KR20190005516A

KR20190005516A - Image Patch Normalizing Method and System

Info

Publication number: KR20190005516A
Application number: KR1020170086282A
Authority: KR
Inventors: 김진형; 임성민; 조동현; 최원섭; 김민기; 김해동; 심은섭
Original assignee: 한국항공우주연구원
Priority date: 2017-07-07
Filing date: 2017-07-07
Publication date: 2019-01-16
Also published as: KR101958275B1

Abstract

The present invention relates to a method and a system for normalizing an image patch. According to the present invention, the method comprises the steps of: extracting, from an input image, an image patch in which a recognition target is included; normalizing the extracted image patch with a predetermined size based on distance information of the recognition target; and transmitting the normalized image patch to a recognizer to detect the recognition target. According to the present invention, by normalizing the image patch extracted from the input image with the predetermined size based on the distance information of the recognition target, the recognition target can be accurately detected by using a single recognizer, and a detection time of the recognition target can be remarkably reduced.

Description

[0001] Image patch normalization method and system [

본 발명은 영상 패치 정규화 방법 및 시스템에 관한 것으로, 보다 상세하게는 영상 기반 객체 검출을 위해 검출/인식기에 입력되는 영상 패치의 크기를 정규화(Normalization)된 크기로 생성할 수 있는 영상 패치 정규화 방법 및 시스템에 관한 것이다.The present invention relates to an image patch normalization method and system, and more particularly, to an image patch normalization method capable of generating a size of an image patch input to a detection / recognizer for normalization, &Lt; / RTI >

최근 들어, 다양한 산업 분야에서 영상의 사용이 증대됨에 따라 영상에 포함된 정보를 이용하기 위한 시도가 다양하게 이루어지고 있다. 특히, 영상에 포함된 객체는 영상 데이터베이스 인덱싱(indexing), 영상 검색 또는 객체 기반 영상 전송 등과 같이 다양한 응용 분야에서 활용되고 있으므로 영상을 기반으로 한 객체 검출 방법이 요구되고 있다.In recent years, as the use of images has increased in various industrial fields, various attempts have been made to use information contained in images. In particular, since objects included in images are utilized in various applications such as image database indexing, image retrieval, or object-based image transmission, an image-based object detection method is required.

현재 영상 처리 분야에서는 객체 단위의 영상 인식이 불가능하므로 입력 영상을 객체보다 작은 단위의 조각으로 분할한 후, 분할된 조각이 목표하는 객체인지 여부를 판별하는 방식으로 객체를 검출하고 있다.In the current image processing field, it is impossible to recognize the image per object, so the input image is divided into fragments smaller than the object, and then the object is detected by determining whether the fragment is a target object.

이와 같이, 입력 영상을 작은 단위의 조각으로 분할하는 과정에서 분할된 조각의 크기와 형태는 제각각이기 때문에 조각 내 특징 정보(색상, 채도, 휘도 등)를 추출하여 검출/인식기에 입력함으로써 조각 내 특징 정보가 목표하는 객체의 특징과 일치하는지 판단하여 목표하는 객체인지 여부를 판별할 수 있다.Since the size and shape of the divided pieces are different in the process of dividing the input image into the small unit pieces, the feature information (color, saturation, luminance, etc.) in the piece is extracted and inputted to the detection / It is possible to determine whether or not the information is a target object by judging whether the information matches the characteristics of the target object.

최근 딥 러닝(Deep learning)으로 통칭되는 인공신경망 기반 인식/검출 기술이 많이 연구되고 있는데, 인공신경망 기반 인식/검출 기술은 인공신경망을 모델로 하여 영상에 존재하는 객체의 특징을 추출한 후 추출된 특징을 이용하여 목표 객체 여부를 판별할 수 있다.Recently, recognition / detection technology based on artificial neural network, which is known as deep learning, has been studied extensively. Artificial neural network-based recognition / detection technology extracts characteristics of objects existing in an image using artificial neural network as a model, To determine whether the object is a target object.

그리고 인공신경망 기반 인식/검출 기술은 일정한 사이즈의 영상을 검출/인식기에 입력해야 하므로 일정한 크기를 가진 윈도우를 영상 내에서 슬라이딩하면서 검출/인식기에 입력해줄 수 있다.In addition, since artificial neural network based recognition / detection technology requires a certain size of image to be input to a detection / recognition device, a window having a predetermined size can be input to a detection / recognition device while sliding in an image.

하지만, 영상은 원근에 따라 객체의 크기가 변하기 때문에 윈도우의 크기를 여러 번 변환해가며 인식 및 검출을 시도해야 하는 문제점이 있었다.However, since the size of the object varies depending on the perspective of the image, there has been a problem that the size of the window must be converted several times to attempt recognition and detection.

이에 따라, 영상에서의 크기별 검출/인식기를 별도로 만들어야 하는 문제점이 있었다.Accordingly, there has been a problem that a detection / recognition unit for each size in an image has to be separately prepared.

한국공개특허공보 제2010-0034105호(공개일 2010. 04. 01)Korean Patent Publication No. 2010-0034105 (published on Apr. 01, 2010)

따라서 본 발명이 해결하고자 하는 기술적 과제는 입력 영상에서 추출된 영상 패치를 인식 대상의 거리 정보에 기초하여 미리 정해진 크기로 정규화함으로써 단일의 인식기를 사용하여 인식 대상을 정확하게 검출할 수 있으며, 인식 대상의 검출 시간을 획기적으로 감소시킬 수 있는 영상 패치 정규화 방법 및 시스템을 제공하는 것이다.SUMMARY OF THE INVENTION Accordingly, the present invention has been made in view of the above problems, and it is an object of the present invention to provide a method and apparatus for correcting a recognition target by using a single recognizer by normalizing an image patch extracted from an input image to a predetermined size based on distance information of a recognition target, And to provide a video patch normalization method and system capable of dramatically reducing detection time.

또한, 본 발명은 명시적으로 언급된 목적 이외에도, 후술하는 본 발명의 구성으로부터 달성될 수 있는 다른 목적도 포함한다.Further, the present invention includes other objects that can be achieved from the construction of the present invention described later, in addition to the objects explicitly mentioned.

상기한 기술적 과제를 해결하기 위한 본 발명의 실시예에 따른 영상 패치 정규화 방법은 입력 영상으로부터 인식 대상이 포함된 영상 패치를 추출하는 단계, 상기 추출된 영상 패치를 상기 인식 대상의 거리 정보에 기초하여 미리 정해진 크기로 정규화하는 단계, 그리고 상기 정규화된 영상 패치를 상기 인식 대상을 검출하기 위한 인식기에 전송하는 단계를 포함한다.According to an aspect of the present invention, there is provided an image patch normalization method including extracting an image patch including a recognition object from an input image, extracting the extracted image patch based on distance information of the recognition object, Normalizing the normalized image patch to a predetermined size, and transmitting the normalized image patch to a recognizer for detecting the recognition target.

상기 추출된 영상 패치를 상기 인식 대상의 거리 정보에 기초하여 미리 정해진 크기로 정규화하는 단계는 상기 추출된 영상 패치에 대한 대표 거리를 산출하는 단계, 상기 산출된 영상 패치에 대한 대표 거리 및 실제 인식 대상의 사전 검출 정보를 이용하여 실제 인식 대상에 대한 영상 평면으로의 사영의 폭을 계산하는 단계, 목표하는 패치의 크기 및 상기 계산된 실제 인식 대상에 대한 영상 평면으로의 사영의 폭을 이용하여 스케일을 계산하는 단계, 그리고 상기 추출된 영상 패치의 크기를 상기 계산된 스케일에 따라 조정하는 단계를 포함한다.Wherein the step of normalizing the extracted image patch to a predetermined size based on the distance information of the recognition target includes calculating a representative distance to the extracted image patch, calculating a representative distance to the calculated image patch, Calculating a width of a projected image on an image plane of an actual recognition target using the pre-detection information of the actual recognition target, calculating a scale using the size of the target patch and the projection width of the image plane on the calculated actual recognition target And adjusting the size of the extracted image patch according to the calculated scale.

상기 실제 인식 대상에 대한 영상 평면으로의 사영의 폭은 하기의 수식에 의해 계산되고,The width of the projected image on the image plane for the actual object to be recognized is calculated by the following equation,

는 상기 대표 거리,

는 상기 실제 인식 대상의 사전 검출 정보로서, 실제 인식 대상의 폭이고,

는 카메라의 초점 거리일 수 있다.

The representative distance,

Is the actual detection target pre-detection information, the actual detection target width,

May be the focal length of the camera.

상기 스케일은 하기의 수식에 의해 계산되고,The scale is calculated by the following equation,

는 상기 목표하는 패치의 크기로서, 목표하는 패치의 폭,

는 상기 실제 인식 대상에 대한 영상 평면으로의 사영의 폭일 수 있다.

Is the size of the target patch, the width of the target patch,

May be the width of the projection on the image plane for the actual recognition target.

상기 추출된 영상 패치를 상기 인식 대상의 거리 정보에 기초하여 미리 정해진 크기로 정규화하는 단계는 상기 영상 패치의 중심점을 상기 목표하는 패치의 중심으로 이동시키고, 상기 목표하는 패치의 크기를 벗어나는 영역은 제거하고, 상기 목표하는 패치 내에 삽입되지 않는 영역은 영 채우기(zero padding)를 수행하는 단계를 더 포함할 수 있다.Wherein the step of normalizing the extracted image patch to a predetermined size based on the distance information of the recognition target moves a center point of the image patch to the center of the target patch, And performing zero padding on an area not inserted in the target patch.

한편, 본 발명의 실시예에 따른 영상 패치 정규화 시스템은 입력 영상으로부터 인식 대상이 포함된 영상 패치를 추출하는 패치 추출부, 상기 추출된 영상 패치를 상기 인식 대상의 거리 정보에 기초하여 미리 정해진 크기로 정규화하는 정규화부, 그리고 상기 정규화된 영상 패치를 상기 인식 대상을 검출하기 위한 인식기에 전송하는 패치 전송부를 포함한다.Meanwhile, the image patch normalization system according to an embodiment of the present invention includes a patch extraction unit that extracts an image patch including an object to be recognized from an input image, a patch extraction unit that extracts the extracted image patch from a predetermined size And a patch transmission unit for transmitting the normalized image patch to a recognizer for detecting the recognition target.

상기 정규화부는 상기 추출된 영상 패치에 대한 대표 거리를 산출하고, 상기 산출된 영상 패치에 대한 대표 거리 및 실제 인식 대상의 사전 검출 정보를 이용하여 실제 인식 대상에 대한 영상 평면으로의 사영의 폭을 계산하고, 목표하는 패치의 크기 및 상기 계산된 실제 인식 대상에 대한 영상 평면으로의 사영의 폭을 이용하여 스케일을 계산하고, 상기 추출된 영상 패치의 크기를 상기 계산된 스케일에 따라 조정할 수 있다.The normalizing unit calculates the representative distance of the extracted image patch, calculates the width of the projected image on the image plane of the actual recognition target using the representative distance of the calculated image patch and the actually detected object detection information And calculates a scale using the size of the target patch and the width of the projected image on the image plane for the calculated actual recognition target, and adjusts the size of the extracted image patch according to the calculated scale.

는 상기 대표 거리,

는 카메라의 초점 거리일 수 있다.

The representative distance,

May be the focal length of the camera.

는 상기 목표하는 패치의 크기로서, 목표하는 패치의 폭,

Is the size of the target patch, the width of the target patch,

상기 정규화부는 상기 영상 패치의 중심점을 상기 목표하는 패치의 중심으로 이동시키고, 상기 목표하는 패치의 크기를 벗어나는 영역은 제거하고, 상기 목표하는 패치 내에 삽입되지 않는 영역은 영 채우기(zero padding)를 수행하는 단계를 더 포함할 수 있다.Wherein the normalizing unit moves the center point of the image patch to the center of the target patch, removes the area that is out of the target patch size, and performs zero padding on the area not inserted in the target patch The method comprising the steps of:

이와 같이 본 발명의 실시예에 따른 영상 패치 정규화 방법 및 시스템에 따르면, 입력 영상에서 추출된 영상 패치를 인식 대상의 거리 정보에 기초하여 미리 정해진 크기로 정규화함으로써 단일의 인식기를 사용하여 인식 대상을 정확하게 검출할 수 있으며, 인식 대상의 검출 시간을 획기적으로 감소시킬 수 있는 장점이 있다.As described above, according to the image patch normalization method and system according to the embodiment of the present invention, the image patch extracted from the input image is normalized to a predetermined size based on the distance information of the recognition object, Therefore, there is an advantage that the detection time of the recognition target can be drastically reduced.

즉 입력 영상 내 인식 대상을 검출하기 위해 크기별 인식기를 별도로 만들 필요 없이 단일의 인식기를 사용하여 인식 대상을 검출할 수 있는 장점이 있다.That is, there is an advantage that a recognition target can be detected by using a single recognizer without separately creating a size recognizer in order to detect the recognition target in the input image.

그리고 입력 영상 내 윈도우 슬라이딩을 단 한 번만 수행하면 되므로 인식 대상을 검출하기 위하여 상당량을 차지하는 탐색 시간을 극적으로 줄일 수 있으며, 인식 대상의 검출 정확도를 높일 수 있는 장점이 있다.Since window sliding in the input image needs to be performed only once, the search time occupying a considerable amount can be dramatically reduced in order to detect the recognition object, and the detection accuracy of the recognition object can be enhanced.

한편, 본 발명의 효과는 상술된 것에 국한되지 않고 후술하는 본 발명의 구성으로부터 도출될 수 있는 다른 효과도 본 발명의 효과에 포함된다.On the other hand, the effects of the present invention are not limited to those described above, and other effects that can be derived from the constitution of the present invention described below are also included in the effects of the present invention.

도 1은 본 발명의 일 실시예에 따른 영상 패치 정규화 시스템의 구성도이다.
도 2는 본 발명의 일 실시예에 따른 영상 패치 정규화 방법을 설명하기 위한 도면이다.
도 3은 본 발명의 일 실시예에 따른 정규화된 영상 패치의 예를 보여주는 도면이다.
도 4는 본 발명의 일 실시예에 따른 영상 패치 정규화 과정을 보여주는 동작 흐름도이다.FIG. 1 is a configuration diagram of an image patch normalization system according to an embodiment of the present invention.
FIG. 2 is a view for explaining an image patch normalization method according to an embodiment of the present invention.
3 is a diagram illustrating an example of a normalized image patch according to an embodiment of the present invention.
4 is a flowchart illustrating an image patch normalization process according to an exemplary embodiment of the present invention.

그러면 첨부한 도면을 참고로 하여 본 발명의 실시예에 대하여 본 발명이 속하는 기술 분야에서 통상의 지식을 가진 자가 용이하게 실시할 수 있도록 상세히 설명한다.Hereinafter, embodiments of the present invention will be described in detail with reference to the accompanying drawings so that those skilled in the art can easily carry out the present invention.

도 1은 본 발명의 일 실시예에 따른 영상 패치 정규화 시스템의 구성도를 나타낸다.FIG. 1 is a block diagram of an image patch normalization system according to an embodiment of the present invention.

도 1에 도시한 바와 같이, 영상 패치 정규화 시스템(1)은 패치 추출부(100), 정규화부(200) 및 패치 전송부(300)를 포함하여 구성된다.As shown in FIG. 1, the image patch normalization system 1 includes a patch extraction unit 100, a normalization unit 200, and a patch transmission unit 300.

영상 패치 정규화 시스템(1)은 딥 러닝(Deep Learning)으로 통칭되는 인공신경망 기반 인식/검출 시스템 등에서 구현될 수 있는데, 딥 러닝은 인공신경망을 이용하여 데이터를 군집화하거나 분류하는데 사용하는 기술로서, 수많은 데이터 속에서 패턴을 발견하여 인간이 사물을 구분하듯 컴퓨터가 데이터를 나누는 것을 말한다. The image patch normalization system 1 can be implemented in an artificial neural network-based recognition / detection system commonly referred to as a deep learning. Deep learning is a technology used for clustering or classifying data using an artificial neural network. A pattern is found in the data, and the computer divides the data as if the human identifies the object.

딥 러닝 기술 중 CNN(Convolutional Neural Network) 기술은 인간의 시각을 처리하는 시신경 구조를 모사하여 만든 인공신경망 모델 중에서 가장 대표적인 기술로서, 합성곱 신경망이라고도 한다. 이미지를 이해하고 이로부터 고수준의 추상화된 정보를 추출하거나 새로운 질감을 가진 그림을 드리는 등과 같은 다양한 영상 처리 또는 컴퓨터 비전 분야 등에서 많이 연구되고 있으며, 비디오 영상 중에서 수천 가지의 서로 다른 물체를 정확하게 인식하고 이를 기반으로 새로운 물체를 인식하는 학습을 추가할 수도 있다.CNN (Convolutional Neural Network) technology is one of the artificial neural network models that simulate the optic nerve structure that processes human vision. It has been studied in various image processing or computer vision fields such as understanding images and extracting high-level abstracted information from them or giving pictures with new texture. In the field of computer vision, thousands of different objects are accurately recognized Based learning can be added to recognize new objects.

패치 추출부(100)는 입력 영상으로부터 인식 대상이 포함된 영상 패치를 추출할 수 있다. 입력 영상은 인식 대상이 포함된 영상으로서, 카메라 또는 영상 저장 장치 등과 같은 영상 획득 장치로부터 획득될 수 있다.The patch extraction unit 100 can extract an image patch including an object of recognition from an input image. The input image can be obtained from an image acquisition device such as a camera or an image storage device as an image including an object to be recognized.

패치 추출부(100)는 영상 분할 기법을 사용하여 입력 영상에서 인식 대상을 분리함으로써 영상 패치를 추출할 수 있다. The patch extracting unit 100 can extract an image patch by separating a recognition target from an input image using an image segmentation technique.

또는 패치 추출부(100)는 영상 처리 과정을 통하여 입력 영상에 포함된 인식 대상의 특징점들을 추출하고, 추출된 각각 특징점들이 포함된 영역을 구분하여 영상 패치를 추출할 수 있다. 즉 패치 추출부(100)는 인식 대상의 특징점을 중심으로 입력 영상을 잘라내어 영상 패치를 추출할 수 있다.Alternatively, the patch extraction unit 100 may extract the feature points of the recognition object included in the input image through the image processing process, and extract the image patches by dividing the region including the extracted feature points. That is, the patch extraction unit 100 can extract the image patch by cutting out the input image around the minutiae of the recognition target.

이외에 패치 추출부(100)는 다양한 방식을 사용하여 영상 패치를 추출할 수 있음은 물론이다.It is needless to say that the patch extraction unit 100 can extract image patches using various methods.

정규화부(200)는 추출된 영상 패치를 인식 대상의 거리 정보에 기초하여 미리 정해진 크기로 정규화할 수 있다.The normalization unit 200 can normalize the extracted image patch to a predetermined size based on the distance information of the recognition object.

보다 자세하게는, 정규화부(200)는 해당 영상 패치에 대한 대표 거리, 실제 인식 대상의 사전 검출 정보 및 카메라 내부 파라미터(카메라의 초점 거리)를 이용하여 영상 패치의 크기를 정규화된 크기로 생성할 수 있다.More specifically, the normalization unit 200 can generate a normalized size of the image patch using the representative distance of the image patch, the pre-detection information of the actual recognition target, and the camera internal parameter (camera focal length) have.

도 2는 본 발명의 일 실시예에 따른 영상 패치 정규화 방법을 설명하기 위한 도면으로서, 도 2에 도시한 바와 같이, 정규화부(200)는 추출된 영상 패치(IP)에 대한 대표 거리(

)를 산출할 수 있다. 여기서, 대표 거리(

)는 입력 영상을 촬영한 카메라로부터 영상 패치(IP)의 중심점까지의 거리 값, 영상 패치 전역 거리 값의 평균 값 또는 중간 값 등으로 이루어질 수 있다.FIG. 2 is a diagram for explaining an image patch normalization method according to an embodiment of the present invention. As shown in FIG. 2, the normalization unit 200 normalizes the representative distance

) Can be calculated. Here, the representative distance (

) May be a distance value from the camera that captured the input image to the center point of the image patch IP, an average value or an intermediate value of the image patch global distance value, and the like.

그리고 정규화부(200)는 산출된 영상 패치(IP)에 대한 대표 거리(

) 및 실제 인식 대상(RA)의 사전 검출 정보를 이용하여 실제 인식 대상에 대한 영상 평면으로의 사영의 폭(

)을 계산할 수 있다. 여기서 실제 인식 대상(RA)의 사전 검출 정보는 실제 인식 대상(RA)의 종횡비, 폭 또는 크기 등으로 이루어질 수 있다.Then, the normalization unit 200 calculates a representative distance (for example,

) And the actual detection target (RA), the width of the projected image to the image plane (

) Can be calculated. Here, the pre-detection information of the actual recognition target RA may be made of the aspect ratio, the width, or the size of the actual recognition target RA.

보다 자세하게는, 정규화부(200)는 산출된 영상 패치(IP)에 대한 대표 거리(

) 및 실제 인식 대상(RA)의 사전 검출 정보를 이용하여 해당 위치에 실제 인식 대상(RA)이 있을 경우 영상 평면으로의 사영의 폭(

)을 하기의 [수식 1]을 이용하여 계산할 수 있다.More specifically, the normalization unit 200 calculates a representative distance ("

) And the actual detection object (RA) at the corresponding position using the detection information of the actual detection object (RA), the width of the projection in the image plane

) Can be calculated using the following equation (1).

[수식 1][Equation 1]

는 영상 패치에 대한 대표 거리,

는 실제 인식 대상의 사전 검출 정보로서, 실제 인식 대상의 폭이고,

는 카메라의 초점 거리일 수 있다.

The representative distance of the image patch,

Is the actual detection subject pre-detection information, and is the actual detection subject width,

May be the focal length of the camera.

그런 다음, 정규화부(200)는 목표하는 패치(PP)의 크기 및 계산된 실제 인식 대상(RA)에 대한 영상 평면으로의 사영의 폭(

)을 이용하여 스케일(S)을 계산할 수 있다.Then, the normalization unit 200 calculates the size of the target patch PP and the width of the projected image on the image plane (for the calculated actual recognition target RA)

) Can be used to calculate the scale (S).

보다 자세하게는, 정규화부(200)는 인식기(50)에 입력되는 영상 패치의 폭과 대표 거리만큼 떨어져 있는 실제 인식 대상(RA)에 대한 영상 평면에서의 사영의 폭(

)의 비율인 스케일을 아래 [수식 2]를 사용하여 계산할 수 있다.More specifically, the normalization unit 200 calculates the width of the projected image in the image plane with respect to the actual recognition target RA that is distant from the width of the image patch input to the recognizer 50 by a representative distance

) Can be calculated using the following equation (2).

[수식 2] [Equation 2]

는 목표하는 패치의 크기로서, 목표하는 패치의 폭,

는 실제 인식 대상에 대한 영상 평면으로의 사영의 폭일 수 있다.

Is the size of the target patch, the width of the target patch,

May be the width of the projection onto the image plane for the actual object to be recognized.

정규화부(200)는 추출된 영상 패치의 크기를 계산된 스케일에 따라 조정할 수 있는데, 영상 패치의 크기를 스케일의 배율로 확대, 축소 및 보간을 수행할 수 있다. 즉 정규화부(200)는 추출된 영상 패치(IP)의 크기가 목표하는 패치(PP)의 크기가 되도록 스케일의 배율로 확대, 축소 및 보간을 수행할 수 있다.The normalization unit 200 can adjust the size of the extracted image patch according to the calculated scale, and can enlarge, reduce, and interpolate the size of the image patch at a scale of the scale. That is, the normalization unit 200 can perform enlargement, reduction, and interpolation at the scale of the scale so that the size of the extracted image patch IP becomes the size of the target patch PP.

그리고 정규화부(200)는 인식 대상이 아닌 패치가 영상 패치로서 들어올 수 있기 때문에 목표하는 패치에 알맞은 크기로 정확한 변환이 이루어지지 않을 수도 있다. 따라서 정규화부(200)는 확대, 축소 및 보간된 영상 패치의 중심점을 목표하는 패치의 중심으로 병진 이동시켜준 후, 목표하는 패치의 크기를 벗어나는 부분은 제거하고, 목표하는 패치 내에 삽입되지 않는 부분은 영 채우기(zero padding)를 수행할 수 있다.Since the non-recognition target patches may come in as an image patch, the normalization unit 200 may not correctly convert the size of the target patch to a proper size. Therefore, the normalization unit 200 translates the center point of the enlarged, reduced, and interpolated image patches to the center of the target patch, then removes the portion that deviates from the target patch size, Can perform zero padding.

패치 전송부(300)는 정규화된 영상 패치를 인식 대상을 검출하기 위한 인식기(50)에 전송할 수 있다.The patch transmitting unit 300 may transmit the normalized image patch to the recognizer 50 for detecting the recognition target.

도 3은 본 발명의 일 실시예에 따른 정규화된 복수의 영상 패치의 예를 보여주는 도면으로서, 도 3과 같이, 본 발명의 일 실시예에 의한 영상 패치 정규화 시스템에 따르면 각기 다른 크기로 추출된 영상 패치를 일정한 크기로 정규화함으로써 동일한 크기를 가지는 복수의 영상 패치를 인식기(50)에 입력할 수 있다.FIG. 3 is a diagram illustrating an example of a plurality of normalized image patches according to an exemplary embodiment of the present invention. As shown in FIG. 3, according to an image patch normalization system according to an exemplary embodiment of the present invention, A plurality of image patches having the same size can be input to the recognizer 50 by normalizing the patches to a predetermined size.

그러면, 인식기(50)는 예컨대, 주파수 영역에서 영상 패치 내부의 에지(Edge)에 대한 기하학적 특징을 추출하거나 미리 정의된 영상 패치 정보와 입력 벡터 정보 등을 비교하여 인식 대상에 대한 인식 및 검출을 수행할 수 있게 된다. Then, the recognizer 50 extracts a geometric feature for an edge in the image patch in the frequency domain or compares the pre-defined image patch information with input vector information to perform recognition and detection of the recognition target .

이와 같이, 본 발명의 일 실시예에 따른 영상 패치의 정규화 시스템에 따르면, 크게 두 가지의 응용을 기대할 수 있다. 첫째는 인공신경망 기반 인식기(50)를 학습시킬 때, 입력되는 영상/깊이 지도 패치의 크기를 자동으로 정규화할 수 있다. 그리고 입력 영상에서 검출/인식을 위한 영상 패치를 생성할 경우에도 해당 영상 패치의 거리 정보를 참조하여 가변적인 크기의 윈도우를 적용하여 인식기(50)에 적용할 수 있다.As described above, according to the image patch normalization system according to an embodiment of the present invention, two applications can be largely expected. First, when learning the artificial neural network based recognizer 50, the size of the inputted image / depth map patch can be automatically normalized. Also, when generating an image patch for detection / recognition in the input image, the image can be applied to the recognizer 50 by applying a window of a variable size with reference to the distance information of the image patch.

이하에서는 본 발명의 일 실시예에 따른 영상 패치 정규화 방법에 대하여 설명하도록 한다.Hereinafter, an image patch normalization method according to an embodiment of the present invention will be described.

도 4은 본 발명의 일 실시예에 따른 영상 패치 정규화 과정을 보여주는 동작 흐름도를 나타낸다.4 is a flowchart illustrating an image patch normalization process according to an exemplary embodiment of the present invention.

도 4에 도시한 바와 같이, 입력 영상으로부터 인식 대상이 포함된 영상 패치를 추출할 수 있다(S400). 입력 영상은 인식 대상이 포함된 영상으로서, 카메라 또는 영상 저장 장치 등과 같은 영상 획득 장치로부터 획득될 수 있다.As shown in FIG. 4, an image patch including an object to be recognized can be extracted from the input image (S400). The input image can be obtained from an image acquisition device such as a camera or an image storage device as an image including an object to be recognized.

보다 자세하게는, 영상 분할 기법을 사용하여 입력 영상에서 인식 대상을 분리함으로써 영상 패치를 추출할 수 있다. 또는 영상 처리 과정을 통하여 입력 영상에 포함된 인식 대상의 특징점들을 추출하고, 추출된 각각 특징점들이 포함된 영역을 구분하여 영상 패치를 추출할 수 있다. 즉 인식 대상의 특징점을 중심으로 입력 영상을 잘라내어 영상 패치를 추출할 수 있다.More specifically, an image patch can be extracted by separating a recognition object from an input image by using an image segmentation technique. Alternatively, the feature points of the recognition target included in the input image may be extracted through the image processing process, and the image patches may be extracted by dividing the regions containing the extracted feature points. That is, the input image can be cut around the minutiae of the recognition target to extract the image patch.

이외에 다양한 방식을 사용하여 영상 패치를 추출할 수 있음은 물론이다.It is needless to say that an image patch can be extracted using various methods.

그리고 추출된 영상 패치를 인식 대상의 거리 정보에 기초하여 미리 정해진 크기로 정규화할 수 있다(S410).Then, the extracted image patch can be normalized to a predetermined size based on the distance information of the recognition object (S410).

보다 자세하게는, 해당 영상 패치에 대한 대표 거리, 실제 인식 대상의 사전 검출 정보 및 카메라 내부 파라미터(카메라의 초점 거리)를 이용하여 영상 패치의 크기를 정규화된 크기로 생성할 수 있다.More specifically, the size of the image patch can be generated in a normalized size using the representative distance to the image patch, the pre-detection information of the actual recognition target, and the camera internal parameter (the focal distance of the camera).

도 2에 도시한 바와 같이, 정규화부(200)는 추출된 영상 패치(IP)에 대한 대표 거리(

)를 산출할 수 있다. 여기서, 대표 거리(

)는 입력 영상을 촬영한 카메라로부터 영상 패치(IP)의 중심점까지의 거리 값, 영상 패치 전역 거리 값의 평균 값 또는 중간 값 등으로 이루어질 수 있다.As shown in FIG. 2, the normalization unit 200 calculates a representative distance (for example,

) Can be calculated. Here, the representative distance (

그리고 산출된 영상 패치(IP)에 대한 대표 거리(

)을 계산할 수 있다. 여기서 실제 인식 대상(RA)의 사전 검출 정보는 실제 인식 대상(RA)의 종횡비, 폭 또는 크기 등으로 이루어질 수 있다.The representative distance (IP) of the image patch

보다 자세하게는, 산출된 영상 패치(IP)에 대한 대표 거리(

)을 하기의 [수식 1]을 이용하여 계산할 수 있다.More specifically, the representative distance ("

) Can be calculated using the following equation (1).

[수식 1][Equation 1]

는 영상 패치에 대한 대표 거리,

는 카메라의 초점 거리일 수 있다.

The representative distance of the image patch,

May be the focal length of the camera.

그런 다음, 목표하는 패치(PP)의 크기 및 계산된 실제 인식 대상(RA)에 대한 영상 평면으로의 사영의 폭(

)을 이용하여 스케일(S)을 계산할 수 있다.Then, the size of the target patch PP and the width of the projected image on the image plane for the calculated actual recognition object RA

) Can be used to calculate the scale (S).

보다 자세하게는, 인식기에 입력되는 영상 패치의 폭과 대표 거리만큼 떨어져 있는 실제 인식 대상(RA)에 대한 영상 평면에서의 사영의 폭(

)의 비율인 스케일을 아래 [수식 2]를 사용하여 계산할 수 있다.More specifically, the width of the projected image in the image plane with respect to the actual recognition object (RA), which is distant from the width of the image patch input to the recognizer by the representative distance

) Can be calculated using the following equation (2).

[수식 2] [Equation 2]

는 목표하는 패치의 크기로서, 목표하는 패치의 폭,

Is the size of the target patch, the width of the target patch,

다음으로, 추출된 영상 패치의 크기를 계산된 스케일에 따라 조정할 수 있는데, 영상 패치의 크기를 스케일의 배율로 확대, 축소 및 보간을 수행할 수 있다. 즉 추출된 영상 패치(IP)의 크기가 목표하는 패치(PP)의 크기가 되도록 스케일의 배율로 확대, 축소 및 보간을 수행할 수 있다.Next, the size of the extracted image patch can be adjusted according to the calculated scale. The size of the image patch can be enlarged, reduced, and interpolated at a scale magnification. That is, enlargement, reduction, and interpolation can be performed at a scale of the scale so that the size of the extracted image patch IP becomes the size of the target patch PP.

그리고 인식 대상이 아닌 패치가 영상 패치로서 들어올 수 있기 때문에 목표하는 패치에 알맞은 크기로 정확한 변환이 이루어지지 않을 수도 있다. 따라서 확대, 축소 및 보간된 영상 패치의 중심점을 목표하는 패치의 중심으로 병진 이동시켜준 후, 목표하는 패치의 크기를 벗어나는 부분은 제거하고, 목표하는 패치 내에 삽입되지 않는 부분은 영 채우기(zero padding)를 수행할 수 있다.Also, since non-recognition patches may come in as image patches, accurate conversion may not be made to the appropriate size for the target patch. Therefore, after the center point of the enlarged, reduced and interpolated image patches is translated to the center of the target patch, the portion deviating from the target patch size is removed, and the portion not inserted in the target patch is zero padded ). &Lt; / RTI >

그런 다음, 정규화된 영상 패치를 인식 대상을 검출하기 위한 인식기에 전송할 수 있다(S420). 즉 각기 다른 크기로 추출된 영상 패치를 일정한 크기로 정규화함으로써 동일한 크기를 가지는 복수의 영상 패치를 인식기에 입력할 수 있다.Then, the normalized image patch may be transmitted to the recognizer for detecting the recognition target (S420). That is, a plurality of image patches having the same size can be input to the recognizer by normalizing the image patches extracted at different sizes to a predetermined size.

그러면, 인식기는 예컨대, 주파수 영역에서 영상 패치 내부의 에지(Edge)에 대한 기하학적 특징을 추출하거나 미리 정의된 영상 패치 정보와 입력 벡터 정보 등을 비교하여 인식 대상에 대한 인식 및 검출을 수행할 수 있게 된다. Then, the recognizer can extract the geometric features of the edge in the image patch in the frequency domain or compare the pre-defined image patch information with the input vector information, for example, to recognize and detect the recognition target do.

본 발명의 실시예는 다양한 컴퓨터로 구현되는 동작을 수행하기 위한 프로그램 명령을 포함하는 컴퓨터로 읽을 수 있는 매체를 포함한다. 이 매체는 앞서 설명한 영상 패치 정규화 방법을 실행시키기 위한 프로그램을 기록한다. 이 매체는 프로그램 명령, 데이터 파일, 데이터 구조 등을 단독으로 또는 조합하여 포함할 수 있다. 이러한 매체의 예에는 하드디스크, 플로피디스크 및 자기 테이프와 같은 자기 매체, CD 및 DVD와 같은 광기록 매체, 플롭티컬 디스크(floptical disk)와 자기-광 매체, 롬, 램, 플래시 메모리 등과 같은 프로그램 명령을 저장하고 수행하도록 구성된 하드웨어 장치 등이 있다. 프로그램 명령의 예에는 컴파일러에 의해 만들어지는 것과 같은 기계어 코드뿐만 아니라 인터프리터 등을 사용해서 컴퓨터에 의해서 실행될 수 있는 고급 언어 코드를 포함한다.Embodiments of the present invention include a computer-readable medium having program instructions for performing various computer-implemented operations. This medium records a program for executing the above-described image patch normalization method. The medium may include program instructions, data files, data structures, etc., alone or in combination. Examples of such media include magnetic media such as hard disks, floppy disks and magnetic tape, optical recording media such as CD and DVD, programmed instructions such as floptical disk and magneto-optical media, ROM, RAM, And a hardware device configured to store and execute the program. Examples of program instructions include machine language code such as those produced by a compiler, as well as high-level language code that can be executed by a computer using an interpreter or the like.

이상에서 본 발명의 바람직한 실시예에 대하여 상세하게 설명하였지만 본 발명의 권리범위는 이에 한정되는 것은 아니고 다음의 청구범위에서 정의하고 있는 본 발명의 기본 개념을 이용한 당업자의 여러 변형 및 개량 형태 또한 본 발명의 권리범위에 속하는 것이다.While the present invention has been particularly shown and described with reference to exemplary embodiments thereof, it is to be understood that the invention is not limited to the disclosed exemplary embodiments, Of the right.

1: 영상 패치 정규화 시스템
100: 패치 추출부
200: 정규화부
300: 패치 전송부1: Image patch normalization system
100:
200: normalization unit
300: patch transmission unit

Claims

Extracting an image patch including an object to be recognized from an input image,
Normalizing the extracted image patch to a predetermined size based on the distance information of the recognition object, and
Transmitting the normalized image patch to a recognizer for detecting the recognition target
/ RTI >

The method of claim 1,
Wherein the step of normalizing the extracted image patch to a predetermined size based on the distance information of the recognition object comprises:
Calculating a representative distance of the extracted image patch,
Calculating a projection width of the image plane to the image plane using the representative distance of the image patch and the pre-detection information of the object to be recognized,
Calculating a scale using the size of the target patch and the width of the projected image on the image plane for the calculated actual recognition target, and
And adjusting the size of the extracted image patch according to the calculated scale.

3. The method of claim 2,
Wherein the projection width of the image plane to the actual object to be recognized,
Is calculated by the following equation,

A representative distance for the image patch, Is the actual detection target pre-detection information, the actual detection target width,

Is the focal distance of the camera.

3. The method of claim 2,
Wherein the scale comprises:
Is calculated by the following equation,

Is the size of the target patch, the width of the target patch,

Is a projection width of the image plane to the actual recognition target.

3. The method of claim 2,
Wherein the step of normalizing the extracted image patch to a predetermined size based on the distance information of the recognition object comprises:
Moving the center point of the image patch to the center of the target patch,
Removing an area outside the size of the target patch and performing zero padding on the area not inserted in the target patch.

A patch extracting unit for extracting an image patch including a recognition object from an input image,
A normalization unit for normalizing the extracted image patch to a predetermined size based on the distance information of the recognition object,
A patch transmission unit for transmitting the normalized image patch to a recognizer for detecting the recognition object,
The image patch normalization system comprising:

The method of claim 6,
The normalization unit may include:
A representative distance for the extracted image patch is calculated,
Calculating a projection width of the image plane to the image plane using the representative distance of the image patch and the pre-detection information of the object to be actually recognized,
The scale is calculated using the size of the target patch and the width of the projected image on the image plane for the calculated actual recognition target,
And adjusting the size of the extracted image patch according to the calculated scale.

8. The method of claim 7,
Wherein the projection width of the image plane to the actual object to be recognized,
Is calculated by the following equation,

A representative distance for the image patch,

Is the focal distance of the camera.

8. The method of claim 7,
Wherein the scale comprises:
Is calculated by the following equation,

Is the size of the target patch, the width of the target patch,

Is a projection width of the image plane to the actual recognition target.

8. The method of claim 7,
The normalization unit may include:
Moving the center point of the image patch to the center of the target patch,
Removing an area out of the size of the target patch and performing zero padding in an area not inserted in the target patch.