KR101827538B1

KR101827538B1 - Device and method for gender reconition

Info

Publication number: KR101827538B1
Application number: KR1020160181518A
Authority: KR
Inventors: 박강령; 뉴엔치엔잣; 홍형길
Original assignee: 동국대학교 산학협력단
Priority date: 2016-12-28
Filing date: 2016-12-28
Publication date: 2018-02-09

Abstract

The present invention relates to a gender recognizing technology and, more specifically, to a device and a method for recognizing a gender by using a convolutional neural network (CNN). According to an embodiment of the present invention, a gender can be accurately recognized by using a model which learns genders of various people based on deep learning.

Description

[0001] DEVICE AND METHOD FOR GENDER RECONITION [0002]

본 발명은 성별 인식 기술에 관한 것으로서, 구체적으로는 회선 신경망 기술(CNN; Convolutional Neural Network)을 이용한 딥 러닝 기반의 성별 인식 장치 및 방법에 관한 것이다.The present invention relates to gender recognition technology, and more particularly, to a device and method for gender recognition based on deep learning using a CNN (Convolutional Neural Network).

기존의 성별 인식 시스템은 근거리 환경에서 주로 사용한다. 또한, 몸과 옷에 대한 정보, 액세서리, 헤어스타일 등을 성별에 인식한다. 하지만, 근거리 환경의 제약으로 인하여 실내 환경 이외에 실외 환경에서 성별 인식을 할 때는 많은 문제가 발생한다.The existing gender recognition system is mainly used in the near environment. In addition, information on body and clothes, accessories, hairstyle, etc. are recognized by gender. However, due to the limitation of the near - field environment, there are many problems in gender recognition in the outdoor environment other than the indoor environment.

본 발명의 배경기술은 대한민국 공개특허공보 제10-2016-0122452호(2016.10.24)에 개시되어 있다.The background art of the present invention is disclosed in Korean Patent Laid-Open Publication No. 10-2016-0122452 (October 26, 2014).

본 발명은 실내 환경뿐만 아니라 실외 환경에서도 정확도가 향상된 성별 인식을 수행하는 딥 러닝 기반의 성별 인식 장치 및 방법을 제공하기 위한 것이다.The present invention is intended to provide a deep learning-based gender recognition apparatus and method for performing gender recognition with improved accuracy not only in an indoor environment but also in an outdoor environment.

본 발명은 근거리뿐만 아니라 원거리에서도 대상체인 사람의 몸 영역 정보를 이용한 학습을 통하여 신뢰성이 높은 성별 인식을 수행하는 딥 러닝 기반의 성별 인식 장치 및 방법을 제공하기 위한 것이다.The present invention is intended to provide a deep learning-based gender recognition apparatus and method that performs highly reliable gender recognition through learning using body region information of a person who is a target, not only in a near place but also in a remote place.

본 발명의 일 측면에 따르면, 성별 인식 장치가 제공된다.According to an aspect of the present invention, a gender recognition apparatus is provided.

본 발명의 일 실시 예에 따른 성별 인식 장치는 입력된 영상에서 대상체인 사람을 검출하고, 사람의 몸 영역 이미지를 추출하여 몸 영역 정보를 생성하는 몸 영역 정보 생성부, 생성한 몸 영역 정보를 딥 러닝 기반의 회선 신경망(Convolutional Neural Network; CNN) 모델을 이용하여 이미지 특징 정보를 추출하는 이미지 특징 추출부 및 추출된 이미지 특징을 이용하여 사람의 성별을 인식하는 성별 인식부를 포함할 수 있다.The gender recognition apparatus according to an embodiment of the present invention includes a body region information generation unit that detects a person who is a target in an input image and extracts an image of a body region of a person to generate body region information, An image feature extraction unit for extracting image feature information using a running-based Convolutional Neural Network (CNN) model, and a gender recognition unit for recognizing a sex of a person using the extracted image feature.

본 발명의 다른 일 측면에 따르면, 성별 인식 방법 및 이를 실행하는 컴퓨터 프로그램이 제공된다.According to another aspect of the present invention, a gender recognition method and a computer program for executing the method are provided.

본 발명의 일 실시 예에 따른 성별 인식 방법 및 이를 실행하는 컴퓨터 프로그램은 입력된 가시광선 영상으로부터 사람을 검출하고, 검출된 사람의 가시광선 몸 영역 이미지를 생성하는 단계, 입력된 열화상 영상으로부터 사람을 검출하고, 검출된 사람의 열화상 몸 영역 이미지를 생성하는 단계, 가시광선 몸 영역 이미지에 딥 러닝 기반의 회선 신경망 모델을 적용하여 가시광선 이미지 특징을 추출하는 단계, 열화상 몸 영역 이미지에 딥 러닝 기반의 회선 신경망 모델을 적용하여 열화상 이미지 특징을 추출하는 단계 및 추출한 가시광선 이미지 특징 및 열화상 이미지 특징을 조합하고 조합된 이미지 특징을 이용하여 검출된 사람의 성별을 인식하는 단계를 포함할 수 있다.A gender recognition method and a computer program for executing the same according to an embodiment of the present invention include a step of detecting a person from an inputted visible ray image and generating an image of a visible ray body region of the detected person, Applying a deep-run-based neural network model to the visible light body region image to extract a visible light image feature, extracting a visible light body region image, Applying a run-based circuit neural network model to extract thermal image features and combining the extracted visible light image features and thermal image features and using the combined image features to recognize the sex of the detected person .

본 발명의 실시 예에 따르면, 딥러닝(Deep Learning) 기반으로 다양한 사람의 몸 영역 정보를 이용하여 성별 구분에 대해 학습한 회선 신경망 모델을 이용하여 촬영된 사람의 성별을 정확하게 인식할 수 있다.According to the embodiment of the present invention, the gender of the photographed person can be accurately recognized by using the line neural network model that learns gender classification using various body region information based on Deep Learning.

또한, 본 발명의 실시 예에 따르면, 실내 환경뿐만 아니라 실외 환경의 근거리뿐만 아니라 원거리에서도 대상체인 사람의 몸 영역 정보를 이용하여 촬영된 사람에 대해 신뢰성이 높은 성별 인식을 수행할 수 있다. In addition, according to the embodiment of the present invention, it is possible to perform highly reliable gender recognition on a person who is photographed using body region information of a person who is a target, not only in an indoor environment but also in a remote area as well as in an outdoor environment.

도 1은 본 발명의 일 실시 예에 따른 성별 인식 시스템을 설명하기 위한 도면.
도 2 내지 6은 본 발명의 일 실시 예에 따른 성별 인식 장치를 설명하기 위한 도면들.
도 7 내지 도 9는 본 발명의 일 실시 예에 따른 성별 인식 방법을 설명하기 위한 도면들.
도 10 내지 도 14는 본 발명의 일 실시 예에 따른 회선 신경망 기반의 학습을 통한 성별 인식 방법의 성별 분류 결과를 나타낸 도면들.1 is a view for explaining a gender recognition system according to an embodiment of the present invention;
2 to 6 are views for explaining a gender recognition apparatus according to an embodiment of the present invention.
7 to 9 are views for explaining a gender recognition method according to an embodiment of the present invention.
10 to 14 are graphs showing gender classification results of a gender recognition method based on a line neural network based learning according to an embodiment of the present invention.

본 발명은 다양한 변경을 가할 수 있고 여러 가지 실시 예를 가질 수 있는 바, 특정 실시 예들을 도면에 예시하고 이를 상세한 설명을 통해 상세히 설명하고자 한다. 그러나, 이는 본 발명을 특정한 실시 형태에 대해 한정하려는 것이 아니며, 본 발명의 사상 및 기술 범위에 포함되는 모든 변경, 균등물 내지 대체물을 포함하는 것으로 이해되어야 한다. 본 발명을 설명함에 있어서, 관련된 공지 기술에 대한 구체적인 설명이 본 발명의 요지를 불필요하게 흐릴 수 있다고 판단되는 경우 그 상세한 설명을 생략한다. 또한, 본 명세서 및 청구항에서 사용되는 단수 표현은, 달리 언급하지 않는 한 일반적으로 "하나 이상"을 의미하는 것으로 해석되어야 한다.While the present invention has been described in connection with certain exemplary embodiments, it is to be understood that the invention is not limited to the disclosed embodiments, but, on the contrary, is intended to cover various modifications and similarities. It should be understood, however, that the invention is not intended to be limited to the particular embodiments, but includes all modifications, equivalents, and alternatives falling within the spirit and scope of the invention. DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS Hereinafter, the present invention will be described in detail with reference to the accompanying drawings. In addition, the singular phrases used in the present specification and claims should be interpreted generally to mean "one or more " unless otherwise stated.

이하, 본 발명의 바람직한 실시 예를 첨부도면을 참조하여 상세히 설명하기로 하며, 첨부 도면을 참조하여 설명함에 있어, 동일하거나 대응하는 구성 요소는 동일한 도면번호를 부여하고 이에 대한 중복되는 설명은 생략하기로 한다.DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS Hereinafter, preferred embodiments of the present invention will be described in detail with reference to the accompanying drawings, wherein like reference numerals refer to like or corresponding components throughout. .

도 1은 본 발명의 일 실시 예에 따른 성별 인식 시스템을 설명하기 위한 도면이다.1 is a view for explaining a gender recognition system according to an embodiment of the present invention.

도 1을 참조하면, 성별 인식 시스템은 영상 촬영 장치(10) 및 성별 인식 장치(20)를 포함한다. Referring to FIG. 1, the gender recognition system includes a photographing device 10 and a gender recognizing device 20. As shown in FIG.

영상 촬영 장치(10)는 대상체인 사람을 촬영하여 영상 정보를 입력한다. 영상 촬영 장치(10)는 예를 들면, 실내 환경 또는 실외 환경에 위치한 CCTV(Closed-Circuit Television　)일 수 있으나, 이에 한정되지 않으며, 영상을 촬영하는 장치를 모두 포함한다. 영상 촬영 장치는 가시광선 카메라 및 열화상 카메라 중 적어도 하나를 포함할 수 있다. 여기서, 열화상 카메라는 적외선을 이용하여 열을 감지하여 영상화하는 카메라일 수 있다. 즉, 영상 촬영 장치(10)는 가시광선 영상 정보 및 열화상 영상 정보 중 적어도 하나를 입력할 수 있다. The image photographing apparatus 10 photographs a person who is a target and inputs image information. The image capturing apparatus 10 may be, for example, a closed-circuit television (CCTV) located in an indoor environment or an outdoor environment, but is not limited thereto. The imaging device may include at least one of a visible light camera and a thermal imaging camera. Here, the thermal imaging camera may be a camera that senses heat and imaged using infrared rays. That is, the image capturing apparatus 10 can input at least one of visible light image information and thermal image information.

성별 인식 장치(20)는 입력된 영상에서 대상체인 사람의 성별을 인식한다. 성별 인식 장치(20)는 입력된 영상에서 대상체인 사람을 검출하고, 사람의 몸 영역 이미지 추출하여 몸 영역 정보를 생성한다. 성별 인식 장치(20)는 생성한 몸 영역 정보를 딥 러닝 기반의 회선 신경망 모델을 이용하여 이미지 특징 정보를 추출한다. 여기서, 성별 인식 장치(20)는 딥 러닝을 위하여 사람 영상을 포함한 데이터베이스를 이용한 학습 및 검증을 반복하여 수행한다. 성별 인식 장치(20)는 추출한 이미지 특징 정보를 이용하여 성별을 분류하는 성별 인식을 수행한다. 성별 인식 장치(20)는 이하 도 2 내지 도 6에서 더욱 상세히 설명하기로 한다.The gender recognition apparatus 20 recognizes the gender of the person who is the target in the input image. The gender recognition apparatus 20 detects a person who is a target in the input image, and extracts a body region image of a person to generate body region information. The gender recognition apparatus 20 extracts the image feature information using the deep-learning-based circuit neural network model with the generated body region information. Here, the gender recognition apparatus 20 repeatedly performs learning and verification using a database including a human image for deep learning. The gender recognition apparatus 20 performs gender recognition for classifying gender using the extracted image feature information. The gender recognition device 20 will be described in more detail below with reference to FIGS. 2 to 6. FIG.

도 2 내지 도 6은 본 발명의 일 실시 예에 따른 성별 인식 장치를 설명하기 위한 도면들이다. FIGS. 2 to 6 are views for explaining a gender recognition apparatus according to an embodiment of the present invention.

도 2를 참조하면, 성별 인식 장치(20)는 몸 영역 정보 생성부(100), 이미지 특징 추출부(200), 성별 인식부(300) 및 성별 인식 학습부(400)를 포함한다.Referring to FIG. 2, the gender recognizing apparatus 20 includes a body region information generating unit 100, an image feature extracting unit 200, a gender recognizing unit 300, and a gender aware learning unit 400.

몸 영역 정보 생성부(100)는 입력된 영상에서 대상체인 사람을 검출하고, 사람의 몸 영역 이미지를 추출하여 몸 영역 정보를 생성한다.The body region information generation unit 100 detects a person who is a target in the input image, and extracts a body region image of a person to generate body region information.

도 3을 참조하면, 몸 영역 정보 생성부(100)는 대상체 검출부(120) 및 몸 영역 이미지 생성부(140)를 포함한다.Referring to FIG. 3, the body region information generation unit 100 includes a body detection unit 120 and a body region image generation unit 140.

대상체 검출부(120)는 입력된 영상 정보에서 사람(대상체)을 검출한다. 대상체 검출부(120)는 공지의 대상체 감지 알고리즘을 이용하여 사람을 검출할 수 있다. 대상체 검출부(120)는 입력되는 영상으로 가시광선 영상 정보 및 열화상 영상 정보를 동시에 이용하는 경우에는 주변 환경 및 날씨의 영향을 덜 받고 정확도 높게 대상체인 사람을 검출할 수 있다.The object detecting unit 120 detects a person (object) from the input image information. The object detection unit 120 can detect a person using a known object detection algorithm. When the visible light ray image information and the thermal image information are simultaneously used as the input image, the object detecting unit 120 can detect the target chain person with less influence from the surrounding environment and the weather and with high accuracy.

몸 영역 이미지 생성부(140)는 검출된 사람 이미지에서 몸 영역 이미지를 생성한다. 몸 영역 이미지 생성부(140)는 가시광선 몸 영역 이미지 및 열화상 몸 영역 이미지 중 적어도 하나를 생성할 수 있다. 몸 영역 이미지 생성부(140)는 크기 정규화 과정(size normalization step)을 통하여 몸 영역 이미지를 미리 설정된 크기 예를 들면, 183 × 119 픽셀로 생성할 수 있다. 여기서, 크기 정규화 과정은 검출되는 사람에 대한 이미지가 근거리인 경우와 원거리인 경우의 차이를 보상하기 위한 것이다. 또한, 몸 영역 이미지 생성부(140)는 제로 중심 방법(zero-center method)을 수행하여 몸 영역 이미지의 조명(illumination)을 정규화(normalization) 할 수 있다. The body region image generating unit 140 generates a body region image from the detected human image. Body region image generating section 140 may generate at least one of a visible light body region image and a thermal image body region image. The body region image generating unit 140 may generate a body region image with a predetermined size, for example, 183 × 119 pixels through a size normalization step. Here, the size normalization process is to compensate for the difference between the case where the image of the person to be detected is near and the case where the image is far away. In addition, the body-area image generator 140 may normalize the illumination of the body-area image by performing a zero-center method.

이미지 특징 추출부(200)는 생성한 몸 영역 정보를 딥 러닝 기반의 회선 신경망(Convolutional Neural Network; CNN) 모델을 이용하여 이미지 특징 정보를 추출한다. 이미지 특징 추출부(200)는 가시광선 몸 영역 이미지 및 열화상 몸 영역 이미지 중 적어도 하나의 이미지를 입력하여 성별 인식을 위한 이미지 특징 정보를 추출할 수 있다. 이미지 특징 추출부(200)는 오버 피팅 문제(over-fitting problem)을 경감하기 위하여 데이터 확장 방법(data augmentation) 및 드롭아웃(dropout) 방법을 적용할 수 있다. 여기서, 데이터 확장 방법은 원본 이미지들과 약간 다른 몇 개의 확장 데이터를 더 추가하는 것으로, 원본 이미지의 좌우상하 측면에서 미리 설정된 픽셀들을 제거하고 이를 이용하여 가상의 확장된 데이터를 생성하여 추가하는 것이다. 드롭아웃 방법은 회선 신경망 모델의 완전 연결 레이어(Fully Connected Layer)에서 이전 레이어와 다음 레이어의 연결을 분리하며, 특정 확률을 가진 숨겨진 뉴런의 출력을 0으로 설정함에 의해 수행될 수 있다. 여기서, 특정 확률은 입력 데이터에 종속적이고 실험을 통하여 결정될 수 있다. The image feature extraction unit 200 extracts image feature information using the deep-learning-based Convolutional Neural Network (CNN) model. The image feature extraction unit 200 may extract image feature information for gender recognition by inputting at least one image of the visible light body region image and the thermal image body region image. The image feature extraction unit 200 may apply a data augmentation method and a dropout method in order to alleviate an over-fitting problem. Here, the data extension method is to add a few different extension data slightly different from the original images. In this case, the preset pixels are removed from the upper and lower sides of the original image, and a virtual extended data is generated and added using the pixels. The dropout method can be performed by separating the connections of the previous layer and the next layer in the fully connected layer of the circuit neural network model and setting the output of hidden neurons with a certain probability to zero. Here, the specific probability is dependent on the input data and can be determined through experimentation.

도 4 및 도 5를 참조하면, 이미지 특징 추출부(200)는 이미지 입력 레이어(220), 특징 추출 레이어(240) 및 분류 레이어(260)를 포함한다.4 and 5, the image feature extraction unit 200 includes an image input layer 220, a feature extraction layer 240, and a classification layer 260.

이미지 입력 레이어(220)는 미리 설정한 스케일의 몸 영역 이미지를 입력한다. 이미지 입력 레이어(220)는 예를 들면, 183 × 119 픽셀의 크기의 가시광선 몸 영역 이미지 또는 동일한 크기의 열화상 몸 영역 이미지를 입력할 수 있다. The image input layer 220 inputs a body region image of a predetermined scale. The image input layer 220 may, for example, input a visible light body region image of a size of 183 x 119 pixels or an image body region image of the same size.

특징 추출 레이어(240)는 입력한 몸 영역 이미지에 회선 신경망 모델을 적용하여 이미지 특징을 추출한다. 특징 추출 레이어(284)는 (1) 5개의 콘볼루션 레이어들을 포함할 수 있고, (2) 각각의 콘볼루션 레이어들은 교정 선형 유닛(Rectified Linear Unit); 이하 'ReLU') 레이어와 함께 하고, (3) 풀링(pooling) 레이어를 포함할 수 있다. The feature extraction layer 240 extracts image features by applying a line neural network model to the input body region image. Feature extraction layer 284 may include (1) five convolution layers, (2) each convolution layer may comprise a rectified linear unit; ('ReLU') layer, and (3) a pooling layer.

여기서, 제1 콘볼루션 레이어는 183 × 119 픽셀의 이미지(가시광선 몸 영역 이미지 또는 열화상 몸 영역 이미지)가 입력되며, 수평 및 수직 방향에서 스트라이드 2×2 픽셀로 크기 11×11 픽셀의 96개의 필터를 이용하여 콘볼루션화될 수 있다(Conv: 96 @ 11 × 11; Stride 2 × 2). 제1 콘볼루션 레이어는 크기 87 × 55 픽셀의 96 특징 맵들을 출력한다. CNN 구조를 이미지 변환에 더욱 강건하게 만들기 위하여 96 특징 맵들을 2 픽셀 스트라이드로 3×3 공간 이웃 픽셀의 최대 값으로 적용하는 제1 맥스 풀링 레이어에 의해 처리된 후 다운샘플링 된다. 결과적으로 제1 콘볼루션 레이어의 출력은 크기 43 × 27 픽셀 크기의 96개의 특징 맵들이다. Here, the first convolution layer is an image of 183 x 119 pixels (visible ray body region image or thermal image of the body region image) input, and in the horizontal and vertical directions, 96 x 11 pixels of size 11 x 11 pixels (Conv: 96 @ 11 x 11; Stride 2 x 2). The first convolution layer outputs 96 feature maps of size 87 x 55 pixels. The CNN structure is processed and downsampled by a first Max-Pulling layer that applies the 96 feature maps to a maximum of 3 x 3 spatial neighboring pixels in a two-pixel stride to make the image transformation more robust. As a result, the output of the first convolution layer is 96 feature maps of size 43 x 27 pixels in size.

제2 콘볼루션 레이어는 제1 콘볼루션 레이어를 미세하게 조정하기 위하여 2 픽셀의 패딩, 스트라이드 1 픽셀로 크기 5×5×96 크기의 128개의 필터를 포함한다(Conv: 128 @ 5 × 5 × 96; Stride: 1 × 1; Padding: 2 × 2). 제2 콘볼루션 레이어는 크기 21 × 13 픽셀 크기의 128개의 특징 맵들을 출력한다. 제1 콘볼루션 레이어 및 제2 콘볼루션 레이어는 예를 들면, 가장자리 부분들(edges) 또는 색깔 부분들(blobs)의 텍스처(texture) 특징 등 낮은 수준의 이미지 특징들을 추출하는 데 이용될 수 있다.The second convolution layer includes 128 filters of size 5x5x96 with 2 pixel padding, stride 1 pixel to fine tune the first convolution layer (Conv: 128 @ 5x5x96 ; Stride: 1x1; Padding: 2x2). The second convolution layer outputs 128 feature maps of size 21 x 13 pixels in size. The first convolution layer and the second convolution layer may be used to extract low-level image features, such as, for example, texture features of edge or color blobs.

제3 콘볼루션 레이어는 1×1 픽셀의 패딩, 스트라이드 1×1 픽셀로 크기 3×3×128 의 256개의 필터를 적용하여 크기 21 × 13 픽셀 크기의 256개의 특징 맵들을 출력한다(Conv: 256 @ 3 × 3 × 128; Stride 1 × 1; Padding: 1 × 1). The third convolution layer outputs 256 feature maps of size 21 × 13 pixels by applying 256 1 × 1 × 1 pixels with padding of 1 × 1 pixels and a size of 3 × 3 × 128 (Conv: 256 @ 3 x 3 x 128; Stride 1 x 1; Padding: 1 x 1).

제4 콘볼루션 레이어는 1×1 픽셀의 패딩, 스트라이드 1×1 픽셀로 크기 3×3×256 크기의 256개의 필터를 적용하여 크기 21 × 13 픽셀의 256개의 특징 맵들을 출력한다(Conv: 256 @ 3 × 3 × 256; Stride 1 × 1; Padding: 1 × 1). The fourth convolution layer outputs 256 feature maps of size 21 × 13 pixels (Conv: 256) by applying 256 pixels of size 3 × 3 × 256 with 1 × 1 pixel padding and stride 1 × 1 pixels @ 3 x 3 x 256; Stride 1 x 1; Padding: 1 x 1).

제5 콘볼루션 레이어는 1×1 픽셀의 패딩, 스트라이드 1×1 픽셀로 크기 3×3×256 크기의 128개의 필터를 적용하여 콘볼루션화하여 크기 10 × 6 픽셀의 128개의 특징 맵들을 출력한다(Conv: 128 @ 3 × 3 × 256; Stride 1 × 1; Padding: 1 × 1). The fifth convolution layer convolutionizes 1 × 1 pixel padding, stride 1 × 1 pixel, 128 filters of size 3 × 3 × 256, and outputs 128 feature maps of size 10 × 6 pixels (Conv: 128 @ 3 x 3 x 256; Stride 1 x 1; Padding: 1 x 1).

이후 특징 추출 레이어(240)는 제1 내지 제5 콘볼루션 레이어들을 이용하여 출력한 크기 10×6 픽셀의 128 특징 맵을 4096, 1024 및 2 뉴런(neurons)을 각각 포함하는 3개의 완전 연결 레이어(Fully Connected Layers)로 처리한다.Then, the feature extraction layer 240 extracts 128 feature maps of size 10 x 6 pixels output using the first through fifth convolution layers to three fully connected layers (1, 2, 3, 4, Fully Connected Layers).

분류 레이어(260)는 마지막 완전 연결 레이어로 단지 2개의 뉴런을 포함하며, 각각의 가시광선 몸 영상 이미지 또는 열화상 몸 영상 이미지를 위한 1024-차원 공간에서 이미지 특징 벡터를 추출할 수 있다. 분류 레이어(260)는 회선 신경망 모델을 적용하여 추출한 특징에 의해 2개의 성별로 분류한다. The classification layer 260 may include only two neurons in the last fully connected layer and may extract image feature vectors in a 1024-dimensional space for each visible light body image or a thermal image of the body image. The classification layer 260 is classified into two genders according to features extracted by applying a line neural network model.

다시 도 2를 참조하면, 성별 인식부(300)는 추출된 이미지 특징을 이용하여 사람의 성별을 인식한다. Referring again to FIG. 2, the gender recognition unit 300 recognizes the sex of a person using the extracted image characteristics.

성별 인식부(300)는 가시광선 몸 영역 이미지 및 열화상 몸 영역 이미지로부터 추출한 이미지 특징을 조합하고 서포트 벡터 머신(Support Vector Machine; SVM)을 이용하여 성별을 인식할 수 있다.The gender recognition unit 300 may combine the image features extracted from the visible light body region image and the thermal image body region image and recognize the gender using a support vector machine (SVM).

도 6을 참조하면, 성별 인식부(300)는 특징 레벨 융합부(320) 및 스코어 레벨 융합부(340)를 포함한다. Referring to FIG. 6, the gender recognition unit 300 includes a feature level fusion unit 320 and a score level fusion unit 340.

특징 레벨 융합부(320)는 주성분 분석(Principal Component Analysis; PCA)에 의해 추출한 가시광선 이미지 특징 및 열화상 이미지 특징 중 최적 특징을 선택하고 서포트 벡터 머신 방법을 이용하여 성별 인식을 수행한다. 여기서, 주성분 분석은 성별을 인식하기 이전에 노이즈를 제거하고 특징 차원을 축소하기 위함이다. 왜냐하면, 추출된 이미지 특징은 대상체의 몸 영역 이미지가 배경, 의류, 장식품, 몸 포즈, 헤어스타일 및 다른 요소의 차이점으로 인하여 매우 넓은 변형을 가질 수 있어 과잉 정보(redundant information)를 포함할 수 있기 때문이다. 서포트 벡터 머신 방법은 학습 데이터를 가지고 분류 파라미터들과 함께 여러 가지 서포트 벡터들, xi를 획득하기 위하여 지도 학습(supervised learning) 방법으로 기계를 학습시킨다. 성별 인식 장치(20)는 서포트 벡터 머신 방법에 의한 검증 이미지의 분류 레벨은 식 (1)에 의해 평가될 수 있다. The feature level fusion unit 320 selects an optimal feature among the visible light image feature and the thermal image feature extracted by the Principal Component Analysis (PCA) and performs the gender recognition using the support vector machine method. Here, the principal component analysis is to remove the noise and reduce the feature dimension before recognizing the sex. This is because the extracted image feature may include redundant information because the body region image of the object may have a very wide variation due to differences in background, clothing, ornament, body pose, hairstyle, and other factors to be. The support vector machine method learns the machine by supervised learning method to obtain various support vectors, xi with classification parameters with learning data. The classification level of the verification image by the support vector machine method of the gender recognition apparatus 20 can be evaluated by equation (1).

(1)

(One)

여기서, yi 는 서포트 벡터 xi에 상응하는 분류 레벨이고, K(×)는 저 차원에서 고차원 특징 공간으로 입력 데이터를 맵핑하기 위하여 이용되는 커널 함수임.Here, yi is a classification level corresponding to the support vector xi, and K (x) is a kernel function used for mapping the input data to the high dimensional feature space at low dimension.

본 발명에서는 식 (2) 의 선형 기반 함수 및 식 (3)의 방사 기반 함수(RBF; Radial Basis Function) 등 2가지의 다른 서포트 벡터 머신 커널들을 이용할 수 있다.In the present invention, two different support vector machine kernels may be used, such as the linear based function of equation (2) and the radial basis function (RBF) of equation (3).

Linear kernel:

(2)Linear kernel:

(2)

RBF kernel:

(3)RBF kernel:

(3)

또한, 본 발명에서는 서포트 벡터 머신 방법 적용을 위하여 오픈 소스 라이브러리인 LibSVM 라이브러리를 이용할 수 있다.Further, in the present invention, the LibSVM library which is an open source library can be used for applying the support vector machine method.

특징 레벨 융합부(320)는 회선 신경망 모델을 이용하여 추출한 가시광선 이미지 특징 및 열화상 이미지 특징을 다음 수식 (4)을 이용하여 연관시킬 수 있다. The feature level fusion unit 320 can associate the visible light image feature and the thermal image feature extracted using the circuit neural network model using the following equation (4).

(4)

여기서, fc 는 융합 특징으로 2가지 타입의 이미지의 이미지 특징을 연관(concatenation)하고, fv 및 ft는 가시광선 이미지 및 열화상 이미지에서 추출한 이미지 특징들임.Where fc concatenates image features of two types of images with fusing characteristics, and fv and ft are image features extracted from visible light and infrared images.

스코어 레벨 융합부(340)는 서포트 벡터 머신 방법을 이용하여 가시광선 이미지 특징 및 열화상 이미지 특징 각각에 대해 성별 인식을 수행한다. 스코어 레벨 융합부(340)는 가시광선 이미지 특징에 주성분 분석을 통하여 가시광선 최적 특징을 선택하고, 열화상 이미지 특징에 주성분 분석을 통하여 열화상 최적 특징을 선택한다. 스코어 레벨 융합부(340)는 가시광선 최적 특징 및 열화상 최적 특징 각각에 대해 서포트 벡터 머신 방법을 적용하여 산출한 가시광선 서포트 벡터 머신 점수(가시광선 이미지 특징에 대한 성별 확률(남성 확률 또는 여성 확률)에 대한 점수(가시광선 이미지 점수)) 및 열화상 서포트 머신 점수(열화상 이미지 특징에 대한 성별 확률에 대한 점수(열화상 이미지 점수))를 산출한다. 스코어 레벨 융합부(340)는 산출한 가시광선 서포트 벡터 머신 점수 및 열화상 서포트 벡터 머신 점수를 조합하여 새로운 특징인 스코어 융합 특징(서포트 벡터 머신 점수 조합)을 생성하고 이를 이용하여 성별 인식을 수행한다. 스코어 레벨 융합부(340)는 스코어 융합 특징 생성을 위하여 서포트 벡터 머신에 기반한 또 하나의 분류 레이어를 이용할 수 있다. The score level fusion unit 340 performs gender recognition for each of the visible light image feature and the thermal image feature using the support vector machine method. The score level fusion unit 340 selects the visible light optimum feature through principal component analysis on the visible light image feature and selects the thermal image optimal feature through principal component analysis on the thermal image feature. The score level fusion unit 340 may be configured to generate a visual ray support vector machine score (a sex probability (a male probability or a female probability) for visible light ray image features calculated by applying a support vector machine method for each of the visible light ray feature ) (Score of visible light image)) and the thermal image support machine score (the score for the sex probability (thermal image score) for the thermal image feature). The score level fusion unit 340 generates a score fusion feature (support vector machine score combination) that is a new feature by combining the calculated visible light support vector machine score and the thermal image support vector machine score, and performs gender recognition using the score fusion feature . The score level fusion unit 340 may use another classification layer based on the support vector machine for generating the score fusion characteristic.

다시 도 2를 참조하면, 성별 인식 학습부(400)는 사람 이미지에 대한 데이터베이스를 이용하여 사람의 성별 인식을 학습하고 검증한다. 여기서, 데이터베이스는 예를 들면, 254명의 남성과 158명의 영성을 포함한 412명의 사람의 이미지를 포함할 수 있다. 각각의 사람은 10장의 가시광선 이미지들과 10장의 열화상 이미지들을 포함할 수 있다. 결과적으로 데이터베이스는 4120장의 가시광선 이미지와 4120의 열화상 이미지를 포함한 8240장의 이미지를 포함할 수 있다. 포함된 모든 이미지는 연구실에서 감시 시스템의 실제 운영을 시뮬레이션 하기 위하여 관찰 장면과 비교하였을 때 대략적으로 6m의 높이에 위치한 듀얼 가시광선 및 열화상 카메라를 이용하여 캡쳐될 수 있다. 데이터베이스는 다양한 구성(texture)과 몸의 포즈를 포함할 수 있다. Referring again to FIG. 2, the gender aware learning unit 400 learns and verifies a person's gender recognition using a database of human images. Here, the database may include, for example, images of 412 persons, including 254 males and 158 spiritualities. Each person may include 10 visible light images and 10 thermal images. As a result, the database may contain 8240 images, including 4120 visible light images and 4120 thermal images. All included images can be captured using dual visible light and thermal cameras located approximately 6 meters high when compared to the observation scene to simulate the actual operation of the surveillance system in the laboratory. A database can contain various configurations of texture and body pose.

성별 인식 학습부(400)는 인식 시스템의 정확도를 측정하기 위하여 5배 교차 유효성 방법을 수행할 수 있다. 성별 인식 학습부(400)는 전체 데이터베이스를 분할하는 방법을 통하여 서브 데이터베이스의 학습 및 검증을 5번 수행할 수 있다. 각각의 분할 방법은 서브 데이터베이스 학습을 위하여 남성 및 여성의 대략 80%의 이미지를 이용할 수 있고, 남은 20%의 이미지는 검증에 할당될 수 있다. 결과적으로 성별 인식 학습부(400)는 5번의 학습 서브 데이터베이스 및 검증 서브 데이터베이스를 이용할 수 있다. 여기서, 각각의 학습 서브 데이터베이스는 204개의 남성 이미지와 127개의 여성 이미지를 포함하고, 각각의 검증 서브 데이터베이스는 50개의 남성 이미지와 31개의 여성 이미지를 포함할 수 있다. The gender awareness learning unit 400 may perform a 5x cross validation method to measure the accuracy of the recognition system. The gender awareness learning unit 400 can perform the learning and verification of the sub database five times by dividing the entire database. Each segmentation method can use approximately 80% of the images of male and female for sub-database learning, and the remaining 20% of images can be assigned to the verification. As a result, the gender aware learning unit 400 can use five learning subdatabases and a verification subdatabase. Here, each learning sub-database includes 204 male images and 127 female images, and each of the verification sub-databases may include 50 male images and 31 female images.

이용된 자체 구성 데이터베이스 Used self-configuration database DatabaseDatabase MalesMales Females Females TotalTotal Number of PersonsNumber of Persons 254254 158158 412 (persons)412 (persons) Number of ImagesNumber of Images 50805080 31603160 8240 (images)8240 (images)

성별 인식 학습부(400)는 앞 서 설명한 오버 피팅 문제를 줄이기 위하여 원본 데이터베이스에서 데이터베이스 크기를 확대하기 위하여 확장 데이터베이스를 생성할 수 있다. 성별 인식 학습부(400)는 남성 이미지 수가 여성 이미지 수보다 많은 경우 남성 이미지 수와 여성 이미지 수를 비슷하게 만들기 위하여 여성 이미지의 확장된 이미지들의 수보다 남성 이미지의 확장된 이미지들의 수를 더 적게 확장할 수 있다. 성별 인식 학습부(400)는 이를 위하여 원본 이미지의 좌우상하 측면으로부터 2 또는 4 픽셀을 제거하여 각 남성 이미지로부터 18 이미지를 더 생성할 수 있고 각 여성 이미지로부터 30 이미지를 더 생성할 수 있다. 성별 인식 학습부(400)는 결과적으로 표 2에 보여진 확장된 데이터베이스를 구성할 수 있다. The gender awareness learning unit 400 may generate an extended database to enlarge the database size in the original database in order to reduce the over-fitting problem described above. The gender awareness learning unit 400 may expand the number of expanded images of the male image to be smaller than the number of expanded images of the female image to make the number of male images and the number of images of the female similar to each other when the number of male images is larger than the number of female images . The gender awareness learning unit 400 may further generate 18 images from each male image by removing 2 or 4 pixels from the left and right and upper and lower sides of the original image to generate 30 images from each female image. The gender awareness learning unit 400 can construct the extended database shown in Table 2 as a result.

확장된 데이터베이스 구성Extended database configuration DatabaseDatabase MalesMales FemalesFemales TotalTotal Augmented DatabaseAugmented Database Learning DatabaseLearning Database Number of PersonsNumber of Persons 204 (persons)204 (persons) 127 (persons)127 (persons) 331 (persons)331 (persons) Number of ImagesNumber of Images 73,440 (204 × 20 × 18 images)73,440 (204 x 20 x 18 images) 76,200 (127 × 20 × 30 images)76,200 (127 x 20 x 30 images) 149,640 images149,640 images Testing DatabaseTesting Database Number of PersonsNumber of Persons 50 (persons)50 (persons) 31 (persons)31 (persons) 81 (persons)81 (persons) Number of ImagesNumber of Images 18,000 (50 × 20 × 18 images)18,000 (50 × 20 × 18 images) 18,600 (31 × 20 × 30 images)18,600 (31 x 20 x 30 images) 36,600 images36,600 images

도 7 내지 도 9는 본 발명의 일 실시 예에 따른 성별 인식 방법을 설명하기 위한 도면들이다.7 to 9 are diagrams for explaining a gender recognition method according to an embodiment of the present invention.

도 7을 참조하면, 본 발명의 일 실시 예에 따른 성별 인식 방법을 설명하면, 단계 S710에서 성별 인식 장치(20)는 입력된 가시광선 영상으로부터 사람을 검출하고, 검출된 사람의 가시광선 몸 영역 이미지를 생성한다.Referring to FIG. 7, a gender recognition method according to an embodiment of the present invention will be described. In step S710, the gender recognition apparatus 20 detects a person from the input visible light ray image, Create an image.

단계 S720에서 성별 인식 장치(20)는 입력된 열화상 영상으로부터 사람을 검출하고, 검출된 사람의 열화상 몸 영역 이미지를 생성한다.In step S720, the gender recognition device 20 detects a person from the input thermal image and generates an image of the detected person's thermal image.

단계 S730에서 성별 인식 장치(20)는 생성된 가시광선 몸 영역 이미지에 딥 러닝 기반의 회선 신경망 모델을 적용하여 가시광선 이미지 특징을 추출한다. In step S730, the gender recognition apparatus 20 applies a deep learning-based circuit neural network model to the generated visible light body region image to extract a visible light image feature.

단계 S740에서 성별 인식 장치(20)는 생성된 열화상 몸 영역 이미지에 딥 러닝 기반의 회선 신경망 모델을 적용하여 열화상 이미지 특징을 추출한다.In step S740, the gender recognition apparatus 20 applies the deep learning-based circuit neural network model to the generated thermal image area image to extract thermal image features.

단계 S750에서 성별 인식 장치(20)는 추출한 가시광선 이미지 특징 및 열화상 이미지 특징을 조합하고 조합된 이미지 특징을 이용하여 검출된 사람의 성별을 인식한다.In step S750, the gender recognition apparatus 20 combines the extracted visible light image characteristic and the thermal image characteristic, and recognizes the sex of the detected person using the combined image characteristic.

단계 S760에서 성별 인식 장치(20)는 인식된 성별 결과를 출력한다.In step S760, the gender recognition device 20 outputs the recognized gender result.

도 8을 참조하면, 본 발명의 일 실시 예에 따른 특징 레벨 융합을 통한 성별 인식 방법을 설명하면, 단계 S810에서 성별 인식 장치(20)는 입력된 가시광선 영상으로부터 사람을 검출하고, 검출된 사람의 가시광선 몸 영역 이미지를 생성한다.Referring to FIG. 8, in step S810, the gender recognition apparatus 20 detects a person from the input visible light ray image, Lt; / RTI > body-area image of the patient.

단계 S820에서 성별 인식 장치(20)는 입력된 열화상 영상으로부터 사람을 검출하고, 검출된 사람의 열화상 몸 영역 이미지를 생성한다.In step S820, the gender recognition device 20 detects a person from the input thermal image and generates an image of the detected person's thermal image.

단계 S830에서 성별 인식 장치(20)는 생성된 가시광선 몸 영역 이미지에 딥 러닝 기반의 회선 신경망 모델을 적용하여 가시광선 이미지 특징을 추출한다. In step S830, the gender recognition apparatus 20 applies a deep learning-based circuit neural network model to the generated visible light body region image to extract a visible light image feature.

단계 S840에서 성별 인식 장치(20)는 생성된 열화상 몸 영역 이미지에 딥 러닝 기반의 회선 신경망 모델을 적용하여 열화상 이미지 특징을 추출한다.In step S840, the gender recognition apparatus 20 applies a deep learning-based circuit neural network model to the generated thermal image area image to extract thermal image features.

단계 S850에서 성별 인식 장치(20)는 주성분 분석(Principal Component Analysis; PCA)에 의해 추출한 가시광선 이미지 특징 및 열화상 이미지 특징 중 최적 특징을 선택하고 서포트 벡터 머신 방법을 이용하여 성별 인식을 수행한다. In step S850, the gender recognition apparatus 20 selects an optimal feature among the visible light image feature and the thermal image feature extracted by the Principal Component Analysis (PCA), and performs gender recognition using the support vector machine method.

단계 S860에서 성별 인식 장치(20)는 인식된 성별 결과를 출력한다.In step S860, the gender recognition device 20 outputs the recognized gender result.

도 9을 참조하면, 본 발명의 일 실시 예에 따른 스코어 레벨 융합을 통한 성별 인식 방법을 설명하면, 단계 S910에서 성별 인식 장치(20)는 입력된 가시광선 영상으로부터 사람을 검출하고, 검출된 사람의 가시광선 몸 영역 이미지를 생성한다.Referring to FIG. 9, the gender recognition method using the score level convergence according to an embodiment of the present invention will be described. In step S910, the gender recognition apparatus 20 detects a person from the inputted visible ray image, Lt; / RTI > body-area image of the patient.

단계 S920에서 성별 인식 장치(20)는 입력된 열화상 영상으로부터 사람을 검출하고, 검출된 사람의 열화상 몸 영역 이미지를 생성한다.In step S920, the gender recognition apparatus 20 detects a person from the input thermal image and generates a thermal image of the body region of the detected person.

단계 S930에서 성별 인식 장치(20)는 생성된 가시광선 몸 영역 이미지에 딥 러닝 기반의 회선 신경망 모델을 적용하여 가시광선 이미지 특징을 추출하고 주성분 분석(Principal Component Analysis; PCA)에 의해 가시광선 최적 특징을 선택한다.In step S930, the gender recognition apparatus 20 extracts visible light image features by applying a deep learning-based line neural network model to the generated visible light body region image, and obtains a visible light optimal feature by Principal Component Analysis (PCA) .

단계 S940에서 성별 인식 장치(20)는 생성된 열화상 몸 영역 이미지에 딥 러닝 기반의 회선 신경망 모델을 적용하여 열화상 이미지 특징을 추출하고, 주성분 분석(Principal Component Analysis; PCA)에 의해 열화상 최적 특징을 선택한다.In step S940, the gender recognition apparatus 20 extracts the thermal image feature by applying a deep learning-based circuit neural network model to the generated thermal image of the body region of the thermal image, and calculates the thermographic image optimum by Principal Component Analysis (PCA) Select a feature.

단계 S950에서 성별 인식 장치(20)는 선택한 가시광선 최적 특징에 대해 서포트 벡터 머신 방법을 이용하여 성별 인식을 수행하여 가시광선 서포트 벡터 머신 점수를 산출한다.In step S950, the gender recognizing device 20 performs gender recognition using the support vector machine method for the selected visible ray optimum feature to calculate the visible ray support vector machine score.

단계 S960에서 성별 인식 장치(20)는 선택한 열화상 최적 특징에 대해 서포트 벡터 머신 방법을 이용하여 열화상 서포트 벡터 머신 점수를 산출한다.In step S960, the gender recognition apparatus 20 calculates the thermal image support vector machine score using the support vector machine method for the selected thermal image best feature.

단계 S970에서 성별 인식 장치(20)는 산출된 가시광선 서포트 벡터 머신 점수 및 열화상 서포트 벡터 머신 점수를 이용하여 서포트 벡터 머신 점수 조합을 생성하고 생성된 서포트 벡터 머신 점수 조합을 이용하여 성별 인식을 수행한다.In step S970, the gender recognition apparatus 20 generates a support vector machine score combination using the calculated visual beam support vector machine score and the thermal image support vector machine score, and performs gender recognition using the generated support vector machine score combination do.

단계 S980에서 성별 인식 장치(20)는 인식된 성별 결과를 출력한다.In step S980, the gender recognition device 20 outputs the recognized gender result.

도 10 내지 도 14는 본 발명의 일 실시 예에 따른 회선 신경망 기반의 학습을 통한 성별 인식 방법의 성별 분류 결과를 나타낸 도면들이다.FIGS. 10 to 14 illustrate gender classification results of the gender recognition method based on the circuit neural network based learning according to the embodiment of the present invention.

본 발명은 가시광선 몸 영역 이미지 및 열화상 몸 영역 이미지에 대해 각각 회선 신경망 기반의 이미지 특징 추출 모델을 학습시키기 위하여 학습 및 검증 절차를 수행하였다. 학습 및 검증 데이터베이스는 가시광선 이미지 및 열화상 이미지를 포함하며 74,820 이미지들 (36,720 남성 이미지들 (204 × 10 × 18) 및 38,100 여성 이미지들 (127 × 10 × 30))를 학습 데이터로 이용하였고, 18,300 이미지들 (9000 남성 이미지들 (50 × 10 × 18) 및 9300 여성 이미지들 (31 × 10 × 30))을 검증 데이터로 이용하였다. 또한, 회선 신경망 모델을 학습하기 MATLAB 실행을 이용하였다. 또한, 회선 신경망 모델의 에폭의 수는 60으로 설정하였으며, 최초 학습 레이트(rate)은 각 20 에폭 이후에 0.1의 학습 레이트 드롭 팩터와 함께 0.01로 설정하였다. The present invention performs learning and verification procedures to learn an image feature extraction model based on a line neural network for a visible light body region image and a thermal image body region image, respectively. The learning and validation database used were 74,820 images (36,720 male images (204 x 10 x 18) and 38,100 female images (127 x 10 x 30)) as learning data, including visible light and thermal images, 18,300 images (9000 male images (50 x 10 x 18) and 9300 female images (31 x 10 x 30)) were used as verification data. In addition, MATLAB execution was used to study the neural network model. In addition, the number of epochs in the circuit neural network model was set to 60, and the initial learning rate was set to 0.01 with a learning rate drop factor of 0.1 after each 20 epochs.

도 10을 참조하면, 본 발명에 따른 성별 인식 결과로 ROC(Receiver-Operating Curve) 곡선에서 가시광선 이미지만을 이용한 회선 신경망 기반의 평균 EER(Equal Error Rate)은 대략 17.216%이고, 열화상 이미지만을 이용한 회선 신경망 기반의 평균 EER은 대략 16.610%였다. 여기서, EER(Equal Error Rate)은 동일 오류 율로 대상자를 다른 사람으로 인식하거나 인식을 하지 못하는 비율을 의미한다. Referring to FIG. 10, the average EER (Equal Error Rate) based on the circuit neural network using only the visible ray image in the ROC (Receiver-Operating Curve) curve is about 17.216% as a result of the gender recognition according to the present invention, The average EER based on the circuit neural network was approximately 16.610%. Here, the Equal Error Rate (EER) means a rate at which the subject is recognized as another person or can not be recognized at the same error rate.

도 11을 참조하면, 본 발명에 따른 성별 인식 결과로 ROC(Receiver-Operating Curve) 곡선에서 가시광선 이미지에서 주성분 분석 및 서포트 벡터 머신 방법을 적용한 회선 신경망 기반의 평균 EER은 대략 17.064%이고, 열화상 이미지에서 주성분 분석 및 서포트 벡터 머신 방법을 적용한 회선 신경망 기반의 평균 EER은 대략 16.114%였다.11, the average EER based on the circuit neural network based on the principle component analysis and the support vector machine method in the visible light image in the ROC (Receiver-Operating Curve) curve is about 17.064% The average EER based on the circuit neural network using principal component analysis and support vector machine method in the image was approximately 16.114%.

도 12를 참조하면, 본 발명에 따른 성별 인식 결과로 ROC(Receiver-Operating Curve) 곡선에서 가시광선 이미지 및 열화상 이미지를 특징 레벨 융합 방법을 적용한 회선 신경망 기반의 평균 EER은 대략 11.439%이고, 스코어 레벨 융합 방법을 적용한 회선 신경망 기반의 평균 EER은 대략 11.713%였다. 도 12에서 보여진 바와 같이, 가시광선 이미지 및 열화상 이미지를 조합하는 경우 가시광선 이미지 또는 열화상 이미지를 단독으로 이용하는 경우보다 인식 정확도가 더 높아짐을 알 수 있다. Referring to FIG. 12, the average EER based on the line-by-line neural network based on the receiver-operating curve (ROC) curves of the visible light image and the thermal image according to the present invention is about 11.439% The average EER based on the line neural network using the level fusion method was approximately 11.713%. As shown in Fig. 12, when the visible light image and the thermal image are combined, the recognition accuracy is higher than when the visible light image or the thermal image is used alone.

본 발명에 따른 성별 인식 방법과 다른 방법과의 인식 정확도 비교Comparison of recognition accuracy between gender recognition method and other methods according to the present invention MethodMethod Using Single Visible Light ImagesUsing Single Visible Light Images Using Single Thermal ImagesUsing Single Thermal Images Feature Level FusionFeature Level Fusion Score Level FusionScore Level Fusion HOG+SVM HOG + SVM 17.81717.817 20.46320.463 16.63216.632 16.22716.227 EWHOG+SVM EWHOG + SVM 15.11315.113 19.19819.198 14.76714.767 14.13514.135 wHOG+SVM wHOG + SVM 15.21915.219 18.25718.257 14.81914.819 13.06013.060 본 발명Invention 17.06417.064 16.11416.114 11.43911.439 11.71311.713

표 3을 참조하면, 본 발명에 따른 성별 인식 방법이 종래의 다른 방법들에 비해 인식 정확도가 많이 향상됨을 확인할 수 있다. Referring to Table 3, it can be seen that the gender recognition method according to the present invention improves recognition accuracy significantly compared to other methods.

도 13 및 도 14는 본 발명에 따른 성별 인식 방법에 의한 성별 인식 결과를 나타내는 샘플을 보여준다.13 and 14 show a sample showing the result of sex recognition by the gender recognition method according to the present invention.

도 13을 참조하면, 본 발명에 따른 성별 인식 방법은 사람들의 앞모습 또는 뒷모습 이미지인 경우에도 사람들의 성별을 정확하게 인식하였다. Referring to FIG. 13, the gender recognition method according to the present invention correctly recognizes the sex of people even in the case of the front view or the rear view image of people.

도 14를 참조하면, 본 발명에 따른 성별 인식 방법의 오류로 도 14 (a) 내지 도 14 (c)는 남성 이미지가 여성으로 잘못 인식된 예들이고, 도 14 (d) 내지 도 14 (f)는 여성 이미지가 남성으로 잘못 인식된 예들이다. 하지만, 도 14의 몸 영역 이미지는 매우 모호하며, 뒷모습에 두꺼운 겨울 의류를 입고 있는 경우도 있어 결과적으로 일반 사람들의 인식에 의한다 할 지라도 성별을 인식하기 쉽지 않을 수 있다. 더욱이 일반 감시 시스템은 카메라에서 대략 10m 거리에서 이미지를 촬영하므로 촬영된 이미지의 질이 일반적으로 좋지 않아 인식 시스템의 오류를 일으키는 부정적인 영향을 미칠 수 있다. 14 (a) through 14 (c) are examples in which the male image is mistakenly recognized as a woman, and FIGS. 14 (d) through 14 (f) Are examples of female images mistakenly recognized as males. However, the image of the body region shown in Fig. 14 is very vague and sometimes wearing a thick winter clothing in the rear view, so that it may not be easy to recognize sex even if it is based on the recognition of the general public. Moreover, since the general surveillance system shoots images at a distance of about 10 meters from the camera, the quality of the captured images is generally poor, which can have a negative impact on the recognition system.

본 발명의 실시 예에 따른 성별 인식 방법은 다양한 컴퓨터 수단을 통하여 수행될 수 있는 프로그램 명령 형태로 구현되어 컴퓨터 판독 가능 매체에 기록될 수 있다. 컴퓨터 판독 가능 매체는 프로그램 명령, 데이터 파일, 데이터 구조 등을 단독으로 또는 조합하여 포함할 수 있다. 컴퓨터 판독 가능 매체에 기록되는 프로그램 명령은 본 발명을 위하여 특별히 설계되고 구성된 것들이거나 컴퓨터 소프트웨어 분야 당업자에게 공지되어 사용 가능한 것일 수도 있다. 컴퓨터 판독 가능 기록 매체의 예에는 하드 디스크, 플로피 디스크 및 자기 테이프와 같은 자기 매체(magnetic media), CD-ROM, DVD와 같은 광기록 매체(optical media), 플롭티컬 디스크(floptical disk)와 같은 자기-광 매체(magneto-optical media) 및 롬(ROM), 램(RAM), 플래시 메모리 등과 같은 프로그램 명령을 저장하고 수행하도록 특별히 구성된 하드웨어 장치가 포함된다.프로그램 명령의 예에는 컴파일러에 의해 만들어지는 것과 같은 기계어 코드뿐만 아니라 인터프리터 등을 사용해서 컴퓨터에 의해서 실행될 수 있는 고급 언어 코드를 포함한다. 상술한 하드웨어 장치는 본 발명의 동작을 수행하기 위해 하나 이상의 소프트웨어 모듈로서 작동하도록 구성될 수 있으며, 그 역도 마찬가지이다.The gender recognition method according to an embodiment of the present invention may be implemented in the form of a program command that can be executed through various computer means and recorded in a computer-readable medium. The computer readable medium may include program instructions, data files, data structures, and the like, alone or in combination. Program instructions to be recorded on a computer-readable medium may be those specially designed and constructed for the present invention or may be available to those skilled in the computer software arts. Examples of computer-readable media include magnetic media such as hard disks, floppy disks, and magnetic tape; optical media such as CD-ROMs and DVDs; magnetic media such as floppy disks; - Hardware devices specifically configured to store and execute program instructions, such as magneto-optical media and ROM, RAM (Random Access Memory), flash memory, etc. Examples of program instructions include those made by a compiler As well as high-level language code that can be executed by a computer using an interpreter or the like. The hardware devices described above may be configured to operate as one or more software modules to perform the operations of the present invention, and vice versa.

이제까지 본 발명에 대하여 그 실시 예들을 중심으로 살펴보았다. 본 발명이 속하는 기술 분야에서 통상의 지식을 가진 자는 본 발명이 본 발명의 본질적인 특성에서 벗어나지 않는 범위에서 변형된 형태로 구현될 수 있음을 이해할 수 있을 것이다. 그러므로 개시된 실시 예들은 한정적인 관점이 아니라 설명적인 관점에서 고려되어야 한다. 본 발명의 범위는 전술한 설명이 아니라 특허청구범위에 나타나 있으며, 그와 동등한 범위 내에 있는 모든 차이점은 본 발명에 포함된 것으로 해석되어야 할 것이다.The embodiments of the present invention have been described above. It will be understood by those skilled in the art that various changes in form and details may be made therein without departing from the spirit and scope of the invention as defined by the appended claims. Therefore, the disclosed embodiments should be considered in an illustrative rather than a restrictive sense. The scope of the present invention is defined by the appended claims rather than by the foregoing description, and all differences within the scope of equivalents thereof should be construed as being included in the present invention.

10: 영상 촬영 장치
20: 성별 인식 장치10: Video shooting device
20: Gender recognition device

Claims

In the gender recognition apparatus,
A body region information generating unit for detecting a person who is a target in the input image and extracting a body region image of a person to generate body region information;
An image feature extracting unit for extracting image feature information by using a deep-learning-based Convolutional Neural Network (CNN) model; And
And a gender recognition unit for recognizing a gender of a person using the extracted image feature,
The image feature extraction unit
An image input layer for inputting a body region image of a preset scale;
A feature extraction layer for extracting image features by applying a line neural network model to the input body image; And
And a classification layer for classifying the two sexes by using the extracted image feature,
The feature extraction layer
Five convolution layer sets and three fully connected layer sets,
The five convolution layer sets
A first convolution layer for inputting an image of 183 × 119 pixels and converting the 96 feature maps of size 43 × 27 pixels into convolutions by applying 96 filters of size 11 × 11,
A second convolution layer for transforming the 96 feature maps of 43 × 27 pixels into convolution using 128 filters of size 5 × 5 × 96 and outputting 128 feature maps of size 21 × 13 pixels;
A third convolution layer for convolutionizing the 128 feature maps of 21x13 pixels by applying 256 filters of size 3x3x128 to output 256 feature maps of size 21x13 pixels;
A fourth convolution layer for convolutionizing 256 feature maps of 21x13 pixels by applying 256 filters of size 3x3x256 to output 256 feature maps of size 21x13 pixels; And
And a fifth convolution layer for convolutionizing 256 feature maps of 21x13 pixels by applying 128 filters of size 3x3x256 to output 128 feature maps of size 10x6 pixels, Recognition device.

The method according to claim 1,
The body-area-information generating unit
A target object detection unit for detecting a target person in the input image information; And
And a body region image generating unit for generating a body region image from the detected person image.

delete

The method according to claim 1,
Wherein the body region information is visible light body image information and thermal image body image information,
The gender-
The visible ray image feature extracted from the visible light body image information through the circuit neural network model and the thermal image feature extracted from the thermal image body image information through the circuit neural network model are selected using the principal component analysis and the support vector machine method To perform gender recognition.

The method according to claim 1,
Wherein the body region information is visible light body image information and thermal image body image information,
The gender-
In the visible light body image information, the visible light ray feature extracted from the line neural network model is selected through principal component analysis,
The thermal image optimum feature is selected through the principal component analysis on the thermal image feature extracted through the circuit neural network model in the thermal image body image information,
A support vector machine score combination is generated using the visual beam support vector machine score and the thermal image support machine score calculated by applying the support vector machine method for each,
And a gender recognition device for performing gender recognition using the generated support vector machine score combination.

The method according to claim 1,
And a gender awareness learning unit that learns and verifies gender recognition of a person using a database of human images.

delete