KR20220036250A

KR20220036250A - Method for weakly supervised learning of neural network using verification information of identification card and system for the method

Info

Publication number: KR20220036250A
Application number: KR1020200118587A
Authority: KR
Inventors: 이동열
Original assignee: 주식회사 카카오뱅크
Priority date: 2020-09-15
Filing date: 2020-09-15
Publication date: 2022-03-22
Also published as: KR102528816B1

Abstract

본 발명은 신경망의 학습 방법에 관한 것으로 본 발명에 따른 신경망 학습 방법은 사용자의 신분증 이미지에 포함된 객체의 인식을 수행하는 신경망을 이용하여 인식 결과를 출력하는 단계; 상기 인식 결과의 적어도 일부에 대한 상담원의 검수 결과 또는 상기 사용자에 대해 수집된 신분 정보를 입력 받는 단계; 및 상기 검수 결과 또는 신분 정보를 각각 대응되는 상기 인식 결과의 정답 값(Ground-Truth)으로 이용하여 신경망을 학습시키는 단계를 포함한다. 본 발명에 따르면 신분증에 포함된 다양한 객체의 인식을 위한 신경망의 일부 객체에 대한 인식 결과 만으로도 추가적인 학습을 수행하여 신경망의 전체 예측 성능을 높일 수 있다.The present invention relates to a learning method of a neural network. The neural network learning method according to the present invention includes the steps of outputting a recognition result using a neural network that recognizes an object included in a user's identification card image; receiving a counselor's inspection result for at least part of the recognition result or inputting identity information collected about the user; And a step of training a neural network by using the inspection result or the identity information as the ground truth value of the corresponding recognition result. According to the present invention, the overall prediction performance of the neural network can be improved by performing additional learning with only the recognition results for some objects of the neural network for recognizing various objects included in the ID card.

Description

{Method for weakly supervised learning of neural network using verification information of identification card and system for the method}

본 발명은 신경망의 학습 방법에 관한 것으로, 신분증 인식을 위한 신경망의 간소화된 학습 방법에 관한 것이다.The present invention relates to a learning method of a neural network, and to a simplified learning method of a neural network for ID recognition.

이동통신단말기, 특히 스마트폰과 모바일 통신 기술의 발달에 힘입어 모바일 어플리케이션을 통해 과거에 오프라인을 통해 수행하던 기능을 지금은 언제 어디서나 간편하고 편리하게 이용할 수 있다.Thanks to the development of mobile communication terminals, especially smartphones, and mobile communication technology, functions that were previously performed offline can now be easily and conveniently used anytime, anywhere through mobile applications.

특히, 최근 들어 사용자의 편의성을 향상시키는 측면에서 어플리케이션을 통해 금융 업무까지 비대면(非對面)으로 처리하는 기술이 개발되어 서비스되고 있으며 비대면 상황에서 사용자의 신분을 확인하기 위한 다양한 방식들이 고안되고 있다.In particular, in recent years, in order to improve user convenience, technology has been developed and serviced to process financial transactions non-face-to-face through applications, and various methods have been devised to verify the user's identity in non-face-to-face situations. there is.

또한, 인공지능 기술의 발달로 다양한 분야에 인공지능 기술 들이 적용되고 있으며, 기존의 영상 처리 방식을 대신하여 신경망 모델을 통해 영상 내 객체를 검출하고 추적하는 방법들이 개발되고 있다. 이러한 인공지능 기술을 신분증 인식에 적용하여 사용자가 직접 신분 정보를 입력하는 대신 스마트 폰의 카메라로부터 촬영된 신분증 이미지의 정보들을 인식하고, 이를 통해 사용자의 신분을 인증하는 방식들이 금융 서비스 상에서 실제 적용되고 있다.In addition, with the development of artificial intelligence technology, artificial intelligence technologies are being applied to various fields, and methods of detecting and tracking objects in images through neural network models are being developed instead of existing image processing methods. By applying this artificial intelligence technology to ID recognition, the user recognizes the information in the ID card image taken from the smartphone camera instead of directly entering the ID information, and through this, the methods of authenticating the user's identity are actually applied in financial services. there is.

하지만, 신분증 이미지에는 인식하고자 하는 다양한 객체와 신분증 자체의 오염이나 위조 방지를 위해 코팅과 홀로그램 등의 다양한 객체들이 함께 표현되므로, 실제 사용환경에 따라 발생가능한 오류들을 반영하여 신경망을 학습시킬 필요가 있다.However, since various objects to be recognized and various objects such as coatings and holograms are expressed in the ID card image to prevent contamination or counterfeiting of the ID card itself, it is necessary to learn the neural network by reflecting errors that may occur depending on the actual usage environment. .

본 발명은 신분증 인식을 위한 신경망의 학습 방법을 제안하는 것을 목적으로 한다. The purpose of the present invention is to propose a neural network learning method for ID recognition.

또한, 본 발명은 실 사용과정에서 신분증의 인식된 정보의 검수 시 발생하는 정보를 이용하는 효율적인 학습 방법을 제안하는 것을 목적으로 한다.In addition, the purpose of the present invention is to propose an efficient learning method that uses information generated when verifying recognized information of an ID card during actual use.

상기 기술적 과제를 해결하기 위한 본 발명에 따른 신경망 학습 방법은 사용자의 신분증 이미지에 포함된 객체의 인식을 수행하는 신경망을 이용하여 인식 결과를 출력하는 단계; 상기 인식 결과의 적어도 일부에 대한 상담원의 검수 결과 또는 상기 사용자에 대해 수집된 신분 정보를 입력 받는 단계; 및 상기 검수 결과 또는 신분 정보를 각각 대응되는 상기 인식 결과의 정답 값(Ground-Truth)으로 이용하여 신경망을 학습시키는 단계를 포함한다.A neural network learning method according to the present invention for solving the above technical problem includes outputting a recognition result using a neural network that recognizes an object included in a user's ID card image; receiving a counselor's inspection result for at least part of the recognition result or inputting identity information collected about the user; And a step of training a neural network by using the inspection result or the identity information as the ground truth value of the corresponding recognition result.

상기 신경망은 회귀(Regression) 분석에 따른 객체의 존재 영역을 정의하는 제1 인식 결과 및 분류(Classification)에 따른 객체의 인식 값을 정의하는 제2 인식 결과를 출력하는 것이 바람직하다.The neural network preferably outputs a first recognition result that defines the area in which the object exists according to regression analysis and a second recognition result that defines the recognition value of the object according to classification.

상기 입력 받는 단계는 상기 제2 인식 결과에 대한 상담원의 검수 결과를 입력 받는 것이 바람직하다.In the input receiving step, it is preferable to receive the counselor's inspection result for the second recognition result.

상기 입력 받는 단계는 상기 제1 인식 결과에 대한 상담원의 오류 판단 결과를 입력 받는 것이 바람직하다.It is preferable that the input receiving step receives the counselor's error judgment result for the first recognition result.

상기 학습시키는 단계는 상기 제1 인식 결과에 대한 손실(loss)을 상수 값으로 고정(freezing)하여 신경망을 학습시키는 것이 바람직하다.In the learning step, it is preferable to train the neural network by freezing the loss of the first recognition result to a constant value.

상기 학습시키는 단계는 상기 제1 인식 결과에 대한 손실을 고정하고, 상기 제2 인식 결과에 대한 상기 검수 결과 또는 신분 정보를 정답 값으로 이용하여 산출된 손실을 이용하여 신경망을 학습시키는 것이 바람직하다.In the learning step, it is preferable to fix the loss for the first recognition result and train the neural network using the loss calculated by using the inspection result or identity information for the second recognition result as the correct value.

상기 학습시키는 단계는 미리 결정된 주기에 따라 수집된 신분증 이미지의 상기 인식 결과와 상기 정답 값의 쌍으로 구성된 학습 셋을 단위로 신경망을 학습시키는 것이 바람직하다.In the learning step, it is preferable to train the neural network based on a learning set consisting of pairs of the recognition result and the correct value of the ID card image collected according to a predetermined cycle.

상기 주기는 배치(batch) 크기로 결정되는 것이 바람직하다.The cycle is preferably determined by the batch size.

상기 기술적 과제를 해결하기 위한 본 발명에 따른 신경망 학습 서버는 사용자의 신분증 이미지에 포함된 객체의 인식을 수행하는 신경망을 이용하여 인식 결과를 출력하는 인식 결과 출력부; 상기 인식 결과의 적어도 일부에 대한 상담원의 검수 결과 또는 상기 사용자에 대해 수집된 신분 정보를 입력 받는 검수 결과 입력부; 및 상기 검수 결과 또는 신분 정보를 각각 대응되는 상기 인식 결과의 정답 값(Ground-Truth)으로 이용하여 신경망을 학습시키는 학습부를 포함한다.A neural network learning server according to the present invention for solving the above technical problem includes a recognition result output unit that outputs a recognition result using a neural network that recognizes an object included in a user's ID card image; an inspection result input unit that receives a counselor's inspection result for at least part of the recognition result or identity information collected about the user; and a learning unit that trains a neural network by using the inspection result or the identity information as a ground truth value of the corresponding recognition result.

상기 검수 결과 입력부는 상기 제2 인식 결과에 대한 상담원의 검수 결과를 입력 받는 것이 바람직하다.Preferably, the inspection result input unit receives the counselor's inspection result for the second recognition result.

상기 검수 결과 입력부는 상기 제1 인식 결과에 대한 상담원의 오류 판단 결과를 입력 받는 것이 바람직하다.Preferably, the inspection result input unit receives the counselor's error judgment result for the first recognition result.

상기 학습부는 상기 제1 인식 결과에 대한 손실을 상수 값으로 고정하여 신경망을 학습시키는 것이 바람직하다.It is preferable that the learning unit trains the neural network by fixing the loss for the first recognition result to a constant value.

상기 학습부는 미리 결정된 주기에 따라 수집된 신분증 이미지의 상기 인식 결과와 상기 정답 값의 쌍으로 구성된 학습 셋을 단위로 신경망을 학습시키는 것이 바람직하다.Preferably, the learning unit trains the neural network based on a learning set consisting of pairs of the recognition result and the correct answer value of the ID card image collected according to a predetermined cycle.

본 발명에 따르면 신분증에 포함된 다양한 정보를 인식할 수 있는 신경망을 효율적으로 학습시킬 수 있다.According to the present invention, a neural network capable of recognizing various information contained in an identification card can be efficiently trained.

본 발명에 따르면 신분증 인식을 위한 신경망의 일부 객체에 대한 인식 결과 만으로도 추가적인 학습을 수행하여 신경망의 전체 예측 성능을 높일 수 있다.According to the present invention, the overall prediction performance of the neural network can be improved by performing additional learning with only the recognition results of some objects of the neural network for ID recognition.

또한, 본 발명은 신분증 인식 시 지도학습이 어려운 부분에 대해 신분증의 검수 과정에서 획득된 검수 결과를 학습을 위한 정답으로 활용함으로써 학습 절차를 효율화 할 수 있다.In addition, the present invention can streamline the learning process by using the inspection results obtained during the identification process as the correct answer for learning in areas where supervised learning is difficult when recognizing an ID card.

도 1은 본 발명의 일 실시예에 따른 신경망 학습 시스템을 나타낸 예시도이다.
도 2는 본 발명의 일 실시예에 따른 신경망 학습 방법을 나타낸 흐름도이다.
도 3 내지 4는 본 발명의 일 실시예에 따른 신경망의 학습을 위한 출력 데이터를 예시하는 도이다.
도 5는 본 발명의 일 실시예에 따른 신경망의 인식 결과에 대한 검수 결과를 예시하는 도이다.
도 6은 본 발명의 일 실시예에 따른 신경망의 학습을 위한 손실 함수의 구성을 나타낸 도이다.
도 7은 본 발명의 일 실시예에 따른 신경망의 학습을 위한 학습 데이터 셋을 예시하는 도이다.
도 8은 본 발명의 일 실시예에 따른 학습 서버의 구성을 나타낸 블록도이다.Figure 1 is an exemplary diagram showing a neural network learning system according to an embodiment of the present invention.
Figure 2 is a flowchart showing a neural network learning method according to an embodiment of the present invention.
3 and 4 are diagrams illustrating output data for learning of a neural network according to an embodiment of the present invention.
Figure 5 is a diagram illustrating the inspection results of the recognition results of the neural network according to an embodiment of the present invention.
Figure 6 is a diagram showing the configuration of a loss function for learning a neural network according to an embodiment of the present invention.
Figure 7 is a diagram illustrating a learning data set for learning a neural network according to an embodiment of the present invention.
Figure 8 is a block diagram showing the configuration of a learning server according to an embodiment of the present invention.

이하의 내용은 단지 발명의 원리를 예시한다. 그러므로 당업자는 비록 본 명세서에 명확히 설명되거나 도시 되지 않았지만 발명의 원리를 구현하고 발명의 개념과 범위에 포함된 다양한 장치를 발명할 수 있는 것이다. 또한, 본 명세서에 열거된 모든 조건부 용어 및 실시 예들은 원칙적으로, 발명의 개념이 이해되도록 하기 위한 목적으로만 명백히 의도되고, 이외같이 특별히 열거된 실시 예들 및 상태들에 제한적이지 않는 것으로 이해되어야 한다. The following merely illustrates the principles of the invention. Therefore, a person skilled in the art can invent various devices that embody the principles of the invention and are included in the concept and scope of the invention, although not clearly described or shown herein. In addition, all conditional terms and embodiments listed in this specification are, in principle, clearly intended only for the purpose of ensuring that the inventive concept is understood, and should be understood as not limiting to the specifically listed embodiments and states. .

상술한 목적, 특징 및 장점은 첨부된 도면과 관련한 다음의 상세한 설명을 통하여 보다 분명해질 것이며, 그에 따라 발명이 속하는 기술분야에서 통상의 지식을 가진 자가 발명의 기술적 사상을 용이하게 실시할 수 있을 것이다. The above-mentioned purpose, features and advantages will become clearer through the following detailed description in relation to the attached drawings, and accordingly, those skilled in the art in the technical field to which the invention pertains will be able to easily implement the technical idea of the invention. .

또한, 발명을 설명함에 있어서 발명과 관련된 공지 기술에 대한 구체적인 설명이 발명의 요지를 불필요하게 흐릴 수 있다고 판단되는 경우에 그 상세한 설명을 생략하기로 한다. 이하에는 첨부한 도면을 참조하여 본 발명의 바람직한 실시 예에 대해 상세하게 설명한다.Additionally, when describing the invention, if it is determined that a detailed description of the known technology related to the invention may unnecessarily obscure the gist of the invention, the detailed description will be omitted. Hereinafter, preferred embodiments of the present invention will be described in detail with reference to the attached drawings.

도 1은 본 발명의 일 실시예에 따른 신경망의 학습 방법이 수행되는 학습 시스템의 구성을 나타내는 도이다.1 is a diagram showing the configuration of a learning system in which a neural network learning method is performed according to an embodiment of the present invention.

도 1을 참조하면, 본 실시예에 따른 학습 시스템은 사용자(10)가 비대면 금융 서비스의 이용을 위해 이용하는 사용자 단말(1000)과 사용자 단말(1000)로부터 입력된 사용자(10)의 신분 정보를 수신하고 이를 통해 본인 인증을 수행하는 학습 서버(100)로 구성될 수 있다.Referring to FIG. 1, the learning system according to this embodiment includes a user terminal 1000 that the user 10 uses to use a non-face-to-face financial service, and the identity information of the user 10 input from the user terminal 1000. It may be configured as a learning server 100 that receives information and performs identity authentication through it.

학습 서버(100)는 비대면 기반의 본인 인증을 수행하기 위하여 사용자(10)로부터 신분증 이미지(50)를 수신하고, 신분증 이미지(50)에 포함된 인증에 필요한 객체들을 학습된 신경망을 이용하여 직접 인식하여 결과(70)로 출력할 수 있다.The learning server 100 receives the ID card image 50 from the user 10 in order to perform non-face-to-face identity authentication, and directly uses the learned neural network to identify objects required for authentication included in the ID card image 50. It can be recognized and output as a result (70).

예를 들어, 신분증 이미지(50)에 포함된 사용자(10)의 성명, 주민등록번호, 주소 정보에 포함된 텍스트들을 신경망을 이용하여 인식할 수 있다.For example, the text included in the name, resident registration number, and address information of the user 10 included in the ID card image 50 can be recognized using a neural network.

구체적으로 학습된 신경망은 인식하고자 하는 객체의 존재 영역에 대한 위치 정보를 인식 결과로 출력할 수 있다.Specifically, the learned neural network can output location information about the area where the object to be recognized exists as a recognition result.

일 예로, 신분증 이미지(50) 전체의 영역으로 신분증 테두리에 대한 정보를 이미지 상의 좌표로 출력할 수 있다. 또는, 신분증 이미지(50) 내의 객체로 예를 들어 소유자 성명, 주민등록번호 등의 텍스트들이 존재하는 영역에 대한 값을 검출하여 출력할 수 있다.As an example, information about the border of the ID card can be output to the entire area of the ID card image 50 as coordinates on the image. Alternatively, a value for an area where text such as the owner's name or resident registration number exists as an object in the ID card image 50 can be detected and output.

이때 영역은 사각형 형태의 바운딩 박스(Bounding Box)로 정의될 수 있으며, 구체적으로 바운딩 박스는 중심점의 x, y축 좌표와 테두리의 폭 w, 높이 h 값의 4차원 벡터 값으로 정의될 수 있다.At this time, the area can be defined as a rectangular bounding box. Specifically, the bounding box can be defined as a 4-dimensional vector value of the x- and y-axis coordinates of the center point and the width w and height h of the border.

또한, 신분증에는 코팅과 홀로그램과 같은 위변조 방지 장치들이 적용되는 데 이와 같은 장치가 비대면 상황의 신분증 촬영 시 반사나 가림 현상을 발생시켜 객체의 인식을 저해할 수 있다. 따라서 신경망은 이러한 저해 요소들의 존재여부를 판단하여 촬영된 신분증 이미지의 이용 가능 여부를 판단하도록 하는 것도 가능하다.In addition, anti-counterfeiting devices such as coatings and holograms are applied to ID cards, and such devices can hinder recognition of objects by causing reflection or blocking when photographing ID cards in non-face-to-face situations. Therefore, it is also possible for the neural network to determine whether the captured ID image can be used by determining the presence or absence of these hindering factors.

또한, 신분증 이미지(50)에는 동일인의 식별을 위해 인쇄된 안면 사진이 포함될 수 있는데 안면 사진으로부터 추출되는 특징 정보들도 출력으로 생성될 수 있다.Additionally, the ID image 50 may include a printed facial photo for identification of the same person, and feature information extracted from the facial photo may also be generated as output.

즉, 신분증 이미지(50)에는 신분 인증을 위한 다양한 정보들이 포함되어 있으며 따라서 신경망이 포함된 모든 객체들을 정확하게 인식하여야만 인증에 이용할 수 있다.In other words, the ID card image 50 contains various information for identity authentication, and therefore, it can be used for authentication only when all objects included in the neural network are accurately recognized.

하지만, 잘 학습된 신경망의 경우에도 신분증 이미지(50)를 촬영하는 환경, 위조 변조 방지를 위한 장치들에 따른 영향으로 객체 인식에 오류가 발생할 수 있으므로 실제 비대면 기반의 서비스를 제공하는 사업 환경에서는 추가적인 검수 절차를 통해 오류를 보완하고 있다.However, even in the case of a well-trained neural network, errors may occur in object recognition due to the influence of the environment in which the ID card image 50 is taken and devices to prevent forgery and tampering, so in a business environment that provides actual non-face-to-face services, Errors are being corrected through additional inspection procedures.

따라서 본 실시예에서 학습 서버(100)는 인식 결과에 대하여 검수 절차의 결과 값을 이용하여 신경망의 학습을 수행하고자 한다. 학습 서버(100)는 검수자의 검수 결과를 수신하고 검수 결과를 학습 데이터의 정답(Ground-Truth)로 이용하여 학습을 수행한다.Therefore, in this embodiment, the learning server 100 seeks to perform neural network learning using the result value of the verification procedure for the recognition result. The learning server 100 receives the inspector's inspection results and performs learning using the inspection results as the ground truth of the learning data.

다만, 검수자는 검수라는 본연의 업무 목적상 신경망으로부터 인식된 모든 객체에 대한 오류 검수를 수행할 필요가 없으며, 신분 인증에 영향을 미치는 객체들에 대한 오류 여부를 검수하므로, 이에 따른 신경망의 학습 방법을 제안한다.However, for the purpose of the original task of inspection, the inspector does not need to check for errors on all objects recognized by the neural network. Since the inspector checks for errors in objects that affect identity authentication, the learning method of the neural network accordingly suggests.

이하, 도 2를 참조하여 본 실시예에 따른 학습 방법에 대하여 보다 상세히 설명한다.Hereinafter, the learning method according to this embodiment will be described in more detail with reference to FIG. 2.

먼저, 사용자(10)의 신분증 이미지(50)에 포함된 객체의 인식을 수행하는 신경망을 이용하여 인식 결과를 출력한다(S100).First, the recognition result is output using a neural network that recognizes the object included in the ID card image 50 of the user 10 (S100).

신경망은 신분증 이미지(50)에 대한 객체의 인식 결과를 출력하도록 미리 학습(pre-training)될 수 있다.The neural network may be pre-trained to output object recognition results for the ID card image 50.

구체적으로 신경망은 신분증 이미지(50)에 포함된 객체들의 위치 정보에 대한 제1 인식 결과와 객체들의 분류 결과로 제2 인식 결과를 각각 출력할 수 있다.Specifically, the neural network may output a first recognition result regarding the location information of objects included in the ID card image 50 and a second recognition result as a classification result of the objects, respectively.

신경망은 객체들이 이미지 상에 존재하는 영역에 대한 정확한 판단을 전제로 영역 내에 위치하는 객체의 분류를 수행할 수 있으므로 제1 인식 결과와 제2 인식 결과는 출력 과정에서 서로 상관관계를 가지며 공통된 신경망으로부터 출력될 수 있다.Since the neural network can classify objects located within the area on the premise of accurate judgment of the area in which the objects exist in the image, the first recognition result and the second recognition result have a correlation with each other in the output process and are derived from a common neural network. can be printed.

이를 위해 신경망은 학습을 위한 학습 데이터로 이미지와 이미지에 포함된 객체들에 대한 영역과 분류가 미리 레이블링 된 학습 데이터 셋을 이용하여 지도 학습(Supervised Learning)을 수행한다.For this purpose, the neural network performs supervised learning using a training data set in which the regions and classifications of images and objects included in the images are pre-labeled as training data.

다만, 본 실시예에서는 신경망의 학습에 있어서 미리 레이블링 된 데이터 외 신분증 이미지(50)의 검수 과정에서 수집되는 정보를 이용하여 학습을 추가 수행함으로써 실제 환경에 보다 강인한 성능을 갖도록 하는 것을 목적으로 한다.However, in this embodiment, the purpose is to provide more robust performance in the real environment by additionally performing learning using information collected during the inspection process of the ID card image 50 in addition to pre-labeled data in learning the neural network.

도 3을 참조하면, 신경망(30)은 회귀(Regression) 분석에 따른 객체의 존재 영역을 정의하는 제1 인식 결과(70-1) 및 분류(Classification)에 따른 객체의 인식 값을 정의하는 제2 인식 결과(70-2)를 출력할 수 있다.Referring to FIG. 3, the neural network 30 provides a first recognition result (70-1) that defines the presence area of the object according to regression analysis and a second recognition result (70-1) that defines the recognition value of the object according to classification. Recognition results (70-2) can be output.

본 실시예에서 신경망(30)은 합성곱 연산을 수행하는 레이어의 적층 구조로 구성된 CNN(Convolution Neural Network) 모델로 구현될 수 있다.In this embodiment, the neural network 30 may be implemented as a CNN (Convolution Neural Network) model composed of a stacked structure of layers that perform convolution operations.

CNN 모델 내부의 합성곱 레이어들은 입력된 이미지 데이터 내의 특징 값을 설정된 필터 들에 따라 합성곱 연산을 수행하며, 출력으로 결정된 차원에 따른 특징 맵을 생성할 수 있다.The convolution layers within the CNN model perform a convolution operation on the feature values in the input image data according to the set filters, and can generate a feature map according to the dimension determined as the output.

또한, 출력된 특징 맵은 완전 연결 레이어를 거치면서 본 실시예에 따른 회귀 분석 결과로서 제1 인식 결과(70-1)와 분류 결과로 제2 인식 결과(70-2)로 출력될 수 있다.Additionally, the output feature map may be output as a first recognition result (70-1) as a result of regression analysis according to this embodiment and a second recognition result (70-2) as a classification result while passing through a fully connected layer.

이때 신경망은 상술한 CNN 모델 외에 인식하고자 하는 객체의 속성에 따라 R-CNN(Region with CNN), RNN(Recurrent Neural Network), LSTM(Long Short-Term Memory) 등의 다양한 알고리즘들이 적용될 수 있으며 신경망의 구현 방법이 이에 한정되는 것은 아니다.At this time, in addition to the CNN model described above, various algorithms such as R-CNN (Region with CNN), RNN (Recurrent Neural Network), and LSTM (Long Short-Term Memory) can be applied to the neural network depending on the properties of the object to be recognized. The implementation method is not limited to this.

이하, 도 4를 참조하여 본 실시예에 따른 신분증 이미지로부터 출력되는 결과에 대하여 보다 설명한다.Hereinafter, the results output from the ID card image according to this embodiment will be described in more detail with reference to FIG. 4.

먼저, 제1 인식 결과(70-1)는 신분증 이미지에 포함된 객체들의 위치 정보로, 바운딩 박스로 정의된 좌표 값일 수 있다.First, the first recognition result 70-1 is location information of objects included in the ID card image, and may be coordinate values defined as a bounding box.

따라서, 신분증 이미지 내 신분증 테두리 영역 정보, 성명이나 주민등록번호 등의 텍스트 부분의 영역 정보를 제1 인식 결과(70-1)로 출력할 수 있다.Accordingly, information on the border area of the ID card within the ID image and area information on the text portion, such as name or resident registration number, can be output as the first recognition result (70-1).

반면 제2 인식 결과(70-2)는 신분증 이미지에 포함된 객체들의 분류 결과 값일 수 있다.On the other hand, the second recognition result (70-2) may be a classification result value of objects included in the ID card image.

예를 들어, 성명 중 성이 '홍'으로 인식되는 텍스트 분류 결과 직접 출력할 수 있으며 주민등록번호의 숫자 값들을 숫자 도메인 상에서 분류 결과로 출력할 수 있다.For example, text classification results in which the last name in the name is recognized as 'Hong' can be directly output, and numeric values of the resident registration number can be output as classification results in the numeric domain.

그 외, 신분인증에 영향을 미치는 요소로서 그림자 포함여부, 홀로그램의 존재여부를 판단하여 인식 결과로 출력하는 것도 가능하다. 또한 안면 정보로 신분증에 포함된 사진 상으로 사용자가 안경을 착용하였는지, 머리 스타일에 대한 판단 정보를 분류 결과로 출력할 수도 있다. In addition, it is possible to determine whether a shadow is included or whether a hologram is present as factors that affect identity authentication and output them as recognition results. In addition, facial information can be used to output judgment information about whether the user is wearing glasses and hair style in the photo included in the ID card as a classification result.

이때, 제1 인식 결과(70-1)는 객체의 영역으로 바운딩 박스의 좌표 정보는 신분 인증에는 영향을 미치지 않는 값이므로 상담원 본연의 업무 목적 상 좌표 값 자체에 대한 오류에 대한 검수가 불필요할 수 있다.At this time, the first recognition result (70-1) is the area of the object, and the coordinate information of the bounding box is a value that does not affect identity authentication, so checking for errors in the coordinate value itself may not be necessary for the agent's original work purpose. there is.

따라서, 제1 인식 결과(70-1)에 대한 정답 값의 설정, 예를 들어 영역의 좌표 값을 정 위치로 교정하는 것은 부가적인 프로세스가 될 수 있으므로 상담원의 최소한의 정오(True/False) 판단에 대한 결과를 이용하여 약지도 학습을 수행할 수 있도록 한다.Therefore, setting the correct value for the first recognition result (70-1), for example, correcting the coordinate value of the area to the correct position, may be an additional process, so the agent's minimum true/false judgment Weakly supervised learning can be performed using the results.

즉 검수 과정에서 객체들이 존재하는 신분증 이미지 내 위치 좌표 값의 정답을 설정하는 대신, 추출된 영역의 맞고 틀림에 대한 이분화 된 판단을 요청할 수 있다.In other words, during the inspection process, instead of setting the correct answer to the location coordinate value in the ID card image where the objects exist, a dichotomous judgment can be requested as to whether the extracted area is correct or incorrect.

예를 들어 신분증 이미지 내 추출된 신분증 테두리의 모서리 4개점에 대한 좌표 각각의 오류를 판단하는 대신 테두리 자체의 맞고 틀림에 대한 판단을 약지도 학습에 이용할 수 있도록 한다.For example, instead of determining the error of each of the coordinates of the four corners of the ID border extracted from the ID image, the judgment of whether the border itself is correct or incorrect can be used for weak map learning.

그 외 추출 가능한 영역 값으로써 신분증 이미지 내 품질의 문제가 있는 곳에 대한 부분이 인식 결과로 출력되는 경우, 해당 부분의 좌표를 직접 상담원이 교정하는 대신 품질 자체의 판단에 대한 맞고 틀림을 판단함으로써 약지도 학습에 이용하는 것도 가능하다.In addition, if a part of the ID card image with a quality problem is output as a recognition result as an extractable area value, instead of the counselor directly correcting the coordinates of the part, a weak map is provided by judging the rightness or error of the judgment of the quality itself. It is also possible to use it for learning.

반면, 제2 인식 결과(70-2)는 사용자의 성명, 주민등록번호 등에 대한 직접적인 값이므로 신분증 발급 기관에서 부여한 주민등록번호와 동일한지에 따라 정상 사용자로 판단하는 기준이 될 수 있다. 즉 인식 결과에 따라 본인 인증 여부가 결정되므로 상담원은 제2 인식 결과(70-2)에 대한 검수를 수행하고 검수 결과를 생성한다.On the other hand, since the second recognition result (70-2) is a direct value for the user's name, resident registration number, etc., it can be a standard for judging a normal user depending on whether it is the same as the resident registration number assigned by the ID issuing agency. In other words, since identity verification is determined based on the recognition result, the agent inspects the second recognition result (70-2) and generates the inspection result.

또한, 제2 인식 결과(70-2)는 신분증 이미지 내의 안면 특징 정보 또는 신분증 이미지의 빛 가림, 홀로그램, 빛 반사 유무에 의한 정상 상태에 대한 분류 결과를 포함할 수 있으며 이에 대해서는 상담원의 검수 결과를 이용하여 지도 학습을 수행할 수 있다.In addition, the second recognition result (70-2) may include facial feature information in the ID card image or a classification result for the normal state based on the presence or absence of light occlusion, hologram, or light reflection of the ID card image, and the agent's inspection results for this may be included. You can use supervised learning to perform supervised learning.

이때의 검수 결과는 신경망의 분류 결과에 대한 정답 값이므로 지도 학습에 직접 이용될 수 있다.즉, 상담원은 신경망의 인식 결과의 오류 여부를 판단함으로써 정상 사용자가 신분 인증이 거절되는 오거절율을 낮출 수 있으며 이와 동시에 본 실시예에서는 생성된 검수 결과를 이용하여 신경망을 학습시킨다. The inspection result at this time is the correct value for the classification result of the neural network, so it can be used directly for supervised learning. In other words, the agent determines whether there is an error in the neural network's recognition result, thereby lowering the false rejection rate in which normal users are rejected for identity authentication. At the same time, in this embodiment, the neural network is trained using the generated inspection results.

따라서 학습 서버는 인식 결과의 적어도 일부에 대한 상담원의 검수 결과 또는 사용자(10)에 대해 수집된 신분 정보를 입력 받는다(S200).Accordingly, the learning server receives the agent's inspection result for at least part of the recognition result or the identity information collected about the user 10 (S200).

검수 결과는 학습 서버(100)로부터 수신된 신분증 이미지(50)의 인식 결과를 검수하는 상담원의 검수 장치(미도시)로부터 수신될 수 있다.The inspection result may be received from a counselor's inspection device (not shown) that inspects the recognition result of the ID card image 50 received from the learning server 100.

상담원은 검수 장치를 통해 직접 육안으로 신분증 이미지(50)에 인쇄된 객체들을 인식하고 신경망의 인식 결과와 비교하여 오인식 여부를 판단한다.The counselor recognizes the objects printed on the ID card image 50 with the naked eye through an inspection device and compares them with the recognition results of the neural network to determine whether there is a misrecognition.

이때 상담원은 모든 인식 결과의 오류 여부를 판단하지 않고 인식 결과 중 일부에 대한 오류 여부를 판단한다.At this time, the counselor does not judge whether all recognition results are errors, but judges whether some of the recognition results are errors.

구체적으로 도 5를 참조하면, 신분증 이미지에 대한 인식결과로 성명 중 성이 '항'으로 인식되었으나 상담원이 육안으로 확인한 결과는 '홍'인 경우 이에 대한 오류의 정정을 검수 결과로 생성할 수 있다.Specifically, referring to Figure 5, if the last name in the name is recognized as 'Hang' as a result of recognition of the ID card image, but the result visually confirmed by the counselor is 'Hong', correction of the error can be generated as an inspection result. .

또한, 안면 정보로서 사용자의 사진 상에 나타나는 머리 스타일에 가르마가 없는 것으로 인식되었으나 상담원의 확인 결과로 가르마가 있는 것으로 판단된 경우 이를 검수 결과로 생성하는 것도 가능하다.Additionally, if the hair style shown in the user's photo as facial information is recognized as having no parting, but is determined to have parting as a result of the counselor's confirmation, it is also possible to generate this as an inspection result.

하지만, 신분증 테두리에 대한 영역 정보에 대해서는 상담원이 별도의 오차 검수를 진행하지 않으므로 검수 결과는 신경망의 출력과 다른 형태로 생성될 수 있다.However, since the agent does not perform a separate error check on the area information about the border of the ID card, the check result may be generated in a different form from the output of the neural network.

즉 약지도 학습을 위하여 신분증 테두리에 대한 영역 정보에 대해서는 상담원이 구체적인 정답은 아니지만 검출된 테두리 영역의 맞고 틀림(True/False)에 대한 오류 판단을 검수 결과로 생성할 수 있으며, 판단 결과에 따라 고정된 상수 값을 손실로 정의함으로써 신경망의 학습에 이용한다.In other words, for weakly supervised learning, the agent can generate an error judgment about the truth/false of the detected border area as a result of inspection, although it is not a specific correct answer regarding the area information about the border of the ID card, and it is fixed according to the judgment result. It is used for learning neural networks by defining the constant value as the loss.

즉, 학습 서버(100)는 신경망으로부터 출력된 인식 결과 중 제2 인식 결과(70-2)에 대한 상담원의 검수 결과를 정답으로 손실을 산출하여 학습에 이용할 수 있으며, 제1 인식 결과(70-1)에 대해서는 정답은 아니지만 상담원의 오류 판단 결과를 통해 결정된 고정된 손실 값으로 학습에 이용할 수 있다.In other words, the learning server 100 can use the agent's inspection result for the second recognition result (70-2) among the recognition results output from the neural network to calculate the loss as the correct answer and use it for learning, and the first recognition result (70-2) Although it is not the correct answer to 1), it can be used for learning as a fixed loss value determined through the agent's error judgment results.

또한, 학습 서버는 추가로 직접 사용자(100)가 사용자 단말(1000)을 통해 입력한 정보를 수신하여 오류 여부를 판단하는 것도 가능하다. 예를 들어 사용자는 사용자 단말의 입력 인터페이스를 이용하여 자신의 성명이나 주민 번호 등의 신상 정보를 입력할 수 있는데, 이때의 입력 정보를 수신하여 학습에 이용하는 것도 가능하다.In addition, the learning server may additionally directly receive information entered by the user 100 through the user terminal 1000 and determine whether there is an error. For example, a user can input personal information such as his or her name or social security number using the input interface of the user terminal. It is also possible to receive the input information and use it for learning.

또는, 신분증을 관리하는 기관의 외부 서버로부터 신분증 발급 정보로서 신분증 소유자의 성명이나 발급 번호 등의 정보를 수신하여 이용하는 것도 가능하다.Alternatively, it is also possible to receive and use information such as the ID card holder's name or issuance number as ID issuance information from an external server of an organization that manages ID cards.

다음 학습 서버는 검수 결과 또는 신분 정보를 각각 대응되는 인식 결과의 정답 값(Ground-Truth)으로 이용하여 신경망을 학습시킨다(S300).Next, the learning server uses the inspection result or identity information as the ground truth value of the corresponding recognition result to learn the neural network (S300).

즉, 인식 결과 중 분류 결과로 나타나는 제2 인식 결과(70-2)에 대한 상담원의 검수 결과(80)로 정답 값과의 차이를 신경망의 손실 함수(loss function)로 정의하여 학습을 수행할 수 있다.In other words, learning can be performed by defining the difference from the correct answer value as the agent's inspection result (80) for the second recognition result (70-2), which appears as a classification result among the recognition results, as the loss function of the neural network. there is.

도 6을 참조하면, 신경망으로부터 2가지의 인식 결과가 출력될 수 있는데 본 실시예에서는 제2 인식 결과(70-2)에 대해서는 검수 과정에서 정답 값으로 검수 결과(80)를 상담원으로부터 수신하므로 이를 이용하여 결과 간의 차이를 계산하고 오차를 역전파 함으로써 신경망 내부의 레이어를 갱신할 수 있다.Referring to FIG. 6, two recognition results can be output from the neural network. In this embodiment, for the second recognition result (70-2), the inspection result (80) is received from the counselor as the correct value during the inspection process. You can update the layers inside the neural network by calculating the difference between results and back-propagating the error.

다만, 본 실시예에서는 일부 인식 결과에 대해서는 정답 값이 설정되지 않는 약지도학습(weakly supervised learning)을 수행하기 위해, 인식 결과(70) 중 검수 결과 또는 신분 정보에 대응되지 않는(정답이 정의되지 않은) 값에 대한 손실을 고정(freezing)시킴으로써 신경망(30)을 학습시킨다. However, in this embodiment, in order to perform weakly supervised learning in which the correct answer value is not set for some recognition results, among the recognition results 70, those that do not correspond to the inspection result or identity information (the correct answer is not defined) The neural network 30 is trained by freezing the loss for the (unknown) values.

구체적으로 인식 결과(70) 중 제1 인식 결과(70-1)에 대해서는 출력 값에 대한 정답 값이 정의되는 대신, 전체 출력 값의 개략적인 맞고 틀림에 대한 판단이 수행되므로 이를 손실로 정의하여 학습에 이용할 수 있다.Specifically, for the first recognition result (70-1) among the recognition results (70), instead of defining the correct answer value for the output value, a rough judgment of whether the entire output value is correct or incorrect is made, so this is defined as a loss and learned. It is available for use.

예를 들어, 상담원의 검수 결과 인식된 영역의 위치 좌표 값 들이 정답에 유사하여 개략적으로 맞다고 판단되는 경우 제1 인식 결과(70-1)에 대한 손실은 상수 값 0으로 오차가 없는 것처럼 설정함으로써 제1 인식 결과(70-1)의 출력과는 무관하게 제2 인식 결과(70-2)에 영향을 미치는 신경망(30)의 내부 레이어들의 파라미터들을 선별하여 갱신할 수 있다.For example, as a result of the agent's inspection, if the location coordinate values of the recognized area are similar to the correct answer and are judged to be roughly correct, the loss for the first recognition result (70-1) is set to a constant value of 0 as if there is no error. Regardless of the output of the first recognition result 70-1, the parameters of the internal layers of the neural network 30 that affect the second recognition result 70-2 can be selected and updated.

반대로, 인식된 객체의 위치 좌표 값 들이 틀린 것으로 판단되는 경우 손실을 상수 값 1로 설정하여 제1 인식 결과(70-1) 및 제2 인식 결과(70-2)의 오류에 영향을 미치는 내부 레이어들을 갱신하도록 할 수 있다.Conversely, if the position coordinate values of the recognized object are judged to be incorrect, the internal layer sets the loss to a constant value of 1 to affect the error of the first recognition result (70-1) and the second recognition result (70-2). You can have them updated.

이상, 학습 서버(100)는 신분증에 포함된 다양한 정보를 인식할 수 있는 신경망(30)을 효율적으로 학습시킬 수 있으며, 신분증의 검수 과정에서 획득된 검수 결과(80)를 학습을 위한 정답으로 활용함으로써 학습 절차를 효율화 할 수 있다.As above, the learning server 100 can efficiently learn the neural network 30 that can recognize various information contained in the ID card, and uses the inspection result 80 obtained during the inspection process of the ID card as the correct answer for learning. By doing so, the learning process can be streamlined.

또한, 실제 신분 인증의 과정에서 발생되는 오류를 정정하는 방향으로 신경망이 학습되므로 사용 환경에 더욱 강인한 성능을 갖도록 할 수 있다.In addition, since the neural network is trained to correct errors that occur during the actual identity authentication process, it can have more robust performance in the use environment.

나아가, 보다 학습의 효율을 높이기 위하여 학습 서버(100)는 미리 결정된 주기에 따라 수집된 신분증 이미지(50)의 인식 결과와 정답 값의 쌍으로 구성된 학습 셋을 단위로 신경망을 학습시킬 수 있다.Furthermore, in order to further increase the efficiency of learning, the learning server 100 may train a neural network based on a learning set consisting of pairs of recognition results and correct answer values of the ID card image 50 collected according to a predetermined cycle.

구체적으로 주기는 배치 사이즈(batch size)로 결정될 수 있으며, 배치 사이즈는 수집되는 학습 데이터의 수로 결정될 수 있다.Specifically, the cycle can be determined by the batch size, and the batch size can be determined by the number of training data collected.

도 7을 참조하면 최소 학습 데이터의 수로 결정된 N개 이상의 신분증 이미지(50')와 검수 결과 쌍을 누적하고 신경망(30)을 배치 단위로 학습시킬 수 있다.Referring to FIG. 7, N or more ID card image 50' and inspection result pairs determined by the minimum number of learning data can be accumulated and the neural network 30 can be trained in batch units.

예를 들어 학습 데이터를 모아두는 버퍼를 통해 상담원이 검수 결과로 정답을 레이블링 한 결과를 누적하고, 누적된 학습 데이터가 N개 이상이 되면 학습 스텝을 수행하도록 할 수 있다. 또한 누적 이후 또는 이전에 쌓여 있던 학습 데이터들도 추가로 불러와서 전체 데이터를 재 학습시키는 것도 가능하다.For example, through a buffer that collects learning data, the agent can accumulate the results of labeling the correct answers as inspection results and perform the learning step when the accumulated learning data exceeds N. It is also possible to re-train the entire data by additionally loading learning data that was accumulated after or before accumulation.

다만, 결정된 임계값 N에 따라 학습 데이터의 누적 주기가 매우 빠를 수 있어 N개 데이터를 하나의 세트(학습하는 기본 단위)로 설정하여 다수의 세트를 기존의 데이터와 섞어서 재 학습하는 것도 가능하다. However, depending on the determined threshold N, the accumulation cycle of learning data can be very fast, so it is possible to set N pieces of data as one set (basic unit for learning) and re-learn by mixing multiple sets with existing data.

또한, 시간을 고려하여 추가적인 학습 기준을 설정할 수 있으며, 예를 들어 하루 이상 학습 데이터를 누적하진 않음으로써 학습 데이터 세트 간의 유의미한 특징을 학습 하도록 할 수 있다.Additionally, additional learning criteria can be set considering time. For example, by not accumulating training data for more than one day, meaningful features between training data sets can be learned.

또한, 보다 효율적인 학습을 위해 수집된 학습 데이터 셋 내의 특징 값들은 정규화 될 수 있으며 결정된 배치를 단위로 정규화(batch normalization)를 수행하여 신경망(30)에 입력될 수 있다. 신경망은 정규화된 입력 값과 정답 값 간의 오차를 역전파(back-propagation)함으로써 내부 레이어들의 파라미터를 고르게 갱신할 수 있다.Additionally, for more efficient learning, the feature values in the collected learning data set can be normalized and input into the neural network 30 by performing batch normalization on a determined batch basis. A neural network can evenly update the parameters of internal layers by back-propagating the error between the normalized input value and the correct answer value.

이때 배치 사이즈는 제2 인식 결과(70-2)의 종류나 개수에 따라 결정될 수 있으며 또는 특정 기간이나 신경망(30)의 업데이트 주기 등에 따라 결정될 수 있다.At this time, the batch size may be determined according to the type or number of the second recognition results 70-2, or may be determined according to a specific period or update cycle of the neural network 30, etc.

이하, 본 발명의 일 실시예에 따른 신경망 학습 방법을 수행하는 학습 서버(100)의 구성에 대하여 설명한다.Hereinafter, the configuration of the learning server 100 that performs the neural network learning method according to an embodiment of the present invention will be described.

도 8을 참조하면, 학습 서버(100)는 인식 결과 출력부(110), 검수 결과 입력부(120), 및 학습부(130)를 포함할 수 있다.Referring to FIG. 8 , the learning server 100 may include a recognition result output unit 110, an inspection result input unit 120, and a learning unit 130.

먼저, 인식 결과 출력부(110)는 신분증 이미지(50)에 대한 객체 인식을 수행하는 신경망(30)을 이용하여 인식 결과를 출력한다.First, the recognition result output unit 110 outputs a recognition result using the neural network 30 that performs object recognition on the ID card image 50.

인식 결과 출력부(110)는 신분증 이미지(50) 중 인식하고자 하는 객체에 대한 존재 영역에 대한 분석 결과와 객체의 분류 결과를 출력할 수 있다.The recognition result output unit 110 may output an analysis result of the presence area of the object to be recognized among the ID card image 50 and a classification result of the object.

구체적으로 신경망은 회귀 분석에 따른 객체의 존재 영역을 정의하는 제1 인식 결과(70-1) 및 분류에 따른 객체의 인식 값을 정의하는 제2 인식 결과(70-2)를 출력할 수 있다.Specifically, the neural network may output a first recognition result 70-1 that defines the area in which the object exists according to regression analysis and a second recognition result 70-2 that defines the recognition value of the object according to classification.

검수 결과 입력부(120)는 인식 결과의 적어도 일부에 대한 상담원의 검수 결과 또는 사용자에 대해 수집된 신분 정보를 입력 받는다.The inspection result input unit 120 receives the counselor's inspection result for at least part of the recognition result or the identity information collected about the user.

본 실시예에서 검수 결과(80)는 신분증 이미지(50)의 인식 결과를 검수하는 상담원의 검수 장치로부터 생성될 수 있다.In this embodiment, the inspection result 80 may be generated from the agent's inspection device that inspects the recognition result of the ID card image 50.

다만, 이때 상담원은 모든 인식 결과의 오류 여부를 판단하지 않고 인식 결과 중 일부에 대한 오류 여부를 판단할 수 있다.However, at this time, the counselor may judge whether some of the recognition results are error-free rather than determining whether all recognition results are error-free.

구체적으로 회귀 분석에 따른 제1 인식 결과(70-1)는 객체의 영역으로 바운딩 박스의 좌표 정보이므로 신분 인증에는 영향을 미치지 않으나 분류에 따른 제2 인식 결과(70-2)는 사용자의 성명, 주민등록번호 등에 대한 인증에 필요한 직접적인 값이므로 이에 대한 오류 여부를 판단한다.Specifically, the first recognition result (70-1) based on regression analysis is the coordinate information of the bounding box in the area of the object, so it does not affect identity authentication, but the second recognition result (70-2) based on classification is the user's name, Since it is a direct value required for authentication of resident registration numbers, etc., it is determined whether there is an error.

나아가 검수 결과 입력부(120)는 상담원의 검수 결과 외에 사용자 단말을 통해 직접 입력한 신상 정보 값을 수신하거나 외부 서버로부터 발급 정보로서 신분증 소유자의 성명이나 발급 번호 등의 정보를 수신하여 정답 값으로 이용하는 것도 가능하다.Furthermore, the inspection result input unit 120 receives the personal information value directly entered through the user terminal in addition to the agent's inspection result, or receives information such as the name or issuance number of the ID card holder as issuance information from an external server and uses it as the correct value. possible.

다음 학습부(130)는 검수 결과 또는 수신된 신분 정보를 각각 대응되는 인식 결과의 정답 값으로 이용하여 신경망(30)을 학습시킨다.Next, the learning unit 130 trains the neural network 30 by using the inspection result or the received identity information as the correct answer value of the corresponding recognition result.

학습부(130)는 인식 결과 중 분류 결과로 나타나는 제2 인식 결과(70-2)에 대한 상담원의 검수 결과(80)를 정답 값으로 인식 결과 간의 차이를 신경망의 손실 함수로 정의하여 학습을 수행할 수 있다.The learning unit 130 performs learning by using the agent's inspection result (80) for the second recognition result (70-2), which appears as a classification result among the recognition results, as the correct value and defining the difference between the recognition results as the loss function of the neural network. can do.

또한 학습부(130)는 보다 학습의 효율을 높이기 위하여 매 검수 결과의 생성 마다 신경망(30)을 학습시키는 대신 미리 결정된 주기에 따라 누적하여 수집된 신분증 이미지의 인식 결과와 정답 값의 쌍으로 구성된 학습 셋을 구성함으로써 효율적으로 신경망을 학습시킬 수 있다.In addition, in order to increase the efficiency of learning, the learning unit 130 trains the neural network 30 for each generation of inspection results, and instead performs learning consisting of pairs of recognition results of ID images collected cumulatively according to a predetermined cycle and the correct answer value. By configuring the set, you can efficiently train a neural network.

구체적으로 주기는 배치 사이즈(batch size)로 결정된 최소 학습 데이터의 수로 결정되거나 기간, 신경망(30)의 배포 주기에 따라 결정될 수 있다.Specifically, the cycle may be determined by the minimum number of learning data determined by the batch size, or may be determined by the period or distribution cycle of the neural network 30.

또한, 보다 효율적인 학습을 위해 수집된 학습 데이터 셋 내의 특징 값들은 정규화 될 수 있으며 결정된 배치를 단위로 정규화되어 신경망에 입력될 수 있다.Additionally, for more efficient learning, the feature values in the collected learning data set can be normalized and input into the neural network by normalizing the determined batch as a unit.

이상 본 발명에 따르면 신분증에 포함된 다양한 객체의 인식을 위한 신경망의 일부 객체에 대한 인식 결과 만으로도 추가적인 학습을 수행하여 신경망의 전체 예측 성능을 높일 수 있다.According to the present invention, the overall prediction performance of the neural network can be increased by performing additional learning with only the recognition results for some objects of the neural network for recognizing various objects included in the ID card.

또한, 본 발명은 학습을 위한 학습 데이터를 신분증의 검수 과정에서 획득하여 부가적인 절차의 발생을 방지할 수 있다.In addition, the present invention can prevent the occurrence of additional procedures by obtaining learning data for learning during the inspection of ID cards.

나아가, 여기에 설명되는 다양한 실시예는 예를 들어, 소프트웨어, 하드웨어 또는 이들의 조합된 것을 이용하여 컴퓨터 또는 이와 유사한 장치로 읽을 수 있는 기록매체 내에서 구현될 수 있다.Furthermore, various embodiments described herein may be implemented in a recording medium readable by a computer or similar device, for example, using software, hardware, or a combination thereof.

하드웨어적인 구현에 의하면, 여기에 설명되는 실시예는 ASICs (application specific integrated circuits), DSPs (digital signal processors), DSPDs (digital signal processing devices), PLDs (programmable logic devices), FPGAs (field programmable gate arrays, 프로세서(processors), 제어기(controllers), 마이크로 컨트롤러(micro-controllers), 마이크로 프로세서(microprocessors), 기타 기능 수행을 위한 전기적인 유닛 중 적어도 하나를 이용하여 구현될 수 있다. 일부의 경우에 본 명세서에서 설명되는 실시예들이 제어 모듈 자체로 구현될 수 있다.According to hardware implementation, the embodiments described herein include application specific integrated circuits (ASICs), digital signal processors (DSPs), digital signal processing devices (DSPDs), programmable logic devices (PLDs), and field programmable gate arrays (FPGAs). In some cases, it may be implemented using at least one of processors, controllers, micro-controllers, microprocessors, and other electrical units for performing functions. The described embodiments may be implemented as a control module itself.

소프트웨어적인 구현에 의하면, 본 명세서에서 설명되는 절차 및 기능과 같은 실시예들은 별도의 소프트웨어 모듈들로 구현될 수 있다. 상기 소프트웨어 모듈들 각각은 본 명세서에서 설명되는 하나 이상의 기능 및 작동을 수행할 수 있다. 적절한 프로그램 언어로 쓰여진 소프트웨어 어플리케이션으로 소프트웨어 코드가 구현될 수 있다. 상기 소프트웨어 코드는 메모리 모듈에 저장되고, 제어모듈에 의해 실행될 수 있다.According to software implementation, embodiments such as procedures and functions described in this specification may be implemented as separate software modules. Each of the software modules may perform one or more functions and operations described herein. Software code can be implemented as a software application written in an appropriate programming language. The software code may be stored in a memory module and executed by a control module.

이상의 설명은 본 발명의 기술 사상을 예시적으로 설명한 것에 불과한 것으로서, 본 발명이 속하는 기술 분야에서 통상의 지식을 가진 자라면 본 발명의 본질적인 특성에서 벗어나지 않는 범위 내에서 다양한 수정, 변경 및 치환이 가능할 것이다. The above description is merely an illustrative explanation of the technical idea of the present invention, and various modifications, changes, and substitutions can be made by those skilled in the art without departing from the essential characteristics of the present invention. will be.

따라서, 본 발명에 개시된 실시 예 및 첨부된 도면들은 본 발명의 기술 사상을 한정하기 위한 것이 아니라 설명하기 위한 것이고, 이러한 실시 예 및 첨부된 도면에 의하여 본 발명의 기술 사상의 범위가 한정되는 것은 아니다. 본 발명의 보호 범위는 아래의 청구 범위에 의하여 해석되어야 하며, 그와 동등한 범위 내에 있는 모든 기술 사상은 본 발명의 권리 범위에 포함되는 것으로 해석되어야 할 것이다.Accordingly, the embodiments disclosed in the present invention and the accompanying drawings are not intended to limit the technical idea of the present invention, but are for illustrative purposes, and the scope of the technical idea of the present invention is not limited by these embodiments and the attached drawings. . The scope of protection of the present invention should be interpreted in accordance with the claims below, and all technical ideas within the equivalent scope should be construed as being included in the scope of rights of the present invention.

Claims

In the neural network learning method,
Outputting a recognition result using a neural network that recognizes an object included in the user's ID card image;
receiving a counselor's inspection result for at least part of the recognition result or inputting identity information collected about the user; and
A neural network learning method comprising the step of training a neural network by using the inspection result or the identity information as a ground truth value of the corresponding recognition result.

According to claim 1,
The neural network learning method is characterized in that the neural network outputs a first recognition result defining the area in which the object exists according to regression analysis and a second recognition result defining the recognition value of the object according to classification.

According to claim 2,
The neural network learning method characterized in that the input receiving step involves receiving a counselor's inspection result for the second recognition result.

According to claim 3,
The neural network learning method characterized in that the input receiving step involves receiving an error judgment result of the counselor regarding the first recognition result.

According to claim 3,
The learning step is a neural network learning method characterized in that the neural network is trained by freezing the loss for the first recognition result to a constant value.

According to claim 5,
The learning step is characterized by fixing the loss for the first recognition result and training the neural network using the loss calculated by using the inspection result or identity information for the second recognition result as the correct value. Neural network learning methods.

According to claim 1,
The learning step is a neural network learning method, characterized in that the neural network is trained on a learning set consisting of pairs of the recognition result and the correct value of the ID card image collected according to a predetermined cycle.

According to claim 7,
A neural network learning method, characterized in that the cycle is determined by the batch size.

In a server that performs neural network learning,
a recognition result output unit that outputs recognition results using a neural network that recognizes objects included in the user's ID card image;
an inspection result input unit that receives a counselor's inspection result for at least part of the recognition result or identity information collected about the user; and
A server that performs neural network learning, comprising a learning unit that trains a neural network by using the inspection result or the identity information as the correct answer value of the corresponding recognition result.

According to clause 9,
The neural network outputs a first recognition result defining a region of the object according to regression analysis and a second recognition result defining a recognition value of the object according to classification.

According to claim 10,
A server performing neural network learning, wherein the inspection result input unit receives a counselor's inspection result for the second recognition result.

According to claim 11,
A server performing neural network learning, wherein the inspection result input unit receives a counselor's error judgment result for the first recognition result.

According to claim 12,
A server that performs neural network learning, wherein the learning unit trains the neural network by fixing the loss for the first recognition result to a constant value.

According to clause 9,
A server that performs neural network learning, wherein the learning unit trains a neural network based on a learning set consisting of pairs of the recognition result and the correct answer value of the ID card image collected according to a predetermined cycle.

A computer-readable recording medium storing a program for performing the neural network learning method according to any one of claims 1 to 8.