KR20210057845A

KR20210057845A - Deep Learning Frame Work-Based Image Recognition Method and System Using Training Image Data

Info

Publication number: KR20210057845A
Application number: KR1020190143935A
Authority: KR
Inventors: 박경남
Original assignee: 이지스로직 주식회사
Priority date: 2019-11-12
Filing date: 2019-11-12
Publication date: 2021-05-24

Abstract

The present invention provides a deep learning artificial neural network-based image recognition method and a system used therefor. The method includes the steps of: training a deep learning model based on a deep learning framework using the training image data to obtain at least two deep learning models; selecting a predetermined number of deep learning models corresponding to the number of the obtained deep learning models or less from the deep learning models obtained in descending order of recognition accuracy for the verified image data; and recognizing image data to be recognized using at least one of the selected deep learning models. According to the present invention, it is possible to partially solve or alleviate the technical problems of the deep learning framework-related technology by the deep learning image recognition method and apparatus, and a computer-readable storage medium.

Description

Deep Learning Frame Work-Based Image Recognition Method and System Using Training Image Data

본 발명은 영상 인식 방법 및 시스템에 관한 것으로서, 보다 상세하게는 딥 러닝 인공신경망 기반 영상 인식 방법 및 시스템에 관한 것이다.The present invention relates to an image recognition method and system, and more particularly, to a deep learning artificial neural network-based image recognition method and system.

일반적으로, 딥러닝은 트레이닝 데이터를 통해 학습된 속성을 기반으로 예측 및 분류하는 기계학습의 한 분야로서 다양한 상황에 대해 프로그램이 근사적인 판단을 내릴 수 있도록 하는 기술이다.In general, deep learning is a field of machine learning that predicts and classifies based on attributes learned through training data, and is a technology that enables programs to make approximate judgments about various situations.

관찰된 값(예를 들어, 영상)은 다양한 픽셀의 강도 값의 벡터로, 또는 더 추상적으로, 일련의 에지, 특정 형상을 갖는 영역 등으로 다양한 방식으로 표현될 수 있다.Observed values (eg, images) can be expressed in various ways as vectors of intensity values of various pixels, or more abstractly, as a series of edges, regions with specific shapes, and so on.

딥 러닝의 이점은 기능 학습 및 계층적 기능 추출을 위해 효율적인 자율적 또는 반 자율적 알고리즘으로 기능의 수동 획득을 대체하는 것이다.The advantage of deep learning is to replace manual acquisition of functions with efficient autonomous or semi-autonomous algorithms for function learning and hierarchical function extraction.

딥 러닝은 이론과 실제를 결합한 규율이다. 새로운 알고리즘 이론이 등장함에 따라 다양한 딥 러닝 프레임 워크가 계속 등장하고 있지만 딥 러닝 프레임 워크에 의해 제공되는 기능은 비교적 단순하고 사용자 경험의 저하를 초래시킨다.Deep learning is a discipline that combines theory and practice. With the emergence of new algorithmic theories, various deep learning frameworks continue to appear, but the functions provided by deep learning frameworks are relatively simple and lead to deterioration of the user experience.

따라서 딥러닝 영상 인식 방법 및 장치, 및 컴퓨터 판독 가능 저장 매체에 의해 딥 러닝 프레임 워크 관련 기술의 기술적 문제를 부분적으로 해결하거나 완화시킬 필요가 있다.Therefore, there is a need to partially solve or alleviate the technical problem of the technology related to the deep learning framework by a deep learning image recognition method and apparatus, and a computer-readable storage medium.

한국공개특허 제10-2014-0000559호의 "영상인식 방법 및 그 시스템", 2014.01.03 공개Korean Patent Laid-Open Patent No. 10-2014-0000559 "Image recognition method and system", 2014.01.03

상기한 바와 같은 종래 기술의 문제점을 해결하기 위하여, 본 발명은 딥러닝 영상 인식 방법 및 장치, 및 컴퓨터 판독 가능 저장 매체에 의해 딥러닝 프레임 워크 관련 기술의 기술적 문제를 부분적으로 해결하거나 완화시키는데 목적이 있다.In order to solve the problems of the prior art as described above, an object of the present invention is to partially solve or alleviate the technical problem of the deep learning framework related technology by using a deep learning image recognition method and apparatus, and a computer-readable storage medium. have.

본 발명의 다른 목적들은 이하의 실시례에 대한 설명을 통해 쉽게 이해될 수 있을 것이다.Other objects of the present invention will be easily understood through the description of the following embodiments.

상기한 바와 같은 목적을 달성하기 위해, 본 발명의 일측면에 따르면, 딥 러닝 기반 영상 인식 방법에 있어서, 트레이닝 영상 데이터를 사용하여 딥 러닝 프레임 워크에 기초하여 딥 러닝 모델을 트레이닝하여 적어도 2 개의 딥 러닝 모델을 얻는 단계; 검증된 영상 데이터에 대한 인식 정확도의 내림차순으로 획득된 딥 러닝 모델로부터 상기 획득된 딥 러닝 모델의 수 이하에 해당하는 미리 결정된 수의 딥 러닝 모델을 선택하는 단계; 및 선택된 딥 러닝 모델 중 적어도 하나를 사용하여 인식될 영상 데이터를 인식하는 단계;를 포함하는, 딥 러닝 인공신경망 기반 영상 인식 방법이 제공된다.In order to achieve the above object, according to an aspect of the present invention, in a deep learning-based image recognition method, at least two deep learning models are trained based on a deep learning framework using training image data. Obtaining a running model; Selecting a predetermined number of deep learning models corresponding to the number or less of the acquired deep learning models from the deep learning models obtained in descending order of recognition accuracy for the verified image data; And recognizing image data to be recognized using at least one of the selected deep learning models.

상기 트레이닝 영상 데이터, 상기 검증된 영상 데이터 및 상기 인식될 영상 데이터 중 적어도 하나에 대해 영상 전처리를 수행하는 단계를 더 포함할 수 있다.The method may further include performing image preprocessing on at least one of the training image data, the verified image data, and the image data to be recognized.

상기 영상 전처리를 수행하는 단계는, 상기 영상 전처리가 랜덤 자르기, 회전, 뒤집기, 밝기 조정 및 대비 조정 중 적어도 하나를 포함할 수 있다.In the performing of the image preprocessing, the image preprocessing may include at least one of random cropping, rotation, flipping, brightness adjustment, and contrast adjustment.

상기 영상 전처리된 영상 데이터를 사전 확립된 메모리 데이터베이스에 저장하는 단계를 더 포함할 수 있다.The method may further include storing the pre-processed image data in a pre-established memory database.

상기 인식될 영상 데이터를 인식하는 단계는, 상기 선택된 딥 러닝 모델을 사용자에게 제공하는 단계; 및 사용자에 의해 선택된 딥 러닝 모델을 획득하고, 사용자에 의해 선택된 딥 러닝 모델을 사용하여 인식될 영상 데이터를 인식하는 단계;를 포함할 수 있다.Recognizing the image data to be recognized may include providing the selected deep learning model to a user; And acquiring a deep learning model selected by the user and recognizing image data to be recognized using the deep learning model selected by the user.

상기 적어도 2 개의 딥 러닝 모델을 얻는 단계는, 트레이닝 프로세스에 대한 상태 정보를 사용자에게 푸시하는 단계를 포함할 수 있다.Obtaining the at least two deep learning models may include pushing state information on a training process to a user.

상기 적어도 2 개의 딥 러닝 모델을 얻는 단계는, 실시간으로 웹 애플리케이션 프로그래밍 인터페이스를 사용하여 트레이닝중인 딥 러닝 모델에 대한 성능 곡선을 그리는 단계를 포함할 수 있다.The step of obtaining the at least two deep learning models may include drawing a performance curve for the deep learning model being trained using a web application programming interface in real time.

상기 적어도 2 개의 딥 러닝 모델을 얻는 단계는, 다수의 딥 러닝 프레임워크가 동일하거나 2개 이상일 수 있다.In the step of obtaining the at least two deep learning models, a plurality of deep learning frameworks may be the same or two or more.

본 발명의 다른 측면에 따르면, 딥 러닝 기반 영상 인식 장치에 있어서, 프로세서; 및 상기 프로세서에 의해 실행될 때, 상기 프로세서로 하여금, 트레이닝 영상 데이터를 사용함으로써, 딥 러닝 프레임 워크에 기초하여 딥 러닝 모델을 트레이닝시켜 적어도 2 개의 딥 러닝 모델을 획득하도록 하는 명령어가 저장된 메모리;를 포함하고, 상기 프로세서는, 검증된 영상 데이터에 대한 인식 정확도의 내림차순으로 획득된 딥 러닝 모델로부터 미리 결정된 수의 딥 러닝 모델을 선택하고, 상기 미리 결정된 수가 획득된 딥 러닝 모델의 수보다 작거나 같도록 하며, 선택된 딥 러닝 모델 중 적어도 하나를 사용하여 인식될 영상 데이터를 인식하도록 하는, 딥 러닝 인공신경망 기반 영상 인식 시스템이 제공된다.According to another aspect of the present invention, there is provided a deep learning-based image recognition apparatus, comprising: a processor; And a memory storing instructions that, when executed by the processor, cause the processor to acquire at least two deep learning models by training a deep learning model based on a deep learning framework by using the training image data. And, the processor selects a predetermined number of deep learning models from the deep learning models obtained in descending order of recognition accuracy for the verified image data, and the predetermined number is less than or equal to the number of acquired deep learning models. And, there is provided an image recognition system based on a deep learning artificial neural network that recognizes image data to be recognized using at least one of the selected deep learning models.

상기 명령어는, 상기 프로세서에 의해 실행될 때, 상기 프로세서로 하여금, 트레이닝 영상 데이터, 검증 영상 데이터 및 인식될 영상 데이터 중 적어도 하나에 대해 영상 전처리를 수행하게 할 수 있다.When executed by the processor, the command may cause the processor to perform image preprocessing on at least one of training image data, verification image data, and image data to be recognized.

상기 영상 전처리는, 랜덤 자르기, 회전, 뒤집기, 밝기 조정 및 콘트라스트 조정 중 적어도 하나를 포함할 수 있다.The image preprocessing may include at least one of random cropping, rotation, flipping, brightness adjustment, and contrast adjustment.

상기 명령어는, 상기 프로세서에 의해 실행될 때, 상기 프로세서로 하여금 메모리 데이터베이스를 확립하고, 사전 처리된 영상 데이터베이스를 사전 설정된 메모리 데이터베이스에 저장하도록 할 수 있다.When executed by the processor, the command may cause the processor to establish a memory database and store a pre-processed image database in a preset memory database.

상기 명령어는, 상기 프로세서에 의해 실행될 때, 상기 프로세서로 하여금, 선택된 딥 트레이닝 모델을 사용자에게 제공하고, 사용자에 의해 선택된 딥 러닝 모델을 획득하며, 사용자에 의해 선택된 딥 러닝 모델을 사용하여 인식될 영상 데이터를 인식하도록 할 수 있다.The instruction, when executed by the processor, causes the processor to provide a selected deep training model to a user, obtain a deep learning model selected by the user, and an image to be recognized using the deep learning model selected by the user. You can make your data aware.

상기 명령어는, 상기 프로세서에 의해 실행될 때, 상기 프로세서로 하여금 적어도 2 개의 딥 러닝 모델을 얻기 위해 트레이닝 영상 데이터를 사용함으로써 딥 러닝 프레임 워크에 기초하여 딥 러닝 모델을 트레이닝하는 과정에서, 트레이닝 과정에 대한 상태 정보를 사용자에게 푸시하도록 할 수 있다.The instruction, when executed by the processor, causes the processor to train a deep learning model based on a deep learning framework by using training image data to obtain at least two deep learning models. You can have the status information pushed to the user.

상기 명령어는, 상기 프로세서에 의해 실행될 때, 상기 프로세서로 하여금 적어도 2 개의 딥 러닝 모델을 얻기 위해 트레이닝 영상 데이터를 사용함으로써 딥 러닝 프레임 워크에 기초하여 딥 러닝 모델을 트레이닝하는 과정에서, 웹 애플리케이션 프로그래밍 인터페이스를 사용하여 현재 학습되고 있는 딥 러닝 모델의 성능 곡선을 실시간으로 그리도록 할 수 있다. The instruction, when executed by the processor, causes the processor to train a deep learning model based on a deep learning framework by using training image data to obtain at least two deep learning models, a web application programming interface It can be used to draw the performance curve of the deep learning model currently being trained in real time.

본 발명에 따른 딥 러닝 인공신경망 기반 영상 인식 방법 및 시스템에 의하면, 딥러닝 영상 인식 방법 및 장치, 및 컴퓨터 판독 가능 저장 매체에 의해 딥 러닝 프레임 워크 관련 기술의 기술적 문제를 부분적으로 해결하거나 완화시키는 효과를 가진다.According to the deep learning artificial neural network-based image recognition method and system according to the present invention, the deep learning image recognition method and device, and the effect of partially solving or mitigating the technical problem of the deep learning framework related technology by a computer-readable storage medium Have.

도 1은 본 발명의 일 실시례에 따른 딥 러닝 기반 영상 인식 방법을 도시한 흐름도이다.
도 2는 본 발명의 다른 실시례에 따른 딥 러닝 기반 영상 인식 방법을 도시한 흐름도이다.
도 3은 본 발명의 또 다른 실시례에 따른 딥 러닝 기반 영상 인식 방법을 도시한 흐름도이다.
도 4는 본 발명의 또 다른 실시례에 따른 딥 러닝 기반 영상 인식 방법을 도시한 흐름도이다.
도 5는 본 발명의 일 실시례에 따른 딥 러닝 기반 영상 인식 장치를 도시한 구성도이다.
도 6은 본 발명의 다른 실시례에 따른 딥 러닝 기반 영상 인식 장치를 도시한 구성도이다.
도 7은 본 발명의 일 실시례에 따른 컴퓨터 장치를 도시한 구성도이다.1 is a flowchart illustrating a deep learning-based image recognition method according to an embodiment of the present invention.
2 is a flowchart illustrating a deep learning-based image recognition method according to another embodiment of the present invention.
3 is a flowchart illustrating a deep learning-based image recognition method according to another embodiment of the present invention.
4 is a flowchart illustrating a deep learning-based image recognition method according to another embodiment of the present invention.
5 is a block diagram illustrating a deep learning-based image recognition apparatus according to an embodiment of the present invention.
6 is a block diagram illustrating a deep learning-based image recognition apparatus according to another embodiment of the present invention.
7 is a block diagram illustrating a computer device according to an embodiment of the present invention.

본 발명은 다양한 변경을 가할 수 있고, 여러 가지 실시례를 가질 수 있는 바, 특정 실시례들을 도면에 예시하고, 상세하게 설명하고자 한다. 그러나, 이는 본 발명을 특정한 실시 형태에 대해 한정하려는 것이 아니고, 본 발명의 기술 사상 및 기술 범위에 포함되는 모든 변경, 균등물 내지 대체물을 포함하는 식으로 이해되어야 하고, 여러 가지 다른 형태로 변형될 수 있으며, 본 발명의 범위가 하기 실시례에 한정되는 것은 아니다. In the present invention, various modifications may be made and various embodiments may be provided, and specific embodiments will be illustrated in the drawings and described in detail. However, this is not intended to limit the present invention to a specific embodiment, and should be understood as including all changes, equivalents, or substitutes included in the technical spirit and scope of the present invention, and may be modified in various other forms. It can be, and the scope of the present invention is not limited to the following examples.

이하, 첨부된 도면을 참조하여 본 발명에 따른 실시례를 상세히 설명하며, 도면 부호에 관계없이 동일하거나 대응하는 구성요소에 대해서는 동일한 참조 번호를 부여하고, 이에 대해 중복되는 설명을 생략하기로 한다.Hereinafter, embodiments according to the present invention will be described in detail with reference to the accompanying drawings, and the same reference numerals are assigned to the same or corresponding components regardless of the reference numerals, and redundant descriptions thereof will be omitted.

도 1은 본 발명의 일 실시례에 따른 딥 러닝 기반 영상 인식 방법을 도시한 흐름도이다.1 is a flowchart illustrating a deep learning-based image recognition method according to an embodiment of the present invention.

도 1을 참조하면, 단계 101은 선택적인 단계로서, 처리될 영상 데이터에 대해 영상 전처리가 수행하게된다. 일부 실시례들에서, 처리될 영상 데이터는 딥 러닝 프레임 워크에 기초하여 딥 러닝 모델을 트레이닝하기 위한 트레이닝 영상 데이터, 트레이닝된 딥 러닝 모델을 검증하기 위한 검증 영상 데이터 및 인식될 영상 데이터 중 하나 이상을 포함할 수 있다. 그러나 이 방법에서는 영상 전처리 프로세스가 필요하지 않다는 것이 예시되어야한다. 즉, 본 방법은 원본 영상을 직접 처리하거나, 제 3 자가 처리한 영상 데이터를 처리할 수 있다. 이와 같이, 전처리 수행단계(101)는 선택사항이다.Referring to FIG. 1, step 101 is an optional step, and image preprocessing is performed on image data to be processed. In some embodiments, the image data to be processed includes at least one of training image data for training a deep learning model, verification image data for verifying the trained deep learning model, and image data to be recognized based on a deep learning framework. Can include. However, it should be exemplified that the image preprocessing process is not required in this method. That is, the present method may directly process an original image or may process image data processed by a third party. As such, the pre-processing step 101 is optional.

단계 102에서, 딥 러닝 모델은 트레이닝된 영상 데이터를 사용하여 딥 러닝 프레임 워크에 기초하여 트레이닝된 적어도 2 개의 딥 학습 모델을 획득한다. 일부 실시례에서, 딥 러닝 프레임 워크의 수는 2 개 이상일 수 있다.In step 102, the deep learning model acquires at least two deep learning models trained based on the deep learning framework by using the trained image data. In some embodiments, the number of deep learning frameworks may be two or more.

본 실시례에서 딥 러닝 프레임 워크는 PyTorch, Tensorflow, Caffe, Keras, MXNet 등일 수 있으며, 본 실시례에 제한되지 않는다. 트레이닝 영상 데이터를 이용하여 딥 학습 프레임 워크에 기초한 딥 러닝 모델을 트레이닝함으로써 적어도 2 개의 딥 러닝 모델이 획득될 수 있다. 예를 들어, 상이한 초기 파라미터를 설정하고 동일한 트레이닝 영상 데이터를 사용함으로써 상이한 딥 러닝 모델이 얻어질 수 있다.In this embodiment, the deep learning framework may be PyTorch, Tensorflow, Caffe, Keras, MXNet, and the like, and is not limited to this embodiment. At least two deep learning models may be obtained by training a deep learning model based on a deep learning framework using the training image data. For example, different deep learning models can be obtained by setting different initial parameters and using the same training image data.

선택적으로, 복수의 상이한 딥 러닝 모델은 동일한 트레이닝 영상 데이터를 이용하여 상이한 딥 러닝 프레임 워크에 기초하여 딥 러닝 모델을 트레이닝하거나, 동일한 트레이닝 영상 데이터를 이용하여, 상이한 활성화 기능을 갖는 딥 러닝 모델을 트레이닝함으로써, 또는 다른 트레이닝 영상 데이터 등을 사용하여 동일하거나 다른 딥 러닝 모델을 트레이닝함으로써 상이한 딥 러닝 모델이 얻어질 수 있다.Optionally, a plurality of different deep learning models train deep learning models based on different deep learning frameworks using the same training image data, or train deep learning models with different activation functions using the same training image data. By doing so, or by training the same or different deep learning models using different training image data or the like, different deep learning models can be obtained.

이들 상이한 딥 러닝 모델은 대응하는 초기 파라미터 설정 하에서 대응하는 트레이닝 영상 데이터로부터 대응하는 딥 러닝 프레임 워크에 의해 학습된 특징을 반영할 수 있어서, 딥 러닝 모델은 동일한 검증 영상 데이터에 대해 상이한 인식 정확도를 가질 수 있다.These different deep learning models can reflect the features learned by the corresponding deep learning framework from the corresponding training image data under the corresponding initial parameter setting, so that the deep learning model will have different recognition accuracy for the same verification image data. I can.

단계 103에서, 검증된 영상 데이터에 대한 인식 정확도의 내림차순으로 획득된 딥 러닝 모델로부터 미리 결정된 수의 딥 러닝 모델이 선택된다.In step 103, a predetermined number of deep learning models are selected from deep learning models obtained in descending order of recognition accuracy for the verified image data.

미리 정해진 수는 획득된 딥 러닝 모델의 수보다 작거나 같을 수 있으며, 미리 정해진 수는 특정 구현에서 시스템 성능 및 또는 구현 요구 사항 등에 따라 설정될 수 있으며, 본 발명에 제한되지 않는다. 예를 들어, 미리 정해진 수는 2일 수 있다.The predetermined number may be less than or equal to the number of acquired deep learning models, and the predetermined number may be set according to system performance and/or implementation requirements in a specific implementation, and is not limited to the present invention. For example, the predetermined number may be 2.

본 실시례에서, 트레이닝을 통해 복수의 딥 러닝 모델이 획득된 후, 종종 검증 세트 또는 검증 영상 데이터에 대해 트레이닝된 딥 러닝 모델의 인식 정확도를 검증할 필요가 있다. 따라서, 본 실시례에서, 검증 영상 데이터에 대한 트레이닝을 통해 획득된 각 딥 러닝 모델의 인식 정확도에 따라, 미리 결정된 수의 딥 러닝 모델이 인식 정확도의 내림차순으로 선택된다. 예를 들어, 소정 개수가 2인 경우, 검증 세트를 위한 트레이닝을 통해 획득된 딥 러닝 모델의 인식 정확도에 따라 트레이닝을 통해 획득된 딥 러닝 모델 중에서 정확도가 가장 높은 딥 러닝 모델 및 두 번째로 정확도가 높은 딥 러닝 모델을 선택할 수 있다.In this embodiment, after a plurality of deep learning models are acquired through training, it is often necessary to verify the recognition accuracy of the deep learning model trained on the verification set or verification image data. Accordingly, in this embodiment, according to the recognition accuracy of each deep learning model acquired through training on the verification image data, a predetermined number of deep learning models are selected in descending order of recognition accuracy. For example, if the predetermined number is 2, the deep learning model with the highest accuracy among the deep learning models acquired through training according to the recognition accuracy of the deep learning model obtained through training for the verification set and the second highest accuracy. You can choose a high deep learning model.

미리 결정된 수의 딥 러닝 모델을 선택하는 단계 이후, 선택된 딥 러닝 모델 중 적어도 하나를 사용하여 인식될 영상 데이터를 인식하는 단계를 수행할 수 있는데, 이는 선택된 딥 러닝 모델을 사용자에게 제공하는 단계(104)와, 사용자에 의해 선택된 딥 러닝 모델을 획득하고, 사용자에 의해 선택된 딥 러닝 모델을 사용하여 인식될 영상 데이터를 인식하는 단계(105)를 포함할 수 있다.After the step of selecting a predetermined number of deep learning models, the step of recognizing image data to be recognized using at least one of the selected deep learning models may be performed, which provides the selected deep learning model to a user (104 ), acquiring a deep learning model selected by the user, and recognizing image data to be recognized using the deep learning model selected by the user (105).

단계 104는 선택적인 단계로서, 선택된 딥 러닝 모델이 사용자에게 제공될 수 있다.Step 104 is an optional step, and the selected deep learning model may be provided to the user.

본 실시례에서, 검증 세트에 대한 인식 정확도의 내림차순으로 트레이닝을 통해 획득된 딥 러닝 모델로부터 소정의 딥 러닝 모델이 선택된 후, 선택된 딥 러닝 모델이 사용자에게 제공될 수 있다.In this embodiment, after a predetermined deep learning model is selected from a deep learning model acquired through training in descending order of recognition accuracy for a verification set, the selected deep learning model may be provided to the user.

특정 구현에서, 미리 정해진 수의 딥 러닝 모델이 검증 세트에 대한 인식 정확도의 내림차순으로 선택된 후에, 선택된 딥 러닝 모델이 저장될 수 있고, 저장된 딥 러닝 모델이 사용자에게 제공되어 사용자가 사용할 딥 러닝 모델을 선택할 수 있다. 예를 들어, 디스플레이 상에 디스플레이되는 사용자 인터페이스에 복수의 옵션 및 대응하는 설명이 제공되어, 사용자가 사용하기 위한 옵션을 선택할 수 있다.In certain implementations, after a predetermined number of deep learning models are selected in descending order of recognition accuracy for the validation set, the selected deep learning model may be stored, and the stored deep learning model is provided to the user to determine the deep learning model to be used by the user You can choose. For example, a plurality of options and corresponding descriptions are provided in a user interface displayed on the display, so that the user can select an option for use.

단계 105는 선택적인 단계로서, 사용자에 의해 선택된 딥 러닝 모델이 획득되고, 인식될 영상 데이터는 사용자에 의해 선택된 딥 러닝 모델을 사용하여 인식될 수 있으며, 이러한 구체적인 과정에 의하여 선택적인 단계에 해당될 수 있다.Step 105 is an optional step. A deep learning model selected by the user is obtained, and image data to be recognized can be recognized using a deep learning model selected by the user. I can.

본 실시례에서, 새로운 영상 데이터가 수신된 후, 사용되는 딥 러닝 모델은 예를 들어, 사용자 인터페이스 상에서 사용자에 의해 선택될 수 있는 옵션에 따라 결정될 수 있고, 인식될 영상 데이터는 딥 러닝 모델을 사용하여 인식될 수 있다.In this embodiment, after new image data is received, the deep learning model to be used may be determined according to an option that can be selected by the user on, for example, a user interface, and the image data to be recognized uses a deep learning model. Can be recognized.

보다 일반적으로, 획득된 딥 러닝 모델들로부터 소정 개수의 딥 러닝 모델들이 검증 영상 데이터에 대한 인식 정확도의 내림차순으로 선택된 후, 인식될 영상 데이터는 선택된 딥 러닝 모델중 적어도 하나를 사용하여 인식될 수 있다. 예를 들어, 적어도 하나의 딥 러닝 모델은 사용자에 의해 수동으로 선택되지 않을 수 있지만, 컴퓨터, 서버 등에 의해 자동으로 선택될 수 있다(예를 들어, 최고 정확도를 갖는 딥 러닝 모델, 트레이닝 수렴 속도가 가장 빠른 딥 러닝 모델, 인식 속도가 가장 빠른 딥 러닝 모델 또는 다른 조건을 만족하는 딥 러닝 모델). 즉, 이 단계는 사용자의 참여없이 완전히 자동화될 수 있다.More generally, after a predetermined number of deep learning models are selected from the acquired deep learning models in descending order of recognition accuracy for the verification image data, the image data to be recognized may be recognized using at least one of the selected deep learning models. . For example, at least one deep learning model may not be manually selected by the user, but may be automatically selected by a computer, server, etc. (e.g., a deep learning model with the highest accuracy, a training convergence rate is The fastest deep learning model, the deepest learning model with the fastest recognition speed, or a deep learning model that satisfies other conditions). In other words, this step can be fully automated without user participation.

딥 러닝 기반 영상 인식 방법에서, 처리될 영상 데이터에 대해 영상 전처리가 수행된 후, 딥 러닝 모델은 트레이닝 영상 데이터를 사용하여 적어도 2 개의 딥 러닝 모델을 획득함으로써 딥 러닝 프레임 워크에 기초하여 트레이닝될 수 있다. 검증 세트에 대한 인식 정확도의 내림차순으로 트레이닝을 통해 획득된 딥 러닝 모델로부터 미리 결정된 수의 딥 러닝 모델이 선택되고, 선택된 딥 러닝 모델이 사용자에게 제공되고, 사용자에 의해 선택된 딥 러닝 모델이 획득된다. 인식될 영상 데이터는 사용자에 의해 선택된 딥 러닝 모델을 사용하여 인식된다.In the deep learning-based image recognition method, after image preprocessing is performed on the image data to be processed, the deep learning model can be trained based on the deep learning framework by acquiring at least two deep learning models using the training image data. have. A predetermined number of deep learning models are selected from the deep learning models acquired through training in descending order of recognition accuracy for the verification set, the selected deep learning models are provided to the user, and the deep learning models selected by the user are obtained. Image data to be recognized is recognized using a deep learning model selected by a user.

이러한 방식으로, 딥 러닝 프레임워크를 제공하기 위한 전반적인 솔루션이 실현될 수 있으며, 이는 사용자가 딥 러닝 모델을 획득한 다음, 획득된 딥 러닝 모델을 통해 수신된 영상 데이터를 인식하는 데 편리하고, 영상 인식의 정확성을 향상시키고 사용자 경험을 향상시킬 수 있다. In this way, an overall solution for providing a deep learning framework can be realized, which is convenient for a user to acquire a deep learning model and then recognize the image data received through the acquired deep learning model, and It can improve recognition accuracy and improve user experience.

일부 실시례에서, 딥 러닝 프레임 워크의 수는 2 개 이상일 수 있는데, 선택된 딥 러닝 모델을 사용자에게 제공하는 단계(102)에서, 다수의 딥 러닝 프레임워크가 동일하거나 2개 이상일 수 있다.In some embodiments, the number of deep learning frameworks may be two or more, and in step 102 of providing the selected deep learning model to a user, the plurality of deep learning frameworks may be the same or two or more.

도 2는 본 발명의 다른 실시례에 따른 딥 러닝 기반 영상 인식 방법을 도시한 흐름도이다.2 is a flowchart illustrating a deep learning-based image recognition method according to another embodiment of the present invention.

도 2에 도시된 바와 같이, 도 1에 도시된 실시례의 단계 101은 다음 단계를 포함할 수 있다.As shown in FIG. 2, step 101 of the embodiment shown in FIG. 1 may include the following steps.

단계 201에서, 처리될 영상 데이터에 대해 랜덤 자르기, 회전, 뒤집기, 밝기 조정 및 대비 조정 중 하나 이상의 동작이 수행된다.In step 201, one or more operations of random cropping, rotation, flipping, brightness adjustment, and contrast adjustment are performed on the image data to be processed.

본 실시례에서, 영상 데이터가 트레이닝되기 전에, 먼저 처리될 영상 데이터에 대해 영상 전처리를 수행해야 하고, 이는 처리될 영상 데이터에 대해 랜덤 자르기, 회전, 뒤집기, 밝기 조정 및 또는 콘트라스트 조정 등과 같은 동작을 수행하는 것을 포함한다.In this embodiment, before the image data is trained, it is necessary to first perform image pre-processing on the image data to be processed, which performs operations such as random cropping, rotation, flipping, brightness adjustment, and contrast adjustment, etc. Includes performing.

또한, 단계 201 뒤에, 본 방법은 다음 단계를 더 포함할 수 있다.Further, after step 201, the method may further include the following steps.

단계 202에서, 전처리된 영상 데이터는 사전 확립된 메모리 데이터베이스에 저장된다.In step 202, the preprocessed image data is stored in a pre-established memory database.

본 실시례에서, 미리 확립된 메모리 데이터베이스는 Lightning Memory-Mapped Database(이하 간략히 LMDB) 또는 LevelDB 등과 같은 딥 러닝 데이터베이스 일 수 있다.In this embodiment, the pre-established memory database may be a deep learning database such as Lightning Memory-Mapped Database (hereinafter, simply LMDB) or LevelDB.

물론, 다른 유형의 데이터베이스가 상기 메모리 데이터베이스로서 사용될 수도 있다. 사용되는 메모리 데이터베이스의 특정 유형은 본 실시례에서 제한되지 않는다.Of course, other types of databases could also be used as the memory database. The specific type of memory database used is not limited in this embodiment.

본 실시례에서, 처리될 영상 데이터에 대해 영상 전처리가 수행된 후, 단계 102는 직접 수행될 수 있고, 또는 단계 202가 먼저 수행될 수 있고, 단계 102가 수행되는 경우, 단계 102에서, 딥 러닝 모델을 얻기 위해, 메모리 데이터베이스에 저장된 트레이닝 영상 데이터를 사용함으로써, 딥 러닝 프레임 워크에 기초하여 딥 러닝 모델이 트레이닝될 수 있다.In this embodiment, after image preprocessing is performed on the image data to be processed, step 102 may be directly performed, or step 202 may be performed first, and if step 102 is performed, in step 102, deep learning In order to obtain a model, a deep learning model can be trained based on a deep learning framework by using training image data stored in a memory database.

도 3은 본 발명의 또 다른 실시 예에 따른 딥 러닝 기반 영상 인식 방법을 도시한 흐름도이다.3 is a flowchart illustrating a deep learning-based image recognition method according to another embodiment of the present invention.

도 3에 도시된 바와 같이, 도 1에 도시된 실시례의 단계 102 뒤에, 본 방법은 다음 단계를 더 포함할 수 있다As shown in Fig. 3, after step 102 of the embodiment shown in Fig. 1, the method may further include the following steps:

단계 301에서, 트레이닝 영상 데이터를 사용하여 딥 러닝 프레임 워크에 기초하여 딥 러닝 모델을 트레이닝 하는 프로세스에서, 적어도 2 개의 딥 러닝 모델을 획득하기 위해, 트레이닝 프로세스에 대한 상태 정보가 사용자에게 푸시된다.In step 301, in the process of training the deep learning model based on the deep learning framework using the training image data, in order to obtain at least two deep learning models, state information on the training process is pushed to the user.

본 실시례에서, 사용자가 트레이닝 과정에 집중하는 것을 용이하게 하기 위해, 트레이닝 영상 데이터를 사용하여 딥 러닝 프레임 워크에 기초하여 딥 러닝 모델을 트레이닝시키는 과정에서 딥 러닝 모델을 얻는 단계, 오류, 정보 또는 경고 등과 같은 트레이닝 프로세스에 대한 상태 정보는 WeChat 또는 QQ 등과 같이 사용자에 의해 등록된 인스턴트 메시징 소프트웨어의 계정으로 실시간으로 푸시될 수 있다.In this embodiment, in order to facilitate the user to concentrate on the training process, the step of obtaining a deep learning model in the process of training a deep learning model based on a deep learning framework using training image data, error, information or Status information about the training process, such as alerts, can be pushed in real time to an account of instant messaging software registered by the user, such as WeChat or QQ.

물론, 트레이닝 과정에 대한 상태 정보는 사용자가 등록한 이메일 계정으로 실시간으로 푸시될 수도 있고, 트레이닝 과정에 대한 상태 정보는 또한 짧은 메시지. 트레이닝 과정에 대한 상태 정보를 사용자에게 푸시할 수 있는 한, 트레이닝 과정에 대한 상태 정보를 푸시하는 방식은 본 실시례에서 제한되지 않는다.Of course, the status information about the training course can be pushed in real time to the email account registered by the user, and the status information about the training course can also be sent in short messages. As long as the state information about the training process can be pushed to the user, the method of pushing the state information about the training process is not limited in this embodiment.

도 4는 본 발명의 또 다른 실시례에 따른 딥 러닝 기반 영상 인식 방법의 예시적인 흐름도이다.4 is an exemplary flowchart of a deep learning-based image recognition method according to another embodiment of the present invention.

도 4는 본 발명의 또 다른 실시 예에 따른 딥 러닝 기반 영상 인식 방법을 도시한 흐름도이다.4 is a flowchart illustrating a deep learning-based image recognition method according to another embodiment of the present invention.

단계 401에서, 트레이닝 영상 데이터를 사용하여 딥 러닝 프레임 워크를 기반으로 딥 러닝 모델을 트레이닝시키는 과정에서, 적어도 2 개의 딥 러닝 모델을 획득하는 단계에서, 현재 트레이닝되고 있는 딥 러닝 모델에 대한 성능 곡선이 웹 애플리케이션 프로그래밍 인터페이스(API)를 사용하여 실시간으로 도출된다.In step 401, in the process of training the deep learning model based on the deep learning framework using the training image data, in the step of acquiring at least two deep learning models, a performance curve for the deep learning model currently being trained is Derived in real time using a web application programming interface (API).

단계 402에서, 도출된 성능 곡선이 제시된다.In step 402, the derived performance curve is presented.

본 실시례에서, 트레이닝 영상 데이터를 이용하여 딥 러닝 프레임 워크를 기반으로 딥 러닝 모델을 트레이닝 \는 과정에서 딥 러닝 모델, 트레이닝 손실과 같은 성능 곡선, 트레이닝 정확도 및 또는 컨퓨전 매트릭스(confusion matrix) 등을 얻는다. 현재 학습되고 있는 딥 러닝 모델 중 하나는 웹 API(예를 들어, 크레용 또는 텐서 보드 등)를 사용하여 실시간으로 그려 질 수 있고,사용자에게 제시된다.In this embodiment, in the process of training a deep learning model based on a deep learning framework using training image data, a deep learning model, a performance curve such as training loss, training accuracy, or a confusion matrix, etc. Get One of the deep learning models currently being trained can be drawn in real time using a web API (for example, a crayon or a tensor board) and presented to the user.

본 발명의 일부 실시례에 따른 딥 러닝 기반 영상 인식 방법은 사용자가 딥 러닝 모델을 획득하기에 편리한 딥 러닝 프레임 워크의 전반적인 솔루션을 제공하고, 이를 통해 수신된 영상 데이터의 인식을 실현할 수 있다. 딥 러닝 모델을 획득하여 영상 인식의 정확성을 향상시키고,사용자 경험을 향상시킨다.The deep learning-based image recognition method according to some embodiments of the present invention provides an overall solution of a deep learning framework that is convenient for a user to acquire a deep learning model, and through this, it is possible to realize the recognition of received image data. Acquire a deep learning model to improve the accuracy of image recognition and improve user experience.

도 5는 본 발명의 일 실시례에 따른 딥 러닝 기반 영상 인식 장치를 도시한 구성도이다. 본 발명의 일 실시례에 따른 딥 러닝 기반 영상 인식 장치는 본 발명의 실시례에 따른 딥 러닝 기반 영상 인식 방법을 수행할 수 있다. 5 is a block diagram illustrating a deep learning-based image recognition apparatus according to an embodiment of the present invention. The deep learning-based image recognition apparatus according to an embodiment of the present invention may perform the deep learning-based image recognition method according to the embodiment of the present invention.

도 5에 도시된 바와 같이, 딥 러닝 기반 영상 인식 장치는 선택적 영상 전처리 모듈(51), 트레이닝 모듈(52), 모델 스크리닝 모듈(53), 선택적 제공 모듈(54) 및 인식 모듈(55)을 포함할 수 있다.As shown in FIG. 5, the deep learning-based image recognition apparatus includes an optional image preprocessing module 51, a training module 52, a model screening module 53, an optional providing module 54, and a recognition module 55. can do.

선택적 영상 전처리 모듈(51)은 처리될 영상 데이터에 대해 영상 전처리를 수행하도록 구성된다.The optional image preprocessing module 51 is configured to perform image preprocessing on image data to be processed.

트레이닝 모듈(52)은 트레이닝 영상 데이터를 사용하여 딥 러닝 프레임 워크에 기초한 딥 러닝 모델을 트레이닝시켜 적어도 2 개의 딥 러닝 모델을 획득하도록 구성된다. 본 실시례에서 딥 러닝 프레임 워크는 PyTorch, Tensorflow, Caffe, Keras, MXNet 등일 수 있으며, 본 실시례에 제한되지 않는다. 트레이닝 모듈(52)은 트레이닝 영상 데이터를 이용하여 딥 러닝 프레임 워크에 기초하여 딥 러닝 모델을 트레이닝시키며, 이는 적어도 2 개의 딥 러닝 모델을 얻을 수 있다.The training module 52 is configured to acquire at least two deep learning models by training a deep learning model based on the deep learning framework using the training image data. In this embodiment, the deep learning framework may be PyTorch, Tensorflow, Caffe, Keras, MXNet, and the like, and is not limited to this embodiment. The training module 52 trains a deep learning model based on a deep learning framework using the training image data, which can obtain at least two deep learning models.

모델 스크리닝 모듈(53)은 검증 영상 데이터에 대한 인식 정확도의 내림차순으로 트레이닝 모듈(52)에 의해 획득된 딥 러닝 모델로부터 미리 결정된 수의 딥 러닝 모델을 선택하도록 구성되고; 여기서 미리 결정된 수는 획득된 딥 러닝 모델의 수 이하이다. 소정의 수는 특정 구현에서 시스템 성능 및 또는 구현 요구 사항 등에 따라 설정될 수 있으며, 본 실시례에 제한되지 않는다. 예를 들어, 미리 정해진 수는 2일 수 있다.The model screening module 53 is configured to select a predetermined number of deep learning models from the deep learning models obtained by the training module 52 in descending order of recognition accuracy for the verification image data; Here, the predetermined number is less than or equal to the number of acquired deep learning models. The predetermined number may be set according to system performance and/or implementation requirements in a specific implementation, and is not limited to this embodiment. For example, the predetermined number may be 2.

본 실시례에서, 트레이닝 모듈(52)은 딥 학습 모델을 얻기 위해 트레이닝을 수행 한 후, 검증 세트에 대한 딥 학습 모델의 인식 정확도를 검증할 필요가 있다. 따라서, 본 실시례에서, 모델 스크리닝 모듈(53)은 검증 세트에 대한 트레이닝을 통해 획득된 딥 러닝 모델의 인식 정확도의 내림차순으로 미리 결정된 수의 딥 러닝 모델을 선택한다.In this embodiment, after performing training to obtain the deep learning model, the training module 52 needs to verify the recognition accuracy of the deep learning model with respect to the verification set. Accordingly, in this embodiment, the model screening module 53 selects a predetermined number of deep learning models in descending order of the recognition accuracy of the deep learning model obtained through training on the verification set.

예를 들어, 기 설정된 개수가 2인 경우, 모델 스크리닝 모듈(53)은 획득된 딥 러닝 모델의 인식 정확도에 따라 트레이닝을 통해 획득된 딥 러닝 모델 중에서 가장 높은 인식 정확도와 두 번째로 높은 인식 정확도를 갖는 딥 러닝 모델을 검증 세트에 대한 교육을 통해 선택할 수 있다.For example, when the preset number is 2, the model screening module 53 provides the highest recognition accuracy and the second highest recognition accuracy among the deep learning models acquired through training according to the recognition accuracy of the acquired deep learning model. You can select the deep learning model you have through training on the validation set.

선택적 제공 모듈(54)은 모델 스크리닝 모듈(53)에 의해 선택된 딥 러닝 모델을 사용자에게 제공하도록 구성된다. 본 실시례에서, 모델 스크리닝 모듈(53)은 검증 세트에 대한 인식 정확도의 내림차순으로 트레이닝을 통해, 획득된 딥 러닝 모델로부터 미리 결정된 수의 딥 러닝 모델을 선택한 후, 제공 모듈(54)은 선택된 딥러닝 모델을 사용자에게 제공할 수 있다.The optional providing module 54 is configured to provide the deep learning model selected by the model screening module 53 to the user. In this embodiment, the model screening module 53 selects a predetermined number of deep learning models from the acquired deep learning models through training in descending order of recognition accuracy for the verification set, and then the providing module 54 selects the selected deep learning models. The running model can be provided to the user.

특정 구현에서, 모델 스크리닝 모듈(53)은 검증 세트에 대한 인식 정확도의 내림차순으로 미리 결정된 수의 딥 러닝 모델을 선택한 후에, 모델 스크리닝 모듈(53)은 선택된 딥 러닝 모델을 제공한 다음 제공 모델을 저장할 수 있다. 모델 스크리닝 모델(53)에 의해 저장된 딥 러닝 모델을 사용자에게 제공하여, 사용자가 사용하기 위한 딥 러닝 모델을 선택할 수 있도록 한다.In a specific implementation, the model screening module 53 selects a predetermined number of deep learning models in descending order of recognition accuracy for the validation set, and then the model screening module 53 provides the selected deep learning model and then stores the provided model. I can. The deep learning model stored by the model screening model 53 is provided to the user, so that the user can select a deep learning model to use.

선택적 인식 모듈(55)은 사용자에 의해 선택된 딥 러닝 모델을 획득하고, 사용자에 의해 선택된 딥 러닝 모델을 통해 인식될 영상 데이터를 인식하도록 구성된다.The optional recognition module 55 is configured to acquire a deep learning model selected by a user and recognize image data to be recognized through the deep learning model selected by the user.

본 실시례에서, 새로운 영상 데이터가 수신된 후, 인식 모듈(55)은 사용자에 의해 선택된 딥 러닝 모델을 획득하고, 사용자에 의해 선택된 딥 러닝 모델을 통해 수신된 영상 데이터의 인식을 실현할 수 있다.In this embodiment, after the new image data is received, the recognition module 55 may acquire a deep learning model selected by the user and realize recognition of the image data received through the deep learning model selected by the user.

상술 한 바와 같이, 인식 모듈(55)은 획득된 딥 러닝 모델 중 소정 개수의 딥 러닝 모델이 선택된 후, 사용자의 참여없이 선택된 딥 러닝 모델 중 적어도 하나를 이용하여 인식될 영상 데이터를 인식할 수 있다. 검증 영상 데이터에 대한 인식 정확도의 내림차순. 예를 들어, 적어도 하나의 딥 러닝 모델은 사용자에 의해 수동으로 선택되지 않을 수 있지만, 컴퓨터, 서버 등에 의해 자동으로 선택될 수 있다(예를 들어, 정확도가 가장 높은 딥 러닝 모델, 트리이닝 수렴 속도가 가장 빠른 딥 러닝 모델, 인식 속도가 가장 빠른 딥 러닝 모델 또는 다른 조건을 만족하는 딥 러닝 모델이 이러한 딥 러닝 모델에서 인식된다.).As described above, after a predetermined number of deep learning models are selected from the acquired deep learning models, the recognition module 55 may recognize image data to be recognized using at least one of the selected deep learning models without user participation. . Descending order of recognition accuracy for verification image data. For example, at least one deep learning model may not be manually selected by the user, but may be automatically selected by a computer, server, etc. Is the fastest deep learning model, the deepest learning model with the fastest recognition speed, or a deep learning model that satisfies other conditions is recognized by these deep learning models).

딥 러닝 기반 영상 인식 장치에서, 선택적 영상 전처리 모듈(51)은 처리될 영상 데이터에 대해 영상 전처리를 수행한 후, 트레이닝 모듈(52)은 적어도 트리이닝 영상 데이터를 이용하여 딥 러닝 프레임 워크에 기초하여 딥 러닝 모델을 트리이닝시킨다. 두 개의 딥 러닝 모델에 이어, 모델 스크리닝 모듈(53)은 검증 세트에 대한 인식 정확도의 내림차순으로 트레이닝을 통해 획득된 딥 러닝 모델로부터 미리 결정된 수의 딥 기대 모델을 선택하고, 선택적인 제공 모듈(54)은 선택된 딥 러닝을 제공한다. 선택적 인식 모듈(55)은 사용자에 의해 선택된 딥 러닝 모델을 획득하고, 사용자에 의해 선택된 딥 러닝 모델을 사용하여 수신된 영상 데이터를 인식할 수 있다. 이러한 방식으로, 딥 러닝 프레임 워크를 제공하기위한 전반적인 솔루션이 실현될 수 있으며, 이는 사용자가 딥 러닝 모델을 획득 한 다음, 획득된 딥 러닝 모델을 통해 수신된 영상 데이터의 인식을 실현함으로써 영상의 정확성을 향상 시킨다.In the deep learning-based image recognition apparatus, after the selective image preprocessing module 51 performs image preprocessing on the image data to be processed, the training module 52 uses at least the treering image data based on the deep learning framework. Tree deep learning models. Following the two deep learning models, the model screening module 53 selects a predetermined number of deep expectation models from the deep learning models acquired through training in descending order of recognition accuracy for the verification set, and an optional provision module 54 ) Provides the selected deep learning. The selective recognition module 55 may acquire the deep learning model selected by the user and recognize the received image data using the deep learning model selected by the user. In this way, an overall solution for providing a deep learning framework can be realized, which means that the user acquires a deep learning model, and then realizes the recognition of the image data received through the acquired deep learning model, resulting in the accuracy of the image. Improves.

도 6은 본 발명의 다른 실시례에 따른 딥 러닝 기반 영상 인식 장치의 예시적인 구조도이다.6 is an exemplary structural diagram of an image recognition apparatus based on deep learning according to another embodiment of the present invention.

도 6은 도 5에 도시된 딥 러닝 기반 영상 인식 장치와 상이하다. 영상 전처리 모듈(51)은 처리될 영상 데이터에 대해 랜덤 자르기, 회전, 뒤집기, 밝기 조정 및 콘트라스트 조정과 같은 다음의 동작 중 하나 또는 조합을 수행하도록 구체적으로 구성될 수 있다.6 is different from the deep learning-based image recognition device shown in FIG. 5. The image preprocessing module 51 may be specifically configured to perform one or a combination of the following operations such as random cropping, rotation, flipping, brightness adjustment, and contrast adjustment on image data to be processed.

본 실시례에서, 트레이닝 모듈(52)이 영상 데이터를 트레이닝시키기 전에, 영상 전처리 모듈(51)은 먼저 처리될 영상 데이터에 대해 영상 전처리를 수행해야하고, 이는 처리될 영상 데이터에 대해 랜덤 자르기, 회전, 뒤집기, 밝기 조정 및 또는 콘트라스트 조정 등과 같은 동작을 수행하는 것을 포함한다.In this embodiment, before the training module 52 trains the image data, the image pre-processing module 51 must first perform image pre-processing on the image data to be processed, which is a random cropping and rotating image data to be processed. , Flipping, adjusting brightness and/or adjusting contrast, and the like.

또한, 딥 러닝 기반 영상 인식 장치는 데이터베이스 설정 모듈(56) 및 저장 모듈(57)을 포함하는데, 데이터베이스 설정 모듈(56)은 메모리 데이터베이스를 구축하도록 구성되고, 저장 모듈(57)은 영상 전처리 모듈(51)이 처리될 영상 데이터에 대해 영상 전처리를 수행한 후, 데이터베이스 설정 모듈(56)에 의해 사전 확립된 메모리 데이터베이스에 처리된 영상 데이터를 저장하도록 구성된다.In addition, the deep learning-based image recognition apparatus includes a database setting module 56 and a storage module 57, the database setting module 56 is configured to build a memory database, and the storage module 57 is an image preprocessing module ( 51) is configured to store the processed image data in a memory database previously established by the database setting module 56 after performing image pre-processing on the image data to be processed.

본 실시례에서, 데이터 설정 모듈(56)에 의해 사전 구축된 메모리 데이터베이스는 LMDB 또는 LevelDB 등과 같은 딥 러닝 데이터베이스 일 수 있다. 물론, 다른 유형의 데이터베이스도 상기 메모리로서 사용될 수 있다. 사용되는 메모리 데이터베이스의 특정 유형은 본 실시례에서 제한되지 않는다.In this embodiment, the memory database pre-built by the data setting module 56 may be a deep learning database such as LMDB or LevelDB. Of course, other types of databases can also be used as the memory. The specific type of memory database used is not limited in this embodiment.

영상 전처리 모듈(51)은 처리될 영상 데이터에 대해 영상 전처리를 수행한 후, 저장 모듈(57)은 데이터베이스 설정 모듈(56)에 의해 사전 확립된 메모리 데이터베이스에 처리된 영상 데이터를 저장할 수 있다.After the image preprocessing module 51 performs image preprocessing on the image data to be processed, the storage module 57 may store the processed image data in a memory database previously established by the database setting module 56.

또한, 딥 러닝 기반 영상 인식 장치는 메시지 푸싱 모듈(58)을 포함할 수 있다. 메시지 푸싱 모듈(58)은 딥 러닝 모델을 얻기 위해 트레이닝 영상 데이터를 사용하는 딥 러닝 프레임 워크에 기초하여 딥 러닝 모델을 트레이닝시키는 과정에서 트레이닝 과정에 대한 상태 정보를 사용자에게 푸시하도록 구성된다.In addition, the deep learning-based image recognition device may include a message pushing module 58. The message pushing module 58 is configured to push state information about a training process to a user in a process of training a deep learning model based on a deep learning framework that uses training image data to obtain a deep learning model.

본 실시례에서, 사용자가 트레이닝 과정에 집중하는 것을 용이하게 하기 위해, 트레이닝 영상 데이터를 이용하여 딥 러닝 프레임 워크에 기초하여 딥 러닝 모델을 트레이닝시키는 과정에서, 딥 러닝 모델을 얻기 위해, 메시지 푸싱 모듈(58)은 에러, 정보 또는 경고 등과 같은 트레이닝 프로세스에 대한 상태 정보를 WeChat 또는 QQ 등과 같이 사용자에 의해 등록된 인스턴트 메시징 소프트웨어의 계정으로 실시간으로 푸시할 수 있다.In this embodiment, in order to facilitate the user to concentrate on the training process, in the process of training a deep learning model based on a deep learning framework using training image data, in order to obtain a deep learning model, a message pushing module 58 can push status information about the training process, such as errors, information or warnings, in real time to an account of instant messaging software registered by the user, such as WeChat or QQ.

물론, 메시지 푸싱 모듈(58)은 또한 트레이닝 프로세스에 대한 상태 정보를 사용자에 의해 등록된 이메일의 계정으로 실시간 푸시할 수 있거나, 메시지 푸싱 모듈(58)은 또한 트레이닝 프로세스에 대한 상태 정보를 단문 메시지를 통한 사용자의 휴대 전화. 메시지 푸싱 모듈(58)에 의해 트레이닝 과정에 대한 상태 정보를 푸시하는 방식은 트레이닝 과정에 대한 상태 정보가 사용자에게 푸시될 수 있는 한, 본 실시례에서 제한되지 않는다.Of course, the message pushing module 58 can also push status information about the training process in real time to an account in an email registered by the user, or the message pushing module 58 can also send a short message about status information about the training process. Through the user's mobile phone. The method of pushing the status information for the training process by the message pushing module 58 is not limited in this embodiment as long as the status information for the training process can be pushed to the user.

또한, 딥 러닝 기반 영상 인식 장치는 실시간 모니터링 모듈(59) 및 프리젠테이션 모듈(510)을 더 포함할 수 있다.In addition, the deep learning-based image recognition device may further include a real-time monitoring module 59 and a presentation module 510.

실시간 모니터링 모듈(59)은 트레이닝 영상 데이터를 사용하여 트레이닝 모듈(52)에 의해 딥 러닝 프레임 워크에 기초하여 딥 러닝 모델을 트레이닝시키는 과정에서 딥 러닝 모델을 얻기 위해 웹 API를 사용하여 실시간으로 딥 러닝 모델에 대한 성능 곡선을 도출되도록 구성된다. The real-time monitoring module 59 uses the training image data to train the deep learning model based on the deep learning framework by the training module 52 in real time using a web API to obtain a deep learning model. It is configured to derive a performance curve for the model.

프리젠테이션 모듈(510)은 실시간 모니터링 모듈(59)에 의해 도출된 성능 곡선을 제시하도록 구성된다.The presentation module 510 is configured to present a performance curve derived by the real-time monitoring module 59.

본 실시례에서, 트레이닝 영상 데이터를 이용하여 딥 러닝 프레임 워크를 기반으로 딥 러닝 모델을 트레이닝하는 과정에서 딥 러닝 모델, 트레이닝 손실과 같은 성능 곡선, 트레이닝 정확도, 및 또는 컨퓨전 매트릭스 등을 얻는다. 현재 트레이닝되고 있는 딥 러닝 모델의 웹 인터페이스는 실시간 모니터링 모듈(59)에 의해 웹 API(예를 들어, 크레용 또는 텐서 보드 등)를 사용하여 실시간으로 도출될 수 있고 사용자에게 제시된다.In this embodiment, in the process of training a deep learning model based on a deep learning framework using training image data, a deep learning model, a performance curve such as training loss, training accuracy, and/or a confusion matrix are obtained. The web interface of the deep learning model currently being trained can be derived in real time by using a web API (eg, crayon or tensor board) by the real-time monitoring module 59 and presented to the user.

본 발명의 일부 실시례에 따른 딥 러닝 기반 영상 인식 장치는 딥 러닝 모델을 획득하기에 편리한 딥 러닝 프레임 워크의 전반적인 솔루션을 제공하고, 이를 통해 수신된 영상 데이터의 인식을 실현할 수 있다. 딥 러닝 모델을 획득하여 영상 인식의 정확성을 향상시키고, 사용자 경험을 향상시킨다.The deep learning-based image recognition apparatus according to some embodiments of the present invention provides an overall solution of a deep learning framework that is convenient for acquiring a deep learning model, and through this, it is possible to realize the recognition of received image data. Acquire a deep learning model to improve the accuracy of image recognition and improve user experience.

도 7은 본 발명의 일 실시례에 따른 컴퓨터 장치를 도시한 구성도이다.7 is a block diagram illustrating a computer device according to an embodiment of the present invention.

도 7에서와 같이, 컴퓨터 장치는 메모리에 저장되고 프로세서상에서 실행 가능한 메모리, 프로세서 및 컴퓨터 프로그램을 포함할 수 있다. 프로세서에 의해 실행될 때, 컴퓨터 프로그램은 본 개시의 일부 실시례에 따른 딥 러닝 기반 영상 인식 방법을 수행할 수 있다.As shown in FIG. 7, the computer device may include a memory, a processor, and a computer program stored in a memory and executable on a processor. When executed by the processor, the computer program may perform a deep learning-based image recognition method according to some embodiments of the present disclosure.

컴퓨터 장치는 단말 장치 또는 서버일 수 있다. 컴퓨터 장치의 특정 형태는 본 실시례에서 제한되지 않는다.The computer device may be a terminal device or a server. The specific form of the computer device is not limited in this embodiment.

도 7에 도시된 컴퓨터 장치(12)는 예시일 뿐이며, 본 발명의 실시례들의 기능 및 사용 범위에 어떠한 제한도 부과해서는 안된다.The computer device 12 shown in FIG. 7 is for illustrative purposes only, and no limitations should be imposed on the functions and scope of use of the embodiments of the present invention.

컴퓨터 장치(12)는 범용 컴퓨팅 장치의 형태로 구현된다. 컴퓨터 장치(12)의 구성 요소는 하나 이상의 프로세서 또는 처리 장치(16), 시스템 메모리(28), 상이한 시스템 구성 요소(시스템 메모리(28) 및 처리 장치(16) 포함)를 연결하기 위한 버스(18)를 포함할 수 있지만, 이에 제한되지는 않는다.The computer device 12 is implemented in the form of a general-purpose computing device. Components of the computer device 12 include one or more processors or processing units 16, system memory 28, and a bus 18 for connecting different system components (including system memory 28 and processing unit 16). ), but is not limited thereto.

컴퓨터 장치(12)는 전형적으로 다양한 컴퓨터 시스템 판독 가능 매체를 포함한다. 이들 매체는 휘발성 및 비 휘발성 매체, 이동식 및 비 이동식 매체를 포함하여 컴퓨터 장치(12)에 의해 액세스될 수있는 임의의 이용 가능한 매체 일 수 있다.Computer device 12 typically includes a variety of computer system readable media. These media may be any available media that can be accessed by computer device 12, including volatile and non-volatile media, removable and non-removable media.

시스템 메모리(28)는 랜덤 액세스 메모리(RAM)(30) 및 또는 캐시 메모리(32)와 같은 휘발성 메모리 형태의 컴퓨터 시스템 판독 가능 매체를 포함할 수 있다. 컴퓨터 장치(12)는 다른 이동식 / 비- 이동식 및 휘발성 / 비 휘발성 컴퓨터 시스템 저장 매체. 단지 예로서, 저장 시스템(34)은 비 이동식 및 비 휘발성 자기 매체(도 7에 도시되지 않음)(일반적으로 "하드 드라이버"로 지칭 됨)로부터 판독 및 기록하는데 사용될 수 있다. 도 7에 도시된 바와 같이, 이동식 및 비 휘발성 자기 디스크(예를 들어, "플로피 디스크")로부터 읽고 쓰기 위한 자기 디스크 드라이버 및 이동식 및 비 휘발성 광 디스크에서 읽고 쓰기 위한 광학 디스크 드라이버(예를 들어, CD-ROM(Compact Disc Read Only Memory), 이하에서는 DVD-ROM(Digital Video Disc Read Only Memory) 또는 기타 광 매체(Optical Media)와 같은 경우가 있다. 하나 이상의 데이터 매체 인터페이스를 통한 버스(18), 메모리(28)는 본 개시의 다양한 실시례의 기능을 수행하도록 구성된(예를 들어, 적어도 하나의) 프로그램 모듈 그룹을 갖는 적어도 하나의 프로그램 제품을 포함할 수 있다.The system memory 28 may include a computer system readable medium in the form of a volatile memory such as random access memory (RAM) 30 and or cache memory 32. Computer device 12 is another removable/non-removable and volatile/non-volatile computer system storage medium. By way of example only, storage system 34 can be used to read and write from non-removable and non-volatile magnetic media (not shown in FIG. 7) (generally referred to as “hard drivers”). As shown in Fig. 7, a magnetic disk driver for reading and writing from a removable and nonvolatile magnetic disk (e.g., "floppy disk") and an optical disk driver for reading and writing from a removable and nonvolatile optical disk (e.g., There are cases such as CD-ROM (Compact Disc Read Only Memory), hereinafter, DVD-ROM (Digital Video Disc Read Only Memory) or other optical media, etc. Bus 18 through one or more data medium interfaces, The memory 28 may include at least one program product having a group of (eg, at least one) program modules configured to perform the functions of various embodiments of the present disclosure.

적어도 하나의 프로그램 모듈(42)을 갖는 프로그램 / 유틸리티(40)는 예를 들어 메모리(28)에 저장될 수 있다. 이러한 프로그램 모듈(42)은 운영 체제, 하나 이상의 애플리케이션, 다른 프로그램 모듈 및 프로그램 데이터를 포함하지만, 이에 제한되지 않으며, 이들 예의 각각 또는 일부 조합은 네트워크 환경의 구현을 포함할 수 있다. 프로그램 모듈(42)은 일반적으로 본 개시의 실시례에서 설명된 기능 및 또는 방법을 수행한다.The program/utility 40 having at least one program module 42 may be stored in the memory 28, for example. Such program modules 42 include, but are not limited to, an operating system, one or more applications, other program modules, and program data, and each or some combination of these examples may include an implementation of a network environment. Program module 42 generally performs functions and/or methods described in embodiments of the present disclosure.

컴퓨터 장치(12)는 또한 하나 이상의 외부 장치(14)(예를 들어, 키보드, 포인팅 장치, 디스플레이 등)와 통신할 수 있고, 또한 사용자가 컴퓨터와 상호 작용할 수 있게 하는 하나 이상의 장치와 통신할 수 있다. 컴퓨터 장치(12)가 하나 이상의 다른 컴퓨팅 장치와 통신할 수 있게 하는 장치(12) 및 또는 임의의 장치(예를 들어, 네트워크 카드, 모뎀 등)를 포함할 수 있다. 이 통신은 입 / 출력(I / O) 인터페이스(22)를 통해 수행될 수 있다. 또한, 컴퓨터 장치(12)는 하나 이상의 네트워크(예를 들어, 근거리 통신망(LAN), 광역 통신 장치)와 통신할 수도 있다. 네트워크 어댑터(20)를 통한 네트워크(이하 간단히 WAN) 및 또는 공공 네트워크(예를 들어, 인터넷)를 가질 수 있다. 도 7에 도시된 바와 같이, 네트워크 어댑터(20)는 버스(18)를 통해 컴퓨터 장치(12)의 다른 모듈과 통신한다. Computer device 12 may also communicate with one or more external devices 14 (e.g., keyboards, pointing devices, displays, etc.), and may also communicate with one or more devices that allow a user to interact with the computer. have. It may include device 12 and or any device (eg, network card, modem, etc.) that enables computer device 12 to communicate with one or more other computing devices. This communication can be performed through an input/output (I/O) interface 22. Further, the computer device 12 may communicate with one or more networks (eg, a local area network (LAN), a wide area communication device). It is possible to have a network (hereinafter simply WAN) and a public network (eg, the Internet) through the network adapter 20. As shown in FIG. 7, network adapter 20 communicates with other modules of computer device 12 via bus 18.

처리 유닛(16)은 시스템 메모리(28)에 저장된 프로그램, 예를 들어 본 발명의 실시례에 따른 딥 러닝 기반 영상 인식 방법을 실행함으로써 다양한 종류의 기능적 애플리케이션 및 데이터 처리를 실행한다.The processing unit 16 executes various types of functional applications and data processing by executing a program stored in the system memory 28, for example, a deep learning-based image recognition method according to an embodiment of the present invention.

본 발명의 실시례는 컴퓨터 프로그램이 저장되는 비 일시적 컴퓨터 판독 가능 저장 매체를 추가로 제공한다. 컴퓨터 프로그램은 프로세서에 의해 실행될 때, 본 개시의 실시례에 따른 딥 러닝 기반 영상 인식 방법을 수행할 수 있다.An embodiment of the present invention further provides a non-transitory computer-readable storage medium in which a computer program is stored. When the computer program is executed by a processor, it may perform a deep learning-based image recognition method according to an exemplary embodiment of the present disclosure.

컴퓨터 판독 가능 신호 매체는 기저 대역에서 또는 반송파의 일부로서 전파하는 데이터 신호를 포함할 수 있고, 데이터 신호는 컴퓨터 판독 가능 프로그램 코드를 운반한다. 전파된 데이터 신호는 전자기 신호, 광학 신호 또는 이들의 임의의 적절한 조합을 포함하지만, 이에 제한되지 않는 다양한 형태를 취할 수 있다. 컴퓨터 판독 가능 신호 매체는 또한 컴퓨터 판독 가능 저장 매체 이외의 임의의 컴퓨터 판독 가능 매체 일 수 있으며, 이는 명령 실행 시스템, 장치 또는 장치에 의해 또는 이와 관련하여 사용하기 위한 프로그램을 송신, 전파 또는 송신할 수 있다.Computer-readable signal media may include data signals that propagate in baseband or as part of a carrier wave, the data signals carrying computer-readable program code. The propagated data signal can take a variety of forms including, but not limited to, electromagnetic signals, optical signals, or any suitable combination thereof. The computer-readable signal medium may also be any computer-readable medium other than a computer-readable storage medium, which can transmit, propagate, or transmit a program for use by or in connection with an instruction execution system, device, or device. have.

컴퓨터 판독 가능 매체에 구현된 프로그램 코드는 무선, 유선, 광섬유 케이블, RF 등 또는 이들의 임의의 적절한 조합을 포함하지만, 이에 제한되지 않는 임의의 적절한 매체를 사용하여 전송될 수 있다.Program code embodied in a computer-readable medium may be transmitted using any suitable medium including, but not limited to, wireless, wired, fiber optic cable, RF, or the like, or any suitable combination thereof.

본 개시의 실시례에 따른 동작을 수행하기위한 컴퓨터 프로그램 코드는 하나 이상의 프로그래밍 언어, 또는 Java, 스몰 토크, C ++와 같은 객체 지향 프로그래밍 언어를 포함하고 또한 종래의 절차적 프로그래밍을 포함하는 이들의 조합을 사용하여 작성될 수 있다. "C"언어 또는 유사한 프로그래밍 언어와 같은 언어. 프로그램 코드는 사용자 컴퓨터에서 완전히 실행될 수 있거나, 사용자 컴퓨터에서 부분적으로 실행될 수 있거나, 독립형 소프트웨어 패키지로서 실행될 수 있거나, 사용자 컴퓨터와 부분적으로 원격 컴퓨터에서 실행될 수 있거나, 또는 원격 컴퓨터 또는 서버에서 완전히 실행된다. 원격 컴퓨터의 경우, 원격 컴퓨터는 근거리 네트워크(이하, LAN) 또는 광역 네트워크(이하, 짧은 WAN)를 포함하는 임의의 종류의 네트워크를 통해 사용자 컴퓨터에 연결되거나 연결될 수 있다. Computer program code for performing an operation according to an embodiment of the present disclosure includes one or more programming languages, or object-oriented programming languages such as Java, Smalltalk, C++, and also includes conventional procedural programming thereof. It can be written using combinations. Languages such as "C" language or similar programming language. The program code can be fully executed on the user computer, partially executed on the user computer, executed as a standalone software package, executed partially on the user computer and partially on a remote computer, or fully executed on a remote computer or server. In the case of a remote computer, the remote computer may be connected or connected to the user computer through any kind of network including a local area network (hereinafter, referred to as a LAN) or a wide area network (hereinafter referred to as a short WAN).

본 발명의 실시례는 컴퓨터 프로그램 제품을 제공한다. 컴퓨터 프로그램 제품의 명령은 프로세서에 의해 실행될 때, 본 개시의 실시례에 따른 딥 러닝 기반 영상 인식 방법을 실행할 수 있다.An embodiment of the present invention provides a computer program product. When the instructions of the computer program product are executed by the processor, the deep learning-based image recognition method according to the exemplary embodiment of the present disclosure may be executed.

이와 같이 본 발명에 대해서 첨부된 도면을 참조하여 설명하였으나, 본 발명의 기술적 사상을 벗어나지 않는 범위 내에서 다양한 수정 및 변형이 이루어질 수 있음은 물론이다. 그러므로, 본 발명의 범위는 설명된 실시례에 한정되어서는 아니되며, 후술하는 특허청구범위뿐만 아니라 이러한 특허청구범위와 균등한 것들에 의해 정해져야 한다.Although the present invention has been described with reference to the accompanying drawings, various modifications and variations can be made without departing from the technical spirit of the present invention. Therefore, the scope of the present invention should not be limited to the described embodiments, and should be defined by the claims and equivalents as well as the claims to be described later.

12 : Computer device 14: 외부장치
16 : 처리 유닛 18 : 버스
20 : 네트워크 어댑터 22 : 입출력 인터페이스
28 : 시스템 메모리 30 : 랜덤 액세스 메모리(RAM)
32 : 캐시 메모리 34 : 저장 시스템
40 : 유틸리티 42 : 프로그램 모듈
51 : 영상 전처리 모듈 52 : 트레이닝 모듈
53 : 모델 스크리닝 모듈 54 : 선택적 제공 모듈
55 : 인식 모듈 56 : 데이터베이스 구축 모듈
57 : 저장 모듈 58 : 메시지 푸싱 모듈
59 : 실시간 모니터링 모듈 510 : 프리젠테이션 모듈12: Computer device 14: External device
16: processing unit 18: bus
20: network adapter 22: input/output interface
28: system memory 30: random access memory (RAM)
32: cache memory 34: storage system
40: utility 42: program module
51: image preprocessing module 52: training module
53: model screening module 54: optional provided module
55: recognition module 56: database building module
57: storage module 58: message pushing module
59: real-time monitoring module 510: presentation module

Claims

In the deep learning-based image recognition method,
Training a deep learning model based on a deep learning framework using the training image data to obtain at least two deep learning models;
Selecting a predetermined number of deep learning models corresponding to the number or less of the acquired deep learning models from the deep learning models obtained in descending order of recognition accuracy for the verified image data; And
Recognizing image data to be recognized using at least one of the selected deep learning models;
Containing, deep learning artificial neural network-based image recognition method.

The method according to claim 1,
The deep learning artificial neural network-based image recognition method further comprising performing image preprocessing on at least one of the training image data, the verified image data, and the image data to be recognized.

The method according to claim 2,
The step of performing the image pre-processing,
The image recognition method based on a deep learning artificial neural network, wherein the image preprocessing includes at least one of random cropping, rotation, flipping, brightness adjustment, and contrast adjustment.

The method of claim 3,
The image recognition method based on a deep learning artificial neural network, further comprising the step of storing the pre-processed image data in a pre-established memory database.

The method according to claim 1,
Recognizing the image data to be recognized,
Providing the selected deep learning model to a user; And
Acquiring a deep learning model selected by a user and recognizing image data to be recognized using the deep learning model selected by the user;
Containing, deep learning artificial neural network-based image recognition method.

The method according to claim 1,
The step of obtaining the at least two deep learning models,
Deep learning artificial neural network based image recognition method comprising the step of pushing the state information on the training process to the user.

The method according to claim 1,
The step of obtaining the at least two deep learning models,
A deep learning artificial neural network based image recognition method comprising the step of drawing a performance curve for a deep learning model being trained using a web application programming interface in real time.

The method according to claim 1,
The step of obtaining the at least two deep learning models,
A deep learning artificial neural network-based image recognition method in which multiple deep learning frameworks are the same or more than one.

In the deep learning-based image recognition device,
Processor; And
A memory storing instructions that, when executed by the processor, cause the processor to acquire at least two deep learning models by training a deep learning model based on a deep learning framework by using the training image data; and ,
The processor,
Select a predetermined number of deep learning models from deep learning models acquired in descending order of recognition accuracy for verified image data, and make the predetermined number less than or equal to the number of acquired deep learning models,
A deep learning artificial neural network based image recognition system for recognizing image data to be recognized using at least one of the selected deep learning models.

The method of claim 9,
The command is:
A deep learning artificial neural network based image recognition system that, when executed by the processor, causes the processor to perform image preprocessing on at least one of training image data, verification image data, and image data to be recognized.

The method of claim 10,
The image pre-processing,
A deep learning artificial neural network based image recognition system comprising at least one of random cropping, rotation, flipping, brightness adjustment, and contrast adjustment.

The method of claim 9,
The above command is:
The deep learning artificial neural network based image recognition system, when executed by the processor, causes the processor to establish a memory database and store the pre-processed image database in a preset memory database.

The method of claim 9,
The command is:
When executed by the processor, to cause the processor to provide the selected deep training model to the user, to obtain the deep learning model selected by the user, and to recognize the image data to be recognized using the deep learning model selected by the user. A deep learning artificial neural network based image recognition system.