KR20210127448A

KR20210127448A - Product classification system and method for filtering similar images using deep learning

Info

Publication number: KR20210127448A
Application number: KR1020200045311A
Authority: KR
Inventors: 손정일
Original assignee: 빅베이스 주식회사
Priority date: 2020-04-14
Filing date: 2020-04-14
Publication date: 2021-10-22
Also published as: KR102395876B1

Abstract

Disclosed are a product classification system which can filter and provide a matching image similar to a target image selected by a user by using a classification model learned through deep learning, and a method thereof. To this end, the product classification system comprises a management server for: inputting a target image provided by a user to an analysis model trained with a convolutional neural network (CNN) algorithm to extract a feature map and a feature vector of the target image; extracting a matching image similar to the target image from an image database in which a plurality of images are stored by using the feature map and the feature vector as search conditions; comparing and analyzing the feature vectors of the target image and the matching image to calculate a vector score for the matching image; and classifying the matching images with vector scores matching image filtering ranking values. Accordingly, accurate classification of a product selected by a user is possible, so that images of other products are not included in a result value. In addition, the product classification system provides manufacturers and sellers with an opportunity to easily check at what price their products are sold in open markets or global markets.

Description

Product classification system and method that can filter similar images using deep learning

본 발명은 사용자에 의해 선택된 타겟 이미지와 유사한 이미지를 분류할 수 있는 상품 분류 시스템 및 방법에 관한 것으로, 보다 상세하게는 딥러닝을 통해 학습된 분류모델을 이용하여 사용자에 의해 선택된 타겟 이미지와 유사한 매칭 이미지를 필터링하여 제공할 수 있는 상품 분류 시스템 및 방법에 관한 것이다. The present invention relates to a product classification system and method capable of classifying images similar to a target image selected by a user, and more particularly, matching similar to a target image selected by a user using a classification model learned through deep learning It relates to a product classification system and method that can filter and provide images.

정보통신 기술 및 반도체 기술의 발전으로 각종 전자장치들이 다양한 멀티미디어 서비스를 제공하는 멀티미디어 장치로 발전하고 있다. 예를 들어, 전자장치는 메신저 서비스, 방송 서비스, 무선 인터넷 서비스, 카메라 서비스 및 음악 재생 서비스와 같은 다양한 멀티미디어 서비스를 제공하고 있다.With the development of information and communication technology and semiconductor technology, various electronic devices are developing into multimedia devices that provide various multimedia services. For example, electronic devices provide various multimedia services such as a messenger service, a broadcasting service, a wireless Internet service, a camera service, and a music reproduction service.

또한, 전자장치는 이미지를 분류하고, 검색할 수 있는 기능을 제공하고 있다. 이러한 전자장치는 기 설정된 분류 기준을 이용하여, 사용자의 이미지들을 소정의 그룹으로 분류할 수 있으나, 일관된 분류 기준을 이용함으로써, 사용자에게 최적화된 분류 결과를 제공할 수 없다는 문제점이 있다.In addition, the electronic device provides a function for classifying and searching images. Such an electronic device may classify the user's images into a predetermined group by using a preset classification criterion, but there is a problem in that an optimized classification result cannot be provided to the user by using a consistent classification criterion.

아울러, 최근 영상 처리 분야에도 인공지능 시스템이 도입되고 있다.In addition, artificial intelligence systems have recently been introduced into the image processing field.

상기 인공지능(Artificial Intelligence, AI) 시스템은 인간 수준의 지능을 구현하는 컴퓨터 시스템이며, 기존 Rule 기반 스마트 시스템과 달리 기계가 스스로 학습하고 판단하며 똑똑해지는 시스템이다. 상기 인공지능 시스템은 사용할수록 인식률이 향상되고 사용자 취향을 보다 정확하게 이해할 수 있게 되어, 기존 Rule 기반 스마트 시스템은 점차 딥러닝 기반 인공지능 시스템으로 대체되고 있다.The artificial intelligence (AI) system is a computer system that implements human-level intelligence, and unlike the existing rule-based smart system, the machine learns, judges, and becomes smarter by itself. As the artificial intelligence system is used, the recognition rate is improved and the user's taste can be understood more accurately, and the existing rule-based smart system is gradually being replaced by a deep learning-based artificial intelligence system.

특히, 인공지능 기술은 기계학습(딥러닝) 및 기계학습을 활용한 요소 기술들로 구성된다.In particular, artificial intelligence technology consists of machine learning (deep learning) and element technologies using machine learning.

이러한 기계학습은 입력 데이터들의 특징을 스스로 분류/학습하는 알고리즘 기술이며, 요소기술은 딥러닝 등의 기계학습 알고리즘을 활용하여 인간 두뇌의 인지, 판단 등의 기능을 모사하는 기술로서, 언어적 이해, 시각적 이해, 추론/예측, 지식 표현, 동작 제어 등의 기술 분야로 구성된다. Such machine learning is an algorithm technology that categorizes/learns the characteristics of input data by itself, and element technology uses machine learning algorithms such as deep learning to simulate functions such as cognition and judgment of the human brain. It consists of technical fields such as visual understanding, reasoning/prediction, knowledge expression, and motion control.

상기 언어적 이해는 인간의 언어/문자를 인식하고 응용/처리하는 기술로서, 자연어 처리, 기계 번역, 대화시스템, 질의응답, 음성 인식/합성 등을 포함한다. The linguistic understanding is a technology for recognizing and applying/processing human language/text, and includes natural language processing, machine translation, dialogue system, question answering, voice recognition/synthesis, and the like.

상기 시각적 이해는 사물을 인간의 시각처럼 인식하여 처리하는 기술로서, 객체 인식, 객체 추적, 영상 검색, 사람 인식, 장면이해, 공간 이해, 영상 개선 등을 포함한다. The visual understanding is a technology for recognizing and processing objects like human vision, and includes object recognition, object tracking, image search, person recognition, scene understanding, spatial understanding, image improvement, and the like.

상기 추론 예측은 정보를 판단하여 논리적으로 추론하고 예측하는 기술로서, 지식/확률 기반 추론, 최적화 예측, 선호 기반 계획, 추천 등을 포함한다. 지식 표현은 인간의 경험정보를 지식데이터로 자동화 처리하는 기술로서, 지식 구축(데이터 생성/분류), 지식 관리(데이터 활용) 등을 포함한다. The inference prediction is a technique for logically reasoning and predicting by judging information, and includes knowledge/probability-based reasoning, optimization prediction, preference-based planning, recommendation, and the like. Knowledge expression is a technology that automatically processes human experience information into knowledge data, and includes knowledge construction (data generation/classification) and knowledge management (data utilization).

상기 동작 제어는 차량의 자율 주행, 로봇의 움직임을 제어하는 기술로서, 움직임 제어(항법, 충돌, 주행), 조작 제어(행동 제어) 등을 포함한다.The motion control is a technology for controlling autonomous driving of a vehicle and motion of a robot, and includes motion control (navigation, collision, driving), manipulation control (action control), and the like.

최근에는 온라인을 통한 상품의 구매가 증가되고 있으며, 사용자 개인화된 취향을 분석하는 기술이 발전하는 중이고, 제조사나 판매자는 자신의 상품이 오픈마켓이나 글로벌 마켓에서 어떤 가격에 어떻게 팔리는지를 찾기 원하기 때문에 제조사나 판매자를 위한 인공지능 기술의 필요성이 요구되고 있는 실정이다.Recently, online product purchases are increasing, and technology to analyze user's personalized tastes is developing. The need for artificial intelligence technology for manufacturers and sellers is being demanded.

이에 따라, 온라인 쇼핑몰에서 상품을 비교 검색할 때 사용하는 새로운 방법으로 이미지의 특징 벡터를 기반으로 검색하는 내용 기반 이미지 검색(content-based retrieval) 기술이 개발되고 있는 실정이다. Accordingly, a content-based image retrieval technique for searching based on a feature vector of an image as a new method used when comparing and retrieving products in an online shopping mall is being developed.

그러나, 이 방법은 데이터의 용량이 커지고 전처리 과정을 위한 비용도 크기 때문에 대용량 이미지 검색에 한계가 있으며, 기술적인 한계로 인해 만족할 만한 검색결과를 얻지 못하고 있다. 특히 이미지가 준비되지 않은 경우에 정확한 검색을 수행하기가 어렵다는 문제가 있었다.However, this method has limitations in large-capacity image search because the data capacity is large and the cost for the pre-processing process is large, and due to technical limitations, satisfactory search results are not obtained. In particular, there was a problem in that it was difficult to perform an accurate search when the image was not prepared.

또한, 제조사나 판매자의 주요 니즈를 충족하기 위해 유사도 값으로 랭킹을 지정한 후 유사도 값이 높은 순서로 정렬하여 결과값을 출력하는 경우, 유사도가 99%에 해당하더라도 실제 상품으로 비교하면 전혀 다른 상품(다른 브랜드, 가품)인 경우가 발생되는 문제가 있었다.In addition, in order to satisfy the main needs of manufacturers or sellers, if the ranking is specified by the similarity value and the results are sorted in the order of the highest similarity value, even if the similarity is 99%, it is a completely different product ( There was a problem in the case of other brands or fakes).

대한민국 공개특허 제10-2020-0002332호(2020.01.08 공고)Republic of Korea Patent Publication No. 10-2020-0002332 (announced on Aug. 8, 2020) 대한민국 공개특허 제10-2019-0098801호(2019.08.23 공개)Republic of Korea Patent Publication No. 10-2019-0098801 (published on August 23, 2019) 대한민국 공개특허 제10-2019-0052028호(2019.05.15 공개)Republic of Korea Patent Publication No. 10-2019-0052028 (published on May 15, 2019) 대한민국 공개특허 제10-2020-0010672호(2020.01.31 공개)Republic of Korea Patent Publication No. 10-2020-0010672 (published on January 31, 2020)

따라서, 본 발명의 제1 목적은 이미지의 CNN(Convolutional Neural Network) 분석 결과의 특성 값과 벡터 값에 대한 임계 범위를 지정하여 해당 범위에 들어올 경우에만 검출하여 분류 정확도를 개선시킬 수 있는 상품 분류 시스템을 제공하는데 있다.Therefore, a first object of the present invention is a product classification system that can improve classification accuracy by designating a threshold range for a characteristic value and a vector value of a CNN (Convolutional Neural Network) analysis result of an image and detecting only when it falls within the corresponding range. is to provide

또한, 본 발명의 제2 목적은 이미지 분석용 딥러닝 알고리즘으로 특징 맵과 특징 벡터를 추출하고, 상기 특징 맵과 특징 벡터를 기반으로 매칭 이미지를 추출하며, 매칭 이미지의 특징 벡터의 집합과 지도 학습 결과를 비교하여 만들어진 일치 영역에 대한 벡터 점수를 필터로 사용하여 동일유사 이미지 검출에 사용하는 상품 분류 방법을 제공하는데 있다.In addition, a second object of the present invention is to extract a feature map and a feature vector with a deep learning algorithm for image analysis, extract a matching image based on the feature map and the feature vector, and set the feature vector of the matching image and the supervised learning result The goal is to provide a product classification method that is used to detect identical images by using the vector score for the matching area created by comparing .

상술한 본 발명의 제1 목적을 달성하기 위하여, 본 발명의 일 실시예에서는 사용자로부터 제공된 타겟 이미지를 CNN(Convolutional Neural Network) 계열 알고리즘으로 학습된 분석 모델에 입력하여 상기 타겟 이미지의 특징 맵과 특징 벡터를 추출하고, 상기 특징 맵과 특징 벡터를 검색 조건으로 하여 복수의 이미지가 저장된 이미지 데이터베이스로부터 상기 타겟 이미지와 유사한 매칭 이미지를 추출하며, 상기 타겟 이미지와 매칭 이미지의 특징 벡터를 비교분석하여 매칭 이미지의 벡터 점수를 산출하고, 이미지 필터링 랭킹 값에 부합하는 벡터 점수를 갖는 매칭 이미지를 분류하는 관리서버를 포함하는 상품 분류 시스템을 제공한다.In order to achieve the first object of the present invention described above, in an embodiment of the present invention, a target image provided by a user is input to an analysis model learned by a CNN (Convolutional Neural Network)-based algorithm, and a feature map and features of the target image are input. extracting a vector, extracting a matching image similar to the target image from an image database in which a plurality of images are stored using the feature map and the feature vector as a search condition, and comparing and analyzing the feature vector of the target image and the matching image to obtain a matching image It provides a product classification system including a management server for calculating a vector score of and classifying a matching image having a vector score matching the image filtering ranking value.

또한, 본 발명의 제2 목적을 달성하기 위하여, 본 발명의 일 실시예에서는 관리서버가 사용자 단말기로부터 타겟 이미지를 입력받아 상기 타겟 이미지의 특징 맵과 특징 벡터를 추출하는 특징 벡터 추출단계와, 상기 관리서버가 상기 타겟 이미지의 특징 맵 및 특징 벡터에 매칭된 특징 맵 및 특징 벡터를 갖는 매칭 이미지를 기존 이미지들로부터 추출하는 매칭 이미지 검색단계와, 상기 관리서버가 상기 타겟 이미지의 특징 맵 및 특징 벡터와 매칭 이미지의 특징 맵과 특징 벡터를 비교분석하여 매칭 이미지의 벡터 점수를 산출하는 벡터 점수 산출단계, 및 상기 관리서버가 상기 매칭 이미지의 벡터 점수가 이미지 필터링 랭킹 값에 부합하는지를 확인하고, 상기 이미지 필터링 랭킹 값에 부합하는 매칭 이미지를 타겟 이미지의 동일유사 이미지로 분류하는 이미지 분류단계를 포함하는 상품 분류 방법을 제공한다.In addition, in order to achieve the second object of the present invention, in one embodiment of the present invention, the management server receives a target image from a user terminal and extracts a feature map and a feature vector of the target image, the feature vector extraction step, A matching image search step in which the management server extracts a matching image having a feature map and a feature vector matched to the feature map and feature vector of the target image from existing images, and the management server performs the feature map and feature vector of the target image A vector score calculation step of calculating a vector score of the matching image by comparing and analyzing the feature map and the feature vector of the matching image, and the management server checks whether the vector score of the matching image matches the image filtering ranking value, and the image It provides a product classification method including an image classification step of classifying a matching image matching the filtering ranking value as the same-similar image of the target image.

본 발명에 의하면, 사용자에 의해 선택된 상품에 대한 정확한 분류가 가능하여 다른 상품의 이미지가 결과 값에 포함되지 않는다. According to the present invention, accurate classification of the product selected by the user is possible, so that images of other products are not included in the result value.

또한, 본 발명은 제조사나 판매자에게 자신의 상품이 오픈 마켓이나 글로벌 마켓에서 어떤 가격에 팔리는지 쉽게 확인할 수 있는 기회를 제공한다.In addition, the present invention provides a manufacturer or seller with an opportunity to easily check at what price their product is sold in an open market or a global market.

아울러, 본 발명은 목적 이미지와 동일하거나 유사한 이미지를 검색하되 다른 브랜드의 이미지와 가품의 이미지가 제외된 검색 결과를 제공해 줄 수 있다.In addition, the present invention can provide a search result that searches for the same or similar image as the target image, but excludes images of other brands and images of fakes.

게다가, 본 발명은 정품의 사진과 유통 중인 상품의 사진을 비교분석하여 유통 중인 상품을 정품과 가품으로 분류할 수 있으며, 제조사나 판매자가 자신의 상품과 유사한 디자인을 갖는 상품의 존재여부를 쉽게 확인할 수 있다.In addition, the present invention can classify a product in circulation into a genuine product and a counterfeit product by comparing and analyzing the photograph of the genuine product and the photograph of the product in circulation, and the manufacturer or the seller can easily check whether there is a product having a design similar to their own product. can

도 1은 본 발명에 따른 상품 분류 시스템을 설명하기 위한 구성도이다.
도 2는 본 발명에 따른 상품 분류 시스템을 설명하기 위한 블록도이다.
도 3은 본 발명에 따른 분석엔진의 동작을 설명하기 위한 흐름도이다.
도 4는 본 발명에 따른 벡터 점수의 산출 및 이미지 필터링 동작의 개념을 설명하기 위한 개략도이다.
도 5는 본 발명에 따른 상품 분류 방법을 설명하기 위한 순서도이다. 1 is a configuration diagram for explaining a product classification system according to the present invention.
2 is a block diagram illustrating a product classification system according to the present invention.
3 is a flowchart for explaining the operation of the analysis engine according to the present invention.
4 is a schematic diagram for explaining the concept of a vector score calculation and image filtering operation according to the present invention.
5 is a flowchart illustrating a product classification method according to the present invention.

이하, 첨부도면을 참조하여 본 발명의 바람직한 실시예들에 의한 딥러닝을 이용하여 유사한 이미지를 필터링할 수 있는 상품 분류 시스템(이하, '딥러닝 기반 상품 분류 시스템'이라 약칭함)을 상세하게 설명한다. Hereinafter, a product classification system (hereinafter, abbreviated as 'deep learning-based product classification system') that can filter similar images using deep learning according to preferred embodiments of the present invention will be described in detail with reference to the accompanying drawings. do.

도 1은 본 발명에 따른 상품 분류 시스템을 설명하기 위한 구성도이며, 도 2는 본 발명에 따른 상품 분류 시스템을 설명하기 위한 블록도이다.1 is a block diagram illustrating a product classification system according to the present invention, and FIG. 2 is a block diagram illustrating a product classification system according to the present invention.

도 1을 참조하면, 본 발명에 따른 딥러닝 기반 상품 분류 시스템은 CNN(Convolutional Neural Network) 계열 알고리즘으로 학습된 분석 모델로 타겟 이미지의 특징 맵(feature map)과 특징 벡터(feature vector)를 추출하고, 상기 특징 맵과 특징 벡터로 매칭 이미지를 추출하며, 상기 타겟 이미지와 매칭 이미지의 특징 벡터를 비교분석하여 동일하거나 유사한 매칭 이미지를 분류하는 관리서버(100)를 포함하며, 선택적으로 관리서버(100)에 타겟 이미지를 제공하는 사용자 단말기(미도시)를 더 포함할 수 있다.1, the deep learning-based product classification system according to the present invention extracts a feature map and a feature vector of a target image with an analysis model learned by a CNN (Convolutional Neural Network)-based algorithm, and , extracting a matching image using the feature map and the feature vector, and comparing and analyzing the feature vector of the target image and the matching image to categorize the same or similar matching image, and optionally the management server 100 ) may further include a user terminal (not shown) that provides the target image.

상기 사용자 단말기는 사용자로부터 타겟 이미지나 영상을 입력받고, 상기 타겟 이미지나 영상을 관리서버(100)로 전송하는 것으로, 관리서버(100)와 유무선 통신네트워크(이하, '통신네트워크'라 약칭함)를 통해 연결된다.The user terminal receives a target image or image from a user, and transmits the target image or image to the management server 100, and the management server 100 and a wired/wireless communication network (hereinafter abbreviated as 'communication network') connected through

일 실시 양태로서, 본 발명에 따른 사용자 단말기는 사용자에 의해 선택된 타겟 이미지나 영상의 파일 형식을 변환시키지 않고 그대로 관리서버(100)에 전송할 수 있다. As an embodiment, the user terminal according to the present invention may transmit the file format of the target image or video selected by the user to the management server 100 as it is without converting the file format.

다른 실시 양태로서, 본 발명에 따른 사용자 단말기는 사용자에 의해 선택된 타겟 이미지나 영상의 파일 형식을 미리 설정된 파일 형식으로 포맷한 다음 관리서버(100)에 전송할 수 있다. 이때, 사용자 단말기는 관리서버(100)로 전송하는 타겟 이미지나 영상의 파일 용량이 줄어들도록 MPG, MP4 등 저용량의 파일 형식으로 포맷을 진행할 수 있다.As another embodiment, the user terminal according to the present invention may format the file format of the target image or video selected by the user into a preset file format and then transmit it to the management server 100 . In this case, the user terminal may format the target image or video to be transmitted to the management server 100 in a low-capacity file format such as MPG or MP4 to reduce the file size.

이와 같이, 사용자 단말기는 사용자로부터 입력된 타겟 이미지나 영상의 포맷을 미리 지정된 포맷으로 변환시키고 관리서버(100)로 전송할 수 있다. In this way, the user terminal may convert the format of the target image or video input from the user into a predetermined format and transmit it to the management server 100 .

필요에 따라, 사용자 단말기에는 관리서버(100)의 접속을 도와주며, 타겟 이미지나 영상의 파일 형식을 변환시켜줄 수 있도록 상품분류 어플리케이션이나 상품분류 프로그램이 설치될 수 있다.If necessary, a product classification application or a product classification program may be installed in the user terminal to help access the management server 100 and to convert a file format of a target image or video.

구체적으로, 상기 상품분류 어플리케이션은 사용자의 요청에 따라 통신네트워크를 통해 관리서버(100)에 접속하고, 타겟 이미지나 영상을 관리서버(100)로 전송한다.Specifically, the product classification application connects to the management server 100 through a communication network according to a user's request, and transmits a target image or video to the management server 100 .

또한, 상품분류 어플리케이션은 통신네트워크를 통해 관리서버(100)부터 동일유사 이미지의 리스트가 전송되면, 상기 동일유사 이미지의 리스트를 사용자 단말기의 화면에 출력한다. In addition, when a list of identical images is transmitted from the management server 100 through the communication network, the product classification application outputs the list of identical images to the screen of the user terminal.

아울러, 상품분류 어플리케이션은 안드로이드 마켓이나 앱스토어 또는 웹 페이지를 통해 다운로드 받아 사용자 단말기에 설치될 수 있다.In addition, the product classification application may be downloaded through an Android market, an app store, or a web page and installed in the user terminal.

이러한 사용자 단말기는 다수의 기지국과 기지국 제어기, 이동 통신 교환기 및 왑(WAP) 게이트웨이 등을 포함하는 무선 통신망과 인터넷 등의 유선 통신망을 통해 딥러닝 기반 상품 분류 서비스를 제공하는 관리서버(100)에 타겟 이미지를 전송할 수 있는 수단을 의미하며, 무선 단말기와 유선 단말기를 사용할 수 있다.This user terminal is targeted to the management server 100 that provides a deep learning-based product classification service through a wired communication network such as the Internet and a wireless communication network including a number of base stations and base station controllers, mobile communication exchanges and WAP gateways, etc. It means a means for transmitting an image, and a wireless terminal and a wired terminal can be used.

이때, 무선 단말기로는 스마트폰, 태블릿 PC, 노트북 컴퓨터 등이 사용될 수 있으며, 유선 단말기로는 인터넷 접속 수단을 갖는 데스크탑 컴퓨터, 스마트 TV, 유선전화 등이 사용될 수 있다. In this case, a smart phone, a tablet PC, a notebook computer, etc. may be used as a wireless terminal, and a desktop computer having an Internet connection means, a smart TV, a wire phone, etc. may be used as a wired terminal.

한편, 상기 관리서버(100)는 딥러닝을 이용해 사용자로부터 제공된 타겟 이미지와 유사한 이미지를 분류하는 것으로, 타겟 이미지와 유사하거나 동일한 상품의 리스트를 타겟 이미지를 제공한 사용자에 전송한다. Meanwhile, the management server 100 categorizes images similar to the target image provided by the user using deep learning, and transmits a list of products similar to or identical to the target image to the user providing the target image.

이를 위해, 관리서버(100)는 타겟 이미지를 CNN 계열 알고리즘으로 학습된 분석 모델에 입력하여 상기 타겟 이미지의 특징 맵과 특징 벡터를 추출하고, 상기 특징 맵과 특징 벡터를 검색 조건으로 하여 복수의 이미지가 저장된 이미지 데이터베이스(122)로부터 상기 타겟 이미지와 유사한 매칭 이미지를 추출하며, 상기 타겟 이미지와 매칭 이미지의 특징 벡터를 비교분석하여 매칭 이미지의 벡터 점수를 산출한다. 여기서, 벡터 점수는 0~1 사이의 수의 행렬로 구성될 수 있고, 행렬의 크기는 이미지 분석 알고리즘에 따라 결정될 수 있으며, 이미지 분석 알고리즘의 특징 벡터 값의 행렬의 크기가 일치한다. To this end, the management server 100 inputs a target image to an analysis model learned by a CNN-based algorithm, extracts a feature map and a feature vector of the target image, and uses the feature map and the feature vector as a search condition for a plurality of images. extracts a matching image similar to the target image from the stored image database 122, and compares and analyzes feature vectors of the target image and the matching image to calculate a vector score of the matching image. Here, the vector score may consist of a matrix of numbers between 0 and 1, and the size of the matrix may be determined according to an image analysis algorithm, and the size of the matrix of the feature vector values of the image analysis algorithm matches.

그리고 관리서버(100)는 이미지 필터링 랭킹 값에 부합하는 벡터 점수를 갖는 매칭 이미지를 분류한다. 여기서, 상기 이미지 필터링 랭킹 값은 지도학습결과 정확히 일치하는 이미지와 불일치 이미지의 벡터 점수를 비교 분석하여 가장 작은 오차를 보인 벡터 점수를 순환 신경망(Recurrent Neural Network)으로 학습된 분석 모델에 입력하여 산출한 평균 값을 의미한다. And the management server 100 classifies the matching image having a vector score that matches the image filtering ranking value. Here, the image filtering ranking value is calculated by comparing and analyzing the vector scores of the images that exactly match and the inconsistent images as a result of supervised learning, and input the vector score showing the smallest error into the analysis model trained with the Recurrent Neural Network. mean the average value.

이러한 이미지 필터링 랭킹 값은 최대 값이 1 미만이기 때문에 매칭 이미지의 벡터 점수가 1에 근접한다고 하더라도 동일유사 이미지의 분류에서 제외될 수 있다. 다시 말해, 본 발명은 벡터 점수가 최상위인 매칭 이미지라 하더라도 이미지 필터링 기준을 벗어나면 타겟 이미지와 비유사 이미지로 분류할 수 있다. Since the maximum value of this image filtering ranking value is less than 1, even if the vector score of the matching image is close to 1, it may be excluded from the classification of the same-similar image. In other words, according to the present invention, even a matching image having the highest vector score can be classified into a target image and a dissimilar image if the image filtering criterion is exceeded.

또한, CNN 계열 알고리즘은 FRCNN(Fast Region-based Convolutional Neural Network), 합성곱신경망(CNN), RCNN(Region-based Convolutional Neural Network), SPPnet(Spatial Pyramid Pooling in Deep Convolutional Networks) 중 어느 하나이다. In addition, the CNN-based algorithm is any one of FRCNN (Fast Region-based Convolutional Neural Network), Convolutional Neural Network (CNN), RCNN (Region-based Convolutional Neural Network), and SPPnet (Spatial Pyramid Pooling in Deep Convolutional Networks).

이하, 도면을 참조하여 관리서버의 구성요소별로 보다 구체적으로 설명한다.Hereinafter, each component of the management server will be described in more detail with reference to the drawings.

도 1 및 도 2를 참조하면, 본 발명에 따른 관리서버(100)는 상품 분류 엔진(110)을 포함한다. 1 and 2 , the management server 100 according to the present invention includes a product classification engine 110 .

상기 상품 분류 엔진(110)은 이미지 전처리부(111)와 영상 변환부(112) 및 분석엔진(113)이 포함된 것으로, 선택적으로 상품옵션 입력부(114), 단위 이미지 추출부(115), 또는 이들 모두를 더 포함할 수 있다.The product classification engine 110 includes an image preprocessing unit 111, an image conversion unit 112, and an analysis engine 113, and optionally a product option input unit 114, a unit image extraction unit 115, or All of these may be further included.

상기 이미지 전처리부(111)는 사용자로부터 제공된 타겟 이미지나 영상의 이미지 전처리(image preprocessing)를 수행하는 것으로, 이미지 전처리가 수행된 타겟 이미지를 영상 변환부(112)나 분석엔진(113)으로 제공한다. The image preprocessing unit 111 performs image preprocessing of the target image or image provided by the user, and provides the image preprocessing target image to the image conversion unit 112 or the analysis engine 113 . .

이때, 이미지 전처리(image preprocessing)로는 포맷 변환이나 노이즈 제거 등이 사용될 수 있다. 이러한 포맷 변환은 분석엔진(113)에서 이미지 분석을 쉽게 진행하도록 하는 이미지 전처리로, RAW 포맷 등이 사용될 수 있다. In this case, format conversion or noise removal may be used as image preprocessing. This format conversion is an image pre-processing that facilitates image analysis in the analysis engine 113, and a RAW format may be used.

이러한 이미지 전처리부(111)는 사용자로부터 제공된 영상이 동영상인 경우, 상기 동영상을 프레임별로 분리하고 상기 프레임에 포함된 특징 이미지 영역을 추출하여 타겟 이미지를 생성한다.When the image provided by the user is a moving image, the image preprocessor 111 separates the moving image for each frame and extracts a feature image region included in the frame to generate a target image.

상기 영상 변환부(112)는 이미지 전처리부(111)를 통과한 타겟 이미지를 미리 지정된 해상도로 변환(up scale 또는 down scale)하는 것으로, 해상도가 변환된 타겟 이미지를 분석엔진(113)으로 제공한다. 예를 들면, 영상 변환부(112)는 타겟 이미지의 해상도를 640*480이나 240*400 등으로 변환시킬 수 있다.The image conversion unit 112 converts the target image that has passed through the image preprocessor 111 to a predetermined resolution (up scale or down scale), and provides the resolution-converted target image to the analysis engine 113 . . For example, the image converter 112 may convert the resolution of the target image to 640*480, 240*400, or the like.

상기 상품옵션 입력부(114)는 타겟 이미지와 함께 상품 검색에 사용될 상품검색옵션을 사용자로부터 입력받아 분석엔진(113)부로 제공하는 것으로, 사용자 단말기나 관리서버(100)의 입력장치로부터 상품검색옵션이 입력되면 이를 타겟 이미지와 연계시켜 분석엔진(113)으로 제공한다. 여기서, 상품검색옵션은 상품의 제품명, 판매자명, 제조사명, 판매자 연락처, 가격, 크기, 색상, 제질, 무개, 부피로 이루어진 군으로부터 선택된 어느 하나 이상이다. The product option input unit 114 receives a product search option to be used for product search together with a target image from a user and provides it to the analysis engine 113 unit. When inputted, it is provided to the analysis engine 113 in association with the target image. Here, the product search option is any one or more selected from the group consisting of product name, seller name, manufacturer name, seller contact information, price, size, color, quality, color, and volume of the product.

이러한 상품옵션 입력부(114)는 상품검색옵션에 포함된 불용어 제거 등의 정규화 과정을 통해 상품검색 키워드를 선별하고, 상기 상품검색 키워드를 함께 입력된 타겟 이미지와 연계시켜 분석엔진(113)으로 제공할 수 있다.The product option input unit 114 selects product search keywords through a normalization process such as removing stopwords included in the product search option, and links the product search keyword with the input target image to provide it to the analysis engine 113. can

상기 단위 이미지 추출부(115)는 상기 영상 변환부(112)로부터 제공된 타겟 이미지에서 분석 대상의 특징이 포함된 이미지 영역만을 추출하여 단위 이미지를 생성하는 것으로, 상기 단위 이미지를 분석엔진(113)으로 제공한다.The unit image extraction unit 115 generates a unit image by extracting only the image region including the characteristics of the analysis target from the target image provided from the image conversion unit 112 , and converts the unit image to the analysis engine 113 . to provide.

상기 분석엔진(113)은 이미지 전처리부(111) 또는 영상 변환부(112)로부터 제공된 타겟 이미지나 단위 이미지 추출부(115)로부터 제공된 단위 이미지를 CNN 알고리즘으로 학습된 분석 모델에 입력하여 특징 맵과 특징 벡터를 추출하고, 상기 특징 맵과 특징 벡터를 검색 조건으로 하여 복수의 이미지가 저장된 이미지 데이터베이스(122)로부터 타겟 이미지와 유사한 매칭 이미지를 추출하며, 타겟 이미지와 매칭 이미지의 특징 벡터를 비교분석하여 매칭 이미지의 벡터 점수를 산출하고, 이미지 필터링 랭킹 값에 부합하는 벡터 점수를 갖는 매칭 이미지를 분류하는 기능을 제공한다.The analysis engine 113 inputs the target image provided from the image pre-processing unit 111 or the image conversion unit 112 or the unit image provided from the unit image extraction unit 115 into the analysis model learned by the CNN algorithm to generate a feature map and Extracting a feature vector, extracting a matching image similar to the target image from the image database 122 in which a plurality of images are stored using the feature map and the feature vector as a search condition, and comparing and analyzing the feature vector of the target image and the matching image It provides a function of calculating a vector score of a matching image and classifying a matching image having a vector score matching the image filtering ranking value.

상기 분석엔진(113)은 상품검색옵션이 연계된 타겟 이미지가 수신되는 경우, 상품검색옵션과 특징 맵 및 특징 벡터를 검색 조건으로 하여 복수의 이미지가 저장된 이미지 데이터베이스(122)로부터 상기 타겟 이미지와 유사한 매칭 이미지를 추출할 수 있다.When a target image to which the product search option is linked is received, the analysis engine 113 uses the product search option, the feature map, and the feature vector as a search condition from the image database 122 in which a plurality of images are stored similar to the target image. Matching images can be extracted.

도 3은 본 발명에 따른 분석엔진의 동작을 설명하기 위한 흐름도이며, 도 4는 본 발명에 따른 벡터 점수의 산출 및 이미지 필터링 동작의 개념을 설명하기 위한 개략도이다.3 is a flowchart for explaining the operation of the analysis engine according to the present invention, and FIG. 4 is a schematic diagram for explaining the concept of the vector score calculation and image filtering operation according to the present invention.

특정 양태로서, 본 발명에 따른 분석엔진(113)은 도 3에 도시된 바와 같이 CNN 분석기 및 벡터점수 추출부를 포함하여 구성될 수 있다.As a specific aspect, the analysis engine 113 according to the present invention may be configured to include a CNN analyzer and a vector score extractor as shown in FIG. 3 .

상기 CNN 분석기는 타겟 이미지를 CNN 계열 알고리즘으로 학습된 분석 모델에 입력하여 상기 타겟 이미지의 특징 맵과 특징 벡터를 추출한다.The CNN analyzer extracts a feature map and a feature vector of the target image by inputting the target image to an analysis model trained with a CNN-based algorithm.

상기 벡터점수 추출부는 타겟 이미지의 특징 맵과 특징 벡터를 검색 조건으로 하여 복수의 이미지가 저장된 이미지 데이터베이스(DB)로부터 상기 타겟 이미지와 유사한 매칭 이미지를 추출하며, 매칭 이미지의 특징 벡터를 타겟 이미지의 특징 벡터와 비교분석하여 매칭 이미지의 벡터 점수를 산출하고, 이미지 필터링 랭킹 값에 부합하는 벡터 점수를 갖는 매칭 이미지를 동일유사 이미지로 분류한다.The vector score extractor extracts a matching image similar to the target image from an image database (DB) in which a plurality of images are stored using a feature map and a feature vector of the target image as search conditions, and sets the feature vector of the matching image as the feature of the target image The vector score of the matching image is calculated by comparison with the vector, and the matching image having a vector score matching the image filtering ranking value is classified as the same-similar image.

여기서, 특징 맵 및 특징 벡터는 이미지의 유사여부를 판단하는 변수이며, 벡터 점수는 특징 맵과 특징 벡터가 유사한 패턴일 경우에 이미지 필터링의 통과 여부를 결정하는 변수이다. Here, the feature map and the feature vector are variables that determine whether images are similar, and the vector score is a variable that determines whether image filtering is passed when the feature map and the feature vector are similar patterns.

또한, 상기 이미지 필터링 랭킹 값은 상품의 종류에 따라 변화되기 때문에 사용자에 의해 타겟 이미지가 선택되면, 타겟 이미지의 종류에 따라 초기 값이 결정되며, 분석엔진(113)의 자기 학습을 통해 변화된다. In addition, since the image filtering ranking value changes according to the type of product, when a target image is selected by the user, the initial value is determined according to the type of the target image, and is changed through self-learning of the analysis engine 113 .

필요에 따라, 벡터점수 추출부는 특징 맵과 특징 벡터의 이미지 매칭 결과에 따른 가중치를 특징 벡터에 적용하여 매칭 이미지의 벡터 점수를 산출할 수 있다.If necessary, the vector score extractor may calculate a vector score of the matching image by applying a weight according to the image matching result of the feature map and the feature vector to the feature vector.

도 1 및 도 2를 참조하면, 본 발명에 따른 관리서버(100)는 빅데이터 스토리지(120)를 더 포함할 수 있다.1 and 2 , the management server 100 according to the present invention may further include a big data storage 120 .

상기 빅데이터 스토리지(120)는 빅데이터 엔진(121)과 이미지 DB(122)가 포함된 것으로, 선택적으로는 문서 DB(123), HDFS(Hadoop Distributed File System, 124)를 더 포함할 수 있다.The big data storage 120 includes a big data engine 121 and an image DB 122 , and may optionally further include a document DB 123 and a Hadoop Distributed File System (HDFS) 124 .

상기 빅데이터 엔진(121)은 각 DB에 접속하여 질의 결과를 제공하는 통합 스토리지 미들웨어로, 분석엔진(113)의 요청에 따라 이미지 데이터베이스(122)를 검색하여 매칭 이미지를 추출한 다음 분석엔진(113)으로 제공한다.The big data engine 121 is an integrated storage middleware that accesses each DB to provide query results, searches the image database 122 according to the request of the analysis engine 113, extracts matching images, and then the analysis engine 113. provided as

또한, 빅데이터 엔진(121)은 분석엔진(113)으로부터 타겟 이미지와 함께 상기 타겟 이미지의 특징 맵과 특징 벡터 및 특징 ID가 제공되면, 이를 이미지 데이터베이스(122)에 저장한다. In addition, when the big data engine 121 provides a feature map, a feature vector, and a feature ID of the target image together with the target image from the analysis engine 113 , it is stored in the image database 122 .

상기 이미지 DB(122)는 전통적인 RDBMS, DBMS, File DB등의 sql 문으로 질의 가능한 데이터베이스와 파일 기반 저장소를 의미하며, 문서 DB(123)는 비정형 문서형 데이터의 저장공간을 의미한다. The image DB 122 means a database and file-based storage that can be queried with sql statements such as traditional RDBMS, DBMS, and File DB, and the document DB 123 means a storage space for unstructured document data.

상기 HDFS(124)는 하둡파일 시스템으로, 분산 클러스터 파일의 저장소이며, 주로 FRCNN 분석 결과와 학습데이터, 테스트 데이터, 이미지 색인 등의 인공지능 분석 데이터가 저장된다.The HDFS 124 is a Hadoop file system, which is a storage of distributed cluster files, and mainly stores FRCNN analysis results, training data, test data, and artificial intelligence analysis data such as image indexes.

도 5는 본 발명에 따른 상품 분류 방법을 설명하기 위한 순서도이다. 5 is a flowchart illustrating a product classification method according to the present invention.

도 5를 참조하면, 본 발명에 따른 상품 분류 방법은 사용자 단말기로부터 타겟 이미지를 입력받아 특징 맵과 특징 벡터를 추출하는 특징 벡터 추출단계(S100)와, 상기 타겟 이미지의 특징 맵 및 특징 벡터에 매칭된 특징 맵 및 특징 벡터를 갖는 매칭 이미지를 기존 이미지들로부터 추출하는 매칭 이미지 검색단계(S200)와, 상기 타겟 이미지와 매칭 이미지의 특징 벡터를 비교분석하여 매칭 이미지의 벡터 점수를 산출하는 벡터 점수 산출단계(S300), 및 상기 매칭 이미지의 벡터 점수를 이미지 필터링 랭킹 값으로 필터링 하여 필터링에 통과한 매칭 이미지를 동일유사 이미지로 분류하는 이미지 분류단계(S400)를 포함한다.Referring to FIG. 5 , the product classification method according to the present invention includes a feature vector extraction step ( S100 ) of extracting a feature map and a feature vector by receiving a target image from a user terminal, and matching the feature map and feature vector of the target image Matching image search step (S200) of extracting a matching image having the specified feature map and feature vector from existing images, and calculating a vector score of calculating a vector score of the matching image by comparing and analyzing the feature vectors of the target image and the matching image It includes a step (S300), and an image classification step (S400) of filtering the vector score of the matching image with an image filtering ranking value and classifying the matching image that has passed the filtering as the same-similar image.

본 발명에 따른 상품 분류 방법은 선택적으로 상기 특징 벡터 추출단계(S100) 이전에 상품 이미지 수집단계, 분석모델 학습단계, 전처리 단계가 더 포함될 수 있다.The product classification method according to the present invention may optionally further include a product image collection step, an analysis model learning step, and a pre-processing step before the feature vector extraction step ( S100 ).

상기 상품 이미지 수집단계는 관리서버(100)가 통신네트워크를 통해 접속 가능한 웹 사이트에서 상품의 이미지를 크롤링을 통해 수집하는 단계로, 파이썬 언어 기반으로 작성된 Scrapy 라이브러리를 사용하여 아마존, 알리익스프레스, 쿠팡, 11번가 등과 같은 대표적인 인터넷 쇼핑몰 사이트에서 상품들을 대상으로 크롤링을 통해 상품 이미지를 수집한 후 이미지 데이터베이스(122)에 저장한다.The product image collection step is a step in which the management server 100 collects product images from a web site accessible through a communication network through crawling. After collecting product images through crawling of products in a representative Internet shopping mall site such as 11st, it is stored in the image database 122 .

상기 분석모델 학습단계는 상품 이미지 수집단계를 통해 수집된 복수의 상품 이미지를 각각 분석 모델에 입력하여 분석 모델을 학습시키는 단계이다. 이 경우, 분석 모델은 딥 러닝 기법 중 하나인 CNN(Convolutional Neural Network) 계열 알고리즘에 기반한 분석 모델이다. 즉, 상품 이미지 각각을 CNN 계열 알고리즘에 입력하면, 파라미터의 조절 등을 통해 CNN의 각 레이어층에서 형태, 색깔, 패턴 등 상품 이미지의 자질이 추출될 수 있다. The analysis model learning step is a step of learning the analysis model by inputting a plurality of product images collected through the product image collection step into the analysis model, respectively. In this case, the analysis model is an analysis model based on a Convolutional Neural Network (CNN) series algorithm, which is one of the deep learning techniques. That is, if each product image is input to a CNN-based algorithm, the qualities of the product image such as shape, color, and pattern can be extracted from each layer of CNN through parameter adjustment.

필요에 따라, 분석 모델은 CNN 계열 알고리즘을 여러 층으로 적층하여 구성될 수 있다. 이러한 신경망 구조는 GoogleNet, ResNet과 같이 학습이 완료되어 공개된 구조를 사용하거나, TensorFlow, Torch, Keras와 같은 인공 신경망 라이브러리를 이용하여 구현된다. If necessary, the analysis model can be constructed by stacking CNN-based algorithms in several layers. Such a neural network structure is implemented by using an open structure that has been trained, such as GoogleNet and ResNet, or artificial neural network libraries such as TensorFlow, Torch, and Keras.

일 실시 양태로서, 본 발명에 따른 전처리 단계에서는 사용자 단말기가 사용자로부터 입력된 타겟 이미지의 포맷을 미리 지정된 포맷으로 변환시키고 관리서버(100)로 전송할 수 있다.As an embodiment, in the pre-processing step according to the present invention, the user terminal may convert the format of the target image input from the user into a predetermined format and transmit it to the management server 100 .

다른 실시 양태로서, 본 발명에 따른 전처리 단계에서는 관리서버(100)가 사용자 단말기로부터 입력된 타겟 이미지의 포맷을 미리 지정된 포맷으로 변환시킬 수 있다.As another embodiment, in the pre-processing step according to the present invention, the management server 100 may convert the format of the target image input from the user terminal into a predetermined format.

상기 특징 벡터 추출단계(S100)에서는 관리서버(100)가 사용자 단말기로부터 타겟 이미지를 입력받아 상기 타겟 이미지의 특징 맵과 특징 벡터를 추출한다. 이때, 관리서버(100)는 추출된 타겟 이미지의 특징 맵과 특징 벡터를 빅데이터 스토리지(120)의 이미지 데이터베이스(122)에 저장할 수 있다.In the feature vector extraction step (S100), the management server 100 receives a target image from the user terminal and extracts a feature map and a feature vector of the target image. In this case, the management server 100 may store the feature map and the feature vector of the extracted target image in the image database 122 of the big data storage 120 .

상기 특징 벡터 추출단계(S100)에서는 관리서버(100)가 사용자 단말기로부터 타겟 이미지를 직접 입력받을 수도 있으며, 사용자 단말기로부터 동영상을 입력받은 후 상기 동영상으로부터 타겟 이미지를 추출할 수도 있다.In the feature vector extraction step (S100), the management server 100 may directly receive the target image from the user terminal, or may extract the target image from the video after receiving the video from the user terminal.

상기 특징 벡터 추출단계(S100)는 관리서버(100)가 사용자 단말기로부터 타겟 이미지를 입력받아 저장하는 이미지 입력과정과, 상기 타겟 이미지를 미리 지정된 해상도로 변환시키는 영상 분석과정과, 해상도가 변환된 타겟 이미지를 기반으로 특징 이미지 영역이 포함된 단위 이미지를 생성하는 단위 이미지 생성과정, 및 상기 단위 이미지의 특징 맵과 특징 벡터를 추출하는 특징 벡터 추출과정을 포함할 수 있다.The feature vector extracting step (S100) includes an image input process in which the management server 100 receives and stores a target image from a user terminal, an image analysis process in which the target image is converted to a predetermined resolution, and a target whose resolution is converted. It may include a unit image generation process of generating a unit image including a feature image region based on the image, and a feature vector extraction process of extracting a feature map and a feature vector of the unit image.

상기 매칭 이미지 검색단계(S200)에서는 상기 관리서버(100)가 상기 타겟 이미지의 특징 맵 및 특징 벡터에 매칭된 특징 맵 및 특징 벡터를 갖는 매칭 이미지를 이미지 데이터베이스(122)에 저장된 기존 이미지들로부터 추출한다. 이때, 기존 이미지들은 자신의 특징 맵과 특징 벡터가 산출되어 연계된 상태로 이미지 데이터베이스(122)에 저장될 수 있다. In the matching image search step (S200), the management server 100 extracts a matching image having a feature map and a feature vector matched to the feature map and feature vector of the target image from existing images stored in the image database 122 do. In this case, the existing images may be stored in the image database 122 in a state in which their feature maps and feature vectors are calculated and linked.

상기 특징 벡터 추출단계(S100)에서는 관리서버(100)가 사용자 단말기로부터 상품검색옵션을 입력받고, 상기 매칭 이미지 검색단계에서는 관리서버(100)가 상기 타겟 이미지의 상품검색옵션과 특징 맵 및 특징 벡터에 매칭된 상품검색옵션과 특징 맵 및 특징 벡터를 갖는 매칭 이미지를 기존 이미지들로부터 추출할 수 있다.In the feature vector extraction step (S100), the management server 100 receives a product search option from the user terminal, and in the matching image search step, the management server 100 provides a product search option, a feature map, and a feature vector of the target image. A matching image having a product search option, a feature map, and a feature vector matched to can be extracted from existing images.

상기 벡터 점수 산출단계(S300)에서는 상기 관리서버(100)가 상기 타겟 이미지의 특징 벡터와 매칭 이미지의 특징 벡터를 비교분석하여 매칭 이미지의 벡터 점수를 산출한다. In the vector score calculation step (S300), the management server 100 compares and analyzes the feature vector of the target image and the feature vector of the matching image to calculate a vector score of the matching image.

상기 벡터 점수 산출단계(S300)에서는 상품검색옵셥에 따라 타겟 이미지의 특징 벡터의 행렬과 매칭 이미지의 특징 벡터의 행렬에 미리 설정된 가중치를 부여하고, 벡터 점수를 산출할 수 있다. In the vector score calculation step ( S300 ), preset weights may be given to the matrix of feature vectors of the target image and the matrix of feature vectors of the matching image according to the product search option, and a vector score may be calculated.

상기 이미지 분류단계(S400)에서는 관리서버(100)가 매칭 이미지의 벡터 점수가 이미지 필터링 랭킹 값에 부합하는지를 확인하고, 상기 이미지 필터링 랭킹 값에 부합하는 매칭 이미지를 타겟 이미지의 동일유사 이미지로 분류한다. 이때, 관리서버(100)는 동일유사 이미지로 분류된 매칭 이미지를 별도로 관리하여 동일유사 이미지 리스트를 생성하고, 상기 동일유사 이미지 리스트를 타겟 이미지를 제공한 사용자 단말기로 전송할 수 있다.In the image classification step (S400), the management server 100 checks whether the vector score of the matching image matches the image filtering ranking value, and classifies the matching image matching the image filtering ranking value as the same-similar image of the target image. . In this case, the management server 100 may separately manage the matching images classified as identical and similar images to generate the identical similar image list, and transmit the identical similar image list to the user terminal providing the target image.

이상에서 본 발명의 바람직한 실시예를 참조하여 설명하였지만, 해당 기술분야의 숙련된 당업자는 하기의 특허청구범위에 기재된 본 발명의 사상 및 영역으로부터 벗어나지 않는 범위 내에서 본 발명을 다양하게 수정 및 변경시킬 수 있음을 이해할 수 있을 것이다.Although described above with reference to the preferred embodiments of the present invention, those skilled in the art can variously modify and change the present invention within the scope without departing from the spirit and scope of the present invention described in the claims below. You will understand that you can.

100 : 관리서버 110 : 상품 분류 엔진
111 : 이미지 전처리부 112 : 영상 변환부
113 : 분석엔진 114 : 상품옵션 입력부
115 : 단위 이미지 추출부 120 : 빅데이터 스토리지
121 : 빅데이터 엔진 122 : 이미지 데이터베이스
123 : 문서 데이터베이스 124 : HDFS 100: management server 110: product classification engine
111: image pre-processing unit 112: image conversion unit
113: analysis engine 114: product option input unit
115: unit image extraction unit 120: big data storage
121: big data engine 122: image database
123: document database 124: HDFS

Claims

A target image provided from a user is input to an analysis model trained with a CNN (Convolutional Neural Network)-based algorithm, a feature map and a feature vector of the target image are extracted, and a plurality of images are obtained by using the feature map and the feature vector as a search condition. A matching image similar to the target image is extracted from the stored image database, a vector score of the matching image is calculated by comparing and analyzing feature vectors of the target image and the matching image, and a matching image having a vector score matching the image filtering ranking value. A product classification system including a management server to classify.

The method of claim 1, wherein the CNN-based algorithm is
Product classification system, characterized in that any one of FRCNN (Fast Region-based Convolutional Neural Network), CNN, RCNN (Region-based Convolutional Neural Network), and SPPnet (Spatial Pyramid Pooling in Deep Convolutional Networks).

The method of claim 1, wherein the image filtering ranking value is
Product classification system, characterized in that it is the average value calculated by inputting the value showing the smallest error by comparing and analyzing the vector scores of the matching image matching the target image and the inconsistent matching image with the target image as a result of supervised learning into an analysis model trained with a recurrent neural network .

The method of claim 1, wherein the management server
an image preprocessing unit that performs image preprocessing of a target image provided by a user;
an image conversion unit converting the target image on which the image pre-processing has been performed into a predetermined resolution;
The resolution-converted target image is input to the analysis model trained with a CNN-based algorithm to extract a feature map and a feature vector of the target image, and a plurality of images are stored from an image database using the feature map and feature vector as a search condition. Analysis of extracting a matching image similar to the target image, comparing and analyzing feature vectors of the target image and the matching image to calculate a vector score of the matching image, and classifying matching images having a vector score matching the image filtering ranking value A product classification system, characterized in that it comprises an engine.

The method of claim 4, wherein the management server
Further comprising a product option input unit for receiving a product search option to be used for product search together with the target image and providing it to the analysis engine,
and the analysis engine extracts a matching image similar to the target image from an image database in which a plurality of images are stored using the product search option, the feature map, and the feature vector as search conditions.

The method of claim 5, wherein the product search option is
Product classification system, characterized in that at least one selected from the group consisting of product name, seller name, manufacturer name, seller contact information, price, size, color, quality, color, and volume of the product.

5. The method of claim 4, wherein the image preprocessor
When the video provided by the user is a video, the video is divided into frames, and a target image is generated by extracting a feature image area included in the frame.

5. The method of claim 4, wherein the analysis engine is
a CNN analyzer for extracting a feature map and a feature vector of the target image by inputting the target image to an analysis model trained with a CNN algorithm; and
Using the feature map and feature vector as a search condition, a matching image similar to the target image is extracted from an image database in which a plurality of images are stored, and a weight according to the image matching result of the feature map and feature vector is applied to the feature vector to obtain a matching image Product classification system, characterized in that it is composed of a vector score extractor that calculates a vector score of and classifies a matching image having a vector score that matches the image filtering ranking value.

a feature vector extraction step in which the management server receives a target image from a user terminal and extracts a feature map and a feature vector of the target image;
a matching image search step in which the management server extracts a matching image having a feature map and a feature vector matched to the feature map and feature vector of the target image from existing images;
a vector score calculation step in which the management server compares and analyzes the feature map and feature vector of the target image and the feature map and feature vector of the matching image to calculate a vector score of the matching image; and
Product classification method comprising an image classification step of the management server confirming whether the vector score of the matching image matches the image filtering ranking value, and classifying the matching image matching the image filtering ranking value as the same-similar image of the target image .

10. The method of claim 9, wherein before the feature vector extraction step
and a pre-processing step of converting, by the user terminal, the format of the target image input from the user into a predetermined format and transmitting the format to a management server.

10. The method of claim 9, wherein before the feature vector extraction step
Product classification method, characterized in that the management server further comprises a pre-processing step of converting the format of the target image input from the user terminal into a predetermined format.

10. The method of claim 9, wherein the feature vector extraction step
An image input process in which the management server receives and stores the target image from the user terminal;
An image analysis process of converting the target image to a predetermined resolution;
A unit image generation process of generating a unit image including a feature image area based on the resolution-converted target image, and
and a feature vector extraction process of extracting a feature map and a feature vector of the unit image.

10. The method of claim 9,
In the feature vector extraction step, the management server receives a product search option input from the user terminal, and in the matching image search step, the management server performs a product search option and a feature map of the target image and a product search option and feature map matched to the feature vector. and extracting a matching image having a feature vector from existing images.

14. The method of claim 13,
The vector score calculation step applies a preset weight to the matrix of the feature vector of the target image and the matrix of the feature vector of the matching image according to the product search option, and calculating the vector score.