KR101910825B1

KR101910825B1 - Method, apparatus, system and computer program for providing aimage retrieval model

Info

Publication number: KR101910825B1
Application number: KR1020160170161A
Authority: KR
Inventors: 이영구; 안 투 구엔
Original assignee: 경희대학교 산학협력단
Priority date: 2016-12-14
Filing date: 2016-12-14
Publication date: 2018-10-24
Also published as: KR20180068455A

Abstract

본 발명은 서비스 서버에서, 이미지 검색을 수행하는 방법에 대한 것으로, 쿼리 이미지를 수신하면, 상기 쿼리 이미지에서 제 1 레벨 코드 북을 적용하여 제 1 디스크립터 벡터를 생성하고, 데이터베이스 이미지와 비교하여 상기 쿼리 이미지와 유사한 후보 이미지를 적어도 하나 이상 추출하는 A 단계; 상기 쿼리 이미지에서 상기 제 1 레벨 코드북보다 고차원의 제 2 레벨 코드북을 적용하여 제 2 디스크립터 벡터를 추출하고, 상기 제 2 디스크립터 벡터에 주제 영역과 배경 영역을 구별하기 위한 토픽 모델을 적용하여 상기 쿼리 이미지의 주제 영역을 추정하는 B 단계; 상기 후보 이미지에 상기 토픽 모델을 적용하여 상기 후보 이미지의 주제 영역을 추정하는 C 단계; 및 상기 쿼리 이미지의 주제 영역과 상기 후보 이미지의 주제 영역의 유사도를 계산하고, 상기 유사도에 따라 상기 후보 이미지의 유사도 랭킹을 설정하는 D 단계를 포함하는 것을 특징으로 한다. A method for performing an image search in a service server, the method comprising: generating a first descriptor vector by applying a first level codebook in the query image to a database image, An A step of extracting at least one candidate image similar to the image; Extracting a second descriptor vector by applying a second level codebook having a higher dimension than the first level codebook in the query image and applying a topic model for distinguishing a subject area and a background area to the second descriptor vector, A step B of estimating a subject area of the subject; C) estimating a subject area of the candidate image by applying the topic model to the candidate image; And a step D of calculating a degree of similarity between a subject area of the query image and a subject area of the candidate image and setting a similarity ranking of the candidate image according to the degree of similarity.

Description

METHOD, APPARATUS, SYSTEM AND COMPUTER PROGRAM FOR PROVIDING AIMAGE RETRIEVAL MODEL < RTI ID = 0.0 >

본 발명은 이미지 검색 방법에 대한 것이다. 보다 구체적으로 본 발명은 대형 이미지 데이터베이스에서 효율적으로 동작할 수 있는 이미지 검색 방법을 제공하는 방법 및 장치에 대한 것이다. The present invention relates to an image retrieval method. More particularly, the present invention relates to a method and apparatus for providing an image retrieval method capable of efficiently operating in a large image database.

디지털 촬상 장치의 보급 확대와 네트워크 환경 발달은 영상 데이터의 급속한 증가로 이어지고 있으며, 각종 모바일 기기를 통한 영상 정보의 검색이 증가하고 있다. The expansion of digital imaging devices and the development of network environment are leading to a rapid increase in image data, and the search for image information through various mobile devices is increasing.

이미지 데이터베이스의 크기가 급격히 증가하는 이와 같은 상황에서 더 편리하고 정확한 검색 방법에 대한 수요가 증가하고 있다. 이미지 검색에는 두개의 프레임워크가 존재하는데, 하나는 텍스트 기반의 이미지 검색이며 다른 하나는 컨텐츠 기반의 이미지 검색에 해당한다. In this situation, where the size of the image database is rapidly increasing, the demand for a more convenient and accurate search method is increasing. There are two frameworks for image retrieval, one for text-based image retrieval and the other for content-based image retrieval.

텍스트 기반의 이미지 검색은 데이터베이스의 모든 이미지는 키워드로 라벨링되며, 라벨 기반으로 이미지를 검색하게 된다. 그러나 이는 라벨링 과정에서 사람의 주관이 개입되게 되어 수학적인 결과를 도출할 수 없는 문제가 있다. Text-based image retrieval is performed by labeling all images in the database as keywords and searching for images on a label basis. However, there is a problem that the subject of the human being is intervened in the labeling process and the mathematical result can not be derived.

한편, 컨텐츠 기반의 이미지 검색은 이미지들 사이의 시각적 유사성을 이용하여 검색을 수행하게 된다. 특히 컨텐츠 기반의 정보 검색은 사용자가 키워드를 모를 경우, 사용자의 편의성을 만족시킬 수 있어 그 효과가 크다. On the other hand, content-based image retrieval performs retrieval using visual similarity between images. In particular, content-based information retrieval can satisfy the user's convenience when the user does not know the keyword, which is effective.

그런데 키워드 등 별도의 텍스트 도움 없이 컨텐츠 기반으로 이미지를 검색하는 방법은 쿼리 이미지를 하나의 벡터로 인코딩하고 이를 인덱싱하는 방식으로 일반적으로 진행된다. 이러한 방식은 초기 결과 추출을 위해서는 효율적이다. However, the method of retrieving images based on content without separate text help such as keywords is generally performed by encoding the query image into one vector and indexing the same. This approach is efficient for extracting initial results.

그러나 이미지를 하나의 벡터로 인코딩하는 종래 검색 방식을 따르면, 오브젝트의 위치, 크기, 방향 등에 대한 기하학적 정보가 부족하기 때문에 오브젝트에 대한 폐쇄, 시점 변화, 변형 등의 변경에 매우 민감하게 반응하여, 검색 결과에 대한 정확도가 떨어지는 문제가 있다. However, according to the conventional search method of encoding an image into one vector, since geometric information about the position, size, and orientation of the object is insufficient, it is very sensitive to changes in closure, viewpoint change, There is a problem that the accuracy of the result is low.

본 발명은 대형 이미지 데이터베이스에서 효율적으로 동작할 수 있는 이미지 검색 방법을 제공하는 것을 목적으로 한다. An object of the present invention is to provide an image retrieval method capable of efficiently operating in a large image database.

보다 구체적으로 본 발명은 비주얼 검색 기술에서 VT (Vocabulary Tree)를 2 레벨로 적용하고, 제 1 레벨에서는 쿼리 이미지와 유사한 데이터베이스 이미지의 그룹 (초기 검색 리스트)을 추출하며, 이미지의 통계적 특성을 반영한 제 2 레벨에서는 토픽 모델을 적용하여 쿼리 이미지의 유사도를 기준으로, 상기 그룹에 속하는 이미지들을 유사도 랭킹을 계산하는 방법을 제공하는 것을 목적으로 한다. More specifically, the present invention applies VT (Vocabulary Tree) at two levels in a visual search technology, extracts a group of database images (initial search list) similar to the query image at the first level, And a method for calculating the similarity rankings of images belonging to the group based on the similarity of the query image by applying the topic model at the second level.

본 발명의 실시예를 따르는 서비스 서버에서, 이미지 검색을 수행하는 방법은, 쿼리 이미지를 수신하면, 상기 쿼리 이미지에서 제 1 레벨 코드 북을 적용하여 제 1 디스크립터 벡터를 생성하고, 데이터베이스 이미지와 비교하여 상기 쿼리 이미지와 유사한 후보 이미지를 적어도 하나 이상 추출하는 A 단계; 상기 쿼리 이미지에서 상기 제 1 레벨 코드북보다 고차원의 제 2 레벨 코드북을 적용하여 제 2 디스크립터 벡터를 추출하고, 상기 제 2 디스크립터 벡터에 주제 영역과 배경 영역을 구별하기 위한 토픽 모델을 적용하여 상기 쿼리 이미지의 주제 영역을 추정하는 B 단계; 상기 후보 이미지에 상기 토픽 모델을 적용하여 상기 후보 이미지의 주제 영역을 추정하는 C 단계; 및 상기 쿼리 이미지의 주제 영역과 상기 후보 이미지의 주제 영역의 유사도를 계산하고, 상기 유사도에 따라 상기 후보 이미지의 유사도 랭킹을 설정하는 D 단계를 포함하는 것을 특징으로 한다. A method for performing image search in a service server according to an embodiment of the present invention includes receiving a query image, generating a first descriptor vector by applying a first level codebook in the query image, comparing the first descriptor vector with a database image An A step of extracting at least one candidate image similar to the query image; Extracting a second descriptor vector by applying a second level codebook having a higher dimension than the first level codebook in the query image and applying a topic model for distinguishing a subject area and a background area to the second descriptor vector, A step B of estimating a subject area of the subject; C) estimating a subject area of the candidate image by applying the topic model to the candidate image; And a step D of calculating a degree of similarity between a subject area of the query image and a subject area of the candidate image and setting a similarity ranking of the candidate image according to the degree of similarity.

나아가 복수의 이미지를 수집하고, 수집된 이미지에 상기 제 1 레벨 코드 북을 적용하여 제 1 디스크립터 벡터를 생성하는 단계; 상기 수집된 이미지에 상기 제 2 레벨 코드북을 적용하여 제 2 디스크립터 벡터를 생성하는 단계; 상기 제 1 디스크립터 벡터 및/또는 상기 제 2 디스크립터 벡터를 인덱스 값으로 해당 이미지에 인덱싱하는 단계를 포함할 수 있다. 나아가 상기 C 단계는, 상기 후보 이미지의 상기 제 2 디스크립터 벡터에 상기 토픽 모델을 적용하여 상기 후보 이미지의 주제 영역을 추정하는 단계를 포함할 수 있다. 나아가 상기 D단계는, 상기 후보 이미지의 주제 영역에 대한 디스크립터에 가중치를 적용하여 상기 후보 이미지와 상기 쿼리 이미지의 유사도를 추정하는 단계를 포함할 수 있다. Further comprising: collecting a plurality of images and applying the first level codebook to the collected images to generate a first descriptor vector; Generating a second descriptor vector by applying the second level codebook to the collected image; And indexing the first descriptor vector and / or the second descriptor vector as an index value into the corresponding image. Further, the step C may include estimating a subject area of the candidate image by applying the topic model to the second descriptor vector of the candidate image. Further, the step D may include estimating the similarity between the candidate image and the query image by applying a weight to the descriptor of the subject area of the candidate image.

한편, 본 발명의 실시예를 따르는서비스 서버에서, 이미지 검색을 수행하는 방법은, 복수의 이미지를 수집하고, 수집된 이미지에서 제 1 레벨 코드 북을 적용하여 제 1 디스크립터를 추출하고, 상기 제 1 디스크립터에 대한 벡터를 인덱스 값으로 해당 이미지에 인덱싱한 데이터베이스를 구축하는 단계; 쿼리 이미지를 수신하면, 상기 쿼리 이미지에서 상기 제 1 레벨 코드 북을 적용하여 제 1 디스크립터 벡터를 생성하고, 상기 벡터와 상기 데이터베이스 이미지의 인덱스 값을 비교하여 상기 쿼리 이미지와 유사한 후보 이미지를 적어도 하나 이상 추출하는 단계; 상기 쿼리 이미지에서 상기 제 1 레벨 코드북보다 고차원의 제 2 레벨 코드북을 적용하여 제 2 디스크립터를 추출하고, 상기 제 2 디스크립터에 주제 영역과 배경 영역을 구별하기 위한 토픽 모델을 적용하여 상기 쿼리 이미지의 주제 영역을 추정하는 단계; 및 상기 후보 이미지에 상기 제 2 레벨 코드북을 적용하여 추출한 제 2 디스크립터에 상기 토픽 모델을 적용하여 상기 후보 이미지의 주제 영역을 추정하는 단계를 포함하는 것을 특징으로 한다. Meanwhile, in a service server according to an embodiment of the present invention, a method of performing an image search includes collecting a plurality of images, extracting a first descriptor by applying a first level codebook to the collected images, Constructing a database in which a vector for the descriptor is indexed to the image as an index value; Generating a first descriptor vector by applying the first level codebook in the query image and comparing the vector with an index value of the database image to generate a candidate image similar to the query image, Extracting; Extracting a second descriptor by applying a second level codebook having a higher dimension than the first level codebook in the query image and applying a topic model for distinguishing a subject area and a background area to the second descriptor, Estimating a region; And estimating a subject area of the candidate image by applying the topic model to a second descriptor extracted by applying the second level codebook to the candidate image.

이때 상기 쿼리 이미지의 주제 영역과 상기 후보 이미지의 주제 영역의 유사도를 계산하고, 상기 유사도에 따라 상기 후보 이미지의 유사도 랭킹을 설정하는 단계를 더 포함할 수 있다. The method may further include calculating a degree of similarity between a subject area of the query image and a subject area of the candidate image and setting a similarity ranking of the candidate image according to the degree of similarity.

한편 본 발명의 실시예를 따르는 이미지 검색을 수행하는 장치는, 적어도 하나 이상의 이미지를 포함하는 데이터베이스; 쿼리 이미지를 수신하는 통신부; 상기 쿼리 이미지에서 제 1 레벨 코드 북을 적용하여 제 1 디스크립터 벡터를 생성하고, 상기 제 1 레벨 코드북보다 고차원의 제 2 레벨 코드북을 적용하여 제 2 디스크립터 벡터를 추출하는 디스크립터 추출부; 상기 쿼리 이미지의 상기 제 2 디스크립터 벡터에 주제 영역과 배경 영역을 구별하기 위한 토픽 모델을 적용하여 상기 쿼리 이미지의 주제 영역을 추정하고, 상기 후보 이미지에 상기 토픽 모델을 적용하여 상기 후보 이미지의 주제 영역을 추정하는 주제 영역 추정부; 및 상기 제 1 디스크립터 벡터와 상기 데이터베이스와 비교하여 상기 쿼리 이미지와 유사한 후보 이미지를 적어도 하나 이상 추출하고, 상기 쿼리 이미지의 주제 영역과 상기 후보 이미지의 주제 영역의 유사도를 계산하고, 상기 유사도에 따라 상기 후보 이미지의 유사도 랭킹을 설정하는 유사도 계산부를 포함하는 것을 특징으로 한다. According to another aspect of the present invention, there is provided an apparatus for performing image search, including: a database including at least one image; A communication unit for receiving a query image; A descriptor extracting unit for generating a first descriptor vector by applying a first level codebook in the query image and extracting a second descriptor vector by applying a second level codebook having a higher dimension than the first level codebook; Estimating a subject area of the query image by applying a topic model for distinguishing a subject area and a background area to the second descriptor vector of the query image, applying the topic model to the candidate image, A subject area estimator for estimating a subject area; And extracting at least one candidate image that is similar to the query image in comparison with the first descriptor vector and the database, calculating a similarity between a subject area of the query image and a subject area of the candidate image, And a degree-of-similarity calculation unit for setting a degree-of-similarity ranking of the candidate image.

이때 상기 디스크립터 추출부는, 상기 데이터베이스에 포함된 이미지에 상기 제 1 레벨 코드 북을 적용하여 제 1 디스크립터 벡터를 생성하고, 상기 제 2 레벨 코드북을 적용하여 제 2 디스크립터 벡터를 생성할 수 있으며, 상기 제 1 디스크립터 벡터 및/또는 상기 제 2 디스크립터 벡터를 인덱스 값으로 해당 이미지에 인덱싱하는 인덱스부를 포함할 수 있다.Here, the descriptor extracting unit may generate the first descriptor vector by applying the first level codebook to the image included in the database, generate the second descriptor vector by applying the second level codebook, And an index unit for indexing the first descriptor vector and / or the second descriptor vector into the corresponding image using the index value.

한편 본 발명의 실시예를 따르는 이미지 검색을 수행하는 장치는, 복수의 이미지에서 제 1 레벨 코드 북을 적용하여 제 1 디스크립터를 추출하고, 상기 제 1 디스크립터에 대한 벡터를 인덱스 값으로 해당 이미지에 인덱싱한 이미지 데이터베이스; 쿼리 이미지를 수신하는 통신부; 및 상기 쿼리 이미지에서 상기 제 1 레벨 코드 북을 적용하여 제 1 디스크립터 벡터를 생성하고, 상기 벡터와 상기 데이터베이스 이미지의 인덱스 값을 비교하여 상기 쿼리 이미지와 유사한 후보 이미지를 적어도 하나 이상 추출하고, 상기 쿼리 이미지에서 상기 제 1 레벨 코드북보다 고차원의 제 2 레벨 코드북을 적용하여 제 2 디스크립터를 추출하고, 상기 제 2 디스크립터에 주제 영역과 배경 영역을 구별하기 위한 토픽 모델을 적용하여 상기 쿼리 이미지의 주제 영역을 추정하며, 상기 후보 이미지에 상기 제 2 레벨 코드북을 적용하여 추출한 제 2 디스크립터에 상기 토픽 모델을 적용하여 상기 후보 이미지의 주제 영역을 추정하는 제어부를 포함할 수 있다. Meanwhile, an apparatus for performing image search according to an embodiment of the present invention extracts a first descriptor by applying a first level code book in a plurality of images, and indexes the vector for the first descriptor as an index value into the corresponding image An image database; A communication unit for receiving a query image; And generating a first descriptor vector by applying the first level codebook in the query image, extracting at least one candidate image similar to the query image by comparing the vector with an index value of the database image, Extracting a second descriptor by applying a second level codebook having a higher dimension than the first level codebook in the image and applying a topic model for distinguishing the subject area and the background area to the second descriptor, And a controller for estimating a subject area of the candidate image by applying the topic model to a second descriptor extracted by applying the second level codebook to the candidate image.

나아가 상기 제어부는, 상기 쿼리 이미지의 주제 영역과 상기 후보 이미지의 주제 영역의 유사도를 계산하고, 상기 유사도에 따라 상기 후보 이미지의 유사도 랭킹을 설정하는 것을 특징으로 한다. Further, the control unit may calculate the similarity between the subject area of the query image and the subject area of the candidate image, and set the similarity ranking of the candidate image according to the similarity.

한편, 본 발명의 실시예를 따르는 서비스 서버에서, 이미지 검색을 수행하기 위하여 컴퓨터 판독가능한 저장 매체에 기록된 컴퓨터 프로그램은, 복수의 이미지를 수집하는 기능; 상기 수집한 이미지에 대한 데이터베이스를 구축하는 기능; 쿼리 이미지를 수신하면, 상기 쿼리 이미지에서 상기 제 1 레벨 코드 북을 적용하여 제 1 디스크립터를 추출하고, 상기 제 1 레벨 디스크립터에 대한 벡터를 생성하여, 상기 벡터와 상기 데이터베이스 이미지의 인덱스 값을 비교하여 상기 쿼리 이미지와 유사한 후보 이미지를 적어도 하나 이상 추출하는 기능; 상기 쿼리 이미지에서 상기 제 1 레벨 코드북보다 고차원의 제 2 레벨 코드북을 적용하여 제 2 디스크립터를 추출하고, 상기 제 2 디스크립터에 주제 영역과 배경 영역을 구별하기 위한 토픽 모델을 적용하여 상기 쿼리 이미지의 주제 영역을 추정하는 기능; 상기 후보 이미지의 상기 제 2 디스크립터에 상기 토픽 모델을 적용하여 상기 후보 이미지의 주제 영역을 추정하는 기능; 상기 쿼리 이미지의 주제 영역과 상기 후보 이미지의 주제 영역의 유사도를 계산하고, 상기 유사도에 따라 상기 후보 이미지의 유사도 랭킹을 설정하는 기능을 수행하는 것을 특징으로 한다. Meanwhile, in a service server according to an embodiment of the present invention, a computer program recorded on a computer-readable storage medium for performing image search includes a function of collecting a plurality of images; A function of building a database of the collected images; Upon receipt of the query image, the first level code book is extracted from the query image to extract a first descriptor, a vector for the first level descriptor is generated, and the vector and the index value of the database image are compared A function of extracting at least one candidate image similar to the query image; Extracting a second descriptor by applying a second level codebook having a higher dimension than the first level codebook in the query image and applying a topic model for distinguishing a subject area and a background area to the second descriptor, A function of estimating a region; Estimating a subject area of the candidate image by applying the topic model to the second descriptor of the candidate image; A degree of similarity between a subject area of the query image and a subject area of the candidate image is calculated and a similarity ranking of the candidate image is set according to the degree of similarity.

나아가 본 발명의 실시예를 따르는 서비스 서버에서, 이미지 검색을 수행하기 위하여 컴퓨터 판독가능한 저장 매체에 기록된 컴퓨터 프로그램은, 이미지 데이터베이스를 생성하는 기능; 쿼리 이미지를 수신하면, 상기 쿼리 이미지에서 제 1 레벨 코드 북을 적용하여 제 1 디스크립터 벡터를 생성하고, 상기 데이터베이스 이미지와 비교하여 상기 쿼리 이미지와 유사한 후보 이미지를 적어도 하나 이상 추출하는 기능; 상기 쿼리 이미지에서 상기 제 1 레벨 코드북보다 고차원의 제 2 레벨 코드북을 적용하여 제 2 디스크립터 벡터를 추출하고, 상기 제 2 디스크립터 벡터에 주제 영역과 배경 영역을 구별하기 위한 토픽 모델을 적용하여 상기 쿼리 이미지의 주제 영역을 추정하는 기능; 상기 후보 이미지에 상기 토픽 모델을 적용하여 상기 후보 이미지의 주제 영역을 추정하는 기능; 및 상기 쿼리 이미지의 주제 영역과 상기 후보 이미지의 주제 영역의 유사도를 계산하고, 상기 유사도에 따라 상기 후보 이미지의 유사도 랭킹을 설정하는 기능을 수행하는 것을 특징으로 한다. Furthermore, in a service server according to an embodiment of the present invention, a computer program recorded on a computer-readable storage medium for performing an image search includes a function of generating an image database; Receiving a query image, generating a first descriptor vector by applying a first level codebook in the query image, and extracting at least one candidate image similar to the query image by comparing with the database image; Extracting a second descriptor vector by applying a second level codebook having a higher dimension than the first level codebook in the query image and applying a topic model for distinguishing a subject area and a background area to the second descriptor vector, A function of estimating a subject area of the subject; Applying the topic model to the candidate image to estimate a subject area of the candidate image; And calculating a degree of similarity between a subject area of the query image and a subject area of the candidate image, and setting a similarity ranking of the candidate image according to the degree of similarity.

본 발명에 따르면, 대형 이미지 데이터베이스에서 이미지 검색을 빠르고 효율적으로 수행할 수 있는 효과가 있다. 보다 구체적으로 본 발명을 따르면 이미지에서 오브젝트를 효과적으로 후출할 수 있으며, 나아가 대형 데이터베이스에서 쿼리 이미지의 오브젝트와 유사한 오브젝트를 포함하는 데이터베이스 이미지를 빠르고 정확하게 검색할 수 있는 효과가 있다. According to the present invention, image retrieval can be performed quickly and efficiently in a large image database. More specifically, according to the present invention, an object can be efficiently extracted from an image, and further, a database image including an object similar to an object of a query image in a large database can be retrieved quickly and accurately.

이러한 구성을 따르면 텍스트에 의존하지 않고 이미지만으로 유사한 이미지를 적은 네트워크 리소스만으로 검색할 수 있어 웹 기반 어플리케이션 및 모바일 기반 어플리케이션에 적용될 수 있는 효과가 있다. According to such a configuration, a similar image can be searched by only a small amount of network resources without relying on text, thereby being applicable to web-based applications and mobile-based applications.

나아가 본 발명에 따르면, 클라이언트에 표시된 이미지와 서버의 데이터베이스에 저장된 이미지 사이의 동일 유사 여부를 효율적이고 정확하게 검색할 수 있는 효과가 있다. 따라서 이미지에 검색을 위한 태그를 미리 삽입할 필요가 없으며, 별도 텍스트의 참조 없이 자동으로 유사한 이미지들을 매칭할 수 있다.Further, according to the present invention, it is possible to efficiently and accurately search for the same similarity between the image displayed on the client and the image stored in the database of the server. Therefore, you do not need to insert the tag for search into the image in advance, and you can automatically match similar images without reference to separate text.

도 1은 쿼리 이미지에서 특징점을 추출하고 디스크립터를 계산하는 과정을 설명하기 위한 순서도
도 2는 본 발명의 실시예에 따라 쿼리 이미지와 유사한 데이터베이스 이미지의 후보 그룹을 추출하는 과정을 설명하기 위한 순서도
도 3은 본 발명의 실시예에 따라 토픽 모델을 적용하여 이미지 후보 그룹의 랭킹을 설정하는 과정을 설명하기 위한 순서도
도 4은 본 발명의 실시예를 따르는 이미지 검색 프레임 워크의 구성을 설명하기 위한 도면
도 5는 본 발명의 실시예에 따라 이미지에서 토픽 모델을 적용하는 예시를 설명하기 위한 도면
도 6은 본 발명의 실시예에 따르는 SLTMB에 적용되는 변수 및 파라미터의 관계를 설명하기 위한 도면
도 7은 본 발명의 다른 실시예에 따라 진행되는 유사 이미지 검색 방법의 예시를 설명하기 위한 순서도
도 8은 본 발명의 실시예에 따르는 쿼리 이미지를 전송하고 유사 이미지를 수신하는 클라이언트의 구성을 설하기 위한 블록도
도 9는 본 발명의 실시예에 따르는 이미지 검색 엔진으로 동작하는 서버의 구성을 설명하기 위한 블록도1 is a flowchart for explaining a process of extracting a feature point from a query image and calculating a descriptor
2 is a flowchart for explaining a process of extracting a candidate group of a database image similar to a query image according to an embodiment of the present invention
3 is a flowchart illustrating a process of setting a ranking of an image candidate group by applying a topic model according to an embodiment of the present invention
4 is a diagram for explaining the configuration of an image retrieval framework according to an embodiment of the present invention;
5 is a diagram for explaining an example of applying a topic model in an image according to an embodiment of the present invention;
6 is a diagram for explaining the relationship between parameters and parameters applied to the SLTMB according to the embodiment of the present invention;
7 is a flowchart for explaining an example of a similar image retrieval method according to another embodiment of the present invention
8 is a block diagram for illustrating a configuration of a client for transmitting a query image and receiving a similar image according to an embodiment of the present invention
9 is a block diagram illustrating a configuration of a server operating as an image search engine according to an embodiment of the present invention.

본 발명은 이하에 기재되는 실시예들의 설명 내용에 한정되는 것은 아니며, 본 발명의 기술적 요지를 벗어나지 않는 범위 내에서 다양한 변형이 가해질 수 있음은 자명하다. 그리고 실시예를 설명함에 있어서 본 발명이 속하는 기술 분야에 널리 알려져 있고 본 발명의 기술적 요지와 직접적으로 관련이 없는 기술 내용에 대해서는 설명을 생략한다. It is to be understood that the present invention is not limited to the description of the embodiments described below, and that various modifications may be made without departing from the technical scope of the present invention. In the following description, well-known functions or constructions are not described in detail since they would obscure the invention in unnecessary detail.

한편, 첨부된 도면에서 동일한 구성요소는 동일한 부호로 표현된다. 그리고 첨부 도면에 있어서 일부 구성요소는 과장되거나 생략되거나 개략적으로 도시될 수도 있다. 이는 본 발명의 요지와 관련이 없는 불필요한 설명을 생략함으로써 본 발명의 요지를 명확히 설명하기 위함이다. 이하 첨부된 도면을 참조하여 본 발명의 바람직한 실시 예들을 상세히 설명한다. In the drawings, the same components are denoted by the same reference numerals. And in the accompanying drawings, some of the elements may be exaggerated, omitted or schematically illustrated. It is intended to clearly illustrate the gist of the present invention by omitting unnecessary explanations not related to the gist of the present invention. Hereinafter, preferred embodiments of the present invention will be described in detail with reference to the accompanying drawings.

이미지 검색은 다양한 방식으로 활용될 수 있다. 예를 들어 상품 정보 검색이 그것이다. Image retrieval can be utilized in various ways. For example, product information search.

예를 들어 데스크탑을 이용하여 블로그를 열람하던 사용자는 웹페이지에 표시되어 있는 이미지 중 유아용품의 이름이나 가격 등이 궁금할 수 있다. 이 경우, 본 발명의 실시예를 따르면 사용자는 디바이스에서 해당 객체를 캡처하고 캡처 이미지를 쿼리로 하여 이미지 검색 서버에 쿼리에 포함된 유아용품과 유사한 이미지 및 상기 유사한 이미지에 연결된 상품 정보에 접근할 수 있다. For example, a user viewing a blog using a desktop may be curious about the name or price of a baby article displayed on a web page. In this case, according to the embodiment of the present invention, the user can capture the object in the device, query the captured image, access the image search server with the image similar to the baby goods contained in the query and the goods information linked with the similar image have.

이를 위하여 쿼리 생성을 위한 익스텐션 프로그램이 플러그인, 엑티브엑스, 툴바 등의 형식으로 웹브라우징 툴에 설치되거나 모바일 디바이스에서 SDK의 형태로 특정 어플리케이션에 삽입되거나 독립된 어플리케이션으로 동작할 수 있다.To this end, an extension program for generating a query may be installed in a web browsing tool in the form of a plug-in, an ActiveX, a tool bar, or the like, or may be embedded in a specific application in the form of an SDK in a mobile device or may operate as an independent application.

서비스 서버는 쿼리에 포함된 이미지 객체와 유사한 이미지를 상품 데이터베이스에서 검색하고, 검색 결과를 사용자 디바이스에 제공할 것이다. The service server will retrieve an image similar to the image object contained in the query from the merchandise database and provide the search results to the user device.

위와 같은 서비스의 기본이 되는 이미지 검색은 최근 모바일 기기의 검색 요청 급증하고 있다. 모바일 기기의 사용자는 검색하고자 하는 대상을 촬영하거나 촬영된 이미지를 활용하여 미리 구축된 데이터베이스 검색을 통해 촬영 대상과 연계된 정보를 얻게 된다. Image retrieval, which is the basis of the above services, is rapidly increasing in search requests for mobile devices. The user of the mobile device acquires the information associated with the object to be photographed through the pre-established database search by photographing the object to be searched or utilizing the photographed image.

예를 들어 상품 검색의 경우, 사용자는 검색하려는 상품을 촬영하고, 이를 데이터베이스에서 검색하여 상품명, 가격, 기능 및 판매처 등의 부가 정보를 억을 수 있다. 또한 서비스 제공자는 거리의 건물을 촬영하고 건물을 인식하여 건물 내의 정보나 주변 정보를 제공하는 서비스를 기획할 수 있다. For example, in the case of a product search, a user may photograph a product to be searched and search it in a database to suppress additional information such as a product name, a price, a function, and a selling place. In addition, the service provider can plan a service that takes a building on the street and recognizes the building and provides information in the building or surrounding information.

그런데 이미지 검색과 관련된 다양한 서비스 시나리오를 만족시키기 위해 이미지 검색 기술은 i) 조명 및 환경 변화, 촬영 시점의 변화에 강인해야 하며 ii) 서로 다른 객체를 포함한 영상 특징을 구별할 수 있어야 하고 iii) 하나의 특징으로 여러 응용분야에 활용 가능해야 하고 iv) 네트워크 환경 및 다양한 단말을 지원할 수 있도록 특징 크기를 계층적으로 구성하여 상호 호환성을 확보하고 v) 대용량 영상 데이터를 효율적으로 검색할 수 있는 구조를 지원해야 하는 검색 효율성 vi) 고속 특징 추출과 검색을 이한 하드웨어 구현을 지원해야 특징이 요구된다. However, in order to satisfy various service scenarios related to image retrieval, image retrieval technology should be robust to changes in lighting and environment, change in shooting time, ii) to be able to distinguish image features including different objects, and iii) It should be able to be applied to various applications. Iv) It should be structured hierarchically to support network environment and various terminals to ensure mutual compatibility. V) Support structure for efficiently searching large image data. And vi) hardware features that support high-speed feature extraction and retrieval.

그러나 종래의 이미지 검색은 쿼리 이미지를 하나의 벡터로 인코딩하고 이를 인덱싱하여 데이터베이스를 검색하고 정합 및 검증하는 방식을 취했다. 이러한 방식은 초기 결과 추출을 위해서는 일반적으로 효율적으로 동작할 수 있다. 그러나 이미지를 하나의 벡터로 인코딩하는 종래 검색 방식을 따르면, 오브젝트의 위치, 크기, 방향 등에 대한 기하학적 정보가 부족하기 때문에, 오브젝트에 대한 폐쇄, 변형, 시점 변화 등의 변경에 매우 민감하게 반응하게 되는 문제가 있었다.However, conventional image retrieval encodes the query image into a single vector and indexes it to retrieve the database, and match and verify it. This approach is generally efficient for extracting initial results. However, according to the conventional search method of encoding an image into one vector, geometric information about the position, size, orientation, and the like of the object is insufficient, so that it is very sensitive to changes such as closing, There was a problem.

본 발명은 이와 같은 문제를 해결하기 위한 것이다. 이를 위해 본 발명은 이미지의 특징을 추출한 후, 인코딩 프레이즈를 거처 초기 검색 리스트를 생성하고, 다음으로 토픽 모델링 프레이즈를 거쳐 상기 리스트의 랭킹을 조정하는 방식으로 대용량 데이터베이스의 검색 결과를 제공하는 구성을 포함할 수 있다. The present invention is intended to solve such a problem. To this end, the present invention includes a configuration for extracting characteristics of an image, generating an initial search list through an encoding phrase, and then providing a search result of a large-volume database by way of adjusting a ranking of the list via a topic modeling phrase can do.

보다 구체적으로, 본 발명의 실시예를 따르면, 이미지의 특징점에 대한 디스크립터를 인코딩하기 위한 VT (Vocabulary Tree)를 제 1 레벨과 제 2 레벨로 구별하여 적용할 수 있다. More specifically, according to an embodiment of the present invention, a VT (Vocabulary Tree) for encoding a descriptor for a feature point of an image can be distinguished from a first level and a second level.

제 1 레벨 VT는 이미지 디스크립터의 값들을 Bow (Back of Words)에서의 단어의 개념으로 생각하고 검색의 대상이 되는 이미지로부터 대략적인 코드 북 (Coarse Codebook)를 구성하여 작성할 수 있다. 쿼리 이미지와 데이터베이스 이미지에 상기 제 1 레벨 VT를 적용하면 특징의 빈도에 따라 후보군을 생성하고 정합 및 검증 방법에 의해 초기 검색 리스트를 생성할 수 있다.The first level VT can be formed by constructing a rough code book from the image to be searched by considering the values of the image descriptor as the concept of words in Bow (Back of Words). If the first level VT is applied to the query image and the database image, candidate groups can be generated according to the frequency of the feature, and an initial search list can be generated by the matching and verification method.

나아가 본 발명의 실시예를 따르는 제 2 레벨 VT는 이미지 디스크립터의 로컬 영역에서의 기하학의 통계적 특성을 반영하여 보다 세밀하게 구성한 정제된 코드북 (Refined Codebook)을 구성하는 방식으로 작성할 수 있다. Further, the second level VT according to the embodiment of the present invention can be created in a manner of constructing a refined codebook constituting a finer structure reflecting the statistical characteristics of the geometry in the local area of the image descriptor.

본 발명의 실시예를 따르면 쿼리 이미지와 초기 검색 리스트에 포함된 이미지에 상기 제 2 레벨 VT를 적용하여 유사도를 계산하고, 토픽 모델을 적용하여 초기 검색 리스트에 대해서 실제 일치하는 정보가 있는지 정합 및/또는 검증할 수 있다. According to an embodiment of the present invention, the second level VT is applied to an image included in a query image and an initial search list to calculate a similarity, and a topic model is applied to determine whether there is actual matching information for an initial search list, Or verification.

상기 토픽 모델은 이미지에서 배경과 객체를 구별하는 토픽 모델을 가정하고, 이 모델이 적용되는지를 반복적으로 검증하여 모델 검증 확률을 계산하기 위한 것이다. 상기 토픽 모델은 대량의 이미지에서 형태 분포, 배경 분포를 추출하여 해당 이미지의 객체를 구별하는 기계 학습의 방식으로 학습될 수 있다.The topic model is intended to calculate a model verification probability by repeatedly verifying whether a topic model is used to distinguish between background and object in an image. The topic model can be learned by a machine learning method of extracting a morphological distribution and background distribution from a large number of images and distinguishing objects of the image.

본 발명의 실시예에 따라 인코딩 단계와 함께 토픽 모델이 적용되면 이미지의 객체 형태의 큰 변화에 대처하는 효율적인 순위 조정이 가능하다. 나아가 공간 배열과 객체, 배경들의 관계에 대한 수학적 계산이 가능할 수 있다.According to the embodiment of the present invention, when the topic model is applied together with the encoding step, it is possible to perform efficient ranking adjustment to cope with a large change in the object shape of the image. Furthermore, it is possible to perform mathematical calculations on the relationship between spatial arrangements and objects and backgrounds.

이와 같은 본 발명의 구성을 구성을 따르면 대형 데이터베이스에서 유사한 오브젝트 이미지를 빠르고 정확하게 효율적으로 검색할 수 있는 효과가 있다. 나아가 본 발명의 실시예를 따르는 이미지 검색 프레임워크를 이용하면 이미지에서 오브젝트가 효과적으로 추출될 수 있다. According to the configuration of the present invention as described above, it is possible to search a similar object image quickly, accurately, and efficiently in a large database. Furthermore, an object can be effectively extracted from an image by using an image search framework according to an embodiment of the present invention.

본 발명의 실시예를 따르는 이미지 검색 방법은 첨부된 도면을 참고하여 보다 자세하게 후술된다. An image retrieval method according to an embodiment of the present invention will be described in more detail below with reference to the accompanying drawings.

도 1은 도 1은 쿼리 이미지에서 특징점을 추출하고 디스크립터를 계산하는 과정을 설명하기 위한 순서도이다.FIG. 1 is a flowchart for explaining a process of extracting feature points from a query image and calculating a descriptor.

단계 110에서 이미지 검색 서비스 서버는, 이미지 데이터베이스를 구축할 수 있다. In step 110, the image search service server may build an image database.

보다 구체적으로, 서비스 서버는 크롤러, 파서, 인덱서를 구비하여, 온라인 에서 웹 문서를 수집하고, 웹 문서에 포함된 이미지 및 이미지에 연결된 각종 정보에 접근할 수 있다. More specifically, the service server includes a crawler, a parser, and an indexer to collect web documents on-line, and access various information linked to images and images included in the web documents.

예를 들어 크롤러는 관심 분야의 웹 주소 목록을 수집하고, 웹사이트를 확인하여 링크를 추적하는 방식으로 이미지와 관련된 데이터를 서비스 서버로 전달할 수 있다. 이때 파서는 크롤링 과정 중에 수집된 웹 문서를 해석하여 페이지에 포함된 이미지, 상기 이미지와 연결된 정보를 추출하며, 인덱서는 해당 위치와 의미를 색인할 수 있다. For example, a crawler might collect a list of web addresses of interest, pass the image-related data to the service server by checking the website and tracking the link. At this time, the parser analyzes the web document collected during the crawling process, extracts the image included in the page, information associated with the image, and the indexer can index the corresponding position and meaning.

나아가 서비스 서버는 수집된 이미지를 프로세싱할 수 있다. 이는 다른 텍스트 정보에 의존하지 않고, 이미지를 기준으로 객체를 레이블링하고 나아가 이미지의 유사도를 기준으로 검색 결과를 제공하기 위한 것이다. Further, the service server can process the collected images. This is for labeling the object based on the image and further providing the search result based on the similarity of the image, without relying on other text information.

이를 위해 서비스 서버는 수집된 이미지의 특징을 추출하고, 검색을 효율성을 위해 이미지들의 특징 정보를 구조화 (indexing)할 수 있다. To do this, the service server extracts features of the collected images and indexes feature information of the images for efficiency in searching.

보다 구체적으로 서비스 서버는 이미지들의 특징 영역을 탐지(Interest Point Detection)할 수 있다. 특징 영역이란, 이미지들 사이의 동일 유사 여부를 판단하기 위한 이미지의 특징에 대한 기술자, 즉 특징 기술자(Feature Descriptor)를 추출하는 주요 영역을 말한다. More specifically, the service server can detect the feature region of the images (Interest Point Detection). The feature region refers to a main region for extracting a descriptor, that is, a feature descriptor, of an image for judging whether or not there is a similarity among images.

본 발명의 실시예에 따르면 이러한 특징 영역은 이미지가 포함하고 있는 윤곽선, 윤곽선 중에서도 코너 등의 모퉁이, 주변 영역과 구분되는 블롭(blob), 이미지의 변형에 따라 불변하거나 공변하는 영역, 또는 주변 밝기보다 어둡거나 밝은 특징이 있는 극점일 수 있으며 이미지의 패치(조각) 또는 이미지 전체를 대상으로 할 수 있다. According to the embodiment of the present invention, such a feature region is a region including a corner, a corner or the like out of the contour including the image, a blob distinguished from the surrounding region, a region which is unchanged or crooked according to the deformation of the image, It can be a dark or bright polar point and can be a patch (piece) of an image or an entire image.

나아가 서비스 서버는 특징 영역에서 특징 기술자를 추출(Descriptor Extraction)할 수 있다. 특징 기술자는 이미지의 특징들을 벡터 값으로 표현한 것이다. Furthermore, the service server can extract the descriptor from the feature region (Descriptor Extraction). The feature descriptor is a vector value representing the characteristics of the image.

본 발명의 실시예에 따르면, 이러한 특징 기술자는 해당 이미지에 대한 특징 영역의 위치, 또는 특징 영역의 밝기, 색상, 선명도, 그라디언트, 스케일 또는 패턴 정보를 이용하여 계산할 수 있다. 예를 들어 특징 기술자는 특징 영역의 밝기 값, 밝기의 변화 값 또는 분포 값 등을 벡터로 변환하여 계산할 수도 있다. According to an embodiment of the present invention, the feature descriptor may be calculated using the position of the feature region for the image, or the brightness, color, sharpness, gradient, scale, or pattern information of the feature region. For example, the feature descriptor may be calculated by converting the brightness value, the brightness value, or the distribution value of the feature region into a vector.

한편 본 발명의 실시예에 따르면 이미지에 대한 특징 기술자는 위와 같이 특징 영역에 기반한 지역 기술자(Local Descriptor) 뿐 아니라, 전역 기술자(Global descriptor), 빈도 기술자(Frequency Descriptor), 바이너리 기술자(Binary Descriptor) 또는 신경망 기술자(Neural Network descriptor)로 표현될 수 있다. According to the embodiment of the present invention, the feature descriptor for the image may include not only a local descriptor based on the feature region, but also a global descriptor, a frequency descriptor, a binary descriptor, It can be expressed as a Neural Network descriptor.

보다 구체적으로, 특징 기술자는 이미지 전체 또는 이미지를 임의의 기준으로 분할한 구역 각각, 또는 특징 영역 각각의 밝기, 색상, 선명도, 그라디언트, 스케일, 패턴 정보 등을 벡터값으로 변환하여 추출하는 전역 기술자 (Global descriptor)를 포함할 수 있다. More specifically, the feature descriptor may include a global descriptor (e.g., a global descriptor) for converting the brightness, color, sharpness, gradient, scale, pattern information, etc. of each of the regions divided by the entire image or an arbitrary reference, Global descriptor).

예를 들어 특징 기술자는 미리 구분한 특정 기술자들이 이미지에 포함되는 횟수, 종래 정의된 색상표와 같은 전역적 특징의 포함 횟수 등을 벡터값으로 변환하여 추출하는 빈도 기술자 (Frequency Descriptor), 각 기술자들의 포함 여부 또는 기술자를 구성하는 각 요소 값들의 크기가 특정값 보다 크거나 작은지 여부를 비트 단위로 추출한 뒤 이를 정수형으로 변환하여 사용하는 바이너리 기술자 (Binary descriptor), 신경망(Neural Network)의 레이어에서 학습 또는 분류를 위해 사용되는 영상정보를 추출하는 신경망 기술자(Neural Network descriptor)를 포함할 수 있다. For example, the feature descriptor may include a frequency descriptor for converting the number of times the specific descriptors classified into the image are included in the image, the number of inclusions of the global feature such as the previously defined color table, etc. into a vector value, A binary descriptor which extracts whether the size of each element value constituting the descriptor is larger or smaller than a specific value and then transforms it into an integer type and uses it in a layer of a neural network Or a neural network descriptor for extracting image information used for classification.

나아가 본 발명의 실시예를 따르면, 서비스 서버는 이미지로부터 추출한 디스크립터를 인코딩하기 위한 VT (Vocabulary Tree)를 제 1 레벨과 제 2 레벨로 구별하여 적용할 수 있다. Further, according to an embodiment of the present invention, the service server can apply a VT (Vocabulary Tree) for encoding the descriptor extracted from the image to the first level and the second level separately.

제 1 레벨 VT는 이미지 디스크립터의 값들을 Bow (Back of Words)에서의 단어의 개념으로 생각하고 검색의 대상이 되는 이미지로부터 대략적인 코드 북 (Coarse Codebook)를 구성하여 작성할 수 있다. 나아가 본 발명의 실시예를 따르면, 서비스 서버는 수집한 이미지에 대해 상기 제 1 레벨의 VT를 적용하고, 제 1 레벨의 VT로 작성된 디스크립터를 하나의 벡터로 통합하여 저차원 벡터로 구성하고 이를 인덱싱할 수 있다. The first level VT can be formed by constructing a rough code book from the image to be searched by considering the values of the image descriptor as the concept of words in Bow (Back of Words). Further, according to an embodiment of the present invention, the service server applies the first level VT to the collected image, composes the descriptors created with the first level VT into one vector, constructs the low dimensional vector, can do.

나아가 본 발명의 실시예를 따르는 서비스 서버는 이미지 로컬 영역에서 기하학의 통계적 특성을 반영하여 보다 세밀하게 구성한 정제된 코드북 (Refined Codebook)을 구성하는 방식으로 데이터베이스 이미지에 대해 제 2 레벨 VT를 작성하고 이를 해당 이미지의 메타 데이터로 작성하고 이를 해당 이미지에 인덱싱할 수 있다. Furthermore, the service server according to the embodiment of the present invention creates a second level VT for the database image in a manner of constructing a refined codebook that is more refined by reflecting the statistical characteristics of the geometry in the image local area, It can be created with the metadata of the image and indexed into that image.

상기 제 1 레벨 VT 및 제 2 레벨 VT를 생성하는 보다 구체적인 설명은 첨부된 도 2 및 도 3에 대한 설명에서 후술된다. A more detailed description of generating the first level VT and the second level VT will be described later in the description of FIGS. 2 and 3 attached hereto.

단계 120에서 서비스 서버는 클라이언트로부터 이미지 검색을 문의하는 쿼리를 수신할 수 있다. In step 120, the service server may receive a query to query the client for an image search.

이를 위해 클라이언트는 쿼리 생성 및 결과 수신을 위한 프로그램을 실행할 수 있다. 상기 프로그램은 플러그인, 엑티브엑스 등의 형식으로 모바일 단말 또는 데스크탑의 웹브라우징 툴에 설치되는 익스텐션 프로그램일 수 있으며, 또는 모바일 단말에 설치되는 어플리케이션일 수 있다. To do this, the client can run a program to generate queries and receive results. The program may be an extension program installed in a mobile terminal or a web browsing tool of a desktop in the form of a plug-in, an active X, or the like, or may be an application installed in a mobile terminal.

상기 프로그램은 클라이언트가 본원발명의 실시예를 따르는 서비스 서버로부터 내려받기하고 설치할 수 있으며, 나아가 서비스 서버로부터 온라인 어플리케이션 마켓 등의 웹사이트 서버를 통하여 내려받기하고 설치할 수도 있다. The program can be downloaded and installed from a service server according to an embodiment of the present invention, and further downloaded from a service server through a website server such as an online application market.

서비스 서버는 쿼리를 수신하면, 상기 쿼리에 포함된 이미지를 확인하고, 이후의 검색을 위해 해당 이미지를 프리 프로세싱할 수 있다. 쿼리 이미지에 대한 프리 프로세싱은 이후의 이미지 프로세싱과 달리, 검색을 위해 쿼리 이미지 자체의 스케일을 미리 설정된 포멧에 따라 변경하거나 사이즈를 변경하거나 해상도를 변경하는 것이다. When the service server receives the query, it can identify the image included in the query and pre-process the image for later retrieval. The pre-processing of the query image, unlike the subsequent image processing, is to change the scale of the query image itself for retrieval according to a predetermined format, or to change the size or resolution.

나아가 단계 130에서 서비스 서버는 쿼리 이미지에서 특징점을 추출할 수 있다. 본 발명의 실시예를 따르는 서비스 서버는 쿼리 이미지에서 영상 변형에 불변성을 가지는 위치를 상기 특징점으로 선택하기 위해 이미지가 포함하고 있는 윤곽선, 윤곽선 중에서도 코너 등의 모퉁이, 주변 영역과 구분되는 블롭(blob), 이미지의 변형에 따라 불변하거나 공변하는 영역, 또는 주변 밝기보다 어둡거나 밝은 특징이 있는 극점일 수 있으며 이미지의 패치(조각) 또는 이미지 전체를 대상으로 추출할 수 있다. Further, in step 130, the service server may extract feature points from the query image. The service server according to an embodiment of the present invention includes a blob that is divided into corners such as a corner and a surrounding area among contours and contours included in an image in order to select a location having invariance in image transformation in the query image, , A region that is unchanged or crooked according to the deformation of the image, or a pole having a darker or lighter characteristic than the surrounding brightness, and can extract a patch (image) of the image or the entire image.

나아가 단계 140에서 서비스 서버는 쿼리 이미지의 특징 영역에서 특징 기술자를 추출(Descriptor Extraction)할 수 있다. 특징 기술자는 이미지의 특징들을 벡터 값으로 표현한 것이다. Further, in step 140, the service server may extract the descriptor from the feature region of the query image (Descriptor Extraction). The feature descriptor is a vector value representing the characteristics of the image.

도 2는 본 발명의 실시예에 따라 쿼리 이미지와 유사한 데이터베이스 이미지의 후보 그룹을 추출하는 과정을 설명하기 위한 순서도이다. FIG. 2 is a flowchart illustrating a process of extracting a candidate group of a database image similar to a query image according to an embodiment of the present invention.

도 2에서 서비스 서버는 쿼리 이미지와 유사한 데이터베이스 이미지의 후보 리스트를 추출할 수 있으며, 이는 도 1에서 추출한 데이터베이스 이미지와 쿼리 이미지의 디스크립터를 이용하여 진행될 수 있다. (단계 210)In FIG. 2, the service server can extract a candidate list of a database image similar to a query image, and this can be performed using a database image extracted in FIG. 1 and a descriptor of a query image. (Step 210)

단계 220에서 서비스 서버는 쿼리 이미지의 디스크립터의 값들을 Bow (Back of Words)에서의 단어의 개념으로 생각하고 대략적인 코드북 (Coarse Codebook)를 구성할 수 있다. 본 발명에서는 이를 제 1 레벨 VT라고 명칭하였다. In step 220, the service server may consider the values of the descriptors of the query image as a concept of words in Bow (Back of Words) and construct a rough codebook. In the present invention, this is referred to as a first level VT.

나아가 서비스 서버는 상기 제 1 레벨 VT에 포함된 로컬 피처들을 하나로 통합하여 저차원 벡터를 형성하고 이를 인덱싱할 수 있다. (단계 220 내지 240)Further, the service server may integrate the local features included in the first level VT into one to form a low dimensional vector and index it. (Steps 220-240)

상기 인덱스 생성 기준은 도 1의 단계 110에서 데이터베이스 이미지에도 그대로 적용되어 데이터베이스에는 이미지 각각에 제 1 레벨 VT에 대한 인덱스가 형성되어 있다. The index creation criterion is applied to the database image at step 110 of FIG. 1, and an index for the first level VT is formed in each of the images in the database.

따라서 서비스 서버는 쿼리 이미지의 인덱스 값과 데이터베이스 이미지의 인덱스 값을 비교하여 미리 설정된 범위 이하의 거리 값을 가지는 이미지를 유사한 것으로 보아, 유사 이미지 후보 그룹을 형성할 수 있다. (단계 250)Therefore, the service server can form the similar image candidate group by comparing the index value of the query image with the index value of the database image and regarding the images having the distance value less than the predetermined range as similar. (Step 250)

도 3은 본 발명의 실시예에 따라 토픽 모델을 적용하여 이미지 후보 그룹의 랭킹을 설정하는 과정을 설명하기 위한 순서도이다. 3 is a flowchart illustrating a process of setting a ranking of an image candidate group by applying a topic model according to an embodiment of the present invention.

도 2에서 서비스 서버는 쿼리 이미지와 유사한 데이터베이스 이미지의 후보 리스트를 추출할 수 있으며, 이는 도 1에서 추출한 데이터베이스 이미지와 쿼리 이미지의 디스크립터를 이용하여 진행될 수 있다. (단계 310)In FIG. 2, the service server can extract a candidate list of a database image similar to a query image, and this can be performed using a database image extracted in FIG. 1 and a descriptor of a query image. (Step 310)

보다 구체적으로 서비스 서버는 도 2에서 추출한 유사 이미지 리스트의 랭킹을 설정하기 위하여 쿼리 이미지의 디스크립터의 값들을 로컬 영역에서의 기하학의 통계적 특성을 반영하여 보다 세밀하게 구성한 정제된 코드북 (Refined Codebook)으로 구성할 수 있다. 본 발명에서는 이를 제 2 레벨 VT라고 명칭하였다. (단계 310)More specifically, in order to set ranking of the similar image list extracted in FIG. 2, the service server configures the values of the descriptors of the query image into a refined codebook in which the statistical characteristics of the geometry in the local area are refined, can do. In the present invention, this is referred to as a second level VT. (Step 310)

나아가 서비스 서버는 쿼리 이미지의 제 2 레벨 코드북과 도 2에서 생성한 후보 리스트 이미지의 제 2 레벨 코드북에 토픽 모델을 적용할 수 있다. (단계 320) Furthermore, the service server may apply the topic model to the second level codebook of the query image and the second level codebook of the candidate list image generated in Fig. (Step 320)

상기 토픽 모델은 이미지에서 배경과 객체를 구별하는 모델을 가정하고, 이 모델이 적용되는지를 반복적으로 검증하여 모델 검증 확률을 계산하기 위한 것이다. 상기 토픽 모델은 대량의 이미지에서 형태 분포, 배경 분포를 추출하여 해당 이미지의 객체를 구별하는 기계 학습의 방식으로 학습될 수 있다.The topic model is for estimating the model verification probability by repeatedly verifying whether or not the model is applied, assuming a model that distinguishes the background from the object in the image. The topic model can be learned by a machine learning method of extracting a morphological distribution and background distribution from a large number of images and distinguishing objects of the image.

상기 토픽 모델을 적용하면 쿼리 이미지와 후보 리스트 이미지에서 주제 영역과 배경 영역이 구분되어, 쿼리 이미지 토픽 오브젝트와 후보 리스트 이미지의 토픽 오브젝트의 유사 여부가 비교될 수 있다. (단계 330, 340)When the topic model is applied, the subject area and the background area are distinguished from each other in the query image and the candidate list image, and the similarity between the query image topic object and the topic object of the candidate list image can be compared. (Steps 330 and 340)

본 발명의 실시예를 따르는 토픽 모델은 첨부된 도 5 및 도 6을 참고하여 보다 구체적으로 설명한다. A topic model according to an embodiment of the present invention will be described in more detail with reference to Figs. 5 and 6 attached hereto.

도 5는 본 발명의 실시예에 따라 이미지에서 토픽 모델을 적용하는 예시를 설명하기 위한 도면이다. 5 is a diagram for explaining an example of applying a topic model in an image according to an embodiment of the present invention.

도 5는 기준 이미지와 비교 대상 이미지의 GV (Geometric Verification)를 비교 하고 있다. 5 compares the GV (Geometric Verification) of the reference image with the comparison image.

도 5a에서 GV의 전역 변환은 건물의 특성 일치가 다른 개체의 기능 일치를 지배하기 때문에 "건물"의 변형을 반영하고 있다. 이러한 일치 사항은 다른 일치하는 개체 (도 5a의 예에서 비행기, 자동차)를 포함하고 있지만, 파란색으로 표시된 일치 점은 정합하는 것으로 간주되나, 노란색으로 표시된 일치점을 정합하지 않은 것으로 처리될 수 있다. In Figure 5A, the global transformation of GV reflects the transformation of the "building" because the property match of the building dominates the functional conformity of the other entity. This correspondence includes other matching entities (airplanes, automobiles in the example of FIG. 5A), but the matchpoints indicated in blue are considered to be matched, but can be treated as not matching the matchpoints indicated in yellow.

도 5b는 본 발명의 실시예에 따라 토픽 레이블을 적용한 국부적인 기하학적 일관성을 추정하는 예시이다. 도 5b의 예에서 토픽 레이블은 먼저 동일한 객체와 연관된 지형지물에 할당될 수 있다. 본 발명의 실시예를 따르면 토픽 레이블을 먼저 오브젝트에 할당하고, 다음에 각각의 로컬 주제 영역의 기하학적 일관성을 추정하기 때문에, 도 5a와 달리 객체들의 일치점은 정합하는 것으로 처리될 수 있는 효과가 있다. 5B is an illustration of estimating local geometric consistency applying a topic label according to an embodiment of the present invention. In the example of FIG. 5B, the topic label may first be assigned to a feature associated with the same object. According to the embodiment of the present invention, since the topic label is first assigned to the object, and then the geometric consistency of each local subject area is estimated, unlike FIG. 5A, the matching points of the objects can be processed as matching.

도 6은 본 발명의 실시예에 따르는 SLTMB (Spatial Latent Topic Model with Background Distribution)에 적용되는 변수 및 파라미터의 관계를 설명하기 위한 도면이다. FIG. 6 is a diagram for explaining a relationship between parameters and parameters applied to an SLTMB (Spatial Latent Topic Model with Background Distribution) according to an embodiment of the present invention.

본 발명의 실시예를 따르는 토픽 모델은 배경의 분산과 공간의 잠재적인 토픽을 추정하기 위한 것으로, 기계 학습 방식으로 학습될 수 있다. A topic model according to an embodiment of the present invention is for estimating the background of the variance and potential topics of the space, and can be learned by a machine learning method.

도 6에 도시된 변수와 파라미터는 아래의 표 1을 참고하여 설명될 수 있다. The parameters and parameters shown in FIG. 6 can be described with reference to Table 1 below.

나아가 본 발명의 실시예를 따르는 토픽 모델은 아래의 단계를 통해 업데이트될 수 있다. Further, a topic model according to an embodiment of the present invention may be updated through the following steps.

각각의 토픽 t에 대해 형태 분포 Φt ~ Dir(β1) 및 배경 분포 Ω ~ Dir (β2) 작성For each topic t, the morphological distribution Φt ~ Dir (β1) and the background distribution Ω ~ Dir (β2)

각각의 이미지 Id (d = 1 … D)에 대해, 워드 타입 분산 값 λd ~ 및 함수 (γ)를 작성하고, 토픽 비율 θd ~ Dir(α)를 계산하고, 각각의 토픽 t (t = 1,2,…, T)에 대해 위치 분포 ~ NW(μ0, κ, ν, Q)를 계산할 수 있다.(T = 1, ..., D) for each image Id (d = 1 ... D) 2, ..., T), the position distribution ~ NW (μ0, κ, ν, Q) can be calculated.

나아가 각각의 워드 위치값 i∈ {1,2, ..., N_d}에 대해 스위치 샘플 s_di ~ 함수 (λ_d)을 계산할 수 있다. 나아가 s_di = 1에서 토픽 z_di ~ 함수 (θ_d)를 계산하고, 비주얼 워드 (visual word) w_di ~ 함수 (Φ_zdi)를 계산하고, 워드 위치값 l_di 를 계산할 수 있다. 다음으로 s_di = 2에서 비주얼 워드 w_di ~ 함수 (Ω)를 계산하고, 워드 위치값 l_di ~ 를 계산할 수 있다. Further, switch sample s _di ~ function (λ _d ) can be calculated for each word position value iε {1,2, ..., N _d }. Further, it is possible to calculate the topic z _di ~ function (? _D ) at s _di = 1, calculate the visual word w _di ~ function (? _Zdi ), and calculate the word position value l _di . Next, we can calculate the visual word w _di ~ function (Ω) at s _di = 2 and calculate the word position value l _di ~.

나아가 본 발명의 실시예를 따르는 토픽모델은 아래의 수학식 1에 따라 계산될 수 있다. Further, a topic model according to an embodiment of the present invention can be calculated according to the following equation (1).

다시 도 3에 대한 설명으로 복귀하면, 서비스 서버는 토픽 모델을 적용하여 후보 이미지 그룹과 쿼리 이미지의 유사도를 기준으로 후보 이미지 그룹의 랭킹을 설정할 수 있다. (단계 350)Referring again to FIG. 3, the service server can apply the topic model to set the ranking of the candidate image group based on the similarity degree between the candidate image group and the query image. (Step 350)

이때 본 발명의 실시예를 따르는 유사도 계산은 아래의 수학식 2 내지 6을 이용하여 계산될 수 있다. At this time, the similarity calculation according to the embodiment of the present invention can be calculated using the following Equations (2) to (6).

이후 서비스 서버는 이미지 검색 결과를 클라이언트에 전송할 수 있다. (단계 360)The service server may then send the image search results to the client. (Step 360)

도 4은 본 발명의 실시예를 따르는 이미지 검색 프레임 워크의 구성을 설명하기 위한 도면이다. 4 is a diagram for explaining a configuration of an image retrieval framework according to an embodiment of the present invention.

본 발명의 실시예를 따르는 이미지 검색 프레임 워크는 미리 프로세싱된 이미지 데이터 베이스 460을 포함할 수 있다.An image search framework in accordance with an embodiment of the present invention may include a preprocessed image database 460.

나아가 쿼리 이미지 410을 수신하면 상기 데이터베이스 이미지 460와 비교하여 유사한 이미지를 순위를 이미지 리스트 480를 제시할 수 있다. Further, upon receiving the query image 410, the image list 480 can be presented in a similar image order as compared with the database image 460.

보다 구체적으로, 본 발명의 실시예를 따르는 이미지 검색 프레임워크는 쿼리 이미지를 수신하면 피처 (특징점에 대한 디스크립터, 420)를 추출하고 이를 2단계로 처리할 수 있다. More specifically, the image retrieval framework according to an embodiment of the present invention extracts a feature (descriptor 420 for a feature point) upon receiving a query image and processes it in two steps.

먼저 이미지 검색 프레임워크는 쿼리 이미지에 대해 2 레벨 VT를 작성할 수 있다. First, the image search framework can create a two-level VT for the query image.

제 1 코드북은 대략적으로 작성되어 로컬 피처에 대해 통합된 (445) 저차원 백터 (447)을 형성할 수 있다. 이는 인덱싱 모듈 (465)에서 쿼리 이미지에 대해 인덱싱되고, 데이터베이스 이미지의 인덱스 값과 비교하여 후보군 (470)을 추출하는데 사용될 수 있다. (인코딩 프레이즈, 440)The first codebook may be roughly created to form a low dimensional vector 447 integrated (445) for the local feature. Which can be indexed for the query image in the indexing module 465 and used to extract the candidate set 470 by comparing it with the index value of the database image. (Encoding phrase, 440)

제 2 코드북은 이미지의 로컬 영역에서 기하학의 통계적 특성을 반영하여 보다 세밀하게 구성된 정제된 코드북을 의미한다. 이는 제 2 레벨 VT로 작성되고 (455) 상기 제 2 레벨 VT에 토픽 모델 (457)을 적용하여 상기 후보 이미지 리스트와 쿼리 이미지의 유사도를 계산 (475)하는데 사용될 수 있다. (토픽 모델링 프레이즈, 450)The second codebook refers to a refined codebook that is more refined by reflecting the statistical properties of the geometry in the local area of the image. This can be used to generate a second level VT 455 and apply the topic model 457 to the second level VT to calculate 475 the similarity of the candidate image list to the query image. (Topic Modeling Phrase, 450)

본 발명의 실시예를 따르는 이미지 검색 프레임워크는 이와 같이 인코딩 단계 및 토픽 모델링 단계를 거쳐 이미지의 유사도가 높은 순위로 검색 결과 (480)를 제시할 수 있다. The image retrieval framework according to the embodiment of the present invention may present the retrieval result 480 in the order of high similarity of images through the encoding step and the topic modeling step.

도 7은 본 발명의 다른 실시예에 따라 클라이언트에서 검색 결과가 표시되는 예시를 설명하기 위한 도면이다. 7 is a diagram for explaining an example in which search results are displayed in a client according to another embodiment of the present invention.

특히 도 7의 실시예는 쿼리 이미지의 특징 정보를 생성하는 처리를 클라이언트에서 담당하여, 네트워크로 이동되는 데이터 양을 줄여, 전체 시스템의 속도를 개선할 수 있는 방법에 대한 것이다In particular, the embodiment of FIG. 7 relates to a method capable of improving the speed of the entire system by taking charge of processing by the client to generate feature information of the query image, thereby reducing the amount of data moved to the network

도 7의 단계 710, 단계 720, 단계 755 및 단계 760의 내용은 앞서 설명한 도 1 내지 도 6에 대한 내용과 실질적으로 동일하므로 이하에서는 차이점 위주로 도 7에 대해 설명된다. The contents of step 710, step 720, step 755, and step 760 of FIG. 7 are substantially the same as those of FIGS. 1 to 6 described above.

단계 725에서 클라이언트는 디스플레이된 이미지에 포함된 객체들을 구분하는 처리를 진행할 수 있다. 예를 들어 클라이언트는 디스플레이된 이미지의 윤곽선 등을 이용하여 배경, 사람, 커피캔, 로션 등의 객체로 구분할 수 있다. In step 725, the client can proceed to distinguish objects included in the displayed image. For example, a client can distinguish objects such as a background, a person, a coffee can, and a lotion using the outline of the displayed image.

단계 730에서 클라이언트는 구분된 이미지 객체에 대한 사용자 선택 입력을 수신할 수 있다. 이때 클라이언트는 구현의 목적에 따라 배경이나 사람 등의 객체는 사용자가 선택할 수 없도록 하고, 매매 가능한 상품에 대한 객체만 선택하도록 설정할 수 있다. In step 730, the client may receive a user selection input for the delimited image object. At this time, the client can set the object such as the background or the person to be selected by the user according to the purpose of the implementation, and to select only the object for the merchandise available for sale.

단계 740에서 클라이언트는 사용자가 선택한 객체 이미지를 쿼리 이미지로 하여 이에 대한 특징 정보를 생성하는 처리를 할 수 있다. 보다 구체적으로 클라이언트는 사용자가 선택한 이미지 객체의 특징 영역을 탐지하고, 특징 기술자를 추출할 수 있으며, 구체적인 내용은 도 1 내지 도 6 에 대한 설명에서 전술한 것과 같다. 한편, 쿼리 이미지의 특징 영역의 개수와 특징 기술자의 개수는 프로세스의 성능을 고려하여 클라이언트마다 달리 설정할 수 있다. In step 740, the client may perform a process of generating feature information on the object image selected by the user as a query image. More specifically, the client can detect the feature region of the image object selected by the user and extract the feature descriptor. The concrete contents are the same as those described in the description of FIG. 1 to FIG. On the other hand, the number of characteristic regions and the number of characteristic descriptors in the query image can be set differently for each client considering the performance of the process.

이후 단계 745에서 클라이언트는 생성한 쿼리 이미지의 특징 정보를 서버에게 전송할 수 있다. Thereafter, in step 745, the client can transmit the feature information of the generated query image to the server.

도 7의 실시예를 따르면 클라이언트에 표시된 이미지 자체가 아니라, 사용자가 선택한 객체 이미지에 대한 특징 정보만이 네트워크를 통해 이동할 수 있다. 따라서 네트워크로 이동되는 데이터 양이 대폭 감소할 수 있으며, 업로드 및 다운로드에 소요되는 시간이 감소하여 전체 시스템의 속도를 개선되는 효과가 있다. According to the embodiment of FIG. 7, not only the image displayed on the client but only the feature information of the object image selected by the user can be moved through the network. Therefore, the amount of data moved to the network can be greatly reduced, and the time required for uploading and downloading is reduced, thereby improving the speed of the entire system.

한편, 단계 765에서 서비스 서버는 데이터베이스 이미지의 특징 정보를 분류한 것과 동일한 기준에 따라 수신한 쿼리 이미지 특징 정보를 변환하는 처리를 할 수 있다. On the other hand, in step 765, the service server can perform the process of converting the received query image feature information according to the same criterion as that of classifying the feature information of the database image.

이후 단계 790, 단계 740의 내용은 도 1 내지 도 6에 대한 설명과 실질적으로 동일하다. 단계 7750에서 클라이언트는 검색 결과를 표시할 수 있다. The contents of steps 790 and 740 are substantially the same as those described with reference to Figs. At step 7750, the client may display the search results.

도 8은 본 발명의 실시예를 따르는 클라이언트의 구성을 설명하기 위한 블록도이다. 본 명세서에서는 스마트폰, 테블릿, PDA 등 모바일 단말에 대해 설명하지만 본 발명은 이에 한정되지 않으며, 본 발명의 실시예를 따르는 클라이언트는 퍼스널 컴퓨터나 노트북일 수 있다. 8 is a block diagram illustrating a configuration of a client according to an embodiment of the present invention. Although a mobile terminal such as a smart phone, a tablet, and a PDA is described in this specification, the present invention is not limited thereto, and a client according to an embodiment of the present invention may be a personal computer or a notebook computer.

도 8에서 도시되는 바와 같이, 본 발명의 실시예에 따른 클라이언트(800)는 통신부(810), 터치스크린부(830), 입력부(820), 저장부(840) 및 제어부(860)를 포함할 수 있으며, 도 8에 도시된 것은 아니지만 오디오 처리부, 비디오 처리부를 더 포함할 수 있다. 8, the client 800 according to the embodiment of the present invention includes a communication unit 810, a touch screen unit 830, an input unit 820, a storage unit 840, and a control unit 860 And may further include an audio processing unit and a video processing unit, though not shown in FIG.

통신부(810)는 클라이언트(800)의 유무선 통신을 위한 해당 데이터의 송수신 기능을 수행하며, 유무선 채널을 통해 데이터를 수신하여 제어부(850)로 출력하고, 제어부(850)로부터 출력된 데이터를 무유선 채널을 통해 전송하는 기능을 수행한다. The communication unit 810 performs a function of transmitting and receiving corresponding data for wired / wireless communication of the client 800. The communication unit 810 receives data through a wired / wireless channel and outputs the received data to the control unit 850, And transmits the data through the channel.

터치스크린부(830)는 터치패널(833) 및 표시부(836)를 포함한다. 특히 본 발명의 실시예를 따르는 터치패널(833)은 표시부(836)에 표시된 이미지 중 상품 객체에 대한 사용자의 선택 입력을 수신하고, 표시부(836)는 상품 정보를 디스플레이하는 기능을 수행할 수 있다. The touch screen unit 830 includes a touch panel 833 and a display unit 836. Particularly, the touch panel 833 according to the embodiment of the present invention can receive the user's selection input of the goods object among the images displayed on the display unit 836, and the display unit 836 can display the goods information .

입력부(820)는 클라이언트(800)를 제어하기 위한 사용자의 입력을 수신하고 입력 신호를 생성하여 제어부(850)에 전달할 수 있다. 본 발명의 선호되는 실시예에서 클라이언트(800)는 터치스크린부(830)만으로 모든 조작이 가능하다. 이 경우 터치패널(833)이 입력부(820)의 기능을 수행할 수 있다. The input unit 820 receives an input of a user for controlling the client 800, generates an input signal, and transmits the input signal to the controller 850. In the preferred embodiment of the present invention, the client 800 is fully operable only with the touch screen portion 830. [ In this case, the touch panel 833 can perform the function of the input unit 820.

저장부(840)는 클라이언트(800)의 동작에 필요한 프로그램 및 데이터를 저장하는 역할을 수행하며, 프로그램 영역과 데이터 영역으로 구분될 수 있다. 본 발명의 실시예를 따르는 저장부(840)의 데이터 영역은 쿼리 이미지 또는 쿼리 이미지의 특징 정보를 저장할 수 있다. 나아가 본 발명의 실시예를 따르는 저장부(840)의 프로그램 영역은 상품 정보를 표시하는 처리를 클라이언트에서 실행시키기 위한 어플리케이션을 저장할 수 있다. 상기 어플리케이션은 표시부(836)에 디스플레이된 이미지를 쿼리 이미지(Query Image)로서 서버로 전송하는 기능, 상기 쿼리 이미지와 동일하거나 유사한 상품 정보를 상기 서버로부터 수신하는 기능, 및 상기 상품 정보를 표시하는 기능을 수행할 수 있다.The storage unit 840 stores programs and data necessary for the operation of the client 800, and can be divided into a program area and a data area. The data area of the storage unit 840 according to an embodiment of the present invention may store feature information of a query image or a query image. Further, the program area of the storage unit 840 according to the embodiment of the present invention may store an application for causing the client to execute the process of displaying the product information. The application may include a function of transmitting an image displayed on the display unit 836 to a server as a query image, a function of receiving product information identical or similar to the query image from the server, a function of displaying the product information Can be performed.

제어부(850)는 클라이언트의 각 구성 요소에 대한 전반적인 동작을 제어한다. 특히, 본 발명의 실시예에 따르는 클라이언트의 제어부(850)는 표시부(836)에 디스플레이된 이미지를 쿼리 이미지로서 서버로 전송하고, 상기 쿼리 이미지와 동일하거나 유사한 검색 결과를 상기 서버로부터 수신하도록 통신부(810)를 제어하며, 상기 검색 결과를 표시하도록 표시부(836)를 제어할 수 있다. The control unit 850 controls the overall operation of each component of the client. In particular, the control unit 850 of the client according to the embodiment of the present invention transmits the image displayed on the display unit 836 to the server as a query image, and transmits a search result, which is the same as or similar to the query image, 810, and controls the display unit 836 to display the search result.

도 9는 본 발명의 실시예를 따르는 서버(900)의 구성을 설명하기 위한 블록도이다. 도 9에서 도시되는 바와 같이, 본 발명의 실시예에 따른 서버(900)는 통신부(910), 저장부(930) 및 제어부(920)를 포함할 수 있으며, 도 9에 도시된 것은 아니지만 입력부와 표시부를 더 포함할 수 있다. 9 is a block diagram illustrating a configuration of a server 900 according to an embodiment of the present invention. 9, the server 900 according to the embodiment of the present invention may include a communication unit 910, a storage unit 930, and a control unit 920, and may include an input unit And may further include a display unit.

도 9에서 서버(900)는 통신부(910), 저장부(930) 및 제어부(920)를 포함하는 것으로 도시되어 있지만, 각 블록은 물리적으로 분리되어 존재할 수 있다. 예를 들어 저장부(930)는 데이터베이스(미도시)에 물리적으로 존재하고, 통신부(910)를 통하여 서버(900)의 제어부(920)와 연결될 수 있다. 9, the server 900 is illustrated as including a communication unit 910, a storage unit 930, and a control unit 920, but each block may be physically separated. For example, the storage unit 930 physically exists in a database (not shown) and may be connected to the controller 920 of the server 900 through the communication unit 910.

통신부(910)는 서버(900)의 유무선 통신을 위한 데이터의 송수신 기능을 수행하며, 유무선 채널을 통해 데이터를 수신하여 제어부(920)로 출력하고, 제어부(920)로부터 출력된 데이터를 무유선 채널을 통해 전송하는 기능을 수행한다. 특히 본 발명의 실시예를 따르는 통신부(910)는 클라이언트로부터 쿼리 이미지(Query Image)를 수신하는 기능을 수행할 수 있다.The communication unit 910 performs a function of transmitting and receiving data for wired / wireless communication of the server 900. The communication unit 910 receives data through a wired / wireless channel and outputs the data to the control unit 920. The control unit 920 transmits the data output from the control unit 920 to a non- Through the network. In particular, the communication unit 910 according to the embodiment of the present invention can perform a function of receiving a query image from a client.

저장부(930)는 서버(900)의 동작에 필요한 프로그램 및 데이터를 저장하는 역할을 수행하며, 프로그램 영역과 데이터 영역으로 구분될 수 있다. 본 발명의 실시예를 따르는 저장부(930)의 데이터 영역에서 저장부(930)는 이미지 데이터베이스, 상기 이미지와 연결된 텍스트 정보 및 쿼리 이미지 특징 정보를 저장할 수 있다. The storage unit 930 stores programs and data necessary for the operation of the server 900, and can be divided into a program area and a data area. In the data area of the storage unit 930 according to an embodiment of the present invention, the storage unit 930 may store an image database, text information associated with the image, and query image feature information.

나아가 본 발명의 실시예를 따르는 저장부(940)의 프로그램 영역은 클라이언트에게 이미지 검색 결과를 제공하는 처리를 서버에서 실행시키는 컴퓨터 프로그램을 저장할 수 있다. 상기 컴퓨터 프로그램은 복수의 이미지에 대한 데이터 베이스를 생성하고, 상기 데이터 베이스에 포함되는 이미지의 특징 정보를 추출하는 기능, 클라이언트에서 디스플레이된 이미지를 쿼리 이미지(Query Image)로 하여 상기 클라이언트로부터 수신하는 기능, 상기 데이터 베이스 이미지의 특징 정보를 검색하여 상기 쿼리 이미지의 특징 정보와 매칭되는 매칭 이미지를 판단하는 기능 및 상기 매칭 이미지에 대한 정보를 상기 클라이언트에게 제공하는 기능을 수행할 수 있다. Further, the program area of the storage unit 940 according to the embodiment of the present invention may store a computer program that causes the server to execute a process of providing an image search result to a client. The computer program includes a function of generating a database for a plurality of images and extracting feature information of an image included in the database, a function of receiving from the client an image displayed on the client as a query image, A function of determining a matching image matching the feature information of the query image by searching the feature information of the database image, and providing the client with information about the matching image.

제어부(920)는 서버(900)의 각 구성 요소에 대한 전반적인 동작을 제어한다. 특히, 본 발명의 실시예에 따르는 제어부(920)는 복수의 이미지에 대한 데이터 베이스를 생성하고, 상기 데이터 베이스에 포함되는 이미지의 특징 정보를 추출하며, 상기 데이터 베이스 이미지의 특징 정보를 검색하여 상기 쿼리 이미지의 특징 정보와 매칭되는 이미지를 판단하는 역할을 수행하며, 검색 결과에 대한 정보를 상기 클라이언트에게 제공하도록 통신부(910)를 제어할 수 있다The control unit 920 controls the overall operation of each component of the server 900. Particularly, the controller 920 according to the embodiment of the present invention generates a database for a plurality of images, extracts feature information of the images included in the database, searches the feature information of the database image, It is possible to control the communication unit 910 to determine the matching image with the feature information of the query image and provide the client with information about the search result

본 명세서와 도면에 개시된 본 발명의 실시 예들은 본 발명의 기술 내용을 쉽게 설명하고 본 발명의 이해를 돕기 위해 특정 예를 제시한 것뿐이며, 본 발명의 범위를 한정하고자 하는 것은 아니다. 여기에 개시된 실시 예들 이외에도 본 발명의 기술적 사상에 바탕을 둔 다른 변형 예들이 실시 가능하다는 것은 본 발명이 속하는 기술 분야에서 통상의 지식을 가진 자에게 자명한 것이다. The embodiments of the present invention disclosed in the present specification and drawings are merely illustrative of specific embodiments of the present invention and are not intended to limit the scope of the present invention in order to facilitate understanding of the present invention. It is to be understood by those skilled in the art that other modifications based on the technical idea of the present invention are possible in addition to the embodiments disclosed herein.

800 : 클라이언트
810: 클라이언트의 통신부
820 : 클라이언트의 입력부
830 : 클라이언트의 터치스크린
840 : 클라이언트의 저장부
850 : 클라이언트의 제어부
800 : 서버
810 : 서버의 통신부
820 : 서버의 제어부
830: 서버의 저장부800: Client
810:
820: Input section of the client
830: Client's touch screen
840: Storage portion of the client
850:
800: Server
810:
820:
830: Storage section of the server

Claims

In a service server, a method for performing an image search,
Generating a first descriptor vector by applying a first level codebook in the query image and extracting at least one candidate image similar to the query image by comparing the query image with a database image;
Extracting a second descriptor vector by applying a second level codebook having a higher dimension than the first level codebook in the query image and applying a topic model for distinguishing a subject area and a background area to the second descriptor vector, A step B of estimating a subject area of the subject;
C) estimating a subject area of the candidate image by applying the topic model to the candidate image; And
Calculating a degree of similarity between a subject area of the query image and a subject area of the candidate image; and setting a similarity ranking of the candidate image according to the degree of similarity.

The method according to claim 1,
Collecting a plurality of images and applying the first level codebook to the collected images to generate a first descriptor vector;
Generating a second descriptor vector by applying the second level codebook to the collected image;
And indexing the first descriptor vector and / or the second descriptor vector into an image using an index value.

3. The method according to claim 2,
And estimating a subject area of the candidate image by applying the topic model to the second descriptor vector of the candidate image.

4. The method according to claim 3, wherein in step (D)
And estimating a similarity between the candidate image and the query image by applying a weight to a descriptor of a subject area of the candidate image.

In a service server, a method for performing an image search,
Collecting a plurality of images, extracting a first descriptor by applying a first level code book to the collected images, and constructing a database indexing the vectors of the first descriptors into corresponding images as index values;
And generating a first descriptor vector by applying the first level codebook in the query image, comparing the vector with an index value of the database image to extract a candidate image similar to the query image, ;
Extracting a second descriptor by applying a second level codebook having a higher dimension than the first level codebook in the query image and applying a topic model for distinguishing a subject area and a background area to the second descriptor, Estimating a region; And
And estimating a subject area of the candidate image by applying the topic model to a second descriptor extracted by applying the second level codebook to the candidate image.

6. The method of claim 5,
Calculating a degree of similarity between a subject area of the query image and a subject area of the candidate image, and setting a similarity ranking of the candidate image according to the degree of similarity.

An apparatus for performing an image search,
A database comprising at least one image;
A communication unit for receiving a query image;
A descriptor extracting unit for generating a first descriptor vector by applying a first level codebook in the query image and extracting a second descriptor vector by applying a second level codebook having a higher dimension than the first level codebook;
Estimating a subject area of the query image by applying a topic model for distinguishing a subject area and a background area to the second descriptor vector of the query image, applying the topic model to the candidate image, Estimating a subject area estimator; And
Extracting at least one candidate image that is similar to the query image as compared with the first descriptor vector and the database, calculating the similarity between the subject region of the query image and the subject region of the candidate image, And a degree-of-similarity calculation unit that sets a degree of similarity rank of the image.

8. The method of claim 7,
Wherein the descriptor-
Generating a first descriptor vector by applying the first level codebook to an image included in the database and applying a second level codebook to generate a second descriptor vector,

9. The method of claim 8,
And an index unit for indexing the first descriptor vector and / or the second descriptor vector into an image using an index value.

An apparatus for performing an image search,
An image database for extracting a first descriptor by applying a first level codebook in a plurality of images and indexing a vector for the first descriptor into an image using an index value;
A communication unit for receiving a query image; And
Generating a first descriptor vector by applying the first level codebook in the query image, extracting at least one candidate image similar to the query image by comparing the vector with an index value of the database image, Extracts a second descriptor by applying a second level codebook of a higher level than the first level codebook and estimates a subject area of the query image by applying a topic model for distinguishing the subject area and the background area to the second descriptor And a controller for applying the topic model to a second descriptor extracted by applying the second level codebook to the candidate image to estimate a subject area of the candidate image.

11. The method of claim 10,
Wherein,
Wherein the similarity degree calculating unit calculates a degree of similarity between a subject area of the query image and a subject area of the candidate image and sets a similarity ranking of the candidate image according to the degree of similarity.

A computer program recorded on a recording medium for causing a computer to execute a method of retrieving an image in a service server,
A function of collecting a plurality of images;
A function of building a database of the collected images;
A first level descriptor is applied to the query image to extract a first descriptor, a vector for a first level descriptor is generated, and an index value of the database image is compared with the vector, A function of extracting at least one candidate image similar to the image;
Extracting a second descriptor by applying a second level codebook having a higher dimension than the first level codebook in the query image and applying a topic model for distinguishing a subject area and a background area to the second descriptor, A function of estimating a region;
Estimating a subject area of the candidate image by applying the topic model to the second descriptor of the candidate image; And
Calculating a degree of similarity between a subject area of the query image and a subject area of the candidate image, and setting a similarity ranking of the candidate image according to the degree of similarity.

A computer program recorded on a recording medium for causing a computer to execute a method of retrieving an image in a service server,
The ability to create an image database;
Generating a first descriptor vector by applying a first level codebook in the query image and extracting at least one candidate image similar to the query image by comparing with the database image;
Extracting a second descriptor vector by applying a second level codebook having a higher dimension than the first level codebook in the query image and applying a topic model for distinguishing a subject area and a background area to the second descriptor vector, A function of estimating a subject area of the subject;
Applying the topic model to the candidate image to estimate a subject area of the candidate image; And
Calculating a degree of similarity between a subject area of the query image and a subject area of the candidate image, and setting a similarity ranking of the candidate image according to the degree of similarity.