KR100626838B1

KR100626838B1 - High speed image compare, retrieval system based on near maximum flow matching and method thereof

Info

Publication number: KR100626838B1
Application number: KR1020040025649A
Authority: KR
Inventors: 이상엽
Original assignee: 주식회사 엘피에이
Priority date: 2004-04-14
Filing date: 2004-04-14
Publication date: 2006-09-20
Also published as: KR20050100729A

Abstract

여기에 개시된 영상 검색 시스템 및 방법은, 캐릭터 영상 객체를 중점으로부터 n개의 섹터로 분할하고, 분할된 영상의 색상 히스토그램을 양자화하여 특징 값을 추출하고 비교할 때 통계적인 방법을 이용하여 문턱값을 추출한다. 그리고, 추출된 소정의 문턱값을 이용하여 최적 유사도를 측정한다. 그 결과, 정확하고도 빠른 정합을 수행할 수 있게 된다. The image retrieval system and method disclosed herein divides a character image object into n sectors from a midpoint, and extracts a threshold value using a statistical method when extracting and comparing feature values by quantizing a color histogram of the divided image. . The optimum similarity is measured using the extracted predetermined threshold value. As a result, accurate and fast matching can be performed.

Description

HIGH SPEED IMAGE COMPARE, RETRIEVAL SYSTEM BASED ON NEAR MAXIMUM FLOW MATCHING AND METHOD THEREOF}

도 1은 본 발명의 바람직한 실시예에 따른 영상 검색 시스템의 전체 블록도;1 is an overall block diagram of an image retrieval system according to a preferred embodiment of the present invention;

도 2는 도 1에 도시된 영상 분할부에서 얻어진 영상 분할 결과와, 분할된 각 영역의 히스토그램에 대한 양자화 과정을 설명하기 위한 도면;FIG. 2 is a diagram for describing a quantization process of an image segmentation result obtained by the image segmenter illustrated in FIG. 1 and a histogram of each segmented region; FIG.

도 3은 캐릭터 영상의 컬러 히스토그램의 예를 보여주는 도면;3 shows an example of a color histogram of a character image;

도 4는 가중치 Bipartite Graph에서 최소 비용을 계산하는 방법을 설명하기 위한 도면;4 is a view for explaining a method of calculating a minimum cost in a weighted Bipartite Graph;

도 5는 WBM의 구성 과정을 설명하기 위한 도면;5 is a view for explaining a configuration process of the WBM;

도 6은 행렬의 최소값에 덧셈과 뺄셈을 이용하여 최적해를 구하는 헝가리안 방법을 설명하기 위한 도면;FIG. 6 is a diagram for explaining a Hungarian method for finding an optimal solution using addition and subtraction at a minimum value of a matrix; FIG.

도 7은 가중치 Bipartite Graph에서 좌측의

의 각 정점이 0, 1, 2, 3으로 설정되고, 우측의

정점이 A, B, C, D로 설정되었을 때, 헝가리안 트리로 모든 연결 가능한 경우를 보여주는 도면;7 shows the left side of the weighted Bipartite Graph.

Each vertex of is set to 0, 1, 2, 3, and

A diagram showing all possible connections to a Hungarian tree when the vertices are set to A, B, C, D;

도 8(a) 내지 8(c)는 너비 우선 검색을 사용하는 헝가리안 트리의 이용 예를 보여주는 도면;8 (a) to 8 (c) show examples of the use of a Hungarian tree using breadth-first search;

도 9(a) 내지 9(c)는 깊이 우선 검색을 사용하는 헝가리안 트리의 이용 예를 보여주는 도면;9 (a) to 9 (c) show examples of the use of a Hungarian tree using depth first search;

도 10은 동일 섹터 영상의 분포 곡선;10 is a distribution curve of the same sector image;

도 11은 도 1에 도시된 다중 특징 정합부에서 수행되는, 본 발명의 바람직한 실시예에 따른 NMFM 정합 방법을 보여주는 흐름도;11 is a flowchart showing an NMFM matching method according to a preferred embodiment of the present invention, performed in the multiple feature matching section shown in FIG. 1;

도 12는 NMFM의 정합 예를 보여주는 도면;12 shows an example of registration of an NMFM;

도 13은 도 1에 도시된 다중 특징 정합부에서 유사도 정렬을 위해 NMFM를 사용하는 방법을 설명하기 위한 도면;FIG. 13 is a view for explaining a method of using NMFM for similarity alignment in the multiple feature matching unit shown in FIG. 1; FIG.

도 14 내지 도16은 본 발명에 대한 성능 평가 그래프;14-16 are graphs of performance evaluation for the present invention;

도 17은 GHM, WBM, 및 NMFM의 양자화 레벨의 수에 따른 성능 평가 그래프; 그리고17 is a performance evaluation graph according to the number of quantization levels of GHM, WBM, and NMFM; And

도 18은 본 발명에 따른 영상 검색 시스템에 대한 WBM과 NMFM의 검색 속도 성능 평가 그래프이다.18 is a graph for evaluating search speed performance of WBM and NMFM for an image search system according to the present invention.

*도면의 주요 부분에 대한 부호의 설명** Description of the symbols for the main parts of the drawings *

100 : 영상 검색 시스템 110 : 전처리부100: image retrieval system 110: preprocessing unit

120 : 영상 분할부 130 : 양자화부120: image segmentation unit 130: quantization unit

140 : 특징 추출부 170 : 다중 특징 정합부140: feature extraction unit 170: multiple feature matching unit

190 : 데이터베이스190: database

본 발명은 데이터 검색 시스템 및 방법에 관한 것으로, 좀 더 구체적으로는 다중 특징 정보를 이용한 캐릭터 영상 검색 시스템 및 방법에 관한 것이다.The present invention relates to a data retrieval system and method, and more particularly, to a character image retrieval system and method using multiple feature information.

디지털 데이터의 기반이 텍스트에서 멀티미디어 정보로 확대되면서 애니메이션 데이터 및 동영상 데이터의 중요성이 커지고 있다. 애니메이션 데이터를 작성하거나 또는 애니메이션 콘텐츠 및, 이와 연관된 지식기반 콘텐츠를 제작함에 있어서, 캐릭터 사용은 날로 증가하고 있다. 새로운 캐릭터의 제작에 있어 기존의 캐릭터들의 현황을 검색할 필요성이 증가되고 있으며, 현재까지 유명 캐릭터 외에 수많은 캐릭터들은 데이터베이스화 되어있지 않기 때문에 정보 공유 및 디지털 콘텐츠 창조에 문제점으로 대두되었다. 또한 새로운 캐릭터를 제작하였을 경우 검색 시스템이 구성되어 있지 않아서 유사한 캐릭터의 제작이 발생되고 지적 사용권 분쟁이 발생하고 있다. 최근 들어 인터넷 서비스에서 아바타를 이용한 자신의 가상 표현이 늘어나고 있다. 또한 교육, 게임 등의 많은 디지털 서비스 산업에서 캐릭터 영상을 사용하기 시작했다. 수많은 디지털 서비스에서 이미 같은 또는 비슷한 형태의 캐릭터를 재 제작하여 사용하는 경우가 많은 이유는 적합한 캐릭터 영상을 검색하고 공급하는 시스템의 부재 때문이다. 따라서 캐릭터 정보를 데이터베이스화하고 해당 캐릭터를 검색하는 시스템을 구성하게 되면 디지털 콘텐츠의 비용이 감소되면서 발전된 디지털 정보를 구축할 수 있게 된다. As the basis of digital data is expanded from text to multimedia information, the importance of animation data and video data is increasing. Character use is increasing day by day in creating animation data or in producing animation content and related knowledge-based content. In the production of new characters, the necessity of searching for the current status of existing characters is increasing. To date, many characters other than famous characters have not been made into a database, which has caused problems in information sharing and digital content creation. In addition, when a new character is produced, a search system is not constructed, and thus a similar character is produced and a intellectual license dispute arises. Recently, the virtual expression of oneself using an avatar in the internet service is increasing. In addition, many digital services industries, such as education and games, have begun using character video. Many digital services already recreate and use characters of the same or similar form because of the lack of a system for searching and supplying suitable character images. Therefore, if the character information is made into a database and a system is searched for the character, the digital content can be reduced and the advanced digital information can be constructed.

그러나, 캐릭터 영상은 그 형태와 색상이 다양하게 변화되는 특징을 가지고 있기 때문에, 형태의 정보나 색상 정보 한가지를 이용하여 검색하기가 매우 어렵고, 색상과 형태의 정보를 함께 사용하여 검색한다 하더라도 검색 성능이 떨어지는 문제점이 있다. 이 외에도, 캐릭터 영상은 서로 다른 캐릭터 영상이라 하더라도 그 형태와 칼라 색상 히스토그램이 같은 영상들이 존재하기 때문에 일반 영상 검색 방법으로 구별하기 힘든 문제점이 있고, 일반 자연 영상과 달리 패턴이 없으며 단색의 형태를 가지고 있기 때문에 패턴을 이용한 검색방법이 어려운 문제점이 있다. However, since the character image has various characteristics of changing its shape and color, it is very difficult to search by using the form information or one color information, and even if the search is performed by using the color and form information together, the search performance is high. There is a falling problem. In addition, the character image has a problem that it is difficult to distinguish it by the normal image search method because the image and the color histogram having the same color even if the character image is different, there is no pattern and has a monochrome form unlike the general natural image There is a problem that the search method using the pattern is difficult.

본 발명이 이루고자 하는 기술적 과제는, 형태의 정보나 색상 정보 한가지를 이용하여 검색하기가 어려운 캐릭터 영상을 적어도 둘 이상의 특징 정보들이 복합된 다중 특징 정보를 이용하여 효과적으로 검색하는 영상 검색 시스템 및 방법을 제공하는데 있다.SUMMARY OF THE INVENTION The present invention provides an image retrieval system and method for efficiently retrieving a character image that is difficult to retrieve using one type of form information or color information using multi-feature information including at least two feature information. It is.

본 발명이 이루고자 하는 기술적 과제는, 정확하고도 빠른 정합을 수행할 수 있는 영상 검색 시스템 및 방법을 제공하는데 있다. An object of the present invention is to provide an image retrieval system and method that can perform accurate and fast matching.

상기의 과제를 이루기 위하여 본 발명에 의한 영상 검색 시스템은, 검색에 사용될 복수의 데이터가 저장되어 있는 데이터베이스; 검색하고자 하는 대상을 다중 특징 정보를 가진 복수 개의 영역들로 분할하고, 상기 분할된 영역들에 대한 특징 값들을 구하는 전처리부; 및 통계적인 방법에 의해 문턱값을 결정하고, 상기 복수의 데이터 중 상기 문턱값의 범위 내에 속하는 데이터와 상기 분할된 영역의 특징 값들에 대한 유사도를 비교하여 상기 검색 대상에 대한 최적 유사도를 결정하는 다중특징 정합부를 포함하는 것을 특징으로 한다.In order to achieve the above object, the image retrieval system according to the present invention includes a database in which a plurality of data to be used for retrieval are stored; A preprocessing unit dividing an object to be searched into a plurality of areas having multiple feature information, and obtaining feature values of the divided areas; And determining a threshold value by a statistical method, and comparing the similarity of the data within the range of the threshold value with feature values of the divided region to determine an optimal similarity level for the search target. Characterized in that it comprises a matching portion.

바라직한 실시예에 있어서, 상기 다중특징 정합부는 상기 데이터베이스에 저장된 데이터 중 상기 문턱값의 범위에 속하는 데이터가 존재하지 않을 경우, 더 이상 정합을 수행하지 않거나, 또는 WBM(Weighted Bipartite Matching) 정합을 수행하여 유사한 영상을 검색하는 것을 특징으로 한다.In a preferred embodiment, the multi-feature matching unit no longer performs matching or weighted bipartite matching (WBM) matching when there is no data belonging to the threshold range among the data stored in the database. It is characterized by searching for similar images.

바람직한 실시예에 있어서, 상기 다중특징 정합부는 상기 데이터베이스에 저장된 데이터 중 상기 문턱값의 범위에 속하는 데이터가 존재하는 경우, NMFM(Near Maximum Flow Matching) 정합을 수행하여 유사한 영상을 검색하는 것을 특징으로 한다.In a preferred embodiment, the multi-feature matching unit searches for a similar image by performing Near Maximum Flow Matching (NMFM) matching when there is data belonging to the range of the threshold value among the data stored in the database. .

상기의 과제를 이루기 위하여 본 발명에 의한 영상 검색 방법은, (a) Bipartite Graph에서 연결 가능한 모든 연결을 설정하는 단계; (b) 최소 비용을 얻는 쌍을 추출하는 단계; (c) 상기 최소 비용을 얻는 쌍의 가중치와 통계적으로 얻어진 소정의 문턱값과의 차이를 계산하는 단계; (d) 계산된 상기 전체 쌍의 가중치가 상기 문턱값의 범위를 벗어난 경우, WBM(Weighted Bipartite Matching) 정합을 수행하여 최적의 흐름과 정합 쌍을 구하는 단계; 및 (e) 계산된 상기 전체 쌍의 가중치가 상기 문턱값의 범위를 벗어나지 않은 경우, 상기 문턱값 범위에 속하는 상기 쌍에 대해 NMFM(Near Maximum Flow Matching) 정합을 수행하여 대략적 최대 흐름을 구하는 단계를 포함하는 것을 특징으로 한다.In order to achieve the above object, the image retrieval method according to the present invention comprises the steps of: (a) setting all connections that can be connected in the Bipartite Graph; (b) extracting the pair with the lowest cost; (c) calculating a difference between the weight of the pair for obtaining the minimum cost and a predetermined threshold obtained statistically; (d) performing weighted bipartite matching (WBM) matching to obtain an optimal flow and matched pair when the calculated weights of the entire pair are outside the range of the threshold value; And (e) obtaining an approximate maximum flow by performing Near Maximum Flow Matching (NMFM) matching on the pair belonging to the threshold range when the calculated weight of the entire pair does not deviate from the range of the threshold value. It is characterized by including.

바람직한 실시예에 있어서, 상기 영상 검색 방법은, 상기 (a) 단계 이전에 검색하고자 하는 대상을 다중 특징 정보를 가진 복수 개의 영역들로 분할하고, 상 기 분할된 영역들에 대한 특징 값들을 구하는 전처리 단계를 더 포함하는 것을 특징으로 한다.In a preferred embodiment, the image retrieval method comprises: pre-processing a target to be searched before the step (a) into a plurality of areas having multiple feature information and obtaining feature values for the divided areas. It further comprises a step.

(실시예)(Example)

본 발명의 신규한 영상 검색 시스템 및 방법은, 캐릭터 영상 객체를 중점으로부터 n개의 섹터로 분할하고, 분할된 영상의 색상 히스토그램을 양자화하여 특징 값을 추출하고 비교할 때 통계적인 방법을 이용하여 문턱값을 추출한다. 그리고, 추출된 소정의 문턱값을 이용하여 최적 유사도를 측정한다. 그 결과, 정확하고도 빠른 정합을 수행할 수 있게 된다. The novel image retrieval system and method of the present invention divides a character image object into n sectors from a midpoint, and quantizes a color histogram of the divided image to extract and compare feature values, thereby using a statistical method to determine threshold values. Extract. The optimum similarity is measured using the extracted predetermined threshold value. As a result, accurate and fast matching can be performed.

특히, 본 발명에 따른 영상 검색 시스템은, 캐릭터 영상의 다중 특징 정보를 이용하여 동일한 캐릭터를 찾아내는 것에 대해 설명하고 있으나, 이는 다중 특징 정보를 이용한 검색 시스템의 일 예에 불과하며, 다량의 정보가 저장되어 있는 데이터베이스로부터 특정 정보를 검색하는 어떠한 검색 시스템에도 본 발명을 적용할 수 있다. 예를 들어, 다수의 학생들에 대한 성적이 저장된 데이터베이스에서 특정 조건(특히, 복수 개의 조건에 부합된 다중 특징)을 만족하는 학생을 검색하고자 할 때, 본 발명을 적용하면 보다 짧은 시간에 보다 정확한 검색 결과를 얻을 수 있다. In particular, the image retrieval system according to the present invention has been described for finding the same character by using the multi-characteristic information of the character image, but this is only an example of a retrieval system using the multi-feature information, a large amount of information is stored The present invention can be applied to any retrieval system for retrieving specific information from an established database. For example, if you want to search for students who meet certain criteria (especially multiple features that match multiple criteria) in a database where scores for multiple students are stored, applying the present invention can provide more accurate search in a shorter time. You can get the result.

이하 본 발명에 따른 실시예를 첨부된 도면을 참조하여 상세히 설명한다.Hereinafter, exemplary embodiments of the present invention will be described in detail with reference to the accompanying drawings.

도 1은 본 발명의 바람직한 실시예에 따른 영상 검색 시스템(100)의 전체 블록도이고, 도 2는 도 1에 도시된 영상 분할부(120)에서 얻어진 영상 분할 결과와, 분할된 각 영역의 히스토그램에 대한 양자화부(130)의 양자화 과정을 설명하기 위한 도면이다.1 is an overall block diagram of an image retrieval system 100 according to an exemplary embodiment of the present invention, and FIG. 2 is an image segmentation result obtained by the image divider 120 shown in FIG. 1 and a histogram of each divided region. A diagram for describing a quantization process of the quantization unit 130 with respect to FIG.

도 1을 참조하면, 본 발명에 따른 영상 검색 시스템(100)은 크게 전처리부(110), 다중 특징 정합부(170), 및 데이터베이스(190)로 구성되어, 형태와 색상 변화와 같이 서로 다른 두 가지 이상의 정보가 결합된 다중 특징 정보를 이용한 영상 검색(또는 데이터 검색)을 수행한다. 여기서, 데이터베이스(190)는 영상 검색 시스템(100)과 함께 로컬 영역 상에 구비될 수도 있고, 원격지에 구비되어 온라인으로 연결될 수도 있다. 그리고, 상기 데이터베이스(190)는 영상 검색 시스템(100)에서 자체적으로 제공될 수도 있고, 타 기관이나 시스템에서 제공되는 어떠한 데이터베이스도 적용이 가능하다.Referring to FIG. 1, the image retrieval system 100 according to the present invention is largely composed of a preprocessor 110, a multi-feature matching unit 170, and a database 190. Image search (or data search) using multiple feature information combined with at least two kinds of information is performed. Here, the database 190 may be provided on the local area together with the image retrieval system 100 or may be provided at a remote location and connected online. In addition, the database 190 may be provided by itself in the image retrieval system 100, or any database provided by another organization or system may be applied.

일반적으로, 캐릭터와 같은 영상 데이터의 검색에 다중 특징 정보를 이용하게 되면, 형태나 색상 정보 하나만으로 구성된 단일 정보의 검색에 비해 구현이 어렵고 연산 시간도 긴 문제점이 있다. 그러나, 본 발명에 따른 영상 검색 시스템(100)은 다중 특성 정보를 이용한 검색에 NMFM(Near Maximum Flow Matching) 정합 방법을 사용하기 때문에, 기존의 단일 정보 검색 방법과 유사한 성능을 가지면서도 매우 빠른 속도로 정합을 계산할 수 있다. 이와 같은 본 발명의 각 구성을 살펴보면 다음과 같다. In general, when multiple feature information is used for retrieving image data such as a character, there is a problem that it is difficult to implement and computation time is long as compared with retrieval of a single piece of information consisting only of shape or color information. However, since the image retrieval system 100 according to the present invention uses a NMFM (Near Maximum Flow Matching) matching method for retrieval using multi-characteristic information, the image retrieval system 100 has a performance similar to that of a conventional single information retrieval method and at a very high speed. The match can be calculated. Looking at each configuration of the present invention as follows.

먼저, 영상 검색 시스템(100)에 구비된 전처리부(110)의 구성 및 동작에 대해 살펴보면, 전처리부(110)는 영상 분할부(120)와, 양자화부(130), 그리고 특징 추출부(140)로 구성된다.First, the configuration and operation of the preprocessor 110 included in the image retrieval system 100 will be described. The preprocessor 110 may include an image divider 120, a quantizer 130, and a feature extractor 140. It is composed of

도 1 및 도 2를 참조하면, 영상 분할부(120)는 객체 영상을 복수 개의 영역으로 분할하여 분할된 영상에 대한 색상 히스토그램을 계산한다. 색상 히스토그램 을 가지고 캐릭터 영상의 형태 변화에 적응하면서 영상을 비교하기 위해서는, 변화가 큰 부분 영상과 변화가 크지 않은 부분 영상의 정합에 가중치가 달라지게 하는 경우 가장 최적으로 비교가 된다. 영상 분할부(120)를 통해 객체 영상을 n개의 섹터 영상으로 분할하는 것은 영상의 중점으로부터 변화되는 영역과 변화되지 않는 영역을 분리하여 최대 흐름도를 보기 위함이다. 그리고, 영상을 복수 개의 영역(즉, 섹터)으로 분할하여 지역(local)의 영상 정보를 보는 것은 분할된 각 영상의 형태 정보를 고려하기 위함이다. 따라서, 객체 영상을 복수 개의 섹터로 나누게 되면 형태 정보와 색상 정보를 함께 가지는 다원적인 정보가 구성된다. 1 and 2, the image divider 120 divides an object image into a plurality of regions and calculates a color histogram of the divided image. In order to compare the images while adapting the shape change of the character image with the color histogram, the comparison is most optimal when the weights are different for matching the partial image with the large change and the partial image with the small change. The division of the object image into n sector images through the image divider 120 is to view a maximum flow chart by separating an area that is not changed from an area of the image and an area that is not changed. In addition, segmenting an image into a plurality of regions (ie, sectors) to view local image information is to consider shape information of each segmented image. Therefore, when the object image is divided into a plurality of sectors, plural pieces of information having both shape information and color information are formed.

양자화부(130)는 영상 분할부(120)에서 획득된 n개의 분할 영상에서 구해진 색상 히스토그램을 삼각형 형태의 파젠(Parzen) 윈도우를 이용하여 양자화한다. 만약, 영상 분할부(120)에서 얻어진 섹터 영상의 히스토그램 정보를 그대로 정합에 이용하게 되면, 한 픽셀 당 3 바이트의 정보가 되기 때문에 데이터 양이 크다. 예를 들어, RGB 레벨이 256일 경우 한 개의 섹터 영상에서 총 768개의 빈(bin)이 필요하고, 6개의 섹터영상으로 분할하였을 경우 총 4608개의 빈이 필요하게 된다. 따라서, 정보를 줄이기 위해 양자화가 필요하다. 캐릭터 영상은 히스토그램이 특정 지역에서 피크가 나타나는 형태이기 때문에 양자화를 하여도 정보의 유실이 많지 않은 특징을 가지고 있다. 그러나, 특정 지역에 피크가 나타나는 경우 그 지역이 양자화의 경계(boundary)일 경우에는 오차가 발생하게 된다. 이런 문제를 보안하기 위해서 본 발명에서는 양자화를 할 때 Parzen 윈도우를 삼각형 형태로 겹치게 하여 분할하였다. 이렇게 하면 양자화의 경계에서 발생되는 오차가 줄어들게 된다. 양자 화부(130)의 구성 및 기능을 보다 구체적으로 살펴보면 다음과 같다. The quantization unit 130 quantizes the color histogram obtained from the n divided images acquired by the image divider 120 using a parzen window in the form of a triangle. If the histogram information of the sector image obtained by the image dividing unit 120 is used for matching, the data amount is large because it becomes 3 bytes of information per pixel. For example, when the RGB level is 256, a total of 768 bins are required in one sector image, and when a six-sector image is divided, a total of 4608 bins are required. Thus, quantization is needed to reduce the information. Since the character image is a form in which the histogram shows a peak in a specific region, the character image has a feature of not losing much information even when quantized. However, if a peak appears in a specific region, an error occurs if the region is a boundary of quantization. In order to secure such a problem, in the present invention, when the quantization is performed, the Parzen windows are overlapped and divided into triangles. This reduces the error that occurs at the boundary of quantization. Looking at the configuration and function of the quantization unit 130 in more detail as follows.

도 3은 캐릭터 영상의 컬러 히스토그램의 예를 보여주는 도면이다. 도 2 및 도 3을 참조하면, 캐릭터 영상은, 특징 값이 한쪽에 몇 개의 피크로 나타나는 경우가 많다. 도 2 및 도 3에 도시된 바와 같이, 영상의 색상 정보가 전역에 고루 분포된 것이 아니라 특정 지역에 밀집되어 나타난다. 3 is a diagram illustrating an example of a color histogram of a character image. 2 and 3, in the character image, a characteristic value often appears as several peaks on one side. As shown in FIG. 2 and FIG. 3, the color information of the image is not uniformly distributed throughout, but is concentrated in a specific area.

예를 들어, 전체 256 레벨의 색상 분포를 총 n개의 빈으로 나누었을 경우 한 개의 빈의 경계에 있는 값들은 크게 변화 되게 된다. 만일 경계 영역의 중앙에 색상 히스토그램이 밀집되어 나타날 경우 빈의 수를 작게 하였을 때 한 개의 빈에서 다른 빈의 차이는 크게 기여되기 때문에 오차가 높게 나타나게 된다. 이 같은 문제점을 해결하기 위해 Parzen 윈도우가 사용된다. Parzen 윈도우의 기본 식은 다음과 같다.For example, if the total 256-level color distribution is divided into n bins, the values at the boundaries of one bin will change significantly. If the color histogram is concentrated in the center of the boundary area, the error is high because the difference between one bin and the other bin is greatly contributed when the number of bins is reduced. To solve this problem, the Parzen window is used. The basic formula for the Parzen window is:

[수학식1]에서,

은 양자화 영역의 길이,

는 현재 영역에서 떨어진 크기, varphi 는 양자화 윈도우, 그리고,

은

의 부피(volume) 함수를 각각 나타낸다. [수학식1]은

윈도우를 사용하여 영역에 있는 값을 통합하여 특정 값으로 변환할 때 사용한다. 이때 어떤 함수를 사용하는가에 따라서 그 영역의 값은 다양하게 나타난다. 오차를 최소화하기 위해서 가장 이상적인 방법은 가우시안(Gaussian) 윈도우를 사용하는 것이다. 그러나 가우시안 윈도우를 이용하게 되면 연산량이 늘어나는 단점이 있다. 우리가 목적으로 하는 것은 캐릭터 영상에서 특징 값들이 몇 개의 피크로 나타나고, 그 피크가 경계선에 있을 때 오차를 최소화하는 것이다. 따라서, 연상량은 줄이고 오차를 최소화하기 위해서 본 발명에서는 가우시안 형태 대신 삼각형의 Parzen 윈도우를 사용하였으며, 이를 수학식으로 표시하면 다음과 같다.In [Equation 1],

Is the length of the quantization region,

Is the size away from the current region, varphi is the quantization window, and

silver

Represent each volume function of. Equation 1 is

It is used when window is used to integrate the values in the area and convert them to specific values. The value of the range varies depending on which function you use. The ideal way to minimize errors is to use a Gaussian window. However, using a Gaussian window has the disadvantage of increasing the amount of computation. Our goal is to minimize the error when the feature values appear in a few peaks in the character image and those peaks are at the boundary. Therefore, in order to reduce the association amount and minimize the error, the present invention uses a triangular Parzen window instead of a Gaussian form, which is expressed as follows.

[수학식2]에서,

은 영상의 원래 히스토그램 레벨이고,

는 양자화할 히스토그램 레벨이다. 그리고,

은 히스토그램의 값들이고,

는 결과로 반영되는 히스토그램의 값이다. Q는 양자화 레벨 값인데, 이 값은 칼라의 가중치 값을 이용하여 계산된다. 캐릭터 영상의 값들은 몇 개의 부분에 밀집해서 있는 형태이기 때문에 [수학식2]를 이용해도 가우시안 윈도우를 사용한 경우와 크게 차이가 나지 않게 된다. 이와 같이 양자화부(130)에 의해 각각의 분할 영역에 대한 색상 히스토그램이 양자화되고 나면, 특징 추출부(140)를 통해 양자화된 각각의 영역에 대한 특징 값이 추출된다.In Equation 2,

Is the original histogram level of the image,

Is the histogram level to quantize. And,

Is the value of the histogram,

Is the value of the resulting histogram. Q is a quantization level value, which is calculated using the weight value of the color. Since the values of the character images are concentrated in a few parts, the equations 2 do not differ significantly from the Gaussian window. As described above, after the color histogram of each divided region is quantized by the quantization unit 130, the feature value of each quantized region is extracted by the feature extractor 140.

다시 도 1을 참조하면, 전처리부(110)에 의해서 형태 정보와 색상 정보를 포함하는 다중 정보에 대한 특징 값들이 추출되고 나면, 다중 특징 정합부(170)는 WBM(Weighted Bipartite Matching) 또는 NMFM(Near Maximum Flow Matching) 정합을 실행하여 영상의 색상 및 형태 정보에 대한 유사도를 비교하고, 최적 유사도를 획득한다. Referring back to FIG. 1, after feature values for multiple information including shape information and color information are extracted by the preprocessing unit 110, the multiple feature matching unit 170 may perform weighted bipartite matching (WBM) or NMFM ( Near Maximum Flow Matching) is performed to compare the similarity with respect to the color and shape information of the image and obtain an optimal similarity.

다중 특징 정합부(170)에서 수행되는 정합은, 색상 및 형태적 특징이 결합된 다원화된 특징에 대한 정합이다. 최적화된 특징 값을 비교할 때 다중 특징 정보를 이용하면, 단일 특징 정보에 비해 정확도가 높고 영상의 단일 특징에 변화가 있어도 검색이 용이한 장점이 있다. 그러나 다중 특징 정보는 그 정보의 속성이 다양하기 때문에 비교에 있어 단일 특징 비교에 비해 연산의 어려움이 크다. 이러한 다중 특징 정보를 비교하기 위해 일반적으로 탐색 방법과 결합된 정합 방법을 사용할 수 있으나, 다중 특징 정보를 활용한 영상의 정합은 2원화된 정보의 비교 정합으로 볼 수 있기 때문에, 본 발명에서는 Bipartite Graph의 정합을 이용하여 다중 특징 정보를 비교한다. Matching performed in the multi-feature matching unit 170 is a matching for a pluralized feature combining color and morphological features. The use of multiple feature information when comparing optimized feature values has the advantage of higher accuracy than single feature information and easy retrieval even when there is a change in a single feature of an image. However, the multi-feature information has a variety of attributes, which is more difficult to compare than the single feature comparison. Generally, a matching method combined with a search method may be used to compare the multi-feature information. However, since the matching of an image using the multi-feature information can be regarded as a comparison matching of binary information, in the present invention, the Bipartite Graph Comparing multiple feature information using the matching of.

Bipartite Graph를 이용해서 두 쌍의 다중 특징 정보를 비교하여 유사도를 계산하기 위해서는 최대 흐름 정합 (max flow matching) 방법을 이용할 수 있다. 그리고, 특징 정보의 속성에 따라 서로 다른 가중치를 부여하여 비교를 수행하게 되면, Bipartite Graph의 최대 흐름 정합은 가중치가 적용된 Bipartite Graph 문제, 즉 WBM(Weighted Bipartite Matching) 문제로 간주할 수 있게 된다.In order to calculate similarity by comparing two pairs of multiple feature information using a bipartite graph, a max flow matching method may be used. When the comparison is performed by assigning different weights according to the attributes of the feature information, the maximum flow matching of the bipartite graph may be regarded as a weighted bipartite graph problem, that is, a weighted bipartite matching (WBM) problem.

WBM을 정합에 이용하게 되면 형태의 변화가 발생하여도 그 변화에 적응하여 영상을 비교 할 수 있는 장점을 가진다. 그러나, WBM은 복잡도가 큰 편에 속하기 때문에, 대용량 데이터베이스의 비교 및 실시간 비교에서 많은 연산량과 시간을 요하는 단점을 가진다. 따라서, 본 발명에서는 이와 같은 WBM의 단점을 극복하기 위 해서 대략적 최대 흐름의 값을 유추하는 NMFM을 제안한다. 이 방법은 오차가 발생할 가능성이 높은 영역을 제거하고 최대 정합이 발생 할 수 있는 최상 단 노드쌍 경우의 흐름 값을 유추하여 대략적으로 최대 흐름을 계산해 내는 방법으로서, WBM과 같은 성능을 가지면서 속도가 빠른 장점을 가지고 있어, 실시간으로 영상을 검색하는 시스템에서 사용이 용이하다.If WBM is used for registration, it has the advantage of being able to compare the image by adapting to the change even if the shape change occurs. However, since WBM is of high complexity, it requires a large amount of computation and time in comparing large databases and real-time comparison. Therefore, the present invention proposes an NMFM that infers the approximate maximum flow value in order to overcome such disadvantages of WBM. This method calculates the approximate maximum flow by eliminating the areas where error is likely to occur and by inferring the flow value of the uppermost node pair where maximum matching can occur. It has a fast advantage, making it easy to use in a system that searches for images in real time.

먼저, 최대 흐름 정합을 수행하는 WBM 정합 방법을 살펴보면 다음과 같다.First, a WBM matching method for performing maximum flow matching is as follows.

도 4는 가중치 Bipartite Graph에서 최소 비용을 계산하는 방법을 설명하기 위한 도면으로, 최소 소요 비용이 발생되는 유사도 비교 그래프가 도시되어 있다. FIG. 4 is a diagram illustrating a method of calculating a minimum cost in a weighted bipartite graph, and a similarity comparison graph in which a minimum required cost is generated is illustrated.

도 4를 참조하면, Bipartite Graph에 각 정점에 연결된 정점마다 가중치가 부여된 그래프를 가중치 Bipartite Graph라고 한다. 가중치 Bipartite Graph에서좌측의

의 각 정점이 우측의

정점과 정합을 하는 문제가 발생되는데, 이와 같은 정합의 문제에서 두 그래프의 유사도를 구하는 부분을 최대흐름문제(Maximum flow problem)로 변환 할 수 있다.

의 각 정점을 출발지라 하고

의 정점들을 도착지라 하면,

과

사이의 연결선은 두 정점을 잇는 경로로 간주할 수 있고, 출발지로부터 도착지로 화물을 수송했을 때 각 경로의 용량의 합을 최대화하는 문제를 최대 흐름 문제라 할 수 있다.Referring to FIG. 4, a weighted graph for each vertex connected to each vertex in the bipartite graph is referred to as a weighted bipartite graph. Weighted Bipartite Graph

Each vertex on the right

The problem of matching with vertices occurs, and the similarity of two graphs can be converted into the maximum flow problem.

Each vertex of is called the origin

The vertices of

and

The connecting line between the two peaks can be regarded as a path connecting two peaks, and the problem of maximizing the sum of the capacity of each path when the cargo is transported from the starting point to the destination is the maximum flow problem.

가중치 Bipartite Graph에서 가중치의 값이 최대가 되는 때를 최대 흐름의 조건에 의한 정합이라고 하며, 최대 흐름의 조건에 의한 정합을 WBM이라고 한다. 최대 흐름 정합을 수학식으로 표시하면 다음과 같다.Weight When the value of the weight in the Bipartite Graph reaches the maximum, matching by the condition of the maximum flow is called matching, and matching by the condition of the maximum flow is called WBM. The maximum flow match is represented by the following equation.

[수학식3]에서, 그래프 G=(V, E)에서 정합된 한 개의 정점은 E₀이고, 이 때 E₀가 가지는 가중치 값은

이고, 각 정점에 연결된 에지의 가중치 합은

로 나타낸다. 그리고, [수학식3]의 G=(V, E)는 완전그래프(Complete Graph)에서 임의의 온전그래프(Perfect Graph)를 의미하며, 온전그래프의 가중치 합이 최대가 될 때, 그 때의 그래프가 최대 흐름 정합을 가지는 그래프가 된다. 이 같은 온전그래프를 찾아 정합을 하는 것이 바로 WBM 정합이다. WBM 정합을 이용하지 않고 순차적으로 연산을 수행한다면, 모든 온전 그래프의

의 값을 구하여야 하며, 이 경우 n!번의

을 구하기 때문에, 매우 큰 복잡도를 가지게 된다. 따라서, 모든 가능한 온전그래프의 가중치 합을 구하지 않고 최소의 연산으로 최대 흐름도를 찾는 알고리즘인 WBM을 이용한다.In Equation 3, one vertex matched in the graph G = (V, E) is E ₀ , and the weight value of E ₀ is

The sum of the weights of the edges connected to each vertex is

Represented by In addition, G = (V, E) in [Equation 3] means an arbitrary perfect graph in the complete graph, and when the sum of the weights of the whole graph becomes the maximum, the graph at that time Becomes the graph with the largest flow match. It is WBM registration that finds and matches such an intact graph. If you perform the operations sequentially without using WBM matching, all of the whole graph

Must be obtained, in which case n!

Since we find, we have a very large complexity. Therefore, we use WBM, which is an algorithm that finds the maximum flow chart with minimum computations without finding the sum of the weights of all possible intact graphs.

WBM 정합에서는 헝가리안 메소드 등을 이용하여 최소 소요 비용을 얻어내고, 이를 통해 정합을 찾아낸다. 헝가리안 메소드의 기본적인 개념은 각각의 에지에서 최소값을 빼도 최적화 흐름에는 관계하지 않는다는 전제에서 출발한다. 헝가리안 메소드에서는 최대 흐름 값을 얻기 위해 최소 비용 값을 이용한다.In WBM matching, the Hungarian method is used to obtain the minimum cost, which is used to find the matching. The basic concept of the Hungarian method starts from the premise that subtracting the minimum value from each edge does not affect the optimization flow. The Hungarian method uses the minimum cost value to get the maximum flow value.

최소 소요 비용이란, 특정 정점에서 다른 정점으로의 이동에 드는 비용을 의미한다. G=(V, E) 그래프에서 임의의 에지(a,b)의 최대 흐름이 W_max이고 (a,b)의 가중치가 W_(a,b)라 할 때, 최소흐름 비용 W_min은 [수학식4]와 같다.The minimum required cost means the cost of moving from one vertex to another. In the G = (V, E) graph, when the maximum flow of any edge (a, b) is W _max and the weight of _{(a, b)} is W _{(a, b)} , the minimum flow cost W _min is Equation 4]

W_min = W_max - W_(a,b) W _min = W _max -W _{(a, b)}

[수학식4]에서 W_min을 이용하여 최소 소요 비용을 찾아내는 것은 최대 흐름의 정합쌍을 찾는 것과 같다. 이와 같이 최소 소요 비용을 찾는 방법을 이용하는 것은 유사도 비교에 많이 이용된다. In Equation 4, finding the minimum required cost using W _min is equivalent to finding the matched pair of maximum flow. This method of finding the minimum cost is often used for similarity comparison.

WBM 정합에서는 W_max를 가지고 있지 않아도 유사도 비교에서 W_min을 얻을 수 있다. 예컨대, G=(V, E)에서

과

의 각 정점이 고유의 값을 가지고 있고,

과

가 같은 속성을 가진 그룹이라고 할 때, 두 그룹의 유사도가 얼마나 비슷한지는 최소 소요 비용을 이용하여 결정된다. In WBM matching, you can get W _min from the similarity comparison even if you do not have W _max . For example, at G = (V, E)

and

Each vertex in has a unique value,

and

When is a group with the same property, how similarity between the two groups is determined by using minimum cost.

도 4에는 하나의 온전 Bipartite Graph가 설정된 경우, 각 정점이 가지고 있는 고유의 값의 차이를 이용하여 최소 흐름을 나타내고 있다. 예를 들어, 한 정점의 값이 7이고, 비교 정점의 값이 5일 경우 두 개의 유사도 차는 2가 되며 이 2의 값이 최소 흐름 비용을 나타내는 것이다. In FIG. 4, when one intact Bipartite Graph is set, a minimum flow is illustrated using a difference in inherent values of each vertex. For example, if one vertex has a value of seven and a comparison vertex has a value of five, then the two similarities will be two, which represents the minimum flow cost.

가능성 있는 모든 온전그래프의 모든 값을 구한 후에 그 중 최대 흐름도를 구성하는 온전그래프를 선택하게 되면 많은 연산을 하게 된다. 하지만 몇 개의 정점이 연결된 상태에서 나머지 정점들이 모두 최대값으로 연결되었다고 가정하면 해당 온전그래프의 가능성 있는 최대값을 예측할 수 있다. 이 예측값이 다른 온전그 래프 정합 쌍의 예측값 보다 작을 경우 해당 온전그래프는 최대 흐름값을 가질 수가 없기 때문에 더 이상 연산을 하지 않는다. 이런 방법을 이용하여 최대 흐름도가 되는 정합쌍을 구하는 것이 WBM 알고리즘이다. After getting all the values of all possible intact graphs and selecting the intact graph that constitutes the maximum flow chart, many operations are performed. However, assuming that a few vertices are connected and all the remaining vertices are connected to their maximum values, we can predict the maximum potential of the intact graph. If the predicted value is smaller than the predicted value of other ontographic matching pairs, the intact graph cannot have any further operation because it cannot have the maximum flow value. Using this method, the WBM algorithm finds the matching pair that is the maximum flow chart.

도 5는 WBM의 구성 과정을 설명하기 위한 도면으로서, WBM을 할 때 발생되는 4가지 경우의 예가 각각 도시되어 있다. 도 5의 (a)에는 한 개의 노드 (1, d) 가 설정된 상태에서 연결 가능한 경로 {(2, a), (3, c), } 설정된 경우가 도시되어 있고, (b)에는 두 개의 노드{(1, d), (2, a)}만 연결되었을 경우가 도시되어 있다. 그리고, (c)에는 한 개의 노드 (1, c) 가 완전히 연결되고 한 개의 노드 (2, a)가 연결되었다 가정할 때, 다른 연결 가능한 경로들이 도시되어 있고, (d)에는 온전한 정합 상태가 각각 도시되어 있다. FIG. 5 is a view for explaining a configuration process of the WBM, and examples of four cases generated when the WBM is shown, respectively. FIG. 5A illustrates a case where a connectable path {(2, a), (3, c),} is set in a state where one node (1, d) is set, and (b) two nodes. The case where only {(1, d), (2, a)} is connected is shown. And, in (c), assuming that one node (1, c) is fully connected and one node (2, a) is connected, other connectable paths are shown, and (d) shows an intact mating state. Each is shown.

한 개의 노드가 완전히 연결된 상태라면 도 5의 (a)와 같은 온전그래프가 (n-1)!개 만들어지며, 각각의 비용 값이 예측된다. 이 때 X 그룹의 정점 2가 a에 설정될 때 가장 작은 최소 비용이 나올 경우, (1, d), (2, a) 가 연결된 상태로 볼 수 있다. 이 경우 나머지 정점들도 다양한 여러 형태로 연결될 수 있으며, 도 5의 (c)가 두 개의 정점이 연결된 이후에 연결 가능한 쌍을 보여주는 것이다. 가능성 있는 온전그래프에서 (a)와 같은 형태가 여러 개가 나올 수 있는데, 이중에서 최소 비용이 나올 수 없는 온전그래프를 제거하고 (b)단계로 넘어가며, (b)단계에서 가능성 있는 연결을 예측하는 (c) 단계가 되고, 이 때 같은 방법으로 최소 비용이 나올 수 없는 온전그래프를 제거하는 방법으로 노드를 만들어 가면서 최소 비용이 나오는 정합쌍을 만들 수가 있다.If one node is completely connected, (n-1)! Of whole graphs as shown in (a) of FIG. 5 are generated, and each cost value is predicted. In this case, when vertex 2 of group X is set to a, when the smallest minimum cost is obtained, (1, d) and (2, a) can be regarded as connected. In this case, the remaining vertices may be connected in various forms, and FIG. 5C shows a pair that can be connected after two vertices are connected. In a potential intact graph, there may be several forms such as (a), of which the elimination of the inexpensive intact graph is skipped to step (b), and in step (b) In step (c), we can create a matched pair with the least cost as we create the node by removing the intact graph, which cannot be costed in the same way.

아래의 [수학식5]는 WBM에서 임의의 에지 쌍을 설정하였을 때 전체 흐름 값을 계산하는 식이다.Equation 5 below calculates the total flow value when an arbitrary pair of edges is set in the WBM.

[수학식5]에서 E는 현재 설정되어 있는 에지 쌍의 총 흐름값을 의미하고, Exy는 현재 최대 정합으로 설정된 에지의 흐름값이 된다.

는 새롭게 최대 정합의 후보로 설정된 에지의 흐름값을 의미하며,

는 정합되어 있지 않는 나머지 노드들 중에서 연결 가능한 최소값을 의미한다. [수학식5]에서 알 수 있듯이 WBM은 흐름값이 최소가 되는 모든 에지 쌍을 후보로 선정하면서 반복적으로 노드를 증가하면서 E 값을 구하고, 노드가 증가되기 이전의 E 값이 현재의 E 값보다 흐름 값이 크게 되면 상위의 정합 쌍은 연결 불가능 경로가 된다.In Equation 5, E denotes the total flow value of the currently set edge pair, and Exy becomes the flow value of the edge currently set to the maximum match.

Means the flow value of the edge newly set as the candidate of maximum matching,

Is the minimum connectable value among the remaining nodes that are not matched. As can be seen from Equation 5, WBM selects all edge pairs whose flow value is the minimum and obtains the E value by repeatedly increasing the node, and the E value before the node is increased is greater than the current E value. If the flow value becomes large, the upper matched pair becomes an unconnectable path.

Bipartite Graph에서 WBM을 수행하기 위한 대표적인 방법은, 쿤(Konig)에 의해서 고안된 헝가리안 방법(Hungarian Method)이다. 이 방법은 헝가리안 트리를 이용하여 뿌리에서 가장 큰 영양분의 흐름을 가진 줄기는 계속 자라고 그 외 작은 영양분을 흐르게 하는 가지들은 더 이상 자라지 않는 형태를 알고리즘에 적용한 것이다. 이 방법은 행렬을 표기하여 최소값을 더하거나 빼 주는 방법이 있으며, 트리를 이용하여 연산하는 방법이 있다. 행렬을 이용하는 방법은 기본적으로 각 행이나 열에서 같은 값을 더해주거나 빼주어도 최적 해는 변하지 않는다는 점에 착안하고 있다. 따라서, 각행이나 열에서 적당한 값을 빼거나 더해주어, 가중치의 모든 값이 0 보다 크거나 같으면서 0이 많이 들어 있도록 바꾸어 나가게 된다. 만일 이렇게 하여 가중치가 0이 되는 것으로만 할당할 수 있게 되면, 이것이 바로 최적해가 되는 것이다. A representative method for performing WBM on a Bipartite Graph is the Hungarian Method, which was devised by Konig. This method uses a Hungarian tree to apply the algorithm to the stems with the largest flow of nutrients in the roots, while the branches that carry the smallest nutrients no longer grow. This method adds or subtracts the minimum value by writing a matrix, and there is a method using a tree. The idea of using matrices is basically that even if you add or subtract the same value in each row or column, the optimal solution does not change. Thus, by subtracting or adding the appropriate values in each row or column, all weight values are greater than or equal to zero and replaced with many zeros. If we can only assign this to a weight of zero, this is the optimal solution.

도 6은 행렬의 최소값에 덧셈과 뺄셈을 이용하여 최적해를 구하는 헝가리안 방법을 설명하기 위한 도면이다. 도 6을 참조하면, 도 6의 (a)에는 각각의 노드들이 연결되었을 때의 가중치를 설정한 예가 도시되어 있다. (a)에서는 각 행에 대하여 행에 있는 비용 중 최소값을 선택하여 그 행의 비용에 감한 값을 설정하는 동작을 수행한다. (a)에서 각 행의 최소값을 선택하고, 그 값으로 나머지 행의 값을 빼 주게 되면 (b)의 형태가 된다. (b)에서는 각 열에 대하여 열에 있는 비용 중 최소값을 구하여 그 열의 각 비용을 감한 값을 설정하는 동작을 수행한다. (b)에서 각 열의 최소값을 선택하고, 그 값으로 나머지 열의 값을 빼주게 되면 (c)의 형태가 된다. FIG. 6 is a diagram illustrating a Hungarian method for obtaining an optimal solution using addition and subtraction at a minimum value of a matrix. Referring to FIG. 6, FIG. 6A illustrates an example of setting weights when respective nodes are connected. In (a), for each row, the minimum value among the costs in the row is selected to set a value subtracted from the cost of the row. If you select the minimum value of each row in (a) and subtract the value of the remaining rows by that value, it becomes the form of (b). In (b), for each column, the minimum value of the costs in the column is obtained, and the operation of setting the subtracted cost of each column is performed. If you select the minimum value of each column in (b) and subtract the value of the remaining columns with the value, it becomes the form of (c).

(a)와 (b)의 과정을 수행하고 나면, 0이 여러 개 생기게 된다. 이때 이 0의 값이 모두 지워질 수 있도록 행과 열을 직선으로 연결하는 방법들 중에서 직선의 수가 최소가 되는 경우를 결정한다. 그리고, 결정된 직선의 수가 행의 숫자보다 적으면 다음 단계를 수행하고, 직선의 수가 행의 숫자와 같으면 최적해가 만들어지게 된다. 예를 들어, (c)에서 0을 모두 지울 수 있는 선을 긋게 되면 1행과 4열이 여기에 해당된다. 이 때, 그어진 선은 2개로서 전체 행의 숫자인 4 보다 작으므로, 다음 단계를 수행하게 된다. 즉, (c)에서는 직선의 수가 행의 숫자 보다 적을 경우, 먼저 직선으로 지워지지 않는 비용 중에서 최소 비용을 구하고, 이 비용을 직 선으로 지워지지 않는 비용에서 빼주고 직선으로 두 번 교차된 영역에는 더해주는 동작을 다시 수행하게 된다.After performing steps (a) and (b), you get several zeros. At this time, it is determined whether the number of straight lines is the minimum among the methods of connecting rows and columns in a straight line so that all the values of 0 can be deleted. If the determined number of straight lines is less than the number of rows, the next step is performed. If the number of straight lines is the same as the number of rows, an optimal solution is created. For example, if you draw a line to erase all zeros in (c), this would be 1 row and 4 columns. At this time, since the drawn line is 2 and smaller than 4, which is the number of all lines, the next step is performed. That is, in (c), if the number of straight lines is smaller than the number of rows, first obtain the minimum cost among the costs that cannot be erased by the straight line, subtract this cost from the cost not erased by the straight line, and add it to the area crossed twice by the straight line. Will run again.

도 6의 (c)에서, 선을 긋지 않고 남은 부분 중에서 최소값은 3행 1열의 1이 된다. 이 경우, 선을 긋지 않은 나머지 부분에서 1을 빼주고 두 번 겹치게 된 1행 4열 의 0의 값에는 1을 더해주는 동작을 수행하며, 그 결과는 (d)가 된다. 계속해서, (d)에서도 (c)에서 수행된 것과 같은 방법으로 0을 지우는 직선을 긋게 되면, 3개의 직선이 나오게 된다. 이 경우, 같은 방법으로 최소값 2를 구하고, 두 선이 만나는 부분에는 2를 더하고, 그리고 직선이 지나지 않는 부분에서 2 값을 빼면, (e)와 같은 결과가 얻어지게 된다.(e)에서는 그을 수 있는 직선이 모두 4개이므로, 모든 방법이 끝나게 된다. 최적 해는 도 6의 (e)에서 0이 한번 나타나는 행 또는 열을 구하여 0값을 갖는 칸에 1을 채워 나가면 된다. In FIG. 6C, the minimum value is 1 in 3 rows and 1 column among the portions remaining without drawing a line. In this case, 1 is subtracted from the rest of the undrawn line, and 1 is added to the value of 0 in the first row and the fourth column that overlaps twice, and the result is (d). Subsequently, if a straight line for erasing zero is drawn in the same manner as that performed in (c) in (d), three straight lines appear. In this case, if you obtain the minimum value of 2 in the same way, add 2 to the point where the two lines meet, and subtract 2 from the point where the straight line does not pass, you get the same result as (e). Since there are four straight lines, all methods are over. The optimal solution is to find the row or column in which 0 appears once in FIG.

아래에서는 온전한 정합을 찾기 위해 사용되는 헝가리안 트리가 설명된다.Below is the Hungarian tree used to find the perfect match.

도 7은 가중치 Bipartite Graph에서 좌측의

의 각 정점이 0, 1, 2, 3으로 설정되고, 우측의

정점이 A, B, C, D로 설정되었을 때, 헝가리안 트리로 모든 연결 가능한 경우를 보여주는 도면이다.7 shows the left side of the weighted Bipartite Graph.

Each vertex of is set to 0, 1, 2, 3, and

When vertices are set to A, B, C, and D, they show all possible connections to Hungarian trees.

도 7을 참조하면, 도 7의 첫 번째 레벨은 A와 0, 1, 2, 3 이 결합하고, 나머지 값은 최소로 설정되었을 때의 비용이 된다. 두 번째 레벨은 A와 첫 번째 레벨에서 설정된 항목을 제외하고 가능성 있는 경로를 B에 할당하고 나머지 부분에서 최소값을 설정한다. 같은 방법으로 세 번째 레벨은 두 개의 결합이 이루어진 나머지 두 개의 결합을 설정하였을 때의 최소 비용이다. 각 레벨의 특징은 하위 레벨은 상위 레벨의 비용보다 큰 비용이 나올 수 없는 것이다. 상위 레벨은 현재 설정된 쌍 외에 나머지에서 쌍의 결정 없이 최소의 비용을 선택하기 때문에 하위 레벨의 비용들이 상위 레벨의 비용과 같거나 큰값이 나오게 되어 있다. 하위 레벨보다 큰 비용을 가지고 있는 상위 레벨이 있다면 상위 레벨은 더 이상 작은 비용을 발생하는 하위 레벨을 가질 수가 없기 때문에, 헝가리안 트리가 더 이상 자랄 필요가 없게 된다. 따라서 하나의 루트를 선택하고 가지를 만들었을 때 그 가지가 다른 루트 보다 작은 값이 설정되었다면 해당 루트의 가지는 만들 필요가 없게 된다. 이 방법을 이용하여 더 이상 자랄 수 없는 부모 레벨의 연산을 하지 않음으로 빠르게 정합을 유추할 수가 있다.Referring to FIG. 7, the first level of FIG. 7 is a cost when A is combined with 0, 1, 2, and 3, and the remaining values are set to minimum. The second level, except for the items set in A and the first level, assigns possible paths to B and sets the minimum in the rest. In the same way, the third level is the minimum cost of setting up the remaining two joins with two joins. The characteristic of each level is that lower levels cannot cost more than higher levels. Since the upper level selects the minimum cost without determining the pair in the rest other than the currently set pair, the lower level costs are equal to or greater than the upper level cost. If there is a higher level that has a higher cost than a lower level, the Hungarian tree no longer needs to grow, because the higher level can no longer have a lower level incurring smaller costs. Therefore, if you select one route and create a branch, and the branch is set smaller than the other route, you do not need to create the branch of that route. Using this method, we can quickly infer matching by not performing parent-level operations that can no longer grow.

헝가리안 트리를 이용할 때, 같은 레벨의 루트를 먼저 연산하는 너비 우선 검색(Breath First Search) 방법과 하나의 루트를 선택하고 자식노드를 만들어 가는 깊이 우선 검색(Depth First Search) 방법이 사용될 수 있다. 너비 우선검색을 이용하였을 경우 자식 노드의 값이 최소값으로 설정되면 매우 빠른 속도로 계산이 완료된다. 반면 많은 레벨의 연산 후에 결과가 결정되는 경우는 깊이 우선 검색 방법이 빠르다.When using a Hungarian tree, a breadth first search method that calculates routes of the same level first and a depth first search method of selecting one route and creating child nodes may be used. In case of using width priority search, the calculation is completed very fast when the value of child node is set to the minimum value. On the other hand, if the result is determined after many levels of operation, the depth-first search is faster.

캐릭터 검색 시스템에서는 비교되는 영상의 색상 히스토그램을 이용하기 때문에 유사성이 높은 값들이 나타나게 된다. 이럴 경우 너비 우선 검색을 이용하게 되면 많은 레벨의 이동 후에 자라지 못하는 부모 노드가 설정되기 때문에, 깊이 우선 검색 방법을 이용하는 것이 용이하다. 그러나 영상의 특징이 완벽하게 분리되는 그룹에서는 너비 우선 검색 방법을 이용하는 것이 용이하다.Since the character search system uses the color histogram of the images to be compared, high similarity values appear. In this case, it is easy to use the depth-first search method because the width-first search sets up a parent node that cannot grow after many levels of movement. However, it is easy to use the breadth-first search method in the group where the features of the image are completely separated.

도 8(a) 내지 8(c)는 너비 우선 검색을 사용하는 헝가리안 트리의 이용 예를 보여주는 도면이다.8 (a) to 8 (c) are diagrams showing an example of using a Hungarian tree using breadth-first search.

도 8(a)을 참조하면, 도 8(a)에는 너비 우선 검색의 첫 번째 스텝을 보여준다. 도 8(a)는 (A, 0)이 결합되고 나머지 값이 모두 최소가 되는 형태로, 노드가 연결되었을 때 가중치 값이 101이 되고, 나머지 루트들은 105, 99, 108형태의 값을 가지게 된다. 헝가리안 트리에서 최초 루트는 가장 최소 비용이 드는 곳을 선택하는 것이 일반적이다. Referring to FIG. 8 (a), FIG. 8 (a) shows the first step of the breadth-first search. In FIG. 8 (a), (A, 0) is combined and the remaining values are all minimum. When the nodes are connected, the weight value is 101, and the remaining routes have values of 105, 99, and 108 type. . The first route in a Hungarian tree is usually the one with the least cost.

3개의 연결 노드 중에서 (A, 2), (B, 1)인 경우가 103일 때 가장 작은 비용을 가지게 된다. 여기에서 상위 레벨의 노드 값들이 103보다 큰 루트는 더 이상 작은 값이 나올 수 없는 상태이기 때문에 더 자랄 수 없는 경로로 설정된다. 도 8(a)에서 (A, 1) 과 (A, 3)는 103 보다 큰 값을 가지기 때문에 더 이상 자식 노드를 만들지 않게 된다.Among the three connection nodes, (A, 2), (B, 1) has the smallest cost when 103. Here, a root whose upper level node values are greater than 103 is set as a path that cannot grow because a smaller value can no longer come out. In FIG. 8 (a), since (A, 1) and (A, 3) have a value greater than 103, no further child nodes are created.

너비 우선 검색은 자식노드로 한 레벨 내려갔을 때 활성화 되어있는 노드도 함께 자식노드를 만드는 형태이다. 도 8(b)에는 (A, 0) 쌍에 자식노드가 만들어진 형태가 도시되어 있다. 도 8(b)를 참조하면, (A, 0) 과 (A, 2)의 쌍에서 가장 작은 비용이 드는 노드가 (A, 2) (B, 1)이 된다. 따라서 가장 작은 비용이 드는 쌍을 우선 선택하여 자식노드를 만들어가게 된다. In breadth-first search, the child node is also created when the child node goes down one level. FIG. 8 (b) shows a form in which child nodes are formed in pairs (A, 0). Referring to FIG. 8 (b), the node having the smallest cost in the pair of (A, 0) and (A, 2) becomes (A, 2) (B, 1). Therefore, the lowest cost pair is selected first to create a child node.

그리고, 도 8(c)에는 (A, 2) (B, 1)에 자식노드가 만들어진 형태가 도시되어 있다. 도 8(c)를 참조하면, (A, 2), (B, 1), (C, 0), (D, 3) 은 103의 최소 값을 가지게 되어, 103 보다 큰 값을 가지는 상위의 모든 노드들은 더 이상 자랄 필요가 없게 되어, 정합은 종료된다. 8 (c) shows a form in which child nodes are made in (A, 2) (B, 1). Referring to FIG. 8 (c), (A, 2), (B, 1), (C, 0), and (D, 3) have a minimum value of 103, and all of the upper values having a value greater than 103 are represented. The nodes no longer need to grow, so the match ends.

계속해서, 깊이 우선 검색 알고리즘을 살펴보면 다음과 같다.Going forward, the depth-first search algorithm looks like this:

도 9(a) 내지 9(c)는 깊이 우선 검색을 사용하는 헝가리안 트리의 이용 예를 보여주는 도면이다. 깊이 우선 검색 알고리즘은, 최소 비용이 설정된 하나의 노드를 만들고, 그 노드의 자식 노드를 최소 비용이 설정되는 노드로 만들어 간 후에, 최소 비용에 해당하는 자식 노드가 레벨 끝에 갔을 경우 부모 노드로 올라오면서 그 다음 최소 비용의 자식 노드를 만들어 가는 방법이다. 9 (a) to 9 (c) are diagrams showing an example of using a Hungarian tree using depth first search. The depth-first search algorithm creates a node with a minimum cost, makes the child node of that node the node with the minimum cost, and then goes to the parent node when the child node corresponding to the minimum cost has reached the end of the level. The next step is to create the lowest cost child node.

도 9(a)를 참조하면, 최소 비용에 해당하는 (A, 2) 노드에서 자식 노드를 만들고, 자식노드에서 가장 작은 비용이 설정된 (A, 2), (B, 1)이 선택되면, (A, 1)과 (A, 3)는 자랄 수 없는 노드가 된다. 이때 (A, 2), (B, 1)이 선택된 후에 (A, 2), (B, 1) 다음으로 자식 노드를 만들어 깊이로 우선 탐색을 하게 된다. 도 9(b)에서는 (A, 2), (B, 1)에서 다음 노드를 만들게 되는데, (A, 2), (B, 1), (C, 0), (D, 3)의 값이 최소값으로 설정되면, (A, 0)의 루트 노드를 제외한 모든 노드들은 더 이상 자랄 수 없는 노드가 된다. 이 같이 최소 비용의 루트 노드를 선택하고 깊이 탐색이 끝나게 되면, 자랄 수 있는 상위 노드를 선택하여 다시 깊이 탐색을 수행하게 된다. 도 9(c)는 루트 노드인 (A, 0)에서 자식 노드를 만든 형태이다. 자식 노드를 만든 후에, 각 노드의 최소 비용을 계산했을 때 최소 비용이 (A, 2)의 최 하단 자식 노드의 값보다 크기 때문에 더 이상 자랄 수 없는 노드가 되어서 (A, 2), (B, 1), (C, 0), (D, 3)의 최적화된 결과가 된다.Referring to FIG. 9 (a), if (A, 2), (B, 1) is selected in which a child node is created at node (A, 2) corresponding to the minimum cost and the smallest cost is set in the child node, (( A, 1) and (A, 3) become nodes that cannot grow. At this time, after (A, 2) and (B, 1) are selected, (A, 2) and (B, 1) are created next, and child nodes are searched first by depth. In FIG. 9 (b), the next node is made from (A, 2) and (B, 1), and the values of (A, 2), (B, 1), (C, 0) and (D, 3) When set to the minimum value, all nodes except the root node of (A, 0) become nodes that can no longer grow. When the root node of the least cost is selected and the depth search is finished, the depth node is selected again by selecting the parent node that can grow. 9 (c) shows a form in which a child node is created from (A, 0) which is a root node. After creating the child nodes, when the minimum cost of each node is calculated, it becomes a node that can no longer grow because the minimum cost is greater than the value of the lowest child node of (A, 2), so that (A, 2), (B, 1), (C, 0), (D, 3) is the optimized result.

이상에서, 깊이 우선 검색은 깊이 탐색을 해가면서 노드가 늘어날 수 있는 경로인가를 확인하기 때문에 너비 우선 검색과 같이 자랄 수 없는 트리를 검색하기 위한 새로운 연결이 필요가 없다. 깊이 우선 검색은 먼저 루트를 만든 다음 해당 루트의 최소 비용의 루트를 선택한 후에 해당 루트에서 자식 노드를 만들고 자식 노드를 만든 각각의 자식 노드들이 또 자식 노드를 만드는 재귀화 방법을 이용하게 된다. 자식 노드를 만들기 전에 자식 노드가 만들어져야 하는 필요성의 검사가 필요하다. 이렇게 하기 위해서 자식 노드를 만들고 최소 비용을 계산한 다음 그 값이 지금의 현재 최소 비용보다 작고 그리고 현재 최소 비용을 가진 노드 보다 하위 노드일 경우 해당 노드를 최소 비용 노드로 설정하는 모듈이 우선된다.In the above, the depth-first search does not need a new connection to search a tree that cannot be grown, such as the breadth-first search, because the depth search verifies whether the node can grow in the depth search. Depth-first retrieval uses a recursion method that first creates a root, then selects the least cost root of that route, then creates a child node from that route, and each child node that creates a child node again. Before creating a child node, you need to check the need for the child node to be created. To do this, the module that creates the child node, calculates the minimum cost, and sets that node as the least expensive node takes precedence if its value is less than the current minimum cost and it is a lower node than the node with the current minimum cost.

앞에서 설명한 너비 우선 검색 알고리즘 및 깊이 우선 알고리즘이 적용되는 WBM은, 지금까지 네트워크 분야에서 최대 흐름 및 최대 흐름 정합쌍을 유추하는데 이용되어 왔으며, 영상 검색 시스템에서는 많이 이용되지 않았다. 그러나, 본 발명과 같이 형태 정보와 색상 정보를 포함하는 다중 정보를 이용한 검색일 경우, WBM 은 매우 효과적인 검색 결과를 도출해 낼 수 있다. 뿐만 아니라, WBM을 이용하면 최대 흐름도와 최대 흐름 정합쌍 두 개의 정보를 동시에 얻을 수 있다. WBM to which the breadth-first search algorithm and the depth-first algorithm described above have been applied has been used to infer the maximum flow and the maximum flow matched pair in the network field so far, and has not been widely used in the image search system. However, in the case of a search using multiple information including shape information and color information as in the present invention, the WBM can derive a very effective search result. In addition, with WBM, you can get information from two of the largest flow and maximum flow matched pairs simultaneously.

그러나, 영상 비교에서 있어서 유사도 여부만을 확인한다면 최대 흐름 정합쌍을 구할 필요가 없기 때문에 이 부분의 연산 부분을 제거할 수 있다면 WBM과 같은 성능을 가지면서도 매우 빠른 검색을 수행할 수 있게 된다. 따라서, 다중 특징 정합부(170)에서는 최대 흐름 정합쌍을 구하지 않고 영상의 유사도 여부만을 확인하는 NMFM 정합을 다중 특징 정합에 이용한다. 이 방법은 오차가 발생할 가능성이 높은 영역을 제거하고 최대 정합이 발생 할 수 있는 최상 단 노드쌍 경우의 흐름 값을 유추하여 대략적으로 최대 흐름을 계산해 내는 방법으로서, WBM과 같은 성능을 가지면서 속도가 빠른 장점을 가지고 있어, 실시간으로 영상을 검색하는 시스템에서 사용이 용이하다.However, it is not necessary to obtain the maximum flow matched pair if only checking the similarity in image comparison, so if the computation part of this part can be eliminated, it is possible to perform a very fast search while having the same performance as WBM. Therefore, the multi-feature matching unit 170 uses NMFM matching, which checks only the similarity of images without obtaining the maximum flow matching pair, for multi-feature matching. This method calculates the approximate maximum flow by eliminating the areas where error is likely to occur and by inferring the flow value of the uppermost node pair where maximum matching can occur. It has a fast advantage, making it easy to use in a system that searches for images in real time.

NMFM 정합의 개념을 살펴보면 다음과 같다. The concept of NMFM matching is as follows.

본 발명에 사용되는 NMFM 정합은, 영상 객체에서 다중 특징을 추출하고 비교할 때 통계적인 방법을 이용하여 문턱값을 추출하고 이 값을 이용하여 유사도 측정을 한다. 통계 적인 방법을 이용할 때는 많은 영상 데이터의 비교 실험을 통하여 영상간의 거리차 분포를 구한 다음, 대부분 같은 영상으로 구성되어 있는 경우의 특정 거리차 σ를 구하고, 질의 영상과의 비교 값이 σ이하 일 때 같은 영상으로 처리한다. NMFM 정합에서 사용되는 대부분의 영상 비교는 통계적인 방법을 이용하고 있으며, 판별을 요구할 때 통계의 의해 결정지어진 특정 문턱값으로 σ를 이용한다. 따라서, 본 발명에서는 WBM을 이용하여 영상을 비교할 때에도 같은 영상으로 판별하기 위해서는 실험을 통해 σ값을 구해야 한다. σ값이 없을 경우 WBM은 단순히 유사도가 큰 순서로 영상들을 정렬 할 수 밖에 없다. 영상 검색의 목적은 영상의 유사도 정렬이 아니고, 같은 영상 또는 유사도가 특정값 미만인 영상을 검색하는 것이다. 따라서 WBM을 이용하여 영상을 비교하는 경우에도 실험을 통해 σ 값을 구해야 한다.In the NMFM matching used in the present invention, when extracting and comparing multiple features in an image object, a threshold value is extracted using a statistical method and similarity is measured using this value. When the statistical method is used, the distribution of distance difference between images is obtained by comparing experiments with a large number of image data, and then a specific distance difference σ is obtained when most images are composed of the same image, and when the comparison value with the query image is σ or less. The same image is processed. Most of the image comparisons used in NMFM matching use a statistical method, which uses σ as a specific threshold determined by statistics when a discrimination is required. Therefore, in the present invention, when comparing images using WBM, the σ value must be obtained through experiments in order to distinguish the same image. If there is no σ value, WBM simply sorts the images in the order of similarity. The purpose of the image search is not to sort the similarity of the images, but to search for the same image or an image having a similarity less than a specific value. Therefore, even when comparing images using WBM, σ values should be obtained through experiments.

지금까지 특정 문턱값으로 σ를 이용하는 방법은, 영상의 비교 결과 후에 σ 값을 같은 영상인가 아닌가를 결정하는데 이용하였으나, 본 발명에서 제안되는 NMFM은 σ값을 예측 지수로 활용을 하는 특징을 가진다. 그리고, 문턱값 σ는 고정된 값이 아니라, 데이터의 누적에 따라 통계적으로 계속적으로 변화되는 값이며, 회로의 구성에 따라서는 사용자의 조절에 의해 임의로 소정의 범위 내에서 그 값을 조절할 수도 있다.Until now, the method of using sigma as a specific threshold has been used to determine whether the sigma value is the same image after the comparison result of the image, but the NMFM proposed in the present invention has the characteristic of using the sigma value as a prediction index. The threshold value σ is not a fixed value but a value that is continuously changed statistically according to the accumulation of data, and may be arbitrarily adjusted within a predetermined range by the user's adjustment depending on the configuration of the circuit.

특정 문턱값 σ는 영상의 유사도 분포를 이용하여 통계적인 방법으로 계산된다. 문턱값 σ를 구하기 위한 유사도 분포식은 아래의 [수학식6]과 같다.The specific threshold σ is calculated in a statistical manner using the similarity distribution of the images. The similarity distribution equation for obtaining the threshold value σ is as shown in Equation 6 below.

Di = Ws -WdDi = Ws -Wd

여기서, Di는 질의 영상인 Ws와 데이터베이스 영상인 Wd와의 거리차를 의미한다. 그리고, N(Di )는 총 n개의 영상을 모두 비교하였을 때 i 값의 거리차가 있는 수를 표시하고, 전체적인 분포는 N(Di ) 값의 총 합을 의미하며, 이 값이 S(D)이다. Here, Di means a distance difference between Ws, which is a query image, and Wd, which is a database image. N (Di) represents the distance difference of i value when all n images are compared, and the overall distribution is the sum of N (Di) values, which is S (D). .

도 10은 [수학식6]으로부터 획득된 동일 섹터 영상의 분포 곡선이다. 도 10에 도시된 문턱값 σ는, 실제 실험 후 표준 편차를 결정할 때 시스템에 따라서 달라질 수 있다. 예를 들어, 95%의 정확도를 요구하는 시스템일 경우, 같은 영상의 95%가 있는 영역의 끝점이 σ로 결정되며, NMFM에서 σ는 Bipartite Graph에서 더 이상 자라지 않는 노드 설정에 이용된다.10 is a distribution curve of the same sector image obtained from Equation (6). The threshold σ shown in FIG. 10 may vary from system to system when determining the standard deviation after the actual experiment. For example, in a system requiring 95% accuracy, the end point of the region where 95% of the same image is located is determined as σ. In NMFM, σ is used to set up a node that no longer grows in the Bipartite Graph.

NMFM에서는 N개의 에지를 설정하고, 각 에지의 최소 비용을 행렬로 설정한 후, 최소의 비용을 가지는 쌍을 구하게 되는데, 가장 최소의 흐름을 찾아내는 방법 은 [수학식7]과 같다. In NMFM, N edges are set, the minimum cost of each edge is set in a matrix, and a pair having the minimum cost is obtained. The method of finding the minimum flow is shown in [Equation 7].

M(x,y) = min(sigma -W_x'y' )M (x, y) = min (sigma -W _{x'y '} )

[수학식7]에서 M(x,y)는 최소 흐름을 가지는 쌍이고, W_x'y' 는 특정 섹터에 연결된 가중치 값들이다. 최소 흐름 쌍이 정해지게 되면 해당 쌍을 제거 한 후, [수학식7]을 이용하여 다음 쌍을 반복하여 구해 나간다. 이와 같이, NMFM는 최소 비용을 가지는 쌍이 구해지게 되면, 다음 총 흐름을 얻어내고, 이 흐름을 대략적 최대 흐름으로 보게 된다. NMFM 알고리즘에서 만일 전체 쌍의 가중치가 모두 σ값 보다 클 경우(즉, 어떠한 최소 흐름쌍도 존재하지 않는 경우에는), NMFM 대신 WBM을 이용하여 최적화 흐름을 유도한다. 이와 같은 NMFM의 전체 알고리즘을 살펴보면 다음과 같다.In Equation 7, M (x, y) is a pair having a minimum flow, and W _{x'y '} is weight values connected to a specific sector. Once the minimum flow pair is determined, remove the pair and then repeat the next pair using Equation (7). As such, the NMFM obtains the next total flow when the pair with the least cost is found and sees this flow as the approximate maximum flow. In the NMFM algorithm, if the weights of the entire pairs are all greater than σ (i.e. no minimum flow pairs exist), WBM is used instead of NMFM to derive the optimized flow. The overall algorithm of NMFM is as follows.

도 11은 도 1에 도시된 다중 특징 정합부(170)에서 수행되는, 본 발명의 바람직한 실시예에 따른 NMFM 정합 방법을 보여주는 흐름도이고, 도 12는 NMFM의 정합 예를 보여주는 도면이다. FIG. 11 is a flowchart illustrating an NMFM matching method according to a preferred embodiment of the present invention, which is performed by the multiple feature matching unit 170 illustrated in FIG. 1, and FIG. 12 is a diagram illustrating an example of matching NMFM.

도 11 및 도 12를 참조하면, 본 발명에 따른 NMFM 정합 방법은 먼저 Bipartite Graph에서 연결 가능한 모든 연결을 설정하고(7110 단계), 최소 비용을 얻는 쌍을 추출한다(7120 단계, 도 12의 (a) 참조). 이때 최소 비용을 얻는 쌍의 가중치와 σ의 차이 값을 계산한다(7130 단계). 여기서, σ는 통계적인 방법을 이용하여 계산된 변수로서, 많은 영상 데이터의 비교 실험을 통하여 영상간의 거리차 분포를 구한 후 대부분 같은 영상으로 구성되어 있는 경우로부터 얻어진 특정 거리차이다. 계속해서, 전체 쌍의 가중치가 모두 σ 보다 큰지의 여부(즉, 영상간의 거리차 분포가 σ의 범위를 벗어났는지의 여부)가 판별된다(7150 단계). 7150 단계에서의 판별 결과, 전체 쌍의 가중치가 모두 σ 보다 크면(즉, 영상간의 거리차 분포가 σ의 범위를 벗어났으면) WBM 정합을 수행하여 최적의 흐름과 정합쌍을 구한다(7200 단계). 그리고, 7150 단계에서의 판별 결과, 전체 쌍의 가중치가 모두 σ 보다 크지 않으면(즉, 영상간의 거리차 분포가 σ의 범위를 벗어나지 않았으면) 계속해서 연결 쌍을 구하게 된다(7300 단계, 도 12의 (b) 참조). 이 때, 7300 단계에서 구해지는 연결 쌍은 [수학식7]에 의해 결정되며, 오차의 범위 내에서 무시할 수 있는 에지의 계수를 설정하고, 순서적으로 해당 계수를 넘는 쌍은 무시하게 된다. 이어서, 해당 흐름 값을 연결된 에지의 값으로 나누어서 평균 흐름 값을 계산하여 대략적 최대 흐름을 구한다(7400 단계, 도 12의 (c) 참조).11 and 12, the NMFM matching method according to the present invention first establishes all connectable connections in the Bipartite Graph (step 7110), and extracts a pair that obtains the minimum cost (step 7120, FIG. 12A). ) Reference). In this case, the difference between the weight of the pair having the minimum cost and σ is calculated (step 7130). Here, σ is a variable calculated using a statistical method, and is a specific distance difference obtained from a case where most distances are composed of the same image after obtaining a distribution of distance differences between images through comparison experiments of many image data. Subsequently, it is determined whether the weights of all pairs are greater than σ (i.e., whether the distribution of the distance difference between images is out of the range of σ) (step 7150). As a result of the determination in step 7150, if the weights of all pairs are larger than σ (that is, the distribution of distance difference between images is out of the range of σ), WBM matching is performed to find an optimal flow and matched pair (step 7200). . As a result of the determination in step 7150, if the weights of all the pairs are not larger than σ (that is, the distribution of the distance difference between the images does not deviate from the range of σ), the connection pair is continuously obtained (step 7300, FIG. 12). (b)). At this time, the connection pair obtained in step 7300 is determined by Equation (7), sets a negligible edge coefficient within the error range, and sequentially ignores the pair over the corresponding coefficient. Then, the average flow value is calculated by dividing the flow value by the value of the connected edge to obtain an approximate maximum flow (step 7400, see FIG. 12C).

즉, 영상이 σ의 구간 안에 속할 경우에는 도 12의 7300 및 7400 단계와 같은 NMFM 정합을 수행하여 대략적 최대 흐름을 구하고, 영상이 σ의 범위를 벗어났을 경우 또는 σ구간에 있는 영상이 하나도 없을 경우에는 해당 영상을 동일하지 않은 영상으로 처리하게 된다. 이와 같은 특성에 의해 NMFM은 다음의 두 가지 경우에서 시간을 줄일 수 있다. 첫 번째는, σ값의 영역 내에 있는 영상이 없을 경우 더 이상 계산을 하지 않고 전혀 다른 영상으로 처리함으로써, 계산을 하지 않고 비교하면서 빠르게 연산할 수 있게 되는 경우이다. 그리고, 두 번째는, 모든 영상에서 거리차를 계산하여 정렬하는 경우로서, σ안에 있는 영상들에 대해서는 NMFM을 이용하여 연산을 수행하여 시간을 절약하고, 나머지 부분에 대해서는 WBM을 이용하여 연산을 수행하여 유사한 영상을 정렬할 수 있게 된다. That is, if the image falls within the interval of sigma, NMFM matching is performed as in steps 7300 and 7400 of FIG. 12 to obtain an approximate maximum flow, and if the image is out of the range of sigma, or if there are no images in the sigma range. In this case, the image is processed as an unequal image. This feature allows the NMFM to save time in two cases: In the first case, when there is no image in the region of σ value, the calculation is performed as a completely different image without any further calculation, so that the calculation can be performed quickly without comparison. Second, in the case of calculating and aligning the distance difference in all images, the operation saves time by performing an operation using NMFM for the images in σ, and performs the operation using WBM for the remaining parts. Similar images can be sorted.

그러나, 섹터영상을 에지로 두었을 경우, 변화가 큰 영상에서는 특정 빈에서 거리차가 매우 크게 나타난다. 이런 거리차의 값은 영상의 유사도에 큰 오차 값으로 나타난다. NMFM에서는 거리차가 매우 크게 나타나는 섹터 영상은 무시하고, 나머지 섹터만 가지고 최대 흐름을 유지한다. [수학식8]은 최대 흐름을 구하는 식이다.However, when the sector image is placed at the edge, the distance difference in a specific bin is very large in the image with a large change. The value of this distance difference appears as a large error value in the similarity of the image. The NMFM ignores sector images that show very large distance differences and maintains maximum flow with only the remaining sectors. [Equation 8] is to calculate the maximum flow.

여기서,

는 무시하고자 하는 섹터의 개수를 의미한다.here,

Denotes the number of sectors to be ignored.

도 11에서 알 수 있는 바와 같이, 본 발명에 따른 영상 검색 방법에서는 전체 쌍의 가중치가 모두 σ 보다 큰지 여부를 판별하여, 전체 쌍의 가중치가 모두 σ 보다 큰 경우에는(즉, σ값의 영역 내에 속하는 영상들이 존재하지 않는 경우) WBM을 수행하여 최적 흐름을 구하고, σ 값의 영역 내에 속하는 영상들이 존재하는 경우에는 NMFM을 수행하여 대략적 최대 흐름을 구하게 된다. As can be seen from Fig. 11, the image retrieval method according to the present invention determines whether the weights of all pairs are greater than sigma, and if the weights of all pairs are greater than sigma (i.e., within the region of σ values). If there is no image belonging) WBM is performed to obtain an optimal flow. If there are images belonging to the region of σ, NMFM is performed to obtain an approximate maximum flow.

이러한 NMFM과 WBM 간의 큰 차이점은, 정합쌍을 추출하는가와 그렇지 않은가에 있다. WBM은 최대 흐름도와 함께 최대 흐름도가 이루어 질 때의 정합쌍도 동시에 구한다. 그러나 NMFM은 최대 흐름도를 추출하지만 정합쌍을 구하지는 않는다. 영상 정합의 목적은 정합쌍을 찾는 것이 아니고, 최대흐름도를 구하는 데 있다. 정 합쌍을 찾는 불필요한 연산을 제거하고 WBM과 같은 흐름도를 예측하기 때문에 WBM 보다 연산 복잡도가 매우 작다. N개의 정점을 WBM 알고리즘을 이용하면 가장 빠른 방법이 n-1개의 순회를 하는 것이다. NMFM의 n-1개의 순회 중 σ값보다 작거나 같은 흐름도가 나오면 더 이상 순회를 할 필요가 없기 때문에 NMFM은 언제나 WBM의 연산과 같거나 작게 된다. 따라서 NMFM의 연산은 WBM보다 빠르다. 아래의 실험 결과에서 자세히 설명되겠지만, NMFM은 실제 실험 시 WBM보다 2배정도 빠르게 나타난다.The big difference between these NMFMs and WBMs is whether or not to extract matched pairs. In addition to the maximum flow chart, WBM also finds matching pairs when the maximum flow chart is achieved. However, NMFM extracts the maximum flow diagram but does not find a matched pair. The purpose of image matching is not to find matching pairs, but to find the maximum flow chart. The computational complexity is much smaller than that of WBM because it eliminates unnecessary operations for finding matched pairs and predicts flow diagrams like WBM. Using the WBM algorithm for N vertices, the fastest way is to do n-1 traversals. If the flow chart is less than or equal to σ of n-1 traversals of the NMFM, the NMFM is always equal to or less than the operation of the WBM because no further traversal is necessary. Therefore, the operation of NMFM is faster than WBM. As will be explained in detail in the experimental results below, NMFM appears twice as fast as WBM in actual experiments.

이 같은 특성을 가지는 WBM 및 NMFM 알고리즘이 어떻게 캐릭터 영상 검색에 적용되는지를 살펴보기로 하자.Let's look at how the WBM and NMFM algorithms with these characteristics are applied to character image retrieval.

WBM을 캐릭터 영상 검색에 이용하기 위해서는 비교하고자 하는 정보를 에지로 설정해야 한다. 캐릭터 검색 시스템에서 영상의 정보를 n개의 섹터로 나눈다. n개의 섹터로 나누고 해당 섹터의 색상 히스토그램 정보가 각 섹터의 가중치 값이 된다. 각 섹터의 가중치는 R, G, B 3개의 값이 되며 이 3개의 값을 포함한 하나의 섹터가 WBM의 에지가 된다. 따라서 WBM의 에지에 비교하고자 하는 정보가 3개의 영역 값이 된다. In order to use WBM to search for a character image, information to be compared should be set as an edge. The character search system divides the information of the image into n sectors. It is divided into n sectors, and color histogram information of the corresponding sector becomes a weight value of each sector. Each sector has a weight of R, G, and B, and one sector including these three values becomes the edge of the WBM. Therefore, the information to be compared to the edge of the WBM becomes three area values.

하나의 섹터에 기여되는 색상 값들은 R, G, B 의 가중치로 설정해야 한다. 가중치를 설정하지 않으면 R 값이 큰 섹터 이미지와 G 값이 큰 섹터 이미지가 있고, 섹터의 전체 값이 같을 경우 색상의 크기는 같게 나오기 때문이다. [수학식9]는 R, G, B 의 각각의 전체 값을 구하는 식으로서, [수학식9]를 수행하기 이전에 도 1에서 설명된 전처리부(110)에서 섹터 영상에 대한 색상 히스토그램이 얻어지 고, 그것에 대한 양자화가 선행된다. Color values contributing to one sector should be set with weights of R, G, and B. This is because if the weight is not set, there is a sector image having a large R value and a sector image having a large G value. If the total values of the sectors are the same, the color sizes are the same. [Equation 9] is a formula for obtaining the total value of each of R, G, and B. Before performing Equation 9, the color histogram of the sector image is obtained by the preprocessor 110 illustrated in FIG. This is followed by quantization.

[수학식9]에서,

는 섹터에 있는 R, G, B 색상의 크기 값이고,

는 각각의 빈에 해당하는 번호이며,

는 해당 번호의 히스토그램값이다. 양자화된 히스토그램에서 각각의 히스토그램의 빈의 번호와 해당 빈의 히스토그램 수를 곱하게 되면 섹터 안에 있는

값을 얻게 된다. 이 값은 섹터 이미지에 있는 각 색상의 크기이다. [수학식10]은 이 크기의 비율을 값을 얻기 위한 수학식이다.In Equation 9,

Is the magnitude value of the R, G, and B colors in the sector,

Is the number corresponding to each bin,

Is the histogram of the number. In a quantized histogram, multiply the number of bins in each histogram by the number of histograms in that bin.

You get a value. This value is the size of each color in the sector image. [Equation 10] is an equation for obtaining a value of this size ratio.

여기서,

는 R, G, B의 각각의 가중치 값이고,

는 [수학식10]에서 얻어진 R, G, B의 수치이며,

는

의 값을 모두 더한 수치가 된다. 이 경우, [수학식10]에서 얻어진 각각의 가중치 값을 이용하여 거리차를 구한 다음 해당 거리차에 가중치 값을 곱해야만 기여되는 색상의 거리차를 얻을 수가 있다.here,

Is a weight value of each of R, G, and B,

Is a numerical value of R, G, B obtained from [Equation 10],

Is

Is the sum of all the values of. In this case, the distance difference can be obtained by calculating the distance difference using each weight value obtained in [Equation 10] and then multiplying the distance difference by the weight value.

WBM을 이용하여 하나의 에지에서 다른 에지로의 최소 비용을 구하고자 할 경우 히스토그램의 거리차를 구해야 한다. WBM에서 하나의 에지에 여러 개의 값을 구성할 수는 있으나 경로가 형성될 때 해당 경로의 소요 비용은 한 개의 값으로 처리 되어야 한다. 영상에서의 색상 거리차는 R, G, B 3개의 거리차로 나타나지고 이렇게 나타난 거리차를 하나의 수치로 전환해야 한다. 각 섹터의 히스토그램을 이용한 거리차를 계산하는 방법은 다음과 같다.If you want to find the minimum cost from one edge to another using WBM, you need to find the distance difference of the histogram. Although multiple values can be configured on one edge in WBM, the cost of the path should be treated as one value when the path is formed. The color distance difference in the image is represented by three distance differences of R, G, and B, and this distance difference must be converted into one value. The method of calculating the distance difference using the histogram of each sector is as follows.

[수학식11]에서 보여 주듯이 각각의 히스토그램값을 이용하여 거리차를 구하면 R, G, B 각각의 거리차가 나오게 된다. WBM을 이용하기 위해서는 이 각각의 값을 가지고 하나의 가중치 값을 유도해야 한다. As shown in [Equation 11], when the distance difference is obtained by using the histogram value, the distance difference of each of R, G, and B comes out. In order to use WBM, one weight value must be derived with each of these values.

[수학식12]는 하나의 가중치 값을 유도하는 식이다.Equation 12 is a formula for deriving one weight value.

[수학식12]에서 W 값은 각 섹터 이미지의 가중치가 되고 해당 가중치를 이용하여 정합을 할 수가 있다. In Equation 12, the W value becomes a weight of each sector image and can be matched using the weight.

영상의 비교에 WBM을 사용할 경우, 깊이 우선 검색이 너비 우선 검색에 비해 좀 더 유리하다. 그 이유는 하나의 노드로부터 최소 비용을 계산할 때 거리차를 이용하는데 영상에서 거리차를 이용할 때는 R, G, B 의3개의 영역의 히스토그램의 거리차와 그리고 각각의 가중치 값을 곱해서 사용하기 때문에 최소 거리차를 계산하는데 있어 사용하는 정보가 단일정보가 아니기 때문이다. 따라서 하나의 섹터의 거리차가 최소화되는 부분이 같은 이미지로 볼 수가 있기 때문에 특정 섹터의 값을 선택하고 나머지 부분에서 최소의 수치를 보았을 때 결과는 최소 비용 흐름과 유사하게 나온다. WBM은 트리를 이용하여도 그 복잡도가 큰 편에 속한다. 영상에서의 특징을 이용하고 깊이 우선 검색 알고리즘이 너비 우선검색 알고리즘 보다 속도 성능이 뛰어나다는 부분을 이용하면 좀더 빠른 알고리즘을 개발할 수 있으며 이 알고리즘을 이용하여 실시간 영상 검색에 이용할 수 있다.When using WBM to compare images, depth-first search is more advantageous than width-first search. The reason is to use the distance difference when calculating the minimum cost from one node. When using the distance difference in the image, the distance difference of the histogram of the three areas of R, G, and B is multiplied by each weight value and the minimum value. This is because the information used to calculate the distance difference is not single information. Therefore, the part where the distance difference between one sector is minimized can be viewed as the same image. When selecting a value of a certain sector and looking at the minimum value in the remaining part, the result is similar to the minimum cost flow. WBM is also one of the more complex using trees. By using the features in the image and using the part where the depth-first search algorithm is faster than the width-first search algorithm, a faster algorithm can be developed and used for real-time video search.

WBM을 사용할 경우 최대 흐름 값을 유추하기 위한 목적과 최대 흐름이 이루어지도록 최적화된 에지의 연결이 목적인 두 가지 형태로 나누어진다. 본 발명에서는 최대 흐름의 값을 유추하는 것이 목적이다. 대략적 최대 흐름을 유도하여도 영상의 거리차를 구할 수가 있기 때문에 정확한 정합을 유도 할 필요가 없다. NMFM은 두 갈래 그래프에서 최대 흐름을 계산하는 것이 아니라 대략적 최대 흐름 값을 계산하게 된다. 온전한 정합(complete matching)을 만들기 위해서 트리 형태의 노드를 이용하여 계산을 하는데 이 트리는 부모 노드의 흐름이 자식 노드의 흐름보다 크지 않고 자식 노드의 흐름보다 작은 부모 노드는 최대 흐름에서 제거되는 특징을 가지고 있다. NMFM의 이 특징을 이용하여 트리의 부모에서 미리 대략적 흐름을 예측하고 예측된 결과의 최대 흐름 값을 얻어내는 것이다. WBM 방법은 최적의 쌍을 유도하나, 최적의 쌍은 본 시스템에서 무의미하기 때문에 빠르게 동작하는 시스템이 필요하다. 캐릭터 영상에서는 WBM과 그 성능이 같은 형태로 나타났다. 데이터베이스 영상 외에 캐릭터 검색은 다양한 영역에서 이용된다. 동영상에서 실시간으로 캐릭터 검색을 할 경우 매우 빠른 정합 방법을 사용해야 한다. 본 발명에서 제안하는 NMFM은 예측값을 이용하여 WBM의 검색 단계를 모두 탐색하지 않고 탐색 중간에 결정을 내리기 때문에 WBM 보다 약 1/2의 연산량이 줄어든다. 영상 데이터베이스에서는 유사도가 비슷한 많은 객체 영상이 있으나 동영상 안에서는 그렇지 않다. 따라서, 실시간 데이터의 검색을 위해서는 NMFM을 사용하여 검색을 하는 것이 용이하다.When using WBM, it is divided into two types: the purpose of inferring the maximum flow value and the connection of the edges optimized for the maximum flow. It is an object of the present invention to infer the value of the maximum flow. Even if you derive the approximate maximum flow, the distance difference of the image can be obtained, so it is not necessary to derive the exact matching. NMFM does not calculate the maximum flow in a two-pronged graph, but rather the approximate maximum flow value. In order to make complete matching, the calculation is done using tree-type nodes, in which the parent node's flow is not larger than the child node's flow and the parent node's smaller than the child node's flow is removed from the maximum flow. have. This feature of the NMFM is used to predict the approximate flow in advance of the tree's parent and to obtain the maximum flow value of the predicted result. The WBM method derives the optimal pair, but because the optimal pair is meaningless in this system, a fast operating system is needed. In character image, WBM and its performance appeared in the same form. In addition to the database image, character search is used in various areas. When searching for a character in real time in a video, you must use a very fast matching method. The NMFM proposed in the present invention reduces the calculation amount of about 1/2 of the WBM because the NMFM makes a decision in the middle of the search without searching all the search steps of the WBM. There are many object images with similarity in the image database, but not in the image. Therefore, it is easy to search using NMFM to search for real-time data.

영상 검색 시스템의 검색 기능은 유사도에 따라 정렬할 경우와 정렬하지 않을 경우의 두 가지 동작으로 구분될 수 있다. 영상을 유사도에 따라 정렬할 경우에는, 비교하는 영상의 거리차를 모두 설정해주어야 한다. 그리고, 정렬할 필요가 없이 유사한 영상만 검색하고자 할 경우에는, NMFM을 사용하고 σ값의 영역 안에 있는 섹터가 있는가를 확인한다. 그리고 나서 σ영역 안에 있는 섹터가 없을 경우 서로 다른 영상으로 간주하게 된다. 그러나, 유사도 정렬을 할 경우에는 서로 같지 않은 영상일 경우에도 거리차 값을 설정해주어야 한다. The search function of the image retrieval system can be divided into two operations, one for sorting according to the similarity and one for not sorting. When the images are sorted according to the similarity, all distance differences of the images to be compared should be set. In the case of searching only similar images without the need for sorting, NMFM is used and it is checked whether there are sectors in the area of? Value. Then, if there are no sectors in the σ area, they are regarded as different images. However, in the case of similarity alignment, the distance difference value should be set even when the images are not the same.

도 13은 도 1에 도시된 다중 특징 정합부(170)에서 유사도 정렬을 위해 NMFM를 사용하는 방법을 설명하기 위한 도면이다. FIG. 13 is a diagram for describing a method of using NMFM for similarity alignment in the multiple feature matching unit 170 shown in FIG. 1.

도 13을 참조하면, 다중 특징 정합부(170)가 영상의 섹터들을 비교한 후에 비교 결과 값의 행렬 안에 σ값 영역 안에 있는 섹터가 하나라도 있게 되면, 해당 영상은 NMFM의 방법을 그대로 사용하게 된다. 도 13에서 상단에 있는 행렬 데이터는 σ영역 안의 데이터가 존재하는 형태이다. 이럴 경우 NMFM 의 값을 그대로 이용하여 거리차(W=1.64)가 나오게 된다. 도 13의 하단부에 있는 행렬 데이터는 σ영역 안에 값이 하나도 존재하지 않는 경우이다. 이 경우 WBM으로 들어가는 첫 단계를 시도하여 가능성 있는 최소비용값을 구하고, 동시에 NMFM에서 σ를 생각하지 않고 최소 비용의 값 순서대로 쌍을 연결하고 해당 값과의 차이를 구했을 때 평균 표준 편차 안에 있으면 해당 값을 최소 비용으로 간주한다. 그렇지 않을 경우 WBM을 이용하여 최소 비용을 얻어낸다. Referring to FIG. 13, when the multi-feature matching unit 170 compares sectors of an image and there is at least one sector in the sigma value region in the matrix of comparison results, the image uses the NMFM method as it is. . The matrix data at the top of FIG. 13 is in the form of data in the sigma region. In this case, the distance difference (W = 1.64) comes out by using the value of NMFM. The matrix data in the lower part of FIG. 13 is a case where no value exists in the sigma region. In this case, try the first step into WBM to find a possible minimum cost value, and at the same time, if the NMFM does not think about σ, connect the pairs in order of the minimum cost, and find the difference between them, if they are within the mean standard deviation. Consider the value as the minimum cost. If not, use WBM to get the minimum cost.

이하에서는 본 발명에 따른 영상 검색 시스템(100)에 대한 실험 결과 및 그것의 성능이 설명될 것이다. 설명에 앞서 영상 검색 시스템(100)의 성능을 평가하는 평가 척도에 대해 살펴보면 다음과 같다.Hereinafter, the experimental results and the performance thereof for the image retrieval system 100 according to the present invention will be described. Before describing the evaluation scale for evaluating the performance of the image retrieval system 100 as follows.

내용기반 검색 시스템에서 평가 척도는 시스템의 정확성을 알려주는 중요한 지표가 된다. 본 발명에서는 캐릭터 검색에 있어서 평가 척도를 BEP(Bulls Eye Performance), NMRR(Normalized Modified Retrieval Rank), 그리고 NDS(Normalized Distance Sum)가 이용된다. In content-based retrieval systems, evaluation scales are important indicators of the accuracy of the system. In the present invention, in the character search, evaluation scales are used as BEP (Bulls Eye Performance), NMRR (Normalized Modified Retrieval Rank), and NDS (Normalized Distance Sum).

먼저, BEP는 MPEG-7의 모양 및 움직임 서술자의 평가 척도로서, 검색 영상에서 검색 영상과 같은 영상이 들어있는 전체 수의 2배수 안에 영상이 들어와 있는 백분율 값을 의미한다. 아래의 [수학식13]은 BEP의 계수를 구하는 식이다.First, the BEP is an evaluation measure of the shape and motion descriptor of MPEG-7, and means a percentage value in which an image is contained within twice the total number of images including the search image in the search image. Equation 13 below calculates the coefficient of BEP.

[수학식13]에서 NR(q)는 2 배수 안에 들어와 있는 같은 영상 이미지의 수이고, NG(q)는 2배수의 전체 이미지 수이다. 예를 들어, 10개의 같은 영상이 존재할 때 BEP 평가 척도에 1에 근사하기 위해서는 20개의 영상 안에 모든 영상이 존재해야 한다.In Equation (13), NR (q) is the number of the same video image within 2 multiples, and NG (q) is the total multiple number of multiple images. For example, when 10 identical images exist, all images must exist within 20 images to approximate 1 to the BEP evaluation scale.

계속해서, NMRR은 MPEG-7의 색상 및 질감 서술자의 평가 척도로서, B. Kroepelien, A. Vailaya 및 A. K. Jain에 의해, 1996년, Proc. of IEEE Int. Conf. on Pattern Recognition, vol. 3, pp. 370-374에 실린 논문, "Image database: A case study in Norwegian silver authentication" 등에 NMRR의 개념 등이 개시되어 있다. NMRR은 전체 영상 NG(q)에서 질의된 영상과 같은 영상을 NR(q)라고 하였을 때, 순서적으로 보았을 때 얼마나 차례대로 질의된 영상과 같은 영상이 들어오는 가를 계산하는 방식이다. BEP의 척도는 동영상에서 많이 쓰이는 척도이기 때문에 순서적인 값이 기록되지 않지만 NMRR은 정지 영상의 패턴 및 형태를 검색하는 부분이기 때문에 얼마나 많은 영상들이 찾은 영상의 순서에서 앞에 있는가를 이용하여 평가 척도를 설정한다. NMRR의 평가 척도는 [수학식14]와 같다.Subsequently, NMRR is a measure of the color and texture descriptors of MPEG-7, by B. Kroepelien, A. Vailaya and A. K. Jain, 1996, Proc. of IEEE Int. Conf. on Pattern Recognition, vol. 3, pp. The papers in 370-374, "Image database: A case study in Norwegian silver authentication," and the like describe the concept of NMRR. NMRR is a method of calculating how the same video as the queried image is sequentially received when the NR (q) is the same as the image queried in the entire image NG (q). Since BEP is a commonly used measure of video, no ordered values are recorded, but since NMRR is a part of searching for patterns and shapes of still images, the evaluation scale is set based on how many images are in front of the sequence of images found. . The evaluation scale of NMRR is shown in [Equation 14].

[수학식14]에서 GTM은 데이터베이스 영상에서 모든 질의 영상에서 max{MG(q)} 값을 의미한다. In Equation 14, GTM means max {MG (q)} value in all query images in the database image.

그리고, NDS는 2000년, 서창덕, 김회율에 의해 한국방송공학회 발명지, 제5권 제1호, pp. 68-81에 실린 논문, "영상 데이터 베이스에서의 의미 기반 유사 상표 검색 및 새로운 검색 효율 평가 척도"에서 제안한 평가 척도로써, 0으로 근접할수록 좋은 성능을 나타내며 평가 조건을 모두 고려하고 세밀한 측정이 가능하다는 장점이 있다. In 2000, the NDS was published by Seo Chang-deok and Kim Ho-ryul in the Journal of the Korean Society of Broadcast Engineers, Vol. 5, No. 1, pp. The evaluation scale proposed in the paper in 68-81, "Meaning-based Similar Trademark Search and New Search Efficiency Evaluation Scale in Image Databases," which shows that the closer to 0, the better the performance, and all the evaluation conditions are considered. There is an advantage.

본 발명에 대한 실험은, 39개의 같은 영상의 그룹으로 이루어진 500개의 영상과 그룹화되어 있지 않는 2500개의 영상을 이용하여 수행되었다. 먼저, 그룹화된 영상만을 이용하여 다른 그룹의 영상들과 비교 검색하였고, 그룹화되어 있지 않는 다양한 영상 2500개를 그룹화되어 있는 영상과 함께 결합시켜 임의로 비교하여 얼마나 많은 같은 영상을 찾는가를 알아보았다.Experiments with the present invention were performed using 500 images consisting of 39 groups of 39 images and 2500 images not grouped. First, we compared and searched the images of other groups using only the grouped images, and then examined how many identical images were found by randomly comparing 2500 ungrouped images with the grouped images.

그룹화된 영상들은 크게 세 개의 파트로 나누어져 있다. 색상이 단색 계통으로 비슷한 형태의 영상 객체, 다양한 색상이 결합된 영상 객체, 새로운 형태가 부과된 영상 객체이다. 실험에서는 39개의 그룹에서 3개의 그룹을 각각 13개씩 두었는데, 이는 캐릭터 검색 시스템에서 3가지 특징을 가지고 있는 영상 객체를 이용하여 제안된 시스템과 기존의 시스템의 성능을 보다 정확하게 평가하기 위함이다. 그리고, 그룹화된 영상에서만 실험을 했을 경우 정확한 정보를 얻지 못할 수도 있기 때문에 그룹화 되어있지 않는 2000의 일반 영상을 혼합하였다.Grouped images are divided into three parts. It is a video system with a similar color with a single color system, a video object with various colors combined, and a video object with a new form. In the experiment, three groups of 39 groups were placed in each of 13 groups. This is to evaluate the performance of the proposed system and the existing system more accurately by using image objects having three characteristics in the character retrieval system. In addition, since the experiment may be impossible to obtain accurate information when the experiment is performed only on the grouped images, 2000 general images that are not grouped are mixed.

실험에서, 컴퓨터 시스템은 Pentium III 800 Mz를 이용하였으며 운영체제로는 프로텍트모드 32비트 Windows XP를 사용하였다. 언어는 C/C++를 사용하였고 GUI 결과를 유추하기 위해서 Visual C++와 MFC를 사용하였다. 실험에 필요한 검색 알고리즘은 모두 직접 제작하였으며 성능 평가를 정확하게 하기 위해서 특수한 하드웨어 가속기를 사용하지 않고 표준 어플리케이션 레벨에서 동작하도록 하였다.In the experiment, the computer system used Pentium III 800 Mz and protected mode 32 bit Windows XP. The language used C / C ++ and Visual C ++ and MFC to infer the GUI results. All the search algorithms required for the experiment were fabricated by hand and operated at the standard application level without using special hardware accelerators for accurate performance evaluation.

본 발명에 대한 성능 평가 결과는 [표 1] 및 도 14 내지 도 16과 같다.Performance evaluation results for the present invention are shown in [Table 1] and 14 to 16.

본 실험은 Swain의 색상 색인 방법과 제안되는 알고리즘인 WBM 및 NMFM과 Gevers가 제안하는 형태와 색상의 결합 실험 평가 결과로서, [표 1]은 4개의 알고리즘(즉, 전역 색상 색인, WBM, NMFM(Swain의 색상 색인), PicToSeek(Gevers의 제안한 형태와 색상 정보 결합)) 을 평가한 결과이고, 도 14 내지 도 16은 각각 [표 1]에 표시된 4개의 방법들에 대한 성능 평가 그래프이다. NMRR과 NDS는 수치가 작을수록 성능이 좋은 것이며, BEP는 1에 가까울수록 성능이 좋은 것이다. This experiment is the result of Swain's color indexing method and the proposed algorithm, WBM and NMFM and Gevers's combination and color combination test results. [Table 1] shows four algorithms (ie, global color index, WBM, NMFM ( Swain's color index) and PicToSeek (Gevers' proposed form and color information combination) are evaluated, and FIGS. 14 to 16 are performance evaluation graphs for the four methods shown in Table 1, respectively. The smaller the numbers, the better the NMRR and NDS. The closer to 1, the better the BEP.

[표 1] 및 도 14 내지 도 16에서 알 수 있는 바와 같이, WBM과 NMFM은 각각 비슷한 결과를 유출하면서 성능이 좋게 나왔으며, PicToSeek가 그 다음으로 Swain의 방법은 가장 낮은 성능 결과를 보였다. 전체적인 색상을 볼 때 Swain의 방법은 색상의 변화가 많은 부분에서 성능이 저하되는 결과를 보여 주었다. Gevers의 방법은 Swain의 방법보다는 성능이 좋게 결과가 나왔으나 형태 정보를 기하 정보를 이용하였기 때문에 캐릭터 영상에서 형태가 변하게 되면 기하 정보의 비교에서 거리차가 크게 나기 때문에, 본 발명에서 제안하고 있는 검색 방법 보다 성능이 떨어진다. As can be seen from Table 1 and FIGS. 14 to 16, WBM and NMFM yielded good performances, respectively, with similar results, and PicToSeek followed Swain's method with the lowest performance. In terms of overall color, Swain's method has shown poor performance in areas with many color variations. Gevers's method has better performance than Swain's method, but since the shape information is used as geometric information, when the shape is changed in the character image, the distance difference is large compared to the geometric information. The performance is lower.

도 17은 GHM, WBM, 및 NMFM의 양자화 레벨의 수에 따른 성능 평가 그래프이 다.17 is a performance evaluation graph according to the number of quantization levels of GHM, WBM, and NMFM.

도 17을 참조하면, 색상 히스토그램을 양자화 할 때 빈의 수가 문턱값 레벨 이상이 될 경우에 검색의 성능을 보장하였으며, 색상을 이용한 모든 검색 방법들이 최소 8빈 이상일 경우에 최적화된 성능을 유지하였다. GHM의 경우가 WBM과 NMFM 보다 빈의 수가 약간 적어도 최적화된 성능이 나오는 결과가 보였는데, 이 이유는 GHM은 섹터분할이 없이 색상 히스토그램을 만들었을 때 단색으로 되어있는 캐릭터 영상에 효과적으로 적응된 결과 때문이다.Referring to FIG. 17, the performance of the search is guaranteed when the number of bins exceeds the threshold level when quantizing the color histogram, and the optimized performance is maintained when all search methods using the color are at least 8 bins. In the case of GHM, the results show that the number of bins is at least slightly better than WBM and NMFM. The reason is that GHM effectively adapts to the image of characters that are monochromatic when creating color histogram without sector division. to be.

WBM과 NMFM의 검색 속도 성능 평가는 아래의 [표 2] 및 도 18과 같다.Search speed performance evaluation of WBM and NMFM is shown in Table 2 below and FIG. 18.

도 18은 본 발명에 따른 영상 검색 시스템(100)에 대한 WBM과 NMFM의 검색 속도 성능 평가 그래프이다. 18 is a graph illustrating a search speed performance evaluation of WBM and NMFM for the image search system 100 according to the present invention.

[표 2] 및 도 18에서 알 수 있는 바와 같이, 본 발명에 의하면 NMFM이 WBM에 비해 약 2배정도 빠른 영상 검색 속도를 보임을 알 수 있다. 100개의 영상을 비교할 때는 속도가 비슷하게 나온 이유는 시스템에서 측정할 수 있는 최소치가 나오기 때문이다. 적은 영상의 수일 경우에는 속도 측정에서 비슷하게 나오나 많은 영상을 비교할 때는 그 속도의 차이가 현저하게 나타난다. 동영상에서 캐릭터 영상을 검색 하고자 할 때는 초당 30프레임의 영상을 검색하여야 한다. 이때 선처리 작업으로 영상에서 검색하고자 하는 객체를 분할하여 추출하는 작업이 들어가고 이후에 검색작업이 들어가기 때문에 빠른 속도의 검색이 필수적이다. 이럴 경우 NMFM이 WBM보다 속도가 빠르기 때문에 적용 성능이 뛰어나게 된다.As can be seen from Table 2 and FIG. 18, it can be seen that according to the present invention, NMFM shows an image search speed about 2 times faster than that of WBM. The reason for the similar speed when comparing 100 images is that there is a minimum that the system can measure. In the case of a small number of images, the speed measurement is similar, but when comparing a large number of images, the speed difference is remarkable. If you want to search for the character image in the video, you have to search for 30 frames per second. At this time, as the pre-processing process involves dividing and extracting the object to be searched from the image, and afterwards, the search process is required. In this case, NMFM is faster than WBM, so the application performance is excellent.

이상에서, 본 발명에 따른 회로의 구성 및 동작을 상기한 설명 및 도면에 따라 도시하였지만 이는 예를 들어 설명한 것에 불과하며 본 발명의 기술적 사상을 벗어나지 않는 범위 내에서 다양한 변화 및 변경이 가능함은 물론이다. In the above, the configuration and operation of the circuit according to the present invention are shown in accordance with the above description and drawings, but this is merely described, for example, and various changes and modifications are possible without departing from the spirit of the present invention. .

특히, 본 발명에 따른 영상 검색 시스템(100)은, 캐릭터 영상의 다중 특징 정보를 이용하여 동일한 캐릭터를 찾아내는 것에 대해 설명하고 있으나, 이는 다중 특징 정보를 이용한 검색 시스템의 일 예에 불과하며, 다량의 정보가 저장되어 있는 데이터베이스로부터 특정 정보를 검색하는 어떠한 검색 시스템에도 본 발명을 적용할 수 있다. 예를 들어, 다수의 학생들에 대한 성적이 저장된 데이터베이스에서 특정 조건(특히, 복수 개의 조건에 부합된 다중 특징)을 만족하는 학생을 검색하고자 할 때, 본 발명을 적용하면 보다 짧은 시간에 보다 정확한 검색 결과를 얻을 수 있다. 이 경우, 상기 응용 예는 검색하고자 하는 객체를 n개의 섹터(예를 들면, 각 과목별 성적)로 분할하고, 통계적인 방법을 이용하여 얻어진 문턱값 σ을 이용하여 최적 유사도를 측정할 수 있게 된다. 이 경우, 검색에 사용되는 정보가 다중 특성 정보이므로, 단일 특성 정보를 검색 할 때에 비해서 원하는 데이터를 보다 다양하게 검색할 수 있으며, 본 발명에서는 다중 특성 정보의 검색에 적합하도록 NMFM 방식을 사용하기 때문에, 보다 빠르고 정확한 검색 결과를 얻을 수 있게 된다.In particular, the image retrieval system 100 according to the present invention has been described for finding the same character by using the multi-characteristic information of the character image, but this is only an example of a retrieval system using the multi-feature information, a large amount of The present invention can be applied to any retrieval system for retrieving specific information from a database in which information is stored. For example, if you want to search for students who meet certain criteria (especially multiple features that match multiple criteria) in a database where scores for multiple students are stored, applying the present invention can provide more accurate search in a shorter time. You can get the result. In this case, the application example divides the object to be searched into n sectors (eg, grades for each subject), and measures the optimal similarity using the threshold value σ obtained using a statistical method. . In this case, since the information used for the search is the multi-characteristic information, the desired data can be searched more diversely than when searching for the single-characteristic information. Since the present invention uses the NMFM method to be suitable for the search of the multi-characteristic information. The result is faster and more accurate search results.

본 발명은 또한 컴퓨터로 읽을 수 있는 기록매체에 컴퓨터가 읽을 수 있는 코드로서 구현하는 것이 가능하다. 컴퓨터가 읽을 수 있는 기록매체는 컴퓨터 시스템에 의하여 읽혀질 수 있는 데이터가 저장되는 모든 종류의 기록장치를 포함한다. 컴퓨터가 읽을 수 있는 기록매체의 예로는 ROM, RAM, CD-ROM, 자기 테이프, 플로피디스크, 광데이터 저장장치 등이 있으며, 또한 캐리어 웨이브(예를 들어 인터넷을 통한 전송)의 형태로 구현되는 것도 포함한다. 또한 컴퓨터가 읽을 수 있는 기록매체는 네트워크로 연결된 컴퓨터 시스템에 분산되어, 분산방식으로 컴퓨터가 읽을 수 있는 코드로 저장되고 실행될 수 있다.The invention can also be embodied as computer readable code on a computer readable recording medium. The computer-readable recording medium includes all kinds of recording devices in which data that can be read by a computer system is stored. Examples of computer-readable recording media include ROM, RAM, CD-ROM, magnetic tape, floppy disk, optical data storage, and the like, and may also be implemented in the form of a carrier wave (for example, transmission over the Internet). Include. The computer readable recording medium can also be distributed over network coupled computer systems so that the computer readable code is stored and executed in a distributed fashion.

이상과 같은 본 발명에 의하면, 형태의 정보나 색상 정보 한가지를 이용하여 검색하기가 어려운 캐릭터 영상을 적어도 둘 이상의 특징 정보들이 복합된 다중 특징 정보를 이용하여 효과적으로 검색할 수 있으며, 캐릭터 영상에 대한 검색 속도 및 정확도가 개선된다. According to the present invention as described above, it is possible to effectively search for a character image that is difficult to search using one type of information or color information by using the multi-feature information combined with at least two feature information, and search for the character image. Speed and accuracy are improved.

Claims

A database in which a plurality of data are stored;

A data dividing unit dividing each of the plurality of data into a plurality of regions;

A quantizer configured to quantize a color histogram of the divided regions using a Parzen window configured to partially overlap the adjacent windows and a weight value of the color;

A feature extractor for extracting feature values for the regions from the quantization result; And

The threshold value is determined from a distribution of the distance difference between the query data and the similar data, and the similarity between the query data and the plurality of data is determined by detecting whether the feature value is within a range of the threshold value among the plurality of data. Including a multi-feature matching unit,

The multi-feature matching unit,

Establish all connectable connections in the Hungarian tree for the plurality of data; Extract the pair with the lowest cost in the connection; Calculate a difference between the threshold given to the weighted pair and the threshold; And if the calculation result does not deviate from a predetermined range, an approximate maximum flow is obtained by iteratively obtaining a pair having the minimum cost among the connections except the pair having the minimum cost.

The method of claim 1,

Each of the divided regions includes both color and morphological features of the data.

The method of claim 1,

The pazen window is

Consists of,

Is the original histogram level,

Is the histogram level to be quantized,

Are values of the histogram corresponding to the divided region,

Is a histogram value corresponding to the quantization result for the divided region, and Q is calculated using a weight value of color as a quantization level value.

The method of claim 1,

The multi-feature matching unit does not perform matching any more when data belonging to the range of the threshold value does not exist among the plurality of data stored in the database.

The method of claim 1,

The multi-feature matching unit searches for a similar image by performing weighted bipartite matching (WBM) when there is no data belonging to the range of the threshold value among the plurality of data stored in the database. Video retrieval system.

The method of claim 1,

The multi-feature matching unit searches for similar images by performing Near Maximum Flow Matching (NMFM) when there is data belonging to the range of the threshold value among the plurality of data stored in the database. Video retrieval system.

Dividing each of the plurality of data stored in the database into a plurality of regions;

Quantizing a color histogram of the divided regions using a Parzen window configured to partially overlap adjacent windows and a weight value of the color;

Extracting feature values for the regions from the quantization result; And

The threshold value is determined from a distribution of the distance difference between the query data and the similar data, and the similarity between the query data and the plurality of data is determined by detecting whether the feature value is within a range of the threshold value among the plurality of data. Including the steps of:

Determining the similarity,

Establishing all connectable connections in the Hungarian tree for the plurality of data; Extracting the pair with the lowest cost in the connection; Calculating a difference between a weight given to the pair of minimum cost and the threshold; And if the calculation result does not deviate from a predetermined range, obtaining an approximate maximum flow by iteratively obtaining a pair having the minimum cost among the connections except the pair having the minimum cost. .

The method of claim 7, wherein

The pazen window is

Consists of,

Is the original histogram level,

Is the histogram level to be quantized,

Are values of the histogram corresponding to the divided region,

The method of claim 7, wherein

The determining of the similarity may further include performing no matching when the calculation result is out of a predetermined range.

The method of claim 7, wherein

The determining of the similarity may further include performing weighted bipartite matching (WBM) to determine the approximate maximum flow and data pair when the calculation result is out of a predetermined range. How to search video.

A computer-readable recording medium having recorded thereon a program for executing the image retrieval method according to any one of claims 7 to 11.