KR100810002B1

KR100810002B1 - Normalization method for shape descriptors and image retrieval method using the same

Info

Publication number: KR100810002B1
Application number: KR1020010019229A
Authority: KR
Inventors: 김회율
Original assignee: 김회율
Priority date: 2001-04-11
Filing date: 2001-04-11
Publication date: 2008-03-07
Also published as: KR20020080073A

Abstract

1. 청구범위에 기재된 발명이 속한 기술분야1. TECHNICAL FIELD OF THE INVENTION

본 발명은 모양 기술자 계산을 위한 정규화 방법 및 그를 이용한 영상 검색 방법과, 상기 방법들을 실현시키기 위한 프로그램을 기록한 컴퓨터로 읽을 수 있는 기록매체에 관한 것임.The present invention relates to a normalization method for calculating shape descriptors, an image retrieval method using the same, and a computer-readable recording medium recording a program for realizing the above methods.

2. 발명이 해결하려고 하는 기술적 과제2. The technical problem to be solved by the invention

본 발명은, 광범위한 영상 데이터베이스 내에서 사용자가 요구하는 특정 영상을 검색하는데 쓰였던 일반적인 모양 기술자가 영상내 잡음이나 한쪽 방향으로만 일그러짐, 원근법 적용 등 어파인 변환된 객체에 대해 취약하므로, 이를 보완하기 위해 어파인 변환에 불변한 모양 기술자를 얻을 수 있는 정규화 방법 및 그를 이용한 영상 검색 방법과, 상기 방법들을 실현시키기 위한 프로그램을 기록한 컴퓨터로 읽을 수 있는 기록매체를 제공하고자 함.In order to compensate for this, the general shape descriptor used to search for a specific image required by the user in a wide image database is vulnerable to affine transformed objects such as noise in the image, distortion in one direction, and application of perspective. A normalization method capable of obtaining a shape descriptor invariant to affine transformation, an image retrieval method using the same, and a computer-readable recording medium recording a program for implementing the methods.

3. 발명의 해결방법의 요지 3. Summary of Solution to Invention

본 발명은, 영상 정규화 방법에 있어서, 영상내 객체의 이동, 회전, 크기 변화, 잡음, 또는 일그러짐으로 인한 변형시에, 객체 영상을 표준영상으로 변환 처리하여, 그 결과 영상에 여러 가지 모양 기술자를 적용시켜 어파인 변환에 불변한 모양 기술자를 얻되, 변형중 잡음, 이동, 크기 변화, 또는 일그러짐에 대해서 컴팩트 알고리즘을 적용하여 영상을 정규화하는 제1 영상 정규화 단계; 및 회전각도에 불변하게 유사한 모양의 객체를 추출하기 위해 상기 컴팩트 알고리즘을 거친후 저니키 기반 모멘트나 에이알(AR) 변환을 이용한 회전불변 모양기술자를 계산하여 영상을 정규화하는 제2 영상 정규화 단계를 포함함.In the image normalization method, an object image is converted into a standard image in a deformation due to movement, rotation, size change, noise, or distortion of an object in the image, and as a result, various shape descriptors are added to the image. A first image normalization step of applying a shape descriptor that is invariant to the affine transformation, and normalizing the image by applying a compact algorithm to noise, shift, size change, or distortion during deformation; And a second image normalization step of normalizing an image by calculating a rotation invariant shape descriptor using Jonikey-based moments or AR (AR) transforms after going through the compact algorithm to extract an object of invariably similar shape to the rotation angle. Included.

4. 발명의 중요한 용도4. Important uses of the invention

본 발명은 캐릭터 영상의 검색, 상표 검색 등에 이용됨.The present invention is used for character image search, trademark search, and the like.

모양 기술자, 영상 검색, AR 변환, 어파인 변환, 정규화, 컴팩트 알고리즘Shape Descriptor, Image Retrieval, AR Transform, Affine Transform, Normalization, Compact Algorithm

Description

Normalization method for shape descriptors and image retrieval using the same

도 1 은 본 발명이 적용되는 내용기반 영상 검색 시스템의 구성 예시도.1 is an exemplary configuration of a content-based image retrieval system to which the present invention is applied.

도 2 는 본 발명에 따른 영상 검색 방법에 대한 일실시예 흐름도.2 is a flowchart of an embodiment of an image retrieval method according to the present invention;

도 3 은 본 발명의 실시예에 따라 MPEG-7 국제 표준용 실험 데이터에 대한 성능 향상 실험 결과 예시도.
3 is a diagram illustrating performance improvement test results for experimental data for MPEG-7 international standard according to an exemplary embodiment of the present invention.

*도면의 주요 부분에 대한 부호의 설명* Explanation of symbols for the main parts of the drawings

10 : 영상 입력 장치 20 : 저장 장치10: video input device 20: storage device

30 : 질의 응답 장치
30: Q & A device

본 발명은 내용기반 영상 검색 시스템에서 모양 기술자 계산을 위한 정규화 방법 및 그를 이용한 영상 검색 방법과 상기 방법들을 실현시키기 위한 프로그램을 기록한 컴퓨터로 읽을 수 있는 기록매체에 관한 것으로, 보다 상세하게는 내용기반 영상 검색에 사용되는 회전불면 모양기술자 계산을 위한 전처리 과정에 관한 것으로서 영상의 정규화 방법에 관한 것이다. The present invention relates to a normalization method for calculating shape descriptors in a content-based image retrieval system, an image retrieval method using the same, and a computer-readable recording medium recording a program for realizing the above methods. The present invention relates to a preprocessing process for calculating a revolving blower shape descriptor used for retrieval.

일반적으로, 영상에서 객체의 모양 정보는 객체(object)를 인식하는데 매우 유용하게 쓰인다. 이러한 모양정보의 일반적인 기술자들은 객체의 회전, 이동, 크기변화, 잡음에 매우 강인하다. 그러나, 이러한 일반적인 모양 기술자들은 한쪽으로 일그러지거나 비스듬히 기울어진 객체, 원근법이 적용된 객체, 즉 어파인(Affine) 변환된 객체에는 매우 취약한 면을 나타내므로 같은 객체라 하더라도 서로 다른 객체로 인식되어지기 쉽다. In general, the shape information of an object in an image is very useful for recognizing an object. General descriptors of this shape information are very robust to object rotation, movement, size change and noise. However, these general shape descriptors are very vulnerable to objects that are distorted or obliquely to one side, objects to which perspective is applied, that is, Affine transformed objects, and therefore, the same objects are likely to be recognized as different objects.

그러나, 종래에는 회전불변 모양 기술자를 계산하는데 있어서, 객체의 정규화 과정이 아예 없거나, 있어도 비스듬히 기울어진 객체, 원근법이 적용된 객체 등과 같이 객체의 전반적인 형태를 변환시키는 어파인 변환에 불변하게 하는 정규화 과정은 없었다. However, in the conventional calculation of the rotation invariant descriptor, the normalization process of changing the overall shape of the object, such as an obliquely inclined object or an object to which the perspective is applied, is absent even if the object is not normalized. There was no.

따라서, 특히 정지 영상의 검색에 있어서 쓰이는 여러 가지 기술자중 검색 성능을 높일 수 있는 모양 기술자를 얻기 위해서는, 모양 기술자 계산의 전처리 과정으로 어파인 변환에 불변한 기술자를 얻을 수 있는 방안이 필수적으로 요구된다.
Therefore, in order to obtain a shape descriptor that can improve retrieval performance among various descriptors used in retrieval of still images, a method of obtaining an invariant descriptor for affine conversion is required as a preprocessing process of shape descriptor calculation. .

본 발명은 상기한 바와 같은 요구에 부응하기 위하여 제안된 것으로서, 광범 위한 영상 데이터베이스 내에서 사용자가 요구하는 특정 영상을 검색하는데 쓰였던 일반적인 모양 기술자가 영상내 잡음이나 한쪽 방향으로만 일그러짐, 원근법 적용 등 어파인 변환된 객체에 대해 취약하므로, 이를 보완하기 위해 어파인 변환에 불변한 모양 기술자를 얻을 수 있는 정규화 방법 및 그를 이용한 영상 검색 방법과 상기 방법들을 실현시키기 위한 프로그램을 기록한 컴퓨터로 읽을 수 있는 기록매체를 제공하는데 그 목적이 있다.
The present invention has been proposed to meet the above-mentioned demands, and a general shape descriptor used to search for a specific image requested by a user in an extensive image database may have noise in the image, distortion in only one direction, and application of perspective. Because it is vulnerable to phosphorous-converted objects, to compensate for this, a normalization method that can obtain a shape descriptor that is invariant to affine transformation, an image retrieval method using the same, and a computer-readable recording medium recording a program for realizing the methods. The purpose is to provide.

상기 목적을 달성하기 위한 본 발명은, 영상 정규화 방법에 있어서, 영상내 객체의 이동, 회전, 크기 변화, 잡음, 또는 일그러짐으로 인한 변형시에, 객체 영상을 표준영상으로 변환 처리하여, 그 결과 영상에 여러 가지 모양 기술자를 적용시켜 어파인 변환에 불변한 모양 기술자를 얻되, 변형중 잡음, 이동, 크기 변화, 또는 일그러짐에 대해서 컴팩트 알고리즘을 적용하여 영상을 정규화하는 제1 영상 정규화 단계; 및 회전각도에 불변하게 유사한 모양의 객체를 추출하기 위해 상기 컴팩트 알고리즘을 거친후 저니키 기반 모멘트나 에이알(AR) 변환을 이용한 회전불변 모양기술자를 계산하여 영상을 정규화하는 제2 영상 정규화 단계를 포함하여 이루어진 것을 특징으로 한다. In order to achieve the above object, the present invention provides a method for normalizing an image, wherein the object image is converted into a standard image when the object is transformed due to movement, rotation, size change, noise, or distortion in the image. A first image normalization step of applying various shape descriptors to a shape descriptor to obtain an invariant shape descriptor for affine transformation, and normalizing an image by applying a compact algorithm to noise, shift, size change, or distortion during deformation; And a second image normalization step of normalizing an image by calculating a rotation invariant shape descriptor using Jonikey-based moments or AR (AR) transforms after going through the compact algorithm to extract an object of invariably similar shape to the rotation angle. Characterized in that the made up.

한편, 본 발명은, 내용기반 영상 검색 시스템에서의 영상 검색 방법에 있어서, 영상내 객체의 이동, 회전, 크기 변화, 잡음, 또는 일그러짐 변형시에, 컴팩트 알고리즘을 적용해 영상을 정규화하여 객체의 이동, 크기 변화, 또는 일그러짐에 불변한 정규화된 영상을 얻는 영상 정규화 단계; 상기 정규화된 영상에서 모양 기술자를 계산하는 단계; 및 상기 계산된 모양 기술자를 이용하여 영상 데이터베이스 내에서 유사도를 계산하여 영상을 검색하는 단계를 포함하여 이루어진 것을 특징으로 한다. On the other hand, the present invention, in the image retrieval method in the content-based image retrieval system, the movement of the object by normalizing the image by applying a compact algorithm during the movement, rotation, size change, noise, or distortion of the object in the image An image normalization step of obtaining a normalized image that is invariant to size change, or distortion; Calculating a shape descriptor from the normalized image; And searching for an image by calculating a similarity level in the image database using the calculated shape descriptor.

상기 목적을 달성하기 위한 본 발명은, 프로세서를 구비한 영상 정규화 시스템에, 영상내 객체의 이동, 회전, 크기 변화, 잡음, 또는 일그러짐으로 인한 변형시에, 객체 영상을 표준영상으로 변환 처리하여, 그 결과 영상에 여러 가지 모양 기술자를 적용시켜 어파인 변환에 불변한 모양 기술자를 얻되, 변형중 잡음, 이동, 크기 변화, 또는 일그러짐에 대해서 컴팩트 알고리즘을 적용하여 영상을 정규화하는 기능; 및 회전각도에 불변하게 유사한 모양의 객체를 추출하기 위해 상기 컴팩트 알고리즘을 거친후 저니키 기반 모멘트나 에이알(AR) 변환을 이용한 회전불변 모양기술자를 계산하여 영상을 정규화하는 기능을 실현시키기 위한 프로그램을 기록한 컴퓨터로 읽을 수 있는 기록매체를 제공한다.The present invention for achieving the above object, in the image normalization system equipped with a processor, by transforming the object image to a standard image at the time of deformation due to the movement, rotation, size change, noise, or distortion of the object in the image, As a result of applying various shape descriptors to the image to obtain an invariant shape descriptor for affine transformation, and normalizing the image by applying a compact algorithm to noise, shift, size change, or distortion during deformation; And a method for normalizing an image by calculating a rotation invariant shape descriptor using Jonikey-based moment or AR transform after going through the compact algorithm to extract an object invariably similar to the rotation angle. Provide a computer readable recording medium having recorded thereon.

한편, 본 발명은, 프로세서를 구비한 내용기반 영상 검색 시스템에, 영상내 객체의 이동, 회전, 크기 변화, 잡음, 또는 일그러짐 변형시에, 컴팩트 알고리즘을 적용해 영상을 정규화하여 객체의 이동, 크기 변화, 또는 일그러짐에 불변한 정규화된 영상을 얻는 기능; 상기 정규화된 영상에서 모양 기술자를 계산하는 기능; 및 상기 계산된 모양 기술자를 이용하여 영상 데이터베이스 내에서 유사도를 계산하여 영상을 검색하는 기능을 실현시키기 위한 프로그램을 기록한 컴퓨터로 읽을 수 있는 기록매체를 제공한다.Meanwhile, the present invention provides a content-based image retrieval system equipped with a processor to normalize an image by applying a compact algorithm when moving, rotating, changing size, noise, or distortion in an object in the image. The ability to obtain a normalized image that is invariant to changes, or distortions; Calculating a shape descriptor from the normalized image; And a computer-readable recording medium having recorded thereon a program for realizing a function of searching for an image by calculating similarity in the image database using the calculated shape descriptor.

본 발명은 정지 및 동영상의 검색에 있어서 쓰이는 여러 가지 기술자중 모양 기술자의 계산의 전처리 과정으로 쓰일 수 있는 것으로서, 좀 더 검색 성능을 높일 수 있는 모양 기술자를 얻을 수 있도록 한다. The present invention can be used as a preprocessing process of the calculation of the shape descriptor among various descriptors used in the retrieval of still images and moving images, and it is possible to obtain a shape descriptor that can further improve the search performance.

본 정규화 방법은, 영상 검색에 있어서 객체의 모양을 기준으로 한 검색(캐릭터 이미지, 상표 이미지 검색 등이 이에 해당)을 행할 때 흔히 나타나기 쉬운 객체 영상의 일그러짐 또는 원근법에 의한 기울어짐 등이 발생하여도 동일한 객체로 인식할 수 있도록 하여 준다.In the normalization method, even when a search based on the shape of an object (such as a character image or a trademark image search) is performed in the image search, distortion of the object image or inclination due to perspective may occur. It can be recognized as the same object.

이렇게 본 정규화 방법을 거친 후, 모양 기술자를 계산하면 어파인 변환에 불변한 기술자를 얻어낼 수 있다. 즉, 입력된 영상에 본 정규화 방법을 적용하면 표준영상으로 변환되고 이 표준영상에 여러 형태의 회전불변 모양 기술자를 적용시키면, 결국 어파인 변환에 불변한 모양 기술자를 얻게 된다. 이렇게 얻어진 모양 기술자를 이용하여 여러 가지 영상의 검색에 사용할 수 있다.After this normalization method, the shape descriptor can be calculated to obtain a descriptor invariant to the affine transformation. That is, if the normalization method is applied to the input image, it is converted into a standard image, and if various types of rotation invariant descriptors are applied to the standard image, the shape descriptor invariant to the affine transformation is obtained. The shape descriptor thus obtained can be used to search for various images.

따라서, 본 정규화 방법을 모양 기술자 계산 이전의 전처리 과정으로 이용함으로써 영상내에서 한쪽 방향으로만 확대/축소되거나 원근법에 의한 기울어짐 등을 보상할 수 있다.Therefore, by using the normalization method as a preprocessing process before calculating the shape descriptor, it is possible to compensate for the inclination by the enlargement / reduction in one direction or the perspective in the image.

본 발명으로 전처리되어진 모양 기술자는 여러 분야에서 응용되어질 수 있는데, 영상내의 객체를 중요시하는 모든 내용기반 검색에 유용하게 쓰여질 수 있고, 특히 캐릭터 이미지 검색, 상표 검색 등 고부가가치의 지적 재산권을 갖는 영상의 검색에도 쓰여질 수 있다. The shape descriptor preprocessed by the present invention can be applied in various fields, and can be usefully used for all content-based searches that focus on objects in the image. In particular, image descriptors having high value-added intellectual property such as character image search and trademark search can be used. Can also be used for searching.

상술한 목적, 특징들 및 장점은 첨부된 도면과 관련한 다음의 상세한 설명을 통하여 보다 분명해 질 것이다. 이하, 첨부된 도면을 참조하여 본 발명에 따른 바람직한 일실시예를 상세히 설명한다.The above objects, features and advantages will become more apparent from the following detailed description taken in conjunction with the accompanying drawings. Hereinafter, exemplary embodiments of the present invention will be described in detail with reference to the accompanying drawings.

도 1 은 본 발명이 적용되는 내용기반 영상 검색 시스템의 구성 예시도로서, 영상 유사도 가중치 조절을 통한 내용기반 영상 검색 시스템을 일 예로 들어 설명한다. 1 is an exemplary configuration diagram of a content-based image retrieval system to which the present invention is applied, and will be described by taking a content-based image retrieval system through image similarity weight adjustment as an example.

도 1에 도시된 바와 같이, 본 발명의 영상 유사도 가중치 조절을 통한 내용기반 영상 검색 시스템은, 운용자로부터 영상을 입력받아 저장하고, 저장된 영상의 특징(모양 기술자)을 추출하여 저장하며, 유사도를 계산하여 저장하고, 일정한 크기의 소영상을 생성하여 저장하기 위한 영상 입력 장치(10)와, 영상, 특징 벡터, 소영상, 유사도 테이블을 저장하기 위한 저장 장치(20)와, 사용자로부터 질의를 입력받아 질의의 유사도에 따른 소영상을 출력하고, 출력된 소영상에 대한 사용자의 선택 정보를 피드백받아 유사도 가중치를 변화시켜 그 결과에 따른 영상을 출력하기 위한 질의 응답 장치(30)를 포함한다.As shown in FIG. 1, the content-based image retrieval system using the image similarity weight control according to the present invention receives an image from an operator, stores the image, extracts and stores a feature (shape descriptor) of the stored image, and calculates the similarity. And a video input device 10 for generating and storing a small image having a predetermined size, a storage device 20 for storing an image, a feature vector, a small image, and a similarity table, and receiving a query from a user. It includes a query response device 30 for outputting a small image according to the similarity of the query, receiving the user's selection information on the output small image to change the similarity weight and output the image according to the result.

상기 영상 입력 장치(10)는 운용자로부터의 데이터베이스 입력 요구가 입력됨에 따라 영상을 분석하여 데이터베이스화하는 데이터베이스 입력부(11)와, 데이터베이스 입력부(11)의 제어에 의해 현재 입력되는 영상을 소형 영상으로 나누어 소형 영상 디렉토리 데이터베이스(21)에 저장하는 소형 영상 생성부(12)와, 데이터베이스 입력부(11)의 제어에 의해 현재 입력되는 영상에 대하여 일정 개수의 추출 사양별로 특징 벡터를 만들어 특징 벡터 디렉토리 데이터베이스(24)에 저장하는 특 징 추출부(13)와, 추출된 일정 개수의 특징 벡터를 이용하여 영상 디렉토리 데이터베이스(22)내의 다른 영상의 특징 벡터와의 각각의 특징에 대해 유사도를 계산하고, 계산된 각각의 특징에 대한 유사도를 동일한 중요도를 갖게 정규화시키며, 각 특징에서의 정규화된 유사도를 단일의 유사도로 결합하여 인덱스를 구성한 후에, 구성된 인덱스를 인덱스 데이터베이스(23)에 저장하는 유사도 계산부(14)를 구비한다.The image input apparatus 10 divides the image currently input under the control of the database input unit 11 into a small image and a database input unit 11 for analyzing and databaseting the image as a database input request from the operator is input. The feature vector directory database 24 generates a feature vector for a certain number of extraction specifications for the image currently input under the control of the database input unit 11 and the small image generation unit 12 stored in the small image directory database 21. The feature extractor 13 for storing the information in the feature extractor 13 and a predetermined number of extracted feature vectors, and calculates the similarity with respect to each feature of the feature vector of the other image in the image directory database 22, Normalize the similarity of the features to have the same importance, and normalize the similarity in each feature Is combined with a single similarity to form an index, and then the similarity calculator 14 stores the constructed index in the index database 23.

상기 저장 장치(20)는 소형 영상 디렉토리를 저장하고 있는 소형 영상 디렉토리 데이터베이스(21)와, 영상 디렉토리를 저장하고 있는 영상 디렉토리 데이터베이스(22)와, 유사도 테이블을 저장하고 있는 인덱스 데이터베이스(23)와, 특징 벡터 디렉토리를 저장하고 있는 특징 벡터 디렉토리 데이터베이스(24)를 구비한다.The storage device 20 includes a small image directory database 21 storing a small image directory, an image directory database 22 storing an image directory, an index database 23 storing a similarity table, A feature vector directory database 24 for storing feature vector directories is provided.

상기 질의 응답 장치(30)는 질의 영상을 분석하여 인덱스 데이터베이스(23)로부터 유사도 테이블을 출력시키는 질의부(32)와, 질의부(32)에 의해 출력되는 유사도 테이블에 해당하는 소형 영상을 소형 영상 디렉토리 데이터베이스(21)에서 독취하여 검색 사용자에게 보여주는 결과 표시부(33)와, 사용자로부터 입력되는 피드백 신호를 입력받아 가중치를 조절하고, 조절된 가중치에 따른 유사도를 구하여 해당하는 인덱스 데이터베이스(23)로부터 유사도 테이블을 결과 표시부(23)로 출력하기 위한 가중치 조절부(31)를 구비한다. The query response device 30 analyzes the query image to output a similarity table from the index database 23, and a small image corresponding to the similarity table output by the query unit 32. The result display unit 33 reads from the directory database 21 and shows it to the search user, and receives a feedback signal input from the user, adjusts the weight, obtains the similarity according to the adjusted weight, and obtains the similarity from the corresponding index database 23. A weight adjusting unit 31 for outputting the table to the result display unit 23 is provided.

상기한 바와 같은 구성을 갖는 본 발명이 적용되는 가중치 조절을 통한 내용기반 영상 검색 시스템의 동작을 상세히 살펴보면 다음과 같다.The operation of the content-based image retrieval system through weight adjustment to which the present invention having the above configuration is applied will be described in detail as follows.

데이터베이스 입력부(11)는 처리해야 할 영상을 영상 디렉토리 데이터베이스(22)에 저장한 후에, 소형 영상 생성부(12)를 구동시켜 새로운 소형 영상(Thumbnail, 즉 화면표시를 용이하게 하기 위해 큰 영상을 100 ×100 정도의 작은 영상으로 만든 것)을 만들어 소형 영상 디렉토리 데이터베이스(21)에 저장한다. The database input unit 11 stores the image to be processed in the image directory database 22 and then drives the small image generating unit 12 to display a new large image (Thumbnail, that is, a large image to facilitate screen display). A small image of about 100 pixels) is stored in the small image directory database 21.

그리고, 새로운 영상에 대한 특징 벡터는 없으므로 특징 추출부(13)를 구동시켜 새로운 객체 영상의 모양에 대한 회전불변 특징량을 추출하여 특징 벡터를 만들고, 그 각각의 특징별로 추출된 특징 벡터를 특징 벡터 디렉토리 데이터베이스(24)에 저장한다.In addition, since there is no feature vector for the new image, the feature extractor 13 is driven to extract the invariant feature quantities for the shape of the new object image to form the feature vector, and the feature vector is extracted for each feature. Stored in the directory database 24.

유사도 계산부(14)는 특징 추출부(13)에 의해 추출된 일정 개수의 특징 벡터를 이용하여 영상 디렉토리 데이터베이스(22)내의 다른 영상의 특징 벡터와의 각각의 특징에 대해 유사도를 계산하고, 계산된 각각의 특징에 대한 유사도를 동일한 중요도를 갖도록 정규화시키며, 각 특징에서의 정규화된 유사도를 단일의 유사도로 결합시킨 후에 테이블로 만들어 인덱스를 생성하여 인덱스 데이터베이스(23)에 저장한다.The similarity calculator 14 calculates and calculates similarity for each feature with the feature vector of another image in the image directory database 22 using a predetermined number of feature vectors extracted by the feature extractor 13. The similarity for each feature is normalized to have the same importance, the normalized similarity in each feature is combined into a single similarity, and then made into a table to generate an index and stored in the index database 23.

그리고, 유사도 계산부(14)는 운용자가 입력한 모든 새로운 영상에 대해 특징 추출부(13)에 의한 특징 추출 동작이 완료되면 데이터베이스 입력부(11)의 제어에 의해 구동된다.The similarity calculator 14 is driven by the control of the database input unit 11 when the feature extraction operation by the feature extractor 13 is completed for all new images input by the operator.

상기한 바와 같이 모든 영상의 각 특징에서의 정규화된 유사도가 유사도 계산부(14)에서 단일의 결합된 유사도로 계산되어 테이블로 만들어지고, 그 유사도 계산부(14)에서 인덱스를 생성하게 되면 영상의 데이터베이스화는 종료된다. As described above, the normalized similarity in each feature of all images is calculated as a single combined similarity in the similarity calculator 14 to form a table, and when the similarity calculator 14 generates an index, Databaseization ends.

질의부(32)는 일반 사용자가 이미 입력된 영상 디렉토리 데이터베이스(22)에서 영상을 검색하고자 할 때 사용하는 질의도구를 나타내는데, 사용자의 검색 요구에 맞게 질의 영상을 분석한 후에, 특징 추출부(13)와 유사도 계산부(14)를 이용하여 인덱스 데이터베이스(23)로부터 검색된 영상을 결과 표시부(33)로 보내준다. The query unit 32 represents a query tool used when a general user wants to search for an image in the image directory database 22 that has been input. After analyzing the query image according to a user's search request, the feature extractor 13 ) And the similarity calculator 14 transmits the image retrieved from the index database 23 to the result display unit 33.

여기서, 사용자는 결과 표시부(33)에 보여진 결과에 대해 원하는 영상과 원하지 않는 영상을 선택하여 다시 질의하면, 가중치 조절부(31)가 사용자의 의견을 분석하여 각각의 특징에 대한 가중치를 조절하고, 다시 인덱스 데이터베이스(23)로부터 변화된 가중치에 의한 결과를 도출하여 결과 표시부(33)로 보내준다. In this case, when the user selects a desired image and an undesired image and inquires again about the result shown on the result display unit 33, the weight adjusting unit 31 analyzes the user's opinion and adjusts the weight for each feature, Again, the result of the changed weight is derived from the index database 23 and sent to the result display unit 33.

상기 질의부(32)는 결과 표시부(33)가 임의로 선택해서 보여준 샘플 영상일 수도 있고, 사용자가 그린 영상(즉, 그림) 또는 스캐너 등의 외부 영상 입력 장치로부터 입력받은 또 다른 영상 데이터 또는 사용자가 지정한 색상과 그 비율 등이 될 수도 있다.The query unit 32 may be a sample image that is arbitrarily selected and displayed by the result display unit 33, or another image data or a user input from an external image input apparatus such as an image drawn by the user (ie, a picture) or a scanner. It can be a specified color and its ratio.

그리고, 결과 표시부(33)는 인덱스 데이터베이스(23)로부터 검색되어 입력되는 영상을 받아 소형 영상 디렉토리 데이터베이스(21)에서 소형 영상을 가져와 검색 사용자에게 보여준다.In addition, the result display unit 33 receives the input image searched from the index database 23 and retrieves the small image from the small image directory database 21 and shows it to the search user.

이제, 내용 기반 영상 검색에 사용되는 회전불변 모양 기술자인 Zernike Moment를 이용한 모양기술자나 혹은 AR 변환 모양 기술자 계산의 전처리 과정으로서, 어파인(Affine) 변환에 불변한 모양 기술자를 얻을 수 있는 정규화 방법에 대해 보다 상세히 설명한다. Now, as a preprocessing of the shape descriptor using Zernike Moment, a rotation invariant shape descriptor used for content-based image retrieval, or the AR transform shape descriptor calculation, the normalization method to obtain the shape descriptor invariant to the affine transformation It will be described in more detail.

영상의 정규화는 왜곡되거나 변형된 영상을 일정한 형태의 표준화 영상으로 만드는 과정을 말한다. 이러한 변형에는 영상내 객체에 잡음, 객체의 이동, 회전, x-y중 단방향 혹은 양방향의 크기 변화, 원근변화 등이 있다. 이러한 변형은 컴팩트 알고리즘을 사용하여 정규화한다. 정규화된 객체영상으로부터 회전불변 모양기술자인 Zernike Moment 혹은 AR 변환 모양기술자를 계산할 수 있다. Normalization of an image refers to a process of transforming a distorted or deformed image into a standardized image of a certain form. Such deformations include noise in an object in the image, movement of the object, rotation, size change in one or two directions during x-y, and perspective change. This variant is normalized using a compact algorithm. From the normalized object image, we can calculate the Zernike Moment or AR transform shape descriptors, which are rotation invariant shape descriptors.

2차원 영상에 대한 어파인 변환은 다음의 (수학식 1)과 같이 표현되어질 수 있다.
The affine transformation on the 2D image may be expressed as Equation 1 below.

상기 (수학식 1)에서, [u v 1]^T는 점[x y 1]^T의 어파인 변환된 점이고, A는 어파인 행렬이다. 따라서, 상기 (수학식 1)에서는 크기 변화, 회전, 원근법에 의한 기울어짐, 변환이 나타난다. In Equation 1, [uv 1] ^T is an affine transformed point of point [xy 1] ^T and A is an affine matrix. Therefore, in the above Equation 1, size change, rotation, inclination due to perspective, and transformation appear.

상기 (수학식 1)은 어파인 행렬에 의해 연관된 두 점들간의 관계를 나타낸다. 그러나, 여기서 종종 어파인 계수를 얻기가 어려울 수가 있는데, 이는 두 개의 이미지상에서 해당되는 두 점을 찾는 것이 명백하지가 않을 뿐 아니라, 노이즈에 의한 변화에도 민감하기 때문이다. 그러므로, 다음의 (수학식 2)가 어파인 관계의 새로운 형식을 제공해 준다(모멘트의 식을 이용).
Equation 1 shows a relationship between two points associated by an affine matrix. However, it is often difficult to obtain affine coefficients here, because it is not only obvious to find the corresponding two points on two images, but also sensitive to changes caused by noise. Therefore, Equation 2 below provides a new form of affine relationship (using the equation of moment).

상기 (수학식 2)에서, m_pq는 정규화된 이미지 f(x,y)에서 pq차의 모멘트이고, m'_pq는 어파인 변환된 이미지 f(u,v)의 pq차의 모멘트이다. pq차의 모멘트 정의는 다음의 (수학식 3)과 같다.
In Equation (2), m _pq is the moment of the pq order in the normalized image f (x, y), m ' _pq is the moment of the pq order of the affine transformed image f (u, v). The moment definition of the pq order is shown in Equation 3 below.

컴팩트 알고리즘의 목적은 알고리즘을 거친 영상의 공분산 행렬이 항등 행렬이 되도록 두 개의 선형 변환을 통해 영상을 조정하는 것이다. The purpose of the compact algorithm is to adjust the image through two linear transformations so that the covariance matrix of the algorithmic image is an identity matrix.

한 영상에서 공분산 행렬은 다음의 (수학식 4)와 같이 계산되어질 수 있다.
In one image, the covariance matrix may be calculated as shown in Equation 4 below.

행렬 C의 고유벡터가 행을 이루는 행렬을 행렬 E라 하자(가장 작은 고유치에 해당하는 고유벡터를 가장 마지막 행으로 놓는다). 결과적으로 회전 변환 행렬 E를 다음의 (수학식 5)와 같이 만들 수 있다.
Assume that matrix E consists of rows of eigenvectors of matrix C (the eigenvectors corresponding to the smallest eigenvalues are placed as the last row). As a result, the rotation transformation matrix E can be made as shown in Equation 5 below.

여기서, e_ix와 e_iy는 각각의 고유치에 해당되는 고유 벡터이다. 변환된 이미지의 공분산 행렬 C'는 다음의 (수학식 6)과 같이 된다.
Here, e _ix and e _iy are eigenvectors corresponding to respective eigenvalues. The covariance matrix C 'of the transformed image is expressed by Equation 6 below.

결과적으로 변환된 이미지는 새로운 좌표계에서는 상관성이 없어진 상태의 이미지가 된다. As a result, the converted image becomes an image with no correlation in the new coordinate system.

다음 단계에서는 좌표계를 원점을 이미지의 중심으로 이동하고 좌표를 행렬 E와 곱해서 변환시킨다. 이렇게 해서 새로운 좌표계는 C의 고유 벡터와 일치하게 된다. [u,v]^T를 새로운 좌표라고 하면 다음의 (수학식 7)과 같은 식이 성립한다. The next step is to transform the coordinate system by moving the origin to the center of the image and multiplying the coordinates by the matrix E. In this way, the new coordinate system matches the eigenvectors of C. If [u, v] ^T is the new coordinate, the following equation (7) is established.

상기 (수학식 7)에서는 새로운 좌표계에서의 상관성을 제거하기 위해서 좌표계를 회전 이동시켰다. 그러므로, 각각의 원소들의 크기를 독립적으로 다룰 수 있다. 최종적인 목표는 바로 크기가 변환된 항등 행렬과 같은 공분산 행렬을 갖는 이미지를 얻는 것이다. 그래서, 고유치에 해당되는 새로운 두 개의 좌표의 크기를 (수학식 8)과 같이 변환시킨다.
In Equation (7), the coordinate system is rotated to remove the correlation in the new coordinate system. Therefore, the size of each element can be dealt with independently. The final goal is to obtain an image with a covariance matrix, such as a transformed identity matrix. Thus, the size of the two new coordinates corresponding to the eigenvalues is converted as shown in Equation (8).

여기서, 크기 변환 행렬은 다음의 (수학식 9)와 같이 정의된다.
Here, the magnitude transformation matrix is defined as follows.

상기 (수학식 9)에서, c는 크기 변환 상수이다. In Equation (9), c is a size conversion constant.

이제 어파인 변환된 영상 f(u,v)로부터 (수학식 10)과 같은 컴팩트 알고리즘을 거친 영상 f(x',y')를 구할 수 있는 알고리즘을 얻을 수 있다.
Now, from the affine-converted image f (u, v), an algorithm for obtaining an image f (x ', y') that has undergone a compact algorithm such as Equation (10) can be obtained.

이로써, 상기 (수학식 10)을 이용한 객체의 이동, 크기 변화, 일그러짐에 불변한 정규화된 영상을 얻어낼 수 있다.As a result, a normalized image invariant to the movement, the change in size, and the distortion of the object using Equation 10 may be obtained.

위의 정규화된 영상으로부터 AR 변환을 이용하여 향상된 검색 효율을 얻을 수 있다.Improved retrieval efficiency can be obtained by using AR transform from the normalized image.

본 발명에 따르면 다음과 같은 효과가 있다. According to the present invention has the following effects.

첫째, 영상 데이터베이스 검색시 검색 효율을 높일 수 있다. 정규화 방법 사용시 성능이 향상됨을 도 3에서 보여준다. 이는 엠펙7(MPEG-7) 국제 표준용 실험 데이터에 대한 성능 향상 실험 결과를 나타낸다(평가척도 : BEP).First, the search efficiency can be improved when searching the image database. 3 shows that the performance is improved when using the normalization method. This shows the results of performance improvement experiments on experimental data for the MPEG-7 (MPEG-7) international standard (scale of evaluation: BEP).

보통 영상 검색에서는 크게 색상, 모양, 질감 등 3가지 정도의 기술자를 이용하여 검색하는데 이 중 모양을 이용한 검색시 그 효율을 높일 수 있다. 모양 기술자가 중요시 되는 영상검색으로는 캐릭터 이미지 검색, 상표 이미지 검색 등이 있다.In general, image search is performed by using three kinds of descriptors such as color, shape, and texture. Among them, the efficiency of searching by shape is improved. Image search, which is important for shape technicians, includes character image search and trademark image search.

둘째, 본 발명은 모양 기술자를 적용하기 이전의 전처리 과정인 정규화 방법 이므로 이 단계를 거친 뒤에는 어떠한 모양 기술자를 적용시키더라도 그 검색 효율의 향상을 기대할 수 있다.Second, since the present invention is a normalization method which is a preprocessing process before applying the shape descriptor, after the step, any shape descriptor can be applied to improve the search efficiency.

그 일예시로서, 도 2를 참조하여 본 발명에 따른 영상 검색 방법에 대해 살펴보기로 한다. As an example, an image search method according to the present invention will be described with reference to FIG. 2.

도 2에 도시된 바와 같이, 본 발명에 따른 영상 검색 방법은, 먼저 원영상을 입력받아(201), 내용 기반 영상 검색에 사용되어지는 AR 변환 모양 기술자 계산의 전처리 과정으로서 원영상의 어파인 불변인 정규화 방법을 수행한다(202). 영상의 정규화는 왜곡되거나 변형된 영상을 정규화된 영상으로 만드는 과정을 말한다. 이러한 변형에는 영상내 객체에 잡음, 객체의 이동, 회전, 크기 변화, 일그러짐 등이 있다. 이러한 변형중 잡음, 이동, 크기 변화, 일그러짐에 대해서 컴팩트 알고리즘을 사용하여 정규화한다. 회전에 대한 정규화는 컴팩트 알고리즘을 거친 후 ART 기반 형태기술자나 저니키 모멘트 기반 형태기술자를 계산함으로서 행해질 수 있다.As shown in FIG. 2, the image retrieval method according to the present invention first receives an original image (201), and performs affinity of an original image as a preprocessing process of calculating an AR transform shape descriptor used for content-based image retrieval. A normalization method is performed (202). Normalization of an image refers to a process of making a distorted or deformed image into a normalized image. Such deformations include noise, movement, rotation, size change, and distortion of objects in the image. Noise, shifts, magnitude variations, and distortions during these strains are normalized using a compact algorithm. Normalization for rotation can be done by going through a compact algorithm and computing either ART-based shape descriptors or Jonikey moment-based shape descriptors.

이후, 정규화된 영상에서 AR 변환을 이용하여 모양 기술자를 계산(특징을 추출)하고(203), 계산된 모양 기술자를 이용하여 영상 데이터베이스 내에서 유사도를 계산하여(204) 영상을 검색하여 검색된 영상을 출력한다(205).Then, the shape descriptor is calculated (feature extraction) using the AR transform on the normalized image (203), the similarity is calculated in the image database using the calculated shape descriptor (204), and the image is searched and retrieved. Output 205.

상술한 바와 같은 본 발명의 방법은 프로그램으로 구현되어 컴퓨터로 읽을 수 있는 기록매체(씨디롬, 램, 롬, 플로피 디스크, 하드 디스크, 광자기 디스크 등)에 저장될 수 있다.The method of the present invention as described above may be implemented as a program and stored in a computer-readable recording medium (CD-ROM, RAM, ROM, floppy disk, hard disk, magneto-optical disk, etc.).

이상에서 설명한 본 발명은, 본 발명이 속하는 기술분야에서 통상의 지식을 가진 자에 있어 본 발명의 기술적 사상을 벗어나지 않는 범위내에서 여러 가지 치 환, 변형 및 변경이 가능하므로 전술한 실시예 및 첨부된 도면에 한정되는 것이 아니다.
The present invention described above is capable of various substitutions, modifications, and changes without departing from the technical spirit of the present invention for those skilled in the art to which the present invention pertains. It is not limited to the drawings shown.

상기한 바와 같은 본 발명은, 영상 검색에 있어서 객체의 모양을 기준으로 한 검색(캐릭터 이미지, 상표 이미지 검색 등)을 행할 때 흔히 나타나기 쉬운 객체의 일그러짐, 원근법에 의한 기울어짐 등이 발생하여도 동일한 객체로 인식할 수 있도록 함으로써, 자연 영상 검색을 제외한 영상내의 객체를 중요시하는 내용기반 검색에 유용하게 쓰여질 수 있는 효과가 있다.The present invention as described above is the same even if the distortion of the object, the inclination by perspective, etc., which are often prone to occur when conducting a search based on the shape of the object (character image, trademark image search, etc.) in the image search. By allowing the object to be recognized as an object, there is an effect that can be usefully used for a content-based search that focuses on the objects in the image except the natural image search.

Claims

In the image normalization method,

In case of deformation due to movement, rotation, size change, noise, or distortion of the object in the image, the object image is converted to the standard image, and as a result, various shape descriptors are applied to the image, resulting in an invariant shape descriptor for affine transformation. But get

A first image normalization step of normalizing an image by applying a compact algorithm to noise, movement, magnitude change, or distortion during deformation; And

A second image normalization step of normalizing an image by calculating a rotation invariant shape descriptor using Jonikey-based moments or AR (AR) transforms after going through the compact algorithm to extract an object invariably similar to the rotation angle.

Normalization method for calculating the shape descriptor comprising a.

The method of claim 1,

The compact algorithm,

Adjust the image through two linear transformations so that the covariance matrix of the image that has undergone the algorithm is an identity matrix, and move, change, or distort the object from the affine transformed image f (u, v) by the following equation. And a normalized image f (x ', y') which is invariant to.

(here,

(Where c is the size conversion constant),

(Where e _ix and e _iy are eigenvectors corresponding to their respective eigenvalues), m _pq is the moment of order _pq in the normalized image f (x, y), and m ' _pq is the affine transformed image f (u, moment of order pq of v)

)being)

The method according to claim 1 or 2,

The first image normalization step,

Using the compact algorithm, noise, object movement, rotation, simple size change on the xy coordinate axis, or size change only on one side, or obliquely tilted or perspective are applied to the object in the image. In the case of distortion, a normalization method for calculating a shape descriptor, wherein the object image is first normalized to a standard image by adjusting the covariance matrix calculated from the coordinate values of the object on the xy coordinate axis to be the same.

The method of claim 3, wherein

The second image normalization step,

A normalization method for calculating a shape descriptor, characterized by normalizing a feature amount of an object image by calculating a feature amount for a rotation invariant shape descriptor such as a Jonikey moment or an AR transformation.

An image retrieval method in a content-based image retrieval system,

An image normalization step of normalizing an image by applying a compact algorithm when moving, rotating, changing size, noise, or distortion in an object in the image to obtain a normalized image invariant to movement, size change, or distortion of the object;

Calculating a shape descriptor from the normalized image; And

Searching for an image by calculating a similarity level in an image database using the calculated shape descriptor

Image search method comprising a.

The method of claim 5,

The shape descriptor,

Image retrieval method, characterized in that the rotation invariant descriptor.

The method according to claim 5 or 6,

The image normalization step,

An image retrieval method characterized by normalizing by applying a compact algorithm for shifting, size change, noise, or distortion during deformation to obtain a shape descriptor that is invariant to affine transformation for the image after preprocessing.

The method of claim 7, wherein

The compact algorithm,

Adjust the image through two linear transformations so that the covariance matrix of the image that has undergone the algorithm is an identity matrix, but the object's movement, size change, and distortion can be changed from the affine transformed image f (u, v) by the following equation. And a constant normalized image f (x ', y').

(here,

(Where c is the size conversion constant),

)being)

In an image normalization system with a processor,

A function of normalizing an image by applying a compact algorithm to noise, shift, size change, or distortion during deformation; And

Function to normalize the image by calculating rotation invariant shape descriptor using Jonikey-based moment or AR (AR) transformation after going through the compact algorithm to extract an object of invariably similar shape to rotation angle

A computer-readable recording medium having recorded thereon a program for realizing this.

In the content-based image retrieval system having a processor,

A function of obtaining a normalized image invariant to movement, size change, or distortion of an object by applying a compact algorithm to normalize the image when the object moves, rotates, changes in size, noise, or distortion in the image;

Calculating a shape descriptor from the normalized image; And

A function of searching for an image by calculating a similarity in an image database using the calculated shape descriptor