KR100667156B1

KR100667156B1 - Apparatus and method for character recognition by selecting character region in camera document image captured by portable camera

Info

Publication number: KR100667156B1
Application number: KR1020040103979A
Authority: KR
Inventors: 김계경; 지수영; 정연구; 이재연
Original assignee: 한국전자통신연구원
Priority date: 2004-12-10
Filing date: 2004-12-10
Publication date: 2007-01-12
Also published as: KR20060065197A

Abstract

1. 청구범위에 기재된 발명이 속한 기술분야1. TECHNICAL FIELD OF THE INVENTION

본 발명은 휴대형 카메라로 획득한 문자영상의 문자영역선택을 통한 문자인식 장치 및 그 방법에 관한 것임.The present invention relates to a character recognition apparatus and method through character region selection of a character image obtained by a portable camera.

2. 발명이 해결하려고 하는 기술적 과제2. The technical problem to be solved by the invention

본 발명은 휴대형 카메라로 획득한 문자영상에서 문자영역을 선택하여 국소 이진화한 후에 문자를 인식함으로써, 문자 추출 및 인식 성능을 향상시킬 수 있는 문자인식 장치 및 그 방법을 제공하는데 그 목적이 있음.SUMMARY OF THE INVENTION An object of the present invention is to provide a character recognition apparatus and method for improving character extraction and recognition performance by selecting a character region from a character image obtained by a portable camera and recognizing a character after local binarization.

3. 발명의 해결방법의 요지3. Summary of Solution to Invention

본 발명은, 휴대형 카메라로 획득한 문자영상의 문자영역선택을 통한 문자인식 장치에 있어서, 휴대형 카메라를 이용하여 획득한 문자영상을 입력받기 위한 문자영상 입력 수단; 상기 문자영상 입력 수단을 통하여 입력받은 카메라 문자영상을 전처리하여 영상을 개선하기 위한 영상 전처리 수단; 상기 영상 전처리 수단에서 영상 개선한 문자영상에 대해 에지 영상을 검출하기 위한 에지 검출 수단; 상기 에지 검출 수단에서 검출한 에지 영상에 대해 화소를 팽창(Dilation)시키기 위한 화소 팽창 수단; 상기 화소 팽창 수단에서 팽창된 문자영상에 대해 화소 영역을 레이블링하여 일정 임계치 이상의 높이를 가진 긴 라인을 형성하는 가상문자영역을 추출하기 위한 가상문자영역 추출 수단; 상기 가상문자영역 추출 수단에서 추출한 문자영역에 대해 이진화를 수행하기 위한 이진화 수단; 상기 이진화 수단에서 이진화한 문자영역에서 문자를 추출하기 위한 문자 추출 수단; 상기 문자 추출 수단에서 추출한 문자의 특징을 추출하기 위한 특징 추출 수단; 및 상기 특징 추출 수단에서 추출한 문자의 특징을 이용하여 문자를 인식하기 위한 문자 인식 수단을 포함함.According to an aspect of the present invention, there is provided a character recognition apparatus through character area selection of a character image obtained by a portable camera, comprising: text image input means for receiving a text image obtained by using a portable camera; Image preprocessing means for preprocessing the camera text image received through the text image input means to improve the image; Edge detection means for detecting an edge image with respect to the character image improved by the image preprocessing means; Pixel dilation means for dilating the pixel with respect to the edge image detected by the edge detection means; A virtual character region extraction means for extracting a virtual character region forming a long line having a height above a predetermined threshold by labeling the pixel region with respect to the character image expanded by the pixel expansion means; Binarization means for performing binarization on the character region extracted by the virtual character region extraction means; Character extraction means for extracting a character from the character area binarized by said binarization means; Feature extracting means for extracting a feature of a character extracted by said character extracting means; And character recognition means for recognizing the character using the feature of the character extracted by the feature extraction means.

4. 발명의 중요한 용도4. Important uses of the invention

본 발명은 영상처리 기술분야 중 패턴인식분야의 문자인식 기술분야 등에 이용됨.The present invention is used in the character recognition technology field of the pattern recognition field of the image processing technology field.

휴대형 카메라, 문자영상 획득, 문자영역 선택, 국소 이진화, 문자인식Portable Camera, Character Image Acquisition, Character Area Selection, Local Binarization, Character Recognition

Description

Apparatus and method for character recognition by selecting character region in camera document image captured by portable camera}

도 1은 본 발명의 일실시 예에 따른 휴대형 카메라로 획득한 문자영상의 문자영역선택을 통한 문자인식 장치의 구성도,1 is a block diagram of a character recognition device through the selection of the character region of the character image obtained by the portable camera according to an embodiment of the present invention,

도 2는 본 발명의 일실시 예에 따른 휴대형 카메라로 획득한 문자영상의 문자영역선택을 통한 문자인식 방법에 대한 일실시예 흐름도,2 is a flowchart illustrating a character recognition method through selecting a character region of a character image acquired by a portable camera according to an embodiment of the present invention;

도 3a 및 도 3b는 휴대형 카메라로 획득한 문자영상을 예시한 도면,3A and 3B are diagrams illustrating a text image obtained by a portable camera.

도 4는 본 발명에 따라 전처리한 문자영상을 도시한 도면,4 is a diagram illustrating a text image preprocessed according to the present invention;

도 5a 내지 도 5c는 본 발명에 따른 에지 영상, 화소 팽창 영상, 및 문자영역부분만 선택한 영상을 도시한 도면,5A to 5C illustrate an edge image, a pixel expansion image, and an image in which only a text area part is selected according to the present invention;

도 6a 및 도 6b는 카메라 문자영상에서 선택한 문자영역에 대한 이진화 결과와 카메라 문자영상 전체에 대한 이진화 결과를 비교 도시한 도면이다.6A and 6B are diagrams illustrating comparison between binarization results of selected text areas in a camera text image and binarization results of an entire camera text image.

* 도면의 주요 부분에 대한 부호의 설명* Explanation of symbols for the main parts of the drawings

100 : 문자인식 장치 110 : 문자영상 입력부100: character recognition device 110: character image input unit

120 : 영상 전처리부 130 : 에지 검출부120: image preprocessor 130: edge detector

140 : 화소 팽창부 150 : 가상문자영역 추출부140: pixel expansion unit 150: virtual character area extraction unit

160 : 국소 이진화부 170 : 문자 추출부160: local binarization unit 170: character extraction unit

180 : 특징 추출부 190 : 문자 인식부180: feature extraction unit 190: character recognition unit

본 발명은 휴대형 카메라로 획득한 문자영상의 문자영역선택을 통한 문자인식 장치 및 그 방법에 관한 것으로, 더욱 상세하게는 PDA(Personal Digital Assistants)나 휴대폰 등과 같은 모바일 기기의 휴대형 카메라를 이용하여 획득한 문자영상에서 문자영역을 자동으로 선택한 후에 그 선택한 문자영역에 대해서 국소 이진화를 수행하여 문자를 인식하기 위한 문자인식 장치 및 그 방법에 관한 것이다.The present invention relates to a character recognition apparatus and a method for selecting a text area of a text image obtained by a portable camera, and more particularly, to a mobile device such as a PDA (Personal Digital Assistants) or a mobile phone. A character recognition apparatus and method for recognizing a character by performing local binarization on a selected character region after automatically selecting a character region in a character image.

지금까지의 문자인식은 대부분 종이문서를 스캐너로 스캐닝하여 인식한 다음에 텍스트 파일이나 전자 문서의 형태로 변환하여 사용하여 왔다. 그러나 최근 카메라 기술의 발달로 인하여 PDA나 휴대폰 등과 같은 모바일 기기에 카메라 기능을 내장시켜 정보 획득의 수단으로 활용할 수 있게 되었다. 그리고 모바일 기기의 사용이 증대됨에 따라 일반 사용자의 편의를 위해 카메라로 획득한 문자영상처리에 대한 요구가 증대되고 있는 실정이다.Until now, character recognition has been mostly used by scanning paper documents with scanners and then converting them into text files or electronic documents. However, due to the recent development of camera technology, it is possible to utilize camera functions in mobile devices such as PDAs and mobile phones to obtain information. In addition, as the use of mobile devices increases, the demand for text image processing acquired by a camera increases for the convenience of general users.

카메라 문자인식은 기존의 종이문서를 주로 입력해왔던 스캐너와는 달리 현장에 존재하는 어떤 형태의 문자정보도 쉽게 획득할 수 있다는 장점이 있다. 즉, 카메라 문자인식은 종이문서뿐만 아니라 스캐너로 입력이 불가능한 문자들도 손쉽게 획득하여 인식한 다음에 그 인식 결과를 활용할 수 있다는 점이 기존의 스캐너 기반 문자인식과 차별화되는 점이다. 카메라 문자인식대상으로는 관광지의 안내문이나 각종 자료에 대한 설명문, 기념비, 간판, 명함, 메뉴 등을 들 수가 있다. 이러한 카메라 문자인식은 입력 대상 문자들이 기록되어있는 매체에 상관없이 휴대형 카메라로 문자들을 손쉽게 획득하여 인식할 수 있다는 장점이 있으므로 최근 활발히 연구되고 있는 문자인식분야 가운데 하나이다.Camera character recognition has the advantage of easily acquiring any type of character information existing in the field, unlike a scanner which has mainly inputted paper documents. In other words, camera character recognition is distinguished from conventional scanner-based character recognition in that it can easily acquire and recognize not only paper documents but also characters that cannot be input by the scanner, and then use the recognition results. Camera character recognition targets include tourist information, various explanatory texts, monuments, signs, business cards, and menus. The camera character recognition is one of the character recognition fields that are being actively researched recently because there is an advantage of easily acquiring and recognizing characters with a portable camera regardless of the medium in which the input target characters are recorded.

그러나 카메라 문자영상은 스캐너 문자영상과는 달리 문자영상을 입력하는 조건이 제한되지 않아 주변 조명의 영향을 많이 받는다는 단점이 있어서, 기존의 스캐너 기반 문자인식에 비하여 인식하기 매우 어려운 것으로 알려져 있다. 카메라 문자인식을 위해 종래에 발표된 방법은 다음과 같다.However, the camera text image, unlike the scanner text image, has a disadvantage of being affected by the ambient light because the condition for inputting the text image is not limited. Therefore, it is known that the camera text image is more difficult to recognize than the conventional scanner-based text recognition. Conventionally published methods for camera character recognition are as follows.

첫 번째 종래 방법에서는 거리간판에 씌어진 문자를 PDA용 카메라로 획득한 다음에 인식하도록 하였다. 이 종래 방법에서는 획득한 영상의 배경에서 사용자가 문자영역만 직접 선택한 다음에 서버로 문자영상을 전송한다. 그에 따른 문자인식결과는 정보 검색이나 외국어로 번역하여 사용자에게 다시 서비스해 주는데 활용되었다.In the first conventional method, a character written on a street sign is acquired by a PDA camera and then recognized. In this conventional method, the user directly selects only the text area in the background of the acquired image, and then transmits the text image to the server. The resulting character recognition results were used to provide information back to the user by translating the information into foreign languages.

두 번째 종래 방법은 카메라를 이용하여 종이문서를 데이터베이스화하는 것에 관한 것이다. 여기서, 문서영상은 이진화를 통해 문자를 추출한 다음에 인식하 도록 하였다. 그러나 상기 두 번째 종래 방법에는 카메라 문서영상인식에 대한 전처리, 문자추출에 대한 처리방법이 구체적으로 제안되지 않았다.The second conventional method relates to the database of paper documents using a camera. Here, the document image is extracted after the character is extracted through binarization. However, the second conventional method does not specifically propose a processing method for preprocessing and text extraction for camera document image recognition.

한편, 일반적으로 카메라 문자영상인식은 제한되지 않은 환경에서의 문자 입력 및 인식대상이 되는 문자 종류의 다양성 등으로 인하여 카메라 문자인식에 대한 안정된 인식 성능을 보장할 수 없다는 문제점이 있다.On the other hand, in general, the camera character image recognition has a problem that it is not possible to guarantee a stable recognition performance for the camera character recognition due to the variety of character input and the character type to be recognized in an unrestricted environment.

따라서 주변 조명 영향에 강인하게 카메라 문자영상에서 문자영역을 제대로 추출하여 인식하는 방법이 요구되고 있다. 또한, 모바일 기기에서 카메라 문자인식기능을 수행하기 위하여 무엇보다 메모리나 처리시간이 문제점으로 지적되어 왔었다. 따라서, 이러한 상기 문제점들을 보완하여 안정적인 문자인식률을 보장할 수 있는 카메라 문자인식 기술이 요구되고 있다.Therefore, there is a demand for a method of properly extracting and recognizing a text area from a camera text image to be robust to ambient lighting effects. In addition, memory or processing time has been pointed out as a problem in order to perform a camera character recognition function in a mobile device. Therefore, there is a need for a camera character recognition technology that can compensate for the above problems to ensure a stable character recognition rate.

본 발명은 상기 문제점을 해결하고 상기 요구에 부응하기 위하여 제안된 것으로, 휴대형 카메라로 획득한 문자영상에서 문자영역을 선택하여 국소 이진화한 후에 문자를 인식함으로써, 문자 추출 및 인식 성능을 향상시킬 수 있는 문자인식 장치 및 그 방법을 제공하는데 그 목적이 있다.The present invention has been proposed to solve the above problems and to meet the above requirements, by selecting a character region from a character image acquired by a portable camera and recognizing the character after local binarization, character extraction and recognition performance can be improved. It is an object of the present invention to provide a character recognition device and a method thereof.

즉, 본 발명은 모바일 기기 등에 부착된 휴대형 카메라를 이용하여 획득한 문자영상에서 문자영역을 선택하여 가상문자크기를 알아내고 이 정보로부터 이진화시킬 부 윈도우 크기를 결정하여 국소 이진화한 후에 문자를 인식함으로써, 문자 추출 및 인식 성능을 향상시키고 메모리를 감소시키며 처리시간을 단축시킬 수 있 는 문자인식 장치 및 그 방법을 제공하는데 그 목적이 있다.That is, the present invention selects a text area from a text image obtained by using a portable camera attached to a mobile device, etc., finds out the virtual text size, determines the sub window size to be binarized from this information, and then recognizes the text after local binarization. It is an object of the present invention to provide a character recognition apparatus and method capable of improving character extraction and recognition performance, reducing memory, and reducing processing time.

본 발명의 다른 목적 및 장점들은 하기의 설명에 의해서 이해될 수 있으며, 본 발명의 실시 예에 의해 보다 분명하게 알게 될 것이다. 또한, 본 발명의 목적 및 장점들은 특허 청구 범위에 나타낸 수단 및 그 조합에 의해 실현될 수 있음을 쉽게 알 수 있을 것이다.
Other objects and advantages of the present invention can be understood by the following description, and will be more clearly understood by the embodiments of the present invention. Also, it will be readily appreciated that the objects and advantages of the present invention may be realized by the means and combinations thereof indicated in the claims.

상기 목적을 달성하기 위한 본 발명의 장치는, 휴대형 카메라로 획득한 문자영상의 문자영역선택을 통한 문자인식 장치에 있어서, 휴대형 카메라를 이용하여 획득한 문자영상을 입력받기 위한 문자영상 입력 수단; 상기 문자영상 입력 수단을 통하여 입력받은 카메라 문자영상을 전처리하여 영상을 개선하기 위한 영상 전처리 수단; 상기 영상 전처리 수단에서 영상 개선한 문자영상에 대해 에지 영상을 검출하기 위한 에지 검출 수단; 상기 에지 검출 수단에서 검출한 에지 영상에 대해 화소를 팽창(Dilation)시키기 위한 화소 팽창 수단; 상기 화소 팽창 수단에서 팽창된 문자영상에 대해 화소 영역을 레이블링하여 일정 임계치 이상의 높이를 가진 긴 라인을 형성하는 가상문자영역을 추출하기 위한 가상문자영역 추출 수단; 상기 가상문자영역 추출 수단에서 추출한 문자영역에 대해 이진화를 수행하기 위한 이진화 수단; 상기 이진화 수단에서 이진화한 문자영역에서 문자를 추출하기 위한 문자 추출 수단; 상기 문자 추출 수단에서 추출한 문자의 특징을 추출하기 위한 특징 추출 수단; 및 상기 특징 추출 수단에서 추출한 문자의 특징을 이용하여 문자를 인식하기 위한 문자 인식 수단을 포함한다.According to an aspect of the present invention, there is provided a character recognition apparatus through character area selection of a character image obtained by a portable camera, the apparatus comprising: text image input means for receiving a character image obtained by using a portable camera; Image preprocessing means for preprocessing the camera text image received through the text image input means to improve the image; Edge detection means for detecting an edge image with respect to the character image improved by the image preprocessing means; Pixel dilation means for dilating the pixel with respect to the edge image detected by the edge detection means; A virtual character region extraction means for extracting a virtual character region forming a long line having a height above a predetermined threshold by labeling the pixel region with respect to the character image expanded by the pixel expansion means; Binarization means for performing binarization on the character region extracted by the virtual character region extraction means; Character extraction means for extracting a character from the character area binarized by said binarization means; Feature extracting means for extracting a feature of a character extracted by said character extracting means; And character recognition means for recognizing the character using the feature of the character extracted by the feature extraction means.

한편, 본 발명의 방법은, 휴대형 카메라로 획득한 문자영상의 문자영역선택을 통한 문자인식 방법에 있어서, 휴대형 카메라를 이용하여 획득한 문자영상을 입력받는 입력 단계; 상기 입력받은 문자영상에 대하여 문자부분이 제대로 추출되도록 하기 위하여 영상 전처리를 수행하는 영상 전처리 단계; 상기 영상 전처리한 문자영상에 대해서 에지 영상을 추출하는 단계; 문자영역을 추출하기 위하여 상기 에지 영상의 화소를 팽창시키는 단계; 상기 화소 팽창을 수행한 문자영상 중에서 문자영역을 추출하는 문자영역 추출 단계; 상기 추출한 문자영역 내에서 이진화를 수행하는 이진화 단계; 상기 이진화한 문자영역에서 문자를 추출하는 문자 추출 단계; 상기 추출한 문자에 대해서 특징을 추출하는 단계; 및 상기 추출한 문자를 인식하는 문자 인식 단계를 포함한다.On the other hand, the method of the present invention, in the character recognition method through the character area selection of the character image obtained by the portable camera, an input step of receiving a character image obtained by using the portable camera; An image preprocessing step of performing image preprocessing so that a character part is correctly extracted with respect to the received character image; Extracting an edge image with respect to the image preprocessed text image; Expanding pixels of the edge image to extract a text area; A text area extraction step of extracting a text area from the text image on which the pixel expansion is performed; A binarization step of performing binarization in the extracted character region; A character extraction step of extracting a character from the binarized character area; Extracting a feature from the extracted text; And a character recognition step of recognizing the extracted character.

이처럼, 본 발명은 휴대형 카메라로 획득한 문자영상에서 문자영역을 선택하여 그 선택된 문자영역에 대해 국소 이진화 방법을 적용하도록 하였다. 이를 위하여 선택한 문자영역으로부터 문자높이를 구하여 국소 이진화에 사용될 윈도우의 크기로 사용하였다. 또한 본 발명은 문자가 적혀있는 부분 영역에 대해서 국소 이진화를 적용하므로 전체 문자영상에 대한 이진화 결과보다 나은 이진화 결과를 얻을 수 있다. 또한, 본 발명은 처리해야할 영상의 크기가 작아짐에 따라 이에 따르는 메모리 문제나 처리시간 문제도 동시에 해결할 수 있는 장점을 제공한다. 따라서 본 발명은 실시간 인식이 가능하므로 사용자 컴퓨터뿐만 아니라 휴대폰이나 PDA 등과 같은 모바일 기기에 탑재하여 활용할 수 있다. 즉, 본 발명은 카메라 문자영상처리에 소요되는 메모리를 감소시키고 처리시간도 단축시킬 수 있으므로, PDA나 휴대폰 등과 같은 휴대형 기기에 탑재하여 실제 활용할 수 있다는 큰 장점을 제공한다.As described above, the present invention selects a text area from a text image acquired by a portable camera and applies a local binarization method to the selected text area. For this, the height of the text was obtained from the selected text area and used as the window size for local binarization. In addition, since the present invention applies local binarization to a partial region in which a character is written, it is possible to obtain a better binarization result than a binarization result for an entire character image. In addition, the present invention provides an advantage that as the size of the image to be processed becomes smaller, memory problems and processing time problems accompanying them can be solved simultaneously. Therefore, the present invention can be used in real time recognition because it is mounted on a mobile device such as a mobile phone or a PDA as well as a user computer. That is, the present invention can reduce the memory required for processing the camera text image and shorten the processing time, and thus provides a great advantage that it can be used in a portable device such as a PDA or a mobile phone.

상술한 목적, 특징 및 장점은 첨부된 도면과 관련한 다음의 상세한 설명을 통하여 보다 분명해 질 것이며, 그에 따라 본 발명이 속하는 기술분야에서 통상의 지식을 가진 자가 본 발명의 기술적 사상을 용이하게 실시할 수 있을 것이다. 또한, 본 발명을 설명함에 있어서 본 발명과 관련된 공지 기술에 대한 구체적인 설명이 본 발명의 요지를 불필요하게 흐릴 수 있다고 판단되는 경우에 그 상세한 설명을 생략하기로 한다. 이하, 첨부된 도면을 참조하여 본 발명에 따른 바람직한 일실시 예를 상세히 설명하기로 한다.The above objects, features and advantages will become more apparent from the following detailed description taken in conjunction with the accompanying drawings, whereby those skilled in the art may easily implement the technical idea of the present invention. There will be. In addition, in describing the present invention, when it is determined that the detailed description of the known technology related to the present invention may unnecessarily obscure the gist of the present invention, the detailed description thereof will be omitted. Hereinafter, exemplary embodiments of the present invention will be described in detail with reference to the accompanying drawings.

도 1은 본 발명의 일실시 예에 따른 휴대형 카메라로 획득한 문자영상의 문자영역선택을 통한 문자인식 장치의 구성도이다.1 is a block diagram of a character recognition apparatus by selecting a character region of a character image obtained by a portable camera according to an embodiment of the present invention.

도 1에 도시된 바와 같이, 본 발명에 따른 휴대형 카메라로 획득한 문자영상의 문자영역선택을 통한 문자인식 장치(100)는, 휴대형 카메라를 이용하여 획득한 문자영상을 입력받기 위한 문자영상 입력부(110), 상기 문자영상 입력부(110)를 통하여 입력받은 카메라 문자영상을 전처리하여 영상을 개선하기 위한 영상 전처리부(120). 상기 영상 전처리부(120)에서 영상 개선한 문자영상에 대해 에지 영상을 검출하기 위한 에지 검출부(130), 상기 에지 검출부(130)에서 검출한 에지 영상에 대해 화소를 팽창(Dilation)시키기 위한 화소 팽창부(140), 상기 화소 팽창부(140)로부터의 문자영상에서 일정 폭을 가지면서 긴 라인을 형성하는 가상문자영역을 추출하기 위한 가상문자영역 추출부(150), 상기 가상문자영역 추출부(150)에서 추출한 가상문자영역에 대해 국소 이진화를 수행하기 위한 국소 이진화부(160), 상기 국소 이진화부(160)에서 국소 이진화한 문자영역에서 개별 문자의 구조적인 특징 정보를 이용하여 단어 및 개별 문자를 추출하기 위한 문자 추출부(170), 상기 문자 추출부(170)에서 추출한 문자의 특징을 추출하기 위한 특징 추출부(180), 및 상기 특징 추출부(180)에서 추출한 문자의 특징을 이용하여 문자의 유형을 분류한 후에 문자를 인식하기 위한 문자 인식부(190)를 포함한다.As shown in FIG. 1, the character recognition apparatus 100 may select a text image input unit for receiving a text image obtained by using a portable camera. 110, an image preprocessor 120 for improving the image by pre-processing the camera character image received through the text image input unit 110. An edge detector 130 for detecting an edge image with respect to the character image improved by the image preprocessor 120, and a pixel expansion for dilating the pixel with respect to the edge image detected by the edge detector 130. A virtual character region extracting unit 150 for extracting a virtual character region having a predetermined width and forming a long line from the character image from the pixel expansion unit 140, the virtual character region extracting unit ( The local binarization unit 160 for performing local binarization on the extracted virtual character region 150 and the words and individual characters using structural feature information of the individual characters in the local binarized character region by the local binarization unit 160. A character extractor 170 for extracting a feature, a feature extractor 180 for extracting a feature of a character extracted by the character extractor 170, and a feature of the character extracted by the feature extractor 180 Used to include the character recognition unit 190 for recognizing the characters after classifying the type of the character.

상기 각 구성요소의 구체적인 동작 및 그 예를 도 2 내지 도 6을 참조하여 상세히 살펴보기로 한다.Specific operations and examples of the respective components will be described in detail with reference to FIGS. 2 to 6.

도 2는 본 발명의 일실시 예에 따른 휴대형 카메라로 획득한 문자영상의 문자영역선택을 통한 문자인식 방법에 대한 일실시예 흐름도이다.2 is a flowchart illustrating a text recognition method through text area selection of a text image acquired by a portable camera according to an embodiment of the present invention.

먼저, 사용자가 휴대형 카메라를 이용하여 다양한 형태의 인식대상문자를 획득하여 본 발명에 따른 문자인식 장치(100)로 전달한다. 그에 따라, 본 발명에 따른 문자인식 장치(100)의 문자영상 입력부(110)가 해당 문자영상을 입력받는다(210).First, a user acquires various types of recognition target characters using a portable camera and transmits them to the character recognition apparatus 100 according to the present invention. Accordingly, the text image input unit 110 of the text recognition device 100 according to the present invention receives the text image (210).

즉, 다양한 카메라 문자영상을 대상으로 하여 인식 성능을 실험하기 위하여 카메라와 문자 사이의 거리를 다양하게 하여 문자영상을 획득하고 다양한 문자 폰트를 대상으로 하여 휴대폰 카메라로 문자영상을 획득하도록 한다. 즉, 다양한 형태의 인식대상문자 획득을 위하여 휴대형 카메라와 문서와의 거리를 다양하게 하여 카메라 문자영상 샘플들을 얻는다. 또한, 다양한 문자 폰트가 기록된 문자영상(문서영상)을 대상으로 샘플들을 입력할 수도 있다(도 3a 참조). 시뮬레이션에 사용된 문자영상은 제한되지 않은 조명 조건 환경에서 얻어진 영상들이다. 따라서 실내·외 환경에서 문자가 기록된 매체에 상관없이 인쇄체로 적힌 문자영상을 획득하도록 한다(도 3b 참조).That is, in order to experiment with the recognition performance of various camera text images, the distance between the camera and the text is varied to obtain a text image, and the text image is acquired by a mobile phone camera targeting various text fonts. That is, camera character image samples are obtained by varying the distance between the portable camera and the document in order to obtain various types of recognition target characters. In addition, samples may be input to a character image (document image) in which various character fonts are recorded (see FIG. 3A). Character images used in the simulation are images obtained in an unrestricted lighting condition environment. Therefore, regardless of the medium in which the character is recorded in the indoor and outdoor environment, the character image written on the printed matter is obtained (see FIG. 3B).

그런데, 휴대형 카메라로 획득한 문자영상을 입력할 경우에 렌즈의 특성에 따른 왜곡 현상 및 포커스 문제로 인하여 문자 부분을 제대로 인식할 수 없는 경우가 발생한다. 또한, 저해상도 카메라를 이용하여 획득한 문자영상이 입력됨으로 인하여 문자영상이 블러링되거나 이웃하는 문자와 겹쳐지는 경우가 많이 나타나게 되므로, 문자영역을 분할하여 인식하는 것이 매우 어렵게 된다. 따라서 본 발명에서는 후술하는 바와 같이 문자영역을 먼저 추출하여 이진화, 문자 추출 및 인식 과정을 진행한다.However, when inputting a text image obtained by a portable camera, the text part may not be properly recognized due to distortion and focus problems according to the characteristics of the lens. In addition, since a text image obtained by using a low resolution camera is inputted, a text image is often blurred or overlapped with a neighboring character. Therefore, it is very difficult to divide and recognize a text region. Therefore, in the present invention, as described later, the text area is first extracted to proceed with binarization, character extraction, and recognition.

이후, 상기 입력받은 문자영상에 대하여 문자부분이 제대로 추출되도록 하기 위한 영상 전처리 알고리즘을 적용시킨다(220). 왜냐하면, 스캐너로 입력한 문자영상과 달리 카메라 문자영상은 주변 조명의 영향으로 인하여 획득한 문자영상의 가장 자리부분에 비네트(vignette) 현상 및 문자영상이 흐려지는 블러링 현상이 발생한다. 이러한 요인들은 문자의 오 인식을 유발하는 요인으로 작용하므로 문자부분을 제대로 추출하여 인식하는 방법이 요구된다.Thereafter, an image preprocessing algorithm is applied to the text portion to properly extract the text portion (220). Because, unlike the text image input to the scanner, the camera text image has a vignette phenomenon and a blurring phenomenon in which the text image is blurred at the edges of the acquired text image due to the influence of ambient light. Since these factors act as factors that cause the misrecognition of characters, a method of extracting and recognizing character parts is required.

따라서 컬러 영상을 명도 영상으로 변환시킨 후에 명도 레벨 정규화(gray-level normalization) 방식을 이용하여 명도 영상에서 부분영역에 위치하고 있는 화소 값들을 전체 레벨에 균등하게 확장시켜 배치시키는 방식으로 영상 전처리를 수행하여 배경으로부터 문자영역을 강조한다. 그 결과로 개선된 영상이 도 4에 도시되어 있다.Therefore, after converting a color image to a brightness image, image preprocessing is performed by dividing the pixel values located in the partial region in the brightness image evenly over the entire level by using the gray level level normalization method. Emphasizes text areas from the background. As a result, the improved image is shown in FIG. 4.

이후, 상기 영상 전처리한 문자영상에 대해서 에지 영상을 추출한다(230). 즉, 상기 영상 전처리 알고리즘이 적용된 문자영상에 대해 도 5a와 같은 에지 영상을 추출한다. 이는 입력받은 문자영상에서 문자가 존재하는 영역을 추출하기 위한 것이다.Thereafter, an edge image is extracted with respect to the image image preprocessed (230). That is, an edge image as shown in FIG. 5A is extracted from the character image to which the image preprocessing algorithm is applied. This is to extract a region in which a character exists in the received character image.

이후, 문자영역을 추출하기 위하여 상기 에지 영상의 화소를 팽창시킨다(240). 즉, 불연속된 에지점들을 연결시킴으로써 이진화된 영상에서 문자 영역에 해당하는 화소들을 팽창시키며, 그 결과 영상이 도 5b에 도시되어 있다.Thereafter, the pixel of the edge image is expanded to extract a text area (240). That is, by connecting discontinuous edge points, the pixels corresponding to the character region are expanded in the binarized image, and the image is shown in FIG. 5B.

이후, 일정 높이를 가진 긴 라인 영역을 가상문자영역으로 간주하여 추출한다(250). 즉, 상기 화소 팽창을 수행한 문자영상에 대해 화소 영역을 레이블링한 다음 일정 임계치 이상의 높이를 가진 긴 선 영역을 가상문자영역으로 간주하여 추출한다. 그 결과 영상이 도 5c에 도시되어 있다.Thereafter, a long line region having a predetermined height is extracted as a virtual character region (250). That is, after labeling the pixel area of the character image on which the pixel expansion has been performed, a long line area having a height higher than or equal to a predetermined threshold is regarded as a virtual character area and extracted. The resulting image is shown in FIG. 5C.

이후, 상기 추출한 가상문자영역 내에서 국소 이진화를 수행한다(260). 즉, 상기 추출한 가상문자영역 내에서 각각 이진화를 위한 임계치를 계산하여 국소 적응적 이진화를 수행하여 문자 획의 손실을 최대한 줄이면서 문자를 추출할 수 있도록 한다. 전술한 바와 같이 휴대형 카메라 문자인식은 문자가 적힌 매체에 상관없이 다양한 종류의 실세계 문자 정보를 휴대형 카메라로 손쉽게 획득하여 입력함으로써, 해당 문자를 인식할 수 있는 장점이 있다. 그러나 카메라로 획득한 문자영상은 스캐너로 획득한 문자영상과 달리 주변 조명의 영향으로 인하여 문자영역을 제대로 추출하지 못해서 인식하지 못하는 경우가 흔히 발생한다. 이러한 문제점을 보완하기 위하여 문자영상을 부 영역으로 나누어 각 영역 내에 존재하는 화소들의 명도 분포를 조사하여 이진화시키는 국소 적응적 이진화 방식을 적용한다. 국소 적응적 이진화 방식에서는 주변 화소들의 정보를 반영하기 위한 부 윈도우의 크기와 대상 문자들의 크기가 많은 연계성을 가진다. Thereafter, local binarization is performed in the extracted virtual character area (260). That is, within the extracted virtual character area, a threshold for binarization is calculated, and local adaptive binarization is performed to extract characters while reducing loss of character stroke as much as possible. As described above, the portable camera character recognition has an advantage of easily acquiring and inputting various kinds of real-world character information with the portable camera, regardless of the medium on which the characters are written, thereby recognizing the corresponding character. However, unlike the character image acquired by the camera, the character image acquired by the camera does not properly recognize the character region due to the influence of ambient lighting. In order to solve this problem, a local adaptive binarization method is applied, which divides a character image into sub-regions and investigates and binarizes the brightness distribution of pixels in each region. In the locally adaptive binarization scheme, the size of the sub-window and the size of the target character for reflecting the information of the neighboring pixels have many associations.

이후, 상기 국소 이진화한 문자영역에서 결합 및 분리 알고리즘을 이용하여 단어 및 개별 문자를 추출한다(270). 즉, 문자 분할을 위하여 수직 투영 및 여백 정보를 이용하여 단어를 추출한 후에 결합 및 분리 알고리즘을 이용하여 개별 문자를 추출한다. 일반적으로 한글은 다른 문자들과 달리 모음과 자음이 결합하여 문자를 이루기 때문에 연결 화소에 대하여 결합 및 분리 알고리즘을 적용하여 개별 문자를 추출할 수 있다. 이 때, 문자 이외의 연결 화소 즉, 표, 그림, 국소 이진화 결과로 나타나는 문자 이외의 연결 화소를 잡음으로 간주하여 제거한다. 특히, 한영이 혼용된 문자영상의 경우에 한글과 영어의 구조적인 특징 정보를 이용하여 개별 문자를 추출한다.Thereafter, words and individual characters are extracted using a combination and separation algorithm in the localized binarized character region (270). That is, for character segmentation, words are extracted using vertical projection and margin information, and then individual characters are extracted using a combination and separation algorithm. In general, unlike Korean characters, vowels and consonants combine to form letters, so that individual letters can be extracted by applying a combination and separation algorithm on the connected pixels. At this time, the connecting pixels other than the characters, that is, the connecting pixels other than the characters resulting from the table, picture, and local binarization are regarded as noise and removed. In particular, in the case of a text image in which Korean-English is mixed, individual characters are extracted using structural feature information of Korean and English.

이후, 상기 추출한 개별 문자에 대해서 특징을 추출한다(280). 즉, 개별 문자 인식을 위하여 망 특징, 거리정보 특징, 및 윤곽선 정보를 이용하여 개별 문자에 대한 특징을 추출한다.Then, the feature is extracted for the extracted individual characters (280). That is, the feature of the individual character is extracted using the network feature, the distance information feature, and the contour information for the individual character recognition.

이후, 상기 추출한 개별 문자를 인식하여 인식 결과를 얻는다(290). 즉, 상기 추출한 개별 문자의 유형을 분류한 후에 문자를 인식한다. 이 때, 한글, 영어, 기호, 숫자가 혼용된 문자 인식을 위하여 한글에 대하여 여섯 가지 문자유형으로 분류하고 영어, 기호, 숫자를 비 한글 유형으로 분류하여 각 문자 유형별로 문자들을 인식한다.Thereafter, the extracted individual character is recognized to obtain a recognition result (290). That is, after classifying the type of the extracted individual characters, the characters are recognized. In this case, for the recognition of characters mixed with Korean, English, symbols, and numbers, Korean characters are classified into six types of letters, and English, symbols, and numbers are classified into non-Hangul types.

전술한 바와 같이, 본 발명에서는 휴대형 카메라를 이용하여 문자영상을 획득하여 가상문자영역을 미리 추출한 다음에 그 영역에 적합한 국소 이진화를 적용하여 문자를 추출하고 인식하는 방식을 제안하였다. 또한, 본 발명에서는 주변 조명 영향을 줄이기 위하여 영상 개선 알고리즘을 적용시키고 가상문자영역을 미리 추출하여 그 영역에 적합한 국소 이진화 방법을 선택적으로 적용하도록 하였다. 또한, 결합 및 분리 알고리즘을 이용하여 개별 문자를 추출하여 유형별로 문자인식을 하도록 함으로써, 인식기의 부담을 줄여 오 인식을 줄일 수 있도록 하였다. 또한 본 발명은 기존의 카메라 문자영상이 주변 조명이나 카메라 렌즈의 영향으로 인하여 인식하기 어려운 문자대상으로 여겨졌었던 문제점을 보완할 수 있도록 하였으며, 또한 휴대폰 등에 카메라 문자인식 알고리즘을 탑재하여 사용할 경우 문제 시 되는 처리시간 및 메모리 문제를 해결하였다. 또한 본 발명은 기존 카메라 문자인식이 해결해야 할 처리시간 문제, 메모리 문제뿐만 아니라 선택적으로 문자영역을 선택하여 국소 이진화를 수행함으로써 카메라 문자인식 성능을 높일 수 있도록 하였다. 또한 본 발명은 카메라 문자인식 기술을 휴대폰이나 다른 모바일 기기에 탑재하여 문자 인식을 실제 활용할 수 있도록 할 수 있다.As described above, the present invention proposed a method of extracting and recognizing a character by acquiring a character image using a portable camera, pre-extracting a virtual character region, and then applying a local binarization suitable for the region. In addition, in the present invention, in order to reduce the influence of the ambient lighting, an image enhancement algorithm is applied and a virtual character region is extracted in advance to selectively apply a local binarization method suitable for the region. In addition, by using the combination and separation algorithm to extract the individual characters to recognize the characters by type, it is possible to reduce the recognition of the error by reducing the burden on the recognizer. In addition, the present invention is to solve the problem that the existing camera character image was considered to be difficult to recognize due to the influence of the ambient light or the camera lens, it is also a problem when using the camera character recognition algorithm in the mobile phone, etc. Processing time and memory issues were solved. In addition, the present invention can improve the camera character recognition performance by performing local binarization by selectively selecting the character region as well as processing time problem, memory problem that the existing camera character recognition should solve. In addition, the present invention can be equipped with a camera character recognition technology in a mobile phone or other mobile devices to enable the actual use of character recognition.

상술한 바와 같은 본 발명의 방법은 프로그램으로 구현되어 컴퓨터로 읽을 수 있는 형태로 기록매체(씨디롬, 램, 롬, 플로피 디스크, 하드 디스크, 광자기 디스크 등)에 저장될 수 있다. 이러한 과정은 본 발명이 속하는 기술 분야에서 통상의 지식을 가진 자가 용이하게 실시할 수 있으므로 더 이상 상세히 설명하지 않기로 한다.As described above, the method of the present invention may be implemented as a program and stored in a recording medium (CD-ROM, RAM, ROM, floppy disk, hard disk, magneto-optical disk, etc.) in a computer-readable form. Since this process can be easily implemented by those skilled in the art will not be described in more detail.

이상에서 설명한 본 발명은, 본 발명이 속하는 기술분야에서 통상의 지식을 가진 자에게 있어 본 발명의 기술적 사상을 벗어나지 않는 범위 내에서 여러 가지 치환, 변형 및 변경이 가능하므로 전술한 실시 예 및 첨부된 도면에 의해 한정되는 것이 아니다.The present invention described above is capable of various substitutions, modifications, and changes without departing from the technical spirit of the present invention for those skilled in the art to which the present invention pertains. It is not limited by the drawings.

상기와 같은 본 발명은, 모바일 기기 등에 부착된 휴대형 카메라를 이용하여 획득한 문자영상에서 문자영역을 선택하여 가상문자크기를 알아내고 이 정보로부터 이진화시킬 부 윈도우 크기를 결정하여 국소 이진화한 후에 문자를 인식함으로써, 문자 추출 및 인식 성능을 향상시키고 메모리를 감소시키며 처리시간을 단축시킬 수 있는 효과가 있다.As described above, the present invention selects a text area from a text image obtained by using a portable camera attached to a mobile device, finds a virtual text size, and determines a sub window size to be binarized from this information. Recognition has the effect of improving character extraction and recognition performance, reducing memory and reducing processing time.

즉, 본 발명은 주변 조명의 영향을 덜 받고 문자크기에 무관하게 카메라 문자영상을 인식하기 위해서 문자가 존재하는 영역을 미리 찾아 문자열을 추출하여 가상문자크기를 먼저 구한 후에, 그 가상문자크기로부터 부 윈도우의 크기를 정하여 문자영역을 국소 이진화를 수행하여 문자를 인식함으로써, 기존의 방법에서보다 더 나은 이진화 결과를 얻을 수 있어 카메라 문자인식의 성능을 향상시킬 수 있고, 또한 PDA나 휴대폰 등과 같은 휴대형 단말기에서 카메라 문자인식이 실행될 때 처리할 영상의 크기가 작아지므로 카메라 문자영상처리에 소요되는 메모리를 감소시키고 처리시간도 단축시킬 수 있는 효과가 있다.That is, in the present invention, in order to recognize the camera character image irrespective of the ambient light and irrespective of the character size, the character string is extracted in advance by finding a region in which the character exists and obtaining the virtual character size first, and then subtracting the character from the virtual character size. By determining the size of the window and performing local binarization on the text area to recognize the text, it is possible to obtain better binarization results than the conventional method, thereby improving the performance of camera character recognition, and also to a portable terminal such as a PDA or a mobile phone. Since the size of the image to be processed is reduced when the camera character recognition is executed in the, the memory required for processing the camera character image can be reduced and the processing time can be shortened.

또한 본 발명은 상기와 같이 카메라 문자영상처리에 소요되는 메모리를 감소시키고 처리시간도 단축시킬 수 있으므로, PDA나 휴대폰 등과 같은 휴대형 기기에 탑재하여 실제 활용할 수 있다는 큰 장점이 있다.In addition, since the present invention can reduce the memory required for the camera character image processing as described above, and also shorten the processing time, there is a great advantage that the present invention can be mounted on a portable device such as a PDA or a mobile phone and used in practice.

Claims

delete

In the character recognition device by selecting the text area of the text image obtained by the portable camera,

Text image input means for receiving a text image acquired using a portable camera;

Image preprocessing means for preprocessing the camera text image received through the text image input means to improve the image;

Edge detection means for detecting an edge image with respect to the character image improved by the image preprocessing means;

Pixel dilation means for dilating the pixel with respect to the edge image detected by the edge detection means;

A virtual character region extraction means for extracting a virtual character region forming a long line having a height above a predetermined threshold by labeling the pixel region with respect to the character image expanded by the pixel expansion means;

Binarization means for performing binarization on the character region extracted by the virtual character region extraction means;

Character extraction means for extracting a character from the character area binarized by said binarization means;

Feature extracting means for extracting a feature of a character extracted by said character extracting means; And

Character recognition means for recognizing a character by using the feature of the character extracted by said feature extraction means

Character recognition device comprising a.

The method of claim 2,

The image preprocessing means,

After converting a color image to a brightness image, image preprocessing is performed by placing pixel values located in a partial region in the brightness image evenly on the entire level by using gray level level normalization. Character recognition device, characterized in that for highlighting the text area from.

The method of claim 2 or 3,

The binarization means,

And a threshold value for binarization in each of the extracted virtual character regions to perform local adaptive binarization to extract characters while reducing the loss of character stroke in the character extracting means as much as possible.

The method of claim 4, wherein

The character extracting means,

Extracting words and individual characters using structural feature information such as network characteristics, distance information characteristics, and contour information according to the Jamo arrangement of individual characters in the localized binarized character region by the binarization means,

The character recognition means,

Character recognition is characterized by classifying the alphabet into six character types according to the characteristics of the characters extracted by the feature extraction means and classifying each character type by classifying English, symbols, and numbers into non-Hangul type. Device.

The method of claim 5,

The character extracting means,

For character segmentation, the word is extracted using vertical projection and margin information, and then the individual characters are extracted by combining and separating algorithms on the connected pixels. Character recognition device, characterized in that to remove other pixels as regards the noise.

In the character recognition method through the selection of the text area of the text image obtained by the portable camera,

An input step of receiving a text image obtained by using a portable camera;

An image preprocessing step of performing image preprocessing so that a character part is correctly extracted with respect to the received character image;

Extracting an edge image with respect to the image preprocessed text image;

Expanding pixels of the edge image to extract a text area;

A text area extraction step of extracting a text area from the text image on which the pixel expansion is performed;

A binarization step of performing binarization in the extracted character region;

A character extraction step of extracting a character from the binarized character area;

Extracting a feature from the extracted text; And

Character recognition step of recognizing the extracted character

Character recognition method comprising a.

The method of claim 7, wherein

The input step,

Character recognition method characterized in that for receiving a text image obtained by the mobile phone camera to the character image obtained by the mobile phone camera and various text fonts by varying the distance between the camera and the character.

The method of claim 7, wherein

The image preprocessing step,

After converting a color image to a brightness image, image preprocessing is performed by placing the pixel values located in the partial region in the brightness image evenly on the entire level by using a gray-level normalization method. Character recognition method characterized in that the emphasis from the text area.

The method according to any one of claims 7 to 9,

The text area extraction step,

And labeling the pixel area of the character image on which the pixel expansion has been performed to form a long line having a height higher than or equal to a predetermined threshold value as a virtual character area.

The method of claim 10,

The binarization step,

Character recognition method characterized in that for performing local adaptive binarization by calculating a threshold for binarization in the extracted virtual character area.

The method of claim 11,

The character extraction step,

After the words are extracted using the vertical projection and margin information for character segmentation, the individual characters are classified according to the structural characteristic information such as the network feature, distance information feature, and contour information according to the alphabetic arrangement of the individual characters using a combination and separation algorithm. Character recognition method characterized in that the extraction.

The method of claim 12,

The character recognition step,

Character recognition method characterized in that the character is recognized after classifying the type of the extracted individual characters.