KR101577824B1

KR101577824B1 - Apparatus and method for character recognition using camera

Info

Publication number: KR101577824B1
Application number: KR1020090041707A
Authority: KR
Inventors: 김지훈; 오상욱; 김상호; 황성택; 김현수
Original assignee: 삼성전자주식회사
Priority date: 2009-05-13
Filing date: 2009-05-13
Publication date: 2015-12-29
Also published as: KR20100122688A

Abstract

카메라가 구비된 단말기에서 문자 인식 성능을 향상시키기 위한 장치에 있어서, 영상 변환 모드의 진입 또는 종료를 제어하고, 상기 카메라를 통하여 수신되는 고해상도 정지 영상을 송신하며, 상기 영상 변환 모드로 진입한 경우 상기 카메라를 통하여 수신되는 프리뷰 저해상도 영상 및 저해상도 정지 화면 영상을 송신하는 사용자 조작 및 시스템 제어부와, 상기 사용자 조작 및 시스템 제어부로부터 상기 프리뷰 저해상도 영상을 기 설정된 프레임 단위로 일정한 시간 간격으로 수신하여 글로벌 모션 벡터를 검출하는 글로벌 모션 벡터 검출부와, 상기 검출된 글로벌 모션 벡터를 이용하여 상기 저해상도 정지 화면 영상을 변환하여 상기 정지 화면 영상의 왜곡을 보정하는 영상 변환부를 포함함을 특징으로 한다.An apparatus for enhancing character recognition performance in a terminal equipped with a camera, the apparatus comprising: a controller for controlling entry or termination of an image conversion mode, transmitting a high resolution still image received through the camera, A user operation and system control unit for transmitting a preview low resolution image and a low resolution still screen image received through a camera and a global motion vector receiving unit for receiving the preview low resolution image from the user operation and system control unit at predetermined time intervals, A global motion vector detecting unit for detecting a global motion vector of the still picture and a global motion vector detecting unit for detecting the global motion vector;

문자 인식, 원근법 변환, 글로벌 모션 벡터 Character recognition, perspective transformation, global motion vector

Description

[0001] APPARATUS AND METHOD FOR CHARACTER RECOGNITION USING CAMERA [0002]

본 발명은 문자 인식을 위한 것으로, 특히 카메라를 이용하여 촬영 시 왜곡된 영상을 보정하는 장치 및 방법에 관한 것이다.BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to character recognition, and more particularly, to an apparatus and method for correcting a distorted image at the time of photographing using a camera.

최근 카메라 기술의 발달로 인하여 PDA(Personal Digital Assistants)나 휴대폰 등과 같은 이동 통신 단말기에 카메라 기능을 내장시켜 정보 획득의 수단으로 활용할 수 있게 되었다. 이러한 이동 통신 단말기에 장착된 카메라를 이용한다면 실시간으로 입력되는 영상으로부터 어떠한 형태의 문자 정보도 쉽게 획득할 수 있으며, 입력된 영상으로부터 문자를 인식 및 처리함으로써 손쉽게 문자 인식 결과를 얻을 수 있다. 이러한 문자 인식 방법은 종래에 종이 문서를 스캐닝한 후 이를 인식하여 텍스트 파일이나 전자 문서의 형태로 변환하였던 방법과 달리 종이 문서뿐만 아니라 스캐너로 입력이 불가능한 문자들도 손쉽게 획득 가능하다는 점이 기존의 스캐너 기반 문자 인식과 차별화된다.Recently, due to the development of camera technology, it has become possible to embed camera functions in mobile communication terminals such as PDA (Personal Digital Assistants) and mobile phones as a means of acquiring information. If a camera mounted on the mobile communication terminal is used, any type of character information can be easily obtained from an image input in real time, and character recognition results can be easily obtained by recognizing and processing characters from the input image. Unlike a conventional method in which a paper document is scanned and then recognized as a text file or an electronic document, characters that can not be input by a scanner as well as a paper document can be easily acquired. It differs from character recognition.

상기한 바와 같이 카메라를 이용한 문자 인식 방법은 입력 대상 문자들이 기록되어 있는 매체에 상관없이 문자 획득 및 인식이 가능할 뿐만 아니라 휴대가 용이 한 이동 통신 단말기에서도 활용할 수 있기 때문에 최근 활발히 연구되고 있는 문자 인식 분야 중의 하나이다. 이동 통신 단말기와 같은 문자 인식 장치에 장착된 카메라를 이용한 스냅샷(Snapshot) 기반 영상 문자 인식 방법은 사용자가 원하는 영상을 촬영하여 이를 저장한 후에 저장된 영상 내에 포함된 하나 이상의 문자를 인식하는 방법이다. 인식된 하나 이상의 문자는 단어로 조합되고 사전 DB를 통하여 조합된 단어의 해당 의미를 검출하여 화면에 표시해 준다. 이 때에 촬영된 정지 영상에 포함된 각각의 단어에 대해 사용자가 직접 포인팅하거나 클릭하여 그 의미를 확인할 수 있다.As described above, since the character recognition method using the camera can acquire and recognize the character regardless of the medium on which the input target characters are recorded and can be utilized in the mobile communication terminal which is easy to carry, Lt; / RTI > A Snapshot-based image character recognition method using a camera mounted on a character recognition device such as a mobile communication terminal is a method of capturing a desired image of a user and recognizing one or more characters contained in the stored image after storing the captured image. The recognized one or more characters are combined into words, and the corresponding meaning of the word combined through the dictionary DB is detected and displayed on the screen. At this time, the user can directly point or click on each word included in the still image photographed to check the meaning thereof.

이하, 일반적인 스냅샷 기반의 문자 인식 과정을 도 1을 참조하여 살펴보면 다음과 같다. 도 1은 기존의 문자 인식 과정을 나타낸 흐름도이다. 도 1을 참조하면, 101단계에서 사용자는 문자가 포함된 영상을 촬영하기 전에 문자 인식 장치에 장착된 카메라로부터 수신되는 프리뷰 영상을 보면서 인식 대상이 제대로 화면에 나타나는지를 판단하여 원하는 문자가 포함된 영상의 촬영을 준비한다. 프리뷰 영상은 실시간으로 문자 인식 장치의 화면에 표시된다. 103단계에서 사용자는 자신이 원하는 문자가 포함된 영상을 촬영한다. 105단계에서 촬영된 영상은 정지 영상으로 획득되어 문자 인식 장치의 메모리부에 저장된다. 107단계로 진행하여 문자 인식 장치는 메모리부에 저장되어 있는 정지 영상에 포함된 문자 인식을 수행한다. 109단계에서 문자 인식의 결과를 문자 인식 장치의 화면에 표시한다.Hereinafter, a general snapshot-based character recognition process will be described with reference to FIG. FIG. 1 is a flowchart illustrating an existing character recognition process. 1, in step 101, a user views a preview image received from a camera mounted on a character recognition apparatus to determine whether a recognition target is displayed properly on a screen before capturing an image including a character, . The preview image is displayed on the screen of the character recognition device in real time. In step 103, the user captures an image including a desired character. The image photographed in step 105 is acquired as a still image and stored in the memory unit of the character recognition apparatus. In step 107, the character recognition apparatus recognizes the character included in the still image stored in the memory unit. In step 109, the result of the character recognition is displayed on the screen of the character recognition apparatus.

상기와 같이 카메라를 이용한 스냅샷 기반 영상 문자 인식의 경우에는 인식하고자하는 문자 영상의 입력이 용이하고 실시간으로 입력된 문자 인식을 수행한 결과를 문자 인식 장치의 화면에 표시할 수 있다는 점에서 손쉽게 문자 인식이 가능한 이점이 있다. 그러나 이러한 스냅샷 기반 영상 문자 인식의 경우 촬영 당시 카메라의 방향과 인식 대상의 방향이 기울어지면 오인식이 생길 수 있다. 즉 카메라의 방향과 문자가 포함된 영상의 방향이 서로 수직을 이루어야만 제대로 된 인식 결과를 얻을 수 있다. 만약 방향이 기울어진 상태에서 촬영을 할 경우 영상에 포함된 문자에 왜곡이 생길 수 있으며 이는 오인식이 발생하는 주요한 원인이 된다. In the case of the snapshot-based image character recognition using the camera as described above, since it is easy to input the character image to be recognized and the result of the character recognition performed in real time can be displayed on the screen of the character recognition device, There is an appreciable benefit. However, in the case of the snapshot-based image character recognition, if the direction of the camera and the direction of the recognition object are tilted at the time of shooting, That is, if the direction of the camera and the direction of the image including the characters are perpendicular to each other, a proper recognition result can be obtained. If you take a picture with a tilted direction, the characters included in the image may be distorted, which is a major cause of misreading.

도 2는 기존의 영상을 촬영하는 기울기를 나타낸 예시도이다. 도 2의 (a)와 같이 사용자가 촬영하기 편리한 자세로 영상의 촬영을 수행하였을 경우 인식 대상과 카메라의 각도가 비스듬하게 되어 영상의 왜곡이 발생한다. 하지만 도 2의 (b)와 같이 인식 대상과 카메라의 각도가 수직을 이루는 상태에서 촬영을 수행하였을 경우 영상의 왜곡은 현저하게 줄어들 수는 있으나 사용자에게 불편한 자세를 요구하게 된다.2 is an exemplary view showing a slope for photographing an existing image. As shown in (a) of FIG. 2, when the user shoots an image in a posture that is convenient for shooting, the angle of the camera to be recognized and the angle of the camera become obliquely distorted. However, if the photographing is performed in a state where the angle of the camera to be recognized and the angle of the camera to be recognized are perpendicular to each other as shown in FIG. 2 (b), distortion of the image may be remarkably reduced, but the user may feel uncomfortable.

따라서 사용자가 촬영하기 편리한 자세로 영상의 촬영을 수행하여도 간단한 보정으로 촬영 영상의 왜곡을 줄일 수 있는 방법이 요구된다.Accordingly, there is a need for a method that can reduce the distortion of the photographed image by simple correction even if the user shoots the image in a posture that is easy to photograph.

본 발명의 목적은 커뮤니케이션을 위한 향상된 장치 및 방법을 제공하는 것이고, 본 발명의 다른 목적은 특성화된 음성메일 인사 메시지를 생성 가능한 장치 및 방법을 제공하는 것이다.It is an object of the present invention to provide an improved apparatus and method for communication, and another object of the present invention is to provide an apparatus and method capable of generating a personalized voice mail greeting message.

본 발명의 일 견지에 따르면, 카메라가 구비된 단말기에서 문자 인식 성능을 향상시키기 위한 장치에 있어서, 영상 변환 모드의 진입 또는 종료를 제어하고, 상기 카메라를 통하여 수신되는 고해상도 정지 영상을 송신하며, 상기 영상 변환 모드로 진입한 경우 상기 카메라를 통하여 수신되는 프리뷰 저해상도 영상 및 저해상도 정지 화면 영상을 송신하는 사용자 조작 및 시스템 제어부와, 상기 사용자 조작 및 시스템 제어부로부터 상기 프리뷰 저해상도 영상을 기 설정된 프레임 단위로 일정한 시간 간격으로 수신하여 글로벌 모션 벡터를 검출하는 글로벌 모션 벡터 검출부와, 상기 검출된 글로벌 모션 벡터를 이용하여 상기 저해상도 정지 화면 영상을 변환하여 상기 정지 화면 영상의 왜곡을 보정하는 영상 변환부를 포함함을 특징으로 한다.According to an aspect of the present invention, there is provided an apparatus for enhancing character recognition performance in a terminal equipped with a camera, the apparatus comprising: a controller for controlling entry or termination of an image conversion mode, transmitting a high resolution still image received through the camera, A user operation and system control unit for transmitting a preview low-resolution image and a low-resolution still image received through the camera when the image input mode is entered; and a display unit for displaying the preview low- A global motion vector detection unit that receives the global motion vector at intervals and detects a global motion vector and an image conversion unit that corrects distortion of the still image by transforming the low resolution still image using the detected global motion vector, do.

본 발명의 다른 견지에 따르면, 카메라가 구비된 단말기에 문자 인식 성능을 향상시키기 위한 방법에 있어서, 사용자 입력에 의해 저해상도 정지 화면 영상과 고해상도 정지 화면 영상을 저장하는 과정과, 상기 저해상도 정지 화면 영상에 왜곡이 있는지 판단하는 과정과, 왜곡이 있는 경우 상기 카메라로부터 수신되는 프리 뷰 저해상도 영상을 이용하여 글로벌 모션 벡터를 추출하는 과정과, 상기 추출된 글로벌 모션 벡터를 이용하여 상기 저해상도 및 고해상도 정지 화면 영상을 변환하여 왜곡을 보정하는 과정과, 상기 왜곡이 보정된 고해상도 정지 화면 영상 내에 포함된 문자를 인식한 후 인식 결과를 출력하는 과정을 포함함을 특징으로 한다.According to another aspect of the present invention, there is provided a method for enhancing character recognition performance in a terminal equipped with a camera, the method comprising: storing a low resolution still image and a high resolution still image by user input; Extracting a global motion vector using a pre-view low-resolution image received from the camera when there is a distortion; extracting a low-resolution and high-resolution still image using the extracted global motion vector; And outputting the recognition result after recognizing the characters included in the high-resolution still picture image in which the distortion is corrected.

본 발명은 촬영 기반 문자 인식을 수행하기 전에 영상 변환 모드를 추가 선택하는 방법을 사용함으로써, 사용자에게 제한된 촬영 자세를 요구하는 방법과는 달리 문자를 인식하기 전에 수신된 영상을 보정하여 사용자 편의성 증대와 인식률 향상시키는 효과가 있다.The present invention uses a method of additionally selecting an image conversion mode before performing shooting-based character recognition. Unlike the method of requesting a limited photographing posture, the present invention corrects a received image before recognizing a character, There is an effect of improving the recognition rate.

이하 본 발명에 따른 바람직한 실시 예를 첨부한 도면을 참조하여 상세히 설명한다. 하기 설명에서는 구체적인 구성 소자 등과 같은 특정 사항들이 나타나고 있는데 이는 본 발명의 보다 전반적인 이해를 돕기 위해서 제공된 것일 뿐 이러한 특정 사항들이 본 발명의 범위 내에서 소정의 변형이나 혹은 변경이 이루어질 수 있음은 이 기술 분야에서 통상의 지식을 가진 자에게는 자명하다 할 것이다.Hereinafter, preferred embodiments of the present invention will be described in detail with reference to the accompanying drawings. It will be appreciated that those skilled in the art will readily observe that certain changes in form and detail may be made therein without departing from the spirit and scope of the present invention as defined by the appended claims. To those of ordinary skill in the art.

본 발명은 이동 통신 단말기에 장착된 카메라를 이용하여 문자 인식의 성능을 향상시키기 위한 방법을 제안한다. 이를 위해 본 발명은 기존의 스냅샷 기반의 영상 문자 인식을 수행하는 과정과 입력된 영상을 변환하는 과정으로 이루어진다. 특히 본 발명에서 입력된 영상을 변환하여 왜곡을 최소화 하는 방법을 제안함으로써 사용자의 촬영 자세에 구애받지 않고도 인식률이 향상된 문자 인식 성능을 기대할 수 있다.The present invention proposes a method for improving the performance of character recognition using a camera mounted on a mobile communication terminal. To this end, the present invention comprises a process of performing recognition of an existing snapshot-based image character and a process of converting the input image. In particular, by proposing a method of minimizing distortion by converting an input image in the present invention, it is possible to expect a character recognition performance with an improved recognition rate without being affected by the photographing posture of a user.

상기한 바와 같은 기능이 구현된 문자 인식 장치의 구성 요소 및 그 동작을 도 3을 참조하여 설명한다. 도 3은 본 발명의 일 실시 예에 따른 문자 인식 장치의 내부 블록 구성도이다. 도 3을 살펴보면, 문자 인식 장치는 카메라를 통한 동작 제어 사용자 인터페이스(Gesture Control UI)를 사용하여 장치 자체의 움직임만으로 왜곡된 영상을 제어하는 기능을 수행한다. 이러한 기능을 제공하기 위한 문자 인식 장치는 크게 카메라부(301), 사용자 조작 및 시스템 제어부(303), 움직임 벡터 검출부(305), 메모리(307), 문자 인식부(309), 영상 변환부(311)로 구성된다. The components of the character recognition apparatus having the functions as described above and their operation will be described with reference to FIG. 3 is an internal block diagram of a character recognition apparatus according to an embodiment of the present invention. Referring to FIG. 3, the character recognition apparatus performs a function of controlling a distorted image only by the movement of the apparatus itself using a gesture control UI through a camera. The character recognition apparatus for providing such functions includes a camera unit 301, a user operation and system control unit 303, a motion vector detection unit 305, a memory 307, a character recognition unit 309, an image conversion unit 311 ).

먼저 카메라부(301)는 매 프레임마다 실시간으로 영상을 받아들여 프리뷰(Preview) 영상으로 문자 인식 장치의 화면에 표시해주는 역할을 한다. 이 때에 실시간으로 영상에 표시되는 프리뷰 영상은 저해상도로 화면에 표시되고, 사용자가 특정 정지 영상을 촬영하기 위한 신호를 보낸 경우 해당 장면을 미리 설정된 고해상도로 촬영한다.First, the camera unit 301 receives an image in every frame in real time and displays the preview image on the screen of the character recognition apparatus. At this time, the preview image displayed on the image in real time is displayed on the screen with a low resolution, and when the user sends a signal for capturing a specific still image, the preview image is shot with a preset high resolution.

메모리부(307)는 사용자가 촬영을 시작하여 획득한 고해상도 정지 영상을 저장하는 역할을 한다. 또한 촬영 직전에 화면에 표시된 프리뷰 저해상도 정지 화면 영상도 저장한다. 촬영 직전에 화면에 표시된 프리뷰 저해상도 정지 화면 영상은 영상 변환 모드에서 고해상도 정지 화면 영상의 보간을 위해 사용된다. 도면에는 따로 표시하지 않았지만 메모리(307)는 문자 인식 장치 외부에 따로 위치 할 수도 있으며, 프리뷰 저해상도 영상은 임시 메모리와 같은 저장 공간에 저장될 수도 있다.The memory unit 307 stores the high-resolution still image acquired by the user at the start of shooting. It also saves the preview low resolution still image displayed on the screen immediately before shooting. The preview low resolution still image displayed on the screen immediately before shooting is used for interpolation of high resolution still image in image conversion mode. Although the memory 307 is not separately shown in the drawing, the memory 307 may be separately located outside the character recognition apparatus, and the preview low resolution image may be stored in a storage space such as a temporary memory.

사용자 조작 및 시스템 제어부(303)는 사용자가 영상 변환 모드로 진입하였을 경우 문자 인식 장치의 움직임에 따라 영상을 변환하는 역할을 한다. 사용자는 획득한 영상의 변환이 필요하다고 판단되면 사용자 선택에 따라 영상 변환 모드로 진입을 결정할 수 있으며, 이는 문자 인식 장치 내에 위치한 특정 키 버튼과 같은 입력으로 진입 여부를 결정할 수 있다. 사용자가 영상 진입 모드로 진입하여 문자 인식 장치를 움직이면 카메라부(301)는 실시간으로 수신되는 프리뷰 저해상도 영상을 움직임 벡터 검출부(305)에 전달한다.The user operation and system control unit 303 converts the image according to the movement of the character recognition apparatus when the user enters the image conversion mode. If it is determined that the acquired image needs to be converted, the user can determine the entry into the image conversion mode according to the user selection, and can determine whether to enter the input with the specific key button located in the character recognition device. When the user enters the image entry mode and moves the character recognition apparatus, the camera unit 301 delivers the preview low resolution image received in real time to the motion vector detection unit 305. [

움직임 벡터 검출부(305)는 매 프레임마다 카메라부(301)로부터 수신되는 프리뷰 저해상도 영상을 수신하여 글로벌 모션 벡터(Global Motion Vector)를 검출하는 역할을 한다. The motion vector detection unit 305 receives a preview low-resolution image received from the camera unit 301 every frame and detects a global motion vector.

영상 변환부(311)에서는 상기와 같이 검출한 움직임 벡터의 방향에 따라서 기 설정된 방법에 따라서 영상을 변환하는 역할을 한다. 최종 변환된 프리뷰 저해상도 영상에 맞춰 메모리부(307) 또는 임시 메모리에 저장된 저해상도 정지 화면 영상을 변환한다. 사용자는 변환된 저해상도 정지 화면 영상을 문자 인식 장치의 화면에서 확인하고 영상 변환이 추가로 필요하다고 판단되면 다시 영상 변환을 수행할 수 있다. 만약 영상 변환이 필요없다고 판단되면 저해상도 정지 화면 영상의 변환에 사용한 계수를 고해상도 정지 화면 영상에 적용하여 고해상도 정지 화면 영상을 변환한다.The image transforming unit 311 transforms the image according to a predetermined method according to the direction of the detected motion vector. Resolution still picture image stored in the memory unit 307 or the temporary memory in accordance with the finally-converted preview low-resolution image. The user can confirm the converted low resolution still image on the screen of the character recognition device and perform image conversion again if it is determined that the image conversion is further needed. If it is judged that the image conversion is not necessary, the coefficients used for the conversion of the low resolution still image are applied to the high resolution still image to convert the high resolution still image.

문자 인식부(309)는 메모리부(307)에 저장된 고해상도 정지 화면 영상에 포함되어 있는 문자를 기 설정된 방법으로 인식하여 인식 결과를 문자 인식 장치의 화면에 표시한다. The character recognition unit 309 recognizes characters included in the high-resolution still picture image stored in the memory unit 307 by a predetermined method and displays the recognition result on the screen of the character recognition apparatus.

상기와 같은 구성을 가지는 문자 인식 장치는 사용자의 입력에 따라서 영상 변환을 실행하면서 기존보다 문자 인식 성능을 향상시킬 수 있으며, 특히 이동 통신 단말기와 같은 휴대용 기기에서의 실시간 문자 인식을 실행할 경우 상기와 같은 영상 변환 기술은 매우 중요한 역할을 하게 된다.The character recognition apparatus having the above configuration can improve the character recognition performance while performing the image conversion according to the input of the user. In particular, when real-time character recognition is performed in a portable device such as a mobile communication terminal, Image conversion technology plays a very important role.

위와 같은 영상 변환을 위해 이하 설명되는 본 발명의 주요한 기술적 특징을 다음과 같이 구분할 수 있다. 첫 번째 단계는 사용자에 의해 영상 변환이 시작되면 실시간으로 수신되는 프리뷰 저해상도 영상을 이용하여 글로벌 모션 벡터를 획득하는 단계이다. 두 번째 단계는 획득한 모션 벡터를 이용하여 원근법에 따른 영상 변환을 실행하여 왜곡된 부분을 보정하는 단계이다. 이러한 단계들을 통해 본 발명에서는 보다 높은 문자 인식 성능을 얻을 수 있게 된다.The main technical features of the present invention, which will be described below, for the above-mentioned image conversion can be classified as follows. The first step is to acquire the global motion vector using the preview low resolution image received in real time when the image conversion is started by the user. The second step is to perform the image transformation according to the perspective method using the obtained motion vector to correct the distorted part. Through these steps, higher character recognition performance can be obtained in the present invention.

이하, 본 발명의 일 실시 예에 따른 문자 인식 과정을 도 4를 참조하여 설명한다. 도 4는 본 발명의 일 실시 예에 따른 문자 인식 과정을 나타낸 흐름도이다. Hereinafter, a character recognition process according to an embodiment of the present invention will be described with reference to FIG. 4 is a flowchart illustrating a character recognition process according to an exemplary embodiment of the present invention.

도 4를 참조하면, 401단계에서 문자 인식 장치는 카메라부를 이용하여 실시간으로 프리뷰 저해상도 영상을 얻는다. 이 프리뷰 저해상도 영상은 문자 인식 장치의 화면에 표시된다. 403단계에서 사용자가 원하는 정지 화면 영상을 문자 인식 장치의 카메라부를 통하여 촬영한다. 촬영된 정지 영상은 문자 인식을 위한 고해상도 정지 화면 영상과 영상 변환 모드에 사용될 저해상도 정지 화면 영상으로 구분되고 405단계에서 촬영된 고해상도 정지 화면 영상은 문자 인식 장치의 메모리부에 저장되고, 저해상도 정지 화면 영상은 메모리부 또는 임시 메모리에 저장된다. 문자 인식 장치의 화면에는 저해상도 정지 화면 영상이 표시된다.Referring to FIG. 4, in step 401, the character recognition apparatus obtains a preview low-resolution image in real time using a camera unit. This preview low-resolution image is displayed on the screen of the character recognition apparatus. In step 403, the still image desired by the user is photographed through the camera unit of the character recognition apparatus. The captured still image is divided into a high resolution still image for character recognition and a low resolution still image for use in the image conversion mode. The high resolution still image captured at step 405 is stored in the memory of the character recognition device. Is stored in the memory unit or in the temporary memory. A low-resolution still picture image is displayed on the screen of the character recognition apparatus.

407단계에서 사용자는 화면에 표시된 저해상도 정지 화면 영상의 왜곡이 있는지를 판단하여 정지 화면 영상의 변환이 필요한지를 판단한다. 만약 정지 화면 영상의 변환이 필요하지 않다면, 문자 인식을 수행하는 409단계로 진행하여 문자 인식 장치가 고해상도 정지 화면 영상에 포함된 문자를 인식한다. 문자 인식이 끝나면 411단계에서 문자 인식 결과를 문자 인식 장치의 화면에 출력한다.In step 407, the user determines whether there is distortion in the low-resolution still picture image displayed on the screen, and determines whether the still picture image needs to be transformed. If the conversion of the still image is not required, the character recognition apparatus recognizes the character included in the high resolution still image by proceeding to step 409 for performing character recognition. When the character recognition is completed, the character recognition result is output to the screen of the character recognition apparatus in step 411.

만약 정지 화면 영상의 변환이 필요하다면, 프리뷰 저해상도 영상을 수신하는 413단계로 진행한다. 문자 인식 장치는 카메라부를 이용하여 프리뷰 저해상도 영상을 수신한다. 단 수신되는 프리뷰 저해상도 영상은 사용자에게 보여 지기 위한 것이 아니라 글로벌 모션 벡터의 검출하여 저해상도 정지 화면 영상을 변환하기 위한 것이므로 문자 인식 장치의 화면에 출력할 필요가 없고 변환된 저해상도 정지 화면 영상이 출력된다. 415단계에서 프리뷰 저해상도 영상은 기 설정된 FPS(Frame Per Second)에 따라서 일정 시간 간격을 두고 입력되는 프레임 영상으로 획득된다. 이 프레임 영상은 문자 인식 장치의 움직임 변화에 따라 다양하게 입력될 수 있다.If it is necessary to convert the still image, the flow advances to step 413 to receive the preview low resolution image. The character recognition apparatus receives the preview low-resolution image using the camera unit. The received preview low resolution image is not intended to be displayed to the user but to detect the global motion vector and convert the low resolution still image, so that it is not necessary to output the image on the screen of the character recognition device and the converted low resolution still image is output. In step 415, the preview low-resolution image is acquired as a frame image input at predetermined time intervals according to a predetermined frame per second (FPS). This frame image can be inputted variously according to the movement change of the character recognition device.

417단계에서 문자 인식 장치는 하나 이상의 프레임 영상, 즉 이전 프레임 영상과 현재 프레임 영상 내부에 존재하는 객체 및 특징들을 비교하여 글로벌 모션 벡터를 획득한다. 여기서 프레임 영상 내부에 존재하는 객체는 영상 내에 존재하는 사물들(책상, 꽃, 건물 등)을 의미하고, 프레임 영상 내부에 존재하는 특징은 이러한 사물들이 변화하는 정도를 의미한다. 만약 카메라부를 통해서 들어온 영상이 단색의 배경이거나 흰색의 벽인 경우, 이전 프레임 영상과 현재 프레임 영상 사이의 변화를 찾을 수 없기 때문에 글로벌 모션 벡터를 찾을 수 없게 된다. 이러한 글로 벌 모션 벡터는 프레임 영상 전체가 어느 방향으로 움직이는지, 즉 문자 인식 장치 움직임 변화를 알려주는 척도가 된다. 글로벌 모션 벡터의 검출 과정은 문자 인식 장치의 화면에는 출력되지 않으며, 글로벌 모션 벡터를 구하는 방법은 하기의 도 5 및 도 6를 참조하여 좀 더 상세히 설명하기로 한다. In step 417, the character recognition apparatus compares at least one frame image, that is, an object and features existing in the current frame image with the previous frame image to obtain a global motion vector. Here, an object existing in a frame image means objects (desk, flower, building, etc.) existing in the image, and a characteristic existing in a frame image means a degree of change of such objects. If the image input through the camera section is a solid background or a white wall, the global motion vector can not be found because no change can be found between the previous frame image and the current frame image. This global motion vector is a measure of the direction in which the frame image moves in its entirety, that is, the movement of the character recognition device. The detection process of the global motion vector is not output to the screen of the character recognition apparatus, and a method of obtaining the global motion vector will be described in more detail with reference to FIGS. 5 and 6. FIG.

419단계에서 문자 인식 장치는 글로벌 모션 벡터에 따라서 저해상도 정지 화면 영상의 변환을 수행한다. 예를 들어 문자 인식 장치가 위 방향으로 움직인 이후 프레임 영상을 획득하였다면 이를 현제 프레임 영상으로 하여 위 방향으로 움직이기 전 획득된 이전 프레임 영상과 비교하여 글로벌 모션 벡터를 검출한다. 검출된 모션 벡터는 저해상도 정지 화면 영상에 적용되어 실시간으로 문자 인식 장치의 화면에 출력된다.In step 419, the character recognition apparatus performs conversion of the low resolution still image according to the global motion vector. For example, if the character recognition apparatus obtains a frame image after moving in the upward direction, the current frame image is compared with the previous frame image obtained before moving in the upward direction to detect a global motion vector. The detected motion vector is applied to the low resolution still image and displayed on the screen of the character recognition device in real time.

421단계에서 문자 인식 장치는 최종 변환된 저해상도 정지 화면 영상을 문자 인식 장치의 화면에 출력한다. 423단계에서 사용자는 변환된 저해상도 정지 화면 영상이 추가적인 변환이 필요한지의 여부를 판단한다. 만약 추가적인 변환이 더 필요하다면 413단계에서 421단계의 과정을 반복적으로 수행하고, 추가적인 변환이 더 필요하지 않다면 425단계로 진행하여 최종 변환된 저해상도 정지 화면 영상의 계수를 고해상도 정지 화면 영상에 적용하여 문자 인식 장치의 메모리부에 저장한다. 이 후 409단계에서 변환된 고해상도 정지 화면 영상에 포함된 문자의 인식이 수행되고, 411단계에서 문자 인식 결과를 문자 인식 장치의 화면에 출력한다.In step 421, the character recognition apparatus outputs the finally converted low resolution still image on the screen of the character recognition apparatus. In step 423, the user determines whether the converted low resolution still picture image needs further conversion. If additional conversion is required, the process of steps 413 and 421 is repeatedly performed. If additional conversion is not required, the process proceeds to step 425 where the coefficient of the final low-resolution still image is applied to the high- And stores it in the memory unit of the recognition apparatus. Thereafter, in step 409, recognition of the characters included in the converted high-resolution still image is performed, and in step 411, the character recognition result is output to the screen of the character recognition apparatus.

도 5는 본 발명의 일 실시 예에 따른 방향에 따른 영상 변환을 나타낸 예시도이다. 본 도면은 문자 인식 장치의 방향에 따라서 원근법에 따른 변환(Perspective Transform)을 적용하여 메모리부에 저장된 정지 영상을 변환하는 과정을 도시하고 있다. 5 is a diagram illustrating an image conversion according to a direction according to an embodiment of the present invention. This figure shows a process of converting a still image stored in a memory unit by applying a perspective transformation according to a perspective method according to the direction of a character recognition apparatus.

도 5의 501은 문자 인식 장치가 위 방향으로 움직였을 경우 정지 영상의 아래 부분이 줄어들고 위 부분이 늘어나는 변환이 실행된다. 이 때에 비스듬하게 촬영된 정지 영상을 카메라부가 영상을 수직으로 촬영한 영상으로 변환하여야 하므로 정지 영상의 위 부분은 아래 방향으로 업 스케일링(Up-Scaling) 보간이 수행되고, 정지 영상의 아래 부분은 도 5의 501에 도시된 바와 같이 양 쪽에서 좁아지는 다운 스케일링(Down scaling) 보간이 수행된다. 이는 정지 영상에서 글로벌 모션 벡터의 방향과 반대 방향의 부분이 화면에서 멀어지는 효과를 가지고 반대로 글로벌 모션 벡터의 방향과 같은 방향은 화면에서 가까워지는 효과를 가져 화면 전체의 균형을 맞추는 역할을 한다. In FIG. 5, when the character recognition apparatus is moved upward, the lower part of the still image is reduced and the upper part is expanded. Up-scaling interpolation is performed in the upper part of the still image and up-scaling is performed in the lower part of the still image because the still image photographed obliquely at this time is converted into the vertically photographed image of the camera additional image. Down scaling interpolation that is narrowed on both sides is performed as shown in 501 of FIG. In the still image, the direction opposite to the direction of the global motion vector moves away from the screen. On the other hand, the direction of the global motion vector has the effect of approaching the screen, thereby balancing the entire screen.

이와 같이 나머지 503, 505, 507 또한 각각의 방향에 따라서 원근법 변환이 수행된다. 이 때에 변환의 크기 또는 변환의 정도는 글로벌 모션 벡터의 크기에 따라서 변화시킬 수 있고, 방향은 오른쪽을 기준(0도)으로 하여 모든 방향을 각도로 표현할 수 있다. 기준 각도로 지정되는 방향은 변경이 가능하다. Thus, the remaining 503, 505, and 507 and perspective transformation are performed along the respective directions. At this time, the magnitude of the transformation or the degree of transformation can be changed according to the size of the global motion vector, and the direction can be expressed as an angle with the right side as the reference (0 degree). The direction specified by the reference angle can be changed.

상기와 같은 과정을 통하여 변환된 정지 영상은 실시간으로 화면에 표시될 수 있으며, 사용자는 화면을 보고 영상의 왜곡이 충분히 보정되었는지의 여부를 판단한다. 충분히 보정되었다면 영상 변환 모드를 종료하게 되고, 충분히 보정되지 않았다면 보정이 필요한 방향으로 원근법 변환을 다시 수행하여 정지 영상을 변환한다. 영상 변환 모드의 종료는 터치스크린 또는 버튼의 입력 등으로 실행이 가능하 다. 글로벌 모션 벡터에 의한 원근법 변환의 구체적인 설명은 하기에서 설명하기로 한다.The converted still image can be displayed on the screen in real time, and the user can see the screen to determine whether the distortion of the image is sufficiently corrected. If the image has been sufficiently corrected, the image conversion mode is terminated. If the image is not sufficiently corrected, the perspective transformation is performed again in the direction requiring correction to convert the still image. The end of image conversion mode can be done by inputting touch screen or button. A detailed description of the perspective transformation by the global motion vector will be described below.

도 6은 본 발명의 일 실시 예에 따른 글로벌 모션 벡터에 의한 원근법 변환을 이용하여 정지 영상을 변환하는 과정을 나타낸 흐름도이다. FIG. 6 is a flowchart illustrating a still image conversion process using perspective transformation using a global motion vector according to an exemplary embodiment of the present invention. Referring to FIG.

도 6을 참조하면, 601단계에서 문자 인식 장치의 카메라부는 저장된 정지 영상의 변환을 위하여 프리뷰 저해상도 영상을 수신한다. 프리뷰 저해상도 영상은 기 설정된 FPS(Frame Per Second)에 따라서 일정 시간 간격을 두고 입력되는 프레임 영상으로 획득되며, 이러한 프레임 영상은 사용자에게 보여 지기 위한 것이 아니라 글로벌 모션 벡터의 검출하여 정지 영상을 변환하기 위해 사용되므로 문자 인식 장치의 화면에 출력되지는 않는다. 603단계에서 문자 인식 장치는 수신되는 프레임 영상들로부터 글로벌 모션 벡터를 검출한다. 글로벌 모션 벡터의 검출은 수신되는 프레임 영상들 중 이전 프레임 영상과 현재 프레임 영상을 비교하여 영상 내부에 존재하는 객체 및 특징들로부터 글로벌 모션 벡터를 검출할 수 있다.Referring to FIG. 6, in step 601, the camera unit of the character recognition apparatus receives a preview low-resolution image for conversion of a stored still image. The preview low-resolution image is obtained as a frame image input with a predetermined time interval according to a predetermined frame per second (FPS), and the frame image is not displayed for the user but for detecting a global motion vector and converting the still image It is not output to the screen of the character recognition apparatus. In step 603, the character recognition apparatus detects a global motion vector from received frame images. The global motion vector may be detected by comparing a previous frame image and a current frame image among the received frame images to detect a global motion vector from objects and features existing in the image.

605단계에서는 이렇게 검출된 글로벌 모션 벡터의 크기와 각도를 계산한다. 글로벌 모션 벡터는 크기와 각도에 따라 모든 2차원 상의 위치를 표현할 수 있으며, 0도에서부터 360도 사이에서 모든 방향을 나타낼 수 있다. 예를 들어 오른쪽 방향을 0도라고 가정했을 경우 반시계 방향으로 위쪽 방향은 90도, 왼쪽 방향은 180도, 아래쪽 방향은 270도가 될 수 있다. 607단계에서 검출된 글로벌 모션 벡터의 크기는 매우 가변적이므로 기 설정된 임계값과 검출된 글로벌 모션 벡터의 크기를 비교한다. 만약 기 설정된 임계값 이상으로 글로벌 모션 벡터가 검출된다면 609단계에 서 글로벌 모션 벡터의 크기는 변환 계수와 비례하여 증감한다. 여기서 변환 계수는 원근법 변환의 변환 정도를 의미하고 원근법 변환의 변환 정도에 따라서 정지 영상이 변환된다. 예를 들어 도 5의 501과 같이 위쪽 방향으로 보간이 필요한 경우 문자 인식 장치를 위쪽 방향으로 원근법 변환하여 검출된 변환 계수에 따라서 글로벌 모션 벡터의 크기가 증감하고 증감된 크기가 기 설정된 임계값 이상인 경우 이를 바탕으로 정지 영상의 변환이 수행된다. 만약 기 설정된 임계값 이상으로 글로벌 모션 벡터가 검출되지 않았다면 601단계부터 605단계까지를 반복한다.In step 605, the size and angle of the detected global motion vector are calculated. The global motion vector can represent all two-dimensional positions according to size and angle, and can represent all directions between 0 and 360 degrees. For example, assuming that the right direction is 0 degree, the upward direction in the counterclockwise direction may be 90 degrees, the left direction 180 degrees, and the downward direction 270 degrees. Since the size of the global motion vector detected in step 607 is very variable, a predetermined threshold value is compared with the size of the detected global motion vector. If the global motion vector is detected to exceed the predetermined threshold value, the magnitude of the global motion vector increases or decreases in proportion to the transform coefficient. Here, the transform coefficient means the degree of transform of the perspective transform, and the still image is transformed according to the transform degree of the perspective transform. For example, when interpolation is required in the upward direction as shown in 501 of FIG. 5, if the size of the global motion vector increases or decreases according to the detected transformation coefficient by converting the character recognition apparatus in the upward direction, Based on this, still image conversion is performed. If the global motion vector is not detected above the predetermined threshold value, steps 601 to 605 are repeated.

영상 변환 모드가 종료되면, 메모리부에 저장되어 있는 고해상도 정지 화면 영상을 변환한다. 이 때에 저해상도 정지 화면 영상 변환에 사용한 계수를 고해상도 정지 화면 영상에 적용한다. 왜곡이 보정된 고해상도 정지 화면 영상은 문자 인식부로 수신되어 기 설정된 방법으로 고해상도 정지 화면 영상에 포함되어 있는 문자를 인식하여 인식 결과를 문자 인식 장치의 화면에 표시한다. When the image conversion mode ends, the high resolution still image stored in the memory unit is converted. At this time, the coefficients used for the low resolution still image conversion are applied to the high resolution still image. The high-resolution still picture image whose distortion is corrected is received by the character recognizing unit, recognizes the characters included in the high-resolution still picture image by a predetermined method, and displays the recognition result on the screen of the character recognizing apparatus.

상기와 같이 본 발명의 일 실시 예에 따른 카메라를 이용한 문자 인식 장치 및 방법의 구성 및 동작이 이루어질 수 있으며, 한편 상기한 본 발명의 설명에서는 구체적인 실시 예에 관해 설명하였으나 여러 가지 변형이 본 발명의 범위를 벗어나지 않고 실시될 수 있다. 따라서 본 발명의 범위는 설명된 실시 예에 의하여 정할 것이 아니고 청구범위와 청구범위의 균등한 것에 의하여 정하여져야 할 것이다.As described above, the configuration and operation of the apparatus and method for recognizing a character using a camera according to an embodiment of the present invention can be performed. In the meantime, although the embodiments of the present invention have been described in detail, Can be carried out without departing from the scope. Accordingly, the scope of the present invention should not be limited by the illustrated embodiments, but should be determined by equivalents of the claims and the claims.

도 1은 기존의 문자 인식 과정을 나타낸 흐름도1 is a flowchart showing an existing character recognition process.

도 2는 기존의 영상을 촬영하는 기울기를 나타낸 예시도FIG. 2 is a view showing an example of a slope for photographing an existing image

도 3은 본 발명의 일 실시 예에 따른 문자 인식 장치의 내부 블록 구성도3 is an internal block diagram of a character recognition apparatus according to an embodiment of the present invention.

도 4는 본 발명의 일 실시 예에 따른 문자 인식 과정을 나타낸 흐름도4 is a flowchart illustrating a character recognition process according to an embodiment of the present invention.

도 5는 본 발명의 일 실시 예에 따른 방향에 따른 영상 변환을 나타낸 예시도5 is a diagram illustrating an image conversion according to a direction according to an embodiment of the present invention.

도 6은 본 발명의 일 실시 예에 따른 글로벌 모션 벡터에 의한 원근법 변환을 이용하여 정지 영상을 변환하는 과정을 나타낸 흐름도6 is a flowchart illustrating a process of converting a still image using perspective transformation using a global motion vector according to an exemplary embodiment of the present invention.

Claims

An apparatus for improving character recognition performance in a terminal equipped with a camera,

A preview resolution image and a low resolution still image, which are received through the camera when the image conversion mode is entered, are transmitted through the camera, A user operation and a system control unit,

A global motion vector detection unit receiving the preview low resolution image from the user operation and system control unit at predetermined time intervals in units of a predetermined frame to detect a global motion vector,

And an image transform unit for transforming the low-resolution still image by using the detected global motion vector to correct the distortion of the still image,

Wherein the conversion of the low resolution still image is repeatedly performed and when the conversion of the low resolution still image is completed, the high resolution still image is converted using the global motion vector applied to the conversion of the low resolution still image, A device for improving the character recognition performance.

The method according to claim 1,

A memory unit receiving the preview low resolution image and the low resolution high resolution still image from the user operation and system control unit,

And a character recognition unit for recognizing a character included in the converted high-resolution still picture image.

The method according to claim 1,

Wherein the low-resolution still picture image can be stored in a storage space different from the high-resolution still picture image.

delete

The system according to claim 1, wherein the user operation and system control unit

Wherein the terminal has a touch screen or a key button in order to enter or exit the image conversion mode.

The apparatus of claim 1, wherein the global motion vector detection unit

And comparing the current preview low-resolution image and the previous preview low-resolution image among the one or more preview low-resolution images received on a frame-by-frame basis to extract a global motion vector.

The method according to claim 6,

Wherein the comparison of the preview low resolution images is performed by comparing the moving direction of the current preview low resolution image and the degree of movement of the current preview low resolution image based on the previous preview low resolution image.

The image processing apparatus according to claim 1,

Wherein the low resolution still image is transformed using perspective transformation according to the detected global motion vector.

A method for improving character recognition performance in a terminal equipped with a camera,

Storing a low resolution still image and a high resolution still image by user input;

Determining whether the low-resolution still image is distorted;

Extracting a global motion vector using a preview low-resolution image received from the camera when there is distortion;

Transforming the low-resolution and high-resolution still picture images using the extracted global motion vectors to correct distortions;

And outputting a recognition result after recognizing the characters included in the high-resolution still picture image in which the distortion is corrected,

Wherein the conversion of the low resolution still image is repeatedly performed and when the conversion of the low resolution still image is completed, the high resolution still image is converted using the global motion vector applied to the conversion of the low resolution still image, A method for improving the character recognition performance.

delete

The method of claim 9, wherein the step of extracting the global motion vector comprises:

Receiving the preview low-resolution image at a predetermined time interval according to a predetermined frame per second (FPS)

Comparing the current preview low resolution image with the previous preview low resolution image as a center;

And extracting a global motion vector according to the comparison result.

12. The method of claim 11, wherein comparing the preview low resolution images comprises:

And comparing the movement direction and the movement degree of the current preview low resolution image with reference to the previous preview low resolution image.

10. The method of claim 9, wherein the step of correcting the distortion comprises:

Detecting the magnitude and angle of the extracted global motion vector,

Determining whether the size of the extracted global motion vector is greater than or equal to a preset threshold value,

Applying a magnitude and an angle of the extracted global motion vector to the low resolution still image if the threshold value is greater than the threshold,

And applying the size and angle of the global motion vector applied to the finally converted still picture image to the high resolution still picture image to correct the distortion when the conversion of the low resolution still picture image is completed A method for improving character recognition performance.