KR20150137752A

KR20150137752A - Character recognition method and apparatus thereof

Info

Publication number: KR20150137752A
Application number: KR1020140066190A
Authority: KR
Inventors: 이승준; 박찬우; 강소영; 김정민
Original assignee: 삼성에스디에스 주식회사
Priority date: 2014-05-30
Filing date: 2014-05-30
Publication date: 2015-12-09
Also published as: WO2015183015A1; KR102208683B1

Abstract

The present invention relates to a method and a device for recognizing characters from a low-quality input image including the characters. According to an embodiment of the present invention, the character recognition method includes the steps of: extracting feature values from the input image; using font shape information for each font and the feature value to determine the application font with respect to the characters displayed in the input image; using the font feature information of the application font to divide the input image into respective character area images; comparing the character area images with each character template of the application font to calculate the similarity between the respective character area images and the respective character templates; and using the similarity to determine the characters corresponding to the character area images.

Description

[0001] The present invention relates to a character recognition method and apparatus,

본 발명은 문자 인식 방법 및 그 장치에 관한 것이다. 보다 자세하게는, 문자가 포함 된 저품질의 입력 이미지, 즉 즉 외부 요인에 의하여 문자 부분이 오염되거나 문자 구성이 불균일한 비정형적 형태를 가진 입력 이미지를 인식하는 방법 및 그 방법이 적용 된 장치에 관한 것이다.The present invention relates to a character recognition method and apparatus therefor. More specifically, the present invention relates to a method of recognizing an input image having a low quality including characters, that is, an input image having an irregular shape in which a character portion is contaminated by an external factor or a character configuration is uneven, and an apparatus to which the method is applied .

문자 인식 방법은 크게 패턴 정합법(pattern matching)과 구조 분석법(structure analysis)으로 분류할 수 있다. 전자는 주로 인쇄 문자의 인식에, 후자는 주로 필기문자의 인식에 사용된다. 양자의 중간적인 문자인식 방식으로 특징 정합법(feature matching) 및 획 분석법(stroke analysis) 등이 있으며, 각각의 용도나 목적에 맞추어 구분하여 사용하고 있다.The character recognition method can be largely divided into pattern matching and structure analysis. The former is mainly used for recognition of printed characters, and the latter is mainly used for recognition of handwritten characters. There are feature matching methods and stroke analysis methods as intermediate character recognition methods of both, and they are used in accordance with the purpose or purpose of each.

한편, 강재, 목재 등 자재의 표면에 표시된 문자를 인식해야 하는 경우, 종래의 문자 인식 방법은 적용되기 어렵다. 자재의 표면이 종이와 달리 고르지 않거나, 공정의 특성상 글자 부분이 부식되거나 오염, 손상되는 변수가 많기 때문이다. 또한, 도트(dot) 타각(打刻) 문자도 상당수를 차지하고 있어 문자의 형태가 다양하다. 따라서, 자재의 표면에 표시된 문자가 포함 된 이미지 등 문자 표시 상태가 좋지 않은 이미지로부터 문자를 인식하는 방법의 제공이 요구 된다.On the other hand, when characters displayed on the surface of a material such as steel or wood are to be recognized, the conventional character recognition method is difficult to apply. This is because the surface of the material is not unlike the paper, or the characters of the process are corroded or contaminated or damaged. In addition, the number of dot embossing characters also occupies a considerable number, and the shape of characters varies. Therefore, it is required to provide a method of recognizing a character from an image in which the character display state is poor, such as an image including a character displayed on the surface of the material.

한국 공개 특허 제2014-0007764호Korea Patent Publication No. 2014-0007764

본 발명이 해결하고자 하는 기술적 과제는 문자가 포함된 저품질의 입력 이미지로부터 문자를 인식하는 방법 및 그 장치를 제공 하는 것이다.SUMMARY OF THE INVENTION The present invention provides a method and apparatus for recognizing characters from a low-quality input image including characters.

본 발명이 해결하고자 하는 다른 기술적 과제는 문자가 포함된 입력 이미지의 특징 값을 이용하여 문자의 폰트를 판정하고, 상기 입력 이미지를 각각의 문자 영역 이미지로 구분 한 후, 문자 영역 이미지를 판정 된 폰트에 포함된 각 문자 템플릿과 비교하고, 비교 결과를 이용하여 문자를 인식하는 방법 및 그 장치를 제공 하는 것이다.According to another aspect of the present invention, there is provided an image processing method comprising the steps of: determining a font of a character using characteristic values of an input image including characters; dividing the input image into individual character area images; And a method and apparatus for recognizing a character using the comparison result.

본 발명이 해결하고자 하는 또 다른 기술적 과제는 이미지 내의 문자가 도트 타입(dot type)으로 표시 된 경우에도 그 문자를 인식 할 수 있는 방법 및 그 장치를 제공 하는 것이다.It is another object of the present invention to provide a method and apparatus for recognizing a character in an image even when the character is displayed as a dot type.

본 발명의 기술적 과제들은 이상에서 언급한 기술적 과제들로 제한되지 않으며, 언급되지 않은 또 다른 기술적 과제들은 아래의 기재로부터 당업자에게 명확하게 이해 될 수 있을 것이다.The technical objects of the present invention are not limited to the above-mentioned technical problems, and other technical subjects not mentioned can be clearly understood by those skilled in the art from the following description.

상기 기술적 과제를 달성하기 위한 본 발명의 일 실시예에 따른 데이터 확인 방법은 입력 이미지에서 특징 값을 추출 하는 단계, 상기 특징 값과 각 폰트의 폰트 형상 정보를 이용하여 상기 입력 이미지에 표시 된 문자에 대한 적용 폰트를 결정하는 단계, 상기 적용 폰트의 폰트 특징 정보를 이용하여, 상기 입력 이미지를 각각의 문자 영역 이미지로 구분 하는 단계, 상기 문자 영역 이미지를 상기 적용 폰트의 각 문자 템플릿과 비교하여, 상기 문자 영역 이미지의 각 문자 템플릿에 대한 유사도를 연산하는 단계, 및 상기 유사도를 이용하여 상기 문자 영역 이미지에 대응하는 문자를 결정하는 단계를 포함한다.According to another aspect of the present invention, there is provided a data checking method including extracting a feature value from an input image, extracting feature values from a character displayed on the input image using the feature value and font shape information of each font, Determining an applied font for the application font, dividing the input image into respective character area images using the font feature information of the applied font, comparing the character area image with each character template of the applied font, Computing the similarity for each character template in the character area image, and determining the character corresponding to the character area image using the similarity.

상기 특징 값을 추출 하는 단계는, 상기 입력 이미지에서 고품질의 에지(edge)를 추출 하기 위하여 상기 입력 이미지에 대하여 소정의 보정 루틴을 적용 하는 단계, 및 상기 입력 이미지에서 에지(edge)를 추출 하는 단계를 포함할 수 있다. Wherein extracting the feature value comprises applying a predetermined correction routine to the input image to extract a high quality edge from the input image and extracting an edge from the input image . &Lt; / RTI >

상기 적용 폰트를 결정하는 단계는, 상기 추출 된 에지를 이용하여 상기 입력 이미지에 표시 된 각 문자의 크기 및 문자를 구성하는 획의 굵기를 결정 하는 단계, 및 상기 문자의 크기 및 획의 굵기에 부합하는 폰트 형상 정보를 가진 폰트를 상기 적용 폰트로 결정하는 단계를 포함할 수 있다.Wherein the step of determining the applied font comprises the steps of: determining the size of each character displayed on the input image using the extracted edge and the thickness of a stroke constituting the character; And determining the font having the font shape information as the applied font.

상기 적용 폰트를 결정하는 단계는, 상기 추출 된 에지 중 원형 에지가 기 지정 된 개수 이상 존재 하는 경우, 도트 타입 폰트로 판정 하는 단계, 인접한 원형 에지를 그룹화 하는 단계, 하나의 그룹에 속한 각각의 원형 에지를 구성하는 픽셀의 좌표를 이용하여 각 문자의 크기를 결정 하는 단계, 각각의 원형 에지의 지름을 이용하여 도트의 굵기를 결정 하는 단계, 및 상기 문자의 크기 및 도트의 굵기에 부합하는 폰트 형상 정보를 가진 도트 타입 폰트를 상기 적용 폰트로 결정하는 단계를 포함할 수 있다.The method of claim 1, wherein determining the applied font comprises: determining a dot type font if there are more than a predetermined number of round edges of the extracted edges; grouping adjacent circular edges; Determining the size of each character using the coordinates of pixels constituting the edge, determining the thickness of the dot by using the diameter of each circular edge, and determining the font size corresponding to the size of the character and the thickness of the dot Determining a dot type font having information as the applied font.

상기 유사도를 연산하는 단계는, 상기 문자 영역 이미지의 크기를 상기 적용 폰트의 각 문자 템플릿의 이미지의 크기와 일치시키는 단계를 포함할 수 있다.The calculating of the similarity may include matching the size of the character area image with the size of the image of each character template of the applied font.

상기 유사도를 연산하는 단계는, 상기 문자 영역 이미지와 상기 적용 폰트의 각 문자 템플릿의 이미지 사이의 픽셀 값 일치 비율을 연산 하여 유사도를 연산하는 단계, 및 상기 문자 영역 이미지가 상기 적용 폰트의 각 문자 템플릿의 특징 요건을 만족하는 지 여부에 의하여 상기 유사도를 조정하는 단계를 포함할 수도 있다.Calculating the degree of similarity by calculating a pixel value matching ratio between the character area image and the image of each character template of the applied font to calculate a degree of similarity, And adjusting the degree of similarity based on whether or not the characteristic requirement of the first embodiment is satisfied.

상기 구분 하는 단계는, 상기 적용 폰트의 폰트 특징 정보 중 자간, 장평, 폰트 이미지 넓이 및 폰트 이미지 높이를 이용하여 상기 입력 이미지에 문자 분리 가이드 라인을 설정 하는 단계, 및 상기 문자 분리 가이드 라인을 기준으로 상기 입력 이미지를 각각의 문자 영역 이미지로 구분 하는 단계를 포함할 수 있다.Wherein the dividing step includes setting a character separation guideline on the input image by using a character spacing, a long pillar, a font image width, and a font image height in the font characteristic information of the applied font, And dividing the input image into respective character area images.

상기 기술적 과제를 달성하기 위한 본 발명의 다른 실시예에 따른 데이터 확인 방법은 폰트 특징 정보를 이용하여, 입력 이미지를 각각의 문자 영역 이미지로 구분 하는 단계, 상기 문자 영역 이미지를 각 문자 템플릿과 비교하여, 상기 문자 영역 이미지의 각 문자 템플릿에 대한 유사도를 연산하는 단계, 및 상기 유사도를 이용하여 상기 문자 영역 이미지에 대응하는 문자를 결정하는 단계를 포함한다. 이 때, 상기 유사도를 연산하는 단계는, 상기 문자 영역 이미지가 각 문자 템플릿의 특징 요건을 만족하는 지 여부를 반영하는 단계를 포함한다.According to another aspect of the present invention, there is provided a data checking method comprising: dividing an input image into individual character area images using font characteristic information; comparing the character area image with each character template; Calculating a similarity for each character template in the character area image, and determining the character corresponding to the character area image using the similarity. Wherein calculating the similarity comprises reflecting whether the character area image satisfies a feature requirement of each character template.

상기 기술적 과제를 달성하기 위한 본 발명의 또 다른 실시예에 따른 문자 인식 장치는, 입력 이미지에서 특징 값을 추출 하는 이미지 특징 추출부, 상기 특징 값과 각 폰트의 폰트 형상 정보를 이용하여 상기 입력 이미지에 표시 된 문자에 대한 적용 폰트를 결정 하는 폰트 결정부, 상기 적용 폰트의 폰트 특징 정보를 이용하여, 상기 입력 이미지를 각각의 문자 영역 이미지로 구분 하는 문자 영역 구획부, 및 상기 문자 영역 이미지를 상기 적용 폰트의 각 문자 템플릿과 비교하여 상기 문자 영역 이미지의 각 문자 템플릿에 대한 유사도를 연산하고, 상기 유사도를 이용하여 상기 문자 영역 이미지에 대응하는 문자를 결정하는 문자 결정부를 포함한다.According to another aspect of the present invention, there is provided a character recognition apparatus including an image feature extraction unit for extracting a feature value from an input image, A character region dividing unit for dividing the input image into respective character region images by using the font characteristic information of the applied font; And a character determination unit for calculating a similarity to each character template of the character area image by comparing the character template of each of the applied fonts and determining a character corresponding to the character area image using the similarity.

일 실시예에 따르면, 상기 문자 인식 장치는 상기 문자 영역 이미지에 대하여 소정의 화질 보정용 이미지 프로세싱을 수행하고, 이미지 프로세싱이 완료된 상기 문자 영역 이미지를 이진화(binarization)를 수행하며, 이진화 된 상기 문자 영역 이미지를 상기 문자 결정부에 제공하는 문자 결정부를 더 포함할 수 있다.According to one embodiment, the character recognition apparatus performs predetermined image quality correction for the character area image, performs binarization of the character area image on which the image processing is completed, and outputs the binarized character area image To the character determination unit.

상기와 같은 본 발명에 따르면, 자재의 표면에 표시된 문자가 포함 된 이미지 등 문자 표시 상태가 좋지 않은 이미지로부터 문자를 인식할 수 있는 효과가 있다. 특히 문자 표시 부분에 오염이 존재하거나, 구성이 균일하지 않은 비정형적 문자일 경우에도 효과적으로 인식할 수 있는 효과가 있다.According to the present invention as described above, characters can be recognized from an image having a poor character display state, such as an image including characters displayed on the surface of a material. Particularly, even when there is contamination in a character display portion or in an irregular character whose configuration is not uniform, the effect can be effectively recognized.

또한, 도트 타입(dot type)으로 표시된 문자가 포함 된 이미지로부터 문자를 인식할 수 있는 효과가 있다.Further, there is an effect that a character can be recognized from an image including a character represented by a dot type.

도 1은 본 발명의 효과를 설명하기 위한 도면이다.
도 2는 본 발명의 일 실시예에 따른 문자 인식 방법의 순서도이다.
도 3은 도 2에 도시된 문자 인식 방법 중 일부 단계에서 수행 되는 이미지 구분의 결과를 나타내는 도면이다.
도 4는 도 2에 도시된 문자 인식 방법 중 일부 단계를 상세히 설명하기 위한 상세 순서도이다.
도 5는 도 2에 도시된 문자 인식 방법 중 일부 단계를 설명하기 위한 개념도이다.
도 6은 도 2내지 도 5를 참조하여 설명한 방법이 수행 된 결과 표시되는 사용자 인터페이스의 일 예이다.
도 7은 본 발명의 다른 실시예에 따른 문자 인식 장치를 나타낸 도면이다.
도 8은 본 발명의 또 다른 실시예에 따른 문자 인식 장치의 블록도이다.
도 9는 도 8에 표시된 문자 인식 장치의 일부 블록에 대한 상세 블록도이다.
도 10은 본 발명의 또 다른 실시예에 따른 문자 인식 장치의 하드웨어 구성을 나타낸 도면이다.1 is a diagram for explaining the effect of the present invention.
2 is a flowchart of a character recognition method according to an embodiment of the present invention.
FIG. 3 is a diagram showing the results of image classification performed at some stages in the character recognition method shown in FIG. 2. FIG.
FIG. 4 is a detailed flowchart for explaining some steps of the character recognition method shown in FIG. 2 in detail.
FIG. 5 is a conceptual diagram for explaining some steps of the character recognition method shown in FIG. 2. FIG.
FIG. 6 is an example of a user interface displayed as a result of performing the method described with reference to FIGS.
7 is a diagram illustrating a character recognition apparatus according to another embodiment of the present invention.
8 is a block diagram of a character recognition apparatus according to another embodiment of the present invention.
FIG. 9 is a detailed block diagram of some blocks of the character recognition apparatus shown in FIG. 8; FIG.
10 is a diagram illustrating a hardware configuration of a character recognition apparatus according to another embodiment of the present invention.

이하, 첨부된 도면을 참조하여 본 발명의 바람직한 실시예를 상세히 설명한다. 본 발명의 이점 및 특징, 그리고 그것들을 달성하는 방법은 첨부되는 도면과 함께 상세하게 후술되어 있는 실시 예들을 참조하면 명확해질 것이다. 그러나 본 발명은 이하에서 게시되는 실시 예들에 한정되는 것이 아니라 서로 다른 다양한 형태로 구현될 수 있으며, 단지 본 실시 예들은 본 발명의 게시가 완전하도록 하고, 본 발명이 속하는 기술분야에서 통상의 지식을 가진 자에게 발명의 범주를 완전하게 알려주기 위해 제공되는 것이며, 본 발명은 청구항의 범주에 의해 정의될 뿐이다. 명세서 전체에 걸쳐 동일 참조 부호는 동일 구성 요소를 지칭한다.Hereinafter, preferred embodiments of the present invention will be described in detail with reference to the accompanying drawings. BRIEF DESCRIPTION OF THE DRAWINGS The advantages and features of the present invention and the manner of achieving them will become apparent with reference to the embodiments described in detail below with reference to the accompanying drawings. The present invention may, however, be embodied in many different forms and should not be construed as limited to the embodiments set forth herein. Rather, these embodiments are provided so that this disclosure will be thorough and complete, and will fully convey the scope of the invention to those skilled in the art. Is provided to fully convey the scope of the invention to those skilled in the art, and the invention is only defined by the scope of the claims. Like reference numerals refer to like elements throughout the specification.

다른 정의가 없다면, 본 명세서에서 사용되는 모든 용어(기술 및 과학적 용어를 포함)는 본 발명이 속하는 기술분야에서 통상의 지식을 가진 자에게 공통적으로 이해될 수 있는 의미로 사용될 수 있을 것이다. 또 일반적으로 사용되는 사전에 정의되어 있는 용어들은 명백하게 특별히 정의되어 있지 않는 한 이상적으로 또는 과도하게 해석되지 않는다. 본 명세서에서 사용된 용어는 실시예들을 설명하기 위한 것이며 본 발명을 제한하고자 하는 것은 아니다. 본 명세서에서, 단수형은 문구에서 특별히 언급하지 않는 한 복수형도 포함한다. 명세서에서 사용되는 "포함한다(comprises)" 및/또는 "포함하는(comprising)"은 언급된 구성요소, 단계, 동작 및/또는 소자는 하나 이상의 다른 구성요소, 단계, 동작 및/또는 소자의 존재 또는 추가를 배제하지 않는다.Unless defined otherwise, all terms (including technical and scientific terms) used herein may be used in a sense commonly understood by one of ordinary skill in the art to which this invention belongs. Also, commonly used predefined terms are not ideally or excessively interpreted unless explicitly defined otherwise. The terminology used herein is for the purpose of illustrating embodiments and is not intended to be limiting of the present invention. In the present specification, the singular form includes plural forms unless otherwise specified in the specification. It is noted that the terms "comprises" and / or "comprising" used in the specification are intended to be inclusive in a manner similar to the components, steps, operations, and / Or additions.

본 명세서에서 로직(LOGIC) 또는 루틴(ROUTINE)은 프로세서에 의하여 수행 될 수 있는 일련의 명령어(OPERATION)를 의미하며, 특정 프로그래밍 언어에 의하여 작성 된 것으로 한정 되지 않는다.In this specification, a logic or a routine refers to a series of instructions that can be executed by a processor, and is not limited to being written by a specific programming language.

먼저, 도 1을 참조하여 본 발명의 실시예들에 따른 문자 인식 방법 및 그 장치가 인식 할 수 있는, 문자가 포함된 이미지 및 그 인식 결과를 제시 한다.First, referring to FIG. 1, a character recognition method according to embodiments of the present invention and an image including characters and recognition results thereof that can be recognized by the apparatus are presented.

이미 언급 된 바와 같이, 목재, 판재, 강재 등 자재의 표면에 그 자재의 일련 번호 등 관련 정보가 타각(打刻)될 수 있다. 타각 되는 문자는 연결 된 획으로 구성된 일반 문자일 수 있으나, 타각의 효율성을 위하여 도트 타입(dot type)의 문자일 수도 있다. 또한, 공정에 발생한 여러 상황에 따라 타각 된 글자가 비정형적 형상을 가질 수도 있다. 또한, 자재의 표면이 종이와 달리 고르지 않거나, 공정의 특성상 글자 부분이 부식되거나 오염, 손상될 수도 있다. 도 1은 도트 타입으로 문자가 타각 된 자재의 표면을 촬영한 이미지(10)를 표시 한다.As already mentioned, related information, such as the serial number of the material, can be embossed on the surface of the material, such as wood, plate, or steel. A character to be embossed may be a general character composed of linked strokes, but may be a dot type character for efficiency of embossing. Also, depending on various situations occurring in the process, the embossed characters may have an irregular shape. In addition, the surface of the material may be uneven from the paper, or character portions of the material may be corroded, contaminated, or damaged due to the nature of the process. Fig. 1 shows an image 10 of a dot-type image of a surface of a material embossed with characters.

본 발명의 실시예들은 도 1에 도시된 것과 같은 이미지(10)로부터 문자를 인식(20) 할 수 있다. 이하, 본 발명의 실시예들을 설명한다. 다만, 본 발명의 실시예들이 도 1에 도시된 것과 같은 타각 된 문자를 인식하거나, 도트 타입의 문자를 인식하는 것으로만 한정되지는 않으며, 본 발명의 실시예들은 문자를 인식 하는 방법에 포함 된 각각의 동작 또는 문자를 인식 하는 장치의 구성을 기준으로 이해 되어야 한다.Embodiments of the present invention may recognize (20) characters from an image 10 as shown in FIG. Hereinafter, embodiments of the present invention will be described. However, the embodiments of the present invention are not limited to recognizing embossed characters such as shown in FIG. 1, or recognizing dot type characters, and embodiments of the present invention are not limited to those embodied in a method of recognizing characters But should be understood on the basis of the configuration of a device that recognizes each operation or character.

도 2를 참조하여, 본 발명의 일 실시예에 따른 문자 인식 방법을 설명한다. 도 2에는 입력 이미지가 이미 생성 된 경우에 대한 순서도가 기재되어 있으나, 본 실시예는, 문자가 프린트 되어 있거나 문자가 타각 되어 있는 피사체를 촬영 하여 상기 입력 이미지를 생성하는 단계를 더 포함할 수 있다. 본 실시예는, 촬영 된 이미지를 디스플레이 하는 단계 및 사용자로부터 디스플레이 된 이미지 상의 문자 표시 영역을 지정 받는 단계를 더 포함할 수도 있다. 문자 표시 영역을 사용자로부터 지정 받은 경우, 상기 문자 표시 영역에서만 문자를 인식함으로써 문자 인식의 오류 발생 가능성을 낮출 수 있다.Referring to FIG. 2, a character recognition method according to an embodiment of the present invention will be described. Although FIG. 2 shows a flowchart for the case where an input image has already been generated, this embodiment may further include a step of photographing a subject in which characters are printed or characters are embossed to generate the input image . The present embodiment may further include a step of displaying the photographed image and a step of designating a character display area on the displayed image from the user. When a character display area is designated by a user, it is possible to reduce the possibility of error in character recognition by recognizing the character only in the character display area.

상기 입력 이미지에서 특징 값(feature)이 추출 된다(S100). 추출 된 상기 특징 값은 상기 입력 이미지에 문자의 폰트(font)를 결정하는데 기초 자료로 사용 될 수 있다(S102). 상기 특징 값을 추출하기 전에, 상기 입력 이미지에서 상기 특징 값이 잘 추출 될 수 있도록, 상기 입력 이미지에 대한 선처리(pre-process)가 수행 될 수 있다. 예를 들어, 상기 선처리는 레벨 조정(level adjusting)을 포함할 수 있다. 예를 들어, 색상을 밝게 하고 명암 대비를 조정할 수 있다.Feature values are extracted from the input image (S100). The extracted feature value may be used as a basic data for determining a font of a character in the input image (S102). A pre-process may be performed on the input image so that the feature value can be extracted well in the input image before extracting the feature value. For example, the pre-processing may include level adjusting. For example, you can lighten the hue and adjust the contrast.

본 실시예에서는, 이미지의 특징 값으로 사용 되는 다양한 요소들이 추출 될 수 있다. 예를 들어, 상기 입력 이미지의 에지(edge), 또는 윤곽선(contour)이 추출 될 수 있다. 또한, 상기 입력 이미지를 구성하는 각 픽셀의 색상 값에 대한 히스토그램(histogram)이 연산 될 수 있다.In this embodiment, various elements used as feature values of an image can be extracted. For example, an edge or contour of the input image may be extracted. In addition, a histogram of color values of pixels constituting the input image may be calculated.

추출 된 상기 입력 이미지의 에지는 상기 입력 이미지에 포함된 각 문자(character)의 크기를 결정하는데 사용 될 수 있다. 일반적으로, 문자의 외곽선은 주위와 비교하여 현저한 색상 변화를 구성하므로, 문자가 포함된 이미지에 대하여 소정의 에지 추출 로직을 적용하면 문자의 외곽선이 에지를 형성하게 된다. 따라서, 한 획(stroke)이 하나의 문자를 구성하는 숫자(0~9) 및 소문자 i, j 제외한 알파벳에 대하여는, 하나의 에지가 차지하는 영역이 곧 문자의 영역인 것으로 간주 된다. (또한 도트형태의 문자 포함한 획으로 구성된 문자도 입력 이미지 품질에 따라 분리될 수 있으므로 에지의 간격이 기준 값 이내일 경우 같은 문자영역으로 간주한다.)The extracted edges of the input image may be used to determine the size of each character included in the input image. Generally, since the outline of a character constitutes a remarkable color change compared to the surroundings, when the predetermined edge extraction logic is applied to an image including a character, the outline of the character forms an edge. Therefore, for an alphabet except a digit (0 to 9) and a lowercase letter i, j in which a stroke constitutes one character, the area occupied by one edge is regarded as a character area. (Also, characters composed of strokes including dot-shaped characters can be separated according to the input image quality, so if the interval of the edge is within the reference value, it is regarded as the same character area.)

본 명세서에서, 에지는 인접한 픽셀들로 구성 된, 끊기지 않은 기준 개수 및 크기 이상의 픽셀의 집합을 지칭한다.In this specification, an edge refers to a set of pixels that are made up of adjacent pixels and that are not less than the reference number and size.

추출된 에지에 대하여 위치 보정이 수행 될 수 있다. 예를 들어, 에지의 아웃라인(outline) 제거 및 평행성 보정 또는 기울기 보정 등이 수행 될 수 있다.Position correction can be performed on the extracted edge. For example, edge outline removal and parallel or slope correction may be performed.

문자의 크기는 넓이와 높이로 정의 될 수 있다.The size of a character can be defined as width and height.

예를 들어, 하나의 에지가 차지하는 픽셀의 x축 최소 좌표가 30이고, x축 최대 좌표가 100이며, y축 최소 좌표가 100이고 y축 최대 좌표가 200이라면, 상기 연결 된 에지가 가리키는 문자의 크기는 가로 70, 세로 100인 것으로 결정 될 수 있을 것이다.For example, if the x coordinate minimum coordinate of one edge is 30, the x coordinate maximum coordinate is 100, the y coordinate minimum coordinate is 100, and the y coordinate maximum coordinate is 200, the character indicated by the connected edge The size can be determined to be 70 in the width and 100 in the width.

한글, 한자, 알파벳 소문자 i, j 및 기타 문자의 경우, 2 이상의 획이 하나의 문자를 구성한다. 이런 경우라도, 하나의 문자를 구성하는 획들은 서로 인접하게 될 것이다. 이러한 성질을 이용하여, 소정의 기준을 만족하는 정도로 인접한 에지들로 구성 된 에지 그룹은 하나의 문자를 구성하는 것으로 간주하고, 상기 에지 그룹이 차지 하는 영역이 곧 문자의 영역인 것으로 간주 된다.Hangul, Kanji, and Alphabets For lowercase i, j, and other characters, two or more strokes constitute one character. Even in this case, the strokes constituting one character will be adjacent to each other. Using this property, an edge group composed of adjacent edges to the extent satisfying a predetermined criterion is regarded as constituting one character, and an area occupied by the edge group is regarded as a character area.

예를 들어, 제1 에지가 차지하는 픽셀의 x축 최소 좌표가 40이고, x축 최대 좌표가 70이며, y축 최소 좌표가 48이고 y축 최대 좌표가 108이며, 상기 제1 에지와 함께 하나의 문자를 구성하는 제2 에지가 차지하는 픽셀의 x축 최소 좌표가 74이고, x축 최대 좌표가 84이며, y축 최소 좌표가 40이고 y축 최대 좌표가 115라면, 제1 에지와 제2 에지가 구성하는 문자의 크기는 가로 44, 세로 75인 것으로 결정 될 것이다.For example, if the x-axis minimum coordinate of the first edge is 40, the x-axis maximum coordinate is 70, the y-axis minimum coordinate is 48 and the y-axis maximum coordinate is 108, If the x-axis minimum coordinate of the pixel occupied by the second edge constituting the character is 74, the x-axis maximum coordinate is 84, the y-axis minimum coordinate is 40 and the y-axis maximum coordinate is 115, the first and second edges It is determined that the size of the constituent characters is 44 in the horizontal direction and 75 in the vertical direction.

한편, 문자의 개수가 복수인 경우, 각 문자의 크기는 서로 다르게 결정 될 수 있다. 이러한 경우, 각 문자 크기의 평균치를 사용할 수 있다.On the other hand, when the number of characters is plural, the size of each character can be determined differently. In this case, the average value of each character size can be used.

추출 된 상기 입력 이미지의 에지는 상기 입력 이미지에 포함된 각 문자를 구성하는 획(stroke)의 굵기를 결정하는 데에도 사용 될 수 있다. 에지는 각 획의 윤곽선으로 볼 수 있으므로, 에지를 구성하는 각 픽셀의 좌표를 분석하여 획의 굵기가 얻어질 수 있다. 획의 굵기에도 오차가 발생하는 등의 이유로 각 획마다 차이가 발생할 수 있다. 이러한 경우, 각 획의 굵기의 평균치를 사용할 수 있다.The edge of the extracted input image can also be used to determine the thickness of a stroke constituting each character included in the input image. Since the edge can be seen as the outline of each stroke, the thickness of the stroke can be obtained by analyzing the coordinates of each pixel constituting the edge. A difference may occur in each stroke due to an error in the thickness of the stroke. In this case, the average value of the thickness of each stroke can be used.

문자의 크기 및 획의 굵기가 얻어지면, 기 저장 된 각 폰트의 폰트 형상 정보 중 상기 문자 크기 및 획의 굵기에 부합하는 것이 있는 지 조회 한다. 이를 위해, 본 실시예는 인식하고자 하는 각 폰트의 폰트 형상 정보를 미리 저장 하는 단계를 더 포함할 수 있다.If the size of the character and the thickness of the stroke are obtained, inquiry is made as to whether or not the font shape information of each of the previously stored fonts matches the character size and the thickness of the stroke. To this end, the present embodiment may further include storing font shape information of each font to be recognized in advance.

상기 폰트 형상 정보에는 폰트 타입, 자간, 장평, 문자 크기 정보, 획 굵기 정보(도트 타입 폰트의 경우, 도트 굵기 정보) 등이 포함 될 수 있다. 상기 폰트 타입은 도트 타입인지 여부를 가리킨다.The font shape information may include a font type, a character spacing, a major plane, character size information, stroke information (dot thickness information in the case of a dot type font), and the like. The font type indicates whether it is a dot type.

문자 촬영 거리에 따라 문자의 크기가 달라질 수 있으므로, 일부 실시예에 따르면 입력 이미지에 포함된 문자의 크기와 획의 굵기 사이의 비율이 적용 폰트를 결정하기 위한 팩터(factor)로써 더 사용될 수 있다. 예를 들어, 입력 이미지의 문자 넓이와 획의 굵기 사이의 비율이 15:1이고, 문자 높이와 획의 굵기 사이의 비율이 18:1으로 측정 된 경우, 폰트 형상 정보에서 이러한 비율에 부합하는 것이 있는지 조회할 수 있다.According to some embodiments, the ratio between the size of a character included in an input image and the thickness of a stroke may be further used as a factor for determining an applied font. For example, if the ratio between the character width of the input image and the width of the stroke is 15: 1 and the ratio between the height of the character and the thickness of the stroke is 18: 1, .

정리하면, 본 실시예에서 입력 이미지에서 에지 등 특징 값을 추출 하는 단계(S100)와, 상기 특징 값과 각 폰트의 폰트 형상 정보를 이용하여 상기 입력 이미지에 표시 된 문자에 대한 적용 폰트를 결정하는 단계(S102)가 수행 되고, 상기 적용 폰트를 결정하는 단계는, 상기 추출 된 특징 값을 이용하여 상기 입력 이미지에 표시 된 각 문자의 크기 및 문자를 구성하는 획의 굵기를 결정 하는 단계와, 상기 문자의 크기 및 획의 굵기에 부합하는 폰트 형상 정보를 가진 폰트를 상기 적용 폰트로 결정하는 단계를 포함할 수 있다. 이 때, 상기 특징 값을 이용하여 상기 입력 이미지에 표시 된 각 문자의 크기 및 문자를 구성하는 획의 굵기를 결정 하는 단계는, 추출 된 각 에지 중 기 정의 된 최소 길이에 미달 하는 길이를 가지는 에지를 노이즈 처리 하는 단계와, 노이즈 처리 되지 않은 각각의 에지를 구성하는 픽셀의 좌표를 이용하여 각 문자의 크기 및 획의 굵기를 결정 하는 단계를 포함할 수 있다.In other words, in the present exemplary embodiment, a feature value such as an edge is extracted from an input image (S100), and a font applied to a character displayed on the input image is determined using the feature value and font shape information of each font Wherein the step of determining the applied font comprises the steps of: determining a size of each character displayed on the input image using the extracted feature value and a thickness of a stroke constituting the character; And determining the font having the font shape information corresponding to the size of the character and the thickness of the stroke as the applied font. In this case, the step of determining the size of the character and the thickness of the strokes constituting the character displayed on the input image using the feature value may include determining the width of the edge of the edge, And determining the size of each character and the thickness of the stroke using the coordinates of the pixels constituting each of the edges that are not subjected to the noise processing.

본 실시예에서, 입력 이미지의 색상 값에 대한 히스토그램이 문자의 외곽선을 나타내는 에지를 보다 정확하게 추출하는데 적용될 수 있다. 문자의 각 획이 동일하거나 유사한 색상으로 구성되고, 하나의 획은 동일 또는 유사 색상으로 구성 되는 이상, 문자 색상은 히스토그램에서 높은 도수를 가질 것이다. 이러한 점을 이용하여, 상기 입력 이미지를 구성하는 픽셀들 중, 상기 히스토그램에서 소정 값 이상의 비율을 차지하는 색상 값을 가지는 문자 구성 후보 픽셀과 그 인접 픽셀들을 대상으로, 에지를 추출할 수 있다. 이러한 경우, 문자의 외곽선이 아닌 곳에서 에지가 추출 되어, 상기 에지가 문자의 외곽선으로 다루어지고, 그 결과 문자의 크기가 잘못 결정 되는 것을 방지할 수 있다.In this embodiment, a histogram of the color values of the input image can be applied to more accurately extract the edges representing the outline of the characters. The character color will have a high frequency in the histogram, so long as each stroke of the character is composed of the same or similar color, and one stroke is composed of the same or similar color. Using this point, an edge can be extracted with respect to a character constituent candidate pixel having a color value occupying a ratio equal to or larger than a predetermined value in the histogram and adjacent pixels among the pixels constituting the input image. In this case, an edge is extracted from the outline of the character, and the edge is treated as the outline of the character, thereby preventing the size of the character from being erroneously determined.

이하, 입력 이미지에 도트 타입의 문자가 포함 된 경우의 본 발명의 일 실시예에 따른 폰트 결정 방법에 대하여 설명한다.Hereinafter, a font determination method according to an embodiment of the present invention in which a dot type character is included in an input image will be described.

상기 입력 이미지에 도트 타입의 문자가 포함 된 경우라면, 각 도트의 외곽선이 에지를 형성하게 된다. 따라서, 도트 타입 문자를 포함한 이미지에 대하여 에지 추출 로직을 적용하면, 원형의 에지가 복수 개 추출 될 것이다. 예를 들어, 20개의 도트로 구성 된 제1 문자와 18개의 도트로 구성 된 제2 문자가 포함 된 이미지에 대하여 에지 추출 로직을 적용하면, 원형의 에지가 38개 추출 될 것이다. 따라서, 본 실시예에서는 에지 추출의 결과 원형의 에지가 기 지정 된 개수 이상 존재 하는 경우, 상기 적용 폰트가 도트 타입 폰트 중 하나인 것으로 판정한다.If a dot type character is included in the input image, the outline of each dot forms an edge. Therefore, if edge extraction logic is applied to an image including a dot type character, a plurality of circular edges will be extracted. For example, when edge extraction logic is applied to an image including a first character composed of 20 dots and a second character composed of 18 dots, 38 circular edges will be extracted. Therefore, in the present embodiment, when the number of the circular edges is equal to or greater than the predetermined number as a result of the edge extraction, it is determined that the applied font is one of the dot type fonts.

상기 도트의 형상은 원형인 것이 대부분이나, 도트의 형상은 비(非)원형, 예를 들어 사각형, 삼각형 등일 수도 있다. 이러한 경우 본 실시예는 동일한 형상의 에지가 기 지정된 개수 이상 존재 하는 경우, 적용 폰트를 도트 타입 폰트로 판정 하는 것으로 일반화 될 수 있다.The shape of the dot is mostly circular, but the shape of the dot may be a non-circular shape, for example, a square, a triangle, or the like. In this case, this embodiment can be generalized to determine the applied font as a dot type font when there are more than a predetermined number of edges of the same shape.

하나의 문자를 구성하는 도트들은 서로 소정의 기준에 부합하는 정도로 인접할 것이므로, 서로 인접한 도트들로 하나의 그룹을 형성하고, 형성 된 도트 그룹이 차지하는 영역을 이용하여 상기 도트 그룹이 가리키는 문자의 크기를 결정할 수 있을 것이다.Since the dots constituting one character are adjacent to each other to a degree matching with a predetermined reference, a group is formed of dots adjacent to each other, and the size of a character indicated by the dot group is determined by using a region occupied by the formed dot group . &Lt; / RTI >

예를 들어, 하나의 도트 그룹에 포함 된 각 도트의 외곽선 에지가 차지하는 픽셀의 x축 최소 좌표가 30이고, x축 최대 좌표가 100이며, y축 최소 좌표가 100이고 y축 최대 좌표가 200이라면, 상기 도트 그룹이 가리키는 문자의 크기는 가로 70, 세로 100인 것으로 결정 될 수 있을 것이다.For example, if the x-axis minimum coordinate of the pixel occupied by the outline edge of each dot included in one dot group is 30, the x-axis maximum coordinate is 100, the y-axis minimum coordinate is 100, and the y- , And the size of the character indicated by the dot group may be determined to be 70 in the horizontal direction and 100 in the vertical direction.

추출 된 상기 입력 이미지의 에지는 상기 입력 이미지에 포함된 각 문자를 구성하는 도트의 굵기를 결정하는 데에도 사용 될 수 있다. 에지는 각 도트의 윤곽선으로 볼 수 있으므로, 에지를 구성하는 각 픽셀의 좌표를 분석하여 도트의 굵기가 얻어질 수 있다. 도트의 굵기에도 오차가 발생하는 등의 이유로 각 도트마다 차이가 발생할 수 있다. 이러한 경우, 각 도트의 굵기의 평균치를 사용할 수 있다.The edge of the extracted input image can also be used to determine the thickness of the dots constituting each character included in the input image. Since the edge can be seen as the outline of each dot, the thickness of the dot can be obtained by analyzing the coordinates of each pixel constituting the edge. A difference may occur in each dot due to an error in the thickness of the dot or the like. In this case, the average value of the thicknesses of the dots can be used.

문자 촬영 거리에 따라 문자의 크기가 달라질 수 있으므로, 일부 실시예에 따르면 입력 이미지에 포함된 문자의 크기와 도트의 굵기 사이의 비율이 적용 폰트를 결정하기 위한 팩터(factor)로써 더 사용될 수 있다. 예를 들어, 입력 이미지의 문자 넓이와 도트 굵기 사이의 비율이 22:1이고, 문자 높이와 획의 굵기 사이의 비율이 28:1으로 측정 된 경우, 폰트 형상 정보에서 이러한 비율에 부합하는 것이 있는지 조회할 수 있다.According to some embodiments, the ratio between the size of the characters included in the input image and the thickness of the dot may be further used as a factor for determining the font to be applied. For example, when the ratio between the character width and the dot thickness of the input image is 22: 1 and the ratio between the character height and the stroke width is 28: 1, it is determined whether there is such a ratio in the font shape information .

정리하면, 적용 폰트를 결정하는 단계는, 인접한 원형 에지를 그룹화 하는 단계와, 하나의 그룹에 속한 각각의 원형 에지를 구성하는 픽셀의 좌표를 이용하여 각 문자의 크기를 결정 하는 단계와, 각각의 원형 에지의 지름을 이용하여 도트의 굵기를 결정 하는 단계와, 상기 문자의 크기 및 도트의 굵기에 부합하는 폰트 형상 정보를 가진 도트 타입 폰트를 상기 적용 폰트로 결정하는 단계를 포함할 수 있다.In summary, the step of determining the applied font comprises the steps of grouping adjacent circular edges, determining the size of each character using the coordinates of the pixels constituting each circular edge belonging to a group, Determining a thickness of the dot using the diameter of the circular edge; and determining a dot type font having font size information and font shape information corresponding to the size of the character and the size of the dot as the application font.

상기 설명 된 동작을 거쳐 입력 이미지에 포함된 문자에 적용 된 폰트가 결정 되면, 상기 적용 폰트의 폰트 형상 정보를 이용하여, 상기 입력 이미지를 각각의 문자 영역 이미지로 구분 한다(S104). 이미 언급한 바와 같이, 상기 폰트 형상 정보에는 자간 및 장평 정보와, 각 문자의 크기 정보가 포함 되어 있으므로, 에지의 추출을 통해 이미 구한 각 문자의 영역 정보에 상기 자간 및 장평 정보를 더 반영 하여 도 3에 도시 된 것과 같은 문자 분리 가이드 라인(30)이 상기 입력 이미지에 설정 될 수 있다. 그 후, 상기 문자 분리 가이드 라인을 기준으로 상기 입력 이미지를 각각의 문자 영역 이미지로 구분할 수 있다.If the font applied to the characters included in the input image is determined through the above-described operation, the input image is divided into the respective character area images using the font shape information of the applied font (S104). As already mentioned, since the font shape information includes the character spacing and the length information and the size information of each character, even if the character spacing and the length information are further reflected in the area information of each character already obtained through edge extraction A character separation guideline 30 such as shown in Fig. 3 may be set in the input image. Thereafter, the input image may be divided into individual character region images based on the character separation guideline.

다음으로, 각각의 문자 영역 이미지 별로 어떠한 문자인지 인식하는 단계가 수행 된다(S106). 이 때, 이미 적용 폰트는 결정 된 바 있으므로, 상기 문자 영역 이미지를 상기 적용 폰트의 각 문자 템플릿과 비교하여, 상기 문자 영역 이미지의 각 문자 템플릿에 대한 유사도를 연산하여, 상기 유사도를 바탕으로 상기 문자 영역 이미지에 대응하는 문자를 결정 한다. 이하, 도 4를 참조하여 보다 자세히 설명한다.Next, a step of recognizing any character for each character area image is performed (S106). At this time, since the already applied font has been determined, the character area image is compared with each character template of the applied font to calculate the similarity to each character template of the character area image, and based on the similarity, The character corresponding to the area image is determined. Hereinafter, this will be described in more detail with reference to FIG.

먼저, 인식의 정확도를 높이기 위하여 상기 문자 영역 이미지에 대하여 이미지 전처리가 수행 된다. 선명도를 높이는 등의 일련의 이미지 처리가 수행 되고, 이진화(binarization)가 수행 될 수 있다(S160). 이진화에 의하여, 문자 영역 이미지는 0 또는 1의 픽셀 값을 가지게 된다.First, image preprocessing is performed on the character area image to improve recognition accuracy. A series of image processing such as increasing sharpness is performed, and binarization may be performed (S160). By binarization, the character area image has a pixel value of 0 or 1.

다음으로, 상기 문자 영역 이미지의 크기를 상기 적용 폰트의 각 문자 템플릿의 이미지의 크기와 일치시킨다(S162).Next, the size of the character area image is matched with the size of the image of each character template of the applied font (S162).

다음으로, 상기 문자 영역 이미지와 상기 적용 폰트의 각 문자 템플릿의 이미지 사이의 픽셀 값 일치 비율을 연산 하여 유사도를 연산한다(S164). 상기 문자 영역 이미지의 크기와 상기 문자 템플릿 이미지의 크기가 일치하므로, 동일한 위치의 픽셀 간의 픽셀 값이 1:1 매치 될 수 있다. 문자 템플릿 이미지 역시 0, 1의 픽셀 값을 가지는 픽셀들로 구성 된 것으로 전제한다.Next, a pixel value matching ratio between the character area image and the image of each character template of the applied font is calculated to calculate the similarity (S164). Since the size of the character area image is identical to the size of the character template image, pixel values between pixels at the same position can be matched 1: 1. The character template image is also assumed to be composed of pixels having a pixel value of 0, 1.

다음으로, 상기 문자 영역 이미지가 상기 적용 폰트의 각 문자 템플릿의 특징 요건을 만족하는 지 여부에 의하여 상기 유사도를 조정한다(S166). 이하, 도 5를 참조하여, 상기 문자 영역 이미지를 상기 적용 폰트의 각 문자 템플릿과 비교하여, 상기 문자 영역 이미지의 각 문자 템플릿에 대한 유사도를 연산하는 동작을 보다 자세히 설명한다.Next, the degree of similarity is adjusted according to whether or not the character area image satisfies the feature requirement of each character template of the applied font (S166). Hereinafter, the operation of comparing the character area image with each character template of the applied font to calculate the similarity to each character template of the character area image will be described in more detail.

도 5에는 5X8(넓이X높이) 픽셀로 구성 된 가상의 문자 영역 이미지 및 특정 폰트의 ‘0’ 템플릿이 도시 되어 있다. 본 발명의 실시예들에 따르면, 인식 대상 폰트에 대한 정보로써, 이미 설명한 폰트 형상 정보 이외에, 각 문자의 템플릿 이미지 및 특징 요건 정보가 더 저장 될 수 있다. 도 5에는 숫자 ‘0’에 대한 템플릿 이미지(36) 및 특징 요건 정보(37)가 도시 되어 있다. 특징 요건 정보(37)는 해당 문자로 인식 되기 위하여, 문자 영역 이미지 중 특정 영역의 픽셀 값이 만족해야 하는 요건(condition)을 의미한다.In FIG. 5, a virtual character area image composed of 5X8 (wide X height) pixels and a '0' template of a specific font are shown. According to the embodiments of the present invention, as the information on the recognition target font, in addition to the font shape information already described, the template image and feature requirement information of each character can be further stored. Figure 5 shows a template image 36 and feature requirement information 37 for the number ' 0 '. The feature requirement information 37 indicates a condition that a pixel value of a specific area in the character area image must be satisfied in order to be recognized as a corresponding character.

먼저, 문자 영역 이미지(35)와 숫자 ‘0’에 대한 템플릿 이미지(36)의 픽셀 값의 유사도가 연산 된다. 이 때, 동일한 위치의 픽셀들끼리, 픽셀 값의 일치 여부를 판정 하고, 픽셀 값이 일치하는 픽셀의 비율이 상기 유사도가 될 수 있다. 다음으로, 문자 영역 이미지(35)가 숫자 '0'에 대한 템플릿의 특징 요건을 만족하는지 여부에 대한 평가가 수행 된다.First, the similarity of the pixel values of the character area image 35 and the template image 36 to the number '0' is calculated. At this time, it is judged whether or not the pixel values of the pixels at the same position coincide with each other, and the ratio of the pixels whose pixel values match can be the similarity. Next, an evaluation is made as to whether the character area image 35 satisfies the feature requirement of the template for the number '0'.

도 5에는 숫자 ‘0’에 대한 가상의 특징 요건으로 3가지가 예시되고 있다. 제1 특징 요건은 제1 영역(31)의 픽셀이 모두 0일 것을 요구한다. 인식 대상인 문자 영역 이미지(35)의 경우, 이 조건을 만족하지 못한다. 제2 특징 요건은 제2 영역(32)의 픽셀 중 적어도 하나는 1일 것을 요구한다. 문자 영역 이미지(35)의 경우, 이 조건을 만족한다. 제3 특징 요건은 제3 영역(33)의 픽셀 중 적어도 하나는 1일 것을 요구한다. 문자 영역 이미지(35)의 경우, 이 조건을 만족한다. 즉, 도 5에 도시된 사안의 경우, 문자 영역 이미지(35)가 제1 특징 요건 만을 만족 시키지 못했으므로, 이미지 간의 비교를 통하여 연산 된 상기 유사도에서, 제1 특징 요건에 부여 된 수치가 차감 되거나, 제1 특징 요건에 부여 된 비율만큼 감점 될 수 있다.In FIG. 5, three virtual feature requirements for the number '0' are illustrated. The first feature requirement requires that the pixels of the first region 31 be all zeros. In the case of the character area image 35 to be recognized, this condition is not satisfied. The second feature requirement requires that at least one of the pixels of the second region 32 be one. In the case of the character area image 35, this condition is satisfied. The third feature requirement requires that at least one of the pixels of the third region 33 be one. In the case of the character area image 35, this condition is satisfied. That is, in the case shown in FIG. 5, since the character area image 35 does not satisfy only the first feature requirement, in the similarity calculated through comparison between images, the value given to the first feature requirement is subtracted , It can be deducted by the ratio given to the first feature requirement.

즉, 각 문자 템플릿은 복수의 특징 요건을 가질 수 있고, 각각의 특징 요건은 서로 다른 정도의 중요도를 가질 수 있으며, 그 결과 특징 요건을 만족하지 못하는 경우, 유사도에서 차감 되는 수치 또는 비율이 특징 요건 마다 서로 다르게 할당 될 수 있다.That is, each character template may have a plurality of feature requirements, and each feature requirement may have a different degree of importance, so that if the feature requirements are not satisfied, Can be assigned differently.

일 실시예에 따르면, 상기 특징 요건의 만족 시 기 연산 된 유사도가 증가 될 수도 있다. 즉, 상기 문자 영역 이미지가 상기 적용 폰트의 각 문자 템플릿의 특징 요건을 만족하는 지 여부에 의하여 상기 유사도를 조정할 때, 특징 요건 불만족 시 유사도를 차감하거나, 특징 요건 만족 시 유사도를 증가시킬 수 있다.According to one embodiment, the calculated degree of similarity may be increased upon satisfaction of the feature requirement. That is, when the similarity is adjusted according to whether or not the character area image satisfies the feature requirement of each character template of the applied font, the degree of similarity may be subtracted or the degree of similarity may be increased when the feature requirement is satisfied.

다른 실시예에 따르면, 상기 특징 요건으로, 특정 영역에 대한 픽셀 값 일치 여부가 사용 될 수 있다. 예를 들어, 숫자 '0'의 문자 템플릿은, 픽셀 값이 '0'일 것(예를 들어 흰색을 의미)을 요하는 영역(31), 픽셀 값이 '1'일 것(예를 들어 검은색을 의미)을 요하는 영역(32, 33)을 상기 특징 요건으로 가질 수 있다. 이 때, 상기 특징 요건에 대한 평가 결과는, 문자 영역 이미지(35)가 영역(31, 32, 33) 내의 지정 된 픽셀 값을 어느 정도 만족 시켰는지에 의하여 연산 될 수 있다. 이 때, 상기 평가 결과는 만족, 불만족의 2가지로만 결정 되는 것이 아니라, 만족 정도에 따라 조정 수치를 결정할 수 있다. 상기 조정 수치는 + 값을 가질 수도 있고, - 값을 가질 수도 있다. 상기 조정 수치는 기 연산 된 상기 유사도에 반영 된다. 본 실시예에 따르면, 문자 프린트 또는 타각 상태가 비정형적인 경우에도 유연하게 문자 인식이 가능하도록 하는 효과가 있다.According to another embodiment, with the above feature requirement, pixel value matching for a specific area can be used. For example, a character template of the number '0' may include a region 31 requiring a pixel value of '0' (for example, white), a pixel value of '1' (32, 33), which require color (meaning color). At this time, the evaluation result on the feature requirement can be calculated by how much the character area image 35 satisfies the designated pixel value in the area 31, 32, 33. At this time, the result of the evaluation is not determined only as satisfaction or dissatisfaction, but the adjustment value can be determined according to the degree of satisfaction. The adjustment value may have a positive value or a negative value. The adjustment value is reflected in the pre-calculated degree of similarity. According to the present embodiment, there is an effect that the character recognition can be performed flexibly even when the character printing or the embossing state is irregular.

하나의 문자 영역 이미지(35)는 적용 폰트에 포함된 모든 문자에 대하여 상기 유사도를 산정할 수 있다. 그 결과 가장 높은 유사도를 가지는 문자가 상기 문자 영역 이미지에 표시된 문자인 것으로 인식 할 수 있을 것이다(S168).One character area image 35 may calculate the similarity for all characters included in the applied font. As a result, it can be recognized that the character having the highest similarity is the character displayed in the character area image (S168).

다시, 도 2로 돌아가서 설명한다. 상기 설명한 문자 영역 이미지에 대한 문자 인식 동작은 입력 이미지에 포함 된 모든 문자 영역 이미지에 대하여 반복 수행 된다. 그 결과, 입력 이미지에 포함 된 모든 문자를 인식 할 수 있다.Again, referring back to Fig. The character recognition operation for the character area image described above is repeated for all the character area images included in the input image. As a result, all characters included in the input image can be recognized.

본 실시예에 따른 문자 인식 방법은, 입력 이미지의 특징 값에 의하여 적용 폰트를 확인하고, 입력 이미지를 문자 영역 이미지로 분리하며, 각 문자 영역 이미지를 적용 폰트의 문자 템플릿과 일일이 비교하여, 각 문자 템플릿과의 유사도 랭킹을 통해 문자를 인식함으로써, 입력 이미지에 문자가 표시 된 상태가 나쁘더라도 문자를 정확하게 인식 할 수 있는 효과가 있다. 또한, 도트 타입의 문자라도, 에지 추출을 통해 복수의 도트를 정확하게 인식하여 도트 타입의 폰트임을 인지하고, 도트 타입의 폰트 형상 데이터 및 각 문자에 대한 이미지, 특징 요건과의 비교를 통해 문자를 정확히 인식할 수 있는 효과가 있다.The character recognition method according to the present embodiment checks an applied font according to a feature value of an input image, separates an input image into a character area image, compares each character area image with a character template of the applied font, By recognizing characters through similarity ranking with the template, even if the characters are displayed in the input image is poor, the characters can be correctly recognized. In addition, even if a dot type character is recognized by a plurality of dots correctly by edge extraction, it is recognized that the font is a dot type font, and the characters are accurately displayed by comparing dot type font shape data, images for each character, There is a recognizable effect.

인식된 문자 또는 문자열은 디스플레이 되거나, 저장 되거나, 네트워크 인터페이스를 통하여 외부 장치로 송신 될 수 있다(S108). 본 실시예에 따른 문자 인식 방법이 적용 된 장치는 도 6에 도시된 것과 같은 사용자 인터페이스를 제공 할 수 있다. 상기 사용자 인터페이스는 입력 이미지를 확인할 수 있도록 디스플레이 하는 영역(40), 인식 된 결과를 확인할 수 있도록 텍스트가 디스플레이 하는 영역(50) 및 인식 된 결과를 처리 하기 위한 메뉴 영역(60)을 포함할 수 있다. 사용자는 상기 사용자 인터페이스를 통해, 자재 표면에 표기 되거나 타각 된 자재 관련 정보의 촬영, 인식, 인식 결과 관리까지를 일괄적으로 처리할 수 있다.The recognized character or string may be displayed, stored, or transmitted to an external device via a network interface (S108). The apparatus to which the character recognition method according to the present embodiment is applied can provide a user interface as shown in FIG. The user interface may include an area 40 for displaying an input image so that the input image can be recognized, an area 50 for displaying the text to identify the recognized result, and a menu area 60 for processing the recognized result . Through the user interface, the user can collectively process the shooting, recognition, and recognition result management of the material-related information displayed on the surface of the material or embossed.

도 7을 참조하여 본 발명의 다른 실시예에 따른 문자 인식 장치를 설명하기로 한다. 본 실시예에 따른 문자 인식 장치(70)는 파지가 편리하도록 손잡이가 구비 된 핸드헬드(hand-held) 단말일 수 있다. 문자 인식 장치(70)에는 이미지를 센싱하는 이미지 센싱부(71) 및 도 6에 도시 된 것과 같은 사용자 인터페이스를 디스플레이 하는 내장 디스플레이(72) 및 제어부(미도시), 저장부(미도시), 네트워크 인터페이스(미도시)를 포함할 수 있다. 상기 제어부에는 도 2 내지 도 5를 참조하여 설명한 방법을 수행하는 컴퓨터 프로그램을 실행 하는 프로세서 및 상기 컴퓨터 프로그램의 실행 코드를 저장하는 저장 수단이 구비 된다.The character recognition apparatus according to another embodiment of the present invention will be described with reference to FIG. The character recognition device 70 according to the present embodiment may be a hand-held terminal having a handle for easy gripping. The character recognition apparatus 70 is provided with an image sensing unit 71 for sensing an image and a built-in display 72 for displaying a user interface such as shown in Fig. 6, a control unit (not shown), a storage unit And an interface (not shown). The control unit is provided with a processor for executing a computer program for performing the method described with reference to FIGS. 2 to 5 and a storage unit for storing an execution code of the computer program.

도 8을 참조하여 본 발명의 또 다른 실시예에 따른 문자 인식 장치를 설명하기로 한다. 본 실시예에 따른 문자 인식 장치(100)는, 예를 들어 노트북 피씨, 데스크톱 피씨, 태블릿 피씨, 스마트 폰 등의 단말 장치일 수 있다. 본 실시예에 따른 문자 인식 장치(100)는, 이미지 획득부(102), 비휘발성 메모리 또는 HDD 등의 보조 저장 장치로 구성 되는 저장부(104), 외부 장치와 네트워크를 통하여 연결 되어 데이터를 송수신 하는 네트워크 인터페이스(108), 디스플레이(110) 및 문자 인식부(106)를 포함할 수 있다.A character recognition apparatus according to another embodiment of the present invention will be described with reference to FIG. The character recognition apparatus 100 according to the present embodiment may be a terminal device such as a notebook PC, a desktop PC, a tablet PC, or a smart phone. The character recognition apparatus 100 according to the present embodiment includes a storage unit 104 composed of an image acquisition unit 102, a nonvolatile memory or an auxiliary storage device such as a HDD, a storage unit 104 connected to an external device via a network, A display 110, and a character recognition unit 106. The network interface 108 may include a network interface 108, a display 110,

이미지 획득부(102)는, 예를 들어 렌즈와 이미지 센서 및 구동 컨트롤러를 포함할 수 있다. 이미지 획득부(102)는 렌즈를 통해 인입 된 광 신호를 디지털 데이터로 변환하여, 입력 이미지를 생성하여 문자 인식부(106)에 제공 한다.The image acquiring unit 102 may include, for example, a lens, an image sensor, and a drive controller. The image acquiring unit 102 converts the optical signal input through the lens into digital data, generates an input image, and provides the input image to the character recognizing unit 106.

문자 인식부(106)에는 도 2 내지 도 5를 참조하여 설명한 방법을 수행하는 컴퓨터 프로그램을 실행 하는 프로세서 및 상기 컴퓨터 프로그램의 실행 코드를 저장하는 저장 수단이 구비 되며, 이미지 획득부(102)로부터 제공 된 입력 이미지에서 문자를 인식하여, 인식 된 문자를 가리키는 디지털 출력 데이터를 생성한다. 문자 인식부(106)는 상기 디지털 출력 데이터가 저장 되도록 저장부(104)에 저장하거나, 상기 디지털 출력 데이터가 디스플레이 되도록 디스플레이(110)에 제공하거나, 상기 디지털 출력 데이터가 외부 장치에 송신 되도록 네트워크 인터페이스(108)에 제공할 수 있다.The character recognition unit 106 is provided with a processor that executes a computer program that performs the method described with reference to FIGS. 2 to 5 and a storage unit that stores an execution code of the computer program. Recognizes the character in the input image, and generates digital output data indicating the recognized character. The character recognition unit 106 stores the digital output data in the storage unit 104 so that the digital output data is stored or provides the digital output data to the display 110 so that the digital output data is displayed, (108).

도 9를 참조 하여 본 발명의 또 다른 실시예에 따른 문자 인식 장치를 설명하기로 한다. 도 9에 도시 된 문자 인식 장치(106)는 도 8에 도시 된 문자 인식 장치에 하나의 구성 요소로 채용 될 수도 있다. 이에 따라, 도 8에는 문자 인식부(106)로 지칭 되어 설명 된 바 있다. 도 9에 도시 된 바와 같이, 본 실시예에 따른 문자 인식 장치(106)는 이미지 특징 추출부(160), 폰트 저장부(161), 폰트 결정부(162), 문자 영역 구획부(163), 이미지 보정부(164) 및 문자 결정부(165)를 포함할 수 있다.A character recognition apparatus according to another embodiment of the present invention will be described with reference to FIG. The character recognition apparatus 106 shown in Fig. 9 may be employed as one component in the character recognition apparatus shown in Fig. Accordingly, the character recognition unit 106 has been described with reference to FIG. 9, the character recognition apparatus 106 according to the present embodiment includes an image feature extraction unit 160, a font storage unit 161, a font determination unit 162, a character region dividing unit 163, An image correction unit 164 and a character determination unit 165. [

이미지 특징 추출부(160)는 입력 이미지를 제공 받아 특징 값을 추출한다. 상기 특징 값은, 예를 들어 에지, 색상에 대한 히스토그램 등일 수 있다.The image feature extraction unit 160 extracts a feature value by receiving an input image. The feature value may be, for example, an edge, a histogram for a color, or the like.

폰트 저장부(161)는 인식을 지원하는 폰트에 대하여, 폰트 형상 정보 및 각 문자의 템플릿을 저장한다. 상기 문자 템플릿은, 문자의 이미지 및 문자의 특징 요건에 대한 정보를 포함한다. 상기 폰트 형상 정보에는 폰트 타입, 자간, 장평, 문자 크기 정보, 획 굵기 정보(도트 타입 폰트의 경우, 도트 굵기 정보) 등이 포함 될 수 있다. 상기 폰트 타입은 도트 타입인지 여부를 가리킨다.The font storage unit 161 stores font shape information and a template of each character for a font that supports recognition. The character template includes information on characteristics of character images and characters. The font shape information may include a font type, a character spacing, a major plane, character size information, stroke information (dot thickness information in the case of a dot type font), and the like. The font type indicates whether it is a dot type.

폰트 결정부(162)는 상기 특징 값과 폰트 저장부(161)로부터 제공 받은 각 폰트의 폰트 형상 정보를 이용하여 상기 입력 이미지에 표시 된 문자에 대한 적용 폰트를 결정한다.The font determination unit 162 determines an applied font for the character displayed on the input image using the feature value and the font shape information of each font provided from the font storage unit 161. [

문자 영역 구획부(163)는 상기 적용 폰트의 폰트 형상 정보를 이용하여, 상기 입력 이미지를 각각의 문자 영역 이미지로 구분 한다.The character area dividing unit 163 divides the input image into the respective character area images using the font shape information of the applied font.

이미지 보정부(164)는 상기 문자 영역 이미지에 대하여 소정의 화질 보정용 이미지 프로세싱을 수행하고, 이미지 프로세싱이 완료된 상기 문자 영역 이미지를 이진화(binarization)를 수행하며, 이진화 된 상기 문자 영역 이미지를 문자 결정부(165)에 제공한다.The image correcting unit 164 performs predetermined image quality correction on the character area image, performs binarization of the character area image on which the image processing is completed, and outputs the binarized character area image to the character decision unit (Not shown).

문자 결정부(165)는 상기 문자 영역 이미지를 상기 적용 폰트의 각 문자 템플릿과 비교하여 상기 문자 영역 이미지의 각 문자 템플릿에 대한 유사도를 연산하고, 상기 유사도를 이용하여 상기 문자 영역 이미지에 대응하는 문자를 결정한다.The character determination unit 165 compares the character region image with each character template of the applied font to calculate a similarity degree to each character template in the character region image, and calculates a character corresponding to the character region image .

도 9의 각 구성요소는 소프트웨어(software) 또는, FPGA(field-programmable gate array)나 ASIC(application-specific integrated circuit)과 같은 하드웨어(hardware)를 의미할 수 있다. 그렇지만 상기 구성요소들은 소프트웨어 또는 하드웨어에 한정되는 의미는 아니며, 어드레싱(addressing)할 수 있는 저장 매체에 있도록 구성될 수도 있고 하나 또는 그 이상의 프로세서들을 실행시키도록 구성될 수도 있다. 상기 구성요소들 안에서 제공되는 기능은 더 세분화된 구성요소에 의하여 구현될 수 있으며, 복수의 구성요소들을 합하여 특정한 기능을 수행하는 하나의 구성요소로 구현할 수도 있다.Each component in FIG. 9 may refer to software or hardware such as a field-programmable gate array (FPGA) or an application-specific integrated circuit (ASIC). However, the components are not limited to software or hardware, and may be configured to be in an addressable storage medium and configured to execute one or more processors. The functions provided in the components may be implemented by a more detailed component or may be implemented by a single component that performs a specific function by combining a plurality of components.

도 10은 본 발명의 또 다른 실시예에 따른 문자 인식 장치의 하드웨어 구성을 나타낸 도면이다. 도 9의 문자 인식 장치(106)는 도 10에 도시 된 것과 같은 하드웨어 구조로 구현 될 수 있다.10 is a diagram illustrating a hardware configuration of a character recognition apparatus according to another embodiment of the present invention. The character recognition apparatus 106 of FIG. 9 can be implemented in a hardware structure as shown in FIG.

도 10을 참조하면, 본 실시예에 따른 데이터 처리 장치(200)는 시스템 버스(202), 프로세서(212), RAM(Random Access Memory)(210), 스토리지(204), 외부 장치와의 통신 또는 입출력 장치와의 연결을 위한 인터페이스부(206) 및 디스플레이(208)를 포함할 수 있다. 본 발명에 따른 문자 인식 방법을 구현하는 컴퓨터 프로그램 코드는 스토리지(204)에 저장되어 RAM(210)에 로드 되고 프로세서(212)에 의해 실행될 수 있다. 상기 컴퓨터 프로그램 코드의 실행 결과, 문자 인식 결과를 가리키는 출력 데이터가 RAM(210) 또는 스토리지(204)에 저장 되거나, 네트워크 인터페이스(206)를 통하여 외부 장치에 송신 되거나, 디스플레이(208)를 통해 표시 될 수 있다.10, the data processing apparatus 200 according to the present embodiment includes a system bus 202, a processor 212, a random access memory (RAM) 210, a storage 204, An interface unit 206 for connection with the input / output device, and a display 208. [ The computer program code embodying the character recognition method according to the present invention may be stored in the storage 204, loaded into the RAM 210 and executed by the processor 212. [ As a result of execution of the computer program code, output data indicating a character recognition result is stored in the RAM 210 or the storage 204, transmitted to an external device via the network interface 206, or displayed via the display 208 .

지금까지 도 1 내지 도 10을 참조하여 설명된 본 발명의 개념은 컴퓨터가 읽을 수 있는 매체 상에 컴퓨터가 읽을 수 있는 코드로 구현될 수 있다. 상기 컴퓨터로 읽을 수 있는 기록 매체는, 예를 들어 이동형 기록 매체(CD, DVD, 블루레이 디스크, USB 저장 장치, 이동식 하드 디스크)이거나, 고정식 기록 매체(ROM, RAM, 컴퓨터 구비 형 하드 디스크)일 수 있다. 상기 컴퓨터로 읽을 수 있는 기록 매체에 기록 된 상기 컴퓨터 프로그램은 인터넷 등의 네트워크를 통하여 다른 컴퓨팅 장치에 전송되어 상기 다른 컴퓨팅 장치에 설치될 수 있고, 이로써 상기 다른 컴퓨팅 장치에서 사용될 수 있다.The concepts of the invention described above with reference to Figures 1 to 10 can be implemented in computer readable code on a computer readable medium. The computer readable recording medium may be, for example, a removable recording medium (CD, DVD, Blu-ray disk, USB storage device, removable hard disk) . The computer program recorded on the computer-readable recording medium may be transmitted to another computing device via a network such as the Internet and installed in the other computing device, thereby being used in the other computing device.

이상 첨부된 도면을 참조하여 본 발명의 실시예들을 설명하였지만, 본 발명이 속하는 기술분야에서 통상의 지식을 가진 자는 본 발명이 그 기술적 사상이나 필수적인 특징을 변경하지 않고서 다른 구체적인 형태로 실시될 수 있다는 것을 이해할 수 있을 것이다. 그러므로 이상에서 기술한 실시예들은 모든 면에서 예시적인 것이며 한정적이 아닌 것으로 이해해야만 한다.While the present invention has been described in connection with what is presently considered to be practical exemplary embodiments, it is to be understood that the invention is not limited to the disclosed embodiments, but, on the contrary, You will understand. It is therefore to be understood that the above-described embodiments are illustrative in all aspects and not restrictive.

Claims

Extracting a feature value from an input image;
Determining an applied font for a character displayed on the input image using the feature value and font shape information of each font;
Dividing the input image into respective character area images using font shape information of the applied font;
Comparing the character area image with each character template of the applied font to calculate a similarity degree to each character template of the character area image; And
And determining the character corresponding to the character area image using the similarity.
Character recognition method.

The method according to claim 1,
Wherein determining the applied font comprises:
Determining a size of each character displayed on the input image using the extracted feature value and a thickness of a stroke constituting the character; And
Determining a font having the font shape information corresponding to the size of the character and the thickness of the stroke as the applied font,
Character recognition method.

3. The method of claim 2,
Wherein the step of extracting feature values from the input image comprises:
Extracting an edge from the input image,
Wherein the step of determining the size of each character displayed in the input image using the feature value and the thickness of the stroke constituting the character,
Noise processing the edge having a length less than a predetermined minimum length among the extracted edges; And
And determining the size of each character and the thickness of the stroke using the coordinates of the pixels constituting each edge that are not noise-
Character recognition method.

The method of claim 3,
The step of extracting the edge comprises:
Generating a color histogram of the input image; And
And extracting the edge from the pixels constituting the input image, the candidate pixel having a color value occupying a ratio equal to or greater than a predetermined value in the histogram and adjacent pixels thereof,
Character recognition method.

The method according to claim 1,
Wherein the step of extracting feature values from the input image comprises:
Extracting an edge from the input image,
Wherein determining the applied font comprises:
Determining a dot type font if there are more than a predetermined number of circular edges among the extracted edges;
Character recognition method.

6. The method of claim 5,
Wherein determining the applied font comprises:
Grouping adjacent circular edges;
Determining the size of each character using coordinates of pixels constituting each circular edge belonging to a group;
Determining the thickness of the dot using the diameter of each circular edge; And
Determining a dot type font having the font shape information corresponding to the size of the character and the thickness of the dot as the application font,
Character recognition method.

The method according to claim 1,
The step of calculating the degree of similarity may include:
And matching the size of the character area image with the size of the image of each character template of the applied font.
Character recognition method.

The method according to claim 1,
The step of calculating the degree of similarity may include:
Computing a pixel value matching ratio between the character area image and the image of each character template of the applied font to calculate the similarity; And
And adjusting the similarity by whether the character area image satisfies a feature requirement of each character template of the applied font.
Character recognition method.

9. The method of claim 8,
The character template of the applied font has the first feature requirement and the second feature requirement,
Wherein the step of adjusting the degree of similarity comprises:
Wherein the character region image does not satisfy the first feature requirement, subtracts the first value from the calculated similarity, and when the character region image does not satisfy the second feature requirement, Subtracting the figures,
Character recognition method.

9. The method of claim 8,
The feature requirement is that the pixel value of a specific region must satisfy,
Character recognition method.

The method according to claim 1,
Wherein the distinguishing step comprises:
Setting a character separation guideline on the input image using the character spacing and the length of the font characteristic information of the applied font; And
And separating the input image into respective character area images based on the character separation guideline.
Character recognition method.

Dividing an input image into respective character area images using font feature information;
Comparing the character region image with each character template to calculate a similarity degree to each character template of the character region image; And
Determining a character corresponding to the character area image using the similarity,
The step of calculating the degree of similarity may include:
And reflecting whether the character area image satisfies a feature requirement of each character template.
Character recognition method.

A computer program for performing the method of any one of claims 1 to 12, wherein the computer program is recorded.

An image feature extraction unit for extracting a feature value from an input image;
A font determining unit for determining an applied font for a character displayed on the input image using the feature value and font shape information of each font;
A character region dividing unit for dividing the input image into respective character region images using font shape information of the applied font; And
And a character determination unit for comparing the character region image with each character template of the applied font to calculate a similarity degree for each character template in the character region image and determining a character corresponding to the character region image using the similarity degree doing,
Character recognition device.

15. The method of claim 14,
An image correction unit that performs predetermined image quality correction for the character area image, performs binarization of the character area image that has been subjected to image processing, and provides the character area image that is binarized to the character determination unit Further included,
Character recognition device.