KR940007615B1

KR940007615B1 - Candidate character classifying method using left and right distance shape in recognizing character

Info

Publication number: KR940007615B1
Application number: KR1019920021515A
Authority: KR
Inventors: 최재균
Original assignee: 주식회사 금성사; 이헌조
Priority date: 1992-11-16
Filing date: 1992-11-16
Publication date: 1994-08-22
Also published as: KR940012182A

Abstract

내용 없음.No content.

Description

Candidate Character Classification Using Left and Right Distances in Character Recognition

제 1 도는 통계적 문자인식방법에 사용되는 8×8메쉬의 예를 나타낸 도면.1 is a diagram showing an example of an 8x8 mesh used in the statistical character recognition method.

제 2 도는 통계적 문자인식방법에 사용되는 문자에 외접하는 틀을 나타낸 도면.2 is a diagram showing a frame circumscribed to a character used in a statistical character recognition method.

제 3 도는 구조적 문자인식시 두께를 갖는 문자를 두께 1인 문자로 세선화하는 방법을 나타낸 도면.3 is a diagram illustrating a method of thinning a character having a thickness when the character is structurally recognized into a character having a thickness of 1;

제 5 도는 본 발명의 좌, 우 거리모양을 이용한 후보문자 분류동작시 문자의 특징 추출을 설명하기 위한 플로우 챠트.5 is a flowchart for explaining feature extraction of characters during candidate character classification operation using left and right distance shapes of the present invention.

제 6 도는 좌, 우측에서 본 거리를 설명하기 위한 도면.6 is a view for explaining the distance seen from the left and the right.

제 7 도는 본 발명에 따른 구간 판정의 예를 나타낸 도면.7 is a diagram showing an example of section determination according to the present invention.

제 8 도는 본 발명에 따른 분류나무의 제 1 예를 나타낸 도면.8 is a view showing a first example of a classification tree according to the present invention.

제 9 도는 본 발명의 좌, 우 거리모양을 이용한 후보문자 분류장치의 구성도.9 is a block diagram of a candidate character classification apparatus using left and right distance shapes of the present invention.

* 도면의 주요부분에 대한 부호의 설명* Explanation of symbols for main parts of the drawings

11 : 이미지 스캐너 12 : 인터페이스 메모리11: image scanner 12: interface memory

13 : 롬 14 : 기준 데이타 제공부13: Romans 14: Reference data provider

15 : 분류 데이타 제공부 16 : 중앙처리장치15: classification data providing unit 16: central processing unit

본 발명은 문자인식에 있어서 후보문자를 분류하기 위한 방법에 관한 것으로서, 특히 문자의 좌우에서 바라본 문자의 모양으로 1차 후보문자를 프로젝션 프로파일(Projection Profile)기법을 사용하여 2차 후보문자를 분류하여 정확한 후보문자를 분류하고 높은 인식률을 얻을 수 있는 후보문자 분류방법에 관한 것이다.The present invention relates to a method for classifying candidate characters in character recognition, and in particular, by classifying secondary candidate characters by using a projection profile technique, the primary candidate characters in the shape of the characters viewed from the left and right of the characters. The present invention relates to a candidate character classification method capable of classifying accurate candidate characters and obtaining a high recognition rate.

일반적으로, 문자인식방법은 크게 2가지로 분류할 수 있는데, 그 하나는 문자의 통계적 특징을 이용하여 문자를 분류하고 인식하는 통계적인 방법이고, 다른 하나는 문자를 이루고 있는 구조적 특성을 이용하여 인식하는 구조적인 인식방법이 있다.In general, the character recognition method can be largely classified into two types, one of which is a statistical method of classifying and recognizing characters using the statistical characteristics of the character, and the other of the character recognition methods using the structural characteristics of the characters. There is a structural recognition method.

전자의 통계적 문자인식방법은 대부분 제 1 도에 도시된 바와 같은 메쉬(Mesh)를 사용하여 각 메쉬내에 포함되어 있는 화소의 분포, 예를 들어 화소수, 분포 밀도 등을 이용하여 모든 문자에 대해 통계치를 구하고, 구해진 통계치에 따라 후보문자를 분류하며, 분류된 같은 문자군에 포함된 글자들 중에서 가장 유사한 문자를 선택하여 인식하는 방법이다.The former statistical character recognition method mostly uses a mesh as shown in FIG. 1, and uses statistical meshes for all characters by using the distribution of pixels included in each mesh, for example, the number of pixels and the distribution density. The candidate characters are classified according to the obtained statistical values, and the most similar characters are selected and recognized among the characters included in the same classified character group.

통계적 문자인식방법에 사용되는 메쉬란 제 2 도에 도시된 바와같이 입력된 문자에 외접하는 틀(Box)을 만들어 주고, 이를 기준으로 등간격으로 좌, 우 및 상, 하방향으로 나누었을 때 이루는 조그만 격자들의 집합이다.The mesh used in the statistical character recognition method creates a box that circumscribes the inputted characters as shown in FIG. 2, and is divided into equally spaced left, right, up, and down directions. It is a collection of small grids.

각 메쉬의 위치를 (i, j)i=0, 1, 2,..., n j=0, 1, 2,..., m으로 하였을 때 모든 문자의 각(i, j) 위치에 대해서 그 통계치를 구한다.For each mesh's position (i, j) i = 0, 1, 2, ..., nj = 0, 1, 2, ..., m Get the statistics.

예를들면, 모든 인식대상 문자에서 (1, 1)위치의 메쉬 평균값을 A라 할때, 입력된 문자의 메쉬(1, 1)의 값이 메쉬(1, 1)의 평균값 A보다 크면, 특정값(1, 1)=1, 작으면 특정값(1, 1)=0으로 분류한다.For example, if the mesh mean value of the position (1, 1) is A in all the recognition target characters, and the value of the mesh (1, 1) of the input character is larger than the mean value A of the mesh (1, 1), If the value (1, 1) = 1, the value is small, the specific value (1, 1) = 0 is classified.

만일 8×8의 메쉬를 정의하였다면 한 문자에 대한 특징은 8×8, 즉 64개의 특징(Feature)이 나오고 이 특징을 이용하여 후보문자의 분류가 가능하다.If 8 × 8 meshes are defined, the feature for one character is 8 × 8, that is, 64 features are used, and candidate characters can be classified using this feature.

후자의 구조적 인식방법은 문자를 이루고 있는 획구조를 이용하여 인식하는 방법으로, 가장 널리 쓰이는 방법으로 세선화(Thinning 또는 Skeletoning) 방법을 사용하여 획의 방향, 크기, 위치, 획간의 연결도 등을 고려하여 문자를 인식하는 방법이다.The latter structural recognition method recognizes using the stroke structure of letters. The most widely used method is thinning (thinning or skeletoning) method, and the direction of stroke, size, position, connection between strokes, etc. It is a method of recognizing a character in consideration.

즉, 먼저 제 3 도에 도시된 바와같이 두께를 가진 문자를 두께 1인 문자로 세선화하고, 세선화된 문자의 각각이 획의 길이와 방향을 제 4 도에 도시된 바와같이 8방향 코드로 정의해서 코드를 추출하며, 그 코드에 맞는 문자로 최종적으로 인식한다.That is, first, the character having a thickness as shown in FIG. 3 is thinned to a character having a thickness of 1, and each of the thinned characters is set to an eight-way code as shown in FIG. The code is defined and extracted, and finally recognized as a character that matches the code.

상기한 문자인식방법에 있어서, 통계적 문자인식방법은 인식문자가 복잡하고 독취한 문자의 영상에 잡음이 있더라도 비교적 인식이 잘되지만 폰트(Font)가 달라짐에 따라 인식율이 현저히 저하되는 단점이 있었다.In the above character recognition method, the statistical character recognition method is relatively well recognized even if the recognition character is complicated and there is noise in the image of the read character, but the recognition rate is significantly reduced as the font is changed.

또한 인식에 필요한 기준자료(reference file)가 많이 필요하여 소요되는 메모리가 많고 유사한 문자가 입력되었을 때 혼동될 확률도 높다.In addition, it requires a lot of reference files for recognition, so it takes a lot of memory and is likely to be confused when similar characters are entered.

한편, 구조적 문자인식방법은 문자의 구성을 이루는 획을 추출함으로써 인식하기 때문에 폰트에 구애됨없이 비교적 높은 인식률을 얻을 수 있으나 문자가 복잡한 경우에 획을 추출하기에 어려움이 많고 세선화를 하는 경우 시간이 많이 소요되는 단점이 있었다.On the other hand, the structural character recognition method recognizes by extracting the strokes that make up the characters, so that a relatively high recognition rate can be obtained regardless of the font, but it is difficult to extract the strokes when the characters are complex, and time is thinning. This had a lot of drawbacks.

따라서 본 발명의 목적은 문자의 좌우에서 바라본 문자의 모양으로 1차 후보문자를 분류하고 프로젝션 프로파일 기법을 사용하여 2차 후보문자를 분류하여 정확한 후보문자를 분류하고 높은 인식률을 얻을 수 있는 후보문자 분류방법을 제공함에 있다.Therefore, the object of the present invention is to classify the candidate candidates in the shape of the characters viewed from the left and right of the characters and to classify the candidate candidates using the projection profile technique to classify the candidate candidates and obtain a high recognition rate. In providing a method.

이와같은 본 발명의 목적은 입력한 문자에 외접하는 틀을 만들어 주기 위한 단계와, 틀의 좌, 우측으로부터 처음으로 문자와 만날때까지의 거리를 구하는 단계와, 이웃하는 좌측간 및 우측간의 거리를 비교하여 좌, 우측 거리의 변화량을 구하는 단계와, 상기 변화량에 따라 직선, 증가 및 감소 또는 +점프 및 -점프구간을 판정하는 단계와, 상기 판정된 구간을 근거로 하여 문자의 특징을 추출하는 단계와, 추출된 문자의 특징에 따라 분류 나무를 구성하여 후보문자를 분류하는 단계로 이루어짐으로써 달성되는 것으로, 이하 본 발명의 실시예를 첨부된 도면에 의거하여 상세히 설명한다.The object of the present invention as described above is to compare the distance between the left and right sides of the step of making a frame to circumscribe the input character, the distance from the left and right sides of the frame to the first encounter with the character, Determining a change amount of the left and right distances; determining a straight line, an increase and a decrease, or a + jump and a-jump section according to the change amount; and extracting a feature of a character based on the determined section; , Which is achieved by forming a classification tree according to the characteristics of the extracted characters to classify the candidate characters. Hereinafter, embodiments of the present invention will be described in detail with reference to the accompanying drawings.

본 발명의 좌, 우 거리모양을 이용한 후보문자 분류장치를 도시한 것으로서, 크게 좌우 거리특징에 의해서 각 구간을 나누고 그 구간의 모양에 적합한 코드를 부여하는 특징 추출부와, 이미 추출된 여러개의 특징을 합쳐서 어떤 문자군에 있는지를 판단하는 특징 합성부인 알고리즘을 바탕으로 구성한 하드웨어 모듈인 프로세싱부로 구성되었다.A candidate character classification apparatus using the left and right distance shapes of the present invention is shown. The feature extraction unit divides each section by a left and right distance feature and gives a code suitable for the shape of the section. It consists of a processing module, which is a hardware module based on an algorithm, which is a feature synthesizing unit that determines which character group is combined.

제 9 도는 본 발명의 좌, 우 거리모양을 이용한 후보문자 분류장치의 구성도를 도시한 것으로서, 입력된 문자의 영상을 스캐닝하기 위한 이미지 스캐너(Image scanner)(11)와, 이미지 스캐너(11)를 통해 입력된 문자의 영상 데이타를 저장하기 위한 인터페이스 메모리(Interface memory)(12)와, 본 발명의 좌, 우 거리모양을 이용한 후보문자 분류방법에 대한 알고리즘이 내장되어 있는 롬(13)과, 문자의 특징 추출시 기준데이타를 제공하기 위한 기준 데이타 제공부(14)와, 문자분류시 문자분류에 대한 데이타를 제공하기 위한 문자분류 데이타 제공부(15) 및 롬(13)내에 저장된 알고리즘을 수행하여 인터페이스 메모리(12)로부터 한 문자씩 가져와 기준 데이타 제공부(14)의 기준 데이타와 비교하여 입력된 문자의 특징을 추출하고 추출된 특징을 합성하여 분류데이타 제공부(15)로부터 제공되는 분류데이타로부터 문자를 분류하기 위한 중앙처리장치(16)로 구성되었다.9 is a block diagram of a candidate character classification apparatus using left and right distances of the present invention, and includes an image scanner 11 and an image scanner 11 for scanning an image of an input character. An interface memory 12 for storing image data of a character input through the ROM, a ROM 13 having a built-in algorithm for a candidate character classification method using the left and right distance shapes of the present invention; Perform a reference data providing unit 14 for providing reference data when character features are extracted, and an algorithm stored in the character classification data providing unit 15 and ROM 13 for providing data on character classification during character classification. By taking characters one by one from the interface memory 12, extracting the characteristics of the input characters by comparing them with the reference data of the reference data providing unit 14, and synthesizing the extracted features to provide classification data. It was composed of a central processing unit (16) for classifying the characters from the classification data provided from the 15.

본 발명의 좌, 우 거리모양을 이용한 후보문자 분류방법은 크게 입력된 문자의 특징을 추출하는 단계와 추출된 문자의 특징을 합성하여 문자를 분류하는 단계로 이루어졌다.The candidate character classification method using the left and right distance shapes of the present invention consists of extracting the characteristics of the input characters largely and classifying the characters by synthesizing the characteristics of the extracted characters.

문자의 특징을 추출하는 단계는 입력된 문자에 외접하는 틀을 만들어 주기 위한 단계와, 틀의 좌, 우측으로부터 처음으로 문자와 만날때까지의 거리를 구하는 단계와, 이웃하는 거리간의 차를 구하여 거리의 변화량을 구하는 단계와, 변화량에 따라 직선구간, 증가 및 감소구간 또는 +, - 점프구간을 판정하는 단계로 이루어졌으며, 문자를 분류하는 단계는 추출된 문자의 특징, 즉 판정된 구간에 따라 분류나무(Tree)를 구성하는 단계와, 이에 따라 문자를 분류하는 단계로 이루어졌다.The step of extracting the feature of the character comprises the steps of creating a frame that circumscribes the input character, obtaining a distance from the left and right sides of the frame to the first encounter with the character, and obtaining a difference between neighboring distances. Comprising a step of determining the amount of change, and the step of determining the straight line, increase and decrease sections or +,-jump section according to the change amount, the step of classifying the character is classified according to the characteristics of the extracted character, that is, the determined section (Tree) is constituted, and accordingly, the characters are classified.

본 발명의 후보문자 분류방법중 먼저 문자의 특징을 추출하는 동작을 설명한다.Among the candidate character classification methods of the present invention, an operation of extracting a feature of a character will be described first.

문자를 구성하는 특징은 여러가지가 있으나 본 발명에서 사용한 특징은 좌우거리를 이용하였는데, 좌, 우거리를 측정하여 구간을 판정하기 위해서는 다음과 같은 순서에 따라 구해진다.Characters constituting the character is various, but the features used in the present invention used the left and right distance, in order to determine the section by measuring the left, right distance is obtained in the following order.

(단계 1) 먼저, 문자에 외접하는 틀을 제 2 도와 같이 구한다.(Step 1) First, a frame for circumscribing a character is obtained as shown in the second diagram.

(단계 2) 상기 만들어진 틀을 등간격으로 횡방향으로 나누어주고, 틀의 상부에서 하부방향으로 좌측 또는 우측에서 처음으로 문자와 만나는 거리를 제 6 도에 도시된 바와같은 방법으로 계산하여 저장한다.(Step 2) The resulting frame is divided in the transverse direction at equal intervals, and the distance that meets the character for the first time from the left or right in the upper direction to the lower direction of the frame is calculated and stored in the manner as shown in FIG.

이때, 좌측에서 본 거리 특징은 PL[i]라 하고, 우측에서 본 거리 특징을 PR[i]라 하며, i값은 문자의 높이에 해당한다.At this time, the distance feature seen from the left is called PL [i], the distance feature seen from the right is called PR [i], and the i value corresponds to the height of the character.

즉, 문자의 높이가 20화소이면 20까지 좌측 및 우측에서 본 거리 PL[i]와, PR[i]를 각각 계산한다.That is, if the height of the character is 20 pixels, the distances PL [i] and PR [i] viewed from the left and the right to 20 are calculated, respectively.

(단계 3) 좌측 및 우측에서 본 거리를 계산한 후, PL[i]와 이웃하는 PL[i]간의 차이 PDL[i] 및 PR[i]와 이웃하는 PR[i]간의 차이 PDR[i]를 계산하고, 그 차이를 기록한다.(Step 3) After calculating the distances seen from the left and the right, the difference PDL [i] between PL [i] and neighboring PL [i] and the difference PDR [i] between PR [i] and neighboring PR [i] Calculate and record the difference.

즉, (단계 3)은 (단계 2)에서 구해진 좌, 우거리를 기준으로 그 값의 변화량을 결정하기 위한 단계로서, 이웃하는 거리간의 차이 PDL[i]와 PDR[i]은 각각 변화량을 의미하는 것으로 식(1)과 같이 표현할 수 있다.That is, (step 3) is a step for determining the amount of change of the value based on the left and right distances obtained in (step 2), and the difference PDL [i] and PDR [i] between neighboring distances respectively indicate the amount of change. It can be expressed as formula (1).

PDL[i] = PL[i] - PL[i - 1]················(1)PDL [i] = PL [i]-PL [i-1] ·············· (1)

PDR[i] = PR[i] - PR[i - 1]PDR [i] = PR [i]-PR [i-1]

이때, 변화량이 "+"이면 거리가 증가하고 있는 것을 의미하고, "-"이면 거리가 감소하고 있는 것을 의미하며, "0"이면 거리의 증감이 없는 것을 각각 의미한다.At this time, if the change amount is "+", it means that the distance is increasing, "-" means that the distance is decreasing, and "0" means that there is no increase or decrease of the distance.

또한, 변화량의 크기가 일정치이상으로 크기나 작은 것은 거리가 일정이상으로 점프(Jump)하고 있음을 의미한다.In addition, if the magnitude of the change amount is larger or smaller than a certain value, it means that the distance is jumped by a certain amount or more.

(단계 4) 이어서 (단계 3)에서 추출된 PDL[i]과 PDR[i]을 이용하여 구간을 결정한다.(Step 4) Then, the interval is determined using PDL [i] and PDR [i] extracted in (Step 3).

즉, 증가구간, 감소구간 및 직선구간을 결정하고 특히 증가나 감소가 어떤 임계치(Threshold value) 이상으로 큰 구간이 발견되면 점프구간으로 결정한다.That is, the increase section, the decrease section, and the straight section are determined, and in particular, when a section in which the increase or decrease is larger than a certain threshold value is determined, the jump section is determined.

따라서, 최종적으로 추출될 수 있는 특징은 다음과 같이 정의될 수 있다.Therefore, the features that can be finally extracted can be defined as follows.

① 만약 PDL[i], PDR[i]가 +임계치 보다 작은 경우에는 -점프구간으로 판정한다.① If PDL [i], PDR [i] is less than + threshold, it is determined as minus jump section.

② 만약 PDL[i], PDR[i]가 -임계치 보다 작은 경우에는 -점프구간으로 판정한다.② If PDL [i], PDR [i] is less than -threshold, it is determined as -jump section.

③ 만약 PDL[i], PDR[i]가 0보다 큰 경우에는 +증가구간으로 판정한다.③ If PDL [i] or PDR [i] is greater than 0, it is determined as + increment.

④ 만약 PDL[i], PDR[i]가 0보다 작은 경우에는 -감소구간으로 판정한다.④ If PDL [i], PDR [i] is less than 0, it is determined as -reduction section.

⑤ 만약 PDL[i], PDR[i]가 0과 같은 경우에는 직선구간으로 판정한다.⑤ If PDL [i], PDR [i] is equal to 0, it is determined as a straight line section.

제 7 도는 후보문자 인식시 구간판정에 대한 일례를 도시한 것으로서, 좌측에서 본 거리에 대한 구간 판정만을 예로서 도시하였다.FIG. 7 illustrates an example of section determination in recognition of a candidate character, and shows only section determination for a distance viewed from the left as an example.

문자의 높이(i)가 12라고 가정할 때, 좌측에서 본 거리의 변화량 PDL[i]은 식(1)에 의하여 구하여진다.Assuming that the height i of the character is 12, the change amount PDL [i] of the distance seen from the left side is obtained by equation (1).

그러므로, 좌측에서 본 거리의 변화량 PDL[i]을 구한 다음 구간을 판정하게 되는데, i가 0에서 1까지의 구간은 변화량이 0이므로 직선구간으로 판정하고, i가 1에서 6까지의 구간에서는 변화량이 +로 되므로 증가구간으로 판정하며, i가 6에서 7까지이 구간은 변화량이 +임계치 이상으로 변하므로 +점프구간으로 판정하고, i가 7에서 8까지의 구간은 변화량이 -임계치 이상으로 변하므로 -점프구간으로 판정하며, i가 8에서 11까지의 구간은 변화량이 -로 되므로 감소구간으로 각각 판정을 하게 된다.Therefore, after determining the change amount PDL [i] of the distance seen from the left side, the section is determined. In the section from 0 to 1, the section is determined to be a straight section because the change amount is 0, and in the section from 1 to 6, the change amount is determined. Since it becomes +, it is determined to be an increase section, and i is determined to be + jump section because the change amount is greater than + threshold value from 6 to 7, and the range from i to 7 to 8 is changed to more than-threshold value. -It is decided by the jump section, and the interval from i to 8 to 11 is changed to-, so it is determined as the reduction section.

다음은 문자의 특징 추출에 이은 문자의 특징을 합성하는 단계로 이에 대하여 설명한다.The following is a step of synthesizing the feature of the character following the feature extraction of the character.

(단계 5) 특징 추출부에서 특징이 추출되면 그 특징을 종합하여 과연 그 특징을 갖는 문자군이 어떤 것인가를 결정해야 한다.(Step 5) When the feature is extracted from the feature extracting unit, the feature must be synthesized to determine which character group has the feature.

즉, "A"라는 글자는 왼쪽, 오른쪽 특징이 모두 감소(-)특성을 갖기 때문에 특징은 (왼쪽)→감소, (오른쪽)→감소라는 특징으로 압축된다.That is, since the letter "A" has a reduction (-) characteristic of both the left and the right features, the feature is compressed into a feature of (left) → reduction, (right) → reduction.

마찬가지로 "B"라는 문자는 (왼쪽)→직선, (오른쪽)→감소, 증가, 감소, 증가라는 특징으로 정의된다.Similarly, the letter "B" is defined as (left) → straight, (right) → decrement, increase, decrease, increase.

(단계 6) 따라서, 앞에서 추출된 특징을 바탕으로 제 8 도와 같이 분류나무(Tree)를 구성하여 입력된 문자를 분류하게 된다.(Step 6) Accordingly, based on the features extracted earlier, a classification tree is constructed as shown in FIG. 8 to classify the input characters.

상기한 바와같은 본 발명에 따르면, 문자인식의 정확도를 높이기 위해서 문자를 선분류하는 문자분류 방법으로 종래의 인식방법이 통계적 혹은 구조적인 방법중 택일하여 인식하는 방식을 탈피하여 통계적인 인식방법과 구조적 인식방법을 혼합하여 구성한 것이 특징이다.According to the present invention as described above, in order to increase the accuracy of character recognition, the character classification method that pre-classifies the character, the conventional recognition method is a statistical recognition method and a structural recognition method by avoiding the alternative of the statistical or structural method. It is characterized by a mixture of recognition methods.

즉, 문자의 좌우에서 바라본 문자의 모양을 바탕으로 구성하였기 때문에 문자체(Font)의 바뀜에 의한 오분류의 영향이 적으며 고속으로 분류할 수 있는 잇점이 있으며 분류에 사용된 방법과 결과가 인식시에도 사용될 수 있는 잇점이 있다.That is, since it is composed based on the shape of the characters seen from the left and right of the characters, there is less influence of misclassification due to the change of the font, and it can be classified at high speed. There is also an advantage that can be used.

Claims

A step for creating a frame circumscribed to the input characters, a step for obtaining a distance from the left and right sides of the frame to the first encounter with the letter, and a change amount between the distance between the neighboring left and the distance on the right Determining a section as a straight section, an increasing and decreasing section, or a + jump and a -jump section according to the obtained change amount, extracting a feature of a character according to the determined section, and extracting a feature of the extracted character. Candidate character recognition method using left and right distance shape during character recognition, comprising the step of classifying candidate characters by constructing a classification tree based on the classification.