KR19980025571A

KR19980025571A - Hangul Recognition System Using Regular Noise

Info

Publication number: KR19980025571A
Application number: KR1019960043740A
Authority: KR
Inventors: 방승양; 권오준; 이진수
Original assignee: 이준; 한국전기통신공사
Priority date: 1996-10-02
Filing date: 1996-10-02
Publication date: 1998-07-15
Also published as: KR100199296B1

Abstract

1. 청구 범위에 기재된 발명이 속한 기술분야1. TECHNICAL FIELD OF THE INVENTION

규칙적인 잡음을 이용한 한글 인식 시스템Hangul Recognition System Using Regular Noise

2. 발명이 해결하려고 하는 기술적 과제2. The technical problem to be solved by the invention

규칙적인 잡음을 이용하여 오인식 문제를 해결함으로써 높은 인식률을 갖고, 더나아가서는 인식대상의 문자 수와 폰트 수에 대하여 뛰어난 확장성을 가지는 한글 인식 시스템을 제공하고자 함.By solving the problem of misrecognition using regular noise, we want to provide a Korean recognition system that has high recognition rate and further expands the number of characters and fonts.

3. 발명의 해결방법의 요지3. Summary of Solution to Invention

한글을 6가지 유형으로 분류하는 제 1 단계; 모음영역에 모음 이외의 자음영역을 포함시켜 자소영역을 결정한 후에 결정된 자소영역에 따라 모음영역과 자음영역을 학습하여 자소를 인식하고, 5번 유형의 문자일 경우에 수평성분의 세로위치를 찾아내어 자소영역을 분리한 후에 신경망 학습을 통하여 자소를 인식하는 제 2 단계; 및 인식 결과에 따라 상기 제 1 단계와 제 2 단계를 재인식하는 제 3 단계를 포함한다.A first step of classifying Korean into six types; After the consonant area other than the vowel is included in the vowel area to determine the phoneme area, the vowel area and the consonant area are learned according to the determined phoneme area to recognize the phoneme, and in case of type 5 characters, find the vertical position of the horizontal component. Separating the phoneme region and recognizing the phoneme through neural network learning; And a third step of recognizing the first step and the second step according to the recognition result.

4. 발명의 중요한 용도4. Important uses of the invention

인쇄체 한글을 인식하는 한글 인식 시스템에 이용됨.It is used for Hangul Recognition System to recognize printed Hangul.

Description

Hangul Recognition System Using Regular Noise

본 발명은 규칙적인 잡음을 이용하여 한글의 자소 인식률을 크게 향상시킨 한글 인식 시스템에 관한 것으로, 특히, 기존의 자동문서인식기(OCR)의 가장 큰 문제점인 문자의 낮은 인식률 문제를 크게 개선시킴으로써 인쇄체 문자 인식의 문서들에 대한 자동 디지틀화를 통하여 문서의 보관 및 지능 정보 검색에 이용되고, 고속의 통신 등에 매우 광범위하게 응용되며, 폰트 수와 문자수의 증가에 무관하게 높은 인식률을 유지할 수 있는 한글 인식 시스템에 관한 것이다.The present invention relates to a Hangul recognition system that greatly improves the phoneme recognition rate of Hangul using regular noise, and in particular, by greatly improving the low recognition rate problem of characters, which is the biggest problem of the conventional OCR. It is used for document archiving and intelligent information retrieval through automatic digitization of recognition documents, and it is widely applied to high speed communication, etc., and it can maintain high recognition rate regardless of increase of font number and number of characters. It's about the system.

한글은 초성, 중성, 종성이 결합되어 하나의 문자를 이루는 특성을 가졌기 때문에 문자수가 많고 그 인식이 어렵다. 따라서, 최근에는 자소별로 인식하려는 시도가 많이 있었고 그 중에서도 특히 92년 정보과학회 논문지(B)에 수록된 권재욱, 조성배, 김진형 교수팀의 계층적 신경망을 이용한 다중 크기의 다중활자체 한글 문서인식에서는 한글을 자소의 위치에 따라 6가지 유형으로 나누어 신경망을 이용하여 인식하는 방법을 사용하여 비교적 좋은 결과를 보고하고 있다.Hangul has a number of characters and is difficult to recognize because it has the characteristics of forming a single character by combining initial, neutral, and Jongsung. Therefore, in recent years, there have been many attempts to recognize each element by phoneme. Among them, Hangeul is a phoneme in multi-letter multi-letter Korean document recognition using the hierarchical neural network of Prof. Jae-Wook Kwon, Sung-Bae Kim, and Jin-Hyung Kim in the Journal of the Korean Institute of Information Scientists and Engineers (B). According to the location of the, it is divided into six types and using the neural network recognition method, and reports relatively good results.

하지만, 상기 종래 방법은 각 자소를 인식하기 위해 고정된 자소 영역을 자소 인식기의 입력으로 사용하기 때문에 해당 자소 이외의 부분까지도 입력으로 사용되어 신경망의 학습이 어렵게 되고, 또한 인식률도 떨어지게 되며, 이러한 현상이 인식 대상 문자의 폰트와 크기가 다양해 질수록 더 두드러지게 나타나는 문제점이 있었다.However, since the conventional method uses a fixed phoneme region as an input of a phoneme recognizer to recognize each phoneme, even parts other than the phoneme are used as inputs, which makes it difficult to learn neural networks, and also decreases the recognition rate. As the font and size of the character to be recognized vary, there is a problem that appears more prominently.

또한, 상기 종래 방법은 한글의 유형중 5번 유형의 경우에 수평 모음의 위치가 매우 다양하여 고정된 자소영역으로는 그 학습이 어려운 문제점이 있었다.In addition, the conventional method has a problem in that it is difficult to learn the fixed vowel region because the position of the horizontal vowel is very diverse in the case of type 5 of the Korean type.

마지막으로 상기 종래 방법은 유형분류와 자소인식이라는 2 단계에 걸쳐 인식을 시도하므로 어느 한 단계라도 오류가 발생하면 전체적으로 오류가 발생하여 인식률이 크게 저하되는 문제점이 있었다.Finally, the conventional method attempts to recognize in two stages of type classification and self-perception, so that if an error occurs in any one of the steps, an error occurs as a whole and the recognition rate is greatly reduced.

상기 제반 문제점을 해결하기 위하여 안출된 본 발명은, 주어진 유형에서 해당 자소들을 포함하는 최소한의 영역을 자소영역으로 결정하는 종래 방법과는 달리, 결합되는 자소에 따라 잡음들이 규칙적으로 나타나는 점을 이용하기 위하여 모음영역에 모음 이외의 자음영역을 포함시켜 자소영역을 결정함으로써 신경망 학습시에 결합되는 다른 자소의 유형까지도 학습하게 하고, 5번 유형의 문자일 경우에 수평성분의 세로위치를 찾아내어 그 위치를 기준으로 자소영역을 분리하는 알고리즘을 사용하여 수평모음의 위치를 고정시키며, 유형분류와 자소인식이라는 두 단계를 인식 결과에 따라 재인식하도록 하여 두 단계의 인식 모듈이 상호 보완적인 역할을 하게 함으로써 전체적인 오류를 감소시킨 한글 인식 시스템을 제공하는데 그 목적이 있다.The present invention devised to solve the above problems, unlike the conventional method of determining the minimum region including the corresponding phonemes in a given type as the phoneme region, takes advantage of the fact that the noise appears regularly according to the phoneme to be combined In order to include the consonant areas other than the vowel in the vowel area to determine the phoneme area, it also learns other phoneme types that are combined in neural network learning, and in case of type 5 characters, finds the vertical position of the horizontal component By using the algorithm that separates the phoneme domains based on the position, the position of the horizontal collection is fixed, and the two stages of recognition module play a complementary role by recognizing the two stages of type classification and self-recognition according to the recognition result. The purpose is to provide a Hangul recognition system with reduced errors.

즉, 본 발명은 종래의 신경망을 이용한 자소별 인식방법에서 해결하지 못한 여러 가지 오인식 문제를 규칙적인 잡음을 이용하여 해결함으로써 높은 인식률을 갖고, 더나아가서는 인식대상의 문자 수와 폰트 수에 대하여 뛰어난 확장성을 가지는 한글 인식 시스템을 제공하는데 그 목적이 있다.That is, the present invention has a high recognition rate by solving various misrecognition problems which are not solved in the conventional recognition method using a neural network using regular noise, and further has an excellent recognition rate for the number of characters and fonts to be recognized. Its purpose is to provide a Hangul recognition system with extensibility.

도 1 은 본 발명에 따른 한글 인식 시스템의 전체 처리도,1 is an overall processing diagram of a Hangul recognition system according to the present invention,

도 2 는 본 발명에 따른 유형 분류기의 상세 처리도,2 is a detailed processing diagram of the type classifier according to the present invention;

도 3a 내지 3f 는 본 발명에 따른 자소 인식기의 상세 처리도,3A to 3F are detailed processing diagrams of the phoneme recognizer according to the present invention;

도 4a 및 4b 는 모음영역에 대한 설명도,4A and 4B are explanatory diagrams of a vowel region;

상기 목적을 달성하기 위하여 본 발명은, 한글을 일정한 자소영역을 갖는 6가지 유형으로 분류하여 각 자소영역별로 자소인식기를 이용하여 한글을 인식하는 한글 인식 시스템에 있어서, 결합되는 한글의 자소에 따라 잡음들이 규칙적으로 나타나는 점을 이용하기 위하여 자소 결합 유형에 따라 한글을 6가지 유형으로 분류하는 제 1 단계; 모음영역에 모음 이외의 자음영역을 포함시켜 자소영역을 결정한 후에 자소인식을 위한 신경망 학습시에 결정된 자소영역에 따라 모음영역과 자음영역을 학습하여 자소를 인식하고, 5번 유형의 문자일 경우에 수평성분의 세로위치를 찾아내어 그 위치를 기준으로 자소영역을 분리하는 알고리즘을 사용하여 수평모음의 위치를 고정시킨 후에 신경망 학습을 통하여 자소를 인식하는 제 2 단계; 및 유형을 분류하는 상기 제 1 단계와 자소를 인식하는 상기 제 2 단계의 인식 모듈이 상호 보완적인 역할을 수행하여 전체적인 오류를 감소시키도록 하기 위하여 인식 결과에 따라 상기 제 1 단계와 제 2 단계를 재인식하는 제 3 단계를 포함한다.In order to achieve the above object, the present invention, in the Hangul recognition system that classifies Hangul into six types having a certain phoneme region and recognizes Hangul using a phoneme recognizer for each phoneme region, A first step of classifying Hangul into six types according to the phoneme coupling type in order to take advantage of the fact that they appear regularly; After the consonant area other than the vowel is included in the vowel area to determine the phoneme area, the vowel area and the consonant area are learned according to the phoneme area determined at the time of neural network learning. A second step of recognizing the phoneme through neural network learning after fixing the position of the horizontal vowel using an algorithm for finding a vertical position of the horizontal component and separating the phoneme region based on the position; And the first and second steps according to the recognition result in order that the recognition module of the first step of classifying a type and the second step of recognizing a phoneme play a complementary role to reduce the overall error. And a third step of recognizing.

이하, 첨부된 도면을 참조하여 본 발명에 따른 일실시예를 상세히 설명한다.Hereinafter, with reference to the accompanying drawings will be described an embodiment according to the present invention;

도 1 은 본 발명에 따른 한글 인식 시스템의 전체 처리도이다.1 is an overall processing diagram of a Hangul recognition system according to the present invention.

먼저, 전처리 부분을 살펴보면 도 1 에서 문서스캐닝부터 크기정규화까지가 이에 해당된다. 문자열로 이루어진 문서를 300dpi 스캐너로 스캔하여 이미지 형식(.tiff)으로 저장한다(1). 이 문자열 이미지를 80x80 크기의 한 문자 이미지들로 세그먼트화한다(2). 이후, 80x80 크기의 한 문자 이미지를 40x40 크기로 크기 정규화한다(3).First, the preprocessing part corresponds to document scanning to size normalization in FIG. 1. Scan a text document with a 300 dpi scanner and save it in image format (.tiff) (1). This string image is segmented into 80x80 single character images (2). Subsequently, the size normalization of a single character image of 80x80 size to 40x40 size is performed (3).

인식과정은 크게 유형분류와 자소인식의 두 단계로 나누어진다. 먼저, 유형분류기를 이용하여 정규화된 문자 이미지를 한글의 자소 영역에 따라 6가지 유형으로 분류한다(4). 후술되는 도 4a 및 4b 는 한글의 6가지 유형을 타나내고 있으며, 후술되는 도 2 는 유형분류기의 상세 구조를 나타내고 있다.The recognition process is largely divided into two stages: classification and self-awareness. First, the normalized text image is classified into six types according to the phoneme region of the Hangul using the type classifier (4). 4A and 4B described below represent six types of Korean characters, and FIG. 2 described below shows a detailed structure of a type classifier.

유형 분류기의 출력 노드 번호는 그 출력치가 큰 순서대로 소팅(sorting)되어 저장되고, 가장 큰 출력치를 갖는 출력 노드 번호만이 선택되어 다음 단계인 해당 유형의 자소인식기를 인에이블(enable)시키기 위한 인에이블 신호가 출력된다(5).The output node numbers of the type classifier are sorted and stored in ascending order of their outputs, and only the output node numbers with the largest outputs are selected to enable the next class of self-identifier. The enable signal is output (5).

각 유형의 자소인식기들은 결합되는 자소에 따라 잡음들이 규칙적으로 나타나는 점을 이용하기 위하여 모음영역에 모음 이외의 자음영역을 포함시켜 자소영역을 결정함으로써 신경망 학습시에 결합되는 다른 자소의 유형까지도 학습하고, 특히 5번 유형의 문자일 경우에 수평성분의 세로위치를 찾아내어 그 위치를 기준으로 자소영역을 분리하는 알고리즘을 사용하여 수평모음의 위치를 고정시킨 후에 학습한다(6 내지 11). 각 유형의 자소인식기는 후술되는 도 3a 내지 3f 에 상세하게 나타나 있으며, 각 유형의 자소인식기는 2개 이상의 자소인식기로 구성되어 각 자소를 인식한다.Each type of phoneme recognizer learns other phoneme types that are combined in neural network learning by determining the phoneme area by including consonant areas other than vowels in the vowel area in order to take advantage of the fact that noises appear regularly according to the phoneme to be combined. For example, in the case of the letter of type 5, the vertical position of the horizontal component is found and then the position of the horizontal collection is fixed by using an algorithm that separates the phoneme region based on the position (6 to 11). Each type of phoneme recognizer is shown in detail in FIGS. 3a to 3f described below, and each type of phoneme recognizer is composed of two or more phoneme recognizers to recognize each phoneme.

이후, 각 유형의 자소인식기중 선택되어 인에이블된 유형의 자소인식기들로부터 출력되는 출력값의 평균을 취한 후에(12) 평균 출력치가 0.5보다 큰지를 판단한다(13).Thereafter, after taking the average of the output values selected from each type of the self-identifying device selected from each type of self-identifying device (12), it is determined whether the average output value is greater than 0.5 (13).

판단 결과, 평균 출력치가 0.5보다 작을 경우에는 후보 유형을 고려하기 위해 상기 출력값을 소팅하여 최대값 출력 노드 번호를 인에이블시키는 과정(5)으로 천이하여 다음 최대값 출력 노드 번호를 최대값 출력 노드 번호로 선택하여, 즉 후보 유형을 다시 선택된 유형으로 놓고 그 유형의 자소인식기를 동작시킨다. 이때, 유형분류기의 후보 유형 자체가 매우 낮은 출력치를 가질 때에는 맨 처음의 결과를 그대로 이용하기 위해 다음 최대값 출력 노드의 출력값이 0.5보다 클 경우에만 최대값 출력 노드를 다음 최대값 출력 노드로 대치한다. 이러한 동작은 유형분류기의 오류를 해결하는 방법이 되는데, 그 원리를 상세히 설펴보면 다음과 같다.As a result of the determination, when the average output value is less than 0.5, the process transitions to the process of enabling the maximum value output node number by sorting the output value in order to consider the candidate type, and converting the next maximum value output node number to the maximum value output node number. Select to, i.e., put the candidate type back into the selected type and operate the self-identifying device of that type. At this time, when the candidate type of the classifier itself has a very low output value, the maximum value output node is replaced with the next maximum value output node only when the output value of the next maximum value output node is larger than 0.5 so as to use the first result as it is. . This operation is a way to solve the error of type classifier. The principle is explained in detail as follows.

일반적인 2단계 인식 방법에서는 전 단계에 오류가 발생하면 다음 단계와 관계없이 최종 결과적로서 오류가 발생된다. 이러한 오류를 방지하기 위하여 다음 단계의 결과에 대한 확신도에 따라 역추적(backtracking)을 허용한다. 본 발명에 따른 한글 인식 시스템에서는 유형분류 신경망의 출력중 가장 높은 출력치를 가지는 유형을 입력문자의 유형이라고 보고, 이를 해당되는 자소인식 신경망들에 입력하여 인식하도록 한다. 그런데, 만일 전단계에서 오류가 발생하여 잘못된 유형으로 인식하고 있다면 현재의 자소인식 신경망이 입력으로 사용하는 영역들은 해당 자소의 영역이 아닌 여러 자소가 섞인 임의의 자소영역이 될 것이다. 그러므로, 각 자소인식기들은 자신이 학습한 적이 없는 전혀 다른 이미지를 인식하게 되므로 그 출력값들은 어떠한 노드도 높은 출력치를 가지지 못한다. 따라서, 이와 같이 가장 높은 값을 가지는 노드의 출력값이 제1 기준치(threshold 1)을 넘지 못하면 현재의 유형이 잘못된 유형이라고 보고 후보유형을 고려하도록 한다. 후보유형이란 이전의 유형분류 단계에서 현재의 유형 다음으로 높은 출력값을 갖는 유형을 말한다. 이러한 과정을 자소인식 신경망들의 출력값이 어느 정도 높은 출력치를 가질 때까지 반복한다. 만일 6가지 유형 모두에 대해 인식했는도 어느 하나 확신할 정도로 높은 출력치를 가진 경우가 발생하지 않으면 그 중에서 가장 높은 출력치를 가졌던 유형을 답이라고 생각한다.In the general two-stage recognition method, if an error occurs in the previous step, an error occurs as a final result regardless of the next step. To prevent this error, backtracking is allowed according to the confidence of the result of the next step. In the Hangul recognition system according to the present invention, the type having the highest output value among the outputs of the type classification neural network is regarded as the type of the input character, and is inputted to the corresponding self-aware neural networks. However, if an error occurs in the previous step and recognizes it as an incorrect type, the areas used by the current self-aware neural network as inputs will be arbitrary phoneme areas mixed with several phonemes instead of the corresponding phoneme areas. Therefore, each phoneme recognizer recognizes a completely different image that it has never learned, so its outputs do not have any output at all. Therefore, if the output value of the node having the highest value does not exceed the first threshold (1), the current type is considered to be an incorrect type, and the candidate type is considered. Candidate type is the type with the highest output value after the current type in the previous classification stage. This process is repeated until the output values of the self-aware neural networks have a somewhat higher output value. If you are aware of all six types and you do not have a high output that you are sure of, then the answer is the one with the highest output.

지금까지는 자소인식기의 결과만을 믿고 후보유형을 고려한 경우에 대한 설명이었는데 여기에 유형분류기의 결과까지도 같이 고려할 수 있다. 후보유형을 취할 때 그 후보유형에 해당하는 확신도가 어느 정도보다 낮을 때에는 아무리 자소인식 신경망이 후보유형을 고려하려고 해도 유형분류기의 판단을 고려하여 그냥 앞의 유형을 답이라고 생각할 수 있다. 유형분류기의 확신도가 낮다는 것은 그 후보유형에 해당하는 노드의 출력값이 제2 기준치(threshold 2)보다 낮은 것을 의미한다. 이때, 유형분류기의 확신도와 자소인식기의 확신도 둘중에 어디에 더 큰 비중을 두느냐에 따라 결과가 달라질 수 있는데, 실험적으로 자소인식 신경망의 비중을 더 큰 것으로 두는 것이 좋은 결과를 나타낸다. 이것은 제1 기준치(threshold 1)의 값을 제2 기준치(threshold 2)의 값보다 크게 잡음으로써 이러한 효과를 가질 수 있다.So far, this has been a description of the case where the candidate type is considered and only the result of the classifier is considered, and the result of the type classifier can be considered here. When the candidate type is taken and the confidence level corresponding to the candidate type is lower than a certain level, no matter how the self-aware neural network attempts to consider the candidate type, the previous type may be considered as the answer considering the judgment of the classifier. The low certainty of the type classifier means that the output value of the node corresponding to the candidate type is lower than the second threshold (threshold 2). At this time, the confidence level of the classifier and the confidence level of the self-identifier can vary depending on where the greater weight is placed. Experimentally, the greater the weight of the self-aware neural network, the better the result. This may have this effect by making the value of the first threshold 1 greater than the value of the second threshold 2.

판단 결과, 평균 출력치가 0.5이상인 경우에 선택된 각 자소인식기로부터 출력되는 각 신경망의 선택된 출력 노드 번호를 다시 해당되는 자소코드로 맵핑(mapping)한 후에(14) 자소코드에 따라 자소를 조합하여(15) 조합형 한글 코드로 출력함으로써 인식과정을 마치게 된다.As a result of the determination, when the average output value is 0.5 or more, after mapping the selected output node number of each neural network output from each selected phoneme recognizer to the corresponding phoneme code (14), the phonemes are combined according to the phoneme code (15). ) The recognition process is completed by outputting the combined Hangul code.

도 2 는 본 발명에 따른 유형 분류기의 상세 처리도이다.2 is a detailed processing diagram of the type classifier according to the present invention.

유형 분류기에서 사용되는 모든 입력피쳐는 전체 입력이미지를 대상으로 망사형 벡터(mesh vector)를 사용한다. 망사형 벡터(mesh vector)란 N x M 크기의 망사형(mesh) 윈도우를 입력이미지의 좌측 상단에서부터 이동하면서 그 윈도우내에 포함된 검은 픽셀(pixel) 수를 피쳐로 한다. 따라서, 전체 입력이미지의 크기가 X, Y라면 망사형 벡터(mesh vector)의 차원은 (X/N) x (Y/M)가 된다. 이때, 본 발명에서 망사형 윈도우(mesh window)의 크기는 5x5이고, 망사형 벡터(mesh vector)의 차원은 64차원이 된다.All input features used in the type classifier use mesh vectors for the entire input image. The mesh vector is characterized by the number of black pixels contained in the window moving the mesh window of size N × M from the upper left of the input image. Therefore, if the size of the entire input image is X and Y, the dimension of the mesh vector becomes (X / N) x (Y / M). At this time, the size of the mesh window (mesh window) in the present invention is 5x5, the dimension of the mesh vector (mesh vector) is 64 dimensions.

실제로 64차원의 망사형 벡터(mesh vector)의 각 벡터값은 신경망의 입력으로 사용되기 전에 0과 1사이로 정규화된다.In practice, each vector of a 64-dimensional mesh vector is normalized between 0 and 1 before being used as the input of the neural network.

유형 분류 신경망은 다계층 퍼셉트론(MLP : Multi-Layer-Perceptron)을 사용하며, 그 구조는 64개의 입력노드수와 35개의 은닉노드수와 6개의 출력노드수를 가진다.Type classification neural network uses Multi-Layer-Perceptron (MLP), and its structure has 64 input nodes, 35 hidden nodes, and 6 output nodes.

도 3a 내지 3f 는 본 발명에 따른 자소 인식기의 상세 처리도이다.3A to 3F are detailed processing diagrams of the phoneme recognizer according to the present invention.

각 유형의 자소인식기에서 사용되는 모든 입력피쳐는 전체 입력이미지를 대상으로 망사형 벡터(mesh vector)를 사용하며, 각 유형의 자소인식기에 사용되는 각 신경망에 따라 망사형 윈도우 크기(mesh window size)가 다르다. 이때, 모든 자소인식기는 신경망으로 다계층 퍼셉트론(MLP)를 사용한다.All input features used in each type of recognizer use a mesh vector for the entire input image, and mesh window size according to each neural network used in each type of recognizer. Is different. At this time, all the self-identifiers use multi-layered perceptron (MLP) as a neural network.

여기에서 중요한 것은 자소인식기 신경망의 입력 영역이다. 전체 입력영역의 가로, 세로가 각각 0에서 49까지라고 했을 때 각 유형별 자소영역이 각 유형의 도면에 좌표로 표시되어 있다. 이 영역은 종래의 방법과는 달리 모음영역의 경우에 자음영역을 포함한 크기이다. 이것이 이러한 구조의 인식모델의 단점을 해결할수 있는 방법으로 작용한다.What is important here is the input region of the autonomic recognizer neural network. When the width and length of the entire input area are 0 to 49, respectively, the phoneme areas of each type are indicated by coordinates in the drawings of each type. Unlike the conventional method, this area is the size including the consonant area in the case of the vowel area. This works as a way to solve the shortcomings of this recognition model.

각 유형에 따른 자소인식기의 구조를 상세히 살펴보면 다음과 같다.Looking at the structure of the phoneme recognizer according to each type in detail as follows.

먼저, 제1 유형의 자소인식기는 입력영역의 좌표가 (X1,Y1,X2,Y2)로 표시될때 초성의 좌표는 (0, 2, 26, 34)이고, 중성의 좌표는 (4, 0, 39, 38)이다. 따라서, 제1 유형의 자소인식기는 초성과 중성을 각각 인식하기 위하여 두개의 자소인식기로 구성되며, 초성을 인식하기 위한 자소인식기는 3x3 크기의 망사형 윈도우(mesh window)로 99차원의 망사형 벡터를 생성하여 99개의 입력노드수와 20개의 은닉노드수와 19개의 출력노드수를 가지는 다계층 퍼셉트론(MLP)에 입력하고, 중성을 인식하기 위한 자소인식기는 3x3 크기의 망사형 윈도우(mesh window)로 156차원의 망사형 벡터를 생성하여 156개의 입력노드수와 32개의 은닉노드수와 9개의 출력노드수를 가지는 다계층 퍼셉트론(MLP)에 입력한다.First, the first type of self-identification recognizer has an initial coordinate of (0, 2, 26, 34) when the coordinate of the input area is represented as (X1, Y1, X2, Y2), and the neutral coordinate is (4, 0, 39, 38). Therefore, the first type of phoneme recognizer is composed of two phoneme recognizers to recognize the first and the neutral, respectively, and the phoneme recognizer to recognize the initiality is a 3x3 mesh window and a 99-dimensional mesh vector. And input it into a multi-layer perceptron (MLP) having 99 input nodes, 20 hidden nodes, and 19 output nodes, and an autonomous recognizer for recognizing neutrality is a 3 × 3 mesh window. A 156-dimensional mesh vector is generated and input into a multi-layer perceptron (MLP) having 156 input nodes, 32 hidden nodes, and 9 output nodes.

제2 유형의 자소인식기는 입력영역의 좌표가 (X1,Y1,X2,Y2)로 표시될때 초성의 좌표는 (2, 1, 39, 29)이고, 중성의 좌표는 (2, 2, 37, 39)이다. 따라서, 제2 유형의 자소인식기는 초성과 중성을 각각 인식하기 위하여 두개의 자소인식기로 구성되며, 초성을 인식하기 위한 자소인식기는 3x3 크기의 망사형 윈도우(mesh window)로 130차원의 망사형 벡터를 생성하여 130개의 입력노드수와 25개의 은닉노드수와 19개의 출력노드수를 가지는 다계층 퍼셉트론(MLP)에 입력하고, 중성을 인식하기 위한 자소인식기는 3x3 크기의 망사형 윈도우(mesh window)로 156차원의 망사형 벡터를 생성하여 156개의 입력노드수와 15개의 은닉노드수와 5개의 출력노드수를 가지는 다계층 퍼셉트론(MLP)에 입력한다.In the second type of self-identification recognizer, when the coordinate of the input area is expressed as (X1, Y1, X2, Y2), the coordinate of the initial is (2, 1, 39, 29), and the neutral is (2, 2, 37, 39). Accordingly, the second type of phoneme recognizer is composed of two phoneme recognizers to recognize the first and the neutral, respectively, and the phoneme recognizer to recognize the initiality is a mesh window of 3x3 size and has a 130-dimensional mesh vector. To generate a multi-layer perceptron (MLP) having 130 input nodes, 25 hidden nodes, and 19 output nodes, and a self-identifying device for recognizing neutrality is a 3x3 mesh window. A 156-dimensional mesh vector is generated and input into a multi-layer perceptron (MLP) having 156 input nodes, 15 hidden nodes, and 5 output nodes.

제3 유형의 자소인식기는 입력영역의 좌표가 (X1,Y1,X2,Y2)로 표시될때 초성의 좌표는 (1, 3, 29, 32)이고, 수평모음의 좌표는 (0, 3, 28, 37)이며, 수직모음의 좌표는 (2, 8, 39, 37)이다. 따라서, 제3 유형의 자소인식기는 초성과 수평모음 및 수직모음을 각각 인식하기 위하여 세개의 자소인식기로 구성되며, 초성을 인식하기 위한 자소인식기는 3x3 크기의 망사형 윈도우(mesh window)로 100차원의 망사형 벡터를 생성하여 100개의 입력노드수와 45개의 은닉노드수와 19개의 출력노드수를 가지는 다계층 퍼셉트론(MLP)에 입력하고, 수평모음을 인식하기 위한 자소인식기는 3x3 크기의 망사형 윈도우(mesh window)로 120차원의 망사형 벡터를 생성하여 120개의 입력노드수와 10개의 은닉노드수와 3개의 출력노드수를 가지는 다계층 퍼셉트론(MLP)에 입력하며, 수직모음을 인식하기 위한 자소인식기는 3x3 크기의 망사형 윈도우(mesh window)로 130차원의 망사형 벡터를 생성하여 130개의 입력노드수와 15개의 은닉노드수와 5개의 출력노드수를 가지는 다계층 퍼셉트론(MLP)에 입력한다.In the third type of self-identification recognizer, when the coordinates of the input area are represented as (X1, Y1, X2, Y2), the initial coordinates are (1, 3, 29, 32), and the horizontal collection coordinates are (0, 3, 28). , 37), and the coordinates of the vertical collection are (2, 8, 39, 37). Accordingly, the third type of phoneme recognizer is composed of three phoneme recognizers to recognize the first consonants, the horizontal vowels and the vertical vowels, respectively, and the phoneme recognizer for recognizing the consonants is a three-dimensional mesh window of 3x3 size. Generates a mesh-like vector and inputs it into a multi-layer perceptron (MLP) having 100 input nodes, 45 hidden nodes, and 19 output nodes, and a self-identifying device for recognizing horizontal vowels has a 3x3 mesh type. Create a 120-dimensional mesh vector as a mesh and input it to a multi-layer perceptron (MLP) with 120 input nodes, 10 hidden nodes, and 3 output nodes, and recognizes vertical vowels. Phoneme recognizer is a 3x3 mesh window that creates a 130-dimensional mesh vector and has a multi-layer perceptor with 130 input nodes, 15 hidden nodes, and 5 output nodes. Enter in the loan (MLP).

제4 유형의 자소인식기는 입력영역의 좌표가 (X1,Y1,X2,Y2)로 표시될때 초성의 좌표는 (0, 1, 26, 26)이고, 중성의 좌표는 (1, 1, 39, 26)이며, 종성의 좌표는 (3, 25, 38, 39)이다. 따라서, 제4 유형의 자소인식기는 초성과 중성 및 종성을 각각 인식하기 위하여 세개의 자소인식기로 구성되며, 초성을 인식하기 위한 자소인식기는 2x2 크기의 망사형 윈도우(mesh window)로 182차원의 망사형 벡터를 생성하여 182개의 입력노드수와 38개의 은닉노드수와 19개의 출력노드수를 가지는 다계층 퍼셉트론(MLP)에 입력하고, 중성을 인식하기 위한 자소인식기는 3x2 크기의 망사형 윈도우(mesh window)로 169차원의 망사형 벡터를 생성하여 169개의 입력노드수와 35개의 은닉노드수와 9개의 출력노드수를 가지는 다계층 퍼셉트론(MLP)에 입력하며, 종성을 인식하기 위한 자소인식기는 2x2 크기의 망사형 윈도우(mesh window)로 144차원의 망사형 벡터를 생성하여 144개의 입력노드수와 39개의 은닉노드수와 26개의 출력노드수를 가지는 다계층 퍼셉트론(MLP)에 입력한다.In the fourth type of phoneme recognizer, when the coordinate of the input area is expressed as (X1, Y1, X2, Y2), the initial coordinate is (0, 1, 26, 26), and the neutral coordinate is (1, 1, 39, 26), and the final coordinates are (3, 25, 38, 39). Therefore, the fourth type of self-identification device is composed of three self-identification devices for recognizing the initial, neutral, and final species, respectively, and the self-identifying device for recognizing the first generation is a mesh window of 2x2 size and has a mesh of 182 dimensions. Generates a type vector and inputs it to a multi-layer perceptron (MLP) having 182 input nodes, 38 hidden nodes, and 19 output nodes, and a self-identifying device for recognizing neutrality is a 3 × 2 mesh window (mesh). window) creates a 169-dimensional mesh vector and inputs it to a multi-layer perceptron (MLP) with 169 input nodes, 35 hidden nodes, and 9 output nodes. A mesh size of 144 is created by a mesh window of size and input to a multi-layer perceptron (MLP) having 144 input nodes, 39 hidden nodes, and 26 output nodes. .

5번 유형의 문자들은 같은 모음이라도 위아래에 결합하는 자소의 종류에 따라 수평모음의 위치가 매우 다르다. 예를 들면 룬과 국이 있다. 이렇게 세로 위치가 완전히 다른 두 모음을 일반적인 고정된 모음영역으로 분리하고 나면 그 두 영역내에 보이는 획의 위치와 분포는 같은 모음이라도 완전히 다르게 된다. 즉, 룬의 경우에는 ㅜ가 영역의 맨 아래 부분에 위치하고 중간에는 오히려 ㄹ 자음의 일부가 차지하게 되고, 국의 경우에는 ㅜ가 맨 위쪽에 위치하고 중간부분에는 받침인 ㄱ의 일부가 차지하게 된다. 이렇게 되면 신경망이 학습하기가 어려워지고 인식률 또한 저하된다. 이러한 현상을 막기 위하여 모음영역의 중심에는 항상 수평모음의 가로획이 위치하도록 고정하는 알고리즘을 사용함으로써 신경망의 학습도 쉽고 인식률도 좋게 만들 수 있다.Characters of type 5 have very different horizontal collections, depending on the type of phonemes that are combined above and below the same vowel. For example, there are runes and soups. After separating two vowels with completely different vertical positions into a general fixed vowel area, the position and distribution of the strokes in the two areas are completely different even with the same vowel. That is, in the case of runes, TT is located at the bottom of the area, and in the middle, a part of the r consonants is occupied. In the case of bureau, TT is located at the top and a part of the base a is occupied. This makes the neural network harder to learn and reduces its recognition rate. In order to prevent this phenomenon, it is possible to make the neural network easier to learn and to recognize by using an algorithm that fixes the horizontal bar of horizontal vowels at the center of the vowel area.

5번 유형의 문자이미지에서 수평모음의 세로 위치를 찾아내는 알고리즘은 다음과 같다. 문자이미지는 가로와 세로가 각각 40 픽셀(pixel)들로 이루어져 있다. 먼저 문자이미지의 맨 왼쪽 열부터 검사하여 처음으로 검은 픽셀이 존재하는 열을 찾아낸다. 이때, 5번 유형의 문자들은 잡음만 없으면 수평모음의 맨 왼쪽 점이 곧 그 문자의 맨 왼쪽에 위치한 점이 된다. 찾아낸 열에서의 검은 픽셀이 과연 수평모음의 일부인 점인지 또는 잡음인지를 확인하기 위하여 검은 픽셀이 위치하는 세로 위치에서 가로 행의 픽셀들을 검사한다. 만일 수평모음의 일부라면 가로 행의 처음과 끝 부분, 그리고 중간 부분에 연속된 검은 픽셀들이 존재(잡음이 없다면 모든 행의 점들이 검은 픽셀일 것이나 잡음이 있을 수 있으므로 이와 같은 알고리즘을 작용시킴)의 조건을 만족할 것이다.The algorithm to find the vertical position of the horizontal collection in the text image of type 5 is as follows. The text image consists of 40 pixels of width and height respectively. First, the leftmost column of the text image is examined to find the first column of black pixels. In this case, if the noise of type 5 is no noise, the leftmost point of the horizontal collection is the point located to the leftmost of the character. The pixels in the horizontal row are examined at the vertical position where the black pixels are located to determine if the black pixels in the found columns are points or noise that are part of the horizontal collection. If it is part of the horizontal vowel, there are consecutive black pixels at the beginning, end, and middle of the horizontal row (if there is no noise, all the rows will be black pixels, but this may work as an algorithm). Will satisfy the conditions.

이렇게 찾아낸 세로 위치를 기준으로 초성 하단좌표 = y + Δy1, 종성 상단좌표 = y + Δy2, 중성 상단좌표 = y + Δy1', 중성 하단좌표 = y + Δy2'와 같이 모음과 두 자음영역을 분리한다. 이때, Δyl, Δy2, Δy1', Δy2'의 값들은 -1, 4, -12, 15이다. 이러한 방법으로 영역을 분리하게 되면 중성영역의 크기는 항상 일정하나 초성과 종성영역의 크기는 찾아진 세로 위치에 따라 달라지므로 크기를 정규화한다. 위와 같이 수행함으로써 5번 유형의 모음위치가 글자나 폰트에 따라 다양하여 인식을 어렵게 하는 것을 해결할 수 있다.The vowel and the two consonant regions are separated as follows: initial lower coordinate = y + Δy1, final upper coordinate = y + Δy2, neutral upper coordinate = y + Δy1 ', neutral lower coordinate = y + Δy2' . At this time, the values of Δyl, Δy2, Δy1 ', and Δy2' are -1, 4, -12, and 15. When the regions are separated in this way, the size of the neutral region is always constant, but the size of the initial and final regions depends on the found vertical position, so the size is normalized. By doing as above, the vowel position of type 5 varies depending on letters or fonts, thus making it difficult to recognize.

제5 유형의 자소인식기는 입력영역의 좌표가 (X1,Y1,X2,Y2)로 표시될때 초성의 좌표는 (0, 0, 39, Y-1)이고, 중성의 좌표는 (1, Y-12, 38, Y+15)이며, 종성의 좌표는 (0, Y+4, 39, 39)이다. 따라서, 제5 유형의 자소인식기는 초성과 중성 및 종성을 각각 인식하기 위하여 세개의 자소인식기로 구성되며, 초성을 인식하기 위한 자소인식기는 40x20 크기로 크기를 정규화하고 3x2 크기의 망사형 윈도우(mesh window)로 140차원의 망사형 벡터를 생성하여 140개의 입력노드수와 27개의 은닉노드수와 19개의 출력노드수를 가지는 다계층 퍼셉트론(MLP)에 입력하고, 중성을 인식하기 위한 자소인식기는 38x28 크기로 크기를 정규화하고 3x2 크기의 망사형 윈도우(mesh window)로 182차원의 망사형 벡터를 생성하여 182개의 입력노드수와 25개의 은닉노드수와 5개의 출력노드수를 가지는 다계층 퍼셉트론(MLP)에 입력하며, 종성을 인식하기 위한 자소인식기는 40x20 크기로 크기를 정규화하고 3x2 크기의 망사형 윈도우(mesh window)로 140차원의 망사형 벡터를 생성하여 140개의 입력노드수와 35개의 은닉노드수와 24개의 출력노드수를 가지는 다계층 퍼셉트론(MLP)에 입력한다.The fifth type of self-identification recognizer has an initial coordinate of (0, 0, 39, Y-1) when the coordinate of the input area is represented by (X1, Y1, X2, Y2), and the neutral coordinate is (1, Y- 12, 38, Y + 15), and the final coordinates are (0, Y + 4, 39, 39). Thus, the fifth type of self-identification machine is composed of three self-identification devices to recognize the initial, neutral, and final species, respectively, and the self-identifying device for recognizing the primitive is normalized to 40x20 size and mesh-shaped windows of 3x2 size. window) to generate a 140-dimensional mesh vector and input it to a multi-layer perceptron (MLP) having 140 input nodes, 27 hidden nodes, and 19 output nodes, and a self-identifying device for recognizing neutrality is 38x28. Multi-layer perceptron (MLP) with 182 input nodes, 25 hidden nodes, and 5 output nodes by normalizing size to size and generating a 182-dimensional mesh vector with a 3x2 mesh window. ), And the phoneme recognizer to recognize the finality is normalized to 40x20 size and creates 140-dimensional mesh vector with 3x2 mesh window. It can force node and having the number of hidden nodes 35 and 24 can be input to the output node layer perceptron (MLP).

제6 유형의 자소인식기는 입력영역의 좌표가 (X1,Y1,X2,Y2)로 표시될때 초성의 좌표는 (2, 1, 27, 23)이고, 수평모음의 좌표는 (1, 13, 29, 27)이며, 수직모음의 좌표는 (13, 2, 39, 27)이고, 종성의 좌표는 (4, 24, 38, 39)이다. 따라서, 제6 유형의 자소인식기는 초성, 수평모음, 수직모음 및 종성을 각각 인식하기 위하여 네개의 자소인식기로 구성되며, 초성을 인식하기 위한 자소인식기는 3x3 크기의 망사형 윈도우(mesh window)로 72차원의 망사형 벡터를 생성하여 72개의 입력노드수와 20개의 은닉노드수와 19개의 출력노드수를 가지는 다계층 퍼셉트론(MLP)에 입력하고, 수평모음을 인식하기 위한 자소인식기는 3x3 크기의 망사형 윈도우(mesh window)로 54차원의 망사형 벡터를 생성하여 54개의 입력노드수와 17개의 은닉노드수와 3개의 출력노드수를 가지는 다계층 퍼셉트론(MLP)에 입력하며, 수직모음을 인식하기 위한 자소인식기는 3x3 크기의 망사형 윈도우(mesh window)로 54차원의 망사형 벡터를 생성하여 54개의 입력노드수와 17개의 은닉노드수와 5개의 출력노드수를 가지는 다계층 퍼셉트론(MLP)에 입력하며, 종성을 인식하기 위한 자소인식기는 3x3 크기의 망사형 윈도우(mesh window)로 72차원의 망사형 벡터를 생성하여 72개의 입력노드수와 10개의 은닉노드수와 9개의 출력노드수를 가지는 다계층 퍼셉트론(MLP)에 입력한다.In the sixth type of phoneme recognizer, when the coordinates of the input area are expressed as (X1, Y1, X2, Y2), the initial coordinates are (2, 1, 27, 23), and the horizontal collection coordinates are (1, 13, 29). , 27), the vertical coordinates are (13, 2, 39, 27) and the final coordinates are (4, 24, 38, 39). Therefore, the sixth type of phoneme recognizer is composed of four phoneme recognizers for recognizing the consonants, the horizontal vowels, the vertical vowels, and the finality, respectively, and the recognizer for recognizing the consonants is a 3x3 mesh window. Creates a 72-dimensional mesh vector and inputs it into a multi-layer perceptron (MLP) having 72 input nodes, 20 hidden nodes, and 19 output nodes. A 54-dimensional mesh vector is created with a mesh window and input to a multi-layer perceptron (MLP) having 54 input nodes, 17 hidden nodes, and 3 output nodes, and recognizes a vertical collection. Multi-layer perceptor with 54 input nodes, 17 hidden nodes and 5 output nodes by generating a 54-dimensional mesh vector with a 3x3 mesh window. The self-identification device for entering the MLP, which recognizes the finality, creates a 72-dimensional mesh vector with a mesh window of 3x3 size, allowing 72 input nodes, 10 hidden nodes, and 9 Input to multi-layer perceptron (MLP) which has the number of output nodes.

도 4a 및 4b 는 모음영역에 대한 설명도로서, 한글을 일정한 자소영역을 갖는 6가지 유형으로 분류한 형태를 나타낸다.4A and 4B are explanatory diagrams of a vowel region and show a form in which Korean characters are classified into six types having a certain phoneme region.

한편, 본 발명에 따른 한글 인식 시스템은 명조체와 고딕체에 대해서는 99.65%의 인식률을 보였고, 신명조체, 중고딕체, 궁서체를 포함하더라도 평균 98.9%정도의 인식률을 유지하였다. 또한, 인식대상 문자 수에 대해서도 500문자에서 1405자까지 확장하여 인식한 결과 인식률의 차이는 0.1%미만임을 확인할 수 있었다.On the other hand, the Hangul recognition system according to the present invention showed a 99.65% recognition rate for the Myungjo and Gothic fonts, and maintained an average recognition rate of 98.9% even when including the Shinmyung, Middle and Korean fonts. In addition, as a result of recognizing the number of characters to be recognized by extending from 500 to 1405 characters, the difference in recognition rate was confirmed to be less than 0.1%.

이상에서 설명한 본 발명은, 본 발명이 속하는 기술분야에서 통상의 지식을 가진 자에 있어 본 발명의 기술적 사상을 벗어나지 않는 범위내에서 여러 가지 치환, 변형 및 변경이 가능하므로 전술한 실시예 및 첨부된 도면에 한정되는 것이 아니다.The present invention described above is capable of various substitutions, modifications, and changes without departing from the spirit of the present invention for those skilled in the art to which the present invention pertains, and the above-described embodiments and accompanying It is not limited to the drawing.

상기와 같은 본 발명은 다음과 같은 효과가 있다.The present invention as described above has the following effects.

첫번째로, 자소별 인식과 해당 자소영역의 확대로 인하여 모호성을 해결하고, 확장성을 증가시킬 수 있는 효과가 있다. 실제로 모음의 경우에 종래의 방법보다 73%의 인식률 개선이 있었고, 폰트 수를 증가할 경우에 종래의 방법보다 인식률의 저하 정도가 매우 낮았으며, 문자 수를 증가할 경우에 거의 인식률에는 변화가 없었다.First, there is an effect of resolving ambiguity and increasing extensibility due to the recognition of each phoneme and the expansion of the corresponding phoneme area. In the case of vowels, the recognition rate was improved by 73% compared to the conventional method. When the number of fonts was increased, the recognition rate was much lower than the conventional method, and when the number of characters was increased, the recognition rate was almost unchanged. .

두번째로, 5번 유형 문자의 경우에 그 위치를 고정시킴으로써 학습 시간을 단축하고 인식률을 증가시킬 수 있는 효과가 있다. 종래의 연구에서 가장 많은 오인식이 발생하는 경우가 5번 유형의 모음이다. 본 발명에 따른 한글 인식 시스템에서는 이러한 알고리즘에 의해 53%의 인식률 개선을 얻을 수 있다.Secondly, in the case of type 5 characters, the position can be fixed to shorten the learning time and increase the recognition rate. The most common misrecognition occurs in the previous studies of type 5 vowels. In the Hangul recognition system according to the present invention, the recognition rate can be improved by 53%.

세번째로, 신경망의 확신도에 따른 후보유형을 고려함으로써 다단계 인식기의 가장 큰 단점인 각 단계별 오류의 곱에 비례하는 전체 오류 증가를 해결할 수 있는 효과가 있다. 종래의 방법에서는 유형분류 신경망에서 오류가 발생하면 자소인식 신경망과는 관계없이 문자를 오인식하게 된다. 이렇게 전체 오류가 앞단계에서 오류와 다음 단계에서의 오류의 곱으로 나타나는 것이 일반적인데 본 발명에 따른 한글 인식 시스템에서는 후보유형을 고려함으로써 실재로는 유형분류 신경망의 자체 오류보다도 더 적은 수의 유형 오류가 나타난다.Third, by considering the candidate type according to the neural network's confidence level, it is possible to solve the total error increase proportional to the product of each step error, which is the biggest disadvantage of the multi-level recognizer. In the conventional method, when an error occurs in the type classification neural network, the character is misrecognized regardless of the self-aware neural network. It is common for the whole error to appear as the product of the error in the previous step and the error in the next step. In the Hangul recognition system according to the present invention, considering the candidate type, the number of type errors is actually smaller than that of the type classification neural network. Appears.

Claims

In the Hangul Recognition System, which classifies Hangul into six types with a certain phoneme region, and recognizes Hangul using a phoneme recognizer for each phoneme region,

A first step of classifying Hangul into six types according to the phoneme combining type in order to take advantage of the fact that noises appear regularly according to the phoneme of the Hangul to be combined;

After the consonant area other than the vowel is included in the vowel area to determine the phoneme area, the vowel area and the consonant area are learned according to the phoneme area determined at the time of neural network learning. A second step of recognizing the phoneme through neural network learning after fixing the position of the horizontal vowel using an algorithm for finding a vertical position of the horizontal component and separating the phoneme region based on the position; And

Recognizing the first and second steps according to the recognition result in order that the recognition module of the first step of classifying the type and the second step of recognizing the phoneme play a complementary role to reduce the overall error. Hangul recognition system comprising a third step to.

The method of claim 1,

The first step is,

Segmenting the string image into single character images after scanning the document consisting of the character string with a scanner and storing the image in an image format;

Classifying the normalized text image into six types according to the phoneme region of the Hangul using a type classifier after normalizing the one text image to a predetermined size; And

Sort and store the output node numbers of the type classifier in the order of their largest output values, and select an output node number having the largest output value to enable an enable signal for enabling the self-identifying identifier of the type. Hangul recognition system comprising the step of outputting.

The method of claim 2,

The type classifier,

Generating a 64 dimensional mesh vector with a 5 × 5 mesh window;

Normalizing each vector value of the 64-dimensional mesh vector between 0 and 1 before using it as an input of the neural network; And

Multi-Layer-Perceptron (MLP), which has 64 input nodes, 35 hidden nodes, and 6 output nodes, receives and learns the normalized vector value and outputs the result. Hangul recognition system comprising a.

The method of claim 2,

The third step,

Determining whether the average output value is greater than the first predetermined reference value after taking an average of the output values selected from each type of the self-identifiers and outputting from the enabled self-identifiers;

As a result of the determination, when the average output value is smaller than the first predetermined reference value, the output value is sorted only when the output value of the next largest maximum output node is larger than the second predetermined reference value in order to consider the candidate type. Transitioning to enabling the number, selecting the next largest maximum value output node number as a new maximum value output node number to operate the corresponding phoneme recognizer; And

As a result of the determination, when the average output value is greater than or equal to the second predetermined reference value, mapping an output node number output from each selected phoneme recognizer to a corresponding phoneme code, and then outputting a combination of phonemes according to the phoneme code; Hangul recognition system characterized in that.

The method according to any one of claims 1 to 4,

Phoneme recognizer for recognizing the first type of Hangul,

When the coordinates of the initial area of the input area are (0, 2, 26, 34) and the neutral coordinates are (4, 0, 39, 38), it is composed of two autonomous recognizers to recognize the initial and neutral, respectively. In order to recognize the initial consonants, the autonomous recognizer generates a 99-dimensional mesh vector with a 3x3 mesh window and has a multi-layer with 99 input nodes, 20 hidden nodes, and 19 output nodes. The self-identifier for perceptron (MLP) and the recognition of neutrality is a 3x3 mesh window that generates 156-dimensional mesh vectors to generate 156 input nodes, 32 hidden nodes and 9 Hangul recognition system having a structure for inputting to the multi-layer perceptron (MLP) having the number of output nodes.

The method of claim 5,

Phoneme recognizer for recognizing the second type of Hangul,

When the coordinates of the initial area of the input area are (2, 1, 39, 29) and the neutral coordinates are (2, 2, 37, 39), two self-identifiers are recognized to recognize the initial and neutral stars respectively. In order to recognize the initial consonants, the autonomous recognizer is a 3x3 mesh window that generates a 130-dimensional mesh vector and has a multi-layer with 130 input nodes, 25 hidden nodes, and 19 output nodes. The self-identifier for perceptron (MLP) and the recognition of neutrality is a 3x3 mesh window that generates a 156-dimensional mesh vector to generate 156 input nodes, 15 hidden nodes, and 5 Hangul recognition system having a structure for inputting to the multi-layer perceptron (MLP) having the number of output nodes.

The method of claim 6,

Phoneme recognizer for recognizing the third type of Hangul,

The initial coordinate of the input area is (1, 3, 29, 32), the horizontal coordinate is (0, 3, 28, 37), and the vertical coordinate is (2, 8, 39, 37). In order to recognize the first consonants, the horizontal vowels and the vertical vowels, three self-identifiers are used. The self-identifiers for the recognition of the first consonants are generated by generating a 100-dimensional mesh vector with a 3x3 mesh window. A multi-layer perceptron (MLP) with 100 input nodes, 45 hidden nodes, and 19 output nodes, and a self-identifying device for recognizing horizontal vowels is a 3x3 mesh window. 3D mesh is generated by multi-layer perceptron (MLP) with 120 input nodes, 10 hidden nodes, and 3 output nodes. Create a 130-dimensional mesh vector with a mesh window 130 and the input node 15 can be hidden node with multi-layer perceptron (MLP) Hangul recognition system, characterized in that with a structure in which the input to obtain a five-number of output nodes.

The method of claim 7, wherein

Phoneme recognizer for recognizing the fourth type of Hangul,

In the case where the initial coordinate of the input area is (0, 1, 26, 26), the neutral coordinate is (1, 1, 39, 26), and the final coordinate is (3, 25, 38, 39), It is composed of three self-identification devices to recognize the initial, neutral, and final species, respectively. The self-identification device for recognizing the first castle is a 2x2 mesh window that generates 182-dimensional mesh vector to create 182 input nodes. Input to multi-layer perceptron (MLP) with number, 38 hidden nodes, and 19 output nodes, and the autonomous recognizer for recognizing neutrality is a 3x2 mesh window, which is a 169-dimensional mesh vector Is generated and input into a multi-layer perceptron (MLP) having 169 input nodes, 35 hidden nodes, and 9 output nodes, and a self-identifying device for recognizing the finality is a 2x2 mesh window. Create a 144-dimensional mesh vector with 144 input nodes and 39 The number of hidden nodes with multi-layer system Hangul recognition, characterized in that with a structure in which the input perceptron (MLP), which will have the output node 26.

The method of claim 8,

In the case of type 5 characters, the process of fixing the position of the horizontal collection using an algorithm that finds the vertical position of the horizontal component and separates the phoneme region based on the position,

From the leftmost column of the text image, the first column of black pixels is found, and then the horizontal position in the vertical position where the black pixels are positioned to determine whether the black pixels in the found columns are a point or noise that is part of the horizontal collection. Examine the pixels in the row, and if there are consecutive black pixels at the beginning, end, and middle of the horizontal row, recognize them as a horizontal collection, then the initial lower coordinate = y + Δy1, the final upper coordinate = y + Δy2, the neutral top The vowel and the two consonant areas are separated (Δyl, Δy2, Δy1 ', and Δy2') such as coordinates = y + Δy1 'and neutral bottom coordinates = y + Δy2' (-1, 4, -12, 15). Hangul recognition system, characterized in that to normalize.

The method of claim 9,

Phoneme recognizer for recognizing the fifth type of Hangul,

The initial coordinate of the input area is (0, 0, 39, Y-1), the neutral coordinate is (1, Y-12, 38, Y + 15), and the final coordinate is (0, Y + 4, 39, 39), it consists of three self-identification devices to recognize the initial, neutral, and final species, respectively, and the self-identification device for recognizing the initials is normalized to 40x20 size and mesh window of 3x2 size (mesh window). ) Creates a 140-dimensional mesh vector and inputs it to a multi-layer perceptron (MLP) having 140 input nodes, 27 hidden nodes, and 19 output nodes, and a self-identifying device for detecting neutrality is 38x28 size. Multi-layer perceptron (MLP) with 182 input nodes, 25 hidden nodes, and 5 output nodes by normalizing the size and generating a 182-dimensional mesh vector with a 3x2 mesh window. And a phoneme recognizer to recognize the species, and normalizes the size to 40x20. Generates a 140-dimensional mesh vector with a high 3x2 mesh window and inputs it to a multi-layer perceptron (MLP) having 140 input nodes, 35 hidden nodes, and 24 output nodes. Hangul recognition system characterized in that having a.

The method of claim 10,

Phoneme recognizer for recognizing the sixth type of Hangul,

The initial coordinates of the input area are (2, 1, 27, 23), the horizontal coordinates are (1, 13, 29, 27), the vertical coordinates are (13, 2, 39, 27), When the coordinates of the finality are (4, 24, 38, 39), four phoneme recognizers are used to recognize the first consonants, the horizontal vowels, the vertical vowels, and the final consonants, respectively. Create a 72-dimensional mesh vector with a mesh window and input it into a multi-layer perceptron (MLP) with 72 input nodes, 20 hidden nodes, and 19 output nodes, and recognizes horizontal vowels. The multi-layer perceptron (MLP) has 54 input nodes, 17 hidden nodes, and 3 output nodes by generating a 54-dimensional mesh vector with a 3x3 mesh window. Is a 3x3 mesh window. It generates a 54-dimensional mesh vector and inputs it into a multi-layer perceptron (MLP) having 54 input nodes, 17 hidden nodes, and 5 output nodes. Creates a 72-dimensional mesh vector with a mesh window and inputs it to a multi-layer perceptron (MLP) having 72 input nodes, 10 hidden nodes, and 9 output nodes. Hangul Recognition System.