KR900005141B1

KR900005141B1 - Handwritter character recognizing device

Info

Publication number: KR900005141B1
Application number: KR1019850002125A
Authority: KR
Inventors: 요시가쯔 나까무라; 요시아끼 구로사와; 유다까 히다이; 가쯔노리 오오이
Original assignee: 가부시끼가이샤 도오시바; 사바 쇼오이찌
Priority date: 1984-04-12
Filing date: 1985-03-29
Publication date: 1990-07-20
Also published as: KR850007707A; JPS60217482A

Abstract

The handwritten letter recognition system comprises a memory circuit (4) for storing coordinates of stroke on a tablet, a characteristic patern memory circuit (7) stored with standard characteristic pattern, a characteristic point extraction circuit (5a) to decide the number of the charateristic point and extract the position coordinates of the characteristic point, and an identification circuit (5c) to identify the written letter by comparing the position coordinates of the characteristic point with that of the standard characteristic pattern.

Description

Character Recognition Device

제1도 내지 제3도는 실효시간내에서 문자를 인식하는 대표적인 방법을 설명하기 위한 도면.1 to 3 are diagrams for explaining a representative method of recognizing a character within the effective time.

제4도는 본 발명의 실시예 1에 따른 문자인식장치를 개략적으로 도시해 놓은 구성도.4 is a configuration diagram schematically showing a character recognition device according to a first embodiment of the present invention.

제5도는 제4도에서의 인식처리부의 구성예를 도시해 놓은 도면.FIG. 5 is a diagram showing an example of the configuration of the recognition processing unit in FIG.

제6도는 제5도에서의 식별회로의 구성예를 도시해 놓은 구성도.6 is a configuration diagram showing an example of the configuration of the identification circuit in FIG.

제7도는 스트록크에 대한 추출특징점수의 제어개념을 설명하기 위한 도면.7 is a view for explaining a control concept of the extracted feature score for the stroke.

제8도는 스트록크를 표현하는 특징점의 모양을 도시해 놓은 도면.8 shows the shape of feature points representing strokes.

제9도는 유사스트록크에 대한 문제점을 설명하기 위한 도면.9 is a diagram for explaining a problem with pseudo stroke.

제10도는 국부특징점의 추출처리회로를 도시해 놓은 구성도.10 is a block diagram showing an extraction processing circuit of a local feature point.

제11도 및 제12도는 각각 국부스트록크의 국부특징점의 모양을 도시해 놓은 도면이다.11 and 12 show the shapes of the local feature points of the local stroke, respectively.

* 도면의 주요부분에 대한 부호의 설명* Explanation of symbols for main parts of the drawings

1 : 타블렛장치 2 : 좌표검출회로1 tablet device 2 coordinate detection circuit

3 : 전단계처리회로 4 : 좌표계열 기억회로3: previous step processing circuit 4: coordinate sequence memory circuit

5 : 획수 검출회로 5a : 특징점 추출회로5: stroke number detection circuit 5a: feature point extraction circuit

5b : 특징점계열 기억회로 5c : 식별회로5b: feature point sequence memory circuit 5c: identification circuit

6 : 인식처리부 7 : 표준특징패턴 기억부6: recognition processing unit 7: standard feature pattern storage unit

8 : 해답결정회로 11 : 선두번지 테이블8: Answer determination circuit 11: Head address table

12,13,14 : 레지스터 15 : 멀티플렉서12,13,14: Register 15: Multiplexer

16 : 가산기 17 : 연산제어회로16: adder 17: operation control circuit

18,19 : 레지스터18,19: register

[산업상의 이용분야][Industrial use]

본 발명은 좌표입력장치를 거쳐서 입력되는 문자나 기호를 인식하도록 된 장치에 관한 것으로서, 특히 입력되는 문자나 기호를 용이하면서도 고정밀도로 그 입력과 동시에 인식할 수 있도록 된 문자인식장치에 관한 것이다.The present invention relates to a device for recognizing a character or a symbol input through a coordinate input device, and more particularly, to a character recognition device capable of easily and precisely recognizing an input character or symbol at the same time.

[종래의 기술 및 그 문제점][Traditional Technology and Problems]

계산기에 대한 문자정보의 입력장치로서, 종래에는 서류에 기재되어 있는 영문자나 일본 가나문자등을 광학적으로 판독한 다음, 그 문자패턴을 표준패턴과 비교함으로써 문자를 인식하도록된 문자 판독장치(OCR)가 개발되어 있다.Character input device (OCR) for inputting character information for a calculator, which conventionally reads alphabetic characters or Japanese Kana characters, etc. described in documents, and compares the character pattern with a standard pattern. Is developed.

그러나, 일본어를 구성하고 있는 문자의 종류로서는 한자나 가나문자를 합쳐 수천종에 이르고 있기 때문에, 이렇게 다양한 문자를 상기 OCR을 이용해서 광학적으로 판독한 다음, 그 문자 패턴을 분석하여 문자를 인식하기 위해서는 대단히 많은 인식처리가 필요하게 되고, 또한 인식의 정밀도를 높이기 위해서는 복잡한 패턴처리가 필요하게 된다. 이 때문에 널리 보급되고 있는 퍼스널컴퓨터와 같은 소형 계산기의 문자정보 입력수단으로서 상기 OCR을 이용한 문자인식장치를 채택한다는 것은 지극히 곤란하였다.However, since there are thousands of kinds of characters constituting Japanese, including Kanji and Kana characters, it is necessary to optically read these various characters using the OCR, and then analyze the character patterns to recognize the characters. A great deal of recognition processing is required, and complicated pattern processing is required to increase the accuracy of recognition. For this reason, it was extremely difficult to adopt the character recognition device using the OCR as the character information input means of a small calculator, such as a personal computer, which is widely used.

한편, 최근에 이르러서는 반도체기술이 진보됨에 따라 대폭 향상된 상기 퍼스널컴퓨터 등의 정보처리능력을 유효하게 살리면서, 저렴한 가격으로 입수할 수 있는 감압 타블렛장치(感壓 tablet 裝置)등과 같은 좌표 입력장치를 이용하여 문자정보를 필기입력(筆記入力)하고, 이 문자정보를 좌표정보로 즉시 검출하여 문자의 종류를 검출하도록 된 문자정보 입력장치가 개발되고 있다.On the other hand, in recent years, as the semiconductor technology is advanced, coordinate input devices such as a pressure-sensitive tablet device, etc., which can be obtained at a low price while effectively utilizing the information processing capability of the personal computer and the like which have been greatly improved. Character information input devices have been developed in which handwriting input of character information is used to detect the character information by immediately detecting the character information as coordinate information.

상기와 같은 문자정보 입력방식을 채용하게 되면, 한자를 포함한 수천종이 넘는 문자정보를 간편하게 인식할 수가 있을 뿐만 아니라, 그 인식결과를 즉시 확인하면서 입력시킬 수 있게 되므로 상기 퍼스널컴퓨터 등의 문자정보입력수단으로서 실용성이 매우 높게 된다.By adopting the character information input method as described above, not only can easily recognize thousands of kinds of character information including Chinese characters, but also can immediately input the character information input means such as the personal computer. As a result, practicality becomes very high.

이와 같이 좌표입력장치를 통해서 필기입력되는 문자정보를 즉시 인식하게 되는 문자인식장치에 쓰여지는 인식방식으로서는 종래 다음과 같은 방식들이 제안되어 있다.As described above, the following methods have been proposed as a recognition method used in a character recognition device that recognizes character information input by hand through a coordinate input device.

즉, 그 하나의 대표적인 인식방법으로서는, 예컨대 제1도에 나타낸 바와같이「英」이라는 문자가 필기입력될때, 그 문자를 형성하고 있는 복수의 스트록크(stroke)의 형상을 그 스트록크마다 판정해서 미리 준비되어 있는 소수의 표준스트록크 형상으로 코드화(code)하고, 그 형상코드계열과 각 스트록크의 위치정보로 부터 상기 입력문자를 식별하는 방법을 들 수 있다.That is, as one representative recognition method, for example, as shown in FIG. 1, when the letter "英" is handwritten, the shapes of a plurality of strokes forming the letter are determined for each stroke. And a method of identifying the input character from the shape code sequence and the positional information of each stroke.

그러나, 이와 같은 인식방법에 있어서, 인식성능을 향상시키기 위해서는 필기자의 습관에 따라 변형되어지는 스트록크의 갖가지 형상에 대처하기 위해 20∼30종류 정도의 표준스트록크 형상을 준비해 둘 필요가 있고, 더우기 소위 리터닝 스트록크(returning stroke)나 점핑 스트록크(jumping stroke)의 변형잡음에 의한 스트록크현상의 변형을 흡수해서 문자를 정밀하게 인식하고자 하는 경우에는 준비되어야 할 스트록크형상의 패턴이 매우 복잡화되어 그 수효는 필연적으로 매우 많아지게 된다.However, in such a recognition method, in order to improve the recognition performance, it is necessary to prepare about 20-30 types of standard stroke shapes in order to cope with various shapes of strokes that are deformed according to the habits of the writer. In order to precisely recognize the characters by absorbing the deformation of the stroke caused by the deformation noise of the so-called returning stroke or jumping stroke, the stroke-shaped pattern to be prepared is very complicated. The number is inevitably very large.

따라서, 제2도(a)에 나타낸 바와 같이, 정해진 방향코를 이용해서 인식대상문자의 필기 스트록크를 차례로 방향코드화(方向 code 化)하고, 예컨대 제2도(b)에 나타낸 문자「日」의 필기스트록크를 순차적으로 방향코드화하고, 이 방향 코드에 길이정보 및 필기구의 업/다운정보를 부가하여 필기스트록크를 제2도(c)에 나타낸 바와 같은 방향 코드 계열로 표현함으로써, 필기스트록크를 표현하는 로드 정보를 줄여서 문자인식처리의 간략화를 도모하는 방식도 제안되어 있다(단, 제2도에서 ①~④는 상호 대응되는 부분을 나타낸 것이다).Therefore, as shown in Fig. 2 (a), the writing stroke of the character to be recognized is sequentially coded using the determined direction nose, for example, the character " 日 " shown in Fig. 2 (b). The writing stroke of the writing stroke is sequentially coded, and the writing stroke is represented by the direction code sequence as shown in Fig. 2C by adding length information and up / down information of the writing instrument to the direction code. A method of simplifying the character recognition process by reducing the load information representing the size is also proposed (however, in Fig. 2, ① to ④ indicate corresponding parts).

그러나. 상기한 인식방식에서는 필기순서가 틀려서 입력문자의 필기스트록크 순서가 변화되거나, 혹은 필기상 잡음이 지나친 스트록크가 입력되는 경우에는 입력문자를 나타내는 스트록크의 계열정보가 크게 변화되기 때문에, 필기순서의 잘못이나 잡음을 흡수하여 안정되게 문자를 인식할 수 없다는 불편함이 따르게 된다.But. In the above recognition method, the writing order of the input characters is changed due to the wrong writing order, or when the stroke is inputted with excessive noise, the sequence information of the stroke representing the input characters is greatly changed. The inconvenience is that you can not recognize the character stably by absorbing the error or noise.

이러한 불편함에 대처하기 위해서, 예컨대 제3도에 나타낸 바와 같이, 문자패턴의 각 스트록크를 그 스트록크상의 복수의 특징점으로 표현되는 문자인식방식이 제안되어 있다.In order to cope with such inconvenience, for example, as shown in FIG. 3, a character recognition method is proposed in which each stroke of a character pattern is represented by a plurality of feature points on the stroke.

이 문자인식방식에서는 문자를 구성하는 스트록크의 수효에 따라서 각 스트록크의 분할점 수효를 정하고, 이 분할점 수효에 따라서 필기입력되는 문자의 각 스트록크를 같은 간격으로 분할하며, 이러한 등간격 분할점과 그 스트록크의 시작점 및 종료점을 상기 각 스트록크를 나타내는 특징점의 위치좌표로서 표현한다. 그리고, 이들 스트록크의 각 특징점의 위치좌표와 미리 준비되어 있는 표준패턴 특징점의 위치좌표간에 매칭처리를 실행함으로써 문자인식을 행하게 된다.In this character recognition method, the number of split points of each stroke is determined according to the number of strokes constituting the character, and the number of strokes of the handwritten characters is divided at equal intervals according to the number of split points. The point and the start point and the end point of the stroke are expressed as the position coordinates of the feature points representing the strokes. Then, character recognition is performed by performing a matching process between the position coordinates of the feature points of these strokes and the position coordinates of the standard pattern feature points prepared in advance.

예를 들여, 곡선성분이 많고 획수가 적은 히라가나등의 경우에는 그 스트록크의 길이를 각각 5등분해서 각 스트록크를 6점 근사시킨 특징점위치좌표로 표현하고, 또 한자처럼 획수가 많은 문자에 대해서는 그 스트록크를 각각 2등분해서 각 스트록크를 3점 근사시킨 특징점의 위치좌표로 표현한다. 여기에서 스트록크의 3점 근사는 2개의 직선으로 표시되는 각 스트록크가 한자패턴 가운데 비교적 많이 존재하기 때문에 그 스트록크를 확실하게 근사시키기 위한 것이다.For example, in the case of Hiragana, which has many curved components and few strokes, the stroke length is divided into five parts, and each stroke is expressed as a feature point position coordinate that approximates six strokes. The strokes are divided into two, and each stroke is expressed by the position coordinates of the feature points approximated by three points. Here, the three-point approximation of the strokes is intended to approximate the strokes reliably because each stroke represented by two straight lines exists relatively in the Chinese character pattern.

그러나, 문자를 구성하는 대다수의 스트록크는 그 균등 분할점의 정보를 필요로 하지 않는 상태로 유사하게 표현될 수 있는데, 이러한 경우에는 상술한 바와 같이 각 스트록크는 그 분할점을 포함하는 복수의 특징점에 근사(近사)하게 되므로, 이때의 인식처리를 위해서는 표준특징패턴의 수가 극히 많이 필요하게되고, 또 그러한 표준특징패턴에 대해서 각각 매칭(matching)처리를 할 필요가 있기 때문에 처리시간이 매우 길어지게 되는 문제가 생기게 된다.However, the majority of the strokes constituting the character may be similarly represented without requiring the information of their even dividing points, in which case each stroke is assigned to a plurality of feature points including the dividing points. Since it is approximated, the number of standard feature patterns is extremely large for the recognition process at this time, and the processing time becomes very long because each of the standard feature patterns needs to be matched with each other. There is a problem.

또한 각 스트록크는 각각 같은 갯수의 특징점으로 표시되기 때문에, 예컨대 짧은 스트록크나 직선적인 스트록크의 특징점으로부터 구해지는 매칭결과는 과대평가되기 쉬운 반면, 문자식별을 위한 정보를 많이 가지고 있는 스트록크의 특징점으로부터 얻어지는 매칭결과는 과소평가되기 쉬우므로 적절하지 않게 되는 바, 이 때문에 문자를 잘못 인식하거나 인식할 수 없게 되는 경우가 많이 발생하게 된다.In addition, since each stroke is represented by the same number of feature points, the matching result obtained from, for example, a short stroke or a linear stroke is likely to be overestimated, whereas from the feature points of the stroke having a lot of information for character identification. The matching result obtained is easily underestimated and therefore not appropriate. Therefore, there are many cases where characters are misrecognized or unrecognized.

또한, 상술한 바와 같은 스트록크의 특징점 추출에 있어서는, 제3도에 나타낸 바와 같은 유사문자의 경우, 예컨대 히라가나의 「わ」와 「ね」 같이 스트록크에 국부적인 차이밖에 없는 경우에는 이것을 확실히 식별할 수가 없었다. 때문에 종래에는 별도의 방식에 따른 특징추출수단을 설치해서 상기한 불합리에 대처토록 하고 있으나, 이와 같이 하게 되면 인식처리의 알고리즘이 쓸데없이 복잡하게 되고 준비해야할 표준특징 패턴이 증대되게 되므로 그 문자를 인식하기 위한 장치의 구성이 복잡하게 된다는 문제점이 있게 된다. 이러한 이유로 인하여 상기 수단은 문자의 작성과 동시에(실효시간에) 이를 입력시킬 수 있는 문자인식장치로서는 적합하지 않게 된다.In addition, in the extraction of the feature points of the stroke described above, in case of similar characters as shown in FIG. 3, if there is only a local difference in the stroke such as "わ" and "ね" of Hiragana, this is certainly identified. I could not. Therefore, conventionally, a feature extraction means according to a separate method is installed to cope with the above irrationality. However, in this case, the algorithm of recognition processing becomes uselessly complicated and the standard feature pattern to be prepared is increased, thus recognizing the character. There is a problem that the configuration of the apparatus for doing so becomes complicated. For this reason, the above means are not suitable as a character recognition device capable of inputting them at the same time as the creation of the characters (effective time).

[발명의 목적][Purpose of invention]

이에, 본 발명은 상기한 결점을 제거하기 위해 발명된 것으로, 필기문자의 각 스트록크의 분할점수(특징점의 수)를 충분하게 지정함으로써 필기문자의 인식률이 높은 문자 인식장치를 제공함에 그 목적이 있다.Accordingly, the present invention has been invented to eliminate the above-described drawbacks, and the object of the present invention is to provide a character recognition apparatus having a high recognition rate of handwritten characters by designating a sufficient number of split points (number of feature points) of each stroke of the handwritten characters. have.

[발명의 구성][Configuration of Invention]

상기 목적을 실현하기 위한 본 발명은 타블렛장치를 거쳐서 필기입력되는 문자나 기호의 필기스트록크의 시계열 좌표를 기억하는 좌표계열 기억회로와, 미리 준비되어 있는 인식대상 카테고리의 표준특징패턴에 관한 정보를 기억하는 표준특징패턴 기억부, 상기 표준특징패턴의 각 스트록크마다 정해진 각 스트록크를 표현하는 특징점의 수효 정보를 근거로 상기 필기입력된 문자나 기호의 각 스트록크를 표현하는 특징점의 수를 결정하고, 이들 각 특징점의 위치좌표를 상기 좌표계열기억회로에 기억된 스트록크의 시계열좌표로부터 구하는 특징점추출회로, 이 특징점추출회로에서 구한 상기 필기입력된 문자나 기호의 각 스트록크의 특징점 위치좌표와 상기 필기 입력된 문자나 기호를 식별하는 식별회로를 구비한 구성으로 되어 있다.The present invention for achieving the above object is a coordinate system storage circuit for storing the time series coordinates of the writing stroke of a character or symbol handwritten input through a tablet device, and information on the standard feature pattern of the recognition target category prepared in advance The number of feature points representing each stroke of the written character or symbol is determined based on the number information of the standard feature pattern storage unit to be stored and the feature points representing each stroke determined for each stroke of the standard feature pattern. And a feature point extraction circuit for obtaining the position coordinates of each feature point from the time series coordinates of the stroke stored in the coordinate sequence memory circuit, and a feature point position coordinate of each stroke of the handwritten input character or symbol obtained by the feature point extraction circuit. It has a structure provided with the identification circuit which identifies the said handwritten character or symbol.

[작용][Action]

상기한 구성으로 된 본 발명에 의하면, 인식대상문자의 카테고리나 그 문자를 구성하는 부수(部首)등에 대응해서 입력문자의 각 스트록크에 대하여 최소로 필요한 수효의 특징점을 구하고, 이러한 특징점으로 입력스트록크를 적절하게 표현한 뒤에 이러한 각 스트록크를 표현하는 특징점의 위치좌표와 표준특징패턴에 관한 특징점의 위치정보를 매칭처리함으로써 입력문자를 실효시간내에 간편하고 정도높게 인식할 수 있게 된다. 즉 입력스트록크의 점근사 정확도를 충분히 높여서 정도높은 인식결과를 얻을 수 있게 된다.According to the present invention having the above-described configuration, the minimum required number of feature points is obtained for each stroke of the input character corresponding to the category of the character to be recognized, the number of copies constituting the character, and the like. After the stroke is appropriately expressed, the matching process of the position coordinates of the feature points representing each stroke and the feature information on the standard feature pattern makes it possible to recognize the input characters easily and accurately within the effective time. That is, the accuracy of the approximation of the input stroke is sufficiently increased to obtain a high recognition result.

또한 스트록크를 표현하는 특징점의 수효를 각각 필요한 만큼 정하기 때문에 특징점의 수를 충분히 줄일 수 있게 된다. 따라서 준비해야 하는 표준특징패턴의 수효도 적어져서 기억부의 구성을 간편하게 할 수 있게 되고, 이와 함께 대조하게될 특징점의 수효가 적어지게 되므로 인식처리시간을 대폭적으로 단축시킬 수 있다.In addition, since the number of feature points representing the stroke is determined as necessary, the number of feature points can be sufficiently reduced. Therefore, the number of standard feature patterns to be prepared is also reduced, so that the structure of the storage unit can be simplified, and the number of feature points to be contrasted with this is reduced, thereby significantly reducing the recognition processing time.

[실시예]EXAMPLE

이하 예시도면을 참고하면 본 발명의 실시예를 상세히 설명한다.Hereinafter, exemplary embodiments of the present invention will be described in detail with reference to the accompanying drawings.

제4도는 본 발명의 실시예 1에 따른 문자인식장치를 개략적으로 도시해 놓은 구성도로, 타블렛장치(1)상에 필기구(1a)를 이용해서 필기입력시키는 문자나 기호의 정보는 타블렉장치(1)에 부속된 좌표검출회로(2)에 의하여 그 스트록크를 나타내는 연속된 좌표위치의 시계열위치좌표데이터(time series data)로 검출된다.4 is a configuration diagram schematically showing a character recognition device according to a first embodiment of the present invention, wherein the information of a character or a symbol to be inputted by using the writing instrument 1a on the tablet device 1 is a tablet device ( Coordinate detection circuit 2 attached to 1) detects time series position coordinate data (time series data) of continuous coordinate positions representing the stroke.

그리고, 상기한 좌표검출회로(2)에 의해서 검출된 시계열위치좌표데이터는 전단계 처리회로(3)에 입력되어 타블렛장치(1)에서의 검출오차나 점핑스트록크, 리터닝스트록크 등과 같은 미소한 잡음이 제거된 후. 좌표계열 기억회로(4)에 순서대로 기억된다. 또한 상기한 전단계처리회로(3)는, 예컨대 필기입력된 문자중 1문자분의 구절이 좌표검출회로(2)에 의해서 검출되었을때, 상기 좌표계열기억회로(4)에 기억되는 시계위치좌표데이터의 문자크기를 정규화하는 등의 전단계 처리도 실시하게 된다.Then, the time-series position coordinate data detected by the coordinate detection circuit 2 is inputted to the previous stage processing circuit 3 so as to detect minute errors such as a detection error, a jumping stroke, a turning stroke, etc. in the tablet device 1. After the noise is removed. It is stored in the coordinate series memory circuit 4 in order. In addition, the preceding step processing circuit 3, for example, the clock position coordinate data stored in the coordinate sequence memory circuit 4 when a passage for one character of the handwritten characters is detected by the coordinate detection circuit 2, for example. The previous step is also performed, such as normalizing the character size.

이렇게 하여 좌표계열 기억회로(4)에는 상기한 바와 같이 전단계 처리된 1문자분의 필기스트록크를 나타내는 시계열 위치좌표데이터가 입력문자데이터로서 기억된다.In this way, the coordinate sequence memory circuit 4 stores time series position coordinate data indicating the handwritten stroke for one character processed as described above as input character data.

또한 회수검출회로(5)는 좌표검출회로(2)의 출력으로부터 타블렛장치(1)의 표면에 대한 필기구(1a)의 업/다운을 검출함으로써 필기입력된 문자의 스트록크수, 즉 획수를 검출한다. 그리고 이 획수검출회로(5)에 의해서 검출된 스트록크의 수효에 관한 정보는 인식처리부(6) 및 표준특징패턴기억부(7)에 입력된다.The number detecting circuit 5 also detects the number of strokes, i.e., strokes, of the handwritten characters by detecting the up / down of the writing instrument 1a with respect to the surface of the tablet device 1 from the output of the coordinate detecting circuit 2. . Information about the number of strokes detected by the stroke number detection circuit 5 is input to the recognition processing section 6 and the standard feature pattern storage section 7.

그리고, 상기한 표준특징패턴기억부(7)에는 획수정보에 따라서 사용되는 모든 문자 ·기호 등의 표준패턴이 격납되어 있고, 획수검출회로(5)로부터의 획수정보에 따라서 인식처리해야할 회수의 표준특징패턴만을 독출할 수 있도록 되어 있다. 또한 상기 인식처리부(6)는 상기한 획수정보에 따라서 표준특징패턴기억부(7)로부터 해당 획수의 표준 특징패턴을 순서대로 독출함과 동시에 좌표계열 기억회로(4)에 기억되어 있는 시계열 위치좌표 데이터를 판독하여 표준특징패턴에 대응되는 각 스트록크 특징점의 위치 좌표를 구한 후, 표준특징패턴 기억부(7)로부터 독출한 표준특징패턴의 복수의 문자를 순차 상기 입력필기문자패턴의 각 스트록크를 나타내는 특징점의 위치좌표 데이터와 비교하여 매칭처리를 실행하고, 그 매칭처리결과를 해답결정회로(8)에 출력한다. 이어, 상기 해답결정회로(8)는 각 매칭처리결과를 식별·판정하여 인식결과로서 가장 확실할 것 같은 표준특징패턴의 카테고리를 필기입력된 문자나 기호의 식별결과로 구하게 된다. 이 경우 상기 인식결과로서 유사도(類似渡)가 높은 복수의 인식문자를 결과후보로서 출력하도록 해도 된다.The standard feature pattern storage unit 7 stores standard patterns such as all characters and symbols used in accordance with the stroke number information, and standardizes the number of times to be recognized and processed according to the stroke number information from the stroke detection circuit 5. Only feature patterns can be read. In addition, the recognition processing section 6 reads out the standard feature pattern of the corresponding stroke number in order from the standard feature pattern storage unit 7 according to the stroke number information, and at the same time stores the time series position coordinates stored in the coordinate sequence memory circuit 4. After reading the data to obtain the position coordinates of each stroke feature point corresponding to the standard feature pattern, a plurality of characters of the standard feature pattern read out from the standard feature pattern storage unit 7 are sequentially sequenced to each stroke of the input written character pattern. The matching process is executed by comparing the position coordinate data of the feature point indicating and the matching result is output to the answer determination circuit 8. Then, the answer determination circuit 8 identifies and judges each matching processing result, and obtains the category of the standard feature pattern that is most likely to be the recognized result as the identification result of the handwritten characters or symbols. In this case, as the recognition result, a plurality of recognition characters with high similarity may be output as result candidates.

상기한 인식처리부(6)는 예컨대 제5도에 도시된 바와 같이 특징점추출회로(5a)와 특징점계열 기억회로(5b) 및 식별회로(5c)로 구성되는데, 여기서 상기 특징점 추출회로(5a)는 좌표계열 기억회로(4)로부터 시계열로된 위치좌표 데이터를 독출하여, 우선 미리 설정된 각 스트록크의 최대 특징점수효에 따라서 각 스트록크의 시계열 데이터를 같은 간격으로 분할한 후, 그 분할점을 각각 특징점으로 해서 각 스트록크의 시작점과 종료점을 포함하는 복수의 특징점을 추출하게 된다. 예를 들어 특징점의 최대수효가 9로 주어질 경우 입력문자패턴의 각 스트록크를 각각 8등분해서 그 스트록크의 8등분점인 7개의 분할점과, 스트록크의 시작점과 종료점을 포함하는 도합 9개의 특징점 위치좌표를 각 스트록크마다 구하게 된다. 특징점추출회로(5a)는 이들의 각 특징점의 위치좌표를 상기 좌표계열 기억회로(4)에 기억되어 있는 데이터로부터 구한다. 따라서 이때 각 스트록크는 각각 9개의 특징점으로 표현되게 되고, 또 그 특징점의 위치좌표 데이터는 특징점계열기억회로(5b)에 기억되게 된다.The recognition processing section 6 is composed of a feature point extraction circuit 5a, a feature point sequence storage circuit 5b, and an identification circuit 5c, as shown in FIG. 5, for example. The positional coordinate data in time series is read out from the coordinate sequence memory circuit 4, and the time series data of each stroke is divided at equal intervals according to the maximum number of feature points set in advance for each stroke. As a result, a plurality of feature points including the start point and the end point of each stroke are extracted. For example, if the maximum number of feature points is 9, each stroke of the input character pattern is divided into eight equal parts, seven split points that are eight equal points of the stroke, and nine total points including the start and end points of the stroke. The feature point position coordinates are obtained for each stroke. The feature point extraction circuit 5a obtains the position coordinates of these feature points from the data stored in the coordinate sequence memory circuit 4. Therefore, each stroke is represented by nine feature points, and the position coordinate data of the feature points are stored in the feature point sequence storage circuit 5b.

상기 특징점검출장치는 입력문자의 한 구절이 검출될 때까지 각 스트록크마다 되풀이하여 실시된다. 그리고 1문자분의 각 스트록크의 특징점위치정보가 상기 특징점계열기억회로(5b)에 기억되었을때, 상기 식별회로(5c)에 의해 상기 추출된 특징점의 데이터와 표준특징패턴기억부(7)에 등록되어 있는 표준특징패턴과의 매칭처리가 실행되게 된다.The feature point detection device is repeated for each stroke until one phrase of the input character is detected. When the feature point position information of each stroke for one character is stored in the feature point sequence storage circuit 5b, the extracted feature point data and the standard feature pattern storage section 7 are stored in the feature point sequence storage circuit 5b. Matching processing with the registered standard feature pattern is executed.

제6도는 상기한 식별회로(5c)와 표준특징패턴기억부(7) 및 특징점계열기억회로(5b)간의 구체적인 결합구성관계를 나타낸 것으로서, 도면중 11은 표준특징패턴기억부(7)에 대한 선두번지 테이블(先頭番地 table)을 나타낸다.FIG. 6 shows a specific coupling configuration relationship between the identification circuit 5c, the standard feature pattern storage unit 7 and the feature point sequence storage circuit 5b. In FIG. 11, FIG. Represents the head address table.

상기 선두번지 테이블(11)은 그 표준패턴을 문자의 회수에 따라서 분류등록하고 있는 상기 표준특징패턴기억부(7)의 각 복수목록중, 최초로 등록된 표준특징패턴의 등록번지를 입력문자의 획수에 따라서 기억하는 것이다. 그리고 상기 획수검출회로(5)에서 검출된 획수의 선두번지데이터는 선두번지테이블(11)에 셋트된다. 이 선두번지테이블(11)로부터의 선두번지데이터는 레지스터(12)를 통해서 상기 표준특징패턴기억부(7)로 데이터독출어드레스로서 인가된다.The head address table 11 registers the number of input characters of the first registered address of the standard feature pattern among the plurality of lists of the standard feature pattern storage unit 7 which classify and register the standard pattern according to the number of letters. So remember. The head address data of the number of strokes detected by the stroke number detection circuit 5 is set in the head address table 11. The head address data from the head address table 11 is applied as a data read address to the standard feature pattern storage section 7 via the register 12.

상기한 어드레스에 의해 지정된 표준특징패턴기억부(7)의 선두번지에는 그 획수의 최초 문자중 최초 스트록크에 대한 표준특징 좌표계열이 특징패턴으로서 기억되어 있는 데, 이 표준특징 좌표계열은, 예컨대 그 문자의 카테고리를 j로 할 경우 제1의 특징점좌표는 (XOj, YOj)로, 제2의 특징점좌표는 (X1j, Y1j)로, 제3의 특징점좌표는 (X2i, Y2j)‥‥로 표시되게 된다.In the first address of the standard feature pattern storage unit 7 designated by the above-mentioned address, a standard feature coordinate sequence for the first stroke of the first characters of the stroke number is stored as a feature pattern. When the character category is j, the first feature point coordinates are represented by (XOj, YOj), the second feature point coordinates are represented by (X1j, Y1j), and the third feature point coordinates are represented by (X2i, Y2j). Will be.

또한 상기 선두번지의 헤더(header)에는 표준특징 좌표계열에 관련되어 그 스트록크에 대한 특징점수효의 피치정보(pitch data)와, 특징점의 위치정보, 스트록크번호, 그 스트록크가 문자의 최종적인 스트록크인가 아닌가에 대한 정보, 이러한 정보들이 그 특수블록에 있어서 최종 문자 카테고리에 관련된 것인가에 대한 정보, 또 상기 표준특징좌표계열이 그 문자의 전체적인 특징인가 아니면 국부적인 특징인가에 대한 정보등이 기억되어 있다.In addition, the header of the first address includes pitch information of the number of feature points for the stroke, the position information of the feature point, the stroke number, and the stroke in relation to the standard feature coordinate system. Information about whether or not a stroke is present, whether such information relates to the final character category in that special block, and whether the standard feature coordinate series is the overall or local character of the character. It is.

위와 같은 헤더와 함께 해당 획수에 대한 각 문자카테고리의 표준특징패턴이 순서대로 표준특징패턴기억부(7)에 등록되어 있다. 그리고 이러한 각 문자카테고리의 헤더와 표준특징패턴은 명령신호에 따라 순차적으로 출력되는 상기 레지스터(12)로부터의 어드레스에 따라서 선두번지에 격납된 것부터 차례로 독출되도록 되어있다.Along with the header as described above, the standard feature pattern of each character category for the number of strokes is registered in the standard feature pattern storage unit 7 in order. The header and standard feature pattern of each character category are read out sequentially from the one stored at the head address according to the address from the register 12 which is sequentially output in accordance with the command signal.

그리고, 표준특징패턴기억부(7)로부터 독출된 특징점수효의 피치데이터 및 스트록크번호는 레지스터(13)에 전송되고, 이러한 전송타이밍에 동기해서 레지스터(14)에 격납되어 있던 데이터가 클리어됨과 동시에, 멀티플렉서(15)의 모우드가 절환된다. 즉. 입력문자에 대한 각 스트록크의 특징점정보를 특징점계열기억회로(5b)에 기록하기 위한 동작모우드로부터 특징점계열기억회로(5b)에 기억된 특징점의 데이터를 독출하는 동작 모우드로 절환된다. 그리고 상기 레지스터(14)의 출력에 따라 특징점계열기억회로(4b)의 데이터독출 어드레스가 지정되게 되는데, 이 경우 상술한 바와 같이, 우선 최초 레지스터(14)가 클리어되어 있으므로 제1스트록크의 제1특징점(시작점)의 위치좌표데이터가 상기 기억회로(5b)로부터 독출되게 된다. 그리고 이 제1스트록크의 제1특징점(시작점)의 위치좌표데이터는 식별회로(5c)에 입력되어 표준특징패턴 기억부(7)로부터 독출되는 표준특징패턴의 제1스트록크의 제1특징점(시작점)의 위치좌표 데이터와의 거리가 계산되게 된다.The pitch data and stroke number of the number of feature points read out from the standard feature pattern storage unit 7 are transferred to the register 13, and the data stored in the register 14 is cleared at the same time as this transfer timing. The mode of the multiplexer 15 is switched. In other words. The operation mode for recording the feature point information of each stroke for the input character into the feature point sequence storage circuit 5b is switched from the operation mode for reading data of the feature points stored in the feature point sequence memory circuit 5b. According to the output of the register 14, the data read address of the feature point sequence storage circuit 4b is designated. In this case, as described above, the first register 14 is first cleared. The positional coordinate data of the feature point (starting point) is read out from the memory circuit 5b. The position coordinate data of the first feature point (starting point) of the first stroke is input to the identification circuit 5c and read from the standard feature pattern memory 7 to the first feature point of the first stroke of the standard feature pattern ( The distance from the position coordinate data of the starting point) is calculated.

상기한 과정을 거쳐서 특징점계열기억회로(5b)로부터 1개의 특징점의 좌표데이터가 출력되었을때, 상기 레지스터(14)에 격납된 특징점 피치의 데이터가 가산기(16)를 통해서 상기 레지스터(14)에 격납된 데이터에 가산되어 레지스터(14)에 재차 셋트된다. 이에 따라 특징점계열기억회로(5b)로부터 다음에 독출되는 특징점이 상기 피치에서 나타나는 수효만큼 진행되어 지정된다. 그리고 이와 마찬가지로 그 특징점의 위치좌표 데이터가 독출되어 식별회로(5c)에서는 표준특징패턴의 특징점좌표의 거리가 계산되게 된다. 이러한 처리는 특징점의 피치에 따라서 순서대로 선택되는 특징점마다 순차 반복실행되어 각 특징점마다 구해지는 거리가 순차 가산되게 된다. 그리고 모든 스트록크의 모든 특징점에 대해서 거리가 계산되었을때, 그 거리의 가산치가 그 표준특징패턴과 입력문자와의 유사결과(類似結果)로 출력되고, 그 후 표준특징패턴기억부(7)로부터 다음 카테고리의 표준특징패턴이 독출되어, 그 다음의 표준특징 패턴에 대한 매칭처리가 전술한 것과 같은 방법으로 실행된다.When the coordinate data of one feature point is outputted from the feature point sequence storage circuit 5b through the above process, the data of the feature point pitch stored in the register 14 is stored in the register 14 through the adder 16. The data is added to the data and set in the register 14 again. As a result, the next feature point to be read out from the feature point sequence storage circuit 5b is designated by advancing by the number appearing in the pitch. Similarly, the position coordinate data of the feature point is read out, and the identification circuit 5c calculates the distance of the feature point coordinate of the standard feature pattern. This process is repeatedly executed for each feature point selected sequentially in accordance with the pitch of the feature points, so that the distances obtained for each feature point are sequentially added. When the distance is calculated for all the characteristic points of all the strokes, the addition value of the distance is output as a result of the similarity between the standard feature pattern and the input character, and then from the standard feature pattern storage unit (7). The standard feature pattern of the next category is read out, and the matching process for the next standard feature pattern is executed in the same manner as described above.

상기한 처리과정은 상기 헤더정보에 의해 처리중에 있는 표준특징패턴이 해당 획수의 최후문자 카테고리로 표시될 때까지 순차 반복해서 실행되게 된다.The above-described processing is repeatedly executed by the header information until the standard feature pattern being processed is displayed as the last character category of the number of strokes.

제7도는 상술한 특징점 피치에 의해서 가변적으로 제어되는 특징점수효의 개념을 나타낸 것이다. 상술한 바와 같이 각 스트록크는, 예컨대 그 스트록크의 길이를 8등분한 7개의 분할점과 시작점 및 종료점을 포함하는 도합 9개의 특징점으로 표시되어 특징점계열기억회로(5b)에 기억된다.7 shows the concept of the number of feature points variably controlled by the feature point pitch described above. As described above, each stroke is stored in the feature point sequence storage circuit 5b, for example, represented by a total of nine feature points including seven divided points obtained by dividing the length of the stroke into eight equal parts and a start point and an end point.

이와 같은 특징점계열에 대해서 피치 데이터는 특징점의 추출피치를 지정하는 것으로, 예컨대 피치가 1로 주어지는 경우에는 제7도(d)에 나타낸 바와 같이 시작점으로부터 모든 특징점이 순서대로 추출되게 된다. 이에 반하여 피치가 2로 주어지는 경우에는 제7도(c)에 나타낸 바와 같이 특징점이 2개째마다 선택되고, 피치가 4로 주어지는 경우에는 제7도(b)에 나타낸 바와 같이 특징점이 4개째마다 선택된다. 단 스트록크를 1개의 특징점만으로 표현할 수 있는 경우에는 제7도의 (e)에 나타낸 바와 같이 1개의 특징점만을 선택·지지할 수 있는데, 이렇게 1개의 특징점의 스트록크를 표현할 수 있는 일례로서 문자「點」에 있어서 부수「火」의 4개 스트록크('''')를 표현하는 경우를 들 수 있다.For this feature point sequence, the pitch data designates the extraction pitch of the feature point. For example, when the pitch is given as 1, all the feature points are sequentially extracted from the starting point as shown in FIG. On the other hand, if the pitch is given as 2, the feature points are selected every second as shown in Fig. 7 (c). If the pitch is given as 4, the feature points are selected every fourth as shown in Fig. 7 (b). do. However, when the stroke can be expressed by only one feature point, only one feature point can be selected and supported as shown in (e) of FIG. 7. As an example in which the stroke of one feature point can be expressed, the letter “點” ", The four strokes (" ") of the incident" fire "can be mentioned.

상기한 과정을 통해서 표준특징패턴기억부(7)에 미리 등록된 피치정보에 따라 각 스트록크를 표현하는데 충분한 특징점의 수효가 지정되어, 그 수효에 대응되는 입력스트록크의 특징점이 선택적으로 추출되게 된다. 즉, 예를 들어 제8도(a)에 나타낸 바와 같은 입력필기패턴의 3개 스트록크는 제8도(b)에 나타낸 표준특징패턴의 각 스트록크를 표현하는 특징점에 대응하여 제1스트록크 및 제2스트록크를 2개의 특징점(시작점과 종료점)으로서 추출되게 되고, 제3스트록크는 6개의 특징점으로 추출하게 된다.Through the above process, the number of feature points sufficient to represent each stroke is specified according to the pitch information registered in advance in the standard feature pattern storage unit 7, so that the feature points of the input stroke corresponding to the number are selectively extracted. do. That is, the three strokes of the input writing pattern as shown in FIG. 8A, for example, correspond to the first stroke and the feature points representing the respective strokes of the standard feature pattern shown in FIG. 8B. The second stroke is extracted as two feature points (start and end points), and the third stroke is extracted as six feature points.

그리고 이러한 양패턴의 상호간에 대응되는 특징점간의 거리가 각각 계산되어 그 유사성이 평가되게 된다.The distances between the feature points corresponding to the two patterns are calculated, respectively, and the similarity is evaluated.

한편, 표준특징패턴기억부(7)에 격납되어 있는 각 스트록크의 순서(스트록크번호)와 필기입력되는 스트록크의 순서가 항상 일치된다는 보장은 없으나, 이 점에 대해서는 예컨대 유사스트록크를 순서대로 대조시키는 방법으로 대처할 수가 있다. 그리고 이러한 융통성을 부여함으로써 획수가 많은 한자의 인식처리에 충분하게 대처할 수 있게 된다. 즉, 그 일부의 대조결과가 어떤 스트록크의 범위에서 미리 정해진 평가치로부터 크게 벗어나는 경우에는 그 이후의 인식처리를 중단시킴으로써 처리효율의 향상을 도모함에 유용하게 된다.On the other hand, there is no guarantee that the order (stroke number) of the strokes stored in the standard feature pattern storage unit 7 and the order of the strokes inputted by hand are always matched. You can cope with it by contrast. By providing such flexibility, it is possible to cope with the recognition process of Chinese characters with a large number of strokes. That is, when a part of the collation result deviates greatly from the predetermined evaluation value in a range of strokes, it becomes useful to improve the processing efficiency by stopping the subsequent recognition process.

상기한 본 장치에 따르면, 문자패턴의 각 스트록크를 각각 필요충분한 수효의 특징점으로 표현해서, 즉 그 인식대상 카테고리의 스트록크가 갖는 특징량에 따라서 설정되는 수의 특징점으로 입력스트록크를 표현하여 인식처리를 실행하므로, 직선과 같은 단순한 스트록크로부터 얻어지는 특정점간의 거리가 과대하게 평가되지 않게 되고, 입력문자 패턴에 대해 전반적으로 직절한 평가를 얻을 수 있게 된다. 즉 직선등의 간단한 스트록크에 대해서는 특징점의 수효를 줄이고, 곡선과 같은 스트록크에 대해서는 그 스트록크를 표현하기 위해 필요한 수효의 특징점을 부여하게 되므로, 특징점에 의한 스트록크의 근사 정확도를 높여서 문자인식의 정밀도를 높일 수 있게 된다. 또한 상술한 바와 같이 각 스트록크가 각각 적절한 수효의 특징점으로 표현되므로 문자전체의 특징점수를 줄일 수 있게 되고, 이로부터 그 계산량을 줄여서 인식처리의 고속화를 도모할 수 있게 된다. 따라서 종래의 문자인식방법에서 볼 수 있는 불합리성을 제거하여 실용적인 잇점을 극대화할 수 있다.According to the above-described apparatus, each stroke of the character pattern is represented by a necessary and sufficient number of feature points, that is, the input stroke is represented by a number of feature points set according to the feature amount of the stroke of the recognition target category. By performing the recognition process, the distance between the specific points obtained from a simple stroke such as a straight line is not excessively evaluated, and an overall straightforward evaluation of the input character pattern can be obtained. In other words, the number of feature points is reduced for simple strokes such as straight lines, and the number of feature points necessary for expressing the strokes is given to strokes such as curves, so that the approximate accuracy of the strokes by the feature points is increased. It is possible to increase the precision of. Further, as described above, each stroke is represented by an appropriate number of feature points, so that the number of feature points of the entire character can be reduced, thereby reducing the amount of computation and thereby speeding up the recognition process. Therefore, it is possible to maximize the practical advantage by removing the irrationality seen in the conventional character recognition method.

그런데 제9도에 나타낸 바와 같이 문자「わ」가 필기입력된 경우, 상술한 문자인식처리만으로는 충분히 신뢰성이 높은 인식결과를 얻지 못할 수 있다. 즉, 문자패턴이 제9도(a)와 같이 표시되는 것에 대하여 문자「わ」와 「ね」의 표준특징패턴이 제9도(b)와 제9도(c)에 나타낸 바와 같이 부여되는 경우에는 특징점간의 거리는 거의 큰 차이가 생기지 않게 된다. 다시 말하여 상기한 경우에는 제3스트록크의 종료점 부근에만 틀리는 곳이 있을 뿐이므로 특징점에 따라 근사되는 스트록크의 차이점은 아주 미소하게 된다. 이 때문에 상기 2개의 표준특징패턴에 대한 입력문자의 인식평가결과가 같은 정도로 될 것이 예상된다.However, as shown in FIG. 9, when the letter "WA" is handwritten, it may not be possible to obtain a sufficiently reliable recognition result only by the above-described character recognition process. That is, when the character pattern is displayed as shown in Fig. 9 (a), the standard characteristic patterns of the letters "わ" and "ね" are given as shown in Figs. 9 (b) and 9 (c). There is little difference in the distance between the feature points. In other words, in the above case, since there is only a wrong place near the end point of the third stroke, the difference between the approximated strokes according to the feature points becomes very small. For this reason, it is expected that the recognition evaluation result of the input characters for the two standard feature patterns will be about the same.

이와 같은 불합리성에 대처하기 위해, 본 장치에서는 스트록크의 국부적인 특징점을 추출해서 인식처리를 재평가하도록 되어 있다.In order to cope with such irrationalities, the apparatus extracts the local feature points of the stroke and reevaluates the recognition process.

제10도는 상기한 불합리성을 해결하기 위한 제어회로의 구성을 나타낸 것으로, 상술한 인식처리에 의해서 얻어진 복수의 문자인식후보에 대하여 충분한 식별을 할 수 없는 경우에 기동되게 된다.Fig. 10 shows the configuration of a control circuit for solving the above irrationality, which is activated when sufficient identification cannot be made for the plurality of character recognition candidates obtained by the above-described recognition process.

상기 표준특징패턴기억부(7)에는 이와 같은 국부적인 스트록크 특징을 식별해내기 위하여 상술한 국부적인 특징인가의 여부를 나타내는 정보에 의해 구별되도록 국부적 특징점의 좌표데이터가 등록되어 있다. 그리고 이 경우에는 상기한 식별정보가 표준특징패턴기억부(7)로부터 독출되어 연산제어회로(17)에 인가되는데, 이 정보에 의해 특징점의 수를 나타내는 피치와 그 특징점위치의 데이터가 레지스터(18)(19)에 셋트된다. 그리고 이 레지스터(18)에 셋트된 정보에 따라 상기 좌표계열기억회로(4)로부터 국부적인 부분 스트록크의 특징점의 좌표데이터가 새로운 특징점으로서 독출되어, 상기 레지스터(19)에 셋트된 데이터에 의한 어드레스의 제어하에 상기 특징점계열기억회로(5a)에 격납된다. 이 경우에도 상술한 특징점의 선택적인 추출처리에서와 같이 국부적인 부분스트록크가, 예컨대 먼저 구해진 특징점 중의 인접특징점의 사이로서 지정되고, 또 그 부분스트록크를 균등분할해서 소정수의 국부특징점을 추출하는 것처럼 제어된다. 그리고 이러한 국부특징점에 대해서 상술한 방식으로 특징점의 피치를 지정하여 필요한 수효의 특징점을 선택함으로써 표준특징패턴의 특징점과의 거리를 계산하게 된다.The standard feature pattern storage section 7 registers the coordinate data of the local feature points so as to be distinguished by the information indicating whether or not the above-described local feature is identified in order to identify such a local stroke feature. In this case, the above-mentioned identification information is read out from the standard feature pattern storage unit 7 and applied to the operation control circuit 17. The information indicating the number of feature points and the data of the feature point positions are stored in the register 18. Set to (19). According to the information set in this register 18, the coordinate data of the feature point of the local partial stroke is read out as a new feature point from the coordinate sequence memory circuit 4, and the address by the data set in the register 19 is read. Is stored in the feature point sequence storage circuit 5a under the control of?. Also in this case, as in the selective extraction process of the feature point described above, the local partial stroke is designated as, for example, between adjacent feature points among the feature points obtained first, and the partial stroke is equally divided to extract a predetermined number of local feature points. It is controlled as if The distance from the feature points of the standard feature pattern is calculated by selecting the required number of feature points by designating the pitch of the feature points in the above-described manner.

즉, 상술한 바 있는 전체적인 특징점추출에 의한 인식처리와 마찬가지로 국부적인 스트록크부분의 국부특징점에 따른 인식처리가 실행되게 된다.In other words, the recognition processing according to the local feature points of the local stroke portion is performed similarly to the recognition processing by the whole feature point extraction described above.

상기한 처리방식에 의해, 예컨대 제11도(a)에 나타낸 바와 같이 입력문자의 각 스트록크를 표현하는 특징점이 각각 추출되고, 제11도(b)와 제11도(b)에 나타낸 것과 같이 국부특징점의 좌표데이터를 갖춘 2개의 유사문자패턴의 표준특징패턴이 각 특징점과 각각 대조되게 된다. 따라서 이와 같은 국부특징점을 포함하여 인식처리된 결과는 상기 2개의 표준패턴에 대해서 크게 다르게 되어 그 식별을 행할 수 있게 된다. 이와 같은 예는 한자의 경우에도 마찬가지인 바, 예컨대 제12도에 나타낸 바와 같이 문자 「閔」과 「問」간의 식별이 충분히 정도높게 실시되지 않는 경우에는 「門」의 내부에 기재되는 스트록에 대해서 국부특징점을 추출하여 인식처리를 행하면 된다.By the above processing method, for example, as shown in Fig. 11A, feature points representing respective strokes of the input characters are extracted, and as shown in Figs. 11B and 11B. The standard feature patterns of the two similar character patterns with the coordinate data of the local feature points are contrasted with each feature point. Therefore, the result of the recognition processing including the local feature points is greatly different with respect to the two standard patterns so that the identification can be performed. The same is true of the Chinese characters. For example, as shown in FIG. 12, when the identification between the characters "閔" and "問" is not sufficiently high, localization is performed on the stroke described in "門". The recognition may be performed by extracting the feature points.

이와 같이 국부특징점의 정보를 적절하게 이용함으로써 문자인식의 정확도를 비약적으로 향상시킬 수 있게 된다. 더우기 이러한 국부특징점의 추출은 상술한 바와 같이 스트록크 전체에 대한 특징점의 추출과 마찬가지로 실행할 수 있게 되므로 장치의 구성이 복잡화되지 않고 그 인식처리 알고리즘을 정연하게 할 수 있게 된다. 따라서 장치 구성이 용이함은 물론 표준특징패턴의 제작을 용이하게 하는 등 실용상 큰 효과가 얻어지게 된다.Thus, by using the information of the local feature point appropriately it is possible to dramatically improve the accuracy of the character recognition. Furthermore, the extraction of the local feature points can be performed in the same manner as the extraction of the feature points for the entire stroke as described above, so that the structure of the device is not complicated and the recognition processing algorithm can be refined. Therefore, not only the device configuration is easy but also the production of a standard feature pattern can be easily achieved.

또한, 본 발명은 상기 실시예에 한정되지 않고, 예컨대 국부적인 특징점계열을 단일 스트록크만이 아닌 복수의 스트록크에 걸쳐서 특징점을 설정함으로써 작성하거나, 한개의 스트록크중 복수의 부분 스트록크에 대해서 국부특징점을 정해서 작성해도 된다. 또한 국부특징점에 의한 인식 처리를 각 표준 특징패턴마다 실행해도 되지만, 모든 표준 특징패턴에 대한 스트록크 전체의 인식처리를 종료하고 나서 유사성이 높은 것에 대해서만 국부특징점에 의한 인식 처리를 실행해도 된다. 또한 인접스트록크 사이를 결합해서 국부특징점의 추출을 행하는 것도 유용하고, 이밖에 본 발명의 요지를 벗어나지 않는 범위내에서 변형시켜 실시할 수 있다.In addition, the present invention is not limited to the above embodiments, and for example, a local feature point sequence is created by setting a feature point over a plurality of strokes instead of a single stroke, or for a plurality of partial strokes of one stroke. Local feature points may be selected and created. In addition, although the recognition process by a local feature point may be performed for every standard feature pattern, the recognition process by a local feature point may be performed only about the thing with high similarity after complete | finishing the whole stroke recognition with respect to all the standard feature patterns. It is also useful to extract local feature points by joining adjacent strokes, and may be modified and carried out without departing from the gist of the present invention.

[발명의 효과][Effects of the Invention]

이상 설명한 바와 같이 본 발명에 의하면, 문자전체의 스트록크에 관한 특징점으로부터 문자를 인식하는데 그치지 않고, 동일수단을 이용하여 스트록크의 국부적인 특징점추출을 효율있게 실시하여 그 평가결과를 문자인식에 이용함으로써 정확도가 매우 높은 인식결과를 얻을 수 있다. 따라서, 예컨대 감압타블렛장치를 거쳐서 필기체로 입력되는 문자를 높은 정확도로 실효시간내에 인식해서 퍼스널컴퓨터등에 입력시키는 것이 가능하게 되는 등 실용적인 효과를 많이 거둘 수 있다.As described above, according to the present invention, not only the character is recognized from the characteristic point of the stroke of the entire character, but also the local feature point extraction of the stroke is efficiently carried out using the same means, and the evaluation result is used for the character recognition. By doing so, highly accurate recognition results can be obtained. Therefore, for example, it is possible to realize a large number of practical effects, such that it is possible to recognize characters input by handwriting through a decompression tablet device with high accuracy within the effective time and input them to a personal computer.

Claims

Coordinate system memory circuit 4 for storing time-series coordinates of handwritten strokes of letters or symbols handwritten through the tablet device 1, and standards for storing information on standard feature patterns of categories to be recognized in advance. The feature pattern storage unit 7 determines the number of feature points representing each stroke of the written character or symbol based on the number information of the feature points representing each stroke determined for each stroke of the standard feature pattern. And a feature point extracting circuit 5a which obtains the position coordinates of each feature point from the time series coordinates of the stroke stored in the coordinate sequence memory circuit 4, and the handwritten input characters or symbols obtained by the feature point extracting circuit 5a. An identification circuit 5c for identifying the handwritten input characters or symbols by matching the feature point position coordinates of the respective strokes with the feature point position coordinates of the respective strokes of the standard feature pattern. Character recognition device characterized in that provided.

2. The feature point extraction circuit (5a) according to claim 1, wherein the feature point extraction circuit (5a) obtains the position coordinates for the feature points of the handwritten characters or symbols, and equally divides the handwritten strokes according to the number of feature points to obtain the position coordinates of the stroke positions. Character recognition device characterized in that.

The character recognition apparatus according to claim 1, wherein the number of feature points representing the stroke is determined for each stroke according to the characteristics of the stroke.

Coordinate system memory circuit 4 for storing time series coordinates of handwritten strokes relating to characters and symbols that are handwritten and input through the tablet device 1, and information on the standard feature patterns of categories to be recognized in advance and localized. A standard feature pattern storage unit (7) for storing identification information representing a feature, and each of the letters or symbols inputted by the handwriting based on the number information of the feature points representing each stroke determined for each stroke of the standard feature pattern A feature point extraction circuit 5a which determines the number of feature points representing the stroke and obtains the positional information of each feature point from the time series coordinates of the stroke stored in the coordinate sequence memory circuit 4, and this feature point extraction circuit ( A feature point sequence storage circuit 5b for storing feature point position coordinates relating to each stroke of the handwritten input character or symbol obtained by 5a), and this feature An identification circuit 5c for identifying the handwritten input characters or symbols by matching the feature point coordinates from the point sequence memory circuit 5b with the feature point position coordinates for the respective strokes of the standard feature pattern, and the identification circuit When the reliability of identification by (5c) is low, the arithmetic control circuit 17 in the feature point extraction circuit 5a generates feature point position information by locally extracting the feature points of the stroke according to the identification information. And the handwritten input character or symbol is identified again in the identification circuit 5c by comparing the locally extracted feature point position coordinates with the feature point position coordinates set locally in the standard feature pattern. Character recognition device.

The feature point extraction circuit (53) according to claim 4, wherein the feature point extraction circuit (53) obtains the position coordinates of the feature points with respect to the handwritten characters or symbols, and obtains the position coordinates of the stroke positions determined by equally dividing the writing strokes according to the number of feature points. Character recognition device characterized in that the.

The character recognition apparatus according to claim 4, wherein the number of feature points representing the stroke is determined for each stroke according to the characteristics of the stroke.

5. The method of claim 4, wherein the feature points of the strokes that are extracted locally are increased by increasing the number of extraction feature points for the strokes or by subdividing partial strokes between the feature points set by dividing the strokes according to the number of feature points. Character recognition device characterized in that.