JPS62229383A

JPS62229383A - On-line character recognizing device

Info

Publication number: JPS62229383A
Application number: JP61070712A
Authority: JP
Inventors: Shizuo Nagata; 永田　静男; Hideo Tanimoto; 谷本　英雄; Yoshimi Yamada; 義美山田; Yasuo Shimizu; 安雄清水
Original assignee: Oki Electric Industry Co Ltd
Current assignee: Oki Electric Industry Co Ltd
Priority date: 1986-03-31
Filing date: 1986-03-31
Publication date: 1987-10-08
Also published as: JPH0531796B2

Abstract

PURPOSE:To decrease the influence of the variance of a character shape by providing a large group classifying part by the number of the strokes of a writing character and a middle group classifying part by the vector between part patterns, selecting a candidate character and executing the sequencing with a part pattern Q value. CONSTITUTION:A large group classifying part 5 execute the large group classification by the stroke of the writing character of the output of a stroke coding part 4. A middle group classifying part 6 calculates the center of gravity of respective part patterns from the segment information to link the section between characteristics points extracted by a characteristic point extracting part 3, obtains the vector between part patterns between the respective centers of gravity centers, executes the matching with the vector between the part patterns prepared beforehand from a registering pattern and selects a candidate character. A part pattern Q value matching part 7 executes the matching with the part pattern Q value to show the characteristic of a part pattern and determines the sequence of the candidate character. A part pattern stroke code part matching part 8 determines a final sequence by the part pattern stroke code distribution.

Description

【発明の詳細な説明】（産業上の利用分野）本発明は、実時間にて筆記文字を識別するオンライン文
字認識装置に関するものである。DETAILED DESCRIPTION OF THE INVENTION (Field of Industrial Application) The present invention relates to an online character recognition device that identifies written characters in real time.

（従来の技術）オンライン文字認識装置における認識方式としては以下
の方法が一般的である。筆記入力されたストローク（ペ
ンオンからペンオフまでの筆記部分）のデータ列をＸ、
Ｙ座標あるいはｒ、θ極座標で表わし、これらのデータ
（以下入カバターンと称す）から特徴点を抽出する。こ
の抽出された特徴点の情報を、あらかじめ同じ方法で特
徴点を抽出し登録しておいたパターン（以下登録パター
ンと称す）の情報とマッチングし、文字認識を行なって
いた。(Prior Art) The following method is generally used as a recognition method in an online character recognition device. The data string of the input stroke (the written part from pen-on to pen-off) is
The feature points are expressed in Y coordinates or r and θ polar coordinates, and feature points are extracted from these data (hereinafter referred to as input cover patterns). Character recognition was performed by matching information on the extracted feature points with information on a pattern (hereinafter referred to as registered pattern) in which feature points were extracted and registered in advance using the same method.

以下、従来のオンライン文字認識方式について第１０図
〜第１２図を参照して説明する。Hereinafter, a conventional online character recognition method will be explained with reference to FIGS. 10 to 12.

先ず、第１０図に特徴点を各ストロークの始点、終点で
表わした一例を示す。筆記入力されたｊストロークデー
タ列ＩＤｊをｒＤｊ＝（（ｘｌ、ｙｓ）ｊ）、ｉ＝１．２．、、ｎ−
＝（１）ｎｊ”ｊストロークのデータ数とすると、始点（Ｘｓ　＋　ｙｓ）’は（ＸＩ　ｒ　ｙ
ｔ　）　ｊで、終点（Ｘｅ　＋　ｙｅ）ｊは（Ｘｎｊ　
ｌ　ｙｎｊ）Δで求められる。First, FIG. 10 shows an example in which feature points are expressed by the start and end points of each stroke. The j stroke data string IDj input by hand is rDj=((xl, ys)j), i=1.2. ,,n-
= (1) nj'' If the number of data for j strokes is, the starting point (Xs + ys)' is (XI r y
t ) j, the end point (Xe + ye)j is (Xnj
l ynj) Δ.

ここで、同様の方法で求め登録しておいた登録パターン
を（ｘｓ　、　ｙｓ）ＪＩと（Ｘｅ、ｙｅ）、ＪＩとす
る。単純なマッチング方式を採用した場合、文字のスト
ローク数がＮのとき、入カバターンと登録パターンの距
離ｄは、にしたがって算出される。そして、各登録パターンとの
マッチングが行なわれ、そのうち距離ｄが小のものが認
識結果として出力される。Here, the registered patterns found and registered in a similar manner are assumed to be (xs, ys) JI and (Xe, ye) JI. When a simple matching method is adopted, when the number of character strokes is N, the distance d between the input cover pattern and the registered pattern is calculated according to the following. Then, matching with each registered pattern is performed, and among them, one with a small distance d is output as a recognition result.

（発明が解決しようとする問題点）しかしながら、このようなマッチング方式を用いた従来
のオンライン文字認識方式では、以下に述べるような問
題点があった。(Problems to be Solved by the Invention) However, the conventional online character recognition method using such a matching method has the following problems.

この方式によれば、登録パターンと同様な形状で筆記し
たものは、はぼ１００％近くの認識率で認識される。と
ころが、一般には、筆記入力されたパターンが登録パタ
ーンと同様の形状でないケースが多々ある。第１１図（
ａ）に登録パターンに対して形状が変形している”河”
の入カバターン例を示し、第１１図（ｂ）にこれと対応
する登録パターンを示す。この場合、”シ”、”「が上
下にずれ、それぞれの大きさも大きく変動している。従
って、入カバターンと登録パターンとの距離ｄは大きく
なり、誤認識となる可能性が大となる。このように、従
来方式によれば、登録パターンに対して変形された形状
で筆記入力が行われた場合、誤認識率が大きくなるとい
う問題点があった。According to this method, a handwritten item with a shape similar to the registered pattern is recognized with a recognition rate of nearly 100%. However, in general, there are many cases where the handwritten input pattern does not have the same shape as the registered pattern. Figure 11 (
A) “River” whose shape is deformed with respect to the registered pattern
An example of the input cover pattern is shown in FIG. 11(b), and a registered pattern corresponding to this is shown in FIG. In this case, "shi" and "" are shifted up and down, and their respective sizes vary greatly. Therefore, the distance d between the input cover pattern and the registered pattern becomes large, increasing the possibility of misrecognition. As described above, according to the conventional method, there is a problem in that when a handwritten input is performed in a shape that is modified from the registered pattern, the misrecognition rate increases.

また、（２）式に基づくような単純な筆記ストローク順
のマッチングでは、入力文字の筆順が異なった場合、距
離ｄが大となる。そのため、実際は、マッチングするス
トロークの最適な対応づけを行なう処理が必要となる。Furthermore, in a simple matching of the order of written strokes based on equation (2), the distance d becomes large when the stroke order of the input characters is different. Therefore, in reality, processing is required to optimally associate matching strokes.

すなわち、筆記入力された各ストロークを、登録パター
ンの各ストロークのどのストロークとマッチングすれば
よいかを決定することが必要となる。そしてこの処理に
多大な時間が費やされることになる。That is, it is necessary to determine which stroke of the registered pattern should be matched with each handwritten stroke. This process consumes a lot of time.

例えば第１２図（ａ）に示したような”シ′°を筆記入
力したときの最適な対応づけを考えて見る。第１２図（
ｂ）はこのときの対応登録パターンを示す図、第１２図
（Ｃ）は最適ストローク対応づけ処理説明図である。For example, let's consider the optimal correspondence when "shi'°" as shown in Fig. 12(a) is input by hand. Fig. 12(a)
b) is a diagram showing the correspondence registration pattern at this time, and FIG. 12(C) is an explanatory diagram of the optimum stroke correspondence processing.

対応づけの一つの方法としては、まず、入カバターンの
第１ストローク（第１２図（ａ）中■）と登録バ斧ターンの第１ストローク（第１２図（ｂ）中■）との距
離ｄｌｌを（２）式により算出し、次に登録パターン■
との距離ｄ、２．登録パターン■との距離ｄ１３を算出
し、ｄＩＩ　　＋　ｄ１２　＋　ｄ１３の最小のストロ
ークを対応づける。One way to make the correspondence is to first find the distance dll between the first stroke of the incoming turn (■ in Figure 12(a)) and the first stroke of the registered bat-axe turn (■ in Figure 12(b)). is calculated using formula (2), and then the registered pattern ■
distance d, 2. The distance d13 from the registered pattern ■ is calculated, and the minimum stroke of dII + d12 + d13 is associated.

この例の場合ｄ１１が最小とな９■−吊１を最適なスト
ロークとして対応づける。同様に、入カバターン第２ス
トローク（第１２図（ａ）中■）と、すでに対応づけら
れた登録パターンのストロークを除く（この場合■）残
りのス）ｏ−りとの対応づけを行なう。そしてそれぞれ
の最適なストロークの対応づけを行なう。In this example, d11 is the minimum, and 9■-hang 1 is associated as the optimal stroke. Similarly, the second stroke of the input cover pattern (■ in FIG. 12(a)) is associated with the remaining strokes excluding the registered pattern strokes that have already been associated (■ in this case). Then, the respective optimal strokes are associated with each other.

この処理は、ストローク数が少ない場合には処理時間は
少ないが、ストローク数大の文字（画数大の文字）では
、その処理数Ｍはストローク数をＮとすると、Ｍ＝　Σ　ｉ　　　　　　　　・・・（３）ｉ−＋の比率で増大するため、膨大な処理量となる。この処理
量の増大をカバーするために高速なＡＬＵ等を使用する
必要があるため、従来方式によるオンライン文字認識装
置では小型化ができず、またコストアップとなるという
欠点があった。This processing takes less time when the number of strokes is small, but for characters with a large number of strokes (characters with a large number of strokes), the number of processing M is as follows, where the number of strokes is N, M= Σ i ... ( 3) Since it increases at the ratio of i-+, the amount of processing becomes enormous. In order to compensate for this increase in processing amount, it is necessary to use a high-speed ALU or the like, so conventional online character recognition devices cannot be miniaturized and have the disadvantage of increasing costs.

このように、従来の方式は筆記入力される入カバターン
の形状の変化に弱く、誤認識となりやすく、また筆順を
緩和するために最適なストローク対応づけを行なう処理
が必要で処理量が多大であるという欠点があった。As described above, the conventional method is sensitive to changes in the shape of the input cover pattern input by handwriting, and is prone to misrecognition.Additionally, it requires processing to perform optimal stroke correspondence in order to relax the stroke order, which requires a large amount of processing. There was a drawback.

本発明は、以上述べた従来技術の問題点を解決し、この
筆記入力される入カバターンの変形に強く、しかも処理
量が少なくてすむ非常に優れたオンライン文字認識装置
を提供することを目的とする。SUMMARY OF THE INVENTION An object of the present invention is to solve the problems of the prior art described above, and to provide an excellent online character recognition device that is resistant to deformation of input cover patterns input by hand, and that requires less processing. do.

（問題点を解決するための手段）本発明は、タブレットに筆記入力して得られた座標デー
タ列の不要データを除去し、直線化処理を施す前処理部
と、前処理部により直線化された座標データ列から、筆
記文字を構成するストロークの特徴を表わす特徴点を抽
出する特徴点抽出部と、特徴点抽出部で抽出された特徴
点の位置関係により各ストロークをコード化するストロ
ークコード化部とを設け、特徴点抽出部またはストロー
クコード化部の出力データを、あらかじめ登録されてい
る登録パターンデータと比較して文字認識を行なうオン
ライン文字認識装置を対象とし、前記従来技術の問題点
を解決するため、ストロークコード化部の出力を受取り
、筆記文字のストローク数によ勺大分類を行なう大分類
部と；大分類部で選択された候補文字に対し、筆記上一
連のものとして筆記する部分を部分パターンとし、特徴
点抽出部で抽出した特徴点間を結ぶセグメントの情報よ
り各部分パターンの重心を算出し、算出した重心データ
に基づき、１の部分パターンの重心と別の部分パターン
の重心をそれぞれ始点及び終点とする部分パターン間ベ
クトルを求め、この部分パターン間ベクトルと、あらか
じめ同様にして登録パターンより作成しておいた部分パ
ターン間ベクトルとのマッチングを行ない、マッチング
結果にしたがって中分類を行なう中分類部と；中分類部
により絞られた候補文子に対し、部分パターンの各スト
ローク各セグメント情報より抽出した部分パターンの特
徴を表わす部分パターンＱ値を求め、この部分パターン
Ｑ値と、あらかじめ同様にして登録パターンより作成し
ておいた部分パターンＱ値とのマッチングを行ない、こ
のマッチング結果と、中分類部によるマッチング結果に
したがって候補文字の順位づけを行なう部分パターンＱ
値マッチング部と；部分パターンＱ値マッチング部で順
位づけされた上位候補文字に対し、部分パターン毎に、
ストロークコード化部で抽出したストロークコードの分
布を算出し、このストロークコード分布と、あらかじめ
同様にして登録パターンより作成しておいた部分パター
ンストロークコード分布とのマッチングを行ない、この
マッチング結果にしたがって候補文字の最終的な順位づ
けを行なう部分パターンストロークコード分布マッチン
グ部とを設けたものである。(Means for Solving the Problems) The present invention includes a preprocessing unit that removes unnecessary data from a coordinate data string obtained by handwriting input on a tablet, and performs linearization processing; A feature point extraction unit that extracts feature points representing characteristics of strokes that make up a written character from a coordinate data string, and a stroke coding unit that codes each stroke based on the positional relationship of the feature points extracted by the feature point extraction unit. The present invention aims at an online character recognition device that performs character recognition by comparing the output data of the feature point extraction section or the stroke coding section with registered pattern data registered in advance, and solves the problems of the conventional technology. In order to solve the problem, there is a large classification section that receives the output of the stroke encoding section and roughly categorizes the written characters according to the number of strokes; The part is treated as a partial pattern, and the center of gravity of each partial pattern is calculated from the information on the segments connecting the feature points extracted by the feature point extraction section. Based on the calculated center of gravity data, the center of gravity of one partial pattern and that of another partial pattern are calculated. Find vectors between sub-patterns with the center of gravity as the starting point and end point, respectively, match these inter-sub-pattern vectors with vectors between sub-patterns previously created from registered patterns in the same way, and perform medium classification according to the matching results. For the candidate sentences narrowed down by the intermediate classification unit, a partial pattern Q value representing the characteristics of the partial pattern extracted from each stroke segment information of the partial pattern is determined, and this partial pattern Q value and The partial pattern Q is matched with the partial pattern Q value created in advance from the registered pattern in the same way, and the candidate characters are ranked according to this matching result and the matching result by the intermediate classification section.
For each partial pattern, for the top candidate characters ranked by the value matching section and the partial pattern Q value matching section,
The stroke code distribution extracted by the stroke coding unit is calculated, and this stroke code distribution is matched with a partial pattern stroke code distribution previously created from registered patterns in the same way, and candidates are generated according to this matching result. The system also includes a partial pattern stroke code distribution matching section that performs final ranking of characters.

（作用）本発明では以上のようにオンライン文字認識装置を構成
したので各技術手段は以下のとおり作用する。(Function) In the present invention, since the online character recognition device is configured as described above, each technical means functions as follows.

前処理部はタブレットに筆記入力された文字の座標デー
タ列に対し、ノイズ除去処理、移動平均処理、平滑化処
理等を施すことにょシ、データを直線化する。特徴点抽
出部はたとえば直線化されたデータ列のデータ間のＸｔ
　Ｙ方向のサインを求めサインの状態の変化点を特徴点
とする等の方法により特徴点を抽出する。ストロークコ
ード化部は抽出された特徴点情報に基づき、たとえば各
セグメントのサイン、セグメントの角度、セグメント間
の回転角度によりストロークを分類し、コード化する。The preprocessing unit performs noise removal processing, moving average processing, smoothing processing, etc. on the character coordinate data string handwritten on the tablet, and linearizes the data. For example, the feature point extraction unit may be configured to
Feature points are extracted by a method such as finding a sign in the Y direction and using a change point in the state of the sign as a feature point. The stroke encoding unit classifies strokes based on the extracted feature point information, for example, by the sign of each segment, the angle of the segments, and the rotation angle between the segments, and encodes the strokes.

このコード化されたストロークデータは以下の本発明の
特徴である各手段の分類、順位づけのために供される。This coded stroke data is used for the following classification and ranking of each means, which is a feature of the present invention.

大分類部は筆記された文字のストローク数すなわち画数
により大分類を行なう。このため、あらかじめ画数毎に
その画数とな９うる文字をたとえば文字辞書に格納して
おく。The major classification section performs major classification based on the number of strokes of written characters. For this reason, for each number of strokes, characters corresponding to the number of strokes are stored in advance in a character dictionary, for example.

中分類部はストローク数により大分類された候補文字に
対し、部分パターン間ベクトルのマッチングに１よる分
類を施し、更に候補文字の絞り込みを行なう。具体的に
は候補文字につき部分パターン間ベクトルをそれぞれ求
め、これをたとえば文字辞書のパターン間ベクトルと比
較してその差を両ベクトル間の距離として求める。この
距離を所定の閾値と比較し、該閾値より大きければ候補
文字からはずし、該閾値以下であれば候補文字のままと
する。The intermediate classification section classifies candidate characters roughly classified according to the number of strokes by 1 based on the matching of vectors between partial patterns, and further narrows down the candidate characters. Specifically, a vector between partial patterns is obtained for each candidate character, and this is compared with, for example, an inter-pattern vector in a character dictionary, and the difference is obtained as the distance between both vectors. This distance is compared with a predetermined threshold value, and if it is larger than the threshold value, it is removed from the candidate characters, and if it is less than the threshold value, it is left as a candidate character.

部分パターンＱ値マッチング部は中分類された候補文字
について部分パターンＱ値のマッチングによる順位づけ
を行なう。このため登録パターンからあらかじめ求めた
部分パターンＱ値をたとえば部分パターン辞書に格納し
ておく。なおこの順位づけには中分類部のマッチングデ
ータも利用される。部分パターンＱ値は各セグメントの
長さ、方向、位置等を表わしておシ、これらの情報を有
効に利用することにより効果的な順位づけが行なわれる
。The partial pattern Q value matching unit ranks the medium-classified candidate characters by matching the partial pattern Q values. For this reason, the partial pattern Q value obtained in advance from the registered pattern is stored in, for example, a partial pattern dictionary. Note that matching data from the intermediate classification section is also used for this ranking. The partial pattern Q value represents the length, direction, position, etc. of each segment, and by effectively using this information, effective ranking can be performed.

部分パターンストロークコード分布マッチング部は、こ
のように順位づけされた上位候補文字に対し、部分パタ
ーンのストロークコード分布（７）マッチングにより最
終的な順位づけを行なう。The partial pattern stroke code distribution matching section performs final ranking on the thus ranked top candidate characters by matching the partial pattern stroke code distribution (7).

したがって筆記入力される文字形状のバラツキによる変
動に対処でき、筆順の違いによるストロークの最適な対
応づけ処理を不要とし、処理量が軽減され、前記従来技
術の問題点が解決される。Therefore, it is possible to deal with fluctuations due to variations in the shapes of characters input by hand, eliminate the need for optimum correspondence processing of strokes due to differences in stroke order, reduce the amount of processing, and solve the problems of the prior art.

（実施例）以下、本発明の一実施例のオンライン文字認識装置につ
いて詳細に説明する。(Embodiment) Hereinafter, an online character recognition device according to an embodiment of the present invention will be described in detail.

第１図は本実施例のオンライン文字認識装置の構成を示
すブロック図である。同図に示すように、このオンライ
ン文字認識装置は、タブレット１、前処理部２、特徴点
抽出部３、ストロークコード化部４、大分類部５、中分
類部６、部分パターンＱ値マッチング部７及び部分パタ
ーンストロークコード分布マッチング部８から構成され
る。なお図中９は図示しない表示器等への出力端子を示
す。FIG. 1 is a block diagram showing the configuration of the online character recognition device of this embodiment. As shown in the figure, this online character recognition device includes a tablet 1, a preprocessing section 2, a feature point extraction section 3, a stroke encoding section 4, a major classification section 5, a medium classification section 6, and a partial pattern Q value matching section. 7 and a partial pattern stroke code distribution matching section 8. Note that 9 in the figure indicates an output terminal to a display device (not shown) or the like.

第２図は第１図の装置の動作の概略を示すフローチャー
トで、前処理部２はステップ１０１を実行し、特徴点抽
出部３はステップ１０２を実行し、ストロークコード化
部４はステップ１０３を実行し、大分類部５はステップ
１０４を実行し、中分類部６はステップ１０５ないし１
０８を実行し、部分パターンＱ値マッチング部７はステ
ップ１０９ないし１１２を実行し、部分パターンストロ
ークコード分布マッチング部８はステップ１１３ないし
１１６を実行する。FIG. 2 is a flowchart showing an outline of the operation of the apparatus shown in FIG. The major classification section 5 executes step 104, and the intermediate classification section 6 executes steps 105 to 1.
08, the partial pattern Q value matching unit 7 executes steps 109 to 112, and the partial pattern stroke code distribution matching unit 8 executes steps 113 to 116.

第３図は本実施例の装置が備えている文字辞書の構成を
示す図で、この辞書は画数（ストローク数）により文字
の選択ができるようになっている。FIG. 3 is a diagram showing the structure of a character dictionary included in the apparatus of this embodiment, and this dictionary allows characters to be selected based on the number of strokes.

また、第４図は本実施例の装置が備えている部分パター
ン辞書の構成を示す図である。Further, FIG. 4 is a diagram showing the configuration of a partial pattern dictionary included in the apparatus of this embodiment.

以下、本実施例の装置の動作を各部ごとに順を追って詳
しく説明する。Hereinafter, the operation of the apparatus of this embodiment will be explained in detail step by step for each part.

先ず、タブレットｌは文字を筆記入力するためのもので
、文字が筆記入力されると第５図（ａ）のように筆記デ
ータ列（（ｘｉ＋　ｙｉ　）、ｉ＝１．２−ｎｊ）ｊ　
が抽出され、前処理部２に送られる。First, the tablet l is used to input characters by hand, and when characters are input by hand, a string of writing data ((xi+yi), i=1.2-nj)j is generated as shown in FIG. 5(a).
is extracted and sent to the preprocessing section 2.

前処理部２は送られてきた筆記データ列に対し、ノイズ
除去処理、移動平均処理、平滑化処理を行うことにより
、第５図（ｂ）のようにデータを直線化し、特徴点抽出
部３に出力する。The preprocessing unit 2 linearizes the data as shown in FIG. 5(b) by performing noise removal processing, moving average processing, and smoothing processing on the received writing data string, and then the feature point extraction unit 3 Output to.

次に特徴点抽出部３の動作について述べる。特徴点抽出
部３の行なう特徴点抽出処理としてはいくつかの方法が
あるが、ここでは直線化されたデータ列（（Ｘｉ　＋　
ｙｉ　）ｉ＝１．２．−ｎｊｌｊのデータ間のＸ。Next, the operation of the feature point extraction section 3 will be described. There are several methods for feature point extraction processing performed by the feature point extraction unit 3, but here we will use a linearized data string ((Xi +
yi)i=1.2. -X between data of njlj.

Ｘ方向のサイン（正、負、０の符号）を算出し、サイン
の状態の変化点を特徴点として抽出する方法について述
べる。A method of calculating the sign (positive, negative, and 0 signs) in the X direction and extracting points of change in the sign state as feature points will be described.

データ間のＸ、Ｘ方向のサイン　ＸＳ・、ＹＳｉをで求
め、＋、０．−で表現する。このようにして求めた各デ
ータ間のＸ方向、Ｘ方向のサインを前データ間のサイン
と比較し、同じであれば特徴点として登録せず、異なっ
た場合には状態が変わったとして特徴点として登録する
。第５図（Ｃ）にこのようにして求めた点の他に始点、
終点を加えた特徴点を示す。一般にはこの処理を直線近
似化と称す場合もある。この特徴点間を以下セグメント
と称し、特徴点を（（Ｘ；　、　Ｙ；　）、ｊ−”１　
、、、、ｔｊ）ｊで表わすことにする。Find the X, X direction sine between the data, XS・, YSi, +, 0. Expressed with -. Compare the signs in the X direction and X direction between each piece of data obtained in this way with the signs between the previous data, and if they are the same, they are not registered as feature points, and if they are different, the state is considered to have changed and the feature point is Register as. In addition to the points obtained in this way, Fig. 5 (C) shows the starting point,
Shows the feature points plus the end point. Generally, this process is sometimes called linear approximation. The space between these feature points is hereinafter referred to as a segment, and the feature points are ((X; , Y; ), j−”1
, , , tj)j.

以上のようにして得られた特徴点情報はストロークコー
ド化部４及び中分類部６に出力される。The feature point information obtained as described above is output to the stroke coding section 4 and the medium classification section 6.

次にストロークコード化部４の動作について述べる。ス
トロークコード化部４は特徴点抽出部３により得られた
特徴点情報に基づき各ストロークをコード化する。この
コード化には数多くの方法があるが、一般には各セグメ
ントのＸ、Ｙサイン、セグメントの角度、セグメント間
の回転角度により分類し、コード化を行なう。第６図は
このコード化処理の説明図で、θ２．θ２．θ３はセグ
メントの角度（＋ｘＸ方向なす角度）を示し、ｉｚ、　
、　ａ２は隣９あうセグメント間の回転角度を示す。コ
ード化されたストロークデータは大分類部５及び部分パ
ターンストロークコード分布マッチング部８に出力され
る。Next, the operation of the stroke coding section 4 will be described. The stroke encoding section 4 encodes each stroke based on the feature point information obtained by the feature point extraction section 3. There are many methods for this encoding, but in general, the encoding is performed by classifying the X and Y signs of each segment, the angle of the segment, and the rotation angle between the segments. FIG. 6 is an explanatory diagram of this encoding process, in which θ2. θ2. θ3 indicates the angle of the segment (the angle formed in the +xX direction), iz,
, a2 indicates the rotation angle between nine adjacent segments. The coded stroke data is output to the major classification section 5 and the partial pattern stroke code distribution matching section 8.

次に大分類部５の動作について述べる。大分類部５はス
トロークコード化部４の出力を受け、ストｏ−り数によ
り対象文字に対する大分類を行なう。そのため、あらか
じめ画数（ストローク数）毎にその画数とな９うる文字
を第３図のように文字辞書に用意しておく。たとえば筆
記入力された文字パターンのストローク数が１０画であ
ったとする。この場合、文字辞書に格納されている文字
のうち第３図に示すような１０画となり得る文字“口亜
”、“挨″、“逢″・・・を候補文字として選択する。Next, the operation of the major classification section 5 will be described. The major classification section 5 receives the output from the stroke encoding section 4 and performs major classification of the target characters based on the number of strokes. Therefore, for each number of strokes, nine characters corresponding to the number of strokes are prepared in advance in a character dictionary as shown in FIG. For example, assume that the number of strokes of a character pattern input by hand is 10 strokes. In this case, among the characters stored in the character dictionary, characters "kuchia", "koi", "ai", etc., which can have 10 strokes as shown in FIG. 3, are selected as candidate characters.

次に中分類部６の動作について述べる。中分類部６は、
大分類部５にて画数により大分類して得た候補文字を以
下に説明する部分パターン間ベクトルにより更に中分類
する。ここで部分パターンとは１つの文字のうち筆記上
一連のものとして筆記する部分をいうものとし、部分パ
ターン間ベクトルとは１の部分パターンの重心と別の部
分パターンの重心をそれぞれ始点、終点とするベクトル
をいうものとする。Next, the operation of the intermediate classification section 6 will be described. The middle classification section 6 is
The candidate characters obtained by major classification according to the number of strokes in the major classification section 5 are further divided into medium classifications using vectors between partial patterns, which will be described below. Here, a partial pattern refers to a part of one character that is written as a series of characters, and a vector between partial patterns refers to the center of gravity of one partial pattern and the center of gravity of another partial pattern as the starting point and ending point, respectively. is the vector.

先ず、部分パターン間ベクトルの算出法の一例を述べる
。部分パターン中の各セグメントのＸ。First, an example of a method for calculating vectors between partial patterns will be described. X for each segment in the partial pattern.

ｙ成分を（ｄｘｉ　、　ｄｙｉ　）とすると各セグメン
トの長さｄｌｌは、ｄｌｉ＝　（’；；ア７Ｉ丁　　　・・・（５）で表わ
される。また文字幅ＨＸ、ＨＹで除することにより正規
化した各セグメントの中心座標を（Ｘｉ　＋ｙｉ）とす
ると、部分パターンの重心座標（ＸＷ　　。When the y component is (dxi, dyi), the length dll of each segment is expressed as dli = ('; Let the center coordinates of each segment be (Xi + yi), then the barycenter coordinates of the partial pattern (XW).

Ｙｗ）はで求められる。以上の方法で各部分パターンの重心を求
め、１の部分パターンの重心と別の部分パターンの重心
をそれぞれ始点、終点として部分パターンベクトルを求
める。なお、ここでは部分バターン間ベクトルはＸ方向
とＸ方向についてそれぞれ分けて考えるものとする。第
７図に文字“ａｌｌ”の部分パターン“口”、“ｔ″の
各重心及び部分パターン間ベクトルＸ（（ＩＩ　ｌ止）
、ｙ（ａ、亜）の例を示す。Yw) is calculated by Using the above method, the center of gravity of each partial pattern is determined, and a partial pattern vector is determined using the center of gravity of one partial pattern and the center of gravity of another partial pattern as the starting point and end point, respectively. Note that here, the inter-partial pattern vectors are considered separately for the X direction and the X direction. Figure 7 shows the partial pattern “mouth” of the character “all”, each center of gravity of “t”, and the vector between partial patterns X ((II l stop)
, y(a, sub).

中分類部６では、以上説明した部分パターン間ベクトル
により、大分類部５で選択された候補文字を絞シこみ中
分類を行なうわけであるが、ここで−例として”・逢”
が筆記入力された場合を考え、以下この入力文字に対す
る中分類の手順を説明する。The medium classification section 6 performs medium classification by narrowing down the candidate characters selected by the main classification section 5 using the inter-partial pattern vectors explained above.
Let us consider the case where the input character is input by hand, and the procedure for medium classification for this input character will be explained below.

筆記入力された文字”逢”は１０画であるので、先ず第
３図に示すごとき文字辞書の１０画部分を参照する。す
るとここには文字”ａｔ”が第１番目に配されておムそ
の欄には“ａｔ”を構成する部分パターン、部分パター
ンの筆記類と各部分パターンのストローク数情報（以下
カット位置と称す）、及び登録パターンよりあらかじめ
算出した各部分パターン間ベクトル値が示されている。Since the handwritten character "Ai" has 10 strokes, first, the 10 stroke part of the character dictionary shown in FIG. 3 is referred to. Then, the character "at" is placed first here, and in that column there are the partial patterns that make up "at", the handwriting of the partial patterns, and the information on the number of strokes for each partial pattern (hereinafter referred to as cut position). ), and vector values between each partial pattern calculated in advance from the registered pattern are shown.

以下順に゛挨′”、”逢″・・・の文字について同様の
情報が並んでおり、中分類部６はこの文字頑にしたがい
候補とすべきか否かをそれぞれ判定し以下のように中分
類を行なう。In the following order, similar information is arranged for the characters ``゛welt'', ``逢'', etc., and the intermediate classification unit 6 determines whether or not each character should be a candidate according to the character, and divides it into intermediate classifications as follows. Do the following.

先ず、筆記入力した文字が０＆＝＝であるとして部分パ
ターン間ベクトルのマッチング距１ｉｔｌｔｄｖｅｃを
求める。第３図の文字辞書にかかれているように、”ｏ
ｆ　”はカット位置が（３，７）すなわち第１ストロー
ク〜第３ストロークで”Ｏ”が形成され、第４ストロー
ク〜第１０ストロークで”晩パが形成されるが、本例で
は入カバターンが”逢″であるのでこのカット位置で１
逢”について部分パターン間ベクトルを考えると第８図
のようになる。第８図のように求められた入カバターン
の部分スペクト間ペクト〜ｘ（ｏ、臣）、ｉ（ｏ、ｔ）
と辞書にかかれている部分パターン間ベクトルＸ（Ｏ、
亜＞。First, assuming that the characters input by hand are 0&==, the matching distance 1itltdvec of vectors between partial patterns is determined. As written in the character dictionary in Figure 3, “o
f'', the cut position is (3,7), that is, the 1st to 3rd strokes form an ``O'', and the 4th to 10th strokes form a ``late hole'', but in this example, the input cover turn is Since it is “Ai”, 1 at this cut position.
If we consider the inter-subpattern vector for ``Ai'', it becomes as shown in Fig. 8.The inter-partial vector of the input pattern obtained as shown in Fig. 8 ~ x (o, Omi), i (o, t)
and the vector between partial patterns written in the dictionary X(O,
A>.

７（Ｃ１，Ｉ）との差がマッチング距離ａｖｅｃであり
、ｄｖｅｃ　＝　ｌ　ｘ（σ；、味）−マ（ロ、仕）１
＋１ブ（σ１ｇ−）−ブ（ロ、亜）１　・・・（力で算
出される。The difference from 7(C1, I) is the matching distance avec, and dvec = l x (σ;, taste) - ma (b, taste) 1
+1 bu (σ1g-) - bu (ro, sub) 1 ... (calculated by force).

一般に、筆記した文字の部分パターン数が複数の場合、
部分パターン数ＢＰＮで正規化を行ないマッチング距離
ａｖｅｃは、にしたがって算出される。Generally, when the number of partial patterns of a written character is multiple,
The matching distance avec is calculated according to the following by normalizing with the number of partial patterns BPN.

ここである閾値ＶＥＣＲＥＪを設定し、算出したｄｖｅ
ｃがＶＥＣＩＪより大きいか否かを判定する。Here, a certain threshold VECREJ is set, and the calculated dve
It is determined whether c is larger than VECIJ.

そしてｄｖｅｃ＞ｖＥＣＲＥＪのときは、参照した文字
（この場合”ａｔ”）ではないとして次の文字の部分パ
ターン間ベクトルのマッチングを行なう。When dvec>vECREJ, it is assumed that the character is not the referenced character (in this case, "at"), and matching of vectors between partial patterns of the next character is performed.

ａｖｅｃ≦ＶＥＣＲＥＪのときは、“０臣”らしいとし
て次に説明する部分パターンＱ値の算出及びマッチング
を行なう。When avec≦VECREJ, it is assumed that the partial pattern is likely to be “0 minister”, and calculation and matching of the partial pattern Q value, which will be explained next, is performed.

次に部分パターンＱ値マッチング部７の動作について述
べる。部分パターンＱ値マッチング部７は中分類部６に
おける部分パターン間ベクトルによる中分類で残った候
補文字について部分パターンＱ値を算出し、第４図に示
す部分パターン辞書中の部分パターンＱ値とマッチング
を行なう。この部分パターン辞書の部分パターンＱ値は
、登録パターンよりあらかじめ作成され格納されている
ものである。ここで部分パターンＱ値とは、各セグメン
トの長さ、方向及びその位置を表わす特徴パラメータを
いう。オンライン文字認識では筆記するペンの動きとし
てＸ、Ｙ方向、＋またば−の方向が重要な情報として得
られ、この情報を有効につかったのがこの部分パターン
Ｑ値である。Next, the operation of the partial pattern Q value matching section 7 will be described. The partial pattern Q value matching unit 7 calculates the partial pattern Q value for the candidate characters remaining after the intermediate classification using the inter-partial pattern vectors in the intermediate classification unit 6, and matches it with the partial pattern Q value in the partial pattern dictionary shown in FIG. Do the following. The partial pattern Q values of this partial pattern dictionary are created and stored in advance from registered patterns. Here, the partial pattern Q value refers to a characteristic parameter representing the length, direction, and position of each segment. In online character recognition, the X, Y directions, + and - directions are obtained as important information regarding the movement of the writing pen, and this partial pattern Q value is an effective use of this information.

先ず、部分パターンＱ値の算出法を説明する。First, a method for calculating the partial pattern Q value will be explained.

なお下式においてΣは、全ストローク、全セグメントに
関する加算を示す。またＨＸ、ＨＹは文字幅を示す。Note that in the formula below, Σ indicates addition regarding all strokes and all segments. Further, HX and HY indicate character width.

■　＋Ｘ方向成分のＸ方向位置 ■　−Ｘ方向成分のＸ方向位置 ■　＋ｙＸ方向成分ｙ方向位置 ■　−ｙ方向成分のＸ方向位置 ■　＋Ｘ方向成分のＸ方向位置 ■　−Ｘ方向成分のＸ方向位置 ■　＋ｙ方向成分のＸ方向位置 ■　−ｙ方向成分のＸ方向位置 ■〜■の場合は原点を左下に設定したときの各方向位置
の値であるが、このとき原点近くにあるものは乗算に供
するとＯとなってしまうため、０となるのを防ぐため原
点を入れかえ原点を右上に設定したときの各方向位置の
値Ｑ９〜Ｑ１８についても同様に記述し、Ｑ１〜Ｑ工の
合計１６個の値により対象文字の各ストロークのセグメ
ントの長さ、方向及び位置全表わすものとする。■ X-direction position of +X-direction component■ -X-direction position of X-direction component■ +yX-direction component y-direction position■ -X-direction position of y-direction component■ +X-direction position of X-direction component■ -X-direction position of X-direction component ■ +X-direction position of the y-direction component■ -X-direction position of the y-direction component■~■ In the case of the origin, it is the value of each direction position when the origin is set to the lower left, but in this case, the value near the origin is multiplied. When the origin is set to the upper right, the values Q9 to Q18 in each direction are written in the same way, and a total of 16 values are obtained from Q1 to Q. The length, direction, and position of each stroke segment of the target character are fully expressed by the value of .

部分パターンＱ値マッチング部７では、部分パターン間
ベクトルによる分類により残ったものに対し、以上説明
した部分パターンＱ値金算出するのであるが、例えば“
逢”を車記入力して“挨°′が部分パターン間ベクトル
による分類により残ったとする。この場合、“挨”のカ
ット位ｔｌｌ［第３図に示すように（３，２，５）でめ
り、部分パターンは１＋ｈ＋天であるので、入カバター
ンｔカット位置（３，２，５）でカットし、各々Ｑ１〜
斧ＱＣｓを算出する。このとき、本例の場合第９図に示す
ように“人”が”ｒに、“二“が“ハ”に、“辻′。The partial pattern Q value matching unit 7 calculates the partial pattern Q value described above for those remaining after classification based on vectors between partial patterns.
Suppose that ``逢'' is input into the car, and ``farewell'' remains after classification based on vectors between partial patterns. In this case, the cut position of "dust" is tll [as shown in Figure 3, it is cut at (3, 2, 5), and the partial pattern is 1+h+height, so the incoming cover turn t cut position is (3, 2, 5). Cut with Q1~
Calculate ax QCs. At this time, in this example, as shown in FIG. 9, "person" becomes "r", "two" becomes "ha", and "tsuji".

が“粱”にそれぞれ対応している。各々算出した部分パ
ターンＱ値岨〜Ｑ＋ａと第４図の部分パターン辞書にあ
る部分パターンＱ値とのマッチングを行なう。すなわち
先ず、“人″で算出したＱ　Ｉ−Ｑ　＋ａと部分パター
ン辞書１４″のＱ１〜Ｑｌａをマッチングさせ、次に“
二”で算出したＱ１〜Ｑ＋ｅと部分パターン辞書”ハ”
のＱ１〜Ｑｔａをマッチングさせ、さらに“辻″で算出
したＱ、〜Ｑ、ａと部分パターン辞書“休″のＱ１〜Ｑ
１６をマッチングさせる。これらのマッチングにおける
差を合計したものをマッチング距離ｄＢＰとする。この
とき距離ｄＢＰは入カバターン“逢”が１挨”にどれだ
け近いかを表わす。correspond to "粱" respectively. The calculated partial pattern Q values ˜Q+a are matched with the partial pattern Q values in the partial pattern dictionary shown in FIG. 4. That is, first, Q I-Q +a calculated for "person" is matched with Q1 to Qla of the partial pattern dictionary 14", and then "
Q1~Q+e calculated in “2” and partial pattern dictionary “Ha”
Matching Q1~Qta of ``Tsuji'', Q1~Q of partial pattern dictionary ``Kyu'' with Q, ~Q, a calculated by "Tsuji"
Match 16. The sum of the differences in these matchings is defined as the matching distance dBP. At this time, the distance dBP represents how close the input cover turn "Ai" is to "1Ki".

一般には、各部分パターンのストａ−り数ＢｓｊによＩ
）（１７）式のように重みづけを行ない、それをマッチ
ング距離ｄＢＰとする。Generally, I is determined by the number of stores Bsj of each partial pattern.
) Weighting is performed as shown in equation (17), and this is taken as the matching distance dBP.

そして、以上のように求めた距離ｄＢＰと、前ステップ
で求めた部分パターン間ベクトルのマッチングにより得
られたｄＶｅＣとを加算した距離ｄｉを求める。Then, a distance di is obtained by adding the distance dBP obtained as above and dVeC obtained by matching the inter-partial pattern vectors obtained in the previous step.

ｄｉ　＝　ｄｖｅｃ　＋　ｄｎｐ　　　　　　　　−（
１８１以上の操作を部分パターン間ベクトルによる分類
で残った全ての候補文字について行ない、ｄｉによるソ
ーティングを行なう。di = dvec + dnp −(
The operations above 181 are performed on all candidate characters remaining after classification based on inter-subpattern vectors, and sorting based on di is performed.

次に部分パターンストロークコード分布マッチング部８
の動作について述べる。部分パターンストロークコード
分布マッチング部８は部分パターンＱ値マッチング部７
及びストロークコード化部４の出力を受け、中分類によ
り絞られた候補文字につき部分パターンストロークコー
ド分布を求め、この分布と登録パターンよりあらかじめ
作成され第４図の部分パターン辞書に格納されている部
分パターンストロークコード分布とのマッチングを行な
い、さらに上位候補の順位づけを行なう。この順位づけ
を行なう対象の範囲は例えばｄｉのソーティングで得ら
れた第１候補の距離ｄ、との比率で決める。すなわちｄ
ｊ／ｄ、≦ＺＲＡＴＥの候補文字までを対象範囲として
順位づけを行なう。Next, partial pattern stroke code distribution matching section 8
We will describe the operation of. The partial pattern stroke code distribution matching section 8 is the partial pattern Q value matching section 7.
In response to the output from the stroke coding unit 4, a partial pattern stroke code distribution is obtained for the candidate characters narrowed down by the intermediate classification, and the partial pattern stroke code distribution is created in advance from this distribution and the registered pattern and stored in the partial pattern dictionary shown in FIG. Matching with the pattern stroke code distribution is performed, and the top candidates are further ranked. The range of targets for this ranking is determined, for example, by the ratio to the distance d of the first candidate obtained by sorting di. i.e. d
Ranking is performed with candidate characters of j/d, ≦ZRATE as the target range.

ここで部分パターンストロークコード分布の算出法につ
いて説明する。−例として入カバターンが”逢”で第１
候補として選ばれた文字が“逢”であったとする。第３
図の文字辞書より候補文字”逢″のカット位置は（３、
４、’３　）で部分パターンはＬ＋’ｌ−＋’５である
ことがわかるので、この位置で入カバターン”逢”をカ
ットする。この場合カットして得た部分パターンは文字
辞書の内容と同じであるが、それぞれの部分パターン毎
に、ストロークコード化部４により得られたストローク
コードの本数の分布を算出する。例えば”幻の部分パタ
ーンを見た場合“〆”、“Ｚ”、“−”であるからｓｏ
ｌが１本、　ｓ０３が１本、　ｓ０５が１本という形で
ストロークコード分布が求められる。Here, a method for calculating the partial pattern stroke code distribution will be explained. -For example, the first cover pattern is “Ai”.
Assume that the character selected as a candidate is "Ai". Third
From the character dictionary in the figure, the cut position of the candidate character “Ai” is (3,
4, '3) shows that the partial pattern is L+'l-+'5, so cut the incoming cover turn "Ai" at this position. In this case, the partial patterns obtained by cutting are the same as the contents of the character dictionary, but the distribution of the number of stroke codes obtained by the stroke coding section 4 is calculated for each partial pattern. For example, if you look at the phantom partial patterns, they are "〆", "Z", and "-", so
The stroke chord distribution is determined in the form of one line for l, one line for s03, and one line for s05.

このようにして算出された部分パターンストロークコー
ド分布は、あらかじめ数個の登録パターンから同様な手
順により算出し平均化して作成しておいた第４図の部分
パターン辞書の部分パターンストロークコード分布とマ
ッチングされる。本例の場合、先ず、以上のようにして
算出した”又°”の部分パターンストロークコード分布
と、第４図の部分パターン辞書の”久″のストロークコ
ード分布との差をマッチング距離ｄｓ（、Ｌ）として得
る。The partial pattern stroke code distribution calculated in this way matches the partial pattern stroke code distribution of the partial pattern dictionary shown in Figure 4, which was created by calculating and averaging several registered patterns in advance using the same procedure. be done. In the case of this example, first, the difference between the partial pattern stroke code distribution of "mata °" calculated as above and the stroke code distribution of "ku" in the partial pattern dictionary in Fig. 4 is calculated by the matching distance ds (, L).

すなわち、入カバターン　　　部分パターンｓＱｌ・・・１本　　　ｓｏｌ・・・０．９本３０２・
・・０本　　３０２・・・０．１本ｓ０３・・・１本　
　　ｓ０３・・・０．４本ｓ０４　＝−０本　　ｓ０４
−＝　０．６本３０５・・・１本　　ｓ０５・・・１本
の差０．１　＋　０．１　＋　０．６　＋　０．６　＝
　１．４本がｄ　ｓ　（４）として求められる。同様に
Ｉｌｌ　Ｉ　ｎ　、　＠ｌ、”についてマッチングを行
ないｄｓ（１）、ｄ　ｓ　（Ｌ）を得、各部分パターン
９マッチング距離の合計の距離ｄ８を、ｄｓ”ｄｓ（又
）＋ｄｓ（））十ｄｓ（Ｌ）　　　・−ｕ９より算出す
る。That is, input cover turn partial pattern sQl...1 line sol...0.9 line 302.
...0 pieces 302...0.1 pieces s03...1 piece
s03...0.4 pieces s04 =-0 pieces s04
-= 0.6 lines 305...1 line s05...1 line difference 0.1 + 0.1 + 0.6 + 0.6 =
1.4 lines are obtained as d s (4). Similarly, matching is performed for Ill I n, @l,'' to obtain ds(1), ds(L), and the total distance d8 of each partial pattern 9 matching distance is ds''ds(also)+ds()) Calculated from 10ds(L) -u9.

部分パターンストロークコード分布マッチング部８では
、このｄｓをｄｊ／ｄ、≦ＺＲＡＴＥの各候補文字につ
き算出し、その算出結果にしたがって候補文字の順位づ
けを行なう。そして、その認識結果を出力端子９より図
示しない表示器等へ出力する。The partial pattern stroke code distribution matching section 8 calculates this ds for each candidate character of dj/d, ≦ZRATE, and ranks the candidate characters according to the calculation result. Then, the recognition result is outputted from the output terminal 9 to a display device (not shown) or the like.

なお、上記実施例ではふれなかったが、順位づけされた
認識結果において筆記形状が非常に類似したもの、例え
ば”挨”と”埃”などについて部分パターン”＋”、”
あ”の違いとして○印の交差があれば”挨”、なければ
”埃″を認識結果とする詳細チェック部を設けるように
すると、より一層確実な認識結果が得られる。Although not mentioned in the above embodiment, partial patterns "+" and "dust" are used for those whose handwriting shapes are very similar in the ranked recognition results, such as "dust" and "dust".
A more reliable recognition result can be obtained by providing a detailed check section that recognizes "dust" if there is an intersection of ○ marks as a difference between "A" and "dust" if not.

また、上記実施例の説明では部分パターンＱ値マッチン
グ部７においてα八代にて示したａｖｅｃ　＋（ｌａｐ
の加算値により候補の順位づけを行なうとして説明した
が、ｄＢＰのみで候補の順位づけを行なうことも可能で
ある。In addition, in the explanation of the above embodiment, the partial pattern Q value matching unit 7 avec + (lap
Although it has been described that the candidates are ranked based on the added value of dBP, it is also possible to rank the candidates only based on dBP.

−さらに、中分類部６において部分パターン間ベクトル
のマッチングによる候補の絞シ込みに加え、部分パター
ンストロークコード分布マッチングにより距離がある閾
値以上のとき候補としない処理を追加し、後処理を行な
う候補文字を絞るようにすることも可能である。-Furthermore, in addition to narrowing down the candidates by matching vectors between partial patterns in the medium classification unit 6, a process is added to exclude candidates when the distance is greater than a certain threshold by partial pattern stroke code distribution matching, and candidates are subjected to post-processing. It is also possible to narrow down the characters.

（発明の効果）以上詳細に説明したように、本発明によれば、部分ハタ
ーン毎の特徴パラメータの抽出及びマッチングを行なう
ことにより、パターンマッチング法の欠点である筆記入
力される形状のバラツキ（特に文字を構成する部分パタ
ーン間の位置のずれ）による距離値の変動が吸収される
ので、この変動に起因する誤認識を防止できる。また、
部分パターン間ベクトルのマッチングによる候補の絞シ
込み及び部分パターン毎に各ストローク各セグメントの
情報を表わした部分パラメータＱ値を用いたマッチング
により、最適ストローク対応づけ処理が不要となり、処
理が短縮化される。このように本発明は筆記形状のバラ
ツキに強く、シかも処理量が少なくてすむ優れたオンラ
イン文字認識装置を提供することができる。(Effects of the Invention) As described in detail above, according to the present invention, by extracting and matching feature parameters for each partial pattern, it is possible to eliminate variations in shapes input by hand, which is a drawback of pattern matching methods (especially Since fluctuations in distance values due to positional deviations between partial patterns constituting characters are absorbed, erroneous recognition caused by these fluctuations can be prevented. Also,
Narrowing down the candidates by matching vectors between partial patterns and matching using partial parameter Q values representing information on each segment of each stroke for each partial pattern eliminates the need for optimal stroke matching processing and shortens the processing time. Ru. As described above, the present invention can provide an excellent online character recognition device that is resistant to variations in writing shapes and that requires less processing.

[Brief explanation of drawings]

第１図は本発明の一実施例のオンライン文字認識装置の
構成を示すブロック図、第２図は第１図の装置の動作フ
ローチャート、第３図は文字辞書の構成例を示す図、第
４図は部分パターン辞書の構成例を示す図、第５図は前
処理の説明図、第６図はストロークコード化処理の説明
図、第７図は部分パターン間ベクトルの説明図、第８図
は部分パターン間ベクトル算出の説明図、第９図は部分
パターンＱ値算出の説明図、第１０図は入カバターンと
登録パターンの特徴点を示す図、第１１図は変形入カバ
ターンの説明図、第１２図はストロークの対応づけ処理
説明図である。１・・・タブレット、２・・・前処理部、３・・・特徴
点抽出部、４・・・ストロークコード化部、５・・・大
分類部、６・・・中分類部、７・・・部分パターンＱ値
マッチング部、８・・・部分パターンストロークコード
分布マッチング部、９・・・出力端子。FIG. 1 is a block diagram showing the configuration of an online character recognition device according to an embodiment of the present invention, FIG. 2 is an operation flowchart of the device shown in FIG. 1, FIG. 3 is a diagram showing an example of the configuration of a character dictionary, and FIG. The figure shows an example of the structure of a partial pattern dictionary, FIG. 5 is an explanatory diagram of preprocessing, FIG. 6 is an explanatory diagram of stroke encoding processing, FIG. 7 is an explanatory diagram of vectors between partial patterns, and FIG. FIG. 9 is an explanatory diagram of calculation of vectors between partial patterns. FIG. 9 is an explanatory diagram of partial pattern Q value calculation. FIG. 10 is a diagram showing feature points of input cover turns and registered patterns. FIG. 12 is an explanatory diagram of stroke association processing. DESCRIPTION OF SYMBOLS 1... Tablet, 2... Preprocessing part, 3... Feature point extraction part, 4... Stroke encoding part, 5... Main classification part, 6... Middle classification part, 7. . . . Partial pattern Q value matching section, 8 . . . Partial pattern stroke code distribution matching section, 9 . . . Output terminal.

Claims

[Scope of Claims] A preprocessing unit that removes unnecessary data from a coordinate data string obtained by handwritten input on a tablet and performs a linearization process, and a written character from the coordinate data string linearized by the preprocessing unit. A feature point extraction unit that extracts feature points representing the characteristics of the strokes that make up the stroke, and a stroke coding unit that codes each stroke based on the positional relationship of the feature points extracted by the feature point extraction unit. An online character recognition device that performs character recognition by comparing the output data of the stroke encoding section or stroke encoding section with registered pattern data registered in advance, receives the output of the stroke encoding section and roughly categorizes the written characters according to the number of strokes. The major classification section performs the following: The part of the candidate character selected by the major classification section that is written as a series in handwriting is defined as a partial pattern, and each character is extracted from the segment information connecting the feature points extracted by the feature point extraction section. Calculate the center of gravity of the partial patterns, and based on the calculated center of gravity data, find a vector between partial patterns whose starting and ending points are the center of gravity of one partial pattern and the center of gravity of another partial pattern. The intermediate classification section performs matching with vectors between partial patterns created from the registered patterns in the same way, and performs intermediate classification according to the matching results. A partial pattern Q value representing the characteristics of the partial pattern extracted from each segment information of each stroke is calculated, and this partial pattern Q value and a partial pattern Q previously created from registered patterns in the same manner are calculated.
A partial pattern Q value matching unit performs matching with the value and ranks candidate characters according to this matching result and the matching result by the intermediate classification unit, and the top candidates ranked by the partial pattern Q value matching unit. For each character, the stroke code distribution extracted by the stroke coding unit is calculated for each partial pattern, and this stroke code distribution is matched with the partial pattern stroke code distribution previously created from registered patterns in the same way. and a partial pattern stroke code distribution matching section that performs the following: and performs a final ranking of candidate characters according to the matching results.