JPH0981690A

JPH0981690A - Handwritten character on-line recognition device and character style registering and learning method

Info

Publication number: JPH0981690A
Application number: JP7239932A
Authority: JP
Inventors: Masaki Nakagawa; 正樹中川; Ban Touu Ree; バントゥーレー
Original assignee: Individual
Current assignee: Individual
Priority date: 1995-09-19
Filing date: 1995-09-19
Publication date: 1997-03-28
Anticipated expiration: 2015-09-19
Also published as: JP2994992B2

Abstract

PROBLEM TO BE SOLVED: To highly efficiently recognize characters with a small dictionary registration number even for the peculiar form and abbreviated forms, etc., of individuals. SOLUTION: When dictionary registration is performed for an input character pattern 'GO' including the peculiar form or the like in a part (left-hand side of character, for instance) the input character pattern is compared with a standard pattern corresponding to it, a partial pattern (left-hand side of character) unsmilar to a radical pattern for constituting the standard pattern is detected, the partial pattern is automatically divided from the input character pattern and the partial pattern is registered as the pattern of a partinent radical. When 'HANASHI' whose left-hand side part is turned to the similar peculiar form is written and inputted, recognition rate is improved when the standard pattern matched with it is constituted by combining the 'left-hand side' of the peculiar form and the standard pattern 'SHITA'.

Description

Detailed Description of the Invention

【０００１】[0001]

【発明の属する技術分野】本発明はタブレットから手書
き入力された文字をオンラインで認識する手書き文字オ
ンライン認識装置に係り、特に、標準的な字体と異なる
手書き文字を少ないユーザ登録辞書から高効率で認識す
るのに好適な字体登録・学習方法に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a handwritten character online recognition device for recognizing characters handwritten on a tablet online, and particularly to recognize handwritten characters different from standard typefaces with high efficiency from a small user registration dictionary. The present invention relates to a suitable font registration / learning method.

【０００２】[0002]

【従来の技術】タブレットから手書き入力された文字パ
ターンを認識する装置では、入力された手書き文字に対
し、予め装置に登録されている標準字体の辞書を用いた
マッチング処理が行われる。手書き入力文字が、標準字
体のように正しい筆順でしかも丁寧に楷書で書かれてい
れば、その認識率は高くなる。しかし、手書き入力文字
の筆順が間違っていると、その入力文字の特徴点列の順
序が標準字体の特徴点列の順序と大きく異なってしま
い、認識率が低くなる。2. Description of the Related Art In a device for recognizing a character pattern handwritten from a tablet, a matching process is performed on an input handwritten character using a dictionary of standard fonts registered in advance in the device. If the handwritten input characters are written in the right stroke order and carefully in a regular style like a standard typeface, the recognition rate will be high. However, if the stroke order of the handwritten input characters is incorrect, the order of the feature point sequences of the input characters will be significantly different from the order of the feature point sequences of the standard font, and the recognition rate will be low.

【０００３】そこで、例えば、情報処理学会第５０回
（平成７年前期）全国大会の予稿集２−６１頁の４Ｄ−
４「ストローク数非依存の高速オンライン手書き文字認
識手法」レー・バン・トゥー等（東京農工大学）に記載
されているように、構造化字体表現を併用する技術が提
案されている。この構造化字体表現を、図１２，図１３
を用いて説明する。Therefore, for example, Proceedings of the 50th National Convention of the Information Processing Society of Japan (the first half of 1995), page 2-61, 4D-
4 "High-speed online handwritten character recognition method that does not depend on the number of strokes" As described in Lei Van Too et al. (Tokyo University of Agriculture and Technology), a technology that uses structured font expression in combination has been proposed. This structured font representation is shown in FIG. 12 and FIG.
This will be described with reference to FIG.

【０００４】図１２に示した例では、「明」という標準
字体パターンを、「日」というパターンと、「月」とい
うパターンとから、下記手順(1)〜(4)により求め、これ
を入力文字と比較するようにしている。 (1) 構成要素「日」の標準パターンを、文字「明」の第
一部分の位置（この場合、偏の位置）とサイズに変換す
る。 (2) 構成要素「月」の標準パターンを、文字「明」の第
二部分の位置（この場合、旁の位置）とサイズに変換す
る。 (3) 上記(1)で変換された標準パターンの終点Ｅを、(2)
で変換された標準パターンの始点Ｓにつなげる。 (4) できたパターンを、１文字のサイズに変換する。In the example shown in FIG. 12, the standard font pattern "light" is obtained from the pattern "day" and the pattern "month" by the following steps (1) to (4), and this is input. I try to compare it with the letters. (1) Convert the standard pattern of the component "day" into the position (in this case, the biased position) and size of the first part of the character "bright". (2) Convert the standard pattern of the component "month" into the position (in this case, the position of the straw) and the size of the second part of the character "bright". (3) Set the end point E of the standard pattern converted in (1) above to (2)
It connects to the starting point S of the standard pattern converted by. (4) Convert the created pattern to the size of one character.

【０００５】上述したように、文字Ａ（上記例では
「明」）が、構成要素Ｂ（上記例では「日」）と構成要
素Ｃ（上記例では「月」）とからなり、文字Ａの標準パ
ターンとして登録された数（筆順や筆跡の違うパターン
数）がｋ個あり、構成要素Ｂとして登録された数がｍ個
あり、構成要素Ｃとして登録された数がｎ個ある場合、
入力文字と比較する標準パターンの全数は、ｍ×ｎ＋ｋ
となる。As described above, the character A (“bright” in the above example) is composed of the component B (“day” in the above example) and the component C (“month” in the above example). When the number of patterns registered as standard patterns (the number of patterns with different stroke order or handwriting) is k, the number registered as component B is m, and the number registered as component C is n,
The total number of standard patterns to be compared with the input characters is m × n + k
Becomes

【０００６】図１３は、「猫」の標準パターン形成の例
を示す図である。獣偏の標準パターン数が３個あり、草
冠の標準パターン数が３個あり、「田」の標準パターン
数が２個あり、「猫」として１個の標準パターンが登録
されていたとすると、「猫」の全体の標準パターン数
は、３×３×２＋１＝１９個となる。FIG. 13 is a diagram showing an example of standard pattern formation of "cat". If there are 3 standard patterns of beast bias, 3 standard patterns of canopy, 2 standard patterns of "rice field", and 1 standard pattern is registered as "cat", The total number of standard patterns of “cat” is 3 × 3 × 2 + 1 = 19.

【０００７】[0007]

【発明が解決しようとする課題】上述した構造化字体表
現を採用することで、全ての文字（漢字でいえば、第１
水準および第２水準の合計６３００有余の文字）をユー
ザ登録しなくても、手書き文字の認識率を向上させるこ
とができる。しかし、個人の癖字や略字は、くずした文
字でしかも一筆書きや二筆書き等で書かれるため、上記
の構造化字体表現を採用しても、基になる辞書パターン
が楷書等で書かれていると、全く認識できないことにな
る。また、異字体なども、認識不可能である。このよう
な略字や異字体を高効率で認識できるようにするには、
文字の全てについて個人の癖字等を登録しておく必要が
あり、登録作業に多大な時間を費やさなければならない
という問題がある。By adopting the above-described structured font expression, all characters (Kanji characters have the first character
It is possible to improve the recognition rate of handwritten characters without user registration of a total of 6300 surplus characters of the level and the second level. However, personal habits and abbreviations are written in single-stroke or double-stroke, etc., so even if the above structured font expression is adopted, the underlying dictionary pattern will be written in regular writing. Then you will not be able to recognize it at all. In addition, different characters cannot be recognized. In order to be able to recognize such abbreviations and variant characters with high efficiency,
There is a problem in that it is necessary to register individual habit characters for all the characters, and a great deal of time is required for registration work.

【０００８】本発明の目的は、個人の癖字や略字等も少
ない辞書登録数で高効率で認識可能にする字体登録・学
習方法と手書き文字オンライン認識装置を提供すること
にある。It is an object of the present invention to provide a font registration / learning method and a handwritten character online recognition device which enable highly efficient recognition with a small number of dictionary registrations for individual habit characters and abbreviations.

【０００９】[0009]

【課題を解決するための手段】上記目的は、登録されて
いる各種部首パターンの組み合わせで構成される標準パ
ターンと、筆跡を座標データ列として出力するタブレッ
トを用いて取得した入力文字パターンとのマッチング処
理を行って該入力文字パターンの認識を行う手書き文字
オンライン認識装置において、入力文字パターンについ
て辞書登録が行われたとき、該入力文字パターンとこれ
に対応する前記標準パターンとを比較し、該標準パター
ンを構成する部首パターンに類似しない部分パターンを
前記入力文字パターンから自動的に分割し、該部分パタ
ーンを該当する部首のパターンとして登録することで、
達成される。[Means for Solving the Problems] The above object is to provide a standard pattern composed of a combination of various radical patterns registered and an input character pattern acquired by using a tablet which outputs handwriting as a coordinate data string. In a handwritten character online recognition device that performs matching processing to recognize the input character pattern, when a dictionary registration is performed for the input character pattern, the input character pattern is compared with the standard pattern corresponding thereto, By automatically dividing a partial pattern that is not similar to the radical pattern forming the standard pattern from the input character pattern and registering the partial pattern as the pattern of the corresponding radical,
Achieved.

【００１０】人間は、ある部首など文字の一部について
特異な筆記をする場合、その部首などを字形の一部に含
むすべての文字において、系統的にその特異な筆記をす
る場合が多い。もしある文字でその特異な筆記をする部
分が特定され、その特定部分を登録できれば、上述した
構造化字体表現を採用することで、文字登録の労力は大
幅に軽減されることになる。When a person makes a peculiar writing on a part of a character such as a radical, in many cases, a person systematically makes the peculiar writing on all characters including the radical as a part of the character shape. . If the part where the peculiar handwriting is performed is specified by a certain character and the specific part can be registered, the labor of character registration will be greatly reduced by adopting the structured font expression described above.

【００１１】そこで、筆跡を座標データ列として出力す
るタブレットを用いて取得した文字パターンデータに基
づき筆記文字の認識を行うに際し、標準字体と異なる文
字パターンが登録されたとき、この登録文字パターンに
ついて、更に以下の処理を行う。Therefore, when recognizing a written character based on the character pattern data acquired using a tablet that outputs a handwriting as a coordinate data string, when a character pattern different from the standard font is registered, regarding this registered character pattern, Further, the following processing is performed.

【００１２】登録文字パターン（例えば、「語」という
文字で、言偏部分が略字となっている文字パターンとす
る。）と辞書パターンとを照合し、一致する部分パター
ン（「言」「吾」）があるか否かを判定する。一致する
部分パターンがあれば、残りの部分同士が対応するもの
として登録する（上記例で、「吾」部分が一致すれば、
残りの略字部分パターンと「言」とが一致するものとし
て登録する）。そして、この略字部分を、同様の略字部
分を含む他の文字（「詳」「論」等の文字）の構造化字
体表現による認識処理のときに使用する。The registered character pattern (for example, a character pattern in which the word bias is abbreviated with the word "word") is collated with the dictionary pattern, and the matching partial pattern ("word""go"). ) Is determined. If there is a matching partial pattern, register the remaining parts as corresponding (if the "go" part matches in the above example,
Register the remaining abbreviation pattern and the word as matching.) Then, this abbreviation part is used at the time of recognition processing of other characters including the same abbreviation part (characters such as "detailed" and "theory") by the structured font representation.

【００１３】[0013]

【発明の実施の形態】以下、本発明の一実施例を図面を
参照して説明する。図３は、本発明の一実施例に係る手
書き文字オンライン認識装置のうち文字認識のための字
体登録機能を示す構成図である。この文字オンライン認
識装置は、入力文字パターンを分割せずに登録（学習）
するパターン全体学習部１と、入力文字パターンを自動
的に分割し登録（学習）すべき部分を登録（学習）する
自動分割学習部２と、構造化字体表現辞書３と、入力文
字パターンが基本部首のみであるか否かを判定し基本部
首のみのとき（＝分割できないとき）には入力文字パタ
ーンをパターン全体学習部１に渡しそうでない場合には
入力文字パターンを自動分割学習部２に渡す判定部４
と、自動分割学習部２による入力文字の自動分割が失敗
しアプリケーションに対して分割要求が出力され（アプ
リケーションは、この分割要求により、ユーザに対して
自動分割の失敗文字を分割して入力するように促すガイ
ダンスを出力する。）てから入力された分割文字を学習
して構造化字体表現辞書３に登録する被分割指示パター
ン学習部５とからなる。BEST MODE FOR CARRYING OUT THE INVENTION An embodiment of the present invention will be described below with reference to the drawings. FIG. 3 is a configuration diagram showing a font registration function for character recognition in the handwritten character online recognition device according to an embodiment of the present invention. This character online recognition device registers (learns) without dividing the input character pattern.
Pattern whole learning unit 1, an automatic division learning unit 2 that automatically divides an input character pattern and registers (learns) a portion to be registered (learned), a structured font expression dictionary 3, and an input character pattern It is determined whether or not there is only radicals. If only basic radicals (= cannot be divided), the input character pattern is not passed to the overall pattern learning unit 1. If not, the input character pattern is automatically divided into learning units 2. Determination unit 4
Then, the automatic division learning unit 2 fails to automatically divide the input character, and a division request is output to the application (the application requests the user to divide and input the automatic division failure character by this division request). And outputs a guidance for prompting the user to input the divided character. Then, the input character learning unit 5 learns the input divided characters and registers them in the structured font expression dictionary 3.

【００１４】図４は、自動分割学習部２で行われる自動
分割学習処理手順を示すフローチャートである。この図
４で用いている変数と記号の定義は、 In_ptt：正規化された入力パターン B,C：認識結果として出力してほしい文字（被登録文
字） A の構成要素 Cat(A)：文字または部首 A のカテゴリパターンセット
（ A を表す標準パターンの集合） Des_Score：誤認識したときの入力パターンと誤認識結
果との類似度 MaxScore(In_ptt,Cat(A))：入力パターンと Cat(A) の
それぞれとの類似度の最大値である。FIG. 4 is a flowchart showing an automatic division learning processing procedure performed by the automatic division learning unit 2. The definitions of variables and symbols used in this Figure 4 are: In_ptt: Normalized input pattern B, C: Character (registered character) that you want to output as a recognition result (registered character) A component Cat (A): Character or Category pattern set of radical A (set of standard patterns that represent A) Des_Score: Similarity between input pattern and misrecognition result when misrecognized MaxScore (In_ptt, Cat (A)): Input pattern and Cat (A) Is the maximum value of the similarity with each of.

【００１５】この図４の処理手順では、まず、入力パタ
ーン In_Ptt から被登録文字Ａ（構成要素ＢとＣから成
る。）の一方の構成要素Ｂの標準パターンと似ている部
分が検出できるか否かを判定する（ステップ１）。この
判定結果が否定（類似しない）であれば後述するステッ
プ５に進み、構成要素Ｂと類似する部分が検出できる場
合には、残りの部分を切り出しこれを構成要素Ｃとして
登録してみる（ステップ２）。そしてこの登録により、
入力パターンと被登録文字のカテゴリパターンセットと
の最大類似度 MaxScore(In_Ptt,Cat(A)) が目的類似度
Des_Score（入力パターンに対して新しい標準パターン
を登録しても、その入力パターンと新しい標準パターン
との類似度が、登録前に誤認識していた標準パターンと
の類似度を越えなければ、登録後も類似度の一番大きい
ものは依然として誤認識していた標準パターンになって
しまう。従って、それを越えることが必要なことから、
Des_Scoreは、目的類似度となる。）より大きくなるか
否かを判定する（ステップ３）。この判定結果が肯定(y
es)であれば学習を終了し、否定(No)であればステップ
２での登録をキャンセルし（ステップ４）、ステップ５
に進む。In the processing procedure of FIG. 4, first, whether or not a portion similar to the standard pattern of one constituent element B of the registered character A (consisting of constituent elements B and C) can be detected from the input pattern In_Ptt. It is determined (step 1). If this determination result is negative (not similar), the process proceeds to step 5 described later, and if a part similar to the component B can be detected, the remaining part is cut out and registered as the component C (step 2). And by this registration,
The maximum similarity between the input pattern and the registered character category pattern set is MaxScore (In_Ptt, Cat (A)).
Des_Score (If a new standard pattern is registered for an input pattern, if the similarity between the input pattern and the new standard pattern does not exceed the similarity with the standard pattern that was misrecognized before registration, However, the one with the highest similarity still becomes the standard pattern that was erroneously recognized, so it is necessary to exceed it,
Des_Score is the target similarity. ) It is determined whether it becomes larger (step 3). This judgment result is affirmative (y
If it is es), the learning is terminated, and if negative (No), the registration in step 2 is canceled (step 4), step 5
Proceed to.

【００１６】ステップ５以下では、他方の構成要素Ｃに
対して前述したステップ１〜４と同じ処理を行い、分割
ができなかった場合（ステップ５での判定結果が否定ま
たはステップ８に進んだ場合）には、自動分割学習が失
敗したとしてアプリケーションに対して分割要求を出力
する。In step 5 and subsequent steps, the same processing as in steps 1 to 4 described above is performed on the other constituent element C, and division is not possible (if the judgment result in step 5 is negative or if step 8 is reached). ), A division request is output to the application because the automatic division learning has failed.

【００１７】図５は、図４のステップ１（構成要素Ｂに
似ている部分を検出する処理）の詳細手順を示すフロー
チャートである。図５の処理では、構成要素Ｂのカテゴ
リパターンセットと比較し、入力パターンのうち最も似
ている部分パターンを検出し、この部分パターンと残り
の部分とを区切る区切り点を返す処理を行う。この図５
のフローチャートの各ステップ内に記載した変数と記号
の定義は、次の通りである。FIG. 5 is a flow chart showing a detailed procedure of step 1 of FIG. 4 (processing for detecting a part similar to the component B). In the process of FIG. 5, the process is compared with the category pattern set of the component B, the most similar partial pattern of the input patterns is detected, and a process of returning a delimiter point that separates this partial pattern from the remaining part is performed. This Figure 5
The definitions of variables and symbols described in each step of the flowchart of are as follows.

【００１８】ストローク（ペンをおろしてからあげるま
でに書かれる一つの筆画）の j 番目筆点（公知技術で
求めた特徴点と同じ）は始点であるとき Is_Start(j) = 1 終点であるとき Is_End(j) = 1 Part(X：s〜e)：パターン X の第 s 点から第 e 点ま
での部分パターン Points(X)：パターン X の筆点数 Points1(X)：パターン X の第１部分の筆点数 Points2(X)：パターン X の第２部分の筆点数 n：構成要素 B のカテゴリパターンセットの標準パター
ン数 i : B のカテゴリパターンセット内の標準パターンを指
す指数 PB(i) : B のカテゴリパターンセットの i 番目の標準
パターン Cut_start = 2 ：第１部分の筆点数が最小のときのその
終点 Cut_end = Points(In_Ptt)-3：第１部分の筆点数が最大
のときのその終点 Cut_run : 第１部分の終点を探すための変数 Cut_candidate：第１部分の終点候補 Part_score：入力パターンの第１部分とB カテゴリの標
準パターンとの類似度 Part_maxscore：Part_score の最大値 OK_score：二つのパターンが似ているかどうかを判断す
るための類似度のしきい値以上が定義である。When the j-th writing point (same as the feature point obtained by the known technique) of the stroke (one stroke drawn from the time when the pen is lowered) is the starting point Is_Start (j) = 1 When the ending point Is_End (j) = 1 Part (X: s to e): Part of pattern X from the s-th point to the e-th point Pattern Points (X): Number of writing points of pattern X Points1 (X): First part of pattern X Number of writing points Points2 (X): Number of writing points in the second part of pattern X n: Number of standard patterns in the category pattern set of component B i: Index PB (i): B indicating the standard pattern in the category pattern set of B -Th standard pattern in the category pattern set of Cut_start = 2: end point when the number of strokes in the first part is the minimum Cut_end = Points (In_Ptt) -3: end point when the number of strokes in the first part is the maximum Cut_run : Variable for searching the end point of the first part Cut_candidate: End point candidate of the first part Part_score: Similarity between the first part of the input pattern and the standard pattern of B category Part_maxscore: Maximum value of Part_score OK_score: Similarity threshold for determining whether two patterns are similar or more is the definition .

【００１９】この図５の処理では、入力したパターンが
ある点で二つの部分に分けられ、第１部分は構成要素Ｂ
のある標準パターンに似ていると仮定して、その区切り
点を返す。構成要素Ｂのカテゴリパターンセットのすべ
ての標準パターン PB(i) について、入力パターンの第
１部分の終点（区切り点）が、筆点（特徴点）の２番目
（本実施例では、筆点の始点を０番目としているため、
２番目とは始点から数えて３番目の筆点）から Points
(In_ptt)-3 番目の点（同様に、筆点の最終点から２番
目）までの間にあるとしてその部分パターンを切り出
し、PB(i) とマッチングし、類似度が一番高い標準パタ
ーンと入力パターンの区切り点とを決めて、区切り点を
返す。In the processing of FIG. 5, the input pattern is divided into two parts, and the first part is the component B.
Returns its breakpoint, assuming that it looks like some standard pattern of. For all the standard patterns PB (i) of the category pattern set of the component B, the end point (break point) of the first part of the input pattern is the second writing point (feature point) (in this embodiment, Since the starting point is 0th,
2nd is the 3rd writing point counting from the starting point) to Points
(In_ptt) -The partial pattern is cut out as being between the 3rd point (similarly from the last point of the writing point to the 2nd point), matched with PB (i), and the standard pattern with the highest similarity is obtained. Determine the breakpoints in the input pattern and return the breakpoints.

【００２０】第１部分を入力パターンの始点から区切り
点までの部分とし、区切り点の次点（もし、それがスト
ロークの終点ならば、さらにその次の点）から入力パタ
ーンの終点までの部分を第２部分として返す。ただし、
この類似度が“０”または小さすぎる（しきい値 OK_sc
ore より小さい）場合は、似ている標準パターンが見つ
からないとして返す。The first part is the part from the start point of the input pattern to the break point, and the part from the next point of the break point (if it is the end point of the stroke, then the next point) to the end point of the input pattern. Return as the second part. However,
This similarity is “0” or too small (Threshold OK_sc
If it is smaller than ore), a similar standard pattern is returned as not found.

【００２１】以下、図５の詳細手順を説明する。まずス
テップ１０で、変数ｉと変数Part_maxscoreの値を
“０”とする初期化を行い、次のステップ１１で、変数
ｉの値が、構成要素Ｂの標準パターン数ｎより小さいか
否かを判定する。変数ｉの値は、後述するステップ１８
で“１”づつインクリメントされ、以下の処理が標準パ
ターン数ｎだけ繰り返される。最初はｉ＝０のため、ス
テップ１１の判定結果はYesとなってステップ１２に進
む。ステップ１２では、区切り点を探すための変数Cut_
runの値として変数Cut_startの値（今の場合、“２”）
を入れ、ステップ１３に進む。変数Cut_runの値は、後
述するステップ１７で“１”づつインクリメントされ
る。The detailed procedure of FIG. 5 will be described below. First, in step 10, initialization is performed by setting the values of the variable i and the variable Part_maxscore to “0”, and in the next step 11, it is determined whether or not the value of the variable i is smaller than the standard pattern number n of the component B. To do. The value of the variable i is set in step 18 described later.
Is incremented by 1, and the following process is repeated by the number n of standard patterns. Since i = 0 at first, the determination result of step 11 is Yes and the process proceeds to step 12. In step 12, a variable Cut_ for searching for a break point
The value of the variable Cut_start as the value of run (in this case, "2")
And enter step 13. The value of the variable Cut_run is incremented by "1" in step 17 described later.

【００２２】ステップ１３では、変数Cut_runの値が変
数Cut_endの値以下であるか否かが判定される。本実施
例では、入力パターンの筆点（特徴点）のうちどれが区
切り点になるかを調べるため、区切り点と仮定する点を
入力パターンの筆点上で１点づつ始点側から終点側にず
らしていき、区切り点より前の部分パターンを標準パタ
ーンと比較してその類似度を求めるものである。従っ
て、ステップ１３では、その仮定した区切り点が第１部
分の終点（入力パターンの第２部分の筆点は最小でも２
点あるため、第１部分の終点の最大は、入力パターンの
終点の２点前となる。）に達したか否かを判定する。In step 13, it is judged whether or not the value of the variable Cut_run is less than or equal to the value of the variable Cut_end. In the present embodiment, in order to check which of the writing points (feature points) of the input pattern will be the break point, one point is assumed to be the breaking point from the starting point side to the end point side on the writing point of the input pattern. It shifts and compares the partial pattern before the break point with the standard pattern to obtain the degree of similarity. Therefore, in step 13, the assumed break point is the end point of the first part (the writing point of the second part of the input pattern is at least 2).
Since there are dots, the maximum end point of the first portion is two points before the end point of the input pattern. ) Is reached.

【００２３】ステップ１３での判定結果がYesの場合に
は、次のステップ１４に進み、入力パターンの始点から
現在の変数Cut_runの値の筆点までの部分パターンと、
構成要素Ｂのｉ番目の標準パターンとの類似度を求め
る。そして、次のステップ１５では、この類似度が現在
までの類似度スコアの最大値を示す変数Part_maxscore
の値より大きいか否かを判定する。この判定により類似
度が大きくなった場合には、ステップ１６に進み、現在
のCut_runの値を区切り点の終点候補を示す変数Cut_can
didateの値として入れると共に、ステップ１４で求めた
類似度Part_scoreの値を変数Part_maxscoreの値として
入れる。そして、ステップ１７に進み、Cut_runの値を
１つインクリメントしてステップ１３に戻る。ステップ
１５での判定で類似度が小さくなった場合には、ステッ
プ１６を飛び越してステップ１７に進む。If the result of the determination in step 13 is Yes, the process proceeds to the next step 14 and the partial pattern from the start point of the input pattern to the writing point of the current value of Cut_run,
The degree of similarity of the component B with the i-th standard pattern is calculated. Then, in the next step 15, this similarity is a variable Part_maxscore indicating the maximum value of the similarity scores to date.
Is larger than the value of. If the degree of similarity becomes large as a result of this determination, the process proceeds to step 16 and the current value of Cut_run is set as a variable Cut_can indicating the end point candidate of the break point.
The value of didate and the value of the similarity Part_score obtained in step 14 are input as the value of the variable Part_maxscore. Then, the process proceeds to step 17, the value of Cut_run is incremented by 1, and the process returns to step 13. If the degree of similarity is low in the determination in step 15, the process skips step 16 and proceeds to step 17.

【００２４】ステップ１３からステップ１７までの処理
を繰り返し、Cut_runの値が第１部分の終点に達したと
きは、ステップ１３からステップ１８に進んで変数ｉの
値をインクリメントし、比較対象とする標準パターンを
替え、以降、ステップ１１，１２と進み、ステップ１３
からステップ１７までの処理を繰り返す。When the value of Cut_run reaches the end point of the first part by repeating the processing from step 13 to step 17, the process proceeds from step 13 to step 18 to increment the value of the variable i and make it the standard to be compared. After changing the pattern, the process proceeds to steps 11 and 12 and step 13
The processes from to 17 are repeated.

【００２５】比較対象とする標準パターン数すべてにつ
いて処理を終了すると、ステップ１１での判定結果が否
定(No)となり、ステップ１９に進む。このステップ１９
では、変数Part_maxscoreの値がOK_scoreの値（しきい
値）より大きいか否かを判定し、しきい値より大きい場
合には構成要素Ｂの標準パターンに類似する部分パター
ンが見つかったとし、しきい値より小さい場合には標準
パターンに類似する部分パターンは見つからなかったと
判断する。When the processing is completed for all the numbers of standard patterns to be compared, the determination result in step 11 is negative (No), and the process proceeds to step 19. This step 19
Then, it is determined whether the value of the variable Part_maxscore is larger than the value of OK_score (threshold value), and if it is larger than the threshold value, it is assumed that a partial pattern similar to the standard pattern of the component B is found, and the threshold value is set. When it is smaller than the value, it is determined that a partial pattern similar to the standard pattern was not found.

【００２６】以上が、図４のステップ１の詳細手順であ
る。図４のステップ５の詳細手順は、基本的には図５の
処理手順と同様なため、その詳細な説明は省略する。こ
のステップ５では、標準パターン（構成要素Ｃ）と類似
する第２部分（旁の部分）を検出する。このステップ５
では、仮定した区切り点から終点までの部分を第２部分
として、カテゴリ C の標準パターンとマッチングを繰
り返し、区切り点を探す。そして、成功した場合には、
入力パターンの始点から区切り点の前点（もしそれがス
トロークの始点であれば、さらにその前の点）までの部
分を第１部分として返す。The above is the detailed procedure of step 1 in FIG. Since the detailed procedure of step 5 of FIG. 4 is basically the same as the processing procedure of FIG. 5, detailed description thereof will be omitted. In this step 5, the second part (straw part) similar to the standard pattern (component C) is detected. This step 5
Now, with the part from the assumed break point to the end point as the second part, the matching with the standard pattern of category C is repeated to search for the break point. And if successful,
The part from the start point of the input pattern to the previous point of the break point (if it is the start point of the stroke, further before it) is returned as the first part.

【００２７】図４に示す自動分割学習方法で、入力パタ
ーンの第１部分あるいは第２部分と似ている構成要素の
標準パターンが辞書にあれば、両部分間が筆画でつなげ
られた入力パターンに対しても、自動分割は成功する。
図１に、具体的な例を示す。図１において、左の箱は略
字の「言」偏で書かれた「語」の入力パターンである。
この入力パターンを自動分割学習したいとする。In the automatic division learning method shown in FIG. 4, if a standard pattern of constituent elements similar to the first part or the second part of the input pattern is in the dictionary, the input pattern in which both parts are connected by a stroke is formed. On the other hand, the automatic division succeeds.
FIG. 1 shows a specific example. In FIG. 1, the box on the left is an input pattern of "words" written in abbreviated "words".
Suppose that you want to automatically divide and learn this input pattern.

【００２８】「語」という漢字は「言」と「吾」からな
るため、それらの標準パターンを決められたサイズ
（「言」は偏のサイズに、「吾」は旁のサイズ）に変換
する。今の場合、「吾」のカテゴリパターンセットは標
準パターンが一個だけあるとする。Since the kanji "word" consists of "word" and "go", those standard patterns are converted into a predetermined size ("word" is a biased size, and "go" is a straw size). . In this case, it is assumed that the category pattern set of "go" has only one standard pattern.

【００２９】入力パターンから「吾」の標準パターンと
似ている部分を検出するには次のように行う。Cut_run
を Cut_start から Cut_end までの範囲で、仮定の区切
り点により、入力パターンを二つの部分に分けて、第２
部分を「吾」の標準パターンとマッチングする。類似度
が一番高く、また OK_Score より大きくなる点を最後的
な区切り点とする。図１の例では、点Ｃが区切り点であ
ると決定し、入力パターンの始点からＣの前点Ｃ'まで
の部分を第１部分のパターンとして返す。To detect a portion similar to the standard pattern of "go" from the input pattern, the following process is performed. Cut_run
In the range from Cut_start to Cut_end, the input pattern is divided into two parts by the assumed break point, and the second
Match the part with the standard pattern of "go". The point with the highest similarity and greater than OK_Score is the final breakpoint. In the example of FIG. 1, the point C is determined to be a break point, and the portion from the start point of the input pattern to the point C ′ before C is returned as the pattern of the first portion.

【００３０】上述した実施例は、漢字を入力パターンと
した例であるが、本発明は英単語の認識にも適用するこ
とができる。英単語を認識する場合には，接頭語や語
幹，活用語尾などの単位を単語の構成要素と考えること
で，上述した漢字の場合と同様に適用できる。図２にそ
の具体例を示す。単語 <software> は，要素 <soft>と
<ware> からなり，それぞれに，図２の右側に示した標
準パターンが登録されているとする。英単語の場合、単
語の標準パターンは、その構成要素の標準パターンを連
接することで生成される。Although the above-mentioned embodiment is an example in which a kanji is used as an input pattern, the present invention can be applied to the recognition of English words. When recognizing English words, by considering units such as prefixes, word stems, and inflection endings as word constituents, it can be applied in the same way as in the case of the above-mentioned kanji. FIG. 2 shows a specific example thereof. The word <software> and the element <soft>
It is assumed that the standard pattern shown in the right side of FIG. 2 is registered in each of the <ware>. In the case of English words, the standard pattern of words is generated by concatenating the standard patterns of its constituent elements.

【００３１】入力パターンとして、左側に示した個人の
癖字による筆記体のパターンが記入され，それを自動分
割して登録する。この場合、入力パターンから <soft>
または <ware> の標準パターンと似ている部分が検出で
きるか否かを判定する。上述した漢字の例と同様に、Cu
t_run の筆点を Cut_start から Cut_end までの範囲で
の仮定の区切り点として入力パターンを二つの部分に分
け、第１部分を <soft> の標準パターンとマッチング
し、または、第２部分を <ware> の標準パターンとマッ
チングする。そして、類似度が一番高く、また、OK_Sco
re より大きくなる点を最終的な区切り点として、例え
ば第２部分 <ware> を切り出し、残りの部分（癖字がひ
どく辞書パターンとは似ていない部分）を <soft> とし
て構造化字体表現辞書に登録する。As the input pattern, the cursive pattern of the individual habit character shown on the left side is entered, which is automatically divided and registered. In this case, from the input pattern <soft>
Or it judges whether the part similar to the standard pattern of <ware> can be detected. Like the example of Kanji mentioned above, Cu
The input pattern is divided into two parts by using the writing point of t_run as an assumed break point in the range from Cut_start to Cut_end, and the first part is matched with the standard pattern of <soft>, or the second part is <ware>. Matches the standard pattern of. And the similarity is highest, and OK_Sco
For example, the second part <ware> is cut out with the point that is larger than re as the final delimiter, and the remaining part (the part where the peculiar character is not very similar to the dictionary pattern) is <soft> and the structured font expression dictionary. Register with.

【００３２】図６は、図３に示す被分割指示パターン学
習部５の詳細処理手順を示すフローチャートである。こ
の被分割指示パターンの学習処理は、自動分割学習が成
功しなかったときアプリケーションによりユーザにガイ
ダンスが出力され、ユーザがそれに基づいて分割したパ
ターンを入力したときに起動される。この学習部５に
は、文字全体の入力パターンと、２つの部分パターンの
合わせて３つのパターンが渡される。図６に示すフロー
チャートの各ステップ中に使用した変数と記号の定義
は、以下の通りである。FIG. 6 is a flowchart showing a detailed processing procedure of the divided instruction pattern learning unit 5 shown in FIG. The learning processing of the divided instruction pattern is started when the application outputs guidance to the user when the automatic division learning is not successful, and when the user inputs the divided pattern based on the guidance. The learning unit 5 is provided with an input pattern of the entire character and a total of three patterns including the two partial patterns. The definitions of variables and symbols used in each step of the flowchart shown in FIG. 6 are as follows.

【００３３】In_Ptt：正規化された入力パターン In_Ptt1, In_Ptt2：正規化された二つの入力部分パター
ン Score1：In_Ptt1 を登録した後の In_Ptt と Cat(A) と
の最大類似度 Score1：In_Ptt2 を登録した後の In_Ptt と Cat(A) と
の最大類似度 Score：In_Ptt1 及び In_Ptt2 を登録した後の In_Ptt
と Cat(A) との最大類似度。In_Ptt: normalized input pattern In_Ptt1, In_Ptt2: two normalized input partial patterns Score1: In_Ptt1 after registration of the maximum similarity between In_Ptt and Cat (A) Score1: In_Ptt2 after registration Maximum similarity score between In_Ptt and Cat (A) of: In_Ptt after registering In_Ptt1 and In_Ptt2
The maximum similarity between and Cat (A).

【００３４】まず、ステップ２１で Score1,Score2 を
“０”とする初期化を行う。そして、次のステップ２２
で In_Ptt1 を構成要素Ｂとして登録してみる。次にス
テップ２３で、被登録文字のカテゴリパターンセットと
マッチングしてみてその最大類似度 MaxScore の値を S
core1 に入れてから、構成要素Ｂとしての登録をキャン
セルする（ステップ２４）。First, in step 21, Score1 and Score2 are initialized to "0". And the next step 22
Then register In_Ptt1 as component B. Next, in step 23, matching with the category pattern set of the registered character is performed, and the maximum similarity score MaxScore is set to S.
After entering core1, registration as component B is canceled (step 24).

【００３５】次に、ステップ２５で、In_Ptt2 を構成要
素Ｃとして登録してみる。そして、次のステップ２６
で、被登録文字のカテゴリパターンセットとマッチング
してみてその最大類似度 MaxScore の値を Score2 に入
れてから、構成要素Ｃとしての登録をキャンセルする
（ステップ２７）。Next, in step 25, In_Ptt2 is registered as the constituent element C. Then, the next step 26
Then, after matching with the category pattern set of the registered character and putting the value of the maximum similarity MaxScore into Score2, the registration as the constituent element C is canceled (step 27).

【００３６】次のステップ２８で、Score1 ＞ Score2
であるか否かを判定し、Yesであればステップ２９で Sc
ore1 ＞ Des_Score であるか否かを、Noであればステッ
プ３０で Score2 ＞ Des_Score であるか否かを夫々判
定する（Des_Score については図４のフローチャートの
定義参照）。ステップ２９での判定結果がYesであれば
次のステップ３１にて In_Ptt1 を構成要素Ｂとして登
録し、処理を終了する。ステップ３０での判定結果がYe
sであれば次のステップ３２にて In_Ptt2 を構成要素Ｃ
として登録し、処理を終了する。In the next step 28, Score1> Score2
If Yes, then in step 29, Sc
It is determined whether ore1> Des_Score or not, and if No, it is determined in step 30 whether or not Score2> Des_Score is satisfied (for Des_Score, refer to the definition of the flowchart of FIG. 4). If the determination result in step 29 is Yes, In_Ptt1 is registered as the constituent element B in the next step 31, and the process ends. The judgment result in step 30 is Ye
If s, In_Ptt2 is the constituent element C in the next step 32.
Then, the process ends.

【００３７】ステップ２９，ステップ３０での判定結果
がNoのときはステップ３３に進み、In_Ptt1 を構成要素
Ｂとして登録すると共に In_Ptt2 を構成要素Ｃとして
登録し、ステップ３４で、この登録した文字を標準パタ
ーンのカテゴリパターンセットとマッチングしてその最
大類似度を Score に入れる。そして、ステップ３５
で、Score ＞ Des_Score であるか否かを判定し、Yesの
場合には処理を終了し、Noの場合にはステップ３３での
登録をキャンセルし（ステップ３６）、全体のパターン
を登録して（ステップ３７）処理を終了する。When the judgment results in Steps 29 and 30 are No, the process proceeds to Step 33, In_Ptt1 is registered as the constituent element B and In_Ptt2 is registered as the constituent element C. In Step 34, the registered characters are standardized. Match the pattern category set to the maximum similarity in Score. And step 35
Then, it is determined whether or not Score> Des_Score. If Yes, the process ends. If No, the registration in step 33 is canceled (step 36), and the entire pattern is registered ( (Step 37) The process ends.

【００３８】上述したように、本発明の実施例に係る自
動分割学習では、入力パターンを辞書の標準パターンと
比較しながら、最適な自動分割つまり区切り点を決定す
る。入力パターンが複数の構成要素に分割されない基本
的な字形の漢字の場合には、図１に示すパターン全体学
習部が機能することが多いが、分割可能な字形（漢字）
のパターンに対しては、自動分割学習が効果を発揮す
る。なぜなら、入力パターンが略字または、標準でない
筆順を含んでいても、ほとんどの場合、少なくとも一つ
の部分が辞書の標準パターンと似ているからである。As described above, in the automatic division learning according to the embodiment of the present invention, the optimum automatic division, that is, the break point is determined while comparing the input pattern with the standard pattern of the dictionary. In the case of a basic kanji character whose input pattern is not divided into a plurality of constituent elements, the pattern whole learning unit shown in FIG. 1 often works, but the divisible character shape (kanji)
Automatic division learning is effective for this pattern. This is because even if the input pattern includes abbreviated characters or non-standard stroke order, in most cases, at least one part is similar to the standard pattern in the dictionary.

【００３９】図７に示す様に、「言」の略字を含んだ文
字（上段）と「木」の異字体を含んだ文字（下段）をペ
ンで手書き入力した場合、それらの入力パターンは辞書
の標準パターンと似ていないため、認識されない。そこ
で、適当に選んだ文字例えば「語」と「休」の手書き入
力パターンについて、上述した本発明実施例に係る自動
分割学習により略字の「言」偏と異字体の「木」とを分
離して登録すると、図８に示す様に、認識率が極めて高
くなる。As shown in FIG. 7, when a character including the abbreviations of "Koto" (upper row) and a character including the different characters of "Ki" (lower row) are handwritten with a pen, the input patterns are the dictionary. It is not recognized because it is not similar to the standard pattern of. Therefore, for a handwritten input pattern of appropriately selected characters, for example, "word" and "pause", the abbreviation "word" bias and the heterograph "tree" are separated by the automatic division learning according to the embodiment of the present invention described above. If registered with, the recognition rate becomes extremely high as shown in FIG.

【００４０】入力パターンと比較する辞書パターンは、
多いほど認識率は高くなるが、一方で認識処理に要する
時間が多大になってしまうという問題がある。そこで、
多数の辞書パターンの中から入力パターンに類似する確
率の高そうな辞書パターンの候補をふるいにかけて取り
出す処理（以下、この処理を「大分類処理」という。）
を、文字認識処理の前に行うのが好ましい。以下、この
大分類処理について説明する。The dictionary pattern to be compared with the input pattern is
The larger the number, the higher the recognition rate, but on the other hand, there is a problem that the time required for the recognition processing becomes long. Therefore,
A process of sieving out a dictionary pattern candidate having a high probability of being similar to the input pattern from a large number of dictionary patterns (hereinafter, this process is referred to as "major classification process").
Is preferably performed before the character recognition processing. Hereinafter, this large classification process will be described.

【００４１】大分類処理では、構造化字体表現辞書から
構成要素を組み合わせて形成される文字パターン、ある
いは、学習用の文字パターンから、それを偏や旁、冠な
どの構成部分に分割しないで得られる特徴（非構造化特
徴）を抽出して標準パターンを作成する。In the large classification process, a character pattern formed by combining constituent elements from a structured font expression dictionary or a learning character pattern is obtained without dividing it into constituent parts such as bias, straw, and crown. A standard pattern is created by extracting the feature (unstructured feature).

【００４２】まず、図９（ａ）の様に、文字パターンを
４×４のメッシュに分割する。次に、図９（ｃ）の領域
ACMK、BDNL、CEOM、FHRP のような、隣同士が重なり合
った小領域を考える。そして、それぞれを P00、P01、P
02、P10 というように Pyxと呼ぶことにする。Pyx の x
と y は、左上の小領域を P00 として、x は１つ右に
ずれるごとに、y は１つ下にずれるごとに、夫々１ずつ
大きくする。これにより、P00 から P22 までの九つの
小領域ができる。図９（ｂ）は、図９（ａ）から P03
を取り出した例である。First, as shown in FIG. 9A, the character pattern is divided into a 4 × 4 mesh. Next, the area of FIG.
Consider a small area with adjacent neighbors, such as ACMK, BDNL, CEOM, and FHRP. And each one is P00, P01, P
I will call it Pyx like 02 and P10. Pyx x
For y and y, the small area in the upper left is P00, and x is incremented by 1 each time it is shifted to the right, and y is incremented by 1 each time it is shifted down by 1. This creates nine subregions from P00 to P22. Fig. 9 (b) shows P03 from Fig. 9 (a).
Is an example of taking out.

【００４３】そして、各小領域毎に、入力パターンのス
トロークの各成分（４方向の成分（｜、＼、―、／））
が夫々どれくらい含まれているかを調べる。図１０に、
その値の例を示す。この値を一列にしたものを方向特徴
ベクトルと呼ぶ。Then, for each small area, each component of the stroke of the input pattern (components in four directions (|, \,-, /))
Find out how much each is included. In FIG.
An example of the value is shown. A series of these values is called a direction feature vector.

【００４４】大分類処理は、入力パターンからも同様の
特徴を抽出し、これを各文字の標準パターンの特徴とマ
ッチングし、類似度の高い標準パターンの文字カテゴリ
を複数個選択して候補として返す。In the large classification process, similar characteristics are extracted from the input pattern, the characteristics are matched with the characteristics of the standard pattern of each character, and a plurality of character categories of the standard pattern having a high degree of similarity are selected and returned as candidates. .

【００４５】先に述べた文字認識処理での登録で、どの
構成要素への入力パターンのどこの部分パターンの登録
が必要かは分かっている。文字認識処理における構造化
字体表現辞書では、この構成要素へ新たな部分パターン
をただ１つ登録するだけで、これを字形の一部に含む文
字カテゴリすべてに登録する効果がある。しかし、構成
要素を共有しない非構造化字体表現辞書では、このよう
な方法は実現できない。そこで、登録対象の構成要素を
字形に含んでいる文字カテゴリすべて（どの文字がその
構成要素を字形に含んでいるかは構造化字体表現辞書か
ら分かる）に対して、追加登録したい部分パターンを含
む文字パターンを構造化字体表現辞書から形成し、上に
述べた非構造化特徴を抽出して、その特徴を許すように
非構造化字体表現辞書を更新する。In the character recognition processing described above, it is known which partial pattern of the input pattern to which constituent element needs to be registered. In the structured font representation dictionary in the character recognition process, it is effective to register only one new partial pattern in this component and register it in all the character categories included in a part of the character shape. However, such a method cannot be realized with an unstructured font representation dictionary that does not share constituent elements. Therefore, for all the character categories that include the component to be registered in the glyph (you can see from the structured font expression dictionary which characters include that component), the character that includes the partial pattern you want to additionally register. The pattern is formed from the structured font representation dictionary, the unstructured features described above are extracted, and the unstructured font representation dictionary is updated to allow the features.

【００４６】図１１に例を用いて示す。「語」に対して
図１１（ａ）が筆記され、その登録が行われるとする。
この場合、略字の言偏部分が切り出され登録される。こ
の言偏を含む漢字（例えば図１１（ｃ）に示す「論」）
は構造化字体表現辞書から得られる。そこで、切り出さ
れた略字の「言」（図１１（ｂ））や他の言偏の漢字一
つ一つに対して、その方向特徴ベクトルを抽出し、非構
造化字体表現辞書を更新する。FIG. 11 shows an example. It is assumed that FIG. 11A is written for the “word” and that the registration is performed.
In this case, the word deviation portion of the abbreviation is cut out and registered. Kanji including this bias (for example, "ron" shown in Fig. 11 (c))
Is obtained from the structured font representation dictionary. Therefore, the direction feature vector is extracted for each of the cut-out abbreviations “words” (FIG. 11B) and other kanji of the word bias, and the unstructured font representation dictionary is updated.

【００４７】[0047]

【発明の効果】本発明によれば、略字，異字体などを一
部に含む字形を登録したときに略字，異字体などの部分
パターンを切り出して登録するため、同じ略字，異字体
などを含む他の字形をすべて登録しなくても、この登録
した部分パターンを利用することで認識率を向上させる
ことが可能となる。According to the present invention, when a character shape including a part of an abbreviated character or an atypical character is registered, a partial pattern of the abbreviated character or an atypical character is cut out and registered. Even if all the other character shapes are not registered, the recognition rate can be improved by using this registered partial pattern.

[Brief description of drawings]

【図１】本発明の一実施例に係る手書き文字オンライン
認識装置における漢字の自動分割を具体的に説明する図
である。FIG. 1 is a diagram specifically illustrating automatic division of Chinese characters in a handwritten character online recognition device according to an embodiment of the present invention.

【図２】本発明の一実施例に係る手書き文字オンライン
認識装置における英単語の自動分割を具体的に説明する
図である。FIG. 2 is a diagram specifically illustrating automatic division of English words in the handwritten character online recognition device according to an embodiment of the present invention.

【図３】本発明の一実施例に係る手書き文字オンライン
認識装置のうち文字認識のための字体登録機能を示す構
成図である。FIG. 3 is a configuration diagram showing a font registration function for character recognition in the handwritten character online recognition device according to an embodiment of the present invention.

【図４】図３に示す自動分割学習部で行われる自動分割
学習処理手順を示すフローチャートである。FIG. 4 is a flowchart showing an automatic division learning processing procedure performed by the automatic division learning unit shown in FIG.

【図５】図４のステップ１（構成要素Ｂに似ている部分
を検出する処理）の詳細手順を示すフローチャートであ
る。5 is a flowchart showing a detailed procedure of step 1 (processing for detecting a part similar to the component B) in FIG. 4;

【図６】図３に示す被分割指示パターン学習部の詳細処
理手順を示すフローチャートである。6 is a flowchart showing a detailed processing procedure of a divided instruction pattern learning unit shown in FIG.

【図７】自動分割による学習前における手書き入力パタ
ーンに対する認識結果を示す図である。FIG. 7 is a diagram showing a recognition result for a handwritten input pattern before learning by automatic division.

【図８】自動分割による学習後における手書き入力パタ
ーンに対する認識結果を示す図である。FIG. 8 is a diagram showing a recognition result for a handwritten input pattern after learning by automatic division.

【図９】大分類処理における文字パターン小領域を説明
する図である。FIG. 9 is a diagram illustrating a character pattern small area in a large classification process.

【図１０】大分類処理における方向特徴ベクトルの一例
を示す図である。FIG. 10 is a diagram showing an example of a direction feature vector in a large classification process.

【図１１】言偏を含む文字カテゴリへの非構造化字体表
現辞書の説明図である。FIG. 11 is an explanatory diagram of an unstructured font expression dictionary for character categories including word bias.

【図１２】「明」の標準パターンの形成例を示す図であ
る。FIG. 12 is a diagram showing an example of forming a standard pattern of “bright”.

【図１３】「猫」のパターン構成を示す図である。FIG. 13 is a diagram showing a pattern configuration of “cat”.

[Explanation of symbols]

１…パターン全体学習部、２…自動分割学習部、３…構
造化字体表現辞書、４…判定部、５…被分割指示パター
ン学習部。1 ... Pattern whole learning unit, 2 ... Automatic division learning unit, 3 ... Structured character expression dictionary, 4 ... Judgment unit, 5 ... Divided instruction pattern learning unit.

【手続補正書】[Procedure amendment]

【提出日】平成７年１２月２８日[Submission date] December 28, 1995

【手続補正１】[Procedure amendment 1]

【補正対象書類名】図面[Document name to be amended] Drawing

【補正対象項目名】図４[Correction target item name] Fig. 4

【補正方法】変更[Correction method] Change

【補正内容】[Correction contents]

【図４】 FIG. 4

Claims

[Claims]

1. A standard pattern composed of a combination of various radical patterns registered, and an input character pattern acquired by using a tablet which outputs a handwriting as a coordinate data string are subjected to matching processing to obtain the input character. In a handwritten character online recognition device for recognizing a pattern, when an input character pattern is registered in a dictionary, the input character pattern is compared with the corresponding standard pattern, and a radical pattern forming the standard pattern is compared. A character registration / learning method for a handwritten character online recognition device, characterized in that a partial pattern not similar to is automatically divided from the input character pattern and the partial pattern is registered as a corresponding radical pattern.

2. The input character pattern by performing a matching process between a standard pattern composed of a combination of registered partial glyph patterns and an input character pattern acquired using a tablet that outputs handwriting as a coordinate data string. In the handwritten character online recognition device for recognizing, when a dictionary is registered for an input character pattern, the input character pattern is compared with the standard pattern corresponding to the input character pattern, and a partial glyph pattern forming the standard pattern is obtained. A font registration / learning method for a handwritten character online recognition device, wherein a dissimilar partial pattern is automatically divided from the input character pattern and the partial pattern is registered as a corresponding partial character shape pattern.

3. The input character by performing a matching process between a standard pattern composed of a combination of various radical patterns registered and an input character pattern acquired using a tablet which outputs handwriting as a coordinate data string. In the handwritten character online recognition device for recognizing a pattern, when performing dictionary registration for an input character pattern, a partial pattern is input from the tablet for each radical constituting the input character pattern, and for each input partial pattern,
A standard pattern is generated by using the input partial pattern as a part of the standard pattern, and a maximum similarity is calculated by performing matching processing between each standard pattern corresponding to each input partial pattern and the input character pattern. A font registration / learning method for a handwritten character online recognition device, characterized in that an input partial pattern giving the maximum similarity is registered as a radical pattern when the value exceeds the similarity.

4. The method according to claim 3, wherein when the maximum similarity does not reach the target similarity, the similarity between the standard pattern formed by the combination of the input partial patterns and the input character pattern is calculated, and the similarity is the target similarity. A font registration / learning method for a handwritten character online recognition device, characterized in that a combination of input partial patterns is registered when the value exceeds a degree.

5. The font registration / learning method of the handwritten character online recognition device according to claim 4, wherein when the similarity does not reach the target similarity, the entire input character pattern is registered.

6. The input character is obtained by performing a matching process between a standard pattern composed of a combination of various radical patterns registered and an input character pattern acquired using a tablet which outputs handwriting as a coordinate data string. In a handwritten character online recognition device for recognizing a pattern, a means for comparing the input character pattern with the standard pattern corresponding thereto when the dictionary is registered for the input character pattern, and the standard pattern is configured as a result of the comparison. And a means for automatically dividing the partial pattern from the input character pattern when a partial pattern not similar to the radical pattern is detected, and a means for registering the partial pattern as a corresponding radical pattern. An online recognition device featuring handwritten characters.

7. The input character pattern by performing a matching process between a standard pattern composed of a combination of registered partial glyph patterns and an input character pattern acquired using a tablet that outputs handwriting as a coordinate data string. In a handwritten character on-line recognition device for recognizing, when a dictionary is registered for an input character pattern, a means for comparing the input character pattern with the corresponding standard pattern, and as a result of the comparison, constructs the standard pattern. And a means for automatically dividing the partial pattern from the input character pattern when a partial pattern not similar to the partial character pattern is detected, and a means for registering the partial pattern as a corresponding partial character pattern. Handwriting character online recognition device.

8. The input character is obtained by performing a matching process between a standard pattern composed of a combination of various radical patterns registered and an input character pattern acquired using a tablet which outputs handwriting as a coordinate data string. In a handwritten character online recognition device for recognizing a pattern, when performing dictionary registration for an input character pattern, a means for inputting a partial pattern for each radical constituting the input character pattern from the tablet to obtain a coordinate data string, A means for generating a standard pattern by using the input partial pattern as a part of the standard pattern for each input partial pattern, performing matching processing between the standard pattern corresponding to each input partial pattern and the input character pattern, and obtaining a maximum similarity. When the maximum similarity exceeds the target similarity, the maximum similarity is given. And a means for registering an input partial pattern as a corresponding radical pattern.

9. The method according to claim 8, wherein when the maximum similarity does not reach the target similarity, a means for determining the similarity between the standard pattern composed of a combination of the input partial patterns and the input character pattern, and the similarity is the target similarity. An online recognition device for handwritten characters, comprising: means for registering a combination of input partial patterns when the value exceeds a degree.

10. The handwritten character online recognition device according to claim 9, further comprising means for registering the entire input character pattern when the similarity does not reach the target similarity.

11. When the characteristic amount of a standard pattern of each character shape is stored in a dictionary for each character shape and a matching process between a large number of standard patterns and an input character pattern is performed to perform character recognition of the input character pattern. Prior to the character recognition processing, a candidate for a standard pattern having a feature amount similar to the feature amount of the input character pattern is searched in the dictionary and preselected, and a matching process between the standard pattern candidate and the input character pattern is performed. In the handwritten character online recognition device for performing the above, when a partial pattern is registered by the font registration / learning method according to any one of claims 1 to 3, the feature amount of a standard pattern having the partial pattern is obtained to obtain the dictionary. An on-line recognition device for handwritten characters, comprising: means for updating.