JPH08249424A

JPH08249424A - Handwriting recognition device

Info

Publication number: JPH08249424A
Application number: JP5152395A
Authority: JP
Inventors: Yasushi Ishizuka; 康司石塚; Hideaki Tanaka; 秀明田中; Minako Kuwata; みな子桑田; Yoshihiro Kitamura; 義弘北村
Original assignee: Sharp Corp
Current assignee: Sharp Corp
Priority date: 1995-03-10
Filing date: 1995-03-10
Publication date: 1996-09-27
Anticipated expiration: 2018-09-08
Also published as: JP3445680B2

Abstract

PURPOSE: To recognize a handwritten character including a (running hand character) and a (simplified form character) stably at a high recognition rate. CONSTITUTION: A dictionary storage part 10 stores a reference pattern in unit of sub-pattern easy to write with one stroke. A stroke processing part 5 extracts a feature point representing the feature of a curvature part in advance before normalization. A feature resampling part 8 performs the normalization, interpolation and noise elimination at every chain of stroke, and interpolates an off- stroke section with the feature point. A character recognition part 9 performs DP matching using the reference pattern in unit of sub-pattern. In such a way, the feature of the curvature part is prevented from being lost due to the normalization by sampling the feature point of the curvature part before the normalization. Also, the accuracy of the DP matching is improved and a time is reduced by the feature point extracted by using both an equal division method and an angle method and a standard pattern for every sub-pattern. Also, it is possible to deal with the (running hand character) and the (simplified form character) by performing write with one stroke by interpolating the off-stroke section.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】この発明は、ワードプロセッサや
パーソナルコンピュータ等に用いられて、タブレットか
らペン入力された手書き文字等を認識する手書き認識装
置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a handwriting recognizing device which is used in a word processor, a personal computer or the like to recognize a handwritten character or the like input from a tablet by a pen.

【０００２】[0002]

【従来の技術】現在、電子手帳,ワードプロセッサある
いはパーソナルコンピュータ等に対して文字を入力する
際にキーボードを用いる方法が主流である。ところが、
このキーボードを用いた文字入力には、一定の技量が必
要であり、文字入力の対象となる機器が電子手帳のよう
に小型化が進んでいるためにキーボードを設ける領域を
確保できない等の問題が生じてきている。そこで、タブ
レット等を用いたオンライン手書き文字入力が注目され
始めている。2. Description of the Related Art At present, a method of using a keyboard for inputting characters to an electronic notebook, a word processor, a personal computer or the like is mainly used. However,
Character input using this keyboard requires a certain amount of skill, and there is a problem that the area for the keyboard cannot be secured because the target device for inputting characters is becoming smaller like an electronic notebook. Is happening. Therefore, online handwritten character input using a tablet or the like has begun to attract attention.

【０００３】オンライン手書き文字入力された文字を認
識する際には、通常、タブレット上に書き込まれた手書
き文字はタブレット上における座標(ｘ,ｙ)で表現され
る。そして、手書き文字のストロークを認識の基本単位
として、ストローク毎に特徴抽出を行っている。そし
て、文字はストロークの集合であるとして、カテゴリ番
号に対応付けたストロークの特徴列を標準パターンとし
てストローク辞書に登録しておき、認識対象手書き文字
から抽出された特徴列と上記標準パターンとを照合し
て、認識対象手書き文字を構成するストロークのカテゴ
リを認識する。そして、文字辞書を用いてストロークの
組み合わせを認識することによって入力文字を認識する
のである。尚、上記辞書に標準パターンを登録する際
に、文字パターンを１個以上の「きへん」,「ごんべん」あ
るいは「ふるとり」等の部首に相当するサブパターンに分
割して登録する方法もある。On-line handwritten characters When recognizing an input character, the handwritten character written on the tablet is usually represented by coordinates (x, y) on the tablet. Then, the stroke of the handwritten character is used as a basic unit for recognition, and feature extraction is performed for each stroke. Then, assuming that the character is a set of strokes, the feature sequence of strokes associated with the category number is registered in the stroke dictionary as a standard pattern, and the feature sequence extracted from the recognition-target handwritten character is collated with the standard pattern. Then, the category of strokes forming the recognition target handwritten character is recognized. Then, the input character is recognized by recognizing the combination of strokes using the character dictionary. When registering a standard pattern in the dictionary, a character pattern is divided into one or more sub patterns such as "kihen", "gonben", or "furutori" and registered as subpatterns. There is also a method.

【０００４】上記ストローク毎の特徴抽出法としては、
以下のような特徴抽出法がある。 (１) ストロークの始点から終点までを所定数に分割し
て、各分割点の特徴を抽出する。 (２) ストロークの始点から終点までを等間隔に分割し
て、各分割点の特徴を抽出する。 (３) ストロークを追跡して所定角度以上の曲がり点を
特徴点として検出し、各特徴点の特徴を抽出する。As the feature extraction method for each stroke,
There are the following feature extraction methods. (1) The stroke start point to end point is divided into a predetermined number, and the features of each division point are extracted. (2) The stroke start point to end point are divided at equal intervals, and the features of each division point are extracted. (3) A stroke is traced, a bending point having a predetermined angle or more is detected as a feature point, and the feature of each feature point is extracted.

【０００５】上述の分類は、従来の手書き認識を特徴抽
出法から見た分類である。一方、手書き認識法全体から
見ると、「楷書」から「若干の続け字」までを認識対象とし
た手書き認識装置が提案されており、今後は「続け字」か
ら「崩し字」への発展が予想されている。しかしながら、
このような研究はかなり前から行われているにも拘わら
ず、実用レベルに達するものは未だ出現していない。ま
た、オンライン手書き認識の際には、上述のストローク
の変形の問題の他に、「筆順変動」という解決すべき大き
な問題が存在する。The above classification is a classification in which conventional handwriting recognition is viewed from the feature extraction method. On the other hand, from the perspective of the handwriting recognition method as a whole, a handwriting recognition device has been proposed that recognizes from "regular characters" to "some continuous characters". Is expected. However,
Despite such research being carried out for a long time, none that has reached a practical level has yet appeared. In addition, in the case of online handwriting recognition, in addition to the above-mentioned problem of stroke deformation, there is a big problem to be solved, that is, “stroke order variation”.

【０００６】現在まで提案されている「続け字」,「崩し
字」に対処する研究として、次のようなものがある。 [i] 変形ベクトル場に基づく文字変形予測を用いたオ
ンライン手書き文字認識（電子通信学会論文誌 '８６/
１２Ｖol.Ｊ６９−ＤＮo.１２) 入力パターンと辞書の標準パターンとのストローク対応
を求めた後、ＤＰマッチングでストローク対毎に対応特
徴点を決める。そして、この対応特徴点間の変形ベクト
ル場から求めた予測変形ベクトルを属性毎に重畳して標
準パターンを変形し、この変形した標準パターンと入力
パターンとの間でＤＰマッチングを行って距離を求める
のである。[0006] The following studies have been made to deal with the "continuation character" and the "collapsed character" that have been proposed so far. [i] Online handwritten character recognition using deformation vector prediction based on deformation vector field (IEICE Transactions '86 /
12 Vol.J69-D No.12) After determining the stroke correspondence between the input pattern and the standard pattern in the dictionary, the corresponding feature points are determined for each stroke pair by DP matching. Then, the predicted deformation vector obtained from the deformation vector field between the corresponding feature points is superimposed for each attribute to deform the standard pattern, and DP matching is performed between the deformed standard pattern and the input pattern to obtain the distance. Of.

【０００７】[ii] ファジィ推論によるくずし字の実時
間認識（電子通信学会ＰＲＵ８９−２９(１９８９)）ストローク毎のｘ,ｙ座標列にフーリエ変換を行ない、
各係数(文献ではｘ,ｙ共に３つ)のメバーシップ関数に
基づいてファジィ化データを求める。そして、このファ
ジィ化データと同様にファジィ化データで表された辞書
データとをストローク毎に対応付けしながら認識するの
である。尚、上記辞書データとしては、各文字毎に可能
性がある全ての続け字パターンを用意する。[Ii] Real-time recognition of comb characters by fuzzy inference (The Institute of Electronics and Communication Engineers PRU 89-29 (1989)) Fourier transform is performed on the x, y coordinate sequence for each stroke,
The fuzzy data is obtained based on the Mebership function of each coefficient (3 in x and y in the literature). Then, similarly to the fuzzy data, the dictionary data represented by the fuzzy data is recognized for each stroke. As the dictionary data, all possible continuous character patterns are prepared for each character.

【０００８】[iii] Attributed String Matching by S
plit-and-Merge for On-Line ChineseCharacter Recogn
ition（IEEE TRANSACTIONS ON PATTERN ANALYSIS AND M
ACTINEINTELLIGENSE VOL 15，NO.2 FEBRUARY 1993）例えば、同一人物のみが使用するような筆順固定の条件
の下で、入力のオフストローク部分と辞書のオフストロ
ーク部分とを対応付けて(結局ストロークを対応付け
て)、対応するストローク毎に詳細なＤＰマッチングを
行なって距離を求める。[Iii] Attributed String Matching by S
plit-and-Merge for On-Line ChineseCharacter Recogn
ition (IEEE TRANSACTIONS ON PATTERN ANALYSIS AND M
ACTINEINTELLIGENSE VOL 15, NO.2 FEBRUARY 1993) For example, under a fixed stroke order condition that only the same person uses, the offstroke part of the input is associated with the offstroke part of the dictionary (after all, the strokes are associated with each other). Then, detailed DP matching is performed for each corresponding stroke to obtain the distance.

【０００９】以上のように、従来の「続け字」,「崩し字」
のオンライン認識法は、以下の３つの基本的なアプロー
チに大別できる。 (４) 筆順フリー方式であって、筆順をストローク対応
付けによって吸収し、詳細な特徴や手法を用いて認識す
る。 (５) 筆順固定方式であって、例えば同一ユーザ等のよ
うに筆順を無視できる条件下や、ある程度のバリエーシ
ョン数に限定して認識する。As described above, the conventional "continuous character" and "collapsed character"
The online recognition method of can be roughly divided into the following three basic approaches. (4) The stroke order is a free method, and the stroke order is absorbed by the stroke correspondence, and is recognized by using detailed features and methods. (5) It is a fixed stroke order method, and is recognized under the condition that the stroke order can be ignored, such as the same user, or limited to a certain number of variations.

【００１０】また、取り込まれた文字データを画像デー
タとして取り扱い、ＯＣＲ(光学的文字認識装置)におけ
る文字認識手法を用いて手書き文字をパターン認識する
方法も提案されている。例えば、入力文字画像を１２×
１２の升目に分割して各升目内における表示画素数を求
め、表示画素の分布を特徴パターンとし、辞書に登録さ
れた標準パターンと照合することによって入力文字を認
識するのである。A method has also been proposed in which the captured character data is treated as image data and a pattern recognition of a handwritten character is performed by using a character recognition method in an OCR (optical character recognition device). For example, if the input character image is 12 ×
The number of display pixels in each cell is divided into 12 cells, the distribution of the display pixels is used as a characteristic pattern, and the input character is recognized by collating with the standard pattern registered in the dictionary.

【００１１】[0011]

【発明が解決しようとする課題】しかしながら、上記従
来の手書き文字認識装置には以下のような問題がある。 (Ａ) 上記ストローク毎の特徴抽出方法(１)において
は、ストローク全体を所定数に分割するために、対象ス
トロークの長さが非常に長い場合には特徴点の間隔が長
くなり過ぎ、特徴点の連なりから元のストロークの形状
が再現できない場合が生ずる。その場合には、ストロー
ク辞書との照合の際に一致度が低下して高い認識率が得
られないという問題がある。However, the above conventional handwritten character recognition device has the following problems. (A) In the feature extraction method (1) for each stroke, since the entire stroke is divided into a predetermined number, when the target stroke is very long, the feature points become too long, In some cases, the shape of the original stroke cannot be reproduced due to the sequence of. In that case, there is a problem in that the degree of coincidence decreases during collation with the stroke dictionary and a high recognition rate cannot be obtained.

【００１２】(Ｂ) 上記ストローク毎の特徴抽出方法
(２)においては、ストローク全体を等間隔に分割するた
めに、対象ストロークの長さが非常に長い場合には特徴
点の数が非常に多くなる。分割間隔を適当な範囲で短く
設定することによって、複雑な漢字等を１画で手書き入
力されても特徴点の連なりから元のストロークの形状を
忠実に再現できる。ところが、その反面、特徴点が多い
ためにストローク辞書との照合に時間が掛かるという問
題が生ずることになる。(B) Feature extraction method for each stroke
In (2), since the entire stroke is divided at equal intervals, the number of feature points becomes very large when the length of the target stroke is very long. By setting the division interval to be short within an appropriate range, the original stroke shape can be faithfully reproduced from a series of feature points even if a complicated Chinese character or the like is input by handwriting in one stroke. However, on the other hand, since there are many characteristic points, it takes time to collate with the stroke dictionary.

【００１３】(Ｃ) 上記ストローク毎の特徴抽出法(３)
においては、所定角度以上の曲がり点を特徴点として検
出するために、検出される特徴点数は角度の閾値に依存
することになる。その結果、角度の閾値を大きくする
と、特徴点数は少なくなって照合時間が早くなるが認識
率が低下する。一方、角度の閾値を小さくすると、認識
率を高めることができるが特徴点が多くなって照合時間
が長くなるという問題がある。また、上記角度の閾値を
適度に設定したとしても、仮名文字のように曲線が多い
ストロークの場合には曲率の個人差によって特徴点の位
置が安定しない。したがって、特徴点の座標を特徴量と
して用いる場合には高い認識率を得ることが難しい。(C) Feature extraction method for each stroke (3)
In the above, since the bending point having a predetermined angle or more is detected as the characteristic point, the number of detected characteristic points depends on the threshold value of the angle. As a result, when the angle threshold value is increased, the number of feature points is decreased and the matching time is shortened, but the recognition rate is reduced. On the other hand, when the angle threshold is reduced, the recognition rate can be increased, but there is a problem that the number of feature points increases and the matching time increases. In addition, even if the threshold value of the angle is appropriately set, the position of the characteristic point is not stable due to individual differences in curvature in the case of a stroke having many curved lines such as a kana character. Therefore, it is difficult to obtain a high recognition rate when the coordinates of the feature points are used as the feature amount.

【００１４】(Ｄ) 上記各特徴抽出法(１)および(２)に
おいては、正規化された複数ストロークから成る入力座
標列から特徴抽出を行うと、抽出された特徴量は入力文
字の特徴を表さなくなる場合がある。そこで、特徴を的
確に表す特徴点を確実に抽出するには特徴点を多数抽出
する必要があり、辞書照合に時間が掛かることになる。
一方、特徴点の抽出数を少なくすると特徴を的確に表わ
せる確率が低くなり、認識率が低下するという問題があ
る。例えば、図２８(a)において、正規化された入力座
標列(実線で表示)上から特徴を抽出する場合を考える。
その場合には、入力座標列から抽出される特徴点(“点"
で表示)は、入力座標列には屈曲部(α)が在るにも拘わ
らず、図２８(b)に示すごとく、あたかも直線上(１列
上)に配列されているような特徴点が抽出される。その
場合には、手書き文字の部首“ぎょうにんべん"の認識
結果が“にんべん"となってしまうのである。(D) In the above feature extraction methods (1) and (2), when feature extraction is performed from an input coordinate sequence consisting of a plurality of normalized strokes, the extracted feature amount indicates the feature of the input character. It may disappear. Therefore, it is necessary to extract a large number of feature points in order to surely extract the feature points that accurately represent the features, and it takes time for dictionary matching.
On the other hand, if the number of extracted feature points is reduced, there is a problem that the probability of accurately expressing the feature decreases, and the recognition rate decreases. For example, in FIG. 28A, consider a case where a feature is extracted from a normalized input coordinate sequence (displayed by a solid line).
In that case, feature points (“points”) extracted from the input coordinate sequence
28), there are characteristic points as if they are arranged on a straight line (one row above) as shown in FIG. 28 (b), even though there is a bent portion (α) in the input coordinate sequence. To be extracted. In that case, the recognition result of the handwritten character radical "Gyoninben" becomes "Ninben".

【００１５】(Ｅ) 上記各特徴抽出法(１)〜(３)の何れ
においても、ストローク単位で認識するので、筆者が意
図しないストロークである「入り」や「跳ね」と文字の構成
に欠くことのできないストロークとを区別ができず、認
識率が低下するという問題がある。さらに、上記ストロ
ークが持つ屈曲部の特徴が正規化によって消滅する場合
があり、その場合には正規化後のストロークから屈曲部
を表す特徴を正確に抽出することは不可能となる。(E) In each of the above-mentioned feature extraction methods (1) to (3), since the stroke is recognized as a unit, the strokes that the writer does not intend, "entry" and "bounce", are not included in the character configuration. There is a problem that the recognition rate is lowered because the strokes that cannot be distinguished cannot be distinguished from each other. Further, the characteristic of the bent portion of the stroke may disappear by normalization, and in that case, it is impossible to accurately extract the characteristic of the bent portion from the normalized stroke.

【００１６】また、上記従来の「続け字」,「崩し字」のオ
ンライン認識法には以下のような問題がある。 (Ｆ) 上記オンライン認識法(４)の筆順フリー方式で
は、ａ．ストローク対応付け自体の処理に時間が掛かり過
ぎ、全体として処理速度が遅い。ｂ．過度の変形や省略があった場合にはストローク対応
付けの性能が低下してしまい、認識率が低下する。という問題が生ずる。ここで、最終目的が「文字の認識」
であるにも拘わらず文字認識以前のストローク対応付け
の処理量が大となってしまう問題は、例えば、電子手帳
のようなプアーなハード構成における「続け字」,「崩し
字」のオンライン認識を実現不可能にしまうのである。
また、過度の変形や省略におけるストローク対応付けの
性能が低下する問題は、「続け字」のオンライン認識から
「崩し字」のオンライン認識への発展を望めなくしてしま
う。Further, the above-mentioned conventional "continuous character" and "collapsed character" online recognition methods have the following problems. (F) In the stroke order free method of the above online recognition method (4), a. It takes too much time to process the stroke association itself, and the processing speed is slow as a whole. b. If there is excessive deformation or omission, the performance of stroke correspondence deteriorates, and the recognition rate decreases. The problem arises. Here, the final purpose is "character recognition".
However, there is a problem that the amount of stroke matching processing before character recognition becomes large, for example, for online recognition of "continuation characters" and "collapsed characters" in a poor hardware configuration such as an electronic notebook. It will not be possible.
Further, the problem that the stroke matching performance deteriorates due to excessive deformation or omission makes it difficult to expect the development of online recognition of "continuous characters" from online recognition of "collapsed characters".

【００１７】(Ｇ) 上記オンライン認識法(５)の筆順固
定方式では、「続け字」,「崩し字」の認識性能は比較的よ
いのであるが筆順変動には極端に弱いという問題があ
る。この問題に対する最も簡単で唯一の解決方法は、筆
順変動のバリエーションを全て辞書に持つことである
が、辞書容量がバリエーションの数だけ単調増加するの
で一般的なシステム構築は事実上不可能となってしま
う。(G) In the fixed stroke order method of the above-mentioned online recognition method (5), although the recognition performance of "successive characters" and "collapsed characters" is relatively good, there is a problem that it is extremely weak against fluctuations in the stroke order. The simplest and only solution to this problem is to have all stroke order variations in the dictionary, but since the dictionary capacity monotonically increases by the number of variations, it is virtually impossible to build a general system. I will end up.

【００１８】(Ｈ) ＯＣＲを用いた文字認識方法を用い
る場合には、文字全体のパターンを認識対象としている
ために、「続け字」や「崩し字」を正確に認識する際には多
くの変形パターンを標準パターンとして辞書に登録して
おく必要があり、照合に長時間を有する。そこで、スト
ローク単位で認識しようとすると、ストロークの切れ目
が明確ではなく、認識率の低下を来すという問題があ
る。(H) When the character recognition method using OCR is used, since the pattern of the entire character is the recognition target, many characters are required to accurately recognize the "continuation character" or the "collapsed character". It is necessary to register the modified pattern as a standard pattern in the dictionary, and it takes a long time for matching. Therefore, there is a problem in that, when trying to recognize the stroke unit, the stroke break is not clear and the recognition rate is lowered.

【００１９】(Ｉ) また、上記オンライン認識法(４),
(５)およびＯＣＲを用いた文字認識方法に共通して、文
字単位の認識処理であるためにユーザーの入力が完了し
ないと認識処理が開始できないという問題がある。通
常、人間の筆記時間は２秒程度掛かる。そこで、この筆
記時間を利用して前認識処理を行うことは、電子手帳の
ようなプアーな装置では特に効果的である。ところが、
従来の認識法ではこの筆記時間を利用した前認識処理の
実現が困難なのである。(I) In addition, the online recognition method (4),
In common with (5) and the character recognition method using OCR, there is a problem that the recognition process cannot be started unless the user's input is completed because it is a character-by-character recognition process. Normally, human writing takes about 2 seconds. Therefore, performing the pre-recognition process using this writing time is particularly effective in a poor device such as an electronic notebook. However,
It is difficult for the conventional recognition method to realize the pre-recognition process using this writing time.

【００２０】そこで、この発明の目的は、安定して高い
認識率で「続け字」や「崩し字」を含む手書き文字等を高速
に認識できる手書き認識装置を提供することにある。SUMMARY OF THE INVENTION An object of the present invention is to provide a handwriting recognition device capable of stably recognizing handwritten characters including "continuation characters" and "collapsed characters" at a high recognition rate at a high speed.

【００２１】[0021]

【課題を解決するための手段】上記目的を達成するた
め、請求項１に係る発明は、手書き入力された筆跡の座
標を時系列に配列して成る座標データ列を生成するタブ
ッレトと,上記座標データ列から筆跡パターンの特徴を
表す特徴点とその特徴点の特徴量を抽出する特徴抽出部
と,抽出された特徴量を有する特徴点の列で成る特徴パ
ターンと辞書との照合を行って筆跡パターンを認識する
認識部と,この認識部による認識結果を表示する結果表
示部を有する手書き認識装置において、上記辞書は,代
表パターンを統計上一筆書きされ易いパターンに分離し
て成るサブパターンおよび各サブパターンの変形パター
ンから抽出された特徴量を有する特徴点の列を標準パタ
ーンとするサブパターン辞書であり、上記特徴抽出部
は,複数ストロークの座標データ列から特徴点と特徴量
を抽出する際に,各ストローク間のオフストロークを特
徴点で補間するオフストローク補間手段を備えて、上記
認識部は,上記複数のストロークとオフストロークとに
係る特徴量を有する特徴点列を一つの特徴パターンと見
なして上記サブパターン辞書との照合を行うことを特徴
としている。In order to achieve the above object, the invention according to claim 1 is a tablet for generating a coordinate data string formed by arranging the coordinates of handwriting input by handwriting in time series, and the coordinates. A feature extraction unit that extracts the feature points that represent the features of the handwriting pattern from the data sequence and the feature amount of the feature points, and the feature pattern that is a sequence of feature points that has the extracted feature amount and the dictionary are matched to make the handwriting. In a handwriting recognition device having a recognition unit for recognizing a pattern and a result display unit for displaying a recognition result by the recognition unit, the dictionary has subpatterns formed by separating a representative pattern into patterns that are easy to write in a statistical manner and each pattern. This is a sub-pattern dictionary in which a sequence of feature points having feature quantities extracted from a modified pattern of a sub-pattern is used as a standard pattern. When the feature points and the feature amount are extracted from the data sequence, the recognition unit includes off-stroke interpolation means for interpolating the off-stroke between strokes with the feature points, and the recognition unit is characterized by the strokes and the off-strokes. It is characterized in that a feature point sequence having a quantity is regarded as one feature pattern and collated with the sub-pattern dictionary.

【００２２】また、請求項２に係る発明は、請求項１に
かかる発明の手書き認識装置において、上記特徴抽出部
は、上記タブレットで生成された座標データ列から筆跡
パターンにおける屈曲部の特徴点と特徴量を抽出する第
１特徴抽出部と、上記第１特徴抽出部によって抽出され
た特徴点列を正規化し,この正規化後の特徴点列と上記
座標データの正規化値とから屈曲部の特徴点間を等間隔
近似する特徴点と特徴量を抽出する第２特徴抽出部を備
えたことを特徴としている。According to a second aspect of the present invention, in the handwriting recognition device according to the first aspect of the present invention, the feature extraction unit determines the feature points of the bent portion in the handwriting pattern from the coordinate data string generated by the tablet. The first feature extraction unit that extracts the feature amount and the feature point sequence extracted by the first feature extraction unit are normalized, and the bent portion is calculated based on the normalized feature point sequence and the normalized value of the coordinate data. A feature is that a feature point that approximates the feature points at equal intervals and a second feature extraction unit that extracts a feature amount are provided.

【００２３】また、請求項３に係る発明は、請求項１に
かかる発明の手書き認識装置において、上記特徴抽出部
で得られた特徴パターン上の特徴点の特徴量と上記標準
パターン上の特徴点の特徴量と両特徴量の差の値とを対
応付けて成るテーブルを格納したテーブル格納部を備え
て、上記認識部は,上記特徴パターン上の各特徴点と標
準パターン上の各特徴点とから成る特徴点対の特徴量に
基づいて,上記テーブルを引いて上記各特徴点対に係る
特徴量の差を求め,この各差の値に基づいて上記特徴パ
ターンと標準パターンとの距離を算出する距離算出手段
を備えたことを特徴としている。According to a third aspect of the invention, in the handwriting recognition device of the first aspect of the invention, the feature amount of the feature points on the feature pattern obtained by the feature extraction section and the feature point on the standard pattern are obtained. A table storing unit that stores a table in which the feature amount and the value of the difference between the two feature amounts are associated with each other, and the recognizing unit includes each feature point on the feature pattern and each feature point on the standard pattern. Based on the feature amount of the feature point pair, the table is drawn to find the difference in the feature amount for each feature point pair, and the distance between the feature pattern and the standard pattern is calculated based on the value of each difference. It is characterized in that it is provided with a distance calculating means.

【００２４】また、請求項４に係る発明は、請求項１に
かかる発明の手書き認識装置において、上記認識部は,
照合の対象となるパターン上における特徴点間毎に変動
重みを付けたＤＰマッチング処理を行うＤＰマッチング
手段を備えたことを特徴としている。The invention according to claim 4 is the handwriting recognition apparatus according to claim 1, wherein the recognition section is
It is characterized by including a DP matching means for performing a DP matching process in which a variation weight is given to each feature point on a pattern to be matched.

【００２５】また、請求項５に係る発明は、請求項４に
かかる発明の手書き認識装置において、上記特徴抽出部
およびＤＰマッチング手段を制御すると共に,予め用意
された学習パターンから上記特徴抽出部によって特徴量
を有する特徴点列で成る学習用特徴パターンを生成し,
この学習用特徴パターンと上記標準パターンとのＤＰマ
ッチング処理を上記ＤＰマッチング手段で行ってマッチ
ングパス上の点に係るマッチング距離を求め,上記標準
パターンにおける特徴点間毎に全学習用特徴パターンに
係るマッチング距離のばらつきの度合いを算出して上記
変動重みとする変動重み算出部を備えたことを特徴とし
ている。According to a fifth aspect of the present invention, in the handwriting recognition apparatus of the fourth aspect of the invention, the feature extracting unit and the DP matching unit are controlled, and the feature extracting unit extracts a learning pattern prepared in advance by the feature extracting unit. Generate a learning feature pattern consisting of a sequence of feature points with features,
The DP matching processing of the learning feature pattern and the standard pattern is performed by the DP matching means to obtain the matching distance for the point on the matching path, and the feature distance for all learning is associated with each feature point in the standard pattern. The present invention is characterized in that a variation weight calculation unit that calculates the degree of variation in matching distance and uses the variation weight as the variation weight is provided.

【００２６】また、請求項６に係る発明は、請求項４に
かかる発明の手書き認識装置において、上記特徴抽出部
によって抽出された特徴パターン上の特徴点から次の特
徴点までの距離と当該特徴パターンにおける全特徴点を
入力順に結ぶ距離との比の値を算出して,当該特徴点に
係る区間の上記変動重みとする変動重み算出部を備えた
ことを特徴としている。According to a sixth aspect of the present invention, in the handwriting recognition apparatus according to the fourth aspect of the invention, the distance from the feature point on the feature pattern extracted by the feature extraction unit to the next feature point and the feature A feature is that a variation weight calculation unit that calculates a value of a ratio to a distance connecting all feature points in a pattern in an input order and sets the value as the variation weight of a section related to the feature point is provided.

【００２７】また、請求項７に係る発明は、請求項１に
かかる発明の手書き認識装置において、代表パターンと
この代表パターンを構成するサブパターンとを対応付け
て成る代表パターン構成テーブルを格納した代表パター
ン構成テーブル格納部を備えると共に、上記認識部は、
上記特徴パターンとサブパターン辞書とを照合して上記
筆跡パターンを上記サブパターン毎に認識するサブパタ
ーン認識手段と、上記サブパターン認識手段による認識
の結果得られたサブパターン候補の組み合わせと上記代
表パターン構成テーブルとを照合して,上記筆跡パター
ンを認識する筆跡パターン認識手段を備えたことを特徴
としている。According to a seventh aspect of the present invention, in the handwriting recognition apparatus according to the first aspect of the present invention, a representative pattern configuration table in which a representative pattern and sub-patterns forming the representative pattern are associated with each other is stored. With the pattern configuration table storage unit, the recognition unit,
Sub-pattern recognition means for recognizing the handwriting pattern for each sub-pattern by comparing the characteristic pattern with a sub-pattern dictionary, a combination of sub-pattern candidates obtained as a result of recognition by the sub-pattern recognition means, and the representative pattern It is characterized in that a handwriting pattern recognition means for recognizing the handwriting pattern by collating with the configuration table is provided.

【００２８】また、請求項８に係る発明は、請求項７に
かかる発明の手書き認識装置において、上記特徴抽出部
は、上記座標データ列における座標点を入力順に結んで
成る直線の方向が所定値以上変化する変化点を特徴点と
して抽出する屈曲部特徴抽出手段と、上記曲部特徴抽出
手段によって抽出された特徴点間を等間隔近似する特徴
点を抽出する直線部特徴抽出手段と、上記両特徴抽出手
段によって抽出された各特徴点における次の特徴点への
方向を表す方向値を特徴量の一つとして算出する方向値
算出手段を備えたことを特徴としている。According to an eighth aspect of the present invention, in the handwriting recognition apparatus according to the seventh aspect of the present invention, the feature extraction unit has a straight line direction connecting the coordinate points in the coordinate data sequence in a predetermined value. Bent portion feature extraction means for extracting the changing points as the feature points, straight line feature extraction means for extracting the feature points that approximate the feature points extracted by the bent portion feature extraction means at equal intervals, and The present invention is characterized by including a direction value calculation unit that calculates, as one of the feature amounts, a direction value that represents the direction to the next feature point in each feature point extracted by the feature extraction unit.

【００２９】また、請求項９に係る発明は、請求項８に
かかる発明の手書き認識装置において、上記屈曲部特徴
抽出手段によって抽出された屈曲部の特徴点の数から上
記筆跡パターンの近似直線数を求め,この近似直線数と
当該筆跡パターンのストローク数とから特徴パターンの
ストローク数の範囲を定めるストローク数限定部を備え
ると共に、上記サブパターン辞書は,各標準パターン毎
に,当該標準パターンの元となる代表パターンのストロ
ーク数で成るヘッダ情報を有し、上記認識部は,上記ス
トローク数限定部によって定められた範囲内のストロー
ク数をヘッダ情報とする標準パターンのみを上記サブパ
ターン辞書から選出する特徴パターン選出手段を備えた
ことを特徴としている。According to a ninth aspect of the invention, in the handwriting recognition device of the eighth aspect of the invention, the approximate straight line number of the handwriting pattern is calculated from the number of the characteristic points of the bent portion extracted by the bent portion feature extracting means. The number of strokes of the feature pattern is defined from the approximate number of straight lines and the number of strokes of the handwriting pattern, and the sub-pattern dictionary is provided for each standard pattern with the original pattern of the standard pattern. And has header information consisting of the number of strokes of the representative pattern, and the recognizing unit selects from the sub-pattern dictionary only standard patterns whose header information is the number of strokes within the range defined by the stroke number limiting unit. It is characterized in that a feature pattern selecting means is provided.

【００３０】また、請求項１０に係る発明は、請求項８
に係る発明の手書き認識装置において、上記サブパター
ン辞書は,各標準パターン毎に,当該標準パターンの最終
特徴点から次ストロークにおける先頭特徴点への方向値
の有効範囲で成るヘッダ情報を有し、上記認識部は,上
記方向値算出手段によって算出された当該特徴パターン
における最終特徴点から次ストロークにおける先頭特徴
点への方向値が入る有効範囲をヘッダ情報とする標準パ
ターンのみを有効標準パターンと判定する有効標準パタ
ーン判定手段を備えたことを特徴としている。The invention according to claim 10 is the invention according to claim 8.
In the handwriting recognition device of the invention according to, the sub-pattern dictionary, for each standard pattern, has header information consisting of the effective range of the direction value from the final feature point of the standard pattern to the leading feature point in the next stroke, The recognition unit determines that only a standard pattern having header information as an effective range in which the direction value from the final feature point in the feature pattern calculated by the direction value calculation means to the first feature point in the next stroke is used as the valid standard pattern. It is characterized in that it is provided with an effective standard pattern determining means.

【００３１】また、請求項１１に係る発明は、請求項７
に係る発明の手書き認識装置において、上記認識部は,
認識済み筆跡パターンのサブパターン候補連鎖に続くこ
とが可能なサブパターンを上記代表パターン構成テーブ
ルを参照して求め,得られたサブパターンの標準パター
ンのみを上記サブパターン辞書から選出する標準パター
ン選出手段を備えたことを特徴としている。The invention according to claim 11 is the invention according to claim 7.
In the handwriting recognition device of the invention according to, the recognition unit,
A standard pattern selection means for obtaining a sub pattern that can follow the sub pattern candidate chain of the recognized handwriting pattern by referring to the representative pattern configuration table and selecting only the standard pattern of the obtained sub pattern from the sub pattern dictionary. It is characterized by having.

【００３２】また、請求項１２に係る発明は、請求項１
に係る発明の手書き認識装置において、上記タブレット
によって生成された座標データ列が書き込まれる一方,
この書き込まれている座標データ列が上記特徴抽出部に
よって読み出されるデュアルポート・メモリで成る入力
座標バッファと、上記特徴抽出部によって抽出された特
徴パターンが書き込まれる一方,この書き込まれている
特徴パターンが上記認識部によって読み出されるデュア
ルポート・メモリで成る特徴バッファと、上記特徴抽出
部および認識部を制御して,上記特徴抽出部による特徴
抽出処理および上記認識部による認識処理と上記タブレ
ットによる座標データ列生成処理とを平行して実行する
制御部を備えたことを特徴としている。The invention according to claim 12 is the invention according to claim 1.
In the handwriting recognition device of the invention according to, while the coordinate data string generated by the tablet is written,
The input coordinate buffer consisting of a dual-port memory from which the written coordinate data sequence is read by the feature extraction unit and the feature pattern extracted by the feature extraction unit are written, while the written feature pattern is A feature buffer composed of a dual port memory read by the recognition unit, a feature extraction process by the feature extraction unit and a recognition process by the recognition unit, and a coordinate data string by the tablet by controlling the feature extraction unit and the recognition unit. It is characterized by having a control unit that executes the generation process in parallel.

【００３３】[0033]

【作用】請求項１に係る発明では、タブレットに手書き
入力されると座標データ列が生成される。そして、この
生成された座標データ列が複数ストロークの座標データ
列である場合には、特徴抽出部のオフスロトーク補間手
段によって、オフスロトークが特徴点によって補間され
る。こうして、上記特徴抽出部によって、複数ストロー
クの座標データ列から、各ストロークとオフストローク
との特徴点と特徴量が抽出される。そうすると、認識部
によって、上記複数ストロークとオフスロトークとに係
る各特徴量を有する特徴点の列が１つの特徴パターンと
見なされてサブパターン辞書との照合が行われ、上記筆
跡パターンが認識される。In the invention according to claim 1, when the tablet is handwritten, a coordinate data string is generated. Then, when the generated coordinate data string is a coordinate data string of a plurality of strokes, the off-slot talk is interpolated by the feature points by the off-slot talk interpolation means of the feature extraction unit. In this way, the feature extraction unit extracts the feature points and the feature amount of each stroke and the off-stroke from the coordinate data sequence of a plurality of strokes. Then, the recognition unit considers the sequence of feature points having the feature amounts related to the plurality of strokes and the off-slot talk as one feature pattern, performs matching with the sub-pattern dictionary, and recognizes the handwriting pattern. It

【００３４】こうして、特徴抽出の際にオフスロトーク
が補間される結果、「入り」や「跳ね」や「続け」等を有する
変形パターンの標準パターンを上記サブパターン辞書に
登録しておくことによって、容易に、「続け字」や「崩し
字」の筆跡パターンが認識される。In this way, as a result of off-slot talk being interpolated at the time of feature extraction, a standard pattern of a modified pattern having "entry", "bounce", "continuation", etc. is registered in the sub-pattern dictionary. , It is easy to recognize the handwriting pattern of "continuation character" or "collapsed character".

【００３５】また、請求項２に係る発明では、上記タブ
レットからの座標データ列が特徴抽出部に入力される
と、先ず、第１特徴抽出部によって、上記座標データ列
から筆跡パターンにおける屈曲部の特徴点と特徴量が抽
出される。次に、第２特徴抽出部によって、上記屈曲部
の特徴点列が正規化され、この正規化後の特徴点間を等
間隔近似する特徴点と特徴量が抽出される。こうして、
正規化前に屈曲部の特徴点と特徴量が抽出されることに
よって、屈曲部の特徴が正規化によって消滅したとして
も確実に屈曲部を表す特徴点と特徴量が抽出される。Further, in the invention according to claim 2, when the coordinate data string from the tablet is input to the feature extraction unit, first, the first feature extraction unit extracts the bent portion in the handwriting pattern from the coordinate data string. Feature points and feature quantities are extracted. Next, the second feature extraction unit normalizes the feature point sequence of the bent portion, and extracts feature points and feature amounts that approximate the normalized feature points at equal intervals. Thus
By extracting the feature points and the feature amount of the bent portion before the normalization, even if the feature of the bent portion disappears due to the normalization, the feature points and the feature amounts representing the bent portion can be reliably extracted.

【００３６】また、請求項３に係る発明では、上記認識
部の距離算出手段によって、上記特徴パターン上の特徴
点と標準パターン上の特徴点とから成る特徴点対の特徴
量に基づいて、テーブルを引いて上記特徴点対に係る特
徴量の差が求められる。そして、この各差の値に基づい
て上記特徴パターンと標準パターンとの距離が算出され
る。こうして、上記認識部による上記特徴パターンと標
準パターンとの照合が、テーブルを引くという簡単な操
作によって迅速に行われる。Further, in the invention according to claim 3, the distance calculating means of the recognizing unit calculates the table based on the characteristic amount of the characteristic point pair consisting of the characteristic points on the characteristic pattern and the characteristic points on the standard pattern. Is subtracted to obtain the difference between the feature quantities associated with the feature point pair. Then, the distance between the characteristic pattern and the standard pattern is calculated based on the value of each difference. In this way, the matching between the characteristic pattern and the standard pattern by the recognition unit is quickly performed by a simple operation of pulling a table.

【００３７】また、請求項４に係る発明では、上記認識
部のＤＰマッチング手段によって、照合の対象となるパ
ターン上における特徴点間毎に変動重みを付けたＤＰマ
ッチング処理が行われる。こうして、ＤＰマッチング時
における時間伸長特性が上記筆跡パターンのストローク
に沿った特徴の現れ方に応じて変更される。Further, in the invention according to claim 4, the DP matching means of the recognition section performs the DP matching processing in which the variation weight is given to each feature point on the pattern to be matched. In this way, the time extension characteristic during DP matching is changed according to the appearance of the feature along the stroke of the handwriting pattern.

【００３８】また、請求項５に係る発明では、変動重み
算出部によって上記特徴抽出部およびＤＰマッチング手
段が制御されて、予め用意された学習パターンから学習
用特徴パターンが生成され、この学習用特徴パターンと
標準パターンとのＤＰマッチング処理によってマッチン
グパス上の点に係るマッチング距離が求められ、上記標
準パターンにおける特徴点間毎に全学習用特徴パターン
に係るマッチング距離のばらつきの度合いが上記変動重
みとして算出される。こうして、安定して特徴点が得ら
れる区間が強調されたＤＰマッチングが行われる。Further, in the invention according to claim 5, the feature extracting section and the DP matching means are controlled by the variation weight calculating section to generate a learning characteristic pattern from a learning pattern prepared in advance, and the learning characteristic pattern is generated. By the DP matching process between the pattern and the standard pattern, the matching distance related to the points on the matching path is obtained, and the degree of variation in the matching distance related to all the learning characteristic patterns is calculated as the variation weight for each characteristic point in the standard pattern. It is calculated. In this way, DP matching is performed in which the section in which the characteristic points are stably obtained is emphasized.

【００３９】また、請求項６に係る発明では、変動重み
算出部によって、上記特徴抽出部によって抽出された特
徴パターン上の特徴点から次の特徴点までの距離と当該
特徴パターンにおける全特徴点を入力順に結ぶ距離との
比の値が、当該特徴点に係る区間の上記変動重みとして
算出される。こうして、上記筆跡パターンにおける屈曲
部の特徴が強調されたＤＰマッチングが行われる。Further, in the invention according to claim 6, the variation weight calculation unit calculates the distance from the feature point on the feature pattern extracted by the feature extraction unit to the next feature point and all the feature points in the feature pattern. The value of the ratio to the distance connecting in the input order is calculated as the variation weight of the section related to the feature point. In this way, DP matching in which the characteristic of the bent portion in the handwriting pattern is emphasized is performed.

【００４０】また、請求項７に係る発明では、認識部の
サブパターン認識手段によって、上記特徴パターンと上
記サブパターン辞書とが照合されて上記筆跡パターンが
サブパターン毎に認識される。そして、筆跡パターン認
識手段によって、上記サブパターン認識手段による認識
の結果得られたサブパターン候補の組み合わせと上記代
表パターン構成テーブルとが照合されて、上記筆跡パタ
ーンが認識される。こうして、統計上一筆書きされ易い
サブパターン単位で認識処理が行われ、少ない認識候補
で迅速に上記筆跡パターンの認識が行われる。In the invention according to claim 7, the sub-pattern recognizing means of the recognizing unit collates the characteristic pattern with the sub-pattern dictionary to recognize the handwriting pattern for each sub-pattern. Then, the handwriting pattern recognition means collates the combination of the subpattern candidates obtained as a result of the recognition by the subpattern recognition means with the representative pattern configuration table to recognize the handwriting pattern. In this way, the recognition process is performed in units of sub-patterns that are statistically easy to draw with one stroke, and the handwriting pattern is quickly recognized with a small number of recognition candidates.

【００４１】また、請求項８に係る発明では、上記特徴
抽出部の屈曲部特徴抽出手段によって、上記座標データ
列における座標点を入力順に結んで成る直線の方向が所
定値以上変化する変化点が屈曲部の特徴点として抽出さ
れる。さらに、直線部特徴抽出手段によって、上記屈曲
部の特徴点間を等間隔近似する特徴点が抽出される。そ
うすると、方向値算出手段によって、上記両特徴抽出手
段によって抽出された各特徴点における次の特徴点への
方向を表す方向値が特徴量の一つとして算出される。こ
うして、角度法と等分割法とによって特徴点が得られ、
辞書照合時に用いられる特徴量の一つとして各特徴点の
上記方向値が用いられる。Further, in the invention according to claim 8, the bending point feature extraction means of the feature extraction unit detects a change point at which the direction of the straight line connecting the coordinate points in the coordinate data sequence changes in a predetermined value or more. It is extracted as a characteristic point of the bent portion. Further, the straight line feature extraction means extracts feature points that approximate the feature points of the bent portion at equal intervals. Then, the direction value calculation means calculates the direction value representing the direction to the next feature point among the feature points extracted by the feature extraction means as one of the feature amounts. Thus, the characteristic points are obtained by the angle method and the equal division method,
The direction value of each feature point is used as one of the feature amounts used at the time of dictionary matching.

【００４２】また、請求項９に係る発明では、ストロー
ク数限定部によって、上記屈曲部の特徴点の数から筆跡
パターンの近似直線数が求められ、この近似直線数と当
該筆跡パターンのストローク数とから特徴パターンのス
トローク数の範囲が定められる。そうすると、上記認識
部の特徴パターン選出手段によって、上記定められた範
囲内のストローク数をヘッダ情報とする標準パターンが
上記サブパターン辞書から選出され、この選出された標
準パターンのみが用いられて上記認識部によって上記辞
書照合が迅速に行われる。Further, in the invention according to claim 9, the stroke number limiting unit obtains the approximate straight line number of the handwriting pattern from the number of characteristic points of the bent portion, and the approximate straight line number and the stroke number of the handwriting pattern. From the range of the number of strokes of the characteristic pattern is determined. Then, the characteristic pattern selecting means of the recognizing unit selects a standard pattern having the number of strokes within the defined range as header information from the sub-pattern dictionary, and uses only the selected standard pattern to perform the recognition. The above-mentioned dictionary collation is quickly performed by the department.

【００４３】また、請求項１０に係る発明では、上記特
徴量抽出部の方向値算出手段によって、当該特徴パター
ンにおける最終特徴点から次ストロークにおける先頭特
徴点への方向値が算出される。そして、上記認識部の有
効標準パターン判定手段によって、上記算出された方向
値が入る有効範囲をヘッダ情報とする標準パターンが有
効標準パターンであると判定される。そうすると、上記
認識部によって、上記有効標準パターンと判定された標
準パターンのみが用いられて上記辞書照合が迅速に行わ
れる。According to the tenth aspect of the invention, the direction value calculating means of the feature amount extracting section calculates the direction value from the final feature point in the feature pattern to the leading feature point in the next stroke. Then, the valid standard pattern determination means of the recognition unit determines that the standard pattern having the valid range in which the calculated direction value is included as header information is the valid standard pattern. Then, the recognition unit uses only the standard pattern determined to be the effective standard pattern to quickly perform the dictionary matching.

【００４４】また、請求項１１に係る発明では、上記認
識部の標準パターン選出手段によって、代表パターン構
成テーブルが参照されて、認識済み筆跡パターンのサブ
パターン候補連鎖に続くことが可能なサブパターンが求
められ、この得られたサブパターンの標準パターンが上
記サブパターン辞書から選出される。そうすると、選出
された標準パターンのみが用いられて上記認識部によっ
て上記辞書照合が迅速に行われる。According to the eleventh aspect of the invention, the standard pattern selection means of the recognition section refers to the representative pattern configuration table to identify subpatterns that can follow the subpattern candidate chain of the recognized handwriting pattern. A standard pattern of the obtained sub-patterns is selected from the sub-pattern dictionary. Then, only the selected standard pattern is used, and the recognition unit quickly performs the dictionary matching.

【００４５】また、請求項１２に係る発明では、制御部
の制御によって、上記特徴抽出部および認識部がタブレ
ットと平行して動作される。そうすると、上記タブレッ
トによって生成された座標データ列がデュアルポート・
メモリで成る入力座標バッファに入力ポートから書き込
まれる。そして、この書き込まれた座標データ列が上記
特徴抽出部によって上記書き込みと平行して出力ポート
から読み出されて、上記特徴抽出が行われる。さらに、
抽出された特徴パターンがデュアルポート・メモリで成
る特徴バッファに入力ポートから書き込まれる。そし
て、この書き込まれた特徴パターンが上記認識部によっ
て上記書き込みと平行して出力ポートから読み出され、
上記辞書との照合が行われる。こうして、上記特徴抽出
処理および認識処理が座標データ列生成処理と平行して
行われる。According to the twelfth aspect of the invention, the feature extraction section and the recognition section are operated in parallel with the tablet under the control of the control section. Then, the coordinate data string generated by the tablet will be
It is written from the input port to the input coordinate buffer consisting of memory. Then, the written coordinate data string is read from the output port by the feature extraction unit in parallel with the writing, and the feature extraction is performed. further,
The extracted feature pattern is written from the input port to the feature buffer composed of the dual port memory. Then, the written feature pattern is read from the output port by the recognition unit in parallel with the writing,
Collation with the dictionary is performed. In this way, the feature extraction process and the recognition process are performed in parallel with the coordinate data string generation process.

【００４６】[0046]

【実施例】以下、この発明を図示の実施例により詳細に
説明する。尚、第１実施例〜第４実施例は主に特徴点お
よび特徴量の抽出法に関し、第５実施例は主に認識法に
関する。＜第１実施例＞図１は本実施例における手書き認識装置
のブロック図である。この手書き認識装置は、認識部
１,タブレット２および結果表示部３で概略構成され
る。The present invention will be described in detail below with reference to the embodiments shown in the drawings. It should be noted that the first to fourth examples mainly relate to the extraction method of the characteristic points and the characteristic amounts, and the fifth example mainly relates to the recognition method. <First Embodiment> FIG. 1 is a block diagram of a handwriting recognition apparatus in this embodiment. This handwriting recognition device is roughly configured by a recognition unit 1, a tablet 2, and a result display unit 3.

【００４７】上記タブレット２は、例えば表示一体型に
形成されており、筆記者がペン入力した筆跡データを電
気信号で表現される座標データ列に変換する。認識部１
は、得られた座標データ列に基づいて、後に詳述するよ
うな手書き認識処理を行って手書き文字等を認識する。
結果表示部３は、液晶表示装置等で構成されて認識部１
による認識結果を表示する。The tablet 2 is formed as a display-integrated type, for example, and converts handwriting data pen-input by a writer into a coordinate data string represented by an electric signal. Recognition unit 1
Recognizes a handwritten character or the like by performing a handwriting recognition process, which will be described in detail later, based on the obtained coordinate data string.
The result display unit 3 is composed of a liquid crystal display device or the like, and includes a recognition unit 1.
Display the recognition result by.

【００４８】上記認識部１は、ストローク終了判定部
４,ストローク処理部５,特徴格納部６,文字終了判定部
７,特徴再抽出部８および文字認識部９等によって構成
される。以下、順次各部の機能および動作について説明
する。The recognition unit 1 is composed of a stroke end determination unit 4, a stroke processing unit 5, a feature storage unit 6, a character end determination unit 7, a feature re-extraction unit 8 and a character recognition unit 9. Hereinafter, the function and operation of each unit will be sequentially described.

【００４９】上記ストローク終了判定部４は、タブレッ
ト２から順次送出されてくる座標データ列を取り込み、
この取り込まれた座標データ列に基づいてペンアップを
検知してストローク終了符号を付加する。ストローク処
理部５は、ストロークを構成している座標データ列に基
づいて、ストロークの屈曲部の座標点を特徴点として抽
出する。こうして得られたストローク終了符号が付加さ
れた特徴点の座標データ列は、特徴格納部６に格納され
る。上記文字終了判定部７は、タブレット２からの座標
データ列に基づいて１文字の終了を検知して、文字終了
情報を出力する。The stroke end judging section 4 fetches the coordinate data sequence sequentially sent from the tablet 2,
Pen-up is detected on the basis of the coordinate data string taken in and a stroke end code is added. The stroke processing unit 5 extracts the coordinate points of the bent portion of the stroke as the characteristic points based on the coordinate data string forming the stroke. The coordinate data string of the feature point to which the stroke end code is added thus obtained is stored in the feature storage unit 6. The character end determination unit 7 detects the end of one character based on the coordinate data string from the tablet 2 and outputs the character end information.

【００５０】上記特徴再抽出部８は、文字終了判定部７
からの文字終了情報に基づいて、特徴格納部６に格納さ
れている特徴点列から１文字分の特徴点列を切り出して
ストローク連鎖を生成して正規化し、各特徴点間の距離
と方向を揃える為に特徴点の追加/削除を行う。さら
に、実際にペン入力されていない区間(以下、オフスト
ロークと言う)にもストロークがあるものと想定して特
徴点を挿入する。上記文字認識部９は、特徴再抽出部８
で最終的に得られた特徴点列と辞書格納部１０に登録さ
れている標準パターンとの重み付けＤＰマッチングを行
ってサブパターンを認識し、さらに、辞書格納部１０に
登録されている文字辞書を用いて手書き入力文字を認識
する。そして、形状では区別できない文字に対して特別
分類処理を行う。The feature re-extracting section 8 includes a character end determining section 7
Based on the character end information from, the feature point sequence for one character is cut out from the feature point sequence stored in the feature storage unit 6, a stroke chain is generated and normalized, and the distance and direction between each feature point are calculated. Add / delete feature points to align. Further, the feature points are inserted on the assumption that there is a stroke even in a section where the pen is not actually input (hereinafter referred to as an off stroke). The character recognition unit 9 includes a feature re-extraction unit 8
In order to recognize the sub-pattern by performing the weighted DP matching between the feature point sequence finally obtained in step 3 and the standard pattern registered in the dictionary storage unit 10, the character dictionary registered in the dictionary storage unit 10 Recognize handwritten input characters using. Then, special classification processing is performed on characters that cannot be distinguished by their shapes.

【００５１】ここで、上記辞書格納部１０に登録されて
いる標準パターンは、ペンの「入り」や「跳ね」あるいは
「続け文字」や「崩し文字」にも対処可能なように、次のよ
うにして生成される。一般的に、文字が手書きされる場
合には、以下のような経験則がある。・漢字の“偏"と“旁"とのように、左右に分離できるパ
ターン内では一筆書きされ易い。・低画数の文字は一筆書きされ易い。・経験的,実験的あるいは指導的に一筆書きされる文字
がある。そこで、本実施例においては、上記経験則に基づいて１
文字のパターンを構成するサブパターンの定義を次のよ
うに定める。・基本的に左右に分離できるパターンをサブパターンと
する。・低画数のパターンおよび統計上一筆書きされ易いパタ
ーンは全体でサブパターンとする。Here, the standard patterns registered in the dictionary storage unit 10 are as follows so as to be able to deal with "entry", "bounce", "continuation character" and "collapse character" of the pen. Will be generated. Generally, when characters are handwritten, there are the following rules of thumb.・ It is easy to write with a single stroke in a pattern that can be separated into left and right, such as Kanji characters "bias" and "straw".・ Low stroke number characters are easy to write with one stroke.・ Some characters are written empirically, experimentally, or instructively. Therefore, in the present embodiment, 1
The definition of the sub-patterns that make up the character pattern is defined as follows. -Basically, a pattern that can be separated into left and right is a sub pattern.・ Patterns with a low stroke count and patterns that are statistically easy to write with a single stroke are sub-patterns.

【００５２】このような定義に従ってサブパターンを設
定し、さらに、各サブパターンの筆順違いや変形バリエ
ーションを用意(以下、個々の筆順違いのパターンや変
形バリエーションのパターンをもサブパターンと言う)
する。そして、各サブパターンにおけるオフストローク
を補間して、後に上述するストローク処理部５および特
徴再抽出部８による特徴抽出処理と同様の処理を行って
特徴点と特徴量を抽出し、特徴量が付加された特徴点の
パターンを標準パターンとして辞書格納部１０のサブパ
ターン辞書に登録する。図２は、上記サブパターンの一
例を示す。尚、図２(a)および図２(b)はサブパターン
「言」であり、図２(c)および図２(d)はサブパターン「日」
であり、図２(e)はサブパターン「早」である。尚、図中
の番号は、ストローク番号である。Sub-patterns are set according to such a definition, and further, the stroke order differences and deformation variations of each sub-pattern are prepared (hereinafter, individual stroke-order patterns and deformation variation patterns are also referred to as sub-patterns).
To do. Then, the off-stroke in each sub-pattern is interpolated, the same processing as the feature extraction processing by the stroke processing unit 5 and the feature re-extraction unit 8 described later is performed to extract the feature point and the feature amount, and the feature amount is added. The pattern of the extracted feature points is registered in the sub-pattern dictionary of the dictionary storage unit 10 as a standard pattern. FIG. 2 shows an example of the sub pattern. 2 (a) and 2 (b) are sub-patterns "words", and FIGS. 2 (c) and 2 (d) are sub-patterns "day".
2E is the sub-pattern “Haya”. The numbers in the figure are stroke numbers.

【００５３】ここで、本実施例におけるサブパターンと
して、基本的に左右に分離できるパターンを１つのサブ
パターンとするのは次の理由による。すなわち、図３に
示すように、上下に分離できるパターン「果」におけるパ
ターン「田」と「木」とは、一般的に「続き書き」され易い。
したがって、サブパターンとして「田」と「木」とを登録し
た場合には、図３(b)および図３(c)のような手書きパタ
ーンが入力された際には認識されないことになる。しか
しながら、本実施例においては、「果」を１つのサブパタ
ーンとして登録しており、しかも、その筆順違いをもサ
ブパターンとしてとしているので、「続き書き」されても
高い認識率を得ることができるのである。Here, as the sub-pattern in this embodiment, a pattern that can be basically divided into right and left is made one sub-pattern for the following reason. That is, as shown in FIG. 3, the patterns “rice” and “tree” in the pattern “fruit” that can be separated into upper and lower parts are generally easy to be “continued”.
Therefore, when "Tar" and "Thu" are registered as the sub-patterns, they will not be recognized when the handwritten pattern as shown in FIGS. 3B and 3C is input. However, in the present embodiment, the “result” is registered as one sub-pattern, and the difference in the stroke order is also used as the sub-pattern, so that a high recognition rate can be obtained even if “continuation” is performed. You can do it.

【００５４】尚、上述のように、統計上一筆書きされ易
いパターンを１つのサブパターンとして登録しているの
で、その変形バリエーションはある程度限られる。した
がって、１文字をＤＰマッチングの単位とする場合によ
りも変形バリエーション数は少なくてよい。つまり、本
実施例のサブパターンによれば、従来のようにストロー
クや１文字をＤＰマッチングの単位とする場合に比較し
て辞書の必要記憶容量を少なくできる。逆に言えば、同
じ記憶容量で従来よりも多くのパターンを登録できるの
で変形バリエーションを更に詳細に設定でき、高い認識
率を得ることができるのである。As described above, since a pattern that is statistically easy to write with one stroke is registered as one sub-pattern, its deformation variation is limited to some extent. Therefore, the number of deformation variations may be small even when one character is used as a unit for DP matching. That is, according to the sub-pattern of the present embodiment, the required storage capacity of the dictionary can be reduced as compared with the conventional case where a stroke or one character is used as a unit for DP matching. Conversely, since more patterns can be registered with the same storage capacity than before, the deformation variation can be set in more detail and a high recognition rate can be obtained.

【００５５】また、上記辞書格納部１０に登録されてい
る文字辞書は、図４に示すように、文字とこの文字を構
成するサブパターンとを対応付けたものであり、上記Ｄ
Ｐマッチングによって認識されたサブパターン候補を組
み合わせて文字候補を生成する際に参照される。尚、本
実施例においては上述のように複数ストロークの集合で
あるサブパターンの組み合わせで文字を構成しているの
で、従来のようにストロークの組み合わせで文字を構成
する場合よりも文字の構成要素数を大幅に減らすことが
でき、文字辞書の容量を少なくできるのである。As shown in FIG. 4, the character dictionary registered in the dictionary storage unit 10 is made by associating a character with a sub-pattern constituting this character, and
It is referred to when generating the character candidate by combining the sub-pattern candidates recognized by the P matching. In this embodiment, since the character is formed by the combination of the sub-patterns, which is a set of a plurality of strokes as described above, the number of constituent elements of the character is larger than that in the case of forming the character by the combination of strokes as in the conventional case. Can be greatly reduced, and the capacity of the character dictionary can be reduced.

【００５６】図５は、上記構成を有する手書き文字認識
装置によって実行される手書き認識処理動作のフローチ
ャートである。以下、図５に従って、手書き認識処理動
作に付いて詳細に説明する。FIG. 5 is a flow chart of the handwriting recognition processing operation executed by the handwritten character recognition device having the above configuration. Hereinafter, the handwriting recognition processing operation will be described in detail with reference to FIG.

【００５７】ステップＳ1で、上記タブレット２によっ
て、ペン入力があるか否かが判別される。そして、ペン
入力があればステップＳ2に進む。ステップＳ2で、上記
タブレット２によって、入力された文字の筆跡パターン
が座標データ列に変換される。そうすると、上記ストロ
ーク終了判定部４によってストロークの終了が判定され
てストローク終了符号が付加された座標データ列が、ス
トローク処理部５に送出される。ここで、上記ストロー
ク終了判定部４によるストローク終了の判定は、次のよ
うにして行われる。すなわち、タブレット２から順次送
出されてくる座標データ列が取り込まれ、この取り込み
間隔がタブレット２における走査タイミングの間隔を越
えたことによってペンアップが検知されて、１つのスト
ロークが終了したと判定される。そして、既に取り込ま
れた座標データ列の最終にストローク終了符号が付加さ
れるのである。In step S1, the tablet 2 determines whether or not there is a pen input. If there is a pen input, the process proceeds to step S2. In step S2, the tablet 2 converts the input handwriting pattern of the character into a coordinate data string. Then, the stroke end determination unit 4 determines the end of the stroke, and the coordinate data string to which the stroke end code is added is sent to the stroke processing unit 5. Here, the stroke end determination by the stroke end determination unit 4 is performed as follows. That is, the coordinate data sequence sequentially sent out from the tablet 2 is fetched, and the pen-up is detected when the fetching interval exceeds the scanning timing interval in the tablet 2, and it is determined that one stroke is completed. . Then, the stroke end code is added to the end of the coordinate data sequence that has already been fetched.

【００５８】ステップＳ3で、上記ストローク処理部５
によって、ストローク終了判定部４で付加されたストロ
ーク終了符号が参照されて特徴抽出処理が行われ、スト
ローク単位で抽出された特徴点列が特徴格納部６に格納
される。ここで、上記ストローク処理部５は、次のよう
にしてストロークの特徴抽出を行う。すなわち、先ず、
上記取り込まれた座標データ列からストローク終了符号
を検出し、このストローク終了符号の次に取り込まれた
座標データをストロークの始点の座標とする。次に、上
記始点に注目して、この始点から次の座標点までの距離
を算出する。そして、この距離が所定距離Ｎlmin(例え
ば０.２５mm)以下の場合には、さらに次の座標点までの
距離を算出する。こうして、距離が上記所定距離Ｎlmin
より大きくなった際の座標点に注目点を移行する。以
後、同様の処理を次のストローク終了符号を取り込むま
で行い、得られた注目点を特徴点とし、その座標を特徴
量の一つとする。In step S3, the stroke processing unit 5
Thus, the stroke end code added by the stroke end determination unit 4 is referred to, the feature extraction processing is performed, and the feature point sequence extracted in stroke units is stored in the feature storage unit 6. Here, the stroke processing unit 5 performs stroke feature extraction as follows. That is, first,
A stroke end code is detected from the fetched coordinate data string, and the coordinate data fetched next to this stroke end code is used as the coordinates of the start point of the stroke. Next, paying attention to the starting point, the distance from this starting point to the next coordinate point is calculated. If this distance is less than or equal to the predetermined distance Nlmin (for example, 0.25 mm), the distance to the next coordinate point is calculated. Thus, the distance is the above-mentioned predetermined distance Nlmin.
The point of interest is moved to the coordinate point when it becomes larger. After that, similar processing is performed until the next stroke end code is fetched, the obtained target point is set as a feature point, and its coordinates are set as one of the feature amounts.

【００５９】次に、図６(a)に示すように、ストローク
を構成する特徴点列における始点Ｐ０の次の特徴点Ｐ１
を注目点とし、始点Ｐ0から注目点(特徴点Ｐ1)への方向
と注目点Ｐ1から特徴点Ｐ2への方向との差を算出し、こ
の差の値が所定角度Ｎdist(例えば３０度)より小さい場
合には注目点Ｐ1を除去する。以下、上記注目点を各特
徴点Ｐ2,Ｐ3,…,Ｐn,…へ順次移行して、特徴点Ｐn-1か
ら注目点Ｐnへの方向と注目点Ｐnから特徴点Ｐn+1への
方向との差を算出し、この差の値に基づいて注目点Ｐn
を除去するか否かの判定を行う。その結果、図７(a)に
示すような入力座標列から図７(b)に示すような特徴点
列が作成される。こうして、正規化によって特徴が失わ
れる前にストロークの屈曲部の特徴点を抽出することに
よって、正規化後にも屈曲部の特徴が保持されるのであ
る。Next, as shown in FIG. 6A, the characteristic point P1 next to the starting point P0 in the characteristic point sequence forming the stroke
Is set as the point of interest, the difference between the direction from the starting point P0 to the point of interest (feature point P1) and the direction from the point of interest P1 to the feature point P2 is calculated, and the value of this difference is calculated from the predetermined angle Ndist (for example, 30 degrees). If it is smaller, the point of interest P1 is removed. Hereinafter, the above-mentioned point of interest is sequentially shifted to each of the feature points P2, P3, ..., Pn, ..., The direction from the feature point Pn-1 to the point of interest Pn and the direction from the point of interest Pn to the feature point Pn + 1. Is calculated, and the point of interest Pn is calculated based on the value of this difference.
Is determined. As a result, a feature point sequence as shown in FIG. 7B is created from the input coordinate sequence as shown in FIG. 7A. In this way, by extracting the characteristic points of the bent portion of the stroke before the characteristic is lost by the normalization, the characteristic of the bent portion is retained even after the normalization.

【００６０】ステップＳ4で、上記特徴再抽出部８によ
って、文字終了判定部７からの文字終了情報に基づいて
１文字入力が終了したか否かが判別される。その結果、
終了していればステップＳ5に進む。一方、終了してい
なければ上記ステップＳ1に戻って、文字入力およびス
トロークの特徴抽出が続行される。尚、上記文字終了判
定部７による文字終了判定は次のようにして行われる。
すなわち、タブレット２から順次送出されてくる座標デ
ータ列を取り込む際に、最後に座標データを取り込んで
から次の座標データを取り込むまで所定時間Ｎtime(例
えば１秒)が経過したことによって１つの手書き文字に
係る座標データ列の終了を検知する。そして、文字終了
を表す上記文字終了情報を特徴再抽出部８に送出するの
である。In step S4, the feature re-extracting section 8 determines whether or not one character has been input based on the character end information from the character end determining section 7. as a result,
If it is completed, the process proceeds to step S5. On the other hand, if not completed, the process returns to step S1 to continue the character input and the stroke feature extraction. The character end determination by the character end determination unit 7 is performed as follows.
That is, when the coordinate data sequence sequentially sent from the tablet 2 is fetched, one handwritten character is generated because a predetermined time Ntime (for example, 1 second) has elapsed from the last coordinate data fetch to the next coordinate data fetch. The end of the coordinate data sequence related to is detected. Then, the character end information indicating the end of the character is sent to the feature re-extracting section 8.

【００６１】ステップＳ5で、上記特徴再抽出部８によ
って、特徴格納部６に格納された特徴点の座標データ列
から、上記文字終了情報に基づいて、１文字区間内のス
トロークの特徴点に付加されている座標データ列が読み
出される。そして、以下のようにして特徴再抽出処理が
行われる。In step S5, the feature re-extracting unit 8 adds the feature point coordinate data stored in the feature storing unit 6 to the feature point of the stroke in one character section based on the character end information. The coordinate data string being read is read. Then, the feature re-extraction processing is performed as follows.

【００６２】上記特徴格納部６から読み出された複数の
特徴点が、入力されたストロークの順に並べられる。そ
して、最初のストロークから最終ストロークに向かって
順次ストロークを接続し、順次得られるストローク連鎖
毎に正規化する。この正規化は、次のようにして行われ
る。すなわち、対象となるストローク連鎖におけるｘ軸
の最大座標値ｘmaxおよびｙ軸の最大座標値ｙmaxが求め
られる。さらに、ｘ軸の最小座標値ｘminおよびｙ軸の
最小座標値ｙminが求められる。そして、式(１),(２)に
従って、各特徴点の座標がｘ軸,ｙ軸夫々に０〜Ｎnorm
(例えば１２８ドット)の範囲に分布するように正規化さ
れる。ｘn'＝{(ｘn−ｘmin)/(ｘmax−ｘmin)}・Ｎnorm …（１）ｙn'＝{(ｙn−ｙmin)/(ｙmax−ｙmin)}・Ｎnorm …（２）ここで、ｘn ：特徴点ｎのｘ座標ｘn'：特徴点ｎの正規化後のｘ座標ｙn ：特徴点ｎのｙ座標ｙn'：特徴点ｎの正規化後のｙ座標A plurality of feature points read from the feature storage unit 6 are arranged in the order of input strokes. Then, the strokes are sequentially connected from the first stroke to the final stroke, and normalized for each sequentially obtained stroke chain. This normalization is performed as follows. That is, the maximum coordinate value xmax of the x axis and the maximum coordinate value ymax of the y axis in the target stroke chain are obtained. Further, the minimum coordinate value xmin of the x-axis and the minimum coordinate value ymin of the y-axis are obtained. Then, according to the equations (1) and (2), the coordinates of each feature point are 0 to Nnorm on the x-axis and the y-axis, respectively.
It is normalized so as to be distributed in the range (for example, 128 dots). xn ′ = {(xn−xmin) / (xmax−xmin)} · Nnorm (1) yn ′ = {(yn−ymin) / (ymax−ymin)} · Nnorm (2) where xn: feature X coordinate of point n xn ': x coordinate of feature point n after normalization yn: y coordinate of feature point n yn': y coordinate after normalization of feature point n

【００６３】上述のようにして正規化された各特徴点に
おける注目特徴点と直後の特徴点との間の距離が求めら
れる。この距離が所定距離Ｎfmax(例えば４mm)以上であ
る場合には、当該特徴点間に相当する座標データ列の中
から選択して正規化した特徴点を新たに追加して、当該
区間における特徴点間の距離が所定距離Ｎflen(例えば
２mm)以下になるようにする。こうして、上記屈曲部の
特徴点間を補間して等間隔近似を行うことによって、よ
り適切に特徴点を抽出でき、以後のＤＰマッチング時間
を短縮すると共に高い認識率を得ることができる。ま
た、上記距離が所定距離Ｎfmin(例えば１mm)以下であっ
て、且つ、上記注目特徴点から直前の注目特徴点への方
向と注目特徴点から直後の特徴点への方向との方向差の
うち小さい方の値が所定方向差Ｎddst(例えば９０度)よ
り小さい場合には、注目特徴点はノイズであると見なし
て特徴点列から削除する。The distance between the feature point of interest and the feature point immediately after it in each feature point normalized as described above is obtained. If this distance is equal to or greater than the predetermined distance Nfmax (for example, 4 mm), a feature point selected from the coordinate data sequence corresponding to the feature points and normalized is newly added to the feature point in the section. The distance between them is set to a predetermined distance Nflen (for example, 2 mm) or less. Thus, by interpolating between the characteristic points of the bent portion and performing equal-interval approximation, the characteristic points can be extracted more appropriately, and the subsequent DP matching time can be shortened and a high recognition rate can be obtained. Further, among the directional differences between the direction from the target feature point to the immediately preceding target feature point and the direction from the target feature point to the immediately subsequent feature point, when the distance is equal to or less than a predetermined distance Nfmin (for example, 1 mm). When the smaller value is smaller than the predetermined direction difference Nddst (for example, 90 degrees), the target feature point is regarded as noise and is deleted from the feature point sequence.

【００６４】ここで、上記ストローク連鎖が形成される
に際して、ストローク間に上記オフストロークが存在す
る場合には、オフストローク区間には実際にストローク
が存在しているものと想定して、当該オフストローク区
間を所定距離Ｎflen以下になるように等分割する特徴点
が追加される。このように、上記オフストローク区間を
補間することによって、筆者が意図しない「入り」や「跳
ね」の部分あるいは「続け文字」や「崩し文字」の特徴をも
適確に抽出できるのである。Here, when the above-mentioned stroke chain is formed, if the above-mentioned off-stroke exists between the strokes, it is assumed that the stroke actually exists in the off-stroke section, and the off-stroke is concerned. Feature points that divide the section into equal parts or less Nflen or less are added. In this way, by interpolating the off-stroke section, it is possible to accurately extract the "entry" or "bounce" portion or the "continuation character" or "collapse character" that the writer does not intend.

【００６５】以上のような特徴点の追加/削除動作が、
新たに特徴点が追加されなくなるまで繰り返される。そ
の結果、図７(b)に示すような特徴点列から、図７(c)お
よび図７(d)に示すようなｘ座標,ｙ座標および絶対方向
の３要素を有する特徴点列が作成されるのである。尚、
図７(c)は第１ストロークａ1と第２ストロークｂ1との
連鎖の特徴点列の例であり、破線ｅは補間されたオフス
トローク区間を示す。また、上記絶対方向は、各特徴点
における次特徴点への方向であり、基準方向からのずれ
の度合いで表す。The above-mentioned feature point addition / deletion operations are
The process is repeated until no new feature point is added. As a result, a feature point sequence having three elements of x-coordinate, y-coordinate and absolute direction as shown in FIGS. 7 (c) and 7 (d) is created from the feature point sequence as shown in FIG. 7 (b). Is done. still,
FIG. 7C shows an example of a series of characteristic points in the chain of the first stroke a1 and the second stroke b1, and the broken line e shows the interpolated off-stroke section. The absolute direction is the direction of each feature point to the next feature point, and is represented by the degree of deviation from the reference direction.

【００６６】ステップＳ6で、上記文字認識部９によっ
て、上記ストローク連鎖から得られた特徴点列と辞書格
納部１０のサブパターン辞書に登録されている標準パタ
ーンとの重み付けＤＰマッチング処理が行われる。本実
施例におけるＤＰマッチングの際に用いる特徴パターン
と標準パターンとの対応特徴点間のマッチング距離(以
下、部分マッチング距離と言う)Δｄは、上記特徴パタ
ーンと標準パターンとの対応特徴点間のユークリッド距
離ΔＬと方向差ΔＨとを用いて式(３)で求める。 Δｄ＝Ｗp・ΔＬ＋Ｗd・ΔＨ …（３）但し、Ｗp：距離に係る重み(例えば、全特徴点間でＮwl
en（＝１)) 但し、Ｗd：方向に係る重み(例えば、全特徴点間でＮwd
st（＝８))In step S6, the character recognition unit 9 performs a weighted DP matching process between the feature point sequence obtained from the stroke chain and the standard pattern registered in the sub-pattern dictionary of the dictionary storage unit 10. The matching distance (hereinafter referred to as partial matching distance) Δd between the corresponding feature points of the characteristic pattern and the standard pattern used in the DP matching in the present embodiment is the Euclidean between the corresponding feature points of the characteristic pattern and the standard pattern. The distance ΔL and the direction difference ΔH are used to obtain the value by the equation (3). Δd = Wp · ΔL + Wd · ΔH (3) However, Wp: Weight related to distance (for example, Nwl between all feature points
en (= 1)) However, Wd: weight related to the direction (for example, Nwd between all feature points)
st (= 8))

【００６７】尚、上記ユークリッド距離ΔＬは式(４)で
与えられる。また、上記方向差ΔＨは、注目特徴点の直
前の特徴点から直後の特徴点への方向間の差であり、式
(５)で与えられる。ここで、始点と終点とにおける方向
差ΔＨの値は“０"とする。 ΔＬ＝{(ｘm−ｘn)²＋(ｙm−ｙn)²}^1/2 …（４） ΔＨ＝|ｄn−ｄm| …（５）但し、ｘn：特徴パターン上における特徴点ｎのｘ座標ｙn：特徴点ｎのｙ座標ｘm：標準パターン上における特徴点ｍのｘ座標ｙm：特徴点ｍのｙ座標ｄn：特徴点ｎの直前の特徴点(ｎ−１)から直後の特徴
点(ｎ＋１)への方向ｄm：特徴点ｍの直前の特徴点(ｍ−１)から直後の特徴
点(ｍ＋１)への方向The Euclidean distance ΔL is given by the equation (4). Further, the direction difference ΔH is a difference between the directions from the feature point immediately before the feature point of interest to the feature point immediately after,
It is given in (5). Here, the value of the direction difference ΔH between the start point and the end point is “0”. ΔL = {(xm−xn) ² + (ym−yn) ² } ^1/2 (4) ΔH = | dn−dm | (5) where xn: x coordinate yn of the characteristic point n on the characteristic pattern : Y coordinate of the feature point n xm: x coordinate of the feature point m on the standard pattern ym: y coordinate of the feature point m dn: feature point (n-1) immediately before the feature point n to the feature point (n + 1) immediately after the feature point n Direction dm: direction from the feature point (m-1) immediately before the feature point m to the feature point (m + 1) immediately after

【００６８】そして、上記部分マッチング距離Δｄを用
いた重み付けＤＰマッチングによるマッチングパスに沿
った距離の総和(以下、単に「マッチング距離」と言う)Ｄ
に基づいて、サブパターン候補ラティスが生成される。Then, the sum of the distances along the matching path by the weighted DP matching using the partial matching distance Δd (hereinafter, simply referred to as "matching distance") D
Sub-pattern candidate lattices are generated based on

【００６９】ここで、上記特徴再抽出部８による特徴再
抽出処理と文字認識部９による重み付けＤＰマッチング
とは、上記ストローク連鎖を順次成長させながら連続し
て行われるのである。尚、その際におけるストローク連
鎖の成長は、上記サブパターン辞書に登録されているサ
ブパターンを構成するストローク数を考慮して設定され
た所定ストローク数(例えば、１６ストローク)までとす
る。以下、図７(b),図７(c)および図７(d)に従って、上
記特徴再抽出処理および重み付けＤＰマッチングとに付
いて、より具体的に説明する。Here, the feature re-extracting process by the feature re-extracting unit 8 and the weighted DP matching by the character recognizing unit 9 are continuously performed while sequentially growing the stroke chain. Note that the growth of the stroke chain at that time is up to a predetermined stroke number (for example, 16 strokes) set in consideration of the stroke numbers forming the sub patterns registered in the sub pattern dictionary. Hereinafter, the feature re-extraction processing and the weighted DP matching will be described more specifically with reference to FIGS. 7B, 7C, and 7D.

【００７０】先ず、上記特徴再抽出部８は、サブパター
ン「月」を有する入力文字パターンに基づいてストローク
処理部５で作成された特徴点列(図７(b)参照)のうち、
第１ストロークａ1の特徴点列を読み出し、上述のよう
な正規化および特徴再抽出処理を行って第１ストローク
ａ1の特徴パターンａ2(図７(c)参照)を得る。次に、文
字認識部９は、第１ストロークａ1の特徴パターンａ2と
上記サブパターン辞書に登録されている標準パターンと
の重み付けＤＰマッチングを行ってマッチング距離Ｄが
所定値以下のサブパターン候補を得、得られたサブパタ
ーン候補をマッチング距離Ｄを付加して内部バッファに
記憶する。First, the feature re-extracting section 8 selects one of the feature point sequences (see FIG. 7B) created by the stroke processing section 5 based on the input character pattern having the sub-pattern "month".
The characteristic point sequence of the first stroke a1 is read out, and the above-described normalization and characteristic re-extraction processing is performed to obtain the characteristic pattern a2 of the first stroke a1 (see FIG. 7C). Next, the character recognition unit 9 performs weighted DP matching between the characteristic pattern a2 of the first stroke a1 and the standard pattern registered in the sub-pattern dictionary to obtain a sub-pattern candidate whose matching distance D is a predetermined value or less. The obtained sub-pattern candidate is added to the matching distance D and stored in the internal buffer.

【００７１】次に、上記特徴再抽出部８は、第１ストロ
ークａ1と第２ストロークｂ1の特徴点列に対して正規化
を行った後、第１ストロークａ1と第２ストロークｂ1と
の間にオフストローク区間が存在するので当該オフスト
ローク区間を補間する。そして、特徴再抽出処理を行っ
て、第１ストロークａ2,オフストローク区間ｅおよび第
２ストロークｂ2から成るストローク連鎖の特徴パター
ンを得る。そして、文字認識部９で重み付けＤＰマッチ
ングを行ってマッチング距離Ｄが所定値以下のサブパタ
ーン候補を得、得られたサブパターン候補をマッチング
距離Ｄを付加して内部バッファに記憶する。Next, the feature re-extracting section 8 normalizes the feature point sequence of the first stroke a1 and the second stroke b1 and then, between the first stroke a1 and the second stroke b1. Since there is an off-stroke section, the off-stroke section is interpolated. Then, characteristic re-extraction processing is performed to obtain a characteristic pattern of a stroke chain composed of the first stroke a2, the off-stroke section e, and the second stroke b2. Then, the character recognition unit 9 performs weighted DP matching to obtain a sub-pattern candidate whose matching distance D is a predetermined value or less, and adds the matching distance D to the obtained sub-pattern candidate and stores it in an internal buffer.

【００７２】以下、同様にして、第１ストロークａ1と
第２ストロークｂ1とのストローク連鎖に順次ストロー
クｃ1,ｄ1を付加し、オフストローク区間ｆ,ｇで補間し
て、特徴再抽出処理および重み付けＤＰマッチングを行
う。このような処理が、ストローク連鎖数が所定ストロ
ーク数“１６"に至るか、あるいは、特徴点列が終了す
るまで続行されて、文字認識部９の上記内部バッファに
はサブパターン候補ラティスが生成されるのである。図
７(d)は、こうして、上記特徴再抽出部８によって最終
的に抽出されたサブパターン「月」の特徴パターンであ
る。Thereafter, similarly, strokes c1 and d1 are sequentially added to the stroke chain of the first stroke a1 and the second stroke b1, and the strokes are interpolated in the off-stroke sections f and g, and the feature re-extraction processing and weighting DP are performed. Match. Such processing is continued until the number of stroke chains reaches the predetermined number of strokes "16" or the feature point sequence ends, and a sub pattern candidate lattice is generated in the internal buffer of the character recognition unit 9. It is. FIG. 7D is the characteristic pattern of the sub-pattern “month” finally extracted by the characteristic re-extracting unit 8 in this way.

【００７３】ステップＳ7で、上記文字認識部９によっ
て、上記ステップＳ6における重み付けＤＰマッチング
の結果得られたサブパターン候補ラティスに基づいて、
辞書格納部１０に登録されている上記文字辞書が参照さ
れて文字候補が生成される。そして、各文字候補を構成
するサブパターンに係るマッチング距離Ｄの合計値が最
も小さい文字候補から順に、例えば１０位までの文字候
補が認識結果として出力される。ステップＳ8で、文字
「や」と文字「ゃ」のように、ストロークの形状では区別で
きない文字に対する特別分類処理が実行される。例え
ば、上記文字「や」と文字「ゃ」との分類の場合には、全く
形状は同じであるから、正規化後に抽出された特徴パタ
ーンを用いるＤＰマッチングでは識別が不可能である。
そこで、上記ステップＳ3においてストローク処理部５
によって抽出された特徴点の座標に基づいて認識対象文
字のｙ軸方向の最大距離(文字の高さ)が求められ、この
文字の高さが直前に認識された文字の１/２以下であれ
ば小文字「ゃ」が第１位の認識結果候補であると決定され
る。In step S7, based on the sub-pattern candidate lattice obtained as a result of the weighted DP matching in step S6 by the character recognition unit 9,
The character dictionary registered in the dictionary storage unit 10 is referenced to generate character candidates. Then, from the character candidate having the smallest total value of the matching distances D associated with the sub-patterns forming each character candidate, for example, the character candidates up to the tenth place are output as the recognition result. In step S8, special classification processing is performed on characters such as the characters "ya" and "ya" that cannot be distinguished by the shape of the stroke. For example, in the case of the classification of the characters “ya” and the characters “ya”, since the shapes are completely the same, it is impossible to identify them by DP matching using the characteristic pattern extracted after normalization.
Therefore, in step S3, the stroke processing unit 5
The maximum distance in the y-axis direction (character height) of the character to be recognized is calculated based on the coordinates of the feature points extracted by, and the height of this character must be less than 1/2 of the character recognized immediately before. For example, the small letter “ya” is determined to be the first recognition result candidate.

【００７４】ステップＳ9で、上記結果表示部３によっ
て、タブレット２の下側に一体に積層されている表示パ
ネル上に第１位の認識結果候補が表示される。ステップ
Ｓ10で、手書き認識処理動作の終了が指示されているか
否かが判別される。その結果、指示されていなければ上
記ステップＳ1に戻って次のペン入力があるのを待ち、
指示されていれば手書き認識処理動作を終了する。In step S9, the result display unit 3 displays the first recognition result candidate on the display panel integrally laminated on the lower side of the tablet 2. In step S10, it is determined whether or not the end of the handwriting recognition processing operation has been instructed. As a result, if not instructed, the process returns to step S1 and waits for the next pen input,
If instructed, the handwriting recognition processing operation ends.

【００７５】上述のように、本実施例においては、上記
ストローク処理部５によって、入力されたストローク毎
に座標点間の方向差が所定値Ｎdistより大きい座標点を
特徴点として抽出するようにしている。つまり、本実施
例においては、正規化される前に屈曲部の特徴を表す特
徴点と特徴量を予め抽出するようにしている。したがっ
て、後の正規化に際して屈曲部の特徴が失われたとして
も、屈曲部を表す特徴量と特徴点を確実に抽出すること
ができるのである。As described above, in the present embodiment, the stroke processing unit 5 extracts the coordinate points whose direction difference between the coordinate points is larger than the predetermined value Ndist for each input stroke as feature points. There is. That is, in the present embodiment, the characteristic points and the characteristic amounts representing the characteristic of the bent portion are extracted in advance before being normalized. Therefore, even if the characteristic of the bent portion is lost during the later normalization, the feature amount and the characteristic point representing the bent portion can be reliably extracted.

【００７６】さらに、上記特徴再抽出部８によって、１
文字パターンを構成するストローク連鎖毎に、正規化さ
れた後の特徴点間の距離が所定値Ｎfminより大きい特徴
点間を補間し、ノイズを除去する。そして更に、上記オ
フストローク区間を等分割するように特徴点で補間する
ようにしている。このように、上記正規化後に再度特徴
点を抽出して等間隔近似を行うようにしている。つま
り、本実施例における特徴点の抽出は、上述した従来の
特徴抽出方法(２)の等分割法と特徴抽出方法(３)の角度
法とを併用したものである。したがって、上記従来の特
徴抽出方法(２),(３)の欠点を補って最適に特徴点を抽
出でき、後のＤＰマッチングの精度を高め時間を短縮で
きる。また、オフストロークを補間することによって、
上記ペンの「入り」や「跳ね」あるいは「続け文字」や「崩し
文字」にも対処できるようになる。Further, the feature re-extracting section 8 sets 1
Noise is removed by interpolating between feature points in which the distance between normalized feature points is greater than a predetermined value Nfmin for each stroke chain forming a character pattern. Further, the off-stroke section is interpolated at the characteristic points so as to be equally divided. In this way, the feature points are extracted again after the above-mentioned normalization, and the equal interval approximation is performed. That is, the extraction of the feature points in this embodiment is a combination of the equal division method of the conventional feature extraction method (2) and the angle method of the feature extraction method (3) described above. Therefore, the feature points can be optimally extracted by compensating for the drawbacks of the conventional feature extraction methods (2) and (3), and the accuracy of the subsequent DP matching can be improved and the time can be shortened. Also, by interpolating the off stroke,
It becomes possible to deal with "entry" and "bounce" or "continuation character" and "collapse character" of the above pen.

【００７７】さらに、上記文字認識部９によって、一筆
書きされ易いサブパターン単位での標準パターンを用い
たＤＰマッチングを行うので、生成されるラティス数が
少なく、文字候補への展開を容易に且つ高速に行うこと
ができる。したがって、本実施例によれば、認識時間を
短縮し、認識率を高めることができる。また、上記サブ
パターン辞書および文字辞書の記憶容量を少なくでき
る。Furthermore, since the character recognition unit 9 performs DP matching using a standard pattern in units of sub-patterns that are easy to write with one stroke, the number of lattices generated is small, and expansion into character candidates is easy and fast. Can be done. Therefore, according to this embodiment, the recognition time can be shortened and the recognition rate can be increased. Also, the storage capacity of the sub-pattern dictionary and the character dictionary can be reduced.

【００７８】＜第２実施例＞本実施例は、第１実施例に
おける上記文字認識部９で重み付けＤＰマッチングの際
に用いられるユークリッド距離ΔＬと方向差ΔＨの生成
に関する。<Second Embodiment> This embodiment relates to generation of the Euclidean distance ΔL and the direction difference ΔH used in the weighted DP matching in the character recognition unit 9 in the first embodiment.

【００７９】図８は、本実施例における手書き認識装置
のブロック図である。タブレット１２,結果表示部１３,
ストローク終了判定部１４,ストローク処理部１５,特徴
格納部１６,文字終了判定部１７,特徴再抽出部１８およ
び辞書格納部２０は、第１実施例におけるタブレット
２,結果表示部３,ストローク終了判定部４,ストローク
処理部５,特徴格納部６,文字終了判定部７,特徴再抽出
部８および辞書格納部１０と同じ機能を有して同様に動
作する。FIG. 8 is a block diagram of the handwriting recognition device in this embodiment. Tablet 12, result display unit 13,
The stroke end determination unit 14, the stroke processing unit 15, the feature storage unit 16, the character end determination unit 17, the feature re-extraction unit 18, and the dictionary storage unit 20 are the tablet 2, the result display unit 3, and the stroke end determination in the first embodiment. The unit 4, the stroke processing unit 5, the feature storage unit 6, the character end determination unit 7, the feature re-extraction unit 8 and the dictionary storage unit 10 have the same functions and operate in the same manner.

【００８０】記憶部２１には、文字認識部１９によって
ユークリッド距離ΔＬおよび方向差ΔＨを生成する際に
用いられる絶対差テーブル,距離テーブルおよび方向差
テーブルが格納されている。上記絶対差テーブルおよび
距離テーブルはユークリッド距離ΔＬ生成の際に用いら
れる。ここで、ユークリッド距離ΔＬを求める際の特徴
点は正規化後の特徴点であるから、特徴点のｘ,ｙ座標
は上記０〜Ｎnormの範囲内に収まることになる。そこ
で、例えば、Ｎnorm＝１２８ドットである場合には、図
９(a)および図９(b)に示すように、特徴パターン上にお
ける特徴点ｎと標準パターン上における点ｍとの座標の
差の絶対値を要素とする１２８×１２８の絶対差テーブ
ルをｘ座標およびｙ座標毎に作成する。また、上記座標
差の絶対値の値は０〜１２７の範囲内に収まる。そこ
で、図９(c)に示すようなユークリッド距離ΔＬを要素
とする１２８×１２８の距離テーブルを作成するのであ
る。The storage unit 21 stores an absolute difference table, a distance table and a direction difference table used when the character recognition unit 19 generates the Euclidean distance ΔL and the direction difference ΔH. The absolute difference table and the distance table are used when generating the Euclidean distance ΔL. Here, since the characteristic points when the Euclidean distance ΔL is obtained are the normalized characteristic points, the x and y coordinates of the characteristic points fall within the range of 0 to Nnorm. Therefore, for example, when Nnorm = 128 dots, as shown in FIGS. 9A and 9B, the difference between the coordinates of the feature point n on the feature pattern and the coordinate m on the standard pattern is calculated. A 128 × 128 absolute difference table having absolute values as elements is created for each x coordinate and y coordinate. The absolute value of the coordinate difference is within the range of 0 to 127. Therefore, a 128 × 128 distance table having the Euclidean distance ΔL as an element as shown in FIG. 9C is created.

【００８１】上記方向差テーブルは方向差ΔＨ生成の際
に用いられる。ここで、上記特徴パターン上における特
徴点ｎあるいは標準パターン上における特徴点ｍの絶対
方向を、ある方向を“０"として１回転が３６０度にな
るように定義し、“０〜３５９"の整数値で表現する。
そして、図１０に示すように、上記特徴パターン上にお
ける特徴点(ｎ−１)と標準パターン上における特徴点
(ｍ−１)との方向差ΔＨを要素とする３６０×３６０の
方向差テーブルを作成する。The direction difference table is used when generating the direction difference ΔH. Here, the absolute direction of the characteristic point n on the characteristic pattern or the characteristic point m on the standard pattern is defined so that one rotation is 360 degrees with a certain direction being "0", and an absolute value of "0-359" is set. Express with a numerical value.
Then, as shown in FIG. 10, the feature point (n-1) on the feature pattern and the feature point on the standard pattern
A 360 × 360 direction difference table having the direction difference ΔH from (m−1) as an element is created.

【００８２】上記構成において、文字認識部１９は、以
下のようにしてユークリッド距離ΔＬと方向差ΔＨとを
生成する。すなわち、先ず、特徴パターン上における特
徴点ｎのｘ座標“ｘn"と標準パターン上における特徴点
ｍのｘ座標“ｘm"との差の絶対値を上記絶対差テーブル
を引いて求める。同様にして、当該特徴点ｎのｙ座標
“ｙn"と当該特徴点ｍのｙ座標“ｙm"との差の絶対値を
上記絶対差テーブルを引いて求める。そして、得られた
両絶対値“|ｘn−ｘm|"と“|ｙn−ｙm|"に基づいて、上
記距離テーブルを引いてユークリッド距離ΔＬを求め
る。次に、上記特徴パターン上の特徴点(ｎ−１)におけ
る特徴点(ｎ＋１)への方向ｄnと標準パターン上の特徴
点(ｍ−１)における特徴点(ｍ＋１)への方向ｄmとの差
ΔＨを上記方向差テーブルを引いて求めるのである。In the above structure, the character recognition unit 19 generates the Euclidean distance ΔL and the direction difference ΔH as follows. That is, first, the absolute value of the difference between the x coordinate "xn" of the characteristic point n on the characteristic pattern and the x coordinate "xm" of the characteristic point m on the standard pattern is obtained by looking up the absolute difference table. Similarly, the absolute value of the difference between the y-coordinate “yn” of the characteristic point n and the y-coordinate “ym” of the characteristic point m is obtained by drawing the absolute difference table. Then, the Euclidean distance ΔL is obtained by drawing the distance table based on the obtained absolute values “| xn−xm |” and “| yn−ym |”. Next, the difference between the direction dn toward the feature point (n + 1) at the feature point (n-1) on the feature pattern and the direction dm toward the feature point (m + 1) at the feature point (m-1) on the standard pattern. ΔH is obtained by drawing the direction difference table.

【００８３】以後は、上述のようにして得られたユーク
リッド距離ΔＬと方向差ΔＨとを用いて式(３)に従って
部分マッチング距離Δｄを算出し、この部分マッチング
距離Δｄを用いてＤＰマッチングを行うのである。After that, the Euclidean distance ΔL and the direction difference ΔH obtained as described above are used to calculate the partial matching distance Δd according to the equation (3), and DP matching is performed using this partial matching distance Δd. Of.

【００８４】このように、本実施例においては、上記絶
対差テーブル,距離テーブルおよび方向差テーブルが登
録された記憶部２１を設ける。そして、文字認識部１９
は、上記絶対差テーブルおよび距離テーブルを用いて、
特徴パターン上の特徴点ｎと標準パターン上の特徴点ｍ
とのユークリッド距離ΔＬを求める。さらに、上記方向
差テーブルを用いて、特徴パターン上の特徴点(ｎ−１)
から特徴点(ｎ＋１)への方向と標準パターン上の特徴点
(ｍ−１)から特徴点(ｍ＋１)への方向との方向差ΔＨを
求めるようにしている。したがって、本実施例によれ
ば、上記記憶部２１に格納された夫々のテーブルをアク
セスするという簡単な処理によってユークリッド距離Δ
Ｌと方向差ΔＨとを求めることができ、部分マッチング
距離Δｄの生成処理速度、延いてはＤＰマッチング処理
速度の高速化を図ることができる。As described above, in this embodiment, the storage unit 21 in which the absolute difference table, the distance table and the direction difference table are registered is provided. Then, the character recognition unit 19
Using the above absolute difference table and distance table,
Feature point n on the feature pattern and feature point m on the standard pattern
And the Euclidean distance ΔL is calculated. Further, using the direction difference table, the feature point (n-1) on the feature pattern
To the feature point (n + 1) and the feature point on the standard pattern
The direction difference ΔH from the direction from (m−1) to the feature point (m + 1) is calculated. Therefore, according to the present embodiment, the Euclidean distance Δ can be obtained by a simple process of accessing each table stored in the storage unit 21.
It is possible to obtain L and the direction difference ΔH, and it is possible to increase the generation processing speed of the partial matching distance Δd, and thus the DP matching processing speed.

【００８５】＜第３実施例＞本実施例は、第１実施例に
おける上記文字認識部９で重み付けＤＰマッチングを行
う際に用いられる重みの変化に関する。<Third Embodiment> This embodiment relates to a change in weight used when performing the weighted DP matching in the character recognition unit 9 in the first embodiment.

【００８６】図１１は、本実施例における手書き認識装
置のブロック図である。タブレット３２,結果表示部３
３,ストローク終了判定部３４,ストローク処理部３５,
特徴格納部３６,文字終了判定部３７および特徴再抽出
部３８は、第１実施例におけるタブレット２,結果表示
部３,ストローク終了判定部４,ストローク処理部５,特
徴格納部６,文字終了判定部７および特徴再抽出部８と
同じ機能を有して同様に動作する。FIG. 11 is a block diagram of the handwriting recognition apparatus in this embodiment. Tablet 32, result display unit 3
3, stroke end determination unit 34, stroke processing unit 35,
The feature storage unit 36, the character end determination unit 37, and the feature re-extraction unit 38 are the tablet 2, the result display unit 3, the stroke end determination unit 4, the stroke processing unit 5, the feature storage unit 6, and the character end determination in the first embodiment. It has the same functions as the unit 7 and the feature re-extracting unit 8 and operates in the same manner.

【００８７】本実施例においては、上記標準パターンに
おける特徴点(ｍ−１)〜(ｍ＋１)間(以下、区間ｍと言
う)毎に重みを求め、この求められた重みを当該標準パ
ターンの特徴点ｍに付加して辞書格納部４０に登録する
のである。In the present embodiment, the weight is calculated for each of the feature points (m-1) to (m + 1) (hereinafter referred to as section m) in the standard pattern, and the obtained weight is the feature of the standard pattern. It is added to the point m and registered in the dictionary storage unit 40.

【００８８】以下、上記各区間毎の重みの算出方法につ
いて詳細に述べる。本実施例においては、文字認識部３
９での部分マッチング距離Δｄ算出の際に用いる距離に
関する重みＷpと方向に関する重みＷdを、式(６)および
式(７)とする。Ｗp＝Ｎwlen・Ｗlwm …（６）Ｗd＝Ｎwdst・Ｗdwm …（７）ここで、Ｎwlen,Ｎwdst：第１実施例と同様の全特徴点
間で均一の重みＷlwm,Ｗdwm：標準パターンにおける区間ｍにおける変
動重みThe method of calculating the weight for each section will be described in detail below. In the present embodiment, the character recognition unit 3
The weight Wp related to the distance and the weight Wd related to the direction used in the calculation of the partial matching distance Δd in 9 are represented by Expression (6) and Expression (7). Wp = Nwlen · Wlwm (6) Wd = Nwdst · Wdwm (7) where Nwlen, Nwdst: uniform weights among all feature points as in the first embodiment Wlwm, Wdwm: in section m in the standard pattern Fluctuation weight

【００８９】上記変動重みＷlwm,Ｗdwmは、重み算出部
４１の制御の下に、以下のようにして求められる。すな
わち、先ず、総ての認識対象文字における種々の筆跡パ
ターンの座標データ列で成る学習文字セットを用意す
る。そして、個々の認識対象文字に係る１つのサブパタ
ーンの座標データ列を任意に選出してストローク終了判
定部３４に入力し、第１実施例の場合と同様にして特徴
抽出/再抽出を行い、得られた特徴列を初期標準パター
ンとして重み算出用バッファ４２に登録しておく。次
に、上記学習文字セット中におけるある学習文字のサブ
パターンのある座標データ列をストローク終了判定部３
４に入力し、第１実施例の場合と同様にして特徴抽出/
再抽出を行って学習用特徴パターンを得る。文字認識部
３９は、当該サブパターンに係る初期標準パターンを重
み算出用バッファ４２から読み出し、当該サブパターン
の初期標準パターンと学習用特徴パターンとのＤＰマッ
チングを行う。その際におけるＤＰマッチングは、第１
実施例における式(３)の部分マッチング距離Δｄを用い
たＤＰマッチングである。The above-mentioned fluctuation weights Wlwm and Wdwm are obtained as described below under the control of the weight calculator 41. That is, first, a learning character set including coordinate data strings of various handwriting patterns for all recognition target characters is prepared. Then, a coordinate data string of one sub-pattern relating to each recognition target character is arbitrarily selected and input to the stroke end determination unit 34, and feature extraction / re-extraction is performed in the same manner as in the first embodiment, The obtained feature string is registered in the weight calculation buffer 42 as an initial standard pattern. Next, the stroke end determination unit 3 determines a coordinate data string having a sub-pattern of a learning character in the learning character set.
4 and the feature extraction / similar to the case of the first embodiment.
Re-extraction is performed to obtain a learning feature pattern. The character recognition unit 39 reads the initial standard pattern related to the sub-pattern from the weight calculation buffer 42 and performs DP matching between the initial standard pattern of the sub-pattern and the learning feature pattern. DP matching at that time is the first
This is DP matching using the partial matching distance Δd of Expression (3) in the embodiment.

【００９０】そして、上記ＤＰマッチングの結果、マッ
チングパスに沿った距離の総和が所定値より大きい場合
には、当該学習用特徴パターンは当該サブパターンのカ
テゴリの範疇ではないとして変動重み算出の対象から外
す。As a result of the DP matching, if the sum of the distances along the matching path is larger than a predetermined value, it is determined that the learning feature pattern is not in the category of the sub-pattern, and the variation weight is calculated. remove.

【００９１】当該学習用特徴パターンが変動重み算出の
対象となる場合には、上記ＤＰマッチング時に算出され
たマッチングパス上の各点に係るユークリッド距離ΔＬ
と方向差ΔＨとを、当該初期標準パターン上における当
該特徴点に係る上記区間に対応付けて重み算出用バッフ
ァ４２に格納する。When the learning feature pattern is to be the object of variable weight calculation, the Euclidean distance ΔL for each point on the matching path calculated during the DP matching is calculated.
And the direction difference ΔH are stored in the weight calculation buffer 42 in association with the section related to the feature point on the initial standard pattern.

【００９２】上述の処理を学習文字セット総ての認識対
象文字における総ての座標データ列に付いて行い、上記
各区間毎のユークリッド距離ΔＬと方向差ΔＨとの分布
を求める。そして、上記重み算出部４１は、重み算出用
バッファ４２に格納されているユークリッド距離ΔＬと
方向差ΔＨとに基づいて、上記各区間毎にユークリッド
距離ΔＬの分散ρLおよび方向差ΔＨの分散ρHを算出
し、区間ｍに係るユークリッド距離ΔＬの分散ρLmを上
記変動重みＷlwmとする。一方、区間ｍに係る方向差Δ
Ｈの分散ρHmを上記変動重みＷdwmとするのである。こ
うして得られた変動重みＷlwm,Ｗdwmは上記初期標準パ
ターンにおける特徴点に付加されて、標準パターンとそ
の変動重みとして辞書格納部４０に格納される。The above-described processing is performed for all the coordinate data strings in the recognition target characters of all the learning character sets to obtain the distribution of the Euclidean distance ΔL and the direction difference ΔH for each section. Then, the weight calculation unit 41 calculates the variance ρL of the Euclidean distance ΔL and the variance ρH of the directional difference ΔH for each section based on the Euclidean distance ΔL and the direction difference ΔH stored in the weight calculation buffer 42. The variance ρLm of the Euclidean distance ΔL related to the section m is calculated and used as the fluctuation weight Wlwm. On the other hand, the direction difference Δ in the section m
The variance ρHm of H is used as the fluctuation weight Wdwm. The variation weights Wlwm and Wdwm thus obtained are added to the feature points in the initial standard pattern and stored in the dictionary storage unit 40 as the standard pattern and its variation weights.

【００９３】尚、本実施例においては、上記各区間毎に
ユークリッド距離ΔＬの分散ρLおよび方向差ΔＨの分
散ρHを算出して変動重みＷlwm,Ｗdwmを求めているが、
ユークリッド距離ΔＬの平均μLおよび方向差ΔＨの平
均μHを変動重みＷlwm,Ｗdwmとしても差し支えない。要
は、初期標準パターンに対する学習文字セットのばらつ
きの度合を表すことができればよいのである。In the present embodiment, the variance weights Wlwm and Wdwm are calculated by calculating the variance ρL of the Euclidean distance ΔL and the variance ρH of the direction difference ΔH for each section.
The average μL of the Euclidean distance ΔL and the average μH of the direction difference ΔH may be used as the fluctuation weights Wlwm and Wdwm. The point is that it is only necessary to be able to represent the degree of variation in the learning character set with respect to the initial standard pattern.

【００９４】このように、本実施例においては、予め、
総ての認識対象文字の学習文字セットを用意し、個々の
認識対象文字に係る１つの座標データ列を任意に選出し
て特徴列を求めて初期標準パターンとする。そして、上
記学習文字セット中における個々の座標データ列からの
学習用特徴パターンと当該学習文字に係る初期標準パタ
ーンとのＤＰマッチングを文字認識部３９で行い、マッ
チングパスに沿った各点に係るユークリッド距離ΔＬと
方向差ΔＨとを求める。そして、初期標準パターンにお
ける区間ｍにある特徴点に係るユークリッド距離ΔＬの
分散ρLおよび方向差ΔＨの分散ρHを変動重みＷlwm,Ｗ
dwmとして得る。以後、上記文字認識部３９による手書
き認識の際には、このようにして得られた変動重みＷlw
m,Ｗdwmを用いた重み付けＤＰマッチングを行う。Thus, in this embodiment, in advance,
A learning character set of all recognition target characters is prepared, one coordinate data string relating to each recognition target character is arbitrarily selected, and a feature string is obtained to be an initial standard pattern. Then, the character recognition unit 39 performs DP matching between the learning feature pattern from each coordinate data string in the learning character set and the initial standard pattern related to the learning character, and the Euclidean character corresponding to each point along the matching path. The distance ΔL and the direction difference ΔH are calculated. Then, the variance ρL of the Euclidean distance ΔL and the variance ρH of the direction difference ΔH related to the feature points in the section m in the initial standard pattern are set as the variation weights Wlwm, W.
Get as dwm. After that, when the handwriting is recognized by the character recognition unit 39, the variation weight Wlw thus obtained is obtained.
Weighted DP matching using m and Wdwm is performed.

【００９５】したがって、本実施例によれば、手書き文
字の曲線部分の特徴点が安定して抽出できない場合に
は、その不安定度に応じた重みを付加してＤＰマッチン
グを行うことによって、より高い認識性能を確保できる
のである。Therefore, according to the present embodiment, when the characteristic points of the curved portion of the handwritten character cannot be stably extracted, the DP matching is performed by adding the weight according to the instability. High recognition performance can be secured.

【００９６】＜第４実施例＞本実施例は、変動重みＷlw
n,Ｗdwnを用いて重み付けＤＰマッチングを行う他の実
施例に関する。図１２は、本実施例における手書き認識
装置のブロック図である。タブレット５２,結果表示部
５３,ストローク終了判定部５４,ストローク処理部５
５,特徴格納部５６,文字終了判定部５７,特徴再抽出部
５８および辞書格納部６０は、第１実施例におけるタブ
レット２,結果表示部３,ストローク終了判定部４,スト
ローク処理部５,特徴格納部６,文字終了判定部７,特徴
再抽出部８および辞書格納部１０と同じ機能を有して同
様に動作する。<Fourth Embodiment> In this embodiment, the variation weight Wlw
The present invention relates to another embodiment in which weighted DP matching is performed using n and Wdwn. FIG. 12 is a block diagram of the handwriting recognition device in this embodiment. Tablet 52, result display unit 53, stroke end determination unit 54, stroke processing unit 5
5, the feature storage unit 56, the character end determination unit 57, the feature re-extraction unit 58, and the dictionary storage unit 60 are the tablet 2, the result display unit 3, the stroke end determination unit 4, the stroke processing unit 5, and the feature in the first embodiment. The storage unit 6, the character end determination unit 7, the feature re-extraction unit 8 and the dictionary storage unit 10 have the same functions and operate in the same manner.

【００９７】本実施例においては、文字認識部５９での
部分マッチング距離Δｄ算出の際に用いる距離に関する
重みＷpと方向に関する重みＷdを、式(８)および式(９)
とする。Ｗp＝Ｎwlen・Ｗlwn …（８）Ｗd＝Ｎwdst・Ｗdwn …（９）ここで、Ｗlwn,Ｗdwn：特徴パターンにおける特徴点ｎ
に係る変動重みIn this embodiment, the weight Wp related to the distance and the weight Wd related to the direction used in the calculation of the partial matching distance Δd in the character recognition unit 59 are expressed by the equations (8) and (9).
And Wp = Nwlen · Wlwn (8) Wd = Nwdst · Wdwn (9) where Wlwn, Wdwn: characteristic point n in the characteristic pattern
Variation weight for

【００９８】重み算出部６１は、上記特徴再抽出部５８
で得られた特徴パターンに基づいて変動重みＷlwn,Ｗdw
nを算出し、得られた変動重みＷlwn,Ｗdwnを当該特徴パ
ターンに付加して文字認識部５９に送出する。そうする
と、文字認識部５９は、重み算出部６１からの特徴パタ
ーンおよび変動重みＷlwn,Ｗdwnと辞書格納部６０に格
納されたサブパターン辞書の標準パターンとに基づい
て、第３実施例と同様の重み付けＤＰマッチングを行う
のである。その際における上記重み算出部６１による変
動重みＷlwn,Ｗdwnの算出は、以下のようにして行われ
る。The weight calculation section 61 has the feature re-extraction section 58.
Variation weights Wlwn, Wdw based on the feature pattern obtained in
n is calculated, and the obtained variation weights Wlwn and Wdwn are added to the feature pattern and sent to the character recognition unit 59. Then, the character recognition unit 59 performs the same weighting as in the third embodiment based on the characteristic pattern and the variation weights Wlwn, Wdwn from the weight calculation unit 61 and the standard pattern of the sub-pattern dictionary stored in the dictionary storage unit 60. DP matching is performed. At this time, the calculation of the variation weights Wlwn and Wdwn by the weight calculation unit 61 is performed as follows.

【００９９】上記特徴再抽出部５８からの特徴パターン
における各特徴点ｎについて、特徴点(ｎ−１)と特徴点
(ｎ＋１)との距離を算出する。また、当該特徴点ｎが属
するストロークを構成する全特徴点の座標に基づいて当
該ストロークの全特徴点を入力順に結ぶ距離(以下、単
に“全長"と言う)を算出する。そして、式(８)に従って
変動重みＷlwn,Ｗdwnを算出する。Ｗlwn＝Ｗdwn＝(特徴点(ｎ−１)と特徴点(ｎ＋１)との距離) /(ストロークの全長) …（１０）For each feature point n in the feature pattern from the feature re-extracting unit 58, the feature point (n-1) and the feature point
The distance from (n + 1) is calculated. In addition, the distance (hereinafter, simply referred to as "total length") connecting all the feature points of the stroke in the input order is calculated based on the coordinates of all the feature points forming the stroke to which the feature point n belongs. Then, the variation weights Wlwn and Wdwn are calculated according to the equation (8). Wlwn = Wdwn = (distance between feature point (n-1) and feature point (n + 1)) / (total length of stroke) (10)

【０１００】このように、本実施例においては、上記重
み付けＤＰマッチングの際の変動重みＷlwn,Ｗdwnの値
を、特徴点ｎの両隣の特徴点間の距離のストローク長に
対する割合で求めるようにしている。したがって、直線
部にある特徴点間の部分マッチング距離Δｄが遠ざけら
れることになり、特徴パターンと標準パターンとにおけ
る屈曲部の特徴点の座標のずれを容易に吸収できるので
ある。すなわち、本実施例によれば、手書き入力パター
ンの局所変形に強い手書き認識装置を実現できるのであ
る。As described above, in the present embodiment, the values of the fluctuation weights Wlwn and Wdwn in the above weighted DP matching are calculated by the ratio of the distance between the characteristic points on both sides of the characteristic point n to the stroke length. There is. Therefore, the partial matching distance Δd between the characteristic points on the straight line portion is increased, and the deviation of the coordinates of the characteristic points of the bent portion between the characteristic pattern and the standard pattern can be easily absorbed. That is, according to this embodiment, it is possible to realize a handwriting recognition device that is resistant to local deformation of the handwriting input pattern.

【０１０１】＜第５実施例＞本実施例は入力パターンの
認識法に関する。尚、本実施例においては、説明を簡単
にするために、手書き文字パターンを認識対象とした手
書き文字認識装置を例に説明するが、例えば、ゼスチャ
等の文字以外のパターンへの適用も同様にして行うこと
ができる。<Fifth Embodiment> This embodiment relates to an input pattern recognition method. In the present embodiment, in order to simplify the description, a handwritten character recognition device that recognizes a handwritten character pattern will be described as an example, but the same applies to a pattern other than a character such as a gesture. Can be done by

【０１０２】以下、発明の概要を簡単に述べた後、デー
タ構造や認識方式等について詳細に説明する。図１３は
本実施例の手書き認識装置におけるブロック図である。
この手書き認識装置は、認識部７１,タブレット７２,特
徴抽出用記憶部７３，認識用記憶部７４，サブパターン
辞書７５および結果表示部７６で概略構成される。ここ
で、本実施例における上記サブパターンの定義は、第１
実施例における定義と全く同じである。After briefly describing the outline of the invention, the data structure, the recognition method and the like will be described in detail. FIG. 13 is a block diagram of the handwriting recognition device in this embodiment.
This handwriting recognition device is roughly configured by a recognition unit 71, a tablet 72, a feature extraction storage unit 73, a recognition storage unit 74, a sub-pattern dictionary 75, and a result display unit 76. Here, the definition of the sub-pattern in this embodiment is as follows:
It is exactly the same as the definition in the embodiment.

【０１０３】上記タブレット７２は、第１実施例〜第４
実施例と同様に、例えば表示一体型に形成されて、筆記
者がペン入力した筆跡データを電気信号で表現される座
標データ列に変換する。認識部７１は、タブレット７２
から入力される座標データ列の特徴を抽出し、サブパタ
ーン辞書７５を用いて手書き文字を認識して結果表示部
７６に表示する。特徴抽出用記憶部７３には、入力され
た座標データ列の特徴を抽出する際に用いられる種々の
バッファが格納される。また、認識用記憶部７４には、
手書き認識処理時に用いられる種々のバッファが格納さ
れる。尚、上記認識用記憶部７４における文字構成テー
ブルは、第１実施例で言うところの文字辞書と同じであ
り、図４に示す構成を有している。The tablet 72 is the first to fourth embodiments.
Similar to the embodiment, for example, it is formed as a display-integrated type and converts the handwriting data pen-input by the writer into a coordinate data string represented by an electric signal. The recognition unit 71 is a tablet 72.
The feature of the coordinate data string input from is extracted, the handwritten character is recognized using the sub-pattern dictionary 75, and displayed on the result display unit 76. The feature extraction storage unit 73 stores various buffers used when extracting features of the input coordinate data sequence. Further, in the recognition storage unit 74,
Various buffers used during the handwriting recognition process are stored. The character structure table in the recognition storage unit 74 is the same as the character dictionary in the first embodiment and has the structure shown in FIG.

【０１０４】上記タブレット７２から入力された座標デ
ータ列は、入力部７７を介して正規化部７８に入力され
て１つ以上のストローク連鎖毎に正規化される。そし
て、特徴抽出部７９によって、正規化後のストローク連
鎖に対して、後に詳述するようにして特徴点とその特徴
量が抽出される。方向差判定部８０は、上記特徴抽出部
７９で抽出された各特徴点のうちストロークの最終特徴
点に係る方向差を後に詳述するようにして求める。そし
て、この方向差に基づいて、ＤＰマッチングに用いる標
準パターンの有効性を判定する。ストローク数限定部８
１は、上記タブレット７２からの入力パターンの正規の
ストローク数をある範囲に限定する。こうして、入力パ
ターンのストローク数を限定することによって後に行わ
れるＤＰマッチングの高速化を図るのである。The coordinate data string input from the tablet 72 is input to the normalization unit 78 via the input unit 77 and is normalized for each one or more stroke chains. Then, the feature extraction unit 79 extracts feature points and their feature amounts from the normalized stroke chain as described later. The direction difference determination unit 80 obtains the direction difference related to the final feature point of the stroke among the feature points extracted by the feature extraction unit 79 as described in detail later. Then, the validity of the standard pattern used for DP matching is determined based on this direction difference. Stroke limited part 8
1 limits the number of regular strokes of the input pattern from the tablet 72 to a certain range. By limiting the number of strokes of the input pattern in this way, the speed of the DP matching performed later can be increased.

【０１０５】ＤＰマッチング部８２は、上記特徴抽出部
７９で抽出された特徴量を有する特徴点列のパターン
(特徴パターン)とサブパターン辞書７５に登録されてい
るサブパターン単位の標準パターンとのＤＰマッチング
を行ってサブパターンを認識する。文字生成部８４は、
サブパターン候補ソーティング部８３でソーティングさ
れたサブパターン候補を組み合わせて文字候補を生成す
る。出力部８５は、生成された文字候補を類似度の高い
順序で結果表示部７６に送出して表示させる。制御部８
６は、上記入力部７７,正規化部７８,特徴抽出部７９,
方向差判定部８０,ストローク数限定部８１,ＤＰマッチ
ング部８２,サブパターン候補ソーティング部８３,文字
生成部８４および出力部８５を制御して、手書き文字認
識処理を実行する。The DP matching unit 82 has a pattern of a feature point sequence having the feature quantity extracted by the feature extracting unit 79.
The sub-pattern is recognized by performing DP matching between the (feature pattern) and the standard pattern in sub-pattern units registered in the sub-pattern dictionary 75. The character generator 84
The sub pattern candidate sorting unit 83 combines the sub pattern candidates to generate a character candidate. The output unit 85 sends the generated character candidates to the result display unit 76 in order of high similarity and displays them. Control unit 8
6 is the input unit 77, the normalization unit 78, the feature extraction unit 79,
The handwriting character recognition process is executed by controlling the direction difference determination unit 80, the stroke number limiting unit 81, the DP matching unit 82, the sub pattern candidate sorting unit 83, the character generation unit 84, and the output unit 85.

【０１０６】図１４は、上記制御部８６の制御の下に行
われる手書き文字認識処理動作のゼネラルフローを示す
フローチャートである。以下、図１４に従って、手書き
文字認識処理動作の概要について説明する。ステップＳ
11で、上記入力部７７によって、タブレット７２からの
座標データ列が取り込まれて特徴抽出用記憶部７３の入
力座標バッファに格納される。ステップＳ12で、上記正
規化部７８,特徴抽出部７９,方向差判定部８０,ストロ
ーク数限定部８１,ＤＰマッチング部８２およびサブパ
ターン候補ソーティング部８３によって、順次スロトー
ク連鎖が生成されて正規化,特徴抽出,ストローク数限
定,次ストロークへのオフ方向差計算,サブパターン認識
および第１サブパターン候補ソーティングが行われて、
第１サブパターン候補が決定される。尚、上記第１サブ
パターンとは、入力文字パターンを構成する最初のサブ
パターンであり、通常は文字の偏に相当するものであ
る。ステップＳ13で、上記正規化部７８,特徴抽出部７
９,方向差判定部８０,ストローク数限定部８１,ＤＰマ
ッチング部８２,サブパターン候補ソーティング部８３
および文字生成部８４によって、上記決定された第１サ
ブパターン候補に基づいて以降のサブパターンを限定し
ながらサブパターン認識を行ない、得られた各サブパタ
ーン候補を組み合わせて文字候補が生成される。ステッ
プＳ14で、上記出力部８５によって、文字生成部８４の
文字候補ソーティング部８８で類似度の高い順にソーテ
ィングされた文字候補が類似度の高い順に結果表示部７
６に出力されて表示される。FIG. 14 is a flowchart showing the general flow of the handwritten character recognition processing operation performed under the control of the control unit 86. The outline of the handwritten character recognition processing operation will be described below with reference to FIG. Step S
At 11, the coordinate data string from the tablet 72 is fetched by the input unit 77 and stored in the input coordinate buffer of the feature extraction storage unit 73. In step S12, the normalization unit 78, the feature extraction unit 79, the direction difference determination unit 80, the stroke number limiting unit 81, the DP matching unit 82, and the sub-pattern candidate sorting unit 83 sequentially generate a Slotalk chain for normalization, Feature extraction, stroke number limitation, off direction difference calculation to the next stroke, sub pattern recognition and first sub pattern candidate sorting are performed,
The first sub-pattern candidate is determined. The first sub-pattern is the first sub-pattern that constitutes the input character pattern, and usually corresponds to the deviation of characters. In step S13, the normalization unit 78 and the feature extraction unit 7
9, direction difference determination unit 80, stroke limit unit 81, DP matching unit 82, sub-pattern candidate sorting unit 83
And the character generation unit 84 performs sub-pattern recognition while limiting subsequent sub-patterns based on the determined first sub-pattern candidate, and combines the obtained sub-pattern candidates to generate a character candidate. In step S14, the output unit 85 sorts the character candidates sorted in descending order of similarity by the character candidate sorting unit 88 of the character generation unit 84 in descending order of similarity.
It is output to 6 and displayed.

【０１０７】以下、上記ゼネラルフローにおける第１サ
ブパターンの認識処理および文字生成処理について詳細
に説明する。ここで、上記第１サブパターンの認識処理
および文字生成処理の説明を容易にするために、正規化
部７８による正規化処理、特徴抽出部７９による特徴抽
出処理、方向差判定部８０による方向差判定処理、スト
ローク数限定部８１によるストローク数限定処理、ＤＰ
マッチング部８２によるＤＰマッチング処理、サブパタ
ーン辞書７５の構造について、予め説明しておく。The first sub-pattern recognition processing and character generation processing in the above general flow will be described in detail below. Here, in order to facilitate the description of the recognition process and the character generation process of the first sub-pattern, the normalization process by the normalization unit 78, the feature extraction process by the feature extraction unit 79, the direction difference by the direction difference determination unit 80. Judgment process, stroke number limiting process by stroke number limiting unit 81, DP
The DP matching process by the matching unit 82 and the structure of the sub-pattern dictionary 75 will be described in advance.

【０１０８】(I) 正規化処理１以上のストローク分のストロークデータ(可変長の座
標列データ)に対して、このストロークデータ全体の外
接枠が縦横夫々Ｎnormドットになるように正規化を行な
う。尚、上記ドット数“Ｎnorm"は認識装置の能力に応
じて随意に決定できる値であり、本実施例ではＮnorm＝
２００ドットである。(I) Normalization processing The stroke data for one or more strokes (variable length coordinate sequence data) is normalized so that the circumscribing frame of the entire stroke data becomes Nnorm dots vertically and horizontally. The number of dots "Nnorm" is a value that can be arbitrarily determined according to the ability of the recognition device. In this embodiment, Nnorm =
It is 200 dots.

【０１０９】(II) 特徴抽出処理正規化後におけるストロークデータに対して、オフスト
ローク区間を適当な座標データの正規化値で補間して入
力パターンの一筆化を行う。そうした後、全ての座標デ
ータを走査して、閾値以上に方向が変化している点を屈
曲点として求める。そして、屈曲点間の距離が所定値以
上である場合にはｗ(例えば“７０")ドット間隔で補間
点を求め、屈曲点と補間点(等間隔点)とを合わせて特徴
点として抽出する。そして、各特徴点における次特徴点
への方向値ｄを求め、座標値(ｘ,ｙ)と合わせて特徴量
Ｆnとする。したがって、特徴量Ｆnは、式(１１)で表さ
れる可変長の方向付座標列となる。Ｆn＝(ｘ₁,ｙ₁,ｄ₁),(ｘ₂,ｙ₂,ｄ₂),…,(ｘ_n,ｙ_n,ｄ_n）…（１１）(II) Feature Extraction Processing With respect to the stroke data after normalization, the off-stroke section is interpolated with the normalization value of appropriate coordinate data to make a single stroke of the input pattern. After that, all coordinate data are scanned, and a point whose direction has changed by a threshold value or more is obtained as a bending point. When the distance between the bending points is equal to or greater than a predetermined value, interpolation points are obtained at w (for example, "70") dot intervals, and the bending points and the interpolation points (equal spacing points) are combined and extracted as feature points. . Then, the direction value d of each feature point to the next feature point is obtained and combined with the coordinate value (x, y) to obtain the feature amount Fn. Therefore, the feature amount Fn is a variable-length coordinate sequence with a direction represented by the equation (11). _{_{Fn = (x 1, y 1}} , d 1), (x 2, y 2, d 2), ..., (x n, y n, d n) ... (11)

【０１１０】尚、上記方向値ｄは、ある方向を“１"と
して１回転が３６０度になるように定義し、１〜Ｄ(例
えば“６４")の整数値で表現する。上記座標(ｘ,ｙ)の
有効範囲は正規化枠のドット数Ｎnormに依存する一方、
方向値ｄの有効範囲は方向の量子化数Ｄに依存する。但
し、上記ドット数Ｎnorm,量子化数Ｄおよび補間間隔ｗ
は認識装置の能力に応じて随意に決定できる値である。The direction value d is defined so that one rotation is 360 degrees with a certain direction being "1", and is expressed by an integer value of 1 to D (for example, "64"). While the effective range of the coordinates (x, y) depends on the number of dots Nnorm in the normalized frame,
The effective range of the direction value d depends on the quantization number D of the direction. However, the dot number Nnorm, the quantization number D, and the interpolation interval w
Is a value that can be arbitrarily determined according to the ability of the recognition device.

【０１１１】(III) 方向差判定処理本処理は、上記ＤＰマッチング処理の高速化を図るため
のスキップ処理を行うか否かを判定する処理である。具
体的には、あるストロークの最終特徴点から次のストロ
ークの先頭特徴点への２つの方向差を求め、この方向差
の値とサブパターン辞書７５に各標準パターン毎に格納
されている方向評価値とで有効標準パターンの判定を行
なう処理である。尚、この処理で有効標準パターンでな
いと判定された標準パターンが在る場合には、その標準
パターンを用いたＤＰマッチング処理は行なわずに次の
標準パターンヘスキップする。(III) Directional Difference Determination Processing This processing is processing for determining whether or not skip processing for speeding up the DP matching processing is to be performed. Specifically, two directional differences from the final feature point of one stroke to the leading feature point of the next stroke are obtained, and the value of this directional difference and the directional evaluation stored in the sub-pattern dictionary 75 for each standard pattern are evaluated. This is a process of determining an effective standard pattern with a value. If there is a standard pattern which is determined not to be an effective standard pattern in this process, the DP matching process using the standard pattern is not performed and the next standard pattern is skipped.

【０１１２】ここで、上記方向差を次のように定義す
る。すなわち、方向値ｄは循環値であるためにその差分
(Ｄdiff(ｄ1,ｄ2))を式(１２)で定義する。Ｄdiff(ｄ1,ｄ2)＝min(│ｄ1−ｄ2│,Ｄ−│ｄ1−ｄ2│) …（１２）但し、ｄ1,ｄ2は、ある特徴点における２方向の方向値
(１≦ｄ1≦Ｄ，１≦ｄ2≦Ｄ)であり、min(ａ,ｂ)はａ,
ｂのうち小さい方の値を取ることを意味する。Here, the direction difference is defined as follows. That is, since the direction value d is a cyclic value, its difference
(Ddiff (d1, d2)) is defined by equation (12). Ddiff (d1, d2) = min (| d1-d2 |, D- | d1-d2 |) (12) where d1 and d2 are direction values in two directions at a certain feature point.
(1≤d1≤D, 1≤d2≤D), and min (a, b) is a,
This means taking the smaller value of b.

【０１１３】上述のように方向値ｄは循環値である。そ
のために、その平均や標準偏差も直接求めることはでき
ない。そこで、本実施例においては、直交する２つの軸
(方向値)を用意し、この両軸と方向値ｄとの方向差を求
め、求められた２つの方向差の平均値と標準偏差値とで
代用するのである。すなわち、ｄ_iをｉ(１≦ｉ≦Ｉ)番
目の方向値、第１,第２軸の方向値をＲ^j(ｊ＝１,２)、
第１,第２軸との方向差をdd^jとすると、方向差dd^jは式
(１３)で求められ、方向差dd^jの平均ｍ^jは式(１４)で求
められ、方向差dd^jの標準偏差ｖ^jは式(１５)で求められ
る。As described above, the direction value d is a cyclic value. Therefore, the average and standard deviation cannot be directly calculated. Therefore, in this embodiment, two orthogonal axes are used.
(Direction value) is prepared, the direction difference between these two axes and the direction value d is calculated, and the average value and standard deviation value of the two calculated direction differences are substituted. That is, d _i is the i (1 ≦ i ≦ I) th direction value, the direction values of the first and second axes are R ^j (j = 1, 2),
If the direction difference between the first and second axes is dd ^j , the direction difference dd ^j is
(13) is determined, the mean m ^j direction difference dd ^j given by Equation (14), the standard deviation v ^j direction difference dd ^j is determined by Equation (15).

【数１】上記第１,第２軸は直交２軸であり、認識装置の能力に
応じて随意に決定できる。本実施例では、Ｒ¹＝１,Ｒ²
＝１６である。[Equation 1] The first and second axes are two orthogonal axes, and can be arbitrarily determined according to the ability of the recognition device. In this embodiment, R ¹ = 1 and R ²
= 16.

【０１１４】尚、上記方向差判定処理結果を用いた上記
スキップ処理は、次のようにして行う。例えば、文字
「賭」におけるサブパターン「貝」では、次のサブパターン
「者」へ至るオフストロークの方向(以下、オフ方向と言
う)が特定の範囲(右上方向)に限定されている。そこ
で、このオフ方向の情報を用いて、現特徴点に係る次ス
トロークへのオフ方向が、現標準パターンに付加されて
いるオフ方向の許容範囲内であるかをチェックして、許
容範囲外である場合には現標準パターンをＤＰマッチン
グ処理の対象とはせず、次の標準パターンへスキップす
るのである。The skip process using the direction difference determination process result is performed as follows. For example, in the sub-pattern “shellfish” in the character “bet”, the direction of the off-stroke to the next sub-pattern “person” (hereinafter, referred to as the off direction) is limited to a specific range (upper right direction). Therefore, by using this off-direction information, it is checked whether the off-direction to the next stroke related to the current feature point is within the allowable range of the off-direction added to the current standard pattern. In some cases, the current standard pattern is not targeted for DP matching processing, and the next standard pattern is skipped.

【０１１５】上記スキップ処理は、具体的には次のよう
にして行う。先ず、上記標準パターンにおける最終スト
ロークの最終特徴点から次ストロークへのオフ方向を求
め、このオフ方向と第１軸(方向値ｄ1)および第２軸(方
向値ｄ2)との方向差dd¹,dd²を求める。次に、上記サブ
パターン辞書７５における現標準パターンのヘッダ部か
ら、第１軸との方向差平均値ｍ¹と第２軸との方向差平
均値ｍ²、および、第１軸との方向差標準偏差値ｖ¹と第
２軸との方向差標準偏差値ｖ²を読み出して、式(１６),
式(１７)を満たすか否かを判定する。そして、満たさな
い場合には、次標準パターンへスキップするのである。 |dd¹−ｍ¹|＜ｖ¹・Ｃs …（１６） |dd²−ｍ²|＜ｖ²・Ｃs …（１７）ここで、Ｃs：定数(例えば“３")Specifically, the skip processing is performed as follows. First, the off direction from the final feature point of the final stroke in the standard pattern to the next stroke is obtained, and the direction difference dd ¹ , between the off direction and the first axis (direction value d1) and the second axis (direction value d2). Find dd ² . Next, from the header part of the current standard pattern in the sub-pattern dictionary 75, the direction difference average value m ¹ with the first axis and the direction difference average value m ² with the second axis, and the direction difference with the first axis. The standard deviation value v ¹ and the directional difference standard deviation value v ² between the second axis are read out to obtain the equation (16),
It is determined whether Expression (17) is satisfied. If not satisfied, the process skips to the next standard pattern. | dd ¹ −m ¹ | <v ¹ · Cs… (16) | dd ² −m ² | <v ² · Cs… (17) where Cs: a constant (eg, “3”)

【０１１６】尚、上記サブパターン辞書７５に書き込ま
れている第１,第２軸との方向差平均値ｍ¹,ｍ²および第
１,第２軸との方向差標準偏差値ｖ¹,ｖ²は、注目サブパ
ターンから次サブパターンへのオフ方向値に対する第
１,第２軸との方向差平均値と標準偏差値であり、大量
の手書き文字の座標データ列に基づいて各サブパターン
毎に求めたものである。The average value m ¹ , m ² of the direction difference between the first and second axes and the standard deviation value v ¹ , v of the direction difference between the first and second axes written in the sub-pattern dictionary 75. ² is the direction difference average value and the standard deviation value with respect to the off-direction value from the target sub-pattern to the next sub-pattern, and is the standard deviation value, based on the coordinate data string of a large amount of handwritten characters for each sub-pattern. Is what I asked for.

【０１１７】(IV) ストローク数限定処理ストローク数限定処理は、入力パターンの正規のストロ
ーク数をある範囲に限定(予想)しておく処理であり、Ｄ
Ｐマッチング処理の高速化を図るための処理である。上
記ストローク数slの限定範囲は、式(１８)に示すよう
に、下限を入力パターンの入力ストローク数rsとし、上
限を上記特徴抽出処理の途中で求められた屈曲点数pcか
ら計算できる予想値とする。 rs≦sl≦pc＋１ …（１８）(IV) Stroke Number Limiting Process The stroke number limiting process is a process of limiting (estimating) the regular stroke number of the input pattern to a certain range.
This is a process for speeding up the P matching process. As shown in Expression (18), the limit range of the stroke number sl is such that the lower limit is the input stroke number rs of the input pattern, and the upper limit is an expected value that can be calculated from the number of bending points pc obtained during the feature extraction processing. To do. rs ≦ sl ≦ pc + 1 (18)

【０１１８】上記ストローク数限定処理結果によるＤＰ
マッチング処理の高速化は次のようにして行われる。す
なわち、予め、上記サブパターン辞書７５を正規のスト
ローク数順にソートしておく。そして、ＤＰマッチング
処理に際しては、総ての標準パターンを用いるのではな
く、注目特徴パターンに係るストロークslの限定範囲内
に入る正規のストローク数を有するサブパターンの標準
パターンのみを用いてＤＰマッチング処理を行うのであ
る。DP according to the above-mentioned stroke number limiting processing result
The speed of the matching process is increased as follows. That is, the sub-pattern dictionary 75 is sorted in advance in the order of regular stroke numbers. Then, in the DP matching processing, all the standard patterns are not used, but only the standard pattern of the sub-patterns having a regular stroke number within the limited range of the stroke sl related to the feature pattern of interest is used. To do.

【０１１９】(V) ＤＰマッチング処理入力パターンの座標データ列からの特徴パターンとサブ
パターン辞書７５に登録されている標準パターンとの距
離計算を行う処理である。このＤＰマッチング処理は長
さの異なる２つのデータ間の距離を求めるものであり、
マッチングパスの定義や整合窓の幅やマッチング距離の
定義は、認識装置の能力に応じて随意に決定できる。(V) DP matching process This is a process for calculating the distance between the characteristic pattern from the coordinate data string of the input pattern and the standard pattern registered in the sub pattern dictionary 75. This DP matching processing is to obtain the distance between two data having different lengths.
The definition of the matching path, the width of the matching window, and the definition of the matching distance can be arbitrarily determined according to the ability of the recognition device.

【０１２０】本実施例においては、上記部分マッチング
距離d(m,n)を式(１９)で定義し、マッチングパスに沿っ
た距離の部分和Ｄ(m,n)を式(２０)で定義する。In this embodiment, the partial matching distance d (m, n) is defined by the equation (19), and the partial sum D (m, n) of the distances along the matching path is defined by the equation (20). To do.

【数２】 [Equation 2]

【０１２１】ＤＰマッチングの整合窓の幅wwは可変長で
あり、特徴パターンの特徴点数Ｎおよび標準パターンの
特徴点数Ｍに基づいて、式(２１)で算出される。 ww＝max(Ｃw,|Ｍ−Ｎ|) …（２１）上記max(ａ,ｂ)はａ,ｂのうちの大きい方の値を取るこ
とを意味する。また、Ｃwは認識装置の能力に応じて随
意に設定できる定数であり、本実施例においては“８"
である。つまり、本実施例におけるＤＰマッチング処理
では、１≦ｍ≦Ｍ,１≦ｎ≦Ｎ及び整合窓wwの範囲内で
マッチングパスに沿った距離の部分和Ｄ(m,n)を求め、
最終的にＤ(Ｍ,Ｎ)の値をＮで割った値がマッチング距
離ｄistとなる。ｄist＝Ｄ(Ｍ,Ｎ)/Ｎ …（２２）The width ww of the matching window of DP matching is variable, and is calculated by the equation (21) based on the number N of characteristic points of the characteristic pattern and the number M of characteristic points of the standard pattern. ww = max (Cw, | MN |) (21) The above max (a, b) means to take the larger value of a and b. Further, Cw is a constant that can be arbitrarily set according to the ability of the recognition device, and is “8” in this embodiment.
Is. That is, in the DP matching processing in the present embodiment, the partial sum D (m, n) of the distances along the matching path within the range of 1 ≦ m ≦ M, 1 ≦ n ≦ N and the matching window ww is obtained,
Finally, the value obtained by dividing the value of D (M, N) by N becomes the matching distance dist. dist = D (M, N) / N (22)

【０１２２】(VI) サブパターン辞書７５の構造上記サブパターン辞書７５には各サブパターン単位での
標準パターンを登録する。標準パターンのデータは、上
述の特徴パターンの特徴量Ｆnと同様の可変長の方向付
座標列であり、予め用意したサブパターンの座標データ
列に対して正規化部７８による正規化処理と特徴抽出部
７９による特徴抽出処理を行なって作成したものであ
る。また、上記サブパターン辞書７５には、各サブパタ
ーンの各種情報がヘッダ情報として付加されている。ヘ
ッダ情報には図１５に示すようなものがある。図１５に
おいて、“ストローク数部"には上記正規のストローク
数が格納され、“第１方向差平均部",“第２方向差平均
部"には上記第１軸との方向差平均値ｍ¹,第２軸との方
向差平均値ｍ²が格納され、“第１方向差偏差部",“第
２方向差偏差値"には上記第１軸との方向差標準偏差値
ｖ¹,第２軸との方向差標準偏差値ｖ²が格納される。上
記サブパターン辞書７５の辞書構造は、一つのサブパタ
ーン辞書当たり上記ヘッダ部と可変長の座標データ列で
構成されるのである。(VI) Structure of Sub-Pattern Dictionary 75 Standard patterns in each sub-pattern unit are registered in the sub-pattern dictionary 75. The data of the standard pattern is a variable-length coordinated coordinate sequence similar to the feature amount Fn of the above-described feature pattern, and the normalization processing and feature extraction by the normalization unit 78 are performed on the coordinate data sequence of the sub-pattern prepared in advance. It is created by performing the feature extraction processing by the unit 79. Further, various information of each sub-pattern is added to the sub-pattern dictionary 75 as header information. Some header information is as shown in FIG. In FIG. 15, the "stroke number part" stores the regular stroke number, and the "first direction difference average part" and the "second direction difference average part" have the direction difference average value m with the first axis. ¹ , the direction difference average value m ² with the second axis is stored, and the direction difference standard deviation value v ¹ with the first axis is stored in the “first direction difference deviation section” and the “second direction difference deviation value”. The directional difference standard deviation value v ^{2 from} the second axis is stored. The dictionary structure of the sub-pattern dictionary 75 is composed of the header section and variable-length coordinate data string per sub-pattern dictionary.

【０１２３】以下、第１サブパターンの認識処理につい
て説明する。本処理は、入力された座標データ列から抽
出された特徴パターンに対するＤＰマッチング処理によ
って得られたマッチング距離ｄistの近い順に、第５０
位までの第１サブパターン候補を作成する処理である。
通常、オンライン手書き文字認識を行う場合は、この第
１サブパターン認識処理はユーザの入力動作と並列に前
認識処理として実現されるのであるが、本実施例におい
ては、説明を簡単にするために入力処理後の動作として
説明する(並列処理の方法については後述する)。The recognition process of the first sub-pattern will be described below. This process is performed in the order of the matching distance dist, which is obtained by the DP matching process with respect to the characteristic pattern extracted from the input coordinate data sequence, in the order of increasing
This is a process of creating first sub-pattern candidates up to the rank.
Normally, when performing online handwritten character recognition, this first sub-pattern recognition processing is realized as pre-recognition processing in parallel with the user's input operation. However, in the present embodiment, in order to simplify the explanation. The operation after the input processing will be described (the method of parallel processing will be described later).

【０１２４】図１６は、上記正規化部７８,特徴抽出部
７９,方向差判定部８０,ストローク数限定部８１,ＤＰ
マッチング部８２およびサブパターン候補ソーティング
部８３による第１サブパターン認識処理動作のフローチ
ャートである。以下、図１６に従って、第１サブパター
ン認識処理動作について詳細に説明する。ステップＳ21
で、上記特徴抽出用記憶部７３に設定された開始ストロ
ーク・カウンタststrkおよび終了ストローク・カウンタen
strkの内容が、夫々“１"に初期設定される。ステップ
Ｓ22で、上記認識用記憶部７４のサブパターン候補バフ
ァの内容が初期化される。尚、上記サブパターン候補バ
ッファは、１候補当たり図１７に示すような構造を有し
ている。また、本実施例において求められるサブパター
ン候補の数は“５０"である。したがって、本実施例に
おけるサブパターン候補バファは、図１７に示す構造が
５０個並んだメモリ容量/構造を有している。ここで、
上記サブパターン候補バファの初期化では、５０個総て
のＤＰマッチング距離の値に最大距離値maxdist(本実施
例では“２５５")をセットするのである。FIG. 16 shows the normalizing unit 78, the feature extracting unit 79, the direction difference determining unit 80, the stroke number limiting unit 81, and the DP.
9 is a flowchart of a first sub-pattern recognition processing operation by the matching unit 82 and the sub-pattern candidate sorting unit 83. Hereinafter, the first sub-pattern recognition processing operation will be described in detail with reference to FIG. Step S21
, The start stroke counter ststrk and the end stroke counter en set in the feature extraction storage unit 73.
The contents of strk are initialized to "1" respectively. At step S22, the contents of the sub-pattern candidate buffer in the recognition storage section 74 are initialized. The sub-pattern candidate buffer has a structure as shown in FIG. 17 for each candidate. In addition, the number of sub-pattern candidates obtained in this embodiment is “50”. Therefore, the sub-pattern candidate buffer in this embodiment has a memory capacity / structure in which 50 structures shown in FIG. 17 are arranged. here,
In the initialization of the sub-pattern candidate buffer, the maximum distance value maxdist (“255” in this embodiment) is set as the value of all 50 DP matching distances.

【０１２５】ステップＳ23で、上記正規化部７８によっ
て、“ststrk"番目のストロークから“enstrk"番目のス
トロークまでのストロークデータに対して、上述の正規
化処理が行われる。ステップＳ24で、上記特徴抽出部７
９によって、上記ステップＳ23における正規化処理後の
ストロークデータに対して上述の特徴抽出処理が行わ
れ、特徴パターンが得られる。こうして得られた特徴パ
ターンは、特徴抽出用記憶部７３の特徴バッファに格納
される。ステップＳ25で、上記ストローク数限定部８１
によって上記ストローク数限定処理が行われる。すなわ
ち、上記ステップＳ24において抽出された特徴量の１つ
である方向値ｄに基づいて屈曲点数pcが求められ、式
(１８)に従ってストローク数slの限定範囲が算出される
のである。こうして得られたストローク数slの限定範囲
は、特徴抽出用記憶部７３の入力データバッファに格納
される。ステップＳ26で、上記方向差判定部８０によっ
て次ストロークへのオフ方向差が計算される。即ち、
“enstrk"番目のストロークの最終特徴点から“enstrk
＋１"番目のストロークの先頭特徴点へのオフ方向が求
められ、このオフ方向と第１,第２軸の方向差dd¹,dd²が
求められるのである。こうして得られた方向差dd¹,dd²
の値は、特徴抽出用記憶部７３の入力データバッファに
格納される。In step S23, the normalization unit 78 performs the above-described normalization processing on the stroke data from the "ststrk" th stroke to the "enstrk" th stroke. In step S24, the feature extraction unit 7
In step 9, the above-mentioned feature extraction process is performed on the stroke data after the normalization process in step S23 to obtain a feature pattern. The feature pattern thus obtained is stored in the feature buffer of the feature extraction storage unit 73. In step S25, the stroke number limiting unit 81
The above-described stroke number limiting process is performed. That is, the bending point number pc is obtained based on the direction value d, which is one of the feature quantities extracted in step S24,
The limited range of the stroke number sl is calculated according to (18). The limited range of the stroke number sl thus obtained is stored in the input data buffer of the feature extraction storage unit 73. In step S26, the off-direction difference to the next stroke is calculated by the direction difference determination unit 80. That is,
"Enstrk" from the last feature point of the "enstrk" th stroke
The off direction to the leading feature point of the +1 "th stroke is obtained, and the direction differences dd ¹ and dd ² between this off direction and the first and second axes are obtained. The direction differences dd ¹ and dd ²
The value of is stored in the input data buffer of the feature extraction storage unit 73.

【０１２６】ステップＳ27で、上述の各ステップで得ら
れて上記特徴バッファおよび入力データ・バッファ等に
格納されている各特徴点の座標データ列,ストローク数s
lの限定範囲,２つの方向差dd¹・dd²,開始ストローク番号
および終了ストローク番号を入力データとして、上記Ｄ
Ｐマッチング部８２によって、後に詳述するようなサブ
パターン認識処理サブルーチンが実行される。ステップ
Ｓ28で、終了ストローク・カウンタenstrkの内容がイン
クリメントされる。ステップＳ29で、上記終了ストロー
ク・カウンタenstrkの内容が、特徴抽出用記憶部７３の
入力座標バッファにストロークデータが格納されている
ストローク数以下であるか否かが判別される。その結
果、入力ストローク数rsより大きければステップＳ30に
進む。一方、入力ストローク数rs以下であれば上記ステ
ップＳ23に戻って、“ststrk"番目のストロークからイ
ンクリメント後の“enstrk"番目のストロークまでのス
トロークデータに対する正規化処理に移行する。こうし
て、ストローク数が上記入力ストローク数rsに至るまで
順次ストローク連鎖を成長しつつ、正規化処理,特徴抽
出処理,ストローク数限定処理,オフ方向差算出処理およ
びサブパターン認識処理が繰り返されるのである。In step S27, the coordinate data string of each feature point and the stroke number s obtained in the above-mentioned steps and stored in the feature buffer, the input data buffer, etc.
The limited range of l, the difference of two directions dd ¹ and dd ² , the start stroke number and the end stroke number are input data, and the above D
The P matching unit 82 executes a sub-pattern recognition processing subroutine, which will be described in detail later. In step S28, the content of the end stroke counter enstrk is incremented. In step S29, it is determined whether or not the content of the end stroke counter enstrk is less than or equal to the number of strokes in which stroke data is stored in the input coordinate buffer of the feature extraction storage unit 73. As a result, if it is larger than the input stroke number rs, the process proceeds to step S30. On the other hand, if the number of input strokes is less than or equal to rs, the process returns to step S23, and shifts to the normalization process for the stroke data from the "ststrk" th stroke to the incremented "enstrk" th stroke. In this way, the normalization process, the feature extraction process, the stroke number limitation process, the off-direction difference calculation process, and the sub-pattern recognition process are repeated while the stroke chain is sequentially grown until the stroke number reaches the input stroke number rs.

【０１２７】ステップＳ30で、サブパターン候補ソーテ
ィング部８３によって、上記ステップＳ27におけるサブ
パターン認識処理の結果上記サブパターン候補バッファ
内に生成された第１サブパターン候補が、マッチング距
離ｄistの小さい順にソーティングされる。そうした
後、第１サブパターン認識処理動作を終了して文字生成
処理動作に移行する。In step S30, the first sub-pattern candidates generated in the sub-pattern candidate buffer as a result of the sub-pattern recognition processing in step S27 are sorted by the sub-pattern candidate sorting section 83 in ascending order of matching distance dist. It After that, the first sub-pattern recognition processing operation is ended and the operation proceeds to the character generation processing operation.

【０１２８】図１８は、上記サブパターン認識処理サブ
ルーチンのフローチャートである。以下、図１８に従っ
て、上記ＤＰマッチング部８２によって実行されるサブ
パターン認識処理動作について詳細に説明する。ステッ
プＳ31で、上記入力データ・バッファに格納されている
ストローク数slの限定範囲が読み出され、この限定範囲
に基づいて、以下のようにしてサブパターン辞書からの
標準パターンを読み出す際の読み出し開始アドレスと読
み出し終了アドレスとが設定される。ここで、上記サブ
パターン辞書７５は、上記正規のストローク数の小さい
順にソートされている。そして、このサブパターン辞書
７５には、登録されている同一ストローク数を有する標
準パターンの先頭アドレスと終了アドレスとがストロー
ク数に対応付けられて成るテーブルが、付加情報として
登録されている。そこで、上記ストローク数slの限定範
囲の下限値に基づいて上記テーブルが参照されて、標準
パターンを読み出す際の読み出し開始アドレスが得られ
るのである。こうして設定された読み出し開始アドレス
が内部メモリに設定された開始ポインタdicpにセットさ
れる。ステップＳ32で、上記ストローク数slの限定範囲
の上限値に基づいて上記テーブルが参照されて標準パタ
ーンを読み出す際の読み出し終了アドレスが設定され
る。そして、上記内部メモリに設定された終了ポインタ
endpにセットされる。ステップＳ33で、上記サブパター
ン候補バッファにおける“距離部"内から最大距離が検
索される。そして、得られた最大距離が最大距離レジス
タmaxdistに格納される。FIG. 18 is a flowchart of the sub pattern recognition processing subroutine. Hereinafter, the sub-pattern recognition processing operation performed by the DP matching unit 82 will be described in detail with reference to FIG. In step S31, the limited range of the number of strokes sl stored in the input data buffer is read out, and based on this limited range, the reading start when reading the standard pattern from the sub-pattern dictionary is started as follows. The address and the read end address are set. Here, the sub-pattern dictionary 75 is sorted in ascending order of the regular stroke number. Then, in the sub-pattern dictionary 75, a table in which the start address and the end address of the registered standard pattern having the same number of strokes are associated with the number of strokes is registered as additional information. Therefore, the table is referred to based on the lower limit value of the limited range of the stroke number sl to obtain the read start address when reading the standard pattern. The read start address thus set is set in the start pointer dicp set in the internal memory. In step S32, the table is referred to based on the upper limit value of the limited range of the number of strokes sl, and the read end address for reading the standard pattern is set. And the end pointer set in the internal memory
Set to endp. In step S33, the maximum distance is searched from within the "distance part" in the sub-pattern candidate buffer. Then, the obtained maximum distance is stored in the maximum distance register maxdist.

【０１２９】ステップＳ34で、上記サブパターン辞書７
５における開始ポインタdicpで指定されるアドレスから
ヘッダ部が読み出されて、内部メモリに格納される。ス
テップＳ35で、上記第１サブパターン認識処理動作のフ
ローチャート(図１６参照)におけるステップＳ26におい
て求められた当該標準パターンの方向差dd¹,dd²と上記
内部メモリ内の方向差情報とに基づいて、上述したスキ
ップ処理が行われる。そして、当該ヘッダ部に係る標準
パターンをＤＰマッチング処理の対象とせずに、次の標
準パターンにスキップするか否かが判定される。その結
果、スキップする場合にはステップＳ40に進み、スキッ
プしない場合にはステップＳ36に進む。ステップＳ36
で、当該特徴パターンと当該標準パターンとの上記ＤＰ
マッチング処理が実行されて、得られたマッチング距離
ｄistは距離レジスタdistに格納される。ステップＳ37
で、上記距離レジスタdistの内容が最大距離レジスタma
xdistの内容以下であるか否かが判別される。その結
果、以下であればステップＳ38に進み、そうでなければ
ステップＳ40に進む。In step S34, the sub-pattern dictionary 7
The header part is read from the address designated by the start pointer dicp in 5 and stored in the internal memory. In step S35, based on the direction difference information dd ¹ , dd ^{2 of the} standard pattern and the direction difference information in the internal memory obtained in step S26 in the flowchart of the first sub-pattern recognition processing operation (see FIG. 16). The skip processing described above is performed. Then, it is determined whether or not the standard pattern related to the header part is skipped to the next standard pattern without being the target of the DP matching process. As a result, if skipped, the process proceeds to step S40, and if not skipped, the process proceeds to step S36. Step S36
Then, the DP of the characteristic pattern and the standard pattern
The matching process is executed, and the obtained matching distance dist is stored in the distance register dist. Step S37
Then, the contents of the above distance register dist is the maximum distance register ma
It is determined whether or not it is less than or equal to the content of xdist. As a result, if the following is true, the process proceeds to step S38, and if not, the process proceeds to step S40.

【０１３０】ステップＳ38で、上記サブパターン候補バ
ッファにおける最大距離maxdistを呈するサブパターン
候補の情報が当該標準パターンの情報に書き換えられ
る。この情報書き換えは、次のように行われる。すなわ
ち、“距離部"には距離レジスタdistの内容が格納さ
れ、“開始ストローク部"には開始ストローク・カウンタ
ststrkの内容が格納され、“ストローク数部"には（終
了ストローク・カウンタenstrkの内容−開始ストローク・
カウンタststrk＋１）の演算結果が格納され、“サブパ
ターン・コード部"には当該ヘッダ部のサブパターン・コ
ードが格納される。ここで、上記最大距離maxdistを呈
するサブパターン候補が複数存在する場合には、最小ア
ドレスのサブパターン候補の情報が書き換えられる。ス
テップＳ39で、上記サブパターン候補バッファにおける
“距離部"内から最大距離が検索される。そして、得ら
れた最大距離が最大距離レジスタmaxdistに格納され
る。In step S38, the information of the sub-pattern candidate having the maximum distance maxdist in the sub-pattern candidate buffer is rewritten with the information of the standard pattern. This rewriting of information is performed as follows. That is, the "distance part" stores the contents of the distance register dist, and the "start stroke part" contains the start stroke counter.
The contents of ststrk are stored, and the "stroke part" is (contents of end stroke counter enstrk-start stroke
The calculation result of the counter ststrk + 1) is stored, and the "sub-pattern code part" stores the sub-pattern code of the header part. Here, when there are a plurality of sub-pattern candidates having the maximum distance maxdist, the information of the sub-pattern candidate having the minimum address is rewritten. In step S39, the maximum distance is searched from within the "distance portion" in the sub-pattern candidate buffer. Then, the obtained maximum distance is stored in the maximum distance register maxdist.

【０１３１】ステップＳ40で、上記開始ポインタdicpの
内容がインクリメントされる。ステップＳ41で、上記開
始ポインタdicpの内容が終了ポインタendpに内容より小
さいか否かが判別される。その結果、小さければ上記ス
テップＳ34に戻って当該特徴パターンに対する次の標準
パターンを用いたＤＰマッチング処理に移行する。一
方、終了ポインタendp以上であれば、サブパターン認識
処理サブルーチンを終了してメインルーチンにリターン
する。At step S40, the content of the start pointer dicp is incremented. In step S41, it is determined whether or not the content of the start pointer dicp is smaller than the content of the end pointer endp. As a result, if it is smaller, the process returns to the step S34 to shift to the DP matching process using the next standard pattern for the feature pattern. On the other hand, if it is equal to or more than the end pointer endp, the sub pattern recognition processing subroutine is ended and the process returns to the main routine.

【０１３２】こうして、上記サブパターン認識処理サブ
ルーチンが終了すると、上記サブパターン候補バッファ
内には当該特徴パターンに類似した標準パターンから成
る最大５０個のサブパターン候補が格納されるのであ
る。Thus, when the sub-pattern recognition processing subroutine is completed, a maximum of 50 sub-pattern candidates consisting of standard patterns similar to the characteristic pattern are stored in the sub-pattern candidate buffer.

【０１３３】図１９は、上記正規化部７８,特徴抽出部
７９,ストローク数限定部８１,ＤＰマッチング部８２,
文字生成部８４,文字候補展開部８７および文字候補ソ
ーティング部８８による文字生成処理動作のフローチャ
ートである。以下、図１９に従って、文字生成処理動作
について詳細に説明する。上記第１サブパターン認識処
理動作が終了すると文字生成処理動作がスタートする。FIG. 19 shows the normalizing unit 78, the feature extracting unit 79, the stroke number limiting unit 81, the DP matching unit 82,
9 is a flowchart of a character generation processing operation by a character generation unit 84, a character candidate expansion unit 87, and a character candidate sorting unit 88. Hereinafter, the character generation processing operation will be described in detail with reference to FIG. When the first sub-pattern recognition processing operation is completed, the character generation processing operation is started.

【０１３４】ステップＳ51で、上記第１サブパターン認
識処理動作によって得られて上記サブパターン候補バッ
ファ内に格納されている第１サブパターン認識結果の内
容によって、認識用記憶部７４の文字候補バッファが初
期化される。この初期化は、次のように行われる。すな
わち、図２０において、“距離部"には、上記サブパタ
ーン候補バッファの“距離部"の内容である“ＤＰマッ
チング距離"が格納される。以下、同様に、“終了スト
ローク部"には、“ストローク数部"の内容である“入力
のストローク数"が格納される。また、“サブパターン・
コード部"には、“サブパターン・コード部"の内容であ
る“サブパターン・コード"が格納される。そして、“サ
ブパターン数部"には“１"が格納される。尚、上記サブ
パターン・コード部のデータ長における記号Ｎspは、１
文字を構成する最大サブパターン数であり、サブパター
ンの定義によって変動する(本実施例においては
“８")。In step S51, the character candidate buffer of the recognition storage unit 74 is determined by the contents of the first sub-pattern recognition result obtained by the first sub-pattern recognition processing operation and stored in the sub-pattern candidate buffer. It is initialized. This initialization is performed as follows. That is, in FIG. 20, “DP matching distance”, which is the content of “distance part” of the sub-pattern candidate buffer, is stored in “distance part”. Similarly, the "end stroke part" stores the "input stroke count", which is the content of the "stroke count part". In addition, "sub pattern
The "code part" stores "sub-pattern code" which is the contents of the "sub-pattern code part", and "1" is stored in the "sub-pattern number part". The symbol Nsp in the data length of the pattern code part is 1
It is the maximum number of sub-patterns that form a character and varies depending on the definition of the sub-pattern ("8" in this embodiment).

【０１３５】ここで、上記文字候補バッファは、図２０
に示す構成が２００個並ぶメモリ容量/構造である。ま
た、上記サブパターン候補バッファには最大５０個の第
１サブパターン候補が格納されている。したがって、文
字候補バッファが初期化された段階では最大５０個の文
字候補情報がＤＰマッチング距離ｄistの近い順に書き
込まれることになる。図２１は、文字「故」が“９"スト
ロークで手書き入力された際における初期化直後の文字
候補バッファの内容を模式的に示す。尚、図２１では、
見やすいように、図２０の項目を並べ変えている。The character candidate buffer is shown in FIG.
The memory capacity / structure in which the 200 structures shown in FIG. Further, a maximum of 50 first sub-pattern candidates are stored in the sub-pattern candidate buffer. Therefore, at the stage when the character candidate buffer is initialized, a maximum of 50 pieces of character candidate information are written in the ascending order of the DP matching distance dist. FIG. 21 schematically shows the contents of the character candidate buffer immediately after the initialization when the character "behind" is input by handwriting with a "9" stroke. In addition, in FIG.
The items in FIG. 20 are rearranged for easier viewing.

【０１３６】ステップＳ52で、初期化後の上記文字候補
バッファにおける“終了ストローク数部"内から最小終
了ストローク数が検索される。そして、得られた最小終
了ストローク数が最小終了ストローク数レジスタminens
trkに格納される。その後、再度上記文字候補バッファ
が検索されて、最小終了ストローク数を有する第１サブ
パターン候補が選出される。ステップＳ53で、上記最小
終了ストローク数レジスタminenstrkの内容が入力スト
ローク数に等しいか否かが判別される。その結果、等し
ければステップＳ65に進み、等しくなければステップＳ
54に進む。図２１に示す例においては、最小終了ストロ
ーク数“２"を有する第１サブパターン候補「十」,「ナ」お
よび「亠」が検索されて、入力ストローク数“９"に等し
くないと判定される。In step S52, the minimum end stroke number is searched from the "end stroke number portion" in the initialized character candidate buffer. Then, the obtained minimum end stroke number is the minimum end stroke number register minens.
Stored in trk. Then, the character candidate buffer is searched again, and the first sub-pattern candidate having the minimum end stroke number is selected. In step S53, it is determined whether or not the content of the minimum end stroke number register minenstrk is equal to the input stroke number. As a result, if they are equal, the process proceeds to step S65, and if they are not equal, the process proceeds to step S65.
Proceed to 54. In the example shown in FIG. 21, the first sub-pattern candidates “ten”, “na”, and “jam” having the minimum end stroke number “2” are searched and it is determined that they are not equal to the input stroke number “9”. It

【０１３７】ステップＳ54で、上記認識用記憶部７４の
文字構成テーブルが参照されて、上記ステップＳ52にお
いて選択された第１サブパターン候補に続く第２サブパ
ターン候補が限定される。例えば、第１サブパターン候
補「十」,「ナ」および「亠」に続く第２サブパターン候補と
しては、第１サブパターン候補第２サブパターン候補「十」 → 「力(協)」,「兄(克)」,… 「ナ」 → 「ム(雄)」,「工(左)」,… 「亠」 → 「田(畝)」,「里(裏)」… 等に限定される。こうすることによって、第２サブパタ
ーン以降の入力パターンの認識の際のＤＰマッチング処
理の高速化を図るのである。In step S54, the character configuration table in the recognition storage section 74 is referred to, and the second sub-pattern candidates following the first sub-pattern candidate selected in step S52 are limited. For example, as the second sub-pattern candidates following the first sub-pattern candidates "ten", "na" and "ja", the first sub-pattern candidate is the second sub-pattern candidate "ten" → "force (cooperative)", " My brother (Katsu) ", ..." Na "→" Mu (male) "," Engineering (left) ", ..." Weapon "→" Ta (ridge) "," Sato (back) "... etc. By doing so, the DP matching process at the time of recognizing the input patterns after the second sub-pattern can be speeded up.

【０１３８】ステップＳ55で、上記開始ストローク・カ
ウンタststrkに“最小終了ストローク数レジスタminens
trkの内容＋１"がセットされる。ステップＳ56で、上記
終了ストローク・カウンタenstrkに“最小終了ストロー
ク数レジスタminenstrkの内容＋１"がセットされる。ス
テップＳ57で、上記第１サブパターン認識処理動作にお
ける上記ステップＳ22と同様にして、サブパターン候補
バファの内容が初期化される。ステップＳ58で、上記終
了ストローク・カウンタenstrkの内容が上記入力ストロ
ーク数よりも小さいか否かが判別される。その結果、入
力ストローク数よりも小さければステップＳ59に進み、
入力ストローク数以上であればステップＳ64に進む。In step S55, the start stroke counter ststrk is set to "minimum end stroke number register minens".
The content of trk + 1 is set. In step S56, the content of the minimum stroke count register minenstrk + 1 is set in the end stroke counter enstrk. In step S57, the first sub-pattern recognition processing operation is performed. The contents of the sub-pattern candidate buffer are initialized in the same manner as in step S22, and it is determined in step S58 whether the contents of the end stroke counter enstrk are smaller than the number of input strokes. , If it is smaller than the number of input strokes, proceed to step S59,
If it is equal to or larger than the number of input strokes, the process proceeds to step S64.

【０１３９】ステップＳ59〜ステップＳ63で、上記第１
サブパターン認識処理動作における上記ステップＳ23〜
ステップＳ25,ステップＳ27,ステップＳ28と同様にし
て、上記正規化処理,特徴抽出処理,ストローク数限定処
理,サブパターン認識処理サブルーチンおよび終了スト
ローク・カウンタenstrkインクリメントが実行されて、
上記ステップＳ58に戻る。尚、その際に、上記サブパタ
ーン認識処理時におけるＤＰマッチングでは、上記ステ
ップＳ54において限定されたサブパターンの標準パター
ンのみを用いて行われる。こうして、ＤＰマッチングに
要する時間を極力少なくして、認識時間が短縮されるの
である。In steps S59 to S63, the first
From step S23 in the sub-pattern recognition processing operation
Similar to step S25, step S27 and step S28, the normalization process, the feature extraction process, the stroke number limiting process, the sub pattern recognition process subroutine and the end stroke counter enstrk increment are executed,
Return to step S58. At that time, the DP matching at the time of the sub-pattern recognition processing is performed using only the standard pattern of the sub-patterns limited in step S54. In this way, the time required for DP matching is minimized and the recognition time is shortened.

【０１４０】こうして、上述の動作が、終了ストローク
・カウンタenstrkの内容が上記入力ストローク数以上に
なるまで行われる。そして、上記ステップＳ58において
終了ストローク・カウンタenstrkの内容が上記入力スト
ローク数以上であると判定されると、ステップＳ64に進
む。In this way, the above operation is performed until the content of the end stroke counter enstrk becomes equal to or more than the number of input strokes. If it is determined in step S58 that the content of the end stroke counter enstrk is equal to or greater than the number of input strokes, the process proceeds to step S64.

【０１４１】ステップＳ64で、上記文字候補展開部８７
によって、後に詳述するような文字展開処理サブルーチ
ンが実行されて上記ステップＳ52に戻る。以後、上述の
動作が、最小終了ストローク数レジスタminenstrkの内
容が入力ストローク数に等しくなるまで行われる。そし
て、上記ステップＳ53において最小終了ストローク数レ
ジスタminenstrkの内容が入力ストローク数に等しいと
判定されるとステップＳ65に進む。ステップＳ65で、上
記文字候補ソーティング部８８によって、上記文字候補
バッファ内に生成された文字候補が距離の小さい順にソ
ーティングされて認識結果が得られ、文字生成処理動作
動作を終了する。In step S64, the character candidate expansion section 87 is used.
As a result, a character expansion processing subroutine, which will be described in detail later, is executed and the process returns to step S52. After that, the above operation is performed until the content of the minimum end stroke number register minenstrk becomes equal to the input stroke number. If it is determined in step S53 that the content of the minimum end stroke number register minenstrk is equal to the input stroke number, the process proceeds to step S65. In step S65, the character candidate sorting unit 88 sorts the character candidates generated in the character candidate buffer in ascending order of distance to obtain a recognition result, and ends the character generation processing operation.

【０１４２】つまり、本文字生成処理においては、上記
第１サブパターン認識処理の結果得られた第１サブパタ
ーン候補のうち最小ストローク数の第１サブパターン候
補を選出し、この最小ストローク数の次のストローク数
を新たな開始ストローク数と見なして図１６の第１スト
ローク認識処理動作を実行し、第１サブパターン候補に
続くサブパターン候補を求め、得られたサブパターン候
補に基づいて文字候補を生成するのである。That is, in this character generation process, the first sub-pattern candidate having the minimum stroke number is selected from the first sub-pattern candidates obtained as a result of the first sub-pattern recognition process, and the next sub-stroke number is selected. 16 is executed as the new starting stroke number, the first stroke recognition processing operation of FIG. 16 is executed, a sub pattern candidate following the first sub pattern candidate is obtained, and a character candidate is obtained based on the obtained sub pattern candidate. To generate.

【０１４３】図２２は、上記文字展開処理サブルーチン
のフローチャートである。以下、図２２に従って、上記
文字候補展開部８７によって実行される文字展開処理動
作について詳細に説明する。この文字展開処理は、上記
文字候補バッファとサブパターン候補バッファとの内容
を入力とし、上記文字構成テーブルを参照して、新たな
文字候補を作成する処理である。上記文字生成処理動作
のフローチャートの上記ステップＳ58において終了スト
ローク・カウンタenstrkの内容が入力ストローク数に等
しいと判別されると文字展開処理動作がスタートする。FIG. 22 is a flow chart of the character expansion processing subroutine. Hereinafter, the character expansion processing operation executed by the character candidate expansion unit 87 will be described in detail with reference to FIG. This character expansion process is a process of inputting the contents of the character candidate buffer and the sub-pattern candidate buffer and referring to the character configuration table to create a new character candidate. When it is determined in step S58 of the flowchart of the character generation processing operation that the content of the end stroke counter enstrk is equal to the number of input strokes, the character expansion processing operation starts.

【０１４４】ステップＳ71で、上記文字候補バッファに
おける“距離部"から最大距離が検索され、最大距離レ
ジスタmaxdistにセットされる。ステップＳ72で、上記
認識用記憶部７４に設定された新文字候補カウンタnew c
countに、“現文字候補カウンタc countの内容(文字候
補バッファ内に生成された文字候補数)＋１"がセットさ
れる。ステップＳ73で、上記認識用記憶部７４に設定さ
れた文字候補カウンタccに初期値“１"がセットされ
る。In step S71, the character candidate buffer is stored.
The maximum distance is searched from the "distance part" in the
Set to Dista maxdist. In step S72, above
New character candidate counter new set in the recognition storage unit 74 c
In count, "current character candidate counter c contents of count (character
The number of character candidates generated in the complementary buffer) +1 "is set.
Be done. In step S73, it is set in the recognition storage unit 74.
The initial value "1" is set in the selected character candidate counter cc.
It

【０１４５】ステップＳ74で、文字候補カウンタccの内
容が現文字候補カウンタc countの内容以下であるか否
かが判別される。その結果、以下であればステップＳ75
に進み、そうでなければステップＳ86に進む。ステップ
Ｓ75で、上記文字候補バッファにおける“cc"番目の文
字候補の“終了ストローク数部"の内容(すなわち、終了
ストローク数)が、上記最小終了ストローク数レジスタm
inenstrkの内容(以下、注目終了ストローク数と言う)に
等しいか否かが判別される。その結果、等しい場合には
ステップＳ76に進み、等しくない場合にはステップＳ85
に進む。At step S74, the contents of the character candidate counter cc is changed to the current character candidate counter c. It is determined whether or not it is less than or equal to the content of count. As a result, if below, step S75
Otherwise, to step S86. In step S75, the content of the "end stroke number part" of the "cc" th character candidate in the character candidate buffer (that is, the end stroke number) is stored in the minimum end stroke number register m.
It is determined whether or not it is equal to the content of inenstrk (hereinafter referred to as the noticed end stroke number). As a result, if they are equal, the process proceeds to step S76, and if they are not equal, the step S85.
Proceed to.

【０１４６】ステップＳ76で、上記認識用記憶部７４に
設定されたサブパターン候補カウンタscに初期値“１"
がセットされる。尚、上記サブパターン候補カウンタに
は、図１９に示す文字生成処理動作の上記ステップＳ62
におけるサブパターン認識によって、第２サブパターン
以降のサブパターン候補が格納されている。ステップＳ
77で、サブパターン候補カウンタscの内容がサブパター
ン候補数以下であるか否かが判別される。その結果、以
下であればステップＳ78に進み、そうでなければステッ
プＳ85に進む。ステップＳ78で、上記文字候補バッファ
における“cc"番目の文字候補(複数存在する場合あり)
の“サブパターン・コード部"の内容と上記サブパターン
候補バッファにおける“sc"番目のサブパターン候補の
“サブパターン・コード部"の内容とから成るサブパター
ン・コード列が生成される。つまり、上記注目終了スト
ローク数を呈する第１サブパターン候補とこれに続くサ
ブパターン候補とのサブパターン・コード列が生成され
るのである。In step S76, the initial value "1" is set in the sub-pattern candidate counter sc set in the recognition storage section 74.
Is set. The sub-pattern candidate counter is set to the step S62 of the character generation processing operation shown in FIG.
Sub-pattern candidates after the second sub-pattern are stored by the sub-pattern recognition in. Step S
At 77, it is determined whether or not the content of the sub-pattern candidate counter sc is less than or equal to the number of sub-pattern candidates. As a result, if it is below, the process proceeds to step S78, and if not, the process proceeds to step S85. In step S78, the "cc" th character candidate in the character candidate buffer (there may be more than one)
, And the content of the "sub-pattern code part" of the "sc" -th sub-pattern candidate in the sub-pattern candidate buffer is generated. That is, a sub-pattern code string of the first sub-pattern candidate having the number of strokes of the attention end and the subsequent sub-pattern candidates is generated.

【０１４７】ステップＳ79で、上記文字構成テーブルが
参照されて、上記ステップＳ78において生成されたサブ
パターン・コード列を構成サブパターンとする文字が存
在するか否かが判別される。その結果、存在すればステ
ップＳ80に進み、無ければステップＳ84に進む。ステッ
プＳ80で、上記ステップＳ79において検索された文字に
係る平均距離c distが、式(２３)に従って算出される。 c dist＝〔{ＣＤ_p-1・(ｐ−１)}＋ＳＤ_p〕/ｐ …（２３）ここで、ｐ：対象とする文字を構成するサブパタ
ーン数ＣＤ_p-1：対象とする文字を構成する(ｐ−１)番目まで
のサブパターン列で成る文字候補の平均距離ＳＤ_p：対象とする文字を構成するｐ番目のサブパター
ン候補のＤＰマッチング距離In step S79, the above-mentioned character composition table is
By referring to the sub generated in step S78,
There is a character whose pattern code string is a constituent sub-pattern.
It is determined whether or not it exists. As a result, if it exists,
If not, the process proceeds to step S84. Step
In step S80, select the character searched in step S79.
Average distance c dist is calculated according to equation (23). c dist = [{CD_p-1・ (P-1)} + SD_p] / P (23) where, p: sub-pattern forming the target character
Number of CDs_p-1: Up to the (p-1) th which constitutes the target character
Average distance of character candidates consisting of subpattern sequence of SD_p: P-th sub-pattern that makes up the target character
DP matching distance of candidate

【０１４８】ステップＳ81で、上記ステップＳ80におい
て算出された平均距離c distの値が最大距離レジスタma
xdistの内容よりも大きいか否かが判別される。その結
果、大きければステップＳ84に進み、そうでなければス
テップＳ82に進む。ステップＳ82で、上記ステップＳ79
において検索された文字が、現在得られている文字候補
における最大距離よりも小さいので、上記ステップＳ78
において作成されたサブパターン・コード列に基づいて
新たな文字候補(新文字候補)を作成して文字候補に追加
する。尚、新文字候補作成は具体的には以下のようにし
て行われる。In step S81, the average distance c calculated in step S80 is calculated. The value of dist is the maximum distance register ma
It is determined whether or not it is larger than the content of xdist. As a result, if it is larger, the process proceeds to step S84, and if not, the process proceeds to step S82. In step S82, the above step S79
Since the character searched in is smaller than the maximum distance of the character candidates currently obtained, the above step S78
A new character candidate (new character candidate) is created on the basis of the sub-pattern code string created in (3) and added to the character candidate. Note that the new character candidate creation is specifically performed as follows.

【０１４９】すなわち、上記文字候補バッファにおける
“new c count”番目の文字候補の“終了ストローク数
部"および“サブパターン数部"の内容として、“cc"番
目の文字候補の“終了ストローク数部"および“サブパ
ターン数部"の内容が複写される。次に、上記サブパタ
ーン候補バッファにおける“sc"番目のサブパターン候
補の内容に基づいて以下に示す処理が行われる。・上記文字候補バッファにおける“new c count"番目の
文字候補(以下、新文字候補と言う)の“距離部"には、
上記ステップＳ80において算出された平均距離c distが
格納される。・上記新文字候補の“終了ストローク数部"には、“sc"
番目のサブパターン候補における“ストローク数部"の
内容が加算される。・上記新文字候補の“サブパターン数部"の内容に“１"
が加算される。・上記新文字候補の“サブパターン・コード部"の内容の
終端に、“sc"番目のサブパターン候補における“サブ
パターン・コード部"の内容が追加される。・上記新文字候補の“サブパターン・コード部"の内容に
“sc"番目のサブパターン候補における“サブパターン・
コード部"の内容を追加して得られたサブパターン・コー
ド列が上記文字構成テーブルにおける唯一のサブパター
ン・コード列である場合には、上記文字構成テーブルに
おける当該サブパターン・コード列で構成される文字の
コードが上記新文字候補の“文字コード部"に格納され
る。That is, "new" in the character candidate buffer c The contents of "end stroke number part" and "sub pattern number part" of "cc" th character candidate are copied as the contents of "end stroke number part" and "sub pattern number part" of the count "th character candidate. Next, the following processing is performed based on the content of the "sc" th sub-pattern candidate in the sub-pattern candidate buffer: "new" in the character candidate buffer c The "distance part" of count "th character candidate (hereinafter referred to as new character candidate)" is
Average distance c calculated in step S80 dist is stored.・ "Sc" is added to the "end stroke part" of the above new character candidates.
The contents of the "stroke part" in the th sub-pattern candidate are added. -"1" in the content of "number of sub-patterns" of the above new character candidate
Is added. -The contents of the "subpattern code part" in the "sc" th subpattern candidate are added to the end of the contents of the "subpattern code part" of the new character candidate. The content of the "subpattern code part" of the above new character candidate is "subpattern" in the "sc" th subpattern candidate.
When the sub-pattern code string obtained by adding the contents of the "code part" is the only sub-pattern code string in the character configuration table, it is composed of the sub-pattern code string in the character configuration table. The code of the character to be stored is stored in the "character code part" of the new character candidate.

【０１５０】ステップＳ83で、上記新文字候補カウンタ
new c countの内容がインクリメントされる。ステップ
Ｓ84で、上記サブパターン候補カウンタscの内容がイン
クリメントされた後上記ステップＳ77に戻る。こうし
て、上述の動作がサブパターン候補カウンタscの内容が
上記サブパターン候補数より大きくなるまで行われる。
そして、上記ステップＳ77においてサブパターン候補数
scの内容が上記サブパターン候補数より大きいと判定さ
れると、ステップＳ85に進む。ステップＳ85で、上記文
字候補カウンタccの内容がインクリメントされた後上記
ステップＳ74に戻る。こうして、上述の動作が、上記文
字候補カウンタccの内容が上記現文字候補カウンタc co
untの内容より大きくなるまで行われる。そして、上記
ステップＳ74において上記現文字候補カウンタc count
の内容より大きいと判定されると、ステップＳ86に進
む。In step S83, the new character candidate counter
new c The contents of count are incremented. After the content of the sub-pattern candidate counter sc is incremented in step S84, the process returns to step S77. In this way, the above operation is performed until the content of the sub pattern candidate counter sc becomes larger than the number of sub pattern candidates.
Then, in the above step S77, the number of sub-pattern candidates
If it is determined that the content of sc is larger than the number of sub-pattern candidates, the process proceeds to step S85. After the content of the character candidate counter cc is incremented in step S85, the process returns to step S74. Thus, the above-mentioned operation is performed such that the contents of the character candidate counter cc is the same as the current character candidate counter c. co
It is performed until it becomes larger than the contents of unt. Then, in step S74, the current character candidate counter c count
If it is determined that it is larger than the content of, the process proceeds to step S86.

【０１５１】ステップＳ86で、文字候補の詰め込みが行
われる。ここで、上記文字候補の詰め込みは、以下のよ
うにして行われる。すなわち、上記文字展開処理が実行
されることによって、上記注目終了ストローク数を有す
る文字候補は、第２サブパターン以降のサブパターンが
付加されて、上記新文字候補として生まれ変わって新た
に登録されている。したがって、もはや上記注目終了ス
トローク数を有する文字候補は必要とはしないのであ
る。そこで、上記文字候補バッファにおける先頭の文字
候補から“c count"番目の文字候補までの文字候補のう
ち、上記注目終了ストローク数以下の終了ストローク数
を有する文字候補の数をカウントして文字候補消去数カ
ウンタe countにセットする。そうした後、上記注目終
了ストローク数以下の終了ストローク数を有する文字候
補を削除し、未消去の文字候補の格納領域を低アドレス
側の空き領域に順次移動するのである。ステップＳ87
で、上記文字候補消去数カウンタe countの内容に基づ
いて、現文字候補カウンタc countの内容が式(２４)に
従って更新される。 C COUNT＝(NEW COUNT)−(E COUNT) …（２４）ここで、 C COUNT：現文字候補カウンタc countの内容 NEW COUNT：新文字候補カウンタnew c countの内容 E COUNT：文字候補消去数カウンタe countの内容そうした後、文字展開処理サブルーチンを終了して、上
記文字生成処理動作ルーチンにリターンする。In step S86, character candidates are packed. Here, the filling of the character candidates is performed as follows. That is, when the character expansion process is executed, the character candidates having the number of end strokes of interest are added to the second and subsequent sub-patterns and reborn as the new character candidates and newly registered. . Therefore, the character candidates having the number of end strokes of interest are no longer required. Therefore, from the first character candidate in the character candidate buffer above, "c Of the character candidates up to the "count" th character candidate, the number of character candidates having an end stroke number less than or equal to the noted end stroke number is counted to erase the character candidate counter e. Set to count. After that, the character candidates having the end stroke number equal to or less than the noted end stroke number are deleted, and the storage area of the non-erased character candidates is sequentially moved to the empty area on the low address side. Step S87
Then, the above character candidate erase count counter e Based on the contents of count, the current character candidate counter c The contents of count are updated according to equation (24). C COUNT ＝ (NEW COUNT) − (E COUNT) (24) where C COUNT: Current character candidate counter c contents of count NEW COUNT: New character candidate counter new c contents of count E COUNT: Character candidate erase count counter e Contents of count After that, the character expansion processing subroutine is terminated, and the processing returns to the character generation processing operation routine.

【０１５２】こうして、上記文字展開処理サブルーチン
が実行されることによって、上記第１サブパターン候補
のうち入力ストローク数に満たないサブパターン候補に
第２サブパターン候補以降のサブパターン候補が付加さ
れて、文字候補が得られるのである。By executing the character expansion processing sub-routine in this manner, the sub-pattern candidates after the second sub-pattern candidate are added to the sub-pattern candidates less than the number of input strokes among the first sub-pattern candidates, Character candidates are obtained.

【０１５３】以下、上述の文字展開処理について、文字
「功」がストローク数“５"で手書き入力された場合を例
に、更に具体的に説明する。ここで、上記第１サブパタ
ーン認識処理による結果、上記文字候補バッファの内容
が、図２３に示すように初期化されているものとする。
また、上記文字生成処理が最小終了スローク数minenstr
k＝３まで終了して、上記サブパターン候補バッファの
内容が図２４に示すようになっているものとする。Hereinafter, the above-mentioned character expansion processing will be described more concretely, taking as an example the case where the character “gaku” is input by handwriting with the stroke number “5”. Here, as a result of the first sub-pattern recognition processing, it is assumed that the contents of the character candidate buffer are initialized as shown in FIG.
Also, the above-mentioned character generation processing ends at the minimum number of minks.
It is assumed that the contents of the sub-pattern candidate buffer are as shown in FIG. 24 after completion of k = 3.

【０１５４】先ず、上記最大距離レジスタmaxdist＝８
０がセットされ、新文字候補カウンタnew c count＝現
文字候補カウンタc count＋１＝４がセットされる。…
ステップＳ71,Ｓ72 次に、上記文字候補カウンタcc＝１がセットされ、この
セット値が現文字候補カウンタc count＝４以下である
ので、文字候補バッファから“cc＝１"番目のサブパタ
ーン候補「工」が読み出される。そして、当該サブパター
ン候補「工」の現在の終了ストローク数“３"が注目終了
ストローク数(＝minenstrk＝３)に一致するので、サブ
パターン候補カウンタscを“１"から“サブパターン候
補数＝３"までインクリメントしつつサブパターン候補
バッファから第１サブパターン候補「工」に続くサブパタ
ーン候補が検索される。そして、検索されたサブパター
ン候補「力」,「方」,「并」が第１サブパターン候補「工」に付
加されて、サブパターン・コード列「工」,「力」と「工」,
「方」と「工」,「并」とが生成される。ところが、この場合
には、サブパターン・コード列「工」,「方」と「工」,「并」は
文字構成テーブルには存在しないので、サブパターン・
コード列「工」,「力」のみの平均距離c distが式(２３)に
従って算出される。こうして、算出された平均距離c dis
t＝３６が最大距離レジスタmaxdist＝８０より小さいの
で、新文字候補「工,力」が作成されて、図２５に示すよ
うに、“new c count＝４"番目の文字候補として登録さ
れる。また、その“ストローク数部"には文字候補「工」
の終了ストローク数＝３にサブパターン候補「力」のスト
ローク数＝２を加算した“５"が格納され、“サブパタ
ーン数部"には文字候補「工」のサブパターン数＝１に
“１"を加算した値“２"が格納され、新文字候補カウン
タnew c countが“５"にインクリメントされる。そうし
た後、上記文字候補カウンタccがインクリメントされ
る。…ステップＳ73〜Ｓ84,ステップＳ77,ステップＳ８
５First, the maximum distance register maxdist = 8
0 is set, new character candidate counter new c count = current
Character candidate counter c count + 1 = 4 is set. …
Steps S71, S72 Next, the character candidate counter cc = 1 is set, and
The set value is the current character candidate counter c count = 4 or less
Therefore, the "cc = 1" th sub-pattern from the character candidate buffer
The candidate "engine" is read. And the sub putter
The current end stroke number “3” of the candidate “Engine” ends
Since it matches the number of strokes (= minenstrk = 3),
Set the pattern candidate counter sc from "1" to "sub pattern
Sub-pattern candidates while incrementing to complement = 3 "
Sub pattern following the first sub-pattern candidate "Kou" from the buffer
The candidate candidates are searched. And the searched sub putter
The candidate "power", "one", "parallel" is attached to the first sub-pattern candidate "engine"
In addition, the sub-pattern code string "work", "force" and "work",
“Person”, “work”, and “parallel” are generated. However, in this case
Is a sub-pattern code string "Kou", "kata" and "Kou", "parallel"
Since it does not exist in the character composition table, the subpattern
Average distance c only for code string "Engineer" and "Force" dist becomes the formula (23)
Therefore, it is calculated. Thus, the average distance c calculated dis
t = 36 is less than maximum distance register maxdist = 80
Then, a new character candidate "Kou, Riki" is created and shown in Fig. 25.
Sea urchin "new c registered as count = 4 "th character candidate
Be done. In addition, the character candidate "Kou" is included in the "stroke part".
End stroke number = 3 and the sub-pattern candidate "power" strikes
"5" that is the sum of the number of roques = 2 is stored, and "sub pattern" is stored.
The number of sub-patterns of the character candidate "Kou" is set to 1
The value "2" obtained by adding "1" is stored, and the new character candidate count
New c count is incremented to "5". So
After that, the above character candidate counter cc is incremented
It ... Steps S73 to S84, Step S77, Step S8
5

【０１５５】“ｃｃ＝２"番目のサブパターン候補「土」
の現在の終了ストローク数“３"は注目終了ストローク
数“３"に一致するので、第１サブパターン候補「土」に
続くサブパターン候補が検索される。そして、サブパタ
ーン・コード列「土」,「力」と「土」,「方」と「土」,「并」とが
生成される。ところが、この場合には、サブパターン・
コード列「土」,「力」と「土」,「并」は文字構成テーブルには
存在しないので、サブパターン・コード列「土」,「方」のみ
の平均距離c distが算出される。そして、得られた平均
距離c dist＝６１が最大距離レジスタmaxdist＝８０よ
り小さいので新文字候補「土,方」が作成されて、図２６
に示すように、“new c count＝５"番目の文字候補とし
て登録され、その“終了ストローク数部"および“サブ
パターン数部"の内容が格納され、新文字候補カウンタn
ew c countが“６"にインクリメントされる。そうした
後、上記文字候補カウンタccがインクリメントされる。
…ステップＳ74〜Ｓ84,ステップＳ77,ステップＳ85"Cc = 2" th sub-pattern candidate "Sat"
Since the current end stroke number “3” of “3” matches the target end stroke number “3”, a sub pattern candidate following the first sub pattern candidate “Sat” is searched. Then, the sub-pattern / code sequence “Soil”, “Force” and “Sat”, “One” and “Sat”, and “Parallel” are generated. However, in this case, the subpattern
Since the code strings "earth", "force" and "earth", "parallel" do not exist in the character composition table, the average distance c of only the sub-pattern code series "earth", "one" dist is calculated. And the obtained average distance c Since dist = 61 is smaller than the maximum distance register maxdist = 80, a new character candidate “Sat, kata” is created, as shown in FIG.
As shown in “new c count = 5 Registered as the 5th character candidate, the contents of the "end stroke number part" and the "subpattern number part" are stored, and the new character candidate counter n
ew c count is incremented to "6". After that, the character candidate counter cc is incremented.
... Steps S74 to S84, Step S77, Step S85

【０１５６】“cc＝３"番目のサブパターン候補「气」が
読み出される。ところが、当該サブパターン候補「气」に
おける現在の終了ストローク数“４"が注目終了ストロ
ーク数＝３に一致しないので、文字候補カウンタccがイ
ンクリメントされる。…ステップＳ74,Ｓ75,Ｓ85 以下、同様の処理が行われ、図２６に示すような文字候
補が上記文字候補バッファ内に展開される。The "cc = 3" th sub-pattern candidate "gas" is read. However, since the current end stroke number “4” in the sub-pattern candidate “gas” does not match the target end stroke number = 3, the character candidate counter cc is incremented. Steps S74, S75, S85 and thereafter, similar processing is performed, and character candidates as shown in FIG. 26 are developed in the character candidate buffer.

【０１５７】次に、図２６に示す文字候補バッファ内の
文字候補における“終了ストローク数"が注目終了スト
ローク数(＝minenstrk＝３)に等しい文字候補「工」,「土」
が削除されて残った文字候補が詰め込まれる。…ステッ
プＳ86 そうした後、上記文字生成処理動作にリターンして文字
候補がソーティングされた後、終了ストローク数が入力
ストローク数“５"に満たない文字候補「气」が削除され
る。その結果、図２７に示すような文字候補が得られる
のである。…ステップＳ65Next, in the character candidates in the character candidate buffer shown in FIG. 26, the "end stroke number" is equal to the noticed end stroke number (= minenstrk = 3).
Is deleted and the remaining character candidates are packed. Step S86 After that, after returning to the above character generation processing operation to sort the character candidates, the character candidate "gas" whose number of end strokes is less than the number "5" of input strokes is deleted. As a result, character candidates as shown in FIG. 27 are obtained. … Step S65

【０１５８】上述のようにして第１サブパターン認識処
理,文字生成処理および文字展開処理が行われて、図２
７に示すような文字候補が上記文字候補バッファ内に形
成されると、出力部８５によって、第１位の文字候補
「工,力」から生成された認識結果「功」の文字コードが結
果表示部７６に送出されて表示されるのである。As described above, the first sub-pattern recognition process, the character generation process, and the character expansion process are performed, and
When a character candidate as shown in 7 is formed in the character candidate buffer, the output unit 85 displays the character code of the recognition result “gong” generated from the first character candidate “work, strength”. It is sent to the unit 76 and displayed.

【０１５９】このように、本実施例においては、上記サ
ブパターン辞書７５に登録する標準パターンにはヘッダ
部を設け、このヘッダ部には、２つの軸との方向差の平
均値ｍ¹,ｍ²と標準偏差値ｖ¹,ｖ²を格納しておく。そし
て、上記方向差判定部８０によって、次ストロークへの
オフ方向差をdd¹,dd²を式(１３)で算出し、この算出結
果dd¹,dd²とサブパターン辞書７５から読み出した標準
パターンに付加されている上記方向差の平均値ｍ¹,ｍ²
および標準偏差値ｖ¹,ｖ²とが式(１６),(１７)を満たさ
ないような標準パターンは、ＤＰマッチングに有効では
ないと判定するようにしている。したがって、後にＤＰ
マッチング部８２によって行われるＤＰマッチング処理
は、有効な標準パターンのみを用いて迅速に行うことが
できるのである。As described above, in the present embodiment, the standard pattern registered in the sub-pattern dictionary 75 is provided with the header portion, and the header portion has the average value m ¹ , m of the directional difference between the two axes. ² and standard deviation values v ¹ and v ² are stored. Then, the direction difference determination unit 80 calculates the difference in the off direction to the next stroke with dd ¹ and dd ² by the equation (13), and the calculated results dd ¹ and dd ² and the standard pattern read from the sub-pattern dictionary 75. the average value of the direction difference is added to m ^1, m ²
It is determined that the standard pattern in which the standard deviation values v ¹ and v ² do not satisfy the equations (16) and (17) is not effective for DP matching. Therefore, later DP
The DP matching process performed by the matching unit 82 can be quickly performed using only valid standard patterns.

【０１６０】また、上記ストローク数限定部８１によっ
て、式(１８)に従って、ＤＰマッチング処理の対象とな
っている特徴パターンの正規のストローク数をある範囲
に限定する。そして、ＤＰマッチング部８２によるＤＰ
マッチング処理の際には、上記限定範囲内に入る標準パ
ターンのみを選択して用いるようにしている。したがっ
て、上記特徴パターンに類似した標準パターンのみを用
いて、上記ＤＰマッチング処理を更に迅速に行うことが
できる。Further, the stroke number limiting unit 81 limits the regular stroke number of the characteristic pattern which is the target of the DP matching processing to a certain range according to the equation (18). Then, the DP by the DP matching unit 82
In the matching process, only standard patterns within the above-mentioned limited range are selected and used. Therefore, the DP matching process can be performed more quickly using only the standard pattern similar to the characteristic pattern.

【０１６１】また、文字と構成サブパターンとを対応付
けてなる文字構成テーブルを認識用記憶部７４に登録
し、上記文字生成部８４による文字生成処理に際して
は、上記文字構成テーブルを参照して、第１サブパター
ンの認識結果に続くサブパターンの候補を限定するよう
にしている。したがって、上記文字生成処理時における
ＤＰマッチングに際して用いる標準パターンを上記限定
結果に従って絞り込むことができ、上記文字認識処理を
迅速に行うことができる。Further, a character structure table in which characters and structure sub-patterns are associated with each other is registered in the recognition storage unit 74, and in the character generation processing by the character generation unit 84, the character structure table is referred to, Sub-pattern candidates following the recognition result of the first sub-pattern are limited. Therefore, the standard pattern used for DP matching in the character generation processing can be narrowed down according to the limited result, and the character recognition processing can be performed quickly.

【０１６２】すなわち、本実施例によれば、手書き入力
された文字の認識処理を高速に実施することが可能にな
る。このことは、単に認識結果を早く得ることを可能に
するのみならず、サブパターン辞書２５に登録する標準
パターンのバリエーション数を増加して「続け字」や「崩
し字」に対する認識率を高めることを可能にするのであ
る。That is, according to the present embodiment, it is possible to perform the recognition processing of the characters input by handwriting at high speed. This not only makes it possible to quickly obtain the recognition result, but also increases the number of variations of the standard pattern registered in the sub-pattern dictionary 25 to increase the recognition rate for "continuation characters" and "collapsed characters". Is possible.

【０１６３】上記各実施例では、説明を簡単にするため
に、第１〜第４実施例における特徴再抽出部８,１８,３
８,５８による特徴再抽出処理、あるいは、第５実施例
における特徴抽出処理や第１サブパターン認識処理は、
１文字分の入力パターンから抽出された特徴点の座標デ
ータ列が特徴格納部６,１６,３６,５６に格納された
後、あるいは、１文字分の座標データ列や特徴パターン
が上記入力座標バッファや特徴バッファに格納された後
に、実行されるとして説明している。ところが、通常、
人間の１文字の筆記には２秒程度掛かるので、その筆記
時間を利用して上記特徴再抽出処理あるいは特徴抽出処
理や第１サブパターン認識処理を上述の入力処理と並列
して行うようにすれば、オンライン手書き文字認識が可
能となる。In each of the above-mentioned embodiments, the feature re-extracting units 8, 18, 3 in the first to fourth embodiments are described for the sake of simplicity.
The feature re-extraction processing by 8, 58, the feature extraction processing and the first sub-pattern recognition processing in the fifth embodiment are
After the coordinate data string of the feature point extracted from the input pattern for one character is stored in the feature storage unit 6, 16, 36, 56, or the coordinate data string or feature pattern for one character is input to the input coordinate buffer. It is described as being executed after being stored in the feature buffer. However, usually
Since it takes about 2 seconds to write one character by a human, the writing time may be used to perform the feature re-extracting process or the feature extracting process and the first sub-pattern recognition process in parallel with the above-mentioned input process. For example, online handwriting recognition is possible.

【０１６４】その際には、第１〜第４実施例における特
徴格納部６,１６,３６,５６、あるいは、第５実施例に
おける上記入力座標バッファ,特徴バッファ,入力データ
・バッファ等をデュアルポート・メモリで構成して、情報
の書き込みと読み出しとを平行に実行すればよい。尚、
このようなオンライン手書き文字認識処理は、上述のよ
うに、手書き文字の認識処理を高速に実行可能になって
初めて実現可能となるのである。In this case, the feature storage units 6, 16, 36, 56 in the first to fourth embodiments, or the input coordinate buffer, feature buffer, input data buffer, etc. in the fifth embodiment are dual-ported. -A memory may be used to write and read information in parallel. still,
Such online handwritten character recognition processing can be realized only when the handwritten character recognition processing can be executed at high speed as described above.

【０１６５】上記第１実施例における座標点間距離Ｎim
in,方向差Ｎdist,ストローク終了判定時間Ｎtime,正規
化範囲Ｎnorm,補間判定距離Ｎfmax,特徴点間限界距離Ｎ
flen,ノイズ判定距離Ｎfmin,ノイズ判定方向差Ｎddst、
あるいは、第５実施例における正規化範囲Ｎnorm,補間
間隔ｗ,方向値量子化数Ｄ,直交２軸の方向値Ｒ^j,定数Ｃ
s,方向差重みＷd,定数Ｃw,整合窓範囲ww等の数値はほん
の一例であり、認識精度や使用状況に応じて適切に設定
すればよい。また、上記各実施例は単独に実施されても
構わないが、適宜に組み合わせても一向に差し支えな
い。例えば、上記正規化前と正規化後とに特徴を抽出す
る第１実施例における前処理を第５実施例の前処理に適
用すること、あるいは、第３,第４実施例におけるＤＰ
マッチングの重み変化を第５実施例におけるＤＰマッチ
ングに適用すること等が考えられる。Distance between coordinate points Nim in the first embodiment.
in, direction difference Ndist, stroke end determination time Ntime, normalized range Nnorm, interpolation determination distance Nfmax, inter-feature point limit distance N
flen, noise judgment distance Nfmin, noise judgment direction difference Nddst,
Alternatively, the normalization range Nnorm, the interpolation interval w, the direction value quantization number D, the direction value R ^{j of} two orthogonal axes, and the constant C in the fifth embodiment.
Numerical values such as s, direction difference weight Wd, constant Cw, matching window range ww, etc. are merely examples, and may be appropriately set according to the recognition accuracy and the usage situation. Further, each of the above-mentioned embodiments may be carried out independently, but may be appropriately combined with each other. For example, the preprocessing in the first embodiment for extracting the features before and after the normalization is applied to the preprocessing in the fifth embodiment, or the DP in the third and fourth embodiments is applied.
It is conceivable to apply the matching weight change to the DP matching in the fifth embodiment.

【０１６６】尚、第１実施例における前処理を第５実施
例の前処理に適用した際には、以下のように、続け字に
対して高い認識結果を得ることができる。すなわち、４
万字分の楷書の筆跡データから作成された約１万４千の
サブパターンの標準パターンをサブパターン辞書に登録
する。そして、複数画が続けて記入された約２万７千字
分の漢字の筆跡データ(約４００字×６７名)を入力し
て、手書き文字認識を行った。その結果、表１に示すよ
うに、第１位の認識候補での正解率は９６.０％と高
く、第２位以下の認識候補を含めれば更に高く９８.７
％の正解率を得ることができた。この正解率は、従来の
手書き認識装置（１９９３年電子情報通信学会全国大
会，Ｄ５７８で発表）に比較して格段に高い値である。When the preprocessing in the first embodiment is applied to the preprocessing in the fifth embodiment, it is possible to obtain a high recognition result for continuous characters as follows. Ie 4
The standard patterns of about 14,000 sub-patterns created from handwriting data of ten thousand characters are registered in the sub-pattern dictionary. Then, handwriting data was recognized by inputting handwriting data (about 400 characters × 67 names) of about 27,000 kanji characters in which a plurality of strokes were continuously written. As a result, as shown in Table 1, the correct answer rate for the first-ranked recognition candidate is as high as 96.0%, and it is even higher if the second-ranked and lower-ranked recognition candidates are included.
We were able to obtain a correct answer rate of%. This correct answer rate is significantly higher than that of a conventional handwriting recognition device (announced at D578, National Convention of the Institute of Electronics, Information and Communication Engineers, 1993).

【表１】 [Table 1]

【０１６７】また、上述したように、複数ストロークの
集合であるサブパターンを認識の単位とすることによっ
て辞書の容量を小さくできるのであるが、このことを具
体例を上げて説明する。ここで、・認識対象パターンは「早」である。・パターン「早」を手書きする際に発生し得るストローク
の種類の数は“１８"である。・パターン「早」を構成するに際して組み合わせ得るスト
ローク数とその場合の組み合わせ数は下記の通りであ
る。ストローク数組み合わせ数ストローク数組み合わせ数６１３１０５５２５４１０１１と仮定する。As described above, the capacity of the dictionary can be reduced by using the sub-pattern, which is a set of a plurality of strokes, as a recognition unit. This will be described with a specific example. Here, the recognition target pattern is “fast”. The number of types of strokes that can occur when handwriting the pattern "Haya" is "18". -The number of strokes that can be combined when forming the pattern "Haya" and the number of combinations in that case are as follows. Number of strokes Number of combinations Number of strokes Number of combinations 6 1 3 10 5 5 2 5 4 4 10 1 1 is assumed.

【０１６８】(ａ) 従来の辞書(スロトーク辞書＋パタ
ーン辞書) 従来の辞書では、パターン「早」を手書きする際に発生し
得る全てのストロークをストローク辞書に登録し、パタ
ーン「早」を構成する全てのストロークの組み合わせをパ
ターン辞書に登録している。・スロトーク辞書１ストローク当たりの記憶容量をストローク・コード＝２バイトストロークを表現する１６区間の方向列＝１６バイト合計＝１８バイトとすると、発生し得るストローク数は“１８"であるか
ら、ストローク辞書の必要最小容量は１８バイト×１８ストローク＝３２４バイトとなる。(A) Conventional Dictionary (Slotok Dictionary + Pattern Dictionary) In the conventional dictionary, all strokes that can occur when handwriting the pattern “Haya” are registered in the stroke dictionary to form the pattern “Haya”. All stroke combinations are registered in the pattern dictionary. Slotalk dictionary The storage capacity per stroke is stroke code = 2 bytes Directional sequence of 16 sections expressing stroke = 16 bytes Total = 18 bytes, the number of possible strokes is "18", so the stroke dictionary The minimum required capacity is 18 bytes x 18 strokes = 324 bytes.

【０１６９】・パターン辞書１パターン候補当たりの記憶容量をパターン・コード＝２バイトストローク数＝１バイト方向情報＝４バイトストローク・コード列＝２バイト×ストローク数ストローク間の位置情報＝１バイト×ストローク数合計＝７バイト＋３バイト×ストローク数とすると、組み合わせ得るストローク数とその場合に発
生する組み合わせ数とは上述の通りであるから、文字辞
書の必要最小容量は次のようになる。１×(７バイト＋３バイト×６ストローク) ＋５×(７バイト＋３バイト×５ストローク) ＋１０×(７バイト＋３バイト×４ストローク) ＋１０×(７バイト＋３バイト×３ストローク) ＋５×(７バイト＋３バイト×２ストローク) ＋１×(７バイト＋３バイト×１ストローク) ＝５６０バイトとなる。Pattern dictionary: Storage capacity per pattern candidate: pattern code = 2 bytes Stroke number = 1 byte Direction information = 4 bytes Stroke code string = 2 bytes x number of strokes Position information between strokes = 1 byte x stroke Assuming that the total number = 7 bytes + 3 bytes × stroke number, the number of strokes that can be combined and the number of combinations generated in that case are as described above. Therefore, the minimum required capacity of the character dictionary is as follows. 1 x (7 bytes + 3 bytes x 6 strokes) + 5 x (7 bytes + 3 bytes x 5 strokes) + 10 x (7 bytes + 3 bytes x 4 strokes) + 10 x (7 bytes + 3 bytes x 3 strokes) + 5 x (7 bytes + 3 Byte x 2 strokes) + 1 x (7 bytes + 3 bytes x 1 stroke) = 560 bytes.

【０１７０】・従来の辞書の必要最小容量ストローク辞書の必要最小容量＋パターン辞書の必要最小容量＝３２４バイト＋５６０バイト＝８８４バイトRequired minimum capacity of conventional dictionary Required minimum capacity of stroke dictionary + Minimum required capacity of pattern dictionary = 324 bytes +560 bytes = 884 bytes

【０１７１】(ｂ) 本実施例の辞書(サブパターン辞書) 本実施例の辞書では、図２(e)に示すサブパターン「早」
をサブパターン辞書に登録している。・サブパターン辞書１サブパターン当たりの記憶容量をサブパターン・コード＝２バイトストローク数＝１バイト座標数＝１バイト方向情報＝４バイト特徴データ＝４バイト×特徴点数合計＝８バイト＋４バイト×特徴点数とすると、図２(e)における特徴点数は“２０"であるか
ら、サブパターン辞書の必要最小容量は８バイト＋４バイト×２０点＝８８バイトとなる。(B) Dictionary of this Embodiment (Sub-Pattern Dictionary) In the dictionary of this embodiment, the sub-pattern “Haya” shown in FIG.
Is registered in the sub-pattern dictionary.・ Sub-pattern dictionary Storage capacity per sub-pattern Sub-pattern code = 2 bytes Stroke number = 1 byte Coordinate number = 1 byte Direction information = 4 bytes Feature data = 4 bytes x number of feature points Total = 8 bytes + 4 bytes x Features 2 (e), the minimum required capacity of the sub-pattern dictionary is 8 bytes + 4 bytes × 20 points = 88 bytes.

【０１７２】上述のように、統計上一筆書きされ易いパ
ターンをサブパターンとすることによって辞書容量を大
幅に縮小できるのである。したがって、サブパターンの
変形パターンを多量に登録することが可能になり、筆跡
パターンの認識率を大幅に向上できるのである。As described above, the dictionary capacity can be greatly reduced by using subpatterns that are patterns that are statistically easy to write with one stroke. Therefore, it becomes possible to register a large number of modified patterns of sub patterns, and the recognition rate of handwriting patterns can be greatly improved.

【０１７３】[0173]

【発明の効果】以上より明らかなように、請求項１に係
る発明の手書き認識装置は、サブパターン及び各サブパ
ターンの変形パターンから求めた標準パターンを格納し
たサブパターン辞書と、オフストロークを補間するオフ
ストローク補間手段を有して、認識部は、複数スロトー
クとそのオフストロークとに係る各特徴量を有する特徴
点の列を一つの特徴パターンと見なして上記サブパター
ン辞書との照合を行うので、「入り」,「跳ね」,「続け」等を
有する変形パターンの標準パターンを予め上記サブパタ
ーン辞書に登録しておけば、安定して高い認識率で「続
け字」や「崩し字」を含む手書き文字等を認識できるので
ある。さらに、統計上一筆書きされ易いサブパターンを
辞書照合の単位としているので、ストロークや１文字を
単位とする場合よりも照合辞書の容量を少なくでき、照
合速度を速めることができる。As is apparent from the above, the handwriting recognition apparatus according to the first aspect of the present invention interpolates the off-stroke and the sub-pattern dictionary that stores the standard pattern obtained from the sub-pattern and the modified pattern of each sub-pattern. Since the recognizing unit has the off-stroke interpolating means for performing the collation with the sub-pattern dictionary, the sequence of the feature points having the feature amounts related to the plurality of slot strokes and the off-strokes is regarded as one feature pattern. If you register a standard pattern of transformation patterns that include, "enter", "bounce", "continue", etc. in the above sub-pattern dictionary in advance, you can consistently recognize "continuation characters" and "collapsed characters" with a high recognition rate. It is possible to recognize handwritten characters and the like. Further, since the sub-pattern that is easily written in one stroke statistically is used as the unit for dictionary matching, the capacity of the matching dictionary can be reduced and the matching speed can be increased as compared with the case where strokes or one character are used as the unit.

【０１７４】すなわち、この発明によれば、電子手帳の
ようなプアーなハード構成における「続け字」や「崩し字」
のオンライン認識を容易に可能にするのである。That is, according to the present invention, the "continuation character" or the "collapsed character" in a poor hardware structure such as an electronic notebook.
It makes it easy to recognize online.

【０１７５】また、請求項２に係る発明の手書き認識装
置における特徴抽出部は、筆跡パターンにおける屈曲部
の特徴点と特徴量を抽出する第１特徴抽出部と、正規化
後の特徴点間を等間隔近似する特徴点と特徴量を抽出す
る第２特徴抽出部を有するので、正規化前の座標データ
列から屈曲部の特徴点と特徴量を抽出できる。したがっ
て、後の正規化によって屈曲部の特徴が消滅したとして
も屈曲部を表す特徴点と特徴量を確実に抽出できる。さ
らに、上記正規化によって屈曲部の特徴が消滅しないた
めに、等分割法と角度法とを併用した特徴抽出を行うこ
とができ、筆跡ストロークの特徴を適確に表す特徴量列
を得ることができる。Further, the feature extraction unit in the handwriting recognition apparatus of the invention according to claim 2 connects between the first feature extraction unit for extracting the feature point and the feature amount of the bent portion in the handwriting pattern and the normalized feature point. Since the second feature extraction unit that extracts the feature points and the feature amount that are approximated at equal intervals is included, the feature point and the feature amount of the bent portion can be extracted from the coordinate data sequence before normalization. Therefore, even if the feature of the bent portion disappears due to the later normalization, the feature point and the feature amount representing the bent portion can be reliably extracted. Further, since the feature of the bent portion is not eliminated by the above normalization, feature extraction using both the equal division method and the angle method can be performed, and a feature amount sequence that accurately represents the feature of the handwriting stroke can be obtained. it can.

【０１７６】また、請求項３に係る発明の手書き認識装
置における認識部は、距離算出手段によって、テーブル
を引いて上記特徴パターン上の各特徴点と標準パターン
上の各特徴点との特徴量の差を求め、この各差の値に基
づいて上記両パターンの距離を算出するので、テーブル
を引くという簡単な操作のみで両パターンの距離算出用
の変数値を求めることができる。したがって、この発明
によれば、高速な辞書照合が可能となる。Further, the recognizing unit in the handwriting recognizing device of the invention according to claim 3 draws a table by the distance calculating means to calculate the feature amount of each feature point on the feature pattern and each feature point on the standard pattern. Since the difference is calculated and the distance between both patterns is calculated based on the value of each difference, the variable value for distance calculation of both patterns can be calculated only by a simple operation of drawing a table. Therefore, according to the present invention, high-speed dictionary matching can be performed.

【０１７７】また、請求項４に係る発明の手書き認識装
置における認識部は、ＤＰマッチング手段によって、照
合の対象となるパターン上における特徴点間毎に変動重
みを付けたＤＰマッチング処理を行うので、上記筆跡パ
ターンのストロークに沿った特徴の現れ方に応じて、Ｄ
Ｐマッチング時における時間伸長特性を変更できる。し
たがって、この発明によれば、上記筆跡パターンの特徴
を強調したＤＰマッチングが可能となり、更なる認識率
の向上を図ることができるのである。Further, the recognizing unit in the handwriting recognizing apparatus of the invention according to claim 4 performs the DP matching processing in which the DP matching means carries out the DP weighting with the variation weight for each feature point on the pattern to be collated. Depending on the appearance of the feature along the stroke of the handwriting pattern, D
The time extension characteristic during P matching can be changed. Therefore, according to the present invention, it is possible to perform DP matching that emphasizes the characteristics of the handwriting pattern, and it is possible to further improve the recognition rate.

【０１７８】また、請求項５に係る発明の手書き認識装
置は、変動重み算出部によって、複数の学習用特徴パタ
ーンと上記標準パターンとのＤＰマッチング処理を行っ
た際のマッチングパス上の点に係るマッチング距離を求
め、全学習用特徴パターンに係るマッチング距離のばら
つきの度合いを上記変動重みとするので、安定して特徴
点が得られる区間を強調したＤＰマッチングが行われ
る。したがって、この発明によれば、安定して高い認識
率を得ることができる。Further, the handwriting recognition apparatus of the invention according to claim 5 relates to the point on the matching path when the DP matching processing of the plurality of learning feature patterns and the standard pattern is performed by the variation weight calculation section. Since the matching distance is obtained and the degree of variation in the matching distances related to all learning feature patterns is used as the variation weight, DP matching is performed by emphasizing a section in which stable feature points are obtained. Therefore, according to the present invention, a high recognition rate can be stably obtained.

【０１７９】また、請求項６に係る発明の手書き認識装
置は、変動重み算出部によって、上記特徴パターン上の
特徴点間の距離と当該特徴パターンにおける全特徴点を
入力順に結ぶ距離との比の値を上記変動重みとするの
で、上記筆跡パターンにおける屈曲部の特徴を強調した
ＤＰマッチングが行われる。したがって、この発明によ
れば、屈曲部のずれを吸収して高い認識率を得ることが
できる。Further, in the handwriting recognition apparatus of the invention according to claim 6, the variation weight calculation unit calculates the ratio of the distance between the characteristic points on the characteristic pattern and the distance connecting all the characteristic points in the characteristic pattern in the input order. Since the value is used as the variation weight, DP matching that emphasizes the characteristic of the bent portion in the handwriting pattern is performed. Therefore, according to the present invention, it is possible to absorb the deviation of the bent portion and obtain a high recognition rate.

【０１８０】また、請求項７に係る発明の手書き認識装
置における認識部は、サブパターン認識手段によって上
記筆跡パターンをサブパターン毎に認識し、筆跡パター
ン認識手段によって、得られたサブパターン候補の組み
合わせと代表パターン構成テーブルとを照合して上記筆
跡パターンを認識するので、一筆書きされ易いサブパタ
ーン単位で認識処理を行うことができる。したがって、
この発明によれば、少ない認識候補で迅速に上記筆記パ
ターンの認識を行うことができる。さらに、「続け字」や
「崩し字」に対して強い手書き認識処理装置を提供でき
る。Further, the recognizing unit in the handwriting recognizing device of the invention according to claim 7 recognizes the handwriting pattern for each subpattern by the subpattern recognizing means, and combines the subpattern candidates obtained by the handwriting pattern recognizing means. Since the above-mentioned handwriting pattern is recognized by comparing with the representative pattern configuration table, it is possible to perform the recognition process in units of sub-patterns that are easy to write with one stroke. Therefore,
According to the present invention, it is possible to quickly recognize the writing pattern with a small number of recognition candidates. Further, it is possible to provide a handwriting recognition processing device that is strong against "continuation characters" and "collapsed characters".

【０１８１】また、請求項８に係る発明の手書き認識装
置における特徴抽出部は、屈曲部特徴抽出手段によっ
て、上記座標データ列における座標点間を結んで成る直
線の方向が所定値以上変化する変化点を特徴点として抽
出し、直線部特徴抽出手段によってこの特徴点間を等間
隔近似する特徴点を抽出し、方向値算出手段によって各
特徴点の方向値を特徴量の一つとして算出するので、等
分割法と角度法とを併用した特徴点抽出を行うことがで
き、筆跡パターンの特徴を適確に表す特徴パターンを抽
出することができる。In the feature extraction unit of the handwriting recognition apparatus according to the eighth aspect of the present invention, the bending feature extraction unit changes the direction of the straight line connecting the coordinate points in the coordinate data string by a predetermined value or more. Since the points are extracted as feature points, the straight-line feature extraction means extracts feature points that approximate the feature points at equal intervals, and the direction value calculation means calculates the direction value of each feature point as one of the feature amounts. The feature points can be extracted by using the equal division method and the angle method together, and the feature pattern that accurately represents the features of the handwriting pattern can be extracted.

【０１８２】また、請求項９に係る発明の手書き認識装
置における認識部は、特徴パターン選出手段によって、
ストローク数限定部で定められた範囲内のストローク数
をヘッダ情報とする標準パターンのみをサブパターン辞
書から選出するので、類似性の高い標準パターンのみを
用いて辞書照合を迅速に行うことができる。The recognizing unit in the handwriting recognizing device of the invention according to claim 9 is characterized by the feature pattern selecting means.
Since only the standard pattern having the number of strokes within the range defined by the stroke number limiting unit as the header information is selected from the sub-pattern dictionary, dictionary matching can be performed quickly using only the highly similar standard pattern.

【０１８３】また、請求項１０に係る発明の手書き認識
装置における認識部は、方向値算出手段によって、当該
特徴パターンの最終特徴点から次ストロークにおける先
頭特徴点への方向値を算出し、有効標準パターン判定手
段によって、上記算出された方向値が入る有効範囲をヘ
ッダ情報とする標準パターンのみを有効標準パターンと
判定するので、有効な標準パターンのみを用いて辞書照
合を迅速に行うことができる。Further, the recognizing unit in the handwriting recognizing device of the invention according to claim 10 calculates the direction value from the final characteristic point of the characteristic pattern to the leading characteristic point in the next stroke by the direction value calculating means, and the effective standard Since the pattern determining unit determines only the standard pattern having the effective range in which the calculated direction value is included as the header information as the effective standard pattern, the dictionary matching can be quickly performed using only the effective standard pattern.

【０１８４】また、請求項１１に係る発明の手書き認識
装置における認識部は、標準パターン選出手段によっ
て、認識済みサブパターン候補連鎖に続くことが可能な
サブパターンを代表パターン構成テーブルを参照して求
め、得られたサブパターンの標準パターンのみをサブパ
ターン辞書から選出するので、有効な標準パターンのみ
を用いて辞書照合を迅速に行うことができる。Further, the recognizing unit in the handwriting recognizing device of the invention according to claim 11 obtains, by the standard pattern selecting means, a sub pattern which can follow the recognized sub pattern candidate chain by referring to the representative pattern configuration table. Since only the standard pattern of the obtained sub-patterns is selected from the sub-pattern dictionary, it is possible to quickly perform dictionary matching using only the valid standard pattern.

【０１８５】また、請求項１２に係る発明の手書き認識
装置は、デュアルポート・メモリでなる入力座標バッフ
ァおよび特徴バッファを有し、制御部の制御によって、
上記特徴抽出部および認識部を上記タブレットと平行動
作させるので、上記入力座標バッファに対する上記タブ
レットによる座標データ列の書き込みと上記特徴抽出部
による読み出し、および、上記特徴バッファに対する上
記特徴抽出部による特徴パターンの書き込みと上記認識
部による読み出しを、平行して行うことができる。した
がって、この発明によれば、上記特徴抽出処理および認
識処理と上記座標データ列生成処理とを平行して実行す
ることができ、オンライン筆跡パターン認識を高速に行
うことができる。すなわち、この発明は、電子手帳のよ
うなプアーなハード構成による「続け字」や「崩し字」のオ
ンライン認識を容易に実現させるのである。Further, the handwriting recognition apparatus of the invention according to claim 12 has an input coordinate buffer and a feature buffer which are dual port memories, and is controlled by the control section.
Since the feature extracting unit and the recognizing unit are operated in parallel with the tablet, writing of the coordinate data string by the tablet to the input coordinate buffer, reading by the feature extracting unit, and feature pattern by the feature extracting unit for the feature buffer. And the reading by the recognition unit can be performed in parallel. Therefore, according to the present invention, the feature extraction processing and the recognition processing and the coordinate data string generation processing can be executed in parallel, and online handwriting pattern recognition can be performed at high speed. In other words, the present invention easily realizes online recognition of "continuation characters" or "collapsed characters" with a poor hardware configuration such as an electronic notebook.

[Brief description of drawings]

【図１】この発明の手書き認識装置の第１実施例におけ
るブロック図である。FIG. 1 is a block diagram of a handwriting recognition device according to a first embodiment of the present invention.

【図２】サブパターンの一例を示す図である。FIG. 2 is a diagram showing an example of a sub pattern.

【図３】上下に分離可能なサブパターンにおける認識結
果の一例を示す図である。FIG. 3 is a diagram showing an example of a recognition result of sub patterns that are vertically separable.

【図４】文字辞書の構成例を示す図である。FIG. 4 is a diagram showing a configuration example of a character dictionary.

【図５】第１実施例における手書き認識処理動作のフロ
ーチャートである。FIG. 5 is a flowchart of a handwriting recognition processing operation in the first embodiment.

【図６】図１におけるストローク処理部による特徴点抽
出の説明図である。FIG. 6 is an explanatory diagram of feature point extraction by a stroke processing unit in FIG. 1.

【図７】入力座標列と特徴点列との対応図である。FIG. 7 is a correspondence diagram of an input coordinate sequence and a feature point sequence.

【図８】この発明の手書き認識装置の第２実施例におけ
るブロック図である。FIG. 8 is a block diagram of a handwriting recognition device according to a second embodiment of the present invention.

【図９】絶対差テーブルおよび距離テーブルの説明図で
ある。FIG. 9 is an explanatory diagram of an absolute difference table and a distance table.

【図１０】方向差テーブルの説明図である。FIG. 10 is an explanatory diagram of a direction difference table.

【図１１】この発明の手書き認識装置の第３実施例にお
けるブロック図である。FIG. 11 is a block diagram of a handwriting recognition device according to a third embodiment of the present invention.

【図１２】この発明の手書き認識装置の第４実施例にお
けるブロック図である。FIG. 12 is a block diagram of a handwriting recognition device according to a fourth embodiment of the present invention.

【図１３】この発明の手書き認識装置の第５実施例にお
けるブロック図である。FIG. 13 is a block diagram of a handwriting recognition device according to a fifth embodiment of the present invention.

【図１４】図１３における制御部の制御の下に行われる
手書き文字認識処理動作のゼネラルフローを示すフロー
チャートである。FIG. 14 is a flowchart showing a general flow of a handwritten character recognition processing operation performed under the control of the control unit in FIG.

【図１５】図１３におけるサブパターン辞書に付加され
たヘッダの構造の説明図である。15 is an explanatory diagram of a structure of a header added to the sub pattern dictionary in FIG.

【図１６】第１サブパターン認識処理動作のフローチャ
ートである。FIG. 16 is a flowchart of a first sub-pattern recognition processing operation.

【図１７】サブパターン候補バッファの構造の説明図で
ある。FIG. 17 is an explanatory diagram of a structure of a sub pattern candidate buffer.

【図１８】サブパターン認識処理サブルーチンのフロー
チャートである。FIG. 18 is a flowchart of a sub pattern recognition processing subroutine.

【図１９】文字生成処理動作のフローチャートである。FIG. 19 is a flowchart of a character generation processing operation.

【図２０】文字候補バッファの構造の説明図である。FIG. 20 is an explanatory diagram of a structure of a character candidate buffer.

【図２１】初期化後における文字候補バッファの内容の
一例を示す模式図である。FIG. 21 is a schematic diagram showing an example of the contents of a character candidate buffer after initialization.

【図２２】文字展開処理サブルーチンのフローチャート
である。FIG. 22 is a flowchart of a character expansion processing subroutine.

【図２３】図２１とは異なる初期化後における文字候補
バッファの内容を示す模式図である。FIG. 23 is a schematic diagram showing the contents of a character candidate buffer after initialization different from that of FIG. 21.

【図２４】図２３に示す文字候補バッファの内容に対応
するサブパターン候補バッファの内容を示す模式図であ
る。24 is a schematic diagram showing the contents of the sub-pattern candidate buffer corresponding to the contents of the character candidate buffer shown in FIG. 23.

【図２５】文字展開処理動作時における図２３に示す文
字候補バッファの内容の変遷を示す図である。25 is a diagram showing changes in the contents of the character candidate buffer shown in FIG. 23 during the character expansion processing operation.

【図２６】図２５に続く文字候補バッファの内容の変遷
を示す図である。FIG. 26 is a diagram showing a transition of the contents of the character candidate buffer subsequent to FIG. 25.

【図２７】図２６に示す文字候補バッファの内容に対し
て文字候補の詰め込みおよびソーティングが行われた際
の文字候補バッファの内容を示す図である。27 is a diagram showing the contents of the character candidate buffer when the character candidates are packed and sorted with respect to the contents of the character candidate buffer shown in FIG. 26.

【図２８】従来の特徴点に対する正規化処理の影響の説
明図である。FIG. 28 is an explanatory diagram of the influence of normalization processing on a conventional feature point.

[Explanation of symbols]

１,１１,３１,５１,７１…認識部、２,１２,３２,５２,
７２…タブレット、３,１３,３３,５３,７６…結果表示
部、５,１５,３５,５５…ストローク処理部、６,１６,
３６,５６…特徴格納部、８,１８,３８,５８…特徴再抽
出部、９,１９,３９,５９…文字認識部、１０,２０,４
０,６０…辞書格納部、４１,６１…重み算出部、
７３…特徴抽出用記憶部、７４…認識用記憶部、
７５…サブパターン辞書、７８…正規化
部、７９…特徴抽出部、８０…方向
差判定部、８１…ストローク数限定部、
８２…ＤＰマッチング部、８４…文字生成
部、８６…制御部、８７…文字候
補展開部。1, 11, 31, 51, 71 ... Recognition unit, 2, 12, 32, 52,
72 ... Tablet, 3,13,33,53,76 ... Result display section 5,15,35,55 ... Stroke processing section 6,16,
36, 56 ... Feature storage unit, 8, 18, 38, 58 ... Feature re-extraction unit, 9, 19, 39, 59 ... Character recognition unit, 10, 20, 4
0,60 ... dictionary storage unit, 41, 61 ... weight calculation unit,
73 ... Feature extraction storage unit, 74 ... Recognition storage unit,
75 ... Sub-pattern dictionary, 78 ... Normalization unit, 79 ... Feature extraction unit, 80 ... Direction difference determination unit, 81 ... Stroke number limiting unit,
82 ... DP matching unit, 84 ... Character generation unit, 86 ... Control unit, 87 ... Character candidate expansion unit.

フロントページの続き (72)発明者北村義弘大阪府大阪市阿倍野区長池町22番22号シャープ株式会社内Front page continued (72) Inventor Yoshihiro Kitamura 22-22 Nagaike-cho, Abeno-ku, Osaka-shi, Osaka

Claims

[Claims]

1. A tablet that generates a coordinate data string formed by arranging the coordinates of handwriting input by handwriting in time series, a feature point representing a feature of a handwriting pattern from the coordinate data string, and a feature amount of the feature point. A feature extraction unit to extract, a recognition unit that recognizes a handwriting pattern by matching a feature pattern consisting of a sequence of feature points having the extracted feature amount with a dictionary, and a result display that displays the recognition result by this recognition unit In the handwriting recognition device having a section, the dictionary standardizes a sequence of feature points having a feature amount extracted from a subpattern formed by separating a representative pattern into a pattern that is statistically easy to write with one stroke and a modified pattern of each subpattern. It is a sub-pattern dictionary for patterns, wherein the feature extraction unit extracts the feature points and the feature amount from the coordinate data sequence of a plurality of strokes, and The substroke dictionary includes off-stroke interpolation means for interpolating off-strokes with feature points, and the recognition unit considers a feature point sequence having feature amounts related to the plurality of strokes and off-strokes as one feature pattern. A handwriting recognition device characterized by performing collation with.

2. The handwriting recognition device according to claim 1, wherein the feature extraction unit extracts a feature point and a feature amount of a bent portion in a handwriting pattern from the coordinate data sequence generated by the tablet. And a feature point sequence extracted by the first feature extraction unit, and a feature that approximates the feature points of the bent portion at equal intervals from the normalized feature point sequence and the normalized value of the coordinate data. A handwriting recognition device comprising a second feature extraction unit for extracting points and feature amounts.

3. The handwriting recognition apparatus according to claim 1, wherein the feature amount of the feature points on the feature pattern obtained by the feature extraction unit, the feature amount of the feature points on the standard pattern, and the difference between the two feature amounts. And a table storage unit that stores a table in which the values are associated with each other, and the recognition unit determines a feature amount of a feature point pair including each feature point on the feature pattern and each feature point on the standard pattern. Based on the above, the table is drawn to obtain the difference between the feature amounts for each of the feature point pairs, and a distance calculating means for calculating the distance between the feature pattern and the standard pattern based on the value of each difference is provided. Characterized handwriting recognition device.

4. The handwriting recognition device according to claim 1, wherein the recognition unit includes a DP matching unit that performs a DP matching process in which a variation weight is given to each feature point on a pattern to be matched. A handwriting recognition device characterized by the above.

5. The handwriting recognition apparatus according to claim 4, wherein the feature extraction unit and the DP matching unit are controlled, and the feature extraction unit comprises a feature point sequence having a feature amount from the learning pattern prepared in advance. A learning characteristic pattern is generated, and a DP matching process between the learning characteristic pattern and the standard pattern is performed by the DP matching means to obtain a matching distance associated with a point on a matching path. 1. A handwriting recognition apparatus, further comprising: a variation weight calculation unit that calculates a degree of variation in matching distances related to all learning feature patterns and uses the variation weight as the variation weight.

6. The handwriting recognition device according to claim 4, wherein the distance from a feature point on the feature pattern extracted by the feature extraction unit to the next feature point and all feature points in the feature pattern are connected in the input order. A handwriting recognition device comprising: a variation weight calculation unit that calculates a value of a ratio to a distance and uses the variation weight for the section related to the feature point.

7. The handwriting recognition apparatus according to claim 1, further comprising: a representative pattern configuration table storage unit that stores a representative pattern configuration table that associates a representative pattern with sub-patterns that form the representative pattern, The recognition unit collates the feature pattern with a sub-pattern dictionary to recognize the handwriting pattern for each sub-pattern, and a sub-pattern candidate obtained as a result of recognition by the sub-pattern recognition unit. A handwriting recognition device comprising a handwriting pattern recognition means for recognizing the handwriting pattern by collating a combination with the representative pattern configuration table.

8. The handwriting recognition apparatus according to claim 7, wherein the feature extraction unit uses a change point at which a direction of a straight line connecting the coordinate points in the coordinate data string changes in a predetermined value or more as a feature point. Bent portion feature extracting means for extracting, straight line feature extracting means for extracting feature points that approximate the feature points extracted by the bend portion feature extracting means at equal intervals, and each feature extracted by both the feature extracting means A handwriting recognition device comprising a direction value calculation means for calculating a direction value representing a direction to a next characteristic point at one point as one of the characteristic amounts.

9. The handwriting recognition apparatus according to claim 8, wherein an approximate straight line number of the handwriting pattern is obtained from the number of characteristic points of the bent portion extracted by the bent portion feature extracting means, and the approximate straight line number and the approximate straight line number are calculated. The stroke number limiting unit that determines the range of the stroke number of the characteristic pattern from the stroke number of the handwriting pattern is provided, and the sub-pattern dictionary includes, for each standard pattern, the stroke number of the representative pattern that is the source of the standard pattern. It has header information, and the recognition unit includes a characteristic pattern selection unit that selects only a standard pattern having the number of strokes within the range defined by the stroke number limiting unit as header information from the sub-pattern dictionary. Characterized handwriting recognition device.

10. The handwriting recognition apparatus according to claim 8, wherein the sub-pattern dictionary includes, for each standard pattern, an effective range of a direction value from a final feature point of the standard pattern to a leading feature point in a next stroke. The standard pattern having the header information is a valid range in which the direction value from the final feature point in the feature pattern calculated by the direction value calculating means to the head feature point in the next stroke is included in the recognition unit. A handwriting recognition device comprising an effective standard pattern determining means for determining only an effective standard pattern.

11. The handwriting recognition apparatus according to claim 7, wherein the recognition unit obtains a sub-pattern that can follow a sub-pattern candidate chain of recognized handwriting patterns by referring to the representative pattern configuration table, A handwriting recognition apparatus comprising a standard pattern selection means for selecting only the standard pattern of the obtained sub patterns from the sub pattern dictionary.

12. The dual-port memory according to claim 1, wherein the coordinate data sequence generated by the tablet is written, and the written coordinate data sequence is read by the feature extracting unit. An input coordinate buffer, a feature buffer extracted from the feature extraction unit, and a feature buffer including a dual port memory from which the written feature pattern is read by the recognition unit; and the feature extraction unit. And a control unit that controls the recognition unit to execute the feature extraction process by the feature extraction unit and the recognition process by the recognition unit and the coordinate data string generation process by the tablet in parallel. Recognition device.