JPH07271917A

JPH07271917A - Method and apparatus for creating handwritten character recognition dictionary

Info

Publication number: JPH07271917A
Application number: JP6064786A
Authority: JP
Inventors: Yoshifumi Sakai; 良文坂井; Yoshitaka Ikeda; 佳隆池田; Toru Yamaguchi; 徹山口
Original assignee: Nippon Steel Corp
Current assignee: Nippon Steel Corp
Priority date: 1994-04-01
Filing date: 1994-04-01
Publication date: 1995-10-20
Anticipated expiration: 2016-05-14
Also published as: JP3164962B2

Abstract

(57)【要約】【目的】離散フーリエ変換や離散コサイン変換等の直交
関数展開を使用して入力文字パターンから特徴ベクトル
を算出し、この特徴ベクトルを用いて手書き文字認識を
行なう場合に使用される辞書において、ユーザによる追
加登録が可能で、コンパクトかつ認識率の高い辞書を作
成できるようにする。【構成】入力文字パターンの特徴ベクトルを算出し（ス
テップ１０２）、特徴空間内におけるその特徴ベクトル
と辞書内に登録された代表点との「近さ」を判定する
（ステップ１０５）。両者が「近い」場合には、算出さ
れた特徴ベクトルに近付くように、登録されている代表
点を修正する（ステップ１０６）。両者が「近くない」
場合には、算出された特徴ベクトルを新たに代表点とし
て追加登録する（ステップ１０７）。 (57) [Abstract] [Purpose] It is used when handwritten character recognition is performed by calculating a feature vector from an input character pattern using orthogonal function expansion such as discrete Fourier transform or discrete cosine transform. It is possible to create a compact dictionary with a high recognition rate that can be additionally registered by the user. [Structure] A feature vector of an input character pattern is calculated (step 102), and "closeness" between the feature vector in the feature space and the representative point registered in the dictionary is determined (step 105). If both are "close", the registered representative point is corrected so as to approach the calculated feature vector (step 106). Both are not near
In this case, the calculated feature vector is additionally registered as a representative point (step 107).

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、手書き文字認識を行な
う際に使用される辞書を作成する方法および装置に関す
る。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a method and apparatus for creating a dictionary used in handwritten character recognition.

【０００２】[0002]

【従来の技術】手書き文字認識を行なう方法の一つとし
て、入力された手書き文字に対する特徴量を算出し、算
出された特徴量に応じて入力された手書き文字がどの文
字であるかを判断する方法がある。この方法は、各文字
ごとに、予めその文字についての代表となるべき特徴量
を求めて辞書に格納しておき、手書き文字認識を行なう
際に、入力された文字の特徴量に最も近いものを辞書に
格納された特徴量の中から検索し、検索された特徴量に
対応する文字をもって認識結果とするものである。特徴
量による方法は、筆順や画数を参照しないでも文字認識
を行なえるため、オンラインおよびオフラインのいずれ
の手書き文字入力装置にも適用でき、さらに、印刷文字
などの読み取りにも適用することができる。なお、特徴
量による方法では、オンライン手書き文字認識による場
合には、画数に関する情報を入手することが可能になる
ので、画数ごとに辞書を編成しておき、入力文字パター
ンの画数に応じて該当する画数の辞書のみを検索するよ
うにすることによって、認識時間の短縮とさらなる認識
率の向上を図ることができる。2. Description of the Related Art As one of methods for recognizing handwritten characters, a feature amount of an input handwritten character is calculated, and which input handwritten character is determined according to the calculated feature amount. There is a way. In this method, for each character, the characteristic amount that should be the representative of that character is obtained in advance and stored in the dictionary, and when recognizing the handwritten character, the one that is closest to the input characteristic amount of the character is calculated. The feature amount stored in the dictionary is searched, and the character corresponding to the searched feature amount is used as the recognition result. Since the method based on the feature amount can perform character recognition without referring to the stroke order or the number of strokes, it can be applied to both online and offline handwritten character input devices, and can also be applied to reading printed characters. In addition, in the method based on the feature amount, it is possible to obtain information on the number of strokes in the case of online handwritten character recognition. Therefore, a dictionary is organized for each number of strokes, and the dictionary is applied according to the number of strokes of the input character pattern. By searching only the stroke number dictionary, the recognition time can be shortened and the recognition rate can be further improved.

【０００３】本明細書において、手書き文字や印刷文字
などの文字認識の対象となるパターンないし図形のこと
を入力文字パターンという。典型的には、入力文字パタ
ーンは、書き手が手書き文字としてタブレットなどを用
いて入力した線の集合として表わされる。また、文字と
は、文字コードと１対１で対応するものである。入力文
字パターンに対して文字認識を行なった結果、その入力
文字パターンが特定の文字に同定されることになる。In the present specification, a pattern or figure which is a target of character recognition such as a handwritten character or a printed character is called an input character pattern. Typically, the input character pattern is represented as a set of lines input by the writer as a handwritten character using a tablet or the like. A character corresponds to a character code on a one-to-one basis. As a result of performing character recognition on the input character pattern, the input character pattern is identified as a specific character.

【０００４】特徴量を用いて手書き文字認識を行なう場
合、特徴量として、スカラー量を使用するのではなく、
ベクトル量を使用するのが一般的である。ここではベク
トル量で表わされた特徴量を特徴ベクトルと呼ぶことに
する。When handwritten character recognition is performed using the feature amount, the scalar amount is not used as the feature amount, but
It is common to use vector quantities. Here, the feature amount represented by the vector amount is called a feature vector.

【０００５】特徴ベクトルによる方法では、入力された
入力文字パターンに対し前処理を施した後、例えば、離
散フーリエ変換や離散コサイン変換などの直交関数展開
を施して各次の係数を求め、算出された各係数を順に配
置することにより、特徴ベクトルが求められる。この特
徴ベクトルはｎ次元空間（この空間を特徴空間という）
内の点を表わすことになる。ｎは１以上の整数である。
一方、辞書には、文字ごとにその文字を代表する特徴ベ
クトルが代表点として登録されている。したがって代表
点も特徴空間内の点である。そして、手書き文字認識を
行なう場合には、入力された入力文字パターンに対する
特徴ベクトルからの距離が最も小さい代表点を辞書から
探し出し、探し出された代表点に対応する文字が、認識
結果として出力される。したがって、例えば「甲」とい
う文字が認識されるためには、予め、辞書内に「甲」と
いう文字に対する代表点が登録されていなければならな
い。In the method using the feature vector, after preprocessing the inputted input character pattern, for example, orthogonal function expansion such as discrete Fourier transform or discrete cosine transform is applied to obtain and calculate coefficients of each degree. A feature vector is obtained by sequentially arranging the respective coefficients. This feature vector is an n-dimensional space (this space is called a feature space)
It represents a point inside. n is an integer of 1 or more.
On the other hand, in the dictionary, a feature vector representing each character is registered as a representative point for each character. Therefore, the representative point is also a point in the feature space. Then, when performing handwritten character recognition, the representative point having the smallest distance from the feature vector to the input character pattern that has been input is searched from the dictionary, and the character corresponding to the found representative point is output as the recognition result. It Therefore, for example, in order for the character "Kou" to be recognized, the representative point for the character "Kou" must be registered in the dictionary in advance.

【０００６】ところで、手書き文字認識を行なう場合、
筆者ごとに書き癖がかなり異なるので、書き手によるば
らつきによらずに認識率が高く維持されるような辞書を
作成することが重要である。このため、同一文字につい
て、多くの書き手によって書かれた多数の入力文字パタ
ーンのサンプルを収集し、これに基づいて辞書を作成す
るのが一般的である。この場合、具体的な辞書の作成方
法としては、(1)採取した入力文字パターンについての
特徴ベクトルを各文字ごとに平均し、平均された特徴ベ
クトルを代表点とする、すなわち、各文字ごとに１つの
代表点を設定する方法、(2)採取した各入力文字パター
ンの特徴ベクトルを算出し、算出した特徴ベクトルの全
部を代表点として登録する、すなわち各文字ごとに多数
の代表点を設定する方法、(3)採取した各入力文字パタ
ーンについて特徴ベクトルを計算し、各文字ごとに平均
を求めるとともにばらつきや特徴量（特徴ベクトルの成
分の値）間の相関を表わす統計量を求め、平均を代表点
として登録するとともに統計量も登録する、すなわち各
文字ごとに１つの代表点と統計パラメータとを設定する
方法がある。By the way, when performing handwritten character recognition,
Since writing habits vary considerably among authors, it is important to create a dictionary that maintains a high recognition rate regardless of writer variations. Therefore, it is common to collect a large number of input character pattern samples written by many writers for the same character and create a dictionary based on the collected sample. In this case, as a specific dictionary creation method, (1) average the feature vectors of the collected input character patterns for each character, and use the averaged feature vector as the representative point, that is, for each character. Method of setting one representative point, (2) Calculate the feature vector of each sampled input character pattern, and register all the calculated feature vectors as representative points, that is, set a large number of representative points for each character. Method, (3) Calculate the feature vector for each sampled input character pattern, calculate the average for each character, and also calculate the statistic that represents the variation and the correlation between the feature quantities (feature vector component values) and calculate the average. There is a method of registering not only a representative point but also a statistic, that is, one representative point and a statistical parameter are set for each character.

【０００７】また、特定のユーザが連続して手書き文字
入力を行なうような場合には、辞書の学習を行なえるよ
うにすることが望ましい。ここで学習とは、その特定の
ユーザの手書き文字に基づく特徴ベクトルを代表点とし
て辞書に登録することである。この学習を行なうことに
より、その特定のユーザの書き癖が辞書に反映されるこ
ととなって、その特定のユーザに対する手書き文字認識
の認識率のさらなる向上を図ることができる。Further, when a specific user continuously inputs handwritten characters, it is desirable to be able to learn a dictionary. Here, learning means registering a feature vector based on the handwritten character of the specific user as a representative point in the dictionary. By carrying out this learning, the writing habit of the specific user is reflected in the dictionary, and the recognition rate of handwritten character recognition for the specific user can be further improved.

【０００８】[0008]

【発明が解決しようとする課題】手書き文字認識におけ
る上述した従来の辞書作成方法には、以下に述べるよう
な問題点がある。上記(1)に示す各文字ごとに１つの代
表点を設定する方法では、書き手や書き方によるばらつ
きが代表点に反映されにくいので、誤認識が多くなりが
ちである。上記(2)に示す入力文字パターンごとに代表
点を設定する方法では、１つの文字を表現するために必
要な情報量が多いので、辞書のサイズが極めて大きくな
り、また、手書き文字認識時の検索に要する時間が長く
なる。上記(3)に示す統計パラメータを付随させる方法
では、この統計パラメータを修正するためには、修正前
の統計パラメータの計算に使用した全ての特徴ベクトル
を必要とするため、事実上、事後的に特定のユーザのた
めの辞書の学習を行なうことができない。The above-mentioned conventional dictionary creating method in handwritten character recognition has the following problems. In the method of setting one representative point for each character shown in (1) above, since the variation due to the writer or the writing style is difficult to be reflected in the representative point, erroneous recognition tends to increase. In the method of setting a representative point for each input character pattern shown in (2) above, since the amount of information required to express one character is large, the size of the dictionary becomes extremely large. It takes a long time to search. In the method of attaching the statistical parameter shown in (3) above, in order to modify this statistical parameter, all the feature vectors used in the calculation of the statistical parameter before modification are required, so in fact Unable to learn a dictionary for a particular user.

【０００９】本発明の目的は、辞書サイズを大きくする
ことなく、書き手や書き方によるばらつきを十分に表現
できかつユーザが自分の手書き文字を登録することので
きる辞書の作成方法および装置を提供することにある。An object of the present invention is to provide a method and apparatus for creating a dictionary in which variations due to writers and writing methods can be sufficiently expressed and a user can register his or her own handwritten character without increasing the dictionary size. It is in.

【００１０】[0010]

【課題を解決するための手段】本発明の手書き文字認識
辞書作成方法は、入力された入力文字パターンに対して
特徴ベクトルを算出し前記特徴ベクトルに応じて前記入
力文字パターンに対応する文字を識別して前記入力文字
パターンの認識を行なう文字認識において使用され、各
文字ごとに当該文字に対応する代表的な特徴ベクトルを
代表点として保持する辞書の作成方法であって、前記辞
書に新たな入力文字パターンを登録することが要求され
た場合に、前記新たな入力文字パターンを入力して前記
新たな入力文字パターンに対する特徴ベクトルを算出す
る第１の工程と、前記辞書を検索して前記特徴ベクトル
が前記特徴ベクトルに対応する文字の前記代表点に近い
かどうかを判定し、近い場合には前記特徴ベクトルに近
付くように前記対応する文字の代表点を修正し、それ以
外の場合には前記特徴ベクトルを前記対応する文字の代
表点として前記辞書に追加する第２の工程とを有する。A handwritten character recognition dictionary creating method of the present invention calculates a feature vector for an input input character pattern and identifies a character corresponding to the input character pattern according to the feature vector. A method of creating a dictionary which is used in character recognition for recognizing the input character pattern and holds a representative feature vector corresponding to the character for each character as a representative point, in which a new input is made to the dictionary. A first step of inputting the new input character pattern and calculating a feature vector for the new input character pattern when it is requested to register a character pattern; and searching the dictionary for the feature vector Is close to the representative point of the character corresponding to the feature vector, and if it is close, the pair is moved so as to approach the feature vector. To correct the representative point of the character, in other cases, and a second step of adding to the dictionary as a representative point of the character to the corresponding said feature vector.

【００１１】本発明の手書き文字認識辞書作成装置は、
入力された入力文字パターンに対して特徴ベクトルを算
出し前記特徴ベクトルに応じて前記入力文字パターンに
対応する文字を識別して前記入力文字パターンの認識を
行なう文字認識において使用され、各文字ごとに当該文
字に対応する代表的な特徴ベクトルを代表点として保持
する辞書の作成装置であって、新たな入力文字パターン
を入力して前記新たな入力文字パターンに対する特徴ベ
クトルを特徴ベクトル算出手段と、前記辞書を検索して
前記特徴ベクトルが前記特徴ベクトルに対応する文字の
前記代表点に近いかどうかを判定する判定手段と、前記
判定手段での判定の結果近い場合には前記特徴ベクトル
に近付くように前記対応する文字の代表点を修正し、そ
れ以外の場合には前記特徴ベクトルを前記対応する文字
の代表点として前記辞書に追加する更新手段とを有す
る。The handwritten character recognition dictionary creating apparatus of the present invention is
Used in character recognition to calculate a feature vector for an input input character pattern, identify a character corresponding to the input character pattern according to the feature vector, and recognize the input character pattern, and for each character A dictionary creating apparatus that holds a representative feature vector corresponding to the character as a representative point, wherein a new input character pattern is input, and a feature vector for the new input character pattern is calculated as a feature vector calculating unit, Determination means for determining whether or not the feature vector is close to the representative point of the character corresponding to the feature vector by searching a dictionary, and if the result of the determination by the determination means is close, approach the feature vector Correct the representative point of the corresponding character, otherwise use the feature vector as the representative point of the corresponding character. And an update means that you want to add to the dictionary.

【００１２】[0012]

【作用】辞書に既に登録されている代表点と新たな入力
文字パターンの特徴ベクトルとを比較し、特徴空間にお
いて両者が「近い」場合には、新たな入力文字パターン
の特徴ベクトルに近付くように既に登録されている代表
点を修正し、両者が「近くない」場合には、新たな入力
文字パターンの特徴ベクトルを新たな代表点として追加
登録するので、代表点の数を極端に増やすことなく認識
率の高い辞書を作成することが可能となる。したがっ
て、ユーザによる追加登録が可能であって、辞書サイズ
を大きくすることなく、かつ書き手や書き方によるばら
つきを十分に表現できる辞書を作成することができる。The representative point already registered in the dictionary is compared with the feature vector of the new input character pattern, and if the two are "close" in the feature space, the feature vector of the new input character pattern is approached. If the representative points that have already been registered are corrected and both are not close, the feature vector of the new input character pattern is additionally registered as a new representative point, so there is no need to increase the number of representative points extremely. It is possible to create a dictionary with a high recognition rate. Therefore, it is possible to create a dictionary that can be additionally registered by the user and can sufficiently express the variation due to the writer and the writing style without increasing the dictionary size.

【００１３】本発明において、「近い」か「近くない」
かの判定方法としては、各種のものが考えられるが、計
算量や作成される辞書の認識率の観点から、例えば、
(1)代表点と特徴ベクトルとの差ベクトルのユークリッ
ドノルムと与えられたしきい値とを比較する、あるい
は、(2)代表点と特徴ベクトルとの差ベクトルの各次元
の成分の絶対値のうちの最大のものと与えられたしきい
値とを比較する、などの方法とすることが望ましい。ま
た、特徴ベクトルに近付くように代表点を修正する場
合、各種の方法が考えられるが、既に入力された入力文
字パターンのうち何個のものにその代表点が対応してい
るかを加味する方法を用いることが望ましい。対応する
入力文字パターンの個数を加味する方法は、例えば、そ
の個数分の重みを代表点につけた上で、代表点と特徴ベ
クトルとの平均をもって修正後の代表点とする方法があ
る。In the present invention, "close" or "not close"
There are various possible methods for determining whether or not, but from the viewpoint of the amount of calculation and the recognition rate of the created dictionary, for example,
(1) Compare the Euclidean norm of the difference vector between the representative point and the feature vector with a given threshold value, or (2) Calculate the absolute value of the component of each dimension of the difference vector between the representative point and the feature vector. It is desirable to use a method such as comparing the maximum one of them with a given threshold. In addition, when modifying the representative point so as to approach the feature vector, various methods are conceivable. However, a method that considers how many of the already-input character patterns the representative point corresponds to is used. It is desirable to use. As a method of adding the number of corresponding input character patterns, for example, a weight is added to the representative points and the average of the representative points and the feature vector is used as the corrected representative points.

【００１４】さらに、上述のように作成された辞書を用
いた場合の認識率をさらに向上させるために、辞書の改
良を行なうことができる。辞書の改良は、例えば、辞書
の作成後、辞書の作成に使用された入力文字パターンに
対して作成された辞書を用いて文字認識を行ない、誤認
識となった場合には当該入力文字パターンに対する特徴
ベクトルを新たに代表点として作成された辞書に追加登
録することによって行なわれる。Further, in order to further improve the recognition rate when the dictionary created as described above is used, the dictionary can be improved. To improve the dictionary, for example, after the dictionary is created, character recognition is performed using the created dictionary for the input character pattern used to create the dictionary, and if incorrect recognition is performed, the input character pattern This is performed by additionally registering the feature vector in a dictionary newly created as a representative point.

【００１５】[0015]

【実施例】次に、本発明の実施例について図面を参照し
て説明する。図１(a)は本発明の一実施例の手書き文字
認識辞書作成装置の構成を示すブロック図である。Embodiments of the present invention will now be described with reference to the drawings. FIG. 1 (a) is a block diagram showing the configuration of a handwritten character recognition dictionary creating apparatus according to an embodiment of the present invention.

【００１６】この手書き文字認識辞書作成装置は、各文
字ごとの代表点を格納する辞書２５を作成するものであ
って、書き手の手書き文字が入力文字パターンとして入
力する手書き文字入力表示装置１０と、入力した入力文
字パターンに対して前処理や直交関数展開などを施すこ
とによって特徴ベクトルを算出する特徴ベクトル算出部
１２と、各入力文字パターンごとに算出された特徴ベク
トルを格納する特徴ベクトル格納部２２と、算出された
特徴ベクトルと辞書２５に格納されている代表点とを比
較する比較部２３と、比較部２３での比較結果に応じて
辞書２５を更新する更新部２４とを備えている。特徴ベ
クトル算出部２１で算出された特徴ベクトルは、比較部
２３に直接送られるとともに、同時に特徴ベクトル格納
部２２にも送られるようになっており、比較部２３は、
特徴ベクトル算出部２１と特徴ベクトル格納部２２のい
ずれからも特徴ベクトルを受け取れるように構成されて
いる。This handwritten character recognition dictionary creating device creates a dictionary 25 for storing representative points for each character, and includes a handwritten character input / display device 10 for inputting handwritten characters of a writer as an input character pattern, A feature vector calculation unit 12 that calculates a feature vector by performing preprocessing or orthogonal function expansion on an input input character pattern, and a feature vector storage unit 22 that stores the feature vector calculated for each input character pattern. And a comparing unit 23 that compares the calculated feature vector with the representative point stored in the dictionary 25, and an updating unit 24 that updates the dictionary 25 according to the comparison result of the comparing unit 23. The feature vector calculated by the feature vector calculation unit 21 is directly sent to the comparison unit 23, and at the same time, sent to the feature vector storage unit 22.
The feature vector calculation unit 21 and the feature vector storage unit 22 are both configured to receive the feature vector.

【００１７】手書き文字入力表示装置１０は、例えば、
液晶表示パネルと入力用タブレットを入力エリア１４と
して一体化させたものであり、ペン１１を用いて文字を
書くように押圧することによって、入力文字パターン１
３の入力が行なわれ、同時にその入力文字パターン１３
が入力箇所にそのまま表示されるようになっている。ま
た、入力エリア１４の図示右下の隅には、学習文字１３
が表示されている。ここで学習文字とは、辞書２５の学
習の対象となる文字のことである。図示した例では、ひ
らがなの「あ」が学習文字であって、これに対応し、書
き手は「あ」を入力文字パターンとして入力している。The handwritten character input / display device 10 is, for example,
The liquid crystal display panel and the input tablet are integrated as the input area 14, and the input character pattern 1 is pressed by pressing the pen 11 to write characters.
3 is input, and at the same time, the input character pattern 13
Is displayed as it is in the input area. Further, in the lower right corner of the input area 14 in the figure, the learning character 13
Is displayed. Here, the learning character is a character to be learned by the dictionary 25. In the illustrated example, the hiragana "a" is the learning character, and corresponding to this, the writer inputs "a" as the input character pattern.

【００１８】比較部２３は、特徴ベクトル算出部２１あ
るいは特徴ベクトル格納部２３から特徴ベクトルを受け
取り、その特徴ベクトルに対応する文字について辞書２
５を検索し、辞書２５内に格納されている当該文字の代
表点の中で受け取った特徴ベクトルに対して最も近い代
表点を選択し、選択された代表点と受け取った特徴ベク
トルとを比較し、両者が相互に近いかそうでないかを判
定する。代表点も特徴ベクトルもｎ次元空間（特徴空
間）内のベクトルとして表わされるから、判定の基準と
しては、(1)両者の差ベクトルのユークリッドノルムが
与えられたしきい値ｋ₁以下であるかどうかや、(2)差ベ
クトルのｎ個の成分（各次元の成分）それぞれの絶対値
を求めこの絶対値のうちの最大のものの値が与えられた
しきい値ｋ ₂以下であるかどうかを用いることができ
る。差ベクトルのユークリッドノルムがしきい値ｋ₁と
いう判定基準は、ｎ次元の特徴空間において、選択され
た代表点を中心とする半径ｋ₁の超球の表面あるいは内
部に特徴ベクトルがあるかどうかと同値である。また、
差ベクトルの成分の絶対値の最大がしきい値ｋ₂以下と
いう条件は、特徴空間において代表点を中心とし各辺が
それぞれ座標軸に平行であって１辺の長さが２・ｋ₂であ
る超立方体の表面または内部に特徴ベクトルがあるかど
うかと同値である。しきい値ｋ₁,ｋ₂は、一定の値に固
定しておいてもよいし、数字や漢字、英字などの区別に
応じて変化させるようにしてもよいし、あるいは、作成
された辞書による文字認識の認識率を評価して適応的に
定めるようにしてもよい。The comparison unit 23 includes the feature vector calculation unit 21.
It receives a feature vector from the feature vector storage unit 23.
Dictionary 2 for the character corresponding to the feature vector
5 is searched, and the substitution of the character stored in the dictionary 25
The closest value to the feature vector received among the points
Select surface points, select selected representative points and received feature vectors
Compare with Toll to see if they are close to each other or not.
Set. Both the representative point and the feature vector are in n-dimensional space (feature sky
It is expressed as a vector within
Then, (1) the Euclidean norm of the difference vector between the two is
Given threshold k₁Whether or not it is below (2)
Absolute value of each of n components (each dimension component) of the cuttle
And given the value of the largest of these absolute values
Threshold k ₂Can be used if
It Euclidean norm of difference vector is threshold k₁When
The criterion is said to be selected in the n-dimensional feature space.
Radius k around the representative point₁On or within the hypersphere of
It is the same as whether there is a feature vector in the part. Also,
The maximum absolute value of the components of the difference vector is the threshold value k₂With
The condition is that each side is centered on the representative point in the feature space.
Each is parallel to the coordinate axis and the length of one side is 2 · k.₂And
Whether there is a feature vector on the surface or inside the hypercube
It is the same value as Kauka. Threshold k₁, k₂Is fixed to a certain value
You can specify it to distinguish numbers, kanji, English letters, etc.
May be changed according to
Adaptively by evaluating the recognition rate of character recognition by the created dictionary
You may decide.

【００１９】このほか、(3)選択された代表点を中心と
する特徴空間内の超直方体の表面または内部に特徴ベク
トルが存在するかどうか、あるいは(4)代表点を中心と
する特徴空間内の超楕円体の表面または内部に特徴ベク
トルが存在するかどうかを判定基準とすることもでき
る。これらは、上記(1),(2)の条件において、特徴ベク
トルの各次元の成分にそれぞれ重みをつけたものに相当
する。Besides, (3) whether or not a feature vector exists on the surface or inside of the hypercube in the feature space centered on the selected representative point, or (4) in the feature space centered on the representative point. It is also possible to determine whether or not a feature vector exists on the surface or inside of the hyperellipsoid of. These correspond to the weighted components of each dimension of the feature vector under the conditions (1) and (2).

【００２０】辞書２５の構成例が図１(b)に示されてい
る。辞書２５では、登録される代表点ごとに、その代表
点に対応する文字の文字コードと、代表点のベクトル値
と、その代表点が何個の特徴ベクトルに基づいて定めら
れているかを表わす回数とが、格納されている。１個の
文字に対し複数の代表点が存在し得るようになってい
る。図に示した例では、文字コードとしてＪＩＳの区点
コードが使用されており、文字「あ」（区点コード：０
４０２）に対して２つの代表点が登録されている。An example of the structure of the dictionary 25 is shown in FIG. In the dictionary 25, for each representative point to be registered, the character code of the character corresponding to the representative point, the vector value of the representative point, and the number of times the representative point is determined based on the number of feature vectors And are stored. A plurality of representative points can exist for one character. In the example shown in the figure, the JIS kuten code is used as the character code, and the character "A" (kuten code: 0
402), two representative points are registered.

【００２１】更新部２４は、比較部２３での判定結果に
基づいて辞書２５を更新するものであり、比較部２３に
入力した特徴ベクトルが辞書２５内から選択された代表
点から「近い」と判定された場合には入力した特徴ベク
トルに近付くようにその代表点の値を修正し、「近くな
い」と判定された場合には、その入力した特徴ベクトル
を対応する文字に関する代表点として辞書２５に追加登
録する。「近い」と判定された場合の代表点の修正方法
としては、例えば、辞書２５を検索してその代表点が何
個の特徴ベクトルに基づいて定められているかを調べ、
ｍ個の特徴ベクトルに基づいている場合（その代表点の
「回数」欄がｍとなっている場合）には、代表点とその
入力した特徴ベクトルとを１：ｍに内分する点にその代
表点を移動させ、移動後の値に辞書２５を書き換える。
そして、その代表点の「回数」の欄の値に１を加算する
方法がある。このほか、常に一定の値だけ近付ける方法
などもある。The updating unit 24 updates the dictionary 25 based on the judgment result of the comparing unit 23, and the feature vector input to the comparing unit 23 is "close" to the representative point selected from the dictionary 25. If it is determined, the value of the representative point is corrected so as to approach the input feature vector, and if it is determined that it is "not close", the input feature vector is set as the representative point for the corresponding character in the dictionary 25. Add to. As a method of correcting the representative point when it is determined to be “close”, for example, the dictionary 25 is searched to check how many feature vectors the representative point is determined based on,
In the case of being based on m feature vectors (when the "number of times" column of the representative point is m), the representative point and the input feature vector are divided into 1: m points. The representative point is moved, and the dictionary 25 is rewritten with the moved value.
Then, there is a method of adding 1 to the value in the "number of times" column of the representative point. In addition to this, there is also a method of always approaching a fixed value.

【００２２】一方、比較部２３において「近くない」と
判定された場合には、更新部２４は、その入力した特徴
ベクトルを代表点として辞書２５に登録するとともに、
この新規に登録した代表点の「回数」欄に１を設定す
る。なお。入力した特徴ベクトルに対応する文字の代表
点が１個も辞書２５に登録されていない場合には、「回
数」を１として、その特徴ベクトルを当該文字に対する
代表点として登録する。On the other hand, when the comparison unit 23 determines that the feature vector is not “near”, the updating unit 24 registers the input feature vector in the dictionary 25 as a representative point and
1 is set in the "number of times" column of the newly registered representative point. Incidentally. If no representative point of the character corresponding to the input feature vector is registered in the dictionary 25, the number of times is set to 1 and the feature vector is registered as the representative point for the character.

【００２３】次に、この手書き文字認識辞書作成装置を
用いた辞書作成の手順について、図２(a)のフローチャ
ートを用いて説明する。Next, a procedure for creating a dictionary using this handwritten character recognition dictionary creating apparatus will be described with reference to the flowchart of FIG.

【００２４】手書き文字入力表示装置１０を介して学習
文字に対応する入力文字パターンが入力すると（ステッ
プ１０１）、特徴ベクトル算出部２１によってその入力
文字パターンの特徴ベクトルが算出される（ステップ１
０２）。算出された特徴ベクトルは、比較部２３に送ら
れるとともに、特徴ベクトル格納部２２にも格納され
る。入力文字パターンに対応する文字が、その登録対象
文字として最初のもの、すなわち辞書２５に未だ１個も
登録されていない文字であるかどうかが判定され（ステ
ップ１０３）、未登録の文字の場合にはステップ１０７
に移行する。この判定は、比較部２３が辞書２５を検索
し、入力文字パターンに対応する学習文字についての代
表点の有無を調べることによって行なわれる。When an input character pattern corresponding to a learned character is input via the handwritten character input / display device 10 (step 101), the feature vector calculation unit 21 calculates the feature vector of the input character pattern (step 1).
02). The calculated feature vector is sent to the comparison unit 23 and also stored in the feature vector storage unit 22. It is determined whether or not the character corresponding to the input character pattern is the first character to be registered, that is, a character that has not been registered in the dictionary 25 yet (step 103). Is step 107
Move to. This determination is performed by the comparison unit 23 searching the dictionary 25 and checking whether or not there is a representative point for the learning character corresponding to the input character pattern.

【００２５】ステップ１０３において最初のものでない
と判定された場合すなわち既に登録されている文字であ
る場合、比較部２３は、算出された特徴ベクトルに対応
する文字の代表点の中でその算出された特徴ベクトルに
最も近い代表点を辞書２５から探索する（ステップ１０
４）。そして、その探索された代表点と算出された特徴
ベクトルとの「近さ」を求めて基準値と比較し、両者が
「近い」か「近くない」かを判定する（ステップ１０
５）。この判定の基準は、上述した通りである。If it is determined in step 103 that the character is not the first one, that is, if it is a character that has already been registered, the comparing unit 23 calculates the representative point of the character corresponding to the calculated feature vector. The representative point closest to the feature vector is searched from the dictionary 25 (step 10).
4). Then, the "closeness" between the searched representative point and the calculated feature vector is obtained and compared with the reference value to determine whether they are "close" or "not close" (step 10).
5). The criteria for this determination are as described above.

【００２６】「近さ」≦基準値である場合すなわち「近
い」と判定された場合には、更新部２４は、算出された
特徴ベクトルに近付くように探索された代表点の値を上
述したように修正し（ステップ１０６）、処理を終了す
る。これに対し、「近さ」＞基準値である場合すなわち
「近くない」と判定された場合や、ステップ１０３にお
いて未登録の文字であると判定された場合には、更新部
２４は、算出された特徴ベクトルをその登録対象文字の
代表点として辞書２５に新たに登録し（ステップ１０
７）、処理を終了する。When "closeness" ≤reference value, that is, when it is determined that the value is "close", the updating unit 24 sets the value of the representative point searched so as to approach the calculated feature vector as described above. (Step 106), and the process ends. On the other hand, when “closeness”> reference value, that is, when it is determined that “not close” or when it is determined that the character is an unregistered character in step 103, the updating unit 24 calculates. The feature vector is newly registered in the dictionary 25 as a representative point of the character to be registered (step 10).
7), the process ends.

【００２７】手書き文字入力表示装置に順次入力する入
力文字パターンに対し、ここで述べたステップ１０１〜
ステップ１０７の処理を次々と実行することにより、辞
書２５の学習が進行し、一連の入力文字パターンに基づ
く辞書が完成する。With respect to the input character patterns sequentially input to the handwritten character input / display device, steps 101 to
By successively executing the processing of step 107, learning of the dictionary 25 proceeds, and a dictionary based on a series of input character patterns is completed.

【００２８】ところで、上述の辞書作成過程において、
比較部２３における「近い」、「近くない」の判定結果
は、その判定を行なうまでに修正を受けた代表点に依存
する。したがって、同一の入力文字パターンの組を使用
した場合であっても、入力順によって、作成される辞書
の内容が異なることになる。すなわち、入力文字パター
ンの組に対して最適化されていない辞書が作成されるこ
とになる。また、よく似た文字間で正しく文字認識を行
なうためには、特徴空間内でのこれら文字に対応する領
域間の境界の形状が正しく辞書に反映されていなければ
ならない。しかしながら、上述したように作成された辞
書の場合、特徴空間内での境界の形状が十分には反映さ
れていないことがある。そこで、図２(b)のフローチャ
ートに示されるように、特徴ベクトル格納部２２に格納
しておいた特徴ベクトルを用いて辞書２５の改良を行な
うことが望ましい。以下、辞書２５の改良の手順につい
て説明する。By the way, in the above-mentioned dictionary creation process,
The determination result of “close” or “not close” in the comparison unit 23 depends on the representative point that has been modified before the determination. Therefore, even when the same set of input character patterns is used, the contents of the created dictionary differ depending on the input order. That is, a dictionary that is not optimized for the set of input character patterns is created. Further, in order to correctly recognize characters between similar characters, the shape of the boundary between the regions corresponding to these characters in the feature space must be correctly reflected in the dictionary. However, in the case of the dictionary created as described above, the shape of the boundary in the feature space may not be sufficiently reflected. Therefore, as shown in the flowchart of FIG. 2B, it is desirable to improve the dictionary 25 by using the feature vector stored in the feature vector storage unit 22. The procedure for improving the dictionary 25 will be described below.

【００２９】作成された辞書２５を使用して、特徴ベク
トル格納部２５に格納されている全ての特徴ベクトルす
なわち全ての入力文字パターンのそれぞれに対して、文
字認識を実行する。ある入力文字パターンについて、文
字認識の結果、その入力文字パターンが本来表わしてい
る文字以外の文字として認識し、誤認識を生じた場合に
は、その誤認識を生じた特徴ベクトルをその本来表わし
ている文字の代表点として登録する（ステップ１１
１）。終了条件を満足するまでこのステップ１１１を繰
り返すことにより（ステップ１１２）、辞書２５の改良
が逐次進行する。終了条件としては、ステップ１１１を
所定の回数実行したかどうか、あるいは誤認識の発生頻
度が所定の値を下回るようになったかどうかなどが、例
示される。Using the created dictionary 25, character recognition is executed for all feature vectors stored in the feature vector storage unit 25, that is, for all input character patterns. As a result of character recognition, an input character pattern is recognized as a character other than the character that the input character pattern originally represents, and if an erroneous recognition occurs, the feature vector that caused the erroneous recognition is originally represented. Registered as a representative point of existing characters (step 11)
1). By repeating this step 111 until the end condition is satisfied (step 112), the dictionary 25 is successively improved. Examples of the termination condition include whether step 111 has been executed a predetermined number of times, or whether the frequency of occurrence of erroneous recognition has fallen below a predetermined value.

【００３０】以下、具体例を図示しながら、本実施例を
さらに説明する。なお、特徴空間を図示する都合上、特
徴空間は２次元であるものとする。実際には、特徴空間
はより高次元の空間である。The present embodiment will be further described below with reference to specific examples. Note that, for convenience of illustration of the feature space, the feature space is assumed to be two-dimensional. In reality, the feature space is a higher dimensional space.

【００３１】図３は、比較部２３における判定基準が、
代表点を中心とする半径ｋ₁の超球（図示の例では円）
の表面（円周）または内部に特徴ベクトルが存在する場
合に、「近い」とする場合を説明するものである。図３
(a)に示されるように、代表点３２を中心とする半径ｋ₁
の円（図示実線の円）内に代表点３２が位置する場合、
比較部２３によって「近い」と判断され、その結果、図
３(b)に示されるように、特徴ベクトル３３に近付くよ
うに代表点が移動し、代表点３２'となる。移動前の代
表点の位置が"＋"印で、移動前の円が一点鎖線で示され
ている。一方、図３(c)に示されるように、代表点３２
を中心とする半径ｋ₁の円の外部に特徴ベクトル３３が
存在する場合には、図３(d)に示されるように、その特
徴ベクトル３３が新たな代表点３４として追加される。
もとの代表点３２はそのまま保存されている。In FIG. 3, the judgment standard in the comparison unit 23 is
Hypersphere with radius k ₁ centered on a representative point (circle in the example shown)
In the case where a feature vector exists on the surface (circle) or inside of, the case will be described as "close". Figure 3
As shown in (a), a radius k ₁ centered on the representative point 32
When the representative point 32 is located within the circle (the circle indicated by the solid line in the figure),
The comparison unit 23 determines that the distance is “close”, and as a result, as shown in FIG. 3B, the representative point moves so as to approach the feature vector 33 and becomes the representative point 32 ′. The position of the representative point before the movement is indicated by "+", and the circle before the movement is indicated by a chain line. On the other hand, as shown in FIG.
When the feature vector 33 exists outside the circle with the radius k ₁ centered at, the feature vector 33 is added as a new representative point 34, as shown in FIG.
The original representative point 32 is preserved as it is.

【００３２】図４は、比較部２３における判定基準が、
代表点を中心とする１辺が２・ｋ₂の超直方体（図示の例
では正方形）の表面（辺）または内部に特徴ベクトルが
存在する場合に、「近い」とする場合を説明するもので
ある。図４(a)に示されるように、代表点４２を中心と
する上述の正方形（図示実線の正方形）内に代表点４２
が位置する場合、比較部２３によって「近い」と判断さ
れ、その結果、図４(b)に示されるように、特徴ベクト
ル４３に近付くように代表点が移動し、代表点４２'と
なる。移動前の代表点の位置が"＋"印で、移動前の正方
形が一点鎖線で示されている。一方、図４(c)に示され
るように、代表点４２を中心とする１辺が２・ｋ₂の正方
形の外部に特徴ベクトル４３が存在する場合には、図４
(d)に示されるように、その特徴ベクトル４３が新たな
代表点４４として追加される。もとの代表点４２はその
まま保存されている。In FIG. 4, the judgment standard in the comparison unit 23 is
A case where a feature vector exists on the surface (side) or inside of a hypercube (square in the example shown) having one side of 2 · k ₂ around the representative point will be described as “close”. is there. As shown in FIG. 4A, the representative point 42 is placed in the above-described square (the solid line square in the figure) centered on the representative point 42.
4 is located, the comparison unit 23 determines that it is “close”, and as a result, as shown in FIG. 4B, the representative point moves so as to approach the feature vector 43 and becomes the representative point 42 ′. The position of the representative point before the movement is indicated by "+", and the square before the movement is indicated by a chain line. On the other hand, as shown in FIG. 4C, when the feature vector 43 exists outside the square centered on the representative point 42 and having one side of 2 · k ₂ ,
As shown in (d), the feature vector 43 is added as a new representative point 44. The original representative point 42 is preserved as it is.

【００３３】図５(a)〜(d)は辞書の改良過程を説明する
図である。ここでは、相互によく似た字として、数字の
「２」、漢字の「乙」、英字の「Ｚ」を取り上げる。ま
た、特徴空間は２次元であるとして説明を行なう。5 (a) to 5 (d) are diagrams for explaining the process of improving the dictionary. Here, the numbers "2", the kanji "Otsu", and the alphabet "Z" are taken up as characters that are very similar to each other. Further, the description will be given assuming that the feature space is two-dimensional.

【００３４】辞書２５を作成したところ、特徴空間５１
内において、「２」に対する代表点５２と「乙」に対す
る代表点５３と「Ｚ」に対する代表点５４が図５(a)に
示されるように配置されたとする。特徴ベクトルによる
文字認識では、入力文字パターンの特徴ベクトルに最も
近い代表点に応じて文字認識が行なわれるから、特徴空
間５１内において各代表点間の垂直２等分線が、各文字
に対応する領域の境界となる。図５(a)においては、Ｙ
字型の境界５５ということになる。また、各文字の領域
がそれぞれ「２」,「乙」,「Ｚ」で示されている。When the dictionary 25 is created, the feature space 51
In the figure, it is assumed that the representative point 52 for "2", the representative point 53 for "Otsu" and the representative point 54 for "Z" are arranged as shown in FIG. 5 (a). In the character recognition using the feature vector, the character recognition is performed according to the representative point closest to the feature vector of the input character pattern. Therefore, the vertical bisector between the representative points in the feature space 51 corresponds to each character. It becomes the boundary of the area. In FIG. 5 (a), Y
This is the border 55 of the character shape. The area of each character is indicated by "2", "Otsu", and "Z", respectively.

【００３５】ところで、文字「２」の領域内の特徴ベク
トル５６（図示×印で示す）が、本当は「Ｚ」に対応し
ていたとする。したがって、この辞書を用いてこの特徴
ベクトル５６の文字認識を行なうと、誤認識となる。こ
れにより、図５(b)に示されるように、この特徴ベクト
ル５６は、「Ｚ」についての２番目の代表点５４ａとな
る。これにより、少なくとも、「Ｚ」と「２」との境界
は文字「２」の代表点５２側にずれて、境界５５ａとな
る。ところが、今度はこの境界５５ａよりは「Ｚ」側に
ある特徴ベクトル５７が、本当は文字「２」に対応して
いたとする。すると、同様にして、図５(c)に示される
ように、この特徴ベクトル５７が、「２」についての２
番目の代表点５２ａとなり、「２」と「Ｚ」との境界
は、「Ｚ」の代表点５４側にずれて境界５５ｂとなる。
以下同様にして、全ての入力文字パターンについて誤認
識が生ずるかどうかを確かめ、誤認識となった場合には
その特徴ベクトルを新たに代表点として加えることによ
り、図５(d)において黒丸で示すように代表点が配置さ
れ、境界５５の形状が、これらよく似た文字間で正しく
文字認識を行なうためにふさわしいものとなる。By the way, it is assumed that the feature vector 56 (indicated by a mark x in the figure) in the area of the character "2" actually corresponds to "Z". Therefore, if the character recognition of this feature vector 56 is performed using this dictionary, it will be misrecognized. As a result, as shown in FIG. 5B, the feature vector 56 becomes the second representative point 54a for "Z". As a result, at least the boundary between "Z" and "2" is shifted toward the representative point 52 side of the character "2" and becomes the boundary 55a. However, it is assumed that the feature vector 57 on the "Z" side of the boundary 55a actually corresponds to the character "2". Then, similarly, as shown in FIG. 5 (c), this feature vector 57 is 2 for "2".
The second representative point 52a is formed, and the boundary between "2" and "Z" is shifted to the representative point 54 side of "Z" to become the boundary 55b.
Similarly, it is confirmed whether or not erroneous recognition occurs with respect to all input character patterns, and in the case of erroneous recognition, the feature vector is newly added as a representative point, which is indicated by a black circle in FIG. 5 (d). The representative points are arranged as described above, and the shape of the boundary 55 is suitable for correctly performing character recognition between these similar characters.

【００３６】以上本発明の実施例について、手書き文字
認識用の辞書の作成を例に挙げて説明したが、印刷文字
などの認識用の辞書の作成にも、本発明は応用すること
ができる。Although the embodiments of the present invention have been described by taking the creation of a dictionary for recognition of handwritten characters as an example, the present invention can be applied to the creation of a dictionary for recognition of print characters and the like.

【００３７】[0037]

【発明の効果】以上説明したように本発明は、辞書に既
に登録されている代表点と新たな入力文字パターンの特
徴ベクトルとを比較し、両者が「近い」場合には、新た
な入力文字パターンの特徴ベクトルに近付くように既に
登録されている代表点を修正し、両者が「近くない」場
合には、新たな入力文字パターンの特徴ベクトルを新た
な代表点として追加登録することにより、ユーザによる
追加登録が可能であって、辞書サイズを大きくすること
なく、かつ書き手や書き方によるばらつきを十分に表現
できる辞書を作成することができるという効果がある。As described above, according to the present invention, the representative points already registered in the dictionary are compared with the feature vector of the new input character pattern. If the representative points that have already been registered are corrected so that they approach the feature vector of the pattern, and if they are not “near”, the feature vector of the new input character pattern is additionally registered as a new representative point. It is possible to create a dictionary that can be additionally registered, and that can sufficiently express variations due to writers and writing methods without increasing the dictionary size.

【００３８】また、辞書の作成後、辞書の作成に使用さ
れた入力文字パターンに対して作成された辞書を用いて
文字認識を行ない、誤認識となった場合には当該入力文
字パターンに対する特徴ベクトルを新たに代表点として
作成された辞書に追加登録することにより、辞書の改良
が行なわれ、この辞書を用いて文字認識を行なった場合
の認識率をさらに向上させることができる。After the dictionary is created, character recognition is performed using the dictionary created for the input character pattern used to create the dictionary, and if misrecognition occurs, a feature vector for the input character pattern is obtained. Is additionally registered in the dictionary newly created as the representative point, the dictionary is improved, and the recognition rate when character recognition is performed using this dictionary can be further improved.

[Brief description of drawings]

【図１】(a)は本発明の一実施例の手書き文字認識辞書
作成装置の構成を示すブロック図、(b)は辞書の構成の
一例を示す図である。FIG. 1A is a block diagram showing a configuration of a handwritten character recognition dictionary creating apparatus according to an embodiment of the present invention, and FIG. 1B is a diagram showing an example of a configuration of a dictionary.

【図２】(a),(b)は辞書作成の過程を説明するフローチ
ャートである。2 (a) and 2 (b) are flowcharts for explaining a process of creating a dictionary.

【図３】(a)〜(d)は代表点の更新の実例を説明する図で
ある。3A to 3D are diagrams illustrating an example of updating a representative point.

【図４】(a)〜(d)は代表点の更新の実例を説明する図で
ある。FIG. 4A to FIG. 4D are diagrams illustrating an example of updating representative points.

【図５】(a)〜(d)は辞書の改良の実例を説明する図であ
る。5 (a) to 5 (d) are diagrams for explaining an actual example of dictionary improvement.

[Explanation of symbols]

１０手書き文字入力表示装置１１ペン１２学習文字１３入力文字パターン１４入力エリア２１特徴ベクトル算出部２２特徴ベクトル格納部２３比較部２４更新部２５辞書３１,４１,５１特徴空間３２,３２',３４,４２,４２',４４,５２〜５４,５２ａ,
５４ａ代表点３３,５６,５７特徴ベクトル５５,５５ａ,５５ｂ境界１０１〜１０７,１１１,１１２ステップ10 handwritten character input display device 11 pen 12 learning character 13 input character pattern 14 input area 21 feature vector calculation unit 22 feature vector storage unit 23 comparison unit 24 update unit 25 dictionary 31,41,51 feature space 32,32 ', 34, 42, 42 ', 44, 52 to 54, 52a,
54a Representative point 33,56,57 Feature vector 55,55a, 55b Boundary 101-107,111,112 steps

Claims

[Claims]

1. A character recognition method for calculating a feature vector for an input input character pattern, identifying a character corresponding to the input character pattern according to the feature vector, and recognizing the input character pattern. , A method of creating a dictionary that holds, for each character, a representative feature vector corresponding to the character as a representative point, wherein when a new input character pattern is requested to be registered in the dictionary, A different input character pattern to calculate a feature vector for the new input character pattern, and searching the dictionary to determine whether the feature vector is close to the representative point of the character corresponding to the feature vector. If it is close, the representative point of the corresponding character is corrected so as to come close to the feature vector. Handwriting recognition dictionary generating method and a second step of adding to the dictionary vectors as the representative point of the corresponding character.

2. The handwritten character according to claim 1, wherein the determination of the closeness is performed by comparing a Euclidean norm of a difference vector between the representative point and the feature vector with a given threshold value. How to create a recognition dictionary.

3. The determination of the closeness is performed by comparing a maximum value of absolute values of components of each dimension of a difference vector between the representative point and the feature vector with a given threshold value. The method for creating a handwritten character recognition dictionary according to claim 1, which is performed by.

4. The method according to claim 1 is executed to create a dictionary, and then the created dictionary is used for the input character pattern used to create the dictionary. A handwritten character recognition dictionary creating method in which character recognition is performed as a result, and in the case of erroneous recognition, a feature vector for the input character pattern is additionally registered in the dictionary as a new representative point.

5. A character recognition method for calculating a feature vector for an input input character pattern, identifying a character corresponding to the input character pattern according to the feature vector, and recognizing the input character pattern. A device for creating a dictionary that holds, for each character, a representative feature vector corresponding to the character as a representative point, wherein a new input character pattern is input, and the feature vector for the new input character pattern is set as a feature vector. Calculation means, determination means for searching the dictionary to determine whether the feature vector is close to the representative point of the character corresponding to the feature vector, and if the result of the determination by the determination means is close to the feature, The representative point of the corresponding character is corrected so as to approach the vector, and in other cases, the feature vector of the corresponding character is changed. A handwriting character recognition dictionary creation device having update means for adding to the dictionary as a representative point.

6. The handwritten character recognition dictionary according to claim 5, wherein the determination means makes a determination by comparing a Euclidean norm of a difference vector between the representative point and the feature vector with a given threshold value. Creation device.

7. The determination means compares the maximum value of the absolute values of the components of each dimension of the difference vector between the representative point and the feature vector with a given threshold value. The handwritten character recognition dictionary creating device according to claim 6.