JP3466903B2

JP3466903B2 - How to create a multi-template dictionary for character recognition

Info

Publication number: JP3466903B2
Application number: JP02247798A
Authority: JP
Inventors: 浩司黒川; 克仁藤本
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 1998-02-04
Filing date: 1998-02-04
Publication date: 2003-11-17
Anticipated expiration: 2018-02-04
Also published as: JPH11219410A

Description

Detailed Description of the Invention

【０００１】[0001]

【発明の実施の形態】本発明は文字認識エンジン用のマ
ルチテンプレート辞書作成方法に関する。文字認識にお
いてマルチフォントの活字や手書き文字を扱う場合，認
識率が悪くなるため，それに対応する一つの方法として
字種毎に複数の辞書を設ける方法（マルチテンプレート
法）がある。DETAILED DESCRIPTION OF THE INVENTION The present invention relates to a method for creating a multi-template dictionary for a character recognition engine. When dealing with multi-font typed characters and handwritten characters in character recognition, the recognition rate deteriorates. Therefore, as one method to deal with this, there is a method of providing multiple dictionaries for each character type (multi-template method).

【０００２】そのようなマルチテンプレート辞書を時間
を要することなく作成することが望まれている。It is desired to create such a multi-template dictionary without taking time.

【０００３】[0003]

【従来の技術】文字認識の技術について，辞書のマルチ
テンプレート化に関する提案が論文『カテゴリー間分布
を考慮した文字認識用マルチテンプレート辞書の構成
法』（電子情報通信学会論文誌−II Vol.J79-D-II No.9
pp.1525-1533 1996年９月）に記載されている。その内
容について図６乃至図８を用いて以下に概説する。2. Description of the Related Art Regarding the technology of character recognition, a proposal for creating a multi-template for a dictionary was published in a paper "Construction of Multi-Template Dictionary for Character Recognition Considering Distribution between Categories" (IEICE Transactions-II Vol.J79- D-II No.9
pp.1525-1533 September 1996). The contents will be outlined below with reference to FIGS. 6 to 8.

【０００４】カテゴリー（文字種）間分布を調べるた
め，各字種間の特徴量の重心の集合を主成分分析し，重
心ベクトル及び個々のサンプルパターンをプロットして
分布形状を確認する。この場合，横軸が第１主成分，縦
軸が第２主成分であり，方向線素特徴量を用いた場合の
文字特徴量の分布が広がっている。In order to investigate the distribution between categories (character types), principal component analysis is performed on the set of centroids of feature amounts between each character type, and the centroid vector and individual sample patterns are plotted to confirm the distribution shape. In this case, the horizontal axis is the first principal component and the vertical axis is the second principal component, and the distribution of the character feature amount when the directional line element feature amount is used is wide.

【０００５】マルチテンプレート辞書作成の基本技術と
して，文字パターンの分布形状を考慮した字種適応クラ
スタリング法を用いる。図６は分割する条件の説明図で
あり，図６のＡ．は文字種ｃ１（例えば，ひらがなの
「あ」）とｃ２（例えば，ひらがなの「お」）について
の，誤認識が生じる典型例を示し，Ａ．の２個の楕円
ｊ，ｋはそれぞれ文字ｃ１と文字ｃ２のクラスタ（テン
プレート）であり，クラスタｊの重心はｍ^j，クラスタ
ｋの重心はｍ ^kである。文字ｃ２のサンプルパターンの
一つである点ｘを注目すると，この点ｘは本来は文字ｃ
２に属するが，文字ｃ１の重心との距離ｄ_jは文字ｃ２
の重心までの距離ｄ_kより短く，点ｘのパターンを文字
ｃ１と誤認識する。このように，誤認識の可能性のある
クラスタを含むカテゴリーを分割してクラスタ数を増や
す。Basic techniques for creating a multi-template dictionary and
Then, the character type adaptive class considering the distribution pattern of the character pattern
The Stirling method is used. FIG. 6 is an explanatory diagram of the conditions for division.
Yes, in FIG. Is the character type c1 (for example, hiragana
"A") and c2 (for example, hiragana "o")
The following shows a typical example in which misrecognition occurs. Two ellipses
j and k are clusters (ten) of the characters c1 and c2, respectively.
Plate) and the center of gravity of cluster j is m^j,cluster
the center of gravity of k is m ^kIs. Of the sample pattern of the character c2
Focusing on one point x, this point x is originally the character c.
2, but the distance d from the center of gravity of the character c1_jIs the character c2
Distance to the center of gravity of_kShorter pattern of points x
It is mistakenly recognized as c1. In this way, there is a possibility of misrecognition
Increase the number of clusters by dividing the category containing clusters
You

【０００６】カテゴリーを分割する条件は，クラスタｋ
の重心ｍ^k，第ｉ主成分の固有値をλ_i ^k，固有ベル
（単位ベクトル）をα_i ^kとする。固有値は主成分軸上
での分散を表すから，適当な定数αを用いて，次の式
(1),(2) の２点でこの特徴領域の第ｉ主成分軸上の両端
を表せる。The condition for dividing a category is a cluster k.
The center of gravity m ^k , the eigenvalue of the i-th principal component is λ _i ^k , and the eigenbell (unit vector) is α _i ^k . Since the eigenvalue represents the variance on the principal component axis, use the appropriate constant α
Two points (1) and (2) can represent both ends of the feature region on the i-th principal component axis.

【０００７】[0007]

【数１】 [Equation 1]

【０００８】ここで，α＝３．３とした。これは，軸上
の分布が正規分布である場合に「クラスタに属するパタ
ーンの９９．９％が含まれる値である。第Ｌ主成分軸ま
で考慮することにすれば，あるｉ（１≦ｉ≦Ｌ）に対し
て次の式(3),(4) が成り立つ時，クラスタｋの第ｉ主成
分軸を，「誤認識の可能性のある軸」と定義し，クラス
タｋを含むカテゴリーを分割する必要があるとみなす。
ここでのＥはユークリッド距離である。Here, α = 3.3. This is a value that "99.9% of the patterns belonging to the cluster are included when the distribution on the axis is a normal distribution. Considering up to the Lth principal component axis, some i (1≤i When the following equations (3) and (4) hold for ≤ L), the i-th principal component axis of the cluster k is defined as "the axis that may be misrecognized", and the category including the cluster k is defined as Consider it necessary to split.
E here is the Euclidean distance.

【０００９】[0009]

【数２】 [Equation 2]

【００１０】但し，ｃはクラスタｋの字種であり，ｊ
(c) はｃ以外の全ての字種のクラスタの集合で，Ｎ(c)
は字種ｃを構成するクラスタの数である。Ｍは定数であ
り，一つのカテゴリーが必要以上に細かく分割されるの
を防ぐために設定される。図６のＡ．の距離ｄ_jとｄ_k
は次の式(5),(6) である。Where c is the character type of cluster k, and j
(c) is the set of clusters of all character types except c, and N (c)
Is the number of clusters forming the character type c. M is a constant and is set to prevent one category from being divided more than necessary. A. of FIG. Distances d _j and d _k
Is the following equations (5) and (6).

【００１１】[0011]

【数３】 [Equation 3]

【００１２】図６のＡ．では点ｘは文字ｃ１に誤認識さ
れるが，この場合ｄ_j＜ｄ_kであり，上記式(3) が満た
されるから，クラスタｋは分割される。図６のＢ．のよ
うにｋが２つのクラスタ（テンプレート）ｋ₁とｋ₂に
分割されると，同じ点ｘからクラスタｋ₁の重心までの
距離の方が短くなり，正しく文字ｃ２に認識されるよう
になると考えられる。なお，ｋ１とｋ２は同じ文字ｃ２
（カテゴリー）を認識するものであり，例えば，明朝体
の「お」とゴシック体の「お」に対応する。Referring to FIG. In this case, the point x is erroneously recognized as the character c1, but in this case d _j <d _k and the above equation (3) is satisfied, so the cluster k is divided. B. of FIG. When k is divided into _two clusters (templates) k ₁ and k ₂ as in the above, the distance from the same point x to the center of gravity of the cluster k ₁ becomes shorter, and the character c2 is correctly recognized. Conceivable. Note that k1 and k2 are the same character c2
(Category) is recognized, and for example, it corresponds to Mincho type "O" and Gothic type "O".

【００１３】字種適応クラスタリング法のアルゴリズム
は次のように定める。まず，各種のサンプルパターンか
ら特徴量（ベクトル）を求め，各字種ごとの特徴量の集
合をそれぞれ一つのクラスタとみなす。各クラスタ毎に
式(3),(4) を満たす軸が存在するかどうかを調べる。そ
して，式(3) を満たす（ｋ，ｊ）のうち，第ｉ主成分の
固有値λ_i ^kの値が最も大きいものを探し（分割するク
ラスタを選択する基準としては，クラスタに含まれるサ
ンプル数が最も多いものを探し），そのクラスタを含む
字種のクラスタ数を１増加する（対象クラスタを２分割
するのではなく，カテゴリー全体を再分割する）。The algorithm of the character type adaptive clustering method is defined as follows. First, the feature quantity (vector) is obtained from various sample patterns, and the set of feature quantities for each character type is regarded as one cluster. Examine whether there is an axis that satisfies Eqs. (3) and (4) for each cluster. Then, of (k, j) satisfying the equation (3), the one having the largest eigenvalue λ _i ^k of the i-th principal component is searched (the criterion for selecting the cluster to be divided is the number of samples included in the cluster). , The number of clusters of the character type including that cluster is increased by 1 (the target cluster is not divided into two, but the entire category is divided again).

【００１４】ここで，λ_i ^kの値は誤読し易すさを表
し，これを誤読度と定義する。この操作を式(3),式(4)
を満たすクラスタがなくなるまで繰り返す。最終的に得
られた各クラスタごとに重心を求め, 辞書のテンプレー
トとする。分割の手法としてはＫ─平均法を用いた。ｋ
が字種ｃのクラスタであるとき，再分割のための初期ク
ラスタ中心は，ｍ^k±εα_i ^kと，ｋ以外のｃのクラス
タの重心（存在すれば）である（εは微小長さ0.001 で
ある）。Here, the value of λ _i ^k represents the ease of misreading, and this is defined as the misreading degree. This operation is expressed by Equation (3) and Equation (4).
Repeat until there are no more satisfying clusters. The centroid of each finally obtained cluster is calculated and used as a dictionary template. The K-means method was used as the dividing method. k
Is a cluster of character type c, the initial cluster center for subdivision is m ^k ± εα _i ^k and the center of gravity (if any) of a cluster of c other than k (ε is a minute length 0.001). Is).

【００１５】図７は初期クラスタ中心の取り方を示し，
図７のＡ．のクラスタｋ₁が第１主成分を考慮したとき
に分割すると判断された場合には，初期クラスタの中心
をクラスタｋ₁の重心からｋ₁の第１主成分の方向に±
εだけ移動した２点とし，全てのベクトルを，Ｋ─平均
法を用いてクラスタリングする。その結果はＢ．のよう
になる。以下，Ｂ．のクラスタｋ₂の第１主成分，Ｃ．
のクラスタｋ₂の第２主成分が上記判断法により選択さ
れたとものとすると，最終的にＤ．のようにクラスタ構
成となる。FIG. 7 shows how to take the initial cluster center,
A. of FIG. If it is determined that the cluster k ₁ of k ₁ is divided when the first principal component is considered, the center of the initial cluster is ± ± from the center of gravity of the cluster k ₁ toward the first principal component of k ₁ .
Two points moved by ε are set, and all vectors are clustered using the K-means method. The result is B. become that way. Hereinafter, B. The first principal component of the cluster k ₂ of C.
Assuming that the second principal component of the cluster k ₂ is selected by the above judgment method, finally D. It becomes a cluster configuration like.

【００１６】図８は従来の分割対象カテゴリーを推定す
る処理フローである。この処理フローは上記図６，図７
に示すような分割において実行される。最初にｉ＝０と
して（図８のＳ１），ｉを誤読カテゴリー（認識の対象
となる文字種）とし（同Ｓ２），次にｊ＝０とし（同Ｓ
３），ｊを被誤読カテゴリー（文字種ｉをｊと誤読する
場合のカテゴリーｊ）とする（同Ｓ４）。ここでｉをｊ
と誤読するか判別し（同Ｓ５），誤読する場合はｊに対
する誤読度がより高い場合はｉの誤読度を更新する（同
Ｓ６）。誤読しない場合及びステップＳ６の処理の後，
ｊを＋１し（図８のＳ７），ｊがカテゴリー数（認識対
象の文字種数）に達してないか判別し（同Ｓ８），達し
てない場合はＳ３に戻り次のｊについて同様の処理を行
う。達した場合は，ｉを＋１して（図７のＳ９），次の
カテゴリーに移行し，次にｉが認識対象のカテゴリー数
に達しないか判別し（図８のＳ１０），達しない場合は
Ｓ２に戻り，上記と同様の処理を実行する。達した場合
はおのおのの誤読カテゴリーに対する誤読度のうち最大
のものを持つカテゴリーを分割する（同Ｓ１１）。この
ように分割する対象となるカテゴリーを選択して分割を
行う。FIG. 8 is a processing flow for estimating a conventional division target category. This processing flow is as shown in FIGS.
It is executed in the division as shown in. First, i = 0 (S1 in FIG. 8), i is set as a misreading category (character type to be recognized) (S2), and then j = 0 (S1).
3) and j are misread categories (category j when the character type i is misread as j) (S4). Where i is j
If it is misread, the misreading degree of i is updated (step S6). If not misread and after the processing of step S6,
j is incremented by 1 (S7 in FIG. 8), it is determined whether or not j has reached the number of categories (the number of character types to be recognized) (at step S8), and if not, the process returns to S3 and the same process is performed for the next j. To do. If it has reached i, i is incremented by 1 (S9 in FIG. 7) and the process moves to the next category. Then, it is determined whether i reaches the number of categories to be recognized (S10 in FIG. 8). Returning to S2, the same processing as above is executed. When it reaches, the category having the highest degree of misreading for each misreading category is divided (at step S11). In this way, the category to be divided is selected and divided.

【００１７】上記図８により誤読カテゴリーチェックに
対してループが２重（ｉとｊについて）になっており，
実行時間はカテゴリー数をＮ（例えば，Ｎ＝３６００に
なる場合もある）とすると，ループの回数はＮの２乗の
オーダとなり，処理時間が長くなる。According to FIG. 8 above, the loop is doubled (for i and j) for the misread category check,
Assuming that the number of categories is N (for example, N = 3600 in some cases), the number of loops becomes the order of N squared and the processing time becomes long.

【００１８】図９は従来の誤読カテゴリーチェックの回
数の説明図であり，上記図８に示すように横軸を被誤読
カテゴリー，縦軸を誤読カテゴリーしすると，毎回ｉ×
ｊ回全てを再計算していた。FIG. 9 is an explanatory diagram of the number of times of the conventional misread category check. As shown in FIG. 8, when the horizontal axis is the misread category and the vertical axis is the misread category, i ×
Recalculated all j times.

【００１９】[0019]

【発明が解決しようとする課題】上記した従来の方式で
は，辞書作成にあたり単純に基本手法通りにテンプレー
トを分割することにより辞書ツールを作成すると，大容
量のデータを入力した時に入力データの増大により誤認
識チェックの実行時間が実用的限界を越え，事実上実行
不可能となる。In the conventional method described above, when a dictionary tool is created by simply dividing the template according to the basic method when creating a dictionary, the input data increases when a large amount of data is input. The execution time of the misrecognition check exceeds the practical limit, making it virtually impossible to execute.

【００２０】本発明は従来技術では不可能であった大容
量データを入力しても短い処理時間で精度の高いマルチ
テンプレート化辞書を作成することができる文字認識用
マルチテンプレート辞書作成方法を提供することを目的
とする。The present invention provides a method for creating a multi-template dictionary for character recognition, which is capable of creating a highly accurate multi-template dictionary in a short processing time even if a large amount of data is input, which was impossible with the prior art. The purpose is to

【００２１】[0021]

【課題を解決するための手段】図１は本発明の原理構成
を示す。図中，１は前回分割結果保持部，１ａは前回分
割カテゴリー保持手段，１ｂはカテゴリー対応の被誤読
カテゴリー保持手段，２は分割対象カテゴリーを被誤読
カテゴリーとする誤読カテゴリーを判定する第１の誤読
チェック部，２ａは誤読カテゴリーが１ｂに保持する被
誤読カテゴリーと同一の場合に起動して誤読のチェック
を行う特定誤読チェック部，３は被誤読カテゴリーを変
化させて分割対象のカテゴリーと誤読するかの判定を行
う第２の誤読チェック部，４は最大誤読度のカテゴリー
を検出する検出部，５は上記２，２ａ，３の各部で更新
された誤読度等の情報を保持する保持部である。FIG. 1 shows the principle configuration of the present invention. In the figure, 1 is a previous division result holding unit, 1a is a previous division category holding means, 1b is a category-corresponding misread category holding means, and 2 is a first misreading for determining a misreading category in which a division target category is a misreading category. Checking unit, 2a is a specific misreading checking unit that is activated when the misreading category is the same as the misreading category held in 1b, and checks misreading, and 3 changes the misreading category and misreads it as a division target category The second erroneous reading check unit for making the determination, 4 is a detecting unit for detecting the category of the maximum erroneous reading degree, and 5 is a holding unit for holding information such as the erroneous reading degree updated by the respective units 2, 2a and 3 above. .

【００２２】最初に従来法によりカテゴリー（テンプレ
ート辞書）分割の処理を行って，前回分割結果保持部１
の前回分割カテゴリー保持手段１ａに分割対象のカテゴ
リー（ｍｃとする）を保持し，更に各カテゴリーについ
て誤読度の高い被誤読カテゴリーを被誤読カテゴリー保
持手段１ｂに保持する。この後，第１の誤読チェック部
２が起動し，各誤読カテゴリー（ｉとする）を変化さ
せ，誤読カテゴリーが分割対象のカテゴリーと誤読する
かどうか判定して，誤読する場合は誤読度を更新して情
報保持部５に記録する。そして，誤読カテゴリー（ｉ）
と被誤読カテゴリーと分割対象カテゴリーが同一カテゴ
リーの場合に特定誤読チェック部２ａが起動し，被誤読
カテゴリーを変化させて，誤読カテゴリー（ｉ）を被誤
読カテゴリーと誤読するか否かのチェックを行い，誤読
する場合は誤読度を更新し，情報保持部５に記録する。First, a category (template dictionary) division process is performed by the conventional method, and the previous division result holding unit 1
The previous division category holding means 1a holds the division target category (referred to as mc), and further holds the misread category having a high misreading degree for each category in the misread category holding means 1b. After that, the first misreading check unit 2 is activated to change each misreading category (referred to as i), determine whether the misreading category is misread with the division target category, and if misreading, update the misreading degree. Then, it is recorded in the information holding unit 5. And misreading category (i)
When the misread category and the division target category are the same category, the specific misreading check unit 2a is activated, the misread category is changed, and it is checked whether or not the misread category (i) is misread as the misread category. When misreading, the misreading degree is updated and recorded in the information holding unit 5.

【００２３】第１の誤読チェック部２または特定誤読チ
ェック部２ａの動作が終了すると，第２の誤読チェック
部３が起動し，前回分割カテゴリー保持手段１ａが保持
するカテゴリー（ｍｃ）を誤読カテゴリーとして，被誤
読カテゴリーを変化させて被誤読カテゴリーをｍｃと誤
読するか判定し，誤読する場合は誤読度を更新して情報
保持部５に記録する。最後に検出部４は情報保持部５に
記録された更新された各誤読度から最大誤読度のカテゴ
リーを検出する。検出されたカテゴリーについて分割が
行われる。When the operation of the first misreading checking unit 2 or the specific misreading checking unit 2a is completed, the second misreading checking unit 3 is activated, and the category (mc) held by the previously divided category holding unit 1a is set as the misreading category. The misread category is changed to determine whether the misread category is misread as mc. If misread, the misreading degree is updated and recorded in the information holding unit 5. Finally, the detection unit 4 detects the maximum misreading degree category from each updated misreading degree recorded in the information holding unit 5. The division is performed for the detected category.

【００２４】このように，従来はカテゴリー数Ｎの２乗
の単位の処理を繰り返してテンプレート（辞書）を分割
していたが，Ｎ回の単位で計算を行うだけでよいため，
計算量を削減でき，高精度のマルチテンプレート化辞書
を作成することができる。As described above, conventionally, the template (dictionary) is divided by repeating the processing in the unit of the square of the number of categories N, but since the calculation only needs to be performed in the unit of N times,
The amount of calculation can be reduced and a highly accurate multi-template dictionary can be created.

【００２５】[0025]

【発明の実施の形態】図２は本発明の基本の処理フロー
である。図２において，最初に文字特徴量の分布状況を
調査する（図２のＳ１）。この場合，各文字（カテゴリ
ー）に対応した特徴量（ベクトル等）を抽出する。次に
２字種間で誤認識するか判別する（同Ｓ２）。誤認識し
ないことが分かると終了するが，誤認識する場合は，誤
認識する字種について新しいテンプレート（文字認識用
辞書）を作成し（図２のＳ３），次に新しいテンプレー
トを考慮して字種間分布を再計算する（同Ｓ４）。この
再計算の後，ステップＳ２に戻り，再度２字種間で誤認
識するか判別する。このＳ２〜Ｓ４，Ｓ２のループ処理
は２字種間で誤認識がなくなるまで繰り返す。2 is a basic processing flow of the present invention. In FIG. 2, first, the distribution state of the character feature amount is investigated (S1 in FIG. 2). In this case, the feature amount (vector or the like) corresponding to each character (category) is extracted. Next, it is determined whether the two character types are erroneously recognized (S2). The process ends when it is found that no misrecognition occurs, but if misrecognition occurs, a new template (dictionary for character recognition) is created for the misrecognized character type (S3 in FIG. 2), and the new template is taken into consideration. The inter-species distribution is recalculated (S4). After this recalculation, the process returns to step S2, and it is determined again whether the two character types are erroneously recognized. The loop process of S2 to S4 and S2 is repeated until there is no misrecognition between the two character types.

【００２６】図３，図４は本発明の実施例の処理フロー
（その１），（その２）であり，図５は本発明による誤
読カテゴリーのチェックの例を示す図である。開始する
と，一回目は従来法で分割を行う（図３のＳ１）。この
従来法は，上記図７に示した方法であり，読取対象のカ
テゴリーが３５００の場合は，二重ループの処理により
その２乗回数の処理を繰り返す。次に，一回目で分割対
象となったカテゴリーを変数ｍｃとして記憶する（図３
のＳ２）。次に，おのおののカテゴリー（例えば，読取
対象の文字種ｉについて）について，ｉを他の文字種ｊ
と誤読する場合，ｊを被誤読カテゴリーとしてそれぞれ
記憶する。この時に計算した各誤読度はメモリ（図示省
略）に記憶する（図３のＳ３）。以上のＳ１〜Ｓ３は，
本発明による分割対象カテゴリーを決定する前処理に相
当し，一回目の処理の結果の必要なデータを記憶してお
き，この後の２回目以降の処理では，１回目から変化し
ないデータについては処理せず，変化した部分について
だけ処理を行うことにより処理を短縮するようにした。FIGS. 3 and 4 are process flows (1) and (2) of the embodiment of the present invention, and FIG. 5 is a diagram showing an example of checking misreading categories according to the present invention. Once started, the first division is performed by the conventional method (S1 in FIG. 3). This conventional method is the method shown in FIG. 7 above, and when the category to be read is 3500, the processing of the number of times of squaring is repeated by the processing of the double loop. Next, the category that is the division target in the first time is stored as a variable mc (see FIG. 3).
S2). Next, for each category (for example, for the character type i to be read), i is the other character type j
If misread and stores each j as the misreading category. Each misreading degree calculated at this time is stored in a memory (not shown) (S3 in FIG. 3). The above S1 to S3 are
Corresponding to the pre-processing for determining the division target category according to the present invention, necessary data as a result of the first processing is stored, and in the second and subsequent processings thereafter, data that does not change from the first processing is processed. Instead, the processing is shortened by processing only the changed parts.

【００２７】すなわち，大きく分けて，Ａブロックの処
理とこれに続くＢブロックの処理とで構成され，Ａブロ
ックはＡ１のブロックまたはＡ２のブロックにより構成
される。That is, it is roughly divided into a process of an A block and a process of a B block which follows it, and the A block is composed of an A1 block or an A2 block.

【００２８】ＡブロックのＡ１では，誤読カテゴリーを
ｉからＮまで変化させる。すなわち，最初にｉ＝０とし
て（図３のＳ４），ｉを誤読カテゴリー（対象となる文
字種）とする（同Ｓ５）。次にこのｉを被誤読カテゴリ
ー＝ｍｃになるか（ｉをｍｃと誤読するか）を判別する
（図３のＳ６）。ここで，前回の処理で記憶しておいた
ｉの被誤読カテゴリーとｍｃが同一のカテゴリーと一致
するか判別し，一致した場合は，Ａ２の処理に移行する
が，一致しない場合は，ｍｃを被誤読カテゴリーにし
（図３のＳ７），カテゴリーｉをカテゴリーｍｃと誤読
するか判別する（同Ｓ８）。誤読する場合は，ｍｃに対
する誤読度がより高い場合はｉの誤読度を更新し（図３
のＳ９），ｉの被誤読カテゴリーを更新する（同Ｓ１
０）。その後，ｉを＋１して（図３のＳ１１），ｉがカ
テゴリーの個数に達しないか判別し（同Ｓ１２），達し
ない場合はＳ５に戻り，達した場合はグループＢの処理
に移る。At A1 of the A block, the misread category is changed from i to N. That is, first, i = 0 is set (S4 in FIG. 3), and i is set as a misreading category (target character type) (S5). Next, it is determined whether or not this i becomes the misread category = mc (i is misread as mc) (S6 in FIG. 3). Here, it is determined whether the misread category of i and mc stored in the previous process match the same category, and if they match, the process moves to A2, but if they do not match, mc is changed. The misread category is set (S7 in FIG. 3), and it is determined whether or not the category i is misread as the category mc (S8). When misreading, when the misreading degree for mc is higher, the misreading degree of i is updated (see FIG. 3).
S9), and the misread category of i is updated (S1 of the same).
0). After that, i is incremented by 1 (S11 in FIG. 3), and it is determined whether i reaches the number of categories (S12). If i is not reached, the process returns to S5, and if i is reached, group B processing is performed.

【００２９】上記Ａ１におけるステップＳ６において，
ｉの被誤読カテゴリーがｍｃと一致した場合，Ａ２グル
ープの処理が実行される。このＡ２では，被誤読カテゴ
リーをｍｃとせずｊとし，被誤読カテゴリーｊをＮまで
変化させｉをｊと誤読するかどうかの判定をする。ｉを
何らかのカテゴリーと誤読し，そのカテゴリーに対する
誤読度がより高い場合は，ｉの誤読度を更新する。At step S6 in the above A1,
If the misread category of i matches mc, the processing of the A2 group is executed. In A2, the misread category is set to j instead of mc, the misread category j is changed to N, and it is determined whether or not i is misread as j. If i is misread as some category and the misreading level for that category is higher, the misreading level of i is updated.

【００３０】すなわち，前回の分割により変更されたカ
テゴリーに対する誤読度等の値について，これらが被誤
読カテゴリーになる場合には，変更された誤読度等につ
いてだけ変更するものであり，それ以外の値については
変更されないので処理の対象としない。すなわち，分割
の対象となった誤読カテゴリーｉを被誤読カテゴリーと
する各カテゴリーだけについて処理を行うものである。That is, regarding the values such as the misreading degree for the category changed by the previous division, if these are the misread categories, only the changed misreading degree is changed, and other values are used. Is not subject to processing because it is not changed. That is, processing is performed only for each category in which the misread category i that is the target of division is the misread category.

【００３１】最初にｊ＝０にして（図３のＳ１３），ｊ
を被誤読カテゴリーとし（同Ｓ１４），ｉをｊと誤認識
するか判別する（同Ｓ１５）。ここで，ｉをｊと誤認識
する場合は，ｊに対する誤読度がより高い場合は，ｊの
誤読度を更新し（同Ｓ１６），ｊの被誤読カテゴリーを
更新する（同Ｓ１７）。続いて，ｊを＋１して（同Ｓ１
８），ｊがカテゴリー数を越えたか判別し（図３のＳ１
９），越えないとステップＳ１３に戻りループ動作を繰
り返す。ｊがカテゴリー数に達すると，ｉを＋１して
（同Ｓ２０），グループＡのＳ１２に戻る。First, j = 0 is set (S13 in FIG. 3), j
Is the misread category (S14), and it is determined whether i is misrecognized as j (S15). Here, in the case of erroneously recognizing i as j, if the misreading degree for j is higher, the misreading degree of j is updated (at step S16) and the misread category of j is updated (at step S17). Then, j is incremented by 1 (the same S1
8), j is determined whether it exceeds the number of categories (S1 in FIG. 3).
9) If not exceeded, the process returns to step S13 to repeat the loop operation. When j reaches the number of categories, i is incremented by 1 (at step S20) and the process returns to step S12 of group A.

【００３２】Ｓ１２において，ｊがカテゴリー数に達し
たことが分かると，図４に示すグループＢへ移行する。
グループＢでは，前回の分割対象となったカテゴリーｍ
ｃを誤読カテゴリーとして，このカテゴリーに誤読され
る被誤読カテゴリーｊをＮまで変化させ，ｍｃをｊと誤
読するかどうかの判定をする。When it is determined in S12 that j has reached the number of categories, the group B shown in FIG. 4 is entered.
In group B, the category m that was the target of the previous division
With c as the misread category, the misread category j misread by this category is changed to N, and it is determined whether or not mc is misread as j.

【００３３】すなわち，ｍｃを誤読カテゴリーとし（図
４のＳ２１），ｊ＝０に設定し（同Ｓ２２），ｊを被誤
読カテゴリーにする（同Ｓ２３）。次にｍｃをｊと誤読
するか判別し（図４のＳ２４），誤読する場合は，ｊに
対する誤読度がより高い場合はｍｃの誤読度を更新し
（同Ｓ２５），ｍｃの被誤読カテゴリーを更新する（同
Ｓ２６）。Ｓ２４及びＳ２６に続いてｊを＋１し（図４
のＳ２７），そのｊがカテゴリー数を越えないか判別し
（同Ｓ２８），越えない場合はＳ２３に戻って，ループ
動作を繰り返す。ｊがカテゴリー数に達すると，おのお
のの誤読カテゴリーに対する誤読度のうち最大のものを
持つカテゴリーを分割する（図４のＳ２９）。That is, mc is set as the misread category (S21 in FIG. 4), j = 0 is set (S22 in the same), and j is set as the misread category (S23). Next, it is determined whether mc is misread as j (S24 in FIG. 4). When misreading is performed, if the misreading degree for j is higher, the misreading degree of mc is updated (S25 in the same), and the misread category of mc is set. It is updated (at step S26). Subsequent to S24 and S26, j is incremented by 1 (see FIG. 4).
S27), and it is determined whether or not j exceeds the number of categories (S28), and if it does not exceed j, the process returns to S23 to repeat the loop operation. When j reaches the number of categories, the category having the largest misreading degree for each misreading category is divided (S29 in FIG. 4).

【００３４】上記Ａ２の処理はごくまれに実行されるだ
けであるため，実質の処理時間はブロックＡ１のループ
の処理時間＋ブロックＢのループの処理時間となり，そ
れぞれがカテゴリー数Ｎについてのループが１度だけな
ので，実行時間はＮのオーダに削減される。Since the processing of A2 is executed only rarely, the actual processing time is the processing time of the loop of block A1 + the processing time of the loop of block B. Since it is only once, the execution time is reduced to N orders.

【００３５】上記図３，図４の処理を図５に示す例によ
り説明する。図５の各行には誤読カテゴリーｉが配置さ
れ，(1) 〜(9) の各行はそれぞれ文字種であり，それぞ
れ例えば「あ」，「い」，「う」等の文字種に対応す
る。各列には被誤読カテゴリーが配置され，(1) 〜(9)
の各列はそれぞれ各行の(1) 〜(9) と同じ文字種であ
り，同じ数字の文字同士が交差する点を除いて，相互の
関係を表す。図５の(3) 行目（ｉ＝３）が分割対象のカ
テゴリーとする（図３のＳ２）。このｉを被誤読カテゴ
リー＝ｍｃ（＝３）とし，このカテゴリーを図５の３列
目に網掛けで示すように被誤読カテゴリーとして，図５
のグループＡ１の処理が実行される。図５の例ではこの
時，(3) のｍｃの文字として誤読する誤読カテゴリーｉ
が(5) 行のカテゴリーと(8) 行のカテゴリーの２つであ
る。すなわち，横方向の第５行と第８行の網掛けの領域
について上記図３のグループＡ２の処理が実行され，誤
読度が更新される。３列目との交差点の黒丸で表す，
は，被誤読カテゴリーが存在することを表している。The processing of FIGS. 3 and 4 will be described with reference to the example shown in FIG. A misreading category i is arranged in each line of FIG. 5, and each line of (1) to (9) is a character type, which corresponds to a character type such as “A”, “I”, and “U”. Misread categories are placed in each column, and (1) to (9)
Each column of has the same character type as (1) to (9) of each row, and expresses the mutual relationship except that the characters of the same number intersect. The line (3) in FIG. 5 (i = 3) is the category to be divided (S2 in FIG. 3). This i is set as the misread category = mc (= 3), and this category is set as the misread category as shown in the third column of FIG.
The processing of group A1 is executed. In the example of FIG. 5, at this time, the misreading category i is misread as the character mc of (3).
Are two categories, line (5) and line (8). That is, the processing of group A2 in FIG. 3 is executed for the shaded areas of the fifth row and the eighth row in the horizontal direction, and the misreading degree is updated. Expressed by the black circle at the intersection with the third column,
Indicates that there is a misread category.

【００３６】更に，ｍｃを誤読カテゴリーとして，この
ｍｃを他のカテゴリーとして誤読する場合の誤読度を求
めるグループＢの処理は，図５の３行目の網掛けの領域
に対して，列方向に交差する点が被誤読カテゴリーとな
り, 誤読度を更新している。この例では３行のカテゴリ
ーを(8) 列のカテゴリーと誤読している（黒丸で表
す）。Further, the processing of the group B for obtaining the degree of misreading when mc is misread as a misreading category and this mc is misread as another category is performed in the column direction with respect to the shaded area in the third row in FIG. The crossing point becomes the misread category, and the misreading degree is updated. In this example, the category in 3 rows is misread as the category in column (8) (represented by a black circle).

【００３７】[0037]

【発明の効果】本発明によれば誤認識の可能性のある字
種を推定する際に使用する誤読度を記憶しておき，変更
があった箇所のみ再計算するようにして，誤認識チェッ
クの計算量を大幅に削減することができる。これによ
り，カテゴリーを分割する処理を高速化して精度の高い
テンプレート辞書を高速に作成することができる。According to the present invention, the misreading degree used for estimating a character type that may be misrecognized is stored, and only the changed portion is recalculated to check the misrecognition. The calculation amount of can be significantly reduced. This makes it possible to speed up the process of dividing the categories and create a highly accurate template dictionary at high speed.

[Brief description of drawings]

【図１】本発明の原理構成を示す図である。FIG. 1 is a diagram showing a principle configuration of the present invention.

【図２】本発明の基本の処理フローである。FIG. 2 is a basic processing flow of the present invention.

【図３】実施例の処理フロー（その１）を示す図であ
る。FIG. 3 is a diagram showing a processing flow (No. 1) of the embodiment.

【図４】実施例の処理フロー（その２）を示す図であ
る。FIG. 4 is a diagram showing a processing flow (No. 2) of the embodiment.

【図５】本発明による誤読カテゴリーのチェックの例を
示す図である。FIG. 5 is a diagram showing an example of checking a misread category according to the present invention.

【図６】分割する条件の説明図である。FIG. 6 is an explanatory diagram of a dividing condition.

【図７】初期クラスタ中心の取り方を示す図である。FIG. 7 is a diagram showing how to take the center of an initial cluster.

【図８】従来の分割対象カテゴリーを推定する処理フロ
ーを示す図である。FIG. 8 is a diagram showing a processing flow for estimating a conventional division target category.

【図９】従来の誤読カテゴリーチェックの回数の説明図
である。FIG. 9 is an explanatory diagram of the number of times of conventional misread category check.

[Explanation of symbols]

１前回分割結果保持部１ａ前回分割カテゴリー保持手段１ｂ被誤読カテゴリー保持手段２第１の誤読チェック部２ａ特定誤読チェック部３第２の誤読チェック部４検出部５保持部 1 Previous division result holding unit 1a Previously divided category holding means 1b Misreading category holding means 2 First misreading check section 2a Specific misreading check section 3 Second misreading check section 4 detector 5 holding part

───────────────────────────────────────────────────── フロントページの続き (58)調査した分野(Int.Cl.⁷，ＤＢ名) G06K 9/00 - 9/76 ─────────────────────────────────────────────────── ─── Continuation of the front page (58) Fields surveyed (Int.Cl. ⁷ , DB name) G06K 9/00-9/76

Claims

(57) [Claims]

1. In a character template multi-template dictionary creation method, a category (mc) that was previously divided, a holding unit that holds misreading degree and misread character as misread information at the time of previous division, and a misread category are set. Fixed to the previous division category (mc), the misread category is checked for misreading only for all categories, and the first check unit for updating and holding the misread degree and the misread character according to the check result, Fixed to, the misreading category is checked for misreading only for all categories, and the second check part holds the misreading degree and the misread character according to the result of the check, and the misreading degree for each of the held misreading categories And a detection unit for detecting a category having the largest one, and dividing the category having the largest misreading degree. Multi-template dictionary creation method for character recognition to.

2. The specific misreading check unit according to claim 1, further comprising a specific misreading check unit that is activated when the misread category of the misread category and the division target category are detected in the first check unit. The misreading checking unit changes the misreading category, checks whether the misreading category is misread as the misreading category, and if misreading, updates the misreading degree and records it. How to create a multi-template dictionary.