JPH0512493A

JPH0512493A - Character recognizing device

Info

Publication number: JPH0512493A
Application number: JP3159316A
Authority: JP
Inventors: Daisuke Nishiwaki; 大輔西脇
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 1991-07-01
Filing date: 1991-07-01
Publication date: 1993-01-22
Anticipated expiration: 2014-06-21
Also published as: JP2906743B2

Abstract

PURPOSE:To simultaneously reduce the number of misreading and the number of abandonment by using plural character classifying means at the same time. CONSTITUTION:Codes Ci (d1(i) and d2(i)) are generated from a distance value to the first candidate character code Ci of a first classifying means 14 and the distance value of a second character classifying means 15 to the code Ci, and codes Cj (d1(j), d2(j)) are generated from the distance value of the means 15 to the first candidate character code Cj and the distance value of the means 14 to the code Cj. Then, evaluation values Ei and Ej to the candidates Ci and Cj are generated from a first main axis, which is calculated from the distribution of vectors to the learning pattern of each candidate according to a main component analyzing method, and the codes Ci (di(i), d2(i)) and Cj (d1(j), d2(j)), and a pair judging means 16 judges which candidate is adopted and abandonned according to those evaluation values Ei and Ej. In this case, even when the first candidate character codes of the both means are equal, the abandonment is judged by using the evaluation values calculated by the method at the pair judging means 16.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、２種類及び３種類以上
の文字分類手段の併用した文字認識装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a character recognition device which uses two or more kinds of character classification means in combination.

【０００２】[0002]

【従来の技術】従来のパターン整合法による文字認識装
置では、未知文字と認識辞書部に記憶されている標準文
字との類似度を計算し、最大類似度が予めその標準文字
に対して設定した閾値以上であり、かつ最大類似度と次
大類似度との差が予め設定したもうひとつの閾値以上の
場合に、未知文字は最大類似度を示す標準文字と同一の
字種という認識結果を出力する。各字種毎に、これらの
２個の閾値を適当に設定することより、誤読する未知文
字の数を抑えるとともに認識不能となる未知文字の数も
減らすことができる。（特公昭５８−１７８４８４号公
報）。2. Description of the Related Art In a conventional character recognition device using a pattern matching method, the similarity between an unknown character and a standard character stored in a recognition dictionary is calculated, and the maximum similarity is set in advance for that standard character. If the difference is greater than or equal to the threshold and the difference between the maximum similarity and the next largest similarity is equal to or greater than another preset threshold, the unknown character outputs the recognition result that it is the same character type as the standard character showing the maximum similarity. To do. By appropriately setting these two thresholds for each character type, it is possible to suppress the number of unknown characters that are misread and reduce the number of unknown characters that cannot be recognized. (Japanese Patent Publication No. 58-178484).

【０００３】また、複数の文字分類手段を併用した文字
認識装置であって前述のような閾値を使う装置が知られ
ている（特願平３−５２６号）。この装置における判定
アルゴリズムの概要を図７に示す。これによれば、複数
種の分類手段が出力する分類結果の第一候補が異なった
場合に、それぞれの分類手段が出力する分類結果の第一
候補に対して評価値を計算し、これによりどちらの分類
手段が出力する分類結果の第一候補を最終認識結果とす
るかを判定する。また、この時それぞれの分類手段が出
力する分類結果の第一候補に対する評価値に対し閾値を
設定し、これを越える分類結果の第一候補に対しては棄
却を行っている。There is also known a character recognition device which uses a plurality of character classification means in combination and uses a threshold value as described above (Japanese Patent Application No. 3-526). FIG. 7 shows the outline of the determination algorithm in this device. According to this, when the first candidates of the classification results output by the plurality of types of classification means are different, the evaluation value is calculated for the first candidates of the classification results output by the respective classification means, which It is determined whether or not the first candidate of the classification result output by the classification unit is to be the final recognition result. At this time, a threshold value is set for the evaluation value for the first candidate of the classification result output by each classification means, and the first candidate for the classification result exceeding this is rejected.

【０００４】[0004]

【発明が解決しようとする課題】しかしながら、前述の
複数の文字の分類手段を併用することにより、認識率の
低下を防ぐ装置は、それぞれの分類手段が出力する分類
結果の第一候補が同一であった場合には、棄却の判定を
行なわずその候補を直ちに最終認識結果としてしまう。
この場合には、誤読を起こしやすい。そこで、それぞれ
の分類手段が出力する分類結果の第一候補が同一だった
場合でもそれを最終認識結果とするか、もしくは、棄却
するかを判定し認識精度を向上させることが必要とな
る。However, in the device for preventing the deterioration of the recognition rate by using the above-mentioned plurality of character classification means together, the first candidates of the classification results output by the respective classification means are the same. If there is, the decision is not made and the candidate is immediately set as the final recognition result.
In this case, misreading is likely to occur. Therefore, even if the first candidates of the classification results output by the respective classification means are the same, it is necessary to improve the recognition accuracy by determining whether to use the first candidate as the final recognition result or to reject it.

【０００５】本発明の目的は、誤読又は棄却する未知文
字の数を減少させることができる文字認識装置を提供す
ることにある。An object of the present invention is to provide a character recognition device capable of reducing the number of unknown characters that are misread or rejected.

【０００６】[0006]

【課題を解決するための手段】第１の発明の文字認識装
置は、２種類の文字分類手段を有し、それぞれの分類手
段により選択された第一候補の文字コードから最終認識
結果を出力する文字認識装置において、（１）第一の分類手段により選択された第一候補の文字
コードＣｉとその距離値ｄ１（ｉ）と、第二の分類手段
における、前記文字コードＣｉに対する距離値ｄ２
（ｉ）とから、２次元ベクトルＣｉ（ｄ１（ｉ），ｄ２
（ｉ））を生成する手段と、（２）第二の分類手段により選択された第一候補の文字
コードＣｊとその距離値ｄ２（ｊ）と、第一の分類手段
における、前記文字コードＣｊに対する距離値ｄ１
（ｊ）とから、２次元ベクトルＣｊ（ｄ１（ｊ），ｄ２
（ｊ））を生成する手段と、（３）前記２次元ベクトルＣｉ（ｄ１（ｉ），ｄ２
（ｉ））とＣｊ（ｄ１（ｊ），ｄ２（ｊ））とから、前
記文字コードＣｉに対する評価値Ｅｉと前記文字コード
Ｃｊに対する評価値Ｅｊとを計算する手段において、分
類対象となる字種Ｋに属する学習用の文字パターンＤｋ
に対する距離値であるｄ１（ｋ）とｄ２（ｋ）とを座標
軸とした平面上における、２次元ベクトルＣｋ（ｄ１
（ｋ），ｄ２（ｋ））の分布の主軸をＶｋ（ｐ１
（ｋ），ｐ２（ｋ）））を、主成分分析法を用い各ベク
トルの要素ｄ１（ｋ）、ｄ２（ｋ）を変量として得られ
る２次元の分散共分散行列の最大固有値から求め、前記
Ｖｋ（ｐ１（ｋ），ｐ２（ｋ））と前記２次元ベクトル
Ｃｋ（ｄ１（ｋ），ｄ２（ｋ））とから前記文字コード
Ｃｋに対する評価値Ｅｋを決定することにより前記評価
値ＥｉとＥｊとを計算する手段と（４）前記文字コードＣｉと前記文字コードＣｊとが同
一の場合に前記評価値Ｅｉを用いて前記文字コードＣｉ
を最終認識結果として出力するか、若しくは棄却するか
を判定し、又は前記文字コードＣｉと前記文字コードＣ
ｊとが異なる場合には、前記評価値Ｅｊとを用いてどち
らの文字コードを最終認識結果として出力するか若しく
はどちらも棄却するかを対判定する手段とから構成され
ることを特徴とする。The character recognition device of the first invention has two types of character classification means, and outputs the final recognition result from the character code of the first candidate selected by each classification means. In the character recognition device, (1) the first candidate character code Ci selected by the first classification means and its distance value d1 (i), and the distance value d2 with respect to the character code Ci in the second classification means.
From (i), the two-dimensional vector Ci (d1 (i), d2
(I)) generating means, (2) the first candidate character code Cj selected by the second classifying means and its distance value d2 (j), and the character code Cj in the first classifying means. Distance value d1 to
From (j), the two-dimensional vector Cj (d1 (j), d2
(J)) generating means, and (3) the two-dimensional vector Ci (d1 (i), d2
(I)) and Cj (d1 (j), d2 (j)), the means for calculating the evaluation value Ei for the character code Ci and the evaluation value Ej for the character code Cj are the character types to be classified. Learning character pattern Dk belonging to K
Two-dimensional vector Ck (d1 on a plane having coordinate values of d1 (k) and d2 (k), which are distance values to
The main axis of the distribution of (k) and d2 (k) is Vk (p1
(K), p2 (k))) is obtained from the maximum eigenvalue of a two-dimensional variance-covariance matrix obtained by using the principal component analysis method with the elements d1 (k) and d2 (k) of each vector as variables. The evaluation values Ei and Ej are determined by determining the evaluation value Ek for the character code Ck from Vk (p1 (k), p2 (k)) and the two-dimensional vector Ck (d1 (k), d2 (k)). And (4) when the character code Ci and the character code Cj are the same, the character code Ci is calculated using the evaluation value Ei.
Is output as the final recognition result or is rejected, or the character code Ci and the character code C are determined.
When j is different from the above, it is characterized by comprising means for making a pair decision by using the evaluation value Ej which character code is to be output as the final recognition result or both are to be rejected.

【０００７】第２の発明の文字認識装置は、２種類の文
字分類手段を有し、それぞれの分類手段により選択され
た第一候補の文字コードから最終認識結果を出力する文
字認識装置において、（１）第一の分類手段により選択された第一候補の文字
コードＣｉとその距離値ｄ１（ｉ）と、第二の分類手段
における、前記文字コードＣｉに対する距離値ｄ２
（ｉ）とから２次元ベクトルＣｉ（ｄ１（ｉ），ｄ２
（ｉ））を生成する手段と、（２）第二の分類手段により選択された第一候補の文字
コードＣｊとその距離値ｄ２（ｊ）と、第一の分類手段
における、前記文字コードＣｊに対する距離値ｄ１
（ｊ）とから２次元ベクトルＣｊ（ｄ１（ｊ），ｄ２
（ｊ））を生成する手段と、（３）前記２次元ベクトルＣｉ（ｄ１（ｉ），ｄ２
（ｉ））とＣｊ（ｄ１（ｊ），ｄ２（ｊ）とから、前記
文字コードＣｉに対する評価値Ｅｉと前記文字コードＣ
ｊに対する評価値Ｅｊとを計算する手段において、分類
対象となる字種ｋに属する学習用の文字パターンＣｋに
対する距離値であるｄ１（ｋ）とｄ２（ｋ）とを座標軸
とした平面上における、２次元ベクトルＣｋ（ｄ１
（ｋ），ｄ２（ｋ））の分布の主軸をＶｋ（ｐ１
（ｋ），ｐ２（ｋ）））を、主成分分析法を用い各ベク
トルの要素ｄ１（ｋ）、ｄ２（ｋ）を変量として得られ
る２次元の相関行列の最大固有値から求め、前記Ｖｋ
（ｐ１（ｋ），ｐ２（ｋ））と前記２次元ベクトルＣｋ
（ｄ１（ｋ），ｄ２（ｋ））とから文字コードＣｋに対
する評価値Ｅｋを決定することにより前記評価値Ｅｉと
Ｅｊとを計算する手段と（４）前記文字コードＣｉと前記文字コードＣｊとが同
一の場合に前記評価値Ｅｉを用いて前記文字コードＣｉ
を最終認識結果として出力するか、若しくは棄却するか
を判定し、又は前記文字コードＣｉと前記文字コードＣ
ｊとが異なる場合には、前記評価値Ｅｉと前記評価値Ｅ
ｊとを用いてどちらの文字コードを最終認識結果として
出力するか若しくはどちらも棄却するかを対判定する手
段とから構成されることを特徴とする。The character recognition device of the second invention is a character recognition device which has two types of character classification means and outputs a final recognition result from the character code of the first candidate selected by each classification means, 1) The first candidate character code Ci selected by the first classifying means and its distance value d1 (i), and the distance value d2 with respect to the character code Ci in the second classifying means.
From (i), the two-dimensional vector Ci (d1 (i), d2
(I)) generating means, (2) the first candidate character code Cj selected by the second classifying means and its distance value d2 (j), and the character code Cj in the first classifying means. Distance value d1 to
(J) and the two-dimensional vector Cj (d1 (j), d2
(J)) generating means, and (3) the two-dimensional vector Ci (d1 (i), d2
(I)) and Cj (d1 (j), d2 (j)), the evaluation value Ei for the character code Ci and the character code C
In the means for calculating the evaluation value Ej for j, the distance values d1 (k) and d2 (k), which are distance values for the learning character pattern Ck belonging to the character type k to be classified, are coordinate axes. Two-dimensional vector Ck (d1
The main axis of the distribution of (k) and d2 (k) is Vk (p1
(K), p2 (k))) is calculated from the maximum eigenvalue of a two-dimensional correlation matrix obtained by using the principal component analysis method with the elements d1 (k) and d2 (k) of each vector as a variate.
(P1 (k), p2 (k)) and the two-dimensional vector Ck
Means for calculating the evaluation values Ei and Ej by determining the evaluation value Ek for the character code Ck from (d1 (k), d2 (k)), and (4) the character code Ci and the character code Cj. The same character code Ci using the evaluation value Ei
Is output as the final recognition result or is rejected, or the character code Ci and the character code C are determined.
When j is different, the evaluation value Ei and the evaluation value E
It is characterized by comprising a means for making a pair determination by using j and which character code is to be output as the final recognition result or which is to be rejected.

【０００８】第３の発明の文字認識装置は、３種類以上
全ｎ種類の文字分類手段を有し、それぞれの分類手段に
より選択された第一候補の文字コードから最終認識結果
を出力する文字認識装置において、（１）ｎを越えないｍに対する第ｍ番目の分類手段によ
り選択された第一候補の文字コードＣｈとその距離値ｄ
ｍ（ｈ）と、第ｍ番目以外のｎ−１個の分類手段におけ
る、前記文字コードＣｈに対する距離値とからそれらを
分類手段の順に並べたｎ次元ベクトルＣｈ（ｄ１
（ｈ），ｄ２（ｈ），．．．，ｄｍ（ｈ），．．．，ｄ
ｎ（ｈ））を生成することを全分類手段の第一候補に対
して行なう手段と、（２）前記ｎ次元ベクトルＣｈ（ｄ１（ｈ），ｄ２
（ｈ），．．．，ｄｍ（ｈ），．．．，ｄｎ（ｈ））か
ら、全ｎ個の分類手段の各々により選択された第一候補
の文字コードに対して評価値を計算する手段において、
分類対象となる字種ｋに属する学習用の文字パターンＣ
ｈから得たｎ次元ベクトルＣｋ（ｄ１（ｋ），ｄ２
（ｋ），．．．，ｄｍ（ｋ），．．．，ｄｎ（ｋ））の
各要素を座標軸とした空間内におけるｎ次元ベクトルＣ
ｋ（ｄ１（ｋ），ｄ２（ｋ），．．．，ｄｍ
（ｋ），．．．，ｄｎ（ｋ））の分布の主軸Ｖｋ（ｐ１
（ｋ），ｐ２（ｋ），．．．，ｐｍ（ｋ），．．．，ｐ
ｎ（ｋ））を、主成分分析法を用い各ベクトルの要素ｄ
ｍ（ｋ）を変量として得られるｎ次元の分散共分散行列
の最大固有値から求め、前記Ｖｋ（ｐ１（ｋ），ｐ２
（ｋ），．．．，ｐｍ（ｋ），．．．，ｐｎ（ｋ））と
前記ｎ次元ベクトルＣｋ（ｄ１（ｋ），ｄ２
（ｋ），．．．，ｄｍ（ｋ），．．．，ｄｎ（ｋ））と
から前記文字コードＣｋに対する評価値Ｅｋを決定する
ことにより前記評価値を計算する手段と、（３）第ｎ番目までの全分類手段により選択された第一
候補の文字コードが同一の場合には、第一候補の文字コ
ードに対する評価値を用いて候補の文字コードを最終認
識結果として選択するか、若しくは棄却するかを判定
し、又は第ｎ番目までの分類手段に対する分類結果の第
一候補の文字コードが異なる場合において、全ｎ個の第
一候補の文字コードに対するそれぞれの評価値からどの
候補を最終認識結果として出力するか、若しくはどちら
も棄却するかを判定する手段とから構成されることを特
徴とする。The character recognition device of the third invention has three or more kinds of all n kinds of character classification means, and outputs the final recognition result from the first candidate character code selected by each classification means. In the apparatus, (1) the first candidate character code Ch selected by the m-th classification means for m not exceeding n and its distance value d
An n-dimensional vector Ch (d1) in which m (h) and distance values with respect to the character code Ch in the n-1 number of classification means other than the m-th one are arranged in the order of the classification means
(H), d2 (h) ,. ．． , Dm (h) ,. ．． , D
(2) the n-dimensional vector Ch (d1 (h), d2) for generating n (h)) for the first candidate of all classification means;
(H) ,. ．． , Dm (h) ,. ．． , Dn (h)), the means for calculating an evaluation value for the character code of the first candidate selected by each of the n total classification means,
A learning character pattern C belonging to the character type k to be classified
n-dimensional vector Ck (d1 (k), d2 obtained from h
(K) ,. ．． , Dm (k) ,. ．． , Dn (k)) is the n-dimensional vector C in the space with the coordinate axes.
k (d1 (k), d2 (k), ..., dm
(K) ,. ．． , Dn (k)) main axis Vk (p1
(K), p2 (k) ,. ．． , Pm (k) ,. ．． , P
n (k)) is the element d of each vector using the principal component analysis method
Vk (p1 (k), p2 is obtained from the maximum eigenvalue of the n-dimensional variance-covariance matrix obtained by using m (k) as a variable
(K) ,. ．． , Pm (k) ,. ．． , Pn (k)) and the n-dimensional vector Ck (d1 (k), d2
(K) ,. ．． , Dm (k) ,. ．． , Dn (k)) and means for calculating the evaluation value Ek for the character code Ck, and (3) first candidate character selected by all classification means up to the n-th When the codes are the same, it is determined whether the candidate character code is selected as the final recognition result or rejected using the evaluation value for the first candidate character code, or the classification means up to the n-th When the character code of the first candidate of the classification result is different, it is determined which candidate is to be output as the final recognition result or both are rejected from the respective evaluation values for all n character codes of the first candidate. And means.

【０００９】第４の発明の文字認識装置は、３種類以上
全ｎ種類の文字分類手段を有し、それぞれの分類手段に
より選択された第一候補の文字コードから最終認識結果
を出力する文字認識装置において、（１）ｎを越えないｍに対する第ｍ番目の分類手段によ
り選択された第一候補の文字コードＣｈとその距離値ｄ
ｍ（ｈ）と、第ｍ番目以外のｎ−１個の分類手段におけ
る、前記文字コードＣｈに対する距離値とから、それら
を分類手段の順に並べたｎ次元ベクトルＣｈ（ｄ１
（ｈ），ｄ２（ｈ），．．．，ｄｍ（ｈ），．．．，ｄ
ｎ（ｈ））を生成することを全分類手段の第一候補に対
して行なう手段と、（２）前記ｎ次元ベクトルＣｈ（ｄ１（ｈ），ｄ２
（ｈ），．．．，ｄｍ（ｈ），．．．，ｄｎ（ｈ））か
ら、全ｎ個の分類手段の各々により選択された第一候補
の文字コードに対して評価値を計算する手段において、
分類対象となる字種ｋに属する学習用の文字パターンＣ
ｋから得たｎ次元ベクトルＣｋ（ｄ１（ｋ），ｄ２
（ｋ），．．．，ｄｍ（ｋ），．．．，ｄｎ（ｋ））の
各要素を座標軸とした空間内におけるｎ次元ベクトルＣ
ｋ（ｄ１（ｋ），ｄ２（ｋ），．．．，ｄｍ
（ｋ），．．．，ｄｎ（ｋ））の分布の主軸Ｖｋ（ｐ１
（ｋ），ｐ２（ｋ），．．．，ｐｍ（ｋ），．．．，ｐ
ｎ（ｋ））を、主成分分析法を用い各ベクトルの要素ｄ
ｍ（ｋ）を変量として得られるｎ次元の相関行列の最大
固有値から求め、前記Ｖｋ（ｐ１（ｋ），ｐ２
（ｋ），．．．，ｐｍ（ｋ），．．．，ｐｎ（ｋ））と
前記ｎ次元ベクトルＣｋ（ｄ１（ｋ），ｄ２
（ｋ），．．．，ｄｍ（ｋ），．．．，ｄｎ（ｋ））と
から前記文字コードＣｋに対する評価値Ｅｋを決定する
ことにより前記評価値を計算する手段と、（３）第ｎ番目までの全分類手段により選択された第一
候補の文字コードが同一の場合には、第一候補の文字コ
ードに対する評価値を用いて候補の文字コードを最終認
識結果として選択するか、若しくは棄却するかを判定
し、又は第ｎ番目までの分類手段に対する分類結果の第
一候補の文字コードが異なる場合において、全ｎ個の第
一候補の文字コードに対するそれぞれの評価値からどの
候補を最終認識結果として出力するか、もしくはどちら
も棄却するかを判定する手段とから構成されることを特
徴とする。A character recognition apparatus according to a fourth aspect of the present invention has three or more kinds of all n kinds of character classifying means, and outputs a final recognition result from the first candidate character code selected by each classifying means. In the apparatus, (1) the first candidate character code Ch selected by the m-th classification means for m not exceeding n and its distance value d
An n-dimensional vector Ch (d1) in which m (h) and distance values with respect to the character code Ch in n-1 classification means other than the m-th one are arranged in the order of the classification means.
(H), d2 (h) ,. ．． , Dm (h) ,. ．． , D
(2) the n-dimensional vector Ch (d1 (h), d2) for generating n (h)) for the first candidate of all classification means;
(H) ,. ．． , Dm (h) ,. ．． , Dn (h)), the means for calculating an evaluation value for the character code of the first candidate selected by each of the n total classification means,
A learning character pattern C belonging to the character type k to be classified
n-dimensional vector Ck (d1 (k), d2 obtained from k
(K) ,. ．． , Dm (k) ,. ．． , Dn (k)) is the n-dimensional vector C in the space with the coordinate axes.
k (d1 (k), d2 (k), ..., dm
(K) ,. ．． , Dn (k)) main axis Vk (p1
(K), p2 (k) ,. ．． , Pm (k) ,. ．． , P
n (k)) is the element d of each vector using the principal component analysis method
Vk (p1 (k), p2 is obtained from the maximum eigenvalue of the n-dimensional correlation matrix obtained by using m (k) as a variable.
(K) ,. ．． , Pm (k) ,. ．． , Pn (k)) and the n-dimensional vector Ck (d1 (k), d2
(K) ,. ．． , Dm (k) ,. ．． , Dn (k)) and means for calculating the evaluation value Ek for the character code Ck, and (3) first candidate character selected by all classification means up to the n-th When the codes are the same, it is determined whether the candidate character code is selected as the final recognition result or rejected using the evaluation value for the first candidate character code, or the classification means up to the n-th When the character code of the first candidate in the classification result is different, it is determined which candidate is to be output as the final recognition result or both are rejected from the respective evaluation values for all n character codes of the first candidate. And means.

【００１０】[0010]

【作用】本発明の作用を図３の２種類の分類手段を併用
した場合を例として説明する。The operation of the present invention will be described by taking as an example the case where two kinds of classification means shown in FIG. 3 are used together.

【００１１】第一の分類手段が出力する分類結果の第一
候補の文字コードＣｉに対する評価値Ｅｉと第二の分類
手段が出力する分類結果の第一候の文字コードＣｉに対
する評価値Ｅｊを計算する手段において、分類対象とな
る字種ｋ毎に、第一の分類手段、第二の分類手段の距離
値をそれぞれ軸とする平面に、字種ｋに属する学習用の
文字パターンＣｋに対して得られた第一の分類手段、第
二の分類手段の距離値ｄ１（ｋ）、ｄ２（ｋ）をプロッ
トすると、図中の閉じた曲線Ｓの内部のように分布す
る。この分布の傾きを代表する主軸Ｖｋ（ｐ１（ｋ），
ｐ２（ｋ））は、同図に示されるようになり、主成分分
析法により求まる。The evaluation value Ei for the first candidate character code Ci of the classification result output by the first classifying means and the evaluation value Ej for the first symptom character code Ci of the classification result output by the second classifying means are calculated. In the means for learning, for each character type k to be classified, a learning character pattern Ck belonging to the character type k is placed on a plane whose axes are distance values of the first classifying means and the second classifying means. When the obtained distance values d1 (k) and d2 (k) of the first classifying means and the second classifying means are plotted, they are distributed like the inside of the closed curve S in the figure. The principal axis Vk (p1 (k), which represents the slope of this distribution,
p2 (k)) is as shown in the same figure, and is obtained by the principal component analysis method.

【００１２】この主軸Ｖｋに対し、図中Ｌで示すような
主軸Ｖｋに直交し、かつ前述の分布Ｓに接する直線を設
定する。候補の文字コードＣｋの２次元ベクトルＣｋ
（ｄ１（ｋ），ｄ２（ｋ））がこれより原点側にある場
合に、Ｃｋは字種ｋに属するとすることにより、図中
四角印で示すような一方の距離値が小さいものも字種ｋ
に属すると判定する。また、図中三角印で示す候補の文
字コードＣ’ｋに対する２次元ベクトルＣ’ｋ（ｄ１’
（ｋ），ｄ２’（ｋ））は斜線の領域外であるから、字
種ｋに属するとは判断せずに、これを棄却する。またＥ
ｋｍａｘは、この例においては前述の直線Ｌと先に求め
た主軸Ｖｋの交点に対する主軸Ｖｋ上の値であり、これ
を候補の文字コードＣｋが字種ｋに属すると考えてよい
か否かの判定閾値とする。A straight line which is orthogonal to the main axis Vk and which is in contact with the above-described distribution S is set with respect to the main axis Vk. Two-dimensional vector Ck of candidate character code Ck
When (d1 (k), d2 (k)) is closer to the origin than this, it is assumed that Ck belongs to the character type k. Seed k
Is determined to belong to. Also, the two-dimensional vector C'k (d1 'for the candidate character code C'k indicated by the triangle mark in the figure
Since (k) and d2 ′ (k) are outside the shaded area, they are not judged to belong to the character type k and are discarded. Also E
In this example, kmax is a value on the main axis Vk with respect to the intersection of the above-described straight line L and the main axis Vk previously obtained, and whether or not it can be considered that the candidate character code Ck belongs to the character type k. Use as a judgment threshold.

【００１３】第一の分類手段、第二の分類手段をそれぞ
れ単独で閾値を設定した場合を考え、その時の第一の分
類手段、第二の分類手段に対する閾値をそれぞれｔ１、
ｔ２とする。ｔ１、ｔ２は学習用のパタンに対する距離
値の最大値とすると、これらをどちらも越えない領域は
図において、縦軸、横軸と一点鎖線、二点鎖線で囲まれ
る長方形の領域となり、必要以上の棄却領域を設定する
ことになり認識率が低下する。この比較から明らかなよ
うに、本手法によれば、棄却されない領域が広くとれ、
一方の分類手法の第一候補の文字コードが非常に近い距
離値を出力しているものに対しては閾値以下であれば、
その字種と判定することが可能となる。Consider the case where the threshold values are set individually for the first classifying means and the second classifying means, and the threshold values for the first classifying means and the second classifying means at that time are t1 and
Let t2. If t1 and t2 are the maximum distance values for the learning pattern, the area that does not exceed either of them is a rectangular area surrounded by the vertical axis, horizontal axis and one-dot chain line, two-dot chain line in the figure, Since the rejection area of is set, the recognition rate decreases. As is clear from this comparison, according to this method, the area that is not rejected can be wide,
If the first candidate character code of one of the classification methods outputs a distance value that is very close, it is less than or equal to the threshold,
It is possible to determine that character type.

【００１４】第一の分類手段の第一候補の文字コードＣ
ｉと第二の分類手段の第一候補の文字コードＣｊに対し
て、どちらを出力するか、もしくは棄却するかを判定す
る手段においては、それぞれの分類手段の第一候補の文
字コードＣｉ、Ｃｊに対して計算された評価値Ｅｉ、Ｅ
ｊに対し、どちらとも閾値Ｅｉｍａｘ、Ｅｊｍａｘを越
えていれば、第一の分類手段と第二の分類手段の出力す
る第一の候補の文字コードＣｉ、Ｃｊを棄却する。それ
以外の場合においては、この評価値は小さい方が原点に
近いことから、評価値の小さい方が、その字種に対する
判定閾値を越えていない時、その評価値に対する第一候
補の文字コードを最終認識結果とし、そうでない場合は
第一の分類手段と第二の分類手段の出力する第一の候補
の文字コードＣｉ、Ｃｊを棄却とする。第一の分類手段
の第一候補の文字コードＣｉと第二の分類手段の第一候
補の文字コードＣｊが同一である場合においては、該当
の文字コードに対して評価値を計算し、それがその字種
に対する判定閾値を越えていない場合、その文字コード
を最終認識結果とし、逆に判定閾値を越えている場合そ
の文字コードを棄却する。これにより、併用する分類手
法が全て同じ候補を出力している場合においても棄却判
定ができ、誤読の低減が可能となる。The first candidate character code C of the first classification means
i and the first candidate character code Cj of the second classifying means, in the means for determining which should be output or rejected, the first candidate character code Ci, Cj of the respective classifying means. Evaluation values Ei, E calculated for
If both the threshold values Eimax and Ejmax are exceeded for j, the first candidate character codes Ci and Cj output by the first classifying unit and the second classifying unit are rejected. In other cases, the smaller evaluation value is closer to the origin, so when the smaller evaluation value does not exceed the judgment threshold for that character type, the character code of the first candidate for that evaluation value is set. As the final recognition result, if not, the first candidate character codes Ci and Cj output by the first classifying means and the second classifying means are rejected. When the character code Ci of the first candidate of the first classifying means and the character code Cj of the first candidate of the second classifying means are the same, an evaluation value is calculated for the corresponding character code and If the judgment threshold for the character type is not exceeded, the character code is used as the final recognition result, and conversely, if the judgment threshold is exceeded, the character code is rejected. As a result, rejection judgment can be performed and misreading can be reduced even when the classification methods used together output the same candidate.

【００１５】３種の分類手段を併用する際も同一の作用
を持つ。The same effect is obtained when the three kinds of classification means are used together.

【００１６】[0016]

【実施例】以下図面により本発明の実施例について説明
する。図１は本発明の一実施例を示すブロック図であ
る。Embodiments of the present invention will be described below with reference to the drawings. FIG. 1 is a block diagram showing an embodiment of the present invention.

【００１７】文字パターン入力手段１は帳票上の文字イ
メージを取り込む手段であり、通常のイメージスキャナ
ーでよい。取り込まれた文字イメージは、第一の分類手
段１４と第二の分類手段１５とで分類される。対判定手
段１６は、第一の分類手段１４と第二の分類手段１５と
が各々出力する第一候補の文字コードのどちらを最終認
識結果をとして採用するか、もしくはどちらも棄却する
かを判定し、判定結果を最終認識結果出力手段１３に送
る。これら文字パターン入力手段１、第一の分類手段１
４、第二の分類手段１５、対判定手段１６、最終認識結
果出力手段１３の制御は制御手段１７が行う。The character pattern input means 1 is a means for taking in a character image on a form, and may be an ordinary image scanner. The captured character images are classified by the first classifying means 14 and the second classifying means 15. The pair determination means 16 determines which of the first candidate character codes output by the first classification means 14 and the second classification means 15 is to be adopted as the final recognition result, or which is rejected. Then, the determination result is sent to the final recognition result output means 13. These character pattern input means 1 and first classification means 1
4, the control means 17 controls the second classification means 15, the pair determination means 16, and the final recognition result output means 13.

【００１８】分類手段１４、１５に係わる内部の要素と
しては、例えば図４、図５に示すような構成が考えられ
る。図４のように、第一の分類手段は、第一の文字認識
手段２、第一の文字認識手段２が参照する第一の分類辞
書メモリ３及び第一の文字認識手段２が出力する分類結
果を記憶する第一の分類結果格納メモリ４から構成され
る。又、図５に示すように、第二の分類手段は、第二の
文字認識手段５、第二の文字認識手段５が参照する第二
の分類辞書メモリ６及び第二の文字認識手段５が出力す
る分類結果を記憶する第二の分類結果格納メモリ７から
構成される。第一の分類辞書メモリ３と第二の分類辞書
メモリ６はそれぞれ第一の文字認識手段２と第二の文字
認識手段５が参照する特徴量を記憶する。なお、この特
徴量は、予め学習用のパターンから作成しておく。第一
の分類手段１４と第二の分類手段１５は、それぞれの分
類辞書メモリに記憶されている分類対象字種ｋに対する
標準と未知文字パターンとの距離計算を行ない、距離の
小さい順に、距離を該当する文字コードとともに、分類
結果格納メモリ４と分類結果格納メモリ７に記憶する。As the internal elements related to the classifying means 14 and 15, for example, the configurations shown in FIGS. 4 and 5 can be considered. As shown in FIG. 4, the first classification means includes the first character recognition means 2, the first classification dictionary memory 3 referred to by the first character recognition means 2, and the classification output by the first character recognition means 2. It is composed of a first classification result storage memory 4 for storing the results. Further, as shown in FIG. 5, the second classification means includes a second character recognition means 5, a second classification dictionary memory 6 referred to by the second character recognition means 5, and a second character recognition means 5. It is composed of a second classification result storage memory 7 for storing the output classification result. The first classification dictionary memory 3 and the second classification dictionary memory 6 store the feature amounts referred to by the first character recognition means 2 and the second character recognition means 5, respectively. Note that this feature amount is created in advance from a learning pattern. The first classifying means 14 and the second classifying means 15 calculate the distance between the standard and unknown character patterns for the classification target character type k stored in the respective classification dictionary memories, and calculate the distance in ascending order of distance. It is stored in the classification result storage memory 4 and the classification result storage memory 7 together with the corresponding character code.

【００１９】対判定手段１６に係わる内部の要素として
は、例えば図６に示すような構成が考えられる。As an internal element related to the pair determination means 16, for example, a configuration as shown in FIG. 6 can be considered.

【００２０】分類結果統合手段８は、第一の分類手段１
４と第二の分類手段１５の処理が終了すると、制御手段
１７により起動される。The classification result integrating means 8 is the first classification means 1
When the processing of 4 and the second classification means 15 is completed, it is started by the control means 17.

【００２１】起動された分類結果統合手段８は、ルート
ｒ１経由で、分類結果格納メモリ４から第一の分類手段
１４における第一候補の文字コードＣｉと距離値ｄ１
（ｉ）を抽出する。さらに第二の分類結果格納メモリ７
から文字コードＣｉに対応する距離値ｄ２（ｉ）を抽出
する。そして、Ｃｉ、ｄ１（ｉ）、ｄ２（ｉ）を２次元
のベクトル形式に変換し、統合結果格納レジスタ１０に
格納する。The activated classification result integrating means 8 passes the route r1 from the classification result storage memory 4 to the first candidate character code Ci and the distance value d1 in the first classification means 14.
Extract (i). Further, the second classification result storage memory 7
The distance value d2 (i) corresponding to the character code Ci is extracted from. Then, Ci, d1 (i), and d2 (i) are converted into a two-dimensional vector format and stored in the integrated result storage register 10.

【００２２】同様に、分類結果統合手段８は、ルート２
経由で、分類結果格納メモリ７から文字コードＣｊと距
離値ｄ２（ｊ）を抽出する。さらに第一の分類結果格納
メモリ４から文字コードＣｊに対応する距離値ｄ１
（ｊ）を抽出する。そしてＣｊ、ｄ２（ｊ）、ｄ１
（ｊ）を２次元のベクトル形式に変換し、統合結果格納
レジスタ１０に格納する。Similarly, the classification result integrating means 8 uses the route 2
The character code Cj and the distance value d2 (j) are extracted from the classification result storage memory 7 via. Further, the distance value d1 corresponding to the character code Cj from the first classification result storage memory 4
Extract (j). And Cj, d2 (j), d1
(J) is converted into a two-dimensional vector format and stored in the integrated result storage register 10.

【００２３】評価値計算手段９は、分類結果統合手段８
の処理が終了した後、制御手段１７により起動され、統
合結果格納レジスタ１０と評価値計算用辞書メモリ１１
を参照し、第一の分類手段１４、第二の分類手段１５が
出力した候補の文字コードに対する評価値Ｅｉ、Ｅｊを
例えば次式（１）、（２）により計算する。The evaluation value calculation means 9 is the classification result integration means 8
After completion of the above process, the control means 17 activates the integrated result storage register 10 and the evaluation value calculation dictionary memory 11.
With reference to, the evaluation values Ei and Ej for the candidate character codes output by the first classifying unit 14 and the second classifying unit 15 are calculated by the following equations (1) and (2), for example.

【００２４】Ｅｉ＝ｄ１（ｉ）ｐ１（ｉ）＋ｄ２（ｉ）ｐ２（ｉ）Ｅｊ＝ｄ１（ｊ）ｐ１（ｊ）＋ｄ２（ｊ）ｐ２（ｊ）第一の分類手段１４と第二の分類手段１５の分類結果の
第一候補が同一である場合は式（１）、式（２）いずれ
かを用いて評価値を計算する。Ei = d1 (i) p1 (i) + d2 (i) p2 (i) Ej = d1 (j) p1 (j) + d2 (j) p2 (j) The first classification means 14 and the second classification. When the first candidates of the classification result of the means 15 are the same, the evaluation value is calculated by using either the formula (1) or the formula (2).

【００２５】評価値計算用辞書メモリ１１には、前述の
分散共分散行列から求められた主軸Ｖｋ（ｐ１（ｋ），
ｐ２（Ｋ））と判定用の閾値Ｅｋｍａｘを字種ｋ毎に記
憶する。ここで各字種ｋに対する主軸Ｖｋの計算は、先
のｄ１（ｋ）、ｄ２（ｋ）を変量とした場合の２次元の
分散共分散行列を使うが、第一の分類手段１４、第二の
分類手段１５が出力する距離値のスケールが大きく異な
る場合には、相関行列を代わりに用いてもよい。これに
よって、評価値の計算時に距離値の分散の大きな方の分
散手段の評価値に重みがかかる作用をなくすことがで
き、第一の分類手段１４、第二の分類手段１５の出力す
る距離値を閾値判定手段１２において対等に評価でき
る。In the evaluation value calculation dictionary memory 11, the principal axis Vk (p1 (k), obtained from the above-mentioned variance-covariance matrix,
p2 (K)) and the threshold value Ekmax for determination are stored for each character type k. Here, the calculation of the main axis Vk for each character type k uses a two-dimensional variance-covariance matrix when the variables d1 (k) and d2 (k) are used as variables. If the scales of the distance values output by the classifying unit 15 are significantly different, the correlation matrix may be used instead. As a result, it is possible to eliminate the effect of weighting the evaluation value of the dispersion means having the larger dispersion of the distance values when calculating the evaluation value, and the distance values output by the first classification means 14 and the second classification means 15 can be eliminated. Can be equally evaluated by the threshold determination means 12.

【００２６】閾値判定手段１２は評価値計算手段９の処
理が終了すると、制御手段１７によって起動され、評価
値計算手段９によって計算された第一の分類手段１４、
第二の分類手段１５が出力した式（１）、式（２）より
得られる、それぞれの候補の文字コードに対する評価値
のＥｉ、Ｅｊの比較を評価値計算用辞書メモリ１１に記
憶されている閾値との比較により行い、どちらの分類結
果を採用するか、もしくはどちらの分類結果も棄却する
かを判定する。そのアルゴリズムの一例を図２を用いて
説明する。第一の分類手段１４、第二の分類手段１５の
出力する候補の文字コードが同一である場合は、判断ｄ
１で式（１）または式（２）ひとつを用いて評価値を計
算する。同一でない場合は、式（１）、式（２）両方を
用いて各第一候補に対して評価値を計算する。これらの
計算は評価値計算手段９が行う。閾値判定手段１２は、
第一の分類手段１４と第二の分類手段１５とが出力する
第一候補の文字コードが同一でない場合には、判断ｄ２
以下の処理を行う。評価値計算手段９で得た、第一の分
類手段１４、第二の分類手段１５の出力した第一候補の
文字コードに対する評価値が、どちらも評価値計算用辞
書メモリ１１の閾値を越えている場合には、どちらの候
補の文字コードも棄却する。そうでない場合には、閾値
を越えていない方の候補の文字コードを選択し、判断ｄ
３によりその評価値がもう一方の評価値以下であれば、
該当する候補の文字コードを最終認識結果として採用し
その文字コードを最終認識結果出力手段１３に送り、そ
うでなければ該当する候補の文字コードを棄却し、棄却
に対応する予め設定してある棄却コードを最終認識結果
出力手段１３に送る。また、第一の分類手段１４、第二
の分類手段１５の出力する候補の文字コードが同一であ
る場合には、判断ｄ４以下の処理を行う。評価値計算手
段９で得た評価値が、評価値計算用辞書メモリ１１の閾
値を越えている場合には、その候補の文字コードを棄却
し、棄却に対応する予め設定してある棄却コードを最終
認識結果出力手段１３に送る。閾値を越えていない場合
には、該当する候補の文字コードを最終認識結果として
採用しその文字コードを最終認識結果出力手段１３に送
る。When the processing of the evaluation value calculation means 9 is completed, the threshold value judgment means 12 is started by the control means 17 and the first classification means 14 calculated by the evaluation value calculation means 9,
The comparison of the evaluation values Ei and Ej with respect to the respective candidate character codes obtained from the expressions (1) and (2) output from the second classification means 15 is stored in the evaluation value calculation dictionary memory 11. By comparing with a threshold value, it is determined which classification result is adopted or which classification result is rejected. An example of the algorithm will be described with reference to FIG. When the character codes of the candidates output by the first classifying unit 14 and the second classifying unit 15 are the same, the determination d
In 1, the evaluation value is calculated using one of the formula (1) or the formula (2). If they are not the same, the evaluation value is calculated for each first candidate using both Expression (1) and Expression (2). The evaluation value calculation means 9 performs these calculations. The threshold determination means 12 is
When the character codes of the first candidates output by the first classifying means 14 and the second classifying means 15 are not the same, the judgment d2
The following processing is performed. The evaluation values for the first candidate character codes output by the first classifying unit 14 and the second classifying unit 15 obtained by the evaluation value calculating unit 9 both exceed the threshold value of the evaluation value calculating dictionary memory 11. If so, the character code of either candidate is rejected. If not, the candidate character code that does not exceed the threshold is selected, and the determination d
According to 3, if the evaluation value is less than or equal to the other evaluation value,
The character code of the corresponding candidate is adopted as the final recognition result, and the character code is sent to the final recognition result output means 13. Otherwise, the character code of the corresponding candidate is rejected, and a preset rejection corresponding to the rejection is rejected. The code is sent to the final recognition result output means 13. If the candidate character codes output by the first classifying unit 14 and the second classifying unit 15 are the same, the process of determination d4 and below is performed. When the evaluation value obtained by the evaluation value calculation means 9 exceeds the threshold value of the evaluation value calculation dictionary memory 11, the candidate character code is rejected, and a preset rejection code corresponding to the rejection is set. It is sent to the final recognition result output means 13. If the threshold is not exceeded, the character code of the corresponding candidate is adopted as the final recognition result, and the character code is sent to the final recognition result output means 13.

【００２７】最終認識結果出力手段１３は、制御手段１
７の命令に応じて、対判定手段１６の出力する最終認識
結果を文字コードとして出力すればよい。The final recognition result output means 13 is the control means 1
The final recognition result output by the pair determination means 16 may be output as a character code in accordance with the command of 7.

【００２８】以上、本発明に係わる２種類の分類手段を
併用した場合の実施例を説明してきたが、第一の分類手
段１４、第二の分類手段１５で計算する距離の尺度は文
字認識において通常使用されるものに限らず、いずれの
距離尺度、または類似性の尺度でも対応が可能であり、
またそれらの混用も可能である。また、３種類以上の分
類手段を併用した場合においても容易に本実施例に示す
装置が構成可能である。これらの各機能はハードウェ
アまたはソフトウェアいずれの手段によって実現しても
よい。The embodiment in which two types of classifying means according to the present invention are used together has been described above. The distance measure calculated by the first classifying means 14 and the second classifying means 15 is used in character recognition. It is possible to correspond to any distance measure or similarity measure, not limited to those commonly used,
It is also possible to mix them. Further, even when three or more kinds of classification means are used together, the device shown in this embodiment can be easily constructed. Each of these functions may be realized by means of either hardware or software.

【００２９】[0029]

【発明の効果】以上説明したように、本発明によれば、
２種または３種類以上の文字の分類手段を併用すること
で誤読される文字数を減らすと同時に、各分類手段が出
力する候補の文字コードに対して計算された評価値の大
小比較と、１回の閾値判定処理で棄却判定を行なうこと
により、２種類の文字分類手段に対して、個別に棄却判
定用の閾値を設定する場合に比べ、棄却される文字数を
減らすことができるので、認識精度の向上が可能とな
る。この時、従来の技術で述べた「文字認識方式および
装置」（特願平３−５２６）に比べ、誤読数の一層の低
減が可能である。As described above, according to the present invention,
By using two or three or more types of character classification means together, the number of misread characters is reduced, and at the same time, the evaluation values calculated for the candidate character codes output by each classification means are compared with each other. By performing the rejection determination in the threshold determination process of, the number of rejected characters can be reduced compared to the case where the thresholds for rejection determination are individually set for the two types of character classification means. It is possible to improve. At this time, the number of erroneous readings can be further reduced as compared with the “character recognition method and apparatus” (Japanese Patent Application No. 3-526) described in the related art.

[Brief description of drawings]

【図１】本発明に係わる文字認識装置の一実施例を説明
するためのブロック図である。FIG. 1 is a block diagram for explaining an embodiment of a character recognition device according to the present invention.

【図２】図１の処理例を示すフローチャート図である。FIG. 2 is a flowchart showing the processing example of FIG.

【図３】本手法の作用を説明するための図である。FIG. 3 is a diagram for explaining the operation of this method.

【図４】第一文字分類手段１４の一構成例であるFIG. 4 is a configuration example of a first character classification means 14.

【図５】第二文字分類手段１５の一構成例である。5 is an example of a configuration of a second character classification means 15. FIG.

【図６】対判定手段１６の一構成例である。FIG. 6 is a configuration example of pair determination means 16.

【図７】従来方式のフローチャート図である。FIG. 7 is a flowchart of a conventional method.

[Explanation of symbols]

１文字パターン入力手段２、５文字認識手段３、６分類辞書メモリ４、７分類結果格納メモリ８分類結果統合手段９評価値計算手段１０統合結果格納レジスタ１１評価値計算用辞書メモリ１２閾値判定手段１３最終認識結果出力手段１４、１５文字分類手段１６対判定手段１７制御手段 1-character pattern input means 2, 5 character recognition means 3, 6 classification dictionary memory 4, 7 Classification result storage memory 8 Classification result integration means 9 Evaluation value calculation means 10 Integrated result storage register 11 Evaluation value dictionary memory 12 Threshold judgment means 13 Final recognition result output means 14, 15 character classification means 16 Pair judging means 17 Control means

Claims

[Claims]

1. A character recognition device which has two types of character classification means and outputs a final recognition result from the character code of the first candidate selected by each of the classification means, (1) The first classification means The selected first candidate character code Ci and its distance value d1 (i), and the distance value d2 with respect to the character code Ci in the second classification means.
From (i), the two-dimensional vector Ci (d1 (i), d2
(I)) generating means, (2) the first candidate character code Cj selected by the second classifying means and its distance value d2 (j), and the character code Cj in the first classifying means. Distance value d1 to
From (j), the two-dimensional vector Cj (d1 (j), d2
(J)) generating means, and (3) the two-dimensional vector Ci (d1 (i), d2
(I)) and Cj (d1 (j), d2 (j)), the evaluation value Ei for the character code Ci and the character code C
In the means for calculating the evaluation value Ej for j, the learning character pattern C belonging to the character type K to be classified
A two-dimensional vector Ck (d1 on a plane having coordinate axes of d1 (k) and d2 (k), which are distance values with respect to k.
The main axis of the distribution of (k) and d2 (k) is Vk (p1
(K), p2 (k))) is obtained from the maximum eigenvalue of the two-dimensional variance-covariance matrix obtained by using the principal component analysis method with the elements d1 (k) and d2 (k) of each vector as variables. V
k (p1 (k), p2 (k)) and the two-dimensional vector C
The character code C from k (d1 (k), d2 (k))
means for calculating the evaluation values Ei and Ej by determining the evaluation value Ek for k; and (4) the character code using the evaluation value Ei when the character code Ci and the character code Cj are the same. Ci
Is output as the final recognition result or is rejected, or when the character code Ci and the character code Cj are different, which character code is finally recognized using the evaluation value Ej. A character recognition device comprising: a means for determining whether to output as a result or to reject either of them.

2. A character recognition device which has two types of character classification means and outputs a final recognition result from the character code of the first candidate selected by each of the classification means, (1) By the first classification means The selected first candidate character code Ci and its distance value d1 (i), and the distance value d2 with respect to the character code Ci in the second classification means.
From (i), the two-dimensional vector Ci (d1 (i), d2
(I)) generating means, (2) the first candidate character code Cj selected by the second classifying means and its distance value d2 (j), and the character code Cj in the first classifying means. Distance value d1 to
From (j), the two-dimensional vector Cj (d1 (j), d2
(J)) generating means, and (3) the two-dimensional vector Ci (d1 (i), d2
(I)) and Cj (d1 (j), d2 (j)), the evaluation value Ei for the character code Ci and the character code C
In the means for calculating the evaluation value Ej for j, the learning character pattern C belonging to the character type K to be classified
A two-dimensional vector Ck (d1 on a plane having coordinate axes of d1 (k) and d2 (k), which are distance values with respect to k.
(K), d2 (k)) distribution main axis Vk (p1 (k),
p2 (k))) is calculated from the maximum eigenvalue of a two-dimensional correlation matrix obtained by using the principal component analysis method as the elements d1 (k) and d2 (k) of each vector, and Vk (p1 (k),
p2 (k)) and the two-dimensional vector Ck (d1 (k),
d2 (k)) and the evaluation value Ek for the character code Ck
Means for calculating the evaluation values Ei and Ej by determining (4) the character code Ci using the evaluation value Ei when the character code Ci and the character code Cj are the same.
Is output as the final recognition result or is rejected, or when the character code Ci and the character code Cj are different, which character code is finally recognized using the evaluation value Ej. A character recognition device comprising: a means for determining whether to output as a result or to reject either of them.

3. A character recognition device which has three or more kinds of all n kinds of character classification means and outputs a final recognition result from the character code of the first candidate selected by each classification means, wherein (1) n The character code Ch of the first candidate selected by the m-th classifying means for m that does not exceed and the distance value d thereof
An n-dimensional vector Ch (d1) in which m (h) and distance values with respect to the character code Ch in n-1 classification means other than the m-th one are arranged in the order of the classification means.
(H), d2 (h) ,. ．． , Dm (h) ,. ．． , D
(2) the n-dimensional vector Ch (d1 (h), d2) for generating n (h)) for the first candidate of all classification means;
(H) ,. ．． , Dm (h) ,. ．． , Dn (h)), the means for calculating an evaluation value for the character code of the first candidate selected by each of the n total classification means,
A learning character pattern C belonging to the character type k to be classified
n-dimensional vector Ck (d1 (k), d2 obtained from h
(K) ,. ．． , Dm (k) ,. ．． , Dn (k)) is the n-dimensional vector C in the space with the coordinate axes.
k (d1 (k), d2 (k), ..., dm
(K) ,. ．． , Dn (k)) main axis Vk (p1
(K), p2 (k) ,. ．． , Pm (k) ,. ．． , P
n (k)) is the element dm of each vector using the principal component analysis method
Vk (p1 (k), p2 is calculated from the maximum eigenvalue of the n-dimensional variance-covariance matrix obtained by using (k) as a variable.
(K) ,. ．． , Pm (k) ,. ．． , Pn (k)) and the n-dimensional vector Ck (d1 (k), d2
(K) ,. ．． , Dm (k) ,. ．． , Dn (k)) and means for calculating the evaluation value Ek for the character code Ck, and (3) first candidate character selected by all classification means up to the n-th If the codes are the same, it is determined whether to select the candidate character code as the final recognition result using the evaluation value for the first candidate character code or to reject it,
Or, when the character codes of the first candidates of the classification results for the n-th classification means are different, which candidate is output as the final recognition result from the respective evaluation values for the character codes of all the n first candidates, Alternatively, a character recognition device comprising a means for determining whether to reject either.

4. A character recognition device which has three or more kinds of all n kinds of character classification means and which outputs a final recognition result from the character code of the first candidate selected by each classification means, wherein (1) n The character code Ch of the first candidate selected by the m-th classifying means for m that does not exceed and the distance value d thereof
An n-dimensional vector Ch (d1) in which m (h) and distance values with respect to the character code Ch in the n−1 classification means other than the m-th one are arranged in the order of the classification means.
(H), d2 (h) ,. ．． , Dm (h) ,. ．． , D
(2) the n-dimensional vector Ch (d1 (h), d2) for generating n (h)) for the first candidate of all classification means;
(H) ,. ．． , Dm (h) ,. ．． , Dn (h)), the means for calculating an evaluation value for the character code of the first candidate selected by each of the n total classification means,
A learning character pattern C belonging to the character type k to be classified
n-dimensional vector Ck (d1 (k), d2 obtained from k
(K) ,. ．． , Dm (k) ,. ．． , Dn (k)) is the n-dimensional vector C in the space with the coordinate axes.
k (d1 (k), d2 (k), ..., dm
(K) ,. ．． , Dn (k)) main axis Vk (p1
(K), p2 (k) ,. ．． , Pm (k) ,. ．． , P
n (k)) is the element d of each vector using the principal component analysis method
Vk (p1 (k), p2 is obtained from the maximum eigenvalue of the n-dimensional correlation matrix obtained by using m (k) as a variable.
(K) ,. ．． , Pm (k) ,. ．． , Pn (k)) and the n-dimensional vector Ck (d1 (k), d2
(K) ,. ．． , Dm (k) ,. ．． , Dn (k)) to calculate the evaluation value Ek for the character code Ck, and (3) a first candidate character selected by all classification means up to the n-th When the codes are the same, it is determined whether the candidate character code is selected as the final recognition result or rejected using the evaluation value for the first candidate character code, or the classification means up to the n-th When the character code of the first candidate in the classification result is different, it is determined which candidate is to be output as the final recognition result or both are rejected from the respective evaluation values for all n character codes of the first candidate. And a character recognition device.