JPS60138689A

JPS60138689A - Character recognizing method

Info

Publication number: JPS60138689A
Application number: JP58251904A
Authority: JP
Inventors: Masahiro Nakamura; 昌弘中村
Original assignee: Ricoh Co Ltd
Current assignee: Ricoh Co Ltd
Priority date: 1983-12-26
Filing date: 1983-12-26
Publication date: 1985-07-23

Abstract

PURPOSE:To avoid the misdecision for recognition of characters by retrieving successively plural dictionaries with the feature variable constant of an input character to detect a character most approximate to the feature variable constant and specifying a font from the distance between the variable constant of said detected character and the feature variable constant. CONSTITUTION:The character number for discrimination of font is limited to Nmax, and a dictionary DCalpha is first retrieved when a feature variable constant Pn is supplied. Then a character most approximate to the Pn is detected in the dictionary DCalpha. Then the distance between a variable constant of the retrieved character and the Pn is obtained and added to an adder ADalpha. Then a dictionary DCbeta is retieved to obtain a character having the variable constant most approximate to the Pn. The distance between the variable constant of the obtained character and the Pn is obtained and added to an adder ABbeta. In the same way, the similar distance is obtained also for an adder ADgamma. The contents of these adders ADalpha, ADbeta and ADgamma are divided by the contents of counters CNalpha, CNbeta and CNgamma to obtain values Dalpha, Dbeta and Dgamma respectively. Then a font corresponding to the minimum one of said three values is specified.

Description

【発明の詳細な説明】［技術分野］本発明は、字体の異なる複数種の文字に対して自動的に
文字認識を行なうマルチフォント文字認識に関し、特に
誤認識の防止に関する。DETAILED DESCRIPTION OF THE INVENTION [Technical Field] The present invention relates to multi-font character recognition that automatically performs character recognition for a plurality of types of characters with different fonts, and particularly to prevention of misrecognition.

［従来技術］英文字では、例えばエリート文字、バイ力文字。[Prior art] For English letters, for example, elite letters and biryoku letters.

イタリック文字、ＯＣＲ用文字等様々なフォントの活字
が存在する。従って、これらの様々なフォントの文字を
認識しなければならないマルチフォント文字認識におい
ては、一般シ；各々のフォントに対応する辞書をそれぞ
れ備えている。そして、個々の認識文字に対して、全て
の文字フォントの辞書を検索し、それぞれ最も特徴パラ
メータの近いものを候補として選択し、それらの中で最
も近いものを最終候補にしている。There are various typefaces such as italic characters and OCR characters. Therefore, in multi-font character recognition in which characters of these various fonts must be recognized, dictionaries corresponding to each font are provided. Then, for each recognized character, dictionaries of all character fonts are searched, and the one with the closest characteristic parameters is selected as a candidate, and the closest one among them is made the final candidate.

しかし、この種のマルチフォント文字認識では、異なる
フォント同志の間で、互いに似かよった文字が存在し、
一部の文字については互いに形状が相似で大きさのみ異
なる場合、あるいは完全に同一の場合がある。このよう
な場合、認識する文字と辞書のデータとの距離が、フォ
ントの互いに異なる複数の辞書について同一になり、フ
ォントの選択を誤まる誤認識が生じ易い。However, in this type of multi-font character recognition, there are characters that are similar to each other between different fonts.
Some characters may have similar shapes and differ only in size, or may be completely the same. In such a case, the distance between the recognized character and the data in the dictionary will be the same for a plurality of dictionaries with different fonts, and erroneous recognition that may lead to incorrect font selection is likely to occur.

従来技術の１つに、読取不能コードが出力された回数を
フォント毎にめてその結果で認識結果を特定する方法が
あるが、互いにパラメータの似たフォントの間では、こ
の値を比較してもフォントを判別できない。One of the conventional techniques is to count the number of times an unreadable code has been output for each font and use that result to identify the recognition result.However, between fonts with similar parameters, this value is compared. can't identify the font either.

フォントの誤認識が生ずると、例えば検索時間を短縮す
るために、最初の文字認識の結果によって、以後使用す
る辞書を特定するような方式の文字認識においては、誤
認識が多発する。When erroneous font recognition occurs, for example, in order to shorten search time, erroneous recognition occurs frequently in a character recognition system in which a dictionary to be used thereafter is specified based on the result of initial character recognition.

［目的］本発明は、確実にフォントを識別して誤認識を防止する
ことを目的とする。[Purpose] The present invention aims to reliably identify fonts and prevent misrecognition.

［構成］認識を行なう一連の文字列が例えば１種類のフォントの
文字のみで構成されていれば、全体として見れば、認識
をする文字の特徴パラメータとの距離が最も小さいデー
タの存在する辞書は明らかに１つである。つまり、フォ
ントを識別するのに十分な複数の文字パターンについて
、各々の辞書を検索して最も近いデータを調べ、その距
離の平均的な値を辞書毎にめれば、その値の最も小さい
辞書のフォントが、一連の文字列のフォントであると判
別しうる。[Structure] If a series of character strings to be recognized is composed of only characters of one type of font, for example, the dictionary containing the data with the smallest distance from the feature parameters of the characters to be recognized is the one that Obviously one. In other words, for multiple character patterns sufficient to identify a font, if you search each dictionary to find the closest data, and find the average distance value for each dictionary, you can find the dictionary with the smallest value. can be determined to be a font of a series of character strings.

平均的な距離の値は１例えば、Ｎ個の文字についてそれ
ぞれ距離Ｄ　ｎ　（ｎ　＝　１−Ｎ）をめてその総和を
計算し、その結果をＮで割れば得られる。The average distance value is 1. For example, it can be obtained by calculating the distance D n (n = 1-N) for each of N characters, calculating the sum, and dividing the result by N.

しかし、この場合、文字を誤認識すればその結果は無意
味であり、フォントの判定の基準となる距離の総和に対
して大きな誤差になる。したがって、認識結果が正解で
あったフォントの距離に対して大きな重み付けを行なえ
ば、この種の誤差を小さくしうる。However, in this case, if a character is erroneously recognized, the result is meaningless and results in a large error with respect to the sum of distances, which is the standard for font determination. Therefore, if a large weight is given to the distance of a font whose recognition result is correct, this type of error can be reduced.

一般に、複数フォントｐ辞書に対して検索を行なう場合
、複数のフォントで同一の結果が得られることが多く、
その結果が正解であることが多い。Generally, when searching multiple font p-dictionaries, the same result is often obtained for multiple fonts.
The results are often correct.

そこで、本発明の１つの好ましい態様においては。Therefore, in one preferred embodiment of the present invention.

各々のフォントの辞書を検索して得られた結果すなわち
第１候補文字から、多数決により第２候補文字を決定し
、第２候補文字と同一の第１候補文字が得られたフォン
トについて距離データの重み付け（すなわちフォント判
定に対する寄与率）を大きくする。From the results obtained by searching the dictionary for each font, that is, the first candidate character, a second candidate character is determined by majority vote, and distance data is calculated for fonts for which the first candidate character that is the same as the second candidate character is obtained. Increase the weighting (that is, the contribution rate to font determination).

以下、図面を参照して本発明の詳細な説明する。Hereinafter, the present invention will be described in detail with reference to the drawings.

第１図に１本発明を一態様で実施する文字認識装置の概
略構成を示す。第１図を参照すると、図示しない原稿上
の文字情報はスキャナＳＣＡによフて光学的に読み取ら
れ、微小画素毎に光電変換されて２値データになる。こ
のデータは、各文字パターン毎に前処理部ＰＰＲで所定
の処理を受けた後、特徴抽出部ＰＰＵに印加される。特
徴抽出部ＰＰＵでは、その文字パターンの様々な特徴パ
ラメータを数値データとして抽出する。FIG. 1 shows a schematic configuration of a character recognition device implementing one aspect of the present invention. Referring to FIG. 1, character information on a document (not shown) is optically read by a scanner SCA, and photoelectrically converted into binary data for each minute pixel. This data is applied to the feature extraction unit PPU after being subjected to a predetermined process in the preprocessing unit PPR for each character pattern. The feature extraction unit PPU extracts various feature parameters of the character pattern as numerical data.

検索制御部ＣＴＲは、各部を制御し、特徴抽出部ＰＰＵ
で抽出される各文字毎の特徴パラメータに最も近い文字
を、辞書から検索して、それを認識結果にする。この例
では、３種類の文字フォントα、βおよびＴを扱うため
、それぞれのフォントの辞書ＤＣα、ＤＣβおよびＤＣ
γが用意されている。検索制御部ＣＴＲには、特徴抽出
部ＰＰＵが出力する特徴パラメータと各々の辞書から得
られる特徴パラメータとの距離を演算する距離演算部ｃ
：ＰＤ、バッファメモリＢＦＦ、３つの加算器ＡＤα、
ＡＤβおよびＡＤγ、３つのカウンタＣＮα、ＣＮβお
よびＣＮγが接続されている。The search control unit CTR controls each unit, and the feature extraction unit PPU
The dictionary is searched for the character closest to the characteristic parameter for each character extracted in , and this is used as the recognition result. In this example, three types of character fonts α, β, and T are handled, so the respective font dictionaries DCα, DCβ, and DC
γ is available. The search control unit CTR includes a distance calculation unit c that calculates the distance between the feature parameters output by the feature extraction unit PPU and the feature parameters obtained from each dictionary.
: PD, buffer memory BFF, three adders ADα,
ADβ and ADγ and three counters CNα, CNβ and CNγ are connected.

第２図に、フォント判別手順すなわち第１図の検索制御
部ＣＴＲの動作の一部を示す。第１図および第２図を参
照して説明する。まず加算器ＡＤα、ＡＤβおよびＡＤ
γ、ならびにカウンタＣＮα、ＣＮβおよびＣＮγの内
容をクリアする。文字数をカウントするカウンタＮを初
期値にセットし、特徴パラメータが特徴抽出部ＰＰＵか
ら入力される毎にこれを＋１する。FIG. 2 shows a part of the font discrimination procedure, that is, the operation of the search control unit CTR shown in FIG. This will be explained with reference to FIGS. 1 and 2. First, adders ADα, ADβ and AD
γ, and the contents of counters CNα, CNβ, and CNγ. A counter N for counting the number of characters is set to an initial value, and is incremented by 1 each time a feature parameter is input from the feature extraction unit PPU.

ここではフォント識別に用いる文字の数をＮｍａｘ（例
えば１０）に限定する。特徴パラメータＰｎを入力した
ら、まず辞書ＤＣαを検索し、その中で最もＰｎに近い
文字（αの第１候補文字）を見つける。検索した文字の
パラメータとＰｎとの距離をめ、それに加算器ＡＤαに
加算する。Here, the number of characters used for font identification is limited to Nmax (for example, 10). When the feature parameter Pn is input, the dictionary DCα is first searched to find the character closest to Pn (the first candidate character for α). The distance between the parameter of the searched character and Pn is determined and added to the adder ADα.

次に、辞書ＤＣβを検索してその中でパラメータが最も
Ｐｎに近い文字（βの第１候補文字）を見つける。検索
した文字のパラメータとＰｎとの距離をめ、その結果を
加算器ＡＤβに加算する。Next, the dictionary DCβ is searched to find a character whose parameter is closest to Pn (first candidate character for β). The distance between the parameter of the searched character and Pn is determined, and the result is added to the adder ADβ.

同様に、辞書ＤＣγに対してもパラメータがＰｎに最も
近い文字（γの第１候補文字）を検索して、そのパラメ
ータとＰｎとの距離をめ、その結果を加算器ＡＤγに加
算する。Similarly, the dictionary DCγ is searched for the character whose parameter is closest to Pn (the first candidate character for γ), the distance between the parameter and Pn is calculated, and the result is added to the adder ADγ.

このようにして得られた３つの第１候補文字の中から、
多数決で第２候補文字を決定する。すなわち、例えばα
の第１候補文字とγの第１候補文字が同一で、それらと
βの第１候補文字が異なる場合、αおよびγの第１候補
文字を第２候補文字とする。もし、３つの第１候補文字
が全て異なる場合には、多数決で決めることができない
ので、３つの第１候補文字の中で最も特徴パラメータＰ
ｎに近いものを第２候補にする。From the three first candidate characters obtained in this way,
The second candidate character is determined by majority vote. That is, for example α
If the first candidate characters of α and γ are the same, and the first candidate characters of β are different from each other, the first candidate characters of α and γ are set as the second candidate characters. If the three first candidate characters are all different, it cannot be decided by majority vote, so if the characteristic parameter P is the highest among the three first candidate characters,
The one closest to n is selected as the second candidate.

このようにして得られる第２候補文字と各々の第１候補
文字とを比較する。第２候補文字とαの第１候補文字と
が等しい場合にはカウンタＣＮαを＋１するが異なる場
合にはそのままにする。同様に、第２候補文字とβの第
１候補文字とを比較して等しければカウンタＣＮβを＋
１し、第２候補文字とγの第１候補文字とが等しければ
カウンタＣＮγを＋１する。The second candidate characters obtained in this way are compared with each first candidate character. If the second candidate character and the first candidate character of α are equal, the counter CNα is incremented by 1, but if they are different, the counter CNα is left unchanged. Similarly, the second candidate character and the first candidate character of β are compared, and if they are equal, the counter CNβ is incremented.
1, and if the second candidate character and the first candidate character of γ are equal, the counter CNγ is incremented by 1.

また、特徴抽出部ＰＰＵから出力された特徴パラメータ
の内容は、後で使用するのでバッファメモリＢＦＦに一
時格納する。Further, the contents of the feature parameters output from the feature extraction unit PPU are temporarily stored in the buffer memory BFF for later use.

以上の動作をＮ　＞　Ｎ　ｍａｘになるまで続けると、
加算器ＡＤα、ＡＤβおよびＡＤγには、それぞれフォ
ントα、βおよびγの辞書から得られた第１候補文字と
Ｎｆｆｌａｘ個の認識文字のそれぞれとの特徴パラメー
タの距離の総和が格納され、カウンタＣＮα、ＣＮβお
よびＣＮγには、それぞれフォントα、βおよびγの正
解文字数が格納される。If we continue the above operation until N > N max,
Adders ADα, ADβ, and ADγ store the sum of feature parameter distances between the first candidate character obtained from the dictionaries of fonts α, β, and γ, respectively, and each of Nfflax recognized characters, and counters CNα, CNβ and CNγ store the number of correct characters of fonts α, β, and γ, respectively.

そこで、　Ｎｍａｘ個の読取が終了した時点で、加算器
ＡＤαの内容をカウンタＣＮαの内容で割った値Ｄα、
加算器ＡＤβの内容をカウンタＣＮβの内容で割った値
Ｄβおよび加算器ＡＤγの内容をカウンタＣＮγの内容
で割った値Ｄγをめる。Therefore, when Nmax readings are completed, the value Dα is obtained by dividing the content of the adder ADα by the content of the counter CNα,
A value Dβ obtained by dividing the contents of the adder ADβ by the contents of the counter CNβ and a value Dγ obtained by dividing the contents of the adder ADγ by the contents of the counter CNγ are calculated.

但し、カウンタの内密が０の場合にはそれを無視する。However, if the secret of the counter is 0, it is ignored.

各々の値Ｄα、ＤβおよびＤγは、それぞれフォントα
、βおよびγの辞書の内容と、Ｎｍａｘ個の認識文字の
パラメータとの平均的な距離の値である。例えば認識す
る文字のフォントがαである場合に、３つの辞書で同一
の文字（正解文字）が認識されたとすると、それらの中
で特徴パラメータＰｎとの距離が最も小さいデータが得
られる辞書はＤＣαである。従って、Ｎｍａｘ個につい
て距離を平均した値が最も小さい辞書のフォントが認識
する文字のフォントに一致する。Each value Dα, Dβ and Dγ is the font α
, β and γ, and the parameters of Nmax recognized characters. For example, if the font of the character to be recognized is α, and the same character (correct character) is recognized in three dictionaries, the dictionary that provides data with the smallest distance from the feature parameter Pn is DCα. It is. Therefore, the dictionary font with the smallest average value of Nmax distances matches the font of the recognized character.

もし辞書ＤＣαおよびＤＣβで正解文字が得られ辞書、
ＤＣγが誤認識文字を候補にすると、多数決の結果によ
りフォントγに対応するカウンタＣＮγの内容が変化し
ないので、結果的にパラメータＤγをめる場合の除数が
小さくなってＤγを大きくするので、辞書ＤＣγから得
られる距離（誤差量）のフォント判定の際に影響する割
合が小さくなる。If the correct character is obtained in the dictionaries DCα and DCβ, the dictionary,
When DCγ selects a misrecognized character as a candidate, the content of the counter CNγ corresponding to the font γ does not change due to the result of the majority vote.As a result, the divisor for adding the parameter Dγ becomes smaller and Dγ becomes larger. The influence of the distance (error amount) obtained from DCγ on font determination is reduced.

得られたパラメータＤα、ＤβおよびＤγを互いに比較
して最も小さいものに対応するフォントの辞書を選択す
る。フォント判定が終了したら、既に認識したＮ　ｗａ
ｘ個の文字について、選択した辞書のみを用いて再度文
字認識を行なう。この場合に、再度スキャナを最初の位
置に戻して文字データ読取、前処理、特徴抽出等を行な
うのでは時間がかかるので、予めバッファメモリＢＦＦ
に格納しておいたＮｍａｘ個の文字そわぞれの特徴パラ
メータを１つずつ読んで文字ｉｇｍを行なう・Ｎｍａｘ
個の文字の再認識が済んだら１選択した辞書を用いて、
通常の方法で文字認識を続行する。The obtained parameters Dα, Dβ, and Dγ are compared with each other, and the dictionary of the font corresponding to the smallest one is selected. Once the font determination is complete, the already recognized N wa
Character recognition is performed again for x characters using only the selected dictionary. In this case, it takes time to return the scanner to the initial position and perform character data reading, preprocessing, feature extraction, etc.
Perform character igm by reading the feature parameters of each of the Nmax characters stored in Nmax one by one.
After re-recognizing the characters, use the selected dictionary,
Continue character recognition in the normal way.

第３図に、第２図の変形実施例を示す。第３図を参照す
ると、この実施例においては、フォント識別のためのＮ
ｍａｘ個の文字認識の際に、各々の文字について、特徴
パラメータＰｎ、第２候補と一致しない第１候補が得ら
れたフォント、および第２候補文字をバッファメモリに
一時記憶しておく。フォントが判明すると、パップアメ
モリＢＦＦの内容を読んで、各認識文字毎に次のような
処理を行なう。FIG. 3 shows a modified embodiment of FIG. 2. Referring to FIG. 3, in this embodiment, N
When max characters are recognized, for each character, the feature parameter Pn, the font from which the first candidate that does not match the second candidate is obtained, and the second candidate character are temporarily stored in a buffer memory. Once the font is known, the contents of the papa memory BFF are read and the following processing is performed for each recognized character.

まず、選択した辞書のフォントと第２候補が得られたフ
ォントが一致するかどうかをチェックする。First, it is checked whether the font of the selected dictionary matches the font obtained as the second candidate.

もし一致しなければ、バッファメモリＢＦＦに格納され
ているその第２候補は誤りであるから、その文字の特徴
パラメータＰｎを読んで選択した辞書を用いて再度認識
を行ない、その結果で第２候補を更新する。一致する場
合には、バッファメモリＢＦＦに格納されている第２候
補をそのまま認識結果として出力する。従って、この実
施例においては、誤認識のあった文字についてだけ再認
識を行なうので１文字数Ｎ　１ｌＩａｘが大きい場合に
再認識の時間が短縮される。If they do not match, the second candidate stored in the buffer memory BFF is incorrect, so the feature parameter Pn of the character is read and recognition is performed again using the selected dictionary, and the result is the second candidate. Update. If they match, the second candidate stored in the buffer memory BFF is output as is as the recognition result. Therefore, in this embodiment, only characters that have been erroneously recognized are re-recognized, so that the re-recognition time is shortened when the number of characters N1lIax is large.

次に本発明のもう１つの実施例を説明する。第４図を参
照すると、この実施例においては、前記バッファメモリ
ＢＦＦを省略し、各々のフォントに対応付けた３つのメ
モリＭα２ＭβおよびＭγを使用する。第５図を、参照
して、この実施例の認識手順を、前記実施例と異なる部
分のみについて説明する。Next, another embodiment of the present invention will be described. Referring to FIG. 4, in this embodiment, the buffer memory BFF is omitted and three memories Mα2Mβ and Mγ associated with each font are used. With reference to FIG. 5, the recognition procedure of this embodiment will be explained only with respect to the parts that are different from the previous embodiment.

この実施例では、フォント識別のためのＮｍａｘ個の文
字認識の際に、各々の文字の第１候補文字を各フォント
毎にそれぞれメモリＭα２ＭβおよびＭγに格納する。In this embodiment, when recognizing Nmax characters for font identification, the first candidate character of each character is stored in memories Mα2Mβ and Mγ for each font, respectively.

そして、フォントが判明すると、そのフォントに対応付
けたメモリＭα２Ｍβ又はＭγの内容を読んで、それを
認識結果にする。従ってこの実施例では再認識をする必
要がない。When the font is determined, the content of the memory Mα2Mβ or Mγ associated with the font is read and used as a recognition result. Therefore, in this embodiment, there is no need for re-recognition.

なお、上記実施例においては、Ｎｍａｘ個の文字につい
てフォントが判明した後でもう１度チェックを行なって
いるが、多数決判定又は距離判定により十分が認識結果
が得られる場合には、このような処理は省略してもよい
。Note that in the above embodiment, another check is performed after the fonts for Nmax characters are determined, but if a sufficient recognition result is obtained by majority judgment or distance judgment, such processing may be performed. may be omitted.

また、実施例ではフォント判別を行なうために使用する
文字を数値Ｎｍａｘとして特定したが、例えば１行分の
文字列に特定したり５頁単位に特定してもよい。また、
実施例では最初にＮｍａｘ個の文字についてフォント判
定をした後はフォント判定をしないが１例えば原稿の頁
が更新されたら、Ｎを初期値に戻してフォント判定を行
なうように変更してもよい。Further, in the embodiment, the characters used for font discrimination are specified as the numerical value Nmax, but the characters may be specified, for example, in one line of character strings or in units of five pages. Also,
In the embodiment, font determination is not performed after first performing font determination for Nmax characters; however, for example, when a page of a manuscript is updated, the font determination may be performed by returning N to the initial value.

更に、実施例では加算器、カウンタ、メ毫り等をそれぞ
れ独立した構成要素にしたが、例えばマイクロコンピュ
ータ等を用いる場合には、それ自体の機能を利用するこ
とにより、特別な構成要素を付加することなしに本発明
を実施しうる。Furthermore, in the embodiment, the adder, counter, message, etc. are each made into independent components, but when using a microcomputer, for example, special components can be added by utilizing its own functions. The present invention can be practiced without having to do so.

［効果］以上のとおり、本発明によれば、複数の文字について、
各々の辞書のデータと文字の特徴パラメータとの距離を
平均化した値でフォントを判別するので、誤判定がなく
なる。[Effect] As described above, according to the present invention, for a plurality of characters,
Since the font is determined based on the average distance between the data in each dictionary and the characteristic parameter of the character, misjudgment is eliminated.

[Brief explanation of the drawing]

第１図は、本発明を１態様で実施する文字認識装置の構
成を示すブロック図である。第２図は、第１図の装置におけるフォント判別手順を示
すフローチャートである。第３図は、第２図の実施例の変形例を示すフローチャー
トである。第４図は、本発明をもう１つの態様で実施する文字認識
装置の構成を示すブロック図である。第５図は、第４図の装置におけるフォント判別手順を示
すフローチャートである、ＤＣα、ＤＣβ、ＤＣγ：辞書ＡＤα、ＡＤβ、ＡＤγ：加算器ＣＮα、ＣＮβ、ＣＮγ：カウンタＢＦＦ：バッファメモリＭα２Ｍβ２Ｍγ：メモリFIG. 1 is a block diagram showing the configuration of a character recognition device implementing one aspect of the present invention. FIG. 2 is a flowchart showing a font discrimination procedure in the apparatus shown in FIG. FIG. 3 is a flowchart showing a modification of the embodiment shown in FIG. FIG. 4 is a block diagram showing the configuration of a character recognition device implementing another embodiment of the present invention. FIG. 5 is a flowchart showing the font discrimination procedure in the apparatus of FIG. 4. DCα, DCβ, DCγ: Dictionaries ADα, ADβ, ADγ: Adders CNα, CNβ, CNγ: Counter BFF: Buffer memory Mα2Mβ2Mγ: Memory

Claims

[Claims]

(1) In character recognition where multiple types of character fonts are recognized; multiple characters are recognized, and for each character, a dictionary of different character fonts is used as the first candidate character according to the characteristic parameters of the character to be recognized. At the same time, the distance value between the feature parameter of each first candidate and the feature parameter of the recognized character is calculated and accumulated for each character font, and the most appropriate one is selected from among the first candidates. is selected as the second candidate, calculates a value for each font depending on whether or not there is a match between the first candidate character and the second candidate character, and calculates the value and the cumulative distance value for each font,
A character recognition method characterized by comparing the results between fonts to identify the font.

(2) The selection of the second candidate is determined by majority vote from among the first candidates, and if the second candidate cannot be selected, the second candidate is determined to be the one closest to the characteristics of the character to be recognized among the first candidates. A character recognition method according to claim (1).

(3) For each of a plurality of characters to be recognized for font identification, characteristic parameters are stored, and after font identification, the characteristic parameters are read and character recognition is performed again for the plurality of characters. A character recognition method according to claim (1).

(4) For each of a plurality of characters recognized for font identification, its characteristic parameters, second candidate characters, and whether or not the first candidate character and second candidate character match, and the relationship with the font are stored. If the identified font and the obtained font of the second candidate character do not match, read the feature parameters for that character and perform re-recognition.
A character recognition method according to claim (1).

(5) For each of the multiple characters to be recognized for font identification, the first candidate character of each font is memorized, and when the font is identified, the first candidate character of that font is output as the recognition result. , a character recognition method according to claim (1).