JPH03260886A

JPH03260886A - Character recognizing method

Info

Publication number: JPH03260886A
Application number: JP2057995A
Authority: JP
Inventors: Tetsuo Kiuchi; 木内　哲夫; Kazushi Yoshida; 收志吉田
Original assignee: Fuji Electric Co Ltd; Fuji Facom Corp
Current assignee: Fuji Electric Co Ltd; Fuji Facom Corp
Priority date: 1990-03-12
Filing date: 1990-03-12
Publication date: 1991-11-20

Abstract

PURPOSE:To improve the rate of recognition by calculating a ratio to a standard character in a document including a character to be recognized and comparing this ratio with a reference value calculated in advance concerning at least one of the character width, character height, area of circumscribed rectangle and aspect ratio of the character to be recognized. CONSTITUTION:Remaining difference Hn of the normalized character height to be defined by expressions I and II, and remaining difference Wn of the normalized character width is calculated. In the expressions I and II, H and W show the character height and character width to be measured from the character picture, Hs and Ws show the standard character height and standard character width in the II document, Hmuj and Hdeltaj show the average value and standard deviation of the ratio to the standard character height for each character form of a character type (j), and Wmuj and Wdeltaj show the mean value and standard deviation of the ratio to the standard character height for each character form of this character type (j). Next, remaining difference NVn of a normalized longitudinal discrete number and remaining difference NHn of a lateral discrete number is calculated. Namely, concerning one of the character width, character height, area of the circumscribed rectangle and aspect ratio of the character to be recognized at least, the ratio to the respectively corresponding amount of the standard character in the document including the character to be recognized is estimated from the character size. Thus, recognition is enabled with high accuracy.

Description

【発明の詳細な説明】〔産業上の利用分野〕この発明は、文字やマークなど（以下、単に文字等とい
う）を認識するための文字認識方法、特に高精度な文字
認識が可能な文字認識方法に関する。[Detailed Description of the Invention] [Industrial Application Field] The present invention relates to a character recognition method for recognizing characters, marks, etc. (hereinafter simply referred to as characters, etc.), and in particular a character recognition method capable of highly accurate character recognition. Regarding the method.

[Conventional technology]

従来、この種の方法として、例えば文書画像から文字行
または文字列（以下、単に文字行という）を切り出した
後、文字らしきものを仮文字として抽出し、この仮文字
から所定の文字サイズを基準として全角文字（漢字等）
を切り出し、文字サイズから全角文字と確定できないも
のは、半角文字として有効に成立し得る文字か否かを調
べ、取立すれば半角文字として認識する方法がある。し
かし、このような方法では文字によって全角文字が２つ
以上の半角文字として誤Ｅ？２１にされるおそれがある
。そこで、出願人は次のような方法を提案している（特
願昭６３−２９２４４５号；提案済み方法１とも云う）
。Conventionally, as a method of this kind, for example, after cutting out a character line or character string (hereinafter simply referred to as a character line) from a document image, what looks like characters is extracted as a temporary character, and a predetermined character size is set as a standard from this temporary character. as full-width characters (kanji, etc.)
If a character cannot be determined to be a full-width character based on the font size, it can be checked to see if it can be effectively used as a half-width character, and then recognized as a half-width character. However, with this method, depending on the character, a full-width character may be mistakenly treated as two or more half-width characters. There is a risk of being made 21. Therefore, the applicant has proposed the following method (Japanese Patent Application No. 63-292445; also referred to as Proposed Method 1).
.

第６図は提案済みの方法１を説明するためのフローチャ
ートである。FIG. 6 is a flowchart for explaining proposed method 1.

これは、図示されない画像処理装置の処理手順を示すも
ので、まず文書画像データを入力しく■参照）、その水
平方向の投影値をとることにより、各文字行を切り出す
（■参照）。これにより、行の幅寸法を求め、全角文字
の大きさに相当する量（文字サイズ）を得る。なお、こ
こでは横書きの場合を想定しているが、縦書きの場合も
同様である。This shows the processing procedure of an image processing device (not shown). First, document image data is input (see (2)), and each character line is cut out by taking its horizontal projection value (see (2)). As a result, the width of the line is determined, and the amount (character size) corresponding to the size of a full-width character is obtained. Note that although horizontal writing is assumed here, the same applies to vertical writing.

次に、各行に垂直な方向の投影値を調べ、文字サイズを
考慮することにより、各文字行から文字らしきもの、す
なわち仮文字群を切り出しく■参照）、シかる後この仮
文字群の中から上記文字サイズを利用して全角文字を選
出する（■参照）。Next, by examining the projection value in the direction perpendicular to each line and taking the character size into account, we cut out what appears to be characters, that is, a group of temporary characters, from each line of characters (see ■). Select full-width characters using the above font size (see ■).

全角文字として選出する条件は次のとおりである。The conditions for selecting characters as full-width characters are as follows.

イ）それ単独で文字サイズが全角文字サイズのもの、即
ち他の仮文字と結合する余地の全くないもの。b) The font size is full-width font size by itself, that is, there is no room for combining with other temporary characters.

口）句読点。oral) punctuation.

ハ）それ単独では半角サイズであるが、隣り合う他の半
角サイズの仮文字と結合させてみると全角サイズとなる
もの。c) It is a half-width size by itself, but becomes a full-width size when combined with other adjacent half-width temporary characters.

二）それ単独ではサイズが全角サイズよりも小さいが、
隣り合う他の半角サイズの仮文字との間に距離があり過
ぎ、これらを無理に結合させると全角文字サイズをこえ
るもの。2) Although the size alone is smaller than the full-width size,
There is too much distance between adjacent half-width temporary characters, and if you forcefully combine them, the size will exceed the full-width character size.

以上の如き条件に従って全角文字を全て選出した後、あ
とに残った仮文字について、これを統合または分離して
統合文字１分離文字を作威しく０参照）、シかる後これ
らの統合文字１分離文字をＯＣＲ（文字読取装置）によ
り、辞書パターンとの類似度を利用して認識する（■参
照）。After selecting all full-width characters according to the above conditions, merge or separate the remaining temporary characters (refer to 1 integrated character 1 separated character (0)), and then combine these integrated characters 1 separated character. Characters are recognized using an OCR (character reading device) based on their similarity to dictionary patterns (see ■).

次に、その認識結果に対して次のような矛盾処理を実行
する（■参照）。Next, the following contradiction processing is performed on the recognition result (see ■).

ａ）例えば認識すべき対象が分離文字であるにも関わら
ず、ＯＣＲによる認識結果が全角サイズの漢字を示すも
のとすれば互いに矛盾するので、かかる認識結果は採用
しない。a) For example, even though the target to be recognized is a separated character, if the recognition result by OCR indicates a full-width kanji character, this recognition result is not adopted because it contradicts each other.

ｂ）上記とは逆に、認識すべき対象か統合文字であるに
もかかわらず、ＯＣＲによる認識結果が英字、数字等の
半角サイズ文字を示す場合。b) Contrary to the above, when the recognition result by OCR shows half-width size characters such as alphabets and numbers, even though the target to be recognized is an integrated character.

そして、最後に残された文字につき、これを統合文字と
すべきか分離文字とすべきかを、ＯＣＲにより相対類似
度を用いて判別する（■参照）。Then, regarding the last remaining character, whether it should be used as an integrated character or a separated character is determined by OCR using relative similarity (see ■).

なお、類似度ｘｌは、類似度Ｘと類似度の平均値ｍとの
比に、或る定数（例えば、１０２４）を掛けたものとし
て定義する。すなわち、ｘｌ＝ｘ／ｍ一定数（１０２４
）である。Note that the degree of similarity xl is defined as the ratio of the degree of similarity X and the average value m of the degrees of similarity multiplied by a certain constant (for example, 1024). That is, xl=x/m constant number (1024
).

上述の方法では類似度から認識結果を決定するようにし
ているので、例えば「う」と「ろ」、「テ」と「チｊの
如く非常に似ているものは、文字画像の僅かな違い、例
えば文字等の太さや字体の相違、文字等のつぶれやかす
れ等により誤認識するおそれがある。また、同様の理由
から、「は」の相対類似度よりも「（ｊと「よ」の部分
の相対ＭｉｆＪ度の方が高くなり、ｆ４　認識してしま
うことがある。このことにつき、以下に具体的に説明す
る。In the above method, recognition results are determined based on the degree of similarity, so for example, when characters are very similar, such as ``u'' and ``ro'', or ``te'' and ``chij'', slight differences in the character images may be detected. , for example, there is a risk of misrecognition due to differences in the thickness and font of the characters, crushed or blurred characters, etc. Also, for the same reason, the relative similarity of ``(j and yo'' is more important than the relative similarity of ``wa''). The relative MifJ degree of the part may be higher and f4 may be recognized.This will be explained in detail below.

第７図は入力文字「は」について、これを統合文字１と
して処理した場合と、分離文字２．３として処理した場
合の類似度Ｘ、相対類似度ｘｉを示すものである。すな
わち、類似度Ｘ、類似度の平均値ｍとが図示の如く得ら
れたものとすると、類似度とその平均値との比に一定数
（１０２４）を乗じて得られる相対類似度は、統合文字
１の場合はｒ９５４．、分離文字２，３の場合はそれぞ
れ「１０２８３．　　「８９８．でその平均値は「９６
３」となり、９５４＜９６３であることから、入力文字「は」は「（」（前括弧）と
「はヨからなるものとして誤認識されることになる。な
お、定数はｒ１０２４」に限らないことは勿論である。FIG. 7 shows the similarity X and relative similarity xi of the input character "ha" when it is processed as an integrated character 1 and when it is processed as a separated character 2.3. That is, assuming that the similarity X and the average value m of the similarity are obtained as shown in the figure, the relative similarity obtained by multiplying the ratio of the similarity and the average value by a constant number (1024) is the integrated For character 1, r954. , in the case of separator characters 2 and 3, respectively "10283."898.The average value is "96.
3", and since 954<963, the input character "wa" will be misrecognized as consisting of "(" (front parenthesis) and "hayo".The constant is not limited to "r1024". Of course.

このような問題を解決するため、出願人はさらに次のよ
うな方法を提案している（特願平１−３９３０８号；提
案済み方法２ともいう）。In order to solve these problems, the applicant has further proposed the following method (Japanese Patent Application No. 1-39308; also referred to as Proposed Method 2).

第８図は提案済み方法２を説明するためのフローチャー
トで、第６図に示すものに対し形状特徴照合ステップ［
相］〜［相］を付加した点が特徴である。FIG. 8 is a flowchart for explaining proposed method 2, in which the shape feature matching step [
It is characterized by the addition of [phase] to [phase].

この方法の要点は、認識すべき文字の縦横比と縦方向お
よび横方向にそれぞれいくつに分離できるかを示す形状
特徴量も辞書として予め記憶しておき、入力文字の形状
特徴量をその認識結果と対応する候補文字の予め記憶さ
れている形状特徴量と比較して、類似度に基づく認識結
果を採用するか否かを決定するようにした点にある。The key point of this method is that the aspect ratio of the character to be recognized and the shape features that indicate how many parts can be separated in the vertical and horizontal directions are also stored in advance as a dictionary, and the shape features of the input character are used as the recognition results. The present invention is characterized in that it is determined whether or not to adopt a recognition result based on the degree of similarity by comparing the shape feature amount of the corresponding candidate character with the previously stored shape feature amount.

この例につき、以下に具体的に説明する。This example will be specifically explained below.

第９図は入力画像（または入力文字）が「う」で、人力
画像から得られる形状特徴は「縦長で、縦方向に２つに
分離する」であり、これに対して認識結果の候補文字の
第１〜第１０位のうちの第１位が「ろ」となった例であ
る。この場合、辞書に予め記憶されている「ろ」の形状
特徴は「縦長でも横長でもなく、縦横に分離しない」で
ある。In Figure 9, the input image (or input character) is ``u'', and the shape feature obtained from the human image is ``vertically long and separated into two vertically'', and the recognition result candidate character This is an example in which the first place among the first to tenth places is "ro". In this case, the shape characteristics of "ro" stored in the dictionary in advance are "neither vertically nor horizontally long, nor separated vertically and horizontally."

従って、縦長・横長チエツクのステップ（第８図［相］
参照）でＮｏ”となるため、第２位の候補文字のチエツ
クが行なわれる。第２位の候補文字は「う」であり、形
状特徴量が一致するので、これを認識結果として採用す
る。Therefore, the step of vertical/horizontal check (Fig. 8 [phase]
), the second candidate character is checked.The second candidate character is "u", and since the shape features match, this is adopted as the recognition result.

第１０図は入力文字「は」を統合文字として処理する場
合と、分離文字ｒ（、、ｒよ」として処理する場合の例
である。すなわち、同図（イ）の如く統合文字として処
理する場合は、認識結果の第１候補「は」が形状特徴量
の点からも適合するので、「はｊが採用される。同様に
、分離文字「（」は同図（ロ）の如く第１位候補文字「
（」と適合するが、分離文字「よ」は同図（ハ）の如く
、その形状特徴量から第１位候補文字「は」とは適合せ
ず、結局第２位の候補文字「まＪと適合する。Figure 10 shows an example where the input character "wa" is processed as an integrated character and when it is processed as a separate character r (,, ryo). In other words, it is processed as an integrated character as shown in (a) of the same figure. In this case, the first candidate "ha" in the recognition result is also suitable from the point of view of the shape feature, so "haj" is adopted.Similarly, the separated character "(" is the first candidate as shown in (b) of the same figure. Position candidate character ``
However, as shown in the same figure (c), the separated character ``yo'' does not match the first candidate character ``ha'' due to its shape features, and in the end, the second candidate character ``maJ'' Compatible with

そし＝、同図（ニ）に示す如く、「は」として認識した
ときの相対類似度はｒ９５４」、「（」と「ま」に分離
されるものとしたときの相対類似度はｒｃ＋４６」とな
り、９４６＜９５４から１は」が採用されることになる。つまり、入力文字
は「は」と認識され、「（」と「ま」に誤認識されるお
それをなくすことができる。As shown in the same figure (d), the relative similarity when recognized as "wa" is r954, and the relative similarity when it is separated into "(" and "ma" is rc+46). , 946<954, then 1 is adopted.In other words, the input character is recognized as ``ha'', and the possibility of it being mistakenly recognized as ``('' and ``ma'' can be eliminated).

[Problem to be solved by the invention]

上述の形状特徴は限定された条件では有効であるが、−
船釣に扱おうとすると困難である。例えば、文字の縦横
比は字体によって異なるため縦横比が２倍以上必要とさ
れることから、「−」や「１」のように適用可能な字種
が限定されることになる。例えば、片仮名の「口」と漢
字の「口」は文字画像が相似であるが、大きさを比べる
と漢字の「日二の方が大きく、片仮名の「力」と漢字の
１カニ、片仮名の「工ｊと漢字のｒ工、も同様である。Although the above shape features are effective under limited conditions, −
It is difficult to use it for boat fishing. For example, since the aspect ratio of a character differs depending on the font, the aspect ratio must be twice or more, which limits the types of characters that can be applied, such as "-" and "1". For example, the character images of the katakana ``口'' and the kanji ``mouth'' are similar, but when comparing the sizes, the kanji ``日二'' is larger; ``The same is true for the kanji ``ko j'' and the kanji ``r''.

また、英字の小文字と大文字ではｒ（、とｒｃ４．　ｒ
Ｋ」とｒｋ、、　ｒ□、と７０；、　ｒｐ。Also, for lowercase and uppercase letters, r(, and rc4.r
K'' and rk,, r□, and 70;, rp.

と７ＰＪ、　’Ｓｊと’ＳＪ、　’ＵＪとｒｔＢ、ｒＶ
ごと’ｖ二、ｒＪ、とｒＷＪ、　７ＸＪと「Ｘ−１「Ｚ
：とｚＢが同様である。さらに、英字の「Ｉ」と漢字の
「工」、漢字の「８二とｒ日二は縦横比を変えると相似
となる。and 7PJ, 'Sj and 'SJ, 'UJ and rtB, rV
Goto'v2, rJ, and rWJ, 7XJ and "X-1"Z
: and zB are similar. Furthermore, the English letter ``I'', the kanji ``工'', and the kanji ``82'' and ``r日2'' become similar when the aspect ratio is changed.

一方、分離数については「二ｊは確実に１＠１方向に２
つに分離する」が、「は」は常に「横方向に２つに分離
する」とは限らない。つまり、文字種により分離数を確
実に指定することはできない。On the other hand, regarding the number of separations, ``2j is definitely 1@2 in one direction.
"to separate into two", but "wa" does not always mean "to separate into two laterally". In other words, it is not possible to reliably specify the number of separations depending on the character type.

このように、文字画像のノイズは勿論のこと、文字の傾
き１文字の大きさ（級数）、変形率等が分離数に影響す
るので、やはり適用可能な字種が限定されることになる
。In this way, the number of separations is affected not only by noise in the character image, but also by the inclination of the characters, the size (series) of each character, the deformation rate, etc., and thus the applicable character types are limited.

したがって、この発明の目的は形状特徴を広く適用でき
るように改良して以上の如き欠点を除去し、認識率を向
上させることにある。Therefore, an object of the present invention is to improve the shape features so that they can be widely applied, eliminate the above-mentioned drawbacks, and improve the recognition rate.

[Means to solve the problem]

文書を画像処理して個々の文字を切り出し各文字毎に辞
書パターンとの類似度を求めて文字を認識するに当たり
、認識すべき文字の文字幅１文字高さ、外接矩形の面積
、縦横比の少なくとも１つについて、認識すべき文字を
含む文書内の標準的な文字との比率を求め、これを予め
求めておいた基準値と比較することにより、或る文字の
字形が他の文字の字形を縦方向、横方向または縦横両方
向に伸張または圧縮して得られる文字と類似するときの
判別を容易にする。このとき、前記各比率の平均値、標
準偏差またはこれらから導かれる量を求めるとともに、
その各々に対する基準値を予め求めてこれを辞書として
所定のメモリに記憶しておくことができる。When recognizing characters by image processing a document, cutting out individual characters, and determining the similarity of each character with a dictionary pattern, we calculate the character width, character height, circumscribed rectangle area, and aspect ratio of the character to be recognized. By determining the ratio of at least one of the characters to standard characters in the document containing the character to be recognized and comparing this with a predetermined reference value, the glyph shape of a certain character can be determined from the glyph shape of another character. To facilitate discrimination when similar to characters obtained by expanding or compressing in the vertical direction, horizontal direction, or both vertical and horizontal directions. At this time, find the average value, standard deviation, or amount derived from these for each ratio, and
Reference values for each of these can be determined in advance and stored in a predetermined memory as a dictionary.

また、文書を画像処理して個々の文字を切り出し各文字
毎に辞書パターンとの類似度を求めて文字を認識するに
当たり、認識すべき文字の縦置離数、横分離数をその文
字サイズから予測し、これを基準値と比較することによ
り、或る文字が分離文字になり易い字形であるときの判
別を容易にする。このとき、前記縦分離数、横分離数の
平均値。In addition, when recognizing characters by image processing a document, cutting out individual characters, and determining the similarity of each character with a dictionary pattern, we can calculate the number of vertical and horizontal separations of the characters to be recognized based on the character size. By predicting and comparing this with a reference value, it is easy to determine when a certain character has a shape that is likely to become a separated character. At this time, the average value of the number of vertical separations and the number of horizontal separations.

標準偏差またはこれらから導かれる量を求めるとともに
、その各々に対する基準値を予め求めてこれを辞書とし
て所定のメモリに記憶しておくことができる。In addition to determining standard deviations or quantities derived therefrom, reference values for each of them can be determined in advance and stored in a predetermined memory as a dictionary.

[Effect]

一般に、例えば片仮名の「口」と漢字の１口」を判別す
る場合、明朝体を除いてはその文字単独では困難である
。しかし、片仮名の「口、と漢字の「口」の２つの文字
を比較できれば、小さい方が片仮名の「口ｊで、大きい
方が漢字の「口」である。一般の文書を読んでいる場合
、そのような直接比較は出来ないが、他の文字と比較し
てその文字が大きめのものに属するか、小さめのものに
属するかで判定することができる。すなわち、その文字
の標準文字サイズ（変形率も含めて）を知り、文字画像
の大きさを量的に比較することにより、類似文字の判定
を正確に行なうことができる。In general, for example, it is difficult to distinguish between the katakana ``kuchi'' and the kanji ``ichikuchi'' based on the characters alone, except in Mincho fonts. However, if we can compare the two characters ``guchi'' in katakana and ``mouth'' in kanji, the smaller one is the katakana ``guchi j'' and the larger one is the kanji ``mouth''. When reading a general document, such direct comparison is not possible, but it is possible to determine whether a character belongs to a larger or smaller size by comparing it with other characters. That is, by knowing the standard character size (including the deformation rate) of the character and quantitatively comparing the sizes of character images, it is possible to accurately determine similar characters.

また、文字サイズが小さくなって０に近づくと、どんな
文字でも文字画像は潰れて塊になるので、分離数は１に
収束し、文字サイズが大きくなると、文字は潰れにくく
なって分離数は増加傾向となり、さらに大きくなると分
離数はその字種の分離し得る最大値に収束する。そこで
、分離文字を文字サイズの連続関数とみなし、分離数の
期待値からどれくらい外れているかを求めることにより
、統合文字２分離文字の判定を正確に行なうことができ
る。Also, as the font size decreases and approaches 0, the character image of any character becomes crushed and lumped, so the number of separations converges to 1, and as the font size increases, the characters become less likely to be crushed and the number of separations increases. As the number of separations increases further, the number of separations converges to the maximum value that can be separated for that character type. Therefore, by regarding the separated characters as a continuous function of the character size and determining how much the number of separated characters deviates from the expected value, it is possible to accurately determine whether the integrated character is a separated character or not.

つまり、認識すべき文字の文字幅２文字高さ。In other words, the character width and height of the character to be recognized.

外接矩形の面積、縦横比の少なくとも１つについての認
識すべき文字を含む文書内の標準的な文字の各対応する
量との比率、または縦分離数、横分離数をその文字サイ
ズから予測することにより、より高精度の認識を可能に
するものである。Predict the area of a circumscribing rectangle, the ratio of at least one of the aspect ratios to the corresponding amount of standard characters in a document containing the character to be recognized, or the number of vertical and horizontal separations from the character size. This enables more accurate recognition.

〔Example〕

第１図はこの発明の実施例を示すフローチャートである
。なお、ステップ■〜■は従来と同様なので、ステップ
■〜［相］について以下に説明する。FIG. 1 is a flowchart showing an embodiment of the invention. Incidentally, steps ① to ① are the same as those of the conventional method, so steps ② to [phase] will be explained below.

まず、次式で定義される正規化した文字高さの残差Ｈｎ
、および正規化した文字幅の残差Ｗｎを求める（■、［
相］参照）。First, the normalized character height residual Hn defined by the following formula
, and find the residual Wn of the normalized character width (■, [
phase]).

二二に、Ｈ，Ｗは文字画像から測定した文字高さ５文字
幅である。第２図（イ）に入力画像のサンプルデータを
示す。Ｈｓ、Ｗｓは入力文書（文章２段落１行）の標準
文字高さ、標準文字幅で、第２図（ロ）にそのデータを
文書データとして示している。或る字体の標準文字高さ
、標準文字幅シよその字体の最大文字高さ、最大文字幅
であり、その字体の漢字の最大文字高さ、最大文字幅と
考えて差支えない。なお、標準文字サイズは字体や変形
率１級数等の違いによって生しる誤差を吸収するための
もので、安定な値が得られるのであれば最大値に限らず
、平均値または最頻値でも良いことは勿論である。入力
文書の標準文字高さ、標単文字幅は文書を読取る前に外
部から指定されるか、読取り中に周囲の文字から統計的
に求めることができるので、その詳細は省略する。22. H and W are the character height and 5 character width measured from the character image. FIG. 2(a) shows sample data of the input image. Hs and Ws are the standard character height and standard character width of the input document (two paragraphs of text, one line), and the data is shown as document data in FIG. 2 (b). The standard character height and standard character width of a certain font are the maximum character height and maximum character width of other fonts, and can be thought of as the maximum character height and maximum character width of Kanji characters of that font. Note that the standard font size is used to absorb errors caused by differences in font, deformation rate, etc., and if a stable value can be obtained, it is not limited to the maximum value, but can also be the average value or mode. Of course it's a good thing. The standard character height and standard character width of the input document can be specified externally before reading the document, or can be statistically determined from surrounding characters during reading, so their details will be omitted.

また、標準文字高さＨｓ、標準文字幅Ｗｓは文字級数を
Ｑ１垂直変形率をαＶ、水平変形率をαｈ、レターフェ
ースと正体の標準文字高さ、標準文字幅との比率をそれ
ぞれβＶ、βｈとすれば、次式の関係にある。In addition, for the standard character height Hs and standard character width Ws, the character series is Q1, the vertical deformation rate is αV, the horizontal deformation rate is αh, and the ratios between the letter face and the original standard character height and standard character width are βV and βh, respectively. Then, we have the following relationship.

Ｈｓ＝β７．Ｑ、αＶ　　　　　　　・・・（３）Ｗ　
ｓ　＝βｈ−にｌαｈ　　　　　　　　＝（４ｊＨμｊ
、Ｈσｊは字種ｊの字体毎の標準文字高さとの比率の平
均値、標準偏差を示し、Ｗ、ｃｚｊ。Hs=β7. Q, αV...(3)W
s = βh− to lαh = (4jHμj
, Hσj represent the average value and standard deviation of the ratio of character type j to the standard character height for each font, and W, czzj.

Ｗσｊは字種ｊの字体毎の標準文字高さとの比率の平均
値、標準偏差を示す。これらの求め方につき、以下に説
明する。Wσj represents the average value and standard deviation of the ratio of character type j to the standard character height for each font. How to obtain these values will be explained below.

例えば、ゴシック体と明朝体とでは、同し級数の同し字
種を比べると、一般に明朝体の方が大きい。片仮名の「
口ｊと漢字の「口」とを大きさで判別するため、このよ
うな字体間の文字サイズの違いを補正しておくことが望
ましい。そこで、例えばゴシック体１３級の「あ」を、
ゴシック体１３級の標準文字高さで割った値を求める。For example, when comparing Gothic and Mincho fonts with the same character type in the same series, Mincho fonts are generally larger. Katakana “
In order to distinguish between 口 j and the kanji character ``口'' based on their size, it is desirable to correct this difference in character size between fonts. So, for example, "a" in Gothic grade 13,
Find the value divided by the standard character height of Gothic grade 13.

次に、明朝体１６級の「あ」を、明朝体１６級の標準文
字高さで割った値を求める。このようにして、次々と「
あ」について文字セント（字体）毎の標準文字高さとの
比率を求め、これらから平均値「Ｈμあ」と標準偏差「
Ｈσあ：が求められる。これらＨｕｊ、Ｈσｊ、Ｈｕｊ
、Ｗσｊは、文書を読取るに先立って読取り辞書を作成
するとともに計算しておき、字種ｊによって簡単に引け
るように表にして第３図の如く記憶しておく。なお、第
３図は字種に１の特徴量を詳細に示しているが、他の字
種についても同様である。また、以上では文字高さ１文
字幅を用いたが、これと等価な外接矩形と縦横比を用い
ても良い。Next, calculate the value of "A" in the 16th grade Mincho typeface divided by the standard character height of the 16th grade Mincho typeface. In this way, one after another,
Find the ratio of standard character height for each character cent (font) for ``A'', and calculate the average value ``HμA'' and the standard deviation ``A'' from these.
Hσa: is required. These Huj, Hσj, Huj
, Wσj are calculated while creating a reading dictionary prior to reading the document, and are stored in a table as shown in FIG. 3 so that they can be easily retrieved by character type j. Note that although FIG. 3 shows the feature amount for character type 1 in detail, the same applies to other character types. Furthermore, although character height and one character width are used in the above example, a circumscribing rectangle and aspect ratio equivalent to this may be used.

次に、次式の如く定義される正規化した縦分離数の残差
ＮＶｎ、および正規化した横分離数の残差ＮＨｎを求め
る（■、ａ２）参照）。Next, the residual difference NVn of the normalized number of vertical separations and the residual difference NHn of the normalized number of horizontal separations are determined as defined by the following equation (see ■, a2).

ＮＶ　σ　ｊ二二に、ＮＶ、ＮＨは入力文字画像から測定した縦置離
数、横分離数で、これらは、縦分離数の平均　　二Ｎ■μｊ＝Ｆ１　（Ｌ　　Ｈ）縦
分離数の標準偏差二ＮＶσｊ＝Ｆ２　（ｊ、Ｈ）横分離
数の平均　　：ＮＨμｊ＝Ｆ３　（ｊ、Ｗ）横分離数の
標準偏差：ＮＨｔｊｊ＝Ｆ・４　（ｊ、Ｗ）のように字
種ｊと文字幅Ｗ１文字高さＨとの関数である。NV σ j Second, NV and NH are the number of vertical separations and the number of horizontal separations measured from the input character image, and these are the average of the number of vertical separations 2Nμj=F1 (L H) Standard of the number of vertical separations Deviation2 NVσj=F2 (j, H) Average number of horizontal separations: NHμj=F3 (j, W) Standard deviation of number of horizontal separations: NHtjj=F・4 (j, W) for character type j and character width W1 is a function of character height H.

例えシヨ、ゴシック体１０級の「は」の横分離数は「２
」、教科書体１０級の「は」の横分離数は「１」、明朝
体１０級の「は」の横分離数はＦ２」であったとすると
、その平均値と標準偏差は、ＮＨμは＝Ｆ３（は、１０
級）＝、（２＋１＋２）／３＝１．６６７ＮＨσは＝Ｆ４（は、１０級）＝　［（４＋１＋４）／３−　（１，６６７）”］　”
２＝０．４７となる。このようにして、１０級のもっと多くの字体に
対してｒＮＨμはＪ、ｒＮＨσはコを求める。次に、１
２級についても同様にしてｒＮＨμはｊ＋　　ｒＮＨσ
は」を求め、以後同様の操作を繰り返し、補間法により
Ｆｌ（は、　　Ｈ）、Ｆ２　ｅまＨ）、Ｆ３（は、Ｗ）
、Ｆ４（は　Ｗ）を求める。For example, the horizontal separation number of "ha" in Gothic grade 10 is "2".
'', the number of horizontal separations for ``ha'' in the 10th grade textbook is ``1'', and the number of horizontal separations for ``ha'' in the 10th grade Mincho font is F2'', then the average value and standard deviation are, NHμ is =F3(is, 10
class) =, (2+1+2)/3 = 1.667 NHσ = F4 (is, 10th class) = [(4+1+4)/3- (1,667)”] ”
2=0.47. In this way, rNHμ is determined to be J, and rNHσ is determined to be C for more fonts in grade 10. Next, 1
Similarly for the secondary level, rNHμ is j + rNHσ
After that, repeat the same operation and use the interpolation method to obtain Fl (ha, H), F2 e ma H), F3 (ha, W)
, F4 (is W).

そして、全ての字種についてかかる操作を実施すれば、
関数Ｆｌ　（ｊ、Ｈ）、Ｆ２　（ｊ、Ｈ）、Ｆ３　（ｊ
、Ｗ）　、Ｆ４　（ｊ、Ｗ）が求められることになる。Then, if you perform this operation for all character types,
Function Fl (j, H), F2 (j, H), F3 (j
, W) , F4 (j, W) will be found.

第４図に縦分離数の平均関数とその標準偏差Ｆ１、Ｆ２
および横分離数の平均関数とその標準偏差Ｆ３．Ｆ４の
例を示す。Figure 4 shows the average function of the number of vertical separations and its standard deviations F1 and F2.
and the average function of the number of lateral separations and its standard deviation F3. An example of F4 is shown.

縦分離数の平均関数Ｆｌ　（ｊ、Ｈ）は同図（イ）のよ
うに文字種ｊ１文字高さＨの関数であり、（ａ　３　　
　　　　　　　　　（ａ　２≦Ｈ）（７）で表わされる。その標準偏差の関数Ｆ２　（ｊ、Ｈ）は
同図（ロ）から、［６（Ｈ≦ｂｌ）ＮＶσ＝Ｊ、　（ｂ５−ｂ３）　（Ｈ−ｂ２）／　（ｂ
・・・　（８）で表わされる。同様に、・・・　（９）・・・　（１０）で表わされる。The average function Fl (j, H) of the number of vertical separations is a function of the character type j1 character height H as shown in the same figure (a), and (a 3
It is expressed as (a 2 ≦H) (7). The standard deviation function F2 (j, H) is obtained from the same figure (b) as [6 (H≦bl) NVσ=J, (b5-b3) (H-b2)/ (b
...(8) It is expressed as: Similarly, it is expressed as ... (9) ... (10).

次のステップ＠では、次式で定義されるノルム（パター
ンの長さに相当する）を求める。In the next step @, the norm (corresponding to the length of the pattern) defined by the following equation is determined.

Ｄ　　ｊ　　＝　　［Ｈｎ”＋Ｗｎｚ＋ＮＶｎ”＋ＮＨ
ｎ２コ　ｌ／２・・・（１１）上記ではＨｎ、Ｗｎ、ＮＶｎ、ＮＨｎの４つの量でノル
ムを表現するようにしているか、そのうちの１つ以上を
用いて表現することができる。D j = [Hn”+Wnz+NVn”+NH
n2 co l/2 (11) In the above, the norm is expressed using four quantities: Hn, Wn, NVn, and NHn, or it can be expressed using one or more of them.

ステップ［相］では、類似度の判定結果と統合して文字
種ｊの総合判定結果Ｔｊを得る。すなわち、相関値であ
る類似度Ｒｊを、次式の如く形状特徴の結果Ｄｊと結合
する。In step [phase], the result of similarity determination is integrated to obtain a comprehensive determination result Tj of character type j. That is, the similarity Rj, which is a correlation value, is combined with the shape feature result Dj as shown in the following equation.

Ｔｊ＝ｗｌ　・Ｒｊ＋ｗ２・　［ｍａｘ　（Ａ”Ｄｊ”
、Ｏ）　　コ　１″　　　　　　　　　　　・・・　（
１２）ここに、ｗｌ、ｗ２は定数で、類似度Ｒｊと形状
特徴の結果Ｄｊの重みを示し、Ａは形状特徴の効果に限
度を与えるための定数である。なお、上式（１２）は、Ｔｊ＝Ｗ１・Ｒｊ＋ｗ２・Ｄｊ　　　・・・（１３）の
ように簡略化することができる。Tj=wl ・Rj+w2・ [max (A"Dj"
, O) ko 1″ ... (
12) Here, wl and w2 are constants indicating the weight of the similarity Rj and the shape feature result Dj, and A is a constant for giving a limit to the effect of the shape feature. Note that the above equation (12) can be simplified as Tj=W1.Rj+w2.Dj (13).

形状特徴を類似度の補助として用いる場合、すなわち類
似字形がなく類似度だけで充分判定が可能で形状特徴を
必要としない字種の場合は、上記多量の標準偏差の値を
実際よりも大きめにしておけぽＤｊの値は非常に小さく
なり、形状特徴の効果を弱めることができる。When using shape features as an aid to similarity, in other words, when there are no similar glyphs, and the similarity alone is enough to determine the character type, and shape features are not required, the value of the large number of standard deviations mentioned above should be set larger than the actual value. The value of Dj becomes very small, and the effect of the shape feature can be weakened.

なお、分離文字をより詳細に表現するには、文字を部分
図形と空白の組み合わせと見做して表わすこともできる
。例えば第５図（イ）に示すように漢字「洲」は６つの
部分図形の大きさと５つの空白の大きさのリストとして
同図（ロ）、（ハ）のように表現し、標準文字サイズで
正規化したときに空白の大きさが１画素以下になったら
潰れるとしてリスト間でパターンマツチングをとるよう
にしても良い。ただし、どの空白から潰れるかは確率的
な問題となるので、パターンマツチングをとる手法が複
雑となる。Note that in order to express the separated characters in more detail, the characters can also be expressed as a combination of partial figures and blank spaces. For example, as shown in Figure 5 (a), the kanji ``zu'' is expressed as a list of six partial figure sizes and five blank sizes as shown in (b) and (c) in the same figure, and the standard character size is If the size of the blank space becomes less than 1 pixel when normalized with , it will collapse, and pattern matching may be performed between the lists. However, since it is a probabilistic problem as to which blank space is to be collapsed, the pattern matching method becomes complicated.

〔Effect of the invention〕

この発明によれぽ、標準文字高さＨｓ、標準文字幅Ｗｓ
は字体間の文字の大きさの違いを補正する量であり、１
つの文書では路間−の字体を用いていることから、字体
間の文字の大きさの違いを詳細かつ正確に判定すること
ができる。これにより、或る文字の字形が他の文字の字
形を縦方向。With this invention, standard character height Hs, standard character width Ws
is the amount to correct the difference in character size between fonts, and 1
Since the two documents use the font ``Rima-'', it is possible to determine the difference in character size between the fonts in detail and accurately. This allows the glyph shape of one character to vertically change the glyph shape of another character.

横方向または縦横両方向に伸張または圧縮して得られる
類似文字であるときも、両者の判別が可能となる。さら
に、字種毎の平均値、標準偏差を大きさのバラツキを評
価するための評価基準に組み込むことにより、判定がよ
り正確となる。Even when similar characters are obtained by expanding or compressing in the horizontal direction or both vertical and horizontal directions, it is possible to distinguish between the two. Furthermore, by incorporating the average value and standard deviation for each character type into the evaluation criteria for evaluating size variations, the determination becomes more accurate.

その結果、片仮名の「口ごと漢字の「口」は文字画像が
相似であるが、大きさを比べると漢字の「口」の方が大
きく、片仮名の「力」と漢字の「力」、片仮名の「工」
と漢字の「工」も同様である。また、英字の小文字と大
文字では「Ｃ」と「Ｃｌ、　ｒＫ」とｒｋ、、　ｒＯ」
とｒｏ」、ｒｐ」と’ＰＪ、　ｒ３．と「Ｓ」、「Ｕ」
とｒｕ」、ｒＶ、と「■」、「Ｗ」と「Ｗ」、「Ｘ」と
「Ｘ」「Ｚ」と「ｚ」が同様である。さらに、英字の「
Ｉ」と漢字の「工」、漢字の「日」と「日」は縦横比を
変えると相似となるが、以上のような類似の文字の判別
が容易となる。As a result, the character images of the kanji ``mouth'' in katakana are similar, but when comparing the sizes, the kanji ``mouth'' is larger; ``Technology''
The same is true for the kanji ``工''. In addition, lowercase and uppercase letters are "C", "Cl, rK", rk,, rO"
and ro'', rp'' and 'PJ, r3. and "S", "U"
The same applies to "and ru", rV, and "■", "W" and "W", "X" and "X", and "Z" and "z". In addition, the alphabet “
The kanji ``I'' and the kanji ``工'' and the kanji ``日'' and ``日'' become similar when the aspect ratio is changed, but it becomes easier to distinguish the similar characters as described above.

また、文字の分離の仕方を文字サイズによって予測し、
その予測結果を基準値と比較して判定するようにすれば
、文字の統合２分離をより正確に判定することができる
。さらに、分離の仕方の字種毎の平均値、標準偏差を大
きさのバラツキを評価するための評価基準に組み込み、
これを文字サイズの近似式または関数として利用すれば
、判定がより正確となる。In addition, the method of separating characters can be predicted based on the font size.
If the prediction result is compared with a reference value for determination, it is possible to more accurately determine whether characters are integrated into two or separated. In addition, we incorporated the average value and standard deviation of the separation method for each type of character into the evaluation criteria for evaluating the variation in size.
If this is used as an approximation formula or function for character size, the determination will be more accurate.

その結果、「二」のように確実に「縦方向に２つに分離
するｊ文字は勿論のこと、ｒは」のように常に「横方向
に２つに分離する」とは限らない文字種についても統合
２分離判定が容易となる。As a result, for character types that are not always separated into two horizontally such as ``r'', as well as ``j'' that is reliably separated into two vertically, such as ``ni''. Also, it becomes easy to make an integrated and two-separate judgment.

[Brief explanation of drawings]

第１図はこの発明の実施例を示すフローチャート、第２
図はサンプルデータおよび文書データを説明するための
説明図、第３図は辞書データを説明するための説明図、
第４図は縦分離数、横分離数の各平均、標準偏差を説明
するための説明図、第５図は或る文字の一例とその文字
分離リストを説明するための説明図、第６図は提案済み
方法１を説明するためのフローチャート、第７図は提案
済み方法ｌを具体的に説明するための説明図、第８図１
ま提案済み方法２を説明するためのフローチャート、第
９図および第１０図はいずれも提案済み方法２を具体的
に説明するための説明図である。符号説明１・−・統合文字、２．３・・・分離文字。FIG. 1 is a flowchart showing an embodiment of the invention, and FIG.
The figure is an explanatory diagram for explaining sample data and document data, FIG. 3 is an explanatory diagram for explaining dictionary data,
Figure 4 is an explanatory diagram to explain the average and standard deviation of the number of vertical separations and the number of horizontal separations; Figure 5 is an explanatory diagram to explain an example of a certain character and its character separation list; Figure 6 is a flowchart for explaining proposed method 1, FIG. 7 is an explanatory diagram for concretely explaining proposed method 1, and FIG. 8 is a flowchart for explaining proposed method 1.
The flow chart for explaining the proposed method 2, and FIGS. 9 and 10 are explanatory diagrams for specifically explaining the proposed method 2. Code explanation 1.--Integrated character, 2.3.--Separation character.

Claims

[Claims] 1) When recognizing characters by image processing a document to cut out individual characters and determining the degree of similarity with a dictionary pattern for each character, the character width, character height of the character to be recognized, area of circumscribed rectangle,
For at least one of the aspect ratios, the ratio of each corresponding amount of standard characters in the document containing the character to be recognized is determined, and this is compared with a predetermined reference value to determine the character of a certain character. A character recognition method, characterized in that it facilitates discrimination when a character shape is similar to a character obtained by expanding or compressing the shape of another character vertically, horizontally, or both vertically and horizontally. 2) An average value, a standard deviation, or a quantity derived from these of each of the ratios is determined, and a reference value for each of the ratios is determined in advance and is stored in a predetermined memory as a dictionary. ) Character recognition method described in 3) Image processing a document to extract individual characters and calculate the similarity of each character with a dictionary pattern to predict the number of vertical and horizontal separations of characters to be recognized based on the character size. A character recognition method characterized in that by comparing this with a reference value, it is easy to determine when a certain character has a shape that is likely to become a separated character. 4) The average value and standard deviation of the number of vertical separations and the number of horizontal separations, or quantities derived from these are determined, and reference values for each of them are determined in advance and stored as a dictionary in a predetermined memory. The character recognition method according to claim 3).