JPH03260886A - Character recognizing method - Google Patents

Character recognizing method

Info

Publication number
JPH03260886A
JPH03260886A JP2057995A JP5799590A JPH03260886A JP H03260886 A JPH03260886 A JP H03260886A JP 2057995 A JP2057995 A JP 2057995A JP 5799590 A JP5799590 A JP 5799590A JP H03260886 A JPH03260886 A JP H03260886A
Authority
JP
Japan
Prior art keywords
character
characters
standard
width
recognized
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP2057995A
Other languages
Japanese (ja)
Inventor
Tetsuo Kiuchi
木内 哲夫
Kazushi Yoshida
收志 吉田
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Fuji Electric Co Ltd
Fuji Facom Corp
Original Assignee
Fuji Electric Co Ltd
Fuji Facom Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Fuji Electric Co Ltd, Fuji Facom Corp filed Critical Fuji Electric Co Ltd
Priority to JP2057995A priority Critical patent/JPH03260886A/en
Publication of JPH03260886A publication Critical patent/JPH03260886A/en
Pending legal-status Critical Current

Links

Abstract

PURPOSE:To improve the rate of recognition by calculating a ratio to a standard character in a document including a character to be recognized and comparing this ratio with a reference value calculated in advance concerning at least one of the character width, character height, area of circumscribed rectangle and aspect ratio of the character to be recognized. CONSTITUTION:Remaining difference Hn of the normalized character height to be defined by expressions I and II, and remaining difference Wn of the normalized character width is calculated. In the expressions I and II, H and W show the character height and character width to be measured from the character picture, Hs and Ws show the standard character height and standard character width in the II document, Hmuj and Hdeltaj show the average value and standard deviation of the ratio to the standard character height for each character form of a character type (j), and Wmuj and Wdeltaj show the mean value and standard deviation of the ratio to the standard character height for each character form of this character type (j). Next, remaining difference NVn of a normalized longitudinal discrete number and remaining difference NHn of a lateral discrete number is calculated. Namely, concerning one of the character width, character height, area of the circumscribed rectangle and aspect ratio of the character to be recognized at least, the ratio to the respectively corresponding amount of the standard character in the document including the character to be recognized is estimated from the character size. Thus, recognition is enabled with high accuracy.

Description

【発明の詳細な説明】 〔産業上の利用分野〕 この発明は、文字やマークなど(以下、単に文字等とい
う)を認識するための文字認識方法、特に高精度な文字
認識が可能な文字認識方法に関する。
[Detailed Description of the Invention] [Industrial Application Field] The present invention relates to a character recognition method for recognizing characters, marks, etc. (hereinafter simply referred to as characters, etc.), and in particular a character recognition method capable of highly accurate character recognition. Regarding the method.

〔従来の技術〕[Conventional technology]

従来、この種の方法として、例えば文書画像から文字行
または文字列(以下、単に文字行という)を切り出した
後、文字らしきものを仮文字として抽出し、この仮文字
から所定の文字サイズを基準として全角文字(漢字等)
を切り出し、文字サイズから全角文字と確定できないも
のは、半角文字として有効に成立し得る文字か否かを調
べ、取立すれば半角文字として認識する方法がある。し
かし、このような方法では文字によって全角文字が2つ
以上の半角文字として誤E?21にされるおそれがある
。そこで、出願人は次のような方法を提案している(特
願昭63−292445号;提案済み方法1とも云う)
Conventionally, as a method of this kind, for example, after cutting out a character line or character string (hereinafter simply referred to as a character line) from a document image, what looks like characters is extracted as a temporary character, and a predetermined character size is set as a standard from this temporary character. as full-width characters (kanji, etc.)
If a character cannot be determined to be a full-width character based on the font size, it can be checked to see if it can be effectively used as a half-width character, and then recognized as a half-width character. However, with this method, depending on the character, a full-width character may be mistakenly treated as two or more half-width characters. There is a risk of being made 21. Therefore, the applicant has proposed the following method (Japanese Patent Application No. 63-292445; also referred to as Proposed Method 1).
.

第6図は提案済みの方法1を説明するためのフローチャ
ートである。
FIG. 6 is a flowchart for explaining proposed method 1.

これは、図示されない画像処理装置の処理手順を示すも
ので、まず文書画像データを入力しく■参照)、その水
平方向の投影値をとることにより、各文字行を切り出す
(■参照)。これにより、行の幅寸法を求め、全角文字
の大きさに相当する量(文字サイズ)を得る。なお、こ
こでは横書きの場合を想定しているが、縦書きの場合も
同様である。
This shows the processing procedure of an image processing device (not shown). First, document image data is input (see (2)), and each character line is cut out by taking its horizontal projection value (see (2)). As a result, the width of the line is determined, and the amount (character size) corresponding to the size of a full-width character is obtained. Note that although horizontal writing is assumed here, the same applies to vertical writing.

次に、各行に垂直な方向の投影値を調べ、文字サイズを
考慮することにより、各文字行から文字らしきもの、す
なわち仮文字群を切り出しく■参照)、シかる後この仮
文字群の中から上記文字サイズを利用して全角文字を選
出する(■参照)。
Next, by examining the projection value in the direction perpendicular to each line and taking the character size into account, we cut out what appears to be characters, that is, a group of temporary characters, from each line of characters (see ■). Select full-width characters using the above font size (see ■).

全角文字として選出する条件は次のとおりである。The conditions for selecting characters as full-width characters are as follows.

イ)それ単独で文字サイズが全角文字サイズのもの、即
ち他の仮文字と結合する余地の全くないもの。
b) The font size is full-width font size by itself, that is, there is no room for combining with other temporary characters.

口)句読点。oral) punctuation.

ハ)それ単独では半角サイズであるが、隣り合う他の半
角サイズの仮文字と結合させてみると全角サイズとなる
もの。
c) It is a half-width size by itself, but becomes a full-width size when combined with other adjacent half-width temporary characters.

二)それ単独ではサイズが全角サイズよりも小さいが、
隣り合う他の半角サイズの仮文字との間に距離があり過
ぎ、これらを無理に結合させると全角文字サイズをこえ
るもの。
2) Although the size alone is smaller than the full-width size,
There is too much distance between adjacent half-width temporary characters, and if you forcefully combine them, the size will exceed the full-width character size.

以上の如き条件に従って全角文字を全て選出した後、あ
とに残った仮文字について、これを統合または分離して
統合文字1分離文字を作威しく0参照)、シかる後これ
らの統合文字1分離文字をOCR(文字読取装置)によ
り、辞書パターンとの類似度を利用して認識する(■参
照)。
After selecting all full-width characters according to the above conditions, merge or separate the remaining temporary characters (refer to 1 integrated character 1 separated character (0)), and then combine these integrated characters 1 separated character. Characters are recognized using an OCR (character reading device) based on their similarity to dictionary patterns (see ■).

次に、その認識結果に対して次のような矛盾処理を実行
する(■参照)。
Next, the following contradiction processing is performed on the recognition result (see ■).

a)例えば認識すべき対象が分離文字であるにも関わら
ず、OCRによる認識結果が全角サイズの漢字を示すも
のとすれば互いに矛盾するので、かかる認識結果は採用
しない。
a) For example, even though the target to be recognized is a separated character, if the recognition result by OCR indicates a full-width kanji character, this recognition result is not adopted because it contradicts each other.

b)上記とは逆に、認識すべき対象か統合文字であるに
もかかわらず、OCRによる認識結果が英字、数字等の
半角サイズ文字を示す場合。
b) Contrary to the above, when the recognition result by OCR shows half-width size characters such as alphabets and numbers, even though the target to be recognized is an integrated character.

そして、最後に残された文字につき、これを統合文字と
すべきか分離文字とすべきかを、OCRにより相対類似
度を用いて判別する(■参照)。
Then, regarding the last remaining character, whether it should be used as an integrated character or a separated character is determined by OCR using relative similarity (see ■).

なお、類似度xlは、類似度Xと類似度の平均値mとの
比に、或る定数(例えば、1024)を掛けたものとし
て定義する。すなわち、xl=x/m一定数(1024
) である。
Note that the degree of similarity xl is defined as the ratio of the degree of similarity X and the average value m of the degrees of similarity multiplied by a certain constant (for example, 1024). That is, xl=x/m constant number (1024
).

上述の方法では類似度から認識結果を決定するようにし
ているので、例えば「う」と「ろ」、「テ」と「チjの
如く非常に似ているものは、文字画像の僅かな違い、例
えば文字等の太さや字体の相違、文字等のつぶれやかす
れ等により誤認識するおそれがある。また、同様の理由
から、「は」の相対類似度よりも「(jと「よ」の部分
の相対MifJ度の方が高くなり、f4 認識してしま
うことがある。このことにつき、以下に具体的に説明す
る。
In the above method, recognition results are determined based on the degree of similarity, so for example, when characters are very similar, such as ``u'' and ``ro'', or ``te'' and ``chij'', slight differences in the character images may be detected. , for example, there is a risk of misrecognition due to differences in the thickness and font of the characters, crushed or blurred characters, etc. Also, for the same reason, the relative similarity of ``(j and yo'' is more important than the relative similarity of ``wa''). The relative MifJ degree of the part may be higher and f4 may be recognized.This will be explained in detail below.

第7図は入力文字「は」について、これを統合文字1と
して処理した場合と、分離文字2.3として処理した場
合の類似度X、相対類似度xiを示すものである。すな
わち、類似度X、類似度の平均値mとが図示の如く得ら
れたものとすると、類似度とその平均値との比に一定数
(1024)を乗じて得られる相対類似度は、統合文字
1の場合はr954.、分離文字2,3の場合はそれぞ
れ「10283.  「898.でその平均値は「96
3」となり、 954<963 であることから、入力文字「は」は「(」(前括弧)と
「はヨからなるものとして誤認識されることになる。な
お、定数はr1024」に限らないことは勿論である。
FIG. 7 shows the similarity X and relative similarity xi of the input character "ha" when it is processed as an integrated character 1 and when it is processed as a separated character 2.3. That is, assuming that the similarity X and the average value m of the similarity are obtained as shown in the figure, the relative similarity obtained by multiplying the ratio of the similarity and the average value by a constant number (1024) is the integrated For character 1, r954. , in the case of separator characters 2 and 3, respectively "10283."898.The average value is "96.
3", and since 954<963, the input character "wa" will be misrecognized as consisting of "(" (front parenthesis) and "hayo".The constant is not limited to "r1024". Of course.

このような問題を解決するため、出願人はさらに次のよ
うな方法を提案している(特願平1−39308号;提
案済み方法2ともいう)。
In order to solve these problems, the applicant has further proposed the following method (Japanese Patent Application No. 1-39308; also referred to as Proposed Method 2).

第8図は提案済み方法2を説明するためのフローチャー
トで、第6図に示すものに対し形状特徴照合ステップ[
相]〜[相]を付加した点が特徴である。
FIG. 8 is a flowchart for explaining proposed method 2, in which the shape feature matching step [
It is characterized by the addition of [phase] to [phase].

この方法の要点は、認識すべき文字の縦横比と縦方向お
よび横方向にそれぞれいくつに分離できるかを示す形状
特徴量も辞書として予め記憶しておき、入力文字の形状
特徴量をその認識結果と対応する候補文字の予め記憶さ
れている形状特徴量と比較して、類似度に基づく認識結
果を採用するか否かを決定するようにした点にある。
The key point of this method is that the aspect ratio of the character to be recognized and the shape features that indicate how many parts can be separated in the vertical and horizontal directions are also stored in advance as a dictionary, and the shape features of the input character are used as the recognition results. The present invention is characterized in that it is determined whether or not to adopt a recognition result based on the degree of similarity by comparing the shape feature amount of the corresponding candidate character with the previously stored shape feature amount.

この例につき、以下に具体的に説明する。This example will be specifically explained below.

第9図は入力画像(または入力文字)が「う」で、人力
画像から得られる形状特徴は「縦長で、縦方向に2つに
分離する」であり、これに対して認識結果の候補文字の
第1〜第10位のうちの第1位が「ろ」となった例であ
る。この場合、辞書に予め記憶されている「ろ」の形状
特徴は「縦長でも横長でもなく、縦横に分離しない」で
ある。
In Figure 9, the input image (or input character) is ``u'', and the shape feature obtained from the human image is ``vertically long and separated into two vertically'', and the recognition result candidate character This is an example in which the first place among the first to tenth places is "ro". In this case, the shape characteristics of "ro" stored in the dictionary in advance are "neither vertically nor horizontally long, nor separated vertically and horizontally."

従って、縦長・横長チエツクのステップ(第8図[相]
参照)でNo”となるため、第2位の候補文字のチエツ
クが行なわれる。第2位の候補文字は「う」であり、形
状特徴量が一致するので、これを認識結果として採用す
る。
Therefore, the step of vertical/horizontal check (Fig. 8 [phase]
), the second candidate character is checked.The second candidate character is "u", and since the shape features match, this is adopted as the recognition result.

第10図は入力文字「は」を統合文字として処理する場
合と、分離文字r(、、rよ」として処理する場合の例
である。すなわち、同図(イ)の如く統合文字として処
理する場合は、認識結果の第1候補「は」が形状特徴量
の点からも適合するので、「はjが採用される。同様に
、分離文字「(」は同図(ロ)の如く第1位候補文字「
(」と適合するが、分離文字「よ」は同図(ハ)の如く
、その形状特徴量から第1位候補文字「は」とは適合せ
ず、結局第2位の候補文字「まJと適合する。
Figure 10 shows an example where the input character "wa" is processed as an integrated character and when it is processed as a separate character r (,, ryo). In other words, it is processed as an integrated character as shown in (a) of the same figure. In this case, the first candidate "ha" in the recognition result is also suitable from the point of view of the shape feature, so "haj" is adopted.Similarly, the separated character "(" is the first candidate as shown in (b) of the same figure. Position candidate character ``
However, as shown in the same figure (c), the separated character ``yo'' does not match the first candidate character ``ha'' due to its shape features, and in the end, the second candidate character ``maJ'' Compatible with

そし=、同図(ニ)に示す如く、「は」として認識した
ときの相対類似度はr954」、「(」と「ま」に分離
されるものとしたときの相対類似度はrc+46」とな
り、 946<954 から1は」が採用されることになる。つまり、入力文字
は「は」と認識され、「(」と「ま」に誤認識されるお
それをなくすことができる。
As shown in the same figure (d), the relative similarity when recognized as "wa" is r954, and the relative similarity when it is separated into "(" and "ma" is rc+46). , 946<954, then 1 is adopted.In other words, the input character is recognized as ``ha'', and the possibility of it being mistakenly recognized as ``('' and ``ma'' can be eliminated).

〔発明が解決しようとする課題〕[Problem to be solved by the invention]

上述の形状特徴は限定された条件では有効であるが、−
船釣に扱おうとすると困難である。例えば、文字の縦横
比は字体によって異なるため縦横比が2倍以上必要とさ
れることから、「−」や「1」のように適用可能な字種
が限定されることになる。例えば、片仮名の「口」と漢
字の「口」は文字画像が相似であるが、大きさを比べる
と漢字の「日二の方が大きく、片仮名の「力」と漢字の
1カニ、片仮名の「工jと漢字のr工、も同様である。
Although the above shape features are effective under limited conditions, −
It is difficult to use it for boat fishing. For example, since the aspect ratio of a character differs depending on the font, the aspect ratio must be twice or more, which limits the types of characters that can be applied, such as "-" and "1". For example, the character images of the katakana ``口'' and the kanji ``mouth'' are similar, but when comparing the sizes, the kanji ``日二'' is larger; ``The same is true for the kanji ``ko j'' and the kanji ``r''.

また、英字の小文字と大文字ではr(、とrc4. r
K」とrk、、 r□、と70;、 rp。
Also, for lowercase and uppercase letters, r(, and rc4.r
K'' and rk,, r□, and 70;, rp.

と7PJ、 ’Sjと’SJ、 ’UJとrtB、rV
ごと’v二、rJ、とrWJ、 7XJと「X−1「Z
:とzBが同様である。さらに、英字の「I」と漢字の
「工」、漢字の「8二とr日二は縦横比を変えると相似
となる。
and 7PJ, 'Sj and 'SJ, 'UJ and rtB, rV
Goto'v2, rJ, and rWJ, 7XJ and "X-1"Z
: and zB are similar. Furthermore, the English letter ``I'', the kanji ``工'', and the kanji ``82'' and ``r日2'' become similar when the aspect ratio is changed.

一方、分離数については「二jは確実に1@1方向に2
つに分離する」が、「は」は常に「横方向に2つに分離
する」とは限らない。つまり、文字種により分離数を確
実に指定することはできない。
On the other hand, regarding the number of separations, ``2j is definitely 1@2 in one direction.
"to separate into two", but "wa" does not always mean "to separate into two laterally". In other words, it is not possible to reliably specify the number of separations depending on the character type.

このように、文字画像のノイズは勿論のこと、文字の傾
き1文字の大きさ(級数)、変形率等が分離数に影響す
るので、やはり適用可能な字種が限定されることになる
In this way, the number of separations is affected not only by noise in the character image, but also by the inclination of the characters, the size (series) of each character, the deformation rate, etc., and thus the applicable character types are limited.

したがって、この発明の目的は形状特徴を広く適用でき
るように改良して以上の如き欠点を除去し、認識率を向
上させることにある。
Therefore, an object of the present invention is to improve the shape features so that they can be widely applied, eliminate the above-mentioned drawbacks, and improve the recognition rate.

〔課題を解決するための手段〕[Means to solve the problem]

文書を画像処理して個々の文字を切り出し各文字毎に辞
書パターンとの類似度を求めて文字を認識するに当たり
、認識すべき文字の文字幅1文字高さ、外接矩形の面積
、縦横比の少なくとも1つについて、認識すべき文字を
含む文書内の標準的な文字との比率を求め、これを予め
求めておいた基準値と比較することにより、或る文字の
字形が他の文字の字形を縦方向、横方向または縦横両方
向に伸張または圧縮して得られる文字と類似するときの
判別を容易にする。このとき、前記各比率の平均値、標
準偏差またはこれらから導かれる量を求めるとともに、
その各々に対する基準値を予め求めてこれを辞書として
所定のメモリに記憶しておくことができる。
When recognizing characters by image processing a document, cutting out individual characters, and determining the similarity of each character with a dictionary pattern, we calculate the character width, character height, circumscribed rectangle area, and aspect ratio of the character to be recognized. By determining the ratio of at least one of the characters to standard characters in the document containing the character to be recognized and comparing this with a predetermined reference value, the glyph shape of a certain character can be determined from the glyph shape of another character. To facilitate discrimination when similar to characters obtained by expanding or compressing in the vertical direction, horizontal direction, or both vertical and horizontal directions. At this time, find the average value, standard deviation, or amount derived from these for each ratio, and
Reference values for each of these can be determined in advance and stored in a predetermined memory as a dictionary.

また、文書を画像処理して個々の文字を切り出し各文字
毎に辞書パターンとの類似度を求めて文字を認識するに
当たり、認識すべき文字の縦置離数、横分離数をその文
字サイズから予測し、これを基準値と比較することによ
り、或る文字が分離文字になり易い字形であるときの判
別を容易にする。このとき、前記縦分離数、横分離数の
平均値。
In addition, when recognizing characters by image processing a document, cutting out individual characters, and determining the similarity of each character with a dictionary pattern, we can calculate the number of vertical and horizontal separations of the characters to be recognized based on the character size. By predicting and comparing this with a reference value, it is easy to determine when a certain character has a shape that is likely to become a separated character. At this time, the average value of the number of vertical separations and the number of horizontal separations.

標準偏差またはこれらから導かれる量を求めるとともに
、その各々に対する基準値を予め求めてこれを辞書とし
て所定のメモリに記憶しておくことができる。
In addition to determining standard deviations or quantities derived therefrom, reference values for each of them can be determined in advance and stored in a predetermined memory as a dictionary.

〔作用〕[Effect]

一般に、例えば片仮名の「口」と漢字の1口」を判別す
る場合、明朝体を除いてはその文字単独では困難である
。しかし、片仮名の「口、と漢字の「口」の2つの文字
を比較できれば、小さい方が片仮名の「口jで、大きい
方が漢字の「口」である。一般の文書を読んでいる場合
、そのような直接比較は出来ないが、他の文字と比較し
てその文字が大きめのものに属するか、小さめのものに
属するかで判定することができる。すなわち、その文字
の標準文字サイズ(変形率も含めて)を知り、文字画像
の大きさを量的に比較することにより、類似文字の判定
を正確に行なうことができる。
In general, for example, it is difficult to distinguish between the katakana ``kuchi'' and the kanji ``ichikuchi'' based on the characters alone, except in Mincho fonts. However, if we can compare the two characters ``guchi'' in katakana and ``mouth'' in kanji, the smaller one is the katakana ``guchi j'' and the larger one is the kanji ``mouth''. When reading a general document, such direct comparison is not possible, but it is possible to determine whether a character belongs to a larger or smaller size by comparing it with other characters. That is, by knowing the standard character size (including the deformation rate) of the character and quantitatively comparing the sizes of character images, it is possible to accurately determine similar characters.

また、文字サイズが小さくなって0に近づくと、どんな
文字でも文字画像は潰れて塊になるので、分離数は1に
収束し、文字サイズが大きくなると、文字は潰れにくく
なって分離数は増加傾向となり、さらに大きくなると分
離数はその字種の分離し得る最大値に収束する。そこで
、分離文字を文字サイズの連続関数とみなし、分離数の
期待値からどれくらい外れているかを求めることにより
、統合文字2分離文字の判定を正確に行なうことができ
る。
Also, as the font size decreases and approaches 0, the character image of any character becomes crushed and lumped, so the number of separations converges to 1, and as the font size increases, the characters become less likely to be crushed and the number of separations increases. As the number of separations increases further, the number of separations converges to the maximum value that can be separated for that character type. Therefore, by regarding the separated characters as a continuous function of the character size and determining how much the number of separated characters deviates from the expected value, it is possible to accurately determine whether the integrated character is a separated character or not.

つまり、認識すべき文字の文字幅2文字高さ。In other words, the character width and height of the character to be recognized.

外接矩形の面積、縦横比の少なくとも1つについての認
識すべき文字を含む文書内の標準的な文字の各対応する
量との比率、または縦分離数、横分離数をその文字サイ
ズから予測することにより、より高精度の認識を可能に
するものである。
Predict the area of a circumscribing rectangle, the ratio of at least one of the aspect ratios to the corresponding amount of standard characters in a document containing the character to be recognized, or the number of vertical and horizontal separations from the character size. This enables more accurate recognition.

〔実施例〕〔Example〕

第1図はこの発明の実施例を示すフローチャートである
。なお、ステップ■〜■は従来と同様なので、ステップ
■〜[相]について以下に説明する。
FIG. 1 is a flowchart showing an embodiment of the invention. Incidentally, steps ① to ① are the same as those of the conventional method, so steps ② to [phase] will be explained below.

まず、次式で定義される正規化した文字高さの残差Hn
、および正規化した文字幅の残差Wnを求める(■、[
相]参照)。
First, the normalized character height residual Hn defined by the following formula
, and find the residual Wn of the normalized character width (■, [
phase]).

二二に、H,Wは文字画像から測定した文字高さ5文字
幅である。第2図(イ)に入力画像のサンプルデータを
示す。Hs、Wsは入力文書(文章2段落1行)の標準
文字高さ、標準文字幅で、第2図(ロ)にそのデータを
文書データとして示している。或る字体の標準文字高さ
、標準文字幅シよその字体の最大文字高さ、最大文字幅
であり、その字体の漢字の最大文字高さ、最大文字幅と
考えて差支えない。なお、標準文字サイズは字体や変形
率1級数等の違いによって生しる誤差を吸収するための
もので、安定な値が得られるのであれば最大値に限らず
、平均値または最頻値でも良いことは勿論である。入力
文書の標準文字高さ、標単文字幅は文書を読取る前に外
部から指定されるか、読取り中に周囲の文字から統計的
に求めることができるので、その詳細は省略する。
22. H and W are the character height and 5 character width measured from the character image. FIG. 2(a) shows sample data of the input image. Hs and Ws are the standard character height and standard character width of the input document (two paragraphs of text, one line), and the data is shown as document data in FIG. 2 (b). The standard character height and standard character width of a certain font are the maximum character height and maximum character width of other fonts, and can be thought of as the maximum character height and maximum character width of Kanji characters of that font. Note that the standard font size is used to absorb errors caused by differences in font, deformation rate, etc., and if a stable value can be obtained, it is not limited to the maximum value, but can also be the average value or mode. Of course it's a good thing. The standard character height and standard character width of the input document can be specified externally before reading the document, or can be statistically determined from surrounding characters during reading, so their details will be omitted.

また、標準文字高さHs、標準文字幅Wsは文字級数を
Q1垂直変形率をαV、水平変形率をαh、レターフェ
ースと正体の標準文字高さ、標準文字幅との比率をそれ
ぞれβV、βhとすれば、次式の関係にある。
In addition, for the standard character height Hs and standard character width Ws, the character series is Q1, the vertical deformation rate is αV, the horizontal deformation rate is αh, and the ratios between the letter face and the original standard character height and standard character width are βV and βh, respectively. Then, we have the following relationship.

Hs=β7.Q、αV       ・・・(3)W 
s =βh−にlαh        =(4jHμj
、Hσjは字種jの字体毎の標準文字高さとの比率の平
均値、標準偏差を示し、W、czj。
Hs=β7. Q, αV...(3)W
s = βh− to lαh = (4jHμj
, Hσj represent the average value and standard deviation of the ratio of character type j to the standard character height for each font, and W, czzj.

Wσjは字種jの字体毎の標準文字高さとの比率の平均
値、標準偏差を示す。これらの求め方につき、以下に説
明する。
Wσj represents the average value and standard deviation of the ratio of character type j to the standard character height for each font. How to obtain these values will be explained below.

例えば、ゴシック体と明朝体とでは、同し級数の同し字
種を比べると、一般に明朝体の方が大きい。片仮名の「
口jと漢字の「口」とを大きさで判別するため、このよ
うな字体間の文字サイズの違いを補正しておくことが望
ましい。そこで、例えばゴシック体13級の「あ」を、
ゴシック体13級の標準文字高さで割った値を求める。
For example, when comparing Gothic and Mincho fonts with the same character type in the same series, Mincho fonts are generally larger. Katakana “
In order to distinguish between 口 j and the kanji character ``口'' based on their size, it is desirable to correct this difference in character size between fonts. So, for example, "a" in Gothic grade 13,
Find the value divided by the standard character height of Gothic grade 13.

次に、明朝体16級の「あ」を、明朝体16級の標準文
字高さで割った値を求める。このようにして、次々と「
あ」について文字セント(字体)毎の標準文字高さとの
比率を求め、これらから平均値「Hμあ」と標準偏差「
Hσあ:が求められる。これらHuj、Hσj、Huj
、Wσjは、文書を読取るに先立って読取り辞書を作成
するとともに計算しておき、字種jによって簡単に引け
るように表にして第3図の如く記憶しておく。なお、第
3図は字種に1の特徴量を詳細に示しているが、他の字
種についても同様である。また、以上では文字高さ1文
字幅を用いたが、これと等価な外接矩形と縦横比を用い
ても良い。
Next, calculate the value of "A" in the 16th grade Mincho typeface divided by the standard character height of the 16th grade Mincho typeface. In this way, one after another,
Find the ratio of standard character height for each character cent (font) for ``A'', and calculate the average value ``HμA'' and the standard deviation ``A'' from these.
Hσa: is required. These Huj, Hσj, Huj
, Wσj are calculated while creating a reading dictionary prior to reading the document, and are stored in a table as shown in FIG. 3 so that they can be easily retrieved by character type j. Note that although FIG. 3 shows the feature amount for character type 1 in detail, the same applies to other character types. Furthermore, although character height and one character width are used in the above example, a circumscribing rectangle and aspect ratio equivalent to this may be used.

次に、次式の如く定義される正規化した縦分離数の残差
NVn、および正規化した横分離数の残差NHnを求め
る(■、a2)参照)。
Next, the residual difference NVn of the normalized number of vertical separations and the residual difference NHn of the normalized number of horizontal separations are determined as defined by the following equation (see ■, a2).

NV σ j 二二に、NV、NHは入力文字画像から測定した縦置離
数、横分離数で、これらは、 縦分離数の平均  二N■μj=F1 (L  H)縦
分離数の標準偏差二NVσj=F2 (j、H)横分離
数の平均  :NHμj=F3 (j、W)横分離数の
標準偏差:NHtjj=F・4 (j、W)のように字
種jと文字幅W1文字高さHとの関数である。
NV σ j Second, NV and NH are the number of vertical separations and the number of horizontal separations measured from the input character image, and these are the average of the number of vertical separations 2Nμj=F1 (L H) Standard of the number of vertical separations Deviation2 NVσj=F2 (j, H) Average number of horizontal separations: NHμj=F3 (j, W) Standard deviation of number of horizontal separations: NHtjj=F・4 (j, W) for character type j and character width W1 is a function of character height H.

例えシヨ、ゴシック体10級の「は」の横分離数は「2
」、教科書体10級の「は」の横分離数は「1」、明朝
体10級の「は」の横分離数はF2」であったとすると
、その平均値と標準偏差は、NHμは=F3(は、10
級) =、(2+1+2)/3=1.667 NHσは=F4(は、10級) = [(4+1+4)/3− (1,667)”] ”
2=0.47 となる。このようにして、10級のもっと多くの字体に
対してrNHμはJ、rNHσはコを求める。次に、1
2級についても同様にしてrNHμはj+  rNHσ
は」を求め、以後同様の操作を繰り返し、補間法により
Fl(は、  H)、F2 eまH)、F3(は、W)
、F4(は W)を求める。
For example, the horizontal separation number of "ha" in Gothic grade 10 is "2".
'', the number of horizontal separations for ``ha'' in the 10th grade textbook is ``1'', and the number of horizontal separations for ``ha'' in the 10th grade Mincho font is F2'', then the average value and standard deviation are, NHμ is =F3(is, 10
class) =, (2+1+2)/3 = 1.667 NHσ = F4 (is, 10th class) = [(4+1+4)/3- (1,667)”] ”
2=0.47. In this way, rNHμ is determined to be J, and rNHσ is determined to be C for more fonts in grade 10. Next, 1
Similarly for the secondary level, rNHμ is j + rNHσ
After that, repeat the same operation and use the interpolation method to obtain Fl (ha, H), F2 e ma H), F3 (ha, W)
, F4 (is W).

そして、全ての字種についてかかる操作を実施すれば、
関数Fl (j、H)、F2 (j、H)、F3 (j
、W) 、F4 (j、W)が求められることになる。
Then, if you perform this operation for all character types,
Function Fl (j, H), F2 (j, H), F3 (j
, W) , F4 (j, W) will be found.

第4図に縦分離数の平均関数とその標準偏差F1、F2
および横分離数の平均関数とその標準偏差F3.F4の
例を示す。
Figure 4 shows the average function of the number of vertical separations and its standard deviations F1 and F2.
and the average function of the number of lateral separations and its standard deviation F3. An example of F4 is shown.

縦分離数の平均関数Fl (j、H)は同図(イ)のよ
うに文字種j1文字高さHの関数であり、(a 3  
         (a 2≦H)(7) で表わされる。その標準偏差の関数F2 (j、H)は
同図(ロ)から、 [6 (H≦bl) NVσ=J、 (b5−b3) (H−b2)/ (b
・・・ (8) で表わされる。同様に、 ・・・ (9) ・・・ (10) で表わされる。
The average function Fl (j, H) of the number of vertical separations is a function of the character type j1 character height H as shown in the same figure (a), and (a 3
It is expressed as (a 2 ≦H) (7). The standard deviation function F2 (j, H) is obtained from the same figure (b) as [6 (H≦bl) NVσ=J, (b5-b3) (H-b2)/ (b
...(8) It is expressed as: Similarly, it is expressed as ... (9) ... (10).

次のステップ@では、次式で定義されるノルム(パター
ンの長さに相当する)を求める。
In the next step @, the norm (corresponding to the length of the pattern) defined by the following equation is determined.

D  j  =  [Hn”+Wnz+NVn”+NH
n2コ l/2・・・(11) 上記ではHn、Wn、NVn、NHnの4つの量でノル
ムを表現するようにしているか、そのうちの1つ以上を
用いて表現することができる。
D j = [Hn”+Wnz+NVn”+NH
n2 co l/2 (11) In the above, the norm is expressed using four quantities: Hn, Wn, NVn, and NHn, or it can be expressed using one or more of them.

ステップ[相]では、類似度の判定結果と統合して文字
種jの総合判定結果Tjを得る。すなわち、相関値であ
る類似度Rjを、次式の如く形状特徴の結果Djと結合
する。
In step [phase], the result of similarity determination is integrated to obtain a comprehensive determination result Tj of character type j. That is, the similarity Rj, which is a correlation value, is combined with the shape feature result Dj as shown in the following equation.

Tj=wl ・Rj+w2・ [max (A”Dj”
、O)  コ 1″           ・・・ (
12)ここに、wl、w2は定数で、類似度Rjと形状
特徴の結果Djの重みを示し、Aは形状特徴の効果に限
度を与えるための定数である。なお、上式(12)は、 Tj=W1・Rj+w2・Dj   ・・・(13)の
ように簡略化することができる。
Tj=wl ・Rj+w2・ [max (A"Dj"
, O) ko 1″ ... (
12) Here, wl and w2 are constants indicating the weight of the similarity Rj and the shape feature result Dj, and A is a constant for giving a limit to the effect of the shape feature. Note that the above equation (12) can be simplified as Tj=W1.Rj+w2.Dj (13).

形状特徴を類似度の補助として用いる場合、すなわち類
似字形がなく類似度だけで充分判定が可能で形状特徴を
必要としない字種の場合は、上記多量の標準偏差の値を
実際よりも大きめにしておけぽDjの値は非常に小さく
なり、形状特徴の効果を弱めることができる。
When using shape features as an aid to similarity, in other words, when there are no similar glyphs, and the similarity alone is enough to determine the character type, and shape features are not required, the value of the large number of standard deviations mentioned above should be set larger than the actual value. The value of Dj becomes very small, and the effect of the shape feature can be weakened.

なお、分離文字をより詳細に表現するには、文字を部分
図形と空白の組み合わせと見做して表わすこともできる
。例えば第5図(イ)に示すように漢字「洲」は6つの
部分図形の大きさと5つの空白の大きさのリストとして
同図(ロ)、(ハ)のように表現し、標準文字サイズで
正規化したときに空白の大きさが1画素以下になったら
潰れるとしてリスト間でパターンマツチングをとるよう
にしても良い。ただし、どの空白から潰れるかは確率的
な問題となるので、パターンマツチングをとる手法が複
雑となる。
Note that in order to express the separated characters in more detail, the characters can also be expressed as a combination of partial figures and blank spaces. For example, as shown in Figure 5 (a), the kanji ``zu'' is expressed as a list of six partial figure sizes and five blank sizes as shown in (b) and (c) in the same figure, and the standard character size is If the size of the blank space becomes less than 1 pixel when normalized with , it will collapse, and pattern matching may be performed between the lists. However, since it is a probabilistic problem as to which blank space is to be collapsed, the pattern matching method becomes complicated.

〔発明の効果〕〔Effect of the invention〕

この発明によれぽ、標準文字高さHs、標準文字幅Ws
は字体間の文字の大きさの違いを補正する量であり、1
つの文書では路間−の字体を用いていることから、字体
間の文字の大きさの違いを詳細かつ正確に判定すること
ができる。これにより、或る文字の字形が他の文字の字
形を縦方向。
With this invention, standard character height Hs, standard character width Ws
is the amount to correct the difference in character size between fonts, and 1
Since the two documents use the font ``Rima-'', it is possible to determine the difference in character size between the fonts in detail and accurately. This allows the glyph shape of one character to vertically change the glyph shape of another character.

横方向または縦横両方向に伸張または圧縮して得られる
類似文字であるときも、両者の判別が可能となる。さら
に、字種毎の平均値、標準偏差を大きさのバラツキを評
価するための評価基準に組み込むことにより、判定がよ
り正確となる。
Even when similar characters are obtained by expanding or compressing in the horizontal direction or both vertical and horizontal directions, it is possible to distinguish between the two. Furthermore, by incorporating the average value and standard deviation for each character type into the evaluation criteria for evaluating size variations, the determination becomes more accurate.

その結果、片仮名の「口ごと漢字の「口」は文字画像が
相似であるが、大きさを比べると漢字の「口」の方が大
きく、片仮名の「力」と漢字の「力」、片仮名の「工」
と漢字の「工」も同様である。また、英字の小文字と大
文字では「C」と「Cl、 rK」とrk、、 rO」
とro」、rp」と’PJ、 r3.と「S」、「U」
とru」、rV、と「■」、「W」と「W」、「X」と
「X」「Z」と「z」が同様である。さらに、英字の「
I」と漢字の「工」、漢字の「日」と「日」は縦横比を
変えると相似となるが、以上のような類似の文字の判別
が容易となる。
As a result, the character images of the kanji ``mouth'' in katakana are similar, but when comparing the sizes, the kanji ``mouth'' is larger; ``Technology''
The same is true for the kanji ``工''. In addition, lowercase and uppercase letters are "C", "Cl, rK", rk,, rO"
and ro'', rp'' and 'PJ, r3. and "S", "U"
The same applies to "and ru", rV, and "■", "W" and "W", "X" and "X", and "Z" and "z". In addition, the alphabet “
The kanji ``I'' and the kanji ``工'' and the kanji ``日'' and ``日'' become similar when the aspect ratio is changed, but it becomes easier to distinguish the similar characters as described above.

また、文字の分離の仕方を文字サイズによって予測し、
その予測結果を基準値と比較して判定するようにすれば
、文字の統合2分離をより正確に判定することができる
。さらに、分離の仕方の字種毎の平均値、標準偏差を大
きさのバラツキを評価するための評価基準に組み込み、
これを文字サイズの近似式または関数として利用すれば
、判定がより正確となる。
In addition, the method of separating characters can be predicted based on the font size.
If the prediction result is compared with a reference value for determination, it is possible to more accurately determine whether characters are integrated into two or separated. In addition, we incorporated the average value and standard deviation of the separation method for each type of character into the evaluation criteria for evaluating the variation in size.
If this is used as an approximation formula or function for character size, the determination will be more accurate.

その結果、「二」のように確実に「縦方向に2つに分離
するj文字は勿論のこと、rは」のように常に「横方向
に2つに分離する」とは限らない文字種についても統合
2分離判定が容易となる。
As a result, for character types that are not always separated into two horizontally such as ``r'', as well as ``j'' that is reliably separated into two vertically, such as ``ni''. Also, it becomes easy to make an integrated and two-separate judgment.

【図面の簡単な説明】[Brief explanation of drawings]

第1図はこの発明の実施例を示すフローチャート、第2
図はサンプルデータおよび文書データを説明するための
説明図、第3図は辞書データを説明するための説明図、
第4図は縦分離数、横分離数の各平均、標準偏差を説明
するための説明図、第5図は或る文字の一例とその文字
分離リストを説明するための説明図、第6図は提案済み
方法1を説明するためのフローチャート、第7図は提案
済み方法lを具体的に説明するための説明図、第8図1
ま提案済み方法2を説明するためのフローチャート、第
9図および第10図はいずれも提案済み方法2を具体的
に説明するための説明図である。 符号説明 1・−・統合文字、2.3・・・分離文字。
FIG. 1 is a flowchart showing an embodiment of the invention, and FIG.
The figure is an explanatory diagram for explaining sample data and document data, FIG. 3 is an explanatory diagram for explaining dictionary data,
Figure 4 is an explanatory diagram to explain the average and standard deviation of the number of vertical separations and the number of horizontal separations; Figure 5 is an explanatory diagram to explain an example of a certain character and its character separation list; Figure 6 is a flowchart for explaining proposed method 1, FIG. 7 is an explanatory diagram for concretely explaining proposed method 1, and FIG. 8 is a flowchart for explaining proposed method 1.
The flow chart for explaining the proposed method 2, and FIGS. 9 and 10 are explanatory diagrams for specifically explaining the proposed method 2. Code explanation 1.--Integrated character, 2.3.--Separation character.

Claims (1)

【特許請求の範囲】 1)文書を画像処理して個々の文字を切り出し各文字毎
に辞書パターンとの類似度を求めて文字を認識するに当
たり、 認識すべき文字の文字幅、文字高さ、外接矩形の面積、
縦横比の少なくとも1つについて、認識すべき文字を含
む文書内の標準的な文字の各対応量との比率を求め、こ
れを予め求めておいた基準値と比較することにより、或
る文字の字形が他の文字の字形を縦方向、横方向または
縦横両方向に伸張または圧縮して得られる文字と類似す
るときの判別を容易にしてなることを特徴とする文字認
識方法。 2)前記各比率の平均値、標準偏差またはこれらから導
かれる量を求めるとともに、その各々に対する基準値を
予め求めてこれを辞書として所定のメモリに記憶してお
くことを特徴とする請求項1)に記載の文字認識方法。 3)文書を画像処理して個々の文字を切り出し各文字毎
に辞書パターンとの類似度を求めて文字を認識するに当
たり、 認識すべき文字の縦分離数、横分離数をその文字サイズ
から予測し、これを基準値と比較することにより、或る
文字が分離文字になり易い字形であるときの判別を容易
にしてなることを特徴とする文字認識方法。 4)前記縦分離数、横分離数の平均値、標準偏差または
これらから導かれる量を求めるとともに、その各々に対
する基準値を予め求めてこれを辞書として所定のメモリ
に記憶しておくことを特徴とする請求項3)に記載の文
字認識方法。
[Claims] 1) When recognizing characters by image processing a document to cut out individual characters and determining the degree of similarity with a dictionary pattern for each character, the character width, character height of the character to be recognized, area of circumscribed rectangle,
For at least one of the aspect ratios, the ratio of each corresponding amount of standard characters in the document containing the character to be recognized is determined, and this is compared with a predetermined reference value to determine the character of a certain character. A character recognition method, characterized in that it facilitates discrimination when a character shape is similar to a character obtained by expanding or compressing the shape of another character vertically, horizontally, or both vertically and horizontally. 2) An average value, a standard deviation, or a quantity derived from these of each of the ratios is determined, and a reference value for each of the ratios is determined in advance and is stored in a predetermined memory as a dictionary. ) Character recognition method described in 3) Image processing a document to extract individual characters and calculate the similarity of each character with a dictionary pattern to predict the number of vertical and horizontal separations of characters to be recognized based on the character size. A character recognition method characterized in that by comparing this with a reference value, it is easy to determine when a certain character has a shape that is likely to become a separated character. 4) The average value and standard deviation of the number of vertical separations and the number of horizontal separations, or quantities derived from these are determined, and reference values for each of them are determined in advance and stored as a dictionary in a predetermined memory. The character recognition method according to claim 3).
JP2057995A 1990-03-12 1990-03-12 Character recognizing method Pending JPH03260886A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP2057995A JPH03260886A (en) 1990-03-12 1990-03-12 Character recognizing method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP2057995A JPH03260886A (en) 1990-03-12 1990-03-12 Character recognizing method

Publications (1)

Publication Number Publication Date
JPH03260886A true JPH03260886A (en) 1991-11-20

Family

ID=13071591

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2057995A Pending JPH03260886A (en) 1990-03-12 1990-03-12 Character recognizing method

Country Status (1)

Country Link
JP (1) JPH03260886A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH06342471A (en) * 1993-06-01 1994-12-13 Hiroshi Kuyama Method for analyzing shape characteristic of graphic or the like
US5999647A (en) * 1995-04-21 1999-12-07 Matsushita Electric Industrial Co., Ltd. Character extraction apparatus for extracting character data from a text image

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS62190575A (en) * 1986-02-18 1987-08-20 Mitsubishi Electric Corp Character pattern segmenting device
JPS62247485A (en) * 1986-04-21 1987-10-28 Ricoh Co Ltd Adding method for information of object character
JPS6316392A (en) * 1986-07-08 1988-01-23 Matsushita Electric Ind Co Ltd Character recognizing device
JPH01114991A (en) * 1987-10-29 1989-05-08 Fuji Electric Co Ltd Method for discriminating capital letter/small letter

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS62190575A (en) * 1986-02-18 1987-08-20 Mitsubishi Electric Corp Character pattern segmenting device
JPS62247485A (en) * 1986-04-21 1987-10-28 Ricoh Co Ltd Adding method for information of object character
JPS6316392A (en) * 1986-07-08 1988-01-23 Matsushita Electric Ind Co Ltd Character recognizing device
JPH01114991A (en) * 1987-10-29 1989-05-08 Fuji Electric Co Ltd Method for discriminating capital letter/small letter

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH06342471A (en) * 1993-06-01 1994-12-13 Hiroshi Kuyama Method for analyzing shape characteristic of graphic or the like
US5999647A (en) * 1995-04-21 1999-12-07 Matsushita Electric Industrial Co., Ltd. Character extraction apparatus for extracting character data from a text image
US6064769A (en) * 1995-04-21 2000-05-16 Nakao; Ichiro Character extraction apparatus, dictionary production apparatus and character recognition apparatus, using both apparatuses
US6141443A (en) * 1995-04-21 2000-10-31 Matsushita Electric Industrial Co., Ltd. Character extraction apparatus, dictionary production apparatus, and character recognition apparatus using both apparatuses

Similar Documents

Publication Publication Date Title
US5848191A (en) Automatic method of generating thematic summaries from a document image without performing character recognition
US5892842A (en) Automatic method of identifying sentence boundaries in a document image
US5390259A (en) Methods and apparatus for selecting semantically significant images in a document image without decoding image content
JPH08166970A (en) Method for highlight enphasis of document image by using coded word token
US20110299779A1 (en) Methods and Systems for Detecting Numerals in a Digital Image
EP0779592B1 (en) Automatic method of identifying drop words in a document image without performing OCR
US8229248B2 (en) Methods and systems for identifying the orientation of a digital image
JPH07200745A (en) Comparison method of at least two image sections
JP2001283152A (en) Device and method for discrimination of forms and computer readable recording medium stored with program for allowing computer to execute the same method
US7099507B2 (en) Method and system for extracting title from document image
JP2006031546A (en) Character direction identifying device, character processing device, program and storage medium
US8787702B1 (en) Methods and apparatus for determining and/or modifying image orientation
JP2000315247A (en) Character recognizing device
US8340428B2 (en) Unsupervised writer style adaptation for handwritten word spotting
JPH0440749B2 (en)
JPH03260886A (en) Character recognizing method
US11361529B2 (en) Information processing apparatus and non-transitory computer readable medium
JP3187899B2 (en) Character recognition device
JP7382544B2 (en) String recognition device and string recognition program
Bhyrapuneni et al. Word Recognition Method Using Convolution Deep Learning Approach Used in Smart Cities for Vehicle Identification.
Chen et al. Detection and location of multicharacter sequences in lines of imaged text
JP3384634B2 (en) Character type identification method
JPH0950488A (en) Method for reading different size characters coexisting character string
JPH0528310A (en) Form type document identification device
KR910007032B1 (en) A method for truncating strings of characters and each character in korean documents recognition system