JPH01187684A - Character recognizing device - Google Patents

Character recognizing device

Info

Publication number
JPH01187684A
JPH01187684A JP63011424A JP1142488A JPH01187684A JP H01187684 A JPH01187684 A JP H01187684A JP 63011424 A JP63011424 A JP 63011424A JP 1142488 A JP1142488 A JP 1142488A JP H01187684 A JPH01187684 A JP H01187684A
Authority
JP
Japan
Prior art keywords
character
direction code
outermost
frequency distribution
distribution value
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP63011424A
Other languages
Japanese (ja)
Inventor
Fumio Yoda
依田 文夫
Keiji Kobayashi
啓二 小林
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Mitsubishi Electric Corp
Original Assignee
Mitsubishi Electric Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Mitsubishi Electric Corp filed Critical Mitsubishi Electric Corp
Priority to JP63011424A priority Critical patent/JPH01187684A/en
Publication of JPH01187684A publication Critical patent/JPH01187684A/en
Pending legal-status Critical Current

Links

Landscapes

  • Character Discrimination (AREA)

Abstract

PURPOSE:To correctly recognize even a character in which a crush is apt to occur by classifying a character based on the direction code frequency distribution value of the outermost point which is not influenced by the crush of the character. CONSTITUTION:In an outermost direction code detecting means 21, a black picture element to meet first by scanning from the upper, the lower, the right and the left of a character frame 8a toward a character part in directions perpendicular to the character frame 8a is detected as the outermost point strong against the crush and the direction code 18 of the outermost point is determined. After that, the outermost direction code frequency distribution value obtained by counting the appearance frequency the direction code 18 of the outermost point to meet by scanning from points on respective divided character frames into which the character frame 8a is halved lengthwise and breadthwise is matched with a standard feature value by a matching means 24. Thus, even a character pattern in which the crush occurs can be correctly recognized.

Description

【発明の詳細な説明】 [産業上の利用分野] この発明は多くの字種を読み取る文字認識装置に関し、
特に字体のっぷれが発生し易い漢字を高゛い精度で認識
する文字認識装置に関するものである。
[Detailed Description of the Invention] [Industrial Application Field] This invention relates to a character recognition device that reads many character types.
In particular, the present invention relates to a character recognition device that recognizes kanji characters, which are prone to font variations, with high accuracy.

[従来の技術] 日本語情報を計算機に入力する手段として1手書き漢字
を直接入力でき文字を認識する文字認識装置への期待が
高まっている。このような文字認識装置における文字認
識方式としては構造解析的手法で文字の特徴を求め1重
ね合せ法で文字を認識する方式などが提案されている。
[Prior Art] As a means of inputting Japanese information into a computer, there are increasing expectations for a character recognition device that can directly input a single handwritten kanji character and recognize characters. As a character recognition method for such a character recognition device, a method has been proposed that uses a structural analysis method to find character features and recognizes the characters using a one-superposition method.

文字を高い精度で読み取るには1字体の違いや文字線の
太さの違いなどの変化にも安定した特徴を用いて文字の
分類を行う必要がある。
In order to read characters with high accuracy, it is necessary to classify characters using features that are stable even when changes such as differences in font or thickness of character lines occur.

例えば、第4図は電子通信学会研究会資料PRL82−
3Or大局的特徴を併用したストロークマツチング法に
よる手書き漢字認識の検討」に示された従来の文字認識
装置の構成を示すブロック図である。大局的特徴を併用
したストロークマツチング法とは、2値パターンの輪郭
点に水平、垂直、右上がり、左上がりの4種数の方向付
けを行い、その発生数を求めた方向コード分布特徴を抽
出し、また、2値パターンを細線化し、端点・分岐点・
屈折点などの特徴点を抽出した後、その特徴点の数を求
めた特徴点分布特徴を抽出する方法をいう。第4図にお
いて、1は文字が記入された用紙、2は用紙1に記入さ
れた文字を光学的に走査して光電変換する走査手段、3
は光電変換された文字パターンの輪郭点の方向コードを
検出する輪郭方向コード検出手段、4は輪郭方向コード
検出手段3で検出した輪郭点の方向コードの数を示す輪
郭方向コード度数分布値を算出する輪郭方向コード度数
分布値算出手段、5は認識対象文字を代表する輪郭方向
コード度数分布値を標準特徴値として記憶した標準特徴
値記憶手段、6は輪郭方向コード度数分布値算出手段4
で算出した輪郭方向コード度数分布値と標準特徴値記憶
手段5に記憶された標準特徴値とを整合させ両者の整合
度合を求める整合手段、7は整合手段6で求められた整
合度合(相違度)に基づいて文字を分類しすべての認識
対象文字の相違度のうち最も小さい値をとる文字を認識
結果として選択し外部装置に出力する分類手段である。
For example, Figure 4 shows the Institute of Electronics and Communication Engineers study group material PRL82-
FIG. 2 is a block diagram showing the configuration of a conventional character recognition device shown in "Study of Handwritten Kanji Recognition Using Stroke Matching Method Using 3Or Global Features". The stroke matching method that uses global features is a method in which the contour points of a binary pattern are given four types of orientation: horizontal, vertical, upward-to-the-right, and upward-to-the-left, and the direction code distribution feature is used to calculate the number of occurrences. Extract and thin the binary pattern to find end points, branch points,
This is a method of extracting feature points such as refraction points, and then extracting the feature point distribution feature by calculating the number of feature points. In FIG. 4, 1 is a sheet of paper with characters written on it, 2 is a scanning means for optically scanning the characters written on the sheet 1 and photoelectrically converting the characters, 3
4 is a contour direction code detection means for detecting the direction code of the contour points of the photoelectrically converted character pattern, and 4 is a contour direction code frequency distribution value indicating the number of direction codes of the contour points detected by the contour direction code detection means 3. 5 is a standard feature value storage means that stores contour direction code frequency distribution values representative of characters to be recognized as standard feature values; 6 is a contour direction code frequency distribution value calculation means 4;
7 is a matching means for matching the contour direction code frequency distribution value calculated by the standard feature value stored in the standard feature value storage means 5 and determining the degree of matching between the two; ), and selects the character with the smallest difference value among all recognition target characters as a recognition result, and outputs it to an external device.

第5図(A)〜(D>は第4図に示す従来装置の動作を
説明するための動作説明図である。第5図(A)に示す
8は用紙1に記入された文字パターンの例、第5図(B
)に示す9は文字パターン8の輪郭点に方向コードを割
り当てた例、第5図(C)に示す10は文字パターン8
の領域を4分割した例である。第5図(D)に示す11
は分割した各領域内の輪郭点の方向コード度数分布値の
例であり1図中のH,R,V、Lは文字パターン8の対
応する輪郭点がそれぞれ水平、右上がり、垂直、左上が
りであることを示す。
5(A) to (D>) are operation explanatory diagrams for explaining the operation of the conventional device shown in FIG. 4. 8 shown in FIG. Example, Figure 5 (B
9 shown in ) is an example of assigning direction codes to the contour points of character pattern 8, and 10 shown in FIG. 5(C) is character pattern 8.
This is an example in which the area is divided into four. 11 shown in Figure 5(D)
is an example of the direction code frequency distribution value of contour points in each divided area. H, R, V, and L in figure 1 indicate that the corresponding contour points of character pattern 8 are horizontal, upward-rightward, vertically, and upward-leftward, respectively. .

第6図は輪郭点が水平、右上がり、垂直および左上がり
のいずれの方向であるがを決定するための3×3のマス
クパターンの例であり9図中の12は着目する黒画素で
ある。また、第7図(A)〜(C)はつぶれの発生した
文字に対する第4図に示す従来装置の処理結果の例であ
り1図中の13はつぶれの発生した文字パターンの例、
14は文字パターンの輪郭点に方向コードを割り当てた
例、15は輪郭点の方向コード度数分布値の例を示す。
Figure 6 is an example of a 3x3 mask pattern for determining whether the contour points are horizontal, upward to the right, vertical, or upward to the left, and 12 in figure 9 is the black pixel of interest. . 7(A) to (C) are examples of the processing results of the conventional apparatus shown in FIG. 4 for characters with blurring, and 13 in FIG. 1 is an example of a character pattern with blurring.
14 shows an example in which direction codes are assigned to contour points of a character pattern, and 15 shows an example of direction code frequency distribution values of contour points.

次に第5図と第6図を用いて第4図に示す従来装置の動
作について説明する。まず、用紙1上の文字を走査手段
2で光電変換して得た文字パターン8を輪郭方向コード
検出手段3へ転送する。次に、上記輪郭方向コード検出
手段3では9文字パターン8を左上から右下へマスクス
キャンしながら、第6図の例に示す3×3のマスクパタ
ーンを用いて文字パターン8のすべての輪郭点に対して
各輪郭点が4方向のうちのどの方向コードに割り当てら
れるかを検出し、第5図(B)の例に示すように文字パ
ターンのすべての輪郭点に対する方向コード9を決定す
る。上記輪郭方向コード検出手段3で求めた輪郭点の方
向コード9は、上記輪郭方向コード度数分布値算出手段
4に送られる。そして上記輪郭方向コード度数分布値算
出手段4では、第5図(C)の例に示すように文字パタ
ーンを2×2の領域に分割し、各分割領域内に存在する
水平H1右上がりR1垂直Vおよび左上がりLの方向コ
ード数を求め、第5図(D)の11で示すようなM次元
(M=16>の輪郭方向コード度数分布値F(fl、 
f2.−・−、fM)=(20,4,8,2,−−−,
14,7,3,8)を算出する。次に上記整合手段6で
は、上記輪郭方向コード度数分布値算出手段4で算出し
た輪郭方向コード度数分布値F(fl、f2.・・・、
fl4)と上記標準特徴記憶手段5に記憶した認識対象
文字kを代表する輪郭方向コード度数分布値Tk(jl
、t2.・・・、シM)とを整合させ7両者の整合の度
合(相違度Dk)を求める。ここで相違度Dkとしては
式(1)に示す絶対値距離などを用いる。
Next, the operation of the conventional device shown in FIG. 4 will be explained using FIGS. 5 and 6. First, the character pattern 8 obtained by photoelectrically converting the characters on the paper 1 by the scanning means 2 is transferred to the contour direction code detecting means 3. Next, while mask-scanning the 9-character pattern 8 from the upper left to the lower right, the contour direction code detecting means 3 detects all the contour points of the character pattern 8 using the 3×3 mask pattern shown in the example of FIG. It is detected which direction code each contour point is assigned to among the four directions, and the direction code 9 for all the contour points of the character pattern is determined as shown in the example of FIG. 5(B). The direction code 9 of the contour point obtained by the contour direction code detection means 3 is sent to the contour direction code frequency distribution value calculation means 4. Then, the contour direction code frequency distribution value calculation means 4 divides the character pattern into 2×2 areas as shown in the example of FIG. Find the number of direction codes for V and left-upward L, and calculate the contour direction code frequency distribution value F(fl,
f2. −・−, fM)=(20,4,8,2,−−,
14, 7, 3, 8). Next, in the matching means 6, the contour direction code frequency distribution value F(fl, f2...,
fl4) and the contour direction code frequency distribution value Tk (jl
, t2. . Here, as the degree of difference Dk, the absolute value distance shown in equation (1) is used.

最後に、上記分類手段7ではすべての認識対象文字の相
違度のうち最も小さい値をとる文字を認識結果として選
択し外部装置に出力する。
Finally, the classification means 7 selects the character having the smallest difference value among all the characters to be recognized as the recognition result and outputs it to an external device.

[発明が解決しようとする課題] 従来の文字認識装置は以上のように文字パターンのすべ
ての輪郭点の方向コードの数に基づいて文字を認識する
ように構成されていたので9文字線がつぶれて文字の内
部の輪郭形状が変形した文字パターンから抽出した輪郭
方向コード度数分布値とつぶれなどの発生してない文字
パターンから抽出した輪郭方向コード度数分布値とでは
その値が大きくかわるという問題点があった。例えば。
[Problems to be Solved by the Invention] Conventional character recognition devices are configured to recognize characters based on the number of direction codes of all outline points of a character pattern as described above, so nine character lines are collapsed. The problem is that the contour direction code frequency distribution value extracted from a character pattern in which the internal contour shape of the character has been deformed is significantly different from the contour direction code frequency distribution value extracted from a character pattern without collapse etc. was there. for example.

第7図(A)に示すつぶれの発生して文字パターン13
では文字内部の輪郭点が存在しなくなるため。
Character pattern 13 with collapse as shown in FIG. 7(A)
In this case, the contour points inside the character no longer exist.

従来の文字認識装置では第7図(B)に示すように方向
コード14が割り付けられる。この結果得られる第7図
(C)の15で示す輪郭方向コード度数分布値F’(r
i 、 f2′、・・・、fM)=(6,1,8,0,
・・・、8,4,3.8>は。
In a conventional character recognition device, a direction code 14 is assigned as shown in FIG. 7(B). As a result, the contour direction code frequency distribution value F'(r
i, f2', ..., fM) = (6, 1, 8, 0,
..., 8, 4, 3.8>.

第5図(D)の11で示すつぶれのない文字パターンの
輪郭方向コード度数分布値F(fl、f2.・・・、f
M) =(20,4,8,2,・・・、14,7,3.
8>と大きく異なる。
Contour direction code frequency distribution value F (fl, f2..., f
M) = (20, 4, 8, 2,..., 14, 7, 3.
8> is significantly different.

このように、従来の装置では文字のすべての輪郭点に割
り当てた方向コードの度数を認識のための特徴値として
用いたため、つぶれなどの発生した文字パターンを正し
く認識できないという問題点があった。
As described above, the conventional device uses the frequency of the direction code assigned to all outline points of a character as a feature value for recognition, which has caused the problem that character patterns with blurring or the like cannot be correctly recognized.

この発明は上記のような問題点を解消するためになされ
たもので、つぶれ等の発生した文字でも正しく認識でき
る文字認識装置を提供することを目的とする。
The present invention has been made to solve the above-mentioned problems, and it is an object of the present invention to provide a character recognition device that can correctly recognize even characters that are distorted or otherwise distorted.

[課題を解決するための手段] この発明に係る文字認識装置は1文字パターン8に外接
する文字枠8a上の点から内部の文字部に向かって文字
枠8aと直交する方向に走査して最初に出会う黒画素が
示す最外郭点の方向コード18を検出する最外郭方向コ
ード検出手段21と、この最外郭方向コード検出手段2
1で検出した最外郭点の方向コード18の情報に基づい
て最外郭方向コードの度数分布値を算出する最外郭方向
コード度数分布値算出手段22と、認識対象文字を代表
する最外郭方向コード度数分布値を標準特徴値として記
憶する標準特徴値記憶手段23と、最外郭方向コード度
数分布値算出手段22で算出した最外郭方向コード度数
分布値と標準特徴値記憶手段23に記憶された標準特徴
値とを整合させ両者の整合度合を求める整合手段24と
を有し、上記整合度合に基づいて用紙1などに記入され
た文字を認識することを特徴とするものである。
[Means for Solving the Problems] A character recognition device according to the present invention scans from a point on a character frame 8a circumscribing a single character pattern 8 toward an internal character portion in a direction perpendicular to the character frame 8a. outermost direction code detection means 21 for detecting the direction code 18 of the outermost point indicated by the black pixel that encounters the outermost direction code detection means 2;
outermost direction code frequency distribution value calculation means 22 that calculates the frequency distribution value of the outermost direction code based on the information of the direction code 18 of the outermost point detected in step 1; The standard feature value storage means 23 stores distribution values as standard feature values, and the outermost direction code frequency distribution value calculated by the outermost direction code frequency distribution value calculation means 22 and the standard feature stored in the standard feature value storage means 23 The apparatus is characterized in that it has a matching means 24 that matches the values and determines the degree of matching between the two, and recognizes characters written on paper 1 or the like based on the degree of matching.

[作用] 上記最外郭方向コード検出手段Z/は、上下左右の文字
枠8aから内部の文字部に向かって文字枠8aと直交す
る方向に走査して最初に出会う黒画素を最外郭点とし、
その最外郭点の方向コード18を検出する。最外郭方向
コード度数分布値算出手段22はその検出情報に基づい
て最外郭方向コードの度数分布値を算出する。整合手段
24は、最外郭方向コード度数分布値と標準特徴値とを
整合し2両者の整合度合を求める。したがって、用紙1
に記入された文字は整合度合(相違度)のうち最も小さ
い値をとる文字であると認識される。
[Operation] The outermost direction code detection means Z/ scans from the upper, lower, left, and right character frames 8a toward the inner character portion in a direction perpendicular to the character frame 8a, and sets the first black pixel encountered as the outermost point;
The direction code 18 at the outermost point is detected. The outermost direction code frequency distribution value calculating means 22 calculates the frequency distribution value of the outermost direction code based on the detected information. The matching means 24 matches the outermost code frequency distribution value and the standard feature value to determine the degree of matching between the two. Therefore, paper 1
The character written in is recognized as the character having the smallest value of the degree of consistency (degree of difference).

[発明の実施例コ 第1図はこの発明の一実施例に係る文字認識装置の構成
を示すブロック図である。図において。
Embodiment of the Invention FIG. 1 is a block diagram showing the configuration of a character recognition device according to an embodiment of the invention. In fig.

1は文字が記入された用紙、2は用紙1に記入された文
字を光学的に走査して光電変換する走査手段、 21は
文字パターンに外接する文字枠上の点から内部の文字部
に向かって文字枠と直交する方向に走査して最初に出会
う黒画素が示す最外郭点の方向コードを検出する最外郭
方向コード検出手段。
1 is a sheet of paper on which characters are written; 2 is a scanning means that optically scans the characters written on the sheet 1 and converts them photoelectrically; 21 is a scanning device that scans the characters written on the sheet 1 from a point on the character frame circumscribing the character pattern toward the internal character area; outermost direction code detection means for scanning in a direction orthogonal to the character frame and detecting the direction code of the outermost point indicated by the first black pixel encountered.

22は最外郭方向コード検出手段21で検出した最外郭
点の方向コードの情報に基づいて最外郭方向コードの度
数分布値を算出する最外郭方向コード度数分布値算出手
段、23は認識対象文字を代表する最外郭方向コード度
数分布値を標準特徴値として記憶する標準特徴値記憶手
段124は最外郭方向コード度数分布値算出手段22で
算出した最外郭方向コード度数分布値と標準特徴値記憶
手段23に記憶された標準特徴値とを整合させ両者の整
合度合を求める整合手段、25は整合手段24で求めら
れた整合度合(相違度)に基づいて文字を分類しすべて
の認識対象文字の相違度のうち最も小さい値をとる文字
を認識結果として選択し外部装置に出力する分類手段で
ある。
22 is outermost direction code frequency distribution value calculation means for calculating the frequency distribution value of the outermost direction code based on the information of the direction code of the outermost point detected by the outermost direction code detection means 21; 23 is a means for calculating the frequency distribution value of the outermost direction code; The standard feature value storage means 124 stores representative outermost direction code frequency distribution values as standard feature values, and the outermost direction code frequency distribution values calculated by the outermost direction code frequency distribution value calculation means 22 and the standard feature value storage means 23 A matching means 25 matches the standard feature values stored in , and calculates the degree of matching between the two, and 25 classifies the characters based on the matching degree (difference degree) obtained by the matching means 24 and calculates the degree of dissimilarity of all characters to be recognized. This classification means selects the character with the smallest value as a recognition result and outputs it to an external device.

第2図(A)〜(C)はこの実施例の動作を説明するた
めの動作説明図であり1図中の8は用紙1に記入された
文字パターン、 8aは文字パターン8に外接する文字
枠、18は文字パターン8の最外郭点に割り当てた方向
コードの例、19は手編・横の文字枠を分割して求めた
最外郭方向コード度数分布値の例を示す。なお、第2図
中のH,R,V、Lは第5図の例と同じくそれぞれ水平
、右上がり。
Figures 2 (A) to (C) are operation explanatory diagrams for explaining the operation of this embodiment. 8 in Figure 1 is a character pattern written on paper 1, and 8a is a character circumscribing character pattern 8. Frame 18 shows an example of the direction code assigned to the outermost point of character pattern 8, and 19 shows an example of the outermost direction code frequency distribution value obtained by dividing the hand-knit horizontal character frame. Note that H, R, V, and L in Figure 2 are horizontal and upward to the right, respectively, as in the example in Figure 5.

垂直、左上がりであることを示す。また、第3図(A)
、(B)はつぶれの発生した文字パターンに対するこの
実施例の処理結果の例を示した図であり。
Indicates that it is vertical, rising to the left. Also, Figure 3 (A)
, (B) are diagrams illustrating an example of the processing results of this embodiment for a character pattern in which distortion has occurred.

図中の13はつぶれの発生した文字パターンの例。13 in the figure is an example of a character pattern in which collapse has occurred.

20は最外郭方向コード度数分布値の例を示す。20 shows an example of the outermost direction code frequency distribution value.

次に第2図および第3図を用いて第1図に示す実施例の
動作を説明する。この実施例において最外郭方向コード
検出手段21は、走査手段2で光電変換された文字パタ
ーン8に外接する文字枠8a上の各点から内部の文字部
に向かって文字枠8aと直交する方向に走査して最初に
出会う黒画素を最外郭点と決定し、この最外郭点に方向
コード18を割り当て7その方向コード18を検出する
ものとする。
Next, the operation of the embodiment shown in FIG. 1 will be explained using FIGS. 2 and 3. In this embodiment, the outermost direction code detection means 21 detects a direction perpendicular to the character frame 8a from each point on the character frame 8a circumscribing the character pattern 8 photoelectrically converted by the scanning means 2 toward the inner character part. The first black pixel encountered during scanning is determined to be the outermost point, and a direction code 18 is assigned to this outermost point.7 The direction code 18 is detected.

また、最外郭方向コード度数分布値算出手段22は。Further, the outermost direction code frequency distribution value calculation means 22 is as follows.

文字枠8aを縦・横にN等分した各分割文字枠上の各点
から走査して出会う最外郭点の方向コードの度数を各分
割文字枠領域毎に求めることにより最外郭方向コード度
数分布値を算出するものとする。
The outermost direction code frequency distribution is obtained by scanning from each point on each divided character frame obtained by dividing the character frame 8a vertically and horizontally into N equal parts, and finding the frequency of the direction code of the outermost point encountered for each divided character frame area. The value shall be calculated.

まず、用紙1上の文字を走査手段2で光電変換して得た
文字パターン8を最外郭方向コード検出手段21へ転送
する。次に、上記最外郭方向コード検出手段21では1
文字パターン8に外接する文字枠8a上の上端の各点か
ら文字部に向がって垂直方向に走査して、最初に出会っ
た黒画素を最外郭点として検出し、従来装置と同様に3
×3のマスクパターンを用いてこの黒画素の方向コード
を決定する。以下同様に1文字枠8aの下端、左端およ
び右端から順次内部の文字部に向がって走査して。
First, the character pattern 8 obtained by photoelectrically converting the characters on the paper 1 by the scanning means 2 is transferred to the outermost direction code detection means 21. Next, in the outermost direction code detection means 21, 1
Scanning is performed in the vertical direction from each point on the upper end of the character frame 8a circumscribing the character pattern 8 toward the character part, and the first black pixel encountered is detected as the outermost point.
The direction code of this black pixel is determined using a ×3 mask pattern. Thereafter, in the same manner, scanning is performed sequentially from the lower end, left end, and right end of the one-character frame 8a toward the inner character portion.

第2図(B)の例に示すように文字パターン8のすべて
の最外郭点に対して水平H1右上がりR1垂直Vおよび
左上がりLの4方向の方向コード18を割り当てる。上
記最外郭方向コード検出手段21で検出した最外郭点の
方向コード18の情報は、上記最外郭方向コード度数分
布値算出手段22に送られる。そして、上記最外郭方向
コード度数分布値算出手段22では第2図(C)の例に
示すように文字パターンを縮・横に2等分し2分割した
各分割文字枠領域内の点から走査して出会う最外郭点の
方向コードの度数を求め、19で示す最外郭方向コード
度数分布値G(gl、g2.・・・、gN)・(0,1
,8,帆・・・、3,1,0゜5)を算出する。ただし
、N=32である。
As shown in the example of FIG. 2(B), direction codes 18 in four directions, ie, horizontal H1, upward-rightward R1, vertical V, and upward-leftward L, are assigned to all the outermost points of the character pattern 8. Information on the direction code 18 of the outermost point detected by the outermost direction code detection means 21 is sent to the outermost direction code frequency distribution value calculation means 22. Then, the outermost direction code frequency distribution value calculating means 22 reduces and horizontally divides the character pattern into two equal parts, and scans from a point within each divided character frame area, as shown in the example of FIG. 2(C). Find the frequency of the direction code of the outermost point encountered by
, 8, sail..., 3, 1, 0°5). However, N=32.

第3図は、つぶれの発生した文字パターン13に対して
上記最外郭方向コード度数分布値算出手段22において
20で示す最外郭方向コード度数分布値G(gl 、 
g2’、・・・、gN)・(0,1,8,0,・・・、
3.2,0.4)を算出した例である。ただし、N=3
2である。最外郭方向コード度数分布値は、つぶれの影
響を受けにくい文字パターンの最外郭点の情報を用いて
いるため、つぶれの発生した第3図(A)の例に示す文
字部してない第2図(A)の例に示す文字パターン8の
最外郭方向コード度数分布値G(gl、g2.・・・、
gN)=<0.1,8,0.・・・、3,1,0.5>
とは、はぼ同じ値をとる。
FIG. 3 shows the outermost direction code frequency distribution value G (gl,
g2',..., gN)・(0,1,8,0,...,
3.2, 0.4) is calculated. However, N=3
It is 2. Since the outermost direction code frequency distribution value uses information on the outermost point of the character pattern that is less susceptible to collapse, the second point without the character part shown in the example of Figure 3 (A) where collapse occurs Outermost direction code frequency distribution value G(gl, g2...,
gN)=<0.1, 8, 0. ..., 3, 1, 0.5>
and take almost the same value.

すなわち1両者の相違の度合を示す相違度りを式1式% となり、最外郭方向コード度数分布値を抽出することに
より1文字のつぶれに影響を受けにくい安定した特徴値
を得ることができる。
In other words, the degree of difference indicating the degree of difference between the two is expressed as Equation 1%. By extracting the outermost direction code frequency distribution value, stable feature values that are not easily affected by the collapse of one character can be obtained.

以上のようにして求めた最外郭方向コード度数分布値G
(gl、、g2.・・・、gN)は、上記整合手段24
に送られる。そして次に上記整合手段24では、上記最
外郭方向コード度数分布値算出手段22で算出した最外
郭方向コード度数分布値G(gl、g2.・・・、gN
)と上記標準特徴記憶手段23に格納した認識対象文字
kを代表する最外郭方向コード度数分布値Pk(pi。
Outermost direction code frequency distribution value G obtained as above
(gl,, g2..., gN) is the matching means 24
sent to. Then, in the matching means 24, the outermost direction code frequency distribution value G (gl, g2..., gN
) and the outermost direction code frequency distribution value Pk (pi.

p2.・・・、 pN>とを従来の装置と同様な方法で
整合させ1両者の整合の度合(相違度Dk>を求め。
p2. . . , pN> in the same manner as in the conventional apparatus, and the degree of matching (difference Dk>) between the two is determined.

最後に上記分類手段25ですべての認識対象文字の相違
度のうち最も小さい値をとる文字を認識結果として選択
し外部装置に出力する。
Finally, the classification means 25 selects the character having the smallest value among all the recognition target characters as a recognition result and outputs it to an external device.

なお、上記実施例では2等分した縦・横の文字枠領域に
対する最外郭点の方向コード度数分布を求める例につい
て説明したが、この発明はこれに限らず、N等分(N≠
2)した縦・横の文字枠領域に対する最外郭点の方向コ
ード度数分布を求めてもよい。
In addition, in the above embodiment, an example was explained in which the direction code frequency distribution of the outermost point for the vertical and horizontal character frame regions divided into two equal parts is obtained, but the present invention is not limited to this.
2) The direction code frequency distribution of the outermost point for the vertical and horizontal character frame areas may be determined.

上記実施例によれば、最外郭方向コード検出手段21に
おいて上下左右の文字枠8aから内部の文字部に向かっ
て文字枠8aと直交する方向に走査して最初に出会う黒
画素をつぶれに強い最外郭点として検出し、この最外郭
点の方向コード18を決定した後9文字枠8aを縦・横
に2等分した各分割文字枠上の点から走査して出会う最
外郭点の方向コードの出現度数を各分割文字枠領域毎に
計数して得た最外郭方向コード度数分布値と、標準特徴
値とを整合することにより、つぶれの発生した文字パタ
ーン13でも正確に認識することができる。
According to the above embodiment, the outermost direction code detecting means 21 scans from the upper, lower, left, and right character frames 8a toward the inner character part in a direction perpendicular to the character frame 8a, and selects the first black pixel encountered that is the most resistant to crushing. After detecting it as an outer point and determining the direction code 18 of this outermost point, the direction code of the outermost point that is encountered by scanning from a point on each divided character frame obtained by dividing the 9 character frame 8a vertically and horizontally into two is determined. By matching the outermost direction code frequency distribution value obtained by counting the frequency of appearance for each divided character frame region with the standard feature value, even a character pattern 13 with blurring can be accurately recognized.

[発明の効果] 以上のように本発明によれば1文字パターンに外接する
文字枠上の点から内部の文字部に向かって文字枠と直交
する方向に走査して最初に出会う黒画素が示す最外郭点
の方向コードを検出し、この方向コードの情報に基づい
て最外郭方向コード度数分布値を算出し、この最外郭方
向コード度数分布値と標準特徴値とを整合させ両者の整
合度合を求めるという構成にしたので1文字のつぶれに
影響されない最外郭点の方向コード度数分布値に基づい
て文字を分類することができ、したがってつぶれの発生
し易い文字であっても正確に認識でき1文字認識精度が
向上するという効果が得られる。
[Effects of the Invention] As described above, according to the present invention, the first black pixel encountered when scanning from a point on a character frame circumscribing a single character pattern toward an internal character part in a direction perpendicular to the character frame indicates The direction code of the outermost point is detected, the outermost direction code frequency distribution value is calculated based on the information of this direction code, and the outermost direction code frequency distribution value is matched with the standard feature value to determine the degree of consistency between the two. Since this configuration allows characters to be classified based on the direction code frequency distribution value of the outermost point, which is not affected by the distortion of a single character, even characters that are likely to be distorted can be accurately recognized. This has the effect of improving recognition accuracy.

【図面の簡単な説明】[Brief explanation of the drawing]

第1図はこの発明の一実施例に係る文字認識装置の構成
を示すブロック図、第2図(A)、(B)、(C)はこ
の実施例の動作を説明するための動作説明図。 第3図(A)、(B>はつぶれの発生した文字に対する
この実施例の処理結果の例を示す図、第4図は従来の文
字認識装置の構成を示すブロック図、第5図(A)、(
B)、(C)、(D)はこの従来例の動作を説明するた
めの動作説明図、第6図は実施例および従来例において
最外郭点の方向を決定するためのマスクパターンの例を
示す図、第7図(A) 、 (B) 、 (C)はつぶ
れの発生した文字に対する従来装置の処理結果の例を示
す図である。 1・・・・・・用紙、8・・・・・・文字パターン、 
8a・・・・・・文字枠、18・・・・・・最外郭点の
方向コード721・・・・・・最外郭方向コード検出手
段、22・・・・・・最外郭方向コード度数分布値算出
手段、23・・・・・・標準特徴値記憶手段。 24・・・・・・整合手段。
FIG. 1 is a block diagram showing the configuration of a character recognition device according to an embodiment of the present invention, and FIGS. 2 (A), (B), and (C) are operation explanatory diagrams for explaining the operation of this embodiment. . 3(A) and 3(B) are diagrams showing an example of the processing results of this embodiment for characters with blurring, FIG. 4 is a block diagram showing the configuration of a conventional character recognition device, and FIG. ), (
B), (C), and (D) are operation explanatory diagrams for explaining the operation of this conventional example, and FIG. 6 is an example of a mask pattern for determining the direction of the outermost point in the embodiment and the conventional example. The figures shown in FIGS. 7(A), 7(B), and 7(C) are diagrams showing examples of processing results of the conventional apparatus for characters with blurring. 1...Paper, 8...Character pattern,
8a...Character frame, 18...Outermost point direction code 721...Outermost direction code detection means, 22...Outermost direction code frequency distribution Value calculation means, 23...Standard feature value storage means. 24... Coordination means.

Claims (1)

【特許請求の範囲】[Claims] 文字パターンに外接する文字枠上の点から内部の文字部
に向かって文字枠と直交する方向に走査して最初に出会
う黒画素が示す最外郭点の方向コードを検出する最外郭
方向コード検出手段と、この最外郭方向コード検出手段
で検出した最外郭点の方向コードの情報に基づいて最外
郭方向コードの度数分布値を算出する最外郭方向コード
度数分布値算出手段と、認識対象文字を代表する最外郭
方向コード度数分布値を標準特徴値として記憶する標準
特徴値記憶手段と、上記最外郭方向コード度数分布値算
出手段で算出した最外郭方向コード度数分布値と上記標
準特徴値記憶手段に記憶された標準特徴値とを整合させ
両者の整合度合を求める整合手段とを有し、上記整合度
合に基づいて用紙などに記入された文字を認識すること
を特徴とする文字認識装置。
Outermost direction code detection means for detecting the direction code of the outermost point indicated by the first black pixel encountered by scanning from a point on the character frame circumscribing the character pattern toward the inner character part in a direction perpendicular to the character frame. and outermost direction code frequency distribution value calculation means for calculating the frequency distribution value of the outermost direction code based on the information of the direction code of the outermost point detected by the outermost direction code detection means, and standard feature value storage means for storing the outermost direction code frequency distribution value as a standard feature value; and the outermost direction code frequency distribution value calculated by the outermost direction code frequency distribution value calculation means and the standard feature value storage means. What is claimed is: 1. A character recognition device comprising: matching means for matching stored standard feature values to determine a degree of matching between the two, and recognizing characters written on paper or the like based on the degree of matching.
JP63011424A 1988-01-21 1988-01-21 Character recognizing device Pending JPH01187684A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP63011424A JPH01187684A (en) 1988-01-21 1988-01-21 Character recognizing device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP63011424A JPH01187684A (en) 1988-01-21 1988-01-21 Character recognizing device

Publications (1)

Publication Number Publication Date
JPH01187684A true JPH01187684A (en) 1989-07-27

Family

ID=11777681

Family Applications (1)

Application Number Title Priority Date Filing Date
JP63011424A Pending JPH01187684A (en) 1988-01-21 1988-01-21 Character recognizing device

Country Status (1)

Country Link
JP (1) JPH01187684A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH03240885A (en) * 1990-02-19 1991-10-28 Fuji Electric Co Ltd Setting pattern detecting method

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS5369542A (en) * 1976-12-03 1978-06-21 Fujitsu Ltd Graphic recognizing equipment

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS5369542A (en) * 1976-12-03 1978-06-21 Fujitsu Ltd Graphic recognizing equipment

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH03240885A (en) * 1990-02-19 1991-10-28 Fuji Electric Co Ltd Setting pattern detecting method

Similar Documents

Publication Publication Date Title
USRE47889E1 (en) System and method for segmenting text lines in documents
US4998285A (en) Character recognition apparatus
US4556985A (en) Pattern recognition apparatus
JPH01187684A (en) Character recognizing device
JP3095470B2 (en) Character recognition device
JP2576491B2 (en) Feature extraction method
JPS58165178A (en) Character reader
JP2715930B2 (en) Line detection method
JPH0877293A (en) Character recognition device and generating method for dictionary for character recognition
JP2576494B2 (en) Feature extraction method
JPS63238686A (en) Feature extracting system
JPS634231B2 (en)
Zhang et al. Using Orientation Voting to Extract Text Lines with Various Mixed Directions from a Document Image
JPH07109612B2 (en) Image processing method
JP2979089B2 (en) Character recognition method for scene images
JPH03175591A (en) Character recognizing device
JPH05120483A (en) Character recognizing device
JP2000207490A (en) Character segmenting device and character segmenting method
CN115965572A (en) Three-stage tube appearance defect detection method based on template comparison
JPH05174179A (en) Document image processor
JPH01201789A (en) Character reader
JPH0799536B2 (en) Character figure recognition method
JPS58182791A (en) Feature extracting and sorting method of character pattern
JPS6019287A (en) Character recognizing method
JP2002251591A (en) Character recognition method and character recognition program