JPS60146380A

JPS60146380A - Character recognition device

Info

Publication number: JPS60146380A
Application number: JP59002801A
Authority: JP
Inventors: Masahiro Shimizu; 正博清水
Original assignee: Matsushita Electric Industrial Co Ltd
Current assignee: Panasonic Holdings Corp
Priority date: 1984-01-11
Filing date: 1984-01-11
Publication date: 1985-08-02

Abstract

PURPOSE:To execute character recognition efficiently by characteristic quantity of a stroke at every code in the every direction by comparing a directional code of a point concerned and its adjoining point and giving the stroke number of the adjoining point having a coincide directional code for a point concerned. CONSTITUTION:A directional code giving part 13 reads out sequentially character patterns in a memory 12, and decides whether or not a point concerned of a character pattern belongs to a continuous black point string having more than reference length, and gives a directional code for the point concerned when it belongs to the black string having more than N number of picture elements. A stroke extracting part 15 extracts a stroke in accordance with the directional code of the point concerned by the directional code giving part 13. A characteristic quantity calculating part 16 obtains characteristic quantity of the character pattern obtained in the stroke extracting part 15. The characteristic quantity at every stroke obtained in the characteristic quantity calculating part 16 is compared with a recognition dictionary at every character pattern and a recognition part 17 recognizes a character pattern.

Description

【発明の詳細な説明】産業上の利用分野本発明は２値文字パターンの方向毎のストロークを抽出
し、各ストローク毎の壽微抽出を行なう文字認識装置に
関するものである。DETAILED DESCRIPTION OF THE INVENTION FIELD OF INDUSTRIAL APPLICATION The present invention relates to a character recognition device that extracts strokes in each direction of a binary character pattern, and performs a characteristic extraction for each stroke.

従来例の構成とその問題点従来は、認識対象文字を水平走査して得られた「０」と
「１」の極性をもつ２値ディジタルデータ列を入力とし
、認識対象文字の方向毎のストロークを抽出する場合、
細線化の手法が使われており非常に複雑で多大な処理時
間を要し、ハード化も困難であった。Configuration of conventional example and its problems Conventionally, the input is a binary digital data string with polarities of "0" and "1" obtained by horizontally scanning the character to be recognized, and the strokes in each direction of the character to be recognized are If you want to extract
It uses a thinning method, which is extremely complex and requires a lot of processing time, and is difficult to implement in hardware.

発明の目的本発明は前記問題点を解消し、各方向毎のストロークを
細線化する事なく簡単に抽出し、各ストロークの特徴量
を計算し、その特徴量を用い対象−文字を認識する事が
できる文字認識装置を提供する事を目的とする。Purpose of the Invention The present invention solves the above problems, easily extracts strokes in each direction without thinning them, calculates the feature amount of each stroke, and recognizes the target character using the feature amount. The purpose is to provide a character recognition device that can.

発明の構成本発明は、認識対象文字パターンの着目点の方向コード
を決定する方向コード付与部と、着目点の方向コードと
、この着目点が隣接する点との方向コードを比較して、
方向コードの一致する隣接点のストローク番号を着目点
に付与することによりストロークを抽出し、各方向コー
ド毎のストロークの特徴量を用いて文字認識を効率的に
行なうものである。Structure of the Invention The present invention includes a direction code assigning unit that determines the direction code of a point of interest in a character pattern to be recognized, and a direction code of the point of interest and a direction code of a point adjacent to the point of interest, which is compared with the direction code of the point of interest.
Strokes are extracted by assigning stroke numbers of adjacent points with matching direction codes to points of interest, and character recognition is efficiently performed using stroke features for each direction code.

実施例の説明第１図は本発明における一実施例の文字認識装置の構成
を示すブロック図である。DESCRIPTION OF THE EMBODIMENTS FIG. 1 is a block diagram showing the configuration of a character recognition device according to an embodiment of the present invention.

第１図において、１０は文字パターン入力信号で、１１
は文字パターン入力信号１０を２値化する２値化部、１
２は文字パターンを格納するメモリ部、１３は文字パタ
ーンの各点の方向を決定する方向コード付与部、１４は
得られた方向コードを１水平走査線分蓄える方向コード
バッファ、１６は文字パターンにおいて連結した同一方
向コードを持つ各点より構成されるストロークを抽出す
るストローク抽出部、１６は各ストロークの数、長さ、
面積等全計算する特微量計算部、１７はめた特徴量を用
いて文字認識を行なう認識部でろ４以上のように構成さ
れた文字認識装置について以下その動作について説明す
る。In FIG. 1, 10 is a character pattern input signal, and 11 is a character pattern input signal.
1 is a binarization unit that binarizes the character pattern input signal 10;
2 is a memory unit for storing character patterns; 13 is a direction code assigning unit for determining the direction of each point of the character pattern; 14 is a direction code buffer for storing the obtained direction code for one horizontal scanning line; 16 is a part for storing the character pattern; A stroke extraction unit extracts strokes made up of points having connected codes in the same direction; 16 is the number and length of each stroke;
The operation of a character recognition device having four or more components, including a feature amount calculation section for calculating all areas, etc., and a recognition section for performing character recognition using 17 feature amounts will be described below.

第１図において、認識すべき文字パターン信号１０は２
値化部１１で白黒に２値化され文字パターンとしてメモ
リ部１２に格納される。In FIG. 1, the character pattern signal 10 to be recognized is 2
It is binarized into black and white in the digitization section 11 and stored in the memory section 12 as a character pattern.

方向コード付与部１３では、メモリ部１２０文字パター
ンを水平方向に順次読出し、文字パターンの着目点が第
２図に示す各方向コードの方向に、方向を決定するだめ
の基準長Ｎ以上の連続する黒点列の一部かどうかを判断
し、Ｎ画素以上の黒点列の一部の場合は該当する方向コ
ード全着目点に付与する。例えば第３図（ａ）の文字パ
ターンは、基準長Ｎ　＝　ｓの場合、各点の方向コード
は第３図（ｂ）のようになる。The direction code assigning unit 13 sequentially reads out the character patterns from the memory unit 120 in the horizontal direction, and selects consecutive character patterns whose points of interest are in the direction of each direction code shown in FIG. It is determined whether the point is part of a black dot string, and if it is a part of a black dot string of N or more pixels, a corresponding direction code is given to all points of interest. For example, in the character pattern shown in FIG. 3(a), when the reference length N=s, the direction code of each point becomes as shown in FIG. 3(b).

方向コード付与部１３でめられた各点の方向コードは方
向コードバッファ１４に１水平走査分順次蓄えられる。The direction code of each point determined by the direction code adding section 13 is sequentially stored in the direction code buffer 14 for one horizontal scan.

ストローク抽出部１６では方向コード付与部１３による
着目点の方向コードと、第４図に示すような着目点の１
画素前の点の方向コードおよび方向コードバッフ７部よ
り得られる１水平走査線前の各点の方向コードとを比較
し、少くとも１点の方向コードと一致すれば着目点に上
記該痛点の属するストローク番号を付与し、全く一致し
ない場合は着目点に新しいストローク番号全付与する。The stroke extraction unit 16 uses the direction code of the point of interest provided by the direction code adding unit 13 and the direction code of the point of interest as shown in FIG.
The direction code of the point in front of the pixel and the direction code of each point one horizontal scanning line before obtained from the direction code buffer 7 are compared, and if the direction code of the point in front of the pixel matches the direction code of at least one point, the point of interest is assigned to the point of interest. The stroke number to which it belongs is assigned, and if they do not match at all, all new stroke numbers are assigned to the point of interest.

例えば第６図（ａ）の文字パターンの各方向毎のストロ
ークは第６図Φ）　、（ｃ）　、　（ｄ）　、　（ｅ）
の通りであることがわかる。For example, the strokes in each direction of the character pattern in Figure 6(a) are Φ), (c), (d), (e) in Figure 6.
It turns out that this is true.

また特微量計算部１６ではストローク抽出部１６でめた
文字パターンの各ストローク毎の長さ、面積、数、位置
情報等の特徴量をめる。特微量計算部１６で得られた各
ストローク毎の特徴量を各文字パターン毎の認識用辞書
と比較し、認識部、１７により認識を行なう。In addition, the feature amount calculation section 16 calculates feature amounts such as length, area, number, position information, etc. for each stroke of the character pattern drawn by the stroke extraction section 16. The feature amount for each stroke obtained by the feature amount calculation section 16 is compared with a recognition dictionary for each character pattern, and recognition is performed by the recognition section 17.

例えば第６図において、文字パターン（ａ）の各方向コ
ード毎ストロークは躯６図（ｂ）　、　（Ｃ）　ｌ　（
ｄ）　、　（ｅ）の通りであシ、第５図（ａ）と第６図
（−）の各文字パターンの方向毎のストロークの特徴量
は第７図の通りであり、第６図（ａ）と第６図（ａ）の
各方向毎のストロークの数、長さは明らかに異なり、各
文字に対応して各方向コード毎のストローク数、長さ等
の特徴量を辞書として持てば文字の認識を行なえる事が
容易にわかる。For example, in Fig. 6, the strokes for each direction code of the character pattern (a) are shown in Fig. 6 (b), (C) l (
As shown in d) and (e), the stroke feature amount for each direction of each character pattern in Figures 5 (a) and 6 (-) is as shown in Figure 7, and as shown in Figure 6 ( The number and length of strokes in each direction in a) and FIG. It is easy to see that character recognition is possible.

また本実施例では第２図に示す方向コードは４方向とし
たが任意の数、方向でも艮い。Further, in this embodiment, the direction codes shown in FIG. 2 are set to four directions, but any number or directions may be used.

発明の効果本発明の文字認識装置は、文字パターンの各点の方向コ
ード？付与し、各点の方向コードをもとに各方向コード
毎のストロークを抽出し、抽出した各方向毎のストロー
クの特徴量より文字の認識を行なうように構成↓たもの
で、従来の方式のように細線化の必要はなく、ハード構
成も簡単であり、その実用的価値は高い。Effects of the Invention The character recognition device of the present invention recognizes the direction code of each point of a character pattern. It is configured to extract strokes for each direction code based on the direction code of each point, and recognize characters based on the feature amount of the extracted strokes for each direction, which is different from the conventional method. There is no need for line thinning, the hardware configuration is simple, and its practical value is high.

[Brief explanation of the drawing]

第１図は本発明における一実施例の文字認識装置の構成
を示すブロック図、第２図は方向コードの例を示す図、
第３図は方向コード付与の例を示す図、第４図はストロ
ーク抽出における連結性をチェックする点の説明図、第
６図、第６図はストローフ抽出例を示す図、第７図は２
種類の文字パターンの各ストロークの特徴量の比較図で
ある。１０・・・・・・文字パターン入力信号、１１・・・・
・・２値化部、１２・・・・・・メセリ部、１３・・・
・・・方向コード付与部、１４・・・・・・方向コード
バッフ１部、１６・・・・・・ストローク抽出部、１６
・・・・・・特徴量計算部、１７・・・・・・認識部。代理人の氏名　弁理士　中　尾　敏　男　ほか１名ＷＡ
　Ｉ　図第　３５１’　（（Ｌ） −第（Ｉｏ−り走査線ｅ　（Ｌ　。＠１走査限ａ− 第　５　図（′ｂン第６図（ｂ）FIG. 1 is a block diagram showing the configuration of a character recognition device according to an embodiment of the present invention, FIG. 2 is a diagram showing an example of a direction code,
Fig. 3 is a diagram showing an example of adding direction codes, Fig. 4 is an explanatory diagram of checking connectivity in stroke extraction, Figs.
It is a comparison diagram of the feature amount of each stroke of different types of character patterns. 10...Character pattern input signal, 11...
...Binarization section, 12...Meseri section, 13...
... Direction code adding section, 14 ... Direction code buff 1 part, 16 ... Stroke extraction section, 16
. . . Feature value calculation section, 17 . . . Recognition section. Name of agent: Patent attorney Toshio Nakao and one other WA
I Figure 351' ((L) -th (Io-ri scanning line e (L. @1 scan limit a- Figure 5 ('b) Figure 6 (b)

Claims

[Claims]

The binary character pattern data string obtained by scanning the character pattern area to be recognized is input, and the j-th point of interest P0 on the i-th horizontal scanning line is the reference length N for determining the direction code.
A direction code assigning unit that determines whether the point of interest PiJ is included in a part of the black dot array in any continuous direction and assigns the direction code to the point of interest PiJ; If the direction code of at least one point matches the direction code of the point one horizontal scanning line before and the point one pixel before the point of interest, the stroke number to which the matching point belongs is assigned to the point of interest, and if there is no match at all. In this case, a stroke extraction section that assigns a new stroke number to the point of interest, and a stroke extraction section that calculates the number of strokes in each direction, length, area, etc., are used to calculate the number of strokes, length, area, etc. of the target character. A character recognition device that performs recognition.