JPS5852783A

JPS5852783A - Feature extraction system

Info

Publication number: JPS5852783A
Application number: JP56151051A
Authority: JP
Inventors: Michiaki Nakanishi; 道明中西
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 1981-09-24
Filing date: 1981-09-24
Publication date: 1983-03-29
Also published as: JPH0139156B2

Abstract

PURPOSE:To perform character recognition accurately and speedily by extracting segments forming the contour of a character pattern part hided behind a character pattern part, and using them as character features. CONSTITUTION:A character pattern is scanned horizontally or vertically and segments forming the contour of a character pattern are extracted in the form of a set of black-white change points Q1-Q4 between the background and character and used as character features for character recognition. Further, a segment l6 forming the contour of a character pattern part hided behind a character pattern part appearing firstly in the horizontal or vertical scanning direction is extracted in the form of a set of odd-numbered black-white change points Q3... except the 1st change point, and this is used as character features.

Description

【発明の詳細な説明】本分明線文字認識に用いる文字特徴の抽出方式に関し、
従来法によりては隠れてしまう部分の特徴抽出を可能に
しようとするものである。[Detailed Description of the Invention] The present invention relates to a character feature extraction method used for bright line character recognition.
This method attempts to enable feature extraction of parts that are hidden by conventional methods.

文字−繊においては、文字を左方から見え特徴、上方か
ら見た特徴を求め、文字認識の資料に供するということ
が行なわれる。例えば第１図に示す数字３を例にとると
、手書きした咳数字３をラインスキャナなどで走査して
２値ビンオ信号を得、一旦メモリに格納し、蚊メモリか
ら数字３のビデオ信号を切出しく一般には他の手書き文
字等と共に走査されて骸メモリに格納されているから）
、左から右へ走査する各水平を走査上で最初に０（背景
）から１（文字）に変換する点の集まシを求め、それを
左方から見た蚊数字３の輪郭特徴とする。In character-texture, the characteristics of characters viewed from the left and the characteristics viewed from above are determined and used as data for character recognition. For example, taking the number 3 shown in Figure 1, the handwritten cough number 3 is scanned with a line scanner to obtain a binary signal, which is temporarily stored in memory, and the video signal of number 3 is cut out from the mosquito memory. Generally, it is scanned together with other handwritten characters and stored in the memory)
, first find a collection of points that convert from 0 (background) to 1 (character) on each horizontal scan from left to right, and use this as the contour feature of the mosquito number 3 seen from the left. .

この論理では線分’！１　ｒ　１４ｅ　ＡＢが左方から
見た文字の特徴（ζ＼では、左方特徴という）となる。In this logic, line segment'! 1 r 14e AB is the feature of the character seen from the left (referred to as the left feature in ζ\).

同様に上から下へ走査する各垂直走査線上で最初に０か
ら１に変る点の集りを求めると第２図の―分１４　＊　
ｚ＝が得られ、これが上方から見た文字の慢徴（上方特
徴）となる。これらの左方特徴、上方特徴および同様に
して求めた右方％徴、下方特徴は抽出が容易で、しかも
数字Ω、１，２・・・・・・・などの単純なパター／の
文字に対してはよい特徴データとなり、文字ｇａに効果
的に利用できる。例えば上記の線分ｔ１とｔ３の間に人
や込んだ線分４があるというものは０〜９の数字では６
以外にはなく、従りて対象は０〜９のいずれかというこ
とで、Ｔｏればこの特徴１つだけでも数字３の認識が可
能である。しかし、陰に隠れて抽出されない特徴も多々
ある。例えば左方特徴では、大きく凹んでいる部分Ｐ１
＊Ｐ！などは抽出されておらず、上方＊微では抽出され
るのはいわば屋根又は庇になっている部分のみでその下
の全部が隠れてしまって抽出されない。従って漢字はお
ろかカナなどの少し複雑なパターンの文字では特徴抽出
不充分となって、他のものと分離、識別できない文字が
発生する。Similarly, if we find the group of points that first change from 0 to 1 on each vertical scanning line scanning from top to bottom, we get -14* in Figure 2.
z= is obtained, which becomes the characteristic feature (upper feature) of the character when viewed from above. These left features, upper features, right % features, and lower features obtained in the same way are easy to extract, and can be easily extracted from simple patterns such as the numbers Ω, 1, 2, etc. It becomes good feature data for the character ga, and can be effectively used for the character ga. For example, there is a line segment 4 with a person between the line segments t1 and t3 above, which is 6 in the numbers 0 to 9.
Therefore, since the target is any number from 0 to 9, it is possible to recognize the number 3 with just this feature alone. However, there are many features that are hidden in the background and are not extracted. For example, in the left feature, the large concave part P1
*P! etc. are not extracted, and in the upper part, only the part that is a roof or eaves is extracted, and everything below it is hidden and not extracted. Therefore, the feature extraction is insufficient for characters with slightly complex patterns such as kanji and kana, and some characters cannot be separated or identified from other characters.

そこで本発明は陰に隠れる部分の特徴抽出も行なえるよ
うにして正確、迅速な文字＆Ｉ識を９臓にしようとする
ものである。即ち本分明社文字パターンを水平又は垂直
方向に走査し、背景と文字との白黒変化点の果りとして
、該文字パターンの輪郭を構成する線分を取出し、それ
を文字−繊に用いる文字特徴とする％徴抽出方式におい
て、谷水乎、垂直走査線方向における、１番目を除く奇
数ｑＩｔ目の白黒変化点の集りとして、走査方向で最初
に現われる文字パターン部分の陰に隠れた文字ノくター
／Ｓ分の輪郭をなす線分を抽出し、それを前記文字特徴
とすることを特徴とするが、次に図面を参照しながらこ
れを詳細に説明する。Therefore, the present invention aims to make accurate and quick character & I recognition possible by making it possible to extract features of hidden parts. That is, a Honbunmeisha character pattern is scanned horizontally or vertically, line segments forming the outline of the character pattern are extracted as a result of black-and-white transition points between the background and the characters, and these are used as character features for the characters. In the percentage feature extraction method, Tani Mizuyuki, the character mark hidden behind the character pattern part that first appears in the scanning direction is defined as a collection of odd-numbered qIt-th black-and-white transition points in the vertical scanning direction, excluding the first one. The present invention is characterized in that a line segment forming an outline of /S is extracted and used as the character feature, which will be described in detail below with reference to the drawings.

前記の線分ＬｌｐＬ＊・−ｍ−は次のようにして認識さ
れる。即ち走査線Ｚｌ）　ｅ　ｚｌｙ　４はＹ座標が異
なるからそのＹ座標別に、０→１反転を生じる最初の点
の座１１１（Ｘ座４１１）を求めると（Ｙ６　＊Ｏ）＋
（Ｙｉ　＋０）ｔ（Ｙ鵞ｐＯ）　ｖ　（Ｙｓ　ｅｘｓ　
）　ｅ　（Ｙ４１Ｘ４’）　””・の如きデータ群が得
られる。こ＼で０は′″０→１０１１反転点は無い″を
示すが、このようなものも単純にＸｌ（ｉ＝ｏ　、　１
　、２・・・・・）で示すと、上記データは（Ｘｉ　、
　Ｙｌ　）で表わされる。これらのデータはメモリのＹ
ｉアドレスにＸげ一層を記録するという方法をとると、
処理が容易である。即ちアドレスカウンタを逐次＋１し
てｙ、　ｌ　ｙ、　ｌ　ｙｊ・・・・・・アドレスのデ
ータ為、ｘｌ　、ｘ、・・・・・・を続出し、ΔＸ＝Ｘ
ｌ＋１−Ｘｉ　　を求めてみると、線分ｔ１ｅ　ｚ、　
１４においては差分ΔＸは小さいからこの事を以りて連
続した線分であると判定できる。またｊＸ〈０なら左肩
下りの線分、ｊＸ＞０なら右肩下りの線分と言える。The line segment LlpL*·-m- is recognized as follows. That is, scanning line Zl) e zly 4 has different Y coordinates, so if we find the first point 111 (X 411) that causes 0→1 inversion for each Y coordinate, we get (Y6 *O)+
(Yi +0)t(Y鵞pO) v (Ys exs
) e (Y41X4') A data group such as "" is obtained. Here, 0 indicates ``There is no 0→1011 reversal point'', but such a thing can also be simply expressed as Xl(i=o, 1
, 2...), the above data is (Xi,
Yl). These data are stored in memory
If you take the method of recording the X number in the i-address,
Easy to process. That is, the address counter is sequentially +1 and y, l y, l yj...... Because of the address data, xl, x, ...... are successively generated, and ΔX=X
When calculating l+1-Xi, the line segment t1e z,
14, the difference ΔX is small, so it can be determined from this that it is a continuous line segment. Also, if jX<0, it is a line segment going down the left shoulder, and if jX>0, it is a line segment going down the right shoulder.

線分ｔ１とＺＩ　Ｆ　Ｌ鵞と１．０境ではｊｘは突然大
になる。このような場合線分性途切れている、少なくと
も水平走査線と平行な線分で接続されているに過ぎない
と言える。線分の端を決定するのはこの不連続点と、Ｘ
ｌ−０からある値を持つようになった点である。線分ｔ
１の始端は後者、終端は前者であり、線分ｔ１は両端が
前者、線分ｔ３は始端が前者、終端が後者である。そし
て線分り冨のように両端が不連続点ということは、数字
０〜９のように１つにつながったものにおいては、両端
に文字パターン部分がある、両端が閉じていると言える
。仁れに対して線分ｔ１．ｔｓは両燗が開放していると
言える。両端クローズの線分を持つということは、−述
のように数字３の大きな特徴点である。At the 1.0 boundary between line segment t1 and ZI F L, jx suddenly becomes large. In such a case, it can be said that the linearity is interrupted, or at least connected by a line segment parallel to the horizontal scanning line. This discontinuity point and X
This is the point where it has a certain value from l-0. line segment t
The starting end of 1 is the latter, and the ending is the former, the line segment t1 has the former at both ends, and the starting end of line segment t3 is the former, and the ending is the latter. The fact that both ends are discontinuous points, such as the line segment depth, means that for numbers 0 to 9, which are connected into one, there are character pattern portions at both ends, and both ends are closed. Line segment t1. It can be said that both sides of ts are open. Having a line segment with both ends closed is a major feature of the number 3, as mentioned above.

ところで数字Ｓには凹みＰｌｐＰ雪があるから、これを
も検出すると、数字６の特徴を一層よりよく抽出したこ
とになる。この凹み部分は線分Ｚｌ　ｅ　ＬＭの陰にな
っているので抽出できなかったものであるが、抽出論理
を「最初の０→１変化点」ではなく［３番目の（一般化
すれば奇数番目の）０→１変化点」とすると、隘になり
ｆｒ：、部分を抽出できる。By the way, the number S has a concave PlpP snow, so if this is also detected, the feature of the number 6 will be extracted even better. This concave part could not be extracted because it is in the shadow of the line segment Zl e LM, but the extraction logic is not the "first 0 → 1 change point" but the third (or odd numbered point if generalized). ) 0 → 1 change point", then fr:, the part can be extracted.

即ち第６図に示すように凹んだ部分Ｐ１を通る走査線ｔ
ａＫついて０−１反転をみるとそれはＱｔ　ｔ　Ｑｌ　
＃Ｑｌ、Ｑ４の４点であり、陰れ先部分ｔ６の輪郭を定
める反転点Ｑ３は６番目である。この「３番目の反転点
」の論理で線分ｔ＠ｎ１４を抽出でき、これと第１図の
方式つまり「１番目の反転点」の論理で求めた線分Ｌ−
を合せると凹部の最深部まで入シ込んだ一分４が得られ
る０か＼る一分ｔ８を用いると数字「６」のｇ繊は一層
確実、容易になる。即ちこの一分のｊＸを求めるとそれ
は正、負、正、負と変り、数字６の特徴をよく表わして
いる。か＼る線分ｔ−と前記線分ｚｔ　ｔ　Ｚｌを組み
合せる、即ち垂直方向では線分Ｌ＊ｅＬｓｅＬｓの順で
存在し、そして水平方向では線分４　ｔ　ｚ、の右方に
あ）両端が該線分ｚｔ　ｅ　ｚ、と重なる一分４がある
という論理では、相当乱暴に手書きしたものでも数字６
を他のものと分離、識別できる。That is, as shown in FIG. 6, the scanning line t passing through the concave portion P1
Looking at the 0-1 reversal for aK, it is Qt t Ql
There are four points, #Ql and Q4, and the reversal point Q3, which defines the outline of the shaded end portion t6, is the sixth. The line segment t@n14 can be extracted using the logic of this "third reversal point," and the line segment L- obtained using the method shown in Figure 1, that is, the logic of the "first reversal point"
If you use 1 minute t8, which is 0 or \, you will get 1 minute 4 that has entered the deepest part of the recess. In other words, when we find this fraction of jX, it changes from positive to negative to positive to negative, which clearly represents the characteristics of the number 6. Combining the line segment t- and the line segment zt t Zl, that is, the line segments exist in the order of L*eLseLs in the vertical direction, and on the right of the line segment 4 t z in the horizontal direction) both ends According to the logic that there is a 1/4 that overlaps with the line segment zt e z, even if it is handwritten very roughly, the number 6
can be separated and identified from others.

第４図は手書きの「チ」、第５図は手書きの「テ」の例
を示す。これらの相違点は突出部Ｒがあるか否かが唯一
の識別ポイントというケースも珍らしくない。しかしこ
れは上部の文字パターン部分Ｂに隠れているので、第１
図の最初の変化点という論理では抽出できない。これに
対して３番目の変化点、特に垂直走査線における３番目
の変化点という論理を用いると線分１ｍ　＊　ｚｌ・が
抽出できる（両端部は、「最初の変化点」で抽出したも
の）。線分２．　＋　２１・が抽出できれば、「Ｙ座標
変化が一様か」で（垂直方向をＸとする）、突出部Ｒの
有無をチェックでき、ひいてはテとチの識別が可能にな
る。まぎられしい字は多々あり、例えば片仮名のつと力
、りとつ、ミとシ、りとン、二とンなども乱暴に薔かれ
九場合に単純な左方特徴、上方特徴などでは識別しにく
いものである０これらも「奇数番目」の論理で胸になる
部分を抽出すると、又はそれと単純な左方特徴等と組合
せると識別可能となることを期待できる。FIG. 4 shows an example of a handwritten "chi", and FIG. 5 shows an example of a handwritten "te". It is not uncommon for these differences to be determined by the presence or absence of the protrusion R. However, this is hidden in the upper character pattern part B, so
It cannot be extracted using the logic of the first change point in the diagram. On the other hand, if we use the logic of the third change point, especially the third change point in the vertical scanning line, we can extract the line segment 1m * zl (both ends are extracted at the "first change point"). . Line segment 2. If +21· can be extracted, the presence or absence of the protrusion R can be checked by checking whether the change in the Y coordinate is uniform (the vertical direction is X), and it becomes possible to identify the tip and the tip. There are many characters that are confusing, for example, the katakana tsuto chikara, ritotsu, mi and shi, riton, and niton, etc., are randomly written and cannot be distinguished by simple left-hand or upper-hand features. It is expected that these difficult-to-identify breasts will be able to be identified by extracting the chest area using the "odd number" logic, or by combining it with a simple left-hand feature or the like.

手書き数字および仮名の認識においては特徴として例え
ば１０００種など多数を用い、これらで５０段程度のト
リー回路を構成し、それを逐っていくことによシ文字認
識を行なう。本発明によシ抽出する特徴もその１つに加
えて使用されるものである。なお特徴が適切であれば比
較的少数の段を逃るだけで結果を得ることができ、文字
認識速度を上げることができる。この点、本発明方式は
甚だ有効である。In the recognition of handwritten numbers and kana, a large number of features, such as 1000 types, are used, and these constitute a tree circuit of about 50 stages, and character recognition is performed by running through the tree circuits. The features extracted according to the present invention are also used in addition to one of them. Note that if the features are appropriate, results can be obtained by missing a relatively small number of rows, and character recognition speed can be increased. In this respect, the method of the present invention is extremely effective.

[Brief explanation of the drawing]

第１図および第２図は従来法の説明図、第６図〜第５図
は本発明法の説明図である。　　　　′図面でｔ１〜ｔ
１・は線分、ｚｓ　ｔ　ｔ７　ｅ　ｚｓ　ｅ　Ｚｌ・は
陰に隠れた文字パターン部分の線分である。出　願　人　富士通株式会社代理人弁理士　青　柳　　　稔第１図　　　　第２図第３図第４図　　　　第す図1 and 2 are explanatory diagrams of the conventional method, and FIGS. 6 to 5 are explanatory diagrams of the method of the present invention. 't1-t in the drawing
1. is a line segment, and zs t t7 e zs e Zl. is a line segment of a hidden character pattern part. Applicant Fujitsu Ltd. Representative Patent Attorney Minoru Aoyagi Figure 1 Figure 2 Figure 3 Figure 4 Figure S

Claims

[Claims] A character feature in which a character pattern is scanned horizontally or vertically, line segments forming the outline of the character pattern are extracted as a collection of black and white transition points between the background and characters, and the line segments are used for character recognition. In the feature extraction method, the outline of the character pattern part hidden behind the character pattern part that appears first in the scanning direction is defined as a collection of odd-numbered black-and-white transition points other than the first in each horizontal and vertical scanning line direction. A character feature extraction method used for character recognition characterized by extracting a line segment and using it as the character feature.