JPH0420228B2

JPH0420228B2 -

Info

Publication number: JPH0420228B2
Application number: JP58164141A
Authority: JP
Inventors: Yoshuki Yamashita
Original assignee: Oki Electric Industry Co Ltd
Current assignee: Oki Electric Industry Co Ltd
Priority date: 1983-09-08
Filing date: 1983-09-08
Publication date: 1992-04-02
Also published as: JPS6057474A

Description

【発明の詳細な説明】（技術分野）本発明は高速で精度の良い文字図形パターンの
特徴抽出方法に関するものである。DETAILED DESCRIPTION OF THE INVENTION (Technical Field) The present invention relates to a method for extracting features of character/graphic patterns at high speed and with high accuracy.

（従来技術）従来の文字認識装置においては文字パターンよ
りストロークを抽出し、それら抽出されたストロ
ークの位置、長さ、ストローク間の相互関係等を
用いて認識する方式が多く採用されている。この
種の装置においては、(1)文字図形の輪郭を追跡す
ることにより検出された輪郭点系列について曲率
を計算し、その曲率の大きな値の点を分割点とし
て輪郭系列を分割し、分割された系列を組合わせ
ることによりストロークを抽出するか、又は(2)文
字図形パターンの細線化処理を行なつて骨格化
し、その骨格パターンの連結性及び骨格パターン
を追跡し急激な角度の変化点等を検出してストロ
ークを抽出し、該抽出されたストロークについて
幾何学的な特徴等を抽出し文字図形の識別を行な
つていた。しかしながら、(1)の方法は、文字図形
パターンが大きくなり又文字図形パターンが複雑
化すると、その処理量が増大しそのため処理速度
の低下を招き、(2)の方法は、文字図形パターンを
細線化する必要があり又その細線化によるパター
ンのひずみ、ヒゲ等の問題があり、その後の処理
が複雑なものとなる欠点がある。また、これらの
欠点を除去する為に文字図形パターン内の各点か
ら各方向へ走査線を出し、走査線と文字線との交
差数を当該注目点の特徴として抽出する方法が提
案されているが、抽出する前記交差数という特徴
は文字線の傾斜によつてばらつくので抽出する特
徴が不安定であり、又、単に交差数のみでは文字
の構造を反映するのに不充分である。(Prior Art) Conventional character recognition devices often employ a method of extracting strokes from a character pattern and recognizing them using the positions, lengths, mutual relationships between strokes, etc. of the extracted strokes. This type of device (1) calculates the curvature of a contour point series detected by tracing the contour of a character figure, divides the contour series using points with a large value of curvature as dividing points; (2) Extract the strokes by combining the series, or (2) perform thinning processing on the character/figure pattern to create a skeleton, and track the connectivity and skeleton pattern of the skeleton pattern to detect points of sudden angle change, etc. The strokes are extracted by detecting the strokes, and the geometric features and the like are extracted from the extracted strokes to identify the characters and figures. However, in method (1), when the character/graphic pattern becomes large or complex, the amount of processing increases, resulting in a decrease in processing speed. Furthermore, there are problems such as pattern distortion and whiskers due to thinning, and subsequent processing becomes complicated. In addition, in order to eliminate these defects, a method has been proposed in which scanning lines are drawn in each direction from each point in a character figure pattern, and the number of intersections between the scanning line and the character line is extracted as a feature of the point of interest. However, the feature of the number of intersections to be extracted varies depending on the slope of the character line, so the extracted feature is unstable, and the number of intersections alone is insufficient to reflect the structure of the character.

（発明の目的及び構成）本発明はこれらの欠点を改善するもので、文字
図形パターンの所望の方向のストローク成分をあ
らわすサブパターンを抽出し、サブパターンの各
点もしくは任意の点から所定の方向をみた文字線
の分布を表わす特徴として、前記所定の方向と直
角をなす文字線成分を含む前記サブパターンのみ
に着目して、注目点と前記所定の方向における文
字線との距離のＫ乗和を抽出し、もつて文字図形
パターンの特徴となすことにより、従来技術にお
ける前記特徴のばらつきの度合を軽減して高精度
の文字認識装置に供し得る特徴抽出方法を提供す
ることを目的とする。(Objects and Structure of the Invention) The present invention aims to improve these drawbacks by extracting a sub-pattern representing a stroke component in a desired direction of a character/figure pattern, and extracting a sub-pattern representing a stroke component in a desired direction from each point or arbitrary point of the sub-pattern. As a feature representing the distribution of character lines when looking at It is an object of the present invention to provide a feature extraction method that can reduce the degree of variation in the features in the prior art and can be used in a high-precision character recognition device by extracting these features and treating them as features of a character/figure pattern.

（発明の実施例）第１図は本発明の実施例である文字認識装置を
示したものである。第１図において、１は光電変
換部、２はパターンレジスタ、３は線幅計算部、
４は文字枠検出部、５は分割点決定部、６は垂直
サブパターン（以下VSPとも称す）抽出部、７
は垂直サブパターンメモリ、８はVSP水平特徴
抽出部、９はVSP水平特徴パターンメモリ、１
０は水平サブパターン（以下HSPとも称す）抽
出部、１１は水平サブパターンメモリ、１２は
HSP垂直特徴抽出部、１３はHSP垂直特徴パタ
ーンメモリ、１４は右斜めサブパターン（以下
RSPとも称す）抽出部、１５は右斜めサブパタ
ーンメモリ、１６はRSP左斜め特徴抽出部、１
７はRSP左斜め特徴パターンメモリ、１８は左
斜めサブパターン（以下LSPとも称す）抽出部、
１９は左斜めサブパターンメモリ、２０はLSP右
斜め特徴抽出部、２１はLSP右斜め特徴パターン
メモリ、２２はVSP水平特徴ベクトル抽出部、
２３はHSP垂直特徴ベクトル抽出部、２４は
RSP左斜め特徴ベクトル抽出部、２５はLSP右
斜め特徴ベクトル抽出部、２６は識別部、２７は
辞書、２８は文字名出力である。(Embodiment of the invention) FIG. 1 shows a character recognition device that is an embodiment of the invention. In FIG. 1, 1 is a photoelectric conversion unit, 2 is a pattern register, 3 is a line width calculation unit,
4 is a character frame detection unit, 5 is a division point determination unit, 6 is a vertical sub-pattern (hereinafter also referred to as VSP) extraction unit, 7
is a vertical sub-pattern memory, 8 is a VSP horizontal feature extraction unit, 9 is a VSP horizontal feature pattern memory, 1
0 is a horizontal sub-pattern (hereinafter also referred to as HSP) extractor, 11 is a horizontal sub-pattern memory, and 12 is a
HSP vertical feature extraction unit, 13 is HSP vertical feature pattern memory, 14 is right diagonal sub pattern (hereinafter referred to as
15 is a right diagonal sub-pattern memory; 16 is an RSP left diagonal feature extraction unit; 1
7 is an RSP left diagonal feature pattern memory; 18 is a left diagonal sub-pattern (hereinafter also referred to as LSP) extraction unit;
19 is a left diagonal sub-pattern memory, 20 is an LSP right diagonal feature extraction unit, 21 is an LSP right diagonal feature pattern memory, 22 is a VSP horizontal feature vector extraction unit,
23 is an HSP vertical feature vector extraction unit; 24 is an HSP vertical feature vector extraction unit;
25 is an LSP right diagonal feature vector extraction unit, 26 is an identification unit, 27 is a dictionary, and 28 is a character name output.

以下第１の実施例の動作につき説明する。 The operation of the first embodiment will be explained below.

まず、読取機構にセツトされた帳票上の文字・
図形パターンは光電変換部１において２値の量子
化されたデイジタル電気信号に変換され、パター
ンレジスタ２に格納される。それと同時に線幅計
算部３において入力パターンの線幅が計算され
る。垂直サブパターン抽出部６はパターンレジス
タ２について水平走査を全面行なつて、黒ビツト
（文字線部を黒ビツト・文字背景部を白ビツトと
する）の連続の長さと線幅計算部３において計算
された線幅との関係により垂直サブパターン
（VSP）を抽出し垂直サブパターンメモリ７に格
納する。同様に水平サブパターン抽出部１０、右
斜めサブパターン抽出部１４、左斜めサブパター
ン抽出部１８はそれぞれ水平走査により水平サブ
パターン（HSP）、右斜め45°走査により右斜めサ
ブパターン（RSP）、左斜め45°走査により左斜め
サブパターン（LSP）を抽出し各サブパターンメ
モリに格納する。第２図は原パターンと各サブパ
ターンの例であり、(a)は原パターン、(b)は垂直サ
ブパターン（VSP）、(c)は水平サブパターン
（HSP）、(d)は右斜めサブパターン（RSP）、(e)は
左斜めサブパターン（LSP）である。 First, the characters and characters on the form set in the reading mechanism.
The graphic pattern is converted into a binary quantized digital electrical signal in the photoelectric conversion section 1 and stored in the pattern register 2. At the same time, the line width of the input pattern is calculated in the line width calculating section 3. The vertical sub-pattern extractor 6 performs horizontal scanning over the entire pattern register 2, and calculates the continuous length and line width of black bits (character lines are black bits and character background parts are white bits). A vertical sub-pattern (VSP) is extracted based on the relationship with the determined line width and stored in the vertical sub-pattern memory 7. Similarly, the horizontal sub-pattern extraction unit 10, right diagonal sub-pattern extraction unit 14, and left diagonal sub-pattern extraction unit 18 generate a horizontal sub-pattern (HSP) by horizontal scanning, a right-diagonal sub-pattern (RSP) by scanning 45 degrees to the right, respectively. A left diagonal subpattern (LSP) is extracted by scanning diagonally left at 45° and stored in each subpattern memory. Figure 2 shows an example of the original pattern and each sub-pattern; (a) is the original pattern, (b) is the vertical sub-pattern (VSP), (c) is the horizontal sub-pattern (HSP), and (d) is diagonal to the right. subpattern (RSP), (e) is a left diagonal subpattern (LSP).

尚、本実施例ではパターンレジスタ２並びに各
サブパターンメモリ７，１１，１５，１９はそれ
ぞれ128×128ビツトのメモリ、つまりＸ方向の幅
は128ビツト、Ｙ方向の幅が128ビツトのパターン
メモリである。文字枠検出部４はパターンレジス
タ２内の文字図形パターンに外接する方形の枠
（以後文字枠と称する）を検出し、パターンレジ
スタ２で定義される２次元平面における前記文字
枠を規定する為の位置座標を分割点決定部５へ送
出する。 In this embodiment, the pattern register 2 and each of the sub-pattern memories 7, 11, 15, and 19 are 128 x 128-bit memories, that is, pattern memories with a width of 128 bits in the X direction and a width of 128 bits in the Y direction. be. The character frame detection unit 4 detects a rectangular frame (hereinafter referred to as a character frame) circumscribing the character figure pattern in the pattern register 2, and detects a rectangular frame (hereinafter referred to as a character frame) that defines the character frame in a two-dimensional plane defined by the pattern register 2. The position coordinates are sent to the division point determining section 5.

以後の説明においては、パターンレジスタ２で
定義される２次元平面における左下を原点とする
座標系を使用し、水平方向をＸ方向、垂直方向を
Ｙ方向とする。また、各サブパターンメモリにお
いて文字線部を黒点、それ以外を白点とする。
VSP水平特徴抽出部８は垂直サブパターンメモ
リ７について水平走査を全面行なうことにより垂
直サブパターンメモリ７内の各点について水平特
徴を抽出する。 In the following description, a coordinate system with the origin at the lower left of a two-dimensional plane defined by the pattern register 2 will be used, with the horizontal direction being the X direction and the vertical direction being the Y direction. Further, in each sub-pattern memory, character line portions are designated as black dots, and other portions are designated as white dots.
The VSP horizontal feature extractor 8 extracts horizontal features for each point in the vertical sub-pattern memory 7 by performing horizontal scanning over the entire vertical sub-pattern memory 7.

第３図はVSP水平特徴抽出部等の特徴抽出部
の構成、並びに特徴抽出部と、サブパターンメモ
リ、特徴パターンメモリとの接続関係を示したも
のである。第３図の点線枠内は特徴抽出部を示し
たもので、ここではVSP水平特徴抽出部８を示
しているものとする。尚、HSP垂直特徴抽出部
１２、RSP左斜め特徴抽出部、LSP右斜め特徴
抽出部２０も同様の構成となつている。第３図の
VSP水平特徴抽出部において、３１は当該特徴
抽出部の全体を制御する制御回路、３２はサブパ
ターンメモリを走査中にサブパターンメモリの内
容を参照して白色から黒色へ変化する変化点を検
出する変化点検出回路、３３は白点から黒点へ変
化した時の黒点のＸ座標を変化点として格納する
ための変化点メモリ、３４は変化点メモリの番地
指定を行う変化点カウンタ、３５はパターンメモ
リアドレスカウンタ、３６は比較器、３７は演算
回路である。また３８及び３９はそれぞれレジス
タＡ及びレジスタＢであり、各特徴抽出部におい
て抽出する２方向の特徴、例えばVSP水平特徴
抽出部においては左右両方向の特徴を抽出すると
きに使用するものであり、左方向レジスタ
（REGL）、右方向レジスタ（REGR）と呼ぶこと
にする。 FIG. 3 shows the configuration of a feature extraction unit such as the VSP horizontal feature extraction unit, and the connection relationship between the feature extraction unit, sub-pattern memory, and feature pattern memory. The area within the dotted line frame in FIG. 3 shows the feature extraction section, and here it is assumed that the VSP horizontal feature extraction section 8 is shown. Note that the HSP vertical feature extraction section 12, RSP left diagonal feature extraction section, and LSP right diagonal feature extraction section 20 have similar configurations. Figure 3
In the VSP horizontal feature extraction unit, 31 is a control circuit that controls the entire feature extraction unit, and 32 is a control circuit that refers to the contents of the sub-pattern memory while scanning the sub-pattern memory to detect a change point where the color changes from white to black. A changing point detection circuit, 33 a changing point memory for storing the X coordinate of a black point when it changes from a white point to a black point as a changing point, 34 a changing point counter for specifying the address of the changing point memory, and 35 a pattern memory. An address counter, 36 a comparator, and 37 an arithmetic circuit. Further, 38 and 39 are registers A and B, respectively, which are used when extracting features in two directions in each feature extraction section, for example, features in both left and right directions in the VSP horizontal feature extraction section. We will call them the direction register (REGL) and right direction register (REGR).

第３図及び４図のフローチヤートを用いて
VSP水平特徴抽出部における水平特徴の抽出動
作について次に説明する。第４図は１個の注目点
に関する水平特徴を計算する過程をフローチヤー
トで示したものでありCM(i)は後述する変化点メ
モリの内容を表わし（ｉ＝１、……、Ｉ，Ｉは当
該注目点を含む１回目の水平走査において検出し
て変化点メモリに登録した変化点の個数）、ｘは
当該注目点のＸ座標でありREGLは左方向レジス
タであり、REGRは右方向レジスタである。 Using the flowcharts in Figures 3 and 4
The horizontal feature extraction operation in the VSP horizontal feature extraction unit will be described next. Figure 4 is a flowchart showing the process of calculating horizontal features regarding one point of interest, and CM(i) represents the contents of the changing point memory (to be described later) (i = 1, ..., I, I is the number of change points detected and registered in the change point memory in the first horizontal scan including the relevant point), x is the X coordinate of the relevant point, REGL is the left direction register, and REGR is the right direction register. It is.

垂直サブパターンメモリ７のＹ軸上の点から水
平走査を開始し、変化点検出回路３３を使用して
白点から黒点への変化を検出すると白点から黒点
へ変化した時の黒点（白黒変化点と称する）のＸ
座標を変化点メモリ３３に登録しながら、また、
変化点カウンタ３４を歩進しながら垂直サブパタ
ーンメモリ７の右端まで走査を行う。 When horizontal scanning is started from a point on the Y axis of the vertical sub-pattern memory 7 and a change from a white point to a black point is detected using the change point detection circuit 33, the black point (black and white change) when the white point changes to a black point is detected. (referred to as a point)
While registering the coordinates in the changing point memory 33,
Scanning is performed to the right end of the vertical sub-pattern memory 7 while incrementing the change point counter 34.

次に前記水平走査を開始した点から再度水平走
査を開始し、走査線上のすべての点について第４
図のフローチヤートに示す処理を行う。すなわ
ち、先ず変化的カウンタ３４に前記水平走査で検
出された変化点の個数（）を設定し（ステツプ
401）、16ビツトの左方向レジスタ（REGL）３８
及び右方向レジスタ（REGR）３９を初期化（ス
テツプ402）した後、当該注目点のＸ座標（第４
図のフローチヤートにｘで示す）と前記変化点メ
モリに登録された内容（第４図のフローチヤート
にCM(i)で示す）を使用して前記変化点メモリの
内容である変化点のＸ座標と当該注目点のＸ座標
（垂直サブパターンメモリを走査する時に使用す
るパターンメモリアドレスカウンタ３５より出力
されるＸ座標）との大小関係を比較器３６により
判定し（ステツプ404）、当該注目点のＸ座標が変
化点メモリ３３の内容より大きい場合、すなわち
変化点メモリ３３に格納されている内容が当該注
目点より左側にある文字線の位置を示す場合は当
該注目点のＸ座標と変化点メモリ３３の内容との
差のＫ乗（Ｋは定数、本実施例ではＫ＝２）すな
わち注目点と文字線との距離のＫ乗の値を16ビツ
トの左方向レジスタ（REGL）３８に演算器３７
を使用して加算し（ステツプ405）、また当該注目
点のＸ座標が変化点メモリの内容より小さい場合
は当該注目点のＸ座標と変化点メモリの内容との
差のＫ乗すなわち距離のＫ乗の値を16ビツトの右
方向レジスタ（REGR）３９に加算するという処
理（ステツプ406）を、ステツプ403，407に示す
ように、変化点メモリ３３に格納されている変化
点の個数分、変化点メモリの番地指定を行なう変
化点カウンタ３４を歩進しながら行なうことによ
り抽出し最終的に残された左方向レジスタ３８と
右方向レジスタ３９の内容を当該注目点に関する
左方向水平特徴及び右方向水平特徴として、当該
注目点の垂直サブパターンメモリ７内の位置に対
応するVSP水平特徴パターンメモリ９内の位置
に格納する。第７図Ａ，ＢはVSP水平特徴の説
明図であり、図中黒丸は注目点、白丸は白黒変化
点を示している。第７図Ａでは着目点の左側の距
離１０離れた位置に文字線V₁が存在し、右側の
距離５離れた位置に文字線V₂が存在する場合に
ついて、前述の左方向水平特徴の値（100）及び
右方向水平特徴の値（25）を示したものであり、
第７図Ｂは、着目点の左側に文字線V₃、右側に
文字線V₄，V₅が存在する場合について前記同様
に示したものである。 Next, start the horizontal scan again from the point where the horizontal scan started, and perform the fourth scan on all the points on the scanning line.
The process shown in the flowchart in the figure is performed. That is, first, the number of change points () detected in the horizontal scanning is set in the change counter 34 (step
401), 16-bit left register (REGL) 38
After initializing the right direction register (REGR) 39 (step 402), the X coordinate (fourth
X of the changing point which is the contents of the changing point memory using the contents registered in the changing point memory (indicated by CM(i) in the flowchart of FIG. 4) The comparator 36 determines the magnitude relationship between the coordinates and the X coordinate of the point of interest (the X coordinate output from the pattern memory address counter 35 used when scanning the vertical sub-pattern memory) (step 404), and the point of interest is determined by the comparator 36 (step 404). If the X coordinate of the point of interest is larger than the contents of the change point memory 33, that is, if the contents stored in the change point memory 33 indicate the position of the character line to the left of the point of interest, the X coordinate of the point of interest and the point of change The value of the difference with the content of the memory 33 raised to the K power (K is a constant; in this embodiment, K=2), that is, the distance between the point of interest and the character line raised to the K power, is calculated in the 16-bit left direction register (REGL) 38. Vessel 37
(step 405), and if the X coordinate of the point of interest is smaller than the contents of the change point memory, the difference between the X coordinate of the point of interest and the contents of the change point memory to the K power, that is, the distance K The process of adding the multiplication value to the 16-bit right direction register (REGR) 39 (step 406) is performed as shown in steps 403 and 407. The contents of the left direction register 38 and right direction register 39 extracted by incrementing the change point counter 34 that specifies the address of the point memory are used as the left direction horizontal feature and the right direction regarding the point of interest. As a horizontal feature, it is stored at a position in the VSP horizontal feature pattern memory 9 that corresponds to the position in the vertical sub-pattern memory 7 of the point of interest. FIGS. 7A and 7B are explanatory diagrams of VSP horizontal features, in which black circles indicate points of interest and white circles indicate points of black and white change. In FIG. 7A, the value of the left horizontal feature described above is for the case where a character line V ₁ exists at a distance of 10 to the left of the point of interest, and a character line V ₂ exists at a distance of 5 to the right of the point of interest. (100) and the value of the rightward horizontal feature (25),
FIG. 7B shows the case where the character line V ₃ exists on the left side of the point of interest, and the character lines V ₄ and V ₅ exist on the right side.

ここで右方向水平特徴について着目すると、文
字線の増加に伴つて特徴値が25から125に増加し
ているのがわかる。 If we pay attention to the rightward horizontal feature, we can see that the feature value increases from 25 to 125 as the number of character lines increases.

この様にしてVSP水平特徴抽出部８は垂直サ
ブパターンメモリについてＹ軸上の各点から水平
走査を２回行なつて走査線上のすべての点につい
て水平特徴を抽出しVSP水平特徴パターンメモ
リ９に格納する。このVSP水平特徴パターンメ
モリ９及び他の各特徴パターンメモリ１３，１
７，２１の構成は第５図に示すようにＸ方向の幅
が128ビツト、Ｙ方向の幅が128ビツトのパターン
メモリであり、また各特徴パターンメモリのこの
Ｘ座標及びＹ座標で指定される各点は特徴（例え
ば左方向水平特徴＋右方向水平特徴）をあらわす
もので32ビツトの幅を持つ。第６図はこの32ビツ
トであらわされる各点の特徴を示したもので、例
えば垂直サブパターンメモリ内の各点について抽
出する32ビツトの水平特徴をあらわし、上位16ビ
ツトは左方向水平特徴、下位16ビツトは右方向水
平特徴をあらわす。またHSP垂直特徴抽出部１
２、RSP左斜め特徴抽出部１６、LSP右斜め特
徴抽出部２０もVSP水平特徴抽出部８と同様の
動作を行う。 In this way, the VSP horizontal feature extraction unit 8 performs horizontal scanning twice from each point on the Y axis on the vertical sub-pattern memory, extracts horizontal features for all points on the scanning line, and stores the horizontal features in the VSP horizontal feature pattern memory 9. Store. This VSP horizontal feature pattern memory 9 and each other feature pattern memory 13, 1
As shown in Fig. 5, the structure of 7 and 21 is a pattern memory with a width of 128 bits in the X direction and a width of 128 bits in the Y direction, and is specified by the X and Y coordinates of each feature pattern memory. Each point represents a feature (for example, left horizontal feature + right horizontal feature) and has a width of 32 bits. Figure 6 shows the characteristics of each point represented by these 32 bits. For example, it represents the 32-bit horizontal feature extracted for each point in the vertical subpattern memory, the upper 16 bits are the left horizontal feature, and the lower 16 bits represent the rightward horizontal feature. Also, HSP vertical feature extraction unit 1
2. The RSP left diagonal feature extraction section 16 and the LSP right diagonal feature extraction section 20 also perform the same operation as the VSP horizontal feature extraction section 8.

以下概略を説明する。HSP垂直特徴抽出部１
２は水平サブパターン抽出部１０において抽出さ
れた水平サブパターン（HSP）が格納されてい
る水平サブパターンメモリ１１についてＸ軸上の
各点から垂直走査を２回行なう過程において走査
線上の各点についての、当該注目点より上方に存
在する文字線と当該注目点との距離のＫ乗和を上
方向垂直特徴として、又、当該注目点より下方に
存在する文字線と当該注目点との距離のＫ乗和を
下方向垂直特徴として抽出しVSP水平特徴パタ
ーンメモリ９と同一構造のHSP垂直特徴パター
ンメモリ１３に格納する。RSP左斜め特徴抽出
部１６は右斜めサブパターン抽出部１４において
抽出された右斜めサブパターン（RSP）が格納
されている右斜めサブパターンメモリ１５につい
て、Ｙ軸上の各点及び右斜めサブパターンメモリ
１５の上辺上の各点から左斜め45°走査を２回行
なう過程において走査線上の各点についての、当
該注目点より左45°上方に存在する文字線と当該
注目点との距離のＫ乗和を上方向左斜め特徴とし
て、又当該注目点より左45°下方に存在する文字
線と当該注目点との距離のＫ乗和を下方向左斜め
特徴として抽出しVSP水平特徴パターンメモリ
９と同一構造のRSP左斜め特徴パターンメモリ
１７に格納する。LSP右斜め特徴抽出部２０は左
斜めサブパターン抽出部１８において抽出された
左斜めサブパターン（LSP）が格納されている左
斜めサブパターンメモリ１９について、Ｙ軸上の
各点及びＸ軸上の各点から右斜め45°走査を２回
行なう過程において走査線上の各点についての、
当該注目点より右45°上方に存在する文字線と当
該注目点との距離のＫ乗和を上方向右斜め特徴と
して、当該注目点より右45°下方に存在する文字
線と当該注目点との距離のＫ乗和を下方向斜め特
徴として抽出しVSP水平特徴パターンメモリ９
と同一構造のLSP右斜め特徴パターンメモリ２１
に格納する。 The outline will be explained below. HSP vertical feature extraction part 1
2 is a horizontal sub-pattern memory 11 in which the horizontal sub-pattern (HSP) extracted by the horizontal sub-pattern extraction unit 10 is stored, and in the process of performing vertical scanning twice from each point on the X axis, each point on the scanning line is The K sum of the distances between the character line that exists above the point of interest and the point of interest is the upward vertical feature, and the distance between the character line that exists below the point of interest and the point of interest. The sum of K powers is extracted as a downward vertical feature and stored in the HSP vertical feature pattern memory 13 having the same structure as the VSP horizontal feature pattern memory 9. The RSP left diagonal feature extraction unit 16 extracts each point on the Y axis and the right diagonal subpattern with respect to the right diagonal subpattern memory 15 in which the right diagonal subpattern (RSP) extracted by the right diagonal subpattern extraction unit 14 is stored. In the process of performing two 45° left diagonal scans from each point on the upper side of the memory 15, the distance K between the character line existing 45° to the left above the point of interest and the point of interest for each point on the scanning line. The sum of the powers is extracted as an upward left diagonal feature, and the K sum of the distances between the character line existing 45 degrees below the left of the point of interest and the point of interest is extracted as a downward left diagonal feature, and the VSP horizontal feature pattern memory 9 It is stored in the RSP left diagonal feature pattern memory 17 having the same structure as . The LSP right diagonal feature extraction unit 20 extracts the left diagonal subpattern (LSP) extracted by the left diagonal subpattern extraction unit 18 from each point on the Y axis and the X axis. In the process of scanning 45 degrees diagonally to the right from each point twice, each point on the scanning line is
The K sum of the distances between the character line that is 45 degrees above the right of the point of interest and the point of interest is the upward right diagonal feature, and the character line that is 45 degrees below the point of interest to the right and the point of interest are The K-th power sum of distances is extracted as a downward diagonal feature and VSP horizontal feature pattern memory 9
LSP right diagonal feature pattern memory 21 with the same structure as
Store in.

なお本実施例ではＫ＝２としたが、仮名文字等
のように文字線の数（文字線量とも称する）の比
較的少ない文字においてはＫ＝１としても良好な
結果が得られる。 Although K=2 is used in this embodiment, good results can also be obtained with K=1 for characters with a relatively small number of character lines (also referred to as character dose), such as kana characters.

分割点決定部５は文字枠検出部４より受けた文
字枠の位置に関する座標を参照し、文字枠内領域
をＭ×Ｎ個の部分領域に分割する為の分割点座標
を計算し、VSP水平特徴ベクトル抽出部２２、
HSP垂直特徴ベクトル抽出部２３、RSP左斜め
特徴ベクトル抽出部２４、LSP右斜め特徴ベクト
ル抽出部２５、へ前記計算した分割点座標を送出
する。VSP水平特徴ベクトル抽出部２２は文字
枠検出部４より受けた文字枠の位置座標及び分割
決定部５より受けた分割点座標を使用し、VSP
水平特徴パターンメモリ内の文字枠内領域をＭ×
Ｎ個の部分領域に分解し、各分割領域内の各点の
右方向水平特徴及び左方向水平特徴それぞれの相
加平均を要素とするＭ×Ｎ×２次元の水平特徴ベ
クトルを抽出する。前記Ｍ×Ｎ×２次元の水平特
徴ベクトルの要素である各部分領域の、右方向水
平特徴の相加平均及び左方向水平特徴の相加平均
をそれぞれHl_n,oHR_n,o（ｍ＝１，……，Ｍ、ｎ＝
１，……，Ｎ）と表わし、当該領域内の各点の右
方向水平特徴及び左方向水平特徴をそれぞれl_x,y，
r_x,yと表わし、当該領域内の総点数をT_n,oと表わ
すと、前記HL_n,o及びHR_n,oは(1)式及び(2)式で定
義される。 The dividing point determining unit 5 refers to the coordinates regarding the position of the character frame received from the character frame detecting unit 4, calculates the dividing point coordinates for dividing the area within the character frame into M×N partial areas, and calculates the dividing point coordinates for dividing the area within the character frame into M×N partial areas. feature vector extraction unit 22,
The calculated division point coordinates are sent to the HSP vertical feature vector extraction section 23, the RSP left diagonal feature vector extraction section 24, and the LSP right diagonal feature vector extraction section 25. The VSP horizontal feature vector extraction unit 22 uses the character frame position coordinates received from the character frame detection unit 4 and the division point coordinates received from the division determination unit 5, and uses the VSP
The area within the character frame in the horizontal feature pattern memory is M×
It is divided into N partial regions, and an M×N×2-dimensional horizontal feature vector whose elements are the arithmetic averages of the rightward horizontal features and leftward horizontal features of each point in each divided region is extracted. Hl _n,o HR _n,o (m= 1,...,M,n=
1,...,N), and the rightward horizontal feature and leftward horizontal feature of each point in the region are respectively expressed as l _x,y ,
When expressed as r _x,y and the total number of points in the area as T _n,o , the HL _n,o and HR _n,o are defined by equations (1) and (2).

HL_n,o＝ΣΣl_x,y／T_n,o (1) HR_n,o＝ΣΣr_x,y／T_n,o (2) HSP垂直特徴ベクトル抽出部２３、RSP左斜
め特徴ベクトル抽出部２４、LSP右斜め特徴ベク
トル抽出部２５はそれぞれHSP垂直特徴パター
ンメモリ１３、RSP左斜め特徴パターンメモリ
１７LSP右斜め特徴パターンメモリ２１をそれぞ
れ参照し、前記VSP水平特徴ベクトル抽出部２
２と同様に各特徴パターンメモリ内の文字枠内領
域をＭ×Ｎ個の部分領域に分割してそれぞれ各部
分領域内の各点の両方向の垂直特徴又は左斜め特
徴又は右斜め特徴のそれぞれの相加平均の算出を
VSP水平特徴ベクトル抽出部３２と同様に行な
いそれぞれの特徴ベクトル抽出部においてＭ×Ｎ
×２次元の特徴ベクトルを抽出する。 HL _n,o =ΣΣl _x,y /T _n,o (1) HR _n,o =ΣΣr _x,y /T _n,o (2) HSP vertical feature vector extraction unit 23, RSP left diagonal feature vector extraction unit 24 , the LSP right diagonal feature vector extraction unit 25 refers to the HSP vertical feature pattern memory 13, the RSP left diagonal feature pattern memory 17, and the LSP right diagonal feature pattern memory 21, respectively, and extracts the VSP horizontal feature vector extraction unit 2.
Similarly to 2, the area within the character frame in each feature pattern memory is divided into M×N partial areas, and each of the vertical features in both directions, the left diagonal feature, or the right diagonal feature of each point in each partial area is divided. Calculating the arithmetic mean
In the same manner as the VSP horizontal feature vector extraction unit 32, M×N
×Extract a two-dimensional feature vector.

識別部２６は前記４種の特徴ベクトル抽出部に
おいて抽出されたＭ×Ｎ×２×４次元の特徴ベク
トルと、辞書２７に格納されているあらかじめ用
意された前記特徴ベクトルと同一形式で記述され
た複数の標準文字パターンとの間の距離すなわち
２本のベクトル間のユークリツド距離を計算し、
その距離が最小の値を与える標準文字パターンの
文字名を文字名出力２８に出力する。 The identification unit 26 identifies the M×N×2×4-dimensional feature vectors extracted by the four types of feature vector extraction units and the feature vectors that are written in the same format as the feature vectors prepared in advance and stored in the dictionary 27. Calculate the distance between multiple standard character patterns, that is, the Euclidean distance between two vectors,
The character name of the standard character pattern whose distance gives the minimum value is output to the character name output 28.

以上詳細に説明したように、本発明の特徴抽出
方法において抽出する特徴は各点から見た各方向
における文字線の分布、つまり、各点から見て垂
直な線分が右側及び左側のどの程度の距離に存在
し、水平な線分が上側及び下側のどの程度の距離
に存在し、左斜め及び右斜めの線分が右上、左下
及び左上、右下の方向のどの程度の距離に存在す
るかという分布、を示すもので前記距離のＫ乗和
により前記各方向の文字線量をも反映させている
ので、文字の幾何学的な構造を充分に反映してい
るものである。又、各方向における文字線の検出
過程でサブパターンを使用して当該方向と直角を
なす文字線にのみ着目しているので手書文字にお
ける筆記者の違いによる文字線の傾斜の影響等を
無視して安定な特徴抽出が可能である。又各点に
ついての特徴抽出を単純な走査という処理のみに
より行なつているので装置の小型化を図ることが
できる利点がある。 As explained in detail above, the features to be extracted in the feature extraction method of the present invention are the distribution of character lines in each direction viewed from each point, that is, the extent to which the perpendicular line segment is to the right and left when viewed from each point. , and at what distance do horizontal line segments exist above and below, and at what distance do diagonal left and right line segments exist in the upper right, lower left, upper left, and lower right directions? Since the character dose in each direction is also reflected by the K-th power sum of the distances, it sufficiently reflects the geometric structure of the characters. In addition, in the process of detecting character lines in each direction, sub-patterns are used to focus only on character lines that are perpendicular to the relevant direction, so the influence of the slope of character lines due to differences in handwritten characters is ignored. stable feature extraction is possible. Furthermore, since the feature extraction for each point is performed only by a simple scanning process, there is an advantage that the apparatus can be made smaller.

（発明の効果）本発明は文字パターンの各点から見た各方向の
文字線の分布を示す特徴として、サブパターンを
使用して注目点と文字線との距離のＫ乗和を抽出
しているので文字線の距離の分布のみならず文字
線量をも反映した特徴となつており、文字全体の
幾何学的な構造を充分に反映した安定な特徴抽出
が可能であり、また前記特徴抽出を単純な走査と
いう処理により実現しているので、高速で精度の
良い文字認識装置に利用することができる。(Effects of the Invention) The present invention uses subpatterns to extract the sum of K powers of the distances between a point of interest and a character line as a feature indicating the distribution of character lines in each direction as seen from each point of a character pattern. Therefore, it is a feature that reflects not only the distance distribution of character lines but also the character dose, and it is possible to perform stable feature extraction that fully reflects the geometric structure of the entire character. Since it is realized by a simple scanning process, it can be used in a high-speed and highly accurate character recognition device.

[Brief explanation of drawings]

第１図は本発明の文字認識装置における実施例
の構成図、第２図は原パターンとサブパターンの
例を示した図、第３図は各特徴抽出部の構成図、
第４図は各点についての水平特徴計算のフローチ
ヤート、第５図は各特徴パターンメモリの構成
図、第６図は各特徴パターンメモリにおける各点
の特徴構成図、第７図はVSP水平特徴の説明図
である。１……光電変換部、２……パターンレジスタ、
３……線幅計算部、４……文字枠検出部、５……
分割点決定部、６……垂直サブパターン抽出部、
７……垂直サブパターンメモリ、８……VSP水
平特徴抽出部、９……VSP水平特徴パターンメ
モリ、１０……水平サブパターン抽出部、１１…
…水平サブパターンメモリ、１２……HSP垂直
特徴抽出部、１３……HSP垂直特徴パターンメ
モリ、１４……右斜めサブパターン抽出部、１５
……右斜めサブパターンメモリ、１６……RSP
左斜め特徴抽出部、１７……RSP左斜め特徴パ
ターンメモリ、１８……左斜めサブパターン抽出
部、１９……左斜めサブパターンメモリ、２０…
…LSP右斜め特徴抽出部、２１……LSP右斜め特
徴パターンメモリ、２２……VSP水平特徴ベク
トル抽出部、２３……HSP垂直特徴ベクトル抽
出部、２４……RSP左斜め特徴ベクトル抽出部、
２５……LSP右斜め特徴ベクトル抽出部、２６…
…識別部、２７……辞書、２８……文字名出力、
３１……制御回路、３２……変化点検出回路、３
３……変化点メモリ、３４…変化点カウンタ、３
５……パターンメモリアドレスカウンタ、３６…
…比較器、３７…演算回路、３８……レジスタ
Ａ，３９……レジスタＢ。 FIG. 1 is a block diagram of an embodiment of the character recognition device of the present invention, FIG. 2 is a diagram showing an example of an original pattern and a sub-pattern, and FIG. 3 is a block diagram of each feature extraction unit.
Figure 4 is a flowchart of horizontal feature calculation for each point, Figure 5 is a configuration diagram of each feature pattern memory, Figure 6 is a feature configuration diagram of each point in each feature pattern memory, and Figure 7 is a VSP horizontal feature. FIG. 1...Photoelectric conversion unit, 2...Pattern register,
3...Line width calculation unit, 4...Character frame detection unit, 5...
Division point determination section, 6... Vertical sub-pattern extraction section,
7...Vertical sub-pattern memory, 8...VSP horizontal feature extraction unit, 9...VSP horizontal feature pattern memory, 10...Horizontal sub-pattern extraction unit, 11...
...Horizontal sub-pattern memory, 12...HSP vertical feature extraction section, 13...HSP vertical feature pattern memory, 14...Right diagonal sub-pattern extraction section, 15
...Right diagonal sub-pattern memory, 16...RSP
Left diagonal feature extraction unit, 17...RSP left diagonal feature pattern memory, 18...Left diagonal sub pattern extraction unit, 19...Left diagonal sub pattern memory, 20...
...LSP right diagonal feature extraction unit, 21...LSP right diagonal feature pattern memory, 22...VSP horizontal feature vector extraction unit, 23...HSP vertical feature vector extraction unit, 24...RSP left diagonal feature vector extraction unit,
25... LSP right diagonal feature vector extraction unit, 26...
...Identification unit, 27...Dictionary, 28...Character name output,
31...Control circuit, 32...Change point detection circuit, 3
3... Change point memory, 34... Change point counter, 3
5...Pattern memory address counter, 36...
...Comparator, 37...Arithmetic circuit, 38...Register A, 39...Register B.

Claims

[Claims] 1. Scanning a character figure pattern in a predetermined direction, detecting a cross section of a character line in the scanning direction, and extracting a cross section whose cross section length is sufficiently longer than the character line width of the character figure pattern. Extract sub-patterns in multiple directions by scan in the direction, extract the sum of K powers (K is a constant) of the distances between all character lines on the scanning line on one side from the origin and the origin as the first feature, and The sum of K powers (K is a constant) of the distances between all the character lines on the scanning line on the side and the origin is extracted as a second feature, and the first feature obtained for all the sub-patterns and the A feature extraction method characterized in that the second feature is a character/figure pattern feature.