JPS59231682A

JPS59231682A - Character discriminating device

Info

Publication number: JPS59231682A
Application number: JP58105490A
Authority: JP
Inventors: Jiichirou Takahashi; 高橋時市郎; Seiichiro Naito; 増田功; Isao Masuda; 内藤誠一郎
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 1983-06-13
Filing date: 1983-06-13
Publication date: 1984-12-26

Abstract

PURPOSE:To discriminate a candidate character category with high precision by extracting a difference between categories on a standard pattern as to the category of a candidate for an input character pattern, and detecting whether the difference is in a stroke extracted from the input character pattern. CONSTITUTION:The character pattern from an input part 1 is passed through a preprocessing circuit 2, and a rough classifying device 3 narrows down a candidate character category and passes to a structure difference extracting circuit 5, which extracts the differences from the standard patterns of two optional categories read out of a standard pattern storage circuit 4. A stroke extracting circuit 6 receives the character pattern from the circuit 2 and the structure difference extracted by the circuit 5 to calculate similarity on the basis of feature point information, and sends the value, etc., to a structure deciding circuit 7 when the similarity is larger than a threshold value. The circuit 7 decides whether the structure difference pattern received from the circuit 5 is present in a character line extracted by the circuit 6 or not. Thus, the structural difference between the candidate character categories is detected to improve character discriminating performance greatly.

Description

【発明の詳細な説明】（１）　　発明の属する分野の説明文字読取装置、特に手書き漢字を読取り対象とする文字
読取装置で杜、処理を「大分類→識別」。[Detailed Description of the Invention] (1) Description of the field to which the invention pertains A character reading device, particularly a character reading device that reads handwritten kanji, processes “major classification → identification”.

らるーＩｉ［大分類→細分類→識別」というように多段
階に分けることが常套手段である。大分類あるいは細分
類の段階で祉容易に抽出で鳶る特徴を抽出し、簡便なバ
タン整合によって候補文字カテゴリを絞る。その後、限
定された候補文字カテゴリの中から構造解析により入力
文字）（タンの属するカテゴリを決定し、識別するとい
う構成をとる仁とが多い。本発明状類似した候補文字カ
テゴリを精度良く識別できる文字識別装置に関するもの
である。It is a common practice to divide into multiple stages such as Raru-Ii [Major classification → Subclassification → Identification]. At the stage of major classification or detailed classification, features that can be extracted easily are extracted, and candidate character categories are narrowed down by simple click matching. After that, the category to which the input character (Tan) belongs is determined and identified by structural analysis from among the limited candidate character categories.This invention is capable of identifying similar candidate character categories with high accuracy. The present invention relates to a character recognition device.

（２）　　従来の技術の説明従来、類似候補文字カテゴリを判別するのに。(2) Description of conventional technology Conventionally, to determine similar candidate character categories.

以下の如き方法によって−た。類似した候補文字カテゴ
リの組を予めシミュレーション実験等により求める。こ
の場合、該類似した候補文字カテゴリ同士を判別するた
めの特徴が存在するかどうかを検出するための処理を２
個々の場合に応じて予め定めておく必要がある。仁のよ
うに該方法を実施するためには、多大な労力を要すると
いう問題点がある。一方９手書き漢字を読取対象とした
場合２手書き変形が大きいため、入力文字バーンの候補
文字カテゴリが前記予め用意された類似候補カテゴリの
組の中に存在しないような場合が生じる。かかる場合、
予め用意されていない候補文字カテゴリについては識別
処理が実行されず、誤認識の原因となることが多く、該
方法の弱点となっていた。It was done by the following method. A set of similar candidate character categories is determined in advance through a simulation experiment or the like. In this case, 2 processes are performed to detect whether or not there are features for distinguishing between the similar candidate character categories.
It is necessary to determine it in advance depending on each individual case. There is a problem in that a great deal of effort is required to implement this method as described above. On the other hand, when 9 handwritten kanji characters are to be read, the 2 handwriting deformation is large, so there may be a case where the candidate character category of the input character burn does not exist in the set of similar candidate categories prepared in advance. In such case,
Identification processing is not performed for candidate character categories that are not prepared in advance, which often causes misrecognition, which has been a weakness of this method.

（３）　　発明の目的本発明は上述した従前方法の問題点に鑑みてなされたも
のであり、その目的とするところは、簡便にしてかつい
かなる候補文字カテゴリの組合せにも対処できる文字識
別装置を提供することにある。以下、実施例について詳
細に説明する。(3) Purpose of the Invention The present invention has been made in view of the problems of the conventional methods described above, and its purpose is to provide a character identification device that is simple and can deal with any combination of candidate character categories. It is about providing. Examples will be described in detail below.

（４）　　発明の構成および作用の説明第１図は本発明
の文字識別装置の一実施例の構成ブロック図である。図
中１は入力文字バタンを受ける入力部である。２は公知
の前処理回路であって、入力部１から入力された入力文
字パタ／に。(4) Description of structure and operation of the invention FIG. 1 is a block diagram of the structure of an embodiment of the character identification device of the invention. In the figure, numeral 1 is an input section that receives an input character button. Reference numeral 2 denotes a known preprocessing circuit, which processes input character patterns inputted from the input section 1.

２値化９位置及び大きさの正規化、平滑化等の前処理を
施す。該前処理を施した入力文字パタンを次段装置３，
６へ渡す。３は大分類装置であって。Binarization 9 Perform preprocessing such as position and size normalization and smoothing. The preprocessed input character pattern is sent to the next device 3,
Pass it to 6. 3 is a major classification device.

例えば線密度特徴等、公知の大分類手法を用いて入力文
字パタンの候補文字カテゴリ数を数１０個程度にまで絞
る。該候補文字カテゴリを次段回路５へ渡す。For example, the number of candidate character categories of the input character pattern is narrowed down to about several dozen using a known major classification method such as line density feature. The candidate character category is passed to the next stage circuit 5.

４は標準バタン記憶回路である。標準バタンは文字バタ
ンを構成するストロークを適尚に分割して得られるスト
ロークの開始点、屈折点、交点。4 is a standard button memory circuit. Standard strokes are the starting points, bending points, and intersection points of the strokes obtained by appropriately dividing the strokes that make up the character strokes.

及び終点の位置座標の系列で与える。上記開始点。and a series of position coordinates of the end point. Starting point above.

屈折点、交点、及び終点を以下暗点と呼ぶ。第２図はカ
テゴリωの標準パタンの記４＆形式の説明図である。カ
テゴリωＦｉ、Ｍ木のストロークから成り。The refraction point, intersection point, and end point are hereinafter referred to as scotoma. FIG. 2 is an explanatory diagram of the notation 4 & format of the standard pattern of category ω. Category ωFi, consisting of M-tree strokes.

各ス）０−りはそれぞれＮ１　ｅ・・・、Ｎ、、・・・
、Ｎｏ　個の筆点から成ることを示している。第ｍ番目
のスト胃−りの第ｎ番目の絡点ｙ□紘、その位置座標（
Ｘ□、ｙ□）と、該筆点り、が開始点、Ｍ折点、交点、
終点のうちのいずれであるかを表わす幾何学　□的属性
ｇ□、及び前記特徴点の場合と同様に該筆点Ｙｍｍと該
筆点１１イ÷１とを結ぶ直線（以下標準セグメン）？＋
７□と呼ぶ）の上下左右にある文字線数それぞれφ□、
φ□、φ□、φ−１組で表わす◇第１図の５は構造差抽
出回路でろって、前記大分類により得られた候補文字カ
テゴリのうち、任意の２つのカテゴリの標準パタンを記
憶回路４から読出し、該読出した二つの標準パタンの間
の差異を抽出して９次段６，７へ渡すのが主要な機能で
あって９次の手段により実現する。Each step) 0-ri is N1 e..., N,,...
, No number of writing points. The n-th node y□Hiro of the m-th striker, its position coordinates (
X□, y□) and the writing point are the starting point, M-fold point, intersection point,
A geometrical attribute g□ indicating which of the end points it is, and a straight line connecting the writing point Ymm and the writing point 11i÷1 (hereinafter referred to as standard segment), as in the case of the feature point? +
The number of character lines on the top, bottom, left and right of 7□) is φ□,
Represented by φ□, φ□, φ-1 set ◇ 5 in Figure 1 is a structural difference extraction circuit, which stores standard patterns of any two categories among the candidate character categories obtained by the above-mentioned major classification. The main function is to read out the standard patterns from the circuit 4, extract the difference between the two read standard patterns, and pass it to the ninth stage 6 and 7, and this is realized by the ninth stage means.

前記記憶回路４から読出した２つの標準パタンのうち、
標準パタンを構成する標準セグメントの本数の多い方を
旧、他方をＲ′とする。この時、ｒＲの標準セグメント
の１木９例えばｌｌ７１，１ｗＩと、　ＩＲ’のすべて
の標準セグメン）　（ｌ１７ｍイ）とについて９次式に
より標準セグメント間距離ｄを求める。ここでＪｍｒ＋
＝Ｙｍｎ＋ｓ　Ｖｍｎ　＊　ｔｑｒｒ：ｎ’ツＹｍ’ｎ
’＋　ｓ　　Ｖｍｊであるから。Of the two standard patterns read out from the memory circuit 4,
The standard pattern with the larger number of standard segments is designated as old, and the other is designated as R'. At this time, the standard inter-segment distance d is calculated using the 9th equation for the standard segment tree 9 of rR, for example, 1171, 1wI, and all standard segments of IR' (117m a). Jmr+ here
=Ymn+s Vmn * tqrr:n'tsuYm'n
'+s Vmj.

ｄ　　（Ｖｍｎ　　＊　　岬ｍｔｉ’）　　−１１）ｍ
ｍ’　　　Ｙｍａ　ｌｌ　＋　ｌｌ　Ｙｍ’ｎ’＋　ｔ
−１１’ｍｎ＋Ｉ　　１１＋α・（１１７ｍｎ　、　’
ｌｌＦｍ’ｎ’）　　　　　　・・・曲・・・・・・・
・内用　（１１ここで１１１１１はベクトルｉのノルム
、　（Ｖ、に’）　ｔｉベクトル賃とｌ′との内積、α
はある定数である。θｍｓが所定閾値である時。d (Vmn * Cape mti') -11) m
m' Yma ll + ll Ym'n'+ t
-11'mn+I 11+α・(117mn, '
llFm'n') ...Song...
・Inner use (11Here, 11111 is the norm of vector i, (V, to') Inner product of ti vector rent and l', α
is a constant. When θms is a predetermined threshold.

選択する。該Ｆｑｌｖｌｎ　＊　ｈｉｍ−をＣ−セグメ
ントとして次段回路６，７へ渡す。式（２）を満たすｗ
：ａ”が財′に存在しない時、ダ□、に対応する取′の
標準セグメントは存在しないと判定する。該幻ｆｆｉわ
が二つの標準パタンｔｎ、ｔｕ’の構造差となる。該Ｊ
１１ｍｍを標準パタンＩＲのＰ−セグメント、またｔＲ
’のＮ−セグメントであるとし９次段回路６．７へ渡す
。ＩＲを構成するすべての標準セグメントについて、上
記処理を行な、つた後、：Ｒ′の標準セグメントのうち
、［Ｒの標準セグメントと対応していないものがあれば
、該標準セグメントを匹′のＰ−セグメント、＠のＮ　
−セグメントとして次段回路６，７へ渡す。前記Ｐ−セ
グメント、Ｃ−セグメント、Ｎ−セグメントから成るバ
タンを構造差バタンと呼ぶ。select. The Fqlvln*him- is passed to the next stage circuits 6 and 7 as a C-segment. w that satisfies formula (2)
:a'' does not exist in goods', it is determined that there is no standard segment of take' corresponding to da
11mm as standard pattern IR P-segment, also tR
' is assumed to be the N-segment of ', and is passed to the ninth stage circuit 6.7. After performing the above processing on all the standard segments constituting the IR, if there is a standard segment of R' that does not correspond to a standard segment of P-segment, @N
- Pass it to the next stage circuits 6 and 7 as a segment. The batan made up of the P-segment, C-segment, and N-segment is called a structural difference batan.

第３図状標準バタン間の構造差抽出例を説明する説明図
で６って、（ａ）は「天Ｊ　ｔ　（ｂｌは「夫」の標準
パタンである。図中１１．１２．・・・、１Ｂ、２１，
２２．・・・。In the explanatory diagram illustrating an example of structural difference extraction between standard patterns in the third figure, 6 (a) is the standard pattern of "Ten J t (bl is the standard pattern of "husband"). 11.12...・,1B,21,
22. ....

２９は鎖点２缶点間を結ぶ実線の矢印拡標準セグメント
である。標準バタン「天」、「夫」を構成する標準セグ
メント数は各々７木、８木である。従って、上述の手段
で、ＩＲ−ｒ夫Ｊ　、　ｆｆｔ’＝　「天」として。29 is a solid line arrow enlarged standard segment connecting two chain points. The standard number of segments constituting the standard batan ``ten'' and ``husband'' is 7 trees and 8 trees, respectively. Therefore, by the means described above, as IR-rhuJ, fft'='heaven'.

各々の標準セグメント間の対応を求めると、同図（ｃｌ
の結果が得られる。侭−「夫」の第３ス）ｏ−りの最初
の標準セグメント（同図（ｂｌで差点２７から２２へ向
かうベクトルで記された標準セグメント）に対応する叡
′＝「天」の標準セグメントが存在せず。When determining the correspondence between each standard segment, the same figure (cl
The result is obtained. The standard segment of 叡 = ``天'', which corresponds to the first standard segment of 侭 - ``husband'' (the standard segment marked by the vector from difference point 27 to 22 in the same figure) does not exist.

該標準セグメントが獣のＰ−セグメント、　ｔＲ’のＮ
−セグメントとなる。一方１　（Ｒ’−ｒ天」の標準セ
グメントはすべて９世−「夫」のいずれかの標準セグメ
ントと対応付けられているので、［Ｒ′のＰ−セグメン
ト、ＩＲのＮ−セグメントは存在しない。第４図には、
該構造差抽出回路５により標準バタン「天」、「夫」か
ら得られたＰ−セグメント、Ｃ−セグメント、Ｎ−セグ
メント、すなわち構造差パタンの例を示している。The standard segment is the P-segment of the beast, N of tR'
-becomes a segment. On the other hand, all standard segments of 1 (R'-rten) are associated with standard segments of 9th generation-'husband', so there are no P-segments of [R' and N-segments of IR]. .In Figure 4,
Examples of P-segments, C-segments, and N-segments, that is, structural difference patterns, obtained from the standard batons "Ten" and "Husband" by the structural difference extraction circuit 5 are shown.

第１図に示した６はストローク抽出装置であって、これ
を実現するには９例えば特開昭５７−１８２８７８号公
報、あるいは文献（高橋、増田：１筆点の生起順序を利
用したストローク抽出法による手書き漢字認識の基礎検
討”、電子通信学会論文誌（ＤＪ、ｖｏｌ、Ｊ−６５Ｄ
、ｎｏ、１０．ｐｐ、１２９４−１３０１（１９８２−
１０））等で開示された手段を用いることができる。Reference numeral 6 in FIG. 1 is a stroke extraction device, and in order to realize this, 9 for example, JP-A-57-182878 or literature (Takahashi, Masuda: Stroke extraction using the order of occurrence of one stroke point. ``Basic study of handwritten kanji recognition using methods'', Transactions of the Institute of Electronics and Communication Engineers (DJ, vol. J-65D)
, no, 10. pp, 1294-1301 (1982-
10)) etc. can be used.

第５図は一実施例の構成ブロック図である。図中３３は
公知の特徴抽出回路であって９文字線の幾何学的形状を
表わす特徴点を公知の手法により前記細線化バタンから
抽出するものである。該抽出した特徴点に１から順に通
し番号を付与する。該抽出した特徴点の位置座橢２種類
（端点９分岐点。FIG. 5 is a block diagram of the configuration of one embodiment. In the figure, numeral 33 is a known feature extraction circuit that extracts feature points representing the geometric shape of nine character lines from the thinning button using a known method. Serial numbers are assigned to the extracted feature points in order from 1. There are two types of position control for the extracted feature points (end points and 9 branch points).

交点、孤立点、サンプリング点の別）及び該特徴点と接
続する特徴点に付与されている番号等を。(Intersection points, isolated points, sampling points) and numbers assigned to feature points connected to the feature point.

以下単に特徴点情報と呼ぶ。当該回路３５で得た上記特
徴点情報を次段回路３５へ渡す。Hereinafter, this will be simply referred to as feature point information. The feature point information obtained by the circuit 35 is passed to the next stage circuit 35.

３５は特徴点記憶回路であって、前記特徴点抽出回路３
５で抽出された特徴点情報を記憶する。35 is a feature point storage circuit, which includes the feature point extraction circuit 3.
The feature point information extracted in step 5 is stored.

３７は対応候補点選択回路であって、前記特徴点抽出回
路３５で抽出された特徴点の中から、所定条件を満たす
所定個数以内の特徴点を、前記構造差抽出回路５から読
出した構造差パタンの標準セグメントの各筆点に対応す
る候補として選び出す。該回路３７を実現する手段とし
ては前記特開昭５７−１８２８７８号公報あるいは前記
文献の中に開示した手段がある。該選び出された特徴点
を対応候補点と呼ぶ。該対応候補点を次段回路５８へ渡
す。Reference numeral 37 denotes a corresponding candidate point selection circuit, which selects feature points within a predetermined number that satisfy a predetermined condition from among the feature points extracted by the feature point extraction circuit 35 and selects the structural difference read out from the structural difference extraction circuit 5. It is selected as a candidate corresponding to each writing point in the standard segment of the pattern. Means for realizing the circuit 37 include the means disclosed in the above-mentioned Japanese Patent Laid-Open No. 57-182878 or the above-mentioned documents. The selected feature points are called corresponding candidate points. The corresponding candidate points are passed to the next stage circuit 58.

３Ｂは連結性判定回路であって９次の機能を有する。前
記筆点ｙ□の対応候補点としてＬ個の特徴点Ｗ’　＋　
ｆｔ２＋　”・＋　ｔｆＬが、筆点１ｒｎ、ｌｌに続く
筆点１ｒｙｙ１ｎ＋１の対応候補点としてＬ′個の特徴
点ｔｒ”、叡−・・・、’　Ｌｌが選択されたとする。3B is a connectivity determination circuit having a ninth-order function. L feature points W' + as corresponding candidate points for the writing point y□
Suppose that ft2+ ``·+tfL selects L' feature points tr'', 叡-..., 'Ll as corresponding candidate points for the writing point 1ryy1n+1 following the writing points 1rn, ll.

箔点Ｖユｍ　＊　Ｗ＋ｎｎ＋１　の対応候補点の中から
それぞれ１個ずつ対応候補点１例えばｎ’、１”を選び
出し、枦ｉｐ　＝　（ダ１．館１５なる対応候補点対を
つくる（ｉ＝１．２．・・・、Ｌ：ｊ’−１’、２’、
・・・、ｒ、）。Select one corresponding candidate point 1, for example, n', 1'' from among the corresponding candidate points of the foil point Vyum*W+nn+1, and create a pair of corresponding candidate points such as ip=(da1.kan15(i= 1.2...., L:j'-1', 2',
..., r,).

ｒト点１ｒｌ、Ｉｎ、ｒｆｆｌ□、に対する対応候補点
はそれぞれＬ個、Ｌ′個ずつ選択されているから、Ｌｘ
Ｌ’組の対応候補点対がイｑられる。このＬｘＬ″組の
対応候補点対すべてについて、対応候補点対を成す対応
候補点同士が入力文字バタンの文字線上で連結している
かどうかを、先ず判定する。次に、対応候補点同士が連
結している対応候補点対についてのみ、対応候補点同士
を最短径路で結ぶ文字線を求める。欣求められた最短径
路（文字線）と前記箔点賃。、、γ４、＋１とを結ぶ標
準セグメント初□−１ｒｍａ＋ｉ　　ｌｒｍｎとの間の
類似度を計算する機能が回路５８の主要な機能であって
、以下の方法で実現する。第６図は”該回路５８の構成
ブロック図である。Since L and L' corresponding candidate points are selected for the r point 1rl, In, rffl□, respectively, Lx
L' pairs of corresponding candidate points are qqed. For all the pairs of corresponding candidate points in this LxL'' set, it is first determined whether the corresponding candidate points forming the pair of corresponding candidate points are connected on the character line of the input character button.Next, the corresponding candidate points are connected to each other. Find a character line that connects the corresponding candidate points by the shortest path only for the pair of corresponding candidate points that are the same. The main function of the circuit 58 is to calculate the similarity between the first □-1rma+i lrmn, and is realized by the following method. FIG. 6 is a block diagram of the configuration of the circuit 58.

６１は対応候補点記憶回路であって、各筆点に対する対
応候補点数と、対応候補点として選択された前ｓＬ！特
徴点の番号を記憶している。61 is a corresponding candidate point storage circuit, which stores the number of corresponding candidate points for each writing point and the previous sL selected as a corresponding candidate point! Memorizes feature point numbers.

６２は文字線追跡回路であって、前記特徴点記憶回路６
５に記憶されてｉる特徴点間の接−続関係から、前記回
路６１から読出した対応候補点同士が入力文字パタンの
文字線上で連結しているかどうか判定し、連結している
ならば対応候補点同士を結ぶ最短径路を求める。これに
は、グラフ理論における最短径路問題として公知の技術
で実現できる。対応候補点同士が連結している対応候補
点対のみ、該対応候補点同士を結ぶ最短径路の長さを求
め９次段回路６３へ渡す。62 is a character line tracing circuit, and the feature point storage circuit 6
Based on the connection relationship between i feature points stored in 5, it is determined whether the corresponding candidate points read from the circuit 61 are connected on the character line of the input character pattern, and if they are connected, it is determined that they correspond. Find the shortest path connecting candidate points. This can be realized using a technique known as the shortest path problem in graph theory. Only for pairs of corresponding candidate points in which corresponding candidate points are connected, the length of the shortest path connecting the corresponding candidate points is determined and passed to the ninth stage circuit 63.

６３は類似度計算回路であって１例えば、゛前記対応候
補点対１ｐ”’　＝　（１１’　＋　ｆｆ”）を成す対
応候補点９１゜１゛が入力文字バタンの文字線上で連結
していて。Reference numeral 63 denotes a similarity calculation circuit 1. For example, corresponding candidate points 91゜1゛ forming the pair of corresponding candidate points 1p"' = (11' + ff") are connected on the character line of the input character button. .

かつＷ’　ｔ　ｔＩ′を結ぶ最短径路の長さがｈＩＩ′
でちることが前記回路６２で得られている場合、前記標
準セグメン）　Ｊ’１７ｍｎとの間の類似度を計算する
ものである。ここで入力セグメントｌ　−ｆｆｊ’−ｆ
ｉｌとすると。And the length of the shortest path connecting W' t tI' is hII'
If the result is obtained by the circuit 62, the degree of similarity between the standard segment) J'17mn is calculated. Here, the input segment l −ffj'−f
If it is il.

類似度ｒは次式で与えられる。The similarity r is given by the following equation.

ここで１１１７１１　、　ＩＩ押、−ＩＩはべ／　）　
ｈｌ（ｊ　＊ｒＱｒｎｎｃｏ　／　ルＡ　テ；６す、Ｗ
は重み関数、δは閾値でらる。第７図は式（３）の説明
図であって２図中二重丸で囲ったａ、ｂは標準バタンの
筆点、−型光で囲ったｃ、ｄ、・・・。Here 111711, press II, -II habe/)
hl(j *rQrnnco / le A te;6su, W
is a weight function and δ is a threshold value. FIG. 7 is an explanatory diagram of formula (3), and in FIG. 2, a and b surrounded by double circles are the writing points of a standard button, and c, d, . . . are surrounded by -type lights.

ｈは入力文字パタンから抽出された特徴点である。h is a feature point extracted from the input character pattern.

ａからｂへ向かう実線の矢印７１紘標準セグメントであ
る。Ｃとｄ、ｄとｅ、・・・９ｇとｈを結ぶ実＄９７２
　、７５　、・・・、７６は文字線である。ここでａ。A solid line arrow 71 is a standard segment pointing from a to b. Fruit connecting C and d, d and e,...9g and h $972
, 75, . . . , 76 are character lines. Here a.

ｂを前記＠　ａｖｍａ　ｓ　ｋｍｎ＋ｉ　ｓまたｃ、ｄ
を対応候補点１．１′とすれば、７１は標準セグメン）
　幻、、、　。b as above @ avma s kmn + i s also c, d
If 1.1' is the corresponding candidate point, then 71 is a standard segment)
Illusion...

Ｃからｈへ向かう一点鎖線のベクトル７７が前記第７図
に示したような大きく屈曲している文字線標準セグメン
トとの類似度が不当に高くならないようにするための方
策である。This is a measure to prevent the vector 77 of the dashed-dotted line from C to h from becoming unduly high in similarity to the character line standard segment which is greatly curved as shown in FIG. 7.

上記得られた類似度の値ｒがｒ≧ｒ’ｒｉｉ　＜　ｒ’
ｒｉｔは閾値）である場合、その類似度の値と、対応候
補点対を成す対応候補点、該対応候補点同士を結ぶ最短
径路上の特徴点の番号を次段回Ｍ７へ渡す。The similarity value r obtained above is r≧r'rii <r'
rit is a threshold value), the value of the similarity, the corresponding candidate points forming the corresponding candidate point pair, and the number of the feature point on the shortest path connecting the corresponding candidate points are passed to the next stage M7.

第１ｒ５Ｊ図示の７紘構造判定回路であって、前記構造
差抽出回路５より受取った前記荷造差バタンか。Is it the packing difference slam received from the structure difference extraction circuit 5 in the 7-loan structure determination circuit shown in Figure 1r5J?

前記ストローク抽出装置６によって抽出された最短径路
、すなわち文字線の中に存在しているかと　・うかを判
定するのが主要な機能であって、以下の手段により実現
する。第８図は該回路７の構成ブロック図である。The main function is to determine whether the shortest path extracted by the stroke extraction device 6 exists within the character line, and this is realized by the following means. FIG. 8 is a block diagram of the circuit 7. As shown in FIG.

図中５３紘水平軸をＸ軸、垂直軸をｙ軸として。In the figure, the horizontal axis is the X axis and the vertical axis is the y axis.

それぞれｎ（例えば１２Ｂ）画素の分解能をもった文字
記憶回路であって、前記前処理回路２で前処理された入
力文字パタンを記憶する。Each character storage circuit has a resolution of n (for example, 12B) pixels, and stores the input character pattern preprocessed by the preprocessing circuit 2.

５４および５５は前記文字記憶回路５３に記憶された文
字を、水平あるいは垂直軸と平行に走査し１文字線の本
数を計数する文字線数計数回路でらる。ある座標値Ｘ’
　＊　Ｙ’が指定された時、計数回路５４はｙ′＋１〜
ｎの間、及び１〜ｙ′−１の間にある文字線数をそれぞ
れ計数し、一方、計数回路５５は１〜ｘ’−１の間、及
びｘ′＋１〜ｎの間にある文字線数をそれぞれ計数し９
次段回路５６に渡す。Reference numerals 54 and 55 are character line number counting circuits that scan the characters stored in the character storage circuit 53 horizontally or parallel to the vertical axis and count the number of lines in one character. A certain coordinate value X'
* When Y' is specified, the counting circuit 54
The counting circuit 55 counts the number of character lines between 1 and x'-1 and between x'+1 and n, respectively. Count each number and make 9
It is passed to the next stage circuit 56.

５６は文字線数記憶回路でＩＳ′）て、前記文字線数計
数回路５４．５５の計数結果を記憶し９次段回路５７へ
書込む。56 is a character line number storage circuit IS') which stores the counting results of the character line number counting circuits 54 and 55 and writes them into the ninth stage circuit 57.

５７は対判定回路である。前記構造差抽出回路５から前
記抽出された構造差バタンを続出す。該構造差パタンを
成すすべての標準セグメントについて、以下の処理を繰
返す。着目している標準セグメントを初、ｎとする。Ｔ
ＩＩ□７に対して、前記スト四−り抽出装置６で得られ
た類似度の値が最も高い類似度を与える対応候補点対か
ら順に以下の沈埋を行なう。前記装ｇｔ、６によって１
例えば対応候補点対−ｏｆを成す対応候補点同士を結ぶ
最短径路上のすべての特徴点の位置座標の値を前記５５
へ書込み、各特徴点の上下左右各走査方向別の文字線の
本数を前記５６より読込む。該文字線の本数の平均値を
上下左右各走査方向別に求め、各々φυ。57 is a pair determination circuit. The extracted structural difference patterns are successively output from the structural difference extraction circuit 5. The following process is repeated for all standard segments forming the structural difference pattern. Let n be the standard segment of interest. T
For II□7, the following embedding is performed in order from the corresponding candidate point pair that gives the highest similarity value obtained by the above-mentioned straight extraction device 6. 1 by gt, 6
For example, the value of the position coordinates of all the feature points on the shortest path connecting the corresponding candidate points forming the corresponding candidate point pair -of is
The number of character lines for each feature point in each of the upper, lower, left, and right scanning directions is read from 56 above. The average value of the number of character lines is determined for each scanning direction (up, down, left, right, left, and right), and φυ is calculated for each.

ψ０．ψ１．ψ冨とする。前記標準セグメントｒ１１．
．．．の上下左右にある文字線数祉φ魯７．φλイ、φ
瓢□φ翫イである。ある走査方向ｚ　（ｚ−Ｕ、Ｄ、Ｌ
、Ｒ）につイテ。ψ0. ψ1. Let ψfu. The standard segment r11.
．．．．．． The number of character lines on the top, bottom, left and right of φ 7. φλi, φ
瓢□φ翫ii. A certain scanning direction z (z-U, D, L
, R) Nitsuite.

次式で与えられる相違度ｄ　（ｚｌを計算する。Calculate the degree of dissimilarity d (zl) given by the following formula.

ｄ（り−１φ−一？＋　　　　　叩・・曲・・・・・叩
・・（４）ζ仁でｄ　（ｚ）≦ｄｔｉ（ｚ）　（ｄｔｍ
　ｔｚ）は閾値）ならば、構造差パタンとなっている標
準セグメントｔｙ＋□に対応する文字線が入力文字バタ
ン上に存在したと判定する。一方＊　ｄ（ｚ）＞　ｄｔ
ｉ（ｚ）の場合、該最短径路（文字線）紘該ｔＱ□に対
応しないとして棄却され２次に類似度の大きい対応候補
点対が同様に調べられる。最終的にすべての対応候補点
対が棄却された時、該標準セグメントに対応する文字線
は入力文字バタン上に存在しなかったと判定される。d(ri-1φ-1?+ Hit...song...hit...(4) ζjin d (z)≦dti(z) (dtm
tz) is the threshold value), it is determined that a character line corresponding to the standard segment ty+□, which is a structural difference pattern, exists on the input character button. On the other hand * d(z) > dt
In the case of i(z), the shortest path (character line) is rejected as not corresponding to the tQ□, and corresponding candidate point pairs having a quadratic degree of similarity are similarly examined. When all the corresponding candidate point pairs are finally rejected, it is determined that the character line corresponding to the standard segment does not exist on the input character button.

上述の処理により、前記二つの標準バタン獣。Through the above-mentioned processing, the two standard bang beasts.

獣′の構造差バタンのすべでの標準セグメントが入力文
字バタン上に存在したか否か判定される。It is determined whether all the standard segments of the beast' structural difference button were present on the input character button.

ＮＰ：入力文字バタン上に存在したＲのＰ−セグメントの本数Ｎｃ：＃　　　　　　存在したＲのＣ−〃ＮＮ：　　　　ｔｒ　　　　　　存在しなかった獣のＮ
−〃である時、標準バタンｌ＜と入力文字バタンとの一致鹿
ｆ町）を次式で定義する。NP: Number of P-segments of R that existed on the input character button Nc: # C- of R that existed NN: tr N of the beast that did not exist
−〃, the match between the standard button l< and the input character button f is defined by the following equation.

ｆ　ｌｌ？ｌ＝　ｇ（Ｎｐ、　Ｎｃ、　ＮＮ）　　　　
・・・・・・・・・・・・・・・・・・・・・（５）こ
こでｇとしては２例えばｇ（Ｎｐ＋Ｎｃ＋ＮＮ）＝ｗｐ・Ｎｐ＋ｗ６ａＮ（＋ｗ
ＨａＮＨ＝−１６１等を用いる仁とができる（但し＋　
ｗｐ、　Ｗｃ、　ｗｗ　　ｈ重み関数である）。ｆｌＲ
）＞ｆ鵠ならば、入力文字バタンはカテゴリ獣′よりも
カテゴリ叡に構造的に類似していると判定し、その結果
を出力部８へ渡す。fll? l=g(Np, Nc, NN)
・・・・・・・・・・・・・・・・・・・・・・・・(5) Here, g is 2 For example, g(Np+Nc+NN)=wp・Np+w6aN(+w
HaNH=-161 etc. can be used (however, +
wp, Wc, ww h are weight functions). flR
)>f, the input character BATA is determined to be structurally more similar to the category E than to the category BUT, and the result is passed to the output unit 8.

（５）　　効果の説明以上説明したように１本発明の装置によれば。(5) Explanation of effects As explained above, according to the apparatus of the present invention.

いかなる候補文字カテゴリの組合せについても。For any combination of candidate character categories.

ス）ｏ−り抽出を用いた識別処理が可能となり。S) Identification processing using o-ri extraction becomes possible.

かつまた、該候補文字カテゴリ同士の構造的な差異とな
っている文字線を検出し、しかる後、該文字線が人力文
字バタン上に存在するか否かを、該文字線の上下左右に
ある文字線数を参照することにより９文字線相互間の位
置関係までを前照して判定する構成であるから２文字識
別能力を大幅に向上できる。In addition, it detects character lines that are structurally different between the candidate character categories, and then determines whether or not the character line exists on the human-powered character button. By referring to the number of character lines, the positional relationship between nine character lines can be determined in advance, so that the ability to identify two characters can be greatly improved.

[Brief explanation of drawings]

第１図は本文字識別装置の構成ブロック図、第２図１標
準パタンの記録形式を説明する説明図。第６図は構造差抽出例を説明する説明図、第４図は構造
差バタンを説明する説明図、第５図はスト田−り抽出装
置の一実施例の構成ブロック図、第６図は連結性判定回
路の構成ブロック図、第７図は類似度を説明する説明図
、第８図は構造判定回路の構成ブロック図である。１・・・入力部、２・・・前処理回路、３・・・大分類
装置。４・・・標準バタン記憶回路、５・・・構造差抽出回路
。６・・・ストローク抽出装置、７・・・構造判定回路、
Ｂ・・・出力部、１１，１２．・・・、１Ｂ、２１，２
２．・・・、２９・・・消点、３３・・・特徴点抽出回
路、３５・・・特徴点記憶回路、３７・・・対応候補点
選択回路、３Ｂ・・・連結性判定回路、６１・・・対応
候補点記憶回路、６２・・・文字線追跡回路、６３・・
・類似度計算回路、７１・・・標準セグメン）、７２，
７３．・・・、７６・・・文字線、７７・・・入力セグ
メント、ａ、ｂ・・・線点、ｃ、ｄ、・・・、ｈ・・・
特徴点、５３・・・文字記憶回路、５７・・・対判定回
路。５４．５５・・・文字線数計数回路、５６・・・文字線
数記憶回路。特許出願人　日本電信電話公社代理人弁理士　森　１）　　寛第　１（２１第　２（２］（Ｑ）　　　　　　　　　　　　　　　　（ｂ）第３１
２１第　６　図第７図手続補正書（自発）１．事件の表示　昭和５８　年特許願第１０５４９０　
　号２、発明の名称　文字識別装置３、補正をする者事件との関係　特許出願人住　所東京都千代田区内幸町１丁目１番６号氏　名（４
２２）日本電信電話公社代表者　真　藤　　　恒４、代理人住　　所　　東京都荒川区西Ｅ、ｌ　Ｍ里４丁目１７番
１号佐原マンン、ン３ＦＣ氏　名　（７４８４）弁理士　森　１）　　寛５、補正
により増加する発明の数なし６、補正の対象図面第１図および第５図７、補正の内容
別紙の通り補正の内容（１）図面第１図および図面第５図を別紙の如く補正す
る。以上FIG. 1 is a block diagram of the structure of the present character identification device, and FIG. 2 is an explanatory diagram illustrating the recording format of the standard pattern. FIG. 6 is an explanatory diagram for explaining an example of structural difference extraction, FIG. 4 is an explanatory diagram for explaining structural difference slam, FIG. FIG. 7 is an explanatory diagram for explaining similarity, and FIG. 8 is a block diagram of the structure determination circuit. 1... Input unit, 2... Preprocessing circuit, 3... Major classification device. 4... Standard button memory circuit, 5... Structural difference extraction circuit. 6... Stroke extraction device, 7... Structure determination circuit,
B...output section, 11, 12. ..., 1B, 21, 2
2. ..., 29... Vanishing point, 33... Feature point extraction circuit, 35... Feature point storage circuit, 37... Corresponding candidate point selection circuit, 3B... Connectivity determination circuit, 61. ...Corresponding candidate point storage circuit, 62...Character line tracing circuit, 63...
・Similarity calculation circuit, 71...standard segment), 72,
73. ..., 76... Character line, 77... Input segment, a, b... Line point, c, d,..., h...
Feature points, 53...Character storage circuit, 57...Pair determination circuit. 54.55...Character line number counting circuit, 56...Character line number storage circuit. Patent applicant Nippon Telegraph and Telephone Public Corporation Patent attorney Mori 1) Hiroshi 1 (21 2(2) (Q) (b) 31
21 Figure 6 Figure 7 Procedural amendment (voluntary) 1. Display of case: 1982 Patent Application No. 105490
No. 2, Title of the invention Character identification device 3, Relationship with the case of the person making the amendment Patent applicant address 1-1-6 Uchisaiwai-cho, Chiyoda-ku, Tokyo Name (4)
22) Nippon Telegraph and Telephone Public Corporation Representative Tsune Shinfuji 4, Agent address 4-17-1 M-ri, Nishi E, Arakawa-ku, Tokyo 3FC Sawara Mann Name (7484) Patent attorney Mori 1) Kan 5 , The number of inventions will not increase due to the amendment 6. Figures 1 and 5 of the drawings to be amended. 7. Contents of the amendment. Contents of the amendment (1) Figures 1 and 5 of the drawings are amended as shown in the attached sheet. do. that's all

Claims

[Claims]

In a character recognition device equipped with a function of extracting strokes, a standard stroke memory circuit stores standard strokes given by the position coordinates of end points, and a difference between multiple categories selected as candidates for input character strokes is detected. A structural difference extraction circuit that extracts on a standard button, detects whether the extracted difference is in the input character button or an extracted stroke, and determines whether the input character button belongs to any candidate character category. A character identification device characterized by comprising a structure determination circuit that determines the structure of a character.