JPH03291781A

JPH03291781A - Character recognizing device

Info

Publication number: JPH03291781A
Application number: JP2093720A
Authority: JP
Inventors: Hiroshi Kameyama; 博史亀山
Original assignee: Glory Ltd
Current assignee: Glory Ltd
Priority date: 1990-04-09
Filing date: 1990-04-09
Publication date: 1991-12-20

Abstract

PURPOSE:To enable the recognition of characters even between similar ones without erroneous recognition by verifying candidates based on the candidates of a block arrangement type and a block attribute. CONSTITUTION:The picture data of a check is read and inputted by a picture input part 2, a preprocess part 3 performs the process of the preprocessing and thinning a line to input the data to a global variable (memory) 20. An inference mechanism 40 collates the condition part of the rule part of a knowledge base 10 and the content of a working memory 30 to control the firing of the rule. The result of execution parts 12B-12E is inputted to the working memory 30 to display it on a display part 4 through the global variable 20.

Description

【発明の詳細な説明】発明の目的（産業上の利用分野）この発明は、手＠ぎ文字を効率的に且つ確実に認識する
ために文字の配置タイプを利用して文字を認識する文字
認識装置に関する。[Detailed Description of the Invention] Purpose of the Invention (Industrial Application Field) The present invention is a character recognition system that recognizes characters by utilizing character arrangement types in order to efficiently and reliably recognize handwritten characters. Regarding equipment.

（従来の技術）従来手書き文字の認識を行なうものとしては、例えは本
出願人か１是案じた特開平１−２３３５８５号公報に記
載された文字認識方法が存在する。この方法ては、文字
データのスムージング・細線化の前処理及び線構造解析
を行なった後に、先ず大局的な構造認識としての１文字
上の文字と判定された認識対象候補ブロックを抽出し、
各ブロック毎に文字認識を行なう。そして、認識てきな
かたブロツりについては更に認識か終了するまでセグメ
ンテーション、線構造解析又はデータの再構成を行ない
、文字認識を進めて行くようにしたものである。(Prior Art) As a conventional method for recognizing handwritten characters, there is, for example, a character recognition method described in Japanese Patent Laid-Open No. 1-233585, which was devised by the present applicant. In this method, after performing pre-processing of smoothing and thinning of character data and line structure analysis, first, a recognition target candidate block that is determined to be one character higher as a global structure recognition is extracted,
Character recognition is performed for each block. As for blots that have not been recognized, segmentation, line structure analysis, or data reconstruction is further performed until recognition is completed to advance character recognition.

（発明か解決しにうとする課題）しかし、−上記従来の手書き文字認識の方法ては、例え
ば第４図に示すのは米国小切手の例であるか、このよう
な場合には最終的には認識不可能となっていた。具体的
には第４図の場合の手書き文字の認識を進めると、最終
的には第６図に示すようにＢ１，１〜ＢＬ８のブロック
どなる。そして、各ブロック毎に認識を行なうとブロッ
クＩｌｌ、］、Ｂ１．２ＢＬ３．Ｂｌ、５．Ｂ１０．Ｉ
ＩＬ８は文字認識できるか、ブロックＢ１４及び旧、６
については誤認識となってしまう。(Problem to be solved by the invention) However, in the above-mentioned conventional handwritten character recognition method, for example, the example shown in FIG. It had become unrecognizable. Specifically, when recognition of the handwritten characters in the case of FIG. 4 is proceeded, the blocks B1,1 to BL8 will finally be recognized as shown in FIG. Then, when recognition is performed for each block, blocks Ill, ], B1.2BL3. Bl, 5. B10. I
Can IL8 recognize characters?Block B14 and old 6
There will be a misunderstanding about this.

つまり、ブロックＢ］、４については数字の１゛°と認
識してしまい、ブロックＢ１，６についても数字の“１
゛°と認識してしまうのである。In other words, blocks B] and 4 are recognized as the number 1゛°, and blocks B1 and 6 are also recognized as the number "1".
It is recognized as ゛°.

更に、仮に上記従来の方法にセン１〜バー、セントポイ
ント等の記号認識情報を（−１加したとして１はも、ブロック旧、４につい〉で牢の“ビなのかセン）〜
バーなのか判別てきす、また、ブロック旧５６について
も数字の’　１　”　ｔｔの＃セン１ヘボイントなのか
判別できない。すなわち、正しくブロックを抽出したに
も拘らず、ブロックの文字の可能・ａか２種類具−１−
ある場合には、結局認識不可能という問題か生していた
。Furthermore, if we add symbol recognition information such as Sen 1 ~ bar, cent point, etc. (-1) to the above conventional method, 1 will be the old block, and 4 will be the prison's "bi" or "sen".
It is difficult to determine whether the block is a bar or not, and it is also not possible to determine whether the number '1'' tt is a bar or not. 2 types of ingredients-1-
In some cases, the problem was that they were unrecognizable.

ところで第４図に示されるように、米国小切手の場合に
は（書ぎさねる文字形式かトルオータセントオータ、セ
ントバー、セン１〜ポイン１−によって構成されており
、しかも手摺きによって書かれたものは個人差はあるも
ののそのほとんどかいずれかのタイプに類形化できる。By the way, as shown in Figure 4, in the case of a U.S. check, it is composed of letters that are written in the form of letters or truota cent ota, cent bar, sen 1 - point 1 -, and is written in the form of a handrail. Although there are individual differences, most can be categorized into one type or another.

米国小切手の手摺き文字の場合ではそのほとんとか第７
図のタイプＡ〜タイプＦのいずれかに属しているのか実
情である。なお、このようなことは米国小切手の場合に
ついてのみ言えることてはなく、ある程度記載形式か定
まっているものてあればこの、］；うノノア徴か見い出
せる。In the case of U.S. check handles, most of them are
The actual situation is whether it belongs to any of types A to F in the figure. Incidentally, this is not only the case with American checks; if there is a certain standard of writing format, then you can see the signs of this.

この発明は上述のような事情より成されたものであり、
この発明の目的は、ある程度記載形式の定まった手書き
文字についてはその手書キ文字を類形化した配置タイプ
を利用して、その配置タイプに基ついて文字を確実に認
識するだめの文字認識装置を提供することにある。This invention was made due to the above-mentioned circumstances,
An object of the present invention is to provide a character recognition device that can reliably recognize characters based on the arrangement type by using a layout type that morphs the handwritten characters with a certain fixed writing format. Our goal is to provide the following.

発明の構成（課題を解決するための手段）この発明は、認識ずへき文字群に関する情報から認識に
必要なルールを記１彦している知識ベースと、前記知識
ベースを利用して推論する推論機構とを利用して文字を
認識する文字認識装置に関するのもて、この発明の上記
目的は、前記認識すへぎ文字君Ｔから単独の文字ブロッ
クを抽出して、各文字ブロック毎にブロック候補をイ・
」与するブロック抽出手段と、予め定められた複数のブ
ロック配置タイプの中から前記ブロック抽出手段で抽出
された文字ブロックの配置に近似するものをブロック配
置タイプ候補として抽出するブロック配置タイプ抽出手
段と、前記ブロック配置タイプ抽出手段で抽出されたブ
ロック配置タイプに基ついて、前記各ブロックにブロッ
ク属性候補を付与する属性イ；１与手段と、前記ブロッ
ク配置タイプ候補及びブロック属性候補に基ついて候補
の検証を行なう検証手段とを設りることによって達成さ
れる。Structure of the Invention (Means for Solving the Problems) This invention provides a knowledge base that stores rules necessary for recognition from information about a group of unrecognized characters, and an inference that uses the knowledge base to make inferences. Regarding a character recognition device that recognizes characters using a mechanism, the above object of the present invention is to extract a single character block from the recognized character block T, and to create a block candidate for each character block. stomach·
a block arrangement type extracting means for extracting, as a block arrangement type candidate, a block arrangement type candidate that approximates the arrangement of the character block extracted by the block extraction means from among a plurality of predetermined block arrangement types; , attribute a for assigning a block attribute candidate to each block based on the block layout type extracted by the block layout type extraction means; This is achieved by providing a verification means for performing the verification.

（作用）この発明では、認識すべき文字群に対して線構造を解析
してブロックの特徴抽出を行なって後、ブロックに対し
て通貨単位タイプ（円に関しては円と銭、トルに関して
はトルとセント５ボンドしこ関してはボン１−とベンス
のように、小紗点以上の通貨表記と小数点以下の表記と
をその表示の仕方によって種々のタイプに予め登録しで
ある表示タイプであり、ここては便宜的にトルを対象と
してセン１−タイプとする）の仮説を行ない、この仮説
に対して検証を行なうことによって認識結果の表示を行
なって、手書き文字、特に小切手等に手書きされた数字
を正確に認識するようにしている。(Operation) In this invention, after analyzing the line structure of the character group to be recognized and extracting the characteristics of the block, the currency unit type (yen and sen for yen, tor and toru for toru) is applied to the block. Regarding cents 5 bonds, it is a display type in which the currency notation above the small point and the notation below the decimal point are registered in advance in various types depending on how they are displayed, such as Bon 1- and Bence. Here, for convenience, we will use the Sen 1 type as a target for Toru), and by verifying this hypothesis, we will display the recognition results. I try to recognize numbers accurately.

（実施例）第１図はこの発明の構成例を示しており、ＣＰＵ等で成
る制御部１は全体を制御するようになっており、小切手
等に書かれた手書き文字は画像入力部２で光学的に読取
られ、前処理部３で細線化等の処理かされて大域変数（
メモリ）２０に人力される。大域変数２０にはブロック
抽出部２１．ブロック特徴抽出部２２及び相対位置コー
ト作成部２３が接続されると共に、ブロック特徴、ブロ
ック属性、セントタイプの仮説、認識の流れを表わす情
報等の変数か保持されるワーキングメモリ３０が接続さ
れている。また、推論機構４０は知識ベース１０のルル
部の条件部とワーキングメモリ３０の内容を照合し、ル
ールの発火の制御を行ない、知識ベースｌＯはルールＢ
〜Ｅの条件部１１Ｂ−１１Ｅ及び実行部１２８−１２Ｅ
を有しており、実行部１２Ｂ−１２Ｈの結果はワーキン
グメモリ３０に人力され、大域変数２０を経て表示部４
に表示される。(Embodiment) FIG. 1 shows a configuration example of the present invention, in which a control unit 1 consisting of a CPU etc. controls the whole, and handwritten characters written on a check etc. are inputted to an image input unit 2. It is read optically and processed by the preprocessing unit 3 such as thinning to create a global variable (
memory) 20 times. The global variable 20 includes a block extraction unit 21. A block feature extraction section 22 and a relative position code creation section 23 are connected, as well as a working memory 30 that stores variables such as block features, block attributes, cent type hypotheses, information representing recognition flow, etc. . In addition, the inference mechanism 40 checks the condition part of the Lulu part of the knowledge base 10 and the contents of the working memory 30, and controls the firing of rules.
Condition part 11B-11E and execution part 128-12E of ~E
The results of the execution units 12B to 12H are manually input to the working memory 30, and are displayed on the display unit 4 via the global variable 20.
will be displayed.

この発明の認識方法は本出願人による特開平１−１１６
７８１号、特開平１−１１６７８２号の内容を基本とし
ており、第２図はこの発明の動作例を示すフローヂャー
トであり、このフローヂャートを参照して認識動作を説
明する。The recognition method of this invention is disclosed in Japanese Unexamined Patent Publication No. 1-116 filed by the present applicant.
781 and JP-A-1-116782, FIG. 2 is a flowchart showing an example of the operation of the present invention, and the recognition operation will be explained with reference to this flowchart.

先ず、第４図で示すような小切手の画像データを画像人
力部２て読取って人力しくステップＳ１）、前処理部３
て第５図で示すように前ｌＡ埋。First, the image data of a check as shown in FIG.
As shown in FIG.

細線化の処理を行なう（ステップ５２１　。そして、線
構造解析で第６図で示すようにブロックＢｌ、１〜Ｂ１
，８の抽出を行なう（ステップ５３）。このブロックの
抽出は有効ブロックに対して行なうが、ドツト数か４以
上のブロックの平均高さをＥＦＩとし、高さ＞　ＥＦＩ
　ｘ　Ｏ，９以上のブロックの高さの平均をＥＦ２　と
した場合、高さがＥＦ２Ｘｌ／３より大きいブロック又
はドツト数かＥＦ２　Ｘ　ｌ／２より大きいブロックを
有効ブロックとする。その後にブロックＢＬＩ〜ＢＬ８
の特徴抽出を行なうか（ステップＳ４）、ブロックの特
徴としてはブロックの高さブロックの幅、ブロックの縦
横比、横方向と重心位置１文字認識結果、バーらしさ、
アンダーバーらしさ、アッパーバーらしさ、ミドルバー
らしさ、斜めアンダーパーらしさ、斜めアッパーバらし
さかある。高さの程度はブロック高さ／ＥＦ２−αｌて
表わされ、αｌ＞１．３のとき゛きわめて高い°°、α
１　＞１．１のとき“かなり高いパα１　＞０．９のと
き“高い°、α１〉０７のとき゛やや低い°′　α１〉
０５のとき″低い°゛α１〉０３のとき゛かなり低い°
゛、αｌ≧０のとき“きわめて低いパとする。また、幅
の程度はブロックの幅／　ＥＦ２−β１で表わされ、β
ｌ〉０９のとき°゛かなり広い°゛、β１〉０５のとき
゛広い°、βｌ≧０のとき“細い°′とする。Line thinning processing is performed (step 521. Then, as shown in FIG. 6 in line structure analysis, blocks Bl, 1 to B1 are
, 8 are extracted (step 53). This block extraction is performed on valid blocks, and the average height of blocks with a number of dots or 4 or more is set as EFI, and height > EFI
If the average height of blocks of x O, 9 or more is EF2, a block with a height greater than EF2Xl/3 or a block with a number of dots greater than EF2Xl/2 is an effective block. After that, blocks BLI~BL8
(step S4), the block features include block height, block width, block aspect ratio, horizontal direction and center of gravity position, single character recognition result, bar-likeness,
There are under-bar characteristics, upper-bar characteristics, middle-bar characteristics, diagonal under-par characteristics, and diagonal upper-bar characteristics. The degree of height is expressed as block height/EF2-αl, and when αl>1.3, it is extremely high °°, α
1 When > 1.1, “fairly high” α1 When > 0.9, “high degree”, when α1>07, “slightly low degree” α1>
When it is 05, it is “low°゛α1〉When it is 03, it is “quite low°”
゛, when αl≧0, it is assumed to be extremely low. Also, the degree of width is expressed as block width/EF2-β1, and β
When l〉09, it is considered as ``quite wide'', when β1>05, it is considered as ``wide'', and when βl≧0, it is considered as ``slim''.

さらに、縦横比は高さ／幅＝γ１で表わされ、γ１．＜
０．６７のとき゛かなり横長°°、γｌ　＜、０．９１
のとき゛横長°°、γ１　＜１．４３のとき°かなり横
長°゛γ１　＜０．９１のとき“横長“、γｌ＜１．４
３のとき°゛中°゛、その他の場合を“′縦長°°とす
る。ざらに又、横方向重心位置は第８図の如く０〜１０
０°′の値で表わされる。Furthermore, the aspect ratio is expressed as height/width=γ1, and γ1. <
When 0.67, ``considerably horizontal °°, γl <, 0.91
When ``Landscape'', when γ1 < 1.43, ``Very horizontal''; when γ1 < 0.91, ``Horizontal'', γl < 1.4
In the case of 3, it is ``medium'', and in other cases, it is ``vertically long.
It is expressed as a value of 0°'.

ブロックの特徴抽出が終了すると、知識ベース１０内の
ブロック属性の前付与条件部１１Ｂかブロック特徴の記
述、実行部１２Ｂがブロック属性の付与の形をした一連
のルール群Ｂが適用され、ブロック属性の前イ」与を行
なうが（ステップＳ５）、第６図に示すブロック１．１
−Ｂ１０に対しては次に示すような属性か与えられる。When block feature extraction is completed, the block attribute pre-assignment condition section 11B in the knowledge base 10 applies a block feature description, and the execution section 12B applies a series of rules B in the form of assigning block attributes. 1.1 shown in FIG. 6 is performed (step S5).
-B10 is given the following attributes.

すなわち、第３図の（八）に示すようにブロックＢＬＩ
は数字（ルールＢ２）、ブロック旧、２は数字（ルール
Ｂ２）、ブロックＢＬ３は数字（ルールＢ２）、ブロッ
クＢＬＩＩは斜めバー（ルールＢ１０）、ブロックＢＬ
４は数字（ルールＢ２）、ブロック旧、５は数字（ルー
ルＢ２）、ブロック旧、６は数字（ルールＢ２）、ブロ
ック［ｌＬ７は数字（ルールＢ２）、ブロックＢＬ８は
数字（ルールＢ２）といった属性が与えられる。なお、
ルールに関しては後述する。That is, as shown in (8) in FIG.
is a number (Rule B2), block old, 2 is a number (Rule B2), block BL3 is a number (Rule B2), block BLII is a diagonal bar (Rule B10), block BL
Attributes such as 4 is a number (Rule B2), block old, 5 is a number (Rule B2), block old, 6 is a number (Rule B2), block [lL7 is a number (Rule B2), block BL8 is a number (Rule B2) is given. In addition,
The rules will be explained later.

このようなブロック属性の付与の後に、第７図（八）〜
（６）で示すようなセントタイプの仮説をルール群Ｃに
基ついて行なう（ステップＳ６）。ずなわち、知識ベー
スｌＯ内の配置タイプ仮説の設定の条件部１１Ｇがブロ
ック特徴、ブロック属性、相対位置コートによる記述、
実行部１２Ｇがセントタイプの仮説の設定を第３図の（
Ｂ）のようにルール群Ｃを適用して行なう。第７図（Ａ
）〜（Ｇ）において、ドルオーダは小数壱（セントポイ
ント）以上の）ｍ貨表記部であり、セン１〜オーダはセ
ン１ヘポイント以下の通貨表記部であり、同図（Ｃ）〜
（１：）のセントタイプＣ−ＦにはセントバーＣＩＩか
（ｔ−ｔ！られている。また、第７図（Ｄ）〜（Ｆ）の
センミルタイプＤ−Ｆには１００°′、“’ＸＸ”′と
いったセント信号〔：ｈｌか記入されている。第６図の
文字認識データに対しては、第７図（Ｅ）又は（［１）
で示ずＪ：うなセントタイプＥ又はＢか仮説され、″仮
説されたセントタイプに対して新たに属性のイ」惺、更
新を行ノイう（ステップ５１０）。ず２ｔわち、セン１
−タイプの仮説（ステップ５６）　　として、後述する
ルールＣ３ｄ３によってセントタイプＥとルールＣＢＩ
によってセン１〜タイプＢの２秤類か仮設され、セント
タイプＥに対してはブロックＢＬ４及びＢ１，６の属性
として斜めセントバー及びセント記号を付与し、セント
タイプＢに対してはブロック旧、７及びＢＬ８の属性と
してそわそれセントオーダ数字を付与する。After assigning such block attributes, the steps shown in FIG. 7 (8) to
A cent type hypothesis as shown in (6) is made based on rule group C (step S6). That is, the condition part 11G for setting the layout type hypothesis in the knowledge base IO is a description using block characteristics, block attributes, and relative position codes.
The execution unit 12G sets the cent type hypothesis as shown in FIG.
This is done by applying rule group C as shown in B). Figure 7 (A
) ~ (G), the dollar order is the m currency notation part of 1 decimal point (cent point) or more, and the sen 1 ~ order is the currency notation part of 1 decimal point or less, and in (C) ~
The cent type C-F in (1:) has a cent bar CII or (t-t!), and the cent bar type D-F in Fig. 7 (D) to (F) has a 100°', " A cent signal such as 'XX"' [:hl or
J: A cent type E or B is hypothesized, and a new attribute is updated for the hypothesized cent type (step 510). zu 2t, sen 1
- As a type hypothesis (step 56), cent type E and rule CBI are determined by rule C3d3, which will be described later.
Accordingly, two scales, Sen 1 to Type B, are temporarily constructed, and for Cent type E, diagonal cent bars and cent symbols are given as attributes of blocks BL4 and B1, 6, and for cent type B, blocks old and 7 are added. And a fidget cent order number is given as an attribute of BL8.

次にルール群りか適用され、イ」属属性か更新される。Next, the rules are applied and the attributes are updated.

仮説の検証１の条件部ＬＩＤはセントタイツ仮説、ブロ
ック属性、ブロンク特徴、相対位置コートによる記述、
実行部１．２０かブロック属性の更新を行７７う。もし
ルール群りのいずれとも照合せず、セントタイプの構成
要素としての属性かイマ１′テされない有効ブロックか
存在するとぎは、セン１、タイプ仮説は棄却される（仮
説の検証１）。そして、第３図（Ｃ）のようにセン１〜
タイプＥに基−つく属性−の更新として、ブロックＢ［
，７はセン１ル記号（ルールＤＫ３）、フロック旧、８
はセン［・記℃（ルルＤＫ４）、ブロックＢ１．３　！
４セントオータ数字（ルルＤｄｌ）、ブロック旧５　ｆ
、ｌセンＩ・オーダ数字（ルルｌ′１ｄｌ）、ブロック
３１−１はトルオータ数字（ルールＤ１）、ブロック旧
、２は］−ルオータ数字（ルール旧）の更新を行なう。The condition part LID of hypothesis verification 1 is the cent tights hypothesis, block attribute, bronc feature, description by relative position coat,
The execution unit 1.20 updates the block attributes in line 77. If it does not match any of the rule groups and there is a valid block whose attribute as a component of the cent type is not imaged, then the type hypothesis is rejected (hypothesis verification 1). Then, as shown in Figure 3 (C), Sen 1~
As an update of attributes based on type E, block B[
, 7 is Sen 1 symbol (rule DK3), old flock, 8
Sen [・Ki ℃ (Lulu DK4), Block B1.3!
4 cent over digits (Lulu Ddl), block old 5 f
, l Sen I order number (Lulu l'1dl), block 31-1 updates the Truota number (rule D1), block old, 2]-Luota number (rule old).

また、セン１〜タイプＢに基つく属性の更新は、ブロッ
クｔｌ　１６はセン１〜ポイント（ルールＤＰＩ）、ブ
ロック［ｌＬ］は１〜ルオ一ダ数字（ルールＤＩ）　、
ブロックＢ１，２は１〜ルオ一タ数字（ルールＤｌ−１
）　、ブロック旧４４は１−ルオータ数字（ルールＤｌ
−１）　、ブロック１３１．３は１〜ルオ一ダ数字（ル
ールＤ１）、ブロック８１．５は１−ルオーダ数字（ル
ールＤｌ−１）の属・１牛の更新を行なう。このような
属性の更新及び仮説の検証（ステップ５１０）の後にセ
グメント処理及び全体配置による検証２を行なう（ステ
ップ５２０）。」−記のセントタイプ仮説Ｂは、第３図
（ｌ］）　に示すようにブロック旧、４と１４６及びブ
ロック旧、８の相対位置関係が適切てノイいことか、ル
ールＥ−ＰＯ５６によりヂエツクされ棄却される。棄却
ルールに該当し／よいか否かを判断して検証２を終了し
、認識結果の表示を表示部４て彷ない（ステップ５３０
）、所要の小切手処理（例えば大金処理）を行１１う（
ステップ５３１）。In addition, the update of attributes based on sen 1 to type B is as follows: block tl 16 is sen 1 - point (rule DPI), block [lL] is 1 - 1 digit (rule DI),
Blocks B1 and 2 are numbers 1 to 1 (rule Dl-1)
), block old 44 is 1-Luota number (rule Dl
-1), block 131.3 updates the genus/1 cow of 1 to 100 digits (rule D1), and block 81.5 updates the genus 1 cow of 1 to 100 digits (rule D1-1). After updating the attributes and verifying the hypothesis (step 510), verification 2 using segment processing and overall placement is performed (step 520). The cent type hypothesis B written in ``-'' is checked by rule E-PO56, probably because the relative positional relationships between blocks old 4 and 146 and blocks old 8 are appropriate and noisy, as shown in Figure 3 (l). and rejected. Verification 2 is terminated by determining whether the rejection rule is met/acceptable, and the recognition result is not displayed on the display unit 4 (step 530).
), perform the necessary check processing (e.g. large amount processing) in line 11 (
Step 531).

ここで、抽出したブロックに与える属性を説明すると、
ブロックの属性には゛数字゛°、°“アンターバーを含
む″、゛バー′°、゛連続砂字゛°゛セントオーダ低連
１ｊＪ数字″２　゛アッパーバーを含む、゛分らない°
°かあり、属性゛数字′°に列しては“）・ルオータ数
字″、゛°セントオータ数字゛°、゛°センｌ−記号゛
°及び゛°セントポイント″へ更新される。又、属性“
アンダーバーを含む′°に対してはパアンハーハーを含
むセン１−オーク１文字″へ更新されるか又は゛アンダ
ーバーを含むセントオーダ２文字゛°へ更新され、゛ア
ンターバを含むセントオーダ２文字“′には゛セントオ
ーダ２字°“と°゛セン１〜バーへ更新される。゛アン
ターバーを含むセントオーダ２文字′”には゛セントオ
ーダ２字°°と“セントオーダ数字°°と゛セン１−バ
ー゛′へ更新される。属性゛バー“′には゛°セン［・
バー“°へ更新され、属性゛連続数字′°には゛トルオ
ータ連続数字゛又は゛センｌ−オーク連続数字パへ更粗
さ引１、゛ドルオーダ゛ドルオーダ数字°′と゛′１ールオーダ数字”°へ更
新され、°゛セントオーダ連肪，数字′′には“セン１
−オーダ数字゛°と゛セン１〜オーダ数字゛′へ更新さ
れる。属性“セン１−オータ低連続数字には“セン１〜
オーダ数字゛と°゛セン１ーオーダ数字°′が与えられ
る。更に、属性分らないには°゛１ールオーダ数字°゛
又はパセントオータ数字°”又は°゛セン］記号°“又
は゛セン１〜ポインドパが与えられる。Here, to explain the attributes given to the extracted blocks,
Block attributes include ``Numbers''°, °``Contains an underbar'', ``Bar'°, ``Continuous sand letters'', ``Cent order low sequence 1j J numbers'' 2, ``Contains upper bar,''``Don'tknow°''
There is a ``°'', and the attribute ``Number'' is updated to ``)・Luota numeral'', ``° Centauta numeral ゛°, ゛° Senl-symbol ゛°, and ゛° Cent point.'' “
For '° that includes an underscore, it is updated to ``1 sen - 1 orc character that includes paanhaha'', or it is updated to ``2 cent order letters that include an underbar'', and ``2 cent order letters that include an turbulence''. is updated to "cents order 2 characters °" and "cents 1 - bar". ′.The attribute bar “′ is updated to ゛°sen[・
The bar is updated to ``°'', the attribute ``Continuous digits'' is set to ``Truoter continuous digits'' or senl-oak continuous digits, the roughness is subtracted by 1, and the attribute ``Continuous digits'' is changed to ``Dolorder digits'' and ``'1'' Updated, °゛cent order consecutively, number'' is “sen1”
- Updated to order number ゛° and sen 1 to order number ゛'. Attribute "Sen1-Ota" for low consecutive numbers "Sen1~
The order number ゛ and °゛sen1-order number °' are given. Furthermore, if the attribute is not known, the ``1 order digit'' or the percent over digit '' or the ``sen] symbol ``'' or the ゛sen 1~ pointer digit is given.

相対位置関係による属＋１の検出を第９図ては相夕４位
置マツプて、第１０図ではブロックの位首，形状で示し
ている。又ブロック旧、４を基へＬブロックとした例は
第１１図に示され、ブロック旧、５の横方向はＸｍ１ｎ
ｚセンター、Ｘ重心；センター、Ｘ１ｌｌａＸ＝センタ
ーで、縦方向はｙイ１．、−ミドル、ｙ重心＝ミドル、
ＹｍＡｘ−ミドルとなり、第１２図より相対位置コート
は（２６，２Ｆ＋）　　と表わされる。また、ブロック
Ｂ１，７　は横方向か×、１１、−センター、×重心−
ライト＋　ＸｍＲＸ−ライトてあり、縦方向はｙｍ＋ｎ
−ミ１〜ル、ｙ重心−ミＩ−ル、ｙイ、Ｘ−ミドルとな
り、相対位置コートは第１２図より（２９，２６）　　
て表わされ、ブロック８１４８の横方向はＸｍ１ｎ−ラ
イト、Ｘ重心−ライト、Ｘ−アーｏｕｔＲてあり、縦方
向はｙイ１゜−ミ１−ル、ｙ重心＝ミドル＋　ｙｍａｘ
−ミドルとなっており、相対位置コートは第１２図より
（３３，２５）で表わされる。相対位置は第１２図に示
すように縦方向横方向それぞれ３５通りの配置の組合せ
があり、これをブロックＢＬ４に対するブロックＢＬ６
の相対位置コートとして（２［１，２６１のように表現
する。この表現方法は、特にセントバーに対する数字又
はセント記号の位置関係を簡単に正確に表現することか
できる。ブロック間の相対位置コートは相対値装置コート作成部２３で作成され、大域変数２０を経てワ
ーキングメモリ３０に保持された後、予め記述されたル
ールの条件部でのコートと照合され、ブロック属＋１の
前イ」与（ステップＳ５）、セントタイプの仮説（ステ
ップＳ６）、セントタイプに基づく属性の付与・更新（
ステップ５ＩＧ）及びセグメント処理及び全体配置によ
る検証（ステップ５２０）で用いられる。第１３図のよ
うにブロックａに対する相対位置コートはブロックｂ及
びＣ共にｘ−２６，ｙ−２６となる。ブロックａに対す
る」−下関係を表現するためにブロックｂ、ｃの任意の
１点から上又は下方向に走査して、第１４図の如く」二
で交わればｕ−＋１−ｄ　、下のときはｕ−ｄ−ｕとす
る。ただし、ｕ−ｄはｘ−２６，ｙ−２６のときにのみ
使用する。Detection of the genus +1 based on the relative positional relationship is shown in FIG. 9 as a four-position map, and in FIG. 10 as the position and shape of the block. Also, an example in which L block is created based on block old 4 is shown in Fig. 11, and the horizontal direction of block old 5 is Xm1n.
z center, X center of gravity; center, X1llaX=center, vertical direction is yi1. , - middle, y center of gravity = middle,
YmAx-middle, and from FIG. 12, the relative position coat is expressed as (26, 2F+). Also, is block B1, 7 lateral or ×, 11, - center, × center of gravity -
Light + XmRX- light, vertical direction is ym+n
- Mill 1 ~ Le, y center of gravity - Mill I - Ru, y A, X - Middle, and the relative position court is from Figure 12 (29, 26)
The horizontal direction of the block 8148 is Xm1n-right,
- It is middle, and the relative position court is represented by (33, 25) from FIG. As shown in FIG. 12, there are 35 combinations of relative positions in the vertical and horizontal directions.
It is expressed as (2[1,261) as a relative position code of After being created by the relative value device code creation unit 23 and stored in the working memory 30 via the global variable 20, it is compared with the code in the condition part of the rule written in advance, and the code is given before the block attribute +1 (step S5), hypothesis of cent type (step S6), assigning/updating attributes based on cent type (
It is used in step 5IG) and verification by segment processing and overall placement (step 520). As shown in FIG. 13, the relative position coats with respect to block a are x-26 and y-26 for both blocks b and C. In order to express the ``-lower relationship for block a, if we scan upward or downward from any one point in blocks b and c and intersect at ``2'' as shown in Figure 14, we get u-+1-d, and the lower The time is u-d-u. However, ud is used only when x-26 and y-26.

次に、この発明に用いる知識ベースｌＯ内のルールの各
側を説明する。Next, each side of the rules within the knowledge base IO used in this invention will be explained.

ルール８１０゜条件部ｌ有効ブロックである２高さ＝゛きわめて高い°′又は“かなり高い゛又は“高い′ ３、ＭＡ方向重心位置〉３０４、斜めバーらしさ〉７２上記１〜４のブロック特徴を持つブロックか存在すると
き、実効部そのブロックに属性としてを与える。Rule 810゜ Condition part l Valid block 2 Height = ``Very high'' or ``Quite high'' or ``High'' 3. MA direction center of gravity position〉30 4. Likeness of diagonal bar〉72 Block characteristics of 1 to 4 above When there is a block with , the effective section gives that block as an attribute.

ルールＢ２条件部゛斜めバー１、有効ブロックである２高さ一゛極めて高い″又は゛かなり高い″又は °“高い″又は゛やや低い゛又は °低い′又は “かなり低い゛。Rule B2 conditional part ゛Diagonal bar 1. It is a valid block 2 Height: 1 ``extremely high'' or ``Quite high'' or °“high” or ``Slightly low'' or °low’ or “Quite low.

３、文字らしさ＝°゛数字かもしない′又は“数字であ
る″ 実効部属性゛数字°。3. Character-likeness = ° ``may be a number'' or ``is a number'' Effective part attribute ``number °.

を与えるルールＣ３ｄ３条件部属性か°゛斜めバー゛°のブロックとその右下位置に有
効ブロックがある。たたし、　°°斜めバー″ブロック
の横方向位置〉４０実効部その有効ブロックを“°セント記号°゛として、セント
タイプＥの仮説を生成する。Rule C3d3 that gives conditional attribute: There is a diagonal bar block and an effective block at its lower right position. However, the lateral position of the °°diagonal bar'' block〉40 Effective part Assuming that the effective block is the "°cent symbol °", a hypothesis of cent type E is generated.

ルールＣＢＩ　。Rules CBI.

条件部１、右端から１つ目（β）及び２つ目（α）の有効ブロ
ックの属性かパ数字°゛又は°小さい数字°“ ２高ざか共に′やや低い″又はパ低い°゛又は°かなり
低い°°又は°“きわめて低い°゛ブロツクα、βあり
、（αのＸ重心）〈（βのＸ重心）とする３上記α、βのブロックのガ側近くに属性か゛数字′°
又は゛連続数字°°のブロックγがあり、αのＸ重心−
γのＸ重心く（γの高さ）×３４γの高さがα又はβの高さの１５倍以上実効部ブロックα、βをセントオーダ数字として、セントタイ
プＢの仮説を生成する。Condition part 1, the attributes of the first (β) and second (α) valid blocks from the right end or the digits °'' or °small numbers `` 2 The heights are both ``slightly low'' or the digits are low °゛ or ° There are blocks α and β, (X center of gravity of α) and (X center of gravity of β) 3 There is an attribute or number near the side of the above α and β blocks.
Or ``There is a block γ of continuous numbers °°, and the X centroid of α −
X centroid of γ (height of γ) × 3 4 The height of γ is 15 times or more the height of α or β. Using the effective part blocks α and β as cent order numbers, a hypothesis of cent type B is generated.

ルールＤＫ３　。Rule DK3.

条件部セントタイプＥて属性パ斜めセン１−バのブロックとの
位置関係か右下（ｉｃｈｉ−ｓｂａｒ）である有効ブロ
ックかある実効部そのブロック属性を゛セン１へ記号°゛として、セント
タイプＥのセント記号ブロックとする。The condition part cent type E and the attribute parameter diagonal sensor 1 - Is there a valid block that is the lower right (ichi-sbar) of the block? Let it be an E cent symbol block.

ルールＩＭＩ　。Rule IMI.

条件部センｌ−タイプＥてあって、属性斜めセン）−バーパブ
ロ・ンクかあって、その左」二（ｉｃｈｉ−ｓ−Ｄｄｌ
）の位置に属性が“数字パ又は小さい数字がある゛実行部そのブロック属性を゛セントオーダ数字”。Condition part senl-type E, attribute diagonal sen)-bar pablo nk, its left” 2(ichi-s-Ddl
) in the position where the attribute is ``number par or small number''.

として、セントタイプＥのセンｌ−オーク数字ブロック
とする。, let it be a cent type E senl-orc numeral block.

ルールＤ１条件部全センＩ−タイプに対して゛′セントオータ数字゛°ブ
ロックが２つあフて（α、βとする）、位置関係か次の
不等式て表わされるブロック属・）’！　”数字パのブ
ロックγかある。Rule D1 Conditional part For all Sen I-types, if there are two blocks (assumed to be α and β), the block genus is expressed by the positional relationship or the following inequality.)'! ``There is a block γ of number pa.

γのＸ重心くα及びβのＸ重心 γの高さ〉α又はβの高さの０，９倍実行部 γをドルオーダ数字とする。X centroid of γ x X centroid of α and β Height of γ〉0.9 times the height of α or β execution part Let γ be a dollar order number.

ルールＤＩ−１条件部全てのセン［・タイプに対して゛ドルオタ数字″の、゛セン１〜オーダ数字′ あって、次の位置関係の°゛数字″ δかある。Rule DI-1 conditional part For all sen[・types] Doruo Ta ``number'', ゛sen 1 ~ order number' Then, the following positional relationship °゛number'' There is δ.

αのＸ重心〈δのＸ重心 δから見たαの相対位置コｃｏｄｅ−ｙ−１１，２０，２］、２６．２７β、γかブロック１〜実行部 δを゛［・ルオーダ数字′°とする。X center of gravity of α〈X center of gravity of δ Relative position of α seen from δ code-y-11,20,2], 26.27β, γ? block 1~ execution part Let δ be ゛[・Ruoder number′°.

ルールＤＰＩ　。Rule DPI.

条件部セン］・タイプが全タイプに対して゛セントオーダ数字
”ｈ）２ブロツク（α、β）分っているとき（αのＸ重
心〈βのＸ重心とする）、゛数字パが２つあって、（γ
、δγのＸ重心くδのＸ重心〈αのＸ重心とする）。δ
か′１°’、”２°’、”６°゛てあり、δは小さ目て
しかもγに比へても小さい。位置としてはδのｙｍ　＋
　１１　＞γのＸ重心。Condition part sen]・When the type is divided into cent order digits h) 2 blocks (α, β) for all types (the X centroid of α is the X centroid of β), there are two There, (γ
, the X centroid of δγ is the X centroid of δ (assumed to be the X centroid of α). δ
``1°'', ``2°'', and ``6°'', and δ is small and even smaller than γ. The position is ym + of δ
11 > X centroid of γ.

実行部 δをセントポインＩ−とする。execution part Let δ be the cent point I-.

ルールＥ−ＰＯ５６；条件部セン１−タイプＢであって、°゛ドルオーダ数字″”α
、β、゛せントオーダ数字“°γがあり、α２　β、γ
か次の位置関係にある。Rule E-PO56; Condition part Sen1-Type B, °゛Dollar order number''”α
, β, there is a sent order number “°γ, α2 β, γ
or in the following positional relationship.

βから見たγの相対位置コード；１ｃｎｉ−ｂａｒ βから見たαの相対位置コード；１ｃｎｉ−Ｄｄｌここで、１ｃｈｉ−ｓ−ｂａｒは斜めアンターバーとそ
の下方にあるブロックとの位置関係であり、（ｘ−２６
，ｙ−２６，ｕ−ｄ−ｄ）又は（ｘ−２６、ｙ−２７）
又は（ｘ−２７ｙ−２ｅ）又は（ｘ−２９、ｙ−２６）
である。Relative position code of γ seen from β; 1cni-bar Relative position code of α seen from β; 1cni-Ddl Here, 1chi-s-bar is the positional relationship between the diagonal amber and the block below it, (x-26
, y-26, u-dd) or (x-26, y-27)
or (x-27y-2e) or (x-29, y-26)
It is.

実行部セントタイプ仮説の棄却発明の効果Ｕ上のようにこの発明の文字認識装置によれば、ある程
度記載形式の定まった手書き文字については、その手書
き文字を類形化した配置タイプを利用し、どの配置タイ
プに属するかを仮説してその仮説に基づいて認識を進め
て行くため、例えば数字の１゛′と斜めバー等非常に似
かよったもの同士てあフても確実に誤認識なく認識する
ことができる。Rejecting the execution part cent type hypothesis Effects of the invention U As mentioned above, according to the character recognition device of the present invention, for handwritten characters whose written format has been determined to some extent, an arrangement type that typifies the handwritten characters is used. Since it hypothesizes which arrangement type it belongs to and proceeds with recognition based on that hypothesis, it can reliably recognize objects that are very similar, such as the number 1' and a diagonal bar, without misrecognition. be able to.

[Brief explanation of drawings]

第１図はこの発明の基本構成を示すブロック図、第２図
はこの発明の動作例を示すフロチャート、第３図〜第６
図はこの発明の詳細な説明するための図、第７図（Ａ）
〜（Ｇ）はセントタイプの種類を示す図、第８図は横方
向重心位置を説明するための図、第９図〜第１１図は相
対位置関係による属性の検出を説明するための図、第１
２図は相対位置コートを示す図、第１３図及び第１４図
は１ｃｈｉ−ｓ−ｂａｒを説明するための図である。１・・・制御部、２・・・画像人力部、３・・・前処理
部、４・・・表示部、１０・・・知識ベース、２０・・
・大域変数、２１・・・ブロック抽出部、２２・・・ブ
ロック特徴抽出部、２３・・・相対位置コート作成部、
３０・・・ワーキングメモリ、４０・・・推論機構。特開平３２９１７８１　（８）図ゼントμブＥによ６溝−生Ｏ丈新ＣＤ）仮説Ｏ腑証２番梁上ルーフ１／　ｔ：該当しなし・ ↓ 検証φ冬・了第７１− セ、ントタイフ゛ｇｔ’：よ５１ｇ侶ＬＯ文１斤↓ 禁止ル―ルに骸当（ルづしＥ−ＰＯ５；５）↓ 廚し熔度敬下け゛る図牛に元ｆ山正号平成２年５月２９日FIG. 1 is a block diagram showing the basic configuration of this invention, FIG. 2 is a flowchart showing an example of the operation of this invention, and FIGS.
The figure is a diagram for explaining the invention in detail, FIG. 7(A)
~(G) are diagrams showing the types of cent types, Figure 8 is a diagram for explaining the horizontal center of gravity position, Figures 9 to 11 are diagrams for explaining the detection of attributes based on relative positional relationships, 1st
FIG. 2 is a diagram showing a relative position court, and FIGS. 13 and 14 are diagrams for explaining a 1chi-s-bar. DESCRIPTION OF SYMBOLS 1... Control part, 2... Image human power part, 3... Preprocessing part, 4... Display part, 10... Knowledge base, 20...
・Global variable, 21...Block extraction unit, 22...Block feature extraction unit, 23...Relative position code creation unit,
30...working memory, 40...reasoning mechanism. Unexamined Japanese Patent Publication No. 3 291781 (8) Fig. Zent μbu E 6 grooves - Raw O length new CD) Hypothesis O proof 2 No. beam roof 1/t: Not applicable・ ↓ Verification φ Winter・Complete No. 71- , Type fig': Yo 51g LO text 1 catty ↓ The prohibition rule is a skeleton (Ruzushi E-PO 5; 5) ↓ Former F Yamasho name May 1990 for the cattle that are deeply respected. 29th

Claims

[Claims]

1. A character recognition device that recognizes characters using a knowledge base that stores rules necessary for recognition from information regarding a group of characters to be recognized, and an inference mechanism that makes inferences using the knowledge base. block extraction means for extracting a single character block from a group of characters to be recognized and assigning a block candidate to each character block; block arrangement type extraction means for extracting block arrangement type candidates that approximate the arrangement of character blocks; and assigning block attribute candidates to each block based on the block arrangement type extracted by the block arrangement type extraction means. A character recognition device comprising an attribute assigning means and a verification means for verifying a candidate based on the block arrangement type candidate and the block attribute candidate.