JPS58146974A

JPS58146974A - Reader of character

Info

Publication number: JPS58146974A
Application number: JP57029333A
Authority: JP
Inventors: Toshio Tsutsumida; 敏夫堤田; Yasuhiro Yamada; 山田　康宏
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 1982-02-25
Filing date: 1982-02-25
Publication date: 1983-09-01

Abstract

PURPOSE:To attain character discrimination processing highly accurately and simply, by performing character discrimination processing independently of character discrimination processing to be collectively applied to the whole categories. CONSTITUTION:After storing a character pattern 11 in a pattern memory 10, a feature group is extracted by a feature group forming part 20 and inputted to a character discriminating part 30 through a feature group register 21. The character discriminatig part 30 matches the contents of combination logics with that of the feature group register 21 in each subcategory on a dictionary memory 40 and outputs a candedate category to an individual character discriminating part 60 through a candidate category register 31. The individual character discriminating part 60 discriminates characters on the basis of an individual feature group extracted from an individual feature group forming part 50 and the elements of successive combination logics from an individual dictionary memory 90.

Description

【発明の詳細な説明】（１）　　発明の属する分野の説明本発明は０文字読取処理、特に字形の類似した文字を読
取る処理において１個々の類似文字を区別するために、
微細な字形上の差異を拠り所とした高精度な識別処理を
行なう文字読取装置に関するものである。DETAILED DESCRIPTION OF THE INVENTION (1) Description of the field to which the invention pertains The present invention provides a method for distinguishing individual similar characters in 0 character reading processing, particularly in reading characters with similar glyph shapes.
The present invention relates to a character reading device that performs highly accurate identification processing based on minute differences in character shape.

（り　　従来の技術の説明カナ、英字、数字等を混在させて記述した文書・特に手
書き文書では１字形のよく似た文字０例えばカナのｌチ
′と数字の１４′、カナのｌ工ｌと英字のＩｌｌ、数字
の１７１　とカナのｌりＩ９１ワＩ等が多く存在する。(Explanation of conventional technology) Documents written using a mixture of kana, alphabetic characters, numbers, etc. Especially in handwritten documents, characters with similar shapes such as kana lchi' and numeral 14', kana l ko l There are many examples such as the alphabetic character Ill, the number 171, and the kana character I91 wa I.

これら類似文字を含めて読取対象とした従来の文字読取
装置では２手書文字の変形に応じてカテゴリ内を複数の
サブカテゴリに分割し０種々の識別用特徴による組合せ
論理によって各サブカテゴリを記述し、混同され易い他
カテゴリとの間に存在する字形の差異を表わした識別用
特徴を、上記組合せ論理の中に含めた辞書を構成するこ
とによって、他カテゴリとの間にある手書き変形に関す
るサブカテゴリの守備範囲を微妙に調整していた。Conventional character reading devices that include these similar characters in the reading target divide a category into multiple subcategories according to the transformation of handwritten characters, and describe each subcategory using combinatorial logic using various identification features. By constructing a dictionary that includes in the above combinatorial logic the discriminative features that represent differences in character shapes that exist between other categories that are easily confused, it is possible to protect subcategories related to handwritten deformation between them and other categories. The range was slightly adjusted.

このような装置では、高精度な読取性能を確保する上で
、類似文字を含めた種々のカテゴリｆＲ時に区別するた
めに、サブカテゴリ毎の論理数や。In such a device, in order to ensure highly accurate reading performance, and to distinguish between various categories including similar characters, a logical number is set for each subcategory.

手書き変形に対するサブカテゴリ数が大きくなり。The number of subcategories for handwritten transformations has increased.

装置が高価になること９文字判定処理に時間がかかるこ
と、また、読取字種の拡大に伴う辞書の修正が困難なこ
となどの欠点があった。The disadvantages include that the device is expensive, that it takes time to process nine characters, and that it is difficult to modify the dictionary as the number of characters that can be read increases.

これに対し、全カテゴリを一括した文字判定処理におい
て、入力文字に対する候補カテゴリが複数出現した場合
に、これらカテゴリ関に固有な字形的差興を新たに僅別
特徴として抽出し、カテゴシ対電に類似文字を区別する
文字読取装置がある。On the other hand, when multiple candidate categories appear for an input character in character judgment processing for all categories at once, the glyph differences unique to these categories are extracted as new distinct features, and the There are character reading devices that distinguish between similar characters.

このような装置ではあるカテゴリの組合せにおいて類似
文字として発生する文字バタンか手書き変形により一様
でないため単純な文字判定処理が困難なこと、候補カテ
ゴリを得た後、その組合せ内容に応じ字形上の差異を詳
細に抽出するため処理に時間がかかることなどの欠点が
あった。With such devices, it is difficult to perform simple character recognition processing because the characters are not uniform due to similar character slams or handwritten deformations in a certain combination of categories. This method has the disadvantage that it takes time to extract the differences in detail.

（１）　　発明の目的本発明はこれらの欠点を除去したもので、全カテゴνを
一括した文字判定処理とは別に類似文字であるカテゴリ
の組合せに対し９個別に文字判定処理を行なうこと、カ
テゴリ対毎にではなく手書を変形に応じたサブカテゴリ
対電に詳細な文字判定処理を行なうこと、標準字形に近
いサブカテゴνを他に優先させることを特徴としており
、・文字読取装置における字形の類似した文字バタンの
だめの文字判定処理を高精度かつ単純な処理で実現する
ことを目的とする。(1) Purpose of the Invention The present invention eliminates these drawbacks, and includes character determination processing for nine individual combinations of categories that are similar characters, in addition to character determination processing for all categories ν at once. It is characterized by performing detailed character judgment processing not on a pair-by-pair basis but on subcategory pairs according to handwriting deformation, and giving priority to subcategories ν that are close to standard glyph shapes over others.・Similar glyph shapes in character reading devices The purpose of this invention is to realize character judgment processing for character slams with high precision and simple processing.

（４）発明の構成および作用の説明オ１図は本発明の実麺例であって、１０はバタンメモリ
、２０は特徴群作成部、２１は特徴群し、ジスタ、３０
は文字判定部、４０は辞書メモリ。(4) Description of structure and operation of the invention Figure 1 shows an actual example of the present invention, in which 10 is a button memory, 20 is a feature group creation section, 21 is a feature group, a register, 30
Reference numeral 40 indicates a character determination unit, and 40 a dictionary memory.

３１は候補カテゴリ・レジスタ、５０は個別特徴群作成
部、５１は個別特徴群レジスタ、６０は個別文字判定部
、７０はカテゴリ対テーブル、８０はサブカテゴリ対テ
ーブル、９０は個別辞書メモリ、ｌｌは入力された文字
バタン、６１は文字判定結果である。31 is a candidate category register, 50 is an individual feature group creation unit, 51 is an individual feature group register, 60 is an individual character determination unit, 70 is a category pair table, 80 is a subcategory pair table, 90 is an individual dictionary memory, ll is an input 61 is the character determination result.

これを動作するには、先ず読取るべき入力されり文字バ
タンｌｌをバタンメモリｌＯに格納する。To operate this, first the input character button 11 to be read is stored in the button memory 10.

特徴群作成部２０では、バタンメモｕｌＯにアル文字バ
タンから、その字形を記述する特徴群の抽出を行ない、
抽出結果を特徴群レジスタ２１に格納する。特徴群は読
取対象とする全カテゴリを一括した文字判定処理におい
て９個々のサブカテゴリを他カテゴリと分離する上で効
果的な特徴によって構成されている。The feature group creation unit 20 extracts a feature group that describes the character shape from the al character batan in the batan memo ulO,
The extraction result is stored in the feature group register 21. The feature group is made up of features that are effective in separating each of the nine subcategories from other categories in a character determination process for all categories to be read at once.

辞書メモリ４０は、読取対象であるカテゴνについて、
各カテゴリ内を手書き変形に応じたサブカテゴリに分割
し１判定のための組合せ論理を格納しておく０本実施例
における辞書メモリ４０上の各サブカテゴリの組合せ論
理要素は、特徴群レジスタ２１の特定の特徴内容を指示
するアドレスとその特徴の値の＃Ｖ−谷範囲とからなり
、また各サブカテゴリの最終アドレスには、そのサブカ
テゴリが属するカテゴリ・フード、サブカテゴリ・コー
ドとそのカテゴリの標準字形に対する変形の少なさの程
度を表わす重み値とを格納しておく。Regarding the category ν to be read, the dictionary memory 40 stores
Each category is divided into subcategories according to handwriting deformation, and combinatorial logic for 1 determination is stored. In this embodiment, the combinatorial logic elements of each subcategory on the dictionary memory 40 are It consists of an address indicating the feature content and the #V-valley range of the value of that feature, and the final address of each subcategory contains the category hood to which the subcategory belongs, the subcategory code, and the variation of the standard glyph for that category. A weight value representing the degree of smallness is stored.

文字判定部３０は、辞書メモリ上の各サブカテゴリ毎に
１組合せ論理内容と特徴群レジスタ２１の内容とを煕合
し、論理条件を満足したサブカテゴリの属するカテゴリ
を候補カテゴリとする０本実施例においては、辞書メモ
リ４０における各サブカテゴリの格納順序として、カテ
ゴリ毎に標準字形に近いものから並べておき１文字判定
処理の際に該文字パタンの特徴群がカテゴリ毎に初めて
組合せ論理を満足したサブカテゴリのみをＮ択ｊるよう
に構成する。従って候補カテゴリ毎に１組のカテゴリ・
フード、サブカテゴリ・コード、重み値を候補カテゴリ
・レジスタ３１に格納する。In this embodiment, the character determination unit 30 matches one combinational logical content for each subcategory on the dictionary memory with the content of the feature group register 21, and selects the category to which the subcategory that satisfies the logical condition belongs as a candidate category. As the storage order of each subcategory in the dictionary memory 40, the characters are arranged in descending order of the standard shape for each category, and only the subcategories for which the feature group of the character pattern satisfies the combinatorial logic for the first time in each category are selected during the single character determination process. It is configured so that there are N choices. Therefore, one set of categories/categories for each candidate category.
The food, subcategory code, and weight value are stored in the candidate category register 31.

次に個別特徴群作成部５０は１字形の類似したカテゴリ
間に存在する種々の字形上の差異を記述した個別特徴を
抽出する。ここでは様々な類似文字であるカテゴリ対に
有効なものを一括して抽出し９個別特徴群として個別特
徴群レジスタ５１に格納する。Next, the individual feature group creation unit 50 extracts individual features that describe various differences in character shape that exist between categories of similar characters. Here, various similar characters that are valid for category pairs are extracted all at once and stored in the individual feature group register 51 as nine individual feature groups.

次に個別文字判定部６０では、該入力文字パタンに対す
る候補カテゴリ数を候補カテゴリ・レジスタ３１より受
けて、それが１カテゴリ以下の時そのカテゴリ・コード
を文字判定出力６１とする。Next, the individual character determination section 60 receives the number of candidate categories for the input character pattern from the candidate category register 31, and when the number is less than one category, the category code is set as the character determination output 61.

候補カテゴリ数が２力テゴリ以上の時、先ず候補カテゴ
リ・レジスタ３１にある各カテゴリに関する重み値の小
ざい（ＩＩ準字形に近いサブカテゴリに対応している）
ものを優先して採り上げ、その結果候補カテゴリ数が１
カテゴリならそのカテゴν・コードを文字判定出力６１
とし、ひき続いて２力テゴリ以上であれば以下の個別文
字判定処理・：・、、。When the number of candidate categories is more than 2 categories, first, the weight value for each category in the candidate category register 31 is determined to be small (corresponding to subcategories close to the subcategories II).
As a result, the number of candidate categories is 1.
If it is a category, output that category ν code for character judgment 61
Then, if the power category is 2 or above, the following individual character judgment processing is performed.

を行なう。Do this.

個別文字判定処理は、処理を単、純にするため。The purpose of individual character determination processing is to simplify the process.

本実施例では２つの候補カテゴリが得られた時に。In this embodiment, when two candidate categories are obtained.

個別辞書メモリ９０にある組合せ論理を基に、二者択一
するものとする。カテゴリ対テーブル７０゜サブカテゴ
リ対テーブル８０は、任意のカテゴリ対に対応する組合
せ論理を１個別辞書メモリ９０上で高速に検索するため
のマツピング・テーブルである。ただし、テーブル・メ
モリ量を大幅に削減できることから類似文字に相当しな
いカテゴリ対およびサブカテゴリ対は含ませない。It is assumed that a choice is made based on the combinatorial logic in the individual dictionary memory 90. The category pair table 70.subcategory pair table 80 is a mapping table for quickly searching one individual dictionary memory 90 for combinational logic corresponding to any category pair. However, category pairs and subcategory pairs that do not correspond to similar characters are not included because this can significantly reduce the amount of table memory.

先ず０個別文字判定部６０で処理を必要としたカテゴリ
対の内容・６２を基に、カテゴリ対テーブル７０を検索
する。同一のカテゴリ対の内容を格納したテーブル要素
があれば、そのテーブル要素に含まれたサブカテゴリ対
テーブル８０のアドレス指定情報７１を出力する。First, the category pair table 70 is searched based on the content 62 of the category pair that required processing in the 0 individual character determination unit 60. If there is a table element storing the contents of the same category pair, the addressing information 71 of the subcategory pair table 80 included in that table element is output.

次に、候補カテゴリ・レジスタ３１から個別文字判定部
６０を通して得たサブカテゴリ対の内容６３を基に、ア
ドレス指定情報７１で指示されたテーブル要素からサブ
カテゴリ対テーブル８０を検索する。同一のサブカテゴ
リ対の内容を格納したテーブル要素があれば、そのテー
ブル要素に含まれた個別辞書メそす９０のアドレス指定
情報８１を出力する６個別辞書メモリ９０における該サ
ブカテゴリ対に関する組合せ論理の格納開始アドレスは
、前記のアドレス指定情報８１によって得られるように
構成してお（１次に、順次０組合せ論理要素９１が個別
辞書メモリ９０より個別文字判定部６０に送られ０個別
特徴群レジスタ５１の内容と照合される。該サブカテゴ
リ対の最後の組合せ論理要素９１は終了フラグを盛り込
んだものとし１個別文字判定部６０は、これを受けるま
で照合処理を繰り返す、その結果、該サブカテゴリ対の
何れか一力のカテゴリが選択できた時は、そのカテゴリ
・コードを、そうでない時はりジェクト・コードを文字
判定結果６１として出力する。Next, based on the content 63 of the subcategory pair obtained from the candidate category register 31 through the individual character determination unit 60, the subcategory pair table 80 is searched from the table element designated by the addressing information 71. If there is a table element storing the contents of the same subcategory pair, outputting the addressing information 81 of the individual dictionary memory 90 included in that table element.6 Storing the combinational logic regarding the subcategory pair in the individual dictionary memory 90. The start address is configured to be obtained from the address designation information 81 (first, the 0 combination logical elements 91 are sequentially sent from the individual dictionary memory 90 to the individual character determination unit 60 and the 0 individual feature group register 51 The last combinational logic element 91 of the subcategory pair includes an end flag, and the 1 individual character determination unit 60 repeats the matching process until it receives this flag.As a result, any of the subcategory pairs When a category with a certain power can be selected, the category code is outputted as the character determination result 61, and when not, the eject code is outputted as the character determination result 61.

この様な構成になっているから１字形の類似した複数の
カテゴリが候補となった時に、入力文字パタンの手書き
変形の内容に応じたサブカテゴリを各カテゴリ毎に対応
づけること、各サブカテゴリ対を文字判定するための組
合せ論理を高速に検索すること、各サブカテゴリの標準
字形への近さ　　。Because of this structure, when multiple categories of similar character shapes are candidates, it is necessary to associate each category with a subcategory according to the content of the handwritten transformation of the input character pattern, and to assign each subcategory pair to the character A fast search for combinatorial logic to determine the proximity of each subcategory to a standard glyph.

に応じ優先して選択することが可能となり、類似文字を
個別に文字判定する際に細かな処理が実現できる。This makes it possible to prioritize and select similar characters depending on the character, and it is possible to realize detailed processing when determining similar characters individually.

第２図は１本実施例における辞書メモリの構成であって
、これを基に１つのサブカテゴリの内容を具体的に説明
する。４０は辞書メモ！ｊ、　　４０１゜４０２．４０
３は特徴群レジスタ２１上の特定の特徴を指示するアド
レス情報ＡＬ　Ａ％、Ａ％、　４１１゜４１２．４１３
はＡＬ　Ａ惰、Ａ％でアドレスが指示された特徴値の許
容条件Ｂｊ１．　Ｂｍ、　Ｂｓ、　４２０は上記の組合
せ論理に関するカテゴリ・コード−Ｃｔ。FIG. 2 shows the structure of the dictionary memory in this embodiment, and based on this, the contents of one subcategory will be specifically explained. 40 is a dictionary memo! j, 401°402.40
3 is address information indicating a specific feature on the feature group register 21. A%, A%, 411°412.413
is the permissible condition Bj1. of the feature value whose address is specified by AL A inertia, A%. Bm, Bs, 420 is the category code-Ct regarding the above combinational logic.

４３０は上記の組合せ論理に関するカテゴリ・Ｃｍ内の
ｊｔ目のサブカテゴリ・コード・ＳＢ、４４０は上記の
サブカテゴリの標準字形への近さを表わす重み値Ｗ（ｊ
である。430 is the jt-th subcategory code SB in the category Cm related to the above combinatorial logic, and 440 is the weight value W(j
It is.

辞書メモリ４０は、シーケンシャルなメモリ構造とし、
カテゴリ毎にそのカテゴリへ属す全すプカテゴνに関す
る情報を連続して格納する。第２図における組合せ論理
要素（ＡＩＢりから（Ｃｊ。The dictionary memory 40 has a sequential memory structure,
For each category, information regarding all Pcategos ν belonging to that category is continuously stored. The combinatorial logic element (AIB) in FIG.

’　Ｃｔｊ、Ｗｊ／）の間がサブカテゴリ・Ｓｔｊに関
する辞書情報である。' Ctj, Wj/) is dictionary information regarding the subcategory Stj.

これらの動作は１個々のサブカテゴリの照合処理が繰り
返される過程の中で１文字判定部３０からのアクセスに
よってなされる。These operations are performed by access from the single character determination unit 30 in the process of repeating the matching process for each subcategory.

先ず１文字判定部３０に組合せ論理要素（ＡＬＢＩＬ）
がとり込まれ６次に特徴群レジスタのアドレス八２の内
容と許容条件Ｂｆｆｉとを比較する。　この比較におい
て条件が満足されれば次の組合せ論理要素（Ａｓ　、　
Ｂｓ　）について同一処理を繰り返し。First, the combinational logic element (ALBIL) is input to the single character determination unit 30.
is taken in, and the contents of address 82 of the feature group register are compared with the allowable condition Bffi. If the condition is satisfied in this comparison, the next combinatorial logic element (As,
Repeat the same process for Bs).

満足されなければ該サブカテゴリの照合処理を終了して
次サブカテゴリの照合処理を行なう、最終の組合せ論理
要素（Ａ％、Ｂｍ）まで全ての条件が満足されれば０次
アドレスに格納された（Ｃ（。If not satisfied, the matching process for the subcategory is ended and the matching process for the next subcategory is performed.If all the conditions are satisfied up to the final combinational logical element (A%, Bm), it is stored at the 0th address (C (.

Ｓｉｊ、Ｗ（ｊ）を候補カテゴリ・レジスタ３１に送出
される。従って、単純な動作により８組合せ論理ヲ満足
したサブカテゴリの内容を簡単に検索することができる
。Sij, W(j) is sent to the candidate category register 31. Therefore, the contents of subcategories that satisfy the 8 combinatorial logics can be easily searched for by simple operations.

第３図は類似文字に関わるサブカテゴリの一例　　゛で
ある。カナの′チ′を中心に数字の１４１．カナのｒケ
ｌ、カナの′テ′との関係を示したもので。Figure 3 is an example of a subcategory related to similar characters. The number 141 is centered around the kana 'chi'. This shows the relationship between kana rkel and kana 'te'.

ＩチｌとＩ４１　　の間にはループ構造の有無によって
２種考えられる。これら手書き変形毎に字形を記述した
組合せ論理によってサブカテゴリを作成し、辞書メそす
４０に格納しておく。図には′チＩにチ１．チ雰、チ１
．チ４の４種　１４１に４ｔ　、　４ｍノ２１［、’ケ
Ｉのケｌ＋’テｌのテ、のサブカテゴリ例を示した。こ
れらの誉ブカテゴリが本実施例における儂別文字判定部
６０の処理対象となるものの一例であり、単にカテゴリ
・コードだけでなく。There are two possible types between Ichil and I41, depending on the presence or absence of a loop structure. A subcategory is created using combinatorial logic that describes the character shape for each of these handwritten transformations, and is stored in the dictionary meso 40. The figure shows ``Chi I to Chi 1.'' Chi atmosphere, Chi1
．． 4 types of Chi 4 141 shows examples of subcategories of 4t, 4mノ21[, 'KeI of KE+'TE of Te. These honorable categories are an example of what is processed by the separate character determination section 60 in this embodiment, and are not just category codes.

サブカテゴリ・フードの内容を導くことにより。By leading to the content of the subcategory food.

２つのカテゴリ間の字形上の差異に関する着眼点が明確
になる。The point of view regarding the difference in glyph shape between the two categories becomes clear.

倒えば（チ１．４ｔ）では端点ｌと端点２の上下関係、
ストロークｌの屈曲、（チ鵞、４ｇ）では端点３近傍の
方向、ストローク２の屈曲、（チ３．ケり。If it falls down (chi 1.4t), the vertical relationship between endpoint l and endpoint 2,
The bending of stroke l, (chi, 4g) is the direction near the end point 3, the bending of stroke 2, (chi 3. keri).

（チ４．テ１）では交叉点の様子である０本実施例では
０個別特徴群作成部５０でこれらの特徴を抽出しておき
個別文字判定処理における照合データとする。In (H4.TE1), the appearance of the intersection point is 0. In this embodiment, the 0 individual feature group creation unit 50 extracts these features and uses them as collation data in the individual character determination process.

第４図は１本実施例における候補カテゴリ・レジスタの
構成を示す。３１は候補カテゴリ・レジスタ、４２１，
４３１．４４１はそれぞれ辞書メモリから得たカテゴリ
・コード、サブカテゴリ・フード、重み値を格納するレ
ジスタ列であり、３１１゜３１２．３１３，３１４．３
１５は最大５種までの候補カナコリ情報を格納するレジ
スタ行である１図における格納例では、辞書メモリ上の
カテゴ’）ＣｌａＣｍ　、　Ｃｍの各々に関するサブカ
テゴリＳ　ｌ＊　Ｓ　＊　ｅＳ３の３通りの組合せ論理
を入力文字バタンの特徴群が満たしていた場合のもので
９重み値ｗｉｔＷ！　、　Ｗｓと併せて３種の候補カテ
ゴリ情報が格納されている。FIG. 4 shows the configuration of the candidate category register in this embodiment. 31 is a candidate category register, 421,
431.441 are register strings that store the category code, subcategory food, and weight value obtained from the dictionary memory, respectively, and are 311°, 312.313, and 314.3.
15 is a register line that stores up to five types of candidate Kanakori information. In the storage example in Figure 1, there are three combinational logics of subcategories S l * S * eS3 for each of the categories ')ClaCm and Cm on the dictionary memory. If the feature group of the input character slam satisfies the 9 weight value witW! , Ws as well as three types of candidate category information are stored.

第５図はサブカテゴリの重み値の説明図であって、標準
字形のサブカテゴリを０．混同確率の小さいす／カテゴ
リを１．混同確率の大きいサブカテゴリを２というよう
に５値の亀み値を用いた例である。上記の候補カテゴリ
情報が個別文字判定処理の入力情報となる。FIG. 5 is an explanatory diagram of the weight values of subcategories, and shows the subcategories of standard glyphs as 0. Set the category/category with the lowest probability of confusion to 1. This is an example in which a five-value scale value is used, such that the subcategory with a high probability of confusion is set to 2. The above candidate category information becomes input information for the individual character determination process.

第６図は１本実施例における個別辞書メそνの検索手順
の具体的な説明図である。６２は文字判定結束に関する
カテゴリ対情報（Ｃ＝、Ｃｊ）、６３は文字判定結果に
関するサブカテゴリ対情報（Ｓｊｍ。FIG. 6 is a concrete explanatory diagram of the search procedure for the individual dictionary menu ν in this embodiment. 62 is category pair information (C=, Cj) regarding character determination unity; 63 is subcategory pair information (Sjm) regarding character determination results.

５ｊｓ）、７０はカテゴリ対テーブル、７１０はカテゴ
リ対の格納列、７２０はアドレス指定情報の格納列、８
０はサブカテゴリ対テーブル、８１Ｏは境界を示す終了
フラグの格納列、８２０はサブカテゴリ対の格納列、８
３０はアドレス指定情報の格納列、９０は個別辞書メモ
！Ｊ、９１０は境界を示す終了フラグの格納列、９２０
は個別特徴群レジスタの特定要素を引き出すためのアド
レス格納列、９３０はカテゴリ対情報における前者のカ
ナゴ！ｊ（Ｃ？）の許容条件格納列、９４０はカテゴリ
対情報における後者のカテゴリＣＣ１）の許容条件格納
列、７１はサブカテゴリ対テーブル８０の検索開始アド
レス指定情報、８１は個別辞書メモν９０の検索開始ア
ドレス指定情報、９１は個別文字判定部６０へ送出され
る組合せ論理要素である。5js), 70 is a category pair table, 710 is a storage column for category pairs, 720 is a storage column for addressing information, 8
0 is a subcategory pair table, 81O is a storage column for an end flag indicating a boundary, 820 is a storage column for subcategory pairs, 8
30 is a storage column for addressing information, and 90 is an individual dictionary memo! J, 910 is a storage column of end flags indicating boundaries, 920
930 is the address storage column for extracting a specific element of the individual feature group register, and 930 is the former name in the category pair information! 940 is the permissible condition storage column for the latter category CC1) in the category pair information, 71 is the search start address designation information of the subcategory pair table 80, 81 is the search start for the individual dictionary memo ν90 Addressing information 91 is a combinational logic element sent to the individual character determination section 60.

これを動作するには、先ずカテゴリ対テーブル７０にお
いて、カテゴリ対情報６２と一致するテーブル要素を検
索する。一致するものが無い時は個別文字判定処理を行
なわない、もし一致するものが有る時には、オ６図のよ
うに同一行に格納しでおいたＡ（／をアドレス指定情報
７１として出力する。次にサブカテゴリ対テーブル８０
において。To operate this, first, the category pair table 70 is searched for a table element that matches the category pair information 62. If there is no match, no individual character determination processing is performed. If there is a match, the A(/ stored in the same line as shown in Figure 6) is output as address designation information 71.Next Table 80 vs. subcategories
In.

アドレス指定情報７１によるアドレスＡ　ｔ　ｊからす
プカナゴリ対情報６３と一致するテーブル要素を検索す
る。終了フラグＥｌｆであるアドレスＡ（ｊ＋％まで検
索して一致するものが無い時は個別文字判定処理を終了
する。A table element matching the address A t j based on the address designation information 71 and the pair information 63 is searched. If there is no match after searching up to address A (j+%), which is the end flag Elf, the individual character determination process ends.

オ６図では、アドレスＡ＜７＋ｚで一致しており。In Figure 6, there is a match at address A<7+z.

同一行に格納したｌ！に、ｔｊ＊を個別辞書メモリ９０
の検索開始アドレス８１として出力する。この後に該カ
テゴリ対の個々のカテゴリに対し１ｍ別辞書メモリ９０
の組合せ論理を基にした照合を行なう。l stored on the same line! , tj* is stored in the individual dictionary memory 90.
is output as the search start address 81. After this, a 1m separate dictionary memory 90 is provided for each category of the category pair.
Verification is performed based on combinatorial logic.

個別辞書メモリ９０のアドレスＡ　ｉ　ｊ　２から、終
了フラグがＩｌｌの７ドレスＡｉｊ２＋算’　までが該
カテゴリ対の組合せ論理である。この間の各組合せ　　
　゛論理を順次個別文字判定部６０へ送出する０組合せ
論理要素はＣＡＬ、　Ｂｉｂ、Ｂｊｋ、ただしｈ＝０．
１゜・・・、Ｓ′）の３種の情報からなり１個別特徴群
レジスタのアドレスＡ＆の個別特徴の値を基にＢｔｈ。The combinatorial logic of the category pair is from the address A i j 2 of the individual dictionary memory 90 to the 7 address A ij2 + arithmetic' where the end flag is Ill. Each combination between these
゛The zero combination logic elements that sequentially send the logic to the individual character determination unit 60 are CAL, Bib, Bjk, where h=0.
Bth based on the value of the individual feature at address A& of one individual feature group register.

Ｂｌｈの２つの許容条件に対する照合処理が個別文字判
定部６０で行なわれる。The individual character determination unit 60 performs a matching process for the two allowable conditions of Blh.

また、一つのカテゴリに属す複数のサブカテゴリを他の
カテゴリと区別する際に０字形の差異が共通しており１
個別特徴群を基にした組合せ論理が同じ内容となる場合
には、ｌｉＩ別辞別辞上メモリ規模えるために、以下の
動作が本実施例において可能な構成となっている。サブ
カテゴリ・テーブル８０のテーブル要素において、サブ
カテゴリ対格納列８２０に格納するサブカテゴリ・コー
ドの何れか、または双方を、対応するカテゴリに属する
すべてのサブカテゴリを一括して扱う場合に。In addition, when distinguishing multiple subcategories belonging to one category from other categories, there is a common difference in the 0 character shape and 1
When the combinational logic based on the individual feature groups has the same content, the following operations are possible in this embodiment in order to increase the memory size for different features. In the table element of the subcategory table 80, when one or both of the subcategory codes stored in the subcategory pair storage column 820 is used to collectively handle all subcategories belonging to the corresponding category.

例えばＮ　ｓ　Ｉｔ　４１フードとする８個別文字判定
処理では、サブカテゴリ対情報６３とサブカテゴリ・テ
ーブル８０の各テーブル要素のサブカテゴリ・コードと
の一致をとる時にＮ％ロフードが格納してあれば一致し
たものとみなし、　Ｎ、Ｑｎコードの欄に対応したカテ
ゴリに属すすべてのサブカテゴリに１個別辞書メモリ上
の共通した一つの組合せ論理で個別特徴群と照合処理を
行なう。For example, in the 8-individual character determination process for N s It 41 food, when matching the subcategory pair information 63 with the subcategory code of each table element of the subcategory table 80, if N%ro food is stored, a match is made. Then, all subcategories belonging to the category corresponding to the N and Qn code columns are compared with the individual feature group using one common combinational logic on one individual dictionary memory.

以上の如く一連の動作によって、任意のカテゴリ対が候
補カテゴリ対として発生した時に、それらの手書き変形
の内容に関するサブカテゴリ対の情報と、必要な情報だ
けで構成した少容量のマツピング・テーブルを基に１ｇ
Ａ別辞書メモリ′９０の関連する部分を高速かつ簡単に
検索することができる。Through the series of operations described above, when an arbitrary category pair is generated as a candidate category pair, a mapping table is created based on the information of the subcategory pair regarding the contents of the handwritten transformation and a small mapping table consisting only of the necessary information. 1g
Relevant portions of the A-separate dictionary memory '90 can be searched quickly and easily.

（５）　　効果の説明以上説明したように１本発明では、複数カテゴリが読取
結果として候補出力となった時に、入力文字の変形の様
子、即ちカテゴリ間のサブカテゴリの内容に応じ０字形
上の差異を記述した組合せ論理を高速かつ簡単に検索・
照合処理を行なうことが可能であるから、類似文字を個
別に文字判定する際に０両カテゴリ間の字形上の差異に
期する着眼点が明確になること９組合せ論理の検索と照
合処理からなる個別文字判定の処理時間が短いこと、な
どの利点がある。また、サブカテゴリ毎に標準字形への
字形上の近さに関する重み値を設けたことから、１！１
み値を比較することで、＃Ｉ４Ｉ４形に近いカテゴリを
簡単に取捨選択できる利点がある。(5) Description of effects As explained above, in the present invention, when multiple categories are output as candidates as a reading result, the deformation of the input character, that is, the difference in character shape depending on the content of subcategories between categories. You can quickly and easily search for combinational logic written in
Since it is possible to perform matching processing, it becomes clear to focus on differences in glyph shapes between categories when identifying similar characters individually. 9 Consists of combinatorial logic search and matching processing It has advantages such as short processing time for individual character determination. In addition, since we set a weight value for the closeness of the glyph to the standard glyph for each subcategory, we calculated a weight value of 1!1
By comparing the values, there is an advantage that categories close to the #I4I4 type can be easily selected.

特に、カナ、英字、数字などの多字種を混在させて読取
る文字読取装置において、類似文字の個別文字判定処理
を、単純な構成で、さほど処理時間を増加させることな
く実現でき、高精度な読取結果を導く利点がある。In particular, in character reading devices that read a mixture of multiple character types such as kana, alphabets, and numbers, it is possible to perform individual character determination processing for similar characters with a simple configuration and without significantly increasing processing time. It has the advantage of guiding the reading results.

[Brief explanation of the drawing]

第１図は本発明の実施例における文字読取装置の構成図
、Ｉｖ２図は本実施例における辞書メモリの構成図、第
３図は類字文字に関するサブカテゴリの例を示す説明図
、第４図は本実施例における候補カテゴリ・レジスタの
構成図、第５図はサブカテゴリの重み値に関する説明図
、第６図は個別辞書メモリ検索手順の説明図である。図中、ｌＯはバタン・メモリ、２０は特徴群作成部、３
０は文字判定部、４０は辞書メモリ、５０は個別特徴群
作成部、６０は個別文字判定部、７０はカテゴリ対テー
ブル、８０はサブカテゴリ対テーブル、９０は個別辞書
メモリ、１１は入力文字バタン、２１は特徴群レジスタ
、３１は候補カテゴリ・レジスタ、５１は個別特徴群レ
ジスタ、６１は文字判定出力、６２はカテゴリ対情報、
６３はサブカテゴリ対情報、７１．８１はアドレス指定
情報を示す。特許出願人　日本電信電話公社代理人弁理士　　森　１）　　　寛FIG. 1 is a block diagram of a character reading device according to an embodiment of the present invention, FIG. Iv2 is a block diagram of a dictionary memory in this embodiment, FIG. FIG. 5 is an explanatory diagram of the weight values of subcategories, and FIG. 6 is an explanatory diagram of the individual dictionary memory search procedure in this embodiment. In the figure, IO is a button memory, 20 is a feature group creation unit, 3
0 is a character determination unit, 40 is a dictionary memory, 50 is an individual feature group creation unit, 60 is an individual character determination unit, 70 is a category pair table, 80 is a subcategory pair table, 90 is an individual dictionary memory, 11 is an input character button, 21 is a feature group register, 31 is a candidate category register, 51 is an individual feature group register, 61 is a character judgment output, 62 is category pair information,
63 indicates subcategory pair information, and 71.81 indicates addressing information. Patent applicant Hiroshi Mori, patent attorney representing Nippon Telegraph and Telephone Public Corporation

Claims

[Scope of Claims] (Li) A character memory for storing an input character pattern; a feature group creation unit for extracting a feature group describing a character shape from the input characters stored in the pattern memory; A character comprising a dictionary memory that collects and stores subcategories made up of feature combination logic, and an individual feature group creation unit that extracts an individual feature group that describes the differences between similar categories of the I (tan memorica) shape. The reading device selects a subcategory ν whose feature group of the character pattern satisfies a certain combinatorial logic, and outputs the category code and subcategory code to which the subcategory belongs. A category pair table that collects and stores combinations of similar categories/foods with parts and 1 glyphs, and category pairs/foods with similar glyphs included in different categories.
A subcategory pair table that collects and stores code combinations, an individual dictionary memory that collects and stores combinatorial logic created for subcategory pair separation based on the individual feature group, and information on the subcategory pair table listed in #I. An individual character determination unit is provided for comparing the combinational logic obtained from the individual dictionary memory with the individual feature group of the character pattern. When there is a plurality of category codes obtained from the character determination section, the category codes extracted from the character pattern are extracted from the character pattern based on the information in the category pair table and subcategory pair table and the information obtained from the character determination section. A character reading device characterized in that a character is determined by separating subcategories between categories whose feature groups are satisfied by combinatorial logic in the individual dictionary memory. (Phantom) In the character reading device recited in claim E (1), the one-character determination unit stores a weight value representing the small amount of deformation from the S-subscript form at the final address of each subcategory on the dictionary memory. By doing so, the combinational logic in the dictionary memory is configured to select a subcategory νf that satisfies the feature group of the input character pattern, and simultaneously output the category code, subcategory code, and the weight value. A character reading device characterized in that the character determination unit is configured to output a category code having a smaller 0 weight value with priority based on the size difference of *power distribution values.