JPH02212990A

JPH02212990A - Character reader

Info

Publication number: JPH02212990A
Application number: JP1033157A
Authority: JP
Inventors: Katsumi Yaguchi; 矢口　克己
Original assignee: Toshiba Corp
Current assignee: Toshiba Corp
Priority date: 1989-02-13
Filing date: 1989-02-13
Publication date: 1990-08-24

Abstract

PURPOSE:To execute character reading processing by using a dictionary corresponding to sub set and to easily change the contents of the dictionary by reading a standard pattern, with which collation is executed in correspondence to the sub set, at random from the dictionary in which the dictionaries of plural categories are stored without being overlapped. CONSTITUTION:In a standard pattern designation information storage part 16, standard pattern designation information are classified and stored for each sub set to designate the standard pattern of the specified category out of the plural standard patterns of the dictionary for large group classification processing stored in a large group classification dictionary memory 15. The plural standard patterns corresponding to a reading candidate character group, which is designated by character group designation information, are read from this storage part 16 based on the standard pattern designation information. Then, a character written to a slip is read by collating the read standard pattern with a character pattern which is the object of character reading processing. Thus, the number of the sub set dictionaries is increased without widely increasing the capacity of the dictionary and the maintenance of the sub set dictionary can be easily executed.

Description

【発明の詳細な説明】［発明の目的］（産業上の利用分野）本発明は、文字パターンと予め設定された標準パターン
との照合を行なうことによって文字を読取る文字読取装
置に関する。DETAILED DESCRIPTION OF THE INVENTION [Object of the Invention] (Industrial Application Field) The present invention relates to a character reading device that reads characters by comparing character patterns with preset standard patterns.

（従来の技術）一般に、文字読取装置において文字読取処理が行なわれ
る帳票は、文字記入位置を示す複数の読取フィールドが
設けられている。そして、文字読取装置には、帳票に設
けられた各読取フィールドに応じて、記入される文字の
文字種等から成る特定の群を指定する情報（以下、サブ
セットと称する）が文字読取処理が行なわれる前に予め
設定される。文字読取装置は、サブセットに応じた専用
の辞書（以下、サブセット辞書と称する）を用いて文字
読取処理を実行する。(Prior Art) Generally, a form on which character reading processing is performed by a character reading device is provided with a plurality of reading fields indicating positions where characters are written. Then, the character reading device performs a character reading process on information specifying a specific group consisting of the character type of the characters to be written (hereinafter referred to as a subset) according to each reading field provided in the form. preset before. The character reading device executes character reading processing using a dedicated dictionary according to the subset (hereinafter referred to as a subset dictionary).

例えば、文字読取処理が行なわれる帳票の読取フィール
ドが、金額欄等のように数字のみが記入されることが指
定されている場合、数字を指定するサブセットを予め設
定することにより、数字に関する標準パターンが格納さ
れたサブセット辞書を用いて文字読取処理が行なわれる
。また、読取フィールドが、フリガナ欄のようにカタカ
ナのみが記入されることが指定されている場合、カタカ
ナを指定するサブセットを予め設定することにより、カ
タカナに関する標準パターンが格納されたサブセット辞
書を用いて文字読取処理が行なわれる。For example, if the reading field of a form where character reading processing is performed is specified to contain only numbers, such as an amount field, by setting a subset that specifies numbers in advance, a standard pattern for numbers can be created. Character reading processing is performed using the subset dictionary in which the characters are stored. In addition, if the reading field is specified to be filled in only katakana, such as in the furigana column, by setting a subset that specifies katakana in advance, it is possible to use a subset dictionary that stores standard patterns related to katakana. Character reading processing is performed.

また、サブセットは、文字種を指定するものに限らず、
漢字が記入される読取フィールド等については、記入内
容に応じた特定の群を指定することができるようにして
いる。例えば、読取フィールドが、氏名欄や住所欄等の
ように漢字が記入されることが指定されている場合、氏
名欄については氏名によく用いられる漢字用に設計され
たサブセット辞書、住所欄については住所によく用いら
れる文字用に設計された辞書を指定するそれぞれのサブ
セットを予め設定している。In addition, subsets are not limited to those that specify character types;
Regarding the reading field where kanji are written, it is possible to specify a specific group according to the content of the entry. For example, if a reading field is specified to be filled with kanji, such as a name field or an address field, a subset dictionary designed for the kanji characters often used in names will be used for the name field, and a subset dictionary designed for the kanji characters often used in names will be used for the name field, and a dictionary for the address field will be used for the name field. Each subset is preconfigured to specify a dictionary designed for characters commonly used in addresses.

これは、漢字の字数が非常に多いために（ＪＩＳ−第１
水準だけでも約２０００文字以上）、文字読取処理の処
理速度、読取精度等を向上させるために読取フィールド
に記入される内容に応じた特定の群に関するサブセット
辞書を用いるものである。This is because there are so many kanji characters (JIS-1
In order to improve the processing speed, reading accuracy, etc. of character reading processing, a subset dictionary is used for a specific group according to the content entered in the reading field.

（発明が解決しようとする課題）ところが、各読取フィールドに記入される内容に応じた
サブセット辞書を構成すると、各サブセット辞書におい
て重複した辞書情報が含まれることがある。この具体的
な例を説明する。金融機関（銀行）の振込み依頼書を処
理対象とする帳票として読取りを行なう場合には、銀行
名、及び支店名に関する辞書が用いられている。ここで
、銀行名読取用、及び支店名読取用の辞書にそれぞれ７
００文字、１５００文字の辞書容量が必要であると、全
サブセット辞書の辞書容量は合計２２００文字程文字種
必要となる。通常、銀行名と支店名に共通して用いられ
る文字は多数あることから、全サブセット辞書中には、
２２００文字の内、約４００文字程度の重複した辞書情
報が存在している。(Problems to be Solved by the Invention) However, if subset dictionaries are configured according to the contents entered in each reading field, duplicate dictionary information may be included in each subset dictionary. A specific example of this will be explained. When reading a transfer request form from a financial institution (bank) as a form to be processed, a dictionary regarding bank names and branch names is used. Here, the dictionaries for reading the bank name and for reading the branch name each contain 7.
If a dictionary capacity of 00 characters and 1500 characters is required, the total dictionary capacity of all subset dictionaries will be about 2200 character types. Since there are usually many characters commonly used in bank names and branch names, all subset dictionaries include
There is about 400 duplicate dictionary information out of 2200 characters.

このように、漢字の読取処理を行なう場合、各読取フィ
ールドに対応するサブセット辞書を構成すると、共通す
る辞書情報が複数のサブセット辞書に重複して格納され
ることがあった。このため、複数のサブセット辞書を扱
う場合、辞書容量が大幅に増加するという問題があった
。また、ある文字に関する辞書情報の内容を変更する場
合には、変更される文字の辞書情報を含む全てのサブセ
ット辞書を変更する必要があった。In this way, when performing the reading process of Chinese characters, if subset dictionaries are configured to correspond to each reading field, common dictionary information may be stored redundantly in a plurality of subset dictionaries. For this reason, when dealing with a plurality of subset dictionaries, there is a problem in that the dictionary capacity increases significantly. Furthermore, when changing the contents of dictionary information regarding a certain character, it is necessary to change all subset dictionaries that include the dictionary information of the character to be changed.

本発明は前記のような点に鑑みてなされたもので、辞書
容量を大幅に増加させることなくサブセット辞書数を増
加し、またサブセット辞書のメインテナンスを容易に行
なうことが可能な文字読取装置を提供することを目的と
する。The present invention has been made in view of the above points, and provides a character reading device that can increase the number of subset dictionaries without significantly increasing the dictionary capacity, and can easily maintain the subset dictionaries. The purpose is to

［発明の構成］（課題を解決するための手段）本発明は、処理対象とする文字パターンとの照合が行な
われる標準パターンが重複することなく格納された標準
パターン格納手段を用い、同格納手段から文字グループ
指定情報により指定された読取候補文字群に対応する複
数の標準パターンを、文字グループ指定情報によって指
定される文字グループ毎に分類された特定の標準パター
ンを指定する標準パターン指定情報に基づいて読出し、
読出された標準パターンと文字読取処理の対象とする文
字パターンとの照合を行なうことによって帳票に記入さ
れた文字を読取るように構成するものである。[Structure of the Invention] (Means for Solving the Problems) The present invention uses standard pattern storage means in which standard patterns to be compared with character patterns to be processed are stored without duplication. Based on the standard pattern specification information that specifies a specific standard pattern classified into each character group specified by the character group specification information, multiple standard patterns corresponding to the reading candidate character group specified by the character group specification information are specified. and read it out,
The system is configured to read characters written on a form by comparing the read standard pattern with the character pattern to be subjected to character reading processing.

（作用）このようにして構成される文字読取装置においては、文
字グループ指定情報によって指定される複数の標準パタ
ーンが、文字グループ指定情報によって指定される文字
グループ毎に分類された標準パターン指定情報に基づい
て読出される。このため、一部の文字が重複することの
多い文字グループ毎に標準パターン辞書（サブセット辞
書）を用意せずに、同一の標準パターンが重複すること
なく格納された唯一の辞書（標準パターン格納手段）を
用いるようにしても何等間迩とはならない。(Operation) In the character reading device configured in this manner, a plurality of standard patterns specified by the character group specification information are classified into standard pattern specification information classified by character group specified by the character group specification information. It is read out based on the For this reason, instead of preparing a standard pattern dictionary (subset dictionary) for each character group in which some characters often overlap, the only dictionary that stores the same standard pattern without duplication (standard pattern storage means ) is used, it will not be a change in any way.

（実施例）以下、図面を参照して本発明の一実施例を説明する。第
１図は同実施例に係わる文字読取装置の文字読取処理を
行なう部分の構成を示すブロック図である。同図におい
て、読取制御部ｉｔは、装置を構成する各部の制御を司
り、文字読取処理を実行する。主記憶メモリ１２には、
読取制御部１１１；おいて実行される文字読取処理の処
理プログラム、各種情報等が格納される。読取制御部１
１は、主記憶メモリ１２に格納された処理プログラムを
読出し、これに従って処理を実行する。大分類標本化メ
モリ１３は、入力文字パターンについて前処理、標本化
等を行なうことによって得られた文字読取処理の対象と
なる標準パターンを格納するものである。(Example) Hereinafter, an example of the present invention will be described with reference to the drawings. FIG. 1 is a block diagram showing the configuration of a portion of a character reading device that performs character reading processing according to the same embodiment. In the figure, a reading control unit it controls the various parts constituting the device and executes character reading processing. In the main memory 12,
A processing program for character reading processing executed in the reading control unit 111, various information, etc. are stored. Reading control section 1
1 reads a processing program stored in the main memory 12 and executes processing according to the program. The major classification sampling memory 13 stores standard patterns to be subjected to character reading processing, which are obtained by performing preprocessing, sampling, etc. on input character patterns.

大分類部１４は、後述する大分類辞書メモリ１５に格納
された辞書と大分類標本化メモリ１３に格納された標本
化パターンとの照合によって、標本化パターンに類似し
た個別認識処理の対象となる候補文字を選択するための
大分類処理を行なう。大分類辞書メモリ１５は、大分類
処理用の辞書を格納している。大分類辞書メモリ１５に
格納される大分類処理用の辞書は、前記標本化パターン
との照合が行なわれる文字の類（以下、カテゴリと称す
る）毎に設計された標準パターンが重複することなく格
納されている。標準パターン指定情報格納部１Ｂは、大
分類辞書メモリ１５に格納された大分類処理用の辞書の
複数の標準パターンから、特定のカテゴリの標準パター
ンを指定する標準パターン指定情報がサブセット毎に分
類されて格納されている。個別認識標本化メモリ１７は
、大分類処理が終了した大分類標本化メモリ１３に格納
された標本化パターンを、大分類処理が終了した後に行
なわれる個別認識処理の対象として格納するものである
。個別認識部１ｇは、後述する個別認識辞書メモリ１９
に格納された辞書と個別認識標本化メモリ１７に格納さ
れた標本化パターンとの照合によって、標本化パターン
が示している１つの文字を確定するための個別認識処理
を行なう。個別認識辞書メモリ１９は、個別認識処理用
の辞書が格納されており、後述する大分類候補リストメ
モリ２０に格納された内容に基づいて読出される。大分
類候補リストメモリ２０は、大分類処理によって得られ
た個別認識処理の候補文字に対応するカテゴリの標準パ
ターンを個別認識辞書メモリ１９から指定する情報を格
納するものであり、大分類処理の処理結果に応じて読取
制御部１１によって書込まれる。The major classification unit 14 performs individual recognition processing similar to the sampling pattern by comparing the dictionary stored in the major classification dictionary memory 15 to be described later with the sampling pattern stored in the major classification sampling memory 13. Performs major classification processing to select candidate characters. The major classification dictionary memory 15 stores a dictionary for major classification processing. The dictionary for major classification processing stored in the major classification dictionary memory 15 stores standard patterns designed for each type of character (hereinafter referred to as a category) to be compared with the sampling pattern without duplication. has been done. The standard pattern designation information storage unit 1B classifies standard pattern designation information for designating a standard pattern of a specific category into subsets from among the plurality of standard patterns of the dictionary for major classification processing stored in the major classification dictionary memory 15. is stored. The individual recognition sampling memory 17 stores the sampling pattern stored in the major classification sampling memory 13 for which the major classification process has been completed, as a target of the individual recognition process to be performed after the major classification process has been completed. The individual recognition unit 1g includes an individual recognition dictionary memory 19, which will be described later.
By comparing the dictionary stored in the memory 17 with the sampling pattern stored in the individual recognition sampling memory 17, individual recognition processing is performed to determine one character indicated by the sampling pattern. The individual recognition dictionary memory 19 stores a dictionary for individual recognition processing, and is read out based on the contents stored in the major classification candidate list memory 20, which will be described later. The major classification candidate list memory 20 stores information for specifying, from the individual recognition dictionary memory 19, the standard pattern of the category corresponding to the candidate character for the individual recognition process obtained by the major classification process. It is written by the reading control unit 11 according to the result.

第２図は、第１図に示す大分類標本化メモリ１３、大分
類部１４、大分類辞書メモリ１５、及び標準パターン指
定情報格納部１８の関係を示す図である。大分類標本化
メモリ１３の標本化パターン格納部３１は、標本化パタ
ーンデータを格納するものであり、アドレスカウンタ３
２の値に応じて大分類ｉ１４に読出される。大分類辞書
メモリＩ５の標準パターン格納部３３は、処理対象とす
る文字の標準パターンデータを格納しており、アドレス
カウンタ３４の値に応じて大分類部１４に読出される。FIG. 2 is a diagram showing the relationship among the major classification sampling memory 13, the major classification section 14, the major classification dictionary memory 15, and the standard pattern designation information storage section 18 shown in FIG. The sampling pattern storage unit 31 of the major classification sampling memory 13 stores sampling pattern data, and
According to the value of 2, the main classification i14 is read out. The standard pattern storage section 33 of the major classification dictionary memory I5 stores standard pattern data of characters to be processed, and is read out to the major classification section 14 according to the value of the address counter 34.

この標準パターン格納部３３の詳細な構成を第４図に示
している。アドレスカウンタ３２．３４は、１カテゴリ
中の１文字分の標準パターンとの照合が終了した際に大
分類部１４から出力される次元クロックによってインク
リメントされる。標準パターン指定情報格納部１６のサ
ブセットテーブルメモリ３５は、標準パターン格納部３
３に格納された特定のカテゴリの標準パターンデータの
先頭アドレス（標準パターン指定情報）が、サブセット
に応じて格納されている。このサブセットテーブルメモ
リ３５の詳細な構成を第３図に示している。アドレスカ
ウンタ３２．３４．３６は、大分類部１４から１力テゴ
リ分の標準パターンとの照合が終了した際に出力される
文字終了クロックを入力する。アドレスカウンタ３Ｂは
、読取制御部１１からのロード信号、アドレス信号を入
力し、各信号に応じてサブセットテーブルメモリ３５に
格納されたアドレスデータを指定する。The detailed configuration of this standard pattern storage section 33 is shown in FIG. The address counters 32 and 34 are incremented by the dimensional clock output from the major classification unit 14 when the comparison with the standard pattern for one character in one category is completed. The subset table memory 35 of the standard pattern designation information storage section 16 is connected to the standard pattern storage section 3.
The start address (standard pattern designation information) of the standard pattern data of a specific category stored in No. 3 is stored according to the subset. The detailed structure of this subset table memory 35 is shown in FIG. The address counters 32, 34, and 36 receive the character end clock output from the major classification section 14 when the comparison with the standard pattern for one category is completed. The address counter 3B receives a load signal and an address signal from the reading control section 11, and specifies address data stored in the subset table memory 35 in accordance with each signal.

次に、同実施例の動作を説明する。Next, the operation of this embodiment will be explained.

はじめに、第３図に示すサブセットテーブルメモリ３５
と第４図に示す標準パターン格納部３３との関係につい
て説明する。標準パターン格納部３８は、各カテゴリ毎
の標準パターンデータが、重複することなく格納されて
いる。また、各カテゴリの辞書には、カテゴリ中に含ま
れる複数の文字分の標準パターンデータが格納されてい
る。すなわち、形状の異なる複数の同一文字について、
それぞれ標準パターンが設計されているものである。各
カテゴリの辞書（標準パターン）は、その先頭アドレス
（“ａａａａ　　ｂｂｂｂ”・・・）よって指定される
ものである。サブセットテーブルメモリ３５は、標準パ
ターン格納部３３に格納された各カテゴリの辞書を指定
するアドレス（１１準パターン指定情報）が、サブセッ
ト毎に分類されて格納されている。また、サブセット毎
に分類されたアドレスの集合は、その先頭アドレス（“
ＡＡＡＡ”　　“ＢＢＢＢ”　）によって指定されるも
のである。なお、サブセットテーブルメモリ３５のアド
レス“ＡＡＡＡ’から始まる領域には、サブセットによ
って指定される「色」に関する漢字の標準パターンが格
納された辞書を指定するアドレスが格納されており、ア
ドレス“ＢＢＢＢ”から始まる領域には、サブセットに
よって指定される「曜日」に関する漢字の標準パターン
が格納された辞書を指定するアドレスが格納されている
ものとする。First, the subset table memory 35 shown in FIG.
The relationship between this and the standard pattern storage section 33 shown in FIG. 4 will be explained. The standard pattern storage section 38 stores standard pattern data for each category without duplication. Further, the dictionary for each category stores standard pattern data for a plurality of characters included in the category. In other words, for multiple identical characters with different shapes,
Standard patterns are designed for each. The dictionary (standard pattern) for each category is specified by its start address ("aaaa bbbb"...). The subset table memory 35 stores addresses (11 quasi-pattern designation information) that designate dictionaries for each category stored in the standard pattern storage section 33, classified for each subset. In addition, the set of addresses classified into each subset is the first address (“
``AAAA'', ``BBBB''). Furthermore, in the area starting from the address ``AAAA'' of the subset table memory 35, a dictionary storing standard patterns of kanji related to the ``color'' specified by the subset is stored. It is assumed that an address to be specified is stored, and an area starting from the address "BBBB" stores an address that designates a dictionary storing standard patterns of Kanji characters related to "day of the week" designated by the subset.

まず、帳票に記入された文字の読取処理を行なう前に、
予め各読取フィールドに記入される文字の群がサブセッ
トによりそれぞれ指定される。ここでは、ある読取フィ
ールドについて「色」に関する漢字が指定されたとする
。読取制御部工１は、指定されたサブセットに応じて標
準パターン指定情報格納部１８のサブセットテーブルメ
モリ３５に、第３図に示すように、標準パターン格納部
３３の特定のカテゴリの辞書を指定するアドレスを設定
する。First, before reading the characters written on the form,
A group of characters to be written in each reading field is specified in advance by a subset. Here, it is assumed that a kanji related to "color" is specified for a certain reading field. The reading control unit 1 specifies a dictionary of a specific category in the standard pattern storage unit 33, as shown in FIG. 3, in the subset table memory 35 of the standard pattern designation information storage unit 18 according to the specified subset. Set address.

文字読取処理が起動されると、図示してない走査機構部
において例えば帳票を光学的に走査されて得られた帳票
イメージから、読取フィールドに記入された文字の文字
パターンが検出される。検出された文字パターンは、読
取制御部１１によって前処理、標本化されて大分類標本
化メモリ１３に格納される。大分類標本化メモリ１８の
標本化パターン格納部８１に標本化パターンが格納され
ると、この標本化パターンについて大分類処理が行なわ
れる。この大分類処理について第２図を参照しながら説
明する。読取制御部１１は、処理対象とする文字パター
ンが記入されていた読取フィールドに対して指定されて
いたサブセットに対応するサブセットテーブルメモリ３
５のアドレスを示すアドレス信号（ここでは°ＡＡＡＡ
“を示すものとする）、及びロード信号を標準パターン
指定情報格納部ＩＢに出力する。アドレスカウンタ３Ｂ
に読取制御部１１からの各信号が人力されると、サブセ
ットテーブルメモリ３５の“ＡＡＡＡ”が示す領域に格
納された辞書を指定する　ａａａａ“が読出され、大分
類辞書メモＩノ１５のアドレスカウンタ３４にロードさ
れる。アドレスカウンタ３４にアドレス　ａａａａ”が
ロードされると、このアドレス“ａａａａ”で指定され
ると、標準パターン格納部３３の漢字赤°のカテゴリの
辞書が大分類部１４に読出される。また、読取制御部０
は、標本化パターン格納部３１に格納された標本化パタ
ーンデータを大分類部Ｉ４に読出させる。大分類部１４
は、標準パターン格納部３３から読出された標準パター
ンデータと標本化パターン格納部３１からの標本化パタ
ーンデータとの照合を行なう。When the character reading process is started, the character pattern of the characters written in the reading field is detected from the form image obtained by optically scanning the form, for example, in a scanning mechanism section (not shown). The detected character patterns are preprocessed and sampled by the reading control unit 11 and stored in the major classification sampling memory 13. When a sampling pattern is stored in the sampling pattern storage section 81 of the major classification sampling memory 18, a major classification process is performed on this sampling pattern. This major classification process will be explained with reference to FIG. The reading control unit 11 reads the subset table memory 3 corresponding to the subset specified for the reading field in which the character pattern to be processed is written.
Address signal indicating address 5 (here, °AAAA
”) and a load signal to the standard pattern designation information storage unit IB.Address counter 3B
When each signal from the reading control unit 11 is input manually, "aaaa" which specifies the dictionary stored in the area indicated by "AAAA" of the subset table memory 35 is read out, and the address counter of the major dictionary memo I No. 15 is read out. 34. When the address "aaaa" is loaded into the address counter 34, when the address "aaaa" is specified, the dictionary of the kanji red category in the standard pattern storage section 33 is read out to the major classification section 14. be done. In addition, the reading control unit 0
causes the major classification unit I4 to read out the sampling pattern data stored in the sampling pattern storage unit 31. Major classification section 14
compares the standard pattern data read from the standard pattern storage section 33 with the sampling pattern data from the sampling pattern storage section 31.

大分類部１４は、照合を行なうことによって得られた読
取結果（標準パターンデータとの類似度値）を読取＃Ａ
ｓ部１１に出力する。大分類部Ｉ４は、“赤”のカテゴ
リ中の１文字の標準パターンとの照合が終了すると、ア
ドレスカウンタ３２．３４に対して次元クロックを出力
する。これによって各カウンタ３２、３４の値がインク
リメントされ、標本化パターン格納部３１．及び標準パ
ターン格納部３３に格納された次アドレスのデータが大
分類部１４に読出されて処理が行なわれる。すなわち、
標準パターン格納部３３の“赤°のカテゴリ中の他の特
徴を示す標準パターンデータが順次読出されて処理が行
なわれる。こうして、“赤１のカテゴリの辞書について
処理が終了すると、大分類部１４はアドレスカウンタ３
２．３４．３５に対して文字終了クロックを出力する。The major classification unit 14 reads the reading result (similarity value with the standard pattern data) obtained by performing the matching #A
It is output to the s section 11. When the major classification unit I4 finishes matching one character in the "red" category with the standard pattern, it outputs a dimensional clock to the address counters 32 and 34. As a result, the values of each counter 32, 34 are incremented, and the sampling pattern storage section 31. The next address data stored in the standard pattern storage section 33 is then read out to the major classification section 14 and processed. That is,
The standard pattern data indicating other features in the "Red ° category" in the standard pattern storage section 33 are sequentially read out and processed. In this way, when the processing for the "Red 1" category dictionary is completed, the main classification section 14 is address counter 3
Output character end clock for 2.34.35.

これにより、アドレスカウンタ３Ｂはインクリメントさ
れ、”ＡＡＡＡ＝の次アドレスの領域に格納された’　
ｂｂｂｂ’を指定する。そして、アドレスカウンタ３４
には、文字終了クロックが入力されたことによって“ｂ
ｂｂｂ”がロードされる。また、アドレスカウンタ３２
は、文字終了クロックによってクリアされる。As a result, the address counter 3B is incremented and "stored in the area of the next address of AAAA="
Specify bbbb'. And address counter 34
“b” is input by inputting the character end clock.
bbb” is loaded. Also, the address counter 32
is cleared by the character end clock.

次に、アドレスカウンタ３４に“青２のカテゴリの辞書
の先頭アドレス”　ｂｂｂｂ’がロードされると、前記
で説明したようにして、大分類部１４は、青。Next, when the address counter 34 is loaded with "the first address of the dictionary for the blue 2 category"bbbb', the major classification unit 14 selects the blue category as described above.

のカテゴリ中の標準パターンと標本化パターンとの照合
を順次行ない、読取結果を読取制御部ｔｉに出力する。The standard pattern in the category and the sampling pattern are sequentially compared, and the reading result is output to the reading control unit ti.

こうしして、サブセットによって指定される「色」に関
する漢字の標準パターンとの照合（°白°　“黄”水゛
　°緑゛・・・）が終了すると、読取制御部１１は、大
分類部１４からの読取結果に基づいて、標本化パターン
と類似度が所定の値より高い個別認識処理の対象となる
カテゴリ（候補文字）を選択する。読取制御部１１は、
個別認識辞書メモリ１９に格納された各候補文字に対応
する個別認識処理用の辞書（各カテゴリの標準パターン
）を示す先頭アドレスを大分類候補リストメモリ２０に
書込む。また、個別認識標本化メモリ１７に個別認識処
理用の標本化パターンが格納される。個別認識処理は、
大分類候補リストメモリ２０に格納された特定のカテゴ
リの辞書を指定するアドレスに基づいて読出された標準
パターンと、個別認識標本化メモリ１７に格納されただ
標本化パターンとの照合が個別認識部１Ｂにおいて行な
われる。個別認識部１８は、個別認識処理の処理結果を
読取制御部１１に出力する。読取制御部１１は、個別認
識部１８からの処理結果から類似度の最も高いものを文
字読取結果として確定する。In this way, when the comparison with the standard pattern of kanji related to "color" specified by the subset (°white° "yellow" water "°green"...) is completed, the reading control section 11 starts the main classification section. Based on the reading results from 14, categories (candidate characters) to be subjected to individual recognition processing whose similarity to the sampling pattern is higher than a predetermined value are selected. The reading control unit 11
The starting address indicating the dictionary for individual recognition processing (standard pattern for each category) corresponding to each candidate character stored in the individual recognition dictionary memory 19 is written into the major classification candidate list memory 20. Further, a sampling pattern for individual recognition processing is stored in the individual recognition sampling memory 17. Individual recognition processing is
The individual recognition unit 1B compares the standard pattern read out based on the address specifying the dictionary of a specific category stored in the major classification candidate list memory 20 with the sampling pattern stored in the individual recognition sampling memory 17. It will be held in The individual recognition unit 18 outputs the processing result of the individual recognition process to the reading control unit 11. The reading control unit 11 determines the one with the highest degree of similarity from the processing results from the individual recognition unit 18 as the character reading result.

ここで、ある読取フィールドについて「曜日」に関する
漢字がサブセットとして設定された場合について説明す
る。この場合、読取制御部１１は、標準パターン指定情
報格納部１Ｂに、第３図に示すように、先頭アドレス“
ＢＢＢＢ”から始まる領域に「曜日」に関するカテゴリ
の辞書を指定するアドレス“ＰＰＰＰ″　ｑｑｑｑ　　
・・・を設定する。この設定されたアドレスには、サブ
セットによって指定される「色」と「曜日」に関する漢
字の群に共通する「水」という文字の辞書を指定するア
ドレスｅｅｅｅ″が含まれている。これによって、標準
パターン格納部３３に格納されている各カテゴリの辞書
から、“水°の辞書がランダムに読出される。Here, a case will be described in which kanji related to "day of the week" are set as a subset for a certain reading field. In this case, the reading control section 11 stores the starting address "" in the standard pattern designation information storage section 1B as shown in FIG.
Address “PPPP” qqqqq that specifies a dictionary of categories related to “day of the week” in the area starting with “BBBB”
Set... This set address includes the address eeee'' that specifies the dictionary of the character ``water'' that is common to the group of kanji related to ``color'' and ``day of the week'' specified by the subset. From the dictionaries of each category stored in the pattern storage section 33, a dictionary of "water" is read out at random.

すなわち、第４図に示すアドレス　ｅｅｅｅ“の領域に
格納された“水”の辞書は、サブセットテーブルメモリ
３５のサブセット「色」及び「曜日」に対応する領域（
先頭アドレス’　ＡＡＡＡ’　　°ＢＢＢＢ”からの各
領域）にそれぞれ格納された　ｅｅｅｅ”によって共通
して読出される。このため、標準パターン格納部３３に
は、サブセットによって指定される複数の群に共通する
文字の辞書が複数設定される必要がない。That is, the dictionary for "water" stored in the area of address "eeee" shown in FIG.
eeee" stored in each area from the start address 'AAAA'°BBBB". Therefore, it is not necessary to set a plurality of dictionaries of characters common to a plurality of groups specified by the subset in the standard pattern storage section 33.

なお、前記実施例においては、大分類処理を行なった後
に詳細な個別認識処理を行なう２段階の構成としたが、
１段階で鹸終的な個別認識を行なうようにしても良い。Note that in the above embodiment, a two-stage configuration was used in which detailed individual recognition processing was performed after performing major classification processing.
Final individual recognition may be performed in one step.

［発明の効果］以上のように本発明によれば、複数のカテゴリの辞書が
重複することなく格納された辞書から、サブセットに応
じて特定のカテゴリの辞書を指定する情報が設定される
サブセットテーブルメモリの内容に基づいて、照合が行
なわれる標準パターンをランダムに読出すようにしたの
で、辞書の容量を増加させることなくサブセットに応じ
た辞書を用いて文字読取処理を行なうことができる。ま
た、ある文字の辞書の内容を変更等が必要な場合であっ
ても、変更対象とする辞書のみを訂正するだけで良いた
め、容易に行なうことが可能となるものである。[Effects of the Invention] As described above, according to the present invention, a subset table is created in which information specifying a dictionary of a specific category is set according to a subset from a dictionary in which dictionaries of a plurality of categories are stored without duplication. Since standard patterns to be checked are read out at random based on the contents of the memory, character reading processing can be performed using a dictionary corresponding to a subset without increasing the capacity of the dictionary. Furthermore, even if it is necessary to change the contents of a dictionary for a certain character, it is possible to do so easily because it is only necessary to correct the dictionary to be changed.

[Brief explanation of the drawing]

第１図は本発明の一実施例に係わる文字読取装置の文字
読取処理を行なう部分の構成を示すブロック図、第２図
は第１図に示す構成の一部の詳細を示すブロック図、第
３図はサブセットテーブルメモリに格納された内容を示
す図、第４図は標準パターン格納部の一例を示す図であ
る。１１・・・読取制御部、１２・・・主記憶メモリ、１３
・・・大分類標本化メモリ、１４・・・大分類部、１５
・・・大分類辞書メモリ、ＩＢ・・・標準パターン指定
情報格納部、１７・・・個別認識標本化メモリ、１８・
・・個別認識部、１９・・・個別認識辞書メモリ、２０
・・・大分類候補リストメモリ、３１・・・標本化パタ
ーン格納部、３３・・・標準パターン格納部、３５・・
・サブセットテーブルメモリ。FIG. 1 is a block diagram showing the configuration of a part that performs character reading processing of a character reading device according to an embodiment of the present invention, FIG. 2 is a block diagram showing details of a part of the configuration shown in FIG. 1, and FIG. FIG. 3 is a diagram showing the contents stored in the subset table memory, and FIG. 4 is a diagram showing an example of the standard pattern storage section. 11...Reading control unit, 12...Main memory, 13
...Large classification sampling memory, 14...Large classification section, 15
...Main classification dictionary memory, IB...Standard pattern designation information storage section, 17...Individual recognition sampling memory, 18.
...Individual recognition unit, 19...Individual recognition dictionary memory, 20
... Major classification candidate list memory, 31... Sampling pattern storage section, 33... Standard pattern storage section, 35...
- Subset table memory.

Claims

[Scope of Claims] A preset character for specifying a character written in a form to be subjected to character reading processing as one of a plurality of character groups consisting of a character pattern of the character and a reading candidate character group. In a character reading device that reads by comparing group designation information with a plurality of standard patterns, the standard pattern storage means stores the standard patterns without duplication, and the identification information stored in the standard pattern storage means standard pattern designation information storage means storing standard pattern designation information that designates standard patterns for each of the character groups; and a plurality of standards corresponding to the reading candidate character groups designated by the character group designation information. standard pattern reading means for reading out a pattern from the standard pattern storage means based on the standard pattern designation information stored in the standard pattern designation information storage means; A character reading device that reads characters written on the form by comparing the pattern with a character pattern to be subjected to character reading processing.