JPH04280394A

JPH04280394A - Character recognizing device and recognized character fixing method

Info

Publication number: JPH04280394A
Application number: JP3043844A
Authority: JP
Inventors: Shigeki Ozawa; 茂樹小澤
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 1991-03-08
Filing date: 1991-03-08
Publication date: 1992-10-06

Abstract

PURPOSE:To present the character recognizing device and the recognized character fixing method, which can obtain correct character recognition with much higher probability, concerning the device and the method to fix recognized characters by processing (post-processing) character data from an OCR (optical character reader). CONSTITUTION:A character reader 2 is provided to read characters described in the respective plural items of a slip 1, and a recognition processing device 3 is provided to fix the recognized characters from the character data read by the character reader 2. The recognition processing device 3 specifies character candidates to be recognized as the characters described in the respective plural items for each item and converts the character candidate to be recognized as the character described in the first item to the character candidate to be recognized as the character described in the second item related to the first item. Further, the specified character candidate to be recognized as the character described in the second item is compared with the converted character candidate, and the coincident character candidate is recognized and fixed as the character described in the second item.

Description

[Detailed description of the invention]

【０００１】0001

【産業上の利用分野】本発明は、ＯＣＲ（光学文字読み
取り）読取装置からの文字データを処理（後処理）して
、認識文字を確定する文字認識装置及び認識文字確定方
法に関する。ＯＣＲ読取装置等の普及に伴い、オペレー
タの修正なしに読み取りデータの処理を行い、手書き文
字をより正確に認識できることが望まれている。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a character recognition device and method for determining recognized characters by processing (post-processing) character data from an OCR (optical character reading) reading device. With the spread of OCR reading devices and the like, it is desired that read data can be processed without operator correction and handwritten characters can be recognized more accurately.

【０００２】0002

【従来の技術】図６は、従来の文字認識装置及びその動
作を説明する図である。１は銀行等における取扱伝票の
一例であり、複数の項目記載欄（項目Ｎｏ１〜３）があ
る。2. Description of the Related Art FIG. 6 is a diagram illustrating a conventional character recognition device and its operation. 1 is an example of a transaction slip at a bank, etc., and has a plurality of item entry columns (items No. 1 to No. 3).

【０００３】項目Ｎｏ１は、氏名のフリガナの記入欄で
ある。項目Ｎｏ２は、氏名を漢字で記載する欄である。更に、項目Ｎｏ３は、住所の記載欄である。通常これら
の項目欄には手書きで文字が記載される。[0003] Item No. 1 is a field for entering the furigana of the name. Item No. 2 is a field in which the name is written in Kanji. Furthermore, item No. 3 is a column for writing an address. Normally, characters are written by hand in these item columns.

【０００４】銀行等において、かかる手書きの伝票又は
帳票からデータをＯＣＲ読取装置２で読み取り業務処理
が行われる。したがって、手書きの文字は正確に読み取
られ認識されることが必要である。さもないとオペレー
タによる修正作業が大きく業務処理が非効率となる。[0004] In banks and the like, data is read from such handwritten slips or forms using an OCR reader 2 and business processing is performed. Therefore, it is necessary that handwritten characters be accurately read and recognized. Otherwise, the operator will have to do a lot of correction work and business processing will become inefficient.

【０００５】手書き伝票１の各項目欄に記載の文字は、
ＯＣＲ読取装置２により読み取られ認識処理装置３に導
かれる。認識処理装置３は、項目欄毎に認識候補文字を
特定する。[0005] The characters written in each item column of the handwritten slip 1 are as follows:
It is read by the OCR reader 2 and guided to the recognition processing device 3. The recognition processing device 3 identifies recognition candidate characters for each item field.

【０００６】項目Ｎｏ１には、カタカナで「ヤマモト」
の四文字が手書き記載されている。認識処理装置３にお
いて、ＯＣＲ読取装置２の読み取りデータを基に「ヤ」
の文字に対して、「ヤ」又は「ア」の候補文字が特定さ
れる。「マ」の文字に対しては、「ア」又は「マ」の候
補文字が特定される。更に、「モ」の文字に対しては、
「モ」と「サ」の候補文字が特定され、「ト」は、「ト
」のみが候補文字に特定される。[0006] Item No. 1 is ``Yamamoto'' in katakana.
The four characters are handwritten. In the recognition processing device 3, “Y” is detected based on the data read by the OCR reading device 2.
A candidate character of "ya" or "a" is specified for the character. For the character "ma", candidate characters "a" or "ma" are identified. Furthermore, for the character “mo”,
Candidate characters for "mo" and "sa" are specified, and only "to" for "to" is specified as a candidate character.

【０００７】候補文字欄ａは、９０％、候補文字欄ｂは
、８０％、そして候補文字欄ｃは、７０％の認識確率で
ある。かかる各文字の組合せで「ヤマモト」の四文字に
対応する候補文字は、「ヤアモト」〜「ママサト」の８
通りの組合せ（７）が得られる。The recognition probability is 90% for candidate character field a, 80% for candidate character field b, and 70% for candidate character field c. The candidate characters corresponding to the four characters of "Yamamoto" in combination of each character are the eight characters from "Yamoto" to "Mamasato".
The correct combination (7) is obtained.

【０００８】認識処理装置３は、更にこれら８通りの組
合せの各々に対し、図示しないメモリ辞書を検索する（
８）。検索の結果、辞書に存在しない組合せは「×」、
存在する組合せは「○」で示されている。The recognition processing device 3 further searches a memory dictionary (not shown) for each of these eight combinations (
8). As a result of the search, combinations that do not exist in the dictionary are marked with an "×",
Existing combinations are indicated by "○".

【０００９】即ち、「ヤマモト」と「ヤマサト」は辞書
に存在する。「ヤマモト」は、認識確率が、９０％×７
０％×９０％×９０％であり、「ヤマサト」は、認識確
率が、９０％×７０％×８０％×９０％である。したが
って、辞書に存在する組合せのうち「ヤマモト」の認識
確率が最大である。That is, "Yamamoto" and "Yamasato" exist in the dictionary. "Yamamoto" has a recognition probability of 90% x 7
0% x 90% x 90%, and the recognition probability of "Yamasato" is 90% x 70% x 80% x 90%. Therefore, among the combinations existing in the dictionary, "Yamamoto" has the highest recognition probability.

【００１０】一方、項目Ｎｏ２は、漢字「山本」が手書
き記載されている。項目Ｎｏ１の手書き文字と同様に処
理がされる。したがって、「山」の文字は、「小」と「
山」、「本」の文字は、「本」と「木」の候補文字が特
定される。これら候補文字から、「小本」〜「山木」の
４通りの組合せ（７）が得られる。On the other hand, in item No. 2, the Chinese character "Yamamoto" is handwritten. Processing is performed in the same way as the handwritten characters in item No. 1. Therefore, the characters for ``yama'' are ``small'' and ``yama''.
For the characters ``mountain'' and ``hon'', candidate characters ``hon'' and ``tree'' are identified. From these candidate characters, four combinations (7) of "Komoto" to "Yamaki" are obtained.

【００１１】更に、これら４通りの組合せが辞書検索さ
れる（８）。その結果、「小木」「山本」「山木」の組
合せが辞書に存在する。認識確率は、「小木」と「山本
」で同じであり、「山木」の認識確率は、最も小さい。Furthermore, a dictionary is searched for these four combinations (8). As a result, the combination of "Ogi", "Yamamoto", and "Yamaki" exists in the dictionary. The recognition probabilities are the same for "Ogi" and "Yamamoto," and the recognition probability for "Yamaki" is the smallest.

【００１２】従来の実施例では、辞書に存在する文字候
補のうち認識確率の最大のものを認識文字として確定す
る（９）。したがって、項目Ｎｏ１に記載の文字に対し
ては「ヤマモト」が認識文字として確定される。項目Ｎ
ｏ２に記載の文字に対しては、「小木」と「山本」の認
識確率が同じであるので、この場合は、先の出現頻度が
高い組合せが優先され、「小木」が認識文字として決定
される。In the conventional embodiment, the character with the highest recognition probability among the character candidates existing in the dictionary is determined as the recognized character (9). Therefore, for the characters listed in item No. 1, "Yamamoto" is determined as the recognized character. Item N
Regarding the characters listed in o2, the recognition probabilities for "Ogi" and "Yamamoto" are the same, so in this case, the combination with the highest appearance frequency is given priority, and "Ogi" is determined as the recognized character. Ru.

【００１３】したがって、かかる従来の装置における認
識文字確定方法では、互いに関連する項目Ｎｏ１、Ｎｏ
２でありながら、項目Ｎｏ２の認識結果に誤りが生じ、
項目Ｎｏ１とＮｏ２の認識結果の矛盾も無視されること
になる。[0013] Therefore, in the recognition character confirmation method in such a conventional device, mutually related items No. 1 and No.
2, an error occurs in the recognition result of item No. 2,
The discrepancy between the recognition results for items No. 1 and No. 2 is also ignored.

【００１４】[0014]

【発明が解決しようとする課題】従来の装置においては
、図６において説明したように認識確率のみを基準とし
て最終の認識文字を決定している。したがって、候補と
なる文字が多くある場合、従来の方法ではより高い確率
で正しい文字認識を期待することは困難である。本発明
は、かかる問題を解決し、より高い確率で正しい文字認
識が得られる文字認識装置及び認識文字確定方法を提供
することを目的とする。In the conventional apparatus, as explained in FIG. 6, the final recognized character is determined based only on the recognition probability. Therefore, when there are many candidate characters, it is difficult to expect correct character recognition with a higher probability using conventional methods. SUMMARY OF THE INVENTION An object of the present invention is to solve such problems and provide a character recognition device and a method for determining recognized characters, which can achieve correct character recognition with a higher probability.

【００１５】[0015]

【課題を解決するための手段】図１は本発明の原理図を
示す。図において、１は、伝票または帳票であり、複数
項目（項目１〜３）に手書きの文字が記載されている。項目１と項目２には、例えば、氏名とそのフリガナとの
関係のように互いに関連する項目データ文字が手書き記
載されている。Means for Solving the Problems FIG. 1 shows a diagram of the principle of the present invention. In the figure, 1 is a slip or a form, and handwritten characters are written in a plurality of items (items 1 to 3). In item 1 and item 2, item data characters related to each other, such as the relationship between a name and its furigana, are handwritten.

【００１６】２はＯＣＲ読取装置であり、伝票１の各項
目に記載の手書きの文字を読み取る。３は認識処理装置
であり、ＯＣＲ読取装置２により読み取られた文字デー
タに対し、後処理を行い認識文字を決定する。Reference numeral 2 denotes an OCR reading device, which reads handwritten characters written in each item of the slip 1. 3 is a recognition processing device, which performs post-processing on character data read by the OCR reading device 2 to determine recognized characters.

【００１７】本発明では、認識処理装置３は、ＯＣＲ読
取装置２で読み取った文字データから複数の項目の各々
に記載の文字として認識すべき文字候補を項目毎に特定
する手段４を有する。一つの手書き文字に対して、複数
の文字候補が与えられる。In the present invention, the recognition processing device 3 has means 4 for specifying, for each item, character candidates to be recognized as characters described in each of a plurality of items from the character data read by the OCR reading device 2. Multiple character candidates are given for one handwritten character.

【００１８】更に、第一の項目（項目１）に記載の文字
として認識すべき複数の文字候補を第一の項目と関連す
る第二の項目に記載の文字として認識すべき文字候補に
変換する手段５を有する。例えば、第二の項目に記載さ
れた文字が氏名漢字であり、第一の項目に記載された文
字がそのフリガナであるという関連を持つ場合、第一の
項目に記載の文字として認識すべき文字候補である複数
のフリガナ文字は、手段５により第二の項目に記載の文
字として認識すべき漢字の文字候補に変換される。Furthermore, a plurality of character candidates to be recognized as characters listed in the first item (item 1) are converted into character candidates to be recognized as characters listed in the second item related to the first item. It has means 5. For example, if the characters listed in the second item are the kanji of the name and the characters listed in the first item are their furigana, the characters should be recognized as the characters listed in the first item. The plurality of furigana characters that are candidates are converted by the means 5 into Kanji character candidates to be recognized as characters listed in the second item.

【００１９】次いで、手段５により変換された漢字の文
字候補と手段４において、特定されている第二の項目に
記載の文字として認識すべき文字候補と変換手段５によ
り変換された文字候補とを比較する。比較の結果、一致
した文字候補を第二の項目に記載の文字として認識確定
する手段６を有する。Next, the character candidates of the Kanji characters converted by the means 5, the character candidates to be recognized as the characters specified in the second item in the means 4, and the character candidates converted by the converting means 5 are compared. compare. It has means 6 for recognizing and confirming character candidates that match as a result of the comparison as characters described in the second item.

【００２０】[0020]

【作用】本発明は、第一の項目に記載の文字として認識
すべき文字候補を関連する第二の項目に記載の文字とし
て認識すべき文字候補に変換し、特定された第二の項目
に記載の文字として認識すべき文字候補と比較を行う。例えば、第一の項目に記載の文字はカタカナ又はひらが
なであり、第二の項目に記載の文字は、第一の項目に記
載の文字に関連する漢字である。[Operation] The present invention converts a character candidate to be recognized as a character described in a first item into a character candidate to be recognized as a character described in a related second item, and Compare with character candidates that should be recognized as written characters. For example, the characters listed in the first item are katakana or hiragana, and the characters listed in the second item are kanji related to the characters listed in the first item.

【００２１】第一の項目のカタカナ又はひらがなを関連
する漢字に変換して第二の項目に記載の文字として認識
すべき漢字文字候補と比較している。比較により一致が
取れた漢字文字候補を第二の項目に記載の文字として確
定する。漢字に対し、カタカナ又はひらがなは、正しく
読み取り認識できる確率が高い。Katakana or Hiragana in the first item is converted into related Kanji characters and compared with Kanji character candidates to be recognized as characters listed in the second item. The kanji character candidates that match through the comparison are determined as the characters listed in the second item. Compared to kanji, katakana and hiragana have a higher probability of being correctly read and recognized.

【００２２】したがって、本発明は、このように第一の
項目に記載の文字と第二の項目に記載の文字との関連性
を利用しているので第二の項目に記載の文字の読み取り
認識の確実性を高めることが出来る。Therefore, since the present invention utilizes the relationship between the characters described in the first item and the characters described in the second item, it is possible to read and recognize the characters described in the second item. It is possible to increase the certainty of

【００２３】[0023]

【実施例】図２は、本発明の実施例を示す。本発明の原
理図である図１及び従来例である図６と同一又は同様の
部位には同一の番号を付してある。図６に示す従来例と
同様に、１は伝票の例であり、項目Ｎｏ２に漢字氏名、
項目Ｎｏ１にそのフリガナ及び項目Ｎｏ３に住所が手書
き記載されている。Embodiment FIG. 2 shows an embodiment of the present invention. The same or similar parts as in FIG. 1, which is a diagram of the principle of the present invention, and FIG. 6, which is a conventional example, are given the same numbers. 1 is an example of a slip, as in the conventional example shown in FIG.
The furigana is handwritten in item No. 1, and the address is handwritten in item No. 3.

【００２４】２はＯＣＲ読取装置であり、伝票１の各項
目の手書き記載文字を読み取り認識処理装置３に送る。認識処理装置３は、ＯＣＲ認識装置２で読み取った文字
を認識確定するための後処理を行う。認識処理装置３に
おいて、本発明にかかる後処理機能である認識処理動作
は図示しないＲＯＭ（読み取り専用メモリ）に記憶され
たソフトウエア命令に基づいて行われる。このソフトウ
エア命令に基づく動作フローは、図３、図４に示される
。Reference numeral 2 denotes an OCR reading device which reads the handwritten characters of each item on the slip 1 and sends it to the recognition processing device 3. The recognition processing device 3 performs post-processing to recognize and confirm the characters read by the OCR recognition device 2. In the recognition processing device 3, the recognition processing operation, which is a post-processing function according to the present invention, is performed based on software instructions stored in a ROM (read-only memory), not shown. The operational flow based on this software instruction is shown in FIGS. 3 and 4.

【００２５】認識処理装置３の後処理動作は、伝票１の
各項目についての定義体に対応して実行される。図５は
、伝票１に対する各項目の定義体の例を示す（１０）。定義体の内容は、項目の種類（Ａ）、項目の伝票上の行
位置（Ｂ）、桁位置（Ｃ）、関連する項目Ｎｏ（Ｄ）が
示される。The post-processing operation of the recognition processing device 3 is executed in accordance with the definitions for each item of the slip 1. FIG. 5 shows an example of the definition of each item for slip 1 (10). The contents of the definition include the type of item (A), the line position of the item on the slip (B), the column position (C), and the related item number (D).

【００２６】項目Ｎｏ１には、氏名のフリガナが記載さ
れる。したがって、関連項目Ｎｏ（Ｄ）には、漢字氏名
が記載される項目Ｎｏ２が関連項目として定義されてい
る。伝票１の各項目についての定義体は、磁気ディスク
１１に格納されている。この定義体は、磁気ディスク１
１から読出され、認識処理装置３の図示しないＲＡＭ（
書き込み可能メモリ）に記憶（ロード）される。[0026] In item No. 1, the furigana of the name is written. Therefore, in related item No. (D), item No. 2 in which the Kanji name is written is defined as a related item. Definitions for each item of the slip 1 are stored in the magnetic disk 11. This definition body is the magnetic disk 1
1 and is read from the RAM (not shown) of the recognition processing device 3.
(writable memory).

【００２７】したがって、認識処理装置３は、定義体が
ＲＡＭにロード済であるかどうかを先ず確認し（図３ス
テップＳ１）、ロード済でない場合は、磁気ディスク１
１から読出し、ＲＡＭにロードする（図３ステップＳ２
）。ＯＣＲ読取装置２が読み取ったデータからこの定義
体に従って処理対象項目の対応する項目情報を取得する
（図３ステップＳ３）。Therefore, the recognition processing device 3 first checks whether the definition program has been loaded into the RAM (step S1 in FIG. 3), and if it has not been loaded, the recognition processing device 3
1 and loaded into the RAM (step S2 in Figure 3).
). From the data read by the OCR reading device 2, item information corresponding to the item to be processed is acquired according to this definition body (step S3 in FIG. 3).

【００２８】即ち、項目Ｎｏ１を処理対象項目とする場
合、記載文字「ヤマモト」に対応するＯＣＲ読取装置２
の読み取りデータを入力し、各文字の候補文字から候補
文字組合せ（図２−７）を求め、候補文字組合せ毎に辞
書を検索（図２−８）する。その結果、候補文字組合せ
が辞書に存在する場合は「○」、存在しない場合は「×
」で示される。かかる処理動作は、図６に示す従来装置
の動作と同様である。That is, when item No. 1 is the item to be processed, the OCR reading device 2 corresponding to the written character "Yamamoto"
The candidate character combinations (FIG. 2-7) are obtained from the candidate characters of each character, and the dictionary is searched for each candidate character combination (FIG. 2-8). As a result, if the candidate character combination exists in the dictionary, "○" is displayed, and if it does not exist, "×"
” is indicated. This processing operation is similar to the operation of the conventional device shown in FIG.

【００２９】項目Ｎｏ１の定義体により、関連する項目
が項目Ｎｏ２であることが定義されている。したがって
、認識処理装置３は、次の処理として関連する項目であ
る項目Ｎｏ２の項目情報を取得する（図３ステップＳ４
）。The definition of item No. 1 defines that the related item is item No. 2. Therefore, the recognition processing device 3 acquires the item information of item No. 2, which is a related item, as the next process (step S4 in FIG. 3).
).

【００３０】項目Ｎｏ２には漢字氏名「山本」が記載さ
れている。記載文字「山本」に対応するＯＣＲ読取装置
２の読み取りデータを入力し、各文字の候補文字から候
補文字組合せ（図２−７）を求め、候補文字組合せごと
に辞書を検索（図２−８）する。項目Ｎｏ１の記載文字
の場合と同様に候補文字組合せが辞書に存在する場合は
「○」、存在しない場合は「×」で示される。[0030] In item No. 2, the Kanji name "Yamamoto" is written. Input the read data of the OCR reader 2 corresponding to the written character "Yamamoto", find candidate character combinations (Figure 2-7) from the candidate characters of each character, and search the dictionary for each candidate character combination (Figure 2-8). )do. As in the case of the written characters in item No. 1, if the candidate character combination exists in the dictionary, it is indicated by "○", and if it does not exist, it is indicated by "x".

【００３１】本発明に従うと、次に処理対象項目である
項目Ｎｏ１の候補文字（カタカナ）を関連する項目Ｎｏ
２の文字（漢字）に変換する（図４ステップＳ５）。こ
の結果、候補文字「ヤマモト」は、「山本」と「山元」
に変換され、候補文字「ヤマサト」は、「山里」に変換
される（図２─５）。According to the present invention, the candidate character (katakana) of item No. 1, which is the item to be processed, is then changed to the related item No.
2 (kanji) (step S5 in FIG. 4). As a result, the candidate characters "Yamamoto" are "Yamamoto" and "Yamamoto".
The candidate character ``Yamasato'' is converted to ``Yamazato'' (Figure 2-5).

【００３２】次いで、変換された文字「山本」「山元」
「山里」は、比較元データとされ、一方、項目Ｎｏ２の
辞書検索により存在した候補文字「小木」「山本」「山
木」が比較先データとされ、順次対比し比較される（ス
テップＳ６、７、８、９）。[0032] Next, the converted characters “Yamamoto” and “Yamamoto”
"Yamazato" is used as the comparison source data, and on the other hand, the candidate characters "Ogi", "Yamamoto", and "Yamaki" that existed through the dictionary search for item No. 2 are used as the comparison destination data, and are compared and compared in order (steps S6 and 7). , 8, 9).

【００３３】比較の結果、比較元データと比較先データ
とが一致した時、一致した文字が項目Ｎｏ２の認識文字
として決定される（ステップＳ１０、１１）。最終的に
不一致の場合は、項目Ｎｏ１、項目Ｎｏ２それぞれに認
識確率の高い文字を認識文字と決定する（ステップＳ１
０、１２、７、８、６、１３）。図２の実施例では、「
山本」の文字で一致がとれるので、項目Ｎｏ１は「ヤマ
モト」、項目Ｎｏ２は「山本」が認識文字として決定さ
れる（図２─６）。As a result of the comparison, when the comparison source data and comparison target data match, the matching character is determined as the recognized character of item No. 2 (steps S10, 11). If there is a final mismatch, characters with high recognition probabilities are determined for each of item No. 1 and item No. 2 as recognized characters (step S1
0, 12, 7, 8, 6, 13). In the example of FIG.
Since a match can be made with the characters ``Yamamoto'', ``Yamamoto'' is determined as the recognized character for item No. 1, and ``Yamamoto'' is determined as the recognized character for item No. 2 (Figures 2-6).

【００３４】上記本発明の実施例では、漢字を比較の対
象としているが、項目Ｎｏ２の漢字の候補文字をカタカ
ナに変換して、項目Ｎｏ１のカタカナの候補文字と比較
するようにしても同様の結果が得られる。In the above embodiment of the present invention, kanji are the objects of comparison, but the same result can be obtained even if the kanji candidate characters in item No. 2 are converted to katakana and compared with the katakana candidate characters in item No. 1. Get results.

【００３５】更に、「郵便番号」あるいは「銀行コード
」の記載項目を第一の項目とし、氏名、住所等の記載項
目を第二の項目として比較するようにしてもよい。「郵便番号」あるいは「銀行コード」の認識率は、９８
〜９９％と高いのでかかる場合は、第二の項目の文字認
識の確率を高めるために特に有利である。[0035] Furthermore, the entry items such as "zip code" or "bank code" may be used as the first item, and the entry items such as name, address, etc. may be used as the second item for comparison. The recognition rate for "postal code" or "bank code" is 98
Since the rate is as high as ~99%, this case is particularly advantageous for increasing the probability of character recognition for the second item.

【００３６】[0036]

【効果】以上詳細に説明したように、本発明によって、
ＯＣＲ読取装置から読み取られた文字データに対し、複
数の認識文字候補がある場合であっても、関連ある項目
の文字候補との比較を行い、的確な文字候補を選択でき
る。これによりより正確な文字認識ができる文字認識装
置が提供される。[Effect] As explained in detail above, the present invention provides
Even if there are a plurality of recognized character candidates for character data read by an OCR reading device, an accurate character candidate can be selected by comparing with character candidates of related items. This provides a character recognition device that can perform more accurate character recognition.

[Brief explanation of the drawing]

【図１】本発明の原理図である。FIG. 1 is a diagram showing the principle of the present invention.

【図２】本発明の実施例である。FIG. 2 is an embodiment of the present invention.

【図３】図２の実施例における動作フローチャートの前
半部であり、従来例における動作と共通する。FIG. 3 is the first half of an operation flowchart in the embodiment of FIG. 2, and is common to the operation in the conventional example.

【図４】図２の実施例における動作フローチャートの後
半部であり、本発明の特徴とする動作フローチャート部
分を示す。FIG. 4 is the second half of the operation flowchart in the embodiment of FIG. 2, and shows a portion of the operation flowchart that is a feature of the present invention.

【図５】図２の実施例における伝票の各項目についての
定義体の説明図である。FIG. 5 is an explanatory diagram of a definition body for each item of a slip in the embodiment of FIG. 2;

【図６】従来の認識装置の一例である。FIG. 6 is an example of a conventional recognition device.

[Explanation of symbols]

１　　伝票２　　文字読取装置３　　認識処理装置 1. Slip 2 Character reading device 3 Recognition processing device

Claims

[Claims]

[Claim 1] A character reading device (2) that reads the characters written in each of the plurality of items of the slip (1), and a recognition processing device (2) that determines recognized characters from the character data read by the character reading device (2). 3), and the recognition processing device (3) includes means (4) for specifying, for each item, character candidates to be recognized as the characters described in each of the plurality of items, and means (5) for converting a character candidate to be recognized as a character into a character candidate to be recognized as a character described in a second item related to the first item; and specifying the character candidate to be recognized for each item. The character candidates to be recognized as the characters specified in the second item specified by the means for converting (4) and the character candidates converted by the converting means (5) are compared, and the matching character candidates are converted into the second item. A character recognition device comprising: (6) means for recognizing and confirming characters as described in item (6).

[Claim 2] A recognition character confirmation method that reads the characters written in each of a plurality of items of a slip (1), and recognizes and confirms the read characters for each item, as the characters written in each of the plurality of items. Identify character candidates to be recognized for each item (4), and recognize character candidates to be recognized as characters listed in the first item as characters listed in the second item related to the first item. Convert to character candidates (5), compare the character candidates to be recognized as characters listed in the specified second item and the converted character candidates, and convert the matched character candidates to the second item. Recognition and confirmation as written characters (6
) A method for determining recognized characters.

Claim 3: The characters listed in the first item are katakana or hiragana, and the characters listed in the second item are kanji corresponding to the characters listed in the first item. The character recognition device according to claim 1.

Claim 4: The characters listed in the first item are katakana or hiragana, and the characters listed in the second item are kanji corresponding to the characters listed in the first item. 3. The method for determining recognized characters according to claim 2.