JP2996823B2

JP2996823B2 - Character recognition device

Info

Publication number: JP2996823B2
Application number: JP4353042A
Authority: JP
Inventors: 英明磯貝
Original assignee: 英明磯貝
Priority date: 1992-12-11
Filing date: 1992-12-11
Publication date: 2000-01-11
Anticipated expiration: 2015-01-11
Also published as: JPH06180767A

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【産業上の利用分野】本発明は、イメージスキャナによ
って読取られたイメージデータやファクシミリ装置によ
り伝送されたイメージデータから、所定の箇所に記入さ
れた文字を切り出し、文字認識処理する文字認識装置に
関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a character recognizing device for cutting out characters written in predetermined places from image data read by an image scanner or image data transmitted by a facsimile machine, and performing character recognition processing.

【０００２】[0002]

【従来の技術】従来から、コンピュータに必要なデータ
を入力するための装置としてＯＣＲ（光学式文字読取装
置）が知られている。ＯＣＲでは、帳票の所定位置に記
入された文字・記号・マーク等の認識対象を、文字認識
処理や位置検出処理によって認識し、コンピュータに入
力し得る所定のデータに変換する。また、最近では、帳
票を直接ＯＣＲに入力する他に、ファクシミリ装置等を
使用して遠隔場所から伝送された帳票のイメージデータ
から認識対象を切り出して認識処理するシステムも使用
されるようになってきた。2. Description of the Related Art Conventionally, an OCR (optical character reader) has been known as a device for inputting necessary data to a computer. In OCR, a recognition target such as a character, a symbol, or a mark written in a predetermined position of a form is recognized by character recognition processing or position detection processing, and is converted into predetermined data that can be input to a computer. In recent years, in addition to directly inputting a form into an OCR, a system that uses a facsimile machine or the like to cut out a recognition target from image data of a form transmitted from a remote place and perform recognition processing has also been used. Was.

【０００３】ところで、一般的な文字認識装置では、イ
メージデータから文字情報の部分を検出し、切り出した
のち、文字認識及び単語認識を行うことにより、認識対
象を同定していくことがなされている。この場合、文字
が１字でも特定できないと、単語認識が不可能になる。[0003] In a general character recognition device, a character information portion is detected from image data, cut out, and then character recognition and word recognition are performed to identify a recognition target. . In this case, if even one character cannot be specified, word recognition becomes impossible.

【０００４】一方、例えば郵便番号の読取装置等では、
郵便番号と住所のように、認識対象の特定に際して相互
に補填し合う２種類の文字情報を利用し、郵便番号の読
取りが不能であった場合に、住所の認識結果を利用して
郵便番号を特定することにより、郵便番号の読取精度を
高める方式も提案されている（特開平2-242389号）。On the other hand, for example, in a postal code reader,
Like the postal code and the address, two types of character information that complement each other are used to identify the recognition target. If the postal code cannot be read, the postal code is used by using the address recognition result. A method has also been proposed in which the reading accuracy of the postal code is increased by specifying (Japanese Patent Laid-Open No. 2-242389).

【０００５】[0005]

【発明が解決しようとする課題】しかしながら、上述し
た従来の方式では、２種類の文字情報のうち、一方が確
実に認識できることが条件となっている。このため、２
種類の文字情報の双方が読取不能であった場合には、認
識も失敗するという問題点がある。However, in the above-mentioned conventional system, it is a condition that one of the two types of character information can be reliably recognized. Therefore, 2
When both types of character information cannot be read, there is a problem that the recognition also fails.

【０００６】本発明はこのような問題点を解決するため
になされたもので、認識対象の特定に際して相互に補填
し合う複数種類の文字列がいずれも読取不能であって
も、文字列の認識を可能にすることができ、文字列の認
識率を格段に向上させることができる文字認識装置を提
供することを目的とする。SUMMARY OF THE INVENTION The present invention has been made to solve such a problem. Even when any of a plurality of types of character strings that complement each other when specifying a recognition target cannot be read, character string recognition is performed. It is an object of the present invention to provide a character recognition device that can make character recognition possible and can significantly improve a character string recognition rate.

【０００７】[0007]

【課題を解決するための手段】本発明に係る文字認識装
置は、認識対象の特定に際して相互に補填し合う複数種
類の文字列のイメージをそれぞれ文字認識処理して所定
値以上の確かさで認識された文字とその位置の情報とを
含む文字列の不完全な認識結果をそれぞれ出力する複数
の文字認識処理手段と、認識対象となり得る各種類の文
字列を他の種類の文字列と対応させて記憶してなる補填
辞書と、前記各文字認識処理手段で得られた各種類の文
字列の不完全な認識結果に基づいて前記認識された文字
を認識された位置に含む同種類の文字列を前記補填辞書
で参照しこれらに対応する他の種類の文字列の文字列候
補を前記補填辞書から相互に抽出し前記各文字認識処理
手段に相互に供給する補填処理手段とを備え、前記各文
字認識処理手段は、前記補填処理手段から供給された文
字列候補によって認識対象の範囲を絞り込みながら前記
文字列の完全な認識結果が確定するまで認識処理を繰返
すものであることを特徴とする。SUMMARY OF THE INVENTION A character recognition apparatus according to the present invention performs character recognition processing on a plurality of types of character string images which are mutually complemented when specifying a recognition target, and performs recognition with a certain degree of certainty or more. Character recognition processing means for outputting an incomplete recognition result of a character string including the extracted character and information on its position, and each type of sentence that can be recognized
Compensation by storing character strings in association with other types of character strings
A dictionary and the same type of character string including the recognized character at a recognized position based on the incomplete recognition result of each type of character string obtained by each of the character recognition processing means are included in the supplementary dictionary.
And a supplementary processing means for mutually extracting character string candidates of other types of character strings corresponding to these from the supplementary dictionary and supplying the extracted character string candidates to the respective character recognition processing means. Is characterized in that the recognition processing is repeated until the complete recognition result of the character string is determined while narrowing the range of the recognition target by the character string candidate supplied from the compensation processing means.

【０００８】また、本発明に係る他の文字認識装置は、
認識対象の特定に際して相互に補填し合う複数種類の文
字列のイメージをそれぞれ文字認識処理する複数の文字
認識処理手段と、前記各文字認識処理手段で得られた各
種類の文字列の不完全な認識結果の組合せ情報から、前
記各種類の文字列の完全な認識結果を得る補填処理テー
ブルとを具備してなることを特徴とする。Further, another character recognition device according to the present invention comprises:
A plurality of character recognition processing means for respectively performing character recognition processing on a plurality of types of character string images which are mutually complemented when specifying a recognition target; and an incomplete character string of each type obtained by each of the character recognition processing means. And a compensation processing table for obtaining a complete recognition result of each type of character string from the combination information of the recognition results.

【０００９】[0009]

【作用】本発明によれば、認識対象の特定に際して相互
に補填し合う複数種類の文字列のイメージを複数の文字
認識処理手段で認識処理した結果、これらの文字認識処
理手段でそれぞれ完全な認識結果が得られない場合で
も、不完全な認識結果から他の種類の文字列に関するい
くつかの文字列候補が相互に抽出される。そして、各文
字認識処理手段は、他の文字認識処理手段での不完全な
認識結果から求められたこれらの文字列候補を相互に参
照して自己が認識する文字列を段階的に絞り込んでい
き、最終的に完全な認識結果を得ることができる。この
ため、本発明によれば、個々の文字認識処理手段から不
完全な認識結果しか得られない場合でも、複数種類の文
字列が持つ相互補完性によって、最終的には完全な認識
結果を得ることが可能となる。According to the present invention, when a plurality of types of character string images which are mutually complemented when a recognition target is specified are recognized by a plurality of character recognition processing means, complete recognition is performed by the character recognition processing means. Even when the result is not obtained, some character string candidates for other types of character strings are mutually extracted from the incomplete recognition result. Then, each character recognition processing means narrows down the character strings recognized by itself by mutually referring to these character string candidates obtained from the incomplete recognition results of the other character recognition processing means. Finally, a complete recognition result can be obtained. Therefore, according to the present invention, even when only incomplete recognition results can be obtained from individual character recognition processing means, a complete recognition result is finally obtained due to the mutual complementarity of a plurality of types of character strings. It becomes possible.

【００１０】また、本発明の他の文字認識装置によれ
ば、複数種類の文字列に関する不完全な認識結果の組合
せ情報から、各種類の文字列の完全な認識結果を一度に
求める補完処理テーブルを持つことにより、高速に認識
結果を求めることができる。According to another character recognition apparatus of the present invention, a complementary processing table for obtaining a complete recognition result of each type of character string at once from combination information of incomplete recognition results regarding a plurality of types of character strings. , The recognition result can be obtained at high speed.

【００１１】なお、ここで複数の文字認識処理手段と
は、必ずしも物理的に複数設けるという必要ではなく、
各種類の文字列の認識処理が順番に実行される場合に
は、各文字列の認識処理ステップがそれぞれ文字認識処
理手段に該当することを意味するものである。Here, the plurality of character recognition processing means does not necessarily need to be physically provided in plurality.
When the recognition processing of each type of character string is executed in order, this means that each character string recognition processing step corresponds to a character recognition processing unit.

【００１２】[0012]

【実施例】以下、添付の図面を参照して本発明の実施例
について説明する。図１は本発明の一実施例に係る文字
認識装置を示す機能ブロック図である。図示しないイメ
ージスキャナで所定の帳票を読取ることによって得られ
たイメージデータは、文字検出切出部１に供給される。
文字検出切出部１は、イメージデータの特定領域のＸ方
向及びＹ方向の射影から、文字を検出し切り出す。文字
検出切出部１で切り出されたイメージデータのうち、第
１種類の文字列は、文字認識部２に供給され、第２種類
の文字列は、文字認識部３に供給される。文字認識部
２，３では、それぞれ文字認識辞書４，５を参照して、
切り出された文字列のイメージデータから各文字を認識
処理する。文字認識部２，３からの文字認識結果は、そ
れぞれ単語認識部６，７に供給される。単語認識部６，
７は、それぞれ単語認識辞書８，９を参照して文字認識
結果に基づく単語認識処理を実行する。文字認識部２、
文字認識辞書４、単語認識部６及び単語認識辞書８は、
第１の文字認識処理手段１５を構成し、文字認識部３、
文字認識辞書５、単語認識部７及び単語認識辞書９は、
第２の文字認識処理手段１６を構成している。Embodiments of the present invention will be described below with reference to the accompanying drawings. FIG. 1 is a functional block diagram showing a character recognition device according to one embodiment of the present invention. Image data obtained by reading a predetermined form with an image scanner (not shown) is supplied to the character detection cutout unit 1.
The character detection cutout unit 1 detects and cuts out a character from the projection of the specific area of the image data in the X and Y directions. Among the image data cut out by the character detection cutout unit 1, a first type of character string is supplied to a character recognition unit 2, and a second type of character string is supplied to a character recognition unit 3. The character recognition units 2 and 3 respectively refer to the character recognition dictionaries 4 and 5 and
Recognize each character from the extracted character string image data. The character recognition results from the character recognition units 2 and 3 are supplied to word recognition units 6 and 7, respectively. Word recognition unit 6,
7 executes word recognition processing based on the character recognition result with reference to the word recognition dictionaries 8 and 9, respectively. Character recognition unit 2,
The character recognition dictionary 4, the word recognition unit 6, and the word recognition dictionary 8
The first character recognition processing means 15 constitutes the character recognition unit 3,
The character recognition dictionary 5, the word recognition unit 7, and the word recognition dictionary 9
The second character recognition processing means 16 is constituted.

【００１３】単語認識部６の単語認識結果は、単語補填
処理部１０に供給され、単語認識部７の単語認識結果
は、単語補填処理部１１に供給されている。単語補填処
理部１０は、単語認識部６で得られた不完全な単語認識
結果から、補填辞書１２を参照して第１種類の文字列に
ついての文字列候補を求め、これらの文字列候補から特
定される第２種類の文字列候補を文字認識辞書５、単語
認識部７及び単語認識辞書９にそれぞれ供給する。同様
に、単語補填処理部１１は、単語認識部７で得られた不
完全な認識結果から、補填辞書１２を参照して第２種類
の文字列についての文字列候補を求め、これらの文字列
候補から特定される第１種類の文字列候補を文字認識辞
書４、単語認識部６及び単語認識辞書８にそれぞれ供給
する。The word recognition result of the word recognition unit 6 is supplied to a word compensation processing unit 10, and the word recognition result of the word recognition unit 7 is supplied to a word compensation processing unit 11. The word compensation processing unit 10 obtains character string candidates for the first type of character string from the incomplete word recognition result obtained by the word recognizing unit 6 with reference to the supplementation dictionary 12, and from these character string candidates. The specified second type of character string candidate is supplied to the character recognition dictionary 5, the word recognition unit 7, and the word recognition dictionary 9, respectively. Similarly, from the incomplete recognition result obtained by the word recognizing unit 7, the word compensation processing unit 11 refers to the compensation dictionary 12 to find a character string candidate for the second type of character string, and The first type of character string candidate specified from the candidate is supplied to the character recognition dictionary 4, the word recognition unit 6, and the word recognition dictionary 8, respectively.

【００１４】そして、文字認識部２，３及び単語認識部
６，７では、これら供給された第１種類及び第２種類の
文字列候補に基づいて再度認識処理を実行し、文字列候
補を絞りこんでいく。The character recognizing units 2 and 3 and the word recognizing units 6 and 7 perform recognition processing again based on the supplied first type and second type character string candidates, and narrow down the character string candidates. I will go in.

【００１５】いま、第１種類の文字列が郵便番号、第２
種類の文字列が区市町村名であるとした場合、この装置
では次のように認識処理が実行されていく。即ち、文字
認識部２が最初に認識処理を行った結果が「？？７」で
あり、文字認識部３が最初に認識処理を行った結果が
「？川？」であったとすると、これらの認識結果だけで
は、単語認識部６，７では、直ちに所望する認識結果を
得ることができない。そこで、単語補填処理部１１は、
３文字構成で２番目の文字が「川」である区市町村名を
補填辞書１２から検索する。その検索結果として、例え
ば「立川市」、「秋川市」、「荒川区」及び「品川区」
が得られると、次に、単語補填処理部１１は、これらの
文字列から特定される第１種類の文字列候補、即ち郵便
番号を単語補填辞書１２から検索する。その検索結果が
「１９０」、「１９７」、「１１６」及び「１４２」で
あったとすると、これらが第１種類の文字列候補とな
る。Now, the first type of character string is a postal code,
Assuming that the type of character string is the name of a ward, city, town or village, this device performs the recognition process as follows. That is, assuming that the result of the first recognition performed by the character recognition unit 2 is "?? 7" and the result of the first recognition performed by the character recognition unit 3 is "???" With the recognition results alone, the word recognition units 6 and 7 cannot immediately obtain a desired recognition result. Therefore, the word compensation processing unit 11
A ward, municipal, and municipal name in which the second character is “river” in a three-character configuration is searched from the supplementary dictionary 12. As the search results, for example, “Tachikawa City”, “Akikawa City”, “Arakawa Ward” and “Shinagawa Ward”
Is obtained, the word compensation processing unit 11 next searches the word supplement dictionary 12 for a first type of character string candidate identified from these character strings, that is, a postal code. Assuming that the search results are “190”, “197”, “116”, and “142”, these are character string candidates of the first type.

【００１６】こうして求められた第１種類の文字列候補
「１９０」、「１９７」、…は、単語認識部６に供給さ
れる。単語認識部６では、「？？７」という認識結果
と、単語補填処理部１１から供給された文字列候補「１
９０」、「１９７」、「１１６」及び「１４２」とか
ら、第１種類の文字列が「１９７」であると認識する。The first type character string candidates "190", "197",... Thus obtained are supplied to the word recognition section 6. The word recognizing unit 6 recognizes “?? 7” and the character string candidate “1” supplied from the word replenishment processing unit 11.
From “90”, “197”, “116”, and “142”, it is recognized that the first type of character string is “197”.

【００１７】次に、単語補填処理部１０は、単語認識部
６で認識された第１種類の文字列である「１９７」とい
う郵便番号から特定される第２種類の文字列を補填辞書
１２から検索し、「秋川市」及び「福生市」という文字
列候補を得る。この第２種類の文字列候補は単語認識部
７に供給される。単語認識部７では、「？川？」という
認識結果と、単語補填処理部１０から供給された文字列
候補「秋川市」及び「福生市」とから、第２種類の文字
列が「秋川市」であると認識する。このように、第１種
類の文字列候補と第２種類の文字列候補とで各種類の不
完全な文字認識結果を相互に補填し合うことにより、不
完全な認識結果から完全な認識結果を得ることができ
る。Next, the word supplementation processing unit 10 reads the second type character string identified by the postal code “197”, which is the first type character string recognized by the word recognition unit 6, from the complement dictionary 12. A search is performed to obtain character string candidates of “Akigawa City” and “Fussa City”. This second type of character string candidate is supplied to the word recognition unit 7. In the word recognition unit 7, the second type of character string is “Akikawa city” from the recognition result “? Kawa?” And the character string candidates “Akikawa city” and “Fussa city” supplied from the word compensation processing unit 10. Is recognized. In this way, the first type of character string candidate and the second type of character string candidate complement each other with incomplete character recognition results of each type, so that a complete recognition result can be obtained from an incomplete recognition result. Obtainable.

【００１８】なお、単語補填処理部１０，１１で高速に
文字列候補を検索するためには、補填辞書１２にインデ
ックスファイルを使用すればよい。即ち、図２に示すよ
うに、第２種類の文字列のうち、１字のみが読取可、２
字のみが読取可、その位置、その文字等の条件の組合せ
毎に、インデックスファイル２１1 ，２１2 ，２１3，
…，２１n を作成し、これらのインデックスファイル２
１1 〜２１n に、マスターデータベースファイル２２に
記憶された対応する文字列へのインデックスやポインタ
等を記憶しておく。これにより、第２種類の不完全な認
識結果から第１種類の文字列候補を直ちに検索すること
ができる。また、第１種類の認識結果から第２種類の文
字列候補を検索する場合にも同様のファイル構成として
おけばよい。In order to search the character string candidates at high speed in the word supplement processing units 10 and 11, an index file may be used for the supplement dictionary 12. That is, as shown in FIG. 2, only one character of the second type of character string is readable.
The index files 211, 212, 213, and
…, 21n are created and these index files 2
11 to 21n store indices and pointers to the corresponding character strings stored in the master database file 22. This makes it possible to immediately search for the first type of character string candidate from the second type of incomplete recognition result. A similar file configuration may be used when searching for a second type of character string candidate from a first type of recognition result.

【００１９】検索の速度を更に高めるためには、例えば
図３に示すように、第１及び第２種類の不完全な認識結
果を入力とし、その組合せ情報から特定される第１及び
第２種類の完全な認識結果を出力とする補填処理テーブ
ル３１を作成すればよい。In order to further increase the search speed, for example, as shown in FIG. 3, the first and second types of incomplete recognition results are input, and the first and second types specified by the combination information are input. It is sufficient to create a compensation processing table 31 that outputs a complete recognition result of the above.

【００２０】認識率を更に高めるためには、文字認識部
２，３及び単語認識部６，７で所定のマッチングレベル
に達しなかったもので疑わしいと思われる候補も含めて
再度認識処理を実行することが考えられる。例えば、文
字認識部２での文字列の各文字の認識結果が、下記表１
のように求められたとする。In order to further increase the recognition rate, the character recognizing units 2 and 3 and the word recognizing units 6 and 7 perform the recognizing process again, including candidates that have not reached the predetermined matching level and are considered suspicious. It is possible. For example, the recognition result of each character of the character string in the character recognition unit 2 is shown in Table 1 below.
Suppose you are asked as follows.

【００２１】[0021]

【表１】 [Table 1]

【００２２】この場合、認識結果として出力する閾値を
0.75とすると、文字認識部２から出力される認識結果
は、前述したように「？？７」となる。ここで、単語補
填処理部１１から文字列候補として例えば「１９０」、
「１９７」、「１１６」、「１４２」が供給されたら、
各文字列について、マッチングレベルの総計を計算す
る。例えば「１９０」の総計は1.05、「１９７」の総計
は1.63となる。そして、最も数値の高いものを認識結果
とする。これにより、認識率を更に向上させることがで
きる。また、このような認識処理の結果、特定の文字に
ついてのマッチングレベルが常に低いという傾向が見ら
れたときには、文字認識辞書４，５及び単語認識辞書
７，８をマッチングレベルを引き上げるように更新する
ようにしてもよい。In this case, the threshold output as the recognition result is
If it is 0.75, the recognition result output from the character recognition unit 2 is “?? 7” as described above. Here, for example, “190” as a character string candidate from the word
When "197", "116", and "142" are supplied,
For each string, calculate the sum of the matching levels. For example, the total of “190” is 1.05, and the total of “197” is 1.63. Then, the one with the highest numerical value is set as the recognition result. Thereby, the recognition rate can be further improved. Further, as a result of such recognition processing, when a tendency is observed that the matching level for a specific character is always low, the character recognition dictionaries 4 and 5 and the word recognition dictionaries 7 and 8 are updated so as to increase the matching level. You may do so.

【００２３】なお、上記の実施例では、まず第２種類の
文字列の認識結果「？川？」から第１種類の文字列候補
「１９０」、「１９７」、…を抽出したが、第１種類の
文字列の認識結果「？？７」から第２種類の文字列候補
を抽出するようにしてもよい。いずれを先に行うかは、
候補数の数等により適宜設定すればよい。In the above embodiment, the first type character string candidates "190", "197",... Are first extracted from the recognition result "? Kawa?" Of the second type character string. A second type of character string candidate may be extracted from the type character string recognition result “?? 7”. Which one to do first?
What is necessary is just to set suitably according to the number of candidates etc.

【００２４】また、例えば郵便番号、住所、フリガナ、
電話番号の市外局番等、相互に補填し合う更に多くの種
類の文字列を使用して認識処理を行う場合には、例えば
図４に示すように、文字認識部４２、文字認識辞書４
３、単語認識部４４及び単語認識辞書４５からなる文字
認識処理手段の系統４８1 〜４８n を文字列の種類分だ
け備え、単語補填処理部４６からからフィードバックさ
れる更に多くの文字列候補を使用して再認識処理を実行
すればよい。これにより、更に多くの情報に基づいてよ
り高精度の認識処理を実行することができる。Further, for example, a postal code, an address, a reading,
When performing recognition processing using more types of character strings that complement each other, such as the area code of a telephone number, for example, as shown in FIG.
3. A system of character recognition processing means 481 to 48n comprising a word recognition section 44 and a word recognition dictionary 45 is provided for each type of character string, and more character string candidates fed back from the word compensation processing section 46 are used. The re-recognition process may be executed. As a result, more accurate recognition processing can be performed based on more information.

【００２５】[0025]

【発明の効果】以上述べたように、本発明によれば、個
々の文字認識処理手段から不完全な認識結果しか得られ
ない場合でも、これらの不完全な情報から得られる複数
の文字列候補で各文字列の認識処理を相互に補填するよ
うにしているので、認識対象の特定に際し、相互に補填
し合う複数種類の文字列がいずれも読取不能であって
も、文字列の認識を可能にすることができ、文字列の認
識率を格段に向上させることが可能になる。As described above, according to the present invention, even when only incomplete recognition results can be obtained from individual character recognition processing means, a plurality of character string candidates obtained from these incomplete information can be obtained. The recognition process of each character string is complemented by each other, so that when identifying the recognition target, even if any of multiple types of character strings that complement each other cannot be read, character strings can be recognized. , And the character string recognition rate can be significantly improved.

[Brief description of the drawings]

【図１】本発明の実施例に係る文字認識装置の機能ブ
ロック図である。FIG. 1 is a functional block diagram of a character recognition device according to an embodiment of the present invention.

【図２】同装置における補填辞書の構成例を示す図で
ある。FIG. 2 is a diagram showing a configuration example of a supplement dictionary in the device.

【図３】同装置における補填処理部の他の構成例を示
す機能ブロック図である。FIG. 3 is a functional block diagram showing another configuration example of the compensation processing unit in the apparatus.

【図４】本発明の他の実施例に係る文字認識装置の機
能ブロック図である。FIG. 4 is a functional block diagram of a character recognition device according to another embodiment of the present invention.

[Explanation of symbols]

１，４１…文字検出切出部、２，３，４２…文字認識
部、４，５，４３…文字認識辞書、６，７，４４…単語
認識部、８，９，４５…単語認識辞書、１０，１１，４
６…単語補填処理部、１２，４７…単語補填辞書、１５
…第１の文字認識処理手段、１６…第２の文字認識処理
手段、４８1 〜４８n…文字認識処理手段。1, 41: character detection cutout section, 2, 3, 42: character recognition section, 4, 5, 43 ... character recognition dictionary, 6, 7, 44 ... word recognition section, 8, 9, 45 ... word recognition dictionary, 10,11,4
6 ... word supplement processing unit, 12, 47 ... word supplement dictionary, 15
.., First character recognition processing means, 16... Second character recognition processing means, 481 to 48n.

Claims

(57) [Claims]

1. A string containing the characters identify when an image of a plurality of types of character strings mutually compensate each other recognized each character recognition process at a predetermined value or more certainty of the recognition target and the information of the position A plurality of character recognition processing means for outputting incomplete recognition results of each character string, and converting each type of character string that can be recognized into another type of character string
And a supplementary dictionary stored in association with the same type including the recognized character at a recognized position based on an incomplete recognition result of each type of character string obtained by each of the character recognition processing units. Referring to the character string in the compensation dictionary the character string candidates of other types of character strings corresponding to these
Compensation processing means for mutually extracting from the compensation dictionary and supplying each of the character recognition processing means to each other, wherein each of the character recognition processing means determines a range of a recognition target by a character string candidate supplied from the compensation processing means. A character recognizing device that repeats recognition processing while narrowing down until a complete recognition result of the character string is determined.