JP2000293626A

JP2000293626A - Method and device for recognizing character and storage medium

Info

Publication number: JP2000293626A
Application number: JP11102841A
Authority: JP
Inventors: Takeshi Hasegawa; 武司長谷川
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 1999-04-09
Filing date: 1999-04-09
Publication date: 2000-10-20
Anticipated expiration: 2019-04-09
Also published as: JP3485020B2

Abstract

PROBLEM TO BE SOLVED: To perform performance improvement optimized to an object that is actually processed by inputting a correct character string, acquiring knowledge for obtaining a correct preprocessing result with the correct character string as a key and updating knowledge to be used in a preprocessing process by the knowledge when a reject or an error is outputted as a final processing result. SOLUTION: A character segmenting part 24 uses knowledge stored in a preprocessing knowledge storing part 22 and performs image division of a detected character string area image as a character in each an optimum block. An individual character recognizing part 25 recognizes an individual divided image as a character. A knowledge processing part 26 adapts knowledge about a preliminarily given character string, constructs it as an appropriate character string to the recognition results of an individual character and outputs it as a final processed result. A correction processing part 27 performs correction manually when a reject or an error is outputted as the final processed result. A learning mechanism part 20 updates this knowledge to learn stored knowledge.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、処理対象に記載さ
れた文字を光学的に読み取って文字認識を行う文字認識
方法及び文字認識装置（いわゆるＯＣＲ；Optical Char
acter Reader）に関し、特に、自由書式の認識対象を処
理し、認識対象の画像に含まれる多くの文字列、文様な
どの中から、実際の文字認識の対象となる文字列を含む
認識対象領域を検出し、文字及び文字列認識を行う文字
認識方法及び装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a character recognition method and a character recognition device (so-called OCR; Optical Character Recognition) for optically reading characters described in a processing object and performing character recognition.
acter Reader), in particular, processes a free-form recognition target and, from among many character strings and patterns included in the image to be recognized, recognizes a recognition target area that includes a character string to be actually subjected to character recognition. The present invention relates to a character recognition method and apparatus for detecting and performing character and character string recognition.

【０００２】[0002]

【従来の技術】一般にＯＣＲとも呼ばれる光学的文字認
識装置は、手書き文字が記入されあるいは印刷文字が印
字された対象物を画像入力装置（スキャナ）で読み取っ
て認識対象画像を取得し、この認識対象画像からそこに
含まれる文字列を認識し、文字を認識する。画像入力装
置自体は汎用の技術であり、また、既にある認識対象画
像データから文字認識を行うことも可能であるから、光
学的文字認識装置を特徴づけるものは、認識対象画像か
ら文字列を抽出して文字を認識するところにある。その
意味で本明細書では、「光学的」の語句に拘泥すること
なく、画像からその画像に含まれる文字を認識する技術
を扱うこととする。2. Description of the Related Art An optical character recognizing apparatus generally called an OCR reads an object on which a handwritten character is written or a printed character is printed by an image input device (scanner) and obtains an image to be recognized. Recognize the character strings contained in the image and recognize the characters. Since the image input device itself is a general-purpose technology and can also perform character recognition from existing image data to be recognized, the characterizing optical character recognition device extracts a character string from the image to be recognized. To recognize characters. In this sense, the present specification deals with a technique for recognizing characters included in an image from an image without being bound by the phrase “optical”.

【０００３】予め位置やサイズが定められた記入枠に１
文字ずつ文字が記入・印字されている場合には、その１
文字ごとに個別の文字認識を実行すればよい。しかしな
がら、自由書式すなわち特に書式を定めることなく記入
あるいは印字された文字を認識する場合、例えば、郵便
物から宛て名となる文字列を抽出し、その文字列中の文
字を認識する場合には、まず、認識対象画像から認識対
象となる個別の文字を抽出するための処理（前処理とい
う）が必要になる。郵便物の区分けのために自由書式の
文字認識を行う場合であれば、前処理として、発信人の
住所ではなくてあて先の住所（や氏名）の文字・文字列
が書かれたブロック状の領域（宛て名記載領域）を認識
対象領域として抽出し、その領域から行ごとの文字列画
像を抽出し、各文字列画像から１文字ずつ文字を切り出
すという処理が必要となる。さらに、前処理としては、
後工程の個別文字認識において認識をしやすくするため
に、文字のかすれたところを補う処理、郵便物表面のし
みや汚れなどによるノイズを除去する処理、傾いている
文字画像を補正する処理、下線（アンダーライン）を検
出してそれを除去する処理、文字の大きさを揃える（正
規化する）処理などが、必要に応じて実行される。文字
切り出しにおいては、１文字で「記」と書かれているは
ずのものを「言」と「己」に分けて切り出したり、２文
字で「三原」と書かれているはずのものを「源」に対応
する１文字で切り出すような、誤った切り出しを行わな
いようにすることが重要である。[0003] An entry frame whose position and size are predetermined is
If characters are entered and printed one by one,
What is necessary is just to perform individual character recognition for every character. However, when recognizing a free-form, that is, a character written or printed without defining a format, for example, when extracting a character string to be an address from a mail and recognizing a character in the character string, First, processing for extracting individual characters to be recognized from the recognition target image (referred to as preprocessing) is required. If free-form character recognition is used to separate mail, a block-like area in which characters and character strings of the destination address (or name) are written instead of the sender's address as preprocessing. It is necessary to extract (address description area) as a recognition target area, extract a character string image for each line from that area, and cut out characters one by one from each character string image. Furthermore, as preprocessing,
To make it easier to recognize individual characters in the subsequent process, processing to compensate for blurred characters, processing to remove noise due to stains and dirt on the mail surface, processing to correct skewed character images, underlining Processing for detecting (underlining) and removing it, processing for equalizing (normalizing) the size of characters, and the like are executed as necessary. In character extraction, one character that should be written as "Ki" can be cut out as "word" and "me", or two characters that should be written as "Mihara" can be extracted as "source". It is important to prevent erroneous clipping such as clipping with one character corresponding to "."

【０００４】書き癖や字体（フォント）の相違に対応す
るために、文字切り出しを行った後の個別文字認識に学
習機能を持たせることが有効であることが知られてい
る。文字認識装置での個別文字認識における学習機能に
ついては、これまでにも多くの提案がなされている。例
えば、特開平８−１８０１４１号「文字認識システ
ム」、特開平５−９４５６５号「手書文字認識方式」、
特開平５−０５４１９６号「ナンバープレート認識装
置」などの各公報に、個別文字認識そのものに対する学
習が開示されている。It is known that it is effective to provide a learning function to individual character recognition after character segmentation in order to deal with differences in writing habits and fonts (fonts). Many proposals have been made for a learning function in individual character recognition in a character recognition device. For example, JP-A-8-180141, "Character Recognition System", JP-A-5-94565, "Handwritten Character Recognition System",
Japanese Unexamined Patent Publication No. H05-054196 "License plate recognition device" discloses learning on individual character recognition itself.

【０００５】さらに文字認識において知識ベースを用い
ることも知られている。例えば、特開平１０−１９８７
６４号公報「文字列認識装置および知識データベース学
習方法」には、認識候補文字列が未登録語である場合に
は、その認識候補文字列を知識データベースに登録する
ようにして、文字認識処理に適用する知識データベース
の自動学習を可能にする技術が開示されている。また、
特開平７−６２０３号公報には、帳票と呼ばれる特定フ
ォーマットを対象とした学習を行う文字認識装置が開示
されている。前処理における文字切り出しに知識ベース
を応用したものとしては、特許第２７５１８６５号明細
書（特開平８−２８７１８８号公報）がある。[0005] It is also known to use a knowledge base in character recognition. For example, JP-A-10-1987
In Japanese Patent No. 64, “Character string recognition device and knowledge database learning method”, when a recognition candidate character string is an unregistered word, the recognition candidate character string is registered in a knowledge database, and the character recognition processing is performed. A technology that enables automatic learning of a knowledge database to be applied is disclosed. Also,
Japanese Patent Laying-Open No. 7-6203 discloses a character recognition device that performs learning for a specific format called a form. Japanese Patent No. 2751865 (JP-A-8-287188) discloses an application of a knowledge base to character extraction in preprocessing.

【０００６】図１２は、従来の文字認識装置の構成を示
すブロック図である。ここでは、郵便物の区分け装置に
用いられる文字認識装置を説明する。この文字認識装置
は、処理対象（郵便物）を２値または多値の認識対象画
像データとして取り込む画像入力部９１と、フォーマッ
トデータベースあるいは処理パラメータなどとして与え
られた知識を予め記憶している前処理知識記憶部９２
と、前処理知識記憶部９２に記憶された知識を用いるこ
とにより、画像入力部９１で取得した認識対象画像デー
タから宛て名記載領域（認識対象領域）を検出して文字
列画像を抽出する文字列領域検出部９３と、同様に前処
理知識記憶部９２に記憶された知識を使用して、検出さ
れた文字列画像を文字として最適と考えられるブロック
ごとに画像分割する文字切り出し部９４と、個々の分割
画像を文字として認識する個別文字認識部９５と、個別
文字の認識結果に対して、予め与えられている文字列に
関する知識（地名情報など）を適用し、適切な文字列と
して構築し、最終処理結果として出力する知識処理部９
６とを備えている。知識処理部９６は、適切な文字列を
発見しなかった場合、すなわち正しい認識結果が得られ
ないと判断した場合には、最終処理結果としてリジェク
トを出力する。さらに、この文字認識装置には、最終処
理結果としてリジェクトあるいは誤り（エラー）が出力
された場合に手作業により校正し、その郵便物の正しい
宛て先（例えば郵便番号（７桁）及び丁目番地号棟室の
情報）を手入力で郵便物区分け装置に入力する校正処理
部９７が、設けられている。FIG. 12 is a block diagram showing a configuration of a conventional character recognition device. Here, a character recognition device used for a mail sorting device will be described. The character recognition device includes an image input unit 91 that captures a processing target (mail) as binary or multi-valued recognition target image data, and a preprocessing that previously stores knowledge given as a format database or processing parameters. Knowledge storage unit 92
And a character for detecting a destination name description area (recognition target area) from the recognition target image data acquired by the image input section 91 and extracting a character string image by using the knowledge stored in the preprocessing knowledge storage section 92 A character segmentation unit 94 that divides a detected character string image into blocks that are considered to be optimal as characters using the knowledge similarly stored in the preprocessing knowledge storage unit 92; An individual character recognizing unit 95 that recognizes each divided image as a character, and a predetermined character string knowledge (such as place name information) is applied to the recognition result of the individual character to construct an appropriate character string. , A knowledge processing unit 9 for outputting as a final processing result
6 is provided. If an appropriate character string is not found, that is, if it is determined that a correct recognition result cannot be obtained, the knowledge processing unit 96 outputs a reject as a final processing result. In addition, the character recognition device manually calibrates the rejection or error (error) as a final processing result, and corrects the correct destination of the postal matter (for example, the postal code (7 digits) and the street address building). There is provided a calibration processing unit 97 for manually inputting (room information) to the mail sorting apparatus.

【０００７】この文字認識装置を用いる郵便物区分け装
置は、知識処理部９６の出力により郵便物を区分けし、
知識処理部９６の出力としてリジェクトあるいはその他
のエラーが検出されたときには、校正処理部９７での校
正結果により、郵便物を区分けする。この従来の文字認
識装置では、文字切り出し部９４での文字切り出しに、
上述した特許第２７５１８６５号明細書に記載の方法を
用いることができる。また、個別文字認識部９５や知識
処理部９６での処理として、学習機能を有しまた知識処
理を行う文字認識方法を適用することができる。A mail sorting device using this character recognition device sorts mails by the output of the knowledge processing section 96,
When a rejection or other error is detected as an output of the knowledge processing unit 96, the mail is classified based on the calibration result of the calibration processing unit 97. In this conventional character recognition device, when the character is cut out by the character cutout unit 94,
The method described in the aforementioned Japanese Patent No. 2751865 can be used. Further, as the processing in the individual character recognition unit 95 and the knowledge processing unit 96, a character recognition method having a learning function and performing knowledge processing can be applied.

【０００８】[0008]

【発明が解決しようとする課題】図１２に示した従来の
文字認識装置では、知識処理による前処理を実行し、ま
た、知識処理であるとともに学習機能を有する個別文字
認識を行っている。しかしながら、自由書式の文字認識
では特に重要な処理である前処理（認識対象領域の検出
や文字切り出し）については、学習を行っていない。こ
れは、これまでの文字認識技術が、定型帳票を対象とし
て発展してきたため、認識対象領域の検出や文字切り出
しにそれほど関心を払ってこなかったことが大きく影響
している。しかし、自由書式を対象とした文字認識装置
の需要の増大に従い、前処理段階での性能問題が大きく
なり、それらの処理を運用対象に最適化するための学習
が必須となってきている。In the conventional character recognition apparatus shown in FIG. 12, preprocessing by knowledge processing is executed, and individual character recognition which is a knowledge processing and has a learning function is performed. However, learning is not performed on preprocessing (detection of a recognition target area or character cutout), which is particularly important processing in free-form character recognition. This is largely due to the fact that conventional character recognition technologies have been developed for fixed forms, and have not paid much attention to the detection of the recognition target area and the extraction of characters. However, as the demand for a character recognition device for free format has increased, performance problems at the preprocessing stage have increased, and learning for optimizing those processes for operation targets has become essential.

【０００９】ここで文字認識装置の性能について説明す
る。現状において文字認識装置による文字認識は未だ人
間のレベルに達していないため、その読み取り性能の向
上が技術的に大きな課題となっている。性能とは、もっ
とも単純に言えば、どれだけ正しく認識できたかである
が、性能を表わす具体的な指標としては、例えば、前処
理における「宛て名記載領域の検出処理」の正解率、
「文字切り出し処理」における正解率（多候補処理であ
れば正解含有率）、さらに、個別文字認識正読率、知識
処理での正解率などが挙げられる。Here, the performance of the character recognition device will be described. At present, character recognition by a character recognition device has not yet reached the level of a human, and therefore, improvement of its reading performance has become a technically significant problem. The performance is, in the simplest case, how correctly the recognition has been made. Specific indexes indicating the performance include, for example, the correct answer rate of the "address detection area detection processing" in the preprocessing,
The correct answer rate in the “character extraction processing” (the correct answer content rate in the case of multiple candidate processing), the individual character recognition correct read rate, the correct answer rate in the knowledge processing, and the like are included.

【００１０】自由書式を対象とした文字認識に共通する
問題は、実際の運用において処理される処理対象のバリ
エーションや記載状況に、あらかじめ最適化して製品化
することが難しいという点である。例えば、現在実用化
されている自由書式に対する文字認識装置として、上述
したように郵便物に記載されたあて名を対象とした文字
認識装置がある。その性能においても、さまざまな絵、
文章、ロゴなどの記載された中からあて名文字列を検出
することが大きな課題となっており、あらかじめ用意し
た前処理、例えばあて名記載領域（認識対象領域）の検
出処理、文字切り出し処理では対応できない郵便物が多
く存在し、さらに運用される地域により、主として記載
される住所も異なるため、個々の文字認識装置に対し
て、実際の運用に即した学習、最適化による性能改善の
必要がある。[0010] A problem common to character recognition for free format is that it is difficult to optimize and commercialize a variation or description of a processing target to be processed in actual operation in advance. For example, as a character recognition device for a free format which is currently in practical use, there is a character recognition device for a destination described in a mail as described above. In the performance, various pictures,
Detecting a destination character string from among texts, logos, etc., has become a major issue, and cannot be handled by pre-processing prepared in advance, for example, detection processing of a destination name description area (recognition target area) and character cutout processing. Since there are many mails and the addresses to be described mainly differ depending on the area where the mail is operated, it is necessary to improve the performance of each character recognition device by learning and optimizing according to the actual operation.

【００１１】しかしながら、これらの処理を自動学習に
より個別の運用状況に最適化し、性能向上させる方法に
ついては、これまでは検討されていないのが現状であ
る。However, a method of optimizing these processes for individual operation situations by automatic learning and improving the performance has not been studied so far.

【００１２】そこで本発明の目的は、自由書式の文字認
識処理において認識対象領域検出処理や文字切り出し処
理等の前処理について学習を行い、実際に処理する対象
に最適化した性能改善を可能とする文字認識方法及び装
置を提供することにある。Accordingly, an object of the present invention is to enable learning of preprocessing such as a recognition target area detection process and a character segmentation process in a free-form character recognition process, and to improve performance optimized for a target to be actually processed. A character recognition method and apparatus are provided.

【００１３】[0013]

【課題を解決するための手段】一般に自由書式の文字認
識においては、リジェクトや誤りの原因となるのは、前
処理での認識対象領域検出、文字列領域の抽出、文字切
り出し処理の失敗である。その一方で、最終的にリジェ
クトとされる場合であっても、通常は文字列の候補が全
く生成できないということは少なく、候補を作成したが
最終的には信頼できないためリジェクトする、あるいは
複数の候補が生成され、いずれか一方に決定する段階で
いずれとも確定できずリジェクトするという場合が多
い。誤認識にしても同様で、複数の候補から最終的に誤
りを選択してしまったという場合が多い。In general, in free-form character recognition, rejects and errors are caused by failures in detection of a recognition target area, extraction of a character string area, and character extraction processing in preprocessing. . On the other hand, even if it is finally rejected, it is rare that no character string candidate can be generated at all, and the candidate is created but eventually rejected because it is not reliable. In many cases, candidates are generated, and at the stage of determining one of them, none of them can be determined and rejected. The same is true for erroneous recognition. In many cases, an error is finally selected from a plurality of candidates.

【００１４】このように正しい候補を最終候補に残すこ
とができない原因として、実際の運用において処理され
る処理対象のバリエーションや記載状況に、あらかじめ
最適化して製品化することが難しいという問題がある。
個別文字認識における認識処理を運用状況に最適化する
方法は既に多く公開されているが、これに前処理までを
実際の運用状況に最適化することができれば、自由書式
に対する文字認識性能は格段に向上させることができ
る。[0014] As a cause that the correct candidate cannot be left as the final candidate, there is a problem that it is difficult to optimize and commercialize a variation or description of a processing target to be processed in actual operation in advance.
Many methods for optimizing the recognition process in individual character recognition to the operational situation have already been published, but if the pre-processing can be optimized for the actual operational situation, the character recognition performance for free format will be remarkably improved. Can be improved.

【００１５】そこで本発明では、リジェクトあるいは誤
認識となったものを手作業で校正する際に得られる情報
を用いて、前処理での候補検出、選択段階で用いる知識
を実際の運用に最適化することで、性能向上を実現す
る。Therefore, in the present invention, the knowledge used in the candidate detection and preselection stages in the preprocessing is optimized for the actual operation by using the information obtained when the rejected or misrecognized ones are manually calibrated. By doing so, the performance is improved.

【００１６】すなわち本発明の文字認識方法は、自由書
式で文字が記載された認識対象画像に対して文字認識を
行う文字認識方法において、認識対象画像から、知識処
理により、文字として最適と考えられるブロックを分割
画像として切り出す前処理工程と、分割画像のそれぞれ
に対して個別文字認識を行う個別文字認識工程と、個別
文字認識の結果に対して、文字列に関する予め与えられ
た知識を適用して適切な文字列を構築し、最終処理結果
として出力する文字列構築工程と、最終処理結果として
リジェクトあるいは誤りが出力された場合に、正しい文
字列を入力する校正処理工程と、校正処理工程を行った
場合に、正しい文字列をキーとして、正しい前処理結果
を得るための知識を獲得し、獲得した知識によって前処
理工程で用いる知識を更新する学習工程と、を有する。That is, the character recognition method of the present invention is a character recognition method for performing character recognition on a recognition target image in which characters are described in a free format. Applying pre-given knowledge about a character string to a pre-processing step of cutting out a block as a divided image, an individual character recognition step of performing individual character recognition on each of the divided images, and an individual character recognition result. Performs a character string construction step of constructing an appropriate character string and outputting it as the final processing result, and a proofreading processing step of inputting a correct character string when a rejection or error is output as the final processing result. In such a case, knowledge for obtaining a correct preprocessing result is acquired using the correct character string as a key, and the knowledge used in the preprocessing step is obtained based on the acquired knowledge. ; And a learning step of updating the.

【００１７】また本発明の文字認識装置は、自由書式で
文字が記載された認識対象画像に対して文字認識を行う
文字認識装置において、文字認識の前処理に必要な知識
を記憶する前処理知識記憶手段と、認識対象画像から、
予め前処理知識記憶手段に与えられた知識を用いて、希
望する文字列領域を検出する文字列領域検出手段と、前
処理知識記憶手段に記憶された知識を使用して、検出さ
れた文字列領域画像を文字として最適と考えられるブロ
ックごとに画像分割し分割画像を得る文字切り出し手段
と、個々の分割画像を文字として認識する個別文字認識
手段と、個別文字の認識結果に対して、予め与えられた
文字列に関する知識を適用し、適当な文字列として構築
し、最終処理結果として出力する知識処理手段と、最終
処理結果としてリジェクト、あるいは誤りが出力された
場合に校正する校正処理手段と、前処理知識記憶手段内
の知識の学習を行う学習手段とを有し、校正処理手段で
校正処理を行った場合に、その正解データをキーとして
正解の前処理結果を得るための知識を獲得し、獲得した
知識に基づいて前処理知識記憶手段内に知識を蓄積し前
処理知識記憶手段内の知識を更新する。The character recognition apparatus of the present invention is a character recognition apparatus for performing character recognition on an image to be recognized in which characters are described in a free format. From the storage means and the image to be recognized,
A character string area detecting means for detecting a desired character string area using knowledge previously given to the preprocessing knowledge storage means, and a character string detected using the knowledge stored in the preprocessing knowledge storage means. A character cutout unit that obtains a divided image by dividing an area image into blocks that are considered to be optimal as characters, an individual character recognizing unit that recognizes each divided image as a character, and a recognition result of an individual character are given in advance. Knowledge processing means for applying knowledge about the given character string, constructing it as an appropriate character string, and outputting it as a final processing result, and rejecting or outputting an error as a final processing result; and Learning means for learning the knowledge in the preprocessing knowledge storage means, and when the calibration processing is performed by the calibration processing means, the correct preprocessing result is obtained by using the correct data as a key. Obtained knowledge acquired for, updating the knowledge in the preprocessing knowledge storing means accumulate knowledge preprocessing knowledge storage means based on the acquired knowledge.

【００１８】すなわち本発明では、文字列領域検出手段
は、封筒に記載された宛て名のような自由書式の認識対
象から、予め前処理知識記憶手段にデータベース、ある
いは処理パラメータなどとして与えられた知識により、
希望する文字列領域を検出する。次に、文字列領域検出
手段で得られた文字列部分は、同様に前処理知識記憶手
段に記憶された知識を使用して、文字切り出し手段にお
いて文字として最適と考えられるブロックごとに画像分
割される。次に、個別文字認識手段で文字認識を実施さ
れ、さらに知識処理手段において記憶されている住所、
氏名など知識を元に最も適当な文字列として構築され、
最終処理結果として出力される。文字認識装置による文
字認識では、リジェクトされる場合や誤りを含む場合が
あるため、校正処理手段においてＯＣＲで正しく認識で
きなかった文字、あるいは文字列を手作業で入力し、正
しい認識結果を作成する。That is, according to the present invention, the character string area detecting means converts the recognition target given as a database or processing parameter in the preprocessing knowledge storage means from a recognition target of a free format such as an address written on an envelope. By
Detect the desired character string area. Next, the character string portion obtained by the character string region detecting means is similarly image-divided into blocks each of which is considered to be optimal as a character by the character extracting means, using the knowledge stored in the preprocessing knowledge storage means. You. Next, the character recognition is performed by the individual character recognition means, and the address stored in the knowledge processing means,
It is constructed as the most appropriate character string based on knowledge such as name,
Output as the final processing result. In character recognition by a character recognition device, a character or a character string that cannot be correctly recognized by the OCR in the proofreading processing means is manually input to create a correct recognition result because the character may be rejected or may include an error. .

【００１９】本発明では、校正処理手段で得られた正解
データをキーとして、前処理からの一連の処理を繰り返
し再実行することで、正解の前処理結果を得るための知
識、パラメータなどを獲得し、さらにそれらを前処理知
識記憶手段で蓄積、更新する機能を文字認識装置に与え
ることで、前処理自体は既存の技術を使用しながら、実
際に運用される状況、入力される画像に最適な前処理を
実現できる文字認識方法及び装置を提供する。According to the present invention, a series of processes from the pre-processing is repeatedly and re-executed by using the correct data obtained by the calibration processing means as a key, thereby acquiring knowledge, parameters, and the like for obtaining a pre-processing result of the correct solution. Then, by giving the function to accumulate and update them in the preprocessing knowledge storage means to the character recognition device, the preprocessing itself is optimal for the actual operation situation and the input image while using the existing technology. The present invention provides a character recognition method and apparatus capable of realizing preprocessing.

【００２０】[0020]

【発明の実施の形態】次に、本発明の好ましい実施の形
態について、図面を参照して説明する。図１は本発明の
第１の実施形態の文字認識装置の構成を示すブロック図
である。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS Next, a preferred embodiment of the present invention will be described with reference to the drawings. FIG. 1 is a block diagram showing the configuration of the character recognition device according to the first embodiment of the present invention.

【００２１】ここでは、郵便物の区分けを目的として自
由書式の文字認識を行う場合を例に挙げて説明する。図
２は、切手が貼付され、また会社のロゴ（社章の類）が
印刷されているとともに、宛て名１１と発信元１２が記
載されている封書１０を示している。ここでは、この封
書を画像入力して得た認識対象画像から、文字認識の対
象となる領域である認識対象領域として、宛て名が記載
されているブロック状の領域（図示破線で囲まれた宛て
名記載領域１３）を検出し、その後、宛て名記載領域１
３から宛て名の各行の文字列領域を抽出し、文字列領域
に対して文字切り出し処理を行い、記載されている文字
列を認識する処理を行うものとして、説明を行う。な
お、図２においては、宛て名記載領域１３を破線で明示
しているが、実際の封書１０にはこのような破線（領
域）が記載されているわけではなく、このブロック領域
は、後述する前処理の結果として、文字認識装置内で初
めて認識され抽出されるものである。また、宛て名の各
行の文字列領域とは、図２に示した例では、文字列「〒
１８３−００３６」を含む領域、文字列「東京都府中市
日新町１−１０」を含む領域、文字列「○△○△株式会
社」を含む領域、文字列「府中事業場第１技術部御
中」を含む領域の合計４つの領域である。Here, a case where free-form character recognition is performed for the purpose of sorting mail items will be described as an example. FIG. 2 shows a sealed letter 10 on which a stamp is affixed, a company logo (a kind of company emblem) is printed, and an address 11 and a sender 12 are described. Here, from a recognition target image obtained by image-inputting the sealed letter, a block-shaped area (address enclosed by a broken line in the drawing) in which a destination name is described as a recognition target area which is a target area for character recognition. The name description area 13) is detected, and then the destination name description area 1 is detected.
3, a description will be given assuming that a character string area of each line of the destination name is extracted, character extraction processing is performed on the character string area, and processing of recognizing the described character string is performed. In FIG. 2, the address description area 13 is clearly indicated by a broken line, but such a broken line (area) is not described in the actual letter 10, and this block area will be described later. As a result of the preprocessing, the character is recognized and extracted for the first time in the character recognition device. In the example shown in FIG. 2, the character string area of each line of the destination name is a character string “〒”.
183-0036 ", an area containing the character string" 1-10 Nissincho, Fuchu-shi, Tokyo ", an area containing the character string" ○ △ ○ △ KK ", and a character string" Mr. . Are four areas in total.

【００２２】図１に示す文字認識装置は、処理対象（こ
こでは封書１０）をＣＣＤ（電荷結合素子）センサなど
の光電変換素子によって読み取り、２値または多値のデ
ジタル画像データである認識対象画像として取り込む画
像入力部２１と、前処理に使用する知識をフォーマット
データベースあるいは処理パラメータなどの形態で予め
記憶しておく前処理知識記憶部２２と、前処理知識記憶
部２２に記憶された知識を用いることにより、画像入力
部２１で取得した認識対象画像からブロック状の領域と
して宛て名記載領域１３を抽出し、抽出した宛て名記載
領域１３から文字列領域を検出する文字列領域検出部２
３と、同様に前処理知識記憶部２２に記憶された知識を
使用して、検出された文字列領域画像を文字として最適
と考えられるブロックごとに画像分割する文字切り出し
部２４と、個々の分割画像を文字として認識する個別文
字認識部２５と、個別文字の認識結果に対して、予め与
えられている文字列に関する知識（地名情報など）を適
用し、適当な文字列として構築し、最終処理結果として
出力する知識処理部２６とを有している。さらにこの文
字認識装置は、知識処理部１６から最終処理結果として
リジェクトあるいは誤りが出力された場合に手作業によ
り校正する校正処理部２７と、前処理知識記憶部２２に
記憶された知識の学習を行うためにこの知識の更新を行
う学習機構部２０とを備えている。The character recognition apparatus shown in FIG. 1 reads an object to be processed (here, sealed letter 10) by a photoelectric conversion element such as a CCD (Charge Coupled Device) sensor or the like, and recognizes the image to be recognized as binary or multi-valued digital image data. Using an image input unit 21 for prefetching, a preprocessing knowledge storage unit 22 for preliminarily storing knowledge used for preprocessing in the form of a format database or processing parameters, and the knowledge stored in the preprocessing knowledge storage unit 22 Thereby, the character string area detecting unit 2 extracts the address description area 13 as a block-like area from the recognition target image acquired by the image input unit 21 and detects the character string area from the extracted address description area 13.
3, a character cutout unit 24 that uses the knowledge stored in the preprocessing knowledge storage unit 22 to divide the detected character string area image into blocks each of which is considered to be optimal as a character. An individual character recognizing unit 25 that recognizes an image as a character, and a knowledge about a character string (such as place name information) that is given in advance to the recognition result of the individual character is applied to construct an appropriate character string. And a knowledge processing unit 26 that outputs the result. Further, the character recognition device performs a calibration processing unit 27 for manually correcting when a rejection or an error is output as a final processing result from the knowledge processing unit 16 and a learning of the knowledge stored in the preprocessing knowledge storage unit 22. And a learning mechanism unit 20 for updating the knowledge in order to perform the updating.

【００２３】この文字認識装置においては、文字列領域
検出部２３から知識処理部２６までは、いずれも相互に
情報を交換することが可能である。特に、知識処理部２
６での処理結果が文字列領域検出部２３及び文字切り出
し部２４にフィードバックし、校正処理部２８において
校正処理が行われた場合にはその校正処理の内容が文字
列領域検出部２３及び文字切り出し部２４にフィードバ
ックしている。学習機構部２９は、文字列領域検出部２
３及び文字切り出し部２４へのフィードバック内容に応
じて前処理知識記憶部２２中の知識の学習を実行する。In this character recognition device, information can be mutually exchanged from the character string area detecting section 23 to the knowledge processing section 26. In particular, the knowledge processing unit 2
6 is fed back to the character string area detection unit 23 and the character cutout unit 24, and when the proofreading processing is performed in the proofreading processing unit 28, the content of the proofreading processing is performed by the character string area detection unit 23 and the character cutout This is fed back to the unit 24. The learning mechanism unit 29 includes the character string area detection unit 2
The learning of the knowledge in the preprocessing knowledge storage unit 22 is executed in accordance with 3 and the content of the feedback to the character cutout unit 24.

【００２４】知識処理部２６は、知識ベースとして例え
ば地名辞書を備えることにより、個別文字認識部２５が
出力した個別文字の認識結果から、知識処理によって、
最終的な認識文字（列）を出力する。知識処理を行うこ
とにより、個別文字認識部２５が「王」の字を「玉」で
あると誤認識した場合であっても、「八王子市」という
地名（八王子市は東京都の西部にある都市）はあっても
「八玉子市」という地名はないことから、正しく、「八
王子市」と認識することができる。The knowledge processing unit 26 includes, for example, a place name dictionary as a knowledge base, and performs knowledge processing based on the recognition result of the individual character output by the individual character recognition unit 25.
Output the final recognized character (string). By performing the knowledge processing, even if the individual character recognizing unit 25 erroneously recognizes the character of “king” as “ball”, the place name “Hachioji-shi” (Hachioji-shi is located in the western part of Tokyo. There is no place name "Hachitamago City" even though there is a city, so it can be correctly recognized as "Hachioji City".

【００２５】特に、ここで述べる文字認識装置の知識処
理部２６は、文字列の情報と同時に書式に関する情報も
知識として記憶しており、知識処理部２６から文字領域
検出部２３及び文字切り出し部２４へ直接フィードバッ
クするような、知識処理を用いてトップダウン的に前処
理候補を作成する機能を有する。すなわちこの文字処理
装置では、自由書式に対する文字認識処理を行う際に知
識処理を行って文字列を構築する段階において、それを
与える前処理結果が適当か否かを判断し、前処理にフィ
ードバックするトップダウン処理が用いられており、文
字列領域検出部２３から知識処理部２６までの情報の流
れは一意ではない。例えば、複数の認識対象領域候補に
対して知識処理までの処理を実施した結果として最も適
当な認識対象領域を検出し、再度、前処理から、最適な
パラメータ、処理、処理手順を用いて、最終結果を得る
ような手法も用いられる。In particular, the knowledge processing section 26 of the character recognition apparatus described here stores information on the format as well as information on the character string as knowledge, and from the knowledge processing section 26, the character area detection section 23 and the character cutout section 24. It has a function to create preprocessing candidates from the top down using knowledge processing, such as direct feedback to. That is, in this character processing device, at the stage of constructing a character string by performing knowledge processing when performing character recognition processing for a free format, it is determined whether or not the preprocessing result given is appropriate and fed back to the preprocessing. The top-down process is used, and the flow of information from the character string area detection unit 23 to the knowledge processing unit 26 is not unique. For example, the most appropriate recognition target area is detected as a result of performing the processing up to the knowledge processing on a plurality of recognition target area candidates, and the final processing is performed again using the optimal parameters, processing, and processing procedure from the preprocessing. Techniques for obtaining results are also used.

【００２６】図１に示す文字認識装置では、校正処理部
２７で得られた正解文字列を与えた上で文字列領域検出
部２３からの一連の処理を繰り返し実行することで、当
初は正解を得ることができなかった処理対象（郵便物な
ど）から正解文字列を検出認識し、正解が得られた時の
前処理知識を用いて、学習機構部２８が、前処理知識記
憶部２２に新たに知識を追加し、あるいは前処理知識記
憶部２２に記憶されている知識を更新する。In the character recognition apparatus shown in FIG. 1, the correct character string obtained by the proofreading processing unit 27 is given, and a series of processing from the character string area detecting unit 23 is repeatedly executed, so that the correct answer is initially obtained. The learning mechanism unit 28 detects and recognizes the correct character string from the processing target (e.g., postal matter) that could not be obtained, and uses the preprocessing knowledge at the time of obtaining the correct answer, stores the new character string in the preprocessing knowledge storage unit 22. Or the knowledge stored in the preprocessing knowledge storage unit 22 is updated.

【００２７】次に、図３に示すフローチャートを用い
て、図１に示す文字認識装置の動作を説明する。Next, the operation of the character recognition apparatus shown in FIG. 1 will be described with reference to the flowchart shown in FIG.

【００２８】まず、画像入力部２１において読み取り対
象（例えば封書）をスキャニングして２値または多値の
デジタル画像データである認識対象画像として取り込む
（ステップ１０１）。この認識対象画像に対し、文字列
領域検出部２３は、前処理知識記憶部２２に記憶されて
いる知識を使用して、宛て名が記載されていると考えら
れるブロック状の領域（図２の宛て名記載領域１３）を
決定し（ステップ１０２）、その領域から各行ごとの文
字列領域を文字列領域画像として抽出する（ステップ１
０３）。次に、文字切り出し部２４が、前処理知識記憶
部２２に記憶されている知識を利用して、１文字の文字
として最適と考えられるブロックごとに文字列領域画像
を画像分割する（ステップ１０４）。このステップ１０
４の処理は、通常、文字切り出し処理と呼ばれる。な
お、ステップ１０２〜１０４の処理を一括して一般に前
処理と呼ぶ。First, the image input unit 21 scans an object to be read (for example, a sealed letter) and captures it as an image to be recognized as binary or multi-valued digital image data (step 101). For this recognition target image, the character string area detection unit 23 uses the knowledge stored in the preprocessing knowledge storage unit 22 to store a block-shaped area (see FIG. The destination name description area 13) is determined (step 102), and a character string area for each line is extracted from the area as a character string area image (step 1).
03). Next, the character cutout unit 24 uses the knowledge stored in the preprocessing knowledge storage unit 22 to divide the image of the character string region image into blocks that are considered to be optimal as one character (step 104). . This step 10
The processing of No. 4 is usually called character extraction processing. Note that the processing of steps 102 to 104 is collectively called preprocessing.

【００２９】前処理知識記憶部２２には、文字列領域検
出部２３において宛て名記載領域であるブロック状の領
域を検出するために必要な知識として、例えば、処理対
象に対して予め推定される典型的な記載フォーマット
（記載パターン）や、いくつかのフォーマットから最も
適当なフォーマットを選択するために必要な処理のパラ
メータ、あるいは処理手順そのものなどが記憶されてい
る。記載フォーマットないし記載パターンは、例えば、
ある種類の封書では宛て名がその封書の左上の頂点から
下に何ｃｍ、右に何ｃｍ移動した点を左上頂点として、
縦横どの程度のサイズの領域の中に記載されているか、
といった知識データである。予め記載フォーマットが推
定できない場合もあるので、前処理知識記憶部２２に
は、例えば文字サイズなどから適当な候補領域を推定し
検出するために使用されるパラメータや処理手順そのも
のなども記憶させておくことが好ましい。The pre-processing knowledge storage unit 22 preliminarily estimates, for example, an object to be processed as knowledge necessary for the character string area detection unit 23 to detect a block-shaped area which is a destination addressing area. A typical description format (description pattern), processing parameters necessary for selecting the most appropriate format from several formats, or the processing procedure itself are stored. The description format or description pattern is, for example,
For a certain type of envelope, the point at which the address moved from the top left vertex of the envelope by how many centimeters and to the right how many centimeters was defined as
How large and small in the area are described,
Such knowledge data. Since there is a case where the description format cannot be estimated in advance, the preprocessing knowledge storage unit 22 also stores parameters used for estimating and detecting an appropriate candidate area from the character size and the like, the processing procedure itself, and the like. Is preferred.

【００３０】さらに前処理知識記憶部２２には、文字切
り出し部２４で使用される知識として、文字切り出しの
際に仮定される文字サイズ、ピッチの推定方法などのパ
ラメータや、文字切り出し処理の処理手順そのものなど
が記憶されている。ここでは、知識処理による前処理に
使用するパラメータ類を単一の前処理知識記憶部２２に
一括して格納しているが、実際の運用においては、個々
の処理ごとに分散してパラメータ類を蓄積するようにし
てもよい。さらに前処理知識記憶部２２は、前処理で使
用可能な処理方法が複数ある場合に、認識対象画像に応
じて処理方法を選択するための知識を記憶していてもよ
い。The preprocessing knowledge storage unit 22 stores, as knowledge used in the character extracting unit 24, parameters such as a character size and a pitch estimating method assumed at the time of character extracting and a processing procedure of the character extracting process. It itself is stored. Here, the parameters used for the preprocessing by the knowledge processing are collectively stored in a single preprocessing knowledge storage unit 22, but in actual operation, the parameters are distributed and stored for each processing. You may make it accumulate. Further, when there are a plurality of processing methods that can be used in the preprocessing, the preprocessing knowledge storage unit 22 may store knowledge for selecting a processing method according to the recognition target image.

【００３１】ステップ１０４において文字切り出し部２
４により分割された個々の分割画像は、次に、個別文字
認識部２５によって文字認識を実施され（ステップ１０
５）、さらに知識処理部２６において適切な文字列とし
て構築され（ステップ１０６）、最終処理結果として出
力される（ステップ１０７）。In step 104, character cutout unit 2
Next, the individual divided images divided by 4 are subjected to character recognition by the individual character recognition unit 25 (step 10).
5) Further, it is constructed as an appropriate character string by the knowledge processing unit 26 (step 106) and output as a final processing result (step 107).

【００３２】以上のステップ１０７までの処理によっ
て、文字認識装置（ＯＣＲ）としての最終結果が得られ
るが、最終的に出力される結果は、ステップ１０２での
宛て名記載領域の検出、ステップ１０３での文字列領域
検出、ステップ１０４での文字切り出し、ステップ１０
５での個別文字認識、ステップ１０６での知識処理など
それぞれの段階での失敗により、リジェクトされる場合
や誤りを含む場合がある。そこでこの文字認識装置で
は、リジェクトや誤りを含むかどうかを判断するととも
に（ステップ１０８）、一般の文字認識装置と同様に、
校正処理部２７を用意して、ステップ１０７までの処理
で正しく認識できなかった文字あるいは文字列につい
て、校正処理として、それら文字や文字列の正しいもの
を手作業で入力し、正しい認識結果を作成する（ステッ
プ１０９）。ステップ１０８において、リジェクトも誤
りも含まないと判断した場合には、そのまま処理を終了
する。A final result as the character recognition device (OCR) is obtained by the processing up to the above step 107. The final output result is the detection of the destination addressing area in the step 102 and the step 103 Character area detection, character extraction in step 104, step 10
5 may be rejected or may contain errors due to failure at each stage, such as individual character recognition at 5 and knowledge processing at step 106. Therefore, this character recognition device determines whether or not a character includes a reject or an error (step 108), and, like a general character recognition device,
Providing the proofreading processor 27, for characters or character strings that could not be correctly recognized in the processing up to step 107, manually inputting the correct characters and character strings as proofreading processing and creating correct recognition results (Step 109). If it is determined in step 108 that neither a rejection nor an error is included, the process ends.

【００３３】校正処理部２７は、一般的には、ステップ
１０１で読み取った画像や最終結果出力までの各処理で
の途中結果を示す表示装置（例えば、ＣＲＴなど）と、
正しい文字や文字列を入力するためのキーボードなどの
入力装置によって構成される。そして、校正処理部２７
での校正処理の具体的手法としては、人間が読み取った
文字や文字列をその人間が直接入力する方法や、あるい
は、文字認識装置が示す複数の候補の中から正しいもの
を選択する方法が一般的である。The calibration processing section 27 generally includes a display device (for example, a CRT or the like) for displaying an image read in step 101 and an intermediate result in each processing until output of a final result.
It is composed of an input device such as a keyboard for inputting correct characters and character strings. Then, the calibration processing unit 27
As a specific method of the proofreading process, there are generally a method of directly inputting a character or a character string read by a human or a method of selecting a correct one from a plurality of candidates indicated by a character recognition device. It is a target.

【００３４】従来の文字認識装置での文字認識処理で
は、手作業による修正（校正処理）も含めて正しい認識
結果を得た段階で処理は終了するが、本実施形態の文字
認識装置では、文字認識装置単体では正しい結果を得る
ことができず、手作業により正しい結果が入力、修正さ
れた場合には、この校正処理部２７において手作業によ
り入力されたこの正解データをフィードバックし、再
度、個別文字認識部２５からの一連の認識処理を実施す
る（ステップ１１０）。In the character recognition processing of the conventional character recognition apparatus, the processing ends when a correct recognition result is obtained, including manual correction (correction processing). When a correct result cannot be obtained by the recognition device alone and the correct result is input and corrected manually, the correct answer data input manually in the calibration processing unit 27 is fed back, and the individual A series of recognition processes from the character recognition unit 25 are performed (Step 110).

【００３５】このステップ１１０での再処理は、前処理
での判定ミスやリジェクトを救うため、可能性のあるす
べての前処理候補について、その段階で持っている前処
理知識を用いて確率が高いと考えられる候補の順に、個
別文字認識部２５、知識処理部２６までの処理を、校正
処理部２７で与えられた正解を得たとステップ１１１で
判断されるまで、繰り返し実施する。前処理候補とは、
ステップ１０２〜１０４での一連の処理のそれぞれにお
いて、各処理で抽出（検出）されるべき領域（や分割画
像）について、優先度（尤度）を付して複数の候補領域
が挙げられるものとして、このように挙げられた候補領
域のことである。すなわち、宛て名記載領域の候補とし
て抽出された（複数の）領域、文字列領域の候補として
抽出された（複数の）領域、１文字分の領域（分割画
像）の候補として抽出された（複数の）領域のことであ
る。既に実行したステップ１０２〜１０４の各処理でそ
れぞれ１つずつしか領域が抽出されない場合には、ステ
ップ１０９の校正処理の終了後、それぞれ複数の候補領
域が見つかるように、改めて前処理（ステップ１０２〜
ステップ１０４）を実行するようにする。あるいは、ス
テップ１１０の処理を繰り返し実行する際に、その１回
の繰り返しのつど、使用する知識を変更しながら前処理
を実行するようにしてもよい。In the re-processing in step 110, in order to save a judgment error or rejection in the pre-processing, the probability of all possible pre-processing candidates is high using the pre-processing knowledge possessed at that stage. The processes up to the individual character recognizing unit 25 and the knowledge processing unit 26 are repeatedly performed in the order of the candidates considered until it is determined in step 111 that the correct answer given by the proofreading processing unit 27 has been obtained. Pre-processing candidates are
In each of the series of processes in steps 102 to 104, a plurality of candidate regions are given by assigning a priority (likelihood) to a region (or a divided image) to be extracted (detected) in each process. , Are the candidate areas listed above. That is, the (plural) regions extracted as candidates for the address description region, the (plural) regions extracted as character string region candidates, and the (multiple image) candidates extracted for one character region (divided image) Area). If only one area is extracted in each of the already executed steps 102 to 104, the pre-processing (steps 102 to 104) is performed again so that a plurality of candidate areas can be found after completion of the calibration processing in step 109.
Step 104) is executed. Alternatively, when the processing of step 110 is repeatedly performed, the preprocessing may be performed while changing the knowledge to be used for each single repetition.

【００３６】これらの作業の結果、正解に達したとステ
ップ１１１において判断できた場合には、その正解が得
られた前処理が処理対象に対する正しい前処理であると
いうことになるので、文字列領域検出部２３及び文字切
り出し部２４での各処理内容に応じ、学習機構部２８
は、前処理知識記憶部２２に対し、正しい前処理に対応
する新しい前処理知識を記憶させ、あるいは、前処理知
識記憶部２２中の知識をその正しい前処理に対応するよ
うに更新する。すなわち、前処理に使用する知識の学習
を実行する。As a result of these operations, if it can be determined in step 111 that the correct answer has been reached, it means that the pre-processing that has obtained the correct answer is the correct pre-processing for the processing target. The learning mechanism unit 28 according to each processing content in the detection unit 23 and the character cutout unit 24
Causes the preprocessing knowledge storage unit 22 to store new preprocessing knowledge corresponding to the correct preprocessing, or updates the knowledge in the preprocessing knowledge storage unit 22 so as to correspond to the correct preprocessing. That is, learning of knowledge used for preprocessing is performed.

【００３７】前処理知識記憶部２２に記憶させる知識の
構成や記憶方法は、使用している既存の前処理の内容に
依存し、例えば、新しいフォーマットを記憶させてもよ
いし、パラメータを変更してもよい。前処理の各段階で
の処理の選択基準を変更してもよい。The configuration and storage method of the knowledge stored in the preprocessing knowledge storage unit 22 depend on the contents of the existing preprocessing used. For example, a new format may be stored or parameters may be changed. You may. The selection criterion for the processing in each stage of the preprocessing may be changed.

【００３８】さらに、前処理知識記憶部２２は、自由書
式の記載においてしばしば発生するが通常は存在しない
ようなイレギュラーな書式について学習してしまうこと
を避けるため、各知識において適当なしきい値を保持す
るようにしてもよい。あるいは、発生頻度の低い処理対
象を知識として記憶することを避けるために、学習すべ
き知識が複数の処理対象に対して得られた場合に、初め
て前処理知識として反映する構造になっていてもよい。Further, the preprocessing knowledge storage unit 22 sets an appropriate threshold value for each knowledge in order to avoid learning about an irregular format which often occurs in description of a free format but does not normally exist. You may make it hold | maintain. Alternatively, in order to avoid storing a processing object with a low frequency of occurrence as knowledge, if the knowledge to be learned is obtained for a plurality of processing objects, the structure may be reflected as preprocessing knowledge for the first time. Good.

【００３９】これらの学習により、次回に同様な処理対
象が与えられた場合には、学習された新しい知識を持つ
前処理知識記憶部２２のデータにより正しい結果を得る
ことができ、さらに繰り返し、より多くのパターンを学
習することで、運用状況に最適化した前処理を実現する
ことが可能となる。また、処理対象に適当な前処理知識
が、長期的には変化していくような場合であっても、自
動的に常に最適な前処理知識を保持することが可能にな
る。When a similar processing target is given next time by the learning, a correct result can be obtained from the data in the preprocessing knowledge storage unit 22 having the learned new knowledge. By learning many patterns, it becomes possible to realize preprocessing optimized for the operation situation. Further, even when the preprocessing knowledge suitable for the processing target changes over a long period of time, it is possible to automatically automatically retain the optimal preprocessing knowledge.

【００４０】以下、実例を挙げて本実施形態を説明す
る。Hereinafter, the present embodiment will be described with reference to actual examples.

【００４１】図４は、封書１０の宛て名書き面に、住所
及び宛て先名称が記載されていると思われる領域が２つ
ある場合（「〒１２３−４５６７東京都足立区…」の方
の領域１５と「〒２３４−５６７８横浜市港南区…」の
方の領域１６）を示している。これらの領域１５，１６
の一方は宛て名記載領域であり、他方は発信者の住所や
名称の記載領域である。一般的には封書１０における記
載位置によっていずれが本当の宛て名記載領域であるか
が判別できるとされているが、場合により（特にダイレ
クトメールの場合）、文字列領域検出部２３での宛て名
記載領域抽出処理によってはいずれの領域が本当の宛て
名記載領域であるかを判別しがたいことがある。そこで
本実施形態の文字認識方法を適用することにより、領域
１５，１６のいずれが本当の宛て名記載領域であるかを
前処理知識として学習することにより、以後は、同じよ
うな封書が出現した場合に、間違いなく宛て名記載領域
を検出することが可能になる。同じような封書が多数連
続して出現するダイレクトメールの場合、最初の１通で
学習することにより、２通目以降については１通目での
学習に基づき、リジェクトや誤りとなることなく迅速に
処理することが可能になる。FIG. 4 shows a case where the address and the destination name of the letter written on the sealed letter 10 have two areas in which the address and the destination name are considered to be described ("@ 123-4567 Adachi-ku, Tokyo ..." An area 15 and an area 16) of "@ Konan-ku, Yokohama-234-5678 ..." are shown. These areas 15, 16
One is an address writing area, and the other is an address or name writing area of the sender. In general, it can be determined which is the real addressing area by the description position in the sealed letter 10. However, depending on the case (especially in the case of direct mail), the address in the character string area detecting unit 23 may be determined. Depending on the description area extraction processing, it may be difficult to determine which area is the real destination name description area. Therefore, by applying the character recognition method of the present embodiment to learn which of the areas 15 and 16 is the real address description area as preprocessing knowledge, a similar sealed letter appears thereafter. In this case, the address description area can be detected without fail. In the case of direct mail in which a number of similar sealed letters appear in succession, learning is performed in the first one, and subsequent mails are quickly and without rejection or error based on learning in the first one. Processing.

【００４２】なお、宛て名記載領域検出のための知識
（パラメータ）としては、封書における該当領域の位置
（封書の１頂点を基準点としてそこからの２次元位置）
や大きさなどを用いることができる。The knowledge (parameter) for detecting the address description area includes the position of the corresponding area in the envelope (two-dimensional position from one vertex of the envelope as a reference point).
And size can be used.

【００４３】図５は、別の例を示している。ここでは手
書き文字による住所の記載から個々の文字を切り出す場
合を説明する。(a),(b)は、それぞれ、「宇都宮市」
（宇都宮市は栃木県内の都市名）と「八王子市」の手書
き文字例を示している。本発明者らの知見によると、
「宇都宮市」を構成する４つの文字（漢字）はほぼ同じ
大きさで記載される傾向があるのに対し、「八王子市」
については、「王」の字が他の文字に比べて小さく記載
される傾向がある。ここで文字切り出し部２４により文
字切り出し処理を行う場合に、全ての文字がほぼ同じ大
きさで記載されていることを前提とすると、「宇都宮
市」については正しく切り出しを行えるのに対し、「八
王子市」については切り出し処理で誤りを生じ、例え
ば、「八」と「王」が一体となって「全子市」と切り出
されるような結果となる可能性がある。本実施形態の文
字認識装置によれば、「八王子市」が出現してリジェク
トあるいは誤りとなった場合に、校正処理で「八王子
市」を入力し、前処理知識記憶部２２の知識の学習を行
うことにより、以後、「八王子市」が出てきた場合に
は、正しく認識できるようになる。郵便物の区分けにこ
の文字認識装置を利用する場合であれば、予め地域別の
前処理知識を用意しておかなくても、東京都の多摩地区
のように八王子市内あて郵便物が多く存在するような地
域において、郵便物の区分けの効率化を図ることができ
る。FIG. 5 shows another example. Here, a case in which individual characters are cut out from an address written in handwritten characters will be described. (a) and (b) are each "Utsunomiya City"
(Utsunomiya City is the name of a city in Tochigi Prefecture) and "Hachioji City". According to the findings of the present inventors,
The four characters (kanji) that make up "Utsunomiya City" tend to be written in approximately the same size, while "Hachioji City"
With regard to, there is a tendency that the character of "king" is described smaller than other characters. Here, when performing the character cutout processing by the character cutout unit 24, assuming that all the characters are described in approximately the same size, "Utsunomiya City" can be cut out correctly, while "Hachioji" For the "city", an error occurs in the extraction processing, and for example, there is a possibility that "eight" and "king" are united and extracted as "all child city". According to the character recognition device of the present embodiment, if “Hachioji” appears and is rejected or an error occurs, “Hachioji” is input in the calibration processing, and the learning of the knowledge of the preprocessing knowledge storage unit 22 is performed. By doing so, when "Hachioji City" comes out thereafter, it will be possible to recognize correctly. If this character recognition device is used for sorting mail, there are many mails addressed to Hachioji city, such as the Tama area in Tokyo, even if the preprocessing knowledge for each region is not prepared in advance. In such an area, the efficiency of sorting mail items can be improved.

【００４４】図６は、図１に示した文字認識装置の変形
例を示している。本発明の文字認識装置は、スキャナな
どの画像入力部ないし画像入力装置と一体的に構成され
ている必要はなく、予め別の場所で読込んだ画像データ
に基づいて、文字認識を行うことができる。図６はその
ような文字認識装置を示しており、図１に示す文字認識
装置における画像入力部の代わりに、認識対象画像を記
憶するイメージ記憶部３１を設けた構成となっている。
イメージ記憶部３１には、例えば、ネットワークや取り
外し可能記憶媒体（光磁気ディスク）などを介して、認
識対象画像が蓄積され、蓄積された認識対象画像は、イ
メージ記憶部３１から文字列領域検出部２３に出力され
る。FIG. 6 shows a modification of the character recognition device shown in FIG. The character recognition device of the present invention does not need to be integrally configured with an image input unit or an image input device such as a scanner, and can perform character recognition based on image data read in another place in advance. it can. FIG. 6 shows such a character recognition device, which has a configuration in which an image storage unit 31 for storing a recognition target image is provided instead of the image input unit in the character recognition device shown in FIG.
The image storage unit 31 stores recognition target images via, for example, a network or a removable storage medium (magneto-optical disk), and the stored recognition target images are sent from the image storage unit 31 to the character string area detection unit. 23.

【００４５】図７は、図１に示した文字認識装置のさら
に別の変形例を示している。近年、特に、ダイレクトメ
ールなどで、宛て名記載面に広告用文言が多数記載され
た郵便物が増加しつつある。このような郵便物では、宛
て名記載領域の抽出処理時に、文字や文字列の集合とみ
なされるブロック状の領域が多数抽出され、宛て名記載
領域の位置やサイズをパラメータとする知識処理だけで
は、いずれが本当の宛て名記載領域なのかを識別するこ
とが極めて困難な場合が多い。図８は、文字認識の処理
対象としてのこのような郵便物の一例を示す図であり、
図示破線で示す矩形の領域は、それぞれ、文字や文字列
の集合とみなされるブロック状の領域を表わしている。
この矩形の領域のうちの１つのみが、本当の宛て名記載
領域である。FIG. 7 shows another modification of the character recognition apparatus shown in FIG. In recent years, in particular, the number of mails, in particular, direct mail or the like, in which a large number of advertising words are written on the addressing face, has been increasing. In such mail, a large number of block-like areas that are regarded as a set of characters and character strings are extracted during the extraction processing of the address description area, and only knowledge processing using the position and size of the address description area as parameters is performed. In many cases, it is extremely difficult to identify which is the real addressing area. FIG. 8 is a diagram showing an example of such a mail as a character recognition processing target.
The rectangular areas shown by the broken lines in the drawing respectively represent block-like areas regarded as a set of characters and character strings.
Only one of the rectangular areas is a real destination description area.

【００４６】従来は、図８に示すような郵便物は、１通
ずつ人手で仕分けするしかなく、作業効率の大幅な低下
をもたらしていた。そこで、図７に示す文字認識装置で
は、図１に示す文字認識装置での前処理知識記憶部２２
のうち記載パターンを格納する部分を独立させてパター
ン記憶部３２とし、パターン記憶部３２ではパターンの
更新や追加登録を行えるようにしている。すなわち、文
字や文字列の集合とみなされる複数の領域の配置をパタ
ーンとして、パターン記憶部部３２は、パターンとその
パターンにおいて宛て名記載領域（認識対象領域）がど
れなのかを示す情報とを記憶する。Conventionally, mails as shown in FIG. 8 have to be manually sorted one by one, resulting in a significant decrease in work efficiency. Therefore, in the character recognition device shown in FIG. 7, the preprocessing knowledge storage unit 22 in the character recognition device shown in FIG.
Of these, the portion storing the written pattern is made independent, and is used as the pattern storage unit 32, so that the pattern storage unit 32 can update and additionally register the pattern. That is, with the arrangement of a plurality of regions regarded as a set of characters and character strings as a pattern, the pattern storage unit 32 stores the pattern and information indicating the destination addressing region (recognition target region) in the pattern. Remember.

【００４７】この文字認識装置においては、文字列領域
検出部２３により宛て名記載領域の抽出を行う場合に
は、まず、認識対象画像から、文字あるいは文字列集合
とみなされる領域を抽出し、そのような領域がある決め
られた数以上検出された場合には、そのような領域の配
置とパターン記憶部３２に格納されているパターンとの
パターンマッチングを行い、認識対象画像がどのパター
ンに属するかを判別し、パターン記憶部３２内からその
パターンでの宛て名記載領域に関する情報を読み出し、
認識対象画像中のブロック状の領域からその情報に基づ
いて宛て名記載領域を抽出する。これにより、宛て名記
載領域と紛らわしいようなブロック状の領域が多数ある
場合であっても、的確に宛て名記載領域を抽出すること
が可能になる。In this character recognition apparatus, when extracting the address description area by the character string area detection unit 23, first, an area regarded as a character or a character string set is extracted from the recognition target image. When such a region is detected in a certain number or more, pattern matching between the arrangement of such a region and the pattern stored in the pattern storage unit 32 is performed to determine which pattern the recognition target image belongs to. Is read out from the pattern storage unit 32, and information on the address description area in the pattern is read out,
An addressing area is extracted from the block-shaped area in the recognition target image based on the information. As a result, even when there are many block-shaped areas that may be confused with the address description area, the address description area can be accurately extracted.

【００４８】さらに、図７に示す文字認識装置において
パターン記憶部３２は、前処理知識記憶部２２と同様
に、学習機構部２８によって新規パターンの追加や更新
を受ける。すなわち、誤って宛て名記載領域を抽出した
がために最終認識結果でリジェクトあるいは誤りとなっ
た場合、校正処理部２７によって正しい宛て名記載領域
を指定すると、それに基づいて学習機構部２８がパター
ン記憶部３２へのパターンの追加や更新を実行する。こ
のようにパターンの学習が行われることにより、宛て名
記載領域の抽出が難しいダイレクトメールであっても、
２通目以降は正確に宛て名記載領域の抽出を行うことが
できる。Further, in the character recognition apparatus shown in FIG. 7, the pattern storage section 32 receives addition or update of a new pattern by the learning mechanism section 28, similarly to the preprocessing knowledge storage section 22. In other words, when the final recognition result is rejected or erroneous due to the erroneous extraction of the address description area, if the correct address description area is specified by the calibration processing unit 27, the learning mechanism unit 28 stores the pattern based on it. A pattern is added to or updated in the unit 32. By learning the pattern in this way, even for direct mail where it is difficult to extract the address description area,
From the second copy onward, the address description area can be accurately extracted.

【００４９】次に、図９を用いて本発明の別の実施の形
態の文字認識装置について説明する。この文字認識装置
は、図１に示す文字認識装置とほぼ同様の構成のもので
あるが、校正処理部２７への入力結果に応じて、文字列
領域検出部２３及び文字切り出し部２４を介することな
く、直接、学習機構部２９によって前処理知識記憶部２
２の学習が行われるように構成されている。Next, a character recognition apparatus according to another embodiment of the present invention will be described with reference to FIG. This character recognition device has substantially the same configuration as that of the character recognition device shown in FIG. 1, but can be operated via a character string region detection unit 23 and a character cutout unit 24 in accordance with the result of input to the calibration processing unit 27. Instead, the preprocessing knowledge storage unit 2 is directly controlled by the learning mechanism unit 29.
2 is performed.

【００５０】上述したように文字認識装置では、通常、
最終的に知識処理部２６で出力されたデータが、文字認
識装置としての最終処理結果になるが、この段階までに
すでに複数の文字列領域候補、文字切り出しの組み合わ
せ候補が得られている場合が多く、自由書式での文字認
識装置では、一般に、それらの各処理結果を随時トップ
ダウン処理に用いて正しい結果を得る方法を用いてい
る。つまり、たとえ最終的に出力結果がリジェクトある
いは誤りであったとしても、校正処理部２７へ入力する
データとして、候補となりうる複数の処理結果情報が含
まれていることが多い。図１に示した文字認識装置で
は、校正処理部２７で入力された文字列に基づいて、入
力画像から全体を再処理する中で正しい前処理知識を得
る方法を採用しているが、知識処理部２６では最終結果
として出力できなかった上記のような複数の処理結果候
補と、それらに相当するフォーマット、処理パラメータ
などの前処理情報とを組み合わせて、校正処理部２７に
与えることもできる。そして、校正処理部２７で正解入
力された結果がそれらの候補の中に含まれる場合には、
全体を再処理しなくとも、その候補に合わせて与えられ
た前処理情報から、学習すべき前処理知識を知ることが
可能である。そこで図９に示す文字認識装置では、校正
処理部２７で正解入力された結果が処理結果候補の中に
含まれる場合に、その処理結果候補と前処理情報とを学
習機構部２９を介して前処理知識記憶部２２にフィード
バックすることにより、前処理知識記憶部２２の学習を
行っている。As described above, in the character recognition device, usually,
The data finally output by the knowledge processing unit 26 is the final processing result of the character recognition device. In some cases, a plurality of character string region candidates and character cutout combination candidates have already been obtained by this stage. In many cases, a free-form character recognition apparatus generally uses a method of obtaining a correct result by using each processing result as needed in a top-down process. That is, even if the output result is finally rejected or incorrect, the data to be input to the calibration processing unit 27 often includes a plurality of pieces of processing result information that can be candidates. The character recognition apparatus shown in FIG. 1 employs a method of obtaining correct preprocessing knowledge while reprocessing the entirety from an input image based on a character string input by the proofreading processing unit 27. The unit 26 can also provide the calibration processing unit 27 with a combination of a plurality of processing result candidates that could not be output as a final result and preprocessing information such as a format and a processing parameter corresponding thereto. Then, when the result input correctly by the calibration processing unit 27 is included in the candidates,
It is possible to know preprocessing knowledge to be learned from preprocessing information given according to the candidate without reprocessing the whole. Therefore, in the character recognition device shown in FIG. 9, when the result of the correct answer input in the proofreading processing unit 27 is included in the processing result candidate, the processing result candidate and the pre-processing information are forward By feeding back to the processing knowledge storage unit 22, learning of the preprocessing knowledge storage unit 22 is performed.

【００５１】図１０は、図９に示す文字認識装置での文
字認識処理を説明するフローチャートである。FIG. 10 is a flowchart for explaining the character recognition processing in the character recognition device shown in FIG.

【００５２】図３に示す処理手順と同様に、画像読み込
み（ステップ１２１）の後、前処理として宛て名記載領
域の抽出（ステップ１２２）、文字列領域の抽出（ステ
ップ１２３）及び文字切り出し（ステップ１２４）を実
行し、その後、個別文字の文字認識を行い（ステップ１
２５）、知識処理による認識文字列の決定を行って（ス
テップ１２６）、最終結果出力を行い（ステップ１２
７）、リジェクトあるいは誤りを含むか否かを判定する
（ステップ１２８）。ここでリジェクトも誤りも含まれ
ない場合には、処理を終了する。As in the processing procedure shown in FIG. 3, after reading the image (step 121), extraction of the address description area (step 122), extraction of the character string area (step 123), and character extraction (step 121) are performed as preprocessing. 124), and then character recognition of individual characters is performed (step 1).
25), the recognition character string is determined by the knowledge processing (step 126), and the final result is output (step 12).
7), it is determined whether or not a reject or error is included (step 128). Here, if neither rejection nor error is included, the processing is terminated.

【００５３】一方、ステップ１２８において、リジェク
トまたは誤りが含まれると判定された場合には、校正処
理部２７において、処理結果候補の中から選択すること
により校正処理を実行し（ステップ１２９）、学習機構
部２９が、選択された候補に応じて、前処理知識記憶部
２２中の知識の更新を実行し（ステップ１３０）、処理
を終了する。On the other hand, if it is determined in step 128 that a rejection or an error is included, the calibration processing unit 27 executes calibration processing by selecting from among processing result candidates (step 129), and performs learning. The mechanism unit 29 updates the knowledge in the preprocessing knowledge storage unit 22 according to the selected candidate (step 130), and ends the process.

【００５４】なお、図９に示す文字認識装置は、全体を
再処理する必要がないため、学習のために必要な時間が
短く、運用状態での学習に適している。しかしながら、
候補として作成される出力情報が大きくなる傾向がある
とともに、もともと与えられた知識で正解の可能性が極
めて低いとして採用されなかった場合には、いくら処理
させても学習されないことがある。Since the character recognition apparatus shown in FIG. 9 does not need to be reprocessed as a whole, the time required for learning is short, and it is suitable for learning in an operation state. However,
If the output information created as a candidate tends to be large and the possibility of a correct answer is not very low because of the originally given knowledge, it may not be learned no matter how much processing is performed.

【００５５】以上、本発明の好ましい実施の形態の文字
認識装置について説明したが、上述の各文字認識装置
は、それを実現するための計算機プログラムを、ワーク
ステーションやパーソナルコンピュータなどの計算機に
読み込ませ、そのプログラムを実行させることによって
も実現できる。文字認識を行うためのプログラムは、磁
気テープやＣＤ−ＲＯＭなどの記録媒体によって、計算
機に読み込まれる。図１２は、上述の文字認識処理を実
行する計算機の構成を示すブロック図である。While the character recognition apparatus according to the preferred embodiment of the present invention has been described above, each of the above-described character recognition apparatuses causes a computer such as a workstation or a personal computer to read a computer program for implementing the apparatus. It can also be realized by executing the program. A program for performing character recognition is read into a computer by a recording medium such as a magnetic tape or a CD-ROM. FIG. 12 is a block diagram illustrating a configuration of a computer that executes the above-described character recognition processing.

【００５６】この計算機は、スキャナなどの画像入力装
置５０と、中央処理装置（ＣＰＵ）５１と、プログラム
やデータを格納するためのハードディスク装置５２と、
主メモリ５３と、キーボードやマウスなどの入力装置５
４と、ＣＲＴなどの表示装置５５と、磁気テープやＣＤ
−ＲＯＭ等の記録媒体５７を読み取る読み取り装置５６
とから構成されている。画像処理装置５０、ハードディ
スク装置５２、主メモリ５３、入力装置５４、表示装置
５５及び読み取り装置５６は、いずれも中央処理装置５
１に接続している。この計算機では、文字認識処理を行
うためのプログラムを格納した記録媒体５７を読み取り
装置５６に装着し、記録媒体５７からプログラムを読み
出してハードディスク装置５２に格納し、ハードディス
ク装置５２に格納されたプログラムを中央処理装置５１
が実行することにより、画像入力装置５０を介して取り
込んだ認識対象画像あるいは予めハードディスク装置５
２などに蓄積されている認識対象画像に対して、上述し
た各処理手順に基づく文字認識処理が実行される。文字
認識結果は、不図示の他の装置（例えば仕分け装置）な
どで利用するために、中央処理装置５１から出力され
る。The computer includes an image input device 50 such as a scanner, a central processing unit (CPU) 51, a hard disk device 52 for storing programs and data,
Main memory 53 and input device 5 such as keyboard and mouse
4, a display device 55 such as a CRT, a magnetic tape or a CD
A reading device 56 for reading a recording medium 57 such as a ROM
It is composed of The image processing device 50, the hard disk device 52, the main memory 53, the input device 54, the display device 55, and the reading device 56
Connected to 1. In this computer, a recording medium 57 storing a program for performing character recognition processing is mounted on a reading device 56, the program is read from the recording medium 57 and stored in the hard disk device 52, and the program stored in the hard disk device 52 is read. Central processing unit 51
Executes the recognition target image captured via the image input device 50 or the hard disk device 5
The character recognition processing based on the above-described processing procedures is performed on the recognition target images stored in the storage device 2 or the like. The character recognition result is output from the central processing unit 51 for use in another device (not shown) (for example, a sorting device) or the like.

【００５７】本発明は、上述した実施形態に限定される
ものではない。本発明の文字認識方法及び装置は、郵便
物の仕分けのための文字認識以外の文字認識、例えば、
各種の伝票類を処理するための文字認識や、各種の報告
書や社内文書を読込んでキーワードとなる文字列を自動
的に認識し、分類するための文字認識などにも適用でき
る。また、文字認識対象の言語（文字種）も日本語（数
字、かな、漢字等）に限定されるものではなく、例え
ば、英語などローマンアルファベットで宛て名が記載さ
れた郵便物の仕分けにも、本発明を適用することができ
る。The present invention is not limited to the above embodiment. Character recognition method and apparatus of the present invention, character recognition other than character recognition for sorting mail, for example,
The present invention can also be applied to character recognition for processing various slips and character recognition for automatically recognizing and classifying character strings serving as keywords by reading various reports and in-house documents. In addition, the language (character type) for character recognition is not limited to Japanese (numbers, kana, kanji, etc.). For example, it is also possible to sort postal items whose names are written in the Roman alphabet such as English. The invention can be applied.

【００５８】[0058]

【発明の効果】以上説明したように本発明は、自由書式
の認識対象を処理し、対象の画像に含まれる多くの文字
列、文様などの中から、求める文字列候補領域を検出
し、文字及び文字列認識を行う文字認識方法及び文字認
識装置において、手作業による校正処理で入力されるデ
ータを用いて自動的に前処理で用いる知識の学習を行う
ことにより、従来困難かつ多くの工数を必要としてい
た、運用開始後の性能改善方法、特に前処理における性
能改善を実現することができるという効果がある。As described above, the present invention processes a free-form recognition target, detects a desired character string candidate area from many character strings, patterns, and the like included in the target image, and performs character recognition. In the character recognition method and the character recognition device for performing character string recognition, learning of knowledge used in preprocessing is automatically performed by using data input in manual calibration processing, so that conventionally difficult and many man-hours are reduced. There is an effect that a required performance improvement method after the start of operation, particularly performance improvement in preprocessing, can be realized.

【００５９】特に、実際の認識対象に基づいた学習を行
うため、処理する対象に最適化した性能改善を可能とす
る。また、長期的に処理対象に適当な前処理知識が変化
していくような場合でも、自動的に常に最適な前処理知
識を保持することが可能になる。さらに、既存の処理を
運用状況に合わせて最適化することで性能改善が可能で
あるため、ハードウェアの増強や、処理プログラムの改
造をすることなく性能向上を実現することができる。In particular, since learning is performed based on the actual recognition target, it is possible to improve the performance optimized for the processing target. Further, even when the preprocessing knowledge appropriate for the processing target changes over a long period of time, it is possible to automatically and always retain the optimum preprocessing knowledge. Furthermore, since the performance can be improved by optimizing the existing processing according to the operation situation, the performance can be improved without increasing the hardware or modifying the processing program.

[Brief description of the drawings]

【図１】本発明の第１の実施形態の文字認識装置の構成
を示すブロック図である。FIG. 1 is a block diagram illustrating a configuration of a character recognition device according to a first embodiment of the present invention.

【図２】認識対象の画像の一例を示す図である。FIG. 2 is a diagram illustrating an example of an image to be recognized.

【図３】図１に示す文字認識装置を用いた文字認識処理
の手順を示すフローチャートである。FIG. 3 is a flowchart illustrating a procedure of a character recognition process using the character recognition device illustrated in FIG. 1;

【図４】文字認識例を説明する図である。FIG. 4 is a diagram illustrating an example of character recognition.

【図５】文字認識例を説明する図である。FIG. 5 is a diagram illustrating an example of character recognition.

【図６】図１に示す文字認識装置の変形例を示すブロッ
ク図である。FIG. 6 is a block diagram showing a modification of the character recognition device shown in FIG.

【図７】図１に示す文字認識装置の別の変形例を示すブ
ロック図である。FIG. 7 is a block diagram showing another modification of the character recognition device shown in FIG.

【図８】ブロック状の領域が多数ある読み取り対象を説
明する図である。FIG. 8 is a diagram illustrating a reading target having a large number of block-shaped regions.

【図９】本発明の第２の実施形態の文字認識装置の構成
を示すブロック図である。FIG. 9 is a block diagram illustrating a configuration of a character recognition device according to a second embodiment of the present invention.

【図１０】図９に示す文字認識装置を用いた文字認識処
理の手順を示すフローチャートである。10 is a flowchart showing a procedure of a character recognition process using the character recognition device shown in FIG.

【図１１】本発明の文字認識装置を構成するに際して好
適に用いられるコンピュータシステムを示すブロック図
である。FIG. 11 is a block diagram showing a computer system suitably used in configuring the character recognition device of the present invention.

【図１２】従来の文字認識装置の構成を示すブロック図
である。FIG. 12 is a block diagram showing a configuration of a conventional character recognition device.

[Explanation of symbols]

１０封書１１宛て名１２発信元１３宛て名記載領域２１画像入力部２２前処理知識記憶部２３文字列領域検出部２４文字切り出し部２５個別文字認識部２６知識処理部２７校正処理部２８，２９学習機構部３１画像ファイル記憶部３２パターン記憶部 DESCRIPTION OF SYMBOLS 10 Envelope 11 Address 12 Source 13 Address description area 21 Image input unit 22 Preprocessing knowledge storage unit 23 Character string area detection unit 24 Character cutout unit 25 Individual character recognition unit 26 Knowledge processing unit 27 Calibration processing unit 28, 29 Learning Mechanism unit 31 Image file storage unit 32 Pattern storage unit

Claims

[Claims]

1. A character recognition method for performing character recognition on a recognition target image in which characters are described in free format, wherein a block considered to be optimal as a character is cut out from the recognition target image as a divided image by knowledge processing. A pre-processing step, an individual character recognition step of performing individual character recognition on each of the divided images, and an appropriate character string by applying knowledge given in advance to the character string to a result of the individual character recognition. And a character string construction step of outputting a final character string as a final processing result, and a calibration processing step of inputting a correct character string when a rejection or an error is output as the final processing result. Then, using the correct character string as a key, knowledge for obtaining a correct preprocessing result is acquired, and the acquired Character recognition method characterized by having a learning step of updating the.

2. The pre-processing step includes: a step of extracting a recognition target area, which is an area to be subjected to character recognition, from the recognition target image by a knowledge processing; and a step of extracting each line from the recognition target area by the knowledge processing. Extracting a character string region corresponding to each of the character strings of the following, and obtaining the divided image by image-dividing the image of the character string region for each block considered to be optimal as a character by knowledge processing. The character recognition method according to claim 1, further comprising:

3. A plurality of preprocessing candidates are generated in the preprocessing step, and in the learning step, each of the preprocessing steps is performed until a character string that matches a correct character string input in the calibration processing step is constructed. Performing the individual character recognition step and the character string construction step based on the candidate, and then performing the preprocessing based on knowledge corresponding to a preprocessing candidate when a character string that matches the correct character string is constructed. 3. The character recognition method according to claim 1, wherein knowledge used in the process is updated.

4. A case in which a plurality of pieces of processing result information that can be candidates for a recognition result are output along with the final processing result, and one of the processing result information is selected as the correct character string in the calibration processing step. 3. The character recognition method according to claim 1, wherein in the learning step, knowledge used in the preprocessing step is updated based on preprocessing information corresponding to the selected processing result information.

5. The arrangement of a plurality of regions as candidates for the recognition target region is stored in advance as a pattern, and the pattern of the recognition target region is determined by pattern matching between the stored pattern and a region arrangement extracted from the recognition target image. The character recognition method according to claim 2, wherein extraction is performed.

6. A character recognition apparatus for performing character recognition on a recognition target image in which characters are described in a free format, comprising: preprocessing knowledge storage means for storing knowledge required for preprocessing of character recognition; From the image, using the knowledge previously given to the preprocessing knowledge storage means, a character string area detection means for detecting a desired character string area, using the knowledge stored in the preprocessing knowledge storage means,
A character cutout unit that divides the detected character string region image into blocks that are considered to be optimal as characters to obtain divided images, an individual character recognition unit that recognizes each divided image as a character, and an individual character recognition result. On the other hand, knowledge processing means for applying knowledge about a given character string in advance, constructing it as an appropriate character string, and outputting it as a final processing result, and calibrating when a rejection or error is output as the final processing result Calibration processing means, and when the calibration processing means performs the calibration processing, obtains knowledge for obtaining a correct preprocessing result using the correct data as a key, and stores the preprocessing knowledge storage based on the obtained knowledge. A character recognition apparatus characterized in that knowledge is accumulated in the means and the knowledge in the preprocessing knowledge storage means is updated.

7. The character recognition apparatus according to claim 6, further comprising image input means for taking a recognition target as binary or multi-valued image data and using the recognition target image as the recognition target image.

8. The character string region detecting means extracts a recognition target region, which is a region to be subjected to character recognition, from the recognition target image by knowledge processing, and converts the recognition target region into a character string of each line from the recognition target image. The character recognition device according to claim 6, wherein a corresponding character string region is extracted.

9. A method according to claim 9, further comprising a pattern storage unit for storing in advance a pattern of a plurality of regions to be candidates for the recognition target region as a pattern, wherein the character string region detection unit stores the pattern stored in the pattern storage unit and When the recognition target area is extracted by pattern matching with the area arrangement extracted from the recognition target image, and the calibration processing is performed by the calibration processing means, the correct data is used as a key to obtain a correct preprocessing result. 9. The character recognition apparatus according to claim 8, wherein a pattern in the pattern storage unit is updated based on the obtained pattern.

10. A recording medium readable by a computer, wherein a block considered to be optimal as a character is cut out as a divided image by a knowledge process from a recognition target image in which characters are described in a free format. A processing step, an individual character recognition step of performing individual character recognition on each of the divided images, and applying a given character string to a result of the individual character recognition to obtain an appropriate character string. A character string constructing step of constructing and outputting as a final processing result; a rewriting processing step of inputting a correct character string when a reject or error is output as the final processing result; and , Using the correct character string as a key, acquiring knowledge for obtaining a correct preprocessing result, and using the acquired knowledge in the preprocessing step. A storage medium storing a program for executing a learning step of updating knowledge to be used.

11. The pre-processing step includes: a step of extracting a recognition target area, which is an area to be subjected to character recognition, from the recognition target image by a knowledge processing; and a step of extracting each line from the recognition target area by the knowledge processing. Extracting a character string region corresponding to each of the character strings of the following, and obtaining the divided image by image-dividing the image of the character string region for each block considered to be optimal as a character by knowledge processing. The recording medium according to claim 10, comprising:

12. A plurality of preprocessing candidates are generated in the preprocessing step, and in the learning step, each of the preprocessing steps is performed until a character string that matches a correct character string input in the calibration processing step is constructed. Performing the individual character recognition step and the character string construction step based on the candidate, and then performing the preprocessing based on knowledge corresponding to a preprocessing candidate when a character string that matches the correct character string is constructed. The recording medium according to claim 10, wherein knowledge used in a process is updated.

13. A case in which a plurality of pieces of processing result information that can be candidates for a recognition result are output along with the final processing result, and one of the processing result information is selected as the correct character string in the calibration processing step. 12. The recording medium according to claim 10, wherein in the learning step, knowledge used in the preprocessing step is updated based on preprocessing information corresponding to the selected processing result information.