JP3466899B2

JP3466899B2 - Character recognition device and method, and program storage medium

Info

Publication number: JP3466899B2
Application number: JP00220698A
Authority: JP
Inventors: 眞紀矢吹
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 1998-01-08
Filing date: 1998-01-08
Publication date: 2003-11-17
Anticipated expiration: 2018-01-08
Also published as: JPH11203405A

Description

Detailed Description of the Invention

【０００１】[0001]

【発明の属する技術分野】本発明は、１文字入力欄を持
つ文書に記入される文字を認識する文字認識装置及び方
法と、その文字認識装置の実現に用いられるプログラム
が記憶されるプログラム記憶媒体とに関し、特に、文字
認識精度の向上を実現する文字認識装置及び方法と、そ
の文字認識装置の実現に用いられるプログラムが記憶さ
れるプログラム記憶媒体とに関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a character recognition device and method for recognizing a character entered in a document having a one-character input field, and a program storage medium for storing a program used to implement the character recognition device. In particular, the present invention relates to a character recognition device and method for improving the accuracy of character recognition, and a program storage medium in which a program used for realizing the character recognition device is stored.

【０００２】１文字入力欄を持つ文書に金額などの文字
を記入させることが多い。これから、１文字入力欄に記
入される文字を認識する文字認識装置が必要となる。こ
のような文字認識装置は、文字認識精度の向上を図るこ
とで実用的なものにしていく必要がある。[0002] In many cases, a document having a one-character input field is made to be filled with characters such as amount of money. From this, a character recognition device for recognizing a character entered in the one-character input field is required. It is necessary to make such a character recognition device practical by improving the character recognition accuracy.

【０００３】[0003]

【従来の技術】１文字入力欄を持つ文書に記入される手
書き文字は、図１９に示すように、入力欄の指定する枠
を超えて記入されたり、文字と文字とが繋がる形態で記
入されることが多く、認識が非常に難しいという問題点
を抱えている。2. Description of the Related Art As shown in FIG. 19, handwritten characters entered in a document having a one-character input field are entered beyond the frame specified by the input field or in a form in which characters are connected to each other. However, it has a problem that recognition is very difficult.

【０００４】更に説明するならば、この問題点は、ファ
ックスやコピーなどで活字文字の画質が劣化する場合に
も起こるし、精度の悪いプリンタで印刷を行う場合にも
起こる。To explain further, this problem occurs when the image quality of printed characters is deteriorated by faxing or copying, and also when printing is performed by a printer with poor accuracy.

【０００５】このようなことを背景にして、本出願人
は、一連の特許出願で開示してきたように、１文字入力
欄を構成する枠と、１文字入力欄に記入された文字とを
分離するという技術を開発して、そのように分離された
文字を文字同士が分離していることを前提とする切り出
しアルゴリズムを使って切り出して、その文字切り出し
技術により切り出された文字を認識するという技術を開
発してきた。Against this background, the present applicant separates the frame forming the one-character input box from the characters entered in the one-character input box, as disclosed in a series of patent applications. A technology that develops a technology to cut out characters, cuts out such separated characters using a cutting out algorithm that assumes that the characters are separated, and recognizes the characters cut out by the character cutting out technology. Has been developed.

【０００６】そして、そのように分離された文字を処理
対象として文字と文字との間の続き文字部分を検出し、
それを削除することで文字を切り出すという技術を開発
して、その文字切り出し技術により切り出された文字を
認識するという技術を開発してきた。Then, the character separated in such a manner is processed, and a continuous character portion between the characters is detected,
We have developed a technology that cuts out characters by deleting them, and has developed a technology that recognizes the characters cut out by the character cutting technology.

【０００７】また、本出願人以外も、１文字入力欄に記
入される文字を認識する様々な技術が提案されている。[0007] In addition to the applicant, various techniques for recognizing a character entered in a one-character input box have been proposed.

【０００８】[0008]

【発明が解決しようとする課題】確かに、本出願人が開
発した文字認識技術は、１文字入力欄に記入された文字
を高精度に読み取れるという特徴がある。Certainly, the character recognition technology developed by the present applicant is characterized in that a character entered in a one-character input box can be read with high accuracy.

【０００９】しかしながら、手書き文字というものは実
に多様なものであり、また、活字文字の画質の劣化など
も実に多様なものである。これから、特定のアルゴリズ
ムに従っていたのでは、自ずと認識精度が限界が出るこ
とを避けられない。However, handwritten characters are very diverse, and the deterioration of the image quality of printed characters is also diverse. From now on, if the specific algorithm is followed, it is unavoidable that the recognition accuracy is limited.

【００１０】これから、従来技術に従っていると、１文
字入力欄に記入された文字の認識精度が必ずしも十分で
ないことが起こるという問題点を抱えていた。本発明は
かかる事情に鑑みてなされたものであって、１文字入力
欄を持つ文書に記入される文字を認識する構成を採ると
きにあって、その文字認識精度の向上を実現する新たな
文字認識装置及び方法の提供と、その文字認識装置の実
現に用いられるプログラムが記憶される新たなプログラ
ム記憶媒体の提供とを目的とする。Therefore, according to the prior art, there is a problem that the recognition accuracy of the character entered in the one-character input box may not always be sufficient. The present invention has been made in view of the above circumstances, and has a new character that realizes an improvement in the character recognition accuracy when adopting a configuration for recognizing a character entered in a document having a one-character input field. An object of the present invention is to provide a recognition device and method, and to provide a new program storage medium in which a program used to realize the character recognition device is stored.

【００１１】[0011]

【課題を解決するための手段】図１に本発明の原理構成
を図示する。図中、１は本発明を具備する文字認識装置
であって、罫線で区切られる１文字入力欄に記入される
文字を認識するもの、２は文字認識装置１に接続される
イメージスキャナであって、文書画像を読み取って文字
認識装置１に入力するものである。FIG. 1 shows the principle configuration of the present invention. In the figure, 1 is a character recognition device equipped with the present invention, which recognizes a character entered in a 1-character input field separated by ruled lines, and 2 is an image scanner connected to the character recognition device 1. The document image is read and input to the character recognition device 1.

【００１２】本発明の文字認識装置１は、イメージメモ
リ１０と、抽出手段１１と、取出手段１２と、第１の認
識手段１３と、第２の認識手段１４と、決定手段１５と
を備える。The character recognition device 1 of the present invention comprises an image memory 10, an extraction means 11, a takeout means 12, a first recognition means 13, a second recognition means 14 and a determination means 15.

【００１３】イメージメモリ１０は、罫線で区切られる
１文字入力欄に文字の記入される文書画像（２値化され
ている）を格納する。抽出手段１１は、入力された文書
画像の持つ罫線を抽出する。取出手段１２は、抽出手段
１１の抽出する罫線と入力された文書画像の持つ文字と
の接触文字部分を特定するとともに、その特定結果に従
って、入力された文書画像から文字のみを取り出す。The image memory 10 stores a document image (binarized) in which characters are entered in a 1-character input field delimited by ruled lines. The extraction means 11 extracts the ruled lines of the input document image. The extraction unit 12 specifies the contact character portion between the ruled line extracted by the extraction unit 11 and the character of the input document image, and extracts only the character from the input document image according to the specification result.

【００１４】第１の認識手段１３は、文字サイズに従っ
て、取出手段１２の特定する接触文字部分を有効とする
のか否かを決定することで、入力された文書画像から文
字を１文字ずつ切り出して認識処理を実行する。The first recognizing means 13 cuts out characters one by one from the input document image by determining whether or not the contact character portion specified by the extracting means 12 is valid according to the character size. Perform recognition processing.

【００１５】第２の認識手段１４は、取出手段１２の取
り出す文字の続き文字部分を検出し、それを削除するこ
とで、入力画像から文字を１文字ずつ切り出して認識処
理を実行する。The second recognizing means 14 detects the subsequent character portion of the character extracted by the extracting means 12 and deletes it, thereby cutting out the character one by one from the input image and executing the recognition processing.

【００１６】決定手段１５は、第１の認識手段１３の認
識結果と、第２の認識手段１４の認識結果とから、最終
的な文字の認識結果を決定する。ここで、本発明の文字
認識装置１の持つ機能は具体的にはプログラムで実現さ
れるものであり、このプログラムは、フロッピィディス
クなどに記憶されたり、サーバなどのディスクなどに記
憶され、それらから文字認識装置１にインストールされ
てメモリ上で動作することで、本発明を実現することに
なる。The deciding means 15 decides a final character recognition result from the recognition result of the first recognizing means 13 and the recognition result of the second recognizing means 14. Here, the function of the character recognition device 1 of the present invention is specifically realized by a program, and this program is stored in a floppy disk or the like, or is stored in a disk or the like of a server, etc. The present invention is realized by being installed in the character recognition device 1 and operating on the memory.

【００１７】このように構成される本発明の文字認識装
置１では、抽出手段１１が入力された文書画像の持つ罫
線を抽出すると、取出手段１２は、抽出された罫線と入
力された文書画像の持つ文字との接触文字部分を特定す
るとともに、その特定結果に従って、入力された文書画
像から文字のみを取り出す。In the character recognition apparatus 1 of the present invention thus constructed, when the extracting means 11 extracts the ruled lines of the input document image, the extracting means 12 extracts the ruled lines and the input document image. In addition to identifying the contact character portion with the possessed character, only the character is extracted from the input document image according to the identification result.

【００１８】これを受けて、第１の認識手段１３は、文
字同士が分離していることを前提する文字切り出しアル
ゴリズムに従い、文字サイズを使って、取出手段１２の
特定する接触文字部分を有効とするのか否かを決定する
ことで、入力された文書画像から文字を１文字ずつ切り
出して認識処理を実行する。In response to this, the first recognizing means 13 validates the contact character portion specified by the extracting means 12 by using the character size in accordance with the character cutting algorithm which assumes that the characters are separated from each other. By deciding whether or not to do so, the characters are cut out one by one from the input document image and the recognition processing is executed.

【００１９】一方、第２の認識手段１４は、文字サイズ
を考慮せずに、取出手段１２の取り出す文字の続き文字
部分を検出し、それを削除することで、入力画像から文
字を１文字ずつ切り出して認識処理を実行する。On the other hand, the second recognizing means 14 detects the subsequent character portion of the character extracted by the extracting means 12 without considering the character size, and deletes it to delete the character one by one from the input image. Cut out and execute the recognition process.

【００２０】このとき、第２の認識手段１４は、文字の
続き文字部分が存在しないことを判断するときには、図
２に示すように、文字の認識処理を実行しないように処
理する。At this time, when the second recognizing means 14 determines that there is no subsequent character portion of the character, the second recognizing means 14 does not execute the character recognizing process as shown in FIG.

【００２１】そして、決定手段１５は、この第１の認識
手段１３の認識結果と、第２の認識手段１４の認識結果
とを受けて、例えば、距離値の小さい方の認識結果を最
終的な認識結果として決定する。Then, the deciding means 15 receives the recognition result of the first recognizing means 13 and the recognition result of the second recognizing means 14, and finally gives the recognition result of the smaller distance value, for example. Determined as a recognition result.

【００２２】このように、本発明の文字認識装置１で
は、１文字入力欄の罫線を削除しつつ文字を認識する構
成を採るときにあって、図３に示すように、文字同士が
分離していることを前提とする文字切り出しアルゴリズ
ムを使って切り出される文字を認識対象（図３の右側）
として、文字の認識処理を実行するとともに、文字の続
き文字部分を削除することで切り出される文字を認識対
象（図３の左側）として、文字の認識処理を実行する構
成を採って、その２つの認識結果から最終的な文字の認
識結果を得るようにすることから、文字認識精度を従来
よりも向上できるようになる。As described above, in the character recognition device 1 of the present invention, when the character recognition is performed while deleting the ruled line of the one-character input field, the characters are separated from each other as shown in FIG. Recognition target character that is cut out using the character cutting algorithm that assumes that
, The character recognition processing is executed, and the character recognition processing is executed with the character cut out by deleting the subsequent character portion of the character as the recognition target (left side in FIG. 3). Since the final character recognition result is obtained from the recognition result, the character recognition accuracy can be improved as compared with the conventional case.

【００２３】[0023]

【発明の実施の形態】以下、金融文書に記入される手書
き数字（０〜９）を認識対象とする実施の形態に従って
本発明を詳細に説明する。BEST MODE FOR CARRYING OUT THE INVENTION The present invention will be described in detail below with reference to an embodiment in which handwritten numbers (0 to 9) entered in a financial document are recognized.

【００２４】図４に、本発明を具備する文字認識装置１
の一実施例を図示する。この実施例に従う本発明の文字
認識装置１は、図１で説明したイメージスキャナ２と、
図１で説明したイメージメモリ１０と、認識制御プログ
ラム２０と、第１の認識プログラム２１と、第２の認識
プログラム２２と、枠抽出プログラム２３と、接触文字
部分抽出プログラム２４とを備えている。FIG. 4 shows a character recognition device 1 having the present invention.
1 illustrates an example. The character recognition device 1 of the present invention according to this embodiment includes the image scanner 2 described in FIG.
The image memory 10 described in FIG. 1, the recognition control program 20, the first recognition program 21, the second recognition program 22, the frame extraction program 23, and the contact character portion extraction program 24 are provided.

【００２５】ここで、文字認識装置１に展開されるプロ
グラムは、フロッピィディスクや回線などを介してイン
ストールされることになる。この認識制御プログラム２
０は、イメージメモリ１０に格納される文書画像の持つ
手書き文字（罫線で区切られる１文字入力欄に記入され
ている）の認識要求が発行されると、図５の処理フロー
に示すように、先ず最初に、ステップ１で、枠抽出プロ
グラム２３を起動する。Here, the program developed in the character recognition device 1 is installed via a floppy disk or a line. This recognition control program 2
When 0 is issued as a recognition request for handwritten characters (written in a 1-character input field delimited by ruled lines) of the document image stored in the image memory 10, as shown in the processing flow of FIG. First, in step 1, the frame extraction program 23 is activated.

【００２６】このようにして起動されると、枠抽出プロ
グラム２３は、後述する構成に従って、文書画像の持つ
１文字入力欄の枠（罫線で区切られる文字の枠）を抽出
する。When activated in this way, the frame extracting program 23 extracts the frame of the one-character input field (the frame of characters separated by ruled lines) of the document image in accordance with the configuration described later.

【００２７】続いて、ステップ２で、枠抽出プログラム
２３による抽出処理が終了するのを待って、この抽出処
理が終了すると、ステップ３に進んで、接触文字部分抽
出プログラム２４を起動する。Subsequently, in step 2, after waiting for the extraction processing by the frame extraction program 23 to be completed, when this extraction processing is completed, the routine proceeds to step 3 to activate the contact character portion extraction program 24.

【００２８】このようにして起動されると、接触文字部
分抽出プログラム２４は、後述する構成に従って、枠抽
出プログラム２３の抽出結果を参照しつつ、１文字入力
欄の枠部分と重なる手書き文字部分（枠に接触する文字
部分）を抽出するとともに、その抽出結果に従って、文
書画像の持つ手書き文字のみを抽出する。When activated in this way, the contact character portion extraction program 24 refers to the extraction result of the frame extraction program 23 according to the configuration described later, while referring to the extraction result of the frame extraction program 23, a handwritten character portion ( (Character portion that touches the frame) is extracted, and only the handwritten character of the document image is extracted according to the extraction result.

【００２９】続いて、ステップ４で、接触文字部分抽出
プログラム２４による抽出処理が終了するのを待って、
この抽出処理が終了すると、ステップ５に進んで、第１
の認識プログラム２１を起動する。Then, in step 4, waiting for the extraction processing by the contact character portion extraction program 24 to end,
When this extraction process is completed, the process proceeds to step 5 and the first
The recognition program 21 is started.

【００３０】このようにして起動されると、第１の認識
プログラム２１は、接触文字部分抽出プログラム２４の
抽出結果を参照しつつ、後述する構成に従って、第２の
認識プログラム２２とは別の文字切り出しアルゴリズム
に従って手書き文字を１文字ずつ切り出して認識する。When activated in this manner, the first recognition program 21 refers to the extraction result of the contact character portion extraction program 24 and, in accordance with the configuration described later, a character different from the second recognition program 22. The handwritten characters are cut out one by one and recognized according to the cutout algorithm.

【００３１】続いて、ステップ６で、第１の認識プログ
ラム２１による手書き文字の認識処理が終了するのを待
って、この認識処理が終了すると、ステップ７に進ん
で、第２の認識プログラム２２を起動する。Then, in step 6, the handwriting character recognition process by the first recognition program 21 is completed, and when this recognition process is completed, the process proceeds to step 7 and the second recognition program 22 is executed. to start.

【００３２】このようにして起動されると、第２の認識
プログラム２２は、接触文字部分抽出プログラム２４の
抽出結果を参照しつつ、後述する構成に従って、第１の
認識プログラム２１とは別の文字切り出しアルゴリズム
に従って手書き文字を１文字ずつ切り出して認識する。When activated in this way, the second recognition program 22 refers to the extraction result of the contact character portion extraction program 24 and, in accordance with the configuration described later, a character different from the first recognition program 21. The handwritten characters are cut out one by one and recognized according to the cutout algorithm.

【００３３】続いて、ステップ８で、第２の認識プログ
ラム２２による手書き文字の認識処理が終了するのを待
って、この認識処理が終了すると、ステップ９に進ん
で、第１の認識プログラム２１の認識結果と、第２の認
識プログラム２２の認識結果とから、最終的な手書き文
字の認識結果を決定する。Subsequently, in step 8, the handwriting character recognition process by the second recognition program 22 is waited for, and when this recognition process is completed, the process proceeds to step 9 to execute the first recognition program 21. The final recognition result of the handwritten character is determined from the recognition result and the recognition result of the second recognition program 22.

【００３４】次に、枠抽出プログラム２３について説明
する。上述したように、枠抽出プログラム２３は、文書
画像の持つ１文字入力欄の枠を抽出する処理を行う。Next, the frame extraction program 23 will be described. As described above, the frame extracting program 23 performs the process of extracting the frame of the one-character input field of the document image.

【００３５】この枠抽出プログラム２３は、文書画像の
持つ枠を抽出するものであればどのような構成に従うも
のでもよいが、例えば、本出願人が出願した特開平６-3
09498 号や、特開平７-28937号や、特開平８-305796 号
や、特開平９-50527号で開示した構成のものを用いるこ
とが可能である。The frame extracting program 23 may have any structure as long as it extracts the frame of the document image. For example, Japanese Patent Laid-Open No. 6-3 filed by the present applicant.
It is possible to use those having the configurations disclosed in 09498, JP-A-7-28937, JP-A-8-305796, and JP-A-9-50527.

【００３６】図６に、本出願人が特開平８-305796 号な
どで開示した枠抽出プログラム２３の機能ブロック図を
図示する。図６の構成に従う枠抽出プログラム２３は、
連結パターン抽出部２３０と、投影部２３１と、直線検
出部２３２と、第１の４辺検出部２３３と、追跡部２３
４と、第２の４辺検出部２３５と、枠抽出部２３６とで
構成されている。FIG. 6 shows a functional block diagram of the frame extraction program 23 disclosed by the present applicant in Japanese Patent Laid-Open No. 8-305796. The frame extraction program 23 according to the configuration of FIG.
The connection pattern extraction unit 230, the projection unit 231, the straight line detection unit 232, the first four side detection unit 233, and the tracking unit 23.
4, a second four-side detection unit 235, and a frame extraction unit 236.

【００３７】連結パターン抽出部２３０は、イメージメ
モリ１０に格納される文書画像を入力して、縦、横、斜
めの８方向のいずれかで繋がっている８連結の連結パタ
ーンをラベリング処理により抽出する。このとき得られ
る連結パターンとしては、文字の接触していない枠、
枠に接触していない文字又はその一部、枠に接触し
ている文字又はその一部のいずれかである。The connection pattern extraction unit 230 inputs the document image stored in the image memory 10 and extracts an 8-connection connection pattern which is connected in any of eight directions of vertical, horizontal, and diagonal by a labeling process. . The connection pattern obtained at this time is a frame in which characters do not touch,
Either the character that is not in contact with the frame or a part thereof, or the character that is in contact with the frame or a part thereof.

【００３８】この抽出された連結パターンの中に、１文
字入力欄の文字枠を構成する直線が含まれている。投影
部２３１は、連結パターン抽出部２３０により抽出され
た連結パターンを水平方向と垂直方向に投影する。直線
検出部２３２は、投影部２３１により得られた水平方向
／垂直方向の投影情報から、水平線／垂直線を検出す
る。第１の４辺検出部２３３は、直線検出部２３２によ
り検出された水平線／垂直線から検出される矩形につい
て、その４辺を検出する。The extracted connection patterns include straight lines that form the character frame of the one-character input field. The projection unit 231 projects the connection pattern extracted by the connection pattern extraction unit 230 in the horizontal direction and the vertical direction. The straight line detection unit 232 detects horizontal lines / vertical lines from the horizontal / vertical projection information obtained by the projection unit 231. The first four-side detecting unit 233 detects the four sides of the rectangle detected from the horizontal line / vertical line detected by the straight line detecting unit 232.

【００３９】追跡部２３４は、直線検出部２３２／第１
の４辺検出部２３３で検出できない線幅の細い直線を求
めるために、設定される数ライン（例えば２〜３ライ
ン）の幅内で８連結する水平線と垂直線を追跡する。第
２の４辺検出部２３５は、追跡部２３４で追跡された水
平線／垂直線から検出される矩形について、その４辺を
検出する。The tracking unit 234 is a straight line detecting unit 232 / first
In order to obtain a straight line with a thin line width that cannot be detected by the four-side detection unit 233, the horizontal line and the vertical line that connect eight within the width of a set number of lines (for example, 2 to 3 lines) are traced. The second four-side detecting unit 235 detects the four sides of the rectangle detected from the horizontal / vertical lines tracked by the tracking unit 234.

【００４０】枠抽出部２３６は、第１の４辺検出部２３
３で検出された４辺から１文字入力欄の枠を抽出すると
ともに、第２の４辺検出部２３５で検出された４辺から
１文字入力欄の枠を抽出する。The frame extracting section 236 is provided in the first four side detecting section 23.
The frame of the one-character input field is extracted from the four sides detected in 3, and the frame of the one-character input field is extracted from the four sides detected by the second four-side detection unit 235.

【００４１】この図６の構成に従って、枠抽出プログラ
ム２３は、文書画像の持つ１文字入力欄の枠に関する予
備知識を持たなくても、文書画像の持つ１文字入力欄の
枠を抽出できるようになる。According to the configuration of FIG. 6, the frame extracting program 23 can extract the frame of the one-character input field of the document image without prior knowledge about the frame of the one-character input field of the document image. Become.

【００４２】次に、接触文字部分抽出プログラム２４に
ついて説明する。上述したように、接触文字部分抽出プ
ログラム２４は、１文字入力欄の枠部分と重なる手書き
文字部分（枠に接触する文字部分）を抽出するととも
に、その抽出結果に従って文書画像の持つ手書き文字の
みを抽出する処理を行う。Next, the contact character portion extraction program 24 will be described. As described above, the contact character portion extraction program 24 extracts the handwritten character portion (the character portion that contacts the frame) overlapping the frame portion of the one-character input field, and only the handwritten character of the document image is extracted according to the extraction result. Perform extraction processing.

【００４３】後述するように、第１及び第２の認識プロ
グラム２１，２２は、罫線の取り除かれた文書画像の持
つ手書き文字に対して認識処理を施すことになるが、こ
の手書き文字が枠と接触する場合には、枠を削除すると
手書き文字部分が欠落してしまうことになる。そこで、
この欠落部分を補間するために、接触文字部分抽出プロ
グラム２４を使って、１文字入力欄の枠部分と重なる手
書き文字部分を抽出する処理を行うのである。As will be described later, the first and second recognition programs 21 and 22 perform recognition processing on the handwritten characters of the document image from which the ruled lines are removed. In the case of contact, if the frame is deleted, the handwritten character portion will be lost. Therefore,
In order to interpolate this missing portion, the contact character portion extraction program 24 is used to perform processing for extracting a handwritten character portion that overlaps the frame portion of the one-character input field.

【００４４】この接触文字部分抽出プログラム２４は、
１文字入力欄の枠部分と重なる手書き文字部分を抽出す
るものであればどのような構成に従うものでもよいが、
例えば、本出願人が出願した特開平６-309498 号や、特
開平７-28937号や、特開平８-305796 号や、特開平９-5
0527号で開示した構成のものを用いることが可能であ
る。The contact character portion extraction program 24 is
Any configuration may be used as long as it extracts a handwritten character portion that overlaps the frame portion of the one-character input field.
For example, JP-A-6-309498, JP-A-7-28937, JP-A-8-305796, and JP-A-9-5 filed by the present applicant.
The configuration disclosed in No. 0527 can be used.

【００４５】図７に、本出願人が特開平８-305796 号な
どで開示した接触文字部分抽出プログラム２４の機能ブ
ロック図を図示する。図７の構成に従う接触文字部分抽
出プログラム２４は、連結パターン属性付加部２４０
と、枠分離部２４１と、交点算出部２４２と、交点対応
付け部２４３と、接触文字部分抽出部２４４とで構成さ
れている。FIG. 7 shows a functional block diagram of the contact character portion extraction program 24 disclosed by the present applicant in Japanese Patent Laid-Open No. 8-305796. The contact character portion extraction program 24 according to the configuration of FIG.
And a frame separation unit 241, an intersection calculation unit 242, an intersection correspondence unit 243, and a contact character portion extraction unit 244.

【００４６】連結パターン属性付加部２４０は、枠抽出
プログラム２３の連結パターン抽出部２３０により抽出
された連結パターンに対して、「枠」、「文字パターン
又はその一部」、「枠と文字パターン又はその一部との
接触パターン（接触文字パターン）」のいずれかの属性
を付加する。The concatenation pattern attribute adding unit 240, for the concatenation pattern extracted by the concatenation pattern extracting unit 230 of the frame extracting program 23, "frame", "character pattern or part thereof", "frame and character pattern or Any attribute of "contact pattern (contact character pattern) with a part thereof" is added.

【００４７】枠分離部２４１は、連結パターン属性付加
部２４０で「枠」又は「文字と枠との接触文字パター
ン」という属性の付加された連結パターンから枠を分離
する。具体的には、枠部分の辺の幅を算出し、それに基
づいて枠を除去する。そして、枠を除去したパターンに
ついて再びラベリングを施して、面積の小さいパターン
を雑音として除去し、連結パターン属性付加部２４０で
属性の付加されなかったパターンの内、枠を除去しても
残るパターンについては「接触文字パターン」の属性を
付加し、枠を除去したら何も残らないパターンについて
は「枠」だけの属性を付加することで行う。The frame separation unit 241 separates the frame from the connection pattern to which the connection pattern attribute adding unit 240 has added the attribute "frame" or "contact character pattern of character and frame". Specifically, the width of the side of the frame portion is calculated, and the frame is removed based on the calculated width. Then, the pattern from which the frame is removed is re-labeled to remove the pattern having a small area as noise, and among the patterns to which the attribute is not added by the concatenated pattern attribute adding unit 240, the patterns that remain after the frame is removed Is performed by adding the attribute of "contact character pattern", and adding the attribute of "frame" only to the pattern in which nothing remains after removing the frame.

【００４８】交点算出部２４２は、先ず最初に、接触文
字パターンについて、枠と文字との交点を算出し、続い
て、それらの全ての交点について、その位置から枠外方
向へ枠幅分程度まで文字線分を探索して枠外の交点を算
出するとともに、その探索した文字線分の面積を求め
る。続いて、この求めた文字線分の面積が閾値以下であ
るときには、その文字線分を雑音とみなして除去すると
ともに、その交点が文字と枠との交点でないと判断する
ことで行う。The intersection calculating unit 242 first calculates the intersections of the frame and the character in the contact character pattern, and then, at all of these intersections, from the position to the outside of the frame up to about the width of the character. The line segment is searched to calculate an intersection outside the frame, and the area of the searched character line segment is obtained. Subsequently, when the obtained area of the character line segment is equal to or smaller than the threshold value, the character line segment is regarded as noise and removed, and the intersection is determined not to be the intersection of the character and the frame.

【００４９】交点対応付け部２４３は、交点算出部２４
２で得られた交点情報に基づいて、枠と接触している文
字線分の方向性を求める。更に、枠の両側に接触してい
る２つの文字線分間の距離を求める。そして、この求め
た方向性及び距離と、この方向性に基づく文字線分の連
続性の条件とにより、文字と枠との各交点を対応付け
る。この対応付け処理により、図８（ａ）の例で説明す
るならば、交点Ａと交点Ｃとが対応付けられ、交点Ｂと
交点Ｄとが対応付けられることになる。The intersection associating unit 243 is used by the intersection calculating unit 24.
Based on the intersection point information obtained in 2, the directionality of the character line segment that is in contact with the frame is obtained. Further, the distance between the two character lines contacting both sides of the frame is calculated. Then, each intersection point between the character and the frame is associated with the obtained directionality and distance and the condition of the continuity of the character line segment based on this directionality. With this association processing, if explained in the example of FIG. 8A, the intersection A and the intersection C are associated with each other, and the intersection B and the intersection D are associated with each other.

【００５０】接触文字部分抽出部２４４は、交点対応付
け部２４３により対応付けられた交点により規定される
枠部分の画像を文字成分と判断する。そして、その判断
結果に従って、枠に影響されない形で文字パターンのみ
を抽出する。The contact character portion extraction unit 244 determines that the image of the frame portion defined by the intersections associated by the intersection association unit 243 is a character component. Then, according to the determination result, only the character pattern is extracted without being influenced by the frame.

【００５１】この図７の構成に従って、接触文字部分抽
出プログラム２４は、図８（ａ）に示す接触文字パター
ンから、図８（ｂ）に示すように、１文字入力欄の枠部
分と重なる手書き文字部分（図中のドット部分）を抽出
できるようになる。そして、この接触文字部分の抽出結
果に従って、図９に示すように、文書画像の持つ１文字
入力欄の枠を除去して、その１文字入力欄に記入される
手書き文字のみを抽出できるようになる。In accordance with the configuration of FIG. 7, the contact character portion extraction program 24 uses the contact character pattern shown in FIG. 8 (a) to handwrite as shown in FIG. The character part (dot part in the figure) can be extracted. Then, according to the extraction result of the contact character portion, as shown in FIG. 9, the frame of the one-character input field of the document image is removed so that only the handwritten character entered in the one-character input field can be extracted. Become.

【００５２】次に、第１の認識プログラム２１の実行す
る手書き文字認識処理について説明する。第１の認識プ
ログラム２１は、認識制御プログラム２０より起動され
ると、接触文字部分抽出プログラム２４の抽出結果を参
照しつつ、文字と文字とが分離していることを前提とす
る文字切り出しアルゴリズムを使って、文書画像から手
書き文字を切り出し、それに対する認識処理を実行す
る。Next, the handwritten character recognition processing executed by the first recognition program 21 will be described. When the first recognition program 21 is activated by the recognition control program 20, the first recognition program 21 refers to the extraction result of the contact character portion extraction program 24 and executes a character cutout algorithm on the assumption that characters are separated from each other. Using it, the handwritten character is cut out from the document image, and the recognition process for it is executed.

【００５３】この文字切り出しアルゴリズムとしては、
どのようなものを用いてもよいが、例えば、本出願人が
特開平８-305796 号で開示したものが使える。図１０
に、第１の認識プログラム２１の実行する処理フローの
一実施例を図示する。次に、この処理フローに従って、
第１の認識プログラム２１の実行する認識処理について
説明する。As this character segmentation algorithm,
Although any one may be used, for example, the one disclosed in Japanese Patent Application Laid-Open No. 8-305796 by the present applicant can be used. Figure 10
An example of a processing flow executed by the first recognition program 21 is illustrated in FIG. Then, according to this processing flow,
The recognition process executed by the first recognition program 21 will be described.

【００５４】第１の認識プログラム２１は、認識制御プ
ログラム２０より起動されると、図１０の処理フローに
示すように、先ず最初に、ステップ１で、接触文字部分
抽出プログラム２４の抽出情報を入手する。When the first recognition program 21 is activated by the recognition control program 20, as shown in the processing flow of FIG. 10, first, in step 1, the extraction information of the contact character portion extraction program 24 is obtained. To do.

【００５５】続いて、ステップ２で、文書画像に記入さ
れる手書き文字の平均文字サイズを算出する。この平均
文字サイズは、特開平８-305796 号では、接触文字部分
抽出プログラム２４により抽出された各文字のサイズ
（外接矩形で近似して求める）から、文書画像に記入さ
れる手書き文字の平均文字サイズを算出する構成を採っ
たが、簡略的な方法として、枠抽出プログラム２３で抽
出された１文字入力欄の大きさから算出するような構成
を採ることも可能である。Then, in step 2, the average character size of handwritten characters written in the document image is calculated. In Japanese Patent Laid-Open No. 8-305796, this average character size is the average size of handwritten characters written in the document image from the size of each character extracted by the contact character part extraction program 24 (approximately obtained by a circumscribed rectangle). Although the structure for calculating the size is adopted, as a simple method, a structure for calculating from the size of the one-character input field extracted by the frame extracting program 23 can be adopted.

【００５６】続いて、ステップ３で、接触文字部分抽出
プログラム２４により抽出された各文字のサイズを、ス
テップ２で算出した平均サイズと比較することで、接触
文字部分抽出プログラム２４で抽出した枠部分と重なる
手書き文字部分（接触文字部分）が本来の文字部分であ
るのか否かを判断して、本来の文字部分でないことを判
断するときには、それを削除していくことで文字を切り
出す。Subsequently, in step 3, the size of each character extracted by the contact character part extraction program 24 is compared with the average size calculated in step 2 to extract the frame part extracted by the contact character part extraction program 24. It is determined whether or not the handwritten character portion (contact character portion) overlapping with is the original character portion, and when it is determined that it is not the original character portion, the character is cut out by deleting it.

【００５７】１文字入力欄が設けられていることから、
金融文書に記入される「０」〜「９」の文字は、基本的
には、１文字入力欄の枠内に記入されることで他の文字
と切り離されているのであるが、文字の一部が枠を飛び
出して記入されたり、隣の文字と繋がって記入されるこ
とがある。そこで、接触文字部分抽出プログラム２４で
抽出した接触文字部分を有効とした場合に、１つの文字
サイズに収まらなくなってしまうときには、その抽出さ
れた接触文字部分を無効として削除し、１つの文字サイ
ズに収まるときには、その抽出された接触文字部分を有
効としていくことで正確な文字を切り出すように処理す
る。Since the one-character input field is provided,
The characters "0" to "9" entered in a financial document are basically separated from other characters by being entered in the frame of the one-character input field. Part may be filled out of the frame, or may be filled in by connecting with the adjacent character. Therefore, when the contact character portion extracted by the contact character portion extraction program 24 is valid and does not fit in one character size, the extracted contact character portion is invalidated and deleted, and the contact character portion is converted into one character size. When it fits, the extracted contact character portion is validated so that an accurate character is cut out.

【００５８】例えば、図１１（ａ）に示すように、
「２」と「５」と「３」とが続けて記入されることで、
１文字入力欄の枠を横切るようなときには、文字サイズ
が平均文字サイズよりも大きくなることを考慮して、接
触文字部分抽出プログラム２４で抽出した接触文字部分
を削除することで、「２」と「５」と「３」とを正確に
切り出すのである。For example, as shown in FIG.
By entering "2", "5", and "3" in succession,
When it crosses the frame of the one-character input box, the contact character portion extracted by the contact character portion extraction program 24 is deleted in consideration of the fact that the character size becomes larger than the average character size. "5" and "3" are cut out accurately.

【００５９】また、図１１（ｂ）に示すように、「１」
と「０」の一部が１文字入力欄の範囲を外れている場合
には、文字サイズが平均文字サイズに収まることを考慮
して、接触文字部分抽出プログラム２４で抽出した接触
文字部分を本来の文字として扱うことで、「１」と
「０」とを正確に切り出すのである。Further, as shown in FIG. 11B, "1"
If a part of "0" and "0" is out of the range of the 1-character input field, the contact character part extracted by the contact character part extraction program 24 is originally taken into consideration in that the character size fits within the average character size. "1" and "0" are accurately cut out by treating them as characters of.

【００６０】このようにして１文字入力欄に記入される
手書き文字を１文字ずつ切り出すと、続いて、ステップ
４に進んで、切り出した文字の中から未処理の文字を１
つ選択し、続くステップ５で、全ての文字を選び出した
のか否かを判断する。After the handwritten characters entered in the one-character input box are cut out one by one in this way, the process proceeds to step 4 and the unprocessed characters are cut out from the cut out characters.
Then, in step 5, it is determined whether or not all the characters have been selected.

【００６１】このステップ５で、全ての文字を選び出し
ていないことを判断するとき、すなわち、ステップ４
で、切り出した文字の中から未処理の文字を１つ選択で
きたことを判断するときには、ステップ６に進んで、そ
の選択した文字の持つ文字認識に用いる特徴量を算出す
る。このとき算出する特徴量としては、従来技術の文字
認識処理で提案されているどのようなものを用いてもよ
い。At this step 5, when it is judged that all the characters have not been selected, that is, step 4
Then, when it is determined that one unprocessed character can be selected from the cut out characters, the process proceeds to step 6, and the feature amount used for character recognition of the selected character is calculated. As the feature amount calculated at this time, any feature proposed in the conventional character recognition processing may be used.

【００６２】続いて、ステップ７で、その算出した特徴
量を使って、認識対象となる登録文字との間の距離を測
定する。金融文書では、「０」〜「９」の１０個の文字
が登録文字となるので、これらの１０個の登録文字との
間の距離を測定するのである。Then, in step 7, the calculated feature amount is used to measure the distance to the registered character to be recognized. In a financial document, 10 characters "0" to "9" are registered characters, and therefore the distance between these 10 characters is measured.

【００６３】続いて、ステップ８で、ステップ４で選択
した文字の認識結果として、ステップ７で測定した最も
距離の小さい登録文字を決定してから、次の文字の認識
に進むべくステップ４に戻っていく。Subsequently, in step 8, as the recognition result of the character selected in step 4, the registered character having the smallest distance measured in step 7 is determined, and then the process returns to step 4 to proceed to recognition of the next character. To go.

【００６４】そして、ステップ４ないしステップ８の処
理を繰り返すことで、ステップ５で、全ての文字を選び
出したことを判断するとき、すなわち、ステップ３で切
り出した手書き文字の文字認識を終了することを判断す
ると、ステップ９に進んで、ステップ８で決定した認識
結果の登録文字との間の距離の合計値を算出して、全処
理を終了する。Then, by repeating the processing of steps 4 to 8, when it is judged in step 5 that all the characters have been selected, that is, the character recognition of the handwritten characters cut out in step 3 is terminated. When it is determined, the process proceeds to step 9, the total value of the distance between the recognition result and the registered character determined in step 8 is calculated, and the entire process is ended.

【００６５】このようにして、第１の認識プログラム２
１は、１文字入力欄の枠部分に重なる文字部分を有効な
ものとしたり無効なものとしながら、文字と文字とが分
離していることを前提とする文字切り出しアルゴリズム
を使って、文書画像から手書き文字を１文字ずつ切り出
して、その切り出した文字に対して認識処理を施すこと
で文字認識を実行するのである。In this way, the first recognition program 2
1 is a document image that uses a character segmentation algorithm that presumes that characters are separated, while validating and invalidating the character portion that overlaps the frame portion of the 1-character input field. Characters are recognized by cutting out handwritten characters one by one and performing a recognition process on the cut out characters.

【００６６】この第１の認識プログラム２１の認識処理
により、例えば、図３の上段に示す続き文字の形態で記
入される「１００００」という手書き文字は、図３の右
側のように切り出されて、「１８８８６」と認識される
ことになる。そして、この認識結果に対して、各文字の
認識結果の距離の合計値として「５００」が算出される
ことになる。By the recognition processing of the first recognition program 21, for example, the handwritten character "10000" entered in the form of continuous characters shown in the upper part of FIG. 3 is cut out as shown on the right side of FIG. It will be recognized as "18886". Then, with respect to this recognition result, "500" is calculated as the total value of the distances of the recognition results of each character.

【００６７】次に、第２の認識プログラム２２の実行す
る認識処理について説明する。第２の認識プログラム２
２は、認識制御プログラム２０より起動されると、接触
文字部分抽出プログラム２４により１文字入力欄の枠の
取り外された手書き文字を認識対象として、文字と文字
とを繋ぐ続き文字部分を検出して、それを削除すること
で文書画像から手書き文字を切り出し、それに対する認
識処理を実行する。Next, the recognition processing executed by the second recognition program 22 will be described. Second recognition program 2
2, when the recognition control program 20 is activated, the contact character portion extraction program 24 detects a handwritten character from which the frame of the one-character input box is removed as a recognition target, and detects a continuous character portion connecting characters. , The handwritten character is cut out from the document image by deleting it, and the recognition process for it is executed.

【００６８】この文字切り出しアルゴリズムとしては、
どのようなものを用いてもよいが、例えば、本出願人が
特開平７-192094 号で開示したものが使える。図１２及
び図１３に、第２の認識プログラム２２の実行する処理
フローの一実施例を図示する。次に、この処理フローに
従って、第２の認識プログラム２２の実行する認識処理
について説明する。As the character segmentation algorithm,
Although any one may be used, for example, the one disclosed in Japanese Patent Application Laid-Open No. 7-192094 by the present applicant can be used. 12 and 13 illustrate an example of a processing flow executed by the second recognition program 22. Next, the recognition processing executed by the second recognition program 22 will be described according to this processing flow.

【００６９】第２の認識プログラム２１は、認識制御プ
ログラム２０より起動されると、図１２及び図１３の処
理フローに示すように、先ず最初に、ステップ１で、接
触文字部分抽出プログラム２４により抽出された手書き
文字のみの抽出情報を入手することで、手書き文字を構
成する連結パターンを入手する。When the second recognition program 21 is started by the recognition control program 20, as shown in the processing flows of FIGS. 12 and 13, first, in step 1, the contact character portion extraction program 24 extracts the second character recognition program 21. By obtaining the extracted information of only the written handwritten characters, the connection pattern forming the handwritten characters is obtained.

【００７０】続いて、ステップ２で、複数の文字が繋が
っている続き文字の候補として、ステップ１で読み取っ
た連結パターンの中から横長の連結パターンを抽出す
る。この抽出処理は、連結パターン毎に外接矩形を求め
て、その外接矩形の縦横の比率を算出し、所定の閾値と
比較することで行う。Subsequently, in step 2, a horizontally long concatenated pattern is extracted from the concatenated patterns read in step 1 as a candidate for a continuous character in which a plurality of characters are connected. This extraction processing is performed by obtaining a circumscribing rectangle for each connection pattern, calculating a vertical-horizontal ratio of the circumscribing rectangle, and comparing it with a predetermined threshold value.

【００７１】続いて、ステップ３で、ステップ２で抽出
した横長パターンの持つ水平続き線（続き文字部分を形
成する線）を抽出する処理を行う。このステップ３で
は、先ず最初に、「文字パターン面積／外接矩形の面
積」を算出し、その値に従って、抽出する直線の長さを
決定する。具体的には、「文字パターン面積／外接矩形
の面積」の値が大きいときには、長い水平線を抽出し、
この値が小さいときには、短い水平線を抽出する。Subsequently, in step 3, a process of extracting a horizontal continuous line (a line forming a continuous character portion) of the horizontally long pattern extracted in step 2 is performed. In this step 3, first, "character pattern area / area of circumscribing rectangle" is calculated, and the length of the straight line to be extracted is determined according to the value. Specifically, when the value of "character pattern area / area of circumscribed rectangle" is large, a long horizontal line is extracted,
When this value is small, a short horizontal line is extracted.

【００７２】すなわち、「文字パターン面積／外接矩形
の面積」の値が大きいということは、文字パターンのパ
ターン幅が大きいことを意味する。このようなときに、
短い直線を抽出するようにすると、本来の文字部分にも
多数の直線が存在し、それらが抽出されてしまうことに
なるからてある。また、「文字パターン面積／外接矩形
の面積」の値が小さいということは、文字パターンのパ
ターン幅が小さいことを意味する。このようなときに、
長い直線を抽出するようにすると、本来の水平線が抽出
されなくなってしまうからである。That is, a large value of "character pattern area / area of circumscribing rectangle" means that the pattern width of the character pattern is large. At times like this,
This is because if a short straight line is extracted, a large number of straight lines will be present in the original character portion and these will be extracted. A small value of "character pattern area / circumscribing rectangle area" means that the pattern width of the character pattern is small. At times like this,
This is because if a long straight line is extracted, the original horizontal line will not be extracted.

【００７３】このステップ３では、続いて、図１４
（ａ）に示すように、抽出する直線の長さに従って、文
字パターンを縦方向に分割し、その分割した範囲内で投
影の処理を行う。このとき、斜め線の存在を考慮して、
周囲の行の投影値を足し合わせる形で横方向に投影（い
わゆる隣接投影法）を行って、その投影値が所定の閾値
以上であるときには、その部分に直線が存在すると認識
して、その範囲を矩形近似して矩形の直線を形成する。In step 3, the process shown in FIG.
As shown in (a), the character pattern is vertically divided according to the length of the straight line to be extracted, and the projection process is performed within the divided range. At this time, considering the existence of diagonal lines,
When the projection values of surrounding lines are added together in the horizontal direction (so-called adjacent projection method) and the projection value is equal to or more than a predetermined threshold value, it is recognized that a straight line exists in that part, and the range Is approximated to a rectangle to form a rectangular straight line.

【００７４】このステップ３では、続いて、図１４
（ｂ）に示すように、接触する矩形直線を統合すること
で長い直線を抽出し、その中で最も長い直線を水平続き
線とする処理を行う。In step 3, the process shown in FIG.
As shown in (b), a long straight line is extracted by integrating contacting rectangular straight lines, and the longest straight line among them is processed as a horizontal continuation line.

【００７５】ステップ３で、水平続き線（続き文字部分
を形成する線）を抽出すると、続いて、ステップ４で、
水平続き線が検出されたのか否かを判断して、水平続き
線が検出されないことを判断するときには、ステップ５
に進んで、認識処理を実行しない旨を記録して処理を終
了する。When a horizontal continuation line (a line forming a continuation character portion) is extracted in step 3, then in step 4,
When it is determined whether or not the horizontal continuation line is detected and it is determined that the horizontal continuation line is not detected, step 5
Then, the process proceeds to step S3 and records that the recognition process is not executed, and ends the process.

【００７６】一方、ステップ４で、水平続き線が検出さ
れたことを判断するときには、ステップ６に進んで、垂
直分離線を決定する。この垂直分離線の決定処理は、図
１５（ａ）に示すように、矩形近似された水平続き線の
下辺の一方の端点から水平続き線を辿ることで文字パタ
ーンとの交差点を見つけ、そこから文字パターンの輪郭
の探索を開始して、水平続き線に辿りついたら輪郭の探
索を一時終了する。続いて、水平続き線を辿ることで次
の文字パターンとの交差点を見つけ、そこから文字パタ
ーンの輪郭の探索を再び開始して、水平続き線に辿りつ
いたら輪郭の探索を一時終了する。これを矩形近似され
た水平続き線の下辺のもう一方の端点に辿りつくまで繰
り返し行う。On the other hand, when it is judged in step 4 that the horizontal continuation line is detected, the process proceeds to step 6 to determine the vertical separation line. As shown in FIG. 15A, this vertical separating line determination processing finds an intersection with the character pattern by tracing the horizontal continuation line from one end point of the lower side of the rectangular continuous horizontal continuation line, The contour search of the character pattern is started, and when the horizontal continuation line is reached, the contour search is temporarily stopped. Subsequently, the horizontal continuation line is traced to find an intersection with the next character pattern, the search for the contour of the character pattern is restarted from that point, and when the horizontal continuation line is reached, the contour search is temporarily stopped. This is repeated until the other end point on the lower side of the horizontal continuous line approximated by a rectangle is reached.

【００７７】最終的に、輪郭探索を行った回数が文字数
となり、輪郭探索の開始点から終了点までが１文字の存
在する領域である。垂直分離線は、文字と文字とを分離
する垂直線であり、図１５（ｂ）に示すように、輪郭探
索の終了点と開始点との間で、かつ、矩形近似された水
平続き線の幅値を持つ位置で決定する。Finally, the number of times the contour search is performed becomes the number of characters, and the region from the start point to the end point of the contour search is one character. The vertical separation line is a vertical line that separates characters from each other, and as shown in FIG. 15B, is a horizontal continuous line between the end point and the start point of the contour search and which is approximated by a rectangle. Determined at the position that has the width value.

【００７８】ステップ６で垂直分離線を決定すると、続
いて、ステップ７で、ゼロ判定を行う。このゼロ判定処
理は、図１６に示すように、垂直分離線と水平続き線に
囲まれた１文字領域内において、水平続き線と文字パタ
ーンとに囲まれた空白部分から、複数方向に放射状に探
索を行うことでループ構造を持つのか否かを調べること
で行う。When the vertical separation line is determined in step 6, subsequently, zero determination is performed in step 7. As shown in FIG. 16, this zero determination processing is performed in a single character area surrounded by vertical separation lines and horizontal continuation lines, starting from a blank part surrounded by horizontal continuation lines and character patterns, and radially in multiple directions. The search is performed to check whether or not it has a loop structure.

【００７９】ステップ７でゼロ判定を行うと、続いて、
ステップ８で、文字を分離する処理を行う。この文字分
離処理は、不要な水平続き線を削除することで行う。す
なわち、ゼロと判定した文字では、水平続き線は不必要
な線であるので、これを削除するのである。この削除処
理は、図１７（ａ）に示すように、垂直分離線を除去す
るとともに、続き線の太さが急激に変化する部分や、続
き線の傾きやその微分値が急激に変化する部分まで削除
することで行う。一方、ゼロでないと判定された文字に
ついては、図１７（ｂ）に示すように、垂直分離線の部
分で他の文字との分離を行うが続き線の削除は行わな
い。When the zero judgment is made in step 7,
In step 8, the process of separating characters is performed. This character separation processing is performed by deleting unnecessary horizontal continuation lines. That is, in the character determined to be zero, the horizontal continuation line is an unnecessary line and is deleted. In this deletion process, as shown in FIG. 17A, the vertical separation line is removed, and the part where the thickness of the continuation line changes abruptly, the part where the slope of the continuation line and the derivative value thereof change abruptly. By deleting up to. On the other hand, as for the character determined to be not zero, as shown in FIG. 17B, the vertical separation line is separated from other characters, but the continuation line is not deleted.

【００８０】このようにして、第２の認識プログラム２
２は、ステップ１ないしステップ８の処理に従って、文
字と文字とを繋げる不要な続き文字部分（金融文書の場
合、そのほとんどが「０」と「０」とを連続的に記入す
るときに発生する）を削除することで、文書画像から手
書き文字を切り出すのである。In this way, the second recognition program 2
2 is an unnecessary continuous character portion (in the case of a financial document, most of them are written when "0" and "0" are continuously written in succession in the case of a financial document) according to the processing of steps 1 to 8. ) Is deleted, the handwritten character is cut out from the document image.

【００８１】なお、この処理フローでは詳細に説明しな
かったが、一部分の文字しか続き文字部分を持たないと
き（すでに分離されている文字がある）にも、ステップ
５には進まずに、ステップ６ないしステップ８の処理に
進んで、その続き文字部分を削除する処理を行うことに
なる。Although not described in detail in this processing flow, even when only a part of characters has a continuous character part (a character already separated is present), the process does not proceed to step 5, The process proceeds from 6 to step 8 to delete the subsequent character portion.

【００８２】ステップ８で文書画像から手書き文字を切
り出すと、続いて、ステップ９に進んで、切り出した文
字の中から未処理の文字を１つ選択し、続くステップ１
０で、全ての文字を選び出したのか否かを判断する。When the handwritten character is cut out from the document image in step 8, the process proceeds to step 9, and one unprocessed character is selected from the cut out characters, and the following step 1
At 0, it is determined whether all the characters have been selected.

【００８３】このステップ１０で、全ての文字を選び出
していないことを判断するとき、すなわち、ステップ９
で、未処理の文字を１つ選択できたことを判断するとき
には、ステップ１１（図１３の処理フロー）に進んで、
第１の認識プログラム２１の用いた算出手法と同一の算
出手法を用いて、その選択した文字の持つ文字認識に用
いる特徴量を算出する。In this step 10, when it is judged that all the characters have not been selected, that is, step 9
Then, when it is determined that one unprocessed character can be selected, the process proceeds to step 11 (process flow of FIG. 13),
Using the same calculation method as that used by the first recognition program 21, the feature amount used for character recognition of the selected character is calculated.

【００８４】続いて、ステップ１２で、その算出した特
徴量を使って認識対象となる登録文字との間の距離を測
定する。金融文書では、「０」〜「９」の１０個の文字
が認識対象の登録文字となるので、これらの１０個の登
録文字との間の距離を測定するのである。Then, in step 12, the calculated feature amount is used to measure the distance to the registered character to be recognized. In a financial document, 10 characters “0” to “9” are registered characters to be recognized, and therefore the distance between these 10 registered characters is measured.

【００８５】続いて、ステップ１３で、ステップ９で選
択した文字の認識結果として、ステップ１２で測定した
最も距離の小さい登録文字を決定してから、次の文字の
認識に進むべくステップ９に戻っていく。Then, in step 13, as the recognition result of the character selected in step 9, the registered character with the smallest distance measured in step 12 is determined, and then the process returns to step 9 to proceed to recognition of the next character. To go.

【００８６】そして、ステップ９ないしステップ１３の
処理を繰り返すことで、ステップ１０で、全ての文字を
選び出したことを判断するとき、すなわち、ステップ８
で切り出した手書き文字の文字認識を終了することを判
断すると、ステップ１４に進んで、ステップ１３で決定
した認識結果の登録文字との間の距離の合計値を算出し
て、全処理を終了する。When it is judged in step 10 that all the characters have been selected by repeating the processing of steps 9 to 13, that is, step 8
When it is determined that the character recognition of the handwritten character cut out in step is to be ended, the process proceeds to step 14, the total value of the distance between the recognition result and the registered character determined in step 13 is calculated, and the entire process is ended. .

【００８７】このようにして、第２の認識プログラム２
２は、接触文字部分抽出プログラム２４により１文字入
力欄の枠の取り外された手書き文字を認識対象として、
文字と文字とを繋ぐ続き文字部分を検出して、それを削
除することで文書画像から手書き文字を切り出し、それ
に対する認識処理を実行するのである。In this way, the second recognition program 2
2 designates the handwritten character with the frame of the one-character input field removed by the contact character portion extraction program 24 as a recognition target,
By detecting a continuous character portion connecting characters and deleting it, a handwritten character is cut out from the document image, and recognition processing for the handwritten character is executed.

【００８８】この第２の認識プログラム２２の認識処理
により、例えば、図３の上段に示す続き文字の形態で記
入される「１００００」という手書き文字は、図３の左
側のように切り出されて、「１００００」と認識される
ことになる。そして、この認識結果に対して、認識結果
の距離の合計値として「２０」が算出されることにな
る。By the recognition processing of the second recognition program 22, for example, the handwritten character "10000" entered in the form of continuous characters shown in the upper part of FIG. 3 is cut out as shown on the left side of FIG. It will be recognized as "10000". Then, for this recognition result, “20” is calculated as the total value of the distances of the recognition result.

【００８９】図５の処理フローで説明したように、認識
制御プログラム２０は、第１及び第２の認識プログラム
２１，２２が認識処理を終了すると、最終的な認識結果
を決定する処理を行う。As described in the processing flow of FIG. 5, the recognition control program 20 performs the processing of determining the final recognition result when the first and second recognition programs 21 and 22 finish the recognition processing.

【００９０】図１８に、この認識制御プログラム２０の
実行する決定処理の一実施例を図示する。次に、この処
理フローについて説明する。認識制御プログラム２０
は、第１及び第２の認識プログラム２１，２２が認識処
理を終了すると、図１８の処理フローに示すように、先
ず最初に、ステップ１で、第２の認識プログラム２２が
認識処理を実行したのか否かを判断する。上述したよう
に、第２の認識プログラム２２は、続き文字部分が存在
しないときには、認識処理を実行せずにその旨を記録す
るだけの処理を行うので、この記録が残されているのか
否かを判断することで、第２の認識プログラム２２が認
識処理を実行したのか否かを判断するのである。FIG. 18 shows an example of the determination process executed by the recognition control program 20. Next, this processing flow will be described. Recognition control program 20
When the first and second recognition programs 21 and 22 finish the recognition process, first, in step 1, the second recognition program 22 executes the recognition process as shown in the process flow of FIG. Or not. As described above, the second recognition program 22 does not perform the recognition process when the continuous character portion does not exist, but does not perform the recognition process but only records the fact, so whether or not this record is left. By determining, it is determined whether or not the second recognition program 22 has performed the recognition process.

【００９１】この判断処理により、第２の認識プログラ
ム２２が認識処理を実行しなかったことを判断するとき
には、ステップ２に進んで、第１の認識プログラム２１
の認識結果を最終的な認識結果として出力して処理を終
了する。When it is judged by this judgment processing that the second recognition program 22 has not executed the recognition processing, the routine proceeds to step 2 and the first recognition program 21 is executed.
The recognition result of is output as the final recognition result, and the process ends.

【００９２】一方、この判断処理により、第２の認識プ
ログラム２２が認識処理を実行したことを判断するとき
には、ステップ３に進んで、第１の認識プログラム２１
の出力する距離合計値と、第２の認識プログラム２２の
出力する距離合計値との大小を比較する。On the other hand, when it is judged by this judgment processing that the second recognition program 22 has executed the recognition processing, the routine proceeds to step 3 and the first recognition program 21 is executed.
And the total distance value output by the second recognition program 22 are compared.

【００９３】この比較処理により、第１の認識プログラ
ム２１の出力する距離合計値の方が小さいと判断すると
きには、ステップ４に進んで、第１の認識プログラム２
１の認識結果を最終的な認識結果として出力する。一
方、第２の認識プログラム２２の出力する距離合計値の
方が小さいと判断するときには、ステップ２に進んで、
第２の認識プログラム２２の認識結果を最終的な認識結
果として出力する。When it is determined by this comparison process that the total distance value output by the first recognition program 21 is smaller, the routine proceeds to step 4, where the first recognition program 2
The recognition result of 1 is output as the final recognition result. On the other hand, when it is determined that the total distance value output by the second recognition program 22 is smaller, the process proceeds to step 2,
The recognition result of the second recognition program 22 is output as the final recognition result.

【００９４】このようにして、認識制御プログラム２０
は、第１の認識プログラム２１の認識結果と、第２の認
識プログラム２２の認識結果とを受け取ると、距離値の
合計値の小さい方、すなわち、より類似していると判断
した認識結果の方を最終的な認識結果として選択して出
力するのである。In this way, the recognition control program 20
When the recognition result of the first recognition program 21 and the recognition result of the second recognition program 22 are received, the one with the smaller total distance value, that is, the recognition result judged to be more similar Is selected and output as the final recognition result.

【００９５】この決定処理により、図３の例で説明する
ならば、第２の認識プログラム２２の認識結果である
「１００００」が最終的な認識結果として出力されるこ
とになる。With this determination processing, as will be described with reference to the example of FIG. 3, the recognition result “10000” of the second recognition program 22 is output as the final recognition result.

【００９６】第１の認識プログラム２１の用いる文字切
り出しアルゴリズムは、文字と文字とが分離しているこ
とを前提するものであり、これから、続き文字部分を除
去できないことが起こる。このようなときには、続き文
字部分を除去することで文字を切り出す第２の認識プロ
グラム２２の認識結果の方が正解の可能性が高い。The character cutout algorithm used by the first recognition program 21 is based on the premise that the characters are separated from each other, and from this, it may happen that the succeeding character part cannot be removed. In such a case, the recognition result of the second recognition program 22 that cuts out characters by removing the subsequent character portion is more likely to be the correct answer.

【００９７】一方、第２の認識プログラム２２の用いる
文字切り出しアルゴリズムは、文字と文字とが分離して
いることを前提としていないものであり、これから、１
文字入力欄に記入されることで本来は文字と文字とが分
離しているにもかかわらず、続き文字部分を誤って検出
することが起こる。このようなときには、文字と文字と
が分離していることを前提として文字を切り出す第１の
認識プログラム２１の認識結果の方が正解の可能性が高
い。On the other hand, the character segmentation algorithm used by the second recognition program 22 does not assume that characters are separated from each other.
Although the characters are originally separated by being filled in the character input field, the succeeding character portion may be erroneously detected. In such a case, the recognition result of the first recognition program 21 that cuts out a character on the premise that the character is separated is more likely to be the correct answer.

【００９８】このような特性の違いを考慮して、本発明
では、認識制御プログラム２０が、第１の認識プログラ
ム２１の認識結果と第２の認識プログラム２２の認識結
果とから、より正解き可能性の高い方の認識結果を最終
的な認識結果とする構成を採るのである。この構成を採
ることで、従来よりも文字認識精度を著しくる向上でき
るようになる。In consideration of such a difference in characteristics, in the present invention, the recognition control program 20 can make a more correct answer from the recognition result of the first recognition program 21 and the recognition result of the second recognition program 22. The recognition result of the one having a higher property is used as the final recognition result. By adopting this configuration, it becomes possible to remarkably improve the character recognition accuracy as compared with the conventional case.

【００９９】図示実施例に従って本発明を説明したが、
本発明はこれに限定されるものではない。例えば、実施
例では、第１の認識プログラム２１と第２の認識プログ
ラム２２とが手書き文字を切り出した後、同一の認識ア
ルゴリズムを使って文字認識を実行する構成を採った
が、両者の認識結果を比較できるものであるならば、異
なる認識アルゴリズムを使ってもよい。Although the present invention has been described with reference to the illustrated embodiment,
The present invention is not limited to this. For example, in the embodiment, the first recognition program 21 and the second recognition program 22 cut out handwritten characters and then perform character recognition using the same recognition algorithm. Different recognition algorithms may be used as long as they can be compared.

【０１００】また、実施例では、金融文書を想定して、
「０」〜「９」を認識対象とすることを想定したが、本
発明は、その適用が「０」〜「９」に認識対象とするも
のに限られるものではない。Further, in the embodiment, assuming a financial document,
Although it is assumed that "0" to "9" are the recognition targets, the present invention is not limited to the application of "0" to "9" as the recognition targets.

【０１０１】また、実施例では、手書き文字を認識対象
とすることを想定したが、画質の劣化した活字文字や、
精度の悪いプリンタにより印刷された文字などに対して
も、本発明はそのまま適用できるものである。In the embodiment, it is assumed that handwritten characters are to be recognized. However, print characters whose image quality is deteriorated,
The present invention can be directly applied to characters printed by a printer with low accuracy.

【０１０２】[0102]

【発明の効果】以上説明したように、本発明の文字認識
装置では、１文字入力欄の罫線を削除しつつ文字を認識
する構成を採るときにあって、文字同士が分離している
ことを前提とする文字切り出しアルゴリズムを使って切
り出される文字を認識対象として、文字の認識処理を実
行するとともに、文字の続き文字部分を削除することで
切り出される文字を認識対象として、文字の認識処理を
実行する構成を採って、その２つの認識結果から最終的
な文字の認識結果を得るようにすることから、文字認識
精度を従来よりも向上できるようになる。As described above, in the character recognition apparatus of the present invention, when the character recognition is performed while deleting the ruled line of the one-character input field, the characters are separated from each other. Performs character recognition processing with the characters cut out using the presumed character cutting algorithm as the recognition target, and also performs character recognition processing with the characters cut out by deleting the subsequent characters of the characters as the recognition target. By adopting the configuration described above and obtaining the final character recognition result from the two recognition results, the character recognition accuracy can be improved as compared with the conventional case.

[Brief description of drawings]

【図１】本発明の原理構成図である。FIG. 1 is a principle configuration diagram of the present invention.

【図２】本発明の説明図である。FIG. 2 is an explanatory diagram of the present invention.

【図３】本発明の説明図である。FIG. 3 is an explanatory diagram of the present invention.

【図４】本発明の一実施例である。FIG. 4 is an example of the present invention.

【図５】認識制御プログラムの実行する処理フローであ
る。FIG. 5 is a processing flow executed by a recognition control program.

【図６】枠抽出プログラムの説明図である。FIG. 6 is an explanatory diagram of a frame extraction program.

【図７】接触文字部分抽出プログラムの説明図である。FIG. 7 is an explanatory diagram of a contact character portion extraction program.

【図８】接触文字部分の抽出処理の説明図である。FIG. 8 is an explanatory diagram of a contact character portion extraction process.

【図９】文字抽出処理の説明図である。FIG. 9 is an explanatory diagram of character extraction processing.

【図１０】第１の認識プログラムの実行する処理フロー
である。FIG. 10 is a processing flow executed by a first recognition program.

【図１１】文字切り出し処理の説明図である。FIG. 11 is an explanatory diagram of character cutting processing.

【図１２】第２の認識プログラムの実行する処理フロー
である。FIG. 12 is a processing flow executed by a second recognition program.

【図１３】第２の認識プログラムの実行する処理フロー
である。FIG. 13 is a processing flow executed by a second recognition program.

【図１４】水平続き線の抽出処理の説明図である。FIG. 14 is an explanatory diagram of a horizontal continuation line extraction process.

【図１５】垂直分離線の決定処理の説明図である。FIG. 15 is an explanatory diagram of a vertical separation line determination process.

【図１６】ゼロ判定処理の説明図である。FIG. 16 is an explanatory diagram of zero determination processing.

【図１７】文字分離処理の説明図である。FIG. 17 is an explanatory diagram of character separation processing.

【図１８】認識制御プログラムの実行する処理フローで
ある。FIG. 18 is a processing flow executed by a recognition control program.

【図１９】１文字入力欄に記入される手書き文字の説明
図である。FIG. 19 is an explanatory diagram of handwritten characters entered in the one-character input field.

[Explanation of symbols]

１文字認識装置２イメージスキャナ１０イメージメモリ１１抽出手段１２取出手段１３第１の認識手段１４第２の認識手段１５決定手段 1 character recognition device 2 image scanner 10 image memory 11 Extraction means 12 Means of taking out 13 First recognition means 14 Second recognition means 15 Determining means

フロントページの続き (56)参考文献特開平９−54813（ＪＰ，Ａ) 特開昭63−251874（ＪＰ，Ａ) 特開平７−282190（ＪＰ，Ａ) 特開平６−333089（ＪＰ，Ａ) 特開平７−192094（ＪＰ，Ａ) 特開平８−305796（ＪＰ，Ａ) 特開平11−25220（ＪＰ，Ａ) 特開平10−162104（ＪＰ，Ａ) 特開平８−202822（ＪＰ，Ａ) 特開昭59−98283（ＪＰ，Ａ) (58)調査した分野(Int.Cl.⁷，ＤＢ名) G06K 9/00 - 9/82 Continuation of front page (56) Reference JP-A-9-54813 (JP, A) JP-A-63-251874 (JP, A) JP-A-7-282190 (JP, A) JP-A-6-333089 (JP , A) JP 7-192094 (JP, A) JP 8-305796 (JP, A) JP 11-25220 (JP, A) JP 10-162104 (JP, A) JP 8-202822 (JP, A) JP-A-59-98283 (JP, A) (58) Fields investigated (Int. Cl. ⁷ , DB name) G06K 9/00-9/82

Claims

(57) [Claims]

1. A character recognition device for recognizing a character entered in a one-character input field separated by a ruled line, an extracting means for extracting a ruled line of an input image, and a ruled line extracted by the extracting means and an input image. By identifying the contact character portion with the character and determining the extraction means for extracting only the character according to the identification result and determining whether the contact character portion is valid according to the character size, the character is extracted from the input image. The first recognition means for recognizing the character by cutting it out character by character, and the continuous character part of the character extracted by the extracting means are detected,
A decision to determine the final character recognition result from the second recognition unit that deletes it and recognizes the characters one by one from the input image and the recognition results of the first and second recognition units. A character recognition device characterized by comprising means.

2. The character recognizing device according to claim 1, wherein the second recognizing means performs processing so as not to execute the character recognizing processing when determining that there is no continuous character portion. Character recognition device.

3. A character recognition method for recognizing a character entered in a one-character input field separated by a ruled line, the first process step of extracting a ruled line of an input image, and the ruled line extracted according to the character size. By deciding whether or not to validate the contact character portion with the character that the input image has, the second processing step in which the character is cut out and recognized one by one from the input image, and the extracted ruled lines and the input image A character is extracted from the input image one by one by recognizing the characters that come in contact with the character that it has, by extracting only the character from the input image, detecting the subsequent character part of the character, and deleting it. A character recognition method comprising: the third processing step; and a fourth processing step that determines a final character recognition result from the recognition results obtained in the second and third processing steps.

4. A program storage medium for storing a program used for realizing a character recognition device for recognizing a character entered in a one-character input field separated by a ruled line, and extracting for extracting a ruled line of an input image. Processing, the contact character portion of the ruled line extracted by the extraction processing and the character of the input image is specified, the extraction processing of extracting only the character according to the specification result, and the contact character portion is validated according to the character size. By deciding whether or not to do so, the first recognition processing for recognizing the characters by cutting out the characters one by one from the input image and the succeeding character portion of the characters extracted by the extraction processing are detected
A decision to determine the final character recognition result from the second recognition processing by cutting out the characters one by one from the input image and recognizing them by deleting them and the recognition results of the first and second recognition processing. A program storage medium characterized in that a program for causing a computer to execute processing is stored.