JPH0816719A

JPH0816719A - Character segmenting character recognition method, method and device by using same

Info

Publication number: JPH0816719A
Application number: JP6147631A
Authority: JP
Inventors: Hiroaki Ikeda; 裕章池田
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 1994-06-29
Filing date: 1994-06-29
Publication date: 1996-01-19
Anticipated expiration: 2019-07-28
Also published as: JP3548234B2

Abstract

PURPOSE:To improve the recognition rate of characters by enabling the segmentation of characters even concerning a character string in a column direction when segmenting characters contained in lines in a row direction. CONSTITUTION:An image block 201 is provided with a standard size and becomes a character image block as it is. An image block 202 contains laterally typeset characters and is provided with lateral width (w1) wider than standard character width. A standard block 203 expresses the standard size of the character image block and is calculated from the image block segmented from a character image by a prescribed method. A threshold value T for identifying the image block including the laterally typeset characters is provided by multiplying a prescribed coefficient to lateral width ws of this standard block 203. Then, the lateral width of each segmented image block is compared with the threshold value T, the image block equipped with the lateral width larger than T is defined as the image block including the laterally typeset characters and concerning this block, dividing processing for laterally typeset character string is performed.

Description

【発明の詳細な説明】Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は文書画像より文字単位の
画像ブロックの取り出しを行って、ブロック単位で文字
認識を行うための文字切り出し方法及びこれを用いた文
字認識方法及び装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a character segmentation method for extracting an image block in character units from a document image and performing character recognition in block units, and a character recognition method and apparatus using the character segmentation method.

【０００２】[0002]

【従来の技術】一般に、光学式文字認識装置（ＯＣＲ）
では、光学的に読み取った文書画像より文字単位の画像
ブロックを取り出して文字の切り出しを行い、切り出さ
れた各文字毎に文字認識が行われる。一般的なＯＣＲで
行なわれる認識処理の一例を図７及び図８を用いて説明
する。2. Description of the Related Art Generally, an optical character recognition device (OCR)
Then, an image block for each character is taken out from the optically read document image, the character is cut out, and character recognition is performed for each cut out character. An example of a recognition process performed by a general OCR will be described with reference to FIGS. 7 and 8.

【０００３】図８は一般的なＯＣＲによる文字認識処理
の手順を表すフローチャートである。まず、ステップＳ
７０１において、イメージスキャナ等を用いて文書画像
を入力する。次に、ステップＳ７０２で、入力された文
書画像から画像ブロックの取り出しを行なう。FIG. 8 is a flow chart showing a procedure of character recognition processing by a general OCR. First, step S
At 701, a document image is input using an image scanner or the like. Next, in step S702, an image block is extracted from the input document image.

【０００４】図９は一般的な文字認識処理において用い
られる画像ブロックの取り出し方法を説明する図であ
る。図９においては、射影を用いた縦書き文書の画像ブ
ロック取り出しの例が示されており、まず、上下方向に
射影８０１を取ることで行を抽出する。そして、行を抽
出後、各行に対して行方向と直角方向に射影８０２を取
ることで画像ブロック８０３を取り出すことができる。
このままでは、２つ以上の文字が接触していることによ
り複数の文字を含むブロック８０４が存在したり、分離
した部分を有する文字について分離されたブロック８０
５が存在したりする。FIG. 9 is a diagram for explaining a method of extracting an image block used in a general character recognition process. FIG. 9 shows an example of image block extraction of a vertically written document using projection. First, a projection 801 is taken in the vertical direction to extract a line. After extracting the rows, the image block 803 can be extracted by taking a projection 802 on each row in a direction perpendicular to the row direction.
In this state, there is a block 804 containing a plurality of characters due to the contact of two or more characters, or a block 80 separated for a character having a separated part.
There are 5 or so.

【０００５】このため、ステップＳ７０３で文字単位の
画像ブロック（文字画像ブロック）を作成する。まず、
各行で各画像ブロックの高さの平均をとったり、画像ブ
ロックの高さの分布からもっとも頻度の大きい高さを抽
出する等して標準文字高を求める。そして、画像ブロッ
クの高さが標準文字高の整数倍であれば、その整数で高
さを等分して画像ブロックの分離を行い文字画像ブロッ
クとする。更に、複数の画像ブロックを結合した場合に
その高さが標準文字高になるのであればそれらの画像ブ
ロックを結合することで文字画像ブロックを生成する。
例えば、図８において、画像ブロック８０４なら２分割
して２つの文字画像ブロックが生成され、画像ブロック
８０５なら２つの画像ブロックを結合することで１つの
文字画像ブロックが生成される。Therefore, in step S703, a character-by-character image block (character image block) is created. First,
The standard character height is obtained by taking the average of the heights of the image blocks in each row or extracting the height with the highest frequency from the distribution of the heights of the image blocks. Then, if the height of the image block is an integral multiple of the standard character height, the height is equally divided by the integer and the image blocks are separated into character image blocks. Furthermore, if the height of a plurality of image blocks is the standard character height when combined, the character image blocks are generated by combining those image blocks.
For example, in FIG. 8, an image block 804 is divided into two to generate two character image blocks, and an image block 805 is generated by combining two image blocks to generate one character image block.

【０００６】以上のようにして、ステップＳ７０２とス
テップＳ７０３により文字画像ブロックを生成すること
により文字の切り出しが行われるので、ステップＳ７０
４で各文字画像ブロック毎に識別演算を行ない、類似度
が最も大きいカテゴリ（文字）を認識結果とするように
構成されている。As described above, the character is cut out by generating the character image block in steps S702 and S703.
In 4, the identification calculation is performed for each character image block, and the category (character) having the highest degree of similarity is set as the recognition result.

【０００７】[0007]

【発明が解決しようとする課題】しかしながら、上記の
手法による文字の切り出しにおいては、図１０にあるよ
うな縦書き文書に含まれる横組みの文字については、９
０１のように、それらが１つの文字画像ブロックとして
取り出されてしまうため、文字単位に切り出せず、認識
不能となったり、誤認識の原因になったりするという問
題がある。However, in the character segmentation by the above method, the horizontal character set included in the vertically written document as shown in FIG.
Like 01, they are taken out as one character image block, so that they cannot be cut out in character units, and there is a problem that they cannot be recognized or cause erroneous recognition.

【０００８】本発明はこのような問題に鑑みてなされた
ものであり、行方向の行中に含まれる文字の切り出し時
に、列方向の文字列についても文字の切り出しを行うこ
とを可能とし、文字の認識率を向上する文字切り出し方
法とこれを用いた文字認識方法及び装置を提供すること
を目的とする。The present invention has been made in view of such a problem, and when cutting out a character included in a line in a row direction, it is possible to cut out a character string in a column direction. It is an object of the present invention to provide a character segmentation method for improving the recognition rate of, and a character recognition method and device using the same.

【０００９】[0009]

【課題を解決するための手段】及び[Means for Solving the Problems] and

【作用】上記の目的を達成するための本発明による文字
認識装置は以下の構成を備える。即ち、画像より文字の
認識を行う文字認識装置であって、前記画像の行方向の
射影に基づいて行を抽出し、抽出された行に直交する列
方向の射影に基づいて認識対象文字を含む矩形領域を獲
得する獲得手段と、前記矩形領域の夫々について列方向
の文字列を含むか否かを判定する判定手段と、前記判定
手段により列方向の文字列を含むと判定された矩形領域
について、列方向の分割を行って新たな矩形領域を生成
する分割手段と、前記獲得手段及び分割手段により得ら
れた矩形領域毎に文字認識処理を行う認識手段とを備え
る。The character recognition apparatus according to the present invention for achieving the above object has the following configuration. That is, a character recognition device for recognizing a character from an image, wherein a row is extracted based on a projection of the image in a row direction, and a character to be recognized is included based on a projection in a column direction orthogonal to the extracted row. Regarding acquisition means for acquiring a rectangular area, determination means for determining whether or not each of the rectangular areas includes a character string in the column direction, and rectangular areas determined by the determination means to include a character string in the column direction A dividing means for dividing a column direction to generate a new rectangular area, and a recognizing means for performing a character recognizing process for each rectangular area obtained by the obtaining means and the dividing means.

【００１０】行方向及び列方向の射影により切り出され
た矩形領域について、列方向に並ぶ複数の文字で構成さ
れた列方向の文字列が存在するか否かを判定し、列方向
の文字列が存在すると判定された矩形領域については更
に列方向の分割を行う。このようにして切り出された矩
形ブロックの夫々について文字認識処理が実行される。It is determined whether or not a column-direction character string composed of a plurality of characters arranged in the column direction exists in the rectangular area cut out by the projection in the row direction and the column direction, and the character string in the column direction is determined. The rectangular area determined to exist is further divided in the column direction. Character recognition processing is executed for each of the rectangular blocks cut out in this manner.

【００１１】又、好ましくは、前記獲得手段は、前記画
像の行方向の射影に基づいて行を抽出し、抽出された行
に直交する列方向の射影に基づいて認識対象文字を切り
出し、該認識対象文字を内包するほぼ最少の矩形領域を
獲得し、前記判定手段は、前記矩形領域の夫々につい
て、その列方向の大きさに基づいて列方向の文字列を含
むか否かを判定する。列方向に複数の文字が並ぶ場合、
通常の文字よりもその列方向の大きさが大きくなるの
で、これを利用することにより容易に列方向文字列を含
む矩形領域を識別できるからである。Further, preferably, the acquisition means extracts a row based on a projection of the image in a row direction, cuts out a recognition target character based on a projection in a column direction orthogonal to the extracted row, and performs the recognition. The substantially smallest rectangular area that contains the target character is acquired, and the determination means determines whether or not each of the rectangular areas includes a character string in the column direction based on the size in the column direction. When multiple characters are lined up in the column direction,
This is because the size in the column direction is larger than that of a normal character, and by using this, it is possible to easily identify the rectangular area including the column direction character string.

【００１２】又、好ましくは、前記判定手段は、前記獲
得手段により得られた矩形領域の平均の行方向長さに基
づいて矩形領域の標準の列方向長さを生成し、前記矩形
領域の夫々について、その列方向長さと該標準の列方向
長さとに基づいて列方向の文字列を含むか否かを判定す
る。列方向の標準の大きさを各文字の行方向の大きさに
基づいて設定することが可能となり、文字の大きさが異
なる各種の文書画像に柔軟に対応することが可能となる
からである。Further, preferably, the determination means generates a standard column-direction length of the rectangular area based on the average length of the rectangular area in the row direction obtained by the acquisition means, and each of the rectangular areas is generated. Is determined based on the length in the column direction and the standard length in the column direction. This is because the standard size in the column direction can be set based on the size in the line direction of each character, and it is possible to flexibly deal with various document images having different character sizes.

【００１３】又、好ましくは、前記分割手段は、前記判
定手段により列方向の文字列を含むと判定された矩形領
域について行方向の射影をとることにより認識対象文字
の切り出しを行って新たな矩形領域を生成する。Further, preferably, the dividing means performs a projection in the row direction on the rectangular area determined by the determining means to include a character string in the column direction, thereby cutting out a character to be recognized to form a new rectangle. Generate a region.

【００１４】更に、上記の目的を達成するための本発明
の他の構成による文字認識装置は例えば以下の構成を備
える。即ち、画像より文字の認識を行う文字認識装置で
あって、前記画像の行方向の射影に基づいて行を抽出
し、抽出された行に直交する列方向の射影に基づいて認
識対象文字を含む矩形領域を獲得する獲得手段と、前記
獲得手段により獲得された矩形領域毎に標準パターンと
の類似度を獲得して文字認識を行う第１認識手段と、前
記第１認識手段により獲得された類似度が所定値よりも
小さい矩形領域について列方向の分割を行い分割された
矩形領域を獲得する分割手段と、前記分割された矩形領
域の夫々について文字認識を行う第２認識手段とを備え
る。Further, a character recognition device according to another structure of the present invention for achieving the above object has, for example, the following structure. That is, a character recognition device for recognizing a character from an image, wherein a row is extracted based on a projection of the image in a row direction, and a character to be recognized is included based on a projection in a column direction orthogonal to the extracted row. Acquisition means for acquiring a rectangular area, first recognition means for performing character recognition by acquiring a similarity with a standard pattern for each rectangular area acquired by the acquisition means, and similarity acquired by the first recognition means A dividing unit that divides a rectangular region whose degree is smaller than a predetermined value in the column direction to obtain the divided rectangular region, and a second recognizing unit that performs character recognition on each of the divided rectangular regions are provided.

【００１５】行方向及び列方向の射影により抽出された
矩形領域について文字認識処理を行い、その類似度が所
定値以下であった場合に当該矩形領域について更に列方
向の分割を行って文字認識処理を行う。Character recognition processing is performed on the rectangular area extracted by the projection in the row direction and the column direction, and when the similarity is less than a predetermined value, the rectangular area is further divided in the column direction to perform the character recognition processing. I do.

【００１６】又、好ましくは、列方向の文字と類似した
形状を有する文字を記憶する記憶手段を更に備え、前記
分割手段は、前記第１認識手段により獲得された類似度
が所定値よりも小さい場合、もしくは前記第１認識手段
による認識結果が前記記憶手段に記憶された文字である
場合、当該矩形領域について列方向の分割を行って矩形
領域を獲得する。横方向の文字列と誤認識しやすい文字
を登録しておくことにより、このような文字であると認
識された場合には、その類似度が所定値以上であっても
列方向の分割と文字認識処理を行うことにより、認識の
精度が更に向上する。[0016] Preferably, the storage device further comprises storage means for storing a character having a shape similar to a character in the column direction, and the dividing means has a degree of similarity acquired by the first recognizing means smaller than a predetermined value. In this case, or when the recognition result by the first recognition means is a character stored in the storage means, the rectangular area is divided in the column direction to obtain the rectangular area. By registering characters that are likely to be erroneously recognized as a horizontal character string, if such a character is recognized, even if the similarity is greater than or equal to a predetermined value, character division in the column direction and character By performing the recognition process, the recognition accuracy is further improved.

【００１７】又、好ましくは、前記分割手段により分割
されて得られた矩形領域の少なくとも１つにおいて前記
第２認識手段により得られた類似度が所定の値以下であ
る場合、該分割された矩形領域の分割前の矩形領域に対
する前記第１認識手段の認識結果を選択する選択手段を
更に備える。分割手段による更なる分割により類似度が
却って低下した場合は、その分割前の認識結果を採用す
ることで、認識率の低下が防止できるからである。Further, preferably, in at least one of the rectangular areas obtained by dividing by the dividing means, when the similarity obtained by the second recognizing means is equal to or less than a predetermined value, the divided rectangle is obtained. It further comprises selection means for selecting a recognition result of the first recognition means for the rectangular area before the area is divided. This is because, if the degree of similarity rather decreases due to further division by the dividing unit, the recognition rate before the division is adopted to prevent the reduction of the recognition rate.

【００１８】尚、本発明において、行方向及び列方向
は、縦書き文書の場合は夫々縦方向、横方向に対応す
る。又、横書き文書にあっては、行方向は横方向、列方
向は縦方向に夫々対応する。In the present invention, the row direction and the column direction correspond to the vertical direction and the horizontal direction in the case of a vertically written document, respectively. In a horizontally written document, the row direction corresponds to the horizontal direction and the column direction corresponds to the vertical direction.

【００１９】[0019]

【実施例】以下に添付の図面を参照して本発明の好適な
実施例を説明する。Preferred embodiments of the present invention will be described below with reference to the accompanying drawings.

【００２０】［実施例１］図１は本発明を実施するため
の文字認識装置の構成を示すブロック図である。１０１
はＣＰＵであり、ＲＯＭ１０２に格納されている制御プ
ログラムに従って本装置における各種の制御を行なう。
１０２はＲＯＭであり、ＣＰＵ１０１が実行する制御プ
ログラムや各種データを格納する。後述するフローチャ
ートで示される処理を実現する制御プログラムもこのＲ
ＯＭ１０２に格納されている。１０３はＲＡＭであり、
ＣＰＵ１０１が各種の処理を実行する際の作業エリアを
提供する。又、ＲＡＭ１０３は、光学的に読み取られた
文書画像などを格納する領域も含む。１０４はディスプ
レイであり、読取画像や、各種認識結果等を表示する。
１０５はキーボードであり、各種指示やデータの入力を
行う。１０６はイメージスキャナであり、画像を光学的
に読み取る。[Embodiment 1] FIG. 1 is a block diagram showing the configuration of a character recognition apparatus for carrying out the present invention. 101
Is a CPU, which performs various controls in this apparatus according to a control program stored in the ROM 102.
A ROM 102 stores a control program executed by the CPU 101 and various data. The control program that realizes the processing shown in the flow chart described later is also R
It is stored in the OM 102. 103 is a RAM,
It provides a work area when the CPU 101 executes various processes. The RAM 103 also includes an area for storing an optically read document image and the like. A display 104 displays a read image and various recognition results.
A keyboard 105 is used to input various instructions and data. An image scanner 106 optically reads an image.

【００２１】次に、図１に示す構成の文字認識装置が実
行する本実施例の動作について説明する。Next, the operation of the present embodiment executed by the character recognition device having the configuration shown in FIG. 1 will be described.

【００２２】図２は、本実施例における文字画像ブロッ
クの切り出しの概要を説明する図である。同図におい
て、２０１は標準的な大きさを有する画像ブロックであ
り、そのまま文字画像ブロックとなる。２０２は横組み
文字に対する画像ブロックであり、標準の文字幅よりも
広い横幅（ｗ１）を有する。２０３は文字画像ブロック
の標準サイズを表す標準ブロックであり、スキャナ１０
６より読み取られた文字画像より切り出した画像ブロッ
クより所定の方法で算出される。この標準ブロック２０
３の横幅ｗｓに所定の係数をかけることで横組み文字を
含む画像ブロックを識別するための閾値Ｔが得られる。
そして、切り出された各画像ブロックの横幅と閾値Ｔと
を比較して、Ｔよりも大きい横幅を有する画像ブロック
が横組み文字を含む画像ブロックであるとして、横組み
文字用の分割処理を行う。例えば図２においては、画像
ブロック２０２がｗ１＞Ｔとなり、この画像ブロックに
ついて更に行方向に射影を行い、その結果３つのブロッ
クに再分割される状態が示されている。FIG. 2 is a diagram for explaining the outline of cutting out a character image block in this embodiment. In the figure, 201 is an image block having a standard size, which is a character image block as it is. Reference numeral 202 denotes an image block for horizontally typed characters, which has a width (w1) wider than the standard character width. Reference numeral 203 is a standard block representing the standard size of the character image block, and the scanner 10
It is calculated by a predetermined method from the image block cut out from the character image read from No. 6. This standard block 20
By multiplying the horizontal width ws of 3 by a predetermined coefficient, the threshold value T for identifying the image block including the horizontal typesetting character can be obtained.
Then, the horizontal width of each clipped image block is compared with the threshold value T, and an image block having a horizontal width larger than T is determined to be an image block including a horizontal writing character, and division processing for horizontal writing characters is performed. For example, FIG. 2 shows a state in which the image block 202 has w1> T, and the image block is further projected in the row direction, and as a result, the image block is subdivided into three blocks.

【００２３】次に、図３及び図４を用いて本実施例１の
動作を更に詳細に説明する。Next, the operation of the first embodiment will be described in more detail with reference to FIGS. 3 and 4.

【００２４】図３は、実施例１の文字認識装置による文
字認識処理を表すフローチャートである。まず、ステッ
プＳ２０１でイメージスキャナ１０６を用いて文書画像
を入力し、得られた画像データをＲＡＭ１０３に格納す
る。このとき入力画像をディスプレイ１０４に表示する
ようにしてもよい。FIG. 3 is a flow chart showing the character recognition processing by the character recognition device in the first embodiment. First, in step S201, a document image is input using the image scanner 106, and the obtained image data is stored in the RAM 103. At this time, the input image may be displayed on the display 104.

【００２５】次に、縦書き文書について縦書き用文字切
り出しを行ない、その結果をＲＡＭ１０３に格納する。
次にステップＳ２０２において、画像ブロックの切り出
しを行う。縦書き用文字の切り出しは、従来例で説明し
たようにステップＳ２０２で縦方向の射影をとって行を
抽出し、各行について横方向の射影をとることで画像ブ
ロックを取り出す。但し、このままでは行内に横組み文
字が存在した場合、横組み文字の横幅が行の幅となって
しまう。従って、本例では、切り出された各画像ブロッ
クについて左右両側の余白部分の切り落しを行い、各画
像ブロックの横幅を修正する。Next, the vertical writing characters are cut out from the vertical writing document, and the result is stored in the RAM 103.
Next, in step S202, the image block is cut out. In the vertical character cutting, as described in the conventional example, in step S202, a vertical projection is taken to extract a row, and a horizontal projection is taken for each row to extract an image block. However, as it is, if there are horizontal characters in a line, the horizontal width of the horizontal characters becomes the width of the line. Therefore, in this example, the margins on the left and right sides of the cut-out image blocks are cut off to correct the width of each image block.

【００２６】次に、ステップＳ２０３において、文字単
位の画像ブロック（文字画像ブロック）を生成する。こ
こでは、従来技術のところで説明したように、各画像ブ
ロックについて分離、結合を施すことで、１文字単位の
画像ブロックを生成する。Next, in step S203, a character-by-character image block (character image block) is generated. Here, as described in the related art, each image block is separated and combined to generate an image block for each character.

【００２７】続くステップＳ２０４〜ステップＳ２０８
の処理により、全文字画像ブロックについて横組み文字
か否かの判定を行い、横組み文字と判定された文字画像
ブロックについては更に分離を行う。Subsequent steps S204 to S208
By this process, it is determined whether or not all the character image blocks are horizontal typesetting characters, and the character image blocks determined to be horizontal typesetting characters are further separated.

【００２８】ステップＳ２０４において、文字画像ブロ
ックを一つずつ注目していき、当該文字画像ブロックが
横組み文字を含むか否かを調べる。注目する文字画像ブ
ロックが横組み文字を含むと判定されたならば、その文
字画像ブロックに対して横書き文字用の切り出し処理を
施すべくステップＳ２０５へ進む。一方、注目している
文字画像ブロックが横組み文字を含まないと判定されれ
ばステップＳ２０７へ進む。尚、ステップＳ２０４にお
ける横組み文字か否かの判定方法については後述する。In step S204, the character image blocks are focused one by one to check whether or not the character image block includes horizontal characters. If it is determined that the character image block of interest includes horizontal typesetting characters, the process proceeds to step S205 to perform a horizontal character cutting process on the character image block. On the other hand, if it is determined that the character image block of interest does not include horizontal typesetting characters, the process proceeds to step S207. The method of determining whether or not the letter is a horizontal typeset character in step S204 will be described later.

【００２９】横書き文字用の切り出し処理では、まずス
テップＳ２０５で画像ブロックを取り出す。一般的に
は、横書き文字用の切り出し処理も、縦書き文字の切り
出し処理と同様の手順がとられる。即ち、入力画像に対
して左右方向の射影を取って横方向の行を抽出し、その
後、各行に対して縦方向に射影を取ることで画像ブロッ
クの切り出しが行なわれる。但し、本実施例の場合は、
注目画像ブロックに対しての処理なので、既に横方向の
行が抽出されているとして処理を始めてよい。In the clipping process for horizontally-written characters, first, in step S205, an image block is taken out. In general, the clipping process for horizontally-written characters has the same procedure as the clipping process for vertically-written characters. That is, the horizontal and horizontal projections are taken for the input image to extract horizontal rows, and then the vertical projections are taken for each row to cut out the image block. However, in the case of this embodiment,
Since the process is for the image block of interest, the process may be started assuming that the row in the horizontal direction has already been extracted.

【００３０】次に、ステップＳ２０６で文字単位の画像
ブロックを生成する。これは、縦書き文字画像における
文字画像ブロックの切り出しにおける画像ブロックの
「高さ」を、「幅」に置き換えることで、縦書きの場合
と同様に行うことができる。但し、縦書き文書の横組み
文字は文字数が少ないので、標準文字幅が精度良く求ま
らないことも多い。従って、本処理（ステップＳ２０
６）を割愛して、ステップＳ２０５で得られた画像ブロ
ックをそのまま文字画像ブロックとして用いてもよい。
そして、以上の結果をＲＡＭ１０３に格納する。Next, in step S206, an image block for each character is generated. This can be performed in the same manner as in the case of vertical writing by replacing "height" of the image block in cutting out the character image block in the vertically written character image with "width". However, since the number of horizontally typed characters in a vertically written document is small, the standard character width is often not obtained accurately. Therefore, this process (step S20
6) may be omitted and the image block obtained in step S205 may be used as it is as a character image block.
Then, the above result is stored in the RAM 103.

【００３１】ステップＳ２０７では注目している文字画
像ブロックを次のブロックに更新する。ステップＳ２０
８において、すべての文字画像ブロックについてステッ
プＳ２０４の調査を終えたか否かを判定し、終了してい
なければステップＳ２０４へ戻り上述の処理を繰り返
す。In step S207, the focused character image block is updated to the next block. Step S20
In 8, it is determined whether or not the investigation in step S204 has been completed for all the character image blocks. If not completed, the process returns to step S204 and the above-described processing is repeated.

【００３２】一方、すべてのブロックについてステップ
Ｓ２０４の調査が終了していれば、ステップＳ２１０へ
進む。ステップＳ２１０では、各文字画像ブロックにつ
いて識別演算を行ない、類似度が最も大きいカテゴリ
（文字）を認識結果とし、ＲＡＭ１０３に格納する。こ
こで、認識結果をディスプレイ１０４に表示するように
してもよい。On the other hand, if the investigation in step S204 has been completed for all blocks, the process proceeds to step S210. In step S210, identification calculation is performed for each character image block, and the category (character) having the highest similarity is stored as a recognition result in the RAM 103. Here, the recognition result may be displayed on the display 104.

【００３３】次に、ステップＳ２０４で行われる横組み
文字かどうかの判定について、図４のフローチャートを
用いてさらに詳細な説明を行なう。図４は、文字画像ブ
ロックが横組み文字を含むか否かを判定するための処理
手順を表すフローチャートである。Next, the determination as to whether the character is a horizontally set character in step S204 will be described in more detail with reference to the flowchart of FIG. FIG. 4 is a flowchart showing a processing procedure for determining whether or not a character image block includes horizontal characters.

【００３４】まずステップＳ３０１において全ての文字
画像ブロックの幅を求める。次に、ステップＳ３０２に
おいて、例えば、行内の文字画像ブロックの幅の平均を
求め、標準文字幅を求める。ここで、句読点や記号等、
幅の著しく小さいものが含まれると、標準文字幅の精度
が低下してしまうので例えば以下の処理を施す。First, in step S301, the widths of all character image blocks are obtained. Next, in step S302, for example, the average of the widths of the character image blocks in a line is calculated, and the standard character width is calculated. Where punctuation marks, symbols, etc.
If a character having a remarkably small width is included, the accuracy of the standard character width is reduced, so the following processing is performed, for example.

【００３５】例えば、先のステップＳ２０３における文
字画像ブロックの生成において求めた標準文字高Ｈと、
予め経験的に求められた値ｍ及びｎ（０＜ｍ＜ｎ）によ
り、画像ブロック幅ｗがｍＨ＜ｗ＜ｎＨの範囲にある文
字画像ブロックを抽出する。そして、これら抽出された
文字画像ブロックを用いて平均値の算出を行い、これを
標準文字幅とすることにより、標準文字幅の精度を向上
することができる。また、文字画像ブロックの幅に関す
る分布を取り、頻度が最大となるクラスの代表値を標準
文字幅としてもよい。ここで、上述のｍ及びｎの値とし
ては、ｍ＝０．７、ｎ＝１．３程度が好ましい。For example, the standard character height H obtained in the generation of the character image block in the previous step S203,
A character image block having an image block width w in the range of mH <w <nH is extracted based on the values m and n (0 <m <n) obtained empirically in advance. Then, the average value is calculated using these extracted character image blocks, and this is set as the standard character width, whereby the accuracy of the standard character width can be improved. Further, the distribution regarding the width of the character image block may be taken, and the representative value of the class having the highest frequency may be set as the standard character width. Here, the values of m and n described above are preferably about m = 0.7 and n = 1.3.

【００３６】ステップＳ３０３では、求めた標準文字幅
ｗを用いて、各文字画像ブロックが横組み文字を含むか
否かを判断するための閾値Ｔを決定する。一般に、縦書
き文書内において、横組み文字を含む文字画像ブロック
の横幅は標準文字幅ｗｓより大きくなる。従って、例え
ば、経験的に求めた値ｔ（ｔ＞１）を用いて閾値Ｔ＝ｔ
×ｗｓを求めておき、この閾値Ｔと各文字画像ブロック
の横幅とを比較することにより、各文字画像ブロックが
横組み文字列を含むか否かを判定できる。ここで、上述
のｔの値としては、ｔ＝１．２程度が好ましい。In step S303, the standard character width w thus obtained is used to determine a threshold value T for judging whether or not each character image block includes a horizontally set character. Generally, in a vertically written document, the horizontal width of a character image block including horizontal typesetting characters is larger than the standard character width ws. Therefore, for example, using the value t (t> 1) empirically obtained, the threshold value T = t
By determining xws and comparing the threshold value T with the width of each character image block, it can be determined whether or not each character image block includes a horizontal character string. Here, the value of t is preferably about t = 1.2.

【００３７】従って、ステップＳ３０４では、各画像ブ
ロックの幅ｗと閾値Ｔを比較する。ここで、ｗ＞Ｔなら
ば、当該文字画像ブロックは横組み文字を含むとしてス
テップＳ３０５に進む。一方、ｗ≦Ｔであればステップ
Ｓ３０６へ進み、通常の文字画像ブロックとする。Therefore, in step S304, the width w of each image block is compared with the threshold value T. Here, if w> T, it is determined that the character image block includes horizontal typesetting characters, and the process proceeds to step S305. On the other hand, if w ≦ T, the process proceeds to step S306, and a normal character image block is set.

【００３８】以上説明したように、本実施例１によれ
ば、縦書き内の横組み文字の文字画像ブロックの幅が通
常の文字画像ブロックの幅に比べて大きいことを利用
し、横組み文字を含む文字画像ブロックを抽出すること
が可能となる。更に、その文字画像ブロックに横書き用
の文字切り出し処理を行なうことで、縦書き文書の中に
含まれる横組み文字についても、文字単位に切り出すこ
とが可能となる。As described above, according to the first embodiment, by utilizing the fact that the width of the character image block of horizontal typesetting characters in vertical writing is larger than the width of a normal character image block, horizontal typesetting characters are used. It is possible to extract a character image block including a. Further, by performing the character cutting process for horizontal writing on the character image block, it is possible to cut out even horizontal typesetting characters included in the vertically written document in character units.

【００３９】なお、上記実施例は、本発明を実施するた
めの一構成例であり、各種の応用が可能であることはい
うまでもない。例えば、オペレータが認識領域を指定し
たり、認識結果を修正する操作ができるように構成され
ていたり、認識領域をオペレータが介在することなく自
動的に決定したり、本処理後に誤認識を減少させるため
の処理が加わるように構成されていてもよい。It is needless to say that the above-mentioned embodiment is a structural example for carrying out the present invention and various applications are possible. For example, it is configured so that the operator can specify a recognition area or perform an operation to correct the recognition result, automatically determine the recognition area without the operator's intervention, or reduce false recognition after this processing. The processing may be added.

【００４０】更に、閾値Ｔを各文字画像ブロックの高さ
に応じて変化させるように構成してもよい。上記実施例
１では、標準文字高さから求められた標準文字幅により
閾値Ｔを１つ決定しているが、全角文字の中に４倍角の
文字が混在した場合に、この４倍角文字を横組み文字と
して認識する可能性がある。従って、閾値Ｔを文字画像
ブロックの高さの関数（例えば閾値Ｔ＝ｋ×ｈ、ここで
ｋは定数、ｈは文字画像ブロックの高さ）とすれば、こ
のような不具合を解消できる。Further, the threshold value T may be changed according to the height of each character image block. In the first embodiment, one threshold value T is determined based on the standard character width obtained from the standard character height. However, when a quadruple-width character is mixed in a full-width character, this quadruple-width character is set to the horizontal direction. It may be recognized as a typeface. Therefore, if the threshold value T is a function of the height of the character image block (for example, threshold value T = k × h, where k is a constant and h is the height of the character image block), such a problem can be solved.

【００４１】また、汎用コンピュータに、本発明を実施
する処理を行なうプログラムを外部から提供し、ＲＡＭ
に本装置の制御プログラムを格納するように構成されて
いてもよいことはいうまでもない。A program for performing the processing of the present invention is externally provided to a general-purpose computer, and RAM is provided.
Needless to say, it may be configured to store the control program of the present apparatus.

【００４２】［実施例２］次に、実施例２について説明
する。上記実施例１では文字画像ブロックの大きさに基
づいて横組み文字を含む文字画像ブロックか否かを判定
している。本実施例２では、まず各文字画像ブロックに
ついて識別演算を行い、その類似度が所定値よりも小さ
いカテゴリしか存在しない文字画像ブロックについて、
横組み文字としての処理を施す。尚、実施例２の文字認
識装置の構成は実施例１（図１）と同様であるのでここ
では説明を省略する。[Second Embodiment] Next, a second embodiment will be described. In the above-described first embodiment, it is determined whether or not the character image block includes the horizontal typesetting character based on the size of the character image block. In the second embodiment, first, an identification calculation is performed on each character image block, and a character image block in which the degree of similarity exists only in a category smaller than a predetermined value,
Performs processing as horizontal characters. The configuration of the character recognition device according to the second embodiment is similar to that of the first embodiment (FIG. 1), and thus the description thereof is omitted here.

【００４３】図５は実施例２における文字認識の手順を
表すフローチャートである。前記実施例と同様、まずス
テップＳ４０１で文書画像を入力し、ステップＳ４０２
で画像ブロックを取り出し、更にステップＳ４０３で文
字単位の画像ブロック（文字画像ブロック）にする。そ
の後ステップＳ４０４で各文字画像ブロックについて識
別演算を行なう。FIG. 5 is a flow chart showing the procedure of character recognition in the second embodiment. Similar to the above-described embodiment, first, a document image is input in step S401, and then step S402.
In step S403, the image block is taken out, and further, in step S403, the image block is made in character units (character image block). Then, in step S404, an identification calculation is performed for each character image block.

【００４４】次に、文字画像ブロックの一つ一つについ
て注目し、各文字画像ブロックについて、横組み文字と
して文字の切り出し及び識別演算を行なうかどうかの判
定を行なう（ステップＳ４０５）。ここで、注目してい
る文字画像ブロックについて横組み文字として再処理を
行なうと判定された場合は、ステップＳ４０６へ進む。
又、横組み文字としての再処理を実行しない場合はステ
ップＳ４１０へ進む。このステップＳ４０５における判
定処理については図６のフローチャートにより後述す
る。Next, attention is paid to each of the character image blocks, and it is determined whether or not to perform character segmentation and identification calculation as horizontal characters for each character image block (step S405). Here, if it is determined that the remarked character image block is to be reprocessed as a horizontal typesetting character, the process proceeds to step S406.
If the re-processing for horizontal typesetting characters is not executed, the process proceeds to step S410. The determination process in step S405 will be described later with reference to the flowchart of FIG.

【００４５】ステップＳ４０６では、当該文字画像ブロ
ックに対して横書き用の文字切り出し処理を行なって新
たに画像ブロックを取り出す。更に、ステップＳ４０７
において、ステップＳ４０６で得られた画像ブロックか
ら文字画像ブロックの生成を行う。但し、実施例１でも
説明したように、ステップＳ４０７における画像ブロッ
クの分離、結合による文字画像ブロックの生成は、標準
文字幅が精度よく求められないので、このステップを省
略し、ステップＳ４０６で得られた画像ブロックをその
まま文字画像ブロックとしてもよい。In step S406, a character cutting process for horizontal writing is performed on the character image block and a new image block is taken out. Further, step S407
At, a character image block is generated from the image block obtained in step S406. However, as described in the first embodiment, since the standard character width cannot be accurately obtained in the generation of the character image block by separating and combining the image blocks in step S407, this step is omitted, and the standard character width is obtained in step S406. The image block may be used as it is as the character image block.

【００４６】ステップＳ４０８では、ステップＳ４０
６、ステップＳ４０７にて新たに切り出された文字画像
ブロックについて識別演算を行う。そして、ステップＳ
４０９において、再文字切り出し前の文字画像ブロック
（ステップＳ４０２〜ステップＳ４０３で切り出された
文字画像ブロック）と再文字切り出し後の文字画像ブロ
ック（ステップＳ４０６〜ステップＳ４０７で切り出さ
れた文字画像ブロック）とを比較し、信頼度が大きい方
の結果を採用し、これを認識結果とする。このステップ
Ｓ４０９における信頼度の判定は図７のフローチャート
を用いて後述する。In step S408, step S40
6. In step S407, identification calculation is performed on the newly cut out character image block. And step S
In 409, the character image block before re-character cutting (the character image block cut out in step S402 to step S403) and the character image block after the re-character cutting (character image block cut out in step S406 to step S407) are extracted. The result of comparison is adopted and the one with the higher reliability is adopted as the recognition result. The determination of the reliability in step S409 will be described later with reference to the flowchart of FIG.

【００４７】ステップＳ４１０では、注目文字画像ブロ
ックを次に進める。そして、ステップＳ４１１で全ての
文字画像ブロックを調べたか否かを判定し、まだ未処理
の文字画像ブロックがあればステップＳ４０５へ戻る。
一方、全ての文字画像ブロックについて処理を終了した
ならば本処理を終了する。In step S410, the focused character image block is advanced. Then, in step S411, it is determined whether or not all the character image blocks have been examined, and if there is an unprocessed character image block, the process returns to step S405.
On the other hand, when the processing is completed for all the character image blocks, this processing is completed.

【００４８】次に、上述のステップＳ４０５における判
定方法を説明する。図６は、各文字画像ブロックについ
て、横組み文字として再処理するか否かを判定する手順
を表すフローチャートである。Next, the determination method in step S405 described above will be described. FIG. 6 is a flowchart showing a procedure for determining whether or not to reprocess each character image block as a horizontal typeface character.

【００４９】ステップＳ４０４の識別演算において、全
カテゴリと各文字画像ブロックとの類似度を計算し、類
似度が最大のカテゴリを識別演算の結果としている。文
字の切り出しが誤っている場合のように、類似した文字
が存在しない画像を識別しようとした場合、その類似度
は、正しく切り出しが行われている場合に比べて、一般
的に低くなる。本例ではこの性質を用いて再処理が必要
か否かを判定する。ここで、文字切り出しの再処理が必
要か否かを判断するための類似度の閾値をＶとする。In the identification calculation of step S404, the similarity between all categories and each character image block is calculated, and the category with the maximum similarity is used as the result of the identification calculation. When trying to identify an image in which similar characters do not exist, such as when the character is cut out incorrectly, the degree of similarity is generally lower than when the cutout is performed correctly. In this example, this property is used to determine whether reprocessing is necessary. Here, the threshold value of the degree of similarity for determining whether or not the character cutout reprocessing is necessary is set to V.

【００５０】まず、ステップＳ５０１において、予め設
定されている類似度の閾値Ｖと注目する文字画像ブロッ
クの類似度ｖとを比較する。ここで、ｖ＜Ｖならばステ
ップＳ５０３に進み、再度処理を行なう文字画像ブロッ
クであると判定する。また、ｖ≧ＶであればステップＳ
５０２へ進む。ステップＳ５０２では、「１０」と
「Ю」や「００」と「∞」などのように誤認識しやすい
文字を予め記憶した誤認識リストを参照し、ステップＳ
４０４による認識結果この誤認識リストに含まれるかど
うかを調べる。認識結果が誤認識リストに存在したなら
ばステップＳ５０３に進み、当該文字画像ブロックを再
度処理するものと判定する。尚、誤認識リストはＲＯＭ
１０２もしくはＲＡＭ１０３に格納されている。First, in step S501, the preset similarity threshold V is compared with the similarity v of the character image block of interest. If v <V, the process advances to step S503 to determine that the character image block is to be processed again. If v ≧ V, step S
Proceed to 502. In step S502, an erroneous recognition list in which characters such as "10" and "Ю" or "00" and "∞" that are likely to be erroneously recognized are stored in advance is referred to.
It is checked whether the recognition result by 404 is included in this misrecognition list. If the recognition result exists in the false recognition list, the process advances to step S503 to determine that the character image block is to be processed again. The misrecognition list is in ROM
102 or RAM 103.

【００５１】以上のようにして、再び文字切り出し処理
を行なうべき文字画像ブロックが選択される。As described above, the character image block to be subjected to the character cutting process is selected again.

【００５２】また、ステップＳ４０９において、再文字
切り出し前後の認識結果のどちらを採用するかを決定す
るが、以下にこの処理について説明する。図７は、文字
切り出しの再処理（即ち横書き用の文字切り出し）が行
なわれた文字画像ブロックについて、再処理前後のどち
らの結果を採用するかを決定する手順を表すフローチャ
ートである。Further, in step S409, which of the recognition results before and after the re-character segmentation is to be adopted is determined. This process will be described below. FIG. 7 is a flowchart showing a procedure for determining which of the result before and after the re-processing is adopted for the character image block on which the re-processing of the character cutting (that is, the character cutting for horizontal writing) is performed.

【００５３】まず、ステップＳ６０１において、文字切
り出しの再処理（ステップＳ４０６〜ステップＳ４０７
による横書き用の文字切り出し）により分割された画像
ブロックに対する識別演算（ステップＳ４０８）の結
果、類似度が予め定めた閾値Ｖより小さいものが存在す
るか否かを判断する。類似度の小さいものが１つでも存
在する場合はステップＳ６０２に進み、再処理される前
の元の結果（即ちステップＳ４０４で得られた結果）を
最終的な認識結果とする。一方、新たに切り出された全
ての文字画像ブロックにおける類似度が閾値Ｖより大き
ければ、横組みの文字切り出しが成功したとして、ステ
ップＳ６０３で再処理された後の結果（即ちステップＳ
４０８で得られた結果）を最終結果とする。First, in step S601, the re-processing of the character segmentation (steps S406 to S407).
As a result of the identification calculation (step S408) for the image blocks divided by the horizontal writing character segmentation, it is determined whether or not there is a similarity degree smaller than a predetermined threshold value V. If even one with a low degree of similarity exists, the process proceeds to step S602, and the original result before reprocessing (that is, the result obtained in step S404) is set as the final recognition result. On the other hand, if the degree of similarity in all the newly cut out character image blocks is larger than the threshold value V, it is determined that the horizontal character cutting has succeeded, and the result after reprocessing in step S603 (that is, step S603).
The result obtained in 408) is the final result.

【００５４】以上説明したように本実施例２によれば、
識別演算による類似度を、縦書き内横組み文字のための
処理を行なうかどうかの判定に用いることで、縦書き文
書の中に含まれる横組み文字部分を認識することが可能
となる。さらに、縦書き内の横組文字の処理が終了した
あとに、その結果を採用するかどうかを判定すること
で、誤認識を減少させ、認識精度を向上させる効果があ
る。As described above, according to the second embodiment,
By using the similarity calculated by the discrimination operation to determine whether or not to perform the process for horizontal writing in vertical writing, it becomes possible to recognize the horizontal writing part included in the vertical writing document. Further, after the processing of the horizontal writing characters in the vertical writing is completed, it is determined whether or not the result is adopted, which has the effect of reducing the erroneous recognition and improving the recognition accuracy.

【００５５】また実施例１で説明した、文字画像ブロッ
クの大きさを用いて再処理を行うか否かの判断を行う処
理を本実施例２に加えた構成で実施してもよいことはい
うまでもない。Further, it is to be understood that the processing for determining whether or not to perform the reprocessing using the size of the character image block, which has been described in the first embodiment, may be implemented by the configuration added to the second embodiment. There is no end.

【００５６】以上説明したように、縦書き文書の認識を
行なう場合、縦書き内横組文字を判定し、その部分に対
し横書き用の処理を実施することで、これまで正しく行
なうことができなかった縦書き内横組み文字の認識を可
能にし、ＯＣＲによる入力作業をより正確に行なえ、誤
認識や認識不能文字の修正を軽減する効果がある。As described above, in the case of recognizing a vertically written document, it is impossible to correctly perform it until now by determining the horizontal writing character in the vertical writing and performing the processing for horizontal writing on that portion. In addition, it is possible to recognize vertical writing in horizontal writing mode, more accurately perform input work by OCR, and reduce erroneous recognition and correction of unrecognizable characters.

【００５７】尚、上記実施例においては縦書き文書にお
ける横組み文字の混在を説明したが、横書き文書におけ
る立て組み文字の混在に対しても上述と同様の概念で実
現できることはいうまでもない。In the above embodiment, the mixture of horizontal typesetting characters in a vertically written document has been described, but it goes without saying that the mixture of vertical typesetting characters in a horizontal writing document can be realized by the same concept as described above.

【００５８】尚、本発明は、複数の機器から構成される
システムに適用しても１つの機器からなる装置に適用し
ても良い。また、本発明はシステム或いは装置に本発明
により規定される処理を実行させるプログラムを供給す
ることによって達成される場合にも適用できることはい
うまでもない。The present invention may be applied to a system composed of a plurality of devices or an apparatus composed of a single device. Further, it goes without saying that the present invention can also be applied to a case where it is achieved by supplying a program that causes a system or an apparatus to execute the processing defined by the present invention.

【００５９】[0059]

【発明の効果】以上説明したように、本発明によれば、
行方向の行中に含まれる文字の切り出し時に、列方向の
文字列についても文字の切り出しを行うことが可能とな
り、文字の認識率が向上する。As described above, according to the present invention,
When a character included in a line in the row direction is cut out, it is possible to cut out a character string in the column direction, and the character recognition rate is improved.

【００６０】[0060]

[Brief description of drawings]

【図１】本発明を実施するための文字認識装置の構成を
示すブロック図である。FIG. 1 is a block diagram showing a configuration of a character recognition device for carrying out the present invention.

【図２】本実施例における文字画像ブロックの切り出し
の概要を説明する図である。FIG. 2 is a diagram illustrating an outline of clipping a character image block according to the present embodiment.

【図３】実施例１の文字認識装置による文字認識処理を
表すフローチャートである。FIG. 3 is a flowchart showing a character recognition process by the character recognition device in the first embodiment.

【図４】文字画像ブロックが横組み文字を含むか否かを
判定するための処理手順を表すフローチャートである。FIG. 4 is a flowchart showing a processing procedure for determining whether or not a character image block includes horizontal characters.

【図５】実施例２における文字認識の手順を表すフロー
チャートである。FIG. 5 is a flowchart showing a procedure of character recognition in the second embodiment.

【図６】各文字画像ブロックについて横組み文字として
再処理するか否かを判定する手順を表すフローチャート
である。FIG. 6 is a flowchart showing a procedure for determining whether or not to re-process each character image block as horizontal typesetting characters.

【図７】再処理前後のどちらの認識結果を採用するかを
決定する手順を表すフローチャートである。FIG. 7 is a flowchart showing a procedure for determining which recognition result before and after reprocessing is adopted.

【図８】一般的なＯＣＲによる文字認識処理の手順を表
すフローチャートである。FIG. 8 is a flowchart showing a procedure of character recognition processing by general OCR.

【図９】一般的な文字認識処理において用いられる画像
ブロックの取り出し方法を説明する図である。FIG. 9 is a diagram illustrating a method of extracting an image block used in a general character recognition process.

【図１０】横組みの文字を含む縦書き文書の例を表す図
である。FIG. 10 is a diagram illustrating an example of a vertically written document including horizontally typed characters.

[Explanation of symbols]

１０１ＣＰＵ１０２ＲＯＭ１０３ＲＡＭ１０４ディスプレイ１０５キーボード１０６イメージスキャナ 101 CPU 102 ROM 103 RAM 104 Display 105 Keyboard 106 Image Scanner

Claims

[Claims]

1. A character recognition device for recognizing a character from an image, wherein a row is extracted based on a projection of the image in a row direction, and a recognition target is based on a projection in a column direction orthogonal to the extracted row. Acquisition means for acquiring a rectangular area containing characters, determination means for determining whether or not each of the rectangular areas includes a column-direction character string, and the determination means is determined to include a column-direction character string The rectangular area includes division means for dividing the rectangular area in a column direction to generate a new rectangular area, and recognition means for performing character recognition processing for each rectangular area obtained by the acquisition means and the division means. Character recognition device.

2. The acquisition means extracts a row based on a projection in the row direction of the image, cuts out a recognition target character based on a projection in a column direction orthogonal to the extracted row, and extracts the recognition target character. The substantially smallest rectangular area to be included is obtained, and the determination means determines whether or not each of the rectangular areas includes a character string in the column direction based on the size in the column direction. The character recognition device according to claim 1.

3. The determining means generates a standard column-direction length of the rectangular area based on the average row-direction length of the rectangular area obtained by the acquiring means, and the standard column-direction length of the rectangular area is calculated for each of the rectangular areas. The character recognition device according to claim 2, wherein it is determined whether or not a character string in a column direction is included based on the column direction length and the standard column direction length.

4. The division means performs a projection in the row direction on a rectangular area determined to include a character string in the column direction by the determination means to cut out a recognition target character to generate a new rectangular area. The character recognition device according to claim 1, wherein:

5. A character recognition device for recognizing a character from an image, wherein a row is extracted based on a projection of the image in a row direction, and a recognition target is based on a projection in a column direction orthogonal to the extracted row. Acquisition means for acquiring a rectangular area including a character; first recognition means for performing character recognition by acquiring the similarity to a standard pattern for each rectangular area acquired by the acquisition means; and acquisition by the first recognition means A dividing unit that divides a rectangular region whose similarity is smaller than a predetermined value in the column direction to obtain a divided rectangular region, and a second recognizing unit that performs character recognition on each of the divided rectangular regions. A character recognition device characterized by comprising.

6. A storage means for storing characters having a shape similar to a character in a column direction is further provided, wherein the dividing means has a similarity degree smaller than a predetermined value acquired by the first recognition means, or The character recognition according to claim 5, wherein when the recognition result by the first recognition means is a character stored in the storage means, the rectangular area is obtained by dividing the rectangular area in the column direction. apparatus.

7. If at least one of the rectangular areas obtained by the division by the dividing means has a degree of similarity obtained by the second recognizing means that is equal to or less than a predetermined value, the division of the divided rectangular areas is performed. The character recognition device according to claim 5, further comprising a selection unit that selects a recognition result of the first recognition unit for the previous rectangular area.

8. A character recognition method for recognizing a character from an image, wherein a row is extracted based on a projection in a row direction of the image, and a recognition target is based on a projection in a column direction orthogonal to the extracted row. An acquisition step of acquiring a rectangular area containing a character, a determination step of determining whether or not each of the rectangular areas includes a column-direction character string, and the determination step is determined to include a column-direction character string. It is characterized by comprising a dividing step of dividing a rectangular area in the column direction to generate a new rectangular area, and a recognition step of performing character recognition processing for each rectangular area obtained by the acquisition step and the dividing step. Character recognition method.

9. A character recognition method for recognizing a character from an image, wherein a row is extracted based on a projection in a row direction of the image, and a recognition target is based on a projection in a column direction orthogonal to the extracted row. An acquisition step of acquiring a rectangular area containing a character; a first recognition step of performing character recognition by acquiring a similarity with a standard pattern for each rectangular area acquired by the acquisition step; and an acquisition step of the first recognition step. A dividing step of dividing a rectangular area whose similarity is smaller than a predetermined value in the column direction to obtain a divided rectangular area; and obtaining a similarity with a standard pattern for each of the divided rectangular areas. A second recognition step of performing character recognition, the character recognition method comprising:

10. A character cutout method for cutting out a character from an image, wherein a row is extracted based on a projection in a row direction of the image, and a recognition target is based on a projection in a column direction orthogonal to the extracted row. An acquisition step of acquiring a rectangular area containing a character, a determination step of determining whether or not each of the rectangular areas includes a column-direction character string, and the determination step is determined to include a column-direction character string. A character segmentation method comprising: dividing a rectangular area in a column direction to generate a new rectangular area.