JPH096906A

JPH096906A - Method and device for image processing

Info

Publication number: JPH096906A
Application number: JP7153313A
Authority: JP
Inventors: Hiroaki Ikeda; 裕章池田
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 1995-06-20
Filing date: 1995-06-20
Publication date: 1997-01-10

Abstract

PURPOSE: To handle an agate distinctively from, for example, a main text other than the agate when an image of a document wherein the agate is present is inputted and characters are segmented out of the document and recognized. CONSTITUTION: The document information is inputted (S201), a character image block is extracted from the document image (S202), and it is decided whether or not the extracted character image block is an agate or not (S204); and discrimination information with which the agate can be discriminated is added to an image which is decided to be the agate and they are stored (S206). For recognition processing, a recognition process is performed for character images other than the selected agate according to the discrimination information (S207 and S208).

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は文書の画像情報から、文
字領域を切り出すことが可能な画像処理方法及び装置に
関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an image processing method and apparatus capable of cutting out a character area from image information of a document.

【０００２】[0002]

【従来の技術】従来、文字認識装置は、画像入力後、図
８に示すように、まず行方向に射影８０１を取り、点線
の矩形８０２のように行の抽出を行い、次に前記射影と
垂直方向に射影８０３を取り、実線の矩形８０４のよう
に文字ごとの画像ブロックを抽出するように構成されて
いる。また更に、分離した文字や、接触した文字を正し
く抽出する為に、各行で画像ブロックの幅の平均や、画
像ブロックの幅の分布でもっとも頻度が大きいものなど
から基準文字サイズを求め、複数の画像ブロックを結合
した時、その幅が基準文字サイズになるならそれらを結
合し、画像ブロックが基準文字サイズの整数倍になって
いれば、整数等分して文字単位の画像ブロックを決定す
るなどの射影により抽出した文字画像ブロックの補正処
理を行うものもある。また、ルビを含んだ画像ブロック
が抽出されたと判断される場合、各行で画像ブロックの
高さの平均や、画像ブロックの高さの分布でもっとも頻
度が大きいものなどから基準ルビ分離サイズを求め、ル
ビ分離サイズを越える高さの画像ブロックを本文とし、
ルビ分離サイズを越えない高さの画像ブロックをルビと
判断して、本文とルビを分離した文字単位の画像ブロッ
クを決定する。2. Description of the Related Art Conventionally, a character recognition apparatus, after inputting an image, first takes a projection 801 in the row direction as shown in FIG. 8 and extracts a row like a dotted rectangle 802. A projection 803 is taken in the vertical direction, and an image block for each character is extracted like a solid rectangle 804. Furthermore, in order to correctly extract separated characters and touched characters, the standard character size is calculated from the average width of image blocks in each line or the one with the highest frequency in the width distribution of image blocks. When combining image blocks, if the width is the standard character size, combine them, and if the image block is an integral multiple of the standard character size, divide it into integers and determine the image block for each character. There is also a method of correcting the character image block extracted by the projection of. Also, when it is determined that the image block containing ruby is extracted, the reference ruby separation size is obtained from the average of the heights of the image blocks in each row, or the one with the highest frequency in the height distribution of the image blocks, The main body is an image block whose height exceeds the ruby separation size.
An image block whose height does not exceed the ruby separation size is determined to be ruby, and an image block for each character in which the text and ruby are separated is determined.

【０００３】画像ブロック抽出後は、各文字の標準特徴
などが格納されている認識辞書との類似度を求める識別
演算を行い、最も類似度が大きい文字を認識結果として
出力するように構成されている。After the image block is extracted, an identification calculation is performed to obtain the similarity with the recognition dictionary in which the standard features of each character are stored, and the character with the highest similarity is output as the recognition result. There is.

【０００４】[0004]

【発明が解決しようとしている課題】しかしながら、上
記従来例では、ルビを含む文書の画像を入力し、文字の
切り出しと認識を行った場合、画像ブロックが抽出され
た後はルビと本文の区別がなくなり、ルビのみの行が存
在したり、ルビが本文内に含まれて出力される欠点があ
った。However, in the above-mentioned conventional example, when an image of a document containing ruby is input and the characters are cut out and recognized, the ruby and the text are distinguished after the image block is extracted. There was a defect that there was a line containing only ruby, and ruby was included in the text and output.

【０００５】本発明に係る目的は、入力文書画像からル
ビが抽出された場合でも、ルビと本文が混合して出力さ
れないようにすることである。また、ルビのみの行を正
しく判定することである。An object of the present invention is to prevent ruby and text from being mixed and output even when ruby is extracted from an input document image. In addition, it is necessary to correctly determine the line of only ruby.

【０００６】[0006]

【課題を解決する為の手段】上記従来の課題を解決する
為に、本発明は入力された文書画像から文字の画像ブロ
ックを抽出し、前記抽出された文字画像がルビであるこ
とを判定する画像処理方法及び装置を提供する。In order to solve the above-mentioned conventional problems, the present invention extracts an image block of a character from an input document image and determines that the extracted character image is ruby. An image processing method and apparatus are provided.

【０００７】また、上記従来の課題を解決する為に、本
発明は好ましくは前記抽出された文字画像を記憶手段に
記憶し、前記ルビであると判定された文字画像には、ル
ビであることを識別できる識別情報を付加して前記記憶
手段に記憶する。In order to solve the above-mentioned conventional problems, the present invention preferably stores the extracted character image in a storage means, and the character image determined to be the ruby is ruby. Is added and stored in the storage means.

【０００８】また、上記従来の課題を解決する為に、本
発明は好ましくは前記抽出された文字画像ブロックを出
力する。In order to solve the above conventional problems, the present invention preferably outputs the extracted character image block.

【０００９】また、上記従来の課題を解決する為に、本
発明は好ましくは前記ルビであると判定された文字画像
を出力しないよう前記出力を制御する。In order to solve the above conventional problems, the present invention preferably controls the output so as not to output the character image judged to be the ruby.

【００１０】また、上記従来の課題を解決する為に、本
発明は好ましくは前記抽出された文字の画像ブロックの
うち、前記ルビであると判定された画像ブロック以外の
画像ブロックを出力するよう出力を制御する。Further, in order to solve the above-mentioned conventional problems, the present invention preferably outputs an image block other than the image block determined to be the ruby among the image blocks of the extracted characters. To control.

【００１１】また、上記従来の課題を解決する為に、本
発明は好ましくは前記出力された文字画像ブロックを認
識する。In order to solve the above conventional problems, the present invention preferably recognizes the output character image block.

【００１２】また、上記従来の課題を解決する為に、本
発明は好ましくは前記認識した結果を表示手段に表示す
る。In order to solve the above-mentioned conventional problems, the present invention preferably displays the recognized result on the display means.

【００１３】また、上記従来の課題を解決する為に、本
発明は好ましくは前記認識した結果を印字手段に印字す
る。In order to solve the above conventional problems, the present invention preferably prints the recognized result on a printing means.

【００１４】また、上記従来の課題を解決する為に、本
発明は好ましくは前記記憶手段に記憶された文字画像の
内、ルビであることを示す識別情報が付加されていない
文字画像を選択して認識を行うよう制御する。In order to solve the above-mentioned conventional problems, the present invention preferably selects, from the character images stored in the storage means, a character image to which no identification information indicating ruby is added. Control to perform recognition.

【００１５】また、上記従来の課題を解決する為に、本
発明は好ましくは前記文字画像抽出手段より抽出した文
字画像を認識し、該認識した結果の内、ルビではないと
判断された文字画像から認識された結果を選択して出力
する。In order to solve the above-mentioned conventional problems, the present invention preferably recognizes a character image extracted by the character image extracting means, and a character image judged not to be ruby among the recognized results. The result recognized from is selected and output.

【００１６】また、上記従来の課題を解決する為に、本
発明は好ましくは前記入力された文書画像を記憶手段に
記憶し、前記ルビであると判断された文字画像を該記憶
手段より削除する。In order to solve the above-mentioned conventional problems, the present invention preferably stores the input document image in a storage means and deletes the character image determined to be the ruby from the storage means. .

【００１７】また、上記従来の課題を解決する為、本発
明は好ましくは前記文字画像の抽出に替えて、前記入力
された文書画像から行の画像ブロックを抽出し、前記ル
ビの判定は前記抽出された行画像がルビ行であることを
判定する。In order to solve the above-mentioned conventional problems, the present invention preferably extracts an image block of a line from the input document image in place of the extraction of the character image, and the ruby determination is the extraction. The determined row image is determined to be a ruby row.

【００１８】また、上記従来の課題を解決する為に、本
発明は好ましくは前記ルビの判定は、注目行の前後の行
が存在するか否かにより判定し、前後に行がないと判断
される場合に該注目行の画像はルビではないと判定す
る。In order to solve the above conventional problems, the present invention preferably determines the ruby based on whether or not there is a line before and after the line of interest, and determines that there is no line before and after. When it is determined that the image of the line of interest is not ruby.

【００１９】また、上記従来の課題を解決する為に、本
発明は好ましくは前記ルビの判定は注目行の行幅と前の
行の行幅との比較により判定し、前の行との相対値が基
準値より小さい場合に、該注目行の画像はルビであると
判定する。In order to solve the above-mentioned conventional problems, the present invention preferably determines the ruby by comparing the line width of the line of interest with the line width of the previous line, and determines the relative to the previous line. When the value is smaller than the reference value, it is determined that the image of the target line is ruby.

【００２０】また、上記従来の課題を解決する為に、本
発明は好ましくは前記ルビの判定は、注目行の行幅と後
の行の行幅との比較により判定し、後の行との相対値が
基準値より小さい場合に、該注目行の画像はルビである
と判定する。In order to solve the above-mentioned conventional problems, the present invention preferably determines the ruby by comparing the line width of the line of interest with the line width of the subsequent line, When the relative value is smaller than the reference value, it is determined that the image of the target line is ruby.

【００２１】また、上記従来の課題を解決する為に、本
発明は好ましくは前記ルビの判定は、注目行と前後の行
との行間隔により判定し、前の行との行間隔が後の行と
の行間隔より大きい場合に、該注目行の画像はルビであ
ると判定する。In order to solve the above-mentioned conventional problems, the present invention preferably determines the ruby based on the line spacing between the line of interest and the preceding and following lines, and determines the line spacing between the preceding line and the following line. If the line spacing is larger than the line interval, the image of the line of interest is determined to be ruby.

【００２２】また、上記従来の課題を解決する為に、本
発明は好ましくは前記ルビの判定は、注目行の位置によ
り判定し、注目行が基準となる範囲に位置していると判
定される場合に、該注目行がルビであると判定する。In order to solve the above conventional problems, the present invention preferably determines the ruby based on the position of the line of interest, and determines that the line of interest is located within a reference range. In this case, it is determined that the line of interest is ruby.

【００２３】[0023]

【Example】

（第１の実施例）図１は本発明を実施するための画像処
理装置の構成を示すブロック図である。１０１はＲＯＭ
１０２に格納されている制御プログラムに従って本装置
全体の制御を行うＣＰＵ、１０２はＣＰＵ１０１が実行
する、後述するフローチャートに示す処理など本装置の
制御プログラムなどを格納するＲＯＭ、１０３は文書画
像や文字認識に用いる辞書、処理途中で用いるパラメー
タの例えばａなどを記憶するＲＡＭ、１０４は磁気ディ
スクなどの外部記憶装置であり、１０５はＣＲＴや液晶
表示器などのディスプレイ、１０６はキーボード、１０
７はマウス等のポインティングデバイス、１０８は画像
を光学的に読み取りためのイメージスキャナである。１
０９はＬＢＰやインクジェット方式のプリンタ、１１０
は公衆回線等を介して外部とのデータの通信を制御する
通信Ｉ／Ｆであり、画像の入力、結果の出力も可能とす
る。１１１は各構成間のデータの転送に用いるデータバ
スである。(First Embodiment) FIG. 1 is a block diagram showing the arrangement of an image processing apparatus for carrying out the present invention. 101 is a ROM
A CPU that controls the entire apparatus according to a control program stored in 102, 102 is a ROM that stores a control program of the apparatus such as the processes shown in the flowcharts described below that the CPU 101 executes, and 103 is a document image or character recognition. , A RAM for storing parameters such as “a” used during processing, 104 is an external storage device such as a magnetic disk, 105 is a display such as a CRT or liquid crystal display, 106 is a keyboard, 10
Reference numeral 7 is a pointing device such as a mouse, and 108 is an image scanner for optically reading an image. 1
09 is an LBP or inkjet printer, 110
Is a communication I / F that controls data communication with the outside via a public line or the like, and enables input of images and output of results. Reference numeral 111 is a data bus used for transferring data between the components.

【００２４】図１に示す構成の画像処理装置が実行する
本実施例の処理を図２のフローチャートを用いて説明す
る。The processing of this embodiment executed by the image processing apparatus having the configuration shown in FIG. 1 will be described with reference to the flowchart of FIG.

【００２５】Ｓ２０１では、スキャナ１０８などから光
学的に読み取った文書画像を入力する。Ｓ２０２では入
力した文書画像から従来の技術で説明したように文字単
位の画像ブロックの抽出を行う。Ｓ２０３では抽出した
行の位置及びその行矩形から切り出した文字画像ブロッ
クの位置や大きさなどの情報をＲＡＭ１０３に記憶す
る。さらにＳ２０４で、抽出した画像ブロックがルビで
あるかどうかの判定を行う。Ｓ２０４におけるルビ判定
方法の詳細処理については、その一例を図３のフローチ
ャートを用いて説明する。In step S201, a document image optically read by the scanner 108 or the like is input. In S202, image blocks are extracted in character units from the input document image as described in the conventional technique. In S203, information such as the position of the extracted line and the position and size of the character image block cut out from the line rectangle is stored in the RAM 103. Further, in S204, it is determined whether the extracted image block is ruby. An example of the detailed process of the ruby determination method in S204 will be described with reference to the flowchart of FIG.

【００２６】以下に説明するルビ判定方法は、Ｓ２０２
で抽出された行がルビのみで構成されているかどうかを
判定することによりルビの文字画像を判定するようにし
たものである。The ruby determination method described below is S202.
The ruby character image is determined by determining whether or not the line extracted in (3) is composed of only ruby.

【００２７】Ｓ３０１では、注目する画像ブロックが含
まれる行を求め、その行の前後に行が存在するかどうか
を判定する。Ｓ２０３でＲＡＭ１０３に記憶された行矩
形の位置情報を用いて判定すればよい。注目行の前後に
行がなく、単独の行であると判定されれば、その行はル
ビ画像ブロックではないと判断できる（Ｓ３０７）。Ｓ
３０２では、注目行の幅と、その前にある行の幅にあら
かじめ決められた１未満の値ａを乗じた値を比較し、注
目行の値の方が小さいかどうかを調べる。ａは前後の行
との行幅の比較によりルビ行を判断する為の基準値とし
て予め設定され、ＲＡＭ１０３に格納されたパラメータ
であり、本実施例では０．６５程度の値を取る。行幅は
入力画像の文書が横書きなら行矩形の上下方向の大きさ
とし、縦書きなら左右方向の大きさとすればよい。ま
た、行矩形として切り出されたブロックの幅ではなく、
その行に含まれると判断される画像ブロックの、行の幅
方向における最大値としてもよい。Ｓ３０３では、注目
行の次の行に対して、前の行に対して行った比較と同様
な比較をする。Ｓ３０２或いはＳ３０３で注目行の行幅
が前行或いは次行の行幅と比較して非常に小さくはない
と判断される（ＮＯ判断）の場合は、注目行はルビ画像
ブロックではないと判断できる（Ｓ３０７）。In step S301, a line including the image block of interest is obtained, and it is determined whether or not there are lines before and after the line. The determination may be made using the position information of the row rectangle stored in the RAM 103 in S203. If there is no line before and after the line of interest and it is determined that the line is a single line, it can be determined that the line is not a ruby image block (S307). S
In step 302, the width of the line of interest is compared with the value obtained by multiplying the width of the line before it by a predetermined value a of less than 1, and it is checked whether or not the value of the line of interest is smaller. “A” is a parameter which is preset as a reference value for judging the ruby line by comparing the line widths of the preceding and following lines and is stored in the RAM 103, and takes a value of about 0.65 in this embodiment. The line width may be the vertical size of the line rectangle if the document of the input image is written horizontally, and the horizontal size if the document is written vertically. Also, instead of the width of the block cut out as a line rectangle,
It may be the maximum value in the width direction of the row of the image blocks determined to be included in the row. In S303, the same comparison as that performed for the previous line is performed for the line next to the target line. If it is determined in S302 or S303 that the line width of the line of interest is not very small compared to the line width of the previous line or the next line (NO determination), it can be determined that the line of interest is not the ruby image block. (S307).

【００２８】次に、Ｓ３０４では前後の行との行間を比
較する。ＲＡＭ１０３に格納されている行矩形の位置情
報を用いて、注目行と前行との行間隔と、注目行と次行
との行間隔を比較する。注目行がルビの行であれば、次
行との間隔が狭くなるはずであるので、前行との行間隔
の方が次行との行間隔よりも大きいと判断される場合は
次のルビ判断ステップに進むが、小さいと判断される場
合（ＮＯ判断）は、注目行はルビではない判断できる
（Ｓ３０７）。さらに、Ｓ３０５で行の存在範囲を調べ
る。存在範囲とは、横書きなら行の先頭ブロックの左端
と最終ブロックの右端の間となる。注目行がルビ行なら
次行の存在範囲を越えないので、注目行が次行の行範囲
内に存在すればよい。なお、両端の文字にルビが存在す
ると、わずかに本文の範囲を越える場合が考えられるの
で、次行の範囲を左右にあらかじめ定めた値だけ広げて
もよい。その場合その値は、標準文字としてよい。Next, in S304, the spaces between the preceding and succeeding lines are compared. The position information of the line rectangle stored in the RAM 103 is used to compare the line interval between the target line and the previous line with the line interval between the target line and the next line. If the line of interest is a ruby line, the space between it and the next line should be narrower, so if it is judged that the line space between the previous line and the next line is larger than the line space between the next line and the next ruby line, Although the process proceeds to the determination step, if it is determined to be small (NO determination), it can be determined that the target line is not ruby (S307). Further, the existence range of the line is checked in S305. The existence range is between the left end of the first block and the right end of the last block of the line in horizontal writing. If the line of interest is a ruby line, it does not exceed the existing range of the next line, so the line of interest only needs to exist within the line range of the next line. Note that if there are ruby characters on both ends, the range of the text may be slightly exceeded, so the range of the next line may be widened to the left and right by a predetermined value. In that case, the value may be a standard character.

【００２９】この段階が残った注目行はルビ行と判断し
てよく、Ｓ３０６で注目ブロックをルビ画像ブロックと
し、それ以外をＳ３０７でルビ画像ブロックでないと判
断する。以上Ｓ３０１〜Ｓ３０７の処理をＳ２０２で抽
出され、ＲＡＭ１０３に記憶された全行矩形に対して繰
り返し行う。It is possible to determine that the line of interest remaining at this stage is a ruby line. In S306, the block of interest is set as a ruby image block, and in other cases, it is determined in S307 that it is not a ruby image block. The above-described processing of S301 to S307 is repeated for all the row rectangles extracted in S202 and stored in the RAM 103.

【００３０】なお、注目行内の他の画像ブロックもルビ
画像ブロックとし、改めて上記判定をする必要はない。
また、図３のフローチャートにおいて説明したＳ３０１
〜Ｓ３０５のルビ判定の処理ステップは、常に全て行わ
れずとも、選択的に行ってもよい。Note that the other image blocks in the line of interest are also ruby image blocks, and there is no need to make the above determination again.
In addition, S301 described in the flowchart of FIG.
The processing steps of ruby determination from S305 to S305 may not be always performed, but may be performed selectively.

【００３１】図３のフローチャートに示した処理によ
り、ルビ画像ブロックの判定が終了したら、Ｓ２０５で
各行矩形についてＳ２０４でルビと判定された画像ブロ
ックであれば、Ｓ２０６において他の画像ブロックと区
別できるようなルビであることを示す属性等をその画像
ブロックに付加してＲＡＭ１０３に記憶するという、Ｓ
２０５とＳ２０６の処理を、Ｓ２０４で処理した全行矩
形について繰り返し行う。When the ruby image block determination is completed by the processing shown in the flowchart of FIG. 3, if the image block determined to be ruby in S204 for each row rectangle in S205, it can be distinguished from other image blocks in S206. The attribute indicating that it is a ruby is added to the image block and stored in the RAM 103.
The processing of 205 and S206 is repeated for all the line rectangles processed in S204.

【００３２】Ｓ２０７では、抽出された画像ブロックの
うち、ルビと判定されたもの以外を画像ブロックに付加
された属性をもとにＲＡＭ１０３から選び、Ｓ２０８で
選択された画像ブロックから特徴を抽出し、各文字の標
準特徴を格納したＲＡＭ１０３内の文字認識辞書と比較
して類似度を求める識別演算を行い、Ｓ２０９で識別演
算の結果、最も類似度の高い文字をその文字画像ブロッ
クの認識結果としてディスプレイ１０５に表示する。In S207, among the extracted image blocks, those other than the one determined to be ruby are selected from the RAM 103 based on the attribute added to the image block, and the features are extracted from the image block selected in S208. An identification operation is performed to obtain a similarity by comparing with a character recognition dictionary in the RAM 103 that stores standard characteristics of each character, and in S209, the character having the highest similarity is displayed as the recognition result of the character image block as a result of the identification operation. 105 is displayed.

【００３３】以上説明したように、ルビ画像ブロックの
判定を行ってから、ルビ以外のブロックのみを識別演算
することで、認識結果の表示が望まれていないルビの識
別演算処理を行わずに済み、処理量を減少させることが
できる。As described above, after the ruby image block is determined, only the blocks other than the ruby are discriminated and calculated, so that the ruby discrimination calculation processing for which the display of the recognition result is not desired is not performed. , The throughput can be reduced.

【００３４】なお、上記実施例では、本発明を実施する
ための最低限の構成要件で説明を行っているが、例えば
汎用コンピュータに、本発明を実施する処理を行うプロ
グラム等のデータを外部から提供し、或いは、あらかじ
め外部記憶１０４に記憶しておき、ＲＡＭ１０３に格納
するように構成されていてもよい。また、本発明を実施
するためのルビ判定方法については、本実施例に限らな
いことはいうまでもない。また、横書き文書でも縦書き
文書でも本発明が実施可能なことは、容易に推測され
る。Although the above embodiment has been described with the minimum structural requirements for carrying out the present invention, data such as a program for carrying out the processing for carrying out the present invention is externally supplied to a general-purpose computer, for example. It may be provided or stored in the external storage 104 in advance and stored in the RAM 103. Further, it goes without saying that the ruby determination method for carrying out the present invention is not limited to this embodiment. Further, it is easily inferred that the present invention can be implemented in a horizontally written document and a vertically written document.

【００３５】また、Ｓ２０９で表示された文書画像の認
識結果は、指示に応じてプリンタ１０９より印字され
る。The recognition result of the document image displayed in S209 is printed by the printer 109 according to the instruction.

【００３６】（第２の実施例）図１に示す構成の画像処
理装置が実行する第２の実施例の処理を図４のフローチ
ャートを用いて説明する。(Second Embodiment) The processing of the second embodiment executed by the image processing apparatus having the configuration shown in FIG. 1 will be described with reference to the flowchart of FIG.

【００３７】Ｓ４０１では、スキャナ１０８などから光
学的に読み取った文書画像を入力する。Ｓ４０２では入
力した文書画像から従来の技術で説明したように文字単
位の画像ブロックの抽出を行う。Ｓ４０３では抽出した
行矩形の位置及びその行矩形から切り出した文字画像ブ
ロックの位置や大きさなどの情報をＲＡＭ１０３に記憶
する。さらにＳ４０４で、Ｓ２０４において詳述した方
法により、抽出した画像ブロックがルビであるかどうか
の判定を行う。Ｓ４０５では、Ｓ４０４でルビと判定さ
れた画像ブロックを識別し、その識別された画像ブロッ
クについてはＳ４０６で他の画像ブロックと区別できる
ようなルビであることを示す属性等をその画像ブロック
に付加してＲＡＭ１０３に記憶する。In step S401, a document image optically read by the scanner 108 or the like is input. In step S402, image blocks are extracted in character units from the input document image as described in the related art. In S403, the RAM 103 stores information such as the position of the extracted line rectangle and the position and size of the character image block cut out from the line rectangle. Further, in step S404, it is determined whether the extracted image block is ruby by the method described in detail in step S204. In step S405, the image block determined to be ruby in step S404 is identified, and the identified image block is added to the image block with an attribute indicating that it is ruby that can be distinguished from other image blocks in step S406. Stored in the RAM 103.

【００３８】Ｓ４０７では、Ｓ４０２において抽出され
た画像ブロックすべてについてＳ２０８と同様な識別演
算を行う。Ｓ４０８では抽出された画像ブロックのう
ち、ＲＡＭ１０３に格納された際に付加された属性を基
に、ルビと判定されたもの以外の画像ブロックから得ら
れた識別演算の結果を選び、ディスプレイ１０５に表示
する（Ｓ４０９）。At S407, the same discrimination operation as at S208 is performed on all the image blocks extracted at S402. In step S408, based on the attribute added when the image block was stored in the RAM 103 among the extracted image blocks, the result of the identification calculation obtained from the image block other than the one determined to be ruby is selected and displayed on the display 105. Yes (S409).

【００３９】以上説明したように、ルビ画像ブロック判
定でルビと判定されたもの以外の認識結果を表示するこ
とで文章内にルビが混合して表示されることを防ぐ効果
がある。As described above, displaying the recognition results other than those judged as ruby in the ruby image block judgment has an effect of preventing ruby from being mixedly displayed in the text.

【００４０】（第３の実施例）図１に示す構成の画像処
理装置が実行する第３の実施例の処理を図５のフローチ
ャートを用いて説明する。(Third Embodiment) The processing of the third embodiment executed by the image processing apparatus having the configuration shown in FIG. 1 will be described with reference to the flowchart of FIG.

【００４１】Ｓ５０１では、スキャナ１０８などから光
学的に読み取った文書画像を入力する。Ｓ５０２では入
力した文書画像から従来の技術で説明したように文字単
位の画像ブロックの抽出を行う。Ｓ５０３では抽出した
行矩形の位置及びその行矩形から切り出した文字画像ブ
ロックの位置や大きさなどの情報をＲＡＭ１０３に記憶
し、Ｓ５０４で、抽出された画像ブロックすべてについ
てＳ２０８と同様な識別演算を行う。In step S501, the document image optically read by the scanner 108 or the like is input. In step S502, image blocks are extracted in character units from the input document image as described in the related art. In step S503, information such as the position of the extracted line rectangle and the position and size of the character image block cut out from the line rectangle is stored in the RAM 103, and in step S504, the same identification operation as in step S208 is performed on all the extracted image blocks. .

【００４２】ここで、抽出した画像ブロックがルビであ
るかどうかの判定をＳ２０４で詳述した方法により行い
（Ｓ５０５）、Ｓ５０６ではＳ５０５においてルビと判
定されたもの以外の画像ブロックから得られた識別演算
の結果だけを選択し、Ｓ５０７でディスプレイ１０５に
表示する。Here, it is determined whether or not the extracted image block is ruby by the method described in detail in S204 (S505), and in S506, an identification obtained from an image block other than the one determined to be ruby in S505. Only the calculation result is selected and displayed on the display 105 in S507.

【００４３】以上説明したように、認識結果を表示する
段階でルビかどうかを判定することで、ルビであること
を記憶する工程を経なくても第３の実施例と同様の結果
を得ることができる効果がある。As described above, by determining whether or not it is ruby at the stage of displaying the recognition result, the same result as that of the third embodiment can be obtained without the step of storing the ruby. There is an effect that can be.

【００４４】（第４の実施例）図１に示す構成の画像処
理装置が実行する第４の実施例の処理を図６のフローチ
ャートを用いて説明する。(Fourth Embodiment) The processing of the fourth embodiment executed by the image processing apparatus having the configuration shown in FIG. 1 will be described with reference to the flowchart of FIG.

【００４５】Ｓ６０１では、スキャナ１０８などから光
学的に読み取った文書画像を入力する。Ｓ６０２では入
力した文書画像から従来の技術で説明したように文字単
位の画像ブロックの抽出を行う。Ｓ６０３では抽出した
行矩形の位置及びその行矩形から切り出した文字画像ブ
ロックの位置や大きさなどの情報をＲＡＭ１０３に記憶
する。さらにＳ６０４で、抽出した画像ブロックがルビ
であるかどうかの判定をＳ２０４で詳述した方法により
行う。Ｓ６０５で、Ｓ６０４においてルビと判定された
画像ブロックを選択し、その選択された画像ブロックの
画像を、Ｓ６０３でＲＡＭ１０３に記憶した情報から削
除する（Ｓ６０６）。Ｓ６０７でＲＡＭ１０３に残って
いる画像ブロックについてＳ２０８と同様な識別演算を
行い、Ｓ６０８で識別演算の結果をディスプレイ１０５
に表示する。In step S601, a document image optically read by the scanner 108 or the like is input. In step S602, an image block in character units is extracted from the input document image as described in the related art. In step S603, information such as the position of the extracted line rectangle and the position and size of the character image block cut out from the line rectangle is stored in the RAM 103. Further, in step S604, it is determined whether the extracted image block is ruby by the method described in detail in step S204. In S605, the image block determined to be ruby in S604 is selected, and the image of the selected image block is deleted from the information stored in the RAM 103 in S603 (S606). In step S607, the same identification calculation as in step S208 is performed on the image blocks remaining in the RAM 103, and in step S608, the result of the identification calculation is displayed on the display 105.
To be displayed.

【００４６】以上説明したように、ルビと判断された画
像ブロックを記憶領域から削除することで、使用記憶領
域を減少させることができる効果がある。As described above, by deleting the image block determined to be ruby from the storage area, it is possible to reduce the used storage area.

【００４７】（第５の実施例）図１に示す構成の画像処
理装置が実行する第５の実施例の処理を図７のフローチ
ャートを用いて説明する。(Fifth Embodiment) The processing of the fifth embodiment executed by the image processing apparatus having the configuration shown in FIG. 1 will be described with reference to the flowchart of FIG.

【００４８】Ｓ７０１では、スキャナ１０８などから光
学的に読み取った文書画像を入力する。Ｓ７０２では入
力した文書画像から従来の技術で説明した行の抽出を行
う。第１の実施例で説明したように、行の位置がわかれ
ば、それがルビの行であるかどうかを判定することがで
きるので、Ｓ７０３でＳ２０４で詳述した方法によりル
ビ行の判定を行う。Ｓ７０４で注目行がルビ行でないと
判断されれば、Ｓ７０５で従来の技術で説明したように
文字単位の画像ブロックの抽出を行い、Ｓ７０６で抽出
した文字画像ブロックの位置や大きさなどの情報をＲＡ
Ｍ１０３に記憶する。Ｓ７０４で注目行がルビ行と判定
されたら、Ｓ７０７でルビの画像ブロックの抽出を行
い、Ｓ７０８でその抽出された文字画像ブロック情報
と、その文字画像ブロックがルビであることを示す属性
をＲＡＭ１０３に記憶する。In step S701, a document image optically read by the scanner 108 or the like is input. In step S702, the lines described in the conventional technique are extracted from the input document image. As described in the first embodiment, if the line position is known, it is possible to determine whether or not it is a ruby line. Therefore, in step S703, the ruby line is determined by the method described in detail in step S204. . If it is determined in S704 that the target line is not a ruby line, image blocks are extracted in character units as described in the conventional technique in S705, and information such as the position and size of the character image block extracted in S706 is obtained. RA
It is stored in M103. If the line of interest is determined to be a ruby line in S704, a ruby image block is extracted in S707, and the extracted character image block information and an attribute indicating that the character image block is ruby are stored in the RAM 103 in S708. Remember.

【００４９】その後、ＲＡＭ１０３に記憶された文字画
像ブロックについて、Ｓ７０９でＳ２０８と同様な識別
演算を行い、Ｓ７１０て注目しているブロックがルビで
なければ、Ｓ７１１でディスプレイ１０５に表示する。Thereafter, with respect to the character image block stored in the RAM 103, the same identification calculation as in S208 is performed in S709, and if the block of interest is not ruby in S710, it is displayed on the display 105 in S711.

【００５０】以上説明したように、ルビを行抽出の時点
で判断することで、ルビの判定回数を減少させる効果が
ある。As described above, judging the ruby at the time of row extraction has the effect of reducing the number of ruby judgments.

【００５１】なお、上記実施例において、ルビ行につい
ては、画像ブロックを抽出しないようにしたり、識別演
算をしないようにしても、本発明を実施することが可能
である。It should be noted that, in the above embodiment, the present invention can be implemented even if the image block is not extracted or the discrimination operation is not performed for the ruby line.

【００５２】また、これまでの実施例では、それぞれ本
発明を実施するための最小限度の説明となっているが、
これらを組み合わせて本発明を実施することが可能であ
ることは容易に推測される。In the above embodiments, the minimum description for carrying out the present invention has been given.
It is easily inferred that the present invention can be implemented by combining these.

【００５３】なお、以上の実施例１〜５では、ルビとし
て検出された文字画像ブロックを出力しないようにした
が、ルビをも本文と共に出力する為の指示手段を更に備
えるようにしても良い。このような手段を備え、ルビと
して検出された文字画像ブロックには「ルビ」の識別情
報を付加して出力し、「ルビ」の識別情報が付加されて
いる文字画像ブロックは本文の文字画像ブロックと別個
に扱えるようにする。そして、ルビも本文と共に、或い
は本文とは区別して出力するように指示された場合の
み、「ルビ」の識別情報が付加されている文字画像ブロ
ックを出力し、この情報に基づいて認識されたルビの認
識結果を本文の脇に本文と識別可能な出力態様（色を変
える、文字サイズを変える、ハッチングパターンをかけ
る、等）で出力するようにしても良い。In the first to fifth embodiments described above, the character image block detected as ruby is not output, but an instruction means for outputting ruby together with the text may be further provided. With such means, the character image block detected as ruby is output with the identification information of "ruby" added, and the character image block with the identification information of "ruby" is the character image block of the text. So that it can be handled separately. Then, only when the ruby is also instructed to be output together with the text or separately from the text, the text image block to which the identification information of “ruby” is added is output, and the ruby recognized based on this information is output. The recognition result may be output to the side of the body in an output mode (color is changed, character size is changed, hatching pattern is applied, etc.) that can be distinguished from the body.

【００５４】[0054]

【発明の効果】以上説明したように、本発明によれば、
文書画像から文字の画像ブロックを抽出し、前記抽出さ
れた文字画像がルビであることを判定することにより、
ルビと本文とが混在する入力画像からルビとルビ以外の
情報とを区別して処理することができる。As described above, according to the present invention,
By extracting an image block of characters from the document image and determining that the extracted character image is ruby,
It is possible to process the ruby and the information other than the ruby separately from the input image in which the ruby and the text are mixed.

【００５５】以上説明したように、本発明によれば、抽
出された文字画像を記憶手段に記憶し、前記ルビである
と判定された文字画像には、ルビであることを識別でき
る識別情報を付加して前記記憶手段に記憶することによ
り、ルビとして記憶された情報を利用してルビとルビ以
外の情報とを区別して処理することができる。As described above, according to the present invention, the extracted character image is stored in the storage means, and the character image determined to be the ruby has the identification information for identifying the ruby. By additionally storing the information in the storage means, it is possible to process the ruby and the information other than the ruby by using the information stored as the ruby.

【００５６】以上説明したように、本発明によれば、前
記抽出された文字画像ブロックを出力することにより、
文字画像毎の処理が可能となる。As described above, according to the present invention, by outputting the extracted character image block,
It is possible to process each character image.

【００５７】以上説明したように、本発明によれば、前
記ルビであると判定された文字画像を出力しないように
前記出力を制御することにより、ルビを除いた画像の処
理が容易となる。As described above, according to the present invention, by controlling the output so as not to output the character image determined to be the ruby, it becomes easy to process the image excluding the ruby.

【００５８】以上説明したように、本発明によれば、前
記抽出された文字の画像ブロックのうち、前記ルビであ
ると判定された画像ブロック以外の画像ブロックを出力
するよう出力制御することにより、ルビ以外の画像ブロ
ックを処理することが容易となる。As described above, according to the present invention, among the image blocks of the extracted characters, the output control is performed so as to output the image blocks other than the image block determined to be the ruby. It becomes easy to process image blocks other than ruby.

【００５９】以上説明したように、本発明によれば、前
記出力された文字画像ブロックを認識することにより、
所望の文字画像の認識を行うことができる。As described above, according to the present invention, by recognizing the output character image block,
A desired character image can be recognized.

【００６０】以上説明したように、本発明によれば、前
記認識した結果を表示手段に表示することにより、認識
結果を表示上で確認することができる。As described above, according to the present invention, the recognition result can be confirmed on the display by displaying the recognized result on the display means.

【００６１】以上説明したように、本発明によれば、前
記認識した結果を印字手段に印字することにより、認識
結果を記録媒体上に出力したものを得ることができる。As described above, according to the present invention, it is possible to obtain the recognition result output on the recording medium by printing the recognized result on the printing means.

【００６２】以上説明したように、本発明によれば、前
記記憶手段に記憶された文字画像の内、ルビであること
を示す識別情報が付加されていない文字画像を選択して
認識を行うよう制御することにより、認識処理を効率よ
く行うことができる。As described above, according to the present invention, of the character images stored in the storage means, a character image to which no identification information indicating ruby is added is selected for recognition. By controlling, the recognition process can be performed efficiently.

【００６３】以上説明したように、本発明によれば、前
記文字画像抽出手段より抽出した文字画像を認識し、該
認識した結果の内、ルビではないと判断された文字画像
から認識された結果を選択して出力することにより、認
識結果の出力を効率よく行うことができる。As described above, according to the present invention, the character image extracted by the character image extracting means is recognized, and the result recognized from the character images judged not to be ruby among the recognized results. It is possible to efficiently output the recognition result by selecting and outputting.

【００６４】以上説明したように、本発明によれば、前
記入力された文書画像を記憶手段に記憶し、前記ルビで
あると判定された文字画像を該記憶手段より削除するこ
とにより、効率よく処理できる画像情報を提供すること
ができる。As described above, according to the present invention, the input document image is stored in the storage means, and the character image determined to be the ruby is deleted from the storage means, thereby efficiently. Image information that can be processed can be provided.

【００６５】以上説明したように、本発明によれば、前
記文字画像の抽出に替えて、前記入力された文書画像か
ら行の画像ブロックを抽出し、前記ルビの判定は前記抽
出された行画像がルビ行であることを判定することによ
り、行抽出の時点でルビを判定するので、その後の処理
を効率よく行うことができる。As described above, according to the present invention, instead of extracting the character image, an image block of a line is extracted from the input document image, and the ruby determination is performed by the extracted line image. By determining that is a ruby line, ruby is determined at the time of line extraction, so that the subsequent processing can be efficiently performed.

【００６６】以上説明したように、本発明によれば、前
記ルビの判定は、注目行の前後の行が存在するか否かに
より判定し、前後に行がないと判断される場合に該注目
行の画像はルビではないと判定することにより、前後の
行の位置関係によりルビを正しく判定することができ
る。As described above, according to the present invention, the ruby determination is made based on whether or not there is a line before and after the line of interest. By determining that the image of the row is not ruby, ruby can be correctly determined based on the positional relationship between the preceding and following rows.

【００６７】以上説明したように、本発明によれば、前
記ルビの判定は、注目行の行幅と前の行の行幅との比較
により判定し、前の行との相対値が基準値より小さい場
合に、該注目行の画像はルビであると判定することによ
り、前の行との行幅によりルビを正しく判定することが
できる。As described above, according to the present invention, the ruby determination is made by comparing the line width of the line of interest with the line width of the previous line, and the relative value to the previous line is the reference value. If it is smaller, it is determined that the image of the target line is ruby, and thus the ruby can be correctly determined based on the line width with the preceding line.

【００６８】以上説明したように、本発明によれば、前
記ルビの判定は、注目行の行幅と後の行の行幅との比較
により判定し、後の行との相対値が基準値より小さい場
合に、該注目行の画像はルビであると判定することによ
り、後の行の行幅によりルビを正しく判定することがで
きる。As described above, according to the present invention, the ruby determination is made by comparing the line width of the target line with the line width of the subsequent line, and the relative value to the subsequent line is the reference value. If it is smaller, it is determined that the image of the target line is ruby, and thus ruby can be correctly determined based on the line width of the subsequent line.

【００６９】以上説明したように、本発明によれば、前
記ルビの判定は、注目行と前後の行との行間隔により判
定し、前の行との行間隔が後の行との行間隔より大きい
場合に、該注目行の画像はルビであると判定することに
より、行間隔によりルビを正しく判定することができ
る。As described above, according to the present invention, the ruby determination is performed based on the line spacing between the target line and the lines before and after, and the line spacing between the preceding line and the subsequent line is determined. If it is larger, it is possible to correctly determine ruby based on the line spacing by determining that the image of the target line is ruby.

【００７０】以上説明したように、本発明によれば、前
記ルビの判定は、注目行の位置により判定し、注目行が
基準となる範囲に位置していると判定される場合に、該
注目行がルビであると判定することにより、行の位置範
囲によりルビを正しく判定することができる。As described above, according to the present invention, the ruby determination is made based on the position of the line of interest, and when it is determined that the line of interest is located in the reference range, By determining that the line is ruby, ruby can be correctly determined based on the position range of the line.

[Brief description of drawings]

【図１】第１の実施例に係る画像処理装置のブロック
図。FIG. 1 is a block diagram of an image processing apparatus according to a first embodiment.

【図２】第１の実施例に係る画像処理装置が行う文字認
識処理のフローチャート。FIG. 2 is a flowchart of character recognition processing performed by the image processing apparatus according to the first embodiment.

【図３】第１の実施例に係るルビの判定方法のフローチ
ャート。FIG. 3 is a flowchart of a ruby determination method according to the first embodiment.

【図４】第２の実施例に係る画像処理装置が行う文字認
識処理のフローチャート。FIG. 4 is a flowchart of character recognition processing performed by the image processing apparatus according to the second embodiment.

【図５】第３の実施例に係る画像処理装置が行う文字認
識処理のフローチャート。FIG. 5 is a flowchart of character recognition processing performed by the image processing apparatus according to the third embodiment.

【図６】第４の実施例に係る画像処理装置が行う文字認
識処理のフローチャート。FIG. 6 is a flowchart of character recognition processing performed by the image processing apparatus according to the fourth embodiment.

【図７】第５の実施例に係る画像処理装置が行う文字認
識処理のフローチャート。FIG. 7 is a flowchart of character recognition processing performed by the image processing apparatus according to the fifth embodiment.

【図８】従来例における文字画像ブロック抽出を説明す
る図。FIG. 8 is a diagram illustrating extraction of a character image block in a conventional example.

Claims

[Claims]

1. An image processing method, comprising: extracting an image block of a character from an input document image, and determining that the extracted character image is ruby.

2. The extracted character image is stored in a storage unit, and the character image determined to be the ruby is added with identification information for identifying the ruby and stored in the storage unit. The image processing method according to claim 1, wherein:

3. The image processing method according to claim 1, wherein the extracted character image block is output.

4. The image processing method according to claim 3, wherein the output is controlled so that the character image determined to be the ruby is not output.

5. The image according to claim 3, wherein output is controlled to output an image block other than the image block determined to be the ruby among the image blocks of the extracted characters. Processing method.

6. The image processing method according to claim 3, wherein the output character image block is recognized.

7. The image processing method according to claim 6, wherein the recognized result is displayed on a display unit.

8. The image processing method according to claim 6, wherein the recognized result is printed on a printing unit.

9. The character image stored in the storage means is controlled so as to select and recognize a character image to which no identification information indicating that it is ruby is added. The image processing method described in.

10. The character image extracted by the character image extracting means is recognized, and the recognized result is selected from the character images judged not to be ruby among the recognized results and output is controlled. The image processing method according to claim 1, wherein:

11. The image processing method according to claim 1, wherein the input document image is stored in a storage unit, and the character image determined to be the ruby is deleted from the storage unit.

12. An image block of a line is extracted from the input document image instead of the extraction of the character image, and the ruby determination is to determine that the extracted line image is a ruby line. The image processing method according to claim 1, wherein:

13. The ruby determination is based on whether or not there is a line before and after the target line, and when it is determined that there is no line before and after the target line, it is determined that the image of the target line is not ruby. The image processing method according to claim 1, wherein:

14. The ruby determination is performed by comparing the line width of the line of interest with the line width of the previous line, and when the relative value with respect to the previous line is smaller than the reference value, the image of the line of interest is displayed. The image processing method according to claim 1, wherein is determined to be ruby.

15. The ruby determination is performed by comparing a line width of a target line with a line width of a subsequent line, and when the relative value with the subsequent line is smaller than a reference value, an image of the target line is displayed. The image processing method according to claim 1, wherein is determined to be ruby.

16. The ruby determination is made based on the line spacing between the target line and the lines before and after, and when the line spacing between the previous line and the subsequent line is larger than the line spacing between the subsequent line and the image of the target line. The image processing method according to claim 1, wherein is determined to be ruby.

17. The ruby determination is made based on the position of the line of interest, and when it is determined that the line of interest is located in a reference range, it is determined that the line of interest is ruby. The image processing method according to claim 1, wherein the image processing method is provided.

18. A document image input means for inputting a document image, a character image extracting means for extracting an image block of a character from the document image input by the document image input means, and a character image extracting means for extracting the character image block. An image processing apparatus, comprising: ruby determining means for determining that a character image is ruby.

19. A character image having storage means for storing the character image extracted by the character image extraction means, wherein the character image determined to be ruby by the ruby determination means can be identified as ruby. The information is added and stored in the storage means.
An image processing apparatus according to claim 1.

20. The image processing apparatus according to claim 18, further comprising output means for outputting the character image block extracted by the character image extraction means.

21. The image processing apparatus according to claim 20, further comprising an output control unit that controls the output unit so as not to output the character image determined to be ruby by the ruby determination unit.

22. Output control means for controlling the output means so as to output an image block other than the image block determined to be ruby by the ruby determination means, out of the image blocks of the characters extracted by the character image extraction means. The image processing apparatus according to claim 20, further comprising:

23. The image processing apparatus according to claim 20, further comprising a recognition unit that recognizes the character image block output by the output unit.

24. The image processing apparatus according to claim 23, further comprising display means for displaying a result recognized by the recognition means.

25. The image processing apparatus according to claim 23, further comprising a printing unit that prints a result recognized by the recognition unit.

26. A recognition control means for controlling to select and recognize a character image to which no identification information indicating that it is ruby is added, out of the character images stored in said storage means. The image processing device according to claim 20.

27. A control means for recognizing a character image extracted by the character image extracting means, and controlling to select and output the recognized result from the character images judged not to be ruby among the recognized results. The image processing apparatus according to claim 18, further comprising:

28. A storage means for storing the document image input by the document image input means, wherein the character image determined by the ruby determination means is deleted from the storage means. The image processing apparatus described in (6).

29. A line image extracting means for extracting an image block of a line from the input document image is provided in place of the character image extracting means, and the ruby determining means is a line extracted by the line image extracting means. The image processing apparatus according to claim 18, wherein it is determined that the image is a ruby line.

30. The ruby judging means judges ruby based on whether or not there is a line before and after the line of interest.
The image processing apparatus according to claim 18, wherein when it is determined that there is no line before and after, it is determined that the image of the line of interest is not ruby.

31. The ruby judgment by the ruby judging means is made by comparing the line width of the target line with the line width of the preceding line, and when the relative value with respect to the preceding line is smaller than the reference value, The image processing apparatus according to claim 18, wherein the image of the line of interest is determined to be ruby.

32. The ruby judgment by the ruby judging means is made by comparing the line width of the line of interest with the line width of the line after, and when the relative value with the line after is smaller than the reference, The image processing apparatus according to claim 18, wherein the image of the row is determined to be ruby.

33. The ruby judgment by the ruby judging means judges by the line spacing between the line of interest and the preceding and following lines, and when the line spacing between the preceding line and the subsequent line is greater than the line spacing between the following line. The image processing apparatus according to claim 18, wherein the image of the line of interest is determined to be ruby.

34. The ruby determination unit determines ruby based on the position of the line of interest, and determines that the line of interest is ruby when it is determined that the line of interest is located in a reference range. The image processing apparatus according to claim 18, wherein the image processing apparatus makes a determination.