JPH064704A - Ruled line discriminating method and area discriminating method - Google Patents

Ruled line discriminating method and area discriminating method

Info

Publication number
JPH064704A
JPH064704A JP4160866A JP16086692A JPH064704A JP H064704 A JPH064704 A JP H064704A JP 4160866 A JP4160866 A JP 4160866A JP 16086692 A JP16086692 A JP 16086692A JP H064704 A JPH064704 A JP H064704A
Authority
JP
Japan
Prior art keywords
rectangle
ruled line
horizontal
vertical
extracted
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
JP4160866A
Other languages
Japanese (ja)
Other versions
JP3215163B2 (en
Inventor
Michiyoshi Tachikawa
道義 立川
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Ricoh Co Ltd
Original Assignee
Ricoh Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Ricoh Co Ltd filed Critical Ricoh Co Ltd
Priority to JP16086692A priority Critical patent/JP3215163B2/en
Publication of JPH064704A publication Critical patent/JPH064704A/en
Application granted granted Critical
Publication of JP3215163B2 publication Critical patent/JP3215163B2/en
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Abstract

PURPOSE:To exactly discriminate and extract a horizontal or vertical ruled line by going through stages of selection of ruled line rectangle candidates, extraction of a ruled line rectangle from ruled line rectangle candidates, and condition discrimination of the ruled line rectangle. CONSTITUTION:Circumscribed rectangles of black linking components are extracted from an inputted document picture by a rectangle extracting part 102. A size W in the horizontal direction and a size H in the vertical direction of each of these extracted rectangles are compared with respective thresholds RLHTB and RLVTH by a rectangle classifying part 106 to discriminate the rectangles satisfying conditions W>RLHTH and H>RLVTH as horizontal ruled line rectangle candidates. Pictures in ranges of these discriminated rectangles are scanned in the horizontal direction to extract black runs longer than a threshold by a horizontal ruled line extracting part 108, and rectangles circumscribed to linking components of these black runs are extracted as ruled line rectangles. The rectangles satisfying a prescribed condition out of these rectangles is finally discriminated as a horizontal ruled line by a horizontal ruled line approving part 112.

Description

【発明の詳細な説明】Detailed Description of the Invention

【0001】[0001]

【産業上の利用分野】本発明は、文書画像中の罫線や表
領域等の抽出に関する。
BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to extraction of ruled lines and table areas in a document image.

【0002】[0002]

【従来の技術】図表と文字等が混在する一般文書画像の
データを記録メディアにファイリングしたりG4ファク
シミリで伝送するような場合、文書画像中の水平罫線、
垂直罫線、罫線による囲み枠領域、罫線による表領域等
を自動的に識別できると都合がよいが、このような目的
に最適な技術は未だ確立されていない。
2. Description of the Related Art In the case of filing general document image data in which charts and characters are mixed in a recording medium or transmitting by G4 facsimile, horizontal ruled lines in the document image,
It is convenient to be able to automatically identify vertical ruled lines, a frame area surrounded by ruled lines, a table region defined by ruled lines, etc. However, the optimum technique for such a purpose has not been established yet.

【0003】なお、これまで知られている関連技術とし
て、原稿をスキャンし2値化した画像から黒連結の矩形
を抽出し、その大きさを閾値と比較することによって、
文字の矩形と線図形の矩形を判別する画像抽出方式(特
開昭55−162177号)、黒ランを矩形統合し、一
定以上の大きさの矩形の内部の一定以上の長さのランを
統合して水平罫線を抽出し、水平罫線の個数より、当該
矩形が表領域であるか否かを判別する表領域識別方法
(特願平1−143456号)がある。
As a related technique known so far, a black connected rectangle is extracted from a binarized image by scanning an original and comparing its size with a threshold value.
An image extraction method for discriminating a character rectangle from a line figure rectangle (Japanese Patent Laid-Open No. 55-162177), a black run is integrated into a rectangle, and a run of a predetermined length or more inside a rectangle of a predetermined size or larger is integrated. Then, there is a table area identification method (Japanese Patent Application No. 1-143456) for extracting horizontal ruled lines and determining whether or not the rectangle is a table area based on the number of horizontal ruled lines.

【0004】[0004]

【発明が解決しようとする課題】前記画像抽出方式は、
線図形に関し水平罫線、垂直罫線、表、囲み枠等に詳細
に識別するものではない。また、前記表領域識別方法も
水平罫線だけを抽出し、その個数によって表領域の判定
を行なうため、表領域の識別精度が不十分な場合があ
り、また内部に罫線のない囲み枠等の識別ができない。
The image extraction method is
The line figure is not specifically identified as a horizontal ruled line, a vertical ruled line, a table, a surrounding frame, or the like. In the table area identification method, only the horizontal ruled lines are extracted, and the table areas are determined based on the number of horizontal ruled lines. Therefore, the table area identification accuracy may be insufficient. I can't.

【0005】よって本発明の目的は、水平罫線、垂直罫
線、表領域、囲み枠領域、その他領域のより高精度な識
別・抽出のための技術を提供することにある。
Therefore, an object of the present invention is to provide a technique for identifying and extracting a horizontal ruled line, a vertical ruled line, a table area, a surrounding frame area, and other areas with higher accuracy.

【0006】[0006]

【課題を解決するための手段】請求項1または2の発明
方法によれば、文書画像またはその縮小画像から抽出し
た黒連結成分の外接矩形の中から、水平方向及び垂直方
向の大きさに基づいて水平罫線矩形候補または垂直罫線
矩形候補を選ぶ。水平罫線の識別の場合には水平方向
に、垂直罫線の識別の場合に垂直方向に、それぞれの矩
形候補の範囲、内の文書画像またはその縮小画像をスキ
ャンすることにより、ある閾値以上の長さの黒ランから
なる水平罫線矩形または垂直罫線矩形を抽出する。そし
て、各方向罫線矩形候補と、それより抽出された各方向
罫線矩形の水平方向及び垂直方向の大きさの関係に基づ
き(請求項1の発明)、あるいは水平方向及び垂直方向
の位置関係に基づき(請求項2の発明)、該罫線矩形が
水平または垂直罫線であるか否かを判定する。
According to the method of the first or second aspect of the present invention, based on the horizontal and vertical sizes of the circumscribed rectangle of the black connected component extracted from the document image or its reduced image. To select horizontal ruled line rectangle candidates or vertical ruled line rectangle candidates. By scanning the document image or its reduced image within the range of each rectangle candidate in the horizontal direction for horizontal ruled line identification and in the vertical direction for vertical ruled line identification, a length equal to or greater than a certain threshold value is scanned. Extract a horizontal ruled line rectangle or a vertical ruled line rectangle consisting of black runs. Then, based on the relationship between each direction ruled line rectangle candidate and the size of each direction ruled line rectangle extracted in the horizontal direction and the vertical direction (the invention of claim 1) or based on the positional relationship between the horizontal direction and the vertical direction. (Invention of Claim 2) It is determined whether the ruled line rectangle is a horizontal or vertical ruled line.

【0007】請求項3または4の発明によれば、文書画
像またはその縮小画像から抽出した黒連結成分の外接矩
形の中から、水平方向及び垂直方向の大きさに基づき表
領域矩形候補を選ぶ。各表領域矩形候補の範囲につい
て、文書画像またはその縮小画像を水平方向にスキャン
することによって、ある閾値以上の長さの黒ランからな
る水平罫線矩形を抽出し、該水平罫線矩形に対して水平
罫線の条件判定を行なうことによって水平罫線を抽出
し、また文書画像またはその縮小画像を垂直方向にスキ
ャンし、ある閾値以上の長さの黒ランからなる垂直罫線
を抽出し、該垂直罫線矩形に対して垂直罫線の条件判定
を行なうことによって垂直罫線を抽出する。そして、請
求項3の発明にあっては、表領域矩形候補から抽出され
た水平罫線の本数及び垂直罫線の本数を少なくとも含む
判定条件により、表領域矩形候補を表領域であるか否か
を判定する。請求項4の発明にあっては、判定条件に表
領域矩形候補の特定範囲における水平罫線または垂直罫
線の有無も含み、該判定条件によって該表領域矩形候補
を表領域、囲み枠領域、またはその他領域のいずれであ
るかを判定する。
According to the third or fourth aspect of the invention, the table area rectangle candidate is selected from the circumscribed rectangles of the black connected components extracted from the document image or the reduced image thereof based on the horizontal and vertical sizes. By scanning the document image or its reduced image in the horizontal direction for each table area rectangle candidate range, a horizontal ruled line rectangle consisting of black runs having a length equal to or longer than a certain threshold is extracted, and a horizontal ruled line rectangle is horizontally extracted. Horizontal ruled lines are extracted by judging the ruled line conditions, and the document image or its reduced image is scanned in the vertical direction, and vertical ruled lines consisting of black runs having a length equal to or greater than a certain threshold are extracted, and the vertical ruled line rectangle is formed. On the other hand, the vertical ruled line is extracted by performing the condition determination of the vertical ruled line. Then, according to the invention of claim 3, it is determined whether or not the table area rectangular candidate is a table area based on a determination condition that includes at least the number of horizontal ruled lines and the number of vertical ruled lines extracted from the table area rectangular candidate. To do. According to the invention of claim 4, the judgment condition includes the presence or absence of a horizontal ruled line or a vertical ruled line in a specific range of the table area rectangular candidate, and the table area rectangular candidate is a table area, a surrounding frame area, or other depending on the judgment condition. It is determined which of the areas.

【0008】[0008]

【作用】請求項1または2の発明の方法は、罫線矩形候
補の選択、罫線矩形候補内の罫線矩形の抽出、罫線矩形
の条件判定という段階を経ることによって、水平または
垂直罫線の正確な識別、抽出が可能である。
According to the method of the present invention, the horizontal and vertical ruled lines are accurately identified by the steps of selecting ruled line rectangle candidates, extracting ruled line rectangles from the ruled line rectangle candidates, and judging the ruled line rectangle conditions. , Can be extracted.

【0009】請求項3または4の発明の領域識別方法
は、表領域矩形候補の選択、表領域矩形候補内の水平罫
線と垂直罫線の抽出、表領域矩形候補内の水平罫線及び
垂直罫線の本数を含む条件の判定(請求項3の発明)、
または表領域矩形候補内の水平及び垂直罫線の本数に加
え表領域矩形候補の特定範囲における水平または垂直罫
線の有無を含む条件の判定(請求項4の発明)、という
段階を経ることによって、表領域の正確な識別が可能で
あり、また請求項4の発明によれば表領域以外の囲み枠
領域とその他領域の識別も可能である。
The area identification method according to the invention of claim 3 or 4 is to select a table area rectangle candidate, extract horizontal and vertical ruled lines from the table area rectangle candidate, and determine the number of horizontal and vertical ruled lines in the table area rectangle candidate. Determination of a condition including (invention of claim 3),
Alternatively, by performing a step of judging the condition including the presence or absence of horizontal or vertical ruled lines in the specific range of the table area rectangle candidate in addition to the number of horizontal and vertical ruled lines in the table area rectangle candidate (the invention of claim 4), The area can be accurately identified, and according to the invention of claim 4, the surrounding frame area other than the table area and the other area can be identified.

【0010】[0010]

【実施例】実施例1 本実施例は、図1に示すように、スキャナ等によって入
力された文書画像データを記憶する文書画像メモリ10
0、このメモリ100内の文書画像データの黒連結成分
の外接矩形を抽出する矩形抽出部102、これにより抽
出された矩形の情報を記録するための矩形メモリ10
4、この矩形メモリ内の矩形情報を基に、抽出された矩
形の幅、高さを予め与えられた閾値と比較することによ
り矩形の種類分けを行なう矩形分類部106、この矩形
分類部106によって水平セパレータ(水平罫線)と判
定された矩形の情報(矩形メモリ104に格納されてい
る)を参照し、この矩形の画像(文書画像メモリ100
に格納されている)に対し水平方向スキャンを行なって
水平方向の黒ランを抽出し、予め与えられた閾値以上の
長さの黒ランから矩形を生成する水平罫線抽出部10
8、これによって抽出された矩形の情報を記憶する水平
罫線矩形メモリ110、このメモリ110に格納された
矩形の情報を基に、その矩形が水平セパレータ(水平罫
線)かどうかの最終判定を行なう水平罫線検定部11
2、各処理部102,106,108,112の制御を
行なう制御部114からなる機能構成を有する。
EXAMPLE 1 In this example, as shown in FIG. 1, a document image memory 10 for storing document image data input by a scanner or the like.
0, a rectangle extraction unit 102 for extracting a circumscribed rectangle of a black connected component of the document image data in the memory 100, and a rectangle memory 10 for recording information of the rectangle extracted by the rectangle extraction unit 102.
4. Based on the rectangular information in the rectangular memory, the rectangular classifying unit 106 that classifies the rectangular type by comparing the width and height of the extracted rectangular with a predetermined threshold value. Information on a rectangle determined to be a horizontal separator (horizontal ruled line) (stored in the rectangle memory 104) is referred to, and an image of this rectangle (document image memory 100
The horizontal ruled line extraction unit 10 generates a rectangle from a black run having a length equal to or larger than a threshold value given in advance by performing a horizontal scan on
8. A horizontal ruled line rectangle memory 110 for storing the information of the rectangles extracted thereby, and a horizontal determination for determining whether or not the rectangle is a horizontal separator (horizontal ruled line) based on the information of the rectangles stored in the memory 110. Ruled line inspection section 11
2. It has a functional configuration including a control unit 114 that controls each processing unit 102, 106, 108, 112.

【0011】なお、矩形抽出部102の前段に画像縮小
部を設け、これによって入力文書画像を縮小した画像を
対象として矩形抽出以下の各処理を行なってもよい。ま
た、この画像縮小部を含め各処理部は、ハードウエアと
して実現されるか、あるいはコンピュータシステム上で
ソフトウエアによって実現される。
An image reducing unit may be provided in front of the rectangle extracting unit 102 to perform the following processes of rectangle extraction on an image obtained by reducing the input document image. Further, each processing unit including the image reducing unit is realized as hardware or software on a computer system.

【0012】本実施例における水平パラメータ(水平罫
線)判定処理のフローは図2のように示される。以下、
この処理の詳細について説明するが、図3は説明中で適
宜参照される処理説明図である。
The flow of the horizontal parameter (horizontal ruled line) determination processing in this embodiment is shown in FIG. Less than,
The details of this process will be described, but FIG. 3 is a process explanatory diagram that is appropriately referred to in the description.

【0013】まず、入力された文書画像に対し、矩形抽
出部102において黒連結成分の外接矩形の抽出を行な
う(ステップ200)。この処理は、例えば画像をスキ
ャンしながら接続した黒ランの外接矩形を抽出し、これ
を一定距離内にあるものについて統合する操作を繰り返
すことによって行なうことができる。
First, the rectangle extraction unit 102 extracts a circumscribed rectangle of a black connected component from the input document image (step 200). This process can be performed by, for example, repeating an operation of extracting a circumscribed rectangle of a connected black run while scanning an image and integrating the extracted rectangles within a certain distance.

【0014】この抽出された矩形に対し、矩形分類部1
06において、矩形の水平方向の大きさW,垂直方向の
大きさHと、それぞれの閾値RLHTH,RLVTHと
の比較判定を行ない、 W>RLHTH かつ H>RLVTH の条件を満たす矩形を水平罫線矩形候補たる矩形(1)
と判定する(ステップ210,215)。
For the extracted rectangle, the rectangle classification unit 1
In 06, the size W in the horizontal direction and the size H in the vertical direction are compared with the respective threshold values RLHTH, RLVTH, and the judgment is made. Barrel rectangle (1)
(Steps 210 and 215).

【0015】図3(a)及び(b)において、300は
矩形(1)の例を示し、その内部の網掛け部は文字や罫
線等の黒連結成分である。なお、矩形抽出部102にお
いては、このような文字や罫線等の黒連結成分の統合を
行なうことによって矩形を抽出するが、かかる処理つい
ては公知であるので詳細説明は省略する。
In FIGS. 3 (a) and 3 (b), reference numeral 300 indicates an example of a rectangle (1), and the shaded portion inside thereof is a black connected component such as characters and ruled lines. The rectangle extracting unit 102 extracts a rectangle by integrating the black connected components such as characters and ruled lines, but since such a process is known, detailed description thereof will be omitted.

【0016】なお、本実施例では閾値RLHTH,RL
VTHを固定しているが、適応的に自動設定するように
してもよい。一例を挙げれば、矩形抽出処理200の際
に文字とみなしえる黒連結成分の垂直方向の大きさの
(横書き文書の場合)のヒストグラムを作成し、このヒ
ストグラムに基づいて標準文字サイズを決定し、この標
準文字サイズに適当な係数を掛けることによって閾値R
LHTH,RLVTHを算出する。ただし、これはあく
まで一例に過ぎない。
In this embodiment, the threshold values RLHTH and RL are set.
Although VTH is fixed, it may be adaptively and automatically set. As an example, a histogram of the vertical size (in the case of a horizontal writing document) of the black connected component that can be regarded as a character in the rectangle extraction processing 200 is created, and the standard character size is determined based on this histogram. A threshold R is obtained by multiplying this standard character size by an appropriate coefficient.
Calculate LHTH and RLVTH. However, this is just an example.

【0017】次に水平罫線抽出部108において、矩形
(1)の範囲の画像を水平方向にスキャンすることによ
り、閾値RUNHTH以上の長さの黒ランを抽出し、こ
の長い黒ランのみの連結成分に外接する矩形(矩形
(2))を罫線矩形として抽出する(ステップ21
5)。図3(b)において、302は矩形(1)300
から抽出された長い黒ランのみから生成される矩形
(2)であり、H1とW1は垂直方向と水平方向の大き
さである。なお、閾値RUNHTHは、固定値として予
め与えられるが、上記RLHTH,RLVTHと同様に
適応的に自動設定するようにしてもよい。
Next, the horizontal ruled line extraction unit 108 horizontally scans the image in the range of the rectangle (1) to extract a black run having a length equal to or longer than the threshold value RUNHTH, and a connected component of only this long black run. A rectangle (rectangle (2)) circumscribing to is extracted as a ruled line rectangle (step 21).
5). In FIG. 3B, 302 is a rectangle (1) 300.
It is a rectangle (2) generated only from the long black run extracted from, and H1 and W1 are the sizes in the vertical direction and the horizontal direction. The threshold value RUNHTH is given in advance as a fixed value, but may be adaptively and automatically set similarly to the above RLHTH and RLVTH.

【0018】このような矩形(2)に対し、水平罫線検
定部112において、 W1/W>閾値(例えば0.8) かつ H1>H>閾
値(例えば0.8) の条件判定を行ない(ステップ220,225)、この
条件を満たすときに矩形(2)を最終的に水平罫線であ
ると判定する(ステップ230)。図3(b)に示した
矩形(2)302は水平罫線と最終判定される。
With respect to such a rectangle (2), the horizontal ruled line verification unit 112 makes a condition determination of W1 / W> threshold value (for example 0.8) and H1>H> threshold value (for example 0.8) (step). 220, 225), the rectangle (2) is finally determined to be a horizontal ruled line when this condition is satisfied (step 230). The rectangle (2) 302 shown in FIG. 3B is finally determined as a horizontal ruled line.

【0019】制御部114は、矩形抽出部200により
抽出された矩形に対する処理の終了判定(ステップ23
5)を行ない、未処理の矩形が残っている場合は、ステ
ップ205以下の処理を再開させる。
The control unit 114 determines whether to end the process for the rectangle extracted by the rectangle extraction unit 200 (step 23).
5) is performed, and if there are unprocessed rectangles, the processing from step 205 onward is restarted.

【0020】実施例2 本実施例は、実施例1と同様に図1に示す機能的構成を
有する。処理内容も、水平罫線検定部112の処理を除
いて実施例1と同様である。
Embodiment 2 This embodiment has the functional configuration shown in FIG. 1 as in Embodiment 1. The processing content is the same as that of the first embodiment except the processing of the horizontal ruled line verification unit 112.

【0021】すなわち、実施例1においては、水平罫線
検定部112は、矩形(1)と矩形(2)の水平方向及
び垂直方向の大きさの比によって水平罫線の検定を行な
った(図2のステップ220〜230)。これに対し本
実施例においては、図3(c)に示すように、矩形
(1)300と矩形(2)302の始終点の水平方向の
位置の差ΔW1,ΔW2、垂直方向の位置の差ΔH1,
ΔH2が、 ΔW1<閾値 かつ ΔW2<閾値 かつ Δh1<閾
値 かつΔH2<閾値 のときに、矩形(2)302を最終的に水平罫線と判定
する。
That is, in the first embodiment, the horizontal ruled line inspection unit 112 performs the horizontal ruled line inspection based on the ratio of the sizes of the rectangle (1) and the rectangle (2) in the horizontal and vertical directions (see FIG. 2). Steps 220-230). On the other hand, in the present embodiment, as shown in FIG. 3C, the horizontal position differences ΔW1 and ΔW2 between the start and end points of the rectangle (1) 300 and the rectangle (2) 302, and the vertical position difference. ΔH1,
When ΔH2 is ΔW1 <threshold, ΔW2 <threshold, Δh1 <threshold and ΔH2 <threshold, the rectangle (2) 302 is finally determined as a horizontal ruled line.

【0022】実施例3 本実施例は図4に示すような機能的構成を有する。本実
施例は垂直罫線を抽出する関係から、実施例1と次の点
が相違する。
Embodiment 3 This embodiment has a functional structure as shown in FIG. This embodiment is different from the first embodiment in the following points due to the relationship of extracting vertical ruled lines.

【0023】矩形分類部106において、矩形の水平方
向の大きさをW、垂直方向の大きさ)Hとし、W<RL
HTHかつH>RLVTHの条件が満たされる場合に、
その矩形を垂直罫線矩形候補たる矩形(1)に分類す
る。なお、垂直罫線を対象としているので、水平罫線を
対象とした実施例1,2とは閾値RLHTH,RLVT
Hの大小関係が逆になる。
In the rectangle classifying unit 106, the size of the rectangle in the horizontal direction is W and the size in the vertical direction is H, and W <RL
When the condition of HTH and H> RLVTH is satisfied,
The rectangle is classified into a rectangle (1) which is a vertical ruled line rectangle candidate. Since the vertical ruled lines are targeted, the thresholds RLHTH and RLVT are different from those of the first and second embodiments targeted for the horizontal ruled lines.
The magnitude relationship of H is reversed.

【0024】垂直罫線抽出部408において、矩形
(1)の範囲の文書画像を垂直方向にスキャンして黒ラ
ンを抽出し、予め与えられた閾値RUNHTH以上の長
さの黒ランのみの連結成分からなる矩形(2)を抽出
し、その情報を垂直罫線矩形メモリ410に格納する。
In the vertical ruled line extraction unit 408, the document image in the range of rectangle (1) is vertically scanned to extract black runs, and from the connected components of only black runs having a length equal to or greater than a predetermined threshold RUNHTH. Rectangle (2) is extracted and the information is stored in the vertical ruled line rectangle memory 410.

【0025】垂直罫線検定部412は、矩形(2)の水
平方向の大きさW1,垂直方向の大きさH1と矩形
(1)の水平方向の大きさW,垂直方向の大きさHとの
間に、 W1/W>閾値(例えば0.8) かつ H1/H>閾
値(例えば0.8) の条件が成立する場合に、矩形(2)を垂直罫線である
と最終判定する。
The vertical ruled line verification unit 412 is arranged between the horizontal size W1 and the vertical size H1 of the rectangle (2) and the horizontal size W and the vertical size H of the rectangle (1). Then, when the conditions of W1 / W> threshold value (for example 0.8) and H1 / H> threshold value (for example 0.8) are satisfied, the rectangle (2) is finally determined as a vertical ruled line.

【0026】なお、本実施例においても、入力文書画像
を縮小した画像を処理対象として垂直罫線抽出を行なっ
てもよい。
Also in this embodiment, the vertical ruled line extraction may be performed with the image obtained by reducing the input document image as the processing target.

【0027】実施例4 本実施例は、実施例3と同様に図4に示す機能的構成を
有する。処理内容も、垂直罫線検定部412の処理を除
いて実施例3と同様である。
Fourth Embodiment This embodiment has the functional configuration shown in FIG. 4 similarly to the third embodiment. The processing content is the same as that of the third embodiment except for the processing of the vertical ruled line inspection unit 412.

【0028】すなわち、本実施例においては、垂直罫線
検定部412は、矩形(1)と矩形(2)の始終点の水
平方向の位置の差ΔW1,ΔW2、垂直方向の位置の差
ΔH1,ΔH2が、 ΔW1<閾値 かつ ΔW2<閾値 かつ Δh1<閾
値 かつΔH2<閾値 のときに、矩形(2)を最終的に垂直罫線と判定する。
That is, in the present embodiment, the vertical ruled line inspection unit 412 has the horizontal position differences ΔW1 and ΔW2 between the start and end points of the rectangle (1) and the rectangle (2) and the vertical position differences ΔH1 and ΔH2. However, when ΔW1 <threshold, ΔW2 <threshold, Δh1 <threshold and ΔH2 <threshold, the rectangle (2) is finally determined as a vertical ruled line.

【0029】なお、本実施例においても、入力文書画像
の縮小画像を対象として処理を行なうこともできる。
In the present embodiment as well, the processing can be performed on the reduced image of the input document image.

【0030】実施例5 図5は本実施例の機能ブロック図である。ただし、図1
または図4の同等部分は同符号により示されている。5
06は矩形分類部であり、これは図1または図4の矩形
分類部106と分類条件が異なる。514は追加された
領域判定部である。図6は本実施例の処理フローチャー
ト、図7は処理説明用の図である。以下、処理内容を説
明する。
Embodiment 5 FIG. 5 is a functional block diagram of this embodiment. However,
Alternatively, the equivalent parts in FIG. 4 are designated by the same reference numerals. 5
Reference numeral 06 denotes a rectangular classification unit, which has different classification conditions from the rectangular classification unit 106 shown in FIG. 1 or 4. Reference numeral 514 is an added area determination unit. FIG. 6 is a process flowchart of this embodiment, and FIG. 7 is a diagram for explaining the process. The processing contents will be described below.

【0031】矩形抽出部102において、文書画像より
黒連結成分の外接矩形を抽出し、矩形の情報を矩形メモ
リ104に格納する(ステップ600)。なお、入力文
書画像の縮小処理を行ない、縮小処理について矩形抽出
以下の処理を実行することも可能である。
The rectangle extraction unit 102 extracts the circumscribed rectangle of the black connected component from the document image, and stores the rectangle information in the rectangle memory 104 (step 600). It is also possible to perform the reduction processing of the input document image and execute the processing after the rectangle extraction for the reduction processing.

【0032】矩形分類部506において、矩形の水平方
向の大きさW、垂直方向の大きさHと、水平方向の閾値
LARGEHTH、垂直方向の閾値LARGEVTHと
の比較判定を行ない、 W>LARGEHTH かつ H>LARGEVTH の条件を満たす矩形を表領域矩形候補たる矩形(1)と
判定する(ステップ605,610)。
In the rectangle classification unit 506, the horizontal size W and the vertical size H of the rectangle are compared with the horizontal threshold LARGEHTH and the vertical threshold LARGEVTH, and W> LARGEH and H>. A rectangle satisfying the condition of LARGEVTH is determined to be a rectangle (1) which is a table area rectangle candidate (steps 605 and 610).

【0033】なお、ここでは水平方向の罫線と垂直方向
の罫線からなる、ある大きさ以上の表領域を識別するこ
とを目的としているので、閾値LARGEHTH,LA
RGEVTHは、そのような識別対象の表領域の最小サ
イズを考慮し予め決定される。ただし、これらの閾値
を、例えば矩形の高さのヒストグラム等に基づいて適応
的に自動決定するようにしてもよい。
Since the purpose here is to identify a table area consisting of horizontal ruled lines and vertical ruled lines and having a certain size or more, the threshold values LARGEHTH and LA are set.
RGEVTH is predetermined in consideration of the minimum size of the table area to be identified. However, these thresholds may be adaptively automatically determined based on, for example, a histogram of the height of a rectangle.

【0034】次に、水平罫線抽出部108と水平罫線検
定部112により矩形(1)から水平罫線を抽出する
(ステップ615)。実施例1と同様に、水平罫線抽出
部108において表領域矩形候補たる矩形(1)の範囲
内の画像を水平スキャンし、所定値以上の長さの黒ラン
のみから生成される水平罫線矩形たる矩形(2)を抽出
する。水平罫線検定部112において、矩形(2)の水
平方向の大きさW1、垂直方向の大きさH1、矩形
(1)の水平方向の大きさWに関して、 W1/W>閾値(例えば0.8) かつ H1<RLH
eightTH の条件を満たすときに、矩形(2)を水平罫線として抽
出する。なお、閾値RLHeightは固定値として
も、適応的に決定される可変値としてもよい。
Next, the horizontal ruled line extraction unit 108 and the horizontal ruled line verification unit 112 extract horizontal ruled lines from the rectangle (1) (step 615). Similar to the first embodiment, the horizontal ruled line extraction unit 108 horizontally scans an image within the rectangle (1), which is a candidate for a table area rectangle, and is a horizontal ruled line rectangle generated from only black runs having a length equal to or greater than a predetermined value. Rectangle (2) is extracted. In the horizontal ruled line verification unit 112, regarding the horizontal size W1 of the rectangle (2), the vertical size H1, and the horizontal size W of the rectangle (1), W1 / W> threshold value (for example, 0.8) And H1 <RLH
When the condition eightTH is satisfied, the rectangle (2) is extracted as a horizontal ruled line. The threshold value RLHeight may be a fixed value or a variable value adaptively determined.

【0035】また、垂直罫線抽出部408と垂直罫線検
定部412により表領域矩形候補たる矩形(1)から垂
直罫線を抽出する(ステップ620)。垂直罫線抽出部
408により、実施例3と同様に、矩形(1)の範囲の
画像を垂直スキャンし、所定値以上の長さの黒ランのみ
から生成される垂直罫線矩形たる矩形(2)を抽出し、
垂直罫線検定部412により、矩形(2)の水平方向の
大きさW1、垂直方向の大きさH1、矩形(1)の垂直
方向の大きさHに関し、 H1/H>閾値(例えば0.8) かつ W1<RLW
idthTH の条件を満たすときに、矩形(2)を垂直罫線として抽
出する。なお、閾値RLwidthTHは固定値として
も、あるいは適応的に決定される可変値としてもよい。
Further, the vertical ruled line extracting section 408 and the vertical ruled line checking section 412 extract vertical ruled lines from the rectangle (1) which is a candidate for the table area rectangle (step 620). Similarly to the third embodiment, the vertical ruled line extraction unit 408 vertically scans the image in the range of the rectangle (1) to obtain a rectangle (2) which is a vertical ruled line rectangle generated from only black runs having a length equal to or larger than a predetermined value. Extract and
With respect to the horizontal size W1 of the rectangle (2), the vertical size H1, and the vertical size H of the rectangle (1), the vertical ruled line verification unit 412 sets H1 / H> threshold (for example, 0.8). And W1 <RLW
When the condition of idthTH is satisfied, the rectangle (2) is extracted as a vertical ruled line. The threshold value RLwidthTH may be a fixed value or a variable value adaptively determined.

【0036】次に、領域判定部517において、矩形分
類部506で分類された表領域矩形候補たる矩形(1)
毎に、それより抽出された水平罫線と垂直罫線の本数と
閾値とを比較する(ステップ625,630)。ここで
は、閾値として3を用いるものとすると、3本以上の水
平罫線と3本以上の垂直罫線の両方が抽出された矩形
(1)を表領域と判定する(ステップ635)。なお、
各方向罫線検定部112,412より抽出した罫線の位
置座標が出され、領域判定部517はこの位置座標と矩
形メモリ104に格納されている矩形(1)の情報との
比較によって、各罫線がどの矩形(1)に属するもので
あるかを認識する。
Next, in the area determination section 517, the rectangle (1) which is the table area rectangle candidate classified by the rectangle classification section 506.
For each time, the number of horizontal ruled lines and vertical ruled lines extracted therefrom and the threshold value are compared (steps 625 and 630). Here, assuming that 3 is used as the threshold value, the rectangle (1) in which both three or more horizontal ruled lines and three or more vertical ruled lines are extracted is determined as the table area (step 635). In addition,
The position coordinates of the extracted ruled lines are output from the direction ruled line verification units 112 and 412, and the area determination unit 517 compares each position coordinate with the information of the rectangle (1) stored in the rectangular memory 104 to determine each ruled line. Recognize which rectangle (1) it belongs to.

【0037】図7において、700は表領域矩形候補た
る矩形(1)の例である。この例では、矩形(1)70
0は3本の水平罫線と3本の垂直罫線からなる表領域で
ある。水平罫線は矩形(2)702a〜702cとして
抽出され、垂直罫線は矩形(2)704a〜704cと
して抽出され、それぞれ最終的に水平罫線または垂直罫
線と判定されるので、この矩形(1)700は各罫線の
本数の条件を満足し表領域と判定されることになる。
In FIG. 7, reference numeral 700 is an example of a rectangle (1) which is a table area rectangle candidate. In this example, rectangle (1) 70
Reference numeral 0 is a table area composed of three horizontal ruled lines and three vertical ruled lines. The horizontal ruled lines are extracted as rectangles (2) 702a to 702c, and the vertical ruled lines are extracted as rectangles (2) 704a to 704c. Since they are finally determined to be horizontal ruled lines or vertical ruled lines, the rectangle (1) 700 is The condition of the number of each ruled line is satisfied and the table area is determined.

【0038】実施例6 本実施例は実施例5と同様の機能構成を有する。本実施
例の処理内容は、領域判定部517の判定条件(図6の
ステップ625,630対応)が実施例5の場合と異な
るが、その他は同様である。
Sixth Embodiment This embodiment has the same functional configuration as the fifth embodiment. The processing content of this embodiment is the same as that of the fifth embodiment, except that the determination condition of the area determination unit 517 (corresponding to steps 625 and 630 in FIG. 6) is different from that of the fifth embodiment.

【0039】領域判定部517の処理について、図7
(b)により説明する。矩形(1)700の上辺からあ
る範囲RangeUTHにある最も上の水平罫線(70
2a)をupper、矩形(1)700の下辺からある
範囲RangeLTHにある最も下の水平罫線(702
c)をlowerとする。このようなupperとlo
werが存在し、かつ実施例5と同様に水平罫線、垂直
罫線がともに3本以上存在する場合に、矩形(1)を表
領域と判定する。
Regarding the processing of the area determination unit 517, FIG.
This will be described with reference to (b). The uppermost horizontal ruled line (70 in a range RangeUTH from the upper side of the rectangle (1) 700)
2a) is upper, and the lowermost horizontal ruled line (702) in a range RangeLTH from the lower side of the rectangle (1) 700.
Let c) be lower. Such upper and lo
If there are wers and there are three or more horizontal ruled lines and vertical ruled lines as in the fifth embodiment, the rectangle (1) is determined as the table area.

【0040】実施例7 本実施例は実施例5と同様の機能構成を有する。本実施
例の処理内容は、領域判定部517の判定条件が実施例
5の場合と異なるが、その他は同様である。
Seventh Embodiment This embodiment has the same functional configuration as the fifth embodiment. The processing content of this embodiment is the same as that of the fifth embodiment, except that the determination condition of the area determination unit 517 is different from that of the fifth embodiment.

【0041】領域判定部517の処理について、図7に
より説明する。実施例6の場合と同様に、upperと
lowerを求める。さらに、矩形(1)700の左辺
からRangeLFTHkの範囲と矩形(1)の右辺か
らRangeRTHの範囲を除いた中間の範囲にある垂
直罫線(704b)をmiddleとする。
The processing of the area determination unit 517 will be described with reference to FIG. Similar to the case of the sixth embodiment, upper and lower are calculated. Further, the vertical ruled line (704b) in the middle range excluding the range of RangeLFTHk from the left side of the rectangle (1) 700 and the range of Range RTH from the right side of the rectangle (1) is set as a middle.

【0042】そして、水平罫線が3本以上あり、かつ、
upperとlower(実施例6参照)が存在し、か
つ、middleが1本以上ある場合に矩形(1)を表
領域と判定する。
There are three or more horizontal ruled lines, and
If upper and lower (see Example 6) exist and one or more middles exist, the rectangle (1) is determined as the table area.

【0043】実施例8 本実施例は実施例5と同様の機能構成を有する。本実施
例の処理内容は、領域判定部517の判定条件が実施例
5の場合と異なるが、その他は同様である。
Eighth Embodiment This embodiment has the same functional configuration as the fifth embodiment. The processing content of this embodiment is the same as that of the fifth embodiment, except that the determination condition of the area determination unit 517 is different from that of the fifth embodiment.

【0044】表領域判定部517の処理を図7を用いて
説明する。実施例6の場合と同様に、upperとlo
werを求める。さらに、矩形(1)700の左辺から
RangeLFTHの範囲にある最も左の垂直罫線(7
04a)をleft、矩形(1)の右辺からRange
RTHの範囲にある最も右の垂直罫線(704c)をr
ight、それらの範囲以外の中間の範囲にある垂直罫
線(704b)をmiddleとして求める。
The processing of the table area determination unit 517 will be described with reference to FIG. Similar to the case of the sixth embodiment, upper and lo
ask for a wer. Further, the leftmost vertical ruled line (7) in the range of RangeLFTH from the left side of the rectangle (1) 700.
04a) to the left, and Range from the right side of the rectangle (1)
R the rightmost vertical ruled line (704c) in the RTH range
A vertical ruled line (704b) in an intermediate range other than the light and those ranges is obtained as a middle.

【0045】そして、領域(1)において、実施例5,
6,7のいずれかの表領域判定条件に当てはまる場合は
領域(1)を表領域と判定する。それに当てはまらない
らない場合、upper,lower,left,ri
ghtの一つ以上が存在するときは領域(1)を囲み領
域と判定するが、upper,lower,left,
rightのいずれも存在しないときは領域(1)を表
領域でも囲み枠領域でもない、その他領域(図、写真
等)と判定する。
Then, in the region (1), the fifth embodiment
If any of the table area determination conditions 6 and 7 is satisfied, the area (1) is determined to be the table area. If not, upper, lower, left, ri
When one or more ghts exist, the area (1) is determined to be an enclosing area, but upper, lower, left,
If none of the light is present, the area (1) is determined as the other area (drawing, photograph, etc.) that is neither the table area nor the surrounding frame area.

【0046】[0046]

【発明の効果】以上の詳細説明から明らかなように、請
求項1または2の発明によれば水平または垂直罫線を正
確に識別できるという効果を得られ、請求項3の発明に
よれば表領域を正確に識別できるという効果を得られ、
また請求項4の発明によれば表領域の正確な識別と囲み
枠領域、その他領域の識別が可能になるという効果を得
られる。
As is clear from the above detailed description, according to the invention of claim 1 or 2, the effect that the horizontal or vertical ruled line can be accurately identified can be obtained, and according to the invention of claim 3, the table area is obtained. The effect of being able to accurately identify
Further, according to the invention of claim 4, it is possible to obtain an effect that the table area can be accurately identified and the surrounding frame area and other areas can be identified.

【図面の簡単な説明】[Brief description of drawings]

【図1】実施例1及び実施例2の機能ブロック図であ
る。
FIG. 1 is a functional block diagram of a first embodiment and a second embodiment.

【図2】実施例1の処理フローチャートである。FIG. 2 is a processing flowchart of the first embodiment.

【図3】(a)水平罫線矩形候補の抽出例を示す図であ
る。 (b)実施例1における水平罫線判定の説明図である。 (c)実施例2における水平罫線判定の説明図である。
FIG. 3A is a diagram showing an example of extraction of horizontal ruled line rectangle candidates. FIG. 7B is an explanatory diagram of horizontal ruled line determination in the first embodiment. (C) It is explanatory drawing of the horizontal ruled line determination in Example 2.

【図4】実施例3及び実施例4の機能ブロック図であ
る。
FIG. 4 is a functional block diagram of third and fourth embodiments.

【図5】実施例5,6,7及び8の機能ブロック図であ
る。
5 is a functional block diagram of Examples 5, 6, 7 and 8. FIG.

【図6】実施例5の処理フローチャートである。FIG. 6 is a processing flowchart of the fifth embodiment.

【図7】(a)表領域矩形候補の例を示す図である。 (b)水平罫線に関する説明図である。 (c)垂直罫線に関する説明図である。FIG. 7A is a diagram showing an example of a table area rectangle candidate. (B) It is explanatory drawing regarding a horizontal ruled line. (C) It is explanatory drawing regarding a vertical ruled line.

【符号の説明】[Explanation of symbols]

100 文書画像メモリ 102 矩形抽出部 104 矩形メモリ 106 矩形分類部 108 水平罫線抽出部 110 水平罫線矩形メモリ 112 水平罫線検定部 114 制御部 300 水平罫線矩形候補(矩形(1)) 302 水平罫線矩形(矩形(2)) 408 垂直罫線抽出部 410 垂直罫線矩形メモリ 412 垂直罫線検定部 506 矩形分類部 517 領域判定部 700 表領域矩形候補(矩形(1)) 702a〜702c 水平罫線矩形(矩形(2)) 704a〜704c 垂直罫線矩形(矩形(2)) 100 Document Image Memory 102 Rectangle Extraction Unit 104 Rectangle Memory 106 Rectangle Classification Unit 108 Horizontal Ruled Line Extraction Unit 110 Horizontal Ruled Line Rectangle Memory 112 Horizontal Ruled Line Verification Unit 114 Control Unit 300 Horizontal Ruled Line Rectangle Candidate (Rectangle (1)) 302 Horizontal Ruled Line Rectangle (Rectangle) (2)) 408 vertical ruled line extraction unit 410 vertical ruled line rectangle memory 412 vertical ruled line inspection unit 506 rectangular classification unit 517 region determination unit 700 table region rectangle candidate (rectangle (1)) 702a to 702c horizontal ruled line rectangle (rectangle (2)) 704a to 704c Vertical ruled line rectangle (rectangle (2))

Claims (4)

【特許請求の範囲】[Claims] 【請求項1】 (ア)文書画像またはその縮小画像から
黒連結成分の外接矩形を抽出し、 (イ)上記(ア)で抽出された矩形より、その水平方向
及び垂直方向の大きさに基づき、水平方向または垂直方
向いずれか特定の方向の罫線矩形候補を選び、 (ウ)上記(イ)で選ばれた罫線矩形候補の範囲につい
て文書画像またはその縮小画像を特定方向にスキャン
し、ある閾値以上の長さの黒ランから罫線矩形を生成
し、 (エ)上記(イ)で選ばれた罫線矩形候補と、それより
上記(ウ)で得られた罫線矩形の水平方向及び垂直方向
の大きさの関係に基づき、該罫線矩形が特定方向の罫線
であるか否かを判定する罫線識別方法。
1. (a) A circumscribed rectangle of a black connected component is extracted from a document image or a reduced image thereof, and (b) based on the size of the rectangle extracted in (a) above in the horizontal and vertical directions. Select a ruled line rectangle candidate in a specific direction, either horizontal or vertical, and (c) scan the document image or its reduced image in the specified direction for the range of ruled line rectangle candidates selected in (a) above, and set a certain threshold. A ruled line rectangle is generated from the black run having the above length, and (d) the ruled line rectangle candidate selected in (a) above and the size of the ruled line rectangle obtained in (c) above in the horizontal and vertical directions. A ruled line identification method for determining whether or not the ruled line rectangle is a ruled line in a specific direction based on the relationship of height.
【請求項2】 (ア)文書画像またはその縮小画像から
黒連結成分の外接矩形を抽出し、 (イ)上記(ア)で抽出された矩形より、その水平方向
及び垂直方向の大きさに基づき、水平方向または垂直方
向いずれか特定の方向の罫線矩形候補を選び、 (ウ)上記(イ)で選ばれた罫線矩形候補の範囲につい
て文書画像またはその縮小画像を特定方向にスキャン
し、ある閾値以上の長さの黒ランから罫線矩形を生成
し、 (エ)上記(イ)で選ばれた罫線矩形候補と、それより
上記(ウ)で得られた罫線矩形の水平方向及び垂直方向
の位置関係に基づき、該罫線矩形が特定方向の罫線であ
るか否かを判定する罫線識別方法。
2. (a) A circumscribed rectangle of a black connected component is extracted from the document image or its reduced image, and (b) based on the size of the rectangle extracted in (a) above in the horizontal and vertical directions. Select a ruled line rectangle candidate in a specific direction, either horizontal or vertical, and (c) scan the document image or its reduced image in the specified direction for the range of ruled line rectangle candidates selected in (a) above, and set a certain threshold. A ruled line rectangle is generated from the black run having the above length, and (d) the ruled line rectangle candidate selected in (a) above and the position of the ruled line rectangle obtained in (c) above in the horizontal and vertical directions. A ruled line identification method for determining whether or not the ruled line rectangle is a ruled line in a specific direction based on the relationship.
【請求項3】 (ア)文書画像またはその縮小画像から
黒連結成分の外接矩形を抽出し、 (イ)上記(ア)で抽出された矩形より、その水平方向
及び垂直方向の大きさに基づき、表領域矩形候補を選
び、 (ウ)上記(イ)で選ばれた表領域矩形候補の範囲につ
いて文書画像またはその縮小画像を水平方向にスキャン
することによって、ある閾値以上の長さの黒ランからな
る水平罫線矩形を抽出し、該水平罫線矩形に対して水平
罫線の条件判定を行なうことによって水平罫線を抽出
し、 (エ)上記(イ)で選ばれた表領域矩形候補の範囲につ
いて文書画像またはその縮小画像を垂直方向にスキャン
し、ある閾値以上の長さの黒ランからなる垂直罫線を抽
出し、該垂直罫線矩形に対して垂直罫線の条件判定を行
なうことによって垂直罫線を抽出し、 (オ)上記(イ)で選ばれた表領域矩形候補から上記
(ウ)で抽出された水平罫線の本数及び上記(エ)で抽
出された垂直罫線の本数を少なくとも含む判定条件によ
り、該表領域矩形候補を表領域であるか否かを判定する
領域識別方法。
3. (a) A circumscribed rectangle of a black connected component is extracted from a document image or a reduced image thereof, and (b) based on the size in the horizontal and vertical directions of the rectangle extracted in (a) above. , A table area rectangle candidate is selected, and (c) a document image or a reduced image thereof is horizontally scanned within the range of the table area rectangle candidate selected in (a) above to obtain a black run having a length equal to or greater than a certain threshold. The horizontal ruled line rectangle is extracted, and the horizontal ruled line is extracted by performing the condition determination of the horizontal ruled line with respect to the horizontal ruled line rectangle, and (d) the range of candidate table area rectangles selected in (a) above is documented. The image or its reduced image is scanned in the vertical direction, a vertical ruled line consisting of black runs having a length equal to or greater than a certain threshold is extracted, and the vertical ruled line is extracted by performing a condition determination of the vertical ruled line on the vertical ruled line rectangle. , (E) by the judgment condition including at least the number of horizontal ruled lines extracted in (c) above and the number of vertical ruled lines extracted in (d) above from the table area rectangle candidate selected in (b) above, An area identification method for determining whether a table area rectangle candidate is a table area.
【請求項4】 請求項3記載の領域識別方法において、
上記(ウ)の判定条件に表領域矩形候補の特定範囲にお
ける水平罫線または垂直罫線の有無も含み、該判定条件
によって、該表領域矩形候補を表領域、囲み枠領域、ま
たはその他領域のいずれであるかを判定することを特徴
とする領域識別方法。
4. The area identification method according to claim 3,
The determination condition of (c) above also includes the presence or absence of a horizontal ruled line or a vertical ruled line in a specific range of the table area rectangle candidate. Depending on the determination condition, the table area rectangle candidate is selected as a table area, a surrounding frame area, or another area. An area identification method characterized by determining whether there is an area.
JP16086692A 1992-06-19 1992-06-19 Ruled line identification method and area identification method Expired - Lifetime JP3215163B2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP16086692A JP3215163B2 (en) 1992-06-19 1992-06-19 Ruled line identification method and area identification method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP16086692A JP3215163B2 (en) 1992-06-19 1992-06-19 Ruled line identification method and area identification method

Publications (2)

Publication Number Publication Date
JPH064704A true JPH064704A (en) 1994-01-14
JP3215163B2 JP3215163B2 (en) 2001-10-02

Family

ID=15724070

Family Applications (1)

Application Number Title Priority Date Filing Date
JP16086692A Expired - Lifetime JP3215163B2 (en) 1992-06-19 1992-06-19 Ruled line identification method and area identification method

Country Status (1)

Country Link
JP (1) JP3215163B2 (en)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7995869B2 (en) 2006-05-02 2011-08-09 Ricoh Company, Ltd. Information processing apparatus, information processing method, and information storing medium
US8125683B2 (en) 2005-06-01 2012-02-28 Ricoh Company, Ltd. Image preview processing apparatus, image preview processing method, and image preview computer product
US8260057B2 (en) 2007-07-12 2012-09-04 Ricoh Company, Limited Image processing apparatus that obtains a ruled line from a multi-value image
US8321367B2 (en) 2008-03-17 2012-11-27 Ricoh Company. Ltd. Data processing apparatus, method, and computer program product for user objective prediction
US9858512B2 (en) 2015-07-30 2018-01-02 Kyocera Document Solutions Inc. Image processing apparatus with an improved detection of ruled lines
US10106919B2 (en) 2012-12-28 2018-10-23 Kuraray Co., Ltd. Drawing device and drawing method
US10650493B2 (en) 2017-06-05 2020-05-12 Kyocera Document Solutions, Inc. Image processing apparatus

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8254669B2 (en) 2007-09-19 2012-08-28 Ricoh Company, Ltd. Data processing apparatus, computer program product, and data processing method for predicting an optimum function based on a case database and image feature values calculated by a feature-value calculating unit

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8125683B2 (en) 2005-06-01 2012-02-28 Ricoh Company, Ltd. Image preview processing apparatus, image preview processing method, and image preview computer product
US7995869B2 (en) 2006-05-02 2011-08-09 Ricoh Company, Ltd. Information processing apparatus, information processing method, and information storing medium
US8260057B2 (en) 2007-07-12 2012-09-04 Ricoh Company, Limited Image processing apparatus that obtains a ruled line from a multi-value image
US8321367B2 (en) 2008-03-17 2012-11-27 Ricoh Company. Ltd. Data processing apparatus, method, and computer program product for user objective prediction
US10106919B2 (en) 2012-12-28 2018-10-23 Kuraray Co., Ltd. Drawing device and drawing method
US9858512B2 (en) 2015-07-30 2018-01-02 Kyocera Document Solutions Inc. Image processing apparatus with an improved detection of ruled lines
US10650493B2 (en) 2017-06-05 2020-05-12 Kyocera Document Solutions, Inc. Image processing apparatus

Also Published As

Publication number Publication date
JP3215163B2 (en) 2001-10-02

Similar Documents

Publication Publication Date Title
US6512848B2 (en) Page analysis system
US6574375B1 (en) Method for detecting inverted text images on a digital scanning device
US5335290A (en) Segmentation of text, picture and lines of a document image
JP4516778B2 (en) Data processing system
US6674900B1 (en) Method for extracting titles from digital images
US7379594B2 (en) Methods and systems for automatic detection of continuous-tone regions in document images
US5452374A (en) Skew detection and correction of a document image representation
JP3278471B2 (en) Area division method
JP2002208007A (en) Automatic detection of scanned document
JP2000207489A (en) Character extracting method and device and record medium
KR100923935B1 (en) Method and system for evaluating document image automatically for optical character recognition
US6360006B1 (en) Color block selection
JP3606500B2 (en) Rectangle classification method
JPH064704A (en) Ruled line discriminating method and area discriminating method
JPH0418351B2 (en)
JP3285686B2 (en) Area division method
JPH08190690A (en) Method for determining number plate
JP3544324B2 (en) CHARACTER STRING INFORMATION EXTRACTION DEVICE AND METHOD, AND RECORDING MEDIUM CONTAINING THE METHOD
JP4409713B2 (en) Document image recognition apparatus and recording medium
JP3187895B2 (en) Character area extraction method
JP2000357287A (en) Method and device for number plate recognition
JPH07141465A (en) Method for detecting inclination of document image
JPH07230525A (en) Method for recognizing ruled line and method for processing table
JP3220226B2 (en) Character string direction determination method
JP2877548B2 (en) Document image attribute discrimination method

Legal Events

Date Code Title Description
FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20070727

Year of fee payment: 6

FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20080727

Year of fee payment: 7

FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20090727

Year of fee payment: 8

FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20090727

Year of fee payment: 8

FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20100727

Year of fee payment: 9

FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20110727

Year of fee payment: 10

FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20120727

Year of fee payment: 11

EXPY Cancellation because of completion of term