JP2001331763A

JP2001331763A - Method for recognizing table

Info

Publication number: JP2001331763A
Application number: JP2001064970A
Authority: JP
Inventors: Junji Kashioka; 潤二柏岡; Satoshi Naoi; 聡直井
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 2000-03-17
Filing date: 2001-03-08
Publication date: 2001-11-30
Anticipated expiration: 2021-03-08
Also published as: JP3904397B2

Abstract

PROBLEM TO BE SOLVED: To precisely separate a group of characters in a tubular document having no ruled lines (or where ruled lines are deleted remarkably) and to precisely recognize table structure even when a table is tilted. SOLUTION: Characters whose pitches are within a fixed value in a table are integrated to obtain the group of the characters and the average of the pitches of the characters in the group of the characters is obtained to re-divide the group of the characters based on the average. Next, a group of characters where the mutual positional relation between the re-divided groups of the characters is within fixed inclination are registered as a row. The groups of characters having a relation where coordinates in the horizontal direction of the re-divided groups of the characters are overlapped with each other are registered as a column. Then, the groups of the characters of respective rows and respective columns obtained like above are recognized to prepare data which can be read into a table.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】近年、入力周辺機器として文
書認識処理の需要が増加している。本発明は、この文書
認識処理において、より高い精度で罫線のない表を認識
することができる表認識方法に関する。BACKGROUND OF THE INVENTION In recent years, demand for document recognition processing as input peripheral equipment has been increasing. The present invention relates to a table recognition method capable of recognizing a table without ruled lines with higher accuracy in the document recognition processing.

【０００２】[0002]

【従来の技術】表形式文書の文字認識結果を例えば表計
算ソフト等で使用する場合には、表構造を認識し、表中
の文字群が何列目の何行目に記録された文字群であるか
を認識する必要がある。罫線がある表形式文書の場合に
は、通常、罫線構造を認識し、罫線構造から各行、各列
の文字群を識別していた。一方、図１１（ａ）のような
罫線のない表形式文書を認識する場合には、従来、文字
サイズの平均値、文字ピッチ等の平均値より文字間隔を
定め、該一定の文字間隔の文字を統合して文字群を求
め、文字群の縦方向、横方向それぞれの座標が重なる関
係にある場合に、行、列として登録する処理が用いられ
ていた。2. Description of the Related Art When a character recognition result of a tabular document is used in, for example, spreadsheet software, a table structure is recognized, and a character group recorded in a column and a row of a character group in the table is recognized. Needs to be recognized. In the case of a table format document having ruled lines, the ruled line structure is usually recognized, and the character group of each row and each column is identified from the ruled line structure. On the other hand, when recognizing a tabular document having no ruled lines as shown in FIG. 11A, conventionally, the character spacing is determined based on the average value of the character size, the character pitch, and the like. Has been used to obtain a character group, and when the vertical and horizontal coordinates of the character group overlap each other, the character group is registered as a row or a column.

【０００３】[0003]

【発明が解決しようとする課題】図１１（ａ）の表では
上記のように一定の文字間隔の文字を統合することによ
り文字群を正確に推定することができるが、図１１
（ｂ）のような表では、上記一定の文字間隔で文字群を
推定すると、図中領域のように本来複数の文字群とな
るべきものが一つの文字群として認識される場合があっ
た。また、図１１（ｃ）のように表が傾いていた場合に
は、行間隔が狭いと複数の行が一つの行として認識され
る場合があった。本発明は、上記事情に鑑みなされたも
のであって、本発明は、罫線がないもしくは罫線が大幅
に削除されている表形式文書における表認識において、
従来の方法では分離することができなかった文字群を精
度よく分離することができ、また表が傾いている場合で
あっても精度良く表を認識することができる表認識方法
を提供するこを目的とする。In the table of FIG. 11 (a), a character group can be accurately estimated by integrating characters at a fixed character interval as described above.
In a table such as that shown in (b), when character groups are estimated at the above-mentioned constant character interval, a character group that should originally be a plurality of character groups, such as an area in the drawing, may be recognized as one character group. When the table is inclined as shown in FIG. 11C, a plurality of rows may be recognized as one row if the row interval is small. The present invention has been made in view of the above circumstances, and the present invention provides a method for recognizing a table in a tabular document in which there is no ruled line or the ruled line is largely deleted.
Provided is a table recognition method that can accurately separate a character group that cannot be separated by the conventional method, and that can accurately recognize a table even when the table is inclined. Aim.

【０００４】[0004]

【課題を解決するための手段】図１は本発明の概要を説
明する図である。同図に示すように本発明においては、
次の〜のようにして表を認識する。表内の文字間隔が一定以内の文字を統合して文字群
を求める。上記文字群内の文字の文字間隔の平均値を求めて、
該平均値に基づき上記文字群を再分割する。上記再分割した文字群同士の位置関係が一定の傾き
以内の文字群を行として登録する。上記再分割した文字群同士の横方向の座標が重なる
関係をもつ文字群を列として登録する。また、罫線が一部省かれた表を認識する場合には、画像
中に含まれる罫線を除去してから、上記〜の表認識
を行う。本発明においては、上記のように表を認識して
いるので、表中に他の文字群の間隔より、間隔の狭い文
字群が混在していても、これらの文字群を複数の文字群
として正確に認識することができる。また、傾きのある
表に対しても正確に表のを認識することができる。FIG. 1 is a diagram for explaining the outline of the present invention. As shown in FIG.
The table is recognized as follows. A character group is obtained by integrating characters in a table with a character spacing within a certain range. Find the average value of the character spacing of the characters in the above character group,
The character group is subdivided based on the average value. A character group in which the positional relationship between the subdivided character groups is within a certain inclination is registered as a line. A character group having a relationship in which the horizontal coordinates of the subdivided character groups overlap each other is registered as a column. When recognizing a table in which the ruled lines are partially omitted, the ruled line included in the image is removed, and then the above table recognition is performed. In the present invention, since the table is recognized as described above, even if a character group with a narrower interval is mixed in the table than the space between other character groups, these character groups are used as a plurality of character groups. Can be accurately recognized. In addition, even a table having a slope can be recognized accurately.

【０００５】[0005]

【発明の実施の形態】図２に本発明の第１の実施例の処
理ブロック図を示す。なお、本発明は、ＣＰＵ、メモ
リ、外部記憶装置、入出力装置、画像読み取りを行うス
キャナ、記録媒体読み取り装置、通信インタフェース等
を備えた通常の計算機システムにより実現することがで
き、上記外部記憶装置等に本発明の文字認識処理を行う
プログラムが格納され、実行時、上記プログラムがメモ
リに読み込まれ、スキャナ等で読み取った画像につい
て、本発明の文字認識処理による文字認識が行われ、文
字認識結果が上記入出力装置から出力される。FIG. 2 is a processing block diagram of a first embodiment of the present invention. Note that the present invention can be realized by a normal computer system including a CPU, a memory, an external storage device, an input / output device, a scanner for reading an image, a recording medium reading device, a communication interface, and the like. The program for performing the character recognition process of the present invention is stored in the memory. When the program is executed, the program is read into a memory, and the image read by the scanner or the like is subjected to character recognition by the character recognition process of the present invention. Is output from the input / output device.

【０００６】次に本発明の第１の実施例について説明す
る。本実施例は例えば、前記図１１（ｂ）のように、罫
線がない表形式文書の認識に適用される。本実施例で
は、図２に示すように、まず、文字群抽出部１１で文字
群を囲む矩形座標を求める。すなわち、罫線のない表と
して与えられた範囲に対して、文字を形成する画素の連
結性より連結成分を囲む矩形を抽出し、更に矩形間の重
なりや位置関係から矩形を統合して文字を推定し、文字
を囲む矩形を抽出する。次いで、抽出された各文字を矩
形の座標から、文字と文字との間の長さ（文字間隔）が
一定の閾値以内にあるものを統合して文字群を抽出し、
文字群を囲む矩形座標を求める。図３（ａ）に示す罫線
のない表から上記文字群抽出部１１により抽出された文
字群を図３（ｂ）に示す。同図に示すように、「（１
月）〜（５月）」は文字間隔が短いので一つの文字群と
して抽出されている。Next, a first embodiment of the present invention will be described. This embodiment is applied to, for example, recognition of a tabular document having no ruled line as shown in FIG. In this embodiment, as shown in FIG. 2, first, the character group extracting unit 11 obtains rectangular coordinates surrounding the character group. In other words, for a range given as a table without ruled lines, rectangles surrounding connected components are extracted from the connectivity of the pixels forming the characters, and then the rectangles are integrated from the overlap and positional relationship between the rectangles to estimate the characters. Then, a rectangle surrounding the character is extracted. Next, a character group is extracted by integrating the extracted characters from the coordinates of the rectangle by integrating the characters having a length (character interval) between the characters within a certain threshold value,
Find the rectangular coordinates surrounding the character group. FIG. 3B shows a character group extracted from the table without ruled lines shown in FIG. As shown in FIG.
"Month) to (May)" are extracted as one character group because the character interval is short.

【０００７】次に、文字群再分割部１２で文字群再分割
処理を行なう。すなわち、後述する図７のフローチャー
トに示すように、文字群抽出部１１で抽出した文字群を
一つずつ取り出し、隣接する文字同士の文字間隔を求め
てその平均Ｗａを求める。そして、各文字がＷａに対し
て一定以上の長さで離れているかを調べ、該当する場合
には、文字群を再分割する。図３（ａ）の表に対して、
文字群再分割処理で抽出された文字群を図３（ｃ）に示
す。同図に示すように、「（１月）〜（５月）」はそれ
ぞれ文字群として再分割されている。次いで、行抽出部
１３において行抽出処理を行なう。行抽出部１３では、
後述する図８のフローチャートに示すように、文字群間
の傾きを考慮して行を抽出する。すなわち、まず文字群
を左から順にソートし、文字群をソートした順に一つ取
り出し番号をｉとする。この時点で行が無い場合には新
規に行を作成し、文字群ｉをその行に登録する。例えば
図３（ｃ）の例では「合計」が最も左にあるので、新規
に行が登録される。Next, the character group subdivision section 12 performs a character group subdivision process. That is, as shown in the flowchart of FIG. 7 described later, the character groups extracted by the character group extraction unit 11 are extracted one by one, the character spacing between adjacent characters is calculated, and the average Wa is calculated. Then, it is checked whether each character is apart from Wa by a certain length or more, and if so, the character group is subdivided. With respect to the table of FIG.
FIG. 3C shows a character group extracted by the character group subdivision process. As shown in the figure, “(January) to (May)” are subdivided as character groups. Next, the row extracting unit 13 performs a row extracting process. In the row extraction unit 13,
As shown in the flowchart of FIG. 8 described later, lines are extracted in consideration of the inclination between the character groups. That is, first, the character groups are sorted in order from the left, and one extracted number is set to i in the order in which the character groups are sorted. If there is no line at this point, a new line is created and the character group i is registered in that line. For example, in the example of FIG. 3C, “sum” is at the leftmost position, so a new row is registered.

【０００８】上記のようにして行が登録されると、その
行に存在する文字群のうち、文字群ｉと横方向に一番近
い文字群と文字群ｉとの傾きを求め、既に抽出した行の
中で最小の傾きを与える行番号（ｇ）と傾き（θｍｉ
ｎ）を記憶する。ここで傾きは図４に示すように文字群
を囲む矩形座標から中心点をそれぞれ求めて、中心点間
を結ぶ線が水平線となす傾きを採用すればよい。次に、
上記傾きθｍｉｎが閾値（θｔｈ）より小さいか調べ、
θｍｉｎがθｔｈより大きい場合には、新しく行を作成
する。また、θｍｉｎが閾値（θｔｈ）より小さい場合
には、ｇ行に文字群ｉを登録する。この処理を全ての文
字群に対して行ない、行抽出を完了する。以上のように
して行を抽出することにより、例えば図６（ａ）に示す
ように、行と文字群が対応付けられる。When the line is registered as described above, the inclination between the character group i and the character group i closest to the character group i in the horizontal direction among the character groups existing in the line is obtained and already extracted. The row number (g) giving the minimum slope and the slope (θmi
n) is stored. Here, as the inclination, as shown in FIG. 4, a center point may be obtained from the rectangular coordinates surrounding the character group, and a line connecting the center points with a horizontal line may be used. next,
Check whether the inclination θmin is smaller than a threshold value (θth),
If θmin is larger than θth, a new row is created. If θmin is smaller than the threshold (θth), the character group i is registered in the g-th line. This process is performed for all the character groups, and the line extraction is completed. By extracting the lines as described above, the lines and the character groups are associated with each other, for example, as shown in FIG.

【０００９】次に列抽出部１４において列抽出を行な
う。列抽出は、例えば、横方向の座標が重なる関係をも
つ文字群を列として登録する。すなわち、横方向をＸ
軸、縦方向をＹ軸としたとき、文字群を囲む矩形のＸ軸
方向の座標値が重なる文字群（Ｘ軸方向の座標値の少な
くとも一部が同じ値である文字群）を列として登録す
る。例えば、前記図３（ｃ）の例では、図５の破線で囲
まれた領域内の文字群のＸ軸方向の座標値が重なってい
るので各領域内の文字群は同じ列に登録される。これに
より、例えば図６（ｂ）に示すように列と文字群が対応
付けられる。表データ作成部１５では、各文字群の文字
認識を行い、行と列の関係からセルデータを作成し、セ
ルデータに文字群を登録する。すなわち、文字群の認識
結果と、図６（ａ）に示す行抽出結果、図６（ｂ）に示
す列抽出結果から、図６（ｃ）に示すような表に書き込
み可能なデータが作成される。Next, a column extraction section 14 performs column extraction. In the column extraction, for example, a character group having a relationship in which the coordinates in the horizontal direction overlap is registered as a column. That is, the horizontal direction is X
When the axis and the vertical direction are the Y axis, a character group in which the coordinate values in the X axis direction of the rectangle surrounding the character group overlap (a character group in which at least a part of the coordinate values in the X axis direction have the same value) is registered as a column. I do. For example, in the example of FIG. 3C, since the coordinate values in the X-axis direction of the character groups in the area surrounded by the broken line in FIG. 5 overlap, the character groups in each area are registered in the same column. . Thereby, for example, as shown in FIG. 6B, the columns are associated with the character groups. The table data creation unit 15 performs character recognition for each character group, creates cell data from the relationship between rows and columns, and registers the character group in the cell data. That is, data that can be written in a table as shown in FIG. 6C is created from the character group recognition result, the row extraction result shown in FIG. 6A, and the column extraction result shown in FIG. 6B. You.

【００１０】次に、前記文字群再分割部１２における文
字群再分割処理を、図７のフローチャートにより説明す
る。まず、文字群抽出部１１で抽出した文字群を一つず
つ取り出し、取り出した文字群ｉに含まれる文字（個
数：Ｎｃ）をその座標に基づいて左から順にソートす
る。そして隣接する文字同士の文字間隔を求めて、その
平均Ｗａを得る。すなわち、図７に示すように、文字群
数をＮｇに設定し、ｉ＝１とする（ステップＳ１）。つ
いで、ｉ≦Ｎｇであるかを調べ、ｉ＞Ｎｇの場合は処理
を終了する。また、ｉ≦Ｎｇの場合には、Ｎｃをｉ番目
の文字群の文字数とする（ステップＳ３）。そして、ｉ
番目の文字群に含まれる文字を左から順にソートする
（ステップＳ４）。ついで、Ｗａをｉ番目の文字の文字
群の平均文字間隔に設定する（ステップＳ５）。Next, the character group subdivision processing in the character group subdivision section 12 will be described with reference to the flowchart of FIG. First, the character groups extracted by the character group extraction unit 11 are extracted one by one, and the characters (number: Nc) included in the extracted character group i are sorted in order from the left based on the coordinates. Then, the character interval between adjacent characters is obtained, and the average Wa is obtained. That is, as shown in FIG. 7, the number of character groups is set to Ng, and i = 1 (step S1). Then, it is checked whether i ≦ Ng, and if i> Ng, the process is terminated. If i ≦ Ng, Nc is set to the number of characters of the i-th character group (step S3). And i
The characters included in the th character group are sorted in order from the left (step S4). Next, Wa is set to the average character spacing of the character group of the i-th character (step S5).

【００１１】次いで、ソートした順に文字ｊと次の文字
ｊ＋１との間隔を求め、Ｗａに対して一定以上の長さで
離れているかを調べ、該当する場合にはその文字ｊと文
字ｊ＋１の間で文字群を再分割する。この際に文字群数
Ｎを一つ増やし、ｉ＋１番目以降の文字群の番号を一つ
増やす。そしてｉ番目の文字群には１〜ｊまでの文字を
登録し、登録した文字を包含する矩形座標を得る。次い
で、ｉ＋１番目の文字群に新たにｊ＋１〜Ｎｃまでの文
字を登録し．登録した文字を包含する矩形座標を得る。
そしてｉを一つ増やし、次の文字群の再分割処理を繰り
返していき、全ての文字群に対して再分割処理を行な
う。すなわち、図７に示すように、ｊ＝１として、ｊ＜
Ｎｃであるかを調べ（ステップＳ７）、ｊ≧Ｎｃの場合
には、ステップＳ１５においてｉ＝ｉ＋１としてステッ
プＳ２に戻り、ｊ＜Ｎｃの場合には、文字ｊと文字ｊ＋
１の間隔をＷとして（ステップＳ８）、文字間隔Ｗ＞平
均文字間隔Ｗａ×〔一定値〕であるかを調べる（ステッ
プＳ９）。上記条件を満たさない場合には、ステップＳ
１０でｊ＝ｊ＋１としてステップＳ７に戻る。また、文
字間隔Ｗ＞平均文字間隔Ｗａ×〔一定値〕の場合には、
文字群数を１増やし（ステップＳ１１）、ｉ＋１番目以
降の文字群の番号を１増やす。さらに、ｉ番目の文字群
に１〜ｊ番目の文字を登録し、文字数、矩形座標を算出
し（ステップＳ１３）、ｉ＋１番目の文字群にｊ＋１〜
Ｎｃ番目の文字を登録し、文字数、矩形座標を算出する
（ステップＳ１４）。以上の処理が終わるとステップＳ
１５でｉ＝ｉ＋１としてステップＳ２に戻る。Next, the distance between the character j and the next character j + 1 is determined in the sorted order, and it is checked whether the character j is separated from Wa by a certain length or more. Subdivides the character group. At this time, the number N of character groups is increased by one, and the number of the character group after the (i + 1) th is increased by one. Then, characters 1 to j are registered in the i-th character group, and rectangular coordinates including the registered characters are obtained. Next, characters from j + 1 to Nc are newly registered in the (i + 1) th character group. Get the rectangular coordinates containing the registered character.
Then, i is incremented by one, and the subdivision process of the next character group is repeated, and the subdivision process is performed for all the character groups. That is, as shown in FIG. 7, j = 1 and j <
Nc is checked (step S7). If j ≧ Nc, i = i + 1 is set in step S15 and the process returns to step S2. If j <Nc, the character j and the character j +
The interval of 1 is set to W (step S8), and it is checked whether or not the character interval W> the average character interval Wa × [constant value] (step S9). If the above conditions are not satisfied, step S
At 10, j = j + 1 is set and the process returns to step S7. When the character spacing W> the average character spacing Wa × [constant value],
The number of character groups is increased by 1 (step S11), and the numbers of the i + 1th and subsequent character groups are increased by 1. Further, the first to j-th characters are registered in the i-th character group, the number of characters and the rectangular coordinates are calculated (step S13), and j + 1 to j + 1 are registered in the (i + 1) -th character group.
The Nc-th character is registered, and the number of characters and rectangular coordinates are calculated (step S14). When the above processing is completed, step S
At 15, i = i + 1, and the process returns to step S2.

【００１２】次に、前記行抽出部１３における行抽出処
理を、図８のフローチャートにより説明する。図８にお
いて、まず文字群を左から順にソートする（ステップＳ
１）、次いで、θｔｈを傾きの閾値（予め設定され
る）、Ｎｇを文字群数、Ｃを行数（最初は０に設定）と
し（ステップＳ２）、ｉ＝０に設定する（ステップＳ
３）。次に、ｉがＮｇ（文字群数）より小さいかを調
べ、小さくない場合には処理を終了する。ｉがＮｇより
小さければ、Ｃ＝０であるかを調べる（ステップＳ
５）。Ｃ＝０であれば、ステップＳ１０に行きＣに１を
加え、Ｃ番目の行に文字群ｉを登録する（ステップＳ１
１）。これにより、例えば図３（ｃ）の例では、文字群
「合計」が登録される（文字群「合計」が最も左の位置
にあるため）。以上のようにして最初の行が登録され
る。Next, the row extracting process in the row extracting section 13 will be described with reference to the flowchart of FIG. In FIG. 8, first, character groups are sorted in order from the left (step S
1) Next, θth is set as a threshold of inclination (preset), Ng is set as the number of character groups, C is set as the number of lines (set to 0 at first) (step S2), and i is set to 0 (step S2).
3). Next, it is checked whether i is smaller than Ng (the number of character groups), and if not, the process is terminated. If i is smaller than Ng, it is checked whether C = 0 (step S).
5). If C = 0, go to step S10, add 1 to C, and register character group i in the C-th line (step S1).
1). Thereby, for example, in the example of FIG. 3C, the character group “total” is registered (since the character group “total” is at the leftmost position). The first line is registered as described above.

【００１３】次にｉに１を加え（ステップＳ１２）、ス
テップＳ４に戻り、前記したようにその行に存在する文
字群のうち、文字群ｉと横方向に一番近い文字群と文字
群ｉとの傾きを求め、既に抽出した行の中で最小の傾き
を与える行番号（ｇ）と傾き（θｍｉｎ）を記憶する。
すなわち、ステップＳ４において、ｉがＮｇ（文字群
数）より小さいかを調べ、小さい場合には、ステップＳ
５でＣ＝０であるかを調べる。今回はＣ＝０でないの
で、ステップＳ６に行き、ｊ＝１、ｇ＝０（ｇは行番
号）、θｍｉｎを無限大に設定し、ｊ≦Ｃであるかを調
べる（ステップＳ７）。一回目はｊ≦Ｃであるので、ス
テップＳ１３に行き、ｊ行目の文字群のうちの文字群ｉ
に横方向に最も近い文字群と文字群ｉの傾きθを求め
る。ステップＳ１４において、｜θ｜＜θｍｉｎである
かを調べ、｜θ｜＜θｍｉｎであれば、ステップＳ１５
に行き、ｇ＝ｊ、θｍｉｎ＝｜θ｜として、ステップＳ
１６でｊに１を加えてステップＳ７に戻る。また、｜θ
｜＜θｍｉｎなければステップＳ１６に行きｊに１を加
えてステップＳ７に戻る。上記処理を行うことにより、
既に抽出した行の中で最小の傾きを与える行番号（ｇ）
と傾き（θｍｉｎ）が求まる。Next, 1 is added to i (step S12), and the process returns to step S4. As described above, of the character groups existing on the line, the character group closest to the character group i in the horizontal direction and the character group i , And the row number (g) and the slope (θmin) that give the smallest slope among the already extracted rows are stored.
That is, it is determined whether or not i is smaller than Ng (the number of character groups) in step S4.
In step 5, it is checked whether C = 0. Since C is not 0 this time, the procedure goes to step S6, where j = 1, g = 0 (g is a row number), θmin is set to infinity, and it is checked whether j ≦ C (step S7). Since the first time is j ≦ C, the process goes to step S13, where the character group i of the character group on the j-th line
Then, the inclination θ between the character group closest to the horizontal direction and the character group i is obtained. In step S14, it is checked whether | θ | <θmin, and if | θ | <θmin, step S15
And set g = j, θmin = | θ |
In step 16, 1 is added to j, and the process returns to step S7. Also, | θ
If | <θmin, go to step S16, add 1 to j, and return to step S7. By performing the above processing,
Line number (g) that gives the minimum slope among the lines already extracted
And the inclination (θmin) are obtained.

【００１４】次に、上記傾きθｍｉｎが閾値（θｔｈ）
より小さいか調べ、θｍｉｎがθｔｈより大きい場合に
は、新しく行を作成する。また、θｍｉｎが閾値（θｔ
ｈ）より小さい場合には、ｇ行に文字群ｉを登録する。
すなわち、ステップＳ７において、ｊ≦Ｃであるかを調
べ、ｊ≦Ｃでなければ、ステップＳ８に行き、θｍｉｎ
＜θｔｈであるかを調べる。θｍｉｎ＜θｔｈでなけれ
ば、ステップＳ１０に行き、Ｃに１を加えＣ番目の行に
文字群ｉを登録する（ステップＳ１１，１２）。これに
より文字群ｉが新たな行に登録される。また、θｍｉｎ
＜θｔｈであれば、ステップＳ９において、ｇ行に文字
群ｉを登録し、ステップＳ１２でｉに１を加えて、ステ
ップＳ４に戻る。これにより、文字群ｉが既に登録され
ている行に追加される。以上の処理を全ての文字群に対
して行ないうことにより、各行が抽出され、前記図６
（ａ）に示したように各行と各文字群が対応付けられ
る。Next, the inclination θmin is a threshold (θth)
A check is made to see if the value is smaller than θth, and if θmin is greater than θth, a new row is created. Also, θmin is a threshold (θt
h) If smaller, the character group i is registered in the g line.
That is, in step S7, it is checked whether j ≦ C, and if not j ≦ C, the process goes to step S8 and θmin
Check whether <θth. If θmin <θth is not satisfied, the process proceeds to step S10, where 1 is added to C, and the character group i is registered in the Cth line (steps S11 and S12). As a result, the character group i is registered on a new line. Also, θmin
If <θth, the character group i is registered in the g-th line in step S9, 1 is added to i in step S12, and the process returns to step S4. Thereby, the character group i is added to the already registered line. By performing the above processing for all the character groups, each line is extracted, and FIG.
Each line is associated with each character group as shown in FIG.

【００１５】図９に本発明の第２の実施例の処理ブロッ
ク図を示す。本実施例は、図１０に示すように罫線が大
幅に省略された表を認識する場合の実施例を示してお
り、前記第１の実施例の図２のブロック図に罫線抽出部
１６、罫線除去部１７を追加したものである。図９にお
いて、まず、罫線抽出部１６で表の領域から罫線を抽出
し、その長さ、位置等の罫線情報を得る。罫線抽出処理
としては特開平９−５０５２７記載の公知の方法が利用
できる。次いで、罫線除去部１７で画像上から罫線を構
成する画素を消去する。以下の処理は前記第１の実施例
と同じであり、文字群抽出部１１で文字群を囲む矩形座
標を求める。次に、文字群再分割部１２で前記図７のフ
ローチャートで説明したように文字群再分割処理を行な
う。次いで、行抽出部１３において行抽出処理を行な
う。行抽出部１３では、前述した前記図８のフローチャ
ートで説明したように、文字群間の傾きを考慮して行を
抽出する。FIG. 9 is a processing block diagram of a second embodiment of the present invention. This embodiment shows an embodiment for recognizing a table in which ruled lines are largely omitted as shown in FIG. 10. The ruled line extracting unit 16 and the ruled line extracting unit 16 shown in the block diagram of FIG. The removal unit 17 is added. In FIG. 9, first, a ruled line extracting unit 16 extracts ruled lines from a table area, and obtains ruled line information such as its length and position. As the ruled line extraction processing, a known method described in JP-A-9-50527 can be used. Next, the pixels forming the ruled line are erased from the image by the ruled line removing unit 17. The following processing is the same as in the first embodiment, and the character group extracting unit 11 obtains rectangular coordinates surrounding the character group. Next, the character group subdivision unit 12 performs the character group subdivision processing as described in the flowchart of FIG. Next, the row extracting unit 13 performs a row extracting process. As described in the flowchart of FIG. 8 described above, the line extracting unit 13 extracts a line in consideration of the inclination between the character groups.

【００１６】次に列抽出部１４において列抽出を行な
う。表データ作成部１５では、各文字群の文字認識を行
い、各列、各行に文字群を登録する。これにより、前記
図６（ｃ）に示したように表に書き込み可能なデータが
作成される。そして、行と列の関係からセルデータを作
成し、セルデータに文字群を登録する。以上のように本
実施例では、一部に罫線の残っている表に対しても表が
認識できる。なお、認識した表のセルのそれぞれについ
て上下左右方向に罫線が近接する場合には、罫線情報を
セルに付加することにより、ワードプロセッサなどのア
プリケーションに表を再現する際には、罫線も含めて表
を再現することができる。Next, a column is extracted in a column extracting section 14. The table data creation unit 15 performs character recognition for each character group, and registers the character group in each column and each row. As a result, data that can be written to the table is created as shown in FIG. Then, cell data is created from the relationship between rows and columns, and a character group is registered in the cell data. As described above, in the present embodiment, a table can be recognized even for a table in which ruled lines remain partially. When ruled lines approach each other in the recognized table cells in the up, down, left, and right directions, by adding ruled line information to the cells, when reproducing the table in an application such as a word processor, the table including the ruled lines is displayed. Can be reproduced.

【００１７】[0017]

【発明の効果】以上説明したように本発明においては、
以下の効果を得ることができる。（１）一定の文字間隔の文字を統合して文字群を求める
従来方法では分離することが困難だった文字群を精度よ
く分離することが可能となる。このため、罫線のない表
を精度よく認識する事が可能となる。（２）文字群同士の位置関係が一定の傾き以内の文字群
を行として登録しているので、表が傾いている場合でも
精度よく表を認識することができる。（３）罫線を除去して表を認識することにより、罫線が
一部省かれた表であっても正確に表を認識することがで
きる。As described above, in the present invention,
The following effects can be obtained. (1) It is possible to accurately separate a character group that was difficult to separate by the conventional method of obtaining a character group by integrating characters at a fixed character interval. For this reason, it is possible to accurately recognize a table without ruled lines. (2) Since a character group in which the positional relationship between the character groups is within a certain inclination is registered as a line, the table can be recognized with high accuracy even when the table is inclined. (3) By recognizing the table by removing the ruled lines, the table can be accurately recognized even if the ruled lines are partially omitted.

[Brief description of the drawings]

【図１】本発明の概要を説明する図である。FIG. 1 is a diagram illustrating an outline of the present invention.

【図２】本発明の第１の実施例の処理ブロック図であ
る。FIG. 2 is a processing block diagram of the first embodiment of the present invention.

【図３】本発明の実施例の認識対象となる表の一例を示
す図である。FIG. 3 is a diagram illustrating an example of a table to be recognized in an embodiment of the present invention.

【図４】文字群間の傾き抽出方法を示す概念図である。FIG. 4 is a conceptual diagram illustrating a method of extracting a tilt between character groups.

【図５】本発明の実施例の列抽出処理の概念図である。FIG. 5 is a conceptual diagram of a column extraction process according to the embodiment of the present invention.

【図６】行抽出結果と列抽出結果と、これらから得られ
た表に書き込み可能なデータの一例を示す図である。FIG. 6 is a diagram illustrating an example of a row extraction result and a column extraction result, and data that can be written in a table obtained from the result.

【図７】本発明の実施例の文字群再分割処理のフローチ
ャートである。FIG. 7 is a flowchart of a character group subdivision process according to the embodiment of the present invention.

【図８】本発明の実施例の行抽出処理のフローチャート
である。FIG. 8 is a flowchart of a row extracting process according to the embodiment of the present invention.

【図９】本発明の第２の実施例の処理ブロック図であ
る。FIG. 9 is a processing block diagram of a second embodiment of the present invention.

【図１０】罫線が大幅に省略された表の一例を示す図で
ある。FIG. 10 is a diagram showing an example of a table in which ruled lines are largely omitted.

【図１１】従来方法で構造を認識できる表とできない表
の一例を示す図である。FIG. 11 is a diagram showing an example of a table whose structure can be recognized by a conventional method and an example of a table whose structure cannot be recognized.

[Explanation of symbols]

１１文字群抽出部１２文字群再分割部１３行抽出部１４列抽出部１５表データ作成部１６罫線抽出部１７罫線除去部 11 Character Group Extraction Unit 12 Character Group Subdivision Unit 13 Row Extraction Unit 14 Column Extraction Unit 15 Table Data Creation Unit 16 Rule Line Extraction Unit 17 Rule Line Removal Unit

Claims

[Claims]

1. A method for recognizing a table structure from a table image without ruled lines, wherein a character group is obtained by integrating characters having a fixed character spacing within the table, and a character spacing of characters in the character group is obtained. The character group is subdivided based on the average value, and the character group in which the positional relationship between the subdivided character groups is within a certain slope is registered as a line. A table recognition method characterized by registering, as a column, a character group having a relationship in which horizontal coordinates of the character groups overlap each other.

2. The table recognition method according to claim 1, wherein the table recognition according to claim 1 is performed after removing a ruled line included in the image.

3. A program for recognizing a table from a table image having no ruled lines, wherein the program obtains a character group by integrating characters in the table with a character spacing of less than a predetermined value. A process of re-dividing the character group based on the average value of the character spacing of the character group, and registering a character group in which the positional relationship between the re-divided character groups is within a certain slope as a line, A table recognition program for causing a computer to execute a process of registering, as a column, a character group having a relationship in which the horizontal coordinates of the subdivided character groups overlap each other.

4. A program for recognizing a table structure according to claim 3, wherein a ruled line included in the image is removed before performing the table structure recognition.

5. A recording medium recording a program for recognizing a table from a table image without ruled lines, wherein the program integrates characters in the table with a character interval within a certain range to obtain a character group. An average value of the character spacing of the characters in the character group is obtained, the character group is subdivided based on the average value, and a character group in which the positional relationship between the subdivided character groups is within a certain inclination is registered as a line. A recording medium storing a table recognition program, wherein a character group having a relationship in which the horizontal coordinates of the subdivided character groups overlap each other is registered as a column.

6. A recording medium storing a table structure recognition program according to claim 3, wherein the ruled line included in the image is removed before the table structure recognition is performed.