JP2001331763A - Method for recognizing table - Google Patents

Method for recognizing table

Info

Publication number
JP2001331763A
JP2001331763A JP2001064970A JP2001064970A JP2001331763A JP 2001331763 A JP2001331763 A JP 2001331763A JP 2001064970 A JP2001064970 A JP 2001064970A JP 2001064970 A JP2001064970 A JP 2001064970A JP 2001331763 A JP2001331763 A JP 2001331763A
Authority
JP
Japan
Prior art keywords
character
character group
characters
group
groups
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
JP2001064970A
Other languages
Japanese (ja)
Other versions
JP3904397B2 (en
Inventor
Junji Kashioka
潤二 柏岡
Satoshi Naoi
聡 直井
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Fujitsu Ltd
Original Assignee
Fujitsu Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Fujitsu Ltd filed Critical Fujitsu Ltd
Priority to JP2001064970A priority Critical patent/JP3904397B2/en
Publication of JP2001331763A publication Critical patent/JP2001331763A/en
Application granted granted Critical
Publication of JP3904397B2 publication Critical patent/JP3904397B2/en
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Landscapes

  • Character Input (AREA)
  • Character Discrimination (AREA)

Abstract

PROBLEM TO BE SOLVED: To precisely separate a group of characters in a tubular document having no ruled lines (or where ruled lines are deleted remarkably) and to precisely recognize table structure even when a table is tilted. SOLUTION: Characters whose pitches are within a fixed value in a table are integrated to obtain the group of the characters and the average of the pitches of the characters in the group of the characters is obtained to re-divide the group of the characters based on the average. Next, a group of characters where the mutual positional relation between the re-divided groups of the characters is within fixed inclination are registered as a row. The groups of characters having a relation where coordinates in the horizontal direction of the re-divided groups of the characters are overlapped with each other are registered as a column. Then, the groups of the characters of respective rows and respective columns obtained like above are recognized to prepare data which can be read into a table.

Description

【発明の詳細な説明】DETAILED DESCRIPTION OF THE INVENTION

【0001】[0001]

【発明の属する技術分野】近年、入力周辺機器として文
書認識処理の需要が増加している。本発明は、この文書
認識処理において、より高い精度で罫線のない表を認識
することができる表認識方法に関する。
BACKGROUND OF THE INVENTION In recent years, demand for document recognition processing as input peripheral equipment has been increasing. The present invention relates to a table recognition method capable of recognizing a table without ruled lines with higher accuracy in the document recognition processing.

【0002】[0002]

【従来の技術】表形式文書の文字認識結果を例えば表計
算ソフト等で使用する場合には、表構造を認識し、表中
の文字群が何列目の何行目に記録された文字群であるか
を認識する必要がある。罫線がある表形式文書の場合に
は、通常、罫線構造を認識し、罫線構造から各行、各列
の文字群を識別していた。一方、図11(a)のような
罫線のない表形式文書を認識する場合には、従来、文字
サイズの平均値、文字ピッチ等の平均値より文字間隔を
定め、該一定の文字間隔の文字を統合して文字群を求
め、文字群の縦方向、横方向それぞれの座標が重なる関
係にある場合に、行、列として登録する処理が用いられ
ていた。
2. Description of the Related Art When a character recognition result of a tabular document is used in, for example, spreadsheet software, a table structure is recognized, and a character group recorded in a column and a row of a character group in the table is recognized. Needs to be recognized. In the case of a table format document having ruled lines, the ruled line structure is usually recognized, and the character group of each row and each column is identified from the ruled line structure. On the other hand, when recognizing a tabular document having no ruled lines as shown in FIG. 11A, conventionally, the character spacing is determined based on the average value of the character size, the character pitch, and the like. Has been used to obtain a character group, and when the vertical and horizontal coordinates of the character group overlap each other, the character group is registered as a row or a column.

【0003】[0003]

【発明が解決しようとする課題】図11(a)の表では
上記のように一定の文字間隔の文字を統合することによ
り文字群を正確に推定することができるが、図11
(b)のような表では、上記一定の文字間隔で文字群を
推定すると、図中領域のように本来複数の文字群とな
るべきものが一つの文字群として認識される場合があっ
た。また、図11(c)のように表が傾いていた場合に
は、行間隔が狭いと複数の行が一つの行として認識され
る場合があった。本発明は、上記事情に鑑みなされたも
のであって、本発明は、罫線がないもしくは罫線が大幅
に削除されている表形式文書における表認識において、
従来の方法では分離することができなかった文字群を精
度よく分離することができ、また表が傾いている場合で
あっても精度良く表を認識することができる表認識方法
を提供するこを目的とする。
In the table of FIG. 11 (a), a character group can be accurately estimated by integrating characters at a fixed character interval as described above.
In a table such as that shown in (b), when character groups are estimated at the above-mentioned constant character interval, a character group that should originally be a plurality of character groups, such as an area in the drawing, may be recognized as one character group. When the table is inclined as shown in FIG. 11C, a plurality of rows may be recognized as one row if the row interval is small. The present invention has been made in view of the above circumstances, and the present invention provides a method for recognizing a table in a tabular document in which there is no ruled line or the ruled line is largely deleted.
Provided is a table recognition method that can accurately separate a character group that cannot be separated by the conventional method, and that can accurately recognize a table even when the table is inclined. Aim.

【0004】[0004]

【課題を解決するための手段】図1は本発明の概要を説
明する図である。同図に示すように本発明においては、
次の〜のようにして表を認識する。 表内の文字間隔が一定以内の文字を統合して文字群
を求める。 上記文字群内の文字の文字間隔の平均値を求めて、
該平均値に基づき上記文字群を再分割する。 上記再分割した文字群同士の位置関係が一定の傾き
以内の文字群を行として登録する。 上記再分割した文字群同士の横方向の座標が重なる
関係をもつ文字群を列として登録する。 また、罫線が一部省かれた表を認識する場合には、画像
中に含まれる罫線を除去してから、上記〜の表認識
を行う。本発明においては、上記のように表を認識して
いるので、表中に他の文字群の間隔より、間隔の狭い文
字群が混在していても、これらの文字群を複数の文字群
として正確に認識することができる。また、傾きのある
表に対しても正確に表のを認識することができる。
FIG. 1 is a diagram for explaining the outline of the present invention. As shown in FIG.
The table is recognized as follows. A character group is obtained by integrating characters in a table with a character spacing within a certain range. Find the average value of the character spacing of the characters in the above character group,
The character group is subdivided based on the average value. A character group in which the positional relationship between the subdivided character groups is within a certain inclination is registered as a line. A character group having a relationship in which the horizontal coordinates of the subdivided character groups overlap each other is registered as a column. When recognizing a table in which the ruled lines are partially omitted, the ruled line included in the image is removed, and then the above table recognition is performed. In the present invention, since the table is recognized as described above, even if a character group with a narrower interval is mixed in the table than the space between other character groups, these character groups are used as a plurality of character groups. Can be accurately recognized. In addition, even a table having a slope can be recognized accurately.

【0005】[0005]

【発明の実施の形態】図2に本発明の第1の実施例の処
理ブロック図を示す。なお、本発明は、CPU、メモ
リ、外部記憶装置、入出力装置、画像読み取りを行うス
キャナ、記録媒体読み取り装置、通信インタフェース等
を備えた通常の計算機システムにより実現することがで
き、上記外部記憶装置等に本発明の文字認識処理を行う
プログラムが格納され、実行時、上記プログラムがメモ
リに読み込まれ、スキャナ等で読み取った画像につい
て、本発明の文字認識処理による文字認識が行われ、文
字認識結果が上記入出力装置から出力される。
FIG. 2 is a processing block diagram of a first embodiment of the present invention. Note that the present invention can be realized by a normal computer system including a CPU, a memory, an external storage device, an input / output device, a scanner for reading an image, a recording medium reading device, a communication interface, and the like. The program for performing the character recognition process of the present invention is stored in the memory. When the program is executed, the program is read into a memory, and the image read by the scanner or the like is subjected to character recognition by the character recognition process of the present invention. Is output from the input / output device.

【0006】次に本発明の第1の実施例について説明す
る。本実施例は例えば、前記図11(b)のように、罫
線がない表形式文書の認識に適用される。本実施例で
は、図2に示すように、まず、文字群抽出部11で文字
群を囲む矩形座標を求める。すなわち、罫線のない表と
して与えられた範囲に対して、文字を形成する画素の連
結性より連結成分を囲む矩形を抽出し、更に矩形間の重
なりや位置関係から矩形を統合して文字を推定し、文字
を囲む矩形を抽出する。次いで、抽出された各文字を矩
形の座標から、文字と文字との間の長さ(文字間隔)が
一定の閾値以内にあるものを統合して文字群を抽出し、
文字群を囲む矩形座標を求める。図3(a)に示す罫線
のない表から上記文字群抽出部11により抽出された文
字群を図3(b)に示す。同図に示すように、「(1
月)〜(5月)」は文字間隔が短いので一つの文字群と
して抽出されている。
Next, a first embodiment of the present invention will be described. This embodiment is applied to, for example, recognition of a tabular document having no ruled line as shown in FIG. In this embodiment, as shown in FIG. 2, first, the character group extracting unit 11 obtains rectangular coordinates surrounding the character group. In other words, for a range given as a table without ruled lines, rectangles surrounding connected components are extracted from the connectivity of the pixels forming the characters, and then the rectangles are integrated from the overlap and positional relationship between the rectangles to estimate the characters. Then, a rectangle surrounding the character is extracted. Next, a character group is extracted by integrating the extracted characters from the coordinates of the rectangle by integrating the characters having a length (character interval) between the characters within a certain threshold value,
Find the rectangular coordinates surrounding the character group. FIG. 3B shows a character group extracted from the table without ruled lines shown in FIG. As shown in FIG.
"Month) to (May)" are extracted as one character group because the character interval is short.

【0007】次に、文字群再分割部12で文字群再分割
処理を行なう。すなわち、後述する図7のフローチャー
トに示すように、文字群抽出部11で抽出した文字群を
一つずつ取り出し、隣接する文字同士の文字間隔を求め
てその平均Waを求める。そして、各文字がWaに対し
て一定以上の長さで離れているかを調べ、該当する場合
には、文字群を再分割する。図3(a)の表に対して、
文字群再分割処理で抽出された文字群を図3(c)に示
す。同図に示すように、「(1月)〜(5月)」はそれ
ぞれ文字群として再分割されている。次いで、行抽出部
13において行抽出処理を行なう。行抽出部13では、
後述する図8のフローチャートに示すように、文字群間
の傾きを考慮して行を抽出する。すなわち、まず文字群
を左から順にソートし、文字群をソートした順に一つ取
り出し番号をiとする。この時点で行が無い場合には新
規に行を作成し、文字群iをその行に登録する。例えば
図3(c)の例では「合計」が最も左にあるので、新規
に行が登録される。
Next, the character group subdivision section 12 performs a character group subdivision process. That is, as shown in the flowchart of FIG. 7 described later, the character groups extracted by the character group extraction unit 11 are extracted one by one, the character spacing between adjacent characters is calculated, and the average Wa is calculated. Then, it is checked whether each character is apart from Wa by a certain length or more, and if so, the character group is subdivided. With respect to the table of FIG.
FIG. 3C shows a character group extracted by the character group subdivision process. As shown in the figure, “(January) to (May)” are subdivided as character groups. Next, the row extracting unit 13 performs a row extracting process. In the row extraction unit 13,
As shown in the flowchart of FIG. 8 described later, lines are extracted in consideration of the inclination between the character groups. That is, first, the character groups are sorted in order from the left, and one extracted number is set to i in the order in which the character groups are sorted. If there is no line at this point, a new line is created and the character group i is registered in that line. For example, in the example of FIG. 3C, “sum” is at the leftmost position, so a new row is registered.

【0008】上記のようにして行が登録されると、その
行に存在する文字群のうち、文字群iと横方向に一番近
い文字群と文字群iとの傾きを求め、既に抽出した行の
中で最小の傾きを与える行番号(g)と傾き(θmi
n)を記憶する。ここで傾きは図4に示すように文字群
を囲む矩形座標から中心点をそれぞれ求めて、中心点間
を結ぶ線が水平線となす傾きを採用すればよい。次に、
上記傾きθminが閾値(θth)より小さいか調べ、
θminがθthより大きい場合には、新しく行を作成
する。また、θminが閾値(θth)より小さい場合
には、g行に文字群iを登録する。この処理を全ての文
字群に対して行ない、行抽出を完了する。以上のように
して行を抽出することにより、例えば図6(a)に示す
ように、行と文字群が対応付けられる。
When the line is registered as described above, the inclination between the character group i and the character group i closest to the character group i in the horizontal direction among the character groups existing in the line is obtained and already extracted. The row number (g) giving the minimum slope and the slope (θmi
n) is stored. Here, as the inclination, as shown in FIG. 4, a center point may be obtained from the rectangular coordinates surrounding the character group, and a line connecting the center points with a horizontal line may be used. next,
Check whether the inclination θmin is smaller than a threshold value (θth),
If θmin is larger than θth, a new row is created. If θmin is smaller than the threshold (θth), the character group i is registered in the g-th line. This process is performed for all the character groups, and the line extraction is completed. By extracting the lines as described above, the lines and the character groups are associated with each other, for example, as shown in FIG.

【0009】次に列抽出部14において列抽出を行な
う。列抽出は、例えば、横方向の座標が重なる関係をも
つ文字群を列として登録する。すなわち、横方向をX
軸、縦方向をY軸としたとき、文字群を囲む矩形のX軸
方向の座標値が重なる文字群(X軸方向の座標値の少な
くとも一部が同じ値である文字群)を列として登録す
る。例えば、前記図3(c)の例では、図5の破線で囲
まれた領域内の文字群のX軸方向の座標値が重なってい
るので各領域内の文字群は同じ列に登録される。これに
より、例えば図6(b)に示すように列と文字群が対応
付けられる。表データ作成部15では、各文字群の文字
認識を行い、行と列の関係からセルデータを作成し、セ
ルデータに文字群を登録する。すなわち、文字群の認識
結果と、図6(a)に示す行抽出結果、図6(b)に示
す列抽出結果から、図6(c)に示すような表に書き込
み可能なデータが作成される。
Next, a column extraction section 14 performs column extraction. In the column extraction, for example, a character group having a relationship in which the coordinates in the horizontal direction overlap is registered as a column. That is, the horizontal direction is X
When the axis and the vertical direction are the Y axis, a character group in which the coordinate values in the X axis direction of the rectangle surrounding the character group overlap (a character group in which at least a part of the coordinate values in the X axis direction have the same value) is registered as a column. I do. For example, in the example of FIG. 3C, since the coordinate values in the X-axis direction of the character groups in the area surrounded by the broken line in FIG. 5 overlap, the character groups in each area are registered in the same column. . Thereby, for example, as shown in FIG. 6B, the columns are associated with the character groups. The table data creation unit 15 performs character recognition for each character group, creates cell data from the relationship between rows and columns, and registers the character group in the cell data. That is, data that can be written in a table as shown in FIG. 6C is created from the character group recognition result, the row extraction result shown in FIG. 6A, and the column extraction result shown in FIG. 6B. You.

【0010】次に、前記文字群再分割部12における文
字群再分割処理を、図7のフローチャートにより説明す
る。まず、文字群抽出部11で抽出した文字群を一つず
つ取り出し、取り出した文字群iに含まれる文字(個
数:Nc)をその座標に基づいて左から順にソートす
る。そして隣接する文字同士の文字間隔を求めて、その
平均Waを得る。すなわち、図7に示すように、文字群
数をNgに設定し、i=1とする(ステップS1)。つ
いで、i≦Ngであるかを調べ、i>Ngの場合は処理
を終了する。また、i≦Ngの場合には、Ncをi番目
の文字群の文字数とする(ステップS3)。そして、i
番目の文字群に含まれる文字を左から順にソートする
(ステップS4)。ついで、Waをi番目の文字の文字
群の平均文字間隔に設定する(ステップS5)。
Next, the character group subdivision processing in the character group subdivision section 12 will be described with reference to the flowchart of FIG. First, the character groups extracted by the character group extraction unit 11 are extracted one by one, and the characters (number: Nc) included in the extracted character group i are sorted in order from the left based on the coordinates. Then, the character interval between adjacent characters is obtained, and the average Wa is obtained. That is, as shown in FIG. 7, the number of character groups is set to Ng, and i = 1 (step S1). Then, it is checked whether i ≦ Ng, and if i> Ng, the process is terminated. If i ≦ Ng, Nc is set to the number of characters of the i-th character group (step S3). And i
The characters included in the th character group are sorted in order from the left (step S4). Next, Wa is set to the average character spacing of the character group of the i-th character (step S5).

【0011】次いで、ソートした順に文字jと次の文字
j+1との間隔を求め、Waに対して一定以上の長さで
離れているかを調べ、該当する場合にはその文字jと文
字j+1の間で文字群を再分割する。この際に文字群数
Nを一つ増やし、i+1番目以降の文字群の番号を一つ
増やす。そしてi番目の文字群には1〜jまでの文字を
登録し、登録した文字を包含する矩形座標を得る。次い
で、i+1番目の文字群に新たにj+1〜Ncまでの文
字を登録し.登録した文字を包含する矩形座標を得る。
そしてiを一つ増やし、次の文字群の再分割処理を繰り
返していき、全ての文字群に対して再分割処理を行な
う。すなわち、図7に示すように、j=1として、j<
Ncであるかを調べ(ステップS7)、j≧Ncの場合
には、ステップS15においてi=i+1としてステッ
プS2に戻り、j<Ncの場合には、文字jと文字j+
1の間隔をWとして(ステップS8)、文字間隔W>平
均文字間隔Wa×〔一定値〕であるかを調べる(ステッ
プS9)。上記条件を満たさない場合には、ステップS
10でj=j+1としてステップS7に戻る。また、文
字間隔W>平均文字間隔Wa×〔一定値〕の場合には、
文字群数を1増やし(ステップS11)、i+1番目以
降の文字群の番号を1増やす。さらに、i番目の文字群
に1〜j番目の文字を登録し、文字数、矩形座標を算出
し(ステップS13)、i+1番目の文字群にj+1〜
Nc番目の文字を登録し、文字数、矩形座標を算出する
(ステップS14)。以上の処理が終わるとステップS
15でi=i+1としてステップS2に戻る。
Next, the distance between the character j and the next character j + 1 is determined in the sorted order, and it is checked whether the character j is separated from Wa by a certain length or more. Subdivides the character group. At this time, the number N of character groups is increased by one, and the number of the character group after the (i + 1) th is increased by one. Then, characters 1 to j are registered in the i-th character group, and rectangular coordinates including the registered characters are obtained. Next, characters from j + 1 to Nc are newly registered in the (i + 1) th character group. Get the rectangular coordinates containing the registered character.
Then, i is incremented by one, and the subdivision process of the next character group is repeated, and the subdivision process is performed for all the character groups. That is, as shown in FIG. 7, j = 1 and j <
Nc is checked (step S7). If j ≧ Nc, i = i + 1 is set in step S15 and the process returns to step S2. If j <Nc, the character j and the character j +
The interval of 1 is set to W (step S8), and it is checked whether or not the character interval W> the average character interval Wa × [constant value] (step S9). If the above conditions are not satisfied, step S
At 10, j = j + 1 is set and the process returns to step S7. When the character spacing W> the average character spacing Wa × [constant value],
The number of character groups is increased by 1 (step S11), and the numbers of the i + 1th and subsequent character groups are increased by 1. Further, the first to j-th characters are registered in the i-th character group, the number of characters and the rectangular coordinates are calculated (step S13), and j + 1 to j + 1 are registered in the (i + 1) -th character group.
The Nc-th character is registered, and the number of characters and rectangular coordinates are calculated (step S14). When the above processing is completed, step S
At 15, i = i + 1, and the process returns to step S2.

【0012】次に、前記行抽出部13における行抽出処
理を、図8のフローチャートにより説明する。図8にお
いて、まず文字群を左から順にソートする(ステップS
1)、次いで、θthを傾きの閾値(予め設定され
る)、Ngを文字群数、Cを行数(最初は0に設定)と
し(ステップS2)、i=0に設定する(ステップS
3)。次に、iがNg(文字群数)より小さいかを調
べ、小さくない場合には処理を終了する。iがNgより
小さければ、C=0であるかを調べる(ステップS
5)。C=0であれば、ステップS10に行きCに1を
加え、C番目の行に文字群iを登録する(ステップS1
1)。これにより、例えば図3(c)の例では、文字群
「合計」が登録される(文字群「合計」が最も左の位置
にあるため)。以上のようにして最初の行が登録され
る。
Next, the row extracting process in the row extracting section 13 will be described with reference to the flowchart of FIG. In FIG. 8, first, character groups are sorted in order from the left (step S
1) Next, θth is set as a threshold of inclination (preset), Ng is set as the number of character groups, C is set as the number of lines (set to 0 at first) (step S2), and i is set to 0 (step S2).
3). Next, it is checked whether i is smaller than Ng (the number of character groups), and if not, the process is terminated. If i is smaller than Ng, it is checked whether C = 0 (step S).
5). If C = 0, go to step S10, add 1 to C, and register character group i in the C-th line (step S1).
1). Thereby, for example, in the example of FIG. 3C, the character group “total” is registered (since the character group “total” is at the leftmost position). The first line is registered as described above.

【0013】次にiに1を加え(ステップS12)、ス
テップS4に戻り、前記したようにその行に存在する文
字群のうち、文字群iと横方向に一番近い文字群と文字
群iとの傾きを求め、既に抽出した行の中で最小の傾き
を与える行番号(g)と傾き(θmin)を記憶する。
すなわち、ステップS4において、iがNg(文字群
数)より小さいかを調べ、小さい場合には、ステップS
5でC=0であるかを調べる。今回はC=0でないの
で、ステップS6に行き、j=1、g=0(gは行番
号)、θminを無限大に設定し、j≦Cであるかを調
べる(ステップS7)。一回目はj≦Cであるので、ス
テップS13に行き、j行目の文字群のうちの文字群i
に横方向に最も近い文字群と文字群iの傾きθを求め
る。ステップS14において、|θ|<θminである
かを調べ、|θ|<θminであれば、ステップS15
に行き、g=j、θmin=|θ|として、ステップS
16でjに1を加えてステップS7に戻る。また、|θ
|<θminなければステップS16に行きjに1を加
えてステップS7に戻る。上記処理を行うことにより、
既に抽出した行の中で最小の傾きを与える行番号(g)
と傾き(θmin)が求まる。
Next, 1 is added to i (step S12), and the process returns to step S4. As described above, of the character groups existing on the line, the character group closest to the character group i in the horizontal direction and the character group i , And the row number (g) and the slope (θmin) that give the smallest slope among the already extracted rows are stored.
That is, it is determined whether or not i is smaller than Ng (the number of character groups) in step S4.
In step 5, it is checked whether C = 0. Since C is not 0 this time, the procedure goes to step S6, where j = 1, g = 0 (g is a row number), θmin is set to infinity, and it is checked whether j ≦ C (step S7). Since the first time is j ≦ C, the process goes to step S13, where the character group i of the character group on the j-th line
Then, the inclination θ between the character group closest to the horizontal direction and the character group i is obtained. In step S14, it is checked whether | θ | <θmin, and if | θ | <θmin, step S15
And set g = j, θmin = | θ |
In step 16, 1 is added to j, and the process returns to step S7. Also, | θ
If | <θmin, go to step S16, add 1 to j, and return to step S7. By performing the above processing,
Line number (g) that gives the minimum slope among the lines already extracted
And the inclination (θmin) are obtained.

【0014】次に、上記傾きθminが閾値(θth)
より小さいか調べ、θminがθthより大きい場合に
は、新しく行を作成する。また、θminが閾値(θt
h)より小さい場合には、g行に文字群iを登録する。
すなわち、ステップS7において、j≦Cであるかを調
べ、j≦Cでなければ、ステップS8に行き、θmin
<θthであるかを調べる。θmin<θthでなけれ
ば、ステップS10に行き、Cに1を加えC番目の行に
文字群iを登録する(ステップS11,12)。これに
より文字群iが新たな行に登録される。また、θmin
<θthであれば、ステップS9において、g行に文字
群iを登録し、ステップS12でiに1を加えて、ステ
ップS4に戻る。これにより、文字群iが既に登録され
ている行に追加される。以上の処理を全ての文字群に対
して行ないうことにより、各行が抽出され、前記図6
(a)に示したように各行と各文字群が対応付けられ
る。
Next, the inclination θmin is a threshold (θth)
A check is made to see if the value is smaller than θth, and if θmin is greater than θth, a new row is created. Also, θmin is a threshold (θt
h) If smaller, the character group i is registered in the g line.
That is, in step S7, it is checked whether j ≦ C, and if not j ≦ C, the process goes to step S8 and θmin
Check whether <θth. If θmin <θth is not satisfied, the process proceeds to step S10, where 1 is added to C, and the character group i is registered in the Cth line (steps S11 and S12). As a result, the character group i is registered on a new line. Also, θmin
If <θth, the character group i is registered in the g-th line in step S9, 1 is added to i in step S12, and the process returns to step S4. Thereby, the character group i is added to the already registered line. By performing the above processing for all the character groups, each line is extracted, and FIG.
Each line is associated with each character group as shown in FIG.

【0015】図9に本発明の第2の実施例の処理ブロッ
ク図を示す。本実施例は、図10に示すように罫線が大
幅に省略された表を認識する場合の実施例を示してお
り、前記第1の実施例の図2のブロック図に罫線抽出部
16、罫線除去部17を追加したものである。図9にお
いて、まず、罫線抽出部16で表の領域から罫線を抽出
し、その長さ、位置等の罫線情報を得る。罫線抽出処理
としては特開平9−50527記載の公知の方法が利用
できる。次いで、罫線除去部17で画像上から罫線を構
成する画素を消去する。以下の処理は前記第1の実施例
と同じであり、文字群抽出部11で文字群を囲む矩形座
標を求める。次に、文字群再分割部12で前記図7のフ
ローチャートで説明したように文字群再分割処理を行な
う。次いで、行抽出部13において行抽出処理を行な
う。行抽出部13では、前述した前記図8のフローチャ
ートで説明したように、文字群間の傾きを考慮して行を
抽出する。
FIG. 9 is a processing block diagram of a second embodiment of the present invention. This embodiment shows an embodiment for recognizing a table in which ruled lines are largely omitted as shown in FIG. 10. The ruled line extracting unit 16 and the ruled line extracting unit 16 shown in the block diagram of FIG. The removal unit 17 is added. In FIG. 9, first, a ruled line extracting unit 16 extracts ruled lines from a table area, and obtains ruled line information such as its length and position. As the ruled line extraction processing, a known method described in JP-A-9-50527 can be used. Next, the pixels forming the ruled line are erased from the image by the ruled line removing unit 17. The following processing is the same as in the first embodiment, and the character group extracting unit 11 obtains rectangular coordinates surrounding the character group. Next, the character group subdivision unit 12 performs the character group subdivision processing as described in the flowchart of FIG. Next, the row extracting unit 13 performs a row extracting process. As described in the flowchart of FIG. 8 described above, the line extracting unit 13 extracts a line in consideration of the inclination between the character groups.

【0016】次に列抽出部14において列抽出を行な
う。表データ作成部15では、各文字群の文字認識を行
い、各列、各行に文字群を登録する。これにより、前記
図6(c)に示したように表に書き込み可能なデータが
作成される。そして、行と列の関係からセルデータを作
成し、セルデータに文字群を登録する。以上のように本
実施例では、一部に罫線の残っている表に対しても表が
認識できる。なお、認識した表のセルのそれぞれについ
て上下左右方向に罫線が近接する場合には、罫線情報を
セルに付加することにより、ワードプロセッサなどのア
プリケーションに表を再現する際には、罫線も含めて表
を再現することができる。
Next, a column is extracted in a column extracting section 14. The table data creation unit 15 performs character recognition for each character group, and registers the character group in each column and each row. As a result, data that can be written to the table is created as shown in FIG. Then, cell data is created from the relationship between rows and columns, and a character group is registered in the cell data. As described above, in the present embodiment, a table can be recognized even for a table in which ruled lines remain partially. When ruled lines approach each other in the recognized table cells in the up, down, left, and right directions, by adding ruled line information to the cells, when reproducing the table in an application such as a word processor, the table including the ruled lines is displayed. Can be reproduced.

【0017】[0017]

【発明の効果】以上説明したように本発明においては、
以下の効果を得ることができる。 (1)一定の文字間隔の文字を統合して文字群を求める
従来方法では分離することが困難だった文字群を精度よ
く分離することが可能となる。このため、罫線のない表
を精度よく認識する事が可能となる。 (2)文字群同士の位置関係が一定の傾き以内の文字群
を行として登録しているので、表が傾いている場合でも
精度よく表を認識することができる。 (3)罫線を除去して表を認識することにより、罫線が
一部省かれた表であっても正確に表を認識することがで
きる。
As described above, in the present invention,
The following effects can be obtained. (1) It is possible to accurately separate a character group that was difficult to separate by the conventional method of obtaining a character group by integrating characters at a fixed character interval. For this reason, it is possible to accurately recognize a table without ruled lines. (2) Since a character group in which the positional relationship between the character groups is within a certain inclination is registered as a line, the table can be recognized with high accuracy even when the table is inclined. (3) By recognizing the table by removing the ruled lines, the table can be accurately recognized even if the ruled lines are partially omitted.

【図面の簡単な説明】[Brief description of the drawings]

【図1】本発明の概要を説明する図である。FIG. 1 is a diagram illustrating an outline of the present invention.

【図2】本発明の第1の実施例の処理ブロック図であ
る。
FIG. 2 is a processing block diagram of the first embodiment of the present invention.

【図3】本発明の実施例の認識対象となる表の一例を示
す図である。
FIG. 3 is a diagram illustrating an example of a table to be recognized in an embodiment of the present invention.

【図4】文字群間の傾き抽出方法を示す概念図である。FIG. 4 is a conceptual diagram illustrating a method of extracting a tilt between character groups.

【図5】本発明の実施例の列抽出処理の概念図である。FIG. 5 is a conceptual diagram of a column extraction process according to the embodiment of the present invention.

【図6】行抽出結果と列抽出結果と、これらから得られ
た表に書き込み可能なデータの一例を示す図である。
FIG. 6 is a diagram illustrating an example of a row extraction result and a column extraction result, and data that can be written in a table obtained from the result.

【図7】本発明の実施例の文字群再分割処理のフローチ
ャートである。
FIG. 7 is a flowchart of a character group subdivision process according to the embodiment of the present invention.

【図8】本発明の実施例の行抽出処理のフローチャート
である。
FIG. 8 is a flowchart of a row extracting process according to the embodiment of the present invention.

【図9】本発明の第2の実施例の処理ブロック図であ
る。
FIG. 9 is a processing block diagram of a second embodiment of the present invention.

【図10】罫線が大幅に省略された表の一例を示す図で
ある。
FIG. 10 is a diagram showing an example of a table in which ruled lines are largely omitted.

【図11】従来方法で構造を認識できる表とできない表
の一例を示す図である。
FIG. 11 is a diagram showing an example of a table whose structure can be recognized by a conventional method and an example of a table whose structure cannot be recognized.

【符号の説明】[Explanation of symbols]

11 文字群抽出部 12 文字群再分割部 13 行抽出部 14 列抽出部 15 表データ作成部 16 罫線抽出部 17 罫線除去部 11 Character Group Extraction Unit 12 Character Group Subdivision Unit 13 Row Extraction Unit 14 Column Extraction Unit 15 Table Data Creation Unit 16 Rule Line Extraction Unit 17 Rule Line Removal Unit

Claims (6)

【特許請求の範囲】[Claims] 【請求項1】 罫線のない表画像から表構造を認識する
方法であって、 上記表内の文字間隔が一定以内の文字を統合して文字群
を求め、 上記文字群内の文字の文字間隔の平均値を求めて、該平
均値に基づき上記文字群を再分割し、 上記再分割した文字群同士の位置関係が一定の傾き以内
の文字群を行として登録し、また、上記再分割した文字
群同士の横方向の座標が重なる関係をもつ文字群を列と
して登録することを特徴とする表認識方法。
1. A method for recognizing a table structure from a table image without ruled lines, wherein a character group is obtained by integrating characters having a fixed character spacing within the table, and a character spacing of characters in the character group is obtained. The character group is subdivided based on the average value, and the character group in which the positional relationship between the subdivided character groups is within a certain slope is registered as a line. A table recognition method characterized by registering, as a column, a character group having a relationship in which horizontal coordinates of the character groups overlap each other.
【請求項2】 画像中に含まれる罫線を除去してから請
求項1記載の表認識を行うことを特徴とする表認識方
法。
2. The table recognition method according to claim 1, wherein the table recognition according to claim 1 is performed after removing a ruled line included in the image.
【請求項3】 罫線のない表画像から表を認識するプロ
グラムであって、 上記プログラムは、上記表内の文字間隔が一定以内の文
字を統合して文字群を求め、上記文字群内の文字の文字
間隔の平均値を求めて、該平均値に基づき上記文字群を
再分割する処理と、 上記再分割した文字群同士の位置関係が一定の傾き以内
の文字群を行として登録し、また、上記再分割した文字
群同士の横方向の座標が重なる関係をもつ文字群を列と
して登録する処理をコンピュータに実行させることを特
徴とする表認識プログラム。
3. A program for recognizing a table from a table image having no ruled lines, wherein the program obtains a character group by integrating characters in the table with a character spacing of less than a predetermined value. A process of re-dividing the character group based on the average value of the character spacing of the character group, and registering a character group in which the positional relationship between the re-divided character groups is within a certain slope as a line, A table recognition program for causing a computer to execute a process of registering, as a column, a character group having a relationship in which the horizontal coordinates of the subdivided character groups overlap each other.
【請求項4】 画像中に含まれる罫線を除去してから請
求項3記載の表構造認識を行うことを特徴とする表構造
認識プログラム。
4. A program for recognizing a table structure according to claim 3, wherein a ruled line included in the image is removed before performing the table structure recognition.
【請求項5】 罫線のない表画像から表を認識するプロ
グラムを記録した記録媒体であって、 上記プログラムは、上記表内の文字間隔が一定以内の文
字を統合して文字群を求め、上記文字群内の文字の文字
間隔の平均値を求めて、該平均値に基づき上記文字群を
再分割し、 上記再分割した文字群同士の位置関係が一定の傾き以内
の文字群を行として登録し、また、上記再分割した文字
群同士の横方向の座標が重なる関係をもつ文字群を列と
して登録することを特徴とする表認識プログラムを記録
した記録媒体。
5. A recording medium recording a program for recognizing a table from a table image without ruled lines, wherein the program integrates characters in the table with a character interval within a certain range to obtain a character group. An average value of the character spacing of the characters in the character group is obtained, the character group is subdivided based on the average value, and a character group in which the positional relationship between the subdivided character groups is within a certain inclination is registered as a line. A recording medium storing a table recognition program, wherein a character group having a relationship in which the horizontal coordinates of the subdivided character groups overlap each other is registered as a column.
【請求項6】 画像中に含まれる罫線を除去してから請
求項3記載の表構造認識を行うことを特徴とする表構造
認識プログラムを記録した記録媒体。
6. A recording medium storing a table structure recognition program according to claim 3, wherein the ruled line included in the image is removed before the table structure recognition is performed.
JP2001064970A 2000-03-17 2001-03-08 Table recognition method Expired - Fee Related JP3904397B2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP2001064970A JP3904397B2 (en) 2000-03-17 2001-03-08 Table recognition method

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
JP2000075627 2000-03-17
JP2000-75627 2000-03-17
JP2001064970A JP3904397B2 (en) 2000-03-17 2001-03-08 Table recognition method

Publications (2)

Publication Number Publication Date
JP2001331763A true JP2001331763A (en) 2001-11-30
JP3904397B2 JP3904397B2 (en) 2007-04-11

Family

ID=26587764

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2001064970A Expired - Fee Related JP3904397B2 (en) 2000-03-17 2001-03-08 Table recognition method

Country Status (1)

Country Link
JP (1) JP3904397B2 (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2008257737A (en) * 2008-04-14 2008-10-23 Toshiba Corp Machine translation device and machine translation program
JP2010002991A (en) * 2008-06-18 2010-01-07 Canon Inc Image processor, image processing method, and computer program
CN105592244A (en) * 2014-11-06 2016-05-18 京瓷办公信息系统株式会社 Image Processing Apparatus
KR101811581B1 (en) * 2016-11-15 2017-12-26 주식회사 셀바스에이아이 Aparatus and method for cell decomposition for a table recognition in document image
CN109325415A (en) * 2018-08-22 2019-02-12 吴昌议 A method of all target areas are predicted based on image column alignment feature
CN112651331A (en) * 2020-12-24 2021-04-13 万兴科技集团股份有限公司 Text table extraction method, system, computer device and storage medium

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2008257737A (en) * 2008-04-14 2008-10-23 Toshiba Corp Machine translation device and machine translation program
JP2010002991A (en) * 2008-06-18 2010-01-07 Canon Inc Image processor, image processing method, and computer program
US8472078B2 (en) 2008-06-18 2013-06-25 Canon Kabushiki Kaisha Image processing apparatus for determining whether a region based on a combined internal region is a table region
CN105592244A (en) * 2014-11-06 2016-05-18 京瓷办公信息系统株式会社 Image Processing Apparatus
KR101811581B1 (en) * 2016-11-15 2017-12-26 주식회사 셀바스에이아이 Aparatus and method for cell decomposition for a table recognition in document image
CN109325415A (en) * 2018-08-22 2019-02-12 吴昌议 A method of all target areas are predicted based on image column alignment feature
CN112651331A (en) * 2020-12-24 2021-04-13 万兴科技集团股份有限公司 Text table extraction method, system, computer device and storage medium
CN112651331B (en) * 2020-12-24 2024-04-16 万兴科技集团股份有限公司 Text form extraction method, system, computer device and storage medium

Also Published As

Publication number Publication date
JP3904397B2 (en) 2007-04-11

Similar Documents

Publication Publication Date Title
JPH11219407A (en) Document image recognizing device and storage medium for document image recognizing program
US5046114A (en) Method and structure for separating joined patterns for use in pattern and character recognition system
JP2004139484A (en) Form processing device, program for implementing it, and program for creating form format
US8103099B2 (en) Method and system for recognizing characters and character groups in electronically represented text
JP2001331763A (en) Method for recognizing table
CN113343658A (en) PDF file information extraction method and device and computer equipment
JP2003067738A (en) Method and system for removing net point
JP4849883B2 (en) Row direction determination program, method and apparatus
JPH08320914A (en) Table recognition method and device
CN109409370B (en) Remote desktop character recognition method and device
JP2550867B2 (en) Structure analysis method of mixed figure image
JP2796561B2 (en) Tabular document recognition method
JP2768249B2 (en) Document image layout analyzer
JP2993533B2 (en) Information processing device and character recognition device
JPS62121589A (en) Character segmenting system
JP3083609B2 (en) Information processing apparatus and character recognition apparatus using the same
JPH11242716A (en) Image processing method and storage medium
JP3402755B2 (en) Area division method
JP2022051198A (en) Ocr processor, ocr processing method, and program
JPH117493A (en) Character recognition processor
JP2962525B2 (en) Text block recognition method
JP3376931B2 (en) Character line extraction method and apparatus
CN117574814A (en) Layout content rapid analysis method based on site distribution
JP2576080B2 (en) Character extraction method
KR950004219B1 (en) Method and apparatus for font storage

Legal Events

Date Code Title Description
A621 Written request for application examination

Free format text: JAPANESE INTERMEDIATE CODE: A621

Effective date: 20040419

A131 Notification of reasons for refusal

Free format text: JAPANESE INTERMEDIATE CODE: A131

Effective date: 20061017

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20061208

TRDD Decision of grant or rejection written
A01 Written decision to grant a patent or to grant a registration (utility model)

Free format text: JAPANESE INTERMEDIATE CODE: A01

Effective date: 20070109

A61 First payment of annual fees (during grant procedure)

Free format text: JAPANESE INTERMEDIATE CODE: A61

Effective date: 20070109

R150 Certificate of patent or registration of utility model

Free format text: JAPANESE INTERMEDIATE CODE: R150

FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20110119

Year of fee payment: 4

FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20110119

Year of fee payment: 4

FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20120119

Year of fee payment: 5

FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20130119

Year of fee payment: 6

FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20130119

Year of fee payment: 6

FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20140119

Year of fee payment: 7

LAPS Cancellation because of no payment of annual fees