JP3582734B2

JP3582734B2 - Table vectorizer

Info

Publication number: JP3582734B2
Application number: JP17402193A
Authority: JP
Inventors: 守人塩原
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 1993-07-14
Filing date: 1993-07-14
Publication date: 2004-10-27
Anticipated expiration: 2019-10-27
Also published as: JPH0728939A

Description

【０００１】
【産業上の利用分野】
本発明は、例えば、帳票や図面等の表をベクトル化する表のベクトル化装置に関する。
【０００２】
帳票や図面等の表をベクトル化することは、ＯＣＲ（光学的文字読み取り装置）、地図の処理、文書処理等の分野で重要な技術である。例えば、ＯＣＲでは、ＯＣＲ専用の帳票だけでなく、市販されている帳票から文字を認識することが要求されている。
【０００３】
帳票の表を自動的にベクトル化することによって、表の同定や、表の抜き取りができ、ＯＣＲの認識率向上や、付加価値を生むことができる。また、図面では、部品表等の表のベクトル化は必須である。
【０００４】
【従来の技術】
図１０は、従来技術の説明図であり、図１０中、１は表画像、２は表部分、３は投影像（ドット数）を示す。
【０００５】
従来、表のベクトル化装置において、帳票、図面等の表画像１から表部分２をベクトル化する場合、表の罫線を抽出することにより、ベクトル化していた。
この場合、表を構成する罫線は、表画像１に対し、水平、若しくは垂直であることを前提にして、垂直軸（Ｙ軸）、及び水平軸（Ｘ軸）への投影像（ドット数）３から求めていた。このベクトル化処理の概要は、次の通りである。
【０００６】
先ず、スキャナ等から帳票等を読んで、表画像１（メモリ内の画像）を得る。その後、表画像１に対し、垂直軸、及び水平軸方向の投影像３を求める。
この投影像３は、Ｘ−Ｙ座標の各軸（Ｘ，Ｙ）方向に対し、ドット数を累積した値（例えば、黒部分のドットを「１」、白部分のドットを「０」とすると、ドット「１」の累積値）である。
【０００７】
また、この投影像３は、表の罫線だけでなく、表部分２の文字、数字等のドットも一緒に求めている（罫線以外のドットがノイズとして入っている）。
従って、表部分２の罫線のみを抽出するために、前記投影像（ドット数）３が、所定の閾値以上であるかどうかを判定し、所定の閾値以上であれば、罫線として抽出する。このようにして、罫線のみを抽出することができる。
【０００８】
【発明が解決しようとする課題】
上記のような従来のものにおいては、次のような課題があった。
帳票や、図面は、スキャナ等の画像入力装置で読み取った場合、多少傾く。この場合、大きな傾きは座標変換で修正することが可能である。
【０００９】
しかし、例えば１°以下の傾きでも、画像中では量子化誤差の影響で直線は数画素の傾きを持つが、この傾きは修正できなかった。傾きが修正できないと、上記のような従来方法では、罫線の抽出の際用いる閾値が一意に決定できない。
【００１０】
本発明は、このような従来の課題を解決し、入力した表画像が多少傾いても、正確に表部分のベクトル化ができるようにすることを目的とする。
【００１１】
【課題を解決するための手段】
図１は本発明の原理説明図であり、図１中、図１０と同じものは、同一符号で示してある。また、５は表のベクトル化装置、６は画像入力部、７は投影部、８はマスク画像生成部、９は罫線探索部、１０は第１のフレームメモリ、１１は第２のフレームメモリを示す。
【００１２】
本発明は上記の課題を解決するため、次のように構成した。
(a) ：入力した表画像１から、垂直軸（Ｙ軸）、及び水平軸（Ｘ軸）への投影像を求めて、表部分２をベクトル化する表のベクトル化装置において、前記表画像１の表部分に相当する画素に対して、水平軸（Ｘ軸）には、水平方向の線分を無視し、垂直方向の線分にのみ限定して投影像を抽出し、垂直軸（Ｙ軸）には、垂直方向の線分を無視し、水平方向の線分のみに限定して投影像を抽出することで、表を構成する罫線の投影像を求める投影部と、前記罫線の投影像と同じ幅の直線を、メモリ（第２のフレームメモリ１１）上に、水平及び垂直方向から引き、それをマスク画像として生成するマスク画像生成部８と、該マスク画像生成部で生成したマスク画像に従って、交点間の長さと交点間の画素数の比率で罫線を探索し、表部分をベクトル化する罫線探索部９を設けた表のベクトル化装置。
【００１４】
(b) ：構成 (a) において、罫線探索部９が、マスク画像に従って罫線を探索する際、前記マスク画像から、直線の交点を抽出し、抽出した交点間の距離に対する画素数の比率から、前記交点間の罫線の存在を決定する表のベクトル化装置。
【００１５】
(c) ：構成 (a) において、罫線探索部９が、マスク画像に従って罫線を探索する際、直線の木構造を利用して罫線の探索を行う表のベクトル化装置。
(d) ：構成 (a) 乃至 (c) のいずれかにおいて、ベクトル化装置に、表画像１の水平成分のベクトル化処理を行う水平成分処理部と、垂直成分のベクトル化処理を行う垂直成分処理部とを設け、入力した表画像１を、水平成分と垂直成分とに分割して、前記水平成分処理部と、垂直成分処理部とで並列処理を行う表のベクトル化装置。
【００１６】
【作用】
上記構成に基づく本発明の作用を、図１に基づいて説明する。
帳票や図面等を、画像入力部６から入力し、第１のフレームメモリ１０に格納した表画像１に対し、以下の処理を行う。
【００１７】
先ず、投影部７は、表画像１に対し、表部分２に相当する画素に対して、垂直／水平罫線の投影像を抽出する。この場合、水平軸（Ｘ軸）に投影するのは、垂直方向の線分のみに限定（水平方向の線分を無視）し、逆に垂直軸（Ｙ軸）に対しては、水平方向の線分のみに限定（垂直方向の線分を無視）する。
【００１８】
これにより、従来、投影像が互いに接触し分離出来なかったのに対し、明確に区別することができるようになる。
次に、垂直軸（Ｙ軸）、及び水平軸（Ｘ軸）の各投影像に対してラベル付けを行う。この処理では、ラベル毎に画素数（ドット数）を数え、或る閾値に達する像のみを残す（ノイズ成分を除去する）。
【００１９】
続いて、前記投影像に従って、像の幅を持つ直線を、垂直／水平方向から画像一杯に引き、マスク画像を生成する。その後、マスク画像から、直線の交点の座標を抽出することにより、表の罫線の交点算出を行う。最後に、罫線の探索を行って表部分のベクトル化を行う。この場合、表画像の端と直線との交点も対象として座標を抽出する。
【００２０】
そして、交点間に罫線が存在するか否かを検査する。その結果、罫線が存在すれば、その両端の座標を記録する。
以上のようにして、入力した表画像の微妙な傾きに対しても正確に表部分のベクトル化ができるようにすることができる。また、前記表部分のベクトル化処理を行う際、水平成分と垂直成分の処理を並列処理とすることにより、処理の高速化を図ることができる。
【００２１】
【実施例】
以下、本発明の実施例を図面に基づいて説明する。
図２〜図９は、本発明の実施例を示した図であり、図２〜図９中、図１、図１０と同じものは、同一符号で示してある。また、１２は第１の投影用メモリ、１３は第２の投影用メモリを示す。
【００２２】
§１：表のベクトル化装置の構成の説明・・・図２参照
図２は実施例の装置構成図である。以下、図２に基づいて、表のベクトル化装置の構成を説明する。
【００２３】
表のベクトル化装置５は、パソコン、ワークステーション、或いはＯＣＲ等の各種処理装置内に設けた装置である。
前記表のベクトル化装置５には、画像入力部６、投影部７、マスク画像生成部８、罫線探索部９、第１のフレームメモリ１０、第２のフレームメモリ１１、第１の投影用メモリ１２、第２の投影用メモリ１３等が設けてある。
【００２４】
画像入力部６は、帳票、図面等を読み取って表画像を入力するものであり、スキャナや、カメラ等の光学的な装置で構成されている。
投影部７は、入力した表画像を、垂直軸（Ｙ軸）、及び水平軸（Ｘ軸）方向に投影するものである。マスク画像生成部８は、マスク画像を生成するものであり、罫線探索部９は、マスク画像に従って罫線の探索処理を行うものである（詳細は後述する）。
【００２５】
§２：実施例の処理概要説明・・・図３参照
図３は、実施例の処理フローチャートである。以下、図３に基づいて実施例の処理概要を説明する。なお、図３のＳ１〜Ｓ５は各処理番号を示す。
【００２６】
帳票や図面等をスキャナから入力した表画像（従来例参照）に対し、以下の処理を行う。
この場合、入力した表画像は、２値のディジタル画像であり、表画像内では、表部分（罫線部分）は画素値が「１」、それ以外は「０」として、ｆ（ｘ，ｙ）と表す。また、表部分は、表画像に対して座標変換等の処理により、傾きの補正（補正可能な大きな傾きの補正）が行われているものとする。
【００２７】
前記入力した表画像に対して、表部分に相当する画素に対して、垂直／水平罫線の投影像（ドット数）を抽出する処理（Ｓ１）を行う。
具体的には、水平軸（Ｘ軸）に投影するのは、垂直方向の線分のみに限定（水平方向の線分を無視）し、逆に垂直軸（Ｙ軸）に対しては、水平方向の線分のみに限定（垂直方向の線分を無視）する。これにより、従来、投影像が互いに接触し分離出来なかったのに対し、明確に区別することができるようになる。
【００２８】
次に、垂直軸、及び水平軸の各投影像に対してラベル付けを行う（Ｓ２）。この処理では、ラベル毎に画素数（ドット数）を数え、或る閾値に達する像のみを残す（ノイズ成分を除去する）。なお、この閾値を設定する際、表の罫線は文字等に比べて十分に長いので、容易に定めることができる。
【００２９】
続いて、前記投影像に従って、像の幅を持つ直線を画像一杯に引き、マスク画像を生成する（Ｓ３）。なお、前記のようにして直線を引いた画像を「マスク画像」と呼ぶ。その後、マスク画像から、直線の交点の座標を抽出することにより、表の罫線の交点算出を行う（Ｓ４）。
【００３０】
最後に、罫線の探索を行つて（Ｓ５）表部分のベクトル化を行う。この場合、表画像の端と直線との交点も対象として座標を抽出する。そして、交点間に罫線が存在するか否かを検査する。その結果、罫線が存在すれば、その両端の座標を記録する。
【００３１】
§３：実施例の処理説明
図４は実施例の処理説明図１（投影像抽出処理）、図５は実施例の処理説明図２（マスク画像生成処理）、図６は実施例の処理説明図３（罫線の交点算出処理）、図７は実施例の処理説明図４（罫線の探索処理）、図８は実施例の処理説明図５（ベクトルの階層化処理）である。以下、実施例の各処理を具体的に説明する。
【００３２】
（１）：表画像の入力処理の説明
表画像の入力処理は、画像入力部６で行う。すなわち、画像入力部６では、帳票や、図面を光学的に読み取り、表をディジタル画像（表画像）に変換する。
【００３３】
入力した表画像では、表部分は「１」の値で、それ以外の部分は「０」の値を持つ２値画像である。
この例では、表画像の大きさをＮ×Ｍ画素の大きさとする。前記入力した表画像は、一旦第１のフレームメモリ１０に格納する。この場合、第１のフレームメモリの大きさは、表画像の大きさ（Ｎ×Ｍ画素）と同じであるとする。
【００３４】
（２）：投影像抽出処理の説明・・・図４参照
投影部７では、前記入力した表画像に対して、投影像抽出処理を行う。この処理では、投影部７が、例えば、９×１画素、及び１×９画素のマスクを設定し、水平方向と垂直方向に対し、別々に第１のフレームメモリ１０を走査する。
【００３５】
すなわち、水平方向に走査する場合は、９×１画素（水平方向に９画素、垂直方向に１画素）のマスクを使用し、垂直方向に走査する時は、１×９画素（水平方向に１画素、垂直方向に９画素）のマスクを使用する。
【００３６】
前記マスクを使用して第１のフレームメモリ１０を走査した結果、例えば、マスク内に、「１」である画素が、マスク内総画素数の８割以上あった場合、若しくは、マスクの中心画素を含む画素が、マスク内の画素数の半分以上連続して「１」である画素が存在するならば、該マスクの中心画素を残す。
【００３７】
但し、水平軸（Ｘ軸）方向に対して残す画素は、第２のフレームメモリ１１に残し、垂直軸（Ｙ軸）方向に残す画素は、第１のフレームメモリ１０に残す。この場合、第１のフレームメモリ１０に格納されていた元の表画像（入力した時の画像）は、消去される。
【００３８】
以上の処理により、ノイズとなる画素を除去し、水平方向に残す画像と、垂直方向に残す画素を別のフレームメモリに分離して格納する。
その後、第１のフレームメモリ１０と、第２のフレームメモリ１１に格納してある表画像から、垂直軸、及び水平軸への投影処理を行い、投影像を求める（画素数を累積する）。
【００３９】
そして、図４に示したように、第１のフレームメモリ１０を水平方向に走査して、各列の値「１」の画素数を、第１の投影用メモリ１２に格納する。この時、各列に対応したアドレスに、画素数（ドット数）を格納する。
【００４０】
また、第２のフレームメモリ１１を垂直方向に走査して、各列の値「１」の画素数を、第２の投影用メモリ１３に格納する。この時、各列に対応したアドレスに、画素数（ドット数）を格納する。
【００４２】
（３）：像のラベル付け処理の説明・・・図４参照
マスク画像生成部８では、第１の投影用メモリ１２、第２の投影用メモリ１３に格納してある投影像に対し、塊（ドット数の存在する１塊の部分）毎に番号を付けて、ラベル付け処理を行う。
【００４３】
例えば、図４では、投影像が「３、９、１」のように、１つの塊（３画素分）となっている部分を対象として、１つのラベル（例えば、番号）を付ける。
（４）：マスク画像生成処理の説明・・・図５参照
次に、図５に示したように、第２のフレームメモリ１１に、各塊の幅と同じ幅の直線（画素値＝１）を水平、及び垂直に引く（メモリ一杯に直線を引く）。
【００４４】
この場合、例えば塊の幅が３画素ならば、幅３画素の直線を引く。このように水平／垂直方向に引いた直線の部分（十字型の画像）を「マスク画像」という。
以上のようにして、マスク画像を生成する。
【００４５】
（５）：罫線の交点算出処理の説明・・・図６参照
前記のようにして生成したマスク画像から、図６に示したように、直線の交点の座標（ｘ，ｙ）を抽出する。この場合、直線の交点は、垂直／水平方向の各直線を引いた投影像の中で、最も値が高い（ドット数が多い）位置の組とする。
【００４６】
すなわち、図６の例では、投影像が「９」の位置で引いた直線と、投影像が「８」の位置で引いた直線の交点の座標を抽出する。このようにして、罫線の交点を抽出する。
【００４７】
（６）：罫線の探索処理の説明・・・図７参照
前記のようにして罫線の交点を検出した後、罫線探索部９では、次のようにして罫線の探索処理を行う。
【００４８】
例えば、図７に示したように、幅Ａの直線がある場合、第１のフレームメモリ１０を参照し、その幅Ａ内に「１」である画素が存在するならば、罫線が存在すると仮定する。
【００４９】
前記交点間で、罫線が存在すると仮定した画素数が、交点間の長さ（最短画素数）の定めた比率（例えば、８割）以上を占める場合、その交点間に罫線が存在すると仮定し、罫線の端点の座標をメモリ１４に格納する。この処理を全ての交点で繰り返して行う。
【００５０】
罫線探索部９では、なるべく長い罫線を得るために、直線で繋がっている交点間の内、離れているものから順に選び、その間に罫線が存在するか否かを検査する。
【００５１】
（７）：ベクトルの階層化処理の説明・・・図８参照
例えば、図８のように、交点間の直線の長さを尺度として、該直線を階層化する。
【００５２】
例えば、直線ＡＥがあった場合、直線ＡＥは、少し短い直線ＡＤと、直線ＢＥから構成されている（一部重複していてもよい）。また、直線ＡＤは、直線ＡＣと直線ＢＤから構成されている。このような関係を木構造で示すと、図８のように階層化構造となる。
【００５３】
この場合、もし、直線ＡＥに罫線が存在しない場合、直線ＡＤと直線ＢＥでの存在を確認しにいき、同様に各直線での罫線の存在を確認しにゆく。もし、直線ＡＥに罫線が存在する場合は、それを構成する直線での罫線の存在は確認しない。
【００５４】
§４：並列処理装置の説明・・・図９参照
図９は実施例の並列処理装置構成図である。前記ベクトル化処理を行う装置として、例えば、図９に示したような並列処理装置を使用することにより、処理の高速化を図ることができる。
【００５５】
この装置は、入力した表画像を水平成分と垂直成分とに分け、水平方向の罫線抽出処理と、垂直方向の罫線抽出処理とを並列処理として行う装置である。
図示のように、表のベクトル化装置５には、水平成分処理部１５と、垂直成分処理部１６が設けてあり、これらの各処理部には、それぞれ、投影部７、マスク画像生成部８、罫線探索部９が設けてある。
【００５６】
この装置では、画像入力部６から入力した表画像を、水平成分処理部１５と垂直成分処理部１６に入力し、水平成分と垂直成分毎に前記の処理を並列処理として行う。なお、各部の機能は、図２の装置と同じである。
【００５７】
【発明の効果】
以上説明したように、本発明によれば次のような効果がある。
▲１▼：入力した表画像が多少傾いていても、正確に、表をベクトル化することができる。
【００５８】
▲２▼：直線の木構造を利用することにより、表を構成する罫線を、なるべく長くすることができる。
▲３▼：表のベクトル化処理を行う装置として、並列処理装置を使用することにより、処理の高速化を図ることができる。
【図面の簡単な説明】
【図１】本発明の原理説明図である。
【図２】実施例の装置構成図である。
【図３】実施例の処理フローチャートである。
【図４】実施例の処理説明図１（投影像抽出処理）である。
【図５】実施例の処理説明図２（マスク画像生成処理）である。
【図６】実施例の処理説明図３（罫線の交点算出処理）である。
【図７】実施例の処理説明図４（罫線の探索処理）である。
【図８】実施例の処理説明図５（ベクトルの階層化処理）である。
【図９】実施例の並列処理装置構成図である。
【図１０】従来技術の説明図である。
【符号の説明】
１表画像
２表部分
５表のベクトル化装置
６画像入力部
７投影部
８マスク画像生成部
９罫線探索部
１０第１のフレームメモリ
１１第２のフレームメモリ[0001]
[Industrial applications]
The present invention relates to a table vectorizing apparatus for vectorizing a table such as a form or a drawing.
[0002]
Vectorizing tables such as forms and drawings is an important technology in the fields of OCR (optical character reading device), map processing, document processing, and the like. For example, in OCR, it is required to recognize characters from not only a form dedicated to OCR but also a form available on the market.
[0003]
By automatically vectorizing the form table, the table can be identified and the table can be extracted, thereby improving the OCR recognition rate and adding value. In the drawings, vectorization of a table such as a parts table is indispensable.
[0004]
[Prior art]
FIG. 10 is an explanatory diagram of a conventional technique. In FIG. 10, 1 indicates a front image, 2 indicates a front portion, and 3 indicates a projected image (the number of dots).
[0005]
Conventionally, in a table vectorizing apparatus, when vectorizing a table portion 2 from a table image 1 such as a form or drawing, vectorization is performed by extracting ruled lines of the table.
In this case, the ruled lines forming the table are projected on the vertical axis (Y axis) and the horizontal axis (X axis) (the number of dots) on the assumption that the ruled lines constituting the table are horizontal or vertical with respect to the table image 1. I was seeking from 3. The outline of this vectorization processing is as follows.
[0006]
First, a form or the like is read from a scanner or the like to obtain a front image 1 (an image in the memory). Thereafter, a projection image 3 in the vertical axis direction and the horizontal axis direction is obtained for the front image 1.
This projected image 3 has a value obtained by accumulating the number of dots in each axis (X, Y) direction of the XY coordinates (for example, if a dot in a black portion is “1” and a dot in a white portion is “0”). , Dot “1”).
[0007]
The projected image 3 also obtains not only the ruled lines of the table but also the dots such as characters and numerals of the table portion 2 (dots other than the ruled lines are included as noise).
Therefore, in order to extract only the ruled line of the table portion 2, it is determined whether or not the projected image (the number of dots) 3 is equal to or larger than a predetermined threshold value. In this way, only the ruled lines can be extracted.
[0008]
[Problems to be solved by the invention]
The above-described conventional device has the following problems.
Forms and drawings are slightly inclined when read by an image input device such as a scanner. In this case, a large inclination can be corrected by coordinate conversion.
[0009]
However, even if the inclination is, for example, 1 ° or less, the straight line has an inclination of several pixels due to the quantization error in the image, but this inclination cannot be corrected. If the inclination cannot be corrected, the conventional method as described above cannot uniquely determine a threshold used for extracting a ruled line.
[0010]
An object of the present invention is to solve such a conventional problem, and to accurately vectorize a table portion even when an input table image is slightly inclined.
[0011]
[Means for Solving the Problems]
FIG. 1 is a diagram illustrating the principle of the present invention. In FIG. 1, the same components as those in FIG. 10 are denoted by the same reference numerals. 5 is a table vectorizing device, 6 is an image input unit, 7 is a projection unit, 8 is a mask image generation unit, 9 is a ruled line search unit, 10 is a first frame memory, and 11 is a second frame memory. Show.
[0012]
The present invention is configured as follows in order to solve the above problems.
(a) : A table vectorization device for obtaining a projection image on the vertical axis (Y axis) and the horizontal axis (X axis) from the input table image 1 and vectorizing the table portion 2, For pixels corresponding to the table portion of FIG. 1 , on the horizontal axis (X-axis), a horizontal line segment is ignored, and only the vertical line segment is extracted to extract a projected image. On the axis), a projection unit that obtains a projection image of a ruled line constituting a table by ignoring a vertical line segment and extracting a projection image limited to only a horizontal line segment, and a projection of the ruled line A mask image generation unit 8 that draws a straight line having the same width as the image on a memory (second frame memory 11) in the horizontal and vertical directions and generates the same as a mask image, and a mask generated by the mask image generation unit According to the image, the ruled line is searched according to the ratio of the length between intersections and the number of pixels between intersections, and the table part is vectorized. A table vectorization apparatus provided with a ruled line search unit 9 for converting into a rule .
[0014]
(b) : In the configuration (a) , when the ruled line search unit 9 searches for a ruled line in accordance with the mask image, an intersection of straight lines is extracted from the mask image, and a ratio of the number of pixels to the distance between the extracted intersections is calculated as follows. A table vectorization device for determining the existence of a ruled line between the intersections.
[0015]
(c) : In the configuration (a) , when the ruled line search unit 9 searches for a ruled line in accordance with a mask image, a table vectorizing apparatus that searches for a ruled line using a tree structure of a straight line.
(d): in the construction (a) to (c), the vertical performed vectorization unit, and the horizontal component processing unit that performs vectorization processing of the horizontal component of the table image 1, the vectorization processing of the vertical component ingredients A table vectorizing apparatus, comprising a processing unit, dividing an input table image 1 into a horizontal component and a vertical component, and performing parallel processing by the horizontal component processing unit and the vertical component processing unit.
[0016]
[Action]
The operation of the present invention based on the above configuration will be described with reference to FIG.
The following processing is performed on the front image 1 stored in the first frame memory 10 by inputting a form, a drawing, and the like from the image input unit 6.
[0017]
First, the projection unit 7 extracts a projection image of a vertical / horizontal ruled line for a pixel corresponding to the table portion 2 in the table image 1. In this case, the projection on the horizontal axis (X axis) is limited only to the vertical line segment (ignoring the horizontal line segment), and conversely, the vertical axis (Y axis) is projected in the horizontal direction. Limit to line segments only (ignore vertical line segments).
[0018]
This allows the projection images to be clearly distinguished from each other, whereas conventionally, the projection images contact each other and cannot be separated.
Next, labeling is performed on each projected image on the vertical axis (Y axis) and the horizontal axis (X axis). In this process, the number of pixels (the number of dots) is counted for each label, and only an image reaching a certain threshold is left (noise component is removed).
[0019]
Subsequently, in accordance with the projection image, a straight line having the width of the image is drawn from the vertical / horizontal direction to fill the image to generate a mask image. After that, the intersection of the ruled lines in the table is calculated by extracting the coordinates of the intersection of the straight lines from the mask image. Finally, a ruled line is searched to vectorize the table portion. In this case, the coordinates are extracted also for the intersection of the end of the table image and the straight line.
[0020]
Then, it is checked whether or not a ruled line exists between the intersections. As a result, if a ruled line exists, the coordinates of both ends are recorded.
As described above, it is possible to accurately vectorize the table portion even for a subtle inclination of the input table image. In addition, when the vectorization processing of the table portion is performed, the processing of the horizontal component and the vertical component is performed in parallel, so that the processing speed can be increased.
[0021]
【Example】
Hereinafter, embodiments of the present invention will be described with reference to the drawings.
2 to 9 are views showing an embodiment of the present invention. In FIGS. 2 to 9, the same components as those in FIGS. 1 and 10 are denoted by the same reference numerals. Reference numeral 12 denotes a first projection memory, and reference numeral 13 denotes a second projection memory.
[0022]
§1: Description of the configuration of the table vectorization device ... See FIG. 2 FIG. 2 is a device configuration diagram of the embodiment. Hereinafter, the configuration of the table vectorization device will be described with reference to FIG.
[0023]
The table vectorizing device 5 is a device provided in various processing devices such as a personal computer, a workstation, and OCR.
The table vectorizing device 5 includes an image input unit 6, a projection unit 7, a mask image generation unit 8, a ruled line search unit 9, a first frame memory 10, a second frame memory 11, and a first projection memory. 12, a second projection memory 13 and the like.
[0024]
The image input unit 6 reads a form, a drawing, and the like and inputs a table image, and is configured by an optical device such as a scanner or a camera.
The projection unit 7 projects the input table image in the vertical axis (Y axis) and horizontal axis (X axis) directions. The mask image generation unit 8 generates a mask image, and the ruled line search unit 9 performs a ruled line search process according to the mask image (details will be described later).
[0025]
§2: Outline of processing of embodiment ... refer to FIG. 3 FIG. 3 is a processing flowchart of the embodiment. Hereinafter, the processing outline of the embodiment will be described with reference to FIG. Note that S1 to S5 in FIG. 3 indicate each processing number.
[0026]
The following processing is performed on a table image (see a conventional example) obtained by inputting a form or drawing from a scanner.
In this case, the input table image is a binary digital image. In the table image, the pixel value of the table portion (ruled line portion) is “1”, and the other values are “0”, and f (x, y) It expresses. In addition, it is assumed that the table portion has been subjected to inclination correction (correctable large inclination correction) by performing processing such as coordinate transformation on the table image.
[0027]
In the input table image, a process (S1) of extracting a projected image (the number of dots) of a vertical / horizontal ruled line is performed on pixels corresponding to the table portion.
Specifically, the projection on the horizontal axis (X axis) is limited only to the vertical line segment (ignoring the horizontal line segment), and conversely, the horizontal axis (Y axis) is projected horizontally. Limited to line segments in the direction only (ignoring line segments in the vertical direction). This allows the projection images to be clearly distinguished from each other, whereas conventionally, the projection images contact each other and cannot be separated.
[0028]
Next, labeling is performed on each of the projected images on the vertical axis and the horizontal axis (S2). In this process, the number of pixels (the number of dots) is counted for each label, and only an image reaching a certain threshold is left (noise component is removed). When setting the threshold value, the ruled line of the table is sufficiently longer than the characters and the like, and thus can be easily determined.
[0029]
Subsequently, a straight line having an image width is drawn to fill the image according to the projection image, and a mask image is generated (S3). Note that an image obtained by drawing a straight line as described above is referred to as a “mask image”. Thereafter, the intersection of the ruled lines in the table is calculated by extracting the coordinates of the intersection of the straight lines from the mask image (S4).
[0030]
Finally, a ruled line is searched (S5), and the table portion is vectorized. In this case, the coordinates are extracted also for the intersection of the end of the table image and the straight line. Then, it is checked whether or not a ruled line exists between the intersections. As a result, if a ruled line exists, the coordinates of both ends are recorded.
[0031]
§3: Process description of the embodiment FIG. 4 is a process description of the embodiment FIG. 1 (projection image extraction process), FIG. 5 is a process description of the embodiment FIG. 2 (mask image generation process), FIG. 6 is a process description of the embodiment FIG. 3 (ruled line intersection calculation processing), FIG. 7 is a processing explanatory diagram of the embodiment (ruled line search processing), and FIG. 8 is a processing explanatory diagram of the embodiment (vector hierarchical processing). Hereinafter, each process of the embodiment will be specifically described.
[0032]
(1): Description of Table Image Input Processing The table image input processing is performed by the image input unit 6. That is, the image input unit 6 optically reads a form or drawing and converts a table into a digital image (table image).
[0033]
In the input table image, the table portion has a value of “1”, and the other portions are binary images having a value of “0”.
In this example, the size of the table image is N × M pixels. The input table image is temporarily stored in the first frame memory 10. In this case, it is assumed that the size of the first frame memory is the same as the size of the front image (N × M pixels).
[0034]
(2): Description of Projected Image Extraction Process—See FIG. 4 The projection unit 7 performs a projected image extraction process on the input table image. In this processing, the projection unit 7 sets a mask of, for example, 9 × 1 pixels and 1 × 9 pixels, and scans the first frame memory 10 separately in the horizontal and vertical directions.
[0035]
That is, when scanning in the horizontal direction, a mask of 9 × 1 pixels (9 pixels in the horizontal direction and 1 pixel in the vertical direction) is used, and when scanning in the vertical direction, 1 × 9 pixels (1 pixel in the horizontal direction) is used. Pixel, 9 pixels vertically).
[0036]
As a result of scanning the first frame memory 10 using the mask, for example, if the number of pixels “1” is 80% or more of the total number of pixels in the mask, If there is a pixel in which the number of pixels including “1” is “1” continuously for at least half of the number of pixels in the mask, the central pixel of the mask is left.
[0037]
However, pixels to be left in the horizontal axis (X-axis) direction are left in the second frame memory 11, and pixels to be left in the vertical axis (Y-axis) direction are left in the first frame memory 10. In this case, the original table image (image at the time of input) stored in the first frame memory 10 is deleted.
[0038]
With the above processing, pixels that become noise are removed, and the image left in the horizontal direction and the pixel left in the vertical direction are separately stored in different frame memories.
After that, projection processing is performed on the vertical axis and the horizontal axis from the table images stored in the first frame memory 10 and the second frame memory 11, and a projection image is obtained (the number of pixels is accumulated).
[0039]
Then, as shown in FIG. 4, the first frame memory 10 is scanned in the horizontal direction, and the number of pixels having the value “1” in each column is stored in the first projection memory 12. At this time, the number of pixels (the number of dots) is stored in the address corresponding to each column.
[0040]
Further, the second frame memory 11 is scanned in the vertical direction, and the number of pixels having the value “1” in each column is stored in the second projection memory 13. At this time, the number of pixels (the number of dots) is stored in the address corresponding to each column.
[0042]
(3): Description of Image Labeling Process—See FIG. 4 The mask image generation unit 8 compares the projection images stored in the first projection memory 12 and the second projection memory 13 A labeling process is performed by assigning a number to each (a lump portion having the number of dots).
[0043]
For example, in FIG. 4, one label (for example, a number) is attached to a portion where one projection (for three pixels), such as “3, 9, 1”, is projected.
(4): Description of mask image generation processing ... see FIG. 5 Next, as shown in FIG. 5, a line having the same width as the width of each block (pixel value = 1) is stored in the second frame memory 11. Draw horizontally and vertically (draw a straight line to fill the memory).
[0044]
In this case, for example, if the block has a width of 3 pixels, a straight line having a width of 3 pixels is drawn. The straight line portion (cross-shaped image) drawn in the horizontal / vertical direction in this manner is called a “mask image”.
The mask image is generated as described above.
[0045]
(5): Explanation of Rule Line Intersection Calculation Process--See FIG. 6 From the mask image generated as described above, the coordinates (x, y) of the line intersection are extracted as shown in FIG. In this case, the intersection of the straight lines is a set of positions having the highest value (the number of dots is large) in the projected image obtained by drawing each straight line in the vertical / horizontal direction.
[0046]
That is, in the example of FIG. 6, the coordinates of the intersection of the straight line drawn at the position "9" of the projected image and the straight line drawn at the position "8" of the projected image are extracted. In this way, the intersections of the ruled lines are extracted.
[0047]
(6): Description of ruled line search processing--see FIG. 7 After the ruled line intersection is detected as described above, the ruled line search unit 9 performs ruled line search processing as follows.
[0048]
For example, as shown in FIG. 7, if there is a straight line having a width A, the first frame memory 10 is referred to, and if a pixel of “1” exists within the width A, it is assumed that a ruled line exists. I do.
[0049]
If the number of pixels assumed to have a ruled line between the intersections occupies a predetermined ratio (for example, 80%) of the length (shortest pixel number) between the intersections, it is assumed that a ruled line exists between the intersections. The coordinates of the end points of the ruled line are stored in the memory 14. This process is repeated at all intersections.
[0050]
In order to obtain a ruled line that is as long as possible, the ruled line search unit 9 sequentially selects intersections connected by straight lines, starting from the distant ones, and inspects whether a ruled line exists between them.
[0051]
(7): Description of vector hierarchization processing ... See FIG. 8 For example, as shown in FIG. 8, the straight lines between intersections are hierarchized as a scale.
[0052]
For example, if there is a straight line AE, the straight line AE is composed of a slightly shorter straight line AD and a straight line BE (they may partially overlap). The straight line AD includes a straight line AC and a straight line BD. When such a relationship is represented by a tree structure, a hierarchical structure is obtained as shown in FIG.
[0053]
In this case, if there is no ruled line on the straight line AE, the presence of the straight line AD and the straight line BE is checked, and similarly, the presence of the ruled line on each straight line is checked. If a ruled line exists on the straight line AE, the existence of a ruled line on the straight line constituting the ruled line is not confirmed.
[0054]
§4: Description of the parallel processing device ... refer to FIG. 9 FIG. 9 is a configuration diagram of the parallel processing device of the embodiment. By using, for example, a parallel processing device as shown in FIG. 9 as an apparatus for performing the vectorization processing, the processing speed can be increased.
[0055]
This apparatus divides an input table image into a horizontal component and a vertical component, and performs horizontal ruled line extraction processing and vertical ruled line extraction processing as parallel processing.
As shown in the figure, the table vectorizing device 5 is provided with a horizontal component processing unit 15 and a vertical component processing unit 16. These processing units are respectively provided with a projection unit 7 and a mask image generation unit 8. , A ruled line search unit 9 is provided.
[0056]
In this apparatus, the table image input from the image input unit 6 is input to the horizontal component processing unit 15 and the vertical component processing unit 16, and the above-described processing is performed as parallel processing for each of the horizontal component and the vertical component. The function of each unit is the same as that of the apparatus shown in FIG.
[0057]
【The invention's effect】
As described above, the present invention has the following effects.
{Circle around (1)} Even if the input table image is slightly inclined, the table can be accurately vectorized.
[0058]
{Circle around (2)} By using a straight tree structure, ruled lines constituting a table can be made as long as possible.
{Circle around (3)} The processing speed can be increased by using a parallel processing device as a device for performing the table vectorization process.
[Brief description of the drawings]
FIG. 1 is a diagram illustrating the principle of the present invention.
FIG. 2 is an apparatus configuration diagram of an embodiment.
FIG. 3 is a processing flowchart of the embodiment.
FIG. 4 is a process explanatory diagram (projected image extraction process) of the embodiment.
FIG. 5 is a process explanatory diagram (mask image generation process) of the embodiment.
FIG. 6 is an explanatory diagram of a process in the embodiment (process of calculating an intersection of ruled lines);
FIG. 7 is an explanatory diagram of a process in the embodiment (a ruled line search process).
FIG. 8 is an explanatory diagram of a process in the embodiment (a vector hierarchical process).
FIG. 9 is a configuration diagram of a parallel processing device according to an embodiment.
FIG. 10 is an explanatory diagram of a conventional technique.
[Explanation of symbols]
1 Table image 2 Table part 5 Table vectorization device 6 Image input unit 7 Projection unit 8 Mask image generation unit 9 Ruled line search unit 10 First frame memory 11 Second frame memory

Claims

From the input table image, in the vertical axis, and in the table vectorization device for vectorizing the table portion to obtain a projected image on the horizontal axis,
For pixels corresponding to the table portion of the table image , the horizontal axis (X-axis) ignores horizontal line segments and extracts only the vertical line segments to extract projected images. (Y-axis), a projection unit that obtains a projection image of a ruled line forming a table by ignoring a vertical line segment and extracting a projection image limited to only a horizontal line segment ;
A mask image generation unit that draws a straight line having the same width as the projection image of the ruled line on a memory in the horizontal and vertical directions, and generates it as a mask image.
A table vector comprising a ruled line search unit for searching for a ruled line at a ratio of the length between intersections and the number of pixels between intersections according to the mask image generated by the mask image generation unit and vectorizing the table part. Device.

When the ruled line search unit searches for a ruled line according to a mask image, the intersection of straight lines is extracted from the mask image,
2. The table vectorizing apparatus according to claim 1 , wherein the existence of a ruled line between the intersections is determined from the ratio of the number of pixels to the distance between the extracted intersections .

2. The table vectorizing apparatus according to claim 1 , wherein the ruled line data when the ruled line search unit searches for a ruled line in accordance with a mask image is represented by a straight tree structure .

The vectorization device includes a horizontal component processing unit that performs a vectorization process of a horizontal component of a table image, and a vertical component processing unit that performs a vectorization process of a vertical component,
The input table image is divided into a horizontal component and a vertical component, and parallel processing is performed by the horizontal component processing unit and the vertical component processing unit . Table vectorizer.