JP2007328652A

JP2007328652A - Image processing device and image processing program

Info

Publication number: JP2007328652A
Application number: JP2006160462A
Authority: JP
Inventors: Toshiya Koyama; 俊哉小山
Original assignee: Fuji Xerox Co Ltd
Current assignee: Fujifilm Business Innovation Corp
Priority date: 2006-06-09
Filing date: 2006-06-09
Publication date: 2007-12-20

Abstract

<P>PROBLEM TO BE SOLVED: To solve the problem that a constitution for applying a Hough transform and histogram generation to an entire image results in the erroneous detection of the inclination of lines of paragraphs as the skew angle of the entire image if, for example, vertically or horizontally adjacent paragraphs included in the image have evenly shifted lines. <P>SOLUTION: A white area search part 202 searches an input image for white areas, and an area division part 203 divides the input image into a plurality of areas on the basis of such white areas. Every divided area is Hough-transformed by a Hough transform part 204 and angle is etected by an angle calculation part 205. An overall angle calculation part 206 calculates the skew angle of the entire image from the angles detected on the divided area basis, and a skew correction part 207 corrects the skew of the input image according to the skew angle. <P>COPYRIGHT: (C)2008,JPO&INPIT

Description

本発明は、画像処理装置および画像処理プログラムに関し、特に画像読み取りの際に傾いて読み込まれた文書画像の傾き角度を検出し、その傾き補正を行う画像処理装置および画像処理プログラムに関する。 The present invention relates to an image processing apparatus and an image processing program, and more particularly to an image processing apparatus and an image processing program for detecting an inclination angle of a document image read in an inclined manner when reading an image and correcting the inclination.

文書画像をイメージスキャナで読み取った際に得られる画像データや、ファクシミリ装置が受信する画像データを解析し、文書画像から領域の切り出しを行うとともに、文書中に含まれる画像の種類や属性を判別し、そのうち文字と判別された領域に対しては文字認識処理を施し、画像として読み取った文書から文字あるいは文章を抽出する技術が知られている。 Analyzes the image data obtained when the document image is read by the image scanner and the image data received by the facsimile machine, extracts the region from the document image, and determines the type and attribute of the image included in the document. A technique is known in which character recognition processing is performed on an area determined as a character, and a character or a sentence is extracted from a document read as an image.

この種の画像処理の分野において、領域の切り出しや文字認識などの処理を正しく実行するためには、イメージスキャナや送信側のファクシミリ装置で読み取った文書画像に傾きが無いことが前提となる。したがって、読み込まれた画像に傾きがある場合には、領域の切り出しや文字認識などの処理を行う前処理として、画像の傾き角度（以下、「スキュー角」と記述する）を検出してその傾きを補正する、いわゆるスキュー（ｓｋｅｗ）補正を行う必要がある。 In the field of this type of image processing, in order to correctly execute processing such as segmentation and character recognition, it is premised that the document image read by the image scanner or the facsimile machine on the transmission side has no inclination. Therefore, when the read image has a tilt, as a pre-process for processing such as segmentation and character recognition, the tilt angle of the image (hereinafter referred to as “skew angle”) is detected and the tilt is detected. It is necessary to perform so-called skew correction.

従来、文書画像のスキュー角を検出し、その検出結果を基にしてスキュー補正を行う技術として、ハフ（Ｈｏｕｇｈ）変換を用いた技術が知られている（例えば、特許文献１参照）。この特許文献１記載の従来技術では、スキュー角の検出に当たって次のような処理を行うようにしている。 Conventionally, a technique using Hough transformation is known as a technique for detecting a skew angle of a document image and performing skew correction based on the detection result (see, for example, Patent Document 1). In the prior art described in Patent Document 1, the following processing is performed when detecting the skew angle.

先ず、入力画像に対してフィルタリング処理を行い、当該フィルタリング処理によって濃淡差が強調された画像に対して２値化処理を行い２値画像を生成する。次に、生成した２値画像の各画素に対してハフ変換を行い、ハフ空間上にヒストグラムを作成し、ハフ空間上で頻度が所定の閾値以上となる座標を抽出する。そして、抽出した座標をグループ化した後、グループごとに代表点座標を抽出し、抽出した座標から画像のスキュー角を推定する。 First, a filtering process is performed on an input image, and a binarization process is performed on an image whose density difference is enhanced by the filtering process to generate a binary image. Next, a Hough transform is performed on each pixel of the generated binary image, a histogram is created in the Hough space, and coordinates whose frequency is equal to or higher than a predetermined threshold in the Hough space are extracted. Then, after the extracted coordinates are grouped, representative point coordinates are extracted for each group, and the skew angle of the image is estimated from the extracted coordinates.

また、特許文献１には、ハフ空間上で頻度が所定の閾値以上となる座標を抽出した後、抽出した座標の個数を角度ごとに積算してヒストグラムを生成し、頻度が最大となる角度を画像のスキュー角として推定する技術も記載されている。 Further, in Patent Document 1, after extracting coordinates having a frequency equal to or higher than a predetermined threshold in the Hough space, the number of extracted coordinates is integrated for each angle to generate a histogram, and the angle at which the frequency is maximum is obtained. A technique for estimating the skew angle of an image is also described.

特開平１１−３２８４０８号公報JP 11-328408 A

しかしながら、上述した従来技術では、文書画像全体に対してハフ変換処理やヒストグラム生成処理を行う構成を採っているために、例えば図７に示すように、文書画像に含まれる段落において上下あるいは左右に隣接する段落相互間で行が均等に傾いている（ずれている）場合に、その段落相互間での行の傾きを文書画像全体の傾き角度（スキュー角）と誤検出してしまったり、あるいは、文書画像全体として１つの傾き角度を決定することになるために、例えば図８に示すように、見開き原稿のように、左右ページで画像の傾き角度が異なる場合に、左右ページの両画像に対してスキュー補正を正しく行うことができなかったりする場合がある。 However, since the conventional technique described above employs a configuration in which the Hough transform process and the histogram generation process are performed on the entire document image, for example, as shown in FIG. If the lines are evenly inclined (shifted) between adjacent paragraphs, the line inclination between the paragraphs may be mistakenly detected as the inclination angle (skew angle) of the entire document image, or In order to determine one inclination angle for the entire document image, for example, as shown in FIG. 8, when the left and right pages have different inclination angles, as shown in FIG. On the other hand, skew correction may not be performed correctly.

本発明は、上記課題に鑑みてなされたものであって、その目的とするところは、画像のスキュー角検出およびその補正を高精度にて実現可能にした画像処理装置および画像処理プログラムを提供することにある。 The present invention has been made in view of the above problems, and an object of the present invention is to provide an image processing apparatus and an image processing program capable of detecting a skew angle of an image and correcting the image with high accuracy. There is.

上記目的を達成するために、本発明は、次のような構成を採っている。すなわち、入力画像中から白領域を探索し、この探索した白領域を基準に前記入力画像を複数の領域に分割し、この分割した領域単位にハフ変換による角度検出を行う。そして、この検出した領域単位の角度を基にスキュー角を求め、この求めたスキュー角を基に入力画像に対するスキュー補正を行うようにする。 In order to achieve the above object, the present invention adopts the following configuration. That is, a white area is searched from the input image, the input image is divided into a plurality of areas based on the searched white area, and angle detection is performed by Hough transform for each divided area. Then, a skew angle is obtained based on the detected angle of the region unit, and skew correction is performed on the input image based on the obtained skew angle.

スキュー角検出・補正を行う画像処理において、画像中から白領域を見つけ、当該白領域で区切られた分割領域単位にハフ変換による角度検出を行う。このように、分割領域単位で角度検出を行うことにより、他の領域との干渉を防ぎつつ分割領域個々の角度検出を行うことができる。そして、分割領域単位で検出した角度から画像全体の又は領域単位のスキュー角を算出し、当該スキュー角を基にスキュー補正を行う。 In the image processing for detecting and correcting the skew angle, a white area is found from the image, and angle detection is performed by Hough transform for each divided area divided by the white area. In this manner, by performing angle detection in units of divided areas, it is possible to detect the angles of individual divided areas while preventing interference with other areas. Then, a skew angle for the entire image or for each region is calculated from the angle detected for each divided region, and skew correction is performed based on the skew angle.

本発明によれば、他の領域との干渉を防ぎつつ分割領域個々の角度検出を行うことで、画像全体の又は領域単位のスキュー角をより正確に検出することができるために、スキュー角の検出およびその補正を高精度にて実現できる。 According to the present invention, by detecting the angle of each divided region while preventing interference with other regions, it is possible to more accurately detect the skew angle of the entire image or of the region unit. Detection and correction thereof can be realized with high accuracy.

以下、本発明の実施の形態について図面を参照して詳細に説明する。 Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings.

図１は、本発明による画像処理装置を含む画像処理システムの構成例を示すブロック図である。図１に示すように、本画像処理システムは、画像データ入力部１０、画像処理部２０および画像データ出力部３０を有する構成となっている。 FIG. 1 is a block diagram showing a configuration example of an image processing system including an image processing apparatus according to the present invention. As shown in FIG. 1, the image processing system includes an image data input unit 10, an image processing unit 20, and an image data output unit 30.

画像データ入力部１０は、イメージスキャナ等の画像読取装置からなり、文書原稿の画像を読み取ってその画像データを画像処理部２０に入力する。なお、イメージスキャナ等の画像読取装置で読み取った画像データをサーバなどの記憶装置に一旦格納しておくシステム構成を採る場合には、サーバが画像データ入力部１０として機能することになる。また、ファクシミリ装置にあっては、受信側のファクシミリ装置が画像データ入力部１０として機能することになる。 The image data input unit 10 includes an image reading device such as an image scanner, reads an image of a document original, and inputs the image data to the image processing unit 20. In the case of adopting a system configuration in which image data read by an image reading device such as an image scanner is temporarily stored in a storage device such as a server, the server functions as the image data input unit 10. In the facsimile apparatus, the receiving facsimile apparatus functions as the image data input unit 10.

画像処理部２０は、本発明による画像処理装置に相当し、画像データ入力部１０から入力される画像データに基づく画像（入力画像）のスキュー角（傾き角度）を検出し、その検出角度を基に画像データに対してスキュー補正を行う。 The image processing unit 20 corresponds to the image processing apparatus according to the present invention, detects a skew angle (tilt angle) of an image (input image) based on the image data input from the image data input unit 10, and based on the detected angle. In addition, skew correction is performed on the image data.

画像処理部２０は、ＣＰＵ(Central Processing Unit:中央演算装置)２１、Ｉ／Ｏ回路２２、ＲＯＭ２３、ＲＡＭ２４およびＨＤＤ（ハードディスクドライブ）装置２５などを有し、これら構成要素がバスライン２６を介して相互に通信可能に接続された構成となっている。 The image processing unit 20 includes a CPU (Central Processing Unit) 21, an I / O circuit 22, a ROM 23, a RAM 24, an HDD (Hard Disk Drive) device 25, and the like, and these components are connected via a bus line 26. It is the structure connected so that mutual communication was possible.

ＣＰＵ２１は、演算処理を含む本画像処理部２０全体の処理の制御を行う。Ｉ／Ｏ回路２２は、画像データ入力部１０および画像データ出力部３０を含む周辺機器との入出力を管理する。ＲＯＭ２３は、ＣＰＵ２１による制御の下に実行される各種処理の処理プログラムを格納する。ＲＡＭ２４は、各種処理の実行時に使用される一次記憶装置である。ＨＤＤ２５は、ＣＰＵ２１による制御の下に処理された画像データや、外部から取り込んだ画像データなどを記憶する。 The CPU 21 controls processing of the entire image processing unit 20 including calculation processing. The I / O circuit 22 manages input / output with peripheral devices including the image data input unit 10 and the image data output unit 30. The ROM 23 stores processing programs for various processes executed under the control of the CPU 21. The RAM 24 is a primary storage device used when executing various processes. The HDD 25 stores image data processed under the control of the CPU 21 and image data captured from the outside.

画像データ出力部３０は、画像処理部２０でスキュー補正された画像データを、次段の例えば文字認識装置などの画像処理装置（図示せず）に対して出力する。なお、画像データ出力部３０をプリンタやディスプレイ等の出力装置とその制御手段によって構成し、画像処理部２０でスキュー補正された画像データに基づく画像を印刷（記録）紙に印刷出力したり、あるいは、表示画面上に表示出力したりすることも可能である。 The image data output unit 30 outputs the image data whose skew has been corrected by the image processing unit 20 to an image processing device (not shown) such as a character recognition device in the next stage. The image data output unit 30 includes an output device such as a printer or a display and its control means, and prints out an image based on the image data skew-corrected by the image processing unit 20 on printing (recording) paper, or It is also possible to display on the display screen.

［第１実施形態］
図２は、本発明の第１実施形態に係る画像処理装置（画像処理部）２０Ａの機能構成例を示すブロック図である。 [First Embodiment]
FIG. 2 is a block diagram illustrating a functional configuration example of the image processing apparatus (image processing unit) 20A according to the first embodiment of the present invention.

図２に示すように、本実施形態に係る画像処理装置２０Ａは、２値化部２０１、白領域探索部２０２、領域分割部２０３、ハフ（Ｈｏｕｇｈ）変換部２０４、角度算出部２０５、全体角度算出部２０６およびスキュー補正部２０７を有する構成となっている。 As shown in FIG. 2, the image processing apparatus 20A according to the present embodiment includes a binarization unit 201, a white region search unit 202, a region division unit 203, a Hough conversion unit 204, an angle calculation unit 205, and an overall angle. The calculation unit 206 and the skew correction unit 207 are included.

２値化部２０１は、図１の画像データ入力部１０から入力される、濃淡が複数ビットで表されている画像データを、入力画像中に含まれる例えば文字や線、絵柄や写真などの前景に属する画素を画素値が論理「１」のＯＮ画素（例えば、黒画素）、背景領域に属する画素を画素値が論理「０」のＯＦＦ画素（例えば、白画素）とした各画素値が１ビットの２値画像データに２値化して白領域探索部２０２および領域分割部２０３に供給する。 The binarization unit 201 converts the image data input from the image data input unit 10 in FIG. 1 and representing the shading with a plurality of bits into foregrounds such as characters, lines, patterns, and photographs included in the input image. Each pixel value is 1 when the pixel belonging to is an ON pixel (for example, a black pixel) whose pixel value is logic “1” and the pixel belonging to the background area is an OFF pixel (for example, a white pixel) whose pixel value is logic “0” The binarized binary image data is supplied to the white area searching unit 202 and the area dividing unit 203.

白領域探索部２０２は、２値化部２０１から供給される２値画像データを基に、当該画像データに基づく２値画像中から白領域（白帯）を探索する。この白領域探索部２０２は、白領域探索を行う際に、２値画像に対して論理和縮小、具体的にはｍ×ｎ画素（ｍ，ｎは２以上の任意の整数）、例えば４×４画素を単位とし、当該単位中の１画素でもＯＮ画素ならその単位を１画素のＯＮ画素とすることによって画像の縮小を行う。 Based on the binary image data supplied from the binarization unit 201, the white region search unit 202 searches for a white region (white band) from the binary image based on the image data. The white area search unit 202 performs logical sum reduction on a binary image when performing a white area search, specifically m × n pixels (m and n are arbitrary integers of 2 or more), for example, 4 ×. If four pixels are used as a unit, and even one pixel in the unit is an ON pixel, the image is reduced by setting the unit as one ON pixel.

この画像縮小については、図３に示すように、原画像（Ａ）を段階的に、例えば縮小画像（Ｂ）→縮小画像（Ｃ）のように２段階で縮小しても良いし、１回で所定の倍率、例えば原画像（Ａ）を縮小画像（Ｃ）に縮小しても良い。 As for this image reduction, as shown in FIG. 3, the original image (A) may be reduced in two steps, for example, reduced image (B) → reduced image (C). A predetermined magnification, for example, the original image (A) may be reduced to a reduced image (C).

領域分割部２０３は、２値化部２０１で２値化された画像を、白領域探索部２０２で探索された白領域を基準にして複数の領域に分割する。ここで、画像を複数の領域に分割する際に、完全に白領域で区切られている必要は無い。白領域を基準として大体領域分割されていれば良く、文字領域などについてはある程度なら分断されていても構わない。 The area dividing unit 203 divides the image binarized by the binarizing unit 201 into a plurality of areas based on the white area searched by the white area searching unit 202. Here, when the image is divided into a plurality of areas, it is not necessary to completely divide the image into white areas. It suffices that the area is roughly divided on the basis of the white area, and the character area or the like may be divided to some extent.

ハフ変換部２０４は、領域分割部２０３で区切られた（分割された）領域単位で周知のハフ変換処理を行う。ここで、ハフ変換処理について説明する。画素の位置をｘ座標とｙ座標とで表した場合、原点から画素までの直線距離をρ、原点から画素への直線とｘ軸とのなす角度をθとすると、ｘ−ｙ座標上において座標（ｘ，ｙ）に位置する画素を通る全ての直線は、
ρ＝ｘｃｏｓθ＋ｙｓｉｎθ （０≦θ＜π）
なる式で表すことができる。 The Hough conversion unit 204 performs a well-known Hough conversion process for each region divided (divided) by the region dividing unit 203. Here, the Hough conversion process will be described. When the position of a pixel is represented by an x coordinate and ay coordinate, if the straight line distance from the origin to the pixel is ρ, and the angle between the straight line from the origin to the pixel and the x axis is θ, the coordinate on the xy coordinate All straight lines passing through the pixel located at (x, y) are
ρ = x cos θ + ysin θ (0 ≦ θ <π)
It can be expressed by the following formula.

そして、座標（ｘ，ｙ）の位置に位置する画素について、上記式の角度θを０〜πまで順次変化させ、この角度θの変化に対応して得られる距離ρをρ−θ座標上にプロットしていくと、ある画素を通る全ての直線を、ρ−θ座標上（極座標上）で曲線として表すことができる。この曲線がハフ曲線であり、このハフ曲線を求める処理がハフ変換である。 Then, for the pixel located at the position of the coordinate (x, y), the angle θ of the above equation is sequentially changed from 0 to π, and the distance ρ obtained corresponding to the change of the angle θ is put on the ρ−θ coordinate. When plotted, all straight lines passing through a certain pixel can be represented as curves on the ρ-θ coordinates (on the polar coordinates). This curve is a Hough curve, and the process for obtaining this Hough curve is the Hough transform.

角度算出部２０５は、ハフ変換部２０４における領域単位でのハフ変換の結果を基に、分割領域単位で画像の傾き角度を算出する。具体的には、角度算出部２０５は、ハフ変換部２０４においてハフ変換されたｘ−ｙ平面における直線の傾き角度を入力画像のスキュー角として分割領域ごとに算出する。この角度算出部２０５はハフ変換部２０４と共に、特許請求の範囲の角度検出手段を構成している。 The angle calculation unit 205 calculates the inclination angle of the image in units of divided regions based on the result of the Hough conversion in units of regions in the Hough conversion unit 204. Specifically, the angle calculation unit 205 calculates the inclination angle of the straight line in the xy plane subjected to the Hough transform by the Hough transform unit 204 as the skew angle of the input image for each divided region. The angle calculation unit 205, together with the Hough conversion unit 204, constitutes the angle detection means in the claims.

全体角度算出部２０６は、特許請求の範囲のスキュー角算出手段に相当し、角度算出部２０５で算出された分割領域ごとの画像の傾き角度を基に分割前の画像全体のスキュー角を算出する。 The overall angle calculation unit 206 corresponds to the skew angle calculation means in the claims, and calculates the skew angle of the entire image before division based on the inclination angle of the image for each divided region calculated by the angle calculation unit 205. .

全体角度算出部２０６における画像全体のスキュー角を算出する手法としては、次のような方式が考えられる。ただし、これらは一例に過ぎず、これらに限定されるものではない。 As a method for calculating the skew angle of the entire image in the overall angle calculation unit 206, the following method can be considered. However, these are merely examples, and the present invention is not limited to these.

方式１：算出された角度の平均を画像全体のスキュー角とする。
方式２：算出された角度のうち、算出頻度の大きかった角度を画像全体のスキュー角とする。
方式３：算出された角度に対して、分割領域のサイズを係数とした重み付けの平均を画像全体のスキュー角とする。
方式４：算出された角度に対して、分割領域中のＯＮ画素数を係数とした重み付けの平均を画像全体のスキュー角とする。 Method 1: The average of the calculated angles is set as the skew angle of the entire image.
Method 2: Of the calculated angles, the angle with the highest calculation frequency is set as the skew angle of the entire image.
Method 3: With respect to the calculated angle, an average of weights using the size of the divided area as a coefficient is set as the skew angle of the entire image.
Method 4: With respect to the calculated angle, an average of weighting using the number of ON pixels in the divided area as a coefficient is set as the skew angle of the entire image.

スキュー補正部２０７は、全体角度算出部２０６で算出された画像全体のスキュー角を基に、当該スキュー角がゼロになるように入力画像を回転させることによってスキュー補正を行う。画像を回転させる手法としては、例えば、アフィン（Ａｆｆｉｎｅ）変換などの周知の手法を用いることができる。 Based on the skew angle of the entire image calculated by the overall angle calculation unit 206, the skew correction unit 207 performs skew correction by rotating the input image so that the skew angle becomes zero. As a method for rotating the image, for example, a known method such as affine transformation can be used.

上記構成の画像処理装置２０Ａの各構成要素、即ち２値化部２０１、白領域探索部２０２、領域分割部２０３、ハフ変換部２０４、角度算出部２０５、全体角度算出部２０６およびスキュー補正部２０７については、ＰＣ（パーソナルコンピュータ）のように、所定プログラムを実行することによって情報記憶処理、画像処理、演算処理等の各機能を実行するコンピュータ機器を利用してソフトウェア構成によって実現することが考えられる。 Each component of the image processing apparatus 20A having the above configuration, that is, a binarizing unit 201, a white region searching unit 202, a region dividing unit 203, a Hough transform unit 204, an angle calculating unit 205, an overall angle calculating unit 206, and a skew correcting unit 207. As for PC, it is conceivable to realize a software configuration using a computer device that executes functions such as information storage processing, image processing, and arithmetic processing by executing a predetermined program, such as a PC (personal computer). .

ただし、ソフトウェア構成による実現に限られるものではなく、ハードウェア構成、あるいはハードウェアとソフトウェアの複合構成によって実現することも可能である。ソフトウェア構成によって実現する場合、２値化部２０１、白領域探索部２０２、領域分割部２０３、ハフ変換部２０４、角度算出部２０５、全体角度算出部２０６およびスキュー補正部２０７としてコンピュータを機能させるプログラムが本発明による画像処理プログラムとなる。 However, the present invention is not limited to the implementation by software configuration, and can also be implemented by a hardware configuration or a combined configuration of hardware and software. When realized by a software configuration, a program that causes a computer to function as the binarization unit 201, the white region search unit 202, the region division unit 203, the Hough transform unit 204, the angle calculation unit 205, the overall angle calculation unit 206, and the skew correction unit 207 Is an image processing program according to the present invention.

また、以下に説明する画像処理シーケンスにおける各ステップの処理を実行するプログラムが、本発明による画像処理プログラムであるとも言える。そして、これらの画像処理プログラムについては、あらかじめコンピュータ内にインストールしておくことが考えられる。ただし、あらかじめインストールされているのではなく、コンピュータ読み取り可能な記憶媒体に格納されて提供されるものであっても良く、または有線若しくは無線による通信手段を介して配信されるものであっても良い。 It can also be said that the program for executing the processing of each step in the image processing sequence described below is the image processing program according to the present invention. These image processing programs may be installed in the computer in advance. However, it may be provided by being stored in a computer-readable storage medium instead of being installed in advance, or distributed via wired or wireless communication means. .

（スキュー角検出・補正処理）
次に、入力画像のスキュー角を検出し、その検出角度を基にスキュー補正を行う第１実施形態に係るスキュー角検出・補正処理の手順の一例について、図４のフローチャートを用いて説明する。 (Skew angle detection / correction processing)
Next, an example of a procedure of skew angle detection / correction processing according to the first embodiment in which a skew angle of an input image is detected and skew correction is performed based on the detected angle will be described with reference to the flowchart of FIG.

先ず、入力画像データを取り込み（ステップＳ１１）、次いで、この取り込んだ画像データを２値化して２値画像データを作成する（ステップＳ１２）。次に、２値画像データを基に当該画像データに基づく２値画像中から白領域（白帯）を探索し（ステップＳ１３）、次いで、この探索した白領域を基準にして２値化画像複数の領域に分割する（ステップＳ１４）。 First, input image data is captured (step S11), and then the captured image data is binarized to generate binary image data (step S12). Next, based on the binary image data, a white area (white band) is searched from the binary image based on the image data (step S13), and then a plurality of binarized images are searched based on the searched white area. (Step S14).

ここで、分割領域の数をｎとするとき、分割領域の番号ｉを１にセットし（ステップＳ１５）、ｉ番目の分割領域に対してハフ変換処理を行い（ステップＳ１６）、次いで、ハフ変換結果を基にｉ番目の分割領域の傾き角度を算出し（ステップＳ１７）、次いで、分割領域の番号ｉをインクリメントする（ステップＳ１８）。 Here, when the number of divided areas is n, the divided area number i is set to 1 (step S15), the Hough transform process is performed on the i-th divided area (step S16), and then the Hough transform is performed. Based on the result, the inclination angle of the i-th divided region is calculated (step S17), and then the number i of the divided region is incremented (step S18).

次に、分割領域の番号ｉが分割領域数ｎを越えたか否かを判断し（ステップＳ１９）、分割領域数ｎを越えていなければ、ステップＳ１６に戻ってｎ番目の分割領域まで、各分割領域ごとにステップＳ１６からステップＳ１８までの処理を繰り返して実行する。 Next, it is determined whether or not the number i of the divided areas exceeds the number n of divided areas (step S19). If the number i does not exceed the number n of divided areas, the process returns to step S16 to reach the nth divided area. The processing from step S16 to step S18 is repeatedly executed for each region.

全ての分割領域について傾き角度の算出処理が終わったら、算出した分割領域ごとの傾き角度を基に分割前の画像全体のスキュー角を算出する（ステップＳ２０）。そして、算出した画像全体のスキュー角を基に入力画像を回転させることによってスキュー補正を行い（ステップＳ２１）、一連のスキュー角検出・補正処理を終了する。 When the calculation process of the inclination angle is completed for all the divided areas, the skew angle of the entire image before the division is calculated based on the calculated inclination angle for each divided area (step S20). Then, the skew correction is performed by rotating the input image based on the calculated skew angle of the entire image (step S21), and the series of skew angle detection / correction processing ends.

上述したように、画像中から白領域（白帯）を見つけ、当該白領域で区切られた領域単位にハフ変換による角度検出を行い、分割領域単位で検出した角度から画像全体のスキュー角を算出することにより、他の領域との干渉を防ぎつつ分割領域個々の角度検出を行うことができ、その結果、画像全体のスキュー角をより正確に検出できるために、スキュー角の検出およびスキュー補正を高精度にて実現できる。 As described above, a white area (white band) is found in the image, angle detection is performed by Hough transform for each area divided by the white area, and the skew angle of the entire image is calculated from the angle detected for each divided area. By doing so, it is possible to detect the angle of each divided area while preventing interference with other areas, and as a result, the skew angle of the entire image can be detected more accurately. Can be realized with high accuracy.

例えば、図７に示すように、画像に含まれる段落において上下あるいは左右に隣接する段落間で行が均等に傾いている（ずれている）場合であっても、分割領域ごとに、即ち段落ごとに傾き角度を検出することにより、他の段落との干渉を防ぎつつ段落個々の角度検出を行うことができるために、段落間での行の傾きを画像全体のスキュー角度と誤検出してしまうのを回避できる。これにより、画像全体のスキュー角をより正確に検出できるために、スキュー角の検出およびスキュー補正を高精度にて実現できる。 For example, as shown in FIG. 7, even if the lines included in the image are evenly inclined (shifted) between adjacent paragraphs in the vertical and horizontal directions, for each divided region, that is, for each paragraph By detecting the tilt angle, it is possible to detect the angle of each paragraph while preventing interference with other paragraphs. Therefore, the tilt of the line between paragraphs is erroneously detected as the skew angle of the entire image. Can be avoided. Thereby, since the skew angle of the whole image can be detected more accurately, the detection of the skew angle and the skew correction can be realized with high accuracy.

［第２実施形態］
図５は、本発明の第２実施形態に係る画像処理装置（画像処理部）２０Ｂの機能構成例を示すブロック図である。 [Second Embodiment]
FIG. 5 is a block diagram illustrating a functional configuration example of an image processing apparatus (image processing unit) 20B according to the second embodiment of the present invention.

図５に示すように、本実施形態に係る画像処理装置２０Ｂは、２値化部２１１、白領域探索部２１２、領域分割部２１３、ハフ変換部２１４、角度算出部２１５、領域統合部２１６、画像分割部２１７、スキュー補正部２１８および画像合成部２１９を有する構成となっている。 As shown in FIG. 5, the image processing apparatus 20B according to the present embodiment includes a binarization unit 211, a white region search unit 212, a region division unit 213, a Hough transform unit 214, an angle calculation unit 215, a region integration unit 216, The image division unit 217, the skew correction unit 218, and the image composition unit 219 are included.

２値化部２１１は、図１の画像データ入力部１０から入力される画像データを基に、画像中に含まれる例えば文字や線、絵柄や写真等の前景に属する画素をＯＮ画素（例えば、黒画素）、背景領域に属する画素をＯＦＦ画素（例えば、白画素）とした各画素１ｂｉｔの２値画像データを作成して白領域探索部２１２および領域分割部２１３に供給する。 Based on the image data input from the image data input unit 10 in FIG. 1, the binarizing unit 211 converts pixels belonging to the foreground such as characters, lines, patterns, photographs, and the like included in the image to ON pixels (for example, Black pixel) and 1-bit binary image data in which pixels belonging to the background region are OFF pixels (for example, white pixels) are generated and supplied to the white region search unit 212 and the region division unit 213.

白領域探索部２１２は、２値化部２１１から供給される２値画像データを基に、当該画像データに基づく２値画像中から白領域（白帯）を探索する。この白領域探索部２１２は、白領域探索を行う際に、２値画像に対して論理和縮小、具体的にはｍ×ｎ画素（ｍ，ｎは２以上の任意の整数）、例えば４×４画素を単位とし、当該単位中の１画素でもＯＮ画素ならその単位を１画素のＯＮ画素とすることによって画像の縮小を行う。このとき、第１実施形態の場合と同様に、段階的に縮小しても良いし、１回で所定の倍率に縮小しても良い。 Based on the binary image data supplied from the binarization unit 211, the white region search unit 212 searches for a white region (white band) from the binary image based on the image data. When the white area search unit 212 performs a white area search, the logical area reduction is performed on the binary image, specifically, m × n pixels (m and n are arbitrary integers of 2 or more), for example, 4 ×. If four pixels are used as a unit, and even one pixel in the unit is an ON pixel, the image is reduced by setting the unit as one ON pixel. At this time, as in the case of the first embodiment, the image may be reduced in stages, or may be reduced to a predetermined magnification at a time.

領域分割部２１３は、２値化部２１１で２値化された画像を、白領域探索部２１２で探索された白領域を基準にして複数の領域に分割する。 The area dividing unit 213 divides the image binarized by the binarizing unit 211 into a plurality of areas based on the white area searched by the white area searching unit 212.

ハフ変換部２１４は、領域分割部２１３で区切られた（分割された）領域単位で周知のハフ変換処理を行う。角度算出部２１５は、ハフ変換部２１４における領域単位でのハフ変換の結果を基に、分割領域単位で画像の傾き角度を算出する。この角度算出部２１５はハフ変換部２１４と共に、特許請求の範囲の角度検出手段を構成しており、第１実施形態の場合と同様にして角度を検出（算出）する。 The Hough transform unit 214 performs a well-known Hough transform process for each region divided (divided) by the region dividing unit 213. The angle calculation unit 215 calculates the inclination angle of the image in units of divided regions based on the result of the Hough conversion in units of regions in the Hough conversion unit 214. The angle calculation unit 215, together with the Hough conversion unit 214, constitutes an angle detection unit in the scope of claims, and detects (calculates) an angle in the same manner as in the first embodiment.

領域統合部２１６は、特許請求の範囲のスキュー角算出手段に相当し、角度算出部２１５で分割領域単位に算出された傾き角度を分割領域相互間で比較し、その比較結果が同一（概ね同一を含む）の傾き角度の場合に、同一の傾き角度ごとに分割領域を統合するとともに、同一の傾き角度を統合した領域のスキュー角とする。 The region integration unit 216 corresponds to the skew angle calculation means in the claims, compares the inclination angles calculated for each divided region by the angle calculation unit 215 between the divided regions, and the comparison result is the same (substantially the same). In addition, the divided areas are integrated for each of the same inclination angles, and the skew angle of the integrated area is set to the same inclination angle.

画像分割部２１７は、領域統合部２１６での統合結果を基に、統合した領域を単位として入力画像を分割する。 The image dividing unit 217 divides the input image in units of the integrated regions based on the integration result in the region integration unit 216.

スキュー補正部２１８は、画像分割部２１７で分割された画像ごとに、領域統合部２１６で統合した領域（分割した画像に対応）のスキュー角を基に、当該スキュー角がゼロになるように分割画像を回転させることによってスキュー補正を行う。画像を回転させる手法としては、例えば、アフィン（Ａｆｆｉｎｅ）変換などの周知の手法を用いることができる。 The skew correction unit 218 divides each image divided by the image division unit 217 so that the skew angle becomes zero based on the skew angle of the region (corresponding to the divided image) integrated by the region integration unit 216. Skew correction is performed by rotating the image. As a method for rotating the image, for example, a known method such as affine transformation can be used.

画像合成部２１９は、スキュー補正部２１８で各々スキュー補正された分割画像を合成して入力画像に対応したスキュー補正後の画像を生成して出力する。 The image synthesis unit 219 generates a skew-corrected image corresponding to the input image by combining the divided images that have been skew-corrected by the skew correction unit 218, and outputs the generated image.

上記構成の画像処理装置２０Ｂの各構成要素、即ち２値化部２１１、白領域探索部２１２、領域分割部２１３、ハフ変換部２１４、角度算出部２１５、領域統合部２１６、画像分割部２１７、スキュー補正部２１８および画像合成部２１９については、ＰＣのように、所定プログラムを実行することによって情報記憶処理、画像処理、演算処理等の各機能を実行するコンピュータ機器を利用してソフトウェア構成によって実現することが考えられる。 Each component of the image processing apparatus 20B having the above configuration, that is, a binarizing unit 211, a white region searching unit 212, a region dividing unit 213, a Hough transform unit 214, an angle calculating unit 215, a region integrating unit 216, an image dividing unit 217, The skew correction unit 218 and the image composition unit 219 are realized by a software configuration using a computer device that executes functions such as information storage processing, image processing, and arithmetic processing by executing a predetermined program like a PC. It is possible to do.

ただし、ソフトウェア構成による実現に限られるものではなく、ハードウェア構成、あるいはハードウェアとソフトウェアの複合構成によって実現することも可能である。ソフトウェア構成によって実現する場合、２値化部２１１、白領域探索部２１２、領域分割部２１３、ハフ変換部２１４、角度算出部２１５、領域統合部２１６、画像分割部２１７、スキュー補正部２１８および画像合成部２１９としてコンピュータを機能させるプログラムが本発明による画像処理プログラムとなる。 However, the present invention is not limited to the implementation by software configuration, and can also be implemented by a hardware configuration or a combined configuration of hardware and software. When realized by a software configuration, the binarization unit 211, the white region search unit 212, the region division unit 213, the Hough transform unit 214, the angle calculation unit 215, the region integration unit 216, the image division unit 217, the skew correction unit 218, and the image A program that causes a computer to function as the combining unit 219 is an image processing program according to the present invention.

（スキュー角検出・補正処理）
次に、入力画像のスキュー角を検出し、その検出角度を基にスキュー補正を行う第２実施形態に係るスキュー角検出・補正処理の手順の一例について、図６のフローチャートを用いて説明する。 (Skew angle detection / correction processing)
Next, an example of a procedure of skew angle detection / correction processing according to the second embodiment in which a skew angle of an input image is detected and skew correction is performed based on the detected angle will be described with reference to a flowchart of FIG.

先ず、入力画像データを取り込み（ステップＳ３１）、次いで、この取り込んだ画像データを２値化して２値画像データを作成する（ステップＳ３２）。次に、２値画像データを基に当該画像データに基づく２値画像中から白領域（白帯）を探索し（ステップＳ３３）、次いで、この探索した白領域を基準にして２値化画像複数の領域に分割する（ステップＳ３４）。 First, input image data is captured (step S31), and then the captured image data is binarized to generate binary image data (step S32). Next, based on the binary image data, a white area (white band) is searched from the binary image based on the image data (step S33), and then a plurality of binarized images are searched based on the searched white area. (Step S34).

ここで、分割領域の数をｎとするとき、分割領域の番号ｉを１にセットし（ステップＳ３５）、ｉ番目の分割領域に対してハフ変換処理を行い（ステップＳ３６）、次いで、ハフ変換結果を基にｉ番目の分割領域の傾き角度を算出し（ステップＳ３７）、次いで、分割領域の番号ｉをインクリメントする（ステップＳ３８）。 Here, when the number of divided areas is n, the divided area number i is set to 1 (step S35), the Hough transform process is performed on the i-th divided area (step S36), and then the Hough transform is performed. Based on the result, the inclination angle of the i-th divided region is calculated (step S37), and then the number i of the divided region is incremented (step S38).

次に、分割領域の番号ｉが分割領域数ｎを越えたか否かを判断し（ステップＳ３９）、分割領域数ｎを越えていなければ、ステップＳ３６に戻ってｎ番目の分割領域まで、各分割領域ごとにステップＳ３６からステップＳ３８までの処理を繰り返して実行する。 Next, it is determined whether or not the number i of the divided areas exceeds the number n of divided areas (step S39). If the number i does not exceed the number n of divided areas, the process returns to step S36 to the nth divided area. The processing from step S36 to step S38 is repeated for each region.

全ての分割領域について傾き角度の算出処理が終わったら、分割領域単位に算出した傾き角度を分割領域相互間で比較し、その比較結果が同一（概ね同一を含む）の傾き角度の場合に、同一の傾き角度ごとに分割領域を統合するとともに、同一の傾き角度を統合した領域のスキュー角とする（ステップＳ４０）。 After the calculation of the tilt angle is completed for all the divided areas, the tilt angles calculated for each divided area are compared between the divided areas. If the comparison results are the same (including almost the same), the same. The divided areas are integrated for each inclination angle, and the skew angle of the integrated area is set to the same inclination angle (step S40).

次に、分割領域についての統合結果を基に、統合した領域を単位として入力画像を分割し（ステップＳ４１）、次いで、分割した画像ごとに、統合した領域のスキュー角を基に分割画像を回転させることによってスキュー補正を行い（ステップＳ４２）、次いで、スキュー補正した各分割画像を合成し（ステップＳ４３）、一連のスキュー角検出・補正処理を終了する。 Next, based on the integration result for the divided areas, the input image is divided in units of the integrated areas (step S41), and then, for each divided image, the divided images are rotated based on the skew angle of the integrated area. Thus, the skew correction is performed (step S42), and then the divided images subjected to the skew correction are synthesized (step S43), and the series of skew angle detection / correction processing is completed.

上述したように、画像中から白領域（白帯）を見つけ、当該白領域で区切られた領域単位にハフ変換による角度検出を行い、分割領域単位で検出した角度からスキュー角を算出することにより、他の領域との干渉を防ぎつつ分割領域個々の角度検出を行うことができ、さらに同一の角度が検出された領域を統合して検出角度を統合領域のスキュー角として検出することができ、その結果、画像全体のスキュー角をより正確に検出できるために、スキュー角の検出およびスキュー補正を高精度にて実現できる。 As described above, by finding a white area (white band) from the image, performing angle detection by Hough transform for each area segmented by the white area, and calculating a skew angle from the angle detected for each divided area , It is possible to detect the angle of each of the divided areas while preventing interference with other areas, furthermore, by integrating the areas where the same angle is detected, the detection angle can be detected as the skew angle of the integrated area, As a result, since the skew angle of the entire image can be detected more accurately, the detection of the skew angle and the skew correction can be realized with high accuracy.

例えば、図８に示すように、見開き原稿のように、左右ページで画像の傾き角度が異なる場合であっても、統合した領域ごとに、即ち左右ページの両画像ごとにスキュー角を求めるとともに、統合した領域を基に入力画像を分割し、各分割画像ごとスキュー補正を行うことにより、他のページとの干渉を防ぎつつページ個々の角度検出を行い、かつスキュー補正を行うことができるために、左右ページの両画像ごとにスキュー角の検出およびスキュー補正を高精度にて実現できる。 For example, as shown in FIG. 8, even when the image tilt angle is different between the left and right pages as in a spread document, the skew angle is obtained for each integrated region, that is, for both the left and right pages, By dividing the input image based on the integrated area and performing skew correction for each divided image, it is possible to detect the angle of each page and prevent skew correction while preventing interference with other pages. In addition, skew angle detection and skew correction can be realized with high accuracy for both the left and right page images.

本発明による画像処理装置を含む画像処理システムの構成例を示すブロック図である。It is a block diagram which shows the structural example of the image processing system containing the image processing apparatus by this invention. 本発明の第１実施形態に係る画像処理装置の機能構成例を示すブロック図である。It is a block diagram which shows the function structural example of the image processing apparatus which concerns on 1st Embodiment of this invention. 白領域探索の際の画像縮小についての説明図である。It is explanatory drawing about image reduction in the case of a white area search. 第１実施形態に係るスキュー角検出・補正処理の手順の一例を示すフローチャートである。It is a flowchart which shows an example of the procedure of the skew angle detection and correction process which concerns on 1st Embodiment. 本発明の第２実施形態に係る画像処理装置の機能構成例を示すブロック図である。It is a block diagram which shows the function structural example of the image processing apparatus which concerns on 2nd Embodiment of this invention. 第２実施形態に係るスキュー角検出・補正処理の手順の一例を示すフローチャートである。It is a flowchart which shows an example of the procedure of the skew angle detection and correction process which concerns on 2nd Embodiment. 画像に含まれる段落において上下あるいは左右に隣接する段落間で行が均等に傾いている状態を示す図である。It is a figure which shows the state in which the line inclines equally between the paragraphs adjacent to the upper and lower sides or right and left in the paragraph contained in an image. 左右ページで画像の傾き角度が異なる状態を示す図である。It is a figure which shows the state from which the inclination angle of an image differs in a right-and-left page.

Explanation of symbols

１０…画像データ入力部、２０，２０Ａ，２０Ｂ…画像処理装置（画像処理部）、３０…画像データ出力部、２１…ＣＰＵ、２２…Ｉ／Ｏ回路、２３…ＲＯＭ、２４…ＲＡＭ、２５…ＨＤＤ（ハードディスクドライブ）装置、２６…バスライン、２０１，２１１…２値化部、２０２，２１２…白領域探索部、２０３，２１３…領域分割部、２０４，２１４…ハフ変換部、２０５，２１５…角度算出部、２０６…全体角度算出部、２０７，２１８…スキュー補正部、２１６…領域統合部、２１７…画像分離部、２１９…画像合成部 DESCRIPTION OF SYMBOLS 10 ... Image data input part, 20, 20A, 20B ... Image processing apparatus (image processing part), 30 ... Image data output part, 21 ... CPU, 22 ... I / O circuit, 23 ... ROM, 24 ... RAM, 25 ... HDD (Hard Disk Drive) device, 26 ... bus line, 201, 211 ... binarization unit, 202, 212 ... white area search unit, 203, 213 ... area division unit, 204, 214 ... Hough conversion unit, 205, 215 ... Angle calculation unit, 206 ... Overall angle calculation unit, 207, 218 ... Skew correction unit, 216 ... Area integration unit, 217 ... Image separation unit, 219 ... Image composition unit

Claims

Search means for searching for a white area in the input image;
Area dividing means for dividing the input image into a plurality of areas based on the white area searched by the search means;
Angle detection means for performing angle detection by Hough transform for each area divided by the area dividing means;
A skew angle calculating means for obtaining a skew angle based on an angle of a region unit detected by the angle detecting means;
An image processing apparatus comprising: a skew correction unit that performs skew correction on the input image based on the skew angle obtained by the skew angle calculation unit.

The image processing according to claim 1, wherein the skew angle calculation unit calculates a skew angle of the entire image before the division by the region dividing unit based on the angle of the region unit detected by the angle detection unit. apparatus.

The skew angle calculation means compares the angle of the area unit detected by the angle detection means between areas, and in the case of the same angle, the areas are integrated for each same angle and the same angle is integrated. The image processing apparatus according to claim 1, wherein a skew angle of the region is used.

A search step for searching for a white region in the input image;
A region dividing step of dividing the input image into a plurality of regions based on the white region searched in the searching step;
An angle detection step of performing angle detection by Hough transform on the region units divided in the region division step;
A skew angle calculating step for obtaining a skew angle based on the angle of the region unit detected in the angle detecting step;
An image processing program that causes a computer to execute each processing of a skew correction step of performing skew correction on the input image based on the skew angle obtained in the skew angle calculation step.