JP6272219B2

JP6272219B2 - Image processing apparatus, image processing method, and program

Info

Publication number: JP6272219B2
Application number: JP2014265573A
Authority: JP
Inventors: 誠榎本; 洋介五十嵐
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 2014-12-26
Filing date: 2014-12-26
Publication date: 2018-01-31
Anticipated expiration: 2034-12-26
Also published as: JP2016126446A

Description

本発明は、撮像画像から得られた領域候補の中から正しい領域を特定する技術に関する。 The present invention relates to a technique for specifying a correct region from region candidates obtained from a captured image.

カメラ付き携帯電話の普及により、ポスターやホワイトボード、名刺、配布資料など、様々な紙面をカメラによってスキャン（撮像）し、電子文書化する技術が開発されている。しかしながら、撮像された紙面の画像にはカメラと紙面の位置関係に応じて台形状の歪みが生じる。このような歪みを補正し、カメラを従来スキャナの代替として利用する為には、撮像画像における紙面の境界を示す紙面領域を正確に取得する必要がある。 With the widespread use of camera-equipped mobile phones, a technology has been developed that scans (images) various papers such as posters, whiteboards, business cards, and handouts with a camera, and converts them into electronic documents. However, a trapezoidal distortion occurs in the captured image of the paper according to the positional relationship between the camera and the paper. In order to correct such distortion and use the camera as an alternative to the conventional scanner, it is necessary to accurately acquire a paper area indicating the boundary of the paper surface in the captured image.

特許文献１では、画像から原稿の境界を検出するために、領域候補全体の内側と外側の画素値を用いて、領域候補の内側と外側の分離度を求め、分離度の高い領域候補を選択する方法が開示されている。 In Patent Document 1, in order to detect a document boundary from an image, the inner and outer pixel values of the entire area candidate are used to determine the inner and outer separation degrees of the area candidate, and the area candidate having a high degree of separation is selected. A method is disclosed.

特開２０１３−１０５２７６号公報JP 2013-105276 A

しかし、特許文献１の方法では、領域候補全体の内側と外側の画素値を用いて分離度を求めている。環境光などの影響によって領域候補の内側と外側の領域において画素値のバラツキが大きい（分散値が高い）場合や、背景の画素値がそもそも均一ではない場合に領域候補の内側と外側の画素値の分離度が低くなってしまう場合がる。 However, in the method of Patent Document 1, the degree of separation is obtained using pixel values inside and outside the entire region candidate. Pixel values on the inside and outside of the area candidate when the variation in pixel values is large (the variance value is high) in the area inside and outside the area candidate due to the influence of ambient light, etc., or when the background pixel value is not uniform in the first place The degree of separation may be low.

その場合に、領域の境界を正しく示す紙面領域候補であっても、特許文献１の方法で領域候補全体の内側と外側の画素値の分離度を算出すると、低い分離度となってしまい、領域候補を正しく選択できない問題がある。 In that case, even if it is a paper area candidate that correctly indicates the boundary of the area, if the degree of separation between the inner and outer pixel values of the whole area candidate is calculated by the method of Patent Document 1, the degree of separation becomes low. There is a problem that candidates cannot be selected correctly.

本発明はこのような問題点に鑑みなされたもので、背景が均一ではない環境において撮像された領域の画像に対しても、領域の境界を正しく示す紙面領域候補から注目領域を精度良く特定することを目的とする。 The present invention has been made in view of such problems, and for a region imaged in an environment where the background is not uniform, the region of interest is accurately identified from the paper region candidates that correctly indicate the boundary of the region. For the purpose.

上記の課題を解決するために、本発明の画像処理装置は、複数の線分で構成される領域を領域候補として入力する入力手段と、前記領域候補を構成する各線分における、所定大きさ以下の複数の領域において、前記線分を境界とする両側の画像特徴量の関係を示す第一情報と、前記領域候補の各角における所定大きさ以下の領域において前記領域候補の内側と外側の画像特徴量の関係を示す第二情報と、を出力する出力手段と、前記複数の領域のそれぞれに対応する前記第一情報と前記第二情報とを用いて、前記領域候補を注目領域として特定する特定手段と、を有する。 In order to solve the above-described problems, an image processing apparatus according to the present invention includes an input unit that inputs a region composed of a plurality of line segments as a region candidate, and a predetermined size or less in each line segment that forms the region candidate. In the plurality of regions, the first information indicating the relationship between the image feature amounts on both sides with the line segment as a boundary, and images on the inside and outside of the region candidate in a region having a predetermined size or less at each corner of the region candidate The region candidate is identified as a region of interest using output means for outputting second information indicating the relationship between feature amounts, and the first information and the second information corresponding to each of the plurality of regions. And a specifying means.

本発明によれば、背景が均一ではない環境において撮像された領域の画像に対しても、領域の境界を正しく示す紙面領域を精度良く特定することができる。 According to the present invention, it is possible to accurately specify a paper area that correctly indicates the boundary of an area even for an image of an area captured in an environment where the background is not uniform.

画像処理装置の構成を示す図である。It is a figure which shows the structure of an image processing apparatus. 紙面検出処理の概要を示す図である。It is a figure which shows the outline | summary of a paper surface detection process. 紙面検出処理を示すフローチャートである。It is a flowchart which shows a paper surface detection process. エッジ検出処理を行う際の機能構成を示す図である。It is a figure which shows the function structure at the time of performing an edge detection process. エッジ検出処理を示すフローチャートである。It is a flowchart which shows an edge detection process. 第１のノイズ判定処理を説明する図である。It is a figure explaining 1st noise determination processing. 第２のノイズ判定処理を説明する図である。It is a figure explaining the 2nd noise judgment processing. 線分出力処理を示すフローチャートである。It is a flowchart which shows a line segment output process. 線分フィッティング処理を示すフローチャートである。It is a flowchart which shows a line segment fitting process. 線分フィッティングにおける分離度の概念を示す図である。It is a figure which shows the concept of the isolation | separation in line segment fitting. 線分フィッティング処理の具体例を示す図である。It is a figure which shows the specific example of a line segment fitting process. 線分候補選択処理の具体例を示す図である。It is a figure which shows the specific example of a line segment candidate selection process. エッジ画素の追加処理の概要を示す図である。It is a figure which shows the outline | summary of the addition process of an edge pixel. 線分出力処理を行う際の機能構成を示す図である。It is a figure which shows the function structure at the time of performing a line segment output process. 紙面領域特定処理を示すフローチャートである。It is a flowchart which shows a paper surface area | region identification process. 紙面領域特定処理を行う際の機能構成図である。It is a functional block diagram at the time of performing a paper surface area | region specific process. 紙面領域候補の辺近傍領域の分割例を示す図である。It is a figure which shows the example of a division | segmentation of the edge region of a paper surface area candidate. 分割領域のスコア算出処理を示すフローチャートである。It is a flowchart which shows the score calculation process of a division area. 分割領域の評価値の例を示す図である。It is a figure which shows the example of the evaluation value of a division area. 紙面領域候補の分割領域例を示す図である。It is a figure which shows the example of the division area of a paper surface area | region candidate. 分割領域の評価値の例を示す図である。It is a figure which shows the example of the evaluation value of a division area. 紙面領域候補の選択処理の具体例を示す図である。It is a figure which shows the specific example of the selection process of a paper surface area candidate. 第２の実施形態の紙面領域候補の例を示す図である。It is a figure which shows the example of the paper surface area | region candidate of 2nd Embodiment. 第３の実施形態における分割領域のスコア算出処理を示すフローチャートである。It is a flowchart which shows the score calculation process of the division area in 3rd Embodiment. 第３の実施形態の紙面領域候補の例を示す図である。It is a figure which shows the example of the paper surface area | region candidate of 3rd Embodiment.

以下、本発明の実施形態について、図面を参照して説明する。以下では本発明を適用する際の一例を説明するが、本実施の形態で説明される特徴の全てが本発明に必須のものとは限らない。 Hereinafter, embodiments of the present invention will be described with reference to the drawings. In the following, an example when the present invention is applied will be described. However, not all the features described in the present embodiment are necessarily essential to the present invention.

＜第１の実施形態＞
●装置構成
図１は、本実施形態における画像処理装置の構成を示す図である。本実施形態の画像処理装置は、撮像された画像データの入力を行う画像入力部１０１と、ＣＰＵ１０２、ＲＡＭ１０３、記憶部１０４、を有する。ＣＰＵ１０２は画像データに本実施形態の線分出力処理を施す画像処理プログラムを実行する。ＲＡＭ１０３は該プログラムを実行する際のワークメモリやデータの一時保存などに利用される。該プログラムやデータは、記憶部１０４に格納される。 <First Embodiment>
Apparatus Configuration FIG. 1 is a diagram illustrating a configuration of an image processing apparatus according to the present embodiment. The image processing apparatus according to the present embodiment includes an image input unit 101 that inputs captured image data, a CPU 102, a RAM 103, and a storage unit 104. The CPU 102 executes an image processing program for performing line segment output processing of the present embodiment on image data. The RAM 103 is used for temporary storage of work memory and data when executing the program. The program and data are stored in the storage unit 104.

なお、図１に示す画像処理装置の構成は一例であり、ここで示した以外の構成要素を含んでいても良い。また、外部の汎用コンピュータなどを用いて画像処理を実行しても良いし、電子回路上で画像処理を実行しても良い。さらに、画像入力部１０１に入力する画像データは撮像画像に限らず、人工的に作成された画像でも良い。 The configuration of the image processing apparatus illustrated in FIG. 1 is an example, and may include components other than those illustrated here. Further, image processing may be executed using an external general-purpose computer or the like, or image processing may be executed on an electronic circuit. Furthermore, the image data input to the image input unit 101 is not limited to a captured image, and may be an artificially created image.

●紙面検出処理概要
図２は、本実施形態の画像処理装置による紙面検出処理の概要を示す図である。図２（ａ）は画像入力部１０１によって取得する撮像画像データの例であり、複数枚の紙面が配置されている。この撮像画像データに対して本実施形態による紙面検出処理を施すと、図２（ｂ）に示すように各紙面の領域を表す四角形が得られる。以下、このような紙面と背景との境界を表す直線が成す四角形の領域を、紙面領域と称する。また本実施形態では、撮像画像の各画素は輝度値により表現されるものとして説明を行うが、これにより本発明が輝度値による画像データに限定されるものではない。 Outline of Paper Surface Detection Process FIG. 2 is a diagram showing an outline of the paper face detection process by the image processing apparatus of the present embodiment. FIG. 2A is an example of captured image data acquired by the image input unit 101, and a plurality of sheets of paper are arranged. When the paper surface detection process according to the present embodiment is performed on the captured image data, a quadrangle representing the area of each paper surface is obtained as shown in FIG. Hereinafter, a rectangular area formed by a straight line representing the boundary between the paper surface and the background is referred to as a paper surface region. In the present embodiment, each pixel of the captured image is described as being expressed by a luminance value. However, the present invention is not limited to image data by a luminance value.

図３は、本実施形態における紙面検出処理を示すフローチャートである。上述したように本処理は、ＣＰＵ１０２がプログラムを実行することによって実現される。 FIG. 3 is a flowchart showing the paper surface detection process in the present embodiment. As described above, this processing is realized by the CPU 102 executing the program.

まず、ステップＳ３０１において、画像入力部１０１で取得された撮像画像データに対してエッジ検出を行う。この処理によって、撮像画像データにおいて輝度勾配の強度が大きい画素がエッジ画素として得られる。エッジ検出処理では、ノイズ相当のエッジを除去する。ここでノイズ相当のエッジとは、文書の辺を構成するエッジではないと推定されるエッジである。 First, in step S301, edge detection is performed on the captured image data acquired by the image input unit 101. By this processing, pixels having a high intensity gradient in the captured image data are obtained as edge pixels. In the edge detection process, an edge corresponding to noise is removed. Here, the edge corresponding to noise is an edge that is estimated not to constitute an edge of the document.

次に、ステップＳ３０２において、ステップＳ３０１で得たエッジ画素の集合からエッジ画素の点列によって構成される直線を生成する。直線生成は、ハフ変換やラドン変換など公知の方法によって行う。続いて、ステップＳ３０３において、ステップＳ３０２で生成された直線とエッジ画素の集合に基づき、線分群を取得する。本実施形態では取得された複数の線分から信頼度の高い線分を紙面領域の境界を示す線分として出力する。なお、本実施形態では、直線は端点を持たず、線分は２つの端点を持つとして区別する。 Next, in step S302, a straight line constituted by a sequence of edge pixels is generated from the set of edge pixels obtained in step S301. The straight line is generated by a known method such as Hough transform or Radon transform. Subsequently, in step S303, a line segment group is acquired based on the set of straight lines and edge pixels generated in step S302. In the present embodiment, a highly reliable line segment is output as a line segment indicating the boundary of the paper area from the plurality of acquired line segments. In this embodiment, a straight line has no end points, and a line segment is distinguished as having two end points.

続いて、ステップＳ３０４において、各線分の位置関係をもとに線分を４本ずつ選択し、紙面領域の候補となる四角形を生成する。そして、最後にステップＳ３０５で、ステップＳ３０４で生成した四角形から、紙面領域を示す四角形を特定する。 Subsequently, in step S304, four line segments are selected on the basis of the positional relationship of each line segment, and a quadrilateral that is a candidate for the paper area is generated. Finally, in step S305, a quadrangle indicating a paper area is specified from the quadrangle generated in step S304.

以下、図３に示すフローチャートにおける各処理について、詳細に説明する。 Hereinafter, each process in the flowchart shown in FIG. 3 will be described in detail.

●エッジ検出処理
ステップＳ３０１におけるエッジ検出処理について、図４〜７を用いて詳細に説明する。図４は、本実施形態の画像処理装置においてエッジ検出処理を行う際の機能構成を示す図であり、図５は本実施形態のエッジ検出処理を示すフローチャートである。 Edge detection processing The edge detection processing in step S301 will be described in detail with reference to FIGS. FIG. 4 is a diagram illustrating a functional configuration when performing edge detection processing in the image processing apparatus of the present embodiment, and FIG. 5 is a flowchart illustrating edge detection processing of the present embodiment.

まず、ステップＳ５０１で撮像画像入力部４０１が、処理対象画像を入力する。次に、ステップＳ５０２でエッジ画像作成部４０２が、入力画像からエッジ画素を抽出したエッジ画像を作成する。このエッジ画像の作成には、エッジ検出のキャニー法（ＪｏｈｎＦ．Ｃａｎｎｙ）等、周知の手法を用いるとする。そして、ステップＳ５０３で連結画素ブロック作成部４０３が、画素と画素が連なっている連結画素ブロックを作成する。ここで連結画素ブロックとは、エッジ画素が８近傍で連結している画素集合のことである。 First, in step S501, the captured image input unit 401 inputs a processing target image. Next, in step S502, the edge image creation unit 402 creates an edge image obtained by extracting edge pixels from the input image. It is assumed that a known method such as an edge detection canny method (John F. Canny) is used to create the edge image. In step S503, the connected pixel block creation unit 403 creates a connected pixel block in which pixels are connected. Here, the connected pixel block is a pixel set in which edge pixels are connected in the vicinity of eight.

そして、ステップＳ５０４で外接矩形作成部４０４が、ステップＳ５０３で作成された連結画素ブロックのそれぞれに外接する矩形を作成する。この時点で、例えば図２（ａ）に示す入力画像であれば、紙面と背景の境界による紙面端エッジと、背景のテクスチャや文書中の文字によるノイズとすべきエッジとの両方が存在している。撮像画像から紙面領域を検出する際には、このノイズとすべきエッジを除去して紙面端エッジのみを抽出することが望ましい。 In step S504, the circumscribed rectangle creating unit 404 creates a rectangle circumscribing each of the connected pixel blocks created in step S503. At this time, for example, in the case of the input image shown in FIG. 2A, there are both the edge of the paper surface due to the boundary between the paper surface and the background, and the edge that should be noise due to the background texture and characters in the document. Yes. When detecting a paper area from a captured image, it is desirable to remove only the edge that should be noise and extract only the edge of the paper.

そこで、本実施形態ではステップＳ５０５以降において、外接矩形の特性を示す各種パラメータから、外接矩形が対応する連結画素ブロックがノイズ相当か否かを判定する。ノイズ相当と判定されれば、この連結画素が除去される。これは、一般に紙面の境界領域に相当する紙面端エッジは直線であり、ノイズによるエッジよりも長い、もしくはノイズによるエッジは一直線に伸びるものではない、という性質を利用したものである。 Therefore, in this embodiment, after step S505, it is determined from the various parameters indicating the characteristics of the circumscribed rectangle whether or not the connected pixel block corresponding to the circumscribed rectangle is equivalent to noise. If it is determined that the noise is equivalent, the connected pixel is removed. This is because the edge of the paper surface corresponding to the boundary region of the paper surface is generally a straight line and is longer than the edge due to noise, or the edge due to noise does not extend in a straight line.

以下、本実施形態におけるステップＳ５０５〜ステップＳ５０９のノイズ除去処理について、詳細に説明する。 Hereinafter, the noise removal processing in steps S505 to S509 in the present embodiment will be described in detail.

まず、ステップＳ５０５で外接矩形パラメータ比較部４０５が、外接矩形の長辺の長さを所定の閾値（第１の閾値）と比較する。第１ノイズ判定部４０６がこの比較結果から、外接矩形が紙面端エッジを構成する領域（以下、境界領域）の候補であるか否かを判断する。これは、紙面端エッジの外接矩形であれば一方向に長く伸びている可能性が高いという性質を利用している。具体的には、外接矩形の長辺が第１の閾値より短いものは境界領域ではなく、抽出対象外のノイズであると判断してステップＳ５０９に進み、第１ノイズ除去部４０７がこの連結画素ブロックの除去を行う。一方、外接矩形の長辺の長さが第１の閾値以上であれば、境界領域の候補であるとしてこの連結画素ブロックを残したまま、第２のノイズ判定処理であるステップＳ５０６に進む。 First, in step S505, the circumscribed rectangle parameter comparison unit 405 compares the length of the long side of the circumscribed rectangle with a predetermined threshold (first threshold). From the comparison result, the first noise determination unit 406 determines whether or not the circumscribed rectangle is a candidate for a region (hereinafter referred to as a boundary region) constituting the edge of the paper surface. This utilizes the property that a rectangle that circumscribes the edge of the paper surface has a high possibility of extending in one direction. Specifically, if the long side of the circumscribed rectangle is shorter than the first threshold value, it is determined that it is not a boundary region but noise that is not to be extracted, and the process proceeds to step S509, where the first noise removal unit 407 Remove block. On the other hand, if the length of the long side of the circumscribed rectangle is equal to or greater than the first threshold value, the process proceeds to step S506, which is the second noise determination process, while leaving this connected pixel block as a boundary region candidate.

ここで図６に、ステップＳ５０５によるノイズ判定処理の具体例を示す。この例では、連結画素ブロック６０１に対する外接矩形６０２の長辺６０３の長さについては、その長さが第１の閾値よりも短いため、この外接矩形６０２を持つ連結画素ブロック６０１はノイズであると判断され、除去される。一方、連結画素ブロック６０４に対する外接矩形６０５の長辺６０６は第１の閾値よりも長いため、この外接矩形６０５を持つ連結画素ブロック６０４は境界領域の候補として残される。 FIG. 6 shows a specific example of the noise determination process in step S505. In this example, since the length of the long side 603 of the circumscribed rectangle 602 with respect to the connected pixel block 601 is shorter than the first threshold, the connected pixel block 601 having the circumscribed rectangle 602 is noise. Judged and removed. On the other hand, since the long side 606 of the circumscribed rectangle 605 for the connected pixel block 604 is longer than the first threshold value, the connected pixel block 604 having the circumscribed rectangle 605 is left as a candidate for the boundary region.

次に、ステップＳ５０６で外接矩形パラメータ比較部４０５が、外接矩形の縦横比（長辺の長さ／短辺の長さ）で所定の閾値（第２の閾値）と比較し、第１ノイズ判定部４０６がこの比較結果から、外接矩形が境界領域候補であるか否かを判断する。ここで、上記ステップＳ５０５の外接矩形の長辺の長さによるノイズでは、背景の大きな模様がある場合等、サイズの大きいノイズでは外接矩形の長辺の長さも長くなり、ノイズと判別されないことがある。これに対しステップＳ５０６のノイズ判定では、紙面の境界領域ならば一方向に長く伸びている可能性が高く、外接矩形が長方形に近くなることに対し、ノイズの場合の外接矩形は正方形に近くなるという性質を利用する。 Next, in step S506, the circumscribed rectangle parameter comparison unit 405 compares the circumscribed rectangle with an aspect ratio (long side length / short side length) with a predetermined threshold (second threshold) to determine the first noise. The unit 406 determines from the comparison result whether the circumscribed rectangle is a boundary region candidate. Here, in the noise due to the length of the long side of the circumscribed rectangle in step S505, the length of the long side of the circumscribed rectangle becomes long when the size of the noise is large, such as when there is a large background pattern, and may not be determined as noise. is there. On the other hand, in the noise determination in step S506, it is highly likely that the boundary region of the paper surface is elongated in one direction. The circumscribed rectangle is close to a rectangle, whereas the circumscribed rectangle in the case of noise is close to a square. This property is used.

具体的には、外接矩形の縦横比（長辺の長さ／短辺の長さ）は第２の閾値を超えていれば、境界領域の候補であると判断してステップＳ５０７に進み、この連結画素ブロックを残す。一方、外接矩形の縦横比が第２の閾値以下であればノイズの可能性があるとし、ステップＳ５０８のノイズ判定に進む。 Specifically, if the aspect ratio of the circumscribed rectangle (long side length / short side length) exceeds the second threshold, it is determined as a boundary region candidate, and the process proceeds to step S507. Leave connected pixel blocks. On the other hand, if the aspect ratio of the circumscribed rectangle is equal to or smaller than the second threshold, it is determined that there is a possibility of noise, and the process proceeds to the noise determination in step S508.

ここで図７に、ステップＳ５０６によるノイズ判定処理の具体例を示す。この例では、連結画素ブロック７０１に対する外接矩形７０２の縦横比（長辺の長さ／短辺の長さ）は第２の閾値よりも大きくなるため、境界領域の候補であると判断してこの連結画素ブロックを残す。一方、連結画素ブロック７０３に対する外接矩形７０４の縦横比は第２の閾値以下となり、ノイズの可能性ありと判定される。ただし、この判定方法では斜め方向に伸びた境界領域の連結画素ブロック７０５についても、その外接矩形７０６の縦横比は第２の閾値以下となってしまう。このような斜め方向に伸びた連結画素ブロック７０５については紙面端エッジである可能性も高いため、ここではまだノイズと判断せず、ステップＳ５０８の第３のノイズ判定に進む。 FIG. 7 shows a specific example of the noise determination process in step S506. In this example, since the aspect ratio (long side length / short side length) of the circumscribed rectangle 702 with respect to the connected pixel block 701 is larger than the second threshold, it is determined that this is a candidate for the boundary region. Leave connected pixel blocks. On the other hand, the aspect ratio of the circumscribed rectangle 704 with respect to the connected pixel block 703 is equal to or less than the second threshold value, and it is determined that there is a possibility of noise. However, in this determination method, the aspect ratio of the circumscribed rectangle 706 also becomes the second threshold value or less for the connected pixel block 705 in the boundary region extending in the oblique direction. Since there is a high possibility that the connected pixel block 705 extending in the oblique direction is an edge on the paper, it is not yet determined as noise, and the process proceeds to the third noise determination in step S508.

ステップＳ５０８で外接矩形パラメータ比較部４０５が、外接矩形内の連結画素ブロックの密度を所定の閾値（第３の閾値）と比較し、第１ノイズ判定部４０６がこの比較結果から、外接矩形が境界領域候補であるか否かの判断を行う。上述したように、斜めに伸びている画素ブロックが紙面端エッジであるかノイズであるかは外接矩形の縦横比では判定できないため、外接矩形内の連結画素ブロックの密度による判定を行う。 In step S508, the circumscribed rectangle parameter comparing unit 405 compares the density of the connected pixel blocks in the circumscribed rectangle with a predetermined threshold (third threshold), and the first noise determining unit 406 determines that the circumscribed rectangle is a boundary from the comparison result. It is determined whether or not it is a region candidate. As described above, whether the pixel block extending diagonally is an edge of the paper or noise cannot be determined by the aspect ratio of the circumscribed rectangle, and therefore, determination is performed based on the density of the connected pixel blocks in the circumscribed rectangle.

これは以下の性質を利用している。すなわち、外接矩形が紙面端エッジに対応する境界領域であれば、該矩形内では対角線上に連結画素ブロックがあるだけであるから、該矩形内での画素密度は低くなる。一方、ノイズであれば矩形内のいたるところ、少なくとも境界領域の場合よりも広く画素が分布するので、画素密度が高くなる。外接矩形内の連結画素ブロックの密度が第３の閾値以上であればノイズと判定してステップＳ５０９に進み、この連結画素ブロックの除去を行う。一方、密度が第３の閾値未満であれば境界領域候補と判定してステップＳ５０７に進み、この画素ブロックを残す。 This utilizes the following properties: That is, if the circumscribed rectangle is a boundary region corresponding to the edge of the paper surface, there is only a connected pixel block on the diagonal line in the rectangle, so that the pixel density in the rectangle is low. On the other hand, in the case of noise, the pixels are distributed everywhere in the rectangle more widely than at least in the boundary region, so that the pixel density is increased. If the density of the connected pixel block in the circumscribed rectangle is equal to or higher than the third threshold value, it is determined as noise and the process proceeds to step S509, and the connected pixel block is removed. On the other hand, if the density is less than the third threshold, it is determined as a boundary region candidate, and the process proceeds to step S507 to leave this pixel block.

ステップＳ５０８によるノイズ判定処理の具体例を、図７に示す。この例では、連結画素ブロック７０１についてはステップＳ５０６で既に境界領域候補として判定されているため、ステップＳ５０８での処理対象外である。連結画素ブロック７０３については、その外接矩形７０４内に連結画素ブロックが広く分布していることから画素密度が高く、ノイズであると判断されて該画素ブロックが除去される。一方、連結画素ブロック７０５については、その外接矩形７０６内での画素ブロックの広がりが対角線上のみであることから画素密度が低く、境界領域候補として判断されてこの画素ブロックが残される。 A specific example of the noise determination process in step S508 is shown in FIG. In this example, since the connected pixel block 701 has already been determined as a boundary region candidate in step S506, it is not subject to processing in step S508. Regarding the connected pixel block 703, since the connected pixel blocks are widely distributed within the circumscribed rectangle 704, the pixel density is high and the pixel block is determined to be noise. On the other hand, the connected pixel block 705 has a low pixel density because the spread of the pixel block within the circumscribed rectangle 706 is only on the diagonal line, and is determined as a boundary region candidate, and this pixel block remains.

以上、図５のフローチャートに示した一連の処理により、処理対象画像からノイズであると判定されたエッジが除去され、境界領域の候補である紙面端エッジが残された画像として出力される。このエッジ検出処理を経た画像であれば、後段の直線・線分出力処理によって境界領域を正しく検出することができる。 As described above, the edge determined to be noise is removed from the processing target image by the series of processing shown in the flowchart of FIG. 5, and the image is output as an image in which the edge of the paper surface that is a candidate for the boundary region remains. In the case of an image that has undergone this edge detection processing, the boundary region can be correctly detected by the subsequent straight line / line segment output processing.

●線分出力処理
ステップＳ３０３における線分出力処理について、図８〜１４を用いて詳細に説明する。 Line Segment Output Process The line segment output process in step S303 will be described in detail with reference to FIGS.

図１４は、本実施形態の画像処理装置において線分出力処理を行う際の機能構成を示す図である。撮像画像入力部１４０１は、撮像画像入力部４０１と同様に、処理対象画像を入力する。エッジ画像作成部１４０２は、ステップＳ５０２〜ステップＳ５０９の処理を行う。直線生成部１４０３は、ステップＳ３０２の直線を生成する。線分出力部１４０４は、本実施形態の線分出力処理を行う。また、線分出力部１４０４を複数のより詳細な機能構成部分に分けることができる。さらに、本実施形態は上述した構成に限定されず、例えば、エッジ画像作成部１４０２は、図４のエッジ画像作成部４０２と同様にステップＳ５０２のエッジ画像の作成処理だけを行ってもよい。 FIG. 14 is a diagram illustrating a functional configuration when performing line segment output processing in the image processing apparatus of the present embodiment. Similar to the captured image input unit 401, the captured image input unit 1401 inputs a processing target image. The edge image creation unit 1402 performs the processing from step S502 to step S509. The straight line generation unit 1403 generates a straight line in step S302. The line segment output unit 1404 performs line segment output processing according to the present embodiment. Further, the line segment output unit 1404 can be divided into a plurality of more detailed functional components. Furthermore, the present embodiment is not limited to the above-described configuration. For example, the edge image creation unit 1402 may perform only the edge image creation processing in step S502 as in the edge image creation unit 402 in FIG.

次に、本実施形態の詳細な機能構成部分について説明する。図８は、線分出力処理を示すフローチャートである。 Next, detailed functional components of the present embodiment will be described. FIG. 8 is a flowchart showing line segment output processing.

まず、ステップＳ８０１において、ステップＳ３０２で得た直線群に基づき、ステップＳ５０２で得たエッジ画素群からエッジ画素を選択し、ステップＳ３０１で得たエッジ画素群にこのエッジ画素を追加する。ステップＳ３０１で得られるエッジ画素群（エッジ画像）は、精度良く直線を生成する為にノイズ除去が行われている。しかしながら本実施形態における線分出力処理では、直線近傍にあるエッジ画素をもとに線分を生成するため、直線近傍のエッジ画素はノイズとして除去されないのが望ましい。そこでステップＳ８０１では、エッジ画素追加部１４０５が、ステップＳ５０２で得たエッジ画像に対して各直線から所定距離内（Ｎ１）にあるエッジ画素を、ステップＳ３０１で得たエッジ画像に追加する。得られたエッジ画素群、を以降の処理で用いる。なお、所定距離Ｎ１としては、例えば２ｍｍに相当する画素数を用いる。 First, in step S801, an edge pixel is selected from the edge pixel group obtained in step S502 based on the straight line group obtained in step S302, and this edge pixel is added to the edge pixel group obtained in step S301. The edge pixel group (edge image) obtained in step S301 is subjected to noise removal in order to generate a straight line with high accuracy. However, in the line segment output process in the present embodiment, line segments are generated based on edge pixels in the vicinity of the straight line, and therefore it is desirable that the edge pixels in the vicinity of the straight line are not removed as noise. Therefore, in step S801, the edge pixel adding unit 1405 adds edge pixels within a predetermined distance (N1) from each straight line to the edge image obtained in step S502 to the edge image obtained in step S301. The obtained edge pixel group is used in the subsequent processing. For example, the number of pixels corresponding to 2 mm is used as the predetermined distance N1.

エッジ画素追加処理の概略について、図１３を用いて説明する。図１３（ａ）は、ステップＳ５０２の処理によって作成されたエッジ画像の一例を示す。図１３（ｂ）は、ステップＳ５０７の処理によって残されたエッジ画素１３０１および、ステップＳ５０９の処理によってノイズとして除去されたエッジ画素１３０２を示す。このエッジ画素１３０１だけを含むエッジ画像がステップ３０１で得られるエッジ画像である。 An outline of the edge pixel addition processing will be described with reference to FIG. FIG. 13A shows an example of an edge image created by the process of step S502. FIG. 13B shows the edge pixel 1301 left by the process of step S507 and the edge pixel 1302 removed as noise by the process of step S509. An edge image including only the edge pixel 1301 is the edge image obtained in step 301.

図１３（ｃ）は、ステップＳ３０２の処理によって生成された直線１３０５を示す。直線１３０５に対して、エッジ画素が存在する部分に線分が存在し、エッジ画素が存在しない部分に線分が存在しないと推定する。直線１３０５の近傍（直線１３０５の周囲）において、エッジ画素とエッジ画素の間隔が所定値ｎ以上の場合は、そのエッジ画素を境界に直線１３０５を線分に分割する。直線１３０５を分割すると、図１３（ｄ）に示すように、短い線分１３０６が得られる。ここで、エッジ画素追加部１４０５が、直線１３０５から所定距離内にあって、ノイズとして除去されたエッジ画素１３０４をエッジ画素１３０１のエッジ画像に追加してから、直線１３０５を線分に分割すると、正しい線分１３０７が得られる。直線を線分に分割する処理は、後述するステップＳ８０２において線分抽出部１４０６が行う本実施形態の線分出力処理の一部である。 FIG. 13C shows a straight line 1305 generated by the process of step S302. With respect to the straight line 1305, it is estimated that a line segment exists in a portion where an edge pixel exists and a line segment does not exist in a portion where no edge pixel exists. In the vicinity of the straight line 1305 (around the straight line 1305), when the interval between the edge pixels is equal to or greater than a predetermined value n, the straight line 1305 is divided into line segments with the edge pixel as a boundary. When the straight line 1305 is divided, a short line segment 1306 is obtained as shown in FIG. Here, when the edge pixel adding unit 1405 adds the edge pixel 1304 that is within a predetermined distance from the straight line 1305 and removed as noise to the edge image of the edge pixel 1301, and then divides the straight line 1305 into line segments, A correct line segment 1307 is obtained. The process of dividing the straight line into line segments is a part of the line segment output process of the present embodiment performed by the line segment extraction unit 1406 in step S802 described later.

図８を用いて、線分出力処理を説明する。正しい線分を抽出するために、ステップＳ８０１では、ステップＳ３０２で得られたエッジ画素群に対して、ステップ３０３で生成された直線の近傍にあって、エッジ画像からノイズとして除去された画素群を加えたエッジ画素群を生成する。次に、ステップＳ８０２において、線分抽出部１４０６が、ステップＳ３０２で直線生成部１４０３によって生成された各直線を複数の線分に分割する。具体的には、まずステップＳ８０１で得られたエッジ画素群から、直線から所定距離内（Ｎ２）内にエッジ画素を抽出する。そして、抽出されたエッジ画素間の距離が所定値ｎより小さいのであれば、それらは同一線分の要素であると判定する。エッジ画素間の距離が所定値ｎより大きいのであれば、それらは異なる線分の要素であると判定する。 The line segment output process will be described with reference to FIG. In order to extract a correct line segment, in step S801, a pixel group that is in the vicinity of the straight line generated in step 303 and is removed as noise from the edge image with respect to the edge pixel group obtained in step S302. An added edge pixel group is generated. Next, in step S802, the line segment extraction unit 1406 divides each straight line generated by the straight line generation unit 1403 in step S302 into a plurality of line segments. Specifically, first, an edge pixel is extracted within a predetermined distance (N2) from the straight line from the edge pixel group obtained in step S801. If the distance between the extracted edge pixels is smaller than the predetermined value n, it is determined that they are elements of the same line segment. If the distance between the edge pixels is greater than the predetermined value n, it is determined that they are different line segment elements.

ここで、距離が所定値ｎより大きい２点のエッジ画素をそれぞれエッジ画素ｐｉ、エッジ画素ｐｊとする。エッジ画素ｐｊとエッジ画素ｐｊは、それぞれ異なる線分の端点であると判定し、エッジ画素ｐｉ、ｐｊを直線に正射影してえる座標を端点とする線分を生成する。所定距離Ｎ２は、例えばステップＳ８０１で定めた所定距離Ｎ１と同値とする。また所定値ｎとしては、例えば図２（ａ）に示す複数紙面のように、処理対象画像における紙面間の最小間隔以上の値を設定する必要があり、例えば５ｃｍに相当する画素数を設定する。 Here, two edge pixels whose distance is larger than the predetermined value n are defined as an edge pixel pi and an edge pixel pj, respectively. The edge pixel pj and the edge pixel pj are determined to be the end points of different line segments, and a line segment whose end points are coordinates obtained by orthogonally projecting the edge pixels pi and pj into a straight line is generated. The predetermined distance N2 is set to the same value as the predetermined distance N1 determined in step S801, for example. Further, as the predetermined value n, for example, it is necessary to set a value that is equal to or larger than the minimum interval between the paper surfaces in the processing target image, such as a plurality of paper surfaces shown in FIG. .

次に、ステップＳ８０３で、ステップＳ８０２で得た各線分について線分近傍（線分の周囲）のエッジ画素に対するフィッティングを行う。フィッティング処理は、分割処理で得られた各線分に対して、その近傍のエッジ画素を用いて、さらに線分を生成して、線分を出力する処理である。このフィッティング処理の詳細は、図９〜１１を用いて後述する。 Next, in step S803, fitting is performed on edge pixels in the vicinity of the line segment (around the line segment) for each line segment obtained in step S802. The fitting process is a process of generating a line segment for each line segment obtained by the division process and using the edge pixels in the vicinity thereof, and outputting the line segment. Details of the fitting process will be described later with reference to FIGS.

ステップＳ８０２及びステップＳ８０３の処理を全直線及び全線分について行った後に、ステップＳ８０４で線分候補選択部１４１２が選択処理の対象となる２本の線分から複数の線分候補を生成し、この複数の線分候補から１本の線分を選択する。２本の線分の線分間距離、線分間角度がいずれも閾値以下である場合に、線分候補選択部１４１２がこの２本の線分を選択処理の対象として判定する。この判定は、ステップＳ８０３の線分フィッティング処理で得られた線分の全てに対して行う。ここで、例えば画像中の紙面枚数が１枚であることが既知の場合には、同一直線上にある２本の線分に対して、積極的に選択処理する必要がある。この場合、線分間距離の閾値を大きくすると、閾値以内の距離で並列した２線分も選択処理の対象となり、意図しない結果が得られてしまう。そこで、選択条件として、両線分の中点同士を結んだ線分と、２線分それぞれが成す角度に対して閾値を与えることで、延長線上にある線分のみを選択処理の対象とすることができる。この選択処理の具体例については、図１２を用いて後述する。 After performing the processing of step S802 and step S803 for all the straight lines and all the line segments, in step S804, the line segment candidate selecting unit 1412 generates a plurality of line segment candidates from the two line segments to be selected, and the plurality of line segment candidates. One line segment is selected from the line segment candidates. When both the line segment distance and the line segment angle are equal to or less than the threshold value, the line segment candidate selection unit 1412 determines the two line segments as a target of selection processing. This determination is performed for all the line segments obtained by the line segment fitting process in step S803. Here, for example, when it is known that the number of sheets in the image is one, it is necessary to positively select two line segments on the same straight line. In this case, if the threshold of the line segment distance is increased, two line segments arranged in parallel at a distance within the threshold are also subject to selection processing, and an unintended result is obtained. Therefore, as a selection condition, a threshold is given to the line segment connecting the midpoints of both line segments and the angle formed by each of the two line segments, so that only the line segment on the extension line is selected. be able to. A specific example of this selection process will be described later with reference to FIG.

選択処理によって選択された線分は、ステップＳ８０５で再度フィッティング処理を行う。これを全線分について行い、線分出力処理を終了する。 The line segment selected by the selection process is subjected to the fitting process again in step S805. This is performed for all line segments, and the line segment output process is terminated.

●線分フィッティング処理
ここで、ステップＳ８０３およびステップＳ８０５における線分フィッティング処理について、図９のフローチャートを用いて詳細に説明する。 Line Segment Fitting Process Here, the line segment fitting process in steps S803 and S805 will be described in detail with reference to the flowchart of FIG.

まず、ステップＳ９０１で、信頼度算出部１４０８が、処理対象の線分（線分１）の信頼度を、［線分長］×［分離度］／［平均二乗誤差］によって定める計算式を用いて算出する。ここで、［線分長］は、処理対象の線分の長さである。［分離度］は、線分の両面の輝度がどの程度分離しているかを表す指標である。［平均二乗誤差］は、エッジ画素が線分近傍（線分の周囲の所定範囲）に密集している度合い（密集度合い）を表す指標である。ここで、［分離度］は以下の式（１）によって算出される。 First, in step S901, the reliability calculation unit 1408 uses a calculation formula that determines the reliability of the line segment to be processed (line segment 1) by [line segment length] × [separation degree] / [mean square error]. To calculate. Here, [Line segment length] is the length of the line segment to be processed. [Separation degree] is an index representing how much the luminance of both sides of a line segment is separated. [Mean square error] is an index that represents the degree (density degree) of edge pixels being densely adjacent to a line segment (a predetermined range around the line segment). Here, [Separation degree] is calculated by the following equation (1).

ここで、Ｎ_１は領域１の画素数、Ｎ_２は領域２の画素数
ｍ_１は領域１の平均輝度値、ｍ_２は領域２の平均輝度値
ｍは領域１−領域２の平均輝度値
ｓ_ｉは画素ｉの輝度値
式（１）において領域１は、線分を一辺とする幅ｗの一方の領域とし、領域２は、この線分を一辺とする領域１と重ならない幅ｗのもう一方の領域とする。また、入力画像データが多値データ（例えば、ＲＧＢの３値データ）である場合にも信頼度を算出できる。例えば、多値空間（ＲＧＢ空間）において、領域１の平均値と領域２の平均値を通る直線に全画素値を射影して得る値を輝度値に置き換えて、式１により分離度などを算出して、信頼度を算出してもよい。 Here, N ₁ is the number of pixels in region 1, N ₂ is the number of pixels in region 2, m ₁ is the average luminance value of region 1, m ₂ is the average luminance value of region 2, and m is the average luminance value of region 1 to region 2. s _i is the luminance value of the pixel i In the expression (1), the region 1 is one region having a width w having a line segment as one side, and the region 2 has a width w that does not overlap the region 1 having the line segment as a side. The other area. The reliability can also be calculated when the input image data is multivalued data (for example, RGB ternary data). For example, in a multi-value space (RGB space), the value obtained by projecting all pixel values onto a straight line passing through the average value of region 1 and the average value of region 2 is replaced with a luminance value, and the degree of separation is calculated using Equation 1. Then, the reliability may be calculated.

［平均二乗誤差］は、線分と線分から距離Ｄ以内にあるエッジ画素の距離を二乗した平均について、さらに平方根を取った値である。本実施形態では、ｗ及びＤを、ステップＳ８０１のエッジ画素群抽出の際に用いた所定距離Ｎ１と同じ値とする。 [Mean square error] is a value obtained by further taking the square root of the average obtained by squaring the distance between edge segments within the distance D from the line segment. In the present embodiment, w and D are set to the same value as the predetermined distance N1 used in the edge pixel group extraction in step S801.

即ち、領域の境界を示す線分としての信頼度は、線分近傍のエッジ画素の分布と、線分を境界とする二つの領域の画像特徴量の関係に基づいて算出する。また、信頼度は、線分近傍のエッジ画素の分布のみ、又は線分を境界とする二つの領域の画像特徴量の関係のみに基づいて算出してもよい。線分近傍のエッジ画素の分布は、例えば、エッジ画素が線分の周囲の所定範囲に密集している度合いである平均二乗誤差を用いて表すことができる。また、線分を境界とする二つの領域の画像特徴量の関係は、例えば、線分の両側の輝度がどの程度分離しているかの指標である分離度を用いて表すことができる。 That is, the reliability as a line segment indicating the boundary of the region is calculated based on the relationship between the distribution of edge pixels near the line segment and the image feature amount of the two regions having the line segment as a boundary. In addition, the reliability may be calculated based only on the distribution of edge pixels in the vicinity of the line segment, or only on the relationship between the image feature amounts of the two regions having the line segment as a boundary. The distribution of edge pixels in the vicinity of a line segment can be expressed by using, for example, a mean square error that is a degree of density of edge pixels in a predetermined range around the line segment. In addition, the relationship between the image feature amounts of two regions having a line segment as a boundary can be expressed by using, for example, a degree of separation that is an index of how much the luminance on both sides of the line segment is separated.

ここで図１０に分離度の概念図を示す。図６（ａ）において、線分１００１を境界とする領域１００２ａ、１００２ｂの輝度ヒストグラムが、１００３ａ、１００３ｂである。同様に、図６（ｂ）の各領域から得たヒストグラムを１００４ａ、１００４ｂに示した。この例の場合、図１０（ｂ）よりも図１０（ａ）の方は分離度が高い値となる。 FIG. 10 shows a conceptual diagram of the degree of separation. In FIG. 6A, the luminance histograms of the regions 1002a and 1002b having the line segment 1001 as a boundary are 1003a and 1003b. Similarly, histograms obtained from the respective regions in FIG. 6B are shown in 1004a and 1004b. In this example, the degree of separation is higher in FIG. 10 (a) than in FIG. 10 (b).

線分の信頼度は、線分が長く、線分を境界とする両面で輝度の変化が大きく、エッジ画素が線分近傍に密集している場合に大きくなる指標である。該信頼度は式（１）に限らず、その変形又は別の式を用いても良い。例えば、［線分長］を［距離Ｄ以内のエッジ画素数］に置き換えられる。また［分離度］を［領域１と領域２の最大輝度差（あるいは平均輝度差）］に置き換えられる。また［平均二乗誤差］を［エッジ画素と線分の距離総和］に置き換えられる。また［平均二乗誤差］を［距離Ｄ以内のエッジ画素群の輝度勾配方向の分散］に置き換えられる。また、線分の信頼度に［距離Ｄ以内のエッジ画素群の輝度勾配強度の総和］を係数として掛ける、などの変形が可能であり、さらに上述の変形方法に限らない。 The reliability of a line segment is an index that increases when the line segment is long, the luminance changes greatly on both sides with the line segment as a boundary, and the edge pixels are close to the line segment. The reliability is not limited to the equation (1), and a modification thereof or another equation may be used. For example, [line segment length] can be replaced with [number of edge pixels within distance D]. [Separation degree] is replaced with [maximum luminance difference (or average luminance difference) between region 1 and region 2]. [Mean square error] can be replaced with [Total sum of distances between edge pixels and line segments]. In addition, [mean square error] is replaced with [dispersion in luminance gradient direction of edge pixel group within distance D]. Further, it is possible to modify the line segment reliability by multiplying [the sum of luminance gradient intensities of edge pixel groups within the distance D] as a coefficient, and the present invention is not limited to the above-described modification method.

また、信頼度算出部１４０８を、さらに、線分長算出部、分離度算出部と平均二乗誤差算出部に分けることができる。線分長、分離度と平均二乗誤差のそれぞれに重み付けを行ってもよい。例えば、比較対象となる２本の線分について、線分長、平均二乗誤差の何れも差が小さい場合は、分離度算出部が算出する分離度を信頼度として用いてもよい。 The reliability calculation unit 1408 can be further divided into a line segment length calculation unit, a separation degree calculation unit, and a mean square error calculation unit. Each of the line segment length, the degree of separation, and the mean square error may be weighted. For example, regarding the two line segments to be compared, when the difference between the line segment length and the mean square error is small, the degree of separation calculated by the degree of separation calculation unit may be used as the reliability.

図９に戻り、ステップＳ９０２でエッジ画素取得部１４０９が線分１から所定距離（Ｄ）内のエッジ画素群Ｅ１を取得する。次に、ステップＳ９０３で、線分取得部１４１０がエッジ画素群Ｅ１から新たに線分２を求める。まず、エッジ画素群Ｅ１に対して、主成分分析などの処理を適用して、直線を生成する。主成分分析は、点群の分散が最も大きくなる方向を求める方法である。次に、このエッジ画素群Ｅ１から生成された直線と、エッジ画素群Ｅ１を包含する外接矩形の交点を端点とする線分を線分２として取得する。 Returning to FIG. 9, in step S <b> 902, the edge pixel acquisition unit 1409 acquires the edge pixel group E <b> 1 within a predetermined distance (D) from the line segment 1. Next, in step S903, the line segment acquisition unit 1410 newly obtains a line segment 2 from the edge pixel group E1. First, a straight line is generated by applying processing such as principal component analysis to the edge pixel group E1. Principal component analysis is a method for obtaining a direction in which the variance of a point group is maximized. Next, a line segment whose end point is an intersection of a straight line generated from the edge pixel group E1 and a circumscribed rectangle including the edge pixel group E1 is acquired as a line segment 2.

次に、ステップＳ９０４で、ステップＳ９０１で求めた線分１の信頼度と同様に線分２の信頼度を求める。そして、ステップＳ９０５で、線分選択部１４１１が線分１の信頼度と線分２の信頼度を比較し、大きい方を線分Ａとして選択する。ここで線分Ａとして線分１が選択された場合にはステップＳ９０６に進み、所定距離ＤをＤ以下の値に更新する。例えば、Ｄ＝Ｄ／２とする。線分Ａとして線分２が選択された場合には、所定距離Ｄを小さい値に更新せず、そのままステップＳ９０７に進む。 Next, in step S904, the reliability of line 2 is obtained in the same manner as the reliability of line 1 obtained in step S901. In step S905, the line segment selection unit 1411 compares the reliability of the line segment 1 with the reliability of the line segment 2, and selects the larger one as the line segment A. Here, when the line segment 1 is selected as the line segment A, the process proceeds to step S906, and the predetermined distance D is updated to a value equal to or less than D. For example, D = D / 2. When the line segment 2 is selected as the line segment A, the predetermined distance D is not updated to a small value, and the process directly proceeds to step S907.

ステップＳ９０７では、線分Ａから所定距離Ｄ以下のエッジ画素群Ｅ２を取得する。線分Ａが線分１である場合、エッジ画素群Ｅ２はエッジ画素群Ｅ１に完全に包含される点群となる。一方、線分Ａが線分２である場合、エッジ画素群Ｅ２はエッジ画素群Ｅ１に重複する点群となる。ステップＳ９０７で得たエッジ画素群Ｅ２がエッジ画素群Ｅ１と等しい場合、処理が収束したとみなし、線分フィッティング処理を終了する。一方、エッジ画素群Ｅ２とエッジ画素群Ｅ１が異なる場合にはステップＳ９０８に進み、エッジ画素群Ｅ２を新たなエッジ画素群Ｅ１とし、線分Ａを新たな線分１としてステップステップＳ９０３に戻り、処理を続行する。 In step S907, an edge pixel group E2 having a predetermined distance D or less from the line segment A is acquired. When the line segment A is the line segment 1, the edge pixel group E2 is a point group that is completely included in the edge pixel group E1. On the other hand, when the line segment A is the line segment 2, the edge pixel group E2 becomes a point group overlapping with the edge pixel group E1. If the edge pixel group E2 obtained in step S907 is equal to the edge pixel group E1, it is considered that the process has converged, and the line segment fitting process is terminated. On the other hand, if the edge pixel group E2 and the edge pixel group E1 are different, the process proceeds to step S908, the edge pixel group E2 is set as a new edge pixel group E1, the line segment A is set as a new line segment 1, and the process returns to step S903. continue processing.

以上、図９のフローチャートに示した線分フィッティングの処理は一例に過ぎず、線分フィッティング処理はこの例に限定されない。例えば、ステップＳ９０２及びステップＳ９０７において、線分１の角度に類似した方向に輝度勾配を持つエッジ画素のみを対象として取得することで、ステップＳ９０３の精度が向上できる。 The line segment fitting process shown in the flowchart of FIG. 9 is merely an example, and the line segment fitting process is not limited to this example. For example, in step S902 and step S907, by acquiring only edge pixels having a luminance gradient in a direction similar to the angle of line segment 1, the accuracy of step S903 can be improved.

また、ステップＳ９０５で選択した線分Ａをそのままフィッティング処理の結果として出力してもよいし、所定のループ回数で強制的に計算を打ち切り、その時点での線分Ａを出力してもよい。また、予め所定距離Ｄの値Ｄｉを複数用意し、ステップＳ９０２及びステップＳ９０３の時点でそれぞれの所定距離Ｄｉについてエッジ画素群Ｅｉ及び線分ｉを求め、信頼度が最大となる線分ｉを選択してもよい。あるいはステップＳ９０３で、主成分分析以外の手法を用いてもよい。例えば、外接矩形に接する２点や、距離が最大となる２点を端点としてもよい。また、周知の最小二乗法、ハフ変換、ＲＡＮＳＡＣ等の方法を用いてもよい。さらに、これらを全て同時に使用して複数の線分を求め、いずれかを選択、あるいは信頼度が閾値以上となる線分を全て出力してもよい。 Further, the line segment A selected in step S905 may be output as it is as a result of the fitting process, or the calculation may be forcibly terminated at a predetermined number of loops, and the line segment A at that time may be output. Also, a plurality of values Di of the predetermined distance D are prepared in advance, and the edge pixel group Ei and the line segment i are obtained for each predetermined distance Di at the time of Step S902 and Step S903, and the line segment i having the maximum reliability is selected. May be. Alternatively, a method other than principal component analysis may be used in step S903. For example, two points in contact with the circumscribed rectangle or two points having the maximum distance may be set as the end points. Further, a known method such as the least square method, the Hough transform, or RANSAC may be used. Further, all of these may be used simultaneously to obtain a plurality of line segments, and either one may be selected, or all line segments whose reliability is equal to or greater than a threshold may be output.

ここで図１１に、線分フィッティング処理の具体例を示す。図１１（ａ）では、３枚の紙面を撮像した画像から得られたエッジ１１０１ａ〜１１０１ｃと、線状ノイズであるが、除去できなかった画素１１０２とから、ステップＳ３０２の直線生成処理で直線１１０３が得られたとする。ここで、エッジ１１０１ｃは、紙面Ｃの上辺に対応する一つのエッジであるが、エッジ１１０１ａとエッジ１１０１ｂは、環境光などの影響により、それぞれ紙面Ａと紙面Ｂの上辺に対応する途切れた二本のエッジである。 FIG. 11 shows a specific example of the line segment fitting process. In FIG. 11A, from the edges 1101a to 1101c obtained from images obtained by imaging three sheets of paper and the pixels 1102 that are linear noise but could not be removed, a straight line 1103 is generated by the straight line generation processing in step S302. Is obtained. Here, the edge 1101c is one edge corresponding to the upper side of the paper surface C, but the two edges 1101a and 1101b correspond to the upper side of the paper surface A and the paper surface B, respectively, due to the influence of ambient light or the like. Is the edge.

直線１１０３を、ステップＳ８０２の処理によって、直線１１０３近傍（線分の周囲の所定範囲）のエッジ画素をもとに分割すると、図１１（ｂ）に示す線分１１０４ａ〜１１０４ｃが得られる。直線１１０３は複数の紙面のエッジ画素およびノイズ相当のエッジ画素から得られたので、線分１１０４ａ〜１１０４ｃは単に直線１１０３を分割した結果であり、いずれも各紙面の上辺から位置、傾き共にズレがある。これらの線分１１０４ａ〜１１０４ｃに対して、線分フィッティング処理を行う。 When the straight line 1103 is divided based on the edge pixels in the vicinity of the straight line 1103 (a predetermined range around the line segment) by the process of step S802, line segments 1104a to 1104c shown in FIG. 11B are obtained. Since the straight line 1103 is obtained from a plurality of paper edge pixels and edge pixels corresponding to noise, the line segments 1104a to 1104c are simply the result of dividing the straight line 1103, both of which are shifted in position and inclination from the upper side of each paper surface. is there. Line segment fitting processing is performed on these line segments 1104a to 1104c.

図１１（ｃ）は、線分１１０４ａ〜１１０４ｃ（線分１）それぞれに対して、ステップＳ９０２及びステップＳ９０３の処理を施した結果である。エッジ１１０１ｂとエッジ１１０１ｃとのそれぞれから、紙面Ｂと紙面Ｃの上辺に適した線分１１０５ｂと線分１１０５ｃとが線分２として得られた。一方で、線分１１０５ａは、線状ノイズ画素１１０２の影響を受けて紙面Ａの上辺からズレがある線分（線分２）となっている。ここでステップＳ９０５の処理を行い、線分１１０４ａと線分１１０５ａ、線分１１０４ｂと線分１１０５ｂ、線分１１０４ｃと線分１１０５ｃのそれぞれの信頼度を算出して比較する。 FIG. 11C shows the result of applying the processing of step S902 and step S903 to each of the line segments 1104a to 1104c (line segment 1). A line segment 1105b and a line segment 1105c suitable for the upper side of the paper surface B and the paper surface C were obtained as the line segment 2 from the edge 1101b and the edge 1101c, respectively. On the other hand, the line segment 1105a is a line segment (line segment 2) that is displaced from the upper side of the paper surface A due to the influence of the linear noise pixel 1102. Here, the processing of step S905 is performed, and the reliability of each of the line segment 1104a and the line segment 1105a, the line segment 1104b and the line segment 1105b, and the line segment 1104c and the line segment 1105c is calculated and compared.

その結果、線分１１０４ａ、線分１１０５ｂおよび線分１１０５ｃがそれぞれ信頼度の大きい線分として選択される。線分１１０５ｂおよび線分１１０５ｃは紙面領域の境界を示しているので、処理が収束し、線分フィッティング処理を終了する。一方、線分２として選択された線分１１０４ａは直線１１０３を分割して得た線分であるため、ステップＳ９０６に進んで所定距離Ｄをより小さな値に更新し、再度エッジ画素群を求める。 As a result, the line segment 1104a, the line segment 1105b, and the line segment 1105c are each selected as a line segment with high reliability. Since the line segment 1105b and the line segment 1105c indicate the boundaries of the paper area, the process converges and the line segment fitting process ends. On the other hand, since the line segment 1104a selected as the line segment 2 is a line segment obtained by dividing the straight line 1103, the process proceeds to step S906, the predetermined distance D is updated to a smaller value, and the edge pixel group is obtained again.

図１１（ｄ）に示すように、線分１１０４ａの周囲の更新前の所定範囲は、線分１１０４ａを中心線として、線分１１０４ａから所定距離Ｄ内の範囲である。図１１（ｅ）は、線分１１０４ａの周囲の更新後の所定範囲を示した図である。所定距離Ｄに対応する所定範囲１１０６と、更新後の所定距離Ｄ／２に対応するより小さくした所定範囲１１０７を示した。所定距離をＤ／２とすることで、線状ノイズ画素１１０２が除かれて、エッジ１１０１ａのみを含むエッジ画素群が得られる。このエッジ画素群をステップＳ９０７におけるエッジ画素群Ｅ２として取得して、ステップＳ９０８及びステップＳ９０３に進み、主成分分析などの処理を行う。そして、図１１（ｆ）に示すように、途切れた二本のエッジ１１０１ａを連結して、正しい紙面領域の境界を示す線分１１０８ａが得られる。 As shown in FIG. 11D, the predetermined range before the update around the line segment 1104a is a range within the predetermined distance D from the line segment 1104a with the line segment 1104a as the center line. FIG. 11E is a diagram showing the updated predetermined range around the line segment 1104a. A predetermined range 1106 corresponding to the predetermined distance D and a smaller predetermined range 1107 corresponding to the updated predetermined distance D / 2 are shown. By setting the predetermined distance to D / 2, the linear noise pixels 1102 are removed, and an edge pixel group including only the edge 1101a is obtained. This edge pixel group is acquired as the edge pixel group E2 in step S907, and the process proceeds to step S908 and step S903 to perform processing such as principal component analysis. Then, as shown in FIG. 11F, two segmented edges 1101a are connected to obtain a line segment 1108a indicating the boundary of the correct paper surface area.

図１１（ｅ）では、線分１１０４ａの周囲の所定範囲（線分の法線方向の所定範囲）を小さくする方法として、線分１１０４ａを境界に、その両側に線分１１０４ａからの所定距離Ｄを所定距離Ｄ／２に変更する一例を示した。ただし、この方法では、線状ノイズ画素１１０２が除去できない場合は、線分１１０８ａが正しく得られず、また信頼度の低い線分が得られてしまう可能性がある。 In FIG. 11 (e), as a method of reducing a predetermined range around the line segment 1104a (a predetermined range in the normal direction of the line segment), a predetermined distance D from the line segment 1104a on both sides of the line segment 1104a as a boundary. An example in which is changed to a predetermined distance D / 2 is shown. However, with this method, if the linear noise pixel 1102 cannot be removed, the line segment 1108a may not be obtained correctly, and a line segment with low reliability may be obtained.

そこで、上述した方法によって得られた線分の信頼度が線分１１０４ａより低い場合、大きさを小さくした所定範囲の位置を、線分１１０４ａの法線方向に移動し、移動した所定範囲にあるエッジ画素群を再度取得して、線分を生成する。生成された線分と線分１１０４ａとの信頼度を比較することによって、線状ノイズの除去を行う。なお、大きさを小さくした所定範囲の移動は複数回行って、線分１１０４ａより信頼度の高い線分が見つかるまで処理を繰り返す。 Therefore, when the reliability of the line segment obtained by the above-described method is lower than the line segment 1104a, the position of the predetermined range whose size is reduced is moved in the normal direction of the line segment 1104a, and is in the moved predetermined range. The edge pixel group is acquired again, and a line segment is generated. By comparing the reliability of the generated line segment with the line segment 1104a, linear noise is removed. Note that the movement within a predetermined range with a reduced size is performed a plurality of times, and the process is repeated until a line segment with higher reliability than the line segment 1104a is found.

なお、本実施形態の線分フィッティング処理は、１枚の紙面しかない場合にも適用できる。例えば、図１１（ｇ）〜（ｉ）及び図１１（ｄ）〜（ｆ）のように、紙面Ｂと紙面Ｃはなく、紙面Ａしかない場合にも、上述した方法は同様に適用できる。 It should be noted that the line segment fitting process of the present embodiment can also be applied when there is only one sheet. For example, as shown in FIGS. 11 (g) to 11 (i) and FIGS. 11 (d) to 11 (f), the above-described method can be similarly applied even when there is no paper surface B and paper surface C but only paper surface A.

●線分候補の選択
ここで図１２を用いて、ステップＳ８０４における線分候補の選択処理を説明する。 Selection of Line Segment Candidate Line segment candidate selection processing in step S804 will be described with reference to FIG.

図１２（ａ）には、選択処理を行う前の２本の線分である線分１２０１と線分１２０２と、線分１２０１の端点１２０１ａ、１２０１ｂと、線分１２０２の端点１２０２ａ、１２０２ｂを示す。 FIG. 12A shows two line segments 1201 and 1202, which are two line segments before selection processing, end points 1201a and 1201b of the line segment 1201, and end points 1202a and 1202b of the line segment 1202. .

線分候補の選択処理では、図１２（ｂ）〜（ｅ）に示すように、まず端点１２０１ａまたは１２０２ａと、端点１２０１ｂまたは１２０２ｂを端点とする線分候補１２０３〜１２０６を求める。ここでは、線分候補１２０３と線分候補１２０４は、もとの線分１２０２と線分１２０１と同じである。線分候補１２０５と線分候補１２０６は、新たに生成された線分である。次にこれらの候補線分について、ステップＳ９０１と同様の信頼度を求め、最も信頼度が高くなる線分候補を選択した結果として出力する。 In the line segment candidate selection process, as shown in FIGS. 12B to 12E, first, line segment candidates 1203 to 1206 having the end point 1201a or 1202a and the end point 1201b or 1202b as end points are obtained. Here, the line segment candidate 1203 and the line segment candidate 1204 are the same as the original line segment 1202 and the line segment 1201. A line segment candidate 1205 and a line segment candidate 1206 are newly generated line segments. Next, the reliability similar to step S901 is calculated | required about these candidate line segments, and it outputs as a result of having selected the line segment candidate with the highest reliability.

ここで出力される線分が線分候補１２０５または線分候補１２０６の場合、これらは新たに生成されたので、ステップＳ８０５で線分フィッティングが再度行われる。また、線分候補は図１２（ｂ）〜（ｅ）に示す線分に限らず、例えば端点１２０１ａと端点１２０２ａの中点と、端点１２０１ｂと端点１２０２ｂの中点を新たに定め、それらを端点とする線分候補を求めてもよい。 When the line segment output here is the line segment candidate 1205 or the line segment candidate 1206, since these are newly generated, the line segment fitting is performed again in step S <b> 805. Further, the line segment candidates are not limited to the line segments shown in FIGS. 12B to 12E. For example, the midpoints of the end points 1201a and 1202a and the midpoints of the end points 1201b and 1202b are newly determined, and the end points are defined as the end points. Line segment candidates may be obtained.

紙面の僅かな歪みや環境光により生じる紙面の影などにより、同一の紙面領域の境界（同一の辺）から、複数の線分が出力される場合がある。これに対して線分候補の選択処理を行うことで、近接した２本の線分から新たに１本の線分が選択され、正しい紙面領域の境界を示す線分を出力することが可能となる。 A plurality of line segments may be output from the boundary (same side) of the same paper region due to slight distortion of the paper surface or shadow of the paper surface caused by ambient light. On the other hand, by performing line segment candidate selection processing, a new line segment is selected from two adjacent line segments, and a line segment indicating the boundary of the correct paper area can be output. .

●四角形生成処理
ステップＳ３０４における四角形生成処理では、ステップＳ３０３によって得られた線分群から、四角形の集合を生成する。 Quadrangle Generation Process In the rectangle generation process in step S304, a set of rectangles is generated from the line segment group obtained in step S303.

まず、全ての線分の角度を求め、角度が０度以上４５度未満の線分を水平方向の線分（以下、水平線分）に分類し、４５度以上９０度未満の線分を垂直方向の線分（以下、垂直線分）に分類する。次に、水平線分群から任意の線分を第１の水平線分として選択し、それと距離が一定以上離れた第２の水平線分を選択する。続いて、垂直線分群から、第１の水平線分と略直交（例えば９０度±３度）する垂直線分を第１の垂直線分として選択し、それと距離が一定以上離れた第２の垂直線分を選択する。最後に、第１、第２の水平線分と第１、第２の垂直線分をそれぞれ延長して得る４つの交点を頂点とする四角形を出力し、これを全ての線分の組み合わせについて探索することで四角形を生成する。 First, the angles of all line segments are obtained, line segments whose angles are 0 degree or more and less than 45 degrees are classified into horizontal line segments (hereinafter referred to as horizontal line segments), and line segments that are 45 degrees or more and less than 90 degrees are vertically aligned. To the line segment (hereinafter referred to as the vertical line segment). Next, an arbitrary line segment is selected from the horizontal line segment group as a first horizontal line segment, and a second horizontal line segment that is separated by a certain distance or more is selected. Subsequently, a vertical line segment that is substantially orthogonal to the first horizontal line segment (for example, 90 degrees ± 3 degrees) is selected from the vertical line segment group as the first vertical line segment, and the second vertical line that is more than a certain distance away from the vertical line segment is selected. Select a line segment. Finally, a quadrangle whose vertexes are four intersections obtained by extending the first and second horizontal line segments and the first and second vertical line segments is output, and this is searched for a combination of all line segments. This creates a rectangle.

なお、本実施形態では四角形を検出対象として説明したが、これに限定するものではない。例えばＮ角形（Ｎは奇数）が検出対象となる場合、線分を９０／Ｎ度刻みにＮ分類し、分類された各群から任意の線分を１本ずつ取得し組み合わせることでＮ角形を生成することが可能である。Ｍ角形（Ｍは偶数）の場合、線分を９０／Ｍ度刻みにＭ分類し、分類された各群から任意の線分を２本ずつ取得し組み合わせることでＭ角形を生成することが可能である。 In the present embodiment, a quadrangle is described as a detection target, but the present invention is not limited to this. For example, when an N-gon (N is an odd number) is an object to be detected, the line segment is classified into N at 90 / N degree increments, and an arbitrary line segment is acquired from each classified group and combined to form the N-gon. It is possible to generate. In the case of M square (M is an even number), it is possible to generate an M square by classifying the line segment into 90 / M degree increments, and obtaining and combining two arbitrary line segments from each classified group. It is.

●紙面領域特定処理
以下、ステップＳ３０５における紙面領域特定処理について、図１５〜２２を用いて詳細に説明する。ここで説明上、四角形の紙面領域候補を例として説明するが、領域候補の形状が四角形に限らず、角と辺から構成される多角形であればよい。また、領域候補は紙面に限らず、プラスチックなどの材料で製作されたシート状のものでもよい。 Paper Area Specifying Process Hereinafter, the paper area specifying process in step S305 will be described in detail with reference to FIGS. Here, for explanation, a quadrilateral paper area candidate is described as an example, but the shape of the area candidate is not limited to a quadrilateral, and may be a polygon composed of corners and sides. Further, the region candidate is not limited to the paper surface, but may be a sheet shape made of a material such as plastic.

図１６は、本実施形態の画像処理装置において紙面領域特定処理を行う際の機能構成を示す図である。撮像画像入力部１６０１は、撮像画像入力部４０１と同様に、処理対象画像を入力する。紙面領域候補入力部１６０２は、紙面領域の候補を入力する機能ブロックである。本実施形態の紙面領域候補入力部１６０２は、エッジ画像作成部１６０３、直線生成部１６０４、線分出力部１６０５と四角形生成部１６０６によって構成される。 FIG. 16 is a diagram illustrating a functional configuration when performing the paper surface area specifying process in the image processing apparatus of the present embodiment. Similar to the captured image input unit 401, the captured image input unit 1601 inputs a processing target image. The paper area candidate input unit 1602 is a functional block for inputting paper area candidates. The paper area candidate input unit 1602 of this embodiment includes an edge image creation unit 1603, a straight line generation unit 1604, a line segment output unit 1605, and a quadrangle generation unit 1606.

エッジ画像作成部１６０３は、エッジ画像作成部１４０２と同様に、ステップＳ５０２〜ステップＳ５０９の処理を行う。直線生成部１６０４は、直線生成部１４０３と同様に、ステップＳ３０２の直線を生成する。線分出力部１６０５は、線分出力部１４０４と同様に、線分出力処理を行う。四角形生成部１６０６は、出力された線分を用いて、ステップＳ３０４の処理を行い、紙面領域の紙面領域候補となる四角形を生成する。 The edge image creation unit 1603 performs the processing from step S502 to step S509 in the same manner as the edge image creation unit 1402. The straight line generation unit 1604 generates the straight line in step S302 as with the straight line generation unit 1403. Similar to the line segment output unit 1404, the line segment output unit 1605 performs line segment output processing. The quadrangle generation unit 1606 performs the process of step S304 using the output line segment, and generates a quadrilateral that is a paper area candidate for the paper area.

情報出力部１６０７は、四角形生成部１６０６で生成された紙面領域候補に関する情報を出力する。領域特定部１６０８は、紙面領域特定処理を行う。 The information output unit 1607 outputs information regarding the paper area candidate generated by the quadrangle generation unit 1606. An area specifying unit 1608 performs a paper area specifying process.

次に、図１５を用いて紙面領域特定処理のフローを説明する。まずステップＳ１５０１で、ステップＳ３０４で四角形として生成された紙面領域候補Ｏ１からＯｎを順次、処理対象の紙面領域候補Ｏとして、ステップＳ１５０２〜ステップＳ１５０７までの処理を繰り返す。それぞれの紙面領域候補Ｏに対する処理は独立しているため、最終的に得られる結果は処理の順序により変動しない。したがって、それぞれの紙面領域候補Ｏの処理を並列に行ってもよい。 Next, the flow of the paper area specifying process will be described with reference to FIG. First, in step S1501, the processing from step S1502 to step S1507 is repeated with the paper region candidates O1 to On generated as squares in step S304 sequentially as the paper region candidate O to be processed. Since the processing for each paper area candidate O is independent, the finally obtained result does not vary depending on the processing order. Therefore, the processing of each paper area candidate O may be performed in parallel.

ステップＳ１５０２では、紙面領域候補Ｏの辺近傍をｍ個に分割して分割領域Ｂ１、Ｂ２、…、Ｂｍを作成する。ここでは、紙面領域候補Ｏの辺近傍領域の分割について、図１７を用いて説明する。 In step S1502, the vicinity of the side of the paper region candidate O is divided into m pieces to create divided regions B1, B2,. Here, the division of the region near the side of the paper region candidate O will be described with reference to FIG.

図１７は、四角形生成部１６０６によって生成された紙面領域候補Ｏの辺近傍領域の分割について説明する図である。図１７（ａ）に示すように紙面領域候補は、４つの線分１７０１、１７０２、１７０３、１７０４から構成される四角形である。 FIG. 17 is a diagram for explaining the division of the region near the side of the paper region candidate O generated by the quadrangle generation unit 1606. As shown in FIG. 17A, the paper surface area candidate is a quadrangle composed of four line segments 1701, 1702, 1703, and 1704.

図１７（ｂ）は、図１７（ａ）に示す４つの線分１７０１、１７０２、１７０３、１７０４を分割した分割点の例を示す。本実施形態では、四角形１６０１の長辺にあたる線分１７０１、１７０３を５等分し、短辺にあたる線分１７０２、１７０４を３等分するように、分割点１７０５〜１７１６を設定している。さらに、各辺の両端、即ち四角形１６０１の角の部分にあたる分割点１７１７、１７１８、１７１９、１７２０を併せて、全１６個の分割点を作成した例を示す。なお、図１７に示す分割点は一例に過ぎず、各辺で異なる個数への分割を行ってもよいし、全辺同数に分割してもよい。また、辺を等分に分割する方法の他、各辺の端から一定間隔で分割点を決定してもよい。 FIG. 17B shows an example of dividing points obtained by dividing the four line segments 1701, 1702, 1703, and 1704 shown in FIG. In this embodiment, the dividing points 1705 to 1716 are set so that the line segments 1701 and 1703 corresponding to the long side of the quadrangle 1601 are equally divided into five and the line segments 1702 and 1704 corresponding to the short side are equally divided into three equal parts. Further, an example in which a total of 16 division points are created by combining the division points 1717, 1718, 1719, and 1720 corresponding to both ends of each side, that is, the corners of the quadrangle 1601, is shown. Note that the dividing points shown in FIG. 17 are merely examples, and each side may be divided into different numbers, or may be divided into the same number on all sides. In addition to the method of dividing the sides equally, the dividing points may be determined at regular intervals from the ends of the sides.

そして図１７（ｃ）に示すように、各分割点を中心とした正方形の分割領域１７２１〜１７３６を作成する。各分割領域は、その中心を紙面領域候補である四角形の辺上に持ち、四角形の辺で内部と外部とが区切られることで、四角形の辺の内側と外側の両方の領域が構成される。なお、この分割領域は一例に過ぎず、例えば分割領域の形状については正方形ではなく長方形でもよいし、円形やその他の形であってもよい。また、各分割領域のサイズも一定でなくても良く、所定大きさ以下であればよい。また、後述する分割領域の評価値に応じて、最も評価値の高い所定大きさを求めて、その所定大きさ以下に各分割領域のサイズを変更してもよい。 Then, as shown in FIG. 17C, square divided areas 1721 to 1736 centering on each dividing point are created. Each divided area has its center on a quadrilateral side that is a paper area candidate, and the inside and outside of the quadrangular side are configured by dividing the inside and the outside by the quadrangular side. This divided area is merely an example. For example, the shape of the divided area may be a rectangle instead of a square, or may be a circle or other shapes. Further, the size of each divided region may not be constant, and may be not more than a predetermined size. Further, a predetermined size having the highest evaluation value may be obtained in accordance with an evaluation value of a divided region described later, and the size of each divided region may be changed to the predetermined size or less.

そしてステップＳ１５０３では、ステップＳ１５０２で作成した分割領域について、紙面領域候補の確からしさ（確度）を得るための評価値を算出する。分割領域ごとの評価値算出方法については、図１８を用いて後述する。 In step S1503, an evaluation value for obtaining the certainty (accuracy) of the paper area candidate is calculated for the divided area created in step S1502. The evaluation value calculation method for each divided area will be described later with reference to FIG.

そしてステップＳ１５０４では、ステップＳ１５０３で算出した分割領域ごとの評価値に基づき、紙面領域候補全体の確からしさ（確度）のスコアを算出し、ステップＳ１５０５へ進む。紙面領域候補のスコアは、辺に位置する分割領域の評価値の４辺の辺ごとの平均を求め、それらの平均と、角に位置する分割領域の評価値の平均を加算して求める。 In step S1504, a probability (accuracy) score of the entire paper area candidate is calculated based on the evaluation value for each divided area calculated in step S1503, and the process proceeds to step S1505. The score of the paper region candidate is obtained by calculating the average of the evaluation values of the divided regions located on the sides for each of the four sides and adding the average of the evaluation values of the divided regions located on the corners.

図１７に示す分割領域の例では、分割領域１７２１〜１７３２が辺に位置し、分割領域１７３３〜１７３６が角に位置する。そして辺ごとの評価値の平均とはすなわち、分割領域１７２１〜１７２４の平均と、分割領域１７２５と分割領域１７２６の平均と、分割領域１７２７〜１７３０の平均と、分割領域１７３１と分割領域１７３２の平均である。 In the example of the divided area shown in FIG. 17, the divided areas 1721 to 1732 are located on the sides, and the divided areas 1733 to 1736 are located on the corners. The average evaluation value for each side means that the average of the divided areas 1721 to 1724, the average of the divided areas 1725 and 1726, the average of the divided areas 1727 to 1730, and the average of the divided areas 1731 and 1732 It is.

紙面領域候補が正しく紙面領域であれば、紙面領域候補の内部、すなわち紙面と、その外部のそれぞれの色差により、分離度は高い値を示す。ステップＳ１５０４で説明した角部と辺部それぞれの平均値を加算する方法は、本実施形態の説明のための一例であり、分割領域の分離度が高いほど紙面領域候補の確度が高くなることが本質であるので、スコアを算出する具体的な方法を変更してもよい。スコアを算出する方法の変更例として、途中の計算結果に重みを付けて算出することができる。例えば、ステップＳ１５０４でスコアを算出する際に、角部にある分割領域の分離度に重みを付けてもよい、その平均値に重みを付けてもよい。また、辺部にあるそれぞれの分割領域の分離度に重みを付けてもよい、その平均値に重みを付けてもよい。重みを付ける方法として、所定係数を分離度又は分離度の平均値に掛けることができる。 If the paper area candidate is a correct paper area, the degree of separation shows a high value due to the color difference between the inside of the paper area candidate, that is, the paper surface and the outside thereof. The method of adding the average values of the corners and sides described in step S1504 is an example for describing the present embodiment, and the higher the degree of separation of the divided regions, the higher the accuracy of the paper region candidate. Since it is essential, the specific method for calculating the score may be changed. As an example of a change in the method for calculating the score, a calculation result in the middle can be weighted and calculated. For example, when the score is calculated in step S1504, the degree of separation of the divided areas at the corners may be weighted, or the average value may be weighted. In addition, a weight may be given to the degree of separation of each divided region in the side, or an average value thereof may be weighted. As a weighting method, a predetermined coefficient can be multiplied by the degree of separation or the average value of the degree of separation.

なお、本実施形態の分離度は式（１）によって算出されることができるが、式（１）に限定されない。分割領域１７２１〜１７３２の分離度は、辺に対応する線分を境界とする両側の画像特徴量の関係を示す情報として、情報出力部１６０７により出力される。また、分割領域１７３３〜１７３６の分離度は、領域候補の内側と外側の画像特徴量の関係を示す情報として、情報出力部１６０７により出力される。 Note that the degree of separation according to the present embodiment can be calculated by Expression (1), but is not limited to Expression (1). The degree of separation of the divided regions 1721 to 1732 is output by the information output unit 1607 as information indicating the relationship between the image feature amounts on both sides with the line segment corresponding to the side as a boundary. Further, the degree of separation of the divided regions 1733 to 1736 is output by the information output unit 1607 as information indicating the relationship between the inner and outer image feature amounts of the region candidates.

そしてステップＳ１５０５で、対象紙面領域候補Ｏのスコアが閾値以上か判定し、閾値以上であればステップＳ１５０６へ進み、対象紙面領域候補Ｏを紙面領域として選択してステップＳ１５８へ進む。一方、閾値未満であればステップＳ１５０７へ進み、対象紙面領域候補Ｏは紙面領域でないとして、紙面領域候補群から削除してステップＳ１５８へ進む。なお、本実施形態ではこの閾値を１．８０とするが、この閾値は一例であり、その他の固定値であってもよいし、実際の処理対象の種類に応じて調整可能な値でもよい。 In step S1505, it is determined whether the score of the target paper area candidate O is equal to or greater than the threshold value. If the score is equal to or greater than the threshold value, the process proceeds to step S1506. On the other hand, if it is less than the threshold value, the process advances to step S1507, and the target paper area candidate O is not a paper area, is deleted from the paper area candidate group, and the process advances to step S158. In the present embodiment, this threshold value is 1.80, but this threshold value is only an example, and may be another fixed value or a value that can be adjusted according to the type of the actual processing target.

ステップＳ１５０５〜ステップＳ１５０７では、閾値を用いて紙面であるか否かの判定を行ったが、入力画像内に一つの紙面領域しか存在しないことが明らかである場合は、スコアが最も高い四角形領域候補のみを残してもよい。 In steps S1505 to S1507, it is determined whether or not it is a paper surface using a threshold value. If it is clear that there is only one paper region in the input image, a rectangular region candidate having the highest score is obtained. You may leave only.

そしてステップＳ１５０８で、紙面領域候補の全てに対する処理が終了していれば、図１５に示す紙面領域特定処理を終了するが、未処理紙面領域候補が残っていれば、ステップＳ１５０１へ戻って、次の紙面領域候補の処理を開始する。 If the processing for all the paper area candidates is completed in step S1508, the paper area specifying process shown in FIG. 15 is ended. If there are unprocessed paper area candidates remaining, the process returns to step S1501, The processing of the paper area candidate is started.

●分割領域評価値の算出処理
以下、上記ステップＳ１５０３における分割領域の評価値の算出処理について、図１８のフローチャートを用いて説明する。 Division Area Evaluation Value Calculation Processing Hereinafter, the division region evaluation value calculation processing in step S1503 will be described with reference to the flowchart of FIG.

まずステップＳ１８０１で、ステップＳ１５０２で作成した分割領域Ｂ１〜Ｂｍを順次、処理対象の分割領域Ｂとして、ステップＳ１８０２〜ステップＳ１８０３の処理を繰り返す。それぞれの分割領域Ｂに対する処理は独立しているため、最終的に得られる結果は処理の順序により変動しない。したがって、それぞれの分割領域Ｂの処理を並列に行ってもよい。 First, in step S1801, the divided areas B1 to Bm created in step S1502 are sequentially set as the divided areas B to be processed, and the processes in steps S1802 to S1803 are repeated. Since the processing for each divided region B is independent, the finally obtained result does not vary depending on the processing order. Therefore, the processing of each divided region B may be performed in parallel.

ステップＳ１８０２では、分割領域Ｂの内部領域と外部領域の画素値（例えば輝度値）の分離度を算出する。分離度の算出には上記式（１）を用いればよい。ここで分離度は、線分を境界とする線分の両側の画素値（例えば輝度値）がどれだけ分離しているかを表す指標である。紙面領域候補Ｏが正しく紙面領域であれば、紙面領域と背面領域（例えば原稿と原稿台）の画素値の差（例えば輝度値の差）により、分離度が高く算出される。そして、ステップＳ１８０３で、ステップＳ１８０２で求めた分離度を分割領域Ｂの評価値として設定し、ステップＳ１８０４へ進む。 In step S1802, the degree of separation between pixel values (for example, luminance values) in the inner area and the outer area of the divided area B is calculated. The above formula (1) may be used to calculate the degree of separation. Here, the degree of separation is an index indicating how much pixel values (for example, luminance values) on both sides of a line segment with the line segment as a boundary are separated. If the paper area candidate O is a correct paper area, the degree of separation is calculated to be high based on the difference in pixel values (eg, the difference in luminance values) between the paper area and the back area (eg, the original and the original table). In step S1803, the degree of separation obtained in step S1802 is set as the evaluation value of the divided region B, and the process proceeds to step S1804.

従来では、紙面領域候補Ｏの辺近傍領域の全体で分離度を求めるが、一般に広い領域では環境光などの影響により、輝度のヒストグラムの分布が広くなる。したがって、広い領域内の２領域間における分離度としては、２領域の画素値（輝度値）の差が小さので、２領域のヒストグラムが重なりあってしまい、分離度も小さい値が算出されてしまう。そこで、本実施形態では、背景と紙面の画素値（輝度値）の差が小さい場合にも正しい紙面領域候補を特定できるように、広い領域を小さい領域に分割して分離度を算出することにより、正しい紙面領域候補はより高い分離度を得ることができる。 Conventionally, the degree of separation is obtained for the entire region in the vicinity of the side of the paper region candidate O. Generally, in a wide region, the distribution of the luminance histogram becomes wide due to the influence of ambient light or the like. Therefore, as the degree of separation between the two areas in the wide area, the difference between the pixel values (luminance values) of the two areas is small, so that the histograms of the two areas overlap and a value with a small degree of separation is calculated. . Therefore, in this embodiment, by dividing the wide area into small areas and calculating the degree of separation so that the correct paper area candidate can be specified even when the difference between the pixel value (luminance value) between the background and the paper is small. The correct paper area candidate can obtain a higher degree of separation.

また、分離度をより大きく得るために、分割領域のサイズを変更してもよい。例えば、紙面内の余白が狭い箇所では、分割領域のサイズによっては文字などの余計な画素が入るため、分離度が小さい値をとってしまうことがある。その場合は、分割領域のサイズを変更して、分離度の高い方を評価値として採用すればよい。 Further, the size of the divided area may be changed in order to obtain a higher degree of separation. For example, in a portion where the margin in the paper is narrow, extra pixels such as characters may be included depending on the size of the divided area, and the separation degree may take a small value. In that case, it is only necessary to change the size of the divided area and adopt the higher degree of separation as the evaluation value.

ステップＳ１８０４では、全ての分割領域について処理が終了していれば、図１８に示す評価値の算出処理を終了するが、未処理の分割領域が残っていればステップＳ１８０１へ戻って、次の分割領域の処理を開始する。 In step S1804, if all the divided areas have been processed, the evaluation value calculation process shown in FIG. 18 is ended. If there is an unprocessed divided area, the process returns to step S1801, and the next divided area is determined. Start processing the region.

●評価値の算出例
以下、図１９に、図１７に示す紙面領域候補に対して、図１５のステップＳ１５０３により算出した評価値の例示す。ステップＳ１８０２、Ｓ１８０３において分割領域１７２１〜１７３２の分離度算出した結果を、表１９０１の評価値の列に記載されている。また、各辺の評価値の平均を計算した結果が、各辺平均として各辺の平均評価値の列に、角部１７３３〜１７３６の評価値の平均を計算した結果が、角部の平均評価値の列に記載されている。 Evaluation Value Calculation Example FIG. 19 shows an example of evaluation values calculated in step S1503 in FIG. 15 for the paper area candidate shown in FIG. The result of calculating the degree of separation of the divided regions 1721 to 1732 in steps S1802 and S1803 is described in the evaluation value column of Table 1901. Moreover, the result of calculating the average of the evaluation values of each side is the result of calculating the average of the evaluation values of the corners 1733 to 1736 in the column of the average evaluation value of each side as the average of each side. It is listed in the value column.

図１９に示す平均評価値をもとに、ステップＳ１５０４でこの紙面領域候補の確度のスコアを計算すると１．８２となる。このスコアは情報出力部１６０７から出力される。そして、ステップＳ１５０５でスコアが閾値以上であると判定され、ステップＳ１５０６へ進む。ステップＳ１５０６で、スコアの判定結果に基づいて、領域特定部１６０８によって、図１７に示す紙面領域候補は紙面領域又は注目領域として特定される。 Based on the average evaluation value shown in FIG. 19, the score of the accuracy of this paper area candidate is calculated to be 1.82 in step S1504. This score is output from the information output unit 1607. In step S1505, it is determined that the score is equal to or greater than the threshold value, and the process advances to step S1506. In step S1506, based on the score determination result, the area specifying unit 1608 specifies the paper area candidate shown in FIG. 17 as the paper area or the attention area.

以下、同じ入力画像に対する、図１７に示す紙面領域候補とは異なる紙面領域候補を例にした、評価値算出例を示す。図２０（ａ）に示す紙面領域候補は、紙面上の黒帯部分の上端２００１と、紙面の三辺にある線分２００２、線分２００３と線分２００４から構成されており、紙面として選択するのが好ましくない。図２０（ｂ）は、上記図１５のステップＳ１５０２で図２０（ａ）に示す４つの線分２００１〜２００４を分割した例を示す。同図において、辺部の分割領域２００４〜２０１３、角部の分割領域２０１４〜２０１７に分割された。 In the following, an example of evaluation value calculation for the same input image will be shown by taking a paper area candidate different from the paper area candidate shown in FIG. 17 as an example. The paper surface area candidate shown in FIG. 20A is composed of the upper end 2001 of the black belt portion on the paper surface, the line segment 2002 on the three sides of the paper surface, the line segment 2003 and the line segment 2004, and is selected as the paper surface. It is not preferable. FIG. 20B shows an example in which the four line segments 2001 to 2004 shown in FIG. 20A are divided in step S1502 of FIG. In the same figure, it was divided into divided areas 2004-2013 on the side and divided areas 2014-2017 on the corner.

図２０（ｃ）は、角部の分割領域２０１４の外部領域を拡大した図である。分割領域２０１４の外部領域は、背景２０１８と紙面２０１９から構成される。この外部領域についての輝度ヒストグラムを求めると、背景２０１８と紙面２０１９を構成する画素群によって発生した二つのピークの幅が広くなるため、外部領域の分散値が高くなり、外部領域と内部領域の分離度が低い値を示す。つまり角部分割領域２０１４の評価値が小さくなる。分割領域２０１５についても同様である。 FIG. 20C is an enlarged view of the outer region of the corner divided region 2014. The outer area of the divided area 2014 is composed of a background 2018 and a paper surface 2019. When the luminance histogram for the outer region is obtained, the width of two peaks generated by the pixel group constituting the background 2018 and the paper surface 2019 is widened, so that the variance value of the outer region is increased and the outer region and the inner region are separated. Indicates a low value. That is, the evaluation value of the corner portion divided area 2014 becomes small. The same applies to the divided areas 2015.

図２１は、図２０の紙面領域候補の分割領域に対して、図１５のステップ１５０２、ステップＳ１５０３により算出した評価値の例示す。表２１０１において、分割領域２０１４と２０１５の評価値が低い値を示している。図２１に示す処理結果をもとに、ステップＳ１５０４でこの紙面領域候補の確度のスコアを計算すると１．７２となる。この値は、ステップＳ１５０５において閾値を満たさないと判定され、ステップＳ１５０７でこの紙面領域候補は正解ではない領域として削除される。 FIG. 21 shows an example of evaluation values calculated in step 1502 and step S1503 in FIG. 15 for the divided areas of the paper area candidate in FIG. In Table 2101, the evaluation values of the divided areas 2014 and 2015 are low. Based on the processing result shown in FIG. 21, the score of the accuracy of this paper area candidate is calculated to be 1.72 in step S1504. This value is determined not to satisfy the threshold value in step S1505, and in step S1507, the paper area candidate is deleted as an area that is not correct.

図２２（ａ）と図２２（ｂ）を用いて、紙面領域候補の特定処理の具体例について説明する。図２２（ａ）に示すような、環境光の影響で撮像画像の全体にグラデーションが発生した場合に、前景領域と背景領域全体の分離度を求める方法では、紙面領域候補の内側と外側双方の分散値が環境光により高い値となり、内側と外側の分離度が低い値を示す。本実施形態の構成や方法では、各辺を小領域に分割することで、各小領域における紙面領域候補の内外の分離度が高い値を示す。 A specific example of the paper area candidate specifying process will be described with reference to FIGS. 22A and 22B. When a gradation occurs in the entire captured image due to the influence of ambient light as shown in FIG. 22 (a), the method for obtaining the degree of separation between the foreground area and the entire background area can be performed both inside and outside the paper area candidate. The dispersion value becomes a high value due to ambient light, and the separation degree between the inside and the outside is low. In the configuration and method of the present embodiment, dividing each side into small areas shows a high value of the internal / external separation degree of the paper area candidates in each small area.

図２２（ｂ）は、上部に色つき（黒やその他の色）の帯領域２２０１が存在する紙面領域を撮影した場合である。この場合であっても、正しい紙面領域候補の色つきの帯領域２２０１部分にある各分割領域の内部と外部の色差が存在するため、各分割領域の内外分離度は高い値を示す。そのため、領域候補は高いスコアが算出される。 FIG. 22B shows a case where a paper area where a colored (black or other color) band area 2201 is present is photographed. Even in this case, since there is a color difference between the inside and outside of each divided area in the colored band area 2201 portion of the correct paper area candidate, the inside / outside separation degree of each divided area shows a high value. Therefore, a high score is calculated for the region candidate.

また、図２２（ｂ）に示すように、色つき（黒やその他の色）のブロック２２１０を含む紙面領域を撮影する場合がある。撮像画像にあるブロック２２１０から生成された紙面領域候補でも、その内側と外側の分離度が高い値を示す。この場合は、さらに、各紙面領域候補の面積を算出して比較し、その面積が相対的に小さい紙面領域候補は正解ではない領域として削除される。算出された各紙面領域候補の面積を比較する方法として、例えば、最も大きい紙面領域候補に対する面積比を求めて、その面積比が閾値以下の紙面領域候補は削除する対象として判定することができる。 In addition, as shown in FIG. 22B, a paper area including a colored (black or other color) block 2210 may be photographed. Even in the paper area candidate generated from the block 2210 in the captured image, the degree of separation between the inner side and the outer side shows a high value. In this case, the areas of the respective paper area candidates are further calculated and compared, and the paper area candidate having a relatively small area is deleted as a non-correct area. As a method for comparing the calculated areas of the respective paper area candidates, for example, an area ratio with respect to the largest paper area candidate can be obtained, and a paper area candidate whose area ratio is equal to or smaller than a threshold value can be determined as an object to be deleted.

本実施形態によれば、紙面領域候補を構成する各線分の周辺領域を分割した各分割領域において、紙面領域候補の外部及び内部の画素値（例えば輝度値）のヒストグラムの分離度に基づき、紙面の確からしさ（確度）をスコアとして算出する。よって、実際の紙面領域ではない誤った紙面領域候補を選択せず、正しい紙面領域候補を紙面領域又は注目領域として特定できる効果がある。 According to the present embodiment, in each divided region obtained by dividing the peripheral region of each line segment constituting the paper region candidate, based on the degree of separation of the histograms of the pixel values (for example, luminance values) outside and inside the paper region candidate. Is calculated as a score. Therefore, there is an effect that the correct paper area candidate can be specified as the paper area or the attention area without selecting an incorrect paper area candidate that is not an actual paper area.

＜第２の実施形態＞
第１の実施形態では、紙面領域候補のスコアを算出し、閾値以上か否かを判定することによって紙面領域候補の選択を行ったが、本実施形態では、複数の紙面領域候補のスコアの相対値より紙面領域候補を選択する。たとえば、入力画像中にＮ個の紙面領域しか含まれないことが予め分かっていれば、閾値を用いずに全紙面領域候補のスコアの上位Ｎ個の紙面領域候補を紙面領域又は注目領域として特定する。本実施形態のその他の処理は、第１の実施形態と同じであるので、説明を省略する。 <Second Embodiment>
In the first embodiment, the paper region candidate score is calculated and the paper region candidate is selected by determining whether or not the threshold value is equal to or greater than the threshold value. However, in the present embodiment, the relative scores of the plurality of paper region candidate candidates are selected. A paper area candidate is selected from the value. For example, if it is known in advance that the input image includes only N paper areas, the top N paper area candidates in the scores of all paper area candidates are identified as the paper area or the attention area without using a threshold value. To do. Other processes in the present embodiment are the same as those in the first embodiment, and thus the description thereof is omitted.

また、２枚の紙面が重なっているために、隠れている紙面については選択したくない場合については、領域の重なる二つの紙面領域候補については、スコアの高い方のみを選択してもよい。 In addition, since the two sheets of paper overlap each other, if the user does not want to select the hidden paper, only the one with the higher score may be selected for the two paper region candidates whose areas overlap.

図２３は、２枚の紙面領域が重なった例を示す。紙面領域候補２３０１は、紙面領域候補２３０２により一部分が隠れている。分割領域２３０３は、紙面領域候補２３０１に対し前述第１の実施形態で説明した処理によって設定された分割領域の例である。この分割領域２３０３においては、紙面領域候補２３０１の内部領域、外部領域ともに、紙面領域候補２２０２内の紙面であるため、分離度が低い値となる。また、紙面領域候補２３０１全体に関しても、その内部領域、外部領域ともに紙面領域２３０２に隠れているため、画素値が均一ではなくなり分散値がそれぞれ高くなり、紙面領域候補全体の分離度が低い値となる。その結果、紙面領域候補２３０１のスコアが紙面領域候補２３０２に比べて低い値となる。この場合、領域特定部１６０８によって、スコアの最も高い紙面領域候補２３０２のみが紙面領域又は注目領域として特定される。 FIG. 23 shows an example in which two sheet areas overlap. The paper area candidate 2301 is partially hidden by the paper area candidate 2302. A divided area 2303 is an example of a divided area set by the processing described in the first embodiment with respect to the paper area candidate 2301. In this divided area 2303, since both the inner area and the outer area of the paper area candidate 2301 are the paper face in the paper area candidate 2202, the separation degree is a low value. Also, the entire paper area candidate 2301 is also hidden in the paper area 2302 in both the internal area and the external area, so that the pixel values are not uniform and the variance value is high, and the separation degree of the entire paper area candidate is low. Become. As a result, the score of the paper area candidate 2301 is lower than that of the paper area candidate 2302. In this case, the area specifying unit 1608 specifies only the paper area candidate 2302 having the highest score as the paper area or the attention area.

ユーザの特定したい紙面領域又は注目領域の数が予め分かっている場合は、本実施形態によれば、複数の紙面候補領域のスコア順位に基づいて、所定個数の紙面候補領域を特定する。また、図２３のような重なりによって一部が隠れている紙面領域候補を特定しないことができる。 When the number of paper areas or attention areas to be specified by the user is known in advance, according to the present embodiment, a predetermined number of paper candidate areas are specified based on the score ranking of the plurality of paper candidate areas. Further, it is not possible to specify a paper area candidate that is partially hidden by overlapping as shown in FIG.

＜第３の実施形態＞
第１の実施形態では、各分割領域の紙面候補領域の内側と外側の分離度を元に紙面領域としての確からしさ（確度）のスコアを算出した。本実施形態では、撮像した紙面の背景領域（原稿台など）の画素値が一様である場合、分散値をスコアの指標として用いる。本実施形態のその他の処理は、第１の実施形態と同じであるので、説明を省略する。 <Third Embodiment>
In the first embodiment, a probability (accuracy) score as a paper area is calculated based on the inner and outer separation degrees of the paper area candidate areas of each divided area. In the present embodiment, when the pixel values of the background area (such as the document table) of the imaged paper are uniform, the variance value is used as a score index. Other processes in the present embodiment are the same as those in the first embodiment, and thus the description thereof is omitted.

図２４は、本実施形態におけるステップＳ１５０３の分割領域の評価値を算出する処理を示すフローチャートである。 FIG. 24 is a flowchart showing the process of calculating the evaluation value of the divided area in step S1503 in this embodiment.

ステップＳ２４０１では、分割領域Ｂ１からＢｍをそれぞれ処理対象の分割領域ＢとしてＳ２４０２へ進み、Ｓ２４０６までの処理を繰り返す。 In step S2401, the divided areas B1 to Bm are set as the divided areas B to be processed, the process proceeds to S2402, and the processes up to S2406 are repeated.

分割領域Ｂの位置によって、その後の処理が決まるので、ステップＳ２４０２では、まず分割領域Ｂが角に位置しているかを判定する。分割領域Ｂが辺に位置していると判定したら、ステップＳ２４０３とステップＳ２４０４の処理を行い、分割領域Ｂが角に位置していると判定したら、ステップＳ２４０５とステップＳ２４０６の処理を行う。 Since the subsequent processing is determined by the position of the divided area B, in step S2402, it is first determined whether the divided area B is located at a corner. If it is determined that the divided area B is located on the side, the processes in steps S2403 and S2404 are performed. If it is determined that the divided area B is located in the corner, the processes in steps S2405 and S2406 are performed.

ステップＳ２４０３とステップＳ２４０４との処理はステップＳ１８０２とステップＳ１８０３との処理と同様であるため、ここで説明を省略する。ステップＳ２４０５では、情報出力部１６０７は、以下の式（２）を用いて角部にある分割領域における紙面候補領域の外部領域の分散値σを算出して出力する。 Since the process of step S2403 and step S2404 is the same as the process of step S1802 and step S1803, description is abbreviate | omitted here. In step S2405, the information output unit 1607 calculates and outputs the variance value σ of the outer area of the paper candidate area in the divided area at the corner using the following equation (2).

ここで、Ｎは画素数
ｍは領域の平均輝度値
ｓ_ｉは画素ｉの輝度値
式（２）では分散値σの算出に輝度値を用いているが、輝度値に限定しない。入力画像が多値（例えばＲＧＢ値）の場合には、多値空間（ＲＧＢ空間）内で全画素値に対して主成分分析を行い、第一主成分得られる主軸に対して全画素値を射影して得る値を輝度値の替わりに式（２）に適用することで分散値σを算出することもできる。 Here, N is the number of pixels, m is the average luminance value of the region, s _i is the luminance value of the pixel i, and the luminance value is used to calculate the variance value σ in Equation (2), but is not limited to the luminance value. When the input image is multi-valued (for example, RGB values), the principal component analysis is performed on all the pixel values in the multi-valued space (RGB space), and all the pixel values are calculated with respect to the main axis obtained from the first principal component. The variance value σ can also be calculated by applying the value obtained by projection to the equation (2) instead of the luminance value.

ステップＳ２４０６では、ステップＳ２４０５で求めた外部領域の分散値を角部の分割領域Ｂの評価値とし、ステップＳ２４０７へ進む。 In step S2406, the variance value of the outer area obtained in step S2405 is set as the evaluation value of the divided area B at the corner, and the process proceeds to step S2407.

ステップＳ２４０７では、全分割領域Ｂに対して処理が完了していれば終了し、未処理領域が残っていればステップＳ２４０１へ戻る。 In step S2407, if the process is completed for all the divided areas B, the process ends. If an unprocessed area remains, the process returns to step S2401.

本実施形態では、情報出力部１６０７は、ステップＳ１５０４のスコア算出処理では、以下の式（３）を用いてスコアを算出して出力する。 In the present embodiment, the information output unit 1607 calculates and outputs a score using the following equation (3) in the score calculation process in step S1504.

ここで、
η_ｍはＳ１５０３で求めた分割領域の各辺の内外分離度の平均
σ_ｍはＳ１５０３で求めた分割領域の各角の外部分散の平均
式（３）は、本実施形態のスコア算出処理の一例であり、各辺の分割領域の分離度が高いほど紙面領域候補の確度が高くなり、角部の外部の分散値が低いほど紙面領域候補の確度が高くなることが本質であり、式（３）のスコア算出に別の重みをつけてもよい。また、分離度又は分散値だけでスコアを算出するなど、その他の方法でスコアを求めてもよい。 here,
η _m is the average of the inside / outside separation degree of each side of the divided region obtained in S1503 σ _m is the average of the external variance of each corner of the divided region obtained in S1503 Equation (3) is an example of the score calculation process of this embodiment It is essential that the accuracy of the paper region candidate increases as the degree of separation of the divided regions on each side increases, and the accuracy of the paper region candidate increases as the variance value outside the corner decreases. ) May be given a different weight. Moreover, you may obtain | require a score by other methods, such as calculating a score only by the degree of separation or a dispersion value.

そして、ステップＳ１５０５で情報出力部１６０７から出力されたスコアの判定が行われ、その判定結果に基づいて、ステップＳ１５０６で領域特定部１６０８は図２５に示す紙面領域候補２５０１を紙面領域又は注目領域として特定する。 In step S1505, the score output from the information output unit 1607 is determined. On the basis of the determination result, in step S1506, the area specifying unit 1608 uses the paper area candidate 2501 shown in FIG. 25 as the paper area or attention area. Identify.

図２５（ａ）は、入力される撮像画像と撮像画像から得られた紙面領域候補の例を示す。紙面領域候補２５０１は、角部が丸みを帯びた紙面の領域候補である。ステップＳ３０４の四角形生成処理では、四辺の候補となる線分を伸長し四角形を生成するため、角部が丸い紙面であっても、この様な四角形が紙面候補として形成される。 FIG. 25A shows an example of an input captured image and a paper area candidate obtained from the captured image. A paper area candidate 2501 is a paper area candidate with rounded corners. In the quadrangle generation processing in step S304, since the line segments that are candidates for the four sides are expanded to generate a quadrangle, such a quadrangle is formed as a paper surface candidate even if the corner has a round paper surface.

分割領域２５０２は、ステップＳ１５０２で生成された、紙面領域候補２６０１の角部に位置する分割領域である。 The divided area 2502 is a divided area located at the corner of the paper area candidate 2601 generated in step S1502.

図２５（ｂ）は、分割領域２５０２を拡大して表した図である。分割領域２５０２には紙面領域候補の内部領域２５０３、外部領域２５０４が存在している。この様に角に丸みを帯びた紙面の場合、角部の分割領域の内部領域２５０３に実際の紙面が一部しか入らない場合や、まったく入らない場合がある。その場合、内外領域の画素分散値に差がでないため、分離度が低い値がでてしまい、第１の実施形態で説明した方法では、正しい紙面領域であっても紙面領域候補のスコアが低い値をなってしまう。分散値を用いる方法では、背景の画素値が一様であれば、分散値が低い値となり、紙面領域候補のスコアを高く算出することが可能である。 FIG. 25B is an enlarged view of the divided area 2502. In the divided area 2502, there are an inner area 2503 and an outer area 2504 of paper area candidates. In the case of a paper surface with rounded corners in this way, there may be a case where only a part of the actual paper surface enters the inner region 2503 of the corner divided region, or there is no case. In that case, since there is no difference in the pixel dispersion values in the inner and outer regions, a low separation value is obtained. With the method described in the first embodiment, the score of the paper region candidate is low even in the correct paper region. Become a value. In the method using the variance value, if the background pixel value is uniform, the variance value is low, and the score of the paper area candidate can be calculated high.

以上、本実施形態によれば、背景領域の画素値が均一であることを利用し、紙面領域候補の角部の外部領域の分散値を用いることで、角に丸みを帯びた紙面に対しても紙面領域の検出を精度よく行うことが可能である。 As described above, according to the present embodiment, by using the fact that the pixel values of the background region are uniform and using the variance value of the outer region of the corner of the paper region candidate, the paper surface with rounded corners is used. In addition, it is possible to accurately detect the paper region.

＜その他の実施形態＞
本発明は、上述の実施形態の１以上の機能を実現するプログラムを、ネットワーク又は記憶媒体を介してシステム又は装置に供給し、そのシステム又は装置のコンピュータにおける１つ以上のプロセッサがプログラムを読出し実行する処理でも実現可能である。また、１以上の機能を実現する回路（例えば、ＡＳＩＣ）によっても実現可能である。 <Other embodiments>
The present invention supplies a program that realizes one or more functions of the above-described embodiments to a system or apparatus via a network or a storage medium, and one or more processors in the computer of the system or apparatus read and execute the program This process can be realized. It can also be realized by a circuit (for example, ASIC) that realizes one or more functions.

１０１画像入力部
１０２ＣＰＵ
１０３ＲＡＭ
１０４記憶部 101 Image input unit 102 CPU
103 RAM
104 Storage unit

Claims

An input means for inputting a region composed of a plurality of line segments as a region candidate;
First information indicating the relationship between image feature amounts on both sides of the line segment as a boundary in a plurality of regions having a predetermined size or less in each line segment constituting the region candidate, and a predetermined size at each corner of the region candidate Output means for outputting the second information indicating the relationship between the image feature amount inside and outside the area candidate in the area below
Using the first information and the second information corresponding to each of the plurality of regions, specifying means for specifying the region candidate as a region of interest;
An image processing apparatus comprising:

2. The image according to claim 1, wherein the first information is a first degree of separation of pixel values on both sides with the line segment as a boundary in each of the plurality of regions in each line segment. Processing equipment.

The image processing apparatus according to claim 2 , wherein the specifying unit specifies the region candidate as a region of interest based on an average value of the first degree of separation.

The image processing apparatus according to claim 1 , wherein the second information is a second degree of separation of pixel values inside and outside the area candidate in the area at each corner.

The image processing apparatus according to claim 4 , wherein the specifying unit specifies the region candidate as a region of interest based on the first information and the average value of the second degree of separation.

The first information is a first degree of separation of pixel values on both sides with the line segment as a boundary in each of the plurality of regions in the line segments, and the second information is the first information at the corners. A second degree of separation of pixel values inside and outside the region candidate in a region,
The specifying unit, based on the average value of the second separation and the average value of the first degree of separation, the image processing apparatus according to claim 1, wherein the identifying the area candidate region of interest .

The specifying unit specifies the region candidate as a region of interest based on the average value of the first degree of separation weighted by the average value of the second degree of separation and the average value of the second degree of separation. The image processing apparatus according to claim 6 .

The specifying unit specifies the region candidate as a region of interest based on the average value of the first degree of separation and the average value of the second degree of separation weighted by the average value of the first degree of separation. The image processing apparatus according to claim 6 .

The specifying means, any one of the claims 6 to 8, wherein the identifying the region candidate said first separation and evaluation values obtained from the second separation is equal to or higher than the threshold as a region of interest The image processing apparatus according to item.

When there are a plurality of region candidates input by the input unit, the specifying unit sets a predetermined number of the region candidates as a region of interest based on the evaluation values obtained from the first degree of separation and the second degree of separation. the image processing apparatus according to any one of claims 6 to 8, wherein the identifying.

An input means for inputting a region composed of a plurality of line segments as a region candidate;
First information indicating the relationship between image feature amounts on both sides of the line segment as a boundary in a plurality of regions having a predetermined size or less in each line segment constituting the region candidate, and a predetermined size at each corner of the region candidate and output means for outputting and a third information indicating the image feature amount of the outside of the region candidate in the following areas are,
Using the first information and the third information corresponding to each of the plurality of regions, the specifying means for specifying the region candidate as a region of interest ;
An image processing apparatus comprising:

The image processing apparatus according to claim 11 , wherein the third information is a variance value of pixel values outside the area candidate in the area at each corner.

The input means includes
Creating means for creating an edge image based on edge pixels extracted from a captured image obtained by imaging a document;
Straight line generating means for generating a straight line from the edge pixels;
Line segment output means for outputting a line segment indicating a boundary of the document based on the straight line generated from the straight line generation means;
Generating means for generating the region candidates from the line segments generated by the line segment output means;
The image processing apparatus according to any one of claims 1 to 1 2, characterized in that it further comprises a.

An input step of inputting a region composed of a plurality of line segments as a region candidate;
First information indicating the relationship between image feature amounts on both sides of the line segment as a boundary in a plurality of regions having a predetermined size or less in each line segment constituting the region candidate, and a predetermined size at each corner of the region candidate An output step for outputting second information indicating the relationship between the inner and outer image feature amounts of the region candidates in the region below
Using the first information and the second information corresponding to each of the plurality of regions, a specifying step of specifying the region candidate as a region of interest;
An image processing method comprising:

An input step of inputting a region composed of a plurality of line segments as a region candidate;
First information indicating the relationship between image feature amounts on both sides of the line segment as a boundary in a plurality of regions having a predetermined size or less in each line segment constituting the region candidate, and a predetermined size at each corner of the region candidate An output step of outputting second information indicating a relationship between the inner and outer image feature amounts of the region candidates in the region below
Using the first information and the second information corresponding to each of the plurality of regions, a specifying step of specifying the region candidate as a region of interest;
A program that causes a computer to execute.

An input step of inputting a region composed of a plurality of line segments as a region candidate;
First information indicating the relationship between image feature amounts on both sides of the line segment as a boundary in a plurality of regions having a predetermined size or less in each line segment constituting the region candidate, and a predetermined size at each corner of the region candidate An output step of outputting third information indicating an image feature amount outside the area candidate in the area below
Using the first information and the third information corresponding to each of the plurality of regions, a specifying step of specifying the region candidate as a region of interest;
An image processing method comprising:

An input step of inputting a region composed of a plurality of line segments as a region candidate;
First information indicating the relationship between image feature amounts on both sides of the line segment as a boundary in a plurality of regions having a predetermined size or less in each line segment constituting the region candidate, and a predetermined size at each corner of the region candidate An output step of outputting third information indicating an image feature amount outside the region candidate in the region below
Using the first information and the third information corresponding to each of the plurality of regions, a specifying step of specifying the region candidate as a region of interest;
A program that causes a computer to execute.