JP2013235418A

JP2013235418A - Image processing device and manuscript reading system

Info

Publication number: JP2013235418A
Application number: JP2012107366A
Authority: JP
Inventors: Takahiro Shoji; 隆浩庄司
Original assignee: Panasonic Corp
Current assignee: Panasonic Corp
Priority date: 2012-05-09
Filing date: 2012-05-09
Publication date: 2013-11-21

Abstract

PROBLEM TO BE SOLVED: To make it possible to, in a simple configuration, prevent an inappropriate tilt correction from being performed when it is difficult to detect an effective straight line component from a photographing image.SOLUTION: A manuscript reading system 1 is configured to comprise: a line segment extraction part 33 that extracts a plurality of line segments on the basis of an edge pixel extracted from a photographing image; a cross line segment percentage calculation part 34 that determines a cross state of the line segments and thereby calculates a cross line segment percentage of the plurality of line segments; a tilt correction angle setting part 35 that performs statistical processing to tilt angles of the plurality of line segments and thereby estimates a tilt angle of a photographing image; and an image rotation part 36 that rotates a photographing image on the basis of an estimated tilt angle and thereby performs a tilt correction of the photographing image. When the cross line segment percentage is equal to or more than a threshold value, the image rotation part is configured so as not to perform the tilt correction.

Description

本発明は、書籍などの原稿を読み取って得られた画像を処理する画像処理装置およびこれを備えた原稿読取システムに関する。 The present invention relates to an image processing apparatus that processes an image obtained by reading a document such as a book, and a document reading system including the image processing apparatus.

書籍のページを自然に開いた状態で上方から撮影してページの画像を読み取ることができる書画カメラ（ブックスキャナ）が普及している（特許文献１参照）。このような書画カメラを用いると、ページをめくりながらページの画像を次々に読み取ることができるため、書籍を電子化する作業を効率良く行うことができる。また、この書画カメラを用いた原稿読取システムでは、正規の読取位置（すなわち、水平・垂直等の基準方向）に対して傾いた状態で原稿がセットされた場合でも、読み取られた原稿の画像（原稿綴じ部や原稿エッジの位置等）に基づき傾斜角度を推定することにより、画像の傾きを自動的に補正することができる。 2. Description of the Related Art Document cameras (book scanners) that can take a picture of a page and read an image of the page while the page of the book is naturally opened are widely used (see Patent Document 1). When such a document camera is used, the images on the page can be read one after another while turning the page, so that the work of digitizing the book can be performed efficiently. Further, in the document reading system using the document camera, even when the document is set in an inclined state with respect to a normal reading position (that is, a reference direction such as horizontal or vertical), an image of the read document ( The inclination of the image can be automatically corrected by estimating the inclination angle based on the document binding portion and the position of the document edge.

画像の傾斜角度を推定する技術に関しては、例えば、エッジ部分の画素濃度を強調するエッジ処理後に二値化した画像データをハフ変換処理（直線成分抽出）してパラメータ図表を生成するハフ変換手段と、そのパラメータ図表における座標の度数を角度θ毎に積算してヒストグラムを生成するヒストグラム生成手段と、そのヒストグラムから座標の度数が最大の角度θを特定することにより傾斜角度を推定する傾斜角度検出手段とを備えたデータ処理装置が知られている（特許文献２参照）。 With respect to the technique for estimating the inclination angle of an image, for example, a Hough transforming unit that generates a parameter chart by performing Hough transform processing (linear component extraction) on binarized image data after edge processing that emphasizes the pixel density of an edge portion , A histogram generating means for generating a histogram by integrating the frequency of coordinates in the parameter chart for each angle θ, and an inclination angle detecting means for estimating the inclination angle by specifying the angle θ having the maximum frequency of coordinates from the histogram Is known (see Patent Document 2).

特開２００１−１０３２４０号公報JP 2001-103240 A 特開平１１−３２８４０８号公報JP 11-328408 A

ところで、上述のような画像の傾斜角度を精度良く推定するためには、原稿の水平・垂直等の基準方向に沿った外形輪郭、文字列、及び図形枠等から検出した有効な直線成分を用いることが必要となる。一方、例えば原稿の紙面の広範囲にわたって写真や複雑な図形等が存在すると、傾斜角度推定ではノイズとなる直線成分が混在してしまい、有効な直線成分を検出することが難しくなり、そのような場合には、画像の傾斜角度の推定に大きな誤差が生じるという問題がある。 By the way, in order to accurately estimate the inclination angle of the image as described above, effective linear components detected from the outer contour, the character string, the figure frame, and the like along the reference direction such as the horizontal / vertical direction of the document are used. It will be necessary. On the other hand, for example, if there are photographs, complex figures, etc. over a wide area of the original, the linear component that becomes noise is mixed in the estimation of the tilt angle, making it difficult to detect effective linear components. However, there is a problem that a large error occurs in the estimation of the tilt angle of the image.

しかしながら、上記特許文献２に記載された従来技術は、紙面の内容等を考慮するものではないため、上記のような問題に対応することは困難であった。したがって、そのような従来技術により、有効な直線成分の検出が難しい画像を適宜傾き補正して連続的に表示や記録を行う場合には、一連の傾き補正後の画像の向きが不自然に変化して見難い画像となってしまう。 However, since the prior art described in Patent Document 2 does not take into account the contents of the page, it has been difficult to cope with the above problems. Therefore, with such conventional techniques, when images that are difficult to detect effective linear components are corrected for tilt and displayed or recorded continuously, the orientation of the image after a series of tilt correction changes unnaturally. It becomes an image that is difficult to see.

本発明は、このような従来技術の課題を鑑みて案出されたものであり、簡易な構成により、撮影画像から有効な直線成分の検出が難しい場合に不適当な傾き補正が実行されることを防止可能とする画像処理装置およびこれを備えた原稿読取システムを提供することを主目的とする。 The present invention has been devised in view of such problems of the prior art, and with a simple configuration, inappropriate tilt correction is performed when it is difficult to detect an effective linear component from a captured image. It is a main object of the present invention to provide an image processing apparatus that can prevent such a situation and a document reading system including the same.

本発明の画像処理装置は、原稿の紙面を順次撮影した撮影画像における複数のエッジ画素を抽出するエッジ抽出部と、前記エッジ画素に基づき複数の直線成分を抽出する直線抽出部と、前記直線成分の交差状態を判定することにより、前記複数の直線成分のうち相互に交差する直線成分の割合である交差線分率を算出する交差線分率算出部と、前記複数の直線成分の傾斜角度について統計的処理を行うことによって前記撮影画像の傾斜角度を推定する傾斜角度推定部と、前記傾斜角度推定部が推定した前記傾斜角度に基づき、前記撮影画像を回転させることにより当該撮影画像の傾き補正を行う画像回転部とを備え、前記画像回転部が前記傾き補正を行う第１の動作モードと、前記画像回転部が前記傾き補正を行わない第２の動作モードとを有し、前記交差線分率が所定の閾値以上の場合、前記第２の動作モードを実行することを特徴とする。 An image processing apparatus according to the present invention includes an edge extraction unit that extracts a plurality of edge pixels in a captured image obtained by sequentially photographing a paper surface of a document, a straight line extraction unit that extracts a plurality of straight line components based on the edge pixels, and the straight line component An intersection line segment calculation unit that calculates an intersection line segment ratio that is a ratio of linear components intersecting each other among the plurality of linear components, and an inclination angle of the plurality of linear components A tilt angle estimation unit that estimates the tilt angle of the captured image by performing statistical processing; and the tilt correction of the captured image by rotating the captured image based on the tilt angle estimated by the tilt angle estimation unit A first operation mode in which the image rotation unit performs the tilt correction, and a second operation mode in which the image rotation unit does not perform the tilt correction. And, the intersecting line segment ratio is equal to or greater than a predetermined threshold value, and executes the second operation mode.

このように本発明によれば、簡易な構成により、撮影画像から有効な直線成分の検出が難しい場合に不適当な傾き補正が実行されることを防止可能とするという優れた効果を奏する。 As described above, according to the present invention, it is possible to prevent an inappropriate inclination correction from being performed when it is difficult to detect an effective linear component from a captured image with a simple configuration.

本発明に係る原稿読取システム１を示す全体構成図1 is an overall configuration diagram showing a document reading system 1 according to the present invention. 図１に示した原稿読取システム１において書籍Ｂを正規の読取位置にセットした状態を示す平面図The top view which shows the state which set the book B in the regular reading position in the original reading system 1 shown in FIG. 図１中の書画カメラ２およびＰＣ３の概略構成を示すブロック図1 is a block diagram showing a schematic configuration of the document camera 2 and the PC 3 in FIG. 図１に示した原稿読取システム１による画像表示手順の要部を示すフロー図FIG. 1 is a flowchart showing a main part of an image display procedure by the document reading system 1 shown in FIG. 図４中の交差線分率取得（ＳＴ１０５）で取得される交差線分率の例を示す説明図Explanatory drawing which shows the example of the intersection line segment acquired by cross line segment acquisition (ST105) in FIG. 図４中の交差線分率取得（ＳＴ１０５）の詳細を示すフロー図FIG. 4 is a flowchart showing details of the intersection line segment acquisition (ST105) in FIG. 図４中の交差線分率取得（ＳＴ１０５）およびデスキュー実施の可否判定（ＳＴ１０６）の具体例を示す説明図Explanatory drawing which shows the specific example of cross line segment acquisition (ST105) in FIG. 4, and the decision | availability determination (ST106) of deskew implementation. 図７におけるデスキュー実施の判定処理の概略を示す状態遷移図State transition diagram showing an outline of the determination process of deskew execution in FIG. 図４中のデスキュー実施の可否判定（ＳＴ１０６）の詳細を示すフロー図FIG. 4 is a flowchart showing details of the deskew execution feasibility determination (ST106) in FIG. 図４中の線分情報統計処理（ＳＴ１０７）の詳細を示すフロー図Flow chart showing details of line segment information statistical processing (ST107) in FIG.

上記課題を解決するためになされた第１の発明は、原稿の紙面を順次撮影した撮影画像における複数のエッジ画素を抽出するエッジ抽出部と、前記エッジ画素に基づき複数の直線成分を抽出する直線抽出部と、前記直線成分の交差状態を判定することにより、前記複数の直線成分のうち相互に交差する直線成分の割合である交差線分率を算出する交差線分率算出部と、前記複数の直線成分の傾斜角度について統計的処理を行うことによって前記撮影画像の傾斜角度を推定する傾斜角度推定部と、前記傾斜角度推定部が推定した前記傾斜角度に基づき、前記撮影画像を回転させることにより当該撮影画像の傾き補正を行う画像回転部とを備え、前記画像回転部が前記傾き補正を行う第１の動作モードと、前記画像回転部が前記傾き補正を行わない第２の動作モードとを有し、前記交差線分率が所定の閾値以上の場合、前記第２の動作モードを実行する構成とする。 According to a first aspect of the present invention, there is provided an edge extracting unit that extracts a plurality of edge pixels in a photographed image obtained by sequentially photographing a paper surface of a document, and a straight line that extracts a plurality of linear components based on the edge pixels. An intersecting line segment calculating unit that calculates an intersecting line segment ratio that is a ratio of linear components intersecting each other among the plurality of linear components by determining an intersecting state of the linear components; A tilt angle estimator for estimating the tilt angle of the captured image by performing statistical processing on the tilt angle of the linear component, and rotating the captured image based on the tilt angle estimated by the tilt angle estimator. An image rotation unit that performs tilt correction of the captured image, wherein the image rotation unit performs the tilt correction, and the image rotation unit does not perform the tilt correction. And a second operating mode, the cross-line rate is equal to or greater than a predetermined threshold, and configured to execute the second operation mode.

これによると、簡易な構成により、撮影画像から有効な直線成分の検出が難しい場合に不適当な傾き補正が実行されることを防止することができる。特に、有効な直線成分の割合（換言すれば、ノイズ成分の割合）の指標として直線成分の交差線分率を用いるため、画像の種別（写真、図形、及び文字等）を判定する等の複雑な処理を不要としつつ、傾き補正の可否を高速かつ高精度に決定できるという利点がある。 According to this, with a simple configuration, it is possible to prevent inappropriate inclination correction from being performed when it is difficult to detect an effective linear component from a captured image. In particular, since the intersection line segment ratio of the straight line component is used as an index of the effective straight line component ratio (in other words, the noise component ratio), the type of image (photo, graphic, character, etc.) is determined. There is an advantage that whether or not tilt correction can be performed can be determined at high speed and with high accuracy, while eliminating the need for simple processing.

また、第２の発明は、上記第１の発明において、前記閾値は、第１の値と、当該第１の値よりも大きい第２の値とからなり、一連の前記撮像画像において、今回の撮影画像における前記交差線分率が前記第１の値よりも大きな前回の撮影画像の前記交差線分率から減少して前記第１の値を下回った場合、前記第２の動作モードを前記第１の動作モードに切り替える一方、今回の撮影画像における前記交差線分率が前記第２の値よりも小さな前回の撮影画像の前記交差線分率から増大して前記第２の値を上回った場合、前記第１の動作モードを前記第２の動作モードに切り替える構成とする。 In a second aspect based on the first aspect, the threshold value includes a first value and a second value that is larger than the first value. When the intersecting line segment ratio in the photographed image decreases from the intersecting line segment ratio of the previous photographed image that is larger than the first value and falls below the first value, the second operation mode is changed to the first operation mode. When switching to the first operation mode, the intersecting line segment ratio in the current photographed image increases from the intersecting line segment ratio in the previous photographed image smaller than the second value and exceeds the second value. The first operation mode is switched to the second operation mode.

これによると、２つの閾値によりヒステリシスを持たせて２つの動作モードを切り替える構成としたため、撮影画像を順次傾き補正する際に、原稿が静止状態であるにも拘わらず何らかの不安定化要因（照明の変化、紙面の僅かな揺れ、イメージセンサのノイズ、確率的ハフ変換のランダム性等）により、直線成分の交差線分率が変動して不適当な傾き補正が実行されることを防止することができる。 According to this, since the two operation modes are switched by providing hysteresis with two threshold values, when the captured image is sequentially tilt-corrected, some destabilization factor (illumination) is used even though the document is stationary. ), Slight fluctuations in the paper surface, image sensor noise, randomness of stochastic Hough transform, etc. Can do.

また、第３の発明は、上記第１または第２の発明において、前記傾斜角度推定部は、前記交差状態の判定により、判定対象である直線成分が１または複数の他の線分と交差する場合、当該判定対象である直線成分を前記統計的処理の対象から除外する構成とする。 In a third aspect based on the first aspect or the second aspect, the inclination angle estimating unit intersects one or more other line segments with a straight line component to be determined by determining the intersection state. In this case, the straight line component that is the determination target is excluded from the statistical processing target.

これによると、判定対象である直線成分が１または複数の他の直線成分と交差する場合には、当該判定対象である直線成分を統計的処理の対象から除外する構成としたため、統計的処理の対象からノイズ成分（すなわち、原稿の基準方向と一致しない不適切な直線成分）を効果的に除去することができ、その結果、撮影画像の傾斜角度を精度良く推定することが可能となる。 According to this, when the straight line component that is the determination target intersects with one or more other linear components, the straight line component that is the determination target is excluded from the statistical processing target. Noise components (that is, inappropriate linear components that do not match the reference direction of the document) can be effectively removed from the target, and as a result, the tilt angle of the captured image can be estimated with high accuracy.

また、第４の発明は、上記第１から第３の発明のいずれかに係る画像処理装置と、前記撮影画像を生成するカメラ部を有する画像入力装置とを備えた原稿読取システムである。 According to a fourth aspect of the present invention, there is provided a document reading system including the image processing apparatus according to any one of the first to third aspects of the present invention and an image input apparatus having a camera unit that generates the captured image.

以下、本発明の実施の形態について図面を参照しながら説明する。 Hereinafter, embodiments of the present invention will be described with reference to the drawings.

図１は本発明に係る原稿読取システム１を示す全体構成図であり、図２は原稿読取システム１において書籍Ｂを正規の読取位置にセットした状態を示す平面図である。この原稿読取システム１は、書籍（原稿）Ｂの紙面の画像を読み取って、紙面の画像データを取得するものであり、紙面を撮影して映像信号に変換する書画カメラ（画像入力装置）２と、この書画カメラ２と通信可能に接続されたＰＣ３とから構成される。 FIG. 1 is an overall configuration diagram showing a document reading system 1 according to the present invention, and FIG. 2 is a plan view showing a state where a book B is set at a regular reading position in the document reading system 1. The document reading system 1 reads a paper image of a book (original) B to acquire image data of the paper surface, and a document camera (image input device) 2 that shoots the paper surface and converts it into a video signal. The document camera 2 and the PC 3 connected to be communicable with each other.

書画カメラ２は、撮影機能を有するカメラ部４と、このカメラ部４を保持するスタンド部５とを備えている。カメラ部４は、ＣＣＤやＣＭＯＳ等からなるイメージセンサと、ＬＥＤや蛍光ランプ等からなる照明用光源（共に図示せず）とを内蔵する。スタンド部５は、机上面などの載置面６に載置される略Ｖ字状に開いた脚７と、この脚７に支持されたアーム８とを有している。脚７の上面には、書籍Ｂの撮影位置を規定するための一対のガイド部材９が突設されている。アーム８は、脚７から斜め上方に伸縮自在に設けられる一方、ヒンジ部８ａによってカメラ部４を回動可能に保持しており、これにより、カメラ部４の画角や光軸方向（撮影方向）の調整が可能となっている。また、カメラ部４は、画角や倍率を連続的に可変させる公知のズーム機能を有している。 The document camera 2 includes a camera unit 4 having a photographing function and a stand unit 5 that holds the camera unit 4. The camera unit 4 includes an image sensor made up of a CCD, a CMOS, etc., and an illumination light source (both not shown) made up of an LED, a fluorescent lamp, or the like. The stand unit 5 includes a leg 7 that is mounted on a mounting surface 6 such as a desk surface and that is open in a substantially V shape, and an arm 8 that is supported by the leg 7. On the upper surface of the leg 7, a pair of guide members 9 for defining the photographing position of the book B are projected. The arm 8 is provided so as to be capable of extending and contracting obliquely upward from the leg 7, while holding the camera unit 4 so as to be rotatable by a hinge unit 8 a, whereby the angle of view and the optical axis direction of the camera unit 4 (photographing direction). ) Can be adjusted. The camera unit 4 also has a known zoom function that continuously changes the angle of view and magnification.

ＰＣ３は、書画カメラ２の各種動作条件をユーザが設定したり、書画カメラ２で撮像した撮影画像をユーザが確認したりするための入出力装置として機能すると共に、撮影画像の画像処理や記録等を行う画像処理装置としても機能する。 The PC 3 functions as an input / output device for the user to set various operating conditions of the document camera 2 and for the user to check a captured image captured by the document camera 2, and to perform image processing and recording of the captured image. It also functions as an image processing apparatus that performs the above.

なお、原稿読取システム１において書画カメラ２と共に用いられる装置としては、ＰＣ(Personal Computer)に限らず同様の機能を実現可能な任意の情報処理装置を用いることができる。また、ＰＣ３の機能の一部を書画カメラ２に付加することや、ＰＣ３と書画カメラ２とを一体的に構成することも可能である。さらに、書画カメラ２とＰＣ３は直接的に接続されている必要はなく、例えば図示しないネットワークを介して接続されていてもよい。このように構成した場合、書画カメラ２が撮像のためのトリガ信号を発生させ（具体的には、例えばイメージキャプチャを指示するスイッチやボタン）、遠隔のＰＣ３（例えば、サーバ）に対してプッシュ型のデータ伝送をすることになる。 The apparatus used together with the document camera 2 in the document reading system 1 is not limited to a PC (Personal Computer), and any information processing apparatus capable of realizing the same function can be used. It is also possible to add a part of the functions of the PC 3 to the document camera 2 or to configure the PC 3 and the document camera 2 integrally. Further, the document camera 2 and the PC 3 do not need to be directly connected, and may be connected via a network (not shown), for example. In such a configuration, the document camera 2 generates a trigger signal for imaging (specifically, for example, a switch or button for instructing image capture), and pushes the remote PC 3 (for example, a server). Data transmission.

書画カメラ２による撮影では、ユーザがカメラ部４の真下（光軸方向）の載置面６上に書籍Ｂを開いた状態で載置することにより、書籍Ｂの見開き２ページと書籍Ｂの周囲の載置面６の一部を含む撮影画像（動画または静止画）が得られる。このときユーザは、図２に示すように、書籍Ｂの上縁Ｂａを左右方向に延びるガイド部材９に突き当てて前後方向位置および傾きを調節することにより、書籍Ｂを読取位置に精度良くセットすることが可能である。撮影画像は、ＰＣ３に適宜送信され、そこで必要な画像処理がなされた後、所定の記録媒体に保存されると共にユーザに対してディスプレイ表示される。 In photographing with the document camera 2, the user places the book B in an open state on the placement surface 6 immediately below (in the optical axis direction) of the camera unit 4, so that two pages of the book B and the periphery of the book B are placed. A captured image (moving image or still image) including a part of the mounting surface 6 is obtained. At this time, as shown in FIG. 2, the user sets the book B at the reading position with high accuracy by adjusting the position and inclination of the book B by contacting the upper edge Ba of the book B with the guide member 9 extending in the left-right direction. Is possible. The captured image is appropriately transmitted to the PC 3, where necessary image processing is performed, and then stored in a predetermined recording medium and displayed on the display for the user.

なお、原稿読取システム１が読み取る原稿は、書籍に限らず、文字、図面、写真その他これに類する情報を含む任意の情報伝達媒体とすることができる。更に、この情報伝達媒体はポジフィルム、ネガフィルム等の透過原稿であってもよく、この場合は、例えば透明な導光板の側面に光源を配置した光源ユニットにより背面から光を照射し、透過光を撮像するように構成すればよい。 The document read by the document reading system 1 is not limited to a book, but may be any information transmission medium including characters, drawings, photographs, and similar information. Further, the information transmission medium may be a transmission original such as a positive film or a negative film. In this case, for example, light is emitted from the back side by a light source unit in which a light source is arranged on the side of a transparent light guide plate, and transmitted light is transmitted. May be configured to take an image.

図３は、図１中の書画カメラ２およびＰＣ３の概略構成を示すブロック図である。 FIG. 3 is a block diagram showing a schematic configuration of the document camera 2 and the PC 3 in FIG.

書画カメラ２は、カメラ部４を備えた撮像処理部１１と、ユーザが設定した動作条件に基づいて撮像処理部１１に所要の動作を行わせる操作指示部１２と、ＰＣ３との接続のためのＵＳＢ規格等に準拠する外部インタフェイス１３とを有している。 The document camera 2 includes an imaging processing unit 11 having a camera unit 4, an operation instruction unit 12 that causes the imaging processing unit 11 to perform a required operation based on an operation condition set by a user, and a PC 3. And an external interface 13 conforming to the USB standard or the like.

ＰＣ３は、書画カメラ２との接続のためのＵＳＢ規格等に準拠する外部インタフェイス２１と、書画カメラ２からの撮影画像データが入力される画像データ入力部２２と、撮影画像の記録や表示に際して必要な画像処理を行う画像処理部２３と、キーボード等からなる入力操作部２４においてユーザが設定した動作条件を書画カメラ２に対して送信する操作系制御部２５と、画像処理後の撮影画像をＬＣＤやプロジェクタ等からなる表示部２６に表示するためのデータを生成する表示データ生成部２７と、画像処理後の撮影画像データを保存するデータ格納部２８とを有している。画像処理部２３は、撮影画像の傾き補正を行う画像傾斜補正部３０を有している。画像傾斜補正部３０によって処理される撮影画像データは画像データ記憶部３７から読み出される。また、後述する画像傾斜補正部３０での交差線分率の算出や統計的処理等に用いられる各種パラメータの情報はパラメータ記憶部３８から読み出される。 The PC 3 includes an external interface 21 compliant with the USB standard for connection with the document camera 2, an image data input unit 22 to which photographed image data from the document camera 2 is input, and recording and display of photographed images. An image processing unit 23 that performs necessary image processing, an operation system control unit 25 that transmits operation conditions set by the user to the document camera 2 in an input operation unit 24 such as a keyboard, and a captured image after image processing. It has a display data generation unit 27 that generates data to be displayed on a display unit 26 such as an LCD or a projector, and a data storage unit 28 that stores captured image data after image processing. The image processing unit 23 includes an image tilt correction unit 30 that performs tilt correction of a captured image. The captured image data processed by the image inclination correction unit 30 is read from the image data storage unit 37. In addition, information on various parameters used for calculation of intersection line segments and statistical processing in the image inclination correction unit 30 described later is read from the parameter storage unit 38.

ＰＣ３における画像データ入力部２２、画像処理部２３、操作系制御部２５および表示データ生成部２７等における処理機能は、画像処理アプリケーションなどのプログラムをＣＰＵで実行するソフトウェア処理によって実現することができる。もちろんＰＣ３を画像処理装置と解釈して、特定の処理を高速に実行するハードウェアを備える構成としてもよい。画像データ記憶部３７およびパラメータ記憶部３８は汎用のメモリからなる。 The processing functions of the image data input unit 22, the image processing unit 23, the operation system control unit 25, the display data generation unit 27, and the like in the PC 3 can be realized by software processing in which a program such as an image processing application is executed by the CPU. Of course, the PC 3 may be interpreted as an image processing apparatus, and may be configured to include hardware that executes specific processing at high speed. The image data storage unit 37 and the parameter storage unit 38 are composed of general-purpose memories.

ＰＣ３において、画像データ入力部２２は、書画カメラ２から送信される撮影画像データを画像処理部２３の画像データ記憶部３７に順次格納し、必要に応じてその撮影画像データを画像傾斜補正部３０に順次出力する。そして、画像傾斜補正部３０は、撮影画像データの画像処理の際に撮影画像の傾斜角度を推定すると共に、その推定した傾斜角度に基づき撮影画像の傾き補正を行う。傾き補正された撮影画像データは、データ格納部２８に格納されると共に、表示データ生成部２７に送られて表示部２６に表示される。 In the PC 3, the image data input unit 22 sequentially stores the captured image data transmitted from the document camera 2 in the image data storage unit 37 of the image processing unit 23, and the captured image data is stored in the image inclination correction unit 30 as necessary. Are output sequentially. Then, the image tilt correction unit 30 estimates the tilt angle of the captured image during image processing of the captured image data, and corrects the tilt of the captured image based on the estimated tilt angle. The captured image data whose inclination is corrected is stored in the data storage unit 28 and is also sent to the display data generation unit 27 and displayed on the display unit 26.

画像傾斜補正部３０は、撮影画像のグレースケール変換や２値化を行うグレースケール変換部３１と、グレースケール変換された撮影画像について複数のエッジ画素を抽出する（エッジ検出を行う）エッジ抽出部３２と、抽出された複数のエッジ画素に基づき当該エッジ画素を結ぶ複数の線分（直線成分）の抽出を行う線分抽出部（直線抽出部）３３と、抽出された各線分の相互の交差状態を判定し、全ての線分のうち他の線分と相互に交差する線分が占める割合である交差線分率を算出する交差線分率算出部３４と、抽出された複数の線分の傾斜角度について統計的処理を行うことによって撮影画像の傾斜角度を推定すると共に、当該傾斜角度に基づき撮影画像の傾き補正角度を設定する傾き補正角度設定部（傾斜角度推定部）３５と、傾き補正角度に基づき撮影画像を回転させる画像回転部３６とを有している。 The image inclination correction unit 30 is a grayscale conversion unit 31 that performs grayscale conversion or binarization of a captured image, and an edge extraction unit that extracts a plurality of edge pixels from the grayscale converted captured image (performs edge detection). 32, a line segment extraction unit (straight line extraction unit) 33 that extracts a plurality of line segments (straight line components) connecting the edge pixels based on the extracted plurality of edge pixels, and a mutual intersection of each extracted line segment An intersection line segment calculation unit 34 that determines a state and calculates an intersection line segment ratio that is a ratio of all line segments that intersect each other with other line segments; and a plurality of extracted line segments The inclination angle of the captured image is estimated by performing statistical processing on the inclination angle of the image, and an inclination correction angle setting unit (inclination angle estimation unit) 35 that sets the inclination correction angle of the captured image based on the inclination angle; And an image rotating unit 36 for rotating the captured image based on the correction angle.

また、ＰＣ３において、ユーザは入力操作部２４を操作することにより、書画カメラ２で撮影される画像の解像度、フレームレート、シャッタスピード、照明用光源の発光量などの動作条件を適宜入力することができる。この動作条件は、操作系制御部２５から書画カメラ２に対して制御信号として送信され、書画カメラ２では、ＰＣ３からの制御信号に基づき操作指示部１２が送出した処理命令にしたがって撮像処理部１１が撮影動作を実行する。 In the PC 3, the user can appropriately input operating conditions such as the resolution of the image captured by the document camera 2, the frame rate, the shutter speed, and the amount of light emitted from the illumination light source by operating the input operation unit 24. it can. This operating condition is transmitted as a control signal from the operation system control unit 25 to the document camera 2, and the document camera 2 captures the imaging processing unit 11 in accordance with a processing command sent from the operation instruction unit 12 based on the control signal from the PC 3. Performs the shooting operation.

図４は図１に示した原稿読取システム１による画像表示手順の要部を示すフロー図であり、図５は図４中の交差線分率取得（ＳＴ１０５）で取得される交差線分率の例を示す説明図である。 FIG. 4 is a flowchart showing the main part of the image display procedure by the document reading system 1 shown in FIG. 1, and FIG. 5 shows the intersection line segment ratio acquired in the intersection line segment acquisition (ST105) in FIG. It is explanatory drawing which shows an example.

原稿読取システム１を利用するユーザは、まず、読取位置（書画カメラ２の撮影可能位置）に書籍Ｂを開いた状態でセットし、書画カメラ２を起動させると共に、ＰＣ３で所要のアプリケーションを起動させる。その後、書画カメラ２による撮像が開始され、ＰＣ３の画像データ入力部２２において書画カメラ２からの画像の入力が検出される（ＳＴ１０１）。この撮影画像の入力は、所定のフレームレートで実施され、各撮影画像に対して以下で示すような処理が順次実行される。 A user who uses the document reading system 1 first sets the book B in an opened state at a reading position (position where the document camera 2 can be photographed), activates the document camera 2, and activates a required application on the PC 3. . Thereafter, imaging by the document camera 2 is started, and input of an image from the document camera 2 is detected in the image data input unit 22 of the PC 3 (ST101). The input of the captured image is performed at a predetermined frame rate, and the following processing is sequentially performed on each captured image.

次に、ＰＣ３では、書画カメラ２から受信した撮影画像データを画像データ入力部２２が画像データ記憶部３７に格納した後、グレースケール変換部３１が撮影画像をＲＧＢのカラー画像から白黒画像に変換する（ＳＴ１０２）。このグレースケール変換処理は、中間値法等の周知の方法を用いて行うことができる。書画カメラ２がＹＣ分離後の信号を直接出力する構成であれば、Ｙ（輝度）信号をそのまま用いるとよい。 Next, in the PC 3, after the captured image data received from the document camera 2 is stored in the image data storage unit 37 by the image data input unit 22, the gray scale conversion unit 31 converts the captured image from an RGB color image to a monochrome image. (ST102). This gray scale conversion process can be performed using a known method such as an intermediate value method. If the document camera 2 is configured to directly output the signal after YC separation, the Y (luminance) signal may be used as it is.

続いて、エッジ抽出部３２は、グレースケール変換された撮影画像において輝度が急激に変化する部位をエッジ画素として抽出する（ＳＴ１０３）。このエッジ抽出処理は、キャニー（Canny）法等の周知の方法を用いて行うことができる。更に、線分抽出部３３は、取得された複数のエッジ画素から線分（直線成分）の抽出を行う（ＳＴ１０４）。この線分抽出処理は、確率的ハフ（Hough）変換等の周知の方法を用いて行うことができる。確率的ハフ変換は、画像の中から端点を持つ線分を検出する際に用いられ、検出された線分について始点，終点の座標値を取得することができる。 Subsequently, the edge extraction unit 32 extracts, as edge pixels, a portion where the luminance changes abruptly in the grey-scale converted captured image (ST103). This edge extraction process can be performed using a known method such as the Canny method. Further, the line segment extraction unit 33 extracts line segments (straight line components) from the plurality of acquired edge pixels (ST104). This line segment extraction process can be performed using a known method such as probabilistic Hough transform. Probabilistic Hough transform is used when detecting a line segment having an end point from an image, and the coordinate values of the start point and end point of the detected line segment can be acquired.

ここで抽出された端点の座標値等の各線分の情報（以下、「線分情報」という。）はパラメータ記憶部３８に格納される。この線分情報には、各線分における両端点の座標等のデータのみならず、後述する交差線分であるか否かを示すフラグ（以下、「交差線分フラグ」と称する。２つの線分が交差しないとき交差線分フラグは「false」を、交差するときは「true」の値をとるものとする。）などの所定の動作条件の成立の有無や、データの状態を確認するための標識が含まれる。なお、撮影画像から抽出される直線成分としては、少なくともそれらの傾斜角度についての統計的処理により書籍Ｂの傾斜の度合いを推定可能なものであればよく、本実施形態に示す線分に限定されるものではない。 Information of each line segment such as the coordinate value of the end point extracted here (hereinafter referred to as “line segment information”) is stored in the parameter storage unit 38. This line segment information includes not only data such as the coordinates of both end points in each line segment, but also a flag indicating whether or not it is an intersection line segment to be described later (hereinafter referred to as “intersection line flag”; two line segments). The crossing line flag is “false” when does not intersect, and “true” is assumed when it intersects.) A sign is included. Note that the linear component extracted from the captured image is not limited to the line segment shown in the present embodiment as long as it can estimate the degree of inclination of the book B by statistical processing of at least those inclination angles. It is not something.

次に、交差線分率算出部３４は、各線分の相互の交差状態を判定し、入力された撮像画像について交差線分率を算出する（ＳＴ１０５）。ここで「交差線分」を他線分との交点を持つ線分と、「非交差線分」を他線分との交点を持たない線分と定義すると、交差線分率＝交差線分の数／（交差線分の数＋非交差線分の数）で示される。なお、交差線分の数＋非交差線分の数＝全線分の数である。また、「交差線分」には、自身の端点が他の線分の端点と一致するものや、自身の端点が他の線分上に位置するものを含ませてもよい。 Next, the intersection line segment calculation unit 34 determines the intersection state of each line segment, and calculates the intersection line segment ratio for the input captured image (ST105). If we define “intersection line” as a line segment that has an intersection with another line segment and “non-intersection line segment” as a line segment that does not have an intersection with another line segment, the intersection line segment = intersection line segment / (Number of intersecting line segments + number of non-intersecting line segments). It should be noted that the number of intersecting line segments + the number of non-intersecting line segments = the number of all line segments. In addition, the “intersection line segment” may include one whose own end point coincides with the end point of another line segment, or one whose own end point is located on another line segment.

ここで、図５に示すように、撮影画像４１の交差線分率は、見開きページである紙面の内容（すなわち、画像の種別）によって大きく変化する。例えば図５（Ａ）に示す教科書Ｂ１では、文字や記号等が表示される文章領域４３が紙面の大半を占め、一部に表、図面、写真等を含む図形領域４４が存在する。この教科書Ｂ１における外形輪郭、文章領域４３、及び図形領域４４の枠線等からは、非交差線分（画像の傾斜角度の推定に有効な直線成分）が多く抽出されるため、ここでの交差線分は２０〜３０％程度である。また、図５（Ｂ）に示す動物図鑑Ｂ２では、交差線分（ノイズ成分）が多く抽出される写真等の図形領域４４が文章領域４３よりも紙面において多くを占めるようになり、ここでの交差線分は６０〜７０％程度である。さらに、図５（Ｃ）に示す動物図鑑Ｂ２の一部（写真部分）を拡大して撮影した画像では、撮影画像４１全体を図形領域４４が占めており、また書籍の外形輪郭等も存在しないため、ここでの交差線分率は略１００％である。なお、ここでは、図５（Ａ）に示すように、原稿（教科書Ｂ１）の基準方向（線Ｃ１参照）が撮影画像の基準方向（線Ｃ２参照）に対してα°傾いた例を示している。また、図５（Ｃ）では、図示の便宜上、図５（Ｂ）に示した動物図鑑Ｂ２に対して撮影画像４１を縮小して示している。 Here, as shown in FIG. 5, the intersection line segment ratio of the captured image 41 varies greatly depending on the content of the paper surface that is the spread page (that is, the type of the image). For example, in the textbook B1 shown in FIG. 5A, a text area 43 where characters, symbols, and the like are displayed occupies most of the paper surface, and a graphic area 44 including a table, a drawing, a photograph, and the like exists in part. Since many non-intersecting line segments (straight line components effective in estimating the inclination angle of the image) are extracted from the outline of the textbook B1, the frame area of the text area 43, and the graphic area 44, the intersection here The line segment is about 20-30%. Further, in the animal picture book B2 shown in FIG. 5 (B), the graphic area 44 such as a photograph from which a large number of intersecting line segments (noise components) are extracted occupies more on the paper than the text area 43. The intersection line segment is about 60 to 70%. Furthermore, in the image taken by enlarging a part (photograph part) of the animal picture book B2 shown in FIG. 5C, the graphic area 44 occupies the entire photographed image 41, and the outline of the book does not exist. Therefore, the intersection line segment here is approximately 100%. Here, as shown in FIG. 5A, an example in which the reference direction (see line C1) of the document (textbook B1) is inclined by α ° with respect to the reference direction (see line C2) of the photographed image is shown. Yes. In FIG. 5C, for convenience of illustration, the photographed image 41 is shown in a reduced scale with respect to the animal picture book B2 shown in FIG.

再び図４を参照して、次に、傾き補正角度設定部３５は、ＳＴ１０５で算出された交差線分率に基づき、現在の撮影画像についてデスキュー実施の可否（画像回転部による傾き補正の可否）を判定する（ＳＴ１０６）。詳細は後述するが、このデスキュー実施の判定では、交差線分率が所定の閾値以下の場合には、デスキューＯＮ（ＳＴ１０６：ＹＥＳ）と判定し、ＰＣ３では傾き補正を行う第１動作モードが実行される。一方、交差線分率が所定の閾値以上の場合には、デスキューＯＦＦ（ＳＴ１０６：ＮＯ）と判定し、これにより、ＰＣ３では傾き補正を行わない第２動作モードが実行される。 Referring to FIG. 4 again, next, the inclination correction angle setting unit 35 determines whether or not deskew can be performed on the current captured image based on the intersection line segment ratio calculated in ST105 (whether or not inclination correction by the image rotation unit is possible). Is determined (ST106). Although details will be described later, in this deskew execution determination, if the intersection line segment ratio is equal to or less than a predetermined threshold, it is determined that deskew is ON (ST106: YES), and the PC3 executes the first operation mode in which the inclination is corrected. Is done. On the other hand, when the intersection line segment ratio is equal to or greater than the predetermined threshold, it is determined that deskew is OFF (ST106: NO), and thereby the second operation mode in which the PC 3 does not perform inclination correction is executed.

傾き補正を実行する場合（ＳＴ１０６：ＹＥＳ）、傾き補正角度設定部３５は、線分抽出部３３により検出された複数の線分の傾斜角度に関する情報について統計的処理（以下、線分情報統計処理という。）を行うことによって撮影画像の傾斜角度を推定して傾き補正角度を設定する（ＳＴ１０７）。この線分情報統計処理では、線分の傾斜角度に関するヒストグラムの情報が生成される。 When the inclination correction is executed (ST106: YES), the inclination correction angle setting unit 35 performs statistical processing (hereinafter, line segment information statistical processing) on information regarding the inclination angles of the plurality of line segments detected by the line segment extraction unit 33. The inclination angle of the photographed image is estimated to set the inclination correction angle (ST107). In this line segment information statistical process, histogram information relating to the inclination angle of the line segment is generated.

さらに、その傾き補正角度に基づき、画像回転部３６は、矩形の撮影画像を回転させて撮影画像の傾き補正を行う（ＳＴ１０８）。これにより、正規の読取位置にて（すなわち、傾きなしで）書籍Ｂの紙面が読み取られた場合と同等の撮影画像を生成することができる。なお、傾き補正された撮影画像は、画像傾斜補正部３０から表示データ生成部２７およびデータ格納部２８に送られる。そして、表示データ生成部２７は傾き補正後の撮影画像に対して、例えばそのページ全体からコンテンツが記載されている領域を抽出し、書籍Ｂの高さに起因する湾曲を補正してフラットな画像を生成して表示部２６に表示する（ＳＴ１０９）。一方、傾き補正を実行しない場合（ＳＴ１０６：ＮＯ）、傾き補正されていない撮影画像が表示部２６に表示される。 Further, based on the tilt correction angle, the image rotation unit 36 rotates the rectangular captured image and corrects the tilt of the captured image (ST108). Thereby, it is possible to generate a captured image equivalent to the case where the paper surface of the book B is read at the normal reading position (that is, without tilting). Note that the captured image whose inclination has been corrected is sent from the image inclination correction unit 30 to the display data generation unit 27 and the data storage unit 28. Then, the display data generation unit 27 extracts, for example, an area in which the content is described from the entire page, and corrects the curvature caused by the height of the book B with respect to the photographed image after the tilt correction, thereby correcting the flat image. Is generated and displayed on the display unit 26 (ST109). On the other hand, when the tilt correction is not executed (ST106: NO), a captured image that is not tilt corrected is displayed on the display unit 26.

ＰＣ３では、上記一連の処理ＳＴ１０１〜ＳＴ１０９が繰り返し実行されることにより、傾き補正された一連の画像（映像）が表示部２６に順次表示される。なお、ＳＴ１０６においてデスキュー実施を不可と判定した場合には、ＳＴ１０７及びＳＴ１０８を省略する構成としたが、これに限らず、例えばＳＴ１０７で設定される傾き補正角度に拘わらず、ＳＴ１０６と同様の判定処理をＳＴ１０８において画像回転部３６に実行させる構成としてもよい。 In the PC 3, the series of processes ST <b> 101 to ST <b> 109 are repeatedly executed, so that a series of images (video) whose inclination is corrected is sequentially displayed on the display unit 26. When it is determined in ST106 that deskew execution is not possible, ST107 and ST108 are omitted. However, the present invention is not limited to this. For example, the same determination process as ST106 is performed regardless of the inclination correction angle set in ST107. May be configured to be executed by the image rotation unit 36 in ST108.

このような処理を実行することにより、原稿読取システム１では、簡易な構成により、撮影画像から検出される有効な線分の割合が低い場合、つまり撮像画像から有効な直線成分の検出が難しい場合でも不適当な傾き補正が実行されることを防止することができる。特に、有効な線分の割合（換言すれば、ノイズ成分の割合）の指標として交差線分率を用いるため、画像の種別（写真、図形、及び文字等）を判定する等の複雑な処理を不要としつつ、傾き補正の可否を高速かつ高精度に決定できるという利点がある。 By executing such processing, the original reading system 1 has a simple configuration and the proportion of effective line segments detected from the captured image is low, that is, it is difficult to detect effective linear components from the captured image. However, it is possible to prevent inappropriate inclination correction from being executed. In particular, since the intersection line segment ratio is used as an index of the effective line segment ratio (in other words, the ratio of the noise component), complicated processing such as determining the type of image (photo, graphic, character, etc.) is performed. There is an advantage that whether or not inclination correction can be performed can be determined at high speed and with high accuracy while making it unnecessary.

図６は、図４中の交差線分率取得（ＳＴ１０５）の詳細を示すフロー図である。まず、交差線分率算出部３４は、他の線分と交差する線分の数を示す交差線分カウンタｉを初期化してｉ＝０とし（ＳＴ２０１）、続いて各線分の相互の交差状態を判定するために、線分情報として取得された全ての線分から交差状態について未判定の１つの線分を判定対象線分として抽出する（ＳＴ２０２）。 FIG. 6 is a flowchart showing details of the intersection line segment acquisition (ST105) in FIG. First, the intersection line segment calculation unit 34 initializes an intersection line segment counter i indicating the number of line segments intersecting with other line segments to set i = 0 (ST201). Therefore, one line segment that has not been determined for the intersection state is extracted as a determination target line segment from all the line segments acquired as line segment information (ST202).

ＳＴ２０２において未判定の線分が抽出された場合（ＳＴ２０３：ＹＥＳ）、その線分の交差状態を示す交差線分フラグを初期化して「false」（交差なし）とし（ＳＴ２０４）、更に、線分情報として取得された全ての線分から判定対象線分以外の１つの線分を比較対象線分（他の線分）として抽出する（ＳＴ２０５）。このＳＴ２０５では、線分情報において今回の判定対象線分と未だ比較されていない他の１つの線分が比較対象線分として順次選択される。 When an undetermined line segment is extracted in ST202 (ST203: YES), an intersection line segment flag indicating the intersection state of the line segment is initialized to “false” (no intersection) (ST204). One line segment other than the determination target line segment is extracted as a comparison target line segment (another line segment) from all the line segments acquired as information (ST205). In ST205, another line segment that has not been compared with the current determination target line segment in the line segment information is sequentially selected as a comparison target line segment.

ＳＴ２０５において比較対象線分が抽出された場合（ＳＴ２０６：ＹＥＳ）、交差線分率算出部３４は、ＳＴ２０２で抽出した判定対象線分がステップＳＴ２０５で抽出した比較対象線分と交差するか否かの判定を実行する（ＳＴ２０７）。各線分の交差状態は、線分の両端点の座標に基づき公知の方法を用いて判定することができる。そこで、判定対象線分が比較対象線分と交差する場合（ＳＴ２０８：ＹＥＳ）、交差線分率算出部３４では、交差線分フラグを「true」（交差あり）に変更する（ＳＴ２０９）と共に、交差線分カウンタｉをカウントアップしてｉ＝ｉ＋１とする（ＳＴ２１０）。その後、交差線分率算出部３４は、今回の判定対象線分をＳＴ２０２の抽出候補から除外し（ＳＴ２１１）、更に、ＳＴ２０５で抽出されるべき比較対象線分の情報をリセットする（ＳＴ２１２）。その後は、ＳＴ２０２に戻って上記と同様の処理を実施する。 When the comparison target line segment is extracted in ST205 (ST206: YES), the intersection line segment ratio calculation unit 34 determines whether or not the determination target line segment extracted in ST202 intersects the comparison target line segment extracted in step ST205. This determination is executed (ST207). The intersection state of each line segment can be determined using a known method based on the coordinates of both end points of the line segment. Therefore, when the determination target line segment intersects with the comparison target line segment (ST208: YES), the intersection line segment ratio calculation unit 34 changes the intersection line segment flag to “true” (with intersection) (ST209), The intersection line counter i is counted up to i = i + 1 (ST210). Thereafter, the intersection line segment calculation unit 34 excludes the current determination target line segment from the extraction candidates of ST202 (ST211), and further resets the comparison target line segment information to be extracted in ST205 (ST212). Thereafter, the process returns to ST202 and the same processing as described above is performed.

一方、ＳＴ２０８において交差しないと判定された場合（ＮＯ）、交差線分率算出部３４は、今回の比較対象線分をＳＴ２０５の抽出候補から除外し（ＳＴ２１３）、ＳＴ２０５に戻って次の比較対象線分を抽出して上記と同様の処理を実施する。なお、今回の判定対象線分を全ての比較対象線分と比較し終えて、ＳＴ２０５で新たな比較対象線分が抽出されない場合には（ＳＴ２０６：ＮＯ）、当該判定対象線分は、他のいずれの線分とも交差しないため、ＳＴ２０９およびＳＴ２１０の処理が省略されてＳＴ２１１に進む。 On the other hand, if it is determined in ST208 that they do not intersect (NO), the intersection line segment calculation unit 34 excludes the current comparison target segment from the extraction candidates of ST205 (ST213), and returns to ST205 for the next comparison target. A line segment is extracted and the same processing as described above is performed. When the current determination target line segment has been compared with all the comparison target line segments and no new comparison target line segment is extracted in ST205 (ST206: NO), the determination target line segment Since neither line segment intersects, the processing of ST209 and ST210 is omitted and the process proceeds to ST211.

最終的に全ての線分について交差状態の判定が終了すると（ＳＴ２０３：ＮＯ）、交差線分率算出部３４は、交差線分カウンタｉの値を線分情報として取得された全線分数で除算することにより、交差線分率（％）を算出する（ＳＴ２１４）。 When the determination of the intersection state is finally completed for all the line segments (ST203: NO), the intersection line segment calculation unit 34 divides the value of the intersection line segment counter i by the total number of line segments acquired as the line segment information. Thus, the intersection line segment ratio (%) is calculated (ST214).

なお、交差線分率取得の過程で各線分に付与された交差線分フラグの値（false又はtrue）は、原稿の傾斜角度を推定するために生成される、ヒストグラムの度数カウントにおいて参照される。 Note that the value (false or true) of the intersection line segment flag assigned to each line segment in the process of acquiring the intersection line segment ratio is referred to in the histogram frequency count generated to estimate the inclination angle of the document. .

図７は図４中の交差線分率取得（ＳＴ１０５）およびデスキュー実施の可否判定（ＳＴ１０６）の具体例を示す説明図であり、図８は図７におけるデスキュー実施の可否判定の処理の概略を示す状態遷移図である。 FIG. 7 is an explanatory diagram showing a specific example of crossing line segment acquisition (ST105) and deskew execution feasibility determination (ST106) in FIG. 4, and FIG. 8 shows an outline of the deskew execution feasibility judgment process in FIG. FIG.

図７に示すように、ここではデスキュー実施の可否を判定するための交差線分率の閾値として閾値ＴＬ１（第１の値）と、この閾値ＴＬ１よりも大きい閾値ＴＬ２（第２の値）とを用いる。ここでは、閾値ＴＬ１＝７０％、閾値ＴＬ２＝８０％とするが、これらの値は撮影する原稿の内容に応じて適宜変更することができる。また、図７では、デスキュー実施の可否判定（ＳＴ１０６）の結果が切り替わる（デスキューＯＮ→ＯＦＦまたはデスキューＯＦＦ→ＯＮ）点を白抜きの丸印で表示して他の点（黒塗りの丸印）と区別している。 As shown in FIG. 7, here, a threshold TL1 (first value) and a threshold TL2 (second value) larger than the threshold TL1 are used as the threshold of the intersection line segment ratio for determining whether or not deskew can be performed. Is used. Here, the threshold value TL1 = 70% and the threshold value TL2 = 80%, but these values can be appropriately changed according to the content of the document to be photographed. Further, in FIG. 7, the result of the determination on whether or not deskew can be performed (ST106) is switched (deskew ON → OFF or deskew OFF → ON) is displayed as a white circle and other points (black circle) It is distinguished from.

まず、時間Ｔ１〜Ｔ２では、図５（Ａ）に示したような文章領域４３が比較的多いページを撮影している。この間において算出された全ての交差線分率は閾値ＴＬ１以下であり、図４のＳＴ１０６においてデスキューＯＮと判断され、また、この状態は、図８における「Non-Threshold Status」５１にある。時間Ｔ１〜Ｔ２では、同一ページを撮影しているため、理想的には同一の交差線分率が算出されるべきであるが、実際には、照明の変化、紙面の僅かな揺れ、イメージセンサのノイズ、確率的ハフ変換のランダム性等により、交差線分率が変動（図８中の変動幅Ｄ１参照）している。なお、時間Ｔ１における点Ｐ１は、撮影開始直後の交差線分率を示すものではなく、一連の撮影の途中段階のものである。 First, at time T1 to T2, a page having a relatively large text area 43 as shown in FIG. All the intersection line segments calculated during this period are equal to or less than the threshold TL1, and it is determined that the deskew is ON in ST106 of FIG. 4, and this state is “Non-Threshold Status” 51 in FIG. Since the same page is photographed at time T1 to T2, the same intersection line segment ratio should be calculated ideally. However, in reality, a change in illumination, a slight fluctuation of the paper surface, an image sensor The crossing line segment varies due to noise, randomness of probabilistic Hough transform, and the like (see fluctuation range D1 in FIG. 8). Note that the point P1 at the time T1 does not indicate the intersection line segment immediately after the start of imaging, but is in the middle of a series of imaging.

次に、時間Ｔ２〜Ｔ３では、ユーザによりページめくり（撮影する紙面の変更）が行われる。このとき、交差線分率は、時間Ｔ２における点Ｐ７から時間Ｔ３における点Ｐ１０まで次第に増大する。ここで、点Ｐ８から点Ｐ９へ移行（上昇）した際に、交差線分率は閾値ＴＬ１を上回るが、依然として閾値ＴＬ２未満であり、図８における「Threshold1 Status」５２に移行してデスキューＯＮの状態が継続される。そして、点Ｐ９から点Ｐ１０へ更に移行（上昇）した際に、交差線分率は閾値ＴＬ２を上回り、これにより、図４のＳＴ１０６においてデスキューＯＦＦ（図８中デスキューＯＦＦ５３を併せて参照）と判断された後、再び図８における「Non-Threshold Status」５１に戻る。 Next, at time T2 to T3, the user turns the page (changes the shooting paper). At this time, the intersection line segment gradually increases from point P7 at time T2 to point P10 at time T3. Here, when the point P8 is shifted (increased) to the point P9, the crossing line segment ratio exceeds the threshold value TL1, but is still less than the threshold value TL2, and the process shifts to “Threshold1 Status” 52 in FIG. The state continues. When the point P9 further shifts (increases) from the point P9 to the point P10, the intersection line segment ratio exceeds the threshold TL2, thereby determining that the deskew is OFF (see also the deskew OFF 53 in FIG. 8) in ST106 of FIG. Then, the process returns to “Non-Threshold Status” 51 in FIG.

次に、時間Ｔ３〜Ｔ４では、図５（Ｂ）に示したような図形領域４４が文章領域４３よりも多くを占めるページ（同一ページ）を撮影している。この間においても時間Ｔ１〜Ｔ２の場合と同様に、交差線分率が変動（図８中の変動幅Ｄ２参照）しており、点Ｐ１３から点Ｐ１４へ移行（下降）した際に、交差線分率は閾値ＴＬ２未満となるが、依然として閾値ＴＬ１を超えており、図８における「Threshold2 Status」５４に移行してデスキューＯＦＦの状態が継続される。そして、その直後に点Ｐ１４から点Ｐ１５へ移行（上昇）した際に、交差線分率は閾値ＴＬ２以上となり、再び図８における「Non-Threshold Status」５１に戻る。 Next, at time T3 to T4, a page (same page) in which the graphic area 44 as shown in FIG. During this time, as in the case of the times T1 to T2, the crossing line segment ratio fluctuates (see the fluctuation range D2 in FIG. 8), and the crossing line segment shifts from the point P13 to the point P14. The rate is less than the threshold value TL2, but still exceeds the threshold value TL1, and the process proceeds to “Threshold2 Status” 54 in FIG. 8 to continue the deskew OFF state. When the transition from point P14 to point P15 is performed (increased) immediately after that, the intersection line segment ratio becomes equal to or greater than the threshold value TL2, and the processing returns to “Non-Threshold Status” 51 in FIG. 8 again.

次に、時間Ｔ４〜Ｔ５では、時間Ｔ２〜Ｔ３の場合と同様に、ユーザによりページめくりが行われる。このとき、交差線分率は、時間Ｔ４における点Ｐ１７から時間Ｔ５における点Ｐ１９まで次第に減少する。ここで、点Ｐ１７から点Ｐ１８へ移行（下降）した際に、交差線分率は閾値ＴＬ２未満となるが、依然として閾値ＴＬ１を超えており、図８における「Threshold2 Status」５４に移行してデスキューＯＦＦの状態が継続される。そして、点Ｐ１８から点Ｐ１９へ更に移行（下降）した際に、交差線分率は閾値ＴＬ１以下となり、これにより、図４のＳＴ１０６においてデスキューＯＮ（図８中デスキューＯＮ５５を併せて参照）と判断された後、再び図８における「Non-Threshold Status」５１に戻る。 Next, at time T4 to T5, the page is turned by the user as in the case of time T2 to T3. At this time, the crossing line segment gradually decreases from the point P17 at time T4 to the point P19 at time T5. Here, when the point P17 is shifted (lowered) from the point P17 to the point P18, the intersection line segment ratio is less than the threshold value TL2, but still exceeds the threshold value TL1, and the process proceeds to “Threshold2 Status” 54 in FIG. The OFF state is continued. When the point P18 further shifts (lowers) from the point P18, the intersection line segment ratio becomes equal to or less than the threshold value TL1, thereby determining that the deskew is ON (see also the deskew ON 55 in FIG. 8) in ST106 of FIG. Then, the process returns to “Non-Threshold Status” 51 in FIG.

次に、時間Ｔ５〜Ｔ６では、時間Ｔ１〜Ｔ２の場合と同様に、交差線分率が変動（図８中の変動幅Ｄ３参照）しており、点Ｐ２３から点Ｐ２４へ移行（上昇）した際に、交差線分率は閾値ＴＬ１を超えるが、依然として閾値ＴＬ２未満であり、図８における「Threshold1 Status」５２に移行してデスキューＯＮの状態が継続される。そして、その直後に点Ｐ２４から点Ｐ２５へ移行（下降）した際に、交差線分率は閾値ＴＬ１以下となり、再び図８における「Non-Threshold Status」５１に戻る。 Next, at time T5 to T6, as in the case of time T1 to T2, the cross line segment ratio fluctuates (see fluctuation width D3 in FIG. 8), and shifts (rises) from point P23 to point P24. In this case, the intersection line segment ratio exceeds the threshold value TL1, but is still less than the threshold value TL2, and the process proceeds to “Threshold1 Status” 52 in FIG. 8 to continue the deskew ON state. Then, immediately after that, when the point P24 is shifted (lowered) from the point P24, the intersection line segment ratio becomes equal to or less than the threshold value TL1, and the processing returns to the “Non-Threshold Status” 51 in FIG. 8 again.

このように、上記デスキュー実施の可否判定（ＳＴ１０６）では、一連の撮像画像において、今回の撮影画像における交差線分率が閾値ＴＬ１よりも大きな前回の撮影画像の交差線分率から減少して閾値ＴＬ１を下回った場合、第２の動作モード（デスキューＯＮ）を第１の動作モード（デスキューＯＦＦ）に切り替える一方、今回の撮影画像における交差線分率が閾値ＴＬ２よりも小さな前回の撮影画像の交差線分率から増大して閾値ＴＬ２を通過した場合、第１の動作モードを第２の動作モードに切り替える。つまり、２つの閾値ＴＬ１、ＴＬ２によりヒステリシスを持たせて２つの動作モード（デスキューＯＮ／ＯＦＦ）を切り替える構成としたため、撮影画像を順次傾き補正する際に、原稿が静止状態であるにも拘わらず何らかの不安定化要因により、直線成分の交差線分率が変動して不適当に動作モード（デスキューＯＮ、ＯＦＦ）の切替えが実行されることを防止することができる。 As described above, in the determination of whether or not deskew can be performed (ST106), in a series of captured images, the crossing line segment ratio in the current captured image is decreased from the cross line segment ratio of the previous captured image larger than the threshold value TL1, and the threshold value is reached. When it falls below TL1, the second operation mode (deskew ON) is switched to the first operation mode (deskew OFF), while the intersection of the previous captured images in which the intersecting line segment ratio in the current captured image is smaller than the threshold value TL2 When increasing from the line segment ratio and passing the threshold TL2, the first operation mode is switched to the second operation mode. In other words, since the two thresholds TL1 and TL2 are provided with hysteresis to switch between the two operation modes (deskew ON / OFF), when the captured image is sequentially tilt corrected, the original is in a stationary state. It is possible to prevent the switching of the operation mode (deskew ON / OFF) from being improperly performed due to fluctuations in the intersection line segment of the linear component due to some destabilizing factor.

なお、ここでは２つの閾値ＴＬ１、ＴＬ２を用いたが、これに限らず、例えば、閾値ＴＬ２のみを用いて、交差線分率が閾値ＴＬ２未満の場合に第１の動作モードを実行する一方、交差線分率が閾値ＴＬ２以上の場合に第２の動作モードを実行する構成も可能である。 Here, the two threshold values TL1 and TL2 are used. However, the present invention is not limited to this. For example, the first operation mode is executed when only the threshold value TL2 is used and the intersection line segment ratio is less than the threshold value TL2. A configuration in which the second operation mode is executed when the intersection line segment ratio is equal to or greater than the threshold TL2 is also possible.

図９は図４中のデスキュー実施の可否判定（ＳＴ１０６）の詳細を示すフロー図である。まず、傾き補正角度設定部３５は、初期状態として図８における「Non-Threshold Status」５１とすると共にデスキューＯＦＦに設定する（ＳＴ３０１）。続いて、ＳＴ１０５で交差線分率が取得されると（ＳＴ３０２：ＹＥＳ）、「Non-Threshold Status」であるか否かを判定する（ＳＴ３０３）。 FIG. 9 is a flowchart showing details of the deskew execution feasibility determination (ST106) in FIG. First, the inclination correction angle setting unit 35 sets “Non-Threshold Status” 51 in FIG. 8 as an initial state and sets the deskew OFF (ST301). Subsequently, when the intersection line segment ratio is acquired in ST105 (ST302: YES), it is determined whether or not it is “Non-Threshold Status” (ST303).

そこで、「Non-Threshold Status」において閾値を通過すると（ＳＴ３０４：ＹＥＳ）、その通過した閾値がＴＬ１の場合には（ＳＴ３０５：ＴＬ１）には図８における「Threshold1 Status」５２にあると判断する（ＳＴ３０６）。一方、通過した閾値がＴＬ２の場合（ＳＴ３０４：ＴＬ２）には図８における「Threshold2 Status」５４にあると判断する（ＳＴ３０７）。その後は、再びＳＴ３０２に戻る。 Therefore, when the threshold value is passed in “Non-Threshold Status” (ST304: YES), when the passed threshold value is TL1 (ST305: TL1), it is determined that “Threshold1 Status” 52 in FIG. ST306). On the other hand, when the threshold value passed is TL2 (ST304: TL2), it is determined that it is in “Threshold2 Status” 54 in FIG. 8 (ST307). After that, it returns to ST302 again.

ＳＴ３０３において、「Non-Threshold Status」にない場合、続いて「Threshold1 Status」であるか否かを判定する（ＳＴ３０８）。そこで、「Threshold1 Status」において閾値を通過すると（ＳＴ３０９：ＹＥＳ）、その通過した閾値がＴＬ１の場合には（ＳＴ３１０：ＴＬ１）には「Non-Threshold Status」にあると判断する（ＳＴ３１１）。一方、通過した閾値がＴＬ２の場合（ＳＴ３１０：ＴＬ２）にはデスキューＯＦＦ（図８中符号５３を参照）とした後（ＳＴ３１２）、「Non-Threshold Status」にあると判断する（ＳＴ３１１）。その後は、再びＳＴ３０２に戻る。 In ST303, when it is not in “Non-Threshold Status”, it is subsequently determined whether or not it is “Threshold1 Status” (ST308). Therefore, if the threshold value is passed in “Threshold1 Status” (ST309: YES), if the passed threshold value is TL1 (ST310: TL1), it is determined that “Non-Threshold Status” exists (ST311). On the other hand, if the threshold value passed is TL2 (ST310: TL2), after deskew is turned off (see reference numeral 53 in FIG. 8) (ST312), it is determined that “Non-Threshold Status” is set (ST311). After that, it returns to ST302 again.

ＳＴ３０８において、「Threshold1 Status」にない場合、続いて「Threshold2 Status」であるか否かを判定する（ＳＴ３１３）。そこで、「Threshold2 Status」において閾値を通過すると（ＳＴ３１４：ＹＥＳ）、その通過した閾値がＴＬ２の場合（ＳＴ３１５：ＴＬ２）には「Non-Threshold Status」にあると判断する（ＳＴ３１６）。一方、通過した閾値がＴＬ１の場合には（ＳＴ３１５：ＴＬ１）にはデスキューＯＮ（図８中符号５５を参照）とした後（ＳＴ３１６）、「Non-Threshold Status」にあると判断する（ＳＴ３１７）。その後は、再びＳＴ３０２に戻る。 If it is not in “Threshold1 Status” in ST308, it is subsequently determined whether it is “Threshold2 Status” (ST313). Therefore, if the threshold value is passed in “Threshold2 Status” (ST314: YES), if the passed threshold value is TL2 (ST315: TL2), it is determined that “Non-Threshold Status” exists (ST316). On the other hand, when the threshold value passed is TL1 (ST315: TL1), after deskew is turned on (see reference numeral 55 in FIG. 8) (ST316), it is determined that the state is “Non-Threshold Status” (ST317). . After that, it returns to ST302 again.

図１０は図４中の線分情報統計処理（ＳＴ１０７）の詳細を示すフロー図である。まず、傾き補正角度設定部３５は、パラメータ記憶部３８から画像処理の対象となる撮影画像についての線分情報を取得する（ＳＴ４０１）。この線分情報には、図６のＳＴ２０７における各線分の交差判定結果が含まれる。 FIG. 10 is a flowchart showing details of the line segment information statistical processing (ST107) in FIG. First, the inclination correction angle setting unit 35 acquires line segment information about a captured image that is an object of image processing from the parameter storage unit 38 (ST401). This line segment information includes the intersection determination result of each line segment in ST207 of FIG.

次に、傾き補正角度設定部３５は、各線分についての交差の有無を判定するために、線分情報における全ての線分から１つの線分（未判定の線分）を判定対象線分として抽出し（ＳＴ４０２）、その判定対象線分の交差線分フラグを参照して交差の有無を判定する（ＳＴ４０４）。なお、少なくとも線分情報統計処理の開始時には、比較が終了していない判定対象線分が存在するため、全線分の交差判定が終了したか否かの判定（ＳＴ４０３）ではＮｏと判定される。 Next, the inclination correction angle setting unit 35 extracts one line segment (undecided line segment) as a determination target line segment from all the line segments in the line segment information in order to determine whether or not each line segment has an intersection. Then (ST402), the presence or absence of intersection is determined with reference to the intersection line segment flag of the determination target line segment (ST404). Note that at least at the start of the line segment information statistical process, there is a determination target line segment for which comparison has not been completed. Therefore, it is determined No in the determination of whether or not the intersection determination of all line segments has been completed (ST403).

次に、傾き補正角度設定部３５は、判定対象線分について交差なし（交差線分フラグ＝false）と判定した場合（ＳＴ４０４：ＮＯ）には、当該判定対象線分の座標データに基づきその傾斜角度θを算出し（ＳＴ４０５）、その算出結果をヒストグラム情報に追加（すなわち、統計的処理の対象として抽出）する（ＳＴ４０６）。つまり、横軸を傾斜角度θ、縦軸を度数として、θに対応する度数をカウントアップする。その後、当該判定対象線分をＳＴ４０２の抽出候補から除外し（ＳＴ４０７）、再びＳＴ４０２に戻る。 Next, when the inclination correction angle setting unit 35 determines that there is no intersection (intersection line flag = false) for the determination target line segment (ST404: NO), the inclination correction angle setting unit 35 determines the inclination based on the coordinate data of the determination target line segment. The angle θ is calculated (ST405), and the calculation result is added to the histogram information (that is, extracted as an object of statistical processing) (ST406). That is, the frequency corresponding to θ is counted up with the horizontal axis as the tilt angle θ and the vertical axis as the frequency. Thereafter, the determination target line segment is excluded from the extraction candidates in ST402 (ST407), and the process returns to ST402 again.

一方、傾き補正角度設定部３５は、判定対象線分について交差あり（交差線分フラグ＝true）と判定した場合（ＳＴ４０４：ＹＥＳ）には、ＳＴ４０５およびＳＴ４０６を実行せずに（ヒストグラム情報に追加しない）当該判定対象線分をＳＴ４０２の抽出候補から除外し（ＳＴ４０７）、再びＳＴ４０２に戻る。 On the other hand, when the inclination correction angle setting unit 35 determines that there is an intersection for the determination target line segment (intersection line flag = true) (ST404: YES), ST405 and ST406 are not executed (added to the histogram information). No) The determination target line segment is excluded from the extraction candidates in ST402 (ST407), and the process returns to ST402 again.

最終的に全ての判定対象線分の交差状態の判定が終了すると（ＳＴ４０３：Ｙｅｓ）、傾き補正角度設定部３５は、ＳＴ４０６において追加されたヒストグラム情報に基づきヒストグラムを生成し、このヒストグラムの傾斜角度分布における最大度数を検索する（ＳＴ４０８）。そして、傾き補正角度設定部３５は、検索した最大度数の値（例えば、階級の間隔の中間値）を撮影画像の傾斜角度（つまり、傾き補正角度）として推定する（ＳＴ４０９）。 When the determination of the intersection state of all the determination target line segments is finished (ST403: Yes), the inclination correction angle setting unit 35 generates a histogram based on the histogram information added in ST406, and the inclination angle of this histogram The maximum frequency in the distribution is searched (ST408). Then, the tilt correction angle setting unit 35 estimates the retrieved maximum frequency value (for example, the intermediate value of the class interval) as the tilt angle of the captured image (that is, the tilt correction angle) (ST409).

このように、ＳＴ４０７において判定対象である線分が他の線分と交差する場合には、当該判定対象である線分を統計的処理の対象から除外する構成としたため、統計的処理の対象からノイズ成分（すなわち、原稿の基準方向と一致しない不適切な線分）を効果的に除去することができ、その結果、撮影画像の傾斜角度を精度良く推定することが可能となる。また、各線分の交差の有無の判定についてはＳＴ１０５における交差線分率取得の処理結果を用いるため、処理を簡略化することができる。 In this way, when the line segment that is the determination target intersects with other line segments in ST407, the line segment that is the determination target is excluded from the statistical processing target. Noise components (that is, inappropriate line segments that do not match the reference direction of the document) can be effectively removed, and as a result, the tilt angle of the captured image can be accurately estimated. Further, the determination of the presence / absence of the intersection of each line segment can be simplified because the process result of the intersection line segment rate acquisition in ST105 is used.

本発明を特定の実施形態に基づいて説明したが、これらの実施形態はあくまでも例示であって、本発明はこれらの実施形態によって限定されるものではない。例えば、上記実施形態では、他の１つの線分と交差する線分を統計的処理の対象から除外する構成としたが、２以上の所定数の他の線分と交差する場合に除外する構成としてもよい。なお、上記実施形態に示した本発明に係る画像処理装置およびこれを備えた原稿読取システムの各構成要素は、必ずしも全てが必須ではなく、少なくとも本発明の範囲を逸脱しない限りにおいて適宜取捨選択することが可能である。 Although the present invention has been described based on specific embodiments, these embodiments are merely examples, and the present invention is not limited to these embodiments. For example, in the above-described embodiment, the line segment that intersects with another line segment is excluded from the target of statistical processing, but is excluded when the line segment intersects with a predetermined number of other line segments of 2 or more. It is good. Note that not all of the components of the image processing apparatus according to the present invention and the document reading system including the image processing apparatus according to the present invention shown in the above-described embodiments are necessarily selected as long as they do not depart from the scope of the present invention. It is possible.

本発明に係る画像処理装置およびこれを備えた原稿読取システムは、簡易な構成により、撮影画像から有効な直線成分の検出が難しい場合に不適当な傾き補正が実行されることを防止可能とし、書籍などの原稿を読み取って得られた画像を処理する画像処理装置およびこれを備えた原稿読取システムなどとして有用である。 The image processing apparatus according to the present invention and the document reading system including the image processing apparatus can prevent an inappropriate inclination correction from being performed when it is difficult to detect an effective linear component from a captured image with a simple configuration. The present invention is useful as an image processing apparatus that processes an image obtained by reading a document such as a book, and a document reading system including the image processing apparatus.

１原稿読取システム
２書画カメラ（画像入力装置）
３ＰＣ（画像処理装置）
２６表示部
２７表示データ生成部
２８データ格納部
３０画像傾斜補正部
３２エッジ抽出部
３３線分抽出部（直線抽出部）
３４交差線分率算出部
３５傾き補正角度設定部（傾斜角度推定部）
３６画像回転部
４１撮影画像
Ｂ書籍（原稿） 1 Document Reading System 2 Document Camera (Image Input Device)
3 PC (image processing device)
26 Display Unit 27 Display Data Generation Unit 28 Data Storage Unit 30 Image Inclination Correction Unit 32 Edge Extraction Unit 33 Line Segment Extraction Unit (Linear Extraction Unit)
34 Crossing line ratio calculation unit 35 Inclination correction angle setting unit (inclination angle estimation unit)
36 Image Rotating Unit 41 Photographed Image B Book (Original)

Claims

An edge extraction unit that extracts a plurality of edge pixels in a photographed image obtained by sequentially photographing the paper surface of the document;
A line extraction unit that extracts a plurality of line components based on the edge pixels;
By determining the crossing state of the straight line components, a cross line segment calculating unit that calculates a cross line segment ratio that is a ratio of straight line components crossing each other among the plurality of straight line components;
A tilt angle estimating unit that estimates the tilt angle of the captured image by performing statistical processing on the tilt angles of the plurality of linear components;
An image rotation unit that performs tilt correction of the captured image by rotating the captured image based on the tilt angle estimated by the tilt angle estimation unit;
The image rotation unit has a first operation mode in which the tilt correction is performed and the image rotation unit has a second operation mode in which the tilt correction is not performed, and the intersection line segment ratio is equal to or greater than a predetermined threshold value An image processing apparatus that executes the second operation mode.

The threshold is composed of a first value and a second value that is greater than the first value,
In a series of the captured images, when the intersecting line segment ratio in the current captured image decreases from the intersecting line segment ratio of the previous captured image larger than the first value and falls below the first value, While the second operation mode is switched to the first operation mode, the intersecting line segment ratio in the current photographed image is increased from the intersecting line segment ratio in the previous photographed image smaller than the second value. The image processing apparatus according to claim 1, wherein when the second value is exceeded, the first operation mode is switched to the second operation mode.

When the straight line component that is the determination target intersects with one or more other line segments based on the determination of the intersecting state, the tilt angle estimation unit excludes the straight line component that is the determination target from the statistical processing target The image processing apparatus according to claim 1, wherein the image processing apparatus is an image processing apparatus.

4. A document reading system comprising: the image processing apparatus according to claim 1; and an image input apparatus having a camera unit that generates the captured image.