JP2015170907A

JP2015170907A - Scanner system, data processing method of scanner system, and program

Info

Publication number: JP2015170907A
Application number: JP2014042865A
Authority: JP
Inventors: 忠則中塚; Tadanori Nakatsuka
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 2014-03-05
Filing date: 2014-03-05
Publication date: 2015-09-28

Abstract

PROBLEM TO BE SOLVED: To efficiently create electronic document data separating objects or electronic document data integrating objects in response to simultaneously imaging a plurality of mounted objects.SOLUTION: A scanner system includes: mounting means for mounting a plurality of objects; a camera which images the plurality of objects mounted in the mounting means and generates a camera image; and projection means for projecting a predetermined measurement pattern or a user interface on the mounting means. In accordance with an instruction to the user interface, a storage form is set which indicates whether the plurality of objects mounted in the mounting means are combined into one page and stored or the objects are separated and stored for the unit of a page. On the basis of the set storage form, processing for conversion into an electronic document in which the plurality of objects are combined into one page and processing for conversion into an electronic document in which the objects are separated are switched and controlled.

Description

本発明は、スキャナシステム、スキャナシステムのデータ処理方法、及びプログラムに関するものである。 The present invention relates to a scanner system, a data processing method for the scanner system, and a program.

従来、文書をスキャンして電子データとして保存する場合、撮像にラインセンサを用いるラインスキャナと、２次元の撮像センサを用いるカメラスキャナとがある。特に、書画台の上方にカメラを配置し、原稿を上向きに書画台に置いて撮像するカメラスキャナの場合には、１枚の原稿であれば置くだけで素早くスキャンすることができるとともに、本のように厚みのある原稿も容易に書画台に置いてスキャンすることができる。さらに、紙や本のような文書だけでなく、立体物を書画台上に置いて立体形状をスキャンするカメラスキャナが特許文献１で開示されている。特許文献１のカメラスキャナは、撮像するためのカメラとともに投光部を備え、投光部から投光する計測パターンをカメラで撮像し、三角測量の原理により立体形状を測定する。そして、書画台上の置かれた物体の立体形状を算出して平面原稿か書籍か立体物かを判別し、それぞれに応じて適切な撮影モードで撮影を行う。 Conventionally, when a document is scanned and stored as electronic data, there are a line scanner using a line sensor for imaging and a camera scanner using a two-dimensional imaging sensor. In particular, in the case of a camera scanner in which a camera is arranged above the document table and the document is placed on the document table with the document facing upward, a single document can be quickly scanned and the book can be scanned quickly. Thus, a thick original can be easily placed on the document table and scanned. Further, Patent Document 1 discloses a camera scanner that scans a three-dimensional shape by placing a three-dimensional object on a document table as well as a document such as paper or a book. The camera scanner of Patent Literature 1 includes a light projecting unit together with a camera for capturing an image, images a measurement pattern projected from the light projecting unit with the camera, and measures a three-dimensional shape based on the principle of triangulation. Then, the three-dimensional shape of the object placed on the document table is calculated to determine whether it is a flat manuscript, a book, or a three-dimensional object, and shooting is performed in an appropriate shooting mode according to each.

特開２００３−７８７２５号公報JP 2003-78725 A

しかしながら、特許文献１のカメラスキャナにおいては、書画台に複数の異なる種類の物体を置いた場合に適切な撮影モードで撮影することができず、１個ずつの撮影が必要となるため大量の物体の撮影を行う場合に手間と時間がかかった。また、複数の物体のデータをＰＤＦのようなドキュメントにまとめる際に同一ページ内に複数のデータを配置するためには一度パソコン等にデータを転送しＰＤＦの編集ソフトで個々のデータを配置することが必要で手間と時間がかかった。 However, in the camera scanner of Patent Document 1, when a plurality of different types of objects are placed on the document table, it is not possible to shoot in an appropriate shooting mode, and a large number of objects are required because each shooting is required. It took time and effort to shoot. In order to place a plurality of data in the same page when collecting data of a plurality of objects into a PDF-like document, the data is once transferred to a personal computer or the like and the individual data is arranged by a PDF editing software. It took time and effort.

本発明は、上記の課題を解決するためになされたもので、本発明の目的は、載置される複数のオブジェクトを同時に撮影することに応じて、それぞれのオブジェクトを分離した電子文書データ、あるいはそれぞれのオブジェクトを一体とする電子文書データを効率よく作成できる仕組みを提供することである。 The present invention has been made to solve the above-described problems, and an object of the present invention is to obtain electronic document data obtained by separating each object according to simultaneous shooting of a plurality of placed objects, or It is to provide a mechanism that can efficiently create electronic document data in which each object is integrated.

上記目的を達成する本発明のスキャナシステムは以下に示す構成を備える。
複数のオブジェクトを載置する載置手段と、前記載置手段に載置される複数のオブジェクトを撮影してカメラ画像を生成するカメラと、前記載置手段に載置される複数のオブジェクトに照射される赤外線パターンを撮影して得られる赤外線画像から距離画像を生成する第１の距離画像生成手段と、前記載置手段に所定の測定パターンまたはユーザインタフェースを投影する投影手段と、前記ユーザインタフェースに対する指示に従い前記載置手段に載置された複数のオブジェクトを１ページに合成して格納するか、それぞれのオブジェクトを分離して１ページずつ格納するかを示す格納形式を設定する設定手段と、前記設定手段により設定された格納形式に基づいて、複数のオブジェクトをページ合成した電子文書に変換する処理と、それぞれオブジェクトを分離した電子文書に変換する処理とを切り換え制御する制御手段と、を備えることを特徴とする。 The scanner system of the present invention that achieves the above object has the following configuration.
A mounting unit for mounting a plurality of objects, a camera for capturing a plurality of objects mounted on the mounting unit and generating a camera image, and irradiating the plurality of objects mounted on the mounting unit First distance image generation means for generating a distance image from an infrared image obtained by photographing the infrared pattern to be obtained, projection means for projecting a predetermined measurement pattern or user interface onto the placement means, and the user interface Setting means for setting a storage format indicating whether a plurality of objects placed on the placing means according to the instruction are combined and stored in one page, or each object is separated and stored one page; Based on the storage format set by the setting means, a process for converting a plurality of objects into a page-combined electronic document, and Characterized in that it comprises control means for controlling switching between processing for converting the electronic document to separate the objects, the.

本発明によれば、載置される複数のオブジェクトを同時に撮影することに応じて、それぞれのオブジェクトを分離した電子文書データ、あるいはそれぞれのオブジェクトを一体とする電子文書データを効率よく作成できる。 According to the present invention, electronic document data in which each object is separated or electronic document data in which each object is integrated can be efficiently created by simultaneously photographing a plurality of placed objects.

スキャナシステムの一例を示す図である。It is a figure which shows an example of a scanner system. 図１に示したカメラスキャナの構成例を示す図である。It is a figure which shows the structural example of the camera scanner shown in FIG. コントローラ部のハードウェア構成例を示すブロック図である。It is a block diagram which shows the hardware structural example of a controller part. カメラスキャナの制御用プログラムの機能構成を示す図である。It is a figure which shows the function structure of the program for control of a camera scanner. 距離画像取得部の処理を説明する図である。It is a figure explaining the process of a distance image acquisition part. ジェスチャー認識部の処理の詳細を説明するフローチャートである。It is a flowchart explaining the detail of the process of a gesture recognition part. 指先検出処理の方法を模式的に表した図である。It is the figure which represented typically the method of the fingertip detection process. 物体検知部の処理を説明するフローチャートである。It is a flowchart explaining the process of an object detection part. 平面原稿画像撮影部が実行する処理を示すフローチャートである。It is a flowchart which shows the process which a planar original image imaging part performs. 平面原稿画像撮影部の処理を説明するための模式図である。FIG. 6 is a schematic diagram for explaining processing of a flat document image photographing unit. 書籍画像撮影部が実行する処理を示すフローチャートである。It is a flowchart which shows the process which a book image imaging part performs. 書籍画像撮影部の処理を説明するための模式図である。It is a schematic diagram for demonstrating the process of a book image imaging | photography part. 立体形状測定部が実行する処理を説明するフローチャートである。It is a flowchart explaining the process which a solid shape measurement part performs. 立体形状測定部の処理を説明するための模式図である。It is a schematic diagram for demonstrating the process of a solid shape measurement part. スキャンアプリケーションの処理を示すフローチャートである。It is a flowchart which shows the process of a scan application. スキャンアプリケーションの処理を示すフローチャートである。It is a flowchart which shows the process of a scan application. スキャンアプリケーションの処理を示すフローチャートである。It is a flowchart which shows the process of a scan application. スキャナシステムで表示されるＵＩ画面例を示す図である。It is a figure which shows the example of UI screen displayed with a scanner system. スキャナシステムで表示されるＵＩ画面例を示す図である。It is a figure which shows the example of UI screen displayed with a scanner system. スキャナシステムで表示されるＵＩ画面例を示す図である。It is a figure which shows the example of UI screen displayed with a scanner system.

次に本発明を実施するための最良の形態について図面を参照して説明する。
＜システム構成の説明＞
〔第１実施形態〕 Next, the best mode for carrying out the present invention will be described with reference to the drawings.
<Description of system configuration>
[First Embodiment]

図１は、本実施形態を示すスキャナシステムの一例を示す図である。本例は、カメラスキャナ１０１はイーサネット（登録商標）等のネットワーク１０４にてホストコンピュータ１０２およびプリンタ１０３に接続されている例である。 FIG. 1 is a diagram illustrating an example of a scanner system according to the present embodiment. In this example, the camera scanner 101 is connected to the host computer 102 and the printer 103 via a network 104 such as Ethernet (registered trademark).

図１のネットワーク構成において、ホストコンピュータ１０２からの指示により、カメラスキャナ１０１から画像を読み取るスキャン機能や、スキャンデータをプリンタ１０３により出力するプリント機能の実行が可能である。また、ホストコンピュータ１０２を介さず、カメラスキャナ１０１への直接の指示により、スキャン機能、プリント機能の実行も可能である。 In the network configuration of FIG. 1, a scan function for reading an image from the camera scanner 101 and a print function for outputting scan data by the printer 103 can be executed by an instruction from the host computer 102. Further, it is possible to execute a scan function and a print function by direct instructions to the camera scanner 101 without using the host computer 102.

＜カメラスキャナの構成＞
図２は、図１に示したカメラスキャナ１０１の構成例を示す図である。
図２の（ａ）に示すように、カメラスキャナ１０１は、コントローラ部２０１、カメラ部２０２、腕部２０３、短焦点プロジェクタ部２０７、距離画像センサ部２０８を含む。カメラスキャナ１０１の本体であるコントローラ部２０１と、撮像を行うためのカメラ部２０２、短焦点プロジェクタ部２０７および距離画像センサ部２０８は、腕部２０３により連結されている。腕部２０３は関節を用いて曲げ伸ばしが可能である。 <Configuration of camera scanner>
FIG. 2 is a diagram showing a configuration example of the camera scanner 101 shown in FIG.
As shown in FIG. 2A, the camera scanner 101 includes a controller unit 201, a camera unit 202, an arm unit 203, a short focus projector unit 207, and a distance image sensor unit 208. The controller unit 201 that is the main body of the camera scanner 101, the camera unit 202 for performing imaging, the short focus projector unit 207, and the distance image sensor unit 208 are connected by an arm unit 203. The arm portion 203 can be bent and stretched using a joint.

図２の（ａ）には、カメラスキャナ１０１が設置されている書画台２０４も示している。カメラ部２０２および距離画像センサ部２０８のレンズは書画台２０４方向に向けられており、破線で囲まれた読み取り領域２０５内の画像を読み取り可能である。
図２の例では、原稿２０６は読み取り領域２０５内に置かれているので、カメラスキャナ１０１に読み取り可能となっている。また、書画台２０４内にはターンテーブル２０９が設けられている。ターンテーブル２０９はコントローラ部２０１からの指示によって回転することが可能であり、ターンテーブル２０９上に置かれた物体（オブジェクト）とカメラ部２０２との角度を変えることができる。ここで、オブジェクトとには、平面オブジェクト（２次元オブジェクト）と立体オブジェクト（３次元オブジェクト）が含まれる。
カメラ部２０２は単一解像度で画像を撮像するものとしてもよいが、高解像度画像撮像と低解像度画像撮像が可能なものとすることが好ましい。
なお、図２に示されていないが、カメラスキャナ１０１は、ＬＣＤタッチパネル３３０およびスピーカ３４０をさらに含むこともできる。 FIG. 2A also shows a document table 204 on which the camera scanner 101 is installed. The lenses of the camera unit 202 and the distance image sensor unit 208 are directed toward the document table 204, and can read an image in the reading region 205 surrounded by a broken line.
In the example of FIG. 2, the document 206 is placed in the reading area 205 and can be read by the camera scanner 101. A turntable 209 is provided in the document table 204. The turntable 209 can be rotated by an instruction from the controller unit 201, and the angle between an object (object) placed on the turntable 209 and the camera unit 202 can be changed. Here, the object includes a planar object (two-dimensional object) and a three-dimensional object (three-dimensional object).
The camera unit 202 may capture an image with a single resolution, but it is preferable that the camera unit 202 can capture a high-resolution image and a low-resolution image.
Although not shown in FIG. 2, the camera scanner 101 can further include an LCD touch panel 330 and a speaker 340.

図２の（ｂ）は、カメラスキャナ１０１における座標系について表している。
カメラスキャナ１０１において取得される撮影データは、距離画像センサ部２０８により得られる距離画像センサ座標系における３次元データと、カメラ部２０２と短焦点プロジェクタ部２０７によって得られるカメラ座標系における３次元データである。ここで、距離画像センサ座標系およびカメラ座標系では、距離画像センサ部２０８のＲＧＢカメラ部５０３およびカメラ部２０２が撮像する画像平面をＸＹ平面とし、画像平面に直交した方向をＺ方向として定義するものである。
すなわち、これらの３次元データは独立した座標系で得られるものであるため、両者を統一的に扱えるようにする必要がある。そこで、書画台２０４を含む平面をＸＹ平面とし、このＸＹ平面から上方に垂直な向きをＺ軸とする直交座標系を定義する。
そして、距離画像センサ座標系およびカメラ座標系で得られたそれぞれの３次元データを、所定の演算処理、具体的には直交座標系における３次元データに変換する。例えば、距離画像センサ座標系中の３次元点は下記（１）式によって直交座標系における３次元座標へと変換する。
数式１

FIG. 2B shows a coordinate system in the camera scanner 101.
Shooting data acquired by the camera scanner 101 is three-dimensional data in the distance image sensor coordinate system obtained by the distance image sensor unit 208, and three-dimensional data in the camera coordinate system obtained by the camera unit 202 and the short focus projector unit 207. is there. Here, in the distance image sensor coordinate system and the camera coordinate system, an image plane captured by the RGB camera unit 503 and the camera unit 202 of the distance image sensor unit 208 is defined as an XY plane, and a direction orthogonal to the image plane is defined as a Z direction. Is.
That is, since these three-dimensional data are obtained by independent coordinate systems, it is necessary to be able to handle both in a unified manner. Therefore, an orthogonal coordinate system is defined in which a plane including the document table 204 is an XY plane, and a direction perpendicular to the XY plane is a Z axis.
Then, the respective three-dimensional data obtained in the distance image sensor coordinate system and the camera coordinate system are converted into predetermined arithmetic processing, specifically, three-dimensional data in an orthogonal coordinate system. For example, a three-dimensional point in the distance image sensor coordinate system is converted into a three-dimensional coordinate in the orthogonal coordinate system by the following equation (1).
Formula 1

ここで、［Ｘｓ，Ｙｓ，Ｚｓ，１］は距離画像センサ座標系における３次元座標、［Ｘ，Ｙ，Ｚ］は座標変換後の直交座標系における３次元座標を表す。また、［Ｒｓ｜ｔｓ］は距離画像センサ座標系から直交座標系への変換行列（回転行列Ｒｓと並進ベクトルｔｓ）である。
なお、直交座標系に対する距離画像センサ座標系およびカメラ座標系の相対位置（直交座標系へ変換するための回転行列および並進ベクトル）は、公知のキャリブレーション手法によりあらかじめキャリブレーションされているものとする。以後、特に断りがなく３次元点群と表記された場合は、直交座標系における３次元データを表しているものとする。 Here, [Xs, Ys, Zs, 1] represents three-dimensional coordinates in the distance image sensor coordinate system, and [X, Y, Z] represents three-dimensional coordinates in the orthogonal coordinate system after coordinate conversion. [Rs | ts] is a transformation matrix (rotation matrix Rs and translation vector ts) from the distance image sensor coordinate system to the orthogonal coordinate system.
Note that the relative positions of the distance image sensor coordinate system and the camera coordinate system (rotation matrix and translation vector for conversion to the orthogonal coordinate system) with respect to the orthogonal coordinate system are calibrated in advance by a known calibration method. . Hereinafter, when there is no particular notice and it is expressed as a three-dimensional point group, it represents three-dimensional data in an orthogonal coordinate system.

＜カメラスキャナのコントローラのハードウェア構成＞
図３は、図２に示したコントローラ部２０１のハードウェア構成例を示すブロック図である。 <Hardware configuration of camera scanner controller>
FIG. 3 is a block diagram illustrating a hardware configuration example of the controller unit 201 illustrated in FIG. 2.

図３に示すように、コントローラ部２０１は、システムバス３０１に接続されたＣＰＵ３０２、ＲＡＭ３０３、ＲＯＭ３０４、ＨＤＤ３０５、ネットワークＩ／Ｆ３０６、画像処理プロセッサ３０７、カメラＩ／Ｆ３０８、ディスプレイコントローラ３０９、シリアルＩ／Ｆ３１０、オーディオコントローラ３１１およびＵＳＢコントローラ３１２を含む。 As shown in FIG. 3, the controller unit 201 includes a CPU 302, a RAM 303, a ROM 304, an HDD 305, a network I / F 306, an image processor 307, a camera I / F 308, a display controller 309, and a serial I / F 310 connected to the system bus 301. Audio controller 311 and USB controller 312.

ＣＰＵ３０２は、コントローラ部２０１全体の動作を制御する中央演算装置である。ＲＡＭ３０３は、揮発性メモリである。ＲＯＭ３０４は不揮発性メモリであり、ＣＰＵ３０２の起動用プログラムが格納されている。ＨＤＤ３０５はＲＡＭ３０３と比較して大容量なハードディスクドライブ（ＨＤＤ）である。ＨＤＤ３０５にはコントローラ部２０１の実行する、カメラスキャナ１０１の制御用プログラムが格納されている。 The CPU 302 is a central processing unit that controls the operation of the entire controller unit 201. The RAM 303 is a volatile memory. A ROM 304 is a non-volatile memory, and stores a startup program for the CPU 302. The HDD 305 is a hard disk drive (HDD) having a larger capacity than the RAM 303. The HDD 305 stores a control program for the camera scanner 101 executed by the controller unit 201.

ＣＰＵ３０２は、電源ＯＮ等の起動時、ＲＯＭ３０４に格納されている起動用プログラムを実行する。この起動用プログラムは、ＨＤＤ３０５に格納されている制御用プログラムを読み出し、ＲＡＭ３０３上に展開するためのものである。ＣＰＵ３０２は、起動用プログラムを実行すると、続けてＲＡＭ３０３上に展開した制御用プログラムを実行し、制御を行う。
また、ＣＰＵ３０２は制御用プログラムによる動作に用いるデータもＲＡＭ３０３上に格納して読み書きを行う。ＨＤＤ３０５上にはさらに、制御用プログラムによる動作に必要な各種設定や、また、カメラ入力によって生成した画像データを格納することができ、ＣＰＵ３０２によって読み書きされる。ＣＰＵ３０２はネットワークＩ／Ｆ３０６を介してネットワーク１０４上の他の機器との通信を行う。 The CPU 302 executes a startup program stored in the ROM 304 when the power is turned on or the like. This activation program is for reading a control program stored in the HDD 305 and developing it on the RAM 303. When executing the startup program, the CPU 302 executes the control program developed on the RAM 303 and performs control.
Further, the CPU 302 also stores data used for the operation by the control program on the RAM 303 to read / write. Further, various settings necessary for operation by the control program and image data generated by camera input can be stored on the HDD 305 and read / written by the CPU 302. The CPU 302 communicates with other devices on the network 104 via the network I / F 306.

画像処理プロセッサ３０７はＲＡＭ３０３に格納された画像データを読み出して処理し、またＲＡＭ３０３へ書き戻す。なお、画像処理プロセッサ３０７が実行する画像処理は、回転、変倍、色変換等である。 The image processor 307 reads and processes the image data stored in the RAM 303 and writes it back to the RAM 303. Note that image processing executed by the image processor 307 includes rotation, scaling, color conversion, and the like.

カメラＩ／Ｆ３０８は、カメラ部２０２および距離画像センサ部２０８と接続され、ＣＰＵ３０２からの指示に応じてカメラ部２０２で撮影された画像データを、距離画像センサ部２０８で撮影された距離画像データを取得してＲＡＭ３０３へ書き込む。また、ＣＰＵ３０２からの制御コマンドをカメラ部２０２および距離画像センサ部２０８へ送信し、カメラ部２０２および距離画像センサ部２０８の設定を行う。 The camera I / F 308 is connected to the camera unit 202 and the distance image sensor unit 208 and receives image data captured by the camera unit 202 in accordance with an instruction from the CPU 302 and distance image data captured by the distance image sensor unit 208. Obtain and write to RAM 303. In addition, a control command from the CPU 302 is transmitted to the camera unit 202 and the distance image sensor unit 208 to set the camera unit 202 and the distance image sensor unit 208.

また、コントローラ部２０１は、ディスプレイコントローラ３０９、シリアルＩ／Ｆ３１０、オーディオコントローラ３１１およびＵＳＢコントローラ３１２のうち少なくとも１つをさらに含むことができる。 The controller unit 201 can further include at least one of a display controller 309, a serial I / F 310, an audio controller 311, and a USB controller 312.

ディスプレイコントローラ３０９は、ＣＰＵ３０２の指示に応じてディスプレイへの画像データの表示を制御する。ここでは、ディスプレイコントローラ３０９は短焦点プロジェクタ部２０７およびＬＣＤタッチパネル３３０に接続されている。 A display controller 309 controls display of image data on the display in accordance with an instruction from the CPU 302. Here, the display controller 309 is connected to the short focus projector unit 207 and the LCD touch panel 330.

シリアルＩ／Ｆ３１０はシリアル信号の入出力を行う。ここでは、シリアルＩ／Ｆ３１０はターンテーブル２０９に接続され、ＣＰＵ３０２の回転開始・終了および回転角度の指示をターンテーブル２０９へ送信する。また、シリアルＩ／Ｆ３１０は、ＬＣＤタッチパネル３３０に接続され、ＣＰＵ３０２はＬＣＤタッチパネル３３０が押下されたときに、シリアルＩ／Ｆ３１０を介して押下された座標を取得する。
オーディオコントローラ３１１は、スピーカ３４０に接続され、ＣＰＵ３０２の指示に応じて音声データをアナログ音声信号に変換し、スピーカ３４０を通じて音声を出力する。 The serial I / F 310 inputs and outputs serial signals. Here, the serial I / F 310 is connected to the turntable 209, and transmits an instruction of the rotation start / end and rotation angle of the CPU 302 to the turntable 209. In addition, the serial I / F 310 is connected to the LCD touch panel 330, and the CPU 302 acquires the coordinates pressed via the serial I / F 310 when the LCD touch panel 330 is pressed.
The audio controller 311 is connected to the speaker 340, converts audio data into an analog audio signal in accordance with an instruction from the CPU 302, and outputs audio through the speaker 340.

ＵＳＢコントローラ３１２は、ＣＰＵ３０２の指示に応じて外付けのＵＳＢデバイスの制御を行う。ここでは、ＵＳＢコントローラ３１２はＵＳＢメモリやＳＤカードなどの外部メモリ３５０に接続され、外部メモリ３５０へのデータの読み書きを行う。なお、距離画像センサ部２０８の詳細構成については、後述する。
＜カメラスキャナの制御用プログラムの機能構成＞
図４は、図３に示したＣＰＵ３０２が実行するカメラスキャナ１０１の制御用プログラムの機能構成４０１を示す図である。
なお、カメラスキャナ１０１の制御用プログラムは、前述のようにＨＤＤ３０５に格納され、ＣＰＵ３０２が起動時にＲＡＭ３０３上に展開して実行する。
メイン制御部４０２は制御の中心であり、機能構成４０１内の他の各部を制御する。 The USB controller 312 controls an external USB device in accordance with an instruction from the CPU 302. Here, the USB controller 312 is connected to an external memory 350 such as a USB memory or an SD card, and reads / writes data from / to the external memory 350. The detailed configuration of the distance image sensor unit 208 will be described later.
<Functional structure of camera scanner control program>
FIG. 4 is a diagram showing a functional configuration 401 of a control program for the camera scanner 101 executed by the CPU 302 shown in FIG.
Note that the control program for the camera scanner 101 is stored in the HDD 305 as described above, and the CPU 302 develops and executes it on the RAM 303 at startup.
The main control unit 402 is the center of control and controls other units in the functional configuration 401.

カメラ画像取得部４０７、距離画像取得部４０８は、画像入力処理を行うモジュールである。カメラ画像取得部４０７は、カメラＩ／Ｆ３０８を介してカメラ部２０２が撮影した画像データを取得し、ＲＡＭ３０３へ格納する。距離画像取得部４０８は、カメラＩ／Ｆ３０８を介して距離センサ部２０８が出力する距離画像データを取得し、ＲＡＭ３０３へ格納する。距離画像取得部４０８の処理の詳細は後に説明する。 The camera image acquisition unit 407 and the distance image acquisition unit 408 are modules that perform image input processing. The camera image acquisition unit 407 acquires image data captured by the camera unit 202 via the camera I / F 308 and stores it in the RAM 303. The distance image acquisition unit 408 acquires the distance image data output from the distance sensor unit 208 via the camera I / F 308 and stores it in the RAM 303. Details of the processing of the distance image acquisition unit 408 will be described later.

ジェスチャー認識部４０９、物体検知部４１０はカメラ画像取得部４０７、距離画像取得部４０８が取得する画像データを解析して書画台２０４上の物体の動きを検知して認識するモジュールである。ジェスチャー認識部４０９、物体検知部４１０の処理の詳細は後述する。ここで、物体の動きには、投影されるユーザインタフェースを操作するユーザの手による指示動作が含まれる（詳細は後述する） A gesture recognition unit 409 and an object detection unit 410 are modules that analyze image data acquired by the camera image acquisition unit 407 and the distance image acquisition unit 408 to detect and recognize the movement of an object on the document table 204. Details of the processes of the gesture recognition unit 409 and the object detection unit 410 will be described later. Here, the movement of the object includes an instruction operation by a user's hand operating the projected user interface (details will be described later).

平面原稿画像撮影部４１１、書籍画像撮影部４１２、立体形状測定部４１３は実際に対象物のスキャンを行うモジュールである。平面原稿画像撮影部４１１は平面原稿、書籍画像撮影部４１２は書籍、立体形状測定部４１３は立体物に、それぞれ適した演算処理を実行し、それぞれに応じた形式のデータを出力する。これらのモジュールの処理の詳細は後述する。 The planar document image photographing unit 411, the book image photographing unit 412, and the three-dimensional shape measuring unit 413 are modules that actually scan an object. The flat original image photographing unit 411 executes suitable arithmetic processing for the flat original, the book image photographing unit 412 for the book, and the three-dimensional shape measuring unit 413 for the three-dimensional object, and outputs data in a format corresponding to each. Details of the processing of these modules will be described later.

ユーザインタフェース部４０３は、物体配置表示部４１４と物体配置修正部４１５から構成される。物体配置表示部４１４は、メイン制御部４０２からの要求を受け、メッセージやボタン、ドキュメントや対象物の大きさや位置を示す枠等のＧＵＩ部品を生成する。
そして、ユーザインタフェース部４０３は、表示部４０６へ生成したＧＵＩ部品の表示を要求する。なお、物体配置修正部４１５はドキュメントの用紙サイズや対象物の表示大きさや位置の変更要望を受け取りデータ管理部４０５にデータを送り保存する。表示部４０６は、ディスプレイコントローラ３０９を介して、短焦点プロジェクタ部２０７もしくはＬＣＤタッチパネル３３０へ要求されたＧＵＩ部品の表示を行う。
プロジェクタ部２０７は書画台２０４に向けて設置されているため、書画台２０４上にＧＵＩ部品を投射することが可能となっている。また、ユーザインタフェース部４０３は、ジェスチャー認識部４０９が認識したタッチ等のジェスチャー操作、あるいはシリアルＩ／Ｆ３１０を介したＬＣＤタッチパネル３３０からの入力操作、そしてさらにそれらの座標を受信する。そして、ユーザインタフェース部４０３は描画中の操作画面の内容と操作座標を対応させて操作内容（押下されたボタン等）を判定する。この操作内容をメイン制御部４０２へ通知することにより、操作者の操作を受け付ける。
ネットワーク通信部４０４は、ネットワークＩ／Ｆ３０６を介して、ネットワーク１０４上の他の機器とＴＣＰ／ＩＰによる通信を行う。 The user interface unit 403 includes an object arrangement display unit 414 and an object arrangement correction unit 415. In response to a request from the main control unit 402, the object arrangement display unit 414 generates a GUI component such as a message, a button, a frame indicating the size or position of a document or an object.
Then, the user interface unit 403 requests the display unit 406 to display the generated GUI component. The object arrangement correcting unit 415 receives a request for changing the paper size of the document or the display size or position of the object, and sends the data to the data management unit 405 for storage. The display unit 406 displays the requested GUI component on the short focus projector unit 207 or the LCD touch panel 330 via the display controller 309.
Since the projector unit 207 is installed toward the document table 204, it is possible to project GUI parts on the document table 204. In addition, the user interface unit 403 receives a gesture operation such as a touch recognized by the gesture recognition unit 409, an input operation from the LCD touch panel 330 via the serial I / F 310, and coordinates thereof. Then, the user interface unit 403 determines the operation content (such as a pressed button) by associating the content of the operation screen being drawn with the operation coordinates. By notifying the main control unit 402 of this operation content, the operator's operation is accepted.
The network communication unit 404 communicates with other devices on the network 104 by TCP / IP via the network I / F 306.

データ管理部４０５は、機能構成４０１に含まれるプログラムの実行において生成した作業データ、例えば平面原稿画像撮影部４１１、書籍画像撮影部４１２、立体形状測定部４１３が生成したスキャンデータのような様々なデータや物体配置修正部４１５が生成したドキュメントや対象物の大きさや位置情報などをＨＤＤ３０５上の所定の領域へ保存し、管理する。また、対象物のデータをＰＤＦのように一つのドキュメントにまとめて保存する。
＜距離画像センサおよび距離画像取得部の説明＞ The data management unit 405 includes various kinds of work data generated in executing the program included in the functional configuration 401, such as scan data generated by the flat document image photographing unit 411, the book image photographing unit 412, and the three-dimensional shape measuring unit 413. The data, the document generated by the object arrangement correcting unit 415, the size and position information of the object, and the like are stored in a predetermined area on the HDD 305 and managed. In addition, the object data is collectively stored in one document like PDF.
<Description of Distance Image Sensor and Distance Image Acquisition Unit>

図３に本実施形態の距離画像生成手段に対応する距離画像センサ部２０８の構成を示している。距離画像センサ部２０８は赤外線によるパターン投射方式の距離画像センサである。赤外線パターン投射部３６１は対象物に、人の目には不可視である赤外線によって３次元測定パターンを照射する。赤外線カメラ３６２は対象物に投射した３次元測定パターン（赤外線画像）を撮影するカメラである。ＲＧＢカメラ３６３は人の目に見える可視光をＲＧＢ信号で撮影するカメラである。 FIG. 3 shows the configuration of the distance image sensor unit 208 corresponding to the distance image generation means of the present embodiment. The distance image sensor unit 208 is a pattern image type distance image sensor using infrared rays. The infrared pattern projection unit 361 irradiates the object with a three-dimensional measurement pattern with infrared rays that are invisible to human eyes. The infrared camera 362 is a camera that captures a three-dimensional measurement pattern (infrared image) projected on an object. The RGB camera 363 is a camera that captures visible light visible to the human eye using RGB signals.

図５は、図４に示した距離画像取得部４０８の処理を説明する図である。特に、図５の（ａ）は図４に示した距離画像取得部４０８のデータ処理手順に対応するフローチャートである。以下、図５の（ｂ）〜（ｄ）はパターン投射方式による距離画像の計測原理を説明する。また、図５の（ａ）に示すフローチャートの各ステップは、ＣＰＵ３０２が制御プログラム（図４に示すモジュール）を実行することで実現される。以下、図４に示したモジュールを主体として説明する。
距離画像取得部４０８が処理を開始すると、Ｓ５０１では図５の（ｂ）に示すように赤外線パターン投射部３６１を用いて赤外線による３次元測定パターン５２２を対象物５２１に投射する。Ｓ５０２では赤外線カメラ３６２を用いてＳ５０１で投射した３次元測定パターン５２２を撮影し、赤外線カメラ画像５２４を取得する。 FIG. 5 is a diagram illustrating the processing of the distance image acquisition unit 408 illustrated in FIG. 5A is a flowchart corresponding to the data processing procedure of the distance image acquisition unit 408 shown in FIG. In the following, FIGS. 5B to 5D explain the distance image measurement principle by the pattern projection method. Each step of the flowchart shown in FIG. 5A is realized by the CPU 302 executing a control program (module shown in FIG. 4). Hereinafter, the module shown in FIG. 4 will be mainly described.
When the distance image acquisition unit 408 starts processing, in S501, the infrared pattern projection unit 361 is used to project a three-dimensional measurement pattern 522 using infrared rays onto the object 521 as shown in FIG. In S502, the infrared camera 362 is used to photograph the three-dimensional measurement pattern 522 projected in S501, and an infrared camera image 524 is acquired.

Ｓ５０３では、図５の（ｃ）に示すように、３次元測定パターン５２２と赤外線カメラ画像５２４間での対応点を抽出する。例えば、赤外線カメラ画像５２４上の１点を３次元測定パターン５２２上から探索し、同一の点が検出された場合に対応付けを行う。あるいは、赤外線カメラ画像５２４の画素の周辺のパターンを３次元測定パターン５２２上から探索し、一番類似度が高い部分と対応付けてもよい。 In step S503, as shown in FIG. 5C, corresponding points between the three-dimensional measurement pattern 522 and the infrared camera image 524 are extracted. For example, one point on the infrared camera image 524 is searched from the three-dimensional measurement pattern 522, and association is performed when the same point is detected. Alternatively, a pattern around the pixel of the infrared camera image 524 may be searched from the three-dimensional measurement pattern 522 and associated with a portion having the highest similarity.

Ｓ５０４では、赤外線パターン投射部３６１と赤外線カメラ３６２を結ぶ直線を基線５２３として三角測量の原理を用いて計算を行うことにより、赤外線カメラ３６２からの距離を算出する。
Ｓ５０３で対応付けが出来た画素については、赤外線カメラ３６２からの距離を算出して画素値として保存し、対応付けが出来なかった画素については、距離の計測が出来なかった部分として無効値を保存する。これを赤外線カメラ画像の全画素に対して行うことで、各画素に距離値が入った距離画像を生成する。 In S504, the distance from the infrared camera 362 is calculated by performing calculation using the principle of triangulation with the straight line connecting the infrared pattern projection unit 361 and the infrared camera 362 as the base line 523.
For pixels that can be correlated in S503, the distance from the infrared camera 362 is calculated and stored as a pixel value, and for pixels that cannot be correlated, an invalid value is stored as the portion where the distance could not be measured. To do. By performing this for all the pixels of the infrared camera image, a distance image in which each pixel has a distance value is generated.

Ｓ５０５では、ＲＧＢカメラ３６３を用いて対象物のＲＧＢ画像５２５を撮影する。赤外線カメラ３６２とＲＧＢカメラ３６３とでは設置位置が異なるため、図５の（ｄ）に示すようにそれぞれで撮影される２つの赤外線カメラ画像５２４および赤外線カメラ画像５２５の位置合わせを行う必要がある。
そこで、Ｓ５０６では、赤外線カメラ３６２の座標系からＲＧＢカメラ３６３の座標系への座標系変換を用いて赤外線カメラ画像５２４を変換し、距離画像をＲＧＢカメラ画像５２５の座標系に合わせる。 In step S 505, the RGB image 525 of the object is captured using the RGB camera 363. Since the installation positions of the infrared camera 362 and the RGB camera 363 are different, as shown in FIG. 5D, it is necessary to align the two infrared camera images 524 and the infrared camera image 525 that are respectively captured.
Therefore, in S506, the infrared camera image 524 is converted using coordinate system conversion from the coordinate system of the infrared camera 362 to the coordinate system of the RGB camera 363, and the distance image is matched with the coordinate system of the RGB camera image 525.

なお、赤外線カメラ３６２とＲＧＢカメラ３６３の相対位置や、それぞれの内部パラメータは事前のキャリブレーション処理により既知であるとする。Ｓ５０７では、Ｓ５０６で座標変換を行った距離画像の各画素にＲＧＢカメラ画像５２５のＲＧＢ値を保存することにより、１画素につきＲ、Ｇ、Ｂ、距離の４つの値を持つ距離画像を生成し、距離画像取得部４０８の処理を終了する。 It is assumed that the relative positions of the infrared camera 362 and the RGB camera 363 and the respective internal parameters are known by a prior calibration process. In S507, the RGB value of the RGB camera image 525 is stored in each pixel of the distance image subjected to coordinate conversion in S506, thereby generating a distance image having four values of R, G, B, and distance for each pixel. Then, the processing of the distance image acquisition unit 408 ends.

以上のように、ここで取得した距離画像は距離画像センサ部２０８のＲＧＢカメラ３６３で定義された距離画像センサ座標系が基準となっている。そこで、図２（ｂ）を用いて上述したように、距離画像センサ座標系として得られた距離データを直交座標系における点群の集合に変換する。（以後、この点の集合を３次元点群と呼ぶ） As described above, the distance image acquired here is based on the distance image sensor coordinate system defined by the RGB camera 363 of the distance image sensor unit 208. Therefore, as described above with reference to FIG. 2B, the distance data obtained as the distance image sensor coordinate system is converted into a set of point groups in the orthogonal coordinate system. (Hereafter, this set of points is called a three-dimensional point group)

なお、本実施形態では上述したように、距離画像センサ部２０８として赤外線パターン投射方式を採用しているが、他の方式の距離画像センサを用いることも可能である。例えば、２つのＲＧＢカメラでステレオ立体視を行うステレオ方式や、レーザー光の飛行時間を検出することで距離を測定するＴＯＦ（ＴｉｍｅｏｆＦｌｉｇｈｔ）方式を用いても良い。 In the present embodiment, as described above, an infrared pattern projection method is employed as the distance image sensor unit 208, but a distance image sensor of another method can also be used. For example, a stereo system that performs stereo stereoscopic vision with two RGB cameras, or a TOF (Time of Flight) system that measures distance by detecting the flight time of laser light may be used.

＜ジェスチャー認識部の説明＞
図６は、図４に示したジェスチャー認識部４０９のデータ処理の詳細を説明するフローチャートである。なお、各ステップは、ＣＰＵ３０２が対応する制御プログラム（図４に示すモジュール）を実行することで実現される。
図６の（ａ）において、ジェスチャー認識部４０９が処理を開始すると、Ｓ６０１では初期化処理を行う。続いて、Ｓ６０２では、書画台２０４上に存在する物体の３次元点群を取得する。Ｓ６０３では取得した３次元点群からユーザの手の形状および指先の検出処理を行う。Ｓ６０４では検出した手の形状および指先からジェスチャーの判定を行う。Ｓ６０５では判定したジェスチャーをユーザインタフェース部４０３へ通知し、Ｓ６０２へ戻ってジェスチャー認識処理を繰り返す。 <Description of gesture recognition unit>
FIG. 6 is a flowchart illustrating details of data processing of the gesture recognition unit 409 shown in FIG. Each step is realized by the CPU 302 executing a corresponding control program (module shown in FIG. 4).
In FIG. 6A, when the gesture recognition unit 409 starts processing, initialization processing is performed in S601. In step S602, a three-dimensional point group of an object existing on the document table 204 is acquired. In step S603, the user's hand shape and fingertip are detected from the acquired three-dimensional point group. In S604, a gesture is determined from the detected hand shape and fingertip. In S605, the determined gesture is notified to the user interface unit 403, and the process returns to S602 to repeat the gesture recognition process.

続いて、Ｓ６０１〜Ｓ６０４のデータ処理の詳細について説明する。
図６の（ｂ）は、図６の（ａ）に示したＳ６０１のジェスチャー認識部初期化処理のフローチャートである。なお、各ステップは、ＣＰＵ３０２が対応する制御プログラム（図４に示すモジュール）を実行することで実現される。
Ｓ６１１では、ジェスチャー認識部４０９は距離画像取得部４０８から距離画像を１フレーム取得する。ここで、ジェスチャー認識部の開始時は書画台２０４上に対象物が置かれていない状態であるため、初期状態として書画台２０４の平面の認識を行う。つまり、Ｓ６１２において、Ｓ６１１で取得した距離画像から最も広い平面を抽出し、その位置と法線ベクトル（以降、書画台２０４の平面パラメータと呼ぶ）を算出する。 Next, details of the data processing in S601 to S604 will be described.
FIG. 6B is a flowchart of the gesture recognition unit initialization process in S601 shown in FIG. Each step is realized by the CPU 302 executing a corresponding control program (module shown in FIG. 4).
In step S 611, the gesture recognition unit 409 acquires a distance image from the distance image acquisition unit 408. Here, since the object is not placed on the document table 204 at the start of the gesture recognition unit, the plane of the document table 204 is recognized as an initial state. That is, in S612, the widest plane is extracted from the distance image acquired in S611, and the position and normal vector (hereinafter referred to as plane parameters of the document stage 204) are calculated.

図６の（ｃ）は、図６の（ａ）に示したＳ６０２の３次元点群取得処理のフローチャートである。
Ｓ６２１では距離画像取得部４０８から距離画像を１フレーム取得し、所定の演算処理により３次元点群に変換する。Ｓ６２２では書画台２０４の平面パラメータを用いて、取得した３次元点群から書画台２０４を含む平面にある点群を除去する。このようにして書画台２０４の上に存在する物体の３次元点群を取得し、Ｓ６０２の３次元点群取得処理を終了する。 FIG. 6C is a flowchart of the three-dimensional point group acquisition process of S602 shown in FIG.
In S621, one frame of the distance image is acquired from the distance image acquisition unit 408, and is converted into a three-dimensional point group by a predetermined calculation process. In step S622, using the plane parameter of the document table 204, the point group on the plane including the document table 204 is removed from the acquired three-dimensional point group. In this way, the three-dimensional point group of the object existing on the document table 204 is acquired, and the three-dimensional point group acquisition process of S602 is ended.

図６の（ｄ）は、図６の（ａ）に示したＳ６０３の手形状および指先検出処理のフローチャートである。なお、各ステップは、ＣＰＵ３０２が対応する制御プログラム（図４に示すモジュール）を実行することで実現される。また、図７は、図６の（ｄ）での指先検出処理の方法を模式的に表した図である。
Ｓ６３１では、Ｓ６０２で取得した３次元点群から、書画台２０４を含む平面から所定の高さ以上にある、肌色の３次元点群を抽出することで、手の３次元点群を得る。
図７の（ａ）の７０１は、抽出した手の３次元点群を表している。Ｓ６３２では、抽出した手の３次元点群を、書画台２０４の平面に射影した２次元画像を生成して、その手の外形を検出する。
図７の（ａ）の７０２は、書画台２０４の平面に投影した３次元点群を表している。投影は、点群の各座標を、書画台２０４の平面パラメータを用いて投影すればよい。
また、図７の（ｂ）に示すように、投影した３次元点群から、ｘｙ座標の値だけを取り出せば、ｚ軸方向から見た２次元画像７０３として扱うことができる。この時、手の３次元点群の各点が、書画台２０４の平面に投影した２次元画像の各座標のどれに対応するかを、記憶しておくものとする。Ｓ６３３では検出した手の外形上の各点について、その点での外形の曲率を算出し、算出した曲率が所定値より小さい点を指先として検出する。
図７の（ｃ）は、外形の曲率から指先を検出する方法を模式的に表したものである。７０４は、書画台２０４の平面に投影された２次元画像７０３の外形を表す点の一部を表している。
ここで、７０４のような、外形を表す点のうち、隣り合う５個の点を含むように円を描くことを考える。円７０５、７０７が、その例である。この円を、全ての外形の点に対して順に描き、その直径（例えば７０６、７０８）が所定の値より小さい（曲率が小さい）ことを以て、指先とする。
この例では隣り合う５個の点としたが、その数は限定されるものではない。また、ここでは曲率を用いたが、外形に対して楕円フィッティングを行うことで、指先を検出してもよい。
Ｓ６３４では、検出した指先の個数および各指先の座標を算出して、手形状および指先の検出処理を終了する。この時、前述したように、書画台２０４に投影した２次元画像の各点と、手の３次元点群の各点の対応関係を記憶しているため、各指先の３次元座標を得ることができる。
今回は、３次元点群から２次元画像に投影した画像から指先を検出する方法を説明したが、指先検出の対象とする画像は、これに限定されるものではない。例えば、距離画像の背景差分や、ＲＧＢ画像の肌色領域から手の領域を抽出し、上に述べたのと同様の方法（外形の曲率計算等）で、手領域のうちの指先を検出してもよい。この場合、検出した指先の座標はＲＧＢ画像や距離画像といった、２次元画像上の座標であるため、その座標における距離画像の距離情報を用いて、直交座標系の３次元座標に変換する必要がある。この時、指先点となる外形上の点ではなく、指先を検出するときに用いた、曲率円の中心を指先点としてもよい。 FIG. 6D is a flowchart of the hand shape and fingertip detection process in S603 shown in FIG. Each step is realized by the CPU 302 executing a corresponding control program (module shown in FIG. 4). FIG. 7 is a diagram schematically illustrating the fingertip detection processing method in FIG.
In S631, a three-dimensional point cloud of the hand is obtained by extracting a three-dimensional point cloud of skin color that is higher than a predetermined height from the plane including the document table 204 from the three-dimensional point cloud acquired in S602.
Reference numeral 701 in FIG. 7A represents a three-dimensional point group of the extracted hand. In S632, a two-dimensional image obtained by projecting the extracted three-dimensional point group of the hand onto the plane of the document table 204 is generated, and the outline of the hand is detected.
Reference numeral 702 in FIG. 7A represents a three-dimensional point group projected on the plane of the document table 204. The projection may be performed by projecting the coordinates of the point group using the plane parameters of the document table 204.
Further, as shown in FIG. 7B, if only the value of the xy coordinate is extracted from the projected three-dimensional point group, it can be handled as a two-dimensional image 703 viewed from the z-axis direction. At this time, it is assumed that each point of the three-dimensional point group of the hand corresponds to which coordinate of the two-dimensional image projected on the plane of the document table 204. In S633, for each point on the detected outer shape of the hand, the curvature of the outer shape at that point is calculated, and a point where the calculated curvature is smaller than a predetermined value is detected as a fingertip.
FIG. 7C schematically shows a method of detecting the fingertip from the curvature of the outer shape. Reference numeral 704 denotes a part of a point representing the outer shape of the two-dimensional image 703 projected onto the plane of the document table 204.
Here, it is considered to draw a circle so as to include five adjacent points among the points representing the outer shape such as 704. Circles 705 and 707 are examples thereof. This circle is drawn in order with respect to all the points of the outer shape, and the diameter (for example, 706, 708) is smaller than a predetermined value (the curvature is small), and is used as a fingertip.
In this example, five points are adjacent to each other, but the number is not limited. In addition, the curvature is used here, but the fingertip may be detected by performing elliptic fitting on the outer shape.
In step S634, the number of detected fingertips and the coordinates of each fingertip are calculated, and the hand shape and fingertip detection process ends. At this time, as described above, since the correspondence between each point of the two-dimensional image projected on the document table 204 and each point of the three-dimensional point group of the hand is stored, the three-dimensional coordinates of each fingertip can be obtained. Can do.
This time, a method of detecting a fingertip from an image projected from a three-dimensional point group onto a two-dimensional image has been described, but the image to be detected by the fingertip is not limited to this. For example, the hand region is extracted from the background difference of the distance image or the skin color region of the RGB image, and the fingertip in the hand region is detected by the same method (external curvature calculation, etc.) as described above. Also good. In this case, since the coordinates of the detected fingertip are coordinates on a two-dimensional image such as an RGB image or a distance image, it is necessary to convert the coordinate information into the three-dimensional coordinates of the orthogonal coordinate system using the distance information of the distance image at that coordinate. is there. At this time, the center of the curvature circle used when detecting the fingertip may be used as the fingertip point instead of the point on the outer shape that becomes the fingertip point.

図６の（ｅ）は、図６の（ａ）に示したＳ６０４のジェスチャー判定処理のフローチャートである。なお、各ステップは、ＣＰＵ３０２が対応する制御プログラム（図４に示すモジュール）を実行することで実現される。
Ｓ６４１では、Ｓ６０３で検出した指先が１つかどうか判定する。指先が１つでなければＳ６４６へ進み、ジェスチャー無しと判定してジェスチャー判定処理を終了する。Ｓ６４１において検出した指先が１つであればＳ６４２へ進み、検出した指先と書画台２０４を含む平面との距離を算出する。Ｓ６４３ではＳ６４２で算出した距離が微小な所定値以下であるかどうかを判定し、Ｓ６４３がＹＥＳであればＳ６４４へ進んで指先が書画台２０４へタッチした、タッチジェスチャーありと判定する。
Ｓ６４３においてＳ６４２で算出した距離が所定値以下で無ければＳ６４５へ進み、指先が移動したジェスチャー（タッチはしていないが指先が書画台２０４上に存在するジェスチャー）と判定し、ジェスチャー判定処理を行う。上述したように、ここで判定したジェスチャーおよびその座標はＳ６０５でメイン制御部４０２へ通知する。 FIG. 6E is a flowchart of the gesture determination process in S604 illustrated in FIG. Each step is realized by the CPU 302 executing a corresponding control program (module shown in FIG. 4).
In S641, it is determined whether there is one fingertip detected in S603. If the number of fingertips is not one, the process proceeds to S646, determines that there is no gesture, and ends the gesture determination process. If there is one fingertip detected in S641, the process proceeds to S642, and the distance between the detected fingertip and the plane including the document table 204 is calculated. In S643, it is determined whether or not the distance calculated in S642 is equal to or smaller than a minute predetermined value. If S643 is YES, the process proceeds to S644 and it is determined that there is a touch gesture in which the fingertip touches the document table 204.
In S643, if the distance calculated in S642 is not equal to or smaller than the predetermined value, the process proceeds to S645, where it is determined that the fingertip has moved (the gesture is not touched but the fingertip is on the document table 204), and gesture determination processing is performed. . As described above, the gesture determined here and the coordinates thereof are notified to the main control unit 402 in S605.

＜物体検知部の処理＞
図８は、図４に示した物体検知部４１０のデータ処理を説明するフローチャートである。なお、各ステップは、ＣＰＵ３０２が対応する制御プログラム（図４に示すモジュール）を実行することで実現される。
図８の（ａ）は、物体検知部４１０の処理の概要を示したフローチャートである。物体検知部４１０が処理を開始すると、Ｓ８０１では物体検知部初期化処理を行う。Ｓ８０２では物体が書画台２０４上に置かれたことの検知（物体載置検知処理）を行う。Ｓ８０３ではＳ９０２で検知した書画台２０４上の物体が除去されることの検知（物体除去検知処理）を行う。続いて、Ｓ８０１、Ｓ８０２、Ｓ８０３の処理の詳細を説明する。 <Processing of object detection unit>
FIG. 8 is a flowchart for explaining data processing of the object detection unit 410 shown in FIG. Each step is realized by the CPU 302 executing a corresponding control program (module shown in FIG. 4).
FIG. 8A is a flowchart showing an outline of processing of the object detection unit 410. When the object detection unit 410 starts processing, in step S801, object detection unit initialization processing is performed. In step S802, detection (object placement detection processing) that an object has been placed on the document table 204 is performed. In step S803, the removal of the object on the document table 204 detected in step S902 is detected (object removal detection process). Next, details of the processing of S801, S802, and S803 will be described.

図８の（ｂ）はＳ８０１の物体検知部初期化処理の詳細手順を示すフローチャートである。なお、各ステップは、ＣＰＵ３０２が対応する制御プログラム（図４に示すモジュール）を実行することで実現される。物体検知部初期化処理を開始すると、Ｓ８１１ではカメラ画像取得部４０７からカメラ部２０２が撮影したカメラ画像を１フレーム取得する。Ｓ８１２では取得したカメラ画像を書画台背景カメラ画像として保存しておく（以降、「書画台背景カメラ画像」と記載した場合はここで取得したカメラ画像のことを指す）。Ｓ８１３では、同じカメラ画像を前フレームカメラ画像として保存しておく。Ｓ８１４では距離画像取得部４０８から距離画像を１フレーム取得する。Ｓ８１５では取得した距離画像を書画台背景距離画像とし保存しておき、物体検知部初期化処理を終了する（以降、「書画台背景距離画像」と記載した場合はここで取得した距離画像のことを指す）。 FIG. 8B is a flowchart showing a detailed procedure of the object detection unit initialization process in S801. Each step is realized by the CPU 302 executing a corresponding control program (module shown in FIG. 4). When the object detection unit initialization process is started, one frame of the camera image captured by the camera unit 202 is acquired from the camera image acquisition unit 407 in S811. In step S812, the acquired camera image is stored as a document table background camera image (hereinafter, “document table background camera image” refers to the acquired camera image). In S813, the same camera image is stored as the previous frame camera image. In S814, one frame of the distance image is acquired from the distance image acquisition unit 408. In step S815, the acquired distance image is stored as a document table background distance image, and the object detection unit initialization process is terminated (hereinafter referred to as the distance image acquired here when "document table background distance image" is described). ).

図８の（ｃ）は、図８の（ａ）に示したＳ８０２の物体載置検知処理の詳細手順を示すフローチャートである。なお、各ステップは、ＣＰＵ３０２が対応する制御プログラム（図４に示すモジュール）を実行することで実現される。
物体載置検知処理を開始すると、Ｓ８２１ではカメラ画像取得部４０７からカメラ画像を１フレーム取得する。Ｓ８２２では取得したカメラ画像と前フレームカメラ画像との差分を計算してその絶対値を合計した差分値を算出する。Ｓ８２３では算出した差分値があらかじめ決めておいた所定値以上かどうかを判定する。算出した差分値が所定値未満であれば書画台２０４上には物体が無いと判断し、Ｓ８２８へ進んで今のカメラ画像を前フレームカメラ画像として保存してからＳ８２１へ戻って処理を続ける。
Ｓ８２３において差分値が所定値以上であればＳ８２４へ進み、Ｓ８２１で取得したカメラ画像と前フレームカメラ画像との差分値を、Ｓ８２２と同様に算出する。Ｓ８２５では算出した差分値があらかじめ決めておいた所定値以下であるかどうかを判定する。Ｓ８２５において算出した差分値が所定値よりも大きければ書画台２０４上の物体が動いていると判断し、Ｓ８２８へ進んで今のカメラ画像を前フレームカメラ画像として保存してから、Ｓ８２１へ戻り処理を続ける。Ｓ８２５において算出した差分値が所定値以下であればＳ８２６へ進む。
Ｓ８２６では、Ｓ８２５が連続してＹＥＳとなった回数から、差分値が所定値以下、つまり書画台２０４上の物体が静止した状態があらかじめ決めておいたフレーム数続いたかどうかを判定する。Ｓ８２６において書画台２０４上の物体が静止した状態があらかじめ決めておいたフレーム数続いていないと判定したら、Ｓ８２８へ進んで今のカメラ画像を前フレームカメラ画像として保存し、Ｓ８２１へ戻って処理を続ける。
Ｓ８２６において書画台２０４上の物体が静止した状態があらかじめ決めておいたフレーム数続いたと判定したら、Ｓ８２７へ進んで物体が置かれたことをメイン制御部４０２へ通知し、物体載置検知処理を終了する。 FIG. 8C is a flowchart showing a detailed procedure of the object placement detection process in S802 shown in FIG. Each step is realized by the CPU 302 executing a corresponding control program (module shown in FIG. 4).
When the object placement detection process is started, one frame of camera image is acquired from the camera image acquisition unit 407 in S821. In step S822, the difference between the acquired camera image and the previous frame camera image is calculated, and a difference value obtained by summing the absolute values is calculated. In S823, it is determined whether or not the calculated difference value is equal to or larger than a predetermined value. If the calculated difference value is less than the predetermined value, it is determined that there is no object on the document table 204, the process proceeds to S828, the current camera image is stored as the previous frame camera image, and the process returns to S821 to continue the processing.
If the difference value is equal to or larger than the predetermined value in S823, the process proceeds to S824, and the difference value between the camera image acquired in S821 and the previous frame camera image is calculated in the same manner as in S822. In S825, it is determined whether or not the calculated difference value is equal to or less than a predetermined value. If the difference value calculated in S825 is larger than the predetermined value, it is determined that the object on the document table 204 is moving, the process proceeds to S828, the current camera image is stored as the previous frame camera image, and the process returns to S821. Continue. If the difference value calculated in S825 is less than or equal to the predetermined value, the process proceeds to S826.
In S826, it is determined from the number of times that S825 becomes YES continuously, whether or not the difference value is equal to or smaller than a predetermined value, that is, whether or not the object on the document table 204 has continued for a predetermined number of frames. If it is determined in S826 that the object on the document table 204 is not still in a predetermined number of frames, the process proceeds to S828, where the current camera image is stored as the previous frame camera image, and the process returns to S821 for processing. to continue.
If it is determined in S826 that the object on the document table 204 is stationary for a predetermined number of frames, the process proceeds to S827 to notify the main control unit 402 that the object has been placed, and the object placement detection process is performed. finish.

図８の（ｄ）は、図８の（ａ）に示したＳ８０３の物体除去検知処理の詳細フローチャートである。なお、各ステップは、ＣＰＵ３０２が対応する制御プログラム（図４に示すモジュール）を実行することで実現される。
物体除去検知処理を開始するとＳ８３１ではカメラ画像取得部４０７からカメラ画像を１フレーム取得する。Ｓ８３２では取得したカメラ画像と書画台背景カメラ画像との差分値を算出する。Ｓ８３３では算出した差分値が予め決めておいた所定値以下かどうかを判定する。Ｓ８３３において算出した差分値が予め決めておいた所定値よりも大きければ書画台２０４上にまだ物体が存在するため、Ｓ８３１へ戻って処理を続ける。Ｓ８３３において算出した差分値が予め決めておいた所定値以下であれば書画台２０４上の物体が無くなったため、物体除去をメイン制御部４０２へ通知し、物体除去検知処理を終了する。 FIG. 8D is a detailed flowchart of the object removal detection process of S803 shown in FIG. Each step is realized by the CPU 302 executing a corresponding control program (module shown in FIG. 4).
When the object removal detection process is started, one frame of camera image is acquired from the camera image acquisition unit 407 in S831. In S832, a difference value between the acquired camera image and the document table background camera image is calculated. In step S833, it is determined whether the calculated difference value is equal to or less than a predetermined value. If the difference value calculated in S833 is larger than the predetermined value determined in advance, there is still an object on the document table 204, so the process returns to S831 to continue the processing. If the difference value calculated in S833 is equal to or smaller than a predetermined value determined in advance, the object on the document table 204 has disappeared, so the object removal is notified to the main control unit 402, and the object removal detection process ends.

＜平面原稿画像撮影部の説明＞
図９は、図４に示した平面原稿画像撮影部４１１が実行する処理を説明するフローチャートである。なお、各ステップは、ＣＰＵ３０２が対応する制御プログラム（図４に示すモジュール）を実行することで実現される。図１０は、図４に示した平面原稿画像撮影部４１１の処理を説明するための模式図である。
平面原稿画像撮影部４１１は処理を開始すると、Ｓ８０１ではカメラ画像取得部４０７を介してカメラ部２０２からの画像を１フレーム取得する。ここで、カメラ部２０２の座標系は図２の（ｂ）で示したように書画台２０４に正対していないため、このときの撮影画像は、図１０の（ａ）に示すように対象物１００１、書画台２０４ともに歪んでいる。
Ｓ９０２では、書画台背景カメラ画像とＳ９０１で取得したカメラ画像との画素毎の差分を算出し、差分画像を生成した上で、差分のある画素が黒、差分の無い画素が白となるように二値化する。したがって、ここで生成した差分画像は、図１０の（ｂ）の差分領域１００２のように、対象物１００１の領域が黒色である（差分がある）画像となる。Ｓ９０３では差分領域１００２を用いて、図１０の（ｃ）のように対象物１００１のみの画像を抽出する。
Ｓ９０４では、抽出した原稿領域画像に対して階調補正を行う。Ｓ９０５では、抽出した原稿領域画像に対してカメラ座標系から書画台２０４への射影変換を行い、図１０の（ｄ）のように書画台２０４の真上から見た画像１００３に変換する。ここで用いる射影変換パラメータは、ジェスチャー認識部４０９の処理において、前述した図６の（ｂ）のＳ６１２で算出した平面パラメータとカメラ座標系から求めることができる。なお、図１０の（ｄ）に示したように、書画台２０４上への原稿の置き方により、ここで得られる画像１００３は傾いていることがある。
そこで、Ｓ９０６では、画像１００３を矩形近似してからその矩形が水平になるように回転し、図１０の（ｅ）で示した画像１００４のように傾きの無い画像を得る。Ｓ９０７では抽出した画像１００４に対して、あらかじめ決めておいた画像フォーマット（例えばＪＰＥＧ、ＴＩＦＦ、ＰＤＦ等）に合わせて圧縮およびファイルフォーマット変換を行う。
Ｓ９０８では生成した画像データを、データ管理部４０５を介してＨＤＤ３０５の所定の領域へファイルとして保存し、平面原稿画像撮影部４１１の処理を終了する。 <Description of Flat Document Image Shooting Unit>
FIG. 9 is a flowchart illustrating processing executed by the flat document image photographing unit 411 shown in FIG. Each step is realized by the CPU 302 executing a corresponding control program (module shown in FIG. 4). FIG. 10 is a schematic diagram for explaining the processing of the planar document image photographing unit 411 shown in FIG.
When the flat document image photographing unit 411 starts the processing, in S801, one frame of the image from the camera unit 202 is obtained via the camera image obtaining unit 407. Here, since the coordinate system of the camera unit 202 does not face the document table 204 as shown in FIG. 2B, the photographed image at this time is an object as shown in FIG. Both 1001 and the document table 204 are distorted.
In step S902, a pixel-by-pixel difference between the document table background camera image and the camera image acquired in step S901 is calculated, and a difference image is generated so that pixels having a difference are black and pixels having no difference are white. Binarize. Therefore, the difference image generated here is an image in which the region of the object 1001 is black (there is a difference), like the difference region 1002 in FIG. In S903, using the difference area 1002, an image of only the target object 1001 is extracted as shown in FIG.
In step S904, gradation correction is performed on the extracted document area image. In S905, the extracted document area image is subjected to projective transformation from the camera coordinate system to the document table 204, and converted to an image 1003 viewed from directly above the document table 204 as shown in FIG. The projective transformation parameters used here can be obtained from the plane parameters calculated in S612 of FIG. 6B and the camera coordinate system in the processing of the gesture recognition unit 409. As shown in FIG. 10D, the image 1003 obtained here may be tilted depending on how the document is placed on the document table 204.
Therefore, in S906, the image 1003 is approximated to a rectangle and then rotated so that the rectangle becomes horizontal, thereby obtaining an image with no inclination like the image 1004 shown in FIG. In step S907, compression and file format conversion are performed on the extracted image 1004 in accordance with a predetermined image format (for example, JPEG, TIFF, PDF, etc.).
In step S908, the generated image data is stored as a file in a predetermined area of the HDD 305 via the data management unit 405, and the processing of the flat original image photographing unit 411 is ended.

＜書籍画像撮影部の処理＞
図１１は、図４に示した書籍画像撮影部４１２が実行するデータ処理を説明するフローチャートである。なお、各ステップは、ＣＰＵ３０２が対応する制御プログラム（図４に示すモジュール）を実行することで実現される。図１２は、図４に示した書籍画像撮影部４１２の処理を説明するための模式図である。
書籍画像撮影部４１２は処理を開始すると、Ｓ１１０１ではカメラ画像取得部４０７を用いてカメラ部２０２からカメラ画像を、距離画像取得部４０８を用いて、距離画像センサ部２０８から距離画像を、それぞれ１フレームずつ取得する。ここで得られるカメラ画像の例を図１２の（ａ）に示す。
図１２の（ａ）では、書画台２０４と撮影対象書籍１２１１を含むカメラ画像１２０１が得られている。図１２の（ｂ）はここで得られた距離画像の例である。図１２の（ｂ）では、距離画像センサ部２０８に近い方が濃い色であらわされており、距離画像センサ部２０８から対象物体１２１２上の各画素への距離が含まれる距離画像１２０２が得られている。
また、図１２の（ｂ）において、距離画像センサ部２０８からの距離が書画台２０４よりも遠い画素については白であらわされており、対象物体１２１２の書画台２０４に接している部分（対象物体１２１２では右側のページ）も同じく白色となる。
Ｓ１１０２では取得したカメラ画像と距離画像から書画台２０４上に載置された書籍物体の３次元点群を算出する演算処理を行う。Ｓ１１０３では取得したカメラ画像とＳ１１０２で算出した３次元点群から、書籍画像のゆがみ補正処理を行い、２次元の書籍画像を生成する。続いて、Ｓ１１０２およびＳ１１０３の処理について説明する。 <Processing of book image photographing unit>
FIG. 11 is a flowchart for explaining data processing executed by the book image photographing unit 412 shown in FIG. Each step is realized by the CPU 302 executing a corresponding control program (module shown in FIG. 4). FIG. 12 is a schematic diagram for explaining processing of the book image photographing unit 412 shown in FIG.
When the book image photographing unit 412 starts the processing, in S1101, the camera image obtaining unit 407 is used to obtain the camera image from the camera unit 202, and the distance image obtaining unit 408 is used to obtain the distance image from the distance image sensor unit 208. Get frame by frame. An example of the camera image obtained here is shown in FIG.
In FIG. 12A, a camera image 1201 including a document table 204 and a photographing target book 1211 is obtained. FIG. 12B is an example of the distance image obtained here. In FIG. 12B, a color closer to the distance image sensor unit 208 is represented by a darker color, and a distance image 1202 including the distance from the distance image sensor unit 208 to each pixel on the target object 1212 is obtained. ing.
In FIG. 12B, pixels farther from the distance image sensor unit 208 than the document table 204 are shown in white, and the portion of the target object 1212 that is in contact with the document table 204 (target object). In 1212, the right page) is also white.
In S1102, a calculation process for calculating a three-dimensional point group of the book object placed on the document table 204 from the acquired camera image and distance image is performed. In S1103, the book image is subjected to distortion correction processing from the acquired camera image and the three-dimensional point group calculated in S1102, and a two-dimensional book image is generated. Subsequently, the processing of S1102 and S1103 will be described.

図１１の（ｂ）は、図１１の（ａ）に示したＳ１１０２の書籍物体３次元点群算出処理を説明するフローチャートである。なお、各ステップは、ＣＰＵ３０２が対応する制御プログラム（図４に示すモジュール）を実行することで実現される。
書籍物体に対する３次元点群算出処理を開始すると、Ｓ１１１１では距離画像１２０２と書画台背景カメラ画像との画素毎の差分を算出して二値化を行い、図１２の（ｃ）のように物体領域１２１３が黒で示されるカメラ差分画像１２０３を生成する。
Ｓ１１１２ではカメラ差分画像１２０３を、カメラ座標系から距離画像センサ座標系へ変換する演算処理を行い、図１２の（ｄ）のように距離画像センサ部２０８からみた物体領域１２１４を含むカメラ差分画像１２０４を生成する。Ｓ１１１３では距離画像と書画台背景距離画像との画素毎の差分を算出して二値化を行い、図１２の（ｅ）のように物体領域１２１５が黒で示される距離差分画像１２０５を生成する。ここで、対象物体である撮影対象書籍１２１１の書画台２０４と同じ色で有る部分については、画素値の差が小さくなるためカメラ差分画像１２０４中の物体領域１２１３に含まれなくなる場合がある。
また、対象物体１２１２の書画台２０４と高さが変わらない部分については距離センサ部２０８からの距離値が書画台２０４と差が小さいため、距離差分画像１２０５中の物体領域１２１５には含まれない場合がある。
そこで、Ｓ１１１４ではカメラ差分画像１２０４と距離差分画像１２０５の和をとって図１２の（ｆ）に示す物体領域画像１２０６を生成し、物体領域１２１６を得る。ここで物体領域１２１６は書画台２０４と比べて色が異なるかまたは高さが異なる領域となり、カメラ差分画像１２０３中の物体領域１２１３か距離差分画像１２０５中の物体領域１２１５のいずれか片方のみを使った場合よりも、より正確に物体領域を表している。
物体領域画像１２０６は距離画像センサ座標系であるため、Ｓ１１１５では距離画像１２０２から物体領域画像１２０６中の物体領域１２１６のみを抽出することが可能である。Ｓ１１１６では、Ｓ１１１５で抽出した距離画像を直交座標系に変換する演算処理を行うことにより図１２（ｇ）に示した３次元点群１２１７を生成する。この３次元点群１２１７が書籍物体の３次元点群であり、書籍物体３次元点群算出処理を終了する。 FIG. 11B is a flowchart for explaining the book object three-dimensional point group calculation process of S1102 shown in FIG. Each step is realized by the CPU 302 executing a corresponding control program (module shown in FIG. 4).
When the three-dimensional point group calculation process for the book object is started, in S1111, the pixel-wise difference between the distance image 1202 and the document table background camera image is calculated, and binarization is performed. As shown in FIG. A camera difference image 1203 in which an area 1213 is shown in black is generated.
In S1112, a calculation process for converting the camera difference image 1203 from the camera coordinate system to the distance image sensor coordinate system is performed, and the camera difference image 1204 including the object region 1214 viewed from the distance image sensor unit 208 as illustrated in FIG. Is generated. In step S1113, a pixel-by-pixel difference between the distance image and the document table background distance image is calculated and binarized to generate a distance difference image 1205 in which the object region 1215 is shown in black as shown in FIG. . Here, a portion having the same color as the document table 204 of the photographing target book 1211 that is the target object may not be included in the object region 1213 in the camera difference image 1204 because the difference in pixel values is small.
Further, the portion of the target object 1212 whose height does not change from the document table 204 is not included in the object area 1215 in the distance difference image 1205 because the distance value from the distance sensor unit 208 is small from the document table 204. There is a case.
Therefore, in S1114, the sum of the camera difference image 1204 and the distance difference image 1205 is taken to generate an object area image 1206 shown in FIG. Here, the object area 1216 is an area having a different color or a different height as compared to the document table 204, and only one of the object area 1213 in the camera difference image 1203 and the object area 1215 in the distance difference image 1205 is used. The object region is represented more accurately than the case.
Since the object area image 1206 is a distance image sensor coordinate system, only the object area 1216 in the object area image 1206 can be extracted from the distance image 1202 in S1115. In S1116, the three-dimensional point group 1217 shown in FIG. 12G is generated by performing arithmetic processing for converting the distance image extracted in S1115 into an orthogonal coordinate system. This three-dimensional point group 1217 is a three-dimensional point group of the book object, and the book object three-dimensional point group calculation process is terminated.

図１１の（ｃ）は、図１１の（ａ）に示したＳ１１０３の書籍画像ゆがみ補正処理を説明するフローチャートである。なお、各ステップは、ＣＰＵ３０２が対応する制御プログラム（図４に示すモジュール）を実行することで実現される。
書籍画像ゆがみ補正処理を開始すると、Ｓ１１２１では物体領域画像１２０６を距離センサ画像座標系からカメラ座標系に変換する。Ｓ１１２２ではカメラ画像１２０１から物体領域画像１２０６中の物体領域１２１６をカメラ座標系に変換したものを用いて物体領域を抽出する。
Ｓ１１２３では抽出した物体領域画像を書画台平面へ射影変換する。Ｓ１１２４では射影変換した物体領域画像を矩形近似し、その矩形が水平になるように回転することによって、図１２の（ｈ）の書籍画像１２０８を生成する。書籍画像１２０８は近似矩形の片方の編がＸ軸に平行となっているため、以降書籍画像１２０８に対してＸ軸方向へのゆがみ補正処理を行う。
Ｓ１１２５では書籍画像１２０８の最も左端の点をＰとする（図１２の（ｈ）の点Ｐ）。Ｓ１１２６では書籍物体の３次元点群１２１７から点Ｐの高さ（図１２の（ｈ）のｈ１）を取得する。Ｓ１１２７では書籍画像１２０８の点Ｐに対してＸ軸方向に所定の距離（図１２の（ｈ）のｘ１）離れた点をＱとする（図１２の（ｈ）の点Ｑ）。Ｓ１１２８では３次元点群１２１７から点Ｑの高さ（図１２（ｈ）のｈ２）を取得する。Ｓ１２１９では点Ｐと点Ｑの書籍物体上での距離（図１２（ｈ）のｌ１）を直線近似で算出する。
数式２

FIG. 11C is a flowchart for explaining the book image distortion correction processing in S1103 shown in FIG. Each step is realized by the CPU 302 executing a corresponding control program (module shown in FIG. 4).
When the book image distortion correction process is started, in S1121, the object region image 1206 is converted from the distance sensor image coordinate system to the camera coordinate system. In step S1122, the object region is extracted from the camera image 1201 using the object region 1216 in the object region image 1206 converted into the camera coordinate system.
In step S1123, the extracted object region image is projectively converted to the document table plane. In S1124, the object region image obtained by projective transformation is approximated to a rectangle, and the book image 1208 shown in FIG. 12H is generated by rotating the rectangle so that the rectangle becomes horizontal. Since the book image 1208 has one of the approximate rectangles parallel to the X axis, the book image 1208 is subjected to distortion correction processing in the X axis direction thereafter.
In S1125, the leftmost point of the book image 1208 is set to P (point P in (h) of FIG. 12). In S1126, the height of the point P (h1 in FIG. 12H) is acquired from the three-dimensional point group 1217 of the book object. In S1127, a point separated by a predetermined distance (x1 in FIG. 12H) from the point P of the book image 1208 in the X-axis direction is defined as Q (point Q in FIG. 12H). In S1128, the height of the point Q (h2 in FIG. 12 (h)) is acquired from the three-dimensional point group 1217. In step S1219, the distance between the point P and the point Q on the book object (l1 in FIG. 12H) is calculated by linear approximation.
Formula 2

Ｓ１１３０では、算出した距離ｌ１でＰＱ間の距離を補正し、図１２の（ｈ）における画像１２１９上の点Ｐ'と点Ｑ'の位置に画素をコピーする。Ｓ１１３１では処理を行った点Ｑを点Ｐとし、Ｓ１１２８に戻って同じ処理を行うことによって図１２の（ｈ）の点Ｑと点Ｒの間の補正を実行することができ、画像１２１９上の点Ｑ'と点Ｒ'の画素とする。この処理を全画素について繰り返すことにより、画像１２１９はゆがみ補正後の画像となる。Ｓ１１３２ではゆがみ補正処理を全ての点について終えたかどうかを判断し、終えていれば書籍物体のゆがみ補正処理を終了する。以上のようにして、Ｓ１１０２、Ｓ１１０３の処理を行ってゆがみ補正を行った書籍画像を生成することができる。 In S1130, the distance between the PQs is corrected by the calculated distance l1, and the pixels are copied to the positions of the points P ′ and Q ′ on the image 1219 in (h) of FIG. In S1131, the processed point Q is set as the point P, and the process returns to S1128 and the same processing is performed, whereby the correction between the point Q and the point R in (h) of FIG. It is assumed that the pixel is a point Q ′ and a point R ′. By repeating this process for all pixels, the image 1219 becomes an image after distortion correction. In S1132, it is determined whether or not the distortion correction processing has been completed for all points. If completed, the book object distortion correction processing is terminated. As described above, it is possible to generate a book image subjected to the distortion correction by performing the processes of S1102 and S1103.

ゆがみ補正を行った書籍画像の生成後、Ｓ１１０４では生成した書籍画像に階調補正を行う。Ｓ１１０５では生成した書籍画像に対して、あらかじめ決めておいた画像フォーマット（例えばＪＰＥＧ、ＴＩＦＦ、ＰＤＦ等）に合わせて圧縮およびファイルフォーマット変換を行う。Ｓ１１０６では生成した画像データを、データ管理部４０５を介してＨＤＤ３０５の所定の領域へファイルとして保存し、書籍画像撮影部４１２の処理を終了する。 After generating the book image subjected to the distortion correction, in S1104, gradation correction is performed on the generated book image. In S1105, compression and file format conversion are performed on the generated book image in accordance with a predetermined image format (for example, JPEG, TIFF, PDF, etc.). In step S1106, the generated image data is stored as a file in a predetermined area of the HDD 305 via the data management unit 405, and the processing of the book image photographing unit 412 is ended.

＜立体形状測定部の説明＞
図１３は、図４に示した立体形状測定部４１３が実行するデータ処理を説明するフローチャートである。なお、各ステップは、ＣＰＵ３０２が対応する制御プログラム（図４に示すモジュール）を実行することで実現される。図１４は、図４に示した立体形状測定部４１３の処理を説明するための模式図である。
立体形状測定部４１３が処理を開始すると、Ｓ１３０１では書画台２０４内に設けられたターンテーブル２０９上の対象物に対して、カメラ部２０２とプロジェクタ部２０７を用いた３次元点群測定処理を行う。
図１３の（ｂ）は、図１３の（ａ）に示したＳ１３０１で実行する３次元点群測定処理のフローチャートである。なお、各ステップは、ＣＰＵ３０２が対応する制御プログラム（図４に示すモジュール）を実行することで実現される。
３次元点群測定処理を開始すると、Ｓ１３１１では図１４の（ａ）に示したターンテーブル２０９上の対象物１４０１に対して、プロジェクタ部２０７から３次元測定パターン１４０２を投射する。Ｓ１３１２では、カメラ画像取得部４０７を介してカメラ部２０２からカメラ画像を１フレーム取得する。Ｓ１３１３では、取得したカメラ画像とカメラ部２０２およびプロジェクタ部２０７の位置関係から、３次元測定パターン１４０２上の各点の距離を測定する。
ここでの測定方法は、距離画像取得部４０８の処理において、図５のＳ５０３で説明した測定方法と同じである。
Ｓ１３１４では距離画像取得部４０８の処理と同様に、カメラ画像の各画素の距離値を算出し、距離画像を生成する。Ｓ１３１５では距離画像の各画素について直交座標系へ座標変換する所定の演算処理（座標変換処理）を行い、３次元点群を算出する。Ｓ１３１６では算出した３次元点群から書画台２０４の平面パラメータを用いて書画台平面に含まれる３次元点群を除去する。
そして、Ｓ１３１７では残った３次元点群の中から位置が大きく外れている点をノイズとして除去し、対象物１４０１の３次元点群１４０３を生成する。Ｓ１３１８ではプロジェクタ部２０７から投射している３次元測定パターン１４０２を消灯する。Ｓ１３１９ではカメラ画像取得部４０７を介してカメラ部２０２からカメラ画像を取得し、その角度から見たときのテクスチャ画像として保存し、３次元点群測定処理を終了する。 <Description of the three-dimensional shape measurement unit>
FIG. 13 is a flowchart for explaining data processing executed by the three-dimensional shape measurement unit 413 shown in FIG. Each step is realized by the CPU 302 executing a corresponding control program (module shown in FIG. 4). FIG. 14 is a schematic diagram for explaining the processing of the three-dimensional shape measuring unit 413 shown in FIG.
When the three-dimensional shape measurement unit 413 starts processing, in step S1301, a three-dimensional point group measurement process using the camera unit 202 and the projector unit 207 is performed on an object on the turntable 209 provided in the document table 204. .
FIG. 13B is a flowchart of the three-dimensional point group measurement process executed in S1301 shown in FIG. Each step is realized by the CPU 302 executing a corresponding control program (module shown in FIG. 4).
When the three-dimensional point group measurement process is started, a three-dimensional measurement pattern 1402 is projected from the projector unit 207 to the object 1401 on the turntable 209 shown in FIG. In S1312, one frame of camera image is acquired from the camera unit 202 via the camera image acquisition unit 407. In step S 1313, the distance of each point on the three-dimensional measurement pattern 1402 is measured from the acquired camera image and the positional relationship between the camera unit 202 and the projector unit 207.
The measurement method here is the same as the measurement method described in S503 of FIG. 5 in the processing of the distance image acquisition unit 408.
In S1314, as in the process of the distance image acquisition unit 408, the distance value of each pixel of the camera image is calculated to generate a distance image. In step S1315, a predetermined calculation process (coordinate conversion process) for converting the coordinates of each pixel of the distance image into the orthogonal coordinate system is performed to calculate a three-dimensional point group. In step S1316, the 3D point group included in the document table plane is removed from the calculated 3D point group using the plane parameters of the document table 204.
In step S 1317, a point whose position is greatly deviated from the remaining three-dimensional point group is removed as noise, and a three-dimensional point group 1403 of the object 1401 is generated. In step S1318, the three-dimensional measurement pattern 1402 projected from the projector unit 207 is turned off. In step S1319, a camera image is acquired from the camera unit 202 via the camera image acquisition unit 407, stored as a texture image when viewed from the angle, and the three-dimensional point cloud measurement process is terminated.

Ｓ１３０１の最初の３次元点群測定処理を行うと、Ｓ１３０２ではシリアルＩ／Ｆ３１０を介してターンテーブル２０９へ回転指示を行い、ターンテーブルを所定の角度、回転する。ここでの回転角度は小さければ小さいほど最終的な測定精度は高くなるが、その分測定回数が多くなり時間がかかるため、装置として適切な回転角度を予め決めておけば良い。Ｓ１３０３では、再び図１３（ｂ）の３次元点群測定処理を行う。
このとき、図１４の（ｃ）に示すようにターンテーブル２０９上の対象物１４０１と、プロジェクタ部２０７およびカメラ部２０２の角度が変わっている。そのため、図１４（ｄ）に示した３次元点群１４０４のように、Ｓ１３０１で得られた３次元点群１４０３とは異なる視点から見た３次元点群が得られる。ここで、３次元点群は、載置されたオブジェクトの立体形状を特性するデータである。
つまり、３次元点群１４０３ではカメラ部２０２およびプロジェクタ部２０７から死角領域となって一方のオブジェクトの３次元点群が算出できなかった部分の３次元点群が、３次元点群１４０４では含まれることになる（逆に、３次元点群１４０４には含まれない３次元点群が、３次元点群１４０３には含まれている）。
なお、死角領域撮影に際して、オブジェクトが撮影前の配置位置から他の配置位置へ移動されたかを判断して、カメラ部２０２に撮影させるように撮影制御を行う。
そこで、異なる視点から見た２つの３次元点群１４０３と１４０４を重ね合わせて合成する処理を行う。Ｓ１３０４ではＳ１３０３で測定した３次元点群１４０４を、ターンテーブルが初期位置から載置平面に対して回転した角度分逆回転移動させてカメラ部２０２により撮影させるように撮影制御することにより、３次元点群１４０３との位置を大まかに合わせた３次元点群１４０５を算出する。Ｓ１３０５では特徴点による３次元点群合成処理を行う。
図１３の（ｃ）は、図１３の（ａ）に示したＳ１３０５の３次元点群合成処理の詳細を示すフローチャートである。なお、各ステップは、ＣＰＵ３０２が対応する制御プログラム（図４に示すモジュール）を実行することで実現される。
３次元点群合成処理を開始すると、Ｓ１３２１では合成対象の２つの３次元点群１４０３と１４０５から、それぞれコーナーとなる点を抽出し、３次元特徴点とする。Ｓ１３２２では３次元点群１４０３の特徴点と３次元点群１４０５の特徴点の対応をとって、すべての対応点同士の距離を算出して加算し、３次元点群１４０５の位置を動かしながら対応点同士の距離の和が最小となる位置を算出する。Ｓ１３２３では算出した位置に３次元点群１４０５を移動してから３次元点群１４０３と重ね合わせることにより、２つの３次元点群１４０３と１４０５を合成する。Ｓ１３２４では、合成により点の密度が大きくなっている部分があるため、点の密度が均一になるようにダウンサンプリングを行う。このようにして合成後の３次元点群１４０６を生成し、３次元点群合成処理を終了する。 When the first three-dimensional point group measurement process of S1301 is performed, in S1302, a rotation instruction is given to the turntable 209 via the serial I / F 310, and the turntable is rotated by a predetermined angle. The smaller the rotation angle is, the higher the final measurement accuracy is. However, the number of times of measurement increases and it takes time, and therefore, an appropriate rotation angle for the apparatus may be determined in advance. In S1303, the three-dimensional point group measurement process of FIG. 13B is performed again.
At this time, as shown in FIG. 14C, the angles of the object 1401 on the turntable 209, the projector unit 207, and the camera unit 202 are changed. Therefore, a three-dimensional point group viewed from a different viewpoint from the three-dimensional point group 1403 obtained in S1301 is obtained, such as a three-dimensional point group 1404 shown in FIG. Here, the three-dimensional point group is data that characterizes the three-dimensional shape of the placed object.
In other words, the 3D point group 1403 includes a 3D point group that is a blind spot region from the camera unit 202 and the projector unit 207 and for which the 3D point group of one object could not be calculated. (Conversely, a 3D point group not included in the 3D point group 1404 is included in the 3D point group 1403).
It should be noted that in blind spot area photographing, it is determined whether the object has been moved from the arrangement position before photographing to another arrangement position, and photographing control is performed so as to cause the camera unit 202 to photograph.
Therefore, a process of superimposing and synthesizing two three-dimensional point groups 1403 and 1404 viewed from different viewpoints is performed. In step S1304, the three-dimensional point group 1404 measured in step S1303 is subjected to photographing control so that the turntable is rotated reversely by an angle rotated with respect to the placement plane from the initial position and is photographed by the camera unit 202. A three-dimensional point group 1405 that roughly matches the position with the point group 1403 is calculated. In step S1305, a three-dimensional point group synthesis process using feature points is performed.
FIG. 13C is a flowchart showing details of the three-dimensional point group synthesis processing in S1305 shown in FIG. Each step is realized by the CPU 302 executing a corresponding control program (module shown in FIG. 4).
When the three-dimensional point group synthesizing process is started, in S1321, points that become corners are extracted from the two three-dimensional point groups 1403 and 1405 to be synthesized and set as three-dimensional feature points. In S1322, the correspondence between the feature points of the three-dimensional point group 1403 and the feature points of the three-dimensional point group 1405 is calculated, the distances between all the corresponding points are calculated and added, and the position of the three-dimensional point group 1405 is moved. The position where the sum of the distances between the points is minimized is calculated. In step S1323, the three-dimensional point group 1405 is moved to the calculated position and then superimposed on the three-dimensional point group 1403, thereby synthesizing the two three-dimensional point groups 1403 and 1405. In S1324, since there is a portion where the density of the points is increased by the synthesis, downsampling is performed so that the density of the points becomes uniform. In this way, the combined three-dimensional point group 1406 is generated, and the three-dimensional point group combining process is terminated.

Ｓ１３０５の３次元点群合成処理が終了するとＳ１３０６ではターンテーブル２０９が１周回転したかを判断する。Ｓ１３０６でまだターンテーブル２０９が１周回転していなければ、Ｓ１３０２へ戻ってターンテーブル２０９をさらに回転してからＳ１３０３を実行して別の角度の３次元点群を測定する。そしてＳ１３０４、Ｓ１３０５で既に生成した３次元点群１４０５と新たに測定した３次元点群との合成処理を行う。
このようにＳ１３０２からＳ１３０５の処理をターンテーブル２０９が１周するまで繰り返すことにより、対象物１４０１の３次元点群を生成することができる。 When the three-dimensional point cloud composition processing in S1305 is completed, it is determined in S1306 whether the turntable 209 has rotated one turn. If the turntable 209 has not yet rotated once in S1306, the process returns to S1302, further rotates the turntable 209, and then executes S1303 to measure a three-dimensional point group at another angle. Then, the 3D point group 1405 already generated in S1304 and S1305 is combined with the newly measured 3D point group.
In this way, by repeating the processing from S1302 to S1305 until the turntable 209 makes one turn, a three-dimensional point group of the object 1401 can be generated.

Ｓ１３０６でターンテーブル２０９が１周したと判断するとＳ１３０７へ進み、生成した３次元点群から３次元モデルを算出する処理を行う。
図１３の（ｄ）は、図１３の（ａ）に示すＳ１３０７の３次元モデル算出処理の詳細手順を示すフローチャートである。なお、各ステップは、ＣＰＵ３０２が対応する制御プログラム（図４に示すモジュール）を実行することで実現される。
３次元モデル算出処理を開始すると、Ｓ１３３１では３次元点群からノイズ除去および平滑化を行う。Ｓ１３３２では３次元点群の点を平面でつなぎ、メッシュ化を行う。Ｓ１３３３ではメッシュ化によって得られた平面へＳ１３１０で保存したテクスチャをテクスチャマッピングする。Ｓ１３３３ではテクスチャマッピング後のデータをＶＲＭＬやＳＴＬ等の標準的な３次元モデルデータフォーマットへ変換し、３次元モデル算出処理を終了する。 If it is determined in S1306 that the turntable 209 has made one turn, the process proceeds to S1307, and a process of calculating a three-dimensional model from the generated three-dimensional point group is performed.
FIG. 13D is a flowchart showing a detailed procedure of the three-dimensional model calculation process in S1307 shown in FIG. Each step is realized by the CPU 302 executing a corresponding control program (module shown in FIG. 4).
When the three-dimensional model calculation process is started, noise removal and smoothing are performed from the three-dimensional point group in S1331. In S1332, the points of the three-dimensional point group are connected by a plane and meshed. In S1333, the texture saved in S1310 is texture-mapped to the plane obtained by meshing. In step S1333, the texture-mapped data is converted into a standard three-dimensional model data format such as VRML or STL, and the three-dimensional model calculation process is terminated.

Ｓ１３０７の３次元モデル算出処理を終了すると、Ｓ１３０８では算出した３次元モデルを、データ管理部４０５を介してＨＤＤ３０５上の所定の領域に格納し、立体形状測定部４１３のデータ処理を終了する。 When the three-dimensional model calculation process in S1307 ends, the calculated three-dimensional model is stored in a predetermined area on the HDD 305 via the data management unit 405 in S1308, and the data processing of the three-dimensional shape measurement unit 413 ends.

＜メイン制御部の説明＞
図１５Ａ〜図１５Ｃは、図４に示したメイン制御部４０２が実行するスキャンアプリケーションの処理を説明するフローチャートである。特に、図１５Ａの（ａ）は、図４に示したメイン制御部４０２の処理を示すフローチャートである。なお、各ステップは、ＣＰＵ３０２が対応する制御プログラム（図４に示すモジュール）を実行することで実現される。また、特に断らない場合、各ステップの制御主体は、メイン制御部４０２とする。
メイン制御部４０２が処理を開始すると、Ｓ１５０１で、格納方法と用紙サイズを選択させる。図１５Ｂの（ｅ）は格納方法／用紙サイズ選択処理を示すフローチャートである。 <Description of main control unit>
15A to 15C are flowcharts for explaining the scan application process executed by the main control unit 402 shown in FIG. In particular, (a) of FIG. 15A is a flowchart showing processing of the main control unit 402 shown in FIG. Each step is realized by the CPU 302 executing a corresponding control program (module shown in FIG. 4). Unless otherwise specified, the main control unit of each step is the main control unit 402.
When the main control unit 402 starts processing, in S1501, the storage method and the paper size are selected. (E) of FIG. 15B is a flowchart showing a storage method / paper size selection process.

Ｓ１５４１で図１６Ａの（ｄ）に示した格納法選択開始画面を、ユーザインタフェース部４０３を介して書画台２０４に投射する。図１６Ａの（ｄ）において、格納形式（格納方法）の選択を促すメッセージ１６３１、「１ページ１個」ボタン１６３２、「１ページ１ステージ」ボタン１６３３である。「１ページ１個」ボタン１６３２は、ドキュメントの新規１ページに１個ずつ対象物のデータを格納することを選択する。「１ページ１ステージ」ボタン１６３３は、書画台２０４のターンテーブル２０９上においた対象物すべてをドキュメントの新規１ページ内に格納することを選択する。ステージは、書画台と同一の意味である。ユーザがいずれかのボタンを押下することでＳ１５４２に進む。ここで「１ページ１個」とは、複数のオブジェクトをそれぞれ分離して各ページに分けて格納することをいう。 In S1541, the storage method selection start screen shown in FIG. 16A (d) is projected onto the document stage 204 via the user interface unit 403. In FIG. 16A (d), there are a message 1631 for prompting selection of a storage format (storage method), a “one page one” button 1632, and a “one page one stage” button 1633. The “one page” button 1632 selects to store the object data one by one on a new page of the document. The “one page and one stage” button 1633 selects that all objects placed on the turntable 209 of the document table 204 are stored in one new page of the document. The stage has the same meaning as the document table. When the user presses any button, the process proceeds to S1542. Here, “one page” means that a plurality of objects are separated and stored on each page.

Ｓ１５４２では、「１ページ１個」ボタン１６３２が押下されたときはＳ１５４３に進み、「１ページ１ステージ」ボタン１６３３が押下されたときは、Ｓ１５４４に進む。Ｓ１５４３では、１ページ１個処理用の用紙サイズ選択画面である図１６Ａ（ｅ）を投射する。 In S1542, when the “one page” button 1632 is pressed, the process proceeds to S1543, and when the “one page and one stage” button 1633 is pressed, the process proceeds to S1544. In S1543, FIG. 16A (e), which is a paper size selection screen for processing one page per page, is projected.

図１６Ａの（ｅ）で、メッセージ１６４０は用紙サイズの選択を促すメッセージ、枠１６３４は選択された用紙サイズを示す枠である。ページ領域を特定するボタン１６３５〜１６３９は用紙サイズを選択するボタンである。 In FIG. 16A (e), a message 1640 is a message prompting the user to select a paper size, and a frame 1634 is a frame indicating the selected paper size. Buttons 1635 to 1639 for specifying the page area are buttons for selecting a paper size.

同様にＳ１５４４では、１ページ１ステージ処理用の用紙サイズ選択画面である図１６Ｂの（ｈ）を投射する。図１６Ｂの（ｈ）で、メッセージ１６４０は用紙サイズの選択を促すメッセージ、枠１６３４は選択された用紙サイズを示す枠である。ボタン１６３６〜１６３９は用紙サイズを選択するボタンである。
次にＳ１５４５で選択された格納方法「１ページ１個」または「１ページ１ステージ（複数のオブジェクトをページ合成して格納する）」をＲＡＭ３０３に記憶する。
Ｓ１５４６では、ユーザによりボタン１６３５〜１６３８を押下されたときはＳ１５５１に進む。「その他」ボタン１６３９が押下されたときはＳ１５４７に進む。 Similarly, in S1544, (h) in FIG. 16B, which is a paper size selection screen for one page and one stage processing, is projected. In FIG. 16B (h), a message 1640 is a message prompting the user to select a paper size, and a frame 1634 is a frame indicating the selected paper size. Buttons 1636 to 1639 are buttons for selecting a paper size.
Next, the storage method “one page / one” or “one page / one stage (multiple objects are combined and stored)” selected in S 1545 is stored in the RAM 303.
In S1546, when the buttons 1635 to 1638 are pressed by the user, the process proceeds to S1551. If the “other” button 1639 is pressed, the process advances to step S1547.

Ｓ１５４７では、「その他」用の処理画面を投射する。図１６Ａの（ｆ）で、ボタン１６４１〜１６４３は最初の画面の図１６Ａの（ｅ）になかった定型用紙サイズが選択可能となる。ボタン１６４４は、定形外の用紙サイズを選択する場合に押下するボタン。ユーザがボタン１６４１〜１６４３を押下した場合はＳ１５５１に進む。ボタン１６４４を押下した場合は、Ｓ１５４９に進む。 In S1547, a processing screen for “others” is projected. In (f) of FIG. 16A, the buttons 1641 to 1643 can select a standard paper size that was not in (e) of FIG. 16A of the first screen. A button 1644 is a button to be pressed when selecting a non-standard paper size. When the user presses the buttons 1641 to 1643, the process proceeds to S1551. If the button 1644 has been pressed, the process advances to step S1549.

Ｓ１５４９では図１６Ｂの（ｇ）に示す定形外用の用紙サイズを設定する画面を投射する。メッセージ１６４６は用紙サイズの設定を促すメッセージで、枠１６３４の４隅のマーク１６４５をタッチして移動することで用紙枠の大きさを設定する。
Ｓ１５５０で、確定ボタン１６４７が押下されたら、用紙枠の大きさが確定なのでＳ１５５１に進む。Ｓ１５５１では、選択された用紙サイズをＲＡＭ３０３に記憶して、格納方法／用紙サイズ選択処理を終了する。 In S1549, a screen for setting a non-standard-size paper size shown in (g) of FIG. 16B is projected. A message 1646 prompts the user to set the paper size. The size of the paper frame is set by touching and moving marks 1645 at the four corners of the frame 1634.
If the confirm button 1647 is pressed in S1550, the size of the paper frame is confirmed, and the process proceeds to S1551. In step S1551, the selected paper size is stored in the RAM 303, and the storage method / paper size selection process ends.

Ｓ１５０２では書画台２０４にスキャンの対象物が載置されるのを待つ。図１５Ａの（ｂ）はＳ１５０２の対象物載置待ち処理の詳細である。対象物載置待ち処理を開始すると、Ｓ１５１１ではユーザインタフェース部４０３を介して、書画台２０４にプロジェクタ部２０７によって図１６Ａの（ａ）の画面を投射する。
図１６Ａの（ａ）の画面では、書画台２０４上に対象物を置くことをユーザに促すメッセージ１６０１を投射する。Ｓ１５１２では物体検知部４１０の処理を起動する。物体検知部４１０は図８のフローチャートで説明した処理の実行を開始する。Ｓ１５１３では、物体検知部４１０からの物体載置通知を待つ。物体検知部４１０が図８のＳ８２７の処理を実行して物体載置をメイン制御部４０２へ通知すると、Ｓ１５１３において物体載置通知ありと判断し、物体載置待ち処理を終了する。 In step S1502, the process waits for an object to be scanned to be placed on the document table 204. (B) of FIG. 15A is the detail of the object mounting waiting process of S1502. When the object placement waiting process is started, in step S 1511, the projector unit 207 projects the screen illustrated in FIG. 16A on the document table 204 via the user interface unit 403.
On the screen of FIG. 16A (a), a message 1601 that prompts the user to place an object on the document table 204 is projected. In step S1512, the processing of the object detection unit 410 is activated. The object detection unit 410 starts executing the process described with reference to the flowchart of FIG. In step S1513, an object placement notification from the object detection unit 410 is awaited. When the object detection unit 410 executes the process of S827 in FIG. 8 and notifies the main control unit 402 of the object placement, in S1513 it is determined that there is an object placement notification, and the object placement waiting process ends.

Ｓ１５０２の物体載置待ち処理を終了すると、メイン制御部４０２は続いてＳ１５０３のスキャン実行処理を行う。図１５Ａの（ｄ）のフローチャートがＳ１５０３のスキャン実行処理の詳細である。 When the object placement waiting process in S1502 ends, the main control unit 402 subsequently performs a scan execution process in S1503. The flowchart of (d) of FIG. 15A shows the details of the scan execution process of S1503.

スキャン実行処理を開始すると、Ｓ１５３０で対象物の複数物体位置判別処理を行う。図１５Ｃの（ｇ）のフローチャートがＳ１５３０の対象物の複数物体位置判別処理の詳細である。
Ｓ１５８０では、距離画像を１フレーム取得し３次元点群に変換する。この処理はＳ６２１と同一である。 When the scan execution process is started, a multi-object position determination process for an object is performed in S1530. The flowchart of (g) of FIG. 15C is the detail of the multiple object position determination process of the target object in S1530.
In S1580, one frame of the distance image is acquired and converted into a three-dimensional point group. This process is the same as S621.

次にＳ１５８１で、３次元点群から高さが所定値以下の点とノイズ除去を行う。具体的には高さが０．１ｍｍ以下の点、孤立点をノイズとして除去する。残った点群から隣接する点を集めて一つにしたものを点群グループと呼ぶ。
図１６Ｂの（ｉ）は、ノイズ除去後の点群の状態を示している。点群グループ１６５１は、立体物である車の点群である。点群グループ１６５２は、紙の点群である。 Next, in step S1581, noise is removed from a point whose height is a predetermined value or less from the three-dimensional point group. Specifically, points with a height of 0.1 mm or less and isolated points are removed as noise. A group of adjacent points collected from the remaining point group is referred to as a point group.
(I) of FIG. 16B shows the state of the point group after noise removal. The point cloud group 1651 is a point cloud of a car that is a three-dimensional object. The point cloud group 1652 is a paper point cloud.

次にＳ１５８２で、連続する点群から所定値以上の面積を占める点群グループの位置を対象物の位置としてＲＡＭ３０３に記憶する。点群グループの面積は書画台（ＸＹ平面）に対する射影の外接矩形の面積であり、たとえば１ｃｍ^２以下の場合はノイズとして対象物としない。外接矩形の例としては図１６Ｂ（ｊ）の枠１６６０や枠１６６１である。個々の対象物の外接矩形の位置を記憶し、複数の対象物の位置判別処理を終了する。 In step S1582, the position of the point group that occupies an area of a predetermined value or more from the continuous point group is stored in the RAM 303 as the position of the object. The area of the point group is the area of the circumscribed rectangle projected onto the document table (XY plane). For example, when the area is 1 cm ² or less, it is not regarded as an object as noise. Examples of the circumscribed rectangle are the frame 1660 and the frame 1661 in FIG. 16B (j). The positions of circumscribed rectangles of the individual objects are stored, and the position determination process for the plurality of objects is completed.

Ｓ１５３１では、図１６Ａの（ｂ）に示したスキャン開始画面を、ユーザインタフェース部４０３を介して書画台２０４に投射する。図１６Ａの（ｂ）は、Ｓ１５３０の複数物体位置判別処理で書画台上の対象物がひとつと判定した場合の例である。図１６Ａの（ｂ）において、対象物１６１１がユーザによって載置されたスキャン対象物体である。撮影条件を設定する２Ｄスキャンボタン１６１２は平面原稿の撮影指示を受け付けるボタンである。撮影条件を設定するブックスキャンボタン１６１３は書籍原稿の撮影指示を受け付けるボタンである。
撮影条件を設定する３Ｄスキャンボタン１６１４は立体形状の測定指示を受け付けるボタンである。スキャン開始ボタン１６１５は選択したスキャンの実行開始指示を受け付けるボタンである。ユーザインタフェース部４０３は、前述したようにジェスチャー認識部４０９から通知されるタッチジェスチャーの座標とこれらのボタンを表示している座標から、いずれかのボタンがユーザによって押下されたことを検知する（以降、ユーザインタフェース部による検知の説明を省略して「ボタンへのタッチを検知する」と記載する）。また、ユーザインタフェース部４０３は、２Ｄスキャンボタン１６１２、ブックスキャンボタン１６１３、３Ｄスキャンボタン１６１４のそれぞれを排他的に選択できるようにしている。ユーザのいずれかのボタンへのタッチを検知すると、タッチされたボタンを選択状態とし、他のボタンの選択を解除する。 In step S1531, the scan start screen illustrated in FIG. 16A (b) is projected onto the document stage 204 via the user interface unit 403. (B) of FIG. 16A is an example when it is determined that the number of objects on the document table is one in the multiple object position determination process of S1530. In FIG. 16A (b), the object 1611 is a scan target object placed by the user. A 2D scan button 1612 for setting shooting conditions is a button for receiving a shooting instruction for a flat document. A book scan button 1613 for setting shooting conditions is a button for accepting a shooting instruction for a book document.
A 3D scan button 1614 for setting shooting conditions is a button for receiving a measurement instruction of a three-dimensional shape. A scan start button 1615 is a button for receiving an instruction to start execution of the selected scan. As described above, the user interface unit 403 detects that one of the buttons has been pressed by the user from the coordinates of the touch gesture notified from the gesture recognition unit 409 and the coordinates at which these buttons are displayed (hereinafter, referred to as the button). The description of detection by the user interface unit is omitted, and is described as “detecting a touch on a button”). The user interface unit 403 can exclusively select each of the 2D scan button 1612, the book scan button 1613, and the 3D scan button 1614. When a touch on any button of the user is detected, the touched button is set in a selected state, and the selection of other buttons is canceled.

図１６Ｂの（ｊ）Ｓ１５３０の複数物体位置判別処理で書画台上の対象物が複数と判定し、ユーザが選択した格納方法が「１ページ１ステージ」、用紙サイズがＢ４である場合の例である。書画台の上に対象物が立体物である車１６６２と印刷物１６６３の２つが置かれている。 FIG. 16B (j) is an example in which a plurality of objects on the document table are determined in the multiple object position determination process in S1530, the storage method selected by the user is “one page and one stage”, and the paper size is B4. is there. Two objects, a car 1662 whose object is a three-dimensional object, and a printed material 1663 are placed on the document table.

１ページ１ステージの格納方法の場合、ドキュメント作成時に各対象物をページ内のどこに配置するかの情報が必要となる。用紙枠１６６７および用紙枠に対する各対象物のＸＹ平面上の位置はＳ１５３０で取得しており、車１６６２の領域を示す枠１６６０と印刷物１６６３の領域を示す枠１６６１が投射されている。配置を変えたい場合は対象物を移動させることで変えることができる。物体配置修正部４１５は対象物の位置がユーザによって修正されていないかどうか監視する。確定するのはスキャン開始ボタン１６６６が押下されたときである。スキャンの種類を示す３種のボタン１６６４は、左側の車１６６２の枠１６６０に対するスキャンの種類をユーザに選択させるボタンである。この場合は、３Ｄスキャンをユーザは選択する。同様に３種のボタン１６６５は、右側の印刷物のスキャンの種類を選択させるボタンである。この場合は、２Ｄスキャンをユーザは選択する。 In the case of the storage method of one page and one stage, information on where to place each object in the page is required at the time of document creation. The position on the XY plane of the paper frame 1667 and each object with respect to the paper frame is acquired in S1530, and a frame 1660 indicating the area of the car 1662 and a frame 1661 indicating the area of the printed material 1663 are projected. If you want to change the arrangement, you can change it by moving the object. The object arrangement correcting unit 415 monitors whether or not the position of the object has been corrected by the user. The determination is made when the scan start button 1666 is pressed. Three types of buttons 1664 indicating scan types are buttons that allow the user to select a scan type for the frame 1660 of the left car 1662. In this case, the user selects 3D scanning. Similarly, the three types of buttons 1665 are buttons for selecting the type of scanning of the printed matter on the right side. In this case, the user selects 2D scanning.

Ｓ１５３２では、図１６Ａの（ｂ）の例ではスキャン開始ボタン１６１５、図１６Ｂの（ｊ）の例ではスキャン開始ボタン１６６６または位置修正ボタン１６６８へのタッチを検知するまで待つ。Ｓ１５３２でスキャン開始ボタン１６１５またはスキャン開始ボタン１６６６へのタッチを検知したらＳ１５３３へ進む。位置修正ボタン１６６８へのタッチを検知したら対象物の位置がユーザにより変えられたのでＳ１５３０に戻り処理をやり直す。 In S1532, the process waits until a touch on the scan start button 1615 is detected in the example of FIG. 16A (b) and the scan start button 1666 or the position correction button 1668 is detected in the example of FIG. 16B (j). If a touch on the scan start button 1615 or the scan start button 1666 is detected in S1532, the process proceeds to S1533. If the touch of the position correction button 1668 is detected, the position of the object has been changed by the user, so the process returns to S1530 and the process is repeated.

Ｓ１５３３では、ユーザの選択したスキャン種別を判定する。２Ｄスキャンボタン１６１２が選択状態であればＳ１５３４へ、ブックスキャンボタン１６１３が選択状態であれば、Ｓ１５３５へ、３Ｄスキャンボタン１６１４が選択状態であればＳ１５３６へ進む。Ｓ１５３４では平面原稿画像撮影部４１１の処理を実行する。Ｓ１５３５では書籍画像撮影部４１２の処理を実行する。Ｓ１５３６では立体形状測定部４１３の処理を実行する。 In S1533, the scan type selected by the user is determined. If the 2D scan button 1612 is selected, the process proceeds to S1534. If the book scan button 1613 is selected, the process proceeds to S1535. If the 3D scan button 1614 is selected, the process proceeds to S1536. In step S1534, the processing of the planar document image photographing unit 411 is executed. In step S1535, the book image photographing unit 412 is processed. In S1536, the processing of the three-dimensional shape measurement unit 413 is executed.

次にＳ１５３７で、書画台上の対象物のすべてのスキャンを実施したか判定し、すべてスキャンした場合は、スキャン実行処理を終了する。書画台上に複数の対象物を置いたケースでまだスキャンができていない対象物がある場合はＳ１５３３に戻り処理を継続する。 In step S1537, it is determined whether all the objects on the document table have been scanned. If all the objects have been scanned, the scan execution process ends. If there is an object that has not been scanned yet in the case where a plurality of objects are placed on the document table, the process returns to S1533 to continue the processing.

これらのスキャン実行処理により、すべての対象物の２次元データまたは３次元データをデータ管理部４０５がＲＡＭ３０３に格納する。また、格納方法が１ページ１ステージの場合は、原稿に対する相対的な位置も合わせて格納する。 By these scan execution processes, the data management unit 405 stores the two-dimensional data or three-dimensional data of all the objects in the RAM 303. When the storage method is one stage per page, the relative position with respect to the original is also stored.

Ｓ１５０３のスキャン実行処理を終了すると、メイン制御部４０２は続いてＳ１５０４の物体除去待ち処理を行う。図１５Ａの（ｃ）は物体除去待ち処理のフローチャートである。物体除去待ち処理を開始すると、Ｓ１５２１では図１６Ａの（ｃ）に示したスキャン終了画面を表示する。
図１６Ａの（ｃ）のスキャン終了画面では、スキャンが終了した旨をユーザに通知するメッセージ１６２１を投射する。Ｓ１５２２では、物体検知部４１０からの物体除去通知を受信するのを待つ。ここで、物体除去通知は、物体検知部４１０が図８のＳ８３４で通知するものである。Ｓ１５２２で物体除去通知があると、物体除去待ち処理を終了する。Ｓ１５０４の物体除去待ち処理を終了すると、メイン制御部４０２はＳ１５０１へ進む。 When the scan execution process in S1503 ends, the main control unit 402 subsequently performs an object removal waiting process in S1504. (C) of FIG. 15A is a flowchart of an object removal waiting process. When the object removal waiting process is started, the scan end screen shown in (c) of FIG. 16A is displayed in S1521.
In the scan end screen of (c) of FIG. 16A, a message 1621 for notifying the user that the scan has ended is projected. In step S1522, the process waits for reception of an object removal notification from the object detection unit 410. Here, the object removal notification is made by the object detection unit 410 in S834 in FIG. When there is an object removal notification in S1522, the object removal waiting process is terminated. When the object removal waiting process in S1504 ends, the main control unit 402 advances to S1501.

Ｓ１５０１では、ボタン１６２２が押下されたときは、Ｓ１５０６に進む。あらかじめ設定した待機時間を過ぎた場合は、スキャン継続とみなしＳ１５０２へ戻り、図１６Ａの（ａ）の初期画面を表示して書画台２０４への物体載置を待つ。このようにすることで、ユーザが複数の原稿をスキャンしたい場合に、書画台２０４上の原稿を取り換えたことを検知することができ、複数の原稿のスキャンを実行できる。
Ｓ１５０６では、ドキュメント作成処理を行う。図１５Ｃの（ｆ）は詳細を示すフローチャートである。
Ｓ１５６０でユーザが選択した格納方法を確認する。「１ページ１個」の場合はＳ１５６１に進み、「１ページ１ステージ」の場合はＳ１５６７に進む。
Ｓ１５６１では、ユーザが選択した用紙サイズを確認する。自動の場合はＳ１５６２に進む。それ以外の場合はＳ１５６３に進む。
Ｓ１５６２では、スキャンしたデータと同じ用紙サイズの新規ページ作成を行う。つまり対象物の大きさ（外接矩形）がＡ４であれば新規ページの用紙サイズをＡ４にする。 In S1501, when the button 1622 is pressed, the process proceeds to S1506. If the preset standby time has passed, it is regarded that the scan is continued, the process returns to S1502, the initial screen of FIG. 16A is displayed, and the object placement on the document table 204 is awaited. In this way, when the user wants to scan a plurality of documents, it can be detected that the document on the document table 204 has been replaced, and a plurality of documents can be scanned.
In step S1506, document creation processing is performed. FIG. 15C is a flowchart showing details.
In S1560, the storage method selected by the user is confirmed. In the case of “one page”, the process proceeds to S1561, and in the case of “one page and one stage”, the process proceeds to S1567.
In S1561, the paper size selected by the user is confirmed. If it is automatic, the process proceeds to S1562. Otherwise, the process proceeds to S1563.
In S1562, a new page is created with the same paper size as the scanned data. That is, if the size of the object (the circumscribed rectangle) is A4, the paper size of the new page is set to A4.

次にＳ１５６４で、データをページに配置する。ＰＤＦ（電子文書データ）であればＡ４サイズのページ辞書を作成して新規ページを作り、そこに対象物が紙や本であれば２次元データであるＪＰＥＧ画像をページ内に配置する。同様に対象物が立体物であれば３次元データであるＵ３Ｄデータをページ内に配置する。３次元データの最初に見える位置や大きさはデフォルトビューとしてＰＤＦ内の３次元注釈辞書に格納されている。
同様に用紙サイズが自動でなくひとつの定型サイズに収める場合はＳ１５６３で、対象物の大きさによらずユーザが選択した用紙サイズの新規ページを作成する。 In step S1564, data is arranged on the page. If it is PDF (electronic document data), a page dictionary of A4 size is created to create a new page. If the object is paper or a book, a JPEG image that is two-dimensional data is arranged in the page. Similarly, if the object is a three-dimensional object, U3D data that is three-dimensional data is arranged in the page. The position and size of the three-dimensional data that can be seen at the beginning are stored in the three-dimensional annotation dictionary in the PDF as a default view.
Similarly, if the paper size is not automatic but fits into one standard size, in S1563, a new page of the paper size selected by the user is created regardless of the size of the object.

次にＳ１５６５にて、データを用紙サイズに変倍してページに配置する。たとえば、対象物がＡ３の印刷物であり、ドキュメントに選択されている用紙サイズがＢ４であれば、Ａ４→Ｂ４の縮小を行いＰＤＦに格納する。 In step S1565, the data is scaled to the paper size and arranged on the page. For example, if the object is a printed matter of A3 and the paper size selected for the document is B4, the size is reduced from A4 to B4 and stored in the PDF.

Ｓ１５６６ではすべての対象物のデータを処理したか判定する。すべてを処理していない場合はＳ１５６１に戻り処理を継続する。すべて処理した場合はドキュメント作成処理を終了する。
Ｓ１５６０でユーザが選択した格納方法が１ページ１ステージである場合について説明する。Ｓ１５６７で、ユーザが選択した用紙サイズの新規ページを作成する。
次にＳ１５６８でステージを一つ選択する。 In S1566, it is determined whether data of all objects has been processed. If not all have been processed, the process returns to S1561 and continues. If all processing is completed, the document creation process is terminated.
A case where the storage method selected by the user in S1560 is one page and one stage will be described. In S1567, a new page of the paper size selected by the user is created.
In step S1568, one stage is selected.

Ｓ１５６９で、対象物のスキャン時に１ページ１ステージの格納方法の場合はステージごとに対象物のデータが記憶されているので、ステージ内の対象物のデータの情報を取り出す。ページ内の位置情報に従いＰＤＦの新規ページ内に対象物のデータを格納する。
たとえば図１６Ｂの（ｊ）のような場合は、Ｂ４用紙サイズとなる用紙枠１６６７の中に左側に車１６６２の３次元データ、右側に印刷物１６６３のＪＰＥＧデータがＰＤＦのページ内に格納される。 In S1569, in the case of the storage method of one page and one stage at the time of scanning the object, since the object data is stored for each stage, information on the object data in the stage is taken out. Data of the object is stored in a new PDF page according to the position information in the page.
For example, in the case of (j) in FIG. 16B, the three-dimensional data of the car 1662 on the left side and the JPEG data of the printed matter 1663 on the right side are stored in the PDF page in the paper frame 1667 having the B4 paper size.

次にＳ１５７０でステージ内のデータを処理したか判定する。まだ対象物のデータが残っている場合は、Ｓ１５６９に戻ってページ内にデータを配置する。すべての対象物を処理した場合は、Ｓ１５７１に進む。 In step S1570, it is determined whether the data in the stage has been processed. If there is still data on the object, the process returns to S1569 to arrange the data in the page. If all the objects have been processed, the process proceeds to S1571.

Ｓ１５７１では、すべてのステージを処理したか判定する。ステージが残っている場合は、Ｓ１５６８に戻って処理を継続する。残っていない場合は、ドキュメント作成処理を終了する。 In S1571, it is determined whether all stages have been processed. If the stage remains, the process returns to S1568 and continues. If not, the document creation process is terminated.

以上のように、第１実施形態ではユーザが平面原稿のスキャンを行うか、厚みのある書籍のスキャンを行うか、立体形状測定をおこなうかを選択できるようにした。なお、スキャンのモードが３種類すべて必要無い場合、例えば、ユーザの設定等により平面原稿のスキャンと厚みのある書籍のスキャンの２種類を実行すれば良い場合も考えられる。その場合、実行する２つのスキャンを選択できるように表示を行えばよい。
具体的には、図１６Ａの（ｂ）において２Ｄスキャンボタン１６１２、ブックスキャンボタン１６１３、スキャン開始ボタン１６１５のみを投射することにより、２種類のスキャンを選択するユーザの入力を受け付けることができる。また、スキャンのモードが１種類のみであればよい場合、例えば、ユーザの設定等により平面原稿のスキャンのみ、あるいは、書籍のスキャンのみを実行すれば良い場合も考えられる。
その場合、図１６Ａの（ｂ）においてはスキャン開始ボタン１６１５のみを投射し、ユーザのスキャン種類の選択を受け付けることなく、スキャン開始ボタン１７１５へのタッチを検知したときにスキャンを実行すれば良い。また、このようにスキャンのモードが１種類のみである場合、書画台２０４への物体の載置を検知したとき、図１６Ａの（ｂ）のようなスキャン操作画面を投射せず、すぐにスキャンを実行しても良い。 As described above, in the first embodiment, the user can select whether to scan a flat document, scan a thick book, or perform solid shape measurement. When all three types of scanning modes are not necessary, for example, it may be possible to execute two types of scanning of a flat document and a thick book according to user settings or the like. In that case, display may be performed so that two scans to be executed can be selected.
Specifically, by projecting only the 2D scan button 1612, the book scan button 1613, and the scan start button 1615 in (b) of FIG. 16A, it is possible to accept an input from the user who selects two types of scans. In addition, when only one type of scan mode is required, for example, only a flat document scan or only a book scan may be executed according to a user setting or the like.
In that case, in FIG. 16A (b), only the scan start button 1615 is projected, and the scan may be executed when a touch on the scan start button 1715 is detected without accepting the selection of the scan type by the user. Further, when there is only one type of scan mode as described above, when the placement of an object on the document table 204 is detected, the scan operation screen as shown in FIG. May be executed.

〔第２実施形態〕
第１実施形態の構成のカメラスキャナにおいて、物体配置表示部４１４は書画台２０４上にドキュメントの用紙サイズを示す枠もしくは対象物の占める領域を示す枠を表示する。
図１６Ｂの（ｊ）では用紙枠１６６７、対象物の領域を示す枠１６６０、枠１６６１である。
対象物が立体物である場合、プロジェクタ部２０７からの枠の投射が立体物にかかって正しく枠が投射できないことがある。
第１実施形態では、それらの枠を四角形により投射して示したが、領域を示せればどのような方法でも構わない。 [Second Embodiment]
In the camera scanner having the configuration of the first embodiment, the object arrangement display unit 414 displays a frame indicating the paper size of the document or a frame indicating the area occupied by the object on the document table 204.
In FIG. 16B (j), there are a paper frame 1667, a frame 1660 indicating the area of the object, and a frame 1661.
When the object is a three-dimensional object, the projection of the frame from the projector unit 207 may be applied to the three-dimensional object and the frame may not be projected correctly.
In the first embodiment, the frames are projected and shown as squares, but any method may be used as long as the area can be shown.

例えば、図１７の（ａ）の４本の直線１７００で囲まれた領域を用紙枠として示しても良い。立体物に投影がかかってしまっても、直線を書画台いっぱいに投射することによりユーザが用紙サイズや対象物の占める領域を容易に知ることができる。 For example, an area surrounded by four straight lines 1700 in FIG. 17A may be shown as a paper frame. Even if the projection is applied to the three-dimensional object, the user can easily know the paper size and the area occupied by the object by projecting the straight line to the full document table.

〔第３実施形態〕
第１実施形態の構成を備えるカメラスキャナにおいて、物体配置表示部４１４は書画台２０４上にドキュメントの用紙サイズを示す枠もしくは対象物の占める領域を示す枠を表示する。図１６Ｂの（ｊ）では用紙枠１６６７、対象物の領域を示す枠１６６０、枠１６６１である。
対象物が立体物である場合、プロジェクタ部２０７からの枠の投射が立体物にかかって正しく枠が投射できないことがある。
第１実施形態では、それらの枠を四角形により投射して示したが、領域を示せればどのような方法でも構わない。 [Third Embodiment]
In the camera scanner having the configuration of the first embodiment, the object arrangement display unit 414 displays a frame indicating the paper size of the document or a frame indicating the area occupied by the object on the document table 204. In FIG. 16B (j), there are a paper frame 1667, a frame 1660 indicating the area of the object, and a frame 1661.
When the object is a three-dimensional object, the projection of the frame from the projector unit 207 may be applied to the three-dimensional object and the frame may not be projected correctly.
In the first embodiment, the frames are projected and shown as squares, but any method may be used as long as the area can be shown.

たとえば、図１７の（ｂ）のように用紙枠の外側に領域を示すマーカー１７１０を示して補助しても良い。立体物に投影がかかってしまっても、マーカー１７１０を書画台の端の方に投射することによりユーザが用紙サイズや対象物の占める領域を容易に知ることができる。 For example, as shown in FIG. 17B, a marker 1710 indicating a region may be provided outside the paper frame to assist. Even if the projection is applied to the three-dimensional object, the user can easily know the paper size and the area occupied by the object by projecting the marker 1710 toward the end of the document table.

〔第４実施形態〕
第１実施形態の構成を備えるカメラスキャナにおいて、物体配置表示部４１４は書画台２０４上にドキュメントの用紙サイズを示す枠もしくは対象物の占める領域を示す枠を表示する。図１６Ｂの（ｊ）では用紙枠１６６７、対象物の領域を示す枠１６６０、枠１６６１である。
対象物が立体物である場合、プロジェクタ部２０７からの枠の投射が立体物にかかって正しく枠が投射できないことがある。
第１実施形態では、それらの枠を四角形により投射して示したが、領域を示せればどのような方法でも構わない。
たとえば、書画台２０４上にあらかじめ良く使う用紙サイズを印刷しておく、または印刷した紙などを敷いておいてもかまわない。
たとえば、Ａ４ポートレートの枠が書画台２０４に印刷されている場合で説明する。
ユーザが選択した用紙サイズがＡ４の場合は、書画台にあらかじめ図１７の（ｃ）のようにＡ４枠が印刷されているため、立体物やプロジェクタの位置によらず確認することが可能である。
ただし、その場合にユーザがＡ４サイズを選ばなかったときに問題となる。 [Fourth Embodiment]
In the camera scanner having the configuration of the first embodiment, the object arrangement display unit 414 displays a frame indicating the paper size of the document or a frame indicating the area occupied by the object on the document table 204. In FIG. 16B (j), there are a paper frame 1667, a frame 1660 indicating the area of the object, and a frame 1661.
When the object is a three-dimensional object, the projection of the frame from the projector unit 207 may be applied to the three-dimensional object and the frame may not be projected correctly.
In the first embodiment, the frames are projected and shown as squares, but any method may be used as long as the area can be shown.
For example, a frequently used paper size may be printed in advance on the document table 204 or a printed paper may be laid.
For example, a case where an A4 portrait frame is printed on the document table 204 will be described.
When the paper size selected by the user is A4, since the A4 frame is printed in advance on the document table as shown in FIG. 17C, it can be confirmed regardless of the position of the three-dimensional object or the projector. .
However, in this case, it becomes a problem when the user does not select the A4 size.

そこで、本実施形態では、ユーザが選択した用紙サイズが書画台に印刷されている用紙枠の大きさと異なる場合のみ第１実施形態の図１５Ｂの（ｅ）のＳ１５４６において用紙サイズを示す枠を投影することを特徴とする。 Therefore, in this embodiment, only when the paper size selected by the user is different from the size of the paper frame printed on the document table, a frame indicating the paper size is projected in S1546 of FIG. 15B (e) of the first embodiment. It is characterized by doing.

このように立体物に投影がかかってしまうような場合でも、Ａ４サイズなど良く使う用紙サイズの枠を書画台にあらかじめ印刷しておき、Ａ４と異なる用紙サイズがユーザに選択されたときのみ用紙枠を投影することで、立体物を置いても多くの場合にユーザが用紙サイズ領域を正しく知ることができる。 Even when a three-dimensional object is projected in this way, a frame of a frequently used paper size, such as A4 size, is printed in advance on the document table, and the paper frame only when a paper size different from A4 is selected by the user. In many cases, even if a three-dimensional object is placed, the user can correctly know the paper size area.

〔第５実施形態〕
第１実施形態では物体配置修正部４１５がスキャン実行処理Ｓ１５０３のＳ１５３２において、用紙枠に対する位置を読み取り、ＰＤＦなどのドキュメント作成時に同様の位置関係の配置で構成されたページを持つようにドキュメントを作成した。配置を決定する手段は、スキャン実行時でなくても同様の効果が得られる。 [Fifth Embodiment]
In the first embodiment, the object placement correcting unit 415 reads a position with respect to the paper frame in S1532 of the scan execution process S1503, and creates a document having pages configured in the same positional relationship when creating a document such as PDF. did. The means for determining the arrangement can obtain the same effect even when the scan is not executed.

Ｓ１５０６のドキュメント作成処理において図１７の（ｄ）に示すように対象物のデータのサムネール画像１７３０を書画台の上部に並べ、書画台の下部に用紙枠１７３４を投射する。指１７３１のようにサムネールにタッチして指１７３３の位置にドラッグすることにより用紙枠１７３４に対してサムネール画像１７３２の位置に対象物のデータを配置するように示しても良い。
また、ユーザによるマーカーの指示動作を認識して、当該移動指示を認識することに応じて、ページ領域を調整するように構成してもよい。 In the document creation processing of S1506, as shown in FIG. 17D, thumbnail images 1730 of the object data are arranged on the upper part of the document table, and a paper frame 1734 is projected on the lower part of the document table. It may be shown that the data of the object is arranged at the position of the thumbnail image 1732 with respect to the paper frame 1734 by touching the thumbnail like the finger 1731 and dragging it to the position of the finger 1733.
Further, the page area may be adjusted in accordance with recognizing the movement of the marker by the user and recognizing the movement instruction.

本発明の各工程は、ネットワーク又は各種記憶媒体を介して取得したソフトウエア（プログラム）をパソコン（コンピュータ）等の処理装置（ＣＰＵ、プロセッサ）にて実行することでも実現できる。 Each process of the present invention can also be realized by executing software (program) acquired via a network or various storage media by a processing device (CPU, processor) such as a personal computer (computer).

本発明は上記実施形態に限定されるものではなく、本発明の趣旨に基づき種々の変形（各実施形態の有機的な組合せを含む）が可能であり、それらを本発明の範囲から除外するものではない。 The present invention is not limited to the above embodiment, and various modifications (including organic combinations of the embodiments) are possible based on the spirit of the present invention, and these are excluded from the scope of the present invention. is not.

１０１カメラスキャナ
２０１コントローラ部
２０２カメラ部
２０４書画台
２０７プロジェクタ
２０８距離画像センサ部 DESCRIPTION OF SYMBOLS 101 Camera scanner 201 Controller part 202 Camera part 204 Document stand 207 Projector 208 Distance image sensor part

Claims

A mounting means for mounting a plurality of objects;
A camera that shoots a plurality of objects placed on the placing means and generates a camera image;
First distance image generation means for generating a distance image from an infrared image obtained by photographing an infrared pattern irradiated to a plurality of objects placed on the placement means;
Projection means for projecting a predetermined measurement pattern or user interface onto the placement means;
A setting for setting a storage format indicating whether a plurality of objects placed on the placing means are combined into one page and stored according to an instruction to the user interface, or each object is separated and stored one page at a time Means,
Control means for switching control between processing for converting a plurality of objects into electronic document data obtained by page composition based on the storage format set by the setting means, and processing for converting each object into electronic document data separated from each other;
A scanner system comprising:

2. The scanner system according to claim 1, further comprising a first acquisition unit configured to acquire a planar document image associated with the object placed on the placement unit from the camera image according to an instruction to the user interface. .

The apparatus according to claim 1, further comprising: a second acquisition unit configured to acquire a book image associated with the object placed on the placement unit from the camera image and the distance image according to an instruction to the user interface. Item 3. The scanner system according to Item 2.

Second distance image generation means for generating a distance image from a camera image obtained by photographing the predetermined pattern projected by the projection means on the object in accordance with an instruction to the user interface;
Third acquisition means for acquiring a three-dimensional point group for performing a predetermined coordinate conversion process on the distance image generated by the second distance image generation means and specifying the three-dimensional shape of the object placed on the placement means. The scanner system according to any one of claims 1 to 3, further comprising:

5. The scanner system according to claim 1, wherein the projecting unit projects a page area set in the electronic document data onto the placing unit.

5. The scanner system according to claim 1, wherein the projection unit projects a user interface for setting an imaging condition for imaging the placement unit.

5. The projector according to claim 1, wherein the projecting unit projects a marker for specifying a page region in which the planar object and the solid object placed on the placing unit are placed. Scanner system.

Recognizing means for analyzing a distance image generated by the first distance image generating means and recognizing an instruction operation by an operator for the user interface;
The scanner system according to claim 7, further comprising: an adjusting unit that adjusts the page area in response to the recognition unit recognizing a movement instruction for the marker projected by the projection unit.

Determining means for determining whether there is a blind spot area in which the entire one object cannot be imaged due to the positional relationship with each object placed on the placing means;
When it is determined by the determination means that there is a blind spot area, the imaging control means for photographing the object placed on the placement means by the camera by rotating the placement means relative to the placement plane;
The scanner system according to any one of claims 1 to 4, further comprising:

Determination means for determining whether an object is placed on the placement means based on a camera image of the placement means taken by the camera and a camera image stored in advance;
The scanner system according to claim 9, wherein when it is determined that no object is placed on the placement unit, the shooting control unit does not perform shooting of the blind spot area.

When it is determined by the determination means that there is a blind spot area, the determination means for determining whether the object has been moved from the placement position of the placement means to another placement position;
When it is determined that the three-dimensional object has been moved from the placement position of the placement means to another placement position, the shooting control means causes the camera to take a picture of the object placed on the placement means. The scanner system according to claim 9, wherein:

A placement unit for placing a plurality of objects, a camera for capturing a plurality of objects placed on the placement unit and generating a camera image, and projecting a predetermined measurement pattern or user interface on the placement unit A data processing method for a scanner system, comprising:
A first distance image generating step for generating a distance image from an infrared image obtained by photographing an infrared pattern irradiated on a plurality of objects placed on the placing means;
A projection step in which the projection means projects a predetermined measurement pattern or user interface on the placement means;
A setting for setting a storage format indicating whether a plurality of objects placed on the placing means are combined into one page and stored according to an instruction to the user interface, or each object is separated and stored one page at a time Process,
Based on the storage format set in the setting step, a control step for switching and controlling a process for converting a plurality of objects into electronic document data obtained by combining pages and a process for converting each object into electronic document data separated from each other;
A data processing method for a scanner system comprising:

A program for causing a computer to execute the data processing method of the scanner system according to claim 12.