JP2021140429A

JP2021140429A - Three-dimentional model generation method

Info

Publication number: JP2021140429A
Application number: JP2020037165A
Authority: JP
Inventors: 南司; Nan Si; 智紀盛合; Tomoki Moriai
Original assignee: NTT Data Corp
Current assignee: NTT Data Group Corp
Priority date: 2020-03-04
Filing date: 2020-03-04
Publication date: 2021-09-16

Abstract

To generate a three-dimentional model capable of identifying a certain region where existence or absence of an object is unknown.SOLUTION: A point cloud generation unit generates point cloud data showing a three-dimensional shape of a monitored object on the basis of an image data set. A registration unit aligns the point cloud data and reference point cloud data. A voxelization unit generates voxel data on the basis of the point cloud data. A pixel position estimation unit estimates world coordinate values of each of pixels making up the image data set for each image data making up the image data set. A label update unit calculates a camera visual line straight line passing through each pixel and a camera center for each pixel of each image data making up the image data set and executes processing for updating a label of the voxel crossing the camera visual line straight line among the voxels indicated by voxel data. A voxel comparison unit compares the voxel data with reference voxel data.SELECTED DRAWING: Figure 2

Description

本発明は、３Ｄモデルを生成する方法に関する。 The present invention relates to a method of generating a 3D model.

従来、ドローンや車両に搭載されたカメラで動画を撮影し、撮影した動画を基に３Ｄ点群データを生成する方法が知られている。この３Ｄ点群データの利用方法として、例えば特許文献１には、３次元点群を解析することにより、ポリゴンモデルを精度良く生成する方法が記載されている。 Conventionally, there is known a method of shooting a moving image with a camera mounted on a drone or a vehicle and generating 3D point cloud data based on the shot moving image. As a method of using this 3D point cloud data, for example, Patent Document 1 describes a method of accurately generating a polygon model by analyzing a three-dimensional point cloud.

特許第６０８０６４０号公報Japanese Patent No. 60080640

３Ｄ点群データの他の利用方法として、異常箇所を検知するというものがある。この利用方法は、点検時の監視対象物を表す３Ｄ点群データを、正常時の監視対象物を表す３Ｄ点群データと比較し、その相違点を異常箇所として検知するというものである。この利用方法では、一般に、点群の距離ベースで類似度を判定する方法が用いられる。 Another method of using the 3D point cloud data is to detect an abnormal part. In this usage method, the 3D point cloud data representing the monitored object at the time of inspection is compared with the 3D point cloud data representing the monitored object at the time of normal operation, and the difference is detected as an abnormal part. In this usage method, a method of determining the similarity based on the distance of a point cloud is generally used.

しかし、３Ｄ点群データを用いた異常検知には、オクルージョン問題が存在する。ここでオクルージョン問題とは、３Ｄ点群データにより表される３Ｄ点群のうち、点が存在しない領域に、そもそも物体が存在しないのか、それとも物体が壁等により遮蔽されて撮影することができず、その結果、その領域に物体が存在するのか否かが不明であるのかを区別することができないという問題である。仮に、壁等により遮蔽されて物体が撮影することができなかったとした場合、その後に当該遮蔽物が撤去されると、当該物体は従前から存在していたにもかかわらず、異常箇所として検知されてしまうことになる。 However, there is an occlusion problem in abnormality detection using 3D point cloud data. Here, the occlusion problem is that there is no object in the area where the point does not exist in the 3D point cloud represented by the 3D point cloud data, or the object is blocked by a wall or the like and cannot be photographed. As a result, it is not possible to distinguish whether or not an object exists in the area. If the object could not be photographed because it was shielded by a wall or the like, and then the object was removed, the object was detected as an abnormal part even though it had existed before. Will end up.

本発明は、このような事情に鑑みてなされたものであり、物体の存否が不明である領域を識別可能な３Ｄモデルを生成することを目的とする。 The present invention has been made in view of such circumstances, and an object of the present invention is to generate a 3D model capable of identifying a region where the existence or nonexistence of an object is unknown.

上記の課題を解決するため、本発明に係る３Ｄモデル生成方法は、コンピュータにより実行される３Ｄモデル生成方法であって、監視対象物を多数の地点から連続的にカメラで撮影することにより生成された画像群に基づいて、世界座標系で表される点群であって、前記監視対象物の立体形状を表す点群を設定する第１ステップと、前記世界座標系で表されるボクセル群であって、前記点群により表される立体形状の領域を占めるボクセル群を設定する第２ステップと、前記画像群に含まれる第１の画像の画素群の各々について、画像座標系で表される座標値を前記世界座標系で表される座標値に変換する第３ステップと、それぞれ座標値が変換された前記画素群の各々について、当該画素と前記カメラを通るカメラ視線直線を特定する第４ステップと、前記特定されたカメラ視線直線の各々について、当該カメラ視線直線が前記ボクセル群のうちの複数のボクセルと交差する場合に、当該複数のボクセルのうち、前記点群を構成する点を所定数以上含むボクセルであって、前記カメラに最も近いボクセルを第１のカテゴリに分類する一方で、その他のボクセルを第２のカテゴリに分類する第５ステップとを含み、前記第１のカテゴリは、前記第１のカテゴリに分類されたボクセルが占める領域に物体が存在することを示し、前記第２のカテゴリは、前記第２のカテゴリに分類されたボクセルが占める領域に物体が存在するか否かが不明であることを示すことを特徴とする。 In order to solve the above problems, the 3D model generation method according to the present invention is a 3D model generation method executed by a computer, and is generated by continuously photographing a monitored object from a large number of points with a camera. The first step of setting a point group representing the three-dimensional shape of the monitored object, which is a point group represented by the world coordinate system based on the image group, and a voxel group represented by the world coordinate system. Therefore, each of the second step of setting the voxel group occupying the area of the three-dimensional shape represented by the point group and the pixel group of the first image included in the image group is represented by the image coordinate system. The third step of converting the coordinate values into the coordinate values represented by the world coordinate system, and the fourth step of specifying the camera line-of-sight straight line passing through the pixel and the camera for each of the pixel groups to which the coordinate values have been converted. For each of the step and the specified camera line-of-sight line, when the camera line-of-sight line intersects a plurality of voxels in the voxel group, a point constituting the point group among the plurality of voxels is determined. The first category includes a fifth step of classifying the voxels containing a number or more and closest to the camera into the first category, while classifying the other voxels into the second category. Indicates that an object exists in the area occupied by the voxels classified into the first category, and the second category indicates whether or not the object exists in the area occupied by the voxels classified into the second category. Is characterized by indicating that is unknown.

好ましい態様において、前記第５ステップでは、前記特定されたカメラ視線直線の各々について、当該カメラ視線直線が、前記ボクセル群に含まれるボクセルであって、前記点群を構成する点を前記所定数以上含まないボクセルと交差する場合に、当該ボクセルを第３のカテゴリに分類し、前記第３のカテゴリは、前記第３のカテゴリに分類されたボクセルが占める領域に物体が存在しないことを示す。 In a preferred embodiment, in the fifth step, for each of the specified camera line-of-sight lines, the camera line-of-sight line is a voxel included in the voxel group, and the number of points constituting the point cloud is equal to or more than the predetermined number. When intersecting with a voxel that does not include, the voxel is classified into a third category, and the third category indicates that there is no object in the area occupied by the voxels classified into the third category.

さらに好ましい態様において、前記第３ステップでは、前記画像群に含まれる複数の画像の画素群の各々について、前記画像座標系で表される座標値を前記世界座標系で表される座標値に変換する。 In a further preferred embodiment, in the third step, the coordinate values represented by the image coordinate system are converted into the coordinate values represented by the world coordinate system for each of the pixel groups of the plurality of images included in the image group. do.

さらに好ましい態様において、前記３Ｄモデル生成方法は、それぞれカテゴリに分類された前記ボクセル群と、それぞれカテゴリに分類された他のボクセル群の間で、座標値が共通するボクセル同士のカテゴリを比較し、当該比較の結果を示す情報を出力する第６ステップをさらに含む。 In a more preferred embodiment, the 3D model generation method compares the categories of voxels having a common coordinate value between the voxel group classified into each category and another voxel group classified into each category. A sixth step of outputting information indicating the result of the comparison is further included.

本発明によれば、物体の存否が不明である領域を識別可能な３Ｄモデルを生成することができる。 According to the present invention, it is possible to generate a 3D model capable of identifying a region where the existence or nonexistence of an object is unknown.

異常検知システム１の機能構成を示すブロック図Block diagram showing the functional configuration of the abnormality detection system 1 異常検知処理を示すフロー図Flow diagram showing anomaly detection processing 点群データＤ２の一例を示す図The figure which shows an example of the point cloud data D2 ３Ｄ点群の一例を示す図The figure which shows an example of a 3D point cloud ボクセルデータ生成処理を示すフロー図Flow diagram showing voxel data generation processing ボクセル群の一例を示す図Diagram showing an example of voxel group ボクセルデータＤ４の一例を示す図The figure which shows an example of the voxel data D4 画像の一例を示す図Diagram showing an example of an image 座標変換処理を示すフロー図Flow diagram showing coordinate conversion processing カメラ属性データＤ５の一例を示す図The figure which shows an example of the camera attribute data D5 画素位置データＤ６の一例を示す図The figure which shows an example of the pixel position data D6 ラベル更新処理を示すフロー図Flow diagram showing label update processing 直方体にカメラが含まれていない状態の一例を示す図The figure which shows an example of the state which the camera is not included in the rectangular parallelepiped 直方体にカメラが含まれている状態の一例を示す図A diagram showing an example of a state in which a camera is included in a rectangular parallelepiped. 第１の更新処理を示すフロー図Flow diagram showing the first update process ボクセル群の一例を示す平面図Top view showing an example of voxel group 第２の更新処理を示すフロー図Flow diagram showing the second update process ラベル更新処理を示す図Diagram showing label update processing 画面の一例を示す図Diagram showing an example of the screen

１．実施形態
本発明の一実施形態に係る異常検知システム１について図面を参照して説明する。本実施形態に係る異常検知システム１は、監視対象物の３Ｄモデルを生成し、この監視対象物の正常時の３Ｄモデルと比較することで、当該監視対象物の異常を検知するためのシステムである。この異常検知システム１は、１以上の情報処理装置により構成される。 1. 1. Embodiment An abnormality detection system 1 according to an embodiment of the present invention will be described with reference to the drawings. The abnormality detection system 1 according to the present embodiment is a system for detecting an abnormality of the monitored object by generating a 3D model of the monitored object and comparing it with a normal 3D model of the monitored object. be. The abnormality detection system 1 is composed of one or more information processing devices.

図１は、この異常検知システム１の機能構成を示すブロック図である。同図に示す異常検知システム１は、画像データ記憶部１０１、点群生成部１０２、点群データ記憶部１０３、参照点群データ記憶部１０４、レジストレーション部１０５、ボクセル化部１０６、ボクセルデータ記憶部１０７、カメラ属性データ記憶部１０８、画素位置推定部１０９、画素位置データ記憶部１１０、ラベル更新部１１１、参照ボクセルデータ記憶部１１２及びボクセル比較部１１３という機能を備える。 FIG. 1 is a block diagram showing a functional configuration of the abnormality detection system 1. The abnormality detection system 1 shown in the figure includes an image data storage unit 101, a point group generation unit 102, a point group data storage unit 103, a reference point group data storage unit 104, a registration unit 105, a voxelization unit 106, and a voxel data storage unit. It has functions such as a unit 107, a camera attribute data storage unit 108, a pixel position estimation unit 109, a pixel position data storage unit 110, a label update unit 111, a reference voxel data storage unit 112, and a voxel comparison unit 113.

これらの機能のうち、各種の記憶部は、ＨＤＤ等の記憶装置により実現される。その他の機能は、ＣＰＵ等の演算処理装置が、記憶装置に記憶されている異常検知プログラムを実行することにより実現される。 Among these functions, various storage units are realized by a storage device such as an HDD. Other functions are realized by an arithmetic processing unit such as a CPU executing an abnormality detection program stored in the storage device.

以下、これらの機能を用いて実行される異常検知処理について説明する。図２は、この異常検知処理を示すフロー図である。 Hereinafter, the abnormality detection process executed by using these functions will be described. FIG. 2 is a flow chart showing this abnormality detection process.

異常検知システム１の点群生成部１０２は、画像データ記憶部１０１に記憶されている画像データセットＤ１に基づいて、監視対象物の立体形状を表す点群データＤ２を生成する（ステップＳａ１）。ここで、画像データ記憶部１０１に記憶される画像データセットＤ１は、監視対象物を点検時に多数の地点から連続的にカメラで撮影することにより生成された画像データのセットである。言い換えると、動画データである。この画像データセットＤ１は、例えば、航空機や車両等の移動体に取り付けられたカメラにより生成される。 The point cloud generation unit 102 of the abnormality detection system 1 generates point cloud data D2 representing the three-dimensional shape of the monitored object based on the image data set D1 stored in the image data storage unit 101 (step Sa1). Here, the image data set D1 stored in the image data storage unit 101 is a set of image data generated by continuously photographing the monitored object from a large number of points with a camera at the time of inspection. In other words, it is video data. This image data set D1 is generated by, for example, a camera attached to a moving body such as an aircraft or a vehicle.

点群生成部１０２は、このステップＳａ１において、具体的には、ＳｆＭ（Structure from Motion）及びＭＶＳ（Multi-view Stereo）技術を用いて、画像データセットＤ１から、監視対象物の立体形状を表す３Ｄ点群を復元する。３Ｄ点群を復元すると、復元した３Ｄ点群の相対座標値を、利用者により指定された地上基準点を基に世界座標値に変換することで、点群データＤ２を生成する。 In this step Sa1, the point cloud generation unit 102 specifically represents the three-dimensional shape of the object to be monitored from the image data set D1 by using SfM (Structure from Motion) and MVS (Multi-view Stereo) techniques. Restore the 3D point cloud. When the 3D point cloud is restored, the point cloud data D2 is generated by converting the relative coordinate values of the restored 3D point cloud into world coordinate values based on the ground reference point specified by the user.

図３は、この点群データＤ２の一例を示す図である。同図に示す点群データＤ２は、点群を構成する各点の点ＩＤ、座標値及び色情報により構成されている。ここで、点ＩＤは点の識別情報であり、座標値は世界座標系の座標値であり、色情報はＲＧＢ値である。 FIG. 3 is a diagram showing an example of this point cloud data D2. The point cloud data D2 shown in the figure is composed of point IDs, coordinate values, and color information of each point constituting the point cloud. Here, the point ID is the point identification information, the coordinate value is the coordinate value of the world coordinate system, and the color information is the RGB value.

図４は、この点群データＤ２により表される３Ｄ点群の一例を示す図である。同図に示す３Ｄ点群ＰＧの各点の位置は、世界座標値（Ｘ，Ｙ，Ｚ）により表される。 FIG. 4 is a diagram showing an example of a 3D point cloud represented by the point cloud data D2. The position of each point of the 3D point cloud PG shown in the figure is represented by the world coordinate values (X, Y, Z).

点群生成部１０２は、点群データＤ２を生成すると、点群データ記憶部１０３に記憶する。 When the point cloud generation unit 102 generates the point cloud data D2, the point cloud generation unit 102 stores it in the point cloud data storage unit 103.

なお、この点群生成部１０２は、例えば、Pix4D（登録商標）mapperを実行することにより実現される。 The point cloud generator 102 is realized, for example, by executing Pix4D (registered trademark) mapper.

点群データＤ２が生成されると、レジストレーション部１０５は、ＩＣＰ（Iterative Closest Point）アルゴリズムを用いて、生成された点群データＤ２と、参照点群データ記憶部１０４に予め記憶されている参照点群データＤ３の位置合わせを行う（ステップＳａ２）。ここで、参照点群データ記憶部１０４に予め記憶される参照点群データＤ３は、正常時の監視対象物の立体形状を表す３Ｄ点群のデータである。この参照点群データＤ３のデータ構成は、図３に例示した点群データＤ２と同様である。 When the point cloud data D2 is generated, the registration unit 105 uses the ICP (Iterative Closest Point) algorithm to generate the point cloud data D2 and the reference stored in the reference point cloud data storage unit 104 in advance. The point cloud data D3 is aligned (step Sa2). Here, the reference point cloud data D3 stored in advance in the reference point cloud data storage unit 104 is the data of the 3D point cloud representing the three-dimensional shape of the monitored object at the normal time. The data structure of the reference point cloud data D3 is the same as that of the point cloud data D2 illustrated in FIG.

レジストレーション部１０５は、具体的には、点群データＤ２と参照点群データＤ３のずれ量を算出し、算出したずれ量を最小化するように点群データＤ２の各点の座標値を補正する。 Specifically, the registration unit 105 calculates the amount of deviation between the point cloud data D2 and the reference point cloud data D3, and corrects the coordinate values of each point of the point cloud data D2 so as to minimize the calculated deviation amount. do.

位置合わせが完了すると、ボクセル化部１０６は、点群データＤ２に基づいてボクセルデータＤ４を生成する（ステップＳａ３）。図５は、このボクセルデータ生成処理を示すフロー図である。 When the alignment is completed, the voxelization unit 106 generates voxel data D4 based on the point cloud data D2 (step Sa3). FIG. 5 is a flow chart showing this voxel data generation process.

同図に示すボクセルデータ生成処理において、ボクセル化部１０６は、点群データＤ２の座標値Ｘの最大値及び最小値、座標値Ｙの最大値及び最小値、並びに座標値Ｚの最大値及び最小値を特定する（ステップＳｂ１）。言い換えると、点群データＤ２により表される立体形状に外接する直方体の長さ、幅及び高さを特定する。直方体の長さ等を特定すると、世界座標空間を分割するボクセル群であって、少なくとも上記の直方体の領域を占めるボクセル群を設定する（ステップＳｂ２）。このボクセル群を構成する各ボクセルは、予め定められたサイズを有する。また、このボクセル群が形成する直方体の各面は、世界座標系のいずれかの座標軸と平行である。 In the voxel data generation process shown in the figure, the voxelization unit 106 includes the maximum and minimum values of the coordinate value X of the point cloud data D2, the maximum and minimum values of the coordinate value Y, and the maximum and minimum values of the coordinate value Z. Specify the value (step Sb1). In other words, the length, width and height of the rectangular parallelepiped circumscribing the three-dimensional shape represented by the point cloud data D2 are specified. When the length of the rectangular parallelepiped is specified, a voxel group that divides the world coordinate space and occupies at least the above-mentioned rectangular parallelepiped region is set (step Sb2). Each voxel constituting this voxel group has a predetermined size. Further, each surface of the rectangular parallelepiped formed by this voxel group is parallel to any coordinate axis of the world coordinate system.

図６は、このボクセル群の一例を示す図である。同図に示すボクセル群は、２７個のボクセルからなり、全体で立方体ＣＢを構成している。このボクセル群により構成される立方体ＣＢは、点群データＤ２により表される３Ｄ点群ＰＧの領域を占める。また、この立方体ＣＢは、世界座標系のいずれかの座標軸と平行な面を有する。例えば、図６に示す面ＰＬ１は、Ｘ軸と平行である。 FIG. 6 is a diagram showing an example of this voxel group. The voxel group shown in the figure is composed of 27 voxels, and constitutes a cube CB as a whole. The cube CB composed of this voxel group occupies the region of the 3D point cloud PG represented by the point cloud data D2. Further, this cube CB has a plane parallel to any coordinate axis of the world coordinate system. For example, the plane PL1 shown in FIG. 6 is parallel to the X axis.

ボクセル群の設定後、ボクセル化部１０６は、設定したボクセル群についてボクセルデータＤ４を生成する（ステップＳｂ３）。図７は、このボクセルデータＤ４の一例を示す図である。同図に示すボクセルデータＤ４は、ボクセル群を構成する各ボクセルのインデックス、ラベル及び点数により構成されている。ここで、インデックスは、ボクセルを識別するための世界座標系の座標値である。より具体的には、ボクセルを構成する８個の頂点のうち、座標値Ｙが最大であり、座標値Ｚが最小である頂点の座標値である。例えば、図６に示すボクセルＶＸ１のインデックスは、頂点ＰＴ１の座標値である。 After setting the voxel group, the voxel conversion unit 106 generates voxel data D4 for the set voxel group (step Sb3). FIG. 7 is a diagram showing an example of the voxel data D4. The voxel data D4 shown in the figure is composed of an index, a label, and a score of each voxel constituting the voxel group. Here, the index is a coordinate value of the world coordinate system for identifying a voxel. More specifically, it is the coordinate value of the vertex having the maximum coordinate value Y and the minimum coordinate value Z among the eight vertices constituting the voxel. For example, the index of voxel VX1 shown in FIG. 6 is the coordinate value of the vertex PT1.

ラベルは、ボックスが占める領域に物体が存在するか否か、又は物体の存否が不明であるかを示す情報である。このラベルには、「unknown」、「empty」及び「occupancy」の３種類がある。このうち、ラベル「unknown」は、ボクセルが占める領域が壁等により遮蔽されて撮影することができず、その結果、その領域に物体が存在するか否かが不明であることを示す。このラベル「unknown」は初期値である。ラベル「empty」は、ボクセルが占める領域に物体が存在しないことを示す。ラベル「occupancy」は、ボクセルが占める領域に物体が存在することを示す。 The label is information indicating whether or not an object exists in the area occupied by the box, or whether or not the existence or nonexistence of the object is unknown. There are three types of labels: "unknown", "empty" and "occupancy". Of these, the label "unknown" indicates that the area occupied by the voxels is blocked by a wall or the like and cannot be photographed, and as a result, it is unknown whether or not an object exists in that area. This label "unknown" is the initial value. The label "empty" indicates that there are no objects in the area occupied by the voxels. The label "occupancy" indicates that the object is in the area occupied by the voxels.

点数は、点群データＤ２を構成する点のうち、ボクセルに含まれる点の数である。 The score is the number of points included in the voxel among the points constituting the point cloud data D2.

ボクセル化部１０６は、ボクセルデータＤ４を生成すると、ボクセルデータ記憶部１０７に記憶する。 When the voxel conversion unit 106 generates the voxel data D4, it stores it in the voxel data storage unit 107.

ボクセルデータ生成処理が完了すると、画素位置推定部１０９は、画像データセットＤ１を構成する各画像データについて、画像を構成する各画素の世界座標値を推定する（ステップＳａ４）。より具体的には、画像を構成する各画素の画像座標値を世界座標値に変換する。ここで、画像座標値とは、画像座標系で表された平面上の位置を示す座標値である。 When the voxel data generation process is completed, the pixel position estimation unit 109 estimates the world coordinate values of each pixel constituting the image for each image data constituting the image data set D1 (step Sa4). More specifically, the image coordinate value of each pixel constituting the image is converted into the world coordinate value. Here, the image coordinate value is a coordinate value indicating a position on a plane represented by the image coordinate system.

図８は、画像データにより表される画像の一例を示す図である。同図に示す画像ＩＭは、監視対象物をカメラＣＡで撮影することにより生成された画像である。この画像ＩＭに写る監視対象物は、同図において立方体ＣＢの位置に配置されている。この画像ＩＭには、画像座標系が定義されている。この画像座標系は、原点を左上隅とし、水平方向に延びるＵ軸と、垂直方向に延びるＶ軸とを有する座標系である。 FIG. 8 is a diagram showing an example of an image represented by image data. The image IM shown in the figure is an image generated by photographing the monitored object with the camera CA. The monitored object shown in this image IM is arranged at the position of the cube CB in the figure. An image coordinate system is defined in this image IM. This image coordinate system is a coordinate system having a U-axis extending in the horizontal direction and a V-axis extending in the vertical direction with the origin as the upper left corner.

図９は、画素位置推定部１０９により実行される座標変換処理を示すフロー図である。 FIG. 9 is a flow chart showing a coordinate conversion process executed by the pixel position estimation unit 109.

同図に示す座標変換処理において、画素位置推定部１０９は、画像データセットＤ１の中から画像データを選択する（ステップＳｃ１）。画像データを選択すると、カメラ属性データ記憶部１０８に記憶されているカメラ属性データＤ５を参照して、選択した画像データに対応するカメラのパラメータを特定する（ステップＳｃ２）。ここで、カメラ属性データ記憶部１０８により記憶されるカメラ属性データＤ５は、上記の画像データセットＤ１を生成したカメラのパラメータを示すデータである。 In the coordinate conversion process shown in the figure, the pixel position estimation unit 109 selects image data from the image data set D1 (step Sc1). When the image data is selected, the camera attribute data D5 stored in the camera attribute data storage unit 108 is referred to to specify the camera parameters corresponding to the selected image data (step Sc2). Here, the camera attribute data D5 stored by the camera attribute data storage unit 108 is data indicating the parameters of the camera that generated the image data set D1.

図１０は、このカメラ属性データＤ５の一例を示す図である。同図に示すカメラ属性データＤ５は、画像データセットＤ１を構成する画像データの画像ＩＤと、当該画像データを生成した際のカメラのパラメータの複数の組により構成されている。ここで、カメラのパラメータには、内部パラメータとして、焦点距離及び画像中心、外部パラメータとして、回転軸角度及び平行移動行列が含まれている。 FIG. 10 is a diagram showing an example of the camera attribute data D5. The camera attribute data D5 shown in the figure is composed of an image ID of the image data constituting the image data set D1 and a plurality of sets of camera parameters when the image data is generated. Here, the camera parameters include the focal length and the center of the image as internal parameters, and the rotation axis angle and the translation matrix as external parameters.

画素位置推定部１０９は、パラメータを特定すると、選択した画像データから画素を選択する（ステップＳｃ３）。画素を選択すると、選択した画素の画像座標値と、特定したパラメータを以下の式（１）に代入して、当該画素の世界座標値を算出する（ステップＳｃ４）。

この式（１）において、Ｕ、Ｖは、画像座標値である。ｆｘ、ｆｙは、カメラの横軸及び縦軸の焦点距離である。Ｃｘ、Ｃｙは、画像座標系における光軸と画像面との交点の位置（画像中心）である。ｒ１１〜ｒ３３は、カメラの回転軸角度を表す回転行列の成分である。ｔ１〜ｔ３は、カメラの平行移動を表す行列の成分である。Ｘ、Ｙ、Ｚは、世界座標値である。 When the parameter is specified, the pixel position estimation unit 109 selects a pixel from the selected image data (step Sc3). When a pixel is selected, the image coordinate value of the selected pixel and the specified parameter are substituted into the following equation (1) to calculate the world coordinate value of the pixel (step Sc4).

In this equation (1), U and V are image coordinate values. fx and fy are focal lengths on the horizontal and vertical axes of the camera. Cx and Cy are the positions (center of the image) of the intersections of the optical axis and the image plane in the image coordinate system. r11 to r33 are components of a rotation matrix representing the rotation axis angle of the camera. t1 to t3 are components of a matrix representing the translation of the camera. X, Y, and Z are world coordinate values.

画素位置推定部１０９は、世界座標値を算出すると、算出した世界座標値を画素位置データ記憶部１１０に記憶する（ステップＳｃ５）。その際、算出した世界座標値を、当該画素の画像座標値と、選択した画像データの画像ＩＤとに対応付けて記憶する。ここで、画素位置データ記憶部１１０は、画素位置データＤ６を記憶する。図１１は、画素位置データＤ６の一例を示す図である。同図に示す画素位置データＤ６は、画像ＩＤと、画素の画像座標値及び世界座標値とにより構成されている。 When the pixel position estimation unit 109 calculates the world coordinate value, the calculated world coordinate value is stored in the pixel position data storage unit 110 (step Sc5). At that time, the calculated world coordinate value is stored in association with the image coordinate value of the pixel and the image ID of the selected image data. Here, the pixel position data storage unit 110 stores the pixel position data D6. FIG. 11 is a diagram showing an example of pixel position data D6. The pixel position data D6 shown in the figure is composed of an image ID, an image coordinate value of the pixel, and a world coordinate value.

世界座標値の記憶が完了すると、未選択の画素があるか否かを判定する（ステップＳｃ６）。この判定の結果、未選択の画素がある場合には（ステップＳｃ６のＹＥＳ）、ステップＳｃ３に戻る。一方、未選択の画素がない場合には（ステップＳｃ６のＮＯ）。未選択の画像データがあるか否かを判定する（ステップＳｃ７）。この判定の結果、未選択の画像データがある場合には（ステップＳｃ７のＹＥＳ）、ステップＳｃ１に戻る。一方、未選択の画像データがない場合には（ステップＳｃ７のＮＯ）、本座標変換処理を終了する。本座標変換処理の結果、画素位置データＤ６が生成される。 When the storage of the world coordinate values is completed, it is determined whether or not there are unselected pixels (step Sc6). As a result of this determination, if there are unselected pixels (YES in step Sc6), the process returns to step Sc3. On the other hand, when there are no unselected pixels (NO in step Sc6). It is determined whether or not there is unselected image data (step Sc7). As a result of this determination, if there is unselected image data (YES in step Sc7), the process returns to step Sc1. On the other hand, if there is no unselected image data (NO in step Sc7), the coordinate conversion process ends. As a result of this coordinate conversion process, pixel position data D6 is generated.

座標変換処理が完了すると、ラベル更新部１１１は、画像データセットＤ１を構成する各画像データの各画素について、画素とカメラ中心を通るカメラ視線直線を算出し、ボクセルデータＤ４により表されるボクセルのうち、このカメラ視線直線と交差するボクセルのラベルを更新する処理を実行する（ステップＳａ５）。図１２は、このラベル更新処理を示すフロー図である。 When the coordinate conversion process is completed, the label updating unit 111 calculates a camera line-of-sight straight line passing through the pixel and the camera center for each pixel of each image data constituting the image data set D1, and the voxel represented by the voxel data D4. Among them, the process of updating the voxel label that intersects the straight line of sight of the camera is executed (step Sa5). FIG. 12 is a flow chart showing this label update process.

同図に示すラベル更新処理において、ラベル更新部１１１は、画像データセットＤ１の中から画像データを選択する（ステップＳｄ１）。画像データを選択すると、カメラ属性データ記憶部１０８に記憶されているカメラ属性データＤ５を参照して、選択した画像データに対応するカメラの座標値を特定する（ステップＳｄ２）。カメラの座標値を特定すると、特定した座標値が、ボクセルデータＤ４により表されるボクセル群に含まれているか否かを判定する（ステップＳｄ３）。言い換えると、ボクセル群が全体で構成する直方体にカメラが含まれているか否かを判定する。その際、ラベル更新部１１１は、以下の条件式（２）を満たすか否かを判定する。

この条件式（２）において、Ｘｖｍａｘ、Ｘｖｍｉｎは、ボクセルデータＤ４の座標値Ｘの最大値、最小値である。Ｙｖｍａｘ、Ｙｖｍｉｎは、ボクセルデータＤ４の座標値Ｙの最大値、最小値である。Ｚｖｍａｘ、Ｚｖｍｉｎは、ボクセルデータＤ４の座標値Ｚの最大値、最小値である。Ｘｃ、Ｙｃ、Ｚｃは、カメラの座標値である。 In the label update process shown in the figure, the label update unit 111 selects image data from the image data set D1 (step Sd1). When the image data is selected, the coordinate values of the camera corresponding to the selected image data are specified by referring to the camera attribute data D5 stored in the camera attribute data storage unit 108 (step Sd2). When the coordinate value of the camera is specified, it is determined whether or not the specified coordinate value is included in the voxel group represented by the voxel data D4 (step Sd3). In other words, it is determined whether or not the camera is included in the rectangular parallelepiped composed of the voxels as a whole. At that time, the label updating unit 111 determines whether or not the following conditional expression (2) is satisfied.

In this conditional expression (2), Xvmax and Xvmin are the maximum and minimum values of the coordinate value X of the voxel data D4. Yvmax and Yvmin are the maximum and minimum values of the coordinate value Y of the voxel data D4. Zvmax and Zvmin are the maximum and minimum values of the coordinate value Z of the voxel data D4. Xc, Yc, and Zc are the coordinate values of the camera.

この条件式（２）を満たさない場合、すなわち、直方体にカメラが含まれない場合には（ステップＳｄ３のＮＯ）、第１の更新処理を実行する（ステップＳｄ４）。図１３は、直方体にカメラが含まれていない状態の一例を示す図である。同図に示すカメラＣＡは、立方体ＣＢの外部に存在する。一方、この条件式（２）を満たす場合、すなわち、直方体にカメラが含まれる場合には（ステップＳｄ３のＹＥＳ）、第２の更新処理を実行する（ステップＳｄ５）。図１４は、直方体にカメラが含まれている状態の一例を示す図である。同図に示すカメラＣＡは、立方体ＣＢの内部に存在する。 If the conditional expression (2) is not satisfied, that is, if the rectangular parallelepiped does not include the camera (NO in step Sd3), the first update process is executed (step Sd4). FIG. 13 is a diagram showing an example of a state in which the rectangular parallelepiped does not include a camera. The camera CA shown in the figure exists outside the cube CB. On the other hand, when the conditional expression (2) is satisfied, that is, when the rectangular parallelepiped includes the camera (YES in step Sd3), the second update process is executed (step Sd5). FIG. 14 is a diagram showing an example of a state in which a camera is included in a rectangular parallelepiped. The camera CA shown in the figure exists inside the cube CB.

第１又は第２の更新処理が完了すると、未選択の画像データがあるか否かを判定する（ステップＳｄ６）。この判定の結果、未選択の画像データがある場合には（ステップＳｄ６のＹＥＳ）、ステップＳｄ１に戻る。一方、未選択の画像データがない場合には（ステップＳｄ６のＮＯ）、本ラベル更新処理を終了する。 When the first or second update process is completed, it is determined whether or not there is unselected image data (step Sd6). As a result of this determination, if there is unselected image data (YES in step Sd6), the process returns to step Sd1. On the other hand, if there is no unselected image data (NO in step Sd6), the label update process ends.

次に、第１の更新処理について説明する。図１５は、この第１の更新処理を示すフロー図である。 Next, the first update process will be described. FIG. 15 is a flow chart showing the first update process.

同図に示す第１の更新処理において、ラベル更新部１１１は、ステップＳｄ１で選択した画像データから画素を選択する（ステップＳｅ１）。画素を選択すると、画素位置データＤ６を参照して、選択した画素の座標値（具体的には、世界座標値）を特定する（ステップＳｅ２）。座標値を特定すると、特定した座標値と、ステップＳｄ２で特定したカメラの座標値を通るカメラ視線直線を算出する（ステップＳｅ３）。その際、ラベル更新部１１１は、画素とカメラの座標値を以下の式（３）に代入することによりカメラ視線直線を算出する。

この式（３）において、Ｘｃ、Ｙｃ、Ｚｃは、カメラの座標値である。Ｘｐ、Ｙｐ、Ｚｐは、画素の座標値である。ｔは媒介変数である。 In the first update process shown in the figure, the label update unit 111 selects pixels from the image data selected in step Sd1 (step Se1). When a pixel is selected, the coordinate value (specifically, the world coordinate value) of the selected pixel is specified with reference to the pixel position data D6 (step Se2). When the coordinate values are specified, the specified coordinate values and the straight line of the camera line of sight passing through the coordinate values of the camera specified in step Sd2 are calculated (step Se3). At that time, the label updating unit 111 calculates the camera line-of-sight line by substituting the coordinate values of the pixels and the camera into the following equation (3).

In this equation (3), Xc, Yc, and Zc are the coordinate values of the camera. Xp, Yp, and Zp are the coordinate values of the pixels. t is a parameter.

ここで算出されるカメラ視線直線は、例えば、図８に示すカメラ視線直線Ｌ１に相当する。 The camera line-of-sight line calculated here corresponds to, for example, the camera line-of-sight line L1 shown in FIG.

カメラ視線直線を算出すると、算出したカメラ視線直線が交差する、上記直方体の面を算出する（ステップＳｅ４）。ここで、上記直方体の各面は、以下の式（４）〜（９）により表される。

これらの式（４）〜（９）において、Ｘｖｍａｘ、Ｘｖｍｉｎは、ボクセルデータＤ４の座標値Ｘの最大値、最小値である。Ｙｖｍａｘ、Ｙｖｍｉｎは、ボクセルデータＤ４の座標値Ｙの最大値、最小値である。Ｚｖｍａｘ、Ｚｖｍｉｎは、ボクセルデータＤ４の座標値Ｚの最大値、最小値である。 When the camera line-of-sight line is calculated, the surface of the rectangular parallelepiped where the calculated camera line-of-sight line intersects is calculated (step Se4). Here, each surface of the rectangular parallelepiped is represented by the following equations (4) to (9).

In these equations (4) to (9), Xvmax and Xvmin are the maximum and minimum values of the coordinate values X of the voxel data D4. Yvmax and Yvmin are the maximum and minimum values of the coordinate value Y of the voxel data D4. Zvmax and Zvmin are the maximum and minimum values of the coordinate value Z of the voxel data D4.

ラベル更新部１１１は、算出したカメラ視線直線の式と、式（４）〜（９）のいずれか１つの連立方程式を順番に解くことで、直方体の各面について媒介変数ｔを算出する。その結果、いずれの面についても、算出された媒介変数ｔの分母が「０」である場合、すなわち、カメラ視線直線が直方体と交差しない場合には（ステップＳｅ５のＮＯ）、ステップＳｅ１９に進む。一方、いずれかの面について、算出された媒介変数ｔの分母が「０」でない場合、すなわち、カメラ視線直線が直方体と交差する場合には（ステップＳｅ５のＹＥＳ）、媒介変数ｔが「０」でない面（言い換えると、カメラ視線直線が交差する面）の数が「２」であるか否かを判定する（ステップＳｅ６）。この判定の結果、媒介変数ｔが「０」でない面の数が「２」である場合には（ステップＳｅ６のＹＥＳ）、より小さい媒介変数ｔが算出された面を入口面とする（ステップＳｅ７）。言い換えると、カメラにより近い面を入口面とする。ここで特定される入口面は、例えば、図８に示す面ＰＬ２に相当する。一方、媒介変数ｔが「０」でない面の数が「２」より多い場合には（ステップＳｅ６のＮＯ）、すなわち、カメラ視線直線が直方体の頂点と交差する場合には、カメラ視線直線が交差する任意の面を入口面とする（ステップＳｅ８）。 The label updating unit 111 calculates the parameter t for each surface of the rectangular parallelepiped by sequentially solving the calculated equation of the line of sight of the camera and any one of the simultaneous equations (4) to (9). As a result, if the denominator of the calculated parameter t is "0" for any of the surfaces, that is, if the straight line of the camera line of sight does not intersect the rectangular parallelepiped (NO in step Se5), the process proceeds to step Se19. On the other hand, when the denominator of the calculated parameter t is not "0" for any of the surfaces, that is, when the straight line of the camera line of sight intersects the square (YES in step Se5), the parameter t is "0". It is determined whether or not the number of non-faces (in other words, the faces where the straight lines of the camera line of sight intersect) is "2" (step Se6). As a result of this determination, when the number of surfaces whose parameter t is not "0" is "2" (YES in step Se6), the surface on which the smaller parameter t is calculated is set as the entrance surface (step Se7). ). In other words, the surface closer to the camera is the entrance surface. The entrance surface specified here corresponds to, for example, the surface PL2 shown in FIG. On the other hand, when the number of faces whose parameter t is not "0" is more than "2" (NO in step Se6), that is, when the camera line-of-sight line intersects the apex of the rectangular parallelepiped, the camera line-of-sight line intersects. Any surface to be used is used as the entrance surface (step Se8).

入口面を特定すると、特定した入口面と、これの反対面と、入口面と反対面の間に存在する複数の断面の中からいずれかの面を選択する（ステップＳｅ９）。その際、ラベル更新部１１１は、カメラに近い面から順に選択する。ここで、次のステップの説明に進む前に、反対面と断面について説明する。 When the entrance surface is specified, one of the specified entrance surfaces, the opposite surface thereof, and a plurality of cross sections existing between the entrance surface and the opposite surface is selected (step Se9). At that time, the label updating unit 111 selects in order from the surface closest to the camera. Here, the opposite surface and the cross section will be described before proceeding to the description of the next step.

図１６は、ボクセル群の一例を示す平面図である。同図に示すボクセル群は、９個のボクセルからなり、全体で直方体ＲＰを構成している。このボクセル群により構成される直方体ＲＰは、カメラ視線直線Ｌ２と交差している。カメラ視線直線Ｌ２と交差する面のうち、面ＰＬ３が入口面に相当する。そして、この入口面に平行な面ＰＬ４が反対面に相当する。そして、この入口面と反対面の間に、断面ＣＳ１及びＣＳ２が存在する。これらの断面ＣＳ１及びＣＳ２は、入口面及び反対面と平行であり、かつ、ボクセルの長さ間隔で配置された断面である。 FIG. 16 is a plan view showing an example of the voxel group. The voxel group shown in the figure is composed of nine voxels, and constitutes a rectangular parallelepiped RP as a whole. The rectangular parallelepiped RP composed of this voxel group intersects the camera line-of-sight line L2. Of the surfaces that intersect the camera line-of-sight line L2, the surface PL3 corresponds to the entrance surface. The surface PL4 parallel to the entrance surface corresponds to the opposite surface. Then, the cross sections CS1 and CS2 exist between the entrance surface and the opposite surface. These cross sections CS1 and CS2 are cross sections that are parallel to the entrance surface and the opposite surface and are arranged at voxel length intervals.

ラベル更新部１１１は、上記の複数の面の中からいずれかの面を選択すると、選択した面が入口面であるか否かを判定する（ステップＳｅ１０）。この判定の結果、選択した面が入口面である場合には（ステップＳｅ１０のＹＥＳ）、ステップＳｅ１１をスキップして、ステップＳｅ１２に進む。一方、選択した面が入口面でない場合には（ステップＳｅ１０のＮＯ）、選択した面がカメラ視線直線と交差するか否かを判定する（ステップＳｅ１１）。具体的には、選択した面の式と、算出したカメラ視線直線の式の連立方程式を解くことで、当該面について媒介変数ｔを算出する。この判定の結果、算出した媒介変数ｔの分母が「０」である場合、すなわち、選択した面がカメラ視線直線と交差しない場合には（ステップＳｅ１１のＮＯ）、ステップＳｅ１９に進む。一方、算出した媒介変数ｔの分母が「０」でない場合、すなわち、選択した面がカメラ視線直線と交差する場合には（ステップＳｅ１１のＹＥＳ）、交点を算出する（ステップＳｅ１２）。具体的には、算出した媒介変数ｔを、算出したカメラ視線直線に代入することで交点を算出する。 When any surface is selected from the plurality of surfaces described above, the label updating unit 111 determines whether or not the selected surface is an entrance surface (step Se10). As a result of this determination, if the selected surface is the entrance surface (YES in step Se10), step Se11 is skipped and the process proceeds to step Se12. On the other hand, when the selected surface is not the entrance surface (NO in step Se10), it is determined whether or not the selected surface intersects the straight line of the camera line of sight (step Se11). Specifically, the parameter t is calculated for the surface by solving the simultaneous equations of the selected surface equation and the calculated equation of the line of sight of the camera. As a result of this determination, if the denominator of the calculated parameter t is "0", that is, if the selected surface does not intersect the straight line of the camera line of sight (NO in step Se11), the process proceeds to step Se19. On the other hand, when the denominator of the calculated parameter t is not "0", that is, when the selected surface intersects the straight line of the camera line of sight (YES in step Se11), the intersection is calculated (step Se12). Specifically, the intersection is calculated by substituting the calculated parameter t into the calculated straight line of the camera line of sight.

交点を算出すると、ボクセルデータＤ４を参照して、算出した交点をその面上に持つボクセルを特定する（ステップＳｅ１３）。図１６を参照して具体的に説明すると、交点ＰＴ２が算出されると、ボクセルＶＸ２が特定され、交点ＰＴ３が算出されると、ボクセルＶＸ３が特定され、交点ＰＴ４が算出されると、ボクセルＶＸ４が特定される。なお、算出した交点をその面上に持つボクセルが２個特定された場合には、カメラから遠い方のボクセルが選択される。 When the intersection point is calculated, the voxel data D4 is referred to, and the voxel having the calculated intersection point on the surface is specified (step Se13). More specifically with reference to FIG. 16, when the intersection PT2 is calculated, the voxel VX2 is specified, when the intersection PT3 is calculated, the voxel VX3 is specified, and when the intersection PT4 is calculated, the voxel VX4 is specified. Is identified. When two voxels having the calculated intersection on the surface are specified, the voxel farther from the camera is selected.

ボクセルを特定すると、ボクセルデータＤ４を参照して、選択したボクセルに含まれる点の数を特定する（ステップＳｅ１４）。そして、特定した点の数が「０」であるか否かを判定する（ステップＳｅ１５）。この判定の結果、特定した点の数が「０」である場合には（ステップＳｅ１５のＹＥＳ）、選択したボクセルのラベルを「unknown」から「empty」に更新する（ステップＳｅ１６）。このラベル「empty」は、当該ボクセルが占める領域に物体が存在しないことを示す。そして、ステップＳｅ１８に進む。一方、特定した点の数が「０」でない場合には（ステップＳｅ１５のＮＯ）、選択したボクセルのラベルを「unknown」から「occupancy」に更新する（ステップＳｅ１７）。このラベル「occupancy」は、当該ボクセルが占める領域に物体が存在することを示す。そして、ステップＳｅ１８をスキップして、ステップＳｅ１９に進む。 When the voxel is specified, the number of points included in the selected voxel is specified with reference to the voxel data D4 (step Se14). Then, it is determined whether or not the number of the specified points is "0" (step Se15). As a result of this determination, if the number of specified points is "0" (YES in step Se15), the label of the selected voxel is updated from "unknown" to "empty" (step Se16). The label "empty" indicates that there are no objects in the area occupied by the voxel. Then, the process proceeds to step Se18. On the other hand, if the number of specified points is not "0" (NO in step Se15), the label of the selected voxel is updated from "unknown" to "occupancy" (step Se17). The label "occupancy" indicates that an object exists in the area occupied by the voxel. Then, step Se18 is skipped and the process proceeds to step Se19.

ここで、ステップＳｅ１８をスキップすることで、選択中の面の背後に存在する面についてはカメラ視線直線との交点が特定されないことになる。そのため、背後の面にカメラ視線直線との交点が存在したとしても、その交点を面上に持つボクセルのラベルは「unknown」のままとされる。これは、「occupancy」にラベルが更新されたボクセルの背後の領域は、壁等により遮蔽されて撮影することができず、物体の存否が不明の領域と考えられるからである。 Here, by skipping step Se18, the intersection with the straight line of the camera line of sight is not specified for the surface existing behind the selected surface. Therefore, even if there is an intersection with the straight line of the camera line of sight on the back surface, the label of the voxel having the intersection on the surface remains "unknown". This is because the area behind the voxel whose label has been updated to "occupancy" cannot be photographed because it is shielded by a wall or the like, and it is considered that the existence or nonexistence of the object is unknown.

次に、ステップＳｅ１８では、ラベル更新部１１１は、未選択の面があるか否かを判定する。この判定の結果、未選択の面がある場合には（ステップＳｅ１８のＹＥＳ）、ステップＳｅ９に戻る。一方、未選択の面がない場合には（ステップＳｅ１８のＮＯ）。未選択の画素があるか否かを判定する（ステップＳｅ１９）。この判定の結果、未選択の画素がある場合には（ステップＳｅ１９のＹＥＳ）、ステップＳｅ１に戻る。一方、未選択の画素がない場合には（ステップＳｅ１９のＮＯ）、第１の更新処理を終了する。
以上が第１の更新処理についての説明である。 Next, in step Se18, the label updating unit 111 determines whether or not there is an unselected surface. As a result of this determination, if there is an unselected surface (YES in step Se18), the process returns to step Se9. On the other hand, when there is no unselected surface (NO in step Se18). It is determined whether or not there are unselected pixels (step Se19). As a result of this determination, if there are unselected pixels (YES in step Se19), the process returns to step Se1. On the other hand, if there are no unselected pixels (NO in step Se19), the first update process is terminated.
The above is the description of the first update process.

次に、上記のステップＳｄ５で実行される第２の更新処理について説明する。この第２の更新処理は、ボクセル群が全体で構成する直方体にカメラが含まれている場合に実行される処理である。 Next, the second update process executed in step Sd5 described above will be described. This second update process is a process executed when the camera is included in the rectangular parallelepiped formed by the voxel group as a whole.

図１７は、この第２の更新処理を示すフロー図である。同図に示す第２の更新処理は、ステップＳｅ６〜Ｓｅ８に代えて、ステップＳｆ１〜Ｓｆ４を含む点においてのみ、第１の更新処理と相違している。したがって以下では、この相違点であるステップＳｆ１〜Ｓｆ４についてのみ説明する。 FIG. 17 is a flow chart showing the second update process. The second update process shown in the figure is different from the first update process only in that steps Sf1 to Sf4 are included instead of steps Se6 to Se8. Therefore, in the following, only this difference, steps Sf1 to Sf4, will be described.

ステップＳｆ１において、ラベル更新部１１１は、媒介変数ｔが「０」でない面（言い換えると、カメラ視線直線が交差する面）の数が「１」であるか否かを判定する。この判定の結果、媒介変数ｔが「０」でない面の数が「１」である場合には（ステップＳｆ１のＹＥＳ）、その媒介変数ｔが「０」でない面を出口面とする（ステップＳｆ２）。ここで特定される出口面は、例えば、図１４に示す面ＰＬ５に相当する。一方、媒介変数ｔが「０」でない面の数が「１」より多い場合には（ステップＳｆ１のＮＯ）、すなわち、カメラ視線直線が直方体の頂点と交差する場合には、カメラ視線直線が交差する任意の面を出口面とする（ステップＳｆ３）。 In step Sf1, the label updating unit 111 determines whether or not the number of surfaces whose parameter t is not "0" (in other words, the surfaces where the straight lines of the camera line of sight intersect) is "1". As a result of this determination, when the number of surfaces whose parameter t is not "0" is "1" (YES in step Sf1), the surface whose parameter t is not "0" is set as the exit surface (step Sf2). ). The exit surface specified here corresponds to, for example, the surface PL5 shown in FIG. On the other hand, when the number of faces whose parameter t is not "0" is larger than "1" (NO in step Sf1), that is, when the camera line-of-sight line intersects the apex of the rectangular parallelepiped, the camera line-of-sight line intersects. Let any surface to be the exit surface (step Sf3).

出口面を特定すると、特定した出口面に基づいて入口面を特定する（ステップＳｆ４）。具体的には、特定した出口面と平行な断面であって、出口面とカメラの間に存在する断面のうち、カメラに最も近い断面を、入口面として特定する。ここで特定される入口面は、例えば、図１４に示す面ＰＬ６に相当する。 When the exit surface is specified, the entrance surface is specified based on the specified exit surface (step Sf4). Specifically, among the cross sections parallel to the specified exit surface and existing between the exit surface and the camera, the cross section closest to the camera is specified as the entrance surface. The entrance surface specified here corresponds to, for example, the surface PL6 shown in FIG.

入口面を特定すると、以降は第１の更新処理と同様に、ステップＳｅ９以降の処理を実行する。
以上が第２の更新処理についての説明である。 After specifying the entrance surface, the processes after step Se9 are executed in the same manner as the first update process.
The above is the description of the second update process.

ラベル更新部１１１は、以上説明したラベル更新処理を実行することで、図１８に例示するように、画像データセットＤ１を構成する複数の画像データの複数の画素について、画素とカメラ中心を通るカメラ視線直線を算出する。そして、ボクセルデータＤ４により表されるボクセルのうち、このカメラ視線直線と交差するボクセルのラベルを更新する処理を実行する。 By executing the label update process described above, the label update unit 111 passes through the pixels and the center of the camera for a plurality of pixels of the plurality of image data constituting the image data set D1 as illustrated in FIG. Calculate the line-of-sight line. Then, among the voxels represented by the voxel data D4, the process of updating the label of the voxel intersecting with the straight line of the camera line of sight is executed.

ラベル更新処理が完了すると、ボクセル比較部１１３は、ボクセルデータ記憶部１０７に記憶されているボクセルデータＤ４と、参照ボクセルデータ記憶部１１２に予め記憶されている参照ボクセルデータＤ７を比較する（ステップＳａ６）。ここで、参照ボクセルデータ記憶部１１２に予め記憶されている参照ボクセルデータＤ７は、正常時の監視対象物について生成されたボクセルデータである。この参照ボクセルデータＤ７のデータ構成は、図７に例示したボクセルデータＤ４と同様である。 When the label update process is completed, the voxel comparison unit 113 compares the voxel data D4 stored in the voxel data storage unit 107 with the reference voxel data D7 stored in advance in the reference voxel data storage unit 112 (step Sa6). ). Here, the reference voxel data D7 stored in advance in the reference voxel data storage unit 112 is voxel data generated for the monitored object at the normal time. The data structure of the reference voxel data D7 is the same as that of the voxel data D4 illustrated in FIG.

ボクセル比較部１１３は、具体的には、ボクセルデータＤ４と参照ボクセルデータＤ７の間で、インデックスが共通するボクセル同士のラベルを比較する。そして、ボクセルデータＤ４により表されるボクセルのうち、参照ボクセルデータＤ７により表されるボクセルとラベルが異なるボクセルを特定する。 Specifically, the voxel comparison unit 113 compares the labels of voxels having a common index between the voxel data D4 and the reference voxel data D7. Then, among the voxels represented by the voxel data D4, a voxel having a label different from that of the voxel represented by the reference voxel data D7 is specified.

ここで、ボクセルデータＤ４と参照ボクセルデータＤ７のいずれについても、物体が存在しない領域を占めるボクセルにはラベル「empty」が付与されている。そのため、物体が存在しない領域を占めるボクセルについても、インデックスが共通するボクセルとラベルを比較することができる。そのため、監視対象物に新たに物体が付加された場合でも、当該物体の存在を検知することができる。これに対して、従来技術では、レファレンスデータにおいて点が存在しない領域は比較対象とはならないため、監視対象物に新たに物体が付加されても、当該物体の存在を検知することができない。 Here, in both the voxel data D4 and the reference voxel data D7, the label "empty" is given to the voxels that occupy the area where no object exists. Therefore, it is possible to compare the label with the voxel having a common index even for the voxel that occupies the area where the object does not exist. Therefore, even when a new object is added to the monitored object, the existence of the object can be detected. On the other hand, in the prior art, since the region where no point does not exist in the reference data is not a comparison target, even if a new object is added to the monitored object, the existence of the object cannot be detected.

次に、ボクセルを特定すると、特定したボクセルが識別可能なように、ボクセルデータＤ４により表されるボクセル群を示す画面を生成する（ステップＳａ７）。図１９は、この画面の一例を示す図である。同図に示す画面では、参照ボクセルデータＤ７により表されるボクセル群と、ボクセルデータＤ４により表されるボクセル群が並置されている。これらのボクセル群を構成する各ボクセルには、ラベルの種類を表す文字が付されている。文字「ｕ」は「unknown」を表し、文字「ｅ」は「empty」を表し、文字「ｏ」は「occupancy」を表す。各ボクセルに付された文字のうち、上記特定されたボクセルの文字には下線が付されている。したがって、この画面を見た利用者は、これらのボクセル群の相違点を識別することができる。言い換えると、正常時の監視対象物と点検時の監視対象物の相違点（言い換えると、異常発生箇所）を識別することができる。 Next, when the voxels are specified, a screen showing the voxel group represented by the voxel data D4 is generated so that the specified voxels can be identified (step Sa7). FIG. 19 is a diagram showing an example of this screen. In the screen shown in the figure, the voxel group represented by the reference voxel data D7 and the voxel group represented by the voxel data D4 are juxtaposed. Each voxel that composes these voxels is given a letter that indicates the type of label. The letter "u" stands for "unknown", the letter "e" stands for "empty", and the letter "o" stands for "occupancy". Of the characters attached to each voxel, the characters of the specified voxel are underlined. Therefore, the user who sees this screen can identify the difference between these voxel groups. In other words, it is possible to identify the difference between the monitored object during normal operation and the monitored object during inspection (in other words, the location where an abnormality occurs).

また、各ボクセルに付されるラベルの種類には「unknown」が含まれている。このラベル「unknown」は、ボクセルが占める領域が壁等により遮蔽されて撮影することができず、その結果、その領域に物体が存在するか否かが不明であることを示す。したがって、利用者は、ボクセル群のうち、物体の存否が不明であるボクセルを識別することができる。 In addition, "unknown" is included in the type of label attached to each voxel. This label "unknown" indicates that the area occupied by the voxel is blocked by a wall or the like and cannot be photographed, and as a result, it is unknown whether or not an object exists in the area. Therefore, the user can identify a voxel whose existence or nonexistence of an object is unknown from the voxel group.

２．変形例
上記の実施形態は以下に記載するように変形してもよい。以下に記載する１以上の変形例は互いに組み合わせてもよい。 2. Modification Example The above embodiment may be modified as described below. One or more modifications described below may be combined with each other.

２−１．変形例１
ラベル更新部１１１は、必ずしも、画像データセットＤ１を構成するすべての画像について第１又は第２の更新処理を実行する必要はない。例えば、ラベル更新部１１１は、画像データセットＤ１の中から所定のサンプリング間隔で画像データを抽出し、抽出した画像データについてのみ第１又は第２の更新処理を実行するようにしてもよい。 2-1. Modification 1
The label update unit 111 does not necessarily have to execute the first or second update process for all the images constituting the image data set D1. For example, the label update unit 111 may extract image data from the image data set D1 at predetermined sampling intervals and execute the first or second update process only for the extracted image data.

同様に、ラベル更新部１１１は、必ずしも、画像データを構成するすべての画素について第１又は第２の更新処理を実行する必要はない。例えば、ラベル更新部１１１は、画像データから所定のサンプリング間隔で画素を抽出し、抽出した画素についてのみ第１又は第２の更新処理を実行するようにしてもよい。 Similarly, the label updating unit 111 does not necessarily have to execute the first or second updating process for all the pixels constituting the image data. For example, the label updating unit 111 may extract pixels from the image data at predetermined sampling intervals and execute the first or second updating process only for the extracted pixels.

２−２．変形例２
ラベル更新部１１１は、上記の第１又は第２の更新処理において、ボクセルに含まれる点数の数が「０」と判定した場合には当該ボクセルのラベルを「empty」に更新し、「０」でないと判定した場合には「occupancy」に更新している。これに代えて、ボクセルに含まれる点数の数が、「１」以上の所定数以下と判定した場合に当該ボクセルのラベルを「empty」に更新し、当該所定数より大と判定した場合に「occupancy」に更新するようにしてもよい。すなわち、ラベルの種類を判定するための閾値を任意に変更してもよい。 2-2. Modification 2
When the label update unit 111 determines that the number of points included in the voxel is "0" in the first or second update process, the label update unit 111 updates the label of the voxel to "empty" and "0". If it is determined that it is not, it is updated to "occupancy". Instead, when it is determined that the number of points contained in the voxel is "1" or more and less than a predetermined number, the label of the voxel is updated to "empty", and when it is determined that the number is larger than the predetermined number, " You may update to "occupancy". That is, the threshold value for determining the label type may be arbitrarily changed.

２−３．変形例３
ボクセル比較部１１３が比較結果を出力する方法は、上記の例に限られない。ボクセル比較部１１３は、監視対象物の正常時と点検時の相違点が識別可能な別の方法で比較結果を出力してもよい。 2-3. Modification 3
The method in which the voxel comparison unit 113 outputs the comparison result is not limited to the above example. The voxel comparison unit 113 may output the comparison result by another method in which the difference between the normal state and the inspection time of the monitored object can be identified.

１…異常検知システム、１０１…画像データ記憶部、１０２…点群生成部、１０３…点群データ記憶部、１０４…参照点群データ記憶部、１０５…レジストレーション部、１０６…ボクセル化部、１０７…ボクセルデータ記憶部、１０８…カメラ属性データ記憶部、１０９…画素位置推定部、１１０…画素位置データ記憶部、１１１…ラベル更新部、１１２…参照ボクセルデータ記憶部、１１３…ボクセル比較部、ＣＡ…カメラ、ＣＢ…立方体、ＣＳ１、ＣＳ２…断面、ＩＭ…画像、Ｌ１、Ｌ２…カメラ視線直線、ＰＧ…３Ｄ点群、ＰＬ１〜ＰＬ６…面、ＰＴ１…頂点、ＰＴ２〜ＰＴ４…交点、ＲＰ…直方体、ＶＸ１〜ＶＸ４…ボクセル 1 ... Abnormality detection system, 101 ... Image data storage unit, 102 ... Point cloud generation unit, 103 ... Point cloud data storage unit, 104 ... Reference point cloud data storage unit, 105 ... Registration unit, 106 ... Voxelization unit, 107 ... Voxel data storage unit, 108 ... Camera attribute data storage unit, 109 ... Pixel position estimation unit, 110 ... Pixel position data storage unit, 111 ... Label update unit, 112 ... Reference voxel data storage unit, 113 ... Voxel comparison unit, CA ... camera, CB ... cube, CS1, CS2 ... cross section, IM ... image, L1, L2 ... camera line of sight, PG ... 3D point cloud, PL1-PL6 ... plane, PT1 ... apex, PT2-PT4 ... intersection, RP ... square , VX1-VX4 ... Voxels

Claims

A 3D model generation method executed by a computer.
A point cloud represented in the world coordinate system based on an image group generated by continuously photographing the monitored object from a large number of points with a camera, and is a point cloud representing the three-dimensional shape of the monitored object. The first step to set and
The second step of setting the voxel group represented by the world coordinate system and occupying the three-dimensional shape region represented by the point cloud, and the second step.
For each of the pixel groups of the first image included in the image group, the third step of converting the coordinate values represented by the image coordinate system into the coordinate values represented by the world coordinate system, and
For each of the pixel groups whose coordinate values have been converted, the fourth step of identifying the pixel and the camera line-of-sight line passing through the camera, and
For each of the specified camera line-of-sight lines, when the camera line-of-sight line intersects a plurality of voxels in the voxel group, a predetermined number or more of the points constituting the point group are included in the plurality of voxels. The voxels that are closest to the camera are classified into the first category, while the other voxels are classified into the second category, and the fifth step is included.
The first category indicates that an object exists in the area occupied by the voxels classified in the first category, and the second category is in the area occupied by the voxels classified in the second category. A 3D model generation method comprising showing that it is unknown whether or not an object exists.

In the fifth step, for each of the specified camera line-of-sight lines, the camera line-of-sight line is a voxel included in the voxel group, and the voxels do not include the points constituting the point cloud in excess of the predetermined number. When crossing, classify the voxels into a third category and
The 3D model generation method according to claim 1, wherein the third category indicates that no object exists in the area occupied by the voxels classified into the third category.

The third step is characterized in that, for each of the pixel groups of a plurality of images included in the image group, the coordinate values represented by the image coordinate system are converted into the coordinate values represented by the world coordinate system. The 3D model generation method according to claim 1 or 2.

A sixth that compares the categories of voxels with common coordinate values between the voxel group classified into each category and the other voxel groups classified into each category, and outputs information indicating the result of the comparison. The 3D model generation method according to any one of claims 1 to 3, further comprising a step.