JP2017146708A

JP2017146708A - Object recognition device and three-dimensional point group modeling device, method and program

Info

Publication number: JP2017146708A
Application number: JP2016026848A
Authority: JP
Inventors: 直之郡司; Naoyuki Gunji; 哲也杵渕; Tetsuya Kinebuchi; 隆行黒住; Takayuki Kurozumi; 仁新垣; Jin Niigaki
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 2016-02-16
Filing date: 2016-02-16
Publication date: 2017-08-24
Anticipated expiration: 2036-02-16
Also published as: JP6499599B2

Abstract

PROBLEM TO BE SOLVED: To allow an object to be accurately recognized from a three-dimensional point group.SOLUTION: A histogram generation unit 46 calculates a histogram representing a recognition target object. A box size determination unit 48 calculates a box representing a size of a three-dimensional point group representing the recognition target object, from the three-dimensional point group. A box histogram calculation unit 50 calculates a histogram representing the box in an arrangement pattern. A histogram comparison unit 52 compares a histogram calculated for each arrangement pattern and the histogram calculated by the histogram generation unit 46 to calculate a score for each arrangement pattern. An object recognition processing unit 54 determines an arrangement pattern in accordance with scores for respective arrangement patterns of the box. A coordinate restoration unit 56 restores three-dimensional coordinates where the recognition target object exists, from the arrangement pattern.SELECTED DRAWING: Figure 1

Description

本発明は、物体認識装置、３次元点群モデル化装置、方法、及びプログラムに係り、特に、環境を表す３次元点群に含まれる認識対象物を認識するための物体認識装置、３次元点群モデル化装置、方法、及びプログラムに関する。 The present invention relates to an object recognition device, a three-dimensional point cloud modeling device, a method, and a program, and more particularly to an object recognition device and a three-dimensional point for recognizing a recognition object included in a three-dimensional point cloud representing an environment. The present invention relates to a group modeling apparatus, method, and program.

レーザーレンジファインダで計測した計測結果である３次元点群から物体を認識するとき、表面の反射率や面の向きによって３次元の計測結果に大きなノイズが生じることが多い。このように計測された３次元点群のデータで物体を認識する場合、ノイズに強いアルゴリズムが必要となる。 When an object is recognized from a three-dimensional point group that is a measurement result measured by a laser range finder, a large noise often occurs in the three-dimensional measurement result depending on the reflectance of the surface and the orientation of the surface. In order to recognize an object from the data of the three-dimensional point group measured in this way, an algorithm that is resistant to noise is required.

２次元の画像における物体検出のアルゴリズムのなかで、ノイズに強い特徴表現としては、Bag of Visual Words（以下、ＢｏＶＷと記載する）がある（非特許文献１参照）。 Among object detection algorithms in a two-dimensional image, there is Bag of Visual Words (hereinafter referred to as BoVW) as a characteristic expression that is resistant to noise (see Non-Patent Document 1).

一般的には、認識対象の物体の多数の２次元画像からＢｏＶＷ形式のベクトルを抽出して記憶部に記憶しておき、入力画像から抽出されたＢｏＶＷ形式のベクトルを記憶部のベクトルと比較することで、物体を認識する。 In general, a BoVW vector is extracted from a number of two-dimensional images of an object to be recognized and stored in a storage unit, and the BoVW vector extracted from the input image is compared with a vector in the storage unit. By recognizing the object.

J. Sivic et al.: "Video Google: A Text Retrieval Approach to Object Matching in Videos", in Proc. ICCV, 2003.J. Sivic et al .: "Video Google: A Text Retrieval Approach to Object Matching in Videos", in Proc. ICCV, 2003. F. Tombari, S. Salti, and L. Di Stefano,"SHOT: Unique signatures of histograms for surface and texture description",in 11th European Conference on Computer Vision (ECCV)F. Tombari, S. Salti, and L. Di Stefano, "SHOT: Unique signatures of histograms for surface and texture description", in 11th European Conference on Computer Vision (ECCV)

上記の２次元画像における物体認識の方法はロバストではあるが、そのままでは３次元点群による物体認識には用いることができない。 The object recognition method in the above two-dimensional image is robust, but cannot be used for object recognition by a three-dimensional point group as it is.

本発明は、上記問題点を解決するために成されたものであり、精度よく、３次元点群から物体を認識することができる物体認識装置、３次元点群モデル化装置、方法、及びプログラムを提供することを目的とする。 The present invention has been made to solve the above-described problems, and is an object recognition device that can accurately recognize an object from a three-dimensional point cloud, a three-dimensional point cloud modeling device, a method, and a program. The purpose is to provide.

上記目的を達成するために、第１の発明に係る物体認識装置は、認識対象物体を表す３次元点群と、認識対象物体が存在する環境を表す３次元点群とを用いて、前記環境を表す３次元点群に含まれる認識対象物体を認識する物体認識装置であって、前記認識対象物体の３次元点群内の各点に対して法線ベクトルを計算する法線ベクトル計算部と、前記認識対象物体の３次元点群内の各点に対して点周辺の情報を表現する局所特徴量を計算する局所特徴量計算部と、あらかじめ前記環境を表す３次元点群から作成された局所特徴量のクラスタ中心に対して、前記局所特徴量計算部によって認識対象物体を表す３次元点群から計算された局所特徴量を割り当てる局所特徴量割当部と、前記局所特徴量割当部によって局所特徴量を割り当てた結果から各クラスタに割り当てられた局所特徴量の数を計算し、認識対象物体を表現するヒストグラムを計算するヒストグラム作成部と、前記認識対象物体を表す３次元点群から当該３次元点群の大きさを表現するボックスを計算するボックスサイズ決定部と、前記環境を表す３次元点群内に、前記ボックスサイズ決定部によって計算されたボックスを配置する各配置パターンに対して、前記配置パターンにおけるボックスを表現するヒストグラムを計算するボックスヒストグラム計算部と、前記ボックスヒストグラム計算部によって各配置パターンに対して計算された前記ヒストグラムと、前記ヒストグラム作成部によって計算されたヒストグラムの比較を行い、各配置パターンに対してスコアを計算するヒストグラム比較部と、前記ヒストグラム比較部によって計算されたボックスの各配置パターンに対するスコアから、ボックス内に前記認識対象物体が含まれるようなボックスの配置パターンを決定する物体認識処理部と、前記物体認識処理部によって得られた配置パターンから認識対象物体が存在する三次元座標を復元する座標復元部と、を含んで構成されている。 In order to achieve the above object, an object recognition apparatus according to a first invention uses the three-dimensional point group representing a recognition target object and the three-dimensional point group representing an environment in which the recognition target object exists, to An object recognition apparatus for recognizing a recognition target object included in a three-dimensional point group representing a normal vector calculation unit for calculating a normal vector for each point in the three-dimensional point group of the recognition target object; , A local feature amount calculation unit for calculating a local feature amount representing information around a point for each point in the three-dimensional point group of the recognition target object, and a three-dimensional point group representing the environment in advance. A local feature amount assigning unit that assigns a local feature amount calculated from a three-dimensional point group representing a recognition target object by the local feature amount calculating unit to a cluster center of the local feature amount, and a local feature amount assigning unit Is it a result of assigning features? The number of local features assigned to each cluster is calculated, a histogram creation unit that calculates a histogram representing the recognition target object, and the size of the three-dimensional point group from the three-dimensional point group representing the recognition target object. A box size determination unit for calculating a box to be expressed, and a box in the arrangement pattern for each arrangement pattern in which the box calculated by the box size determination unit is arranged in a three-dimensional point group representing the environment. A box histogram calculation unit that calculates a histogram to be compared, the histogram calculated for each arrangement pattern by the box histogram calculation unit, and a histogram calculated by the histogram creation unit are compared, and for each arrangement pattern A histogram comparison unit for calculating a score, and the histogram An object recognition processing unit that determines a box arrangement pattern such that the recognition target object is included in the box from the score for each box arrangement pattern calculated by the comparison unit, and the arrangement obtained by the object recognition processing unit A coordinate restoration unit that restores the three-dimensional coordinates where the recognition target object exists from the pattern.

また、第１の発明に係る物体認識装置において、前記環境を表す点群が存在する３次元空間を予め定められた大きさのボクセルによって離散化し、あらかじめ前記環境を表す３次元点群内の各点に対して計算された局所特徴量と、前記局所特徴量に割り当てられたクラスタ中心とに基づいて、前記３次元空間内の離散化された前記ボクセルの各々に対して、前記ボクセルを含む所定の範囲に含まれる局所特徴量から計算されるヒストグラムを格納した３次元積分画像を計算する３次元積分画像作成部を更に含み、前記ボックスヒストグラム計算部は、ボックスの各配置パターンに対して、前記３次元積分画像の各ボクセルに保存されたヒストグラムを用いて高々８回の和または差の計算によって、前記配置パターンのボックスを表現するヒストグラムを計算するようにしてもよい。 In the object recognition device according to the first aspect of the present invention, the three-dimensional space in which the point cloud representing the environment exists is discretized by voxels having a predetermined size, and each of the three-dimensional point cloud representing the environment in advance is discretized. A predetermined containing voxel for each of the discretized voxels in the three-dimensional space based on a local feature calculated for a point and a cluster center assigned to the local feature A three-dimensional integral image creation unit that calculates a three-dimensional integral image storing a histogram calculated from local feature amounts included in the range of the box histogram calculation unit, the box histogram calculation unit for each arrangement pattern of the box Hiss representing the box of the arrangement pattern by calculating the sum or difference at most eight times using the histogram stored in each voxel of the three-dimensional integral image. It is also possible to calculate the grams.

また、第１の発明に係る物体認識装置において、前記局所特徴量計算部は、前記法線ベクトル計算部によって各点に対して計算された法線ベクトルを利用してＳＨＯＴ（Signature of Histograms of Orientations）を計算するようにしてもよい。 In the object recognition apparatus according to the first aspect of the invention, the local feature amount calculation unit may perform SHOT (Signature of Histograms of Orientations) using the normal vector calculated for each point by the normal vector calculation unit. ) May be calculated.

第２の発明に係る３次元点群モデル化装置は、認識対象物体の３次元点群内の各点に対して法線ベクトルを計算する法線ベクトル計算部と、前記認識対象物体の３次元点群内の各点に対して点周辺の情報を表現する局所特徴量を計算する局所特徴量計算部と、あらかじめ認識対象物体が存在する環境を表す３次元点群から作成された局所特徴量のクラスタ中心に対して、前記局所特徴量計算部によって認識対象物体を表す３次元点群から計算された局所特徴量を割り当てる局所特徴量割当部と、前記局所特徴量割当部によって局所特徴量を割り当てた結果から各クラスタに割り当てられた局所特徴量の数を計算し、認識対象物体を表現するヒストグラムを計算するヒストグラム作成部と、前記認識対象物体を表す３次元点群から当該３次元点群の大きさを表現するボックスを計算するボックスサイズ決定部と、前記環境を表す３次元点群内に、前記ボックスサイズ決定部によって計算されたボックスを配置する各配置パターンに対して、前記配置パターンにおけるボックスを表現するヒストグラムを計算するボックスヒストグラム計算部と、を含んで構成されている。 A three-dimensional point cloud modeling apparatus according to a second invention includes a normal vector calculation unit that calculates a normal vector for each point in the three-dimensional point group of the recognition target object, and the three-dimensional of the recognition target object. A local feature quantity calculation unit that calculates local feature quantities that express information around a point for each point in the point cloud, and a local feature quantity that is created in advance from a three-dimensional point cloud that represents the environment in which the recognition target object exists A local feature amount assigning unit for assigning a local feature amount calculated from a three-dimensional point group representing a recognition target object by the local feature amount calculating unit, and a local feature amount by the local feature amount assigning unit. The number of local features assigned to each cluster is calculated from the assigned result, a histogram creating unit for calculating a histogram representing the recognition target object, and the three-dimensional point from the three-dimensional point group representing the recognition target object A box size determining unit for calculating a box expressing the size of the box; and for each arrangement pattern for arranging the box calculated by the box size determining unit in a three-dimensional point group representing the environment, the arrangement pattern And a box histogram calculation unit for calculating a histogram representing a box in FIG.

第３の発明に係る物体認識方法は、法線ベクトル計算部、局所特徴量計算部、局所特徴量割当部、ヒストグラム作成部、ボックスサイズ決定部、ボックスヒストグラム計算部、ヒストグラム比較部、物体認識処理部、及び座標復元部を含み、認識対象物体を表す点群と、認識対象物体が存在する環境を表す点群とを用いて、環境を表す点群に含まれる認識対象物体を認識する物体認識方法であって、前記法線ベクトル計算部が、前記認識対象物体の３次元点群内の各点に対して法線ベクトルを計算するステップと、前記局所特徴量計算部が、前記認識対象物体の３次元点群内の各点に対して点周辺の情報を表現する局所特徴量を計算するステップと、前記局所特徴量割当部が、あらかじめ前記環境を表す３次元点群から作成された局所特徴量のクラスタ中心に対して、前記局所特徴量計算部によって認識対象物体を表す３次元点群から計算された局所特徴量を割り当てるステップと、前記ヒストグラム作成部が、前記局所特徴量割当部によって局所特徴量を割り当てた結果から各クラスタに割り当てられた局所特徴量の数を計算し、認識対象物体を表現するヒストグラムを計算するステップと、前記ボックスサイズ決定部が、前記認識対象物体を表す３次元点群から当該３次元点群の大きさを表現するボックスを計算するステップと、前記ボックスヒストグラム計算部が、前記環境を表す３次元点群内に、前記ボックスサイズ決定部によって計算されたボックスを配置する各配置パターンに対して、前記配置パターンにおけるボックスを表現するヒストグラムを計算するステップと、前記ヒストグラム比較部が、前記ボックスヒストグラム計算部によって各配置パターンに対して計算された前記ヒストグラムと、前記ヒストグラム作成部によって計算されたヒストグラムの比較を行い、各配置パターンに対してスコアを計算するステップと、前記物体認識処理部が、前記ヒストグラム比較部によって計算されたボックスの各配置パターンに対するスコアから、ボックス内に前記認識対象物体が含まれるようなボックスの配置パターンを決定するステップと、前記座標復元部が、前記物体認識処理部によって得られた配置パターンから認識対象物体が存在する三次元座標を復元するステップと、を含んで実行することを特徴とする。 An object recognition method according to a third invention includes a normal vector calculation unit, a local feature amount calculation unit, a local feature amount allocation unit, a histogram creation unit, a box size determination unit, a box histogram calculation unit, a histogram comparison unit, and an object recognition process. Recognition using the point cloud representing the recognition target object and the point cloud representing the environment where the recognition target object exists to recognize the recognition target object included in the point cloud representing the environment A method in which the normal vector calculation unit calculates a normal vector for each point in a three-dimensional point group of the recognition target object; and the local feature value calculation unit includes the recognition target object. Calculating a local feature amount expressing information around the point for each point in the three-dimensional point group, and the local feature amount assigning unit previously created from the three-dimensional point group representing the environment Feature value A step of assigning a local feature amount calculated from a three-dimensional point group representing a recognition target object by the local feature amount calculation unit to the cluster center; and the histogram generation unit includes a local feature amount by the local feature amount assignment unit Calculating the number of local feature values assigned to each cluster from the result of assigning, and calculating a histogram representing a recognition target object; and the box size determination unit includes a three-dimensional point group representing the recognition target object Calculating a box expressing the size of the three-dimensional point group from the box, and the box histogram calculating unit arranges the box calculated by the box size determining unit in the three-dimensional point group representing the environment. For each arrangement pattern, calculating a histogram representing a box in the arrangement pattern; The histogram comparison unit compares the histogram calculated for each arrangement pattern by the box histogram calculation unit with the histogram calculated by the histogram creation unit, and calculates a score for each arrangement pattern The object recognition processing unit determines a box arrangement pattern such that the recognition target object is included in the box from the score for each box arrangement pattern calculated by the histogram comparison unit; and the coordinates And a restoring unit that executes the three-dimensional coordinates in which the recognition target object exists from the arrangement pattern obtained by the object recognition processing unit.

また、第３の発明に係る物体認識方法において、３次元積分画像作成部が、前記環境を表す点群が存在する３次元空間を予め定められた大きさのボクセルによって離散化し、あらかじめ前記環境を表す３次元点群内の各点に対して計算された局所特徴量と、前記局所特徴量に割り当てられたクラスタ中心とに基づいて、前記３次元空間内の離散化された前記ボクセルの各々に対して、前記ボクセルを含む所定の範囲に含まれる局所特徴量から計算されるヒストグラムを格納した３次元積分画像を計算するステップを更に含み、前記ボックスヒストグラム計算部が計算するステップは、ボックスの各配置パターンに対して、前記３次元積分画像の各ボクセルに保存されたヒストグラムを用いて高々８回の和または差の計算によって、前記配置パターンのボックスを表現するヒストグラムを計算するようにしてもよい。 In the object recognition method according to the third aspect of the invention, the three-dimensional integral image creation unit discretizes a three-dimensional space in which a point cloud representing the environment exists with voxels having a predetermined size, Each of the discretized voxels in the three-dimensional space is based on the local feature amount calculated for each point in the three-dimensional point group to be represented and the cluster center assigned to the local feature amount. On the other hand, the method further includes a step of calculating a three-dimensional integral image storing a histogram calculated from a local feature included in a predetermined range including the voxel, and the step of calculating the box histogram calculation unit includes: For the arrangement pattern, the arrangement pattern is calculated by calculating the sum or difference at most eight times using the histogram stored in each voxel of the three-dimensional integral image. It may be calculated a histogram representing the over down box.

第４の発明に係るプログラムは、コンピュータを、上記第１の発明に係る物体認識装置を構成する各部として機能させるためのプログラムである。 A program according to a fourth invention is a program for causing a computer to function as each unit constituting the object recognition device according to the first invention.

本発明の物体認識装置、３次元点群モデル化装置、方法、及びプログラムによれば、精度よく、３次元点群から物体を認識することができる、という効果が得られる。 According to the object recognition device, the three-dimensional point cloud modeling device, the method, and the program of the present invention, it is possible to obtain an effect that an object can be recognized from the three-dimensional point cloud with high accuracy.

本発明の実施の形態に係る物体認識装置の構成を示すブロック図である。It is a block diagram which shows the structure of the object recognition apparatus which concerns on embodiment of this invention. 基礎データ記憶部２１の一例を示す図である。3 is a diagram illustrating an example of a basic data storage unit 21. FIG. 法線ベクトル記憶部３０の一例を示す図である。3 is a diagram illustrating an example of a normal vector storage unit 30. FIG. 局所特徴量記憶部３２の一例を示す図である。4 is a diagram illustrating an example of a local feature amount storage unit 32. FIG. クラスタ記憶部３４の一例を示す図である。3 is a diagram illustrating an example of a cluster storage unit 34. FIG. 積分画像記憶部３６の一例を示す図である。4 is a diagram illustrating an example of an integral image storage unit 36. FIG. 認識結果記憶部３８の一例を示す図である。4 is a diagram illustrating an example of a recognition result storage unit 38. FIG. 本発明の実施の形態に係る物体認識装置における前処理ルーチンを示すフローチャートである。It is a flowchart which shows the pre-processing routine in the object recognition apparatus which concerns on embodiment of this invention. 本発明の実施の形態に係る物体認識装置における物体認識処理ルーチンを示すフローチャートである。It is a flowchart which shows the object recognition process routine in the object recognition apparatus which concerns on embodiment of this invention.

以下、図面を参照して本発明の実施の形態を詳細に説明する。 Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings.

＜本発明の実施の形態に係る概要＞ <Outline according to Embodiment of the Present Invention>

本実施の形態では、特徴量として、３次元点群でも利用可能な記述力の高い「ＳＨＯＴ（Signature of Histograms of Orientations）」を用いることで、多様な形状をカバーして物体を認識する。また、ＢｏＶＷを用いて物体を表現することで、計測誤差などのノイズの影響による形状の微小な変化に対して影響を受けやすいＳＨＯＴの不安定な面をカバーすることができる。 In the present embodiment, “SHOT (Signature of Histograms of Orientations)” that can be used even in a three-dimensional point cloud is used as a feature amount, thereby recognizing an object while covering various shapes. In addition, by representing an object using BoVW, it is possible to cover an unstable surface of SHOT that is easily affected by a minute change in shape due to the influence of noise such as a measurement error.

＜本発明の実施の形態に係る物体認識装置の構成＞ <Configuration of Object Recognition Device According to Embodiment of the Present Invention>

まず、本発明の実施の形態に係る物体認識装置の構成について説明する。図１に示すように、本発明の実施の形態に係る物体認識装置１００は、ＣＰＵと、ＲＡＭと、後述する物体認識処理ルーチンを実行するためのプログラムや各種データを記憶したＲＯＭと、を含むコンピュータで構成することが出来る。この物体認識装置１００は、機能的には図１に示すように入力部１０と、演算部２０と、出力部６０とを備えている。 First, the configuration of the object recognition device according to the embodiment of the present invention will be described. As shown in FIG. 1, an object recognition apparatus 100 according to an embodiment of the present invention includes a CPU, a RAM, and a ROM that stores a program for executing an object recognition processing routine to be described later and various data. Can be configured with a computer. Functionally, the object recognition apparatus 100 includes an input unit 10, a calculation unit 20, and an output unit 60 as shown in FIG.

入力部１０は、物体が存在する環境における複数の３次元点からなる３次元点群を受け付ける。３次元点群は、物体の表面上の位置を計測した計測結果である複数の位置を表す３次元点群であり、レーザーレンジファインダや、赤外線センサ、または超音波センサなどの、３次元点群を計測する装置によって得られたものである。 The input unit 10 receives a three-dimensional point group including a plurality of three-dimensional points in an environment where an object exists. The three-dimensional point group is a three-dimensional point group representing a plurality of positions that are measurement results obtained by measuring positions on the surface of an object, and is a three-dimensional point group such as a laser range finder, an infrared sensor, or an ultrasonic sensor. It was obtained by a device that measures

また、入力部１０は、認識対象物体を表す３次元点群を受け付ける。受け付けた３次元点群は基礎データ記憶部２１に格納するものとする。 Further, the input unit 10 receives a three-dimensional point group representing a recognition target object. The received three-dimensional point group is stored in the basic data storage unit 21.

演算部２０は、前処理部２２と、基礎データ記憶部２１と、法線ベクトル記憶部３０と、局所特徴量記憶部３２と、クラスタ記憶部３４と、積分画像記憶部３６と、認識結果記憶部３８と、法線ベクトル計算部４０と、局所特徴量計算部４２と、局所特徴量割当部４４と、ヒストグラム作成部４６と、ボックスサイズ決定部４８と、ボックスヒストグラム計算部５０と、ヒストグラム比較部５２と、物体認識処理部５４と、座標復元部５６とを含んで構成されている。 The calculation unit 20 includes a preprocessing unit 22, a basic data storage unit 21, a normal vector storage unit 30, a local feature storage unit 32, a cluster storage unit 34, an integrated image storage unit 36, and a recognition result storage. Unit 38, normal vector calculation unit 40, local feature quantity calculation unit 42, local feature quantity allocation unit 44, histogram creation unit 46, box size determination unit 48, box histogram calculation unit 50, and histogram comparison A unit 52, an object recognition processing unit 54, and a coordinate restoration unit 56 are configured.

前処理部２２は、以下に説明する各部の処理によって、物体が存在する環境における複数の３次元点群を受け付け、物体の認識に必要な法線ベクトル、局所特徴量、クラスタ、及び３次元積分画像を学習する。 The preprocessing unit 22 receives a plurality of three-dimensional point groups in the environment where the object exists by processing of each unit described below, and normal vectors, local feature amounts, clusters, and three-dimensional integration necessary for object recognition Learn images.

前処理部２２は、法線ベクトル計算部２３と、局所特徴量計算部２４と、クラスタリング部２５と、３次元積分画像作成部２６とを含んで構成されている。 The preprocessing unit 22 includes a normal vector calculation unit 23, a local feature amount calculation unit 24, a clustering unit 25, and a three-dimensional integral image creation unit 26.

基礎データ記憶部２１には、入力部１０で受け付けた、前処理に用いる、物体が存在する環境における３次元点群が格納されている。また、入力部１０で受け付けた、認識対象物体を表す３次元点群が格納されている。なお、基礎データ記憶部２１を設けずに、外部のレーザーレンジファインダから入力部１０を介してリアルタイムで受け取るようにしてもよい。基礎データ記憶部２１の一例を図２に示す。 The basic data storage unit 21 stores a three-dimensional point group in an environment where an object exists, which is received by the input unit 10 and used for preprocessing. In addition, a three-dimensional point group representing a recognition target object received by the input unit 10 is stored. In addition, you may make it receive in real time via the input part 10 from an external laser range finder, without providing the basic data memory | storage part 21. FIG. An example of the basic data storage unit 21 is shown in FIG.

法線ベクトル計算部２３は、基礎データ記憶部２１に格納された物体が存在する環境における３次元点群の各点について、当該点の周辺の３次元点を用いて、Ｘ，Ｙ，Ｚ座標における単位ベクトルとして、法線ベクトルを計算し、法線ベクトル記憶部３０に記憶する。法線ベクトルの計算には、非特許文献３に記載の手法を用いる。 The normal vector calculation unit 23 uses, for each point of the three-dimensional point group in the environment where the object stored in the basic data storage unit 21 exists, the X, Y, Z coordinates using the three-dimensional points around the point. A normal vector is calculated as the unit vector in, and stored in the normal vector storage unit 30. For the calculation of the normal vector, the method described in Non-Patent Document 3 is used.

［非特許文献３］：K. Klasing, D. Althoff, D. Wollherr, M. Buss,"Comparison of Surface Normal Estimation Methods for Range Sensing Applications", Robotics and Automation, 2009. [Non-Patent Document 3]: K. Klasing, D. Althoff, D. Wollherr, M. Buss, “Comparison of Surface Normal Estimation Methods for Range Sensing Applications”, Robotics and Automation, 2009.

法線ベクトル記憶部３０には、法線ベクトル計算部２３により物体が存在する環境における点群について計算されたＮ個の法線ベクトルが格納されている。また、法線ベクトル計算部４０により認識対象物体について計算されたＮ個の法線ベクトルが格納される。法線ベクトル記憶部３０に一例を図３に示す。 The normal vector storage unit 30 stores N normal vectors calculated by the normal vector calculation unit 23 for the point cloud in the environment where the object exists. Further, N normal vectors calculated for the recognition target object by the normal vector calculation unit 40 are stored. An example of the normal vector storage unit 30 is shown in FIG.

局所特徴量計算部２４は、物体が存在する環境における３次元点群の各点について、法線ベクトル記憶部３０に記憶された法線ベクトルに基づいて、当該点の局所特徴量を計算し、局所特徴量記憶部３２に格納する。ここでは、局所特徴量用に予め定めたサイズのＭ個のボクセルの各々について、当該ボクセルの中心位置に近い所定の個数の３次元点に対して計算された法線ベクトルを利用してＳＨＯＴという特徴量を算出する（上記非特許文献２参照）。なお、３次元点群からサンプリングしたＭ個の点の各々について、局所特徴量を計算してもよい。 The local feature amount calculation unit 24 calculates the local feature amount of the point based on the normal vector stored in the normal vector storage unit 30 for each point of the three-dimensional point group in the environment where the object exists, Stored in the local feature amount storage unit 32. Here, for each of the M voxels having a predetermined size for the local feature amount, SHOT is used by using a normal vector calculated for a predetermined number of three-dimensional points close to the center position of the voxel. A feature amount is calculated (see Non-Patent Document 2 above). The local feature amount may be calculated for each of M points sampled from the three-dimensional point group.

局所特徴量記憶部３２には、局所特徴量計算部２４によって計算されたＭ個の局所特徴量が格納されている。また、局所特徴量には、後述するクラスタリング部２５によって割り当てられたクラスタ中心のクラスタ番号が付与される。局所特徴量はＤ次元のベクトルで表現される。また、局所特徴量に対応付けて、ｘ、ｙ、ｚの座標値（局所特徴量用のボクセルの中心の座標値）も記憶しておく。局所特徴量記憶部３２の一例を図４に示す。 The local feature quantity storage unit 32 stores M local feature quantities calculated by the local feature quantity calculation unit 24. Further, the cluster number of the cluster center assigned by the clustering unit 25 described later is assigned to the local feature amount. The local feature amount is expressed by a D-dimensional vector. In addition, the coordinate values of x, y, and z (the coordinate value of the center of the voxel for the local feature amount) are also stored in association with the local feature amount. An example of the local feature amount storage unit 32 is shown in FIG.

クラスタリング部２５は、局所特徴量計算部２４によって計算されたＭ個の局所特徴量に基づいて、非特許文献１に記載のコードブック学習の手法を用いて、Ｋ個のクラスタについて、クラスタ中心を計算し、各クラスタに、Ｍ個の局所特徴量の各々を割り当てる。計算したクラスタ中心はクラスタ記憶部３４に格納する。また、各クラスタに各局所特徴量を割り当てた結果として、各局所特徴量に対するクラスタ番号を局所特徴量記憶部３２に格納する。クラスタ数Ｋはあらかじめ設定により定めておく。ここでは、大量の３次元点群のデータを用いてクラスタリングすることによって、有意なクラスタ中心を得ることができる。 Based on the M local feature amounts calculated by the local feature amount calculating unit 24, the clustering unit 25 uses the code book learning method described in Non-Patent Document 1 to determine the cluster centers for the K clusters. Calculate and assign each of the M local features to each cluster. The calculated cluster center is stored in the cluster storage unit 34. Further, as a result of assigning each local feature quantity to each cluster, the cluster number for each local feature quantity is stored in the local feature quantity storage unit 32. The number of clusters K is determined in advance by setting. Here, a significant cluster center can be obtained by clustering using a large amount of three-dimensional point cloud data.

クラスタ記憶部３４には、クラスタリング部２５によって計算されたＫ個のクラスタのクラスタ中心が格納されている。クラスタ記憶部３４の一例を図５に示す。 The cluster storage unit 34 stores the cluster centers of K clusters calculated by the clustering unit 25. An example of the cluster storage unit 34 is shown in FIG.

３次元積分画像作成部２６は、物体が存在する環境を表す３次元点群を含む３次元空間を、３次元積分画像用に予め定められた大きさのボクセルによって離散化する。離散化されたｘ、ｙ、ｚの座標値で、点群が含まれる最大の座標値をそれぞれＸｍａｘ、Ｙｍａｘ、Ｚｍａｘとする。次に、局所特徴量計算部２４によって計算されたＭ個の局所特徴量と、クラスタリング部２５によってＭ個の局所特徴量に割り当てられたクラスタとに基づいて、３次元空間内の離散化されたボクセルの各々に対して、当該ボクセルと３次元空間の原点とで規定される直方体の範囲に含まれる局所特徴量から計算される、各クラスタの頻度を表すヒストグラムを格納した３次元積分画像を計算する。２次元の積分画像の考えを３次元に拡張して、計算対象のボックス周辺のボクセルについて格納されたヒストグラムの和及び差を用いて、計算対象のボックスの中の各クラスタの頻度がわかるようにする。なお、積分する単位は設定によりあらかじめ定めておく。また、局所特徴量のボクセルのサイズとは独立して指定するようにしてよい。 The three-dimensional integral image creation unit 26 discretizes a three-dimensional space including a three-dimensional point group representing an environment in which an object exists with voxels having a predetermined size for the three-dimensional integral image. It is assumed that the maximum coordinate values including the point group among the discretized coordinate values of x, y, and z are Xmax, Ymax, and Zmax, respectively. Next, the discretization in the three-dimensional space is performed based on the M local feature amounts calculated by the local feature amount calculation unit 24 and the clusters assigned to the M local feature amounts by the clustering unit 25. For each voxel, calculate a three-dimensional integral image that contains a histogram representing the frequency of each cluster, calculated from local features included in the rectangular parallelepiped range defined by the voxel and the origin of the three-dimensional space. To do. Extend the idea of 2D integral images to 3D and use the sum and difference of histograms stored for voxels around the box to be calculated so that the frequency of each cluster in the box to be calculated can be determined. To do. The unit for integration is determined in advance by setting. The local feature quantity voxel size may be specified independently.

積分画像記憶部３６には、３次元積分画像作成部２６によって計算された３次元積分画像が格納されている。具体的には、図６に示すように、各ボクセルについて、ボクセルのインデックス番号と、各クラスタの頻度とが格納されている。 The integral image storage unit 36 stores a three-dimensional integral image calculated by the three-dimensional integral image creation unit 26. Specifically, as shown in FIG. 6, the voxel index number and the frequency of each cluster are stored for each voxel.

法線ベクトル計算部４０は、基礎データ記憶部２１に格納された認識対象物体の３次元点群内の各点に対して、法線ベクトル計算部２３と同様に、法線ベクトルを計算し、法線ベクトル記憶部３０に格納する。 The normal vector calculation unit 40 calculates a normal vector for each point in the three-dimensional point group of the recognition target object stored in the basic data storage unit 21 in the same manner as the normal vector calculation unit 23, It is stored in the normal vector storage unit 30.

局所特徴量計算部４２は、基礎データ記憶部２１に格納された認識対象物体の３次元点群に基づいて、認識対象物体の３次元点群からサンプリングされた各点について、局所特徴量を計算し、局所特徴量記憶部３２に格納する。ここでは、局所特徴量計算部２４と同様に、各点に対して計算された法線ベクトルを利用してＳＨＯＴという特徴量を算出する（上記非特許文献２参照）。 The local feature amount calculation unit 42 calculates a local feature amount for each point sampled from the three-dimensional point group of the recognition target object based on the three-dimensional point group of the recognition target object stored in the basic data storage unit 21. And stored in the local feature quantity storage unit 32. Here, as with the local feature value calculation unit 24, a feature value called SHOT is calculated using a normal vector calculated for each point (see Non-Patent Document 2 above).

局所特徴量割当部４４は、クラスタ記憶部３４に格納された各クラスタのクラスタ中心と、局所特徴量計算部４２によって認識対象物体を表す３次元点群の各点について計算された局所特徴量とに基づいて、各クラスタに、認識対象物体を表す３次元点群の各点について計算された局所特徴量の各々を割り当てる。 The local feature amount assigning unit 44 includes the cluster center of each cluster stored in the cluster storage unit 34 and the local feature amount calculated for each point of the three-dimensional point group representing the recognition target object by the local feature amount calculating unit 42. Based on the above, each of the local feature amounts calculated for each point of the three-dimensional point group representing the recognition target object is assigned to each cluster.

ヒストグラム作成部４６は、局所特徴量割当部４４によって各点についての局所特徴量
を各クラスタに割り当てた結果から、各クラスタに割り当てられた局所特徴量の数を計算し、認識対象物体を表現する、各クラスタの出現頻度を表すヒストグラムを計算する。 The histogram creation unit 46 expresses the recognition target object by calculating the number of local feature values assigned to each cluster from the result of assigning the local feature value for each point to each cluster by the local feature value assigning unit 44. Then, a histogram representing the appearance frequency of each cluster is calculated.

ボックスサイズ決定部４８は、認識対象物体を表す３次元点群から当該３次元点群の大きさを表現するボックスのサイズを計算する。ｘ方向の大きさは、３次元点群のｘ方向の計測データを用いて、以下の（１）式により計算できる。 The box size determination unit 48 calculates the size of a box expressing the size of the three-dimensional point group from the three-dimensional point group representing the recognition target object. The size in the x direction can be calculated by the following equation (1) using the measurement data in the x direction of the three-dimensional point group.

最大値、最小値は、３次元点群のｘ方向の最大値、最小値である。ｒは、３次元積分画像用のボクセルの一辺の長さとし、ボックスがボクセル何個分となるかのサイズが決定される。上記はｘ方向の場合と同様に、ｙ、ｚについても同様に計算できる。 The maximum value and the minimum value are the maximum value and the minimum value in the x direction of the three-dimensional point group. r is the length of one side of the voxel for the three-dimensional integral image, and the size of how many voxels the box will have is determined. The above can be similarly calculated for y and z as in the case of the x direction.

また、ボックスサイズ決定部４８は、基礎データ記憶部２１に格納された環境を表す３次元点群を含む３次元空間であって、かつ、ボクセルによって離散化された３次元空間をスキャニングするように、計算したサイズのボックスを配置し、各配置パターンを決定する。なお、ボックスの大きさを変えたものを３次元空間に配置した配置パターンを更に決定するようにしてもよい。 Further, the box size determination unit 48 scans a three-dimensional space including a three-dimensional point group representing the environment stored in the basic data storage unit 21 and discretized by voxels. The boxes of the calculated size are arranged, and each arrangement pattern is determined. In addition, you may make it determine further the arrangement pattern which arrange | positioned what changed the magnitude | size of the box in three-dimensional space.

ボックスヒストグラム計算部５０は、積分画像記憶部３６に記憶された３次元積分画像を用いて、ボックスサイズ決定部４８によって決定されたボックスを配置する各配置パターンに対して、当該配置パターンにおけるボックスを表現するヒストグラムを計算する。計算されるヒストグラムは、ボックス内に存在する局所特徴量に割り当てられた各クラスタの出現頻度を表す。当該配置パターンにおけるボックスを表現するヒストグラムは、３次元積分画像の各ボクセルに保存されたヒストグラムを用いて高々８回の和または差の計算によって計算される。 The box histogram calculation unit 50 uses the three-dimensional integrated image stored in the integrated image storage unit 36 to set a box in the arrangement pattern for each arrangement pattern in which the box determined by the box size determination unit 48 is arranged. Calculate the histogram to represent. The calculated histogram represents the appearance frequency of each cluster assigned to the local feature amount existing in the box. A histogram representing a box in the arrangement pattern is calculated by calculating eight sums or differences at most using the histogram stored in each voxel of the three-dimensional integral image.

ヒストグラム比較部５２は、ボックスヒストグラム計算部５０によって各配置パターンに対して計算されたヒストグラムと、ヒストグラム作成部４６によって計算されたヒストグラムの比較を行い、各配置パターンに対してスコアを計算する。例えば、ヒストグラムを表すベクトルのベクトル類似度を、スコアとして計算する。 The histogram comparison unit 52 compares the histogram calculated for each arrangement pattern by the box histogram calculation unit 50 with the histogram calculated by the histogram creation unit 46, and calculates a score for each arrangement pattern. For example, the vector similarity of a vector representing a histogram is calculated as a score.

認識結果記憶部３８には、ヒストグラム比較部５２によって各配置パターンに対して計算されたスコアが格納されている。認識結果記憶部３８の一例を図７に示す。配置パターンに対応するインデックスとスコアとが格納されている。 The recognition result storage unit 38 stores a score calculated for each arrangement pattern by the histogram comparison unit 52. An example of the recognition result storage unit 38 is shown in FIG. An index and a score corresponding to the arrangement pattern are stored.

物体認識処理部５４は、ヒストグラム比較部５２によって計算されたボックスの各配置パターンに対するスコアから、ボックス内に認識対象物体が含まれるようなボックスの配置パターンを決定する。ここでは、スコアの値が閾値以上であり、かつ、周囲の配置パターンのインデックスに対するスコアと比較して、着目する配置パターンのインデックスのスコアが極大（最も高いスコア）になっていれば、そのインデックスに対応する位置に、物体が存在すると認識する。そして、物体が存在すると認識されたインデックスに対して、認識結果を認識結果記憶部３８に追加する。 The object recognition processing unit 54 determines a box arrangement pattern in which the recognition target object is included in the box from the score for each box arrangement pattern calculated by the histogram comparison unit 52. Here, if the score value is equal to or greater than the threshold value and the score of the index of the arrangement pattern of interest is maximal (highest score) compared to the score for the index of the surrounding arrangement pattern, the index It is recognized that an object exists at a position corresponding to. Then, the recognition result is added to the recognition result storage unit 38 for the index recognized that the object exists.

座標復元部５６は、物体認識処理部５４によって得られた配置パターンのインデックスとボックスサイズ決定部４８によって得られたボックスサイズから、上記（１）式とは逆の計算を行って、離散化された座標値から認識対象物体が存在する三次元座標を復元し、出力部６０に出力する。 The coordinate restoration unit 56 is discretized by performing an inverse calculation to the above equation (1) from the arrangement pattern index obtained by the object recognition processing unit 54 and the box size obtained by the box size determination unit 48. The three-dimensional coordinates where the recognition target object exists are restored from the obtained coordinate values and are output to the output unit 60.

＜本発明の実施の形態に係る物体認識装置の作用＞ <Operation of Object Recognition Device According to Embodiment of Present Invention>

次に、本発明の実施の形態に係る物体認識装置１００の作用について説明する。入力部１０において物体が存在する環境における３次元点群を受け付けて基礎データ記憶部２１に格納すると、物体認識装置１００は、図８に示す前処理ルーチンを実行する。 Next, the operation of the object recognition apparatus 100 according to the embodiment of the present invention will be described. When the input unit 10 receives a three-dimensional point cloud in an environment where an object exists and stores it in the basic data storage unit 21, the object recognition apparatus 100 executes a preprocessing routine shown in FIG.

まず、ステップＳ１００では、基礎データ記憶部２１に格納された物体が存在する環境における３次元点群の各点について、Ｘ，Ｙ，Ｚ座標における単位ベクトルとして、法線ベクトルを計算し、法線ベクトル記憶部３０に記憶する。 First, in step S100, a normal vector is calculated as a unit vector in the X, Y, and Z coordinates for each point of the three-dimensional point group in the environment where the object stored in the basic data storage unit 21 exists, and the normal line is calculated. Store in the vector storage unit 30.

次に、ステップＳ１０２では、基礎データ記憶部２１に格納された物体が存在する環境における３次元点群と、法線ベクトル記憶部３０に記憶された各点の法線ベクトルとに基づいて、局所特徴量用に予め定めたサイズのＭ個のボクセルの各々について、局所特徴量を計算し、局所特徴量記憶部３２に格納する。 Next, in step S102, based on the three-dimensional point cloud in the environment where the object stored in the basic data storage unit 21 exists and the normal vector of each point stored in the normal vector storage unit 30, the local data A local feature amount is calculated for each of M voxels having a predetermined size for the feature amount, and stored in the local feature amount storage unit 32.

ステップＳ１０４では、局所特徴量計算部２４によって計算されたＭ個の局所特徴量に基づいて、非特許文献１に記載のコードブック学習の手法を用いて、Ｋ個のクラスタについて、クラスタ中心を計算し、各クラスタにＭ個の局所特徴量の各々を割り当て、Ｍ個の局所特徴量の各々に対するクラスタ番号を局所特徴量記憶部３２に格納し、各クラスタのクラスタ中心をクラスタ記憶部３４に格納する。 In step S104, the cluster center is calculated for K clusters using the codebook learning method described in Non-Patent Document 1 based on the M local feature amounts calculated by the local feature amount calculation unit 24. Then, each of the M local feature values is assigned to each cluster, the cluster number for each of the M local feature values is stored in the local feature value storage unit 32, and the cluster center of each cluster is stored in the cluster storage unit 34. To do.

ステップＳ１０６では、基礎データ記憶部２１に格納された環境を表す３次元点群を含む３次元空間を、予め定められた大きさのボクセルによって離散化する。 In step S106, the three-dimensional space including the three-dimensional point group representing the environment stored in the basic data storage unit 21 is discretized by voxels having a predetermined size.

ステップＳ１０８では、ステップＳ１０４で計算されたＭ個の局所特徴量と、ステップＳ１０４でＭ個の局所特徴量に割り当てられたクラスタとに基づいて、３次元空間内の離散化されたボクセルの各々に対して、当該ボクセルと３次元空間の原点とで規定される直方体の範囲に含まれる局所特徴量から計算されるヒストグラムを格納した３次元積分画像を計算して、積分画像記憶部３６に格納する。 In step S108, based on the M local feature values calculated in step S104 and the clusters assigned to the M local feature values in step S104, each of the discretized voxels in the three-dimensional space is assigned. On the other hand, a three-dimensional integrated image storing a histogram calculated from local feature amounts included in a rectangular parallelepiped range defined by the voxel and the origin of the three-dimensional space is calculated and stored in the integrated image storage unit 36. .

次に、物体認識処理ルーチンについて説明する。入力部１０において認識対象物体を表す３次元点群を受け付けて基礎データ記憶部２１に格納すると、物体認識装置１００は、図９に示す物体認識処理ルーチンを実行する。 Next, the object recognition processing routine will be described. When the input unit 10 receives a three-dimensional point cloud representing a recognition target object and stores it in the basic data storage unit 21, the object recognition device 100 executes an object recognition processing routine shown in FIG.

ステップＳ２００では、基礎データ記憶部２１に格納された認識対象物体の３次元点群内の各点に対して、法線ベクトルを計算し、法線ベクトル記憶部３０に格納する。 In step S <b> 200, a normal vector is calculated for each point in the three-dimensional point group of the recognition target object stored in the basic data storage unit 21 and stored in the normal vector storage unit 30.

ステップＳ２０２では、基礎データ記憶部２１に格納された認識対象物体の３次元点群に基づいて、３次元点群からサンプリングされた各点について、局所特徴量を計算し、局所特徴量記憶部３２に格納する。 In step S202, a local feature amount is calculated for each point sampled from the three-dimensional point group based on the three-dimensional point group of the recognition target object stored in the basic data storage unit 21, and the local feature amount storage unit 32 is calculated. To store.

ステップＳ２０４では、クラスタ記憶部３４に格納された各クラスタのクラスタ中心に基づいて、ステップＳ２０２で認識対象物体を表す３次元点群の各点について計算された局所特徴量を、各クラスタに割り当てる。 In step S204, the local feature amount calculated for each point of the three-dimensional point group representing the recognition target object in step S202 based on the cluster center of each cluster stored in the cluster storage unit 34 is assigned to each cluster.

ステップＳ２０６では、ステップＳ２０４で局所特徴量をクラスタに割り当てた結果から、各クラスタに割り当てられた局所特徴量の数を計算し、認識対象物体を表現するヒストグラムを計算する。 In step S206, the number of local feature values assigned to each cluster is calculated from the result of assigning the local feature values to the clusters in step S204, and a histogram representing the recognition target object is calculated.

ステップＳ２０８では、認識対象物体を表す３次元点群から当該３次元点群の大きさを表現するボックスのサイズを上記（１）式により計算する。 In step S208, the size of the box expressing the size of the three-dimensional point group is calculated from the three-dimensional point group representing the recognition target object by the above equation (1).

ステップＳ２１０では、基礎データ記憶部２１に格納された環境を表す３次元点群を含む３次元空間であって、ボクセルによって離散化された３次元空間に、ステップＳ２０８でサイズが計算されたボックスを配置し、ボックスの各配置パターンを決定する。 In step S210, the box whose size is calculated in step S208 is added to the three-dimensional space including the three-dimensional point cloud representing the environment stored in the basic data storage unit 21 and discretized by the voxels. Arrange and determine each placement pattern of the box.

ステップＳ２１１では、積分画像記憶部３６に記憶された３次元積分画像を用いて、上記ステップＳ２１０で決定されたボックスを配置する各配置パターンに対して、当該配置パターンにおけるボックスを表現するヒストグラムを計算する。当該配置パターンにおけるボックスを表現するヒストグラムは、３次元積分画像の各ボクセルに保存されたヒストグラムを用いて高々８回の和または差の計算によって計算される。 In step S211, using the three-dimensional integrated image stored in the integrated image storage unit 36, for each arrangement pattern in which the box determined in step S210 is arranged, a histogram expressing the box in the arrangement pattern is calculated. To do. A histogram representing a box in the arrangement pattern is calculated by calculating eight sums or differences at most using the histogram stored in each voxel of the three-dimensional integral image.

ステップＳ２１２では、ステップＳ２１０で各配置パターンに対して計算されたヒストグラムと、ステップＳ２０６で計算されたヒストグラムの比較を行い、各配置パターンに対してスコアを計算する。 In step S212, the histogram calculated for each arrangement pattern in step S210 is compared with the histogram calculated in step S206, and a score is calculated for each arrangement pattern.

ステップＳ２１４では、ステップＳ２１２によって計算されたボックスの各配置パターンに対するスコアから、スコアが最も高い配置パターンを、ボックス内に認識対象物体が含まれるようなボックスの配置パターンとして決定する。 In step S214, an arrangement pattern having the highest score is determined as a box arrangement pattern in which a recognition target object is included in the box from the scores for the respective arrangement patterns of boxes calculated in step S212.

ステップＳ２１６では、物体認識処理部５４によって得られた配置パターンから認識対象物体が存在する三次元座標を復元し、出力部６０に出力し処理を終了する。 In step S216, the three-dimensional coordinates where the recognition target object is present are restored from the arrangement pattern obtained by the object recognition processing unit 54, output to the output unit 60, and the process ends.

以上説明したように、本発明の実施の形態に係る物体認識装置によれば、認識対象物体を表現するヒストグラムを計算し、認識対象物体を表す３次元点群から当該３次元点群の大きさを表現するボックスを計算し、認識対象物体が存在する環境を表す３次元点群を含む３次元空間にボックスを配置した配置パターンを決定し、配置パターンにおけるボックスを表現するヒストグラムを計算し、各配置パターンに対して計算されたヒストグラムと、認識対象物体を表現するヒストグラムの比較を行い、各配置パターンに対してスコアを計算し、ボックスの各配置パターンに対するスコアから、配置パターンを決定し、配置パターンから認識対象物体が存在する三次元座標を復元することにより、精度よく、３次元点群から物体を認識することができる。 As described above, according to the object recognition apparatus according to the embodiment of the present invention, the histogram representing the recognition target object is calculated, and the size of the three-dimensional point group is calculated from the three-dimensional point group representing the recognition target object. Calculating a box representing a box in the arrangement pattern, determining a placement pattern in which the box is placed in a three-dimensional space including a three-dimensional point cloud representing an environment in which the recognition target object exists, Comparing the histogram calculated for the placement pattern with the histogram representing the recognition target object, calculating the score for each placement pattern, determining the placement pattern from the score for each placement pattern in the box, and placing By reconstructing the 3D coordinates where the recognition target object exists from the pattern, it is possible to recognize the object from the 3D point cloud with high accuracy. Kill.

なお、本発明は、上述した実施の形態に限定されるものではなく、この発明の要旨を逸脱しない範囲内で様々な変形や応用が可能である。 The present invention is not limited to the above-described embodiment, and various modifications and applications can be made without departing from the gist of the present invention.

例えば、上述した実施の形態では物体認識装置１００について説明したがこの装置及び構成に限定されるものではない。例えば、認識対象物体をモデル化するために、基礎データ記憶部２１と、法線ベクトル記憶部３０と、局所特徴量記憶部３２と、クラスタ記憶部３４と、積分画像記憶部３６と、法線ベクトル計算部４０と、局所特徴量計算部４２と、局所特徴量割当部４４と、ヒストグラム作成部４６と、ボックスサイズ決定部４８と、ボックスヒストグラム計算部５０と、を含む３次元点群モデル化装置として構成するようにしてもよい。 For example, although the object recognition apparatus 100 has been described in the above-described embodiment, the present invention is not limited to this apparatus and configuration. For example, in order to model a recognition target object, a basic data storage unit 21, a normal vector storage unit 30, a local feature storage unit 32, a cluster storage unit 34, an integrated image storage unit 36, a normal line Three-dimensional point cloud modeling including a vector calculation unit 40, a local feature value calculation unit 42, a local feature value allocation unit 44, a histogram creation unit 46, a box size determination unit 48, and a box histogram calculation unit 50 You may make it comprise as an apparatus.

１０入力部
２０演算部
２２前処理部
２１基礎データ記憶部
２３法線ベクトル計算部
２４局所特徴量計算部
２５クラスタリング部
２６３次元積分画像作成部
３０法線ベクトル記憶部
３２局所特徴量記憶部
３４クラスタ記憶部
３６積分画像記憶部
３８認識結果記憶部
４０法線ベクトル計算部
４２局所特徴量計算部
４４局所特徴量割当部
４６ヒストグラム作成部
４８ボックスサイズ決定部
５０ボックスヒストグラム計算部
５２ヒストグラム比較部
５４物体認識処理部
５６座標復元部
６０出力部
１００物体認識装置 DESCRIPTION OF SYMBOLS 10 Input part 20 Operation part 22 Pre-processing part 21 Basic data storage part 23 Normal vector calculation part 24 Local feature quantity calculation part 25 Clustering part 26 Three-dimensional integral image creation part 30 Normal vector storage part 32 Local feature quantity storage part 34 Cluster storage unit 36 Integrated image storage unit 38 Recognition result storage unit 40 Normal vector calculation unit 42 Local feature amount calculation unit 44 Local feature amount allocation unit 46 Histogram creation unit 48 Box size determination unit 50 Box histogram calculation unit 52 Histogram comparison unit 54 Object recognition processing unit 56 Coordinate restoration unit 60 Output unit 100 Object recognition device

Claims

An object recognition apparatus for recognizing a recognition target object included in a three-dimensional point group representing an environment using a three-dimensional point group representing a recognition target object and a three-dimensional point group representing an environment in which the recognition target object exists. There,
A normal vector calculator that calculates a normal vector for each point in the three-dimensional point cloud of the recognition target object;
A local feature amount calculation unit that calculates a local feature amount that represents information around a point for each point in the three-dimensional point group of the recognition target object;
A local feature in which a local feature amount calculated from a three-dimensional point group representing a recognition target object by the local feature amount calculation unit is assigned to a cluster center of the local feature amount previously created from the three-dimensional point group representing the environment. A quantity allocation unit;
Calculating the number of local feature values assigned to each cluster from the result of assigning local feature values by the local feature value assigning unit, and calculating a histogram expressing a recognition target object;
A box size determining unit for calculating a box expressing the size of the three-dimensional point group from the three-dimensional point group representing the recognition target object;
A box histogram calculation unit for calculating a histogram representing a box in the arrangement pattern for each arrangement pattern in which the box calculated by the box size determination unit is arranged in a three-dimensional point group representing the environment;
A histogram comparison unit that compares the histogram calculated for each arrangement pattern by the box histogram calculation unit and a histogram calculated by the histogram creation unit, and calculates a score for each arrangement pattern;
An object recognition processing unit that determines a layout pattern of a box such that the recognition target object is included in the box from a score for each layout pattern of the box calculated by the histogram comparison unit;
A coordinate restoration unit that restores the three-dimensional coordinates where the recognition target object exists from the arrangement pattern obtained by the object recognition processing unit;
An object recognition apparatus including:

A three-dimensional space in which a point cloud representing the environment exists is discretized by a voxel having a predetermined size, and a local feature amount calculated in advance for each point in the three-dimensional point cloud representing the environment; A histogram calculated from local features included in a predetermined range including the voxels for each of the discretized voxels in the three-dimensional space based on the cluster center assigned to the local features. A three-dimensional integral image creating unit for calculating a three-dimensional integral image storing
The box histogram calculation unit represents the box of the arrangement pattern by calculating the sum or difference at most eight times using the histogram stored in each voxel of the three-dimensional integral image for each arrangement pattern of the box. The object recognition apparatus according to claim 1, wherein a histogram to be calculated is calculated.

The object according to claim 1, wherein the local feature amount calculation unit calculates SHOT (Signature of Histograms of Orientations) using the normal vector calculated for each point by the normal vector calculation unit. Recognition device.

A normal vector calculator that calculates a normal vector for each point in the 3D point cloud of the recognition target object;
A local feature amount calculation unit that calculates a local feature amount that represents information around a point for each point in the three-dimensional point group of the recognition target object;
The local feature calculated from the three-dimensional point group representing the recognition target object by the local feature amount calculation unit with respect to the cluster center of the local feature amount previously created from the three-dimensional point group representing the environment where the recognition target object exists A local feature amount assigning unit for assigning an amount;
Calculating the number of local feature values assigned to each cluster from the result of assigning local feature values by the local feature value assigning unit, and calculating a histogram expressing a recognition target object;
A box size determining unit for calculating a box expressing the size of the three-dimensional point group from the three-dimensional point group representing the recognition target object;
A box histogram calculation unit for calculating a histogram representing a box in the arrangement pattern for each arrangement pattern in which the box calculated by the box size determination unit is arranged in a three-dimensional point group representing the environment;
3D point cloud modeling device.

It includes a normal vector calculation unit, a local feature amount calculation unit, a local feature amount allocation unit, a histogram creation unit, a box size determination unit, a box histogram calculation unit, a histogram comparison unit, an object recognition processing unit, and a coordinate restoration unit, and is a recognition target An object recognition method for recognizing a recognition target object included in a point group representing an environment using a point group representing an object and a point group representing an environment where the recognition target object exists,
The normal vector calculation unit calculating a normal vector for each point in the three-dimensional point group of the recognition target object;
The local feature amount calculation unit calculating a local feature amount expressing information around the point for each point in the three-dimensional point group of the recognition target object;
The local feature amount assigning unit is calculated from the three-dimensional point group representing the recognition target object by the local feature amount calculating unit with respect to the cluster center of the local feature amount previously created from the three-dimensional point group representing the environment. Assigning local feature quantities,
Calculating the number of local feature values assigned to each cluster from the result of assigning local feature values by the local feature value assigning unit, and calculating a histogram representing a recognition target object;
The box size determining unit calculating a box expressing the size of the three-dimensional point group from the three-dimensional point group representing the recognition target object;
The box histogram calculation unit calculates a histogram representing a box in the arrangement pattern for each arrangement pattern in which the box calculated by the box size determination unit is arranged in a three-dimensional point group representing the environment. Steps,
The histogram comparison unit compares the histogram calculated for each arrangement pattern by the box histogram calculation unit with the histogram calculated by the histogram creation unit, and calculates a score for each arrangement pattern When,
The object recognition processing unit determines a box arrangement pattern such that the recognition target object is included in a box from a score for each box arrangement pattern calculated by the histogram comparison unit;
The coordinate restoration unit restoring the three-dimensional coordinates where the recognition target object exists from the arrangement pattern obtained by the object recognition processing unit;
An object recognition method including:

The three-dimensional integral image creation unit discretizes a three-dimensional space in which the point cloud representing the environment exists with voxels having a predetermined size, and calculates in advance for each point in the three-dimensional point cloud representing the environment. And each of the discretized voxels in the three-dimensional space is included in a predetermined range including the voxel based on the determined local feature and the cluster center assigned to the local feature. A step of calculating a three-dimensional integral image storing a histogram calculated from the local features;
The step of calculating by the box histogram calculation unit includes calculating the arrangement pattern by performing at most eight sums or differences using the histogram stored in each voxel of the three-dimensional integral image for each arrangement pattern of the box. The object recognition method according to claim 5, wherein a histogram representing a box is calculated.

The program for functioning a computer as each part which comprises the object recognition apparatus of any one of Claims 1-3.