JP2021076948A

JP2021076948A - Person detection device, person detection system, and person detection method

Info

Publication number: JP2021076948A
Application number: JP2019201138A
Authority: JP
Inventors: 佳一三谷; Yoshikazu Mitani; 昭信渡邊; Akinobu Watanabe; 敦根尾; Atsushi Neo
Original assignee: Hitachi LG Data Storage Inc
Current assignee: Hitachi LG Data Storage Inc
Priority date: 2019-11-06
Filing date: 2019-11-06
Publication date: 2021-05-20

Abstract

To provide a person detection device that detects a person with high accuracy even when a plurality of persons overlap and a part of the person behind cannot be seen or when a plurality of persons are close to each other.SOLUTION: In a person detection system 1, a person detection device 10 includes: a first person detection unit 107 that detects a person included in an RGB image acquired from an RGB camera 101; a point cloud clustering unit 108 that acquires a point cloud group by spatially grouping point clouds included in point cloud data generated from a point cloud image acquired from a ToF sensor 102; a person point cloud cluster association unit 109 that associates a person detection area in which the first person detection unit 107 detected the person in the RGB image with the point cloud group grouped by the point cloud clustering unit 108; and a second person detection unit 112 that determines whether or not a non-corresponding point cloud group is a person according to a first feature quantity of the non-corresponding point cloud group of the point cloud group that is not associated with the person detection area by the person point cloud cluster association unit 109.SELECTED DRAWING: Figure 1

Description

本発明は、画像内に含まれる人を検出する技術に関する。 The present invention relates to a technique for detecting a person included in an image.

画像を用いた監視技術においては、画像内に含まれる人を検出する場合がある。本技術分野における従来技術として特許文献１、特許文献２、特許文献３がある。 In the surveillance technology using an image, a person included in the image may be detected. Patent Document 1, Patent Document 2, and Patent Document 3 are prior arts in the present technical field.

下記特許文献１は、『所定の範囲に対して距離の測定を行う距離測定部と、前記距離測定部によって測定された距離の分布に基づいて人を検出する人処理部と、前記人処理部によって検出された人毎に識別子を付与するトラッキング部と、人に対して挙手の開始と終了を促すことにより設定される第１期間において、測定された距離に含まれる高さ方向の値である人データ高さに基づいて、人の反応を推定する状況推定部と、を備え、前記状況推定部は、前記トラッキング部によって識別子を付与された人毎の前記人データ高さを前記第１期間に複数回測定し、複数の前記人データ高さが所定の範囲内にある識別子を抽出し、抽出された識別子に対応する人の反応を推定する動作推定装置』という技術を開示している（請求項１参照）。 The following Patent Document 1 describes, "A distance measuring unit that measures a distance with respect to a predetermined range, a human processing unit that detects a person based on the distribution of the distance measured by the distance measuring unit, and the human processing unit. It is a value in the height direction included in the measured distance in the tracking unit that assigns an identifier to each person detected by and the first period set by urging the person to start and end raising his / her hand. A situation estimation unit that estimates a person's reaction based on a person data height is provided, and the situation estimation unit sets the person data height for each person assigned an identifier by the tracking unit for the first period. Discloses a technique called "motion estimation device" that measures a plurality of times, extracts a plurality of identifiers whose data heights are within a predetermined range, and estimates the reaction of a person corresponding to the extracted identifiers. See claim 1).

下記特許文献２は、『通行人の足のレーザセンサからの距離を示す距離データを受信する距離データ受信手段と、複数の前記レーザセンサによって取得された前記距離データを複数用いて、同時刻における空間平面上のレーザセンサの検出位置を示すセンシング画像データを生成するセンシング画像データ生成手段と、前記センシング画像データにおける複数の前記検出位置をクラスタリングするクラスタリング手段と、前記クラスタリング結果から一人の通行人の足に対応するクラスタを特定する通行人特定手段と、前記特定した通行人のクラスタから、当該通行人の足のポイントの位置、歩行方向、歩幅、歩行周期、歩行位相とからなるパラメータを決定するパラメータ決定手段と、前記パラメータとParticle filterアルゴリズムを用いて通行人の軌跡を解析する軌跡解析処理手段と、を備えることを特徴とする通行人行動解析装置』という技術を開示している（請求項１参照)。 The following Patent Document 2 describes, "A distance data receiving means for receiving distance data indicating a distance from a passerby's foot from a laser sensor and a plurality of the distance data acquired by the plurality of laser sensors are used at the same time. Sensing image data generation means that generates sensing image data indicating the detection position of the laser sensor on the spatial plane, clustering means that clusters a plurality of the detection positions in the sensing image data, and one passerby from the clustering result. From the passerby identification means for specifying the cluster corresponding to the foot and the specified passerby cluster, parameters including the position of the point of the passerby's foot, walking direction, stride length, walking cycle, and walking phase are determined. A technique called a "passerby behavior analysis device" comprising a parameter determining means and a locus analysis processing means for analyzing the locus of a passerby using the parameters and a particle filter algorithm is disclosed (claim). 1).

下記特許文献３は、『画像の範囲ごとに群衆の移動ベクトルを取得する取得手段と、前記取得手段により取得された移動ベクトルに基づいて人流クラスタを生成する生成手段と、前記生成手段により生成された人流クラスタを正常な人流クラスタと異常な人流クラスタとに分類する分類手段と、前記正常な人流クラスタと前記異常な人流クラスタとを視覚的に判別可能な形で前記画像に重畳して表示する表示手段と、を有する画像処理装置』という技術を開示している（請求項１参照）。 The following Patent Document 3 describes, "A acquisition means for acquiring a movement vector of a crowd for each range of an image, a generation means for generating a human flow cluster based on the movement vector acquired by the acquisition means, and a generation means generated by the generation means. A classification means for classifying a normal human flow cluster into a normal human flow cluster and an abnormal human flow cluster, and displaying the normal human flow cluster and the abnormal human flow cluster superimposed on the image in a visually distinguishable form. It discloses a technique called "an image processing device having a display means" (see claim 1).

特開２０１６−２１８６１０号公報Japanese Unexamined Patent Publication No. 2016-218610 特開２００９−１１０１８５号公報Japanese Unexamined Patent Publication No. 2009-110185 特開２０１９−０６７２０８号公報JP-A-2019-067208

従来技術を用いることにより、被撮影領域内にいる人を検出できる。しかし人が多くなるに連れ、複数人が重なって後方の人の一部が見えなくなることにより後方の人が検出されなくなる場合や、複数人が近接することにより複数人が１人として検出されてしまう場合が増える。 By using the prior art, it is possible to detect a person in the area to be photographed. However, as the number of people increases, there are cases where multiple people overlap and some of the people behind are not visible, so the people behind are not detected, or when multiple people are close to each other, multiple people are detected as one person. There are more cases where it ends up.

本発明は、上記のような課題に鑑みてなされたものであり、複数人が重なって後方の人の一部が見えなくなる場合や、複数人が近接する場合においても、高精度に人を検出することができる人検出装置を提供することを目的とする。 The present invention has been made in view of the above problems, and can detect a person with high accuracy even when a plurality of people overlap and a part of the person behind cannot be seen or when a plurality of people are close to each other. It is an object of the present invention to provide a person detection device capable of performing.

本発明に係る人検出装置は、ＲＧＢ画像に含まれる人領域を検出した後、点群クラスタのなかで前記人領域に対応していない未対応点群グループの特徴量を算出し、その特徴量にしたがって、前記未対応点群グループが人であるか否かを判断する。 The person detection device according to the present invention detects a human area included in an RGB image, then calculates a feature amount of an uncorresponding point group group that does not correspond to the human area in the point group cluster, and calculates the feature amount. Therefore, it is determined whether or not the uncorresponding point group is a person.

本発明に係る人検出装置によれば、複数人が重なって後方の人の一部が見えなくなる場合や、複数人が近接する場合においても、高精度に人を検出できる。上記した以外の課題、構成、効果は、以下の実施形態の説明により明らかにされる。 According to the person detection device according to the present invention, a person can be detected with high accuracy even when a plurality of people overlap and a part of the person behind cannot be seen or when a plurality of people are close to each other. Issues, configurations, and effects other than those described above will be clarified by the following description of the embodiments.

実施形態１に係る人検出システム１の構成図である。It is a block diagram of the person detection system 1 which concerns on Embodiment 1. ＲＧＢカメラ１０１とＴｏＦセンサ１０２によって同じ１組のマーカを撮影する状況を示している。It shows the situation where the same set of markers is photographed by the RGB camera 101 and the ToF sensor 102. 人検出システム１が画像内の人を検出する処理を説明するフローチャートである。It is a flowchart explaining the process which the person detection system 1 detects a person in an image. 撮影環境の１例である。This is an example of a shooting environment. 図３のフローチャートにしたがって図４Ａにおける被写体４０５Ａと４０６Ａを検出する手順を示す処理フロー図である。It is a processing flow diagram which shows the procedure of detecting the subject 405A and 406A in FIG. 4A according to the flowchart of FIG. 実施形態２に係る人検出システム１の構成図である。It is a block diagram of the person detection system 1 which concerns on Embodiment 2. 人クラスタ形状特徴量抽出部５０２と、点群クラスタフィッティング部５０４と、第２人検出部１１２が実施する処理の詳細を説明する概念図である。It is a conceptual diagram explaining the details of the processing performed by the person cluster shape feature amount extraction unit 502, the point cloud cluster fitting unit 504, and the second person detection unit 112. 実施形態２における人検出システム１が画像内の人を検出する処理を説明するフローチャートである。It is a flowchart explaining the process which the person detection system 1 in Embodiment 2 detects a person in an image. ステップＳ７０３の詳細を説明するフローチャートである。It is a flowchart explaining the detail of step S703. 実施形態３に係る人検出システム１の構成図である。It is a block diagram of the person detection system 1 which concerns on Embodiment 3. 点群密度極大値検出部８０２と第２人検出部１１２による人検出方法の詳細を説明する図である。It is a figure explaining the detail of the person detection method by the point cloud density maximum value detection unit 802 and the second person detection unit 112. 実施形態３における人検出システム１が画像内の人を検出する処理を説明するフローチャートである。It is a flowchart explaining the process which the person detection system 1 in Embodiment 3 detects a person in an image.

実施の形態を説明するための全図において、同一の部材には原則として同一の符号を付し、その繰り返しの説明は省略する。また、以下の実施の形態において、その構成要素（要素ステップ等も含む）は、特に明示した場合および原理的に明らかに必須であると考えられる場合等を除き、必ずしも必須のものではないことは言うまでもない。また、「Ａからなる」、「Ａよりなる」、「Ａを有する」、「Ａを含む」と言うときは、特にその要素のみである旨明示した場合等を除き、それ以外の要素を排除するものでないことは言うまでもない。同様に、以下の実施の形態において、構成要素等の形状、位置関係等に言及するときは、特に明示した場合および原理的に明らかにそうでないと考えられる場合等を除き、実質的にその形状等に近似または類似するもの等を含むものとする。 In all the drawings for explaining the embodiment, the same members are, in principle, given the same reference numerals, and the repeated description thereof will be omitted. Further, in the following embodiments, the components (including element steps and the like) are not necessarily essential unless otherwise specified or clearly considered to be essential in principle. Needless to say. In addition, when saying "consisting of A", "consisting of A", "having A", and "including A", other elements are excluded unless it is clearly stated that it is only that element. It goes without saying that it is not something to do. Similarly, in the following embodiments, when referring to the shape, positional relationship, etc. of a component or the like, the shape is substantially the same unless otherwise specified or when it is considered that it is not apparent in principle. Etc., etc. shall be included.

本明細書等における「第１」、「第２」、「第３」などの表記は、構成要素を識別するために付するものであり、必ずしも、数、順序、もしくはその内容を限定するものではない。また、構成要素の識別のための番号は文脈毎に用いられ、一つの文脈で用いた番号が、他の文脈で必ずしも同一の構成を示すとは限らない。また、ある番号で識別された構成要素が、他の番号で識別された構成要素の機能を兼ねることを妨げるものではない。 The notations such as "first", "second", and "third" in the present specification and the like are attached to identify the components, and do not necessarily limit the number, order, or contents thereof. is not it. In addition, numbers for identifying components are used for each context, and numbers used in one context do not always indicate the same composition in other contexts. Further, it does not prevent the component identified by a certain number from having the function of the component identified by another number.

図面等において示す各構成の位置、大きさ、形状、範囲などは、発明の理解を容易にするため、実際の位置、大きさ、形状、範囲などを表していない場合がある。このため、本発明は、必ずしも、図面等に開示された位置、大きさ、形状、範囲などに限定されない。 The position, size, shape, range, etc. of each configuration shown in the drawings and the like may not represent the actual position, size, shape, range, etc. in order to facilitate understanding of the invention. Therefore, the present invention is not necessarily limited to the position, size, shape, range, etc. disclosed in the drawings and the like.

＜実施の形態１＞
図１は、本発明の実施形態１に係る人検出システム１の構成図である。人検出システム１は、画像内に含まれる人を検出するシステムである。人検出システム１は、撮影処理部１１、第１情報処理部１２、第２情報処理部１３を備える。第１情報処理部１２と第２情報処理部１３は、人検出装置１０を構成することができる（実施形態２以降においても同様）。 <Embodiment 1>
FIG. 1 is a configuration diagram of a person detection system 1 according to the first embodiment of the present invention. The person detection system 1 is a system for detecting a person included in an image. The human detection system 1 includes a photographing processing unit 11, a first information processing unit 12, and a second information processing unit 13. The first information processing unit 12 and the second information processing unit 13 can configure the person detection device 10 (the same applies to the second and subsequent embodiments).

撮影処理部１１は、ＲＧＢカメラ１０１とＴｏＦ（ＴｉｍｅｏｆＦｌｉｇｈｔ）センサ１０２を有する。ＲＧＢカメラ１０１は、各画素が一般的な階調情報（ＲＧＢなど）を持つ画像（以降、ＲＧＢ画像と呼ぶ）を取得しＲＧＢ画像取得部１０３へ出力する。ＴｏＦセンサ１０２は、各画素が距離情報を持つ距離画像を取得し、距離画像取得部１０４へ出力する。距離画像における各画素は、座標変換によって３次元座標に変換でき、それら変換後の３次元座標が示す点の集団を点群と呼ぶ。 The photographing processing unit 11 has an RGB camera 101 and a ToF (Time of Flight) sensor 102. The RGB camera 101 acquires an image (hereinafter referred to as an RGB image) in which each pixel has general gradation information (RGB or the like) and outputs the image to the RGB image acquisition unit 103. The ToF sensor 102 acquires a distance image in which each pixel has distance information and outputs the distance image to the distance image acquisition unit 104. Each pixel in the distance image can be converted into three-dimensional coordinates by coordinate conversion, and the group of points indicated by the three-dimensional coordinates after the conversion is called a point cloud.

第１情報処理部１２は、ＲＧＢ画像取得部１０３、距離画像取得部１０４、ピクセル同期部１０５、同期情報保持部１０６、第１人検出部１０７、点群クラスタリング部１０８、人点群クラスタ対応付け部１０９、人クラスタ除去部１１０を有する。 The first information processing unit 12 includes an RGB image acquisition unit 103, a distance image acquisition unit 104, a pixel synchronization unit 105, a synchronization information holding unit 106, a first person detection unit 107, a point cloud clustering unit 108, and a point cloud cluster association. It has a unit 109 and a person cluster removing unit 110.

ＲＧＢ画像取得部１０３は、ＲＧＢカメラ１０１からＲＧＢ画像を取得し、ピクセル同期部１０５と第１人検出部１０７へ出力する。距離画像取得部１０４は、ＴｏＦセンサ１０２から距離画像を取得し、ピクセル同期部１０５と点群クラスタリング部１０８へ出力する。 The RGB image acquisition unit 103 acquires an RGB image from the RGB camera 101 and outputs it to the pixel synchronization unit 105 and the first person detection unit 107. The distance image acquisition unit 104 acquires a distance image from the ToF sensor 102 and outputs it to the pixel synchronization unit 105 and the point cloud clustering unit 108.

ピクセル同期部１０５は、ＲＧＢ画像取得部１０３から取得したＲＧＢ画像と距離画像取得部１０４から取得した距離画像の各画素の位置を同期させ、両画像における各画素の対応関係（以下、ピクセル同期情報と呼ぶ）を同期情報保持部１０６へ出力する。この同期処理はＲＧＢ画像と距離画像を取得する度に実施してもよいし、ＲＧＢ画像と距離画像を初めに取得した時のみ実施してもよく、ピクセル同期部１０５の処理頻度は限定されない。同期処理の詳細は後述する。 The pixel synchronization unit 105 synchronizes the positions of each pixel of the RGB image acquired from the RGB image acquisition unit 103 and the distance image acquired from the distance image acquisition unit 104, and the correspondence relationship of each pixel in both images (hereinafter, pixel synchronization information). Is output to the synchronization information holding unit 106. This synchronization processing may be performed each time the RGB image and the distance image are acquired, or may be performed only when the RGB image and the distance image are first acquired, and the processing frequency of the pixel synchronization unit 105 is not limited. The details of the synchronization process will be described later.

同期情報保持部１０６は、ピクセル同期部１０５から取得したピクセル同期情報を保持し、人点群クラスタ対応付け部１０９へピクセル同期情報を出力する。 The synchronization information holding unit 106 holds the pixel synchronization information acquired from the pixel synchronization unit 105, and outputs the pixel synchronization information to the person point cloud cluster association unit 109.

第１人検出部１０７は、ＲＧＢ画像取得部１０３から取得したＲＧＢ画像に対して人検出を実施し、人検出数と人検出領域情報を人点群クラスタ対応付け部１０９と第２人検出部１１２へ出力する。例えば、ＣＮＮ（ＣｏｎｖｏｌｕｔｉｏｎａｌＮｅｕｒａｌＮｅｔｗｏｒｋ）を用いた画像認識アルゴリズムにより人の全身や顔部分を検出し、検出した人の数と位置をそれぞれ示す人検出数と人検出領域情報を人点群クラスタ対応付け部１０９と第２人検出部１１２へ出力する。人検出数が０の場合、検出数は０として人検出領域情報は出力しない。 The first person detection unit 107 performs human detection on the RGB image acquired from the RGB image acquisition unit 103, and transmits the number of people detected and the person detection area information to the person point cloud cluster association unit 109 and the second person detection unit. Output to 112. For example, an image recognition algorithm using CNN (Convolutional Neural Network) is used to detect the whole body and face of a person, and the number of people detected and the person detection area information indicating the number and position of the detected people are associated with a point cloud cluster. Output to unit 109 and second person detection unit 112. When the number of detected persons is 0, the number of detected persons is set to 0 and the person detection area information is not output.

点群クラスタリング部１０８は、距離画像取得部１０４から取得した距離画像を点群へ変換し、点群を幾つかの塊（以下、点群クラスタと呼ぶ）として一定密度以上の値で密集する点群を空間的にグルーピングする。点群をグルーピングするその他の手法としては、例えば、点群の分布をＧＭＭ（ＧａｕｓｓｉａｎＭｉｘｔｕｒｅＭｏｄｅｌ）によって再現したときの、各点群座標に対するＧａｕｓｓ分布モデルの尤度にしたがってグルーピングすることができる。その他適当なグルーピング手法を用いてもよい。点群クラスタリング部１０８は、グルーピングした点群クラスタを人点群クラスタ対応付け部１０９へ出力する。このグルーピング処理のことをクラスタリングとも呼ぶ。 The point cloud clustering unit 108 converts the distance image acquired from the distance image acquisition unit 104 into a point cloud, and the point cloud is densely packed with a value of a certain density or more as a number of agglomerates (hereinafter, referred to as a point cloud cluster). Group the groups spatially. As another method for grouping the point cloud, for example, the distribution of the point cloud can be grouped according to the likelihood of the Gauss distribution model with respect to the coordinates of each point cloud when the distribution of the point cloud is reproduced by GMM (Gaussian Mixture Model). Other suitable grouping methods may be used. The point cloud clustering unit 108 outputs the grouped point cloud clusters to the human point cloud cluster association unit 109. This grouping process is also called clustering.

人点群クラスタ対応付け部１０９は、同期情報保持部１０６から読み出したピクセル同期情報に基づいて、第１人検出部１０７から取得した人検出領域情報を点群クラスタリング部１０８から取得した点群クラスタに対応付け（この点群クラスタを領域対応点群クラスタと呼ぶ）、人クラスタ除去部１１０へ出力する。このとき領域対応点群クラスタには、人検出領域情報が対応付けられた人対応点群クラスタと人検出領域が対応付けられていない人未対応点群クラスタが混在する。第１人検出部１０７から取得した人検出数が０の場合は、点群クラスタに対して対応付け処理は実施しない。このとき、人クラスタ除去部１１０へ出力される点群クラスタは人未対応点群クラスタと等しい。 The point cloud cluster association unit 109 acquires the person detection area information acquired from the first person detection unit 107 from the point cloud clustering unit 108 based on the pixel synchronization information read from the synchronization information holding unit 106. (This point cloud cluster is called a region-corresponding point cloud cluster) and is output to the human cluster removal unit 110. At this time, the area-corresponding point cloud cluster includes a person-corresponding point cloud cluster to which the person detection area information is associated and a person-unsupported point cloud cluster to which the person detection area is not associated. When the number of detected persons acquired from the first person detection unit 107 is 0, the association processing is not performed on the point cloud cluster. At this time, the point cloud cluster output to the human cluster removal unit 110 is equal to the human unsupported point cloud cluster.

人クラスタ除去部１１０は、人点群クラスタ対応付け部１０９から取得した領域対応点群クラスタから、人対応点群クラスタを除去し、動体検出部１１１へ人未対応点群クラスタを出力する。 The human cluster removal unit 110 removes the human-corresponding point cloud from the area-corresponding point cloud cluster acquired from the human-point group cluster mapping unit 109, and outputs the non-human-corresponding point cloud to the moving object detection unit 111.

第２情報処理部１３は、動体検出部１１１と第２人検出部１１２を含む。 The second information processing unit 13 includes a moving object detection unit 111 and a second person detection unit 112.

動体検出部１１１は、人クラスタ除去部１１０から取得した人未対応点群クラスタの移動動作を表すパラメータを取得する。例えば人未対応点群クラスタの移動量（例えば、最初に除去した位置からの移動距離）や速度などをそのパラメータとして検出し、その情報を第２人検出部１１２へ出力する。 The moving object detection unit 111 acquires a parameter representing the movement operation of the human uncorresponding point cloud cluster acquired from the human cluster removal unit 110. For example, the movement amount (for example, the movement distance from the first removed position) and the speed of the unsupported point cloud cluster are detected as the parameters, and the information is output to the second person detection unit 112.

第２人検出部１１２は、動体検出部１１１から取得した人未対応点群クラスタの移動量や速度などの情報が人相当の値であるかどうかを閾値にしたがって判断し、その判断結果を第１人検出部１０７から取得した人検出数と人検出領域情報へ統合することにより、改めて人検出数と人検出領域情報を生成する。この判断基準としては、例えば、（ａ）移動距離が５０ｃｍを超えていれば人として判断する、（ｂ）速度が一般的な人の歩行速度（４ｋｍ／ｈ）程度であれば人として判断する、などが考えられる。一度、人未対応点群クラスタを人として検出すれば、ＲＧＢカメラ１０１とＴｏＦセンサ１０２の視野領域を外れない限り、止まっていても追従する。 The second person detection unit 112 determines whether or not the information such as the movement amount and the speed of the human uncorresponding point cloud cluster acquired from the moving object detection unit 111 is a value equivalent to a person, and determines the determination result according to the threshold value. By integrating the number of people detected and the person detection area information acquired from the one person detection unit 107, the number of people detected and the person detection area information are generated again. As the judgment criteria, for example, (a) if the moving distance exceeds 50 cm, it is judged as a person, and (b) if the speed is about a general walking speed (4 km / h), it is judged as a person. , Etc. can be considered. Once the non-human point cloud cluster is detected as a human, it follows even if it is stopped, as long as it does not deviate from the visual field region of the RGB camera 101 and the ToF sensor 102.

図２は、ＲＧＢカメラ１０１とＴｏＦセンサ１０２によって同じ１組のマーカを撮影する状況を示している。図２を用いて、ピクセル同期部１０５が実施する同期処理の詳細を述べる。ＲＧＢカメラ１０１とＴｏＦセンサ１０２がそれぞれ撮影する画像のスケールは等しく、互いに歪曲していないものとする。 FIG. 2 shows a situation in which the same set of markers is photographed by the RGB camera 101 and the ToF sensor 102. The details of the synchronization processing performed by the pixel synchronization unit 105 will be described with reference to FIG. It is assumed that the scales of the images captured by the RGB camera 101 and the ToF sensor 102 are the same and are not distorted from each other.

処理フロー２１は、ＲＧＢカメラ１０１が撮影したマーカについて２つの画素位置を検出している処理を示す。処理フロー２２は、ＴｏＦセンサ１０２が撮影したマーカついて２つの頂点の画素位置を検出している処理を示す。処理フロー２３は、同期に必要な情報を検出している処理を示す。 The processing flow 21 shows a process of detecting two pixel positions of the marker photographed by the RGB camera 101. The processing flow 22 shows a process of detecting the pixel positions of two vertices of the marker photographed by the ToF sensor 102. The processing flow 23 indicates a process of detecting information necessary for synchronization.

ＲＧＢ画像２０１は、ＲＧＢ画像取得部１０３がＲＧＢカメラ１０１から取得するＲＧＢ画像である。距離画像２０２は、距離画像取得部１０４がＴｏＦセンサ１０２から取得する距離画像である。マーカ２０３と２０４は、ＲＧＢ画像２０１上のマーカである。マーカ２０５と２０６は、距離画像２０２上のマーカである。 The RGB image 201 is an RGB image acquired by the RGB image acquisition unit 103 from the RGB camera 101. The distance image 202 is a distance image acquired by the distance image acquisition unit 104 from the ToF sensor 102. The markers 203 and 204 are markers on the RGB image 201. The markers 205 and 206 are markers on the distance image 202.

ＲＧＢ画像２０１上の２つのピクセル座標（ｘ１１，ｙ１１）において見えるマーカ２０３とピクセル座標（ｘ１２，ｙ１２）において見えるマーカ２０４は、距離画像２０２上の２つのピクセル座標（ｘ２１，ｙ２１）と（ｘ２２，ｙ２２）においても見える（マーカ２０５と２０６）。このときの、ＲＧＢ画像２０１と距離画像２０２の間でマーカの同じ部位を参照するための画素間の位置関係（ピクセル同期情報）を求めればよい。例えば、マーカ２０３と２０４がマーカ２０５と２０６に重なるようにオフセット（ｇｘ，ｇｙ）と回転角φを設定し、数１と数２を導出する。数１と数２を連立して解くことにより、オフセット（ｇｘ，ｇｙ）が数３と数４で求まり、回転角φは数５において逆正接関数から求まる。算出したオフセット（ｇｘ，ｇｙ）と回転角φを数６に適用することにより、ＲＧＢ画像２０１上の任意のピクセル座標（ｘＲＧＢ，ｙＲＧＢ）を距離画像２０２上のピクセル座標（ｘＴｏＦ，ｙＴｏＦ）に対応付けることができる。 The marker 203 visible at the two pixel coordinates (x11, y11) on the RGB image 201 and the marker 204 visible at the pixel coordinates (x12, y12) are the two pixel coordinates (x21, y21) and (x22, x22, on the distance image 202). It is also visible in y22) (markers 205 and 206). At this time, the positional relationship (pixel synchronization information) between the pixels for referencing the same portion of the marker between the RGB image 201 and the distance image 202 may be obtained. For example, the offset (gx, gy) and the rotation angle φ are set so that the markers 203 and 204 overlap the markers 205 and 206, and the numbers 1 and 2 are derived. By solving the equations 1 and 2 simultaneously, the offset (gx, gy) can be obtained by the equations 3 and 4, and the angle of rotation φ can be obtained by the inverse trigonometric function in the equation 5. By applying the calculated offset (gx, gy) and rotation angle φ to Equation 6, any pixel coordinates (xRGB, yRGB) on the RGB image 201 are associated with the pixel coordinates (xToF, yToF) on the distance image 202. be able to.

同期情報保持部１０６は、座標（ｇｘ，ｇｙ）と回転角φを保持しておく。人点群クラスタ対応付け部１０９は、数６にしたがって人検出領域の各画素の位置を点群クラスタへ対応付ける際に、座標（ｇｘ，ｇｙ）と回転角φを参照する。 The synchronization information holding unit 106 holds the coordinates (gx, gy) and the rotation angle φ. The point cloud cluster association unit 109 refers to the coordinates (gx, gy) and the angle of rotation φ when associating the position of each pixel in the person detection region with the point cloud cluster according to Equation 6.

同期処理は、人検出システム１を設置するとき１度だけ手動などで実施してもよいし、ＲＧＢ画像２０１と距離画像２０２を取得する度に、取得した画像内の任意の形状に関するテンプレートマッチング手法を用いて自動的に画素間の関係を算出し、得られた座標（ｇｘ，ｇｙ）と回転角φを任意のタイミング（例えば１時間に１度）で同期情報保持部１０６へ格納してもよい。 The synchronization process may be performed manually only once when the person detection system 1 is installed, or a template matching method for an arbitrary shape in the acquired image each time the RGB image 201 and the distance image 202 are acquired. Even if the relationship between the pixels is automatically calculated using the above and the obtained coordinates (gx, gy) and the rotation angle φ are stored in the synchronization information holding unit 106 at an arbitrary timing (for example, once an hour). Good.

ＲＧＢカメラ１０１とＴｏＦセンサ１０２が撮影した両画像のスケールが互いに異なる場合は、アフィン変換によってマーカ検出領域のサイズが等しくなるような等倍変換をした後に同期処理を実施すればよい。ＲＧＢカメラ１０１が撮影した画像が歪曲している場合は、あらかじめ様々な視点から撮影しておいたチェスボードを複数枚用いて歪み補正処理をした後に同期処理を実施すればよい。上記のどちらの場合も起こった場合は、各対処を組み合わせればよい。 When the scales of the two images captured by the RGB camera 101 and the ToF sensor 102 are different from each other, the synchronization process may be performed after performing the same magnification conversion so that the size of the marker detection area becomes the same by the affine transformation. When the image captured by the RGB camera 101 is distorted, the distortion correction process may be performed using a plurality of chess boards captured from various viewpoints in advance, and then the synchronization process may be performed. If either of the above cases occurs, each countermeasure may be combined.

図３は、人検出システム１が画像内の人を検出する処理を説明するフローチャートである。以下図３の各ステップについて説明する。 FIG. 3 is a flowchart illustrating a process in which the human detection system 1 detects a person in an image. Each step of FIG. 3 will be described below.

（図３：ステップＳ３０１〜Ｓ３０３）
ＲＧＢ画像取得部１０３は、ＲＧＢカメラ１０１からＲＧＢ画像を取得する（Ｓ３０１）。距離画像取得部１０４は、ＴｏＦセンサ１０２から距離画像を取得し、取得した距離画像から点群を生成する（Ｓ３０２）。ピクセル同期部１０５は、図２で例示した手法により、ＲＧＢ画像の画素位置と距離画像の画素位置を同期することにより、座標（ｇｘ，ｇｙ）と回転角φを算出する（Ｓ３０３）。 (FIG. 3: Steps S301 to S303)
The RGB image acquisition unit 103 acquires an RGB image from the RGB camera 101 (S301). The distance image acquisition unit 104 acquires a distance image from the ToF sensor 102 and generates a point cloud from the acquired distance image (S302). The pixel synchronization unit 105 calculates the coordinates (gx, gy) and the angle of rotation φ by synchronizing the pixel positions of the RGB image and the pixel positions of the distance image by the method illustrated in FIG. 2 (S303).

（図３：ステップＳ３０４〜Ｓ３０５）
第１人検出部１０７は、ＲＧＢ画像に対して人検出処理を実施することにより、人検出領域情報と人検出数を生成する（Ｓ３０４）。点群クラスタリング部１０８は、点群に対してクラスタリング処理を実施することにより、点群クラスタを生成する（Ｓ３０５）。 (FIG. 3: Steps S304 to S305)
The first person detection unit 107 generates the person detection area information and the number of people detected by performing the person detection process on the RGB image (S304). The point cloud clustering unit 108 generates a point cloud cluster by performing a clustering process on the point cloud (S305).

（図３：ステップＳ３０６）
ステップＳ３０４において生成した人検出数が０より大きければステップＳ３０７へ進み、ステップＳ３０４にて生成した人検出数が０の場合はステップＳ３１０へスキップする。 (FIG. 3: Step S306)
If the number of detected persons generated in step S304 is larger than 0, the process proceeds to step S307, and if the number of detected persons generated in step S304 is 0, the process skips to step S310.

（図３：ステップＳ３０７）
人点群クラスタ対応付け部１０９は、ステップＳ３０４において検出した人検出領域情報を、ステップＳ３０５においてクラスタリングした点群クラスタへ対応付けることにより、人対応点群クラスタと人未対応点群クラスタに分類する。 (FIG. 3: Step S307)
The person point cloud cluster association unit 109 classifies the person corresponding point cloud cluster and the person unsupported point cloud cluster by associating the person detection area information detected in step S304 with the point cloud cluster clustered in step S305.

（図３：ステップＳ３０８）
人クラスタ除去部１１０はステップＳ３０７にて人検出領域情報を対応付けた人対応点群クラスタを除去する。本ステップは、人未対応点群クラスタを明確に識別するための便宜上のものであり、人未対応点群クラスタを識別できるのであれば必ずしも実施しなくともよい。 (Fig. 3: Step S308)
In step S307, the person cluster removing unit 110 removes the person corresponding point cloud cluster associated with the person detection area information. This step is for convenience to clearly identify the unsupported point cloud cluster, and does not necessarily have to be performed as long as the unsupported point cloud cluster can be identified.

（図３：ステップＳ３０９〜Ｓ３１０）
動体検出部１１１は、ステップＳ３０８後に残った人未対応点群クラスタを検出する（Ｓ３０９）。動体検出部１１１は、ステップＳ３０９において検出した人未対応点群クラスタの移動動作パラメータ（例えば移動量、速度など）を検出する（Ｓ３１０）。 (FIG. 3: Steps S309 to S310)
The moving object detection unit 111 detects the human unsupported point cloud cluster remaining after step S308 (S309). The moving object detection unit 111 detects the movement operation parameters (for example, movement amount, speed, etc.) of the human unsupported point cloud cluster detected in step S309 (S310).

（図３：ステップＳ３１１）
第２人検出部１１２は、ステップＳ３１０において検出した移動動作パラメータが人相当であるかどうかを判断する。第２人検出部１１２は、その判断結果を、第１人検出部１０７から取得した人検出数と人検出領域情報へ統合することにより、改めて人検出数と人検出領域情報を生成する。 (Fig. 3: Step S311)
The second person detection unit 112 determines whether or not the movement operation parameter detected in step S310 is equivalent to a person. The second person detection unit 112 regenerates the number of people detected and the person detection area information by integrating the determination result into the number of people detected and the person detection area information acquired from the first person detection unit 107.

（図３：ステップＳ３１２）
人検出システム１は、終了コマンドが発生したか否かを確認する。終了コマンドの例としては、内部から例外処理などによる終了コードが発生した、外部から手動によって終了コマンドが入力された、などの例が挙げられる。終了コマンドがあれば本フローチャートを終了し、それ以外であればステップＳ３０１に戻って同じ処理を繰り返す。 (Fig. 3: Step S312)
The human detection system 1 confirms whether or not a termination command has occurred. Examples of the exit command include an exit code generated by exception handling from the inside, and an exit command manually input from the outside. If there is an end command, this flowchart is ended, otherwise the process returns to step S301 and the same process is repeated.

図４Ａは、撮影環境の１例である。図４Ａにおいて、２人が重なり後方の人の一部が前方の人に隠れている。したがってＲＧＢ画像に対する人検出処理のみを用いた場合は後方の人が検出できず、距離画像のみを用いた場合は２人が一塊としてクラスタリングされてしまう。このような状況において、図３のフローチャートにより２人を検出できることを説明する。 FIG. 4A is an example of a shooting environment. In FIG. 4A, two people overlap and a part of the person behind is hidden by the person in front. Therefore, when only the person detection process for the RGB image is used, the person behind cannot be detected, and when only the distance image is used, two people are clustered as a lump. In such a situation, it will be described that two people can be detected by the flowchart of FIG.

ＲＧＢ画像４０１Ａは、ＲＧＢ画像取得部１０３がＲＧＢカメラから取得するＲＧＢ画像である。距離画像４０２Ａは、距離画像取得部１０４がＴｏＦセンサ１０２から取得する距離画像である。視野領域４０３Ａは、ＲＧＢカメラ１０１の視野領域である。視野領域４０４Ａは、ＴｏＦセンサ１０２の視野領域である。被写体４０５Ａと被写体４０６Ａは、視野領域４０３Ａと視野領域４０４Ａの範囲内に存在する人物である。被写体４０７Ａと被写体４０８Ａは、各々ＲＧＢカメラ１０１で撮影し取得したＲＧＢ画像４０１Ａ上の被写体４０５Ａと４０６Ａである。被写体４０９Ａと４１０Ａは、各々ＴｏＦセンサ１０２で撮影し取得した距離画像４０２Ａ上の被写体４０５Ａと４０６Ａである。 The RGB image 401A is an RGB image acquired by the RGB image acquisition unit 103 from the RGB camera. The distance image 402A is a distance image acquired by the distance image acquisition unit 104 from the ToF sensor 102. The field of view area 403A is the field of view area of the RGB camera 101. The visual field region 404A is the visual field region of the ToF sensor 102. The subject 405A and the subject 406A are persons who exist within the range of the visual field area 403A and the visual field area 404A. The subject 407A and the subject 408A are the subjects 405A and 406A on the RGB image 401A captured and acquired by the RGB camera 101, respectively. The subjects 409A and 410A are the subjects 405A and 406A on the distance image 402A captured and acquired by the ToF sensor 102, respectively.

図４Ｂは、図３のフローチャートにしたがって図４Ａにおける被写体４０５Ａと４０６Ａを検出する手順を示す処理フロー図である。以下図４Ｂにしたがって、２人の被写体４０５Ａと４０６Ａを識別する処理を説明する。 FIG. 4B is a processing flow diagram showing a procedure for detecting the subjects 405A and 406A in FIG. 4A according to the flowchart of FIG. Hereinafter, a process of distinguishing two subjects 405A and 406A will be described with reference to FIG. 4B.

処理フロー４１Ｂは、図４ＡにおいてＲＧＢカメラ１０１とＴｏＦセンサ１０２から取得した画像に対する処理を示す。ＲＧＢカメラ１０１とＴｏＦセンサ１０２の視点においては、後ろの被写体４０８Ａと４１０Ａの一部が前の被写体４０７Ａと４０９Ａに隠れており、ステップＳ３０１とステップＳ３０２において取得したＲＧＢ画像と距離画像はこれを反映している。ＲＧＢ画像上においては、前方の被写体４０７Ａのみが検出され人検出領域情報４０１Ｂのみが出力される（ステップＳ３０４）。距離画像上においては、被写体４０９Ａと４１０Ａが一塊の点群クラスタ４０２Ｂとしてクラスタリングされる（ステップＳ３０５）。人検出領域情報４０１Ｂを点群クラスタ４０２Ｂへ対応付けると（ステップＳ３０７）、人対応点群クラスタ４０３Ｂと人未対応点群クラスタ４０４Ｂに分かれる。人対応点群クラスタ４０３Ｂを除去し（ステップＳ３０８）、人未対応点群クラスタ４０４Ｂを検出し（ステップＳ３０９）、検出点群領域４０５Ｂを出力する。 The processing flow 41B shows the processing for the image acquired from the RGB camera 101 and the ToF sensor 102 in FIG. 4A. From the viewpoint of the RGB camera 101 and the ToF sensor 102, a part of the rear subjects 408A and 410A is hidden by the front subjects 407A and 409A, and the RGB images and the distance images acquired in steps S301 and S302 reflect this. doing. On the RGB image, only the subject 407A in front is detected and only the person detection area information 401B is output (step S304). On the distance image, the subjects 409A and 410A are clustered as a single point cloud cluster 402B (step S305). When the person detection area information 401B is associated with the point cloud cluster 402B (step S307), it is divided into a person corresponding point group cluster 403B and a person unsupported point group cluster 404B. The human-corresponding point cloud cluster 403B is removed (step S308), the unhumanized point cloud cluster 404B is detected (step S309), and the detection point group region 405B is output.

処理フロー４２Ｂは、処理フロー４１Ｂから１フレーム分の時間ステップを経てＲＧＢカメラ１０１とＴｏＦセンサ１０２から取得した画像に対する処理を示す。人検出領域情報４０１Ｂは、処理フロー４１Ｂにおいて、ステップＳ３０４によって検出した被写体４０７Ａの検出情報である。点群クラスタ４０２Ｂは、処理フロー４１Ｂにおいて、ステップＳ３０５によって被写体４０９Ａと４１０Ａが一塊にクラスタリングされたものである。検出点群領域４０５Ｂは、処理フロー４１Ｂにおける、人未対応点群クラスタ４０４Ｂの検出結果を表す情報である。人未対応点群クラスタ４０６Ｂは、処理フロー４２Ｂにおいて、処理フロー４１Ｂの時点での人未対応点群クラスタ４０４Ｂが移動したものである。検出点群領域４０７Ｂは、処理フロー４２Ｂにおいて、人未対応点群クラスタ４０６Ｂの検出結果を表す情報である。 The processing flow 42B shows processing for an image acquired from the RGB camera 101 and the ToF sensor 102 after a time step of one frame from the processing flow 41B. The person detection area information 401B is the detection information of the subject 407A detected in step S304 in the processing flow 41B. In the point cloud cluster 402B, the subjects 409A and 410A are clustered together by step S305 in the processing flow 41B. The detection point group area 405B is information representing the detection result of the human unsupported point cloud cluster 404B in the processing flow 41B. The unhumanized point cloud cluster 406B is a movement of the unhumanized point cloud cluster 404B at the time of the processing flow 41B in the processing flow 42B. The detection point group region 407B is information representing the detection result of the human unsupported point cloud cluster 406B in the processing flow 42B.

処理フロー４２Ｂにおいて、処理フロー４１Ｂと同様の処理を経て、人未対応点群クラスタ４０４Ｂは１フレーム分の時間ステップを経たことにより、移動して人未対応点群クラスタ４０６Ｂとなる。これを検出することにより検出点群領域４０７Ｂを出力する。ステップＳ３１０において、検出点群領域４０５Ｂと４０７Ｂの間の移動動作パラメータ４０８Ｂ（移動量または速度）を検出する。 In the processing flow 42B, after undergoing the same processing as in the processing flow 41B, the unsupported point cloud cluster 404B moves to become the unsupported human point cloud cluster 406B after undergoing a time step for one frame. By detecting this, the detection point group area 407B is output. In step S310, the movement operation parameter 408B (movement amount or velocity) between the detection point group regions 405B and 407B is detected.

＜実施の形態１：まとめ＞
本実施形態１に係る人検出システム１は、ＲＧＢ画像において検出した人検出領域を点群クラスタと対応付けることにより、人未対応点群クラスタを識別し、人未対応点群クラスタの移動動作パラメータにしたがって、日と未対応点群クラスタが人であるか否かを判断する。これにより、複数の人が密接しており、ＲＧＢ画像に対する人検出処理のみでは後方の人を検出できず、点群に対する点群クラスタリング処理では複数の人を一塊としてクラスタリングしてしまう状況であっても、正確に複数の人を検出できる。 <Embodiment 1: Summary>
The person detection system 1 according to the first embodiment identifies the unsupported point cloud cluster by associating the detected human detection area in the RGB image with the point cloud cluster, and sets it as a movement operation parameter of the unsupported point cloud cluster. Therefore, it is determined whether or not the day and the unpaired point cloud cluster are humans. As a result, a plurality of people are in close contact with each other, and the person behind the RGB image cannot be detected only by the person detection process, and the point cloud clustering process for the point cloud clusters the plurality of people as a group. However, it can accurately detect multiple people.

＜実施の形態２＞
本発明の実施形態２では、人未対応点群クラスタの移動動作パラメータに代えて、点群クラスタの形状に関する特徴量にしたがって人を検出する構成例を説明する。その他構成は実施形態１と同様であるので、以下では主に差異点について説明する。 <Embodiment 2>
In the second embodiment of the present invention, a configuration example in which a person is detected according to a feature amount related to the shape of the point cloud cluster will be described instead of the movement operation parameter of the point cloud cluster that does not correspond to a person. Since other configurations are the same as those in the first embodiment, the differences will be mainly described below.

図５は、本実施形態２に係る人検出システム１の構成図である。本実施形態２において第２情報処理部１３は、動体検出部１１１に代えて点群クラスタフィッティング部５０４を備える。第２情報処理部１３はさらに、人クラスタ形状特徴量抽出部５０２と特徴量保持部５０３と点群クラスタフィッティング部５０４を備える。 FIG. 5 is a configuration diagram of the person detection system 1 according to the second embodiment. In the second embodiment, the second information processing unit 13 includes a point cloud cluster fitting unit 504 instead of the moving object detection unit 111. The second information processing unit 13 further includes a human cluster shape feature amount extraction unit 502, a feature amount holding unit 503, and a point cloud cluster fitting unit 504.

人点群クラスタ対応付け部１０９は、実施形態１で説明した処理に加え、人クラスタ形状特徴量抽出部５０２に対して人対応点群クラスタを出力する。人クラスタ形状特徴量抽出部５０２は、人点群クラスタ対応付け部１０９から取得した人対応点群クラスタの形状に関する特徴量を抽出し、特徴量保持部５０３へ出力する。 In addition to the processing described in the first embodiment, the person point cloud cluster association unit 109 outputs the person correspondence point group cluster to the person cluster shape feature amount extraction unit 502. The person cluster shape feature amount extraction unit 502 extracts the feature amount related to the shape of the person correspondence point group cluster acquired from the person point cloud cluster association unit 109 and outputs the feature amount to the feature amount holding unit 503.

特徴量保持部５０３は、人クラスタ形状特徴量抽出部５０２から特徴量を取得し、取得した特徴量や以前保持した特徴量に応じて、取得した特徴量を保持し、保持した特徴量を点群クラスタフィッティング部５０４へ出力する。一定期間の周期で、人クラスタ形状特徴量抽出部５０２から取得した特徴量を保持してもよいし、人クラスタ形状特徴量抽出部５０２から特徴量を取得する度に保持してもよい。あるいは、人クラスタ形状特徴量抽出部５０２から取得した特徴量が特徴量保持部５０３内に保持する特徴量のいずれとも大きく異なる場合に、人クラスタ形状特徴量抽出部５０２から取得した特徴量を逐次保持してもよいし、人クラスタ形状特徴量抽出部５０２から取得した特徴量をそのまま出力してもよい。保持されている特徴量をランダムに１つ以上出力してもよい。出力の規則はランダムに限るものではない。 The feature amount holding unit 503 acquires the feature amount from the human cluster shape feature amount extracting unit 502, holds the acquired feature amount according to the acquired feature amount and the previously held feature amount, and points the retained feature amount. Output to the group cluster fitting unit 504. The feature amount acquired from the human cluster shape feature amount extraction unit 502 may be retained in a cycle of a certain period, or may be retained every time the feature amount is acquired from the human cluster shape feature amount extraction unit 502. Alternatively, when the feature amount acquired from the human cluster shape feature amount extraction unit 502 is significantly different from any of the feature amounts held in the feature amount holding unit 503, the feature amount acquired from the human cluster shape feature amount extraction unit 502 is sequentially added. It may be retained, or the feature amount acquired from the human cluster shape feature amount extraction unit 502 may be output as it is. One or more retained features may be output at random. Output rules are not limited to random.

点群クラスタフィッティング部５０４は、特徴量保持部５０３から取得した特徴量にしたがって、人クラスタ除去部１１０から取得した人未対応点群クラスタに対して特徴量を計算してフィッティング処理を実施し、第２人検出部に対してフィッティング結果を出力する。 The point cloud cluster fitting unit 504 calculates the feature amount for the unsupported point cloud cluster acquired from the person cluster removal unit 110 according to the feature amount acquired from the feature amount holding unit 503, and performs the fitting process. The fitting result is output to the second person detection unit.

第２人検出部１１２は、点群クラスタフィッティング部５０４から取得したフィッティング結果に基づいて、フィッティングが十分であるかどうかを判断する。フィッティングが十分であれば、第１人検出部１０７から取得した人検出数と人検出領域情報をそのフィテッィング結果と統合することにより、改めて人検出数と人検出領域情報を生成する。 The second person detection unit 112 determines whether or not the fitting is sufficient based on the fitting result obtained from the point cloud cluster fitting unit 504. If the fitting is sufficient, the number of people detected and the person detection area information are generated again by integrating the number of people detected and the person detection area information acquired from the first person detection unit 107 with the fitting result.

図６は、人クラスタ形状特徴量抽出部５０２と、点群クラスタフィッティング部５０４と、第２人検出部１１２が実施する処理の詳細を説明する概念図である。ここでは人の顔部分を球体（図６の符号６０１と６０６）によって模式的に表した例を用いて、人対応点群クラスタと人未対応点群クラスタを対応付ける処理を説明する。 FIG. 6 is a conceptual diagram illustrating details of the processing performed by the human cluster shape feature amount extraction unit 502, the point cloud cluster fitting unit 504, and the second person detection unit 112. Here, a process of associating a human-corresponding point cloud cluster with a human-uncorresponding point group cluster will be described using an example in which a human face portion is schematically represented by spheres (reference numerals 601 and 606 in FIG. 6).

人対応点群クラスタ６０１は、人点群クラスタ対応付け部１０９から取得した人対応点群クラスタである。代表点６０２は、それぞれ人対応点群クラスタ６０１のなかのある一点である。例えば、人対応点群クラスタ６０１の点群を等間隔に間引いた後に残った点を代表点６０２として採用すればよい。参照球６０３は、代表点６０２を中心としたある半径の球である。法線ベクトル６０４は、代表点６０２から参照球６０３内の各点へのベクトルに基づき生成した共分散行列の固有ベクトルのうち、固有値が最小となるベクトルである。 The person-corresponding point cloud cluster 601 is a person-corresponding point cloud cluster acquired from the person-point cloud cluster association unit 109. Each of the representative points 602 is a certain point in the person-corresponding point cloud cluster 601. For example, the points remaining after thinning out the point clouds of the human-corresponding point cloud cluster 601 at equal intervals may be adopted as the representative points 602. The reference sphere 603 is a sphere having a certain radius centered on the representative point 602. The normal vector 604 is a vector having the smallest eigenvalue among the eigenvectors of the covariance matrix generated based on the vector from the representative point 602 to each point in the reference sphere 603.

人特徴量６０５は、代表点６０２から参照球６０３内の各点へのベクトルと、法線ベクトル６０４との間の内角についての度数分布である。この度数分布は、人対応点群クラスタ６０１（すなわち人の顔）の表面形状を表している。人未対応点群クラスタ内にも人の顔が存在するのであれば、人未対応点群クラスタも類似する形状の度数分布を有していると考えられる。 The human feature amount 605 is a frequency distribution for the internal angle between the vector from the representative point 602 to each point in the reference sphere 603 and the normal vector 604. This frequency distribution represents the surface shape of the human-corresponding point cloud cluster 601 (that is, the human face). If a human face also exists in the human uncorresponding point cloud cluster, it is considered that the human uncorresponding point cloud cluster also has a frequency distribution having a similar shape.

人特徴量６０５を求める手法として、この他にも例えば、ＦＰＦＨ（ＦａｓｔＰｏｉｎｔＦｅａｔｕｒｅＨｉｓｔｏｇｒａｍ）特徴量やＳＨＯＴ（ＳｉｇｎａｔｕｒｅｏｆＨｉｓｔｏｇｒａｍｓｏｆＯｒｉｅｎＴａｔｉｏｎｓ）特徴量などの抽出手法を用いることができる。 As a method for obtaining the human feature amount 605, for example, an extraction method such as an FPFH (Fast Point Histogram) feature amount or a SHOT (Signature of Histograms of Origin States) feature amount can be used.

人未対応点群クラスタ６０６は、人クラスタ除去部１１０から取得した人未対応点群クラスタである。代表点６０７、６１０、６１３は、人未対応点群クラスタ６０６のなかのある一点である。法線ベクトル６０９、６１２、６１５は、参照球６０８、６１１、６１４それぞれの法線ベクトルである。各法線ベクトルは、参照球６０３において法線ベクトル６０４を求める手法と同様に得ることができる。人未対応特徴量６１６、６１７、６１８は、代表点６０７、６１０、６１３について人特徴量６０５と同様にして得られる。 The human unsupported point cloud cluster 606 is a human unsupported point cloud cluster acquired from the human cluster removal unit 110. The representative points 607, 610, and 613 are one point in the unsupported point cloud cluster 606. The normal vectors 609, 612, and 615 are normal vectors of the reference spheres 608, 611, and 614, respectively. Each normal vector can be obtained in the same manner as the method for obtaining the normal vector 604 in the reference sphere 603. The human feature amounts 616, 617, and 618 are obtained for the representative points 607, 610, and 613 in the same manner as the human feature amount 605.

人クラスタ形状特徴量抽出部５０２は、人対応点群クラスタ６０１の代表点６０２において、参照球６０３を設定して法線ベクトル６０４を計算することにより、人特徴量６０５を求める。 The human cluster shape feature amount extraction unit 502 obtains the human feature amount 605 by setting the reference sphere 603 and calculating the normal vector 604 at the representative point 602 of the human correspondence point group cluster 601.

点群クラスタフィッティング部５０４は、人未対応点群クラスタ６０６に対して上記と同様の方法で人未対応点群クラスタ６０６の代表点全てについて人未対応特徴量を計算する。具体的には、代表点６０７、６１０、６１３について、参照球６０８、６１１、６１４を設定して法線ベクトル６０９、６１２、６１５を計算することにより、人未対応特徴量６１６〜６１８を求める。 The point cloud cluster fitting unit 504 calculates the unhumanized feature amount for all the representative points of the unhumanized point cloud cluster 606 with respect to the unhumanized point cloud cluster 606 in the same manner as described above. Specifically, for the representative points 607, 610, and 613, the reference spheres 608, 611, and 614 are set and the normal vectors 609, 612, and 615 are calculated to obtain the human-unsupported feature amounts 616 to 618.

点群クラスタフィッティング部５０４は、人未対応特徴量６１６〜６１８のうち人特徴量６０５と最も類似するものを探索する。探索によって得られた人未対応特徴量を対応特徴量とする。探索手法としては例えば、人未対応特徴量６１６〜６２０それぞれと人特徴量６０５との間の平均２乗誤差を求め、これが最小となる人未対応特徴量を探索すればよい。この他にも、例えば、ＲＡＮＳＡＣ（ＲＡＮｄｏｍＳＡｍｐｌｅＣｏｎｓｅｎｓｕｓ）によるフィッティング処理を用いてもよい。図６においては、人特徴量６０５の対応特徴量は人未対応特徴量６１６である。 The point cloud cluster fitting unit 504 searches for the one most similar to the human feature amount 605 among the unsupported feature amounts 616 to 618. Let the unsupported feature amount obtained by the search be the corresponding feature amount. As a search method, for example, the average square error between each of the human feature quantities 616 to 620 and the human feature quantity 605 may be obtained, and the human uncorresponding feature quantity that minimizes this may be searched. In addition to this, for example, a fitting process by RANSAC (RANdom Sample Consensus) may be used. In FIG. 6, the corresponding feature amount of the human feature amount 605 is the non-human feature amount 616.

以上の手順により、人未対応点群クラスタ６０６において代表点６０２に対応するのは代表点６０７であることが分かる。人未対応点群クラスタ６０６が人の顔であれば、人特徴量６０５と人未対応特徴量６１６との間の平均２乗誤差は十分小さく、人の顔でなければ大きいと想定される。 From the above procedure, it can be seen that the representative point 607 corresponds to the representative point 602 in the unsupported point cloud cluster 606. If the unhumanized point cloud cluster 606 is a human face, the average square error between the human feature amount 605 and the unhumanized feature amount 616 is assumed to be sufficiently small, and if it is not a human face, it is assumed to be large.

第２人検出部１１２は、点群クラスタフィッティング部５０４から取得したフィッティング結果に対して、平均２乗誤差が閾値未満であればフィッティングが十分である（すなわち人未対応点群クラスタ６０６は人である）と判断し、第１人検出部１０７から取得した人検出数と人検出領域情報に対してフィッティング結果を統合することにより、改めて人検出数と人検出領域情報を生成する。 The second person detection unit 112 has sufficient fitting if the average square error is less than the threshold value with respect to the fitting result acquired from the point group cluster fitting unit 504 (that is, the person unsupported point group cluster 606 is a person). By integrating the fitting result with the number of people detected and the person detection area information acquired from the first person detection unit 107, the number of people detected and the person detection area information are generated again.

図６においては、人対応点群クラスタ６０１の代表点として代表点６０２を挙げたが、代表点は１点に限らない。人対応点群クラスタ６０１内の代表点が複数ある場合、代表点ごとに人未対応点群クラスタ６０６内の対応する代表点を探索する。また、人未対応点群クラスタ６０６の代表点として代表点６０７、６１０、６１３を挙げたが、代表点は３点に限らない。人対応点群クラスタ内の代表点が複数ある場合、その代表点ごとに両クラスタの間の最小平均２乗誤差を取得し、それらの平均値をフィテッィング結果とすればよい。後述する図７Ａ〜図７Ｂにおいても同様である。 In FIG. 6, the representative point 602 is mentioned as the representative point of the human correspondence point cloud cluster 601 but the representative point is not limited to one point. When there are a plurality of representative points in the person correspondence point cloud cluster 601, the corresponding representative points in the person uncorresponding point group cluster 606 are searched for each representative point. In addition, the representative points 607, 610, and 613 are mentioned as the representative points of the human unsupported point cloud cluster 606, but the representative points are not limited to three points. When there are a plurality of representative points in the human-corresponding point cloud cluster, the minimum average squared error between the two clusters may be acquired for each representative point, and the average value thereof may be used as the fitting result. The same applies to FIGS. 7A to 7B described later.

図７Ａは、本実施形態２における人検出システム１が画像内の人を検出する処理を説明するフローチャートである。図３と比較すると、ステップＳ７０１からＳ７０５が新設され、ステップＳ３０９からＳ３１１が除去された。以下図３とは異なるステップについて主に説明する。 FIG. 7A is a flowchart illustrating a process in which the person detection system 1 in the second embodiment detects a person in an image. Compared with FIG. 3, steps S701 to S705 were newly established, and steps S309 to S311 were removed. Hereinafter, steps different from those in FIG. 3 will be mainly described.

（図７Ａ：ステップＳ７０１〜Ｓ７０２）
人クラスタ形状特徴量抽出部５０１は、人対応点群クラスタの人対応特徴量を抽出する（Ｓ７０１）。特徴量保持部５０３は、ステップＳ７０１において抽出した人対応特徴量を保持する（Ｓ７０２）。 (FIG. 7A: steps S701 to S702)
The human cluster shape feature amount extraction unit 501 extracts the human correspondence feature amount of the human correspondence point group cluster (S701). The feature amount holding unit 503 holds the feature amount corresponding to the person extracted in step S701 (S702).

（図７Ａ：ステップＳ７０３）
点群クラスタフィッティング部５０４は、図６で説明した手順にしたがって、人対応点群クラスタと人未対応点群クラスタを対応付けることにより、両クラスタの間の最小平均２乗誤差をフィテッィング結果として取得する。 (FIG. 7A: step S703)
The point cloud cluster fitting unit 504 acquires the minimum average squared error between both clusters as a fitting result by associating the point cloud clusters corresponding to humans with the point cloud clusters not corresponding to humans according to the procedure described with reference to FIG. ..

（図７Ａ：ステップＳ７０４）
第２人検出部１１２は、点群クラスタフィッティング部５０４から取得したフィッティング結果に基づいて、フィッティングが十分であるかどうかを判断する。フィッティングが十分であれば、そのフィッティング結果を、第１人検出部１０７から取得した人検出数と人検出領域情報へ統合することにより、改めて人検出数と人検出領域情報を生成する。 (FIG. 7A: step S704)
The second person detection unit 112 determines whether or not the fitting is sufficient based on the fitting result obtained from the point cloud cluster fitting unit 504. If the fitting is sufficient, the number of people detected and the person detection area information are generated again by integrating the fitting result into the number of people detected and the person detection area information acquired from the first person detection unit 107.

（図７Ａ：ステップＳ７０５）
点群クラスタフィッティング部５０４は、特徴量保持部５０３が保持する人対応特徴量を読み出し、ステップＳ７０３を実行する。 (FIG. 7A: step S705)
The point cloud cluster fitting unit 504 reads out the human-compatible feature amount held by the feature amount holding unit 503, and executes step S703.

図７Ｂは、ステップＳ７０３の詳細を説明するフローチャートである。ステップＳ７０３０１〜Ｓ７０３０５は、人対応点群クラスタ内の１つの代表点について、これに対応する人未対応点群クラスタ内の代表点を探索する処理である。ステップＳ７０３０６は、同様の処理を人対応点群クラスタ内の各代表点について実施するためのループ処理である。以下図７Ｂの各ステップを説明する。 FIG. 7B is a flowchart illustrating the details of step S703. Steps S70301 to S70305 are processes for searching for a representative point in the person-corresponding point cloud cluster corresponding to one representative point in the person-corresponding point cloud cluster. Step S70306 is a loop process for carrying out the same process for each representative point in the human correspondence point cloud cluster. Each step of FIG. 7B will be described below.

（図７Ｂ：ステップＳ７０３０１〜Ｓ７０３０３）
点群クラスタフィッティング部５０４は、特徴量保持部５０３から取得した人対応特徴量のなかからいずれか１つを選択する（Ｓ７０３０１）。点群クラスタフィッティング部５０４は、人未対応点群クラスタのなかのいずれかの代表点を選び、その代表点に基づき人未対応特徴量を計算する（Ｓ７０３０２）。点群クラスタフィッティング部５０４は、人特徴量と人未対応特徴量との間の平均２乗誤差を計算する（Ｓ７０３０３）。 (FIG. 7B: Steps S70301-S70303)
The point cloud cluster fitting unit 504 selects any one of the human-compatible feature quantities acquired from the feature quantity holding unit 503 (S7031). The point cloud cluster fitting unit 504 selects one of the representative points in the unsupported point cloud cluster, and calculates the unsupported feature amount for humans based on the representative points (S70302). The point cloud cluster fitting unit 504 calculates the average squared error between the human feature amount and the unhumanized feature amount (S70303).

（図７Ｂ：ステップＳ７０３０４）
点群クラスタフィッティング部５０４は、ステップＳ７０３０２とステップＳ７０３０３を人未対応点群クラスタの全ての人未対応特徴量（すなわち全ての代表点）について実施したかどうかを判断する。全ての人未対応特徴量について実施完了したならばステップＳ７０３０５へ進み、完了していなければ、まだ選択していない人未対応特徴量についてステップＳ７０３０２に戻って同様の処理を繰り返す。 (FIG. 7B: step S70304)
The point cloud cluster fitting unit 504 determines whether or not step S70302 and step S70303 have been performed for all the unhumanized feature quantities (that is, all the representative points) of the unhumanized point cloud cluster. If the implementation is completed for all the unsupported features, the process proceeds to step S70305. If not, the process returns to step S70302 for the unselected features that have not been selected, and the same process is repeated.

（図７Ｂ：ステップＳ７０３０５）
点群クラスタフィッティング部５０４は、Ｓ７０３０３において計算した各平均２乗誤差のなかで最小値を探索する。本ステップは、人未対応点群クラスタの代表点のなかで、Ｓ７０３０１において選択した代表点に対応するものを探索することに相当する。 (FIG. 7B: step S70305)
The point cloud cluster fitting unit 504 searches for the minimum value among the average squared errors calculated in S70303. This step corresponds to searching for a representative point corresponding to the representative point selected in S7031 among the representative points of the human uncorresponding point cloud cluster.

（図７Ｂ：ステップＳ７０３０６）
点群クラスタフィッティング部５０４は、ステップＳ７０３０２からステップＳ７０３０５を人対応点群クラスタの全ての人特徴量（すなわち全ての代表点）について実施したかどうかを判断する。全ての人特徴量について実施完了したならばステップＳ７０３０７へ進み、完了していなければ、まだ選択していない人特徴量（すなわち人対応点群クラスタのなかでまだ選択していない代表点）について、ステップＳ７０３０１に戻って同様の処理を繰り返す。 (FIG. 7B: step S70306)
The point cloud cluster fitting unit 504 determines whether or not steps S70302 to S70305 have been performed for all human features (that is, all representative points) of the human-corresponding point cloud cluster. If the implementation is completed for all human features, the process proceeds to step S70307, and if not completed, the human features that have not yet been selected (that is, the representative points that have not yet been selected in the human correspondence point cloud cluster) are The process returns to step S70301 and the same process is repeated.

（図７Ｂ：ステップＳ７０３０７）
点群クラスタフィッティング部５０４は、人対応点群クラスタにおける全ての人特徴量について最小平均２乗誤差を取得し、それらの平均値を計算する。第２人検出部１１２は、この平均値をフィッティング結果として用いることができる。 (FIG. 7B: step S70307)
The point cloud cluster fitting unit 504 acquires the minimum average squared error for all the human features in the point cloud cluster corresponding to humans, and calculates the average value thereof. The second person detection unit 112 can use this average value as the fitting result.

＜実施の形態２：まとめ＞
本実施形態２に係る人検出システム１は、人対応点群クラスタの形状特徴量に対応する部分を、人未対応点群クラスタ内において探索することにより、人未対応点群クラスタが人であるか否かを判断する。これにより実施形態１と同様に、複数の人が密接している場合であっても、正確に複数の人を検出できる。また移動動作パラメータを用いないので、人が移動していない場合であっても人を検出できる利点がある。 <Embodiment 2: Summary>
In the person detection system 1 according to the second embodiment, the human-unsupported point cloud cluster is a person by searching the part corresponding to the shape feature amount of the human-compatible point cloud cluster in the human-unsupported point cloud cluster. Judge whether or not. As a result, as in the first embodiment, even when a plurality of people are in close contact with each other, the plurality of people can be accurately detected. Moreover, since the movement operation parameter is not used, there is an advantage that a person can be detected even when the person is not moving.

＜実施の形態３＞
本発明の実施形態２では、人未対応点群クラスタの移動動作パラメータに代えて、点群クラスタの高さにしたがって人を検出する構成例を説明する。その他構成は実施形態１と同様であるので、以下では主に差異点について説明する。 <Embodiment 3>
In the second embodiment of the present invention, a configuration example in which a person is detected according to the height of the point cloud cluster will be described instead of the movement operation parameter of the point cloud cluster that does not correspond to a person. Since other configurations are the same as those in the first embodiment, the differences will be mainly described below.

図８は、本実施形態３に係る人検出システム１の構成図である。本実施形態３において第２情報処理部１３は、動体検出部１１１に代えて高さフィルタリング部８０１と点群密度極大値検出部８０２を備える。 FIG. 8 is a configuration diagram of the person detection system 1 according to the third embodiment. In the third embodiment, the second information processing unit 13 includes a height filtering unit 801 and a point cloud density maximum value detecting unit 802 instead of the moving object detecting unit 111.

高さフィルタリング部８０１は、人クラスタ除去部１１０から取得した人未対応点群クラスタに対して一定の高さ未満の領域を除去したものを、点群密度極大値検出部８０２へ出力する。点群密度極大値検出部８０２は、高さフィルタリング部８０１から取得した人未対応点群クラスタにおいて、点群の密度が極大値となる高さを探索し、その高さと密度を第２人検出部１１２へ出力する。 The height filtering unit 801 removes a region less than a certain height from the human uncorresponding point cloud cluster acquired from the human cluster removing unit 110, and outputs the removed region to the point cloud density maximum value detecting unit 802. The point cloud density maximum value detection unit 802 searches for the height at which the density of the point cloud becomes the maximum value in the unsupported point cloud cluster acquired from the height filtering unit 801 and detects the height and density of the second person. Output to unit 112.

第２人検出部１１２は、点群密度極大値検出部８０２から取得した高さと密度が人相当の値であるかどうか判断し、その判断結果を、第１人検出部１０７から取得した人検出数と人検出領域情報に統合することにより、改めて人検出数と人検出領域情報を生成する。 The second person detection unit 112 determines whether or not the height and density acquired from the point cloud density maximum value detection unit 802 are values equivalent to a person, and the determination result is obtained from the first person detection unit 107. By integrating the number and the person detection area information, the number of people detected and the person detection area information are generated again.

図９は、点群密度極大値検出部８０２と第２人検出部１１２による人検出方法の詳細を説明する図である。密度分布９１は、ある高さにおける人未対応点群クラスタの点群の密度についての度数分布である。閾値９０１は、高さフィルタリング部８０１が点群をフィルタリングする際の閾値である。検出点９０２は、密度分布９１において極大値を示す点であり、高さと密度の値を有している。 FIG. 9 is a diagram illustrating details of a person detection method by the point cloud density maximum value detection unit 802 and the second person detection unit 112. The density distribution 91 is a frequency distribution for the density of the point cloud of the unsupported point cloud cluster at a certain height. The threshold value 901 is a threshold value when the height filtering unit 801 filters the point cloud. The detection point 902 is a point showing a maximum value in the density distribution 91, and has height and density values.

点群密度極大値検出部８０２は、高さフィルタリング部８０１が閾値９０１にしたがって除去した人未対応点群クラスタを取得し、密度分布９１を計算する。点群密度極大値検出部８０２は、密度分布９１において極大値を探索することにより検出点９０２を検出する。点群密度極大値検出部８０２は、検出点９０２を第２人検出部１１２へ出力する。第２人検出部１１２は、点群密度極大値検出部８０２から取得した検出点９０２が、所定の高さ範囲内かつ所定密度以上であれば人と検出する。一般に、ＴｏＦセンサ１０２の設置位置によって点群の高さに対する密度は変化するので、高さを関数とした密度を判断基準にしてもよい。 The point cloud density maximum value detection unit 802 acquires the unsupported point cloud clusters removed by the height filtering unit 801 according to the threshold value 901, and calculates the density distribution 91. The point cloud density maximum value detection unit 802 detects the detection point 902 by searching for the maximum value in the density distribution 91. The point cloud density maximum value detection unit 802 outputs the detection point 902 to the second person detection unit 112. The second person detection unit 112 detects a person if the detection points 902 acquired from the point cloud density maximum value detection unit 802 are within a predetermined height range and equal to or higher than a predetermined density. In general, the density with respect to the height of the point cloud changes depending on the installation position of the ToF sensor 102, so the density using the height as a function may be used as a criterion.

図１０は、本実施形態３における人検出システム１が画像内の人を検出する処理を説明するフローチャートである。図３と比較すると、ステップＳ１００１〜Ｓ１００３が新設され、ステップＳ３０９〜Ｓ３１１が除去された。以下図３とは異なるステップについて主に説明する。 FIG. 10 is a flowchart illustrating a process in which the person detection system 1 in the third embodiment detects a person in an image. Compared with FIG. 3, steps S1001 to S1003 were newly established, and steps S309 to S311 were removed. Hereinafter, steps different from those in FIG. 3 will be mainly described.

（図１０：ステップＳ１００１）
高さフィルタリング部８０１は、ステップＳ３０７において人検出領域を対応付けた人未対応点群クラスタから閾値９０１未満の領域を除去することにより、人未対応点群クラスタを生成する。 (FIG. 10: Step S1001)
The height filtering unit 801 generates a human uncorresponding point cloud cluster by removing the region less than the threshold value 901 from the human uncorresponding point cloud cluster associated with the human detection region in step S307.

（図１０：ステップＳ１００２）
点群密度極大値検出部８０２は、ステップＳ１００１において生成した人未対応点群クラスタのなかで、点群の密度が極大値となる高さとその高さにおける密度を検出する。 (FIG. 10: Step S1002)
The point cloud density maximum value detection unit 802 detects the height at which the density of the point cloud becomes the maximum value and the density at that height in the unsupported point cloud cluster generated in step S1001.

（図１０：ステップＳ１００３）
第２人検出部１１２は、ステップＳ１００２において検出した高さと密度が人相当の値であるかどうか判断する。人であると判断した場合は、その判断結果を第１人検出部１０７から取得した人検出数と人検出領域情報へ統合することにより、改めて人検出数と人検出領域情報を生成する。 (FIG. 10: Step S1003)
The second person detection unit 112 determines whether or not the height and density detected in step S1002 are values equivalent to a person. When it is determined that the person is a person, the number of people detected and the person detection area information are generated again by integrating the determination result into the number of people detected and the person detection area information acquired from the first person detection unit 107.

（図１０：ステップＳ１００２〜Ｓ１００３：補足）
密度分布９１における極大値が複数ある場合、各極大値がそれぞれ人である可能性もある。したがって点群密度極大値検出部８０２と第２人検出部１１２は、極大値ごとに人であるか否かを判断してもよい。判断手順は図９で説明したものと同様である。 (FIG. 10: Steps S1002 to S1003: Supplement)
When there are a plurality of maximum values in the density distribution 91, each maximum value may be a person. Therefore, the point cloud density maximum value detection unit 802 and the second person detection unit 112 may determine whether or not the person is a person for each maximum value. The determination procedure is the same as that described with reference to FIG.

＜実施の形態３：まとめ＞
本実施形態３に係る人検出システム１は、人未対応点群クラスタの高さをフィルタリングした上で、高さ分布の極大値が所定範囲内の高さにあるか否かに基づき、人未対応点群クラスタが人であるか否かを判断する。これにより実施形態１と同様に、複数の人が密接している場合であっても、正確に複数の人を検出できる。また移動動作パラメータを用いないので、人が移動していない場合であっても人を検出できる利点がある。さらに、高さについての密度分布は点群を一度走査するだけで計算できるので、計算コストを抑制した上で正確に複数の人を検出できる。 <Embodiment 3: Summary>
The person detection system 1 according to the third embodiment filters the height of the unsupported point cloud cluster, and then determines whether or not the maximum value of the height distribution is within a predetermined range. Correspondence point group Determine if the cluster is a person. As a result, as in the first embodiment, even when a plurality of people are in close contact with each other, the plurality of people can be accurately detected. Moreover, since the movement operation parameter is not used, there is an advantage that a person can be detected even when the person is not moving. Furthermore, since the density distribution for height can be calculated by scanning the point cloud only once, it is possible to accurately detect a plurality of people while suppressing the calculation cost.

＜本発明の変形例について＞
本発明は、前述した実施形態に限定されるものではなく、様々な変形例が含まれる。例えば、上記した実施形態は本発明を分かりやすく説明するために詳細に説明したものであり、必ずしも説明した全ての構成を備えるものに限定されるものではない。また、ある実施形態の構成の一部を他の実施形態の構成に置き換えることが可能であり、また、ある実施形態の構成に他の実施形態の構成を加えることも可能である。また、各実施形態の構成の一部について、他の構成の追加・削除・置換をすることが可能である。 <About a modified example of the present invention>
The present invention is not limited to the above-described embodiment, and includes various modifications. For example, the above-described embodiment has been described in detail in order to explain the present invention in an easy-to-understand manner, and is not necessarily limited to the one including all the described configurations. Further, it is possible to replace a part of the configuration of one embodiment with the configuration of another embodiment, and it is also possible to add the configuration of another embodiment to the configuration of one embodiment. Further, it is possible to add / delete / replace a part of the configuration of each embodiment with another configuration.

以上の実施形態において、第１人検出部１０７は、顔ＲＧＢ画像上で検出を実施することにより、人の顔領域を検出することができる。第２人検出部１１２は、点群クラスタのうち各顔領域の下方において人の身体が存在すると想定される領域を、人とみなすこともできる。例えば顔領域下方の所定サイズの円柱領域を、人とみなしてもよい。これにより複数の人が顔以外の領域において重なり合っている場合であれば、簡易的な処理で人を検出できる。 In the above embodiment, the first person detection unit 107 can detect the human face region by performing the detection on the face RGB image. The second person detection unit 112 can also consider the area of the point cloud cluster in which the human body is assumed to exist below each face area as a person. For example, a cylindrical region of a predetermined size below the face region may be regarded as a person. As a result, if a plurality of people overlap in an area other than the face, the person can be detected by a simple process.

以上の実施形態において、同期情報保持部１０６は、座標（ｇｘ，ｇｙ）と回転角φを記憶する記憶装置によって構成することができる。第１情報処理部１２と第２情報処理部１３は、これらの機能を実装した回路デバイスなどのハードウェアを用いて構成することもできるし、これらの機能を実装したソフトウェアをＣＰＵ（ＣｅｎｔｒａｌＰｒｏｃｅｓｓｉｎｇＵｎｉｔ）などの演算装置が実行することにより構成することもできる。第１情報処理部１２と第２情報処理部１３が備える各機能部も同様である。 In the above embodiment, the synchronization information holding unit 106 can be configured by a storage device that stores the coordinates (gx, gy) and the rotation angle φ. The first information processing unit 12 and the second information processing unit 13 can be configured by using hardware such as a circuit device that implements these functions, and software that implements these functions is provided by a CPU (Central Processing Unit). ) And other arithmetic units can be executed. The same applies to each functional unit included in the first information processing unit 12 and the second information processing unit 13.

１：人検出システム
１０：人検出装置
１１：撮影処理部
１０１：ＲＧＢカメラ
１０２…ＴｏＦセンサ
１２：第１情報処理部
１０３：ＲＧＢ画像取得部
１０４：距離画像取得部
１０５：ピクセル同期部
１０６：同期情報保持部
１０７：第１人検出部
１０８：点群クラスタリング部
１０９：人点群クラスタ対応付け部
１１０：人クラスタ除去部
１３：第２情報処理部
１１１：動体検出部
１１２：第２人検出部
５０２：人クラスタ形状特徴量抽出部
５０３：特徴量保持部
５０４：点群クラスタフィッティング部
８０１：高さフィルタリング部
８０２：点群密度極大値検出部 1: Person detection system 10: Person detection device 11: Imaging processing unit 101: RGB camera 102 ... ToF sensor 12: First information processing unit 103: RGB image acquisition unit 104: Distance image acquisition unit 105: Pixel synchronization unit 106: Synchronization Information holding unit 107: First person detection unit 108: Point cloud clustering unit 109: Point cloud cluster association unit 110: Person cluster removal unit 13: Second information processing unit 111: Moving object detection unit 112: Second person detection unit 502: Human cluster shape feature amount extraction unit 503: Feature amount holding unit 504: Point cloud cluster fitting unit 801: Height filtering unit 802: Point cloud density maximum value detection unit

Claims

A person detection device that detects people included in an image.
First person detection unit that detects people included in the RGB image acquired from the RGB camera,
A point cloud grouping unit that acquires a point cloud group by spatially grouping the point clouds included in the point cloud data generated from the point cloud image acquired from the ToF sensor.
A mapping processing unit that associates a person detection region in which a person is detected in the RGB image by the first person detection unit with the point cloud group grouped by the point cloud grouping unit.
Among the point group groups, the association processing unit determines whether or not the uncorresponding point group group is a person according to the first feature amount of the uncorresponding point group group that is not associated with the person detection area. , Second person detector,
A person detection device comprising.

The person detection device further includes a synchronization processing unit that calculates synchronization data representing a difference between the pixel position of the RGB image and the pixel position of the point cloud image.
The point cloud grouping unit is characterized in that the RGB image and the point cloud image are aligned according to the synchronization data, and then the point cloud group is acquired from the point cloud image. Item 1. The person detection device according to item 1.

The synchronization processing unit periodically updates the difference between the pixel position of the RGB image and the pixel position of the point cloud image by periodically calculating the synchronization data. Item 2. The person detection device according to item 2.

The second person detection unit calculates a parameter describing the movement operation of the uncorresponding point group group as the first feature amount, and calculates the parameter.
The second person detection unit is characterized in that it determines whether or not the uncorresponding point group group is a person by comparing a parameter representing the movement movement with a threshold value representing the movement movement of a person. The person detection device according to claim 1.

The person detection device further includes a first feature amount calculation unit that calculates a feature amount representing the shape of the uncorresponding point group group as the first feature amount.
The person detection device further includes a second feature amount calculation unit in which the association processing unit calculates a parameter representing the shape of the point cloud group associated with the person detection region as a second feature amount.
The second person detection unit according to claim 1, wherein the second person detection unit determines whether or not the uncorresponding point group is a person by comparing the first feature amount with the second feature amount. Person detection device.

The first feature amount calculation unit calculates a first histogram of the angle formed by the normal vector of the surface of the uncorresponding point group group with the reference angle as the first feature amount.
The second feature amount calculation unit obtains a second histogram of the angle formed by the normal vector of the surface of the point cloud group associated with the person detection region by the association processing unit with respect to the reference angle. Created as the second feature amount,
The second person detection unit determines whether or not the uncorresponding point cloud group is a person according to whether or not the difference between the first histogram and the second histogram is less than the threshold value. 5. The person detection device according to claim 5.

The person detection device further includes height filtering for filtering point groups whose height is less than a threshold value among the point groups included in the uncorresponding point group group.
The person detection device further includes a height histogram creating unit that creates a height histogram representing the frequency distribution of the heights of the point groups included in the uncorresponding point group after the filtering.
The second person detection unit according to claim 1, wherein the second person detection unit determines whether or not the uncorresponding point cloud group is a person according to whether or not the height histogram satisfies the person detection condition. Human detector.

In the person detection condition, if the maximum value of the frequency value of the height histogram is within the predetermined range and the density of the point group is equal to or more than the predetermined value, the uncorresponding point group group is regarded as a person, otherwise. The person detection device according to claim 7, wherein the non-corresponding point group group is configured to be regarded as not a person.

The second person detection unit determines whether or not the person detection condition is satisfied for each of the maximum values, thereby determining whether or not the uncorresponding point group group is a person for each of the maximum values. 8. The person detection device according to claim 8.

The first person detection unit detects a person's face region included in the RGB image by performing face detection on the RGB image.
The second person detection unit is characterized in that, among the uncorresponding point group groups, a point cloud existing in a predetermined range below the face region is regarded as a person corresponding to the face region. 1. The person detection device according to 1.

It is a person detection system that detects people included in images.
RGB camera,
ToF sensor,
Human detector,
With
The person detection device is
A first person detection unit that detects a person included in an RGB image acquired from the RGB camera,
A point cloud grouping unit that acquires a point cloud group by spatially grouping the point clouds included in the point cloud data generated from the point cloud image acquired from the ToF sensor.
A mapping processing unit that associates a person detection region in which a person is detected in the RGB image by the first person detection unit with the point cloud group grouped by the point cloud grouping unit.
Among the point group groups, the association processing unit determines whether or not the uncorresponding point group group is a person according to the first feature amount of the uncorresponding point group group that is not associated with the person detection area. , Second person detector,
A person detection system characterized by being equipped with.

It is a person detection method that detects people included in the image.
Steps to detect people included in an RGB image acquired from an RGB camera,
A step of acquiring a point cloud group by spatially grouping the point clouds included in the point cloud data generated from the point cloud image acquired from the ToF sensor.
A step of associating a person detection region in which a person is detected in the RGB image with the grouped point cloud group.
A step of determining whether or not the uncorresponding point group is a person according to the first feature amount of the uncorresponding point group that is not associated with the person detection region in the associating step of the point group. ,
A person detection method characterized by having.