JP6348368B2

JP6348368B2 - Object detection device

Info

Publication number: JP6348368B2
Application number: JP2014164407A
Authority: JP
Inventors: 叶秋李; 正則小野塚; 佐藤　昌宏; 昌宏佐藤; 陽介村井; 秀紀氏家
Original assignee: Secom Co Ltd
Current assignee: Secom Co Ltd
Priority date: 2014-08-12
Filing date: 2014-08-12
Publication date: 2018-06-27
Anticipated expiration: 2034-08-12
Also published as: JP2016040674A

Description

本発明は入力画像から所定の対象が現れた対象領域を検出する対象検出装置に関する。 The present invention relates to a target detection device that detects a target region where a predetermined target appears from an input image.

監視カメラなどで撮影した入力画像から人物領域などを検出するために識別器等による探索処理が行われる。入力画像における対象の位置や大きさは一般に未知であるため、この探索処理では、入力画像内の各位置に窓領域を設定し、窓領域における画像を識別器等に入力する。そして、識別器等から出力されるスコアが閾値を超える窓領域を対象の候補領域として抽出する。 Search processing by a classifier or the like is performed in order to detect a person region or the like from an input image taken by a surveillance camera or the like. Since the position and size of the target in the input image are generally unknown, in this search process, a window area is set at each position in the input image, and the image in the window area is input to a discriminator or the like. Then, a window region whose score output from the discriminator or the like exceeds a threshold is extracted as a target candidate region.

このとき１つの対象に対して、真の対象の位置及び大きさを有する候補領域が抽出されるだけでなく、その近傍や内側においても位置及び／または大きさの異なる複数の候補領域が抽出される。そのため、重複を有する候補領域を、１つの対象に対して抽出された複数の候補領域であるとしてグループ化し、領域グループごとにスコアが最大の候補領域を対象の領域として選別することが行われている。 At this time, not only a candidate area having a true target position and size is extracted for one target, but also a plurality of candidate areas having different positions and / or sizes are extracted in the vicinity or inside thereof. The Therefore, the candidate areas having overlap are grouped as a plurality of candidate areas extracted for one target, and the candidate area having the highest score for each area group is selected as the target area. Yes.

ここで、領域グループは背景のみが写った領域（以下、背景領域）でも抽出されることがある。そこで従来、重複して抽出された候補領域の数を求め、求めた数が閾値を下回る候補領域を削除していた。これは、背景領域では重複して抽出される候補領域の数が少ない傾向を利用したものである。 Here, the area group may be extracted even in an area where only the background is shown (hereinafter, background area). Therefore, conventionally, the number of candidate areas extracted in duplicate is obtained, and candidate areas in which the obtained number falls below a threshold value are deleted. This uses the tendency that the number of candidate areas extracted in the background area is small.

特開２０１０−１６０６４０号公報JP 2010-160640 A

しかしながら、背景領域であっても比較的多くの候補領域が重複して抽出されることがあり、また、対象が写った領域であっても対象の姿勢変動などの影響によって重複して抽出される候補領域の数が比較的少なくなる場合がある。 However, even if it is a background area, a relatively large number of candidate areas may be extracted in an overlapping manner, and even an area in which an object is captured is redundantly extracted due to influences such as a change in posture of the object. The number of candidate areas may be relatively small.

そのため、従来技術のように重複数だけに着目し、重複数が閾値を下回る領域グループを削除すると、対象を含む領域グループを誤って削除してしまうおそれがあった。或いは、重複数の閾値を高めに設定してしまうと対象領域の領域グループを検出し損ねるおそれがあった。 Therefore, if attention is paid only to the overlap number as in the prior art and an area group in which the overlap number falls below the threshold value is deleted, the area group including the target may be erroneously deleted. Alternatively, if the overlap threshold value is set high, there is a risk of failing to detect the region group of the target region.

また、真の対象を表す候補領域であっても対象の姿勢変化などによってスコアが低めとなる場合があり、単純に候補領域を抽出する閾値を引き上げて背景領域での領域グループの抽出を抑制しようとすると対象を検出し損ねる問題があった。 Even if a candidate region represents a true target, the score may be lower due to changes in the posture of the target, etc., so simply increase the threshold for extracting the candidate region and suppress the extraction of region groups in the background region. Then, there was a problem of failing to detect the target.

このように従来技術は対象の領域グループと背景の領域グループとを好適に弁別できないことが比較的起こりやすいという問題を有していた。 As described above, the conventional technique has a problem that it is relatively easy to appropriately distinguish the target area group from the background area group.

本発明は上記問題を鑑みてなされたものであり、対象の検出し損ねを防止しつつ、背景領域にて対象の誤検出を防止し、入力画像から精度よく対象を検出可能な対象検出装置を提供することを目的とする。 The present invention has been made in view of the above problems, and an object detection device capable of detecting a target with high accuracy from an input image while preventing erroneous detection of the target while preventing erroneous detection of the target in the background region. The purpose is to provide.

本発明に係る対象検出装置は、入力画像において所定の対象が現れている対象領域を検出するものであって、前記入力画像内に設定される注目領域に前記対象が存在する尤もらしさを表す指標値を前記入力画像内の各所にて抽出される特徴量を用いて算出するための指標値算出関数を予め記憶している記憶部と、前記入力画像内の複数の位置に前記注目領域を設定し、当該注目領域における前記指標値を前記指標値算出関数により算出する指標値算出部と、前記注目領域のうち前記指標値が予め定められた第一閾値以上であるものを候補領域として抽出すると共に、当該候補領域相互についての予め定められた重複関係を満たす複数の前記候補領域からなる領域グループを生成する領域グループ生成部と、前記領域グループのうち、帰属する前記候補領域の個数が予め定められた個数閾値以下であり且つ帰属する前記候補領域の前記指標値が前記第一閾値よりも高く定めた第二閾値以下であるものを削除する領域グループ削除部と、前記領域グループ削除部によって削除されなかった前記領域グループそれぞれから前記対象領域を決定する対象領域決定部と、を備える。 An object detection device according to the present invention detects an object region in which a predetermined object appears in an input image, and is an index representing the likelihood that the object exists in an attention region set in the input image A storage unit that stores in advance an index value calculation function for calculating a value using a feature amount extracted at various points in the input image, and sets the attention area at a plurality of positions in the input image Then, an index value calculation unit that calculates the index value in the attention area by the index value calculation function, and the attention area that has the index value equal to or greater than a predetermined first threshold is extracted as a candidate area. In addition, a region group generation unit that generates a region group composed of a plurality of the candidate regions satisfying a predetermined overlapping relationship between the candidate regions, and belonging among the region groups An area group deletion unit that deletes the number of candidate areas that are equal to or less than a predetermined number threshold and the index value of the candidate area to which the candidate area belongs is equal to or less than a second threshold that is set higher than the first threshold; A target region determining unit that determines the target region from each of the region groups that have not been deleted by the region group deleting unit.

本発明に係る対象検出装置においては、前記領域グループに帰属する前記候補領域の前記個数が前記個数閾値以下である場合における前記第二閾値は当該個数に応じて設定され、当該第二閾値の各設定値は、当該設定値に対応する前記個数が多いほど小さい値とするのが好適である。 In the object detection device according to the present invention, the second threshold value when the number of the candidate regions belonging to the region group is equal to or less than the number threshold value is set according to the number, and each of the second threshold values The set value is preferably set to a smaller value as the number corresponding to the set value increases.

本発明によれば、対象の検出し損ねを防止しつつ背景の誤検出を的確に減じることができる。 According to the present invention, it is possible to accurately reduce background misdetection while preventing an object from being missed.

本発明の実施形態に係る人物検出装置の概略のブロック構成図である。1 is a schematic block configuration diagram of a person detection device according to an embodiment of the present invention. 入力画像及び縮小画像の例を示す模式図である。It is a schematic diagram which shows the example of an input image and a reduction image. 本発明の実施形態に係る人物検出装置の概略の動作を示すフロー図である。It is a flowchart which shows operation | movement of the outline of the person detection apparatus which concerns on embodiment of this invention. 領域グループ生成部の概略の処理フロー図である。FIG. 10 is a schematic process flow diagram of an area group generation unit. 領域グループ生成部により抽出された候補領域に対する後続処理を説明する模式的な画像である。It is a typical image explaining the subsequent process with respect to the candidate area | region extracted by the area | region group production | generation part. 領域グループ削除部の概略の処理フロー図である。FIG. 10 is a schematic process flow diagram of an area group deletion unit.

以下、本発明の実施の形態（以下実施形態という）について、図面に基づいて説明する。本実施形態に係る対象検出装置は、画像中に映った人物を検出の対象とする人物検出装置１である。 Hereinafter, embodiments of the present invention (hereinafter referred to as embodiments) will be described with reference to the drawings. The target detection apparatus according to the present embodiment is a person detection apparatus 1 that targets a person shown in an image as a detection target.

[構成例]
図１は、実施形態に係る人物検出装置１の概略のブロック構成図である。人物検出装置１は、画像入力部２、制御部３、記憶部４及び出力部５を含んで構成される。画像入力部２、記憶部４及び出力部５は制御部３と接続される。 [Configuration example]
FIG. 1 is a schematic block diagram of a person detection device 1 according to the embodiment. The person detection device 1 includes an image input unit 2, a control unit 3, a storage unit 4, and an output unit 5. The image input unit 2, the storage unit 4, and the output unit 5 are connected to the control unit 3.

画像入力部２は例えば、監視カメラなどの撮像装置、又は映像を記録したデジタルビデオレコーダーなどの記録装置であり、画像を制御部３へ出力する。以下、画像入力部２から制御部３に入力される画像を入力画像と称する。 The image input unit 2 is, for example, an imaging device such as a surveillance camera or a recording device such as a digital video recorder that records video, and outputs an image to the control unit 3. Hereinafter, an image input from the image input unit 2 to the control unit 3 is referred to as an input image.

制御部３はＣＰＵ（Central Processing Unit）、ＤＳＰ(Digital Signal Processor)等の演算装置を用いて構成される。制御部３は、画像入力部２からの入力画像を処理して人の存在有無を判定し、その判定結果等を出力部５へ出力する処理を行う。そのために、制御部３は、記憶部４からプログラムを読み出して実行し、画像縮小部３０、特徴量抽出部３１、指標値算出部３２、領域グループ生成部３３、領域グループ削除部３４及び対象領域決定部３５として機能する。 The control unit 3 is configured using an arithmetic device such as a CPU (Central Processing Unit) or a DSP (Digital Signal Processor). The control unit 3 processes the input image from the image input unit 2 to determine the presence / absence of a person and outputs the determination result to the output unit 5. For this purpose, the control unit 3 reads out and executes a program from the storage unit 4, and executes an image reduction unit 30, a feature amount extraction unit 31, an index value calculation unit 32, a region group generation unit 33, a region group deletion unit 34, and a target region. It functions as the determination unit 35.

画像縮小部３０は、入力画像に撮像されている人物のサイズが様々であることに対応して、予め設定された複数段階の倍率で入力画像を縮小する。これにより画像内にて人物を検出するために設定する窓領域の大きさは変えずに、様々なサイズの人物領域を検出することが可能となる。例えば、画像縮小部３０は入力画像を予め定めた最小幅または高さになるまで決まった間隔で順次縮小し、縮小画像を生成する。縮小倍率は、例えば縦横のサイズが半分になるまでの間に１０段階に設定される。例えば、図２（ａ）に示す画像１００が原サイズの入力画像であり、図２（ｂ），（ｃ）に示す画像１１０，１２０は画像１００を縮小した入力画像の例である。 The image reduction unit 30 reduces the input image at a plurality of preset magnifications in response to the various sizes of the person captured in the input image. This makes it possible to detect person areas of various sizes without changing the size of the window area set for detecting a person in the image. For example, the image reduction unit 30 sequentially reduces the input image at a predetermined interval until it reaches a predetermined minimum width or height, and generates a reduced image. For example, the reduction ratio is set to 10 levels until the vertical and horizontal sizes are halved. For example, the image 100 shown in FIG. 2A is an input image of the original size, and the images 110 and 120 shown in FIGS. 2B and 2C are examples of input images obtained by reducing the image 100.

特徴量抽出部３１は、原サイズの入力画像及び縮小した入力画像のそれぞれを予め定めたブロックサイズに区切り、各ブロックの画像から特徴量を抽出する。特徴量として、ヒストグラム・オブ・オリエンティッド・グラディエント（Histograms of Oriented Gradients：ＨＯＧ）特徴量、局所二値パターン（Local Binary Pattern：ＬＢＰ）特徴量、Haar-like特徴量などの従来知られた特徴量を単独で、又は複数を組み合わせて用いることができる。 The feature amount extraction unit 31 divides each of the original size input image and the reduced input image into predetermined block sizes, and extracts the feature amount from the image of each block. Conventionally known features such as Histograms of Oriented Gradients (HOG), Local Binary Pattern (LBP) features, Haar-like features, etc. It can be used alone or in combination.

指標値算出部３２は、原サイズの入力画像及び縮小した入力画像内の各位置に人物を検出するための枠として、予め定めた人の大きさの窓領域（注目領域）を設定し、当該窓領域に対象が存在する尤もらしさを表す多値の指標値であるスコアを、入力画像内の各所にて抽出された特徴量と予め学習した指標値算出関数により算出する。例えば、指標値算出部３２は、各窓領域内の特徴量を指標値算出関数に入力して当該窓領域に対するスコアを算出する、または、人物の腕部等が窓領域からはみ出す姿勢変動を考慮して窓領域内及び窓領域周辺の所定範囲の特徴量を指標値算出関数に入力して当該窓領域に対するスコアを算出する。 The index value calculation unit 32 sets a window area (attention area) of a predetermined person size as a frame for detecting a person at each position in the original size input image and the reduced input image, and A score, which is a multi-valued index value representing the likelihood that the target exists in the window area, is calculated using feature values extracted at various points in the input image and an index value calculation function learned in advance. For example, the index value calculation unit 32 calculates the score for the window area by inputting the feature amount in each window area to the index value calculation function, or takes into account the posture variation that the arm part of the person protrudes from the window area Then, a feature amount within a predetermined range in and around the window area is input to the index value calculation function to calculate a score for the window area.

なお、図２では画像１００，１１０，１２０に設定される矩形の窓領域１０１の例を点線で示している。指標値算出部３２は窓領域１０１を少しずつずらしながら繰り返し設定し、画像全体を走査する。例えば、窓領域１０１の走査は画像の左上から水平方向の走査が開始される。水平方向の走査は垂直方向の位置を少しずつずらしつつ繰り返される。 In FIG. 2, an example of the rectangular window region 101 set in the images 100, 110, and 120 is indicated by a dotted line. The index value calculation unit 32 repeatedly sets the window area 101 while gradually shifting it, and scans the entire image. For example, scanning of the window region 101 starts in the horizontal direction from the upper left of the image. The horizontal scanning is repeated while shifting the vertical position little by little.

指標値算出関数は本実施形態では、検出対象である「人」と「人」以外とを識別する識別器である。識別器は「人」が映っている多数の画像と、「人」が映っていない多数の画像とを用いて予め学習され、後述する指標値算出関数格納部４０に格納されている。指標値算出部３２は識別器に窓領域の位置に応じて特徴量を与えることでスコアを算出する。指標値算出部３２は、窓領域の矩形情報（入力画像における位置、幅及び高さ）とそのスコアを、後述する指標値格納部４１に格納する。例えば、入力画像における窓領域の位置として窓領域をなす矩形の左上の座標が格納される。 In this embodiment, the index value calculation function is a discriminator that discriminates between “persons” to be detected and those other than “persons”. The discriminator is learned in advance using a large number of images in which “people” are reflected and a large number of images in which “people” are not reflected, and is stored in an index value calculation function storage unit 40 described later. The index value calculation unit 32 calculates a score by giving a feature amount to the classifier according to the position of the window region. The index value calculation unit 32 stores rectangular information (position, width, and height in the input image) of the window area and its score in an index value storage unit 41 described later. For example, the upper left coordinates of the rectangle forming the window area are stored as the position of the window area in the input image.

領域グループ生成部３３は、指標値格納部４１から、スコアが予め定めた第一閾値Ｔ_１以上である窓領域を候補領域として抽出すると共に、当該候補領域相互についての予め定められた重複関係を満たす複数の領域からなる領域グループを生成する。具体的には、領域グループ生成部３３は、所定以上の重複を有する候補領域同士に同じラベル番号を割り当てることによって領域グループの情報を生成する。また、その際にスコアの高い候補領域を優先的にグループの核とする。詳細は動作の説明にて後述する。領域グループ生成部３３で割り当てた各候補領域のラベル番号は、矩形情報及びスコアと共に、後述する領域グループ格納部４２に格納される。 Region group generation unit 33, from the index value storage section 41, extracts the window region is first thresholds T ₁ or the score is predetermined as a candidate region, a predetermined overlapping relation on the candidate regions mutually A region group composed of a plurality of regions to be filled is generated. Specifically, the region group generation unit 33 generates region group information by assigning the same label number to candidate regions having a predetermined overlap or more. At that time, a candidate area having a high score is preferentially set as the core of the group. Details will be described later in the description of the operation. The label number of each candidate area assigned by the area group generation unit 33 is stored in the area group storage unit 42 described later together with the rectangle information and the score.

領域グループ削除部３４は、領域グループ格納部４２に格納されている領域グループのうち、削除条件に合致するものを領域グループ格納部４２から削除する。ここで、実験的に、人物の領域を含む領域グループは、人物の領域が含まれない領域グループよりも候補領域数が多く、且つ、姿勢変動、オクルージョン等が原因で仮に候補領域数が少なくなった場合でも、候補領域のスコアは人物の領域が含まれない領域グループの候補領域のスコアよりも高い傾向があることがわかった。そこで、削除条件は、領域グループが次の２つの要件（Ｒ１），（Ｒ２）をいずれも満たしていることとする。 The area group deletion unit 34 deletes the area groups stored in the area group storage unit 42 that match the deletion condition from the area group storage unit 42. Here, experimentally, an area group including a person area has a larger number of candidate areas than an area group not including a person area, and the number of candidate areas temporarily decreases due to posture variation, occlusion, and the like. Even in this case, it was found that the score of the candidate area tends to be higher than the score of the candidate area of the area group not including the person area. Therefore, the deletion condition is that the area group satisfies both of the following two requirements (R1) and (R2).

（Ｒ１）領域グループに帰属する候補領域の個数が予め定めた個数閾値Ｍ以下であること。 (R1) The number of candidate regions belonging to the region group is equal to or less than a predetermined number threshold M.

（Ｒ２）領域グループに帰属する全ての候補領域のスコアが第一閾値Ｔ_１よりも高く定めた第二閾値Ｔ_２以下であること。 (R2) The scores of all candidate regions belonging to the region group are equal to or lower than a second threshold T ₂ determined higher than the first threshold T ₁ .

個数閾値Ｍは、指標値算出関数の性能、第一閾値の設定、縮小率の設定などに依存して実験的・経験的に決定するものであって、例えば、実験データにおいて人物を含まない領域グループごとに候補領域の個数を求め、これらのグループから求めた個数の最大値を閾値Ｍに設定することができる。つまり、個数閾値Ｍは、人物を含まないグループのうち最も個数が多い領域グループに基づいて定めた閾値であるので、人物を含まない可能性がある領域グループと確実に人物を含む領域グループとの境界となる閾値と推定することができる。 The number threshold M is determined experimentally and empirically depending on the performance of the index value calculation function, the setting of the first threshold, the setting of the reduction ratio, and the like. The number of candidate regions can be obtained for each group, and the maximum value obtained from these groups can be set as the threshold value M. That is, the number threshold M is a threshold determined based on the region group having the largest number among the groups not including the person, and therefore, the region group that may not include the person and the region group that surely includes the person. It can be estimated that the threshold value is a boundary.

第一閾値Ｔ_１は、事前の実験に基づいて設定することができ、真の人物領域を検出し損ねない程度に低めの値に設定される。例えば、第一閾値Ｔ_１は実験データにおける真の人物領域に対して算出されたスコアの最小値とすることができる。第二閾値Ｔ_２も事前の実験を基づいて設定することができる。第二閾値Ｔ_２は第一閾値Ｔ_１よりも高い値であるが、領域グループ削除部３４による第二閾値Ｔ_２に関する要件（Ｒ２）の適用の有無が問題となるのは、領域グループが要件（Ｒ１）を満たす場合に限定される。そこで例えば、実験データにおいて帰属する候補領域の個数がＭ個以下であり真の人物領域を含む領域グループごとに最大スコアを求め、これらのグループから求めた最大スコアのうちの最小値を第二閾値Ｔ_２とすることができる。 First thresholds T ₁ may be set based on preliminary experiments, it is set to a lower value so as not fail to detect a true person area. For example, the first thresholds T ₁ may be the minimum value of the scores calculated for the true person region in the experimental data. Also the second threshold value T ₂ can be set based on preliminary experiments. The second threshold value T ₂ is higher than the first threshold value T ₁ , but whether or not the requirement (R 2) regarding the second threshold value T ₂ is applied by the region group deletion unit 34 is a problem for the region group. It is limited to satisfying (R1). Therefore, for example, the maximum score is obtained for each area group including the true person area and the number of candidate areas belonging to the experimental data is M or less, and the minimum value of the maximum scores obtained from these groups is set as the second threshold value. it can be a T _2.

第二閾値Ｔ_２は人物を含む領域グループのうち最もスコアが低い領域グループに基づいて定めた閾値であるので、これを人物を含む領域グループと人物を含まない領域グループとの境界となる閾値と推定することができる。 Since the second threshold value T ₂ is a threshold determined based on the most score lower area group in the region group including a person, a threshold at the boundary between the region groups that do not contain area groups and persons including the same person Can be estimated.

対象領域決定部３５は領域グループ格納部４２に格納されている領域グループから最終的な人物領域を求める。対象領域決定部３５は、領域グループ削除部３４で残された領域グループごとに１つの人物領域を定め、当該人物領域の領域情報をスコアと共に対象領域格納部４３に格納する。例えば、対象領域決定部３５は、最終的な人物領域として、各領域グループの中でスコアが最大になる候補領域を１つ選択する。或いは、対象領域決定部３５は、領域グループごとに当該領域グループを構成する候補領域を平均して最終的な人物領域を算出する。当該平均を求める際、スコアで重み付けをしてもよい。 The target area determination unit 35 obtains a final person area from the area group stored in the area group storage unit 42. The target region determination unit 35 determines one person region for each region group left by the region group deletion unit 34 and stores the region information of the person region in the target region storage unit 43 together with the score. For example, the target area determination unit 35 selects one candidate area having the maximum score in each area group as the final person area. Alternatively, the target area determination unit 35 calculates a final person area by averaging the candidate areas constituting the area group for each area group. When obtaining the average, the score may be weighted.

制御部３は，入力画像から最終的な人物領域が１つでも検出された場合は、その情報を出力部５に出力する。 When at least one final person region is detected from the input image, the control unit 3 outputs the information to the output unit 5.

記憶部４はＲＯＭ（Read Only Memory）、ＲＡＭ（Random Access Memory）、ハードディスク等の記憶装置であり、制御部３で使用されるプログラムやデータを記憶する。記憶部４はこれらプログラム、データを制御部３との間で入出力する。記憶部４は指標値算出関数格納部４０、指標値格納部４１、領域グループ格納部４２及び対象領域格納部４３としての機能を有する。 The storage unit 4 is a storage device such as a ROM (Read Only Memory), a RAM (Random Access Memory), and a hard disk, and stores programs and data used by the control unit 3. The storage unit 4 inputs and outputs these programs and data to and from the control unit 3. The storage unit 4 has functions as an index value calculation function storage unit 40, an index value storage unit 41, an area group storage unit 42, and a target area storage unit 43.

指標値算出関数格納部４０は、入力画像内に設定される窓領域に対象が存在する尤もらしさを表す指標値であるスコアを、入力画像内の各ブロックにて抽出される特徴量を用いて算出するための指標値算出関数、及び第一閾値Ｔ_１を予め記憶している。指標値算出関数は既に述べたように識別器であり、具体的には予め収集した人の学習用画像と人以外の学習用画像にサポートベクターマシーン（Support Vector Machine：ＳＶＭ）を適用して求めた識別器のパラメータが指標値算出関数格納部４０に格納される。学習アルゴリズムとして線形ＳＶＭを用いた場合、識別器のパラメータは学習用画像から生成した重みベクトルである。この重みベクトルは、特徴量の各要素に対する重みである。重みベクトルは、当該重みベクトルと学習用画像から抽出された特徴量との内積が０より大きい場合は人、０以下の場合は人以外と識別されるように学習において調整され、入力画像の特徴量と重みベクトルとの内積の値がスコアを表す。よって、人と人以外のスコアを識別する閾値は原理上は０であり、通常、第一閾値Ｔ_１は０に設定される。しかし本実施形態では前述したように、人を人以外であると識別する誤りを減じるために、第一閾値Ｔ_１を０よりも小さな値に設定しておく。 The index value calculation function storage unit 40 uses a feature amount extracted by each block in the input image to obtain a score, which is an index value representing the likelihood that the target exists in the window area set in the input image. calculated for the index value calculating function for, and the first thresholds T ₁ are stored in advance. As described above, the index value calculation function is a discriminator. Specifically, the index value calculation function is obtained by applying a support vector machine (SVM) to learning images of human beings and learning images other than human beings. The parameters of the discriminator are stored in the index value calculation function storage unit 40. When linear SVM is used as the learning algorithm, the parameter of the discriminator is a weight vector generated from the learning image. This weight vector is a weight for each element of the feature amount. The weight vector is adjusted in learning so that the inner product of the weight vector and the feature amount extracted from the learning image is identified as a person when it is greater than 0, and when it is equal to or less than 0, it is identified as a person other than the person. The value of the inner product of the quantity and the weight vector represents the score. Therefore, the threshold for discriminating the score between a person and a person other than a person is 0 in principle, and the first threshold T ₁ is normally set to 0. However, as described above in the present embodiment, in order to reduce the error identified as other than human human, leave the first thresholds T ₁ is set to a value less than 0.

識別器の学習アルゴリズムにはＳＶＭの他、アダブースト（AdaBoost）法など、従来知られた各種のものを用いることができる。 As the learning algorithm of the discriminator, various conventionally known ones such as the AdaBoost method can be used in addition to the SVM.

また、識別器の代わりにパターンマッチング器を用いることもでき、その場合、スコアは人の学習用画像から抽出した特徴量の平均パターンと入力画像の特徴量との距離の逆数などとなり、指標値算出関数は当該スコアを出力値とし入力画像の特徴量を入力値とする関数とすることができる。 In addition, a pattern matching device can be used instead of the discriminator. In this case, the score is the reciprocal of the distance between the average pattern of feature values extracted from the human learning image and the feature value of the input image, and the index value. The calculation function can be a function having the score as an output value and the feature quantity of the input image as an input value.

指標値格納部４１は、指標値算出部３２で算出された各窓領域の情報を格納する。窓領域の情報は当該窓領域の矩形情報（入力画像における矩形の位置及び寸法）とスコアを対応付けた情報である。 The index value storage unit 41 stores information on each window area calculated by the index value calculation unit 32. The window area information is information in which rectangular information of the window area (position and size of the rectangle in the input image) is associated with the score.

領域グループ格納部４２は、領域グループ生成部３３で生成された各領域グループの情報を格納する。領域グループの情報は、領域グループのラベル番号（識別符号）と領域グループに帰属する候補領域の数と各候補領域の情報とを対応付けた情報である。候補領域の情報は、矩形情報（入力画像における矩形の位置及び寸法）とスコアである。格納された領域グループの情報のうち削除条件に合致したものは領域グループ削除部３４により削除される。 The area group storage unit 42 stores information on each area group generated by the area group generation unit 33. The area group information is information in which the label number (identification code) of the area group, the number of candidate areas belonging to the area group, and information on each candidate area are associated with each other. The candidate area information is rectangular information (rectangular position and size in the input image) and score. Of the stored area group information, the area group deletion unit 34 deletes the information that matches the deletion condition.

対象領域格納部４３は、対象領域決定部３５により最終的に人物がいると判定された人物領域の情報を格納する。人物領域の情報は、候補領域の情報と同様、入力画像における人物領域の矩形情報（矩形の位置、及び寸法）とスコアとを対応付けた情報である。 The target area storage unit 43 stores information on a person area that is finally determined to have a person by the target area determination unit 35. The person area information is information in which rectangular information (rectangle position and size) of the person area in the input image is associated with the score, as in the candidate area information.

出力部５は対象領域決定部３５の結果を受けて、ディスプレイなどの外部表示装置に入力画像と共に異常発生の旨を表示し、または、異常信号をセンタ装置へ送出するといった警報出力を行う。 The output unit 5 receives the result of the target area determination unit 35 and displays an alarm occurrence on the external display device such as a display together with the input image, or outputs an alarm signal such as sending an abnormality signal to the center device.

[動作例]
次に人物検出装置１の動作を説明する。図３は人物検出装置１の概略の動作を示すフロー図である。制御部３は画像入力部２から画像を入力されると（ステップＳ１０）、画像縮小部３０により、入力画像を複数の倍率それぞれで縮小して縮小画像を作成する（ステップＳ２０）。例えば、図２に示したように、入力画像１００から縮小画像１１０，１２０が生成される。 [Example of operation]
Next, the operation of the person detection device 1 will be described. FIG. 3 is a flowchart showing a schematic operation of the person detection apparatus 1. When an image is input from the image input unit 2 (step S10), the control unit 3 creates a reduced image by reducing the input image at a plurality of magnifications by the image reduction unit 30 (step S20). For example, as illustrated in FIG. 2, reduced images 110 and 120 are generated from the input image 100.

特徴量抽出部３１は入力画像及び複数の縮小画像それぞれについて、画像内の各所において特徴量を抽出する（ステップＳ３０）。 The feature amount extraction unit 31 extracts a feature amount at each place in the image for each of the input image and the plurality of reduced images (step S30).

指標値算出部３２は、特徴量抽出部３１で抽出された特徴量と指標値算出関数格納部４０に格納されている識別器とにより画像内の各所に設定する窓領域に対応したスコアを算出し指標値格納部４１に格納する（ステップＳ４０）。 The index value calculation unit 32 calculates a score corresponding to the window region set in each place in the image by using the feature amount extracted by the feature amount extraction unit 31 and the classifier stored in the index value calculation function storage unit 40. And stored in the index value storage unit 41 (step S40).

領域グループ生成部３３は、指標値算出部３２で算出されたスコアに基づき人物の候補領域を抽出し、重複する複数の候補領域からなる領域グループを生成して領域グループ格納部４２に格納する（ステップＳ５０）。 The area group generation unit 33 extracts a person candidate area based on the score calculated by the index value calculation unit 32, generates an area group including a plurality of overlapping candidate areas, and stores the area group in the area group storage unit 42 ( Step S50).

図４は領域グループ生成部３３の概略の処理フロー図である。図４を用いて領域グループ生成部３３の動作について説明する。 FIG. 4 is a schematic process flow diagram of the area group generation unit 33. The operation of the area group generation unit 33 will be described with reference to FIG.

領域グループ生成部３３は指標値格納部４１を参照し、スコアが第一閾値Ｔ_１以上である窓領域を候補領域として抽出する（ステップＳ５００）。 Region group generation unit 33 refers to the index value storage section 41, extracts window regions score is first thresholds T ₁ or more as a candidate region (step S500).

図２では、候補領域の例を窓領域１０１に応じた大きさの実線の矩形で示している。画像１００では左側の小さな（遠くの）人物像の辺りに候補領域１０２ａ，１０２ｂが抽出されている。また、画像１１０では中央のバス停標識の辺りに候補領域１１２ａ，１１２ｂが検出され、画像１２０では右側の大きな（近くの）人物像の辺りに候補領域１２２ａ〜１２２ｄが抽出されている。なお、図２に示すように、人物などの１つの像に対し、重複した複数の候補領域が抽出され得る。 In FIG. 2, examples of candidate areas are indicated by solid-line rectangles having a size corresponding to the window area 101. In the image 100, candidate regions 102a and 102b are extracted around a small (far) person image on the left side. In the image 110, candidate areas 112a and 112b are detected around the central bus stop sign, and in the image 120, candidate areas 122a to 122d are extracted around a large (near) human image on the right side. As shown in FIG. 2, a plurality of overlapping candidate regions can be extracted for one image such as a person.

図５は領域グループ生成部３３により抽出された候補領域に対する後続処理を説明する模式的な画像である。なお、図５の画像は図２に示したものと同じ内容が映っており、図５（ａ）の画像１３０は、画像１００，１１０，１２０の候補領域を１つの画像上にまとめて表示したものである。画像１３０は入力画像１００と等倍のサイズであり、画像１００の候補領域１０２ａ，１０２ｂはそのままの倍率で画像１３０上の候補領域１３１ａ，１３１ｂとなる。一方、縮小画像における候補領域１１２ａ，１１２ｂ，１２２ａ〜１２２ｄそれぞれは入力画像１００の倍率に正規化された候補領域１３２ａ，１３２ｂ，１３３ａ〜１３３ｄとなる。 FIG. 5 is a schematic image for explaining the subsequent process for the candidate area extracted by the area group generation unit 33. The image of FIG. 5 shows the same content as that shown in FIG. 2, and the image 130 of FIG. 5A displays the candidate areas of the images 100, 110, and 120 together on one image. Is. The image 130 is the same size as the input image 100, and the candidate areas 102a and 102b of the image 100 become candidate areas 131a and 131b on the image 130 at the same magnification. On the other hand, the candidate areas 112a, 112b, 122a to 122d in the reduced image are candidate areas 132a, 132b, 133a to 133d normalized to the magnification of the input image 100, respectively.

領域グループ生成部３３は抽出した候補領域をスコアの降順に並べ替え（ステップＳ５０１）、全候補領域についてラベル情報をラベル番号が未割当であることを示す状態に設定する（ステップＳ５０２）。 The area group generation unit 33 rearranges the extracted candidate areas in descending order of the scores (step S501), and sets the label information for all candidate areas to a state indicating that the label number is not assigned (step S502).

領域グループ生成部３３は、ラベル番号を０から順次、インクリメントして設定する。そこで、現在のラベル番号を初期値０に設定する（ステップＳ５０３）。 The area group generation unit 33 increments and sets the label number sequentially from 0. Therefore, the current label number is set to the initial value 0 (step S503).

領域グループ生成部３３はラベル番号が未割当の候補領域があるかどうかチェックする（ステップＳ５０４）。未割当の候補領域がある場合は（ステップＳ５０４にて「ＹＥＳ」の場合）、未割当の候補領域の中からスコアが最大になるもの（候補領域Ａとする）を選択し（ステップＳ５０５）、現在のラベル番号を付与する（ステップＳ５０６）。 The area group generation unit 33 checks whether there is a candidate area to which the label number is not assigned (step S504). If there is an unassigned candidate area (in the case of “YES” in step S504), the unassigned candidate area having the highest score (candidate area A) is selected (step S505), The current label number is assigned (step S506).

そして候補領域Ａを比較の基準として、ラベル未割当の候補領域を１つずつ比較相手として繰り返されるループ処理（Ｓ５０７〜Ｓ５１１）が行われる。当該ループ処理では比較相手として選択されていない候補領域を順次選択し（ステップＳ５０７）、比較相手として選択された候補領域Ｂと、候補領域Ａとの重複度を計算し（ステップＳ５０８）、重複度が予め定められたグループ判定閾値より大きいか否かを判定する（ステップＳ５０９）。 Then, a loop process (S507 to S511) is performed in which the candidate area A is used as a reference for comparison, and the candidate areas that have not been assigned labels are compared one by one. In the loop processing, candidate regions that are not selected as comparison partners are sequentially selected (step S507), the degree of overlap between the candidate region B selected as the comparison partner and the candidate region A is calculated (step S508), and the degree of overlap is calculated. Is larger than a predetermined group determination threshold value (step S509).

重複度は、例えば、(入力画像中での候補領域Ａと候補領域Ｂとの共通領域の面積) / (入力画像中での候補領域Ａ及び候補領域Ｂの面積のうち小さい方)で計算される。また、(入力画像中での候補領域Ａと候補領域Ｂとの共通領域の面積) / (入力画像中での候補領域Ａと候補領域Ｂとの和領域の面積)で重複度を計算することもできる。例えば、グループ判定閾値は０．５に設定することができる。 The degree of overlap is calculated by, for example, (the area of the common area between candidate area A and candidate area B in the input image) / (the smaller of the areas of candidate area A and candidate area B in the input image). The Also, the degree of overlap is calculated by (area of the common area between candidate area A and candidate area B in the input image) / (area of the sum area of candidate area A and candidate area B in the input image). You can also. For example, the group determination threshold can be set to 0.5.

重複度がグループ判定閾値より大きい場合は（ステップＳ５０９にて「ＹＥＳ」の場合）、候補領域Ｂに候補領域Ａと同じラベル番号を付与し（ステップＳ５１０）、当該候補領域Ｂについての処理を終えステップＳ５０７に戻る。一方、重複度がグループ判定閾値以下の場合は（ステップＳ５０９にて「ＮＯ」の場合）、候補領域Ｂはラベル番号を未割当の状態のままとして当該候補領域Ｂについての処理を終えステップＳ５０７に戻る。 If the degree of overlap is greater than the group determination threshold value (if “YES” in step S509), the same label number as the candidate area A is assigned to the candidate area B (step S510), and the process for the candidate area B is finished. The process returns to step S507. On the other hand, if the degree of overlap is equal to or less than the group determination threshold value (in the case of “NO” in step S509), the candidate area B is left unallocated with the label number, and the process for the candidate area B is finished and the process returns to step S507. Return.

或る候補領域Ａについて未割当の候補領域すべてとの比較が終了した場合、つまりステップＳ５０７で未処理の候補領域が存在せず選択できなかった場合（ステップＳ５１１にて「ＮＯ」の場合）、現在のラベル番号をインクリメントし（ステップＳ５１２）、ステップＳ５０４に戻り、新たな候補領域Ａを選択してステップＳ５０５〜Ｓ５１２の処理を繰り返す。 When the comparison with all the unallocated candidate areas for a certain candidate area A is completed, that is, when an unprocessed candidate area does not exist in step S507 and cannot be selected (in the case of “NO” in step S511), The current label number is incremented (step S512), the process returns to step S504, a new candidate area A is selected, and the processes of steps S505 to S512 are repeated.

一方、候補領域に対してラベル番号の付与がすべて終了した場合、つまり未割当の候補領域が無い場合は、（ステップＳ５０４で「ＮＯ」の場合）、同じラベルを付与された候補領域同士を領域グループとして領域グループ格納部４２に格納して（ステップＳ５１３）、グループ生成処理を終了し図３のステップＳ６０に処理を移行する。 On the other hand, when all label numbers are assigned to the candidate areas, that is, when there is no unallocated candidate area (in the case of “NO” in step S504), the candidate areas to which the same label is assigned are defined as areas. The group is stored as a group in the area group storage unit 42 (step S513), the group generation process is terminated, and the process proceeds to step S60 in FIG.

なお、上述のように、スコアが高い候補領域を優先してグループの核に設定することにより、近接する複数の人物に係る候補領域が１つのグループとなることを回避することが期待できる。 As described above, by setting a candidate area having a high score as the core of a group with priority, it can be expected that candidate areas relating to a plurality of adjacent persons are prevented from forming one group.

領域グループ生成部３３の処理の結果、例えば、図５（ａ）の画像１３０における候補領域１３３ａ〜１３３ｄがラベル番号“０”のグループとなり、候補領域１３１ａ，１３１ｂがラベル番号“１”のグループとなり、候補領域１３２ａ，１３２ｂがラベル番号“２”のグループとなる。 As a result of the processing of the area group generation unit 33, for example, the candidate areas 133a to 133d in the image 130 in FIG. 5A become a group with a label number “0”, and the candidate areas 131a and 131b become a group with a label number “1”. The candidate areas 132a and 132b form a group with the label number “2”.

なお、候補領域の重心と寸法をパラメータとするクラスタリングによっても重複度に基づくグループ生成を行うことができる。 Note that group generation based on the degree of overlap can also be performed by clustering using the centroid and dimensions of candidate regions as parameters.

領域グループ削除部３４は、領域グループ格納部４２に格納された領域グループのうち、上述した削除条件、すなわち要件（Ｒ１）及び（Ｒ２）に合致するものを領域グループ格納部４２から削除する（ステップＳ６０）。図６は領域グループ削除部３４の概略の処理フロー図である。図６を用いて領域グループ削除部３４の動作について説明する。 The area group deletion unit 34 deletes, from the area group storage unit 42, the area groups stored in the area group storage unit 42 that meet the above-described deletion conditions, that is, the requirements (R1) and (R2) (step S40). S60). FIG. 6 is a schematic process flow diagram of the area group deletion unit 34. The operation of the area group deletion unit 34 will be described with reference to FIG.

領域グループ削除部３４は領域グループ格納部４２に格納されている領域グループを１つずつ処理対象として選択してステップＳ６０１〜Ｓ６０３の処理を行い（ステップＳ６００にて「ＹＥＳ」の場合）、未処理の領域グループがなくなるとグループ削除処理を終了し図３のステップＳ７０に処理を移行する（ステップＳ６００にて「ＮＯ」の場合）。 The region group deletion unit 34 selects the region groups stored in the region group storage unit 42 as processing targets one by one and performs the processing of steps S601 to S603 (in the case of “YES” in step S600), unprocessed When there is no more area group, the group deletion process is terminated and the process proceeds to step S70 in FIG. 3 (in the case of “NO” in step S600).

領域グループ削除部３４は処理対象の領域グループについて、例えばまず要件（Ｒ１）を満たすか否か、つまり当該グループに帰属する候補領域の個数が予め定められた個数閾値Ｍ以下か否かを判定する（ステップＳ６０１）。個数閾値Ｍ以下の場合は（ステップＳ６０１にて「ＹＥＳ」の場合）、さらにスコアに関する要件（Ｒ２）を満たすか否か、すなわち領域グループに帰属する全ての候補領域のスコアが第二閾値Ｔ_２以下か否か判定する（ステップＳ６０２）。なお、第二閾値Ｔ_２は既に述べたように、候補領域を抽出するための第一閾値Ｔ_１よりも高く定められた閾値である。 For example, the region group deletion unit 34 first determines whether or not the region group to be processed satisfies the requirement (R1), that is, whether or not the number of candidate regions belonging to the group is equal to or less than a predetermined number threshold M. (Step S601). If the number is equal to or less than the number threshold M (in the case of “YES” in step S601), whether the score requirement (R2) is further satisfied, that is, the scores of all candidate regions belonging to the region group are the second threshold T _2. It is determined whether it is below (step S602). Note that as the second threshold value T ₂ has already been mentioned, it is a high-determined threshold than the first thresholds T ₁ for extracting a candidate region.

ステップＳ６０２にてスコアが第二閾値Ｔ_２以下の場合は（ステップＳ６０２にて「ＹＥＳ」の場合）、当該領域グループは削除条件である要件（Ｒ１）及び（Ｒ２）を満たすので、領域グループ削除部３４は当該領域グループを領域グループ格納部４２より削除する（ステップＳ６０３）。 If the score is the second threshold value _{T 2} or less at step S602 (if at step S602 is "YES"), because the area group meets the requirements a deletion condition (R1) and (R2), the area group deletion The unit 34 deletes the area group from the area group storage unit 42 (step S603).

一方、要件（Ｒ１）を満たさない場合（ステップＳ６０１にて「ＮＯ」の場合）、及び要件（Ｒ２）を満たさない場合（ステップＳ６０２にて「ＮＯ」の場合）は、領域グループ削除部３４は当該領域グループを削除せず、ステップＳ６００に戻り、次の領域グループを処理対象とする。 On the other hand, if the requirement (R1) is not satisfied (in the case of “NO” in step S601) and the requirement (R2) is not satisfied (in the case of “NO” in step S602), the region group deletion unit 34 The area group is not deleted, and the process returns to step S600 to set the next area group as a processing target.

図５（ｂ）の画像１４０は画像１３０に対する領域グループ削除部３４の処理結果を示しており、例えば、個数閾値Ｍを２として、以下にその処理と効果を説明する。 The image 140 in FIG. 5B shows the processing result of the area group deletion unit 34 for the image 130. For example, assuming that the number threshold M is 2, the processing and effects will be described below.

候補領域１３３ａ〜１３３ｄからなる領域グループ“０”は候補領域の数が４であるので要件（Ｒ１）を満たさず、よって削除されない。このように十分に多い候補領域が抽出される典型的な人物領域グループは、グループを構成する候補領域のスコアと第二閾値Ｔ_２との関係によらず適切に残すことができる。 The area group “0” composed of the candidate areas 133a to 133d does not satisfy the requirement (R1) because the number of candidate areas is 4, and thus is not deleted. The typical human region group sufficiently large candidate regions are extracted as may be appropriate to leave regardless of the relationship between the score and the second threshold value T ₂ of the candidate regions that constitute the group.

一方、人物に起因する候補領域１３１ａ，１３１ｂからなる領域グループ“１”は、候補領域の数が２であり要件（Ｒ１）を満たし、背景に起因する候補領域１３２ａ，１３２ｂからなる領域グループ“２”もまた候補領域の数が２であり要件（Ｒ１）を満たす。従来技術のように要件（Ｒ１）のみで削除条件を構成すると、背景に起因する領域グループ“２”は削除できるものの、人物に起因する領域グループ“１”までも削除されてしまう。 On the other hand, the area group “1” composed of the candidate areas 131a and 131b attributed to the person satisfies the requirement (R1) because the number of candidate areas is 2, and the area group “2” composed of the candidate areas 132a and 132b attributed to the background. "" Also satisfies the requirement (R1) because the number of candidate regions is two. If the deletion condition is configured only by the requirement (R1) as in the prior art, the area group “2” caused by the background can be deleted, but the area group “1” caused by the person is also deleted.

しかし、要件（Ｒ２）を削除条件に含んだ本発明を適用することにより、これらを適切に弁別できる。すなわち、人物に起因する領域グループ“１”においては好適に真の人物領域を捉える位置にある候補領域１３１ａのスコアが第二閾値Ｔ_２を超えるため要件（Ｒ２）を満たさず、よって削除されない。他方、領域グループ“２”においては各候補領域１３２ａ，１３２ｂが背景に起因するためスコアが第二閾値Ｔ_２以下となり要件（Ｒ２）を満たし、よって削除される。 However, by applying the present invention in which the requirement (R2) is included in the deletion condition, these can be appropriately distinguished. In other words, preferably not satisfy the requirement (R2) for the score of the candidate region 131a in a position to capture the true person area exceeds a second threshold value T ₂ are in the area group "1" due to the person, thus not deleted. On the other hand, the candidate region 132a in the area group "2", 132b meets the requirements score for due to the background becomes the second threshold value _{T 2} or less (R2), thus being removed.

上述した領域グループ削除部３４の処理が終わると、対象領域決定部３５は最終的な人物領域を求めて対象領域格納部４３に格納する（図３のステップＳ７０）。 When the processing of the region group deletion unit 34 described above is completed, the target region determination unit 35 obtains a final person region and stores it in the target region storage unit 43 (step S70 in FIG. 3).

図５（ｃ）の画像１５０は画像１４０に対する対象領域決定部３５の処理例を示しており、領域グループ“０”，“１”それぞれから、スコアが最大となる候補領域１３３ａ，１３１ａが人物領域として選択されている。 An image 150 in FIG. 5C shows a processing example of the target area determination unit 35 for the image 140. The candidate areas 133a and 131a having the maximum score from the area groups “0” and “1” are person areas. As selected.

ステップＳ７０にて人物領域の算出後、画像中に人物が１人でもいた場合（ステップＳ８０にて「ＹＥＳ」の場合）、例えば、出力部５は検出された人物領域の情報と当該人物領域が検出された入力画像とを含めた異常信号をセンタ装置に送出する（ステップＳ９０）。 If there is even one person in the image after the calculation of the person area in step S70 (in the case of “YES” in step S80), for example, the output unit 5 includes information on the detected person area and the person area. An abnormal signal including the detected input image is sent to the center device (step S90).

以上、実施形態を用いて説明した本発明では、領域グループに含まれる候補領域の個数と候補領域のスコアとを併用することで、人物領域の最終的な検出結果の精度を向上させる。具体的には、領域グループに含まれる候補領域の個数を第一の指標として、個数が閾値Ｍを下回ることを領域グループの削除条件の１つの要件（Ｒ１）とする。さらに、候補領域を見つけたときのスコアの閾値Ｔ_１よりも高く定めた閾値Ｔ_２を第二の指標として、候補領域のスコアがいずれも第二の指標を下回ることを領域グループの削除条件の他の１つの要件（Ｒ２）とする。これら要件（Ｒ１）及び（Ｒ２）を両方満たす領域グループを削除することで、対象の検出し損ねを防止しつつ背景の誤検出を的確に減じ、最終的な検出結果の精度を向上させることができる。 As described above, in the present invention described using the embodiment, the accuracy of the final detection result of the human region is improved by using the number of candidate regions included in the region group and the score of the candidate region together. Specifically, with the number of candidate regions included in the region group as a first index, one requirement (R1) of the region group deletion condition is that the number is below the threshold M. Further, as the higher second indicator threshold T ₂ which defines than the threshold T ₁ of the score when you find the candidate region, neither the score of the candidate region is deletion condition region group that below a second indicator Let it be another requirement (R2). By deleting a region group that satisfies both of these requirements (R1) and (R2), it is possible to accurately reduce background misdetection while preventing failure to detect the target and improve the accuracy of the final detection result. it can.

［他の実施形態］
以下、本発明の他の実施形態について上記実施形態との相違点のみを説明する。上記実施形態において領域グループ削除部３４は１つの第二閾値Ｔ_２を用いた削除条件に基づいて背景に起因する領域グループを削除したが、別の実施形態として、要件（Ｒ２）における第二閾値Ｔ_２を、領域グループに含まれる候補領域の個数ｍに応じて変化させる構成とすることができる。例えば、要件（Ｒ２）において、個数閾値Ｍ以下の候補領域の個数ｍに対して第二閾値Ｔ_２が複数段階に設定される。 [Other Embodiments]
Hereinafter, only different points of the other embodiments of the present invention from the above embodiments will be described. Area group deletion unit 34 in the above embodiment has been deleted region group due to background based on the deletion condition with one second threshold value T ₂ but, as another embodiment, the second threshold value in the requirement (R2) T ₂ may be configured to change according to the number m of candidate regions included in the region group. For example, in the requirement (R2), the second threshold value T ₂ is set in a plurality of steps with respect to the number m of the following candidate area number threshold M.

具体的には、個数閾値Ｍ以下の候補領域の個数ｍに対して設定される複数の第二閾値Ｔ_２の設定値は、個数ｍが多いほど小さく定められる。例えば、個数ｍについて個数閾値Ｍ以下の範囲を上から順に区間ｂ_Ｈ、区間ｂ_Ｍ、区間ｂ_Ｌの３つに区切り、区間ｂの第二閾値Ｔ_２の設定値をρ（ｂ）と表すと、Ｔ_１＜ρ（ｂ_Ｈ）＜ρ（ｂ_Ｍ）＜ρ（ｂ_Ｌ）を満たすように設定される。ちなみに、各区間に含まれる個数ｍの値は１つでも複数でもよい。 Specifically, the setting values of a plurality of second threshold T ₂ set for the number m of the following candidate area number threshold value M is determined smaller the larger the number m. For example, for the number m, the range below the number threshold M is divided into the section b _H , the section b _M , and the section b _L in order from the top, and the set value of the second threshold T _{2 in} the section b is expressed as ρ (b). And T ₁ <ρ (b _H ) <ρ (b _M ) <ρ (b _L ). Incidentally, the number m included in each section may be one or plural.

領域グループ削除部３４は要件（Ｒ１）を満たす領域グループに対して、当該領域グループに帰属する候補領域の個数ｍに応じた第二閾値Ｔ_２にて要件（Ｒ２）を満たすか否かを判定する。すなわち、第二閾値Ｔ_２は個数ｍが多いときほど低くなるように定められ、領域グループ削除部３４は領域グループについて、個数ｍが予め定められた閾値Ｍ以下である場合に、帰属する全ての候補領域のスコアが個数ｍに応じた定められた第二閾値Ｔ_２以下であるものを背景に起因する領域グループとして削除する。 Region group deletion unit 34 to the region group that meets the requirements (R1), determine whether they meet the requirements (R2) at the second threshold value T ₂ in accordance with the number m of candidate regions that belong to the area group To do. That is, the second threshold value T ₂ is determined to be lower as when the number m is large, the area group deletion unit 34 for the area groups, when the number m is not greater than the predetermined threshold value M, all attributable score of the candidate region is deleted as an area group due to the background what is the second threshold value T ₂ or less defined according to the number m.

帰属する候補領域の個数ｍが多いほど対象領域を含む領域グループである確度が高いため、帰属する候補領域の個数ｍが多い領域グループほど第二閾値Ｔ_２を低くでき、これによって対象領域の検出し損ねを防止する効果を高めることができる。他方、帰属する候補領域の個数ｍが少ない領域グループほど高い第二閾値Ｔ_２が適用されるので、背景領域の誤検出も適確に防止できる。 For a high probability of a region group including the target region larger the number m of attributable candidate area, as the area group number m is larger attribution candidate region can be lowered a second threshold value T _2, whereby the detection of the target region The effect of preventing failure can be enhanced. On the other hand, since the number m of attribution candidate region is small region groups higher second threshold value T ₂ is applied, erroneous detection of the background area can be prevented to apply probability.

なお、上記実施形態においては検出対象を人の全身としたが、人の顔や上半身など特定の部位を検出対象としてもよく、車輌や標識など各種の物体を検出対象としてもよく、また、表情や姿勢など各種の状態を検出対象としてもよい。 In the above embodiment, the detection target is the whole body of the person, but a specific part such as a person's face or upper body may be the detection target, and various objects such as a vehicle or a sign may be the detection target. Various states such as poses and postures may be detected.

１人物検出装置、２画像入力部、３制御部、４記憶部、５出力部、３０画像縮小部、３１特徴量抽出部、３２指標値算出部、３３領域グループ生成部、３４領域グループ削除部、３５対象領域決定部、４０指標値算出関数格納部、４１指標値格納部、４２領域グループ格納部、４３対象領域格納部。 DESCRIPTION OF SYMBOLS 1 Person detection apparatus, 2 Image input part, 3 Control part, 4 Storage part, 5 Output part, 30 Image reduction part, 31 Feature-value extraction part, 32 Index value calculation part, 33 Area group generation part, 34 Area group deletion part , 35 target area determination unit, 40 index value calculation function storage unit, 41 index value storage unit, 42 area group storage unit, 43 target area storage unit.

Claims

A target detection device for detecting a target region where a predetermined target appears in an input image,
An index value calculation function for calculating an index value representing the likelihood that the target exists in a region of interest set in the input image using feature amounts extracted at various points in the input image is stored in advance. Storage unit
An index value calculation unit that sets the attention area at a plurality of positions in the input image and calculates the index value in the attention area by the index value calculation function;
A region composed of a plurality of candidate regions satisfying a predetermined overlapping relationship between the candidate regions and extracting the region of interest whose index value is equal to or greater than a predetermined first threshold as a candidate region An area group generation unit for generating a group;
In the region group, the number of candidate regions to which the group belongs is equal to or less than a predetermined number threshold value, and the index value of all the candidate regions to which it belongs is equal to or less than a second threshold value set higher than the first threshold value. An area group deletion section for deleting a certain thing,
A target region determination unit that determines the target region from each of the region groups that have not been deleted by the region group deletion unit;
An object detection apparatus comprising:

When the number of the candidate regions belonging to the region group is equal to or less than the number threshold, the second threshold is set according to the number, and each setting value of the second threshold corresponds to the setting value The object detection apparatus according to claim 1, wherein the value is smaller as the number is larger.