JP2011039994A

JP2011039994A - Component detector, component detection method, program, and recording medium

Info

Publication number: JP2011039994A
Application number: JP2009189402A
Authority: JP
Inventors: Kazuya Ueki; 一也植木
Original assignee: NEC Solution Innovators Ltd
Current assignee: NEC Solution Innovators Ltd
Priority date: 2009-08-18
Filing date: 2009-08-18
Publication date: 2011-02-24
Anticipated expiration: 2029-08-18
Also published as: JP5431830B2

Abstract

<P>PROBLEM TO BE SOLVED: To provide a component detector for accurately and easily detecting a component even when there is a part hidden in an image to be identified. <P>SOLUTION: The component detector includes: a target image acquiring means 111 for acquiring an image to be identified, a target region detecting means 121 for detecting a region to be identified from the target image by referring to a region extracting model created from region teacher data 141 obtained in advance, and a component detecting means 122 for detecting a component included in the target region by referring to a component extracting model created from component teacher data 142 obtained in advance. <P>COPYRIGHT: (C)2011,JPO&INPIT

Description

本発明は、部品検出装置、部品検出方法、プログラムおよび記録媒体に関する。 The present invention relates to a component detection apparatus, a component detection method, a program, and a recording medium.

画像から特定の顔部品を検出するシステムは、テレビ会議やテレビ電話等の各種の画像通信、データベース検索、ビル監視等のセキュリティシステム、顔による照合システム等、多方面に亘り利用されている。例えば、人物の顔画像から目、眉、鼻孔、口等の顔部品を抽出する方法が提案されている（例えば、特許文献１参照）。しかしながら、前記顔部品が隠れていると抽出が困難となったり、また、類似の顔部品が画像中に存在すると誤検出がされたりするという問題があった。 A system for detecting a specific facial part from an image is used in various fields such as various image communication such as a video conference and a video phone, a database search, a security system such as building monitoring, and a face matching system. For example, a method for extracting facial parts such as eyes, eyebrows, nostrils, and mouth from a human face image has been proposed (see, for example, Patent Document 1). However, there is a problem that extraction is difficult when the face part is hidden, and erroneous detection is performed when a similar face part exists in the image.

近年、セキュリティ上の要請から、現金自動預け払い機（ＡＴＭ）や販売時点情報管理（ＰＯＳ）システムを搭載したセルフレジの操作において、サングラス、マスク、帽子等を外すよう、呼びかけや掲示がなされたりしているが、無人の場合や時間帯によっては、必ずしも効果は上がっていない。そこで、サングラス等を着用している操作者を検出し、音声による警告や、着用中の操作の制限を行うことのできるシステムが求められている。また、防犯カメラによって撮影された大量の映像アーカイブの中から、サングラス等を着用している犯罪者と思しき人物を探索することのできるシステムが求められている。 In recent years, due to security demands, calls and notices have been made to remove sunglasses, masks, hats, etc. in the operation of self-checkout machines equipped with automatic teller machines (ATMs) and point-of-sale information management (POS) systems. However, the effect is not necessarily improved depending on the unmanned case and time zone. Therefore, there is a need for a system that can detect an operator wearing sunglasses or the like, and can perform a warning by voice or limit an operation while wearing. There is also a need for a system that can search for a person who appears to be a criminal wearing sunglasses or the like from a large number of video archives taken by a security camera.

特開平１０−３０７９２３号公報Japanese Patent Laid-Open No. 10-307923

しかしながら、例えば、顔の検出を、目を抽出することによって行う場合、（１）識別対象画像中から画像処理により目を抽出する、（２）目が抽出できなかった場合は、サングラスを抽出する、（３）サングラスも抽出できなかった場合は、帽子を抽出する等、目が隠れている場合に、隠れの原因毎に段階的に可能性のあるものを探していくアルゴリズムを構築する必要があり、煩雑である。 However, for example, when face detection is performed by extracting eyes, (1) eyes are extracted from the identification target image by image processing, and (2) sunglasses are extracted when eyes cannot be extracted. (3) If sunglasses cannot be extracted, it is necessary to construct an algorithm that searches for possible items for each cause of hiding when the eyes are hidden, such as extracting a hat. Yes, it is complicated.

そこで、本発明は、識別対象画像中に隠れている部品がある場合でも、部品を高精度かつ容易に検出可能な部品検出装置、部品検出方法、プログラムおよび記録媒体を提供することを目的とする。 Therefore, an object of the present invention is to provide a component detection apparatus, a component detection method, a program, and a recording medium that can detect a component with high accuracy and easily even when there is a component hidden in the identification target image. .

前記目的を達成するために、本発明の部品検出装置は、識別対象画像を取得する識別対象画像取得手段と、
予め取得した識別対象領域教師データから作成される識別対象領域抽出モデルを参照して、前記識別対象画像から識別対象領域を検出する識別対象領域検出手段と、
予め取得した部品教師データから作成される部品抽出モデルを参照して、前記識別対象領域に含まれる部品を検出する部品検出手段とを含むことを特徴とする。 In order to achieve the above object, the component detection apparatus of the present invention includes an identification target image acquisition unit that acquires an identification target image;
An identification target area detecting means for detecting an identification target area from the identification target image with reference to an identification target area extraction model created from the identification target area teacher data acquired in advance;
And a component detection unit that detects a component included in the identification target region with reference to a component extraction model created from component teacher data acquired in advance.

本発明の部品検出方法は、識別対象画像を取得する識別対象画像取得工程と、
予め取得した識別対象領域教師データから作成される識別対象領域抽出モデルを参照して、前記識別対象画像から識別対象領域を検出する識別対象領域検出工程と、
予め取得した部品教師データから作成される部品抽出モデルを参照して、前記識別対象領域に含まれる部品を検出する部品検出工程とを含むことを特徴とする。 The component detection method of the present invention includes an identification target image acquisition step of acquiring an identification target image,
An identification target area detection step of detecting an identification target area from the identification target image with reference to an identification target area extraction model created from the identification target area teacher data acquired in advance;
A component detection step of detecting a component included in the identification target region with reference to a component extraction model created from component teacher data acquired in advance.

本発明のプログラムは、前記本発明の部品検出方法をコンピュータに実行させることを特徴とする。 The program of the present invention causes a computer to execute the component detection method of the present invention.

本発明の記録媒体は、前記本発明のプログラムを記録していることを特徴とする。 The recording medium of the present invention records the program of the present invention.

本発明によれば、識別対象画像中に隠れている部品がある場合でも、部品を高精度かつ容易に検出可能な部品検出装置、部品検出方法、プログラムおよび記録媒体を提供することができる。 According to the present invention, it is possible to provide a component detection apparatus, a component detection method, a program, and a recording medium that can detect a component with high accuracy and easily even when there is a component hidden in the identification target image.

図１（Ａ）は、本発明の部品検出方法の一例を示すフローチャートであり、図１（Ｂ）は、本発明の部品検出装置の一例を示すブロック図である。FIG. 1A is a flowchart showing an example of the component detection method of the present invention, and FIG. 1B is a block diagram showing an example of the component detection apparatus of the present invention. 図２（Ａ）は、本発明の部品検出方法のその他の例を示すフローチャートであり、図２（Ｂ）は、本発明の部品検出装置のその他の例を示すブロック図である。FIG. 2A is a flowchart showing another example of the component detection method of the present invention, and FIG. 2B is a block diagram showing another example of the component detection apparatus of the present invention. 図３（Ａ）は、本発明の部品検出方法のさらにその他の例を示すフローチャートであり、図３（Ｂ）は、本発明の部品検出装置のさらにその他の例を示すブロック図である。FIG. 3A is a flowchart showing still another example of the component detection method of the present invention, and FIG. 3B is a block diagram showing still another example of the component detection apparatus of the present invention. 図４は、本発明における識別対象領域教師データの取得の一例について説明するための図である。FIG. 4 is a diagram for explaining an example of acquisition of identification target area teacher data according to the present invention. 図５（Ａ）〜（Ｃ）は、本発明における部品教師データの取得例について説明するための図である。FIGS. 5A to 5C are diagrams for explaining an example of acquisition of component teacher data in the present invention. 図６（Ａ）および（Ｂ）は、本発明の識別対象領域検出工程の一例について説明するための図である。6A and 6B are diagrams for explaining an example of the identification target region detection step of the present invention. 図７（Ａ）〜（Ｃ）は、本発明の部品検出工程の例について説明するための図である。7A to 7C are diagrams for explaining an example of the component detection step of the present invention. 図８は、本発明の部品検出装置を用いた部品検出システムの一例の構成を示すブロック図である。FIG. 8 is a block diagram showing a configuration of an example of a component detection system using the component detection apparatus of the present invention.

つぎに、本発明の部品検出装置、部品検出方法、プログラムおよび記憶媒体について、例をあげて説明する。ただし、本発明は、下記の例に限定されない。 Next, the component detection apparatus, component detection method, program, and storage medium of the present invention will be described with examples. However, the present invention is not limited to the following examples.

（実施形態１）
図１に、本実施形態における部品検出方法のフローチャート（同図（Ａ））および部品検出装置のブロック図（同図（Ｂ））を示す。図１（Ｂ）に示すとおり、本実施形態の部品検出装置は、識別対象画像取得手段１１１、演算手段１２０、出力手段１３１およびデータ記憶手段１４０を主要な構成部材として含む。前記データ記憶手段１４０には、予め取得した識別対象領域教師データ１４１および部品教師データ１４２が格納されており、前記演算手段１２０は、識別対象領域検出手段１２１および部品検出手段１２２を含む。前記識別対象領域検出手段１２１は、前記識別対象領域教師データ１４１と接続している。前記部品検出手段１２２は、前記部品教師データ１４２と接続している。前記識別対象画像取得手段１１１としては、例えば、ＣＣＤ（ＣｈａｒｇｅＣｏｕｐｌｅｄＤｅｖｉｃｅ）カメラ、ＣＭＯＳ（ＣｏｍｐｌｅｍｅｎｔａｒｙＭｅｔａｌＯｘｉｄｅＳｅｍｉｃｏｎｄｕｃｔｏｒ）カメラ、イメージスキャナ等があげられる。前記演算手段１２０としては、例えば、中央処理装置（ＣＰＵ）等があげられる。前記出力手段１３１としては、例えば、映像により出力するモニター（例えば、液晶ディスプレイ（ＬＣＤ）、ブラウン管（ＣＲＴ）ディスプレイ等の各種画像表示装置等）、印刷により出力するプリンター、音声により出力するスピーカー等があげられる。前記出力手段１３１は、任意の構成部材であり、本発明の部品検出装置に含まれていなくてもよいが、含まれていることが好ましい。前記データ記憶手段１４０としては、例えば、ランダムアクセスメモリ（ＲＡＭ）、読み出し専用メモリ（ＲＯＭ）、ハードディスク（ＨＤ）、光ディスク、フロッピー（登録商標）ディスク（ＦＤ）等があげられる。前記データ記憶手段１４０は、装置内蔵型であってもよいし、外部記憶装置のような外付け型であってもよい。前記識別対象画像取得手段、前記演算手段、前記出力手段および前記データ記憶手段については、後述の実施形態２、３および５においても同様である。 (Embodiment 1)
FIG. 1 shows a flowchart of the component detection method according to the present embodiment (FIG. 1A) and a block diagram of the component detection apparatus (FIG. 1B). As shown in FIG. 1B, the component detection apparatus of this embodiment includes an identification target image acquisition unit 111, a calculation unit 120, an output unit 131, and a data storage unit 140 as main components. The data storage unit 140 stores identification target area teacher data 141 and component teacher data 142 acquired in advance, and the calculation unit 120 includes an identification target region detection unit 121 and a component detection unit 122. The identification target area detection unit 121 is connected to the identification target area teacher data 141. The component detection unit 122 is connected to the component teacher data 142. Examples of the identification target image acquisition unit 111 include a CCD (Charge Coupled Device) camera, a CMOS (Complementary Metal Oxide Semiconductor) camera, and an image scanner. Examples of the arithmetic unit 120 include a central processing unit (CPU). Examples of the output means 131 include a monitor that outputs video (for example, various image display devices such as a liquid crystal display (LCD) and a cathode ray tube (CRT) display), a printer that outputs by printing, a speaker that outputs by sound, and the like. can give. The output means 131 is an arbitrary component and may not be included in the component detection apparatus of the present invention, but is preferably included. Examples of the data storage means 140 include a random access memory (RAM), a read-only memory (ROM), a hard disk (HD), an optical disk, a floppy (registered trademark) disk (FD), and the like. The data storage unit 140 may be a device built-in type or an external type such as an external storage device. The identification target image acquisition unit, the calculation unit, the output unit, and the data storage unit are the same in the second, third, and fifth embodiments described later.

本発明において、識別対象領域は、特に限定されず、例えば、人物、人物の頭部、動物、乗用車等、いかなるものであってもよい。 In the present invention, the identification target area is not particularly limited, and may be any person such as a person, a person's head, an animal, and a passenger car.

本発明において、部品も、前記識別対象領域に含まれるものであれば、特に限定されず、いかなるものであってもよい。例えば、前記識別対象領域が人物である場合、前記部品としては、例えば、頭部、胴体、左手、右手、左脚、右脚等があげられる。例えば、前記識別対象領域が人物の頭部である場合、前記部品としては、例えば、左目、右目、鼻、口、左眉、右眉、左耳、右耳等があげられる。例えば、前記識別対象領域が動物である場合、前記部品としては、例えば、頭部、胴体、脚部、尻尾等があげられる。例えば、前記識別対象領域が乗用車である場合、前記部品としては、例えば、車体、タイヤ等があげられる。 In the present invention, the component is not particularly limited as long as it is included in the identification target region, and any component may be used. For example, when the identification target region is a person, examples of the component include a head, a torso, a left hand, a right hand, a left leg, and a right leg. For example, when the identification target region is a human head, examples of the component include a left eye, a right eye, a nose, a mouth, a left eyebrow, a right eyebrow, a left ear, and a right ear. For example, when the identification target region is an animal, examples of the component include a head, a trunk, a leg, and a tail. For example, when the identification target region is a passenger car, examples of the component include a vehicle body and a tire.

以下、前記識別対象領域が人物の頭部（以下、単に「頭部」ということがある）であり、前記部品が、左目、右目、鼻および口の４つである場合を例にとり、本実施形態の部品検出装置および部品検出方法について、さらに詳細に説明する。 Hereinafter, the present embodiment will be described by taking as an example the case where the identification target area is a human head (hereinafter, simply referred to as “head”) and the parts are four of a left eye, a right eye, a nose, and a mouth. The component detection device and the component detection method of the embodiment will be described in more detail.

この例の部品検出方法は、図１（Ｂ）の部品検出装置を用いて、例えば、つぎのようにして実施する。 The component detection method of this example is implemented as follows using the component detection apparatus of FIG. 1 (B), for example.

まず、前記部品検出方法の実施に先立ち、機械学習（パターン認識）の技術を用いて学習を行う。具体的には、まず、学習用画像の前記識別対象領域（頭部）に識別対象領域教師データ１４１を付与する。この例において、前記識別対象領域教師データ１４１としては、例えば、頭部の位置、大きさ、向き等の教師データがあげられる。具体的には、例えば、図４に示すように、まず、学習用画像１０の頭部にラベリング１１を付すことで、前記頭部の位置および大きさの教師データを付与する。さらに、例えば、１度単位での向きデータ（例えば、左向き３度）、８方向に準じた向きデータ（例えば、斜め左）等の前記頭部の向きの教師データを付与する。前記頭部の位置、大きさ、向き等の教師データは、例えば、キーボード、マウス等の従来公知の入力手段を用いて、人が入力することで付与する。検出精度を考慮すると、前記学習用画像１０の数は多いほど好ましく、多数用意した学習用画像１０に、多数の頭部の位置、大きさおよび向き等の識別対象領域教師データ１４１を付与することが好ましい。 First, prior to the implementation of the component detection method, learning is performed using a machine learning (pattern recognition) technique. Specifically, first, identification target area teacher data 141 is given to the identification target area (head) of the learning image. In this example, the identification target area teacher data 141 includes teacher data such as the position, size, and orientation of the head. Specifically, for example, as shown in FIG. 4, first, by attaching a labeling 11 to the head of the learning image 10, teacher data of the position and size of the head is given. Furthermore, teacher data of the head orientation such as orientation data in units of 1 degree (for example, 3 degrees to the left) and orientation data in accordance with 8 directions (for example, diagonally left) is given. Teacher data such as the position, size, and orientation of the head is given by a person using a conventionally known input means such as a keyboard and a mouse. In consideration of detection accuracy, the number of learning images 10 is preferably as large as possible, and identification target region teacher data 141 such as the position, size, and orientation of a large number of heads is assigned to a large number of learning images 10 prepared. Is preferred.

つぎに、識別対象領域教師データ１４１が付与された識別対象領域（頭部）に含まれる顔部品（左目、右目、鼻および口）に部品教師データ１４２を付与する。この例において、前記部品教師データ１４２としては、例えば、左目、右目、鼻および口のそれぞれの位置および隠れの有無等の教師データがあげられる。具体的には、例えば、図５（Ａ）に示すように、まず、図４における頭部のラベリング１１に含まれる左目、右目、鼻および口のそれぞれに○でラベリング１２ａ〜１５ａを付すことで、前記左目、前記右目、前記鼻および前記口のそれぞれの位置の教師データ（例えば、（ｘ，ｙ）座標）およびそれらに隠れが無いとの教師データを付与する。なお、例えば、図５（Ｂ）に示すように、前記左目および前記右目がサングラスで隠れている場合、前記左目および前記右目のそれぞれに×でラベリング１２ｂおよび１３ｂを付すことで、前記左目および前記右目のそれぞれの位置の教師データ（例えば、（ｘ，ｙ）座標）およびそれらに隠れが有るとの教師データを付与する。この場合において、前記鼻および前記口のラベリングおよび教師データの付与については、前述の図５（Ａ）に示した場合と同様である。また、例えば、図５（Ｃ）に示すように、前記鼻および前記口がマスクで隠れている場合、前記鼻および前記口のそれぞれに×でラベリング１４ｂおよび１５ｂを付すことで、前記鼻および前記口のそれぞれの位置の教師データ（例えば、（ｘ，ｙ）座標）およびそれらに隠れが有るとの教師データを付与する。この場合において、前記左目および前記右目のラベリングおよび教師データの付与については、前述の図５（Ａ）に示した場合と同様である。前記左目、前記右目、前記鼻および前記口のそれぞれの位置および隠れの有無等の教師データは、例えば、キーボード、マウス等の従来公知の入力手段を用いて、人が入力することで付与する。 Next, the part teacher data 142 is assigned to the face parts (left eye, right eye, nose and mouth) included in the identification target area (head) to which the identification target area teacher data 141 is assigned. In this example, examples of the component teacher data 142 include teacher data such as the positions of the left eye, the right eye, the nose and the mouth, and the presence / absence of hiding. Specifically, for example, as shown in FIG. 5 (A), first, labeling 12a to 15a is attached with ○ to each of the left eye, right eye, nose and mouth included in the head labeling 11 in FIG. , Teacher data (for example, (x, y) coordinates) of the positions of the left eye, the right eye, the nose and the mouth, and teacher data indicating that they are not hidden. For example, as shown in FIG. 5 (B), when the left eye and the right eye are hidden by sunglasses, the left eye and the right eye are labeled by adding labeling 12b and 13b to each of the left eye and the right eye. Teacher data (for example, (x, y) coordinates) at each position of the right eye and teacher data indicating that they are hidden are assigned. In this case, the labeling of the nose and the mouth and the provision of teacher data are the same as those shown in FIG. Also, for example, as shown in FIG. 5C, when the nose and the mouth are hidden by a mask, the nose and the mouth can be obtained by attaching labeling 14b and 15b with x to the nose and the mouth, respectively. Teacher data (for example, (x, y) coordinates) at each position of the mouth and teacher data indicating that they are hidden are given. In this case, the labeling of the left eye and the right eye and the assignment of teacher data are the same as those shown in FIG. Teacher data such as the position of each of the left eye, the right eye, the nose and the mouth and the presence / absence of hiding are given by a person using a conventionally known input means such as a keyboard and a mouse.

つぎに、図１（Ａ）および（Ｂ）に示すように、前記識別対象画像取得手段１１１により、識別対象画像を取得する（ステップＳ１１）。 Next, as shown in FIGS. 1A and 1B, an identification target image is acquired by the identification target image acquisition unit 111 (step S11).

つぎに、図１（Ａ）および（Ｂ）に示すように、前記識別対象領域検出手段１２１により、予め取得した前記識別対象領域教師データ１４１から作成される識別対象領域抽出モデルを参照して、前記識別対象画像から識別対象領域データを検出する（ステップＳ２１）。この例において、前記識別対象領域データとしては、例えば、頭部の位置、大きさ、向き等のデータがあげられる。前記頭部の位置、大きさ、向きのデータの詳細は、前記頭部の位置、大きさ、向きの教師データと同様である。具体的には、例えば、図６（Ａ）に示すように、予め取得した前記識別対象領域教師データ１４１から作成される識別対象領域抽出モデルを参照して、前記識別対象画像２０の左上端から水平方向に、それを順に下の行に向かって探索するいわゆるラスタスキャンで、頭部を探索する。頭部を検出できなかった場合には、図６（Ｂ）に示すように、前記識別対象画像２０の取得サイズを変えて、予め取得した前記識別対象領域教師データ１４１から作成される識別対象領域抽出モデルを参照して、前記ラスタスキャンで、頭部を探索する。前記取得サイズの変更を、頭部が検出されるまで繰り返すことで、前記頭部の位置、大きさ、向き等のデータを検出する。 Next, as shown in FIGS. 1 (A) and 1 (B), the identification target area extraction model created from the identification target area teacher data 141 acquired in advance by the identification target area detection means 121 is referred to. Identification target area data is detected from the identification target image (step S21). In this example, examples of the identification target area data include data such as the position, size, and orientation of the head. Details of the position, size, and orientation data of the head are the same as the teacher data for the position, size, and orientation of the head. Specifically, for example, as shown in FIG. 6A, with reference to the identification target region extraction model created from the identification target region teacher data 141 acquired in advance, from the upper left end of the identification target image 20 The head is searched by a so-called raster scan in which it is searched in the horizontal direction in order toward the lower row. If the head cannot be detected, as shown in FIG. 6B, the identification target area created from the identification target area teacher data 141 acquired in advance by changing the acquisition size of the identification target image 20 With reference to the extracted model, the head is searched by the raster scan. By repeating the change of the acquisition size until the head is detected, data such as the position, size, and orientation of the head is detected.

つぎに、図１（Ａ）および（Ｂ）に示すように、前記部品検出手段１２２により、予め取得した前記部品教師データ１４２から作成される部品抽出モデルを参照して、前記識別対象領域に含まれる顔部品のデータを検出する（ステップＳ２２）。この例において、前記顔部品データとしては、例えば、左目、右目、鼻および口のそれぞれの位置および隠れの有無等のデータがあげられる。前記左目、前記右目、前記鼻および前記口のそれぞれの位置および隠れの有無のデータの詳細は、前記左目、前記右目、前記鼻および前記口のそれぞれの位置および隠れの有無の教師データと同様である。具体的には、例えば、図７（Ａ）に示すように、予め取得した前記部品教師データ１４２から作成される部品抽出モデルを参照して、左目、右目、鼻および口をそれぞれ検出する。ついで、前記左目、前記右目、前記鼻および前記口のそれぞれに○でラベリング２２ａ〜２５ａを付し、前記左目、前記右目、前記鼻および前記口のそれぞれの位置のデータ（例えば、（ｘ，ｙ）座標）およびそれらに隠れが無いとのデータを検出する。なお、例えば、図７（Ｂ）に示すように、前記左目および前記右目がサングラスで隠れている場合、前記左目および前記右目のそれぞれに×でラベリング２２ｂおよび２３ｂを付し、前記左目および前記右目のそれぞれの位置のデータ（例えば、（ｘ，ｙ）座標）およびそれらに隠れが有るとのデータを検出する。この場合において、前記鼻および前記口のデータの検出については、前述の図７（Ａ）に示した場合と同様である。また、例えば、図７（Ｃ）に示すように、前記鼻および前記口がマスクで隠れている場合、前記鼻および前記口のそれぞれに×でラベリング２４ｂおよび２５ｂを付し、前記鼻および前記口のそれぞれの位置のデータ（例えば、（ｘ，ｙ）座標）およびそれらに隠れが有るとのデータを検出する。この場合において、前記左目および前記右目のデータの検出については、前述の図７（Ａ）に示した場合と同様である。 Next, as shown in FIGS. 1A and 1B, the component detection unit 122 refers to a component extraction model created from the component teacher data 142 acquired in advance, and is included in the identification target region. The detected face part data is detected (step S22). In this example, the face part data includes, for example, data of the positions of the left eye, right eye, nose and mouth, and the presence / absence of hiding. The details of the data on the positions of the left eye, the right eye, the nose and the mouth and the presence / absence of hiding are the same as the teacher data on the positions of the left eye, the right eye, the nose and the mouth and the presence / absence of the hiding. is there. Specifically, for example, as shown in FIG. 7A, the left eye, the right eye, the nose, and the mouth are detected with reference to the part extraction model created from the part teacher data 142 acquired in advance. Then, the left eye, the right eye, the nose and the mouth are labeled with a circle 22a to 25a, respectively, and the position data of the left eye, the right eye, the nose and the mouth (for example, (x, y ) Coordinates) and data that they are not hidden are detected. For example, as shown in FIG. 7B, when the left eye and the right eye are hidden by sunglasses, the left eye and the right eye are labeled with x and labeled 22b and 23b, respectively. Data of each position (for example, (x, y) coordinates) and data indicating that they are hidden are detected. In this case, the detection of the data of the nose and the mouth is the same as the case shown in FIG. Further, for example, as shown in FIG. 7C, when the nose and the mouth are hidden by a mask, labeling 24b and 25b are attached to the nose and the mouth with x, respectively, and the nose and the mouth Data of each position (for example, (x, y) coordinates) and data indicating that they are hidden are detected. In this case, the detection of the data for the left eye and the right eye is the same as that shown in FIG.

つぎに、図１（Ａ）および（Ｂ）に示すように、前記出力手段１３１により、前記識別対象領域データおよび前記顔部品データのうち、少なくとも前記顔部品データを出力する（ステップＳ３１）。出力される前記識別対象領域データとしては、例えば、頭部の位置（例えば、（ｘ，ｙ）座標）、大きさ、向きのデータ等があげられる。出力される前記顔部品データとしては、例えば、左目および右目に隠れが有るとのデータ等があげられる。前記出力工程Ｓ３１は、任意の工程であり、本発明の部品検出方法に含まれていなくてもよいが、含まれていることが好ましい。 Next, as shown in FIGS. 1A and 1B, the output unit 131 outputs at least the face part data among the identification target region data and the face part data (step S31). Examples of the identification target region data to be output include head position (for example, (x, y) coordinates), size, and orientation data. Examples of the face part data to be output include data indicating that the left eye and the right eye are hidden. The output step S31 is an optional step and may not be included in the component detection method of the present invention, but is preferably included.

前述のように、本実施形態の部品検出装置および部品検出方法では、その実施に先立ち、機械学習（パターン認識）の技術を用いて、識別対象領域教師データおよび部品教師データを予め取得しておき、それらから作成される識別対象領域抽出モデルおよび部品抽出モデルを参照して識別対象領域データおよび顔部品データの検出を行う。このため、本実施形態の部品検出装置および部品検出方法によれば、識別対象領域データおよび顔部品データを高精度かつ容易に検出可能である。前述のとおり、本実施形態では、前記部品教師データに左目等の顔部品が隠れているものを含ませているため、識別対象画像において、前記顔部品が隠れている場合においても、前記顔部品データを高精度かつ容易に検出可能である。これらについては、後述の実施形態２および３においても同様である。 As described above, in the component detection apparatus and component detection method of this embodiment, prior to the implementation, the identification target region teacher data and the component teacher data are acquired in advance using a machine learning (pattern recognition) technique. Then, the identification target area data and the face part data are detected with reference to the identification target area extraction model and the part extraction model created from them. For this reason, according to the component detection apparatus and component detection method of the present embodiment, it is possible to easily detect the identification target area data and the facial component data with high accuracy. As described above, in the present embodiment, since the part teacher data includes a face part such as a left eye that is hidden, the face part is hidden even when the face part is hidden in the identification target image. Data can be detected with high accuracy and ease. The same applies to later-described Embodiments 2 and 3.

本実施形態では、前記学習用画像および前記識別対象画像として、一人の人物の画像を用いたが、本発明はこれに限定されない。前記学習用画像に二人以上の人物の画像を含ませ、機械学習（パターン認識）の技術を用いて、識別対象領域教師データおよび部品教師データを予め取得しておくことで、前記識別対象画像が二人以上の人物の画像であっても、それぞれの人物の識別対象領域データおよび部品データを高精度かつ容易に検出可能である。これについては、後述の実施形態２および３においても同様である。 In the present embodiment, an image of one person is used as the learning image and the identification target image, but the present invention is not limited to this. By including images of two or more persons in the learning image and using the machine learning (pattern recognition) technique, the identification target region teacher data and the component teacher data are acquired in advance, thereby the identification target image. Even if it is an image of two or more persons, it is possible to easily detect the identification target area data and the component data of each person with high accuracy. The same applies to later-described Embodiments 2 and 3.

前述のとおり、本発明においては、前記学習用画像および前記識別対象画像として、例えば、人物の画像を用いてもよい。前記学習用画像に二人の人物が重なり、脚が三本に見える画像を含ませ、機械学習（パターン認識）の技術を用いて、識別対象領域教師データおよび部品教師データを予め取得しておくことで、前記識別対象画像が二人の人物が重なり、脚が三本に見える画像であっても、前記三本の脚のうちの一本を、動物の尻尾であると誤検出するようなことがない。これについては、後述の実施形態２および３においても同様である。 As described above, in the present invention, for example, a person image may be used as the learning image and the identification target image. The learning image includes an image in which two persons are overlapped and three legs are seen, and identification target region teacher data and component teacher data are acquired in advance using a machine learning (pattern recognition) technique. Thus, even if the identification target image is an image in which two persons overlap and the legs look three, one of the three legs is erroneously detected as the tail of an animal. There is nothing. The same applies to later-described Embodiments 2 and 3.

（実施形態２）
つぎに、図２に、属性認識を含む例のフローチャート（同図（Ａ））およびブロック図（同図（Ｂ））を示す。同図において、図１と同一部分には、同一符号を付している。同図（Ｂ）に示すように、本実施形態の部品検出装置は、さらに、前記データ記憶手段１４０に、属性認識用教師データ１４３が格納され、前記演算手段１２０が、属性認識手段１２３を含む点を除き、図１（Ｂ）に示す部品検出装置と同様の構成である。前記属性認識手段１２３は、前記属性認識用教師データ１４３と接続している。 (Embodiment 2)
Next, FIG. 2 shows a flowchart (FIG. 2A) and a block diagram (FIG. 2B) of an example including attribute recognition. In this figure, the same parts as those in FIG. As shown in FIG. 5B, in the component detection apparatus of the present embodiment, attribute recognition teacher data 143 is further stored in the data storage unit 140, and the calculation unit 120 includes an attribute recognition unit 123. Except for this point, the configuration is the same as that of the component detection apparatus shown in FIG. The attribute recognition unit 123 is connected to the attribute recognition teacher data 143.

以下、実施形態１と同様に、前記識別対象領域が頭部であり、前記部品が左目、右目、鼻および口の４つである場合を例にとり、本実施形態の部品検出装置および部品検出方法について、さらに詳細に説明する。 Hereinafter, in the same manner as in the first embodiment, the component detection apparatus and the component detection method according to the present embodiment are described by taking, as an example, the case where the identification target region is the head and the components are the left eye, right eye, nose, and mouth. Will be described in more detail.

この例の部品検出方法は、図２（Ｂ）の部品検出装置を用いて、例えば、つぎのようにして実施される。 The component detection method of this example is implemented as follows using the component detection apparatus of FIG. 2 (B), for example.

まず、前記部品検出方法の実施に先立ち、機械学習（パターン認識）の技術を用いて学習を行う。具体的には、まず、実施形態１と同様にして、学習用画像の識別対象領域（頭部）に識別対象領域教師データ１４１を付与する。 First, prior to the implementation of the component detection method, learning is performed using a machine learning (pattern recognition) technique. Specifically, first, similarly to the first embodiment, the identification target area teacher data 141 is given to the identification target area (head) of the learning image.

つぎに、実施形態１と同様にして、識別対象領域教師データ１４１が付与された識別対象領域（頭部）に含まれる顔部品（左目、右目、鼻および口）に部品教師データ１４２を付与する。 Next, in the same manner as in the first embodiment, the part teacher data 142 is assigned to the facial parts (left eye, right eye, nose and mouth) included in the identification target area (head) to which the identification target area teacher data 141 is assigned. .

つぎに、前記識別対象領域教師データ１４１に属性を付与することで、属性認識用教師データ１４３を取得する。この例において、前記属性としては、例えば、被り物（例えば、帽子、フード等）の有無、眼鏡の有無、サングラスの有無、マスクの有無、性別、推定年齢等があげられる。前記属性は、例えば、キーボード、マウス等の従来公知の入力手段を用いて、人が付与する。 Next, attribute recognition teacher data 143 is acquired by assigning an attribute to the identification target region teacher data 141. In this example, the attributes include, for example, the presence / absence of a cover (eg, hat, hood, etc.), the presence / absence of glasses, the presence / absence of sunglasses, the presence / absence of a mask, gender, estimated age, and the like. The attribute is given by a person using conventionally known input means such as a keyboard and a mouse.

つぎに、図２（Ａ）および（Ｂ）に示すように、実施形態１と同様にして、前記識別対象画像取得手段１１１により、識別対象画像を取得する（ステップＳ１１）。 Next, as shown in FIGS. 2A and 2B, the identification target image is acquired by the identification target image acquisition unit 111 in the same manner as in the first embodiment (step S11).

つぎに、図２（Ａ）および（Ｂ）に示すように、実施形態１と同様にして、前記識別対象領域検出手段１２１により、予め取得した前記識別対象領域教師データ１４１から作成される識別対象領域抽出モデルを参照して、前記識別対象画像から識別対象領域データを検出する（ステップＳ２１）。 Next, as shown in FIGS. 2A and 2B, as in the first embodiment, the identification target created from the identification target area teacher data 141 acquired in advance by the identification target area detection unit 121. With reference to the region extraction model, identification target region data is detected from the identification target image (step S21).

つぎに、図２（Ａ）および（Ｂ）に示すように、実施形態１と同様にして、前記部品検出手段１２２により、予め取得した前記部品教師データ１４２から作成される部品抽出モデルを参照して、前記識別対象領域に含まれる顔部品のデータを検出する（ステップＳ２２）。 Next, as shown in FIGS. 2A and 2B, the component extraction model created from the component teacher data 142 acquired in advance by the component detection unit 122 is referred to as in the first embodiment. Then, the data of the face part included in the identification target area is detected (step S22).

つぎに、図２（Ａ）および（Ｂ）に示すように、予め取得した前記属性認識用教師データ１４３から作成される属性抽出モデルを参照して、前記顔部品の属性を認識する（ステップＳ２３）。前記属性は、前記識別対象領域教師データ１４１に付与したものと同じである。 Next, as shown in FIGS. 2A and 2B, the attribute of the face part is recognized with reference to the attribute extraction model created from the attribute recognition teacher data 143 acquired in advance (step S23). ). The attribute is the same as that given to the identification target area teacher data 141.

つぎに、図２（Ａ）および（Ｂ）に示すように、前記出力手段１３１により、前記識別対象領域データ、前記顔部品データおよび前記属性のうち、少なくとも前記顔部品データを出力する（ステップＳ３１）。出力される前記識別対象領域データおよび顔部品データは、実施形態１と同様である。出力される前記属性としては、例えば、被り物が無い、眼鏡が無い、マスクが無い等の属性があげられる。前記出力工程Ｓ３１は、任意の工程であり、本発明の部品検出方法に含まれていなくてもよいが、含まれていることが好ましい。 Next, as shown in FIGS. 2A and 2B, the output unit 131 outputs at least the face part data among the identification target area data, the face part data, and the attribute (step S31). ). The output identification target area data and face part data are the same as those in the first embodiment. Examples of the output attribute include attributes such as no cover, no glasses, and no mask. The output step S31 is an optional step and may not be included in the component detection method of the present invention, but is preferably included.

実施形態１における各顔部品の隠れの有無は、前記教師データの取得および前記データの検出に加え、もしくは代えて、本実施形態における前記属性の付与および前記属性の認識で判定してもよい。これについては、後述の実施形態３においても同様である。 Whether or not each face part is hidden in the first embodiment may be determined by adding the attribute and recognizing the attribute in the present embodiment in addition to or instead of acquiring the teacher data and detecting the data. The same applies to the third embodiment described later.

前述のように、本実施形態の部品検出装置および部品検出方法では、その実施に先立ち、機械学習（パターン認識）の技術を用いて取得した識別対象領域教師データに属性を付与した属性認識用教師データを予め取得しておき、それから作成される属性抽出モデル参照して属性を認識する。このため、本実施形態の部品検出装置および部品検出方法によれば、属性を高精度かつ容易に認識可能である。これについては、後述の実施形態３においても同様である。 As described above, in the component detection apparatus and the component detection method of the present embodiment, the attribute recognition teacher in which attributes are added to the identification target region teacher data acquired using the machine learning (pattern recognition) technique prior to the implementation. Data is acquired in advance, and attributes are recognized with reference to an attribute extraction model created therefrom. For this reason, according to the component detection apparatus and component detection method of this embodiment, an attribute can be recognized with high accuracy and easily. The same applies to the third embodiment described later.

本実施形態の部品検出装置および部品検出方法を用いれば、例えば、左目および右目に隠れが有るとの顔部品データが検出され、サングラスが有るとの属性が認識された場合に、サングラスを外すよう音声により警告したり、着用中の操作の制限を行うことのできるシステムを構築できる。また、本実施形態の部品検出装置および部品検出方法を用いれば、例えば、鼻に隠れが無く、口に隠れが有るとの顔部品データが検出され、マスクが有るとの属性が認定された場合に、マスクで鼻まで覆うよう音声により警告することのできるシステムを構築できる。さらに、本実施形態の部品検出装置および部品検出方法を用いれば、例えば、左目、右目、鼻および口に隠れが有るとの顔部品データが検出され、サングラスおよびマスクが有るとの属性が認識される識別対象画像の抽出により、犯罪者と思しき人物を探索するシステムを構築できる。これらについては、後述の実施形態３においても同様である。 Using the component detection apparatus and the component detection method of this embodiment, for example, when face component data indicating that there is a hiding in the left eye and the right eye is detected, and an attribute indicating that sunglasses is present is recognized, the sunglasses are removed. It is possible to construct a system that can warn by voice and limit operations while wearing. Further, when the component detection device and the component detection method of the present embodiment are used, for example, face component data indicating that there is no hiding in the nose and hiding in the mouth is detected, and an attribute that there is a mask is recognized. In addition, it is possible to construct a system capable of warning by voice so as to cover the nose with a mask. Furthermore, using the component detection apparatus and the component detection method of the present embodiment, for example, facial component data indicating that the left eye, right eye, nose and mouth are hidden is detected, and the attribute indicating that sunglasses and a mask are present is recognized. A system for searching for a person who seems to be a criminal can be constructed by extracting an identification target image. The same applies to Embodiment 3 described later.

本実施形態では、属性認識用教師データの取得および属性の認識を、それぞれ、部品教師データの取得および顔部品データの検出の後に行ったが、本発明はこれに限定されず、属性認識用教師データの取得および属性の認識を、それぞれ、部品教師データの取得および顔部品データの検出と同時に行ってもよい。 In this embodiment, the acquisition of attribute recognition teacher data and the recognition of attributes are performed after the acquisition of component teacher data and the detection of face component data, respectively, but the present invention is not limited to this, and the teacher for attribute recognition Data acquisition and attribute recognition may be performed simultaneously with acquisition of part teacher data and detection of face part data, respectively.

（実施形態３）
つぎに、図３に、位置合わせ（アライメント）を含む例のフローチャート（同図（Ａ））およびブロック図（同図（Ｂ））を示す。同図において、図１および２と同一部分には、同一符号を付している。同図（Ｂ）に示すように、本実施形態の部品検出装置は、さらに、前記演算手段１２０が、アライメント手段１２４を含む点を除き、図２（Ｂ）に示す部品検出装置と同様の構成である。 (Embodiment 3)
Next, FIG. 3 shows a flowchart (FIG. 3A) and a block diagram (FIG. 3B) of an example including alignment (alignment). In this figure, the same parts as those in FIGS. 1 and 2 are denoted by the same reference numerals. As shown in FIG. 2B, the component detection apparatus of the present embodiment further has the same configuration as the component detection apparatus shown in FIG. 2B except that the calculation unit 120 includes an alignment unit 124. It is.

以下、実施形態１および２と同様に、前記識別対象領域が頭部であり、前記部品が左目、右目、鼻および口の４つである場合を例にとり、本実施形態の部品検出装置および部品検出方法について、さらに詳細に説明する。 Hereinafter, as in Embodiments 1 and 2, the case where the identification target region is the head, and there are four parts, ie, the left eye, the right eye, the nose, and the mouth, is taken as an example. The detection method will be described in further detail.

この例の部品検出方法は、図３（Ｂ）の部品検出装置を用いて、例えば、つぎのようにして実施される。 The component detection method of this example is implemented as follows using the component detection apparatus of FIG. 3 (B), for example.

まず、前記部品検出方法の実施に先立ち、機械学習（パターン認識）の技術を用いて学習を行う。具体的には、まず、実施形態１および２と同様にして、学習用画像の識別対象領域（頭部）に識別対象領域教師データ１４１を付与する。 First, prior to the implementation of the component detection method, learning is performed using a machine learning (pattern recognition) technique. Specifically, first, similarly to the first and second embodiments, the identification target area teacher data 141 is assigned to the identification target area (head) of the learning image.

つぎに、実施形態１および２と同様にして、識別対象領域教師データ１４１が付与された識別対象領域（頭部）に含まれる顔部品（左目、右目、鼻および口）に部品教師データ１４２を付与する。 Next, in the same manner as in the first and second embodiments, the part teacher data 142 is applied to the face parts (left eye, right eye, nose and mouth) included in the identification target area (head) to which the identification target area teacher data 141 is assigned. Give.

つぎに、前記顔部品から位置合わせの基準となる基準顔部品（例えば、左目および右目）を選択した後、前記基準顔部品を所定の位置に固定してアライメント教師データを取得する。前記基準顔部品は、前記顔部品の中から少なくとも一つを選択すればよく、例えば、鼻、口等であってもよい。 Next, after selecting a reference face part (for example, left eye and right eye) as a reference for alignment from the face parts, the reference face part is fixed at a predetermined position, and alignment teacher data is acquired. The reference face part may be at least one selected from the face parts, and may be, for example, a nose or a mouth.

つぎに、前記アライメント教師データに属性を付与することで、属性認識用教師データ１４３を取得する。前記属性およびその付与方法は、実施形態２と同様である。 Next, attribute recognition teacher data 143 is acquired by assigning an attribute to the alignment teacher data. The attributes and the method for giving them are the same as those in the second embodiment.

つぎに、図３（Ａ）および（Ｂ）に示すように、実施形態１および２と同様にして、前記識別対象画像取得手段１１１により、識別対象画像を取得する（ステップＳ１１）。 Next, as shown in FIGS. 3A and 3B, the identification target image is acquired by the identification target image acquisition unit 111 in the same manner as in the first and second embodiments (step S11).

つぎに、図３（Ａ）および（Ｂ）に示すように、実施形態１および２と同様にして、前記識別対象領域検出手段１２１により、予め取得した前記識別対象領域教師データ１４１から作成される識別対象領域抽出モデルを参照して、前記識別対象画像から識別対象領域データを検出する（ステップＳ２１）。 Next, as shown in FIGS. 3A and 3B, similarly to Embodiments 1 and 2, the identification target area detection unit 121 creates the identification target area teacher data 141 acquired in advance. With reference to the identification target region extraction model, identification target region data is detected from the identification target image (step S21).

つぎに、図３（Ａ）および（Ｂ）に示すように、実施形態１および２と同様にして、前記部品検出手段１２２により、予め取得した前記部品教師データ１４２から作成される部品抽出モデルを参照して、前記識別対象領域に含まれる顔部品のデータを検出する（ステップＳ２２）。 Next, as shown in FIGS. 3A and 3B, in the same manner as in the first and second embodiments, a component extraction model created from the component teacher data 142 acquired in advance by the component detector 122 is obtained. Referring to the facial part data included in the identification target area, the data is detected (step S22).

つぎに、図３（Ａ）および（Ｂ）に示すように、前記顔部品から位置合わせの基準となる基準顔部品（例えば、左目および右目）を選択した後、前記基準顔部品を所定の位置に固定してアライメント画像を作成する（ステップＳ２４）。前記基準顔部品は、前記顔部品の中から少なくとも一つを選択すればよく、例えば、鼻、口等であってもよい。検出精度および検出速度を考慮すると、予め取得した前記アライメント教師データと同じ条件で、前記アライメント画像を作成することが好ましい。 Next, as shown in FIGS. 3A and 3B, after selecting a reference face part (for example, left eye and right eye) as a reference for alignment from the face part, the reference face part is moved to a predetermined position. To create an alignment image (step S24). The reference face part may be at least one selected from the face parts, and may be, for example, a nose or a mouth. In consideration of detection accuracy and detection speed, it is preferable to create the alignment image under the same conditions as the alignment teacher data acquired in advance.

つぎに、図３（Ａ）および（Ｂ）に示すように、予め取得した前記属性認識用教師データ１４３から作成される属性抽出モデルを参照して、前記顔部品の属性を認識する（ステップＳ２３）。前記属性は、前記アライメント教師データに付与したものと同じである。 Next, as shown in FIGS. 3A and 3B, the attribute of the face part is recognized with reference to the attribute extraction model created from the attribute recognition teacher data 143 acquired in advance (step S23). ). The attribute is the same as that given to the alignment teacher data.

つぎに、図３（Ａ）および（Ｂ）に示すように、前記出力手段１３１により、前記識別対象領域データ、前記顔部品データおよび前記属性のうち、少なくとも前記顔部品データを出力する（ステップＳ３１）。出力される前記識別対象領域データ、顔部品データおよび属性は、実施形態１および２と同様である。前記出力工程Ｓ３１は、任意の工程であり、本発明の部品検出方法に含まれていなくてもよいが、含まれていることが好ましい。 Next, as shown in FIGS. 3A and 3B, the output unit 131 outputs at least the face part data among the identification target area data, the face part data, and the attribute (step S31). ). The output identification target area data, face part data, and attributes are the same as those in the first and second embodiments. The output step S31 is an optional step and may not be included in the component detection method of the present invention, but is preferably included.

前述のように、本実施形態の部品検出装置および部品検出方法では、その実施に先立ち、機械学習（パターン認識）の技術を用いて取得した識別対象領域教師データに位置合わせ（アライメント）を行った後、属性を付与した属性認識用教師データを予め取得しておき、それから作成される属性抽出モデルを参照して属性を認識する。このため、本実施形態の部品検出装置および部品検出方法によれば、属性をより高精度かつ迅速に認識可能である。 As described above, in the component detection apparatus and component detection method of the present embodiment, prior to the implementation, alignment (alignment) is performed on the identification target region teacher data acquired using the machine learning (pattern recognition) technique. After that, attribute recognition teacher data to which the attribute is given is acquired in advance, and the attribute is recognized with reference to the attribute extraction model created therefrom. For this reason, according to the component detection apparatus and component detection method of this embodiment, an attribute can be recognized more accurately and rapidly.

（実施形態４）
本実施形態のプログラムは、前述の部品検出方法を、コンピュータ上で実行可能なプログラムである。または、本実施形態のプログラムは、例えば、記録媒体に記録されてもよい。前記記録媒体としては、特に限定されず、例えば、読み出し専用メモリ（ＲＯＭ）、ハードディスク（ＨＤ）、光ディスク、フロッピー（登録商標）ディスク（ＦＤ）等があげられる。 (Embodiment 4)
The program of this embodiment is a program that can execute the above-described component detection method on a computer. Or the program of this embodiment may be recorded on a recording medium, for example. The recording medium is not particularly limited, and examples thereof include a read only memory (ROM), a hard disk (HD), an optical disk, a floppy (registered trademark) disk (FD), and the like.

（実施形態５）
図８に、本発明の部品検出装置を用いた部品検出システムの一例の構成を示す。図示のとおり、この部品検出システムは、識別対象画像取得手段１１１ａ、１１１ｂ、１１１ｃと、出力手段１３１ａ、１３１ｂ、１３１ｃと、通信インターフェイス１５０ａ、１５０ｂ、１５０ｃと、サーバ１７０とを備える。前記識別対象画像取得手段１１１ａおよび出力手段１３１ａは、通信インターフェイス１５０ａに接続されている。前記識別対象画像取得手段１１１ａ、出力手段１３１ａおよび通信インターフェイス１５０ａは、場所Ｘに設置されている。前記識別対象画像取得手段１１１ｂおよび出力手段１３１ｂは、通信インターフェイス１５０ｂに接続されている。前記識別対象画像取得手段１１１ｂ、出力手段１３１ｂおよび通信インターフェイス１５０ｂは、場所Ｙに設置されている。前記識別対象画像取得手段１１１ｃおよび出力手段１３１ｃは、通信インターフェイス１５０ｃに接続されている。前記識別対象画像取得手段１１１ｃ、出力手段１３１ｃおよび通信インターフェイス１５０ｃは、場所Ｚに設置されている。そして、通信インターフェイス１５０ａ、１５０ｂ、１５０ｃと、前記サーバ１７０とが、回線網１６０を介して接続されている。 (Embodiment 5)
FIG. 8 shows a configuration of an example of a component detection system using the component detection apparatus of the present invention. As illustrated, the component detection system includes identification target image acquisition units 111a, 111b, and 111c, output units 131a, 131b, and 131c, communication interfaces 150a, 150b, and 150c, and a server 170. The identification target image acquisition unit 111a and the output unit 131a are connected to a communication interface 150a. The identification target image acquisition unit 111a, the output unit 131a, and the communication interface 150a are installed in the place X. The identification target image acquisition unit 111b and the output unit 131b are connected to a communication interface 150b. The identification target image acquisition unit 111b, the output unit 131b, and the communication interface 150b are installed in the place Y. The identification target image obtaining unit 111c and the output unit 131c are connected to a communication interface 150c. The identification target image obtaining unit 111c, the output unit 131c, and the communication interface 150c are installed in the place Z. The communication interfaces 150a, 150b, and 150c are connected to the server 170 via the line network 160.

この部品検出システムでは、前記サーバ１７０側に、識別対象領域取得手段および部品検出手段を有し、前記サーバ１７０に識別対象領域教師データおよび部品教師データが格納される。例えば、場所Ｘで前記識別対象画像取得手段１１１ａを用いて取得された識別対象画像を、サーバ１７０に送信し、前記サーバ１７０側で、識別対象領域および部品を検出する。また、検出された識別対象領域および部品のうち、少なくとも部品を、前記出力手段１３１ａにより出力する。 In this component detection system, the server 170 side has an identification target region acquisition unit and a component detection unit, and the identification target region teacher data and the component teacher data are stored in the server 170. For example, the identification target image acquired by using the identification target image acquisition unit 111a at the location X is transmitted to the server 170, and the identification target region and parts are detected on the server 170 side. In addition, at least the component of the detected identification target area and component is output by the output unit 131a.

本実施形態の部品検出システムによれば、識別対象画像取得手段および出力手段を現場に設置し、サーバ等は他の場所に設置して、オンラインによる識別対象領域および部品の検出等が可能である。そのため、装置の設置に場所を取ることがなく、メンテナンスも容易である。また、各設置場所が離れている場合であっても、一箇所での集中管理や遠隔操作が可能となる。本実施形態の部分検出システムは、前述の実施形態２の属性認識および実施形態３の位置合わせ（アライメント）に対応したものであってもよい。 According to the component detection system of the present embodiment, the identification target image acquisition unit and the output unit are installed in the field, and the server or the like is installed in another place, so that the identification target region and the component can be detected online. . Therefore, the installation of the apparatus does not take a place, and maintenance is easy. Further, even when the installation locations are separated, centralized management and remote operation can be performed at one location. The partial detection system of the present embodiment may correspond to the attribute recognition of the second embodiment and the alignment (alignment) of the third embodiment.

１０学習用画像
１１、１２ａ、１３ａ、１４ａ、１５ａ、１２ｂ、１３ｂ、１４ｂ、１５ｂ、２２ａ、２３ａ、２４ａ、２５ａ、２２ｂ、２３ｂ、２４ｂ、２５ｂラベリング
２０識別対象画像
１１１識別対象画像取得手段
１２０演算手段
１２１識別対象領域検出手段
１２２部品検出手段
１２３属性認識手段
１２４アライメント手段
１３１出力手段
１４０データ記憶手段
１４１識別対象領域教師データ
１４２部品教師データ
１４３属性認識用教師データ 10 Learning images 11, 12a, 13a, 14a, 15a, 12b, 13b, 14b, 15b, 22a, 23a, 24a, 25a, 22b, 23b, 24b, 25b Labeling 20 Discrimination target image 111 Discrimination target image acquisition means 120 Calculation Means 121 Identification target area detection means 122 Component detection means 123 Attribute recognition means 124 Alignment means 131 Output means 140 Data storage means 141 Identification target area teacher data 142 Part teacher data 143 Attribute recognition teacher data

Claims

An identification target image acquisition means for acquiring an identification target image;
An identification target area detecting means for detecting an identification target area from the identification target image with reference to an identification target area extraction model created from the identification target area teacher data acquired in advance;
A component detection apparatus comprising: a component detection unit configured to detect a component included in the identification target region with reference to a component extraction model created from component teacher data acquired in advance.

further,
2. The component detection apparatus according to claim 1, further comprising attribute recognition means for recognizing an attribute of the component by referring to an attribute extraction model created from attribute recognition teacher data acquired in advance.

further,
After selecting a reference part as a reference for alignment from the parts, including an alignment means for creating an alignment image by fixing the reference part at a predetermined position,
The component detection apparatus according to claim 2, wherein the attribute of the component is recognized by the attribute recognition unit using the alignment image.

The component detection apparatus according to claim 3, wherein the alignment image is created under the same conditions as the alignment teacher data acquired in advance.

The component detection apparatus according to claim 1, wherein the identification target region is a person.

The component detection apparatus according to claim 1, wherein the identification target region is a human head.

The component detection apparatus according to claim 6, wherein the component includes at least one selected from the group consisting of a left eye, a right eye, a nose, a mouth, a left eyebrow, a right eyebrow, a left ear, and a right ear.

An identification target image acquisition step of acquiring an identification target image;
An identification target area detection step of detecting an identification target area from the identification target image with reference to an identification target area extraction model created from the identification target area teacher data acquired in advance;
A component detection method comprising: detecting a component included in the identification target region with reference to a component extraction model created from component teacher data acquired in advance.

further,
9. The component detection method according to claim 8, further comprising an attribute recognition step of recognizing the attribute of the component with reference to an attribute extraction model created from attribute recognition teacher data acquired in advance.

further,
After selecting a reference component that is a reference for alignment from the component, including an alignment step of fixing the reference component at a predetermined position and creating an alignment image,
The component detection method according to claim 9, wherein the attribute of the component is recognized in the attribute recognition step using the alignment image.

The component detection method according to claim 10, wherein the alignment image is created under the same conditions as the alignment teacher data acquired in advance.

The component detection method according to claim 8, wherein the identification target region is a person.

The component detection method according to claim 8, wherein the identification target region is a human head.

The component detection method according to claim 13, wherein the component includes at least one selected from the group consisting of a left eye, a right eye, a nose, a mouth, a left eyebrow, a right eyebrow, a left ear, and a right ear.

A program for causing a computer to execute the component detection method according to any one of claims 8 to 14.

A recording medium on which the program according to claim 15 is recorded.