JP2020087305A

JP2020087305A - Information processing apparatus, information processing method and program

Info

Publication number: JP2020087305A
Application number: JP2018225344A
Authority: JP
Inventors: 山本　貴久; Takahisa Yamamoto; 貴久山本; 俊亮中野; Toshiaki Nakano; 英生野呂; Hideo Noro; 敦夫野本; Atsuo Nomoto; 孝嗣牧田; Takatsugu Makita; 将由山▲崎▼; Masayoshi Yamazaki; 潔考高橋; Kiyotaka Takahashi
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 2018-11-30
Filing date: 2018-11-30
Publication date: 2020-06-04

Abstract

To be able to suppress repetition of false recognition of a person.SOLUTION: An information processing apparatus 100 recognizes a person included in a captured image by using a list in which a face image of the person to be recognized is registered. The information processing apparatus 100 comprises: recognition means for recognizing a face image included in an image of interest as a specific person on the basis of the face image included in the image of interest and the list; and input means for inputting a recognition result for the face image by means of the recognition means is false. The recognition means controls, in recognition after receiving the input, such that the face image determined to be false is not recognized as the specific person.SELECTED DRAWING: Figure 1

Description

本発明は、画像から特定の人物の顔を認識する技術に関する。 The present invention relates to a technique of recognizing a face of a specific person from an image.

近年、撮像された画像内に写っているオブジェクトの画像を高度に処理して有用な情報を抽出する技術が多く提案されている。特にその中でも人間の顔画像を認識して、入力された顔画像（入力顔画像と呼ぶ）と、予め登録されている複数の人物の顔画像（登録顔画像と呼ぶ）とをそれぞれ照合して、入力された顔画像が誰であるか判定する顔認識に関して、盛んに研究開発されている。 In recent years, many techniques have been proposed in which an image of an object included in a captured image is highly processed to extract useful information. In particular, it recognizes human face images and compares the input face images (called input face images) with the face images of a plurality of people registered in advance (called registered face images). , Has been actively researched and developed for face recognition for determining who the input face image is.

店舗等における顔認識の用途には、事前登録されている重要顧客を認識するホワイトリスト認識、或いは要注意顧客を認識するブラックリスト認識がある。これらの認識では、認識したい人物の顔画像が予め登録されており、カメラに映った人物の顔画像と、登録済みの人物の顔画像それぞれとを照合する。カメラに映った人物の顔画像が登録済み人物のいずれかの顔画像と合致した場合には、店員や警備員等に通報される。 Face recognition in stores or the like includes whitelist recognition for recognizing pre-registered important customers or blacklist recognition for recognizing sensitive customers. In these recognitions, the face image of the person to be recognized is registered in advance, and the face image of the person reflected by the camera is compared with the face images of the registered persons. When the face image of the person reflected on the camera matches any of the face images of the registered persons, the clerk or the security guard is notified.

特開２０１２−１４１７０８号公報JP 2012-141708 A

顔認識は、顔画像が撮像された状態によっては、誤認識（他人と間違えて認識してしまうこと）が起こり得る。例えば、登録されている人物と似た顔の人物が現れた状態である。また、撮像された顔画像が小さかったり、ぶれていたりする状態である。それ以外にも、撮像された顔の向きや照明の状態によっては、誤認識が起きやすい。 In the face recognition, erroneous recognition (i.e., erroneously recognizing the face image as another person) may occur depending on the state in which the face image is captured. For example, this is a state in which a person with a face similar to the registered person appears. In addition, the captured face image is small or shaken. Other than that, erroneous recognition is likely to occur depending on the orientation of the imaged face and the state of illumination.

上記問題に対して、特許文献１では、誤認識であるとユーザが判断したフィードバックを用いて認識結果を修正する。しかしながら、上記で開示された手法では、誤認識が発生した人物が時系列的に連続して画像に現れる場合、繰り返し誤認識が発生する可能性がある。例えば、歩行している人物に対して、顔認識を行う場合、動画像の各フレーム画像に対して顔認識することが考えられる。この場合に、あるフレームでの顔認識が誤認識となると、その誤認識フレームと見た目があまり変化しない、以降のフレームでも繰り返し誤認識を起こす可能性がある。本発明は上記課題に鑑みてなされたものであり、時系列画像に写ったある人物について誤認識が繰り返し発生することを抑制する。 With respect to the above problem, in Patent Document 1, the recognition result is corrected by using the feedback that the user has determined to be erroneous recognition. However, in the method disclosed above, when a person who has been erroneously recognized appears in the image continuously in time series, erroneous recognition may repeatedly occur. For example, when face recognition is performed on a walking person, face recognition may be performed on each frame image of a moving image. In this case, if face recognition in a certain frame is erroneous recognition, there is a possibility that erroneous recognition may be repeatedly performed in subsequent frames in which appearance does not change much from that erroneously recognized frame. The present invention has been made in view of the above problems, and suppresses repeated occurrence of erroneous recognition of a person shown in a time-series image.

上記課題を解決する本発明にかかる情報処理装置は、認識対象である人物の顔画像を登録したリストを用いて、撮像画像に含まれる人物を認識する情報処理装置であって、注目画像に含まれる顔画像、および前記リストに基づいて、前記注目画像に含まれる顔画像を特定の人物として認識する認識手段と、前記認識手段による前記顔画像に対する認識結果が誤りであることを入力する入力手段とを有し、前記認識手段は、前記入力を受けた後の認識において、認識結果が誤りであると入力された前記顔画像を前記特定の人物として認識しないように制御することを特徴とする。 An information processing apparatus according to the present invention that solves the above problem is an information processing apparatus that recognizes a person included in a captured image using a list in which face images of a person who is a recognition target are registered, and is included in a target image. Recognition means for recognizing a face image included in the attention image as a specific person based on the face image and the list, and input means for inputting that the recognition result of the recognition means by the recognition means is incorrect. In the recognition after receiving the input, the recognizing unit controls the face image input that the recognition result is erroneous so as not to be recognized as the specific person. ..

時系列画像に写ったある人物について誤認識が繰り返し発生することを抑制できる。 It is possible to prevent repeated erroneous recognition of a person in a time-series image.

情報処理装置の機能構成例を示すブロック図Block diagram showing a functional configuration example of an information processing apparatus 登録人物リストについて説明する図Figure explaining the registered person list 情報処理装置が実行する処理を説明するフローチャートFlowchart explaining processing executed by the information processing apparatus 情報処理装置の機能構成例を示すブロック図Block diagram showing a functional configuration example of an information processing apparatus 登録人物リストについて説明する図Figure explaining the registered person list 情報処理装置の機能構成例を示すブロック図Block diagram showing a functional configuration example of an information processing apparatus 情報処理装置のハードウェア構成例を示す図Diagram showing an example of the hardware configuration of an information processing device ＧＵＩの一例を示す図Diagram showing an example of GUI ＧＵＩの一例を示す図Diagram showing an example of GUI 情報処理装置が実行する処理を説明するフローチャートFlowchart explaining processing executed by the information processing apparatus

＜実施形態１＞
店舗等のサービス施設において、重要顧客や要注意顧客の来店をいち早く確実に把握するために、施設内外に設置された監視カメラを使った顔認識技術が利用されている。重要顧客等の特定の人物についての顔画像と個人情報を示すＩＤを割り当てたリストをサービス施設が管理し、そのようなリストを利用して登録された人物と店内にいる顧客の顔画像との認識を行う。しかしながら、顧客は店内を自由に動き回るため、静止した状態での顧客の顔画像が取得できるとは限らない。このような状況が考えられるため、時系列で取得される画像から動き回る顧客を同一人物であることを特定したうえで、さらに特定された人物が登録された顧客であるか認識する必要がある。ここで、時系列画像から特定された特定の人物についての認識を一度失敗すると、その後に取得する画像において同様の誤認識を続けてしまう可能性がある。以下の実施形態における目的の１つは、顔認識技術において、一度誤認識してしまった場合に、その後の同一人物に対する認識処理においては、同じ誤認識を繰り返さないようにする、ということにある。そのために、一度誤認識してしまった場合、以後の同じ人物に対する顔認識では、先ほど誤認識してしまった登録人物を、もとの登録人物セットから除外したうえで、顔認識を行うようにする。 <Embodiment 1>
In service facilities such as stores, face recognition technology using surveillance cameras installed inside and outside the facility is used in order to quickly and surely know the visit of important customers or customers requiring attention. The service facility manages a list in which face images of specific persons such as important customers and IDs indicating personal information are assigned, and the person registered using such a list and the face images of customers in the store To recognize. However, since the customer freely moves around in the store, it is not always possible to acquire the face image of the customer in a stationary state. Since such a situation is possible, it is necessary to specify that the customer who moves around from the images acquired in time series is the same person, and then recognize whether the specified person is the registered customer. Here, if the recognition of the specific person specified from the time-series image fails once, there is a possibility that the same erroneous recognition may continue in the images acquired thereafter. One of the objects in the following embodiments is to prevent the same erroneous recognition from being repeated in the subsequent recognition process for the same person, once the erroneous recognition is performed in the face recognition technology. .. Therefore, if you make a mistake in recognizing once, in the face recognition for the same person after that, you should exclude the registered person who was erroneously recognized earlier from the original registered person set and then perform face recognition. To do.

これを実現するために、撮像画像中の登場人物が複数フレームに渡り同一人物であることを、顔追尾によって特定する。そして、特定した人物ごとに追尾識別子の割り当てを行う。それと同時に、追尾識別子ごとに、認識処理から除外すべき登録人物ＩＤを管理する、ということを行う。本発明にかかる実施形態を説明するのに先立ち、用語の定義について説明する。 In order to realize this, it is specified by face tracking that the persons appearing in the captured image are the same person over a plurality of frames. Then, a tracking identifier is assigned to each identified person. At the same time, the registered person ID that should be excluded from the recognition process is managed for each tracking identifier. Prior to describing the embodiments according to the present invention, the definition of terms will be described.

登録人物リストとは、不特定多数の人物から特定したい人物について、少なくとも顔の画像特徴（もしくは顔画像）とＩＤを紐づけた情報である。例えば、図２（ａ）のようなリストである。不特定多数の人物が出入りするサービス施設等において、施設管理者等のユーザにとって重要な顧客（または危険な人物）を見分ける為に用いる。登録人物リストは、予めユーザが認識したい人物についての顔画像、顔画像から抽出される所定の特徴、各顔画像を区別するためのＩＤを準備する。登録人物リストは登録された人物についてのカテゴリ情報（性別、年齢、身長等）を保持していてもよい。 The registered person list is information in which at least the facial image characteristics (or facial image) and the ID are associated with each other for a person to be identified from an unspecified number of persons. For example, the list is as shown in FIG. It is used to identify customers (or dangerous persons) who are important to users such as facility managers in service facilities where an unspecified number of people come and go. The registered person list prepares in advance a face image of a person the user wants to recognize, predetermined features extracted from the face image, and an ID for distinguishing each face image. The registered person list may hold category information (sex, age, height, etc.) about the registered persons.

追尾識別子リストとは、ある時刻での監視カメラ画像から検出された顔画像について割り当てられたＩＤと顔の画像特徴（顔画像）と、登録人物除外ＩＤとを紐づけた情報である。例えば、図２（ｂ）や図２（ｃ）のようなリストである。登録人物除外ＩＤとは、登録人物リストのうち、ある追尾識別子について、認識処理をスキップする人物と対応するＩＤである。なお、追尾識別子リストは、処理が開始される時点では情報を特に持たない。画像から顔検出がなされ、顔追尾によって新しい顔画像が検出された場合に新しい追尾識別子と除外ＩＤ（初期値はＮｏｎｅ）を更新する。 The tracking identifier list is information in which an ID assigned to a face image detected from a surveillance camera image at a certain time, a face image feature (face image), and a registered person exclusion ID are associated with each other. For example, the list is as shown in FIG. 2B or 2C. The registered person exclusion ID is an ID corresponding to a person who skips the recognition process for a certain tracking identifier in the registered person list. Note that the tracking identifier list has no particular information at the time when the processing is started. When a face is detected from the image and a new face image is detected by face tracking, a new tracking identifier and exclusion ID (initial value is None) are updated.

以下では本実施形態の顔認識を実行する情報処理装置について詳細に説明する。図１は、本発明を適用可能な実施形態を示す情報処理装置の機能構成例を示すブロック図である。情報処理装置１００は、認識対象である人物の顔を登録した登録人物リストを用いて、画像から人物を認識する。情報処理装置１００は、撮像装置２００と、表示装置３００に接続されている。情報処理装置１００は、画像入力部１０１、検出部１０２、追尾部１０３、個人特徴取得部１０４、認識部１０５、登録人物リスト記憶部１０６、追尾識別子リスト記憶部１０７、表示制御部１０８、フィードバック入力部１０９を有する。なお、ここに挙げたすべての機能構成を情報処理装置１００が有するとはかぎらない。例えば、表示制御部１０８やフィードバック入力部１０９は外部に接続された表示装置３００が有していてもよい。 The information processing apparatus that executes face recognition according to this embodiment will be described in detail below. FIG. 1 is a block diagram showing a functional configuration example of an information processing apparatus showing an embodiment to which the present invention is applicable. The information processing apparatus 100 recognizes a person from an image using the registered person list in which the face of the person to be recognized is registered. The information processing device 100 is connected to the imaging device 200 and the display device 300. The information processing apparatus 100 includes an image input unit 101, a detection unit 102, a tracking unit 103, a personal characteristic acquisition unit 104, a recognition unit 105, a registered person list storage unit 106, a tracking identifier list storage unit 107, a display control unit 108, and a feedback input. It has a part 109. It should be noted that the information processing apparatus 100 may not have all the functional configurations listed here. For example, the display control unit 108 and the feedback input unit 109 may be included in the display device 300 connected to the outside.

画像入力部１０１は、撮像装置２００によって撮像された時系列の撮像画像、つまり動画の一部である画像を入力する。本実施形態では、所定の空間にいる人物を撮像した画像（撮像画像）を入力する。所定の空間とは、店舗や公共施設等の不特定多数の人物が出入りする空間である。本実施形態では、撮像装置２００がリアルタイムに撮像中の画像が画像入力部１０１から入力されるとする。撮像装置２００はその他の構成部と物理的に離れた場所に置かれていてもよく、映像をネットワーク越しにその他の構成部に転送するような構成でもよい。また、撮像中の画像に限るものではなく、撮像済み記録画像の再生画像が画像入力部１０１から入力されてもよい。撮像された画像は、記憶装置によって保持される。 The image input unit 101 inputs a time-series captured image captured by the image capturing apparatus 200, that is, an image that is a part of a moving image. In the present embodiment, an image (captured image) obtained by capturing a person in a predetermined space is input. The predetermined space is a space where an unspecified large number of people such as shops and public facilities enter and leave. In the present embodiment, it is assumed that the image being captured by the image capturing apparatus 200 in real time is input from the image input unit 101. The imaging device 200 may be placed in a place physically separated from other components, or may be configured to transfer an image to other components via a network. Further, not limited to the image being picked up, the reproduced image of the picked-up recorded image may be input from the image input unit 101. The captured image is held by the storage device.

検出部１０２では、画像入力部１０１から入力された注目画像に対して、顔の構成要素である目や鼻の画像特徴を抽出することによって、注目画像に含まれる顔画像の検出を行う。注目画像に人物が含まれない場合は顔画像を検出しない。注目画像に複数の人物が含まれる場合は各人物の顔画像を検出する。顔の検出手法は既存の公知の手法を使えばよい。例えば、鼻、口や目などの顔画像の構成要素に相当する形状を示す画像特徴を抽出する。抽出された両目の大きさとそれらの距離から顔の大きさを推定し、鼻の中心に相当する位置を基準として、抽出された大きさの領域で囲んだ領域を顔画像とする。検出された顔画像は、後に説明する追尾処理、個人特徴抽出処理のために、その画像における画角内での位置と領域のサイズとともに追尾部１０３、個人特徴取得部１０４に入力される。 The detection unit 102 detects the face image included in the attention image by extracting the image features of the eyes and the nose, which are the constituent elements of the face, from the attention image input from the image input unit 101. If the target image does not include a person, no face image is detected. When the attention image includes a plurality of persons, the face image of each person is detected. An existing known method may be used as the face detection method. For example, the image feature indicating the shape corresponding to the constituent elements of the face image such as the nose, mouth and eyes is extracted. The size of the face is estimated from the extracted sizes of both eyes and their distances, and the region surrounded by the region of the extracted size is used as a face image with the position corresponding to the center of the nose as a reference. The detected face image is input to the tracking unit 103 and the individual feature acquisition unit 104 together with the position within the angle of view and the size of the region in the image for the tracking process and the individual feature extraction process described later.

追尾部１０３では、注目画像と前のフレームで撮像された第１の画像とに基づいて、注目画像に含まれる顔画像と第１の画像に含まれる顔画像とが、所定の時間内で人物が移動可能な範囲にある場合、顔画像に同一の識別子を付与することによって顔を追尾する。具体的には、現在の撮像画像から検出されたＮ枚の顔画像ごとに類似する顔画像を過去のフレーム画像から特定し、Ｎ枚の各顔画像の追尾識別子を特定する。検出部１０２で、Ｎ枚の顔画像が検出された場合は、各顔画像に識別子ｉ＝０〜Ｎ−１（ただしｉは自然数）を割り振る。ｉ＝０から順番に、顔画像ごとに、前フレームにおける各顔画像とマッチング処理を行う。ｉ番目の顔画像が前フレームの顔画像と類似していた場合は、前フレームの追尾顔画像から追尾識別子を特定し、同じ追尾識別子をｉ番目の顔画像に付与する。つまり、前フレームの顔画像と位置が近く、輝度データの近い現在のフレームの顔画像を、同一人物の顔画像として特定する。さらに、現在のフレームで対応する顔画像がなかった場合には、前フレームの顔画像の輝度データと相関の高い領域を、現在のフレームにおいてある一定の範囲でサーチする。該当する領域があった場合には、その領域を同一人物の顔画像として特定する。あるいは、前のフレーム画像にｊ番目の顔画像に類似した顔画像がなかった場合、新たな追尾識別子をｊ番目の画像に付与する。これは、人物が一瞬顔を横や上下に向けたために、検出部により顔として検出されないような場合に有効であり、検出部の有する検出能力以上の顔の検出を行うことができる。 In the tracking unit 103, based on the target image and the first image captured in the previous frame, the face image included in the target image and the face image included in the first image are detected within a predetermined period of time. When is in the movable range, the face is tracked by giving the same identifier to the face image. Specifically, a face image similar to each of the N face images detected from the current captured image is specified from the past frame images, and the tracking identifiers of the N face images are specified. When the detection unit 102 detects N face images, identifiers i=0 to N−1 (where i is a natural number) are assigned to each face image. The matching process is performed for each face image in order from i=0 with each face image in the previous frame. When the i-th face image is similar to the face image of the previous frame, the tracking identifier is specified from the tracking face image of the previous frame, and the same tracking identifier is given to the i-th face image. That is, the face image of the current frame, which is close in position to the face image of the previous frame and has similar brightness data, is specified as the face image of the same person. Furthermore, when there is no corresponding face image in the current frame, an area having a high correlation with the brightness data of the face image in the previous frame is searched in a certain range in the current frame. If there is a corresponding area, that area is specified as the face image of the same person. Alternatively, if there is no face image similar to the jth face image in the previous frame image, a new tracking identifier is added to the jth image. This is effective when the detection unit does not detect the face because the person faces the sideways or up and down for a moment, and it is possible to detect a face having a detection capability higher than that of the detection unit.

このようにして追尾部１０３では、複数フレームに渡る同一人物の顔を追尾する。さらに追尾部１０３では、同一人物であるとされた顔を特定するために、一連の時系列画像に対して共通の追尾識別子を特定する。追尾部１０３で同一人物の顔とされた、一連の顔画像群に対しては、同じ追尾識別子が設定される。つまり、ある時刻における追尾識別子が割り当てられた顔画像の画像特徴をテンプレートとして、各時系列画像でマッチングする領域を探索して、マッチングした領域には同じＩＤを割りあてる。この追尾識別子は、個人特徴取得部１０４で抽出された個人特徴とともに、個人認識部１０５に入力される。 In this way, the tracking unit 103 tracks the face of the same person over a plurality of frames. Further, the tracking unit 103 specifies a common tracking identifier for a series of time-series images in order to specify faces that are the same person. The same tracking identifier is set for a series of face image groups in which the face of the same person is made by the tracking unit 103. That is, by using the image feature of the face image to which the tracking identifier is assigned at a certain time as a template, a matching area is searched for in each time series image, and the same ID is assigned to the matched area. The tracking identifier is input to the personal recognition unit 105 together with the personal characteristic extracted by the personal characteristic acquisition unit 104.

個人特徴取得部１０４では、顔画像から、個人を特定可能な顔の部位の位置関係について示す個人特徴を取得する。ここでいう特徴とは、画像からエッジ検出等で得られるような画像特徴であることが想定されている。取得方法は任意で良いが、本実施形態では以下のように個人特徴を抽出する。まず顔検出部１０２で特定した目・鼻・口など代表的な器官の位置に基づいて、両目の幅が所定の距離になるように、両目を結ぶ線分が画像上で水平になるように画像を回転・拡大縮小する。そして顔画像に特徴を抽出する矩形領域を設定する。領域の大きさは任意であるが、個人の特徴をよく表す目や口などの器官がもれなく入るように、しかし背景などは入らないように、一辺が目幅のおおよそ１．５倍程度の正方形を顔の中央に配置するとよい。続いて矩形領域内の画素値を左上から右下に向かって順に取り出し、一列につなげてベクトルとする。これを個人特徴とする。ただし、本実施形態に用いる個人特徴の抽出は、上記に示した手法に限らない。例えば、ディープニューラルネットワークを用いて個人特徴を抽出してもよい。具体的には、ＣＮＮ（ＣｏｎｖｏｌｕｔｉｏｎａｌＮｅｕｒａｌＮｅｔｗｏｒｋ：畳み込みニューラルネットワーク）における、畳込み層から取得できる画像特徴などを取得する。この場合、個人特徴取得部１０４では、顔画像を、人物の識別を学習させたニューラルネットワーク（学習済みモデル）に入力することによって、個人の識別に必要な個人特徴を取得する。なお、学習済みモデルとは、撮像画像から撮像画像に対応する認識結果を出力するニューラルネットワークに基づくネットワーク構造とそのパラメータである。認識は、例えば画像から個人を特定するものでもよい。学習済みモデルを用いることで、より精度良く個人の認識を行うことができる。 The personal characteristic acquisition unit 104 acquires, from the face image, a personal characteristic indicating the positional relationship of face parts that can identify an individual. The feature here is assumed to be an image feature that can be obtained from an image by edge detection or the like. The acquisition method may be arbitrary, but in the present embodiment, the individual characteristics are extracted as follows. First, based on the positions of representative organs such as eyes, nose, and mouth specified by the face detection unit 102, the width of both eyes is set to a predetermined distance, and the line segment connecting both eyes is made horizontal on the image. Rotate/scale images. Then, a rectangular area for extracting features is set in the face image. The size of the area is arbitrary, but one side is about 1.5 times the width of the eye so that all the organs such as eyes and mouth that show the characteristics of an individual can be completely filled in, but not the background. Should be placed in the center of the face. Subsequently, the pixel values in the rectangular area are taken out in order from the upper left to the lower right, and are connected in a line to form a vector. This is an individual feature. However, the extraction of the personal feature used in this embodiment is not limited to the method described above. For example, a deep neural network may be used to extract individual features. Specifically, the image feature and the like that can be acquired from the convolutional layer in CNN (Convolutional Neural Network) are acquired. In this case, the personal characteristic acquisition unit 104 acquires the personal characteristic necessary for individual identification by inputting the face image into the neural network (learned model) in which the identification of the person has been learned. The learned model is a network structure based on a neural network that outputs a recognition result corresponding to a captured image from the captured image and its parameters. The recognition may be to identify an individual from an image, for example. By using the learned model, the individual can be recognized with higher accuracy.

個人認識部１０５では、登録人物リストに登録された人物の顔の特徴と、第１の顔画像の特徴とを比較することによって、第１の顔画像が登録人物リストに登録された人物のうち第１の人物であることを認識する。さらに詳しく説明すると、まず個人特徴取得部１０４から送られてくる撮像画像の個人特徴と、登録人物リスト記憶部１０６から送られてくる登録顔画像の個人特徴とを照合し、類似度を取得する。各顔画像から抽出された個人特徴を入力個人特徴と呼ぶ。登録人物リスト（登録人物リスト）において、登録人物と対応する顔画像に保持されている個人特徴を登録個人特徴と呼ぶ。ここでは、入力個人特徴と登録個人特徴との間のＬ２距離（ユークリッド距離）の逆数を類似度とする。個人認識部１０５では、取得された類似度と、あらかじめ設定された閾値とを比較して、認識結果を出力する。あるいは、類似度が所定の閾値より大きくなる登録人物すべてを認識結果として出力する。 The personal recognition unit 105 compares the facial feature of the person registered in the registered person list with the characteristic of the first face image to determine the first facial image among the persons registered in the registered person list. Recognize that you are the first person. More specifically, first, the personal feature of the captured image sent from the personal feature acquisition unit 104 and the personal feature of the registered face image sent from the registered person list storage unit 106 are compared to acquire the similarity. .. The personal feature extracted from each face image is called an input personal feature. In the registered person list (registered person list), the personal feature held in the face image corresponding to the registered person is called a registered personal feature. Here, the reciprocal of the L2 distance (Euclidean distance) between the input individual feature and the registered individual feature is set as the similarity. The personal recognition unit 105 compares the acquired degree of similarity with a preset threshold value and outputs the recognition result. Alternatively, all registered persons whose degree of similarity is larger than a predetermined threshold value are output as the recognition result.

登録人物リスト記憶部１０６に複数の登録個人特徴が格納されている場合には、撮像画像から取得された入力個人特徴と、複数それぞれの登録個人特徴との間の類似度が取得される。この場合、登録個人特徴の数と同数の類似度が取得される。個人認識部１０５では、この複数の類似度の最大の類似度に対して、閾値処理を行う。そのうえで、最大となる類似度を取得した登録個人特徴に関連付けられた登録人物ＩＤを認識結果として出力する。閾値を超える類似度がなければ、登録人物中に該当人物なし、との結果を出力する。 When a plurality of registered individual characteristics are stored in the registered person list storage unit 106, the degree of similarity between the input individual characteristic acquired from the captured image and each of the plurality of registered individual characteristics is acquired. In this case, the same degree of similarity as the number of registered personal characteristics is acquired. The personal recognition unit 105 performs threshold processing on the maximum similarity among the plurality of similarities. Then, the registered person ID associated with the registered individual characteristic that has obtained the maximum similarity is output as a recognition result. If there is no similarity exceeding the threshold value, the result that there is no corresponding person in the registered persons is output.

登録人物リスト記憶部１０６には、登録された人物の顔の特徴（顔画像）が、あらかじめ取得され、記憶されている。登録された人物の顔の特徴を登録個人特徴とよぶ。登録個人特徴は、例えば特徴ベクトルや数値によって表現される。登録画像に対する個人特徴の取得方法は、これまでに示した手順と同じように行えばよい。つまり、登録画像を撮像画像として入力し、顔検出を行い、検出した顔から個人特徴を取得すればよい。ここでは、画像に含まれる人物を識別する学習済みモデルに基づいて顔画像から個人を特定可能な個人特徴を取得する。登録人物リスト記憶部１０６には、図２（ａ）のように、登録人物の人物ＩＤとその人物の登録個人特徴とが関連付けられて保持されている。 In the registered person list storage unit 106, the facial features (face image) of the registered persons are acquired and stored in advance. The facial features of the registered person are called registered personal features. The registered individual characteristic is represented by, for example, a characteristic vector or a numerical value. The method of acquiring the personal characteristics of the registered image may be the same as the procedure shown so far. That is, the registered image may be input as a captured image, face detection may be performed, and the personal feature may be acquired from the detected face. Here, an individual feature that can identify an individual is acquired from a face image based on a learned model that identifies a person included in the image. In the registered person list storage unit 106, as shown in FIG. 2A, the registered person's person ID and the registered individual characteristic of the person are stored in association with each other.

識別子情報記憶部１０７では、今までのフィードバック情報が反映された追尾識別子リストに基づいて、追尾識別子ごとに、個人認識部１０５で行う認識処理を行わない登録人物の人物ＩＤ（これを除外ＩＤと呼ぶ）を保持しているか判断する。図２（ｂ）には、撮像画像から検出された顔画像に基づいて生成された初期の追尾識別子リストを示す。図２（ｃ）は、図２（ｂ）の状態から何度かフィードバックを受付したことによって更新された追尾識別子リストをします。図２（ｃ）において、追尾識別子がＴｒａｃｋＩＤ＿０とＴｒａｃｋＩＤ＿３に対しては、除外ＩＤがないということを示している。また、追尾識別子がＴｒａｃｋＩＤ＿１に対しては、ＲｅｇＩＤ＿３が除外ＩＤであるということを示している。同様に、追尾識別子がＴｒａｃｋＩＤ＿２に対しては、ＲｅｇＩＤ＿０とＲｅｇＩＤ＿２とが登録人物除外ＩＤであるということを示している。登録人物リスト管理部１０７では、以上のように追尾識別子に関連付けられて、登録人物除外ＩＤが管理されている。 In the identifier information storage unit 107, based on the tracking identifier list reflecting the feedback information up to now, the person ID of the registered person who does not perform the recognition process performed by the individual recognition unit 105 for each tracking identifier (this is referred to as an exclusion ID). Call) is held. FIG. 2B shows an initial tracking identifier list generated based on the face image detected from the captured image. Fig. 2(c) shows the tracking identifier list updated by receiving several feedbacks from the state of Fig. 2(b). In FIG. 2C, there is no exclusion ID for the tracking IDs TrackID_0 and TrackID_3. Further, for the tracking identifier TrackID_1, it is indicated that RegID_3 is an exclusion ID. Similarly, for the tracking identifier TrackID_2, it is indicated that RegID_0 and RegID_2 are registered person exclusion IDs. In the registered person list management unit 107, the registered person exclusion ID is managed in association with the tracking identifier as described above.

ここで、画像入力部１０１から画像が入力された時の、個人認識部１０５、登録人物リスト記憶部１０６、追尾識別子リスト記憶部１０７の間での情報のやり取りをもう少し詳細に説明しておく。 Here, the exchange of information between the individual recognition unit 105, the registered person list storage unit 106, and the tracking identifier list storage unit 107 when an image is input from the image input unit 101 will be described in a little more detail.

画像入力部１０１から入力された画像に対して、検出部１０２、追尾部１０３、個人特徴取得部１０４において、それぞれ顔検出、顔追尾、個人特徴抽出が実行される。その結果、今回認識処理を行う顔画像に対応する追尾識別子が特定されると同時に、その顔画像に対応する入力個人特徴が取得される。個人認識部１０５には、このようにして取得された入力個人特徴と、その追尾識別子とが入力される。 Face detection, face tracking, and individual feature extraction are performed on the image input from the image input unit 101 in the detection unit 102, the tracking unit 103, and the individual feature acquisition unit 104, respectively. As a result, the tracking identifier corresponding to the face image to be subjected to the recognition process this time is specified, and at the same time, the input personal feature corresponding to the face image is acquired. The input personal feature thus obtained and the tracking identifier are input to the personal recognition unit 105.

個人認識部１０５は、入力された追尾識別子と追尾識別子リスト記憶部１０７とを参照することで、追尾識別子ｉに対応する顔画像について認証する際に除外すべき登録人物ＩＤの有無を確認する。つまり、追尾識別子リスト記憶部１０７は、示された追尾識別子に対応する除外ＩＤがあれば、その登録人物ＩＤを登録人物リスト記憶部１０６に伝える。登録人物リスト記憶部１０６は、今回の認識処理で使用する登録個人特徴を個人認識部１０５に対して出力する。その際には、登録人物リスト管理部１０７から伝えられた登録人物ＩＤに対応する登録個人特徴を除外して出力する。つまり、判定結果によって第１の顔画像は第１の人物ではないと判定された場合、個人認識部は、第１の顔画像が登録人物リストに登録された人物のうち第１の人物以外であることを認証する。 The personal recognition unit 105 refers to the input tracking identifier and the tracking identifier list storage unit 107, and confirms whether or not there is a registered person ID to be excluded when authenticating the face image corresponding to the tracking identifier i. That is, the tracking identifier list storage unit 107 notifies the registered person list storage unit 106 of the registered person ID if there is an exclusion ID corresponding to the indicated tracking identifier. The registered person list storage unit 106 outputs the registered individual characteristics used in the recognition process this time to the individual recognition unit 105. At that time, the registered individual characteristics corresponding to the registered person ID transmitted from the registered person list management unit 107 are excluded and output. That is, when it is determined that the first face image is not the first person based on the determination result, the personal recognition unit determines that the first face image is not the first person among the persons registered in the registered person list. Certify that there is.

図２を用いて、上記の手順を具体的に説明する。例えば、図２（ａ）のように、６人の登録人物（それぞれの登録人部ＩＤをＲｅｇＩＤ＿０〜ＲｅｇＩＤ＿５とする）が登録人物リスト記憶部１０６に登録されているとする。このあらかじめ登録されているすべての登録人物ＩＤの登録個人特徴をデフォルト登録個人特徴セットと呼ぶ。 The above procedure will be specifically described with reference to FIG. For example, as shown in FIG. 2A, it is assumed that six registered persons (whose respective registered person section IDs are RegID_0 to RegID_5) are registered in the registered person list storage section 106. The registered personal characteristics of all registered personal IDs registered in advance are referred to as a default registered personal characteristic set.

図２（ｃ）は、検出部１０２で検出された顔画像について、追尾部１０３が類似した顔画像に共通の追尾識別子を付与することによって生成される追尾識別子リストの一例である。このリストは、フィードバックを行う前では、図２（ｂ）のように、特定された追尾識別子のみが保持されており、除外ＩＤはすべてＮｏｎｅとして保持される。このリストは、ユーザによるフィードバックが行われることによって更新される（図２（ｃ））。例えば、個人認識部１０５に入力されてくる追尾識別子がＴｒａｃｋＩＤ＿０（或いはＴｒａｃｋＩＤ＿３）であれば、個人認識部１０５では、すべての登録個人特徴（デフォルト登録個人特徴セット）と、入力個人特徴とから類似度を取得する。或いは、個人認識部１０５に入力されてくる追尾識別子がＴｒａｃｋＩＤ＿１であれば、個人認識部１０５では、ＲｅｇＩＤ＿３の登録個人特徴を除くデフォルト登録個人特徴セットと、入力個人特徴とから類似度を取得する。或いは、個人認識部１０５に入力されてくる追尾識別子がＴｒａｃｋＩＤ＿２であれば、個人認識部１０５では、ＲｅｇＩＤ＿０とＲｅｇＩＤ＿２の登録個人特徴を除くデフォルト登録個人特徴セットと、入力個人特徴とから類似度を取得する。このようにすることで、追尾識別子に応じて、個人認識部１０５において類似度を取得すべき登録人物ＩＤを制限することができる。 FIG. 2C is an example of a tracking identifier list generated by the tracking unit 103 assigning a common tracking identifier to similar face images detected by the detection unit 102. Prior to feedback, this list holds only the specified tracking identifiers and all the exclusion IDs are held as None, as shown in FIG. 2B. This list is updated by feedback from the user (FIG. 2(c)). For example, if the tracking identifier input to the personal recognition unit 105 is TrackID_0 (or TrackID_3), the personal recognition unit 105 calculates the similarity from all registered personal features (default registered personal feature set) and input personal features. To get. Alternatively, if the tracking identifier input to the personal recognition unit 105 is TrackID_1, the personal recognition unit 105 acquires the degree of similarity from the default registered personal characteristic set excluding the registered personal characteristic of RegID_3 and the input personal characteristic. Alternatively, if the tracking identifier input to the personal recognition unit 105 is TrackID_2, the personal recognition unit 105 acquires the similarity from the default registered personal feature set excluding the registered personal features of RegID_0 and RegID_2 and the input personal feature. To do. By doing so, it is possible to limit the registered person ID for which the degree of similarity should be acquired in the personal recognition unit 105, according to the tracking identifier.

表示制御部１０８には、ｉ番目の顔画像が登録人物であることをユーザに通知するための表示を行うよう表示装置を制御する。例えば、ユーザが所有するＧＵＩに、顔認識処理の対象となった撮像画像と、個人認識部１０５が判定した判定結果の登録人物画像と、登録人物ＩＤ（登録人物の名前）が表示する。表示装置３００は、情報処理装置１００の監督者が閲覧可能なモニタや、携帯端末等を想定している。ここで監督者とは、情報処理装置１００を用いて、所定のサービスを実行する人物を指している。例えば、店舗での重要顧客（或いは要注意顧客）を検出するホワイトリスト（或いはブラックリスト）検出を実現するために情報処理装置１００を用いる場合には、店員や警備員のことを指す。 The display control unit 108 controls the display device to perform a display for notifying the user that the i-th face image is a registered person. For example, the captured image that is the target of the face recognition processing, the registered person image of the determination result determined by the individual recognition unit 105, and the registered person ID (name of the registered person) are displayed on the GUI owned by the user. The display device 300 is assumed to be a monitor that can be viewed by a supervisor of the information processing device 100, a mobile terminal, or the like. Here, the supervisor refers to a person who uses the information processing apparatus 100 to execute a predetermined service. For example, when the information processing apparatus 100 is used to realize white list (or black list) detection for detecting an important customer (or caution customer) in a store, it refers to a store clerk or a security guard.

個人認識部１０５で判定した個人認識結果は、表示制御部１０８によって、監督者（店員や警備員）に通知される。店員や警備員は、表示制御部１０８に表示された撮像画像及び登録人物画像を見比べて、所定のサービスを提供することができる。例えば、表示制御部１０８に重要顧客の人物ＩＤが通知された場合には、その好みに応じた接客等を行うことが可能となる。さらに監督者は、表示制御部１０８に表示された撮像画像及び登録人物画像を見比べて、顔認識結果が誤っている（誤認識している）と判断した場合には、フィードバック入力部１０９を介して、その旨を通知する。 The display control unit 108 notifies the supervisor (store clerk or security guard) of the personal recognition result determined by the personal recognition unit 105. The store clerk or the guard can compare the captured image and the registered person image displayed on the display control unit 108 to provide a predetermined service. For example, when the display control unit 108 is notified of the person ID of an important customer, it is possible to provide customer service or the like according to the taste. Further, when the supervisor compares the captured image and the registered person image displayed on the display control unit 108 and determines that the face recognition result is erroneous (erroneous recognition), the supervisor inputs the feedback input unit 109. And notify that effect.

フィードバック入力部（受付部）１０９は、顔画像に対する認識結果が誤りであることを判定結果としてユーザから受け付ける。例えば、監督者（ユーザ）から、ｉ番目の顔画像は登録人物ＩＤであるか否かについてのフィードバックを受け付ける。監督者のフィードバックとは、上述したとおり、監督者が今回の顔認識結果が誤っていると判断した場合に、その旨（誤認識している）を通知することを指す。 The feedback input unit (reception unit) 109 receives from the user that the recognition result for the face image is incorrect as a determination result. For example, the supervisor (user) receives feedback as to whether or not the i-th face image is the registered person ID. As described above, the feedback from the supervisor means that when the supervisor determines that the face recognition result this time is incorrect, the fact is notified (misrecognized).

フィードバック入力部１０９に入力された監督者からのフィードバックは、登録人物リスト管理部１０７に通知される。登録人物リスト管理部１０７は、監督者から今回の認識結果が誤認識であるとのフィードバックを受けると、内部で管理している追尾識別子リストの除外ＩＤについて更新を行う。つまり、今回認識処理した顔画像が割り当てられている追尾識別子の除外ＩＤに、今回誤認識した登録人物ＩＤを追加する。このようにすることで、今回と同じ追尾識別子をもつ撮像画像に対しては、以降の認識処理において、同じ誤認識を繰り返すことを避けることが可能となる。 The feedback from the supervisor input to the feedback input unit 109 is notified to the registered person list management unit 107. When the registered person list management unit 107 receives feedback from the supervisor that the recognition result this time is an erroneous recognition, the registered person list management unit 107 updates the exclusion ID of the tracking identifier list managed internally. That is, the registered person ID that is erroneously recognized this time is added to the exclusion ID of the tracking identifier to which the face image that has been recognized this time is assigned. By doing so, it becomes possible to avoid repeating the same erroneous recognition in the subsequent recognition processing for the captured image having the same tracking identifier as this time.

図７は、情報処理装置１００のハードウェア構成を示す図である。Ｈ１１はＣＰＵであり、システムバスＨ１８に接続された各種デバイスの制御を行う。Ｈ１２はＲＯＭであり、ＢＩＯＳのプログラムやブートプログラムを記憶する。Ｈ１３はＲＡＭであり、ＣＰＵであるＨ１１の主記憶装置として使用される。Ｈ１４は外部メモリであり、情報処理装置１０が処理するプログラムを格納する。入力部Ｈ１５はキーボードやマウス、ロボットコントローラーであり、情報等の入力に係る処理を行う。表示部Ｈ１６はＨ１１からの指示に従って情報処理装置１００の演算結果を表示装置に出力する。なお、表示装置は液晶表示装置やプロジェクタ、ＬＥＤインジケーターなど、種類は問わない。Ｈ１７は通信インターフェイスであり、ネットワークを介して情報通信を行うものである。通信インターフェイスはイーサネット（登録商標）でもよく、ＵＳＢやシリアル通信等種類は問わない。 FIG. 7 is a diagram showing a hardware configuration of the information processing device 100. H11 is a CPU, which controls various devices connected to the system bus H18. H12 is a ROM that stores a BIOS program and a boot program. H13 is a RAM and is used as a main storage device of the CPU H11. H14 is an external memory that stores programs processed by the information processing apparatus 10. The input unit H15 is a keyboard, a mouse, and a robot controller, and performs processing related to input of information and the like. The display unit H16 outputs the calculation result of the information processing device 100 to the display device according to the instruction from H11. The display device may be of any type such as a liquid crystal display device, a projector, an LED indicator, or the like. H17 is a communication interface for performing information communication via a network. The communication interface may be Ethernet (registered trademark), and any type such as USB or serial communication may be used.

以上が情報処理装置１００の説明である。次に、本実施形態における処理手順について説明する。図３は、本実施形態における情報処理装置１００の処理手順の一例を示すフローチャートである。以下の説明では、各工程（ステップ）について先頭にＳを付けて表記することで、工程（ステップ）の表記を省略する。ただし、情報処理システム１は必ずしもこのフローチャートで説明するすべてのステップを行わなくても良い。以下、フローチャートは、コンピュータである図７のＣＰＵ（Ｈ１１）が外部メモリ（Ｈ１４）で格納されているコンピュータプログラムを実行することにより実現されるものとする。 The above is the description of the information processing apparatus 100. Next, a processing procedure in this embodiment will be described. FIG. 3 is a flowchart showing an example of the processing procedure of the information processing apparatus 100 according to this embodiment. In the following description, each process (step) will be described by adding S to the beginning, and the description of the process (step) will be omitted. However, the information processing system 1 does not necessarily have to perform all the steps described in this flowchart. Hereinafter, it is assumed that the flowchart is realized by the CPU (H11) of FIG. 7 which is a computer executing the computer program stored in the external memory (H14).

Ｓ１では、検出部１０２が、画像入力部１０１から入力された画像に対して、顔の構成要素である目や鼻の画像特徴を抽出することによって、撮像画像に含まれる顔画像の検出を行う。ここで、入力された画像にはＮ人（ＮはＮ≧０の整数）の人物が写っていたものとする。検出された顔画像には、顔の画像特徴、領域の大きさ、画角内における位置情報が含まれる。 In S1, the detection unit 102 detects the face image included in the captured image by extracting the image features of the eyes and the nose, which are the constituent elements of the face, from the image input from the image input unit 101. .. Here, it is assumed that N persons (N is an integer of N≧0) are included in the input image. The detected face image includes face image characteristics, area size, and position information within the angle of view.

Ｓ２では、追尾部１０３が、撮像画像の前に撮像された過去のフレーム画像から検出された顔画像と類似した第１の顔画像を撮像画像から特定する。前のフレームで入力された画像から、現在のフレームで検出されたＮ枚の顔画像ごとに類似する顔画像を特定し、それぞれの顔画像の追尾識別子を特定する。 In S2, the tracking unit 103 identifies, from the captured image, a first face image similar to the face image detected from the past frame image captured before the captured image. From the image input in the previous frame, a similar face image is specified for each of the N face images detected in the current frame, and the tracking identifier of each face image is specified.

Ｓ３では、追尾部１０３が、追尾識別子ｉについて、撮像画像の直前のフレーム画像に存在した顔画像であるか否かを判断する。なお、Ｓ３以降の処理はｉ＝０からｉ＝Ｎ−１まで順番に処理していく。追尾識別子ｉが前のフレーム画像でも検出されていた場合は、Ｓ４０に進む。追尾識別子ｉが前のフレーム画像では検出されていない場合、すなわち新しく設定された追尾識別子である場合は、Ｓ４１へ進む。 In S3, the tracking unit 103 determines whether or not the tracking identifier i is the face image existing in the frame image immediately before the captured image. It should be noted that the processes from S3 onward are sequentially processed from i=0 to i=N-1. If the tracking identifier i is also detected in the previous frame image, the process proceeds to S40. If the tracking identifier i is not detected in the previous frame image, that is, if it is a newly set tracking identifier, the process proceeds to S41.

Ｓ４０では、追尾識別子リスト記憶部１０７が、認識部１０５に前のフレーム画像で生成された追尾識別子リストを参照する。追尾識別子ｉが前のフレームにおいて検出している場合、追尾識別子リスト記憶部１０７は前のフレームで生成された追尾識別子リストを保持している。認識部１０５では、この追尾識別子リストを使って追尾識別子ｉについての顔認識を実行する。Ｓ４１では、追尾識別子リスト記憶部１０７が、追尾識別子ｉについてのリストを更新する。新しく追加された追尾識別子については、フィードバックが未だないため、除外ＩＤにはＮｏｎｅとしてリストを生成する。 In S40, the tracking identifier list storage unit 107 refers to the recognition unit 105 of the tracking identifier list generated in the previous frame image. When the tracking identifier i is detected in the previous frame, the tracking identifier list storage unit 107 holds the tracking identifier list generated in the previous frame. The recognition unit 105 uses this tracking identifier list to perform face recognition on the tracking identifier i. In S41, the tracking identifier list storage unit 107 updates the list for the tracking identifier i. With respect to the newly added tracking identifier, no feedback has been given yet, so a list is generated as None for the exclusion ID.

Ｓ５では、追尾識別子リスト記憶部１０７が、今までのフィードバック情報が反映された追尾識別子リストに基づいて、追尾識別子ごとに認識処理を行わない登録人物の人物ＩＤ（これを除外ＩＤと呼ぶ）を保持しているか判断する。ｉ番目の追尾識別子について、除外ＩＤがない場合はＳ６０に進む。除外ＩＤがある場合はＳ６１に進む。 In step S5, the tracking identifier list storage unit 107 determines the person IDs of registered persons who do not perform recognition processing for each tracking identifier (referred to as exclusion IDs), based on the tracking identifier list in which the feedback information up to now is reflected. Judge whether it holds. If there is no exclusion ID for the i-th tracking identifier, the process proceeds to S60. If there is an exclusion ID, the process proceeds to S61.

Ｓ６０では、個人認識部１０５が、リストに登録された人物の顔の特徴と第１の顔画像の特徴とを比較することによって、第１の顔画像がリストに登録された人物のうち特定の人物であることを認識する。ｉ番目の顔画像から取得される個人特徴と登録人物リストにおける各登録人物の個人特徴とを照合し、それぞれの登録人物との類似度を取得する。ｉ番目の顔画像との類似度が最大になる登録人物を取得する。さらに、類似度が所定の閾値より大きくなる場合、ｉ番目の顔画像を対応する登録人物として認識する。まず、個人特徴取得部１０４が、ｉ番目の顔画像から、個人の識別に必要な個人特徴を取得する。ここでは、画像に含まれる人物を識別する学習済みモデルに基づいて顔画像から個人を特定可能な個人特徴を取得する。登録人物リスト記憶部１０６には、登録人物の顔画像から学習済みモデルに基づいて各登録人物に対応する個人特徴が、あらかじめ取得され、記憶されている。認識部１０５は、ｉ番目の顔画像から取得された個人特徴と、各登録人物に対応する個人特徴とを照合することによって、ｉ番目の顔画像と各登録人物との類似度を取得する。ｉ番目の顔画像の認識結果は、類似度が最大でかつ所定の閾値より大きい個人特徴に対応する登録人物とする。閾値を超える類似度がなければ、登録人物中に該当人物なし、との結果を出力する。 In S60, the personal recognition unit 105 compares the facial features of the persons registered in the list with the features of the first facial image, so that the first facial image is identified among the persons registered in the list. Recognize that you are a person. The personal feature acquired from the i-th face image is compared with the personal feature of each registered person in the registered person list to obtain the degree of similarity with each registered person. The registered person having the highest similarity to the i-th face image is acquired. Further, when the degree of similarity is larger than a predetermined threshold value, the i-th face image is recognized as the corresponding registered person. First, the personal characteristic acquisition unit 104 acquires a personal characteristic necessary for identifying an individual from the i-th face image. Here, an individual feature that can identify an individual is acquired from a face image based on a learned model that identifies a person included in the image. In the registered person list storage unit 106, individual characteristics corresponding to each registered person are acquired in advance from the face image of the registered person based on the learned model and stored. The recognition unit 105 obtains the degree of similarity between the i-th face image and each registered person by collating the individual characteristic acquired from the i-th face image with the individual characteristic corresponding to each registered person. The recognition result of the i-th face image is the registered person corresponding to the personal feature having the maximum similarity and larger than the predetermined threshold. If there is no similarity exceeding the threshold value, the result that there is no corresponding person in the registered persons is output.

Ｓ６１では、個人認識部１０５が、リストに登録された人物のうち除外された人物以外の顔の特徴と第１の顔画像の特徴とを比較することによって、第１の顔画像がリストのうち除外ＩＤ以外の特定の人物であることを認識する。ｉ番目の顔画像から取得される個人特徴と、除外ＩＤ以外の登録人物リストにおける各登録人物の個人特徴とを照合し、それぞれの登録人物との類似度を取得する。ｉ番目の顔画像との類似度が最大になる登録人物を取得する。さらに、類似度が所定の閾値より大きくなる場合、ｉ番目の顔画像を対応する登録人物として認識する。閾値を超える類似度がなければ、登録人物中に該当人物なし、との結果を出力する。除外ＩＤを除外した登録人物情報をもちいることによって、同じ誤認識を繰り返さないようになる。 In S61, the personal recognition unit 105 compares the features of the faces other than the excluded persons among the persons registered in the list with the features of the first face image, so that the first face image is included in the list. Recognize that the person is a specific person other than the exclusion ID. The personal feature acquired from the i-th face image is compared with the personal feature of each registered person in the registered person list other than the exclusion ID, and the degree of similarity with each registered person is acquired. The registered person having the highest similarity to the i-th face image is acquired. Further, when the degree of similarity is larger than a predetermined threshold value, the i-th face image is recognized as the corresponding registered person. If there is no similarity exceeding the threshold value, the result that there is no corresponding person in the registered persons is output. By using the registered person information excluding the exclusion ID, the same misrecognition will not be repeated.

Ｓ７では、個人認識部１０５が、ｉ番目の顔画像が登録人物に該当したか否かを判断する。Ｓ６０あるいはＳ６１の認識結果から、ｉ番目の顔画像が登録人物のいずれかであることが認識された場合は、Ｓ８に進む。Ｓ６０あるいはＳ６１の認識結果から、ｉ番目の顔画像が登録人物の誰にも該当しないことが認識された場合は、Ｓ１２に進む。 In S7, the personal recognition unit 105 determines whether or not the i-th face image corresponds to the registered person. If it is recognized from the recognition result of S60 or S61 that the i-th face image is one of the registered persons, the process proceeds to S8. When it is recognized from the recognition result of S60 or S61 that the i-th face image does not correspond to any of the registered persons, the process proceeds to S12.

Ｓ８では、表示制御部１０８が、ｉ番目の顔画像が登録された人物のうち特定人物であることをユーザに通知するための表示をするように表示装置を制御する。例えば、表示装置３００の表示画面上に認識対象である特定人物が来たことを知らせる通知を表示する。ＧＵＩによる表示の例は後述する。表示の他、音声や光でユーザに通知するような制御をしてもよい。 In S8, the display control unit 108 controls the display device to perform a display for notifying the user that the i-th face image is the specific person among the registered persons. For example, a notification is displayed on the display screen of the display device 300 to notify that the specific person who is the recognition target has arrived. An example of GUI display will be described later. In addition to the display, control may be performed such that the user is notified by voice or light.

Ｓ９では、フィードバック入力部１０９が、監督者（ユーザ）からｉ番目の顔画像が認識された登録人物であるか否かについてのフィードバックを入力する。 In S9, the feedback input unit 109 inputs feedback regarding whether or not the i-th face image is recognized by the supervisor (user).

Ｓ１０では、フィードバック入力部１０９が、フィードバックに基づいて認識結果が誤っていたか否かを判断する。監督者が今回の顔認識結果が誤っていると判断した場合は、Ｓ１０に進む。監督者が今回の顔認識結果が正解であると判断した場合は、Ｓ１２に進む。 In S10, the feedback input unit 109 determines whether the recognition result is incorrect based on the feedback. If the supervisor determines that the face recognition result this time is incorrect, the process proceeds to S10. When the supervisor determines that the face recognition result this time is correct, the process proceeds to S12.

Ｓ１１では、追尾識別子リスト記憶部１０７が、認識結果あるいはフィードバックに基づいて、ｉ番目の顔画像に対応する追尾識別子リストについて、認識された登録人物に対応する登録人物ＩＤを除外ＩＤとして更新する。追尾識別子リスト記憶部１０７が追尾識別子リストを更新した後は、Ｓ６１に進み、再びｉ番目の顔画像についての認識を行う。このようにすることによって、これ以降に処理において、追尾識別子に応じて、個人認識部１０５が類似度を取得すべき登録人物ＩＤを制限することができる。その結果、追尾識別子リストが更新された顔画像について、早く正確に顔認識を行えるようになる。追尾識別子リストを更新した後はＳ６１に戻って、更新された追尾識別リストを参照して認識処理を行う。Ｓ６１に戻って同じ顔画像について認識処理を行う場合は、前の処理で取得した類似度に基づいて閾値処理のみ行ってもよい。または、Ｓ６１に戻らず、Ｓ１２に進んで次の顔画像についての認識処理をすすめてもよい。 In S11, the tracking identifier list storage unit 107 updates the tracking identifier list corresponding to the i-th face image with the registered person ID corresponding to the recognized registered person as the exclusion ID, based on the recognition result or the feedback. After the tracking identifier list storage unit 107 updates the tracking identifier list, the process proceeds to S61, and the i-th face image is recognized again. By doing so, in the subsequent processing, it is possible to limit the registered person ID for which the personal recognition unit 105 should acquire the degree of similarity in accordance with the tracking identifier. As a result, face recognition can be performed quickly and accurately for a face image whose tracking identifier list has been updated. After updating the tracking identifier list, the process returns to S61, and the recognition processing is performed with reference to the updated tracking identification list. When the process returns to S61 and the recognition process is performed on the same face image, only the threshold process may be performed based on the similarity acquired in the previous process. Alternatively, instead of returning to S61, the process may proceed to S12 to proceed with the recognition process for the next face image.

Ｓ１２では、個人認識部１０５が、次の顔画像があるか否かを判断する。ｉをインクリメントし、Ｎと比較することによって、判断する。次の顔画像がある場合は、Ｓ５に戻る。次の顔画像がない場合（現在のフレームにおける顔画像についてすべて認識を終えた場合）は、処理を終了する。 In S12, the personal recognition unit 105 determines whether or not there is a next face image. Judgment is made by incrementing i and comparing with N. If there is a next face image, the process returns to S5. When there is no next face image (when recognition is completed for all face images in the current frame), the process ends.

上記の手順に関して少し具体的に説明する。例えば、図２（ｃ）のように登録除外情報が管理されている時に、画像が入力され、顔検出、顔追尾の結果、画像中の人物の追尾識別子は、ＴｒａｃｋＩＤ＿３だと判定された場合を考える。この人物に対して、個人認識した結果、人物ＩＤがＲｅｇＩＤ＿０だと認識されたとする。この場合、認識結果表示制御部１０８には、撮像画像と人物ＩＤの登録画像と、ＲｅｇＩＤ＿０という人物ＩＤとが表示される。 The above procedure will be described in some detail. For example, when the registration exclusion information is managed as shown in FIG. 2C, an image is input, and as a result of face detection and face tracking, it is determined that the tracking identifier of the person in the image is TrackID_3. Think As a result of personal recognition of this person, it is assumed that the person ID is recognized as RegID_0. In this case, the recognition result display control unit 108 displays the captured image, the registered image of the person ID, and the person ID of RegID_0.

監督者の目視判断で、この認識結果が誤認識であると判断されると、フィードバック入力部１０９を通じて、登録人物リスト管理部１０７にその情報が通知される。登録人物リスト管理部１０７では、追尾識別子ＴｒａｃｋＩＤ＿３の人物に対して、ＲｅｇＩＤ＿０という認識結果が誤認識だったとわかるので、追尾識別子ＴｒａｃｋＩＤ＿３の登録除外人物ＩＤにＲｅｇＩＤ＿０を追加する。今の場合（図２（ｃ））、追尾識別子ＴｒａｃｋＩＤ＿３の登録除外人物ＩＤはＮｏｎｅ（登録除外人物はない）なので、ＮｏｎｅからＲｅｇＩＤ＿０に更新される。 If the supervisor visually judges that the recognition result is an erroneous recognition, the information is notified to the registered person list management unit 107 through the feedback input unit 109. The registered person list management unit 107 recognizes that the recognition result of RegID_0 is erroneous recognition for the person of the tracking identifier TrackID_3, and thus adds RegID_0 to the registration exclusion person ID of the tracking identifier TrackID_3. In this case (FIG. 2(c)), since the registration exclusion person ID of the tracking identifier TrackID_3 is None (there is no registration exclusion person), the None is updated to the RegID_0.

図８を使って、ＧＵＩを説明する。図８は、表示制御部１０８またはフィードバック入力部１０９に関するＧＵＩの一例である。図８（ａ）は、ユーザが持つタブレット等の表示装置Ｇ１０である。表示装置Ｇ１０は、パーソナルコンピュータや、スマートフォン等の画面を有する装置であれば何でもよい。Ｇ２００は、顔認識処理を終えた画像が表示される。Ｇ２０１は、顔検出処理によって検出された顔画像を示すバウンディングボックスである。バウンディングボックスの近傍には追尾識別子（ＴｒａｃｋＩＤ＿ＸＸ）が表示されるようになっている。ここではＧ２００から４人の人物の顔画像が検出されたとする。Ｇ３００は、ユーザによるフィードバック対象となる一人の顔画像である。追尾識別子が登録人物のいずれかに該当した場合に表示される。この表示によって、監督者であるユーザは監視対象空間に登録人物が現れたことを通知される。Ｇ３０１は、追尾識別子についての認識部１０５の認識結果に対応する顔画像である。予め登録人物リストで対応付けられている登録人物のＩＤ（名前）と顔画像を表示する。その他紐づけられているカテゴリ情報を同時に表示してもよい。Ｇ４００とＧ４０１は、ユーザが認識結果についてのフィードバックを行う入力部である。ユーザは、Ｇ３００とＧ３０１を見比べて、認識結果が合っている場合はＧ４００を、認識結果が間違っている場合はＧ４０１を選択する。このＧＵＩでは、認識結果の正否のみを入力できるようになっている。登録人物が１００人以上いる等、認識すべき対象が多い場合は、このようなＵＩによって誤認識を効率的に発見できる。 The GUI will be described with reference to FIG. FIG. 8 is an example of a GUI relating to the display control unit 108 or the feedback input unit 109. FIG. 8A shows a display device G10 such as a tablet held by the user. The display device G10 may be any device having a screen such as a personal computer or a smartphone. In G200, the image for which the face recognition processing has been completed is displayed. G201 is a bounding box indicating a face image detected by the face detection processing. A tracking identifier (TrackID_XX) is displayed near the bounding box. Here, it is assumed that the face images of four persons are detected from G200. G300 is a face image of one person who is a feedback target of the user. Displayed when the tracking identifier is one of the registered persons. By this display, the user who is the supervisor is notified that the registered person has appeared in the monitored space. G301 is a face image corresponding to the recognition result of the recognition unit 105 regarding the tracking identifier. The registered person's ID (name) and face image associated in advance in the registered person list are displayed. Other related category information may be displayed at the same time. G400 and G401 are input units through which the user gives feedback on the recognition result. The user compares G300 and G301 and selects G400 if the recognition results match, and selects G401 if the recognition results are incorrect. In this GUI, only the correctness of the recognition result can be input. When there are many objects to be recognized, such as 100 or more registered persons, such a UI can efficiently detect misrecognition.

一方で、登録人物が１０人程度と少ない場合は、図８（ｂ）のようなＧＵＩも考えられる。Ｇ３０２には、認識結果である登録人物の顔画像と、類似度が示されている。Ｇ５０１は、登録人物のリストを表示する。ユーザはリストに含まれる登録人物の顔画像を選択して、Ｇ３００とＧ３０２を見比べる。登録人物リストの人数が10人程度であれば、ユーザは認識対象の人物を記憶できる可能性が高いため、正解をフィードバックしてもよい。例えば、リストの中に該当人物がいる場合は、正しい認識結果が誰であるのかをフィードバックする。その結果、個人認識部１０５は、撮像画像の後に撮像された第２のフレーム画像から検出された第２の顔画像であって、第１の顔画像と同一の識別子が付与された第２の顔画像については、第１の人物とは異なる第２の人物であることを認識する。ユーザが正しい認識結果をフィードバックすることによって、ある追尾識別子についての認識結果が確定する。これによって、効率的に登録された人物の認識を行うことが出来る。 On the other hand, when the number of registered people is small, such as about 10, a GUI as shown in FIG. 8B can be considered. G302 shows the facial image of the registered person, which is the recognition result, and the degree of similarity. G501 displays a list of registered persons. The user selects the face image of the registered person included in the list and compares G300 and G302. If the number of people in the registered person list is about 10, the user is likely to be able to memorize the person to be recognized, and the correct answer may be fed back. For example, when the person in question is in the list, the person who gives the correct recognition result is fed back. As a result, the personal recognition unit 105 is the second face image detected from the second frame image captured after the captured image, and the second face image having the same identifier as the first face image is added. It is recognized that the face image is a second person different from the first person. When the user feeds back the correct recognition result, the recognition result for a certain tracking identifier is fixed. As a result, the registered person can be efficiently recognized.

以上詳細に説明したように、顔認識システムを本実施形態のように構成することで、一度誤認識してしまった場合に、その後の同一人物に対する認識処理においては、同じ誤認識を繰り返さないようにする、ということが実現できる。 As described in detail above, by configuring the face recognition system as in the present embodiment, even if an erroneous recognition is made once, the same erroneous recognition is not repeated in the subsequent recognition process for the same person. Can be realized.

＜実施形態２＞
本実施形態では、実施形態１で示した監督者からのフィードバックを拡張した場合の例を示す。実施形態１で示した監督者からのフィードバックは、認識結果が誤認識か否かだけであった。つまり、撮像画像の人物と、認識結果で示された人物とが同じ人物か否かを目視し判断して、その判断結果をフィードバックするというものであった。本実施形態では、上記に加えて、撮像画像の人物と、認識結果で示された人物との属性が同じか否かという情報もフィーバックする。ここでいう属性とは、性別や年齢、人種等、その人に備わっていて、見かけで判断できるような特徴を指す。属性も不一致であった場合には、除外ＩＤとして、その属性を持つＩＤすべてを登録できる。 <Embodiment 2>
In this embodiment, an example in which the feedback from the supervisor shown in the first embodiment is expanded is shown. The feedback from the supervisor shown in the first embodiment is only whether or not the recognition result is a false recognition. That is, the person in the captured image and the person indicated by the recognition result are visually judged to determine whether the person is the same person, and the determination result is fed back. In the present embodiment, in addition to the above, information regarding whether or not the attributes of the person in the captured image and the person indicated by the recognition result are the same is also fed back. The term "attribute" as used herein refers to a characteristic such as gender, age, race, etc. that a person has and that can be judged by appearance. If the attributes do not match, all IDs having the attribute can be registered as exclusion IDs.

本実施形態では誤認識を減らすために、フィードバックで得られた属性情報も利用する。そのために、実施形態１で説明した登録人物リスト記憶部１０６や追尾識別子リスト管理部１０７でも属性情報を管理するように変更する。以下では本実施形態の顔認識システムを詳細に説明する。 In this embodiment, attribute information obtained by feedback is also used in order to reduce erroneous recognition. Therefore, the registered person list storage unit 106 and the tracking identifier list management unit 107 described in the first embodiment are also changed to manage the attribute information. The face recognition system of this embodiment will be described in detail below.

図４は、本実施形態における顔認識システム４００の構成を示すブロック図である。図４において、図１と同じ意味を持つ部品には図１と同じ番号を付与し、その説明は省略する。 FIG. 4 is a block diagram showing the configuration of the face recognition system 400 in this embodiment. 4, parts having the same meanings as in FIG. 1 are given the same numbers as in FIG. 1, and description thereof will be omitted.

登録人物リスト記憶部４０６には、実施形態１の場合と同じように、登録人物の人物ＩＤとその人物の個人特徴とが関連付けられて保持されている。さらに本実施形態では、登録人物の属性も関連付けて管理されている。本実施形態では、登録人物属性として性別を用いた場合について説明を行う。なお、属性は、人が予め判定した結果を入力してもよい。または、顔画像から属性を推定するニューラルネットワーク等を用いた結果を属性として保持してもよい。 In the registered person list storage unit 406, as in the case of the first embodiment, the person ID of the registered person and the individual characteristic of the person are held in association with each other. Further, in the present embodiment, the attributes of registered persons are also managed in association with each other. In this embodiment, a case where sex is used as the registered person attribute will be described. The attribute may be the result of a person's determination in advance. Alternatively, the result obtained by using a neural network or the like for estimating the attribute from the face image may be held as the attribute.

図５（ａ）に登録人物リスト記憶部４０６における登録人物ＩＤと登録個人特徴と属性との管理の状態を表で図示している。図５（ａ）において、登録人物ＩＤがＲｅｇＩＤ＿０の人物の登録個人特徴はＲｅｇＦｅａｔｕｒｅ＿０で、その性別は男性であるということを示している。他の登録人物ＩＤに関しても同様である。 FIG. 5A is a table showing the management state of registered person IDs, registered individual characteristics, and attributes in the registered person list storage unit 406. In FIG. 5A, the registered individual characteristic of the person whose registered person ID is RegID_0 is RegFeature_0, and the sex is male. The same applies to other registered person IDs.

追尾識別子リスト記憶部４０７では、追尾識別子ごとに除外ＩＤが管理されている。さらに本実施形態では、追尾識別子ごとに、個人認識部１０５で行う認識処理に用いない人物属性（これを登録除外属性と呼ぶ）が管理されている。つまり本実施形態では、登録除外情報として、除外ＩＤと登録除外属性とが管理されている。 In the tracking identifier list storage unit 407, exclusion IDs are managed for each tracking identifier. Further, in the present embodiment, a person attribute (which is referred to as a registration exclusion attribute) that is not used in the recognition processing performed by the individual recognition unit 105 is managed for each tracking identifier. That is, in the present embodiment, the exclusion ID and the registration exclusion attribute are managed as the registration exclusion information.

図５（ｂ）には、追尾識別子リスト記憶部４０７における追尾識別子と除外ＩＤと除外属性の管理の状態を表で図示している。図５（ｂ）において、追尾識別子がＴｒａｃｋＩＤ＿０とＴｒａｃｋＩＤ＿２に対しては、登録除外属性がないということを示している。また、追尾識別子がＴｒａｃｋＩＤ＿１に対しては、女性という属性が登録除外属性であるということを示している。同様に、追尾識別子がＴｒａｃｋＩＤ＿３に対しては、男性という属性が登録除外属性であるということを示している。 FIG. 5B shows a table of the management states of the tracking identifier, the exclusion ID, and the exclusion attribute in the tracking identifier list storage unit 407. In FIG. 5B, it is shown that there is no registration exclusion attribute for the tracking IDs TrackID_0 and TrackID_2. Further, for the tracking identifier TrackID_1, it is indicated that the attribute of female is a registration exclusion attribute. Similarly, for the tracking identifier TrackID_3, it indicates that the attribute of male is a registration exclusion attribute.

追尾識別子リスト記憶部４０７では、以上のように追尾識別子に関連付けられて、登録人物除外ＩＤと登録除外属性が管理されている。 In the tracking identifier list storage unit 407, the registered person exclusion ID and the registration exclusion attribute are managed in association with the tracking identifier as described above.

表示制御部４０８は、顔認識処理の対象となった撮像画像と、個人認識部１０５が判定した判定結果の登録人物画像と、登録人物ＩＤ（登録人物の名前）とを表示装置に表示するよう制御する。さらに本実施形態では、個人認識部１０５が判定した判定結果の登録人物に対応した登録人物属性も表示するよう制御する。例えば、図９に示すようなＧＵＩを表示する。 The display control unit 408 displays the captured image subjected to the face recognition processing, the registered person image of the determination result determined by the individual recognition unit 105, and the registered person ID (name of the registered person) on the display device. Control. Further, in the present embodiment, control is performed so that the registered person attribute corresponding to the registered person of the determination result determined by the individual recognition unit 105 is also displayed. For example, a GUI as shown in FIG. 9 is displayed.

フィードバック入力部４０９には、実施形態１の場合と同じように、監督者が今回の顔認識結果が誤っていると判断した場合に、その旨（誤認識している）のフィードバックが入力される。さらに本実施形態では、監督者は、表示制御部４０８に表示された登録人物属性と撮像画像を見比べて、人物属性が一致しているか否かを判断し、一致していないと判断した場合には、フィードバック入力部４０９を介して、その旨（属性不一致）を通知する。 As in the case of the first embodiment, when the supervisor determines that the face recognition result of this time is incorrect, the feedback input unit 409 inputs feedback to that effect (misrecognizing). .. Further, in the present embodiment, the supervisor compares the registered person attribute displayed on the display control unit 408 with the captured image to determine whether or not the person attributes match, and when it determines that they do not match. Notifies that (attribute mismatch) via the feedback input unit 409.

人物属性の性質から、属性不一致の場合には必ず誤認識となるので、本実施形態におけるフィードバックは、「属性不一致でかつ誤認識」という場合と、「属性は一致しているが誤認識」という場合の二通りになる。フィードバック入力部４０９に入力された監督者からのフィードバックは、追尾識別子リスト記憶部４０７に通知される。 Because of the nature of the person attributes, if there is an attribute disagreement, there is always an erroneous recognition. Therefore, the feedback in this embodiment is that "attribute disagreement and erroneous recognition" and "attribute matching but erroneous recognition". There are two cases. Feedback from the supervisor input to the feedback input unit 409 is notified to the tracking identifier list storage unit 407.

追尾識別子リスト記憶部４０７は、監督者から今回の認識結果が「属性不一致でかつ誤認識」であるとのフィードバックを受けると、内部で管理している登録除外情報の更新を行う。この場合、今回認識処理した顔画像が割り当てられている追尾識別子の登録除外人物ＩＤに、今回誤認識した登録人物ＩＤを追加する。さらに同じ追尾識別子の登録除外属性に、今回誤認識した登録人物の人物属性を追加する。「属性は一致しているが誤認識」であるとのフィードバックを受けた場合は、登録除外人物ＩＤの更新のみを行う。或いは、本実施形態のように、人物属性のクラス分類に重複がなく漏れもないような場合には、一致している属性以外の属性を、同じ追尾識別子の登録除外属性に追加してもよい。 When the tracking identifier list storage unit 407 receives feedback from the supervisor that the current recognition result is “attribute mismatch and erroneous recognition”, the registration exclusion information managed internally is updated. In this case, the registered person ID that has been erroneously recognized this time is added to the registration excluded person ID of the tracking identifier to which the face image that has been recognized this time is assigned. Further, the personal attribute of the registered person who is erroneously recognized this time is added to the registration exclusion attribute of the same tracking identifier. When the feedback that "the attributes are the same but the recognition is incorrect" is received, only the registration exclusion person ID is updated. Alternatively, as in the present embodiment, when there is no overlap and no omission in the classification of person attributes, attributes other than the matching attributes may be added to the registration exclusion attributes of the same tracking identifier. ..

例えば、撮像画像の人物が女性であるのに、認識結果として人物属性が男性の登録人物と誤認識した場合、監督者からのフィードバックは「属性不一致でかつ誤認識」となる。その場合には、登録除外人物ＩＤに登録人物ＩＤを追加すると同時に、登録除外属性に、今回誤認識した登録人物の人物属性（男性）を追加する。 For example, when the person in the captured image is a woman, but the recognition result is that the person attribute is erroneously recognized as a registered person having a male attribute, the feedback from the supervisor is “attribute mismatch and erroneous recognition”. In that case, the registration person ID is added to the registration exclusion person ID, and at the same time, the person attribute (male) of the registration person who is erroneously recognized this time is added to the registration exclusion attribute.

また例えば、撮像画像の人物が女性である場合に、認識結果として人物属性が女性の登録人物と誤認識した場合、監督者からのフィードバックは「属性一致しているが誤認識」となる。その場合には、登録除外人物ＩＤに登録人物ＩＤを追加すると同時に、登録除外属性に、今回誤認識した登録人物の人物属性（女性）以外の人物属性（つまり男性）を追加する。 Further, for example, when the person in the captured image is a woman and the recognition result is erroneously recognized as a registered person having a woman attribute, the feedback from the supervisor is “mismatched attribute, but misrecognized”. In that case, at the same time as adding the registered person ID to the registered exclusion person ID, a person attribute (that is, male) other than the person attribute (female) of the registered person who is erroneously recognized this time is added to the registered exclusion attribute.

ここで、画像入力部１０１から画像が入力された時の、個人認識部１０５、登録人物リスト記憶部４０６、追尾識別子リスト記憶部４０７の間での情報のやり取りをもう少し詳細に説明しておく。 Here, the exchange of information between the individual recognition unit 105, the registered person list storage unit 406, and the tracking identifier list storage unit 407 when an image is input from the image input unit 101 will be described in a little more detail.

画像入力部１０１から入力された画像に対して、顔検出部１０２、追尾部１０３、個人特徴取得部１０４において、それぞれ顔検出、顔追尾、個人特徴抽出が実行される。その結果、今回認識処理を行う顔画像に対応する追尾識別子が特定されると同時に、その顔画像に対応する入力個人特徴が取得される。個人認識部１０５には、このようにして取得された入力個人特徴と、その追尾識別子とが入力される。 Face detection, face tracking, and personal characteristic acquisition unit 104 perform face detection, face tracking, and personal characteristic extraction on the image input from image input unit 101, respectively. As a result, the tracking identifier corresponding to the face image to be subjected to the recognition process this time is specified, and at the same time, the input personal feature corresponding to the face image is acquired. The input personal feature thus obtained and the tracking identifier are input to the personal recognition unit 105.

個人認識部１０５は、入力された追尾識別子を追尾識別子リスト記憶部４０７に示すことで、今回の認識処理から除外すべき登録人物ＩＤの有無、および登録除外属性の有無を確認する。つまり、追尾識別子リスト記憶部４０７は、示された追尾識別子に対応する登録除外人物ＩＤがあれば、その登録人物ＩＤを登録人物リスト記憶部１０６に伝える。同様に、示された追尾識別子に対応する登録除外属性があれば、その人物属性を登録人物リスト記憶部４０６に伝える。 The personal recognition unit 105 confirms the presence/absence of the registered person ID to be excluded from the recognition process this time and the presence/absence of the registration exclusion attribute by displaying the input tracking identifier in the tracking identifier list storage unit 407. That is, if there is a registration exclusion person ID corresponding to the indicated tracking identifier, the tracking identifier list storage unit 407 notifies the registration person list storage unit 106 of the registration person ID. Similarly, if there is a registration exclusion attribute corresponding to the indicated tracking identifier, that person attribute is transmitted to the registered person list storage unit 406.

登録人物リスト記憶部４０６は、今回の認識処理で使用する登録個人特徴を個人認識部１０５に対して出力する。その際には、追尾識別子リスト記憶部４０７から伝えられた登録人物ＩＤに対応する登録個人特徴と、同じく追尾識別子リスト記憶部４０７から伝えられた登録人物属性に対応する登録個人特徴とを除外して出力する。 The registered person list storage unit 406 outputs the registered individual characteristics used in the recognition process this time to the individual recognition unit 105. In that case, the registered individual characteristic corresponding to the registered person ID transmitted from the tracking identifier list storage unit 407 and the registered individual characteristic corresponding to the registered person attribute similarly transmitted from the tracking identifier list storage unit 407 are excluded. Output.

図５、図６を用いて、上記の手順を具体的に説明する。例えば図５（ａ）に示すように、６人の登録人物（それぞれの登録人部ＩＤをＲｅｇＩＤ＿０〜ＲｅｇＩＤ＿５とする）が登録人物リスト記憶部４０６に登録されているとする。このあらかじめ登録されているすべての登録人物ＩＤの登録個人特徴をデフォルト登録個人特徴セットと呼ぶ。 The above procedure will be specifically described with reference to FIGS. 5 and 6. For example, as shown in FIG. 5A, it is assumed that six registered persons (whose respective registered person section IDs are RegID_0 to RegID_5) are registered in the registered person list storage section 406. The registered personal characteristics of all registered personal IDs registered in advance are referred to as a default registered personal characteristic set.

このとき、個人認識部１０５に入力されてくる追尾識別子がＴｒａｃｋＩＤ＿０であれば、図５（ｂ）から登録人物除外ＩＤも登録除外属性もＮｏｎｅであるので、除外する登録個人特徴はない。従って、個人認識部１０５では、すべての登録個人特徴（デフォルト登録個人特徴セット）と、入力個人特徴とから類似度を取得する。 At this time, if the tracking identifier input to the individual recognition unit 105 is TrackID_0, both the registered person exclusion ID and the registration exclusion attribute are None from FIG. 5B, so there is no registered individual characteristic to be excluded. Therefore, the individual recognizing unit 105 acquires the degree of similarity from all the registered individual characteristics (default registered individual characteristic set) and the input individual characteristics.

個人認識部１０５に入力されてくる追尾識別子がＴｒａｃｋＩＤ＿１であれば、図５（ｂ）から登録人物除外ＩＤはＲｅｇＩＤ＿３で、登録除外属性は女性であることがわかる。その場合、図５（ｂ）からＲｅｇＩＤ＿３、ＲｅｇＩＤ＿１、ＲｅｇＩＤ＿２、ＲｅｇＩＤ＿４に対応する登録個人特徴を除外することがわかる。したがって、個人認識部１０５では、ＲｅｇＩＤ＿３、ＲｅｇＩＤ＿１、ＲｅｇＩＤ＿２、ＲｅｇＩＤ＿４の登録個人特徴を除くデフォルト登録個人特徴セットと、入力個人特徴とから類似度を取得する。 If the tracking identifier input to the personal identification unit 105 is TrackID_1, it can be seen from FIG. 5B that the registered person exclusion ID is RegID_3 and the registration exclusion attribute is female. In that case, it can be seen from FIG. 5B that registered personal characteristics corresponding to RegID_3, RegID_1, RegID_2, and RegID_4 are excluded. Therefore, the individual recognizing unit 105 acquires the degree of similarity from the default registered personal feature set excluding the registered personal features of RegID_3, RegID_1, RegID_2, and RegID_4, and the input personal feature.

個人認識部１０５に入力されてくる追尾識別子がＴｒａｃｋＩＤ＿２であれば、図５（ｂ）から登録人物除外ＩＤはＲｅｇＩＤ＿０、ＲｅｇＩＤ＿２で、登録除外属性はないことがわかる。したがって、個人認識部１０５では、ＲｅｇＩＤ＿０、ＲｅｇＩＤ＿２の登録個人特徴を除くデフォルト登録個人特徴セットと、入力個人特徴とから類似度を取得する。 If the tracking identifier input to the personal identification unit 105 is TrackID_2, it can be seen from FIG. 5B that the registered person exclusion IDs are RegID_0 and RegID_2, and there is no registration exclusion attribute. Therefore, the individual recognizing unit 105 obtains the degree of similarity from the input personal feature and the default registered personal feature set excluding the registered personal features of RegID_0 and RegID_2.

個人認識部１０５に入力されてくる追尾識別子がＴｒａｃｋＩＤ＿３であれば、図５（ｂ）から登録人物除外ＩＤはなしで、登録除外属性は男性であることがわかる。その場合、図５（ｂ）からＲｅｇＩＤ＿０、ＲｅｇＩＤ＿３、ＲｅｇＩＤ＿５に対応する登録個人特徴を除外することがわかる。したがって、個人認識部１０５では、ＲｅｇＩＤ＿０、ＲｅｇＩＤ＿３、ＲｅｇＩＤ＿５の登録個人特徴を除くデフォルト登録個人特徴セットと、入力個人特徴とから類似度を取得する。このようにすることで、追尾識別子に応じて、個人認識部１０５において類似度を取得すべき登録人物ＩＤを制限することができる。 If the tracking identifier input to the personal identification unit 105 is TrackID_3, it can be seen from FIG. 5B that there is no registered person exclusion ID and the registration exclusion attribute is male. In that case, it can be seen from FIG. 5B that registered personal characteristics corresponding to RegID_0, RegID_3, and RegID_5 are excluded. Therefore, the individual recognizing unit 105 acquires the degree of similarity from the default registered individual characteristic set excluding the registered individual characteristics of RegID_0, RegID_3, and RegID_5 and the input individual characteristic. By doing so, it is possible to limit the registered person ID for which the degree of similarity should be acquired in the personal recognition unit 105, according to the tracking identifier.

図１０を使って、本実施形態における処理の流れを説明する。図１０は情報処理装置が実行する処理を説明するフローチャートである。なお、実施形態１と同様の処理については説明を省略し、差異がある部分を中心に説明する。個人識別部１０５は、撮像画像から得た顔画像における人物の属性情報を更に認識する。フィードバック入力部１０９は、属性情報が合っているか否かについての判定結果を更に受け付ける。判定結果によって顔画像の属性情報が誤りであると判定された場合、個人認識部１０５は、顔画像の属性情報と異なる属性情報を持つ登録人物と比較することによって顔画像を認証する。 The flow of processing in this embodiment will be described with reference to FIG. FIG. 10 is a flowchart illustrating the processing executed by the information processing device. The description of the same processing as that of the first embodiment will be omitted, and the description will focus on the differences. The personal identification unit 105 further recognizes the attribute information of the person in the face image obtained from the captured image. The feedback input unit 109 further receives a determination result as to whether or not the attribute information matches. When it is determined from the determination result that the attribute information of the face image is incorrect, the personal recognition unit 105 authenticates the face image by comparing it with a registered person having attribute information different from the attribute information of the face image.

Ｓ１では、検出部１０２が、注目画像から顔の特徴を抽出することによって顔画像を検出する。Ｓ２では、追尾部１０３が、注目画像の前に撮像された過去のフレーム画像から検出された顔画像と類似した顔を追尾する。Ｓ３では、追尾部１０３が、追尾識別子ｉについて、前のフレームに存在した顔画像であるか否かを判断する。なお、Ｓ３以降の処理はｉ＝０からｉ＝Ｎ−１まで順番に処理していく。追尾識別子ｉが前フレームでも検出されていた場合は、Ｓ４０に進む。追尾識別子ｉが前フレームでは検出されていない場合、すなわち新しく設定された追尾識別子である場合は、Ｓ４１へ進む。Ｓ４０では、追尾識別子リスト記憶部１０７が、個人認識部１０５に前のフレーム画像で生成された追尾識別子リストを参照する。追尾識別子ｉが前のフレームにおいて検出している場合、追尾識別子リスト記憶部１０７は前のフレームで生成された追尾識別子リストを保持している。本実施形態においては、除外属性情報についても保持している。そのため、前のフレーム画像におけるフィードバックによって顔画像ｉの属性情報が限定されていた場合、認識対象となる登録人物ＩＤが限定されたリストになっている。個人認識部１０５では、この追尾識別子リストを使って追尾識別子ｉについての顔認識を実行する。Ｓ４１では、追尾識別子リスト記憶部１０７が、追尾識別子ｉについてのリストを更新する。新しく追加された追尾識別子については、フィードバックが未だないため、除外ＩＤにはＮｏｎｅとしてリストを生成する。 In S1, the detection unit 102 detects the face image by extracting the features of the face from the target image. In S2, the tracking unit 103 tracks a face similar to the face image detected from the past frame image captured before the target image. In S3, the tracking unit 103 determines whether or not the tracking identifier i is a face image existing in the previous frame. It should be noted that the processes from S3 onward are sequentially processed from i=0 to i=N-1. If the tracking identifier i is also detected in the previous frame, the process proceeds to S40. If the tracking identifier i is not detected in the previous frame, that is, if it is a newly set tracking identifier, the process proceeds to S41. In S40, the tracking identifier list storage unit 107 refers to the individual recognition unit 105 to the tracking identifier list generated in the previous frame image. When the tracking identifier i is detected in the previous frame, the tracking identifier list storage unit 107 holds the tracking identifier list generated in the previous frame. In this embodiment, the exclusion attribute information is also held. Therefore, when the attribute information of the face image i is limited by the feedback in the previous frame image, the registered person ID to be recognized is a limited list. The personal recognition unit 105 uses this tracking identifier list to perform face recognition on the tracking identifier i. In S41, the tracking identifier list storage unit 107 updates the list for the tracking identifier i. With respect to the newly added tracking identifier, no feedback has been given yet, so a list is generated as None for the exclusion ID.

Ｓ５では、追尾識別子リスト記憶部１０７が、今までのフィードバック情報が反映された追尾識別子リストに基づいて、追尾識別子ごとに認識処理を行わない登録人物の人物ＩＤ（これを除外ＩＤと呼ぶ）を保持しているか判断する。ｉ番目の追尾識別子について、除外ＩＤがない場合はＳ６０に進む。除外ＩＤがある場合はＳ６１に進む。Ｓ６０では、個人認識部１０５が、リストに登録された人物の顔の特徴と第１の顔画像の特徴とを比較することによって、第１の顔画像がリストに登録された人物のうち第１の人物であることを認識する。Ｓ６１では、個人認識部１０５が、リストに登録された人物のうち除外された人物以外の顔の特徴と第１の顔画像の特徴とを比較することによって、第１の顔画像がリストのうち除外ＩＤ以外の人物であることを認識する。ここでは、追尾識別子リスト記憶部４０７から伝えられた登録人物ＩＤに対応する登録個人特徴と、同じく追尾識別子リスト記憶部４０７から伝えられた登録人物属性に対応する登録個人特徴とを除外したリストを参照する。 In step S5, the tracking identifier list storage unit 107 determines the person IDs of registered persons who do not perform recognition processing for each tracking identifier (referred to as exclusion IDs), based on the tracking identifier list in which the feedback information up to now is reflected. Judge whether it holds. If there is no exclusion ID for the i-th tracking identifier, the process proceeds to S60. If there is an exclusion ID, the process proceeds to S61. In S60, the personal recognition unit 105 compares the facial feature of the person registered in the list with the characteristic of the first facial image to determine whether the first facial image is the first person among the persons registered in the list. Recognize that the person is In S61, the personal recognition unit 105 compares the features of the faces other than the excluded persons among the persons registered in the list with the features of the first face image, so that the first face image is included in the list. Recognize that the person is a person other than the exclusion ID. Here, a list excluding the registered individual characteristic corresponding to the registered person ID transmitted from the tracking identifier list storage unit 407 and the registered individual characteristic corresponding to the registered person attribute also transmitted from the tracking identifier list storage unit 407 is set. refer.

Ｓ８では、表示制御部１０８が、ｉ番目の顔画像が登録人物であることをユーザに通知するための表示を行うよう制御する。Ｓ９では、フィードバック入力部１０９が、監督者（ユーザ）からｉ番目の顔画像が認識された登録人物であるか否かについてのフィードバックを受け付ける。Ｓ１０では、フィードバック入力部１０９が、フィードバックに基づいて認識結果が誤っていたか否かを判断する。監督者が今回の顔認識結果が誤っていると判断した場合は、Ｓ２０に進む。監督者が今回の顔認識結果が正解であると判断した場合は、Ｓ１２に進む。 In S8, the display control unit 108 controls to perform a display for notifying the user that the i-th face image is a registered person. In S9, the feedback input unit 109 receives feedback from the supervisor (user) as to whether or not the i-th face image is the registered person recognized. In S10, the feedback input unit 109 determines whether the recognition result is incorrect based on the feedback. If the supervisor determines that the face recognition result this time is incorrect, the process proceeds to S20. When the supervisor determines that the face recognition result this time is correct, the process proceeds to S12.

Ｓ２０では、フィードバック入力部１０９が、監督者（ユーザ）からｉ番目の顔画像の属性情報に関する認識結果が整合しているかフィードバックを受け付ける。属性情報が整合している場合は、Ｓ２１に進む。属性情報が整合しておらず、誤りである場合は、Ｓ２２に進む。Ｓ２１では、追尾識別子リスト記憶部４０７が、認識結果あるいはフィードバックに基づいて、ｉ番目の顔画像に対応する追尾識別子リストについて、認識された登録人物に対応する登録人物ＩＤを除外ＩＤとして更新する。Ｓ２２では、追尾識別子リスト記憶部４０７が、顔画像ｉの登録除外人物ＩＤに、今回誤認識した登録人物ＩＤを追加し、さらに同じ追尾識別子の登録除外属性に、今回誤認識した登録人物の人物属性を追加する。Ｓ１２では、個人認識部１０５が、次の顔画像があるか否かを判断する。ｉをインクリメントし、Ｎと比較することによって、判断する。次の顔画像がある場合は、Ｓ５に戻る。次の顔画像がない場合（現在のフレームにおける顔画像についてすべて認識を終えた場合）は、処理を終了する。 In S20, the feedback input unit 109 receives feedback from the supervisor (user) whether the recognition result regarding the attribute information of the i-th face image is consistent. If the attribute information matches, the process proceeds to S21. If the attribute information does not match and is incorrect, the process proceeds to S22. In S21, the tracking identifier list storage unit 407 updates the tracking identifier list corresponding to the i-th face image with the registered person ID corresponding to the recognized registered person as the exclusion ID, based on the recognition result or the feedback. In S22, the tracking identifier list storage unit 407 adds the registered person ID that is erroneously recognized this time to the registration excluded person ID of the face image i, and further adds the registered person ID that is erroneously recognized this time to the registration exclusion attribute of the same tracking identifier. Add attributes. In S12, the personal recognition unit 105 determines whether or not there is a next face image. Judgment is made by incrementing i and comparing with N. If there is a next face image, the process returns to S5. When there is no next face image (when recognition is completed for all face images in the current frame), the process ends.

図９を使って、本実施形態におけるＧＵＩについて説明する。図８と符号が共通する構成は説明を省略する。Ｇ３０３は、Ｇ２００に表示されている追尾識別子のうち、登録人物であると認証された追尾識別子に対応する顔画像である。ここでは、Ｇ２００から特定された顔画像とＩＤに加えて、属性情報である性別も表示される。Ｇ３０４は、Ｇ３０３に表示されている追尾識別子についての認証結果を示す画像である。ここでは、登録人物リストから顔画像とＩＤに加えて、属性情報である性別も表示される。Ｇ５０１は、追尾識別子リストである。ここでは、ＴｒａｃｋＩＤ＿０が選択されており、ＴｒａｃｋＩＤ＿０に対応する顔画像の認識結果である属性情報を表示する。ここでは、ＴｒａｃｋＩＤ＿０は女性であることが推定されている。Ｇ４００、Ｇ４０４とＧ４０５は、ユーザによって判定されたフィードバックを入力するＵＩである。Ｇ４００は、ユーザによって認識結果が正しいと判断された場合に認識結果が正解であるというフィードバックを受け付ける。Ｇ４０４は、属性情報（ここでは性別）が誤っており、なおかつ登録人物ではなかった場合に、「属性不一致でかつ誤認識」であるというフィードバックを受け付ける。Ｇ４０５は、属性情報（ここでは性別）はあっているが、登録人物ではなかった場合に、「属性は一致しているが誤認識」であるというフィードバックを受け付ける。 The GUI in this embodiment will be described with reference to FIG. The description of the components having the same reference numerals as those in FIG. G303 is a face image corresponding to the tracking identifier authenticated as the registered person among the tracking identifiers displayed in G200. Here, in addition to the face image and ID specified from G200, the gender as attribute information is also displayed. G304 is an image showing the authentication result for the tracking identifier displayed in G303. Here, in addition to the face image and the ID from the registered person list, the gender which is the attribute information is also displayed. G501 is a tracking identifier list. Here, TrackID_0 is selected, and the attribute information which is the recognition result of the face image corresponding to TrackID_0 is displayed. Here, it is estimated that TrackID_0 is a female. G400, G404 and G405 are UIs for inputting feedback determined by the user. The G400 receives feedback that the recognition result is correct when the user determines that the recognition result is correct. When the attribute information (here, gender) is incorrect and the person is not a registered person, G404 receives feedback that the attribute is “mismatched and misrecognized”. When the attribute information (here, gender) is present but the person is not a registered person, G405 receives feedback that "attributes match but is erroneously recognized".

実施形態１では、一度誤認識してしまった場合に、先ほど誤認識してしまった登録人物を、もとの登録人物セットから除外したうえで、その後の同一人物に対する認識処理を行うようにすることで同じ誤認識を繰り返さないようにした。 In the first embodiment, when the misrecognized person is once erroneously recognized, the registered person who is erroneously recognized earlier is excluded from the original registered person set, and thereafter, the recognition process for the same person is performed. This prevents the same misrecognition from being repeated.

実施形態２では、誤認識に加えて属性も不一致だった場合には、その属性情報も利用して、もとの登録人物セットから除外する。上記の効果に加えて、その後の同一人物に対する認識処理において、別の誤認識が発生する可能性も低減させることが可能となっている。 In the second embodiment, when the attributes do not match in addition to the erroneous recognition, the attribute information is also used to exclude them from the original registered person set. In addition to the above effects, it is possible to reduce the possibility that another erroneous recognition will occur in the subsequent recognition process for the same person.

＜実施形態３＞
実施形態１、２では、単一のカメラで顔の追尾を行う場合の例について記したが、本発明は、複数のカメラに渡って顔の追尾を行う場合でも適用可能である。本実施形態では、複数のカメラに渡って顔の追尾を行う場合の例を示す。 <Embodiment 3>
In the first and second embodiments, an example in which the face tracking is performed by a single camera has been described, but the present invention is also applicable when the face tracking is performed over a plurality of cameras. In the present embodiment, an example in which face tracking is performed across a plurality of cameras is shown.

図６は、情報処理システム７００（情報処理システム１００及び情報処理システム７０４から成る）及び、カメラ間人物統合部７０１から構成されるシステムの構成を示す図である。図６において、図１と同じ意味を持つ部品には図１と同じ番号を付与し、その説明は省略する。 FIG. 6 is a diagram showing a configuration of a system including an information processing system 700 (including the information processing system 100 and the information processing system 704) and an inter-camera person integration unit 701. 6, parts having the same meanings as in FIG. 1 are given the same numbers as in FIG. 1 and their description is omitted.

図６は、情報処理システム７００の構成を示すブロック図である。また、符号７０４は同じ機能構成を有する別の情報処理システムを示している。 FIG. 6 is a block diagram showing the configuration of the information processing system 700. Reference numeral 704 indicates another information processing system having the same functional configuration.

符号７０１は、カメラ間人物統合部である。カメラ間人物統合部７０１では、複数の情報処理システム７００で撮像された顔画像の対応関係を調査し、同一人物の判定を行う。つまり、ある情報処理システム７００で撮像された顔と、別の情報処理システム７００で撮像された顔が同一人物のものか否かを判断する。さらに判断結果を、情報処理システム７００から送られてくる追尾識別子と関連付けて管理する。また、情報処理システム７００から送られてくる登録除外情報も併せて管理を行う。カメラ間人物統合部７０１の詳細を以下に説明する。 Reference numeral 701 is an inter-camera person integration unit. The inter-camera person integration unit 701 investigates the correspondence between face images captured by a plurality of information processing systems 700 and determines the same person. That is, it is determined whether the face imaged by a certain information processing system 700 and the face imaged by another information processing system 700 belong to the same person. Further, the judgment result is managed in association with the tracking identifier sent from the information processing system 700. The registration exclusion information sent from the information processing system 700 is also managed. Details of the inter-camera person integration unit 701 will be described below.

カメラ間追尾部７０２では、複数の情報処理システム７００から送られてくる顔検出画像を用いて、複数カメラ間での顔画像との対応関係をとる機能を有する。複数カメラ間での顔画像の対応をとる手法は特に問わない。実施形態１で説明したように、顔画像の輝度データや色情報を用いて対応をとる手法を使用すればよい。また、顔画像だけでは対応関係が十分判定できない場合には、情報処理システム７００から全身画像を送ってもらうようにし、服の色や持ち物の有無から対応を決定してもよい。情報処理システム７００では、顔検出結果を基に全身領域を推定すればよい。 The inter-camera tracking unit 702 has a function of using face detection images sent from a plurality of information processing systems 700 to establish a correspondence relationship with face images between a plurality of cameras. The method of associating face images between a plurality of cameras is not particularly limited. As described in the first embodiment, the method of taking correspondence using the brightness data and color information of the face image may be used. If the correspondence cannot be sufficiently determined only by the face image, the information processing system 700 may send a full-body image, and the correspondence may be determined based on the color of the clothes and the presence or absence of belongings. In the information processing system 700, the whole body area may be estimated based on the face detection result.

複数の情報処理システム７００から送られてきた複数の顔が同一人物のものであるとされた場合、その人物を特定するために、追尾識別子（カメラ間追尾で設定される追尾識別子をカメラ間追尾識別子と呼ぶ）を設定する。 When it is determined that a plurality of faces sent from a plurality of information processing systems 700 belong to the same person, in order to identify the person, a tracking identifier (a tracking identifier set in the tracking between cameras is used as a tracking identifier between cameras). (Called an identifier).

カメラ間追尾部７０２は、情報処理システム７００から送られてきた追尾識別子（同一と判定された両者の追尾識別子）と、カメラ間追尾識別子とをカメラ間記憶部７０３に送る。 The inter-camera tracking unit 702 sends the inter-camera storage unit 703 the inter-camera tracking identifier and the tracking identifier (both tracking identifiers determined to be the same) sent from the information processing system 700.

カメラ間記憶部７０３では、情報処理システム７００から送られてくる、追尾識別子と登録除外人物ＩＤの情報を、さらにカメラ間追尾識別子と関連させて管理する。 The inter-camera storage unit 703 manages the information on the tracking identifier and the registration-excluded person ID sent from the information processing system 700 in association with the inter-camera tracking identifier.

図６には、カメラ間記憶部７０３におけるカメラ間追尾識別子、追尾識別子（情報処理システム７００のものと７０４のもの）、登録人物除外ＩＤの管理の状態を表で図示している。図６において、カメラ間追尾識別子がｉｎｔｅｇＴｒａｃｋＩＤ＿０は、情報処理システム７００では追尾識別子ＴｒａｃｋＩＤ＿０として管理されている人物であり、登録人物除外ＩＤがないということを示している。また、カメラ間追尾識別子がｉｎｔｅｇＴｒａｃｋＩＤ＿１は、情報処理システム７００では追尾識別子ＴｒａｃｋＩＤ＿１として管理されている人物であり、登録人物除外ＩＤがＲｅｇＩＤ＿３であるということを示している。また、カメラ間追尾識別子がｉｎｔｅｇＴｒａｃｋＩＤ＿２は、情報処理システム７００では追尾識別子ＴｒａｃｋＩＤ＿２として管理され、顔認識システム７０４では追尾識別子ＴｒａｃｋＩＤ＿１として管理されている人物である。登録人物除外ＩＤがＲｅｇＩＤ＿０であるということを示している。また、カメラ間追尾識別子がｉｎｔｅｇＴｒａｃｋＩＤ＿３は、顔認識システム７０４では追尾識別子ＴｒａｃｋＩＤ＿２として管理されている人物であり、登録人物除外ＩＤがないということを示している。 FIG. 6 is a table showing a management state of inter-camera tracking identifiers, tracking identifiers (of the information processing systems 700 and 704), and registered person exclusion IDs in the inter-camera storage unit 703. In FIG. 6, the inter-camera tracking identifier integTrackID_0 indicates that the person is managed as the tracking identifier TrackID_0 in the information processing system 700, and there is no registered person exclusion ID. Further, the inter-camera tracking identifier integTrackID_1 indicates that the information processing system 700 is a person managed as the tracking identifier TrackID_1, and the registered person exclusion ID is RegID_3. Further, the inter-camera tracking identifier integTrackID_2 is a person who is managed as the tracking identifier TrackID_2 in the information processing system 700 and as the tracking identifier TrackID_1 in the face recognition system 704. This indicates that the registered person exclusion ID is RegID_0. In addition, the inter-camera tracking identifier intTrackID_3 indicates that the person is managed as the tracking identifier TrackID_2 in the face recognition system 704 and that there is no registered person exclusion ID.

カメラ間記憶部７０３では、以上のようにカメラ間追尾識別子及び追尾識別子に関連付けられて、登録人物除外ＩＤが管理されている。カメラ間記憶部７０３では、情報処理システム７００（或いは７０４）の追尾識別子リスト記憶部７０７で行われる登録除外情報の更新に連動して、登録除外ＩＤが更新される。さらに、更新した情報を顔認識システム７０４（或いは７００）の登録除外情報管理部７０７に通知する。通知を受けた登録除外情報管理部７０７では、その情報に基づいて、登録除外情報を更新する。 In the inter-camera storage unit 703, the registered person exclusion ID is managed in association with the inter-camera tracking identifier and the tracking identifier as described above. In the inter-camera storage unit 703, the registration exclusion ID is updated in association with the update of the registration exclusion information performed in the tracking identifier list storage unit 707 of the information processing system 700 (or 704). Further, the updated information is notified to the registration exclusion information management unit 707 of the face recognition system 704 (or 700). Upon receipt of the notification, the registration exclusion information management unit 707 updates the registration exclusion information based on the information.

追尾識別子リスト記憶部７０７は符号１０７で説明した動作に加えて、上述したようにカメラ間記憶部７０３と登録除外情報をやり取りし、自分が管理している登録除外情報の更新を行う。 In addition to the operation described with reference numeral 107, the tracking identifier list storage unit 707 exchanges the registration exclusion information with the inter-camera storage unit 703 as described above, and updates the registration exclusion information managed by itself.

以上のように構成することで、複数のカメラに渡って顔の追尾を行う場合でも本発明を適用可能となる。 With the above configuration, the present invention can be applied even when face tracking is performed across a plurality of cameras.

＜その他の実施形態＞
これまでの実施形態では、顔認識が正認識であった場合に関しては言及していなかった。正認識であった場合には、その情報を利用して、監督者がサービス（例えば顧客の好みに応じた接客等）を開始することになるが、そのサービスの形態によっては、同じ追尾識別子で表現される同一人物に対しては、以降の顔認識は必要ない場合もありうる。従って、ある追尾識別子に対して、正認識が行われれば、以降の顔認識をスキップすることも可能である。 <Other embodiments>
In the above embodiments, no reference was made to the case where face recognition was correct recognition. In the case of correct recognition, the supervisor uses the information to start a service (for example, customer service according to the taste of the customer), but depending on the form of the service, the same tracking identifier may be used. Subsequent face recognition may not be necessary for the same person represented. Therefore, if the correct recognition is performed on a certain tracking identifier, it is possible to skip the subsequent face recognition.

これまで説明した実施形態では、監督者からのフィードバックは誤認識した場合に限定されていたが、正認識の場合にもフィードバックをしてもらえば、正認識の判断が可能となる。或いは、誤認識とのフィードバックがなければ、それをもって正認識と判断してもよい。 In the embodiments described so far, the feedback from the supervisor is limited to the case of erroneous recognition. However, in the case of correct recognition as well, if feedback is given, it is possible to determine correct recognition. Alternatively, if there is no feedback as false recognition, it may be determined as correct recognition.

本発明は、以下の処理を実行することによっても実現される。即ち、上述した実施形態の機能を実現するソフトウェア（プログラム）を、データ通信用のネットワーク又は各種記憶媒体を介してシステム或いは装置に供給する。そして、そのシステム或いは装置のコンピュータ（またはＣＰＵやＭＰＵ等）がプログラムを読み出して実行する処理である。また、そのプログラムをコンピュータが読み取り可能な記録媒体に記録して提供してもよい。 The present invention is also realized by executing the following processing. That is, the software (program) that realizes the functions of the above-described embodiments is supplied to the system or device via the network for data communication or various storage media. Then, the computer (or CPU, MPU, etc.) of the system or apparatus reads and executes the program. Alternatively, the program may be recorded in a computer-readable recording medium and provided.

１００情報処理装置
１０１画像入力部
１０２検出部
１０３追尾部
１０４個人特徴取得部
１０５個人認識部
１０６登録人物リスト記憶部
１０７追尾識別子リスト記憶部
１０８表示制御部
１０９フィードバック入力部 100 Information processing device 101 Image input unit 102 Detection unit 103 Tracking unit 104 Personal feature acquisition unit 105 Personal recognition unit 106 Registered person list storage unit 107 Tracking identifier list storage unit 108 Display control unit 109 Feedback input unit

Claims

An information processing apparatus for recognizing a person included in a captured image using a list in which face images of a person to be recognized are registered,
A face image included in the image of interest, and a recognition unit that recognizes the face image as a specific person based on the list,
An input unit for inputting that the recognition result of the face image by the recognition unit is incorrect,
The information processing apparatus, wherein the recognition means controls, in the recognition after receiving the input, not to recognize the input face image as an error in the recognition result as the specific person.

The captured image is a part of a moving image,
When the image captured after the target image includes a face image corresponding to the face image included in the target image, the recognizing unit captures the target image after the target image based on the input. The information processing apparatus according to claim 1, wherein a face image included in an image is not recognized as the specific person.

The recognition means recognizes the face image as the specific person based on the similarity between the characteristics of the face image of the person who is the recognition target and the characteristics of the face image included in the attention image. The information processing device according to claim 1 or 2.

The information processing apparatus according to claim 3, wherein the recognition unit recognizes the face image included in the image of interest as the specific person when the similarity is included in a predetermined range.

The information processing apparatus according to claim 3, wherein the features include a positional relationship between parts of a face and a size of the face image.

The information processing apparatus according to claim 1, wherein the input unit is input by a user that the recognition result of the face image by the recognition unit is incorrect.

Based on at least the attention image and the first image captured in a frame prior to the attention image, the face image included in the attention image and the face image included in the first image have a predetermined value. 7. The tracking device according to claim 2, further comprising a tracking unit for tracking the face by assigning the same identifier to the face image when the person is within a movable range within a time period. The information processing device described.

8. The tracking unit tracks a face by assigning the identifier to each of the face images when there are a plurality of face images included in the target image, according to any one of claims 2 to 7. The information processing device described.

The recognizing unit is a person registered in the list acquired from the learned model and individual features that can identify an individual for the face included in the image of interest based on a learned model that recognizes a face included in the image. 9. The personal feature of each of the images is compared to recognize that the face image included in the image of interest is a person registered in the list, according to any one of claims 1 to 8. The information processing device described.

10. The display control means for controlling the display means to display a recognition result indicating that the face is the specific person by the recognition means, further comprising: Information processing equipment.

The display control means controls to display a face to be recognized included in the target image and a face image of a specific person recognized by the recognition means side by side on the display means. 10. The information processing device according to 10.

The display control unit controls the display unit that further displays the degree of similarity between the face image of the recognition target included in the target image and the face image of the specific person recognized by the recognition unit. The information processing device according to claim 10 or 11.

The input means accepts that the face image is registered as a determination result by the user and is a second person different from the specific person.
The recognizing unit is the second face image included in the second image captured after the target image, and the second face image similar to the face image included in the target image is The information processing apparatus according to any one of claims 2 to 12, characterized in that it is recognized as a second person.

The identification means further recognizes attribute information of a person corresponding to a face image included in the attention image,
The input means further receives a determination result as to whether or not the attribute information matches,
When the determination result determines that the attribute information of the face image included in the attention image is incorrect, the recognition unit does not recognize the face image as a person having the error attribute information. The information processing apparatus according to any one of claims 1 to 13, wherein the information processing apparatus is configured as described above.

The method according to any one of claims 1 to 14, further comprising update means for updating the list so as to exclude the specific person from the recognition target for the face image included in the attention image based on the input. Information processing equipment.

A program for causing a computer to function as each unit included in the information processing apparatus according to claim 1.

An information processing method for recognizing a person included in a captured image using a list in which face images of a person to be recognized are registered,
A face image included in the image of interest, and a recognition step of recognizing the face image included in the image of interest as a specific person based on the list;
An input step of inputting that the recognition result of the face image by the recognition means is incorrect,
In the recognition step, in the recognition after receiving the input, the information processing method is controlled so as not to recognize the input face image as an error in the recognition result as the specific person.