JP4380308B2

JP4380308B2 - Face area detection method and apparatus, program, and computer-readable storage medium storing face area detection program

Info

Publication number: JP4380308B2
Application number: JP2003398025A
Authority: JP
Inventors: 西尾　　卓; 行信谷口
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 2003-11-27
Filing date: 2003-11-27
Publication date: 2009-12-09
Anticipated expiration: 2023-11-27
Also published as: JP2005157911A

Description

本発明は、顔領域検出方法及び装置及びプログラム及び顔領域検出プログラムを格納したコンピュータ読み取り可能な記憶媒体に係り、特に、画像中から存在数やサイズが未知の顔領域を検出するための顔領域検出方法及び装置及びプログラム及び顔領域検出プログラムを格納したコンピュータ読み取り可能な記憶媒体に関する。 The present invention relates to a face area detection method and apparatus, a program, and a computer-readable storage medium storing the face area detection program, and in particular, a face area for detecting a face area whose number or size is unknown from an image. The present invention relates to a detection method and apparatus, a program, and a computer-readable storage medium storing a face area detection program.

従来、顔領域のサイズが未知の場合、画像中から顔領域を検出する方法のひとつとして、画像の解像度を変化させながら処理を行う方法が用いられている。例えば、パターン検出によく用いられるテンプレートマッチングがある（例えば、非特許文献１参照）。 Conventionally, when the size of the face area is unknown, a method of performing processing while changing the resolution of the image is used as one of methods for detecting the face area from the image. For example, there is template matching often used for pattern detection (see, for example, Non-Patent Document 1).

テンプレートマッチングとは、図１１のように、検出しようとする対象を表すテンプレートｔ（ｘ，ｙ）を画像ｆ（ｘ，ｙ）中の点（ｉ，ｊ）にその中心が重なるようにし、ｔ（ｘ，ｙ）とそれと重なる画像の部分パターンとの類似度を測り、その値を点（ｉ，ｊ）に対象が存在する確からしさとする方法である。 In template matching, as shown in FIG. 11, a template t (x, y) representing an object to be detected is overlapped at a point (i, j) in an image f (x, y), and t This is a method of measuring the degree of similarity between (x, y) and the partial pattern of the overlapping image, and using the value as the probability that the target exists at the point (i, j).

顔検出の場合にはテンプレートに顔画像を用いて、上記の操作を画像の全ての点に対して施し、類似度が最大となる点を求めたり、類似度がある閾値を超える点を求めたりすればよい。 In the case of face detection, using the face image as a template, the above operation is performed on all the points of the image to find the point where the similarity is the maximum, or the point where the similarity exceeds a certain threshold do it.

点（ｕ，ｖ）における類似度ｍ（ｕ，ｖ）の算出式としては、例えば式１のようなものがある。 As a formula for calculating the similarity m (u, v) at the point (u, v), for example, there is a formula like Formula 1.

上記の式１では、（ｕ，ｖ）はテンプレートの中央ではなく、左上端の位置を表している。但し、この方法では、テンプレートと同サイズの顔領域しか検出できないため、顔領域のサイズが未知の場合には、図１２のように画像の解像度を変化させながら、各解像度におけるテンプレートとの類似度を求める必要がある。

In Equation 1 above, (u, v) represents the position of the upper left corner, not the center of the template. However, in this method, only a face area of the same size as the template can be detected. Therefore, when the face area size is unknown, the degree of similarity with the template at each resolution is changed while changing the resolution of the image as shown in FIG. It is necessary to ask.

動画像から顔領域を検出する際には、上記手順をフレームごとに適用し領域を求める。
コンピュータ画像処理入門ｐｐ．１４９−１５２田村秀行総研出版 When a face area is detected from a moving image, the above procedure is applied for each frame to obtain the area.
Introduction to computer image processing pp. 149-152 Hideyuki Tamura Research Institute Publishing

しかしながら、上記の従来の方法を用いると、顔領域のサイズが未知の場合、画像の解像度を逐次変化させながら顔領域の検出を行うため、処理に膨大な時間がかかってしまう。 However, when the above-described conventional method is used, if the size of the face area is unknown, the face area is detected while sequentially changing the resolution of the image, so that the process takes a long time.

また、動画像の場合には、処理対象とする画像の枚数が激増するため、処理時間が大幅に増えてしまうという問題がある。 In the case of a moving image, the number of images to be processed increases drastically, and there is a problem that the processing time increases significantly.

この問題の解決方法として、顔領域の存在数が既知の場合である場合は、全ての顔領域が見つかった時点で処理を打ち切ることで高速化が可能であるが、顔領域が複数あり、存在数が未知の場合にはこの方法は利用できない。 As a solution to this problem, if the number of existing face areas is known, processing can be speeded up by stopping processing when all face areas are found, but there are multiple face areas. This method cannot be used when the number is unknown.

本発明は、上記の点に鑑みなされたもので、複数の異なるサイズの顔領域が存在する画像中から顔領域の検出時間短縮を図ることが可能な顔領域検出方法及び装置及びプログラム及び顔領域検出プログラムを格納したコンピュータ読み取り可能な記憶媒体を提供することを目的とする。 The present invention has been made in view of the above points, and a face area detection method, apparatus, program, and face area capable of reducing a face area detection time from an image in which a plurality of face areas of different sizes exist. It is an object to provide a computer-readable storage medium storing a detection program.

図１は、本発明の原理を説明するための図である。 FIG. 1 is a diagram for explaining the principle of the present invention.

本発明（請求項１）は、静止画像及び動画像を含む画像から顔領域を検出する装置における顔領域検出方法において、
入力画像を読み込み、記憶手段に格納する画像読込過程（ステップ１）と、
記憶手段から１つの入力画像を読み出して、該入力画像から縦横それぞれの辺の画素数を減らした変換画像で、顔が判別できる程度で可能な限り低い解像度から該入力画像の解像度までの変換画像を、該入力画像の縦横比を変化させないで該入力画像を変換することにより予め定めた枚数を作成し、記憶手段に格納する解像度変換過程（ステップ２）と、
解像度変換過程により変換済みの変換画像のうち、解像度が一番低い変換画像から順に解像度を上げた変換画像を用いて顔領域検出処理を行う顔領域検出過程（ステップ３）と、
顔領域検出過程により初めて顔領域を見つけてから解像度を順に上げて顔領域検出処理を行うときの処理回数が所定の回数に達した場合に、顔画像検出過程の処理を終了させる終了判定過程（ステップ４）と、を行う。 The present invention (Claim 1) is a face area detection method in an apparatus for detecting a face area from an image including a still image and a moving image.
An image reading process (step 1) for reading an input image and storing it in a storage means;
A converted image obtained by reading one input image from the storage unit and reducing the number of pixels on each side in the vertical and horizontal directions from the input image and converting the image from the lowest possible resolution to the resolution of the input image to the extent that a face can be identified. A resolution conversion step (step 2) in which a predetermined number is created by converting the input image without changing the aspect ratio of the input image and stored in the storage means;
A face area detection process (step 3) in which face area detection processing is performed using a converted image in which the resolution is increased in order from the converted image having the lowest resolution among the converted images converted by the resolution conversion process;
The resolution from the finding for the first time face region if the number of times of processing reaches a predetermined number of times when performing the face area detecting process by raising the order by the face area detection process, end determining process to terminate the process of the face image detection process (Step 4).

本発明（請求項２）は、動画像から顔領域を検出する装置における顔領域検出方法において、
入力動画像を読み込み、記憶手段に格納する画像読込過程と、
記憶手段から前記入力動画像の最初のフレームを読み出して基準フレームとして記憶手段に格納する基準フレーム選択過程と、
基準フレームについて顔領域検出を行う顔検出過程と、
基準フレームに続くフレームを比較対象フレームとする比較対象フレームとする比較対象フレーム選択過程と、
基準フレームと比較対象フレーム間の差分値を求める差分算出過程と、
差分値算出過程で求められた前記差分値が一定の閾値以下であれば、前記基準フレームで検出された顔領域を比較対象フレームにおける顔領域と決定し、新たな比較対象フレームを次フレームとして差分算出過程を繰り返し、該差分値が一定の閾値より大きければ、該比較対象フレームを新たな基準フレームとして顔領域検出過程以降を繰り返す終了過程と、を行い、
顔検出過程は、
基準フレームの画像から縦横それぞれの辺の画素数を減らした変換画像で、顔が判別できる程度で可能な限り低い解像度から該基準フレームの画像の解像度までの変換画像を、該基準フレームの画像の縦横比を変化させないで基準フレームの画像を変換することにより予め定めた枚数を作成し、記憶手段に格納する解像度変換過程と、
解像度変換過程により変換済みの変換画像のうち、解像度が一番低い変換画像から順に解像度を上げた変換画像を用いて顔領域検出処理を行う顔領域検出過程と、
顔領域検出過程により初めて顔領域を見つけてから解像度を順に上げて顔領域検出処理を行うときの処理回数が所定の回数に達した場合に、該顔領域検出過程の処理を終了させる終了判定過程と、を行う。 The present invention (Claim 2) is a face area detection method in an apparatus for detecting a face area from a moving image.
An image reading process of reading an input moving image and storing it in a storage means;
A reference frame selection process of reading the first frame of the input moving image from the storage means and storing it in the storage means as a reference frame;
A face detection process for detecting a face area for a reference frame;
A comparison target frame selection process in which a frame subsequent to the reference frame is set as a comparison target frame;
A difference calculation process for obtaining a difference value between the reference frame and the comparison target frame;
If the difference value obtained in the difference value calculation process is equal to or smaller than a certain threshold value, the face area detected in the reference frame is determined as the face area in the comparison target frame, and the new comparison target frame is used as the next frame. Repeating the calculation process, and if the difference value is greater than a certain threshold value, performing a face area detection process and subsequent end processes using the comparison target frame as a new reference frame, and
The face detection process
A converted image obtained by reducing the number of pixels on each side in the vertical and horizontal directions from the reference frame image, and converting the converted image from the lowest possible resolution to the resolution of the reference frame image to the extent that the face can be identified. A resolution conversion process of creating a predetermined number of images by converting the image of the reference frame without changing the aspect ratio, and storing it in the storage means;
A face area detection process in which face area detection processing is performed using a converted image whose resolution is increased in order from the converted image having the lowest resolution among the converted images converted by the resolution conversion process;
In cases where processing count has reached the predetermined number of times when the finding for the first time the face region by the face region detection process by increasing the resolution in order perform the face area detecting process, end determination to end the processing of said pigment region detection process Process.

また、本発明（請求項３）は、差分値算出過程において、差分値として、各画素値の差分値を求める。 Further, according to the present invention (claim 3) , a difference value of each pixel value is obtained as a difference value in the difference value calculation process.

また、本発明（請求項４）は、差分値算出過程において、差分値を、画像全体の色ヒストグラムの差分を用いて算出する。 According to the present invention (Claim 4), in the difference value calculation process, the difference value is calculated using the difference of the color histogram of the entire image.

図２は、本発明の原理構成図である。 FIG. 2 is a principle configuration diagram of the present invention.

本発明（請求項５）は、静止画像及び動画像を含む画像から顔領域を検出する顔領域検出装置であって、
入力画像を読み込み、記憶手段１２に格納する画像読込手段１１と、
記憶手段１２から１つの入力画像を読み出して、該入力画像から縦横それぞれの辺の画素数を減らした変換画像で、顔が判別できる程度で可能な限り低い解像度から該入力画像の解像度までの変換画像を、該入力画像の縦横比を変化させないで該入力画像を変換することにより予め定めた枚数を作成し、記憶手段１２に格納する解像度変換手段１３と、
解像度変換手段１３により変換済みの変換画像のうち、解像度が一番低い変換画像から順に解像度を上げた変換画像を用いて顔領域検出処理を行う顔領域検出手段１４と、
顔領域検出手段１４により初めて顔領域を見つけてから解像度を順に上げて顔領域検出処理を行うときの処理回数が所定の回数に達した場合に、顔画像検出過程の処理を終了させる終了判定手段１６と、を有する。 The present invention (Claim 5) is a face area detection device for detecting a face area from an image including a still image and a moving image,
An image reading means 11 for reading an input image and storing it in the storage means 12;
A conversion image obtained by reading one input image from the storage unit 12 and reducing the number of pixels on each side in the vertical and horizontal directions from the input image, and converting from the lowest possible resolution to the resolution of the input image so that a face can be identified. A resolution conversion unit 13 for creating a predetermined number of images by converting the input image without changing the aspect ratio of the input image and storing the image in the storage unit 12;
A face area detection means 14 for performing face area detection processing using a converted image whose resolution is increased in order from the converted image having the lowest resolution among the converted images converted by the resolution conversion means 13;
In cases where processing count has reached the predetermined number of times when the finding for the first time the face region by the face region detection unit 14 to increase the resolution in order perform the face area detecting process, end determination to end the process of the face image detection process Means 16.

本発明（請求項６）は、動画像から顔領域を検出する顔領域検出装置であって、
入力動画像を読み込み、記憶手段に格納する画像読込手段と、
記憶手段から入力動画像の最初のフレームを読み出して基準フレームとして記憶手段に格納する基準フレーム選択手段と、
基準フレームについて顔領域検出を行う顔検出手段と、
基準フレームに続くフレーム比較対象フレームとする比較対象フレーム選択手段と、
基準フレームと比較対象フレーム間の差分値を求める差分値算出手段と、
差分値算出手段で求められた差分値が一定の閾値以下であれば、基準フレームで検出された顔領域を比較対象フレームにおける顔領域と決定する顔領域決定手段と、
顔領域決定手段の後に新たな比較対象フレームを次フレームとして、比較対象フレーム選択手段、差分値算出手段、顔領域決定手段の処理を繰り返す第１の繰り返し手段と、
差分値算出手段で求められた差分値が一定の閾値より大きければ、基準フレーム選択手段、顔検出手段、差分値算出手段、顔領域決定手段の処理を繰り返す第２の繰り返し手段と、を有し、
顔検出手段は、
基準フレームの画像から縦横それぞれの辺の画素数を減らした変換画像で、顔が判別できる程度で可能な限り低い解像度から該基準フレームの画像の解像度までの変換画像を、該基準フレームの画像の縦横比を変化させないで該基準フレームの画像を変換することにより予め定めた枚数を作成し、記憶手段に格納する解像度変換手段と、
解像度変換手段により変換済みの変換画像のうち、解像度が一番低い変換画像から順に解像度を上げた変換画像を用いて顔領域検出処理を行う顔領域検出手段と、
顔領域検出手段により初めて顔領域を見つけてから解像度を順に上げて顔領域検出処理を行うときの処理回数が所定の回数に達した場合に、該顔領域検出手段の処理を終了させる終了判定手段と、を有する。

The present invention (Claim 6) is a face area detection device for detecting a face area from a moving image,
Image reading means for reading an input moving image and storing it in a storage means;
Reference frame selection means for reading out the first frame of the input moving image from the storage means and storing it in the storage means as a reference frame;
Face detection means for performing face area detection on the reference frame;
A comparison target frame selecting means to be a frame comparison target frame following the reference frame;
A difference value calculating means for obtaining a difference value between the reference frame and the comparison target frame;
If the difference value obtained by the difference value calculation means is equal to or less than a certain threshold value, a face area determination means for determining the face area detected in the reference frame as the face area in the comparison target frame;
A first repeating unit that repeats the processing of the comparison target frame selection unit, the difference value calculation unit, and the face region determination unit, with the new comparison target frame as the next frame after the face region determination unit;
A second repeater that repeats the process of the reference frame selector, the face detector, the difference value calculator, and the face area determiner if the difference value obtained by the difference value calculator is greater than a certain threshold value. ,
Face detection means
A converted image obtained by reducing the number of pixels on each side in the vertical and horizontal directions from the reference frame image, and converting the converted image from the lowest possible resolution to the resolution of the reference frame image to the extent that the face can be identified. A resolution conversion unit that creates a predetermined number of images by converting the image of the reference frame without changing the aspect ratio, and stores the resolution in the storage unit;
A face area detection means for performing face area detection processing using a converted image in which the resolution is increased in order from the converted image having the lowest resolution among the converted images converted by the resolution conversion means;
The first time if the number of times of processing reaches a predetermined number of times when the find a face region by increasing the resolution in order perform the face area detecting process, end determination to end the processing of said pigment region detection means by the face region detecting means Means.

また、本発明（請求項７）の差分値算出手段は、差分値として、各画素値の差分値を求める。 Further, the difference value calculation means of the present invention (claim 7) obtains a difference value of each pixel value as the difference value.

また、本発明（請求項８）の差分値算出手段は、差分値を、画像全体の色ヒストグラムの差分を用いて算出する。 Further, the difference value calculation means of the present invention (claim 8) calculates the difference value using the difference of the color histogram of the entire image.

本発明（請求項９）は、請求項５乃至８のいずれか１項に記載の顔領域検出装置を構成する各手段としてコンピュータを機能させるための顔領域検出プログラムである。 The present invention (Claim 9) is a face area detection program for causing a computer to function as each means constituting the face area detection apparatus according to any one of Claims 5 to 8 .

本発明（請求項１０）は、請求項９に記載の顔領域検出プログラムを格納した、コンピュータ読み取り可能な記憶媒体である。 The present invention (Claim 10) is a computer-readable storage medium storing the face area detection program according to Claim 9 .

上記のように、本発明では、静止画像または動画像における撮影上の特徴に着目した。撮影上の特徴とは、通常、人物を撮影する場合、顔がよく見えるように撮影するケースが多いということである。 As described above, in the present invention, attention is paid to the shooting characteristics of a still image or a moving image. The feature on photography is that, in general, when photographing a person, there are many cases where the face is well photographed.

また、画像中で着目している人物、何らかの意味を持つ人物を中心に大きく写るように撮影するケースが多い。 Also, there are many cases where a photograph is taken so that a person who is paying attention in the image or a person having some meaning is photographed largely.

そのため大きく写っている顔の重要性が高く、それに比べて小さな顔を無視してもよい場合がある。例えば、主要な登場人物の一覧を作る場合である。 For this reason, a face that is large is highly important, and a small face may be ignored in comparison. For example, when making a list of main characters.

そこで、本発明では、低解像度の画像から高解像度の順に顔領域検出処理を行い、処理結果が予め定めた閾値以下であれば顔領域であると決定し、初めて顔領域を見つけた時点での画像の解像度より予め定めた割合だけ高解像度の画像について処理を行った時点で処理を打ち切るものである。 Therefore, in the present invention, face area detection processing is performed in the order of high resolution from low resolution images, and if the processing result is equal to or smaller than a predetermined threshold, it is determined that the face area is detected, The processing is terminated when processing is performed on an image having a higher resolution than the resolution of the image by a predetermined ratio.

また、顔領域の検出を行う際に、その画像中に人物が存在するか否かを調べることが目的で、正確な位置は必要でない場合がある。例えば、動画像中の特定の人物登場シーンを探すような場合である。 Further, when detecting a face area, there is a case where an accurate position is not necessary for the purpose of checking whether or not a person is present in the image. For example, it is a case where a specific person appearance scene is searched for in a moving image.

そこで、別の方法として、動画像において、最初のフレームを基準フレームとし、基準フレームと基準フレームに続く核フレーム間で、同位置にある各画素値の差分を求め、画像全体での差分の絶対値の総和が一定の閾値以下であれば、フレーム間での変化が殆どないため、基準フレームで検出された顔領域を、そのフレームにおける顔領域と決定する。 Therefore, as another method, in the moving image, the first frame is used as a reference frame, and the difference between the pixel values at the same position is obtained between the reference frame and the core frame following the reference frame. If the sum of the values is equal to or less than a certain threshold value, there is almost no change between frames, so the face area detected in the reference frame is determined as the face area in that frame.

差分の絶対値の総和が一定の閾値より大きければ顔領域を求め、新たな基準フレームとする。これを繰り返すことにより動画像全体で顔領域を決定する。 If the sum of the absolute values of the differences is larger than a certain threshold value, the face area is obtained and set as a new reference frame. By repeating this, the face area is determined for the entire moving image.

つまり、フレーム間の差分値が少ないフレームの検出処理を飛ばし、検出処理を行うフレーム間隔の適応的な調節を行うものである。 In other words, the detection process of frames with a small difference value between frames is skipped, and the frame interval for performing the detection process is adaptively adjusted.

上記の発明は、各画素値の差分の代わりに画像全体の色ヒストグラムの差分を用いることや、差分の絶対値総和の代わりに差分の自乗和を求めることも可能である。 In the above invention, it is also possible to use the difference of the color histogram of the entire image instead of the difference of each pixel value, or to obtain the square sum of the difference instead of the absolute value sum of the differences.

上記のように、本発明によれば、小さな顔領域の検出処理を省くことで処理時間の短縮が可能となる。 As described above, according to the present invention, it is possible to shorten the processing time by omitting the detection process of a small face area.

また、動画像中に存在する顔領域のうち、基準フレームから基準フレーム間の顔領域検出処理を除き、前基準フレーム同位置に顔領域が存在するとみなすことで処理時間の短縮が可能となる。 Further, the processing time can be shortened by assuming that a face area exists in the same position as the previous reference frame, except for the face area detection process between the reference frames from the reference frame among the face areas existing in the moving image.

以下、図面と共に本発明の実施の形態を説明する。 Hereinafter, embodiments of the present invention will be described with reference to the drawings.

［第１の実施の形態］
図３は、本発明の第１の実施の形態における顔領域検出装置の構成を示す。 [First Embodiment]
FIG. 3 shows the configuration of the face area detection apparatus according to the first embodiment of the present invention.

同図に示す顔領域検出装置１０は、入力画像を読み込む画像読込部１１、読み込まれた画像を一時的に格納するメモリ１２、メモリ１２から入力画像を読み出して解像度を変換してメモリ１２に格納する解像度変換部１３、変換済みの画像のうち、低解像度の画像から高解像度の順に顔領域検出処理を行う顔領域検出部１４、顔領域検出部１４から参照されるテンプレート１５、及び処理の終了判定を行う処理終了判定部１６から構成される。なお、同図の構成では、テンプレート１５が含まれているが、本実施の形態の例として顔検出処理において、テンプレートマッチングを用いているためであり、テンプレートマッチング以外の処理の場合は不要である。 The face area detection device 10 shown in FIG. 1 includes an image reading unit 11 that reads an input image, a memory 12 that temporarily stores the read image, and reads the input image from the memory 12, converts the resolution, and stores it in the memory 12. The resolution conversion unit 13 that performs conversion, the face region detection unit 14 that performs face region detection processing from the low-resolution image to the high-resolution image, the template 15 that is referred to by the face region detection unit 14, and the end of the processing It is comprised from the process end determination part 16 which performs determination. Although the template 15 is included in the configuration shown in the figure, it is because template matching is used in face detection processing as an example of the present embodiment, and is not necessary for processing other than template matching. .

図４は、本発明の第１の実施の形態における顔領域検出処理のフローチャートであり、入力画像読み込みステップ（ステップ３０１）、入力画像解像度変換ステップ（ステップ３０２）、顔領域検出ステップ（ステップ３０３）、繰り返し判定ステップ（ステップ３０４）を示している。 FIG. 4 is a flowchart of face area detection processing according to the first embodiment of the present invention, in which an input image reading step (step 301), an input image resolution conversion step (step 302), and a face area detection step (step 303). , A repeat determination step (step 304) is shown.

以下、図４に基づいて顔領域検出の動作を詳細に説明する。 Hereinafter, the face area detection operation will be described in detail with reference to FIG.

まず、ステップ３０１において画像読込部１１に入力されるのは、ディジタル化された静止画像、または、動画像の任意の単一フレームとする。なお、画像の解像度や、保存形式、内容、記録媒体については特に制限しないが、図３では例として、読み込まれたフレームをメモリ１２に一時的に保存するものとする。 First, in step 301, the input to the image reading unit 11 is a digitized still image or an arbitrary single frame of a moving image. The image resolution, storage format, content, and recording medium are not particularly limited, but the read frame is temporarily stored in the memory 12 as an example in FIG.

ステップ３０１では、顔領域検出処理を行う画像を選択し、装置外部の記録媒体からメモリ１２に格納する。画像の選択はオペレータの判断で自由に行ってよい。 In step 301, an image to be subjected to face area detection processing is selected and stored in the memory 12 from a recording medium external to the apparatus. The selection of the image may be freely made at the operator's discretion.

ここで得られた原画像の解像度を（Ｐ×Ｑ）とする。 The resolution of the original image obtained here is (P × Q).

次に、ステップ３０２では、解像度変換部１３において、ステップ３０１でメモリ１２に読み込まれた原画像（Ｐ×Ｑ）を低解像度の画像（Ｋ×Ｌ）に変換する。 Next, in step 302, the resolution converter 13 converts the original image (P × Q) read into the memory 12 in step 301 into a low resolution image (K × L).

初回のループでは顔が判別できる程度で可能な限り低解像度の画像に変換する（例えば、１６×１６程度）。 In the first loop, the image is converted to an image with as low a resolution as possible so that the face can be identified (for example, about 16 × 16).

次回のループからは前回のループで用いた画像より一回り高解像度の画像に変換する。なお、変換の際には画像の縦横の比率は変化させない。 From the next loop, the image is converted to a higher resolution image than the image used in the previous loop. In the conversion, the aspect ratio of the image is not changed.

ここで、図５の原画像を図６で表す低解像度の画像へ変換する例について説明する。但し、各画素は各格子点に対応しているものとする。 Here, an example of converting the original image of FIG. 5 into the low-resolution image shown in FIG. 6 will be described. However, each pixel is assumed to correspond to each grid point.

まず、図６の低解像度画像における画素ａ’は、図５の原画像において同位置の点ａ’に対応する。ここで、図５の点ａ’の周辺は図７のようになっている。 First, the pixel a ′ in the low-resolution image in FIG. 6 corresponds to the point a ′ at the same position in the original image in FIG. 5. Here, the periphery of the point a 'in FIG. 5 is as shown in FIG.

このとき、各点の画素値をｆ（ｘ，ｙ）とすると、点ａ’の画素値ｆは式２で表現される。 At this time, assuming that the pixel value at each point is f (x, y), the pixel value f at the point a 'is expressed by Equation 2.

ｆ＝ｆ（ｕ，ｖ）（１−α）（１−β）＋ｆ（ｕ＋１，ｖ）α（１−β）
＋ｆ（ｕ，ｖ＋１）（１―α）β＋ｆ（ｕ＋１，ｖ＋１）αβ （２）
同様にして図６の低解像度画像における他の格子点の画素値は、原画像の同位置にある点の周囲４点の画素値から求めることができる。 f = f (u, v) (1-α) (1-β) + f (u + 1, v) α (1-β)
+ F (u, v + 1) (1-α) β + f (u + 1, v + 1) αβ (2)
Similarly, the pixel values of other lattice points in the low-resolution image of FIG. 6 can be obtained from the pixel values of four points around the point at the same position in the original image.

次に、ステップ３０３では顔領域検出部１４において、ステップ３０２で得られた低解像度に対して顔検出処理を行う。 Next, in step 303, the face area detection unit 14 performs face detection processing on the low resolution obtained in step 302.

顔検出処理の方法は、テンプレートマッチングを利用した方法、ニューラルネットワークを利用した方法、サポートベクターマシンを利用した方法などがあるが、ここではテンプレートマッチングを利用した方法について説明する。 The face detection processing method includes a method using template matching, a method using a neural network, a method using a support vector machine, and the like. Here, a method using template matching will be described.

まず、テンプレート１５となる顔画像（Ｍ×Ｎ）を記憶媒体（メモリ）上に用意する。 First, a face image (M × N) serving as the template 15 is prepared on a storage medium (memory).

このテンプレートは複数人物の顔画像を（Ｍ×Ｎ）のサイズに切り出し、複数の顔画像の各画素の平均値をテンプレートの対応する画素の値とする。図８のように、テンプレートｔ（ｘ，ｙ）をステップ３０２で得られた低解像度画像ｇ（ｘ，ｙ）中の点（ｉ，ｊ）にその中心が重なるようにし、ｔ（ｘ，ｙ）とそれと重なる画像の部分パターンとの類似度ｍ（ｉ，ｊ）を式３により算出する。 In this template, face images of a plurality of persons are cut into a size of (M × N), and an average value of each pixel of the plurality of face images is set as a value of a corresponding pixel of the template. As shown in FIG. 8, the center of the template t (x, y) overlaps with the point (i, j) in the low resolution image g (x, y) obtained in step 302, and t (x, y ) And the similarity m (i, j) between the overlapping partial pattern of the image is calculated by Equation 3.

同様に、低解像度画像ｇ（ｘ，ｙ）中の全ての点について類似度を算出し、その類似度ｍ（ｘ，ｙ）が予め定めた閾値以下であれば、点（ｘ，ｙ）に顔領域が存在すると見做す。また、全ての点の類似度が予め定めた閾値より大きければ何もしない。

Similarly, the similarity is calculated for all points in the low-resolution image g (x, y), and if the similarity m (x, y) is less than or equal to a predetermined threshold, the point (x, y) is calculated. It is assumed that a face area exists. If the similarity of all points is greater than a predetermined threshold value, nothing is done.

このステップ３０３により、顔領域の座標を得ることができる。 By this step 303, the coordinates of the face area can be obtained.

ステップ３０４では、顔検出処理の繰り返しを行うか否かの判定を行う。以下の条件に当てはまらない場合は、前回のループで用いた変換画像の解像度を徐々に上げながらステップ３０２〜３０４を繰り返す。この場合も画像の縦横の比率は変更しない。 In step 304, it is determined whether or not to repeat the face detection process. If the following conditions are not satisfied, steps 302 to 304 are repeated while gradually increasing the resolution of the converted image used in the previous loop. In this case as well, the aspect ratio of the image is not changed.

ステップ３０３により、初めて顔領域が検出されたループから予め定めた回数ループ処理を繰り返した場合、または、ループ毎に徐々に解像度を上げた変換画像が原画像と一致する場合、ループを終了する。 In step 303, when the loop processing is repeated a predetermined number of times from the loop in which the face area is detected for the first time, or when the converted image whose resolution is gradually increased for each loop matches the original image, the loop is terminated.

本実施の形態によって、画像中に存在する顔領域のうち、小さな領域の検出処理を省くことで処理時間の短縮が可能となる。 According to the present embodiment, it is possible to shorten the processing time by omitting the detection process of a small area among the face areas existing in the image.

［第２の実施の形態］
図９は、本発明の第２の実施の形態における顔領域検出装置の構成を示す。 [Second Embodiment]
FIG. 9 shows a configuration of a face area detection apparatus according to the second embodiment of the present invention.

同図に示す顔領域検出装置２０は、入力画像を読み込む画像読込部２１、読み込まれた画像を格納するメモリ２２、メモリ２２に格納された最初のフレームを基準フレームとして選択してメモリ２２に格納する基準フレーム選択部２３、メモリ２２に格納されている基準フレームと当該基準フレームに続くフレームを比較対象フレームとし、基準フレームと比較対象フレームとの差分値を求め、差分値が一定の閾値以下であれば基準フレームで検出された顔領域を比較対象フレームにおける顔領域として決定する顔領域検出部２４、次の比較対象フレームをメモリ２２から取得し、顔領域検出部２４に渡す比較対象フレーム選択部２６、顔領域検出部２４から参照されるテンプレート２５、及び、各部の処理の終了を判定する処理終了判定部２７から構成される。 The face area detection apparatus 20 shown in FIG. 1 includes an image reading unit 21 that reads an input image, a memory 22 that stores the read image, and the first frame stored in the memory 22 is selected as a reference frame and stored in the memory 22. The reference frame selection unit 23 and the reference frame stored in the memory 22 and the frame following the reference frame are set as comparison target frames, a difference value between the reference frame and the comparison target frame is obtained, and the difference value is equal to or less than a certain threshold value. If there is a face area detection unit 24 that determines a face area detected in the reference frame as a face area in the comparison target frame, a comparison target frame selection unit that acquires the next comparison target frame from the memory 22 and passes it to the face area detection unit 24 26, a template 25 referred to by the face area detection unit 24, and a process end determination unit that determines the end of the process of each unit 7 consists of.

図１０は、本発明の第２の実施の形態における顔領域検出処理のフローチャートである
同図に示すフローチャートは、入力動画図読み込みステップ（ステップ８０１）、基準フレーム選択ステップ（ステップ８０２）、顔領域検出処理ステップ（ステップ８０３）、比較対象フレーム選択ステップ（ステップ８０４）、繰り返し判定ステップ（ステップ８０５）、フレーム間差分値算出ステップ（ステップ８０６）、処理方法判定ステップ（ステップ８０７）を示す。 FIG. 10 is a flowchart of face area detection processing according to the second embodiment of the present invention. The flowchart shown in FIG. 10 includes an input moving image diagram reading step (step 801), a reference frame selection step (step 802), and a face region. A detection processing step (step 803), a comparison target frame selection step (step 804), a repetition determination step (step 805), an inter-frame difference value calculation step (step 806), and a processing method determination step (step 80 7 ) are shown.

以下、図１０に基づいて、顔領域検出の動作を詳細に説明する。 The face area detection operation will be described in detail below with reference to FIG.

まず、ステップ８０１において、画像読込部２１において、ディジタル化された動画像が入力されるものとする。ここで、動画像の解像度や、保存形式、内容、記録媒体については特に制限しない。読み込まれた顔領域検出処理対象の動画像を記録媒体からメモリ２２に格納する。 First, in step 801, it is assumed that the image reading unit 21 receives a digitized moving image. Here, the resolution of the moving image, the storage format, the contents, and the recording medium are not particularly limited. The read moving image subject to face area detection processing is stored in the memory 22 from the recording medium.

ステップ８０２では、基準フレーム選択部２３において、ステップ８０１で得られた動画像から基準となるフレームを選択してメモリ２２に格納する。初回のループでは、動画像の最初のフレームをメモリ２２から読み出して基準フレームとして選択する。 In step 802, the reference frame selection unit 23 selects a reference frame from the moving image obtained in step 801 and stores it in the memory 22. In the first loop, the first frame of the moving image is read from the memory 22 and selected as a reference frame.

ステップ８０３では、顔検出処理部２４において、ステップ８０２で得られた基準フレームについて顔検出処理を行う。顔検出処理の方法はテンプレートマッチングを利用した方法、ニューラルネットワークを利用した方法、サポートベクターマシンを利用した方法などがあるが、ここでは、テンプレートマッチングを利用した方法について説明する。 In step 803, the face detection processing unit 24 performs face detection processing on the reference frame obtained in step 802. The face detection processing method includes a method using template matching, a method using a neural network, a method using a support vector machine, and the like. Here, a method using template matching will be described.

まず、テンプレートとなる顔画像（Ｍ×Ｎ）を記憶媒体（メモリ）上に用意する。 First, a face image (M × N) serving as a template is prepared on a storage medium (memory).

このテンプレート２５は、複数人物の顔画像を（Ｍ×Ｎ）のサイズに切り出し、複数顔画像の各画素の平均値をテンプレートの対応する画素の値とする。 The template 25 cuts face images of a plurality of persons into a size of (M × N), and uses the average value of each pixel of the plurality of face images as the value of the corresponding pixel of the template.

図８のように、顔領域検出部２４は、テンプレートｔ（ｘ，ｙ）をステップ８０３で得られた基準フレーム画像ｇ（ｘ，ｙ）中の点（ｉ，ｊ）にその中心が重なるようにし、ｔ（ｘ，ｙ）とそれと重なる画像の部分パターンとの類似度ｍ（ｉ，ｊ）を前述の式３により算出する。 As shown in FIG. 8, the face area detection unit 24 makes the center of the template t (x, y) overlap the point (i, j) in the reference frame image g (x, y) obtained in step 803. Then, the similarity m (i, j) between t (x, y) and the partial pattern of the overlapping image is calculated by the above-described equation 3.

同様に、基準フレーム画像ｇ（ｘ，ｙ）中の全ての点について類似度を算出し、その類似度ｍ（ｘ，ｙ）が予め定めた閾値以下であれば、点（ｘ，ｙ）に顔領域が存在するとみなす。 Similarly, the similarity is calculated for all points in the reference frame image g (x, y), and if the similarity m (x, y) is equal to or less than a predetermined threshold, the point (x, y) is calculated. It is assumed that a face area exists.

類似度が閾値以下になる点が複数あれば、複数の点すべてに顔領域が存在するとみなす。全ての点の類似度を予め定めた閾値により大きければ何もしない。 If there are a plurality of points whose similarity is equal to or less than the threshold value, it is considered that a face region exists at all of the plurality of points. Nothing is done if the similarity of all points is greater than a predetermined threshold.

ステップ８０３により、顔領域の座標を得ることができる。 In step 803, the coordinates of the face area can be obtained.

ステップ８０４では、比較対象フレーム選択部２６によりメモリ２２から比較対象とするフレームを選択する。比較対象フレームは、前回の比較対象フレームの次フレームとする。初回は基準フレームの次フレームを比較対象フレームとする。 In step 804, the comparison target frame selection unit 26 selects a frame to be compared from the memory 22. The comparison target frame is the next frame of the previous comparison target frame. For the first time, the frame next to the reference frame is set as a comparison target frame.

ステップ８０５では、処理終了判定部２７において、処理の繰り返し判定を行う。ステップ８０４において、比較対象フレームの次フレームが存在しない場合、つまり動画像の最後のフレームまで処理を行った場合、ループを終了する。それ以外の場合は、継続してループ処理を行う。 In step 805, the process end determination unit 27 determines whether to repeat the process. In step 804, if the next frame of the comparison target frame does not exist, that is, if the processing is performed up to the last frame of the moving image, the loop is terminated. In other cases, loop processing is continued.

ステップ８０６では、顔領域検出部２４において、基準フレームと比較対象フレームの間で差分値を求める。差分値とは、画像全体の色ヒストグラムで各色毎の差分値の総和や対応する同位置の各画素値の差分自乗和を用いることもできるが、ここでは、対応する同位置の各画素値の差分絶対値総和を用いた例について説明する。 In step 806, the face area detection unit 24 obtains a difference value between the reference frame and the comparison target frame. The difference value may be the sum of the difference values for each color in the color histogram of the entire image or the difference square sum of the corresponding pixel values at the same position. An example using the sum of absolute differences will be described.

基準フレーム画像ｆ（ｘ，ｙ）と比較対象フレームｇ（ｘ，ｙ）の差分値ｄは、式４により算出される。 The difference value d between the reference frame image f (x, y) and the comparison target frame g (x, y) is calculated by Equation 4.

ステップ８０７では、顔領域検出部２４において、ステップ８０６で得られた差分値ｄに基づき、処理の分岐判定を行う。ステップ８０６で得られた差分値ｄが予め定めた閾値以下であれば、基準フレームと比較対象フレーム間で画像の変化が少ないため、基準フレームと同様の位置に顔領域が存在するとみなし、比較対象フレームの次フレームを新たな比較対象フレームとし、ステップ８０４〜８０７を繰り返す。

In step 807, the face area detection unit 24 determines whether to branch the process based on the difference value d obtained in step 806. If the difference value d obtained in step 806 is less than or equal to a predetermined threshold value, there is little change in the image between the reference frame and the comparison target frame, so it is considered that a face region exists at the same position as the reference frame, and the comparison target Steps 804 to 807 are repeated with the next frame as a new frame to be compared.

ステップ８０６で得られた差分値ｄが予め定めた閾値より大きければ、基準フレームと比較対象フレーム間で画像の変化が大きいため顔領域位置の更新を行う必要がある。 If the difference value d obtained in step 806 is larger than a predetermined threshold value, it is necessary to update the face area position because the image changes greatly between the reference frame and the comparison target frame.

そこで、顔領域検出部２４は、基準フレーム選択部２３に対して、顔領域位置の更新のために、当該比較対象フレームを渡し、基準フレーム選択部２３において、比較対象フレームを新たな基準フレームとして選択し、ステップ８０２〜８０７を繰り返す。 Therefore, the face area detection unit 24 passes the comparison target frame to the reference frame selection unit 23 for updating the face area position, and the reference frame selection unit 23 sets the comparison target frame as a new reference frame. Select and repeat steps 802-807.

本実施の形態によって、動画像中に存在する顔領域のうち、基準フレームから基準フレーム間の顔領域検出処理を省き、前基準フレームと同位置に顔領域が存在すると見做すことで処理時間の短縮が可能となる。 According to the present embodiment, the processing time is eliminated by omitting the face area detection process from the reference frame to the reference frame among the face areas existing in the moving image and assuming that the face area exists at the same position as the previous reference frame. Can be shortened.

また、上記の第１の実施の形態における図４及び第２の実施の形態における図１０のフローチャートに示す動作を、顔領域検出装置として利用されるコンピュータにインストールし、ＣＰＵ等の制御手段により実行する、または、ネットワークを介して流通させることも可能である。 Further, the operations shown in the flowcharts of FIG. 4 in the first embodiment and FIG. 10 in the second embodiment are installed in a computer used as a face area detection device and executed by a control means such as a CPU. Or distributed via a network.

また、構築されたプログラムを、顔領域検出装置として利用されるコンピュータに接続されるハードディスク装置や、フレキシブルディスク、ＣＤ−ＲＯＭ等の可搬記憶媒体に格納しておき、実施する際に、コンピュータにインストールすることも可能である。 In addition, the constructed program is stored in a hard disk device connected to a computer used as a face area detection device, a portable storage medium such as a flexible disk, a CD-ROM, and the like. It is also possible to install.

なお、本発明は、上記の実施の形態に限定されることなく、特許請求の範囲内において、種々変更・応用が可能である。 The present invention is not limited to the above-described embodiment, and various modifications and applications can be made within the scope of the claims.

本発明は、画像処理技術、顔認証システムや防犯システム等における顔領域を検出する機能を有するシステムに適用可能である。 The present invention can be applied to a system having a function of detecting a face area in an image processing technique, a face authentication system, a security system, or the like.

本発明の原理を説明するための図である。It is a figure for demonstrating the principle of this invention. 本発明の原理構成図である。It is a principle block diagram of this invention. 本発明の第１の実施の形態における顔領域検出装置の構成図である。It is a block diagram of the face area detection apparatus in the 1st Embodiment of this invention. 本発明の第１の実施の形態における顔領域検出処理のフローチャートである。It is a flowchart of the face area detection process in the 1st Embodiment of this invention. 本発明の第１の実施の形態における高解像度画像の例である。It is an example of the high resolution image in the 1st Embodiment of this invention. 本発明の第１の実施の形態における低解像度画像の例である。It is an example of the low-resolution image in the 1st Embodiment of this invention. 本発明の第１の実施の形態における解像度変換算出イメージを示す図である。It is a figure which shows the resolution conversion calculation image in the 1st Embodiment of this invention. 本発明の第１の実施の形態におけるテンプレートマッチングの例である。It is an example of the template matching in the 1st Embodiment of this invention. 本発明の第２の実施の形態における顔領域検出装置の構成図である。It is a block diagram of the face area detection apparatus in the 2nd Embodiment of this invention. 本発明の第２の実施の形態における顔領域検出処理のフローチャートである。It is a flowchart of the face area detection process in the 2nd Embodiment of this invention. 従来のテンプレートマッチングの概要図である。It is a schematic diagram of conventional template matching. 従来の画像解像度変換を示す図である。It is a figure which shows the conventional image resolution conversion.

Explanation of symbols

１０顔領域検出装置
１１画像読込手段、画像読込部
１２記憶手段、メモリ
１３解像度変換手段、解像度変換部
１４顔領域検出手段、顔領域検出部
１５テンプレート
１６終了判定手段、処理終了判定部
２０顔領域検出装置
２１画像読込部
２２メモリ
２３基準フレーム選択部
２４顔領域検出部
２５テンプレート
２６比較対象フレーム選択部
２７処理終了判定部 DESCRIPTION OF SYMBOLS 10 Face area detection apparatus 11 Image reading means, Image reading part 12 Memory | storage means, Memory 13 Resolution conversion means, Resolution conversion part 14 Face area detection means, Face area detection part 15 Template 16 End determination means, Process end determination part 20 Face area Detection device 21 Image reading unit 22 Memory 23 Reference frame selection unit 24 Face area detection unit 25 Template 26 Comparison target frame selection unit 27 Processing end determination unit

Claims

In a face area detection method in an apparatus for detecting a face area from an image including a still image and a moving image,
An image reading process of reading an input image and storing it in a storage means;
A conversion image obtained by reading out one input image from the storage means and reducing the number of pixels on each side in the vertical and horizontal directions from the input image, and converting from the lowest possible resolution to the resolution of the input image so that a face can be identified. A resolution conversion process of creating a predetermined number of images by converting the input images without changing the aspect ratio of the input images, and storing the images in a storage unit;
A face area detection process in which face area detection processing is performed using a converted image in which the resolution is increased in order from the converted image having the lowest resolution among the converted images converted by the resolution conversion process;
In cases where processing count has reached the predetermined number of times when performing the face area detecting process by increasing the resolution in order from finding a first face region by the face region detecting step, the end to end process of the face image detection process Judgment process,
A face area detection method characterized by:

In a face area detection method in an apparatus for detecting a face area from a moving image,
An image reading process of reading an input moving image and storing it in a storage means;
A reference frame selection process of reading the first frame of the input moving image from the storage means and storing it in the storage means as a reference frame;
A face detection process for performing face area detection on the reference frame;
A comparison target frame selection process in which a frame subsequent to the reference frame is a comparison target frame.
A difference calculation process for obtaining a difference value between the reference frame and the comparison target frame;
If the difference value obtained in the difference value calculation process is equal to or smaller than a certain threshold value, the face area detected in the reference frame is determined as the face area in the comparison target frame, and a new comparison target frame is set as the next frame. The difference calculation process is repeated, and if the difference value is larger than a certain threshold value, an end process of repeating the face area detection process and the subsequent steps using the comparison target frame as a new reference frame is performed.
The face detection process includes:
A converted image obtained by reducing the number of pixels on each side in the vertical and horizontal directions from the image of the reference frame, and a converted image from the lowest possible resolution to the resolution of the image of the reference frame to the extent that a face can be identified. A resolution conversion process in which a predetermined number is created by converting the image of the reference frame without changing the aspect ratio of the image, and stored in the storage means;
A face area detection process in which face area detection processing is performed using a converted image in which the resolution is increased in order from the converted image having the lowest resolution among the converted images converted by the resolution conversion process;
In cases where processing count has reached the predetermined number of times when performing the face area detecting process by increasing the resolution in order from finding a first face region by the face region detecting step, the end to end the processing of said pigment region detection process Judgment process,
A face area detection method characterized by:

In the difference value calculation process,
The face area detection method according to claim 2, wherein a difference value of each pixel value is obtained as the difference value.

In the difference value calculation process,
The face area detection method according to claim 2, wherein the difference value is calculated using a difference of a color histogram of the entire image.

A face area detection device for detecting a face area from an image including a still image and a moving image,
Image reading means for reading an input image and storing it in a storage means;
A conversion image obtained by reading out one input image from the storage means and reducing the number of pixels on each side in the vertical and horizontal directions from the input image, and converting from the lowest possible resolution to the resolution of the input image so that a face can be identified. A resolution conversion unit that creates a predetermined number of images by converting the input image without changing the aspect ratio of the input image, and stores the image in a storage unit;
A face area detecting means for performing a face area detecting process using a converted image whose resolution is increased in order from the converted image having the lowest resolution among the converted images converted by the resolution converting means;
After finding the first face area if the number of times of processing reaches a predetermined number of times when performing the face area detecting process by increasing the resolution in the order by the face region detecting means, end determining unit for terminating said pigment image detection means And a face area detecting device.

A face area detection device for detecting a face area from a moving image,
Image reading means for reading an input moving image and storing it in a storage means;
Reference frame selection means for reading the first frame of the input moving image from the storage means and storing it in the storage means as a reference frame;
Face detection means for performing face area detection for the reference frame;
Comparison target frame selection means to be a frame comparison target frame following the reference frame;
Difference value calculating means for obtaining a difference value between the reference frame and the comparison target frame;
If the difference value obtained by the difference value calculation means is equal to or less than a certain threshold value, a face area determination means for determining a face area detected in the reference frame as a face area in the comparison target frame;
A first repeating unit that repeats the processing of the comparison target frame selecting unit, the difference value calculating unit, and the face region determining unit with a new comparison target frame as the next frame after the face region determining unit;
If the difference value obtained by the difference value calculation means is larger than a certain threshold value, a second repetition means for repeating the processes of the reference frame selection means, the face detection means, the difference value calculation means, and the face area determination means And having
The face detection means includes
A converted image obtained by reducing the number of pixels on each side in the vertical and horizontal directions from the image of the reference frame, and a converted image from the lowest possible resolution to the resolution of the image of the reference frame to the extent that a face can be identified. A resolution conversion unit that creates a predetermined number of images by converting the image of the reference frame without changing the aspect ratio of the image, and stores the number in a storage unit;
A face area detecting means for performing a face area detecting process using a converted image whose resolution is increased in order from the converted image having the lowest resolution among the converted images converted by the resolution converting means;
After finding the first face area if the number of times of processing reaches a predetermined number of times when performing the face area detecting process by increasing the resolution in the order by the face region detecting means, the end to end process of said pigment region detection means A determination means;
A face area detecting apparatus characterized by comprising:

The difference value calculating means includes:
The face area detection device according to claim 6, wherein a difference value of each pixel value is obtained as the difference value.

The difference value calculating means includes:
The face area detection apparatus according to claim 6, wherein the difference value is calculated using a difference of a color histogram of the entire image.

A face area detection program for causing a computer to function as each means constituting the face area detection device according to claim 5.

A computer-readable storage medium storing the face area detection program according to claim 9.