JP3401511B2

JP3401511B2 - Computer-readable recording medium and image characteristic point extracting apparatus, in which a program for causing a computer to execute the method for extracting characteristic points of an image and the method for extracting the characteristic points of the image are recorded.

Info

Publication number: JP3401511B2
Application number: JP22557499A
Authority: JP
Inventors: 慎二郎川戸
Original assignee: ATR Advanced Telecommunications Research Institute International
Current assignee: ATR Advanced Telecommunications Research Institute International
Priority date: 1999-08-09
Filing date: 1999-08-09
Publication date: 2003-04-28
Anticipated expiration: 2019-08-09
Also published as: JP2001052176A

Description

Detailed Description of the Invention

【０００１】[0001]

【発明の属する技術分野】この発明はカメラ等からの画
像を処理する画像処理に関し、特に、画像中の人物の顔
等の特徴点を抽出するための画像認識の分野に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to image processing for processing an image from a camera or the like, and more particularly to the field of image recognition for extracting feature points such as a person's face in an image.

【０００２】[0002]

【従来の技術】通信により、遠隔地にいる複数の人間で
会議を行うＴＶ会議システムが実用化されている。しか
しこれらシステムにおいて、映像そのものを送ると通信
データ量が増大するという問題点がある。そのために、
たとえば対象となる人物の視線、顔の向き、表情等に関
する特徴データを各地で抽出し、抽出したデータのみを
互いに送信する技術が研究されている。受信側では、こ
のデータに基づいて仮想的な人物の顔面の画像を生成し
て表示する。これにより、通信データ量を削減しなが
ら、ＴＶ会議を効率良く行える。2. Description of the Related Art A video conferencing system has been put into practical use in which a plurality of people in a remote place conduct a conference by communication. However, in these systems, there is a problem that the amount of communication data increases when the image itself is sent. for that reason,
For example, a technique has been studied in which feature data relating to the line of sight, face orientation, facial expression, etc. of a target person is extracted at various locations and only the extracted data is transmitted to each other. On the receiving side, an image of the face of a virtual person is generated and displayed based on this data. As a result, the video conference can be efficiently performed while reducing the communication data amount.

【０００３】また、たとえば放送を用いた教育システム
では、講師が各地にいる受講者の反応を見ながら講議を
すすめて行くことが望ましい。この場合にも各地の映像
を講師の講議している場所に送信すると通信データ量が
多くなる。そもそも、受講者が多数になると全受講者の
映像を送ることは実際的でなく、なんらかの方法で各受
講者の反応を各地で抽出し、その反応を示す情報のみを
講師に送信し、講師に対しては抽象的な「受講者の集合
の反応」という形で提示する方が好ましい。Further, for example, in an educational system using broadcasting, it is desirable that the lecturer proceed with the lecture while watching the reactions of the students who are in various places. In this case, too, the amount of communication data increases when the images from various places are sent to the place where the lecturer is giving a lecture. In the first place, it is impractical to send the video of all the students when the number of students is large, and somehow the reaction of each student is extracted in each region, and only the information showing the reaction is sent to the instructor, and the instructor is sent. On the other hand, it is preferable to present it in the form of an abstract “reaction of the set of students”.

【０００４】こうした処理を実現するためには、人物の
顔画像からその表情、姿勢、視線方向等を認識すること
が必要である。そのためには、顔の位置を特定し、さら
に人物の表情の変化が顕著にあらわれる目、鼻、口等の
顔部品、特に目の位置を検出する必要がある。In order to realize such processing, it is necessary to recognize the facial expression, posture, line-of-sight direction and the like from the face image of the person. For that purpose, it is necessary to specify the position of the face and detect face parts such as eyes, nose, and mouth, in particular, the positions of the eyes, where the facial expression of the person is remarkably changed.

【０００５】現在のところ、人物の顔全体の位置を映像
から特定し追跡する技術としては、映像のカラー情報を
利用して肌色を検出・追跡する方法が提案されている。
またより簡単な方法として、映像の背景の動きが少なく
人物のみが動くものと仮定して、映像のフレーム間差分
により顔の領域を検出する方法がある。At present, as a technique for identifying and tracking the position of the entire face of a person from a video, a method of detecting and tracking a skin color using color information of the video has been proposed.
Further, as a simpler method, there is a method of detecting a face area by a difference between frames of a video, assuming that the background of the video has little movement and only a person moves.

【０００６】こうして顔全体の概略位置が検出された後
に、目を検出するための技術としては、顔の領域内の画
像の明暗の分布とあらかじめ準備されたテンプレートと
のマッチングを用いるもの、顔領域の画像の縦方向、横
方向への投影処理により顔部品の位置を見つけるものが
提案されている。[0006] As a technique for detecting the eyes after the approximate position of the entire face is detected in this way, a technique of using the matching of the light and dark distribution of the image in the face region with a template prepared in advance, the face region It has been proposed to find the position of the facial part by projecting the image in the vertical and horizontal directions.

【０００７】[0007]

【発明が解決しようとする課題】しかし、テンプレート
マッチングを用いるものでは、精度よい検出を行おうと
すると多数のテンプレートを準備する必要がある。その
ために多くの記憶容量を必要とするのみでなく、マッチ
ングのための処理時間も長くかかり、実時間での処理に
は向かないという問題点がある。また、テンプレートマ
ッチングではたとえば顔を傾けた場合、すなわち水平軸
周りにやや回転した場合の顔部品を精度よく検出するた
めには、その回転に対応したテンプレートを用意する必
要がある。回転角度が異なれば別のテンプレートを用意
する必要があるので、これによってますますテンプレー
トの数が増大し処理時間が遅くなる。こうした問題はや
や横を向いた顔の顔部品の位置の検出においても同様に
生ずる。また画像の全体的な明るさが変動したときの検
出に対しても対応しようとすれば、そのための新たなテ
ンプレートを準備する必要がある。However, in the case of using template matching, it is necessary to prepare a large number of templates in order to perform accurate detection. Therefore, not only a large storage capacity is required, but also the processing time for matching is long, which is not suitable for real-time processing. Further, in template matching, for example, in order to accurately detect a face part when the face is tilted, that is, when the face is slightly rotated around the horizontal axis, it is necessary to prepare a template corresponding to the rotation. This will increase the number of templates and slow down the processing time, because different templates need to be prepared for different rotation angles. Such a problem similarly occurs in the detection of the position of the face part of the face that faces a little sideways. In addition, in order to deal with detection when the overall brightness of the image changes, it is necessary to prepare a new template for that.

【０００８】また、こうした問題は顔部品の位置の検出
のみに固有の問題ではなく、たとえば製造、流通段階に
おける管理のための各種工業製品の種類の、外観画像か
らの認識、移動する物体に付された特定のマークの認識
等においても同様に生じ得る。Further, such a problem is not unique to the detection of the position of the face part, and is associated with the recognition of the type of various industrial products for the management at the manufacturing and distribution stage from the appearance image and the moving object. The same can occur in the recognition of the specific mark that is created.

【０００９】それゆえに本発明の目的は、画像全体の傾
向にあまり影響を受けずに画像認識に必要な特徴情報が
得られる画像特徴点抽出装置、そのための方法および当
該方法をコンピュータを用いて実現するためのプログラ
ムを記録した記録媒体を提供することである。Therefore, an object of the present invention is to realize an image feature point extracting apparatus which can obtain the feature information necessary for image recognition without being largely influenced by the tendency of the entire image, a method therefor, and a method using the computer. It is to provide a recording medium in which a program for performing the above is recorded.

【００１０】この発明の他の目的は、画像全体の傾向お
よび画像の回転にあまり影響を受けずに表情認識に必要
な顔部品の位置の検出が行える画像特徴点抽出装置、そ
のための方法および当該方法をコンピュータを用いて実
現するためのプログラムを記録した記録媒体を提供する
ことである。Another object of the present invention is to provide an image feature point extraction apparatus capable of detecting the positions of face parts necessary for facial expression recognition without being significantly affected by the tendency of the entire image and the rotation of the image, a method therefor, and a method therefor. It is to provide a recording medium recording a program for realizing the method using a computer.

【００１１】この発明のさらに他の目的は、画像全体の
傾向および画像の回転にあまり影響を受けずに、顔画像
から、目を特定するための情報の抽出が行える画像特徴
点抽出装置、そのための方法および当該方法をコンピュ
ータを用いて実現するためのプログラムを記録した記録
媒体を提供することである。Still another object of the present invention is to provide an image feature point extraction device capable of extracting information for identifying an eye from a face image without being significantly affected by the tendency of the entire image and the rotation of the image. And a recording medium on which a program for implementing the method using a computer is recorded.

【００１２】[0012]

【課題を解決するための手段】請求項１に記載の発明に
かかる画像の特徴点の抽出方法は、人間の顔領域であっ
て対象となる画像領域内の各画素の値のデジタルデータ
を準備するステップと、対象となる画像領域内の画素の
各々に対して、当該画素を囲むあらかじめ定められた閉
曲線上の画素の値の分布曲線が示す特定の波数成分の相
対的な量を求めるステップと、顔領域内で、特定の波数
成分の相対的な量が局所的に最大を示す画素を顔領域内
の特徴点たる眉間を表す点として抽出するステップとを
含む。According to a first aspect of the present invention, there is provided a method of extracting a feature point of an image which is a human face area.
Te comprising: providing a digital data value of each pixel in the image region of interest, for each pixel in the image region of interest, the value of the pixels on the predetermined closed curve surrounding the pixel The step of obtaining the relative amount of the specific wave number component indicated by the distribution curve and the specific wave number within the face area
The pixel in which the relative amount of the component locally shows the maximum is set in the face area.
And a step of extracting the feature points as points representing the eyebrows .

【００１３】閉曲線上の画素の値の分布曲線が示す特定
の波数成分の相対的な量にしたがって特徴点を抽出する
ので、画素の値の全体的な値の傾向にかかわらず、特徴
点を抽出できる。Since the characteristic points are extracted according to the relative amount of the specific wave number component indicated by the distribution curve of the pixel values on the closed curve, the characteristic points are extracted regardless of the tendency of the overall pixel value. it can.

【００１４】請求項２に記載の発明にかかる画像の特徴
点の抽出方法は、請求項１に記載の発明の構成に加え
て、閉曲線は、画素を中心とし、画像領域の大きさに応
じて定められる半径を持つ円である。According to a second aspect of the present invention, in addition to the structure of the first aspect of the present invention, the closed curve has a pixel at the center, and the closed curve corresponds to the size of the image area. A circle with a defined radius.

【００１５】円周上の画素の値の分布の波数成分は回転
不変な量である。したがって、請求項１に記載の発明の
効果に加えて、対象となる画像が回転していても特徴点
の抽出を行うことができる。The wave number component of the distribution of pixel values on the circumference is a rotation invariant quantity. Therefore, in addition to the effect of the invention described in claim 1, the feature points can be extracted even if the target image is rotated.

【００１６】請求項３に記載の発明にかかる画像の特徴
点の抽出方法は、請求項１または請求項２に記載の発明
の構成に加えて、特定の波数成分の相対的な量を求める
ステップは、対象となる画像領域内の画素の各々に対し
て、当該画素を囲む閉曲線上の各画素の値をサンプリン
グ値として離散フーリエ変換を行うことにより特定の波
数成分の相対的な量を求めるステップを含む。According to a third aspect of the present invention, in addition to the configuration of the first or second aspect of the present invention, the method of extracting feature points of an image further comprises the step of obtaining a relative amount of a specific wave number component. Is a step of obtaining a relative amount of a specific wave number component by performing a discrete Fourier transform for each of the pixels in the target image area, using the value of each pixel on the closed curve surrounding the pixel as a sampling value. including.

【００１７】離散フーリエ変換という、効率的な演算を
行うことにより波数成分の相対的な量を求めることがで
きる。したがって、請求項１または請求項２に記載の発
明の効果に加えて、高速かつ効率的に特徴点を抽出でき
る。The relative amount of wave number components can be obtained by performing an efficient operation called discrete Fourier transform. Therefore, in addition to the effect of the invention described in claim 1 or claim 2, feature points can be extracted quickly and efficiently.

【００１８】請求項４に記載の発明にかかる画像の特徴
点の抽出方法は、請求項１〜請求項３のいずれかに記載
の発明の構成に加えて、特定の波数成分は波数２の成分
である。According to a fourth aspect of the present invention, there is provided a method of extracting a feature point of an image, wherein the specific wave number component is a component having a wave number of 2 in addition to the configuration of the first aspect of the invention. Is.

【００１９】ある特徴点の周囲に、たとえば明度でいう
と「明、暗、明、暗」のように「明と暗」の繰返しが２
回出現するような画素の値の分布がある場合には、波数
２の成分の相対的な量が多くなり、周囲との比較ではそ
の特徴点は確実に局所最大値を示す。したがって、請求
項１〜請求項３のいずれかに記載の発明の効果に加え
て、周囲に２回の繰返しがある画素の値の分布を有する
特徴点を確実に抽出することができる。Around a certain characteristic point, for example, in terms of lightness, there are two repetitions of "bright and dark" such as "bright, dark, bright, dark".
When there is a distribution of pixel values that appear repeatedly, the relative amount of the wave number 2 component increases, and the feature point surely exhibits the local maximum value in comparison with the surroundings. Therefore, in addition to the effect of the invention described in any one of claims 1 to 3, it is possible to reliably extract a feature point having a distribution of pixel values having two repetitions in the surroundings.

【００２０】請求項５に記載の発明にかかる画像の特徴
点の抽出方法は、請求項４に記載の発明の構成に加えて
さらに、特徴点として抽出するステップによって抽出さ
れた複数個の特徴点の候補のうち、当該候補点を囲む閉
曲線上の各画素の値の分布曲線の、波数１の成分のスペ
クトルパワーと波数２の成分のスペクトルパワーとの比
が所定のしきい値より小さいもののみを特徴点として選
択するステップを含む。According to a fifth aspect of the present invention, there is provided a method of extracting a feature point of an image, wherein in addition to the configuration of the fourth aspect of the invention, a plurality of feature points extracted by the step of extracting the feature point Among the candidates of the above, only those in which the ratio of the spectral power of the component of wave number 1 to the spectral power of the component of wave number 2 of the distribution curve of the value of each pixel on the closed curve surrounding the candidate point is smaller than a predetermined threshold value. Is selected as a feature point.

【００２１】たとえば明度でいうと「明、暗、明、暗」
のように「明と暗」の繰返しが２回出現するような画素
の値の分布がある場合には、その周上での画素の値の分
布が理想的に正弦カーブに近ければ近いほど、波数１の
成分の相対的な量は小さくなり、波数２の成分の相対的
な量は大きくなる。したがってその比が所定のしいき値
よりも小さいほど、その点が上記した特徴を持つ特徴点
である可能性が高い。したがって、請求項４に記載の発
明の効果に加えて、周囲に２回の繰返しがある画素の値
の分布を有する特徴点をより確実に抽出することができ
る。For example, in terms of brightness, "bright, dark, bright, dark"
When there is a distribution of pixel values such that the repetition of “bright and dark” appears twice, the closer the distribution of pixel values on that circumference is to a sine curve, the closer The relative amount of the wave number 1 component becomes small, and the relative amount of the wave number 2 component becomes large. Therefore, the smaller the ratio is than the predetermined threshold value, the higher the possibility that the point is the feature point having the above-mentioned features. Therefore, in addition to the effect of the invention described in claim 4, it is possible to more reliably extract a feature point having a distribution of pixel values having two repetitions in the surroundings.

【００２２】請求項６に記載の発明にかかる画像の特徴
点の抽出方法は、請求項１に記載の発明の構成に加え
て、各画素の値は各画素の明度である。According to a sixth aspect of the present invention, there is provided a method of extracting feature points of an image, in addition to the configuration of the first aspect of the invention.
Te, the value of each pixel is Ru lightness der of each pixel.

【００２３】人間の顔領域において眉間は、周囲に明、
暗の繰返しが所定回数表れる。したがって、請求項１に
記載の発明の効果に加えて、人間の顔面の特徴点である
眉間を、その周囲の画素の明度の値の分布の波数成分に
基づいて抽出できる。In the human face area , the space between the eyebrows is bright in the surroundings,
Dark iteration Ru predetermined number appears. Therefore, in addition to the effect of the invention described in claim 1, it is a feature point of the human face.
The eyebrow can be extracted based on the wave number component of the distribution of the brightness values of the pixels around it.

【００２４】請求項７に記載の発明にかかるコンピュー
タ読取可能な記録媒体は、コンピュータに、対象となる
画像領域内の特徴点を抽出する方法を実行させるための
プログラムを記録した、コンピュータ読取可能な記録媒
体であって、プログラムは、人間の顔領域であって対象
となる画像領域内の各画素の値のデジタルデータを準備
する処理を行うプログラム部分と、対象となる画像領域
内の画素の各々に対して、当該画素を囲むあらかじめ定
められた閉曲線上の画素の値の分布曲線が示す特定の波
数成分の相対的な量を求める処理を行うプログラム部分
と、顔領域内で、特定の波数成分の相対的な量が局所的
に最大を示す画素を顔領域内の特徴点たる眉間を表す点
として抽出する処理を行うプログラム部分とを含む。A computer-readable recording medium according to a seventh aspect of the present invention is a computer-readable recording medium in which a program for causing a computer to execute a method of extracting a feature point in a target image area is recorded. The program is a recording medium, and the program includes a program portion that performs processing for preparing digital data of the value of each pixel in the target image area that is a human face area, and each of the pixels in the target image area. , The program part that performs the process of obtaining the relative amount of the specific wave number component indicated by the distribution curve of the pixel values on the predetermined closed curve surrounding the pixel , and the specific wave number component within the face area. The relative amount of is local
The pixel that shows the maximum is the point that represents the space between the eyebrows that is the feature point in the face area.
And a program part for performing a process of extracting as .

【００２５】閉曲線上の画素の値の分布曲線が示す特定
の波数成分の相対的な量にしたがって特徴点を抽出する
ので、画素の値の全体的な値の傾向にかかわらず、特徴
点を抽出できる。Since the feature points are extracted according to the relative amount of the specific wave number component indicated by the distribution curve of the pixel values on the closed curve, the feature points are extracted regardless of the tendency of the overall pixel value. it can.

【００２６】請求項８に記載の発明にかかるコンピュー
タ読取可能な記録媒体は、請求項７に記載の発明の構成
に加えて、閉曲線は、画素を中心とし、画像領域の大き
さに応じて定められる半径を持つ円である。In the computer-readable recording medium according to the invention described in claim 8, in addition to the configuration of the invention described in claim 7, the closed curve is determined according to the size of the image area with the pixel at the center. It is a circle with a radius.

【００２７】円周上の画素の値の分布の波数成分は回転
不変な量である。したがって、請求項７に記載の発明の
効果に加えて、対象となる画像が回転していても特徴点
の抽出を行うことができる。The wave number component of the distribution of pixel values on the circumference is a rotation invariant quantity. Therefore, in addition to the effect of the invention described in claim 7, the feature points can be extracted even when the target image is rotated.

【００２８】請求項９に記載の発明にかかるコンピュー
タ読取可能な記録媒体は、請求項７または請求項８に記
載の発明の構成に加えて、特定の波数成分の相対的な量
を求める処理を行うプログラム部分は、対象となる画像
領域内の画素の各々に対して、当該画素を囲む閉曲線上
の各画素の値をサンプリング値として離散フーリエ変換
を行うことにより特定の波数成分の相対的な量を求める
処理を行うプログラム部分を含む。According to a ninth aspect of the present invention, there is provided a computer-readable recording medium, in addition to the configuration of the seventh or eighth aspect of the invention, a process for obtaining a relative amount of a specific wave number component. The program part to perform is the relative amount of a specific wave number component by performing a discrete Fourier transform for each pixel in the target image area by using the value of each pixel on the closed curve surrounding the pixel as a sampling value. Includes a program part for performing the process for obtaining

【００２９】離散フーリエ変換という、効率的な演算を
行うことにより波数成分の相対的な量を求めることがで
きる。したがって、請求項７または請求項８に記載の発
明の効果に加えて、高速かつ効率的に特徴点を抽出でき
る。The relative amount of wave number components can be obtained by performing an efficient operation called discrete Fourier transform. Therefore, in addition to the effect of the invention described in claim 7 or claim 8, feature points can be extracted quickly and efficiently.

【００３０】請求項１０に記載の発明にかかるコンピュ
ータ読取可能な記録媒体は、請求項７〜請求項９のいず
れかに記載の発明の構成に加えて、特定の波数成分は波
数２の成分である。According to a tenth aspect of the present invention, there is provided a computer-readable recording medium, wherein the specific wave number component is a component having a wave number of 2 in addition to the configuration of the invention according to any of the seventh to ninth aspects. is there.

【００３１】ある特徴点の周囲に、たとえば明度でいう
と「明、暗、明、暗」のように「明と暗」の繰返しが２
回出現するような画素の値の分布がある場合には、波数
２の成分の相対的な量が多くなり、周囲との比較ではそ
の特徴点は確実に局所最大値を示す。したがって、請求
項７〜請求項９のいずれかに記載の発明の効果に加え
て、周囲に２回の繰返しがある画素の値の分布を有する
特徴点を確実に抽出することができる。Around a certain feature point, for example, in terms of lightness, there are two repetitions of "bright and dark" such as "bright, dark, bright, dark".
When there is a distribution of pixel values that appear repeatedly, the relative amount of the wave number 2 component increases, and the feature point surely exhibits the local maximum value in comparison with the surroundings. Therefore, in addition to the effect of the invention described in any one of claims 7 to 9, it is possible to reliably extract feature points having a distribution of pixel values having two repetitions in the surroundings.

【００３２】請求項１１に記載の発明にかかるコンピュ
ータ読取可能な記録媒体のプログラムは、請求項１０に
記載の発明の構成に加えてさらに、特徴点として抽出す
る処理を行うプログラム部分によって抽出された複数個
の特徴点の候補のうち、当該候補点を囲む閉曲線上の各
画素の値の分布曲線の、波数１の成分のスペクトルパワ
ーと波数２の成分のスペクトルパワーとの比が所定のし
きい値より小さいもののみを特徴点として選択する処理
を行うプログラム部分を含む。The program of a computer-readable recording medium according to an eleventh aspect of the present invention is extracted by a program portion that performs a process of extracting a feature point in addition to the configuration of the tenth aspect of the invention. Of the plurality of feature point candidates, the ratio of the spectral power of the wave number 1 component and the spectral power of the wave number 2 component of the distribution curve of the values of each pixel on the closed curve surrounding the candidate point is a predetermined threshold. It includes a program part for performing a process of selecting only those smaller than the value as feature points.

【００３３】たとえば明度でいうと「明、暗、明、暗」
のように「明と暗」の繰返しが２回出現するような画素
の値の分布がある場合には、その周上での画素の値の分
布が理想的に正弦カーブに近ければ近いほど、波数１の
成分の相対的な量は小さくなり、波数２の成分の相対的
な量は大きくなる。したがってその比が所定のしいき値
よりも小さいほど、その点が上記した特徴を持つ特徴点
である可能性が高い。したがって、請求項１０に記載の
発明の効果に加えて、周囲に２回の繰返しがある画素の
値の分布を有する特徴点をより確実に抽出することがで
きる。For example, in terms of brightness, "bright, dark, bright, dark"
When there is a distribution of pixel values such that the repetition of “bright and dark” appears twice, the closer the distribution of pixel values on that circumference is to a sine curve, the closer The relative amount of the wave number 1 component becomes small, and the relative amount of the wave number 2 component becomes large. Therefore, the smaller the ratio is than the predetermined threshold value, the higher the possibility that the point is the feature point having the above-mentioned features. Therefore, in addition to the effect of the invention described in the tenth aspect, it is possible to more reliably extract the feature points having the distribution of the values of the pixels having the repetition twice around.

【００３４】請求項１２に記載の発明にかかるコンピュ
ータ読取可能な記録媒体は、請求項７に記載の発明の構
成に加えて、各画素の値は各画素の明度である。[0034] Such computer-readable recording medium in the invention of claim 12, in addition to the structure of the invention according to claim 7, the value of each pixel is Ru lightness der of each pixel.

【００３５】人間の顔領域において眉間は、周囲に明、
暗の繰返しが所定回数表れる。したがって、請求項７に
記載の発明の効果に加えて、人間の顔面の特徴点である
眉間を、その周囲の画素の明度の値の分布の波数成分に
基づいて抽出できる。In the human face area , the space between the eyebrows is bright in the surroundings,
Dark iteration Ru predetermined number appears. Therefore, in addition to the effect of the invention described in claim 7, it is a feature point of the human face.
The eyebrow can be extracted based on the wave number component of the distribution of the brightness values of the pixels around it.

【００３６】請求項１３に記載の発明にかかる画像特徴
点抽出装置は、人間の顔領域であって対象となる画像領
域内の各画素の値のデジタルデータを準備するための手
段と、対象となる画像領域内の画素の各々に対して、当
該画素を囲むあらかじめ定められた閉曲線上の値の分布
曲線が示す特定の波数成分の相対的な量を求めるための
手段と、顔領域内で、特定の波数成分の相対的な量が局
所的に最大を示す画素を顔領域内の特徴点たる眉間を表
す点として抽出するための手段とを含む。The image feature point extraction device according to the thirteenth aspect of the present invention is a device for preparing digital data of the value of each pixel in the target image region which is the human face region, and the target and For each of the pixels in the image area, the means for obtaining the relative amount of the specific wave number component indicated by the distribution curve of the values on the predetermined closed curve surrounding the pixel, and in the face area, The relative amount of a particular wavenumber component
The pixel showing the maximum locally is displayed as the feature point in the face area between the eyebrows.
Means for extracting the points .

【００３７】閉曲線上の画素の値の分布曲線が示す特定
の波数成分の相対的な量にしたがって特徴点を抽出する
ので、画素の値の全体的な値の傾向にかかわらず、特徴
点を抽出できる。Since the characteristic points are extracted according to the relative amount of the specific wave number component indicated by the distribution curve of the pixel values on the closed curve, the characteristic points are extracted regardless of the tendency of the overall pixel value. it can.

【００３８】請求項１４に記載の発明にかかる画像特徴
点抽出装置は、請求項１３に記載の発明の構成に加え
て、閉曲線は、画素を中心とし、画像領域の大きさに応
じて定められる半径を持つ円である。According to the image feature point extracting apparatus of the fourteenth aspect of the invention, in addition to the configuration of the thirteenth aspect of the invention, the closed curve is centered on the pixel and is determined according to the size of the image area. It is a circle with a radius.

【００３９】円周上の画素の値の分布の波数成分は回転
不変な量である。したがって、請求項１３に記載の発明
の効果に加えて、対象となる画像が回転していても特徴
点の抽出を行うことができる。The wave number component of the pixel value distribution on the circumference is a rotation invariant quantity. Therefore, in addition to the effect of the invention described in claim 13, the feature points can be extracted even when the target image is rotated.

【００４０】請求項１５に記載の発明にかかる画像特徴
点抽出装置は、請求項１３または請求項１４に記載の発
明の構成に加えて、特定の波数成分の相対的な量を求め
るための手段は、対象となる画像領域内の画素の各々に
対して、当該画素を囲む閉曲線上の各画素の値をサンプ
リング値として離散フーリエ変換を行うことにより特定
の波数成分の相対的な量を求めるための手段を含む。The image feature point extracting device according to the invention of claim 15 is, in addition to the configuration of the invention of claim 13 or 14, a means for obtaining a relative amount of a specific wave number component. Is to obtain the relative amount of a specific wave number component by performing a discrete Fourier transform for each pixel in the target image area using the value of each pixel on the closed curve surrounding the pixel as a sampling value. Including means.

【００４１】離散フーリエ変換という、効率的な演算を
行うことにより波数成分の相対的な量を求めることがで
きる。したがって、請求項１３または請求項１４に記載
の発明の効果に加えて、高速かつ効率的に特徴点を抽出
できる。The relative amount of wave number components can be obtained by performing an efficient operation called discrete Fourier transform. Therefore, in addition to the effect of the invention described in claim 13 or 14, feature points can be extracted quickly and efficiently.

【００４２】請求項１６に記載の発明にかかる画像特徴
点抽出装置は、請求項１３〜請求項１５のいずれかに記
載の発明の構成に加えて、特定の波数成分は波数２の成
分である。In the image feature point extraction device according to the invention of claim 16, in addition to the configuration of the invention of any one of claims 13 to 15, the specific wave number component is a component of wave number 2. .

【００４３】たとえば明度でいうと「明、暗、明、暗」
のように「明と暗」の繰返しが２回出現するような画素
の値の分布がある場合には、その周上での画素の値の分
布が理想的に正弦カーブに近ければ近いほど、波数１の
成分の相対的な量は小さくなり、波数２の成分の相対的
な量は大きくなる。したがってその比が所定のしいき値
よりも小さいほど、その点が上記した特徴を持つ特徴点
である可能性が高い。したがって、請求項１３〜請求項
１５に記載の発明の効果に加えて、周囲に２回の繰返し
がある画素の値の分布を有する特徴点をより確実に抽出
することができる。For example, in terms of brightness, "bright, dark, bright, dark"
When there is a distribution of pixel values such that the repetition of “bright and dark” appears twice, the closer the distribution of pixel values on that circumference is to a sine curve, the closer The relative amount of the wave number 1 component becomes small, and the relative amount of the wave number 2 component becomes large. Therefore, the smaller the ratio is than the predetermined threshold value, the higher the possibility that the point is the feature point having the above-mentioned features. Therefore, in addition to the effects of the invention described in claims 13 to 15, it is possible to more reliably extract a feature point having a distribution of pixel values having two repetitions in the surroundings.

【００４４】請求項１７に記載の発明にかかる画像特徴
点抽出装置は、請求項１６に記載の発明の構成に加えて
さらに、特徴点として抽出するための手段によって抽出
された複数個の特徴点の候補のうち、当該候補点を囲む
閉曲線上の各画素の値の分布曲線の、波数１の成分のス
ペクトルパワーと波数２の成分のスペクトルパワーとの
比が所定のしきい値より小さいもののみを特徴点として
選択するための手段を含む。An image feature point extracting device according to the invention of claim 17 is the image feature point extracting device according to claim 16, further comprising a plurality of feature points extracted by means for extracting feature points. Among the candidates of the above, only those in which the ratio of the spectral power of the component of wave number 1 to the spectral power of the component of wave number 2 of the distribution curve of the value of each pixel on the closed curve surrounding the candidate point is smaller than a predetermined threshold Means for selecting as a feature point.

【００４５】たとえば明度でいうと「明、暗、明、暗」
のように「明と暗」の繰返しが２回出現するような画素
の値の分布がある場合には、その周上での画素の値の分
布が理想的に正弦カーブに近ければ近いほど、波数１の
成分の相対的な量は小さくなり、波数２の成分の相対的
な量は大きくなる。したがってその比が所定のしいき値
よりも小さいほど、その点が上記した特徴を持つ特徴点
である可能性が高い。したがって、請求項１６に記載の
発明の効果に加えて、周囲に２回の繰返しがある画素の
値の分布を有する特徴点をより確実に抽出することがで
きる。For example, in terms of brightness, "bright, dark, bright, dark"
When there is a distribution of pixel values such that the repetition of “bright and dark” appears twice, the closer the distribution of pixel values on that circumference is to a sine curve, the closer The relative amount of the wave number 1 component becomes small, and the relative amount of the wave number 2 component becomes large. Therefore, the smaller the ratio is than the predetermined threshold value, the higher the possibility that the point is the feature point having the above-mentioned features. Therefore, in addition to the effect of the invention described in claim 16, it is possible to more reliably extract a feature point having a distribution of pixel values having two repetitions in the periphery.

【００４６】請求項１８に記載の発明にかかる画像特徴
点抽出装置は、請求項１３に記載の発明の構成に加え
て、各画素の値は各画素の明度である。 The image feature point extraction device according to the invention of claim 18, in addition to the configuration of the invention according to claim 1 3, the value of each pixel Ru lightness der of each pixel.

【００４７】人間の顔領域において眉間は、周囲に明、
暗の繰返しが所定回数表れる。したがって、請求項１３
に記載の発明の効果に加えて、人間の顔面の特徴点であ
る眉間を、その周囲の画素の明度の値の分布の波数成分
に基づいて抽出できる。In the human face area , the space between the eyebrows is bright in the surroundings,
Dark iteration Ru predetermined number appears. Accordingly, claims 1 to 3
In addition to the effect, the feature point der human face in the embodiment described in
The eyebrow can be extracted based on the wave number component of the distribution of the brightness values of the surrounding pixels.

【００４８】[0048]

【発明の実施の形態】［ハードウェア構成］以下、本発
明の実施の形態１にかかる画像特徴点抽出装置について
説明する。この画像特徴点抽出装置は、パーソナルコン
ピュータまたはワークステーション等、コンピュータ上
で実行されるソフトウェアにより実現されるものであっ
て、人物の顔の映像から、目の位置を検出するためのも
のである。図１に、この画像特徴点抽出装置の外観を示
す。BEST MODE FOR CARRYING OUT THE INVENTION [Hardware Configuration] An image feature point extracting apparatus according to a first embodiment of the present invention will be described below. This image feature point extraction device is realized by software executed on a computer such as a personal computer or a workstation, and is for detecting the position of eyes from the image of a person's face. FIG. 1 shows the appearance of this image feature point extraction device.

【００４９】図１を参照してこのシステム２０は、ＣＤ
−ＲＯＭ（Compact Disc Read-OnlyMemory ）ドライブ
５０およびＦＤ（Flexible Disk ）ドライブ５２を備え
たコンピュータ本体４０と、コンピュータ本体４０に接
続された表示装置としてのディスプレイ４２と、同じく
コンピュータ本体４０に接続された入力装置としてのキ
ーボード４６およびマウス４８と、コンピュータ本体４
０に接続された、画像を取込むためのカメラ３０とを含
む。この実施の形態の装置では、カメラ３０としてはＣ
ＣＤ（固体撮像素子）を含むビデオカメラを用い、カメ
ラ３０の前にいてこのシステム２０を操作する人物の目
の位置を検出する処理を行うものとする。Referring to FIG. 1, this system 20 includes a CD
A computer main body 40 having a ROM (Compact Disc Read-Only Memory) drive 50 and an FD (Flexible Disk) drive 52; a display 42 as a display device connected to the computer main body 40; A keyboard 46 and a mouse 48 as input devices, and a computer main body 4
Camera 30 for capturing images, which is connected to 0. In the apparatus of this embodiment, the camera 30 is C
It is assumed that a video camera including a CD (solid-state image sensor) is used to perform processing for detecting the position of the eyes of a person who operates the system 20 in front of the camera 30.

【００５０】図２に、このシステム２０の構成をブロッ
ク図形式で示す。図３に示されるようにこのシステム２
０を構成するコンピュータ本体４０は、ＣＤ−ＲＯＭド
ライブ５０およびＦＤドライブ５２に加えて、それぞれ
バス６６に接続されたＣＰＵ（Central Processing Uni
t ）５６と、ＲＯＭ（Read Only Memory) ５８と、RAM
（Random Access Memory）６０と、ハードディスク５４
と、カメラ３０からの画像を取込むための画像取込装置
６８とを含んでいる。ＣＤ−ＲＯＭドライブ５０にはＣ
Ｄ−ＲＯＭ６２が装着される。ＦＤドライブ５２にはＦ
Ｄ６４が装着される。FIG. 2 shows the configuration of the system 20 in a block diagram form. This system 2 as shown in FIG.
In addition to the CD-ROM drive 50 and the FD drive 52, the computer main body 40 constituting the CPU 0 has a CPU (Central Processing Uni) connected to the bus 66.
t) 56, ROM (Read Only Memory) 58, RAM
(Random Access Memory) 60 and hard disk 54
And an image capture device 68 for capturing an image from the camera 30. CD-ROM drive 50 has C
The D-ROM 62 is mounted. F for FD drive 52
D64 is installed.

【００５１】既に述べたようにこの画像特徴点抽出装置
の主要部は、コンピュータハードウェアと、ＣＰＵ５６
により実行されるソフトウェアとにより実現される。一
般的にこうしたソフトウェアはＣＤ−ＲＯＭ６２、ＦＤ
６４等の記憶媒体に格納されて流通し、ＣＤ−ＲＯＭド
ライブ５０またはＦＤドライブ５２等により記憶媒体か
ら読取られてハードディスク５４に一旦格納される。ま
たは、当該装置がネットワークに接続されている場合に
は、ネットワーク上のサーバから一旦ハードディスク５
４にコピーされる。そうしてさらにハードディスク５４
からＲＡＭ６０に読出されてＣＰＵ５６により実行され
る。なお、ネットワーク接続されている場合には、ハー
ドディスク５４に格納することなくＲＡＭ６０に直接ロ
ードして実行するようにしてもよい。As described above, the main part of the image feature point extracting apparatus is the computer hardware and the CPU 56.
It is realized by software executed by. Generally, such software is CD-ROM 62, FD
It is stored in a storage medium such as 64 and distributed, is read from the storage medium by the CD-ROM drive 50 or the FD drive 52, and is temporarily stored in the hard disk 54. Alternatively, when the device is connected to the network, the hard disk 5 is temporarily sent from the server on the network.
4 is copied. Then further hard disk 54
Is read from the RAM 60 to the RAM 60 and executed by the CPU 56. When connected to the network, it may be directly loaded into the RAM 60 and executed without being stored in the hard disk 54.

【００５２】図１および図２に示したコンピュータのハ
ードウェア自体およびその動作原理は一般的なものであ
る。したがって、本発明の最も本質的な部分はＦＤドラ
イブ５２、ＦＤ６４、ハードディスク５４等の記憶媒体
に記憶されたソフトウェアである。The hardware itself and the operating principle of the computer shown in FIGS. 1 and 2 are general. Therefore, the most essential part of the present invention is the software stored in the storage medium such as the FD drive 52, the FD 64, and the hard disk 54.

【００５３】なお、最近の一般的傾向として、コンピュ
ータのオペレーティングシステムの一部として様々なプ
ログラムモジュールを用意しておき、アプリケーション
プログラムはこれらモジュールを所定の配列で必要な時
に呼び出して処理を進める方式が一般的である。そうし
た場合、当該画像特徴点抽出装置を実現するためのソフ
トウェア自体にはそうしたモジュールは含まれず、当該
コンピュータでオペレーティングシステムと協働しては
じめて画像特徴点抽出装置が実現することになる。しか
し、一般的なプラットフォームを使用する限り、そうし
たモジュールを含ませたソフトウェアを流通させる必要
はなく、それらモジュールを含まないソフトウェア自体
およびそれらソフトウェアを記録した記録媒体（および
それらソフトウェアがネットワーク上を流通する場合の
データ信号）が実施の形態を構成すると考えることがで
きる。［基本的原理］図３を参照して、本実施の形態の装置で
は、人物の顔の中で、両目の間に位置する眉間（以下の
説明では両眼の中心を結んだ線分の中央の位置を「眉
間」と呼ぶこととする。）に着目する。眉間を以下「Ｂ
ＥＰ」（Ｂｅｔｗｅｅｎ−Ｅｙｅｓ−Ｐｏｉｎｔ）と呼
ぶ。As a recent general tendency, various program modules are prepared as a part of the operating system of a computer, and the application program calls these modules in a predetermined arrangement when necessary and advances the processing. It is common. In such a case, such a module is not included in the software itself for realizing the image feature point extraction device, and the image feature point extraction device is realized only when the computer cooperates with the operating system. However, as long as a general platform is used, it is not necessary to distribute software that includes such modules, and the software itself that does not include those modules and a recording medium that records these software (and those software are distributed on the network). The data signal in the case) can be considered to constitute an embodiment. [Basic Principle] With reference to FIG. 3, in the device of the present embodiment, in the face of a person, between the eyebrows located between the eyes (in the following description, the center of a line segment connecting the centers of both eyes). The position of is called "between the eyebrows".) Below the eyebrows, "B
EP "(Between-Eyes-Point).

【００５４】図３（ａ）に示すように、人物の顔画像に
おいて、眉間を中心としてある半径の円を描き、その円
周に沿った各画素の明度を調べる。するとその結果は概
略図３（ｂ）に示したようになる。図３（ｂ）におい
て、横軸は円周方向の各画素の位置、縦軸は各画素の明
度をそれぞれ示す。なお、図３（ａ）に示される円周の
最も上部を図３（ｂ）の横軸の原点とし、図３（ａ）の
円周を逆時計回り方向に回る方向に沿って図３（ｂ）の
グラフでは横軸に各画素を並べるものとする。As shown in FIG. 3A, a circle with a certain radius is drawn around the eyebrows in the face image of a person, and the brightness of each pixel along the circumference is checked. Then, the result is as schematically shown in FIG. 3 (b). In FIG. 3B, the horizontal axis represents the position of each pixel in the circumferential direction, and the vertical axis represents the brightness of each pixel. It should be noted that the uppermost part of the circumference shown in FIG. 3A is the origin of the horizontal axis in FIG. 3B, and the circumference of FIG. In the graph of b), each pixel is arranged on the horizontal axis.

【００５５】図３（ｂ）を参照すると、このグラフは、
頂点、谷、頂点、谷という「頂点と谷」の２回の繰り返
しを形成していることが分かる。この意味は、図３
（ａ）に示される顔画像を参照すると明らかである。す
なわち、人間の顔画像では、眉間を中心として上記した
円周上をたどると、最初は額（明度が高い）、次に右目
（明度が低い）、次に鼻（明度が高い）、次に左目（明
度が低い）、最後に最初の額（明度が高い）というよう
に、明るい部分と暗い部分とが交互に２回繰返されてい
る。顔画像の中では、こうした特徴を最も示すのは眉間
であり、他の部分はこうした特徴はあまり示さず、あっ
ても眉間より小さい。Referring to FIG. 3B, this graph is
It can be seen that the “vertices and valleys” are repeated twice, that is, the apex, the valley, the apex, and the valley. This meaning is shown in Figure 3.
It is obvious with reference to the face image shown in (a). In other words, in a human face image, if the human eye image is traced on the above-mentioned circumference centering on the eyebrow, first the forehead (high brightness), then the right eye (low brightness), then the nose (high brightness), then the A bright part and a dark part are alternately repeated twice, such as the left eye (low brightness) and finally the first frame (high brightness). In the face image, these features are most shown in the eyebrows, and other parts do not show such features much, and even if they are, they are smaller than the eyebrows.

【００５６】そこで本実施の形態の装置では、眉間の周
囲にはこうした明度の分布が存在すると想定して以下に
「リングＤＦＴ（離散フーリエ変換）フィルタ」と呼ぶ
フィルタによるフィルタリングを行うことによって最初
にＢＥＰを検出し、しかる後にそのＢＥＰを基準として
眉間の両側の目の位置を検出する。なお、実施の形態１
で用いられるリングＤＦＴ変換については後述する。Therefore, in the apparatus according to the present embodiment, it is assumed that such a lightness distribution exists around the eyebrows, and filtering is first performed by a filter called "ring DFT (discrete Fourier transform) filter" below. The BEP is detected, and thereafter, the positions of the eyes on both sides of the brow are detected based on the BEP. The first embodiment
The ring DFT transform used in the above will be described later.

【００５７】本実施の形態の装置は、以下のような制御
構造を有するソフトウェアを用いて目の位置を検出しト
ラッキングする。The apparatus according to the present embodiment detects and tracks the eye position using software having the following control structure.

【００５８】図４を参照して、まずイメージを取得する
（ステップ８０）。ここでは、図１および図２に示した
カメラ３０から得た１フレームの画像を画像取込装置６
８でデジタル変換して画像取込装置６８内の画像メモリ
に格納し、この画像に対して以下のような処理をするも
のとする。連続的に処理する場合、カメラ３０から得た
画像の各フレームに対して以下の処理を繰返す。Referring to FIG. 4, an image is first acquired (step 80). Here, one frame image obtained from the camera 30 shown in FIG. 1 and FIG.
It is assumed that the image is digitally converted at 8 and stored in the image memory in the image capturing device 68, and the following processing is performed on this image. In the case of continuous processing, the following processing is repeated for each frame of the image obtained from the camera 30.

【００５９】すなわち、ステップ８２で、１フレーム分
の画像データのうちから、前述したリングＤＦＴフィル
タを用いて眉間の候補点を抽出する。この処理について
は図５を参照して後述する。That is, at step 82, candidate points between the eyebrows are extracted from the image data for one frame by using the ring DFT filter described above. This processing will be described later with reference to FIG.

【００６０】次に、装置の動作モードが探索モードとト
ラッキングモードとのいずれであるかを判定する（ステ
ップ８４）。探索モードとは、目の位置をトラッキング
（追跡）する処理の前提として、目の位置を最初に決定
するためのモードである。トラッキングモードとは、こ
うして一旦決定された目の位置の近傍のみ処理すること
によって目の位置の変動に追従して目の位置を常に把握
しておくための動作モードである。この処理の最初には
まだ目の位置が定まっていないので、動作モードは探索
モードに設定される。したがって以下ではまず探索モー
ドでの処理について説明し、その後にトラッキングモー
ドでの処理について説明する。Next, it is determined whether the operation mode of the apparatus is the search mode or the tracking mode (step 84). The search mode is a mode for first determining the eye position as a premise of the processing for tracking the eye position. The tracking mode is an operation mode for always grasping the eye position by following changes in the eye position by processing only the vicinity of the eye position once determined in this way. At the beginning of this process, the eye position has not been determined yet, so the operation mode is set to the search mode. Therefore, hereinafter, the processing in the search mode will be described first, and then the processing in the tracking mode will be described.

【００６１】探索モードでは、ステップ８２で抽出され
た眉間の候補点（一般的には複数個である）のうちで、
その両側の対称な位置に２ケ所だけ暗い領域（目に対
応）があるという条件を満足するものを探す。眉間の候
補点のうち、この条件を充足しないものはここで不採用
となる。In the search mode, among the candidate points (generally a plurality) between the eyebrows extracted in step 82,
Look for one that satisfies the condition that there are two dark areas (corresponding to the eyes) at symmetrical positions on both sides. Among the eyebrow candidate points, those that do not satisfy this condition are not adopted here.

【００６２】ステップ８８で、ステップ８６の処理の結
果得られた目が一対のみか否か（すなわち眉間がただ１
点のみ検出されたか否か）が判定される。一対のみ得ら
れた場合には、以後の動作モードをトラッキングモード
にする処理が行われ（ステップ９０）、その目の近傍の
領域をテンプレートとして保存する（ステップ９８）。
そして、決定された眉間（または目）の位置を出力し
（ステップ１００）処理はステップ８０に戻る。以後新
たなフレームデータに対して上述した処理を繰返す。た
だし以後の動作モードはトラッキングモードとなる。In step 88, it is determined whether or not the eyes obtained as a result of the processing in step 86 are only one pair (that is, the eyebrows are only 1).
It is determined whether or not only points are detected. When only one pair has been obtained, the process for changing the operation mode thereafter to the tracking mode is performed (step 90), and the region near the eyes is saved as a template (step 98).
Then, the position of the determined eyebrow (or eye) is output (step 100), and the process returns to step 80. After that, the above processing is repeated for new frame data. However, the operation mode thereafter is the tracking mode.

【００６３】一方、ステップ８８の処理で一対を越える
数の目が検出された場合には処理はステップ８０にもど
り、以後新たなフレームに対して上述した処理を繰返
す。ただしこの場合には動作モードは依然として探索モ
ードである。On the other hand, if more than one pair of eyes are detected in the process of step 88, the process returns to step 80, and the above process is repeated for a new frame. However, in this case, the operation mode is still the search mode.

【００６４】一旦処理がトラッキングモードとなると、
ステップ８４の判定の結果制御はステップ９２に進む。
なおこれに先立つステップ８２の処理では、その直前に
得られたＢＥＰの近傍のみを処理対象とすることで、候
補点の抽出処理と、以後の処理とをいずれも高速化でき
る。ステップ９２では、ステップ９８で保存されたテン
プレート（前フレームでのＢＥＰ）と、次のフレームで
のＢＥＰの候補点の全てとの間でマッチングがとられ
る。そして、最もよい一致を与えたものが唯一のＢＥＰ
の候補点となる。Once the processing is in the tracking mode,
As a result of the determination in step 84, the control proceeds to step 92.
In the process of step 82 preceding this, only the vicinity of the BEP obtained immediately before that is targeted for processing, whereby both the extraction process of candidate points and the subsequent processes can be speeded up. In step 92, the template (BEP in the previous frame) stored in step 98 is matched with all the BEP candidate points in the next frame. And the one that gave the best match is the only BEP
Will be the candidate point.

【００６５】ステップ９４では、ステップ９２で最もよ
い一致を与えたＢＥＰの候補点とテンプレートとの相違
が所定のしきい値以上であるか否かが判定される。仮に
最もよい一致を与えたとしても、直前フレームにおける
ＢＥＰとの相違があまりに大きい場合にはその候補点を
ＢＥＰとして決定することは不適当なことが多い。この
ステップでの処理はそうした不適当なＢＥＰを排除する
ための処理である。In step 94, it is judged whether or not the difference between the BEP candidate point that gave the best match in step 92 and the template is equal to or more than a predetermined threshold value. Even if the best match is given, it is often inappropriate to determine the candidate point as the BEP when the difference from the BEP in the immediately preceding frame is too large. The process in this step is a process for eliminating such an inappropriate BEP.

【００６６】ステップ９４で相違が所定のしきい値を越
える（または所定のしきい値以上である）と判定されれ
ば、ステップ９６で動作モードを探索モードに設定し処
理はステップ８６に進む。以後の処理については探索モ
ードに関して既に説明した通りである。If it is determined in step 94 that the difference exceeds the predetermined threshold value (or is greater than or equal to the predetermined threshold value), the operation mode is set to the search mode in step 96, and the process proceeds to step 86. The subsequent processing is as already described for the search mode.

【００６７】ステップ９４で相違が所定のしきい値以下
である（または所定のしきい値未満である）と判定され
れば、この候補点が前フレームから引続いてトラッキン
グされているＢＥＰであると決定され、このＢＥＰに応
じた新たなテンプレートが保存される（ステップ９
８）。そしてこのＢＥＰまたはその両側の目の位置を出
力し（ステップ１００）制御はステップ８０に戻る。以
後の処理については既に説明したとおりである。［リン
グＤＦＴフィルタを用いた候補点の抽出］さて、ステッ
プ８２で行われる眉間の候補点の抽出において、前述し
たリングＤＦＴフィルタが用いられている。以後ステッ
プ８２の処理について図５を参照して説明する。If it is determined in step 94 that the difference is less than or equal to the predetermined threshold value (or less than the predetermined threshold value), this candidate point is the BEP continuously tracked from the previous frame. Then, a new template corresponding to this BEP is saved (step 9).
8). Then, this BEP or the positions of the eyes on both sides thereof are output (step 100), and the control returns to step 80. The subsequent processing is as already described. [Extraction of Candidate Points Using Ring DFT Filter] In the extraction of the eyebrow candidate points performed in step 82, the ring DFT filter described above is used. Hereinafter, the process of step 82 will be described with reference to FIG.

【００６８】まず、ステップ１１０で、図６（ａ）に示
されるような処理対象の画像の平滑化および縦横方向の
１／２縮小処理が行われ、図６（ｂ）に示されるような
画像を得る。実験ではたとえば対象点となる画素の周囲
の５×５個の画素の明度を平均化してその画素の明度と
し、このとき対象点の選択により画像の縮小も同時に行
った。平滑化は、画像に含まれているノイズ（比較的高
周波成分が多い）を除去するためのものである。特に、
人間のＢＥＰの検出では、後述するように波数２のスペ
クトルパワー成分を計算するので、この平滑化によっ
て、以後の処理で必要とされる情報が削除されるおそれ
はない。またこの処理で画素数を１／４に削減すること
により、処理の高速化を図ることができる。ただし、十
分高速なプロセッサを用いた場合にはあえて画素数を削
減する必要はないかも知れない。また、より低速なプロ
セッサを用いる場合には、より小さな画像に縮小する
（画素数を少なくする）ことが必要となろう。ただしあ
まりに画像を縮小すると解像度が下がる結果、ＢＥＰの
検出の精度が低下するおそれがあるので、実験により適
当な解像度を選択することも有用である。First, at step 110, the image to be processed as shown in FIG. 6A is subjected to smoothing and vertical / horizontal ½ reduction processing, and the image as shown in FIG. 6B is obtained. To get In the experiment, for example, the brightness of 5 × 5 pixels around the target pixel was averaged to obtain the brightness of the pixel, and the image was reduced at the same time by selecting the target point. The smoothing is for removing noise (relatively high frequency components are included) included in the image. In particular,
In the detection of human BEP, since the spectrum power component of wave number 2 is calculated as described later, there is no possibility that this smoothing will delete information necessary for the subsequent processing. Further, by reducing the number of pixels to 1/4 in this process, it is possible to speed up the process. However, it may not be necessary to reduce the number of pixels if a sufficiently fast processor is used. Also, if a slower processor is used, it may be necessary to reduce (reduce the number of pixels) to a smaller image. However, if the image is reduced too much, the resolution will decrease, and as a result, the accuracy of BEP detection may decrease, so it is also useful to select an appropriate resolution by experiments.

【００６９】続いて、こうして得られた画像から、対象
となる人物の頭部領域を推定する処理が行われる（ステ
ップ１１２）。この処理には、前述したとおりカラー情
報を用い、肌色の領域を追跡するアルゴリズムを用いた
り、前フレームと現フレームとの差分から、２フレーム
間で移動したと思われる領域を抽出しこれを頭部領域と
推定するアルゴリズムを用いたりすることができる。本
実施の形態では、フレーム間の差分を用いる。またここ
で推定される領域はどのような形状の領域でもよいが、
領域計算の簡便さを考えると矩形領域が適切である。た
だし、条件によっては別の形状を用いた方が効率のよい
場合もあるであろう。なお、頭部がほとんど移動してい
ない場合にはフレーム間差分が得られない。その場合に
は、頭部が移動していないと想定して直前の処理で推定
された頭部領域を使用する。Subsequently, a process of estimating the head region of the target person from the image thus obtained is performed (step 112). For this processing, color information is used as described above, an algorithm for tracking the skin color area is used, or the area that seems to have moved between two frames is extracted from the difference between the previous frame and the current frame, and this is extracted. It is also possible to use an algorithm for estimating a partial area. In the present embodiment, the difference between frames is used. Moreover, the region estimated here may be a region of any shape,
Considering the simplicity of area calculation, a rectangular area is suitable. However, depending on the conditions, it may be more efficient to use another shape. In addition, when the head is hardly moved, the inter-frame difference cannot be obtained. In that case, the head region estimated in the immediately preceding process is used assuming that the head has not moved.

【００７０】次に、こうして得られた頭部領域の範囲内
で、リングＤＦＴフィルタを用いたフィルタリングを行
う。具体的には、たとえば頭部領域の左上の画素から順
に、その画素を中心とする、図７に示されるような所定
の大きさの円上の周上の画素に対して以下の計算を行
う。Next, filtering using a ring DFT filter is performed within the range of the head region thus obtained. Specifically, for example, the following calculation is performed in order from a pixel on the upper left of the head region, with respect to a pixel on the circumference of a circle having a predetermined size and having the pixel at the center as a center. .

【００７１】[0071]

【数１】 [Equation 1]

【００７２】ただしこの式で、Ｎは円周上の点の数であ
り、ｋはこの円周上の点のうち最も上にある（「北極」
に相当する位置）点を０として、逆時計周りに順に各点
にふられた番号である。またｆｋ（ｋ＝０、…、Ｎ−
１）は円周上のｋ番目の画素の明度で、ｉは虚数単位で
ある。この式（１）は、次に示す一般的な離散フーリエ
変換によって求められるＤＦＴ係数のｎ＝２の場合であ
る。In this equation, N is the number of points on the circumference, and k is the highest point on the circumference ("North Pole").
The position is a number corresponding to each point in a counterclockwise direction, where 0 is the point corresponding to. Also, fk (k = 0, ..., N−
1) is the lightness of the k-th pixel on the circumference, and i is the imaginary unit. This formula (1) is for the case where n = 2 of the DFT coefficient obtained by the following general discrete Fourier transform.

【００７３】[0073]

【数２】 [Equation 2]

【００７４】式（１）に示す変換により、上記した円周
上の明度の変動波形（図３（ｂ）参照）に含まれる波数
２のスペクトルパワー成分が計算される。本実施の形態
では円の半径を７画素、Ｎ＝３６として計算した。な
お、人物とカメラとの間の距離に応じて顔領域の大きさ
は変化するから、そうした距離の変化量が多いと考えら
れる場合には、既に得られた顔領域の概略の大きさに合
わせて円の半径を変化させることでより精度が高くな
る。ただし、人物がほとんどそうした移動を行わないこ
とがわかっていれば、半径をあらかじめ固定しておいて
もよい。By the conversion shown in the equation (1), the spectrum power component of the wave number 2 included in the above-mentioned fluctuation waveform of the lightness on the circumference (see FIG. 3B) is calculated. In the present embodiment, the radius of the circle is 7 pixels and N = 36. Since the size of the face area changes according to the distance between the person and the camera, if it is considered that there is a large amount of change in such distance, the face area size should be adjusted to the approximate size of the face area already obtained. The accuracy is improved by changing the radius of the circle. However, the radius may be fixed in advance if it is known that the person hardly makes such a movement.

【００７５】この計算により、頭部領域のすべての画素
について、その画素を中心とする円周上の波数２のスペ
クトルパワーの値が計算される。その値に応じた明度を
各画素に割り振って新たな画像としたものの例を図８
（ｂ）に示す。図８（ｂ）を参照すればわかるように、
各画素に対して上記した計算を行った結果得られる値の
分布中には、特に値の高い部分が存在する。それら部分
は、その周囲の円周上に上記したような波数２の波数成
分が多く存在するものと考えられる。したがってそれら
はＢＥＰとしての候補点としての資格を備えている。こ
のように、画像上の各対象点を中心とする、典型的には
円となる閉曲線上をたどり、その上での画素の値（明度
に限らず色相、彩度等をも含みうる）に対してＤＦＴを
行うことにより図８（ｂ）のような情報を得ることを本
願発明では「リングＤＦＴフィルタによるフィルタリン
グ」と呼んでいる。By this calculation, the value of the spectral power of wave number 2 on the circumference of the center of the pixel is calculated for all the pixels in the head region. An example of a new image in which the brightness according to the value is assigned to each pixel is shown in FIG.
It shows in (b). As can be seen by referring to FIG.
In the distribution of the values obtained as a result of performing the above-mentioned calculation for each pixel, there is a particularly high value portion. It is considered that those portions have many wave number components of wave number 2 as described above on the circumference of the circumference. Therefore, they qualify as candidate points for BEP. In this way, a closed curve, which is typically a circle, centered on each target point on the image is traced, and the pixel value (including not only the lightness but also the hue, saturation, etc.) can be calculated on the closed curve. Obtaining information as shown in FIG. 8B by performing DFT on the other hand is called “filtering by a ring DFT filter” in the present invention.

【００７６】こうして得られた図８（ｂ）に示されるよ
うな値の分布から、局所的な最大値を示す点を選んでＢ
ＥＰの候補点とする（ステップ１１６）。検出された候
補点の例を図８（ａ）に示す。この例では２７個の候補
点が検出されている。ここで注意すべきなのは、この２
７個の中に真のＢＥＰが含まれているということであ
る。前述したとおり真のＢＥＰの周囲にはほぼまちがい
なく明、暗、明、暗という領域の分布がある。したがっ
てステップ１１４の処理の結果、ほぼ間違いなく真のＢ
ＥＰは局所的な最大値を示し、その結果ステップ１１６
でほぼ例外なく候補点として抽出される。このようにロ
バストに、ほぼ確実に真のＢＥＰが抽出されるのがこの
方式の特徴である。なお、選択のためのしきい値は、対
象となる画像の持つべき特徴に応じて主として経験的に
定められる。From the distribution of values thus obtained as shown in FIG. 8B, the point showing the local maximum value is selected and B
It is set as a candidate point for EP (step 116). An example of the detected candidate points is shown in FIG. In this example, 27 candidate points are detected. The important thing to note here is this 2
That is, the true BEP is included in the seven. As described above, there is almost no difference around the true BEP, and there is a distribution of regions of bright, dark, bright, and dark. Therefore, as a result of the processing in step 114, it is almost certain that the true B
EP indicates a local maximum and, as a result, step 116.
Is extracted as a candidate point almost without exception. As described above, it is a characteristic of this method that the true BEP is almost certainly extracted. It should be noted that the threshold value for selection is empirically determined mainly according to the characteristics that the target image should have.

【００７７】続いてステップ１１８で、複数個の局所最
大値のうちで、ＢＥＰに特徴的な局所特徴を考慮して、
ＢＥＰの候補を絞る処理が行われる。Then, in step 118, among the plurality of local maximum values, the local feature characteristic of BEP is considered,
A process of narrowing down BEP candidates is performed.

【００７８】たとえば実際のＢＥＰでは、その上（額）
と下（鼻）とに明るい領域があり、その左右（両眼）に
暗い領域が存在するはずである。したがって式（１）の
計算結果は必ず実部が正となるはずである。正でない実
部を生ずるものはＢＥＰではなく、候補から除外され
る。For example, in the actual BEP, in addition (amount)
There should be a bright area at the bottom and the bottom (nose), and dark areas at the left and right (both eyes). Therefore, the real part of the calculation result of equation (1) must be positive. Those that produce a non-positive real part are not BEPs and are excluded from the candidates.

【００７９】また、同じ理由から、真のＢＥＰを中心と
した画像を縦方向および横方向に投影した画像を考える
と、次のようなことがいえる。図９を参照して、明暗の
分布は、（ａ）に示すように上下方向では中央が最も暗
く、左右方向では中央は最も明るい。またその分布は中
央を中心としてほぼ対称となるべきである。そこで、複
数個の候補点が存在する場合、これと同様の縦方向およ
び横方向の投影を作成し、上記した条件に合致しないも
のを不採用とする。For the same reason, considering an image obtained by projecting an image centered on the true BEP in the vertical and horizontal directions, the following can be said. Referring to FIG. 9, the distribution of light and dark is darkest in the center in the vertical direction and brightest in the center in the horizontal direction as shown in (a). Also, its distribution should be almost symmetrical about the center. Therefore, when there are a plurality of candidate points, the same projections in the vertical direction and the horizontal direction are created, and the ones that do not meet the above conditions are not adopted.

【００８０】また別の基準として、ＢＥＰ候補点を中心
とする小領域の明るさの重心を計算し、その重心とＢＥ
Ｐ候補点との距離がしきい値を越えていればそのＢＥＰ
候補点を除外する。As another criterion, the center of gravity of the brightness of a small area centered on the BEP candidate point is calculated, and the center of gravity and BE are calculated.
If the distance to the P candidate point exceeds the threshold value, the BEP
Exclude candidate points.

【００８１】さらに、リングＤＦＴフィルタの以下のよ
うな特徴を用いて候補点を絞ることができる。すなわ
ち、上記したＦｎの一般式（式（２））において、ｎ＝
１として各画素でＦ１を計算する。そして、各画素で得
られたＦ２との比（Ｆ１／Ｆ２）を計算し、この値が小
さいほど真のＢＥＰである確率が高い、という基準を用
いる。この値は、次の理由により、ある画素を中心とす
る円上の明暗の分布が理想的な制限カーブにどの程度合
致しているか（どの程度離反しているか）を示す基準と
なりうると考えられる。Further, the candidate points can be narrowed down by using the following features of the ring DFT filter. That is, in the above general formula of Fn (formula (2)), n =
F1 is calculated for each pixel as 1. Then, the ratio (F1 / F2) with F2 obtained for each pixel is calculated, and the standard that the smaller this value is, the higher the probability of true BEP is used. It is considered that this value can be used as a reference to show how much the distribution of light and darkness on a circle centered on a pixel matches the ideal limit curve (how far they are separated) for the following reasons. .

【００８２】式（２）でｎ＝１、２、…として計算され
た値はそれぞれ、円周上において波数が１、２、…であ
る波数成分のスペクトルパワーを示す。もし円周上の明
暗の分布が理想的にｎ＝２のときの正弦カーブと一致し
ているときには、２以外のｎに対してＦｎ＝０となる筈
である。もちろん、実際には明暗の分布が正弦カーブと
一致することはないが、それでも理想的な正弦カーブに
近ければＦ１は小さな値となり、Ｆ２は相対的に大きな
値となるであろう。そこで、上記したＦ１／Ｆ２が小さ
ければ、対象画素の周囲の明暗分布は実際のＢＥＰの周
りの明暗分布に近く、大きければ遠い、と考えることが
できる。これがＦ１／Ｆ２が尺度として利用できる理由
である。The values calculated as n = 1, 2, ... In equation (2) indicate the spectral powers of the wave number components whose wave numbers are 1, 2 ,. If the distribution of light and darkness on the circumference ideally coincides with the sine curve when n = 2, Fn = 0 should be obtained for n other than 2. Of course, the distribution of light and darkness does not actually coincide with the sine curve, but if it is close to an ideal sine curve, F1 will have a small value and F2 will have a relatively large value. Therefore, it can be considered that if F1 / F2 is small, the light-dark distribution around the target pixel is close to the actual light-dark distribution around BEP, and if it is large, it is far away. This is why F1 / F2 can be used as a measure.

【００８３】なお、Ｆ３、Ｆ４等についてもＦ１と同様
に理想的な明暗の分布では０となるはずである。そこで
Ｆ３／Ｆ２、Ｆ４／Ｆ２等を基準とすることも考えられ
る。しかしこれらはより高い波数成分の量を示し、その
ためにノイズの影響を受けやすいので、Ｆ１／Ｆ２を用
いた場合よりも結果の信頼性は低い。It should be noted that F3, F4, etc. should be 0 in the ideal distribution of light and dark like F1. Therefore, it is also possible to use F3 / F2, F4 / F2, or the like as a reference. However, they show a higher amount of wavenumber components and are therefore more susceptible to noise, so the results are less reliable than with F1 / F2.

【００８４】以下、図４のステップ８６、８８によって
このＢＥＰが真のＢＥＰであるか否かが検定される。図
１０に、その結果を示す。この例では、ＢＥＰの候補と
して複数個が残った（「＊」の印）が、それらのうち目
と考えられる領域（「＋」の印）が両側に対称的に存在
しているのは中央の「×」印の点、すなわち真のＢＥＰ
のみである。こうして真のＢＥＰが決定される。Thereafter, steps 86 and 88 in FIG. 4 test whether this BEP is a true BEP or not. The result is shown in FIG. In this example, a plurality of BEP candidates remained (marked with “*”), but among them, regions that are considered to be eyes (marked with “+”) exist symmetrically on both sides in the center. "X" mark of the true BEP
Only. Thus, the true BEP is determined.

【００８５】以上のようにこの実施の形態のシステムで
は、リングＤＦＴフィルタを用いてＢＥＰの検出を行っ
た。リングＤＦＴフィルタは、画像のうちの明暗の分布
内に存在する波数成分のみからＢＥＰ等の特徴点の抽出
を行う。そのため、画像の全体的な明るさの変動による
影響を受けにくいという特徴がある。また、顔がやや傾
けられている場合にも、ある点の周囲の明るさの分布内
の波数成分は画像の回転に対し不変である。そのため上
記した手法を用いると、回転に強い特徴点の抽出を実現
することができる。これは顔をやや横に向けた場合も同
様である。両眼がギリギリで見える程度の顔を横に向け
たとしても、両眼が画像中に存在している限りは依然と
して上記した明暗の配置が眉間の周囲に存在しているの
で、上で説明した手法を用いてほぼ確実にＢＥＰを抽出
することができる。また、対象となる人物が目を閉じて
いても、依然としてその領域は額、鼻と比較して暗いた
め、上記した手法でほぼ確実にＢＥＰを検出することが
できる。したがって、高い信頼性でＢＥＰを、さらには
その両側の目の位置を検出することができる。As described above, in the system of this embodiment, BEP detection is performed using the ring DFT filter. The ring DFT filter extracts feature points such as BEP from only wave number components existing in the light and dark distribution of the image. Therefore, there is a feature that it is less likely to be affected by the fluctuation of the overall brightness of the image. Further, even when the face is slightly tilted, the wave number component in the brightness distribution around a certain point is invariant to the rotation of the image. Therefore, using the above-described method, it is possible to realize the extraction of the feature points that are resistant to rotation. This is also the case when the face is turned slightly sideways. Even if the eyes are turned sideways so that they are barely visible, as long as both eyes are present in the image, the light-dark arrangement described above still exists around the eyebrows, so it is explained above. BEP can be almost certainly extracted using the method. Further, even if the target person has his eyes closed, the area is still darker than the forehead and nose, so that the BEP can be detected almost certainly by the above method. Therefore, it is possible to detect the BEP with high reliability and further the positions of the eyes on both sides of the BEP.

【００８６】さらに、上記した特徴点の抽出方法および
装置が顔の特徴点の抽出のみに利用可能である、という
わけではないことは当業者には明白であろう。すでに説
明したように、上記したリングＤＦＴフィルタは、ある
画素を中心とする円上の明暗の分布の波数成分を計算す
ることにより、特定の明暗の分布と合致する点を特徴点
として抽出する。したがって、たとえば移動する物体
（商品、自動車、建築資材）等を自動的に認識し管理す
る場合において、その姿勢を認識するために、あらかじ
めそれら物体の特定箇所に特定のマークを付しておき、
上記した特徴点抽出を行うことにより、それら物体の位
置および姿勢等を認識することができる。このとき使用
されるマークは、ある点を中心としてその周囲に特定の
明暗（または色彩）の分布があるものを使用すればよ
い。また、このとき検出すべき波数成分は２には限定さ
れず、マークの形状（模様）内に存在する明暗の繰返し
数に応じた波数成分を検出するようにすればよい。Furthermore, it will be apparent to those skilled in the art that the above-described feature point extraction method and apparatus are not limited to use only for facial feature point extraction. As described above, the ring DFT filter described above extracts the wave number components of the distribution of light and darkness on a circle centered on a certain pixel, and extracts the points that match the particular distribution of light and darkness as feature points. Therefore, for example, in the case of automatically recognizing and managing moving objects (products, automobiles, building materials) and the like, in order to recognize the posture, specific marks are previously attached to specific positions of those objects,
By performing the above-described feature point extraction, it is possible to recognize the positions and orientations of those objects. As a mark used at this time, a mark having a specific light and dark (or color) distribution around a certain point may be used. Further, the wave number component to be detected at this time is not limited to 2, and the wave number component corresponding to the number of repetitions of light and dark existing in the shape (pattern) of the mark may be detected.

【００８７】またさらに、上記した例では各画素を中心
とする円周上の点についてＤＦＴ係数を計算した。しか
し本発明は、円周上の点についてのみ適用可能であると
いうわけではない。あらかじめ特徴点として抽出される
べき点と所定の位置関係にある閉曲線であって、その周
上の明暗のあるべき分布が分かっているのであれば、他
のどのような閉曲線上で上記した計算を行ってもよい。
もっとも、回転に対してロバストな結果を与えるのは円
のときであるから、円が最適となることが多いであろ
う。Furthermore, in the above example, the DFT coefficient was calculated for points on the circumference centered on each pixel. However, the invention is not only applicable to points on the circumference. If it is a closed curve that has a predetermined positional relationship with the points to be extracted as feature points in advance, and if the distribution of light and dark on its circumference is known, use the above calculation on any other closed curve. You can go.
However, since it is the case of a circle that gives a robust result to rotation, the circle will often be optimal.

【００８８】さらにまた、上記した実施の形態では各画
素を中心とする一つの円周上での明暗分布中の波数成分
を利用したが、使用される円の数が１に限定されるわけ
ではないことも当業者には明らかであろう。たとえば抽
出すべき特徴点の周囲で、中心から異なった距離の領域
では異なった明暗の分布が存在すべきことがあらかじめ
判明しているのであれば、それに応じて複数個の円周
（または閉曲線）上でそれぞれ上記した計算を行い、両
者の計算の結果を総合して特徴点を抽出してもよい。Furthermore, in the above-described embodiment, the wave number component in the light-dark distribution on one circumference centered on each pixel is used, but the number of circles used is not limited to one. It will also be apparent to those skilled in the art that none. For example, if it is known in advance that there should be different light and dark distributions around the feature point to be extracted in the regions at different distances from the center, then a plurality of circles (or closed curves) can be generated accordingly. The above-mentioned calculations may be performed respectively, and the feature points may be extracted by integrating the results of both calculations.

【００８９】また、上記した例では波数成分の計算のた
めにＤＦＴを使用した。ＤＦＴを用いることが最も効率
的だとは考えられるが、上記した例で必要な関数は、周
上の明暗分布内の波数成分を抽出することさえできれば
よい。したがって、使用できる手法はＤＦＴに限定され
るわけではなく、一般的なフーリエ変換を含め、波数成
分の抽出のための関数のいずれもが使用可能であること
もまた当業者には明白であろう。In the above example, the DFT is used to calculate the wave number component. Although it is considered that the use of DFT is the most efficient, the function required in the above example only needs to be able to extract the wave number component in the light-dark distribution on the circumference. Therefore, it will also be apparent to those skilled in the art that the technique that can be used is not limited to DFT, and that any function for extracting a wave number component can be used, including a general Fourier transform. .

【００９０】さらに、上記した実施の形態では、リング
ＤＦＴフィルタの処理対象は画素の明度であった。しか
し本発明の適用可能な対象はこれには限定されない。た
とえば各画素の色相、彩度等の値に対してリングＤＦＴ
フィルタによるフィルタリングを行ってもよい。また、
検出すべき特徴点のもつべき性質に応じて、各画素の明
度、色相、彩度などの値に対して所定の演算を施した値
をフィルタリングの対象とすることも考えられる。Further, in the above-mentioned embodiment, the processing target of the ring DFT filter is the brightness of the pixel. However, the applicable object of the present invention is not limited to this. For example, a ring DFT is applied to values such as hue and saturation of each pixel.
You may perform filtering by a filter. Also,
Depending on the property that the characteristic point to be detected should have, it is possible to use a value obtained by performing a predetermined operation on the value of each pixel such as lightness, hue, and saturation as a filtering target.

【００９１】今回開示された実施の形態はすべての点で
例示であって制限的なものではないと考えられるべきで
ある。本発明の範囲は上記した説明ではなくて特許請求
の範囲によって示され、特許請求の範囲と均等の意味お
よび範囲内でのすべての変更が含まれることが意図され
る。The embodiments disclosed this time are to be considered as illustrative in all points and not restrictive. The scope of the present invention is shown not by the above description but by the claims, and is intended to include meanings equivalent to the claims and all modifications within the scope.

[Brief description of drawings]

【図１】本発明の１実施の形態にかかるシステムの外
観図である。FIG. 1 is an external view of a system according to an embodiment of the present invention.

【図２】本発明の１実施の形態にかかるシステムのハ
ードウェア的構成を示すブロック図である。FIG. 2 is a block diagram showing a hardware configuration of a system according to an embodiment of the present invention.

【図３】本発明の原理を説明するための図である。FIG. 3 is a diagram for explaining the principle of the present invention.

【図４】本発明の実施の形態１にかかるシステムで実
行される処理のフローチャートである。FIG. 4 is a flowchart of processing executed by the system according to the first embodiment of the present invention.

【図５】画像データから眉間の候補点を抽出する処理
のフローチャートである。FIG. 5 is a flowchart of a process of extracting a candidate point between eyebrows from image data.

【図６】平滑化／縮小前後の画像を示す図である。FIG. 6 is a diagram showing images before and after smoothing / reduction.

【図７】リングＤＦＴフィルタの計算経路を示す図で
ある。FIG. 7 is a diagram showing a calculation path of a ring DFT filter.

【図８】リングＤＦＴフィルタによる眉間の候補点の
抽出処理の結果を示す図である。FIG. 8 is a diagram showing a result of extraction processing of a candidate point between eyebrows by a ring DFT filter.

【図９】眉間の局部的特徴を説明するための模式図で
ある。FIG. 9 is a schematic diagram for explaining local features between the eyebrows.

【図１０】眉間の検定結果を示す図である。FIG. 10 is a diagram showing a result of a test between the eyebrows.

[Explanation of symbols]

２０画像特徴点抽出装置３０カメラ４０コンピュータ本体４２モニタ 20 Image feature point extraction device 30 cameras 40 computer body 42 monitors

フロントページの続き (58)調査した分野(Int.Cl.⁷，ＤＢ名) G06T 7/40 G06T 7/00 300 ＪＩＣＳＴファイル（ＪＯＩＳ)Front page continuation (58) Fields surveyed (Int.Cl. ⁷ , DB name) G06T 7/40 G06T 7/00 300 JISST file (JOIS)

Claims

(57) [Claims]

1. A step of preparing digital data of a value of each pixel in a target image area which is a human face area, and for each of the pixels in the target image area,
Determining a relative amount of a specific wave number component indicated by a distribution curve of the value of the pixel on a predetermined closed curve surrounding the pixel, and a relative amount of the specific wave number component within the face region. Is a station
The pixel showing the maximum locally is the feature point in the face area between the eyebrows
A method for extracting feature points of an image, the method including:

2. The method of extracting feature points of an image according to claim 1, wherein the closed curve is a circle centered on the pixel and having a radius determined according to a size of the image region.

3. The step of obtaining the relative amount of the specific wave number component, for each of the pixels in the image area of interest, sets the value of each of the pixels on the closed curve surrounding the pixel. The method for extracting feature points of an image according to claim 1 or 2, further comprising the step of determining a relative amount of the specific wave number component by performing a discrete Fourier transform as a sampling value.

4. The method of extracting feature points of an image according to claim 1, wherein the specific wave number component is a wave number 2 component.

5. A component having a wave number of 1 in a distribution curve of the value of each pixel on the closed curve surrounding the candidate points among a plurality of candidate feature points extracted by the step of extracting the characteristic points. 5. The method for extracting a feature point of an image according to claim 4, further comprising the step of selecting only a feature point having a ratio between the spectrum power of 1) and the spectrum power of the component of wave number 2 smaller than a predetermined threshold value as the feature point.

6. Before SL value of each pixel brightness der of each pixel
That, method of extracting feature points of the image according to 請 Motomeko 1.

7. A computer-readable recording medium having recorded thereon a program for causing a computer to execute a method of extracting a feature point in a target image area, the program being a human face area. There is a program portion that performs a process of preparing digital data of the value of each pixel in the target image area, and for each of the pixels in the target image area,
A program part for performing a process of obtaining a relative amount of a specific wave number component indicated by a distribution curve of the value of the pixel on a predetermined closed curve surrounding the pixel , Relative quantity is bureau
The pixel showing the maximum locally is the feature point in the face area between the eyebrows
A computer-readable recording medium that includes a program portion that performs a process of extracting as a point that represents .

8. The computer-readable recording medium according to claim 7, wherein the closed curve is a circle centered on the pixel and having a radius determined according to a size of the image area.

9. A program portion for performing a process of obtaining a relative amount of the specific wave number component, for each of the pixels in the target image area, the program on the closed curve surrounding the pixel. 9. The computer-readable recording medium according to claim 7, further comprising a program portion for performing a process of obtaining a relative amount of the specific wave number component by performing a discrete Fourier transform using a pixel value as a sampling value. .

10. The computer-readable recording medium according to claim 7, wherein the specific wave number component is a wave number 2 component.

11. The program further includes a distribution of the values of the respective pixels on the closed curve surrounding the candidate points among a plurality of characteristic point candidates extracted by a program part that performs processing for extracting the characteristic points. 11. The program part according to claim 10, further comprising a program portion for performing a process of selecting, as a characteristic point, only a curve having a ratio of the spectral power of the wave number 1 component and the spectral power of the wave number 2 component smaller than a predetermined threshold value. Computer readable recording medium.

12. Before SL value of each pixel is Ru Oh <br/> in brightness of each pixel, computer-readable recording medium according to claim 7
Body .

13. A means for preparing digital data of a value of each pixel in a target image area which is a human face area, and for each of the pixels in the target image area,
A means for obtaining a relative amount of a specific wave number component indicated by a distribution curve of values on a predetermined closed curve surrounding the pixel, and a relative amount of the specific wave number component in the face region are Station
The pixel showing the maximum locally is the feature point in the face area between the eyebrows
An image feature point extraction device including means for extracting as a point representing the image feature point.

14. The closed curve is centered on the pixel,
The image feature point extraction device according to claim 13, wherein the image feature point extraction device is a circle having a radius determined according to the size of the image region.

15. The means for determining the relative amount of the specific wave number component includes, for each of the pixels in the target image area, a pixel of each of the pixels on the closed curve that surrounds the pixel. The image feature point extraction apparatus according to claim 13 or 14, further comprising means for obtaining a relative amount of the specific wave number component by performing a discrete Fourier transform using the value as a sampling value.

16. The image feature point extraction apparatus according to claim 13, wherein the specific wave number component is a wave number 2 component.

17. Further, among a plurality of feature point candidates extracted by means for extracting as a feature point,
Only the characteristic points of which the ratio of the spectral power of the wave number 1 component and the spectral power of the wave number 2 component of the distribution curve of the value of each pixel on the closed curve surrounding the candidate point is smaller than a predetermined threshold value Including means for choosing as
The image feature point extraction device according to claim 16.

18. Before SL value of each pixel is Ru Oh <br/> in brightness of each pixel, image feature point extraction device according to 請 Motomeko 13.