JP6991401B2

JP6991401B2 - Information processing equipment, programs and information processing methods

Info

Publication number: JP6991401B2
Application number: JP2021541019A
Authority: JP
Inventors: 隼也大澤; 大貴樋口
Original assignee: Mitsubishi Electric Corp
Current assignee: Mitsubishi Electric Corp
Priority date: 2019-09-20
Filing date: 2019-09-20
Publication date: 2022-01-12
Anticipated expiration: 2039-09-20
Also published as: DE112019007642T5; WO2021053806A1; JPWO2021053806A1

Description

本発明は、情報処理装置、プログラム及び情報処理方法に関する。 The present invention relates to an information processing apparatus, a program and an information processing method.

車内又は工場等において、事故防止のために人の眠気状態を推定する技術開発が進んでいる。例えば、心拍、脳波又は瞬き等から眠気状態に応じて変化する特徴量を抽出し、その特徴量を閾値と比較することによって眠気状態を推定する技術がある。 Technological development for estimating the drowsiness of a person is progressing in a car or in a factory to prevent accidents. For example, there is a technique for estimating a drowsiness state by extracting a feature amount that changes according to a drowsiness state from a heartbeat, an electroencephalogram, a blink, or the like, and comparing the feature amount with a threshold value.

心拍又は脳波といった生体信号は人の状態を直接計測でき、可能性のある手法であるが、センサを装着する必要があり、煩わしく感じる場合がある。また、センサを装着し忘れた場合には機能自体が損なわれてしまい、コストが高いという問題点がある。 Biological signals such as heartbeats or brain waves can directly measure a person's condition and are a possible technique, but they require a sensor to be attached and may be annoying. Further, if the sensor is forgotten to be attached, the function itself is impaired, and there is a problem that the cost is high.

一方、カメラで撮像された動画像から人の眠気状態を推定する手法は、低コスト、センサ寿命が長い、ユーザに非接触及びシステムが簡易という利点がある。 On the other hand, the method of estimating the drowsiness state of a person from a moving image captured by a camera has the advantages of low cost, long sensor life, non-contact with the user, and simple system.

動画像から人の眠気状態を推定する手法として例えば、瞬目頻度、瞬目速度、及び瞬目持続時間の各々に閾値を設定して、眠気状態を推定する技術がある。しかし、サングラスを装着している場合は瞬目情報を正確に取得できないため、眠気状態を正確に推定できないという問題がある。 As a method for estimating a person's drowsiness state from a moving image, for example, there is a technique for estimating a drowsiness state by setting threshold values for each of blink frequency, blink speed, and blink duration. However, when wearing sunglasses, there is a problem that the drowsiness state cannot be estimated accurately because the blink information cannot be acquired accurately.

瞬目情報を用いず、動画像から人の眠気状態を推定する手法として、例えば、欠伸発生頻度を用いて眠気状態を推定する技術が特許文献１に開示されている。特許文献１に記載された技術は、開口の変化のパターンマッチングで欠伸を判定する（段落００８２等）。 As a method for estimating a person's drowsiness state from a moving image without using blink information, for example, a technique for estimating a drowsiness state using the frequency of yawning is disclosed in Patent Document 1. The technique described in Patent Document 1 determines yawning by pattern matching of changes in openings (paragraph 882, etc.).

特開２００５－１９９０７８号公報Japanese Unexamined Patent Publication No. 2005-199078

しかしながら、上述の特許文献１に記載された技術は、開口の変化のパターンマッチングで欠伸を判定するため、パターンの設定の仕方によっては誤判定率が高くなるという問題点がある。 However, the technique described in the above-mentioned Patent Document 1 has a problem that the erroneous determination rate becomes high depending on the method of setting the pattern because the yawning is determined by the pattern matching of the change of the opening.

そこで、本発明の一又は複数の態様は、精度良く欠伸状態を判定できるようにすることを目的とする。 Therefore, one or more aspects of the present invention are intended to enable accurate determination of the yawning state.

本発明の一態様に係る情報処理装置は、動画像に含まれている複数のフレームの各々から、人物の顔の領域である顔領域を抽出する顔領域抽出部と、前記顔領域から、予め定められた複数の特徴点を抽出する顔特徴点抽出部と、前記複数の特徴点から、前記顔の特徴を示す顔特徴量を算出する顔特徴量算出部と、前記顔特徴量から、前記顔において口が開いている度合いである開口度を特定する開口度特定部と、前記複数のフレームから特定された複数の前記開口度により、前記口が継続して開いている度合いである開口継続度を算出する開口継続度算出部と、前記人物が欠伸以外の予め定められた要因による開口を行っているか否かを判定し、前記予め定められた要因による開口を行っていると判定されたフレームに、予め定められた第１の閾値よりも低い値を欠伸開口継続度として対応付け、前記予め定められた要因による開口を行っていないと判定されたフレームに、前記開口継続度を欠伸開口継続度として対応付け、最新のフレームを含む予め定められた数の連続したフレームの内、前記第１の閾値以上となっている欠伸開口継続度が対応付けられているフレームの数が、予め定められた第２の閾値以上の場合に、前記人物が欠伸をしたと判定する判定処理部と、を備えることを特徴とする。 The information processing apparatus according to one aspect of the present invention has a face area extraction unit that extracts a face area, which is a face area of a person, from each of a plurality of frames included in a moving image, and a face area extraction unit that extracts the face area from the face area in advance. From the face feature point extraction unit that extracts a plurality of defined feature points, the face feature amount calculation unit that calculates the face feature amount indicating the facial feature from the plurality of feature points, and the face feature amount. The opening continuation, which is the degree to which the mouth is continuously opened, by the opening degree specifying portion that specifies the opening degree, which is the degree to which the mouth is open on the face, and the plurality of the opening degrees specified from the plurality of frames. The opening continuity calculation unit that calculates the degree determines whether or not the person is opening due to a predetermined factor other than the missing stretch, and it is determined that the person is opening due to the predetermined factor. A value lower than a predetermined first threshold is associated with the frame as the extension opening continuity, and the opening continuity is defined as the extension opening in the frame determined that the opening is not performed due to the predetermined factor. The number of frames to which the stretch opening continuity, which is equal to or higher than the first threshold value, is associated with the predetermined number of consecutive frames including the latest frame, which is associated with the continuity, is predetermined. It is characterized by comprising a determination processing unit for determining that the person has a defect when the value is equal to or higher than the second threshold value.

本発明の一態様に係るプログラムは、コンピュータを、動画像に含まれている複数のフレームの各々から、人物の顔の領域である顔領域を抽出する顔領域抽出部、前記顔領域から、予め定められた複数の特徴点を抽出する顔特徴点抽出部、前記複数の特徴点から、前記顔の特徴を示す顔特徴量を算出する顔特徴量算出部、前記顔特徴量から、前記顔において口が開いている度合いである開口度を特定する開口度特定部、前記複数のフレームから特定された複数の前記開口度により、前記口が継続して開いている度合いである開口継続度を算出する開口継続度算出部、及び、前記人物が欠伸以外の予め定められた要因による開口を行っているか否かを判定し、前記予め定められた要因による開口を行っていると判定されたフレームに、予め定められた第１の閾値よりも低い値を欠伸開口継続度として対応付け、前記予め定められた要因による開口を行っていないと判定されたフレームに、前記開口継続度を欠伸開口継続度として対応付け、最新のフレームを含む予め定められた数の連続したフレームの内、前記第１の閾値以上となっている欠伸開口継続度が対応付けられているフレームの数が、予め定められた第２の閾値以上の場合に、前記人物が欠伸をしたと判定する判定処理部、として機能させることを特徴とする。 The program according to one aspect of the present invention uses a computer in advance from a face area extraction unit that extracts a face area, which is a face area of a person, from each of a plurality of frames included in a moving image, and from the face area. A face feature point extraction unit that extracts a plurality of defined feature points, a face feature amount calculation unit that calculates a face feature amount indicating the face feature from the plurality of feature points, and a face feature amount that calculates the face feature amount from the face feature amount. The opening continuity, which is the degree to which the mouth is continuously open, is calculated from the opening degree specifying portion that specifies the opening degree, which is the degree to which the mouth is open, and the plurality of the opening degrees specified from the plurality of frames. The opening continuity calculation unit and the frame determined to determine whether or not the person has opened due to a predetermined factor other than yawning, and the frame determined to have opened due to the predetermined factor. , A value lower than the predetermined first threshold value is associated as the yawning opening continuity, and the opening continuity is set as the yawning opening continuity to the frame determined that the opening is not performed due to the predetermined factor. Of the predetermined number of consecutive frames including the latest frame, the number of frames to which the yawning opening continuity, which is equal to or higher than the first threshold value, is associated is predetermined. It is characterized in that it functions as a determination processing unit for determining that the person has yawned when the value is equal to or higher than the second threshold value.

本発明の一態様に係る情報処理方法は、動画像に含まれている複数のフレームの各々から、人物の顔の領域である顔領域を抽出し、前記顔領域から、予め定められた複数の特徴点を抽出し、前記複数の特徴点から、前記顔の特徴を示す顔特徴量を算出し、前記顔特徴量から、前記顔において口が開いている度合いである開口度を特定し、前記複数のフレームから特定された複数の前記開口度により、前記口が継続して開いている度合いである開口継続度を算出し、前記人物が欠伸以外の予め定められた要因による開口を行っているか否かを判定し、前記予め定められた要因による開口を行っていると判定されたフレームに、予め定められた第１の閾値よりも低い値を欠伸開口継続度として対応付け、前記予め定められた要因による開口を行っていないと判定されたフレームに、前記開口継続度を欠伸開口継続度として対応付け、最新のフレームを含む予め定められた数の連続したフレームの内、前記第１の閾値以上となっている欠伸開口継続度が対応付けられているフレームの数が、予め定められた第２の閾値以上の場合に、前記人物が欠伸をしたと判定することを特徴とする。 In the information processing method according to one aspect of the present invention, a face region, which is a region of a person's face, is extracted from each of a plurality of frames included in a moving image, and a plurality of predetermined faces are extracted from the face region. The feature points are extracted, the facial feature amount indicating the facial feature is calculated from the plurality of feature points, and the degree of opening, which is the degree to which the mouth is open in the face, is specified from the facial feature amount. Whether the person opens by a predetermined factor other than yawning by calculating the opening continuity, which is the degree to which the mouth is continuously opened, from the plurality of openings specified from the plurality of frames. It is determined whether or not the frame is opened due to the predetermined factor, and a value lower than the predetermined first threshold value is associated with the frame as the yawning opening continuity, which is predetermined. The opening continuity is associated with the frame determined not to be opened due to the above factor as the yawning opening continuity, and the first threshold value among the predetermined number of consecutive frames including the latest frame. When the number of frames associated with the above-mentioned yawning opening continuity is equal to or greater than a predetermined second threshold value, it is determined that the person has yawned.

本発明の一又は複数の態様によれば、精度良く欠伸状態を判定することができる。 According to one or more aspects of the present invention, the yawning state can be accurately determined.

実施の形態１に係る欠伸判定装置の構成を概略的に示すブロック図である。It is a block diagram which shows schematic structure of the yawning determination apparatus which concerns on Embodiment 1. FIG. 実施の形態１における開口区別判定条件の一例を示す概略図である。It is a schematic diagram which shows an example of the opening distinction determination condition in Embodiment 1. FIG. 欠伸判定条件の一例を示す概略図である。It is a schematic diagram which shows an example of a yawning determination condition. 実施の形態１における処理部の構成を概略的に示すブロック図である。It is a block diagram which shows schematic structure of the processing part in Embodiment 1. FIG. 顔特徴点情報の一例を示す概略図である。It is a schematic diagram which shows an example of the face feature point information. 顔特徴量を説明するための概略図である。It is a schematic diagram for demonstrating the amount of facial features. 顔特徴量情報を示す概略図である。It is a schematic diagram which shows the facial feature amount information. （Ａ）及び（Ｂ）は、開口速度を説明するためのグラフである。(A) and (B) are graphs for explaining the opening speed. 欠伸判定装置のハードウェア構成を概略的に示すブロック図である。It is a block diagram which shows the hardware composition of the yawning determination device schematicly. 実施の形態１における欠伸判定部での処理を示すフローチャートである。It is a flowchart which shows the process in the yawning determination part in Embodiment 1. 実施の形態２及び３に係る欠伸判定装置の構成を概略的に示すブロック図である。It is a block diagram which shows schematic structure of the yawning determination apparatus which concerns on Embodiments 2 and 3. 実施の形態２における開口区別判定条件の一例を示す概略図である。It is a schematic diagram which shows an example of the opening distinction determination condition in Embodiment 2. 実施の形態２における顔表情特徴モデルの一例を示す概略図である。It is a schematic diagram which shows an example of the facial expression characteristic model in Embodiment 2. 実施の形態２における処理部の構成を概略的に示すブロック図である。It is a block diagram which shows schematic structure of the processing part in Embodiment 2. FIG. 実施の形態２における欠伸判定部での処理を示すフローチャートである。It is a flowchart which shows the process in the yawning determination part in Embodiment 2. 実施の形態３における開口区別判定条件の一例を示す概略図である。It is a schematic diagram which shows an example of the opening distinction determination condition in Embodiment 3. FIG. 実施の形態３における処理部の構成を概略的に示すブロック図である。It is a block diagram which shows schematic structure of the processing part in Embodiment 3. FIG. 実施の形態３における欠伸判定部での処理を示すフローチャートである。It is a flowchart which shows the process in the yawning determination part in Embodiment 3. FIG.

実施の形態１．
図１は、実施の形態１に係る情報処理装置としての欠伸判定装置１００の構成を概略的に示すブロック図である。
欠伸判定装置１００は、撮像部１１０と、データベース部１２０と、処理部１３０と、表示部１５０とを備える。Embodiment 1.
FIG. 1 is a block diagram schematically showing a configuration of a yawning determination device 100 as an information processing device according to the first embodiment.
The yawn determination device 100 includes an image pickup unit 110, a database unit 120, a processing unit 130, and a display unit 150.

撮像部１１０は、動画像を取得して、取得された動画像を処理部１３０に与える。
データベース部１２０は、欠伸判定条件モデル１２１を記憶する記憶部である。
欠伸判定条件モデル１２１は、開口区別判定条件及び欠伸判定条件を含む。The image pickup unit 110 acquires a moving image and gives the acquired moving image to the processing unit 130.
The database unit 120 is a storage unit that stores the yawning determination condition model 121.
The yawning determination condition model 121 includes an opening distinction determination condition and a yawning determination condition.

開口区別判定条件は、予め定められた要因による開口と、予め定められた要因ではない要因による開口とを区別するために使用される判定条件である。
図２は、開口区別判定条件の一例を示す概略図である。
実施の形態１では、開口区別判定条件は、後述する開口速度評価値が予め定められた閾値（例えば、２０）以上である場合には、予め定められた要因ではない要因による開口と判定し、その開口速度評価値が予め定められた閾値未満である場合には、予め定められた要因による開口と判定する条件になっている。開口速度評価値の詳細は、後述する。なお、予め定められた要因ではない要因による開口が行われている場合は、予め定められた要因による開口が行われていないことになる。The opening distinction determination condition is a determination condition used to distinguish between an opening due to a predetermined factor and an opening due to a factor other than the predetermined factor.
FIG. 2 is a schematic view showing an example of an opening distinction determination condition.
In the first embodiment, when the opening speed evaluation value described later is equal to or higher than a predetermined threshold value (for example, 20), the opening distinction determination condition is determined to be an opening due to a factor other than the predetermined factor. When the opening speed evaluation value is less than a predetermined threshold value, it is a condition for determining the opening due to a predetermined factor. The details of the opening speed evaluation value will be described later. If the opening is performed by a factor other than the predetermined factor, it means that the opening is not performed by the predetermined factor.

欠伸判定条件は、欠伸と判定するための条件である。
図３は、欠伸判定条件の一例を示す概略図である。
実施の形態１では、欠伸判定条件は、過去の予め定められた数のフレーム（例えば、１００フレーム）の内、欠伸開口継続度が第１の閾値としての予め定められた値（例えば、４０）以上となっているフレームの数が第２の閾値としての予め定められた数（例えば、２０）以上の場合に、欠伸と判定する条件となっている。なお、欠伸開口継続度の詳細は、後述する。ここで、過去の予め定められた数のフレームは、最新のフレームを含む予め定められた数の連続したフレームである。The yawning determination condition is a condition for determining yawning.
FIG. 3 is a schematic view showing an example of yawning determination conditions.
In the first embodiment, the yawning determination condition is a predetermined value (for example, 40) in which the yawning opening continuity is the first threshold value among the predetermined number of frames (for example, 100 frames) in the past. When the number of frames as described above is equal to or greater than a predetermined number (for example, 20) as the second threshold value, it is a condition for determining yawning. The details of the yawning opening continuity will be described later. Here, the predetermined number of frames in the past is a predetermined number of consecutive frames including the latest frame.

図１に戻り、処理部１３０は、欠伸判定装置１００での処理を実行する。
図４は、実施の形態１における処理部１３０の構成を概略的に示すブロック図である。
処理部１３０は、入力部１３１と、顔領域抽出部１３２と、顔特徴点抽出部１３３と、顔特徴点記憶部１３４と、顔特徴量算出部１３５と、顔特徴量記憶部１３６と、開口度特定部１３７と、開口度記憶部１３８と、開口継続度算出部１３９と、開口継続度記憶部１４０と、判定処理部１４１と、欠伸判定結果記憶部１４６と、出力部１４７とを備える。Returning to FIG. 1, the processing unit 130 executes processing by the yawning determination device 100.
FIG. 4 is a block diagram schematically showing the configuration of the processing unit 130 according to the first embodiment.
The processing unit 130 includes an input unit 131, a face area extraction unit 132, a face feature point extraction unit 133, a face feature point storage unit 134, a face feature amount calculation unit 135, a face feature amount storage unit 136, and an opening. It includes a degree specifying unit 137, an opening degree storage unit 138, an opening continuity calculation unit 139, an opening continuity storage unit 140, a determination processing unit 141, a stretch determination result storage unit 146, and an output unit 147.

判定処理部１４１は、開口速度評価値算出部１４２と、開口速度評価値記憶部１４３と、欠伸判定部１４４と、一時記憶部１４５とを備える。 The determination processing unit 141 includes an opening speed evaluation value calculation unit 142, an opening speed evaluation value storage unit 143, a yawning determination unit 144, and a temporary storage unit 145.

入力部１３１は、撮像部１１０で取得された動画像の入力を受ける。入力された動画像は、顔領域抽出部１３２に与えられる。 The input unit 131 receives the input of the moving image acquired by the image pickup unit 110. The input moving image is given to the face area extraction unit 132.

顔領域抽出部１３２は、動画像に含まれている複数のフレームの各々から、人物の顔の領域である顔領域を抽出する。
例えば、顔領域抽出部１３２は、Ａｄａｂｏｏｓｔ学習によるＨａａｒ－ｌｉｋｅ特徴を用いた識別器を用いて、入力された動画像から人物の顔領域を抽出する。
これについては、例えば、下記の文献に記載されている。
ＰａｕｌＶｉｏｌａ，ＭｉｃｈａｅｌＪ．Ｊｏｎｅｓ、“ＲｏｂｕｓｔＲｅａｌ-ＴｉｍｅＦａｃｅＤｅｔｅｃｔｉｏｎ”、ＩｎｔｅｒｎａｔｉｏｎａｌＪｏｕｒｎａｌｏｆＣｏｍｐｕｔｅｒＶｉｓｉｏｎ．Ｖｏｌ．５７、Ｎｏ．２、ｐｐ１３７－１５４、２００４年The face region extraction unit 132 extracts a face region, which is a region of a person's face, from each of the plurality of frames included in the moving image.
For example, the face area extraction unit 132 extracts a person's face area from the input moving image by using a discriminator using the Haar-like feature by AdaBoost learning.
This is described, for example, in the following literature.
Paul Viola, Michael J. Jones, "Robust Real-Time Face Detection", International Journal of Computer Vision. Vol. 57, No. 2, pp137-154, 2004

顔特徴点抽出部１３３は、抽出された顔領域から、輪郭、眉毛、目、鼻又は口等の予め定められた複数の特徴点である複数の顔特徴点を抽出する。
抽出された顔領域画像において輪郭、眉毛、目、鼻又は口等の顔特徴点を抽出する抽出方法としては、例えば、下記の文献に記載されている公知の方法が用いられればよい。
ＷｉｓｋｏｔｔＬ．，Ｆｅｌｌｏｕｓｊ．－Ｍ．，ＫｒｕｇｅｒＮ．，ｖｏｎｄｅｒＭｌｓｂｕｒｇＣ．、 “ＦａｃｅＲｅｃｏｇｎｉｔｉｏｎｂｙＥｌａｓｔｉｃＢｕｎｃｈＧｒａｐｈＭａｔｃｈｉｎｇ”、ＩＥＥＥＴｒａｎｓａｃｔｉｏｎｓｏｎＰａｔｔｅｒｎＡｎａｌｙｓｉｓａｎｄＭａｃｈｉｎｅＩｎｔｅｌｌｉｇｅｎｃｅ、Ｖｏｌ．１９、Ｉｓｓｕｅ７、ｐｐ．７７５－７７９、１９９７年The facial feature point extraction unit 133 extracts a plurality of facial feature points, which are a plurality of predetermined feature points such as contours, eyebrows, eyes, nose, and mouth, from the extracted face area.
As an extraction method for extracting facial feature points such as contours, eyebrows, eyes, nose or mouth in the extracted facial region image, for example, a known method described in the following documents may be used.
Wiskott L. , Fellous j. -M. , Kruger N. et al. , Von der Mlsburg C.I. , "Face Recognition by Elastic Bunch Graph Matching", IEEE Transitions on Pattern Analysis and Machine Integrity, Vol. 19, Issue 7, pp. 775-779, 1997

また、顔特徴点抽出部１３３は、抽出された複数の顔特徴点の配置から顔の向きも算出する。顔の向きは、例えば、Ｙａｗ、Ｐｉｔｃｈ及びＲｏｌｌの角度で特定される。 In addition, the face feature point extraction unit 133 also calculates the orientation of the face from the arrangement of the extracted plurality of face feature points. The orientation of the face is specified, for example, by the angles of Yaw, Pitch and Roll.

顔特徴点記憶部１３４は、顔特徴点抽出部１３３で抽出された複数の顔特徴点及び算出された顔の向きを示す顔特徴点情報を格納する。
図５は、顔特徴点記憶部１３４に記憶されている顔特徴点情報の一例を示す概略図である。
図５に示されているように、顔特徴点情報は、顔特徴点の座標及び顔の向きの角度を含む。The face feature point storage unit 134 stores a plurality of face feature points extracted by the face feature point extraction unit 133 and face feature point information indicating the calculated face orientation.
FIG. 5 is a schematic diagram showing an example of face feature point information stored in the face feature point storage unit 134.
As shown in FIG. 5, the facial feature point information includes the coordinates of the facial feature points and the angle of the face orientation.

図４に戻り、顔特徴量算出部１３５は、抽出された複数の顔特徴点から、顔の特徴を示す顔特徴量を算出する。
例えば、顔特徴量算出部１３５は、２点の顔特徴点間の距離又は３点の顔特徴点からなる角度等により顔の特徴を表す顔特徴量を算出する。
具体的には、顔特徴量算出部１３５は、図６に示されているように、人物の顔の正面から見て左目の目頭と目尻との２点間の距離Ｖ１、右目の目頭と目尻との２点間の距離Ｖ２、上唇と下唇との２点間の距離Ｖ３、基準点（例えば、鼻の先端）Ｐと両眉の内端とを結んだ直線Ｌ１、Ｌ２間の角度（Ｖ４）、基準点Ｐと鼻の下の両端とを結んだ直線Ｌ３、Ｌ４間の角度Ｖ５、及び、基準点Ｐと左右両口角とを結んだ直線Ｌ５、Ｌ６間の角度Ｖ６を顔特徴量として算出する。これらの顔特徴量は、口角の上がり、小鼻の開き、眉のしかめ等表情変化において特徴が表れるとされている特徴量である。Returning to FIG. 4, the facial feature amount calculation unit 135 calculates the facial feature amount indicating the facial feature from the extracted plurality of facial feature points.
For example, the facial feature amount calculation unit 135 calculates a facial feature amount representing a facial feature based on a distance between two facial feature points, an angle consisting of three facial feature points, and the like.
Specifically, as shown in FIG. 6, the facial feature amount calculation unit 135 has a distance V1 between two points between the inner and outer corners of the left eye and the inner and outer corners of the right eye when viewed from the front of the person's face. The distance V2 between the two points, the distance V3 between the upper and lower lips, the angle between the straight lines L1 and L2 connecting the reference point (for example, the tip of the nose) P and the inner ends of both eyebrows (for example). V4), the angle V5 between the straight lines L3 and L4 connecting the reference point P and both ends under the nose, and the angle V6 between the straight lines L5 and L6 connecting the reference point P and the left and right mouth angles are facial features. Calculated as. These facial features are features that are said to appear in facial expression changes such as rising corners of the mouth, opening of the nose, and frowning of the eyebrows.

顔特徴量算出部１３５が算出する顔特徴量は、以上の例に限定されない。例えば、顔特徴量は、顔の眉間領域、口領域又は頬領域における画像特徴量であってもよい。
このとき画像特徴量は、例えば、ＨＯＧ（ＨｉｓｔｏｇｒａｍｓｏｆＯｒｉｅｎｔｅｄＧｒａｄｉｅｎｔｓ）特徴量を用いるものとする。
これについては、例えば、下記の文献に記載されている。
Ｎ．Ｄａｌａｌ，Ｂ．Ｔｒｉｇｇｓ、 “ＨｉｓｔｏｇｒａｍｓｏｆＯｒｉｅｎｔｅｄＧｒａｄｉｅｎｔｓｆｏｒｈｕｍａｎＤｅｔｅｃｔｉｏｎ”、Ｐｒｏｃ．ＩＥＥＥＣｏｎｆｅｒｅｎｃｅｏｎＣｏｍｐｕｔｅｒＶｉｓｉｏｎａｎｄＰａｔｔｅｒｎＲｅｃｏｇｎｉｔｉｏｎ（ＣＶＰＲ）、ｐｐ．８８６－８９３、２００５年The facial feature amount calculated by the facial feature amount calculation unit 135 is not limited to the above examples. For example, the facial feature amount may be an image feature amount in the glabellar region, mouth region, or cheek region of the face.
At this time, for the image feature amount, for example, the HOG (Histograms of Oriented Gradients) feature amount is used.
This is described, for example, in the following literature.
N. Dalal, B. Triggs, "Histograms of Oriented Gradients for human Detection", Proc. IEEE Computer Vision and Pattern Recognition (CVPR), pp. 886-893, 2005

なお、画像特徴量としては、ＨＯＧ特徴量以外でも、ＳＩＦＴ（ＳｃａｌｅｄＩｎｖａｒｉａｎｃｅＦｅａｔｕｒｅＴｒａｎｓｆｏｒｍ）特徴量、ＳＵＲＦ（Ｓｐｅｅｄｅ-ｕｐＲｏｂｕｓｔＦｅａｔｕｒｅｓ）又はＨａａｒ－ｌｉｋｅ特徴量が用いられてもよい。 In addition to the HOG feature amount, SIFT (Scaled Invariant Feature Features) feature amount, SURF (Speed-up Robot Features), or Haar-like feature amount may be used as the image feature amount.

図４に戻り、顔特徴量記憶部１３６は、顔特徴量算出部１３５で算出された顔特徴量を示す顔特徴量情報を記憶する。
図７は、顔特徴量記憶部１３６に記憶されている顔特徴量情報を示す概略図である。
図７に示されているように、顔特徴量情報は、顔特徴量として算出された値を格納する。Returning to FIG. 4, the face feature amount storage unit 136 stores the face feature amount information indicating the face feature amount calculated by the face feature amount calculation unit 135.
FIG. 7 is a schematic diagram showing facial feature amount information stored in the face feature amount storage unit 136.
As shown in FIG. 7, the face feature amount information stores a value calculated as a face feature amount.

図４に戻り、開口度特定部１３７は、顔特徴量記憶部１３６に記憶されている顔特徴量から、人物の顔において口が開いている度合いである開口度を特定する。
例えば、開口度特定部１３７は、上唇と下唇との２点間の距離Ｖ３を開口度として特定する。Returning to FIG. 4, the opening degree specifying unit 137 specifies the opening degree, which is the degree to which the mouth is open in the face of a person, from the facial feature amount stored in the facial feature amount storage unit 136.
For example, the opening degree specifying portion 137 specifies the distance V3 between the two points of the upper lip and the lower lip as the opening degree.

また、開口度特定部１３７は、距離Ｖ３を正規化して０～１００までのパーセントの値に変換して用いてもよい。
具体的には、２次元の画像において、人物が正面を向いているときの開口度１００％における距離Ｖ３と、人物が上又は下を見ているときの開口度１００％における距離Ｖ３とは異なる。このため、開口度特定部１３７は、人物が正面を向いているときは、距離Ｖ３＝５ｃｍで開口度１００％となるようにし、人物が上を向いているときは、距離Ｖ３＝３ｃｍで開口度１００％となるように、顔特徴点記憶部１３４に記憶されている顔の向きに応じて、開口度１００％となる距離Ｖ３の値を変化させる正規化処理を行ってもよい。Further, the opening degree specifying unit 137 may be used by normalizing the distance V3 and converting it into a percentage value from 0 to 100.
Specifically, in a two-dimensional image, the distance V3 at 100% opening when the person is facing the front is different from the distance V3 at 100% opening when the person is looking up or down. .. Therefore, the opening degree specifying portion 137 is set so that the opening degree is 100% at a distance V3 = 5 cm when the person is facing the front, and the opening degree is opened at a distance V3 = 3 cm when the person is facing upward. A normalization process may be performed in which the value of the distance V3 at which the opening degree is 100% is changed according to the direction of the face stored in the face feature point storage unit 134 so that the degree becomes 100%.

開口度記憶部１３８は、開口度特定部１３７により特定された開口度を記憶する。 The opening degree storage unit 138 stores the opening degree specified by the opening degree specifying unit 137.

開口継続度算出部１３９は、開口度記憶部１３８に記憶されている複数の開口度により、口が継続して開いている度合いである開口継続度を算出する。複数の開口度は、複数のフレームから特定される。
例えば、開口継続度算出部１３９は、動画像におけるｔ番目（ｔは１以上の整数）のフレームであるｔフレームの開口度と、過去のフレームの開口度とを重み付けして加算することで、開口継続度を算出する。ｔ＋１番目のフレームであるｔ＋１フレームにおける開口継続度ｘ_ｔ＋１の算出式は、下記の（１）式である。

The opening continuity calculation unit 139 calculates the opening continuity degree, which is the degree to which the mouth is continuously opened, from the plurality of opening degrees stored in the opening degree storage unit 138. Multiple apertures are specified from multiple frames.
For example, the aperture continuity calculation unit 139 weights and adds the aperture degree of the t-frame, which is the t-th frame (t is an integer of 1 or more) in the moving image, and the aperture degree of the past frame. Calculate the opening continuity. The formula for calculating the opening continuity x _{t + 1} in the t + 1 frame, which is the t + 1st frame, is the following formula (1).

ここで、ｘ_ｔは、ｔフレームにおける開口継続度、ｙ_ｔ＋１は、ｔ＋１フレームにおける開口度、αは、パラメータの調整値であり、０＜α＜１を満たす。
この開口継続度は、継続して開口する場合に数値が高くなる時間フィルタをかけた値である。Here, x _t is the opening continuity in the t frame, y _{t + 1} is the opening degree in the t + 1 frame, α is the adjustment value of the parameter, and 0 <α <1 is satisfied.
This opening continuity is a time-filtered value in which the numerical value becomes higher when the opening is continued.

開口継続度記憶部１４０は、開口継続度算出部１３９で算出された開口継続度を記憶する。 The opening continuity storage unit 140 stores the opening continuity calculated by the opening continuity calculation unit 139.

判定処理部１４１は、フレーム内の人物が欠伸以外の予め定められた要因による開口を行っているか否かを判定する。判定処理部１４１は、予め定められた要因による開口を行っていると判定されたフレームに、予め定められた第１の閾値よりも低い値を欠伸開口継続度として対応付け、予め定められた要因による開口を行っていないと判定されたフレームに、開口継続度算出部１３９で算出された開口継続度を欠伸開口継続度として対応付ける。
そして、判定処理部１４１は、最新のフレームを含む予め定められた数の連続したフレームの内、第１の閾値以上となっている欠伸開口継続度が対応付けられているフレームの数が、予め定められた第２の閾値以上の場合に、その人物が欠伸をしたと判定する。The determination processing unit 141 determines whether or not the person in the frame has opened due to a predetermined factor other than yawning. The determination processing unit 141 associates a value lower than the predetermined first threshold value with the frame determined to be opened by a predetermined factor as the yawning opening continuity, and determines the predetermined factor. The frame determined not to be opened by the above is associated with the opening continuity calculated by the opening continuity calculation unit 139 as the yawning opening continuity.
Then, in the determination processing unit 141, among a predetermined number of consecutive frames including the latest frame, the number of frames associated with the yawning opening continuity that is equal to or higher than the first threshold value is determined in advance. If it is equal to or higher than the specified second threshold value, it is determined that the person has yawned.

実施の形態１では、判定処理部１４１は、フレーム内の人物が会話を行っている場合に、予め定められた要因による開口を行っていると判定する。
具体的には、判定処理部１４１は、人物の口が開く速度である開口速度が予め定められた第３の閾値未満の場合に、その人物が会話を行っていると判定する。In the first embodiment, the determination processing unit 141 determines that when a person in the frame is having a conversation, an opening is performed due to a predetermined factor.
Specifically, the determination processing unit 141 determines that the person is having a conversation when the opening speed, which is the speed at which the person's mouth opens, is less than a predetermined third threshold value.

判定処理部１４１での処理は、開口速度評価値算出部１４２、開口速度評価値記憶部１４３、欠伸判定部１４４及び一時記憶部１４５で実現される。以下、説明する。 The processing in the determination processing unit 141 is realized by the opening speed evaluation value calculation unit 142, the opening speed evaluation value storage unit 143, the yawning determination unit 144, and the temporary storage unit 145. This will be described below.

開口速度評価値算出部１４２は、開口継続度記憶部１４０に記憶されている開口継続度から、開口速度が速くなるほど大きくなる値である開口速度評価値を算出する。
ｔフレームにおける開口速度ｚ_ｔは、下記の（２）式で算出することができる。

ここで、Ｖ３_ｔは、ｔフレームにおける距離Ｖ３、Ｖ３_ｔ－１は、ｔ－１フレームにおける距離Ｖ３である。The opening speed evaluation value calculation unit 142 calculates an opening speed evaluation value, which is a value that increases as the opening speed increases, from the opening continuity stored in the opening continuity storage unit 140.
The opening speed zt in the _t -frame can be calculated by the following equation (2).

Here, V3 _t is the distance V3 in the t frame, and V3 _t-1 is the distance V3 in the t-1 frame.

開口速度評価値算出部１４２は、（２）式を用いて、開口速度ｚ_ｔを算出してもよいが、実施の形態１では、下記の（３）式～（５）式に示されている開口速度評価値ｚ＃１_ｔ～ｚ＃３_ｔの内の何れか一つを算出している。

The opening speed evaluation value calculation unit 142 may calculate the opening speed _zt using the formula (2), but in the first embodiment, it is shown by the following formulas (3) to (5). Any one of the existing opening speed evaluation values z # 1 _t to z # 3 _t is calculated.

開口速度評価値ｚ＃１_ｔ～ｚ＃３_ｔは、何れも開口速度ｚ_ｔが速くなるほど、大きくなる値である。このため、開口速度評価値ｚ＃１_ｔ～ｚ＃３_ｔが第４の閾値未満であるか否かを判定することで、開口速度が予め定められた第３の閾値未満であるか否かを判定することができる。
実施の形態１では、開口速度評価値算出部１４２は、開口速度評価値ｚ＃３_ｔを算出するものとする。ここで、（５）式では、ｘ_ｔの勾配に着目しており、上記の（１）式のαの値を小さく設定した場合、開口速度が速くなると、ｘ_ｔとｙ_ｔとの差分が大きくなるため、開口速度評価値ｚ＃３_ｔは、開口速度の評価値として使用することができる。The opening speed evaluation values z # 1 _t to z # 3 _t are all values that increase as the opening speed z _t increases. Therefore, whether or not the opening speed is less than a predetermined third threshold value by determining whether or not the opening speed evaluation values z # 1 _t to z # 3 _t are less than the fourth threshold value. Can be determined.
In the first embodiment, the opening speed evaluation value calculation unit 142 calculates the opening speed evaluation value z # 3 _t . Here, in equation (5), attention is paid to the gradient of x _t , and when the value of α in equation (1) above is set small, the difference between x _t and y _t increases as the opening speed increases. Since it becomes large, the opening speed evaluation value z # 3 _t can be used as an evaluation value of the opening speed.

ここで、開口速度は、開口継続度の傾斜が大きい場合に数値が高くなる。このため、図８（Ａ）及び（Ｂ）に示されているように、欠伸時は開口速度が速く、会話時は開口速度が遅くなる。 Here, the opening speed becomes higher when the inclination of the opening continuity is large. Therefore, as shown in FIGS. 8A and 8B, the opening speed is high during yawning and slow during conversation.

図４に戻り、開口速度評価値記憶部１４３は、開口速度評価値算出部１４２で算出された開口速度評価値を記憶する。 Returning to FIG. 4, the opening speed evaluation value storage unit 143 stores the opening speed evaluation value calculated by the opening speed evaluation value calculation unit 142.

欠伸判定部１４４は、開口継続度記憶部１４０に記憶されている開口継続度と、開口速度評価値記憶部１４３に記憶されている開口速度評価値とを用いて、予め定められた要因（ここでは、会話）による開口と、予め定められた要因ではない要因による開口とを判別し、欠伸を判定する。 The yawning determination unit 144 uses a predetermined factor (here) using the opening continuity stored in the opening continuity storage unit 140 and the opening speed evaluation value stored in the opening speed evaluation value storage unit 143. Then, the opening due to the conversation) and the opening due to a factor other than the predetermined factor are discriminated, and the yawning is determined.

具体的には、開口速度評価値記憶部１４３に記憶されている開口速度評価値が欠伸判定条件モデル１２１に含まれている開口区別判定条件において、開口速度評価値が予め定められた閾値（例えば、２０）未満である場合には、予め定められた要因による開口と判定する。ここでの閾値は、第４の閾値ともいう。 Specifically, in the opening distinction determination condition in which the opening speed evaluation value stored in the opening speed evaluation value storage unit 143 is included in the yawning determination condition model 121, the opening speed evaluation value is a predetermined threshold value (for example). , 20), it is determined that the opening is due to a predetermined factor. The threshold value here is also referred to as a fourth threshold value.

次に、欠伸判定部１４４は、予め定められた要因による開口と判定した場合には、無効値を欠伸開口継続値とし、予め定められた要因ではない要因による開口と判定した場合には、そのフレームの開口継続度を欠伸開口継続度として、そのフレームを識別するための識別情報であるフレーム識別番号に対応付けて、一時記憶部１４５に記憶する。無効値は、第１の閾値よりも小さい値であれば、どのような値でもよい。 Next, when the yawning determination unit 144 determines that the opening is due to a predetermined factor, the invalid value is set as the continuation value of the yawning opening, and when it is determined that the opening is due to a factor other than the predetermined factor, the invalid value is used. The degree of opening continuity of the frame is set as the degree of continuation of yawning, and is stored in the temporary storage unit 145 in association with the frame identification number which is the identification information for identifying the frame. The invalid value may be any value as long as it is smaller than the first threshold value.

そして、欠伸判定部１４４は、欠伸判定条件モデル１２１に含まれている欠伸判定条件が満たされている場合に、欠伸と判定する。
ここでは、欠伸判定部１４４は、過去の予め定められた数のフレーム（例えば、１００フレーム）の内、欠伸開口継続度が予め定められた値（例えば、４０）以上となっているフレームの数が予め定められた数（例えば、２０）以上のときに、欠伸と判定する。Then, the yawning determination unit 144 determines that yawning is achieved when the yawning determination condition included in the yawning determination condition model 121 is satisfied.
Here, the yawning determination unit 144 is the number of frames in which the degree of continuation of the yawning opening is a predetermined value (for example, 40) or more among the predetermined number of frames in the past (for example, 100 frames). When is a predetermined number (for example, 20) or more, it is determined to be yawning.

欠伸判定結果記憶部１４６は、欠伸判定部１４４での欠伸判定結果を記憶する。ここでの欠伸判定結果は、例えば、「欠伸」又は「欠伸ではない」である。 The yawning determination result storage unit 146 stores the yawning determination result in the yawning determination unit 144. The yawning determination result here is, for example, "yawning" or "not yawning".

出力部１４７は、欠伸判定結果記憶部１４６に記憶されている欠伸判定結果を図１に示されている表示部１５０に送り、表示部１５０にその欠伸判定結果又はその欠伸判定結果に対応する情報を表示させる。 The output unit 147 sends the yawning determination result stored in the yawning determination result storage unit 146 to the display unit 150 shown in FIG. 1, and the display unit 150 receives the yawning determination result or the information corresponding to the yawning determination result. Is displayed.

図９は、実施の形態１に係る欠伸判定装置１００のハードウェア構成を概略的に示すブロック図である。
実施の形態１に係る欠伸判定装置１００は、カメラ１６１と、補助記憶装置１６２と、プロセッサ１６３と、メモリ１６４と、表示装置１６５とを備えるコンピュータ１６０により構成することができる。FIG. 9 is a block diagram schematically showing a hardware configuration of the yawning determination device 100 according to the first embodiment.
The yawning determination device 100 according to the first embodiment can be configured by a computer 160 including a camera 161, an auxiliary storage device 162, a processor 163, a memory 164, and a display device 165.

具体的には、撮像部１１０は、撮像装置としてのカメラ１６１により実現することができる。
データベース部１２０は、ＨＤＤ（ＨａｒｄＤｉｓｋＤｒｉｖｅ）等の補助記憶装置１６２により実現することができる。Specifically, the image pickup unit 110 can be realized by a camera 161 as an image pickup device.
The database unit 120 can be realized by an auxiliary storage device 162 such as an HDD (Hard Disk Drive).

処理部１３０は、プロセッサ１６３及びメモリ１６４により実現することができる。
プロセッサ１６３は、ＣＰＵ（ＣｅｎｔｒａｌＰｒｏｃｅｓｓｉｎｇＵｎｉｔ）等により実現することができ、メモリ１６４は、不揮発メモリにより実現することができる。プロセッサ１６３は、補助記憶装置１６２に記憶されているプログラムをメモリ１６４に読み出して、そのプログラムを実行することで各種処理を実行する。このようなプログラムは、ネットワークを通じて提供されてもよく、また、記録媒体に記録されて提供されてもよい。即ち、このようなプログラムは、例えば、プログラムプロダクトとして提供されてもよい。
表示部１５０は、表示装置１６５により実現することができる。The processing unit 130 can be realized by the processor 163 and the memory 164.
The processor 163 can be realized by a CPU (Central Processing Unit) or the like, and the memory 164 can be realized by a non-volatile memory. The processor 163 reads the program stored in the auxiliary storage device 162 into the memory 164 and executes the program to execute various processes. Such a program may be provided through a network, or may be recorded and provided on a recording medium. That is, such a program may be provided, for example, as a program product.
The display unit 150 can be realized by the display device 165.

図１０は、実施の形態１における欠伸判定部１４４での処理を示すフローチャートである。
まず、欠伸判定部１４４は、開口継続度記憶部１４０に記憶されている開口継続度と、開口速度評価値記憶部１４３に記憶されている開口速度評価値とを取得する（Ｓ１０）。FIG. 10 is a flowchart showing the processing in the yawning determination unit 144 in the first embodiment.
First, the stretchout determination unit 144 acquires the opening continuity stored in the opening continuity storage unit 140 and the opening speed evaluation value stored in the opening speed evaluation value storage unit 143 (S10).

次に、欠伸判定部１４４は、取得された開口速度評価値が、開口区別判定条件における閾値ａ以上であるか否かを判定する（Ｓ１１）。開口速度評価値が閾値ａ未満である場合（Ｓ１１でＮｏ）には、処理はステップＳ１２に進み、開口速度評価値が閾値ａ以上である場合（Ｓ１１でＹｅｓ）には、処理はステップＳ１３に進む。 Next, the yawning determination unit 144 determines whether or not the acquired opening speed evaluation value is equal to or greater than the threshold value a in the opening discrimination determination condition (S11). When the opening speed evaluation value is less than the threshold value a (No in S11), the process proceeds to step S12, and when the opening speed evaluation value is equal to or more than the threshold value a (Yes in S11), the process proceeds to step S13. move on.

ステップＳ１２では、開口速度評価値が閾値ａ未満であり、動画像に映っている人物の開口が欠伸以外の会話であると考えられるため、欠伸判定部１４４は、開口継続度を予め定められた値である無効値（例えば、「－１」）とする。そして、処理はステップＳ１３に進む。 In step S12, the opening speed evaluation value is less than the threshold value a, and it is considered that the opening of the person shown in the moving image is a conversation other than yawning. It is an invalid value (for example, "-1") which is a value. Then, the process proceeds to step S13.

ステップＳ１３では、欠伸判定部１４４は、現フレームの開口継続度を、欠伸開口継続度として、そのフレーム番号とともに一時記憶部１４５に記憶させる。ここで、欠伸開口継続度は、開口速度評価値が閾値ａ以上である場合には、現フレームに対応して開口継続度算出部１３９で算出された開口継続度であり、開口速度評価値が閾値ａ未満である場合には、ステップＳ１２で設定された無効値となる。 In step S13, the yawning determination unit 144 stores the opening continuity of the current frame as the yawning opening continuity in the temporary storage unit 145 together with the frame number. Here, the yawning opening continuity is the opening continuity calculated by the opening continuity calculation unit 139 corresponding to the current frame when the opening speed evaluation value is equal to or higher than the threshold value a, and the opening speed evaluation value is If it is less than the threshold value a, it becomes an invalid value set in step S12.

次に、欠伸判定部１４４は、欠伸判定条件モデル１２１に含まれている欠伸判定条件を満たすか否かを判定する（Ｓ１４）。ここでは、欠伸判定部１４４は、過去の予め定められた数のフレーム（例えば、１００フレーム）の内、欠伸開口継続度が予め定められた値（例えば、４０）以上となっているフレームの数が予め定められた数（例えば、２０）以上のときに、欠伸と判定する。欠伸判定条件が満たされている場合（Ｓ１４でＹｅｓ）には、処理はステップＳ１５に進み、欠伸判定条件が満たされていない場合（Ｓ１４でＮｏ）には、処理はステップＳ１６に進む。 Next, the yawning determination unit 144 determines whether or not the yawning determination condition condition included in the yawning determination condition model 121 is satisfied (S14). Here, the yawning determination unit 144 is the number of frames in which the degree of continuation of the yawning opening is a predetermined value (for example, 40) or more among the predetermined number of frames in the past (for example, 100 frames). When is a predetermined number (for example, 20) or more, it is determined to be yawning. If the yawning determination condition is satisfied (Yes in S14), the process proceeds to step S15, and if the yawning determination condition is not satisfied (No in S14), the process proceeds to step S16.

ステップＳ１５では、欠伸判定部１４４は、欠伸と判定し、その判定結果を欠伸判定結果記憶部１４６に記憶する。なお、欠伸判定部１４４は、欠伸は続けて発生しないと想定し、一度欠伸と判定したら一定フレーム（例えば、１００フレーム）は、欠伸と判定しない。
一方、ステップＳ１６では、欠伸判定部１４４は、欠伸ではないと判定し、その判定結果を欠伸判定結果記憶部１４６に記憶する。In step S15, the yawning determination unit 144 determines that the yawning is due, and stores the determination result in the yawning determination result storage unit 146. The yawning determination unit 144 assumes that yawning does not occur continuously, and once yawning is determined, a certain frame (for example, 100 frames) is not determined to be yawning.
On the other hand, in step S16, the yawning determination unit 144 determines that the yawning is not, and stores the determination result in the yawning determination result storage unit 146.

以上のように、実施の形態１によれば、動画像に映っている人物が会話している状態では、欠伸と判定しないため、人物の口が開いている状態から、精度よく欠伸を検出することができる。 As described above, according to the first embodiment, when the person shown in the moving image is talking, it is not determined to be yawning, so that the yawning is detected accurately from the state where the person's mouth is open. be able to.

実施の形態２．
図１１は、実施の形態２に係る情報処理装置としての欠伸判定装置２００の構成を概略的に示すブロック図である。
欠伸判定装置２００は、撮像部１１０と、データベース部２２０と、処理部２３０と、表示部１５０とを備える。
実施の形態２に係る欠伸判定装置２００の撮像部１１０及び表示部１５０は、実施の形態１に係る欠伸判定装置１００の撮像部１１０及び表示部１５０と同様である。Embodiment 2.
FIG. 11 is a block diagram schematically showing the configuration of the yawning determination device 200 as the information processing device according to the second embodiment.
The yawn determination device 200 includes an image pickup unit 110, a database unit 220, a processing unit 230, and a display unit 150.
The image pickup unit 110 and the display unit 150 of the yawn determination device 200 according to the second embodiment are the same as the image pickup unit 110 and the display unit 150 of the yawn determination device 100 according to the first embodiment.

データベース部２２０は、欠伸判定条件モデル２２１と、顔表情特徴モデル２２２とを記憶する記憶部である。
欠伸判定条件モデル２２１は、開口区別判定条件及び欠伸判定条件を含む。
実施の形態２における欠伸判定条件モデル２２１の欠伸判定条件は、実施の形態１における欠伸判定条件モデル１２１の欠伸判定条件と同様である。The database unit 220 is a storage unit that stores the yawning determination condition model 221 and the facial expression feature model 222.
The yawn determination condition model 221 includes an opening distinction determination condition and a yawn determination condition.
The yawning determination condition of the yawning determination condition model 221 in the second embodiment is the same as the yawning determination condition of the yawning determination condition model 121 in the first embodiment.

開口区別判定条件は、予め定められた要因による開口と、予め定められた要因ではない要因による開口とを区別するために使用される判定条件である。
図１２は、開口区別判定条件の一例を示す概略図である。
実施の形態２では、開口区別判定条件は、後述する顔表情特徴比較結果に応じて、予め定められた要因による開口か、予め定められた要因ではない要因による開口かを判定する条件になっている。実施の形態２では、予め定められた要因は、人物の表情が、笑顔又は怒った顔であることである。The opening distinction determination condition is a determination condition used to distinguish between an opening due to a predetermined factor and an opening due to a factor other than the predetermined factor.
FIG. 12 is a schematic view showing an example of the opening distinction determination condition.
In the second embodiment, the opening distinction determination condition is a condition for determining whether the opening is due to a predetermined factor or a factor other than the predetermined factor according to the facial expression feature comparison result described later. There is. In the second embodiment, the predetermined factor is that the facial expression of the person is a smiling face or an angry face.

顔表情特徴モデル２２２は、笑顔判定条件及び驚き判定条件を含む。
笑顔判定条件は、顔特徴量に基づいて、動画像に含まれている人物の表情が笑顔であることを判定するための条件である。
驚き判定条件は、顔特徴量に基づいて、動画像に含まれている人物の表情が驚いた顔であることを判定するための条件である。The facial expression feature model 222 includes a smile determination condition and a surprise determination condition.
The smile determination condition is a condition for determining that the facial expression of the person included in the moving image is a smile based on the facial feature amount.
The surprise determination condition is a condition for determining that the facial expression of the person included in the moving image is a surprised face based on the facial feature amount.

図１３は、顔表情特徴モデル２２２の一例を示す概略図である。
顔特徴量が、図１３に示されている何れかの条件を満たす場合には、人物の表情が笑顔又は驚いた顔であると判定される。FIG. 13 is a schematic view showing an example of the facial expression feature model 222.
When the facial feature amount satisfies any of the conditions shown in FIG. 13, it is determined that the facial expression of the person is a smiling face or a surprised face.

図１１に戻り、処理部２３０は、欠伸判定装置２００での処理を実行する。
図１４は、実施の形態２における処理部２３０の構成を概略的に示すブロック図である。
処理部２３０は、入力部１３１と、顔領域抽出部１３２と、顔特徴点抽出部１３３と、顔特徴点記憶部１３４と、顔特徴量算出部１３５と、顔特徴量記憶部１３６と、開口度特定部１３７と、開口度記憶部１３８と、開口継続度算出部１３９と、開口継続度記憶部１４０と、判定処理部２４１と、欠伸判定結果記憶部１４６と、出力部１４７とを備える。Returning to FIG. 11, the processing unit 230 executes the processing by the yawning determination device 200.
FIG. 14 is a block diagram schematically showing the configuration of the processing unit 230 according to the second embodiment.
The processing unit 230 includes an input unit 131, a face area extraction unit 132, a face feature point extraction unit 133, a face feature point storage unit 134, a face feature amount calculation unit 135, a face feature amount storage unit 136, and an opening. It includes a degree specifying unit 137, an opening degree storage unit 138, an opening continuity calculation unit 139, an opening continuity storage unit 140, a determination processing unit 241 and a stretch determination result storage unit 146, and an output unit 147.

判定処理部２４１は、欠伸判定部２４４と、一時記憶部１４５と、顔表情特徴比較部２４８と、顔表情特徴比較結果記憶部２４９とを備える。 The determination processing unit 241 includes a yawn determination unit 244, a temporary storage unit 145, a facial expression feature comparison unit 248, and a facial expression feature comparison result storage unit 249.

実施の形態２における処理部２３０の入力部１３１、顔領域抽出部１３２、顔特徴点抽出部１３３、顔特徴点記憶部１３４、顔特徴量算出部１３５、顔特徴量記憶部１３６、開口度特定部１３７、開口度記憶部１３８、開口継続度算出部１３９、開口継続度記憶部１４０、一時記憶部１４５、欠伸判定結果記憶部１４６及び出力部１４７は、実施の形態１における処理部１３０の入力部１３１、顔領域抽出部１３２、顔特徴点抽出部１３３、顔特徴点記憶部１３４、顔特徴量算出部１３５、顔特徴量記憶部１３６、開口度特定部１３７、開口度記憶部１３８、開口継続度算出部１３９、開口継続度記憶部１４０、一時記憶部１４５、欠伸判定結果記憶部１４６及び出力部１４７と同様である。 Input unit 131, face area extraction unit 132, face feature point extraction unit 133, face feature point storage unit 134, face feature amount calculation unit 135, face feature amount storage unit 136, opening degree specification of the processing unit 230 in the second embodiment. Unit 137, opening degree storage unit 138, opening continuity calculation unit 139, opening continuity storage unit 140, temporary storage unit 145, stretch determination result storage unit 146, and output unit 147 are inputs of the processing unit 130 in the first embodiment. Unit 131, face area extraction unit 132, face feature point extraction unit 133, face feature point storage unit 134, face feature amount calculation unit 135, face feature amount storage unit 136, opening degree identification unit 137, opening degree storage unit 138, opening. This is the same as the continuity calculation unit 139, the opening continuity storage unit 140, the temporary storage unit 145, the stretchout determination result storage unit 146, and the output unit 147.

実施の形態２でも、判定処理部２４１は、フレーム内の人物が欠伸以外の予め定められた要因による開口を行っているか否かを判定する。
但し、実施の形態２では、判定処理部２４１は、人物の表情が欠伸とは異なる予め定められた表情である場合に、予め定められた要因による開口を行っていると判定する。
実施の形態２における判定処理部２４１での処理は、欠伸判定部２４４、一時記憶部１４５、顔表情特徴比較部２４８及び顔表情特徴比較結果記憶部２４９により実現される。以下説明する。Also in the second embodiment, the determination processing unit 241 determines whether or not the person in the frame makes an opening due to a predetermined factor other than yawning.
However, in the second embodiment, the determination processing unit 241 determines that the opening is performed by a predetermined factor when the facial expression of the person is a predetermined facial expression different from the yawning.
The processing in the determination processing unit 241 in the second embodiment is realized by the yawn determination unit 244, the temporary storage unit 145, the facial expression feature comparison unit 248, and the facial expression feature comparison result storage unit 249. This will be described below.

顔表情特徴比較部２４８は、顔特徴量から、フレーム内の人物が予め定められた表情であるか否かを判定する顔表情判定部である。ここでは、顔表情特徴比較部２４８は、現在の顔特徴と、顔表情特徴（例えば、笑顔又は驚き時に表出する顔の特徴）とを比較することで、人物の表情を判定する。 The facial expression feature comparison unit 248 is a facial expression determination unit that determines whether or not a person in the frame has a predetermined facial expression based on the facial feature amount. Here, the facial expression feature comparison unit 248 determines the facial expression of a person by comparing the current facial feature with the facial expression feature (for example, the facial feature that appears at the time of a smile or surprise).

例えば、顔表情特徴比較部２４８は、顔特徴量記憶部１３６に記憶されている顔特徴量を用いて欠伸時に発生しない顔特徴が見られるかを判定する。一般に、人物の顔が笑顔又は驚いた顔になっている場合には、欠伸時には発生しない顔特徴が表れる。従って、顔表情特徴比較部２４８は、人物の顔の特徴量が、欠伸時には発生しない顔特徴を示す顔特徴量となっている場合には、欠伸を要因とする開口ではないと判定する。 For example, the facial expression feature comparison unit 248 uses the facial feature amount stored in the facial feature amount storage unit 136 to determine whether facial features that do not occur during yawning can be seen. In general, when a person's face is smiling or surprised, facial features that do not occur during yawning appear. Therefore, when the facial expression feature comparison unit 248 is a facial feature that indicates a facial feature that does not occur during yawning, the facial expression feature comparison unit 248 determines that the opening is not due to yawning.

具体的には、顔表情特徴比較部２４８は、顔特徴量記憶部１３６に記憶されている顔表情の何れかが、図１３の顔表情特徴モデル２２２に示されている判定条件を満たしている場合には、笑顔による開口又は驚きによる開口であると判定する。 Specifically, in the facial expression feature comparison unit 248, any of the facial expressions stored in the facial feature amount storage unit 136 satisfies the determination condition shown in the facial expression feature model 222 of FIG. In some cases, it is determined that the opening is due to a smile or an opening due to surprise.

なお、顔表情特徴比較部２４８は、顔特徴量記憶部１３６に記憶されている顔特徴量を用いて、ＲａｎｄｏｍＦｏｒｅｓｔ、ＳＶＭ（ＳｕｐｐｏｒｔＶｅｃｔｏｒＭａｃｈｉｎｅ）、Ａｄａｂｏｏｓｔ、ＣＮＮ（ＣｏｎｖｏｌｕｔｉｏｎａｌＮｅｕｒａｌＮｅｔｗｏｒｋ）等の機械学習技術を用いて、予め定められた要因による開口か否かを判定してもよい。 The facial feature comparison unit 248 uses the facial feature amount stored in the facial feature amount storage unit 136 to perform machine learning such as Random Forest, SVM (Support Vector Machine), AdaBoost, and CNN (Convolutional Neural Network). Techniques may be used to determine if the opening is due to a predetermined factor.

顔表情特徴比較結果記憶部２４９は、顔表情特徴比較部２４８での比較結果（判定結果ともいう）を記憶する。
顔表情特徴比較部２４８での比較結果は、例えば、「笑顔による開口」、「驚きによる開口」又は「開口」であるものとする。「開口」は、「笑顔による開口」又は「驚きによる開口」であると判定されなかった場合の比較結果である。ここで、「笑顔による開口」は、フレーム内の人物の表情が笑顔であることを示し、「驚きによる開口」は、その表情が驚いた顔であることを示す。The facial expression feature comparison result storage unit 249 stores the comparison result (also referred to as a determination result) in the facial expression feature comparison unit 248.
The comparison result in the facial expression feature comparison unit 248 is, for example, "aperture due to a smile", "opening due to surprise", or "opening". The "opening" is a comparison result when it is not determined to be "opening due to a smile" or "opening due to surprise". Here, "aperture by smile" indicates that the facial expression of the person in the frame is a smile, and "aperture by surprise" indicates that the facial expression is a surprised face.

欠伸判定部２４４は、開口継続度記憶部１４０に記憶されている開口継続度と、顔表情特徴比較結果記憶部２４９に記憶されている比較結果とを用いて、予め定められた要因による開口であるか否かを判別し、欠伸を判定する。例えば、欠伸判定部２４４は、顔表情特徴比較部２４８での判定結果が、人物の表情が予め定められた表情である場合に、予め定められた要因による開口を行っていると判定し、その判定結果が、その人物の表情が予め定められた表情ではない場合に、予め定められた要因ではない要因による開口であると判定する。 The yawning determination unit 244 uses an opening continuity stored in the opening continuity storage unit 140 and a comparison result stored in the facial expression feature comparison result storage unit 249 to determine an opening due to a predetermined factor. It is determined whether or not it is present, and yawning is determined. For example, the yawning determination unit 244 determines that when the determination result of the facial expression feature comparison unit 248 is a predetermined facial expression, the opening is performed by a predetermined factor. When the determination result is that the facial expression of the person is not a predetermined facial expression, it is determined that the opening is due to a factor other than the predetermined factor.

具体的には、欠伸判定部２４４は、顔表情特徴比較結果記憶部２４９に記憶されている比較結果が「笑顔による開口」又は「驚きによる開口」である場合には、予め定められた要因による開口と判定し、その比較結果が「開口」である場合には、予め定められた要因ではない要因による開口と判定する。 Specifically, the yawning determination unit 244 is based on a predetermined factor when the comparison result stored in the facial expression feature comparison result storage unit 249 is "opening due to smile" or "opening due to surprise". If it is determined to be an opening and the comparison result is "opening", it is determined to be an opening due to a factor other than a predetermined factor.

次に、欠伸判定部２４４は、予め定められた要因による開口と判定した場合には、無効値を欠伸開口継続値とし、予め定められた要因ではない要因による開口と判定した場合には、そのフレームの開口継続度を欠伸開口継続度として、そのフレームを識別するための識別情報であるフレーム識別番号に対応付けて、一時記憶部１４５に記憶する。
そして、欠伸判定部２４４は、欠伸判定条件モデル２２１に含まれている欠伸判定条件が満たされている場合に、欠伸と判定する。Next, when the yawning determination unit 244 determines that the opening is due to a predetermined factor, the invalid value is set as the continuation value of the yawning opening, and when it is determined that the opening is due to a factor other than the predetermined factor, the invalid value is used. The degree of opening continuity of the frame is set as the degree of continuation of yawning, and is stored in the temporary storage unit 145 in association with the frame identification number which is the identification information for identifying the frame.
Then, the yawning determination unit 244 determines that yawning is achieved when the yawning determination condition included in the yawning determination condition model 221 is satisfied.

図１５は、実施の形態２における欠伸判定部２４４での処理を示すフローチャートである。
まず、欠伸判定部２４４は、開口継続度記憶部１４０に記憶されている開口継続度と、顔表情特徴比較結果記憶部２４９に記憶されている比較結果とを取得する（Ｓ２０）。FIG. 15 is a flowchart showing the processing in the yawning determination unit 244 in the second embodiment.
First, the yawning determination unit 244 acquires the opening continuity stored in the opening continuity storage unit 140 and the comparison result stored in the facial expression feature comparison result storage unit 249 (S20).

次に、欠伸判定部２４４は、取得された比較結果が予め定められた要因による開口であるか否かを判定する（Ｓ２１）。例えば、欠伸判定部２４４は、比較結果が「笑顔による開口」又は「驚きによる開口」である場合には、予め定められた要因による開口であると判定し、比較結果が「開口」である場合には、予め定められた要因ではない要因による開口であると判定する。予め定められた要因による開口である場合（Ｓ２１でＹｅｓ）には、処理はステップＳ２２に進み、予め定められた要因ではない要因による開口である場合（Ｓ２１でＮｏ）には、処理はステップＳ２３に進む。 Next, the yawning determination unit 244 determines whether or not the acquired comparison result is an opening due to a predetermined factor (S21). For example, when the comparison result is "opening due to smile" or "opening due to surprise", the yawning determination unit 244 determines that the opening is due to a predetermined factor, and when the comparison result is "opening". Is determined to be an opening due to a factor other than a predetermined factor. If the opening is due to a predetermined factor (Yes in S21), the process proceeds to step S22, and if the opening is due to a factor other than the predetermined factor (No in S21), the process proceeds to step S23. Proceed to.

ステップＳ２２では、欠伸判定部２４４は、開口継続度を予め定められた値である無効値（例えば、「－１」）とする。そして、処理はステップＳ２３に進む。 In step S22, the yawning determination unit 244 sets the opening continuity to an invalid value (for example, “-1”) which is a predetermined value. Then, the process proceeds to step S23.

ステップＳ２３では、欠伸判定部２４４は、現フレームの開口継続度を、欠伸開口継続度として、そのフレーム番号とともに一時記憶部１４５に記憶させる。ここで、欠伸開口継続度は、予め定められた要因ではない要因による開口である場合には、現フレームに対応して開口継続度算出部１３９で算出された開口継続度であり、予め定められた要因による開口である場合には、ステップＳ２２で設定された無効値となる。 In step S23, the yawning determination unit 244 stores the opening continuity of the current frame as the yawning opening continuity in the temporary storage unit 145 together with the frame number. Here, the yawning opening continuity is the opening continuity calculated by the opening continuity calculation unit 139 corresponding to the current frame when the opening is due to a factor other than a predetermined factor, and is predetermined. If the opening is due to a factor, the invalid value set in step S22 is used.

次に、欠伸判定部２４４は、欠伸判定条件モデル２２１に含まれている欠伸判定条件を満たすか否かを判定する（Ｓ２４）。ここでは、欠伸判定部２４４は、過去の予め定められた数のフレーム（例えば、１００フレーム）の内、欠伸開口継続度が予め定められた値（例えば、４０）以上となっているフレームの数が予め定められた数（例えば、２０）以上のときに、欠伸と判定する。欠伸判定条件が満たされている場合（Ｓ２４でＹｅｓ）には、処理はステップＳ２５に進み、欠伸判定条件が満たされていない場合（Ｓ２４でＮｏ）には、処理はステップＳ２６に進む。 Next, the yawning determination unit 244 determines whether or not the yawning determination condition condition included in the yawning determination condition model 221 is satisfied (S24). Here, the yawning determination unit 244 is the number of frames in which the degree of continuation of the yawning opening is a predetermined value (for example, 40) or more among the predetermined number of frames in the past (for example, 100 frames). When is a predetermined number (for example, 20) or more, it is determined to be yawning. If the yawning determination condition is satisfied (Yes in S24), the process proceeds to step S25, and if the yawning determination condition is not satisfied (No in S24), the process proceeds to step S26.

ステップＳ２５では、欠伸判定部２４４は、欠伸と判定し、その判定結果を欠伸判定結果記憶部１４６に記憶する。なお、欠伸判定部２４４は、欠伸は続けて発生しないと想定し、一度欠伸と判定したら一定フレーム（例えば、１００フレーム）は、欠伸と判定しない。
一方、ステップＳ２６では、欠伸判定部１４４は、欠伸ではないと判定し、その判定結果を欠伸判定結果記憶部１４６に記憶する。In step S25, the yawning determination unit 244 determines that the yawning occurs, and stores the determination result in the yawning determination result storage unit 146. The yawning determination unit 244 assumes that yawning does not occur continuously, and once yawning is determined, a certain frame (for example, 100 frames) is not determined to be yawning.
On the other hand, in step S26, the yawning determination unit 144 determines that the yawning is not, and stores the determination result in the yawning determination result storage unit 146.

以上のように、実施の形態２によれば、動画像に映っている人物の表情が笑顔又は驚いた顔となっている状態では、欠伸と判定しないため、人物の口が開いている状態から、精度よく欠伸を検出することができる。 As described above, according to the second embodiment, when the facial expression of the person in the moving image is a smile or a surprised face, it is not determined to be yawning, so that the person's mouth is open. , Yawning can be detected accurately.

実施の形態３．
図１１に示されているように、実施の形態３に係る情報処理装置としての欠伸判定装置３００は、撮像部１１０と、データベース部３２０と、処理部３３０と、表示部１５０とを備える。
実施の形態３に係る欠伸判定装置３００の撮像部１１０及び表示部１５０は、実施の形態１に係る欠伸判定装置１００の撮像部１１０及び表示部１５０と同様である。Embodiment 3.
As shown in FIG. 11, the yawning determination device 300 as the information processing device according to the third embodiment includes an imaging unit 110, a database unit 320, a processing unit 330, and a display unit 150.
The image pickup unit 110 and the display unit 150 of the yawn determination device 300 according to the third embodiment are the same as the image pickup unit 110 and the display unit 150 of the yawn determination device 100 according to the first embodiment.

データベース部３２０は、欠伸判定条件モデル３２１と、顔表情特徴モデル２２２とを記憶する記憶部である。
実施の形態３における顔表情特徴モデル２２２は、実施の形態２における顔表情特徴モデル２２２と同様である。The database unit 320 is a storage unit that stores the yawning determination condition model 321 and the facial expression feature model 222.
The facial expression feature model 222 in the third embodiment is the same as the facial expression feature model 222 in the second embodiment.

欠伸判定条件モデル３２１は、開口区別判定条件及び欠伸判定条件を含む。
実施の形態３における欠伸判定条件モデル３２１の欠伸判定条件は、実施の形態１における欠伸判定条件モデル１２１の欠伸判定条件と同様である。The yawn determination condition model 321 includes an opening distinction determination condition and a yawn determination condition.
The yawning determination condition of the yawning determination condition model 321 in the third embodiment is the same as the yawning determination condition of the yawning determination condition model 121 in the first embodiment.

開口区別判定条件は、予め定められた要因による開口と、予め定められた要因ではない要因による開口とを区別するために使用される判定条件である。
図１６は、開口区別判定条件の一例を示す概略図である。
実施の形態３では、開口区別判定条件は、開口速度評価値が予め定められた閾値（例えば、２０）未満である場合には、予め定められた要因による開口と判定し、その開口速度評価値が予め定められた閾値以上である場合には、予め定められた要因ではない要因による開口であると判定する条件と、顔表情特徴比較結果に応じて、予め定められた要因による開口か、予め定められた要因ではない要因による開口かを判定する条件とになっている。実施の形態３でも、予め定められた要因は、人物の表情が、笑顔又は怒った顔であることである。The opening distinction determination condition is a determination condition used to distinguish between an opening due to a predetermined factor and an opening due to a factor other than the predetermined factor.
FIG. 16 is a schematic view showing an example of the opening distinction determination condition.
In the third embodiment, when the opening speed evaluation value is less than a predetermined threshold value (for example, 20), the opening discrimination determination condition determines that the opening is due to a predetermined factor, and the opening speed evaluation value is determined. If is equal to or greater than a predetermined threshold value, the opening is due to a factor other than the predetermined factor, and depending on the facial expression feature comparison result, the opening is due to the predetermined factor. It is a condition to judge whether the opening is due to a factor other than the specified factor. Also in the third embodiment, the predetermined factor is that the facial expression of the person is a smiling face or an angry face.

図１１に戻り、処理部３３０は、欠伸判定装置３００での処理を実行する。
図１７は、実施の形態３における処理部３３０の構成を概略的に示すブロック図である。
処理部３３０は、入力部１３１と、顔領域抽出部１３２と、顔特徴点抽出部１３３と、顔特徴点記憶部１３４と、顔特徴量算出部１３５と、顔特徴量記憶部１３６と、開口度特定部１３７と、開口度記憶部１３８と、開口継続度算出部１３９と、開口継続度記憶部１４０と、判定処理部３４１と、欠伸判定結果記憶部１４６と、出力部１４７とを備える。Returning to FIG. 11, the processing unit 330 executes processing by the yawning determination device 300.
FIG. 17 is a block diagram schematically showing the configuration of the processing unit 330 in the third embodiment.
The processing unit 330 includes an input unit 131, a face area extraction unit 132, a face feature point extraction unit 133, a face feature point storage unit 134, a face feature amount calculation unit 135, a face feature amount storage unit 136, and an opening. It includes a degree specifying unit 137, an opening degree storage unit 138, an opening continuity degree calculation unit 139, an opening continuity degree storage unit 140, a determination processing unit 341, a missing extension determination result storage unit 146, and an output unit 147.

判定処理部３４１は、開口速度評価値算出部１４２と、開口速度評価値記憶部１４３と、欠伸判定部３４４と、一時記憶部１４５と、顔表情特徴比較部２４８と、顔表情特徴比較結果記憶部２４９とを備える。 The determination processing unit 341 includes an opening speed evaluation value calculation unit 142, an opening speed evaluation value storage unit 143, a stretch determination unit 344, a temporary storage unit 145, a facial expression feature comparison unit 248, and a facial expression feature comparison result storage. A unit 249 is provided.

実施の形態３における処理部３３０の入力部１３１、顔領域抽出部１３２、顔特徴点抽出部１３３、顔特徴点記憶部１３４、顔特徴量算出部１３５、顔特徴量記憶部１３６、開口度特定部１３７、開口度記憶部１３８、開口継続度算出部１３９、開口継続度記憶部１４０、一時記憶部１４５、欠伸判定結果記憶部１４６及び出力部１４７は、実施の形態１における処理部１３０の入力部１３１、顔領域抽出部１３２、顔特徴点抽出部１３３、顔特徴点記憶部１３４、顔特徴量算出部１３５、顔特徴量記憶部１３６、開口度特定部１３７、開口度記憶部１３８、開口継続度算出部１３９、開口継続度記憶部１４０、一時記憶部１４５、欠伸判定結果記憶部１４６及び出力部１４７と同様である。 Input unit 131, face area extraction unit 132, face feature point extraction unit 133, face feature point storage unit 134, face feature amount calculation unit 135, face feature amount storage unit 136, opening degree specification of the processing unit 330 in the third embodiment. Unit 137, opening degree storage unit 138, opening continuity calculation unit 139, opening continuity storage unit 140, temporary storage unit 145, stretch determination result storage unit 146, and output unit 147 are inputs of the processing unit 130 in the first embodiment. Unit 131, face area extraction unit 132, face feature point extraction unit 133, face feature point storage unit 134, face feature amount calculation unit 135, face feature amount storage unit 136, opening degree identification unit 137, opening degree storage unit 138, opening. This is the same as the continuity calculation unit 139, the opening continuity storage unit 140, the temporary storage unit 145, the stretchout determination result storage unit 146, and the output unit 147.

また、実施の形態３における処理部３３０の顔表情特徴比較部２４８及び顔表情特徴比較結果記憶部２４９は、実施の形態２における処理部２３０の顔表情特徴比較部２４８及び顔表情特徴比較結果記憶部２４９と同様である。 Further, the facial expression feature comparison unit 248 and the facial expression feature comparison result storage unit 249 of the processing unit 330 in the third embodiment store the facial expression feature comparison unit 248 and the facial expression feature comparison result storage of the processing unit 230 in the second embodiment. It is the same as the part 249.

実施の形態３でも、判定処理部３４１は、フレーム内の人物が予め定められた要因による開口を行っているか否かを判定する。
但し、実施の形態３では、判定処理部３４１は、その人物が会話を行っている場合、又は、その人物の表情が欠伸とは異なる予め定められた表情である場合に、予め定められた要因による開口を行っていると判定する。
実施の形態３における判定処理部３４１での処理は、開口速度評価値算出部１４２、開口速度評価値記憶部１４３、欠伸判定部３４４、一時記憶部１４５、顔表情特徴比較部２４８及び顔表情特徴比較結果記憶部２４９により実現される。以下説明する。Also in the third embodiment, the determination processing unit 341 determines whether or not the person in the frame makes an opening due to a predetermined factor.
However, in the third embodiment, the determination processing unit 341 determines a predetermined factor when the person is having a conversation or when the facial expression of the person is a predetermined facial expression different from the yawning. It is determined that the opening is performed by.
The processing in the determination processing unit 341 in the third embodiment includes the opening speed evaluation value calculation unit 142, the opening speed evaluation value storage unit 143, the stretch determination unit 344, the temporary storage unit 145, the facial expression feature comparison unit 248, and the facial expression feature. It is realized by the comparison result storage unit 249. This will be described below.

欠伸判定部３４４は、開口継続度記憶部１４０に記憶されている開口継続度と、開口速度評価値記憶部１４３に記憶されている開口速度と、顔表情特徴比較結果記憶部２４９に記憶されている比較結果とを用いて、予め定められた要因による開口と、予め定められた要因ではない要因による開口とを区別し、欠伸を判定する。 The missing stretch determination unit 344 is stored in the opening continuity storage unit 140, the opening speed stored in the opening speed evaluation value storage unit 143, and the facial expression feature comparison result storage unit 249. Using the comparison results, the opening due to a predetermined factor and the opening due to a factor other than the predetermined factor are distinguished, and the loss is determined.

具体的には、欠伸判定部３４４は、開口速度評価値記憶部１４３に記憶されている開口速度評価値が欠伸判定条件モデル１２１に含まれている開口区別判定条件において、開口速度評価値が予め定められた閾値（例えば、２０）未満である場合には、予め定められた要因による開口と判定する。 Specifically, the yawning determination unit 344 sets the opening speed evaluation value in advance under the opening distinction determination condition in which the opening speed evaluation value stored in the opening speed evaluation value storage unit 143 is included in the yawning determination condition model 121. If it is less than a predetermined threshold (for example, 20), it is determined that the opening is due to a predetermined factor.

また、欠伸判定部３４４は、顔表情特徴比較結果記憶部２４９に記憶されている比較結果が「笑顔による開口」又は「驚きによる開口」である場合には、予め定められた要因による開口であると判定し、その比較結果が「開口」である場合には、予め定められた要因ではない要因による開口と判定する。 Further, the yawning determination unit 344 is an opening due to a predetermined factor when the comparison result stored in the facial expression feature comparison result storage unit 249 is "opening due to smile" or "opening due to surprise". If the comparison result is "opening", it is determined that the opening is due to a factor other than a predetermined factor.

次に、欠伸判定部３４４は、予め定められた要因による開口と判定した場合には、無効値を欠伸開口継続値とし、予め定められた要因ではない要因による開口と判定した場合には、そのフレームの開口継続度を欠伸開口継続度として、そのフレームを識別するための識別情報であるフレーム識別番号に対応付けて、一時記憶部１４５に記憶する。
そして、欠伸判定部３４４は、欠伸判定条件モデル３２１に含まれている欠伸判定条件が満たされている場合に、欠伸と判定する。
ここでは、欠伸判定部３４４は、過去の予め定められた数のフレーム（例えば、１００フレーム）の内、欠伸開口継続度が予め定められた値（例えば、４０）以上となっているフレームの数が予め定められた数（例えば、２０）以上のときに、欠伸と判定する。Next, when the yawning determination unit 344 determines that the opening is due to a predetermined factor, the invalid value is set as the continuation value of the yawning opening, and when it is determined that the opening is due to a factor other than the predetermined factor, the invalid value is used. The degree of opening continuity of the frame is set as the degree of continuation of yawning, and is stored in the temporary storage unit 145 in association with the frame identification number which is the identification information for identifying the frame.
Then, the yawning determination unit 344 determines that yawning is achieved when the yawning determination condition included in the yawning determination condition model 321 is satisfied.
Here, the yawning determination unit 344 is the number of frames in which the degree of continuation of the yawning opening is a predetermined value (for example, 40) or more among the predetermined number of frames in the past (for example, 100 frames). When is a predetermined number (for example, 20) or more, it is determined to be yawning.

図１８は、実施の形態３における欠伸判定部３４４での処理を示すフローチャートである。
まず、欠伸判定部３４４は、開口継続度記憶部１４０に記憶されている開口継続度と、開口速度評価値記憶部１４３に記憶されている開口速度評価値と、顔表情特徴比較結果記憶部２４９に記憶されている比較結果とを取得する（Ｓ３０）。FIG. 18 is a flowchart showing the process in the yawning determination unit 344 in the third embodiment.
First, the stretchlessness determination unit 344 has an opening continuity stored in the opening continuity storage unit 140, an opening speed evaluation value stored in the opening speed evaluation value storage unit 143, and a facial expression feature comparison result storage unit 249. The comparison result stored in is acquired (S30).

次に、欠伸判定部３４４は、取得された開口速度評価値が、開口区別判定条件における閾値ａ以上であるか否かを判定する（Ｓ３１）。開口速度評価値が閾値ａ未満である場合（Ｓ３１でＮｏ）には、処理はステップＳ３２に進み、開口速度評価値が閾値ａ以上である場合（Ｓ３１でＹｅｓ）には、処理はステップＳ３３に進む。 Next, the yawning determination unit 344 determines whether or not the acquired opening speed evaluation value is equal to or greater than the threshold value a in the opening discrimination determination condition (S31). When the opening speed evaluation value is less than the threshold value a (No in S31), the process proceeds to step S32, and when the opening speed evaluation value is equal to or more than the threshold value a (Yes in S31), the process proceeds to step S33. move on.

ステップＳ３２では、開口速度評価値が閾値ａ未満であり、動画像に映っている人物の開口が欠伸以外の会話であると考えられるため、欠伸判定部３４４は、開口継続度を予め定められた値である無効値（例えば、「－１」）とする。そして、処理はステップＳ３３に進む。 In step S32, the opening speed evaluation value is less than the threshold value a, and it is considered that the opening of the person shown in the moving image is a conversation other than yawning. Therefore, the yawning determination unit 344 determines the opening continuity degree in advance. It is an invalid value (for example, "-1") which is a value. Then, the process proceeds to step S33.

ステップＳ３３では、欠伸判定部３４４は、取得された比較結果が予め定められた要因による開口であるか否かを判定する。例えば、欠伸判定部３４４は、比較結果が「笑顔による開口」又は「驚きによる開口」である場合には、予め定められた要因による開口であると判定し、比較結果が「開口」である場合には、予め定められた要因ではない要因による開口であると判定する。予め定められた要因による開口である場合（Ｓ３３でＹｅｓ）には、処理はステップＳ３４に進み、予め定められた要因ではない要因による開口である場合（Ｓ３４でＮｏ）には、処理はステップＳ３５に進む。 In step S33, the yawning determination unit 344 determines whether or not the acquired comparison result is an opening due to a predetermined factor. For example, when the comparison result is "opening due to smile" or "opening due to surprise", the yawning determination unit 344 determines that the opening is due to a predetermined factor, and when the comparison result is "opening". Is determined to be an opening due to a factor other than a predetermined factor. If the opening is due to a predetermined factor (Yes in S33), the process proceeds to step S34, and if the opening is due to a factor other than the predetermined factor (No in S34), the process proceeds to step S35. Proceed to.

ステップＳ３４では、欠伸判定部３４４は、開口継続度を予め定められた値である無効値（例えば、「－１」）とする。そして、処理はステップＳ３５に進む。 In step S34, the yawning determination unit 344 sets the opening continuity to an invalid value (for example, “-1”) which is a predetermined value. Then, the process proceeds to step S35.

ステップＳ３５では、欠伸判定部３４４は、現フレームの開口継続度を、欠伸開口継続度として、そのフレーム番号とともに一時記憶部１４５に記憶させる。 In step S35, the yawning determination unit 344 stores the opening continuity of the current frame as the yawning opening continuity in the temporary storage unit 145 together with the frame number.

次に、欠伸判定部３４４は、欠伸判定条件モデル３２１に含まれている欠伸判定条件を満たすか否かを判定する（Ｓ３６）。ここでは、欠伸判定部３４４は、過去の予め定められた数のフレーム（例えば、１００フレーム）の内、欠伸開口継続度が予め定められた値（例えば、４０）以上となっているフレームの数が予め定められた数（例えば、２０）以上のときに、欠伸と判定する。欠伸判定条件が満たされている場合（Ｓ３６でＹｅｓ）には、処理はステップＳ３７に進み、欠伸判定条件が満たされていない場合（Ｓ３６でＮｏ）には、処理はステップＳ３８に進む。 Next, the yawning determination unit 344 determines whether or not the yawning determination condition condition included in the yawning determination condition model 321 is satisfied (S36). Here, the yawning determination unit 344 is the number of frames in which the degree of continuation of the yawning opening is a predetermined value (for example, 40) or more among the predetermined number of frames in the past (for example, 100 frames). When is a predetermined number (for example, 20) or more, it is determined to be yawning. If the yawning determination condition is satisfied (Yes in S36), the process proceeds to step S37, and if the yawning determination condition is not satisfied (No in S36), the process proceeds to step S38.

ステップＳ３７では、欠伸判定部３４４は、欠伸と判定し、その判定結果を欠伸判定結果記憶部１４６に記憶する。なお、欠伸判定部３４４は、欠伸は続けて発生しないと想定し、一度欠伸と判定したら一定フレーム（例えば、１００フレーム）は、欠伸と判定しない。
一方、ステップＳ３６では、欠伸判定部３４４は、欠伸ではないと判定し、その判定結果を欠伸判定結果記憶部１４６に記憶する。In step S37, the yawning determination unit 344 determines that the yawning occurs, and stores the determination result in the yawning determination result storage unit 146. The yawning determination unit 344 assumes that yawning does not occur continuously, and once yawning is determined, a certain frame (for example, 100 frames) is not determined to be yawning.
On the other hand, in step S36, the yawning determination unit 344 determines that the yawning is not, and stores the determination result in the yawning determination result storage unit 146.

以上のように、実施の形態３によれば、動画像に映っている人物が会話している状態、笑顔である状態、及び、驚いている状態では、欠伸と判定しないため、人物の口が開いている状態から、精度よく欠伸を検出することができる。 As described above, according to the third embodiment, when the person shown in the moving image is talking, smiling, or surprised, it is not determined to be yawning, so that the person's mouth is open. Yawning can be detected accurately from the open state.

以上に記載された実施の形態１～３では、欠伸判定装置１００、２００、３００が、撮像部１１０、データベース部１２０、処理部１３０、２３０、３３０及び表示部１５０の全てを備えているが、実施の形態１～３は、このような例に限定されない。例えば、撮像部１１０、データベース部１２０及び表示部１５０の少なくとも一つは、欠伸判定装置１００、２００、３００とネットワークで接続された別の装置であってもよい。 In the above-described embodiments 1 to 3, the yawning determination devices 100, 200, and 300 include all of the imaging unit 110, the database unit 120, the processing units 130, 230, 330, and the display unit 150. Embodiments 1 to 3 are not limited to such examples. For example, at least one of the image pickup unit 110, the database unit 120, and the display unit 150 may be another device connected to the yawn determination devices 100, 200, and 300 via a network.

１００，２００，３００欠伸判定装置、１１０撮像部、１２０，２２０，３２０データベース部、１２１，２２１，３２１欠伸判定条件モデル、２２２顔表情特徴モデル、１３０，２３０，３３０処理部、１３１入力部、１３２顔領域抽出部、１３３顔特徴点抽出部、１３４顔特徴点記憶部、１３５顔特徴量算出部、１３６顔特徴量記憶部、１３７開口度特定部、１３８開口度記憶部、１３９開口継続度算出部、１４０開口継続度記憶部、１４１，２４１，３４１判定処理部、１４２開口速度評価値算出部、１４３開口速度評価値記憶部、１４４，２４４，３４４欠伸判定部、１４５一時記憶部、１４６欠伸判定結果記憶部、１４７出力部、２４８顔表情特徴比較部、２４９顔表情特徴比較結果記憶部、１５０表示部。 100,200,300 deficiency determination device, 110 imaging unit, 120, 220, 320 database unit, 121,221,321 deficiency determination condition model, 222 facial expression feature model, 130, 230, 330 processing unit, 131 input unit, 132 Face area extraction unit, 133 Face feature point extraction unit, 134 Face feature point storage unit, 135 Face feature amount calculation unit, 136 Face feature amount storage unit, 137 Openness specification unit, 138 Openness storage unit, 139 Opening continuity calculation unit Unit, 140 Opening continuity storage unit, 141,241,341 Judgment processing unit, 142 Opening speed evaluation value calculation unit, 143 Opening speed evaluation value storage unit, 144,244,344 Missing extension judgment unit, 145 Temporary storage unit, 146 Missing extension Judgment result storage unit, 147 output unit, 248 facial expression feature comparison unit, 249 facial expression feature comparison result storage unit, 150 display unit.

Claims

A face area extraction unit that extracts a face area, which is a face area of a person, from each of a plurality of frames included in a moving image, and a face area extraction unit.
A face feature point extraction unit that extracts a plurality of predetermined feature points from the face area, and a face feature point extraction unit.
A face feature amount calculation unit that calculates a face feature amount indicating the face feature from the plurality of feature points, and a face feature amount calculation unit.
An opening degree specifying portion that specifies the opening degree, which is the degree to which the mouth is open in the face, from the facial feature amount.
An opening continuity calculation unit that calculates an opening continuity, which is the degree to which the mouth is continuously opened, by a plurality of opening degrees specified from the plurality of frames.
It is determined whether or not the person is opening due to a predetermined factor other than yawning, and a predetermined first threshold value is set in the frame determined to be opening due to the predetermined factor. A value lower than is associated with the yawning opening continuity, and the frame determined that the opening is not performed due to the predetermined factor is associated with the opening continuation as the yawning opening continuity, and the latest frame is included. When the number of frames associated with the yawning opening continuity that is equal to or greater than the first threshold value among the predetermined number of consecutive frames is equal to or greater than the predetermined second threshold value. An information processing apparatus including a determination processing unit for determining that the person has yawned.

The information processing apparatus according to claim 1, wherein the determination processing unit determines that the person is making an opening due to a predetermined factor when the person is having a conversation.

The second aspect of the present invention is characterized in that the determination processing unit determines that the person is having a conversation when the opening speed, which is the opening speed of the mouth, is less than a predetermined third threshold value. Information processing equipment.

The determination processing unit
An opening speed evaluation value calculation unit that calculates an opening speed evaluation value, which is a value that increases as the opening speed increases from the opening continuity.
3. The third aspect of the present invention is provided with a yawning determination unit for determining that the opening speed is less than the third threshold value when the opening speed evaluation value is less than a predetermined fourth threshold value. The information processing device described in.

The first aspect of claim 1, wherein the determination processing unit determines that the opening is performed by the predetermined factor when the facial expression of the person is a predetermined facial expression different from the yawning. Information processing equipment.

The determination processing unit
A facial expression determination unit that determines whether or not the facial expression of the person is a predetermined facial expression from the facial feature amount.
When the determination result in the facial expression determination unit is that the facial expression of the person is the predetermined facial expression, it is determined that the opening is performed by the predetermined factor, and the facial expression determination unit determines that the opening is performed. 5. The fifth aspect of the present invention is characterized in that the determination result includes a stretch determination unit for determining that the opening due to the predetermined factor is not performed when the facial expression of the person is not the predetermined facial expression. The information processing device described in.

The information processing apparatus according to claim 5 or 6, wherein the predetermined facial expression is a smiling face or a surprised face.

When the person is having a conversation or the facial expression of the person is a predetermined facial expression different from the yawning, the determination processing unit is said to perform the opening due to the predetermined factor. The information processing apparatus according to claim 1, wherein the determination is made.

The eighth aspect of the present invention is characterized in that the determination processing unit determines that the person is having a conversation when the opening speed, which is the opening speed of the mouth, is less than a predetermined third threshold value. Information processing equipment.

The determination processing unit
An opening speed evaluation value calculation unit that calculates an opening speed evaluation value, which is a value that increases as the opening speed increases from the opening continuity.
9. The aspect 9 is provided with a yawning determination unit for determining that the opening speed is less than the third threshold value when the opening speed evaluation value is less than a predetermined fourth threshold value. The information processing device described in.

The determination processing unit
A facial expression determination unit that determines whether or not the facial expression of the person is a predetermined facial expression from the facial feature amount.
When the determination result in the facial expression determination unit is that the facial expression of the person is the predetermined facial expression, it is determined that the opening is performed by the predetermined factor, and the determination in the facial expression determination unit. The eighth aspect of the present invention is characterized by comprising a stretchout determination unit for determining that the opening is not performed due to the predetermined factor when the facial expression of the person is not the predetermined facial expression. The information processing device described.

The information processing apparatus according to claim 11, wherein the predetermined facial expression is a smiling face or a surprised face.

Computer,
A face area extraction unit that extracts a face area, which is a face area of a person, from each of a plurality of frames included in a moving image.
A face feature point extraction unit that extracts a plurality of predetermined feature points from the face area,
A facial feature amount calculation unit that calculates a facial feature amount indicating the facial feature from the plurality of feature points.
An opening degree specifying portion that specifies the opening degree, which is the degree to which the mouth is open in the face, from the facial feature amount.
An opening continuity calculation unit that calculates an opening continuity, which is the degree to which the mouth is continuously open, based on the plurality of openings specified from the plurality of frames, and an opening continuity calculation unit.
It is determined whether or not the person is opening due to a predetermined factor other than yawning, and a predetermined first threshold value is set in the frame determined to be opening due to the predetermined factor. A value lower than is associated with the yawning opening continuity, and the frame determined that the opening is not performed due to the predetermined factor is associated with the opening continuation as the yawning opening continuity, and the latest frame is included. When the number of frames associated with the yawning opening continuity that is equal to or greater than the first threshold value among the predetermined number of consecutive frames is equal to or greater than the predetermined second threshold value. A program characterized by functioning as a determination processing unit for determining that the person has yawned.

The face area, which is the area of the human face, is extracted from each of the multiple frames included in the moving image.
A plurality of predetermined feature points are extracted from the face area, and a plurality of predetermined feature points are extracted.
From the plurality of feature points, a facial feature amount indicating the facial feature is calculated.
From the facial features, the degree of opening, which is the degree to which the mouth is open in the face, is specified.
The opening continuity, which is the degree to which the mouth is continuously opened, is calculated from the plurality of openings specified from the plurality of frames.
It is determined whether or not the person is opening due to a predetermined factor other than yawning, and a predetermined first threshold value is set in the frame determined to be opening due to the predetermined factor. Corresponding to a value lower than the yawning opening continuity,
The opening continuity is associated with the frame determined not to be opened due to the predetermined factor as the yawning opening continuity.
Of the predetermined number of consecutive frames including the latest frame, the number of frames associated with the yawning opening continuity that is equal to or higher than the first threshold value is the predetermined second threshold value. An information processing method characterized in that it is determined that the person has yawned in the above cases.