WO2015027289A1 - Method and apparatus for eye detection from glints - Google Patents

Method and apparatus for eye detection from glints Download PDF

Info

Publication number
WO2015027289A1
WO2015027289A1 PCT/AU2014/000868 AU2014000868W WO2015027289A1 WO 2015027289 A1 WO2015027289 A1 WO 2015027289A1 AU 2014000868 W AU2014000868 W AU 2014000868W WO 2015027289 A1 WO2015027289 A1 WO 2015027289A1
Authority
WO
WIPO (PCT)
Prior art keywords
image
frames
reflections
specular
time series
Prior art date
Application number
PCT/AU2014/000868
Other languages
French (fr)
Inventor
Sebastian Rougeaux
Original Assignee
Seeing Machines Limited
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from AU2013903337A external-priority patent/AU2013903337A0/en
Application filed by Seeing Machines Limited filed Critical Seeing Machines Limited
Priority to EP14840504.6A priority Critical patent/EP3042341A4/en
Priority to CN201480059776.9A priority patent/CN105765608A/en
Priority to US14/916,082 priority patent/US10552675B2/en
Priority to JP2016539360A priority patent/JP2016532217A/en
Publication of WO2015027289A1 publication Critical patent/WO2015027289A1/en

Links

Classifications

    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61BDIAGNOSIS; SURGERY; IDENTIFICATION
    • A61B3/00Apparatus for testing the eyes; Instruments for examining the eyes
    • A61B3/10Objective types, i.e. instruments for examining the eyes independent of the patients' perceptions or reactions
    • A61B3/113Objective types, i.e. instruments for examining the eyes independent of the patients' perceptions or reactions for determining or recording eye movement
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/10Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
    • G06V40/18Eye characteristics, e.g. of the iris
    • G06V40/193Preprocessing; Feature extraction

Definitions

  • the present invention relates to the field of object detection and monitoring, and, in particular discloses a method and system for eye detection based on reflection structure.
  • embodiments of the invention are applicable in tracking the eye location of a user of a computer or mobile device (such as a smartphone or tablet), or a driver of a vehicle.
  • a method of determining the position of at least one eyeball within an image including the steps of: (a) capturing a time series of image frames illuminated in a predetermined temporal manner by at least two spaced apart light sources, by at least one imaging sensor; (b) processing the image frames to determine specular reflection locations in the image frames; and (c) utilising the time series evolution of the location of the specular reflections to isolate corneal reflections from the determined specular reflection locations.
  • the step (c) preferably can include utilising either a velocity or acceleration model of position evolution to model the location of the specular reflections corresponding to corneal reflections.
  • the isolate step preferably can include utilising an error measure between the model and the actual locations of the specular reflections in the image frames.
  • the model preferably can include maximum velocity or accelerations.
  • first and second light sources are included, wherein the first light source is actuated to illuminate one or both of the eyeballs during capture of even frames of the time series and the second light source is actuated to illuminate one or both of the eyeballs during capture of odd frames of the time series.
  • a plurality of light sources is included, each light source being actuated to illuminate one or both of the eyeballs during capture of predetermined frames of the time series.
  • an image processing system for detecting the position of an eyeball within an image, the system including: at least two image illumination sources for illuminating the image area in a predetermined temporal manner; an image sensor for capturing a sequence of temporal frames of the image area; a processor configured to process the temporal frames to determine specular reflection locations in the temporal frames; and second processing means for isolating likely corneal reflections from the specular reflection locations of a series of temporal frames.
  • a method of tracking one or more objects within a series of images including the steps of:
  • the step of applying one or more constraints preferably includes applying a motion model of the one or more objects based on the position of the specular reflections in a plurality of images.
  • a computer system configured to perform a method according to the third aspect.
  • a device configured to perform a method according to the third aspect.
  • Fig. 1 illustrates a first example complex image having a series of specular reflections
  • Fig. 2 illustrates a second example image having specular reflections
  • Fig. 3 illustrates schematically the geometry of creation of corneal reflections
  • Fig. 4 illustrates a flow chart of the steps of the preferred embodiment
  • Fig. 5 illustrates an example processing system suitable for implementation of the preferred embodiment
  • Fig. 6 illustrates the processing arrangement of the preferred embodiment.
  • the preferred embodiment provides a robust form of eye detection through the utilisation of the corneal reflection in a captured image. As the corneal reflection from the eye is usually still present, even in the presence of other strong reflections and noise, the detection and processing of corneal reflection location can provide a strong indicator of eye position and gaze.
  • Fig. 1 illustrates an example noisy image 1 of a human head including hat 2, safety glasses 4 and air mask 3. From close examination of the image 1, it can be seen that two corneal reflections 5, 6 are also present in the image.
  • Fig. 2 illustrates a second example image of an imaging device recording a view of a single eye having glasses 20.
  • the light source produces a number of specular reflections 21, in addition to a targeted corneal reflection 22.
  • the presence of corneal specular reflections is utilised to advantage.
  • the preferred embodiment uses at least one imaging device and at least two active light sources to determine the location of the corneal reflections.
  • the light sources are synchronised with the imaging devices. A greater number of light sources gives higher accuracy glint detection and less detection errors. Where there is more than one imaging device, their integration periods are also synchronised.
  • Exemplary imaging devices include digital cameras and CCD cameras.
  • the light sources can also be synchronized with the imaging device(s) integration period and can be actively controlled so that any combination of light sources can be ON or OFF for a given frame.
  • Exemplary light sources include LEDs or other electronically controllable lights that can emit light for a predetermined time period in response to a control signal.
  • a light source When a light source is ON, it produces a reflection (also called glint) on the surface of the cornea.
  • Fig. 3 illustrates the process schematically 30, wherein light sources 31 and 32 are projected towards the eyeball 33, a corneal reflection 34 is detected by camera 35.
  • Light sources 31 and 32 are spaced apart light so as to direct light at the cornea from different angles. This aids in better detection of glints, especially when one or both eyes are partially occluded.
  • the cornea surface can be modelled as any parametric surface.
  • the cornea is modelled as a sphere of centre C and radius .
  • the light sources 31 and 32 can also produce many other specular reflections, as illustrated in Fig. 1 and Fig. 2.
  • the proposed method of the preferred embodiment detects all the specular reflections in a sequence of images, and then using a constant motion model of the cornea (e.g., the cornea centre C is considered to move at constant velocity or constant acceleration in a 3D space), to evaluate which of the detected specular reflections are corresponding to corneal reflections.
  • Fig 4 illustrates a flow chart of the steps involved in a method 40 of determining the position of eyeballs within an image or a time series of images.
  • method 40 will be described with reference to the exemplary hardware illustrated in arrangement 50 of Fig. 5 having the exemplary configuration of Fig. 6.
  • a monitored subject 51 is subjected to sequenced infra red light sequencing from lights 52, 53 controlled by light sequencing microcontroller 55.
  • Video is captured by a video capture unit 54.
  • Unit 54 includes one or more digital cameras and optionally an internal processor.
  • the video capture is processed by processor 56 in accordance with method 40 described below.
  • a time series of images of subject 51 is captured using unit 54.
  • a subset of the time series is frames n to n+3 (57-60), as illustrated in Fig. 6.
  • the subject's eyeballs are illuminated by light sources 51 and 53.
  • illumination of sequential frames is preferably provided by a different light source in an alternating fashion.
  • light source 0 (52) is ON for the even numbered frames
  • light source 1 (53) is ON for the odd numbered frames.
  • the illumination profile varies by at least one of the light sources each frame.
  • consecutive image frames in the time series may be illuminated using the following illumination sequence:
  • sequencing microcontroller 55 in conjunction with processor 56 and capture unit 54.
  • the timing of the illumination is synchronised with the capture of image frames in the time series.
  • the general preference is that there is some variation in illumination profile (different actuated light sources or combinations of actuated light sources) between consecutive frames of the time series to better differentiate the specular reflections from noise.
  • the specular reflections or glints within the image are detected.
  • a triplet of frames Fn, Fn+1 and Fn+2 (54-56)
  • a set of 2D glints Gn, Gn+1 and Gn+2 is extracted as two-dimensional coordinates of pixels within the image.
  • Glint extraction can be done using well known computer vision methods, such as the maximum of Laplacian operators.
  • Those glints are either corresponding to a corneal reflection or any other specular reflection in the image.
  • the number of glints detected within an image can range from a few to several hundred depending on the environment imaged and the lighting.
  • the glint extraction process can be performed in parallel. Due to the small size of glints with an image, overlap of pixels between the separate modules can be significantly reduced.
  • a motion model is used to determine which specular reflections correspond to corneal reflections (as opposed to other specular reflections such as from a person's glasses).
  • An exemplary motion model is a constant velocity model of an eye.
  • Another exemplary motion model is an acceleration model of an eye. Ideally, a minimum of 3 frames for constant velocity assumption are used, or 4 frames for constant acceleration assumptions. The preferred embodiment focuses on the constant velocity model, but extension to the constant acceleration or other motion models can be used.
  • the model is applied by passing the captured image data through an algorithm run by processor 56. Each model applies constraints which relate to the typical motion of an eye. Corresponding motion models of other objects can be applied when tracking other objects within images.
  • the threshold distance may be based on a distance derived by a maximum velocity of the cornea in 3D space.
  • a minimization process can then occur to determine the best cornea trajectory in 3D (6 degrees of freedom using a constant velocity model) that fit the triplet of glints (6 observations from 3 x 2D locations).
  • Any iterative optimization process can be used at this stage (e.g. Levenberg-Marquardt) using the geometry of Fig. 3.
  • a specific fast solution to the optimization problem can be used.
  • the trajectory of the cornea can be computed from a sequence of 2D glints locations captured by a system as illustrated in Fig. 3, with the following considerations:
  • a camera / with known intrinsic projection parameters ⁇ A reference frame F aligned with the camera axis (X,Y parallel to the image plane, Z collinear with the optical axis of the camera) and centred on the camera centre of projection.
  • An infrared illuminator located at a known 3D location L in the camera reference frame F.
  • a motion model Q g ⁇ , i) where a are the motion parameters (e.g. constant velocity or constant acceleration) describing the trajectory C.
  • a sequence of 2D glints locations G ⁇ G G n ⁇ corresponding to the reflections of the light emanating from the infrared illuminator on the surface of the cornea as imaged by the camera.
  • the minimum of this function can be found using well-known optimization techniques. Once the parameter a min is found the trajectory T of the cornea can be computed using the known motion model.
  • the cornea is assumed to be a sphere of known radius R.
  • the method remains valid for any other parametric shape of the cornea (e.g. ellipsoid) as long as the theoretical location G L of the glint can be computed from the known position (and optionally orientation) of the cornea.
  • the above culling process will often reduce the number of candidate glints down to about 3 or 4.
  • the triplet of glints can then be rejected or accepted based on other predetermined criteria. For example, a maximum threshold on the residuals from the optimization (the error between the observed 2D positions of the glints and their optimized 2D positions computed from the optimized 3D cornea trajectory) can be set. Other thresholds on the optimized cornea trajectory can also be set, like the minimum and maximum depth or velocity.
  • the triplets that pass all the acceptance criteria are considered to be from actual corneal reflections and therefore both the 2D position of the eye and the 3D location of the cornea have been computed.
  • 2 consecutive glint triplets can then be assessed as a quadruplet using another motion model (e.g. constant velocity or constant acceleration) to further check for false positive detections.
  • the proposed method detects any reflective object with a curvature similar to that of a cornea. It can also occasionally produce false positives in the presence of noise (high number of specular reflections) in the images. In such cases, further image analysis, like machine learning based classifiers or appearance based criteria, can be employed to eliminate unwanted false positives.
  • the eye position determined from the corneal reflections is output.
  • the output data is in the form of either a three-dimensional coordinate of the cornea position in the camera reference frame or a two-dimensional projection in the image. These coordinates may be subsequently used to project the eye positions back onto the image or another image in the time series. Further, the coordinates of the detected eyes may be used to determine a gaze direction through further analysis of the images.
  • the embodiments described herein provide various useful method of determining the position of eyeballs within an image.
  • the invention has applications for any computer vision based face or eye tracking systems that require the detection of eye(s) and/or face(s). It is particularly useful where the face is partially occluded (for example, where the user is wearing a dust or hygienic mask), not entirely visible (for example, a portion of the face is out of the field of view of the camera), or the eye texture is partially occluded by glasses rims and reflections on the lenses.
  • Exemplary applications include vehicle operator monitoring systems for detecting signs of fatigue or distraction, gaze tracking systems that computing gaze direction (on 2D screens or in 3D environments) for ergonomic or human behavioural studies, face tracking systems for virtual glasses try-out, and face tracking systems for avatar animation.
  • the present invention is able to be performed in systems having a single glint detection module or a plurality of glint detection modules running in parallel.
  • the abovementioned overlap problem associated with prior art techniques is significantly reduced because the glint is a very small feature in the image even at close range (in some embodiments, typically 3 or 4 pixels in diameter).
  • close range in some embodiments, typically 3 or 4 pixels in diameter.
  • the system and method of the invention is still able to fit a trajectory to the detected glints from the plurality of glint detectors (removing many false eye candidates) and thereby creating a single candidate solution for the eye validation phase to operate over. This makes the process of validating any region containing an eye much more likely to return positive results with less processing time, when the eye is moving.
  • any one of the terms comprising, comprised of or which comprises is an open term that means including at least the elements/features that follow, but not excluding others.
  • the term comprising, when used in the claims should not be interpreted as being limitative to the means or elements or steps listed thereafter.
  • the scope of the expression a device comprising A and B should not be limited to devices consisting only of elements A and B.
  • Any one of the terms including or which includes or that includes as used herein is also an open term that also means including at least the elements/features that follow the term, but not excluding others.
  • including is synonymous with and means comprising.
  • the term "exemplary" is used in the sense of providing examples, as opposed to indicating quality. That is, an "exemplary embodiment" is an embodiment provided as an example, as opposed to necessarily being an embodiment of exemplary quality.
  • Coupled when used in the claims, should not be interpreted as being limited to direct connections only.
  • the terms “coupled” and “connected,” along with their derivatives, may be used. It should be understood that these terms are not intended as synonyms for each other.
  • the scope of the expression a device A coupled to a device B should not be limited to devices or systems wherein an output of device A is directly connected to an input of device B. It means that there exists a path between an output of A and an input of B which may be a path including other devices or means.
  • Coupled may mean that two or more elements are either in direct physical or electrical contact, or that two or more elements are not in direct contact with each other but yet still co-operate or interact with each other.

Landscapes

  • Engineering & Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • General Health & Medical Sciences (AREA)
  • Ophthalmology & Optometry (AREA)
  • Molecular Biology (AREA)
  • Public Health (AREA)
  • Medical Informatics (AREA)
  • Biomedical Technology (AREA)
  • Surgery (AREA)
  • Animal Behavior & Ethology (AREA)
  • Biophysics (AREA)
  • Heart & Thoracic Surgery (AREA)
  • Veterinary Medicine (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • General Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Theoretical Computer Science (AREA)
  • Image Analysis (AREA)
  • Eye Examination Apparatus (AREA)
  • Image Processing (AREA)

Abstract

A method of determining the position of eyeballs within an image, the method including the steps of: (a) capturing a time series of image frames illuminated in a predetermined temporal manner by at least two spaced apart light sources, by at least one imaging sensor; (b) processing the image frames to determine specular reflection locations in the image frames; and (c) utilising the time series evolution of the location of the specular reflections to isolate corneal reflections from the determined specular reflection locations.

Description

Method and Apparatus for Eye Detection from Glints
FIELD OF THE INVENTION
[0001 ] The present invention relates to the field of object detection and monitoring, and, in particular discloses a method and system for eye detection based on reflection structure. By way of example, embodiments of the invention are applicable in tracking the eye location of a user of a computer or mobile device (such as a smartphone or tablet), or a driver of a vehicle.
BACKGROUND
[0002] Any discussion of the background art throughout the specification should in no way be considered as an admission that such art is widely known or forms part of common general knowledge in the field.
[0003] Proper detection of eyes in sensed noisy images can be difficult, especially in the presence of glasses with the occlusion generated by their frame and the reflections occurring on the lenses.
[0004] Traditional computer vision algorithms for eye detection often rely on appearance (e.g. US Patent 7,020,337 to Viola and Jones entitled "System and method for detecting objects in images"). This method relies on training a model based on the appearance of the object to be detected and its robustness will degrade significantly in the presence of noise such as strong reflections and/or occlusions. Further, this method is relatively computationally intensive.
[0005] Another example is the method described in US Patent 7,460,693 to Loy and Thomsen entitled "Method and apparatus for the automatic detection of facial features". In this document the eyes are detected using a fast symmetry transform (using the circular symmetry of the iris) and then refined using a Hough transform (which detects circles in images). This method relies on the texture of the eyes and its performance will degrade significantly if the iris is partially occluded by specular reflections on the lenses of glasses for example.
[0006] In addition, certain prior art systems comprise whole eye detection modules running in parallel. Such systems have inherent disadvantages. For example, in some situations, the eye feature may occupy a significant portion of the image (e.g. for a phone camera held close to the face, it may be that 20% of the pixels will fall on the eye). In these circumstances, the eye detectors will have to operate over areas of the image that overlap by this amount, otherwise the eye will not be detectable (referred to herein as "the overlap problem"). The overlap creates additional redundant processing on the same pixel data and can create multiple detections of the same eye from different detectors which require further processing to disambiguate. [0007] In addition, where the eye detector operates on multiple frames and the eye is moving (creating a trajectory), such prior art systems will not be able to resolve a trajectory that moves across the multiple eye detection regions. Instead it will report multiple trajectories with a discontinuity between them.
[0008] Therefore, there is a general need for a more robust form of eye detection in noisy or occluded images.
SUMMARY OF THE INVENTION
[0009] It is an object of the invention, in its preferred form to provide an improved form of image object detection, including the detection of eyes within an image.
[0010] In accordance with a first aspect of the present invention, there is provided a method of determining the position of at least one eyeball within an image, the method including the steps of: (a) capturing a time series of image frames illuminated in a predetermined temporal manner by at least two spaced apart light sources, by at least one imaging sensor; (b) processing the image frames to determine specular reflection locations in the image frames; and (c) utilising the time series evolution of the location of the specular reflections to isolate corneal reflections from the determined specular reflection locations.
[001 1 ] The step (c) preferably can include utilising either a velocity or acceleration model of position evolution to model the location of the specular reflections corresponding to corneal reflections.
[0012] The isolate step preferably can include utilising an error measure between the model and the actual locations of the specular reflections in the image frames. The model preferably can include maximum velocity or accelerations.
[0013] In one embodiment, first and second light sources are included, wherein the first light source is actuated to illuminate one or both of the eyeballs during capture of even frames of the time series and the second light source is actuated to illuminate one or both of the eyeballs during capture of odd frames of the time series.
[0014] In another embodiment, a plurality of light sources is included, each light source being actuated to illuminate one or both of the eyeballs during capture of predetermined frames of the time series.
[0015] In accordance with a second aspect of the present invention, there is provided an image processing system for detecting the position of an eyeball within an image, the system including: at least two image illumination sources for illuminating the image area in a predetermined temporal manner; an image sensor for capturing a sequence of temporal frames of the image area; a processor configured to process the temporal frames to determine specular reflection locations in the temporal frames; and second processing means for isolating likely corneal reflections from the specular reflection locations of a series of temporal frames.
[0016] In accordance with a third aspect of the present invention, there is provided a method of tracking one or more objects within a series of images, the method including the steps of:
(a) controlling at least two spaced apart light sources to illuminate the one or more objects during respective predetermined time periods;
(b) during one of the predetermined time periods, controlling a camera to capture an image including the one or more objects, the image forming part of the image stream;
(c) identifying specular reflections present in images;
(d) applying one or more constraints to determine which of the specular reflections correspond to reflections from the one or more objects; and
(e) outputting two-dimensional coordinates of the position of the one or more objects in at least a subset of the image frames.
[0017] The step of applying one or more constraints preferably includes applying a motion model of the one or more objects based on the position of the specular reflections in a plurality of images.
[0018] In accordance with a fourth aspect of the present invention, there is provided a computer program configured to perform a method according to the third aspect.
[0019] In accordance with a fifth aspect of the present invention, there is provided a computer system configured to perform a method according to the third aspect.
[0020] In accordance with a sixth aspect of the present invention, there is provided a device configured to perform a method according to the third aspect.
BRIEF DESCRIPTION OF THE DRAWINGS
[0021 ] Embodiments of the invention will now be described, by way of example only, with reference to the accompanying drawings in which:
Fig. 1 illustrates a first example complex image having a series of specular reflections;
Fig. 2 illustrates a second example image having specular reflections;
Fig. 3 illustrates schematically the geometry of creation of corneal reflections;
Fig. 4 illustrates a flow chart of the steps of the preferred embodiment; Fig. 5 illustrates an example processing system suitable for implementation of the preferred embodiment; and
Fig. 6 illustrates the processing arrangement of the preferred embodiment.
DETAILED DESCRIPTION
[0022] The preferred embodiment provides a robust form of eye detection through the utilisation of the corneal reflection in a captured image. As the corneal reflection from the eye is usually still present, even in the presence of other strong reflections and noise, the detection and processing of corneal reflection location can provide a strong indicator of eye position and gaze.
[0023] Fig. 1 illustrates an example noisy image 1 of a human head including hat 2, safety glasses 4 and air mask 3. From close examination of the image 1, it can be seen that two corneal reflections 5, 6 are also present in the image.
[0024] Fig. 2 illustrates a second example image of an imaging device recording a view of a single eye having glasses 20. In this example, the light source produces a number of specular reflections 21, in addition to a targeted corneal reflection 22.
[0025] In the preferred embodiment, the presence of corneal specular reflections is utilised to advantage. The preferred embodiment uses at least one imaging device and at least two active light sources to determine the location of the corneal reflections. The light sources are synchronised with the imaging devices. A greater number of light sources gives higher accuracy glint detection and less detection errors. Where there is more than one imaging device, their integration periods are also synchronised. Exemplary imaging devices include digital cameras and CCD cameras.
[0026] The light sources can also be synchronized with the imaging device(s) integration period and can be actively controlled so that any combination of light sources can be ON or OFF for a given frame. Exemplary light sources include LEDs or other electronically controllable lights that can emit light for a predetermined time period in response to a control signal.
[0027] When a light source is ON, it produces a reflection (also called glint) on the surface of the cornea. Fig. 3 illustrates the process schematically 30, wherein light sources 31 and 32 are projected towards the eyeball 33, a corneal reflection 34 is detected by camera 35. Light sources 31 and 32 are spaced apart light so as to direct light at the cornea from different angles. This aids in better detection of glints, especially when one or both eyes are partially occluded.
[0028] The cornea surface can be modelled as any parametric surface. In a first example embodiment, the cornea is modelled as a sphere of centre C and radius . The light sources 31 and 32 can also produce many other specular reflections, as illustrated in Fig. 1 and Fig. 2. [0029] The proposed method of the preferred embodiment detects all the specular reflections in a sequence of images, and then using a constant motion model of the cornea (e.g., the cornea centre C is considered to move at constant velocity or constant acceleration in a 3D space), to evaluate which of the detected specular reflections are corresponding to corneal reflections.
[0030] This procedure will now be described in detail with reference to Fig 4, which illustrates a flow chart of the steps involved in a method 40 of determining the position of eyeballs within an image or a time series of images. Whilst the various embodiments of the invention can be implemented on many different hardware platforms (stand alone or mobile, PDA, Smart Phone etc), method 40 will be described with reference to the exemplary hardware illustrated in arrangement 50 of Fig. 5 having the exemplary configuration of Fig. 6.
[0031 ] In arrangement 50, a monitored subject 51 is subjected to sequenced infra red light sequencing from lights 52, 53 controlled by light sequencing microcontroller 55. Video is captured by a video capture unit 54. Unit 54 includes one or more digital cameras and optionally an internal processor. The video capture is processed by processor 56 in accordance with method 40 described below.
[0032] First, at step 41, a time series of images of subject 51 is captured using unit 54. A subset of the time series is frames n to n+3 (57-60), as illustrated in Fig. 6. During the capture, the subject's eyeballs are illuminated by light sources 51 and 53. In a system using two light sources, illumination of sequential frames is preferably provided by a different light source in an alternating fashion. As shown in Fig. 6, light source 0 (52) is ON for the even numbered frames and light source 1 (53) is ON for the odd numbered frames. In systems using more than two light sources, it is preferable that the illumination profile varies by at least one of the light sources each frame. By way of example, in a system including three light sources (LI, L2 and L3), consecutive image frames in the time series may be illuminated using the following illumination sequence:
Frame 1 : LI + L2
Frame 2: LI + L3
Frame 3 : L2 + L3
Frame 4: L1 + L2 + L3
Frame 5: LI + L2...
[0033] This sequencing and is determined by sequencing microcontroller 55 in conjunction with processor 56 and capture unit 54. The timing of the illumination is synchronised with the capture of image frames in the time series. The general preference is that there is some variation in illumination profile (different actuated light sources or combinations of actuated light sources) between consecutive frames of the time series to better differentiate the specular reflections from noise.
[0034] At step 42, from the captured time series of images, the specular reflections or glints within the image are detected. Given a triplet of frames Fn, Fn+1 and Fn+2 (54-56), a set of 2D glints Gn, Gn+1 and Gn+2 is extracted as two-dimensional coordinates of pixels within the image. Glint extraction can be done using well known computer vision methods, such as the maximum of Laplacian operators. Those glints are either corresponding to a corneal reflection or any other specular reflection in the image. The number of glints detected within an image can range from a few to several hundred depending on the environment imaged and the lighting. In systems implementing multiple glint detection modules, the glint extraction process can be performed in parallel. Due to the small size of glints with an image, overlap of pixels between the separate modules can be significantly reduced.
[0035] At step 43, a motion model is used to determine which specular reflections correspond to corneal reflections (as opposed to other specular reflections such as from a person's glasses). An exemplary motion model is a constant velocity model of an eye. Another exemplary motion model is an acceleration model of an eye. Ideally, a minimum of 3 frames for constant velocity assumption are used, or 4 frames for constant acceleration assumptions. The preferred embodiment focuses on the constant velocity model, but extension to the constant acceleration or other motion models can be used. The model is applied by passing the captured image data through an algorithm run by processor 56. Each model applies constraints which relate to the typical motion of an eye. Corresponding motion models of other objects can be applied when tracking other objects within images.
[0036] It is necessary to consider whether any triplet of glints in consecutive frames is relevant. Where only one glint is picked per set Gn, Gn+1 and Gn+2, this involves trying to identify triplets corresponding to 3 consecutive corneal reflections on the same cornea. A first cull can occur at this stage to reject triplets where the glint position on two consecutive frames is greater than a predetermined threshold distance. For example, the threshold distance may be based on a distance derived by a maximum velocity of the cornea in 3D space. Assuming a known corneal radius (which is very similar across the human population), a minimization process can then occur to determine the best cornea trajectory in 3D (6 degrees of freedom using a constant velocity model) that fit the triplet of glints (6 observations from 3 x 2D locations). Any iterative optimization process can be used at this stage (e.g. Levenberg-Marquardt) using the geometry of Fig. 3. For the embodiment shown in Fig 4, a specific fast solution to the optimization problem can be used. [0037] From a mathematical perspective, the trajectory of the cornea can be computed from a sequence of 2D glints locations captured by a system as illustrated in Fig. 3, with the following considerations:
A camera / with known intrinsic projection parameters Θ. A reference frame F aligned with the camera axis (X,Y parallel to the image plane, Z collinear with the optical axis of the camera) and centred on the camera centre of projection.
An infrared illuminator located at a known 3D location L in the camera reference frame F. A spherical cornea of known radius R, whose center is following a trajectory C = {Cx Cn) in the reference frame F in a sequence of images. A motion model Q = g{ , i) where a are the motion parameters (e.g. constant velocity or constant acceleration) describing the trajectory C. A sequence of 2D glints locations G = {G Gn} corresponding to the reflections of the light emanating from the infrared illuminator on the surface of the cornea as imaged by the camera.
[0038] Using well known reflective geometry of spherical mirrors and projective geometry of cameras, there is a known function GL = f{L , R, a, 9 , i) where G(is the theoretical location of the specular reflection G^ The parameters of the cornea trajectory can then be computed by minimizing the error function:
Figure imgf000008_0001
[0039] The minimum of this function can be found using well-known optimization techniques. Once the parameter amin is found the trajectory T of the cornea can be computed using the known motion model.
[0040] Note that for simplification the cornea is assumed to be a sphere of known radius R. However, as mentioned above, the method remains valid for any other parametric shape of the cornea (e.g. ellipsoid) as long as the theoretical location GL of the glint can be computed from the known position (and optionally orientation) of the cornea.
[0041 ] The above culling process will often reduce the number of candidate glints down to about 3 or 4. For glints that pass the distance or trajectory assessment described above, the triplet of glints can then be rejected or accepted based on other predetermined criteria. For example, a maximum threshold on the residuals from the optimization (the error between the observed 2D positions of the glints and their optimized 2D positions computed from the optimized 3D cornea trajectory) can be set. Other thresholds on the optimized cornea trajectory can also be set, like the minimum and maximum depth or velocity.
[0042] The triplets that pass all the acceptance criteria are considered to be from actual corneal reflections and therefore both the 2D position of the eye and the 3D location of the cornea have been computed. In one embodiment, 2 consecutive glint triplets can then be assessed as a quadruplet using another motion model (e.g. constant velocity or constant acceleration) to further check for false positive detections.
[0043] The proposed method detects any reflective object with a curvature similar to that of a cornea. It can also occasionally produce false positives in the presence of noise (high number of specular reflections) in the images. In such cases, further image analysis, like machine learning based classifiers or appearance based criteria, can be employed to eliminate unwanted false positives.
[0044] Finally, at step 44 the eye position determined from the corneal reflections is output. The output data is in the form of either a three-dimensional coordinate of the cornea position in the camera reference frame or a two-dimensional projection in the image. These coordinates may be subsequently used to project the eye positions back onto the image or another image in the time series. Further, the coordinates of the detected eyes may be used to determine a gaze direction through further analysis of the images.
[0045] It will be appreciated that the embodiments described herein provide various useful method of determining the position of eyeballs within an image. The invention has applications for any computer vision based face or eye tracking systems that require the detection of eye(s) and/or face(s). It is particularly useful where the face is partially occluded (for example, where the user is wearing a dust or hygienic mask), not entirely visible (for example, a portion of the face is out of the field of view of the camera), or the eye texture is partially occluded by glasses rims and reflections on the lenses. Exemplary applications include vehicle operator monitoring systems for detecting signs of fatigue or distraction, gaze tracking systems that computing gaze direction (on 2D screens or in 3D environments) for ergonomic or human behavioural studies, face tracking systems for virtual glasses try-out, and face tracking systems for avatar animation.
[0046] The present invention is able to be performed in systems having a single glint detection module or a plurality of glint detection modules running in parallel. In parallel embodiments, the abovementioned overlap problem associated with prior art techniques is significantly reduced because the glint is a very small feature in the image even at close range (in some embodiments, typically 3 or 4 pixels in diameter). As such, it is possible to allow the detector region overlap to be very small. If the same glint is detected by multiple glint detectors, then any ambiguity is resolved in the cornea trajectory fitting process.
[0047] In addition, in contrast to certain prior art systems, where the eye detector operates on multiple frames and the eye is moving through regions of multiple detectors, the system and method of the invention is still able to fit a trajectory to the detected glints from the plurality of glint detectors (removing many false eye candidates) and thereby creating a single candidate solution for the eye validation phase to operate over. This makes the process of validating any region containing an eye much more likely to return positive results with less processing time, when the eye is moving.
Interpretation
[0048] Reference throughout this specification to "one embodiment", "some embodiments" or "an embodiment" means that a particular feature, structure or characteristic described in connection with the embodiment is included in at least one embodiment of the present invention. Thus, appearances of the phrases "in one embodiment", "in some embodiments" or "in an embodiment" in various places throughout this specification are not necessarily all referring to the same embodiment, but may. Furthermore, the particular features, structures or characteristics may be combined in any suitable manner, as would be apparent to one of ordinary skill in the art from this disclosure, in one or more embodiments.
[0049] As used herein, unless otherwise specified the use of the ordinal adjectives "first", "second", "third", etc., to describe a common object, merely indicate that different instances of like objects are being referred to, and are not intended to imply that the objects so described must be in a given sequence, either temporally, spatially, in ranking, or in any other manner.
[0050] In the claims below and the description herein, any one of the terms comprising, comprised of or which comprises is an open term that means including at least the elements/features that follow, but not excluding others. Thus, the term comprising, when used in the claims, should not be interpreted as being limitative to the means or elements or steps listed thereafter. For example, the scope of the expression a device comprising A and B should not be limited to devices consisting only of elements A and B. Any one of the terms including or which includes or that includes as used herein is also an open term that also means including at least the elements/features that follow the term, but not excluding others. Thus, including is synonymous with and means comprising. [0051 ] As used herein, the term "exemplary" is used in the sense of providing examples, as opposed to indicating quality. That is, an "exemplary embodiment" is an embodiment provided as an example, as opposed to necessarily being an embodiment of exemplary quality.
[0052] It should be appreciated that in the above description of exemplary embodiments of the invention, various features of the invention are sometimes grouped together in a single embodiment, FIG., or description thereof for the purpose of streamlining the disclosure and aiding in the understanding of one or more of the various inventive aspects. This method of disclosure, however, is not to be interpreted as reflecting an intention that the claimed invention requires more features than are expressly recited in each claim. Rather, as the following claims reflect, inventive aspects lie in less than all features of a single foregoing disclosed embodiment. Thus, the claims following the Detailed Description are hereby expressly incorporated into this Detailed Description, with each claim standing on its own as a separate embodiment of this invention.
[0053] Furthermore, while some embodiments described herein include some but not other features included in other embodiments, combinations of features of different embodiments are meant to be within the scope of the invention, and form different embodiments, as would be understood by those skilled in the art. For example, in the following claims, any of the claimed embodiments can be used in any combination.
[0054] Furthermore, some of the embodiments are described herein as a method or combination of elements of a method that can be implemented by a processor of a computer system or by other means of carrying out the function. Thus, a processor with the necessary instructions for carrying out such a method or element of a method forms a means for carrying out the method or element of a method. Furthermore, an element described herein of an apparatus embodiment is an example of a means for carrying out the function performed by the element for the purpose of carrying out the invention.
[0055] In the description provided herein, numerous specific details are set forth. However, it is understood that embodiments of the invention may be practiced without these specific details. In other instances, well-known methods, structures and techniques have not been shown in detail in order not to obscure an understanding of this description.
[0056] Similarly, it is to be noticed that the term coupled, when used in the claims, should not be interpreted as being limited to direct connections only. The terms "coupled" and "connected," along with their derivatives, may be used. It should be understood that these terms are not intended as synonyms for each other. Thus, the scope of the expression a device A coupled to a device B should not be limited to devices or systems wherein an output of device A is directly connected to an input of device B. It means that there exists a path between an output of A and an input of B which may be a path including other devices or means. "Coupled" may mean that two or more elements are either in direct physical or electrical contact, or that two or more elements are not in direct contact with each other but yet still co-operate or interact with each other.
[0057] Thus, while there has been described what are believed to be the preferred embodiments of the invention, those skilled in the art will recognize that other and further modifications may be made thereto without departing from the spirit of the invention, and it is intended to claim all such changes and modifications as falling within the scope of the invention. For example, any formulas given above are merely representative of procedures that may be used. Functionality may be added or deleted from the block diagrams and operations may be interchanged among functional blocks. Steps may be added or deleted to methods described within the scope of the present invention.

Claims

CLAIMS:
1. A method of determining the position of eyeballs within an image, the method including the steps of:
(a) capturing a time series of image frames illuminated in a predetermined temporal manner by at least two spaced apart light sources, by at least one imaging sensor;
(b) processing the image frames to determine specular reflection locations in the image frames; and
(c) utilising the time series evolution of the location of the specular reflections to isolate corneal reflections from the determined specular reflection locations.
2. A method as claimed in claim 1 wherein the step (c) includes utilising either a velocity or acceleration model of position evolution to model the location of the specular reflections corresponding to corneal reflections.
3. A method as claimed in claim 2 wherein said isolate step includes utilising an error measure between said model and the actual locations of the specular reflections in the image frames.
4. A method as claimed in claim 2 wherein said model includes maximum velocity or accelerations.
5. A method as claimed in any one of the preceding claims including first and second light sources, wherein the first light source is actuated to illuminate one or both of the eyeballs during capture of even frames of the time series and the second light source is actuated to illuminate one or both of the eyeballs during capture of odd frames of the time series.
6. A method as claimed in any one of claims 1 to 4 including a plurality of light sources, each light source being actuated to illuminate one or both of the eyeballs during capture of
predetermined frames of the time series.
7. An image processing system for detecting the position of an eyeball within an image, the system including:
at least two image illumination sources for illuminating the image area in a predetermined temporal manner;
an image sensor for capturing a sequence of temporal frames of the image area;
a processor configured to process the temporal frames to determine specular reflection locations in the temporal frames; and second processing means for isolating likely corneal reflections from the specular reflection locations of a series of temporal frames.
8. A method of tracking one or more objects within a series of images, the method including the steps of:
(a) controlling at least two spaced apart light sources to illuminate the one or more objects during respective predetermined time periods;
(b) during one of the predetermined time periods, controlling a camera to capture an image including the one or more objects, the image forming part of the image stream;
(c) identifying specular reflections present in images;
(d) applying one or more constraints to determine which of the specular reflections correspond to reflections from the one or more objects; and
(e) outputting two-dimensional coordinates of the position of the one or more objects in at least a subset of the image frames.
9. A method according to claim 8 wherein the step of applying one or more constraints includes applying a motion model of the one or more objects based on the position of the specular reflections in a plurality of images.
10. A computer program configured to perform a method according to claim 8 or claim 9.
11. A computer system configured to perform a method according to claim 8 or claim 9.
12. A device configured to perform a method according to claim 8 or claim 9.
PCT/AU2014/000868 2013-09-02 2014-09-01 Method and apparatus for eye detection from glints WO2015027289A1 (en)

Priority Applications (4)

Application Number Priority Date Filing Date Title
EP14840504.6A EP3042341A4 (en) 2013-09-02 2014-09-01 Method and apparatus for eye detection from glints
CN201480059776.9A CN105765608A (en) 2013-09-02 2014-09-01 Method and apparatus for eye detection from glints
US14/916,082 US10552675B2 (en) 2014-03-12 2014-09-01 Method and apparatus for eye detection from glints
JP2016539360A JP2016532217A (en) 2013-09-02 2014-09-01 Method and apparatus for detecting eyes with glint

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
AU2013903337A AU2013903337A0 (en) 2013-09-02 Method and apparatus for eye detection (Glints)
AU2013903337 2013-09-02
AU2014900842A AU2014900842A0 (en) 2014-03-12 Improvements to Methods and Apparatus for Eye Detection (Glints)
AU2014900842 2014-03-12

Publications (1)

Publication Number Publication Date
WO2015027289A1 true WO2015027289A1 (en) 2015-03-05

Family

ID=52585306

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/AU2014/000868 WO2015027289A1 (en) 2013-09-02 2014-09-01 Method and apparatus for eye detection from glints

Country Status (4)

Country Link
EP (1) EP3042341A4 (en)
JP (1) JP2016532217A (en)
CN (1) CN105765608A (en)
WO (1) WO2015027289A1 (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2017160356A1 (en) * 2016-03-16 2017-09-21 Google Inc. Systems and methods for enhancing object visibility for overhead imaging
WO2018154271A1 (en) * 2017-02-22 2018-08-30 Fuel 3D Technologies Limited Systems and methods for obtaining eyewear information
CN109690553A (en) * 2016-06-29 2019-04-26 醒眸行有限公司 The system and method for executing eye gaze tracking
CN110168563A (en) * 2016-11-11 2019-08-23 3E株式会社 Flash detection method
EP3671541A1 (en) * 2018-12-21 2020-06-24 Tobii AB Classification of glints using an eye tracking system

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2023120892A1 (en) * 2021-12-20 2023-06-29 삼성전자주식회사 Device and method for controlling light sources in gaze tracking using glint

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7747068B1 (en) 2006-01-20 2010-06-29 Andrew Paul Smyth Systems and methods for tracking the eye

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7280678B2 (en) * 2003-02-28 2007-10-09 Avago Technologies General Ip Pte Ltd Apparatus and method for detecting pupils
CA2634033C (en) * 2005-12-14 2015-11-17 Digital Signal Corporation System and method for tracking eyeball motion
JP5621456B2 (en) * 2010-09-21 2014-11-12 富士通株式会社 Gaze detection device, gaze detection method, computer program for gaze detection, and portable terminal
JP5776323B2 (en) * 2011-05-17 2015-09-09 富士通株式会社 Corneal reflection determination program, corneal reflection determination device, and corneal reflection determination method

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7747068B1 (en) 2006-01-20 2010-06-29 Andrew Paul Smyth Systems and methods for tracking the eye

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
HARO ET AL.: "Detecting and Tracking eyes by using their physiological properties, dynamcics, and appearance", PROCEEDINGS 2000 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2000, 13 June 2000 (2000-06-13), pages 163 - 168, XP001035597
HARO, A. ET AL.: "Detecting and Tracking Eyes By Using Their Physiological Properties, Dynamics. and Appearance", COMPUTER VISION AND PATTERN RECOGNITION, vol. 1, 2000, pages 163 - 168, XP001035597 *
MORIMOTO, C. H. ET AL.: "Pupil detection and tracking using multiple light sources", IMAGE AND VISION COMPUTING, vol. 18, no. ISSUE, 2000, pages 331 - 335, XP008126446 *
See also references of EP3042341A4

Cited By (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2017160356A1 (en) * 2016-03-16 2017-09-21 Google Inc. Systems and methods for enhancing object visibility for overhead imaging
US9996905B2 (en) 2016-03-16 2018-06-12 Planet Labs, Inc. Systems and methods for enhancing object visibility for overhead imaging
EP3663714A1 (en) * 2016-03-16 2020-06-10 Planet Labs Inc. Systems and methods for enhancing object visibility for overhead imaging
US10249024B2 (en) 2016-03-16 2019-04-02 Plant Labs, Inc. Systems and methods for enhancing object visibility for overhead imaging
CN109690553A (en) * 2016-06-29 2019-04-26 醒眸行有限公司 The system and method for executing eye gaze tracking
JP2019519859A (en) * 2016-06-29 2019-07-11 シーイング マシーンズ リミテッド System and method for performing gaze tracking
EP3479293A4 (en) * 2016-06-29 2020-03-04 Seeing Machines Limited Systems and methods for performing eye gaze tracking
US10878237B2 (en) 2016-06-29 2020-12-29 Seeing Machines Limited Systems and methods for performing eye gaze tracking
CN110168563A (en) * 2016-11-11 2019-08-23 3E株式会社 Flash detection method
WO2018154271A1 (en) * 2017-02-22 2018-08-30 Fuel 3D Technologies Limited Systems and methods for obtaining eyewear information
US10775647B2 (en) 2017-02-22 2020-09-15 Fuel 3D Technologies Limited Systems and methods for obtaining eyewear information
EP3671541A1 (en) * 2018-12-21 2020-06-24 Tobii AB Classification of glints using an eye tracking system
CN111522431A (en) * 2018-12-21 2020-08-11 托比股份公司 Classifying glints using an eye tracking system
CN111522431B (en) * 2018-12-21 2021-08-20 托比股份公司 Classifying glints using an eye tracking system
CN113608620A (en) * 2018-12-21 2021-11-05 托比股份公司 Classifying glints using an eye tracking system
US11619990B2 (en) 2018-12-21 2023-04-04 Tobii Ab Classification of glints using an eye tracking system

Also Published As

Publication number Publication date
CN105765608A (en) 2016-07-13
EP3042341A1 (en) 2016-07-13
EP3042341A4 (en) 2017-04-19
JP2016532217A (en) 2016-10-13

Similar Documents

Publication Publication Date Title
US10552675B2 (en) Method and apparatus for eye detection from glints
US10878237B2 (en) Systems and methods for performing eye gaze tracking
JP5529660B2 (en) Pupil detection device and pupil detection method
US7682026B2 (en) Eye location and gaze detection system and method
US10318831B2 (en) Method and system for monitoring the status of the driver of a vehicle
EP2748797B1 (en) Object distance determination from image
EP3453316B1 (en) Eye tracking using eyeball center position
US7819525B2 (en) Automatic direct gaze detection based on pupil symmetry
US9411417B2 (en) Eye gaze tracking system and method
JP5467303B1 (en) Gaze point detection device, gaze point detection method, personal parameter calculation device, personal parameter calculation method, program, and computer-readable recording medium
WO2015027289A1 (en) Method and apparatus for eye detection from glints
CN106547341B (en) Gaze tracker and method for tracking gaze thereof
US9002053B2 (en) Iris recognition systems
EP3542308B1 (en) Method and device for eye metric acquisition
WO2016027627A1 (en) Corneal reflection position estimation system, corneal reflection position estimation method, corneal reflection position estimation program, pupil detection system, pupil detection method, pupil detection program, gaze detection system, gaze detection method, gaze detection program, face orientation detection system, face orientation detection method, and face orientation detection program
KR20130107981A (en) Device and method for tracking sight line
WO2016142489A1 (en) Eye tracking using a depth sensor
JP6870474B2 (en) Gaze detection computer program, gaze detection device and gaze detection method
JP2010123019A (en) Device and method for recognizing motion
CN113260299A (en) System and method for eye tracking
JP2016045707A (en) Feature point detection system, feature point detection method, and feature point detection program
US20240153136A1 (en) Eye tracking
JP2023125652A (en) Pupil detection device and pupil detection method

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 14840504

Country of ref document: EP

Kind code of ref document: A1

ENP Entry into the national phase

Ref document number: 2016539360

Country of ref document: JP

Kind code of ref document: A

NENP Non-entry into the national phase

Ref country code: DE

WWE Wipo information: entry into national phase

Ref document number: 14916082

Country of ref document: US

REEP Request for entry into the european phase

Ref document number: 2014840504

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 2014840504

Country of ref document: EP