WO2016142933A1 - System and method for enabling communication through eye feedback - Google Patents

System and method for enabling communication through eye feedback Download PDF

Info

Publication number
WO2016142933A1
WO2016142933A1 PCT/IL2016/050235 IL2016050235W WO2016142933A1 WO 2016142933 A1 WO2016142933 A1 WO 2016142933A1 IL 2016050235 W IL2016050235 W IL 2016050235W WO 2016142933 A1 WO2016142933 A1 WO 2016142933A1
Authority
WO
WIPO (PCT)
Prior art keywords
eye
user
relative
orientation
pupil
Prior art date
Application number
PCT/IL2016/050235
Other languages
French (fr)
Inventor
Shay RISHONI
Tal Kellner
Itai KORNBERG
Or RETZKIN
Original Assignee
Eyefree Assisting Communication Ltd.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Eyefree Assisting Communication Ltd. filed Critical Eyefree Assisting Communication Ltd.
Priority to CN201680026221.3A priority Critical patent/CN107850939A/en
Priority to EP16761197.9A priority patent/EP3268847A4/en
Priority to IL303037A priority patent/IL303037A/en
Priority to US15/557,031 priority patent/US11883101B2/en
Priority to EP20212667.8A priority patent/EP3809241B1/en
Priority to JP2017566228A priority patent/JP7016263B2/en
Publication of WO2016142933A1 publication Critical patent/WO2016142933A1/en
Priority to IL254315A priority patent/IL254315A0/en

Links

Classifications

    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61BDIAGNOSIS; SURGERY; IDENTIFICATION
    • A61B3/00Apparatus for testing the eyes; Instruments for examining the eyes
    • A61B3/10Objective types, i.e. instruments for examining the eyes independent of the patients' perceptions or reactions
    • A61B3/113Objective types, i.e. instruments for examining the eyes independent of the patients' perceptions or reactions for determining or recording eye movement
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/011Arrangements for interaction with the human body, e.g. for user immersion in virtual reality
    • G06F3/013Eye tracking input arrangements
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61BDIAGNOSIS; SURGERY; IDENTIFICATION
    • A61B3/00Apparatus for testing the eyes; Instruments for examining the eyes
    • A61B3/0008Apparatus for testing the eyes; Instruments for examining the eyes provided with illuminating means
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61BDIAGNOSIS; SURGERY; IDENTIFICATION
    • A61B3/00Apparatus for testing the eyes; Instruments for examining the eyes
    • A61B3/02Subjective types, i.e. testing apparatus requiring the active assistance of the patient
    • A61B3/028Subjective types, i.e. testing apparatus requiring the active assistance of the patient for testing visual acuity; for determination of refraction, e.g. phoropters
    • A61B3/032Devices for presenting test symbols or characters, e.g. test chart projectors
    • GPHYSICS
    • G02OPTICS
    • G02BOPTICAL ELEMENTS, SYSTEMS OR APPARATUS
    • G02B27/00Optical systems or apparatus not provided for by any of the groups G02B1/00 - G02B26/00, G02B30/00
    • G02B27/0093Optical systems or apparatus not provided for by any of the groups G02B1/00 - G02B26/00, G02B30/00 with means for monitoring data relating to the user, e.g. head-tracking, eye-tracking
    • GPHYSICS
    • G02OPTICS
    • G02BOPTICAL ELEMENTS, SYSTEMS OR APPARATUS
    • G02B27/00Optical systems or apparatus not provided for by any of the groups G02B1/00 - G02B26/00, G02B30/00
    • G02B27/01Head-up displays
    • G02B27/017Head mounted
    • G02B27/0172Head mounted characterised by optical features
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/017Gesture based interaction, e.g. based on a set of recognized hand gestures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/02Input arrangements using manually operated switches, e.g. using keyboards or dials
    • G06F3/023Arrangements for converting discrete items of information into a coded form, e.g. arrangements for interpreting keyboard generated codes as alphanumeric codes, operand codes or instruction codes
    • G06F3/0233Character input methods
    • G06F3/0236Character input methods using selection techniques to select from displayed items
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/048Interaction techniques based on graphical user interfaces [GUI]
    • G06F3/0481Interaction techniques based on graphical user interfaces [GUI] based on specific properties of the displayed interaction object or a metaphor-based environment, e.g. interaction with desktop elements like windows or icons, or assisted by a cursor's changing behaviour or appearance
    • G06F3/0482Interaction with lists of selectable items, e.g. menus
    • GPHYSICS
    • G02OPTICS
    • G02BOPTICAL ELEMENTS, SYSTEMS OR APPARATUS
    • G02B27/00Optical systems or apparatus not provided for by any of the groups G02B1/00 - G02B26/00, G02B30/00
    • G02B27/01Head-up displays
    • G02B27/0101Head-up displays characterised by optical features
    • G02B2027/0138Head-up displays characterised by optical features comprising image capture systems, e.g. camera

Definitions

  • the present invention relates to eye tracking and to interfaces and user communication platforms.
  • a degenerative disease is characterized by the deterioration of structure or function of body tissue.
  • Degenerative diseases often lead to a loss of control over muscles and a decline of motor function that impair a person's movement capabilities. As a result, the individual may experience severe difficulties or even a complete inability to communicate verbally and/or via hand motions.
  • Degenerative diseases do not necessarily diminish the person's mind and cognitive function at all or at the same rate as the decline in motor function. In this way, a person with full cognitive activity can lose his ability to interact physically and verbally with his environment. The person's mind can be considered to be essentially trapped or locked into his body.
  • ALS Amyotrophic lateral sclerosis
  • DMD Duchenne muscular dystrophy
  • MJD Machado-Joseph disease
  • MS Multiple sclerosis
  • MD Muscular dystrophy
  • ALS Amyotrophic lateral sclerosis
  • ALS also known as Lou Gehrig's disease and Charcot disease
  • ALS is a disorder that involves the death of neurons and is characterized by stiff muscles, muscle twitching, and gradually worsening weakness due to muscle wasting.
  • ALS typically causes difficulty speaking, swallowing, and breathing.
  • ALS patients represent a subgroup of people with fully functional cognitive abilities that gradually lose their speech and the use of their upper limbs.
  • This lack of communication can trigger feelings of detachment and isolation, as well as lead to situations where the patient is distressed and unable to effectively relay essential needs or obtain critical assistance (which in some cases may be life-threatening).
  • One possible approach is to enable an alternative form of communication for such individuals based on characteristics relating to their eyes. For example, the direction or focus of the person's line of sight may be used to provide instructions or otherwise communicate information to their surroundings.
  • There are tools for aiding such alternative non-verbal forms of communication however they are typically relatively expensive, cumbersome and/or immobile (fixed in place). These tools often require the use of a dedicated computer, and are contingent on the stable posture of the person using the tool.
  • These tools can track eye location relative to a display on a screen and translate eye movements to mimic the operation of a mouse cursor on the screen.
  • the focus of the patient is translated to a geographical location of a pointer (or cursor) and the person can control a computer through his/her eyes similar to the manipulation of a computing mouse.
  • the pupil is identified as a point in space and allows the person to control the computer using their eyes similar to the way they would control a computing mouse.
  • Mouse control is not always accurate and locked-in patients typically do not have complete freedom of movement of the head (or other body parts), for example due to attached medical equipment (e.g., a respirator).
  • US 2013/0293488 to Na et al. entitled “Mobile Terminal and Control Method Thereof” discloses a mobile terminal including a camera configured to capture eye gestures of a user and a controller configured to display an eye tracking mode guide based on the captured eye gestures on a display and to select a particular function included in the eye tracking mode guide based on the captured second eye gesture.
  • US 8,593,375 to Maltz entitled “Eye Gaze User” discloses a software controlled user interface for an eye gaze controlled device, designed to accommodate angular accuracy versus time averaging tradeoffs for eye gaze direction sensors.
  • US 6,456,262 to Bell entitled “Microdisplay with Eye Gaze Detection” discloses a microdisplay in which a displayed image element may be selected by gazing upon it.
  • US 2006/0061544 to Min et al. entitled “Apparatus and Method for Inputting Keys Using Biological Signals in Head Mounted Display Information Terminal” discloses a Head Mounted Display where a user inputs a key selected according to the user's biological signals sensed through a biological signal sensing unit having an EOG (Electrooculogram) input unit and an EMG (Electromyogram) input unit for sensing and receiving the biological signals as key inputs.
  • EOG Electrooculogram
  • EMG Electromyogram
  • EyeWriterTM An example of a system that is operated using eye tracking is the "EyeWriterTM” (notimpossiblelabs.com or http://www.evewriter.org/).
  • the EyeWriterTM was developed for the benefit of graffiti artists suffering from ALS.
  • a system for enabling a user to communicate with eye- based feedback includes a relative eye orientation sensor, a selection interface and a processor.
  • the selection interface is configured to selectively present a series of communication options to the user.
  • the relative eye orientation sensor is configured to detect light reflecting from the eye of the user and provide a correlative signal.
  • the processor is communicatively coupled with the relative eye orientation sensor and with the selection interface.
  • the processor is configured to receive and process the correlative signal to determine a relative eye orientation of the user, the relative eye orientation including the relative orientation of at least the pupil of the eye with respect to the head orientation of the user.
  • the processor is further configured to determine a selected communication option based on the determined relative eye orientation, and to provide instructions to implement the selected communication option.
  • the system may further include at least one light source for illuminating the eye of the user.
  • the light source may be an infrared (IR) light source, and the relative eye orientation sensor may detect IR light reflected from the eye.
  • the selection interface may include a visual interface, an audible interface, and/or a tactile interface.
  • the light source and/or the relative orientation sensor may be coupled with a wearable head gear worn by the user.
  • the selection interface may operate on a mobile computing device.
  • the relative eye orientation sensor may include at least one image sensor configured to capture images of the eye of the user, where the processor determines the relative eye orientation by image processing of the captured images.
  • the processor may determine the relative eye orientation by detecting an instantaneous relative orientation of the pupil, or by detecting a transient change of the pupil.
  • the processor may be communicatively coupled wirelessly with the relative eye orientation sensor and/or the selection interface.
  • the communication option may include: an audible alarm, a menu selection, a language selection, a confirmation message, a sentence, a phrase, a word, a syllable, a letter, a mobile computing device activation selection, a visual interface deactivation selection, instruction to send an email/SMS/MMS, and/or instructions to implement a computing application.
  • the visual interface may display at least one communication option by: color- coordination, shape-coordination, and/or symbolic images.
  • a method for enabling a user to communicate with eye-based feedback includes the procedures of: selectively presenting a series of communication options to the user with a selection interface, and detecting light reflected from the eye of the user and providing a correlative signal with a relative eye orientation sensor.
  • the method further includes the procedure of processing the correlative signal to determine a relative eye orientation of the user, the relative eye orientation including the relative orientation of at least the pupil of the eye with respect to the head orientation of the user.
  • the method further includes the procedures of determining a selected communication option based on the determined relative eye orientation, and providing instructions to implement the selected communication option.
  • the method may further include the procedure of illuminating the eye of the user with at least one light source.
  • the eye may be illuminated with IR light, and the relative eye orientation sensor is configured to detect IR light reflected from the eye.
  • a communication option may be presented by the selection interface visually, audibly, and/or in a tactile manner.
  • the procedure of detecting light reflected from the eye of the user may include capturing images of the eye with at least one image sensor, where the relative eye orientation is determined by image processing of the captured images.
  • the relative eye orientation may be determined by detected an instantaneous relative orientation of the pupil, or by detecting a transient change of the pupil.
  • the communication option may include: an audible alarm, a menu selection, a language selection, a confirmation message, a sentence, a phrase, a word, a syllable, a letter, a mobile computing device activation selection, a visual interface deactivation selection, instruction to send an email/SMS/MMS, and/or instructions to implement a computing application.
  • the method may further include the procedure of performing calibration by: comparing consecutive captured images to detect relative changes of the pupil in the eye; marking the pupil in the captured images to track an area of interest signifying the location of the pupil; setting a reference point as the center of a 2D grid and translating eye movements to coordinates on the grid, when determining that the pupil is at the center of the grid, selecting a reference point for building a reference point coordinate system; and determining the range of motion of the user.
  • Figure 1 is a schematic illustration of a system for enabling a user to communicate with eye-based feedback, constructed and operative in accordance with an embodiment of the present invention
  • Figure 2 is an illustration of a settings menu and a words menu displayed on the display of Figure 1 ;
  • Figure 3A is an illustration of a letter group menu displayed on the display of Figure 1 ;
  • Figure 3B is an illustration of the letter group menu of Figure 3A where the image to be displayed on the display has been divided into different sections, and a selection is being made;
  • Figure 4 is an illustration of a letter sub-menu displayed on the display of Figure 1 ;
  • Figure 5A is an illustration of an image of the user's eye with the pupil in an upward quadrant captured by the camera of Figure 1 ;
  • Figure 5B is an illustration of an image of the user's eye with the pupil in a downward quadrant captured by the camera of Figure 1 ;
  • Figure 5C is an illustration of an image of the user's eye with the pupil in a left quadrant captured by the camera of Figure 1
  • Figure 5D is an illustration of an image of the user's eye with the pupil in a right quadrant captured by the camera of Figure 1 ;
  • Figure 5E is an illustration of an image of the user's eye without the pupil captured by the camera of Figure 1 ;
  • Figure 5F is an illustration of an image of the user's eye with the pupil in the center captured by the camera of Figure 1 ;
  • Figure 6 is a flow diagram of a method for enabling a user to communicate with eye-based feedback, operative in accordance with an embodiment of the present invention.
  • the present invention overcomes the disadvantages of the prior art by providing a system and method for allowing a user to communicate by means of eye-based feedback.
  • the user can select a communication option presented to him/her via a visual/audio/tactile interface, by using very basic eye gestures in the form of manipulating the relative orientation of his/her eyes to indicate a general direction that he/she is looking.
  • the system determines the relative eye orientation of the user, representing the orientation of the pupil relative to the eye socket or head orientation, by detecting reflections of infrared light (or other light) from the eye, such as by using an IR light source and an IR camera or other sensor.
  • the present invention thus assists in improving the quality of life of individuals characterized by a difficulty or inability to communicate verbally and/or manually, such as sufferers of locked-in syndrome (LIS), such as ALS patients, while maintaining their cognitive awareness, by providing an elegant, accessible, relatively non-expensive, mobile, alternative form of communication. More generally, the present invention may be used by any individual, including persons who do not exhibit limitations in communicating otherwise. In addition, the system allows for communication and operation of devices while the user is otherwise engaged.
  • LIS locked-in syndrome
  • System 100 includes a primary display 102, an image processor 104, a light source 106, a camera 108, an audio unit 1 12, and a computing device 1 14 with an auxiliary display 124.
  • Processor 104 is communicatively coupled with primary display 102, with camera 108, with audio unit 1 12, and with computing device 1 14.
  • Computing device 1 14 may be communicatively coupled with processor 104 through a wired or wireless communication link 1 16 (e.g., a two-way Bluetooth connection).
  • Light source 106 is configured to emit light 1 18 to illuminate at least one eye of user 150.
  • Light source 106 may be configured to emit non-visible light, such as light in the infrared (IR) spectrum. IR light is particularly desirable because it can be used effectively even in dark rooms with poor lighting conditions, and allows for effective illumination of the eye. Alternatively, other light wavelengths (non-IR light) may be used to illuminate the eye of user 150.
  • Light source 106 typically emits eye-safe light (e.g., in the visible or infrared wavelengths). Eye-safe light can pass through the various optical elements of system 100 in the desired optical path (i.e., transmitted through or reflected from the optical elements in the desired directions), without interfering with the visible light path of the displayed image.
  • light source 106 can be embodied by four IR LED lights that are arranged in an array (e.g., in a square pattern with each one of the LEDs in a respective corner of the square).
  • light source 106 may be embodied by ambient light, such as solar radiation or indoor lighting.
  • Camera 108 is configured to capture images of at least one eye of user 150.
  • the line of sight (LOS) of camera 108 is aligned toward the general direction that user 150 is facing.
  • the field of view (FOV) of camera 108 is related to the possible range of head movements by user 150, while maintaining the eyes within the image frame.
  • System 100 may include multiple cameras 108 with different FOVs, allowing for imaging of a wider overall FOV than would be possible with a single camera 108.
  • Camera 108 may be any type of device capable of acquiring an image representation of a real-world scene, including the acquisition of any form of electromagnetic radiation at any range of wavelengths (e.g., light in the visible or non-visible spectrum, ultraviolet, infrared, radar, microwave, RF, and the like).
  • camera 108 may be a complementary metal-oxide-semiconductor (CMOS) or charge-coupled device (CCD) camera operating in the visible to near infrared (NIR) spectrum.
  • CMOS complementary metal-oxide-semiconductor
  • CCD charge-coupled device
  • camera 108 is operative in a spectral region that corresponds to or at least partially overlaps with, the operational wavelength range of light source 106.
  • the main components of camera 108 are: a filter 1 1 1 , an image sensor 1 10, a lens (not shown), and electronic circuitry (not shown).
  • Image sensor 1 10 may be configured to detect light at selected wavelengths only (i.e., a narrow-band sensor).
  • filter 1 1 1 may be configured to block light outside the desired wavelengths from reaching image sensor 1 10 (e.g., using a bandpass filter).
  • Sensor 1 10 may be, for example, a photodiode, a CCD, or another type of light detection element.
  • Camera 108 is generally operative to acquire at least one image frame, such as a sequence of consecutive image frames representing a video image, which may be converted into an electronic signal for subsequent processing and/or transmission.
  • camera 108 may be embodied by a modified web camera, a digital camera and/or a cellular phone camera.
  • image refers to any form of output from an aforementioned image sensor, including any optical or digital representation of a scene acquired at any spectral region, and encompasses both a single image frame and a sequence of image frames (i.e., a "video image").
  • system 100 may alternatively include a different type of sensor instead of camera 108.
  • system 100 may alternatively include at least one discrete light sensor, arranged individually or in a suitable configuration (such as a sensor array) in relation to user 150. Accordingly, a determination of the relative eye orientation of user 150 may be made based on an analysis of the reflected light detected by such discrete sensors, rather than based on image analysis of camera images, as will be discussed further hereinbelow.
  • Primary display 102 and auxiliary display 124 are configured to display visual information that is viewable by user 150.
  • the displayed information may include graphical control elements with selectable options, such as in the form of graphical menus or tables containing alphanumeric characters and/or symbols (as will be elaborated upon further hereinbelow).
  • Primary display 102 may be any device or mechanism capable of presenting visual information using any suitable format or underlying technology, including but not limited to: a computer monitor, a television monitor, a tablet or mobile phone display, a plasma or LCD screen, a light emitting diode (LED) display, a three-dimensional representation display (e.g., a holographic display), and the like.
  • LED light emitting diode
  • Primary display 102 may be embodied by a head-mounted display (HMD) embedded within a wearable apparatus worn by user 150, such as a helmet, a headband, a visor, spectacles, goggles, and the like.
  • System 100 may alternatively include an audible interface and/or a tactile interface, in addition to or instead of primary display 102 and auxiliary display 124, for presenting information in an audible and/or tactile format.
  • Processor 104 receives instructions and data from the various system components. Processor 104 is also configured to perform image processing and analysis on the image frames captured by camera 108, and to extract information from the image characteristics. Processor 104 may be situated at a remote location from the other components of system 100. For example, processor 104 may be part of a server, such as a remote computer or remote computing system or machine, which is accessible over a communications medium or network. Alternatively, processor 104 may be situated adjacent to user 150 and/or integrated within other components of system 100. Processor 104 may be embodied by a low cost processor with sufficient computing power (e.g., Odroid U3 or Raspberry pi2), and may include a text-to-speech engine and Bluetooth capabilities.
  • a low cost processor with sufficient computing power (e.g., Odroid U3 or Raspberry pi2), and may include a text-to-speech engine and Bluetooth capabilities.
  • Audio unit 1 12 is configured to generate an audible output.
  • the audible output may include different types of sounds of varying tones or intensities, ranging from simple beeps or alarm type audio notifications to more complex speech patterns.
  • System 100 may generally include multiple audio units 1 12, which may be configured to produce multiple forms of audible output.
  • Audio unit 1 12 may be embodied by at least one speaker, such as a pair of headphones.
  • Computing device 1 14 may be embodied by a mobile computing device, such as a smartphone, a tablet computer, and/or a digital music player, or a stationary computing device, such as a desktop computer.
  • a mobile computing device such as a smartphone, a tablet computer, and/or a digital music player
  • a stationary computing device such as a desktop computer.
  • System 100 may include or be used with a support frame (not shown) for supporting elements of system 100 in the vicinity of user 150.
  • the support frame may be embodied by a spectacle or eye-glasses.
  • the support frame may support light source 106 and camera 108 using a strong and flexible moveable arm.
  • the flexible movable arm can be made of a material that is resilient and rigid enough to support camera 108 and yet flexible enough to be bent to adjust the imaging range, location and/or viewing angle of camera 108.
  • a light-weight material may be used.
  • camera 108 may be attached via a wire to a head gear worn by and directed toward the face of user 150, such that camera 108 images at least one eye of user 150.
  • This alignment may be readjusted automatically or manually to maintain alignment of the eyes during shifting head movements of user 150 or to accommodate other physical requirements of user 150.
  • LIS patients may be affixed to ventilators or other medical equipment
  • the maneuverability of the support frame can allow for such patients to utilize system 100 conveniently, while minimizing the interference or constraints resulting if system 100 were to be completely fixed in place.
  • the support frame may also be adjustable and/or adapted to meet the requirements of particular users 150, such as to be accessible by people with different types of respirators.
  • An alternative technique for providing stability and support is by attaching elements of system 100 to an adjacent counter, a vehicle or a wheelchair.
  • the components and devices of system 100 may be based in hardware, software, or combinations thereof. It is appreciated that the functionality associated with each of the devices or components of system 100 may be distributed among multiple devices or components, which may reside at a single location or at multiple locations. For example, the functionality associated with processor 104 may be distributed between multiple processing units (such as a dedicated image processor for the image processing functions).
  • System 100 may optionally include and/or be associated with additional components not shown in Figure 1 , for enabling the implementation of the disclosed subject matter.
  • system 100 may include a power supply (not shown) for providing power to the various components, and may further include a memory or storage unit (not shown) for temporary storage of image frames or other types of data.
  • system 100 will now be discussed, for exemplary purposes, in the context of a user 150 characterized by a difficulty or inability to verbally communicate, such as an LIS patient.
  • User 150 views visual information presented on primary display 102 and/or auxiliary display 124, such as multiple command selections.
  • Camera 108 captures images of the eyes of user 150 which are illuminated by light source 106, and processor 104 receives and processes the captured images to determine the relative eye orientation of user 150.
  • the relative eye orientation represents a general direction that user 150 is looking (e.g., "up”, “down”, “left”, “right”, “straight ahead") relative to the direction that he/she is facing which generally corresponds to the orientation of the head.
  • the relative eye orientation can be considered as the orientation of the pupil within the eye socket or the pupil orientation relative to the head orientation of the user.
  • processor 104 compares the captured image of the eye with at least one previous captured image, to determine the instantaneous relative orientation of the pupil (with respect to the eye socket or head orientation of user 150) and/or to detect transient changes in the pupil orientation.
  • processor 104 processes the reflection signals from the eye of user 150 detected by discrete light sensors, such as discrete IR sensors, rather than images captured by camera 108.
  • processor 104 may compare the respective detections of multiple discrete sensors arranged at different positions around the eye of user (e.g., four sensors evenly distributed around the perimeter of the eye), and determine a relative eye orientation of user 150 based on characteristics of the sensor detections (e.g., which sensor(s) detected light reflected from the eye; the intensity and/or angle of arrival of the detected reflections), as well as the relative proximity of the pupil to the respective sensors.
  • characteristics of the sensor detections e.g., which sensor(s) detected light reflected from the eye; the intensity and/or angle of arrival of the detected reflections
  • Display 102 presents a plurality of options via an option selection table or menu.
  • User 150 indicates an option to select by performing an eye gesture such that his/her relative eye orientation correlates to the relative direction of that option as presented on display 102.
  • processor 104 determines the relative eye orientation of user 150, and determines the option selected by user 150 based on the determined relative eye orientation. For example, if the relative eye orientation is determined to be "right" (i.e., the pupil orientation is at the right side within the eye socket of user 150), then the option displayed at the right side of the menu is considered to be selected.
  • the user selection is determined based on only a relative eye orientation that corresponds to a very general (imprecise) direction that user 150 is looking (e.g., "right”, “left”, “up”, “down”, “straight ahead"), without requiring the determination of a precise gaze direction or line-of-sight of the user (i.e., without determining a particular point or spatial coordinates at which user 150 is looking at in relation to a particular frame of reference, such as determining a unique coordinate position on display 102).
  • a relative eye orientation that corresponds to a very general (imprecise) direction that user 150 is looking (e.g., "right”, “left”, “up”, “down”, “straight ahead)
  • a precise gaze direction or line-of-sight of the user i.e., without determining a particular point or spatial coordinates at which user 150 is looking at in relation to a particular frame of reference, such as determining a unique coordinate position on display 102).
  • Processor 104 provides instructions to display 102, audio unit 1 12, and/or computing device 1 14 to perform at least one action based on the determined selection of user 150. These actions can be, for example: to audibly output a word, letter, phrase, sentence or alarm warning siren; to display a different menu screen or different menu option(s); to display text; to generate and send a text or e-mail message; and/or to execute a particular application on computing device 1 14.
  • the selection process of system 100 will now be further described with reference to Figures 2 to 4.
  • the upper menu portion 200 is a "settings menu”, which is arranged to allow user 150 to select or alter a setting parameter of system 100.
  • the lower menu portion 202 is a "word menu”, which is arranged to allow user 150 to select a specific word or series of words (a phrase) from a predetermined set of key words and phrases.
  • One or more setting options 200A, 200B, 200C, 200D, 200E and one or more word options 202A, 202B, 202C 202D, 202E are displayed on display 102, 124 at a given instant. Selection of an option may prompt a sub-menu with additional options to be displayed.
  • option 200A represents a "language option” that leads to a sub-menu of different languages to choose from.
  • Option 200B represents a "standby option” that switches system 100 into a standby (i.e., rest or low power) mode of operation until reactivated by user 150, such as by performing a specific eye gesture or indication.
  • Option 200C represents a "help option” that leads to a sub-menu of different options designed to help user 150 (e.g., explanations of how to use certain aspects or elements of system 100).
  • Option 200D represents an "alert option” that triggers audio unit 1 12 to emit a warning or alert signal upon selection.
  • Option 200D is especially useful when user 150 is in distress and/or a life-threatening situation.
  • the warning or alert siren may also be activated by user 150 performing a specific eye gesture or indication, such as by winking four times in rapid succession.
  • Option 200E represents a "connect to mobile option” that establishes a communication link 1 16 between processor 104 and computing device 1 14, allowing user 150 to execute program applications on computing device 1 14.
  • option 202A represents the word: "hello”
  • option 202B represents the word: “goodbye”
  • option 202C represents the word: “thanks”
  • option 202D represents the word: "help”.
  • selection of option 202D may cause audio unit 1 12 to emit a warning signal or alert siren, similar to settings option 200D.
  • Option 202E represents the word: "application”.
  • selection of option 202E may direct processor 104 to establish a communication link 1 16 with computing device 1 14 to allow the user to execute program applications on computing device 1 14, similar to setting option 200E.
  • only a single option from either "settings" menu portion 200 or "words" menu portion 202 is displayed at a given time on display 102, 124, allowing user 150 to either select the currently displayed menu option or to continue viewing other available menu options, by implementing suitable eye movements or gestures.
  • individual options from each menu portion 200, 202 may be presented sequentially in a cyclic manner, in accordance with the relative eye orientation of user 150.
  • user 150 may initially select "settings menu” 200 by looking in an upwards direction (U).
  • L1 leftwards
  • R1 rightwards
  • display 102, 124 initially presents only settings options 200A and 202A.
  • user 150 can direct system 100 to display settings option 200E (e.g., in place of settings option 200A).
  • display option 200B e.g., in place of option 200A.
  • user 150 can select "words menu” 202. Subsequently, user 150 may sequentially cycle through the "words options" on display, such as by looking left (L2) to cycle through the words options in a first sequential order or by looking right (R2) to cycle through the words options in a second sequential order. For example, by looking down (D) and then toward the left (L2), user 150 can direct system 100 to display word option 202E (e.g., in place of option 202A). Alternatively, by looking down (D) and then toward the right (R2), user 150 can direct system 100 to display option 202B (e.g., instead of option 202A). It is noted that alternative directional configurations are equally applicable.
  • user 150 may alternatively sequentially cycle through menu options by directing his eye gaze upwards and downwards rather than leftwards and rightwards, and display 102, 124 may present different menu portions 200, 202 side-by-side rather than in a top-and-bottom configuration.
  • User 150 can use a predetermined eye-based gesture or indication, such as closing their eye (winking/blinking), or gazing in a specified direction, such as at diagonal or slanted angle (e.g., a 45° angle) relative to a forward eye orientation, to select an option from one of the menus 200, 202.
  • User 150 may select a suitable eye-based gesture to be used during an initialization process of system 100, so as to ensure that he/she can successfully implement the selection process (for example, some users may have difficulty blinking and may thus select an alternative gesture).
  • the selected gesture may be a preselected default option so as to preclude the need for preliminary setup or initialization of system 100 prior to use.
  • user 150 can look down (D) and then toward the left (L2) and/or right (R2) to cycle through the different options of word menu 202 until he/she arrives at the desired option, e.g., option 202C, and then look down (D) and gesture (e.g., winking) to select option 202C (the word "thanks").
  • Selection of a word in the word menu 202 will direct audio unit 1 12 to provide an audible indication of the selected word.
  • selecting option 202C will cause those in the vicinity of user 150 to hear the word "thanks" audibly emitted from audio unit 1 12.
  • a temporary visual indication such as highlighted or emboldened text or a colored frame, may be applied to the selected option, such as to indicate that an option has been initially or tentatively selected but not yet validated by user 150.
  • a confirmation eye-related gesture or indication (such as blinking twice) may then be required in order to validate the user selection, before implementing the relevant functionality associated with the selected option.
  • shapes or colors may be used to assist with differentiating among the various options in the presented menus.
  • a tentative user selection may be indicated audibly, such as by directing partial audio output to user 150.
  • the menu option that is initially (tentatively) selected is broadcast only through a single channel of audio headphones being worn by user 150, providing a monophonic sound reproduction (e.g., by broadcasting only to the right ear / right channel of the headphones).
  • the selected word is validated by an additional eye-related indication or gesture (e.g., blinking), which causes an appropriate sound (or sounds) associated with the selected option to be broadcast over audio unit 1 12, as well as optionally over the other audio channel of the headphones worn by user 150 (e.g., the left ear / left channel of the headphones).
  • an additional eye-related indication or gesture e.g., blinking
  • Audio unit 1 12 may alternatively present the available selection options to user 150 in an audible format, instead of (or in addition to) the visual presentation by display 102, 124.
  • the list of settings options of settings menu 200 and/or word options of words menu 202 may be broadcast to user 150 through headphones, such that user 150 hears the available options, in conjunction with an audible indication of the different eye-related gestures associated with each option.
  • the user 150 may hear the following audio sequence to prompt a user selection: "left - hello, right - goodbye, up - yes, down - no, blink - repeat menu".
  • user 150 may simply hear a single option, e.g., "language”.
  • the option(s) may be broadcast only to user 150.
  • User 150 may move through different option menus or select an option by implementing designated eye-related gestures. For example, user 150 may cycle between different word options using glances to either side (e.g., right or left, up or down), and indicate a selection using a blinking or winking gesture. Following the user selection, the associated audio output may be publically broadcast to the surroundings.
  • side e.g., right or left, up or down
  • the associated audio output may be publically broadcast to the surroundings.
  • the selection interface of system 100 can be a visual interface (a visual menu presented on display 102, 124), an audible interface (an audible menu provided by processor 104 and audio unit 1 12) and/or a tactile interface (not shown).
  • system 100 can present selection menus corresponding to settings menu 200 and words menu 202 (Fig. 2) even without displays 102, 124. It is appreciated that the elimination of a display may be particularly beneficial in the context of a medical institution, by allowing a recovering patient (e.g., after a surgery) to request a nurse or to communicate with family members, similar to a distress device that can be placed next to a patient's bed.
  • display 102, 124 may alternatively selectively display only a single menu portion 200, 202 at a given instant, while presenting a selectable option on each of the menus 200, 202 that would allow for displaying the other menu 200, 202 instead (or an alternative available menu that is not currently being displayed).
  • settings menu 200 may include a "switch to words menu” option (not shown), and correspondingly words menu 202 may include a "switch to settings menu” option (not shown), allowing for toggling between the two respective menus 200, 202.
  • Such a display configuration may serve to simplify the selection process by limiting the eye movements of user 150 to only the left or right directions (and obviating the need for upward and downward eye movements). It will be further appreciated that the ability to sequentially cycle through the displayed menus 200, 202 by merely shifting his/her gaze toward a general direction (e.g., left or right) allows user 150 to access all of the available menu options relatively easily and quickly. For example, if a particular menu consists of ten (10) possible options, any one of the options can be reached within five (5) "cycles" of the menu (e.g., in the example of Figure 2 where words menu 202 includes five options, then option 202D can be reached starting from option 202A by cycling either twice to the left or three times to the right).
  • the vocabulary provided in words menu 202 can be limited to a certain number of pre-determined words that were found to have high importance and high incidence to the target audience or user in question.
  • the available word options of a given words menu 202 may also be preselected and/or edited by user 150, partially or entirely.
  • the respective audio outputs associated with the words menu 202 vocabulary may be previously generated prior to the operation of system 100 (e.g., by pre-recording various audio files corresponding to spoken words and/or phrases), or may be generated in real-time (e.g., via text-to-speech or phonetic transcription speech synthesizers).
  • a given menu option may represent a different sub-menu that can be displayed, such as words menu 202 including a menu option representing a letter group menu.
  • selection of a connect to mobile option 200E, 202E on words menu 202 may prompt processor 104 to form a communication link 1 16 with computing device 1 14 which is executing an application that includes a different menu or sub-menu, e.g., a letter group menu.
  • Figure 3A is an illustration of the exemplary display screen 300A on display 102, 124 presenting an exemplary letter option menu, generally referenced 300, that includes different letter group options 302, 304, 306, 308, 310 for selection.
  • Letter group menu 300 may be used for composing a written text or message, or for spelling out a word to be communicated or added to words menu 202.
  • option 302 represents the group of letters: "ABCDE”
  • option 304 represents the letters: "FGHIJ”
  • option 306 represents the letters: "KLMNO”
  • option 308 represents the letters: "PQRST”
  • option 310 represents the letters: "UVWXY”.
  • Letter group options 302, 304, 306, 308, 310 are presented in a cross configuration, with each letter group located on a separate region of display 102, 124 (i.e., top, bottom, left, right, or middle).
  • user 150 may select an available option of letter group menu 300 using only relative eye orientation and/or eye-related gestures. For example, while viewing letter group menu 300 on display 102, 124, user 150 may gaze upward (U) to select option 302; may gaze leftwards (L) to select option 304; may gaze rightwards (R) to select option 308; or may gaze downwards (D) to select option 310.
  • User 150 may select the middle option 306 by performing a predetermined eye-related gesture or indication, such as closing one or both eyes (winking/blinking), or gazing in a diagonal or slanted angle relative to a forward relative eye orientation (e.g., a 45° angle).
  • a predetermined eye-related gesture or indication such as closing one or both eyes (winking/blinking), or gazing in a diagonal or slanted angle relative to a forward relative eye orientation (e.g., a 45° angle).
  • the options could also be communicated via audio unit 1 12 announcing the aforementioned letters.
  • Figure 3B is an illustration of the exemplary display screen 300B on display 102, 124 presenting the exemplary letter group menu 300 of Figure 3A divided into different sections, where a selection of the letter "G" is being made.
  • a selection of a letter group option (e.g., 302 or 304) of letter group menu 300 may then result in displaying another sub-menu representing the individual letters of the selected letter group (i.e., 400).
  • a letter group option e.g., 302 or 304
  • display 102, 124 displays the associated ABCDE letter submenu.
  • Figure 4 is an illustration of the exemplary display screen 400A on display 102, 124 presenting the ABCDE letter sub-menu, generally referenced 400, that includes different individual letter options 402, 404, 406, 408, 410 for selection.
  • Letter submenu 400 corresponds to the selected letter group option 302 ( Figure 3A).
  • option 402 represents the letter "A”
  • option 404 represents the letter “B”
  • option 406 represents the letter “C”
  • option 408 represents the letter “D”
  • option 410 represents the letter "E”.
  • individual letter options are also presented in a cross configuration, with each letter group located on a separate region of display 102, 124 (i.e., top, bottom, left, right, or middle).
  • User 150 may select an available option of individual letter submenu 400 using relative eye orientation and/or eye-related gestures.
  • user 150 may gaze upwards (U) to select option 402; may gaze leftwards (L) to select option 404; may gaze rightwards (R) to select option 408; and may gaze downwards (D) to select option 410.
  • User 150 may select the middle option 406 by performing a predetermined eye-related gesture or indication, such as closing one or both eyes (winking/blinking).
  • a selection of an individual letter of sub-menu 400 may then result in a visual indication of the selected letter displayed on display 102, 124 (e.g., by highlighting or presenting a visual border around the selected letter), which may require confirmation by user 150.
  • an audio indication of the selected letter may be generated by audio unit 1 12.
  • the selected letter may further be added to a text message or text-based instruction being composed by user 150. For example, looking up and selecting option 402 directs audio unit 1 12 to emit the sound of the letter "A", while providing a visual indication of the displayed letter "A” on display 102, and entering the letter "A" into the text of a message being composed by user 150.
  • a completed word may be added and saved to words menu 202.
  • letter group menu 300 or individual letter menu 400 may be alternatively configured to display any number of options (and not necessarily five options). For example, if user 150 experiences difficulties in implementing a suitable eye gesture to indicate the selection of the "middle" option, only four options may be displayed. In this manner, the visually presented menu can be configured to obtain the user selection based solely on a more standard selection of relative eye orientation (e.g., up, down, left or right), while precluding the need for any additional eye- related gestures.
  • the middle region of a menu that includes only 4 options may be left empty, or may display the title of the menu (i.e., not an option that can be selected).
  • system 100 may designate four diagonal directions (up-right, up-left, down-right, down-left) and an optional straight direction (e.g., straight down) to indicate a selection, such as for individuals who have difficulty controlling their blinking or experience involuntary blinking.
  • the different menu options may also be displayed with unique visual effects or characteristics, so as to provide additional visual differentiation to each displayed menu option, in addition to their different relative locations on display 102, 124.
  • each menu option can be displayed in a different color shape or style of border, so that the different options can be more easily recognized, identified and distinguished from one another by user 150.
  • a menu may include a "return option" to enable user 150 to go back to a previous selected option, to a previously displayed menu or a preset determined action.
  • system 100 may be configured such that user 150 can be automatically returned to a prior menu or a default menu following the final selection of a menu option.
  • an indication may be provided to the selected option to indicate that the option has been initially selected.
  • the indication may be, for example, a highlighted or emboldened text, a colored frame, and/or an audible indication emitted in at least one ear of the headphones worn by user 150.
  • System 100 may then wait for a designated confirmation gesture (such as blinking twice) to validate or confirm the selection of the selected option, thereby providing a visual and/or audio indication informing user 150 that an option has been selected and/or which option has been selected.
  • a designated confirmation gesture such as blinking twice
  • An eye-related gesture for implementing a selection of an option by user 150 can include a sequence of individual actions or gestures. Such a sequence may also allow for the direct selection of an available option in a sub-menu associated with the presented menu while bypassing the presentation of the sub-menu.
  • user 150 can select an individual letter by performing two consecutive predetermined eye-related gestures while viewing "letter group menu" 300 of Figure 3A, without being displayed the "individual letter sub-menu” 400 of Figure 4.
  • the first gesture would select a particular letter group (e.g., letter group 308), and then the second gesture, which may be based on the color or location of an individual letter within the letter group, selects a specific letter (e.g., the letter "S”), thereby bypassing the displaying of individual letter sub-menu 400.
  • the individual letters inside each displayed letter-group 302, 304, 306, 308, 310 may be color coded to correspond with a respective direction, which may be indicated by a color indicator displayed in the relevant direction.
  • the first letters of each letter-group: "A”, “F”, “K”, “P”, and “U”, may all be depicted in one color (e.g., orange), and an orange color indicator may be displayed on display 102, 124 represented by an arrow facing a certain direction (e.g., up).
  • an orange color indicator may be displayed on display 102, 124 represented by an arrow facing a certain direction (e.g., up).
  • user 150 selects option 304 (i.e., the entire letter group: "FGHIJ"), and by subsequently looking upward (UP) (i.e., towards the orange indicator direction), user 150 selects a specific letter (e.g., the letter "F”) from among letter group 304.
  • the colors used to display selected menu options can be specific colors that are easily discernible and highly conspicuous, to ensure their perception even by users 150 characterized by impaired vision or perception capabilities.
  • the graphical elements of menu items displayed by display 102, 124 is not limited to text or alphanumeric characters, but can also include, for example, various types of symbols, icons or images representing various actions or selectable options, such as: a "spacebar icon”, a "backspace/delete icon”, a "back or return to the main menu icon”, and the like.
  • user 150 can be provided with an option menu with a plurality of images, symbols or pictorial drawings, e.g., a plate of food, a glass of water, a car, a wheelchair, and the like.
  • the selected drawing can be displayed to the user's surroundings on an external screen, such as auxiliary display 124, or sent in a message to another device (e.g., to a mobile phone of a friend or a nurse).
  • selection of a drawing can result in an appropriate audio output, e.g., selection of the drawing of a plate of food would result in the audible output of the word "food”.
  • System 100 can include word prediction applications to help forecast and suggest complete words based on the first few letters selected by a user. For example, if user 150 selects the following letter sequence: "H-E-L", system 100 can suggest the words: "help” or "hello", by presenting the suggested words on display 102, 124 or via audio unit 1 12. System 100 may even utilize entire word or sentence completion based on context determined by processor 104. User 150 can cycle through different suggested complete words on display, such as by looking towards the left or right. Selection of a suggested word by user 150 can be performed with a suitable eye-related gesture or indication.
  • the selection of a word by user 150 may be hastened and the accuracy improved (by precluding the need for user 150 to continue selecting the additional letter and avoiding subsequent inadvertent incorrect letter selections by the user).
  • the completed words can be indicated audibly or visually, and stored or transferred as an audio file and/or text file, as applicable.
  • the word prediction feature may be based on language modeling and vocabulary libraries, and may be adaptive such as by learning the language habits of a user to improve future suggested word completions associated with that user.
  • a word menu can include different options representing actual words or categories of words classified by topic or subject matter, (e.g., "food”, “clothing”, “colors”, and the like).
  • the selection of an initially displayed word option can result in the displaying of a sub-menu with logical word options that follow from the initially selected word.
  • a word menu may initially present five different categories. When a particular category is selected, a sub-menu of five new words within that category is then presented. Each of these new words may lead to further subcategories, and so forth (e.g., "clothing"-"shirt”-”dress shirt”).
  • a substantially large vocabulary of potential words is made available to user 150. For example, assuming each word menu screen presents a total of 5 options, user 150 can potentially access over three thousand (3000) different words using only five (5) eye-related gestures (i.e., when each option on the first four menu screens represents a subcategory of word options).
  • System 100 can also adaptively learn preferences and habits of user 150, and amend or update the available words or options of different menu screens accordingly. By prioritizing preferred or frequent words or options previously selected by the user, the operation of system 100 may become more efficient and more intuitive.
  • System 100 may further include an interface (not shown), such as via the internet, an input device coupled with processor, or an application of computing device 1 14, to allow a particular user (or an administrator), to customize a particular menu for that user. Such menu customization allows for individual users, or a particular group of users, to select from words that are more relevant or more likely to be frequently accessed by such users/groups.
  • FIGS 5A to 5F are schematic illustrations of exemplary images of an eye of user 150 captured by camera 108 of system 100 of Figure 1 .
  • Each image (500A, 500B, 500C, 500D, 500E, 500F) is divided into four quadrants: upper quadrant 502 which corresponds to user 150 looking up; lower quadrant 504 which corresponds to user 150 looking down; left quadrant 506 which corresponds to user 150 looking right; and right quadrant 508 which corresponds to user 150 looking left.
  • Processor 104 determines the coarse or general location of the eye pupil in the captured image.
  • processor 104 may determine in which region or quadrant the pupil is most likely located (i.e., in which the majority of the pupil is located), or if the pupil is in or near the center of the image. If no pupil is located, then processor 104 may determine that the eye is blinking or winking.
  • the processor 104 determines the corresponding location on display 102, 124 (up, down, left, right, or center) and hence the selected/desired option of a menu (200, 300, 400) being displayed.
  • the pupil P is located in upper quadrant 502, which corresponds to option 402 of individual letters menu 400 ( Figure 4).
  • the pupil P is located, respectively, in lower quadrant 504, left quadrant 506, and right quadrant 508, corresponding to respective letter options 410 ⁇ ", 404 "B", and 408 "B" of individual letters menu 400 (pupil P being located in left quadrant 506 corresponds to the user looking toward the right, and pupil P being located in right quadrant 508 corresponds to the user looking toward the left (i.e., the captured image mirrors the user's gesture).
  • processor 104 may determine that user 150 is winking (or blinking), which may correspond to a selection of the centrally displayed option on the menu screen (e.g., option 406 of menu 400) or the selection or confirmation of an option (e.g., the letter 410 ⁇ ").
  • processor 104 may determine that user 150 is looking forward (not toward any of the above-mentioned directions), which may correspond to a standby mode where system 100 waits until user 150 looks in a direction or performs an eye gesture.
  • the actual number of divisions may vary and is not limited to quadrants.
  • the image can be divided into additional or fewer sections during the image processing for identifying the pupil location (and relative eye orientation).
  • processor 104 may divide a captured eye image into only two sections (e.g., a left section and a right section, divided by a central vertical divider), to determine only left and right locations of the pupil.
  • processor 104 may divide a captured eye image into upper and lower section (i.e., via a central horizontal boundary), to determine only up and down locations of the pupil.
  • the eye image can be divided into eight (8) sections (resembling a hashtag or number sign), to requiring more accurate determination of the angular positioning and/or movement of the pupil and corresponding relative eye orientation.
  • Dividing the image into a large number of sections when processing to identify the pupil location may improve the determination of pupil location/relative eye orientation at various diagonal or slanted angles relative to a forward gaze of the user (e.g., at a 45° angle relative to a forward gaze direction of user 150).
  • processor 104 may divide an eye image into different sections, portions, or segments correlating to the particular sections or portions of display 102, 124 on which particular menu options are being presented.
  • letter group menu 300 and individual letter menu 400 present the respective menu options in a "cross" shaped configuration, and so processor 104 may divide each processed image in a similar cross-shaped manner, so as to facilitate determination of the pupil location representing which menu option is being viewed by user 150.
  • Processor 104 may also track or locate other relevant eye parameters (e.g., the cornea), in addition to or instead of the eye pupil P, for determining the relative eye orientation of user 150.
  • Processor 104 may also track the location of the pupil P over a sequence of captured images.
  • processor 108 may utilize a reference image (500REF), and then compare a subsequent image (or a sequence of subsequent images) with the reference image 500REF to determine if the pupil P location has shifted relative to its location in the reference image 500REF, as well as the measured amount of movement.
  • Such processing may be based on a calculated image filter to detect and measure differences from the original reference image and the subsequent images. It is further noted that determining only the relative eye-orientation direction of user 150, as well as tracking the eye pupil location over a sequence of captured images, serves to reduce the need to stabilize or compensate for movements of camera 108 (e.g., camera motions relative to the user eye between images) or compensating for changes in background. Thus, a calibration process may not be required.
  • An exemplary general calibration process is as follows.
  • the first step is comparing consecutive images to detect relative changes by performing screen motion detection on the geographic area of the eye (including the pupil region). According to heuristics derived from the shape of the image, it can be assumed that the areas that change in the image represent areas of interest.
  • the second step is marking the pupil circle as a point in space in the captured images, to follow the area of interest that signifies the location of the pupil P.
  • the third step is setting a reference point as the center of a two-dimensional (x-y) axis grid and translating the eye movements to (x,y) coordinates on the grid.
  • the calibration process determines, with a high enough probability, that pupil P is in the middle (i.e., close to the center [0,0] of the grid), a reference point is selected according to which the reference point coordinate system (grid) is built.
  • the (x,y) coordinates represent the movement of the eye pupil P in each of the four directions (up, down, left and right).
  • the fourth step may determine the range of motion of the user.
  • the calibration process may adaptively learn the particular range of motion associated with different users. Different users are characterized by different physical features, including different eye and facial features.
  • the calibration process of system 100 may adaptively learn the range of movement of pupil P on the x-y axis (grid) for each user, and adjust the range set threshold (beyond which is sent a signal for a certain direction of movement of the pupil), as necessary.
  • Non-detection of pupil P for a minimum duration e.g., a few milliseconds
  • the amount of time can be adjusted so as to prevent inadvertent identification of a selection, such as one based on the natural blinking of a user.
  • the automatic calibration process (without prompting the user) of system 100 obviates the need for external (third party) assistance for each calibration.
  • the system (algorithm) identifies the pupil P and understands the relative eye orientation.
  • Pupil detection capability includes identifying and comparing the location and range of motion and reliable detection of the presence/non-presence of the pupil P in an image 500.
  • multiple object detection algorithms and their combinations may be used to deal with clarity issues and relatively heavy computation requirements.
  • a dedicated calibration algorithm allows the system to calibrate itself automatically.
  • a non-automatic calibration algorithm can be used that does not use any object recognition tools (feature detection). That is, the algorithm does not identify the exact location of the eye (pupil P), but compares images captured during system operation to those captured during a manual calibration.
  • System 100 may operate a generic factory calibration, performing a single initialization process prior to the operational session of system 100.
  • the calibration (or initialization) may be implemented automatically using different filters or predetermined photographs designed for each of the potential relative eye orientations (e.g., up, down, left, right and center) that need to be identified during regular usage throughout an operational session.
  • Processor 104 may apply the filters to the images 500, and determines the likelihood that a current processed image 500 is comparable to reference images 500REF.
  • Reference images 500REF corresponding to each base situation or state.
  • the system may grade the captured images based on the reference images 500REF and choose the option with the highest grade.
  • System 100 may also operate with self-calibration or "automatic calibration", i.e., that does not require prompting the user to initiate, but rather is performed automatically during the course of an operational session without the conscious knowledge of the user.
  • System 100 may determine when and how often to perform a calibration process automatically, in accordance with relevant criteria. Automatic calibration can be performed when a predetermined condition is met, such as when the system is able to identify the pupil in an image.
  • System 100 may also operate with user or manual calibration with user 150 by providing an audio or a visual instruction, e.g., "Please Look Right", before capturing a reference image 500REF.
  • An audio indication such as resembling the click of a camera shutter, may be provided after the image has been captured, to notify user 150 that the image has been captured.
  • the reference image 500REF similarly may be analyzed to locate the pupil and determine relative borders of the eye of user 150 or a determined boundary area that the pupil will approach when moving from one position to another (i.e., from "looking left” to “looking right”). This boundary area can further be used to trigger relative eye- orientation responses by user 150 as the pupil is observed or measured to cross the boundaries, resulting in processor 104 providing an appropriate response accordingly.
  • the captured image can be fed to a noise filter (e.g., a Markov Signal Noise filter), to reduce noise and enhance the quality of the image processing.
  • a noise filter e.g., a Markov Signal Noise filter
  • the output for each image 500 is determined as: right, left, up, down, closed eye, center, or no identification.
  • the final output (in this case one of five different options) can be based on additional criteria, such as, timing rules. For example, if no pupil is identified for longer than a threshold duration, such as more than approximately 0.2 seconds, then the determination is 'blink' or closed eye, and this determination could be maintained until a following image frame in which the pupil is again located or identified (i.e., movement of the pupil is detected) or to be registered as a unique action as previously mentioned.
  • the user instructions are determined solely from his/her relative eye orientation corresponding to an imprecise general eye direction (e.g., "right”, “left”, “up”, “down”, “straight ahead"), rather than a precise gaze direction or line-of-sight.
  • an imprecise general eye direction e.g., "right”, “left”, “up”, “down”, “straight ahead”
  • system 100 can be considered to operate in a manner resembling that of a joystick, which applies an imprecise general directional determination to provide an indication of the user selection, as opposed to a screen-dependent mouse tracking mechanism where pinpoint location is required.
  • FIG. 6 is a flow diagram of a method for enabling a user to communicate with eye-based feedback.
  • calibration is performed. Calibration of the system components may be performed only once (e.g., before the initial operation of system 100), or may be performed at regular intervals (during the operation of system 100), as necessary. Calibration may be performed using only a single eye-reflection image captured by camera 108, or by comparing and analyzing a large number of such images. Furthermore, the calibration may be adapted uniquely for specific users (i.e., for the particular user currently using system 100), or alternatively a generic calibration applicable for a large number of different users may be implemented (e.g., during an early initialization stage of system 100).
  • a series of communication options is selectively presented to a user with a selection interface.
  • display 102 (and/or display 124) displays a menu 300 with a variety of visual menu options 302, 304, 306, 308 and 310 for the user to choose from.
  • the communications options may alternatively be presented via an audio or a tactile interface.
  • at least one eye of the user is illuminated with a light source. Referring to Figure 1 , light source 106 emits IR light 1 18 toward the eyes of user 150.
  • a correlative signal is provided by a relative eye orientation sensor.
  • image sensor 1 10 receives IR light 1 18 reflects from the eye of user 150 and relays a correlative signal to processor 104.
  • the eye orientation sensor may alternatively be embodied by a non-image sensor, such as by one or more discrete sensors or a sensor array.
  • image sensor 1 10 generates image 500A of the region around the eye of user 150, where pupil P is located in an upper quadrant region of image 500A.
  • the correlative signal is processed to determine a relative eye orientation that includes the relative orientation of the pupil with respect to the head orientation of the user.
  • processor 104 determines a relative eye orientation of user 150 based on the images captured by image sensor 1 10 (or by using other eye reflection data captured by a non-image sensor).
  • the determined relative eye orientation represents the relative orientation of the pupil with respect to the head orientation of user 150 (where the head orientation represents the general direction that user 150 is facing).
  • processor 104 determines from image 500A that pupil P is located in an upper quadrant region 502, and therefore the relative eye orientation of user 150 is an "up" position.
  • Processor 104 may determine the relative eye orientation of user 150 by detecting an instantaneous relative orientation of pupil P or by detecting a transient change of pupil P.
  • a selected communication option is determined based on the determined relative eye orientation of the user.
  • processor 104 selects an option presented on display 102 (and/or by audio unit 1 12), in accordance with the determined relative eye orientation of user 150. For example, if processor 104 determined from image 500A that the relative eye orientation of user 150 is an "up" position, processor 104 then selects communication option 302 ("ABCDE") presented on menu 300.
  • processor 104 instructs audio unit 1 12 to provide an audible indication associated with the selected communication option.
  • audio unit 1 12 broadcasts a word selected by user 150, such as the word "hello" associated with word option 202A of words menu 202.
  • processor 104 may instruct display 102 to present a visual indication associated with the selected communication option. For example, display 102 highlights the selected menu option, such as the "hello" word option 202A.
  • processor 104 may instruct computing device 1 14 to implement a selected application or perform an action associated with the selected communication option. For example, computing device 1 14 sends a message or e-mail, such as following the selection of the "email application" option 202E of words menu 202.
  • the processor 104 may translate the eye movements and broadcast operations through a wireless link (e.g., Bluetooth) to a computing device 1 14 (e.g., a smartphone) present in the area, and able to operate different applications which may be used by user 150 to communicate with his/her environment.
  • Computing device 1 14 can be used to send text messages, execute different applications and/or control a computer (using programs designed for control using selection tables or menus).
  • processor 104 may continue to transmit and receive data to/from computing device 1 14 over communication link 1 16.
  • system 100 can be used to exert control on an external device or apparatus, such as, a motorized wheelchair or computerized home appliances.
  • a user may direct or navigate a motorized wheelchair by indicating a selected direction (e.g., forward, back, left, right) using his/her relative eye orientation (e.g., up, down, left, right).
  • the user may provide the indication by looking toward a particular menu option of a "motorized wheelchair navigation" menu presented on display 102, 124.
  • System 100 determines the indicated selection (e.g., turn wheelchair left), and triggers the motorized wheelchair to perform the selected option.
  • System 100 may be communicatively coupled with such external devices via a wireless communication link (e.g., communication link 1 16).
  • user 150 may alternatively by an individual who is vision impaired or who lacks full eye sight capabilities, as system 100 provides eye-based communication by determining only the relative eye orientation of user 150, which does not require the full use of vision or even a satisfactory level of eye sight to be attributed to user 150.
  • the present invention thus provides a relatively affordable and accessible solution with easy and intuitive operation that helps improve the quality of life of individuals with special needs, by providing means for effective communication with their environment.
  • the system is mobile, portable and suitable for use in many situations and positions: lying, sitting, head hanging down, and the like. Furthermore, the system of the present invention does not require third party assistance for calibration.
  • the system and method of the present invention is not limited to individuals characterized by locked-in syndrome (LIS), but is equally applicable for use by other types of people as well, including those who are vision impaired.
  • the system and method of the present invention may be integrated with an application on a computing device, such as a smartphone, tablet computer, or a digital music player, which includes at least a sensor for detecting light reflections from the eye (such as an embedded camera).
  • the present invention may thus facilitate and initiate various communication actions through the computing device, such as composing and sending a message or initiating a phone conversation with a third party, based solely on instructions provided by the user by means of eye gestures correlating to his/her relative eye orientation.

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • General Physics & Mathematics (AREA)
  • Human Computer Interaction (AREA)
  • Medical Informatics (AREA)
  • Public Health (AREA)
  • Biomedical Technology (AREA)
  • Molecular Biology (AREA)
  • Surgery (AREA)
  • Animal Behavior & Ethology (AREA)
  • General Health & Medical Sciences (AREA)
  • Heart & Thoracic Surgery (AREA)
  • Veterinary Medicine (AREA)
  • Ophthalmology & Optometry (AREA)
  • Biophysics (AREA)
  • Optics & Photonics (AREA)
  • User Interface Of Digital Computer (AREA)
  • Eye Examination Apparatus (AREA)
  • Ultra Sonic Daignosis Equipment (AREA)

Abstract

System and method for enabling user communication with eye- based feedback. A selection interface selectively presents a series of communication options to the user. A sensor detects light reflected from the eye of the user and provides a correlative signal, which is processed to determine a relative eye orientation that includes the relative orientation of the pupil with respect to the head orientation of the user. The eye may be illuminated by a light source or ambient lighting. The sensor may detect infrared light reflected from the eye illuminated by an infrared light source. The relative eye orientation may be determined by processing images captured by an image sensor. The relative eye orientation may be determined by detecting an instantaneous relative orientation of the pupil or by detecting a transient change of the pupil. Based on the determined relative eye orientation, a selected communication option is determined and implemented.

Description

SYSTEM AND METHOD FOR ENABLING COMMUNICATION
THROUGH EYE FEEDBACK
FIELD OF THE INVENTION
The present invention relates to eye tracking and to interfaces and user communication platforms.
BACKGROUND OF THE INVENTION
A degenerative disease is characterized by the deterioration of structure or function of body tissue. Degenerative diseases often lead to a loss of control over muscles and a decline of motor function that impair a person's movement capabilities. As a result, the individual may experience severe difficulties or even a complete inability to communicate verbally and/or via hand motions. Degenerative diseases do not necessarily diminish the person's mind and cognitive function at all or at the same rate as the decline in motor function. In this way, a person with full cognitive activity can lose his ability to interact physically and verbally with his environment. The person's mind can be considered to be essentially trapped or locked into his body.
Some examples of degenerative diseases include: Duchenne muscular dystrophy (DMD), Machado-Joseph disease (MJD), Multiple sclerosis (MS), Muscular dystrophy (MD), Parkinson's and Amyotrophic lateral sclerosis (ALS). Amyotrophic lateral sclerosis (ALS), also known as Lou Gehrig's disease and Charcot disease, is a disorder that involves the death of neurons and is characterized by stiff muscles, muscle twitching, and gradually worsening weakness due to muscle wasting. ALS typically causes difficulty speaking, swallowing, and breathing. ALS patients represent a subgroup of people with fully functional cognitive abilities that gradually lose their speech and the use of their upper limbs.
Aside from those suffering from degenerative diseases, other groups of people may also experience a loss of physical capabilities while retaining their mental capabilities (i.e., have difficulty verbally communicating while maintaining full cognitive capabilities). For example, those suffering from locked-in syndrome (LIS) or neurodegenerative diseases like Parkinson's or Huntington's, stroke and other accident victims (e.g., traffic accidents), patients connected to a respirator (mechanically ventilated patients) who cannot speak due to having a breathing tube inserted in their throat, those too weak or unable to move their limbs, those who are paralyzed, or others who have reached a state that even though their brains are functioning properly they cannot interact with their environment due to technical restrictions. Such individuals are thus severely limited in their ability to communicate with other people in their environment. This lack of communication can trigger feelings of detachment and isolation, as well as lead to situations where the patient is distressed and unable to effectively relay essential needs or obtain critical assistance (which in some cases may be life-threatening). One possible approach is to enable an alternative form of communication for such individuals based on characteristics relating to their eyes. For example, the direction or focus of the person's line of sight may be used to provide instructions or otherwise communicate information to their surroundings. There are tools for aiding such alternative non-verbal forms of communication, however they are typically relatively expensive, cumbersome and/or immobile (fixed in place). These tools often require the use of a dedicated computer, and are contingent on the stable posture of the person using the tool. These tools can track eye location relative to a display on a screen and translate eye movements to mimic the operation of a mouse cursor on the screen. Hence, the focus of the patient is translated to a geographical location of a pointer (or cursor) and the person can control a computer through his/her eyes similar to the manipulation of a computing mouse. For example, the pupil is identified as a point in space and allows the person to control the computer using their eyes similar to the way they would control a computing mouse. Mouse control is not always accurate and locked-in patients typically do not have complete freedom of movement of the head (or other body parts), for example due to attached medical equipment (e.g., a respirator). Such tools which rely on determining an exact eye gaze location of the user also typically necessitate relatively frequent recalibration, requiring active user participation. The calibration process usually requires an accurate fixation of the user in front of the screen in multiple positions, and is thus relatively burdensome and time-consuming. The calibration process can also often require third party assistance, which can be costly or inconvenient.
US 8, 120, 577 to Bouvin et al. entitled "Eye Tracker with Visual Feedback" discloses entry of control commands into a computer in response to eye-tracker detected movement sequences of a point of regard over a graphical display.
US 2013/0293488 to Na et al. entitled "Mobile Terminal and Control Method Thereof" discloses a mobile terminal including a camera configured to capture eye gestures of a user and a controller configured to display an eye tracking mode guide based on the captured eye gestures on a display and to select a particular function included in the eye tracking mode guide based on the captured second eye gesture.
US 8,593,375 to Maltz entitled "Eye Gaze User" discloses a software controlled user interface for an eye gaze controlled device, designed to accommodate angular accuracy versus time averaging tradeoffs for eye gaze direction sensors.
US 8,593,375 to Maltz entitled "Eye Gaze User" discloses a software controlled user interface for an eye gaze controlled device, designed to accommodate angular accuracy versus time averaging tradeoffs for eye gaze direction sensors. US 6,456,262 to Bell entitled "Microdisplay with Eye Gaze Detection" discloses a microdisplay in which a displayed image element may be selected by gazing upon it.
US 2006/0061544 to Min et al. entitled "Apparatus and Method for Inputting Keys Using Biological Signals in Head Mounted Display Information Terminal" discloses a Head Mounted Display where a user inputs a key selected according to the user's biological signals sensed through a biological signal sensing unit having an EOG (Electrooculogram) input unit and an EMG (Electromyogram) input unit for sensing and receiving the biological signals as key inputs.
An example of a system that is operated using eye tracking is the "EyeWriter™" (notimpossiblelabs.com or http://www.evewriter.org/). The EyeWriter™ was developed for the benefit of graffiti artists suffering from ALS.
SUMMARY OF THE INVENTION
In accordance with one aspect of the present invention, there is thus provided a system for enabling a user to communicate with eye- based feedback. The system includes a relative eye orientation sensor, a selection interface and a processor. The selection interface is configured to selectively present a series of communication options to the user. The relative eye orientation sensor is configured to detect light reflecting from the eye of the user and provide a correlative signal. The processor is communicatively coupled with the relative eye orientation sensor and with the selection interface. The processor is configured to receive and process the correlative signal to determine a relative eye orientation of the user, the relative eye orientation including the relative orientation of at least the pupil of the eye with respect to the head orientation of the user. The processor is further configured to determine a selected communication option based on the determined relative eye orientation, and to provide instructions to implement the selected communication option. The system may further include at least one light source for illuminating the eye of the user. The light source may be an infrared (IR) light source, and the relative eye orientation sensor may detect IR light reflected from the eye. The selection interface may include a visual interface, an audible interface, and/or a tactile interface. The light source and/or the relative orientation sensor may be coupled with a wearable head gear worn by the user. The selection interface may operate on a mobile computing device. The relative eye orientation sensor may include at least one image sensor configured to capture images of the eye of the user, where the processor determines the relative eye orientation by image processing of the captured images. The processor may determine the relative eye orientation by detecting an instantaneous relative orientation of the pupil, or by detecting a transient change of the pupil. The processor may be communicatively coupled wirelessly with the relative eye orientation sensor and/or the selection interface. The communication option may include: an audible alarm, a menu selection, a language selection, a confirmation message, a sentence, a phrase, a word, a syllable, a letter, a mobile computing device activation selection, a visual interface deactivation selection, instruction to send an email/SMS/MMS, and/or instructions to implement a computing application. The visual interface may display at least one communication option by: color- coordination, shape-coordination, and/or symbolic images.
In accordance with another aspect of the present invention, there is thus provided a method for enabling a user to communicate with eye-based feedback. The method includes the procedures of: selectively presenting a series of communication options to the user with a selection interface, and detecting light reflected from the eye of the user and providing a correlative signal with a relative eye orientation sensor. The method further includes the procedure of processing the correlative signal to determine a relative eye orientation of the user, the relative eye orientation including the relative orientation of at least the pupil of the eye with respect to the head orientation of the user. The method further includes the procedures of determining a selected communication option based on the determined relative eye orientation, and providing instructions to implement the selected communication option. The method may further include the procedure of illuminating the eye of the user with at least one light source. The eye may be illuminated with IR light, and the relative eye orientation sensor is configured to detect IR light reflected from the eye. A communication option may be presented by the selection interface visually, audibly, and/or in a tactile manner. The procedure of detecting light reflected from the eye of the user may include capturing images of the eye with at least one image sensor, where the relative eye orientation is determined by image processing of the captured images. The relative eye orientation may be determined by detected an instantaneous relative orientation of the pupil, or by detecting a transient change of the pupil. The communication option may include: an audible alarm, a menu selection, a language selection, a confirmation message, a sentence, a phrase, a word, a syllable, a letter, a mobile computing device activation selection, a visual interface deactivation selection, instruction to send an email/SMS/MMS, and/or instructions to implement a computing application. The method may further include the procedure of performing calibration by: comparing consecutive captured images to detect relative changes of the pupil in the eye; marking the pupil in the captured images to track an area of interest signifying the location of the pupil; setting a reference point as the center of a 2D grid and translating eye movements to coordinates on the grid, when determining that the pupil is at the center of the grid, selecting a reference point for building a reference point coordinate system; and determining the range of motion of the user.
BRIEF DESCRIPTION OF THE DRAWINGS
The present invention will be understood and appreciated more fully from the following detailed description taken in conjunction with the drawings in which:
Figure 1 is a schematic illustration of a system for enabling a user to communicate with eye-based feedback, constructed and operative in accordance with an embodiment of the present invention;
Figure 2 is an illustration of a settings menu and a words menu displayed on the display of Figure 1 ;
Figure 3A is an illustration of a letter group menu displayed on the display of Figure 1 ;
Figure 3B is an illustration of the letter group menu of Figure 3A where the image to be displayed on the display has been divided into different sections, and a selection is being made;
Figure 4 is an illustration of a letter sub-menu displayed on the display of Figure 1 ;
Figure 5A is an illustration of an image of the user's eye with the pupil in an upward quadrant captured by the camera of Figure 1 ;
Figure 5B is an illustration of an image of the user's eye with the pupil in a downward quadrant captured by the camera of Figure 1 ;
Figure 5C is an illustration of an image of the user's eye with the pupil in a left quadrant captured by the camera of Figure 1 ; Figure 5D is an illustration of an image of the user's eye with the pupil in a right quadrant captured by the camera of Figure 1 ;
Figure 5E is an illustration of an image of the user's eye without the pupil captured by the camera of Figure 1 ;
Figure 5F is an illustration of an image of the user's eye with the pupil in the center captured by the camera of Figure 1 ; and
Figure 6 is a flow diagram of a method for enabling a user to communicate with eye-based feedback, operative in accordance with an embodiment of the present invention.
DETAILED DESCRIPTION OF THE EMBODIMENTS
The present invention overcomes the disadvantages of the prior art by providing a system and method for allowing a user to communicate by means of eye-based feedback. In particular, the user can select a communication option presented to him/her via a visual/audio/tactile interface, by using very basic eye gestures in the form of manipulating the relative orientation of his/her eyes to indicate a general direction that he/she is looking. The system determines the relative eye orientation of the user, representing the orientation of the pupil relative to the eye socket or head orientation, by detecting reflections of infrared light (or other light) from the eye, such as by using an IR light source and an IR camera or other sensor. The present invention thus assists in improving the quality of life of individuals characterized by a difficulty or inability to communicate verbally and/or manually, such as sufferers of locked-in syndrome (LIS), such as ALS patients, while maintaining their cognitive awareness, by providing an elegant, accessible, relatively non-expensive, mobile, alternative form of communication. More generally, the present invention may be used by any individual, including persons who do not exhibit limitations in communicating otherwise. In addition, the system allows for communication and operation of devices while the user is otherwise engaged.
Reference is now made to Figure 1 , which is a schematic illustration of a system, generally referenced 100, for enabling a user to communicate with eye-based feedback, constructed and operative in accordance with an embodiment of the present invention. System 100 includes a primary display 102, an image processor 104, a light source 106, a camera 108, an audio unit 1 12, and a computing device 1 14 with an auxiliary display 124. Processor 104 is communicatively coupled with primary display 102, with camera 108, with audio unit 1 12, and with computing device 1 14. Computing device 1 14 may be communicatively coupled with processor 104 through a wired or wireless communication link 1 16 (e.g., a two-way Bluetooth connection).
Light source 106 is configured to emit light 1 18 to illuminate at least one eye of user 150. Light source 106 may be configured to emit non-visible light, such as light in the infrared (IR) spectrum. IR light is particularly desirable because it can be used effectively even in dark rooms with poor lighting conditions, and allows for effective illumination of the eye. Alternatively, other light wavelengths (non-IR light) may be used to illuminate the eye of user 150. Light source 106 typically emits eye-safe light (e.g., in the visible or infrared wavelengths). Eye-safe light can pass through the various optical elements of system 100 in the desired optical path (i.e., transmitted through or reflected from the optical elements in the desired directions), without interfering with the visible light path of the displayed image. As an example, light source 106 can be embodied by four IR LED lights that are arranged in an array (e.g., in a square pattern with each one of the LEDs in a respective corner of the square). Alternatively, light source 106 may be embodied by ambient light, such as solar radiation or indoor lighting.
Camera 108 is configured to capture images of at least one eye of user 150. The line of sight (LOS) of camera 108 is aligned toward the general direction that user 150 is facing. The field of view (FOV) of camera 108 is related to the possible range of head movements by user 150, while maintaining the eyes within the image frame. System 100 may include multiple cameras 108 with different FOVs, allowing for imaging of a wider overall FOV than would be possible with a single camera 108.
Camera 108 may be any type of device capable of acquiring an image representation of a real-world scene, including the acquisition of any form of electromagnetic radiation at any range of wavelengths (e.g., light in the visible or non-visible spectrum, ultraviolet, infrared, radar, microwave, RF, and the like). For example, camera 108 may be a complementary metal-oxide-semiconductor (CMOS) or charge-coupled device (CCD) camera operating in the visible to near infrared (NIR) spectrum. In general, camera 108 is operative in a spectral region that corresponds to or at least partially overlaps with, the operational wavelength range of light source 106. The main components of camera 108 are: a filter 1 1 1 , an image sensor 1 10, a lens (not shown), and electronic circuitry (not shown). Image sensor 1 10 may be configured to detect light at selected wavelengths only (i.e., a narrow-band sensor). Alternatively, filter 1 1 1 may be configured to block light outside the desired wavelengths from reaching image sensor 1 10 (e.g., using a bandpass filter).
Sensor 1 10 may be, for example, a photodiode, a CCD, or another type of light detection element. Camera 108 is generally operative to acquire at least one image frame, such as a sequence of consecutive image frames representing a video image, which may be converted into an electronic signal for subsequent processing and/or transmission. As an example, camera 108 may be embodied by a modified web camera, a digital camera and/or a cellular phone camera. Accordingly, the term "image" as used herein refers to any form of output from an aforementioned image sensor, including any optical or digital representation of a scene acquired at any spectral region, and encompasses both a single image frame and a sequence of image frames (i.e., a "video image").
While camera 108 is provided as an exemplary type of sensor used for determining the relative eye orientation of user 150, system 100 may alternatively include a different type of sensor instead of camera 108. For example, system 100 may alternatively include at least one discrete light sensor, arranged individually or in a suitable configuration (such as a sensor array) in relation to user 150. Accordingly, a determination of the relative eye orientation of user 150 may be made based on an analysis of the reflected light detected by such discrete sensors, rather than based on image analysis of camera images, as will be discussed further hereinbelow.
Primary display 102 and auxiliary display 124 are configured to display visual information that is viewable by user 150. The displayed information may include graphical control elements with selectable options, such as in the form of graphical menus or tables containing alphanumeric characters and/or symbols (as will be elaborated upon further hereinbelow). Primary display 102 may be any device or mechanism capable of presenting visual information using any suitable format or underlying technology, including but not limited to: a computer monitor, a television monitor, a tablet or mobile phone display, a plasma or LCD screen, a light emitting diode (LED) display, a three-dimensional representation display (e.g., a holographic display), and the like. Primary display 102 may be embodied by a head-mounted display (HMD) embedded within a wearable apparatus worn by user 150, such as a helmet, a headband, a visor, spectacles, goggles, and the like. System 100 may alternatively include an audible interface and/or a tactile interface, in addition to or instead of primary display 102 and auxiliary display 124, for presenting information in an audible and/or tactile format.
Processor 104 receives instructions and data from the various system components. Processor 104 is also configured to perform image processing and analysis on the image frames captured by camera 108, and to extract information from the image characteristics. Processor 104 may be situated at a remote location from the other components of system 100. For example, processor 104 may be part of a server, such as a remote computer or remote computing system or machine, which is accessible over a communications medium or network. Alternatively, processor 104 may be situated adjacent to user 150 and/or integrated within other components of system 100. Processor 104 may be embodied by a low cost processor with sufficient computing power (e.g., Odroid U3 or Raspberry pi2), and may include a text-to-speech engine and Bluetooth capabilities.
Audio unit 1 12 is configured to generate an audible output. The audible output may include different types of sounds of varying tones or intensities, ranging from simple beeps or alarm type audio notifications to more complex speech patterns. System 100 may generally include multiple audio units 1 12, which may be configured to produce multiple forms of audible output. Audio unit 1 12 may be embodied by at least one speaker, such as a pair of headphones.
Computing device 1 14 may be embodied by a mobile computing device, such as a smartphone, a tablet computer, and/or a digital music player, or a stationary computing device, such as a desktop computer.
System 100 may include or be used with a support frame (not shown) for supporting elements of system 100 in the vicinity of user 150. The support frame may be embodied by a spectacle or eye-glasses. The support frame may support light source 106 and camera 108 using a strong and flexible moveable arm. The flexible movable arm can be made of a material that is resilient and rigid enough to support camera 108 and yet flexible enough to be bent to adjust the imaging range, location and/or viewing angle of camera 108. In order to reduce weight and pressure on user 150, a light-weight material may be used. For example, camera 108 may be attached via a wire to a head gear worn by and directed toward the face of user 150, such that camera 108 images at least one eye of user 150. This alignment may be readjusted automatically or manually to maintain alignment of the eyes during shifting head movements of user 150 or to accommodate other physical requirements of user 150. As LIS patients may be affixed to ventilators or other medical equipment, the maneuverability of the support frame can allow for such patients to utilize system 100 conveniently, while minimizing the interference or constraints resulting if system 100 were to be completely fixed in place. The support frame may also be adjustable and/or adapted to meet the requirements of particular users 150, such as to be accessible by people with different types of respirators. An alternative technique for providing stability and support is by attaching elements of system 100 to an adjacent counter, a vehicle or a wheelchair.
The components and devices of system 100 may be based in hardware, software, or combinations thereof. It is appreciated that the functionality associated with each of the devices or components of system 100 may be distributed among multiple devices or components, which may reside at a single location or at multiple locations. For example, the functionality associated with processor 104 may be distributed between multiple processing units (such as a dedicated image processor for the image processing functions). System 100 may optionally include and/or be associated with additional components not shown in Figure 1 , for enabling the implementation of the disclosed subject matter. For example, system 100 may include a power supply (not shown) for providing power to the various components, and may further include a memory or storage unit (not shown) for temporary storage of image frames or other types of data.
The operation of system 100 will now be discussed, for exemplary purposes, in the context of a user 150 characterized by a difficulty or inability to verbally communicate, such as an LIS patient. User 150 views visual information presented on primary display 102 and/or auxiliary display 124, such as multiple command selections. Camera 108 captures images of the eyes of user 150 which are illuminated by light source 106, and processor 104 receives and processes the captured images to determine the relative eye orientation of user 150. The relative eye orientation represents a general direction that user 150 is looking (e.g., "up", "down", "left", "right", "straight ahead") relative to the direction that he/she is facing which generally corresponds to the orientation of the head. Accordingly, the relative eye orientation can be considered as the orientation of the pupil within the eye socket or the pupil orientation relative to the head orientation of the user. For example, processor 104 compares the captured image of the eye with at least one previous captured image, to determine the instantaneous relative orientation of the pupil (with respect to the eye socket or head orientation of user 150) and/or to detect transient changes in the pupil orientation. Alternatively, processor 104 processes the reflection signals from the eye of user 150 detected by discrete light sensors, such as discrete IR sensors, rather than images captured by camera 108. For example, processor 104 may compare the respective detections of multiple discrete sensors arranged at different positions around the eye of user (e.g., four sensors evenly distributed around the perimeter of the eye), and determine a relative eye orientation of user 150 based on characteristics of the sensor detections (e.g., which sensor(s) detected light reflected from the eye; the intensity and/or angle of arrival of the detected reflections), as well as the relative proximity of the pupil to the respective sensors.
Display 102 presents a plurality of options via an option selection table or menu. User 150 indicates an option to select by performing an eye gesture such that his/her relative eye orientation correlates to the relative direction of that option as presented on display 102. Accordingly, processor 104 determines the relative eye orientation of user 150, and determines the option selected by user 150 based on the determined relative eye orientation. For example, if the relative eye orientation is determined to be "right" (i.e., the pupil orientation is at the right side within the eye socket of user 150), then the option displayed at the right side of the menu is considered to be selected. It is noted that the user selection is determined based on only a relative eye orientation that corresponds to a very general (imprecise) direction that user 150 is looking (e.g., "right", "left", "up", "down", "straight ahead"), without requiring the determination of a precise gaze direction or line-of-sight of the user (i.e., without determining a particular point or spatial coordinates at which user 150 is looking at in relation to a particular frame of reference, such as determining a unique coordinate position on display 102).
Processor 104 provides instructions to display 102, audio unit 1 12, and/or computing device 1 14 to perform at least one action based on the determined selection of user 150. These actions can be, for example: to audibly output a word, letter, phrase, sentence or alarm warning siren; to display a different menu screen or different menu option(s); to display text; to generate and send a text or e-mail message; and/or to execute a particular application on computing device 1 14.
The selection process of system 100 will now be further described with reference to Figures 2 to 4. In the exemplary display screen 200S depicted in Figure 2, two separate menu portions are shown. The upper menu portion 200 is a "settings menu", which is arranged to allow user 150 to select or alter a setting parameter of system 100. The lower menu portion 202 is a "word menu", which is arranged to allow user 150 to select a specific word or series of words (a phrase) from a predetermined set of key words and phrases. One or more setting options 200A, 200B, 200C, 200D, 200E and one or more word options 202A, 202B, 202C 202D, 202E are displayed on display 102, 124 at a given instant. Selection of an option may prompt a sub-menu with additional options to be displayed.
In the illustrated example, option 200A represents a "language option" that leads to a sub-menu of different languages to choose from. Option 200B represents a "standby option" that switches system 100 into a standby (i.e., rest or low power) mode of operation until reactivated by user 150, such as by performing a specific eye gesture or indication. Option 200C represents a "help option" that leads to a sub-menu of different options designed to help user 150 (e.g., explanations of how to use certain aspects or elements of system 100). Option 200D represents an "alert option" that triggers audio unit 1 12 to emit a warning or alert signal upon selection. Option 200D is especially useful when user 150 is in distress and/or a life-threatening situation. The warning or alert siren may also be activated by user 150 performing a specific eye gesture or indication, such as by winking four times in rapid succession. Option 200E represents a "connect to mobile option" that establishes a communication link 1 16 between processor 104 and computing device 1 14, allowing user 150 to execute program applications on computing device 1 14.
Referring to the word menu portion 202, option 202A represents the word: "hello"; option 202B represents the word: "goodbye"; option 202C represents the word: "thanks"; and option 202D represents the word: "help". For example, selection of option 202D may cause audio unit 1 12 to emit a warning signal or alert siren, similar to settings option 200D. Option 202E represents the word: "application". For example, selection of option 202E may direct processor 104 to establish a communication link 1 16 with computing device 1 14 to allow the user to execute program applications on computing device 1 14, similar to setting option 200E.
In an exemplary embodiment, only a single option from either "settings" menu portion 200 or "words" menu portion 202 is displayed at a given time on display 102, 124, allowing user 150 to either select the currently displayed menu option or to continue viewing other available menu options, by implementing suitable eye movements or gestures. For example, individual options from each menu portion 200, 202 may be presented sequentially in a cyclic manner, in accordance with the relative eye orientation of user 150. In particular, user 150 may initially select "settings menu" 200 by looking in an upwards direction (U). Subsequently, by looking leftwards (L1 ) user 150 may cycle through the "settings options" presented on display 102, 124 in a first sequential order, whereas by looking rightwards (R1 ) user 150 can cycle through the "settings options" in a reverse sequential order. For example, display 102, 124 initially presents only settings options 200A and 202A. By looking up (U) and then toward the left (L1 ), user 150 can direct system 100 to display settings option 200E (e.g., in place of settings option 200A). Alternatively, by looking up (U) and then toward the right (R1 ), user 150 can direct system 100 to display option 200B (e.g., in place of option 200A). Similarly, by looking down (D) initially, user 150 can select "words menu" 202. Subsequently, user 150 may sequentially cycle through the "words options" on display, such as by looking left (L2) to cycle through the words options in a first sequential order or by looking right (R2) to cycle through the words options in a second sequential order. For example, by looking down (D) and then toward the left (L2), user 150 can direct system 100 to display word option 202E (e.g., in place of option 202A). Alternatively, by looking down (D) and then toward the right (R2), user 150 can direct system 100 to display option 202B (e.g., instead of option 202A). It is noted that alternative directional configurations are equally applicable. For example, user 150 may alternatively sequentially cycle through menu options by directing his eye gaze upwards and downwards rather than leftwards and rightwards, and display 102, 124 may present different menu portions 200, 202 side-by-side rather than in a top-and-bottom configuration.
User 150 can use a predetermined eye-based gesture or indication, such as closing their eye (winking/blinking), or gazing in a specified direction, such as at diagonal or slanted angle (e.g., a 45° angle) relative to a forward eye orientation, to select an option from one of the menus 200, 202. User 150 may select a suitable eye-based gesture to be used during an initialization process of system 100, so as to ensure that he/she can successfully implement the selection process (for example, some users may have difficulty blinking and may thus select an alternative gesture). The selected gesture may be a preselected default option so as to preclude the need for preliminary setup or initialization of system 100 prior to use. For example, user 150 can look down (D) and then toward the left (L2) and/or right (R2) to cycle through the different options of word menu 202 until he/she arrives at the desired option, e.g., option 202C, and then look down (D) and gesture (e.g., winking) to select option 202C (the word "thanks"). Selection of a word in the word menu 202 will direct audio unit 1 12 to provide an audible indication of the selected word. For example, selecting option 202C will cause those in the vicinity of user 150 to hear the word "thanks" audibly emitted from audio unit 1 12.
A temporary visual indication, such as highlighted or emboldened text or a colored frame, may be applied to the selected option, such as to indicate that an option has been initially or tentatively selected but not yet validated by user 150. A confirmation eye-related gesture or indication (such as blinking twice) may then be required in order to validate the user selection, before implementing the relevant functionality associated with the selected option. In addition, shapes or colors may be used to assist with differentiating among the various options in the presented menus. Alternatively, a tentative user selection may be indicated audibly, such as by directing partial audio output to user 150. For example, the menu option that is initially (tentatively) selected is broadcast only through a single channel of audio headphones being worn by user 150, providing a monophonic sound reproduction (e.g., by broadcasting only to the right ear / right channel of the headphones). When user 150 wants to make the word heard by his surrounding environment, the selected word is validated by an additional eye-related indication or gesture (e.g., blinking), which causes an appropriate sound (or sounds) associated with the selected option to be broadcast over audio unit 1 12, as well as optionally over the other audio channel of the headphones worn by user 150 (e.g., the left ear / left channel of the headphones). By selectively directing partial audible indications between different audio units 1 12 (e.g., between headphones listenable by the user only and speakers listenable by the entire surroundings), and between different audio channels of a single audio unit (i.e., mono broadcasting), user 150 can recognize and control the audible feedback broadcast by system 100.
Audio unit 1 12 may alternatively present the available selection options to user 150 in an audible format, instead of (or in addition to) the visual presentation by display 102, 124. For example, the list of settings options of settings menu 200 and/or word options of words menu 202 may be broadcast to user 150 through headphones, such that user 150 hears the available options, in conjunction with an audible indication of the different eye-related gestures associated with each option. For example, the user 150 may hear the following audio sequence to prompt a user selection: "left - hello, right - goodbye, up - yes, down - no, blink - repeat menu". Alternatively, user 150 may simply hear a single option, e.g., "language". The option(s) may be broadcast only to user 150. User 150 may move through different option menus or select an option by implementing designated eye-related gestures. For example, user 150 may cycle between different word options using glances to either side (e.g., right or left, up or down), and indicate a selection using a blinking or winking gesture. Following the user selection, the associated audio output may be publically broadcast to the surroundings.
As such, the selection interface of system 100 can be a visual interface (a visual menu presented on display 102, 124), an audible interface (an audible menu provided by processor 104 and audio unit 1 12) and/or a tactile interface (not shown). Thus, system 100 can present selection menus corresponding to settings menu 200 and words menu 202 (Fig. 2) even without displays 102, 124. It is appreciated that the elimination of a display may be particularly beneficial in the context of a medical institution, by allowing a recovering patient (e.g., after a surgery) to request a nurse or to communicate with family members, similar to a distress device that can be placed next to a patient's bed. Furthermore, the elimination of display 102, 124 serves to decrease the number of components of system 100, which may lower the overall cost and provide greater mobility and accessibility. Referring to the exemplary display screen 200S (Fig. 2), display 102, 124 may alternatively selectively display only a single menu portion 200, 202 at a given instant, while presenting a selectable option on each of the menus 200, 202 that would allow for displaying the other menu 200, 202 instead (or an alternative available menu that is not currently being displayed). For example, settings menu 200 may include a "switch to words menu" option (not shown), and correspondingly words menu 202 may include a "switch to settings menu" option (not shown), allowing for toggling between the two respective menus 200, 202. Such a display configuration may serve to simplify the selection process by limiting the eye movements of user 150 to only the left or right directions (and obviating the need for upward and downward eye movements). It will be further appreciated that the ability to sequentially cycle through the displayed menus 200, 202 by merely shifting his/her gaze toward a general direction (e.g., left or right) allows user 150 to access all of the available menu options relatively easily and quickly. For example, if a particular menu consists of ten (10) possible options, any one of the options can be reached within five (5) "cycles" of the menu (e.g., in the example of Figure 2 where words menu 202 includes five options, then option 202D can be reached starting from option 202A by cycling either twice to the left or three times to the right). The vocabulary provided in words menu 202 can be limited to a certain number of pre-determined words that were found to have high importance and high incidence to the target audience or user in question. The available word options of a given words menu 202 may also be preselected and/or edited by user 150, partially or entirely. The respective audio outputs associated with the words menu 202 vocabulary may be previously generated prior to the operation of system 100 (e.g., by pre-recording various audio files corresponding to spoken words and/or phrases), or may be generated in real-time (e.g., via text-to-speech or phonetic transcription speech synthesizers).
As mentioned above, a given menu option may represent a different sub-menu that can be displayed, such as words menu 202 including a menu option representing a letter group menu. Alternatively, selection of a connect to mobile option 200E, 202E on words menu 202 may prompt processor 104 to form a communication link 1 16 with computing device 1 14 which is executing an application that includes a different menu or sub-menu, e.g., a letter group menu.
Reference is now made to Figure 3A, which is an illustration of the exemplary display screen 300A on display 102, 124 presenting an exemplary letter option menu, generally referenced 300, that includes different letter group options 302, 304, 306, 308, 310 for selection. Letter group menu 300 may be used for composing a written text or message, or for spelling out a word to be communicated or added to words menu 202. In the illustrated example of Figure 3A, option 302 represents the group of letters: "ABCDE"; option 304 represents the letters: "FGHIJ"; option 306 represents the letters: "KLMNO"; option 308 represents the letters: "PQRST"; and option 310 represents the letters: "UVWXY". Letter group options 302, 304, 306, 308, 310 are presented in a cross configuration, with each letter group located on a separate region of display 102, 124 (i.e., top, bottom, left, right, or middle). Analogous to the selection of menu options previously described with reference to Figure 2, user 150 may select an available option of letter group menu 300 using only relative eye orientation and/or eye-related gestures. For example, while viewing letter group menu 300 on display 102, 124, user 150 may gaze upward (U) to select option 302; may gaze leftwards (L) to select option 304; may gaze rightwards (R) to select option 308; or may gaze downwards (D) to select option 310. User 150 may select the middle option 306 by performing a predetermined eye-related gesture or indication, such as closing one or both eyes (winking/blinking), or gazing in a diagonal or slanted angle relative to a forward relative eye orientation (e.g., a 45° angle). The options could also be communicated via audio unit 1 12 announcing the aforementioned letters.
Reference is made to Figure 3B, which is an illustration of the exemplary display screen 300B on display 102, 124 presenting the exemplary letter group menu 300 of Figure 3A divided into different sections, where a selection of the letter "G" is being made.
A selection of a letter group option (e.g., 302 or 304) of letter group menu 300 may then result in displaying another sub-menu representing the individual letters of the selected letter group (i.e., 400). For example, if user 150 gazes up to select letter group option 302 (letters ABCDE), display 102, 124 then displays the associated ABCDE letter submenu. Reference is now made to Figure 4, which is an illustration of the exemplary display screen 400A on display 102, 124 presenting the ABCDE letter sub-menu, generally referenced 400, that includes different individual letter options 402, 404, 406, 408, 410 for selection. Letter submenu 400 corresponds to the selected letter group option 302 (Figure 3A). In the illustrated example of menu 400, option 402 represents the letter "A"; option 404 represents the letter "B"; option 406 represents the letter "C"; option 408 represents the letter "D"; and option 410 represents the letter "E". Similar to letter group menu 300, individual letter options are also presented in a cross configuration, with each letter group located on a separate region of display 102, 124 (i.e., top, bottom, left, right, or middle). User 150 may select an available option of individual letter submenu 400 using relative eye orientation and/or eye-related gestures. For example, user 150 may gaze upwards (U) to select option 402; may gaze leftwards (L) to select option 404; may gaze rightwards (R) to select option 408; and may gaze downwards (D) to select option 410. User 150 may select the middle option 406 by performing a predetermined eye-related gesture or indication, such as closing one or both eyes (winking/blinking).
A selection of an individual letter of sub-menu 400 may then result in a visual indication of the selected letter displayed on display 102, 124 (e.g., by highlighting or presenting a visual border around the selected letter), which may require confirmation by user 150. Alternatively or additionally, an audio indication of the selected letter may be generated by audio unit 1 12. The selected letter may further be added to a text message or text-based instruction being composed by user 150. For example, looking up and selecting option 402 directs audio unit 1 12 to emit the sound of the letter "A", while providing a visual indication of the displayed letter "A" on display 102, and entering the letter "A" into the text of a message being composed by user 150. A completed word may be added and saved to words menu 202.
It is noted that letter group menu 300 or individual letter menu 400 may be alternatively configured to display any number of options (and not necessarily five options). For example, if user 150 experiences difficulties in implementing a suitable eye gesture to indicate the selection of the "middle" option, only four options may be displayed. In this manner, the visually presented menu can be configured to obtain the user selection based solely on a more standard selection of relative eye orientation (e.g., up, down, left or right), while precluding the need for any additional eye- related gestures. The middle region of a menu that includes only 4 options may be left empty, or may display the title of the menu (i.e., not an option that can be selected). Alternatively, system 100 may designate four diagonal directions (up-right, up-left, down-right, down-left) and an optional straight direction (e.g., straight down) to indicate a selection, such as for individuals who have difficulty controlling their blinking or experience involuntary blinking.
The different menu options may also be displayed with unique visual effects or characteristics, so as to provide additional visual differentiation to each displayed menu option, in addition to their different relative locations on display 102, 124. For example, each menu option can be displayed in a different color shape or style of border, so that the different options can be more easily recognized, identified and distinguished from one another by user 150. A menu may include a "return option" to enable user 150 to go back to a previous selected option, to a previously displayed menu or a preset determined action. Alternatively, system 100 may be configured such that user 150 can be automatically returned to a prior menu or a default menu following the final selection of a menu option. As mentioned above, an indication may be provided to the selected option to indicate that the option has been initially selected. The indication may be, for example, a highlighted or emboldened text, a colored frame, and/or an audible indication emitted in at least one ear of the headphones worn by user 150. System 100 may then wait for a designated confirmation gesture (such as blinking twice) to validate or confirm the selection of the selected option, thereby providing a visual and/or audio indication informing user 150 that an option has been selected and/or which option has been selected. An eye-related gesture for implementing a selection of an option by user 150 can include a sequence of individual actions or gestures. Such a sequence may also allow for the direct selection of an available option in a sub-menu associated with the presented menu while bypassing the presentation of the sub-menu. For example, user 150 can select an individual letter by performing two consecutive predetermined eye-related gestures while viewing "letter group menu" 300 of Figure 3A, without being displayed the "individual letter sub-menu" 400 of Figure 4. The first gesture would select a particular letter group (e.g., letter group 308), and then the second gesture, which may be based on the color or location of an individual letter within the letter group, selects a specific letter (e.g., the letter "S"), thereby bypassing the displaying of individual letter sub-menu 400. For example, the individual letters inside each displayed letter-group 302, 304, 306, 308, 310 may be color coded to correspond with a respective direction, which may be indicated by a color indicator displayed in the relevant direction. For example, the first letters of each letter-group: "A", "F", "K", "P", and "U", may all be depicted in one color (e.g., orange), and an orange color indicator may be displayed on display 102, 124 represented by an arrow facing a certain direction (e.g., up). Accordingly, by first looking left (L), user 150 selects option 304 (i.e., the entire letter group: "FGHIJ"), and by subsequently looking upward (UP) (i.e., towards the orange indicator direction), user 150 selects a specific letter (e.g., the letter "F") from among letter group 304. The colors used to display selected menu options can be specific colors that are easily discernible and highly conspicuous, to ensure their perception even by users 150 characterized by impaired vision or perception capabilities. It is noted that the graphical elements of menu items displayed by display 102, 124 is not limited to text or alphanumeric characters, but can also include, for example, various types of symbols, icons or images representing various actions or selectable options, such as: a "spacebar icon", a "backspace/delete icon", a "back or return to the main menu icon", and the like. For a visual output, user 150 can be provided with an option menu with a plurality of images, symbols or pictorial drawings, e.g., a plate of food, a glass of water, a car, a wheelchair, and the like. The selected drawing can be displayed to the user's surroundings on an external screen, such as auxiliary display 124, or sent in a message to another device (e.g., to a mobile phone of a friend or a nurse). Alternatively, selection of a drawing can result in an appropriate audio output, e.g., selection of the drawing of a plate of food would result in the audible output of the word "food".
System 100 can include word prediction applications to help forecast and suggest complete words based on the first few letters selected by a user. For example, if user 150 selects the following letter sequence: "H-E-L", system 100 can suggest the words: "help" or "hello", by presenting the suggested words on display 102, 124 or via audio unit 1 12. System 100 may even utilize entire word or sentence completion based on context determined by processor 104. User 150 can cycle through different suggested complete words on display, such as by looking towards the left or right. Selection of a suggested word by user 150 can be performed with a suitable eye-related gesture or indication. In this manner, the selection of a word by user 150 may be hastened and the accuracy improved (by precluding the need for user 150 to continue selecting the additional letter and avoiding subsequent inadvertent incorrect letter selections by the user). The completed words can be indicated audibly or visually, and stored or transferred as an audio file and/or text file, as applicable. The word prediction feature may be based on language modeling and vocabulary libraries, and may be adaptive such as by learning the language habits of a user to improve future suggested word completions associated with that user.
Rather than indicating a word by spelling out the individual letters, user 150 may locate the desired word directly on a displayed "word menu". Such a menu can include different options representing actual words or categories of words classified by topic or subject matter, (e.g., "food", "clothing", "colors", and the like). The selection of an initially displayed word option can result in the displaying of a sub-menu with logical word options that follow from the initially selected word. For example, a word menu may initially present five different categories. When a particular category is selected, a sub-menu of five new words within that category is then presented. Each of these new words may lead to further subcategories, and so forth (e.g., "clothing"-"shirt"-"dress shirt"). In this manner, a substantially large vocabulary of potential words is made available to user 150. For example, assuming each word menu screen presents a total of 5 options, user 150 can potentially access over three thousand (3000) different words using only five (5) eye-related gestures (i.e., when each option on the first four menu screens represents a subcategory of word options).
User 150 can save preferred frequent words or sentences, allowing direct access to such words/sentences in future operational sessions. System 100 can also adaptively learn preferences and habits of user 150, and amend or update the available words or options of different menu screens accordingly. By prioritizing preferred or frequent words or options previously selected by the user, the operation of system 100 may become more efficient and more intuitive. System 100 may further include an interface (not shown), such as via the internet, an input device coupled with processor, or an application of computing device 1 14, to allow a particular user (or an administrator), to customize a particular menu for that user. Such menu customization allows for individual users, or a particular group of users, to select from words that are more relevant or more likely to be frequently accessed by such users/groups. This customization also serves to accommodate a range of different users with special needs of various types. Reference is now made to Figures 5A to 5F. Figures 5A to 5F are schematic illustrations of exemplary images of an eye of user 150 captured by camera 108 of system 100 of Figure 1 . Each image (500A, 500B, 500C, 500D, 500E, 500F) is divided into four quadrants: upper quadrant 502 which corresponds to user 150 looking up; lower quadrant 504 which corresponds to user 150 looking down; left quadrant 506 which corresponds to user 150 looking right; and right quadrant 508 which corresponds to user 150 looking left. Processor 104 determines the coarse or general location of the eye pupil in the captured image. For example, processor 104 may determine in which region or quadrant the pupil is most likely located (i.e., in which the majority of the pupil is located), or if the pupil is in or near the center of the image. If no pupil is located, then processor 104 may determine that the eye is blinking or winking.
Based on the general location of the pupil and/or the change in direction of the pupil (e.g., movement towards the left, right, up or down) the processor 104 determines the corresponding location on display 102, 124 (up, down, left, right, or center) and hence the selected/desired option of a menu (200, 300, 400) being displayed. For example, in image 500A (Figure 5A), the pupil P is located in upper quadrant 502, which corresponds to option 402 of individual letters menu 400 (Figure 4). In respective images 500B, 500C, 500D (Figures 5B, 5C, 5D), the pupil P is located, respectively, in lower quadrant 504, left quadrant 506, and right quadrant 508, corresponding to respective letter options 410 Έ", 404 "B", and 408 "B" of individual letters menu 400 (pupil P being located in left quadrant 506 corresponds to the user looking toward the right, and pupil P being located in right quadrant 508 corresponds to the user looking toward the left (i.e., the captured image mirrors the user's gesture). If processor 104 fails to detect (at least a minimal portion of) pupil P in an image, or throughout a minimal sequence of successive images, as shown in image 500E (Figure 5E), then processor 104 may determine that user 150 is winking (or blinking), which may correspond to a selection of the centrally displayed option on the menu screen (e.g., option 406 of menu 400) or the selection or confirmation of an option (e.g., the letter 410 Έ"). If processor 104 detects pupil P to be in or near the center of an image, or throughout a minimal sequence of successive images, as shown in image 500F (Figure 5F), then processor 104 may determine that user 150 is looking forward (not toward any of the above-mentioned directions), which may correspond to a standby mode where system 100 waits until user 150 looks in a direction or performs an eye gesture. The actual number of divisions may vary and is not limited to quadrants.
Alternatively, instead of dividing the image 500 into quadrants, the image can be divided into additional or fewer sections during the image processing for identifying the pupil location (and relative eye orientation). For example, processor 104 may divide a captured eye image into only two sections (e.g., a left section and a right section, divided by a central vertical divider), to determine only left and right locations of the pupil. Alternatively, processor 104 may divide a captured eye image into upper and lower section (i.e., via a central horizontal boundary), to determine only up and down locations of the pupil. Further alternatively, the eye image can be divided into eight (8) sections (resembling a hashtag or number sign), to requiring more accurate determination of the angular positioning and/or movement of the pupil and corresponding relative eye orientation. Dividing the image into a large number of sections when processing to identify the pupil location may improve the determination of pupil location/relative eye orientation at various diagonal or slanted angles relative to a forward gaze of the user (e.g., at a 45° angle relative to a forward gaze direction of user 150).
More generally, processor 104 may divide an eye image into different sections, portions, or segments correlating to the particular sections or portions of display 102, 124 on which particular menu options are being presented. For example, letter group menu 300 and individual letter menu 400 present the respective menu options in a "cross" shaped configuration, and so processor 104 may divide each processed image in a similar cross-shaped manner, so as to facilitate determination of the pupil location representing which menu option is being viewed by user 150. Processor 104 may also track or locate other relevant eye parameters (e.g., the cornea), in addition to or instead of the eye pupil P, for determining the relative eye orientation of user 150. Processor 104 may also track the location of the pupil P over a sequence of captured images. For example, processor 108 may utilize a reference image (500REF), and then compare a subsequent image (or a sequence of subsequent images) with the reference image 500REF to determine if the pupil P location has shifted relative to its location in the reference image 500REF, as well as the measured amount of movement. Such processing may be based on a calculated image filter to detect and measure differences from the original reference image and the subsequent images. It is further noted that determining only the relative eye-orientation direction of user 150, as well as tracking the eye pupil location over a sequence of captured images, serves to reduce the need to stabilize or compensate for movements of camera 108 (e.g., camera motions relative to the user eye between images) or compensating for changes in background. Thus, a calibration process may not be required.
While calibration of system 100 is not required, several methods of calibration may be available, including: generic or factory calibration, automatic calibration, and manual calibration.
An exemplary general calibration process is as follows. The first step is comparing consecutive images to detect relative changes by performing screen motion detection on the geographic area of the eye (including the pupil region). According to heuristics derived from the shape of the image, it can be assumed that the areas that change in the image represent areas of interest. The second step is marking the pupil circle as a point in space in the captured images, to follow the area of interest that signifies the location of the pupil P. The third step is setting a reference point as the center of a two-dimensional (x-y) axis grid and translating the eye movements to (x,y) coordinates on the grid. Once the calibration process determines, with a high enough probability, that pupil P is in the middle (i.e., close to the center [0,0] of the grid), a reference point is selected according to which the reference point coordinate system (grid) is built. The (x,y) coordinates represent the movement of the eye pupil P in each of the four directions (up, down, left and right). The fourth step may determine the range of motion of the user. The calibration process may adaptively learn the particular range of motion associated with different users. Different users are characterized by different physical features, including different eye and facial features.
Furthermore, the position and orientation of the camera 108 relative to the face and eye of each user will likely vary during different operational sessions of system 100. As a result, the calibration process of system 100 may adaptively learn the range of movement of pupil P on the x-y axis (grid) for each user, and adjust the range set threshold (beyond which is sent a signal for a certain direction of movement of the pupil), as necessary. Non-detection of pupil P for a minimum duration (e.g., a few milliseconds) is considered to represent a wink or blink by the user. The amount of time can be adjusted so as to prevent inadvertent identification of a selection, such as one based on the natural blinking of a user. It is further noted that the automatic calibration process (without prompting the user) of system 100 obviates the need for external (third party) assistance for each calibration. Further, the system (algorithm) identifies the pupil P and understands the relative eye orientation. Pupil detection capability includes identifying and comparing the location and range of motion and reliable detection of the presence/non-presence of the pupil P in an image 500. To do this, multiple object detection algorithms and their combinations may be used to deal with clarity issues and relatively heavy computation requirements. A dedicated calibration algorithm allows the system to calibrate itself automatically. Alternatively, a non-automatic calibration algorithm can be used that does not use any object recognition tools (feature detection). That is, the algorithm does not identify the exact location of the eye (pupil P), but compares images captured during system operation to those captured during a manual calibration.
System 100 may operate a generic factory calibration, performing a single initialization process prior to the operational session of system 100. The calibration (or initialization) may be implemented automatically using different filters or predetermined photographs designed for each of the potential relative eye orientations (e.g., up, down, left, right and center) that need to be identified during regular usage throughout an operational session. Processor 104 may apply the filters to the images 500, and determines the likelihood that a current processed image 500 is comparable to reference images 500REF. Reference images 500REF corresponding to each base situation or state. The system may grade the captured images based on the reference images 500REF and choose the option with the highest grade.
System 100 may also operate with self-calibration or "automatic calibration", i.e., that does not require prompting the user to initiate, but rather is performed automatically during the course of an operational session without the conscious knowledge of the user. System 100 may determine when and how often to perform a calibration process automatically, in accordance with relevant criteria. Automatic calibration can be performed when a predetermined condition is met, such as when the system is able to identify the pupil in an image.
System 100 may also operate with user or manual calibration with user 150 by providing an audio or a visual instruction, e.g., "Please Look Right", before capturing a reference image 500REF. An audio indication, such as resembling the click of a camera shutter, may be provided after the image has been captured, to notify user 150 that the image has been captured. The reference image 500REF similarly may be analyzed to locate the pupil and determine relative borders of the eye of user 150 or a determined boundary area that the pupil will approach when moving from one position to another (i.e., from "looking left" to "looking right"). This boundary area can further be used to trigger relative eye- orientation responses by user 150 as the pupil is observed or measured to cross the boundaries, resulting in processor 104 providing an appropriate response accordingly.
The captured image (signal) can be fed to a noise filter (e.g., a Markov Signal Noise filter), to reduce noise and enhance the quality of the image processing. The output for each image 500 is determined as: right, left, up, down, closed eye, center, or no identification. The final output (in this case one of five different options) can be based on additional criteria, such as, timing rules. For example, if no pupil is identified for longer than a threshold duration, such as more than approximately 0.2 seconds, then the determination is 'blink' or closed eye, and this determination could be maintained until a following image frame in which the pupil is again located or identified (i.e., movement of the pupil is detected) or to be registered as a unique action as previously mentioned.
In accordance with the present invention, the user instructions are determined solely from his/her relative eye orientation corresponding to an imprecise general eye direction (e.g., "right", "left", "up", "down", "straight ahead"), rather than a precise gaze direction or line-of-sight. Such a determination generally involves relatively basic and rapid processing, as compared to the more extensive and time-consuming processing required to identify an exact line-of-sight of the user by determining precise coordinates in relation to a particular frame of reference. Accordingly, system 100 can be considered to operate in a manner resembling that of a joystick, which applies an imprecise general directional determination to provide an indication of the user selection, as opposed to a screen-dependent mouse tracking mechanism where pinpoint location is required.
Reference is now made to Figure 6, which is a flow diagram of a method for enabling a user to communicate with eye-based feedback. In an optional procedure 601 , calibration is performed. Calibration of the system components may be performed only once (e.g., before the initial operation of system 100), or may be performed at regular intervals (during the operation of system 100), as necessary. Calibration may be performed using only a single eye-reflection image captured by camera 108, or by comparing and analyzing a large number of such images. Furthermore, the calibration may be adapted uniquely for specific users (i.e., for the particular user currently using system 100), or alternatively a generic calibration applicable for a large number of different users may be implemented (e.g., during an early initialization stage of system 100).
In procedure 602, a series of communication options is selectively presented to a user with a selection interface. Referring to Figures 1 and 3, display 102 (and/or display 124) displays a menu 300 with a variety of visual menu options 302, 304, 306, 308 and 310 for the user to choose from. The communications options may alternatively be presented via an audio or a tactile interface. In optional procedure 604, at least one eye of the user is illuminated with a light source. Referring to Figure 1 , light source 106 emits IR light 1 18 toward the eyes of user 150.
In procedure 606, light reflected from the eye is detected and a correlative signal is provided by a relative eye orientation sensor. Referring to Figure 1 , image sensor 1 10 receives IR light 1 18 reflects from the eye of user 150 and relays a correlative signal to processor 104. The eye orientation sensor may alternatively be embodied by a non-image sensor, such as by one or more discrete sensors or a sensor array. Referring to Figure 5A, image sensor 1 10 generates image 500A of the region around the eye of user 150, where pupil P is located in an upper quadrant region of image 500A.
In procedure 608, the correlative signal is processed to determine a relative eye orientation that includes the relative orientation of the pupil with respect to the head orientation of the user. Referring to Figures 1 and 5A, processor 104 determines a relative eye orientation of user 150 based on the images captured by image sensor 1 10 (or by using other eye reflection data captured by a non-image sensor). The determined relative eye orientation represents the relative orientation of the pupil with respect to the head orientation of user 150 (where the head orientation represents the general direction that user 150 is facing). For example, referring to Figure 5A, processor 104 determines from image 500A that pupil P is located in an upper quadrant region 502, and therefore the relative eye orientation of user 150 is an "up" position. Processor 104 may determine the relative eye orientation of user 150 by detecting an instantaneous relative orientation of pupil P or by detecting a transient change of pupil P.
In procedure 610, a selected communication option is determined based on the determined relative eye orientation of the user. Referring to Figures 1 , 5A and 3A, processor 104 selects an option presented on display 102 (and/or by audio unit 1 12), in accordance with the determined relative eye orientation of user 150. For example, if processor 104 determined from image 500A that the relative eye orientation of user 150 is an "up" position, processor 104 then selects communication option 302 ("ABCDE") presented on menu 300.
In procedure 612, instructions to implement the selected communication option are provided. Referring to Figure 1 , processor 104 instructs audio unit 1 12 to provide an audible indication associated with the selected communication option. For example, audio unit 1 12 broadcasts a word selected by user 150, such as the word "hello" associated with word option 202A of words menu 202. Additionally, processor 104 may instruct display 102 to present a visual indication associated with the selected communication option. For example, display 102 highlights the selected menu option, such as the "hello" word option 202A. Alternatively, processor 104 may instruct computing device 1 14 to implement a selected application or perform an action associated with the selected communication option. For example, computing device 1 14 sends a message or e-mail, such as following the selection of the "email application" option 202E of words menu 202.
As mentioned above, the processor 104 may translate the eye movements and broadcast operations through a wireless link (e.g., Bluetooth) to a computing device 1 14 (e.g., a smartphone) present in the area, and able to operate different applications which may be used by user 150 to communicate with his/her environment. Computing device 1 14 can be used to send text messages, execute different applications and/or control a computer (using programs designed for control using selection tables or menus). After the user chooses to operate an application on computing device 1 14, processor 104 may continue to transmit and receive data to/from computing device 1 14 over communication link 1 16.
In accordance with an embodiment of the present invention, system 100 can be used to exert control on an external device or apparatus, such as, a motorized wheelchair or computerized home appliances. For example, a user may direct or navigate a motorized wheelchair by indicating a selected direction (e.g., forward, back, left, right) using his/her relative eye orientation (e.g., up, down, left, right). The user may provide the indication by looking toward a particular menu option of a "motorized wheelchair navigation" menu presented on display 102, 124. System 100 determines the indicated selection (e.g., turn wheelchair left), and triggers the motorized wheelchair to perform the selected option. System 100 may be communicatively coupled with such external devices via a wireless communication link (e.g., communication link 1 16).
It is further noted that user 150 may alternatively by an individual who is vision impaired or who lacks full eye sight capabilities, as system 100 provides eye-based communication by determining only the relative eye orientation of user 150, which does not require the full use of vision or even a satisfactory level of eye sight to be attributed to user 150.
The present invention thus provides a relatively affordable and accessible solution with easy and intuitive operation that helps improve the quality of life of individuals with special needs, by providing means for effective communication with their environment. The system is mobile, portable and suitable for use in many situations and positions: lying, sitting, head hanging down, and the like. Furthermore, the system of the present invention does not require third party assistance for calibration.
It will be appreciated that the system and method of the present invention is not limited to individuals characterized by locked-in syndrome (LIS), but is equally applicable for use by other types of people as well, including those who are vision impaired. The system and method of the present invention may be integrated with an application on a computing device, such as a smartphone, tablet computer, or a digital music player, which includes at least a sensor for detecting light reflections from the eye (such as an embedded camera). The present invention may thus facilitate and initiate various communication actions through the computing device, such as composing and sending a message or initiating a phone conversation with a third party, based solely on instructions provided by the user by means of eye gestures correlating to his/her relative eye orientation.
While certain embodiments of the disclosed subject matter have been described, so as to enable one of skill in the art to practice the present invention, the preceding description is intended to be exemplary only. It should not be used to limit the scope of the disclosed subject matter, which should be determined by reference to the following claims.

Claims

1 . A system for enabling a user to communicate with eye-based feedback, the system comprising:
a selection interface, configured to selectively present a series of communication options to said user;
at least one relative eye orientation sensor, configured to detect light reflected from the eye of said user and to provide a correlative signal;
a processor, communicatively coupled with said relative eye orientation sensor and with said selection interface, said processor configured to receive and process said correlative signal to determine a relative eye orientation of said user, said relative eye orientation comprising the relative orientation of at least the pupil of said eye with respect to the head orientation of said user, and further configured to determine a selected communication option based on the determined relative eye orientation, and to provide instructions to implement said selected communication option.
2. The system of claim 1 , further comprising at least one light source, configured to illuminate the eye of said user.
3. The system of claim 2, wherein said light source comprises a light emitting diode (LED).
4. The system of claim 2, wherein said light source comprises an infrared (IR) light source, and wherein said relative eye orientation sensor is configured to detect IR light reflected from said eye.
5. The system of claim 1 , wherein said selection interface is selected from the list consisting of:
a visual interface;
an audible interface; and
a tactile interface.
6. The system of claim 1 , wherein at least one of: said light source; and said relative eye orientation sensor, is coupled with a wearable head gear worn by said user.
7. The system of claim 1 , wherein said selection interface operates on a mobile computing device.
8. The system of claim 1 , wherein said relative eye orientation sensor comprises at least one image sensor configured to capture images of the eye of said user, wherein said processor is configured to determine said relative eye orientation by image processing of the captured images.
9. The system of claim 1 , wherein said processor is configured to determine said relative eye orientation by detecting an instantaneous relative orientation of said pupil.
10. The system of claim 1 , wherein said processor is configured to determine said relative eye orientation by detecting a transient change of said pupil.
1 1 . The system of claim 1 , wherein said processor is communicatively coupled wirelessly with at least one of: said relative eye orientation sensor; and said selection interface.
12. The system of claim 1 , wherein at least one of said communication options is selected from the list consisting of:
an audible alarm;
a menu selection;
a language selection;
a confirmation message;
a sentence;
a phrase;
a word;
a syllable; a letter;
a mobile computing device activation selection;
a visual interface deactivation selection;
instructions to send an email/SMS/MMS; and
instructions to implement a computing application.
13. The system of claim 5, wherein said visual interface is configured to display at least one communication option in a manner selected from the list consisting of:
color-coordination;
shape-coordination; and
symbolic images.
14. A method for enabling a user to communicate with eye-based feedback, the method comprising the procedures of:
selectively presenting a series of communication options to said user with a selection interface;
detecting light reflected from the eye of said user and providing a correlative signal with at least one relative eye orientation sensor; processing the correlative signal to determine a relative eye orientation of said user, said relative eye orientation comprising the relative orientation of at least the pupil of said eye with respect to the head orientation of said user; determining a selected communication option based on the determined relative eye orientation; and
providing instructions to implement said selected communication option.
15. The method of claim 14, further comprising the procedure of illuminating the eye of said user with at least one light source.
16. The method of claim 15, wherein the eye of said user is illuminated with IR light, and wherein said relative eye orientation sensor is configured to detect IR light reflected from said eye.
17. The method of claim 14, wherein at least one of said communication options is presented by selection interface in a manner selected from the list consisting of:
visually;
audibly; and
tactile.
18. The method of claim 14, wherein said procedure of detecting light reflected from the eye of said user comprises capturing images of the eye with at least one image sensor, wherein said relative eye orientation is determined by image processing of the captured images.
19. The method of claim 14, wherein said relative eye orientation is determined by detecting an instantaneous relative orientation of said pupil.
20. The method of claim 14, wherein said relative eye orientation is determined by detecting a transient change of said pupil.
21 . The method of claim 14, wherein at least one of said communication options is selected from the list consisting of:
an audible alarm;
a menu selection;
a language selection;
a confirmation message;
a sentence;
a phrase;
a word;
a syllable;
a letter;
an mobile computing device activation selection;
a visual interface deactivation selection; instructions to send an email;
instructions to send an SMS/MMS; and
instructions to implement a computing application.
The method of claim 18, further comprises the procedure of performing calibration by:
comparing consecutive captured images to detect relative changes of said pupil in said eye;
marking said pupil in said captured images to track an area of interest signifying the location of said pupil;
setting a reference point as the center of a 2D grid and translating eye movements to coordinates on said grid;
when determining that said pupil is at the center of said grid, selecting a reference point for building a reference point coordinate system; and
determining the range of motion of said user.
PCT/IL2016/050235 2015-03-10 2016-03-02 System and method for enabling communication through eye feedback WO2016142933A1 (en)

Priority Applications (7)

Application Number Priority Date Filing Date Title
CN201680026221.3A CN107850939A (en) 2015-03-10 2016-03-02 For feeding back the system and method for realizing communication by eyes
EP16761197.9A EP3268847A4 (en) 2015-03-10 2016-03-02 System and method for enabling communication through eye feedback
IL303037A IL303037A (en) 2015-03-10 2016-03-02 System and method for enabling communication through eye feedback
US15/557,031 US11883101B2 (en) 2015-03-10 2016-03-02 System and method for enabling communication through eye feedback
EP20212667.8A EP3809241B1 (en) 2015-03-10 2016-03-02 System and method for enabling communication through eye feedback
JP2017566228A JP7016263B2 (en) 2015-03-10 2016-03-02 Systems and methods that enable communication through eye feedback
IL254315A IL254315A0 (en) 2015-03-10 2017-09-04 System and method for enabling communication through eye feedback

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US201562130637P 2015-03-10 2015-03-10
US62/130,637 2015-03-10

Publications (1)

Publication Number Publication Date
WO2016142933A1 true WO2016142933A1 (en) 2016-09-15

Family

ID=56879968

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/IL2016/050235 WO2016142933A1 (en) 2015-03-10 2016-03-02 System and method for enabling communication through eye feedback

Country Status (6)

Country Link
US (1) US11883101B2 (en)
EP (2) EP3268847A4 (en)
JP (2) JP7016263B2 (en)
CN (1) CN107850939A (en)
IL (2) IL303037A (en)
WO (1) WO2016142933A1 (en)

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108245748A (en) * 2017-12-21 2018-07-06 北京怡和嘉业医疗科技股份有限公司 The method that mask, respirator system and the respirator system perform
WO2018063134A3 (en) * 2016-09-28 2018-07-12 Donmez Mehmet Electronic and motorized wheelchair system that can be controlled with eye movements
WO2019111257A1 (en) 2017-12-07 2019-06-13 Eyefree Assisting Communication Ltd. Communication methods and systems
EP3506055A1 (en) * 2017-12-28 2019-07-03 Vestel Elektronik Sanayi ve Ticaret A.S. Method for eye-tracking calibration with splash screen
US10860098B1 (en) 2019-12-30 2020-12-08 Hulu, LLC Gesture-based eye tracking
SE1950971A1 (en) * 2019-08-23 2021-02-24 Your Speech Factory Ab Electronic device and method for conducting a users gaze direction
WO2023002488A1 (en) 2021-07-22 2023-01-26 Eyefree Assisting Communication Ltd. A system for communication with a patient based on the determined sedation state
EP4043999A4 (en) * 2019-10-10 2023-11-08 Medithinq Co., Ltd. Eye tracking system for smart glasses and method therefor
US12097030B2 (en) 2019-08-07 2024-09-24 Eyefree Assisting Communication, Ltd. System and method for patient monitoring

Families Citing this family (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11402909B2 (en) * 2017-04-26 2022-08-02 Cognixion Brain computer interface for augmented reality
JP6460425B2 (en) * 2017-06-02 2019-01-30 本田技研工業株式会社 Vehicle control system, vehicle control method, and program
CN109002186B (en) * 2018-06-28 2020-12-25 北京金山安全软件有限公司 Input prediction method and device
US11393318B2 (en) * 2018-10-05 2022-07-19 Cleveland State University Systems and methods for privacy-aware motion tracking with automatic authentication
EP3891585A1 (en) * 2019-02-01 2021-10-13 Apple Inc. Biofeedback method of modulating digital content to invoke greater pupil radius response
US10809637B1 (en) * 2019-05-30 2020-10-20 Applied Materials, Inc. Learning based digital corrections to compensate variations on lithography systems with multiple imaging units
CN110619303A (en) * 2019-09-16 2019-12-27 Oppo广东移动通信有限公司 Method, device and terminal for tracking point of regard and computer readable storage medium
US20220104959A1 (en) * 2020-10-07 2022-04-07 Jay Curtis Beavers Systems, methods, and techniques for eye gaze control of seat and bed positioning
JP7507482B2 (en) 2020-10-29 2024-06-28 パナソニックオートモーティブシステムズ株式会社 Eye-gaze input device
CN113589958A (en) * 2021-08-05 2021-11-02 西交利物浦大学 Text input method, device, equipment and storage medium
WO2023076573A1 (en) * 2021-10-28 2023-05-04 Lifedrive Llc Drive manager for power wheelchair and related methods
JP7568301B2 (en) * 2022-03-01 2024-10-16 日新電機工作株式会社 Information processing system, information processing method, and information processing program
US12032156B2 (en) * 2022-04-08 2024-07-09 Mirza Faizan Apparatus to enable differently abled users to communicate and a method thereof
US11995231B2 (en) * 2022-07-21 2024-05-28 Dell Products Lp System and method for headset haptic or audio confirmation feedback for gaze activation
US20240111361A1 (en) * 2022-09-27 2024-04-04 Tobii Dynavox Ab Method, System, and Computer Program Product for Drawing and Fine-Tuned Motor Controls
CN116421135A (en) * 2023-04-27 2023-07-14 北京京东拓先科技有限公司 Vision testing method, device, electronic equipment and storage medium
CN116708974B (en) * 2023-08-01 2023-10-17 清华大学 Universal camera interference method and system for head-mounted active vision camera

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100125816A1 (en) * 2008-11-20 2010-05-20 Bezos Jeffrey P Movement recognition as input mechanism
US20130307771A1 (en) * 2012-05-18 2013-11-21 Microsoft Corporation Interaction and management of devices using gaze detection
US20140300535A1 (en) * 2013-04-03 2014-10-09 Samsung Electronics Co., Ltd. Method and electronic device for improving performance of non-contact type recognition function
US20140375541A1 (en) * 2013-06-25 2014-12-25 David Nister Eye tracking via depth camera

Family Cites Families (38)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8120A (en) 1851-05-27 Cabkiaorb
US577A (en) 1838-01-20 Uvtachine for threshing grain and shelling corn
US4836670A (en) * 1987-08-19 1989-06-06 Center For Innovative Technology Eye movement detector
US4973149A (en) * 1987-08-19 1990-11-27 Center For Innovative Technology Eye movement detector
US4950069A (en) * 1988-11-04 1990-08-21 University Of Virginia Eye movement detector with improved calibration and speed
JPH0749744A (en) * 1993-08-04 1995-02-21 Pioneer Electron Corp Head mounting type display input device
US5912721A (en) * 1996-03-13 1999-06-15 Kabushiki Kaisha Toshiba Gaze detection apparatus and its method as well as information display apparatus
US6351273B1 (en) * 1997-04-30 2002-02-26 Jerome H. Lemelson System and methods for controlling automatic scrolling of information on a display or screen
JP2000137789A (en) 1998-10-29 2000-05-16 Sharp Corp Image processor
US6456262B1 (en) * 2000-05-09 2002-09-24 Intel Corporation Microdisplay with eye gaze detection
US6943754B2 (en) * 2002-09-27 2005-09-13 The Boeing Company Gaze tracking system, eye-tracking assembly and an associated method of calibration
US20050045373A1 (en) 2003-05-27 2005-03-03 Joseph Born Portable media device with audio prompt menu
JP3673834B2 (en) 2003-08-18 2005-07-20 国立大学法人山口大学 Gaze input communication method using eye movement
EP1755441B1 (en) 2004-04-01 2015-11-04 Eyefluence, Inc. Biosensors, communicators, and controllers monitoring eye movement and methods for using them
KR100594117B1 (en) 2004-09-20 2006-06-28 삼성전자주식회사 Apparatus and method for inputting key using biosignal in HMD information terminal
JP4654434B2 (en) 2004-11-24 2011-03-23 国立大学法人佐賀大学 Gaze direction identification system
EP1943583B1 (en) * 2005-10-28 2019-04-10 Tobii AB Eye tracker with visual feedback
ES2605367T3 (en) * 2006-01-26 2017-03-14 Nokia Technologies Oy Eye tracking device
WO2007113975A1 (en) * 2006-03-31 2007-10-11 National University Corporation Shizuoka University View point detecting device
JP4685708B2 (en) 2006-05-22 2011-05-18 富士通株式会社 Mobile terminal device
RU2410026C2 (en) * 2006-11-15 2011-01-27 Панасоник Корпорэйшн Setup system for method of brain wave identification, setup procedure and computer program
US20100149073A1 (en) * 2008-11-02 2010-06-17 David Chaum Near to Eye Display System and Appliance
CN101893934A (en) 2010-06-25 2010-11-24 宇龙计算机通信科技(深圳)有限公司 Method and device for intelligently adjusting screen display
US8593375B2 (en) 2010-07-23 2013-11-26 Gregory A Maltz Eye gaze user interface and method
JP2012048358A (en) 2010-08-25 2012-03-08 Sony Corp Browsing device, information processing method and program
WO2012083415A1 (en) * 2010-11-15 2012-06-28 Tandemlaunch Technologies Inc. System and method for interacting with and analyzing media on a display using eye gaze tracking
US8888287B2 (en) * 2010-12-13 2014-11-18 Microsoft Corporation Human-computer interface system having a 3D gaze tracker
AU2011204946C1 (en) 2011-07-22 2012-07-26 Microsoft Technology Licensing, Llc Automatic text scrolling on a head-mounted display
KR101355895B1 (en) * 2011-10-20 2014-02-05 경북대학교 산학협력단 Apparatus for deciding user's intention and method therof
US8955973B2 (en) * 2012-01-06 2015-02-17 Google Inc. Method and system for input detection using structured light projection
US9171198B1 (en) * 2012-04-02 2015-10-27 Google Inc. Image capture technique
KR101850035B1 (en) * 2012-05-02 2018-04-20 엘지전자 주식회사 Mobile terminal and control method thereof
US9261961B2 (en) * 2012-06-07 2016-02-16 Nook Digital, Llc Accessibility aids for users of electronic devices
US9658746B2 (en) * 2012-07-20 2017-05-23 Nook Digital, Llc Accessible reading mode techniques for electronic devices
KR102181897B1 (en) * 2013-05-09 2020-11-23 에스케이플래닛 주식회사 Device and method for mobile tooltip using eye detecting
US20140368442A1 (en) * 2013-06-13 2014-12-18 Nokia Corporation Apparatus and associated methods for touch user input
WO2017216118A1 (en) * 2016-06-13 2017-12-21 SensoMotoric Instruments Gesellschaft für innovative Sensorik mbH Method and eye tracking system for performing a calibration procedure for calibrating an eye tracking device
US9779478B1 (en) * 2016-10-04 2017-10-03 Oculus Vr, Llc Rendering composite content on a head-mounted display including a high resolution inset

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100125816A1 (en) * 2008-11-20 2010-05-20 Bezos Jeffrey P Movement recognition as input mechanism
US20130307771A1 (en) * 2012-05-18 2013-11-21 Microsoft Corporation Interaction and management of devices using gaze detection
US20140300535A1 (en) * 2013-04-03 2014-10-09 Samsung Electronics Co., Ltd. Method and electronic device for improving performance of non-contact type recognition function
US20140375541A1 (en) * 2013-06-25 2014-12-25 David Nister Eye tracking via depth camera

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See also references of EP3268847A4 *

Cited By (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2018063134A3 (en) * 2016-09-28 2018-07-12 Donmez Mehmet Electronic and motorized wheelchair system that can be controlled with eye movements
US11612342B2 (en) 2017-12-07 2023-03-28 Eyefree Assisting Communication Ltd. Eye-tracking communication methods and systems
WO2019111257A1 (en) 2017-12-07 2019-06-13 Eyefree Assisting Communication Ltd. Communication methods and systems
JP2021506052A (en) * 2017-12-07 2021-02-18 アイフリー アシスティング コミュニケ−ション リミテッドEyeFree Assisting Communication Ltd. Communication methods and systems
EP3721320B1 (en) * 2017-12-07 2022-02-23 Eyefree Assisting Communication Ltd. Communication methods and systems
CN108245748A (en) * 2017-12-21 2018-07-06 北京怡和嘉业医疗科技股份有限公司 The method that mask, respirator system and the respirator system perform
US11833298B2 (en) 2017-12-21 2023-12-05 Bmc Medical Co., Ltd Face mask, ventilation therapy device system and method executed by ventilation therapy device system
EP3506055A1 (en) * 2017-12-28 2019-07-03 Vestel Elektronik Sanayi ve Ticaret A.S. Method for eye-tracking calibration with splash screen
US12097030B2 (en) 2019-08-07 2024-09-24 Eyefree Assisting Communication, Ltd. System and method for patient monitoring
SE1950971A1 (en) * 2019-08-23 2021-02-24 Your Speech Factory Ab Electronic device and method for conducting a users gaze direction
WO2021040602A1 (en) * 2019-08-23 2021-03-04 Your Speech Factory Ab Electronic device and method for eye-contact training
US12039879B2 (en) 2019-08-23 2024-07-16 Your Speech Factory Ab Electronic device and method for eye-contact training
EP4043999A4 (en) * 2019-10-10 2023-11-08 Medithinq Co., Ltd. Eye tracking system for smart glasses and method therefor
US10860098B1 (en) 2019-12-30 2020-12-08 Hulu, LLC Gesture-based eye tracking
WO2023002488A1 (en) 2021-07-22 2023-01-26 Eyefree Assisting Communication Ltd. A system for communication with a patient based on the determined sedation state

Also Published As

Publication number Publication date
US11883101B2 (en) 2024-01-30
IL254315A0 (en) 2017-11-30
IL303037A (en) 2023-07-01
JP2018519601A (en) 2018-07-19
EP3809241A1 (en) 2021-04-21
JP2021073589A (en) 2021-05-13
EP3809241B1 (en) 2023-12-13
JP7153096B2 (en) 2022-10-13
EP3268847A1 (en) 2018-01-17
JP7016263B2 (en) 2022-02-04
CN107850939A (en) 2018-03-27
US20200022577A1 (en) 2020-01-23
EP3268847A4 (en) 2018-10-24
EP3809241C0 (en) 2023-12-13

Similar Documents

Publication Publication Date Title
EP3809241B1 (en) System and method for enabling communication through eye feedback
US11989930B2 (en) UI for head mounted display system
US10747315B2 (en) Communication and control system and method
US20200333913A1 (en) Interactive computer display and control systems for the home
CN106471419B (en) Management information is shown
JP2000250699A (en) Visual line input device
US20240289007A1 (en) Devices, methods, and graphical user interfaces for adjusting device settings
US20240104859A1 (en) User interfaces for managing live communication sessions
US20240103616A1 (en) User interfaces for gaze tracking enrollment
US20240118746A1 (en) User interfaces for gaze tracking enrollment
US20240203066A1 (en) Methods for improving user environmental awareness
US20240103678A1 (en) Devices, methods, and graphical user interfaces for interacting with extended reality experiences
JP2018124457A (en) Face image processing device
CA3117533A1 (en) Ui for head mounted display system
US20240103679A1 (en) Devices, Methods, and Graphical User Interfaces for Interacting with Three-Dimensional Environments
US20240152244A1 (en) Devices, Methods, and Graphical User Interfaces for Interacting with Three-Dimensional Environments
JP7367041B2 (en) UI for head-mounted display systems
US20240265796A1 (en) Devices, methods, and graphical user interfaces for device position adjustment
US20240103608A1 (en) Devices, Methods, and Graphical User Interfaces for Providing Computer-Generated Experiences
US20240153205A1 (en) Devices, Methods, and Graphical User Interfaces for Providing Computer-Generated Experiences
US20240233288A1 (en) Methods for controlling and interacting with a three-dimensional environment
WO2024064230A1 (en) Devices, methods, and graphical user interfaces for interacting with three-dimensional environments
WO2024064372A1 (en) Devices, methods, and graphical user interfaces for providing computer-generated experiences

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 16761197

Country of ref document: EP

Kind code of ref document: A1

DPE1 Request for preliminary examination filed after expiration of 19th month from priority date (pct application filed from 20040101)
WWE Wipo information: entry into national phase

Ref document number: 254315

Country of ref document: IL

ENP Entry into the national phase

Ref document number: 2017566228

Country of ref document: JP

Kind code of ref document: A

NENP Non-entry into the national phase

Ref country code: DE

REEP Request for entry into the european phase

Ref document number: 2016761197

Country of ref document: EP