US20140372944A1 - User focus controlled directional user input - Google Patents
User focus controlled directional user input Download PDFInfo
- Publication number
- US20140372944A1 US20140372944A1 US13/916,193 US201313916193A US2014372944A1 US 20140372944 A1 US20140372944 A1 US 20140372944A1 US 201313916193 A US201313916193 A US 201313916193A US 2014372944 A1 US2014372944 A1 US 2014372944A1
- Authority
- US
- United States
- Prior art keywords
- user
- user interface
- see
- spoke
- menu
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/048—Interaction techniques based on graphical user interfaces [GUI]
- G06F3/0481—Interaction techniques based on graphical user interfaces [GUI] based on specific properties of the displayed interaction object or a metaphor-based environment, e.g. interaction with desktop elements like windows or icons, or assisted by a cursor's changing behaviour or appearance
- G06F3/04815—Interaction with a metaphor-based environment or interaction object displayed as three-dimensional [3D], e.g. changing the user viewpoint with respect to the environment or object
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/011—Arrangements for interaction with the human body, e.g. for user immersion in virtual reality
- G06F3/012—Head tracking input arrangements
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/011—Arrangements for interaction with the human body, e.g. for user immersion in virtual reality
- G06F3/013—Eye tracking input arrangements
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/048—Interaction techniques based on graphical user interfaces [GUI]
- G06F3/0481—Interaction techniques based on graphical user interfaces [GUI] based on specific properties of the displayed interaction object or a metaphor-based environment, e.g. interaction with desktop elements like windows or icons, or assisted by a cursor's changing behaviour or appearance
- G06F3/0482—Interaction with lists of selectable items, e.g. menus
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/048—Interaction techniques based on graphical user interfaces [GUI]
- G06F3/0487—Interaction techniques based on graphical user interfaces [GUI] using specific features provided by the input device, e.g. functions controlled by the rotation of a mouse with dual sensing arrangements, or of the nature of the input device, e.g. tap gestures based on pressure sensed by a digitiser
Definitions
- Augmented reality relates to providing an augmented real-world environment where the perception of a real-world environment (or data representing a real-world environment) is augmented or modified with computer-generated virtual data.
- data representing a real-world environment may be captured in real-time using sensory input devices such as a camera or microphone and augmented with computer-generated virtual data including virtual images and virtual sounds.
- the virtual data may also include information related to the real-world environment such as a text description associated with a real-world object in the real-world environment.
- An AR implementation may be used to enhance numerous applications including video game, mapping, navigation, and mobile device applications.
- a near-eye display such as a head mounted display (HMD) may be worn by a user to view the mixed imagery of virtual and real objects.
- a near-eye display uses a combination of optics and stereopsis to focus virtual imagery in the user's field of view.
- Accessing system menus in an augmented-reality environment in a hands-free way is non-trivial.
- Voice has the disadvantage of feeling unnatural, creating awkward social situations where people are speaking commands in public, and with current technology does not always correctly interpret the command on the first try.
- Mouse and glove-type controllers have the disadvantage of requiring extra hardware and cannot be easily used while one's hands are occupied (e.g. carrying things, eating. etc.).
- a system and method are provided that provide a user interface in a see-through, near-eye display, such as an HMD.
- the user interface is controlled by head orientation, in one embodiment. Eye gaze could also be used.
- a method comprising the following is provided.
- a user interface symbol is presented in a see-through, near-eye display. Focus of a user that is wearing the see-through, near-eye display is determined.
- a handle is presented in the see-through, near-eye display that extends away from the user interface symbol in response to determining that the user is focused on the user interface symbol.
- a determination is made that the user's focus has shifted along the handle away from the user interface symbol.
- a user interface element is activated in response to determining that the user's focus has shifted away from the user interface symbol along the handle to meet a selection criterion.
- One embodiment includes a display system comprising a see-through, near-eye display device and logic in communication with the display device.
- the logic is configured to present a user interface symbol in the see-through, near-eye display device.
- the logic is configured to track eye gaze or head orientation of a user wearing the see-through, near-eye display device.
- the logic is configured to display a handle in the see-through, near-eye display device that extends away from the user interface symbol in response to determining that either the eye gaze or the head orientation indicates that the user is looking at the user interface symbol.
- the logic is configured to activate a user interface element in response to determining that the eye gaze or head orientation indicates that the user is looking away from the user interface symbol along the handle to meet a selection criterion.
- One embodiment includes a method of providing a user interface in a see-through, near-eye display device.
- the method comprising the following.
- a user interface symbol is presented in the see-through, near-eye display device.
- Head orientation of a user wearing the see-through, near-eye display device is tracked.
- a handle that extends away from the user interface symbol is presented in the see-through, near-eye display device in response to determining, based on the head orientation, that the user is looking in the general direction of the user interface symbol.
- a spoke menu is spawned in the see-through, near-eye display device in response to determining that the user's head orientation indicates that the user has shifted from looking at the user interface symbol along the handle.
- the spoke menu includes menu items around a central hub, with each of the menu items having a spoke between it and the central hub.
- FIG. 1 is a block diagram depicting example components of one embodiment of a see-through, mixed reality display device in a system environment in which the device may operate.
- FIG. 2 illustrates a user wearing head mounted display device in an environment.
- FIG. 3 is a top view of a portion of one embodiment of a head mounted display unit.
- FIG. 4A is a block diagram of one embodiment of the components of a head mounted display unit.
- FIG. 4B is a block diagram of one embodiment of the components of a processing unit associated with a head mounted display unit.
- FIG. 5 is a block diagram of one embodiment of the components of a hub computing system used with head mounted display unit.
- FIG. 6 is a block diagram of one embodiment of a computing system that can be used to implement the hub computing system described herein.
- FIG. 7 is a flow chart depicting a process according to an embodiment of the present technology.
- FIG. 8A shows an example of a user interface symbol presented on an object in accordance with one embodiment.
- FIG. 8B depicts one example in which the symbol is highlighted.
- FIG. 8C depicts on example in which a handle extends away from the symbol.
- FIG. 8D is an example of one embodiment of a spokes menu.
- FIGS. 9A , 9 B, and 9 C show an example of one embodiment of a spokes menu.
- FIG. 10 is a flowchart of one embodiment of a process providing for a selection of a menu item on a spokes menu.
- FIGS. 11A , 11 B and 11 C show an example of a spoke being progressively filled.
- FIG. 12 is a flowchart of one embodiment of a process providing for a selection of a menu item on a spokes menu.
- FIG. 13 shows one embodiment in which the menu items are at a diagonal from the hub.
- FIGS. 14A and 14B depict another embodiment of a spokes menu.
- FIG. 15 shows an example in which there is a first user selection symbol and a second user selection symbol associated with object.
- FIG. 16 depicts one embodiment in which user selection symbol points in the direction in which the handle has been extended.
- FIG. 17 shows one example of nested spoke menus.
- FIG. 18 shows one embodiment that allows user input based on focusing on the handle.
- Hands-free user input in an augmented reality environment is provided for, in accordance with one embodiment.
- Access to system menus is provided for, in accordance with one embodiment.
- a user can provide input by moving the orientation of their head. For example, the user could rotate their head.
- a user can provide input by moving a direction of their eye gaze.
- a user interface symbol is presented in the near eye display device.
- the user interface symbol could be presented such that it appears to be on or near a real-world object or on or near a virtual object being rendered in the near eye display device.
- the user interface symbol may serve as an indicator to the user that the user has a chance to enter input to somehow interact with the object. If the user directs their attention at the user interface symbol, a handle extends away from the user interface symbol, in one embodiment.
- the system could determine that the user has focused their attention at the user interface symbol by determining the user's head orientation and/or their eye gaze.
- the user interface symbol may also be referred to as a “gleam,” as it can, in some embodiments, be made to gleam, glow, etc. in response to the user looking at it.
- the handle may serve as a type of selection device such that if the user directs their attention along the handle, away from the user interface symbol, a selection can be made.
- the user can cause a menu to appear by rotating their head such that system determines the user is looking along the handle away from the user interface symbol. In one embodiment, this is a spoke menu that has menu items surrounding a hub. The last point at which the user was looking to activate the spoke menu may serve as a hub for the spoke menu.
- the user can then select the menu items by directing their focus to one of the menu items.
- the user moves their head towards a menu item to select it. For example, the user might rotate their head left, right, up, or down to select the menu item.
- the user moves their eyes towards a menu item as part of a selection process.
- FIG. 1 is a block diagram depicting example components of one embodiment of a see-through, mixed reality display device 2 in a system environment 100 in which the device may operate.
- the see-through display device is a near-eye, head mounted display device 2 in communication with processing unit 4 via wire 6 .
- head mounted display device 2 communicates with processing unit 4 via wireless communication.
- Processing unit 4 may take various embodiments.
- processing unit 4 is a separate unit which may be worn on the user's body, e.g., the wrist in the illustrated example or in a pocket, and includes much of the computing power used to operate near-eye display device 2 .
- Processing unit 4 may communicate wirelessly (e.g., WiFi, Bluetooth, infra-red, or other wireless communication means) to one or more hub computing systems 12 , hot spots, cellular data networks, etc.
- the functionality of the processing unit 4 may be integrated in software and hardware components of the display device 2 .
- head mounted display device 2 which in one embodiment is in the shape of eyeglasses in a frame 115 , is worn on the head of a user so that the user can see through a display, embodied in this example as a display optical system 14 for each eye, and thereby have an actual direct view of the space in front of the user.
- a display embodied in this example as a display optical system 14 for each eye
- actual direct view refers to the ability to see real world objects directly with the human eye, rather than seeing created image representations of the objects. For example, looking through glass at a room allows a user to have an actual direct view of the room, while viewing a video of a room on a television is not an actual direct view of the room.
- the system can project images of virtual objects, sometimes referred to as virtual images, on the display that are viewable by the person wearing the see-through display device while that person is also viewing real world objects through the display.
- images of virtual objects sometimes referred to as virtual images
- Frame 115 provides a support for holding elements of the HMD 2 in place as well as a conduit for electrical connections.
- frame 115 provides a convenient eyeglass frame as support for the elements of the system discussed further below.
- other support structures can be used.
- An example of such a structure is a visor, hat, helmet or goggles.
- the frame 115 includes a temple or side arm for resting on each of a user's ears.
- Temple 102 is representative of an embodiment of the right temple and includes control circuitry 136 for the display device 2 .
- Nose bridge 104 of the frame includes a microphone 110 for recording sounds and transmitting audio data to processing unit 4 .
- Hub computing system 12 may be a computer, a gaming system or console, or the like. According to an example embodiment, the hub computing system 12 may include hardware components and/or software components such that hub computing system 12 may be used to execute applications such as gaming applications, non-gaming applications, or the like. An application may be executing on hub computing system 12 , the display device 2 , as discussed below on a mobile device 5 or a combination of these.
- the hub computing system 12 further includes one or more capture devices, such as capture devices 20 A and 20 B.
- the two capture devices can be used to capture the room or other physical environment of the user but are not necessary for use with see through head mounted display device 2 in all embodiments.
- Capture devices 20 A and 20 B may be, for example, cameras that visually monitor one or more users and the surrounding space such that gestures and/or movements performed by the one or more users, as well as the structure of the surrounding space, may be captured, analyzed, and tracked to perform one or more controls or actions within an application and/or animate an avatar or on-screen character.
- Hub computing system 12 may be connected to an audiovisual device 16 such as a television, a monitor, a high-definition television (HDTV), or the like that may provide game or application visuals.
- the audiovisual device 16 may be a three-dimensional display device.
- audiovisual device 16 includes internal speakers.
- audiovisual device 16 , a separate stereo or hub computing system 12 is connected to external speakers 22 .
- HMD 2 and processing unit 4 can be used without hub computing system 12 , in which case processing unit 4 will communicate with a WiFi network, a cellular network or other communication means.
- the processes described herein are performed in whole or in part by head mounted display device 2 , processing unit 4 , hub computing system 12 , or a combination thereof.
- various sensor technologies embedded in head mounted display device 2 including inertial sensing using inertial measurements from accelerometers and gyroscopes, global positioning system (GPS), eye tracking elements determine a focus of the user.
- focus it is meant where the user is looking or what has the user's attention.
- the user's focus can be determined by head orientation and/or eye gaze.
- the head orientation and eye gaze are focused at different points. For example, the user might not be looking straight ahead. In such as case, head orientation may be used to define the focus.
- the eye gaze could be used to determine the user's focus when the two differ.
- head mounted display device 2 and processing unit 4 will be described below with respect to FIGS. 3 , 4 A and 4 B.
- hub computing device 12 may be used to track the user and head mounted display device 2 to provide a preliminary determination of location and orientation of head mounted display device 2 .
- Various sensor technologies may be implemented in hub computing device 12 including RGB camera, depth sensor, and/or other technologies to determine location and orientation of head mounted display device 2 . More details of hub computing device 12 will be described below with respect to FIG. 5 .
- Additional information such as information retrieved from the cloud, information detected and/or gathered by one or more external devices, and other relevant information may also be used to identify and continuously track the user's head position and rotation.
- Techniques such as Simultaneous Localization and Mapping (SLAM) using RGB and/or depth sensor data may be employed to provide a real-time position of the user's head relative to the mapped environment.
- SLAM Simultaneous Localization and Mapping
- Environmental typography may be identified using data from the cloud and/or depth sensor data. Regions of the user's body can be identified (e.g., hand, arm, torso, legs) using the depth sensor when the user is looking at him or herself.
- One or more sensors may be used as redundancies to further refine the measurement of the total field of view of the user.
- FIG. 2 illustrates a user 2222 wearing head mounted display device 2 in an environment 2220 .
- the user's head position and orientation are continuously tracked.
- the user's eye gaze may also be tracked.
- time user's 2222 head position and orientation are identified using the various sensor technologies as described above, and user 2222 is shown with their focus directed towards element 2234 , which may be a picture, television screen, etc.
- the user's focus could be defined as a vector that originates at a point between the eyes and “shoots straight out.”
- the user's focus could also be defined using eye tracking.
- technology in the HMD 2 or elsewhere, can track the direction in which one or both of the user's eyes are gazing.
- eye gaze is considered to be a point at which the user's eyes are momentarily fixed.
- the environment includes a lamp 2224 , clock 2232 , a portion of wall 2230 , a portion of wall 2226 , a table 2240 , processing device 2238 , capture devices 20 A, 20 B, and display 2234 , floor 2231 , and a flower 2242 .
- the environment 2220 may be defined relative to a coordinate system 2250 , and the user's head position and orientation defined relative to a second coordinate system 2252 .
- the direction of focus can be ascertained. Also, eye tracking can be used instead, or in addition to head tracking, to ascertain the direction of focus.
- FIG. 3 depicts a top view of a portion of head mounted display device 2 , including a portion of the frame that includes temple 102 and nose bridge 104 . Only the right side of head mounted display device 2 is depicted. At the front of head mounted display device 2 is a physical environment facing camera 113 that can capture video and still images. Those images are transmitted to processing unit 4 , as described below.
- physical environment facing camera 113 is a depth camera that may capture a depth image of a scene.
- the depth image may include a two-dimensional (2-D) pixel area of the captured scene where each pixel in the 2-D pixel area may represent a depth value such as a distance in, for example, centimeters, millimeters, or the like of an object in the captured scene from the camera.
- the depth camera may include an IR illuminator transmitter and a hot reflecting surface like a hot mirror in front of the visible image sensor which lets the visible light pass and directs reflected IR radiation within a wavelength range transmitted by the illuminator to a CCD or other type of depth sensor.
- the data from the sensors may be sent to the processing unit 4 or hub computing system 12 for processing.
- the processing identifies and maps the user's real world field of view.
- physical environment facing camera 113 may also include a light meter for measuring ambient light.
- a portion of the frame of head mounted display device 2 will surround a display (that includes one or more lenses). In order to show the components of head mounted display device 2 , a portion of the frame surrounding the display is not depicted.
- the display includes a light guide optical element 112 , opacity filter 114 , see-through lens 116 and see-through lens 118 .
- opacity filter 114 is behind and aligned with see-through lens 116
- light-guide optical element 112 is behind and aligned with opacity filter 114
- see-through lens 118 is behind and aligned with light-guide optical element 112 .
- See-through lenses 116 and 118 are standard lenses used in eye glasses and can be made to any prescription (including no prescription).
- see-through lenses 116 and 118 can be replaced by a variable prescription lens.
- head mounted display device 2 will include only one see-through lens or no see-through lenses.
- a prescription lens can go inside light guide optical element 112 .
- Opacity filter 114 filters out natural light (either on a per pixel basis or uniformly) to enhance the contrast of the virtual imagery.
- Light guide optical element 112 channels artificial light to the eye. More details of the opacity filter 114 and light guide optical element 112 is provided below.
- an image source which (in one embodiment) includes micro display assembly 120 for projecting a virtual image and lens 122 for directing images from micro display 120 into light guide optical element 112 .
- lens 122 is a collimating lens.
- Control circuits 136 provide various electronics that support the other components of head mounted display device 2 . More details of control circuits 136 are provided below with respect to FIGS. 4A and 4B .
- ear phones 130 Inside, or mounted to temple 102 , are ear phones 130 , inertial and/or magnetic sensors 132 and temperature sensor 138 .
- inertial and magnetic sensors 132 include a three axis magnetometer 132 A, three axis gyro 132 B and three axis accelerometer 132 C (See FIG. 4A ).
- the inertial and/or magnetic sensors are for sensing position, orientation, and sudden accelerations of head mounted display device 2 .
- Micro display 120 projects an image through lens 122 .
- image generation technologies can be used to implement micro display 120 .
- micro display 120 can be implemented in using a transmissive projection technology where the light source is modulated by optically active material, backlit with white light. These technologies are usually implemented using LCD type displays with powerful backlights and high optical energy densities.
- Micro display 120 can also be implemented using a reflective technology for which external light is reflected and modulated by an optically active material. The illumination is forward lit by either a white source or RGB source, depending on the technology.
- DLP digital light processing
- LCOS liquid crystal on silicon
- Mirasol® display technology from Qualcomm, Inc.
- micro display 120 can be implemented using an emissive technology where light is generated by the display.
- a PicoPTM display engine from Microvision, Inc. emits a laser signal with a micro mirror steering either onto a tiny screen that acts as a transmissive element or beamed directly into the eye (e.g., laser).
- Light guide optical element 112 transmits light from micro display 120 to the eye 140 of the user wearing head mounted display device 2 .
- Light guide optical element 112 also allows light from in front of the head mounted display device 2 to be transmitted through light guide optical element 112 to eye 140 , as depicted by arrow 142 , thereby allowing the user to have an actual direct view of the space in front of head mounted display device 2 in addition to receiving a virtual image from micro display 120 .
- the walls of light guide optical element 112 are see-through.
- Light guide optical element 112 includes a first reflecting surface 124 (e.g., a mirror or other surface). Light from micro display 120 passes through lens 122 and becomes incident on reflecting surface 124 .
- the reflecting surface 124 reflects the incident light from the micro display 120 such that light is trapped inside a planar, substrate comprising light guide optical element 112 by internal reflection. After several reflections off the surfaces of the substrate, the trapped light waves reach an array of selectively reflecting surfaces 126 . Note that only one of the five surfaces is labeled 126 to prevent over-crowding of the drawing. Reflecting surfaces 126 couple the light waves incident upon those reflecting surfaces out of the substrate into the eye 140 of the user. As different light rays will travel and bounce off the inside of the substrate at different angles, the different rays will hit the various reflecting surface 126 at different angles. Therefore, different light rays will be reflected out of the substrate by different ones of the reflecting surfaces.
- each eye will have its own light guide optical element 112 .
- each eye can have its own micro display 120 that can display the same image in both eyes or different images in the two eyes.
- Opacity filter 114 which is aligned with light guide optical element 112 , selectively blocks natural light, either uniformly or on a per-pixel basis, from passing through light guide optical element 112 .
- the opacity filter can be a see-through LCD panel, electro chromic film, PDLC (Polymer dispersed Liquid Crystal) or similar device which is capable of serving as an opacity filter.
- PDLC Polymer dispersed Liquid Crystal
- Such a see-through LCD panel can be obtained by removing various layers of substrate, backlight and diffusers from a conventional LCD.
- the LCD panel can include one or more light-transmissive LCD chips which allow light to pass through the liquid crystal. Such chips are used in LCD projectors, for instance.
- Opacity filter 114 can include a dense grid of pixels, where the light transmissivity of each pixel is individually controllable between minimum and maximum transmissivities. While a transmissivity range of 0-100% is ideal, more limited ranges are also acceptable. As an example, a monochrome LCD panel with no more than two polarizing filters is sufficient to provide an opacity range of about 50% to 99% per pixel, up to the resolution of the LCD. At the minimum of 50%, the lens will have a slightly tinted appearance, which is tolerable. 100% transmissivity represents a perfectly clear lens.
- An “alpha” scale can be defined from 0-100%, where 0% allows no light to pass and 100% allows all light to pass. The value of alpha can be set for each pixel by the opacity filter control circuit 224 described below.
- a mask of alpha values can be used from a rendering pipeline, after z-buffering with proxies for real-world objects.
- the system When the system renders a scene for the augmented reality display, it takes note of which real-world objects are in front of which virtual objects. If a virtual object is in front of a real-world object, then the opacity should be on for the coverage area of the virtual object. If the virtual is (virtually) behind a real-world object, then the opacity should be off, as well as any color for that pixel, so the user will only see the real-world object for that corresponding area (a pixel or more in size) of real light.
- opacity filter can be rendered in color, such as with a color LCD or with other displays such as organic LEDs, to provide a wide field of view. More details of an opacity filter are provided in U.S. patent application Ser. No. 12/887,426, “Opacity Filter For See-Through Mounted Display,” filed on Sep. 21, 2010, incorporated herein by reference in its entirety.
- Head mounted display device 2 may also include a system for tracking the position of the user's eyes.
- head mounted display device 2 includes eye tracking assembly 134 which will include an eye tracking illumination device 134 A (see FIG. 4A ) and eye tracking camera 134 B (see FIG. 4A ).
- eye tracking illumination source 134 A includes one or more infrared (IR) emitters, which emit IR light toward the eye.
- Eye tracking camera 134 B includes one or more cameras that sense the reflected IR light.
- the position of the pupil can be identified by known imaging techniques which detects the reflection of the cornea. For example, see U.S. Pat. No. 7,401,920, entitled “Head Mounted Eye Tracking and Display System”, issued Jul.
- eye tracking involves obtaining an image of the eye and using computer vision techniques to determine the location of the pupil within the eye socket. In one embodiment, it is sufficient to track the location of one eye since the eye usually moves in unison. However, it is possible to track each eye separately.
- the eye tracking illumination device 134 A will use four IR LEDs and the eye tracking camera 134 B users four IR photo detectors (not illustrated) in rectangular arrangement so that there is one IR LED and IR photo detector at each corner of the lens of head mounted display device 2 .
- Light from the LEDs reflect off the eyes.
- the amount of infrared light detected at each of the four IR photo detectors determines the pupil direction. That is, the amount of white versus black in the eye will determine the amount of light reflected off the eye for that particular photo detector.
- the photo detector will have a measure of the amount of white or black in the eye. From the four samples, the system can determine the direction of the eye.
- Another alternative is to use four infrared LEDs as discussed above, but only one infrared CCD on the side of the lens of head mounted display device 2 .
- the CCD will use a small mirror and/or lens (fish eye) such that the CCD can image up to 75% of the visible eye from the glasses frame.
- the CCD will then sense an image and use computer vision to find the eye position, much like as discussed above. More or less than four IR transmitters and/or four IR sensors can also be used.
- Another embodiment for tracking the direction of the eyes is based on charge tracking. This concept is based on the observation that a retina carries a measurable positive charge and the cornea has a negative charge. Sensors are mounted by the user's ears (near earphones 130 ) to detect the electrical potential while the eyes move around and effectively read out what the eyes are doing in real time. Other embodiments for tracking eyes can also be used. Suitable tracking technologies is disclosed in U.S. Patent Application Publication 2012/0154277, “Optimized Focal Area for Augmented Reality Displays,” filed on Dec. 17, 2010, which is incorporated herein by reference in their entirety.
- FIG. 3 only shows half of head mounted display device 2 .
- a full head mounted display device 2 would include (where applicable) another set lenses, another opacity filter another light guide optical element, another micro display, another lens, room facing camera, eye tracking assembly, micro display, earphones, and temperature sensor.
- FIG. 4A is a block diagram depicting the various components of head mounted display device 2 . It will be understood with reference to FIG. 3 that some components illustrated in FIG. 4A may not be present in the embodiment shown in FIG. 3 .
- FIG. 4B is a block diagram describing the various components of processing unit 4 of FIG. 1 .
- Head mounted display device 2 the components of which are depicted in FIG. 4A , is used to display an optimized image to the user (user 2222 ). Additionally, the head mounted display device components of FIG. 4A may include many sensors that track various conditions including the head position and rotation of user 2222 . Head mounted display device 2 will receive instructions about the virtual image from processing unit 4 and will provide the sensor information back to processing unit 4 .
- Processing unit 4 the components of which are depicted in FIG. 4B will receive the sensory information from head mounted display device 2 and also from hub computing device 12 (See FIG. 1 ). Based on that information, processing unit 4 will determine where and when to provide a virtual image to the user and send instructions accordingly to the head mounted display device of FIG. 4A .
- FIG. 4A shows the control circuit 200 in communication with the power management circuit 202 .
- Control circuit 200 includes processor 210 , memory controller 212 in communication with memory 214 (e.g., D-RAM), camera interface 216 , camera buffer 218 , display driver 220 , display formatter 2222 , timing generator 226 , display out interface 228 , and display in interface 230 .
- memory 214 e.g., D-RAM
- all of components of control circuit 200 are in communication with each other via dedicated lines or one or more buses.
- each of the components of control circuit 200 is in communication with processor 210 .
- Camera interface 216 provides an interface to the two room facing cameras 113 and stores images received from the room facing cameras in camera buffer 218 .
- Display driver 220 will drive micro display 120 or 153 .
- Display formatter 2222 provides information, about the virtual image being displayed on micro display 120 or 153 , to opacity control circuit 224 , which controls opacity filter 114 .
- Timing generator 226 is used to provide timing data for the system.
- Display out interface 228 is a buffer for providing images from room facing cameras 113 to processing unit 4 .
- Display in 230 is a buffer for receiving images such as a virtual image to be displayed on micro display 120 .
- Display out 228 and display in 230 communicate with band interface 232 which is an interface to processing unit 4 .
- Display driver 220 may also drive mirror controller 162 to position the mirror 166 to display a focused image in accordance with the above embodiment of FIGS. 3C and 3D .
- Power management circuit 202 includes voltage regulator 234 , eye tracking illumination driver 236 , audio DAC and amplifier 238 , microphone preamplifier audio ADC 240 , temperature sensor interface 242 and clock generator 244 .
- Voltage regulator 234 receives power from processing unit 4 via band interface 232 and provides that power to the other components of head mounted display device 2 .
- Eye tracking illumination driver 236 provides the IR light source for eye tracking illumination 134 A, as described above.
- Audio DAC and amplifier 238 receive the audio information from earphones 130 .
- Microphone preamplifier and audio ADC 240 provide an interface for microphone 110 .
- Temperature sensor interface 242 is an interface for temperature sensor 138 .
- Power management unit 202 also provides power and receives data back from three axis magnetometer 132 A, three axis gyro 132 B and three axis accelerometer 132 C.
- FIG. 4B is a block diagram describing the various components of processing unit 4 .
- Control circuit 304 includes a central processing unit (CPU) 320 , graphics processing unit (GPU) 322 , cache 324 , RAM 326 , memory control 328 in communication with memory 330 (e.g., D-RAM), flash memory controller 332 in communication with flash memory 334 (or other type of non-volatile storage), display out buffer 336 in communication with head mounted display device 2 via band interface 302 and band interface 232 , display in buffer 338 in communication with head mounted display device 2 via band interface 302 and band interface 232 , microphone interface 340 in communication with an external microphone connector 342 for connecting to a microphone, PCI express interface for connecting to a wireless communication device 346 , and USB port(s) 348 .
- CPU central processing unit
- GPU graphics processing unit
- RAM random access memory
- memory control 328 in communication with memory 330 (e.g., D-RAM)
- flash memory controller 332 in communication with flash memory 3
- wireless communication device 346 can include a Wi-Fi enabled communication device, BlueTooth communication device, infrared communication device, etc.
- the USB port can be used to dock the processing unit 4 to hub computing device 12 in order to load data or software onto processing unit 4 , as well as charge processing unit 4 .
- CPU 320 and GPU 322 are the main workhorses for determining where, when and how to insert virtual images into the view of the user. More details are provided below.
- Power management circuit 306 includes clock generator 360 , analog to digital converter 362 , battery charger 364 , voltage regulator 366 , head mounted display power source 376 , and temperature sensor interface 372 in communication with temperature sensor 374 (located on the wrist band of processing unit 4 ).
- Analog to digital converter 362 is connected to a charging jack 370 for receiving an AC supply and creating a DC supply for the system.
- Voltage regulator 366 is in communication with battery 368 for supplying power to the system.
- Battery charger 364 is used to charge battery 368 (via voltage regulator 366 ) upon receiving power from charging jack 370 .
- HMD power source 376 provides power to the head mounted display device 2 .
- the above-described system will be configured to insert a virtual image into the field of view of a user so that the virtual image replaces the view of a real world object.
- the virtual image can be inserted without replacing the image of a real world object.
- the virtual image will be adjusted to match the appropriate orientation, size and shape based on the object being replaced or the environment for which the image is being inserted into.
- the virtual image can be adjusted to include reflectivity and shadows.
- head mounted display device 2 , processing unit 4 and hub computing device 12 work together as each of the devices includes a subset of sensors that are used to obtain the data for determining where, when and how to insert the virtual images.
- the calculations that determine where, how and when to insert a virtual image are performed by hub computing device 12 . In another embodiment, those calculations are performed by processing unit 4 . In another embodiment some of the calculations are performed by hub computing device 12 while other calculations are performed by processing unit 4 . In other embodiments, the calculations can be performed by head mounted display device 2 .
- hub computing device 12 will create a model of the environment that the user is in and track various moving objects in that environment.
- hub computing device 12 tracks the position and orientation of head mounted display device 2 .
- the model and the tracking information are provided from hub computing device 12 to processing unit 4 .
- Sensor information may also be obtained from head mounted display device 2 .
- Processing unit 4 uses the additional sensor information it receives from head mounted display device 2 to refine the field of view of the user and provide instructions to head mounted display device 2 on how, where and when to insert the virtual image.
- FIG. 5 illustrates an example embodiment of hub computing system 12 with a capture device.
- capture devices 20 A and 20 B are the same structure, therefore, FIG. 5 only shows capture device 20 A.
- capture device 20 A may be configured to capture video with depth information including a depth image that may include depth values via any suitable technique including, for example, time-of-flight, structured light, stereo image, or the like.
- the capture device 20 A may organize the depth information into “Z layers,” or layers that may be perpendicular to a Z axis extending from the depth camera along its line of sight.
- capture device 20 A may include a camera component 423 .
- camera component 423 may be or may include a depth camera that may capture a depth image of a scene.
- the depth image may include a two-dimensional (2-D) pixel area of the captured scene where each pixel in the 2-D pixel area may represent a depth value such as a distance in, for example, centimeters, millimeters, or the like of an object in the captured scene from the camera.
- Camera component 23 may include an infra-red (IR) light component 425 , a three-dimensional (3-D) camera 426 , and an RGB (visual image) camera 428 that may be used to capture the depth image of a scene.
- IR infra-red
- 3-D three-dimensional
- RGB visual image
- the IR light component 425 of the capture device 20 A may emit an infrared light onto the scene and may then use sensors (in some embodiments, including sensors not shown) to detect the backscattered light from the surface of one or more targets and objects in the scene using, for example, the 3-D camera 426 and/or the RGB camera 428 .
- pulsed infrared light may be used such that the time between an outgoing light pulse and a corresponding incoming light pulse may be measured and used to determine a physical distance from the capture device 20 A to a particular location on the targets or objects in the scene. Additionally, in other example embodiments, the phase of the outgoing light wave may be compared to the phase of the incoming light wave to determine a phase shift. The phase shift may then be used to determine a physical distance from the capture device to a particular location on the targets or objects.
- time-of-flight analysis may be used to indirectly determine a physical distance from the capture device 20 A to a particular location on the targets or objects by analyzing the intensity of the reflected beam of light over time via various techniques including, for example, shuttered light pulse imaging.
- capture device 20 A may use a structured light to capture depth information.
- patterned light i.e., light displayed as a known pattern such as grid pattern, a stripe pattern, or different pattern
- the pattern may become deformed in response.
- Such a deformation of the pattern may be captured by, for example, the 3-D camera 426 and/or the RGB camera 428 (and/or other sensor) and may then be analyzed to determine a physical distance from the capture device to a particular location on the targets or objects.
- the IR Light component 425 is displaced from the cameras 425 and 426 so triangulation can be used to determined distance from cameras 425 and 426 .
- the capture device 20 A will include a dedicated IR sensor to sense the IR light, or a sensor with an IR filter.
- the capture device 20 A may include two or more physically separated cameras that may view a scene from different angles to obtain visual stereo data that may be resolved to generate depth information.
- Other types of depth image sensors can also be used to create a depth image.
- the capture device 20 A may further include a microphone 430 , which includes a transducer or sensor that may receive and convert sound into an electrical signal. Microphone 430 may be used to receive audio signals that may also be provided by hub computing system 12 .
- the capture device 20 A may further include a processor 432 that may be in communication with the image camera component 423 .
- Processor 432 may include a standardized processor, a specialized processor, a microprocessor, or the like that may execute instructions including, for example, instructions for receiving a depth image, generating the appropriate data format (e.g., frame) and transmitting the data to hub computing system 12 .
- Capture device 20 A may further include a memory 434 that may store the instructions that are executed by processor 432 , images or frames of images captured by the 3-D camera and/or RGB camera, or any other suitable information, images, or the like.
- memory 434 may include random access memory (RAM), read only memory (ROM), cache, flash memory, a hard disk, or any other suitable storage component.
- RAM random access memory
- ROM read only memory
- cache flash memory
- hard disk or any other suitable storage component.
- memory 434 may be a separate component in communication with the image capture component 423 and processor 432 .
- the memory 434 may be integrated into processor 432 and/or the image capture component 422 .
- Capture devices 20 A and 20 B are in communication with hub computing system 12 via a communication link 436 .
- the communication link 436 may be a wired connection including, for example, a USB connection, a Firewire connection, an Ethernet cable connection, or the like and/or a wireless connection such as a wireless 802.11b, g, a, or n connection.
- hub computing system 12 may provide a clock to capture device 20 A that may be used to determine when to capture, for example, a scene via the communication link 436 .
- the capture device 20 A provides the depth information and visual (e.g., RGB) images captured by, for example, the 3-D camera 426 and/or the RGB camera 428 to hub computing system 12 via the communication link 436 .
- RGB visual
- the depth images and visual images are transmitted at 30 frames per second; however, other frame rates can be used.
- Hub computing system 12 may then create and use a model, depth information, and captured images to, for example, control an application such as a game or word processor and/or animate an avatar or on-screen character.
- Hub computing system 12 includes depth image processing and skeletal tracking module 450 , which uses the depth images to track one or more persons detectable by the depth camera function of capture device 20 A.
- Depth image processing and skeletal tracking module 450 provides the tracking information to application 453 , which can be a video game, productivity application, communications application or other software application etc.
- the audio data and visual image data is also provided to application 452 and depth image processing and skeletal tracking module 450 .
- Application 452 provides the tracking information, audio data and visual image data to recognizer engine 454 .
- recognizer engine 454 receives the tracking information directly from depth image processing and skeletal tracking module 450 and receives the audio data and visual image data directly from capture devices 20 A and 20 B.
- Recognizer engine 454 is associated with a collection of filters 460 , 462 , 464 , . . . , 466 each comprising information concerning a gesture, action or condition that may be performed by any person or object detectable by capture device 20 A or 20 B.
- the data from capture device 20 A may be processed by filters 460 , 462 , 464 , . . . , 466 to identify when a user or group of users has performed one or more gestures or other actions.
- Those gestures may be associated with various controls, objects or conditions of application 452 .
- hub computing system 12 may use the recognizer engine 454 , with the filters, to interpret and track movement of objects (including people).
- Capture devices 20 A and 20 B provide RGB images (or visual images in other formats or color spaces) and depth images to hub computing system 12 .
- the depth image may be a plurality of observed pixels where each observed pixel has an observed depth value.
- the depth image may include a two-dimensional (2-D) pixel area of the captured scene where each pixel in the 2-D pixel area may have a depth value such as distance of an object in the captured scene from the capture device.
- Hub computing system 12 will use the RGB images and depth images to track a user's or object's movements. For example, the system will track a skeleton of a person using the depth images. There are many methods that can be used to track the skeleton of a person using depth images.
- Recognizer engine 454 includes multiple filters 460 , 462 , 464 , . . . , 466 to determine a gesture or action.
- a filter comprises information defining a gesture, action or condition along with parameters, or metadata, for that gesture, action or condition. For instance, a throw, which comprises motion of one of the hands from behind the rear of the body to past the front of the body, may be implemented as a gesture comprising information representing the movement of one of the hands of the user from behind the rear of the body to past the front of the body, as that movement would be captured by the depth camera. Parameters may then be set for that gesture.
- a parameter may be a threshold velocity that the hand has to reach, a distance the hand travels (either absolute, or relative to the size of the user as a whole), and a confidence rating by the recognizer engine that the gesture occurred.
- These parameters for the gesture may vary between applications, between contexts of a single application, or within one context of one application over time.
- Filters may be modular or interchangeable.
- a filter has a number of inputs (each of those inputs having a type) and a number of outputs (each of those outputs having a type).
- a first filter may be replaced with a second filter that has the same number and types of inputs and outputs as the first filter without altering any other aspect of the recognizer engine architecture. For instance, there may be a first filter for driving that takes as input skeletal data and outputs a confidence that the gesture associated with the filter is occurring and an angle of steering.
- a filter need not have a parameter. For instance, a “user height” filter that returns the user's height may not allow for any parameters that may be tuned.
- An alternate “user height” filter may have tunable parameters—such as to whether to account for a user's footwear, hairstyle, headwear and posture in determining the user's height.
- Inputs to a filter may comprise things such as joint data about a user's joint position, angles formed by the bones that meet at the joint, RGB color data from the scene, and the rate of change of an aspect of the user.
- Outputs from a filter may comprise things such as the confidence that a given gesture is being made, the speed at which a gesture motion is made, and a time at which a gesture motion is made.
- Recognizer engine 454 may have a base recognizer engine that provides functionality to the filters.
- the functionality that recognizer engine 454 implements includes an input-over-time archive that tracks recognized gestures and other input, a Hidden Markov Model implementation (where the modeled system is assumed to be a Markov process—one where a present state encapsulates any past state information used to determine a future state, so no other past state information must be maintained for this purpose—with unknown parameters, and hidden parameters are determined from the observable data), as well as other functionality used to solve particular instances of gesture recognition.
- Filters 460 , 462 , 464 , . . . , 466 are loaded and implemented on top of the recognizer engine 454 and can utilize services provided by recognizer engine 454 to all filters 460 , 462 , 464 , . . . , 466 .
- recognizer engine 454 receives data to determine whether it meets the requirements of any filter 460 , 462 , 464 , . . . , 466 . Since these provided services, such as parsing the input, are provided once by recognizer engine 454 rather than by each filter 460 , 462 , 464 , . . . , 466 , such a service need only be processed once in a period of time as opposed to once per filter for that period, so the processing used to determine gestures is reduced.
- Application 452 may use the filters 460 , 462 , 464 , . . . , 466 provided with the recognizer engine 454 , or it may provide its own filter, which plugs in to recognizer engine 454 .
- all filters have a common interface to enable this plug-in characteristic. Further, all filters may utilize parameters, so a single gesture tool below may be used to debug and tune the entire filter system.
- recognizer engine 454 More information about recognizer engine 454 can be found in U.S. patent application Ser. No. 12/422,661, “Gesture Recognizer System Architecture,” filed on Apr. 13, 2009, incorporated herein by reference in its entirety. More information about recognizing gestures can be found in U.S. patent application Ser. No. 12/391,150, “Standard Gestures,” filed on Feb. 23, 2009; and U.S. patent application Ser. No. 12/474,655, “Gesture Tool” filed on May 29, 2009, both of which are incorporated herein by reference in their entirety.
- hub computing system 12 includes a user profile database 470 that includes user-specific information related to one or more users interacting with hub computing system 12 .
- the user-specific information includes information related to a user such as the user's expressed preferences, the user's friends' list, the user's preferred activities, a list of the user's reminders, the user's social groups, the user's current location, the user's past intents to interact with objects in the user's environment and other user created content, such as the user's photos, images and recorded videos.
- the user-specific information may be obtained from one or more data sources such as the user's social networking sites, address book, email data, Instant Messaging data, user profiles or other sources on the Internet. In one approach, and as will be discussed in detail below, the user-specific information is utilized to automatically determine the user's intent to interact with one or more objects in the user's environment.
- FIG. 6 illustrates an example embodiment of a computing system that may be used to implement hub computing system 12 .
- the multimedia console 500 has a central processing unit (CPU) 501 having a level 1 cache 502 , a level 2 cache 504 , and a flash ROM (Read Only Memory) 506 .
- the level 1 cache 502 and a level 2 cache 504 temporarily store data and hence reduce the number of memory access cycles, thereby improving processing speed and throughput.
- CPU 501 may be provided having more than one core, and thus, additional level 1 and level 2 caches 502 and 504 .
- the flash ROM 506 may store executable code that is loaded during an initial phase of a boot process when the multimedia console 500 is powered on.
- a graphics processing unit (GPU) 508 and a video encoder/video codec (coder/decoder) 514 form a video processing pipeline for high speed and high resolution graphics processing. Data is carried from the graphics processing unit 508 to the video encoder/video codec 514 via a bus. The video processing pipeline outputs data to an A/V (audio/video) port 540 for transmission to a television or other display.
- a memory controller 510 is connected to the GPU 508 to facilitate processor access to various types of memory 512 , such as, but not limited to, a RAM (Random Access Memory).
- the multimedia console 500 includes an I/O controller 520 , a system management controller 522 , an audio processing unit 523 , a network interface 524 , a first USB host controller 526 , a second USB controller 528 and a front panel I/O subassembly 530 that are preferably implemented on a module 518 .
- the USB controllers 526 and 528 serve as hosts for peripheral controllers 542 ( 1 )- 542 ( 2 ), a wireless adapter 548 , and an external memory device 546 (e.g., flash memory, external CD/DVD ROM drive, removable media, etc.).
- the network interface 524 and/or wireless adapter 548 provide access to a network (e.g., the Internet, home network, etc.) and may be any of a wide variety of various wired or wireless adapter components including an Ethernet card, a modem, a Bluetooth module, a cable modem, and the like.
- a network e.g., the Internet, home network, etc.
- wired or wireless adapter components including an Ethernet card, a modem, a Bluetooth module, a cable modem, and the like.
- System memory 543 is provided to store application data that is loaded during the boot process.
- a media drive 544 is provided and may comprise a DVD/CD drive, Blu-Ray drive, hard disk drive, or other removable media drive, etc.
- the media drive 144 may be internal or external to the multimedia console 500 .
- Application data may be accessed via the media drive 544 for execution, playback, etc. by the multimedia console 500 .
- the media drive 544 is connected to the I/O controller 520 via a bus, such as a Serial ATA bus or other high speed connection (e.g., IEEE 1394).
- the system management controller 522 provides a variety of service functions related to assuring availability of the multimedia console 500 .
- the audio processing unit 523 and an audio codec 532 form a corresponding audio processing pipeline with high fidelity and stereo processing. Audio data is carried between the audio processing unit 523 and the audio codec 532 via a communication link.
- the audio processing pipeline outputs data to the A/V port 540 for reproduction by an external audio user or device having audio capabilities.
- the front panel I/O subassembly 530 supports the functionality of the power button 550 and the eject button 552 , as well as any LEDs (light emitting diodes) or other indicators exposed on the outer surface of the multimedia console 500 .
- a system power supply module 536 provides power to the components of the multimedia console 500 .
- a fan 538 cools the circuitry within the multimedia console 500 .
- the CPU 501 , GPU 508 , memory controller 510 , and various other components within the multimedia console 500 are interconnected via one or more buses, including serial and parallel buses, a memory bus, a peripheral bus, and a processor or local bus using any of a variety of bus architectures.
- bus architectures can include a Peripheral Component Interconnects (PCI) bus, PCI-Express bus, etc.
- application data may be loaded from the system memory 543 into memory 512 and/or caches 502 , 504 and executed on the CPU 501 .
- the application may present a graphical user interface that provides a consistent user experience when navigating to different media types available on the multimedia console 500 .
- applications and/or other media contained within the media drive 544 may be launched or played from the media drive 544 to provide additional functionalities to the multimedia console 500 .
- the multimedia console 500 may be operated as a standalone system by simply connecting the system to a television or other display. In this standalone mode, the multimedia console 500 allows one or more users to interact with the system, watch movies, or listen to music. However, with the integration of broadband connectivity made available through the network interface 524 or the wireless adapter 548 , the multimedia console 500 may further be operated as a participant in a larger network community. Additionally, multimedia console 500 can communicate with processing unit 4 via wireless adaptor 548 .
- a set amount of hardware resources are reserved for system use by the multimedia console operating system. These resources may include a reservation of memory, CPU and GPU cycle, networking bandwidth, etc. Because these resources are reserved at system boot time, the reserved resources do not exist from the application's view.
- the memory reservation preferably is large enough to contain the launch kernel, concurrent system applications and drivers.
- the CPU reservation is preferably constant such that if the reserved CPU usage is not used by the system applications, an idle thread will consume any unused cycles.
- lightweight messages generated by the system applications are displayed by using a GPU interrupt to schedule code to render popup into an overlay.
- the amount of memory required for an overlay depends on the overlay area size and the overlay preferably scales with screen resolution. Where a full user interface is used by the concurrent system application, it is preferable to use a resolution independent of application resolution. A scaler may be used to set this resolution such that the need to change frequency and cause a TV resync is eliminated.
- multimedia console 500 boots and system resources are reserved, concurrent system applications execute to provide system functionalities.
- the system functionalities are encapsulated in a set of system applications that execute within the reserved system resources described above.
- the operating system kernel identifies threads that are system application threads versus gaming application threads.
- the system applications are preferably scheduled to run on the CPU 501 at predetermined times and intervals in order to provide a consistent system resource view to the application. The scheduling is to minimize cache disruption for the gaming application running on the console.
- a multimedia console application manager controls the gaming application audio level (e.g., mute, attenuate) when system applications are active.
- Optional input devices are shared by gaming applications and system applications.
- the input devices are not reserved resources, but are to be switched between system applications and the gaming application such that each will have a focus of the device.
- the application manager preferably controls the switching of input stream, without knowing the gaming application's knowledge and a driver maintains state information regarding focus switches.
- Capture devices 20 A and 20 B may define additional input devices for the console 500 via USB controller 526 or other interface.
- hub computing system 12 can be implemented using other hardware architectures. The implementation of hub computing system 12 does not require any particular hardware architecture.
- FIG. 7 is a simplified flow chart depicting a process according to an embodiment of the present technology. Other steps could be added and many variations are possible. Not all steps are required. In one embodiment, the processing depicted in FIG. 7 may be performed by one or more components of head mounted device 2 as depicted in FIG. 1 . Various steps of the process could be performed by software, hardware, or some combination of hardware and software. Herein, the term “logic” refers to software, hardware, or some combination of hardware and software. Thus, the HMD 2 may have logic that is configured to perform operations described in FIG. 7 . The logic that performs some or all of the operations could be off the HMD 2 at hub computing system 12 , or some other processor.
- steps illustrated in FIG. 7 may be performed in parallel or simultaneously, and steps may be performed continuously.
- step 702 could be performed continuously such that the user's focus is continuously tracked.
- a focus of a user that is wearing a HMD 2 is determined.
- the focus refers to where the user is directing their attention (e.g., looking).
- the focus is determined by tracking head orientation and/or position. This may include tracking head translation, rotation, etc.
- the system 100 determines what they are focused on.
- the focus is determined based on eye gaze tracking.
- the focus is determined based on a combination of eye gaze tracking and head tracking.
- information identifying the position and orientation of the user and the head mounted display device 2 associated with the user is transmitted from the hub computing device 12 and received by processing unit 4 .
- FIG. 2 shows one example of a 3D coordinate system 2252 in which the position and orientation of the user, their head, and the head mounted display device 2 might be defined.
- hub computing device 12 will use one or more depth images and one or more visual images to track a user (e.g., using skeleton tracking).
- One or more depth images and one or more visual images can be used to determine the position and orientation of the user and the head mounted display device associated with the user.
- information sensed locally e.g., from the head mounted device 2 ) by one or more sensors embedded in the head mounted display device 2 is used in step 702 .
- Various sensor technologies embedded in the head mounted display including the accelerometer, magnetometer and gyroscope, or other sensor technologies may be used to identify head position and orientation. Other sensor technologies may include time of flight, spatial scan, mechanical linkages, phase-difference sensing, and/or direct field sensing.
- processing unit 4 will access data from three axis gyro 132 B, three axis accelerometer 132 C, and three axis magnetometer 132 A.
- information identifying a position of the user's eye in the environment is used in step 702 .
- This is also known as the “eye tracking” as discussed above.
- the user's eye can be illuminated using infrared light from eye tracking illumination 134 A.
- the reflection from the eye is detected using one or more eye tracking cameras 134 B.
- the reflection data is sent from head mounted display device 2 to processing unit 4 .
- Processing unit 4 will determine the position of the eye based on the reflection data, as discussed above.
- the information identifying a position of the user's eye in the environment is received or accessed by processing unit 4 .
- Suitable tracking technologies are disclosed in U.S. Patent Application Publication 2012/0154277, “Optimized Focal Area for Augmented Reality Displays,” filed on Dec. 17, 2010, which is incorporated herein by reference in its entirety.
- Additional information such as information retrieved from the cloud, information detected and/or gathered by one or more external devices, and other relevant information may also be used in step 702 .
- Techniques such as Simultaneous Localization and Mapping (SLAM) using RGB and/or depth sensor data may be employed to provide a real-time position of the user's head relative to the mapped environment.
- SLAM Simultaneous Localization and Mapping
- Environmental typography may be identified using data from the cloud and/or depth sensor data. Regions of the user's body can be identified (e.g., hand, arm, torso, legs) using the depth sensor when the user is looking at him or herself.
- the system 100 determines whether the user's focus is in a general direction of an object associated with the user interface symbol.
- the user interface symbol may or may not be displayed in the HMD 2 .
- the user interface symbol could be associated with a real world object or a virtual object.
- the symbol could be associated with an actual real world book or with a virtual dog.
- FIG. 8A shows an example of a user interface symbol presented on an object in accordance with one embodiment.
- a user interface symbol 804 is presented in the HMD 2 such that it appears as being associated with the object 802 .
- the object 802 could be a real world object or a virtual object being rendering in the HMD 2 .
- the symbol 804 is presented on one edge of the object 802 , and appears to touch the object 802 .
- the symbol 804 is not required to appear to be in contact with the object 802 . However, being close by helps the user to understand that the symbol 804 may serve as a way to initiate a user interaction, user input, etc. in connection with the object 802 .
- step 706 the system 100 could present more than one symbol 804 on the object 802 . This is because an object 802 could have more than one symbol 804 associated with it.
- one option is to present the symbol 804 in the HMD 2 even if the user is not looking at the object 802 .
- the system 100 might make the symbol 804 somewhat more visible when the user looks at the object 802 , or in its general direction.
- step 704 determines that the user is not focused in the direction of the object 802 . If the system determined in step 704 that the user is not focused in the direction of the object 802 , then control passes to step 702 to continue to monitor the user's focus. Note that the symbol 804 need not be displayed if the user is not focused in the direction of the object 802 , although symbol 804 optionally is displayed in some format. This format could be different from when the user is looking in the direction of the object 802 . For example, the symbol 804 might be less perceptible.
- the system 100 determines whether the user is focused on the symbol 804 , in step 708 . This can be achieved by monitoring the user's head orientation and/or position and/or the user's eye gaze. In one embodiment, the system 100 determines whether the user's attention is focused at the symbol 804 for some minimum time.
- the object 802 can have more than one symbol 804 associated with it.
- the system 100 determines which symbol 804 the user is most directly focused on and highlights that symbol 804 .
- the system 100 displays a handle in the HMD 2 extending away from the symbol 804 , in step 710 .
- a handle in the HMD 2 extending away from the symbol 804 , in step 710 .
- FIG. 8B depicts one example in which the symbol 804 is highlighted.
- FIG. 8C depicts one example in which a handle 806 extends away from the symbol 804 .
- the handle 806 may or may not have a text label associated with it.
- the handle 806 may or may not have an icon associated with it.
- the system 100 determines whether the user moves their focus away from the symbol 804 along the handle 806 , in step 712 . If not, the system 100 returns to step 708 . The user might choose to look away from the symbol 804 after initially looking at it. In this case, the test of step 708 would be “no” and control passes to step 704 .
- step 712 includes the system 100 determining whether the user moves their head in a way that indicates that their focus is moving along the handle 806 . This could be a rotation of the head, as one example.
- the system 100 determines whether the user moves their eyes in a way that indicates that their eye gaze is moving along the handle 806 .
- eye gaze determined by eye tracking is considered to be a point at which the user's eye (or eyes) are momentarily are fixed.
- the system 100 can track the different points at which the user is gazing and determine that these points are moving progressively along the handle 806 .
- the shift in user focus need not be exactly along the handle 806 .
- the system 100 can still consider this to be focus along the handle 806 .
- the degree of freedom that the system 100 provides here may be an adjustable parameter. Note that this system 100 can determine the user's focus along the handle 806 by tracking head orientation and/or eye gaze.
- the system 100 determines whether a selection criterion has been met in step 714 .
- the selection criterion could be that the user's focus is at or near the end of the handle 806 , as one example.
- a user interface element is activated, in step 716 .
- a spokes menu in the HMD 2 .
- Example spokes menus are discussed herein.
- the user interface element is activated at or near the end of the handle 806 that is away from the symbol 804 , in one embodiment.
- step 714 in some cases the user will not move their focus along the handle 806 after looking at the symbol 804 . Furthermore, the user may look away from the symbol 804 .
- step 720 is to discontinue presenting the symbol 804 and the handle 806 in the HMD 2 .
- the system 100 may return to monitoring the focus of the user in step 702 .
- the user continues to focus on the handle 806 , but the selection criterion is not met. For example, the user does not look far enough to the end of the handle 806 .
- the user looks back at the symbol 804 after looking at the handle 806 (but not meeting the selection criterion). In either case, the system 100 continues to present the handle 806 . This is represented by the “handle or symbol” condition of step 718 passing control to step 710 .
- step 706 passes to step 706 in this case.
- the system 100 may provide feedback to the user.
- This feedback could be provided in connection with a variety of events. For example, when the user looks at the symbol 804 audio feedback could be provided. When the user looks along the handle 806 , audio feedback could be provided to assist the user in knowing that progress is being made. Visual feedback could also be provided as the user moves their focus along the handle 806 .
- the result of step 714 is selection of an input.
- the object 802 might be a virtual dog that wags its tail as a result of the selection in step 714 . In this example, this might be the only selection that is associated with the symbol 804 .
- FIG. 8D is an example of one embodiment of a spokes menu 808 .
- the object 802 that is associated with the spokes menu 808 is also shown in FIG. 8D .
- the spokes menu 808 has a central hub 816 , which is this example is represented as a black dot.
- the central hub 816 is not necessarily shown to the user in the HMD 2 .
- the central hub 816 is located where the end of the handle 806 that is opposite the symbol 804 was, in one embodiment. However, the handle 806 and symbol 804 are no longer presented in the display when the spokes menu 808 is displayed, in one embodiment.
- the spokes menu 808 appears to grow out of the end of the handle 806 ; then, the handle 806 and symbol 804 disappear.
- the menu 808 has a number of menu items 810 at the exterior, around the central hub 816 . There is a spoke 812 between each menu item 810 and the hub 816 . The spokes 812 are not necessarily shown to the user at all times. Initially, it is expected that the user's focus will be at the hub 816 . This is due to the hub 816 being located at the end of the handle 806 . As previously noted, the user may shift their focus towards the end of the handle 806 to activate the spoke menu 808 . By activating this means causing the HMD to display the spoke menu 808 .
- the user is able to select a menu item 810 by shifting their focus from the hub 816 towards one of the menu items 810 .
- the spoke 812 between the hub 816 and that menu item 810 might not be shown at all.
- the spoke 812 grows towards the menu item 810 , in one embodiment. Further details are discussed below.
- FIG. 9A is another example of one embodiment of a spokes menu 808 .
- the object 802 with which the spokes menu 808 is associated is not depicted.
- the spokes 812 and hub 816 need not be presented in the HMD 2 .
- the menu items 810 are actions associated with the object 802 , although this is not a requirement.
- the menu items 810 include various controls (e.g., volume, play/pause, FF/Rev) for operating a video or the like.
- the menu items 810 have text labels, but that is not required.
- the spokes 812 may or may not have text labels associated with them.
- the spokes 812 may or may not have icons associated with them.
- spoke menus 808 can be nested. This means that selection of a menu item 810 can spawn another spoke menu 808 . For example, if the user selects the menu item 810 labeled volume, this could spawn the spoke menu 808 depicted in FIG. 9B . For example, if the user selects the menu item 810 labeled FF/Rev, this could spawn the spoke menu 808 depicted in FIG. 9C . Spawning a menu means that the menu is presented in the HMD 2 .
- FIG. 10 is a flowchart of one embodiment of a process 1000 providing for a selection of a menu item 810 on a spokes menu 808 .
- Process 1000 may be performed after step 714 from FIG. 7 . Recall the step 714 results in a spokes menu 808 being activated, in one embodiment. For example, a spokes menu 808 such as depicted in FIG. 8D or 9 A is activated.
- head tracking is used. Eye tracking need not be used in process 1000 .
- Process 1000 may be performed by hardware, software, or some combination of hardware and software. In one embodiment, process 1000 is performed, at least in part, by an HMD 2 . A processor coupled to the HMD 2 can assist in some of the steps.
- the system 100 may assume that the user's attention is focused on the hub 816 of the spokes menu 808 . This is described in step 1002 . This assumption may be made given that the hub 816 may be located at or near the end of the handle 806 . As noted above, the hub 816 does not need to be visually presented in the HMD 2 . Also, it is not necessary that the user is actually looking directly at the location of the hub 816 .
- Step 1004 is tracking the user's head orientation and/or position. This has already been discussed. The head tracking may be performed continuously throughout process 1000 .
- step 1006 the system 100 determines whether the user's head is moving such that the user's focus is moving away from the hub 816 . There may be some “safe zone” near the hub 816 such that the system 100 does not consider this to be a significant head movement unless the head moves some defined amount (e.g., beyond a threshold).
- the manner in which the head movement is measured can vary. One technique is to track head rotation in terms of degrees. The head may need to move some minimum amount in terms of degree of rotation to consider this to be a significant movement.
- the system 100 highlights the relevant spoke 812 in step 1008 .
- a portion of the spoke 812 is highlighted. This portion may be the portion that is closest to the hub 816 .
- the amount by which the spoke 812 is filled depends on how far the user has moved (e.g., rotated) their head, in one embodiment.
- FIGS. 11A-11C show an example of a spoke 812 being progressively filled. This may be in response to the user moving their head further away from the hub 816 in a direction towards the menu item labeled “louder.” For example, as the user rotates their head such that a vector that “shoots straight out” from between their eyes moves from the hub 816 to the right towards the “louder” menu item, the spoke 812 is progressively filled. Progressively filling the spoke 812 means to progressively or gradually present more of the spoke 812 in the HMD 2 , in one embodiment. Progressively filling the spoke 812 means to progressively or gradually “fill in” the spoke 812 by, for example, making it more saturated, brighter, filled in, etc., in one embodiment. For example, the entire spoke 812 may initially be faded out. Then, progressively or gradually more of the spoke 812 is made brighter, filled in, saturated, etc. Note that spokes 812 can be “unfilled,” as well, depending on the focus of the user.
- the filling the spoke 812 could have different sensitivities with respect to the head movement.
- the degree to which the spoke 812 fills in response to a certain amount of head movement can be an adjustable parameter. What this implies is that the aforementioned vector from the user's head need not point exactly to where the spoke 812 has been filled.
- the spoke 812 has been filled to point A.
- the vector from the user's head may or may not be pointed at point A.
- the vector could be pointing closer to the hub 816 or further from the hub 816 than point A.
- the sensitivity can help to prevent false positives while still making it easy for the user to provide input.
- spokes 812 are filled in what may be referred to as a “growth model”.
- the spoke 812 can continue to fill so long as the user maintains their head position outside of the “safe zone” near the central hub 816 . This assumes the head position is also generally along the region of the spoke 812 . Thus, even with the user's head not moving, the spoke 812 can continue to progressively fill as depicted in the sequence from FIG. 11A-11C .
- the system 100 determines whether a selection criterion has been met.
- the selection criterion is met by the head moving a certain amount.
- the system 100 determines whether the user's head position reaches a selection point.
- This selection point refers to a certain amount of movement (e.g., head rotation), in one embodiment.
- a selection point is one example of a selection criterion.
- the system selects the menu item in step 1012 .
- the “louder” menu item 810 is selected when the user's head has moved (e.g., rotated) a defined amount. This amount could be such that the aforementioned head vector is pointing at the louder menu item 810 , to the right of it, or to the left of it (but right of the hub).
- spokes menu 808 could go away.
- the spokes menu 808 in FIG. 9B could go away.
- a different spokes menu to replace the existing one, which is referred to herein as “spawning” a spokes menu 808 .
- the menu item of FIG. 9B could be spawned upon selection of the volume menu item of FIG. 9A .
- Other techniques can be used to make a spokes menu 808 disappear, such as the user nodding their head.
- the degree to which the user needs to move their head to make the selection can be an adjustable parameter. This sensitivity could depend on the configuration of the spoke menu 808 . For example, some menu items 810 could be located further away from the hub 816 than others (where further away may be measured in terms of degrees of head rotation, as one example).
- the selection point is reached when the spoke 812 has filled to a certain degree. As one example, this can be when the spoke 812 reaches the menu item 810 , as depicted in FIG. 11C .
- This selection technique might be used in connection to the “growth model” technique described above. However, it can also be used in connection with other techniques such as ones that progressively fill the spoke 812 as the user continues to further move (e.g., rotate) their head.
- step 1010 determines whether the user is still focused on the same spoke. If so, then control passes to step 1010 to determine whether the selection criterion is met. Note that while the user continues to focus on the same spoke it may gradually fill in more or may gradually “unfill,” depending on the user's focus.
- step 1014 it is possible that the user begins to move their head along a spoke 812 , but then changes their mind and returns their focus to the central hub 816 or to a different spoke 812 .
- One option is to return to step 1008 to highlight the spoke that the user is presently focused on (e.g., head is oriented towards).
- step 1014 Another option stemming from step 1014 is to return to step 1004 upon determining that the user's head is oriented at the safe zone near the hub 816 . This is represented by the “hub” condition of step 1014 .
- step 1016 Still another possibility is to abort in step 1016 if it is determined that the user is no longer looking anywhere near the spoke menu 808 . This is represented by the “away from spoke menu” condition of step 1014 passing control to step 1016 .
- the system 100 may provide feedback to the user.
- This feedback could be provided in connection with a variety of events. For example, when the spoke 812 is being progressively filled, audio feedback could be provided to assist the user in knowing that progress is being made.
- the user is allowed to move their eyes without triggering a response from the system 100 . This is because it is head tracking that is used in this technique. This allows the user to keep their head relatively still while moving their eyes to scan the different menu items 810 in the spokes menu 808 . The user does not need to keep their head perfectly still if there is a “safe zone” or the like, as previously discussed.
- FIG. 12 is a flowchart of one embodiment of a process 1200 providing for a selection of a menu item 810 on a spokes menu 808 .
- Process 1200 may be performed after step 714 from FIG. 7 . Recall the step 714 results in a spokes menu 808 being activated, in one embodiment. For example, a spokes menu 808 such as depicted in FIG. 8D or 9 A is activated.
- eye tracking is used.
- the system 100 may assume that the user's attention is focused on the hub 816 of the spokes menu 808 . This is described in step 1202 . This assumption may be made given that the hub 816 may be located at or near the end of the handle 806 . As noted above, the hub 816 , as well as the spokes 812 do not need to be presented in the HMD 2 . Also, it is not necessary that the user is actually looking directly at the location of the hub 816 .
- Step 1204 is tracking the user's eye gaze. This has already been discussed. The eye tracking may be performed continuously throughout process 1200 .
- the system 100 may use a variety of techniques to prevent a false positive.
- the system 100 allows the user to survey the spoke menu items 810 without making an inadvertent selection by requiring that the user provide a signal that they are now using their eyes for selection purposes.
- a condition for performing step 1206 is that a mode has been entered to analyze the eye gaze.
- step 1206 the system determines whether the user's eye gaze has shifted away from the hub 816 . There may be some “safe zone” near the hub 816 such that the system does not consider this to be a significant eye movement unless the eye gaze moves some defined amount. The manner in which the eye movement is measured can vary.
- the system 100 highlights the relevant spoke 812 in step 1208 .
- a portion of the spoke 812 is highlighted. This portion may be the portion that is closest to the hub 816 .
- the amount by which the spoke 812 is progressively filled depends on how far the user has moved (e.g., rotated) their eyes, in one embodiment.
- FIGS. 11A-11C are applicable to progressively filling the spoke based on the eye gaze. As with head movement, filling the spoke 812 could have different sensitivities with respect to the eye movement.
- spokes 812 are filled in what may be referred to as a “growth model”.
- the spoke 812 can continue to progressively fill so long as the user maintains their eye position outside of the “safe zone” and near the spoke 812 .
- the spoke 812 can continue to progressively fill as depicted in the sequence from FIG. 11A-11C .
- the system 100 determines whether the selection has been triggered.
- the selection point is triggered by the eyes moving a certain amount.
- the system 100 determines whether the eyes “hover” near the menu item 810 (or other selection spot) for some defined time period.
- the system 100 determines whether an additional select event occurs while the user's eye gaze is near the menu item 810 (or other selection spot). For example, the user could use a voice command to select, use a hand motion, touch a region of the HMD 2 (e.g., temple), etc.
- the foregoing are all examples of selection criterion that are based, at least in part, on eye gaze.
- the degree to which the user needs to move their eye to make the selection can be an adjustable parameter. This sensitivity could depend on the configuration of the spoke menu 808 .
- the selection point is reached when the spoke 812 has filled to a certain degree. As one example, this can be when the spoke 812 reaches the menu item 810 , as depicted in FIG. 11C .
- This selection technique might be used in connection to the “growth model” technique described above. However, it can also be used in connection with other techniques such as ones that progressively fill the spoke 812 as the user continues to further move their eyes.
- step 1210 determines whether the user is still focused on the same spoke. If so, then control passes to step 1210 to determine whether the selection criterion is met. Note that while the user continues to focus on the same spoke it may gradually fill in more or may gradually “unfill,” depending on the user's focus.
- step 1211 no
- step 1214 determines an action to take based on eye gaze.
- One option is to return to step 1208 to highlight the spoke that the user has shifted their focus to (e.g., eye gaze is oriented towards).
- step 1204 Another option is to return to step 1204 upon determining that the user's eye gaze is oriented at the safe zone near the hub 816 . Still another possibility is to abort in step 1216 if it is determined that the user is no longer looking anywhere near the spoke menu 808 .
- the spokes 812 are filled progressively. However, this is not an absolute requirement. In one embodiment, the spoke 812 fill all at once in response to the user's focus meeting some criterion, such as being focused near the menu item 810 .
- FIG. 13 shows one embodiment in which the menu items 810 are at a diagonal from the hub 816 .
- the spokes 812 need not be presented in the HMD 2 at all times.
- a given spoke 812 can be progressively filled or grown in response to the user's focus.
- the hub 816 might never be displayed.
- FIGS. 14A and 14B depict another embodiment of a spokes menu 808 .
- FIG. 14A shows four menu items 810 in a radial configuration. The hub is in the center of the menu, although it is not depicted in FIG. 14A . The spokes are also not depicted in FIG. 14A . Three of the menu items 810 allow the user to select a movie, the other is for going back one menu.
- FIG. 14B show this embodiment when one spoke 812 that has been filled (at least partially) in response to the system 100 determining that the user's focus has shifted from the hub towards the menu item 810 for Movie B.
- the system 100 determined that the user's head rotated downwards some pre-defined amount.
- the spoke 812 can fill progressively or fill all at once.
- the spoke 812 could also unfill progressively or all at once, depending on the user's focus.
- Other techniques discussed in connection with FIG. 10 or 12 can be used.
- the spoke 812 is pie-shaped. Many other shapes can be used for the spoke 812 .
- FIG. 15 shows an example in which there is a first user selection symbol 804 a and a second user selection symbol 804 b associated with object 802 .
- Second user selection symbol 804 b has a handle 806 extended from it in response to the user focusing their attention at or near it. As discussed above, if the user focuses their attention away from the second user selection symbol 804 b , along the handle 806 , this will activate some menu element associated with the object 802 . The user can also select the first user selection symbol 804 a by focusing their attention on it.
- the system 100 presents both symbols 804 a , 804 b on the object 802 in response to the user looking in the general direction of object 802 .
- the system 100 highlights symbol 804 a in response to the user looking at that symbol 804 a , and highlights symbol 804 b in response to the user looking at that symbol 804 b .
- focus can be determined by head tracking and/or eye tracking.
- the user selection symbol 804 is shaped in a manner that identifies the direction in which the handle 806 will extend.
- FIG. 16 depicts one embodiment in which user selection symbol 804 c points in the direction in which the handle 806 has been extended.
- User selection symbol 804 d also points in a particular direction to indicate the direction in which the handle 806 will extend from it in response to the user focusing on that symbol 804 d.
- the symbol 804 there is feedback on the symbol 804 that indicates where the user's eyes or head vector is with respect to the symbol 804 .
- the symbol 804 looks like a comet having stardust that is magnetically drawn to where the user's eyes are focused. This helps the user know where they are focused when the symbol 804 is very small. It also helps when the system is using an invisible cursor. For example, this can assist the user by providing feedback of whether they need to shift their focus a little right or a little up/down/left in order to activate the user interface.
- spoke menus 808 can be nested. This means that selection of a menu item 810 can spawn another spoke menu 808 .
- FIG. 17 shows one example of nested spoke menus 808 .
- Selection of the control video menu item 810 a spawns the play controls spoke menu 808 b .
- Selection of the adjust slate menu item 810 b spawns the slate controls spoke menu 808 c .
- typically only one of these spoke menus 808 needs to be presented at a time.
- the main controls spokes menu 808 a may disappear.
- the main controls spokes menu 808 a may disappear.
- the main controls spokes menu 808 a may disappear
- volume control spoke menu 808 d shows another way to provide feedback to the user.
- the user focuses on either the decrease volume menu item 810 e or the increase volume menu item 810 d to change the volume.
- the user focuses their attention along the spoke associated with a menu item 810 to provide input.
- the user can control the volume by shifting their focus along the spoke 812 associated with the volume menu item 810 e .
- the volume might be controlled by the position of the user's focus along the spoke. For example, the closer to the volume menu item 810 e , the louder the volume is.
- the speed at which the volume changes might be controlled by the location of the user's focus along the spoke associated with the volume spoke menu item 810 e .
- the volume can increase more rapidly when the user's focus is near the increase volume menu item 810 d .
- the volume bar 1712 provides feedback to the user. Similar control over user input may be provided by spokes associated with other menu items 810 .
- spokes 812 there may be two (or more) types of spokes 812 .
- One type of spoke 812 may be referred to as a “selection spoke.”
- a selection spoke may result in a selection associated with the menu item. For example, the selection could spawn another spokes menu, or be an input (e.g., select play, select fast forward, etc.).
- Another type of spoke 812 may be referred to as an “amplitude” spoke.
- the amplitude spoke may allow the user to change a value, as one example. For example, the user can change a volume level, change the rate of fast forward, etc.
- a resize menu item 810 f in the slate controls 808 c which can spawn the resize spokes menu 808 e , upon its selection.
- step 714 is for input to be provided by the user shifting their focus to the end of the handle 806 .
- One example of this is to enter letters on a keyboard.
- FIG. 18 shows one embodiment that allows user input based on sliding focus along the handle 806 .
- FIG. 18 shows a number of user interface symbols 804 , each of which is associated with a different letter of the English alphabet. These symbols 804 are presented to the user as virtual images in the HMD 2 . There could be some object 802 associated with the symbols 804 , although that object is not depicted in FIG. 18 .
- the system 110 When a user looks at one of the symbols 804 , the system 110 highlights that symbol 804 and grows the handle 806 out of the symbol 806 . The user can then select that letter by shifting their focus from the symbol 804 to a point at or near the end of the handle 806 that is away from the symbol 804 . Receiving the selection of the letter is one example of activating a user interface element in response to determining that the user's focus has shifted away from the user interface symbol 804 along the handle 806 to meet a selection criterion, in accordance with step 714 of FIG. 7 .
Landscapes
- Engineering & Computer Science (AREA)
- General Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- User Interface Of Digital Computer (AREA)
Abstract
Description
- Augmented reality (AR) relates to providing an augmented real-world environment where the perception of a real-world environment (or data representing a real-world environment) is augmented or modified with computer-generated virtual data. For example, data representing a real-world environment may be captured in real-time using sensory input devices such as a camera or microphone and augmented with computer-generated virtual data including virtual images and virtual sounds. The virtual data may also include information related to the real-world environment such as a text description associated with a real-world object in the real-world environment. An AR implementation may be used to enhance numerous applications including video game, mapping, navigation, and mobile device applications.
- A near-eye display such as a head mounted display (HMD) may be worn by a user to view the mixed imagery of virtual and real objects. A near-eye display uses a combination of optics and stereopsis to focus virtual imagery in the user's field of view.
- Accessing system menus in an augmented-reality environment in a hands-free way is non-trivial. Voice has the disadvantage of feeling unnatural, creating awkward social situations where people are speaking commands in public, and with current technology does not always correctly interpret the command on the first try. Mouse and glove-type controllers have the disadvantage of requiring extra hardware and cannot be easily used while one's hands are occupied (e.g. carrying things, eating. etc.).
- A system and method are provided that provide a user interface in a see-through, near-eye display, such as an HMD. The user interface is controlled by head orientation, in one embodiment. Eye gaze could also be used.
- According to one embodiment, a method comprising the following is provided. A user interface symbol is presented in a see-through, near-eye display. Focus of a user that is wearing the see-through, near-eye display is determined. A handle is presented in the see-through, near-eye display that extends away from the user interface symbol in response to determining that the user is focused on the user interface symbol. A determination is made that the user's focus has shifted along the handle away from the user interface symbol. A user interface element is activated in response to determining that the user's focus has shifted away from the user interface symbol along the handle to meet a selection criterion.
- One embodiment includes a display system comprising a see-through, near-eye display device and logic in communication with the display device. The logic is configured to present a user interface symbol in the see-through, near-eye display device. The logic is configured to track eye gaze or head orientation of a user wearing the see-through, near-eye display device. The logic is configured to display a handle in the see-through, near-eye display device that extends away from the user interface symbol in response to determining that either the eye gaze or the head orientation indicates that the user is looking at the user interface symbol. The logic is configured to activate a user interface element in response to determining that the eye gaze or head orientation indicates that the user is looking away from the user interface symbol along the handle to meet a selection criterion.
- One embodiment includes a method of providing a user interface in a see-through, near-eye display device. The method comprising the following. A user interface symbol is presented in the see-through, near-eye display device. Head orientation of a user wearing the see-through, near-eye display device is tracked. A handle that extends away from the user interface symbol is presented in the see-through, near-eye display device in response to determining, based on the head orientation, that the user is looking in the general direction of the user interface symbol. A spoke menu is spawned in the see-through, near-eye display device in response to determining that the user's head orientation indicates that the user has shifted from looking at the user interface symbol along the handle. The spoke menu includes menu items around a central hub, with each of the menu items having a spoke between it and the central hub.
- This summary is provided to introduce a selection of concepts in a simplified form that are further described below in the description. This summary is not intended to identify key features or essential features of the claimed subject matter, nor is it intended to be used as an aid in determining the scope of the claimed subject matter.
-
FIG. 1 is a block diagram depicting example components of one embodiment of a see-through, mixed reality display device in a system environment in which the device may operate. -
FIG. 2 illustrates a user wearing head mounted display device in an environment. -
FIG. 3 is a top view of a portion of one embodiment of a head mounted display unit. -
FIG. 4A is a block diagram of one embodiment of the components of a head mounted display unit. -
FIG. 4B is a block diagram of one embodiment of the components of a processing unit associated with a head mounted display unit. -
FIG. 5 is a block diagram of one embodiment of the components of a hub computing system used with head mounted display unit. -
FIG. 6 is a block diagram of one embodiment of a computing system that can be used to implement the hub computing system described herein. -
FIG. 7 is a flow chart depicting a process according to an embodiment of the present technology. -
FIG. 8A shows an example of a user interface symbol presented on an object in accordance with one embodiment. -
FIG. 8B depicts one example in which the symbol is highlighted. -
FIG. 8C depicts on example in which a handle extends away from the symbol. -
FIG. 8D is an example of one embodiment of a spokes menu. -
FIGS. 9A , 9B, and 9C show an example of one embodiment of a spokes menu. -
FIG. 10 is a flowchart of one embodiment of a process providing for a selection of a menu item on a spokes menu. -
FIGS. 11A , 11B and 11C show an example of a spoke being progressively filled. -
FIG. 12 is a flowchart of one embodiment of a process providing for a selection of a menu item on a spokes menu. -
FIG. 13 shows one embodiment in which the menu items are at a diagonal from the hub. -
FIGS. 14A and 14B depict another embodiment of a spokes menu. -
FIG. 15 shows an example in which there is a first user selection symbol and a second user selection symbol associated with object. -
FIG. 16 depicts one embodiment in which user selection symbol points in the direction in which the handle has been extended. -
FIG. 17 shows one example of nested spoke menus. -
FIG. 18 shows one embodiment that allows user input based on focusing on the handle. - Technology is disclosed herein by which a user's experience when using a near-eye, see-through display device is enhanced. Hands-free user input in an augmented reality environment is provided for, in accordance with one embodiment. Access to system menus is provided for, in accordance with one embodiment. In one embodiment, a user can provide input by moving the orientation of their head. For example, the user could rotate their head. In one embodiment, a user can provide input by moving a direction of their eye gaze.
- In one embodiment, a user interface symbol is presented in the near eye display device. The user interface symbol could be presented such that it appears to be on or near a real-world object or on or near a virtual object being rendered in the near eye display device. The user interface symbol may serve as an indicator to the user that the user has a chance to enter input to somehow interact with the object. If the user directs their attention at the user interface symbol, a handle extends away from the user interface symbol, in one embodiment. The system could determine that the user has focused their attention at the user interface symbol by determining the user's head orientation and/or their eye gaze. The user interface symbol may also be referred to as a “gleam,” as it can, in some embodiments, be made to gleam, glow, etc. in response to the user looking at it.
- The handle may serve as a type of selection device such that if the user directs their attention along the handle, away from the user interface symbol, a selection can be made. As one example, the user can cause a menu to appear by rotating their head such that system determines the user is looking along the handle away from the user interface symbol. In one embodiment, this is a spoke menu that has menu items surrounding a hub. The last point at which the user was looking to activate the spoke menu may serve as a hub for the spoke menu. The user can then select the menu items by directing their focus to one of the menu items. In one embodiment, the user moves their head towards a menu item to select it. For example, the user might rotate their head left, right, up, or down to select the menu item. In one embodiment, the user moves their eyes towards a menu item as part of a selection process.
-
FIG. 1 is a block diagram depicting example components of one embodiment of a see-through, mixedreality display device 2 in asystem environment 100 in which the device may operate. The see-through display device is a near-eye, head mounteddisplay device 2 in communication with processing unit 4 viawire 6. In other embodiments, head mounteddisplay device 2 communicates with processing unit 4 via wireless communication. Processing unit 4 may take various embodiments. In some embodiments, processing unit 4 is a separate unit which may be worn on the user's body, e.g., the wrist in the illustrated example or in a pocket, and includes much of the computing power used to operate near-eye display device 2. Processing unit 4 may communicate wirelessly (e.g., WiFi, Bluetooth, infra-red, or other wireless communication means) to one or morehub computing systems 12, hot spots, cellular data networks, etc. In other embodiments, the functionality of the processing unit 4 may be integrated in software and hardware components of thedisplay device 2. - See through head mounted
display device 2, which in one embodiment is in the shape of eyeglasses in aframe 115, is worn on the head of a user so that the user can see through a display, embodied in this example as a displayoptical system 14 for each eye, and thereby have an actual direct view of the space in front of the user. The use of the term “actual direct view” refers to the ability to see real world objects directly with the human eye, rather than seeing created image representations of the objects. For example, looking through glass at a room allows a user to have an actual direct view of the room, while viewing a video of a room on a television is not an actual direct view of the room. Based on the context of executing software, for example, a gaming application, the system can project images of virtual objects, sometimes referred to as virtual images, on the display that are viewable by the person wearing the see-through display device while that person is also viewing real world objects through the display. -
Frame 115 provides a support for holding elements of theHMD 2 in place as well as a conduit for electrical connections. In this embodiment,frame 115 provides a convenient eyeglass frame as support for the elements of the system discussed further below. In other embodiments, other support structures can be used. An example of such a structure is a visor, hat, helmet or goggles. Theframe 115 includes a temple or side arm for resting on each of a user's ears.Temple 102 is representative of an embodiment of the right temple and includescontrol circuitry 136 for thedisplay device 2.Nose bridge 104 of the frame includes amicrophone 110 for recording sounds and transmitting audio data to processing unit 4. -
Hub computing system 12 may be a computer, a gaming system or console, or the like. According to an example embodiment, thehub computing system 12 may include hardware components and/or software components such thathub computing system 12 may be used to execute applications such as gaming applications, non-gaming applications, or the like. An application may be executing onhub computing system 12, thedisplay device 2, as discussed below on a mobile device 5 or a combination of these. - In one embodiment, the
hub computing system 12 further includes one or more capture devices, such as 20A and 20B. The two capture devices can be used to capture the room or other physical environment of the user but are not necessary for use with see through head mountedcapture devices display device 2 in all embodiments. -
20A and 20B may be, for example, cameras that visually monitor one or more users and the surrounding space such that gestures and/or movements performed by the one or more users, as well as the structure of the surrounding space, may be captured, analyzed, and tracked to perform one or more controls or actions within an application and/or animate an avatar or on-screen character.Capture devices -
Hub computing system 12 may be connected to anaudiovisual device 16 such as a television, a monitor, a high-definition television (HDTV), or the like that may provide game or application visuals. In some instances, theaudiovisual device 16 may be a three-dimensional display device. In one example,audiovisual device 16 includes internal speakers. In other embodiments,audiovisual device 16, a separate stereo orhub computing system 12 is connected toexternal speakers 22. - Note that
HMD 2 and processing unit 4 can be used withouthub computing system 12, in which case processing unit 4 will communicate with a WiFi network, a cellular network or other communication means. - In various embodiments, the processes described herein are performed in whole or in part by head mounted
display device 2, processing unit 4,hub computing system 12, or a combination thereof. - In one example implementation, various sensor technologies embedded in head mounted
display device 2 including inertial sensing using inertial measurements from accelerometers and gyroscopes, global positioning system (GPS), eye tracking elements determine a focus of the user. By focus, it is meant where the user is looking or what has the user's attention. The user's focus can be determined by head orientation and/or eye gaze. In some cases, the head orientation and eye gaze are focused at different points. For example, the user might not be looking straight ahead. In such as case, head orientation may be used to define the focus. Alternatively, the eye gaze could be used to determine the user's focus when the two differ. - Other techniques and sensors may be used to determine the user's focus. These could include time of flight, spatial scan, mechanical linkages, phase-difference sensing, and/or direct field sensing. In such cases, additional hardware may be needed in the head mounted display. More details of head mounted
display device 2 and processing unit 4 will be described below with respect toFIGS. 3 , 4A and 4B. - In one embodiment,
hub computing device 12 may be used to track the user and head mounteddisplay device 2 to provide a preliminary determination of location and orientation of head mounteddisplay device 2. Various sensor technologies may be implemented inhub computing device 12 including RGB camera, depth sensor, and/or other technologies to determine location and orientation of head mounteddisplay device 2. More details ofhub computing device 12 will be described below with respect toFIG. 5 . - Additional information such as information retrieved from the cloud, information detected and/or gathered by one or more external devices, and other relevant information may also be used to identify and continuously track the user's head position and rotation. Techniques such as Simultaneous Localization and Mapping (SLAM) using RGB and/or depth sensor data may be employed to provide a real-time position of the user's head relative to the mapped environment. Environmental typography may be identified using data from the cloud and/or depth sensor data. Regions of the user's body can be identified (e.g., hand, arm, torso, legs) using the depth sensor when the user is looking at him or herself.
- It should be recognized that not all sensor information and/or sensing technologies as discussed above are required at all times. One or more sensors may be used as redundancies to further refine the measurement of the total field of view of the user.
-
FIG. 2 illustrates auser 2222 wearing head mounteddisplay device 2 in anenvironment 2220. In this example, the user's head position and orientation are continuously tracked. Optionally, the user's eye gaze may also be tracked. For example, at a given instant of time such as time user's 2222 head position and orientation are identified using the various sensor technologies as described above, anduser 2222 is shown with their focus directed towardselement 2234, which may be a picture, television screen, etc. The user's focus could be defined as a vector that originates at a point between the eyes and “shoots straight out.” - The user's focus could also be defined using eye tracking. In this case, technology in the
HMD 2, or elsewhere, can track the direction in which one or both of the user's eyes are gazing. Typically, eye gaze is considered to be a point at which the user's eyes are momentarily fixed. - The environment includes a
lamp 2224,clock 2232, a portion ofwall 2230, a portion ofwall 2226, a table 2240,processing device 2238, 20A, 20B, andcapture devices display 2234,floor 2231, and aflower 2242. Theenvironment 2220 may be defined relative to a coordinatesystem 2250, and the user's head position and orientation defined relative to a second coordinatesystem 2252. - By identifying and continuously tracking the user's head position and orientation relative to
environment 2220 at various time instances, the direction of focus can be ascertained. Also, eye tracking can be used instead, or in addition to head tracking, to ascertain the direction of focus. -
FIG. 3 depicts a top view of a portion of head mounteddisplay device 2, including a portion of the frame that includestemple 102 andnose bridge 104. Only the right side of head mounteddisplay device 2 is depicted. At the front of head mounteddisplay device 2 is a physicalenvironment facing camera 113 that can capture video and still images. Those images are transmitted to processing unit 4, as described below. - In one embodiment, physical
environment facing camera 113 is a depth camera that may capture a depth image of a scene. The depth image may include a two-dimensional (2-D) pixel area of the captured scene where each pixel in the 2-D pixel area may represent a depth value such as a distance in, for example, centimeters, millimeters, or the like of an object in the captured scene from the camera. For example, the depth camera may include an IR illuminator transmitter and a hot reflecting surface like a hot mirror in front of the visible image sensor which lets the visible light pass and directs reflected IR radiation within a wavelength range transmitted by the illuminator to a CCD or other type of depth sensor. The data from the sensors may be sent to the processing unit 4 orhub computing system 12 for processing. The processing identifies and maps the user's real world field of view. Additionally, physicalenvironment facing camera 113 may also include a light meter for measuring ambient light. - A portion of the frame of head mounted
display device 2 will surround a display (that includes one or more lenses). In order to show the components of head mounteddisplay device 2, a portion of the frame surrounding the display is not depicted. The display includes a light guideoptical element 112,opacity filter 114, see-throughlens 116 and see-throughlens 118. In one embodiment,opacity filter 114 is behind and aligned with see-throughlens 116, light-guideoptical element 112 is behind and aligned withopacity filter 114, and see-throughlens 118 is behind and aligned with light-guideoptical element 112. See-through 116 and 118 are standard lenses used in eye glasses and can be made to any prescription (including no prescription). In one embodiment, see-throughlenses 116 and 118 can be replaced by a variable prescription lens. In some embodiments, head mountedlenses display device 2 will include only one see-through lens or no see-through lenses. In another alternative, a prescription lens can go inside light guideoptical element 112.Opacity filter 114 filters out natural light (either on a per pixel basis or uniformly) to enhance the contrast of the virtual imagery. Light guideoptical element 112 channels artificial light to the eye. More details of theopacity filter 114 and light guideoptical element 112 is provided below. - Mounted to or inside
temple 102 is an image source, which (in one embodiment) includesmicro display assembly 120 for projecting a virtual image andlens 122 for directing images frommicro display 120 into light guideoptical element 112. In one embodiment,lens 122 is a collimating lens. -
Control circuits 136 provide various electronics that support the other components of head mounteddisplay device 2. More details ofcontrol circuits 136 are provided below with respect toFIGS. 4A and 4B . Inside, or mounted totemple 102, areear phones 130, inertial and/ormagnetic sensors 132 andtemperature sensor 138. In one embodiment inertial andmagnetic sensors 132 include a threeaxis magnetometer 132A, three axis gyro 132B and threeaxis accelerometer 132C (SeeFIG. 4A ). The inertial and/or magnetic sensors are for sensing position, orientation, and sudden accelerations of head mounteddisplay device 2. -
Micro display 120 projects an image throughlens 122. There are different image generation technologies that can be used to implementmicro display 120. For example,micro display 120 can be implemented in using a transmissive projection technology where the light source is modulated by optically active material, backlit with white light. These technologies are usually implemented using LCD type displays with powerful backlights and high optical energy densities.Micro display 120 can also be implemented using a reflective technology for which external light is reflected and modulated by an optically active material. The illumination is forward lit by either a white source or RGB source, depending on the technology. Digital light processing (DLP), liquid crystal on silicon (LCOS) and Mirasol® display technology from Qualcomm, Inc. are all examples of reflective technologies which are efficient as most energy is reflected away from the modulated structure and may be used in the system described herein. Additionally,micro display 120 can be implemented using an emissive technology where light is generated by the display. For example, a PicoP™ display engine from Microvision, Inc. emits a laser signal with a micro mirror steering either onto a tiny screen that acts as a transmissive element or beamed directly into the eye (e.g., laser). - Light guide
optical element 112 transmits light frommicro display 120 to theeye 140 of the user wearing head mounteddisplay device 2. Light guideoptical element 112 also allows light from in front of the head mounteddisplay device 2 to be transmitted through light guideoptical element 112 toeye 140, as depicted byarrow 142, thereby allowing the user to have an actual direct view of the space in front of head mounteddisplay device 2 in addition to receiving a virtual image frommicro display 120. Thus, the walls of light guideoptical element 112 are see-through. Light guideoptical element 112 includes a first reflecting surface 124 (e.g., a mirror or other surface). Light frommicro display 120 passes throughlens 122 and becomes incident on reflectingsurface 124. The reflectingsurface 124 reflects the incident light from themicro display 120 such that light is trapped inside a planar, substrate comprising light guideoptical element 112 by internal reflection. After several reflections off the surfaces of the substrate, the trapped light waves reach an array of selectively reflecting surfaces 126. Note that only one of the five surfaces is labeled 126 to prevent over-crowding of the drawing. Reflectingsurfaces 126 couple the light waves incident upon those reflecting surfaces out of the substrate into theeye 140 of the user. As different light rays will travel and bounce off the inside of the substrate at different angles, the different rays will hit the various reflectingsurface 126 at different angles. Therefore, different light rays will be reflected out of the substrate by different ones of the reflecting surfaces. The selection of which light rays will be reflected out of the substrate by which surface 126 is engineered by selecting an appropriate angle of thesurfaces 126. More details of a light guide optical element can be found in United States Patent Application Publication 2008/0285140, Ser. No. 12/214,366, published on Nov. 20, 2008, “Substrate-Guided Optical Devices” incorporated herein by reference in its entirety. In one embodiment, each eye will have its own light guideoptical element 112. When the head mounted display device has two light guide optical elements, each eye can have its ownmicro display 120 that can display the same image in both eyes or different images in the two eyes. In another embodiment, there can be one light guide optical element which reflects light into both eyes. -
Opacity filter 114, which is aligned with light guideoptical element 112, selectively blocks natural light, either uniformly or on a per-pixel basis, from passing through light guideoptical element 112. In one embodiment, the opacity filter can be a see-through LCD panel, electro chromic film, PDLC (Polymer dispersed Liquid Crystal) or similar device which is capable of serving as an opacity filter. Such a see-through LCD panel can be obtained by removing various layers of substrate, backlight and diffusers from a conventional LCD. The LCD panel can include one or more light-transmissive LCD chips which allow light to pass through the liquid crystal. Such chips are used in LCD projectors, for instance. -
Opacity filter 114 can include a dense grid of pixels, where the light transmissivity of each pixel is individually controllable between minimum and maximum transmissivities. While a transmissivity range of 0-100% is ideal, more limited ranges are also acceptable. As an example, a monochrome LCD panel with no more than two polarizing filters is sufficient to provide an opacity range of about 50% to 99% per pixel, up to the resolution of the LCD. At the minimum of 50%, the lens will have a slightly tinted appearance, which is tolerable. 100% transmissivity represents a perfectly clear lens. An “alpha” scale can be defined from 0-100%, where 0% allows no light to pass and 100% allows all light to pass. The value of alpha can be set for each pixel by the opacityfilter control circuit 224 described below. - A mask of alpha values can be used from a rendering pipeline, after z-buffering with proxies for real-world objects. When the system renders a scene for the augmented reality display, it takes note of which real-world objects are in front of which virtual objects. If a virtual object is in front of a real-world object, then the opacity should be on for the coverage area of the virtual object. If the virtual is (virtually) behind a real-world object, then the opacity should be off, as well as any color for that pixel, so the user will only see the real-world object for that corresponding area (a pixel or more in size) of real light. Coverage would be on a pixel-by-pixel basis, so the system could handle the case of part of a virtual object being in front of a real-world object, part of the virtual object being behind the real-world object, and part of the virtual object being coincident with the real-world object. Displays capable of going from 0% to 100% opacity at low cost, power, and weight are the most desirable for this use. Moreover, the opacity filter can be rendered in color, such as with a color LCD or with other displays such as organic LEDs, to provide a wide field of view. More details of an opacity filter are provided in U.S. patent application Ser. No. 12/887,426, “Opacity Filter For See-Through Mounted Display,” filed on Sep. 21, 2010, incorporated herein by reference in its entirety.
- Head mounted
display device 2 may also include a system for tracking the position of the user's eyes. For example, head mounteddisplay device 2 includeseye tracking assembly 134 which will include an eye trackingillumination device 134A (seeFIG. 4A ) andeye tracking camera 134B (seeFIG. 4A ). In one embodiment, eye trackingillumination source 134A includes one or more infrared (IR) emitters, which emit IR light toward the eye.Eye tracking camera 134B includes one or more cameras that sense the reflected IR light. The position of the pupil can be identified by known imaging techniques which detects the reflection of the cornea. For example, see U.S. Pat. No. 7,401,920, entitled “Head Mounted Eye Tracking and Display System”, issued Jul. 22, 2008 to Kranz et al., incorporated herein by reference. Such a technique can locate a position of the center of the eye relative to the tracking camera. Generally, eye tracking involves obtaining an image of the eye and using computer vision techniques to determine the location of the pupil within the eye socket. In one embodiment, it is sufficient to track the location of one eye since the eye usually moves in unison. However, it is possible to track each eye separately. - In one embodiment, the eye tracking
illumination device 134A will use four IR LEDs and theeye tracking camera 134B users four IR photo detectors (not illustrated) in rectangular arrangement so that there is one IR LED and IR photo detector at each corner of the lens of head mounteddisplay device 2. Light from the LEDs reflect off the eyes. The amount of infrared light detected at each of the four IR photo detectors determines the pupil direction. That is, the amount of white versus black in the eye will determine the amount of light reflected off the eye for that particular photo detector. Thus, the photo detector will have a measure of the amount of white or black in the eye. From the four samples, the system can determine the direction of the eye. - Another alternative is to use four infrared LEDs as discussed above, but only one infrared CCD on the side of the lens of head mounted
display device 2. The CCD will use a small mirror and/or lens (fish eye) such that the CCD can image up to 75% of the visible eye from the glasses frame. The CCD will then sense an image and use computer vision to find the eye position, much like as discussed above. More or less than four IR transmitters and/or four IR sensors can also be used. - Another embodiment for tracking the direction of the eyes is based on charge tracking. This concept is based on the observation that a retina carries a measurable positive charge and the cornea has a negative charge. Sensors are mounted by the user's ears (near earphones 130) to detect the electrical potential while the eyes move around and effectively read out what the eyes are doing in real time. Other embodiments for tracking eyes can also be used. Suitable tracking technologies is disclosed in U.S. Patent Application Publication 2012/0154277, “Optimized Focal Area for Augmented Reality Displays,” filed on Dec. 17, 2010, which is incorporated herein by reference in their entirety.
-
FIG. 3 only shows half of head mounteddisplay device 2. A full head mounteddisplay device 2 would include (where applicable) another set lenses, another opacity filter another light guide optical element, another micro display, another lens, room facing camera, eye tracking assembly, micro display, earphones, and temperature sensor. -
FIG. 4A is a block diagram depicting the various components of head mounteddisplay device 2. It will be understood with reference toFIG. 3 that some components illustrated inFIG. 4A may not be present in the embodiment shown inFIG. 3 .FIG. 4B is a block diagram describing the various components of processing unit 4 ofFIG. 1 . Head mounteddisplay device 2, the components of which are depicted inFIG. 4A , is used to display an optimized image to the user (user 2222). Additionally, the head mounted display device components ofFIG. 4A may include many sensors that track various conditions including the head position and rotation ofuser 2222. Head mounteddisplay device 2 will receive instructions about the virtual image from processing unit 4 and will provide the sensor information back to processing unit 4. Processing unit 4, the components of which are depicted inFIG. 4B will receive the sensory information from head mounteddisplay device 2 and also from hub computing device 12 (SeeFIG. 1 ). Based on that information, processing unit 4 will determine where and when to provide a virtual image to the user and send instructions accordingly to the head mounted display device ofFIG. 4A . - Note that some of the components of
FIG. 4A (e.g., rear facingcamera 113,eye tracking camera 134B, 120 or 153,micro display opacity filter 114,eye tracking illumination 134A, and earphones 130) are shown in shadow to indicate that there may be two of each of those devices, one for the left side and one for the right side of head mounteddisplay device 2.FIG. 4A shows thecontrol circuit 200 in communication with thepower management circuit 202.Control circuit 200 includesprocessor 210,memory controller 212 in communication with memory 214 (e.g., D-RAM),camera interface 216,camera buffer 218,display driver 220,display formatter 2222,timing generator 226, display outinterface 228, and display ininterface 230. In one embodiment, all of components ofcontrol circuit 200 are in communication with each other via dedicated lines or one or more buses. In another embodiment, each of the components ofcontrol circuit 200 is in communication withprocessor 210.Camera interface 216 provides an interface to the tworoom facing cameras 113 and stores images received from the room facing cameras incamera buffer 218.Display driver 220 will drive 120 or 153.micro display Display formatter 2222 provides information, about the virtual image being displayed on 120 or 153, tomicro display opacity control circuit 224, which controlsopacity filter 114.Timing generator 226 is used to provide timing data for the system. Display outinterface 228 is a buffer for providing images fromroom facing cameras 113 to processing unit 4. Display in 230 is a buffer for receiving images such as a virtual image to be displayed onmicro display 120. Display out 228 and display in 230 communicate withband interface 232 which is an interface to processing unit 4.Display driver 220 may also drive mirror controller 162 to position the mirror 166 to display a focused image in accordance with the above embodiment ofFIGS. 3C and 3D . -
Power management circuit 202 includesvoltage regulator 234, eye trackingillumination driver 236, audio DAC andamplifier 238, microphonepreamplifier audio ADC 240,temperature sensor interface 242 andclock generator 244.Voltage regulator 234 receives power from processing unit 4 viaband interface 232 and provides that power to the other components of head mounteddisplay device 2. Eyetracking illumination driver 236 provides the IR light source foreye tracking illumination 134A, as described above. Audio DAC andamplifier 238 receive the audio information fromearphones 130. Microphone preamplifier andaudio ADC 240 provide an interface formicrophone 110.Temperature sensor interface 242 is an interface fortemperature sensor 138.Power management unit 202 also provides power and receives data back from threeaxis magnetometer 132A, three axis gyro 132B and threeaxis accelerometer 132C. -
FIG. 4B is a block diagram describing the various components of processing unit 4.FIG. 4B showscontrols circuit 304 in communication withpower management circuit 306.Control circuit 304 includes a central processing unit (CPU) 320, graphics processing unit (GPU) 322,cache 324,RAM 326,memory control 328 in communication with memory 330 (e.g., D-RAM),flash memory controller 332 in communication with flash memory 334 (or other type of non-volatile storage), display outbuffer 336 in communication with head mounteddisplay device 2 viaband interface 302 andband interface 232, display inbuffer 338 in communication with head mounteddisplay device 2 viaband interface 302 andband interface 232,microphone interface 340 in communication with anexternal microphone connector 342 for connecting to a microphone, PCI express interface for connecting to awireless communication device 346, and USB port(s) 348. In one embodiment,wireless communication device 346 can include a Wi-Fi enabled communication device, BlueTooth communication device, infrared communication device, etc. The USB port can be used to dock the processing unit 4 tohub computing device 12 in order to load data or software onto processing unit 4, as well as charge processing unit 4. In one embodiment,CPU 320 andGPU 322 are the main workhorses for determining where, when and how to insert virtual images into the view of the user. More details are provided below. -
Power management circuit 306 includesclock generator 360, analog todigital converter 362,battery charger 364,voltage regulator 366, head mounteddisplay power source 376, andtemperature sensor interface 372 in communication with temperature sensor 374 (located on the wrist band of processing unit 4). Analog todigital converter 362 is connected to a chargingjack 370 for receiving an AC supply and creating a DC supply for the system.Voltage regulator 366 is in communication withbattery 368 for supplying power to the system.Battery charger 364 is used to charge battery 368 (via voltage regulator 366) upon receiving power from chargingjack 370.HMD power source 376 provides power to the head mounteddisplay device 2. - The above-described system will be configured to insert a virtual image into the field of view of a user so that the virtual image replaces the view of a real world object. Alternatively, the virtual image can be inserted without replacing the image of a real world object. In various embodiments, the virtual image will be adjusted to match the appropriate orientation, size and shape based on the object being replaced or the environment for which the image is being inserted into. In addition, the virtual image can be adjusted to include reflectivity and shadows. In one embodiment, head mounted
display device 2, processing unit 4 andhub computing device 12 work together as each of the devices includes a subset of sensors that are used to obtain the data for determining where, when and how to insert the virtual images. In one embodiment, the calculations that determine where, how and when to insert a virtual image are performed byhub computing device 12. In another embodiment, those calculations are performed by processing unit 4. In another embodiment some of the calculations are performed byhub computing device 12 while other calculations are performed by processing unit 4. In other embodiments, the calculations can be performed by head mounteddisplay device 2. - In one example embodiment,
hub computing device 12 will create a model of the environment that the user is in and track various moving objects in that environment. In addition,hub computing device 12 tracks the position and orientation of head mounteddisplay device 2. The model and the tracking information are provided fromhub computing device 12 to processing unit 4. Sensor information may also be obtained from head mounteddisplay device 2. Processing unit 4 then uses the additional sensor information it receives from head mounteddisplay device 2 to refine the field of view of the user and provide instructions to head mounteddisplay device 2 on how, where and when to insert the virtual image. -
FIG. 5 illustrates an example embodiment ofhub computing system 12 with a capture device. In one embodiment, 20A and 20B are the same structure, therefore,capture devices FIG. 5 only showscapture device 20A. According to an example embodiment,capture device 20A may be configured to capture video with depth information including a depth image that may include depth values via any suitable technique including, for example, time-of-flight, structured light, stereo image, or the like. According to one embodiment, thecapture device 20A may organize the depth information into “Z layers,” or layers that may be perpendicular to a Z axis extending from the depth camera along its line of sight. - As shown in
FIG. 5 ,capture device 20A may include acamera component 423. According to an example embodiment,camera component 423 may be or may include a depth camera that may capture a depth image of a scene. The depth image may include a two-dimensional (2-D) pixel area of the captured scene where each pixel in the 2-D pixel area may represent a depth value such as a distance in, for example, centimeters, millimeters, or the like of an object in the captured scene from the camera. - Camera component 23 may include an infra-red (IR)
light component 425, a three-dimensional (3-D)camera 426, and an RGB (visual image)camera 428 that may be used to capture the depth image of a scene. For example, in time-of-flight analysis, theIR light component 425 of thecapture device 20A may emit an infrared light onto the scene and may then use sensors (in some embodiments, including sensors not shown) to detect the backscattered light from the surface of one or more targets and objects in the scene using, for example, the 3-D camera 426 and/or theRGB camera 428. In some embodiments, pulsed infrared light may be used such that the time between an outgoing light pulse and a corresponding incoming light pulse may be measured and used to determine a physical distance from thecapture device 20A to a particular location on the targets or objects in the scene. Additionally, in other example embodiments, the phase of the outgoing light wave may be compared to the phase of the incoming light wave to determine a phase shift. The phase shift may then be used to determine a physical distance from the capture device to a particular location on the targets or objects. - According to another example embodiment, time-of-flight analysis may be used to indirectly determine a physical distance from the
capture device 20A to a particular location on the targets or objects by analyzing the intensity of the reflected beam of light over time via various techniques including, for example, shuttered light pulse imaging. - In another example embodiment,
capture device 20A may use a structured light to capture depth information. In such an analysis, patterned light (i.e., light displayed as a known pattern such as grid pattern, a stripe pattern, or different pattern) may be projected onto the scene via, for example, the IR light component 424. Upon striking the surface of one or more targets or objects in the scene, the pattern may become deformed in response. Such a deformation of the pattern may be captured by, for example, the 3-D camera 426 and/or the RGB camera 428 (and/or other sensor) and may then be analyzed to determine a physical distance from the capture device to a particular location on the targets or objects. In some implementations, theIR Light component 425 is displaced from the 425 and 426 so triangulation can be used to determined distance fromcameras 425 and 426. In some implementations, thecameras capture device 20A will include a dedicated IR sensor to sense the IR light, or a sensor with an IR filter. - According to another embodiment, the
capture device 20A may include two or more physically separated cameras that may view a scene from different angles to obtain visual stereo data that may be resolved to generate depth information. Other types of depth image sensors can also be used to create a depth image. - The
capture device 20A may further include amicrophone 430, which includes a transducer or sensor that may receive and convert sound into an electrical signal.Microphone 430 may be used to receive audio signals that may also be provided byhub computing system 12. - In an example embodiment, the
capture device 20A may further include aprocessor 432 that may be in communication with theimage camera component 423.Processor 432 may include a standardized processor, a specialized processor, a microprocessor, or the like that may execute instructions including, for example, instructions for receiving a depth image, generating the appropriate data format (e.g., frame) and transmitting the data tohub computing system 12. -
Capture device 20A may further include amemory 434 that may store the instructions that are executed byprocessor 432, images or frames of images captured by the 3-D camera and/or RGB camera, or any other suitable information, images, or the like. According to an example embodiment,memory 434 may include random access memory (RAM), read only memory (ROM), cache, flash memory, a hard disk, or any other suitable storage component. As shown inFIG. 5 , in one embodiment,memory 434 may be a separate component in communication with theimage capture component 423 andprocessor 432. According to another embodiment, thememory 434 may be integrated intoprocessor 432 and/or the image capture component 422. -
20A and 20B are in communication withCapture devices hub computing system 12 via acommunication link 436. Thecommunication link 436 may be a wired connection including, for example, a USB connection, a Firewire connection, an Ethernet cable connection, or the like and/or a wireless connection such as a wireless 802.11b, g, a, or n connection. According to one embodiment,hub computing system 12 may provide a clock to capturedevice 20A that may be used to determine when to capture, for example, a scene via thecommunication link 436. Additionally, thecapture device 20A provides the depth information and visual (e.g., RGB) images captured by, for example, the 3-D camera 426 and/or theRGB camera 428 tohub computing system 12 via thecommunication link 436. In one embodiment, the depth images and visual images are transmitted at 30 frames per second; however, other frame rates can be used.Hub computing system 12 may then create and use a model, depth information, and captured images to, for example, control an application such as a game or word processor and/or animate an avatar or on-screen character. -
Hub computing system 12 includes depth image processing andskeletal tracking module 450, which uses the depth images to track one or more persons detectable by the depth camera function ofcapture device 20A. Depth image processing andskeletal tracking module 450 provides the tracking information to application 453, which can be a video game, productivity application, communications application or other software application etc. The audio data and visual image data is also provided toapplication 452 and depth image processing andskeletal tracking module 450.Application 452 provides the tracking information, audio data and visual image data torecognizer engine 454. In another embodiment,recognizer engine 454 receives the tracking information directly from depth image processing andskeletal tracking module 450 and receives the audio data and visual image data directly from 20A and 20B.capture devices -
Recognizer engine 454 is associated with a collection of 460, 462, 464, . . . , 466 each comprising information concerning a gesture, action or condition that may be performed by any person or object detectable byfilters 20A or 20B. For example, the data fromcapture device capture device 20A may be processed by 460, 462, 464, . . . , 466 to identify when a user or group of users has performed one or more gestures or other actions. Those gestures may be associated with various controls, objects or conditions offilters application 452. Thus,hub computing system 12 may use therecognizer engine 454, with the filters, to interpret and track movement of objects (including people). -
20A and 20B provide RGB images (or visual images in other formats or color spaces) and depth images toCapture devices hub computing system 12. The depth image may be a plurality of observed pixels where each observed pixel has an observed depth value. For example, the depth image may include a two-dimensional (2-D) pixel area of the captured scene where each pixel in the 2-D pixel area may have a depth value such as distance of an object in the captured scene from the capture device.Hub computing system 12 will use the RGB images and depth images to track a user's or object's movements. For example, the system will track a skeleton of a person using the depth images. There are many methods that can be used to track the skeleton of a person using depth images. One suitable example of tracking a skeleton using depth image is provided in U.S. Pat. No. 8,295,546, “Pose Tracking Pipeline” filed on Oct. 21, 2009, Craig, et al. (hereinafter referred to as the '546 Application), incorporated herein by reference in its entirety. The process of the '546 Application includes acquiring a depth image, down sampling the data, removing and/or smoothing high variance noisy data, identifying and removing the background, and assigning each of the foreground pixels to different parts of the body. Based on those steps, the system will fit a model to the data and create a skeleton. The skeleton will include a set of joints and connections between the joints. Other methods for tracking can also be used. Suitable tracking technologies are also disclosed in the following four U.S. patent applications, all of which are incorporated herein by reference in their entirety: U.S. patent application Ser. No. 12/475,308, “Device for Identifying and Tracking Multiple Humans Over Time,” filed on May 29, 2009; U.S. patent application Ser. No. 12/696,282, “Visual Based Identity Tracking,” filed on Jan. 29, 2010; U.S. patent application Ser. No. 12/641,788, “Motion Detection Using Depth Images,” filed on Dec. 18, 2009; and U.S. patent application Ser. No. 12/575,388, “Human Tracking System,” filed on Oct. 7, 2009. -
Recognizer engine 454 includes 460, 462, 464, . . . , 466 to determine a gesture or action. A filter comprises information defining a gesture, action or condition along with parameters, or metadata, for that gesture, action or condition. For instance, a throw, which comprises motion of one of the hands from behind the rear of the body to past the front of the body, may be implemented as a gesture comprising information representing the movement of one of the hands of the user from behind the rear of the body to past the front of the body, as that movement would be captured by the depth camera. Parameters may then be set for that gesture. Where the gesture is a throw, a parameter may be a threshold velocity that the hand has to reach, a distance the hand travels (either absolute, or relative to the size of the user as a whole), and a confidence rating by the recognizer engine that the gesture occurred. These parameters for the gesture may vary between applications, between contexts of a single application, or within one context of one application over time.multiple filters - Filters may be modular or interchangeable. In one embodiment, a filter has a number of inputs (each of those inputs having a type) and a number of outputs (each of those outputs having a type). A first filter may be replaced with a second filter that has the same number and types of inputs and outputs as the first filter without altering any other aspect of the recognizer engine architecture. For instance, there may be a first filter for driving that takes as input skeletal data and outputs a confidence that the gesture associated with the filter is occurring and an angle of steering. Where one wishes to substitute this first driving filter with a second driving filter—perhaps because the second driving filter is more efficient and requires fewer processing resources—one may do so by simply replacing the first filter with the second filter so long as the second filter has those same inputs and outputs—one input of skeletal data type, and two outputs of confidence type and angle type.
- A filter need not have a parameter. For instance, a “user height” filter that returns the user's height may not allow for any parameters that may be tuned. An alternate “user height” filter may have tunable parameters—such as to whether to account for a user's footwear, hairstyle, headwear and posture in determining the user's height.
- Inputs to a filter may comprise things such as joint data about a user's joint position, angles formed by the bones that meet at the joint, RGB color data from the scene, and the rate of change of an aspect of the user. Outputs from a filter may comprise things such as the confidence that a given gesture is being made, the speed at which a gesture motion is made, and a time at which a gesture motion is made.
-
Recognizer engine 454 may have a base recognizer engine that provides functionality to the filters. In one embodiment, the functionality that recognizerengine 454 implements includes an input-over-time archive that tracks recognized gestures and other input, a Hidden Markov Model implementation (where the modeled system is assumed to be a Markov process—one where a present state encapsulates any past state information used to determine a future state, so no other past state information must be maintained for this purpose—with unknown parameters, and hidden parameters are determined from the observable data), as well as other functionality used to solve particular instances of gesture recognition. -
460, 462, 464, . . . , 466 are loaded and implemented on top of theFilters recognizer engine 454 and can utilize services provided byrecognizer engine 454 to all 460, 462, 464, . . . , 466. In one embodiment,filters recognizer engine 454 receives data to determine whether it meets the requirements of any 460, 462, 464, . . . , 466. Since these provided services, such as parsing the input, are provided once byfilter recognizer engine 454 rather than by each 460, 462, 464, . . . , 466, such a service need only be processed once in a period of time as opposed to once per filter for that period, so the processing used to determine gestures is reduced.filter -
Application 452 may use the 460, 462, 464, . . . , 466 provided with thefilters recognizer engine 454, or it may provide its own filter, which plugs in torecognizer engine 454. In one embodiment, all filters have a common interface to enable this plug-in characteristic. Further, all filters may utilize parameters, so a single gesture tool below may be used to debug and tune the entire filter system. - More information about
recognizer engine 454 can be found in U.S. patent application Ser. No. 12/422,661, “Gesture Recognizer System Architecture,” filed on Apr. 13, 2009, incorporated herein by reference in its entirety. More information about recognizing gestures can be found in U.S. patent application Ser. No. 12/391,150, “Standard Gestures,” filed on Feb. 23, 2009; and U.S. patent application Ser. No. 12/474,655, “Gesture Tool” filed on May 29, 2009, both of which are incorporated herein by reference in their entirety. - In one embodiment,
hub computing system 12 includes a user profile database 470 that includes user-specific information related to one or more users interacting withhub computing system 12. In one example, the user-specific information includes information related to a user such as the user's expressed preferences, the user's friends' list, the user's preferred activities, a list of the user's reminders, the user's social groups, the user's current location, the user's past intents to interact with objects in the user's environment and other user created content, such as the user's photos, images and recorded videos. In one embodiment, the user-specific information may be obtained from one or more data sources such as the user's social networking sites, address book, email data, Instant Messaging data, user profiles or other sources on the Internet. In one approach, and as will be discussed in detail below, the user-specific information is utilized to automatically determine the user's intent to interact with one or more objects in the user's environment. -
FIG. 6 illustrates an example embodiment of a computing system that may be used to implementhub computing system 12. As shown inFIG. 6 , themultimedia console 500 has a central processing unit (CPU) 501 having alevel 1cache 502, alevel 2cache 504, and a flash ROM (Read Only Memory) 506. Thelevel 1cache 502 and alevel 2cache 504 temporarily store data and hence reduce the number of memory access cycles, thereby improving processing speed and throughput.CPU 501 may be provided having more than one core, and thus,additional level 1 andlevel 2 502 and 504. Thecaches flash ROM 506 may store executable code that is loaded during an initial phase of a boot process when themultimedia console 500 is powered on. - A graphics processing unit (GPU) 508 and a video encoder/video codec (coder/decoder) 514 form a video processing pipeline for high speed and high resolution graphics processing. Data is carried from the
graphics processing unit 508 to the video encoder/video codec 514 via a bus. The video processing pipeline outputs data to an A/V (audio/video)port 540 for transmission to a television or other display. Amemory controller 510 is connected to theGPU 508 to facilitate processor access to various types ofmemory 512, such as, but not limited to, a RAM (Random Access Memory). - The
multimedia console 500 includes an I/O controller 520, asystem management controller 522, anaudio processing unit 523, anetwork interface 524, a first USB host controller 526, asecond USB controller 528 and a front panel I/O subassembly 530 that are preferably implemented on amodule 518. TheUSB controllers 526 and 528 serve as hosts for peripheral controllers 542(1)-542(2), awireless adapter 548, and an external memory device 546 (e.g., flash memory, external CD/DVD ROM drive, removable media, etc.). Thenetwork interface 524 and/orwireless adapter 548 provide access to a network (e.g., the Internet, home network, etc.) and may be any of a wide variety of various wired or wireless adapter components including an Ethernet card, a modem, a Bluetooth module, a cable modem, and the like. -
System memory 543 is provided to store application data that is loaded during the boot process. A media drive 544 is provided and may comprise a DVD/CD drive, Blu-Ray drive, hard disk drive, or other removable media drive, etc. The media drive 144 may be internal or external to themultimedia console 500. Application data may be accessed via the media drive 544 for execution, playback, etc. by themultimedia console 500. The media drive 544 is connected to the I/O controller 520 via a bus, such as a Serial ATA bus or other high speed connection (e.g., IEEE 1394). - The
system management controller 522 provides a variety of service functions related to assuring availability of themultimedia console 500. Theaudio processing unit 523 and anaudio codec 532 form a corresponding audio processing pipeline with high fidelity and stereo processing. Audio data is carried between theaudio processing unit 523 and theaudio codec 532 via a communication link. The audio processing pipeline outputs data to the A/V port 540 for reproduction by an external audio user or device having audio capabilities. - The front panel I/
O subassembly 530 supports the functionality of thepower button 550 and theeject button 552, as well as any LEDs (light emitting diodes) or other indicators exposed on the outer surface of themultimedia console 500. A systempower supply module 536 provides power to the components of themultimedia console 500. Afan 538 cools the circuitry within themultimedia console 500. - The
CPU 501,GPU 508,memory controller 510, and various other components within themultimedia console 500 are interconnected via one or more buses, including serial and parallel buses, a memory bus, a peripheral bus, and a processor or local bus using any of a variety of bus architectures. By way of example, such architectures can include a Peripheral Component Interconnects (PCI) bus, PCI-Express bus, etc. - When the
multimedia console 500 is powered on, application data may be loaded from thesystem memory 543 intomemory 512 and/or 502, 504 and executed on thecaches CPU 501. The application may present a graphical user interface that provides a consistent user experience when navigating to different media types available on themultimedia console 500. In operation, applications and/or other media contained within the media drive 544 may be launched or played from the media drive 544 to provide additional functionalities to themultimedia console 500. - The
multimedia console 500 may be operated as a standalone system by simply connecting the system to a television or other display. In this standalone mode, themultimedia console 500 allows one or more users to interact with the system, watch movies, or listen to music. However, with the integration of broadband connectivity made available through thenetwork interface 524 or thewireless adapter 548, themultimedia console 500 may further be operated as a participant in a larger network community. Additionally,multimedia console 500 can communicate with processing unit 4 viawireless adaptor 548. - When the
multimedia console 500 is powered ON, a set amount of hardware resources are reserved for system use by the multimedia console operating system. These resources may include a reservation of memory, CPU and GPU cycle, networking bandwidth, etc. Because these resources are reserved at system boot time, the reserved resources do not exist from the application's view. In particular, the memory reservation preferably is large enough to contain the launch kernel, concurrent system applications and drivers. The CPU reservation is preferably constant such that if the reserved CPU usage is not used by the system applications, an idle thread will consume any unused cycles. - With regard to the GPU reservation, lightweight messages generated by the system applications (e.g., pop ups) are displayed by using a GPU interrupt to schedule code to render popup into an overlay. The amount of memory required for an overlay depends on the overlay area size and the overlay preferably scales with screen resolution. Where a full user interface is used by the concurrent system application, it is preferable to use a resolution independent of application resolution. A scaler may be used to set this resolution such that the need to change frequency and cause a TV resync is eliminated.
- After
multimedia console 500 boots and system resources are reserved, concurrent system applications execute to provide system functionalities. The system functionalities are encapsulated in a set of system applications that execute within the reserved system resources described above. The operating system kernel identifies threads that are system application threads versus gaming application threads. The system applications are preferably scheduled to run on theCPU 501 at predetermined times and intervals in order to provide a consistent system resource view to the application. The scheduling is to minimize cache disruption for the gaming application running on the console. - When a concurrent system application requires audio, audio processing is scheduled asynchronously to the gaming application due to time sensitivity. A multimedia console application manager (described below) controls the gaming application audio level (e.g., mute, attenuate) when system applications are active.
- Optional input devices (e.g., controllers 542(1) and 542(2)) are shared by gaming applications and system applications. The input devices are not reserved resources, but are to be switched between system applications and the gaming application such that each will have a focus of the device. The application manager preferably controls the switching of input stream, without knowing the gaming application's knowledge and a driver maintains state information regarding focus switches.
20A and 20B may define additional input devices for theCapture devices console 500 via USB controller 526 or other interface. In other embodiments,hub computing system 12 can be implemented using other hardware architectures. The implementation ofhub computing system 12 does not require any particular hardware architecture. -
FIG. 7 is a simplified flow chart depicting a process according to an embodiment of the present technology. Other steps could be added and many variations are possible. Not all steps are required. In one embodiment, the processing depicted inFIG. 7 may be performed by one or more components of head mounteddevice 2 as depicted inFIG. 1 . Various steps of the process could be performed by software, hardware, or some combination of hardware and software. Herein, the term “logic” refers to software, hardware, or some combination of hardware and software. Thus, theHMD 2 may have logic that is configured to perform operations described inFIG. 7 . The logic that performs some or all of the operations could be off theHMD 2 athub computing system 12, or some other processor. - It should be noted that steps illustrated in
FIG. 7 may be performed in parallel or simultaneously, and steps may be performed continuously. For example, step 702 could be performed continuously such that the user's focus is continuously tracked. - In
step 702, a focus of a user that is wearing aHMD 2 is determined. The focus refers to where the user is directing their attention (e.g., looking). In one embodiment, the focus is determined by tracking head orientation and/or position. This may include tracking head translation, rotation, etc. Thus, when the user rotates their head to look at on object, thesystem 100 determines what they are focused on. In one embodiment, the focus is determined based on eye gaze tracking. In one embodiment, the focus is determined based on a combination of eye gaze tracking and head tracking. - In one embodiment, information identifying the position and orientation of the user and the head mounted
display device 2 associated with the user is transmitted from thehub computing device 12 and received by processing unit 4. This could be defined in a 3D coordinate system.FIG. 2 shows one example of a 3D coordinatesystem 2252 in which the position and orientation of the user, their head, and the head mounteddisplay device 2 might be defined. For example,hub computing device 12 will use one or more depth images and one or more visual images to track a user (e.g., using skeleton tracking). One or more depth images and one or more visual images can be used to determine the position and orientation of the user and the head mounted display device associated with the user. - In one embodiment, information sensed locally (e.g., from the head mounted device 2) by one or more sensors embedded in the head mounted
display device 2 is used instep 702. Various sensor technologies embedded in the head mounted display including the accelerometer, magnetometer and gyroscope, or other sensor technologies may be used to identify head position and orientation. Other sensor technologies may include time of flight, spatial scan, mechanical linkages, phase-difference sensing, and/or direct field sensing. In one embodiment, processing unit 4 will access data from threeaxis gyro 132B, threeaxis accelerometer 132C, and threeaxis magnetometer 132A. - In one embodiment, information identifying a position of the user's eye in the environment is used in
step 702. This is also known as the “eye tracking” as discussed above. For example, in one embodiment for tracking a position of the user's eye in the environment, the user's eye can be illuminated using infrared light fromeye tracking illumination 134A. The reflection from the eye is detected using one or moreeye tracking cameras 134B. The reflection data is sent from head mounteddisplay device 2 to processing unit 4. Processing unit 4 will determine the position of the eye based on the reflection data, as discussed above. In one embodiment, the information identifying a position of the user's eye in the environment is received or accessed by processing unit 4. Suitable tracking technologies are disclosed in U.S. Patent Application Publication 2012/0154277, “Optimized Focal Area for Augmented Reality Displays,” filed on Dec. 17, 2010, which is incorporated herein by reference in its entirety. - Additional information such as information retrieved from the cloud, information detected and/or gathered by one or more external devices, and other relevant information may also be used in
step 702. Techniques such as Simultaneous Localization and Mapping (SLAM) using RGB and/or depth sensor data may be employed to provide a real-time position of the user's head relative to the mapped environment. Environmental typography may be identified using data from the cloud and/or depth sensor data. Regions of the user's body can be identified (e.g., hand, arm, torso, legs) using the depth sensor when the user is looking at him or herself. - In
step 704, thesystem 100 determines whether the user's focus is in a general direction of an object associated with the user interface symbol. At this point in time, the user interface symbol may or may not be displayed in theHMD 2. The user interface symbol could be associated with a real world object or a virtual object. For example, the symbol could be associated with an actual real world book or with a virtual dog. - If the
system 100 determines that the user is focused in the general direction of the object, then thesystem 100 presents a symbol, instep 706.FIG. 8A shows an example of a user interface symbol presented on an object in accordance with one embodiment. Auser interface symbol 804 is presented in theHMD 2 such that it appears as being associated with theobject 802. Theobject 802 could be a real world object or a virtual object being rendering in theHMD 2. In this example, thesymbol 804 is presented on one edge of theobject 802, and appears to touch theobject 802. Many other possibilities exist. Thesymbol 804 is not required to appear to be in contact with theobject 802. However, being close by helps the user to understand that thesymbol 804 may serve as a way to initiate a user interaction, user input, etc. in connection with theobject 802. - In
step 706, thesystem 100 could present more than onesymbol 804 on theobject 802. This is because anobject 802 could have more than onesymbol 804 associated with it. - Note that one option is to present the
symbol 804 in theHMD 2 even if the user is not looking at theobject 802. In this case, thesystem 100 might make thesymbol 804 somewhat more visible when the user looks at theobject 802, or in its general direction. - If the system determined in
step 704 that the user is not focused in the direction of theobject 802, then control passes to step 702 to continue to monitor the user's focus. Note that thesymbol 804 need not be displayed if the user is not focused in the direction of theobject 802, althoughsymbol 804 optionally is displayed in some format. This format could be different from when the user is looking in the direction of theobject 802. For example, thesymbol 804 might be less perceptible. - After presenting the
symbol 804 instep 706, thesystem 100 determines whether the user is focused on thesymbol 804, instep 708. This can be achieved by monitoring the user's head orientation and/or position and/or the user's eye gaze. In one embodiment, thesystem 100 determines whether the user's attention is focused at thesymbol 804 for some minimum time. - As noted, the
object 802 can have more than onesymbol 804 associated with it. In one embodiment, thesystem 100 determines whichsymbol 804 the user is most directly focused on and highlights thatsymbol 804. - If the user's attention is focused on the
symbol 804, then thesystem 100 displays a handle in theHMD 2 extending away from thesymbol 804, instep 710. In one embodiment, when the user first focuses on thesymbol 804 it is highlighted. An example of highlighting is to make thesymbol 804 gleam, glow, shine, grow brighter, etc. Thus, thesymbol 804 may be referred to as a “gleam.” If the user continues to look at thesymbol 804, then the handle extends outward.FIG. 8B depicts one example in which thesymbol 804 is highlighted.FIG. 8C depicts one example in which ahandle 806 extends away from thesymbol 804. Thehandle 806 may or may not have a text label associated with it. Thehandle 806 may or may not have an icon associated with it. - After the
handle 806 is presented, thesystem 100 determines whether the user moves their focus away from thesymbol 804 along thehandle 806, instep 712. If not, thesystem 100 returns to step 708. The user might choose to look away from thesymbol 804 after initially looking at it. In this case, the test ofstep 708 would be “no” and control passes to step 704. - In one embodiment,
step 712 includes thesystem 100 determining whether the user moves their head in a way that indicates that their focus is moving along thehandle 806. This could be a rotation of the head, as one example. - In one embodiment, the
system 100 determines whether the user moves their eyes in a way that indicates that their eye gaze is moving along thehandle 806. Typically, eye gaze determined by eye tracking is considered to be a point at which the user's eye (or eyes) are momentarily are fixed. In this case, thesystem 100 can track the different points at which the user is gazing and determine that these points are moving progressively along thehandle 806. - The shift in user focus need not be exactly along the
handle 806. For example, when tracking head rotation for the example ofFIG. 8C , even if when the user rotates their head to the right they also rotate the head up or down such that the focus is somewhat above or below thehandle 806, thesystem 100 can still consider this to be focus along thehandle 806. However, if the user looks far above (or below) thehandle 806 this can indicate that the user is not attempting to look along thehandle 806 but has shifted their attention to something else. The degree of freedom that thesystem 100 provides here may be an adjustable parameter. Note that thissystem 100 can determine the user's focus along thehandle 806 by tracking head orientation and/or eye gaze. - If the
system 100 determines that the user has shifted their focus along thehandle 806, thesystem 100 determines whether a selection criterion has been met instep 714. The selection criterion could be that the user's focus is at or near the end of thehandle 806, as one example. - If the selection criterion is met, then a user interface element is activated, in
step 716. One example of this is to present a spokes menu in theHMD 2. Example spokes menus are discussed herein. The user interface element is activated at or near the end of thehandle 806 that is away from thesymbol 804, in one embodiment. - Returning to the discussion of
step 714, in some cases the user will not move their focus along thehandle 806 after looking at thesymbol 804. Furthermore, the user may look away from thesymbol 804. - In one case, after looking at the
symbol 804, the user looks away from both thesymbol 804 and theobject 802. In this case, thesystem 100 may discontinue presenting thehandle 806 and thesymbol 804. This is represented bystep 718 resulting in the “away” condition. In this case,step 720 is to discontinue presenting thesymbol 804 and thehandle 806 in theHMD 2. In this case, thesystem 100 may return to monitoring the focus of the user instep 702. - In another case, the user continues to focus on the
handle 806, but the selection criterion is not met. For example, the user does not look far enough to the end of thehandle 806. In another case, the user looks back at thesymbol 804 after looking at the handle 806 (but not meeting the selection criterion). In either case, thesystem 100 continues to present thehandle 806. This is represented by the “handle or symbol” condition ofstep 718 passing control to step 710. - Still another possibility is that the user looks away from the
symbol 804, but is still looking at theobject 802. In this case, thesystem 100 might discontinue displaying thehandle 806. However, thesymbol 804 is still displayed. This is represented by the “object” condition ofstep 718 passing control to step 722. Control then passes to step 706 in this case. - During the process of
FIG. 7 , thesystem 100 may provide feedback to the user. This could be audio feedback as one example. This feedback could be provided in connection with a variety of events. For example, when the user looks at thesymbol 804 audio feedback could be provided. When the user looks along thehandle 806, audio feedback could be provided to assist the user in knowing that progress is being made. Visual feedback could also be provided as the user moves their focus along thehandle 806. - In some cases, the result of
step 714 is selection of an input. For example, theobject 802 might be a virtual dog that wags its tail as a result of the selection instep 714. In this example, this might be the only selection that is associated with thesymbol 804. - In other cases, the result of
step 714 is to bring up a menu. One type of menu is referred to herein as a “spokes menu.”FIG. 8D is an example of one embodiment of aspokes menu 808. Theobject 802 that is associated with thespokes menu 808 is also shown inFIG. 8D . Thespokes menu 808 has acentral hub 816, which is this example is represented as a black dot. Thecentral hub 816 is not necessarily shown to the user in theHMD 2. Thecentral hub 816 is located where the end of thehandle 806 that is opposite thesymbol 804 was, in one embodiment. However, thehandle 806 andsymbol 804 are no longer presented in the display when thespokes menu 808 is displayed, in one embodiment. In one embodiment, thespokes menu 808 appears to grow out of the end of thehandle 806; then, thehandle 806 andsymbol 804 disappear. - The
menu 808 has a number ofmenu items 810 at the exterior, around thecentral hub 816. There is aspoke 812 between eachmenu item 810 and thehub 816. Thespokes 812 are not necessarily shown to the user at all times. Initially, it is expected that the user's focus will be at thehub 816. This is due to thehub 816 being located at the end of thehandle 806. As previously noted, the user may shift their focus towards the end of thehandle 806 to activate thespoke menu 808. By activating this means causing the HMD to display thespoke menu 808. - The user is able to select a
menu item 810 by shifting their focus from thehub 816 towards one of themenu items 810. Initially, thespoke 812 between thehub 816 and thatmenu item 810 might not be shown at all. As the user shifts their focus towards themenu item 810, thespoke 812 grows towards themenu item 810, in one embodiment. Further details are discussed below. -
FIG. 9A is another example of one embodiment of aspokes menu 808. In this example, theobject 802 with which thespokes menu 808 is associated is not depicted. Also, thespokes 812 andhub 816 and are depicted for purposes of discussion. Thespokes 812 andhub 816 need not be presented in theHMD 2. Typically, themenu items 810 are actions associated with theobject 802, although this is not a requirement. In this example, themenu items 810 include various controls (e.g., volume, play/pause, FF/Rev) for operating a video or the like. In this example, themenu items 810 have text labels, but that is not required. Thespokes 812 may or may not have text labels associated with them. Thespokes 812 may or may not have icons associated with them. - In one embodiment, spoke
menus 808 can be nested. This means that selection of amenu item 810 can spawn another spokemenu 808. For example, if the user selects themenu item 810 labeled volume, this could spawn thespoke menu 808 depicted inFIG. 9B . For example, if the user selects themenu item 810 labeled FF/Rev, this could spawn thespoke menu 808 depicted inFIG. 9C . Spawning a menu means that the menu is presented in theHMD 2. -
FIG. 10 is a flowchart of one embodiment of aprocess 1000 providing for a selection of amenu item 810 on aspokes menu 808.Process 1000 may be performed afterstep 714 fromFIG. 7 . Recall thestep 714 results in aspokes menu 808 being activated, in one embodiment. For example, aspokes menu 808 such as depicted inFIG. 8D or 9A is activated. Inprocess 1000, head tracking is used. Eye tracking need not be used inprocess 1000.Process 1000 may be performed by hardware, software, or some combination of hardware and software. In one embodiment,process 1000 is performed, at least in part, by anHMD 2. A processor coupled to theHMD 2 can assist in some of the steps. - At the beginning of
process 1000, thesystem 100 may assume that the user's attention is focused on thehub 816 of thespokes menu 808. This is described instep 1002. This assumption may be made given that thehub 816 may be located at or near the end of thehandle 806. As noted above, thehub 816 does not need to be visually presented in theHMD 2. Also, it is not necessary that the user is actually looking directly at the location of thehub 816. -
Step 1004 is tracking the user's head orientation and/or position. This has already been discussed. The head tracking may be performed continuously throughoutprocess 1000. - In
step 1006, thesystem 100 determines whether the user's head is moving such that the user's focus is moving away from thehub 816. There may be some “safe zone” near thehub 816 such that thesystem 100 does not consider this to be a significant head movement unless the head moves some defined amount (e.g., beyond a threshold). The manner in which the head movement is measured can vary. One technique is to track head rotation in terms of degrees. The head may need to move some minimum amount in terms of degree of rotation to consider this to be a significant movement. - If there is head movement, then the
system 100 highlights the relevant spoke 812 instep 1008. In one embodiment, only a portion of thespoke 812 is highlighted. This portion may be the portion that is closest to thehub 816. The amount by which thespoke 812 is filled depends on how far the user has moved (e.g., rotated) their head, in one embodiment. -
FIGS. 11A-11C show an example of aspoke 812 being progressively filled. This may be in response to the user moving their head further away from thehub 816 in a direction towards the menu item labeled “louder.” For example, as the user rotates their head such that a vector that “shoots straight out” from between their eyes moves from thehub 816 to the right towards the “louder” menu item, thespoke 812 is progressively filled. Progressively filling thespoke 812 means to progressively or gradually present more of thespoke 812 in theHMD 2, in one embodiment. Progressively filling thespoke 812 means to progressively or gradually “fill in” thespoke 812 by, for example, making it more saturated, brighter, filled in, etc., in one embodiment. For example, the entire spoke 812 may initially be faded out. Then, progressively or gradually more of thespoke 812 is made brighter, filled in, saturated, etc. Note thatspokes 812 can be “unfilled,” as well, depending on the focus of the user. - Note that the filling the
spoke 812 could have different sensitivities with respect to the head movement. In other words, the degree to which thespoke 812 fills in response to a certain amount of head movement can be an adjustable parameter. What this implies is that the aforementioned vector from the user's head need not point exactly to where thespoke 812 has been filled. For example, inFIG. 11B thespoke 812 has been filled to point A. The vector from the user's head may or may not be pointed at point A. The vector could be pointing closer to thehub 816 or further from thehub 816 than point A. The sensitivity can help to prevent false positives while still making it easy for the user to provide input. - Some
spokes 812 are filled in what may be referred to as a “growth model”. In this case, when thesystem 100 has determined instep 1006 that the user has moved their head a certain amount thespoke 812 begins to fill. Thespoke 812 can continue to fill so long as the user maintains their head position outside of the “safe zone” near thecentral hub 816. This assumes the head position is also generally along the region of thespoke 812. Thus, even with the user's head not moving, thespoke 812 can continue to progressively fill as depicted in the sequence fromFIG. 11A-11C . - Returning again to the discussion on
FIG. 10 , instep 1010, thesystem 100 determines whether a selection criterion has been met. In one embodiment, the selection criterion is met by the head moving a certain amount. Thus, thesystem 100 determines whether the user's head position reaches a selection point. This selection point refers to a certain amount of movement (e.g., head rotation), in one embodiment. A selection point is one example of a selection criterion. When this selection criterion is met, the system selects the menu item instep 1012. Referring toFIG. 11C , the “louder”menu item 810 is selected when the user's head has moved (e.g., rotated) a defined amount. This amount could be such that the aforementioned head vector is pointing at thelouder menu item 810, to the right of it, or to the left of it (but right of the hub). - Note that after the selection of the
menu item 810, thespokes menu 808 could go away. For example, after selection of the “done”menu item 810 inFIG. 9B , thespokes menu 808 inFIG. 9B could go away. It is also possible for a different spokes menu to replace the existing one, which is referred to herein as “spawning” aspokes menu 808. For example, the menu item ofFIG. 9B could be spawned upon selection of the volume menu item ofFIG. 9A . Other techniques can be used to make aspokes menu 808 disappear, such as the user nodding their head. - The degree to which the user needs to move their head to make the selection (i.e., to achieve the selection criterion) can be an adjustable parameter. This sensitivity could depend on the configuration of the
spoke menu 808. For example, somemenu items 810 could be located further away from thehub 816 than others (where further away may be measured in terms of degrees of head rotation, as one example). - In one embodiment, the selection point is reached when the
spoke 812 has filled to a certain degree. As one example, this can be when thespoke 812 reaches themenu item 810, as depicted inFIG. 11C . This selection technique might be used in connection to the “growth model” technique described above. However, it can also be used in connection with other techniques such as ones that progressively fill thespoke 812 as the user continues to further move (e.g., rotate) their head. - If the selection criterion is not met (
step 1010=no), then control passes to step 1011. Instep 1011, the system determines whether the user is still focused on the same spoke. If so, then control passes to step 1010 to determine whether the selection criterion is met. Note that while the user continues to focus on the same spoke it may gradually fill in more or may gradually “unfill,” depending on the user's focus. - It is possible that the user begins to move their head along a
spoke 812, but then changes their mind and returns their focus to thecentral hub 816 or to adifferent spoke 812. This is tested for instep 1014 in the event that the user is no longer focused on the same spoke (step 1011=no). One option is to return to step 1008 to highlight the spoke that the user is presently focused on (e.g., head is oriented towards). Another option stemming fromstep 1014 is to return to step 1004 upon determining that the user's head is oriented at the safe zone near thehub 816. This is represented by the “hub” condition ofstep 1014. - Still another possibility is to abort in
step 1016 if it is determined that the user is no longer looking anywhere near thespoke menu 808. This is represented by the “away from spoke menu” condition ofstep 1014 passing control to step 1016. - During the process of
FIG. 10 , thesystem 100 may provide feedback to the user. This could be audio feedback as one example. This feedback could be provided in connection with a variety of events. For example, when thespoke 812 is being progressively filled, audio feedback could be provided to assist the user in knowing that progress is being made. - In the process of
FIG. 10 , the user is allowed to move their eyes without triggering a response from thesystem 100. This is because it is head tracking that is used in this technique. This allows the user to keep their head relatively still while moving their eyes to scan thedifferent menu items 810 in thespokes menu 808. The user does not need to keep their head perfectly still if there is a “safe zone” or the like, as previously discussed. -
FIG. 12 is a flowchart of one embodiment of aprocess 1200 providing for a selection of amenu item 810 on aspokes menu 808.Process 1200 may be performed afterstep 714 fromFIG. 7 . Recall thestep 714 results in aspokes menu 808 being activated, in one embodiment. For example, aspokes menu 808 such as depicted inFIG. 8D or 9A is activated. Inprocess 1200, eye tracking is used. - At the beginning of
process 1200, thesystem 100 may assume that the user's attention is focused on thehub 816 of thespokes menu 808. This is described instep 1202. This assumption may be made given that thehub 816 may be located at or near the end of thehandle 806. As noted above, thehub 816, as well as thespokes 812 do not need to be presented in theHMD 2. Also, it is not necessary that the user is actually looking directly at the location of thehub 816. -
Step 1204 is tracking the user's eye gaze. This has already been discussed. The eye tracking may be performed continuously throughoutprocess 1200. - In
process 1200 thesystem 100 may use a variety of techniques to prevent a false positive. In one embodiment, thesystem 100 allows the user to survey thespoke menu items 810 without making an inadvertent selection by requiring that the user provide a signal that they are now using their eyes for selection purposes. Thus, a condition for performingstep 1206 is that a mode has been entered to analyze the eye gaze. - In
step 1206, the system determines whether the user's eye gaze has shifted away from thehub 816. There may be some “safe zone” near thehub 816 such that the system does not consider this to be a significant eye movement unless the eye gaze moves some defined amount. The manner in which the eye movement is measured can vary. - If there is eye movement, then the
system 100 highlights the relevant spoke 812 instep 1208. In one embodiment, only a portion of thespoke 812 is highlighted. This portion may be the portion that is closest to thehub 816. The amount by which thespoke 812 is progressively filled depends on how far the user has moved (e.g., rotated) their eyes, in one embodiment. The previously discussed examples ofFIGS. 11A-11C are applicable to progressively filling the spoke based on the eye gaze. As with head movement, filling thespoke 812 could have different sensitivities with respect to the eye movement. - As with the example of head movement, some
spokes 812 are filled in what may be referred to as a “growth model”. In this case, when thesystem 100 has determined instep 1206 that the user has moved their eye a certain amount thespoke 812 begins to progressively fill. Thespoke 812 can continue to progressively fill so long as the user maintains their eye position outside of the “safe zone” and near thespoke 812. Thus, even with the user's eyes are not moving, thespoke 812 can continue to progressively fill as depicted in the sequence fromFIG. 11A-11C . - Returning again to the discussion on
FIG. 12 , instep 1210, thesystem 100 determines whether the selection has been triggered. In one embodiment, the selection point is triggered by the eyes moving a certain amount. In one embodiment, thesystem 100 determines whether the eyes “hover” near the menu item 810 (or other selection spot) for some defined time period. In one embodiment, thesystem 100 determines whether an additional select event occurs while the user's eye gaze is near the menu item 810 (or other selection spot). For example, the user could use a voice command to select, use a hand motion, touch a region of the HMD 2 (e.g., temple), etc. The foregoing are all examples of selection criterion that are based, at least in part, on eye gaze. - The degree to which the user needs to move their eye to make the selection can be an adjustable parameter. This sensitivity could depend on the configuration of the
spoke menu 808. - In one embodiment, the selection point is reached when the
spoke 812 has filled to a certain degree. As one example, this can be when thespoke 812 reaches themenu item 810, as depicted inFIG. 11C . This selection technique might be used in connection to the “growth model” technique described above. However, it can also be used in connection with other techniques such as ones that progressively fill thespoke 812 as the user continues to further move their eyes. - If the selection criterion is not met (
step 1210=no), then control passes to step 1211. Instep 1211, the system determines whether the user is still focused on the same spoke. If so, then control passes to step 1210 to determine whether the selection criterion is met. Note that while the user continues to focus on the same spoke it may gradually fill in more or may gradually “unfill,” depending on the user's focus. - It is possible that the user begins to move their eye gaze along a
spoke 812, but then changes their mind and returns their focus to thecentral hub 816 or to adifferent spoke 812. If the system determines that the user is no longer focused on the same spoke (step 1211=no), then control passes to step 1214 to determine an action to take based on eye gaze. One option is to return to step 1208 to highlight the spoke that the user has shifted their focus to (e.g., eye gaze is oriented towards). - Another option is to return to step 1204 upon determining that the user's eye gaze is oriented at the safe zone near the
hub 816. Still another possibility is to abort instep 1216 if it is determined that the user is no longer looking anywhere near thespoke menu 808. - In the discussion of
FIGS. 10 and 12 , thespokes 812 are filled progressively. However, this is not an absolute requirement. In one embodiment, thespoke 812 fill all at once in response to the user's focus meeting some criterion, such as being focused near themenu item 810. - A wide variety of configurations may be used for the
spoke menu 808.FIG. 13 shows one embodiment in which themenu items 810 are at a diagonal from thehub 816. Again, thespokes 812 need not be presented in theHMD 2 at all times. As noted, a given spoke 812 can be progressively filled or grown in response to the user's focus. Thehub 816 might never be displayed. -
FIGS. 14A and 14B depict another embodiment of aspokes menu 808.FIG. 14A shows fourmenu items 810 in a radial configuration. The hub is in the center of the menu, although it is not depicted inFIG. 14A . The spokes are also not depicted inFIG. 14A . Three of themenu items 810 allow the user to select a movie, the other is for going back one menu. -
FIG. 14B show this embodiment when one spoke 812 that has been filled (at least partially) in response to thesystem 100 determining that the user's focus has shifted from the hub towards themenu item 810 for Movie B. For example, thesystem 100 determined that the user's head rotated downwards some pre-defined amount. As with the discussion ofFIG. 10 , there can be a safe zone around the central hub to allow some movement prior to beginning to fill thespoke 812. Thespoke 812 can fill progressively or fill all at once. Thespoke 812 could also unfill progressively or all at once, depending on the user's focus. Other techniques discussed in connection withFIG. 10 or 12 can be used. In this example, thespoke 812 is pie-shaped. Many other shapes can be used for thespoke 812. - There can be more than one
user selection symbol 804 associated with anobject 802.FIG. 15 shows an example in which there is a firstuser selection symbol 804 a and a seconduser selection symbol 804 b associated withobject 802. Seconduser selection symbol 804 b has ahandle 806 extended from it in response to the user focusing their attention at or near it. As discussed above, if the user focuses their attention away from the seconduser selection symbol 804 b, along thehandle 806, this will activate some menu element associated with theobject 802. The user can also select the firstuser selection symbol 804 a by focusing their attention on it. - In one embodiment, the
system 100 presents both 804 a, 804 b on thesymbols object 802 in response to the user looking in the general direction ofobject 802. Thesystem 100 highlightssymbol 804 a in response to the user looking at thatsymbol 804 a, and highlightssymbol 804 b in response to the user looking at thatsymbol 804 b. As discussed previously focus can be determined by head tracking and/or eye tracking. - In one embodiment, the
user selection symbol 804 is shaped in a manner that identifies the direction in which thehandle 806 will extend.FIG. 16 depicts one embodiment in whichuser selection symbol 804 c points in the direction in which thehandle 806 has been extended.User selection symbol 804 d also points in a particular direction to indicate the direction in which thehandle 806 will extend from it in response to the user focusing on thatsymbol 804 d. - In one embodiment, there is feedback on the
symbol 804 that indicates where the user's eyes or head vector is with respect to thesymbol 804. For example, thesymbol 804 looks like a comet having stardust that is magnetically drawn to where the user's eyes are focused. This helps the user know where they are focused when thesymbol 804 is very small. It also helps when the system is using an invisible cursor. For example, this can assist the user by providing feedback of whether they need to shift their focus a little right or a little up/down/left in order to activate the user interface. - In one embodiment, spoke
menus 808 can be nested. This means that selection of amenu item 810 can spawn another spokemenu 808.FIG. 17 shows one example of nested spokemenus 808. In this example, there is a main control spokemenu 808 a. Selection of the controlvideo menu item 810 a, spawns the play controls spokemenu 808 b. Selection of the adjustslate menu item 810 b, spawns the slate controls spokemenu 808 c. Note that typically only one of these spokemenus 808 needs to be presented at a time. Thus, upon selection of thevideo menu item 810 a, the maincontrols spokes menu 808 a may disappear. In a similar manner, upon selection of the adjustslate menu item 810 a, the maincontrols spokes menu 808 a may disappear - Going further down, selection of the
volume menu item 810 c in the play controls spokemenu 808 b spawns the volume control spokemenu 808 d. The volume control spokemenu 808 d shows another way to provide feedback to the user. In one embodiment, the user focuses on either the decreasevolume menu item 810 e or the increasevolume menu item 810 d to change the volume. In one embodiment, the user focuses their attention along the spoke associated with amenu item 810 to provide input. For example, the user can control the volume by shifting their focus along thespoke 812 associated with thevolume menu item 810 e. The volume might be controlled by the position of the user's focus along the spoke. For example, the closer to thevolume menu item 810 e, the louder the volume is. Alternatively, the speed at which the volume changes might be controlled by the location of the user's focus along the spoke associated with the volume spokemenu item 810 e. For example, the volume can increase more rapidly when the user's focus is near the increasevolume menu item 810 d. Thevolume bar 1712 provides feedback to the user. Similar control over user input may be provided by spokes associated withother menu items 810. - The foregoing indicates that there may be two (or more) types of
spokes 812. One type ofspoke 812 may be referred to as a “selection spoke.” A selection spoke may result in a selection associated with the menu item. For example, the selection could spawn another spokes menu, or be an input (e.g., select play, select fast forward, etc.). Another type ofspoke 812 may be referred to as an “amplitude” spoke. The amplitude spoke may allow the user to change a value, as one example. For example, the user can change a volume level, change the rate of fast forward, etc. - Also depicted in
FIG. 17 is aresize menu item 810 f in the slate controls 808 c, which can spawn theresize spokes menu 808 e, upon its selection. - As noted in the discussion of
FIG. 7 , one possibility ofstep 714 is for input to be provided by the user shifting their focus to the end of thehandle 806. One example of this is to enter letters on a keyboard.FIG. 18 shows one embodiment that allows user input based on sliding focus along thehandle 806.FIG. 18 shows a number ofuser interface symbols 804, each of which is associated with a different letter of the English alphabet. Thesesymbols 804 are presented to the user as virtual images in theHMD 2. There could be someobject 802 associated with thesymbols 804, although that object is not depicted inFIG. 18 . - When a user looks at one of the
symbols 804, thesystem 110 highlights thatsymbol 804 and grows thehandle 806 out of thesymbol 806. The user can then select that letter by shifting their focus from thesymbol 804 to a point at or near the end of thehandle 806 that is away from thesymbol 804. Receiving the selection of the letter is one example of activating a user interface element in response to determining that the user's focus has shifted away from theuser interface symbol 804 along thehandle 806 to meet a selection criterion, in accordance withstep 714 ofFIG. 7 . - The foregoing detailed description of the technology herein has been presented for purposes of illustration and description. It is not intended to be exhaustive or to limit the technology to the precise form disclosed. Many modifications and variations are possible in light of the above teaching. The described embodiments were chosen in order to best explain the principles of the technology and its practical application to thereby enable others skilled in the art to best utilize the technology in various embodiments and with various modifications as are suited to the particular use contemplated. It is intended that the scope of the technology be defined by the claims appended hereto.
- Although the subject matter has been described in language specific to structural features and/or methodological acts, it is to be understood that the subject matter defined in the appended claims is not necessarily limited to the specific features or acts described above. Rather, the specific features and acts described above are disclosed as example forms of implementing the claims.
Claims (20)
Priority Applications (5)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US13/916,193 US9710130B2 (en) | 2013-06-12 | 2013-06-12 | User focus controlled directional user input |
| KR1020167000651A KR102219246B1 (en) | 2013-06-12 | 2014-06-05 | User focus controlled graphical user interface using a head mounted device |
| EP14733485.8A EP3008567B1 (en) | 2013-06-12 | 2014-06-05 | User focus controlled graphical user interface using an head mounted device |
| PCT/US2014/041121 WO2014200811A1 (en) | 2013-06-12 | 2014-06-05 | User focus controlled graphical user interface using a head mounted device |
| CN201480033895.7A CN105378632B (en) | 2013-06-12 | 2014-06-05 | The oriented user input of user focus control |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US13/916,193 US9710130B2 (en) | 2013-06-12 | 2013-06-12 | User focus controlled directional user input |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| US20140372944A1 true US20140372944A1 (en) | 2014-12-18 |
| US9710130B2 US9710130B2 (en) | 2017-07-18 |
Family
ID=51022486
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US13/916,193 Expired - Fee Related US9710130B2 (en) | 2013-06-12 | 2013-06-12 | User focus controlled directional user input |
Country Status (5)
| Country | Link |
|---|---|
| US (1) | US9710130B2 (en) |
| EP (1) | EP3008567B1 (en) |
| KR (1) | KR102219246B1 (en) |
| CN (1) | CN105378632B (en) |
| WO (1) | WO2014200811A1 (en) |
Cited By (49)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20150015489A1 (en) * | 2010-06-01 | 2015-01-15 | Vladimir Vaganov | System and method for digital recording of handpainted, handdrawn and handwritten information |
| US20150120824A1 (en) * | 2013-08-22 | 2015-04-30 | Huawei Technologies Co., Ltd. | Communications Method, Client, and Terminal |
| US20150153912A1 (en) * | 2013-12-01 | 2015-06-04 | Apx Labs, Llc | Systems and methods for accessing a nested menu |
| US20150323791A1 (en) * | 2014-05-07 | 2015-11-12 | Verizon Patent And Licensing Inc. | Methods and Systems for Facilitating Remote Control by a Wearable Computer System of an Application Being Executed by a Media Content Processing Device |
| US20150362990A1 (en) * | 2014-06-11 | 2015-12-17 | Lenovo (Singapore) Pte. Ltd. | Displaying a user input modality |
| US9285872B1 (en) * | 2013-12-12 | 2016-03-15 | Google Inc. | Using head gesture and eye position to wake a head mounted device |
| US20160103437A1 (en) * | 2013-06-27 | 2016-04-14 | Abb Technology Ltd | Method and data presenting device for assisting a remote user to provide instructions |
| US9377852B1 (en) * | 2013-08-29 | 2016-06-28 | Rockwell Collins, Inc. | Eye tracking as a method to improve the user interface |
| WO2016109127A1 (en) * | 2014-12-29 | 2016-07-07 | Sony Computer Entertainment America Llc | Methods and systems for user interaction within virtual or augmented reality scene using head mounted display |
| US9442631B1 (en) * | 2014-01-27 | 2016-09-13 | Google Inc. | Methods and systems for hands-free browsing in a wearable computing device |
| US20160371886A1 (en) * | 2015-06-22 | 2016-12-22 | Joe Thompson | System and method for spawning drawing surfaces |
| US20170000342A1 (en) | 2015-03-16 | 2017-01-05 | Magic Leap, Inc. | Methods and systems for detecting health conditions by imaging portions of the eye, including the fundus |
| EP3115871A1 (en) * | 2015-07-09 | 2017-01-11 | Cogisen SRL | Graphic user interface for selection by gaze or other input methods |
| USD788158S1 (en) * | 2015-09-02 | 2017-05-30 | Samsung Electronics Co., Ltd. | Display screen or portion thereof with animated graphical user interface |
| US20170153699A1 (en) * | 2014-07-08 | 2017-06-01 | Denso Corporation | Sight line input parameter correction apparatus, and sight line input apparatus |
| US20170220134A1 (en) * | 2016-02-02 | 2017-08-03 | Aaron Burns | Volatility Based Cursor Tethering |
| US9769585B1 (en) * | 2013-08-30 | 2017-09-19 | Sprint Communications Company L.P. | Positioning surround sound for virtual acoustic presence |
| US9817474B2 (en) * | 2014-01-24 | 2017-11-14 | Tobii Ab | Gaze driven interaction for a vehicle |
| USD804513S1 (en) * | 2015-09-02 | 2017-12-05 | Samsung Electronics Co., Ltd | Display screen or portion thereof with graphical user interface |
| US20180032132A1 (en) * | 2015-02-25 | 2018-02-01 | Kyocera Corporation | Wearable device, control method, and control program |
| US20180046352A1 (en) * | 2016-08-09 | 2018-02-15 | Matthew Johnson | Virtual cursor movement |
| WO2018101995A1 (en) * | 2016-11-30 | 2018-06-07 | Google Llc | Switching of active objects in an augmented and/or virtual reality environment |
| USD824417S1 (en) | 2014-08-22 | 2018-07-31 | Adp, Llc | Display screen or portion thereof with graphical user interface |
| EP3367213A4 (en) * | 2015-10-22 | 2019-04-17 | LG Electronics Inc. | MOBILE TERMINAL AND METHOD OF CONTROLLING THE SAME |
| EP3533047A4 (en) * | 2016-10-27 | 2019-10-02 | Alibaba Group Holding Limited | USER INTERFACE FOR INPUTTING INFORMATION IN A VIRTUAL REALITY ENVIRONMENT |
| US20190324708A1 (en) * | 2017-01-06 | 2019-10-24 | Sony Interactive Entertainment Inc. | Sound outputting apparatus, head-mounted display, sound outputting method, and program |
| US10459231B2 (en) | 2016-04-08 | 2019-10-29 | Magic Leap, Inc. | Augmented reality systems and methods with variable focus lens elements |
| US10565725B2 (en) | 2016-10-17 | 2020-02-18 | Samsung Electronics Co., Ltd. | Method and device for displaying virtual object |
| US10586514B2 (en) | 2017-05-01 | 2020-03-10 | Elbit Systems Ltd | Head mounted display device, system and method |
| US10788791B2 (en) | 2016-02-22 | 2020-09-29 | Real View Imaging Ltd. | Method and system for displaying holographic images within a real object |
| US10795316B2 (en) | 2016-02-22 | 2020-10-06 | Real View Imaging Ltd. | Wide field of view hybrid holographic display |
| US10877643B2 (en) | 2018-03-15 | 2020-12-29 | Google Llc | Systems and methods to increase discoverability in user interfaces |
| US10877437B2 (en) | 2016-02-22 | 2020-12-29 | Real View Imaging Ltd. | Zero order blocking and diverging for holographic imaging |
| US10943395B1 (en) * | 2014-10-03 | 2021-03-09 | Virtex Apps, Llc | Dynamic integration of a virtual environment with a physical environment |
| US10962855B2 (en) | 2017-02-23 | 2021-03-30 | Magic Leap, Inc. | Display system with variable power reflector |
| US11188147B2 (en) * | 2015-06-12 | 2021-11-30 | Panasonic Intellectual Property Corporation Of America | Display control method for highlighting display element focused by user |
| US20210382311A1 (en) * | 2015-03-17 | 2021-12-09 | Raytrx, Llc | Wearable image manipulation and control system with high resolution micro-displays and dynamic opacity augmentation in augmented reality glasses |
| US20220012931A1 (en) * | 2015-02-26 | 2022-01-13 | Rovi Guides, Inc. | Methods and systems for generating holographic animations |
| WO2022155113A1 (en) * | 2021-01-14 | 2022-07-21 | Sterling Labs Llc | Method and device for visualizing multi-modal inputs |
| WO2023020155A1 (en) * | 2021-08-20 | 2023-02-23 | Huawei Technologies Co., Ltd. | Methods, devices and media for input/output space mapping in head-based human-computer interactions |
| US11609675B2 (en) * | 2018-12-19 | 2023-03-21 | Google Llc | Placement of objects in an augmented reality environment |
| US11663937B2 (en) | 2016-02-22 | 2023-05-30 | Real View Imaging Ltd. | Pupil tracking in an image display system |
| US20230229234A1 (en) * | 2019-11-05 | 2023-07-20 | Micron Technology, Inc. | Rendering enhancement based in part on eye tracking |
| US20230418371A1 (en) * | 2022-06-24 | 2023-12-28 | Apple Inc. | Controlling a Device Setting Using Head Pose |
| US12235445B2 (en) | 2015-03-17 | 2025-02-25 | Raytrx, Llc | Wearable image manipulation and control system with high resolution micro-displays and dynamic opacity augmentation in augmented reality glasses |
| USD1081680S1 (en) * | 2022-08-26 | 2025-07-01 | Igt | Display screen or a portion thereof with a graphical user interface |
| US12474788B2 (en) | 2015-10-20 | 2025-11-18 | Magic Leap, Inc. | Selecting virtual objects in a three-dimensional space |
| US12560814B2 (en) | 2016-02-22 | 2026-02-24 | Real View Imaging Ltd. | Holographic display |
| US12591297B2 (en) | 2017-04-19 | 2026-03-31 | Magic Leap, Inc. | Multimodal task execution and text editing for a wearable system |
Families Citing this family (17)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2015142228A1 (en) * | 2014-03-18 | 2015-09-24 | Telefonaktiebolaget L M Ericsson (Publ) | Controlling a target device |
| CN106303652B (en) * | 2015-05-27 | 2019-09-06 | 阿里巴巴集团控股有限公司 | Method and device for drawing interface elements |
| US10409443B2 (en) * | 2015-06-24 | 2019-09-10 | Microsoft Technology Licensing, Llc | Contextual cursor display based on hand tracking |
| US20170092002A1 (en) * | 2015-09-30 | 2017-03-30 | Daqri, Llc | User interface for augmented reality system |
| EP3249497A1 (en) * | 2016-05-24 | 2017-11-29 | Harman Becker Automotive Systems GmbH | Eye tracking |
| US10788888B2 (en) * | 2016-06-07 | 2020-09-29 | Koninklijke Kpn N.V. | Capturing and rendering information involving a virtual environment |
| CN106648047A (en) * | 2016-09-14 | 2017-05-10 | 歌尔科技有限公司 | Focus locating method and apparatus used for virtual reality device, and virtual reality device |
| CN108958590A (en) * | 2017-05-26 | 2018-12-07 | 成都理想境界科技有限公司 | Menu operation method applied to head-mounted display device and head-mounted display device |
| US10957069B2 (en) * | 2017-09-29 | 2021-03-23 | Tobii Ab | Head pose estimation from local eye region |
| WO2019204161A1 (en) | 2018-04-20 | 2019-10-24 | Pcms Holdings, Inc. | Method and system for gaze-based control of mixed reality content |
| US10831030B2 (en) | 2018-04-20 | 2020-11-10 | Microsoft Technology Licensing, Llc | Systems and methods for visually guided gaze-based targeting |
| CN109613982A (en) * | 2018-12-13 | 2019-04-12 | 叶成环 | Wear-type AR shows the display exchange method of equipment |
| WO2021247312A1 (en) | 2020-06-03 | 2021-12-09 | Limonox Projects Llc | Eye-gaze based biofeedback |
| CN111722716B (en) * | 2020-06-18 | 2022-02-08 | 清华大学 | Eye movement interaction method, head-mounted device and computer readable medium |
| US12099654B1 (en) | 2021-06-21 | 2024-09-24 | Apple Inc. | Adaptation of electronic content |
| US12061738B2 (en) | 2022-06-02 | 2024-08-13 | Google Llc | Attention redirection of a user of a wearable device |
| KR102829285B1 (en) * | 2022-06-17 | 2025-07-03 | 한국과학기술원 | Electronic device for providing gaze-based menu interface utilizing lattice of visual anchors, and method of the same |
Citations (8)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20100083172A1 (en) * | 2008-10-01 | 2010-04-01 | International Business Machines Corporation | Method and system for generating and displaying an interactive dynamic list view of multiply connected objects |
| US20110004849A1 (en) * | 2008-02-01 | 2011-01-06 | Oh Eui Jin | Inputting method by predicting a word and electronic device for practicing the method |
| US20110078560A1 (en) * | 2009-09-25 | 2011-03-31 | Christopher Douglas Weeldreyer | Device, Method, and Graphical User Interface for Displaying Emphasis Animations for an Electronic Document in a Presentation Mode |
| US20110140994A1 (en) * | 2009-12-15 | 2011-06-16 | Noma Tatsuyoshi | Information Presenting Apparatus, Method, and Computer Program Product |
| US20130073932A1 (en) * | 2011-08-19 | 2013-03-21 | Apple Inc. | Interactive Content for Digital Books |
| US8910078B2 (en) * | 2009-06-05 | 2014-12-09 | Dassault Systemes Solidworks Corporation | Predictive target enlargement |
| US20150070477A1 (en) * | 2012-03-30 | 2015-03-12 | Sony Corporation | Image processing device, image processing method, and program |
| US9035878B1 (en) * | 2012-02-29 | 2015-05-19 | Google Inc. | Input system |
Family Cites Families (13)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US6009210A (en) | 1997-03-05 | 1999-12-28 | Digital Equipment Corporation | Hands-free interface to a virtual reality environment using head tracking |
| CA2310114A1 (en) | 1998-02-02 | 1999-08-02 | Steve Mann | Wearable camera system with viewfinder means |
| US6152563A (en) | 1998-02-20 | 2000-11-28 | Hutchinson; Thomas E. | Eye gaze direction tracker |
| US7121946B2 (en) | 1998-08-10 | 2006-10-17 | Cybernet Systems Corporation | Real-time head tracking system for computer games and other applications |
| US7883415B2 (en) | 2003-09-15 | 2011-02-08 | Sony Computer Entertainment Inc. | Method and apparatus for adjusting a view of a scene being displayed according to tracked head motion |
| US7412077B2 (en) | 2006-12-29 | 2008-08-12 | Motorola, Inc. | Apparatus and methods for head pose estimation and head gesture detection |
| WO2009110769A2 (en) * | 2008-03-06 | 2009-09-11 | Oh Eui-Jin | Data input device |
| DE102009049073A1 (en) * | 2009-10-12 | 2011-04-21 | Metaio Gmbh | Method for presenting virtual information in a view of a real environment |
| US8976086B2 (en) | 2010-12-03 | 2015-03-10 | Esight Corp. | Apparatus and method for a bioptic real time video system |
| US9213405B2 (en) | 2010-12-16 | 2015-12-15 | Microsoft Technology Licensing, Llc | Comprehension and intent-based content for augmented reality displays |
| US9727132B2 (en) | 2011-07-01 | 2017-08-08 | Microsoft Technology Licensing, Llc | Multi-visor: managing applications in augmented reality environments |
| US20130021374A1 (en) * | 2011-07-20 | 2013-01-24 | Google Inc. | Manipulating And Displaying An Image On A Wearable Computing System |
| US8866852B2 (en) | 2011-11-28 | 2014-10-21 | Google Inc. | Method and system for input detection |
-
2013
- 2013-06-12 US US13/916,193 patent/US9710130B2/en not_active Expired - Fee Related
-
2014
- 2014-06-05 EP EP14733485.8A patent/EP3008567B1/en not_active Not-in-force
- 2014-06-05 WO PCT/US2014/041121 patent/WO2014200811A1/en not_active Ceased
- 2014-06-05 KR KR1020167000651A patent/KR102219246B1/en not_active Expired - Fee Related
- 2014-06-05 CN CN201480033895.7A patent/CN105378632B/en active Active
Patent Citations (8)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20110004849A1 (en) * | 2008-02-01 | 2011-01-06 | Oh Eui Jin | Inputting method by predicting a word and electronic device for practicing the method |
| US20100083172A1 (en) * | 2008-10-01 | 2010-04-01 | International Business Machines Corporation | Method and system for generating and displaying an interactive dynamic list view of multiply connected objects |
| US8910078B2 (en) * | 2009-06-05 | 2014-12-09 | Dassault Systemes Solidworks Corporation | Predictive target enlargement |
| US20110078560A1 (en) * | 2009-09-25 | 2011-03-31 | Christopher Douglas Weeldreyer | Device, Method, and Graphical User Interface for Displaying Emphasis Animations for an Electronic Document in a Presentation Mode |
| US20110140994A1 (en) * | 2009-12-15 | 2011-06-16 | Noma Tatsuyoshi | Information Presenting Apparatus, Method, and Computer Program Product |
| US20130073932A1 (en) * | 2011-08-19 | 2013-03-21 | Apple Inc. | Interactive Content for Digital Books |
| US9035878B1 (en) * | 2012-02-29 | 2015-05-19 | Google Inc. | Input system |
| US20150070477A1 (en) * | 2012-03-30 | 2015-03-12 | Sony Corporation | Image processing device, image processing method, and program |
Cited By (124)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20150015489A1 (en) * | 2010-06-01 | 2015-01-15 | Vladimir Vaganov | System and method for digital recording of handpainted, handdrawn and handwritten information |
| US9829873B2 (en) * | 2013-06-27 | 2017-11-28 | Abb Schweiz Ag | Method and data presenting device for assisting a remote user to provide instructions |
| US20160103437A1 (en) * | 2013-06-27 | 2016-04-14 | Abb Technology Ltd | Method and data presenting device for assisting a remote user to provide instructions |
| US9531841B2 (en) * | 2013-08-22 | 2016-12-27 | Huawei Technologies Co., Ltd. | Communications method, client, and terminal |
| US20150120824A1 (en) * | 2013-08-22 | 2015-04-30 | Huawei Technologies Co., Ltd. | Communications Method, Client, and Terminal |
| US9377852B1 (en) * | 2013-08-29 | 2016-06-28 | Rockwell Collins, Inc. | Eye tracking as a method to improve the user interface |
| US9769585B1 (en) * | 2013-08-30 | 2017-09-19 | Sprint Communications Company L.P. | Positioning surround sound for virtual acoustic presence |
| US20150153826A1 (en) * | 2013-12-01 | 2015-06-04 | Apx Labs, Llc | Systems and methods for providing a virtual menu |
| US20150153913A1 (en) * | 2013-12-01 | 2015-06-04 | Apx Labs, Llc | Systems and methods for interacting with a virtual menu |
| US10466858B2 (en) * | 2013-12-01 | 2019-11-05 | Upskill, Inc. | Systems and methods for interacting with a virtual menu |
| US10558325B2 (en) * | 2013-12-01 | 2020-02-11 | Upskill, Inc. | Systems and methods for controlling operation of an on-board component |
| US20150153912A1 (en) * | 2013-12-01 | 2015-06-04 | Apx Labs, Llc | Systems and methods for accessing a nested menu |
| US10254920B2 (en) * | 2013-12-01 | 2019-04-09 | Upskill, Inc. | Systems and methods for accessing a nested menu |
| US9285872B1 (en) * | 2013-12-12 | 2016-03-15 | Google Inc. | Using head gesture and eye position to wake a head mounted device |
| US10035518B2 (en) | 2014-01-24 | 2018-07-31 | Tobii Ab | Gaze driven interaction for a vehicle |
| US20180224933A1 (en) * | 2014-01-24 | 2018-08-09 | Tobii Ab | Gaze driven interaction for a vehicle |
| US9817474B2 (en) * | 2014-01-24 | 2017-11-14 | Tobii Ab | Gaze driven interaction for a vehicle |
| US10324527B2 (en) | 2014-01-24 | 2019-06-18 | Tobii Ab | Gaze driven interaction for a vehicle |
| US9442631B1 (en) * | 2014-01-27 | 2016-09-13 | Google Inc. | Methods and systems for hands-free browsing in a wearable computing device |
| US10114466B2 (en) | 2014-01-27 | 2018-10-30 | Google Llc | Methods and systems for hands-free browsing in a wearable computing device |
| US9958935B2 (en) * | 2014-05-07 | 2018-05-01 | Verizon Patent And Licensing Inc. | Methods and systems for facilitating remote control by a wearable computer system of an application being executed by a media content processing device |
| US20150323791A1 (en) * | 2014-05-07 | 2015-11-12 | Verizon Patent And Licensing Inc. | Methods and Systems for Facilitating Remote Control by a Wearable Computer System of an Application Being Executed by a Media Content Processing Device |
| US20150362990A1 (en) * | 2014-06-11 | 2015-12-17 | Lenovo (Singapore) Pte. Ltd. | Displaying a user input modality |
| US20170153699A1 (en) * | 2014-07-08 | 2017-06-01 | Denso Corporation | Sight line input parameter correction apparatus, and sight line input apparatus |
| USD824417S1 (en) | 2014-08-22 | 2018-07-31 | Adp, Llc | Display screen or portion thereof with graphical user interface |
| US10943395B1 (en) * | 2014-10-03 | 2021-03-09 | Virtex Apps, Llc | Dynamic integration of a virtual environment with a physical environment |
| US11887258B2 (en) | 2014-10-03 | 2024-01-30 | Virtex Apps, Llc | Dynamic integration of a virtual environment with a physical environment |
| WO2016109127A1 (en) * | 2014-12-29 | 2016-07-07 | Sony Computer Entertainment America Llc | Methods and systems for user interaction within virtual or augmented reality scene using head mounted display |
| US10073516B2 (en) | 2014-12-29 | 2018-09-11 | Sony Interactive Entertainment Inc. | Methods and systems for user interaction within virtual reality scene using head mounted display |
| US20180032132A1 (en) * | 2015-02-25 | 2018-02-01 | Kyocera Corporation | Wearable device, control method, and control program |
| US10540009B2 (en) * | 2015-02-25 | 2020-01-21 | Kyocera Corporation | Wearable device, control method, and control program |
| US20220012931A1 (en) * | 2015-02-26 | 2022-01-13 | Rovi Guides, Inc. | Methods and systems for generating holographic animations |
| US11663766B2 (en) * | 2015-02-26 | 2023-05-30 | Rovi Guides, Inc. | Methods and systems for generating holographic animations |
| US12217348B2 (en) | 2015-02-26 | 2025-02-04 | Adeia Guides Inc. | Methods and systems for generating holographic animations |
| US10459229B2 (en) | 2015-03-16 | 2019-10-29 | Magic Leap, Inc. | Methods and systems for performing two-photon microscopy |
| US10788675B2 (en) | 2015-03-16 | 2020-09-29 | Magic Leap, Inc. | Methods and systems for diagnosing and treating eyes using light therapy |
| US12345892B2 (en) | 2015-03-16 | 2025-07-01 | Magic Leap, Inc. | Augmented and virtual reality display systems and methods for diagnosing health conditions based on visual fields |
| US20170000342A1 (en) | 2015-03-16 | 2017-01-05 | Magic Leap, Inc. | Methods and systems for detecting health conditions by imaging portions of the eye, including the fundus |
| US11747627B2 (en) | 2015-03-16 | 2023-09-05 | Magic Leap, Inc. | Augmented and virtual reality display systems and methods for diagnosing health conditions based on visual fields |
| US11474359B2 (en) | 2015-03-16 | 2022-10-18 | Magic Leap, Inc. | Augmented and virtual reality display systems and methods for diagnosing health conditions based on visual fields |
| US10345592B2 (en) | 2015-03-16 | 2019-07-09 | Magic Leap, Inc. | Augmented and virtual reality display systems and methods for diagnosing a user using electrical potentials |
| US10345593B2 (en) | 2015-03-16 | 2019-07-09 | Magic Leap, Inc. | Methods and systems for providing augmented reality content for treating color blindness |
| US10345590B2 (en) | 2015-03-16 | 2019-07-09 | Magic Leap, Inc. | Augmented and virtual reality display systems and methods for determining optical prescriptions |
| US10345591B2 (en) | 2015-03-16 | 2019-07-09 | Magic Leap, Inc. | Methods and systems for performing retinoscopy |
| US10359631B2 (en) | 2015-03-16 | 2019-07-23 | Magic Leap, Inc. | Augmented reality display systems and methods for re-rendering the world |
| US10365488B2 (en) | 2015-03-16 | 2019-07-30 | Magic Leap, Inc. | Methods and systems for diagnosing eyes using aberrometer |
| US10371949B2 (en) | 2015-03-16 | 2019-08-06 | Magic Leap, Inc. | Methods and systems for performing confocal microscopy |
| US10371948B2 (en) | 2015-03-16 | 2019-08-06 | Magic Leap, Inc. | Methods and systems for diagnosing color blindness |
| US10371946B2 (en) | 2015-03-16 | 2019-08-06 | Magic Leap, Inc. | Methods and systems for diagnosing binocular vision conditions |
| US10371945B2 (en) | 2015-03-16 | 2019-08-06 | Magic Leap, Inc. | Methods and systems for diagnosing and treating higher order refractive aberrations of an eye |
| US10371947B2 (en) | 2015-03-16 | 2019-08-06 | Magic Leap, Inc. | Methods and systems for modifying eye convergence for diagnosing and treating conditions including strabismus and/or amblyopia |
| US10379354B2 (en) | 2015-03-16 | 2019-08-13 | Magic Leap, Inc. | Methods and systems for diagnosing contrast sensitivity |
| US10379350B2 (en) | 2015-03-16 | 2019-08-13 | Magic Leap, Inc. | Methods and systems for diagnosing eyes using ultrasound |
| US10379351B2 (en) | 2015-03-16 | 2019-08-13 | Magic Leap, Inc. | Methods and systems for diagnosing and treating eyes using light therapy |
| US10386639B2 (en) | 2015-03-16 | 2019-08-20 | Magic Leap, Inc. | Methods and systems for diagnosing eye conditions such as red reflex using light reflected from the eyes |
| US10386640B2 (en) | 2015-03-16 | 2019-08-20 | Magic Leap, Inc. | Methods and systems for determining intraocular pressure |
| US10386641B2 (en) | 2015-03-16 | 2019-08-20 | Magic Leap, Inc. | Methods and systems for providing augmented reality content for treatment of macular degeneration |
| US10429649B2 (en) | 2015-03-16 | 2019-10-01 | Magic Leap, Inc. | Augmented and virtual reality display systems and methods for diagnosing using occluder |
| US11256096B2 (en) | 2015-03-16 | 2022-02-22 | Magic Leap, Inc. | Methods and systems for diagnosing and treating presbyopia |
| US10437062B2 (en) | 2015-03-16 | 2019-10-08 | Magic Leap, Inc. | Augmented and virtual reality display platforms and methods for delivering health treatments to a user |
| US10444504B2 (en) | 2015-03-16 | 2019-10-15 | Magic Leap, Inc. | Methods and systems for performing optical coherence tomography |
| US10451877B2 (en) | 2015-03-16 | 2019-10-22 | Magic Leap, Inc. | Methods and systems for diagnosing and treating presbyopia |
| US20170007111A1 (en) * | 2015-03-16 | 2017-01-12 | Magic Leap, Inc. | Methods and systems for diagnosing eye conditions, including macular degeneration |
| US11156835B2 (en) | 2015-03-16 | 2021-10-26 | Magic Leap, Inc. | Methods and systems for diagnosing and treating health ailments |
| US10983351B2 (en) | 2015-03-16 | 2021-04-20 | Magic Leap, Inc. | Augmented and virtual reality display systems and methods for diagnosing health conditions based on visual fields |
| US10969588B2 (en) | 2015-03-16 | 2021-04-06 | Magic Leap, Inc. | Methods and systems for diagnosing contrast sensitivity |
| US10466477B2 (en) | 2015-03-16 | 2019-11-05 | Magic Leap, Inc. | Methods and systems for providing wavefront corrections for treating conditions including myopia, hyperopia, and/or astigmatism |
| US10473934B2 (en) | 2015-03-16 | 2019-11-12 | Magic Leap, Inc. | Methods and systems for performing slit lamp examination |
| US10527850B2 (en) | 2015-03-16 | 2020-01-07 | Magic Leap, Inc. | Augmented and virtual reality display systems and methods for determining optical prescriptions by imaging retina |
| US20170007843A1 (en) | 2015-03-16 | 2017-01-12 | Magic Leap, Inc. | Methods and systems for diagnosing and treating eyes using laser therapy |
| US10775628B2 (en) | 2015-03-16 | 2020-09-15 | Magic Leap, Inc. | Methods and systems for diagnosing and treating presbyopia |
| US10539795B2 (en) | 2015-03-16 | 2020-01-21 | Magic Leap, Inc. | Methods and systems for diagnosing and treating eyes using laser therapy |
| US10539794B2 (en) | 2015-03-16 | 2020-01-21 | Magic Leap, Inc. | Methods and systems for detecting health conditions by imaging portions of the eye, including the fundus |
| US10545341B2 (en) * | 2015-03-16 | 2020-01-28 | Magic Leap, Inc. | Methods and systems for diagnosing eye conditions, including macular degeneration |
| US20170007450A1 (en) | 2015-03-16 | 2017-01-12 | Magic Leap, Inc. | Augmented and virtual reality display systems and methods for delivery of medication to eyes |
| US10564423B2 (en) | 2015-03-16 | 2020-02-18 | Magic Leap, Inc. | Augmented and virtual reality display systems and methods for delivery of medication to eyes |
| US12210156B2 (en) | 2015-03-17 | 2025-01-28 | Raytrx, Llc | Wearable image manipulation and control system with high resolution micro-displays and dynamic opacity augmentation in augmented reality glasses |
| US12235445B2 (en) | 2015-03-17 | 2025-02-25 | Raytrx, Llc | Wearable image manipulation and control system with high resolution micro-displays and dynamic opacity augmentation in augmented reality glasses |
| US12235446B2 (en) * | 2015-03-17 | 2025-02-25 | Raytrx, Llc | Wearable image manipulation and control system with high resolution micro-displays and dynamic opacity augmentation in augmented reality glasses |
| US20210382311A1 (en) * | 2015-03-17 | 2021-12-09 | Raytrx, Llc | Wearable image manipulation and control system with high resolution micro-displays and dynamic opacity augmentation in augmented reality glasses |
| US11188147B2 (en) * | 2015-06-12 | 2021-11-30 | Panasonic Intellectual Property Corporation Of America | Display control method for highlighting display element focused by user |
| US20160371886A1 (en) * | 2015-06-22 | 2016-12-22 | Joe Thompson | System and method for spawning drawing surfaces |
| US9898865B2 (en) * | 2015-06-22 | 2018-02-20 | Microsoft Technology Licensing, Llc | System and method for spawning drawing surfaces |
| EP3115871A1 (en) * | 2015-07-09 | 2017-01-11 | Cogisen SRL | Graphic user interface for selection by gaze or other input methods |
| USD804513S1 (en) * | 2015-09-02 | 2017-12-05 | Samsung Electronics Co., Ltd | Display screen or portion thereof with graphical user interface |
| USD788158S1 (en) * | 2015-09-02 | 2017-05-30 | Samsung Electronics Co., Ltd. | Display screen or portion thereof with animated graphical user interface |
| US12474788B2 (en) | 2015-10-20 | 2025-11-18 | Magic Leap, Inc. | Selecting virtual objects in a three-dimensional space |
| US10540005B2 (en) | 2015-10-22 | 2020-01-21 | Lg Electronics Inc. | Mobile terminal and control method therefor |
| EP3367213A4 (en) * | 2015-10-22 | 2019-04-17 | LG Electronics Inc. | MOBILE TERMINAL AND METHOD OF CONTROLLING THE SAME |
| US20170220134A1 (en) * | 2016-02-02 | 2017-08-03 | Aaron Burns | Volatility Based Cursor Tethering |
| US10209785B2 (en) * | 2016-02-02 | 2019-02-19 | Microsoft Technology Licensing, Llc | Volatility based cursor tethering |
| US10877437B2 (en) | 2016-02-22 | 2020-12-29 | Real View Imaging Ltd. | Zero order blocking and diverging for holographic imaging |
| US10788791B2 (en) | 2016-02-22 | 2020-09-29 | Real View Imaging Ltd. | Method and system for displaying holographic images within a real object |
| US11663937B2 (en) | 2016-02-22 | 2023-05-30 | Real View Imaging Ltd. | Pupil tracking in an image display system |
| US10795316B2 (en) | 2016-02-22 | 2020-10-06 | Real View Imaging Ltd. | Wide field of view hybrid holographic display |
| US12481243B2 (en) | 2016-02-22 | 2025-11-25 | Real View Imaging Ltd. | Method and system for displaying holographic images within a real object |
| US11543773B2 (en) | 2016-02-22 | 2023-01-03 | Real View Imaging Ltd. | Wide field of view hybrid holographic display |
| US12560814B2 (en) | 2016-02-22 | 2026-02-24 | Real View Imaging Ltd. | Holographic display |
| US11754971B2 (en) | 2016-02-22 | 2023-09-12 | Real View Imaging Ltd. | Method and system for displaying holographic images within a real object |
| US11614626B2 (en) | 2016-04-08 | 2023-03-28 | Magic Leap, Inc. | Augmented reality systems and methods with variable focus lens elements |
| US10459231B2 (en) | 2016-04-08 | 2019-10-29 | Magic Leap, Inc. | Augmented reality systems and methods with variable focus lens elements |
| US11106041B2 (en) | 2016-04-08 | 2021-08-31 | Magic Leap, Inc. | Augmented reality systems and methods with variable focus lens elements |
| US20180046352A1 (en) * | 2016-08-09 | 2018-02-15 | Matthew Johnson | Virtual cursor movement |
| US10565725B2 (en) | 2016-10-17 | 2020-02-18 | Samsung Electronics Co., Ltd. | Method and device for displaying virtual object |
| EP3533047A4 (en) * | 2016-10-27 | 2019-10-02 | Alibaba Group Holding Limited | USER INTERFACE FOR INPUTTING INFORMATION IN A VIRTUAL REALITY ENVIRONMENT |
| WO2018101995A1 (en) * | 2016-11-30 | 2018-06-07 | Google Llc | Switching of active objects in an augmented and/or virtual reality environment |
| US11086587B2 (en) * | 2017-01-06 | 2021-08-10 | Sony Interactive Entertainment Inc. | Sound outputting apparatus and method for head-mounted display to enhance realistic feeling of augmented or mixed reality space |
| US20190324708A1 (en) * | 2017-01-06 | 2019-10-24 | Sony Interactive Entertainment Inc. | Sound outputting apparatus, head-mounted display, sound outputting method, and program |
| US10962855B2 (en) | 2017-02-23 | 2021-03-30 | Magic Leap, Inc. | Display system with variable power reflector |
| US11774823B2 (en) | 2017-02-23 | 2023-10-03 | Magic Leap, Inc. | Display system with variable power reflector |
| US11300844B2 (en) | 2017-02-23 | 2022-04-12 | Magic Leap, Inc. | Display system with variable power reflector |
| US12591297B2 (en) | 2017-04-19 | 2026-03-31 | Magic Leap, Inc. | Multimodal task execution and text editing for a wearable system |
| US11004425B2 (en) | 2017-05-01 | 2021-05-11 | Elbit Systems Ltd. | Head mounted display device, system and method |
| US10586514B2 (en) | 2017-05-01 | 2020-03-10 | Elbit Systems Ltd | Head mounted display device, system and method |
| US10877643B2 (en) | 2018-03-15 | 2020-12-29 | Google Llc | Systems and methods to increase discoverability in user interfaces |
| US11609675B2 (en) * | 2018-12-19 | 2023-03-21 | Google Llc | Placement of objects in an augmented reality environment |
| US20230229234A1 (en) * | 2019-11-05 | 2023-07-20 | Micron Technology, Inc. | Rendering enhancement based in part on eye tracking |
| WO2022155113A1 (en) * | 2021-01-14 | 2022-07-21 | Sterling Labs Llc | Method and device for visualizing multi-modal inputs |
| WO2023020155A1 (en) * | 2021-08-20 | 2023-02-23 | Huawei Technologies Co., Ltd. | Methods, devices and media for input/output space mapping in head-based human-computer interactions |
| US11797081B2 (en) | 2021-08-20 | 2023-10-24 | Huawei Technologies Co., Ltd. | Methods, devices and media for input/output space mapping in head-based human-computer interactions |
| US12026302B2 (en) * | 2022-06-24 | 2024-07-02 | Apple Inc. | Controlling a device setting using head pose |
| US20230418371A1 (en) * | 2022-06-24 | 2023-12-28 | Apple Inc. | Controlling a Device Setting Using Head Pose |
| USD1081680S1 (en) * | 2022-08-26 | 2025-07-01 | Igt | Display screen or a portion thereof with a graphical user interface |
| USD1117272S1 (en) * | 2022-08-26 | 2026-03-10 | Igt | Display screen or a portion thereof with a graphical user interface |
Also Published As
| Publication number | Publication date |
|---|---|
| WO2014200811A1 (en) | 2014-12-18 |
| EP3008567A1 (en) | 2016-04-20 |
| EP3008567B1 (en) | 2018-07-25 |
| KR20160018792A (en) | 2016-02-17 |
| CN105378632A (en) | 2016-03-02 |
| US9710130B2 (en) | 2017-07-18 |
| KR102219246B1 (en) | 2021-02-22 |
| CN105378632B (en) | 2019-01-08 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US9710130B2 (en) | User focus controlled directional user input | |
| US9727132B2 (en) | Multi-visor: managing applications in augmented reality environments | |
| EP2652940B1 (en) | Comprehension and intent-based content for augmented reality displays | |
| US9690099B2 (en) | Optimized focal area for augmented reality displays | |
| US9116666B2 (en) | Gesture based region identification for holograms | |
| US10175483B2 (en) | Hybrid world/body locked HUD on an HMD | |
| US8884984B2 (en) | Fusing virtual content into real content | |
| US20160131902A1 (en) | System for automatic eye tracking calibration of head mounted display device | |
| US20130342572A1 (en) | Control of displayed content in virtual environments | |
| US20130257748A1 (en) | Touch sensitive user interface | |
| US20130328925A1 (en) | Object focus in a mixed reality environment | |
| HK1171103B (en) | Comprehension and intent-based content for augmented reality display | |
| HK1171515B (en) | Optimized focal area for augmented reality display |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| AS | Assignment |
Owner name: MICROSOFT TECHNOLOGY LICENSING, LLC, WASHINGTON Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:MICROSOFT CORPORATION;REEL/FRAME:034747/0417 Effective date: 20141014 Owner name: MICROSOFT TECHNOLOGY LICENSING, LLC, WASHINGTON Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:MICROSOFT CORPORATION;REEL/FRAME:039025/0454 Effective date: 20141014 |
|
| AS | Assignment |
Owner name: MICROSOFT CORPORATION, WASHINGTON Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:MULCAHY, KATHLEEN;BURNS, AARON;OMOTANI, TODD;AND OTHERS;SIGNING DATES FROM 20130603 TO 20130610;REEL/FRAME:035835/0589 |
|
| STCF | Information on status: patent grant |
Free format text: PATENTED CASE |
|
| MAFP | Maintenance fee payment |
Free format text: PAYMENT OF MAINTENANCE FEE, 4TH YEAR, LARGE ENTITY (ORIGINAL EVENT CODE: M1551); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY Year of fee payment: 4 |
|
| FEPP | Fee payment procedure |
Free format text: MAINTENANCE FEE REMINDER MAILED (ORIGINAL EVENT CODE: REM.); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY |
|
| LAPS | Lapse for failure to pay maintenance fees |
Free format text: PATENT EXPIRED FOR FAILURE TO PAY MAINTENANCE FEES (ORIGINAL EVENT CODE: EXP.); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY |
|
| STCH | Information on status: patent discontinuation |
Free format text: PATENT EXPIRED DUE TO NONPAYMENT OF MAINTENANCE FEES UNDER 37 CFR 1.362 |
|
| FP | Lapsed due to failure to pay maintenance fee |
Effective date: 20250718 |