WO2014085768A1 - Système d'instruction à réalité virtuelle et augmentée - Google Patents

Système d'instruction à réalité virtuelle et augmentée Download PDF

Info

Publication number
WO2014085768A1
WO2014085768A1 PCT/US2013/072493 US2013072493W WO2014085768A1 WO 2014085768 A1 WO2014085768 A1 WO 2014085768A1 US 2013072493 W US2013072493 W US 2013072493W WO 2014085768 A1 WO2014085768 A1 WO 2014085768A1
Authority
WO
WIPO (PCT)
Prior art keywords
virtual
augmented reality
board
instruction system
tracking
Prior art date
Application number
PCT/US2013/072493
Other languages
English (en)
Inventor
Imran HADDISH
Original Assignee
Haddish Imran
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Haddish Imran filed Critical Haddish Imran
Priority to AU2013351959A priority Critical patent/AU2013351959B2/en
Priority to US17/269,969 priority patent/US11694565B2/en
Priority to ES13858451T priority patent/ES2893410T3/es
Priority to EP21189799.6A priority patent/EP3968135A1/fr
Priority to CN201380062587.2A priority patent/CN105247453A/zh
Priority to CA2892958A priority patent/CA2892958C/fr
Priority to DK13858451.1T priority patent/DK2926224T3/da
Priority to EP13858451.1A priority patent/EP2926224B1/fr
Publication of WO2014085768A1 publication Critical patent/WO2014085768A1/fr
Priority to AU2021261950A priority patent/AU2021261950B2/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/03Arrangements for converting the position or the displacement of a member into a coded form
    • G06F3/033Pointing devices displaced or positioned by the user, e.g. mice, trackballs, pens or joysticks; Accessories therefor
    • G06F3/0346Pointing devices displaced or positioned by the user, e.g. mice, trackballs, pens or joysticks; Accessories therefor with detection of the device orientation or free movement in a 3D space, e.g. 3D mice, 6-DOF [six degrees of freedom] pointers using gyroscopes, accelerometers or tilt-sensors
    • GPHYSICS
    • G09EDUCATION; CRYPTOGRAPHY; DISPLAY; ADVERTISING; SEALS
    • G09BEDUCATIONAL OR DEMONSTRATION APPLIANCES; APPLIANCES FOR TEACHING, OR COMMUNICATING WITH, THE BLIND, DEAF OR MUTE; MODELS; PLANETARIA; GLOBES; MAPS; DIAGRAMS
    • G09B5/00Electrically-operated educational appliances
    • G09B5/06Electrically-operated educational appliances with both visual and audible presentation of the material to be studied
    • G09B5/065Combinations of audio and video presentations, e.g. videotapes, videodiscs, television systems
    • GPHYSICS
    • G02OPTICS
    • G02BOPTICAL ELEMENTS, SYSTEMS OR APPARATUS
    • G02B27/00Optical systems or apparatus not provided for by any of the groups G02B1/00 - G02B26/00, G02B30/00
    • G02B27/01Head-up displays
    • G02B27/017Head mounted
    • G02B27/0172Head mounted characterised by optical features
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F1/00Details not covered by groups G06F3/00 - G06F13/00 and G06F21/00
    • G06F1/16Constructional details or arrangements
    • G06F1/1613Constructional details or arrangements for portable computers
    • G06F1/163Wearable computers, e.g. on a belt
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/011Arrangements for interaction with the human body, e.g. for user immersion in virtual reality
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/017Gesture based interaction, e.g. based on a set of recognized hand gestures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/03Arrangements for converting the position or the displacement of a member into a coded form
    • G06F3/033Pointing devices displaced or positioned by the user, e.g. mice, trackballs, pens or joysticks; Accessories therefor
    • G06F3/038Control and interface arrangements therefor, e.g. drivers or device-embedded control circuitry
    • G06F3/0383Signal control means within the pointing device
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/03Arrangements for converting the position or the displacement of a member into a coded form
    • G06F3/041Digitisers, e.g. for touch screens or touch pads, characterised by the transducing means
    • G06F3/042Digitisers, e.g. for touch screens or touch pads, characterised by the transducing means by opto-electronic means
    • G06F3/0428Digitisers, e.g. for touch screens or touch pads, characterised by the transducing means by opto-electronic means by sensing at the edges of the touch surface the interruption of optical paths, e.g. an illumination plane, parallel to the touch surface which may be virtual
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/03Arrangements for converting the position or the displacement of a member into a coded form
    • G06F3/041Digitisers, e.g. for touch screens or touch pads, characterised by the transducing means
    • G06F3/043Digitisers, e.g. for touch screens or touch pads, characterised by the transducing means using propagating acoustic waves
    • GPHYSICS
    • G02OPTICS
    • G02BOPTICAL ELEMENTS, SYSTEMS OR APPARATUS
    • G02B27/00Optical systems or apparatus not provided for by any of the groups G02B1/00 - G02B26/00, G02B30/00
    • G02B27/01Head-up displays
    • G02B27/0101Head-up displays characterised by optical features
    • G02B2027/0138Head-up displays characterised by optical features comprising image capture systems, e.g. camera
    • GPHYSICS
    • G02OPTICS
    • G02BOPTICAL ELEMENTS, SYSTEMS OR APPARATUS
    • G02B27/00Optical systems or apparatus not provided for by any of the groups G02B1/00 - G02B26/00, G02B30/00
    • G02B27/01Head-up displays
    • G02B27/0101Head-up displays characterised by optical features
    • G02B2027/014Head-up displays characterised by optical features comprising information/image processing systems
    • GPHYSICS
    • G02OPTICS
    • G02BOPTICAL ELEMENTS, SYSTEMS OR APPARATUS
    • G02B27/00Optical systems or apparatus not provided for by any of the groups G02B1/00 - G02B26/00, G02B30/00
    • G02B27/01Head-up displays
    • G02B27/0101Head-up displays characterised by optical features
    • G02B2027/0141Head-up displays characterised by optical features characterised by the informative content of the display
    • GPHYSICS
    • G02OPTICS
    • G02BOPTICAL ELEMENTS, SYSTEMS OR APPARATUS
    • G02B27/00Optical systems or apparatus not provided for by any of the groups G02B1/00 - G02B26/00, G02B30/00
    • G02B27/01Head-up displays
    • G02B27/017Head mounted
    • G02B2027/0178Eyeglass type
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2203/00Indexing scheme relating to G06F3/00 - G06F3/048
    • G06F2203/038Indexing scheme relating to G06F3/038
    • G06F2203/0381Multimodal input, i.e. interface arrangements enabling the user to issue commands by simultaneous use of input devices of different nature, e.g. voice plus gesture on digitizer
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/03Arrangements for converting the position or the displacement of a member into a coded form
    • G06F3/033Pointing devices displaced or positioned by the user, e.g. mice, trackballs, pens or joysticks; Accessories therefor
    • G06F3/0354Pointing devices displaced or positioned by the user, e.g. mice, trackballs, pens or joysticks; Accessories therefor with detection of 2D relative movements between the device, or an operating part thereof, and a plane or surface, e.g. 2D mice, trackballs, pens or pucks
    • G06F3/03545Pens or stylus
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/03Arrangements for converting the position or the displacement of a member into a coded form
    • G06F3/041Digitisers, e.g. for touch screens or touch pads, characterised by the transducing means
    • GPHYSICS
    • G09EDUCATION; CRYPTOGRAPHY; DISPLAY; ADVERTISING; SEALS
    • G09BEDUCATIONAL OR DEMONSTRATION APPLIANCES; APPLIANCES FOR TEACHING, OR COMMUNICATING WITH, THE BLIND, DEAF OR MUTE; MODELS; PLANETARIA; GLOBES; MAPS; DIAGRAMS
    • G09B7/00Electrically-operated teaching apparatus or devices working with questions and answers
    • G09B7/02Electrically-operated teaching apparatus or devices working with questions and answers of the type wherein the student is expected to construct an answer to the question which is presented or wherein the machine gives an answer to the question presented by a student

Definitions

  • the present invention relates to a virtual re-creation of an instructional session through a virtual and augmented reality instruction system, and more particularly to a system designed to capture the activities of an individual's speech, movement, and handwriting.
  • an event-driven system In order to overcome the challenges of static media, an event-driven system is needed. Such a system records an environment through the events that take place within it. This results in the generation of quantifiable and manipulable data at a small, fixed bandwidth cost. This is achieved by rendering the environment on the user's computer based on the events transmitted online.
  • a complete interactive information input and retrieval system may include a board system to capture all movements on the board's surface, and a tracking system to capture all physical movements.
  • the board system and tracking system can be used either individually or separately.
  • the board system and the tracking system can communicate with each other through a network, and a control device (such as a laptop, desktop, mobile phone and tablet) can be used to control the board system and tracking system through the network.
  • the board system may include one or more board units, and a tracking region may be defined in a predetermined distance from the board unit(s). More particularly, the tracking region is the total area where the tracking system can track individual(s) or objects in front of the board unit(s).
  • each tracking unit of the tracking system is equipped with at least one 3D sensor, each of which communicates with each other through the network and is used to track the movement and speech of each individual in the tracking region.
  • the sensors are configured to track the skeletons of each individual and map the environment of the tracking region.
  • the tracking system can also track motion that is not part of the individual's skeleton.
  • the following configurations can be attached to the board unit to detect movement (including writing and erasing) on the board's surface:
  • the tracking system can be operated with see- through augmented reality glasses to enable the user to manipulate virtual objects using physical movement. More specifically the tracking system can fix the virtual objects to the user's location to allow easy manipulation or allow the object to be fixed or move independently of the user's location in virtual space. In a further embodiment, the user can supplement the tracking system with wearable motion controllers or markers to enable finer motion control.
  • a portable interactive information input and retrieval system may include a touch-screen display, as part of a computing device, or a digital pen to capture all movements on the device's surface.
  • the digital pen or touch-screen display are the only components and are not accompanied by a tracking system or board system. Therefore the physical environment can be generated dynamically based on user preference and user tracking is interpolated based on the movements on the device's surface.
  • the content player on the end user's terminal contains a button that allows users to ask questions in real-time using asynchronous or live online lectures, wherein the question button on the content player allows users to ask a question that is time-stamped to a specific point in the lecture. Furthermore, the content player keeps a time-synced stream of questions available to the user, as well as providing visual cues within the virtual environment.
  • FIG. 1 illustrates one aspect [complete] of a virtual and augmented reality instruction system in the present invention.
  • FIG. 2a-2b illustrates the tracking system having one or more tracking units with overlapping field-of-views (FOV) in the present invention.
  • FOV field-of-views
  • FIGs. 3a-3b illustrate one further aspect [portable] of a virtual and augmented reality instruction system in the present invention.
  • FIGs. 4a-4c illustrate an ultrasonic configuration attached to the board unit to detect movement (including writing and erasing) on the board's surface in the present invention.
  • FIGs. 5a-5c illustrate a laser configuration attached to the board unit to detect movement (including writing and erasing) on the board's surface in the present invention.
  • FIGs. 6a-6c illustrate an infrared configuration attached to the board unit to detect movement (including writing and erasing) on the board's surface in the present invention.
  • FIGs. 7a-7b illustrate a 3D sensor configuration attached to the board unit to detect movement (including writing and erasing) on the board's surface in the present invention.
  • FIG. 8 illustrates the use of augmented reality within the tracking system in the present invention.
  • FIG. 9 illustrates a question button on the content player that allows users to ask questions in real-time using asynchronous or live online lectures, wherein the question button on the content player allows users to ask a question that is time- stamped to a specific point in the lecture.
  • FIG. 10 illustrates the flow of data for all aspects of a virtual and augmented reality instruction system in the present invention.
  • 3D sensor refers to devices which capture depth data with or without accompanying image data. Devices are also commonly referred to as depth or RGBD sensors. The use of 3D sensors within the invention is independent of the method used to obtain either the depth or image data. These methods include but are not limited to structured light, time-of-flight (TOF) and stereo.
  • TOF time-of-flight
  • a virtual and augmented reality instruction system may come in a complete format 100 and a portable format 200/300. Both formats of the system achieve the same results with the exception that the portable format 200/300 computer generates the data it does not capture.
  • data includes, but is not limited to, movement of users, objects (stationary and moving), environment geometry, and environment/background noise.
  • a virtual and augmented reality instruction system in a portable format 200/300 may come in the following embodiments, which follow a data processing method, as outlined in FIG. 10, once data has been captured.
  • the handwriting can be tracked via any touch-enabled device 210 such as touch-screen displays or tablet devices.
  • the user may use his finger or a soft-tipped (mechanical) pen 220 to interact with the touch-enabled device 210.
  • the audio may either be captured by the use of an external microphone 230 or a built-in microphone if available on the touch-enabled device 210.
  • the handwriting can be tracked via a digital pen 320.
  • the user may be required to use paper 310 specific to the digital pen 320.
  • the audio may either be captured by the use of an external microphone 330 or a built-in microphone if available on the digital pen 320.
  • the data processing method for a virtual and augmented reality instruction system in a portable format 200/300 involves two types of data: writing 11 and audio 12.
  • the touch-enabled device 210 or digital pen 320 captures the writing 21 and the microphone 230/330 captures the audio 22.
  • the writing is captured at an approximate minimum frequency of 30Hz, so the writing data captured is a stroke within a character or drawing.
  • the handwriting strokes and audio are time-stamped (30).
  • the following step involves running the raw data through their respective engines.
  • the time-stamped, handwriting strokes are processed through both a Natural Movement Engine (NME) 41a and a Handwriting Recognition Engine (HRE) 41b, and the time-stamped audio is processed through both a Facial Animation Engine (FAE) 42a and an Audio Transcription Engine (ATE) 42b.
  • NME 41a uses the position of the current handwriting stroke in relation to prior captured writing and the absolute boundaries of the device (or medium) used to capture the writing to interpolate the movement of the avatar within virtual space.
  • the NME 41a also determines minute movements, both involuntary and voluntary, to supplement the interpolated movements; increasing the level of realism of the computer generated motion.
  • Such minute movements include, but are not limited to, shifting weight between legs, directing avatar's focus towards written text, tapping, breathing, and etc.
  • the HRE 41b combines the handwriting strokes over time and converts them into text and/or drawings.
  • the HRE 41b outputs the current handwriting stroke and any new text or drawings it was able to decipher from prior handwriting strokes.
  • the FAE 42a uses the audio captured by the microphone 230/330 to generate facial animations corresponding to the words spoke by the user, and the ATE 42b transcribes the user's speech from the microphone 230/330 into text. After the raw data has been processed by the various engines mentioned above, the data from each engine is synced (50) to its respective time-stamp.
  • the virtual and augmented reality instruction system in a complete format 100 may include a board system 110 to capture all movements on the board's surface, and a tracking system 120 to capture all physical movements.
  • the board system 110 and tracking system 120 can be used either individually or jointly.
  • the board system 110 and the tracking system 120 can communicate with each other through a network, and control devices 130 (such as laptops, desktops, remote servers, mobile phones or tablets) can be used to control the board system 110 and tracking system 120 through the network as well.
  • the board system 110 may include one or more board unit(s) 111, and a tracking region 122 may be defined in a predetermined distance from the board unit(s) 111. More particularly, the tracking region 122 is the total area where the tracking system 120 can track individual(s) in front of the board unit(s) 111. Within each board unit 111 is a computing device which determines the movements based on sensor outputs, and those movements are then transmitted to the controlling devices 130. Each board, such as a whiteboard or chalkboard, may have one or more units attached to its surface depending on the size of the board and configuration of the board unit 111.
  • the tracking system 120 may include one or more tracking units 121 as well. Each tracking unit 121 is used to create a continuous field-of-view (FOV) or tracking region 122 among the sensors. This is achieved through the registration of the overlap regions 123 between the individual FOVs of each tracking unit 121, as can be seen in FIG. 2.
  • FOV field-of-view
  • each tracking unit 121 of the tracking system 120 is equipped with at least one 3D sensor, communicating with each other through the network, and is used to track the movement and speech of each individual in the tracking region 122.
  • the sensors are configured to track the skeletons of each individual and used to map the environment of the tracking region.
  • the tracking system 120 can also track motion that is not part of the individual's skeleton.
  • the tracking system 120 can also track the movement of a moving object like a ball travelling in the air.
  • Each tracking unit 121 can be equipped with a microphone to conduct speech capturing, motion tracking, and environment noise capturing.
  • the process can be assisted by using additional microphones.
  • the user would use a personal microphone 131 attached to a mobile computer (mobile phone or tablet) acting as a control device 130, as can be seen in FIG. 1.
  • the personal microphone 131 would therefore act as the primary audio channel while the microphone within each tracking unit 121 would act as supplementary audio channels for the audio of a specific user.
  • mapping the environment of the tracking region 122 may include analyzing the image and depth data produced by the sensor to determine what objects, besides the individuals, are present. These objects may include desks, chairs, trash cans, podiums, etc., which will then be re-created in the virtual environment displayed on the end user's computer.
  • an ultrasonic configuration is shown.
  • the ultrasonic configuration may include two or more pairs of ultrasonic receivers 410 attached to the board unit 111.
  • the ultrasonic receivers 410 are in pairs as at least three points are needed for triangulation.
  • Each pair of ultrasonic receivers 410 receive transmissions from a chalk, pen, or eraser holder transmitting at the same respective frequency.
  • the triangulation determines the position of the chalk, pen, or eraser holder through the strength of the signal in relation to the location of the board unit 111.
  • FIG. 4b shows a chalk/pen holder 420 having a trigger 421, ultrasonic transmitter 422, and pressure sensor 423.
  • the trigger 421 is used to load/unload the pen/chalk from the holder, while the ultrasonic transmitter 422 is configured to send out a signal at the corresponding frequency of its receiver.
  • the pressure sensor 423 determines when the holder is being used (pressure between board, chalk/pen, and sensor) and activates the signal transmission.
  • FIG. 4c shows an eraser holder 430 having an accelerometer and gyroscope 431, ultrasonic transmitter 432, and pressure sensor 433.
  • the accelerometer and gyroscope 431 are used to determine the erasers' orientation, because the signal transmission is orientation independent, while the ultrasonic transmitter 432 sends out a signal at the corresponding frequency of its receiver.
  • the pressure sensor 433 determines when the holder is being used (pressure between the board, eraser, and sensor) and activates the signal transmission.
  • pairs of ultrasonic receivers 410 can be attached to a board unit 111 to detect movements (writing/erasing) on the board's surface, and the pen/chalk holder 420 and eraser 430 are used to transmit movements (writing/erasing).
  • a scanning range finder laser 510 can be attached to the board unit 111 to detect movement (including writing and erasing) on the board's surface.
  • the scanning range finder laser 510 can make multiple scans of the board unit 111 per second. Each scan will provide a 180-degree range pattern that is used to determine the presence and movement of chalk, pens (520), or erasers (531, 532), as shown in FIGs. 5b and 5c. Because this configuration is based on patterns, not triangulation, no additional holders are required since we are testing for the geometry of chalk, pens, and erasers in the range pattern from each scan, from which the detection patterns are analyzed to detect movements (writing/erasing).
  • FIG. 6a an infrared configuration is shown in FIG. 6a, which includes an array of infrared cameras 610 attached to the board unit 111.
  • the infrared cameras 610 are arranged with their FOVs overlapping to prevent gaps in detection on the board's surface.
  • the triangulation determines the position of the chalk, pen, or eraser holder through the strength of the infrared light in relation to the location of the infrared camera 610 that detects it.
  • FIG. 6b shows a chalk/pen holder 620 having a trigger 621, infrared emitter 622, and pressure sensor 623.
  • the trigger 621 is used to load/unload the pen/chalk from the holder, while the infrared emitter 622 sends out infrared light.
  • the pressure sensor 623 determines when the holder is being used (pressure between board, chalk/pen, and sensor) and activates the infrared emitter 622 of the chalk/pen holder 620.
  • an eraser holder 630 may include an array of infrared emitters 631 and a pressure sensor 632.
  • the array of infrared emitters 631 is positioned around the eraser holder 630, so the board unit 111 will be able to distinguish the pen/chalk from the eraser because the eraser's infrared light will be captured in a linear shape in contrast to the single point generated by the pen/chalk.
  • the pressure sensor 632 determines when the holder is being used (pressure between board, eraser, and sensor) and activates the infrared emitter.
  • the array of infrared cameras 611 can be attached to the board unit 111 to detect movements (writing/erasing) on the board's surface, and the pen/chalk holder 620 and eraser 630 are used to transmit movements (writing/erasing).
  • a 3D (three-dimensional) sensor 710 can be attached to the board unit 111 to detect movement (including writing and erasing) on the board's surface.
  • the 3D sensors 710 are arranged with their FOVs overlapping to prevent gaps in detection on the board's surface.
  • the captured depth/image data is processed through a hand tracking algorithm, which is a subset of the Skeleton Tracking Engine 43a shown in FIG. 10.
  • the depth/image data is further processed to determine if the user is holding an eraser or chalk/pen.
  • the distance between the eraser or chalk/pen is tested to determine if it is in use. Because this configuration is based on hand tracking, no additional holders are required since we are analyzing the depth/image data produced by the 3D sensor 710 to detect movements (writing/erasing).
  • a group of 3D sensors 710 can be attached to the board unit 111.
  • the 3D sensors 710 are arranged in an arc of 180 degrees. For example, three 3D sensors 710 would be required if each 3D sensor 710 had a horizontal field-of-view of 60 degrees.
  • the board unit 111 analyzes the volume of depth data immediately above the board's surface. The resulting data will provide a 180-degree range pattern identical to those in FIGs. 5b and 5c. The range pattern is used to determine the presence and movement of chalk, pens, or erasers. Because this configuration is based on patterns, not triangulation, no additional holders are required since we are testing for the geometry of chalk, pens, and erasers in the range pattern from each analysis, from which the detection patterns are analyzed to detect movements (writing/erasing).
  • the aforementioned embodiments of the board unit 111 and their corresponding figures are single representations for the use of ultrasonic, laser, infrared, and 3D sensors.
  • the position, rotation, and combination of sensors may differ according to the size and shape of the board's (whiteboard or chalkboard) surface, as well as lighting conditions of the environment.
  • augmented reality glasses 810 needs to be transparent with an integrated display in order to allow the user to continue navigating in real space without obstructed vision. More importantly, the augmented reality glasses will not require an additional camera, which is usually a requirement, due to the existence of the tracking system 120.
  • the augmented reality glasses 810 which are worn by the user, will display the virtual representation of an object 840 within the tracking region 122. This is achieved due to the tracking system (120)'s constant awareness of the user's location & orientation and geometry of the environment.
  • the virtual object can be free to be positioned or moved anywhere within the tracking region 122.
  • the virtual object (840)'s position can be fixed in front of the user, regardless of the user's movements, therefore allowing the user quick & easy access to the object throughout the tracking region 122.
  • the tracking system 120 will then allow the instructor to interact with the virtual object 840 by tracking the user's movements and translating them onto the virtual object 840.
  • the virtual object 840 can also be bound by the laws of physics therefore the forces translated onto it will be proportional to the user's movement.
  • the interaction between the user and the virtual object 840 will then be displayed on the end user's computer.
  • the user can supplement the tracking system 120 with wearable motion controllers or markers to enable finer motion control.
  • the 3D sensors within each tracking unit 121 have a finite degree of accuracy and certain scenarios may require increased accuracy. Such an example includes interacting with virtual objects via augmented reality. Therefore, FIG. 8 shows the user wearing four additional motion sensors on his arms 820 and legs 830. These sensors will supply additional data to the Skeleton Tracking Engine 43a allowing more subtle and accurate movements to be captured.
  • the data processing method for a virtual and augmented reality instruction system in a complete format 100 may include five types of data: movement 13, writing 14, user audio 15, background audio 16, and video 17. Each type of data is captured in the following manners:
  • the tracking system 120 captures the depth data (movement). (23)
  • the microphone 131 connected to a mobile control device 130 captures the user audio and is supplemented by the microphone within each tracking unit 121.
  • the microphone within each tracking unit 121 captures the background audio.
  • the writing is captured at an approximate minimum frequency of 30Hz. Therefore the writing data captured is a stroke within a character or drawing.
  • the movement captured by the tracking system 120 is in the form of depth data.
  • Each frame of data consists of a map of depth values.
  • the depth frames, handwriting strokes, user audio, background audio, and video frames are time-stamped (30). The following step involves running the raw data through their respective engines.
  • the time-stamped depth frames are processed through a Skeleton Tracking Engine (STE) 43a and Object Tracking Engine (OTE) 43b.
  • the STE identifies the skeletons of users within each frame of data.
  • the skeleton data is then shared with the OTE 43b, which captures the movements of non-skeleton objects and calculates the position, rotation, and velocity of virtual objects.
  • the time-stamped, handwriting strokes are processed through a Handwriting Recognition Engine (HRE) 44.
  • the HRE 44 combines the handwriting strokes over time and converts them into text and/or drawings.
  • the HRE 44 outputs the current handwriting stroke and any new text or drawings it was able to decipher from prior handwriting strokes.
  • the time-stamped user audio is processed through a Facial Animation Engine (FAE) 45a and Audio Transcription Engine (ATE) 45b.
  • the FAE uses the audio captured by the microphone to generate facial animations corresponding to the words spoke by the user, and the ATE transcribes the user's speech into text.
  • the time-stamped video frames are processed through a Video Processing Engine (VPE) 47.
  • VPE 47 registers the frames from each tracking unit 121 together and compresses the resulting data.
  • the data from each engine and background audio are synced (50) to their respective time-stamps.
  • both the portable 200/300 and complete 100 formats of a virtual and augmented reality instruction system will have identical data formatting and will therefore be treated the same.
  • the processed data can be stored for asynchronous playback 61, streamed to the end-user 62, and/or outputted to a course building software 63.
  • Outputting the processed data to a course building software 63 allows the user to preview the re-created virtual environment and make changes. For example, the user can re-enact certain portions of the session or re-write some of the writing using only the portable format 200/300 or the board system 110. Furthermore, the user can also make changes to the environment and his avatar to his preference.
  • Outputting the processed data to either a live stream 62 or storing for asynchronous playback 61 may include sending the data to a content player 70 on the end-user's computer.
  • the content player 70 serves the data in the following formats:
  • Text 81 The end-user will have access to a stream of both the handwritten and text data for use when only the text is necessary.
  • Audio 82 The end-user will have access to a stream of the audio data of the entire session for use when visual playback is not necessary.
  • Partial Rendering 84 The end-user will have access to a stream of the handwritten data in the representation of writing on a board, or paper. This will be accompanied by the rendering of a 3D avatar. The avatar will be controlled by the audio, motion, and facial animation data generated. The 3D avatar may either be rendered completely or partially such as, a talking head.
  • Video 85 The end-user will have access to a stream of the video data generated.
  • the content player 70 can include a question button 910 that allows end- users to ask questions in real-time using asynchronous or live online lectures.
  • FIG. 9 is one embodiment of the content player 70 being used to display a text stream 81.
  • end-users will also be able to help other end-users by answering questions that they know the answer while watching the lecture, resulting in reinforced learning.
  • the question button 910 allows end-users to ask a question that is time-stamped to a specific point in the lecture. Clicking the button results in the appearance of a text-field 911 with a time-stamp 912 corresponding to that point in the lecture and a submission button 913.
  • the content player 70 can also display prior questions in three ways.
  • ticks 941 are shown on the timeline to indicate when a question was asked. Furthermore, if multiple questions were asked at specific points, the ticks are expanded above the timeline (zoom above timeline) 940.
  • the content player 70 can also display prior questions by marking the text or objects in the virtual environment. Such markings may include highlighting 920, ticks 921, or outlining 922.
  • a time-synced stream 930 can display questions on the side of the content player 70. The area corresponding to each question 931 will be clickable allowing the browser to focus the page to a new window or section of the current window where an expanded view of the question and its answers are displayed.
  • Each question within the time-synced stream 930 contains an "Answer" button 932 allowing the end-user to answer the question in the same manner in which questions are asked 911, 912, 913. Furthermore, hovering over or selecting markings or ticks in the first two embodiments results in the time-synced stream 930 scrolling to the respective question(s). In addition, hovering over or selecting the clickable area of a question 931 results in the markings 920, 921, 922 (of the second embodiment) of the corresponding question becoming highlighted and/or focused; allowing the end-user to easily focus on the source of the question.

Abstract

L'invention porte sur un système d'instruction à réalité virtuelle et augmentée qui peut comprendre un format complet et un format portable. Le format complet peut comprendre un système de tableau pour capturer tout mouvement (y compris une écriture et un effacement) sur la surface du tableau, et un système de suivi pour capturer tout mouvement physique. Le format portable, qui peut comprendre un dispositif à fonction tactile ou un stylo numérique et un microphone, est conçu pour capturer un sous-ensemble des données capturées par le format complet. Selon un mode de réalisation du format complet, le système de tableau et le système de suivi peuvent communiquer l'un avec l'autre par l'intermédiaire d'un réseau, et des dispositifs de commande (tels qu'un ordinateur portable, un ordinateur de bureau, un téléphone mobile et une tablette) peuvent être utilisés pour commander le système de tableau et le système de suivi par l'intermédiaire du réseau. Selon d'autres modes de réalisation du format complet, une réalité augmentée peut être obtenue dans le système de suivi au moyen de la combinaison de capteurs 3D et de lunettes de réalité augmentée transparentes.
PCT/US2013/072493 2012-11-29 2013-11-29 Système d'instruction à réalité virtuelle et augmentée WO2014085768A1 (fr)

Priority Applications (9)

Application Number Priority Date Filing Date Title
AU2013351959A AU2013351959B2 (en) 2012-11-29 2013-11-29 Virtual and augmented reality instruction system
US17/269,969 US11694565B2 (en) 2012-11-29 2013-11-29 Virtual and augmented reality instruction system
ES13858451T ES2893410T3 (es) 2012-11-29 2013-11-29 Sistema de instrucción de realidad virtual y aumentada
EP21189799.6A EP3968135A1 (fr) 2012-11-29 2013-11-29 Système d'instruction à réalité virtuelle et augmentée
CN201380062587.2A CN105247453A (zh) 2012-11-29 2013-11-29 虚拟和增强现实教学系统
CA2892958A CA2892958C (fr) 2012-11-29 2013-11-29 Systeme d'instruction a realite virtuelle et augmentee
DK13858451.1T DK2926224T3 (da) 2012-11-29 2013-11-29 Instruktionssystem til virtuel og augmented reality
EP13858451.1A EP2926224B1 (fr) 2012-11-29 2013-11-29 Système d'instruction à réalité virtuelle et augmentée
AU2021261950A AU2021261950B2 (en) 2012-11-29 2021-11-05 Virtual and augmented reality instruction system

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
US201261731111P 2012-11-29 2012-11-29
US61/731,111 2012-11-29
US201361792771P 2013-03-15 2013-03-15
US61/792,771 2013-03-15

Publications (1)

Publication Number Publication Date
WO2014085768A1 true WO2014085768A1 (fr) 2014-06-05

Family

ID=50828517

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2013/072493 WO2014085768A1 (fr) 2012-11-29 2013-11-29 Système d'instruction à réalité virtuelle et augmentée

Country Status (8)

Country Link
US (1) US11694565B2 (fr)
EP (2) EP3968135A1 (fr)
CN (1) CN105247453A (fr)
AU (2) AU2013351959B2 (fr)
CA (1) CA2892958C (fr)
DK (1) DK2926224T3 (fr)
ES (1) ES2893410T3 (fr)
WO (1) WO2014085768A1 (fr)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106713896A (zh) * 2016-11-30 2017-05-24 世优(北京)科技有限公司 静态图像的多媒体呈现方法、装置和系统
US9805511B2 (en) 2015-10-21 2017-10-31 International Business Machines Corporation Interacting with data fields on a page using augmented reality
CN109255990A (zh) * 2018-09-30 2019-01-22 杭州乔智科技有限公司 一种基于ar增强现实的教学系统
CN111312012A (zh) * 2020-02-27 2020-06-19 广东工业大学 一种书法练习指引方法及装置
JP2020184172A (ja) * 2019-05-07 2020-11-12 株式会社インフォマティクス トラッキングデバイス、プロットシステム、プロット方法及びプログラム
WO2023141660A1 (fr) * 2022-01-24 2023-07-27 Freedom Trail Realty School, Inc. Systèmes et techniques pour sessions hybrides en direct et à distance à la demande

Families Citing this family (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107437343A (zh) * 2016-05-25 2017-12-05 中央大学 交互式教学系统以及方法
CN107635057A (zh) * 2017-07-31 2018-01-26 努比亚技术有限公司 一种虚拟现实终端控制方法、终端和计算机可读存储介质
CN109308132A (zh) * 2018-08-31 2019-02-05 青岛小鸟看看科技有限公司 虚拟现实的手写输入的实现方法、装置、设备及系统
CN109542252B (zh) * 2018-12-06 2021-05-18 中国科学院长春光学精密机械与物理研究所 一种遥控轨迹笔及其使用方法及电缆虚拟预装系统
CN109901714A (zh) * 2019-02-28 2019-06-18 淮北幻境智能科技有限公司 一种电子纸笔系统及其控制方法
CN109979269A (zh) * 2019-03-28 2019-07-05 王雍天 一种基于人工智能的在线教育交互系统
US11694380B2 (en) 2020-11-13 2023-07-04 Zoltan GELENCSER System and method for immersive telecommunications
CN115933868B (zh) * 2022-10-24 2023-08-04 华中师范大学 翻转讲台的立体综合教学场系统及其工作方法

Citations (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CA2595167A1 (fr) * 2006-07-31 2008-01-31 University Of New Brunswick Methode d'etalonnage de positions de capteur dans un systeme de mesure et d'analyse du mouvement humain
US20080149401A1 (en) * 2006-12-20 2008-06-26 3M Innovative Properties Company Untethered stylus employing separate communication channels
US20090058988A1 (en) * 2007-03-16 2009-03-05 Kollmorgen Corporation System for Panoramic Image Processing
US20090153526A1 (en) * 2003-02-14 2009-06-18 Microsoft Corporation Determining the location of the tip of an electronic stylus
US7676372B1 (en) * 1999-02-16 2010-03-09 Yugen Kaisha Gm&M Prosthetic hearing device that transforms a detected speech into a speech of a speech form assistive in understanding the semantic meaning in the detected speech
US20100302142A1 (en) * 1995-11-06 2010-12-02 French Barry J System and method for tracking and assessing movement skills in multidimensional space
US20110096042A1 (en) * 2005-03-23 2011-04-28 Epos Development Ltd. Method and system for digital pen assembly
US20120113092A1 (en) * 2010-11-08 2012-05-10 Avi Bar-Zeev Automatic variable virtual focus for augmented reality displays
US20120154511A1 (en) * 2010-12-20 2012-06-21 Shi-Ping Hsu Systems and methods for providing geographically distributed creative design
US20120181934A1 (en) * 2009-08-05 2012-07-19 Koninklijke Philips Electronics N.V. Light guiding system and a method for controlling the same
US20120229282A1 (en) * 2011-03-10 2012-09-13 Security Identification Systems Corporation a Florida Maritime Overboard Detection and Tracking System

Family Cites Families (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1059970A2 (fr) * 1998-03-03 2000-12-20 Arena, Inc, Dispositif et technique de suivi et d'estimation de la dexterite de mouvements dans un espace pluridimensionnel
US9229540B2 (en) * 2004-01-30 2016-01-05 Electronic Scripting Products, Inc. Deriving input from six degrees of freedom interfaces
SG155167A1 (en) * 2004-08-03 2009-09-30 Silverbrook Res Pty Ltd Walk-up printing
US9940589B2 (en) * 2006-12-30 2018-04-10 Red Dot Square Solutions Limited Virtual reality system including viewer responsiveness to smart objects
US20090058850A1 (en) * 2007-09-04 2009-03-05 Wey Fun System and method for intuitive interactive navigational control in virtual environments
JP2009145883A (ja) * 2007-11-20 2009-07-02 Rissho Univ 学習システム、記憶媒体及び学習方法
WO2009155483A1 (fr) * 2008-06-20 2009-12-23 Invensys Systems, Inc. Systèmes et procédés pour une interaction immersive avec des équipements réels et/ou simulés pour un contrôle de processus, environnemental et industriel
KR20090132914A (ko) * 2008-06-23 2009-12-31 주식회사 히씽크 페이셜애니메이션제어방법 및 3차원 게임 엔진 기반의실시간 대화형 원격강의시스템
US20130249947A1 (en) * 2011-08-26 2013-09-26 Reincloud Corporation Communication using augmented reality
US9563265B2 (en) * 2012-01-12 2017-02-07 Qualcomm Incorporated Augmented reality with sound and geometric analysis
US20140214629A1 (en) * 2013-01-31 2014-07-31 Hewlett-Packard Development Company, L.P. Interaction in a virtual reality environment
WO2015066037A1 (fr) * 2013-10-28 2015-05-07 Brown University Procédés et systèmes de réalité virtuelle

Patent Citations (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100302142A1 (en) * 1995-11-06 2010-12-02 French Barry J System and method for tracking and assessing movement skills in multidimensional space
US7676372B1 (en) * 1999-02-16 2010-03-09 Yugen Kaisha Gm&M Prosthetic hearing device that transforms a detected speech into a speech of a speech form assistive in understanding the semantic meaning in the detected speech
US20090153526A1 (en) * 2003-02-14 2009-06-18 Microsoft Corporation Determining the location of the tip of an electronic stylus
US20110096042A1 (en) * 2005-03-23 2011-04-28 Epos Development Ltd. Method and system for digital pen assembly
CA2595167A1 (fr) * 2006-07-31 2008-01-31 University Of New Brunswick Methode d'etalonnage de positions de capteur dans un systeme de mesure et d'analyse du mouvement humain
US20080149401A1 (en) * 2006-12-20 2008-06-26 3M Innovative Properties Company Untethered stylus employing separate communication channels
US20090058988A1 (en) * 2007-03-16 2009-03-05 Kollmorgen Corporation System for Panoramic Image Processing
US20120181934A1 (en) * 2009-08-05 2012-07-19 Koninklijke Philips Electronics N.V. Light guiding system and a method for controlling the same
US20120113092A1 (en) * 2010-11-08 2012-05-10 Avi Bar-Zeev Automatic variable virtual focus for augmented reality displays
US20120154511A1 (en) * 2010-12-20 2012-06-21 Shi-Ping Hsu Systems and methods for providing geographically distributed creative design
US20120229282A1 (en) * 2011-03-10 2012-09-13 Security Identification Systems Corporation a Florida Maritime Overboard Detection and Tracking System

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See also references of EP2926224A4 *

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9805511B2 (en) 2015-10-21 2017-10-31 International Business Machines Corporation Interacting with data fields on a page using augmented reality
CN106713896A (zh) * 2016-11-30 2017-05-24 世优(北京)科技有限公司 静态图像的多媒体呈现方法、装置和系统
CN109255990A (zh) * 2018-09-30 2019-01-22 杭州乔智科技有限公司 一种基于ar增强现实的教学系统
JP2020184172A (ja) * 2019-05-07 2020-11-12 株式会社インフォマティクス トラッキングデバイス、プロットシステム、プロット方法及びプログラム
CN111312012A (zh) * 2020-02-27 2020-06-19 广东工业大学 一种书法练习指引方法及装置
CN111312012B (zh) * 2020-02-27 2022-05-06 广东工业大学 一种书法练习指引方法及装置
WO2023141660A1 (fr) * 2022-01-24 2023-07-27 Freedom Trail Realty School, Inc. Systèmes et techniques pour sessions hybrides en direct et à distance à la demande

Also Published As

Publication number Publication date
CA2892958C (fr) 2022-04-19
AU2013351959A1 (en) 2015-07-09
CA2892958A1 (fr) 2014-06-05
CN105247453A (zh) 2016-01-13
AU2013351959B2 (en) 2021-08-19
US20220415197A1 (en) 2022-12-29
ES2893410T3 (es) 2022-02-09
AU2021261950A1 (en) 2021-12-02
EP2926224A4 (fr) 2016-10-12
DK2926224T3 (da) 2021-11-15
EP3968135A1 (fr) 2022-03-16
EP2926224B1 (fr) 2021-09-01
EP2926224A1 (fr) 2015-10-07
AU2021261950B2 (en) 2023-12-14
US11694565B2 (en) 2023-07-04

Similar Documents

Publication Publication Date Title
AU2021261950B2 (en) Virtual and augmented reality instruction system
EP2498237B1 (fr) Fourniture d'informations de position dans un environnement collaboratif
US20060092178A1 (en) Method and system for communicating through shared media
US20150277699A1 (en) Interaction method for optical head-mounted display
US20160117142A1 (en) Multiple-user collaboration with a smart pen system
WO2022120255A1 (fr) Carte d'informations virtuelles pour le partage d'informations collaboratives
JP6683864B1 (ja) コンテンツ制御システム、コンテンツ制御方法、およびコンテンツ制御プログラム
US11630633B1 (en) Collaborative system between a streamer and a remote collaborator
KR101757420B1 (ko) 투명 디스플레이를 이용한 상호작용 기반의 원격 화상 통신 및 강의 시스템
US20230043422A1 (en) Viewing terminal, viewing method, viewing system, and program
Cho et al. RealityReplay: Detecting and Replaying Temporal Changes In Situ Using Mixed Reality
Lui et al. Gesture-based interaction for seamless coordination of presentation aides in lecture streaming
US20230334790A1 (en) Interactive reality computing experience using optical lenticular multi-perspective simulation
US20230334792A1 (en) Interactive reality computing experience using optical lenticular multi-perspective simulation
Chu et al. A Study on AR Authoring using Mobile Devices for Educators.
He Enhancing Collaboration and Productivity for Virtual and Augmented Reality
KR20180033738A (ko) 디스플레이 장치에 설치되는 착탈식 터치 패널 장치
Eslami et al. SignCol: Open-Source Software for Collecting Sign Language Gestures
Stearns Handsight: A Touch-Based Wearable System to Increase Information Accessibility for People with Visual Impairments
Lala et al. Enhancing communication through distributed mixed reality
KR101165375B1 (ko) 가상현실을 이용한 학습정보 제공방법
Milekic Using eye-and gaze-tracking to interact with a visual display
WO2023215637A1 (fr) Expérience informatique de réalité interactive à l'aide d'une simulation multi-perspective lenticulaire optique
KR20230108398A (ko) 혼합현실을 기반 원격 강의 시스템
Champion et al. 3D in-world Telepresence With Camera-Tracked Gestural Interaction.

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 13858451

Country of ref document: EP

Kind code of ref document: A1

ENP Entry into the national phase

Ref document number: 2892958

Country of ref document: CA

NENP Non-entry into the national phase

Ref country code: DE

WWE Wipo information: entry into national phase

Ref document number: 2013858451

Country of ref document: EP

ENP Entry into the national phase

Ref document number: 2013351959

Country of ref document: AU

Date of ref document: 20131129

Kind code of ref document: A