WO2014037876A2 - Augmented reality for video system - Google Patents
Augmented reality for video system Download PDFInfo
- Publication number
- WO2014037876A2 WO2014037876A2 PCT/IB2013/058254 IB2013058254W WO2014037876A2 WO 2014037876 A2 WO2014037876 A2 WO 2014037876A2 IB 2013058254 W IB2013058254 W IB 2013058254W WO 2014037876 A2 WO2014037876 A2 WO 2014037876A2
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- electronic device
- relevant portion
- display device
- augmented reality
- video
- Prior art date
Links
- 230000003190 augmentative effect Effects 0.000 title claims abstract description 76
- 238000009877 rendering Methods 0.000 claims abstract description 46
- 238000000034 method Methods 0.000 claims abstract description 43
- 230000008859 change Effects 0.000 claims description 9
- 238000010586 diagram Methods 0.000 description 6
- 238000004364 calculation method Methods 0.000 description 5
- 238000001514 detection method Methods 0.000 description 5
- 230000000007 visual effect Effects 0.000 description 4
- 230000000694 effects Effects 0.000 description 3
- 230000008569 process Effects 0.000 description 2
- 230000009466 transformation Effects 0.000 description 2
- 230000009471 action Effects 0.000 description 1
- 230000003416 augmentation Effects 0.000 description 1
- 239000002131 composite material Substances 0.000 description 1
- 238000004590 computer program Methods 0.000 description 1
- 230000007613 environmental effect Effects 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 230000001953 sensory effect Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T19/00—Manipulating 3D models or images for computer graphics
- G06T19/006—Mixed reality
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/03—Arrangements for converting the position or the displacement of a member into a coded form
- G06F3/033—Pointing devices displaced or positioned by the user, e.g. mice, trackballs, pens or joysticks; Accessories therefor
- G06F3/0346—Pointing devices displaced or positioned by the user, e.g. mice, trackballs, pens or joysticks; Accessories therefor with detection of the device orientation or free movement in a 3D space, e.g. 3D mice, 6-DOF [six degrees of freedom] pointers using gyroscopes, accelerometers or tilt-sensors
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/431—Generation of visual interfaces for content selection or interaction; Content or additional data rendering
- H04N21/4312—Generation of visual interfaces for content selection or interaction; Content or additional data rendering involving specific graphical features, e.g. screen layout, special fonts or colors, blinking icons, highlights or animations
- H04N21/4316—Generation of visual interfaces for content selection or interaction; Content or additional data rendering involving specific graphical features, e.g. screen layout, special fonts or colors, blinking icons, highlights or animations for displaying supplemental content in a region of the screen, e.g. an advertisement in a separate window
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/14—Digital output to display device ; Cooperation and interconnection of the display device with other functional units
- G06F3/1423—Digital output to display device ; Cooperation and interconnection of the display device with other functional units controlling a plurality of local displays, e.g. CRT and flat panel display
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T11/00—2D [Two Dimensional] image generation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T15/00—3D [Three Dimensional] image rendering
- G06T15/10—Geometric effects
- G06T15/20—Perspective computation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/70—Determining position or orientation of objects or cameras
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/41—Structure of client; Structure of client peripherals
- H04N21/4104—Peripherals receiving signals from specially adapted client devices
- H04N21/4126—The peripheral being portable, e.g. PDAs or mobile phones
- H04N21/41265—The peripheral being portable, e.g. PDAs or mobile phones having a remote control device for bidirectional communication between the remote control device and client device
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/439—Processing of audio elementary streams
- H04N21/4394—Processing of audio elementary streams involving operations for analysing the audio stream, e.g. detecting features or characteristics in audio streams
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/44—Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs
- H04N21/44008—Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs involving operations for analysing video streams, e.g. detecting features or characteristics in the video stream
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/47—End-user applications
- H04N21/472—End-user interface for requesting content, additional data or services; End-user interface for interacting with content, e.g. for content reservation or setting reminders, for requesting event notification, for manipulating displayed content
- H04N21/4722—End-user interface for requesting content, additional data or services; End-user interface for interacting with content, e.g. for content reservation or setting reminders, for requesting event notification, for manipulating displayed content for requesting additional data associated with the content
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/47—End-user applications
- H04N21/488—Data services, e.g. news ticker
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/80—Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
- H04N21/81—Monomedia components thereof
- H04N21/8126—Monomedia components thereof involving additional data, e.g. news, sports, stocks, weather forecasts
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/80—Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
- H04N21/81—Monomedia components thereof
- H04N21/8146—Monomedia components thereof involving graphical data, e.g. 3D object, 2D graphics
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/80—Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
- H04N21/81—Monomedia components thereof
- H04N21/816—Monomedia components thereof involving special video data, e.g 3D video
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N5/00—Details of television systems
- H04N5/222—Studio circuitry; Studio devices; Studio equipment
- H04N5/262—Studio circuits, e.g. for mixing, switching-over, change of character of image, other special effects ; Cameras specially adapted for the electronic generation of special effects
- H04N5/265—Mixing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F2200/00—Indexing scheme relating to G06F1/04 - G06F1/32
- G06F2200/16—Indexing scheme relating to G06F1/16 - G06F1/18
- G06F2200/163—Indexing scheme relating to constructional details of the computer
- G06F2200/1637—Sensing arrangement for detection of housing movement or orientation, e.g. for controlling scrolling or cursor movement on the display of an handheld computer
Definitions
- the present invention relates to methods and apparatus for providing augmented reality content to a video content item.
- Augmented reality is a live, direct or indirect, view of a physical, real- world environment whose elements are augmented (or supplemented) by computer- generated sensory input such as sound, video, graphics or GPS data. Augmentation is typically in real-time and in semantic context with environmental elements.
- Augmented Reality applications are now available to users via smartphones, tablet computers, or other handheld devices.
- these applications use cameras embedded in the handheld device and display contextual information overlaid onto real images captured by the camera.
- Fig. 1 is a simplified block diagram illustration of a system constructed and operative in accordance with an embodiment of the present invention
- Fig. 2 is a simplified block diagram illustration of a video database constructed and operative in accordance with an embodiment of the present invention
- Fig. 3 is a flow chart diagram of a method to view augmented reality content according to an embodiment of the present invention
- FIG. 4C are simplified pictorial illustrations showing augmented reality content obtained by translating an electronic device along an axis perpendicular to a plane of its rendering screen in accordance with an embodiment of the present invention
- Figs. 5 and 6 are pictorial illustrations of the calculations used in the automatic calibration procedure in accordance with an embodiment of the present invention.
- Fig. 7 is a pictorial illustration showing a relevant portion of the augmented reality panel to render on the rendering screen of an electronic device in accordance to an embodiment of the present invention.
- a method includes: displaying an event on a first display device associated with a client device, the event comprising at least one video scene; enabling an electronic device to communicate with the client device; receiving a user input requesting augmented reality content associated with the at least one video scene; identifying a relevant portion of the augmented reality content to render on a second display associated with the electronic device, wherein the relevant portion comprises video frames corresponding to the at least one video scene and additional content surrounding the video frames; and rendering the relevant portion on the second display device.
- a headend 110 typically communicates with a plurality of client devices via a communication network 130. Additionally or alternatively, a plurality of headends communicate with a single client device or with a plurality of client devices via the communication network 130. For simplicity of depiction and description, and without limiting the generality of the present invention, one client device 120 is depicted in Fig. 1 as being in operative communication (via communication network 130) with one headend 110. Headend 110 also comprises a video database 111 that will be described in greater details hereinafter in relation to Fig. 2.
- the communication network 130 is a one-way or two-way communication network that includes at least one of the following: a satellite based communication network; a cable based communication network; a conventional terrestrial broadcast television network; a telephony based communication network; a telephony based television broadcast network; a mobile-telephony based television broadcast network; an Internet Protocol (IP) television broadcast network; and a computer based communication network.
- IP Internet Protocol
- the communication network 130 may, for example, be implemented by a one-way or two-way hybrid communication network, such as a combination cable -telephone network, a combination satellite-telephone network, a combination satellite-computer based communication network, or by any other appropriate network.
- the system of Fig. 1 comprises a client device 120 disposed between a headend 110 and a display device 140.
- Client device 120 is coupled to a display device 140.
- Client device 120 typically comprises a tuner, a decoder, a receiver and a descrambler. It is appreciated that the client device 120 comprises standard hardware components and software components, as is well known in the art.
- Client device 120 is typically connected in operation to display device 140 via a digital AV interface (e.g. HDMI, DVI, etc.) or via an analogue AV interface (e.g. component (RGB, YPbPr), composite (NTSC, PAL, SECAM), S -video, SCART, RF coaxial, D-Terminal (D-tanshi) etc.). While shown as separate entities in Fig. 1, the client device 120 may be integral with the display device 140 in other embodiments of the present invention.
- a digital AV interface e.g. HDMI, DVI, etc.
- an analogue AV interface e.g. component (RGB, YPbPr), composite (NTSC, PAL, SECAM), S -video, SCART, RF coaxial, D-Terminal (D-tanshi) etc.
- the client device 120 may be integral with the display device 140 in other embodiments of the present invention.
- Client device 120 typically receives audio visual (AV) signals from e.g. a broadcast stream, a VOD library, etc., as well as related digital data.
- the client device typically decodes and decrypts/descrambles as necessary the AV signals received from a headend 110 optionally under control of a conditional access device such as removable security element as is well known in the art.
- the removable security element typically includes a smart card as is well known in the art.
- the output from client device 120 comprises a decoded and decrypted/de scrambled as necessary audio video (AV) signals ready to be displayed on the display device 140.
- the client device 120 typically runs onboard software stored within a memory of the client device 120, the onboard software comprising applications (e.g.
- EPG electronic program guide
- PV pay-per- view
- VOD video-on-demand
- middleware software layers for controlling the device layer interface that provides an interface between the middleware and the decoder hardware controllers, and all other layers used to operate the client device 120.
- Client device 120 is typically controlled by a user, for example via a remote control unit (RCU) 150.
- Client device 120 can receive control commands from the RCU 150 (e.g. via infra-red communication).
- RCU 150 is used by a user to select AV content to view, such as a live event broadcast, a Video -On-Demand (VOD) asset, a recorded event, etc., or to select a particular application to run, such as an EPG, a PPV application, a VOD application, etc., which is then displayed on the display device 140.
- the onboard software within client device 120 monitors user interaction with the client device 120 so that the relevant information is displayed on the display device 140 in response to a user command entered using the RCU 150.
- an electronic device 160 is provided that is operable to communicate with the client device 120 using any suitable connectivity link (e.g. wired, WiFi, Bluetooth, RF, etc.).
- the electronic device 160 is further operable to communicate with the headend 110 through the communication network 130.
- Figure 1 shows only one communication network 130. However, those skilled in the art will appreciate that the electronic device 160 may communicate directly with the headend 110 using the same communication network 130 or a different communication network.
- the electronic device 160 may be for example, but without limiting the generality of the invention, a companion device such as a tablet computer, an iPadTM, or a mobile computing device, such as a Personal Digital Assistant (PDA), mobile phone, or any handheld device.
- PDA Personal Digital Assistant
- the electronic device 160 typically comprises a processor, a video player, a rendering screen for displaying content and data information to a user, a camera operative to take pictures and/or videos and an orientation module 170 comprising a plurality of sensors such as, for example, but without limiting the generality of the present invention, an accelerometer sensor, a gyroscope, a manual switch, etc. operative to detect the orientation of the electronic device 160.
- a user is typically able to select an event or a service to be displayed on the display device 140. Further, the user is typically able to select an application to be run on the electronic device 160 in order to get augmented reality content related to the event currently being displayed on the display device 140.
- the application may be locally executed on the electronic device 160 and launched as soon as an event is selected for display by the user, or upon reception of a notification transmitted by a video database 111 notifying the user that augmented reality content is available for the event.
- the notification indicating that augmented reality content is available may be, for example, but without limiting the generality of the present invention, in a form of a visual indication such as an On-Screen Display (OSD) or pop-up message appearing on the display device 140.
- OSD On-Screen Display
- FIG. 2 is a simplified block diagram illustration of a video database 211 constructed and operative in accordance with an embodiment of the present invention.
- the video database 211 is typically located at the headend 210 and comprises metadata related to a plurality of video content items 212. Those skilled in the art will appreciate that this video database 211 may be in any suitable form and located in any suitable part of the system of the present invention. Other ways of implementing the database 211, such as using a plurality of databases, will be apparent to someone skilled in the art.
- a video content item 212(i) typically comprises a sequence of video frames representing one or more video scenes.
- a video content item 212(i) is identified by a video content identifier and metadata is provided for each video scene of the video content item 212(i).
- Metadata provided for each video scene typically comprise at least:
- augmented reality content that may be accessed and viewed by a user while the video scene of the video content item 212(i) is being played on the display device 140.
- This augmented reality content is typically in a form of an augmented reality panel comprising the video frames of the relevant video scene of the video content item 212(i) and additional content surrounding the video frames.
- the additional content may be for example, but without limiting the generality of the present invention, pictures, videos, etc. retrieved from any suitable source (e.g. web sources, local contents, etc.). Once retrieved, the additional content is aggregated and arranged around the video frames to form the augmented reality panel relevant to the video scene of the video content item 212(i).
- the surrounding content may already be available in situations where the video scene currently being displayed is a subset of the original filmed image.
- the dimensions of the augmented reality panel as well as the position of the video frames corresponding to the video scene currently being displayed in the augmented reality panel are defined in advance.
- the video content items 212 which populate the video database 211 may be selected and provided by a TV broadcaster, a platform operator, a content provider or any third party entity.
- a wide selection of regular TV broadcast events, VOD assets, and other video content items may be available to be used by the application and/or the end user.
- the video database 211 may also be updated regularly by a TV broadcaster, a platform operator, a content provider or any third party entity in order to make available new video content items 212 and their associated metadata to users of the application.
- each video content item 212 supporting this augmented reality content system may be defined in advance by the TV broadcaster, platform operator, content provider or any third party entity providing this service. Indeed, it will be apparent to someone skilled in the art that displaying meaningful augmented reality contents to users depends on the type of event and video scene. An event may have several different video scenes, real or imaginary, taking place in many different places. Therefore, it may be complicated or even sometimes impossible to provide such augmented reality content for some video scenes of the event.
- the present invention is well-suited for situations in which, for example, but without limiting the generality of the present invention, the action takes place in a same room or when the video scene is filmed with a same fixed camera angle.
- the position of the video frames in the augmented reality panel may vary from one video scene to another.
- the video frames may be located in any suitable position so that relevant surrounding content is available to the user.
- the video frames shown in Figs. 4A and 4B show two actors sitting on a bench with a garden and Notre-Dame Cathedral in the background. Therefore, the video frames showing the actors may be located at the bottom center of the augmented reality panel so that surrounding content relevant to the garden and Ul-Dame Cathedral is available on the left, right and top of the video frames.
- the position depends on many factors such as the TV broadcaster, platform operator, content provider or any third party entity providing this service, the relevant video scene, the available surrounding content, etc.
- the user may select an event or a service to be displayed on the display device 140 by operating a RCU 150 e.g. a remote control.
- the client device 120 tunes to the selected event or service and demultiplexes, decodes and decrypts/descrambles as necessary the broadcast television stream received from the headend 110 in order to display the selected event or service on the display device 140.
- the event may be any suitable type of event such as a recorded event, a VOD asset, etc.
- the user is notified - via a sound or visual indication as explained hereinabove - that augmented reality content may be available for the video scene of the event currently being displayed.
- the user may decide at step 303 to run an application on his electronic device 160 to obtain and view augmented reality content relevant to the event or service currently being displayed on the display device 140.
- the user may run the application as soon as the event selected is displayed on the display device 140 or upon reception of a notification from a video database 111 that augmented reality content is available for the video scene currently being displayed.
- the application may be a distributed computing application and therefore, that the application may be executed in any or in a plurality of suitable computing devices of a distributed computing system.
- the electronic device 160 is operable to communicate with the video database 111 of the headend 110.
- the communication between the electronic device 160 and the video database 111 is established via the client device 120.
- the augmented reality panel related to the video scene currently being displayed is retrieved from the video database 111 and stored on the client device 120 therefore becoming accessible and/or available to be used by the electronic device 160. It will be apparent to someone skilled in the art that the augmented reality panel related to the video scene currently being displayed may also be streamed in real time by the electronic device 160 from the video database 111.
- the user Prior to viewing the augmented reality content relevant to the video scene currently being displayed, the user is requested to hold his electronic device 160 in front of the display device 140 (as illustrated in Fig. 4A) for an automatic calibration procedure (step 304).
- the purpose of this calibration procedure is to determine how and which portion of the augmented reality panel (which depends on the distance and direction between the display device 140 and the electronic device 160) is to be displayed on the rendering screen of the electronic device 160. To do so, the position where the video frames of the augmented reality panel are to be displayed on the rendering screen of the electronic device 160, and the dimensions of the video frames, are automatically computed using the camera of the electronic device 160.
- the computed dimensions typically corresponds to a homothetic transformation of the dimensions and position of the video displayed on the display device 140 as viewed by the user in the plane of the rendering screen of the electronic device 160.
- the position and the dimensions of the video frames are calculated to correspond to the dimensions of the video displayed on the display device 140 as they would be seen by the user in the plane of the rendering screen of the electronic device 160 and not as they are rendered on the rendering screen by the camera of the electronic device 160.
- the absolute direction of the display device 140 is determined using the orientation module 170 e.g. the gyroscope sensor of the electronic device 160. This absolute direction may be corrected as the camera is usually not centered on the middle of the electronic device 160 to give the initial direction. This initial direction obtained during the calibration procedure is stored for later use.
- the relevant portion of the augmented reality panel related to the video scene of the event currently being displayed on the display device 140 is identified and rendered on the rendering screen of the electronic device 160 with:
- the user is able to move the electronic device 160 and see augmented reality content surrounding the video frames of the video scene currently being displayed.
- the orientation module 170 e.g. gyroscope and accelerometer sensors
- the camera of the electronic device 160 are used to continuously communicate the current position and orientation of the electronic device 160 to the client device 120.
- the new position and orientation of the electronic device 160 are determined and the new relevant portion of augmented reality panel (comprising the video frames of the video scene and surrounding additional content) is displayed on the rendering screen of the electronic device 160.
- FIG. 4B shows the user rotating the electronic device 160 around the X-axis parallel to a plane of the rendering screen in order to see augmented reality content located above the video frame of the video scene currently being displayed.
- Fig. 4C shows different portions of the augmented reality panel displayed on the rendering screen of the electronic device 160 when the user rotates the electronic device 160 around an axis parallel to the plane of the rendering screen:
- a zoom-in (or zoom-out) effect may be achieved.
- This type of user's motion involves real time calculations in order to determine and update the position where the video frames of the video scene currently being displayed are to be displayed on the rendering screen of the electronic device 160, and the dimensions of the video frames (and in turn, the dimensions of the augmented reality panel). Those skilled in the art will appreciate that these calculations are similar to the ones made during the calibration procedure. Further details regarding the motion detection and related effects rendered on the rendering screen of the electronic device 160 will be given hereinafter in relation to the description of Fig. 7.
- the purpose of the calibration procedure is to determine how and which portion of the augmented reality panel relevant to the video scene of the event currently being displayed on the display device 140 is to be displayed on the rendering screen of the electronic device 160.
- the camera of the electronic device 160 is held and oriented by the user towards the direction of the display device 140.
- the shape of the video, corresponding to the event currently being displayed on the display device 140 is detected on the rendering screen of the electronic device 160 and analyzed using well-known real time image recognition and analysis technics.
- the position and dimensions of the display device 140 as viewed by the camera of the electronic device 160 are obtained as output of this analysis.
- the dimensions do not correspond to the dimensions at which the display device 140 is viewed by the user in the plane of the rendering screen of the electronic device 160. This is due to the camera's properties and more particularly to the camera's angle of view also referred to as the "Field of View” (“FoV”) angle. Depending on the electronic device 160 used, this "FoV" angle varies. Therefore, the calibration procedure enables users of any electronic devices having their own properties to compute the position where the video frames of the augmented reality panel are to be displayed on the rendering screen of the electronic device 160 and the dimensions of the video frames.
- FoV Field of View
- Figs. 5 and 6 are pictorial illustrations of the calculations used in the automatic calibration procedure according to an embodiment of the present invention.
- the computed dimensions typically correspond to a homothetic transformation of the dimensions and position of the video displayed on the display device 140 as viewed by the user in the plane of the rendering screen of the electronic device 160.
- Fig. 5 shows two planes:
- o H REAL is the height of the real image
- o II REAL is the height of the detected video shape
- o H CAMERA is the height corresponding to H REAL as viewed by the camera of the electronic device 160.
- o h CAMERA is the height corresponding to mAL as viewed by the camera of the electronic device 160.
- Futhermore Fig. 5 shows two additional parameters: a corresponding to the vertical "FoV" angle of the camera; and D TV - CAMERA corresponding to the distance between the camera of the electronic device 160 and the display device 140.
- the vertical "FoV” angle is a constant defined as an intrinsic property of the electronic device 160 and is given by:
- DTV -CAMERA The camera image and recognized shape of the video provide H CAMERA and hcAMERA where:
- Fig. 6 shows a display device plane and a rendering screen plane of the electronic device 160 for which:
- H is the height of the portion of the augmented reality panel that is to be rendered on the rendering screen of the electronic device 160 and typically corresponds to the height of the rendering screen.
- ⁇ corresponds to the vertical "FoV” angle of the user and may be given by:
- a is an intrinsic property of the camera
- 1ICAMJ3 ⁇ 4A and HCAMJ3 ⁇ 4A are given by the camera
- D USER _ DEVICE is fixed as a constant corresponding to the distance between the user and the electronic device 160. All these equations show that the height h depends on h CAMERA and H CAMERA . It is then possible to derive the width w of the video frames. This width w typically depends on the ratio of the video R e.g. 16/9 and is given by:
- the user typically holds the electronic device 160 in front of the display device 140 to perform the calibration procedure.
- the user is also able to perform the calibration procedure even if he is not located in front of the display device 140.
- h CAMERA and H CAMERA will be computed as an average between the left and right corresponding heights of the camera image.
- the user is requested to orientate the electronic device 160 towards the direction of the display device 140 so that the display device 140 appears entirely in the camera image.
- Fig. 7 is a pictorial illustration showing a relevant portion of the augmented reality panel to render on the rendering screen of the electronic device 160 according to an embodiment of the present invention.
- the orientation module 170 e.g. gyroscope and accelerometer sensors of the electronic device 160 are used to determine which portion of the augmented reality panel to render on the rendering screen of the electronic device 160.
- This orientation module 170 is typically able to detect an orientation change of the electronic device 160 i.e. when the electronic device 160 is moved and/or turned around.
- the user holding the electronic device 160 can move in a plurality of directions. However, each user movement can be modeled as a movement around a sphere as shown in Fig. 7. This sphere is centered on the user and has a radius corresponding to the distance between the user and the electronic device 160. Therefore, the following user movements may be considered:
- rotating the electronic device 160 around the Y-axis parallel to the plane of the rendering screen corresponds to a left or right movement around the sphere and therefore left or right surrounding content is rendered on the rendering screen of the electronic device 160.
- This motion detection may be done efficiently in order to support a minimal frame rate of for example, but without limiting the generality of the invention, 15 images per second;
- rotating the electronic device around the X-axis parallel to the plane of the rendering screen corresponds to an up or down movement around the sphere to the top or bottom of the sphere and therefore top or bottom surrounding content is rendered on the rendering screen of the electronic device 160.
- This motion detection may be done efficiently in order to support a minimal frame rate of for example, but without limiting the generality of the invention, 15 images per second; and
- the motion of the electronic device 160 is detected by the orientation module 170.
- the orientation module 170 sends to the client device 120 its change of position (delta position) from the initial position (i.e. the initial position indicating the direction of the display device 140 at the end of the calibration procedure represented by the central portion of the augmented reality panel in Fig. 7) both in terms of distance to the display device 140 (e.g. using the accelerometer sensor) and in terms of direction to the display device 140 (e.g. using the gyroscope sensor).
- the output from the motion detection may be:
- an update of the calibration procedure comprising updated dimensions and position of the video frames (and in turn, of the augmented reality panel) as well as an updated initial position. This typically corresponds to a movement along the Z-axis perpendicular to the plane of the rendering screen of the electronic device 160 for which a new calibration procedure is performed;
- the relevant portion of the augmented reality panel is defined by the following parameters:
- width WPORTION is given by:
- ⁇ a position in the augmented reality panel defined by the coordinates XPORTION and ypoRTTON ( ⁇ PORTION, YPORTION)- These coordinates are set to (0, 0) according to the initial direction obtained during the calibration procedure. Then, when a rotation of the electronic device 160 around an axis X and/or Y is detected, the orientation module 170 e.g. the gyroscope sensor determines the angular difference ( ⁇ ⁇ , ⁇ ⁇ ) with the initial direction.
- the new coordinates corresponding to the new relevant portion of the augmented reality panel to render are given by:
- xPORTION ⁇ USER-DEVICE ' ⁇
- PORTION ⁇ USER-DEVICE ' ⁇
- the coordinates are expressed with respect to a reference point set to correspond to the initial direction.
- the reference point may be any suitable point of the portion of the augmented reality panel such as for example, but without limiting the generality of the invention, the center of the portion, the top left corner, etc.
- the dimensions of the relevant portion of the augmented reality panel to display (h PO RTioN 5 WPORTION) and the initial direction are updated.
- the coordinates (XPORTION ⁇ YPORTON) are computed when the electronic device 160 is moved around the sphere, i.e. rotated around the X-axis and/or Y-axis parallel to the plane of the rendering screen of the electronic device 160. In this latter case, the dimensions (h PO RTioN, WPORTION) of the augmented reality panel do not change.
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- Human Computer Interaction (AREA)
- Computer Graphics (AREA)
- Business, Economics & Management (AREA)
- Marketing (AREA)
- Databases & Information Systems (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Software Systems (AREA)
- Computer Hardware Design (AREA)
- Computing Systems (AREA)
- Geometry (AREA)
- User Interface Of Digital Computer (AREA)
- Processing Or Creating Images (AREA)
Abstract
In one embodiment, a method includes: displaying an event on a first display device associated with a client device, the event comprising at least one video scene; enabling an electronic device to communicate with the client device; receiving a user input requesting augmented reality content associated with the at least one video scene; identifying a relevant portion of the augmented reality content to render on a second display associated with the electronic device, wherein the relevant portion comprises video frames corresponding to the at least one video scene and additional content surrounding the video frames; and rendering the relevant portion on the second display device.
Description
AUGMENTED REALITY FOR VIDEO SYSTEM TECHNICAL FIELD The present invention relates to methods and apparatus for providing augmented reality content to a video content item.
BACKGROUND Augmented reality (AR) is a live, direct or indirect, view of a physical, real- world environment whose elements are augmented (or supplemented) by computer- generated sensory input such as sound, video, graphics or GPS data. Augmentation is typically in real-time and in semantic context with environmental elements.
An increased number of different Augmented Reality applications are now available to users via smartphones, tablet computers, or other handheld devices. Typically, these applications use cameras embedded in the handheld device and display contextual information overlaid onto real images captured by the camera.
BRIEF DESCRIPTION OF THE DRAWINGS
The present invention will be understood and appreciated more fully from the detailed description, taken in conjunction with the drawings in which:
Fig. 1 is a simplified block diagram illustration of a system constructed and operative in accordance with an embodiment of the present invention;
Fig. 2 is a simplified block diagram illustration of a video database constructed and operative in accordance with an embodiment of the present invention;
Fig. 3 is a flow chart diagram of a method to view augmented reality content according to an embodiment of the present invention;
Fig. 4 A is a simplified pictorial illustration of a calibration procedure in accordance with an embodiment of the present invention;
Fig. 4B is a simplified pictorial illustration showing augmented reality content obtained after rotating an electronic device around an axis parallel to a plane of its rendering screen in accordance with an embodiment of the present invention;
Fig. 4C are simplified pictorial illustrations showing augmented reality content obtained by translating an electronic device along an axis perpendicular to a plane of its rendering screen in accordance with an embodiment of the present invention;
Figs. 5 and 6 are pictorial illustrations of the calculations used in the automatic calibration procedure in accordance with an embodiment of the present invention.
Fig. 7 is a pictorial illustration showing a relevant portion of the augmented reality panel to render on the rendering screen of an electronic device in accordance to an embodiment of the present invention.
DESCRIPTION OF EXAMPLE EMBODIMENTS
Overview
In one embodiment, a method includes: displaying an event on a first display device associated with a client device, the event comprising at least one video scene; enabling an electronic device to communicate with the client device; receiving a user input requesting augmented reality content associated with the at least one video scene; identifying a relevant portion of the augmented reality content to render on a second display associated with the electronic device, wherein the relevant portion comprises video frames corresponding to the at least one video scene and additional content surrounding the video frames; and rendering the relevant portion on the second display device.
Example Embodiments
In the following description, numerous specific details are set forth in order to provide a thorough understanding of the various principles of the present invention. However, those skilled in the art will appreciate that not all these details are necessarily
always required for practicing the present invention. In this instance, well-known circuits, control logic, and the details of computer program instructions for conventional algorithms and processes have not been shown in detail in order not to obscure the general concepts unnecessarily.
Although the principles of the present invention are largely described herein in relation to a television broadcast environment and related television applications, this is an example selected for convenience of presentation, and is not limiting.
Reference is now made to Fig. 1, which is a simplified block diagram illustration of a system constructed and operative in accordance with an embodiment of the present invention.
A headend 110 typically communicates with a plurality of client devices via a communication network 130. Additionally or alternatively, a plurality of headends communicate with a single client device or with a plurality of client devices via the communication network 130. For simplicity of depiction and description, and without limiting the generality of the present invention, one client device 120 is depicted in Fig. 1 as being in operative communication (via communication network 130) with one headend 110. Headend 110 also comprises a video database 111 that will be described in greater details hereinafter in relation to Fig. 2.
The communication network 130 is a one-way or two-way communication network that includes at least one of the following: a satellite based communication network; a cable based communication network; a conventional terrestrial broadcast television network; a telephony based communication network; a telephony based television broadcast network; a mobile-telephony based television broadcast network; an Internet Protocol (IP) television broadcast network; and a computer based communication network. It is appreciated that in other embodiments of the present invention, the communication network 130 may, for example, be implemented by a one-way or two-way hybrid communication network, such as a combination cable -telephone network, a combination satellite-telephone network, a combination satellite-computer based communication network, or by any other appropriate network. Other ways of implementing the communication network 130 will be apparent to someone skilled in the art.
The system of Fig. 1 comprises a client device 120 disposed between a headend 110 and a display device 140. Client device 120 is coupled to a display device 140. Client device 120 typically comprises a tuner, a decoder, a receiver and a descrambler. It is appreciated that the client device 120 comprises standard hardware components and software components, as is well known in the art.
Client device 120 is typically connected in operation to display device 140 via a digital AV interface (e.g. HDMI, DVI, etc.) or via an analogue AV interface (e.g. component (RGB, YPbPr), composite (NTSC, PAL, SECAM), S -video, SCART, RF coaxial, D-Terminal (D-tanshi) etc.). While shown as separate entities in Fig. 1, the client device 120 may be integral with the display device 140 in other embodiments of the present invention.
Client device 120 typically receives audio visual (AV) signals from e.g. a broadcast stream, a VOD library, etc., as well as related digital data. The client device typically decodes and decrypts/descrambles as necessary the AV signals received from a headend 110 optionally under control of a conditional access device such as removable security element as is well known in the art. The removable security element typically includes a smart card as is well known in the art. The output from client device 120 comprises a decoded and decrypted/de scrambled as necessary audio video (AV) signals ready to be displayed on the display device 140. Furthermore, the client device 120 typically runs onboard software stored within a memory of the client device 120, the onboard software comprising applications (e.g. electronic program guide (EPG), pay-per- view (PPV) resident application, video-on-demand (VOD) resident application, etc.), an intermediate software layer (middleware), software layers for controlling the device layer interface that provides an interface between the middleware and the decoder hardware controllers, and all other layers used to operate the client device 120.
Client device 120 is typically controlled by a user, for example via a remote control unit (RCU) 150. Client device 120 can receive control commands from the RCU 150 (e.g. via infra-red communication). RCU 150 is used by a user to select AV content to view, such as a live event broadcast, a Video -On-Demand (VOD) asset, a recorded event, etc., or to select a particular application to run, such as an EPG, a PPV application, a VOD application, etc., which is then displayed on the display device 140. The onboard software within client device 120 monitors user interaction with the client device 120 so that the
relevant information is displayed on the display device 140 in response to a user command entered using the RCU 150.
Furthermore, an electronic device 160 is provided that is operable to communicate with the client device 120 using any suitable connectivity link (e.g. wired, WiFi, Bluetooth, RF, etc.). The electronic device 160 is further operable to communicate with the headend 110 through the communication network 130. Figure 1 shows only one communication network 130. However, those skilled in the art will appreciate that the electronic device 160 may communicate directly with the headend 110 using the same communication network 130 or a different communication network. The electronic device 160 may be for example, but without limiting the generality of the invention, a companion device such as a tablet computer, an iPad™, or a mobile computing device, such as a Personal Digital Assistant (PDA), mobile phone, or any handheld device. The electronic device 160 typically comprises a processor, a video player, a rendering screen for displaying content and data information to a user, a camera operative to take pictures and/or videos and an orientation module 170 comprising a plurality of sensors such as, for example, but without limiting the generality of the present invention, an accelerometer sensor, a gyroscope, a manual switch, etc. operative to detect the orientation of the electronic device 160.
A user is typically able to select an event or a service to be displayed on the display device 140. Further, the user is typically able to select an application to be run on the electronic device 160 in order to get augmented reality content related to the event currently being displayed on the display device 140. The application may be locally executed on the electronic device 160 and launched as soon as an event is selected for display by the user, or upon reception of a notification transmitted by a video database 111 notifying the user that augmented reality content is available for the event. In the latter case, the notification indicating that augmented reality content is available may be, for example, but without limiting the generality of the present invention, in a form of a visual indication such as an On-Screen Display (OSD) or pop-up message appearing on the display device 140. It will be apparent to someone skilled in the art that any suitable form of visual and/or sound indication may be used to notify the user that augmented reality content is available. Once the application is launched, communication between the electronic device 160 and client device 120 and/or headend 110 is established. The user
may decide to view this augmented reality content and to do so is requested to hold the electronic device 160 in front of the display device 140 for a calibration procedure, which is typically automatic. Once the calibration procedure is successfully achieved, the current video scene of the event or service currently being displayed on the display device 140 is also displayed on the rendering screen of the electronic device 160 and the user is further able to view augmented reality content (i.e. surrounding content received from the video database 111) related to the video scene currently being displayed on the display device 140.
Reference is now made to Fig. 2, which is a simplified block diagram illustration of a video database 211 constructed and operative in accordance with an embodiment of the present invention.
The video database 211 is typically located at the headend 210 and comprises metadata related to a plurality of video content items 212. Those skilled in the art will appreciate that this video database 211 may be in any suitable form and located in any suitable part of the system of the present invention. Other ways of implementing the database 211, such as using a plurality of databases, will be apparent to someone skilled in the art.
A video content item 212(i) typically comprises a sequence of video frames representing one or more video scenes. In an embodiment of the present invention, a video content item 212(i) is identified by a video content identifier and metadata is provided for each video scene of the video content item 212(i). Metadata provided for each video scene typically comprise at least:
• a timeline position identifying when the video scene occurs in the video content item. This timeline position may be for example, but without limiting the generality of the invention, a time elapsed since the beginning of the video content item 212(i), start and end times of the video scene, or a duration of the video scene, etc.; and
• augmented reality content that may be accessed and viewed by a user while the video scene of the video content item 212(i) is being played on the display device 140. This augmented reality content is typically in a form of an augmented reality panel comprising the video frames of the relevant video scene of the video content
item 212(i) and additional content surrounding the video frames. The additional content may be for example, but without limiting the generality of the present invention, pictures, videos, etc. retrieved from any suitable source (e.g. web sources, local contents, etc.). Once retrieved, the additional content is aggregated and arranged around the video frames to form the augmented reality panel relevant to the video scene of the video content item 212(i). In another embodiment of the present invention, the surrounding content may already be available in situations where the video scene currently being displayed is a subset of the original filmed image. Also, the dimensions of the augmented reality panel as well as the position of the video frames corresponding to the video scene currently being displayed in the augmented reality panel are defined in advance.
Those skilled in the art will appreciate that the video content items 212 which populate the video database 211 may be selected and provided by a TV broadcaster, a platform operator, a content provider or any third party entity. Hence, a wide selection of regular TV broadcast events, VOD assets, and other video content items may be available to be used by the application and/or the end user. Furthermore, the video database 211 may also be updated regularly by a TV broadcaster, a platform operator, a content provider or any third party entity in order to make available new video content items 212 and their associated metadata to users of the application.
Similarly, the video scenes of each video content item 212 supporting this augmented reality content system may be defined in advance by the TV broadcaster, platform operator, content provider or any third party entity providing this service. Indeed, it will be apparent to someone skilled in the art that displaying meaningful augmented reality contents to users depends on the type of event and video scene. An event may have several different video scenes, real or imaginary, taking place in many different places. Therefore, it may be complicated or even sometimes impossible to provide such augmented reality content for some video scenes of the event. The present invention is well-suited for situations in which, for example, but without limiting the generality of the present invention, the action takes place in a same room or when the video scene is filmed with a same fixed camera angle.
Finally, the position of the video frames in the augmented reality panel may vary from one video scene to another. Depending on the video scene, the video frames
may be located in any suitable position so that relevant surrounding content is available to the user. The video frames shown in Figs. 4A and 4B show two actors sitting on a bench with a garden and Notre-Dame Cathedral in the background. Therefore, the video frames showing the actors may be located at the bottom center of the augmented reality panel so that surrounding content relevant to the garden and Notre-Dame Cathedral is available on the left, right and top of the video frames. Those skilled in the art will appreciate that the position depends on many factors such as the TV broadcaster, platform operator, content provider or any third party entity providing this service, the relevant video scene, the available surrounding content, etc.
Reference is now made to Fig. 3, which is a flow chart diagram of a method to view augmented reality content according to an embodiment of the present invention.
At step 301, the user may select an event or a service to be displayed on the display device 140 by operating a RCU 150 e.g. a remote control. Upon reception of the user's choice, the client device 120 tunes to the selected event or service and demultiplexes, decodes and decrypts/descrambles as necessary the broadcast television stream received from the headend 110 in order to display the selected event or service on the display device 140. Although described in relation to a live TV event broadcast, those skilled in the art will appreciate that the event may be any suitable type of event such as a recorded event, a VOD asset, etc.
At step 302, the event currently being displayed on the display device is identified and it is determined if that event supports augmented reality video content. To do so, an event identifier is compared to the video content item identifiers present in the video database 111. If there is no match, no augmented reality content is available for the event currently being displayed. On the contrary, if there is a match, then augmented reality content may be available for the event currently being displayed on the display device 140. In the latter case, the client device 120 continuously monitors the time elapsed since the beginning of the event and compares it to the timeline position of the different video scenes of the identified video content item 212(i). When it is determined that the video scene of event currently being displayed on the display device 140 corresponds to a video scene of the video content item 212(i), the user is notified - via a sound or visual
indication as explained hereinabove - that augmented reality content may be available for the video scene of the event currently being displayed.
In an embodiment of the present invention, the user may decide at step 303 to run an application on his electronic device 160 to obtain and view augmented reality content relevant to the event or service currently being displayed on the display device 140. As explained hereinabove, the user may run the application as soon as the event selected is displayed on the display device 140 or upon reception of a notification from a video database 111 that augmented reality content is available for the video scene currently being displayed. Those skilled in the art will appreciate that the application may be a distributed computing application and therefore, that the application may be executed in any or in a plurality of suitable computing devices of a distributed computing system. Once launched, the electronic device 160 is operable to communicate with the video database 111 of the headend 110. Additionally and/or alternatively, the communication between the electronic device 160 and the video database 111 is established via the client device 120. The augmented reality panel related to the video scene currently being displayed is retrieved from the video database 111 and stored on the client device 120 therefore becoming accessible and/or available to be used by the electronic device 160. It will be apparent to someone skilled in the art that the augmented reality panel related to the video scene currently being displayed may also be streamed in real time by the electronic device 160 from the video database 111.
Prior to viewing the augmented reality content relevant to the video scene currently being displayed, the user is requested to hold his electronic device 160 in front of the display device 140 (as illustrated in Fig. 4A) for an automatic calibration procedure (step 304). The purpose of this calibration procedure is to determine how and which portion of the augmented reality panel (which depends on the distance and direction between the display device 140 and the electronic device 160) is to be displayed on the rendering screen of the electronic device 160. To do so, the position where the video frames of the augmented reality panel are to be displayed on the rendering screen of the electronic device 160, and the dimensions of the video frames, are automatically computed using the camera of the electronic device 160. The computed dimensions typically corresponds to a homothetic transformation of the dimensions and position of the video displayed on the display device 140 as viewed by the user in the plane of the
rendering screen of the electronic device 160. In other words, the position and the dimensions of the video frames are calculated to correspond to the dimensions of the video displayed on the display device 140 as they would be seen by the user in the plane of the rendering screen of the electronic device 160 and not as they are rendered on the rendering screen by the camera of the electronic device 160. Also, the absolute direction of the display device 140 is determined using the orientation module 170 e.g. the gyroscope sensor of the electronic device 160. This absolute direction may be corrected as the camera is usually not centered on the middle of the electronic device 160 to give the initial direction. This initial direction obtained during the calibration procedure is stored for later use. At the end of the automatic calibration procedure, the relevant portion of the augmented reality panel related to the video scene of the event currently being displayed on the display device 140 is identified and rendered on the rendering screen of the electronic device 160 with:
• the video frames (and in turn, the augmented reality panel) located and resized according to the computed dimensions; and
• the initial direction.
Therefore, as sizes of display device 140 and electronic device 160 as they are seen by the user do not necessarily correspond, the first portion of the augmented reality panel relevant to the current video scene is rendered on the rendering screen of the electronic device 160 already with surrounding additional content. For avoiding any parallax issues, the rendering on the electronic device 160 is done as if the user is standing right in front of the display device 140 regardless of his actual position (or position at the time when the calibration procedure was performed) with respect to the display device 140. The calibration procedure and related calculations will be described in greater details hereinafter in relation to the description of Figs. 5, 6 and 7.
At step 305, the user is able to move the electronic device 160 and see augmented reality content surrounding the video frames of the video scene currently being displayed. The orientation module 170 (e.g. gyroscope and accelerometer sensors) as well as the camera of the electronic device 160 are used to continuously communicate the current position and orientation of the electronic device 160 to the client device 120. When the user rotates the electronic device 160 around an axis parallel to a plane of the rendering screen, the new position and orientation of the electronic device 160 are
determined and the new relevant portion of augmented reality panel (comprising the video frames of the video scene and surrounding additional content) is displayed on the rendering screen of the electronic device 160. Fig. 4B shows the user rotating the electronic device 160 around the X-axis parallel to a plane of the rendering screen in order to see augmented reality content located above the video frame of the video scene currently being displayed. Similarly, Fig. 4C shows different portions of the augmented reality panel displayed on the rendering screen of the electronic device 160 when the user rotates the electronic device 160 around an axis parallel to the plane of the rendering screen:
· rotation around the Y-axis for left (view 1) and right (view 2) views; and
• rotation around the X-axis for top (view 3) and bottom (view 4) views.
Also, when the user is moving along the Z-axis perpendicular to the plane of the rendering screen of the electronic device 160 to get closer (or farther) to (from) the display device 140, a zoom-in (or zoom-out) effect may be achieved. This type of user's motion involves real time calculations in order to determine and update the position where the video frames of the video scene currently being displayed are to be displayed on the rendering screen of the electronic device 160, and the dimensions of the video frames (and in turn, the dimensions of the augmented reality panel). Those skilled in the art will appreciate that these calculations are similar to the ones made during the calibration procedure. Further details regarding the motion detection and related effects rendered on the rendering screen of the electronic device 160 will be given hereinafter in relation to the description of Fig. 7.
As explained hereinabove in relation to step 304 of Fig. 3, the purpose of the calibration procedure is to determine how and which portion of the augmented reality panel relevant to the video scene of the event currently being displayed on the display device 140 is to be displayed on the rendering screen of the electronic device 160. To do so, the camera of the electronic device 160 is held and oriented by the user towards the direction of the display device 140. Then, the shape of the video, corresponding to the event currently being displayed on the display device 140, is detected on the rendering screen of the electronic device 160 and analyzed using well-known real time image recognition and analysis technics. The position and dimensions of the display device 140
as viewed by the camera of the electronic device 160 are obtained as output of this analysis. However, the dimensions do not correspond to the dimensions at which the display device 140 is viewed by the user in the plane of the rendering screen of the electronic device 160. This is due to the camera's properties and more particularly to the camera's angle of view also referred to as the "Field of View" ("FoV") angle. Depending on the electronic device 160 used, this "FoV" angle varies. Therefore, the calibration procedure enables users of any electronic devices having their own properties to compute the position where the video frames of the augmented reality panel are to be displayed on the rendering screen of the electronic device 160 and the dimensions of the video frames.
Reference is now made to Figs. 5 and 6, which are pictorial illustrations of the calculations used in the automatic calibration procedure according to an embodiment of the present invention. The computed dimensions typically correspond to a homothetic transformation of the dimensions and position of the video displayed on the display device 140 as viewed by the user in the plane of the rendering screen of the electronic device 160. Fig. 5 shows two planes:
• a display device plane corresponding to the real image viewed by the camera in which:
o HREAL is the height of the real image; and
o IIREAL is the height of the detected video shape; and
· a camera plane corresponding to the camera image as viewed by the camera according to its own "FoV" angle for which:
o HCAMERA is the height corresponding to HREAL as viewed by the camera of the electronic device 160; and
o hCAMERA is the height corresponding to mAL as viewed by the camera of the electronic device 160.
Futhermore, Fig. 5 shows two additional parameters: a corresponding to the vertical "FoV" angle of the camera; and DTV-CAMERA corresponding to the distance between the camera of the electronic device 160 and the display device 140.
The vertical "FoV" angle is a constant defined as an intrinsic property of the electronic device 160 and is given by:
HREAL
a =—
DTV -CAMERA
The camera image and recognized shape of the video provide HCAMERA and hcAMERA where:
HcAMERA _ HREAL
^CAMERA h-REAL
By combining the two equations given hereinabove, it is possible to compute the angle under which the display device 140 is viewed from the electronic device:
He AM ERA DTV -CAMERA
=O
^CAMERA h-REAL
Assuming that the distance between the user and the electronic device 160 DusER-DEvicE is negligible compared to
then the user's angle of view (corresponding to how the display device 140 is viewed by the user) denoted by ω may be computed as follows:
^■CAMERA
ω =o -
HcAMERA
Fig. 6 shows a display device plane and a rendering screen plane of the electronic device 160 for which:
• h is the height at which the video frames of the augmented panel are to be rendered on the rendering screen of the electronic device 160; and
• H is the height of the portion of the augmented reality panel that is to be rendered on the rendering screen of the electronic device 160 and typically corresponds to the height of the rendering screen.
Similarly to the vertical "FoV" angle a for the camera, ω corresponds to the vertical "FoV" angle of the user and may be given by:
H
ω
'USER -DEVICE
Therefore, the height h at which the video frames of the augmented reality panel are to be rendered on the rendering screen of the electronic device 160 is given by:
, _ ^-CAMERA N
— 0 77 ^USER -DEVICE
N CAM ERA
where, a is an intrinsic property of the camera; 1ICAMJ¾A and HCAMJ¾A are given by the camera; and DUSER_DEVICE is fixed as a constant corresponding to the distance between the user and the electronic device 160.
All these equations show that the height h depends on hCAMERA and HCAMERA. It is then possible to derive the width w of the video frames. This width w typically depends on the ratio of the video R e.g. 16/9 and is given by:
w = h - R
In an embodiment of the present invention, the user typically holds the electronic device 160 in front of the display device 140 to perform the calibration procedure. However, in another embodiment of the present invention, the user is also able to perform the calibration procedure even if he is not located in front of the display device 140. In such a situation, hCAMERA and HCAMERA will be computed as an average between the left and right corresponding heights of the camera image. Also, in order to ease the image recognition and analysis process, the user is requested to orientate the electronic device 160 towards the direction of the display device 140 so that the display device 140 appears entirely in the camera image.
Reference is now made to Fig. 7, which is a pictorial illustration showing a relevant portion of the augmented reality panel to render on the rendering screen of the electronic device 160 according to an embodiment of the present invention.
The orientation module 170 e.g. gyroscope and accelerometer sensors of the electronic device 160 are used to determine which portion of the augmented reality panel to render on the rendering screen of the electronic device 160. This orientation module 170 is typically able to detect an orientation change of the electronic device 160 i.e. when the electronic device 160 is moved and/or turned around. The user holding the electronic device 160 can move in a plurality of directions. However, each user movement can be modeled as a movement around a sphere as shown in Fig. 7. This sphere is centered on the user and has a radius corresponding to the distance between the user and the electronic device 160. Therefore, the following user movements may be considered:
• rotating the electronic device 160 around the Y-axis parallel to the plane of the rendering screen corresponds to a left or right movement around the sphere and therefore left or right surrounding content is rendered on the rendering screen of the electronic device 160. This motion detection may be done efficiently in order to support a minimal frame rate of for example, but without limiting the generality of the invention, 15 images per second;
• rotating the electronic device around the X-axis parallel to the plane of the rendering screen corresponds to an up or down movement around the sphere to the top or bottom of the sphere and therefore top or bottom surrounding content is rendered on the rendering screen of the electronic device 160. This motion detection may be done efficiently in order to support a minimal frame rate of for example, but without limiting the generality of the invention, 15 images per second; and
• moving the electronic device 160 along the Z-axis perpendicular to the plane of the rendering screen typically corresponds to a zoom-in or zoom-out effect and is similar to a new calibration procedure.
For all these user movements, the motion of the electronic device 160 is detected by the orientation module 170. At each motion detection, the orientation module 170 sends to the client device 120 its change of position (delta position) from the initial position (i.e. the initial position indicating the direction of the display device 140 at the end of the calibration procedure represented by the central portion of the augmented reality panel in Fig. 7) both in terms of distance to the display device 140 (e.g. using the accelerometer sensor) and in terms of direction to the display device 140 (e.g. using the gyroscope sensor). The output from the motion detection may be:
• an update of the calibration procedure comprising updated dimensions and position of the video frames (and in turn, of the augmented reality panel) as well as an updated initial position. This typically corresponds to a movement along the Z-axis perpendicular to the plane of the rendering screen of the electronic device 160 for which a new calibration procedure is performed; and/or
• a difference of angle between the initial position and the new position. This typically corresponds to a movement around the X-axis and/or the Y-axis parallel to the plane of the rendering screen of the electronic device 160.
For both outputs, the relevant portion of the augmented reality panel is defined by the following parameters:
• height hpoRTioN and width WPORTION ^PORTION, WPORTION) dimensions. These dimensions are determined at the end of each calibration procedure (i.e. initial calibration procedure and/or when a movement of the electronic device 160 along the Z-axis is detected by the orientation module 170 e.g. by the accelerometer
sensor) using the computed dimensions (h, w) of the video frames and mAL. The relationship between these parameters is given below:
H _ h
^PORTION hREAL
and so,
_ H
^PORTION ' h-REAL
Also, the width WPORTION is given by:
wPORTION = ^PORTION ' R
· a position in the augmented reality panel defined by the coordinates XPORTION and ypoRTTON (^PORTION, YPORTION)- These coordinates are set to (0, 0) according to the initial direction obtained during the calibration procedure. Then, when a rotation of the electronic device 160 around an axis X and/or Y is detected, the orientation module 170 e.g. the gyroscope sensor determines the angular difference (βχ, βγ) with the initial direction. The new coordinates corresponding to the new relevant portion of the augmented reality panel to render are given by:
xPORTION = ^USER-DEVICE ' βχ
and,
PORTION = ^USER-DEVICE ' βγ
Those skilled in the art will appreciate that the coordinates are expressed with respect to a reference point set to correspond to the initial direction. The reference point may be any suitable point of the portion of the augmented reality panel such as for example, but without limiting the generality of the invention, the center of the portion, the top left corner, etc.
Therefore, when a user moves the electronic device 160 along the Z-axis perpendicular to the plane of the rendering screen of the electronic device 160, the dimensions of the relevant portion of the augmented reality panel to display (hPORTioN5 WPORTION) and the initial direction are updated. The coordinates (XPORTION^ YPORTON) are computed when the electronic device 160 is moved around the sphere, i.e. rotated around the X-axis and/or Y-axis parallel to the plane of the rendering screen of the electronic device 160. In this latter case, the dimensions (hPORTioN, WPORTION) of the augmented reality panel do not change.
Although the above embodiments have been described in the context of DVB implementation, someone skilled in the art will realize that other implementations are possible.
It is appreciated that various features of the invention which are, for clarity, described in the contexts of separate embodiments may also be provided in combination in a single embodiment. Conversely, various features of the invention which are, for brevity, described in the context of a single embodiment may also be provided separately or in any suitable subcombination.
It will be appreciated by persons skilled in the art that the present invention is not limited to what has been particularly shown and described hereinabove. Rather the scope of the invention is defined by the appended claims and equivalents thereof.
Claims
1. A method comprising:
displaying an event on a first display device associated with a client device, said event comprising at least one video scene;
enabling an electronic device to communicate with said client device;
receiving a user input requesting augmented reality content associated with said at least one video scene;
identifying a relevant portion of said augmented reality content to render on a second display associated with said electronic device, wherein said relevant portion comprises video frames corresponding to said at least one video scene and additional content surrounding said video frames; and
rendering said relevant portion on said second display device.
2. The method of claim 1, wherein said identifying comprises:
retrieving an augmented reality content panel corresponding to said augmented reality content associated with said at least one video scene;
determining a position and dimensions at which said relevant portion is to be rendered on said second display device; and
identifying said relevant portion according to said determined position and dimensions.
3. The method of claim 2, wherein said determining further comprises determining an initial direction corresponding to a direction of said first display device.
4. The method of claim 2, wherein said determining comprises determining a position and dimensions at which said video frames are to be rendered on said second display device.
5. The method of claim 2, wherein said determining is performed using a camera of said electronic device.
6. The method of claim 1, said method further comprising:
detecting an orientation change of said electronic device;
identifying a new relevant portion of said augmented reality content to render on the second display device associated with said electronic device, said new relevant portion being identified according to said detected orientation change; and
rendering said new relevant portion on said second display device.
7. The method of claim 6, wherein said detecting an orientation change comprises detecting a rotation of said electronic device around an axis parallel to a plane of said second display device.
8. The method of claim 7, wherein said rotation is detected by a gyroscope sensor of said electronic device.
9. The method of claim 6, wherein said detecting an orientation change comprises detecting a movement of said electronic device along an axis perpendicular to a plane of said second display device.
10. The method of claim 9, wherein said movement is detected by an accelerometer sensor of said electronic device.
11. The method of claim 6, wherein said identifying a new relevant portion comprises:
calculating coordinates of said new relevant portion of said augmented reality content with respect to an initial position of said relevant portion when said detected orientation change is a rotation of said electronic device around an axis parallel to a plane of said second display device; and
identifying said new relevant portion using said calculated coordinates.
12. The method of claim 6, wherein said identifying a new relevant portion comprises:
determining a position and dimensions at which said new relevant portion is to be rendered on said second display device when said detected orientation change is a movement of said electronic device along an axis perpendicular to a plane of said second display device; and
identifying said new relevant portion according to said determined position and dimensions.
13. A system comprising:
a first display device operable to display an event comprising at least one video scene;
a client device associated with said first display device and operable to: communicate with an electronic device; and to receive a user input requesting augmented reality content associated with said at least one video scene; and
an electronic device operable to identify a relevant portion of said augmented reality content, wherein said relevant portion comprises video frames corresponding to said at least one video scene and additional content surrounding said video frames; said electronic device further comprising a second display device operable to render said relevant portion.
14. One or more computer readable storage media encoded with software comprising computer executable instructions and when software is executed operable to:
display an event on a first display device associated with a client device, said event comprising at least one video scene;
enable an electronic device to communicate with said client device;
receive a user input requesting augmented reality content associated with said at least one video scene;
identify a relevant portion of said augmented reality content to render on a second display associated with said electronic device, wherein said relevant portion comprises video frames corresponding to said at least one video scene and additional content surrounding said video frames; and
render said relevant portion on said second display device.
Priority Applications (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP13782846.3A EP2893706B1 (en) | 2012-09-04 | 2013-09-03 | Augmented reality for video system |
US14/425,327 US9747723B2 (en) | 2012-09-04 | 2013-09-03 | Augmented reality for video system |
CN201380046203.8A CN104604239B (en) | 2012-09-04 | 2013-09-03 | Augmented reality for video system |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201261743431P | 2012-09-04 | 2012-09-04 | |
US61/743,431 | 2012-09-04 |
Publications (2)
Publication Number | Publication Date |
---|---|
WO2014037876A2 true WO2014037876A2 (en) | 2014-03-13 |
WO2014037876A3 WO2014037876A3 (en) | 2014-07-03 |
Family
ID=49485766
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/IB2013/058254 WO2014037876A2 (en) | 2012-09-04 | 2013-09-03 | Augmented reality for video system |
Country Status (4)
Country | Link |
---|---|
US (1) | US9747723B2 (en) |
EP (1) | EP2893706B1 (en) |
CN (1) | CN104604239B (en) |
WO (1) | WO2014037876A2 (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
GB2533637A (en) * | 2014-12-24 | 2016-06-29 | Fountain Digital Labs Ltd | An interactive video system and a method of controlling an interactive video system |
US10467917B2 (en) | 2016-06-28 | 2019-11-05 | Fountain Digital Labs Limited | Interactive video system and a method of controlling an interactive video system based on a motion and a sound sensors |
Families Citing this family (26)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9940009B2 (en) * | 2013-05-15 | 2018-04-10 | Sony Corporation | Display control device for scrolling of content based on sensor data |
WO2015012835A1 (en) * | 2013-07-25 | 2015-01-29 | Empire Technology Development, Llc | Composite display with mutliple imaging properties |
US10514773B2 (en) * | 2014-09-16 | 2019-12-24 | 3Drudder | Foot-operated controller, device and furniture comprising it, method of operating it |
US10043319B2 (en) | 2014-11-16 | 2018-08-07 | Eonite Perception Inc. | Optimizing head mounted displays for augmented reality |
US10055895B2 (en) * | 2016-01-29 | 2018-08-21 | Snap Inc. | Local augmented reality persistent sticker objects |
US10788791B2 (en) | 2016-02-22 | 2020-09-29 | Real View Imaging Ltd. | Method and system for displaying holographic images within a real object |
US10795316B2 (en) | 2016-02-22 | 2020-10-06 | Real View Imaging Ltd. | Wide field of view hybrid holographic display |
WO2017145158A1 (en) | 2016-02-22 | 2017-08-31 | Real View Imaging Ltd. | Zero order blocking and diverging for holographic imaging |
US11663937B2 (en) | 2016-02-22 | 2023-05-30 | Real View Imaging Ltd. | Pupil tracking in an image display system |
US10650591B1 (en) | 2016-05-24 | 2020-05-12 | Out of Sight Vision Systems LLC | Collision avoidance system for head mounted display utilized in room scale virtual reality system |
US10433196B2 (en) * | 2016-06-08 | 2019-10-01 | Bank Of America Corporation | System for tracking resource allocation/usage |
US10037077B2 (en) * | 2016-06-21 | 2018-07-31 | Disney Enterprises, Inc. | Systems and methods of generating augmented reality experiences |
US11017712B2 (en) | 2016-08-12 | 2021-05-25 | Intel Corporation | Optimized display image rendering |
US9928660B1 (en) | 2016-09-12 | 2018-03-27 | Intel Corporation | Hybrid rendering for a wearable display attached to a tethered computer |
US11164378B1 (en) * | 2016-12-08 | 2021-11-02 | Out of Sight Vision Systems LLC | Virtual reality detection and projection system for use with a head mounted display |
WO2018211494A1 (en) * | 2017-05-15 | 2018-11-22 | Real View Imaging Ltd. | System with multiple displays and methods of use |
US10565158B2 (en) * | 2017-07-31 | 2020-02-18 | Amazon Technologies, Inc. | Multi-device synchronization for immersive experiences |
CN107657990A (en) * | 2017-09-22 | 2018-02-02 | 中国科学院重庆绿色智能技术研究院 | A kind of auxiliary of operation record typing supports system and method |
CN107592520B (en) * | 2017-09-29 | 2020-07-10 | 京东方科技集团股份有限公司 | Imaging device and imaging method of AR equipment |
US10852836B2 (en) | 2017-11-30 | 2020-12-01 | Microsoft Technology Licensing, Llc | Visual transformation using a motion profile |
US11210826B2 (en) | 2018-02-02 | 2021-12-28 | Disney Enterprises, Inc. | Systems and methods to provide artificial intelligence experiences |
CN110852143B (en) * | 2018-08-21 | 2024-04-09 | 元平台公司 | Interactive text effects in an augmented reality environment |
KR102139331B1 (en) * | 2018-11-20 | 2020-07-29 | 주식회사 엘지유플러스 | Apparatus, server, and method for playing moving picture contents |
AU2020421629B2 (en) * | 2020-01-06 | 2024-03-14 | Pixovr, Corp. | Methods and systems configured for distribution of VR/MR/AR/XR content to end user devices via logical relationships and related management of end user devices |
KR20220078298A (en) * | 2020-12-03 | 2022-06-10 | 삼성전자주식회사 | Method for providing adaptive augmented reality streaming and apparatus for performing the same |
CN115942022B (en) * | 2021-08-27 | 2024-06-11 | 中移(苏州)软件技术有限公司 | Information preview method, related equipment and storage medium |
Family Cites Families (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8547401B2 (en) | 2004-08-19 | 2013-10-01 | Sony Computer Entertainment Inc. | Portable augmented reality device and method |
GB0608939D0 (en) * | 2006-05-05 | 2006-06-14 | Sony Comp Entertainment Europe | Display apparatus and method |
US8913171B2 (en) * | 2010-11-17 | 2014-12-16 | Verizon Patent And Licensing Inc. | Methods and systems for dynamically presenting enhanced content during a presentation of a media content instance |
EP2485119A3 (en) * | 2011-02-02 | 2012-12-12 | Nintendo Co., Ltd. | Spatially-correlated multi-display human-machine interface |
JP5745340B2 (en) * | 2011-06-02 | 2015-07-08 | 任天堂株式会社 | GAME SYSTEM, GAME DEVICE, GAME PROGRAM, AND IMAGE GENERATION METHOD |
-
2013
- 2013-09-03 CN CN201380046203.8A patent/CN104604239B/en not_active Expired - Fee Related
- 2013-09-03 WO PCT/IB2013/058254 patent/WO2014037876A2/en active Application Filing
- 2013-09-03 EP EP13782846.3A patent/EP2893706B1/en active Active
- 2013-09-03 US US14/425,327 patent/US9747723B2/en active Active
Non-Patent Citations (1)
Title |
---|
None |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
GB2533637A (en) * | 2014-12-24 | 2016-06-29 | Fountain Digital Labs Ltd | An interactive video system and a method of controlling an interactive video system |
GB2533637B (en) * | 2014-12-24 | 2021-05-19 | Fountain Digital Labs Ltd | An Interactive Video System and a Method of Controlling an Interactive Video System |
US10467917B2 (en) | 2016-06-28 | 2019-11-05 | Fountain Digital Labs Limited | Interactive video system and a method of controlling an interactive video system based on a motion and a sound sensors |
Also Published As
Publication number | Publication date |
---|---|
EP2893706B1 (en) | 2020-01-08 |
EP2893706A2 (en) | 2015-07-15 |
US9747723B2 (en) | 2017-08-29 |
US20150206350A1 (en) | 2015-07-23 |
CN104604239B (en) | 2019-03-22 |
WO2014037876A3 (en) | 2014-07-03 |
CN104604239A (en) | 2015-05-06 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US9747723B2 (en) | Augmented reality for video system | |
US11308583B2 (en) | Systems, methods, and media for adjusting one or more images displayed to a viewer | |
US10499115B2 (en) | Video content switching and synchronization system and method for switching between multiple video formats | |
US10334162B2 (en) | Video processing apparatus for generating panoramic video and method thereof | |
US9271048B2 (en) | Systems and methods for immersive viewing experience | |
CN104010225B (en) | The method and system of display panoramic video | |
US8730354B2 (en) | Overlay video content on a mobile device | |
US20240205494A1 (en) | Gaze-responsive advertisement | |
US20150082187A1 (en) | Methods and systems for presenting direction-specific media assets | |
US9538246B2 (en) | Map your movie | |
US9930416B1 (en) | Playback device that requests and receives a user tailored video stream using content of a primary stream and an enhanced stream | |
EP3065413B1 (en) | Media streaming system and control method thereof | |
US20180096202A1 (en) | Systems and methods for recreating a reference image from a media asset | |
US10893206B1 (en) | User experience with digital zoom in video from a camera | |
US10887653B2 (en) | Systems and methods for performing distributed playback of 360-degree video in a plurality of viewing windows | |
KR20150073573A (en) | Method and apparatus for displaying contents related in mirroring picture | |
US9197844B2 (en) | User interface | |
US10440266B2 (en) | Display apparatus and method for generating capture image | |
KR20150090314A (en) | Advertising method using smart display having division area changing module | |
Kochale et al. | Gesture controlled interactive rendering in a panoramic scene | |
KR20150124425A (en) | Advertising method using smart display having division area changing module |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 13782846 Country of ref document: EP Kind code of ref document: A2 |
|
WWE | Wipo information: entry into national phase |
Ref document number: 14425327 Country of ref document: US |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2013782846 Country of ref document: EP |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 13782846 Country of ref document: EP Kind code of ref document: A2 |