WO2023012745A1 - Telepresence through vr broadcast streams - Google Patents

Telepresence through vr broadcast streams Download PDF

Info

Publication number
WO2023012745A1
WO2023012745A1 PCT/IB2022/057318 IB2022057318W WO2023012745A1 WO 2023012745 A1 WO2023012745 A1 WO 2023012745A1 IB 2022057318 W IB2022057318 W IB 2022057318W WO 2023012745 A1 WO2023012745 A1 WO 2023012745A1
Authority
WO
WIPO (PCT)
Prior art keywords
video
content
broadcast
display
flat
Prior art date
Application number
PCT/IB2022/057318
Other languages
French (fr)
Inventor
Brant Candelore
Adam Goldberg
Fred Ansfield
Graham Clift
Loren Pineda
Original Assignee
Sony Group Corporation
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from US17/569,969 external-priority patent/US11818414B2/en
Application filed by Sony Group Corporation filed Critical Sony Group Corporation
Priority to EP22773025.6A priority Critical patent/EP4360326A1/en
Priority to JP2024506971A priority patent/JP2024532717A/en
Priority to CN202280010265.2A priority patent/CN116762347A/en
Priority to KR1020237030496A priority patent/KR20230145115A/en
Publication of WO2023012745A1 publication Critical patent/WO2023012745A1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/81Monomedia components thereof
    • H04N21/816Monomedia components thereof involving special video data, e.g 3D video
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/41Structure of client; Structure of client peripherals
    • H04N21/4104Peripherals receiving signals from specially adapted client devices
    • H04N21/4122Peripherals receiving signals from specially adapted client devices additional display device, e.g. video projector
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/41Structure of client; Structure of client peripherals
    • H04N21/414Specialised client platforms, e.g. receiver in car or embedded in a mobile appliance
    • H04N21/41407Specialised client platforms, e.g. receiver in car or embedded in a mobile appliance embedded in a portable device, e.g. video client on a mobile phone, PDA, laptop
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/442Monitoring of processes or resources, e.g. detecting the failure of a recording device, monitoring the downstream bandwidth, the number of times a movie has been viewed, the storage space available from the internal hard disk
    • H04N21/44213Monitoring of end-user related data
    • H04N21/44218Detecting physical presence or behaviour of the user, e.g. using sensors to detect if the user is leaving the room or changes his face expression during a TV program
    • AHUMAN NECESSITIES
    • A63SPORTS; GAMES; AMUSEMENTS
    • A63FCARD, BOARD, OR ROULETTE GAMES; INDOOR GAMES USING SMALL MOVING PLAYING BODIES; VIDEO GAMES; GAMES NOT OTHERWISE PROVIDED FOR
    • A63F2300/00Features of games using an electronically generated display having two or more dimensions, e.g. on a television screen, showing representations related to the game
    • A63F2300/80Features of games using an electronically generated display having two or more dimensions, e.g. on a television screen, showing representations related to the game specially adapted for executing a specific type of game
    • A63F2300/8082Virtual reality

Definitions

  • This application relates to technical advances necessarily rooted in computer technology and directed to digital television, and more particularly to Advanced Television Systems Committee (ATSC) 3.0.
  • ATSC Advanced Television Systems Committee
  • ATSC 3.0 The Advanced Television Systems Committee (ATSC) 3.0 suite of standards is a set of over a dozen industry technical standards as indicated in A/300 for delivering the next generation of broadcast television.
  • ATSC 3.0 supports delivery of a wide range of television services including televised video, interactive services, non-real time delivery of data, and tailored advertising to a large number of receiving devices, from ultra-high-definition televisions to wireless telephones.
  • ATSC 3.0 also orchestrates coordination between broadcast content (referred to as “over the air” or OTA) and related broadband delivered content and services (referred to as “over the top” or OTT).
  • OTA broadcast content
  • OTT broadband delivered content and services
  • ATSC 3.0 is designed to be flexible so that as technology evolves, advances can be readily incorporated without requiring a complete overhaul of any related technical standard.
  • One hundred eighty/three hundred sixty-degree (180°/360°) field of view, 4K or 8K or higher resolution video can be panned on a PC, tablet or smart phone using a mouse or touchscreen.
  • the video can be panned with a hand controller or head motion.
  • VR goggles can sense the natural turning or tilting of the head in order to pan the image shown within the googles to make the wearer feel present at the location as if being “teleported”.
  • 180/360 video can be monoscopic or stereoscopic.
  • a flat display as used herein may present monoscopic or stereoscopic video, with stereoscopic video being used to render 3D video and monoscopic video typically being considered to be 2D video.
  • “Flat display” or “flat video” as used herein means a video in which the entire image that is processed is limited to a display, typically with an aspect ratio of 16x9, at whatever the received resolution is, with no ability to pan.
  • “flat” video may be considered to be the “main view” video of a 180/360 video. It might be where the director or content creator wants to focus the user’s attention.
  • the “surround video” is not shown.
  • the “flat video” plus the “surround video” make up the entirely of a 180/360 video.
  • Examples can include providing, along with a normal flat feed, a 180°/360° surround feed of live news events.
  • Live news events such as at a location of an accident, tornado, demonstration, political rally, etc. can be shown as a 360° feed for 180°/360°spherical representation on the goggles. And it is not just live news events, but live “human interest” stories where people and organizations are showcased.
  • a 360° feed can provide the viewer a better idea as to they are about because they are able to “look around” and see things that a flat feed cannot show.
  • people without goggles watch the normal flat video on a TV, people with goggles can pan around the entire scene by means of panning the 180°/360° surround feed.
  • panning may be accomplished using the remote control by pressing the up, down, left, and right buttons on the remote control.
  • the video can be monoscopic or stereoscopic. Stereoscopic video can deliver separate images to each eye, and this called “3D” video. 3D video can be supported with 180°/360° video, but not all 180°/360° video is 3D.
  • Present principles also apply to travel tourist content and to any show or content filmed in 180°/360° VR such as concerts, motor bike racing, snowboarding, or hiking.
  • Many VR videos are filmed monoscopically, e.g., delivering the same content to each eye, and often with the focus set to infinity which minimizes the nausea that users experienced with 3D. Monoscopic video can deliver a very satisfactory experience of “being there” and telepresence.
  • a digital television apparatus includes at least one receiver with at least one processor programmed with instructions to configure the processor to present a flat video of an event on a main display and present a one hundred eighty or three hundred sixty-degree (180° or 360°) video of the event on a virtual reality (VR) headset.
  • VR virtual reality
  • a method in digital television in which at least one receiver can receive broadcast signals, includes receiving at least a first broadcast video signal. The method also includes presenting content in the first broadcast video signal on at least one normal flat display and presenting 180°/360° video of the content on at least one display. The method includes, responsive to at least one command, panning the 180°/360° video on the at least one display.
  • the at least one display includes a virtual reality (VR) headset.
  • VR virtual reality
  • the at least one display includes the flat display.
  • the 180°/360° video is received in the first broadcast video stream, the first broadcast stream includes no other video of the content, and the panning includes panning video on the flat display.
  • the 180°/360° video is received in the first broadcast video stream, the first broadcast stream includes no other video of the content, and the 180°/360° video is presented as flat video on the flat display as a backwardly compatible content view.
  • the 180°/360° video is received in the first broadcast video stream and content therein is presented on the flat display, and the method includes sending the content from the flat display to the VR headset.
  • the 180°/360° video is received in the first broadcast video stream and content therein is presented on the flat display, and the method includes sending the content to the VR headset without passing through the flat display.
  • the 180°/360° video may be received in the first broadcast video stream along with a flat feed of content in the 180°/360° video.
  • the 180°/360° video may be received from an over-the-top (OTT) source.
  • OTT over-the-top
  • an apparatus in another aspect, includes at least one receiver assembly configured to receive at least a first broadcast video signal and present content in the first broadcast video signal on at least one flat display.
  • the receiver assembly also is configured to present 180°/360° video of the content on at least one display. Responsive to at least one command, the assembly is configured to pan the 180°/360° video on the at least one display.
  • FIG 1 illustrates an Advanced Television Systems Committee (ATSC) 3.0 system
  • FIG. 1 illustrates components of the devices shown in Figure 1;
  • Figure 3 illustrates a first technique for providing telepresence video
  • Figure 4 illustrates a second technique for providing telepresence video
  • FIG. 5 illustrates an example specific system consistent with present principles
  • Figure 6 illustrates example logic in example flow chart format consistent with present principles
  • Figure 7 illustrates a receiver system with two displays, one for presenting a flat view of an event and the other for providing a 360-degree VR telepresence view of the event;
  • Figure 8 illustrates a first technique for providing 360° telepresence video as flat video on a TV that can be panned
  • Figure 9 illustrates a second technique for providing 360° telepresence video as flat backwardly compatible video on a TV
  • Figure 10 illustrates a third technique for providing 360° telepresence video on a TV and on VR goggles.
  • Figure 11 illustrates a fourth technique for providing 360° telepresence video directly to VR goggles.
  • An example system herein may include ATSC 3.0 source components and client components, connected via broadcast and/or over a network such that data may be exchanged between the client and ATSC 3.0 source components.
  • the client components may include one or more computing devices including portable televisions (e.g., smart TVs, Internet-enabled TVs), portable computers such as laptops and tablet computers, and other mobile devices including smart phones and additional examples discussed below.
  • portable televisions e.g., smart TVs, Internet-enabled TVs
  • portable computers such as laptops and tablet computers
  • other mobile devices including smart phones and additional examples discussed below.
  • These client devices may operate with a variety of operating environments.
  • some of the client computers may employ, as examples, operating systems from Microsoft, or a Unix operating system, or operating systems produced by Apple Computer or Google, such as Android®.
  • These operating environments may be used to execute one or more browsing programs, such as a browser made by Microsoft or Google or Mozilla or other browser program that can access websites hosted by the Internet servers discussed below.
  • ATSC 3.0 source components may include broadcast transmission components and servers and/or gateways that may include one or more processors executing instructions that configure the source components to broadcast data and/or to transmit data over a network such as the Internet.
  • a client component and/or a local ATSC 3.0 source component may be instantiated by a game console such as a Sony PlayStation®, a personal computer, etc. Information may be exchanged over a network between the clients and servers.
  • servers and/or clients can include firewalls, load balancers, temporary storages, and proxies, and other network infrastructure for reliability and security.
  • instructions refer to computer-implemented steps for processing information in the system. Instructions can be implemented in software, firmware or hardware and include any type of programmed step undertaken by components of the system.
  • a processor may be a single- or multi-chip processor that can execute logic by means of various lines such as address lines, data lines, and control lines and registers and shift registers.
  • Software modules described by way of the flow charts and user interfaces herein can include various sub-routines, procedures, etc. Without limiting the disclosure, logic stated to be executed by a particular module can be redistributed to other software modules and/or combined together in a single module and/or made available in a shareable library. While flow chart format may be used, it is to be understood that software may be implemented as a state machine or other logical method.
  • logical blocks, modules, and circuits can be implemented or performed with a general-purpose processor, a digital signal processor (DSP), a field programmable gate array (FPGA) or other programmable logic device such as an application specific integrated circuit (ASIC), discrete gate or transistor logic, discrete hardware components, or any combination thereof designed to perform the functions described herein.
  • DSP digital signal processor
  • FPGA field programmable gate array
  • ASIC application specific integrated circuit
  • a processor can be implemented by a controller or state machine or a combination of computing devices.
  • HTTP hypertext markup language
  • ROM read-only memory
  • EEPROM electrically erasable programmable read-only memory
  • CD-ROM compact disk read-only memory
  • DVD digital versatile disc
  • USB universal serial bus
  • a connection may establish a computer-readable medium.
  • Such connections can include, as examples, hard-wired cables including fiber optics and coaxial wires and digital subscriber line (DSL) and twisted pair wires.
  • a recitation of “having at least one of A, B, and C” includes A alone, B alone, C alone, A and B together, A and C together, B and C together, and/or A, B, and C together, etc.
  • Machine learning models consistent with present principles may use various algorithms trained in ways that include supervised learning, unsupervised learning, semi-supervised learning, reinforcement learning, feature learning, self-learning, and other forms of learning. Examples of such algorithms, which can be implemented by computer circuitry, include one or more neural networks, such as a convolutional neural network (CNN), a recurrent neural network (RNN), and a type of RNN known as a long short-term memory (LSTM) network. Support vector machines (SVM) and Bayesian networks also may be considered to be examples of machine learning models.
  • CNN convolutional neural network
  • RNN recurrent neural network
  • LSTM long short-term memory
  • SVM Support vector machines
  • Bayesian networks also may be considered to be examples of machine learning models.
  • performing machine learning may therefore involve accessing and then training a model on training data to enable the model to process further data to make inferences.
  • An artificial neural network/artificial intelligence model trained through machine learning may thus include an input layer, an output layer, and multiple hidden layers in between that that are configured and weighted to make inferences about an appropriate output.
  • an example of an ATSC 3.0 source component is labeled “broadcaster equipment” 10 and may include over-the-air (OTA) equipment 12 for wirelessly broadcasting, typically via orthogonal frequency division multiplexing (OFDM) in a one-to-many relationship, television data to plural receivers 14 such as ATSC 3.0 televisions.
  • OTA over-the-air
  • OFDM orthogonal frequency division multiplexing
  • One or more receivers 14 may communicate with one or more companion devices 16 such as remote controls, tablet computers, mobile telephones, virtual reality (VR) headsets such as computer simulation 180°/360° headsets such as computer game headsets such as a Sony PlayStation® headset and the like over a short range, typically wireless link 18 that may be implemented by Bluetooth®, low energy Bluetooth, Wireless HDMI, Amimon wireless HD video link, other near field communication (NFC) protocol, infrared (IR), etc.
  • companion devices 16 such as remote controls, tablet computers, mobile telephones, virtual reality (VR) headsets such as computer simulation 180°/360° headsets such as computer game headsets such as a Sony PlayStation® headset and the like over a short range
  • typically wireless link 18 that may be implemented by Bluetooth®, low energy Bluetooth, Wireless HDMI, Amimon wireless HD video link, other near field communication (NFC) protocol, infrared (IR), etc.
  • one or more of the receivers 14 may communicate, via a wired and/or wireless network link 20 such as the Internet, with over-the-top (OTT) equipment 22 of the broadcaster equipment 10 typically in a one-to-one relationship.
  • the OTA equipment 12 may be co-located with the OTT equipment 22 or the two sides 12, 22 of the broadcaster equipment 10 may be remote from each other and may communicate with each other through appropriate means.
  • a receiver 14 may receive ATSC 3.0 television signals OTA over a tuned-to ATSC 3.0 television channel and may also receive related content, including television, OTT (broadband).
  • computerized devices described in all of the figures herein may include some or all of the components set forth for various devices in Figures 1 and 2.
  • Figure 2 illustrates an example protocol stack that may be implemented by a combination of hardware and software.
  • broadcasters can send hybrid service delivery in which one or more program elements are delivered via a computer network (referred to herein as “broadband” and “over-the-top” (OTT)) as well as via a wireless broadcast (referred to herein as “broadcast” and “over-the-air” (OTA)).
  • Figure 2 also illustrates an example stack with hardware that may be embodied by a receiver.
  • the application layer 204 can include one or more software applications written in, e.g., HTML5/Javascript running in a runtime environment.
  • the applications in the application stack 204 may include linear TV applications, interactive service applications, companion screen applications, personalization applications, emergency alert applications, and usage reporting applications.
  • the applications typically are embodied in software that represents the elements that the viewer experiences, including video coding, audio coding and the run-time environment.
  • an application may be provided that enables a user to control dialog, use alternate audio tracks, control audio parameters such as normalization and dynamic range, and so on.
  • the presentation layer 206 includes, on the broadcast (OTA) side, broadcast audio-video playback devices referred to as Media Processing Units (MPU) 208 that, when implemented in a receiver, decode and playback, on one or more displays and speakers, wirelessly broadcast audio video content.
  • the MPU 208 is configured to present International Organization for Standardization (ISO) base media file format (BMFF) data representations 210 and video in high efficiency video coding (HEVC) with audio in, e.g., Dolby audio compression (AC-4) format.
  • ISO BMFF is a general file structure for time-based media files broken into “segments” and presentation metadata. Each of the files is essentially a collection of nested objects each with a type and a length.
  • the MPU 208 may access a broadcast side encrypted media extension (EME)/common encryption (CENC) module 212.
  • EME broadcast side encrypted media extension
  • CENC common encryption
  • Figure 2 further illustrates that on the broadcast side the presentation layer 206 may include signaling modules, including either motion pictures expert group (MPEG) media transport protocol (MMTP) signaling module 214 or real-time object delivery over unidirectional transport (ROUTE) signaling module 216 for delivering non-real time (NRT) content 218 that is accessible to the application layer 204.
  • NRT content may include but is not limited to stored replacement advertisements.
  • the presentation layer 206 can include one or more dynamic adaptive streaming over hypertext transfer protocol (HTTP) (DASH) player/decoders 220 for decoding and playing audio-video content from the Internet.
  • DASH dynamic adaptive streaming over hypertext transfer protocol
  • the DASH player 220 may access a broadband side EME/CENC module 222.
  • the DASH content may be provided as DASH segments 224 in ISO/BMFF format.
  • DASH segments encode various media content components types such as audio, video, or text. It is anticipated that there could be more than one video type - one encompassing the legacy main view - and new video type encompassing the surround video.
  • Legacy players/decoders decode the legacy main view based legacy video type, while enhanced players pan and decode the additional video type that includes the surround video.
  • the broadband side of the presentation layer 206 may include NRT content in files 226 and may also include signaling objects 228 for providing play back signaling.
  • the session layer 230 includes, on the broadcast side, either MMTP protocol 232 or ROUTE protocol 234. Note that the ATSC standard provides an option to use MPEG MMT for transport, though it is not shown here.
  • the session layer 230 includes HTTP protocol 236 which may be implemented as HTTP-secure (HTTP(S)).
  • HTTP HTTP-secure
  • the broadcast side of the session layer 230 also may employ a HTTP proxy module 238 and a service list table (SLT) 240.
  • SLT 240 includes a table of signaling information which is used to build a basic service listing and provide bootstrap discovery of the broadcast content.
  • Media presentation descriptions (MPD) are included in the “ROUTE Signaling” tables delivered over user datagram protocol (UDP) by the ROUTE transport protocol.
  • a transport layer 242 is below the session layer 230 in the protocol stack for establishing low-latency and loss-tolerating connections.
  • the transport layer 242 uses (UDP 244 and on the broadband side transmission control protocol (TCP) 246.
  • the example non-limiting protocol stack shown in Figure 2 also includes a network layer 248 below the transport layer 242.
  • the network layer 248 uses Internet protocol (IP) on both sides for IP packet communication, with multicast delivery being typical on the broadcast side and unicast being typical on the broadband side.
  • IP Internet protocol
  • the physical layer 250 which includes broadcast transmission/receive equipment 252 and computer network interface(s) 254 for communicating on the respective physical media associated with the two sides.
  • the physical layer 250 converts Internet Protocol (IP) packets to be suitable to be transported over the relevant medium and may add forward error correction functionality to enable error correction at the receiver as well as contain modulation and demodulation modules to incorporate modulation and demodulation functionalities. This converts bits into symbols for long distance transmission as well as to increase bandwidth efficiency.
  • IP Internet Protocol
  • the physical layer 250 typically includes a wireless broadcast transmitter to broadcast data wirelessly using orthogonal frequency division multiplexing (OFDM) while on the OTT side the physical layer 250 includes computer transmission components to send data over the Internet.
  • OFDM orthogonal frequency division multiplexing
  • DASH-IF DASH Industry Forum
  • HTTP/TCP/IP DASH Industry Forum
  • Media files in the DASH-IF profile based on the ISO BMFF may be used as the delivery, media encapsulation and synchronization format for both broadcast and broadband delivery.
  • Each receiver 14 typically includes a protocol stack that is complementary to that of the broadcaster equipment.
  • a receiver 14 in Figure 1 may include, as shown in Figure 2, an Internet-enabled TV with an ATSC 3.0 TV tuner (equivalently, set top box controlling a TV) 256.
  • the receiver 14 may be an Android®-based system.
  • the receiver 14 alternatively may be implemented by a computerized Internet enabled (“smart”) telephone, a tablet computer, a notebook computer, a wearable computerized device, and so on. Regardless, it is to be understood that the receiver 14 and/or other computers described herein is configured to undertake present principles (e.g., communicate with other devices to undertake present principles, execute the logic described herein, and perform any other functions and/or operations described herein).
  • the receiver 14 can be established by some or all of the components shown in Figure 1.
  • the receiver 14 can include one or more displays 258 that may be implemented by a high definition or ultra-high definition “4K” or higher flat screen and that may or may not be touch-enabled for receiving user input signals via touches on the display.
  • the receiver 14 may also include one or more speakers 260 for outputting audio in accordance with present principles, and at least one additional input device 262 such as, e.g., an audio receiver/micr ophone for, e.g., entering audible commands to the receiver 14 to control the receiver 14.
  • the example receiver 14 may further include one or more network interfaces 264 for communication over at least one network such as the Internet, a WAN, a LAN, a PAN etc. under control of one or more processors 266.
  • the interface 264 may be, without limitation, a Wi-Fi transceiver, which is an example of a wireless computer network interface, such as but not limited to a mesh network transceiver.
  • the interface 264 may be, without limitation, a Bluetooth® transceiver, Zigbee® transceiver, Infrared Data Association (IrDA) transceiver, Wireless USB transceiver, wired USB, wired LAN, Powerline or Multimedia over Coax Alliance (MoCA).
  • IrDA Infrared Data Association
  • the processor 266 controls the receiver 14 to undertake present principles, including the other elements of the receiver 14 described herein such as, for instance, controlling the display 258 to present images thereon and receiving input therefrom.
  • the network interface 264 may be, e.g., a wired or wireless modem or router, or other appropriate interface such as, e.g., a wireless telephony transceiver, or Wi-Fi transceiver as mentioned above, etc.
  • the receiver 14 may also include one or more input ports 268 such as a high-definition multimedia interface (HDMI) port or a USB port to physically connect (using a wired connection) to another CE device and/or a headphone port to connect headphones to the receiver 14 for presentation of audio from the receiver 14 to a user through the headphones.
  • the input port 268 may be connected via wire or wirelessly to a cable or satellite source of audio video content.
  • the source may be a separate or integrated set top box, or a satellite receiver.
  • the source may be a game console or disk player.
  • the receiver 14 may further include one or more computer memories 270 such as disk-based or solid-state storage that are not transitory signals, in some cases embodied in the chassis of the receiver as standalone devices or as a personal video recording device (PVR) or video disk player either internal or external to the chassis of the receiver for playing back audio video (AV) programs or as removable memory media.
  • the receiver 14 can include a position or location receiver 272 such as but not limited to a cellphone receiver, global positioning satellite (GPS) receiver, and/or altimeter that is configured to e.g. receive geographic position information from at least one satellite or cellphone tower and provide the information to the processor 266 and/or determine an altitude at which the receiver 14 is disposed in conjunction with the processor 266.
  • GPS global positioning satellite
  • altimeter that is configured to e.g. receive geographic position information from at least one satellite or cellphone tower and provide the information to the processor 266 and/or determine an altitude at which the receiver 14 is disposed in conjunction with the processor 266.
  • another suitable position receiver other than
  • the receiver 14 may include one or more cameras 274 that may include one or more of a thermal imaging camera, a digital camera such as a webcam, and/or a camera integrated into the receiver 14 and controllable by the processor 266 to gather pictures/images and/or video in accordance with present principles.
  • a Bluetooth® transceiver 276 or other Near Field Communication (NFC) element for communication with other devices using Bluetooth® and/or NFC technology, respectively.
  • NFC element can be a radio frequency identification (RFID) element.
  • the receiver 14 may include one or more auxiliary sensors 278 (such as a motion sensor such as an accelerometer, gyroscope, cyclometer, or a magnetic sensor or inertial measurement unit (IMU) and combinations thereof), an infrared (IR) sensor for receiving IR commands from a remote control, an optical sensor, a speed and/or cadence sensor, a gesture sensor (for sensing gesture commands) and so on providing input to the processor 266.
  • An IR sensor 280 may be provided to receive commands from a wireless remote control.
  • a battery (not shown) may be provided for powering the receiver 14.
  • the companion device 16 may incorporate some or all of the elements shown in relation to the receiver 14 described above.
  • a live event 300 may be filmed for broadcast by an event camera 302 that transmits real, normal flat images via an OTA broadcast transmitter 304 to one or more receivers 306.
  • a movable one hundred eighty/three hundred sixty-degree (180°/360°) 4K or 8K or higher camera 308 films the event 300 in every direction and sends its output to the receiver 306 via an OTT source 310 such as an Internet server.
  • the (180°/360°) 4K or 8K or higher camera 308 may provide a surround view of the event and may also provide a greater elevation view of the event than provided by the flat camera 302.
  • the (180°/360°) 4K or 8K or higher camera 308 may be mounted on a drone to film the event 300.
  • the (180°/360°) 4K or 8K or higher camera 308 may output 180°/360° video if desired.
  • AC-4 audio for example may accompany the (180°/360°) 4K or 8K or higher video.
  • the (180°/360°) 4K or 8K or higher camera 308 may capture the event 300 in a large angle format.
  • Figure 4 illustrates an embodiment in which both outputs of the flat camera 302 and (180°/360°) 4K or 8K or higher camera 308 are sent via OTA broadcast 304 to the receiver 306.
  • the two video streams produced by the cameras 302, 308 may be sent in a single physical layer pipe (PLP) or related PLPs and may be accompanied by signaling that both video streams are available, identifying each one for what it is, e.g., flat or (180°/360°) 4K or 8K.
  • PLP physical layer pipe
  • the flat camera 302 may be omitted from Figure 4 and only the video stream from the (180°/360°) 4K or 8K or higher camera 308 may be sent OTA to receivers for processing as described below.
  • the stream can have a main view using the main video content type.
  • Figure 5 illustrates that a receiver assembly may include an OTA receiver 500, and OTT receiver 502, and a flat display 504 such as a TV.
  • the receiver assembly may further include virtual reality (VR) headset 506 such as goggles and one or more devices for panning video such as 180°/360° video presented on the VR headset 506.
  • VR virtual reality
  • These devices may include a point and click mechanism such as a joystick or buttons on a remote control (RC) 508 for the TV, a touchscreen on a smart phone 510, or a touchscreen or touchpad 512 on the headset 506.
  • the pan devices may also include one or more motion sensors 514 in the headset 506 or RC 508.
  • the headset may have a RC 508 in lieu of a touchscreen or touchpad 512 to accomplish panning. Combinations of the above maybe used for panning.
  • Each of the components shown in Figure 5 may include appropriate components from Figure 2 including processors, storage devices, and network interfaces.
  • Figure 6 illustrates example logic that may be executed by one or more processors in the receiver assembly of Figure 5.
  • a flat video stream is received, e.g., OTA or derived from a single (180°/360°) 4K or 8K or higher video stream when no flat event camera 302 is provided.
  • the (180°/360°) 4K or 8K or higher video stream can have a main view which is essentially the flat view stream.
  • This flat stream may be presented on, e.g., the TV 504 in figure 5 at block 602.
  • the (180°/360°) 4K or 8K or higher video stream is received OTA or OTT as described above and any pan signals received at block 606.
  • the (180°/360°) 4K or 8K or higher 180°/360° video may be presented at block 308 on the VR headset 506 shown in Figure 5.
  • Figure 7 illustrates that the flat stream of the event 300 in Figure 3 can be presented on the TV 504 while simultaneously a (180°/360°) 4K or 8K or higher 180°/360° version of the same event 300 can be presented on the headset 506 with a panning feature indicated by the arrow 700 to allow a wearer of the headset 506 to be virtually present at the event 300 and move his head or manipulate another panning device, e.g. RC 508, to “look around” the event as if he was actually there.
  • panning includes both azimuthal panning and elevational panning.
  • Figure 8 illustrates a first technique in which (180°/360°) 4K or 8K or higher video 800 is received OTA by the TV 504 for presentation as flat video on the TV.
  • a viewer can pan the video on the TV 504 using any of the input devices described above, e.g., joystick or buttons on the RC 508, touchscreen on the smart phone 510, etc.
  • Figure 9 illustrates the (180°/360°) 4K or 8K or higher video 800 is received OTA by the TV 504 and rendered on the TV as flat to establish a backward compatible view without panning.
  • Figure 10 illustrates (180°/360°) 4K or 8K or higher video 800 may be received OTA by the TV 504 and rendered into flat for presentation on the TV, with the (180°/360°) 4K or 8K or higher video 800 being relayed from the TV to the headset 506, e.g., via Bluetooth or Wi-Fi, for presentation of 180°/360° on the headset that can pan the view as described.
  • the headset 506 e.g., via Bluetooth or Wi-Fi
  • Figure 11 illustrates (180°/360°) 4K or 8K or higher video 800 may be received OTA or OTT directly by the headset 506 for presentation of 180°/360° on the headset that can pan the view as described, without the need for the TV 504 to intervene to relay the 180°/360° video.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Social Psychology (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Databases & Information Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)

Abstract

Techniques are described for expanding and/or improving the Advanced Television Systems Committee (ATSC) 3.0 television protocol in robustly delivering the next generation broadcast television services. Telepresence is provided through over-the-air (OTA) virtual reality (VR) broadcast streams.

Description

TELEPRESENCE THROUGH VR BROADCAST STREAMS
FIELD
This application relates to technical advances necessarily rooted in computer technology and directed to digital television, and more particularly to Advanced Television Systems Committee (ATSC) 3.0.
BACKGROUND
The Advanced Television Systems Committee (ATSC) 3.0 suite of standards is a set of over a dozen industry technical standards as indicated in A/300 for delivering the next generation of broadcast television. ATSC 3.0 supports delivery of a wide range of television services including televised video, interactive services, non-real time delivery of data, and tailored advertising to a large number of receiving devices, from ultra-high-definition televisions to wireless telephones. ATSC 3.0 also orchestrates coordination between broadcast content (referred to as “over the air” or OTA) and related broadband delivered content and services (referred to as “over the top” or OTT). ATSC 3.0 is designed to be flexible so that as technology evolves, advances can be readily incorporated without requiring a complete overhaul of any related technical standard.
SUMMARY
One hundred eighty/three hundred sixty-degree (180°/360°) field of view, 4K or 8K or higher resolution video can be panned on a PC, tablet or smart phone using a mouse or touchscreen. Using goggles, the video can be panned with a hand controller or head motion. VR goggles can sense the natural turning or tilting of the head in order to pan the image shown within the googles to make the wearer feel present at the location as if being “teleported”.
As envisioned herein, 180/360 video can be monoscopic or stereoscopic. A flat display as used herein may present monoscopic or stereoscopic video, with stereoscopic video being used to render 3D video and monoscopic video typically being considered to be 2D video. “Flat display” or “flat video” as used herein means a video in which the entire image that is processed is limited to a display, typically with an aspect ratio of 16x9, at whatever the received resolution is, with no ability to pan. Thus, “flat” video may be considered to be the “main view” video of a 180/360 video. It might be where the director or content creator wants to focus the user’s attention. The “surround video” is not shown. The “flat video” plus the “surround video” make up the entirely of a 180/360 video.
Examples can include providing, along with a normal flat feed, a 180°/360° surround feed of live news events. Live news events such as at a location of an accident, tornado, demonstration, political rally, etc. can be shown as a 360° feed for 180°/360°spherical representation on the goggles. And it is not just live news events, but live “human interest” stories where people and organizations are showcased. A 360° feed can provide the viewer a better idea as to they are about because they are able to “look around” and see things that a flat feed cannot show. As people without goggles watch the normal flat video on a TV, people with goggles can pan around the entire scene by means of panning the 180°/360° surround feed. For TVs that support the display of 180°/360° video, panning may be accomplished using the remote control by pressing the up, down, left, and right buttons on the remote control. The video can be monoscopic or stereoscopic. Stereoscopic video can deliver separate images to each eye, and this called “3D” video. 3D video can be supported with 180°/360° video, but not all 180°/360° video is 3D. Present principles also apply to travel tourist content and to any show or content filmed in 180°/360° VR such as concerts, motor bike racing, snowboarding, or hiking. Many VR videos are filmed monoscopically, e.g., delivering the same content to each eye, and often with the focus set to infinity which minimizes the nausea that users experienced with 3D. Monoscopic video can deliver a very satisfactory experience of “being there” and telepresence.
Accordingly, a digital television apparatus includes at least one receiver with at least one processor programmed with instructions to configure the processor to present a flat video of an event on a main display and present a one hundred eighty or three hundred sixty-degree (180° or 360°) video of the event on a virtual reality (VR) headset.
In another aspect, in digital television in which at least one receiver can receive broadcast signals, a method includes receiving at least a first broadcast video signal. The method also includes presenting content in the first broadcast video signal on at least one normal flat display and presenting 180°/360° video of the content on at least one display. The method includes, responsive to at least one command, panning the 180°/360° video on the at least one display.
In some embodiments the at least one display includes a virtual reality (VR) headset.
In some embodiments the at least one display includes the flat display. In some implementations the 180°/360° video is received in the first broadcast video stream, the first broadcast stream includes no other video of the content, and the panning includes panning video on the flat display.
In other embodiments the 180°/360° video is received in the first broadcast video stream, the first broadcast stream includes no other video of the content, and the 180°/360° video is presented as flat video on the flat display as a backwardly compatible content view.
In some implementations the 180°/360° video is received in the first broadcast video stream and content therein is presented on the flat display, and the method includes sending the content from the flat display to the VR headset. In other implementations the 180°/360° video is received in the first broadcast video stream and content therein is presented on the flat display, and the method includes sending the content to the VR headset without passing through the flat display.
The 180°/360° video may be received in the first broadcast video stream along with a flat feed of content in the 180°/360° video. Or, the 180°/360° video may be received from an over-the-top (OTT) source.
In another aspect, an apparatus includes at least one receiver assembly configured to receive at least a first broadcast video signal and present content in the first broadcast video signal on at least one flat display. The receiver assembly also is configured to present 180°/360° video of the content on at least one display. Responsive to at least one command, the assembly is configured to pan the 180°/360° video on the at least one display. The details of the present application, both as to its structure and operation, can best be understood in reference to the accompanying drawings, in which like reference numerals refer to like parts, and in which:
BRIEF DESCRIPTION OF THE DRAWINGS
Figure 1 illustrates an Advanced Television Systems Committee (ATSC) 3.0 system;
Figure 2 illustrates components of the devices shown in Figure 1;
Figure 3 illustrates a first technique for providing telepresence video;
Figure 4 illustrates a second technique for providing telepresence video;
Figure 5 illustrates an example specific system consistent with present principles;
Figure 6 illustrates example logic in example flow chart format consistent with present principles;
Figure 7 illustrates a receiver system with two displays, one for presenting a flat view of an event and the other for providing a 360-degree VR telepresence view of the event;
Figure 8 illustrates a first technique for providing 360° telepresence video as flat video on a TV that can be panned;
Figure 9 illustrates a second technique for providing 360° telepresence video as flat backwardly compatible video on a TV;
Figure 10 illustrates a third technique for providing 360° telepresence video on a TV and on VR goggles; and
Figure 11 illustrates a fourth technique for providing 360° telepresence video directly to VR goggles.
DETAILED DESCRIPTION
This disclosure relates to technical advances in digital television such as in Advanced Television Systems Committee (ATSC) 3.0 television. An example system herein may include ATSC 3.0 source components and client components, connected via broadcast and/or over a network such that data may be exchanged between the client and ATSC 3.0 source components. The client components may include one or more computing devices including portable televisions (e.g., smart TVs, Internet-enabled TVs), portable computers such as laptops and tablet computers, and other mobile devices including smart phones and additional examples discussed below. These client devices may operate with a variety of operating environments. For example, some of the client computers may employ, as examples, operating systems from Microsoft, or a Unix operating system, or operating systems produced by Apple Computer or Google, such as Android®. These operating environments may be used to execute one or more browsing programs, such as a browser made by Microsoft or Google or Mozilla or other browser program that can access websites hosted by the Internet servers discussed below.
ATSC 3.0 source components may include broadcast transmission components and servers and/or gateways that may include one or more processors executing instructions that configure the source components to broadcast data and/or to transmit data over a network such as the Internet. A client component and/or a local ATSC 3.0 source component may be instantiated by a game console such as a Sony PlayStation®, a personal computer, etc. Information may be exchanged over a network between the clients and servers. To this end and for security, servers and/or clients can include firewalls, load balancers, temporary storages, and proxies, and other network infrastructure for reliability and security.
As used herein, instructions refer to computer-implemented steps for processing information in the system. Instructions can be implemented in software, firmware or hardware and include any type of programmed step undertaken by components of the system.
A processor may be a single- or multi-chip processor that can execute logic by means of various lines such as address lines, data lines, and control lines and registers and shift registers.
Software modules described by way of the flow charts and user interfaces herein can include various sub-routines, procedures, etc. Without limiting the disclosure, logic stated to be executed by a particular module can be redistributed to other software modules and/or combined together in a single module and/or made available in a shareable library. While flow chart format may be used, it is to be understood that software may be implemented as a state machine or other logical method.
Present principles described herein can be implemented as hardware, software, firmware, or combinations thereof; hence, illustrative components, blocks, modules, circuits, and steps are set forth in terms of their functionality.
Further to what has been alluded to above, logical blocks, modules, and circuits can be implemented or performed with a general-purpose processor, a digital signal processor (DSP), a field programmable gate array (FPGA) or other programmable logic device such as an application specific integrated circuit (ASIC), discrete gate or transistor logic, discrete hardware components, or any combination thereof designed to perform the functions described herein. A processor can be implemented by a controller or state machine or a combination of computing devices.
The functions and methods described below, when implemented in software, can be written in an appropriate language such as but not limited to hypertext markup language (HTML)-5, Java®/Javascript, C# or C++, and can be stored on or transmitted through a computer-readable storage medium such as a random access memory (RAM), read-only memory (ROM), electrically erasable programmable read-only memory (EEPROM), compact disk read-only memory (CD-ROM) or other optical disk storage such as digital versatile disc (DVD), magnetic disk storage or other magnetic storage devices including removable universal serial bus (USB) thumb drives, etc. A connection may establish a computer-readable medium. Such connections can include, as examples, hard-wired cables including fiber optics and coaxial wires and digital subscriber line (DSL) and twisted pair wires.
Components included in one embodiment can be used in other embodiments in any appropriate combination. For example, any of the various components described herein and/or depicted in the Figures may be combined, interchanged, or excluded from other embodiments.
A recitation of “having at least one of A, B, and C" (likewise "having at least one of A, B, or C" and "having at least one of A, B, C") includes A alone, B alone, C alone, A and B together, A and C together, B and C together, and/or A, B, and C together, etc.
Present principles may employ various machine learning models, including deep learning models. Machine learning models consistent with present principles may use various algorithms trained in ways that include supervised learning, unsupervised learning, semi-supervised learning, reinforcement learning, feature learning, self-learning, and other forms of learning. Examples of such algorithms, which can be implemented by computer circuitry, include one or more neural networks, such as a convolutional neural network (CNN), a recurrent neural network (RNN), and a type of RNN known as a long short-term memory (LSTM) network. Support vector machines (SVM) and Bayesian networks also may be considered to be examples of machine learning models.
As understood herein, performing machine learning may therefore involve accessing and then training a model on training data to enable the model to process further data to make inferences. An artificial neural network/artificial intelligence model trained through machine learning may thus include an input layer, an output layer, and multiple hidden layers in between that that are configured and weighted to make inferences about an appropriate output.
Turning to Figure 1, an example of an ATSC 3.0 source component is labeled “broadcaster equipment” 10 and may include over-the-air (OTA) equipment 12 for wirelessly broadcasting, typically via orthogonal frequency division multiplexing (OFDM) in a one-to-many relationship, television data to plural receivers 14 such as ATSC 3.0 televisions. One or more receivers 14 may communicate with one or more companion devices 16 such as remote controls, tablet computers, mobile telephones, virtual reality (VR) headsets such as computer simulation 180°/360° headsets such as computer game headsets such as a Sony PlayStation® headset and the like over a short range, typically wireless link 18 that may be implemented by Bluetooth®, low energy Bluetooth, Wireless HDMI, Amimon wireless HD video link, other near field communication (NFC) protocol, infrared (IR), etc.
Also, one or more of the receivers 14 may communicate, via a wired and/or wireless network link 20 such as the Internet, with over-the-top (OTT) equipment 22 of the broadcaster equipment 10 typically in a one-to-one relationship. The OTA equipment 12 may be co-located with the OTT equipment 22 or the two sides 12, 22 of the broadcaster equipment 10 may be remote from each other and may communicate with each other through appropriate means. In any case, a receiver 14 may receive ATSC 3.0 television signals OTA over a tuned-to ATSC 3.0 television channel and may also receive related content, including television, OTT (broadband). Note that computerized devices described in all of the figures herein may include some or all of the components set forth for various devices in Figures 1 and 2.
Referring now to Figure 2, details of examples of components shown in Figure 1 may be seen. Figure 2 illustrates an example protocol stack that may be implemented by a combination of hardware and software. Using the ATSC 3.0 protocol stack shown in Figure 2 and modified as appropriate for the broadcaster side, broadcasters can send hybrid service delivery in which one or more program elements are delivered via a computer network (referred to herein as “broadband” and “over-the-top” (OTT)) as well as via a wireless broadcast (referred to herein as “broadcast” and “over-the-air” (OTA)). Figure 2 also illustrates an example stack with hardware that may be embodied by a receiver.
Disclosing Figure 2 in terms of broadcaster equipment 10, one or more processors 200 accessing one or more computer storage media 202 such as any memories or storages described herein may be implemented to provide one or more software applications in a top-level application layer 204. The application layer 204 can include one or more software applications written in, e.g., HTML5/Javascript running in a runtime environment. Without limitation, the applications in the application stack 204 may include linear TV applications, interactive service applications, companion screen applications, personalization applications, emergency alert applications, and usage reporting applications. The applications typically are embodied in software that represents the elements that the viewer experiences, including video coding, audio coding and the run-time environment. As an example, an application may be provided that enables a user to control dialog, use alternate audio tracks, control audio parameters such as normalization and dynamic range, and so on.
Below the application layer 204 is a presentation layer 206. The presentation layer 206 includes, on the broadcast (OTA) side, broadcast audio-video playback devices referred to as Media Processing Units (MPU) 208 that, when implemented in a receiver, decode and playback, on one or more displays and speakers, wirelessly broadcast audio video content. The MPU 208 is configured to present International Organization for Standardization (ISO) base media file format (BMFF) data representations 210 and video in high efficiency video coding (HEVC) with audio in, e.g., Dolby audio compression (AC-4) format. ISO BMFF is a general file structure for time-based media files broken into “segments” and presentation metadata. Each of the files is essentially a collection of nested objects each with a type and a length. To facilitate decryption, the MPU 208 may access a broadcast side encrypted media extension (EME)/common encryption (CENC) module 212.
Figure 2 further illustrates that on the broadcast side the presentation layer 206 may include signaling modules, including either motion pictures expert group (MPEG) media transport protocol (MMTP) signaling module 214 or real-time object delivery over unidirectional transport (ROUTE) signaling module 216 for delivering non-real time (NRT) content 218 that is accessible to the application layer 204. NRT content may include but is not limited to stored replacement advertisements.
On the broadband (OTT or computer network) side, when implemented by a receiver the presentation layer 206 can include one or more dynamic adaptive streaming over hypertext transfer protocol (HTTP) (DASH) player/decoders 220 for decoding and playing audio-video content from the Internet. To this end the DASH player 220 may access a broadband side EME/CENC module 222. The DASH content may be provided as DASH segments 224 in ISO/BMFF format. To support various embodiments of the present invention, DASH segments encode various media content components types such as audio, video, or text. It is anticipated that there could be more than one video type - one encompassing the legacy main view - and new video type encompassing the surround video. Legacy players/decoders decode the legacy main view based legacy video type, while enhanced players pan and decode the additional video type that includes the surround video.
As was the case for the broadcast side, the broadband side of the presentation layer 206 may include NRT content in files 226 and may also include signaling objects 228 for providing play back signaling.
Below the presentation layer 206 in the protocol stack is a session layer 230. The session layer 230 includes, on the broadcast side, either MMTP protocol 232 or ROUTE protocol 234. Note that the ATSC standard provides an option to use MPEG MMT for transport, though it is not shown here.
On the broadband side the session layer 230 includes HTTP protocol 236 which may be implemented as HTTP-secure (HTTP(S)). The broadcast side of the session layer 230 also may employ a HTTP proxy module 238 and a service list table (SLT) 240. The SLT 240 includes a table of signaling information which is used to build a basic service listing and provide bootstrap discovery of the broadcast content. Media presentation descriptions (MPD) are included in the “ROUTE Signaling” tables delivered over user datagram protocol (UDP) by the ROUTE transport protocol.
A transport layer 242 is below the session layer 230 in the protocol stack for establishing low-latency and loss-tolerating connections. On the broadcast side the transport layer 242 uses (UDP 244 and on the broadband side transmission control protocol (TCP) 246.
The example non-limiting protocol stack shown in Figure 2 also includes a network layer 248 below the transport layer 242. The network layer 248 uses Internet protocol (IP) on both sides for IP packet communication, with multicast delivery being typical on the broadcast side and unicast being typical on the broadband side.
Below the network layer 248 is the physical layer 250 which includes broadcast transmission/receive equipment 252 and computer network interface(s) 254 for communicating on the respective physical media associated with the two sides. The physical layer 250 converts Internet Protocol (IP) packets to be suitable to be transported over the relevant medium and may add forward error correction functionality to enable error correction at the receiver as well as contain modulation and demodulation modules to incorporate modulation and demodulation functionalities. This converts bits into symbols for long distance transmission as well as to increase bandwidth efficiency. On the OTA side the physical layer 250 typically includes a wireless broadcast transmitter to broadcast data wirelessly using orthogonal frequency division multiplexing (OFDM) while on the OTT side the physical layer 250 includes computer transmission components to send data over the Internet.
A DASH Industry Forum (DASH-IF) profile sent through the various protocols (HTTP/TCP/IP) in the protocol stack may be used on the broadband side. Media files in the DASH-IF profile based on the ISO BMFF may be used as the delivery, media encapsulation and synchronization format for both broadcast and broadband delivery.
Each receiver 14 typically includes a protocol stack that is complementary to that of the broadcaster equipment.
A receiver 14 in Figure 1 may include, as shown in Figure 2, an Internet-enabled TV with an ATSC 3.0 TV tuner (equivalently, set top box controlling a TV) 256. The receiver 14 may be an Android®-based system. The receiver 14 alternatively may be implemented by a computerized Internet enabled (“smart”) telephone, a tablet computer, a notebook computer, a wearable computerized device, and so on. Regardless, it is to be understood that the receiver 14 and/or other computers described herein is configured to undertake present principles (e.g., communicate with other devices to undertake present principles, execute the logic described herein, and perform any other functions and/or operations described herein).
Accordingly, to undertake such principles the receiver 14 can be established by some or all of the components shown in Figure 1. For example, the receiver 14 can include one or more displays 258 that may be implemented by a high definition or ultra-high definition “4K” or higher flat screen and that may or may not be touch-enabled for receiving user input signals via touches on the display. The receiver 14 may also include one or more speakers 260 for outputting audio in accordance with present principles, and at least one additional input device 262 such as, e.g., an audio receiver/micr ophone for, e.g., entering audible commands to the receiver 14 to control the receiver 14. The example receiver 14 may further include one or more network interfaces 264 for communication over at least one network such as the Internet, a WAN, a LAN, a PAN etc. under control of one or more processors 266. Thus, the interface 264 may be, without limitation, a Wi-Fi transceiver, which is an example of a wireless computer network interface, such as but not limited to a mesh network transceiver. The interface 264 may be, without limitation, a Bluetooth® transceiver, Zigbee® transceiver, Infrared Data Association (IrDA) transceiver, Wireless USB transceiver, wired USB, wired LAN, Powerline or Multimedia over Coax Alliance (MoCA). It is to be understood that the processor 266 controls the receiver 14 to undertake present principles, including the other elements of the receiver 14 described herein such as, for instance, controlling the display 258 to present images thereon and receiving input therefrom. Furthermore, note the network interface 264 may be, e.g., a wired or wireless modem or router, or other appropriate interface such as, e.g., a wireless telephony transceiver, or Wi-Fi transceiver as mentioned above, etc.
In addition to the foregoing, the receiver 14 may also include one or more input ports 268 such as a high-definition multimedia interface (HDMI) port or a USB port to physically connect (using a wired connection) to another CE device and/or a headphone port to connect headphones to the receiver 14 for presentation of audio from the receiver 14 to a user through the headphones. For example, the input port 268 may be connected via wire or wirelessly to a cable or satellite source of audio video content. Thus, the source may be a separate or integrated set top box, or a satellite receiver. Or, the source may be a game console or disk player.
The receiver 14 may further include one or more computer memories 270 such as disk-based or solid-state storage that are not transitory signals, in some cases embodied in the chassis of the receiver as standalone devices or as a personal video recording device (PVR) or video disk player either internal or external to the chassis of the receiver for playing back audio video (AV) programs or as removable memory media. Also, in some embodiments, the receiver 14 can include a position or location receiver 272 such as but not limited to a cellphone receiver, global positioning satellite (GPS) receiver, and/or altimeter that is configured to e.g. receive geographic position information from at least one satellite or cellphone tower and provide the information to the processor 266 and/or determine an altitude at which the receiver 14 is disposed in conjunction with the processor 266. However, it is to be understood that that another suitable position receiver other than a cellphone receiver, GPS receiver and/or altimeter may be used in accordance with present principles to determine the location of the receiver 14 in e.g., all three dimensions.
Continuing the description of the receiver 14, in some embodiments the receiver 14 may include one or more cameras 274 that may include one or more of a thermal imaging camera, a digital camera such as a webcam, and/or a camera integrated into the receiver 14 and controllable by the processor 266 to gather pictures/images and/or video in accordance with present principles. Also included on the receiver 14 may be a Bluetooth® transceiver 276 or other Near Field Communication (NFC) element for communication with other devices using Bluetooth® and/or NFC technology, respectively. An example NFC element can be a radio frequency identification (RFID) element.
Further still, the receiver 14 may include one or more auxiliary sensors 278 (such as a motion sensor such as an accelerometer, gyroscope, cyclometer, or a magnetic sensor or inertial measurement unit (IMU) and combinations thereof), an infrared (IR) sensor for receiving IR commands from a remote control, an optical sensor, a speed and/or cadence sensor, a gesture sensor (for sensing gesture commands) and so on providing input to the processor 266. An IR sensor 280 may be provided to receive commands from a wireless remote control. A battery (not shown) may be provided for powering the receiver 14.
The companion device 16 may incorporate some or all of the elements shown in relation to the receiver 14 described above.
The methods described herein may be implemented as software instructions executed by a processor, suitably configured application specific integrated circuits (ASIC) or field programmable gate array (FPGA) modules, or any other convenient manner as would be appreciated by those skilled in those art. Where employed, the software instructions may be embodied in a non-transitory device such as a CD ROM or Flash drive. The software code instructions may alternatively be embodied in a transitory arrangement such as a radio or optical signal, or via a download over the Internet. Refer now to Figure 3. A live event 300 may be filmed for broadcast by an event camera 302 that transmits real, normal flat images via an OTA broadcast transmitter 304 to one or more receivers 306. Simultaneously, a movable one hundred eighty/three hundred sixty-degree (180°/360°) 4K or 8K or higher camera 308 films the event 300 in every direction and sends its output to the receiver 306 via an OTT source 310 such as an Internet server. The (180°/360°) 4K or 8K or higher camera 308 may provide a surround view of the event and may also provide a greater elevation view of the event than provided by the flat camera 302. In an example, the (180°/360°) 4K or 8K or higher camera 308 may be mounted on a drone to film the event 300. The (180°/360°) 4K or 8K or higher camera 308 may output 180°/360° video if desired. AC-4 audio for example may accompany the (180°/360°) 4K or 8K or higher video. The (180°/360°) 4K or 8K or higher camera 308 may capture the event 300 in a large angle format.
Figure 4 on the other hand illustrates an embodiment in which both outputs of the flat camera 302 and (180°/360°) 4K or 8K or higher camera 308 are sent via OTA broadcast 304 to the receiver 306. The two video streams produced by the cameras 302, 308 may be sent in a single physical layer pipe (PLP) or related PLPs and may be accompanied by signaling that both video streams are available, identifying each one for what it is, e.g., flat or (180°/360°) 4K or 8K.
In another embodiment the flat camera 302 may be omitted from Figure 4 and only the video stream from the (180°/360°) 4K or 8K or higher camera 308 may be sent OTA to receivers for processing as described below. In this embodiment, the stream can have a main view using the main video content type.
Figure 5 illustrates that a receiver assembly may include an OTA receiver 500, and OTT receiver 502, and a flat display 504 such as a TV. The receiver assembly may further include virtual reality (VR) headset 506 such as goggles and one or more devices for panning video such as 180°/360° video presented on the VR headset 506. These devices may include a point and click mechanism such as a joystick or buttons on a remote control (RC) 508 for the TV, a touchscreen on a smart phone 510, or a touchscreen or touchpad 512 on the headset 506. The pan devices may also include one or more motion sensors 514 in the headset 506 or RC 508. The headset may have a RC 508 in lieu of a touchscreen or touchpad 512 to accomplish panning. Combinations of the above maybe used for panning. Each of the components shown in Figure 5 may include appropriate components from Figure 2 including processors, storage devices, and network interfaces.
Figure 6 illustrates example logic that may be executed by one or more processors in the receiver assembly of Figure 5. Commencing at block 600, a flat video stream is received, e.g., OTA or derived from a single (180°/360°) 4K or 8K or higher video stream when no flat event camera 302 is provided. The (180°/360°) 4K or 8K or higher video stream can have a main view which is essentially the flat view stream. This flat stream may be presented on, e.g., the TV 504 in figure 5 at block 602.
Moving to block 604, the (180°/360°) 4K or 8K or higher video stream is received OTA or OTT as described above and any pan signals received at block 606. The (180°/360°) 4K or 8K or higher 180°/360° video may be presented at block 308 on the VR headset 506 shown in Figure 5.
Figure 7 illustrates that the flat stream of the event 300 in Figure 3 can be presented on the TV 504 while simultaneously a (180°/360°) 4K or 8K or higher 180°/360° version of the same event 300 can be presented on the headset 506 with a panning feature indicated by the arrow 700 to allow a wearer of the headset 506 to be virtually present at the event 300 and move his head or manipulate another panning device, e.g. RC 508, to “look around” the event as if he was actually there. Note that panning includes both azimuthal panning and elevational panning.
Figure 8 illustrates a first technique in which (180°/360°) 4K or 8K or higher video 800 is received OTA by the TV 504 for presentation as flat video on the TV. A viewer can pan the video on the TV 504 using any of the input devices described above, e.g., joystick or buttons on the RC 508, touchscreen on the smart phone 510, etc.
Figure 9 illustrates the (180°/360°) 4K or 8K or higher video 800 is received OTA by the TV 504 and rendered on the TV as flat to establish a backward compatible view without panning.
Figure 10 illustrates (180°/360°) 4K or 8K or higher video 800 may be received OTA by the TV 504 and rendered into flat for presentation on the TV, with the (180°/360°) 4K or 8K or higher video 800 being relayed from the TV to the headset 506, e.g., via Bluetooth or Wi-Fi, for presentation of 180°/360° on the headset that can pan the view as described.
Figure 11 illustrates (180°/360°) 4K or 8K or higher video 800 may be received OTA or OTT directly by the headset 506 for presentation of 180°/360° on the headset that can pan the view as described, without the need for the TV 504 to intervene to relay the 180°/360° video. It will be appreciated that whilst present principals have been described with reference to some example embodiments, these are not intended to be limiting, and that various alternative arrangements may be used to implement the subject matter claimed herein.

Claims

WHAT IS CLAIMED IS:
1. In digital television in which at least one receiver can receive broadcast signals, a method, comprising: receiving at least a first broadcast video signal; presenting content in the first broadcast video signal on at least one flat display; presenting 180°/360° video of the content on at least one display; and responsive to at least one command, panning the 180°/360° video on the at least one display.
2. The method of Claim 1, wherein the at least one display comprises a virtual reality (VR) headset.
3. The method of Claim 1, wherein the at least one display comprises the flat display.
4. The method of Claim 1, wherein the 180°/360° video is received in the first broadcast video stream, the first broadcast stream comprising no other video of the content, and the panning comprises panning video on the flat display.
5. The method of Claim 1, wherein the 180°/360° video is received in the first broadcast video stream, the first broadcast stream comprising no other video of the content, and
22 the 180°/360° video is presented as flat video on the flat display as a backwardly-compatible content view.
6. The method of Claim 2, wherein the 180°/360° video is received in the first broadcast video stream and content therein is presented on the flat display, and the method comprises sending the content from the flat display to the VR headset.
7. The method of Claim 2, wherein the 180°/360° video is received in the first broadcast video stream and content therein is presented on the flat display, and the method comprises sending the content to the VR headset without passing through the flat display.
8. The method of Claim 1, wherein the 180°/360° video is received in the first broadcast video stream along with a flat feed of content in the 180°/360° video.
9. The method of Claim 1, wherein the 180°/360° video is received from an over-the-top (OTT) source.
10. The method of Claim 1, wherein the digital television receiver comprises an advanced television systems committee (ATSC) 3.0 receiver.
11. An apparatus comprising: at least one receiver assembly configured to: receive at least a first broadcast video signal; present content in the first broadcast video signal on at least one flat display; present 180°/360° video of the content on at least one display; and responsive to at least one command, pan the 180°/360° video on the at least one display.
12. The apparatus of Claim 11, wherein the at least one display comprises a virtual reality (VR) headset.
13. The apparatus of Claim 11, wherein the at least one display comprises the flat display.
14. The apparatus of Claim 11, wherein the 180°/360° video is received in the first broadcast video stream, the first broadcast stream comprising no other video of the content, and the pan comprises panning video on the flat display.
15. The apparatus of Claim 11, wherein the 180°/360° video is received in the first broadcast video stream, the first broadcast stream comprising no other video of the content, and the 180°/360° video is presented as flat video on the flat display as a backwardly-compatible content view.
16. The apparatus of Claim 12, wherein the 180°/360° video is received in the first broadcast video stream and content therein is presented on the flat display, and the instructions are executable to send the content from the flat display to the VR headset.
17. The apparatus of Claim 12, wherein the 180°/360° video is received in the first broadcast video stream and content therein is presented on the flat display, and the instructions are executable to send the content to the VR headset without passing through the flat display.
18. The apparatus of Claim 11, wherein the 180°/360° video is received in the first broadcast video stream along with a flat feed of content in the 180°/360° video.
19. The apparatus of Claim 11, wherein the 180°/360° video is received from an over-the-top (OTT) source.
20. A digital television apparatus comprising: at least one receiver comprising at least one processor programmed with instructions to configure the processor to: present a flat video of an event on a main display; and present a one hundred eighty or three hundred sixty-degree (180° or 360°) video of the event on a virtual reality (VR) headset.
25
PCT/IB2022/057318 2021-08-06 2022-08-05 Telepresence through vr broadcast streams WO2023012745A1 (en)

Priority Applications (4)

Application Number Priority Date Filing Date Title
EP22773025.6A EP4360326A1 (en) 2021-08-06 2022-08-05 Telepresence through vr broadcast streams
JP2024506971A JP2024532717A (en) 2021-08-06 2022-08-05 Telepresence through VR broadcast stream
CN202280010265.2A CN116762347A (en) 2021-08-06 2022-08-05 Remote presentation of broadcast streams through VR
KR1020237030496A KR20230145115A (en) 2021-08-06 2022-08-05 Telepresence through VR broadcast streams

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
US202163260023P 2021-08-06 2021-08-06
US63/260,023 2021-08-06
US17/569,969 US11818414B2 (en) 2021-08-06 2022-01-06 Telepresence through OTA VR broadcast streams
US17/569,969 2022-01-06

Publications (1)

Publication Number Publication Date
WO2023012745A1 true WO2023012745A1 (en) 2023-02-09

Family

ID=83361225

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/IB2022/057318 WO2023012745A1 (en) 2021-08-06 2022-08-05 Telepresence through vr broadcast streams

Country Status (1)

Country Link
WO (1) WO2023012745A1 (en)

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20180316948A1 (en) * 2012-04-24 2018-11-01 Skreens Entertainment Technologies, Inc. Video processing systems, methods and a user profile for describing the combination and display of heterogeneous sources
US20200245017A1 (en) * 2018-12-21 2020-07-30 Streamlayer Inc. Method and System for Providing Interactive Content Delivery and Audience Engagement
WO2020264088A1 (en) * 2019-06-25 2020-12-30 Apple Inc. Immersive teleconferencing and telepresence
US20200412565A1 (en) * 2019-06-30 2020-12-31 Charter Communications Operating, Llc Wireless enabled distributed data apparatus and methods
US20210243418A1 (en) * 2018-04-27 2021-08-05 Pcms Holdings, Inc. 360 degree multi-viewport system

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20180316948A1 (en) * 2012-04-24 2018-11-01 Skreens Entertainment Technologies, Inc. Video processing systems, methods and a user profile for describing the combination and display of heterogeneous sources
US20210243418A1 (en) * 2018-04-27 2021-08-05 Pcms Holdings, Inc. 360 degree multi-viewport system
US20200245017A1 (en) * 2018-12-21 2020-07-30 Streamlayer Inc. Method and System for Providing Interactive Content Delivery and Audience Engagement
WO2020264088A1 (en) * 2019-06-25 2020-12-30 Apple Inc. Immersive teleconferencing and telepresence
US20200412565A1 (en) * 2019-06-30 2020-12-31 Charter Communications Operating, Llc Wireless enabled distributed data apparatus and methods

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
"Live VR end-to-end workflows: real-life deployments and advances in VR and network technology", no. Virtual, 31 March 2021 (2021-03-31), XP051991692, Retrieved from the Internet <URL:https://ftp.3gpp.org/tsg_sa/WG4_CODEC/TSGS4_113-e/Docs/S4-210408_vrif2020.115.01.pdf> [retrieved on 20210331] *

Similar Documents

Publication Publication Date Title
EP2628306B1 (en) Streaming digital video between video devices using a cable television system
EP2884751A1 (en) A multimedia platform for generating and streaming content based on data provided by capturing devices corresponding to multiple viewpoints including subjective viewpoints
US11350138B2 (en) Managing a multi-view event comprising several streams, stream buffers, and rendering onto a single canvas
KR20220116162A (en) Techniques for Alternative Content Signaling in ATSC 3.0 TV
KR20240093629A (en) ATSC boundary conditions transitioning to the Internet
US11818414B2 (en) Telepresence through OTA VR broadcast streams
US11451854B2 (en) Dongle to convert formats to ATSC 3.0 low power wireless
US11363310B2 (en) ATSC 3.0 hospitality TV system
WO2023012745A1 (en) Telepresence through vr broadcast streams
US11553230B2 (en) Platform-independent USB driver communicating I2C commands to USB dongle through JAVA application
US11606528B2 (en) Advanced television systems committee (ATSC) 3.0 latency-free display of content attribute
US11695488B2 (en) ATSC over-the-air (OTA) broadcast of public volumetric augmented reality (AR)
US11611792B2 (en) ATSC 3 reception across boundary conditions using location data
US11812083B2 (en) Techniques for pushing personalized storefront ads using digital TV
US11895345B2 (en) Obfuscating replaceable content in advanced television systems committee (ATSC) 3.0 system
US11153640B2 (en) Memory management of replacement content in digital TV system
US20210329348A1 (en) Memory management in advanced television systems committee (atsc) 3.0 system
WO2021116839A1 (en) Advanced television systems committee (atsc) 3.0 latency-free display of content attribute
US20230179835A1 (en) Personalized emergency alert
US20240281164A1 (en) Method for signaling memory requirements in atsc3.0 when out-of-order data is being utilized
WO2023242663A1 (en) Obfuscating replaceable content in advanced television systems committee (atsc) 3.0 system
WO2023012750A1 (en) Improving atsc 3 reception across boundary conditions using location data
CN118339839A (en) Blurring alternative content in Advanced Television Systems Committee (ATSC) 3.0 system

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 22773025

Country of ref document: EP

Kind code of ref document: A1

WWE Wipo information: entry into national phase

Ref document number: 202280010265.2

Country of ref document: CN

ENP Entry into the national phase

Ref document number: 20237030496

Country of ref document: KR

Kind code of ref document: A

WWE Wipo information: entry into national phase

Ref document number: 1020237030496

Country of ref document: KR

WWE Wipo information: entry into national phase

Ref document number: 2022773025

Country of ref document: EP

ENP Entry into the national phase

Ref document number: 2024506971

Country of ref document: JP

Kind code of ref document: A

ENP Entry into the national phase

Ref document number: 2022773025

Country of ref document: EP

Effective date: 20240125

NENP Non-entry into the national phase

Ref country code: DE