WO2018004936A1 - Appareil et procédé de fourniture et d'affichage de contenu - Google Patents

Appareil et procédé de fourniture et d'affichage de contenu Download PDF

Info

Publication number
WO2018004936A1
WO2018004936A1 PCT/US2017/035060 US2017035060W WO2018004936A1 WO 2018004936 A1 WO2018004936 A1 WO 2018004936A1 US 2017035060 W US2017035060 W US 2017035060W WO 2018004936 A1 WO2018004936 A1 WO 2018004936A1
Authority
WO
WIPO (PCT)
Prior art keywords
bit rate
content
content item
rate version
high bit
Prior art date
Application number
PCT/US2017/035060
Other languages
English (en)
Inventor
Dennis D. CASTLEMAN
Original Assignee
Sony Interactive Entertainment Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from US15/280,933 external-priority patent/US11089280B2/en
Application filed by Sony Interactive Entertainment Inc. filed Critical Sony Interactive Entertainment Inc.
Priority to JP2018568224A priority Critical patent/JP6859372B2/ja
Priority to KR1020197003058A priority patent/KR20190022851A/ko
Priority to EP17820807.0A priority patent/EP3479574A4/fr
Priority to CN201780039760.5A priority patent/CN109417624B/zh
Priority to KR1020207037655A priority patent/KR102294098B1/ko
Publication of WO2018004936A1 publication Critical patent/WO2018004936A1/fr

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/83Generation or processing of protective or descriptive data associated with content; Content structuring
    • H04N21/845Structuring of content, e.g. decomposing content into time segments
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/234Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs
    • H04N21/2343Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/442Monitoring of processes or resources, e.g. detecting the failure of a recording device, monitoring the downstream bandwidth, the number of times a movie has been viewed, the storage space available from the internal hard disk
    • H04N21/44213Monitoring of end-user related data
    • H04N21/44218Detecting physical presence or behaviour of the user, e.g. using sensors to detect if the user is leaving the room or changes his face expression during a TV program
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/472End-user interface for requesting content, additional data or services; End-user interface for interacting with content, e.g. for content reservation or setting reminders, for requesting event notification, for manipulating displayed content
    • H04N21/4728End-user interface for requesting content, additional data or services; End-user interface for interacting with content, e.g. for content reservation or setting reminders, for requesting event notification, for manipulating displayed content for selecting a Region Of Interest [ROI], e.g. for requesting a higher resolution version of a selected region
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/60Network structure or processes for video distribution between server and client or between remote clients; Control signalling between clients, server and network components; Transmission of management data between server and client, e.g. sending from server to client commands for recording incoming content stream; Communication details between server and client 
    • H04N21/65Transmission of management data between client and server
    • H04N21/658Transmission by the client directed to the server
    • H04N21/6587Control parameters, e.g. trick play commands, viewpoint selection
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/81Monomedia components thereof
    • H04N21/816Monomedia components thereof involving special video data, e.g 3D video
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N23/00Cameras or camera modules comprising electronic image sensors; Control thereof
    • H04N23/60Control of cameras or camera modules
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N23/00Cameras or camera modules comprising electronic image sensors; Control thereof
    • H04N23/90Arrangement of cameras or camera modules, e.g. multiple cameras in TV studios or sports stadiums
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N23/00Cameras or camera modules comprising electronic image sensors; Control thereof
    • H04N23/60Control of cameras or camera modules
    • H04N23/698Control of cameras or camera modules for achieving an enlarged field of view, e.g. panoramic image capture
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N5/00Details of television systems
    • H04N5/222Studio circuitry; Studio devices; Studio equipment
    • H04N5/262Studio circuits, e.g. for mixing, switching-over, change of character of image, other special effects ; Cameras specially adapted for the electronic generation of special effects
    • H04N5/265Mixing

Definitions

  • the present invention relates generally to video processing and display.
  • Video streaming is increasingly becoming one of the main ways that media contents are delivered and accessed. Video streaming traffic also accounts for a large portion of Internet bandwidth consumption.
  • One embodiment provides a method for displaying content, comprising:
  • determining a focal area of a viewer of a content item displayed on a display device retrieving a low bit rate version of the content item, retrieving a portion of a high bit rate version of the content item corresponding to the focal area, combining the portion of the high bit rate version of the content with the low bit rate version of the content item to generate a combined image, and causing the combined image to be displayed to the viewer via the display device.
  • Another embodiment provides a system for displaying content, comprising: a display device, a sensor device, and a processor coupled to the display device and the sensor device.
  • the processor being configured to: determine, with the sensor device, a focal area of a viewer of a content item displayed on the display device, retrieve a low bit rate version of the content item, retrieve a portion of a high bit rate version of the content item corresponding to the focal area, combine the portion of the high bit rate version of the content with the low bit rate version of the content item to generate a combined image, and cause the combined image to be displayed to the viewer via the display device.
  • Another embodiment provides a non-transitory computer readable storage medium storing one or more computer programs configured to cause a processor based system to execute steps comprising: determining a focal area of a viewer of a content item displayed on a display device, retrieving a low bit rate version of the content item, retrieving a portion of a high bit rate version of the content item corresponding to the focal area, combining the portion of the high bit rate version of the content with the low bit rate version of the content item to generate a combined image; and causing the combined image to be displayed to the viewer via the display device.
  • Another embodiment provides a method for providing content, comprising: receiving a content item, generating a low bit rate version of the content item, receiving a content request from a playback device, the content request comprises an indication of a viewer focal area, selecting a portion of the high bit rate version of the content item based on the viewer focal area, and providing the low bit rate version of the content item and the portion of the high bit rate version of the content item to the playback device in response to the content request.
  • Another embodiment provides a system for providing content comprising: a memory device, a communication device; and a processor coupled to the memory device and the communication device.
  • the processor being configured to: receive a content item, generate a low bit rate version of the content item, store the high bit rate version of the content item and the low bit rate version of the content item on the memory device, receive, via the communication device, a content request from a playback device, the content request comprises an indication of a viewer focal area, select a portion of the high bit rate version of the content item based on the viewer focal area, and providing the low bit rate version of the content item and a portion of the high bit rate version of the content item to the playback device in response to the content request.
  • FIG. 1 is a process diagram illustrating a process for providing content in accordance with some embodiments of the present invention
  • FIG. 2 is a flow diagram illustrating a method for providing content in accordance with some embodiments of the present invention
  • FIG. 3 is a flow diagram illustrating a method for displaying content in accordance with some embodiments of the present invention.
  • FIGS. 4A and 4B are illustrations of a content display area in accordance with some embodiments of the present invention.
  • FIG. 5 is an illustration of image blending in accordance with some embodiments of the present invention.
  • FIGS. 6A and 6B are illustrations of image cells in accordance with some embodiments
  • FIGS. 7 A and 7B are illustrations of focal areas in accordance with some embodiments.
  • FIG. 8 is a block diagram illustrating a system in accordance with some embodiments of the present invention.
  • Digital video content may be stored and transmitted in a variety of formats. Factors such as the video's resolution, frame rate, coding format, compression scheme, and compression factor can affect the total size and bit rate of the video file.
  • bit rate generally refers to the number of bits used per unit of playback time to represent a continuous medium such as audio or video.
  • the encoding bit rate of a multimedia file may refer to the size of a multimedia file divided by the playback time of the recording (e.g. in seconds).
  • the bit rate of a video content file affects whether the video can be streamed without interruptions under network bandwidth constraints between a streaming server and a playback device.
  • step 111 video content is captured by a camera system.
  • the camera system may comprise one or more of a conventional camera system, a stereoscopic camera system, a panoramic camera system, a surround view camera system, a 360-degree camera system, and an omnidirectional camera system, and the like.
  • step 112 the captured video is encoded and transmitted to a server.
  • the encoding performed in step 112 may comprise lossy or lossless video encoding.
  • the video may comprise a live-streaming or a prerecorded video content.
  • the camera may communicate with the server via wireless or wired means by way of a network, such as for example the Internet.
  • the camera performing steps 111 and 112 may comprise a segmented video capture device such as those described in United States Provisional Patent Application No. 62/357,259, filed on June 30, 2016, entitled "APPARATUS AND METHOD FOR CAPTURING AND DISPLAYING
  • each captured video stream be provided as separate video streams to the server or may be combined into a single video stream prior to step 112.
  • the server decodes the video content received from the camera.
  • the decoded video may comprise a video in the originally captured resolution, frame rate, and/or bit rate.
  • the server reduces the bit rate of the decoded video stream.
  • the bit rate of the video content may be reduced by one or more of: reducing the resolution of the video, reducing the frame rate of the video, and compressing the video with a compression algorithm.
  • the reduced bit rate video is encoded and prepared for streaming to a playback device.
  • steps 122 and 123 may comprise a single step. For example, an encoding algorithm may be used reduce the bit rate of the received content.
  • one or more portions of the received video are extracted from received video.
  • Portions of a content item may generally refer to a spatial section of the video content display area.
  • a portion of the content may comprise an area of the content displayed area spanning one or more frames.
  • the extraction in step 125 may be performed by partially decoding the received content.
  • step 125 may be performed in response to receiving a viewer focal area from a playback device and the extracted portion may correspond to the location of the viewer's focal area in the content.
  • step 125 may be performed on the content preliminarily and one or more portions may be extracted and stored for later retrieval by playback devices.
  • the extracted portion is encoded and prepared for streaming to the playback device.
  • high and low bit rates are relative terms referring to the relative bit rates of the at least two versions of a video content item provided from the server to a playback device.
  • the server may generate at least one low bit rate version of the received video and extract at least a portion of a version of the content item having a higher bit rate as compared to the low bit rate version.
  • multiple versions of a video content item having different bit rates may be created by the servers.
  • bit rate reduction may also be performed on the received video prior to extracting portions of the content in step 125 and/or performed on the portion extracted in step 125.
  • a high bit rate version of the content item has a higher average bit rate than the low bit rate version of the content item over the duration of the video content.
  • the bit rate of the high bit rate version of the content item may be higher than the low bit rate version of the content item for some or all of temporal segments of the video content.
  • the video stream containing the extracted portion of the high bit rate version of the content item may have a lower bit rate as compared to the video stream comprising the low bit rate version of the content item.
  • the portion of the high bit rate version of the content item may cover a significantly smaller display area of the content as compared to the low bit rate version, resulting in the lower bit rate of the extracted portion.
  • the low bit rate version of the content item may comprise lower one or more of resolution, frame rate, and compression quality as compared to the high bit rate version of the content item. In some embodiments, the low bit rate version of the content item may comprise a lower video quality and/or definition as compare to the high bit rate version of the content item. In some embodiments, the low and high bit rate versions of the content may comprise constant bit rate (CBR) or variable bit rate (VBR) video streams.
  • CBR constant bit rate
  • VBR variable bit rate
  • the server may communicate with the playback device by way of a network, such as for example the Internet.
  • the playback device receives and decodes a low bit rate version of the video content and a portion of a high bit rate portion of the video content.
  • the portion of the high bit rate portion of the video content may be selected based on the focal area of a viewer viewing the content via the playback device.
  • the focal area of a viewer refers an area of the viewer's field of vision that is or is likely to be in focus while the viewer views the content.
  • the focal area may correspond to one or more of the central, paracentral, macular, near peripheral, and mid peripheral areas of the viewer's field of vision.
  • the focal area of the viewer may be detected by a sensor device coupled to the playback device.
  • Inertial Inertial
  • Measurement Unit (IMU) data recorded by a capture device of the content item may be compared to the viewer's eye and/or head direction to determine the portion of the high bit rate video content to extract for the playback device.
  • the low bit rate version of the video content and the portion of the high bit rate portion of the video content may be transmitted as separate video streams from the server to the playback device.
  • step 132 the low bit rate version of the video content and the portion of the high bit rate portion of the video content are combined.
  • combining the video streams comprises combining the low bit rate version of the content item with the portion of the high bit rate version at the location of the content displayed area from which the high bit rate portion was extracted.
  • step 132 comprises blending the two video streams by including a transition area between the high and low bit rate areas of the image to reduce the noticeability of the border between the two versions of the video content. In some embodiments, step 132 further comprises scaling the low bit rate version of the video content to the resolution and/or frame rate of the high bit rate version of the content prior to combining the images.
  • the combined image is displayed to the viewer.
  • the combined image may be displayed via one or more of a flat screen display, a curved display, a dome display device, a head-mounted display device, an augmented reality display device, a virtual reality display device, and the like.
  • the combined image may be viewed by a head mounted display such as the systems and devices described in United States Patent Application No. 15/085,887, filed on March 30, 2016, entitled “Head-Mounted Display Tracking," the entire disclosure of which is hereby fully incorporated by reference herein in its entirety.
  • the high bit rate portion of the video content may be combined with the low bit rate version of the content at the server and encoded as a single video stream for transmission. While the resolution and the frame rate of such video streams may not be reduced as compared to a full high bit rate version, the overall size of the transmitted video stream may still be reduced by processing the area of the content outside of the focal area with a more lossy video compression algorithm before recombining the images.
  • the portion of the content item corresponding to the user's focal area is provided in a relatively high bit rate and the remaining area of the content are provided in a relatively low bit rate.
  • the network bandwidth demand for achieving interruption-free video streaming may be reduced by decreasing the overall bit rate of the streaming video content while maintaining the video quality in the focal area of the viewer's field of vision.
  • FIG. 2 a method for providing content is shown.
  • the steps in FIG. 2 may generally be performed by a processor-based device such as one or more of a computer system, a server, a cloud-based server, a content host, a streaming service host, a media server, and the like.
  • the steps in FIG. 2 may be performed by one or more of the content server 810 and the playback device 820 described with reference to FIG. 8, the server described with reference to FIG. 1, and/or other similar devices.
  • the system receives a content item.
  • the content item may comprise one or more of a movie, a TV show, a video clip, prerecorded video content, streaming video content, live-streamed video content, and the like.
  • the video content may comprise a single video stream or a plurality of video streams captured by one or more of a stereoscopic camera system, a panoramic camera system, a surround view camera system, a 360-degree camera system, an omnidirectional camera system, and the like.
  • the content item may be encoded via any encoding scheme such as MPEG, WMV, VP8, and the like.
  • the system may further be configured to decode the received content item according to various encoding schemes in step 310.
  • the system In step 220, the system generates a low bit rate version of the content item.
  • the bit rate of the received content may be reduced by one or more of: reducing the resolution of the video, reducing the frame rate of the video, and compressing the video with a lossy compression algorithm.
  • a lossy compression generally means that the compressed video lacks some information present in the original video.
  • multiple low bit rate versions of the content item may be generated in step 220 and stored for retrieval by playback devices.
  • the system receives a content request.
  • the content request may be received from a playback device such as a game console, a personal computer, a tablet computer, a television, a head mounted display ("HMD"), an augmented reality device, a virtual reality device, a wearable device, a portable user device, a smartphone, etc.
  • the content request may identify one or more of the content item being requested, the requested temporal segment, an indication of the viewer's focal point and/or area, and/or other authentication information.
  • the content request may be similar to a conventional streaming content request.
  • the content request may comprise an indication of the viewer's focal area which may correspond to a point or an area in the content display area.
  • the indication of the viewer's focal area may comprise a coordinate or a set of coordinates within the dimension of a frame of the content.
  • the indication of the viewer's focal area may be represented by a viewing angle.
  • the focal area may be determined based on a sensor device associated with the playback device comprising one or more of an eye tracking sensor and a head tracking sensor.
  • the low bit rate version of the content is provided to the playback device in response to the content request received in step 230.
  • multiple low bit rate versions of the content item may be generated in step 220.
  • the system may select from among the multiple low bit rate versions of the content item based on one or more of: the current or estimated network throughput between the playback device and the server, the available bandwidth at the server and/or the playback device, the requested video quality specified in the content request, the playback device's processing capacity, user settings, etc.
  • the selection of the low bit rate version of the content item from a plurality of versions may be similar to conventional adaptive bit rate streaming methods.
  • the system selects a portion of the high bit rate version of the content item based on the content request.
  • the high bit rate version of a content item generally refers a version of the content with a higher bit rate as compared to the low bit rate content provided in step 240.
  • the high bit rate version of the content item may comprise a higher average bit rate than the low bit rate version of the content over the duration of the video content.
  • the bit rate of the high bit rate version of the content item may be higher than the low bit rate version of the content.
  • the high bit rate version of the content may comprise the original content received in step 210.
  • the high bit rate version of the content item may also comprise a reduced bit rate version of the originally received content item.
  • the portion of the content selected in step 250 may be selected based on the viewer's focal area comprising one or more of a detected focal point and a predicted future focal point.
  • the predicted future focal point may be predicted by the server and/or the playback device.
  • the future focal point may be predicted based on one or more of the viewer's gaze path history, a gaze path profile associated with the viewer, gaze path data collected from a plurality of viewers, and a content provider provided standard gaze path. Examples of predicting the viewer's future focal point are described in United States Patent Application No. 15/280,962, filed on September 29, 2016, entitled “APPARATUS AND METHOD FOR GAZE TRACKING", by inventor Dennis D. Castleman, and identified by Attorney Docket No. 138627
  • a portion of the content may generally refer to a spatial portion of the display content area such as a set pixels within a frame. In some embodiments, a portion may comprise the same part of the display content area spanning a plurality of frames. In some embodiments, the portion selected in step 250 may generally correspond to the location of a viewer's focal area in the content display area. In some embodiments, the displayed area of the content may be divided into a plurality of sections. For example, the displayed area of the content may be divided into quadrants, 3x3 grids, 5x5 grids, etc. In some embodiments, one or more sections of the content display area that overlaps the focal area of the viewer may be selected to comprise the portion of the high bit rate version of the content item provided to the playback device. In some embodiments, the focal area and/or the extracted portion of the content may comprise any shape and size. Examples of focal areas and portions extracted from content items are described in more detail with references to FIGS. 4A-4B and FIGS. 7A-7B herein.
  • the system may further select from a plurality of original and/or reduced bit rate versions of the content to extract the selected portion based on one or more of: the current or estimated network throughput between the playback device and the, the available bandwidth at the server and/or the playback device, a requested video quality specified in the content request, the playback device's processing capacity, and user settings.
  • the portion of the high bit rate version may be extracted from one of the reduced bit rate versions generated in step 220.
  • the high bit rate version of the content item may generally be selected from versions of the content item with higher bit rate as compared to the low bit rate version of the content item selected in step 240.
  • the system may be configured to provide two or more portions of the high bit rate version of the content item in step 270.
  • the system and/or the playback device may predict two or more likely future focal areas of the viewer.
  • the system may then select two or more portions of the high bit rate version of the content item based on the two or more likely future focal areas of the viewer in step 250.
  • the playback device may be configured to select from among the provided portions shortly before playback based on the detected focal area.
  • the system determines whether the selected portion has been previously cached in the system. In some embodiments, when a portion of the high bit rate version of the content is extracted, the system may cache the portion for later use. In some embodiments, the system may preliminarily generate a plurality of extracted portions of the high bit rate version of the content item based on predicting the locations that viewers are likely to focus on in the displayed content. For example, preliminarily extracted portions may correspond to high activity areas and/or foreground areas of the displayed content. In some embodiments, the cached portions may each comprise an encoded video stream. In some embodiments, the system may be configured to automatically purge extracted portions that have not been used for a set period of time (e.g. hours, days, etc.). In some embodiments, each cached portion of the high bit rate portion may be identified and retrieved with an area identifier and a time stamp identifier (e.g. section 3B, time 00:30:20-00:30:22). In some
  • portions of the high bit rate version of the content may be stored in an encoded form in the cache and be made directly available for streaming to playback devices. If the selected portion has been previously cached, the system may provide the cached portion to the playback device in step 270.
  • the system extracts a portion of the high bit rate version of the content in step 280.
  • the portion may be extracted from the content received in step 210.
  • the portion may be extracted from one of the reduced bit rate versions of the originally received content.
  • the portion of may be extracted by first decoding the received content.
  • the system may be configured to partially decode and extract a portion of the content from an encoded version of the content item.
  • step 280 may further comprise processing the extracted portion to include a plurality of empty/transparent pixels or cells around the edge of the extracted portion. The density of
  • step 280 may further comprise separately encoding the extracted portion for streaming.
  • the encoded portion of the high bit rate version of the content item may then be provided to the playback device in step 270.
  • the portion of the high bit rate version of the content item may be provided in a plurality of encoded video streams each corresponding to a predefined area (e.g. a cell in a grid) of the content display area.
  • steps 270 and 240 may occur at substantially the same time to provide corresponding temporal segments of the same content item to the playback device.
  • the low bit rate version of the content may be provided and buffered at the playback device prior to the corresponding high bit rate portion of the content item being provided in step 270.
  • the portion of the high bit rate version of the content item and the low bit rate version of the content item may be provided as two separately encoded and transmitted video streams.
  • portions of the high bit rate version of the content item and the low bit rate version of the content item may be provided from different parts of a server system.
  • a central server may be configured to stream low bit rate versions of content items to playback devices while a plurality of geographically dispersed server devices may be configured to extract and/or provide portions of the high bit rate versions of the same content item to nearby playback devices.
  • steps 210 through 270 may be repeated for multiple content items.
  • steps 250-270 may be repeated periodically as a viewer views a content item at the playback device.
  • the playback device may periodically (e.g. every few milliseconds, seconds, frames, etc.) update the focal area of the viewer at the server, and the system may select a different portion of the high bit rate version of the content item based on the updated focal area of the viewer.
  • the playback device may be configured to detect a change in the focal area and only notify the server when the location of the focal area changes. In some embodiments, if no focal area is detected (e.g.
  • the system may skip steps 250-270 and only provide the low bit rate version of the content item to the playback device.
  • the system may further select the lowest bit rate version of the content item to provide to the playback device in step 240 to reduce network bandwidth usage.
  • the system may adjust the bit rate of the low and/or high bit rate versions of the content provided to reduce interruptions.
  • FIG. 3 a method for providing content is shown.
  • the steps in FIG. 3 may generally be performed by a processor-based device such as one or more of a game console, a personal computer, a tablet computer, a television, a head mounted display ("HMD"), an augmented reality device, a virtual reality device, a wearable device, a portable user device, a smartphone, a mobile device, and the like.
  • the steps in FIG. 3 may be performed by one or more of the content server 810 and the playback device 820 described with reference to FIG. 8, the playback device described with reference to FIG. 1, or other similar devices.
  • the system determines a focal area of a viewer.
  • the focal area may be determined based on a sensor device comprising one or more of an eye tracking sensor and a head tracking sensor.
  • the head direction of the user may be determined by a head tracker device comprising one or more of an Inertial Measurement Unit (IMU), an accelerometer, gyroscope, an image sensor, and a range sensor.
  • IMU Inertial Measurement Unit
  • accelerometer gyroscope
  • image sensor gyroscope
  • range sensor a range sensor
  • an IMU may comprise an electronic device that measures and reports a body's specific force, angular rate, and/or magnetic field surrounding the body, using a combination of accelerometers and gyroscopes, sometimes also magnetometers.
  • the head tracker device may be coupled to a head mounted display (HMD) worn by the user.
  • the gaze location of the user may be determined by an eye tracker device comprising one or more of an image sensor, an optical reflector sensor, a range sensor, an electromyography (EMG) sensor, and an optical flow sensor.
  • EMG electromyography
  • the focal area may be determined based on one or more of a detected focal point and a predicted future focal point. In some embodiments, the future focal point may be predicted based on one or more of the viewer's gaze point history, a gaze path profile associated with the viewer, gaze path data collected from a plurality of viewers, and a content provider provided standard gaze path. In some embodiments, the focal area may by represented by a point of focus in a 2D or 3D space. In some embodiments, the focal area may be represented as a 3D angle such as a direction represented by a spherical azimuthal angle ( ⁇ ) and polar angle ( ⁇ ). In some embodiments, the focal area may be represented by a 2D polar angle ( ⁇ ).
  • the focal area may correspond the pitch, yaw, and roll of the viewer's head, eyes, and/or the display device.
  • the system may compare the IMU data of the recorded content and the EVIU data of the display device to determine the focal area of the view relative to the content.
  • the size of the focal area may further be determined based on the viewer's distance from the display device. For example, for a television display, a smaller focal area may be associated with a viewer sitting 5 feet away from the screen while a larger focal are may be associated with a viewer sitting 10 feet away.
  • the focal area may be approximated to an area of fixed size and shape around the user' s focal point.
  • the playback device retrieves a low bit rate version of a content item.
  • a playback device sends a content request to a server hosting the content item in step 320 to retrieve the content item.
  • the low bit rate version of the content item may comprise a reduced bit rate version of the content item generated by a content provider and/or the hosting service.
  • step 320 may occur prior to step 310 and the low bit rate version of the content item may begin to be downloaded, buffered, and/or viewed prior to the focal area of the viewer being determined.
  • step 320 may correspond to step 240 described with reference to FIG. 2 herein.
  • the playback device retrieves a portion of a high bit rate version of the content item.
  • the playback device sends a content request identifying the focal area of the viewer determined in step 310 to a server to retrieve the portion of the high bit rate version of the content item.
  • the retrieved portion may comprise a spatial portion of the content selected based on the focal area of the viewer.
  • the retrieved portion may comprise a short temporal segment of an area of the content item (e.g. milliseconds, seconds, frames, etc.).
  • the portion of the high bit rate version of the content item may be retrieved in a video stream separately encoded from the low bit rate version of the content item retrieved in step 320.
  • the low bit rate version of the content item may buffer ahead of the retrieval of the high bit rate version of the content item.
  • step 330 may correspond to step 270 described with reference to FIG. 2 herein.
  • step 340 the system combines the portion of the high bit rate version of the content item with the low bit rate version of the content item to generate a combined image.
  • the system first decodes the portion of the high bit rate version of the content item retrieved in step 330 and the low bit rate version of the content item retrieved in step 320.
  • the system may first adjust the resolution and/or frame rate of at least one of the versions prior to combining the images.
  • the system may increase the resolution and/or frame rate of the low bit rate version of the content item to match the resolution and/or frame rate of the high bit rate portion by up-sampling and/or interloping the decoded low bit rate version of the content item.
  • the system may combine the two versions of the content item by replacing the pixels in the frames of the low bit rate version of the content item with pixels from the corresponding frames of the portion of the high bit rate version of the content item.
  • the frames may be identified and matched by time stamps.
  • the image may further be blended to reducing the appearance of a border between the two versions of the content item.
  • the system blends the versions of the content item by generating a transition area between the portion of the high bit rate version of the content and the low bit rate version of the content. In the transition area, the pixels containing information from the high bit rate version may gradually decrease from the high bit rate area towards the low bit rate area of the displayed content.
  • blending the portion of the high bit rate version of the content items with the low bit rate version of the content item may comprise grouping pixels into triangular cells for blending. Examples of the transition areas and blending are described with reference to FIGS. 5 and 6A-6B herein.
  • the high bit rate portion may be provided in a pre-blended form from the server. For example, edges of the high bit rate portion may comprise a plurality of
  • the playback device may then overlay the high bit rate portion with the transparent pixels onto the low bit rate version of the content item without further processing the images and archive the blended effect.
  • the combined image is displayed on a display device.
  • the display device may comprise one or more of a monitor, a television set, a projector, a head mounted display (HMD), a virtual reality display device, a wearable device, a display screen, a mobile device, and the like.
  • HMD head mounted display
  • virtual reality display device a virtual reality display device
  • wearable device a display screen
  • mobile device a mobile device, and the like.
  • the system may further adjust the combined image based on the display device's specifications. For example, for virtual reality display devices, the system may adjust for the warp and distortions associated with the device.
  • steps 310 to 350 may be repeated continuously as a viewer views a content item.
  • different portions of the high bit rate version of the content item may be retrieved in step 330 and combined with the low bit rate version in step 340 over time.
  • step 320 may occur independently of steps 310 and 330.
  • the system may only retrieve the low bit rate version of the content item to display and skip steps 330-350 until a focal point is detected again.
  • the system may further be configured to determine a view area of the viewer and retrieve only a portion of the low bit rate content based on a view area of the viewer in step 320.
  • the view area of the viewer may be determined based on one or more of eye tracking and head tracking similar to the determination of the focal area in step 310.
  • the view area of the viewer may generally refer to the area of the content that is visible to the user but may or may not be in focus in the viewer's field of vision.
  • the view area may comprise an area surrounding the focal area.
  • the portion of the low bit rate version of the content item retrieved may exclude areas of the content area not within the view area.
  • the portion of the low bit rate version of the content item retrieved may further exclude the focal area and only include the area that is assumed to be visible to the viewer but not in focus.
  • the retrieved portion of the low bit rate version of the content item may correspond to one or more of the near, mid, and far peripheral vision area of the viewer's field of vision.
  • the content area 400 represents the entire image area of a content item. While the content area 400 is shown to be a rectangle, in some embodiments, the content area 400 may correspond to a cylinder, a sphere, a semi-sphere, etc. for immersive content and/or omnidirectional video content.
  • the content area 400 may generally comprise any shape, aspect ratio, and size without departing from the spirit of the present disclosure.
  • the focal point 410 represents the viewer's point of focus within the content. In some embodiments, the focal point 410 may correspond to a detected focal point and/or a predicted focal point.
  • the focal area 412 represents an area around the focal point 410 that is likely to be in focus within the viewer's field of vision.
  • the focal area may comprise one or more of the central, paracentral, macular, near peripheral, and mid peripheral areas of the viewer's field of vision.
  • the size and shape of the focal area 412 are shown as examples only. The relative sizes of the focal area 412 and the content area 400 may also vary.
  • the shape and size of the focal area 412 may be calibrated for each individual user and/or be estimated based on the viewer's profile containing one or more of viewer demographic information, viewing habits, user feedback, user settings, etc.
  • the size of the focal area 412 may further be determined based on the viewer's distance from the display screen. In some embodiments, for display device types with a fixed distance between the eyes of the viewer and the display screen (e.g. HMDs), the size of the focal area 412 may generally be assumed to remain the same.
  • the playback device may be configured to retrieve a portion of the high bit rate version of the content item corresponding to the focal area 412.
  • the content area 400 may be divided into a grid comprising a plurality of sections.
  • sections of the content area 400 overlapping the focal are 421 may comprise the portion of the high bit rate version of the content item retrieved by the playback device.
  • the high bit rate version of the content item may be displayed in the portion of the content area corresponding to the focal area 412 and the low bit rate version of the content item may be displayed in the remaining portion of the content area 400.
  • the high bit rate area may not be an exact match to the size and shape of the focal area 412 but may generally substantially cover the focal area 412.
  • the portion of the high bit rate version of the content item may be extracted to closely match the shape and size of the focal area 412.
  • the content area 400, the focal point 410, and the focal area 412 in FIG. 4B may generally be similar to the corresponding elements in FIG. 4A.
  • the system may further determine a view area 411 surrounding the focal area 412 as shown in FIG. 4B.
  • the view area 414 may generally refer to the area of the content that is visible to the user but may or may not be in focus in the viewer's field of vision.
  • the portion of the low bit rate version of the content item retrieved may exclude areas of the content area 400 outside of the view area 414.
  • the portion of the low bit rate version of the content item retrieved may further exclude the focal area 412 and only include the area that is assumed to be visible to the viewer but not in focus.
  • the view area may correspond to one or more of the near, mid, and far peripheral vision area of the viewer's field of vision.
  • the content area 400 may correspond to an immersive video content and/or an omnidirectional video content captured by a plurality of image sensors.
  • the view area 414 may be used to select and stitch a plurality of separately encoded video streams as described in United States Provisional Patent Application No. 62/357,259, filed on June 30, 2016, entitled "APPARATUS AND METHOD FOR CAPTURING AND DISPLAYING SEGMENTED CONTENT" the entire disclosure of which is hereby fully incorporated by reference herein in its entirety.
  • the focal area 412 may also comprise data from a plurality of separately encoded video streams that are stitched at the playback device.
  • FIG. 5 may represent a combined image displayed in step 350 of FIG. 3.
  • the displayed image comprises a low bit rate area 510, a high bit rate area 512, and a transition area 511.
  • pixels containing information from the high bit rate area 512 may gradually decrease from the high bit rate area 512 toward the low bit rate area 510.
  • blending the portion of the high bit rate version of the content with the low bit rate version of the content item comprises grouping pixels in the transition are 511 into cells for blending.
  • each set of grouped pixels may contain data from one of the versions of the content item or the other.
  • the size and shape of the transition area 511 is shown as an example only and the transition area 511 may be of any size, shape, and thickness.
  • the transition area 511 surrounds the high bit rate area and includes interleaved data from both the high bit rate area 512 and the low bit rate area 510 to reduce the appearance of a border between the two areas.
  • FIG. 6A shows a sphere divided into a plurality of triangular cells.
  • the sphere may correspond to the content area of an omnidirectional and/or immersive video content.
  • each cell may comprise a unit for blending images.
  • triangular cells better adapt to the curvature of a sphere and are less noticeable to human eyes as compared to square or rectangular cells.
  • the triangular cells may further be subdivided into smaller triangular cells to provide for adjustable granularity in blending.
  • FIG. 6B illustrates blending using triangular cells.
  • the cells in FIG. 6B may represent a section of a transition area between two versions of a content item. In FIG.
  • cells labeled with "1" may contain data from one version of a content item and cells labeled with "2" may contain data from a different version of the content item.
  • each cell in FIG. 6B may be subdivided into smaller triangular cells for more granular blending.
  • a transition area may have any number of row or columns of triangular cells.
  • each cell shown in FIGS. 6 A and 6B may be merged or subdivided to form triangular cells of different sizes for blending images.
  • the focal area of a viewer may be determined based on the area of the content that is likely to be in focus in a viewer's field of vision.
  • the focal area is approximated to an oval.
  • the focal area may be approximated to a circle, a square, etc. by the system.
  • FIGS. 7 A and 7B illustrate other shapes that may represent the shape of the focus area used by the system.
  • the shape shown in FIG. 7 A approximates the shape of human's field vision with two merged ovals having aligned major axes.
  • the shape in shown in FIG. 7B comprises two oval having major axes that are perpendicular to each other.
  • the shape shown in FIG. 7B may be used to create a buffer area around the point of focus.
  • vertical or horizontal movements are generally more common than diagonal movements. Therefore, using the shape shown in FIG. 7B to approximate the focal area may allow a viewer to have some vertical or horizontal eye movements without having their focal area leave the high bit rate content area.
  • the retrieved portion of the high bit rate content item discussed here may correspond one or more of the shapes shown in FIGS. 4A-4B, 7A-7B, a circle, a square, a rectangle, and the like.
  • the system includes a content server 810 and a playback device 820 communicating over a data connection such as a network.
  • the content server 810 includes a processor 812, a memory 813, and a communication device 814.
  • the content server 810 may generally comprise one or more processor-based devices accessible by the playback device via a network such as the Internet.
  • the content server may comprise one or more of a cloud-based server, a content host, a streaming service host, a media server, a streaming video server, a broadcast content server, a social networking server, and the like.
  • the processor 812 may comprise one or more of a control circuit, a central processor unit, a graphical processor unit (GPU), a microprocessor, a video decoder, a video encoder and the like.
  • GPU graphical processor unit
  • the memory 813 may include one or more of a volatile and/or non-volatile computer readable memory devices. In some embodiments, the memory 813 stores computer executable code that causes the processor 812 to provide content to the playback device 820.
  • the communication device 814 may comprise one or more of a network adapter, a data port, a router, a modem, and the like. Generally, the communication device 814 may be configured to allow the processor 812 to communicate with the playback device 820. In some
  • the processor 812 may be configured to provide a low bit rate version of a content item and a portion of a high bit rate version of the content item to the playback device 820 based on a request from the playback device 820.
  • the request may comprise an identification of the requested content item and/or an indication of a focal area of the viewer of the content item.
  • the processor 812 may be configured to generate and/or store at least one of the low bit rate version of the content item and one or more portions of the high bit rate version of the content item based on a received content item.
  • the memory 813 and/or a separate content library may store one or more content items each comprising at least two versions of the content item having different bit rates.
  • the content server 810 may be configured to stream the content recorded by a capture device to the playback device 820 in substantially real-time.
  • the content server 810 may be configured to host a plurality of prerecorded content items for streaming and/or downloading to the playback devices 820 on-demand. While only one playback device 820 is shown in FIG. 8, the content server 810 may be configured to simultaneously receive content from a plurality of capture devices and/or provide content to a plurality of playback devices 820 via the communication device 814.
  • the content server 810 may be configured to facilitate peer-to- peer transfer of video streams between capture devices and playback devices 820.
  • the low bit rate version of the content item may be transferred via a peer-to- peer network while portions of the high bit rate content item may be transferred via the content server 810.
  • the content server 810 may be configured to provide the low bit rate version of the content item and the portion of the high bit rate version of the content item in separately encoded video streams.
  • the content server 810 may further be configured to pre-process the content item before providing the content item to the playback device 820.
  • the content server 810 may soften the edges of the extracted portion of the high bit rate version of the content server by including empty/transparent pixels on at the edges prior to providing the portion of the high bit rate content to the playback device 820.
  • the playback device 820 may blend the video streams by simply combining the pixel data from the two versions without performing further image processing.
  • the content server 810 may be configured to combine a low bit rate version of a content item with a portion of the high bit rate version of the content prior to providing the combined content to the playback device 820.
  • While one content server 810 is shown, in some embodiments, functionalities of the content server 810 may be implemented on one or more processor-based devices. In some embodiments, the content servers 810 for providing low bit rate versions of contents and for providing high bit rate versions of contents may be separately implemented. For example, a central content server may be configured to provide low bit rate versions of contents while a plurality of geographically distributed content servers may be configured to provide portions of the high bit rate versions of contents to playback devices.
  • the playback device 820 comprises a processor 821, a memory 823, a display device 825, and a sensor device 827.
  • the playback device 820 may generally comprise a processor-based devices such as one or more of a game console, a media console, a set-top box, a personal computer, a tablet computer, a television, a head mounted display ("HMD"), an augmented reality device, a virtual reality device, a wearable device, a portable user device, a smartphone, etc.
  • the processor 821 may comprise one or more of a control circuit, a central processor unit (CPU), a graphical processor unit (GPU), a microprocessor, a video decoder and the like.
  • the memory 823 may include one or more of a volatile and/or non-volatile computer readable memory devices.
  • the memory 823 stores computer executable code that cause the processor 821 to determine a focal area of a user and retrieve a content item from the content server 810.
  • the playback device 820 may be configured to retrieve a low bit rate version and a portion of a high bit rate version of the content item from the content server 810 and/or from a local storage and combine the two versions to generate a combined image to display to the user via the display device 825.
  • the memory 823 may comprise a buffer for buffering one or more versions of the content item retrieved from the content server 810.
  • the computer executable code stored in the memory 823 may comprise one or more of a computer program, a software program, a playback device firmware, a mobile application, a game and/or media console application, etc.
  • the display device 825 may comprise a device for displaying content to a viewer.
  • the display device 825 may comprise one or more of a monitor, a television, a head mounted display (HMD), a virtual reality display device, a wearable device, a display screen, a mobile device, and the like.
  • HMD head mounted display
  • virtual reality display device a virtual reality display device
  • wearable device a display screen
  • mobile device and the like.
  • the display device 825 may comprise a stereoscopic display having one or more screens.
  • the sensor device 827 may comprise one or more sensors configured to determine a focal point and/or area a viewer of the display device 825.
  • the sensor device 827 may comprise one or more of an image sensor, an optical reflector sensor, a range sensor, an electromyography (EMG) sensor, and an optical flow sensor for detecting eye and/or head movement.
  • the sensor device 827 may comprise an IMU that measures and reports a body's specific force, angular rate, and/or magnetic field surrounding the body, using a combination of accelerometers and gyroscopes, sometimes also magnetometers.
  • the sensor device 827 may be coupled to an HMD and/or a wearable device that allows the sensor to detect the motion of the user's head or eyes via the motion of the HMD and/or wearable device.
  • the sensor device 827 may comprise an optical sensor for detecting one or more of a head motion and eye-motion of the user.
  • the sensor may be coupled to an HMD and/or a wearable device and/or be a relatively stationary device that captures data from the viewer from a distance.
  • the display device 825 may comprise a separate device with or without a separate processor.
  • the display device 825 may be coupled to the playback device 820 via a wired or wireless communication channel.
  • the playback device 820 may comprise a PC or a game console and the display device 825 may comprise an HMD configured to display content from the playback device 820.
  • the sensor device 827 may be part of the playback device 820, the display device 825, and/or may be a physically separated device communicating with one or more of the playback device 820 and the display device 825.
  • one or more of the display device 825 and the sensor device 827 may be integrated with the playback device 820.
  • the display device 825 may further comprise a processor and/or a memory for at least partially storing the retrieved content and/or the viewer's eye or head movement detected by the sensor device 827.
  • the playback device 820 may further include a communication device such as a network adapter, a Wi-Fi transceiver, a mobile data network transceiver, etc. for requesting and downloading content items from the content server 810 and/or a capture device.
  • the playback device 820 may further include one or more user input/output devices such as buttons, a controller, a keyboard, a display screen, a touch screen and the like for the user to control the selection and playback of content items.
  • one or more of the embodiments, methods, approaches, and/or techniques described above may be implemented in one or more computer programs or software applications executable by a processor based apparatus or system.
  • processor based apparatus or systems may comprise a computer, entertainment system, game console, workstation, graphics workstation, server, client, portable device, pad-like device, etc.
  • Such computer program(s) may be used for executing various steps and/or features of the above- described methods and/or techniques. That is, the computer program(s) may be adapted to cause or configure a processor based apparatus or system to execute and achieve the functions described above.
  • such computer program(s) may be used for implementing any embodiment of the above-described methods, steps, techniques, or features.
  • such computer program(s) may be used for implementing any type of tool or similar utility that uses any one or more of the above described embodiments, methods, approaches, and/or techniques.
  • program code macros, modules, loops, subroutines, calls, etc., within or without the computer program(s) may be used for executing various steps and/or features of the above-described methods and/or techniques.
  • the computer program(s) may be stored or embodied on a computer readable storage or recording medium or media, such as any of the computer readable storage or recording medium or media described herein.
  • the present invention provides a computer program product comprising a medium for embodying a computer program for input to a computer and a computer program embodied in the medium for causing the computer to perform or execute steps comprising any one or more of the steps involved in any one or more of the embodiments, methods, approaches, and/or techniques described herein.
  • the present invention provides one or more non-transitory computer readable storage mediums storing one or more computer programs adapted or configured to cause a processor based apparatus or system to execute steps comprising: determining a focal area of a viewer of a content item displayed on a display device, retrieving a low bit rate version of the content item, retrieving a portion of a high bit rate version of the content item corresponding to the focal area, combining the portion of the high bit rate version of the content with the low bit rate version of the content item to generate a combined image, and causing the combined image to be displayed to the viewer via the display device.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Social Psychology (AREA)
  • Databases & Information Systems (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Human Computer Interaction (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
  • User Interface Of Digital Computer (AREA)

Abstract

L'invention concerne un procédé d'affichage de contenu. Un mode de réalisation du procédé selon l'invention consiste à : déterminer une zone focale d'un spectateur d'un élément de contenu affiché sur un dispositif d'affichage ; récupérer une version à faible débit binaire de l'élément de contenu ; récupérer une partie d'une version à débit binaire élevé de l'élément de contenu correspondant à la zone focale ; combiner la partie de la version à débit binaire élevé du contenu à la version à faible débit binaire de l'élément de contenu pour générer une image combinée ; et commander l'affichage de l'image combinée à l'intention du spectateur via le dispositif d'affichage. L'invention concerne également des systèmes exécutent des étapes similaires, et des supports de stockage non transitoires lisibles par ordinateur contenant chacun un ou plusieurs programmes informatiques.
PCT/US2017/035060 2016-06-30 2017-05-30 Appareil et procédé de fourniture et d'affichage de contenu WO2018004936A1 (fr)

Priority Applications (5)

Application Number Priority Date Filing Date Title
JP2018568224A JP6859372B2 (ja) 2016-06-30 2017-05-30 コンテンツを表示するための方法及びシステム、並びにコンテンツを提供するための方法及びシステム
KR1020197003058A KR20190022851A (ko) 2016-06-30 2017-05-30 콘텐츠를 제공 및 디스플레이하기 위한 장치 및 방법
EP17820807.0A EP3479574A4 (fr) 2016-06-30 2017-05-30 Appareil et procédé de fourniture et d'affichage de contenu
CN201780039760.5A CN109417624B (zh) 2016-06-30 2017-05-30 用于提供和显示内容的装置和方法
KR1020207037655A KR102294098B1 (ko) 2016-06-30 2017-05-30 콘텐츠를 제공 및 디스플레이하기 위한 장치 및 방법

Applications Claiming Priority (10)

Application Number Priority Date Filing Date Title
US201662357259P 2016-06-30 2016-06-30
US62/357,259 2016-06-30
US201662374687P 2016-08-12 2016-08-12
US62/374,687 2016-08-12
US15/280,933 US11089280B2 (en) 2016-06-30 2016-09-29 Apparatus and method for capturing and displaying segmented content
US15/280,947 US20180007422A1 (en) 2016-06-30 2016-09-29 Apparatus and method for providing and displaying content
US15/280,947 2016-09-29
US15/280,933 2016-09-29
US15/280,962 2016-09-29
US15/280,962 US10805592B2 (en) 2016-06-30 2016-09-29 Apparatus and method for gaze tracking

Publications (1)

Publication Number Publication Date
WO2018004936A1 true WO2018004936A1 (fr) 2018-01-04

Family

ID=60786538

Family Applications (3)

Application Number Title Priority Date Filing Date
PCT/US2017/035058 WO2018004934A1 (fr) 2016-06-30 2017-05-30 Appareil et procédé de capture et d'affichage de contenu segmenté
PCT/US2017/035057 WO2018004933A1 (fr) 2016-06-30 2017-05-30 Appareil et procédé destinés au suivi du regard
PCT/US2017/035060 WO2018004936A1 (fr) 2016-06-30 2017-05-30 Appareil et procédé de fourniture et d'affichage de contenu

Family Applications Before (2)

Application Number Title Priority Date Filing Date
PCT/US2017/035058 WO2018004934A1 (fr) 2016-06-30 2017-05-30 Appareil et procédé de capture et d'affichage de contenu segmenté
PCT/US2017/035057 WO2018004933A1 (fr) 2016-06-30 2017-05-30 Appareil et procédé destinés au suivi du regard

Country Status (1)

Country Link
WO (3) WO2018004934A1 (fr)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2020120254A (ja) * 2019-01-23 2020-08-06 株式会社近江デジタルファブリケーションズ 配信画像生成方法

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10826964B2 (en) 2018-09-05 2020-11-03 At&T Intellectual Property I, L.P. Priority-based tile transmission system and method for panoramic video streaming
US10931979B2 (en) 2018-10-18 2021-02-23 At&T Intellectual Property I, L.P. Methods, devices, and systems for decoding portions of video content according to a schedule based on user viewpoint

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4208811A (en) 1978-11-30 1980-06-24 Enrique Junowicz Display with overlapping picture elements
US6762789B1 (en) * 1999-06-25 2004-07-13 Matsushita Electric Industrial Co., Ltd. Omnidirectional video output method and apparatus
US8184069B1 (en) * 2011-06-20 2012-05-22 Google Inc. Systems and methods for adaptive transmission of data
US20140282750A1 (en) * 2013-03-15 2014-09-18 Cox Communications, Inc. Systems, methods, and apparatus for accessing recordings of content items on multiple customer devices
US20150002529A1 (en) 2013-06-27 2015-01-01 Canon Kabushiki Kaisha Method, system and apparatus for rendering
IN2015CH02866A (fr) 2015-06-09 2015-07-17 Wipro Ltd

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1602322A1 (fr) * 2004-06-02 2005-12-07 SensoMotoric Instruments GmbH Méthode et appareil pour réduction de temps d'inactivité d'un appareil de poursuite oculaire
US20090278921A1 (en) * 2008-05-12 2009-11-12 Capso Vision, Inc. Image Stabilization of Video Play Back
US9313481B2 (en) 2014-02-19 2016-04-12 Microsoft Technology Licensing, Llc Stereoscopic display responsive to focal-point shift
US9462230B1 (en) * 2014-03-31 2016-10-04 Amazon Technologies Catch-up video buffering
KR102611448B1 (ko) * 2014-05-29 2023-12-07 네버마인드 캐피탈 엘엘씨 콘텐트를 전달 및/또는 콘텐트를 재생하기 위한 방법들 및 장치
US10204658B2 (en) * 2014-07-14 2019-02-12 Sony Interactive Entertainment Inc. System and method for use in playing back panorama video content

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4208811A (en) 1978-11-30 1980-06-24 Enrique Junowicz Display with overlapping picture elements
US6762789B1 (en) * 1999-06-25 2004-07-13 Matsushita Electric Industrial Co., Ltd. Omnidirectional video output method and apparatus
US8184069B1 (en) * 2011-06-20 2012-05-22 Google Inc. Systems and methods for adaptive transmission of data
US20120319928A1 (en) 2011-06-20 2012-12-20 Google Inc. Systems and Methods for Adaptive Transmission of Data
US20140282750A1 (en) * 2013-03-15 2014-09-18 Cox Communications, Inc. Systems, methods, and apparatus for accessing recordings of content items on multiple customer devices
US20150002529A1 (en) 2013-06-27 2015-01-01 Canon Kabushiki Kaisha Method, system and apparatus for rendering
IN2015CH02866A (fr) 2015-06-09 2015-07-17 Wipro Ltd

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See also references of EP3479574A4

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2020120254A (ja) * 2019-01-23 2020-08-06 株式会社近江デジタルファブリケーションズ 配信画像生成方法
JP7219620B2 (ja) 2019-01-23 2023-02-08 株式会社近江デジタルファブリケーションズ 配信画像生成方法

Also Published As

Publication number Publication date
WO2018004934A1 (fr) 2018-01-04
WO2018004933A1 (fr) 2018-01-04

Similar Documents

Publication Publication Date Title
KR102294098B1 (ko) 콘텐츠를 제공 및 디스플레이하기 위한 장치 및 방법
US10536693B2 (en) Analytic reprocessing for data stream system and method
Fan et al. A survey on 360 video streaming: Acquisition, transmission, and display
US20160277772A1 (en) Reduced bit rate immersive video
US11653065B2 (en) Content based stream splitting of video data
US11290699B2 (en) View direction based multilevel low bandwidth techniques to support individual user experiences of omnidirectional video
US9832450B2 (en) Methods and apparatus for generating and using reduced resolution images and/or communicating such images to a playback or content distribution device
CN112204993B (zh) 使用重叠的被分区的分段的自适应全景视频流式传输
CN110419224B (zh) 消费视频内容的方法、电子设备和服务器
US10939139B2 (en) Adaptive coding and streaming of multi-directional video
US10769754B2 (en) Virtual reality cinema-immersive movie watching for headmounted displays
US20190104330A1 (en) Method, apparatus and stream of formatting an immersive video for legacy and immersive rendering devices
Podborski et al. Virtual reality and DASH
WO2018004936A1 (fr) Appareil et procédé de fourniture et d'affichage de contenu
Reznik User-adaptive mobile video streaming using MPEG-DASH
KR102183895B1 (ko) 가상 현실 비디오 스트리밍에서의 관심영역 타일 인덱싱
CN116848840A (zh) 多视图视频流式传输

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 17820807

Country of ref document: EP

Kind code of ref document: A1

ENP Entry into the national phase

Ref document number: 2018568224

Country of ref document: JP

Kind code of ref document: A

NENP Non-entry into the national phase

Ref country code: DE

ENP Entry into the national phase

Ref document number: 20197003058

Country of ref document: KR

Kind code of ref document: A

ENP Entry into the national phase

Ref document number: 2017820807

Country of ref document: EP

Effective date: 20190130