WO2021142036A1 - Media content presentation - Google Patents

Media content presentation Download PDF

Info

Publication number
WO2021142036A1
WO2021142036A1 PCT/US2021/012376 US2021012376W WO2021142036A1 WO 2021142036 A1 WO2021142036 A1 WO 2021142036A1 US 2021012376 W US2021012376 W US 2021012376W WO 2021142036 A1 WO2021142036 A1 WO 2021142036A1
Authority
WO
WIPO (PCT)
Prior art keywords
video asset
aspect ratio
display
video
location
Prior art date
Application number
PCT/US2021/012376
Other languages
French (fr)
Inventor
Robert A. POST
Blake Barnes
Joseph BURFITT
Eric Buehl
Clifton Smith
Nicholas Allen
Original Assignee
QBI Holdings, LLC
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from US16/735,511 external-priority patent/US20200296316A1/en
Application filed by QBI Holdings, LLC filed Critical QBI Holdings, LLC
Publication of WO2021142036A1 publication Critical patent/WO2021142036A1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/234Processing of video elementary streams, e.g. splicing of video streams, manipulating MPEG-4 scene graphs
    • H04N21/2343Processing of video elementary streams, e.g. splicing of video streams, manipulating MPEG-4 scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements
    • H04N21/23439Processing of video elementary streams, e.g. splicing of video streams, manipulating MPEG-4 scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements for generating different versions
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/236Assembling of a multiplex stream, e.g. transport stream, by combining a video stream with other content or additional data, e.g. inserting a URL [Uniform Resource Locator] into a video stream, multiplexing software data into a video stream; Remultiplexing of multiplex streams; Insertion of stuffing bits into the multiplex stream, e.g. to obtain a constant bit-rate; Assembling of a packetised elementary stream
    • H04N21/2365Multiplexing of several video streams
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/41Structure of client; Structure of client peripherals
    • H04N21/414Specialised client platforms, e.g. receiver in car or embedded in a mobile appliance
    • H04N21/41407Specialised client platforms, e.g. receiver in car or embedded in a mobile appliance embedded in a portable device, e.g. video client on a mobile phone, PDA, laptop
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/41Structure of client; Structure of client peripherals
    • H04N21/422Input-only peripherals, i.e. input devices connected to specially adapted client devices, e.g. global positioning system [GPS]
    • H04N21/42202Input-only peripherals, i.e. input devices connected to specially adapted client devices, e.g. global positioning system [GPS] environmental sensors, e.g. for detecting temperature, luminosity, pressure, earthquakes
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/41Structure of client; Structure of client peripherals
    • H04N21/422Input-only peripherals, i.e. input devices connected to specially adapted client devices, e.g. global positioning system [GPS]
    • H04N21/42204User interfaces specially adapted for controlling a client device through a remote control device; Remote control devices therefor
    • H04N21/42206User interfaces specially adapted for controlling a client device through a remote control device; Remote control devices therefor characterized by hardware details
    • H04N21/42222Additional components integrated in the remote control device, e.g. timer, speaker, sensors for detecting position, direction or movement of the remote control, microphone or battery charging device
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/434Disassembling of a multiplex stream, e.g. demultiplexing audio and video streams, extraction of additional data from a video stream; Remultiplexing of multiplex streams; Extraction or processing of SI; Disassembling of packetised elementary stream
    • H04N21/4347Demultiplexing of several video streams
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/44Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream, rendering scenes according to MPEG-4 scene graphs
    • H04N21/4402Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream, rendering scenes according to MPEG-4 scene graphs involving reformatting operations of video signals for household redistribution, storage or real-time display
    • H04N21/440263Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream, rendering scenes according to MPEG-4 scene graphs involving reformatting operations of video signals for household redistribution, storage or real-time display by altering the spatial resolution, e.g. for displaying on a connected PDA
    • H04N21/440272Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream, rendering scenes according to MPEG-4 scene graphs involving reformatting operations of video signals for household redistribution, storage or real-time display by altering the spatial resolution, e.g. for displaying on a connected PDA for performing aspect ratio conversion
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/442Monitoring of processes or resources, e.g. detecting the failure of a recording device, monitoring the downstream bandwidth, the number of times a movie has been viewed, the storage space available from the internal hard disk
    • H04N21/4424Monitoring of the internal components or processes of the client device, e.g. CPU or memory load, processing speed, timer, counter or percentage of the hard disk space used
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/45Management operations performed by the client for facilitating the reception of or the interaction with the content or administrating data related to the end-user or to the client device itself, e.g. learning user preferences for recommending movies, resolving scheduling conflicts
    • H04N21/458Scheduling content for creating a personalised stream, e.g. by combining a locally stored advertisement with an incoming stream; Updating operations, e.g. for OS modules ; time-related management operations
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/01Conversion of standards, e.g. involving analogue television standards or digital television standards processed at pixel level
    • H04N7/0117Conversion of standards, e.g. involving analogue television standards or digital television standards processed at pixel level involving conversion of the spatial resolution of the incoming video signal
    • H04N7/0122Conversion of standards, e.g. involving analogue television standards or digital television standards processed at pixel level involving conversion of the spatial resolution of the incoming video signal the input and the output signals having different aspect ratios

Definitions

  • Examples of the disclosure relate generally to systems and methods for presenting media content to a user of a computing device, and more specifically, to systems and methods for presenting media content including video to a user of a mobile computing device.
  • Such mobile devices place new demands on video content.
  • One such demand relates to the aspect ratio (e.g., the ratio of a display width to a display height) of the video content.
  • a native aspect ratio of a video asset e.g., a source file containing video content
  • one of two conventional solutions can be used to format the video asset for the display: either the video asset can be cropped to fit the display (e.g., via “pan and scan” techniques); or the video asset can be “letterboxed” by adding dummy content (e.g., black bars) to fill the regions of the display unoccupied by the video asset.
  • dummy content e.g., black bars
  • cropping the video asset results in the cropped content being unviewable on the display (which can affect the viewer’s understanding or appreciation of the video asset); and letterboxing the video asset results in regions of the display being effectively unused (which can impair visibility, especially on mobile devices with limited display space).
  • a preferred solution is to anticipate the aspect ratio of the display on which video content will be viewed, and to provide to the display a video asset that matches that aspect ratio. But this approach is frustrated by mobile device displays that change aspect ratios as the user changes the orientation of the device. For instance, a display may be in a “portrait” mode (e.g., in which the aspect ratio is less than unity) when a device is held upright, but may shift to a “landscape” mode (e.g., in which the aspect ratio is greater than unity) when the device is rotated 90 degrees to the left or the right.
  • a solution is needed for seamlessly switching between aspect ratios of video content without resorting to cropping or letterboxing techniques.
  • video content he data-efficient: that is, that the video content respect the limited data storage capacity of many mobile devices, and the cost and overhead of downloading large files on consumer data plans; and that it accommodate the high latency, low bandwidth network conditions in which mobile devices may operate.
  • the present disclosure describes such one or more solutions, which improve on conventional approaches by providing a data-efficient mechanism for quickly and seamlessly changing an aspect ratio of video content on a mobile device display.
  • a plurality of assets is received at a device comprising a display and an orientation sensor.
  • the plurality of assets comprises a first video asset associated with a first aspect ratio, and a second video asset associated with a second aspect ratio, different from the first aspect ratio.
  • a desired aspect ratio is determined based on an output of the orientation sensor.
  • the first video asset is selected.
  • the second video asset is selected.
  • the selected video is presented at the desired aspect ratio via the display.
  • FIGs. 1A-ID illustrate an example smartphone, an example tablet, an example wearable device, and an example head-mounted device that can each include a display according to examples of the di sclosure.
  • FIG. 2 illustrates a display having an aspect ratio according to examples of the disclosure.
  • FIG. 3A illustrates an example of presenting vi deo content in a portrai t aspect ratio, according to examples of the disclosure.
  • FIG. 36 illustrates an example of presenting video content in a landscape aspect ratio, according to examples of the disclosure.
  • FIGs. 4A-4B illustrate examples of determining an orientation of a display according to examples of the disclosure.
  • FIG. 5 illustrates an example process for encoding and presenting assets to a user via a display, according to examples of the disclosure.
  • FIGs. 6A-6D illustrate examples of data streams comprising video and/or audio according to examples of the disclosure.
  • FIGs. 7A-7G illustrate examples of encoding assets comprising video according to examples of the disclosure.
  • FIG. 8 illustrates an example computer system for implementing various examples of the disclosure.
  • FIGs. 9A-9C illustrate examples an example layout of a user interface including a scrubber according to examples of the disclosure.
  • FIGs. lOA-lOB illustrate examples of a user interface including a scrubber according to examples of the disclosure.
  • FIGs. 11A-11B illustrate examples of a user interface including a scrubber according to examples of the disclosure.
  • FIGs. 12A-12D illustrate examples of a user interface including a scrubber according to examples of the disclosure.
  • FIGs. 13A-13D illustrate examples of a user interface including a scrubber according to examples of the disclosure.
  • FIGs. 14A-14B illustrate examples of a user interface including a scrubber according to examples of the disclosure.
  • FIGs. 15A-15D illustrate examples of a user interface including a scrubber according to examples of the disclosure.
  • FIGs. 1A-1D illustrate examples of mobile devices including displays that can be used to present video content (which may comprise one or more video assets, as well as, in some examples, corresponding audio assets, or other assets such as assets describing haptic effects).
  • a mobile device may include, but is not limited to, smartphones, tablets, and wearable devices. Although described with respect to mobile devices, examples of this disclosure may also be used with a television display or a computer monitor.
  • video can include still images, motion video (e.g., sequences of image frames), GIF files, or any other suitable visual media content.
  • FIG. 1A illustrates an example smartphone 110 with a display 112.
  • FIG. IB illustrates an example tablet device 120 with a display 122.
  • FIG. 1C illustrates an example wearable device 130 (such as a smart watch) with a display 132.
  • FIG. ID illustrates an example wearable head- mounted device 140 with a display 142 configured to be positioned in front of a user’s eyes.
  • a display can comprise a transmissive display, such as for augmented reality or mixed reality applications.
  • the head -mounted device can include a non- transmissive display, such as for virtual reality applications or conventional computing applications.
  • Each of these example devices can include a respective one or more processors; one or more speakers; one or more actuators; one or more sensors, such as orientation sensors (e.g., accelerometers, gyroscopes, inertial measurement units (IMUs)), position sensors (e.g., GPS), cameras, microphones, or other suitable sensors); storage capabilities (e.g., internal volatile or non-volatile memory, or interfaces to external storage such as optical storage, solid-state storage, or magnetic storage); input or output devices; and networking capabilities, such as to send and receive data (including video data) via a network.
  • orientation sensors e.g., accelerometers, gyroscopes, inertial measurement units (IMUs)
  • position sensors e.g., GPS
  • storage capabilities e.g., internal volatile or non-volatile memory, or interfaces to external storage such as optical storage, solid-state storage, or magnetic storage
  • input or output devices e.g., input or output devices
  • networking capabilities e.
  • Displays such as those that can he included in the example devices described above with respect to FIGs. 1 A-1D, can be characterized by an aspect ratio — conventionally, the ratio of the width of the display to the height of the display, although other conventions (e.g., the ratio of the height to the width, or an angle of a diagonal) can be used.
  • FIG. 2 is illustrative.
  • an example display 200 has a width wj and a height in; the aspect ratio can be expressed as the inverse of the slope of the diagonal line 202 (i.e., the width w 1 divided by the height in). Equivalently, the aspect ratio can be expressed in terms of the angle ⁇ 1 (e.g., the tangent of ⁇ 1 ).
  • the display can he described as having a “portrait” orientation. Conversely, if the aspect ratio is greater than unity (e.g., the inverse slope of 202 is greater than 1, and ⁇ 1 is greater than 45 degrees), the display can be described as having a “landscape” orientation. In some examples the angle Q; may be selected from an angle of 1-89 degrees.
  • a width and height of a display can refer to horizontal and vertical dimensions, respectively, of the display with respect to a viewer (which may differ from a width and height of the device itself, if the device is rotated with respect to the viewer).
  • FIGs. 3A-3B illustrate examples of video content being presented on a display of a mobile device.
  • device 300 is oriented with its display in a portrait orientation, and is presenting video content 310A via its display.
  • device 300 is rotated such that its display is in a landscape orientation, and is presenting video content 310B via its display.
  • Device 300 can be freely rotated between the portrait orientation and the landscape orientation, and the corresponding video content (310A or 310B, respectively) will be presented accordingly.
  • Video content can be presented by a media player application executing on device 300. Once skilled in the art will understand that the device may be rotated both clockwise and counterclockwise to switch between the video content 310A and 310B.
  • device 300 is shown presenting video content 310A/310B that includes a television news-style program comprising various visual elements: a host 322A/322B, which may be video footage of the host; and visual elements 320A/320B, 324A/324B, and 326A/326B, all overlaid on background 328A/328B.
  • video content 310A/310B may be presented concurrently with a corresponding audio track (e.g., presented via speakers of device 300), or with some other type of output (e.g., haptic output).
  • Video content 310A includes one or more video assets associated with a portrait aspect ratio to match a portrait aspect ratio of the display of device 300 in FIG. 3A.
  • video content 310A can comprise a single video asset that includes elements 320A, 322A, 324A, 326A, and 328A, and is formatted to a native portrait aspect ratio.
  • video content 310A can comprise two or more video assets.
  • a first video asset of video content 310A could include a composite of host 322A and background 328A, with the first video asset formatted to native portrait aspect ratio; a second video asset of video content 310A could include element 320A; a third video asset of video content 310A could include element 324A; and a fourth video asset of video content 310A could include element 326.4.
  • the second, third, and fourth video assets could be considered layers, and the layers could he combined (e.g., proceduraliy) with the first video asset (e.g., the background and host) to generate a composite video featuring the layers arranged on top of the first video asset, with the composite video having a native portrait aspect ratio.
  • video content 310B includes one or more video assets associated with a landscape aspect ratio to match a landscape aspect ratio of the display of device 300 in FIG. 3B.
  • Video content 310B can in some examples comprise a single video asset including elements 320B, 322.B, 324B, 32.6B, and 328B, and formatted to a native landscape aspect ratio: and can in some examples include two or more video assets, such as described above with respect to video content 310A, with the video assets used to generate a composite video having a native landscape aspect ratio.
  • Device 300 can select whether to display video content 310A (and its corresponding assets) or video content 310B (and its corresponding assets), based on an orientation of device 300, such as described below.
  • video content (and corresponding video assets) that are unselected may remain resident in the memory of device 300, such that switching between video content 310.4 and video content 310B can occur quickly and seamlessly.
  • transition effects c.g., dissolves, fades, screen wipes
  • animations can be used when switching between video content 310A and video content 31 OB to further ease the transition between the two.
  • FIGs. 4A and 4B illustrate examples of determining an orientation of a mobile device 400, in order to select which video assets of a plurality of video assets should he presented on the display of the device.
  • the determined orientation could correspond to a portrait orientation, in which case a portrait mode video asset (e.g., for video content 310A in FIG. 3 A) could be selected; or the determined orientation could correspond to a landscape orientation, in which case a landscape mode video asset (e.g., for video content 310B in FIG. 3B) could be selected.
  • Orientation of a device can he determined by detecting an output of one or more orientation sensors of the device, such as an 1MU, an accelerometer, and/or a gyroscope.
  • the orientation can he determined by measuring, from an orientation sensor, an angle of the display with respect to a horizontal plane in an inertial frame.
  • an orientation sensor can indicate that an edge of the display of device 400 lies at an angle ⁇ (412) with respect to horizon 410.
  • angle a can be compared to the diagonal of the display; for instance, if angle a exceeds an angle between the diagonal and an edge of the display, the device can he considered to be in landscape mode; and conversely, if angle a is less than the angle between the diagonal and the edge, the device can he considered to be in portrait mode.
  • angle a can be compared to one or more threshold values, and the device can be considered to be in landscape mode if the angle exceeds the threshold, and in portrait mode if it does not.
  • Example threshold values can include 30 degrees, 45 degrees, 60 degrees, or any other suitable threshold.
  • hysteresis effects can be implemented via asymmetric threshold values (e.g., a threshold value of 30 degrees for transitioning from portrait mode to landscape mode, and a threshold value of 60 degrees for transitioning from landscape mode to portrait mode).
  • threshold values can be specified by a user of the device, or by an author of video content to be presented on the device. Other suitable methods of determining an orientation of the device will he apparent.
  • the orientation of the display can be determined with respect to an orientation of a user.
  • user 450 views device 400 while his or her head is oriented vertically along a vector 460 (i.e., vector 460 points upwards in the user’s field of view).
  • Vector 460 can be determined using one or more sensors of device 400, such as cameras used to track the eyes of user 450.
  • An orientation of device 400 can be determined by comparing an inertial orientation 414 of the device (which can be determined using orientation sensors, such as described above) to vector 460.
  • an angle between vector 460 and an edge of the device 400 can be detected, and used to determine a portrait orientation or a landscape orientation using the methods described above.
  • An advantage of this approach is that if the eyes of user 450 are positioned at an angle with respect to an inertial frame — for example, if user 450 is reclined, or lying on his or her side — the determined orientation of device 400 can take that into account. This can be desirable where, for instance, a user wishes to watch video content in portrait mode while fully reclined, even though the device may be oriented in a landscape mode with respect to an inertial frame (as might be detected by an orientation sensor such as an accelerometer).
  • FIG. 5 illustrates an example process 500 for presenting video content to a user according to embodiments of the invention.
  • 502 represents a first video asset (e.g., video asset “L,” as in landscape), and 512 represents a second video asset (e.g., video asset “P,” as in portrait).
  • the first video asset can be associated with a first orientation (e.g., a landscape orientation), and the second video asset can be associated with a second orientation (e.g., a portrait orientation).
  • the first and second video assets may include the same general content; for example, the first and second video assets may be different versions of a single episode of a scripted television show. While example process 500 depicts two video assets, it will be understood that the example can be extended to any suitable number of video assets.
  • the first and second video assets 502 and 512 can he provided by a creative entity with creative control over the video assets.
  • the creative entity can author (e.g., produce and edit) the first video asset 502 such that it is creatively suited for presentation in the first orientation (e.g., landscape orientation); for example, the creative entity can select camera shots, control scene placement, and position graphical elements such that the video content is understandable and engaging in a landscape orientation.
  • the creative entity can similarly author the second video asset 512 such that it is creatively suited for presentation in the second orientation (e.g., portrait orientation). Viewability differences between the first orientation and the second orientation may result in significantly different creative demands of the first video asset 502 and the second video asset 512.
  • a full-body camera shot of a standing actor may be well suited for a portrait orientation, because the proportions of an actor standing upright may resemble the proportions of a portrait display. But the same full-body shot may be inappropriate for a landscape display, whose proportions vary significantly from those of the actor.
  • a wide-angle camera shot of a basketball court may present well on a landscape display, but may be entirely unsuited for a portrait display. Such differences may be especially pronounced on mobile devices, which may have small screens that make it difficult for a viewer to resolve small visual details (such as facial features).
  • the creative entity may elect to produce a first video asset S02 that differs (even significantly) from the second video asset 512, even though the two video assets may relate to the same general content.
  • the first and second video assets may comprise entire separate camera shots and sequences, transitions, focal points, post-processing effects, overlays, or other video elements, as appropriate.
  • Providing separate first and second video assets, where those assets may differ creatively, offers an advantage over processes in which a single video asset is manipulated (e.g., via cropping or letterboxing) for presentation at multiple different aspect ratios.
  • the creative entity can make human authorship decisions based on what the entity decides would look best when presented at a particular aspect ratio.
  • 504 represents a first audio asset corresponding to the first video asset 502
  • 514 represents a second audio asset corresponding to the second video asset 512.
  • Audio asset 504 is thus associated with the first aspect ratio
  • audio asset 514 is associated with the second aspect ratio.
  • Audio assets 504 and 514 may represent audio tracks to be presented concurrently with their respective video asset.
  • example process 500 depicts two audio assets, it will be understood that the example can be extended to any suitable number of audio assets (which may, but need not, equal the number of video assets).
  • audio assets 504 and 514 may be identical assets, such as where identical audio tracks are to be presented regardless of whether a device is in a portrait orientation or a landscape orientation.
  • audio assets 504 and 514 may have different audio characteristics, such as a where it is desirable to present different audio tracks based on whether the device is in a portrait orientation or a landscape orientation. For instance, during a scene of a video, first video asset 502 (e.g., in landscape orientation) may feature a distant camera shot on an actor's face, while a corresponding scene in second video asset 512 (e.g., in portrait orientation) may feature a close-up camera shot on the same actor’s face.
  • the actor’s dialogue may be louder in the second audio asset 514 than in the first audio asset 504, to correspond with the close-up shot in portrait orientation; and for consistency with the user’s expectation that sounds originating closer to the camera will be relatively louder in an audio mix than those originating farther from the camera.
  • a creative entity can exercise creative control over the first audio asset and the second audio asset, such that die audio assets reflect human judgment of what will sound best to the listener.
  • stage 530 represents an encoder, which comprises or executes one or more encoding processes that can encode the data of one or more assets, such that the encoded data can be presented to a device or a process (e.g., a software application) that can decode the encoded data and present it to a user.
  • Encoder 530 can be implemented using one or more processors, which in some cases may be located on a server remote to the presentation device, or which may be implemented across a distributed computing system.
  • encoder 530 can perform encoding processes on an entire asset in advance of that asset’s presentation to the user; in some cases, encoder 530 can perform encoding processes in real-time, while the asset is being presented to the user (as in live television). In such examples, an asset may be divided into individual units (e.g., groups of frames or samples), with encoding performed separately on each unit.
  • the disclosure is not limited to any particular system or method for implementing encoder 530.
  • Encoder 530 can accept as input any suitable number of type of assets.
  • encoder 530 accepts as input the first video asset 502, the second video asset 512, the first audio asset 504, and the second audio asset 514, such as described above.
  • encoder 530 can accept as input one or more layer assets 520, which may describe assets to be composited with other assets.
  • a first video asset 502 could include host 322B and background 3286; and layer assets 520 could include overlay elements 3206 (e.g., a graphic overlay); 324B (e.g., a brand logo); and 326B (e.g., a clock).
  • the layer assets and the first video asset could be provided as input to the encoding process 530, with the layer assets to be presented as a composite with the first video asset.
  • the composition could occur as part of an encoding process of the encoder 530; in some examples, layer assets 520 can be procedurally composited on a video asset by a media player application (e.g., application 560 described below). Layer assets 520 can, but need not, be associated with a particular aspect ratio.
  • Encoder 530 can encode its input assets according to one or more suitable processes, which may be selected depending on criteria such as network conditions (e.g., latency, available bandwidth), content type, user preferences, or display type (including display aspect ratios), such as described below.
  • encoder 530 can output one or more streams 540 of encoded data.
  • data streams 540 can include a first encoded data stream 542, a second encoded data stream 544, and a third encoded data stream 546 (and potentially other data streams).
  • a data stream may correspond to any suitable combination of video data, audio data, or data associated with any other suitable type of asset (e.g., haptic data).
  • the disclosure is not limited to any particular correlation of data streams to assets (such as assets 502, 504, 512, 514, and 520 described above); a data stream can include data for any suitable number or type of assets.
  • Data streams 540 can be delivered to device 550, which may correspond to the example devices in FIGs. 1A-1D.
  • data streams 540 can be downloaded by device 550 via a computer network, such as a content delivery network (e.g., via a streaming download, or by downloading individual files).
  • data streams 540 can be stored on storage media (e.g., optical storage such as a DVD-ROM, solid-state storage such as a USB memory stick, or magnetic storage such as a hard drive), and read by device 550 via an interface to the storage media.
  • a media player application 560 (e.g., a software application executing on one or more processors of device 550) can accept the encoded data streams 540 as input, and process that data (e.g., by decompressing it and setting rendering parameters) to present the underlying assets (e.g., video assets 502 and 512 and audio assets 504 and 514) to a user.
  • media player application 560 can present the video assets to the user via a display 572 of device 550; and can concurrently present audio assets to the user via speaker 574 of device 550.
  • media player application 560 (alone or in conjunction with one or more additional applications) can present other asset types, such as haptic assets, via device 550 (e.g., via a motor or actuator 576 of device 550).
  • process 500 can incorporate various interactive behaviors; for example, media player application 560 can accept user input (e.g., via an input device of device 550) relating to process 500 and respond accordingly.
  • media player application 560 can select between two or more presentations of video content, such as described above with respect to FIGs. 3A and 3B and video content 310A/310B.
  • Media player application 560 can select one or more of a plurality of assets to be presented, decode the encoded data, and present the decoded data corresponding to the selected assets.
  • media player application 560 can identify an orientation of device 550, such as described above with respect to FIGs. 4A-4B; identify a desired aspect ratio based on that orientation; and select a video asset associated with an aspect ratio closest to that desired aspect ratio.
  • media player application 560 can identify which of the two video assets has the aspect ratio ciosest to the desired aspect ratio, and select that video asset for presentation to the user. In selecting a video asset for presentation, media player application 560 can decode the video data of data streams 540, identify the decoded video data that corresponds to the selected video asset, and present that decoded video data while disregarding data that corresponds to the unselected video asset. Media player application 560 can similarly identify a corresponding audio asset, or other type of asset, and concurrently present that asset along with the presentation of the selected video asset. This process can be extended to any suitable number and type of assets.
  • the process can be performed multiple times by media player application 560 during playback, for example on a frame-by-frame basis, as the user may continue to reorient device 550.
  • This process wherein media player application 560 selects one or more desired assets from a plurality of assets resident in memory — can carry a speed advantage over other solutions, such as selectively delivering assets to device 550; selecting a different asset for presentation need not require re-encoding an asset, or fetching the asset from a remote location, which can introduce unwanted delay and overhead, instead, the newly selected asset is preloaded in memory and ready for immediate presentation.
  • FIGs. 6A-6C illustrate examples of data streams 540.
  • FIGs. 6A-6C illustrate examples of data streams 540. In the example shown in
  • data streams 540 can comprise three separate streams: a first stream 602 comprising a first video asset (e.g., a video asset in a portrait orientation); a second stream 604 comprising a second video asset (e.g., a video asset in a landscape orientation); and a third stream 606 comprising an audio asset (which can he associated with both the first video asset and the second video asset).
  • a first stream 602 comprising a first video asset (e.g., a video asset in a portrait orientation); a second stream 604 comprising a second video asset (e.g., a video asset in a landscape orientation); and a third stream 606 comprising an audio asset (which can he associated with both the first video asset and the second video asset).
  • a first video asset e.g., a video asset in a portrait orientation
  • second stream 604 comprising a second video asset
  • a third stream 606 comprising an audio asset (which can he associated with both the first video asset and the second video asset).
  • data streams 540 can comprise two separate streams: a first stream 612 comprising a first video asset (e.g., a video asset in a portrait orientation) and a corresponding first audio asset (e.g., an audio asset for playback on a device in a portrait orientation); and a second stream 614 comprising a second video asset (e.g., a video asset in a landscape orientation) and a corresponding second audio asset (e.g., an audio asset for playback on a device in a landscape orientation).
  • a first stream 612 comprising a first video asset (e.g., a video asset in a portrait orientation) and a corresponding first audio asset (e.g., an audio asset for playback on a device in a portrait orientation); and a second stream 614 comprising a second video asset (e.g., a video asset in a landscape orientation) and a corresponding second audio asset (e.g., an audio asset for playback on a device in a landscape orientation).
  • data streams 540 comprise just a single stream
  • stream 622 comprises a first video asset (e.g., a video asset in a portrait orientation), a second video asset (e.g., a video asset in a landscape orientation), and an audio asset (which can be associated with both the first video asset and the second video asset).
  • a first video asset e.g., a video asset in a portrait orientation
  • a second video asset e.g., a video asset in a landscape orientation
  • an audio asset which can be associated with both the first video asset and the second video asset.
  • data streams 540 comprise just a single stream 632, where stream 632 comprises a first video asset (e.g., a video asset in a portrait orientation) and a corresponding first audio asset (e.g., an audio asset for playback on a device in a portrait orientation); and a second video asset (e.g., a video asset in a landscape orientation) and a corresponding second audio asset (e.g., an audio asset for playback on a device in a landscape orientation).
  • data streams 540 can be delivered to device 550, which can decode the underlying data and present it to a user as video content (e.g., video content 310A/310B described above with respect to FIGs. 3A-3B).
  • Data streams 540 can include one or more manifest files that can include metadata relating to the contents of the data stream, and that can include various instructions or parameters for decoding and presenting those contents.
  • multiple assets are encoded (e.g., by encoder 530) in a single data stream.
  • This encoding can comprise interlacing the multiple assets; concatenating the multiple assets; and/or employing any other suitable technique.
  • encoding multiple video assets may comprise composing a video from respective time- matched frames of two or more input video assets.
  • a first frame of the composed video can comprise video data from a first frame of a first video asset alongside video data from a first frame of a second video asset.
  • Corresponding input frames can he scaled and positioned in the composed video, such as described further below; the composed video can be encoded (e.g., on a frame-by-frame basis) by an encoder, and the encoded data delivered to device 550 as described above.
  • the composed video can be encoded (e.g., on a frame-by-frame basis) by an encoder, and the encoded data delivered to device 550 as described above.
  • Other suitable implementations are contemplated, and specific implementations of encoding multiple assets in a single data stream can vary depending on a codec used.
  • Data efficiencies can be realized by encoding multiple assets in a single data stream, such as data stream 622 shown in FIG. 6C or data stream 632 shown in 6D.
  • single stream delivery can incur less network overhead.
  • certain video codecs e.g., H.264
  • data redundancies such as regions of similar or identical video content in a data stream (e.g., in composed videos such as described above), to reduce file size.
  • combining data for multiple video assets in a single data stream can improve the likelihood of such redundancies being present in the data stream — particularly where, for example, two video assets present substantially similar versions of common video content — and result in greater compression ratios and overall reduced data size.
  • certain encoding schemes may result in lower overall data usage or may otherwise be preferable to certain other encoding schemes.
  • multiple encoding schemes may he used, for example to produce multiple data streams from which only a subset are delivered to device 550.
  • FIGs. 7A-7G illustrate various example encodings that can be used by encoder 530 to generate a data stream, such as data stream 622 or 632 described above, that includes both a first video asset and a second video asset.
  • media player application 560 can decode the data stream, select the desired video asset from the decoded data stream, and render that video asset to a display while disregarding the unselected video asset.
  • audio assets or other assets may be encoded with the first and second video assets in the data streams shown in the figures.
  • the examples shown in FIGs. 7A-7F can include composed videos, such as described above, where the composed videos comprise time-matched frames from two or more video assets; the frames are scaled and positioned in the composed video, which is encoded by encoder 530.
  • a composed video 710 comprises a first video asset 712 and a second video asset 714, where the height h 1 of the first video asset 712 (e.g., portrait orientation) equals the width w 2 of the second video asset (e.g., landscape orientation).
  • the first and second video assets may have inverse aspect ratios, such as 16:9 and 9: 16.
  • encoder 530 can generate data stream 622 by encoding a version of the second video asset 714 that is rotated 90 degrees with respect to the first video asset 712, such that the rotated asset 714 can be positioned horizontally adjacent to asset 712 as shown in the figure.
  • a composed video 720 comprises a first video asset 722 and a second video asset 724, where the width w, of the first video asset 722 (e.g., portrait orientation) equals the width W2 of the second video asset 724 (e.g., landscape orientation).
  • the second video asset 724 may be scaled down from an original size, such that its width w 2 equals w 1 , the overall data size of the data stream may be reduced in this configuration.
  • encoder 530 can generate data stream 622 or 632 by positioning the first video asset 722 vertically adjacent to the second video asset 724 as shown in the figure and encoding the composed video.
  • a composed video 730 comprises a first video asset 732 and a second video asset 734, where the height h 1 of the first video asset 732 (e.g., portrait orientation) equals the height fe of the second video asset 734 (e.g., landscape orientation).
  • the first video asset 732 may be scaled down from an original size, such that its width h 1 equals h 2 ; the overall data size of the data stream may be reduced in this configuration.
  • encoder 530 can generate data stream 622 or 632 by positioning the first video asset 732 horizontally adjacent to the second video asset 734 as shown in the figure and encoding the composed video.
  • frames 741, 743, 745 in portrait orientation are interlaced with frames of a second video asset (e.g., frames 742, 744, 746 in landscape orientation), such that media player application 560 can present either the first video asset or the second video asset by de-interlacing the frames and presenting those frames corresponding to the desired video asset.
  • frames of the second video asset can be rotated 90 degrees with respect to frames of the first video asset, for spatial consistency with frames of the first video asset.
  • a composed video 750 comprises a first video asset 752 and a second video asset (e.g., 754, 756, 758).
  • the first video asset 752 e.g., portrait orientation
  • the second video asset can be scaled to various reduced sizes.
  • video assets 756 and 758 illustrate video assets of increasingly reduced size with respect to 754.
  • FIG. 7F shows an example data stream featuring a composed video 760, in which the first video asset 762 is a full-size video asset in a landscape orientation, and reduced- size video assets 764, 766, and 768 are in a portrait orientation.
  • video assets may be rotated or oriented as desired to obtain network efficiencies or meet other requirements.
  • the portrait orientation video asset 712 may be oriented at a first angle (e.g., zero degrees), while the landscape orientation video asset 714 may be oriented at a second angle (e.g., 90 degrees with respect to the first angle).
  • Orienting a first video asset at a non- zero angle with respect to a second video asset can permit the first video asset and the second video asset to be encoded in a single frame that meets desired dimensional requirements.
  • a first angle e.g., zero degrees
  • the landscape orientation video asset 714 may be oriented at a second angle (e.g., 90 degrees with respect to the first angle).
  • the second video asset 714 (with width w 2 and height h 2 ) can he oriented at a 90 degree angle with respect to the first video asset 712 (with width w 1 and height h 1 ), such that the combined frame can be encoded as a rectangle with a height of h 1 and a width of w 1 + h 2 .
  • the video assets may he oriented at any suitable angle (including a zero angle) with respect to each other.
  • a first video asset may be oriented at a same angle as a second video asset. This may be necessary when, for example, technical requirements demand that a video asset be encoded with a particular orientation. For instance, a video playback application may require that a video asset be oriented at a particular angle in order to present that video asset to a viewer. In some cases, DRM or other content protection mechanisms may require encoding of a video asset in a particular orientation.
  • FIG. 7G shown an example data stream in which a composed video 770 comprises a first video asset 772 and a second video asset 774.
  • second video asset 774 may be a landscape orientation video asset that is not oriented at a non-zero angle with respect to first video asset 772, a portrait orientation video asset.
  • the first video asset 772 has a width w 1 and a height h 1 ; and the second video asset 774 has a width w 3 and a height fe.
  • the first video asset 772 and the second video asset 774 are positioned in an “L” shape, in which the first video asset 772 and the second video asset 774 share a single common boundary (in the example, the width w 1 of first video asset 772, which is coilinear with the width w 3 of the second video asset 774).
  • the first video asset 772 and the second video asset 774 need not share any common linear dimensions. For example, as shown in the figure, width w 1 of the first video asset 772 is shorter than width w 3 of the second video asset 774; and height h 1 of the first video asset 772 is longer than height h 2 of the second video asset 774.
  • the first video asset 772 and the second video asset 774 are encoded together in a rectangular frame having height h 3 (i.e., h 1 + h 2 ) and width w 3 .
  • the lack of a common linear ⁇ dimension differs from some examples shown above: for instance, in example data stream 710 in FIG. 7 A, landscape orientation video asset 714 can he rotated 90 degrees to share a boundary (i.e., w 2 and h 1 ) coextensive with portrait orientation video asset 712. And in example data stream 720 in FIG.
  • a resolution of landscape orientation video asset 724 can be reduced with respect to portrait orientation video asset 722 such that landscape orientation video asset 724 shares a boundary (i.e., w 1 and wi) coextensive with portrait orientation video asset 722.
  • the example encoding shown in FIG. 7G results in a region 776 of composed video 770 that does not belong to either the first video asset 772 or the second video asset 774.
  • region 776 can be used to encode additional data (e.g., one or more additional video assets); but in some examples, region 776 may be a region of unutilized space.
  • Certain encoding schemes may be more desirable than others depending on variables such as network conditions; user habits; or the type of content to be presented.
  • machine learning techniques e.g., neural network techniques
  • probabilistic models can be used to identify and predict which encoding scheme is preferable in a particular circumstance.
  • video assets are encoded adjacent to one another, e.g., in a data stream.
  • adjacent video assets may share a common pixel boundary.
  • adjacent video assets may be separated by an insubstantial region of space.
  • adjacent video assets may be separated by a row or column of 1 or 2 pixels.
  • aspects of the disclosure can be applied to audio-only assets (e.g., music, podcasts) that may not have a video component. Further, the disclosure can be applied to assets comprising still images, GIF files, or other suitable types of media.
  • audio-only assets e.g., music, podcasts
  • assets comprising still images, GIF files, or other suitable types of media.
  • a user of a display device may switch between a first video asset, e.g., a video in a portrait orientation, and a second video asset, e.g., a video in a landscape orientation, by rotating the display device.
  • a data stream may include both a first video asset and a second video asset, such as described above.
  • Playback of a video asset can comprise a user interface with a “scrubber” element for graphically representing temporal locations of the video asset (e.g., via a position of a playback bead).
  • a scrubber element can depict a video asset as a line, with a first time (e.g., the beginning) of the video asset at one end of the line, and a second time (e.g., the end) of the video asset at the other end of the line.
  • a user can change the frame of a video, or a time corresponding to a playback position by dragging a slider or playhead across a scrubber bar or timeline. (The timeline may, but need not, be represented as a straight line.)
  • touch-sensitive devices a user can manipulate the scrubber using touch controls; for example, dragging a playhead along a timeline with one’s finger on a touch screen can move the playback position accordingly.
  • the user can manipulate the scrubber using a mouse, a remote control, a gamepad, or another suitable input device.
  • the usability of a scrubber can depend on various factors. For example, it is desirable that a scrubber be easily accessible (e.g., via one’s dominant thumb while holding a touch screen device with the dominant hand). It is further desirable that the scrubber not overlap or obscure important information on a display screen. However, it is also desirable that the scrubber be sufficiently large to allow for fine and coarse inputs; a scrubber that is too small may frustrate a user’s ability to make fine adjustments to the playhead, particular when viewing lengthy video assets.
  • a one-size-fits-all scrubber design may be unsuitable.
  • a scrubber that is sufficiently large, easily accessible to the user, and does not overlap important video content when viewing a portrait orientation video may suddenly become unsuitable when the user switches to a landscape orientation video; for example, the landscape orientation video may place important information in a different location on the display, and if the user holds a mobile device in a landscape orientation, the scrubber may no longer be within easy reach of the user.
  • a scrubber tool comprise an adaptive scrubber; that is, a scrubber whose appearance or functionality may adapt to the video asset being played, and/or to the orientation of the display device. This can be particularly desirable when a user (e.g., of a mobile display device) wishes to switch between two video modes or video assets (e.g., portrait and landscape), such as described above, and control them with a common scrubber tool.
  • an adaptive scrubber may adopt one or more first characteristics (e.g., appearance, display location, dimensions, functionality) when viewing a first video asset or a first video mode, and may adopt one or more second characteristics when viewing a second video asset or a second video mode.
  • a video asset corresponding to a portrait orientation may be associated with an playback interface or an adaptive scrubber having a vertically oriented layout
  • a video asset corresponding to a landscape orientation may be associated with a playback interface or an adaptive scrubber having a horizontally oriented layout.
  • the adaptive scrubber can adopt an appearance or functionality suited to the video at hand. This may enhance ease of use of the scrubber and enhance the overall viewing experience for the user.
  • other aspects of video playback may adapt to the video asset being played; one example is adaptive closed captioning, where the location and/or the content of the closed captioning may differ between a first video asset and a second video asset.
  • FIGs. 9A-9C illustrate examples of a user interface including an adaptive scrubber being presented on a display of a mobile device in a portrait orientation.
  • a device 900 displaying an adaptive scrubber 910 on a display may include a scrubber bar 902 having a start position 906 and an end position 908.
  • the scrubber bar 902 may include a slider or playhead 904.
  • the slider 904 may move along the scrubber bar 902 to indicate the playback time that has elapsed. For example, at the beginning of a video, the slider may be positioned along the scrubber bar 902 at the start position 906. As the video plays, the slider will move along the scrubber bar 902 corresponding to the amount of time the video has played.
  • the slider 904 can be positioned at the end position 908.
  • the slider 904 may have a timecode 918 positioned next to the slider 904.
  • the timecode 918 may be positioned to the left the slider 904.
  • the timecode may be positioned to the left, below, or above the slider 904.
  • the timecode 918 may move with the slider to maintain the same relative position to the slider 904.
  • the adaptive scrubber may also include playback controls 912.
  • the playback controls 912 may include a fast-forward, a rewind, and a play/pause icon. Selecting an icon can have a corresponding effect on the video as well as the position of the slider 904 on the scrubber bar 902. For example, selecting (e.g., clicking or tapping) the play/pause icon on the playback controls 912 may cause a playing video to pause.
  • the slider 904 can remain in place along the scrubber bar 902 at the time the selection was made, i.e, playback of the currently displayed video asset will stop. If the play command is selected, the presented video asset will resume playing at a real time speed.
  • Selecting the fast-forward icon on the playback controls 912 may cause the video to fast-forward, i.e., advance foe current video asset at a rate quicker than real-time.
  • the slider 904 can similarly advance along the scrubber bar 902 at a rate quicker than real-time corresponding to the video.
  • the rewind icon of the playback controls 912 may be used to rewind the video, i.e., present frames of the selected video asset in a reverse order starting from the current frame.
  • the rewind icon may have multiple speeds as described with respect to the fast-forward icon.
  • the user interface elements associated with the adaptive scrubber may be positioned based on foe video asset currently being played by foe device, e.g., may depend on the orientation of the device.
  • the device 900 is oriented in with its display in portrait orientation, which may correspond to a first video asset presenting video content on the display in a vertical layout.
  • the adaptive scrubber 910 may have a corresponding vertical layout on the display such that foe length of the scrubber bar 902A is parallel to the longest side of the device 900.
  • a user may rotate the device such that foe device 900 is oriented with its display in a landscape orientation, which may trigger foe device to play a second video asset presenting video content with a horizontal layout, as shown in FIG. 9B.
  • the scrubber bar 902B is similarly rotated to have a corresponding horizontal layout such that the scrubber bar 902B is parallel to the longest side of the device 900.
  • Device 900 can be freely rotated between the portrait orientation and the landscape orientation, and the adaptive scrubber 910 will be presented accordingly.
  • rotating the device between the first video asset, e.g., portrait orientation, and the second video asset, e.g., landscape orientation may affect the position or orientation of user interface elements of the adaptive scrubber.
  • These can include the timecode 918 and playback controls 912, and additional user interface elements described below (e.g., FIGs. 12A-12D, and 13A-13D).
  • the device 900 is playing a the first video asset in portrait orientation such that the scrubber bar 902 is on the right side of the display and the time code 918 is positioned to the left of the slider 904 on the scrubber bar 902.
  • the device 900 is rotated to the orientation shown in FIG.
  • the device may play the second video asset in landscape orientation such that the scrubber bar 902 is positioned at the bottom of the display and the time code 918 is positioned below the slider 904 on the scrubber bar 902.
  • the time code may be positioned above the slider.
  • the playback controls 912 may similarly change position depending on an orientation of the display device (e.g., portrait or landscape), which may correspond to the video asset being presented, such as described above.
  • the device 900 is playing a second video asset in landscape orientation with the playback controls 912 positioned at the bottom of the display.
  • the device may play the first video asset in portrait orientation, such that the playback controls 912 are positioned to the left of the scrubber bar 902 in a vertical configuration.
  • the adaptive scrubber 910A may have a vertical layout on the display such that the scrubber bar 902A is located on a right side of the display. This layout may be preferable if the user is right-hand dominant.
  • the adaptive scrubber 910C while the device 900 is showing the first video asset in a portrait orientation, the adaptive scrubber 910C may have a vertical layout on the display such that the scrubber bar 902C is iocated on a left side of the display. This layout may be preferable if the user is left-hand dominant.
  • a user may indicate a preference for having the scrubber bar located on either the right or left side of the display,
  • a handheld device may detect a location of a hand of the user grasping the device to automatically determine which side of the screen to position the scrubber bar.
  • the device 1000 may detect that a left hand 1016A of the user is grasping the device and position the scrubber bar 1002 on the left side of the display of the device 1000. This may allow a user to manipulate the position of the slider 1004 and playback controls 1012 with the thumb of the left hand 1016A.
  • the angle or orientation of the device 1000 may be used to determine which hand is grasping the phone.
  • the scrubber bar may be positioned on the right side of the display of the device. For example, a user may prefer to grasp the device with their right hand and manipulate the adaptive scrubber with the left hand.
  • the device 1000 may detect that a right hand 1016B of the user is grasping the device and position the scrubber bar 1002 on the right side of the display of the device 1000. This may allow a user to manipulate the position of the slider 1004 and playback controls 1012 with the thumb of the right hand 1016B.
  • the scrubber bar may be positioned on the left side of the display of the device. For example, a user may prefer to grasp the device with a right hand of the user and manipulate the adaptive scrubber with the left hand.
  • locations of the adaptive scrubber user interface elements may depend on the content of the video asset being played.
  • the adaptive scrubber user interface elements may be presented in areas of the display that do not correspond to, or overlap with, a focal region of the video asset.
  • a focal region may he a region of interest, such as an area of the screen that a creative entity would expect a user’s gaze to be directed.
  • the focal region for a video may correspond to an area where a person is delivering dialogue.
  • the video content 310A includes a television news-style program with the host 322A positioned near the right-side of the display.
  • the focal region may correspond to the location of the host 322A.
  • An adaptive scrubber according to this example may be presented on the left-side of the display to avoid obstructing the focal region with visual elements 320A and 324A could be repositioned accordingly.
  • Another example of a focal region may include a region where there is relative movement of the same object between frames of the video asset, e.g., an explosion or water running in a river. This concept of positioning the adaptive scrubber in regions other than the focal region may also apply to adaptive closed captioning, in some examples, focal regions may be determined autonomously, such as by a playback application; for instance, face detection techniques could be used to identify an actor’s face as a focal region.
  • one or more focal regions can be manually inserted, such as by a film director, or by an end user.
  • Focal region data can be included in metadata associated with a corresponding video asset.
  • focal regions may change location over the length of a video, with the adaptive scrubber being continually repositioned throughout the video to avoid overlapping with the focal regions,
  • FIGs. 11A-11B illustrate an example adaptive scrubber layout that includes chapter heads 1114A, 1114B, and 1114C, positioned along the scrubber bar 1102.
  • the device 1100 is in a portrait orientation and displaying a first video asset (e.g., video in a portrait orientation) and a corresponding adaptive scrubber 1110 with a vertical layout.
  • the location of the chapter heads, 1114A, 1114B, and 1114C, along the scrubber bar 1102 corresponds to the time at the start of the respective chapter.
  • 11A includes four chapters: a chapter starting at the start position 1106, a second chapter starting at chapter head 1114A, a third chapter starting at a chapter head 1114B, and a fourth chapter starting at 1114C. In other examples, there may be more or less than four chapters in a video.
  • a title of the chapter may display next to the chapter head when hovering over the chapter head with a finger, stylus, or other pointer device.
  • the chapter head icon e.g., 1114A-1114C may be rectangular, oval, circular, triangular, or any shape that a user may easily recognize without confusing the chapter head for the slider.
  • the chapter head may be the same shape as the slider 1104, but be a different color, in some examples, the start position 1106 may include a chapter head icon.
  • Rotating the device 1100 from a portrait orientation in FIG. 11A to a landscape orientation in FIG. 11B may result in the video asset switching from a first video asset to the second video asset, such as described above.
  • the device 1100 is in landscape orientation displaying a second video asset (e.g., landscape orientation) with the adaptive scrubber 1110 oriented horizontally.
  • the adaptive scrubber 1110 includes chapters 1114A, 1114B, and 1114C located along a horizontally oriented scrubber bar 1102.
  • the chapters may indicate different sections of the video asset. According to one example, if the video is a news show, the chapters may correspond to different news segments. According to another example, if the video is a movie, the chapters may correspond to shorter narrative arcs within the movie and/or suggest a recommended location for a viewer to pause while watching the video. In some examples a creative entity and/or distributor of the content may determine the placement of the chapters. In some examples, a user may create their own chapters, as described below with respect to bookmarking and FIGs. 13A-13D.
  • a first video asset e.g., having a portrait orientation
  • a second video asset e.g., having a landscape orientation, as described above, e.g., FIGs. 3A-3B
  • the adaptive scrubber corresponding to the first video asset may be different than the adaptive scrubber corresponding to the second video asset.
  • the chapter locations may between the first and second video asset may correspond to different times along the scrubber bar to complement differences between the first and second video assets.
  • the adaptive scrubber corresponding to a first video asset may have a different number of chapters than the adaptive scrubber corresponding to a second video asset.
  • FIGs. 12A-12D illustrate examples of an adaptive scrubber 1210 layout that includes social media and sharing capabilities.
  • a device 1200 displays a first video asset in a portrait orientation and a corresponding adaptive scrubber 1210 oriented vertically, including a scrubber bar 1202 having a start position 1206 and an end position 1208.
  • the scrubber bar 1202 may include a slider or playhead 1204.
  • the slider 1204 may move along the scrubber bar 1202 to indicate the playback time that has elapsed.
  • the adaptive scrubber may also include playback controls 1212.
  • the playback controls 1212 may include a fast-forward, a rewind, and a play/pause icon.
  • the adaptive scrubber 1210 may also include a plurality of social media icons or virtual buttons 1216 that may be associated with, for example, a social networking platform (e.g., Facehook, Pinterest, Twitter, Tumbir, etc.), or a messaging service (e.g., SMS text, WhatsApp).
  • a social networking platform e.g., Facehook, Pinterest, Twitter, Tumbir, etc.
  • a messaging service e.g., SMS text, WhatsApp.
  • the plurality of social media icons 1216 may be positioned near the playback controls 1212 at the bottom of the display.
  • the social media icons may be arranged vertically to the right or left of the vertically oriented adaptive scrubber.
  • the adaptive scrubber 1210 may similarly transition (e.g., rotate) to be oriented horizontally.
  • the scrubber bar 1204 is located near the bottom of the display with the playback controls 1212 and social media icons 1216 positioned beneath the scrubber bar 1204.
  • the social media icons may be positioned above the scrubber bar 1204.
  • a user that desires to share a link to the video asset on playing on a device 1200 may select one of the social media icons 1216.
  • a user may select a share icon.
  • a pop-up 1230 may appear on the screen corresponding to options associated with the share icon 1216C.
  • the pop-up 1230C may include one or more messaging or social media icons 1232 that correspond to a messaging or social networking app or service (e.g., SMS text, WhatsApp, Instagram, etc.) that can be used to share the video asset link.
  • a messaging or social networking app or service e.g., SMS text, WhatsApp, Instagram, etc.
  • the pop-up 1230 may also include a dialog box 1234 where a user can identify a recipient (e.g., a user of a social networking platform) and/or type a message to the recipient.
  • the pop-up 1230 may also include a timer 1236 corresponding to the current playback time of the video. In this manner, a user may send a link to an exact location in the playback of the video. The user may use the playback controls 1212 to change the location in the video to send to the intended recipient. In some examples the user may prefer to send a link to the video from the beginning, without specifying a time, and may de-select the timer to do so.
  • FIG. 12D shows an alternate layout for the pop-up 1230D.
  • Pop-up 1230D may, for example, include the playback controls 1212D located in the pop-up 1230D near the messaging icons 1232 for ease of use.
  • the content of the pop-up can be tailored to the social media icon selected. For example, if a user selects the Facebook icon, the user may be given the option to post on their personal wall, a friend’s wall, or send the link over Facebook Messenger.
  • the link sent through the social media capabilities may require the intended recipient to watch the same video asset that the user was watching before sending the link. For instance, if the user is watching a first video asset in portrait orientation and sends a link, the intended recipient can receive a link to the same video asset, e.g., to watch the video in portrait orientation. In this manner, the user can ensure that the intended recipient will have a similar experience with the video content by watching the video in the same orientation.
  • the intended recipient may have an option to watch the same or a different video asset.
  • an intended recipient may receive a link to both video assets and depending on the detected orientation of the device, the appropriate video asset will play, e.g., if the device is in a portrait orientation, the video asset corresponding to the portrait orientation will play.
  • a user may rotate the device during playback and the video asset should switch accordingly, e.g ⁇ , from a first video asset in portrait orientation to a second video asset in landscape orientation.
  • the message accompanying the link may suggest a preferred orientation.
  • FIGs. 13A-13D illustrate an example adaptive scrubber 1310 layout that includes hookmarking capabilities.
  • a device 1300 displays a first video asset in portrait orientation and a corresponding adaptive scrubber 1310 oriented vertically, including a scrubber bar 1302 having a start position 1306 and an end position 1308.
  • the scrubber bar 1302 may include a slider or playhead 1304.
  • the slider 1304 may move along the scrubber bar 1302 to indicate the playback time that has elapsed.
  • the adaptive scrubber may also include playback controls 1312.
  • the playback controls 1312 may include a fast -forward, a rewind , and a play/pause icon.
  • the playback controls may be arranged as described with respect to FIGs. 9 A or 9C.
  • the adaptive scrubber 1310 may also include a plurality of social media icons or virtual buttons 1316.
  • the social media icons may include, for example, Facebook, Pinterest, Twitter, Tumbir, and the like.
  • the adaptive scrubber 1310 may also include a bookmarking icon 1348. As shown in FIG. 13 A, the bookmarking icon 1348 may be positioned near the plurality of social media icons 1316 with the playback controls 1312 at the bottom of the display. The bookmarking icon may be located in other areas, for example near the start position 1306, the end position 1308, or the slider 1304.
  • the adaptive scrubber user interface elements including, but not limited to, the scrubber bar 1302, slider 1304, playback controls 1312, social media icons 1316, and bookmarking icon 1348 may change position as described above (e.g., FIGs. 12A and 12B).
  • FIG. 1302 may select the bookmarking icon 1348.
  • selecting the bookmarking icon 1348 may cause a pop-up 1330B to appear.
  • the pop- up 1330B may be positioned adjacent to the slider 1304.
  • the pop-up 1330B may include a dialog box 1334B where a user can type a note or comment about the bookmark.
  • the pop-up 1330B may include a timer 1436B corresponding to the current playback time of the video.
  • the user may use the playback controls 1312 to change a location along the scrubber bar 1302, so that the user may bookmark the precise time in the video.
  • FIG. 13C shows an alternate layout for the pop-up 1330C.
  • Pop-up 1330C may, for example, include the playback controls 1312C located in the pop- up 1330C for ease of use.
  • bookmark icon 1346D may appear, corresponding to foe location of the bookmark along foe scrubber bar 1302D.
  • bookmarks may be saved with respect to a particular video asset. For example, if a user is watching a first video asset and saves a bookmark while in the first video asset, then the bookmark may be saved to the first video asset and not foe second. In this example, the bookmark would not appear on the scrubber bar corresponding to the second video asset.
  • the bookmark may be saved to the second video asset.
  • the bookmark would not appear on the scrubber bar corresponding to the first video asset.
  • bookmarks saved by the user will save to both the first video asset and the second video asset, e.g., horizontal and vertical orientations of foe adaptive scrubber.
  • foe bookmark icon 1346 may be rectangular, oval, circular, triangular, or any shape that a user may easily recognize without confusing the bookmark icon 1346 for a chapter head or the slider 1304.
  • the bookmark icon may be the same shape as the chapter head or slider, but be a different color.
  • the bookmarks may have a similar function as chapters and use foe same icon as a chapter head.
  • a user may have an option to designate a selected time location as a bookmark or a chapter.
  • the chapter icon may work as described above with respect to the bookmarking icon.
  • One skilled in the art would understand that a user may save multiple bookmarks for a video asset.
  • the adaptive scrubber may include a movable scrubber bar.
  • FIGs. 14A-14B illustrate examples of a movable scrubber bar 1402.
  • FIG. 14A illustrates a device 1400 displaying a first video asset in a portrait orientation with the scrubber bar 1402 of adaptive scrubber 1410 positioned vertically at a first position on the right side of the display.
  • the device 1400 may detect a first location of touch 1420A by the user along the scrubber bar 1402.
  • the user may drag the scrubber bar 1402 to a second position on the display by moving the first location of touch 1420A to a second location of touch 1420B, as shown in FIG. 14B.
  • the device may require that the first location of touch 1420A be maintained for a predetermined amount of time, e.g., two seconds, and/or that the user apply a predetermined amount of pressure before the scrubber bar 1402 can be dragged.
  • a predetermined amount of time e.g., two seconds
  • the scrubber bar 1402 may stay put once the scrubber bar has been dragged to a new location. For example, referring to FIG. 14B, a user may drag the scrubber bar 1402 to a second position corresponding to the second location of touch 1420B. Once the user removes a finger or stylus from the device, the scrubber bar 1402 can remain in the second position.
  • the playback controls may automatically move to avoid overlap between the scrubber bar 1402 and the location of the playback controls 1412. For example, referring to FIG. I4B, the playback controls 1412 may be off-centered to the left side of the device 1400 instead of centered on the centerline 1422 of the device.
  • a button of the playback controls may be located on a right side of the scrubber bar while the remaining butons may be located on a left side of the scrubber bar.
  • the rewind and play icons may be to the left of the scrubber bar while the fast-forward icon may be to the right of the scrubber bar.
  • the playback controls may remain in the original location despite the overlap.
  • the scrubber bar 1402 may snap to a location depending on the final location of touch (e.g., the location of touch preceding the user removing a finger or stylus from the screen).
  • the second location of touch is to the right of the centerline 1422 of the device 1400, thus, the scrubber bar 1402 may snap back to the right side of the display of the device 1400.
  • the adaptive scrubber 1401 will have the same position that it started with in FIG. 14A.
  • the scrubber bar 1402 would snap to the left side of the display device 1400, e.g., as illustrated in FIG. 9C.
  • FIGs. 15A-15D illustrate examples of a draggable scrubber that may snap to a location with a rubber band effect.
  • device 1500 is displaying a first video asset in a portrait orientation with the scrubber bar 1502 of adaptive scrubber 1510 oriented vertically at a first position on the right side of the display.
  • the device 1500 may detect a first location of touch 1520A by the user along the scrubber bar 1502 in the same manner described above with respect to FIGs. 14A-14B.
  • the user may drag the scrubber bar to a second location of touch 1520B.
  • the scrubber bar 1502 is sselling the rubber band effect, where a portion of the scrubber bar 1502 in FIG.
  • the scrubber bar 1502 may snap back to the right side of the display of the device 1500 if the user were to remove pressure, e.g., no longer contact the display of the device 1500 at the second location of touch 152.0B.
  • a line other than centerline 1522 could be used as the threshold to determine the side of the display the scrubber bar 1502 will snap back.
  • the second location of touch may be located to the left of the centerline of the device.
  • the user may drag the scrubber bar 1502 to a second location of touch 1520C to the left of the centerline 1522.
  • the start position 1506 and end position 1508 of the scrubber bar 1502 may snap to position on the left side of the display of the device 1500.
  • the scrubber bar 1502 may snap fully into place as illustrated in Fig. 15D.
  • the media content application may prompt a user to select a preferred amount of mobility for the scrubber bar, e.g., no movement, free movement, snap to location, rubber band effect.
  • a user may be able to configure the mobility of the scrubber bar in the settings of the video application.
  • a computer-readable recording medium can be any medium that can contain or store programming for use by or in connection with an instruction execution system, apparatus, or device.
  • Such computer readable media may be stored on a memory, where a memory is any device capable of storing a computer readable medium and capable of being accessed by a computer.
  • a memory may include additional features.
  • a computer can comprise a conventional computer or one or more mobile devices.
  • a computer may include a processor.
  • a processor can be any device suitable to access a memory and execute a program stored thereon.
  • Communications may be transmitted between nodes over a communications network, such as the Internet.
  • a communications network such as the Internet.
  • Other communications technology may include, but is not limited to, any combination of wired or wireless digital or analog communications channels, such as instant messaging (IM), short message service (SMS ), multimedia messaging service (MMS) or a phone system (e.g., cellular, landline, or IP-based).
  • IM instant messaging
  • SMS short message service
  • MMS multimedia messaging service
  • a phone system e.g., cellular, landline, or IP-based
  • These communications technologies can include Wi-Fi, Bluetooth, or other wireless radio technologies.
  • Examples of the disclosure may be implemented in any suitable form, including hardware, software, firmware, or any combination of these. Examples of the disclosure may optionally be implemented partly as computer software running on one or more data processors and/or digital signal processors.
  • the elements and components of an example of the disclosure may be physically, functionally, and logically implemented in any suitable way. indeed, the functionality may be implemented in a single unit, in multiple units, or as part of other functional units. As such, examples of the disclosure may be implemented in a single unit or may be physically and functionally distributed between different units and processors.
  • FIG. 8 illustrates an example computer 800 (which may comprise a mobile device) capable of implementing the disclosed examples.
  • Example computer 800 includes a memory 802, a processor 804, an input interface 806, an output interface 808, and a communications interface 810.
  • Memory 802 may include volatile and non-volatile storage.
  • memory storage may include read only memory (ROM) in a hard disk device (HDD), random access memory (RAM), flash memory, and the like.
  • ROM read only memory
  • RAM random access memory
  • OS application programs
  • Processor 804 may include any device suitable to access a memory and execute a program stored thereon.
  • Input interface 806 may include a keyboard or mouse, for example.
  • Output interface 808 may include a conventional color monitor and printer, such as a conventional laser printer. Output interface 808 may provide requisite circuitry to electrically connect and interface the display and printer to the computer system.
  • Communications interface 810 may allow' the network and nodes to connect directly, or over another network, to other nodes or networks.
  • the network can include, for example, a local area network (LAN), a wide area network (WAN), or the Internet.
  • LAN local area network
  • WAN wide area network
  • the network, modules, and nodes can be connected to another client, server, or device via a wireless interface.
  • the input interface, processor, memory, communications interface, output interface, or combinations thereof, are interconnected by a bus.
  • the disclosed examples could be embodied as a JAVA tool, which means it can ran on any platform that is JAVA enabled. Examples can ran on a web server that provides a website for administrators to monitor the system results remotely. Sandra with administrative access to the web server can connect to and use visualization tools to take actions within a visualization. The examples can ran on any type of server, including virtual servers or an actual machine. While JAVA is provided as an example, any suitable programming language or technology can be used to implement the examples of the disclosure.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Databases & Information Systems (AREA)
  • Biodiversity & Conservation Biology (AREA)
  • Human Computer Interaction (AREA)
  • Business, Economics & Management (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Ecology (AREA)
  • Emergency Management (AREA)
  • Environmental & Geological Engineering (AREA)
  • Environmental Sciences (AREA)
  • Remote Sensing (AREA)
  • General Engineering & Computer Science (AREA)
  • User Interface Of Digital Computer (AREA)

Abstract

A method of presenting media content is disclosed. A plurality of assets is received at a device comprising a display and an orientation sensor. The plurality of assets comprises a first video asset associated with a first aspect ratio, and a second video asset associated with a second aspect ratio, different from the first aspect ratio. A desired aspect ratio is determined based on an output of the orientation sensor. In accordance with a determination that the desired aspect ratio is closer to the first aspect ratio than to the second aspect ratio, the first video asset is selected. In accordance with a determination that the desired aspect ratio is closer to the second aspect ratio than to the first aspect ratio, the second video asset is selected. The selected video is presented at the desired aspect ratio via the display.

Description

MEDIA CONTENT PRESENTATION
CROSS-REFERENCE TO RELATED APPLICATIONS
[0001] This application claims the benefit of U.S. Patent Application No. 16/735,511, filed
January 6, 2020, which is a continuation-in-part of U.S. Patent Application No. 16/405,648, filed May 7, 2019, which claims priority to U.S. Provisional Application Serial No. 62/816,884, filed March 11, 2019, both of which applications are hereby incorporated by reference in their entirety.
FIELD
[0002] Examples of the disclosure relate generally to systems and methods for presenting media content to a user of a computing device, and more specifically, to systems and methods for presenting media content including video to a user of a mobile computing device.
BACKGROUND
[0003] With the growth of video-capable mobile devices, such as smartphones, tablets, and wearable devices, users’ media viewing habits have gradually shifted out of the living room, and into the outside world — into every corner and crevice where these devices can be used. Similarly, this shift has displaced the traditional television set — a bulky screen designed to be mounted semi-permanently in a single place, such as on a wall or on a flat surface — in favor of small, portable screens that can be viewed in virtually any position, and in virtually any orientation.
[0004] Such mobile devices place new demands on video content. One such demand relates to the aspect ratio (e.g., the ratio of a display width to a display height) of the video content. Under desired viewing conditions, a native aspect ratio of a video asset (e.g., a source file containing video content) snatches the aspect ratio of the display on which the video asset is presented. For example, when viewing a video asset on a display having a 16:9 aspect ratio, it is desirable that the video asset itself have a 16:9 aspect ratio. If the video asset has an aspect ratio that differs from the aspect ratio of the display, one of two conventional solutions can be used to format the video asset for the display: either the video asset can be cropped to fit the display (e.g., via “pan and scan” techniques); or the video asset can be “letterboxed” by adding dummy content (e.g., black bars) to fill the regions of the display unoccupied by the video asset. Neither solution is desirable: cropping the video asset results in the cropped content being unviewable on the display (which can affect the viewer’s understanding or appreciation of the video asset); and letterboxing the video asset results in regions of the display being effectively unused ( which can impair visibility, especially on mobile devices with limited display space).
[0005] A preferred solution is to anticipate the aspect ratio of the display on which video content will be viewed, and to provide to the display a video asset that matches that aspect ratio. But this approach is frustrated by mobile device displays that change aspect ratios as the user changes the orientation of the device. For instance, a display may be in a “portrait” mode (e.g., in which the aspect ratio is less than unity) when a device is held upright, but may shift to a “landscape” mode (e.g., in which the aspect ratio is greater than unity) when the device is rotated 90 degrees to the left or the right. A solution is needed for seamlessly switching between aspect ratios of video content without resorting to cropping or letterboxing techniques.
[0006] Further, users of mobile devices demand that video content he data-efficient: that is, that the video content respect the limited data storage capacity of many mobile devices, and the cost and overhead of downloading large files on consumer data plans; and that it accommodate the high latency, low bandwidth network conditions in which mobile devices may operate. The present disclosure describes such one or more solutions, which improve on conventional approaches by providing a data-efficient mechanism for quickly and seamlessly changing an aspect ratio of video content on a mobile device display.
BRIEF SUMMARY
[0007] Examples of the disclosure describe systems and methods of presenting media content. According to examples of the disclosure, a plurality of assets is received at a device comprising a display and an orientation sensor. The plurality of assets comprises a first video asset associated with a first aspect ratio, and a second video asset associated with a second aspect ratio, different from the first aspect ratio. A desired aspect ratio is determined based on an output of the orientation sensor. In accordance with a determination that the desired aspect ratio is closer to the first aspect ratio than to the second aspect ratio, the first video asset is selected. In accordance with a determination that the desired aspect ratio is closer to the second aspect ratio than to the first aspect ratio, the second video asset is selected. The selected video is presented at the desired aspect ratio via the display.
BRIEF DESCRIPTION OF THE DRAWINGS
[0008] FIGs. 1A-ID illustrate an example smartphone, an example tablet, an example wearable device, and an example head-mounted device that can each include a display according to examples of the di sclosure.
[0009] FIG. 2 illustrates a display having an aspect ratio according to examples of the disclosure.
[0010] FIG. 3A illustrates an example of presenting vi deo content in a portrai t aspect ratio, according to examples of the disclosure.
[0011] FIG. 36 illustrates an example of presenting video content in a landscape aspect ratio, according to examples of the disclosure.
[0012] FIGs. 4A-4B illustrate examples of determining an orientation of a display according to examples of the disclosure.
[0013] FIG. 5 illustrates an example process for encoding and presenting assets to a user via a display, according to examples of the disclosure.
[0014] FIGs. 6A-6D illustrate examples of data streams comprising video and/or audio according to examples of the disclosure.
[0015] FIGs. 7A-7G illustrate examples of encoding assets comprising video according to examples of the disclosure.
[0016] FIG. 8 illustrates an example computer system for implementing various examples of the disclosure.
[0017] FIGs. 9A-9C illustrate examples an example layout of a user interface including a scrubber according to examples of the disclosure.
[0018] FIGs. lOA-lOB illustrate examples of a user interface including a scrubber according to examples of the disclosure. [0019] FIGs. 11A-11B illustrate examples of a user interface including a scrubber according to examples of the disclosure.
[0020] FIGs. 12A-12D illustrate examples of a user interface including a scrubber according to examples of the disclosure.
[0021] FIGs. 13A-13D illustrate examples of a user interface including a scrubber according to examples of the disclosure.
[0022] FIGs. 14A-14B illustrate examples of a user interface including a scrubber according to examples of the disclosure.
[0023] FIGs. 15A-15D illustrate examples of a user interface including a scrubber according to examples of the disclosure.
DETAILED DESCRIPTION
[0024] In the following description of examples, reference is made to the accompanying drawings which form a part hereof, and in which it is shown by way of illustration specific examples that can be practiced. It is to be understood that other examples can be used and structural changes can be made without departing from the scope of the disclosed examples.
[0025] FIGs. 1A-1D illustrate examples of mobile devices including displays that can be used to present video content (which may comprise one or more video assets, as well as, in some examples, corresponding audio assets, or other assets such as assets describing haptic effects). A mobile device may include, but is not limited to, smartphones, tablets, and wearable devices. Although described with respect to mobile devices, examples of this disclosure may also be used with a television display or a computer monitor. As used herein, video can include still images, motion video (e.g., sequences of image frames), GIF files, or any other suitable visual media content. FIG. 1A illustrates an example smartphone 110 with a display 112. FIG. IB illustrates an example tablet device 120 with a display 122. FIG. 1C illustrates an example wearable device 130 (such as a smart watch) with a display 132. FIG. ID illustrates an example wearable head- mounted device 140 with a display 142 configured to be positioned in front of a user’s eyes. In some examples, such a display can comprise a transmissive display, such as for augmented reality or mixed reality applications. In some examples, the head -mounted device can include a non- transmissive display, such as for virtual reality applications or conventional computing applications. Each of these example devices can include a respective one or more processors; one or more speakers; one or more actuators; one or more sensors, such as orientation sensors (e.g., accelerometers, gyroscopes, inertial measurement units (IMUs)), position sensors (e.g., GPS), cameras, microphones, or other suitable sensors); storage capabilities (e.g., internal volatile or non-volatile memory, or interfaces to external storage such as optical storage, solid-state storage, or magnetic storage); input or output devices; and networking capabilities, such as to send and receive data (including video data) via a network. The example devices shown in FIGs. 1A-1D can be used to implement embodiments of the disclosure.
[0026] Displays, such as those that can he included in the example devices described above with respect to FIGs. 1 A-1D, can be characterized by an aspect ratio — conventionally, the ratio of the width of the display to the height of the display, although other conventions (e.g., the ratio of the height to the width, or an angle of a diagonal) can be used. FIG. 2 is illustrative. In FIG. 2, an example display 200 has a width wj and a height in; the aspect ratio can be expressed as the inverse of the slope of the diagonal line 202 (i.e., the width w1 divided by the height in). Equivalently, the aspect ratio can be expressed in terms of the angle θ1 (e.g., the tangent of θ1 ). If the aspect ratio is less than unity (e.g., the inverse slope of 202 is less than 1, and θ1 is less than 45 degrees), the display can he described as having a “portrait” orientation. Conversely, if the aspect ratio is greater than unity (e.g., the inverse slope of 202 is greater than 1, and θ1 is greater than 45 degrees), the display can be described as having a “landscape” orientation. In some examples the angle Q; may be selected from an angle of 1-89 degrees. As described herein, a width and height of a display can refer to horizontal and vertical dimensions, respectively, of the display with respect to a viewer (which may differ from a width and height of the device itself, if the device is rotated with respect to the viewer).
[0027] FIGs. 3A-3B illustrate examples of video content being presented on a display of a mobile device. In FIG. 3 A, device 300 is oriented with its display in a portrait orientation, and is presenting video content 310A via its display. In FIG. 3B, device 300 is rotated such that its display is in a landscape orientation, and is presenting video content 310B via its display. Device 300 can be freely rotated between the portrait orientation and the landscape orientation, and the corresponding video content (310A or 310B, respectively) will be presented accordingly. Video content can be presented by a media player application executing on device 300. Once skilled in the art will understand that the device may be rotated both clockwise and counterclockwise to switch between the video content 310A and 310B.
[0028] In FIGs. 3A-3B, device 300 is shown presenting video content 310A/310B that includes a television news-style program comprising various visual elements: a host 322A/322B, which may be video footage of the host; and visual elements 320A/320B, 324A/324B, and 326A/326B, all overlaid on background 328A/328B. In addition, video content 310A/310B may be presented concurrently with a corresponding audio track (e.g., presented via speakers of device 300), or with some other type of output (e.g., haptic output). (In some cases, such audio tracks or other content may be presented without any corresponding video.) Video content 310A includes one or more video assets associated with a portrait aspect ratio to match a portrait aspect ratio of the display of device 300 in FIG. 3A. In some examples, video content 310A can comprise a single video asset that includes elements 320A, 322A, 324A, 326A, and 328A, and is formatted to a native portrait aspect ratio. In some examples, video content 310A can comprise two or more video assets. For instance, a first video asset of video content 310A could include a composite of host 322A and background 328A, with the first video asset formatted to native portrait aspect ratio; a second video asset of video content 310A could include element 320A; a third video asset of video content 310A could include element 324A; and a fourth video asset of video content 310A could include element 326.4. The second, third, and fourth video assets could be considered layers, and the layers could he combined (e.g., proceduraliy) with the first video asset (e.g., the background and host) to generate a composite video featuring the layers arranged on top of the first video asset, with the composite video having a native portrait aspect ratio. Similarly, video content 310B includes one or more video assets associated with a landscape aspect ratio to match a landscape aspect ratio of the display of device 300 in FIG. 3B. Video content 310B can in some examples comprise a single video asset including elements 320B, 322.B, 324B, 32.6B, and 328B, and formatted to a native landscape aspect ratio: and can in some examples include two or more video assets, such as described above with respect to video content 310A, with the video assets used to generate a composite video having a native landscape aspect ratio. Device 300 can select whether to display video content 310A (and its corresponding assets) or video content 310B (and its corresponding assets), based on an orientation of device 300, such as described below. However, video content (and corresponding video assets) that are unselected may remain resident in the memory of device 300, such that switching between video content 310.4 and video content 310B can occur quickly and seamlessly. Further, transition effects (c.g., dissolves, fades, screen wipes) or animations can be used when switching between video content 310A and video content 31 OB to further ease the transition between the two.
[0029] FIGs. 4A and 4B illustrate examples of determining an orientation of a mobile device 400, in order to select which video assets of a plurality of video assets should he presented on the display of the device. For instance, the determined orientation could correspond to a portrait orientation, in which case a portrait mode video asset (e.g., for video content 310A in FIG. 3 A) could be selected; or the determined orientation could correspond to a landscape orientation, in which case a landscape mode video asset (e.g., for video content 310B in FIG. 3B) could be selected. Orientation of a device can he determined by detecting an output of one or more orientation sensors of the device, such as an 1MU, an accelerometer, and/or a gyroscope. In some examples, such as shown in FIG. 4A, the orientation can he determined by measuring, from an orientation sensor, an angle of the display with respect to a horizontal plane in an inertial frame. For instance, in the figure, an orientation sensor can indicate that an edge of the display of device 400 lies at an angle α (412) with respect to horizon 410. In some cases, angle a can be compared to the diagonal of the display; for instance, if angle a exceeds an angle between the diagonal and an edge of the display, the device can he considered to be in landscape mode; and conversely, if angle a is less than the angle between the diagonal and the edge, the device can he considered to be in portrait mode. According to another method, angle a can be compared to one or more threshold values, and the device can be considered to be in landscape mode if the angle exceeds the threshold, and in portrait mode if it does not. Example threshold values can include 30 degrees, 45 degrees, 60 degrees, or any other suitable threshold. Moreover, hysteresis effects can be implemented via asymmetric threshold values (e.g., a threshold value of 30 degrees for transitioning from portrait mode to landscape mode, and a threshold value of 60 degrees for transitioning from landscape mode to portrait mode). In some examples, threshold values can be specified by a user of the device, or by an author of video content to be presented on the device. Other suitable methods of determining an orientation of the device will he apparent.
[0030] In some examples, such as shown in FIG. 4B, the orientation of the display can be determined with respect to an orientation of a user. For instance, in FIG. 4B, user 450 views device 400 while his or her head is oriented vertically along a vector 460 (i.e., vector 460 points upwards in the user’s field of view). Vector 460 can be determined using one or more sensors of device 400, such as cameras used to track the eyes of user 450. An orientation of device 400 can be determined by comparing an inertial orientation 414 of the device (which can be determined using orientation sensors, such as described above) to vector 460. For example, an angle between vector 460 and an edge of the device 400 can be detected, and used to determine a portrait orientation or a landscape orientation using the methods described above. An advantage of this approach is that if the eyes of user 450 are positioned at an angle with respect to an inertial frame — for example, if user 450 is reclined, or lying on his or her side — the determined orientation of device 400 can take that into account. This can be desirable where, for instance, a user wishes to watch video content in portrait mode while fully reclined, even though the device may be oriented in a landscape mode with respect to an inertial frame (as might be detected by an orientation sensor such as an accelerometer).
[0031] FIG. 5 illustrates an example process 500 for presenting video content to a user according to embodiments of the invention. In the figure, 502 represents a first video asset (e.g., video asset “L,” as in landscape), and 512 represents a second video asset (e.g., video asset “P,” as in portrait). The first video asset can be associated with a first orientation (e.g., a landscape orientation), and the second video asset can be associated with a second orientation (e.g., a portrait orientation). The first and second video assets may include the same general content; for example, the first and second video assets may be different versions of a single episode of a scripted television show. While example process 500 depicts two video assets, it will be understood that the example can be extended to any suitable number of video assets.
[0032] The first and second video assets 502 and 512 can he provided by a creative entity with creative control over the video assets. The creative entity can author (e.g., produce and edit) the first video asset 502 such that it is creatively suited for presentation in the first orientation (e.g., landscape orientation); for example, the creative entity can select camera shots, control scene placement, and position graphical elements such that the video content is understandable and engaging in a landscape orientation. The creative entity can similarly author the second video asset 512 such that it is creatively suited for presentation in the second orientation (e.g., portrait orientation). Viewability differences between the first orientation and the second orientation may result in significantly different creative demands of the first video asset 502 and the second video asset 512. For example, a full-body camera shot of a standing actor may be well suited for a portrait orientation, because the proportions of an actor standing upright may resemble the proportions of a portrait display. But the same full-body shot may be inappropriate for a landscape display, whose proportions vary significantly from those of the actor. Conversely, a wide-angle camera shot of a basketball court may present well on a landscape display, but may be entirely unsuited for a portrait display. Such differences may be especially pronounced on mobile devices, which may have small screens that make it difficult for a viewer to resolve small visual details (such as facial features). Accordingly, the creative entity may elect to produce a first video asset S02 that differs (even significantly) from the second video asset 512, even though the two video assets may relate to the same general content. For example, the first and second video assets may comprise entire separate camera shots and sequences, transitions, focal points, post-processing effects, overlays, or other video elements, as appropriate. Providing separate first and second video assets, where those assets may differ creatively, offers an advantage over processes in which a single video asset is manipulated (e.g., via cropping or letterboxing) for presentation at multiple different aspect ratios. The creative entity can make human authorship decisions based on what the entity decides would look best when presented at a particular aspect ratio.
[0033] With respect to FIG. 5, 504 represents a first audio asset corresponding to the first video asset 502, and 514 represents a second audio asset corresponding to the second video asset 512. Audio asset 504 is thus associated with the first aspect ratio, and audio asset 514 is associated with the second aspect ratio. Audio assets 504 and 514 may represent audio tracks to be presented concurrently with their respective video asset. As above, while example process 500 depicts two audio assets, it will be understood that the example can be extended to any suitable number of audio assets (which may, but need not, equal the number of video assets).
[0034] In some examples, audio assets 504 and 514 may be identical assets, such as where identical audio tracks are to be presented regardless of whether a device is in a portrait orientation or a landscape orientation. In other examples, audio assets 504 and 514 may have different audio characteristics, such as a where it is desirable to present different audio tracks based on whether the device is in a portrait orientation or a landscape orientation. For instance, during a scene of a video, first video asset 502 (e.g., in landscape orientation) may feature a distant camera shot on an actor's face, while a corresponding scene in second video asset 512 (e.g., in portrait orientation) may feature a close-up camera shot on the same actor’s face. It may be desirable for the actor’s dialogue to be louder in the second audio asset 514 than in the first audio asset 504, to correspond with the close-up shot in portrait orientation; and for consistency with the user’s expectation that sounds originating closer to the camera will be relatively louder in an audio mix than those originating farther from the camera. As with the first and second video assets described above, a creative entity can exercise creative control over the first audio asset and the second audio asset, such that die audio assets reflect human judgment of what will sound best to the listener.
[0035] With respect to FIG. 5, stage 530 represents an encoder, which comprises or executes one or more encoding processes that can encode the data of one or more assets, such that the encoded data can be presented to a device or a process (e.g., a software application) that can decode the encoded data and present it to a user. Encoder 530 can be implemented using one or more processors, which in some cases may be located on a server remote to the presentation device, or which may be implemented across a distributed computing system. In some cases, encoder 530 can perform encoding processes on an entire asset in advance of that asset’s presentation to the user; in some cases, encoder 530 can perform encoding processes in real-time, while the asset is being presented to the user (as in live television). In such examples, an asset may be divided into individual units (e.g., groups of frames or samples), with encoding performed separately on each unit. The disclosure is not limited to any particular system or method for implementing encoder 530.
[0036] Encoder 530 can accept as input any suitable number of type of assets. In the example process 500 shown in FIG. 5, encoder 530 accepts as input the first video asset 502, the second video asset 512, the first audio asset 504, and the second audio asset 514, such as described above. Additionally, in some examples, encoder 530 can accept as input one or more layer assets 520, which may describe assets to be composited with other assets. For example, with respect to video content 310B in FIG. 3A described above, a first video asset 502 could include host 322B and background 3286; and layer assets 520 could include overlay elements 3206 (e.g., a graphic overlay); 324B (e.g., a brand logo); and 326B (e.g., a clock). The layer assets and the first video asset could be provided as input to the encoding process 530, with the layer assets to be presented as a composite with the first video asset. In some examples, the composition could occur as part of an encoding process of the encoder 530; in some examples, layer assets 520 can be procedurally composited on a video asset by a media player application (e.g., application 560 described below). Layer assets 520 can, but need not, be associated with a particular aspect ratio.
[0037] Encoder 530 can encode its input assets according to one or more suitable processes, which may be selected depending on criteria such as network conditions (e.g., latency, available bandwidth), content type, user preferences, or display type (including display aspect ratios), such as described below. Depending on which encoding processes are used, encoder 530 can output one or more streams 540 of encoded data. For example, data streams 540 can include a first encoded data stream 542, a second encoded data stream 544, and a third encoded data stream 546 (and potentially other data streams). A data stream may correspond to any suitable combination of video data, audio data, or data associated with any other suitable type of asset (e.g., haptic data). Further, the disclosure is not limited to any particular correlation of data streams to assets (such as assets 502, 504, 512, 514, and 520 described above); a data stream can include data for any suitable number or type of assets.
[0038] Data streams 540 can be delivered to device 550, which may correspond to the example devices in FIGs. 1A-1D. In some cases, data streams 540 can be downloaded by device 550 via a computer network, such as a content delivery network (e.g., via a streaming download, or by downloading individual files). In some cases, data streams 540 can be stored on storage media (e.g., optical storage such as a DVD-ROM, solid-state storage such as a USB memory stick, or magnetic storage such as a hard drive), and read by device 550 via an interface to the storage media. A media player application 560 (e.g., a software application executing on one or more processors of device 550) can accept the encoded data streams 540 as input, and process that data (e.g., by decompressing it and setting rendering parameters) to present the underlying assets (e.g., video assets 502 and 512 and audio assets 504 and 514) to a user. For example, media player application 560 can present the video assets to the user via a display 572 of device 550; and can concurrently present audio assets to the user via speaker 574 of device 550. In some examples, media player application 560 (alone or in conjunction with one or more additional applications) can present other asset types, such as haptic assets, via device 550 (e.g., via a motor or actuator 576 of device 550). In some examples, process 500 can incorporate various interactive behaviors; for example, media player application 560 can accept user input (e.g., via an input device of device 550) relating to process 500 and respond accordingly.
[0039] In presenting the assets, media player application 560 can select between two or more presentations of video content, such as described above with respect to FIGs. 3A and 3B and video content 310A/310B. Media player application 560 can select one or more of a plurality of assets to be presented, decode the encoded data, and present the decoded data corresponding to the selected assets. In some eases, media player application 560 can identify an orientation of device 550, such as described above with respect to FIGs. 4A-4B; identify a desired aspect ratio based on that orientation; and select a video asset associated with an aspect ratio closest to that desired aspect ratio. For example, if encoded data streams 540 encode two video assets each having a different aspect ratio, media player application 560 can identify which of the two video assets has the aspect ratio ciosest to the desired aspect ratio, and select that video asset for presentation to the user. In selecting a video asset for presentation, media player application 560 can decode the video data of data streams 540, identify the decoded video data that corresponds to the selected video asset, and present that decoded video data while disregarding data that corresponds to the unselected video asset. Media player application 560 can similarly identify a corresponding audio asset, or other type of asset, and concurrently present that asset along with the presentation of the selected video asset. This process can be extended to any suitable number and type of assets. The process can be performed multiple times by media player application 560 during playback, for example on a frame-by-frame basis, as the user may continue to reorient device 550. This process . wherein media player application 560 selects one or more desired assets from a plurality of assets resident in memory — can carry a speed advantage over other solutions, such as selectively delivering assets to device 550; selecting a different asset for presentation need not require re-encoding an asset, or fetching the asset from a remote location, which can introduce unwanted delay and overhead, instead, the newly selected asset is preloaded in memory and ready for immediate presentation.
[0040] FIGs. 6A-6C illustrate examples of data streams 540. In the example shown in
FIG. 6A, data streams 540 can comprise three separate streams: a first stream 602 comprising a first video asset (e.g., a video asset in a portrait orientation); a second stream 604 comprising a second video asset (e.g., a video asset in a landscape orientation); and a third stream 606 comprising an audio asset (which can he associated with both the first video asset and the second video asset). In the example shown in FIG. 6B, data streams 540 can comprise two separate streams: a first stream 612 comprising a first video asset (e.g., a video asset in a portrait orientation) and a corresponding first audio asset (e.g., an audio asset for playback on a device in a portrait orientation); and a second stream 614 comprising a second video asset (e.g., a video asset in a landscape orientation) and a corresponding second audio asset (e.g., an audio asset for playback on a device in a landscape orientation). Encoding in multiple parallel data streams may be useful, for example, to take advantage of multiple transmission paths and various streaming protocols.
[0041] In the example shown in FIG. 6C, data streams 540 comprise just a single stream
622, where stream 622. comprises a first video asset (e.g., a video asset in a portrait orientation), a second video asset (e.g., a video asset in a landscape orientation), and an audio asset (which can be associated with both the first video asset and the second video asset). In the example shown in FIG. 6D, data streams 540 comprise just a single stream 632, where stream 632 comprises a first video asset (e.g., a video asset in a portrait orientation) and a corresponding first audio asset (e.g., an audio asset for playback on a device in a portrait orientation); and a second video asset (e.g., a video asset in a landscape orientation) and a corresponding second audio asset (e.g., an audio asset for playback on a device in a landscape orientation). In each of the examples in FIG. 6A-6D, data streams 540 can be delivered to device 550, which can decode the underlying data and present it to a user as video content (e.g., video content 310A/310B described above with respect to FIGs. 3A-3B). Data streams 540 can include one or more manifest files that can include metadata relating to the contents of the data stream, and that can include various instructions or parameters for decoding and presenting those contents.
[0042] In the examples shown in FIG. 6C and 6D, multiple assets are encoded (e.g., by encoder 530) in a single data stream. This encoding can comprise interlacing the multiple assets; concatenating the multiple assets; and/or employing any other suitable technique. In some examples, encoding multiple video assets may comprise composing a video from respective time- matched frames of two or more input video assets. For example, a first frame of the composed video can comprise video data from a first frame of a first video asset alongside video data from a first frame of a second video asset. Corresponding input frames can he scaled and positioned in the composed video, such as described further below; the composed video can be encoded (e.g., on a frame-by-frame basis) by an encoder, and the encoded data delivered to device 550 as described above. Other suitable implementations are contemplated, and specific implementations of encoding multiple assets in a single data stream can vary depending on a codec used.
[0043] Data efficiencies can be realized by encoding multiple assets in a single data stream, such as data stream 622 shown in FIG. 6C or data stream 632 shown in 6D. For example, compared to delivering data in multiple streams, single stream delivery can incur less network overhead. In addition, certain video codecs (e.g., H.264) can take ad vantage of data redundancies, such as regions of similar or identical video content in a data stream (e.g., in composed videos such as described above), to reduce file size. Accordingly, combining data for multiple video assets in a single data stream can improve the likelihood of such redundancies being present in the data stream — particularly where, for example, two video assets present substantially similar versions of common video content — and result in greater compression ratios and overall reduced data size. Depending on the nature of the assets, the nature of device 550, user preferences, or other factors, certain encoding schemes may result in lower overall data usage or may otherwise be preferable to certain other encoding schemes. Further, in some examples, multiple encoding schemes may he used, for example to produce multiple data streams from which only a subset are delivered to device 550.
[0044] FIGs. 7A-7G illustrate various example encodings that can be used by encoder 530 to generate a data stream, such as data stream 622 or 632 described above, that includes both a first video asset and a second video asset. As described above, media player application 560 can decode the data stream, select the desired video asset from the decoded data stream, and render that video asset to a display while disregarding the unselected video asset. While not shown in FIGs. 7A-7F, audio assets or other assets may be encoded with the first and second video assets in the data streams shown in the figures. The examples shown in FIGs. 7A-7F can include composed videos, such as described above, where the composed videos comprise time-matched frames from two or more video assets; the frames are scaled and positioned in the composed video, which is encoded by encoder 530.
[0045] In the example data stream shown in FIG. 7 A, a composed video 710 comprises a first video asset 712 and a second video asset 714, where the height h1 of the first video asset 712 (e.g., portrait orientation) equals the width w2 of the second video asset (e.g., landscape orientation). For example, the first and second video assets may have inverse aspect ratios, such as 16:9 and 9: 16. In this example, encoder 530 can generate data stream 622 by encoding a version of the second video asset 714 that is rotated 90 degrees with respect to the first video asset 712, such that the rotated asset 714 can be positioned horizontally adjacent to asset 712 as shown in the figure.
[0046] In the example data stream shown in FIG. 7B, a composed video 720 comprises a first video asset 722 and a second video asset 724, where the width w, of the first video asset 722 (e.g., portrait orientation) equals the width W2 of the second video asset 724 (e.g., landscape orientation). The second video asset 724 may be scaled down from an original size, such that its width w2 equals w1, the overall data size of the data stream may be reduced in this configuration. In this example, encoder 530 can generate data stream 622 or 632 by positioning the first video asset 722 vertically adjacent to the second video asset 724 as shown in the figure and encoding the composed video.
[0047] In the example data stream shown in FIG. 7C, a composed video 730 comprises a first video asset 732 and a second video asset 734, where the height h1 of the first video asset 732 (e.g., portrait orientation) equals the height fe of the second video asset 734 (e.g., landscape orientation). The first video asset 732 may be scaled down from an original size, such that its width h1 equals h2; the overall data size of the data stream may be reduced in this configuration. In this example, encoder 530 can generate data stream 622 or 632 by positioning the first video asset 732 horizontally adjacent to the second video asset 734 as shown in the figure and encoding the composed video.
[0048] In the example data stream 740 shown in FIG. 7D, frames of a first video asset
(e.g., frames 741, 743, 745 in portrait orientation) are interlaced with frames of a second video asset (e.g., frames 742, 744, 746 in landscape orientation), such that media player application 560 can present either the first video asset or the second video asset by de-interlacing the frames and presenting those frames corresponding to the desired video asset. In the example shown, frames of the second video asset can be rotated 90 degrees with respect to frames of the first video asset, for spatial consistency with frames of the first video asset.
[0049] In the example data stream shown in FIG. 7E, a composed video 750 comprises a first video asset 752 and a second video asset (e.g., 754, 756, 758). The first video asset 752 (e.g., portrait orientation) can be encoded at a full size, while the second video asset can be scaled to various reduced sizes. For example, video assets 756 and 758 illustrate video assets of increasingly reduced size with respect to 754. This may be desirable if device 550 is not expected to operate in an orientation (e.g., landscape orientation) that corresponds to the second video asset; the data size of the data stream can be reduced accordingly, while still making reduced -size versions of the second video asset available to media player application 560 in the unexpected event that the device rotates such that the second video asset should be played. In some examples, such an event could be followed by providing the second video asset to device 550 at a full size (e.g., as shown in FIG. 7 A). Similarly, FIG. 7F shows an example data stream featuring a composed video 760, in which the first video asset 762 is a full-size video asset in a landscape orientation, and reduced- size video assets 764, 766, and 768 are in a portrait orientation.
[0050] In the examples shown in FIGs. 6A-6D and FIGs. 7A-7F, video assets may be rotated or oriented as desired to obtain network efficiencies or meet other requirements. For example, with respect to FIG. 7 A. the portrait orientation video asset 712 may be oriented at a first angle (e.g., zero degrees), while the landscape orientation video asset 714 may be oriented at a second angle (e.g., 90 degrees with respect to the first angle). Orienting a first video asset at a non- zero angle with respect to a second video asset can permit the first video asset and the second video asset to be encoded in a single frame that meets desired dimensional requirements. For example, with respect to FIG. 7 A, the second video asset 714 (with width w2 and height h2) can he oriented at a 90 degree angle with respect to the first video asset 712 (with width w1 and height h1), such that the combined frame can be encoded as a rectangle with a height of h1 and a width of w1 + h2. In all examples described herein and shown in the figures, it is within the scope of the disclosure that the video assets may he oriented at any suitable angle (including a zero angle) with respect to each other.
[0051] In some examples, a first video asset may be oriented at a same angle as a second video asset. This may be necessary when, for example, technical requirements demand that a video asset be encoded with a particular orientation. For instance, a video playback application may require that a video asset be oriented at a particular angle in order to present that video asset to a viewer. In some cases, DRM or other content protection mechanisms may require encoding of a video asset in a particular orientation.
[0052] FIG. 7G shown an example data stream in which a composed video 770 comprises a first video asset 772 and a second video asset 774. In the example, second video asset 774 may be a landscape orientation video asset that is not oriented at a non-zero angle with respect to first video asset 772, a portrait orientation video asset. In the example shown, the first video asset 772 has a width w1 and a height h1 ; and the second video asset 774 has a width w3 and a height fe. In the example shown, the first video asset 772 and the second video asset 774 are positioned in an “L” shape, in which the first video asset 772 and the second video asset 774 share a single common boundary (in the example, the width w1 of first video asset 772, which is coilinear with the width w3 of the second video asset 774). The first video asset 772 and the second video asset 774 need not share any common linear dimensions. For example, as shown in the figure, width w1 of the first video asset 772 is shorter than width w3 of the second video asset 774; and height h1 of the first video asset 772 is longer than height h2 of the second video asset 774. In the example, the first video asset 772 and the second video asset 774 are encoded together in a rectangular frame having height h3 (i.e., h1 + h2) and width w3. The lack of a common linear· dimension differs from some examples shown above: for instance, in example data stream 710 in FIG. 7 A, landscape orientation video asset 714 can he rotated 90 degrees to share a boundary (i.e., w2 and h1) coextensive with portrait orientation video asset 712. And in example data stream 720 in FIG. 7B, a resolution of landscape orientation video asset 724 can be reduced with respect to portrait orientation video asset 722 such that landscape orientation video asset 724 shares a boundary (i.e., w1 and wi) coextensive with portrait orientation video asset 722. Unlike these examples, the example encoding shown in FIG. 7G results in a region 776 of composed video 770 that does not belong to either the first video asset 772 or the second video asset 774. In some examples, region 776 can be used to encode additional data (e.g., one or more additional video assets); but in some examples, region 776 may be a region of unutilized space.
[0053] Certain encoding schemes, such as described above, may be more desirable than others depending on variables such as network conditions; user habits; or the type of content to be presented. In some examples, machine learning techniques (e.g., neural network techniques) or probabilistic models can be used to identify and predict which encoding scheme is preferable in a particular circumstance.
[0054] In some examples described and shown above, video assets are encoded adjacent to one another, e.g., in a data stream. In some examples, adjacent video assets may share a common pixel boundary. However, it is within the scope of this disclosure that adjacent video assets may be separated by an insubstantial region of space. For example, it is contemplated that adjacent video assets may be separated by a row or column of 1 or 2 pixels.
[0055] While the above examples arc described with respect to video assets, it will be understood that aspects of the disclosure can be applied to audio-only assets (e.g., music, podcasts) that may not have a video component. Further, the disclosure can be applied to assets comprising still images, GIF files, or other suitable types of media.
Adaptive Scrubber
[0056] As described above, a user of a display device may switch between a first video asset, e.g., a video in a portrait orientation, and a second video asset, e.g., a video in a landscape orientation, by rotating the display device. In some examples a data stream may include both a first video asset and a second video asset, such as described above. Playback of a video asset can comprise a user interface with a “scrubber” element for graphically representing temporal locations of the video asset (e.g., via a position of a playback bead). For example, a scrubber element can depict a video asset as a line, with a first time (e.g., the beginning) of the video asset at one end of the line, and a second time (e.g., the end) of the video asset at the other end of the line. During video playback or editing, a user can change the frame of a video, or a time corresponding to a playback position by dragging a slider or playhead across a scrubber bar or timeline. (The timeline may, but need not, be represented as a straight line.) On touch-sensitive devices, a user can manipulate the scrubber using touch controls; for example, dragging a playhead along a timeline with one’s finger on a touch screen can move the playback position accordingly. On non-touch-sensitive devices, the user can manipulate the scrubber using a mouse, a remote control, a gamepad, or another suitable input device.
[0057] The usability of a scrubber can depend on various factors. For example, it is desirable that a scrubber be easily accessible (e.g., via one’s dominant thumb while holding a touch screen device with the dominant hand). It is further desirable that the scrubber not overlap or obscure important information on a display screen. However, it is also desirable that the scrubber be sufficiently large to allow for fine and coarse inputs; a scrubber that is too small may frustrate a user’s ability to make fine adjustments to the playhead, particular when viewing lengthy video assets. When a scrubber is used to manipulate one or multiple video assets — for example, either a portrait orientation video asset or a landscape orientation video asset, such as in the examples described above — a one-size-fits-all scrubber design may be unsuitable. For instance, a scrubber that is sufficiently large, easily accessible to the user, and does not overlap important video content when viewing a portrait orientation video may suddenly become unsuitable when the user switches to a landscape orientation video; for example, the landscape orientation video may place important information in a different location on the display, and if the user holds a mobile device in a landscape orientation, the scrubber may no longer be within easy reach of the user.
[0058] It can be desirable that a scrubber tool comprise an adaptive scrubber; that is, a scrubber whose appearance or functionality may adapt to the video asset being played, and/or to the orientation of the display device. This can be particularly desirable when a user (e.g., of a mobile display device) wishes to switch between two video modes or video assets (e.g., portrait and landscape), such as described above, and control them with a common scrubber tool. In embodiments described herein, an adaptive scrubber may adopt one or more first characteristics (e.g., appearance, display location, dimensions, functionality) when viewing a first video asset or a first video mode, and may adopt one or more second characteristics when viewing a second video asset or a second video mode. For example, a video asset corresponding to a portrait orientation may be associated with an playback interface or an adaptive scrubber having a vertically oriented layout, while a video asset corresponding to a landscape orientation may be associated with a playback interface or an adaptive scrubber having a horizontally oriented layout. In this manner, the adaptive scrubber can adopt an appearance or functionality suited to the video at hand. This may enhance ease of use of the scrubber and enhance the overall viewing experience for the user. In some examples, other aspects of video playback may adapt to the video asset being played; one example is adaptive closed captioning, where the location and/or the content of the closed captioning may differ between a first video asset and a second video asset.
[0059] FIGs. 9A-9C illustrate examples of a user interface including an adaptive scrubber being presented on a display of a mobile device in a portrait orientation. In FIG. 9 A, a device 900 displaying an adaptive scrubber 910 on a display may include a scrubber bar 902 having a start position 906 and an end position 908. The scrubber bar 902 may include a slider or playhead 904. The slider 904 may move along the scrubber bar 902 to indicate the playback time that has elapsed. For example, at the beginning of a video, the slider may be positioned along the scrubber bar 902 at the start position 906. As the video plays, the slider will move along the scrubber bar 902 corresponding to the amount of time the video has played. At the end of the video, the slider 904 can be positioned at the end position 908. In some embodiments, the slider 904 may have a timecode 918 positioned next to the slider 904. For example, as seen in FIG. 9A, the timecode 918 may be positioned to the left the slider 904. In other embodiments, the timecode may be positioned to the left, below, or above the slider 904. As the slider 904 moves along the scrubber bar· 902, the timecode 918 may move with the slider to maintain the same relative position to the slider 904.
[0060] The adaptive scrubber may also include playback controls 912. The playback controls 912 may include a fast-forward, a rewind, and a play/pause icon. Selecting an icon can have a corresponding effect on the video as well as the position of the slider 904 on the scrubber bar 902. For example, selecting (e.g., clicking or tapping) the play/pause icon on the playback controls 912 may cause a playing video to pause. The slider 904 can remain in place along the scrubber bar 902 at the time the selection was made, i.e,, playback of the currently displayed video asset will stop. If the play command is selected, the presented video asset will resume playing at a real time speed. Selecting the fast-forward icon on the playback controls 912 may cause the video to fast-forward, i.e., advance foe current video asset at a rate quicker than real-time. The slider 904 can similarly advance along the scrubber bar 902 at a rate quicker than real-time corresponding to the video. In some embodiments, there may be more than one fast-forward speed. For example, selecting foe fast-forward icon once may result in a fast-forward rate corresponding to 1.5x foe speed of foe original video. Selecting foe fast-forward icon a second time may result in a fast-forward rate of 2.0x the speed of the original video. Selecting the fast- forward icon a third time may result in a fast-forward rate of 3.0x the speed of the original video. The rewind icon of the playback controls 912 may be used to rewind the video, i.e., present frames of the selected video asset in a reverse order starting from the current frame. The rewind icon may have multiple speeds as described with respect to the fast-forward icon.
[0061] The user interface elements associated with the adaptive scrubber may be positioned based on foe video asset currently being played by foe device, e.g., may depend on the orientation of the device. In FIG. 9A, the device 900 is oriented in with its display in portrait orientation, which may correspond to a first video asset presenting video content on the display in a vertical layout. While the device has a portrait orientation and is playing the first video asset, the adaptive scrubber 910 may have a corresponding vertical layout on the display such that foe length of the scrubber bar 902A is parallel to the longest side of the device 900. A user may rotate the device such that foe device 900 is oriented with its display in a landscape orientation, which may trigger foe device to play a second video asset presenting video content with a horizontal layout, as shown in FIG. 9B. The scrubber bar 902B is similarly rotated to have a corresponding horizontal layout such that the scrubber bar 902B is parallel to the longest side of the device 900. Device 900 can be freely rotated between the portrait orientation and the landscape orientation, and the adaptive scrubber 910 will be presented accordingly.
[0062] Referring to both FIGs. 9A and 9B, rotating the device between the first video asset, e.g., portrait orientation, and the second video asset, e.g., landscape orientation, may affect the position or orientation of user interface elements of the adaptive scrubber. These can include the timecode 918 and playback controls 912, and additional user interface elements described below (e.g., FIGs. 12A-12D, and 13A-13D). For example, in FIG. 9A, the device 900 is playing a the first video asset in portrait orientation such that the scrubber bar 902 is on the right side of the display and the time code 918 is positioned to the left of the slider 904 on the scrubber bar 902. When the device 900 is rotated to the orientation shown in FIG. 9B, the device may play the second video asset in landscape orientation such that the scrubber bar 902 is positioned at the bottom of the display and the time code 918 is positioned below the slider 904 on the scrubber bar 902. In some examples, when the second video asset (e.g., landscape orientation) is playing, the time code may be positioned above the slider.
[0063] Referring to FIGs. 9B-9C, the playback controls 912 may similarly change position depending on an orientation of the display device (e.g., portrait or landscape), which may correspond to the video asset being presented, such as described above. For example, in FIG. 9B, the device 900 is playing a second video asset in landscape orientation with the playback controls 912 positioned at the bottom of the display. According to some examples, when the device 900 is rotated to the orientation shown in FIG. 9C, the device may play the first video asset in portrait orientation, such that the playback controls 912 are positioned to the left of the scrubber bar 902 in a vertical configuration.
[0064] With respect to FIG. 9A, while the device 900 is showing the first video asset in a portrait orientation, the adaptive scrubber 910A may have a vertical layout on the display such that the scrubber bar 902A is located on a right side of the display. This layout may be preferable if the user is right-hand dominant. Referring to FIG. 9C, while the device 900 is showing the first video asset in a portrait orientation, the adaptive scrubber 910C may have a vertical layout on the display such that the scrubber bar 902C is iocated on a left side of the display. This layout may be preferable if the user is left-hand dominant. Accordi ng to some examples, a user may indicate a preference for having the scrubber bar located on either the right or left side of the display,
[0065] According to some examples, a handheld device may detect a location of a hand of the user grasping the device to automatically determine which side of the screen to position the scrubber bar. In FIG. 10A, the device 1000 may detect that a left hand 1016A of the user is grasping the device and position the scrubber bar 1002 on the left side of the display of the device 1000. This may allow a user to manipulate the position of the slider 1004 and playback controls 1012 with the thumb of the left hand 1016A. For example, there may be sensors in the device 1000 that can detect the left hand 1016 A is holding the phone. In some examples, the angle or orientation of the device 1000 may be used to determine which hand is grasping the phone. According to some examples, if the device 1000 detects that a left hand of the user is grasping the device, the scrubber bar may be positioned on the right side of the display of the device. For example, a user may prefer to grasp the device with their right hand and manipulate the adaptive scrubber with the left hand.
[0066] In FIG. 1018, the device 1000 may detect that a right hand 1016B of the user is grasping the device and position the scrubber bar 1002 on the right side of the display of the device 1000. This may allow a user to manipulate the position of the slider 1004 and playback controls 1012 with the thumb of the right hand 1016B. According to some examples, if the device detects that a right hand of the user is grasping the device, the scrubber bar may be positioned on the left side of the display of the device. For example, a user may prefer to grasp the device with a right hand of the user and manipulate the adaptive scrubber with the left hand.
[0067] In some examples, locations of the adaptive scrubber user interface elements, e.g., scrubber bar, slider, and playback controls may depend on the content of the video asset being played. For example, the adaptive scrubber user interface elements may be presented in areas of the display that do not correspond to, or overlap with, a focal region of the video asset. A focal region may he a region of interest, such as an area of the screen that a creative entity would expect a user’s gaze to be directed. For example, the focal region for a video may correspond to an area where a person is delivering dialogue. Referring back to FIG. 3 A, the video content 310A includes a television news-style program with the host 322A positioned near the right-side of the display. In this example, the focal region may correspond to the location of the host 322A. An adaptive scrubber according to this example may be presented on the left-side of the display to avoid obstructing the focal region with visual elements 320A and 324A could be repositioned accordingly. Another example of a focal region may include a region where there is relative movement of the same object between frames of the video asset, e.g., an explosion or water running in a river. This concept of positioning the adaptive scrubber in regions other than the focal region may also apply to adaptive closed captioning, in some examples, focal regions may be determined autonomously, such as by a playback application; for instance, face detection techniques could be used to identify an actor’s face as a focal region. In some examples, one or more focal regions can be manually inserted, such as by a film director, or by an end user. Focal region data can be included in metadata associated with a corresponding video asset. In some cases, focal regions may change location over the length of a video, with the adaptive scrubber being continually repositioned throughout the video to avoid overlapping with the focal regions,
[0068] FIGs. 11A-11B illustrate an example adaptive scrubber layout that includes chapter heads 1114A, 1114B, and 1114C, positioned along the scrubber bar 1102. The device 1100 is in a portrait orientation and displaying a first video asset (e.g., video in a portrait orientation) and a corresponding adaptive scrubber 1110 with a vertical layout. The location of the chapter heads, 1114A, 1114B, and 1114C, along the scrubber bar 1102 corresponds to the time at the start of the respective chapter. For example, the adaptive scrubber illustrated in FIG. 11A includes four chapters: a chapter starting at the start position 1106, a second chapter starting at chapter head 1114A, a third chapter starting at a chapter head 1114B, and a fourth chapter starting at 1114C. In other examples, there may be more or less than four chapters in a video. In some examples, a title of the chapter may display next to the chapter head when hovering over the chapter head with a finger, stylus, or other pointer device. The chapter head icon, e.g., 1114A-1114C may be rectangular, oval, circular, triangular, or any shape that a user may easily recognize without confusing the chapter head for the slider. In some examples, the chapter head may be the same shape as the slider 1104, but be a different color, in some examples, the start position 1106 may include a chapter head icon.
[0069] Rotating the device 1100 from a portrait orientation in FIG. 11A to a landscape orientation in FIG. 11B may result in the video asset switching from a first video asset to the second video asset, such as described above. For example, in FIG. 11B, the device 1100 is in landscape orientation displaying a second video asset (e.g., landscape orientation) with the adaptive scrubber 1110 oriented horizontally. The adaptive scrubber 1110 includes chapters 1114A, 1114B, and 1114C located along a horizontally oriented scrubber bar 1102.
[0070] The chapters may indicate different sections of the video asset. According to one example, if the video is a news show, the chapters may correspond to different news segments. According to another example, if the video is a movie, the chapters may correspond to shorter narrative arcs within the movie and/or suggest a recommended location for a viewer to pause while watching the video. In some examples a creative entity and/or distributor of the content may determine the placement of the chapters. In some examples, a user may create their own chapters, as described below with respect to bookmarking and FIGs. 13A-13D.
[0071] In some examples, a first video asset, e.g., having a portrait orientation, may be different a second video asset, e.g., having a landscape orientation, as described above, e.g., FIGs. 3A-3B). Likewise, in some examples, the adaptive scrubber corresponding to the first video asset may be different than the adaptive scrubber corresponding to the second video asset. For example, the chapter locations may between the first and second video asset may correspond to different times along the scrubber bar to complement differences between the first and second video assets. In some examples, the adaptive scrubber corresponding to a first video asset may have a different number of chapters than the adaptive scrubber corresponding to a second video asset.
[0072] FIGs. 12A-12D illustrate examples of an adaptive scrubber 1210 layout that includes social media and sharing capabilities. In FIG. 12A, a device 1200 displays a first video asset in a portrait orientation and a corresponding adaptive scrubber 1210 oriented vertically, including a scrubber bar 1202 having a start position 1206 and an end position 1208. The scrubber bar 1202 may include a slider or playhead 1204. The slider 1204 may move along the scrubber bar 1202 to indicate the playback time that has elapsed. The adaptive scrubber may also include playback controls 1212. The playback controls 1212 may include a fast-forward, a rewind, and a play/pause icon. The adaptive scrubber 1210 may also include a plurality of social media icons or virtual buttons 1216 that may be associated with, for example, a social networking platform (e.g., Facehook, Pinterest, Twitter, Tumbir, etc.), or a messaging service (e.g., SMS text, WhatsApp). As shown in FIG. 12A, the plurality of social media icons 1216 may be positioned near the playback controls 1212 at the bottom of the display. In some examples, the social media icons may be arranged vertically to the right or left of the vertically oriented adaptive scrubber. When the device 1200 is rotated, for example from the portrait orientation illustrated in FIG. 12A to the landscape orientation illustrated in FIG. 12B, such as described above, the adaptive scrubber 1210 may similarly transition (e.g., rotate) to be oriented horizontally. For example, in FIG. 12B, the scrubber bar 1204 is located near the bottom of the display with the playback controls 1212 and social media icons 1216 positioned beneath the scrubber bar 1204. In some examples, the social media icons may be positioned above the scrubber bar 1204.
[0073] A user that desires to share a link to the video asset on playing on a device 1200 may select one of the social media icons 1216. Referring to FIG. T2C, in one example, a user may select a share icon. A pop-up 1230 may appear on the screen corresponding to options associated with the share icon 1216C. For example, the pop-up 1230C may include one or more messaging or social media icons 1232 that correspond to a messaging or social networking app or service (e.g., SMS text, WhatsApp, Instagram, etc.) that can be used to share the video asset link. The pop-up 1230 may also include a dialog box 1234 where a user can identify a recipient (e.g., a user of a social networking platform) and/or type a message to the recipient. The pop-up 1230 may also include a timer 1236 corresponding to the current playback time of the video. In this manner, a user may send a link to an exact location in the playback of the video. The user may use the playback controls 1212 to change the location in the video to send to the intended recipient. In some examples the user may prefer to send a link to the video from the beginning, without specifying a time, and may de-select the timer to do so. FIG. 12D shows an alternate layout for the pop-up 1230D. Pop-up 1230D may, for example, include the playback controls 1212D located in the pop-up 1230D near the messaging icons 1232 for ease of use.
[0074] The content of the pop-up can be tailored to the social media icon selected. For example, if a user selects the Facebook icon, the user may be given the option to post on their personal wall, a friend’s wall, or send the link over Facebook Messenger. In some examples, the link sent through the social media capabilities may require the intended recipient to watch the same video asset that the user was watching before sending the link. For instance, if the user is watching a first video asset in portrait orientation and sends a link, the intended recipient can receive a link to the same video asset, e.g., to watch the video in portrait orientation. In this manner, the user can ensure that the intended recipient will have a similar experience with the video content by watching the video in the same orientation. In other examples, the intended recipient may have an option to watch the same or a different video asset. For example, an intended recipient may receive a link to both video assets and depending on the detected orientation of the device, the appropriate video asset will play, e.g., if the device is in a portrait orientation, the video asset corresponding to the portrait orientation will play. In this example, a user may rotate the device during playback and the video asset should switch accordingly, e.g·, from a first video asset in portrait orientation to a second video asset in landscape orientation. In some examples where an intended recipient can choose to watch the first or second video asset, the message accompanying the link may suggest a preferred orientation.
[0075] FIGs. 13A-13D illustrate an example adaptive scrubber 1310 layout that includes hookmarking capabilities. In FIG. 13A, a device 1300 displays a first video asset in portrait orientation and a corresponding adaptive scrubber 1310 oriented vertically, including a scrubber bar 1302 having a start position 1306 and an end position 1308. The scrubber bar 1302 may include a slider or playhead 1304. The slider 1304 may move along the scrubber bar 1302 to indicate the playback time that has elapsed. The adaptive scrubber may also include playback controls 1312. The playback controls 1312 may include a fast -forward, a rewind , and a play/pause icon. The playback controls may be arranged as described with respect to FIGs. 9 A or 9C. The adaptive scrubber 1310 may also include a plurality of social media icons or virtual buttons 1316. The social media icons may include, for example, Facebook, Pinterest, Twitter, Tumbir, and the like. The adaptive scrubber 1310 may also include a bookmarking icon 1348. As shown in FIG. 13 A, the bookmarking icon 1348 may be positioned near the plurality of social media icons 1316 with the playback controls 1312 at the bottom of the display. The bookmarking icon may be located in other areas, for example near the start position 1306, the end position 1308, or the slider 1304. As a user rotates the device 1300 from portrait orientation to landscape orientation, the adaptive scrubber user interface elements including, but not limited to, the scrubber bar 1302, slider 1304, playback controls 1312, social media icons 1316, and bookmarking icon 1348 may change position as described above (e.g., FIGs. 12A and 12B).
[0076] A user that desires to bookmark a start of a video segment along the scrubber bar
1302 may select the bookmarking icon 1348. Referring to FIG. 13B, in one example, selecting the bookmarking icon 1348 may cause a pop-up 1330B to appear. As seen in FIG. 13B, the pop- up 1330B may be positioned adjacent to the slider 1304. The pop-up 1330B may include a dialog box 1334B where a user can type a note or comment about the bookmark. The pop-up 1330B may include a timer 1436B corresponding to the current playback time of the video. The user may use the playback controls 1312 to change a location along the scrubber bar 1302, so that the user may bookmark the precise time in the video. FIG. 13C shows an alternate layout for the pop-up 1330C. Pop-up 1330C may, for example, include the playback controls 1312C located in the pop- up 1330C for ease of use.
[0077] Referring to FIG. 13D, once a bookmark has been saved, bookmark icon 1346D may appear, corresponding to foe location of the bookmark along foe scrubber bar 1302D. In some examples, bookmarks may be saved with respect to a particular video asset. For example, if a user is watching a first video asset and saves a bookmark while in the first video asset, then the bookmark may be saved to the first video asset and not foe second. In this example, the bookmark would not appear on the scrubber bar corresponding to the second video asset. In some examples, if the user is watching the second video asset (e.g., in landscape orientation) and changes the orientation of the device (e.g., to portrait orientation) for ease of typing or other reasons while saving the bookmark, the bookmark may be saved to the second video asset. In this example, the bookmark would not appear on the scrubber bar corresponding to the first video asset. In some examples, bookmarks saved by the user will save to both the first video asset and the second video asset, e.g., horizontal and vertical orientations of foe adaptive scrubber.
[0078] In some examples, foe bookmark icon 1346 may be rectangular, oval, circular, triangular, or any shape that a user may easily recognize without confusing the bookmark icon 1346 for a chapter head or the slider 1304. In some examples, the bookmark icon may be the same shape as the chapter head or slider, but be a different color. In some examples, the bookmarks may have a similar function as chapters and use foe same icon as a chapter head. In some examples, a user may have an option to designate a selected time location as a bookmark or a chapter. In some examples, there may be a bookmarking icon as well as a separate chapter icon. In such examples, the chapter icon may work as described above with respect to the bookmarking icon. One skilled in the art would understand that a user may save multiple bookmarks for a video asset.
[0079] According to some examples, the adaptive scrubber may include a movable scrubber bar. FIGs. 14A-14B illustrate examples of a movable scrubber bar 1402. For example, FIG. 14A illustrates a device 1400 displaying a first video asset in a portrait orientation with the scrubber bar 1402 of adaptive scrubber 1410 positioned vertically at a first position on the right side of the display. The device 1400 may detect a first location of touch 1420A by the user along the scrubber bar 1402. The user may drag the scrubber bar 1402 to a second position on the display by moving the first location of touch 1420A to a second location of touch 1420B, as shown in FIG. 14B. In order to avoid equating errant touches and contact with the scrubber bar as drag commands, the device may require that the first location of touch 1420A be maintained for a predetermined amount of time, e.g., two seconds, and/or that the user apply a predetermined amount of pressure before the scrubber bar 1402 can be dragged.
[0080] While the user can move the location of touch in both a horizontal and vertical direction, the movement of the scrubber bar will track the horizontal position of the location of touch. In other words, according to this example, moving the scrubber bar will not change its vertical position.
[0081] In some examples, the scrubber bar 1402 may stay put once the scrubber bar has been dragged to a new location. For example, referring to FIG. 14B, a user may drag the scrubber bar 1402 to a second position corresponding to the second location of touch 1420B. Once the user removes a finger or stylus from the device, the scrubber bar 1402 can remain in the second position. According to some examples, the playback controls may automatically move to avoid overlap between the scrubber bar 1402 and the location of the playback controls 1412. For example, referring to FIG. I4B, the playback controls 1412 may be off-centered to the left side of the device 1400 instead of centered on the centerline 1422 of the device. In some examples, a button of the playback controls may be located on a right side of the scrubber bar while the remaining butons may be located on a left side of the scrubber bar. For example, the rewind and play icons may be to the left of the scrubber bar while the fast-forward icon may be to the right of the scrubber bar. In some examples, the playback controls may remain in the original location despite the overlap.
[0082] In some examples, the scrubber bar 1402 may snap to a location depending on the final location of touch (e.g., the location of touch preceding the user removing a finger or stylus from the screen). For example, still referring to Figure 14B, the second location of touch is to the right of the centerline 1422 of the device 1400, thus, the scrubber bar 1402 may snap back to the right side of the display of the device 1400. In other words, the adaptive scrubber 1401 will have the same position that it started with in FIG. 14A. According to another example, if the second location of touch was to the left of the centerline 1422, the scrubber bar 1402 would snap to the left side of the display device 1400, e.g., as illustrated in FIG. 9C.
[0083] FIGs. 15A-15D illustrate examples of a draggable scrubber that may snap to a location with a rubber band effect. In FIG. 15 A, device 1500 is displaying a first video asset in a portrait orientation with the scrubber bar 1502 of adaptive scrubber 1510 oriented vertically at a first position on the right side of the display. The device 1500 may detect a first location of touch 1520A by the user along the scrubber bar 1502 in the same manner described above with respect to FIGs. 14A-14B. Referring to also to FIG. 15B, the user may drag the scrubber bar to a second location of touch 1520B. The scrubber bar 1502 is showcasing the rubber band effect, where a portion of the scrubber bar 1502 in FIG. 15B “stretches” to move with the location of touch from the first position 1520A to the second position 1520B. Still referring to Figure 15B, the second location of touch is to the right of the centerline 1522 of the device 1500, thus, the scrubber bar 1502 may snap back to the right side of the display of the device 1500 if the user were to remove pressure, e.g., no longer contact the display of the device 1500 at the second location of touch 152.0B. In some examples, a line other than centerline 1522 could be used as the threshold to determine the side of the display the scrubber bar 1502 will snap back.
[0084] According to some examples the second location of touch may be located to the left of the centerline of the device. For example, in FIG. 15C, the user may drag the scrubber bar 1502 to a second location of touch 1520C to the left of the centerline 1522. Once the second location of touch 1520C moves to the left of the centerline 1522, the start position 1506 and end position 1508 of the scrubber bar 1502 may snap to position on the left side of the display of the device 1500. Once the user lifts the finger or stylus corresponding to the second location of touch 1520C, the scrubber bar 1502 may snap fully into place as illustrated in Fig. 15D.
[0085] In some examples, the media content application may prompt a user to select a preferred amount of mobility for the scrubber bar, e.g., no movement, free movement, snap to location, rubber band effect. In some examples, a user may be able to configure the mobility of the scrubber bar in the settings of the video application.
[0086] The examples described above may operate on one or more computers (e.g., one or more servers), including non-transitory computer readable recording media oil a computer. This readable media contains the program instructions tor accomplishing various steps described above. In the context of tills disclosure, a computer-readable recording medium can be any medium that can contain or store programming for use by or in connection with an instruction execution system, apparatus, or device. Such computer readable media may be stored on a memory, where a memory is any device capable of storing a computer readable medium and capable of being accessed by a computer. A memory may include additional features. As used herein, a computer can comprise a conventional computer or one or more mobile devices. A computer may include a processor. A processor can be any device suitable to access a memory and execute a program stored thereon.
[0087] Communications may be transmitted between nodes over a communications network, such as the Internet. Other communications technology may include, but is not limited to, any combination of wired or wireless digital or analog communications channels, such as instant messaging (IM), short message service (SMS ), multimedia messaging service (MMS) or a phone system (e.g., cellular, landline, or IP-based). These communications technologies can include Wi-Fi, Bluetooth, or other wireless radio technologies.
[0088] Examples of the disclosure may be implemented in any suitable form, including hardware, software, firmware, or any combination of these. Examples of the disclosure may optionally be implemented partly as computer software running on one or more data processors and/or digital signal processors. The elements and components of an example of the disclosure may be physically, functionally, and logically implemented in any suitable way. indeed, the functionality may be implemented in a single unit, in multiple units, or as part of other functional units. As such, examples of the disclosure may be implemented in a single unit or may be physically and functionally distributed between different units and processors.
[0089] FIG. 8 illustrates an example computer 800 (which may comprise a mobile device) capable of implementing the disclosed examples. Example computer 800 includes a memory 802, a processor 804, an input interface 806, an output interface 808, and a communications interface 810.
[0090] Memory 802 may include volatile and non-volatile storage. For example, memory storage may include read only memory (ROM) in a hard disk device (HDD), random access memory (RAM), flash memory, and the like. The Operating System (OS) and application programs may be stored in ROM.
[0091] Specific software modules that implement embodiments of the described systems and methods may be incorporated in application programs on a server. The software may execute under control of an OS.
[0092] Processor 804 may include any device suitable to access a memory and execute a program stored thereon.
[0093] Input interface 806 may include a keyboard or mouse, for example. Output interface 808 may include a conventional color monitor and printer, such as a conventional laser printer. Output interface 808 may provide requisite circuitry to electrically connect and interface the display and printer to the computer system.
[0094] Communications interface 810 may allow' the network and nodes to connect directly, or over another network, to other nodes or networks. The network can include, for example, a local area network (LAN), a wide area network (WAN), or the Internet. In some examples, the network, modules, and nodes can be connected to another client, server, or device via a wireless interface.
[0095] In some examples, the input interface, processor, memory, communications interface, output interface, or combinations thereof, are interconnected by a bus.
[0096] The disclosed examples could be embodied as a JAVA tool, which means it can ran on any platform that is JAVA enabled. Examples can ran on a web server that provides a website for administrators to monitor the system results remotely. Anyone with administrative access to the web server can connect to and use visualization tools to take actions within a visualization. The examples can ran on any type of server, including virtual servers or an actual machine. While JAVA is provided as an example, any suitable programming language or technology can be used to implement the examples of the disclosure.
[0097] The disclosed examples may be embodied on a distributed processing system to break processing apart into smaller jobs that can be executed by different processors in parallel. The results of the parallel processing could then be combined once completed. [0098] Although the present invention has been fully described in connection with examples thereof with reference to the accompanying drawings, it is to be noted that various changes and modifications will become apparent to those skilled in the art. Such changes and modifications are to be understood as being included within the scope of the claimed subject matter. The various examples of the invention should be understood that they have been presented by way of example only, and not by way of limitation. Although the invention is described above in terms of various examples and implementations, it should be understood that the various features and functionality described in one or more of the individual examples are not limited in their applicability to the particular example with which they are described. They instead can, be applied, alone or in some combination, to one or more of the other examples of the invention, whether or not such examples are described, and whether or not such features are presented as being a part of a described example. Thus the breadth and scope of the claimed subject matter should not be limi ted by any of the above-described examples.
[0099] Terms and phrases used in this document, and variations thereof, unless otherwise expressly stated, should be construed as open ended as opposed to limiting. As examples of the foregoing, the term “including” should be read as meaning “including, without limitation” or the like; the term “example” is used to provide exemplary instances of the item in discussion, not an exhaustive or limiting list thereof; and adjectives such as “conventional,” “traditional,” “normal,” “standard,” “known,” and terms of similar meaning, should not be construed as limiting the item described to a given time period, or to an item available as of a given time. These terms should instead be read to encompass conventional, traditional, normal, or standard technologies that may he available, known now, or at any time in the future. Likewise, a group of items linked with the conjunction “and” should not be read as requiring that each and every one of those items be present in the grouping, but rather should he read as “and/or” unless expressly stated otherwise. Similarly, a group of items linked with the conjunction “or” should not be read as requiring mutual exclusivity among that group, but rather should also be read as “and/or” unless expressly stated otherwise. Furthermore, although items, elements or components of the invention may be described or claimed in the singular, the plural is contemplated to be within the scope thereof unless limitation to the singular is explicitly stated. For example, “at least one” may refer to a single or plural and is not limited to either. The presence of broadening words and phrases such as “one or more,” “at least,” “but not limited to,” or other like phrases in some instances shall not be read to mean that the narrower ease is intended or required in instances where such broadening phrases may be absent. The word “exemplary” is used herein to mean “serving as an example or illustration.” Any aspect or design described herein as “exemplary” is not necessarily to be construed as preferred or advantageous over other aspects or designs.
[00100] It will be appreciated that, for clarity purposes, the above description has described examples of the invention with reference to different functional units and modules. However, it will be apparent that any suitable distribution of functionality between different functional units, processing logic elements or domains may be used without detracting from the invention. For example, functionality illustrated to be performed by separate processing logic elements, or controllers, may be performed by the same processing logic element, or controller. Hence, references to specific functional units are only to he seen as references to suitable means for providing the described functionality, rather than indicative of a strict logical or physical structure or organization. It should be understood that the specific order or hierarchy of steps in the processes disclosed herein is an example of exemplary approaches. Based upon design preferences, it is understood that the specific order or hierarchy of steps in the processes may be rearranged while remaining within the scope of the claimed subject matter. Further, in some examples, some steps in the processes disclosed herein may be forgone altogether while remaining within the scope of the claimed subject matter.

Claims

CLAIMS What is claimed is:
1. A method comprising: receiving, at a device comprising a display and an orientation sensor, a plurality of assets comprising: a first video asset associated with a first aspect ratio; and a second video asset associated with a second aspect ratio, different from the first aspect ratio: determining, based on an output of the orientation sensor, a desired aspect ratio; in accordance with a determination that the desired aspect ratio is closer to the first aspect ratio than to the second aspect ratio, selecting the first video asset and further selecting a first playback interface associated with the first aspect ratio; in accordance with a determination that the desired aspect ratio is closer to the second aspect ratio than to the first aspect ratio, selecting the second video asset and further selecting a second playback interface associated with the second aspect ratio; presenting, via the display, the selected video asset at the desired aspect ratio; and presenting, via the display, concurrently with presenting the selected video asset, the selected playback interface,
2. The method of claim 1 , wherein the first selected corresponding playback interface comprises: a scrubber bar having a start point and an end point: a slider of the scrubber bar; and a plurality of playback controls.
3. The method of claim 2, further comprising: receiving input corresponding to a playback control of the plurality of playback controls; and in response to receiving the input: determining, based on the playback control, a command corresponding to one or more of a rewind command, a pause command, a play command, and a fast-forward command; and executing the command.
4. The method of claim 1, further comprising: receiving a touch input; and in response to receiving the touch input, determining, based on the touch input, a desired display location, wherein presenting the selected playback interface comprises presenting the selected playback interface on the display at the desired display location.
5. The method of claim 1, wherein the first playback interface comprises a first scrubber bar having a first length, and the second playback interface comprises a second scrubber bar having a second length the same as the first length.
6. The method of claim 1, further comprising: receiving a first input corresponding to an icon associated with a social networking platform; receiving a second input identifying a target user of the social networking platform; and in response to receiving the second input, sending a link to the target user, wherein the link is associated with a first location of the first video asset and is further associated with a second location of the second video asset, the second location corresponding to the first location.
7. The method of claim 6, wherein the link is further associated with the selected video asset.
8. The method of claim 1, further comprising: determining a first focus region associated with the selected video asset; wherein presenting the selected playback interface comprises presenting the selected playback interface in a second region of the display that does not overlap with the first focus region.
9. The method of claim 8, wherein determining the first focus region comprises determining the first focus region based on metadata associated with the selected video asset.
10. The method of claim 8, wherein determining the first focus region comprises determining the first focus region based on video display data associated with the selected video asset.
11. The method of claim 1, further comprising: determining a location of a user’s hand with respect to the display; and determining, based on the location of the user’s hand, a desired display location, wherein presenting the selected playback interface comprises presenting the selected playback interface on the display at the desired display location.
12. A device comprising: a display; an orientation sensor; and one or more processors configured to execute instructions which, when executed, cause the device to perform a method comprising: recei ving a plurality of assets comprising: a first video asset associated with a first aspect ratio; and a second video asset associated with a second aspect ratio, different from the first aspect ratio; determining, based on an output of the orientation sensor, a desired aspect ratio; in accordance with a determination that the desired aspect ratio is closer to the first aspect ratio than to the second aspect ratio, selecting the first video asset and further selecting a first playback interface associated with the first aspect ratio; in accordance with a determination that the desired aspect ratio is closer to the second aspect ratio than to the first aspect ratio, selecting the second video asset and further selecting a second playback interface associated with the second aspect ratio; presenting, via the display, the selected video asset at the desired aspect ratio; and presenting, via the display, concurrently with presenting the selected video asset, the selected playback interface.
13. The device of claim 12, wherein the first selected corresponding playback interface comprises: a scrubber bar having a start point and an end point: a slider of the scrubber bar; and a plurality of playback controls.
14. The device of claim 13, wherein the method further comprises: receiving input corresponding to a playback control of the plurality of playback controls; and in response to recei ving the input: determining, based on the playback control, a command corresponding to one or more of a rewind command, a pause command, a play command, and a fast-forward command; and executing the command.
15. The device of claim 12, wherein the method further comprises: receiving a touch input; and in response to receiving the touch input, determining, based on the touch input, a desired display location, and wherein presenting the selected playback interface comprises presenting the selected playback interface on the display at the desired display location.
16. The device of claim 12, wherein the first playback interface comprises a first scrubber bar having a first length, and the second playback interface comprises a second scrubber bar having a second length the same as the first length.
17. The device of claim 12, wherein the method further comprises: receiving a first input corresponding to an icon associated with a social networking platform; receiving a second input identifying a target user of the social networking platform; and in response to receiving the second input, sending a link to the target user, wherein the link is associated with a first location of the first video asset and is further associated with a second location of the second video asset, the second location corresponding to the first location.
18. The device of claim 17, wherein the link is further associated with the selected video asset.
19. The device of claim 12, wherein the method further comprises: determining a first focus region associated with the selected video asset, and wherein presenting the selected playback interface comprises presenting the selected playback interface in a second region of the display that does not overlap with the first focus region.
20. The device of claim 19, wherein determining the first focus region comprises determining the first focus region based on metadata associated with the selected video asset.
21. The device of claim 19, wherein determining the first focus region comprises determining the first focus region based on video display data associated with the selected video asset.
22. The device of claim 12, wherein the method further comprises: determining a location of a user’s hand with respect to the display; and determining, based on the location of the user’s hand, a desired display location, and wherein presenting the selected playback interface comprises presenting the selected playback interface on the display at the desired display location.
23. A non-transitory computer-readable medium storing instructions which, when executed by one or more processors, cause the one or more processors to execute a method comprising: receiving, at a device comprising a display and an orientation sensor, a plurality of assets comprising: a first video asset associated with a first aspect ratio; and a second video asset associated with a second aspect ratio, different from the first aspect ratio; determining, based on an output of the orientation sensor, a desired aspect ratio; in accordance with a determination that the desired aspect ratio is closer to the first aspect ratio than to the second aspect ratio, selecting the first video asset and further selecting a first playback interface associated with the first aspect ratio; in accordance with a determination that the desired aspect ratio is closer to the second aspect ratio than to the first aspect ratio, selecting the second video asset and further selecting a second playback interface associated with the second aspect ratio; presenting, via the display, the selected video asset at the desired aspect ratio; and presenting, via the display, concurrently with presenting the selected video asset, the selected playback interface.
24. The non-transitory computer-readable medium of claim 23, wherein the first selected corresponding playback interface comprises: a scrubber bar having a start point and an end point; a slider of the scrubber bar; and a plurality of playback controls.
2.5. The non-transitory computer-readable medium of claim 2.3, wherein the method further comprises: receiving a touch input: and in response to receiving the touch input, determining, based on the touch input, a desired display location, and wherein presenting the selected playback interface comprises presenting the selected playback interface on the display at the desired display location.
26. The non-transitory computer-readable medium of claim 23, wherein the first playback interface comprises a first scrubber bar having a first length, and the second playback interface comprises a second scrubber bar having a second length the same as the first length.
27. The non-transitory computer-readable medium of claim 23, wherein the method further comprises: receiving a first input corresponding to an icon associated with a social networking platform: receiving a second input identifying a target user of the social networking platform; and in response to receiving the second input, sending a link to the target user, wherein the link is associated with a first location of the first video asset and is further associated with a second location of the second video asset, the second location corresponding to the first location.
28. The noil-transitory computer-readable medium of claim 27, wherein the link is further associated with the selected video asset.
29. The non-transitory computer-readable medium of claim 23, wherein the method further comprises: determining a first focus region associated with the selected video asset, and wherein presenting the selected playback interface comprises presenting the selected playback interface in a second region of the display that does not overlap with the first focus region.
30. The non-transitory computer-readable medium of claim 23, wherein the method further comprises: determining a location of a user’s hand with respect to the display; and determining, based on the location of the user’s hand, a desired display location, and wherein presenting the selected playback interface comprises presenting the selected playback interface on the display at the desired display location.
PCT/US2021/012376 2020-01-06 2021-01-06 Media content presentation WO2021142036A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US16/735,511 US20200296316A1 (en) 2019-03-11 2020-01-06 Media content presentation
US16/735,511 2020-01-06

Publications (1)

Publication Number Publication Date
WO2021142036A1 true WO2021142036A1 (en) 2021-07-15

Family

ID=76787567

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2021/012376 WO2021142036A1 (en) 2020-01-06 2021-01-06 Media content presentation

Country Status (1)

Country Link
WO (1) WO2021142036A1 (en)

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20180014049A1 (en) * 2016-07-08 2018-01-11 Tastemade, Inc. Orientation Based, Aspect Ratio Switching Video Playback System

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20180014049A1 (en) * 2016-07-08 2018-01-11 Tastemade, Inc. Orientation Based, Aspect Ratio Switching Video Playback System

Similar Documents

Publication Publication Date Title
US20200296316A1 (en) Media content presentation
US11563915B2 (en) Media content presentation
US20230012007A1 (en) Media seek mechanisms
CN110554818B (en) Apparatus, method and graphical user interface for navigating media content
US10405009B2 (en) Generating videos with multiple viewpoints
KR101919475B1 (en) Device and methodfor providing user interface
Burtnyk et al. Stylecam: interactive stylized 3d navigation using integrated spatial & temporal controls
US7589732B2 (en) System and method of integrated spatial and temporal navigation
EP3247110B1 (en) Method and device for handling multiple video streams using metadata
US9841883B2 (en) User interfaces for media application
EP1806666A2 (en) Method for presenting set of graphic images on television system and television system for presenting set of graphic images
US20120321280A1 (en) Picture Selection for Video Skimming
US8811797B2 (en) Switching between time order and popularity order sending of video segments
EP3459262B1 (en) Electronic device and method for rendering 360-degree multimedia content
JP2008141746A (en) System and method for playing moving images
CN110574379A (en) System and method for generating customized views of video
EP2472519A1 (en) Multi-video rendering for enhancing user interface usability and user experience
CN105122826B (en) System and method for displaying annotated video content by a mobile computing device
Hürst Interactive audio-visual video browsing
CN113286181A (en) Data display method and device
Bassbouss et al. Interactive 360° video and storytelling tool
WO2021142036A1 (en) Media content presentation
Schoeffmann et al. Video navigation on tablets with multi-touch gestures
US11659219B2 (en) Video performance rendering modification based on device rotation metric
US11765333B1 (en) Systems and methods for improved transitions in immersive media

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21738521

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 20/10/2022)

122 Ep: pct application non-entry in european phase

Ref document number: 21738521

Country of ref document: EP

Kind code of ref document: A1