WO2016127440A1 - 基于超文本传输协议媒体流的媒体呈现导览方法和相关装置 - Google Patents

基于超文本传输协议媒体流的媒体呈现导览方法和相关装置 Download PDF

Info

Publication number
WO2016127440A1
WO2016127440A1 PCT/CN2015/073148 CN2015073148W WO2016127440A1 WO 2016127440 A1 WO2016127440 A1 WO 2016127440A1 CN 2015073148 W CN2015073148 W CN 2015073148W WO 2016127440 A1 WO2016127440 A1 WO 2016127440A1
Authority
WO
WIPO (PCT)
Prior art keywords
navigation
media presentation
media
adaptation set
video adaptation
Prior art date
Application number
PCT/CN2015/073148
Other languages
English (en)
French (fr)
Inventor
张少波
王新
唐廷芳
Original Assignee
华为技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 华为技术有限公司 filed Critical 华为技术有限公司
Priority to KR1020177025344A priority Critical patent/KR101919726B1/ko
Priority to CN201580038222.5A priority patent/CN106664299B/zh
Priority to JP2017542417A priority patent/JP6478357B2/ja
Priority to PCT/CN2015/073148 priority patent/WO2016127440A1/zh
Priority to EP15881602.5A priority patent/EP3249873B1/en
Publication of WO2016127440A1 publication Critical patent/WO2016127440A1/zh
Priority to US15/677,436 priority patent/US20170374122A1/en

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L65/00Network arrangements, protocols or services for supporting real-time applications in data packet communication
    • H04L65/60Network streaming of media packets
    • H04L65/75Media network packet handling
    • H04L65/764Media network packet handling at the destination 
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/02Protocols based on web technology, e.g. hypertext transfer protocol [HTTP]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L9/00Cryptographic mechanisms or cryptographic arrangements for secret or secure communications; Network security protocols
    • H04L9/40Network security protocols
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/231Content storage operation, e.g. caching movies for short term storage, replicating data over plural servers, prioritizing data for deletion
    • H04N21/23109Content storage operation, e.g. caching movies for short term storage, replicating data over plural servers, prioritizing data for deletion by placing content in organized collections, e.g. EPG data repository
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/233Processing of audio elementary streams
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/234Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/25Management operations performed by the server for facilitating the content distribution or administrating data related to end-users or client devices, e.g. end-user or client device authentication, learning user preferences for recommending movies
    • H04N21/262Content or additional data distribution scheduling, e.g. sending additional data at off-peak times, updating software modules, calculating the carousel transmission frequency, delaying a video stream transmission, generating play-lists
    • H04N21/26283Content or additional data distribution scheduling, e.g. sending additional data at off-peak times, updating software modules, calculating the carousel transmission frequency, delaying a video stream transmission, generating play-lists for associating distribution time parameters to content, e.g. to generate electronic program guide data
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/41Structure of client; Structure of client peripherals
    • H04N21/422Input-only peripherals, i.e. input devices connected to specially adapted client devices, e.g. global positioning system [GPS]
    • H04N21/42204User interfaces specially adapted for controlling a client device through a remote control device; Remote control devices therefor
    • H04N21/42206User interfaces specially adapted for controlling a client device through a remote control device; Remote control devices therefor characterized by hardware details
    • H04N21/42208Display device provided on the remote control
    • H04N21/42209Display device provided on the remote control for displaying non-command information, e.g. electronic program guide [EPG], e-mail, messages or a second television channel
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/439Processing of audio elementary streams
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/44Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/482End-user interface for program selection
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/482End-user interface for program selection
    • H04N21/4825End-user interface for program selection using a list of items to be played back in a given order, e.g. playlists
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/60Network structure or processes for video distribution between server and client or between remote clients; Control signalling between clients, server and network components; Transmission of management data between server and client, e.g. sending from server to client commands for recording incoming content stream; Communication details between server and client 
    • H04N21/61Network physical structure; Signal processing
    • H04N21/6106Network physical structure; Signal processing specially adapted to the downstream path of the transmission network
    • H04N21/6125Network physical structure; Signal processing specially adapted to the downstream path of the transmission network involving transmission via Internet
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/60Network structure or processes for video distribution between server and client or between remote clients; Control signalling between clients, server and network components; Transmission of management data between server and client, e.g. sending from server to client commands for recording incoming content stream; Communication details between server and client 
    • H04N21/63Control signaling related to video distribution between client, server and network components; Network processes for video distribution between server and clients or between remote clients, e.g. transmitting basic layer and enhancement layers over different transmission paths, setting up a peer-to-peer communication via Internet between remote STB's; Communication protocols; Addressing
    • H04N21/643Communication protocols
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/60Network structure or processes for video distribution between server and client or between remote clients; Control signalling between clients, server and network components; Transmission of management data between server and client, e.g. sending from server to client commands for recording incoming content stream; Communication details between server and client 
    • H04N21/63Control signaling related to video distribution between client, server and network components; Network processes for video distribution between server and clients or between remote clients, e.g. transmitting basic layer and enhancement layers over different transmission paths, setting up a peer-to-peer communication via Internet between remote STB's; Communication protocols; Addressing
    • H04N21/643Communication protocols
    • H04N21/64322IP
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/83Generation or processing of protective or descriptive data associated with content; Content structuring
    • H04N21/845Structuring of content, e.g. decomposing content into time segments
    • H04N21/8456Structuring of content, e.g. decomposing content into time segments by decomposing the content in the time domain, e.g. in time segments
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/85Assembly of content; Generation of multimedia applications
    • H04N21/854Content authoring
    • H04N21/8549Creating video summaries, e.g. movie trailer
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/85Assembly of content; Generation of multimedia applications
    • H04N21/858Linking data to content, e.g. by linking an URL to a video object, by creating a hotspot
    • H04N21/8586Linking data to content, e.g. by linking an URL to a video object, by creating a hotspot by using a URL

Definitions

  • the present invention relates to the field of data transmission, and in particular to a media presentation navigation method and related apparatus based on a hypertext transfer protocol media stream.
  • HTTP Hyper Text Transfer Protocol
  • the present invention provides a method and related apparatus for providing navigation media presentation based on a hypertext transfer protocol media stream, in order to support video navigation in an HTTP-based media streaming service scenario, thereby improving user experience.
  • a first aspect of the embodiments of the present invention provides a media presentation navigation method based on a hypertext transfer protocol media stream, which may include:
  • the client obtains a media presentation description of the navigation media presentation, wherein the media presentation description of the navigation media presentation describes N navigation units included in the navigation media presentation, and the N is an integer greater than one;
  • the client acquires K navigation units of the N navigation units according to the media presentation description presented by the navigation media;
  • the client presents the K navigation units, each navigation unit of the K navigation units points to a main media presentation, wherein the navigation unit i pointed to by the navigation unit i in the navigation unit
  • the presentation quality of the media presentation is higher than the presentation quality of the navigation unit i.
  • the media presentation description of the navigation media presentation is different from the primary media presentation pointed to by each navigation unit of the K navigation units The media presentation description.
  • each of the K navigation units points to the primary media presentation described by the media presentation description in a manner that points to the media presentation description.
  • the media presentation description of the navigation media presentation and the primary media presentation by the navigation unit of the K navigation units are aggregated to form an aggregated media presentation description.
  • each of the K navigation units is configured to reference the aggregated media presentation The way the element is rendered to point to a primary media presentation.
  • the N navigation Each of the navigation units in the unit includes a video component, or each of the N navigation units includes an audio component and a video component.
  • the video component included in the different navigation units of the K navigation units is K video suitable And media representation in different video adaptation sets in the set, wherein media expressions in any one of the K video adaptation sets have selective mutual exclusion, and the K video adaptation sets are different There is selectivity compatibility between video adaptation sets.
  • the K navigation unit includes an audio component that is a media expression in an audio adaptation set, the audio The adaptation set is different from any one of the K adaptation sets, and the audio component adaptation set has selection compatibility with the K video adaptation sets;
  • the audio components included in the different navigation units of the K navigation units are media representations in different audio adaptation sets in the K audio adaptation sets, wherein different audio adaptations in the K audio adaptation sets There is selective mutual exclusion between sets.
  • the media expression element in the audio adaptation set element includes the media expression described in the navigation A description of the area of the associated area in the media presentation.
  • the media expressions described by the media expression elements including the same area description have an association relationship, or include the same
  • the adaptation set described by the adaptation set element of the region description has an association relationship.
  • the region description is an SRD spatial relationship description.
  • the media presentation of the navigation media presentation includes K video adaptation set elements, and the K video adaptation set elements are in one-to-one correspondence with the K video adaptation sets; wherein the K video adaptation set elements include a descriptor Element Ci, having a selection compatibility between the video adaptation sets described in the K video adaptation set elements that satisfy the set common condition video adaptation set element, the set commonality condition being a video adaptation set
  • the element name and the method identification schemeIdUri attribute of the description sub-element Ci included in the element are the same.
  • the description sub-element Ci describes a video adaptation set element including the description sub-element Ci
  • the media representations in the described video adaptation set are part of the navigation media presentation.
  • the description sub-element Ci describes a video adaptation set element corresponding to the description sub-element Ci
  • the video adaptation of the centralized media expresses the role presented in the navigation media.
  • the descriptive sub-element Ci acts Explain the Role element or the basic property EssentialProptery element or the supplementary property SupplementalProptery element.
  • the set common condition is video suitable
  • the description element element Ci included in the matching element has the same element name, the method identification schemeIdUri attribute is the same, and the parameter value attribute is the same.
  • the media presentation description of the navigation media presentation includes the K video adaptation set elements, and the K video adaptation set elements and One-to-one correspondence between K video adaptation sets,
  • the video adaptation set element VI corresponding to the video adaptation set I of the K video adaptation set elements includes a pointer for pointing to a primary media presentation, and the video adaptation set I is the K Any video adaptation set in the video adaptation set.
  • the pointer is carried by the attributes of the video adaptation set element VI.
  • the pointer is carried by an xlink:href attribute of the video adaptation set element VI.
  • the pointer is carried by a basic attribute EssentialProptery element or a supplementary attribute SupplementalProperty element in the video adaptation set element VI.
  • the pointer is a child element in an EssentialProptery element of the video adaptation set element VI Carrying, or the pointer is carried by an attribute of an EssentialProptery element in the video adaptation set element VI; or the pointer is carried by a child element in a SupplementalProperty element in the video adaptation set element VI, or the pointer It is carried by the attribute of the SupplementalProperty element in the video adaptation set element VI.
  • the pointer is carried by a value attribute of an EssentialProptery element in the video adaptation set element VI
  • the pointer is carried by the value attribute of the SupplementalProperty element of the video adaptation set element VI.
  • the pointer is represented by a virtual medium in the video adaptation set element VI to represent a Representation element Attribute bearing, or the pointer is carried by a child element in a virtual Representation element in the video adaptation set element VI, wherein the virtual Representation element does not include a media segment template element, a media segment list element, and a basic unification Resource locator BaseURL element.
  • the pointer is directed to the ReferencedMediaPresentation element by the media presentation in the video adaptation set element VI Hosted.
  • the time structure of the navigation media presentation does not depend on the temporal structure of the primary media presentation pointed to by the K navigation units in the navigation media presentation.
  • the method further comprises, in the case where the focus of attention stays in the navigation unit i of the K navigation units, the client presents the audio component of the navigation unit i.
  • the method further includes, in the case where the navigation unit i of the K navigation units is selected, the client acquires a main media presentation pointed to by the navigation unit i.
  • a second aspect of the embodiments of the present invention provides a media presentation navigation method based on a hypertext transfer protocol media stream, including:
  • the media presentation description of the navigation media presentation describing N navigation units included in the navigation media presentation, the N being an integer greater than 1; the N guides Each navigation unit in the browsing unit points to a main media presentation, wherein the presentation quality of the main media presentation pointed by the navigation unit i in the N navigation units is higher than the presentation quality of the navigation unit i .
  • the media presentation description of the navigation media presentation is different from the primary media pointed to by each navigation unit of the K navigation units The rendered media presentation description.
  • each of the N navigation units is directed to the media presentation description
  • the media presentation describes the primary media presentation described.
  • the media presentation description presented by the navigation media and the primary media presented by each navigation unit of the N navigation units are aggregated to form an aggregated media presentation description.
  • each of the N navigation units is configured to reference the aggregated media presentation The way the element is rendered to point to a primary media presentation.
  • the N navigation Each of the navigation units in the unit includes a video component, or each of the N navigation units includes an audio component and a video component.
  • the video component included in the different navigation units of the N navigation units is N video suitable And media representation in different video adaptation sets in the set, wherein media expressions in any one of the N video adaptation sets have selective mutual exclusion, and the N video adaptation sets are different There is selectivity compatibility between video adaptation sets.
  • the N navigation units include an audio component that is a media representation in an audio adaptation set, the audio The adaptation set is different from any one of the N adaptation sets, and the audio component adaptation set has a selection compatibility with the N video adaptation sets;
  • the audio components included in the different navigation units of the N navigation units are media representations in different audio adaptation sets in the N audio adaptation sets, wherein different audio adaptations in the N audio adaptation sets There is selective mutual exclusion between sets.
  • the media expression element in the audio adaptation set element includes an area description of the associated area in which the media expression is expressed in the navigation media presentation.
  • the media expressions described by the media expression elements including the same area description have an association relationship, or include the same
  • the adaptation set described by the adaptation set element of the region description has an association relationship.
  • the area description is an SRD spatial relationship description.
  • the media presentation of the navigation media presentation includes N video adaptation set elements, the N video adaptation set elements and the N video adaptation sets are in one-to-one correspondence; wherein the N video adaptation set elements include a descriptor Element Ci, having a selection compatibility between video adaptation sets described by the set of common condition video adaptation set elements in the N video adaptation set elements, the set common condition being a video adaptation set
  • the element name and the method identification schemeIdUri attribute of the description sub-element Ci included in the element are the same.
  • the description sub-element Ci describes a video adaptation set element including the description sub-element Ci
  • the media representations in the described video adaptation set are part of the navigation media presentation.
  • the description sub-element Ci describes a video adaptation set element corresponding to the description sub-element Ci
  • the video adaptation of the centralized media expresses the role presented in the navigation media.
  • the descriptive sub-element Ci acts Explain the Role element or the basic property EssentialProptery element or the supplementary property SupplementalProptery element.
  • the description sub-element Ci is an action description Role element
  • the setting common condition is video suitable
  • the description element element Ci included in the matching element has the same element name and method identification.
  • the schemeIdUri attribute is the same, and the parameter value attribute is the same.
  • the media that the navigation media presents The N video adaptation set elements are included in the presentation description, and the N video adaptation set elements are in one-to-one correspondence with the N video adaptation sets.
  • the video adaptation set element VI corresponding to the video adaptation set I of the N video adaptation set elements includes a pointer for pointing to a primary media presentation, where the video adaptation set I is the N Any video adaptation set in the video adaptation set.
  • the pointer is carried by the attributes of the video adaptation set element VI.
  • the pointer is carried by an xlink:href attribute of the video adaptation set element VI.
  • the pointer is carried by a basic attribute EssentialProptery element or a supplementary attribute SupplementalProperty element in the video adaptation set element VI.
  • the pointer is a child element in an EssentialProptery element of the video adaptation set element VI Carrying, or the pointer is carried by an attribute of an EssentialProptery element in the video adaptation set element VI; or the pointer is carried by a child element in a SupplementalProperty element in the video adaptation set element VI, or the pointer It is carried by the attribute of the SupplementalProperty element in the video adaptation set element VI.
  • the pointer is carried by a value attribute of an EssentialProptery element in the video adaptation set element VI
  • the pointer is carried by the value attribute of the SupplementalProperty element of the video adaptation set element VI.
  • the pointer is carried by an attribute of a virtual media expression Representation element in the video adaptation set element VI, or the pointer is carried by a child element in a virtual Representation element in the video adaptation set element VI, where
  • the virtual Representation element does not include a media segment template element, a media segment list element, and a base uniform resource locator BaseURL element.
  • the pointer is directed by the media presentation in the video adaptation set element VI to the ReferencedMediaPresentation element Hosted.
  • the time structure of the navigation media presentation does not depend on the temporal structure of the primary media presentation pointed to by the N navigation units in the navigation media presentation.
  • a third aspect of the present invention provides a client, including:
  • a first obtaining unit configured to acquire a media presentation description of the navigation media presentation, where the media presentation description presented by the navigation media describes N navigation units included in the navigation media presentation, where N is greater than An integer of 1;
  • a second acquiring unit configured to acquire K navigation units of the N navigation units according to the media presentation description presented by the navigation media
  • a presentation unit for presenting the K navigation units, each navigation unit of the K navigation units pointing to a main media presentation, wherein the navigation unit i in the K navigation units points The presentation quality of the main media presentation is higher than the presentation quality of the navigation unit i.
  • the media presentation description of the navigation media presentation is different from the primary media presentation pointed to by each navigation unit of the K navigation units The media presentation description.
  • each of the K navigation units is directed in a manner that points to a media presentation description
  • the media presentation describes the primary media presentation described.
  • the navigation medium is The current media presentation description and the media presentation descriptions of the primary media presentations pointed to by each of the K navigation units are aggregated to form an aggregated media presentation description.
  • each of the K navigation units is configured to reference the aggregated media presentation The way the element is rendered to point to a primary media presentation.
  • the N navigation Each of the navigation units in the unit includes a video component, or each of the N navigation units includes an audio component and a video component.
  • the video component included in the different navigation units of the K navigation units is K video suitable And media representation in different video adaptation sets in the set, wherein media expressions in any one of the K video adaptation sets have selective mutual exclusion, and the K video adaptation sets are different There is selectivity compatibility between video adaptation sets.
  • the K navigation unit includes an audio component that is a media expression in an audio adaptation set, the audio The adaptation set is different from any one of the K adaptation sets, and the audio component adaptation set has selection compatibility with the K video adaptation sets;
  • the audio components included in the different navigation units of the K navigation units are media representations in different audio adaptation sets in the K audio adaptation sets, wherein different audio adaptations in the K audio adaptation sets There is selective mutual exclusion between sets.
  • the media expression element in the audio adaptation set element includes the media expression described in the navigation A description of the area of the associated area in the media presentation.
  • the media expressions described by the media expression elements including the same area description have an association relationship, or include the same
  • the adaptation set described by the adaptation set element of the region description has an association relationship.
  • the area description is an SRD spatial relationship description.
  • the media presentation of the navigation media presentation includes K video adaptation set elements, and the K video adaptation set elements are in one-to-one correspondence with the K video adaptation sets; wherein the K video adaptation set elements include a descriptor Element Ci, having a selection compatibility between the video adaptation sets described in the K video adaptation set elements that satisfy the set common condition video adaptation set element, the set commonality condition being a video adaptation set
  • the element name and the method identification schemeIdUri attribute of the description sub-element Ci included in the element are the same.
  • the description sub-element Ci describes a video adaptation set element including the description sub-element Ci
  • the media representations in the described video adaptation set are part of the navigation media presentation.
  • the description sub-element Ci describes a video adaptation set element corresponding to the description sub-element Ci
  • the video adaptation of the centralized media expresses the role presented in the navigation media.
  • the description sub element Ci is a function Explain the Role element or the basic property EssentialProptery element or the supplementary property SupplementalProptery element.
  • the description sub-element Ci is an action description Role element
  • the setting common condition is video suitable
  • the description element element Ci included in the matching element has the same element name, the method identification schemeIdUri attribute is the same, and the parameter value attribute is the same.
  • the medium displayed by the navigation medium The K video adaptation set elements are included in the presentation description, and the K video adaptation set elements are in one-to-one correspondence with the K video adaptation sets.
  • the video adaptation set element VI corresponding to the video adaptation set I of the K video adaptation set elements includes a pointer for pointing to a primary media presentation, and the video adaptation set I is the K Any video adaptation set in the video adaptation set.
  • the pointer is carried by the attributes of the video adaptation set element VI.
  • the pointer is carried by an xlink:href attribute of the video adaptation set element VI.
  • the pointer is carried by a basic attribute EssentialProptery element or a supplementary attribute SupplementalProperty element in the video adaptation set element VI.
  • the pointer is a child element in an EssentialProptery element in the video adaptation set element VI Carrying, or the pointer is carried by an attribute of an EssentialProptery element in the video adaptation set element VI; or the pointer is carried by a child element in a SupplementalProperty element in the video adaptation set element VI, or the pointer It is carried by the attribute of the SupplementalProperty element in the video adaptation set element VI.
  • the pointer is carried by a value attribute of an EssentialProptery element in the video adaptation set element VI
  • the pointer is carried by the value attribute of the SupplementalProperty element of the video adaptation set element VI.
  • the pointer is carried by an attribute of a virtual media expression Representation element in the video adaptation set element VI, or the pointer is carried by a child element in a virtual Representation element in the video adaptation set element VI, where
  • the virtual Representation element does not include a media segment template element, a media segment list element, and a base uniform resource locator BaseURL element.
  • the pointer is pointed by the media presentation in the video adaptation set element VI to the ReferencedMediaPresentation element Hosted.
  • the time structure of the navigation media presentation does not depend on the temporal structure of the primary media presentation pointed to by the K navigation units in the navigation media presentation.
  • the presentation unit is further configured to present the audio component of the navigation unit i in the case where the focus of attention stays in the navigation unit i of the K navigation units.
  • the presentation unit is further configured to acquire, when the navigation unit i of the K navigation units is selected, the main media presentation pointed to by the navigation unit i.
  • a fourth aspect of the present invention provides a media presentation navigation apparatus, including:
  • a determining unit configured to determine N navigation units included in the navigation media presentation
  • a generating unit configured to generate a media presentation description of the navigation media presentation, where the media presentation description of the navigation media presentation describes N navigation units included in the navigation media presentation, and the N is an integer greater than 1;
  • Each of the N navigation units points to a main media presentation, and the presentation quality of the main media presentation pointed by the navigation unit i in the N navigation units is higher than that of the navigation unit i The quality of the presentation.
  • the media presentation description of the navigation media presentation is different from the primary media presentation pointed to by each navigation unit of the K navigation units The media presentation description.
  • each of the N navigation units is directed to the media presentation description
  • the media presentation describes the primary media presentation described.
  • the media presentation description presented by the navigation media and the primary media presented by each navigation unit of the N navigation units are aggregated to form an aggregated media presentation description.
  • each of the N navigation units is configured to reference the aggregate media presentation The way the element is rendered to point to a primary media presentation.
  • the N navigation Each of the navigation units in the unit includes a video component, or each of the N navigation units includes an audio component and a video component.
  • the video component included in the different navigation units of the N navigation units is N video suitable And media representation in different video adaptation sets in the set, wherein media expressions in any one of the N video adaptation sets have selective mutual exclusion, and the N video adaptation sets are different There is selectivity compatibility between video adaptation sets.
  • the N navigation unit includes an audio component that is a media expression in an audio adaptation set, the audio The adaptation set is different from any one of the N adaptation sets, and the audio component adaptation set has a selection compatibility with the N video adaptation sets;
  • the audio components included in the different navigation units of the N navigation units are media representations in different audio adaptation sets in the N audio adaptation sets, wherein different audio adaptations in the N audio adaptation sets There is selective mutual exclusion between sets.
  • the media expression element in the audio adaptation set element includes the media expression described in the navigation A description of the area of the associated area in the media presentation.
  • the media expressions described by the media presentation elements including the same region description are related to each other
  • the adaptation set described by the adaptation set element containing the same area description has an association relationship.
  • the area description is an SRD spatial relationship description.
  • the media presentation of the navigation media presentation includes N video adaptation set elements, the N video adaptation set elements and the N video adaptation sets are in one-to-one correspondence; wherein the N video adaptation set elements include a descriptor Element Ci, having a selection compatibility between video adaptation sets described by the set of common condition video adaptation set elements in the N video adaptation set elements, the set common condition being a video adaptation set
  • the element name and the method identification schemeIdUri attribute of the description sub-element Ci included in the element are the same.
  • the description sub-element Ci describes a video adaptation set element including the description sub-element Ci
  • the media representations in the described video adaptation set are part of the navigation media presentation.
  • the description sub-element Ci describes a video adaptation set element corresponding to the description sub-element Ci
  • the video adaptation of the centralized media expresses the role presented in the navigation media.
  • the description sub element Ci is a function Explain the Role element or the basic property EssentialProptery element or the supplementary property SupplementalProptery element.
  • the description sub-element Ci is an action description Role element
  • the setting common condition is video suitable
  • the description element element Ci included in the matching element has the same element name, the method identification schemeIdUri attribute is the same, and the parameter value attribute is the same.
  • the medium displayed by the navigation medium The N video adaptation set elements are included in the presentation description, and the N video adaptation set elements and N One-to-one correspondence between video adaptation sets,
  • the video adaptation set element VI corresponding to the video adaptation set I of the N video adaptation set elements includes a pointer for pointing to a primary media presentation, where the video adaptation set I is the N Any video adaptation set in the video adaptation set.
  • the pointer is carried by the attributes of the video adaptation set element VI.
  • the pointer is carried by an xlink:href attribute of the video adaptation set element VI.
  • the pointer is carried by a basic attribute EssentialProptery element or a supplementary attribute SupplementalProperty element in the video adaptation set element VI.
  • the pointer is a child element in an EssentialProptery element in the video adaptation set element VI Carrying, or the pointer is carried by an attribute of an EssentialProptery element in the video adaptation set element VI; or the pointer is carried by a child element in a SupplementalProperty element in the video adaptation set element VI, or the pointer It is carried by the attribute of the SupplementalProperty element in the video adaptation set element VI.
  • the pointer is carried by a value attribute of an EssentialProptery element in the video adaptation set element VI
  • the pointer is carried by the value attribute of the SupplementalProperty element of the video adaptation set element VI.
  • the pointer is carried by an attribute of a virtual media expression Representation element in the video adaptation set element VI, or the pointer is carried by a child element in a virtual Representation element in the video adaptation set element VI, where The virtual Representation element does not include a media segment module The version element, the media fragment list element, and the base uniform resource locator BaseURL element.
  • the pointer is pointed by the media presentation in the video adaptation set element VI to the ReferencedMediaPresentation element Hosted.
  • the time structure of the navigation media presentation does not depend on the temporal structure of the primary media presentation pointed to by the N navigation units in the navigation media presentation.
  • a fifth aspect of the present invention provides a client, including:
  • the processor by calling code or instructions in the memory, for obtaining a media presentation description of the navigation media presentation, wherein the media presentation description of the navigation media presentation describes the navigation media presentation N navigation units included, the N is an integer greater than 1; acquiring K navigation units of the N navigation units according to the media presentation description presented by the navigation media; presenting the K guides a navigation unit, each navigation unit of the K navigation unit points to a main media presentation, wherein a presentation quality of the main media presentation pointed by the navigation unit i in the K navigation units is higher than the guide View the presentation quality of unit i.
  • the media presentation description of the navigation media presentation is different from the primary media presentation pointed to by each navigation unit of the K navigation units The media presentation description.
  • each of the K navigation units is directed to the media presentation description
  • the media presentation describes the primary media presentation described.
  • the media presentation description presented by the navigation media and the primary media presented by each navigation unit of the K navigation units are aggregated to form an aggregated media presentation description.
  • each of the K navigation units is configured to reference the aggregated media presentation The way the element is rendered to point to a primary media presentation.
  • the N navigation Each of the navigation units in the unit includes a video component, or each of the N navigation units includes an audio component and a video component.
  • the video component included in the different navigation units of the K navigation units is K video suitable And media representation in different video adaptation sets in the set, wherein media expressions in any one of the K video adaptation sets have selective mutual exclusion, and the K video adaptation sets are different There is selectivity compatibility between video adaptation sets.
  • the K navigation unit includes an audio component that is a media expression in an audio adaptation set, the audio The adaptation set is different from any one of the K adaptation sets, and the audio component adaptation set has selection compatibility with the K video adaptation sets;
  • the audio components included in the different navigation units of the K navigation units are media representations in different audio adaptation sets in the K audio adaptation sets, wherein different audio adaptations in the K audio adaptation sets There is selective mutual exclusion between sets.
  • the media expression element in the audio adaptation set element includes the media expression described in the navigation A description of the area of the associated area in the media presentation.
  • the media expressions described by the media expression elements including the same area description have an association relationship, or include the same
  • the adaptation set described by the adaptation set element of the region description has an association relationship.
  • the area description is a description of the SRD spatial relationship.
  • the media presentation description of the navigation media presentation includes K video adaptation set elements, and the K video adaptation set elements and the One-to-one correspondence between the K video adaptation sets; wherein the K video adaptation set elements include a description sub-element Ci, and the K common video adaptation set elements satisfy a set common condition video adaptation set
  • the set commonality condition is that the element name and the method identification schemeIdUri attribute of the description sub-element Ci included in the video adaptation set element are the same.
  • the description sub-element Ci describes a video adaptation set element including the description sub-element Ci
  • the media representations in the described video adaptation set are part of the navigation media presentation.
  • the description sub-element Ci describes a video adaptation set element corresponding to the description sub-element Ci
  • the video adaptation of the centralized media expresses the role presented in the navigation media.
  • the description sub element Ci is a function Explain the Role element or the basic property EssentialProptery element or the supplementary property SupplementalProptery element.
  • the description sub-element Ci is an action description Role element
  • the setting common condition is video suitable
  • the description element element Ci included in the matching element has the same element name, the method identification schemeIdUri attribute is the same, and the parameter value attribute is the same.
  • the media displayed by the navigation medium The K video adaptation set elements are included in the presentation description, and the K video adaptation set elements are in one-to-one correspondence with the K video adaptation sets.
  • the video adaptation set element VI corresponding to the video adaptation set I of the K video adaptation set elements includes a pointer for pointing to a primary media presentation, and the video adaptation set I is the K Any video adaptation set in the video adaptation set.
  • the seventeenth possible aspect of the fifth aspect In an embodiment,
  • the pointer is carried by the attributes of the video adaptation set element VI.
  • the pointer is carried by an xlink:href attribute of the video adaptation set element VI.
  • the pointer is carried by a basic attribute EssentialProptery element or a supplementary attribute SupplementalProperty element in the video adaptation set element VI.
  • the pointer is a child element in an EssentialProptery element in the video adaptation set element VI Carrying, or the pointer is carried by an attribute of an EssentialProptery element in the video adaptation set element VI; or the pointer is carried by a child element in a SupplementalProperty element in the video adaptation set element VI, or the pointer It is carried by the attribute of the SupplementalProperty element in the video adaptation set element VI.
  • the pointer is carried by a value attribute of an EssentialProptery element in the video adaptation set element VI
  • the pointer is carried by the value attribute of the SupplementalProperty element of the video adaptation set element VI.
  • the pointer is carried by an attribute of a virtual media expression Representation element in the video adaptation set element VI, or the pointer is carried by a child element in a virtual Representation element in the video adaptation set element VI, where
  • the virtual Representation element does not include a media segment template element, a media segment list element, and a base uniform resource locator BaseURL element.
  • the pointer is pointed by the media presentation in the video adaptation set element VI to the ReferencedMediaPresentation element Hosted.
  • the time structure of the navigation media presentation does not depend on the temporal structure of the primary media presentation pointed to by the K navigation units in the navigation media presentation.
  • the method processor is further configured to present the audio component of the navigation unit i in the case where the focus of attention stays in the navigation unit i of the K navigation units.
  • the method processor is further configured to: in the case where the navigation unit i in the K navigation units is selected, the client acquires a main media presentation pointed by the navigation unit i.
  • a sixth aspect of the embodiments of the present invention provides a media presentation navigation apparatus, including:
  • the processor invokes code or instructions in the memory for determining N navigation units included in the navigation media presentation; generating a media presentation description of the navigation media presentation, the media presented by the navigation media
  • the presentation description describes N navigation units included in the navigation media presentation, the N being an integer greater than 1; each of the N navigation units pointing to a primary media presentation, the N
  • the presentation quality of the primary media presentation pointed to by the navigation unit i in the navigation unit is higher than the presentation quality of the navigation unit i.
  • the media presentation description of the navigation media presentation is different from the primary media presentation pointed to by each navigation unit of the K navigation units The media presentation description.
  • each of the N navigation units is directed to the media presentation description
  • the media presentation describes the primary media presentation described.
  • the media presentation description presented by the navigation media and the primary media presented by each navigation unit of the N navigation units are aggregated to form an aggregated media presentation description.
  • each of the N navigation units is configured to reference the aggregate media The way the element is rendered to point to a primary media presentation.
  • the N navigation Each of the navigation units in the unit includes a video component, or each of the N navigation units includes an audio component and a video component.
  • the video component included in the different navigation units of the N navigation units is N video suitable And media representation in different video adaptation sets in the set, wherein media expressions in any one of the N video adaptation sets have selective mutual exclusion, and the N video adaptation sets are different There is selectivity compatibility between video adaptation sets.
  • the N navigation unit includes an audio component that is a media expression in an audio adaptation set, the audio The adaptation set is different from any one of the N adaptation sets, and the audio component adaptation set has a selection compatibility with the N video adaptation sets;
  • the audio components included in the different navigation units of the N navigation units are media representations in different audio adaptation sets in the N audio adaptation sets, wherein different audio adaptations in the N audio adaptation sets There is selective mutual exclusion between sets.
  • the media expression element in the audio adaptation set element includes the media expression described in the navigation A description of the area of the associated area in the media presentation.
  • the media expressions described by the media expression elements including the same area description have an association relationship, or include the same
  • the adaptation set described by the adaptation set element of the region description has an association relationship.
  • the area description is SRD space Department description.
  • the media presentation of the navigation media presentation includes N video adaptation set elements, the N video adaptation set elements and the N video adaptation sets are in one-to-one correspondence; wherein the N video adaptation set elements include a descriptor Element Ci, having a selection compatibility between video adaptation sets described by the set of common condition video adaptation set elements in the N video adaptation set elements, the set common condition being a video adaptation set
  • the element name and the method identification schemeIdUri attribute of the description sub-element Ci included in the element are the same.
  • the description sub-element Ci describes a video adaptation set element including the description sub-element Ci
  • the media representations in the described video adaptation set are part of the navigation media presentation.
  • the description sub-element Ci describes a video adaptation set element corresponding to the description sub-element Ci
  • the video adaptation of the centralized media expresses the role presented in the navigation media.
  • the description sub element Ci is a function Explain the Role element or the basic property EssentialProptery element or the supplementary property SupplementalProptery element.
  • the description sub-element Ci is an action description Role element
  • the setting common condition is video suitable
  • the description element element Ci included in the matching element has the same element name, the method identification schemeIdUri attribute is the same, and the parameter value attribute is the same.
  • the media displayed by the navigation medium The N video adaptation set elements are included in the presentation description, and the N video adaptation set elements are in one-to-one correspondence with the N video adaptation sets.
  • the video adaptation set element VI corresponding to the video adaptation set I of the N video adaptation set elements includes a pointer for pointing to a primary media presentation, where the video adaptation set I is the N Vision
  • the frequency adaptation focuses on any one of the video adaptation sets.
  • the pointer is carried by the attributes of the video adaptation set element VI.
  • the pointer is carried by an xlink:href attribute of the video adaptation set element VI.
  • the pointer is carried by a basic attribute EssentialProptery element or a supplementary attribute SupplementalProperty element in the video adaptation set element VI.
  • the pointer is a child element in an EssentialProptery element in the video adaptation set element VI Carrying, or the pointer is carried by an attribute of an EssentialProptery element in the video adaptation set element VI; or the pointer is carried by a child element in a SupplementalProperty element in the video adaptation set element VI, or the pointer It is carried by the attribute of the SupplementalProperty element in the video adaptation set element VI.
  • the pointer is carried by a value attribute of an EssentialProptery element in the video adaptation set element VI
  • the pointer is carried by the value attribute of the SupplementalProperty element of the video adaptation set element VI.
  • the pointer is carried by an attribute of a virtual media expression Representation element in the video adaptation set element VI, or the pointer is carried by a child element in a virtual Representation element in the video adaptation set element VI, where
  • the virtual Representation element does not include a media segment template element, a media segment list element, and a base uniform resource locator BaseURL element.
  • the pointer is pointed by the media presentation in the video adaptation set element VI
  • the ReferencedMediaPresentation element is hosted.
  • the time structure of the navigation media presentation does not depend on the temporal structure of the primary media presentation pointed to by the N navigation units in the navigation media presentation.
  • a seventh aspect of the present invention provides a communication system, including:
  • the client is configured to obtain a media presentation description of the navigation media presentation from the content server, where the media presentation description presented by the navigation media describes the N navigation units included in the navigation media presentation.
  • the N is an integer greater than 1; the media presentation description presented according to the navigation medium acquires K navigation units of the N navigation units from the content server; the K navigation units are presented, Each navigation unit of the K navigation units points to a main media presentation, wherein the presentation quality of the main media presentation pointed to by the navigation unit i in the K navigation units is higher than the presentation of the navigation unit i quality.
  • the media presentation description of the navigation media presentation is different from the primary media presentation pointed to by each navigation unit of the K navigation units The media presentation description.
  • each of the K navigation units is directed to the media presentation description
  • the media presentation describes the primary media presentation described.
  • the media presentation description presented by the navigation media and the primary media presented by each navigation unit of the K navigation units are aggregated to form an aggregated media presentation description.
  • each of the K navigation units is configured to reference the aggregated media presentation The way the element is rendered to point to a primary media presentation.
  • the N navigation Each of the navigation units in the unit includes a video component, or each of the N navigation units includes an audio component and a video component.
  • the video component included in the different navigation units of the K navigation units is K video suitable And media representation in different video adaptation sets in the set, wherein media expressions in any one of the K video adaptation sets have selective mutual exclusion, and the K video adaptation sets are different There is selectivity compatibility between video adaptation sets.
  • the K navigation unit includes an audio component that is a media expression in an audio adaptation set, the audio The adaptation set is different from any one of the K adaptation sets, and the audio component adaptation set has selection compatibility with the K video adaptation sets;
  • the audio components included in the different navigation units of the K navigation units are media representations in different audio adaptation sets in the K audio adaptation sets, wherein different audio adaptations in the K audio adaptation sets There is selective mutual exclusion between sets.
  • the media expression element in the audio adaptation set element includes the media expression described in the navigation A description of the area of the associated area in the media presentation.
  • the media expressions described by the media expression elements including the same area description have an association relationship, or include the same
  • the adaptation set described by the adaptation set element of the region description has an association relationship.
  • the area description is an SRD spatial relationship description.
  • the media presentation of the navigation media presentation includes K video adaptation set elements, and the K video adaptation set elements are in one-to-one correspondence with the K video adaptation sets; wherein the K video adaptation set elements include a descriptor Yuan a Ci, a selection compatibility between the video adaptation sets described in the K video adaptation set elements that satisfy the set common condition video adaptation set element, the set common condition being a video adaptation set
  • the element name and the method identification schemeIdUri attribute of the description sub-element Ci included in the element are the same.
  • the description sub-element Ci describes a video adaptation set element including the description sub-element Ci
  • the media representations in the described video adaptation set are part of the navigation media presentation.
  • the description sub-element Ci describes a video adaptation set element corresponding to the description sub-element Ci
  • the video adaptation of the centralized media expresses the role presented in the navigation media.
  • the description sub element Ci is a function Explain the Role element or the basic property EssentialProptery element or the supplementary property SupplementalProptery element.
  • the description sub-element Ci is an action description Role element
  • the setting common condition is video suitable
  • the description element element Ci included in the matching element has the same element name, the method identification schemeIdUri attribute is the same, and the parameter value attribute is the same.
  • the media presented by the navigation medium The K video adaptation set elements are included in the presentation description, and the K video adaptation set elements are in one-to-one correspondence with the K video adaptation sets.
  • the video adaptation set element VI corresponding to the video adaptation set I of the K video adaptation set elements includes a pointer for pointing to a primary media presentation, and the video adaptation set I is the K Any video adaptation set in the video adaptation set.
  • the pointer is carried by the attributes of the video adaptation set element VI.
  • the pointer is carried by an xlink:href attribute of the video adaptation set element VI.
  • the pointer is carried by a basic attribute EssentialProptery element or a supplementary attribute SupplementalProperty element in the video adaptation set element VI.
  • the pointer is a child element in an EssentialProptery element in the video adaptation set element VI Carrying, or the pointer is carried by an attribute of an EssentialProptery element in the video adaptation set element VI; or the pointer is carried by a child element in a SupplementalProperty element in the video adaptation set element VI, or the pointer It is carried by the attribute of the SupplementalProperty element in the video adaptation set element VI.
  • the pointer is carried by a value attribute of an EssentialProptery element in the video adaptation set element VI
  • the pointer is carried by the value attribute of the SupplementalProperty element of the video adaptation set element VI.
  • the pointer is carried by an attribute of a virtual media expression Representation element in the video adaptation set element VI, or the pointer is carried by a child element in a virtual Representation element in the video adaptation set element VI, where
  • the virtual Representation element does not include a media segment template element, a media segment list element, and a base uniform resource locator BaseURL element.
  • the pointer is pointed by the media presentation in the video adaptation set element VI to the ReferencedMediaPresentation element Hosted.
  • the time structure of the navigation media presentation does not depend on the temporal structure of the primary media presentation pointed to by the K navigation units in the navigation media presentation.
  • each navigation unit of the K navigation units can respectively point to one main media presentation, this is equivalent to the introduction between the navigation unit and the main media presentation.
  • Correlation which enables the client to obtain a media presentation description of the main media presentation j pointed to by the navigation unit i in the case where the navigation unit i of the K navigation units is selected, and thus The media presentation description of the primary media presentation j obtains the primary media presentation j for presentation, which is beneficial to facilitate flexible switching between the navigation media presentation and the primary media presentation, thereby implementing the HTTP-based media streaming service scenario. Support for video navigation, which will help improve the user's high quality experience.
  • FIG. 1 is a schematic structural diagram of a media presentation description according to an embodiment of the present disclosure
  • FIG. 1 is a schematic flowchart of a media presentation and navigation method based on an HTTP media stream according to an embodiment of the present invention
  • FIG. 1 is a schematic diagram of a time structure of a single media presentation according to an embodiment of the present invention
  • FIG. 1 is a schematic diagram of a time structure of multiple media presentations according to an embodiment of the present invention
  • 1 - e and 1 - f are schematic diagrams of media representations of a coded navigation unit according to an embodiment of the present invention.
  • FIG. 1 is a schematic diagram of a time structure of another multiple media presentation according to an embodiment of the present invention
  • FIG. 1 is a schematic diagram of a time structure of another multiple media presentation according to an embodiment of the present invention
  • FIG. 1 is a schematic diagram of a synthesized navigation media presentation according to an embodiment of the present invention
  • FIG. 1 is a schematic diagram of a video component of a client decoding output navigation unit according to an embodiment of the present invention
  • FIG. 1 is a schematic diagram of an audio component of a client decoding output navigation unit according to an embodiment of the present invention
  • FIG. 2 is a schematic flowchart of another media presentation and navigation method based on HTTP media stream according to an embodiment of the present invention
  • FIG. 3 is a schematic flowchart of another media presentation and navigation method based on an HTTP media stream according to an embodiment of the present disclosure
  • FIG. 3 is a schematic diagram of a network architecture according to an embodiment of the present disclosure.
  • FIG. 4 is a schematic diagram of a client according to an embodiment of the present invention.
  • FIG. 5 is a schematic diagram of another client according to an embodiment of the present disclosure.
  • FIG. 6 is a schematic diagram of a server according to an embodiment of the present disclosure.
  • FIG. 7 is a schematic diagram of another server according to an embodiment of the present disclosure.
  • FIG. 8 is a schematic diagram of a communication system according to an embodiment of the present invention.
  • Embodiments of the present invention provide a media presentation navigation method and related device based on a hypertext transfer protocol media stream, so as to support video navigation in an HTTP-based media streaming service scenario, thereby improving user experience.
  • EPG Electronic Program Guide
  • the EPG is actually a list.
  • the EPG contains information such as programs and times of different channels, and the like, and the user can find the electric power of interest through the EPG.
  • Video channel then switch from the EPG channel to the channel.
  • the navigation service provided in a graphical manner is more user-friendly and user-friendly.
  • a navigation channel is represented by a navigation unit.
  • the navigation unit like the TV channel it represents, can have different media components, such as video, audio, and so on.
  • the graphical navigation service presents a video of a set of navigation units in the form of multiple small-format images (dynamic image sequences or static images). The user can browse through the images of multiple small frames, change the navigation unit of interest, and the user can even hear the audio of the navigation unit currently focused on. The user can select a navigation unit to switch to the channel corresponding to the navigation unit.
  • HTTP-based adaptive streaming services have become the mainstream technology for multimedia streaming services, representing the latest developments in this field.
  • Apple's HTTP streaming service HLS, HTTP Live Streaming
  • Microsoft's Smooth Streaming SS
  • HTTP-based dynamic image expert group MPEG, Moving Picture Experts Group
  • DASH Dynamic Adaptative Streaming Over HTTP
  • the MPEG DASH standard is a standardized technology developed by MPEG and is expected to be widely adopted, thus changing the fragmented market structure.
  • the existing HTTP-based media streaming service is only applicable to one media presentation (media presentation is a term used in the DASH standard, conceptually equivalent to a TV channel), while the navigation service serves multiple media presentations, which is a cross-multiple Media presentation of the business.
  • the present invention aims to solve the support of the navigation service by the HTTP-based media streaming service.
  • the present invention refers to terms in the DASH standard as a basis for the description and embodiments, the method of the present invention is not limited to the DASH standard, but is applicable to a variety of HTTP-based media streaming services.
  • the technical solutions of some embodiments of the present invention may be, for example, according to some DASH specifications and supplementary revisions thereof as follows:
  • ISO/IEC 23009-1 Part 1: Media presentation description and segment formats, 2nd Edition, 2014.
  • Part 1 Media presentation description and segment formats.
  • AMENDMENT 1 High Profile and Availability Time Synchronization Extended profiles and time synchronization
  • ISO/IEC 23009-1 2014/FDAM 1Part 1: Media presentation description and segment formats.
  • Part 1 Media presentation description and segment formats.
  • AMENDMENT 2 Spatial Relationship Description, Generalized URL parameters and other extensions.
  • a media content is encoded into multiple versions, each version has different characteristics, such as code rate. These versions are called Representation in DASH, and they represent the same media content, from content presentation ( The viewing/playing angles are alternative to each other.
  • a media representation is divided into accessible units in time - usually a few seconds in length, called a media segment or a media sub-segment (a media segment can be logically divided into media sub-segments).
  • the media segment the initialization segment is referred to as a segment.
  • the media expression is stored on the content server - the HTTP server for the client to obtain, and the fragment is the smallest unit that the client can access through the Uniform Resource Locator (URL).
  • the Media Presentation Description is an extensible Markup Language (XML) file that contains the metadata required by the client, describes the characteristics of the media representation, and how to obtain media expressions from the server. Including: the code rate of the media expression, the resolution, the aspect ratio of the video image, the URL of the clip included in the media expression, and the like.
  • the client can construct an HTTP URL to request media segments in the media presentation from the content server, and can switch to other media representations at the media segment boundaries to accommodate changes in available bandwidth.
  • the HTTP-based adaptive media streaming service allows for changes in the characteristics of the content in a media presentation, such as changes in the way the media is encoded.
  • this is achieved through the concept of the so-called "Period", which is used for splicing of content, such as the previous content paragraph is a news program, and the next content paragraph is an advertisement.
  • a media presentation includes one or more content paragraphs (Period), These content passages are sequential in time, and the beginning of a content passage means that there are some changes compared to the previous content passage, such as changes in content, such as from news programs to sports programs, sports programs to movie programs, and from Movie programs to advertisements, advertisements, variety shows, etc.; changes in the way the content is encoded, for example, can be changed from H.264 encoding scheme to H.265 encoding scheme; changes in the number of media expressions, for example, can increase or decrease media expression; The change of the content component, for example, can increase the audio expression of Chinese and the like.
  • the client's working conditions have changed and may have to be reinitialized.
  • a collection of media expressions containing the same media content and media components is referred to as an adaptation set, an adaptation set containing at least one media representation, and media representations in an adaptation set having mutual substitution.
  • Different adaptation sets may be compatible or repulsive.
  • the media presentation can include one or more temporally sequential content paragraphs, each content paragraph containing one or more Adaptation Sets.
  • Each of the Adaptation Sets contains one or more media representations (Representations).
  • One of the media expressions contains one or more segments.
  • the media presentation description has a hierarchical structure similar to the media presentation, as shown in Figure 1-a.
  • the concept of media presentation described above may be represented by an XML element in the media presentation description, the media presentation element includes one or more content paragraph elements, and each content paragraph element contains one or more adaptation sets ( AdaptationSet) element. Each AdaptationSet element contains one or more Representation elements.
  • the media presentation corresponds to a media presentation description element in the media presentation description
  • a content paragraph in the media presentation corresponds to a content paragraph element in the media presentation description
  • an adaptation set in the media presentation corresponds to an appropriate one in the media presentation description
  • the following describes the media presentation navigation method based on HTTP media stream.
  • the navigation service serves multiple media presentations, and is convenient for selecting a group of media presentations, and is a service presented across multiple media.
  • the plurality of media services served by the navigation service present a member media presentation called the navigation service, referred to as member media presentation or main media presentation.
  • the navigation service can be implemented as a media presentation (ie, navigation) Media presentation), the navigation media presents a presentation of the media independent of its members.
  • the navigation business and its member media presentations are each illustrated by their respective media presentation descriptions. Wherein, if the navigation service is served by N media presentations, then there are N+1 media presentations and corresponding N+1 media presentation descriptions.
  • each member media presentation corresponds to the navigation media presentation.
  • a navigation unit that represents the member's media presentation.
  • the navigation business and its member media presentations are each described by their respective media presentations.
  • a navigation unit represents a media presentation that may include multiple media components, such as video components (also referred to as video media representations) and audio components (also referred to as audio media representations).
  • the video of a navigation unit is a small format image representing a media presentation.
  • the video of the navigation unit is usually cropped from the video component represented by the media it represents, that is, a part of the picture, the navigation unit presentation quality (such as resolution and/or frame rate, etc.) is lower than the main media presentation.
  • the audio of the viewing unit comes from the audio presented by the main media.
  • the video of one navigation unit is implemented as one or more media representations (one in some examples).
  • FIG. 1-b is a schematic flowchart of a media presentation navigation method based on HTTP media stream according to an embodiment of the present invention.
  • a media presentation navigation method based on an HTTP media stream provided by an embodiment of the present invention may include:
  • the client obtains a media presentation description of the navigation media presentation.
  • the media presentation description presented by the navigation media describes N navigation units included in the navigation media presentation.
  • the client may obtain a media presentation description of the navigation media presentation from a content server or other device.
  • N is an integer greater than 1.
  • the N may be equal to 7, 2, 3, 4, 5, 8, 11, 15, 20, 25, 30 or other values, for example.
  • the client may be a DASH client or other client with DASH client logic function or other client of HTTP-based media streaming service.
  • the client may be, for example, a personal computer, a mobile phone, a tablet, a television, or a set top box.
  • the navigation media presentation can be seen as a special media presentation.
  • the client acquires K navigation units of the N navigation units according to the media presentation description presented by the navigation media.
  • K is a positive integer less than or equal to the N.
  • the K may be equal to 1, 2, 3, 4, 5, 8, 11, 15, 20, 25, 30 or other values, for example.
  • the K navigation units may be in one-to-one correspondence with the K logical presentation units (the logical presentation units may be, for example, a navigation window), that is, each navigation unit of the K navigation units may be presented by different logical presentation units. .
  • the client presents the K navigation units.
  • Each of the K navigation units points to a primary media presentation.
  • the presentation quality of the main media presentation pointed to by the navigation unit i in the K navigation units is higher than the presentation quality of the navigation unit i.
  • each of the K navigation units can point to a main media presentation.
  • the presentation quality of the main media presentation pointed by the navigation unit i in the K navigation units is higher than the presentation quality of the navigation unit i. That is to say, the presentation quality of the media expression of the navigation unit is lower than the presentation quality of the main media presentation represented by the navigation unit.
  • the media presentation description presented by the navigation media may be different from the media presentation of the primary media presentation pointed to by each navigation unit of the K navigation units.
  • the navigation media presentation may have an independent media presentation description, and each of the K navigation units may be directed to the primary media presentation or may be independent of the media presentation presented by the navigation media.
  • K navigation units point to K main media presentations, and K main media presentations respectively have corresponding media presentation descriptions, ie, K media presentation descriptions, and the media presentation descriptions presented by the navigation media are different from the K media. Any one of the presentation descriptions, that is, the navigation media presentation can be described by the K+1th media presentation.
  • the media presentation description presented by the navigation media and the media presentation description of the primary media presentation pointed to by each navigation unit of the K navigation units may be aggregated.
  • An aggregated media presentation description (also referred to as a supermedia presentation description) is formed.
  • An aggregated media presentation description (which can be called a supermedia presentation description) can be used to describe the navigation media presentation and The navigation media presents the primary media presentation that it points to.
  • the introduction of the supermedia presentation description facilitates enhancing the association between the navigation media presentation and the guided master media presentation.
  • the way in which the navigation unit points to the main media presentation can be flexible, and the navigation unit can directly point to the main media presentation or indirectly to the main media presentation.
  • each of the K navigation units can point to the primary media presentation described by the media presentation description in a manner that points to a media presentation description.
  • the navigation unit can also point to the main media presentation by other direct or indirect pointing methods.
  • the media presentation description of the navigation media presentation and the media presentation description of the primary media presentation pointed to by each of the K navigation units may be aggregated to form an aggregated media presentation description.
  • each of the K navigation units may point to a primary media presentation in a manner that references the presentation elements in the aggregated media presentation description.
  • each of the N navigation units includes a video component, or each of the N navigation units includes audio Component and video components, further, the navigation unit may also include subtitle components or other types of media components.
  • the present invention provides a signaling mechanism for navigation services by media presentation descriptions (such as MPD in the DASH standard).
  • the media presentation description can inform the client which navigation unit consists of a navigation unit, the components of the navigation unit, the relationship between the navigation unit and the media presentation of the members of the navigation service, and the relationship between the video components of the navigation unit, The relationship between the audio components of the navigation unit, the relationship between the audio component and the video component of the navigation unit, and the like.
  • the video components included in different navigation units of the K navigation units are media expressions in different video adaptation sets in the K video adaptation sets, where The media representations in any one of the K video adaptation sets have selective mutual exclusion, and the different video adaptation sets in the K video adaptation sets have selection compatibility.
  • the video components included in the navigation unit i in the K navigation units may be attributed to the video adaptation set Ci in the K video adaptation sets, which are included in the navigation unit j of the K navigation units.
  • the video component may be attributed to a video adaptation set Cj in the K video adaptation sets, wherein the video adaptation set Cj and the video adaptation set Ci are two different video adaptation sets in the K video adaptation sets.
  • Guide The element j and the navigation unit i can be any two navigation units of the K navigation units.
  • selection compatibility means that these objects can be selected at the same time. For example, if there is selection compatibility between different video adaptation sets in the K video adaptation sets, it means that K video adaptation sets can be selected at the same time. Media representation in multiple video adaptation sets.
  • the so-called mutual exclusion means that these objects are not supported at the same time. For example, if there is selective mutual exclusion between media expressions in any one of the K video adaptation sets, it means that one is not supported at the same time.
  • Multiple media representations in the video adaptation set for example, assuming that the video adaptation set I in the K video adaptation sets includes more than 10 media representations, if there is selective mutual exclusion between media representations in the video adaptation set, then each Only one of the 10 media expressions can be selected at a time, and multiple of the 10 media expressions cannot be selected at the same time.
  • the K navigation unit includes an audio component that is a media representation in an audio adaptation set, where the audio adaptation set is different from the K video adaptation sets.
  • the audio component adaptation set has a selection compatibility with the K video adaptation sets. For example, suppose the audio adaptation set includes 20 multiple media representations, and if there is selective mutual exclusion between media expressions in the audio adaptation set, then only one of 20 multiple media expressions can be selected at a time, and Multiple of the 30 media expressions cannot be selected at the same time.
  • the audio components included in the different navigation units of the K navigation units are media expressions in different audio adaptation sets in the K audio adaptation sets.
  • the different audio adaptation sets in the K audio adaptation sets have selective mutual exclusion.
  • the media expression element in the audio adaptation set element may include an area description of an associated area of the media expression in the navigation media presentation.
  • media representations including media presentation elements of the same region description have an association relationship, or an adaptation set element including the same region description There is an association between the sets.
  • the media expression described by the media expression element i is expressed as a media expression ri
  • the media expression described by the media expression element j is expressed as a media expression rj. If the media expression element i and the media expression element j contain the same region description, then the representation may be There is a relationship between the media expression ri and the media expression rj.
  • the media expression element i and the adaptation set element ci include the same area description, and then may also describe the media expression and the adaptation set element ci described by the media expression element i.
  • the media expressions in the adaptation set have an association relationship, for example, the media expression element i can be an audio media expression, and the media expression in the adaptation set described by the adaptation set element ci can be a video media expression.
  • the area description may be a spatial relationship description (SRD).
  • SRD spatial relationship description
  • the area description can also be other types of descriptive information that can be used to describe the location area.
  • the media presentation description of the navigation media presentation includes K video adaptation set elements, and the K video adaptation set elements and the K video One-to-one correspondence between adaptation sets.
  • the K video adaptation set elements include a description sub-element Ci
  • the video adaptation sets described in the K video adaptation set elements satisfying the set common condition video adaptation set element have a selection
  • the set commonality condition may be the same for both the element name and the method identifier (schemeIdUri) attribute of the description sub-element Ci included in the video adaptation set element.
  • the description sub-element Ci may describe that the media representation in the video adaptation set described by the video adaptation set element including the description sub-element Ci is the navigation medium.
  • the components of the presentation may describe a role that the media expression in the video adaptation set corresponding to the video adaptation set element corresponding to the description sub-element Ci is presented in the navigation medium, and the role may be, for example, primary, supplementary, subtitle or Translation dubbing, etc.
  • the description sub-element Ci may be, for example, an EssentialProptery element or a Supplemental Proptery element or a Role element or other elements.
  • the description sub-element Ci is an action description Role element
  • the set commonality condition may be an element name of the descriptive sub-element Ci included in the video adaptation set element.
  • the same method and method identification schemeIdUri attributes can be the same, and the parameter attributes can be the same.
  • the media presentation description of the navigation media presentation includes the K video adaptation set elements, the K video adaptation set elements and K videos One-to-one correspondence between adaptation sets.
  • the video adaptation set element VI corresponding to the video adaptation set I of the K video adaptation set elements includes a pointer for pointing to a primary media presentation, and the video adaptation set I may be the K Any video adaptation set in a video adaptation set.
  • the location of the pointer in the video adaptation set element VI may be determined according to the needs of the scene.
  • the pointer can be carried by an attribute of the video adaptation set element VI.
  • the pointer may be carried by an xlink:href attribute or other attribute of the video adaptation set element VI.
  • the pointer can be carried by an EssentialProptery element or a SupplementalProperty element in the video adaptation set element VI.
  • the pointer may be carried by a child element in an EssentialProptery element of the video adaptation set element VI, or the pointer may be carried by an attribute of an EssentialProptery element in the video adaptation set element VI;
  • the pointer may be carried by a child element in a SupplementalProperty element in the video adaptation set element VI, or the pointer may be carried by an attribute of a SupplementalProperty element in the video adaptation set element VI.
  • the pointer may be carried by a value attribute or other attribute of an EssentialProptery element in the video adaptation set element VI, or the pointer may be a value attribute of a SupplementalProperty element in the video adaptation set element VI or other Property bearer.
  • the pointer may be carried by an attribute of a virtual Representation element in the video adaptation set element VI, or the pointer may be carried by a child element in a virtual Representation element in the video adaptation set element VI, where The virtual Representation element does not include a media segment template element, a media segment list element, and a base Uniform Resource Locator (BaseURL) element.
  • BaseURL base Uniform Resource Locator
  • the pointer can also be carried by a Mediad Presentation (ReferencedMediaPresentation) element in the Video Adaptation Set Element VI.
  • the ReferencedMediaPresentation element is an element of the new extension, that is, the pointer can be carried by the newly extended element in the Video Adaptation Set Element VI, which is newly expanded in the Video Adaptation Set Element VI.
  • Bearer The name of the element of the pointer is not limited to ReferencedMediaPresentation, but may be other element names.
  • the time structure of the navigation media presentation may be independent of the time of presentation of the primary media pointed by the K navigation units in the navigation media presentation.
  • the audio of the navigation unit may be obtained by encoding the audio presented by the main media
  • the video of the navigation unit may be obtained by encoding the video presented by the main media, which may make the time structure of the navigation unit and There is no correlation between the time structures presented by the main media.
  • Figure 1-c illustrates, by way of example, the temporal structure of a media presentation that includes a number of consecutive Periods.
  • Figure 1-d illustrates, by way of example, the temporal structure of multiple media presentations, each media presentation including a contiguous number of Periods.
  • the time structure between multiple media presentations is different, for example, the boundaries of Period are not aligned.
  • the media presentation is sequential in time
  • the media presentation description also describes the sequential time structure
  • the non-sequential time structure describing multiple concurrent media presentations exceeds the ability of the traditional media presentation description.
  • the media expression (audio and video, etc.) of the main media presentation pointed by each navigation unit can be re-encoded to obtain the media expression of the navigation unit, that is, each navigation unit points
  • the media expression of the main media presentation and the media expression of the navigation unit are independent.
  • the media representation of each navigation unit is independent, and the audio component and video component of the same navigation unit are also independent.
  • the media representation that may not be presented by the media of the navigation unit is not affected by the Period arrangement of the media representation presented by the corresponding primary media.
  • Figures 1-e and 1-f illustrate examples of the manner in which the content server encodes the video media representation and audio media representation of the primary media presentation pointed to by the navigation unit.
  • Figure 1-g shows an example of a Period arrangement for media presentation of N navigation units presented by the navigation media.
  • the Period arrangement of the media presentation of the N navigation units presented by the navigation media is aligned.
  • Figure 1-h shows that when a navigation unit is added, the newly arranged navigation unit is aligned with the periodic arrangement of the media presentations of the other navigation units.
  • Figure 1-i shows the media presentation of the main media presentation by the content server using the navigation units.
  • An example of the manner in which the media presentation description presented by the navigation media is obtained.
  • the content server can also obtain the media presentation description of the navigation media presentation by other means.
  • Figures 1-j and 1-k show an example of a client selecting K navigation units for rendering.
  • the video media representations of the K navigation units in the N navigation units will be decoded and rendered, and the audio media representations of the highlighted navigation units in the audio media representations of the K navigation units will be decoded for presentation.
  • the client can select the specific manner in which the K navigation units are presented based on the media presentation description and user instructions presented by the navigation media.
  • the method further includes: if the focus of attention stays in the navigation unit i in the K navigation units, the client presents the The audio component of navigation unit i.
  • the method further includes: if the navigation unit i in the K navigation units is selected, the client acquires the navigation The primary media pointed to by unit i is presented. Further, the client may also present a primary media presentation pointed by the navigation unit i.
  • each navigation unit of the K navigation units can respectively point to one main media presentation, this is equivalent to the introduction between the navigation unit and the main media presentation.
  • Correlation which enables the client to obtain a media presentation description of the main media presentation j pointed to by the navigation unit i in the case where the navigation unit i of the K navigation units is selected, and thus The media presentation description of the primary media presentation j obtains the primary media presentation j for presentation, which is beneficial to facilitate flexible switching between the navigation media presentation and the primary media presentation, thereby implementing the HTTP-based media streaming service scenario. Support for video navigation, which will help improve the user's high quality experience.
  • the technical solution of the embodiment of the present invention is advantageous for making the navigation service more flexible.
  • the present invention can implement a personalized navigation service.
  • the navigation service can be configured on the client, such as: displayed in a navigation page/window.
  • the number of navigation units, the combination of navigation units, the presentation position and order of the navigation units, etc. can all be configured on the client side, which facilitates greatly facilitating the use of the navigation service on a variety of different devices.
  • mobile phone terminals, tablets, their capabilities are different - display device size, resolution, computing power.
  • FIG. 2 is a schematic flowchart diagram of another media presentation and navigation method based on HTTP media stream according to another embodiment of the present invention.
  • a media presentation navigation method based on an HTTP media stream provided by another embodiment of the present invention may include:
  • the media presentation description of the navigation media presentation describes N navigation units included in the navigation media presentation, where N is an integer greater than 1;
  • N is an integer greater than 1;
  • Each of the navigation units of the navigation unit points to a main media presentation, wherein the presentation quality of the main media presentation pointed by the navigation unit i of the N navigation units is higher than that of the navigation unit i Presentation quality.
  • the execution body of the embodiment of the present invention may be a content server or other device.
  • the content server can store a media presentation description of the navigation media presentation and can provide it to the client.
  • the media presentation description presented by the navigation media describes N navigation units included in the navigation media presentation.
  • the client may obtain a media presentation description of the navigation media presentation from a content server or other device.
  • N is an integer greater than 1.
  • the N may be equal to 7, 2, 3, 4, 5, 8, 11, 15, 20, 25, 30 or other values, for example.
  • the client may be a DASH client or other client with DASH client logic function or other client of HTTP-based media streaming service.
  • the client may be, for example, a personal computer, a mobile phone, a tablet, a television, or a set top box.
  • the navigation media presentation can be seen as a special media presentation.
  • the navigation media presentation of the navigation media presentation describes the N navigation units included in the navigation media presentation, because each navigation unit of the N navigation units can respectively Pointing to a master media presentation, which is equivalent to a certain association relationship introduced between the navigation unit and the main media presentation, which enables the client to select the navigation unit i of the N navigation units.
  • the client may obtain a media presentation description of the primary media presentation j pointed to by the navigation unit i, and then obtain the primary media presentation j according to the media presentation description of the primary media presentation j, and the scheme is
  • the foundation of the flexible switching between the navigation media presentation and the main media presentation is laid, which lays a foundation for supporting video navigation in the HTTP-based media streaming service scenario.
  • the presentation quality of the main media presentation pointed by the navigation unit i in the N navigation units is higher than the presentation quality of the navigation unit i. That is to say, the presentation quality of the media expression of the navigation unit is lower than the presentation quality of the main media presentation represented by the navigation unit.
  • the media presentation description of the navigation media presentation may be different from the media presentation of the primary media presentation pointed to by each of the N navigation units.
  • description that is, the navigation media presentation may have an independent media presentation description, and each of the N navigation units may also have a media presentation that is independent of the presentation of the navigation media.
  • N navigation units point to N main media presentations, and N main media presentations respectively have corresponding media presentation descriptions, ie, N media presentation descriptions, and the media presentation descriptions presented by the navigation media are different from the N media.
  • Any one of the presentation descriptions, that is, the navigation media presentation can be described by the K+1th media presentation.
  • the media presentation description presented by the navigation media and the media presentation description of the primary media presentation pointed to by each navigation unit of the N navigation units may be aggregated.
  • An aggregated media presentation description (also referred to as a supermedia presentation description) is formed.
  • An aggregated media presentation description (which may be referred to as a supermedia presentation description) may be utilized to describe the primary media presentation to which the navigation media presentation and navigation media presentations are directed.
  • the introduction of a super-media presentation description facilitates enhanced navigation The relationship between the presentation of the media and the presentation of the master media presented.
  • the way in which the navigation unit points to the main media presentation can be flexible, and the navigation unit can directly point to the main media presentation or indirectly to the main media presentation.
  • each of the N navigation units can point to the primary media presentation described by the media presentation description in a manner that points to a media presentation description.
  • the navigation unit can also point to the main media presentation by other direct or indirect pointing methods.
  • the media presentation description of the navigation media presentation and the media presentation description of the primary media presentation pointed to by each of the N navigation units may be aggregated to form an aggregated media presentation description.
  • each of the N navigation units may point to a primary media presentation in a manner that references the presentation elements in the aggregated media presentation description.
  • each of the N navigation units includes a video component, or each of the N navigation units includes audio Component and video components, further, the navigation unit may also include subtitle components or other types of media components.
  • the present invention provides a signaling mechanism for navigation services by media presentation descriptions (such as MPD in the DASH standard).
  • the media presentation description can inform the client which navigation unit consists of a navigation unit, the components of the navigation unit, the relationship between the navigation unit and the media presentation of the members of the navigation service, and the relationship between the video components of the navigation unit, The relationship between the audio components of the navigation unit, the relationship between the audio component and the video component of the navigation unit, and the like.
  • the video components included in the different navigation units of the N navigation units are media expressions in different video adaptation sets in the N video adaptation sets, where The media representations in any one of the N video adaptation sets have selective mutual exclusion, and the different video adaptation sets in the N video adaptation sets have selection compatibility.
  • the video component included in the navigation unit i in the N navigation units may be attributed to the video adaptation set Ci in the N video adaptation sets, where the navigation unit j included in the N navigation units
  • the video component may be attributed to a video adaptation set Cj in the N video adaptation sets, wherein the video adaptation set Cj and the video adaptation set Ci are two different video adaptation sets in the N video adaptation sets.
  • the navigation unit j and the navigation unit i can be any two navigation units of the N navigation units.
  • selection compatibility means that these objects can be selected at the same time. For example, if there is selection compatibility between different video adaptation sets in the N video adaptation sets, it means that N video adaptation sets can be selected at the same time. Media representation in multiple video adaptation sets.
  • mutual exclusion indicating that these objects do not support simultaneous selection, for example, if there is selective mutual exclusion between media expressions in any one of the N video adaptation sets, indicating that one is not supported at the same time.
  • Multiple media representations in the video adaptation set for example, assuming that the video adaptation set I in the N video adaptation sets includes 10 multiple media representations, if there is selective mutual exclusion between media expressions in the video adaptation set, then each Only one of the 10 media expressions can be selected at a time, and multiple of the 10 media expressions cannot be selected at the same time.
  • the N navigation unit includes an audio component that is a media expression in an audio adaptation set, where the audio adaptation set is different from the N video adaptation sets.
  • the audio component adaptation set has a selection compatibility with the N video adaptation sets. For example, suppose the audio adaptation set includes 20 multiple media representations, and if there is selective mutual exclusion between media expressions in the audio adaptation set, then only one of 20 multiple media expressions can be selected at a time, and Multiple of the 30 media expressions cannot be selected at the same time.
  • the audio components included in the different navigation units of the N navigation units are media expressions in different audio adaptation sets in the N audio adaptation sets.
  • the different audio adaptation sets in the N audio adaptation sets have selective mutual exclusion.
  • the media expression element in the audio adaptation set element may include an area description of an associated area of the media expression in the navigation media presentation.
  • media representations including media presentation elements of the same region description have an association relationship, or an adaptation set element including the same region description There is an association between the sets.
  • the media expression described by the media expression element i is expressed as a media expression ri
  • the media expression described by the media expression element j is expressed as a media expression rj. If the media expression element i and the media expression element j contain the same regional description, the media expression ri may be represented. There is a relationship with the media expression rj.
  • the media expression element i and the adaptation set element ci include the same area description, and then may also describe the media expression and the adaptation set element ci described by the media expression element i.
  • the media expressions in the adaptation set have an association relationship, for example, the media expression element i can be an audio media expression, and the media expression in the adaptation set described by the adaptation set element ci can be a video media expression.
  • the area description may be a spatial relationship description (SRD).
  • SRD spatial relationship description
  • the area description can also be other types of descriptive information that can be used to describe the location area.
  • the media presentation description of the navigation media presentation includes N video adaptation set elements, and the N video adaptation set elements and the N video One-to-one correspondence between adaptation sets.
  • the N video adaptation set elements include a description sub-element Ci
  • the video adaptation sets described in the N video adaptation set elements satisfying the set common condition video adaptation set element have a selection
  • the set commonality condition may be the same for both the element name and the method identifier (schemeIdUri) attribute of the description sub-element Ci included in the video adaptation set element.
  • the description sub-element Ci may describe that the media representation in the video adaptation set described by the video adaptation set element including the description sub-element Ci is the navigation medium.
  • the components of the presentation may describe a role that the media expression in the video adaptation set corresponding to the video adaptation set element corresponding to the description sub-element Ci is presented in the navigation medium, and the role may be, for example, primary, supplementary, subtitle or Translation dubbing, etc.
  • the description sub-element Ci may be, for example, an EssentialProptery element or a Supplemental Proptery element or a Role element or other elements.
  • the description sub-element Ci is an action description Role element
  • the set commonality condition may be an element name of the descriptive sub-element Ci included in the video adaptation set element.
  • the same method and method identification schemeIdUri attributes can be the same, and the parameter attributes can be the same.
  • the media presented by the navigation media is The N video adaptation set elements are included in the description, and the N video adaptation set elements are in one-to-one correspondence with the N video adaptation sets.
  • the video adaptation set element VI corresponding to the video adaptation set I of the N video adaptation set elements includes a pointer for pointing to a primary media presentation, and the video adaptation set I may be the N Any video adaptation set in a video adaptation set.
  • the location of the pointer in the video adaptation set element VI may be determined according to the needs of the scene.
  • the pointer can be carried by an attribute of the video adaptation set element VI.
  • the pointer may be carried by an xlink:href attribute or other attribute of the video adaptation set element VI.
  • the pointer can be carried by an EssentialProptery element or a SupplementalProperty element in the video adaptation set element VI.
  • the pointer may be carried by a child element in an EssentialProptery element of the video adaptation set element VI, or the pointer may be carried by an attribute of an EssentialProptery element in the video adaptation set element VI;
  • the pointer may be carried by a child element in a SupplementalProperty element in the video adaptation set element VI, or the pointer may be carried by an attribute of a SupplementalProperty element in the video adaptation set element VI.
  • the pointer may be carried by a value attribute or other attribute of an EssentialProptery element in the video adaptation set element VI, or the pointer may be a value attribute of a SupplementalProperty element in the video adaptation set element VI or other Property bearer.
  • the pointer may be carried by an attribute of a virtual Representation element in the video adaptation set element VI, or the pointer may be carried by a child element in a virtual Representation element in the video adaptation set element VI, where The virtual Representation element does not include a media segment template element, a media segment list element, and a base Uniform Resource Locator (BaseURL) element.
  • BaseURL base Uniform Resource Locator
  • the pointer can also be carried by a Mediad Presentation (ReferencedMediaPresentation) element in the Video Adaptation Set Element VI.
  • the ReferencedMediaPresentation element is an element of the new extension, that is, the pointer can be carried by the newly extended element in the Video Adaptation Set Element VI, which is newly expanded in the Video Adaptation Set Element VI.
  • the name of the element carrying the pointer is not limited to ReferencedMediaPresentation, but may be other The name of the element.
  • the time structure of the navigation media presentation may be independent of the time of presentation of the primary media pointed by the N navigation units in the navigation media presentation.
  • the audio of the navigation unit may be obtained by encoding the audio presented by the main media
  • the video of the navigation unit may be obtained by encoding the video presented by the main media, which may make the time structure of the navigation unit and There is no correlation between the time structures presented by the main media.
  • FIG. 3-a is a schematic flowchart diagram of a method for providing navigation media presentation based on HTTP streaming media according to another embodiment of the present invention.
  • the method for providing navigation media presentation based on HTTP streaming as shown in FIG. 3-a can be implemented based on the network architecture shown in FIG. 3-b.
  • the network architecture shown in Figure 3-b mainly includes the DASH client and the content server.
  • a method for providing a navigation media presentation based on HTTP streaming media may include:
  • the DASH client obtains a media presentation description of the navigation media presentation from the content server.
  • the media presentation description presented by the navigation media describes N navigation units included in the navigation media presentation.
  • N is an integer greater than 1.
  • the N may be equal to 7, 2, 3, 4, 5, 8, 11, 15, 20, 25, 30 or other values, for example.
  • the DASH client can be, for example, a personal computer, a mobile phone, a tablet, a television, or a set top box.
  • the DASH client acquires, from the content server, the K navigation units of the N navigation units according to the media presentation description presented by the navigation media.
  • K is a positive integer less than or equal to the N.
  • the K may be equal to 1, 2, 3, 4, 5, 8, 11, 15, 20, 25, 30 or other values, for example.
  • K navigation units can be in one-to-one correspondence with K logical presentation units, that is, K navigation units Each of the navigation units can be presented by a different logical presentation unit.
  • the DASH client presents the K navigation units.
  • each of the K navigation units can point to a main media presentation.
  • the presentation quality of the main media presentation pointed by the navigation unit i in the K navigation units is higher than the presentation quality of the navigation unit i. That is to say, the presentation quality of the media expression of the navigation unit is lower than the presentation quality of the main media presentation represented by the navigation unit.
  • the DASH client acquires, from the content server, a media presentation description of the main media presentation pointed by the navigation unit i.
  • the DASH client obtains the primary media presentation from the content server based on the media presentation description of the primary media presentation.
  • the DASH client presents a primary media presentation pointed by the navigation unit i.
  • the presentation quality of the main media presentation pointed by the navigation unit i in the K navigation units is higher than the presentation quality of the navigation unit i. That is to say, the presentation quality of the media expression of the navigation unit is lower than the presentation quality of the main media presentation represented by the navigation unit.
  • the media presentation description presented by the navigation media may be different from the media presentation of the primary media presentation pointed to by each navigation unit of the K navigation units.
  • the navigation media presentation may have an independent media presentation description, and each of the K navigation units may be directed to the primary media presentation or may be independent of the media presentation presented by the navigation media.
  • K navigation units point to K main media presentations, and K main media presentations respectively have corresponding media presentation descriptions, ie, K media presentation descriptions, and the media presentation descriptions presented by the navigation media are different from the K media. Any one of the presentation descriptions, that is, the navigation media presentation can be described by the K+1th media presentation.
  • the media presentation description presented by the navigation media and the media presentation description of the primary media presentation pointed to by each navigation unit of the K navigation units may be aggregated.
  • An aggregated media presentation description (also referred to as a supermedia presentation description) is formed.
  • An aggregated media presentation description (which may be referred to as a supermedia presentation description) may be utilized to describe the primary media presentation to which the navigation media presentation and navigation media presentations are directed. The introduction of the supermedia presentation description facilitates enhancing the association between the navigation media presentation and the guided master media presentation.
  • the way in which the navigation unit points to the main media presentation can be flexible, and the navigation unit can directly point to the main media presentation or indirectly to the main media presentation.
  • each of the K navigation units can point to the primary media presentation described by the media presentation description in a manner that points to a media presentation description.
  • the navigation unit can also point to the main media presentation by other direct or indirect pointing methods.
  • the media presentation description of the navigation media presentation and the media presentation description of the primary media presentation pointed to by each of the K navigation units may be aggregated to form an aggregated media presentation description.
  • each of the K navigation units may point to a primary media presentation in a manner that references the presentation elements in the aggregated media presentation description.
  • each of the N navigation units includes a video component, or each of the N navigation units includes audio Component and video components, further, the navigation unit may also include subtitle components or other types of media components.
  • the present invention provides a signaling mechanism for navigation services by media presentation descriptions (such as MPD in the DASH standard).
  • the media presentation description can inform the client which navigation unit consists of a navigation unit, the components of the navigation unit, the relationship between the navigation unit and the media presentation of the members of the navigation service, and the relationship between the video components of the navigation unit, The relationship between the audio components of the navigation unit, the relationship between the audio component and the video component of the navigation unit, and the like.
  • the video components included in different navigation units of the K navigation units are media expressions in different video adaptation sets in the K video adaptation sets, where The media representations in any one of the K video adaptation sets have selective mutual exclusion, and the different video adaptation sets in the K video adaptation sets have selection compatibility.
  • the video components included in the navigation unit i in the K navigation units may be attributed to the video adaptation set Ci in the K video adaptation sets, which are included in the navigation unit j of the K navigation units.
  • the video component may be attributed to a video adaptation set Cj in the K video adaptation sets, wherein the video adaptation set Cj and the video adaptation set Ci are two different video adaptation sets in the K video adaptation sets.
  • the navigation unit j and the navigation unit i can be any two navigation units of the K navigation units.
  • the so-called selection compatibility means that these objects can be selected at the same time, for example, if K videos Selective compatibility between different sets of video adaptations in the adaptation set means that media representations in multiple video adaptation sets in the K video adaptation sets can be selected simultaneously.
  • the so-called mutual exclusion means that these objects are not supported at the same time. For example, if there is selective mutual exclusion between media expressions in any one of the K video adaptation sets, it means that one is not supported at the same time.
  • Multiple media representations in the video adaptation set for example, assuming that the video adaptation set I in the K video adaptation sets includes more than 10 media representations, if there is selective mutual exclusion between media representations in the video adaptation set, then each Only one of the 10 media expressions can be selected at a time, and multiple of the 10 media expressions cannot be selected at the same time.
  • the K navigation unit includes an audio component that is a media representation in an audio adaptation set, where the audio adaptation set is different from the K video adaptation sets.
  • the audio component adaptation set has a selection compatibility with the K video adaptation sets. For example, suppose the audio adaptation set includes 20 multiple media representations, and if there is selective mutual exclusion between media expressions in the audio adaptation set, then only one of 20 multiple media expressions can be selected at a time, and Multiple of the 30 media expressions cannot be selected at the same time.
  • the audio components included in the different navigation units of the K navigation units are media expressions in different audio adaptation sets in the K audio adaptation sets.
  • the different audio adaptation sets in the K audio adaptation sets have selective mutual exclusion.
  • the media expression element in the audio adaptation set element may include an area description of an associated area of the media expression in the navigation media presentation.
  • media representations including media presentation elements of the same region description have an association relationship, or an adaptation set element including the same region description There is an association between the sets.
  • the media expression described by the media expression element i is expressed as a media expression ri
  • the media expression described by the media expression element j is expressed as a media expression rj. If the media expression element i and the media expression element j contain the same regional description, the media expression ri may be represented. There is a relationship with the media expression rj.
  • the media expression element i and the adaptation set element ci may also indicate that the media expression described by the media expression element i has an association relationship with each media expression in the adaptation set described by the adaptation set element ci, for example, the media expression element i may be an audio media expression.
  • the media representation in the adaptation set described by the adaptation set element ci may be a video media representation.
  • the area description may be a spatial relationship description (SRD).
  • SRD spatial relationship description
  • the area description can also be other types of descriptive information that can be used to describe the location area.
  • the media presentation description of the navigation media presentation includes K video adaptation set elements, and the K video adaptation set elements and the K video One-to-one correspondence between adaptation sets.
  • the K video adaptation set elements include a description sub-element Ci
  • the video adaptation sets described in the K video adaptation set elements satisfying the set common condition video adaptation set element have a selection
  • the set commonality condition may be the same for both the element name and the method identifier (schemeIdUri) attribute of the description sub-element Ci included in the video adaptation set element.
  • the description sub-element Ci may describe that the media representation in the video adaptation set described by the video adaptation set element including the description sub-element Ci is the navigation medium.
  • the components of the presentation may describe a role that the media expression in the video adaptation set corresponding to the video adaptation set element corresponding to the description sub-element Ci is presented in the navigation medium, and the role may be, for example, primary, supplementary, subtitle or Translation dubbing, etc.
  • the description sub-element Ci may be, for example, an EssentialProptery element or a Supplemental Proptery element or a Role element or other elements.
  • the description sub-element Ci is an action description Role element
  • the set commonality condition may be an element name of the descriptive sub-element Ci included in the video adaptation set element.
  • the same method and method identification schemeIdUri attributes can be the same, and the parameter attributes can be the same.
  • the media presentation description of the navigation media presentation includes the K video adaptation set elements, the K video adaptation set elements and K views One-to-one correspondence between frequency adaptation sets.
  • the video adaptation set element VI corresponding to the video adaptation set I of the K video adaptation set elements includes a pointer for pointing to a primary media presentation, and the video adaptation set I may be the K Any video adaptation set in a video adaptation set.
  • the location of the pointer in the video adaptation set element VI may be determined according to the needs of the scene.
  • the pointer can be carried by an attribute of the video adaptation set element VI.
  • the pointer may be carried by an xlink:href attribute or other attribute of the video adaptation set element VI.
  • the pointer can be carried by an EssentialProptery element or a SupplementalProperty element in the video adaptation set element VI.
  • the pointer may be carried by a child element in an EssentialProptery element of the video adaptation set element VI, or the pointer may be carried by an attribute of an EssentialProptery element in the video adaptation set element VI;
  • the pointer may be carried by a child element in a SupplementalProperty element in the video adaptation set element VI, or the pointer may be carried by an attribute of a SupplementalProperty element in the video adaptation set element VI.
  • the pointer may be carried by a value attribute or other attribute of an EssentialProptery element in the video adaptation set element VI, or the pointer may be a value attribute of a SupplementalProperty element in the video adaptation set element VI or other Property bearer.
  • the pointer may be carried by an attribute of a virtual Representation element in the video adaptation set element VI, or the pointer may be carried by a child element in a virtual Representation element in the video adaptation set element VI, where The virtual Representation element does not include a media segment template element, a media segment list element, and a base Uniform Resource Locator (BaseURL) element.
  • BaseURL base Uniform Resource Locator
  • the pointer can also be carried by a Mediad Presentation (ReferencedMediaPresentation) element in the Video Adaptation Set Element VI.
  • the ReferencedMediaPresentation element is an element of the new extension, that is, the pointer can be carried by the newly extended element in the Video Adaptation Set Element VI, which is newly expanded in the Video Adaptation Set Element VI.
  • the name of the element carrying the pointer is not limited to ReferencedMediaPresentation, but may be other element names.
  • the time structure of the navigation media presentation may be independent of the time of presentation of the primary media pointed by the K navigation units in the navigation media presentation.
  • the audio of the navigation unit may be obtained by encoding the audio presented by the main media
  • the video of the navigation unit may be obtained by encoding the video presented by the main media, which may make the time structure of the navigation unit and There is no correlation between the time structures presented by the main media.
  • each navigation unit of the K navigation units can respectively point to one main media presentation, this is equivalent to the introduction between the navigation unit and the main media presentation.
  • the association relationship which enables the DASH client to obtain the media presentation description of the main media presentation j pointed to by the navigation unit i in the case where the navigation unit i of the K navigation units is selected, and thus The media presentation description of the main media presentation j obtains the main media presentation j for presentation, which is beneficial to realize a more flexible switching between the navigation media presentation and the main media presentation, thereby implementing the HTTP-based media streaming service scenario. Support for video navigation, which in turn helps to enhance the user's high quality experience.
  • the videos of the various navigation units are parallel and juxtaposed, the video of the plurality of navigation units is presented on the display screen of the user equipment or a window, and the audio is mutually exclusive, and there can only be one at any time.
  • the audio of the navigation unit is selected and played, and the video screen of the navigation unit is the focus of the user.
  • the navigation service needs to be supported by the corresponding signaling mechanism.
  • the signaling informs the client which navigation unit consists of a navigation unit, the components of the navigation unit, the relationship between the navigation unit and the media presentation of the members of the navigation service, and the relationship between the video components of the navigation unit.
  • the relationship between the audio components of the unit the relationship between the audio component and the video component of the navigation unit.
  • the signaling of the navigation service is represented by a description file presented by the navigation media, implemented as some elements in the description file, expressing various relationships between the media expressions of the media components.
  • the navigation services in the example serve 16 member media presentations.
  • These MPD examples can be based on some of the following DASH specifications and their additions:
  • ISO/IEC 23009-1 Part 1: Media presentation description and segment formats, 2nd Edition, 2014.
  • Part 1 Media presentation description and segment formats.
  • AMENDMENT 1 High Profile and Availability Time Synchronization Extended profiles and time synchronization
  • ISO/IEC 23009-1 2014/FDAM 1Part 1: Media presentation description and segment formats.
  • Part 1 Media presentation description and segment formats.
  • AMENDMENT 2 Spatial Relationship Description, Generalized URL parameters and other extensions.
  • each example is not a complete MPD, but an MPD segment taken to illustrate the features of the present invention.
  • the example scenario S1 illustrates a signaling mechanism for navigating a service in the example scenario S1, informing the client which navigation units are comprised of a navigation unit, the components of the navigation unit, the navigation unit and the members of the navigation service.
  • a Role element is used for each adaptation set element, including a video adaptation set element and an audio adaptation set element.
  • the adaptation set element contains a use Role element, and the adaptation set of the description sub-element whose parameter is "main" is compatible and can be selected together by the client.
  • media representations in multiple video adaptation sets - video media representations of different navigation units can be selected together and presented on the client.
  • the video of the navigation unit and the main media presentation it represents are expressed by the attributes of the video adaptation set element of the navigation unit, specifically the attribute @xlink:href, which is essentially a pointer. Use it to point to a media presentation description of a remote primary media presentation. Because the pointing element is not an adaptation set element, the pointed element is not embedded in the navigation media presentation description (MPD's data model is hierarchical, and an element contains only elements of a lower-level type, not including Its more advanced type of element), which can be expressed in @xlink:show.
  • the element pointed to by @xlink:href is the same as the type of the element in which the attribute is located, ie, if the attribute is at the level of the adaptation set element, the element it points to is adapted.
  • Set element type In the present invention, the attribute is extended to the type of the element, which is used to point to a media presentation.
  • the adaptation set element has both the far-end element (which points to a remote element) and the local media representation, which is not true in the existing DASH specification.
  • the association relationship between the video media expression and the same navigation unit is established through association signaling.
  • the @associationId attribute refers to the identifier of the associated video media expression
  • the value of @id, @associationType may not appear. , indicating an unknown relationship, or adding a definition of an association, such as "accompany”.
  • the semantic differences between the elements of the media presentation description are reflected in the behavior of the client.
  • the client selects multiple media expressions in the same position in the navigation service.
  • the status is a description of the role element in the adaptation set element to which the media expression belongs.
  • the parameters of the usage description sub-element are main. It indicates that the media expression in the adaptation set is the main component in the media presentation.
  • the client selects the video media representations of the plurality of navigation units, requests the segments of the media expressions from the content server, processes them, and presents them to the user together. Things like these: selecting several video adaptation sets (video media representations), presenting them in what order, the location layout of the presentation, the presentation mode (moving image sequence), etc., can all be determined by the client. The decision can be made according to the user's instructions, the user's configuration of the client, the capabilities of the client, and the like.
  • the client selects the audio media representation of the navigation unit, acquires the segment expressed by the audio media, and plays the audio.
  • the client switches to the primary media presentation.
  • the switching process may include the following steps: the client first obtains a media presentation description of the main media presentation according to the pointer in the navigation unit, the second step analyzes the media presentation description of the main media presentation, and selects an appropriate media expression; Joining the main media presentation at a certain time, this is actually a seeking operation. If the navigation service is for a live media presentation service, then this time location is the time location of the media content in which the handover occurred, ie, interrupting the navigation service time location.
  • Example scenario S2 A signaling mechanism for navigating traffic is illustrated in the example scenario S2, and scenario S2 illustrates the composition of the MPD used to represent the navigation service.
  • the navigation description method takes a Universal Resource Identifier as a parameter, wherein the universal resource identifier is used to point to a media presentation, which is actually directed to the media presentation by a media presentation description directed to the media presentation.
  • the method identifier for the method, such as: urn:mpeg:dash:mosaic:2011.
  • the @smemeId of the Supplemental Property descriptor is the method identifier, which can represent the element containing the descriptor: an adaptation set or a media expression, which is an integral part of the navigation service.
  • the attribute @value of the descriptor is a parameter of the navigation service description method, and a universal resource identifier that points to the media presentation description of the main media presentation.
  • one video adaptation set (corresponding to one navigation unit) has two media representations.
  • One of them is a virtual media expression that does not contain any fragments, but refers to the main media presentation represented by the navigation unit, and actually points to the media presentation by pointing to the media presentation description presented by the media.
  • the template of the fragment does not appear at the level of the adaptation set element, but appears in the actual media expression element.
  • a referenced remote unit may only know its type after being parsed because of a remote unit It's just an XML object whose type may be a media presentation description type, or it may be a time period or an adaptation set. If you relax the compatibility restrictions, introducing a new element description in the media presentation description means citing a media expression so that ambiguity can be avoided. This element can be attributed to parent elements of different levels, such as adaptation sets, media expressions.
  • the Mediad Presentation (ReferencedMediaPresentation) in the example of the example scenario S4 is a specific implementation.
  • An example of an aggregated media presentation description is given in example scenario S5.
  • the aggregated media presentation description is MPD, which is a superset of MPD. It describes multiple parallel media presentations, including member media rendering and navigation media rendering.
  • the presentation element is introduced in the aggregated media presentation description, which can be a remote element, pointing to a media presentation description, or an embedded media presentation description.
  • the media presentation description of the member media presentation is a remote element, while the navigation media presentation is local and is an embedded media presentation description.
  • Embodiments of the present invention also provide related apparatus for implementing the above solution.
  • an embodiment of the present invention provides a client 400, which may include:
  • the first obtaining unit 410 is configured to obtain a media presentation description of the navigation media presentation, where the media presentation description of the navigation media presentation describes the N navigation units included in the navigation media presentation, where the N is An integer greater than one;
  • the second obtaining unit 420 is configured to acquire K navigation units of the N navigation units according to the media presentation description presented by the navigation media;
  • a presentation unit 430 configured to present the K navigation units, each navigation unit of the K navigation units pointing to a main media presentation, wherein the navigation unit i in the K navigation units is pointed
  • the presentation quality of the primary media presentation is higher than the presentation quality of the navigation unit i.
  • the media presentation description presented by the navigation media may be different from the media presentation of the primary media presentation pointed to by each navigation unit of the K navigation units.
  • the navigation media presentation may have an independent media presentation description, and each of the K navigation units may be directed to the primary media presentation or may be independent of the media presentation presented by the navigation media.
  • K navigation units point to K main media presentations, and K main media presentations respectively have corresponding media presentation descriptions, ie, K media presentation descriptions, and the media presentation descriptions presented by the navigation media are different from the K media. Any one of the presentation descriptions, that is, the navigation media presentation can be described by the K+1th media presentation.
  • the media presentation description of the navigation media presentation and the media presentation of the primary media presentation pointed to by each navigation unit of the K navigation units can be aggregated to form an aggregated media presentation description (also referred to as a supermedia presentation description).
  • An aggregated media presentation description (which may be referred to as a supermedia presentation description) may be utilized to describe the primary media presentation to which the navigation media presentation and navigation media presentations are directed.
  • the introduction of the supermedia presentation description facilitates enhancing the association between the navigation media presentation and the guided master media presentation.
  • the way in which the navigation unit points to the main media presentation can be flexible, and the navigation unit can directly point to the main media presentation or indirectly to the main media presentation.
  • each of the K navigation units can point to the primary media presentation described by the media presentation description in a manner that points to a media presentation description.
  • the navigation unit can also point to the main media presentation by other direct or indirect pointing methods.
  • the media presentation description of the navigation media presentation and the media presentation description of the primary media presentation pointed to by each of the K navigation units may be aggregated to form an aggregated media presentation description.
  • each of the K navigation units may point to a primary media presentation in a manner that references the presentation elements in the aggregated media presentation description.
  • each of the N navigation units includes a video component, or each of the N navigation units includes audio Component and video components, further, the navigation unit may also include subtitle components or other types of media components.
  • the present invention provides a signaling mechanism for navigation services by media presentation descriptions (such as MPD in the DASH standard).
  • the media presentation description can inform the client which navigation unit consists of a navigation unit, the components of the navigation unit, the relationship between the navigation unit and the media presentation of the members of the navigation service, and the relationship between the video components of the navigation unit, The relationship between the audio components of the navigation unit, the relationship between the audio component and the video component of the navigation unit, and the like.
  • the video components included in different navigation units of the K navigation units are media expressions in different video adaptation sets in the K video adaptation sets, where The media representations in any one of the K video adaptation sets have selective mutual exclusion, and the different video adaptation sets in the K video adaptation sets have selection compatibility.
  • the video components included in the navigation unit i in the K navigation units may be attributed to the video adaptation set Ci in the K video adaptation sets, which are included in the navigation unit j of the K navigation units.
  • the video component may be attributed to a video adaptation set Cj in the K video adaptation sets, wherein the video adaptation set Cj and the video adaptation set Ci are two different video adaptation sets in the K video adaptation sets.
  • the navigation unit j and the navigation unit i can be any two navigation units of the K navigation units.
  • selection compatibility means that these objects can be selected at the same time. For example, if there is selection compatibility between different video adaptation sets in the K video adaptation sets, it means that K video adaptation sets can be selected at the same time. Media representation in multiple video adaptation sets.
  • the so-called mutual exclusion means that these objects are not supported at the same time. For example, if there is selective mutual exclusion between media expressions in any one of the K video adaptation sets, it means that one is not supported at the same time.
  • Multiple media representations in the video adaptation set for example, assuming that the video adaptation set I in the K video adaptation sets includes more than 10 media representations, if there is selective mutual exclusion between media representations in the video adaptation set, then each Only one of the 10 media expressions can be selected at a time, and multiple of the 10 media expressions cannot be selected at the same time.
  • the K navigation unit includes an audio component that is a media representation in an audio adaptation set, where the audio adaptation set is different from the K video adaptation sets.
  • the audio component adaptation set has a selection compatibility with the K video adaptation sets. For example, suppose the audio adaptation set includes 20 multiple media representations, and if there is selective mutual exclusion between media expressions in the audio adaptation set, then only one of 20 multiple media expressions can be selected at a time, and Multiple of the 30 media expressions cannot be selected at the same time.
  • the audio components included in the different navigation units of the K navigation units are media expressions in different audio adaptation sets in the K audio adaptation sets.
  • the different audio adaptation sets in the K audio adaptation sets have selective mutual exclusion.
  • the media expression element in the audio adaptation set element may include an area description of an associated area of the media expression in the navigation media presentation.
  • media representations including media presentation elements of the same region description have an association relationship, or an adaptation set element including the same region description There is an association between the sets.
  • media expression element i The media expression is expressed as a media expression ri
  • the media expression described by the media expression element j is expressed as a media expression rj. If the media expression element i and the media expression element j contain the same region description, the media expression ri and the media expression rj may be represented. There is an association between them.
  • the media expression element i and the adaptation set element ci include the same area description, and then may also describe the media expression and the adaptation set element ci described by the media expression element i.
  • the media expressions in the adaptation set have an association relationship, for example, the media expression element i can be an audio media expression, and the media expression in the adaptation set described by the adaptation set element ci can be a video media expression.
  • the area description may be a spatial relationship description (SRD).
  • SRD spatial relationship description
  • the area description can also be other types of descriptive information that can be used to describe the location area.
  • the media presentation description of the navigation media presentation includes K video adaptation set elements, and the K video adaptation set elements and the K video One-to-one correspondence between adaptation sets.
  • the K video adaptation set elements include a description sub-element Ci
  • the video adaptation sets described in the K video adaptation set elements satisfying the set common condition video adaptation set element have a selection
  • the set commonality condition may be the same for both the element name and the method identifier (schemeIdUri) attribute of the description sub-element Ci included in the video adaptation set element.
  • the description sub-element Ci may describe that the media representation in the video adaptation set described by the video adaptation set element including the description sub-element Ci is the navigation medium.
  • the components of the presentation may describe a role that the media expression in the video adaptation set corresponding to the video adaptation set element corresponding to the description sub-element Ci is presented in the navigation medium, and the role may be, for example, primary, supplementary, subtitle or Translation dubbing, etc.
  • the description sub-element Ci may be, for example, an EssentialProptery element or a Supplemental Proptery element or a Role element or other elements.
  • the description sub-element Ci is an action description Role element
  • the set commonality condition may be a descriptive sub-element included in the video adaptation set element.
  • the element names of Ci can be the same, the method identification schemeIdUri attribute can be the same, and the parameter attribute can be the same.
  • the media presentation description of the navigation media presentation includes the K video adaptation set elements, the K video adaptation set elements and K videos One-to-one correspondence between adaptation sets.
  • the video adaptation set element VI corresponding to the video adaptation set I of the K video adaptation set elements includes a pointer for pointing to a primary media presentation, and the video adaptation set I may be the K Any video adaptation set in a video adaptation set.
  • the location of the pointer in the video adaptation set element VI may be determined according to the needs of the scene.
  • the pointer can be carried by an attribute of the video adaptation set element VI.
  • the pointer may be carried by an xlink:href attribute or other attribute of the video adaptation set element VI.
  • the pointer can be carried by an EssentialProptery element or a SupplementalProperty element in the video adaptation set element VI.
  • the pointer may be carried by a child element in an EssentialProptery element of the video adaptation set element VI, or the pointer may be carried by an attribute of an EssentialProptery element in the video adaptation set element VI;
  • the pointer may be carried by a child element in a SupplementalProperty element in the video adaptation set element VI, or the pointer may be carried by an attribute of a SupplementalProperty element in the video adaptation set element VI.
  • the pointer may be carried by a value attribute or other attribute of an EssentialProptery element in the video adaptation set element VI, or the pointer may be a value attribute of a SupplementalProperty element in the video adaptation set element VI or other Property bearer.
  • the pointer may be carried by an attribute of a virtual Representation element in the video adaptation set element VI, or the pointer may be carried by a child element in a virtual Representation element in the video adaptation set element VI, where The virtual Representation element does not include a media segment template element, a media segment list element, and a base Uniform Resource Locator (BaseURL) element.
  • BaseURL base Uniform Resource Locator
  • the pointer can also be carried by a Mediad Presentation (ReferencedMediaPresentation) element in the Video Adaptation Set Element VI.
  • ReferencedMediaPresentation element Is a new extension of the element, that is, the pointer can be carried by the newly extended element in the video adaptation set element VI, the newly extended bearer in the video adaptation set element VI.
  • the name of the element of the pointer is not limited to the ReferencedMediaPresentation, and may be other element names.
  • the time structure of the navigation media presentation may be independent of the time of presentation of the primary media pointed by the K navigation units in the navigation media presentation.
  • the audio of the navigation unit may be obtained by encoding the audio presented by the main media
  • the video of the navigation unit may be obtained by encoding the video presented by the main media, which may make the time structure of the navigation unit and There is no correlation between the time structures presented by the main media.
  • the presenting unit is further configured to present the navigation unit if the focus of attention stays in the navigation unit i in the K navigation units.
  • the presentation unit is further configured to acquire, by using the navigation unit i in the K navigation units, the navigation unit i Point to the main media presentation. Further, the client may also present a primary media presentation pointed by the navigation unit i.
  • the client 400 can be, for example, a personal computer, a mobile phone, a tablet computer, a television set or a set top box.
  • the client 400 can be used to implement any of the media presentation navigation methods based on the hypertext transfer protocol media stream provided by the foregoing embodiments.
  • each navigation unit of the K navigation units can respectively point to one main media presentation, this is equivalent to the introduction between the navigation unit and the main media presentation.
  • the association relationship which enables the client 400 to obtain a media presentation description of the main media presentation j pointed to by the navigation unit i in the case where the navigation unit i of the K navigation units is selected, and thus Obtaining the main media presentation j according to the media presentation description of the primary media presentation j, which is visible to facilitate a more flexible switching between the navigation media presentation and the primary media presentation.
  • the video navigation is supported in the HTTP-based media streaming service scenario, which is beneficial to enhance the user's high-quality experience.
  • a client 500 may include:
  • the processor 502 and the memory 503 are coupled by a bus 501.
  • the processor 502 by calling a code or instruction in the memory 503, for obtaining a media presentation description of the navigation media presentation, wherein the media presentation description of the navigation media presentation describes the navigation media presentation N navigation units included, the N is an integer greater than 1; acquiring K navigation units of the N navigation units according to the media presentation description presented by the navigation media; presenting the K guides a navigation unit, each navigation unit of the K navigation unit points to a main media presentation, wherein a presentation quality of the main media presentation pointed by the navigation unit i in the K navigation units is higher than the guide View the presentation quality of unit i.
  • the media presentation description presented by the navigation media may be different from the media presentation of the primary media presentation pointed to by each navigation unit of the K navigation units.
  • the navigation media presentation may have an independent media presentation description, and each of the K navigation units may be directed to the primary media presentation or may be independent of the media presentation presented by the navigation media.
  • K navigation units point to K main media presentations, and K main media presentations respectively have corresponding media presentation descriptions, ie, K media presentation descriptions, and the media presentation descriptions presented by the navigation media are different from the K media. Any one of the presentation descriptions, that is, the navigation media presentation can be described by the K+1th media presentation.
  • the media presentation description presented by the navigation media and the media presentation description of the primary media presentation pointed to by each navigation unit of the K navigation units may be aggregated.
  • An aggregated media presentation description (also referred to as a supermedia presentation description) is formed.
  • An aggregated media presentation description (which may be referred to as a supermedia presentation description) may be utilized to describe the primary media presentation to which the navigation media presentation and navigation media presentations are directed. The introduction of the supermedia presentation description facilitates enhancing the association between the navigation media presentation and the guided master media presentation.
  • the way in which the navigation unit points to the main media presentation can be flexible, and the navigation unit can directly point to the main media presentation or indirectly to the main media presentation.
  • each of the K navigation units can point to the primary media presentation described by the media presentation description in a manner that points to a media presentation description.
  • the navigation unit can also point to the main media presentation by other direct or indirect pointing methods.
  • the media presentation description of the navigation media presentation and the media presentation description of the primary media presentation pointed to by each of the K navigation units may be aggregated to form an aggregated media presentation description.
  • each of the K navigation units may point to a primary media presentation in a manner that references the presentation elements in the aggregated media presentation description.
  • each of the N navigation units includes a video component, or each of the N navigation units includes audio Component and video components, further, the navigation unit may also include subtitle components or other types of media components.
  • the present invention provides a signaling mechanism for navigation services by media presentation descriptions (such as MPD in the DASH standard).
  • the media presentation description can inform the client which navigation unit consists of a navigation unit, the components of the navigation unit, the relationship between the navigation unit and the media presentation of the members of the navigation service, and the relationship between the video components of the navigation unit, The relationship between the audio components of the navigation unit, the relationship between the audio component and the video component of the navigation unit, and the like.
  • the video components included in different navigation units of the K navigation units are media expressions in different video adaptation sets in the K video adaptation sets, where The media representations in any one of the K video adaptation sets have selective mutual exclusion, and the different video adaptation sets in the K video adaptation sets have selection compatibility.
  • the video components included in the navigation unit i in the K navigation units may be attributed to the video adaptation set Ci in the K video adaptation sets, which are included in the navigation unit j of the K navigation units.
  • the video component may be attributed to a video adaptation set Cj in the K video adaptation sets, wherein the video adaptation set Cj and the video adaptation set Ci are two different video adaptation sets in the K video adaptation sets.
  • the navigation unit j and the navigation unit i can be any two navigation units of the K navigation units.
  • selection compatibility means that these objects can be selected at the same time. For example, if there is selection compatibility between different video adaptation sets in the K video adaptation sets, it means that K video adaptation sets can be selected at the same time. Media representation in multiple video adaptation sets.
  • the so-called mutual exclusion means that these objects are not supported at the same time. For example, if there is selective mutual exclusion between media expressions in any one of the K video adaptation sets, it means that one is not supported at the same time.
  • Multiple media representations in the video adaptation set for example, assuming that the video adaptation set I in the K video adaptation sets includes more than 10 media representations, if there is selective mutual exclusion between media representations in the video adaptation set, then each Only one of the 10 media expressions can be selected at a time, and multiple of the 10 media expressions cannot be selected at the same time.
  • the K navigation unit includes an audio component that is a media representation in an audio adaptation set, where the audio adaptation set is different from the K video adaptation sets.
  • the audio component adaptation set has a selection compatibility with the K video adaptation sets. For example, suppose the audio adaptation set includes 20 multiple media representations, and if there is selective mutual exclusion between media expressions in the audio adaptation set, then only one of 20 multiple media expressions can be selected at a time, and Multiple of the 30 media expressions cannot be selected at the same time.
  • the audio components included in the different navigation units of the K navigation units are media expressions in different audio adaptation sets in the K audio adaptation sets.
  • the different audio adaptation sets in the K audio adaptation sets have selective mutual exclusion.
  • the media expression element in the audio adaptation set element may include an area description of an associated area of the media expression in the navigation media presentation.
  • media representations including media presentation elements of the same region description have an association relationship, or an adaptation set element including the same region description There is an association between the sets.
  • the media expression described by the media expression element i is expressed as a media expression ri
  • the media expression described by the media expression element j is expressed as a media expression rj. If the media expression element i and the media expression element j contain the same regional description, the media expression ri may be represented. There is a relationship with the media expression rj.
  • the media expression element i and the adaptation set element ci include the same area description, and then may also describe the media expression and the adaptation set element ci described by the media expression element i.
  • the element i may be an audio media representation
  • the media representation in the adaptation set described by the adaptation set element ci may be a video media representation.
  • the area description may be a spatial relationship description (SRD).
  • SRD spatial relationship description
  • the area description can also be other types of descriptive information that can be used to describe the location area.
  • the media presentation description of the navigation media presentation includes K video adaptation set elements, and the K video adaptation set elements and the K video One-to-one correspondence between adaptation sets.
  • the K video adaptation set elements include a description sub-element Ci
  • the video adaptation sets described in the K video adaptation set elements satisfying the set common condition video adaptation set element have a selection
  • the set commonality condition may be the same for both the element name and the method identifier (schemeIdUri) attribute of the description sub-element Ci included in the video adaptation set element.
  • the description sub-element Ci may describe that the media representation in the video adaptation set described by the video adaptation set element including the description sub-element Ci is the navigation medium.
  • the components of the presentation may describe a role that the media expression in the video adaptation set corresponding to the video adaptation set element corresponding to the description sub-element Ci is presented in the navigation medium, and the role may be, for example, primary, supplementary, subtitle or Translation dubbing, etc.
  • the description sub-element Ci may be, for example, an EssentialProptery element or a Supplemental Proptery element or a Role element or other elements.
  • the description sub-element Ci is an action description Role element
  • the set commonality condition may be an element name of the descriptive sub-element Ci included in the video adaptation set element.
  • the same method and method identification schemeIdUri attributes can be the same, and the parameter attributes can be the same.
  • the media presentation description of the navigation media presentation includes the K video adaptation set elements, the K video adaptation set elements and K videos One-to-one correspondence between adaptation sets.
  • the video adaptation set element VI corresponding to the video adaptation set I of the K video adaptation set elements includes a pointer for pointing to a primary media presentation, where the video is suitable
  • the set I can be any one of the K video adaptation sets.
  • the location of the pointer in the video adaptation set element VI may be determined according to the needs of the scene.
  • the pointer can be carried by an attribute of the video adaptation set element VI.
  • the pointer may be carried by an xlink:href attribute or other attribute of the video adaptation set element VI.
  • the pointer can be carried by an EssentialProptery element or a SupplementalProperty element in the video adaptation set element VI.
  • the pointer may be carried by a child element in an EssentialProptery element of the video adaptation set element VI, or the pointer may be carried by an attribute of an EssentialProptery element in the video adaptation set element VI;
  • the pointer may be carried by a child element in a SupplementalProperty element in the video adaptation set element VI, or the pointer may be carried by an attribute of a SupplementalProperty element in the video adaptation set element VI.
  • the pointer may be carried by a value attribute or other attribute of an EssentialProptery element in the video adaptation set element VI, or the pointer may be a value attribute of a SupplementalProperty element in the video adaptation set element VI or other Property bearer.
  • the pointer may be carried by an attribute of a virtual Representation element in the video adaptation set element VI, or the pointer may be carried by a child element in a virtual Representation element in the video adaptation set element VI, where The virtual Representation element does not include a media segment template element, a media segment list element, and a base Uniform Resource Locator (BaseURL) element.
  • BaseURL base Uniform Resource Locator
  • the pointer can also be carried by a Mediad Presentation (ReferencedMediaPresentation) element in the Video Adaptation Set Element VI.
  • the ReferencedMediaPresentation element is an element of the new extension, that is, the pointer can be carried by the newly extended element in the Video Adaptation Set Element VI, which is newly expanded in the Video Adaptation Set Element VI.
  • the name of the element carrying the pointer is not limited to ReferencedMediaPresentation, but may be other element names.
  • the time structure of the navigation media presentation may be independent of the primary media presentation pointed by the K navigation units in the navigation media presentation.
  • the audio of the navigation unit may be obtained by encoding the audio presented by the main media
  • the video of the navigation unit may be obtained by encoding the video presented by the main media, which may make the time structure of the navigation unit and There is no correlation between the time structures presented by the main media.
  • the processor is further configured to present the navigation unit if the focus of attention stays in the navigation unit i in the K navigation units.
  • the processor is further configured to acquire, by using the navigation unit i in the K navigation units, the navigation unit i Point to the main media presentation. Further, the client may also present a primary media presentation pointed by the navigation unit i.
  • the client 500 can be, for example, a personal computer, a mobile phone, a tablet computer, a television set or a set top box.
  • the client 500 in this embodiment may be specifically implemented according to the method in the foregoing method embodiment.
  • the client 500 can be used to implement any of the media presentation navigation methods based on the hypertext transfer protocol media stream provided by the foregoing embodiments.
  • each navigation unit of the K navigation units can respectively point to one main media presentation, this is equivalent to the introduction between the navigation unit and the main media presentation.
  • Correlation relationship which enables the client 500 to obtain a media presentation description of the main media presentation j pointed to by the navigation unit i in the case where the navigation unit i of the K navigation units is selected, and thus Obtaining the main media presentation j according to the media presentation description of the primary media presentation j, which is convenient to implement a flexible switch between the navigation media presentation and the primary media presentation, thereby implementing the HTTP-based media streaming service.
  • Video navigation is supported in the scenario, which in turn helps to enhance the user's high-quality experience.
  • an embodiment of the present invention provides a server 600, which may include:
  • the determining unit 610 is configured to determine N navigation units included in the navigation media presentation.
  • a generating unit 620 configured to generate a media presentation description of the navigation media presentation, where the media presentation description of the navigation media presentation describes the N navigation units included in the navigation media presentation, where N is An integer greater than 1; each of the N navigation units points to a primary media presentation, and the presentation quality of the primary media presented by the navigation unit i in the N navigation units is higher than The presentation quality of the navigation unit i is described.
  • the presentation quality of the main media presentation pointed by the navigation unit i in the N navigation units is higher than the presentation quality of the navigation unit i. That is to say, the presentation quality of the media expression of the navigation unit is lower than the presentation quality of the main media presentation represented by the navigation unit.
  • the media presentation description of the navigation media presentation may be different from the media presentation of the primary media presentation pointed to by each of the N navigation units.
  • description that is, the navigation media presentation may have an independent media presentation description, and each of the N navigation units may also have a media presentation that is independent of the presentation of the navigation media.
  • N navigation units point to N main media presentations, and N main media presentations respectively have corresponding media presentation descriptions, ie, N media presentation descriptions, and the media presentation descriptions presented by the navigation media are different from the N media.
  • Any one of the presentation descriptions, that is, the navigation media presentation can be described by the K+1th media presentation.
  • the media presentation description presented by the navigation media and the media presentation description of the primary media presentation pointed to by each navigation unit of the N navigation units may be aggregated.
  • An aggregated media presentation description (also referred to as a supermedia presentation description) is formed.
  • An aggregated media presentation description (which may be referred to as a supermedia presentation description) may be utilized to describe the primary media presentation to which the navigation media presentation and navigation media presentations are directed. The introduction of the supermedia presentation description facilitates enhancing the association between the navigation media presentation and the guided master media presentation.
  • the way in which the navigation unit points to the main media presentation can be flexible, and the navigation unit can directly point to the main media presentation or indirectly to the main media presentation.
  • each of the N navigation units can point to the primary media presentation described by the media presentation description in a manner that points to a media presentation description.
  • the navigation unit can also point to the main media presentation by other direct or indirect pointing methods.
  • the media presentation description of the navigation media presentation and the media presentation description of the primary media presentation pointed to by each of the N navigation units may be aggregated to form an aggregated media presentation description.
  • each of the N navigation units may present the presentation in the aggregated media description by reference. The way the element points to a main media presentation.
  • each of the N navigation units includes a video component, or each of the N navigation units includes audio Component and video components, further, the navigation unit may also include subtitle components or other types of media components.
  • the present invention provides a signaling mechanism for navigation services by media presentation descriptions (such as MPD in the DASH standard).
  • the media presentation description can inform the client which navigation unit consists of a navigation unit, the components of the navigation unit, the relationship between the navigation unit and the media presentation of the members of the navigation service, and the relationship between the video components of the navigation unit, The relationship between the audio components of the navigation unit, the relationship between the audio component and the video component of the navigation unit, and the like.
  • the video components included in the different navigation units of the N navigation units are media expressions in different video adaptation sets in the N video adaptation sets, where The media representations in any one of the N video adaptation sets have selective mutual exclusion, and the different video adaptation sets in the N video adaptation sets have selection compatibility.
  • the video component included in the navigation unit i in the N navigation units may be attributed to the video adaptation set Ci in the N video adaptation sets, where the navigation unit j included in the N navigation units
  • the video component may be attributed to a video adaptation set Cj in the N video adaptation sets, wherein the video adaptation set Cj and the video adaptation set Ci are two different video adaptation sets in the N video adaptation sets.
  • the navigation unit j and the navigation unit i can be any two navigation units of the N navigation units.
  • selection compatibility means that these objects can be selected at the same time. For example, if there is selection compatibility between different video adaptation sets in the N video adaptation sets, it means that N video adaptation sets can be selected at the same time. Media representation in multiple video adaptation sets.
  • mutual exclusion indicating that these objects do not support simultaneous selection, for example, if there is selective mutual exclusion between media expressions in any one of the N video adaptation sets, indicating that one is not supported at the same time.
  • Multiple media representations in the video adaptation set for example, assuming that the video adaptation set I in the N video adaptation sets includes 10 multiple media representations, if there is selective mutual exclusion between media expressions in the video adaptation set, then each Only one of the 10 media expressions can be selected at a time, and multiple of the 10 media expressions cannot be selected at the same time.
  • the N navigation unit includes an audio component that is a media expression in an audio adaptation set, where the audio adaptation set is different from the N video adaptation sets.
  • the audio component adaptation set has a selection compatibility with the N video adaptation sets. For example, suppose the audio adaptation set includes 20 multiple media representations, and if there is selective mutual exclusion between media expressions in the audio adaptation set, then only one of 20 multiple media expressions can be selected at a time, and Multiple of the 30 media expressions cannot be selected at the same time.
  • the audio components included in the different navigation units of the N navigation units are media expressions in different audio adaptation sets in the N audio adaptation sets.
  • the different audio adaptation sets in the N audio adaptation sets have selective mutual exclusion.
  • the media expression element in the audio adaptation set element may include an area description of an associated area of the media expression in the navigation media presentation.
  • media representations including media presentation elements of the same region description have an association relationship, or an adaptation set element including the same region description There is an association between the sets.
  • the media expression described by the media expression element i is expressed as a media expression ri
  • the media expression described by the media expression element j is expressed as a media expression rj. If the media expression element i and the media expression element j contain the same regional description, the media expression ri may be represented. There is a relationship with the media expression rj.
  • the media expression element i and the adaptation set element ci include the same area description, and then may also describe the media expression and the adaptation set element ci described by the media expression element i.
  • the media expressions in the adaptation set have an association relationship, for example, the media expression element i can be an audio media expression, and the media expression in the adaptation set described by the adaptation set element ci can be a video media expression.
  • the area description may be a spatial relationship description (SRD).
  • SRD spatial relationship description
  • the area description can also be other types of descriptive information that can be used to describe the location area.
  • the media presented by the navigation media is The N video adaptation set elements are included in the description, and the N video adaptation set elements are in one-to-one correspondence with the N video adaptation sets.
  • the N video adaptation set elements include a description sub-element Ci
  • the video adaptation sets described in the N video adaptation set elements satisfying the set common condition video adaptation set element have a selection
  • the set commonality condition may be the same for both the element name and the method identifier (schemeIdUri) attribute of the description sub-element Ci included in the video adaptation set element.
  • the description sub-element Ci may describe that the media representation in the video adaptation set described by the video adaptation set element including the description sub-element Ci is the navigation medium.
  • the components of the presentation may describe a role that the media expression in the video adaptation set corresponding to the video adaptation set element corresponding to the description sub-element Ci is presented in the navigation medium, and the role may be, for example, primary, supplementary, subtitle or Translation dubbing, etc.
  • the description sub-element Ci may be, for example, an EssentialProptery element or a Supplemental Proptery element or a Role element or other elements.
  • the description sub-element Ci is an action description Role element
  • the set commonality condition may be an element name of the descriptive sub-element Ci included in the video adaptation set element.
  • the same method and method identification schemeIdUri attributes can be the same, and the parameter attributes can be the same.
  • the media presentation description of the navigation media presentation includes the N video adaptation set elements, the N video adaptation set elements and N videos One-to-one correspondence between adaptation sets.
  • the video adaptation set element VI corresponding to the video adaptation set I of the N video adaptation set elements includes a pointer for pointing to a primary media presentation, and the video adaptation set I may be the N Any video adaptation set in a video adaptation set.
  • the location of the pointer in the video adaptation set element VI may be determined according to the needs of the scene.
  • the pointer can be carried by an attribute of the video adaptation set element VI.
  • the pointer may be carried by an xlink:href attribute or other attribute of the video adaptation set element VI.
  • the pointer may be an EssentialProptery element in the video adaptation set element VI Or the SupplementalProperty element is hosted.
  • the pointer may be carried by a child element in an EssentialProptery element of the video adaptation set element VI, or the pointer may be carried by an attribute of an EssentialProptery element in the video adaptation set element VI;
  • the pointer may be carried by a child element in a SupplementalProperty element in the video adaptation set element VI, or the pointer may be carried by an attribute of a SupplementalProperty element in the video adaptation set element VI.
  • the pointer may be carried by a value attribute or other attribute of an EssentialProptery element in the video adaptation set element VI, or the pointer may be a value attribute of a SupplementalProperty element in the video adaptation set element VI or other Property bearer.
  • the pointer may be carried by an attribute of a virtual Representation element in the video adaptation set element VI, or the pointer may be carried by a child element in a virtual Representation element in the video adaptation set element VI, where The virtual Representation element does not include a media segment template element, a media segment list element, and a base Uniform Resource Locator (BaseURL) element.
  • BaseURL base Uniform Resource Locator
  • the pointer can also be carried by a Mediad Presentation (ReferencedMediaPresentation) element in the Video Adaptation Set Element VI.
  • the ReferencedMediaPresentation element is an element of the new extension, that is, the pointer can be carried by the newly extended element in the Video Adaptation Set Element VI, which is newly expanded in the Video Adaptation Set Element VI.
  • the name of the element carrying the pointer is not limited to ReferencedMediaPresentation, but may be other element names.
  • the time structure of the navigation media presentation may be independent of the time of presentation of the primary media pointed by the N navigation units in the navigation media presentation.
  • the audio of the navigation unit may be obtained by encoding the audio presented by the main media
  • the video of the navigation unit may be obtained by encoding the video presented by the main media, which may make the time structure of the navigation unit and There is no correlation between the time structures presented by the main media.
  • server 600 can be used to implement any of the foregoing embodiments.
  • the server 600 can be a content server or other server.
  • the navigation media presentation generated by the navigation media generated by the server 600 presents the N navigation units included in the navigation media presentation, because each of the N navigation units The viewing unit can respectively point to a main media presentation, which is equivalent to a certain association relationship introduced between the navigation unit and the main media presentation, which causes the client to be selected in the navigation unit i of the N navigation units.
  • the client can obtain the media presentation description of the main media presentation j pointed to by the navigation unit i, and then the main media presentation j can be obtained according to the media presentation description of the primary media presentation j, and visible.
  • This solution lays the foundation for the flexible switching between the navigation media presentation and the main media presentation, and lays a foundation for supporting video navigation in the HTTP-based media streaming service scenario.
  • a server 700 may include:
  • the processor 702 and the memory 703 are coupled by a bus 701.
  • the processor 702 by calling code or instructions in the memory 703, for determining that the navigation media presentation includes N navigation units; generating a media presentation description of the navigation media presentation, the navigation media presentation
  • the media presentation description describes N navigation units included in the navigation media presentation, the N being an integer greater than one; each of the N navigation units pointing to a primary media presentation,
  • the presentation quality of the primary media presentation pointed to by the navigation unit i in the N navigation units is higher than the presentation quality of the navigation unit i.
  • the presentation quality of the main media presentation pointed by the navigation unit i in the N navigation units is higher than the presentation quality of the navigation unit i. That is to say, the presentation quality of the media expression of the navigation unit is lower than the presentation quality of the main media presentation represented by the navigation unit.
  • the media presentation description of the navigation media presentation may be different from the media presentation of the primary media presentation pointed to by each of the N navigation units. description. That is, the navigation media presentation may have an independent media presentation description, and each of the N navigation units may be independent of the navigation.
  • the media presentation of the media presentation presented by the media presentation For example, N navigation units point to N main media presentations, and N main media presentations respectively have corresponding media presentation descriptions, ie, N media presentation descriptions, and the media presentation descriptions presented by the navigation media are different from the N media. Any one of the presentation descriptions, that is, the navigation media presentation can be described by the K+1th media presentation.
  • the media presentation description presented by the navigation media and the media presentation description of the primary media presentation pointed to by each navigation unit of the N navigation units may be aggregated.
  • An aggregated media presentation description (also referred to as a supermedia presentation description) is formed.
  • An aggregated media presentation description (which may be referred to as a supermedia presentation description) may be utilized to describe the primary media presentation to which the navigation media presentation and navigation media presentations are directed. The introduction of the supermedia presentation description facilitates enhancing the association between the navigation media presentation and the guided master media presentation.
  • the way in which the navigation unit points to the main media presentation can be flexible, and the navigation unit can directly point to the main media presentation or indirectly to the main media presentation.
  • each of the N navigation units can point to the primary media presentation described by the media presentation description in a manner that points to a media presentation description.
  • the navigation unit can also point to the main media presentation by other direct or indirect pointing methods.
  • the media presentation description of the navigation media presentation and the media presentation description of the primary media presentation pointed to by each of the N navigation units may be aggregated to form an aggregated media presentation description.
  • each of the N navigation units may point to a primary media presentation in a manner that references the presentation elements in the aggregated media presentation description.
  • each of the N navigation units includes a video component, or each of the N navigation units includes audio Component and video components, further, the navigation unit may also include subtitle components or other types of media components.
  • the present invention provides a signaling mechanism for navigation services by media presentation descriptions (such as MPD in the DASH standard).
  • the media presentation description can inform the client which navigation unit consists of a navigation unit, the components of the navigation unit, the relationship between the navigation unit and the media presentation of the members of the navigation service, and the relationship between the video components of the navigation unit, The relationship between the audio components of the navigation unit, the relationship between the audio component and the video component of the navigation unit, and the like.
  • the video components included in the different navigation units of the N navigation units are media expressions in different video adaptation sets in the N video adaptation sets, where The media representations in any one of the N video adaptation sets have selective mutual exclusion, and the different video adaptation sets in the N video adaptation sets have selection compatibility.
  • the video component included in the navigation unit i in the N navigation units may be attributed to the video adaptation set Ci in the N video adaptation sets, where the navigation unit j included in the N navigation units
  • the video component may be attributed to a video adaptation set Cj in the N video adaptation sets, wherein the video adaptation set Cj and the video adaptation set Ci are two different video adaptation sets in the N video adaptation sets.
  • the navigation unit j and the navigation unit i can be any two navigation units of the N navigation units.
  • selection compatibility means that these objects can be selected at the same time. For example, if there is selection compatibility between different video adaptation sets in the N video adaptation sets, it means that N video adaptation sets can be selected at the same time. Media representation in multiple video adaptation sets.
  • mutual exclusion indicating that these objects do not support simultaneous selection, for example, if there is selective mutual exclusion between media expressions in any one of the N video adaptation sets, indicating that one is not supported at the same time.
  • Multiple media representations in the video adaptation set for example, assuming that the video adaptation set I in the N video adaptation sets includes 10 multiple media representations, if there is selective mutual exclusion between media expressions in the video adaptation set, then each Only one of the 10 media expressions can be selected at a time, and multiple of the 10 media expressions cannot be selected at the same time.
  • the N navigation unit includes an audio component that is a media expression in an audio adaptation set, where the audio adaptation set is different from the N video adaptation sets.
  • the audio component adaptation set has a selection compatibility with the N video adaptation sets. For example, suppose the audio adaptation set includes 20 multiple media representations, and if there is selective mutual exclusion between media expressions in the audio adaptation set, then only one of 20 multiple media expressions can be selected at a time, and Multiple of the 30 media expressions cannot be selected at the same time.
  • the audio components included in the different navigation units of the N navigation units are media expressions in different audio adaptation sets in the N audio adaptation sets.
  • the different audio adaptation sets in the N audio adaptation sets have selective mutual exclusion.
  • the media expression element in the audio adaptation set element may include an area description of an associated area of the media expression in the navigation media presentation.
  • media representations including media presentation elements of the same region description have an association relationship, or an adaptation set element including the same region description There is an association between the sets.
  • the media expression described by the media expression element i is expressed as a media expression ri
  • the media expression described by the media expression element j is expressed as a media expression rj. If the media expression element i and the media expression element j contain the same regional description, the media expression ri may be represented. There is a relationship with the media expression rj.
  • the media expression element i and the adaptation set element ci include the same area description, and then may also describe the media expression and the adaptation set element ci described by the media expression element i.
  • the media expressions in the adaptation set have an association relationship, for example, the media expression element i can be an audio media expression, and the media expression in the adaptation set described by the adaptation set element ci can be a video media expression.
  • the area description may be a spatial relationship description (SRD).
  • SRD spatial relationship description
  • the area description can also be other types of descriptive information that can be used to describe the location area.
  • the media presentation description of the navigation media presentation includes N video adaptation set elements, and the N video adaptation set elements and the N video One-to-one correspondence between adaptation sets.
  • the N video adaptation set elements include a description sub-element Ci
  • the video adaptation sets described in the N video adaptation set elements satisfying the set common condition video adaptation set element have a selection
  • the set commonality condition may be the same for both the element name and the method identifier (schemeIdUri) attribute of the description sub-element Ci included in the video adaptation set element.
  • the description sub-element Ci may describe that the media representation in the video adaptation set described by the video adaptation set element including the description sub-element Ci is the navigation medium.
  • the components of the presentation may describe that the description sub-element Ci may describe that the media expression in the video adaptation set corresponding to the video adaptation set element including the description sub-element Ci is presented in the navigation media.
  • Roles, such as roles may be primary, supplementary, subtitle or translation dubbing.
  • the description sub-element Ci may be, for example, an EssentialProptery element or a Supplemental Proptery element or a Role element or other elements.
  • the description sub-element Ci is an action description Role element
  • the set commonality condition may be an element name of the descriptive sub-element Ci included in the video adaptation set element.
  • the same method and method identification schemeIdUri attributes can be the same, and the parameter attributes can be the same.
  • the media presentation description of the navigation media presentation includes the N video adaptation set elements, the N video adaptation set elements and N videos One-to-one correspondence between adaptation sets.
  • the video adaptation set element VI corresponding to the video adaptation set I of the N video adaptation set elements includes a pointer for pointing to a primary media presentation, and the video adaptation set I may be the N Any video adaptation set in a video adaptation set.
  • the location of the pointer in the video adaptation set element VI may be determined according to the needs of the scene.
  • the pointer can be carried by an attribute of the video adaptation set element VI.
  • the pointer may be carried by an xlink:href attribute or other attribute of the video adaptation set element VI.
  • the pointer can be carried by an EssentialProptery element or a SupplementalProperty element in the video adaptation set element VI.
  • the pointer may be carried by a child element in an EssentialProptery element of the video adaptation set element VI, or the pointer may be carried by an attribute of an EssentialProptery element in the video adaptation set element VI;
  • the pointer may be carried by a child element in a SupplementalProperty element in the video adaptation set element VI, or the pointer may be carried by an attribute of a SupplementalProperty element in the video adaptation set element VI.
  • the pointer may be carried by a value attribute or other attribute of an EssentialProptery element in the video adaptation set element VI, or the pointer may be a value attribute of a SupplementalProperty element in the video adaptation set element VI or other Property bearer.
  • the pointer can be virtual Representation in the video adaptation set element VI
  • the attribute of the element is carried, or the pointer may be carried by a child element in a virtual Representation element in the video adaptation set element VI, wherein the virtual Representation element does not include a media segment template element, a media segment list element, and a basic unification Resource Locator (BaseURL) element.
  • BaseURL basic unification Resource Locator
  • the pointer can also be carried by a Mediad Presentation (ReferencedMediaPresentation) element in the Video Adaptation Set Element VI.
  • the ReferencedMediaPresentation element is an element of the new extension, that is, the pointer can be carried by the newly extended element in the Video Adaptation Set Element VI, which is newly expanded in the Video Adaptation Set Element VI.
  • the name of the element carrying the pointer is not limited to ReferencedMediaPresentation, but may be other element names.
  • the time structure of the navigation media presentation may be independent of the time of presentation of the primary media pointed by the N navigation units in the navigation media presentation.
  • the audio of the navigation unit may be obtained by encoding the audio presented by the main media
  • the video of the navigation unit may be obtained by encoding the video presented by the main media, which may make the time structure of the navigation unit and There is no correlation between the time structures presented by the main media.
  • server 700 can be used to implement any of the media presentation navigation methods based on the hypertext transfer protocol media stream provided by the foregoing embodiments.
  • the server 700 can be a content server or other server.
  • the navigation media presentation generated by the navigation media generated by the server 700 presents N navigation units included in the navigation media presentation, because each of the N navigation units
  • the viewing unit can respectively point to a main media presentation, which is equivalent to a certain association relationship introduced between the navigation unit and the main media presentation, which causes the client to be selected in the navigation unit i of the N navigation units.
  • the client can obtain the media presentation description of the main media presentation j pointed to by the navigation unit i, and then the main media presentation j can be obtained according to the media presentation description of the primary media presentation j, and visible.
  • This solution lays the foundation for the flexible switching between the navigation media presentation and the main media presentation, and is implemented in the HTTP-based media streaming service scenario. Support for video navigation has laid the foundation.
  • an embodiment of the present invention further provides a communication system, which may include:
  • the client 810 is configured to obtain, from the content server 820, a media presentation description of the navigation media presentation, where the media presentation description presented by the navigation media describes the N navigations included in the navigation media presentation. a unit, the N being an integer greater than 1; acquiring, according to the media presentation description of the navigation media, K navigation units from the content navigation server 820; presenting the K navigation units Each of the K navigation units points to a primary media presentation, wherein the presentation quality of the primary media presentation pointed to by the navigation unit i in the K navigation units is higher than the navigation unit The presentation quality of i.
  • the client 810 can be any client provided by the foregoing embodiment, for example.
  • the content is based on the same concept as the method embodiment of the present invention.
  • the description in the method embodiment of the present invention and details are not described herein again.
  • the embodiment of the present invention further provides a computer storage medium, wherein the computer storage medium can store a program, and the program includes some or all of the steps of any one of the methods described in the foregoing method embodiments.
  • the disclosed apparatus may be implemented in other ways.
  • the device embodiments described above are merely illustrative.
  • the division of the above units is only a logical function division. In actual implementation, there may be another division manner.
  • multiple units or components may be combined or may be integrated into Another system, or some features can be ignored Or not.
  • the mutual coupling or direct coupling or communication connection shown or discussed may be an indirect coupling or communication connection through some interface, device or unit, and may be electrical or otherwise.
  • the units described above as separate components may or may not be physically separate, and the components displayed as units may or may not be physical units, that is, may be located in one place, or may be distributed to multiple network units. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution of the embodiment.
  • each functional unit in each embodiment of the present invention may be integrated into one processing unit, or each unit may exist physically separately, or two or more units may be integrated into one unit.
  • the above integrated unit can be implemented in the form of hardware or in the form of a software functional unit.
  • the above-described integrated unit if implemented in the form of a software functional unit and sold or used as a stand-alone product, may be stored in a computer readable storage medium.
  • the instructions include a plurality of instructions for causing a computer device (which may be a personal computer, server or network device, etc., and in particular a processor in a computer device) to perform all or part of the steps of the above-described methods of various embodiments of the present invention.
  • the foregoing storage medium may include: a U disk, a mobile hard disk, a magnetic disk, an optical disk, a read-only memory (ROM), or a random access memory (RAM), and the like. The medium of the code.

Landscapes

  • Engineering & Computer Science (AREA)
  • Signal Processing (AREA)
  • Multimedia (AREA)
  • Databases & Information Systems (AREA)
  • Human Computer Interaction (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Computer Security & Cryptography (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
  • Information Transfer Between Computers (AREA)

Abstract

基于超文本传输协议媒体流的提供导览媒体呈现的方法和相关装置。一种基于超文本传输协议媒体流的媒体呈现导览方法,可包括:客户端获取导览媒体呈现的媒体呈现描述,其中,所述导览媒体呈现的媒体呈现描述描述了所述导览媒体呈现包括的N个导览单元,所述N为大于1的整数;所述客户端根据所述导览媒体呈现的媒体呈现描述获取所述N个导览单元中的K个导览单元;所述客户端呈现所述K个导览单元,所述K个导览单元中的每个导览单元指向一个主媒体呈现,其中,K个导览单元中的导览单元i所指向的主媒体呈现的呈现质量高于所述导览单元i的呈现质量。本发明实施例的方案有利于在基于HTTP的媒体流服务场景下支持视频导览,进而提高用户体验。

Description

基于超文本传输协议媒体流的媒体呈现导览方法和相关装置 技术领域
本发明涉及数据传输领域,具体涉及基于超文本传输协议媒体流的媒体呈现导览方法和相关装置。
背景技术
基于超文本传输协议(HTTP,Hyper Text Transfer Protocol)媒体流的多媒体业务正日益发展,甚至挑战了传统的广播电视的地位。不过传统电视中一些业务,基于HTTP的媒体流服务还不支持,视频导览就是其中一项,这不能不说是一个缺憾。
发明内容
本发明提供了基于超文本传输协议媒体流的提供导览媒体呈现的方法和相关装置,以期能在基于HTTP的媒体流服务场景下支持视频导览,进而提高用户体验。
本发明实施例第一方面提供一种基于超文本传输协议媒体流的媒体呈现导览方法,可包括:
客户端获取导览媒体呈现的媒体呈现描述,其中,所述导览媒体呈现的媒体呈现描述描述了所述导览媒体呈现包括的N个导览单元,所述N为大于1的整数;
所述客户端根据所述导览媒体呈现的媒体呈现描述获取所述N个导览单元中的K个导览单元;
所述客户端呈现所述K个导览单元,所述K个导览单元中的每个导览单元指向一个主媒体呈现,其中,K个导览单元中的导览单元i所指向的主媒体呈现的呈现质量高于所述导览单元i的呈现质量。
结合第一方面,在第一方面的第一种可能的实施方式中,所述导览媒体呈现的媒体呈现描述不同于所述K个导览单元中的每个导览单元所指向主媒体呈现的媒体呈现描述。
结合第一方面的第一种可能的实施方式,在第一方面的第二种可能的实施 方式中,所述K个导览单元中的每个导览单元以指向媒体呈现描述的方式来指向该媒体呈现描述所描述的主媒体呈现。
结合第一方面,在第一方面的第三种可能的实施方式中,所述导览媒体呈现的媒体呈现描述与所述K个导览单元中的每个导览单元所指向主媒体呈现的媒体呈现描述被聚合形成了一个聚合媒体呈现描述。
结合第一方面的第三种可能的实施方式,在第一方面的第四种可能的实施方式中,所述K个导览单元中的每个导览单元以引用所述聚合媒体呈现描述中的呈现元素的方式来指向一个主媒体呈现。
结合第一方面或第一方面的第一种至第四种可能的实施方式中的任意一种可能的实施方式,在第一方面的第五种可能的实施方式中,所述N个导览单元中的每个导览单元包括视频分量,或者所述N个导览单元中的每个导览单元包括音频分量和视频分量。
结合第一方面的第五种可能的实施方式,在第一方面的第六种可能的实施方式中,所述K个导览单元中的不同导览单元所包括的视频分量为K个视频适配集中的不同视频适配集中的媒体表达,其中,所述K个视频适配集中的任意一个视频适配集中的媒体表达之间具有选择互斥性,所述K个视频适配集中的不同视频适配集之间具有选择相容性。
结合第一方面的第六种可能的实施方式,在第一方面的第七种可能的实施方式中,所述K个导览单元包括的音频分量为音频适配集中的媒体表达,所述音频适配集不同于所述K个适配集中的任意一个适配集,所述音频分量适配集与所述K个视频适配集之间具有选择相容性;
或者,
所述K个导览单元中的不同导览单元所包括的音频分量为K个音频适配集中的不同音频适配集中的媒体表达,其中,所述K个音频适配集中的不同音频适配集之间具有选择互斥性。
结合第一方面的第七种可能的实施方式,在第一方面的第八种可能的实施方式中,所述音频适配集元素中的媒体表达元素,包含其所描述的媒体表达在导览媒体呈现中的关联区域的区域说明。
结合第一方面的第八种可能的实施方式,在第一方面的第九种可能的实施方式中,包含相同区域说明的媒体表达元素所描述的媒体表达之间具有关联关系,或者,包含相同区域说明的适配集元素所描述的适配集之间具有关联关系。
结合第一方面的第八种可能的实施方式或第一方面的第九种可能的实施方式,在第一方面的第十种可能的实施方式中,所述区域说明为SRD空间关系描述。
结合第一方面的第七种至第十种可能的实施方式中的任意一种可能的实施方式,在第一方面的第十一种可能的实施方式中,所述导览媒体呈现的媒体呈现描述中包括K个视频适配集元素,所述K个视频适配集元素与所述K个视频适配集之间一一对应;其中,所述K个视频适配集元素中包括描述子元素Ci,所述K个视频适配集元素中的满足设定共性条件视频适配集元素所描述的视频适配集之间具有选择相容性,所述设定共性条件为视频适配集元素所包括的描述子元素Ci的元素名称和方法识别schemeIdUri属性均相同。
结合第一方面的第十一种可能的实施方式,在第一方面的第十二种可能的实施方式中,所述描述子元素Ci描述了包括该描述子元素Ci的视频适配集元素所描述的视频适配集中的媒体表达为导览媒体呈现的组成部分。
结合第一方面的第十一种可能的实施方式,在第一方面的第十三种可能的实施方式中,所述描述子元素Ci描述了包括该描述子元素Ci的视频适配集元素对应的视频适配集中的媒体表达在导览媒体呈现的角色。
结合第一方面的第十二种可能的实施方式或第一方面的第十三种可能的实施方式,在第一方面的第十四种可能的实施方式中,所述描述子元素Ci为作用说明Role元素或者基本属性EssentialProptery元素或者补充属性SupplementalProptery元素。
结合第一方面的第十四种可能的实施方式,在第一方面的第十五种可能的实施方式中,若描述子元素Ci为作用说明Role元素,则所述设定共性条件为视频适配集元素所包括的描述子元素Ci的元素名称相同、方法识别schemeIdUri属性相同,且参数value属性相同。
结合第一方面的第四种至第十五种可能的实施方式中的任意一种可能的 实施方式,在第一方面的第十六种可能的实施方式中,所述导览媒体呈现的媒体呈现描述中包括所述K个视频适配集元素,所述K个视频适配集元素与K个视频适配集之间一一对应,
其中,所述K个视频适配集元素中的与视频适配集I对应的视频适配集元素VI包括用于指向一个主媒体呈现的指针,所述视频适配集I为所述K个视频适配集中任意一个视频适配集。
结合第一方面的第十六种可能的实施方式,在第一方面的第十七种可能的实施方式中,
所述指针由所述视频适配集元素VI的属性承载。
结合第一方面的第十七种可能的实施方式,在第一方面的第十八种可能的实施方式中,所述指针由所述视频适配集元素VI的xlink:href属性承载。
结合第一方面的第十六种可能的实施方式,在第一方面的第十九种可能的实施方式中,
所述指针由所述视频适配集元素VI中的基本属性EssentialProptery元素或补充属性SupplementalProperty元素承载。
结合第一方面的第十六种可能的实施方式,在第一方面的第二十种可能的实施方式中,所述指针由所述视频适配集元素VI之中的EssentialProptery元素中的子元素承载,或所述指针由所述视频适配集元素VI中的EssentialProptery元素的属性承载;或者所述指针由所述视频适配集元素VI中的SupplementalProperty元素中的子元素承载,或者所述指针由所述视频适配集元素VI中的SupplementalProperty元素的属性承载。
结合第一方面的第二十种可能的实施方式,在第一方面的第二十一种可能的实施方式中,所述指针由所述视频适配集元素VI中的EssentialProptery元素的value属性承载,或者所述指针由所述视频适配集元素VI之中的SupplementalProperty元素的value属性承载。
结合第一方面的第十六种可能的实施方式,在第一方面的第二十二种可能的实施方式中,
所述指针由所述视频适配集元素VI中的虚拟媒体表达Representation元素 的属性承载,或,所述指针由所述视频适配集元素VI中的虚拟Representation元素中的子元素承载,其中,所述虚拟Representation元素不包括媒体片段模版元素、媒体片段列表元素和基础统一资源定位符BaseURL元素。
结合第一方面的第十六种可能的实施方式,在第一方面的第二十三种可能的实施方式中,所述指针由所述视频适配集元素VI中的媒体呈现指向ReferencedMediaPresentation元素来承载。
结合第一方面或第一方面的第一种至第二十三种可能的实施方式中的任意一种可能的实施方式,在第一方面的第二十四种可能的实施方式中,
所述导览媒体呈现的时间结构不依赖于所述导览媒体呈现中的所述K个导览单元所指向的主媒体呈现的时间结构。
结合第一方面或第一方面的第一种至第二十四种可能的实施方式中的任意一种可能的实施方式,在第一方面的第二十五种可能的实施方式中,所述方法还包括:在关注焦点停留在所述K个导览单元中的导览单元i的情况下,所述客户端呈现所述导览单元i的音频分量。
结合第一方面或第一方面的第一种至第二十五种可能的实施方式中的任意一种可能的实施方式,在第一方面的第二十六种可能的实施方式中,所述方法还包括:在所述K个导览单元中的导览单元i被选择的情况下,所述客户端获取所述导览单元i所指向的主媒体呈现。
本发明实施例第二方面提供一种基于超文本传输协议媒体流的媒体呈现导览方法,包括:
确定导览媒体呈现包括的N个导览单元;
生成导览媒体呈现的媒体呈现描述,所述导览媒体呈现的媒体呈现描述描述了所述导览媒体呈现包括的N个导览单元,所述N为大于1的整数;所述N个导览单元中的每个导览单元指向一个主媒体呈现,其中,所述N个导览单元中的导览单元i所指向的主媒体呈现的呈现质量高于所述导览单元i的呈现质量。
结合第二方面,在第二方面的第一种可能的实施方式中,所述导览媒体呈现的媒体呈现描述不同于所述K个导览单元中的每个导览单元所指向主媒体 呈现的媒体呈现描述。
结合第二方面的第一种可能的实施方式,在第二方面的第二种可能的实施方式中,所述N个导览单元中的每个导览单元以指向媒体呈现描述的方式来指向该媒体呈现描述所描述的主媒体呈现。
结合第二方面,在第二方面的第三种可能的实施方式中,所述导览媒体呈现的媒体呈现描述与所述N个导览单元中的每个导览单元所指向主媒体呈现的媒体呈现描述被聚合形成了一个聚合媒体呈现描述。
结合第二方面的第三种可能的实施方式,在第二方面的第四种可能的实施方式中,所述N个导览单元中的每个导览单元以引用所述聚合媒体呈现描述中的呈现元素的方式来指向一个主媒体呈现。
结合第二方面或第二方面的第一种至第四种可能的实施方式中的任意一种可能的实施方式,在第二方面的第五种可能的实施方式中,所述N个导览单元中的每个导览单元包括视频分量,或者所述N个导览单元中的每个导览单元包括音频分量和视频分量。
结合第二方面的第五种可能的实施方式,在第二方面的第六种可能的实施方式中,所述N个导览单元中的不同导览单元所包括的视频分量为N个视频适配集中的不同视频适配集中的媒体表达,其中,所述N个视频适配集中的任意一个视频适配集中的媒体表达之间具有选择互斥性,所述N个视频适配集中的不同视频适配集之间具有选择相容性。
结合第二方面的第六种可能的实施方式,在第二方面的第七种可能的实施方式中,所述N个导览单元包括的音频分量为音频适配集中的媒体表达,所述音频适配集不同于所述N个适配集中的任意一个适配集,所述音频分量适配集与所述N个视频适配集之间具有选择相容性;
或者,
所述N个导览单元中的不同导览单元所包括的音频分量为N个音频适配集中的不同音频适配集中的媒体表达,其中,所述N个音频适配集中的不同音频适配集之间具有选择互斥性。
结合第二方面的第七种可能的实施方式,在第二方面的第八种可能的实施 方式中,所述音频适配集元素中的媒体表达元素,包含其所描述的媒体表达在导览媒体呈现中的关联区域的区域说明。
结合第二方面的第八种可能的实施方式,在第二方面的第九种可能的实施方式中,包含相同区域说明的媒体表达元素所描述的媒体表达之间具有关联关系,或者,包含相同区域说明的适配集元素所描述的适配集之间具有关联关系。
结合第二方面的第八种可能的实施方式或第二方面的第九种可能的实施方式,在第二方面的第十种可能的实施方式中,所述区域说明为SRD空间关系描述。
结合第二方面的第七种至第十种可能的实施方式中的任意一种可能的实施方式,在第二方面的第十一种可能的实施方式中,所述导览媒体呈现的媒体呈现描述中包括N个视频适配集元素,所述N个视频适配集元素与所述N个视频适配集之间一一对应;其中,所述N个视频适配集元素中包括描述子元素Ci,所述N个视频适配集元素中的满足设定共性条件视频适配集元素所描述的视频适配集之间具有选择相容性,所述设定共性条件为视频适配集元素所包括的描述子元素Ci的元素名称和方法识别schemeIdUri属性均相同。
结合第二方面的第十一种可能的实施方式,在第二方面的第十二种可能的实施方式中,所述描述子元素Ci描述了包括该描述子元素Ci的视频适配集元素所描述的视频适配集中的媒体表达为导览媒体呈现的组成部分。
结合第二方面的第十一种可能的实施方式,在第二方面的第十三种可能的实施方式中,所述描述子元素Ci描述了包括该描述子元素Ci的视频适配集元素对应的视频适配集中的媒体表达在导览媒体呈现的角色。
结合第二方面的第十二种可能的实施方式或第二方面的第十三种可能的实施方式,在第二方面的第十四种可能的实施方式中,所述描述子元素Ci为作用说明Role元素或者基本属性EssentialProptery元素或者补充属性SupplementalProptery元素。
结合第二方面的第十四种可能的实施方式,在第二方面的第十五种可能的实施方式中,若描述子元素Ci为作用说明Role元素,则所述设定共性条件为视频适配集元素所包括的描述子元素Ci的元素名称相同、方法识别 schemeIdUri属性相同,且参数value属性相同。
结合第二方面的第四种至第十五种可能的实施方式中的任意一种可能的实施方式,在第二方面的第十六种可能的实施方式中,所述导览媒体呈现的媒体呈现描述中包括所述N个视频适配集元素,所述N个视频适配集元素与N个视频适配集之间一一对应,
其中,所述N个视频适配集元素中的与视频适配集I对应的视频适配集元素VI包括用于指向一个主媒体呈现的指针,所述视频适配集I为所述N个视频适配集中任意一个视频适配集。
结合第二方面的第十六种可能的实施方式,在第二方面的第十七种可能的实施方式中,
所述指针由所述视频适配集元素VI的属性承载。
结合第二方面的第十七种可能的实施方式,在第二方面的第十八种可能的实施方式中,所述指针由所述视频适配集元素VI的xlink:href属性承载。
结合第二方面的第十六种可能的实施方式,在第二方面的第十九种可能的实施方式中,
所述指针由所述视频适配集元素VI中的基本属性EssentialProptery元素或补充属性SupplementalProperty元素承载。
结合第二方面的第十六种可能的实施方式,在第二方面的第二十种可能的实施方式中,所述指针由所述视频适配集元素VI之中的EssentialProptery元素中的子元素承载,或所述指针由所述视频适配集元素VI中的EssentialProptery元素的属性承载;或者所述指针由所述视频适配集元素VI中的SupplementalProperty元素中的子元素承载,或者所述指针由所述视频适配集元素VI中的SupplementalProperty元素的属性承载。
结合第二方面的第二十种可能的实施方式,在第二方面的第二十一种可能的实施方式中,所述指针由所述视频适配集元素VI中的EssentialProptery元素的value属性承载,或者所述指针由所述视频适配集元素VI之中的SupplementalProperty元素的value属性承载。
结合第二方面的第十六种可能的实施方式,在第二方面的第二十二种可能 的实施方式中,
所述指针由所述视频适配集元素VI中的虚拟媒体表达Representation元素的属性承载,或,所述指针由所述视频适配集元素VI中的虚拟Representation元素中的子元素承载,其中,所述虚拟Representation元素不包括媒体片段模版元素、媒体片段列表元素和基础统一资源定位符BaseURL元素。
结合第二方面的第十六种可能的实施方式,在第二方面的第二十三种可能的实施方式中,所述指针由所述视频适配集元素VI中的媒体呈现指向ReferencedMediaPresentation元素来承载。
结合第二方面或第二方面的第一种至第二十三种可能的实施方式中的任意一种可能的实施方式,在第二方面的第二十四种可能的实施方式中,
所述导览媒体呈现的时间结构不依赖于所述导览媒体呈现中的所述N个导览单元所指向的主媒体呈现的时间结构。
本发明第三方面提供一种客户端,包括:
第一获取单元,用于获取导览媒体呈现的媒体呈现描述,其中,所述导览媒体呈现的媒体呈现描述描述了所述导览媒体呈现包括的N个导览单元,所述N为大于1的整数;
第二获取单元,用于根据所述导览媒体呈现的媒体呈现描述获取所述N个导览单元中的K个导览单元;
呈现单元,用于呈现所述K个导览单元,所述K个导览单元中的每个导览单元指向一个主媒体呈现,其中,K个导览单元中的导览单元i所指向的主媒体呈现的呈现质量高于所述导览单元i的呈现质量。
结合第三方面,在第三方面的第一种可能的实施方式中,所述导览媒体呈现的媒体呈现描述不同于所述K个导览单元中的每个导览单元所指向主媒体呈现的媒体呈现描述。
结合第三方面的第一种可能的实施方式,在第三方面的第二种可能的实施方式中,所述K个导览单元中的每个导览单元以指向媒体呈现描述的方式来指向该媒体呈现描述所描述的主媒体呈现。
结合第三方面,在第三方面的第三种可能的实施方式中,所述导览媒体呈 现的媒体呈现描述与所述K个导览单元中的每个导览单元所指向主媒体呈现的媒体呈现描述被聚合形成了一个聚合媒体呈现描述。
结合第三方面的第三种可能的实施方式,在第三方面的第四种可能的实施方式中,所述K个导览单元中的每个导览单元以引用所述聚合媒体呈现描述中的呈现元素的方式来指向一个主媒体呈现。
结合第三方面或第三方面的第一种至第四种可能的实施方式中的任意一种可能的实施方式,在第三方面的第五种可能的实施方式中,所述N个导览单元中的每个导览单元包括视频分量,或者所述N个导览单元中的每个导览单元包括音频分量和视频分量。
结合第三方面的第五种可能的实施方式,在第三方面的第六种可能的实施方式中,所述K个导览单元中的不同导览单元所包括的视频分量为K个视频适配集中的不同视频适配集中的媒体表达,其中,所述K个视频适配集中的任意一个视频适配集中的媒体表达之间具有选择互斥性,所述K个视频适配集中的不同视频适配集之间具有选择相容性。
结合第三方面的第六种可能的实施方式,在第三方面的第七种可能的实施方式中,所述K个导览单元包括的音频分量为音频适配集中的媒体表达,所述音频适配集不同于所述K个适配集中的任意一个适配集,所述音频分量适配集与所述K个视频适配集之间具有选择相容性;
或者,
所述K个导览单元中的不同导览单元所包括的音频分量为K个音频适配集中的不同音频适配集中的媒体表达,其中,所述K个音频适配集中的不同音频适配集之间具有选择互斥性。
结合第三方面的第七种可能的实施方式,在第三方面的第八种可能的实施方式中,所述音频适配集元素中的媒体表达元素,包含其所描述的媒体表达在导览媒体呈现中的关联区域的区域说明。
结合第三方面的第八种可能的实施方式,在第三方面的第九种可能的实施方式中,包含相同区域说明的媒体表达元素所描述的媒体表达之间具有关联关系,或者,包含相同区域说明的适配集元素所描述的适配集之间具有关联关系。
结合第三方面的第八种可能的实施方式或第三方面的第九种可能的实施方式,在第三方面的第十种可能的实施方式中,所述区域说明为SRD空间关系描述。
结合第三方面的第七种至第十种可能的实施方式中的任意一种可能的实施方式,在第三方面的第十一种可能的实施方式中,所述导览媒体呈现的媒体呈现描述中包括K个视频适配集元素,所述K个视频适配集元素与所述K个视频适配集之间一一对应;其中,所述K个视频适配集元素中包括描述子元素Ci,所述K个视频适配集元素中的满足设定共性条件视频适配集元素所描述的视频适配集之间具有选择相容性,所述设定共性条件为视频适配集元素所包括的描述子元素Ci的元素名称和方法识别schemeIdUri属性均相同。
结合第三方面的第十一种可能的实施方式,在第三方面的第十二种可能的实施方式中,所述描述子元素Ci描述了包括该描述子元素Ci的视频适配集元素所描述的视频适配集中的媒体表达为导览媒体呈现的组成部分。
结合第三方面的第十一种可能的实施方式,在第三方面的第十三种可能的实施方式中,所述描述子元素Ci描述了包括该描述子元素Ci的视频适配集元素对应的视频适配集中的媒体表达在导览媒体呈现的角色。
结合第三方面的第十二种可能的实施方式或第三方面的第十三种可能的实施方式,在第三方面的第十四种可能的实施方式中,所述描述子元素Ci为作用说明Role元素或者基本属性EssentialProptery元素或者补充属性SupplementalProptery元素。
结合第三方面的第十四种可能的实施方式,在第三方面的第十五种可能的实施方式中,若描述子元素Ci为作用说明Role元素,则所述设定共性条件为视频适配集元素所包括的描述子元素Ci的元素名称相同、方法识别schemeIdUri属性相同,且参数value属性相同。
结合第三方面的第四种至第十五种可能的实施方式中的任意一种可能的实施方式,在第三方面的第十六种可能的实施方式中,所述导览媒体呈现的媒体呈现描述中包括所述K个视频适配集元素,所述K个视频适配集元素与K个视频适配集之间一一对应,
其中,所述K个视频适配集元素中的与视频适配集I对应的视频适配集元素VI包括用于指向一个主媒体呈现的指针,所述视频适配集I为所述K个视频适配集中任意一个视频适配集。
结合第三方面的第十六种可能的实施方式,在第三方面的第十七种可能的实施方式中,
所述指针由所述视频适配集元素VI的属性承载。
结合第三方面的第十七种可能的实施方式,在第三方面的第十八种可能的实施方式中,所述指针由所述视频适配集元素VI的xlink:href属性承载。
结合第三方面的第十六种可能的实施方式,在第三方面的第十九种可能的实施方式中,
所述指针由所述视频适配集元素VI中的基本属性EssentialProptery元素或补充属性SupplementalProperty元素承载。
结合第三方面的第十六种可能的实施方式,在第三方面的第二十种可能的实施方式中,所述指针由所述视频适配集元素VI之中的EssentialProptery元素中的子元素承载,或所述指针由所述视频适配集元素VI中的EssentialProptery元素的属性承载;或者所述指针由所述视频适配集元素VI中的SupplementalProperty元素中的子元素承载,或者所述指针由所述视频适配集元素VI中的SupplementalProperty元素的属性承载。
结合第三方面的第二十种可能的实施方式,在第三方面的第二十一种可能的实施方式中,所述指针由所述视频适配集元素VI中的EssentialProptery元素的value属性承载,或者所述指针由所述视频适配集元素VI之中的SupplementalProperty元素的value属性承载。
结合第三方面的第十六种可能的实施方式,在第三方面的第二十二种可能的实施方式中,
所述指针由所述视频适配集元素VI中的虚拟媒体表达Representation元素的属性承载,或,所述指针由所述视频适配集元素VI中的虚拟Representation元素中的子元素承载,其中,所述虚拟Representation元素不包括媒体片段模版元素、媒体片段列表元素和基础统一资源定位符BaseURL元素。
结合第三方面的第十六种可能的实施方式,在第三方面的第二十三种可能的实施方式中,所述指针由所述视频适配集元素VI中的媒体呈现指向ReferencedMediaPresentation元素来承载。
结合第三方面或第三方面的第一种至第二十三种可能的实施方式中的任意一种可能的实施方式,在第三方面的第二十四种可能的实施方式中,
所述导览媒体呈现的时间结构不依赖于所述导览媒体呈现中的所述K个导览单元所指向的主媒体呈现的时间结构。
结合第三方面或第三方面的第一种至第二十四种可能的实施方式中的任意一种可能的实施方式,在第三方面的第二十五种可能的实施方式中,所述呈现单元还用于,在关注焦点停留在所述K个导览单元中的导览单元i的情况下,呈现所述导览单元i的音频分量。
结合第三方面或第三方面的第一种至第二十五种可能的实施方式中的任意一种可能的实施方式,在第三方面的第二十六种可能的实施方式中,所述呈现单元还用于,在所述K个导览单元中的导览单元i被选择的情况下,获取所述导览单元i所指向的主媒体呈现。
本发明第四方面提供一种媒体呈现导览装置,包括:
确定单元,用于确定导览媒体呈现包括的N个导览单元;
生成单元,用于生成导览媒体呈现的媒体呈现描述,所述导览媒体呈现的媒体呈现描述描述了所述导览媒体呈现包括的N个导览单元,所述N为大于1的整数;所述N个导览单元中的每个导览单元指向一个主媒体呈现,所述N个导览单元中的导览单元i所指向的主媒体呈现的呈现质量高于所述导览单元i的呈现质量。
结合第四方面,在第四方面的第一种可能的实施方式中,所述导览媒体呈现的媒体呈现描述不同于所述K个导览单元中的每个导览单元所指向主媒体呈现的媒体呈现描述。
结合第四方面的第一种可能的实施方式,在第四方面的第二种可能的实施方式中,所述N个导览单元中的每个导览单元以指向媒体呈现描述的方式来指向该媒体呈现描述所描述的主媒体呈现。
结合第四方面,在第四方面的第三种可能的实施方式中,所述导览媒体呈现的媒体呈现描述与所述N个导览单元中的每个导览单元所指向主媒体呈现的媒体呈现描述被聚合形成了一个聚合媒体呈现描述。
结合第四方面的第三种可能的实施方式,在第四方面的第四种可能的实施方式中,所述N个导览单元中的每个导览单元以引用所述聚合媒体呈现描述中的呈现元素的方式来指向一个主媒体呈现。
结合第四方面或第四方面的第一种至第四种可能的实施方式中的任意一种可能的实施方式,在第四方面的第五种可能的实施方式中,所述N个导览单元中的每个导览单元包括视频分量,或者所述N个导览单元中的每个导览单元包括音频分量和视频分量。
结合第四方面的第五种可能的实施方式,在第四方面的第六种可能的实施方式中,所述N个导览单元中的不同导览单元所包括的视频分量为N个视频适配集中的不同视频适配集中的媒体表达,其中,所述N个视频适配集中的任意一个视频适配集中的媒体表达之间具有选择互斥性,所述N个视频适配集中的不同视频适配集之间具有选择相容性。
结合第四方面的第六种可能的实施方式,在第四方面的第七种可能的实施方式中,所述N个导览单元包括的音频分量为音频适配集中的媒体表达,所述音频适配集不同于所述N个适配集中的任意一个适配集,所述音频分量适配集与所述N个视频适配集之间具有选择相容性;
或者,
所述N个导览单元中的不同导览单元所包括的音频分量为N个音频适配集中的不同音频适配集中的媒体表达,其中,所述N个音频适配集中的不同音频适配集之间具有选择互斥性。
结合第四方面的第七种可能的实施方式,在第四方面的第八种可能的实施方式中,所述音频适配集元素中的媒体表达元素,包含其所描述的媒体表达在导览媒体呈现中的关联区域的区域说明。
结合第四方面的第八种可能的实施方式,在第四方面的第九种可能的实施方式中,包含相同区域说明的媒体表达元素所描述的媒体表达之间具有关联关 系,或者,包含相同区域说明的适配集元素所描述的适配集之间具有关联关系。
结合第四方面的第八种可能的实施方式或第四方面的第九种可能的实施方式,在第四方面的第十种可能的实施方式中,所述区域说明为SRD空间关系描述。
结合第四方面的第七种至第十种可能的实施方式中的任意一种可能的实施方式,在第四方面的第十一种可能的实施方式中,所述导览媒体呈现的媒体呈现描述中包括N个视频适配集元素,所述N个视频适配集元素与所述N个视频适配集之间一一对应;其中,所述N个视频适配集元素中包括描述子元素Ci,所述N个视频适配集元素中的满足设定共性条件视频适配集元素所描述的视频适配集之间具有选择相容性,所述设定共性条件为视频适配集元素所包括的描述子元素Ci的元素名称和方法识别schemeIdUri属性均相同。
结合第四方面的第十一种可能的实施方式,在第四方面的第十二种可能的实施方式中,所述描述子元素Ci描述了包括该描述子元素Ci的视频适配集元素所描述的视频适配集中的媒体表达为导览媒体呈现的组成部分。
结合第四方面的第十一种可能的实施方式,在第四方面的第十三种可能的实施方式中,所述描述子元素Ci描述了包括该描述子元素Ci的视频适配集元素对应的视频适配集中的媒体表达在导览媒体呈现的角色。
结合第四方面的第十二种可能的实施方式或第四方面的第十三种可能的实施方式,在第四方面的第十四种可能的实施方式中,所述描述子元素Ci为作用说明Role元素或者基本属性EssentialProptery元素或者补充属性SupplementalProptery元素。
结合第四方面的第十四种可能的实施方式,在第四方面的第十五种可能的实施方式中,若描述子元素Ci为作用说明Role元素,则所述设定共性条件为视频适配集元素所包括的描述子元素Ci的元素名称相同、方法识别schemeIdUri属性相同,且参数value属性相同。
结合第四方面的第四种至第十五种可能的实施方式中的任意一种可能的实施方式,在第四方面的第十六种可能的实施方式中,所述导览媒体呈现的媒体呈现描述中包括所述N个视频适配集元素,所述N个视频适配集元素与N 个视频适配集之间一一对应,
其中,所述N个视频适配集元素中的与视频适配集I对应的视频适配集元素VI包括用于指向一个主媒体呈现的指针,所述视频适配集I为所述N个视频适配集中任意一个视频适配集。
结合第四方面的第十六种可能的实施方式,在第四方面的第十七种可能的实施方式中,
所述指针由所述视频适配集元素VI的属性承载。
结合第四方面的第十七种可能的实施方式,在第四方面的第十八种可能的实施方式中,所述指针由所述视频适配集元素VI的xlink:href属性承载。
结合第四方面的第十六种可能的实施方式,在第四方面的第十九种可能的实施方式中,
所述指针由所述视频适配集元素VI中的基本属性EssentialProptery元素或补充属性SupplementalProperty元素承载。
结合第四方面的第十六种可能的实施方式,在第四方面的第二十种可能的实施方式中,所述指针由所述视频适配集元素VI之中的EssentialProptery元素中的子元素承载,或所述指针由所述视频适配集元素VI中的EssentialProptery元素的属性承载;或者所述指针由所述视频适配集元素VI中的SupplementalProperty元素中的子元素承载,或者所述指针由所述视频适配集元素VI中的SupplementalProperty元素的属性承载。
结合第四方面的第二十种可能的实施方式,在第四方面的第二十一种可能的实施方式中,所述指针由所述视频适配集元素VI中的EssentialProptery元素的value属性承载,或者所述指针由所述视频适配集元素VI之中的SupplementalProperty元素的value属性承载。
结合第四方面的第十六种可能的实施方式,在第四方面的第二十二种可能的实施方式中,
所述指针由所述视频适配集元素VI中的虚拟媒体表达Representation元素的属性承载,或,所述指针由所述视频适配集元素VI中的虚拟Representation元素中的子元素承载,其中,所述虚拟Representation元素不包括媒体片段模 版元素、媒体片段列表元素和基础统一资源定位符BaseURL元素。
结合第四方面的第十六种可能的实施方式,在第四方面的第二十三种可能的实施方式中,所述指针由所述视频适配集元素VI中的媒体呈现指向ReferencedMediaPresentation元素来承载。
结合第四方面或第四方面的第一种至第二十三种可能的实施方式中的任意一种可能的实施方式,在第四方面的第二十四种可能的实施方式中,
所述导览媒体呈现的时间结构不依赖于所述导览媒体呈现中的所述N个导览单元所指向的主媒体呈现的时间结构。
本发明第五方面提供一种客户端,包括:
处理器和存储器;
其中,所述处理器通过调用所述存储器中的代码或指令以用于,获取导览媒体呈现的媒体呈现描述,其中,所述导览媒体呈现的媒体呈现描述描述了所述导览媒体呈现包括的N个导览单元,所述N为大于1的整数;根据所述导览媒体呈现的媒体呈现描述获取所述N个导览单元中的K个导览单元;呈现所述K个导览单元,所述K个导览单元中的每个导览单元指向一个主媒体呈现,其中,K个导览单元中的导览单元i所指向的主媒体呈现的呈现质量高于所述导览单元i的呈现质量。
结合第五方面,在第五方面的第一种可能的实施方式中,所述导览媒体呈现的媒体呈现描述不同于所述K个导览单元中的每个导览单元所指向主媒体呈现的媒体呈现描述。
结合第五方面的第一种可能的实施方式,在第五方面的第二种可能的实施方式中,所述K个导览单元中的每个导览单元以指向媒体呈现描述的方式来指向该媒体呈现描述所描述的主媒体呈现。
结合第五方面,在第五方面的第三种可能的实施方式中,所述导览媒体呈现的媒体呈现描述与所述K个导览单元中的每个导览单元所指向主媒体呈现的媒体呈现描述被聚合形成了一个聚合媒体呈现描述。
结合第五方面的第三种可能的实施方式,在第五方面的第四种可能的实施方式中,所述K个导览单元中的每个导览单元以引用所述聚合媒体呈现描述 中的呈现元素的方式来指向一个主媒体呈现。
结合第五方面或第五方面的第一种至第四种可能的实施方式中的任意一种可能的实施方式,在第五方面的第五种可能的实施方式中,所述N个导览单元中的每个导览单元包括视频分量,或者所述N个导览单元中的每个导览单元包括音频分量和视频分量。
结合第五方面的第五种可能的实施方式,在第五方面的第六种可能的实施方式中,所述K个导览单元中的不同导览单元所包括的视频分量为K个视频适配集中的不同视频适配集中的媒体表达,其中,所述K个视频适配集中的任意一个视频适配集中的媒体表达之间具有选择互斥性,所述K个视频适配集中的不同视频适配集之间具有选择相容性。
结合第五方面的第六种可能的实施方式,在第五方面的第七种可能的实施方式中,所述K个导览单元包括的音频分量为音频适配集中的媒体表达,所述音频适配集不同于所述K个适配集中的任意一个适配集,所述音频分量适配集与所述K个视频适配集之间具有选择相容性;
或者,
所述K个导览单元中的不同导览单元所包括的音频分量为K个音频适配集中的不同音频适配集中的媒体表达,其中,所述K个音频适配集中的不同音频适配集之间具有选择互斥性。
结合第五方面的第七种可能的实施方式,在第五方面的第八种可能的实施方式中,所述音频适配集元素中的媒体表达元素,包含其所描述的媒体表达在导览媒体呈现中的关联区域的区域说明。
结合第五方面的第八种可能的实施方式,在第五方面的第九种可能的实施方式中,包含相同区域说明的媒体表达元素所描述的媒体表达之间具有关联关系,或者,包含相同区域说明的适配集元素所描述的适配集之间具有关联关系。
结合第五方面的第八种可能的实施方式或第五方面的第九种可能的实施方式,在第五方面的第十种可能的实施方式中,所述区域说明为SRD空间关系描述。
结合第五方面的第七种至第十种可能的实施方式中的任意一种可能的实 施方式,在第五方面的第十一种可能的实施方式中,所述导览媒体呈现的媒体呈现描述中包括K个视频适配集元素,所述K个视频适配集元素与所述K个视频适配集之间一一对应;其中,所述K个视频适配集元素中包括描述子元素Ci,所述K个视频适配集元素中的满足设定共性条件视频适配集元素所描述的视频适配集之间具有选择相容性,所述设定共性条件为视频适配集元素所包括的描述子元素Ci的元素名称和方法识别schemeIdUri属性均相同。
结合第五方面的第十一种可能的实施方式,在第五方面的第十二种可能的实施方式中,所述描述子元素Ci描述了包括该描述子元素Ci的视频适配集元素所描述的视频适配集中的媒体表达为导览媒体呈现的组成部分。
结合第五方面的第十一种可能的实施方式,在第五方面的第十三种可能的实施方式中,所述描述子元素Ci描述了包括该描述子元素Ci的视频适配集元素对应的视频适配集中的媒体表达在导览媒体呈现的角色。
结合第五方面的第十二种可能的实施方式或第五方面的第十三种可能的实施方式,在第五方面的第十四种可能的实施方式中,所述描述子元素Ci为作用说明Role元素或者基本属性EssentialProptery元素或者补充属性SupplementalProptery元素。
结合第五方面的第十四种可能的实施方式,在第五方面的第十五种可能的实施方式中,若描述子元素Ci为作用说明Role元素,则所述设定共性条件为视频适配集元素所包括的描述子元素Ci的元素名称相同、方法识别schemeIdUri属性相同,且参数value属性相同。
结合第五方面的第四种至第十五种可能的实施方式中的任意一种可能的实施方式,在第五方面的第十六种可能的实施方式中,所述导览媒体呈现的媒体呈现描述中包括所述K个视频适配集元素,所述K个视频适配集元素与K个视频适配集之间一一对应,
其中,所述K个视频适配集元素中的与视频适配集I对应的视频适配集元素VI包括用于指向一个主媒体呈现的指针,所述视频适配集I为所述K个视频适配集中任意一个视频适配集。
结合第五方面的第十六种可能的实施方式,在第五方面的第十七种可能的 实施方式中,
所述指针由所述视频适配集元素VI的属性承载。
结合第五方面的第十七种可能的实施方式,在第五方面的第十八种可能的实施方式中,所述指针由所述视频适配集元素VI的xlink:href属性承载。
结合第五方面的第十六种可能的实施方式,在第五方面的第十九种可能的实施方式中,
所述指针由所述视频适配集元素VI中的基本属性EssentialProptery元素或补充属性SupplementalProperty元素承载。
结合第五方面的第十六种可能的实施方式,在第五方面的第二十种可能的实施方式中,所述指针由所述视频适配集元素VI之中的EssentialProptery元素中的子元素承载,或所述指针由所述视频适配集元素VI中的EssentialProptery元素的属性承载;或者所述指针由所述视频适配集元素VI中的SupplementalProperty元素中的子元素承载,或者所述指针由所述视频适配集元素VI中的SupplementalProperty元素的属性承载。
结合第五方面的第二十种可能的实施方式,在第五方面的第二十一种可能的实施方式中,所述指针由所述视频适配集元素VI中的EssentialProptery元素的value属性承载,或者所述指针由所述视频适配集元素VI之中的SupplementalProperty元素的value属性承载。
结合第五方面的第十六种可能的实施方式,在第五方面的第二十二种可能的实施方式中,
所述指针由所述视频适配集元素VI中的虚拟媒体表达Representation元素的属性承载,或,所述指针由所述视频适配集元素VI中的虚拟Representation元素中的子元素承载,其中,所述虚拟Representation元素不包括媒体片段模版元素、媒体片段列表元素和基础统一资源定位符BaseURL元素。
结合第五方面的第十六种可能的实施方式,在第五方面的第二十三种可能的实施方式中,所述指针由所述视频适配集元素VI中的媒体呈现指向ReferencedMediaPresentation元素来承载。
结合第五方面或第五方面的第一种至第二十三种可能的实施方式中的任 意一种可能的实施方式,在第五方面的第二十四种可能的实施方式中,
所述导览媒体呈现的时间结构不依赖于所述导览媒体呈现中的所述K个导览单元所指向的主媒体呈现的时间结构。
结合第五方面或第五方面的第一种至第二十四种可能的实施方式中的任意一种可能的实施方式,在第五方面的第二十五种可能的实施方式中,所述方法处理器还用于,在关注焦点停留在所述K个导览单元中的导览单元i的情况下,所述客户端呈现所述导览单元i的音频分量。
结合第五方面或第五方面的第一种至第二十五种可能的实施方式中的任意一种可能的实施方式,在第五方面的第二十六种可能的实施方式中,所述方法处理器还用于,在所述K个导览单元中的导览单元i被选择的情况下,所述客户端获取所述导览单元i所指向的主媒体呈现。
本发明实施例第六方面提供一种媒体呈现导览装置,包括:
处理器和存储器;
其中,所述处理器通过调用所述存储器中的代码或指令以用于确定导览媒体呈现包括的N个导览单元;生成导览媒体呈现的媒体呈现描述,所述导览媒体呈现的媒体呈现描述描述了所述导览媒体呈现包括的N个导览单元,所述N为大于1的整数;所述N个导览单元中的每个导览单元指向一个主媒体呈现,所述N个导览单元中的导览单元i所指向的主媒体呈现的呈现质量高于所述导览单元i的呈现质量。
结合第六方面,在第六方面的第一种可能的实施方式中,所述导览媒体呈现的媒体呈现描述不同于所述K个导览单元中的每个导览单元所指向主媒体呈现的媒体呈现描述。
结合第六方面的第一种可能的实施方式,在第六方面的第二种可能的实施方式中,所述N个导览单元中的每个导览单元以指向媒体呈现描述的方式来指向该媒体呈现描述所描述的主媒体呈现。
结合第六方面,在第六方面的第三种可能的实施方式中,所述导览媒体呈现的媒体呈现描述与所述N个导览单元中的每个导览单元所指向主媒体呈现的媒体呈现描述被聚合形成了一个聚合媒体呈现描述。
结合第六方面的第三种可能的实施方式,在第六方面的第四种可能的实施方式中,所述N个导览单元中的每个导览单元以引用所述聚合媒体呈现描述中的呈现元素的方式来指向一个主媒体呈现。
结合第六方面或第六方面的第一种至第四种可能的实施方式中的任意一种可能的实施方式,在第六方面的第五种可能的实施方式中,所述N个导览单元中的每个导览单元包括视频分量,或者所述N个导览单元中的每个导览单元包括音频分量和视频分量。
结合第六方面的第五种可能的实施方式,在第六方面的第六种可能的实施方式中,所述N个导览单元中的不同导览单元所包括的视频分量为N个视频适配集中的不同视频适配集中的媒体表达,其中,所述N个视频适配集中的任意一个视频适配集中的媒体表达之间具有选择互斥性,所述N个视频适配集中的不同视频适配集之间具有选择相容性。
结合第六方面的第六种可能的实施方式,在第六方面的第七种可能的实施方式中,所述N个导览单元包括的音频分量为音频适配集中的媒体表达,所述音频适配集不同于所述N个适配集中的任意一个适配集,所述音频分量适配集与所述N个视频适配集之间具有选择相容性;
或者,
所述N个导览单元中的不同导览单元所包括的音频分量为N个音频适配集中的不同音频适配集中的媒体表达,其中,所述N个音频适配集中的不同音频适配集之间具有选择互斥性。
结合第六方面的第七种可能的实施方式,在第六方面的第八种可能的实施方式中,所述音频适配集元素中的媒体表达元素,包含其所描述的媒体表达在导览媒体呈现中的关联区域的区域说明。
结合第六方面的第八种可能的实施方式,在第六方面的第九种可能的实施方式中,包含相同区域说明的媒体表达元素所描述的媒体表达之间具有关联关系,或者,包含相同区域说明的适配集元素所描述的适配集之间具有关联关系。
结合第六方面的第八种可能的实施方式或第六方面的第九种可能的实施方式,在第六方面的第十种可能的实施方式中,所述区域说明为SRD空间关 系描述。
结合第六方面的第七种至第十种可能的实施方式中的任意一种可能的实施方式,在第六方面的第十一种可能的实施方式中,所述导览媒体呈现的媒体呈现描述中包括N个视频适配集元素,所述N个视频适配集元素与所述N个视频适配集之间一一对应;其中,所述N个视频适配集元素中包括描述子元素Ci,所述N个视频适配集元素中的满足设定共性条件视频适配集元素所描述的视频适配集之间具有选择相容性,所述设定共性条件为视频适配集元素所包括的描述子元素Ci的元素名称和方法识别schemeIdUri属性均相同。
结合第六方面的第十一种可能的实施方式,在第六方面的第十二种可能的实施方式中,所述描述子元素Ci描述了包括该描述子元素Ci的视频适配集元素所描述的视频适配集中的媒体表达为导览媒体呈现的组成部分。
结合第六方面的第十一种可能的实施方式,在第六方面的第十三种可能的实施方式中,所述描述子元素Ci描述了包括该描述子元素Ci的视频适配集元素对应的视频适配集中的媒体表达在导览媒体呈现的角色。
结合第六方面的第十二种可能的实施方式或第六方面的第十三种可能的实施方式,在第六方面的第十四种可能的实施方式中,所述描述子元素Ci为作用说明Role元素或者基本属性EssentialProptery元素或者补充属性SupplementalProptery元素。
结合第六方面的第十四种可能的实施方式,在第六方面的第十五种可能的实施方式中,若描述子元素Ci为作用说明Role元素,则所述设定共性条件为视频适配集元素所包括的描述子元素Ci的元素名称相同、方法识别schemeIdUri属性相同,且参数value属性相同。
结合第六方面的第四种至第十五种可能的实施方式中的任意一种可能的实施方式,在第六方面的第十六种可能的实施方式中,所述导览媒体呈现的媒体呈现描述中包括所述N个视频适配集元素,所述N个视频适配集元素与N个视频适配集之间一一对应,
其中,所述N个视频适配集元素中的与视频适配集I对应的视频适配集元素VI包括用于指向一个主媒体呈现的指针,所述视频适配集I为所述N个视 频适配集中任意一个视频适配集。
结合第六方面的第十六种可能的实施方式,在第六方面的第十七种可能的实施方式中,
所述指针由所述视频适配集元素VI的属性承载。
结合第六方面的第十七种可能的实施方式,在第六方面的第十八种可能的实施方式中,所述指针由所述视频适配集元素VI的xlink:href属性承载。
结合第六方面的第十六种可能的实施方式,在第六方面的第十九种可能的实施方式中,
所述指针由所述视频适配集元素VI中的基本属性EssentialProptery元素或补充属性SupplementalProperty元素承载。
结合第六方面的第十六种可能的实施方式,在第六方面的第二十种可能的实施方式中,所述指针由所述视频适配集元素VI之中的EssentialProptery元素中的子元素承载,或所述指针由所述视频适配集元素VI中的EssentialProptery元素的属性承载;或者所述指针由所述视频适配集元素VI中的SupplementalProperty元素中的子元素承载,或者所述指针由所述视频适配集元素VI中的SupplementalProperty元素的属性承载。
结合第六方面的第二十种可能的实施方式,在第六方面的第二十一种可能的实施方式中,所述指针由所述视频适配集元素VI中的EssentialProptery元素的value属性承载,或者所述指针由所述视频适配集元素VI之中的SupplementalProperty元素的value属性承载。
结合第六方面的第十六种可能的实施方式,在第六方面的第二十二种可能的实施方式中,
所述指针由所述视频适配集元素VI中的虚拟媒体表达Representation元素的属性承载,或,所述指针由所述视频适配集元素VI中的虚拟Representation元素中的子元素承载,其中,所述虚拟Representation元素不包括媒体片段模版元素、媒体片段列表元素和基础统一资源定位符BaseURL元素。
结合第六方面的第十六种可能的实施方式,在第六方面的第二十三种可能的实施方式中,所述指针由所述视频适配集元素VI中的媒体呈现指向 ReferencedMediaPresentation元素来承载。
结合第六方面或第六方面的第一种至第二十三种可能的实施方式中的任意一种可能的实施方式,在第六方面的第二十四种可能的实施方式中,
所述导览媒体呈现的时间结构不依赖于所述导览媒体呈现中的所述N个导览单元所指向的主媒体呈现的时间结构。
本发明第七方面提供一种通信系统,包括:
客户端和与所述客户端通信连接的内容服务器;
其中,所述客户端,用于从内容服务器获取导览媒体呈现的媒体呈现描述,其中,所述导览媒体呈现的媒体呈现描述描述了所述导览媒体呈现包括的N个导览单元,所述N为大于1的整数;根据所述导览媒体呈现的媒体呈现描述从内容服务器获取所述N个导览单元中的K个导览单元;呈现所述K个导览单元,所述K个导览单元中的每个导览单元指向一个主媒体呈现,其中,K个导览单元中的导览单元i所指向的主媒体呈现的呈现质量高于所述导览单元i的呈现质量。
结合第七方面,在第七方面的第一种可能的实施方式中,所述导览媒体呈现的媒体呈现描述不同于所述K个导览单元中的每个导览单元所指向主媒体呈现的媒体呈现描述。
结合第七方面的第一种可能的实施方式,在第七方面的第二种可能的实施方式中,所述K个导览单元中的每个导览单元以指向媒体呈现描述的方式来指向该媒体呈现描述所描述的主媒体呈现。
结合第七方面,在第七方面的第三种可能的实施方式中,所述导览媒体呈现的媒体呈现描述与所述K个导览单元中的每个导览单元所指向主媒体呈现的媒体呈现描述被聚合形成了一个聚合媒体呈现描述。
结合第七方面的第三种可能的实施方式,在第七方面的第四种可能的实施方式中,所述K个导览单元中的每个导览单元以引用所述聚合媒体呈现描述中的呈现元素的方式来指向一个主媒体呈现。
结合第七方面或第七方面的第一种至第四种可能的实施方式中的任意一种可能的实施方式,在第七方面的第五种可能的实施方式中,所述N个导览 单元中的每个导览单元包括视频分量,或者所述N个导览单元中的每个导览单元包括音频分量和视频分量。
结合第七方面的第五种可能的实施方式,在第七方面的第六种可能的实施方式中,所述K个导览单元中的不同导览单元所包括的视频分量为K个视频适配集中的不同视频适配集中的媒体表达,其中,所述K个视频适配集中的任意一个视频适配集中的媒体表达之间具有选择互斥性,所述K个视频适配集中的不同视频适配集之间具有选择相容性。
结合第七方面的第六种可能的实施方式,在第七方面的第七种可能的实施方式中,所述K个导览单元包括的音频分量为音频适配集中的媒体表达,所述音频适配集不同于所述K个适配集中的任意一个适配集,所述音频分量适配集与所述K个视频适配集之间具有选择相容性;
或者,
所述K个导览单元中的不同导览单元所包括的音频分量为K个音频适配集中的不同音频适配集中的媒体表达,其中,所述K个音频适配集中的不同音频适配集之间具有选择互斥性。
结合第七方面的第七种可能的实施方式,在第七方面的第八种可能的实施方式中,所述音频适配集元素中的媒体表达元素,包含其所描述的媒体表达在导览媒体呈现中的关联区域的区域说明。
结合第七方面的第八种可能的实施方式,在第七方面的第九种可能的实施方式中,包含相同区域说明的媒体表达元素所描述的媒体表达之间具有关联关系,或者,包含相同区域说明的适配集元素所描述的适配集之间具有关联关系。
结合第七方面的第八种可能的实施方式或第七方面的第九种可能的实施方式,在第七方面的第十种可能的实施方式中,所述区域说明为SRD空间关系描述。
结合第七方面的第七种至第十种可能的实施方式中的任意一种可能的实施方式,在第七方面的第十一种可能的实施方式中,所述导览媒体呈现的媒体呈现描述中包括K个视频适配集元素,所述K个视频适配集元素与所述K个视频适配集之间一一对应;其中,所述K个视频适配集元素中包括描述子元 素Ci,所述K个视频适配集元素中的满足设定共性条件视频适配集元素所描述的视频适配集之间具有选择相容性,所述设定共性条件为视频适配集元素所包括的描述子元素Ci的元素名称和方法识别schemeIdUri属性均相同。
结合第七方面的第十一种可能的实施方式,在第七方面的第十二种可能的实施方式中,所述描述子元素Ci描述了包括该描述子元素Ci的视频适配集元素所描述的视频适配集中的媒体表达为导览媒体呈现的组成部分。
结合第七方面的第十一种可能的实施方式,在第七方面的第十三种可能的实施方式中,所述描述子元素Ci描述了包括该描述子元素Ci的视频适配集元素对应的视频适配集中的媒体表达在导览媒体呈现的角色。
结合第七方面的第十二种可能的实施方式或第七方面的第十三种可能的实施方式,在第七方面的第十四种可能的实施方式中,所述描述子元素Ci为作用说明Role元素或者基本属性EssentialProptery元素或者补充属性SupplementalProptery元素。
结合第七方面的第十四种可能的实施方式,在第七方面的第十五种可能的实施方式中,若描述子元素Ci为作用说明Role元素,则所述设定共性条件为视频适配集元素所包括的描述子元素Ci的元素名称相同、方法识别schemeIdUri属性相同,且参数value属性相同。
结合第七方面的第四种至第十五种可能的实施方式中的任意一种可能的实施方式,在第七方面的第十六种可能的实施方式中,所述导览媒体呈现的媒体呈现描述中包括所述K个视频适配集元素,所述K个视频适配集元素与K个视频适配集之间一一对应,
其中,所述K个视频适配集元素中的与视频适配集I对应的视频适配集元素VI包括用于指向一个主媒体呈现的指针,所述视频适配集I为所述K个视频适配集中任意一个视频适配集。
结合第七方面的第十六种可能的实施方式,在第七方面的第十七种可能的实施方式中,
所述指针由所述视频适配集元素VI的属性承载。
结合第七方面的第十七种可能的实施方式,在第七方面的第十八种可能的 实施方式中,所述指针由所述视频适配集元素VI的xlink:href属性承载。
结合第七方面的第十六种可能的实施方式,在第七方面的第十九种可能的实施方式中,
所述指针由所述视频适配集元素VI中的基本属性EssentialProptery元素或补充属性SupplementalProperty元素承载。
结合第七方面的第十六种可能的实施方式,在第七方面的第二十种可能的实施方式中,所述指针由所述视频适配集元素VI之中的EssentialProptery元素中的子元素承载,或所述指针由所述视频适配集元素VI中的EssentialProptery元素的属性承载;或者所述指针由所述视频适配集元素VI中的SupplementalProperty元素中的子元素承载,或者所述指针由所述视频适配集元素VI中的SupplementalProperty元素的属性承载。
结合第七方面的第二十种可能的实施方式,在第七方面的第二十一种可能的实施方式中,所述指针由所述视频适配集元素VI中的EssentialProptery元素的value属性承载,或者所述指针由所述视频适配集元素VI之中的SupplementalProperty元素的value属性承载。
结合第七方面的第十六种可能的实施方式,在第七方面的第二十二种可能的实施方式中,
所述指针由所述视频适配集元素VI中的虚拟媒体表达Representation元素的属性承载,或,所述指针由所述视频适配集元素VI中的虚拟Representation元素中的子元素承载,其中,所述虚拟Representation元素不包括媒体片段模版元素、媒体片段列表元素和基础统一资源定位符BaseURL元素。
结合第七方面的第十六种可能的实施方式,在第七方面的第二十三种可能的实施方式中,所述指针由所述视频适配集元素VI中的媒体呈现指向ReferencedMediaPresentation元素来承载。
结合第七方面或第七方面的第一种至第二十三种可能的实施方式中的任意一种可能的实施方式,在第七方面的第二十四种可能的实施方式中,
所述导览媒体呈现的时间结构不依赖于所述导览媒体呈现中的所述K个导览单元所指向的主媒体呈现的时间结构。
可以看出,本实施例的技术方案中,由于K个导览单元中的每个导览单元可以分别指向一个主媒体呈现,这样就相当于在导览单元和主媒体呈现之间引入的一定的关联关系,这使得在所述K个导览单元的导览单元i被选择的情况下,所述客户端可获取与导览单元i指向的主媒体呈现j的媒体呈现描述,进而可以根据所述主媒体呈现j的媒体呈现描述获取所述主媒体呈现j进行呈现,可见这有利于实现导览媒体呈现和主媒体呈现之间的较灵活切换,进而实现在基于HTTP的媒体流服务场景下支持视频导览,进而有利于提升用户的高品质体验。
附图说明
为了更清楚地说明本发明实施例中的技术方案,下面将对实施例描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本发明的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动性的前提下,还可以根据这些附图获得其他的附图。
图1-a为本发明实施例提供的一种媒体呈现描述的架构示意图;
图1-b为本发明实施例提供的一种基于HTTP媒体流的媒体呈现导览方法的流程示意图;
图1-c为本发明实施例提供的一种单个媒体呈现的时间结构的示意图;
图1-d为本发明实施例提供的一种多个媒体呈现的时间结构的示意图;
图1-e和1-f为本发明实施例提供的一种编码得到导览单元的媒体表达的示意图;
图1-g为本发明实施例提供的另一种多个媒体呈现的时间结构的示意图;
图1-h为本发明实施例提供的另一种多个媒体呈现的时间结构的示意图;
图1-i为本发明实施例提供的一种合成得到导览媒体呈现的示意图;
图1-j为本发明实施例提供的一种客户端解码输出导览单元的视频分量的示意图;
图1-k为本发明实施例提供的一种客户端解码输出导览单元的音频分量的示意图;
图2为本发明实施例提供的另一种基于HTTP媒体流的媒体呈现导览方法的流程示意图;
图3-a为本发明实施例提供的另一种基于HTTP媒体流的媒体呈现导览方法的流程示意图;
图3-b为本发明实施例提供的一种网络架构的示意图;
图4为本发明实施例提供的一种客户端的示意图;
图5为本发明实施例提供的另一种客户端的示意图;
图6为本发明实施例提供的一种服务器的示意图;
图7为本发明实施例提供的另一种服务器的示意图;
图8为本发明实施例提供的一种通信系统的示意图。
具体实施方式
本发明实施例提供了基于超文本传输协议媒体流的媒体呈现导览方法和相关装置,以期能在基于HTTP的媒体流服务场景下支持视频导览,进而提高用户体验。
下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚地描述,显然,所描述的实施例仅仅是本发明一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有作出创造性劳动前提下所获得的所有其他实施例,都属于本发明保护的范围。
本发明的说明书和权利要求书及上述附图中的术语“第一”、“第二”、“第三”和“第四”等是用于区别不同对象,而不是用于描述特定顺序。此外,术语“包括”和“具有”以及它们任何变形,意图在于覆盖不排他的包括。例如包括了一系列步骤或单元的过程、方法、系统、产品或设备没有限定于已列出的步骤或单元,而是可选地还包括没有列出的步骤或单元,或可选地还包括对于这些过程、方法、产品或设备固有的其它步骤或单元。
为便于更好的理解本发明实施例的技术方案,下面先进行一些可能相关技术的介绍。
在传统的模拟电视服务中,用户可以通过在不同频道之间切换来寻找其感兴趣频道,然后停留在感兴趣频道上。在数字电视服务中,可以提供电子节目导航(EPG,Electronic Program Guide),EPG实际上是一个列表,EPG包含不同频道的节目和时间等诸如此类的信息,用户可通过EPG寻找感兴趣的电 视频道,然后从EPG频道切换到该频道。实践发现,以图形化方式提供的导览业务对用户更友好,易于用户使用。
在导览业务以导览单元代表一个电视频道。导览单元和它所代表的电视频道一样,可以有不同的媒体分量,如:视频,音频等。图形化的导览业务以多个小画幅的图像(动态的图像序列或者静态的图片)的形式呈现一组导览单元的视频。用户可在多个小画幅的图像中进行浏览,改变关注的导览单元,用户甚至可以听到当前关注的导览单元的音频。用户选择某个导览单元就可以切换到该导览单元对应的频道。
随着技术的发展,特别是宽带通信和微处理器,个人设备的通信能力和功能越来越强大,通过互联网的在线流服务传送多媒体的应用越来越广泛。基于HTTP的自适应流服务成为多媒体流服务的主流技术,代表性了这一领域的最新发展。苹果(Apple)公司的HTTP流服务(HLS,HTTP Live Streaming)、微软(Microsoft)公司的平滑流服务(SS,Smooth Streaming),动态图像专家组(MPEG,Moving Picture Experts Group)的基于HTTP的动态自适应媒体流(DASH,Dynamic Adaptative Streaming Over HTTP)都是这一技术的不同形式。MPEG的DASH标准是由MPEG制订的标准化技术,有望得到广泛的采用,从而改变割裂的市场格局。
遗憾的是,现在的基于HTTP的媒体流服务不能支持导览业务。现有基于HTTP的媒体流服务只适用于一个媒体呈现(媒体呈现是DASH标准中使用的术语,概念上大致相当于一个电视频道),而导览业务服务于多个媒体呈现,是一个跨多个媒体呈现的业务。本发明旨在解决基于HTTP的媒体流服务对导览业务的支持。虽然本发明引用DASH标准中的术语作为叙述和实施例的基础,但本发明的方法并不限于DASH标准,而可适用于多种基于HTTP的媒体流服务。
可选的,本发明的一些实施例的技术方案例如可以是根据如下的一些DASH规范及其增补修订:
ISO/IEC 23009-1:Part 1:Media presentation description and segment formats,2nd Edition,2014。
ISO/IEC 23009-1:2014/FDAM 1。
Part 1:Media presentation description and segment formats。
AMENDMENT 1:High Profile and Availability Time Synchronization Extended profiles and time synchronization,ISO/IEC 23009-1:2014/FDAM 1Part 1:Media presentation description and segment formats。
ISO/IEC 23009-1:2014/DAM 2。
Part 1:Media presentation description and segment formats。
AMENDMENT 2:Spatial Relationship Description,Generalized URL parameters and other extensions。
在DASH标准中,一项媒体内容编码为多个版本,各个版本有不同的特性,如码率,这些版本在DASH中称为媒体表达(Representation),它们代表相同的媒体内容,从内容呈现(观看/播放)的角度彼此具有替代性。一个媒体表达在时间上分割为可访问的单位——通常长度为若干秒,称为媒体片段或者媒体子片段(一个媒体片段可以在逻辑上划分为媒体子片段)。另外还有一个初始化片段,它只包含有元数据而没有媒体数据。下文中,媒体片段,初始化片段都称为片段(Segment)。媒体表达存储在内容服务器——HTTP服务器上供客户端获取,而片段是客户端能够通过统一资源定位符(URL,Uniform Resource Locator)访问的最小单位。媒体呈现描述(MPD,Media Presentation Description)是一个扩展标记语言(XML,extensible Markup Language)文件,它包含了客户端所需要的元数据,描述了媒体表达的特性以及如何从服务器上获取媒体表达,包括:媒体表达的码率,分辨率,视频图像的长宽比,媒体表达包含的片段的URL等。基于MPD中的信息,客户端可构造HTTP URL以从内容服务器请求媒体表达中的媒体片段,在媒体片段边界可以切换到其他的媒体表达以适应可用带宽的变化。
基于HTTP的自适应媒体流服务允许一个媒体呈现中内容特性的变化,例如媒体编码方式的改变。在DASH标准中,这是通过所谓“内容段落(Period)”这一概念来实现的,Period用于内容的拼接,比如前一个内容段落是新闻节目,下一个内容段落是广告。一个媒体呈现包括一个或者多个内容段落(Period), 这些内容段落在时间上是顺序的,一个内容段落的开始意味着相比前一个内容段落有某些变化,例如内容的变化,例如可从新闻节目到体育节目,从体育节目到电影节目、从电影节目到广告、从广告到综艺节目等等;内容的编码方式的变化,例如可从H.264编码方案转变为H.265编码方案;媒体表达数量的变化,例如可增加或者减少媒体表达;内容分量的变化,例如可增加中文的音频表达等等。当客户端遇到一个新的内容段落的开始,客户端工作条件发生了变化,可能要重新初始化。
在一个内容段落中,包含相同媒体内容和媒体分量的媒体表达的集合称为适配集,一个适配集至少包含一个媒体表达,一个适配集中的媒体表达具有相互替代性。不同的适配集之间可能是相容或者相斥的。
总结以上所述,媒体呈现可包含一个或多个时间上顺序的内容段落,每个内容段落包含一个或者多个适配集(Adaptation Set)。其中每个适配集(Adaptation Set)包含一个或者多个媒体表达(Representation)。其中一个媒体表达包含一个或者多个片段(Segment)。
媒体呈现描述具有和媒体呈现相似的层次化结构,如图1-a所示。以上介绍的媒体呈现的概念在媒体呈现描述中可用一个XML元素表示,媒体呈现元素包括一个或多个内容段落(Period)元素,每个内容段落(Period)元素包含一个或多个适配集(AdaptationSet)元素。每个适配集(AdaptationSet)元素包含一个或多个媒体表达(Representation)元素。
媒体呈现对应于媒体呈现描述中的媒体呈现描述元素,媒体呈现中的一个内容段落对应于媒体呈现描述中的一个内容段落元素,媒体呈现中的一个适配集对应于媒体呈现描述中的一个适配集元素,媒体呈现中的一个媒体表达对应于媒体呈现描述中的一个媒体表达元素,以此类推。
下面介绍基于HTTP媒体流的媒体呈现导览方法。
其中,导览业务服务于多个媒体呈现,为在一组媒体呈现中进行选择提供方便,是一个跨多个媒体呈现的业务。导览业务所服务的多个媒体呈现称为该导览业务的成员媒体呈现,简称成员媒体呈现或者主媒体呈现。
在本发明实施例的技术方案中,导览业务可实现为一个媒体呈现(即导览 媒体呈现),导览媒体呈现独立于它的成员媒体呈现。导览业务和它的成员媒体呈现分别由各自的媒体呈现描述来说明。其中,如果导览业务服务于N个媒体呈现,那么有N+1个媒体呈现和相应的N+1个媒体呈现描述,在导览业务中,每个成员媒体呈现对应于导览媒体呈现的一个导览单元,代表该成员媒体呈现。导览业务和它的成员媒体呈现分别由各自的媒体呈现描述说明。一个导览单元代表一个媒体呈现,它可能包括多个媒体分量,典型的例如:视频分量(也可称视频媒体表达),音频分量(也可称音频媒体表达)。一个导览单元的视频是一个小画幅的图像,代表一个媒体呈现。导览单元的视频通常是从它所代表的媒体呈现的视频分量裁剪而来的,即画面的一部分,导览单元呈现质量(例如分辨率和/或帧率等)低于主媒体呈现,导览单元的音频来自主媒体呈现的音频。在本发明中,一个导览单元的视频实现为一个或多个媒体表达(在一些示例中以一个为例)。
参见图1-b,图1-b为本发明的一个实施例提供的一种基于HTTP媒体流的媒体呈现导览方法的流程示意图。如图1-b所示,本发明的一个实施例提供的一种基于HTTP媒体流的媒体呈现导览方法可以包括:
101、客户端(Client)获取导览媒体呈现的媒体呈现描述。
其中,所述导览媒体呈现的媒体呈现描述描述了所述导览媒体呈现包括的N个导览单元。
其中,客户端(Client)可从内容服务器或其它设备获取导览媒体呈现的媒体呈现描述。
其中,所述N为大于1的整数。
其中,所述N例如可等于7、2、3、4、5、8、11、15、20、25、30或者其他值。
其中,所述客户端可为DASH客户端或具有DASH客户端逻辑功能的其他客户端或基于HTTP的媒体流服务的其他客户端。
其中,所述客户端例如可以为个人电脑,手机,平板电脑,电视机或机顶盒等。
其中,导览媒体呈现可看成是一种特殊的媒体呈现。
102、所述客户端根据所述导览媒体呈现的媒体呈现描述获取所述N个导览单元中的K个导览单元。
其中,所述K为小于或等于所述N的正整数。
其中,所述K例如可等于1、2、3、4、5、8、11、15、20、25、30或者其他值。
其中,K个导览单元可与K个逻辑呈现单元(逻辑呈现单元例如可为导览窗口)一一对应,即K个导览单元中的每个导览单元可由不同的逻辑呈现单元来呈现。
103、所述客户端呈现所述K个导览单元。
所述K个导览单元中的每个导览单元指向一个主媒体呈现。K个导览单元中的导览单元i所指向的主媒体呈现的呈现质量高于所述导览单元i的呈现质量。
其中,所述K个导览单元中的每个导览单元可指向一个主媒体呈现。
其中,K个导览单元中的导览单元i所指向的主媒体呈现的呈现质量高于所述导览单元i的呈现质量。也就是说,导览单元的媒体表达的呈现质量低于导览单元所表示的主媒体呈现的呈现质量。
可选的,在本发明的一些可能的实施方式中,所述导览媒体呈现的媒体呈现描述可以不同于所述K个导览单元中的每个导览单元所指向主媒体呈现的媒体呈现描述。即,所述导览媒体呈现可具有独立的媒体呈现描述,K个导览单元中的每个导览单元所指向主媒体呈现亦可具有独立的且不同于所述导览媒体呈现的媒体呈现描述的媒体呈现描述。例如K个导览单元指向了K个主媒体呈现,而K个主媒体呈现分别具有对应的媒体呈现描述,即K个媒体呈现描述,而导览媒体呈现的媒体呈现描述不同于这K个媒体呈现描述中的任意一个,即导览媒体呈现可由第K+1个媒体呈现描述。
此外,在本发明另一些可能的实施方式中,所述导览媒体呈现的媒体呈现描述与所述K个导览单元中的每个导览单元所指向主媒体呈现的媒体呈现描述可被聚合形成了一个聚合媒体呈现描述(也可称超级媒体呈现描述)。即可利用一个聚合媒体呈现描述(可称超级媒体呈现描述)来描述导览媒体呈现和 导览媒体呈现所指向的主媒体呈现。超级媒体呈现描述的引入有利于增强导览媒体呈现和所导览的主媒体呈现之间的关联关系。
在实际应用中,导览单元指向主媒体呈现的方式可以很灵活,导览单元可以直接指向主媒体呈现,也可以间接的指向主媒体呈现。
举例来说,所述K个导览单元中的每个导览单元可以以指向媒体呈现描述的方式来指向该媒体呈现描述所描述的主媒体呈现。当然,导览单元亦可通过其他的直接指向或间接指向的方式来指向主媒体呈现。例如,所述导览媒体呈现的媒体呈现描述与所述K个导览单元中的每个导览单元所指向主媒体呈现的媒体呈现描述可被聚合形成了一个聚合媒体呈现描述。这种情况下,所述K个导览单元中的每个导览单元可以以引用所述聚合媒体呈现描述中的呈现元素的方式来指向一个主媒体呈现。
可选的,在本发明的一些可能的实施方式中,所述N个导览单元中的每个导览单元包括视频分量,或者所述N个导览单元中的每个导览单元包括音频分量和视频分量,进一步的,导览单元还可包括字幕分量或其他类型的媒体分量。
本发明通过媒体呈现描述(如DASH标准中的MPD),提供了导览业务的信令机制。媒体呈现描述可告知客户端一个导览业务由哪些导览单元组成,导览单元的分量,导览单元和导览业务的成员媒体呈现之间的关系,导览单元视频分量之间的关系,导览单元的音频分量之间的关系,导览单元的音频分量和视频分量之间的关系等。
可选的,在本发明一些可能实施方式中,所述K个导览单元中的不同导览单元所包括的视频分量为K个视频适配集中的不同视频适配集中的媒体表达,其中,所述K个视频适配集中的任意一个视频适配集中的媒体表达之间具有选择互斥性,所述K个视频适配集中的不同视频适配集之间具有选择相容性。例如,所述K个导览单元中导览单元i所包括的视频分量可归属于K个视频适配集中的视频适配集Ci,所述K个导览单元中导览单元j所包括的视频分量可归属于K个视频适配集中的视频适配集Cj,其中,视频适配集Cj和视频适配集Ci为所述K个视频适配集中的两个不同的视频适配集。导览单 元j和导览单元i可为K个导览单元中的任意两个导览单元。
其中,所谓选择相容性,表示这些对象可同时被选择,例如若K个视频适配集中的不同视频适配集之间具有选择相容性,则表示可同时选择K个视频适配集中的多个视频适配集中的媒体表达。
所谓选择互斥性,表示这些对象不支持同时被选择,例如若所述K个视频适配集中的任意一个视频适配集中的媒体表达之间具有选择互斥性,表示不支持同时选择1个视频适配集中的多个媒体表达,例如假设K个视频适配集中的视频适配集I包括10个多个媒体表达,若视频适配集中的媒体表达之间具有选择互斥性,那么每次只能选择10个媒体表达中的其中1个,而不能同时选择该10个媒体表达中的多个。
可选的,在本发明的一些可能实施方式中,所述K个导览单元包括的音频分量为音频适配集中的媒体表达,所述音频适配集不同于所述K个视频适配集中的任意一个适配集,所述音频分量适配集与所述K个视频适配集之间具有选择相容性。例如,假设音频适配集包括20个多个媒体表达,若音频适配集中的媒体表达之间具有选择互斥性,那么每次只能选择20个多个媒体表达中的其中1个,而不能同时选择该30个媒体表达中的多个。
可选的,在本发明的另一些可能的实施方式中,所述K个导览单元中的不同导览单元所包括的音频分量为K个音频适配集中的不同音频适配集中的媒体表达,其中,所述K个音频适配集中的不同音频适配集之间具有选择互斥性。
可选的,在本发明的一些可能的实施方式中,所述音频适配集元素中的媒体表达元素,可以包含其所描述的媒体表达在导览媒体呈现中的关联区域的区域说明。
可选的,在本发明一些可能实施方式中,包含相同区域说明的媒体表达元素所描述的媒体表达(representation)之间具有关联关系,或者,包含相同区域说明的适配集元素所描述的适配集之间具有关联关系。例如媒体表达元素i所描述的媒体表达为媒体表达ri,媒体表达元素j所描述的媒体表达为媒体表达rj,若媒体表达元素i与媒体表达元素j包含相同区域说明,那么可以表示 媒体表达ri与媒体表达rj之间具有关联关系。
可选的,在本发明一些可能实施方式中,媒体表达元素i和适配集元素ci包含相同区域说明,那么也可能说明媒体表达元素i所描述的媒体表达与适配集元素ci所描述的适配集中的各媒体表达之间具有关联关系,例如媒体表达元素i可为音频媒体表达,而适配集元素ci所描述的适配集中的媒体表达可为视频媒体表达。
可选的,在本发明的一些可能的实施方式中,所述区域说明可为空间关系描述(SRD)。当然,所述区域说明亦可为其他类型的可用于描述位置区域的说明信息。
可选的,在本发明的一些可能的实施方式中,所述导览媒体呈现的媒体呈现描述中包括K个视频适配集元素,所述K个视频适配集元素与所述K个视频适配集之间一一对应。
其中,所述K个视频适配集元素中包括描述子元素Ci,所述K个视频适配集元素中的满足设定共性条件视频适配集元素所描述的视频适配集之间具有选择相容性,所述设定共性条件例如可为视频适配集元素所包括的描述子元素Ci的元素名称和方法识别(schemeIdUri)属性均相同。
可选的,在本发明的一些可能的实施方式中,所述描述子元素Ci可描述了包括该描述子元素Ci的视频适配集元素所描述的视频适配集中的媒体表达为导览媒体呈现的组成部分。或者,所述描述子元素Ci可描述了包括该描述子元素Ci的视频适配集元素对应的视频适配集中的媒体表达在导览媒体呈现的角色,角色例如可能是主要、补充、字幕或翻译配音等。
可选的,在本发明的一些可能的实施方式中,所述描述子元素Ci例如可为基本属性(EssentialProptery)元素或者补充属性(SupplementalProptery)元素或作用说明(Role)元素或者其他元素。
可选的,在本发明的一些可能的实施方式中,若描述子元素Ci为作用说明Role元素,则所述设定共性条件可为视频适配集元素所包括的描述子元素Ci的元素名称可相同、方法识别schemeIdUri属性可相同,且参数(value)属性可相同。
可选的,在本发明的一些可能的实施方式中,所述导览媒体呈现的媒体呈现描述中包括所述K个视频适配集元素,所述K个视频适配集元素与K个视频适配集之间一一对应。其中,所述K个视频适配集元素中的与视频适配集I对应的视频适配集元素VI包括用于指向一个主媒体呈现的指针,所述视频适配集I可为所述K个视频适配集中任意一个视频适配集。
其中,可根据场景需要来确定所述指针在视频适配集元素VI中承载位置。
例如,所述指针可由所述视频适配集元素VI的属性承载。
具体例如,所述指针可由所述视频适配集元素VI的xlink:href属性或其它属性承载。
又例如,所述指针可由所述视频适配集元素VI中的EssentialProptery元素或SupplementalProperty元素承载。
具体例如,所述指针可由所述视频适配集元素VI之中的EssentialProptery元素中的子元素承载,或者所述指针可由所述视频适配集元素VI中的EssentialProptery元素的属性承载;或者,所述指针可由所述视频适配集元素VI中的SupplementalProperty元素中的子元素承载,或者所述指针可由所述视频适配集元素VI中的SupplementalProperty元素的属性承载。
具体例如,所述指针可由所述视频适配集元素VI中的EssentialProptery元素的value属性或其它属性承载,或所述指针可由所述视频适配集元素VI之中的SupplementalProperty元素的value属性或其它属性承载。
又例如,所述指针可由所述视频适配集元素VI中的虚拟Representation元素的属性承载,或所述指针可由所述视频适配集元素VI中的虚拟Representation元素中的子元素承载,其中,所述虚拟Representation元素不包括媒体片段模版元素、媒体片段列表元素和基础统一资源定位符(BaseURL)元素。
又举例来说,所述指针也可由所述视频适配集元素VI中的媒体呈现指向(ReferencedMediaPresentation)元素来承载。ReferencedMediaPresentation元素是新扩展的一种元素,也就是说,可以利用所述视频适配集元素VI中的新扩展出的元素来承载所述指针,所述视频适配集元素VI中新扩展出的承载所 述指针的元素的名称并不限于ReferencedMediaPresentation,也可以为其它的元素名称。
可选的,在本发明的一些可能的实施方式中,所述导览媒体呈现的时间结构可不依赖于所述导览媒体呈现中的所述K个导览单元所指向的主媒体呈现的时间结构。其中,导览单元的音频可以是通过对主媒体呈现的音频进行编码而得到,导览单元的视频可以是通过对主媒体呈现的视频进行编码而得到,这可使得导览单元的时间结构和主媒体呈现的时间结构之间没有相关性。
下面结合附图来举例媒体呈现的时间结构。
下面结合附图1-c和图1-d来举例媒体呈现的时间结构。
附图1-c举例示出了一个媒体呈现的时间结构,媒体呈现包括连续的若干个Period。
附图1-d举例示出了多个媒体呈现的时间结构,每个媒体呈现包括连续的若干个Period。但是多个媒体呈现之间的时间结构不同,例如Period的边界未对齐等。其中,媒体呈现在时间上是顺序的,媒体呈现描述也描述顺序的时间结构,而描述多个并发的媒体呈现的非顺序的时间结构超过了传统媒体呈现描述的能力。
本发明实施例中可通过对每个导览单元指向的主媒体呈现的媒体表达(音频和视频等)重新进行编码处理来得到导览单元的媒体表达,也就是说,每个导览单元指向的主媒体呈现的媒体表达和导览单元的媒体表达是独立的。并且,各每个导览单元的媒体表达是独立的,同一导览单元的音频分量和视频分量也是独立的。因此可不受到导览单元的媒体呈现的媒体表达不受到对应主媒体呈现的媒体表达的Period安排的影响。图1-e和图1-f示出了内容服务器对导览单元指向的主媒体呈现的视频媒体表达和音频媒体表达进行编码的方式示例。
图1-g示出了导览媒体呈现的N个导览单元的媒体呈现的Period安排的一种示例。其中,导览媒体呈现的N个导览单元的媒体呈现的Period安排是对齐的。图1-h示出了当新增一个导览单元时,新增的导览单元的与其他导览单元的媒体呈现的Period安排是对齐的。
图1-i示出了内容服务器利用各导览单元指向的主媒体呈现的媒体呈现描 述来得到导览媒体呈现的媒体呈现描述的方式示例。当然,内容服务器亦可通过其他方式来获得导览媒体呈现的媒体呈现描述。
图1-j和图1-k示出了客户端选择K个导览单元进行呈现的示例。N个导览单元中的K个导览单元的视频媒体表达将被解码呈现,而K个导览单元的音频媒体表达中的高亮导览单元的音频媒体表达将被解码呈现。当然,客户端可基于导览媒体呈现的媒体呈现描述和用户指令来选择K个导览单元进行呈现的具体方式。
可选的,在本发明的一些可能的实施方式中,所述方法还包括:在关注焦点停留在所述K个导览单元中的导览单元i的情况下,所述客户端呈现所述导览单元i的音频分量。
可选的,在本发明的一些可能的实施方式中,所述方法还包括:在所述K个导览单元中的导览单元i被选择的情况下,所述客户端获取所述导览单元i所指向的主媒体呈现。进一步的,所述客户端还可呈现所述导览单元i所指向的主媒体呈现。
可以看出,本实施例的技术方案中,由于K个导览单元中的每个导览单元可以分别指向一个主媒体呈现,这样就相当于在导览单元和主媒体呈现之间引入的一定的关联关系,这使得在所述K个导览单元的导览单元i被选择的情况下,所述客户端可获取与导览单元i指向的主媒体呈现j的媒体呈现描述,进而可以根据所述主媒体呈现j的媒体呈现描述获取所述主媒体呈现j进行呈现,可见这有利于实现导览媒体呈现和主媒体呈现之间的较灵活切换,进而实现在基于HTTP的媒体流服务场景下支持视频导览,进而有利于提升用户的高品质体验。
本发明实施例的技术方案有利于使得导览业务更具有灵活性,本发明可以实现个性化的导览业务,例如可以在客户端配置导览业务,如:一个导览页面/窗口中显示的导览单元的数目,导览单元的组合,导览单元的呈现位置和顺序等等均可在客户端配置,这有利于极大地方便了导览业务在多样化的不同的设备上的使用。如:移动电话终端,平板电脑,他们的的能力各异——显示器件尺寸,分辨率,计算能力等。
另一方面,是提高了通信带宽使用的有效性。在以往的电视服务中,所有的媒体流,包括导览单元流和主媒体流以广播方式一起传送到终端(电视机或机顶盒),传送所有的媒体流对于媒体流服务是不可能的,因为一个客户端能够使用的带宽是有限的,比广播系统中少得多。另外,用户往往只会用到一部分导览单元,或者,因为用户的兴趣,比如用户只对体育类节目感兴趣,或者终端的通信能力,或者用户找到了要看的节目频道,不在继续使用导览,很多导览单元是不需要传送的。本发明中,导览单元可只在客户端需要时发生传送,这样也有利于避免不必要的带宽占用。
参见图2,图2为本发明的另一个实施例提供的另一种基于HTTP媒体流的媒体呈现导览方法的流程示意图。如图2所示,本发明的另一个实施例提供的一种基于HTTP媒体流的媒体呈现导览方法可以包括:
201、确定导览媒体呈现包括的N个导览单元。
202、生成导览媒体呈现的媒体呈现描述,所述导览媒体呈现的媒体呈现描述描述了所述导览媒体呈现包括的N个导览单元,所述N为大于1的整数;所述N个导览单元中的每个导览单元指向一个主媒体呈现,其中,所述N个导览单元中的导览单元i所指向的主媒体呈现的呈现质量高于所述导览单元i的呈现质量。
其中,本发明实施例的执行主体可以是内容服务器或其他设备。内容服务器可存储导览媒体呈现的媒体呈现描述,并可将其提供给客户端。
其中,所述导览媒体呈现的媒体呈现描述描述了所述导览媒体呈现包括的N个导览单元。
其中,客户端(Client)可从内容服务器或其它设备获取导览媒体呈现的媒体呈现描述。
其中,所述N为大于1的整数。
其中,所述N例如可等于7、2、3、4、5、8、11、15、20、25、30或者其他值。
其中,所述客户端可为DASH客户端或具有DASH客户端逻辑功能的其他客户端或基于HTTP的媒体流服务的其他客户端。
其中,所述客户端例如可以为个人电脑,手机,平板电脑,电视机或机顶盒等。
其中,导览媒体呈现可看成是一种特殊的媒体呈现。
可以看出,本实施例的技术方案中,导览媒体呈现的媒体呈现描述所描述的导览媒体呈现包括的N个导览单元,由于N个导览单元中的每个导览单元可以分别指向一个主媒体呈现,这样就相当于在导览单元和主媒体呈现之间引入的一定的关联关系,这使得客户端在所述N个导览单元的导览单元i被选择的情况下,所述客户端可获取与导览单元i指向的主媒体呈现j的媒体呈现描述,进而可以根据所述主媒体呈现j的媒体呈现描述获取所述主媒体呈现j进行呈现,可见这中方案为实现导览媒体呈现和主媒体呈现之间的较灵活切换奠定了基础,进而为实现在基于HTTP的媒体流服务场景下支持视频导览奠定了基础。
其中,N个导览单元中的导览单元i所指向的主媒体呈现的呈现质量高于所述导览单元i的呈现质量。也就是说,导览单元的媒体表达的呈现质量低于导览单元所表示的主媒体呈现的呈现质量。
可选的,在本发明的一些可能的实施方式中,所述导览媒体呈现的媒体呈现描述可以不同于所述N个导览单元中的每个导览单元所指向主媒体呈现的媒体呈现描述。即,所述导览媒体呈现可具有独立的媒体呈现描述,N个导览单元中的每个导览单元所指向主媒体呈现亦可具有独立的且不同于所述导览媒体呈现的媒体呈现描述的媒体呈现描述。例如N个导览单元指向了N个主媒体呈现,而N个主媒体呈现分别具有对应的媒体呈现描述,即N个媒体呈现描述,而导览媒体呈现的媒体呈现描述不同于这N个媒体呈现描述中的任意一个,即导览媒体呈现可由第K+1个媒体呈现描述。
此外,在本发明另一些可能的实施方式中,所述导览媒体呈现的媒体呈现描述与所述N个导览单元中的每个导览单元所指向主媒体呈现的媒体呈现描述可被聚合形成了一个聚合媒体呈现描述(也可称超级媒体呈现描述)。即可利用一个聚合媒体呈现描述(可称超级媒体呈现描述)来描述导览媒体呈现和导览媒体呈现所指向的主媒体呈现。超级媒体呈现描述的引入有利于增强导览 媒体呈现和所导览的主媒体呈现之间的关联关系。
在实际应用中,导览单元指向主媒体呈现的方式可以很灵活,导览单元可以直接指向主媒体呈现,也可以间接的指向主媒体呈现。
举例来说,所述N个导览单元中的每个导览单元可以以指向媒体呈现描述的方式来指向该媒体呈现描述所描述的主媒体呈现。当然,导览单元亦可通过其他的直接指向或间接指向的方式来指向主媒体呈现。例如,所述导览媒体呈现的媒体呈现描述与所述N个导览单元中的每个导览单元所指向主媒体呈现的媒体呈现描述可被聚合形成了一个聚合媒体呈现描述。这种情况下,所述N个导览单元中的每个导览单元可以以引用所述聚合媒体呈现描述中的呈现元素的方式来指向一个主媒体呈现。
可选的,在本发明的一些可能的实施方式中,所述N个导览单元中的每个导览单元包括视频分量,或者所述N个导览单元中的每个导览单元包括音频分量和视频分量,进一步的,导览单元还可包括字幕分量或其他类型的媒体分量。
本发明通过媒体呈现描述(如DASH标准中的MPD),提供了导览业务的信令机制。媒体呈现描述可告知客户端一个导览业务由哪些导览单元组成,导览单元的分量,导览单元和导览业务的成员媒体呈现之间的关系,导览单元视频分量之间的关系,导览单元的音频分量之间的关系,导览单元的音频分量和视频分量之间的关系等。
可选的,在本发明一些可能实施方式中,所述N个导览单元中的不同导览单元所包括的视频分量为N个视频适配集中的不同视频适配集中的媒体表达,其中,所述N个视频适配集中的任意一个视频适配集中的媒体表达之间具有选择互斥性,所述N个视频适配集中的不同视频适配集之间具有选择相容性。例如,所述N个导览单元中导览单元i所包括的视频分量可归属于N个视频适配集中的视频适配集Ci,所述N个导览单元中导览单元j所包括的视频分量可归属于N个视频适配集中的视频适配集Cj,其中,视频适配集Cj和视频适配集Ci为所述N个视频适配集中的两个不同的视频适配集。导览单元j和导览单元i可为N个导览单元中的任意两个导览单元。
其中,所谓选择相容性,表示这些对象可同时被选择,例如若N个视频适配集中的不同视频适配集之间具有选择相容性,则表示可同时选择N个视频适配集中的多个视频适配集中的媒体表达。
所谓选择互斥性,表示这些对象不支持同时被选择,例如若所述N个视频适配集中的任意一个视频适配集中的媒体表达之间具有选择互斥性,表示不支持同时选择1个视频适配集中的多个媒体表达,例如假设N个视频适配集中的视频适配集I包括10个多个媒体表达,若视频适配集中的媒体表达之间具有选择互斥性,那么每次只能选择10个媒体表达中的其中1个,而不能同时选择该10个媒体表达中的多个。
可选的,在本发明的一些可能实施方式中,所述N个导览单元包括的音频分量为音频适配集中的媒体表达,所述音频适配集不同于所述N个视频适配集中的任意一个适配集,所述音频分量适配集与所述N个视频适配集之间具有选择相容性。例如,假设音频适配集包括20个多个媒体表达,若音频适配集中的媒体表达之间具有选择互斥性,那么每次只能选择20个多个媒体表达中的其中1个,而不能同时选择该30个媒体表达中的多个。
可选的,在本发明的另一些可能的实施方式中,所述N个导览单元中的不同导览单元所包括的音频分量为N个音频适配集中的不同音频适配集中的媒体表达,其中,所述N个音频适配集中的不同音频适配集之间具有选择互斥性。
可选的,在本发明的一些可能的实施方式中,所述音频适配集元素中的媒体表达元素,可以包含其所描述的媒体表达在导览媒体呈现中的关联区域的区域说明。
可选的,在本发明一些可能实施方式中,包含相同区域说明的媒体表达元素所描述的媒体表达(representation)之间具有关联关系,或者,包含相同区域说明的适配集元素所描述的适配集之间具有关联关系。例如媒体表达元素i所描述的媒体表达为媒体表达ri,媒体表达元素j所描述的媒体表达为媒体表达rj,若媒体表达元素i与媒体表达元素j包含相同区域说明,那么可以表示媒体表达ri与媒体表达rj之间具有关联关系。
可选的,在本发明一些可能实施方式中,媒体表达元素i和适配集元素ci包含相同区域说明,那么也可能说明媒体表达元素i所描述的媒体表达与适配集元素ci所描述的适配集中的各媒体表达之间具有关联关系,例如媒体表达元素i可为音频媒体表达,而适配集元素ci所描述的适配集中的媒体表达可为视频媒体表达。
可选的,在本发明的一些可能的实施方式中,所述区域说明可为空间关系描述(SRD)。当然,所述区域说明亦可为其他类型的可用于描述位置区域的说明信息。
可选的,在本发明的一些可能的实施方式中,所述导览媒体呈现的媒体呈现描述中包括N个视频适配集元素,所述N个视频适配集元素与所述N个视频适配集之间一一对应。
其中,所述N个视频适配集元素中包括描述子元素Ci,所述N个视频适配集元素中的满足设定共性条件视频适配集元素所描述的视频适配集之间具有选择相容性,所述设定共性条件例如可为视频适配集元素所包括的描述子元素Ci的元素名称和方法识别(schemeIdUri)属性均相同。
可选的,在本发明的一些可能的实施方式中,所述描述子元素Ci可描述了包括该描述子元素Ci的视频适配集元素所描述的视频适配集中的媒体表达为导览媒体呈现的组成部分。或者,所述描述子元素Ci可描述了包括该描述子元素Ci的视频适配集元素对应的视频适配集中的媒体表达在导览媒体呈现的角色,角色例如可能是主要、补充、字幕或翻译配音等。
可选的,在本发明的一些可能的实施方式中,所述描述子元素Ci例如可为基本属性(EssentialProptery)元素或者补充属性(SupplementalProptery)元素或作用说明(Role)元素或者其他元素。
可选的,在本发明的一些可能的实施方式中,若描述子元素Ci为作用说明Role元素,则所述设定共性条件可为视频适配集元素所包括的描述子元素Ci的元素名称可相同、方法识别schemeIdUri属性可相同,且参数(value)属性可相同。
可选的,在本发明的一些可能的实施方式中,所述导览媒体呈现的媒体呈 现描述中包括所述N个视频适配集元素,所述N个视频适配集元素与N个视频适配集之间一一对应。其中,所述N个视频适配集元素中的与视频适配集I对应的视频适配集元素VI包括用于指向一个主媒体呈现的指针,所述视频适配集I可为所述N个视频适配集中任意一个视频适配集。
其中,可根据场景需要来确定所述指针在视频适配集元素VI中承载位置。
例如,所述指针可由所述视频适配集元素VI的属性承载。
具体例如,所述指针可由所述视频适配集元素VI的xlink:href属性或其它属性承载。
又例如,所述指针可由所述视频适配集元素VI中的EssentialProptery元素或SupplementalProperty元素承载。
具体例如,所述指针可由所述视频适配集元素VI之中的EssentialProptery元素中的子元素承载,或者所述指针可由所述视频适配集元素VI中的EssentialProptery元素的属性承载;或者,所述指针可由所述视频适配集元素VI中的SupplementalProperty元素中的子元素承载,或者所述指针可由所述视频适配集元素VI中的SupplementalProperty元素的属性承载。
具体例如,所述指针可由所述视频适配集元素VI中的EssentialProptery元素的value属性或其它属性承载,或所述指针可由所述视频适配集元素VI之中的SupplementalProperty元素的value属性或其它属性承载。
又例如,所述指针可由所述视频适配集元素VI中的虚拟Representation元素的属性承载,或所述指针可由所述视频适配集元素VI中的虚拟Representation元素中的子元素承载,其中,所述虚拟Representation元素不包括媒体片段模版元素、媒体片段列表元素和基础统一资源定位符(BaseURL)元素。
又举例来说,所述指针也可由所述视频适配集元素VI中的媒体呈现指向(ReferencedMediaPresentation)元素来承载。ReferencedMediaPresentation元素是新扩展的一种元素,也就是说,可以利用所述视频适配集元素VI中的新扩展出的元素来承载所述指针,所述视频适配集元素VI中新扩展出的承载所述指针的元素的名称并不限于ReferencedMediaPresentation,也可以为其它的 元素名称。
可选的,在本发明的一些可能的实施方式中,所述导览媒体呈现的时间结构可不依赖于所述导览媒体呈现中的所述N个导览单元所指向的主媒体呈现的时间结构。其中,导览单元的音频可以是通过对主媒体呈现的音频进行编码而得到,导览单元的视频可以是通过对主媒体呈现的视频进行编码而得到,这可使得导览单元的时间结构和主媒体呈现的时间结构之间没有相关性。
为便于更好的理解和实施本发明实施例的上述方案,下面结合一些具体的应用场景进行举例说明。
参见图3-a和图3-b,图3-a为本发明的另一实施例提供的一种基于HTTP流媒体的提供导览媒体呈现的方法的流程示意图。图3-a所示基于HTTP流媒体的提供导览媒体呈现的方法可基于图3-b所示网络架构来具体实施。图3-b所示网络架构中主要包括DASH Client和内容服务器等。
如图3-a所示,本发明的另一个实施例提供的一种基于HTTP流媒体的提供导览媒体呈现的方法可以包括:
301、DASH客户端从内容服务器获取导览媒体呈现的媒体呈现描述。
其中,所述导览媒体呈现的媒体呈现描述描述了所述导览媒体呈现包括的N个导览单元。
其中,所述N为大于1的整数。
其中,所述N例如可等于7、2、3、4、5、8、11、15、20、25、30或者其他值。
其中,所述DASH客户端例如可以为个人电脑,手机,平板电脑,电视机或机顶盒等。
302、DASH客户端根据所述导览媒体呈现的媒体呈现描述从内容服务器获取所述N个导览单元中的K个导览单元。
其中,所述K为小于或等于所述N的正整数。
其中,所述K例如可等于1、2、3、4、5、8、11、15、20、25、30或者其他值。
其中,K个导览单元可与K个逻辑呈现单元一一对应,即K个导览单元 中的每个导览单元可由不同的逻辑呈现单元来呈现。
303、DASH客户端呈现所述K个导览单元。
其中,所述K个导览单元中的每个导览单元可指向一个主媒体呈现。
其中,K个导览单元中的导览单元i所指向的主媒体呈现的呈现质量高于所述导览单元i的呈现质量。也就是说,导览单元的媒体表达的呈现质量低于导览单元所表示的主媒体呈现的呈现质量。
304、在所述K个导览单元中的导览单元i被选择的情况下,DASH客户端从内容服务器获取所述导览单元i所指向的主媒体呈现的媒体呈现描述。
305、DASH客户端基于所述主媒体呈现的媒体呈现描述,从内容服务器获取所述主媒体呈现。
306、DASH客户端呈现所述导览单元i所指向的主媒体呈现。
其中,K个导览单元中的导览单元i所指向的主媒体呈现的呈现质量高于所述导览单元i的呈现质量。也就是说,导览单元的媒体表达的呈现质量低于导览单元所表示的主媒体呈现的呈现质量。
可选的,在本发明的一些可能的实施方式中,所述导览媒体呈现的媒体呈现描述可以不同于所述K个导览单元中的每个导览单元所指向主媒体呈现的媒体呈现描述。即,所述导览媒体呈现可具有独立的媒体呈现描述,K个导览单元中的每个导览单元所指向主媒体呈现亦可具有独立的且不同于所述导览媒体呈现的媒体呈现描述的媒体呈现描述。例如K个导览单元指向了K个主媒体呈现,而K个主媒体呈现分别具有对应的媒体呈现描述,即K个媒体呈现描述,而导览媒体呈现的媒体呈现描述不同于这K个媒体呈现描述中的任意一个,即导览媒体呈现可由第K+1个媒体呈现描述。
此外,在本发明另一些可能的实施方式中,所述导览媒体呈现的媒体呈现描述与所述K个导览单元中的每个导览单元所指向主媒体呈现的媒体呈现描述可被聚合形成了一个聚合媒体呈现描述(也可称超级媒体呈现描述)。即可利用一个聚合媒体呈现描述(可称超级媒体呈现描述)来描述导览媒体呈现和导览媒体呈现所指向的主媒体呈现。超级媒体呈现描述的引入有利于增强导览媒体呈现和所导览的主媒体呈现之间的关联关系。
在实际应用中,导览单元指向主媒体呈现的方式可以很灵活,导览单元可以直接指向主媒体呈现,也可以间接的指向主媒体呈现。
举例来说,所述K个导览单元中的每个导览单元可以以指向媒体呈现描述的方式来指向该媒体呈现描述所描述的主媒体呈现。当然,导览单元亦可通过其他的直接指向或间接指向的方式来指向主媒体呈现。例如,所述导览媒体呈现的媒体呈现描述与所述K个导览单元中的每个导览单元所指向主媒体呈现的媒体呈现描述可被聚合形成了一个聚合媒体呈现描述。这种情况下,所述K个导览单元中的每个导览单元可以以引用所述聚合媒体呈现描述中的呈现元素的方式来指向一个主媒体呈现。
可选的,在本发明的一些可能的实施方式中,所述N个导览单元中的每个导览单元包括视频分量,或者所述N个导览单元中的每个导览单元包括音频分量和视频分量,进一步的,导览单元还可包括字幕分量或其他类型的媒体分量。
本发明通过媒体呈现描述(如DASH标准中的MPD),提供了导览业务的信令机制。媒体呈现描述可告知客户端一个导览业务由哪些导览单元组成,导览单元的分量,导览单元和导览业务的成员媒体呈现之间的关系,导览单元视频分量之间的关系,导览单元的音频分量之间的关系,导览单元的音频分量和视频分量之间的关系等。
可选的,在本发明一些可能实施方式中,所述K个导览单元中的不同导览单元所包括的视频分量为K个视频适配集中的不同视频适配集中的媒体表达,其中,所述K个视频适配集中的任意一个视频适配集中的媒体表达之间具有选择互斥性,所述K个视频适配集中的不同视频适配集之间具有选择相容性。例如,所述K个导览单元中导览单元i所包括的视频分量可归属于K个视频适配集中的视频适配集Ci,所述K个导览单元中导览单元j所包括的视频分量可归属于K个视频适配集中的视频适配集Cj,其中,视频适配集Cj和视频适配集Ci为所述K个视频适配集中的两个不同的视频适配集。导览单元j和导览单元i可为K个导览单元中的任意两个导览单元。
其中,所谓选择相容性,表示这些对象可同时被选择,例如若K个视频 适配集中的不同视频适配集之间具有选择相容性,则表示可同时选择K个视频适配集中的多个视频适配集中的媒体表达。
所谓选择互斥性,表示这些对象不支持同时被选择,例如若所述K个视频适配集中的任意一个视频适配集中的媒体表达之间具有选择互斥性,表示不支持同时选择1个视频适配集中的多个媒体表达,例如假设K个视频适配集中的视频适配集I包括10个多个媒体表达,若视频适配集中的媒体表达之间具有选择互斥性,那么每次只能选择10个媒体表达中的其中1个,而不能同时选择该10个媒体表达中的多个。
可选的,在本发明的一些可能实施方式中,所述K个导览单元包括的音频分量为音频适配集中的媒体表达,所述音频适配集不同于所述K个视频适配集中的任意一个适配集,所述音频分量适配集与所述K个视频适配集之间具有选择相容性。例如,假设音频适配集包括20个多个媒体表达,若音频适配集中的媒体表达之间具有选择互斥性,那么每次只能选择20个多个媒体表达中的其中1个,而不能同时选择该30个媒体表达中的多个。
可选的,在本发明的另一些可能的实施方式中,所述K个导览单元中的不同导览单元所包括的音频分量为K个音频适配集中的不同音频适配集中的媒体表达,其中,所述K个音频适配集中的不同音频适配集之间具有选择互斥性。
可选的,在本发明的一些可能的实施方式中,所述音频适配集元素中的媒体表达元素,可以包含其所描述的媒体表达在导览媒体呈现中的关联区域的区域说明。
可选的,在本发明一些可能实施方式中,包含相同区域说明的媒体表达元素所描述的媒体表达(representation)之间具有关联关系,或者,包含相同区域说明的适配集元素所描述的适配集之间具有关联关系。例如媒体表达元素i所描述的媒体表达为媒体表达ri,媒体表达元素j所描述的媒体表达为媒体表达rj,若媒体表达元素i与媒体表达元素j包含相同区域说明,那么可以表示媒体表达ri与媒体表达rj之间具有关联关系。
可选的,在本发明一些可能实施方式中,媒体表达元素i和适配集元素ci 包含相同区域说明,那么也可能说明媒体表达元素i所描述的媒体表达与适配集元素ci所描述的适配集中的各媒体表达之间具有关联关系,例如媒体表达元素i可为音频媒体表达,而适配集元素ci所描述的适配集中的媒体表达可为视频媒体表达。
可选的,在本发明的一些可能的实施方式中,所述区域说明可为空间关系描述(SRD)。当然,所述区域说明亦可为其他类型的可用于描述位置区域的说明信息。
可选的,在本发明的一些可能的实施方式中,所述导览媒体呈现的媒体呈现描述中包括K个视频适配集元素,所述K个视频适配集元素与所述K个视频适配集之间一一对应。
其中,所述K个视频适配集元素中包括描述子元素Ci,所述K个视频适配集元素中的满足设定共性条件视频适配集元素所描述的视频适配集之间具有选择相容性,所述设定共性条件例如可为视频适配集元素所包括的描述子元素Ci的元素名称和方法识别(schemeIdUri)属性均相同。
可选的,在本发明的一些可能的实施方式中,所述描述子元素Ci可描述了包括该描述子元素Ci的视频适配集元素所描述的视频适配集中的媒体表达为导览媒体呈现的组成部分。或者,所述描述子元素Ci可描述了包括该描述子元素Ci的视频适配集元素对应的视频适配集中的媒体表达在导览媒体呈现的角色,角色例如可能是主要、补充、字幕或翻译配音等。
可选的,在本发明的一些可能的实施方式中,所述描述子元素Ci例如可为基本属性(EssentialProptery)元素或者补充属性(SupplementalProptery)元素或作用说明(Role)元素或者其他元素。
可选的,在本发明的一些可能的实施方式中,若描述子元素Ci为作用说明Role元素,则所述设定共性条件可为视频适配集元素所包括的描述子元素Ci的元素名称可相同、方法识别schemeIdUri属性可相同,且参数(value)属性可相同。
可选的,在本发明的一些可能的实施方式中,所述导览媒体呈现的媒体呈现描述中包括所述K个视频适配集元素,所述K个视频适配集元素与K个视 频适配集之间一一对应。其中,所述K个视频适配集元素中的与视频适配集I对应的视频适配集元素VI包括用于指向一个主媒体呈现的指针,所述视频适配集I可为所述K个视频适配集中任意一个视频适配集。
其中,可根据场景需要来确定所述指针在视频适配集元素VI中承载位置。
例如,所述指针可由所述视频适配集元素VI的属性承载。
具体例如,所述指针可由所述视频适配集元素VI的xlink:href属性或其它属性承载。
又例如,所述指针可由所述视频适配集元素VI中的EssentialProptery元素或SupplementalProperty元素承载。
具体例如,所述指针可由所述视频适配集元素VI之中的EssentialProptery元素中的子元素承载,或者所述指针可由所述视频适配集元素VI中的EssentialProptery元素的属性承载;或者,所述指针可由所述视频适配集元素VI中的SupplementalProperty元素中的子元素承载,或者所述指针可由所述视频适配集元素VI中的SupplementalProperty元素的属性承载。
具体例如,所述指针可由所述视频适配集元素VI中的EssentialProptery元素的value属性或其它属性承载,或所述指针可由所述视频适配集元素VI之中的SupplementalProperty元素的value属性或其它属性承载。
又例如,所述指针可由所述视频适配集元素VI中的虚拟Representation元素的属性承载,或所述指针可由所述视频适配集元素VI中的虚拟Representation元素中的子元素承载,其中,所述虚拟Representation元素不包括媒体片段模版元素、媒体片段列表元素和基础统一资源定位符(BaseURL)元素。
又举例来说,所述指针也可由所述视频适配集元素VI中的媒体呈现指向(ReferencedMediaPresentation)元素来承载。ReferencedMediaPresentation元素是新扩展的一种元素,也就是说,可以利用所述视频适配集元素VI中的新扩展出的元素来承载所述指针,所述视频适配集元素VI中新扩展出的承载所述指针的元素的名称并不限于ReferencedMediaPresentation,也可以为其它的元素名称。
可选的,在本发明的一些可能的实施方式中,所述导览媒体呈现的时间结构可不依赖于所述导览媒体呈现中的所述K个导览单元所指向的主媒体呈现的时间结构。其中,导览单元的音频可以是通过对主媒体呈现的音频进行编码而得到,导览单元的视频可以是通过对主媒体呈现的视频进行编码而得到,这可使得导览单元的时间结构和主媒体呈现的时间结构之间没有相关性。
可以看出,本实施例的技术方案中,由于K个导览单元中的每个导览单元可以分别指向一个主媒体呈现,这样就相当于在导览单元和主媒体呈现之间引入的一定的关联关系,这使得在所述K个导览单元的导览单元i被选择的情况下,DASH客户端可获取与导览单元i指向的主媒体呈现j的媒体呈现描述,进而可以根据所述主媒体呈现j的媒体呈现描述获取所述主媒体呈现j进行呈现,可见这有利于实现导览媒体呈现和主媒体呈现之间的较灵活切换,进而实现在基于HTTP的媒体流服务场景下支持视频导览,进而有利于提升用户的高品质体验。
在导览服务中,各个导览单元的视频是平行和并列的,多个导览单元的视频呈现在用户设备的显示屏或者一个窗口,而音频则是互斥的,任何时间只能有一个导览单元的音频被选择和播放,该导览单元的视频画面正是用户的关注焦点所在。导览业务需要相应的信令机制支持。信令告知客户端一个导览业务由哪些导览单元组成,导览单元的分量,导览单元和导览业务的成员媒体呈现之间的关系,导览单元视频分量之间的关系,导览单元的音频分量之间的关系,导览单元的音频分量和视频分量之间的关系。导览业务的信令通过导览媒体呈现的描述文件来表示,实现为描述文件中的一些元素,表达上述媒体分量的媒体表达之间的各种关系。
以下提供了多种使用不同的工具实现导览业务的信令的实施例,例子中的导览业务服务于16个成员媒体呈现。这些MPD示例可以是根据如下的一些DASH规范及其增补修订:
ISO/IEC 23009-1:Part 1:Media presentation description and segment formats,2nd Edition,2014。
ISO/IEC 23009-1:2014/FDAM 1。
Part 1:Media presentation description and segment formats。
AMENDMENT 1:High Profile and Availability Time Synchronization Extended profiles and time synchronization,ISO/IEC 23009-1:2014/FDAM 1Part 1:Media presentation description and segment formats。
ISO/IEC 23009-1:2014/DAM 2。
Part 1:Media presentation description and segment formats。
AMENDMENT 2:Spatial Relationship Description,Generalized URL parameters and other extensions。
为方便起见,每个示例并不是完整的MPD,而是为了说明本发明相关的特点而截取的MPD片段。
示例场景S1、在示例场景S1中示例了导览业务的一种信令机制,告知客户端一个导览业务由哪些导览单元组成,导览单元的分量,导览单元和导览业务的成员媒体呈现之间的关系,导览单元视频分量之间的关系,导览单元的音频分量之间的关系,导览单元的音频分量和视频分量之间的关系。
在这个例子中,用途描述子(Role)元素用于各个适配集元素,包括视频适配集元素和音频适配集元素。这样,适配集元素包含用途描述子(Role)元素,且该描述子元素的参数为“主要成分”(main)的适配集是相容的,可以一起被客户端选择。对于视频,多个视频适配集中的媒体表达——不同的导览单元的视频媒体表达可以一起被选择,在客户端上呈现。对于音频,只有一个音频媒体表达被选择,对应于一个导览单元。
导览单元(的视频)和它所代表的主媒体呈现是通过该导览单元的视频适配集元素的属性表达的,具体地,是属性@xlink:href,该属性本质上是一个指针,用它指向一个远端的主媒体呈现的媒体呈现描述。因为该指向元素不是适配集元素,所以该指向的元素未被嵌入到导览媒体呈现描述中(MPD的数据模型是层次化的,一个元素只包含比其更低级类型的元素,不包含比其更高级类型的元素),这可以用@xlink:show来表述。
现在的DASH标准规范中,@xlink:href指向的元素是和该属性所在的元素的类型一致的,即,如果该属性在适配集元素层次上,它指向的元素是适配 集元素类型。本发明中,扩展了该属性指向元素的类型,用它指向一个媒体呈现。不同于现有规范的另外一点在于,适配集元素既有远端的元素(该属性指向一个远端元素)又包含本地的媒体表达,这在现有DASH规范中是不成立的。
音频媒体表达中通过关联信令建立和同一导览单元的视频媒体表达的关联关系,具体的,通过@associationId属性引用所关联的视频媒体表达的标识——@id的值,@associationType可以不出现,表示未知的关联关系,或者增加一种关联关系的定义,如“伴随(accompany)”。
媒体呈现描述的元素在语义上的差异反映在客户端的行为上。客户端选择多个在导览业务中相同地位的媒体表达,地位是有媒体表达所属的适配集元素中的用途描述子(Role)元素说明,如:用途描述子元素的参数都是main,表明适配集中的媒体表达是媒体呈现中的主要成分。客户端选择多个导览单元的视频媒体表达,从内容服务器请求这些媒体表达的片段,经过处理,一起呈现给用户。诸如这些事情:选择几个视频适配集(视频媒体表达),以什么顺序呈现它们,呈现的位置布局,呈现方式(动态图像序列)等,都是可以由客户端决定的。决定可以根据用户的指令,用户对客户端的配置,客户端的能力等作出。
当用户的关注焦点停留在一个导览单元的视频画面,客户端选择该导览单元的音频媒体表达,获取该音频媒体表达的片段,播放音频。
当用户选择一个导览单元的视频画面,表示要观看对应的主媒体呈现时,客户端切换到主媒体呈现。切换过程可包括以下步骤:客户端首先根据导览单元中的指针,获取主媒体呈现的媒体呈现描述,第二步解析主媒体呈现的媒体呈现描述,选择合适的媒体表达;第三步,从某一时间位置加入主媒体呈现,这实际是定位操作(seeking)。如果导览业务是为直播的媒体呈现服务,那么这一时间位置是发生切换的媒体内容的时间位置,即中断导览业务时间位置。
下面给出示例场景S1中的一个可能的MPD示例。
Figure PCTCN2015073148-appb-000001
Figure PCTCN2015073148-appb-000002
Figure PCTCN2015073148-appb-000003
示例场景S2。在示例场景S2中示例了导览业务的一种信令机制,场景S2示例出了MPD用于表示导览业务的组成。导览说明方法带有一个通用资源识别符(Universal Resource Identifier)作为参数,其中,该通用资源识别符用于指向一个媒体呈现,实际上通过指向这个媒体呈现的媒体呈现描述来指向这个媒体呈现。
为该方法定义一个方法标识,如:urn:mpeg:dash:mosaic:2011。如果基本属性描述子(EssentialProperty),补充属性描述子(SupplementalProperty)的@schemeId取值为该方法标识,可表示包含该描述子的元素:适配集或者媒体表达,是导览业务的组成部分,该描述子的属性@value就是导览业务说明方法的参数,指向主媒体呈现的媒体呈现描述的通用资源识别符。
下面给出示例场景S2中的一个可能的MPD示例。
Figure PCTCN2015073148-appb-000004
Figure PCTCN2015073148-appb-000005
示例场景S3
示例场景S3中,一个视频适配集(对应于一个导览单元)有两个媒体表达。其中一个是虚拟的媒体表达,不含有任何的片段,而指向导览单元所代表的主媒体呈现,实际上通过指向这个媒体呈现的媒体呈现描述来指向这个媒体呈现。这种情况下,片段的模板不出现在适配集元素层次上,而出现在实际的媒体表达元素中。
下面给出示例场景S3中的一个可能的MPD示例。
Figure PCTCN2015073148-appb-000006
Figure PCTCN2015073148-appb-000007
Figure PCTCN2015073148-appb-000008
示例场景S4
示例场景S4中考虑到严格地与现有DASH中的媒体呈现描述保持兼容可能导致模糊和歧义,如一个被引用的远端单元只在被解析之后才可能知道它的类型,因为一个远端单元只是一个XML对象,它的类型可能是一个媒体呈现描述类型,也可能是一个时间段或者一个适配集。如果放松兼容性限制,在媒体呈现描述中引入一个新的元素说明表示引用一个媒体表达,这样就可以避免歧义。该元素可以归属于不同层级的父元素,如适配集,媒体表达。示例场景S4的例子中媒体呈现引用(ReferencedMediaPresentation)就是一种具体的实现方式。
下面给出示例场景S4中的一个可能的MPD示例。
Figure PCTCN2015073148-appb-000009
Figure PCTCN2015073148-appb-000010
示例场景S5
示例场景S5中给出了聚合媒体呈现描述的例子。聚合媒体呈现描述是MPD,是MPD的超集。它描述了多个并行的媒体呈现,包括成员媒体呈现和导览媒体呈现。聚合媒体呈现描述中引入了呈现元素,它可以是一个远端的元素,指向一个媒体呈现描述,或者是一个嵌入的媒体呈现描述。
下面的举例中,成员媒体呈现的媒体呈现描述是远端元素,而导览媒体呈现是本地的,是嵌入的媒体呈现描述。
下面给出示例场景S5中的一个可能的MPD示例。
Figure PCTCN2015073148-appb-000011
Figure PCTCN2015073148-appb-000012
可以理解,上述示例的MPD只为举例说明,本发明实施例的技术方案并不受上述举例的限制。
本发明实施例还提供用于实施上述方案的相关装置。
参见图4,本发明实施例提供一种客户端400,可包括:
第一获取单元410,用于获取导览媒体呈现的媒体呈现描述,其中,所述导览媒体呈现的媒体呈现描述描述了所述导览媒体呈现包括的N个导览单元,所述N为大于1的整数;
第二获取单元420,用于根据所述导览媒体呈现的媒体呈现描述获取所述N个导览单元中的K个导览单元;
呈现单元430,用于呈现所述K个导览单元,所述K个导览单元中的每个导览单元指向一个主媒体呈现,其中,K个导览单元中的导览单元i所指向的主媒体呈现的呈现质量高于所述导览单元i的呈现质量。
可选的,在本发明的一些可能的实施方式中,所述导览媒体呈现的媒体呈现描述可以不同于所述K个导览单元中的每个导览单元所指向主媒体呈现的媒体呈现描述。即,所述导览媒体呈现可具有独立的媒体呈现描述,K个导览单元中的每个导览单元所指向主媒体呈现亦可具有独立的且不同于所述导览媒体呈现的媒体呈现描述的媒体呈现描述。例如K个导览单元指向了K个主媒体呈现,而K个主媒体呈现分别具有对应的媒体呈现描述,即K个媒体呈现描述,而导览媒体呈现的媒体呈现描述不同于这K个媒体呈现描述中的任意一个,即导览媒体呈现可由第K+1个媒体呈现描述。
此外,在本发明另一些可能的实施方式中,所述导览媒体呈现的媒体呈现描述与所述K个导览单元中的每个导览单元所指向主媒体呈现的媒体呈现描 述可被聚合形成了一个聚合媒体呈现描述(也可称超级媒体呈现描述)。即可利用一个聚合媒体呈现描述(可称超级媒体呈现描述)来描述导览媒体呈现和导览媒体呈现所指向的主媒体呈现。超级媒体呈现描述的引入有利于增强导览媒体呈现和所导览的主媒体呈现之间的关联关系。
在实际应用中,导览单元指向主媒体呈现的方式可以很灵活,导览单元可以直接指向主媒体呈现,也可以间接的指向主媒体呈现。
举例来说,所述K个导览单元中的每个导览单元可以以指向媒体呈现描述的方式来指向该媒体呈现描述所描述的主媒体呈现。当然,导览单元亦可通过其他的直接指向或间接指向的方式来指向主媒体呈现。例如,所述导览媒体呈现的媒体呈现描述与所述K个导览单元中的每个导览单元所指向主媒体呈现的媒体呈现描述可被聚合形成了一个聚合媒体呈现描述。这种情况下,所述K个导览单元中的每个导览单元可以以引用所述聚合媒体呈现描述中的呈现元素的方式来指向一个主媒体呈现。
可选的,在本发明的一些可能的实施方式中,所述N个导览单元中的每个导览单元包括视频分量,或者所述N个导览单元中的每个导览单元包括音频分量和视频分量,进一步的,导览单元还可包括字幕分量或其他类型的媒体分量。
本发明通过媒体呈现描述(如DASH标准中的MPD),提供了导览业务的信令机制。媒体呈现描述可告知客户端一个导览业务由哪些导览单元组成,导览单元的分量,导览单元和导览业务的成员媒体呈现之间的关系,导览单元视频分量之间的关系,导览单元的音频分量之间的关系,导览单元的音频分量和视频分量之间的关系等。
可选的,在本发明一些可能实施方式中,所述K个导览单元中的不同导览单元所包括的视频分量为K个视频适配集中的不同视频适配集中的媒体表达,其中,所述K个视频适配集中的任意一个视频适配集中的媒体表达之间具有选择互斥性,所述K个视频适配集中的不同视频适配集之间具有选择相容性。例如,所述K个导览单元中导览单元i所包括的视频分量可归属于K个视频适配集中的视频适配集Ci,所述K个导览单元中导览单元j所包括的 视频分量可归属于K个视频适配集中的视频适配集Cj,其中,视频适配集Cj和视频适配集Ci为所述K个视频适配集中的两个不同的视频适配集。导览单元j和导览单元i可为K个导览单元中的任意两个导览单元。
其中,所谓选择相容性,表示这些对象可同时被选择,例如若K个视频适配集中的不同视频适配集之间具有选择相容性,则表示可同时选择K个视频适配集中的多个视频适配集中的媒体表达。
所谓选择互斥性,表示这些对象不支持同时被选择,例如若所述K个视频适配集中的任意一个视频适配集中的媒体表达之间具有选择互斥性,表示不支持同时选择1个视频适配集中的多个媒体表达,例如假设K个视频适配集中的视频适配集I包括10个多个媒体表达,若视频适配集中的媒体表达之间具有选择互斥性,那么每次只能选择10个媒体表达中的其中1个,而不能同时选择该10个媒体表达中的多个。
可选的,在本发明的一些可能实施方式中,所述K个导览单元包括的音频分量为音频适配集中的媒体表达,所述音频适配集不同于所述K个视频适配集中的任意一个适配集,所述音频分量适配集与所述K个视频适配集之间具有选择相容性。例如,假设音频适配集包括20个多个媒体表达,若音频适配集中的媒体表达之间具有选择互斥性,那么每次只能选择20个多个媒体表达中的其中1个,而不能同时选择该30个媒体表达中的多个。
可选的,在本发明的另一些可能的实施方式中,所述K个导览单元中的不同导览单元所包括的音频分量为K个音频适配集中的不同音频适配集中的媒体表达,其中,所述K个音频适配集中的不同音频适配集之间具有选择互斥性。
可选的,在本发明的一些可能的实施方式中,所述音频适配集元素中的媒体表达元素,可以包含其所描述的媒体表达在导览媒体呈现中的关联区域的区域说明。
可选的,在本发明一些可能实施方式中,包含相同区域说明的媒体表达元素所描述的媒体表达(representation)之间具有关联关系,或者,包含相同区域说明的适配集元素所描述的适配集之间具有关联关系。例如媒体表达元素i 所描述的媒体表达为媒体表达ri,媒体表达元素j所描述的媒体表达为媒体表达rj,若媒体表达元素i与媒体表达元素j包含相同区域说明,那么可以表示媒体表达ri与媒体表达rj之间具有关联关系。
可选的,在本发明一些可能实施方式中,媒体表达元素i和适配集元素ci包含相同区域说明,那么也可能说明媒体表达元素i所描述的媒体表达与适配集元素ci所描述的适配集中的各媒体表达之间具有关联关系,例如媒体表达元素i可为音频媒体表达,而适配集元素ci所描述的适配集中的媒体表达可为视频媒体表达。
可选的,在本发明的一些可能的实施方式中,所述区域说明可为空间关系描述(SRD)。当然,所述区域说明亦可为其他类型的可用于描述位置区域的说明信息。
可选的,在本发明的一些可能的实施方式中,所述导览媒体呈现的媒体呈现描述中包括K个视频适配集元素,所述K个视频适配集元素与所述K个视频适配集之间一一对应。
其中,所述K个视频适配集元素中包括描述子元素Ci,所述K个视频适配集元素中的满足设定共性条件视频适配集元素所描述的视频适配集之间具有选择相容性,所述设定共性条件例如可为视频适配集元素所包括的描述子元素Ci的元素名称和方法识别(schemeIdUri)属性均相同。
可选的,在本发明的一些可能的实施方式中,所述描述子元素Ci可描述了包括该描述子元素Ci的视频适配集元素所描述的视频适配集中的媒体表达为导览媒体呈现的组成部分。或者,所述描述子元素Ci可描述了包括该描述子元素Ci的视频适配集元素对应的视频适配集中的媒体表达在导览媒体呈现的角色,角色例如可能是主要、补充、字幕或翻译配音等。
可选的,在本发明的一些可能的实施方式中,所述描述子元素Ci例如可为基本属性(EssentialProptery)元素或者补充属性(SupplementalProptery)元素或作用说明(Role)元素或者其他元素。
可选的,在本发明的一些可能的实施方式中,若描述子元素Ci为作用说明Role元素,则所述设定共性条件可为视频适配集元素所包括的描述子元素 Ci的元素名称可相同、方法识别schemeIdUri属性可相同,且参数(value)属性可相同。
可选的,在本发明的一些可能的实施方式中,所述导览媒体呈现的媒体呈现描述中包括所述K个视频适配集元素,所述K个视频适配集元素与K个视频适配集之间一一对应。其中,所述K个视频适配集元素中的与视频适配集I对应的视频适配集元素VI包括用于指向一个主媒体呈现的指针,所述视频适配集I可为所述K个视频适配集中任意一个视频适配集。
其中,可根据场景需要来确定所述指针在视频适配集元素VI中承载位置。
例如,所述指针可由所述视频适配集元素VI的属性承载。
具体例如,所述指针可由所述视频适配集元素VI的xlink:href属性或其它属性承载。
又例如,所述指针可由所述视频适配集元素VI中的EssentialProptery元素或SupplementalProperty元素承载。
具体例如,所述指针可由所述视频适配集元素VI之中的EssentialProptery元素中的子元素承载,或者所述指针可由所述视频适配集元素VI中的EssentialProptery元素的属性承载;或者,所述指针可由所述视频适配集元素VI中的SupplementalProperty元素中的子元素承载,或者所述指针可由所述视频适配集元素VI中的SupplementalProperty元素的属性承载。
具体例如,所述指针可由所述视频适配集元素VI中的EssentialProptery元素的value属性或其它属性承载,或所述指针可由所述视频适配集元素VI之中的SupplementalProperty元素的value属性或其它属性承载。
又例如,所述指针可由所述视频适配集元素VI中的虚拟Representation元素的属性承载,或所述指针可由所述视频适配集元素VI中的虚拟Representation元素中的子元素承载,其中,所述虚拟Representation元素不包括媒体片段模版元素、媒体片段列表元素和基础统一资源定位符(BaseURL)元素。
又举例来说,所述指针也可由所述视频适配集元素VI中的媒体呈现指向(ReferencedMediaPresentation)元素来承载。ReferencedMediaPresentation元 素是新扩展的一种元素,也就是说,可以利用所述视频适配集元素VI中的新扩展出的元素来承载所述指针,所述视频适配集元素VI中新扩展出的承载所述指针的元素的名称并不限于ReferencedMediaPresentation,也可以为其它的元素名称。
可选的,在本发明的一些可能的实施方式中,所述导览媒体呈现的时间结构可不依赖于所述导览媒体呈现中的所述K个导览单元所指向的主媒体呈现的时间结构。其中,导览单元的音频可以是通过对主媒体呈现的音频进行编码而得到,导览单元的视频可以是通过对主媒体呈现的视频进行编码而得到,这可使得导览单元的时间结构和主媒体呈现的时间结构之间没有相关性。
可选的,在本发明的一些可能的实施方式中,所述呈现单元还用于在关注焦点停留在所述K个导览单元中的导览单元i的情况下,呈现所述导览单元i的音频分量。
可选的,在本发明的一些可能的实施方式中,所述呈现单元还用于在所述K个导览单元中的导览单元i被选择的情况下,获取所述导览单元i所指向的主媒体呈现。进一步的,所述客户端还可呈现所述导览单元i所指向的主媒体呈现。
其中,所述客户端400例如可以为个人电脑,手机,平板电脑,电视机或机顶盒等。
可以理解的是,本实施例的客户端400的各功能模块的功能可根据上述方法实施例中的方法具体实现,其具体实现过程可以参照上述方法实施例的相关描述,此处不再赘述。其中,客户端400可用于实施上述实施例提供的任意一种基于超文本传输协议媒体流的媒体呈现导览方法。
可以看出,本实施例的技术方案中,由于K个导览单元中的每个导览单元可以分别指向一个主媒体呈现,这样就相当于在导览单元和主媒体呈现之间引入的一定的关联关系,这使得在所述K个导览单元的导览单元i被选择的情况下,所述客户端400可获取与导览单元i指向的主媒体呈现j的媒体呈现描述,进而可以根据所述主媒体呈现j的媒体呈现描述获取所述主媒体呈现j进行呈现,可见这有利于实现导览媒体呈现和主媒体呈现之间的较灵活切换,进 而实现在基于HTTP的媒体流服务场景下支持视频导览,进而有利于提升用户的高品质体验。
参见图5,本发明实施例提供的一种客户端500,可包括:
处理器502和存储器503。其中,处理器502和存储器503通过总线501耦合连接。
所述处理器502通过调用所述存储器503中的代码或指令以用于,获取导览媒体呈现的媒体呈现描述,其中,所述导览媒体呈现的媒体呈现描述描述了所述导览媒体呈现包括的N个导览单元,所述N为大于1的整数;根据所述导览媒体呈现的媒体呈现描述获取所述N个导览单元中的K个导览单元;呈现所述K个导览单元,所述K个导览单元中的每个导览单元指向一个主媒体呈现,其中,K个导览单元中的导览单元i所指向的主媒体呈现的呈现质量高于所述导览单元i的呈现质量。
可选的,在本发明的一些可能的实施方式中,所述导览媒体呈现的媒体呈现描述可以不同于所述K个导览单元中的每个导览单元所指向主媒体呈现的媒体呈现描述。即,所述导览媒体呈现可具有独立的媒体呈现描述,K个导览单元中的每个导览单元所指向主媒体呈现亦可具有独立的且不同于所述导览媒体呈现的媒体呈现描述的媒体呈现描述。例如K个导览单元指向了K个主媒体呈现,而K个主媒体呈现分别具有对应的媒体呈现描述,即K个媒体呈现描述,而导览媒体呈现的媒体呈现描述不同于这K个媒体呈现描述中的任意一个,即导览媒体呈现可由第K+1个媒体呈现描述。
此外,在本发明另一些可能的实施方式中,所述导览媒体呈现的媒体呈现描述与所述K个导览单元中的每个导览单元所指向主媒体呈现的媒体呈现描述可被聚合形成了一个聚合媒体呈现描述(也可称超级媒体呈现描述)。即可利用一个聚合媒体呈现描述(可称超级媒体呈现描述)来描述导览媒体呈现和导览媒体呈现所指向的主媒体呈现。超级媒体呈现描述的引入有利于增强导览媒体呈现和所导览的主媒体呈现之间的关联关系。
在实际应用中,导览单元指向主媒体呈现的方式可以很灵活,导览单元可以直接指向主媒体呈现,也可以间接的指向主媒体呈现。
举例来说,所述K个导览单元中的每个导览单元可以以指向媒体呈现描述的方式来指向该媒体呈现描述所描述的主媒体呈现。当然,导览单元亦可通过其他的直接指向或间接指向的方式来指向主媒体呈现。例如,所述导览媒体呈现的媒体呈现描述与所述K个导览单元中的每个导览单元所指向主媒体呈现的媒体呈现描述可被聚合形成了一个聚合媒体呈现描述。这种情况下,所述K个导览单元中的每个导览单元可以以引用所述聚合媒体呈现描述中的呈现元素的方式来指向一个主媒体呈现。
可选的,在本发明的一些可能的实施方式中,所述N个导览单元中的每个导览单元包括视频分量,或者所述N个导览单元中的每个导览单元包括音频分量和视频分量,进一步的,导览单元还可包括字幕分量或其他类型的媒体分量。
本发明通过媒体呈现描述(如DASH标准中的MPD),提供了导览业务的信令机制。媒体呈现描述可告知客户端一个导览业务由哪些导览单元组成,导览单元的分量,导览单元和导览业务的成员媒体呈现之间的关系,导览单元视频分量之间的关系,导览单元的音频分量之间的关系,导览单元的音频分量和视频分量之间的关系等。
可选的,在本发明一些可能实施方式中,所述K个导览单元中的不同导览单元所包括的视频分量为K个视频适配集中的不同视频适配集中的媒体表达,其中,所述K个视频适配集中的任意一个视频适配集中的媒体表达之间具有选择互斥性,所述K个视频适配集中的不同视频适配集之间具有选择相容性。例如,所述K个导览单元中导览单元i所包括的视频分量可归属于K个视频适配集中的视频适配集Ci,所述K个导览单元中导览单元j所包括的视频分量可归属于K个视频适配集中的视频适配集Cj,其中,视频适配集Cj和视频适配集Ci为所述K个视频适配集中的两个不同的视频适配集。导览单元j和导览单元i可为K个导览单元中的任意两个导览单元。
其中,所谓选择相容性,表示这些对象可同时被选择,例如若K个视频适配集中的不同视频适配集之间具有选择相容性,则表示可同时选择K个视频适配集中的多个视频适配集中的媒体表达。
所谓选择互斥性,表示这些对象不支持同时被选择,例如若所述K个视频适配集中的任意一个视频适配集中的媒体表达之间具有选择互斥性,表示不支持同时选择1个视频适配集中的多个媒体表达,例如假设K个视频适配集中的视频适配集I包括10个多个媒体表达,若视频适配集中的媒体表达之间具有选择互斥性,那么每次只能选择10个媒体表达中的其中1个,而不能同时选择该10个媒体表达中的多个。
可选的,在本发明的一些可能实施方式中,所述K个导览单元包括的音频分量为音频适配集中的媒体表达,所述音频适配集不同于所述K个视频适配集中的任意一个适配集,所述音频分量适配集与所述K个视频适配集之间具有选择相容性。例如,假设音频适配集包括20个多个媒体表达,若音频适配集中的媒体表达之间具有选择互斥性,那么每次只能选择20个多个媒体表达中的其中1个,而不能同时选择该30个媒体表达中的多个。
可选的,在本发明的另一些可能的实施方式中,所述K个导览单元中的不同导览单元所包括的音频分量为K个音频适配集中的不同音频适配集中的媒体表达,其中,所述K个音频适配集中的不同音频适配集之间具有选择互斥性。
可选的,在本发明的一些可能的实施方式中,所述音频适配集元素中的媒体表达元素,可以包含其所描述的媒体表达在导览媒体呈现中的关联区域的区域说明。
可选的,在本发明一些可能实施方式中,包含相同区域说明的媒体表达元素所描述的媒体表达(representation)之间具有关联关系,或者,包含相同区域说明的适配集元素所描述的适配集之间具有关联关系。例如媒体表达元素i所描述的媒体表达为媒体表达ri,媒体表达元素j所描述的媒体表达为媒体表达rj,若媒体表达元素i与媒体表达元素j包含相同区域说明,那么可以表示媒体表达ri与媒体表达rj之间具有关联关系。
可选的,在本发明一些可能实施方式中,媒体表达元素i和适配集元素ci包含相同区域说明,那么也可能说明媒体表达元素i所描述的媒体表达与适配集元素ci所描述的适配集中的各媒体表达之间具有关联关系,例如媒体表达 元素i可为音频媒体表达,而适配集元素ci所描述的适配集中的媒体表达可为视频媒体表达。
可选的,在本发明的一些可能的实施方式中,所述区域说明可为空间关系描述(SRD)。当然,所述区域说明亦可为其他类型的可用于描述位置区域的说明信息。
可选的,在本发明的一些可能的实施方式中,所述导览媒体呈现的媒体呈现描述中包括K个视频适配集元素,所述K个视频适配集元素与所述K个视频适配集之间一一对应。
其中,所述K个视频适配集元素中包括描述子元素Ci,所述K个视频适配集元素中的满足设定共性条件视频适配集元素所描述的视频适配集之间具有选择相容性,所述设定共性条件例如可为视频适配集元素所包括的描述子元素Ci的元素名称和方法识别(schemeIdUri)属性均相同。
可选的,在本发明的一些可能的实施方式中,所述描述子元素Ci可描述了包括该描述子元素Ci的视频适配集元素所描述的视频适配集中的媒体表达为导览媒体呈现的组成部分。或者,所述描述子元素Ci可描述了包括该描述子元素Ci的视频适配集元素对应的视频适配集中的媒体表达在导览媒体呈现的角色,角色例如可能是主要、补充、字幕或翻译配音等。
可选的,在本发明的一些可能的实施方式中,所述描述子元素Ci例如可为基本属性(EssentialProptery)元素或者补充属性(SupplementalProptery)元素或作用说明(Role)元素或者其他元素。
可选的,在本发明的一些可能的实施方式中,若描述子元素Ci为作用说明Role元素,则所述设定共性条件可为视频适配集元素所包括的描述子元素Ci的元素名称可相同、方法识别schemeIdUri属性可相同,且参数(value)属性可相同。
可选的,在本发明的一些可能的实施方式中,所述导览媒体呈现的媒体呈现描述中包括所述K个视频适配集元素,所述K个视频适配集元素与K个视频适配集之间一一对应。其中,所述K个视频适配集元素中的与视频适配集I对应的视频适配集元素VI包括用于指向一个主媒体呈现的指针,所述视频适 配集I可为所述K个视频适配集中任意一个视频适配集。
其中,可根据场景需要来确定所述指针在视频适配集元素VI中承载位置。
例如,所述指针可由所述视频适配集元素VI的属性承载。
具体例如,所述指针可由所述视频适配集元素VI的xlink:href属性或其它属性承载。
又例如,所述指针可由所述视频适配集元素VI中的EssentialProptery元素或SupplementalProperty元素承载。
具体例如,所述指针可由所述视频适配集元素VI之中的EssentialProptery元素中的子元素承载,或者所述指针可由所述视频适配集元素VI中的EssentialProptery元素的属性承载;或者,所述指针可由所述视频适配集元素VI中的SupplementalProperty元素中的子元素承载,或者所述指针可由所述视频适配集元素VI中的SupplementalProperty元素的属性承载。
具体例如,所述指针可由所述视频适配集元素VI中的EssentialProptery元素的value属性或其它属性承载,或所述指针可由所述视频适配集元素VI之中的SupplementalProperty元素的value属性或其它属性承载。
又例如,所述指针可由所述视频适配集元素VI中的虚拟Representation元素的属性承载,或所述指针可由所述视频适配集元素VI中的虚拟Representation元素中的子元素承载,其中,所述虚拟Representation元素不包括媒体片段模版元素、媒体片段列表元素和基础统一资源定位符(BaseURL)元素。
又举例来说,所述指针也可由所述视频适配集元素VI中的媒体呈现指向(ReferencedMediaPresentation)元素来承载。ReferencedMediaPresentation元素是新扩展的一种元素,也就是说,可以利用所述视频适配集元素VI中的新扩展出的元素来承载所述指针,所述视频适配集元素VI中新扩展出的承载所述指针的元素的名称并不限于ReferencedMediaPresentation,也可以为其它的元素名称。
可选的,在本发明的一些可能的实施方式中,所述导览媒体呈现的时间结构可不依赖于所述导览媒体呈现中的所述K个导览单元所指向的主媒体呈现 的时间结构。其中,导览单元的音频可以是通过对主媒体呈现的音频进行编码而得到,导览单元的视频可以是通过对主媒体呈现的视频进行编码而得到,这可使得导览单元的时间结构和主媒体呈现的时间结构之间没有相关性。
可选的,在本发明的一些可能的实施方式中,所述处理器还用于在关注焦点停留在所述K个导览单元中的导览单元i的情况下,呈现所述导览单元i的音频分量。
可选的,在本发明的一些可能的实施方式中,所述处理器还用于在所述K个导览单元中的导览单元i被选择的情况下,获取所述导览单元i所指向的主媒体呈现。进一步的,所述客户端还可呈现所述导览单元i所指向的主媒体呈现。
其中,所述客户端500例如可以为个人电脑,手机,平板电脑,电视机或机顶盒等。
可以理解的是,本实施例的客户端500的的功能可根据上述方法实施例中的方法具体实现,其具体实现过程可以参照上述方法实施例的相关描述,此处不再赘述。其中,客户端500可用于实施上述实施例提供的任意一种基于超文本传输协议媒体流的媒体呈现导览方法。
可以看出,本实施例的技术方案中,由于K个导览单元中的每个导览单元可以分别指向一个主媒体呈现,这样就相当于在导览单元和主媒体呈现之间引入的一定的关联关系,这使得在所述K个导览单元的导览单元i被选择的情况下,所述客户端500可获取与导览单元i指向的主媒体呈现j的媒体呈现描述,进而可以根据所述主媒体呈现j的媒体呈现描述获取所述主媒体呈现j进行呈现,可见这有利于实现导览媒体呈现和主媒体呈现之间的较灵活切换,进而实现在基于HTTP的媒体流服务场景下支持视频导览,进而有利于提升用户的高品质体验。
参见图6,本发明实施例提供一种服务器600,可包括:
确定单元610,用于确定导览媒体呈现包括的N个导览单元。
生成单元620,用于生成导览媒体呈现的媒体呈现描述,所述导览媒体呈现的媒体呈现描述描述了所述导览媒体呈现包括的N个导览单元,所述N为 大于1的整数;所述N个导览单元中的每个导览单元指向一个主媒体呈现,所述N个导览单元中的导览单元i所指向的主媒体呈现的呈现质量高于所述导览单元i的呈现质量。
其中,N个导览单元中的导览单元i所指向的主媒体呈现的呈现质量高于所述导览单元i的呈现质量。也就是说,导览单元的媒体表达的呈现质量低于导览单元所表示的主媒体呈现的呈现质量。
可选的,在本发明的一些可能的实施方式中,所述导览媒体呈现的媒体呈现描述可以不同于所述N个导览单元中的每个导览单元所指向主媒体呈现的媒体呈现描述。即,所述导览媒体呈现可具有独立的媒体呈现描述,N个导览单元中的每个导览单元所指向主媒体呈现亦可具有独立的且不同于所述导览媒体呈现的媒体呈现描述的媒体呈现描述。例如N个导览单元指向了N个主媒体呈现,而N个主媒体呈现分别具有对应的媒体呈现描述,即N个媒体呈现描述,而导览媒体呈现的媒体呈现描述不同于这N个媒体呈现描述中的任意一个,即导览媒体呈现可由第K+1个媒体呈现描述。
此外,在本发明另一些可能的实施方式中,所述导览媒体呈现的媒体呈现描述与所述N个导览单元中的每个导览单元所指向主媒体呈现的媒体呈现描述可被聚合形成了一个聚合媒体呈现描述(也可称超级媒体呈现描述)。即可利用一个聚合媒体呈现描述(可称超级媒体呈现描述)来描述导览媒体呈现和导览媒体呈现所指向的主媒体呈现。超级媒体呈现描述的引入有利于增强导览媒体呈现和所导览的主媒体呈现之间的关联关系。
在实际应用中,导览单元指向主媒体呈现的方式可以很灵活,导览单元可以直接指向主媒体呈现,也可以间接的指向主媒体呈现。
举例来说,所述N个导览单元中的每个导览单元可以以指向媒体呈现描述的方式来指向该媒体呈现描述所描述的主媒体呈现。当然,导览单元亦可通过其他的直接指向或间接指向的方式来指向主媒体呈现。例如,所述导览媒体呈现的媒体呈现描述与所述N个导览单元中的每个导览单元所指向主媒体呈现的媒体呈现描述可被聚合形成了一个聚合媒体呈现描述。这种情况下,所述N个导览单元中的每个导览单元可以以引用所述聚合媒体呈现描述中的呈现 元素的方式来指向一个主媒体呈现。
可选的,在本发明的一些可能的实施方式中,所述N个导览单元中的每个导览单元包括视频分量,或者所述N个导览单元中的每个导览单元包括音频分量和视频分量,进一步的,导览单元还可包括字幕分量或其他类型的媒体分量。
本发明通过媒体呈现描述(如DASH标准中的MPD),提供了导览业务的信令机制。媒体呈现描述可告知客户端一个导览业务由哪些导览单元组成,导览单元的分量,导览单元和导览业务的成员媒体呈现之间的关系,导览单元视频分量之间的关系,导览单元的音频分量之间的关系,导览单元的音频分量和视频分量之间的关系等。
可选的,在本发明一些可能实施方式中,所述N个导览单元中的不同导览单元所包括的视频分量为N个视频适配集中的不同视频适配集中的媒体表达,其中,所述N个视频适配集中的任意一个视频适配集中的媒体表达之间具有选择互斥性,所述N个视频适配集中的不同视频适配集之间具有选择相容性。例如,所述N个导览单元中导览单元i所包括的视频分量可归属于N个视频适配集中的视频适配集Ci,所述N个导览单元中导览单元j所包括的视频分量可归属于N个视频适配集中的视频适配集Cj,其中,视频适配集Cj和视频适配集Ci为所述N个视频适配集中的两个不同的视频适配集。导览单元j和导览单元i可为N个导览单元中的任意两个导览单元。
其中,所谓选择相容性,表示这些对象可同时被选择,例如若N个视频适配集中的不同视频适配集之间具有选择相容性,则表示可同时选择N个视频适配集中的多个视频适配集中的媒体表达。
所谓选择互斥性,表示这些对象不支持同时被选择,例如若所述N个视频适配集中的任意一个视频适配集中的媒体表达之间具有选择互斥性,表示不支持同时选择1个视频适配集中的多个媒体表达,例如假设N个视频适配集中的视频适配集I包括10个多个媒体表达,若视频适配集中的媒体表达之间具有选择互斥性,那么每次只能选择10个媒体表达中的其中1个,而不能同时选择该10个媒体表达中的多个。
可选的,在本发明的一些可能实施方式中,所述N个导览单元包括的音频分量为音频适配集中的媒体表达,所述音频适配集不同于所述N个视频适配集中的任意一个适配集,所述音频分量适配集与所述N个视频适配集之间具有选择相容性。例如,假设音频适配集包括20个多个媒体表达,若音频适配集中的媒体表达之间具有选择互斥性,那么每次只能选择20个多个媒体表达中的其中1个,而不能同时选择该30个媒体表达中的多个。
可选的,在本发明的另一些可能的实施方式中,所述N个导览单元中的不同导览单元所包括的音频分量为N个音频适配集中的不同音频适配集中的媒体表达,其中,所述N个音频适配集中的不同音频适配集之间具有选择互斥性。
可选的,在本发明的一些可能的实施方式中,所述音频适配集元素中的媒体表达元素,可以包含其所描述的媒体表达在导览媒体呈现中的关联区域的区域说明。
可选的,在本发明一些可能实施方式中,包含相同区域说明的媒体表达元素所描述的媒体表达(representation)之间具有关联关系,或者,包含相同区域说明的适配集元素所描述的适配集之间具有关联关系。例如媒体表达元素i所描述的媒体表达为媒体表达ri,媒体表达元素j所描述的媒体表达为媒体表达rj,若媒体表达元素i与媒体表达元素j包含相同区域说明,那么可以表示媒体表达ri与媒体表达rj之间具有关联关系。
可选的,在本发明一些可能实施方式中,媒体表达元素i和适配集元素ci包含相同区域说明,那么也可能说明媒体表达元素i所描述的媒体表达与适配集元素ci所描述的适配集中的各媒体表达之间具有关联关系,例如媒体表达元素i可为音频媒体表达,而适配集元素ci所描述的适配集中的媒体表达可为视频媒体表达。
可选的,在本发明的一些可能的实施方式中,所述区域说明可为空间关系描述(SRD)。当然,所述区域说明亦可为其他类型的可用于描述位置区域的说明信息。
可选的,在本发明的一些可能的实施方式中,所述导览媒体呈现的媒体呈 现描述中包括N个视频适配集元素,所述N个视频适配集元素与所述N个视频适配集之间一一对应。
其中,所述N个视频适配集元素中包括描述子元素Ci,所述N个视频适配集元素中的满足设定共性条件视频适配集元素所描述的视频适配集之间具有选择相容性,所述设定共性条件例如可为视频适配集元素所包括的描述子元素Ci的元素名称和方法识别(schemeIdUri)属性均相同。
可选的,在本发明的一些可能的实施方式中,所述描述子元素Ci可描述了包括该描述子元素Ci的视频适配集元素所描述的视频适配集中的媒体表达为导览媒体呈现的组成部分。或者,所述描述子元素Ci可描述了包括该描述子元素Ci的视频适配集元素对应的视频适配集中的媒体表达在导览媒体呈现的角色,角色例如可能是主要、补充、字幕或翻译配音等。
可选的,在本发明的一些可能的实施方式中,所述描述子元素Ci例如可为基本属性(EssentialProptery)元素或者补充属性(SupplementalProptery)元素或作用说明(Role)元素或者其他元素。
可选的,在本发明的一些可能的实施方式中,若描述子元素Ci为作用说明Role元素,则所述设定共性条件可为视频适配集元素所包括的描述子元素Ci的元素名称可相同、方法识别schemeIdUri属性可相同,且参数(value)属性可相同。
可选的,在本发明的一些可能的实施方式中,所述导览媒体呈现的媒体呈现描述中包括所述N个视频适配集元素,所述N个视频适配集元素与N个视频适配集之间一一对应。其中,所述N个视频适配集元素中的与视频适配集I对应的视频适配集元素VI包括用于指向一个主媒体呈现的指针,所述视频适配集I可为所述N个视频适配集中任意一个视频适配集。
其中,可根据场景需要来确定所述指针在视频适配集元素VI中承载位置。
例如,所述指针可由所述视频适配集元素VI的属性承载。
具体例如,所述指针可由所述视频适配集元素VI的xlink:href属性或其它属性承载。
又例如,所述指针可由所述视频适配集元素VI中的EssentialProptery元素 或SupplementalProperty元素承载。
具体例如,所述指针可由所述视频适配集元素VI之中的EssentialProptery元素中的子元素承载,或者所述指针可由所述视频适配集元素VI中的EssentialProptery元素的属性承载;或者,所述指针可由所述视频适配集元素VI中的SupplementalProperty元素中的子元素承载,或者所述指针可由所述视频适配集元素VI中的SupplementalProperty元素的属性承载。
具体例如,所述指针可由所述视频适配集元素VI中的EssentialProptery元素的value属性或其它属性承载,或所述指针可由所述视频适配集元素VI之中的SupplementalProperty元素的value属性或其它属性承载。
又例如,所述指针可由所述视频适配集元素VI中的虚拟Representation元素的属性承载,或所述指针可由所述视频适配集元素VI中的虚拟Representation元素中的子元素承载,其中,所述虚拟Representation元素不包括媒体片段模版元素、媒体片段列表元素和基础统一资源定位符(BaseURL)元素。
又举例来说,所述指针也可由所述视频适配集元素VI中的媒体呈现指向(ReferencedMediaPresentation)元素来承载。ReferencedMediaPresentation元素是新扩展的一种元素,也就是说,可以利用所述视频适配集元素VI中的新扩展出的元素来承载所述指针,所述视频适配集元素VI中新扩展出的承载所述指针的元素的名称并不限于ReferencedMediaPresentation,也可以为其它的元素名称。
可选的,在本发明的一些可能的实施方式中,所述导览媒体呈现的时间结构可不依赖于所述导览媒体呈现中的所述N个导览单元所指向的主媒体呈现的时间结构。其中,导览单元的音频可以是通过对主媒体呈现的音频进行编码而得到,导览单元的视频可以是通过对主媒体呈现的视频进行编码而得到,这可使得导览单元的时间结构和主媒体呈现的时间结构之间没有相关性。
可以理解的是,本实施例的服务器600的各功能模块的功能可根据上述方法实施例中的方法具体实现,其具体实现过程可以参照上述方法实施例的相关描述,此处不再赘述。其中,服务器600可用于实施上述实施例提供的任意一 种基于超文本传输协议媒体流的媒体呈现导览方法。
其中,服务器600可为内容服务器或其他服务器。
可以看出,本实施例的技术方案中,服务器600生成的导览媒体呈现的媒体呈现描述所描述的导览媒体呈现包括的N个导览单元,由于N个导览单元中的每个导览单元可以分别指向一个主媒体呈现,这样就相当于在导览单元和主媒体呈现之间引入的一定的关联关系,这使得客户端在所述N个导览单元的导览单元i被选择的情况下,所述客户端可获取与导览单元i指向的主媒体呈现j的媒体呈现描述,进而可以根据所述主媒体呈现j的媒体呈现描述获取所述主媒体呈现j进行呈现,可见这中方案为实现导览媒体呈现和主媒体呈现之间的较灵活切换奠定了基础,进而为实现在基于HTTP的媒体流服务场景下支持视频导览奠定了基础。
参见图7,本发明实施例提供的一种服务器700,可包括:
处理器702和存储器703。其中,处理器702和存储器703通过总线701耦合连接。
所述处理器702通过调用所述存储器703中的代码或指令以用于,确定导览媒体呈现包括的N个导览单元;生成导览媒体呈现的媒体呈现描述,所述导览媒体呈现的媒体呈现描述描述了所述导览媒体呈现包括的N个导览单元,所述N为大于1的整数;所述N个导览单元中的每个导览单元指向一个主媒体呈现,所述N个导览单元中的导览单元i所指向的主媒体呈现的呈现质量高于所述导览单元i的呈现质量。
其中,N个导览单元中的导览单元i所指向的主媒体呈现的呈现质量高于所述导览单元i的呈现质量。也就是说,导览单元的媒体表达的呈现质量低于导览单元所表示的主媒体呈现的呈现质量。
可选的,在本发明的一些可能的实施方式中,所述导览媒体呈现的媒体呈现描述可以不同于所述N个导览单元中的每个导览单元所指向主媒体呈现的媒体呈现描述。即,所述导览媒体呈现可具有独立的媒体呈现描述,N个导览单元中的每个导览单元所指向主媒体呈现亦可具有独立的且不同于所述导览 媒体呈现的媒体呈现描述的媒体呈现描述。例如N个导览单元指向了N个主媒体呈现,而N个主媒体呈现分别具有对应的媒体呈现描述,即N个媒体呈现描述,而导览媒体呈现的媒体呈现描述不同于这N个媒体呈现描述中的任意一个,即导览媒体呈现可由第K+1个媒体呈现描述。
此外,在本发明另一些可能的实施方式中,所述导览媒体呈现的媒体呈现描述与所述N个导览单元中的每个导览单元所指向主媒体呈现的媒体呈现描述可被聚合形成了一个聚合媒体呈现描述(也可称超级媒体呈现描述)。即可利用一个聚合媒体呈现描述(可称超级媒体呈现描述)来描述导览媒体呈现和导览媒体呈现所指向的主媒体呈现。超级媒体呈现描述的引入有利于增强导览媒体呈现和所导览的主媒体呈现之间的关联关系。
在实际应用中,导览单元指向主媒体呈现的方式可以很灵活,导览单元可以直接指向主媒体呈现,也可以间接的指向主媒体呈现。
举例来说,所述N个导览单元中的每个导览单元可以以指向媒体呈现描述的方式来指向该媒体呈现描述所描述的主媒体呈现。当然,导览单元亦可通过其他的直接指向或间接指向的方式来指向主媒体呈现。例如,所述导览媒体呈现的媒体呈现描述与所述N个导览单元中的每个导览单元所指向主媒体呈现的媒体呈现描述可被聚合形成了一个聚合媒体呈现描述。这种情况下,所述N个导览单元中的每个导览单元可以以引用所述聚合媒体呈现描述中的呈现元素的方式来指向一个主媒体呈现。
可选的,在本发明的一些可能的实施方式中,所述N个导览单元中的每个导览单元包括视频分量,或者所述N个导览单元中的每个导览单元包括音频分量和视频分量,进一步的,导览单元还可包括字幕分量或其他类型的媒体分量。
本发明通过媒体呈现描述(如DASH标准中的MPD),提供了导览业务的信令机制。媒体呈现描述可告知客户端一个导览业务由哪些导览单元组成,导览单元的分量,导览单元和导览业务的成员媒体呈现之间的关系,导览单元视频分量之间的关系,导览单元的音频分量之间的关系,导览单元的音频分量和视频分量之间的关系等。
可选的,在本发明一些可能实施方式中,所述N个导览单元中的不同导览单元所包括的视频分量为N个视频适配集中的不同视频适配集中的媒体表达,其中,所述N个视频适配集中的任意一个视频适配集中的媒体表达之间具有选择互斥性,所述N个视频适配集中的不同视频适配集之间具有选择相容性。例如,所述N个导览单元中导览单元i所包括的视频分量可归属于N个视频适配集中的视频适配集Ci,所述N个导览单元中导览单元j所包括的视频分量可归属于N个视频适配集中的视频适配集Cj,其中,视频适配集Cj和视频适配集Ci为所述N个视频适配集中的两个不同的视频适配集。导览单元j和导览单元i可为N个导览单元中的任意两个导览单元。
其中,所谓选择相容性,表示这些对象可同时被选择,例如若N个视频适配集中的不同视频适配集之间具有选择相容性,则表示可同时选择N个视频适配集中的多个视频适配集中的媒体表达。
所谓选择互斥性,表示这些对象不支持同时被选择,例如若所述N个视频适配集中的任意一个视频适配集中的媒体表达之间具有选择互斥性,表示不支持同时选择1个视频适配集中的多个媒体表达,例如假设N个视频适配集中的视频适配集I包括10个多个媒体表达,若视频适配集中的媒体表达之间具有选择互斥性,那么每次只能选择10个媒体表达中的其中1个,而不能同时选择该10个媒体表达中的多个。
可选的,在本发明的一些可能实施方式中,所述N个导览单元包括的音频分量为音频适配集中的媒体表达,所述音频适配集不同于所述N个视频适配集中的任意一个适配集,所述音频分量适配集与所述N个视频适配集之间具有选择相容性。例如,假设音频适配集包括20个多个媒体表达,若音频适配集中的媒体表达之间具有选择互斥性,那么每次只能选择20个多个媒体表达中的其中1个,而不能同时选择该30个媒体表达中的多个。
可选的,在本发明的另一些可能的实施方式中,所述N个导览单元中的不同导览单元所包括的音频分量为N个音频适配集中的不同音频适配集中的媒体表达,其中,所述N个音频适配集中的不同音频适配集之间具有选择互斥性。
可选的,在本发明的一些可能的实施方式中,所述音频适配集元素中的媒体表达元素,可以包含其所描述的媒体表达在导览媒体呈现中的关联区域的区域说明。
可选的,在本发明一些可能实施方式中,包含相同区域说明的媒体表达元素所描述的媒体表达(representation)之间具有关联关系,或者,包含相同区域说明的适配集元素所描述的适配集之间具有关联关系。例如媒体表达元素i所描述的媒体表达为媒体表达ri,媒体表达元素j所描述的媒体表达为媒体表达rj,若媒体表达元素i与媒体表达元素j包含相同区域说明,那么可以表示媒体表达ri与媒体表达rj之间具有关联关系。
可选的,在本发明一些可能实施方式中,媒体表达元素i和适配集元素ci包含相同区域说明,那么也可能说明媒体表达元素i所描述的媒体表达与适配集元素ci所描述的适配集中的各媒体表达之间具有关联关系,例如媒体表达元素i可为音频媒体表达,而适配集元素ci所描述的适配集中的媒体表达可为视频媒体表达。
可选的,在本发明的一些可能的实施方式中,所述区域说明可为空间关系描述(SRD)。当然,所述区域说明亦可为其他类型的可用于描述位置区域的说明信息。
可选的,在本发明的一些可能的实施方式中,所述导览媒体呈现的媒体呈现描述中包括N个视频适配集元素,所述N个视频适配集元素与所述N个视频适配集之间一一对应。
其中,所述N个视频适配集元素中包括描述子元素Ci,所述N个视频适配集元素中的满足设定共性条件视频适配集元素所描述的视频适配集之间具有选择相容性,所述设定共性条件例如可为视频适配集元素所包括的描述子元素Ci的元素名称和方法识别(schemeIdUri)属性均相同。
可选的,在本发明的一些可能的实施方式中,所述描述子元素Ci可描述了包括该描述子元素Ci的视频适配集元素所描述的视频适配集中的媒体表达为导览媒体呈现的组成部分。或者,所述描述子元素Ci可描述了包括该描述子元素Ci的视频适配集元素对应的视频适配集中的媒体表达在导览媒体呈现 的角色,角色例如可能是主要、补充、字幕或翻译配音等。
可选的,在本发明的一些可能的实施方式中,所述描述子元素Ci例如可为基本属性(EssentialProptery)元素或者补充属性(SupplementalProptery)元素或作用说明(Role)元素或者其他元素。
可选的,在本发明的一些可能的实施方式中,若描述子元素Ci为作用说明Role元素,则所述设定共性条件可为视频适配集元素所包括的描述子元素Ci的元素名称可相同、方法识别schemeIdUri属性可相同,且参数(value)属性可相同。
可选的,在本发明的一些可能的实施方式中,所述导览媒体呈现的媒体呈现描述中包括所述N个视频适配集元素,所述N个视频适配集元素与N个视频适配集之间一一对应。其中,所述N个视频适配集元素中的与视频适配集I对应的视频适配集元素VI包括用于指向一个主媒体呈现的指针,所述视频适配集I可为所述N个视频适配集中任意一个视频适配集。
其中,可根据场景需要来确定所述指针在视频适配集元素VI中承载位置。
例如,所述指针可由所述视频适配集元素VI的属性承载。
具体例如,所述指针可由所述视频适配集元素VI的xlink:href属性或其它属性承载。
又例如,所述指针可由所述视频适配集元素VI中的EssentialProptery元素或SupplementalProperty元素承载。
具体例如,所述指针可由所述视频适配集元素VI之中的EssentialProptery元素中的子元素承载,或者所述指针可由所述视频适配集元素VI中的EssentialProptery元素的属性承载;或者,所述指针可由所述视频适配集元素VI中的SupplementalProperty元素中的子元素承载,或者所述指针可由所述视频适配集元素VI中的SupplementalProperty元素的属性承载。
具体例如,所述指针可由所述视频适配集元素VI中的EssentialProptery元素的value属性或其它属性承载,或所述指针可由所述视频适配集元素VI之中的SupplementalProperty元素的value属性或其它属性承载。
又例如,所述指针可由所述视频适配集元素VI中的虚拟Representation 元素的属性承载,或所述指针可由所述视频适配集元素VI中的虚拟Representation元素中的子元素承载,其中,所述虚拟Representation元素不包括媒体片段模版元素、媒体片段列表元素和基础统一资源定位符(BaseURL)元素。
又举例来说,所述指针也可由所述视频适配集元素VI中的媒体呈现指向(ReferencedMediaPresentation)元素来承载。ReferencedMediaPresentation元素是新扩展的一种元素,也就是说,可以利用所述视频适配集元素VI中的新扩展出的元素来承载所述指针,所述视频适配集元素VI中新扩展出的承载所述指针的元素的名称并不限于ReferencedMediaPresentation,也可以为其它的元素名称。
可选的,在本发明的一些可能的实施方式中,所述导览媒体呈现的时间结构可不依赖于所述导览媒体呈现中的所述N个导览单元所指向的主媒体呈现的时间结构。其中,导览单元的音频可以是通过对主媒体呈现的音频进行编码而得到,导览单元的视频可以是通过对主媒体呈现的视频进行编码而得到,这可使得导览单元的时间结构和主媒体呈现的时间结构之间没有相关性。
可以理解的是,本实施例的服务器700的各功能模块的功能可根据上述方法实施例中的方法具体实现,其具体实现过程可以参照上述方法实施例的相关描述,此处不再赘述。其中,服务器700可用于实施上述实施例提供的任意一种基于超文本传输协议媒体流的媒体呈现导览方法。
其中,服务器700可为内容服务器或其他服务器。
可以看出,本实施例的技术方案中,服务器700生成的导览媒体呈现的媒体呈现描述所描述的导览媒体呈现包括的N个导览单元,由于N个导览单元中的每个导览单元可以分别指向一个主媒体呈现,这样就相当于在导览单元和主媒体呈现之间引入的一定的关联关系,这使得客户端在所述N个导览单元的导览单元i被选择的情况下,所述客户端可获取与导览单元i指向的主媒体呈现j的媒体呈现描述,进而可以根据所述主媒体呈现j的媒体呈现描述获取所述主媒体呈现j进行呈现,可见这中方案为实现导览媒体呈现和主媒体呈现之间的较灵活切换奠定了基础,进而为实现在基于HTTP的媒体流服务场景下 支持视频导览奠定了基础。
参见图8,本发明实施例还提供一种通信系统,可包括:
客户端810和与所述客户端通信连接的内容服务器820;
其中,所述客户端810,用于从内容服务器820获取导览媒体呈现的媒体呈现描述,其中,所述导览媒体呈现的媒体呈现描述描述了所述导览媒体呈现包括的N个导览单元,所述N为大于1的整数;根据所述导览媒体呈现的媒体呈现描述从内容服务器820获取所述N个导览单元中的K个导览单元;呈现所述K个导览单元,所述K个导览单元中的每个导览单元指向一个主媒体呈现,其中,K个导览单元中的导览单元i所指向的主媒体呈现的呈现质量高于所述导览单元i的呈现质量。
其中,所述客户端810例如可为上述实施例提供的任意一种客户端。
上述装置和系统内的各模块之间的信息交互、执行过程等内容,由于与本发明方法实施例基于同一构思,具体内容可参见本发明方法实施例中的叙述,此处不再赘述。
本发明实施例还提供一种计算机存储介质,其中,该计算机存储介质可存储有程序,该程序执行时包括上述方法实施例中记载的任意一种方法的部分或全部步骤。
在上述实施例中,对各个实施例的描述都各有侧重,某个实施例中没有详述的部分,可以参见其他实施例的相关描述。
需要说明的是,对于前述的各方法实施例,为了简单描述,故将其都表述为一系列的动作组合,但是本领域技术人员应该知悉,本发明并不受所描述的动作顺序的限制,因为依据本发明,某些步骤可能可以采用其他顺序或者同时进行。其次,本领域技术人员也应该知悉,说明书中所描述的实施例均属于优选实施例,所涉及的动作和模块并不一定是本发明所必须的。
在本申请所提供的几个实施例中,应该理解到,所揭露的装置,可通过其它的方式实现。例如以上所描述的装置实施例仅仅是示意性的,例如上述单元的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式,例如多个单元或组件可以结合或者可以集成到另一个系统,或者一些特征可以忽略 或不执行。另一点,所显示或讨论的相互之间的耦合或直接耦合或通信连接可以是通过一些接口,装置或单元的间接耦合或通信连接,可以是电性或其它的形式。
上述作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部单元来实现本实施例方案的目的。
另外,在本发明各实施例中的各功能单元可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中。上述集成的单元既可以采用硬件的形式实现,也可以采用软件功能单元的形式实现。
上述集成的单元如果以软件功能单元的形式实现并作为独立的产品销售或使用时,可以存储在一个计算机可读取存储介质中。基于这样的理解,本发明的技术方案本质上或者说对现有技术做出贡献的部分或者该技术方案的全部或部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质中,包括若干指令用以使得一台计算机设备(可以为个人计算机、服务器或者网络设备等,具体可以是计算机设备中的处理器)执行本发明各个实施例上述方法的全部或部分步骤。其中,而前述的存储介质可包括:U盘、移动硬盘、磁碟、光盘、只读存储器(ROM,Read-Only Memory)或者随机存取存储器(RAM,Random Access Memory)等各种可以存储程序代码的介质。
以上所述,以上实施例仅用以说明本发明的技术方案,而非对其限制;尽管参照前述实施例对本发明进行了详细的说明,本领域的普通技术人员应当理解:其依然可以对前述各实施例所记载的技术方案进行修改,或者对其中部分技术特征进行等同替换;而这些修改或者替换,并不使相应技术方案的本质脱离本发明各实施例技术方案的精神和范围。

Claims (104)

  1. 一种基于超文本传输协议媒体流的媒体呈现导览方法,其特征在于,包括:
    客户端获取导览媒体呈现的媒体呈现描述,其中,所述导览媒体呈现的媒体呈现描述描述了所述导览媒体呈现包括的N个导览单元,所述N为大于1的整数;
    所述客户端根据所述导览媒体呈现的媒体呈现描述获取所述N个导览单元中的K个导览单元;
    所述客户端呈现所述K个导览单元,所述K个导览单元中的每个导览单元指向一个主媒体呈现,其中,K个导览单元中的导览单元i所指向的主媒体呈现的呈现质量高于所述导览单元i的呈现质量。
  2. 根据权利要求1所述的方法,其特征在于,所述导览媒体呈现的媒体呈现描述不同于所述K个导览单元中的每个导览单元所指向主媒体呈现的媒体呈现描述。
  3. 根据权利要求2所述的方法,其特征在于,所述K个导览单元中的每个导览单元以指向媒体呈现描述的方式来指向该媒体呈现描述所描述的主媒体呈现。
  4. 根据权利要求1所述的方法,其特征在于,所述导览媒体呈现的媒体呈现描述与所述K个导览单元中的每个导览单元所指向主媒体呈现的媒体呈现描述被聚合形成了一个聚合媒体呈现描述。
  5. 根据权利要求4所述的方法,其特征在于,
    所述K个导览单元中的每个导览单元以引用所述聚合媒体呈现描述中的呈现元素的方式来指向一个主媒体呈现。
  6. 根据权利要求1至5任一项所述的方法,其特征在于,所述N个导览单元中的每个导览单元包括视频分量,或者所述N个导览单元中的每个导览单元包括音频分量和视频分量。
  7. 根据权利要求6所述的方法,其特征在于,所述K个导览单元中的不同导览单元所包括的视频分量为K个视频适配集中的不同视频适配集中的媒 体表达,其中,所述K个视频适配集中的任意一个视频适配集中的媒体表达之间具有选择互斥性,所述K个视频适配集中的不同视频适配集之间具有选择相容性。
  8. 根据权利要求7所述的方法,其特征在于,所述K个导览单元包括的音频分量为音频适配集中的媒体表达,所述音频适配集不同于所述K个适配集中的任意一个适配集,所述音频分量适配集与所述K个视频适配集之间具有选择相容性;
    或者,
    所述K个导览单元中的不同导览单元所包括的音频分量为K个音频适配集中的不同音频适配集中的媒体表达,其中,所述K个音频适配集中的不同音频适配集之间具有选择互斥性。
  9. 根据权利要求8所述的方法,其特征在于,所述音频适配集元素中的媒体表达元素,包含其所描述的媒体表达在导览媒体呈现中的关联区域的区域说明。
  10. 根据权利要求9所述的方法,其特征在于,包含相同区域说明的媒体表达元素所描述的媒体表达之间具有关联关系,或者,包含相同区域说明的适配集元素所描述的适配集之间具有关联关系。
  11. 根据权利要求9或10所述的方法,其特征在于,所述区域说明为SRD空间关系描述。
  12. 根据权利要求7至11任一项所述的方法,其特征在于,所述导览媒体呈现的媒体呈现描述中包括K个视频适配集元素,所述K个视频适配集元素与所述K个视频适配集之间一一对应;其中,所述K个视频适配集元素中包括描述子元素Ci,所述K个视频适配集元素中的满足设定共性条件视频适配集元素所描述的视频适配集之间具有选择相容性,所述设定共性条件为视频适配集元素所包括的描述子元素Ci的元素名称和方法识别schemeIdUri属性均相同。
  13. 根据权利要求12所述的方法,其特征在于,所述描述子元素Ci描述了包括该描述子元素Ci的视频适配集元素所描述的视频适配集中的媒体表达 为导览媒体呈现的组成部分。
  14. 根据权利要求12所述的方法,其特征在于,所述描述子元素Ci描述了包括该描述子元素Ci的视频适配集元素对应的视频适配集中的媒体表达在导览媒体呈现的角色。
  15. 根据权利要求13或14所述的方法,其特征在于,所述描述子元素Ci为作用说明Role元素或者基本属性EssentialProptery元素或者补充属性SupplementalProptery元素。
  16. 根据权利要求15所述的方法,其特征在于,
    若描述子元素Ci为作用说明Role元素,则所述设定共性条件为视频适配集元素所包括的描述子元素Ci的元素名称相同、方法识别schemeIdUri属性相同,且参数value属性相同。
  17. 根据权利要求5至16任一项所述的方法,其特征在于,所述导览媒体呈现的媒体呈现描述中包括所述K个视频适配集元素,所述K个视频适配集元素与K个视频适配集之间一一对应,
    其中,所述K个视频适配集元素中的与视频适配集I对应的视频适配集元素VI包括用于指向一个主媒体呈现的指针,所述视频适配集I为所述K个视频适配集中任意一个视频适配集。
  18. 根据权利要求17所述的方法,其特征在于,
    所述指针由所述视频适配集元素VI的属性承载。
  19. 根据权利要求18所述的方法,其特征在于,所述指针由所述视频适配集元素VI的xlink:href属性承载。
  20. 根据权利要求17所述的方法,其特征在于,
    所述指针由所述视频适配集元素VI中的基本属性EssentialProptery元素或补充属性SupplementalProperty元素承载。
  21. 根据权利要求17所述的方法,其特征在于,所述指针由所述视频适配集元素VI之中的EssentialProptery元素中的子元素承载,或所述指针由所述视频适配集元素VI中的EssentialProptery元素的属性承载;或者所述指针由所述视频适配集元素VI中的SupplementalProperty元素中的子元素承载,或者所 述指针由所述视频适配集元素VI中的SupplementalProperty元素的属性承载。
  22. 根据权利要求21所述的方法,其特征在于,所述指针由所述视频适配集元素VI中的EssentialProptery元素的value属性承载,或者所述指针由所述视频适配集元素VI之中的SupplementalProperty元素的value属性承载。
  23. 根据权利要求17所述的方法,其特征在于,
    所述指针由所述视频适配集元素VI中的虚拟媒体表达Representation元素的属性承载,或,所述指针由所述视频适配集元素VI中的虚拟Representation元素中的子元素承载,其中,所述虚拟Representation元素不包括媒体片段模版元素、媒体片段列表元素和基础统一资源定位符BaseURL元素。
  24. 根据权利要求17所述的方法,其特征在于,
    所述指针由所述视频适配集元素VI中的媒体呈现指向ReferencedMediaPresentation元素来承载。
  25. 根据权利要求1至24任意一项所述的方法,其特征在于,
    所述导览媒体呈现的时间结构不依赖于所述导览媒体呈现中的所述K个导览单元所指向的主媒体呈现的时间结构。
  26. 根据权利要求1至25任意一项所述的方法,其特征在于,
    在关注焦点停留在所述K个导览单元中的导览单元i的情况下,所述客户端呈现所述导览单元i的音频分量。
  27. 根据权利要求1至26任意一项所述的方法,其特征在于,
    所述方法还包括:
    在所述K个导览单元中的导览单元i被选择的情况下,所述客户端获取所述导览单元i所指向的主媒体呈现。
  28. 一种基于超文本传输协议媒体流的媒体呈现导览方法,其特征在于,包括:
    确定导览媒体呈现包括的N个导览单元;
    生成导览媒体呈现的媒体呈现描述,所述导览媒体呈现的媒体呈现描述描述了所述导览媒体呈现包括的N个导览单元,所述N为大于1的整数;所述N个导览单元中的每个导览单元指向一个主媒体呈现,其中,所述N个导览 单元中的导览单元i所指向的主媒体呈现的呈现质量高于所述导览单元i的呈现质量。
  29. 根据权利要求28所述的方法,其特征在于,所述导览媒体呈现的媒体呈现描述不同于所述N个导览单元中的每个导览单元所指向主媒体呈现的媒体呈现描述。
  30. 根据权利要求29所述的方法,其特征在于,所述N个导览单元中的每个导览单元以指向媒体呈现描述的方式来指向该媒体呈现描述所描述的主媒体呈现。
  31. 根据权利要求28所述的方法,其特征在于,所述导览媒体呈现的媒体呈现描述与所述N个导览单元中的每个导览单元所指向主媒体呈现的媒体呈现描述被聚合形成了一个聚合媒体呈现描述。
  32. 根据权利要求31所述的方法,其特征在于,
    所述N个导览单元中的每个导览单元以引用所述聚合媒体呈现描述中的呈现元素的方式来指向一个主媒体呈现。
  33. 根据权利要求28至32任一项所述的方法,其特征在于,所述N个导览单元中的每个导览单元包括视频分量,或者所述N个导览单元中的每个导览单元包括音频分量和视频分量。
  34. 根据权利要求33所述的方法,其特征在于,所述N个导览单元中的不同导览单元所包括的视频分量为N个视频适配集中的不同视频适配集中的媒体表达,其中,所述N个视频适配集中的任意一个视频适配集中的媒体表达之间具有选择互斥性,所述N个视频适配集中的不同视频适配集之间具有选择相容性。
  35. 根据权利要求34所述的方法,其特征在于,所述N个导览单元包括的音频分量为音频适配集中的媒体表达,所述音频适配集不同于所述N个适配集中的任意一个适配集,所述音频分量适配集与所述N个视频适配集之间具有选择相容性;
    或者,
    所述N个导览单元中的不同导览单元所包括的音频分量为N个音频适配 集中的不同音频适配集中的媒体表达,其中,所述N个音频适配集中的不同音频适配集之间具有选择互斥性。
  36. 根据权利要求35所述的方法,其特征在于,所述音频适配集元素中的媒体表达元素,包含其所描述的媒体表达在导览媒体呈现中的关联区域的区域说明。
  37. 根据权利要求36所述的方法,其特征在于,包含相同区域说明的媒体表达元素所描述的媒体表达之间具有关联关系,或者,包含相同区域说明的适配集元素所描述的适配集之间具有关联关系。
  38. 根据权利要求36或37所述的方法,其特征在于,所述区域说明为SRD空间关系描述。
  39. 根据权利要求34至38任一项所述的方法,其特征在于,所述导览媒体呈现的媒体呈现描述中包括N个视频适配集元素,所述N个视频适配集元素与所述N个视频适配集之间一一对应;其中,所述N个视频适配集元素中包括描述子元素Ci,所述N个视频适配集元素中的满足设定共性条件视频适配集元素所描述的视频适配集之间具有选择相容性,所述设定共性条件为视频适配集元素所包括的描述子元素Ci的元素名称和方法识别schemeIdUri属性均相同。
  40. 根据权利要求39所述的方法,其特征在于,所述描述子元素Ci描述了包括该描述子元素Ci的视频适配集元素所描述的视频适配集中的媒体表达为导览媒体呈现的组成部分。
  41. 根据权利要求39所述的方法,其特征在于,所述描述子元素Ci描述了包括该描述子元素Ci的视频适配集元素对应的视频适配集中的媒体表达在导览媒体呈现的角色。
  42. 根据权利要求40或41所述的方法,其特征在于,所述描述子元素Ci为作用说明Role元素或者基本属性EssentialProptery元素或者补充属性SupplementalProptery元素。
  43. 根据权利要求42所述的方法,其特征在于,
    若描述子元素Ci为作用说明Role元素,则所述设定共性条件为视频适配 集元素所包括的描述子元素Ci的元素名称相同、方法识别schemeIdUri属性相同,且参数value属性相同。
  44. 根据权利要求32至43任一项所述的方法,其特征在于,所述导览媒体呈现的媒体呈现描述中包括所述N个视频适配集元素,所述N个视频适配集元素与N个视频适配集之间一一对应,
    其中,所述N个视频适配集元素中的与视频适配集I对应的视频适配集元素VI包括用于指向一个主媒体呈现的指针,所述视频适配集I为所述N个视频适配集中任意一个视频适配集。
  45. 根据权利要求44所述的方法,其特征在于,
    所述指针由所述视频适配集元素VI的属性承载。
  46. 根据权利要求45所述的方法,其特征在于,所述指针由所述视频适配集元素VI的xlink:href属性承载。
  47. 根据权利要求44所述的方法,其特征在于,
    所述指针由所述视频适配集元素VI中的基本属性EssentialProptery元素或补充属性SupplementalProperty元素承载。
  48. 根据权利要求44所述的方法,其特征在于,所述指针由所述视频适配集元素VI之中的EssentialProptery元素中的子元素承载,或所述指针由所述视频适配集元素VI中的EssentialProptery元素的属性承载;或者所述指针由所述视频适配集元素VI中的SupplementalProperty元素中的子元素承载,或者所述指针由所述视频适配集元素VI中的SupplementalProperty元素的属性承载。
  49. 根据权利要求48所述的方法,其特征在于,所述指针由所述视频适配集元素VI中的EssentialProptery元素的value属性承载,或者所述指针由所述视频适配集元素VI之中的SupplementalProperty元素的value属性承载。
  50. 根据权利要求44所述的方法,其特征在于,
    所述指针由所述视频适配集元素VI中的虚拟媒体表达Representation元素的属性承载,或,所述指针由所述视频适配集元素VI中的虚拟Representation元素中的子元素承载,其中,所述虚拟Representation元素不包括媒体片段模版元素、媒体片段列表元素和基础统一资源定位符BaseURL元素。
  51. 根据权利要求44所述的方法,其特征在于,所述指针由所述视频适配集元素VI中的媒体呈现指向ReferencedMediaPresentation元素来承载。
  52. 根据权利要求28至51任意一项所述的方法,其特征在于,
    所述导览媒体呈现的时间结构不依赖于所述导览媒体呈现中的所述N个导览单元所指向的主媒体呈现的时间结构。
  53. 一种客户端,其特征在于,包括:
    第一获取单元,用于获取导览媒体呈现的媒体呈现描述,其中,所述导览媒体呈现的媒体呈现描述描述了所述导览媒体呈现包括的N个导览单元,所述N为大于1的整数;
    第二获取单元,用于根据所述导览媒体呈现的媒体呈现描述获取所述N个导览单元中的K个导览单元;
    呈现单元,用于呈现所述K个导览单元,所述K个导览单元中的每个导览单元指向一个主媒体呈现,其中,K个导览单元中的导览单元i所指向的主媒体呈现的呈现质量高于所述导览单元i的呈现质量。
  54. 根据权利要求53所述的客户端,其特征在于,所述导览媒体呈现的媒体呈现描述不同于所述K个导览单元中的每个导览单元所指向主媒体呈现的媒体呈现描述。
  55. 根据权利要求54所述的客户端,其特征在于,所述K个导览单元中的每个导览单元以指向媒体呈现描述的方式来指向该媒体呈现描述所描述的主媒体呈现。
  56. 根据权利要求53所述的客户端,其特征在于,所述导览媒体呈现的媒体呈现描述与所述K个导览单元中的每个导览单元所指向主媒体呈现的媒体呈现描述被聚合形成了一个聚合媒体呈现描述。
  57. 根据权利要求56所述的客户端,其特征在于,
    所述K个导览单元中的每个导览单元以引用所述聚合媒体呈现描述中的呈现元素的方式来指向一个主媒体呈现。
  58. 根据权利要求53至57任一项所述的客户端,其特征在于,所述N个导览单元中的每个导览单元包括视频分量,或者所述N个导览单元中的每 个导览单元包括音频分量和视频分量。
  59. 根据权利要求58所述的客户端,其特征在于,所述K个导览单元中的不同导览单元所包括的视频分量为K个视频适配集中的不同视频适配集中的媒体表达,其中,所述K个视频适配集中的任意一个视频适配集中的媒体表达之间具有选择互斥性,所述K个视频适配集中的不同视频适配集之间具有选择相容性。
  60. 根据权利要求59所述的客户端,其特征在于,所述K个导览单元包括的音频分量为音频适配集中的媒体表达,所述音频适配集不同于所述K个适配集中的任意一个适配集,所述音频分量适配集与所述K个视频适配集之间具有选择相容性;
    或者,
    所述K个导览单元中的不同导览单元所包括的音频分量为K个音频适配集中的不同音频适配集中的媒体表达,其中,所述K个音频适配集中的不同音频适配集之间具有选择互斥性。
  61. 根据权利要求60所述的客户端,其特征在于,所述音频适配集元素中的媒体表达元素,包含其所描述的媒体表达在导览媒体呈现中的关联区域的区域说明。
  62. 根据权利要求61所述的客户端,其特征在于,包含相同区域说明的媒体表达元素所描述的媒体表达之间具有关联关系,或者,包含相同区域说明的适配集元素所描述的适配集之间具有关联关系。
  63. 根据权利要求61或62所述的客户端,其特征在于,所述区域说明为SRD空间关系描述。
  64. 根据权利要求58至63任一项所述的客户端,其特征在于,所述导览媒体呈现的媒体呈现描述中包括K个视频适配集元素,所述K个视频适配集元素与所述K个视频适配集之间一一对应;其中,所述K个视频适配集元素中包括描述子元素Ci,所述K个视频适配集元素中的满足设定共性条件视频适配集元素所描述的视频适配集之间具有选择相容性,所述设定共性条件为视频适配集元素所包括的描述子元素Ci的元素名称和客户端识别schemeIdUri 属性均相同。
  65. 根据权利要求64所述的客户端,其特征在于,所述描述子元素Ci描述了包括该描述子元素Ci的视频适配集元素所描述的视频适配集中的媒体表达为导览媒体呈现的组成部分。
  66. 根据权利要求64所述的客户端,其特征在于,所述描述子元素Ci描述了包括该描述子元素Ci的视频适配集元素对应的视频适配集中的媒体表达在导览媒体呈现的角色。
  67. 根据权利要求65或66所述的客户端,其特征在于,所述描述子元素Ci为作用说明Role元素或者基本属性EssentialProptery元素或者补充属性SupplementalProptery元素。
  68. 根据权利要求67所述的客户端,其特征在于,
    若描述子元素Ci为作用说明Role元素,则所述设定共性条件为视频适配集元素所包括的描述子元素Ci的元素名称相同、客户端识别schemeIdUri属性相同,且参数value属性相同。
  69. 根据权利要求57至68任一项所述的客户端,其特征在于,所述导览媒体呈现的媒体呈现描述中包括所述K个视频适配集元素,所述K个视频适配集元素与K个视频适配集之间一一对应,
    其中,所述K个视频适配集元素中的与视频适配集I对应的视频适配集元素VI包括用于指向一个主媒体呈现的指针,所述视频适配集I为所述K个视频适配集中任意一个视频适配集。
  70. 根据权利要求69所述的客户端,其特征在于,
    所述指针由所述视频适配集元素VI的属性承载。
  71. 根据权利要求70所述的客户端,其特征在于,所述指针由所述视频适配集元素VI的xlink:href属性承载。
  72. 根据权利要求70所述的客户端,其特征在于,
    所述指针由所述视频适配集元素VI中的基本属性EssentialProptery元素或补充属性SupplementalProperty元素承载。
  73. 根据权利要求70所述的客户端,其特征在于,所述指针由所述视频 适配集元素VI之中的EssentialProptery元素中的子元素承载,或所述指针由所述视频适配集元素VI中的EssentialProptery元素的属性承载;或者所述指针由所述视频适配集元素VI中的SupplementalProperty元素中的子元素承载,或者所述指针由所述视频适配集元素VI中的SupplementalProperty元素的属性承载。
  74. 根据权利要求73所述的客户端,其特征在于,所述指针由所述视频适配集元素VI中的EssentialProptery元素的value属性承载,或者所述指针由所述视频适配集元素VI之中的SupplementalProperty元素的value属性承载。
  75. 根据权利要求70所述的客户端,其特征在于,
    所述指针由所述视频适配集元素VI中的虚拟媒体表达Representation元素的属性承载,或,所述指针由所述视频适配集元素VI中的虚拟Representation元素中的子元素承载,其中,所述虚拟Representation元素不包括媒体片段模版元素、媒体片段列表元素和基础统一资源定位符BaseURL元素。
  76. 根据权利要求70所述的客户端,其特征在于,
    所述指针由所述视频适配集元素VI中的媒体呈现指向ReferencedMediaPresentation元素来承载。
  77. 根据权利要求53至76任意一项所述的客户端,其特征在于,
    所述导览媒体呈现的时间结构不依赖于所述导览媒体呈现中的所述K个导览单元所指向的主媒体呈现的时间结构。
  78. 根据权利要求53至77任意一项所述的客户端,其特征在于,
    所述呈现单元还用于,在关注焦点停留在所述K个导览单元中的导览单元i的情况下,呈现所述导览单元i的音频分量。
  79. 根据权利要求53至78任意一项所述的客户端,其特征在于,所述呈现单元还用于,在所述K个导览单元中的导览单元i被选择的情况下,获取所述导览单元i所指向的主媒体呈现。
  80. 一种服务器,其特征在于,包括:
    确定单元,用于确定导览媒体呈现包括的N个导览单元;
    生成单元,用于生成导览媒体呈现的媒体呈现描述,所述导览媒体呈现的媒体呈现描述描述了所述导览媒体呈现包括的N个导览单元,所述N为大于 1的整数;所述N个导览单元中的每个导览单元指向一个主媒体呈现,所述N个导览单元中的导览单元i所指向的主媒体呈现的呈现质量高于所述导览单元i的呈现质量。
  81. 根据权利要求80所述的服务器,其特征在于,所述导览媒体呈现的媒体呈现描述不同于所述N个导览单元中的每个导览单元所指向主媒体呈现的媒体呈现描述。
  82. 根据权利要求81所述的服务器,其特征在于,所述N个导览单元中的每个导览单元以指向媒体呈现描述的方式来指向该媒体呈现描述所描述的主媒体呈现。
  83. 根据权利要求80所述的服务器,其特征在于,所述导览媒体呈现的媒体呈现描述与所述N个导览单元中的每个导览单元所指向主媒体呈现的媒体呈现描述被聚合形成了一个聚合媒体呈现描述。
  84. 根据权利要求82所述的服务器,其特征在于,
    所述N个导览单元中的每个导览单元以引用所述聚合媒体呈现描述中的呈现元素的方式来指向一个主媒体呈现。
  85. 根据权利要求80至84任一项所述的服务器,其特征在于,所述N个导览单元中的每个导览单元包括视频分量,或者所述N个导览单元中的每个导览单元包括音频分量和视频分量。
  86. 根据权利要求85所述的服务器,其特征在于,所述N个导览单元中的不同导览单元所包括的视频分量为N个视频适配集中的不同视频适配集中的媒体表达,其中,所述N个视频适配集中的任意一个视频适配集中的媒体表达之间具有选择互斥性,所述N个视频适配集中的不同视频适配集之间具有选择相容性。
  87. 根据权利要求86所述的服务器,其特征在于,所述N个导览单元包括的音频分量为音频适配集中的媒体表达,所述音频适配集不同于所述N个适配集中的任意一个适配集,所述音频分量适配集与所述N个视频适配集之间具有选择相容性;
    或者,
    所述N个导览单元中的不同导览单元所包括的音频分量为N个音频适配集中的不同音频适配集中的媒体表达,其中,所述N个音频适配集中的不同音频适配集之间具有选择互斥性。
  88. 根据权利要求87所述的服务器,其特征在于,所述音频适配集元素中的媒体表达元素,包含其所描述的媒体表达在导览媒体呈现中的关联区域的区域说明。
  89. 根据权利要求88所述的服务器,其特征在于,包含相同区域说明的媒体表达元素所描述的媒体表达之间具有关联关系,或者,包含相同区域说明的适配集元素所描述的适配集之间具有关联关系。
  90. 根据权利要求88或89所述的服务器,其特征在于,所述区域说明为SRD空间关系描述。
  91. 根据权利要求86至90任一项所述的服务器,其特征在于,所述导览媒体呈现的媒体呈现描述中包括N个视频适配集元素,所述N个视频适配集元素与所述N个视频适配集之间一一对应;其中,所述N个视频适配集元素中包括描述子元素Ci,所述N个视频适配集元素中的满足设定共性条件视频适配集元素所描述的视频适配集之间具有选择相容性,所述设定共性条件为视频适配集元素所包括的描述子元素Ci的元素名称和服务器识别schemeIdUri属性均相同。
  92. 根据权利要求91所述的服务器,其特征在于,所述描述子元素Ci描述了包括该描述子元素Ci的视频适配集元素所描述的视频适配集中的媒体表达为导览媒体呈现的组成部分。
  93. 根据权利要求91所述的服务器,其特征在于,所述描述子元素Ci描述了包括该描述子元素Ci的视频适配集元素对应的视频适配集中的媒体表达在导览媒体呈现的角色。
  94. 根据权利要求92或93所述的服务器,其特征在于,所述描述子元素Ci为作用说明Role元素或者基本属性EssentialProptery元素或者补充属性SupplementalProptery元素。
  95. 根据权利要求94所述的服务器,其特征在于,
    若描述子元素Ci为作用说明Role元素,则所述设定共性条件为视频适配集元素所包括的描述子元素Ci的元素名称相同、服务器识别schemeIdUri属性相同,且参数value属性相同。
  96. 根据权利要求84至95任一项所述的服务器,其特征在于,所述导览媒体呈现的媒体呈现描述中包括所述N个视频适配集元素,所述N个视频适配集元素与N个视频适配集之间一一对应,
    其中,所述N个视频适配集元素中的与视频适配集I对应的视频适配集元素VI包括用于指向一个主媒体呈现的指针,所述视频适配集I为所述N个视频适配集中任意一个视频适配集。
  97. 根据权利要求96所述的服务器,其特征在于,
    所述指针由所述视频适配集元素VI的属性承载。
  98. 根据权利要求95所述的服务器,其特征在于,所述指针由所述视频适配集元素VI的xlink:href属性承载。
  99. 根据权利要求96所述的服务器,其特征在于,
    所述指针由所述视频适配集元素VI中的基本属性EssentialProptery元素或补充属性SupplementalProperty元素承载。
  100. 根据权利要求96所述的服务器,其特征在于,所述指针由所述视频适配集元素VI之中的EssentialProptery元素中的子元素承载,或所述指针由所述视频适配集元素VI中的EssentialProptery元素的属性承载;或者所述指针由所述视频适配集元素VI中的SupplementalProperty元素中的子元素承载,或者所述指针由所述视频适配集元素VI中的SupplementalProperty元素的属性承载。
  101. 根据权利要求100所述的服务器,其特征在于,所述指针由所述视频适配集元素VI中的EssentialProptery元素的value属性承载,或者所述指针由所述视频适配集元素VI之中的SupplementalProperty元素的value属性承载。
  102. 根据权利要求96所述的服务器,其特征在于,
    所述指针由所述视频适配集元素VI中的虚拟媒体表达Representation元素的属性承载,或,所述指针由所述视频适配集元素VI中的虚拟Representation元素中的子元素承载,其中,所述虚拟Representation元素不包括媒体片段模 版元素、媒体片段列表元素和基础统一资源定位符BaseURL元素。
  103. 根据权利要求96所述的服务器,其特征在于,所述指针由所述视频适配集元素VI中的媒体呈现指向ReferencedMediaPresentation元素来承载。
  104. 根据权利要求80至103任意一项所述的服务器,其特征在于,
    所述导览媒体呈现的时间结构不依赖于所述导览媒体呈现中的所述N个导览单元所指向的主媒体呈现的时间结构。
PCT/CN2015/073148 2015-02-15 2015-02-15 基于超文本传输协议媒体流的媒体呈现导览方法和相关装置 WO2016127440A1 (zh)

Priority Applications (6)

Application Number Priority Date Filing Date Title
KR1020177025344A KR101919726B1 (ko) 2015-02-15 2015-02-15 하이퍼텍스트 전송 프로토콜 미디어 스트림에 기초한 미디어 프레젠테이션 가이드 방법 및 관련 장치
CN201580038222.5A CN106664299B (zh) 2015-02-15 2015-02-15 基于超文本传输协议媒体流的媒体呈现导览方法和相关装置
JP2017542417A JP6478357B2 (ja) 2015-02-15 2015-02-15 メディアストリーミング・オーバー・ハイパーテキストトランスファープロトコルにおけるメディアプレゼンテーションガイドを提供するための方法及び関連する装置
PCT/CN2015/073148 WO2016127440A1 (zh) 2015-02-15 2015-02-15 基于超文本传输协议媒体流的媒体呈现导览方法和相关装置
EP15881602.5A EP3249873B1 (en) 2015-02-15 2015-02-15 Media presentation guide method based on hyper text transport protocol media stream and related device
US15/677,436 US20170374122A1 (en) 2015-02-15 2017-08-15 Method and Related Apparatus for Providing Media Presentation Guide in Media Streaming Over Hypertext Transfer Protocol

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2015/073148 WO2016127440A1 (zh) 2015-02-15 2015-02-15 基于超文本传输协议媒体流的媒体呈现导览方法和相关装置

Related Child Applications (1)

Application Number Title Priority Date Filing Date
US15/677,436 Continuation US20170374122A1 (en) 2015-02-15 2017-08-15 Method and Related Apparatus for Providing Media Presentation Guide in Media Streaming Over Hypertext Transfer Protocol

Publications (1)

Publication Number Publication Date
WO2016127440A1 true WO2016127440A1 (zh) 2016-08-18

Family

ID=56615026

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2015/073148 WO2016127440A1 (zh) 2015-02-15 2015-02-15 基于超文本传输协议媒体流的媒体呈现导览方法和相关装置

Country Status (6)

Country Link
US (1) US20170374122A1 (zh)
EP (1) EP3249873B1 (zh)
JP (1) JP6478357B2 (zh)
KR (1) KR101919726B1 (zh)
CN (1) CN106664299B (zh)
WO (1) WO2016127440A1 (zh)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110431848A (zh) * 2017-03-24 2019-11-08 索尼公司 内容提供系统、内容提供方法和程序

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2016133296A1 (ko) * 2015-02-16 2016-08-25 엘지전자 주식회사 방송 신호 송신 장치, 방송 신호 수신 장치, 방송 신호 송신 방법, 및 방송 신호 수신 방법
WO2020050550A1 (en) * 2018-09-03 2020-03-12 Samsung Electronics Co., Ltd. Methods and systems for performing editing operations on media
US11895173B2 (en) * 2022-01-07 2024-02-06 Avago Technologies International Sales Pte. Limited Gapped and/or subsegmented adaptive bitrate streams

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102137137A (zh) * 2010-09-17 2011-07-27 华为技术有限公司 基于http流的媒体内容动态插播方法、装置及系统
US20140013003A1 (en) * 2012-07-09 2014-01-09 Futurewei Technologies, Inc. Content-Specific Identification and Timing Behavior in Dynamic Adaptive Streaming over Hypertext Transfer Protocol
CN103974147A (zh) * 2014-03-07 2014-08-06 北京邮电大学 一种基于mpeg-dash协议的带有码率切换控制和静态摘要技术的在线视频播控系统

Family Cites Families (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8677005B2 (en) * 2009-11-04 2014-03-18 Futurewei Technologies, Inc. System and method for media content streaming
CN102055789B (zh) * 2009-11-09 2013-10-09 华为技术有限公司 实现基于http的流媒体业务的方法、系统和网络设备
CN102055773B (zh) * 2009-11-09 2013-10-09 华为技术有限公司 实现基于http的流媒体业务的方法、系统和网络设备
KR101709903B1 (ko) * 2010-02-19 2017-02-23 텔레폰악티에볼라겟엘엠에릭슨(펍) 에이치티티피 스트리밍에서 적응화를 위한 방법 및 장치
US8468262B2 (en) * 2010-11-01 2013-06-18 Research In Motion Limited Method and apparatus for updating http content descriptions
CN109600632B (zh) * 2011-10-13 2020-12-25 三星电子株式会社 用于发送和接收多媒体服务的方法和装置
US9712874B2 (en) * 2011-12-12 2017-07-18 Lg Electronics Inc. Device and method for receiving media content
EP3018912B1 (en) * 2013-07-02 2018-09-12 Sony Corporation Content provision device, content provision method, program, terminal device, and content provision system
US20160373496A1 (en) * 2013-07-02 2016-12-22 Sony Corporation Content supply device, content supply method, program, terminal device, and content supply system
EP3020208B1 (en) * 2013-07-12 2022-03-09 Canon Kabushiki Kaisha Adaptive data streaming with push messages control
JP6493765B2 (ja) * 2013-07-19 2019-04-03 ソニー株式会社 情報処理装置および方法
US20150026358A1 (en) * 2013-07-19 2015-01-22 Futurewei Technologies, Inc. Metadata Information Signaling And Carriage In Dynamic Adaptive Streaming Over Hypertext Transfer Protocol
KR20160077067A (ko) * 2013-10-30 2016-07-01 소니 주식회사 송신 장치, 송신 방법, 수신 장치, 및 수신 방법

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102137137A (zh) * 2010-09-17 2011-07-27 华为技术有限公司 基于http流的媒体内容动态插播方法、装置及系统
US20140013003A1 (en) * 2012-07-09 2014-01-09 Futurewei Technologies, Inc. Content-Specific Identification and Timing Behavior in Dynamic Adaptive Streaming over Hypertext Transfer Protocol
CN103974147A (zh) * 2014-03-07 2014-08-06 北京邮电大学 一种基于mpeg-dash协议的带有码率切换控制和静态摘要技术的在线视频播控系统

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
"ISO/IEC 23009-1", PART1: MEDIA PRESENTATION DESCRIPTION AND SEGMENT FORMATS, 15 May 2014 (2014-05-15), pages 16 - 82, XP055214031 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110431848A (zh) * 2017-03-24 2019-11-08 索尼公司 内容提供系统、内容提供方法和程序
CN110431848B (zh) * 2017-03-24 2021-12-21 索尼公司 内容提供系统、内容提供方法和程序

Also Published As

Publication number Publication date
KR101919726B1 (ko) 2018-11-16
EP3249873B1 (en) 2018-09-12
EP3249873A1 (en) 2017-11-29
US20170374122A1 (en) 2017-12-28
KR20170116116A (ko) 2017-10-18
CN106664299B (zh) 2020-01-17
EP3249873A4 (en) 2017-11-29
CN106664299A (zh) 2017-05-10
JP2018510552A (ja) 2018-04-12
JP6478357B2 (ja) 2019-03-06

Similar Documents

Publication Publication Date Title
US10187668B2 (en) Method, system and server for live streaming audio-video file
US9294728B2 (en) System and method for routing content
WO2019024919A1 (zh) 视频转码方法及其装置、服务器、可读存储介质
CN105681912A (zh) 一种视频播放方法和装置
JP2020519094A (ja) ビデオ再生方法、デバイス、およびシステム
WO2018014691A1 (zh) 一种媒体数据的获取方法和装置
CN107888993B (zh) 一种视频数据的处理方法及装置
IL230273A (en) Transmission of reconstruction data in a layered signal quality hierarchy
WO2016127440A1 (zh) 基于超文本传输协议媒体流的媒体呈现导览方法和相关装置
WO2021143360A1 (zh) 资源传输方法及计算机设备
WO2016202225A1 (zh) 内容项聚合方法和相关装置及通信系统
CN105142012A (zh) 智能电视直播频道列表获取、频道切换及同屏观看的方法
CN109068169A (zh) 一种视频播放方法及装置
CN109587478A (zh) 一种媒体信息的处理方法及装置
US20110200093A1 (en) Method and apparatus for transmitting and receiving video and video links
WO2008103364A1 (en) Systems and methods for sending, receiving and processing multimedia bookmarks
US10637904B2 (en) Multimedia streaming service presentation method, related apparatus, and related system
CN104185033A (zh) 一种电视多画面的处理方法、装置及系统
Kaiser et al. MPEG-DASH enabling adaptive streaming with personalized commercial breaks and second screen scenarios
WO2019188485A1 (ja) 情報処理装置、情報処理装置およびプログラム
Marfil et al. Enhancing the broadcasted TV consumption experience with broadband omnidirectional video content
Cheong et al. Interactive terrestrial digital multimedia broadcasting (T-DMB) player
WO2019176590A1 (ja) 情報処理装置、情報処理装置およびプログラム
EP2744215A1 (en) Method for streaming AV content and method for presenting AV content
JP2016533673A (ja) 隠し広告のための方法、装置、およびシステム

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 15881602

Country of ref document: EP

Kind code of ref document: A1

ENP Entry into the national phase

Ref document number: 2017542417

Country of ref document: JP

Kind code of ref document: A

NENP Non-entry into the national phase

Ref country code: DE

REEP Request for entry into the european phase

Ref document number: 2015881602

Country of ref document: EP

ENP Entry into the national phase

Ref document number: 20177025344

Country of ref document: KR

Kind code of ref document: A