WO2016127440A1 - 基于超文本传输协议媒体流的媒体呈现导览方法和相关装置 - Google Patents
基于超文本传输协议媒体流的媒体呈现导览方法和相关装置 Download PDFInfo
- Publication number
- WO2016127440A1 WO2016127440A1 PCT/CN2015/073148 CN2015073148W WO2016127440A1 WO 2016127440 A1 WO2016127440 A1 WO 2016127440A1 CN 2015073148 W CN2015073148 W CN 2015073148W WO 2016127440 A1 WO2016127440 A1 WO 2016127440A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- navigation
- media presentation
- media
- adaptation set
- video adaptation
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 163
- 230000006978 adaptation Effects 0.000 claims description 1121
- 230000014509 gene expression Effects 0.000 claims description 301
- 230000000153 supplemental effect Effects 0.000 claims description 81
- 230000007717 exclusion Effects 0.000 claims description 64
- 230000009471 action Effects 0.000 claims description 25
- 230000002123 temporal effect Effects 0.000 claims description 14
- 238000012546 transfer Methods 0.000 claims description 12
- 239000012634 fragment Substances 0.000 claims description 6
- 239000000306 component Substances 0.000 claims 31
- 239000012533 medium component Substances 0.000 claims 1
- 230000009286 beneficial effect Effects 0.000 abstract description 5
- 230000000576 supplementary effect Effects 0.000 description 22
- 238000010586 diagram Methods 0.000 description 17
- 230000006870 function Effects 0.000 description 13
- 239000012092 media component Substances 0.000 description 11
- 230000007727 signaling mechanism Effects 0.000 description 10
- 238000004891 communication Methods 0.000 description 9
- 230000008569 process Effects 0.000 description 9
- 238000013519 translation Methods 0.000 description 7
- 230000002708 enhancing effect Effects 0.000 description 6
- 238000005516 engineering process Methods 0.000 description 5
- LEGNTRAAJFCGFF-UHFFFAOYSA-N 2-(diazomethyl)-9h-fluorene Chemical compound C1=CC=C2C3=CC=C(C=[N+]=[N-])C=C3CC2=C1 LEGNTRAAJFCGFF-UHFFFAOYSA-N 0.000 description 4
- 230000011664 signaling Effects 0.000 description 4
- 230000008878 coupling Effects 0.000 description 3
- 238000010168 coupling process Methods 0.000 description 3
- 238000005859 coupling reaction Methods 0.000 description 3
- 238000009877 rendering Methods 0.000 description 3
- 230000003044 adaptive effect Effects 0.000 description 2
- 230000008859 change Effects 0.000 description 2
- 238000011161 development Methods 0.000 description 2
- 230000018109 developmental process Effects 0.000 description 2
- 238000007792 addition Methods 0.000 description 1
- 230000006399 behavior Effects 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 238000013499 data model Methods 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 230000000737 periodic effect Effects 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 230000003068 static effect Effects 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L65/00—Network arrangements, protocols or services for supporting real-time applications in data packet communication
- H04L65/60—Network streaming of media packets
- H04L65/75—Media network packet handling
- H04L65/764—Media network packet handling at the destination
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L67/00—Network arrangements or protocols for supporting network services or applications
- H04L67/01—Protocols
- H04L67/02—Protocols based on web technology, e.g. hypertext transfer protocol [HTTP]
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L9/00—Cryptographic mechanisms or cryptographic arrangements for secret or secure communications; Network security protocols
- H04L9/40—Network security protocols
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/20—Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
- H04N21/23—Processing of content or additional data; Elementary server operations; Server middleware
- H04N21/231—Content storage operation, e.g. caching movies for short term storage, replicating data over plural servers, prioritizing data for deletion
- H04N21/23109—Content storage operation, e.g. caching movies for short term storage, replicating data over plural servers, prioritizing data for deletion by placing content in organized collections, e.g. EPG data repository
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/20—Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
- H04N21/23—Processing of content or additional data; Elementary server operations; Server middleware
- H04N21/233—Processing of audio elementary streams
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/20—Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
- H04N21/23—Processing of content or additional data; Elementary server operations; Server middleware
- H04N21/234—Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/20—Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
- H04N21/25—Management operations performed by the server for facilitating the content distribution or administrating data related to end-users or client devices, e.g. end-user or client device authentication, learning user preferences for recommending movies
- H04N21/262—Content or additional data distribution scheduling, e.g. sending additional data at off-peak times, updating software modules, calculating the carousel transmission frequency, delaying a video stream transmission, generating play-lists
- H04N21/26283—Content or additional data distribution scheduling, e.g. sending additional data at off-peak times, updating software modules, calculating the carousel transmission frequency, delaying a video stream transmission, generating play-lists for associating distribution time parameters to content, e.g. to generate electronic program guide data
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/41—Structure of client; Structure of client peripherals
- H04N21/422—Input-only peripherals, i.e. input devices connected to specially adapted client devices, e.g. global positioning system [GPS]
- H04N21/42204—User interfaces specially adapted for controlling a client device through a remote control device; Remote control devices therefor
- H04N21/42206—User interfaces specially adapted for controlling a client device through a remote control device; Remote control devices therefor characterized by hardware details
- H04N21/42208—Display device provided on the remote control
- H04N21/42209—Display device provided on the remote control for displaying non-command information, e.g. electronic program guide [EPG], e-mail, messages or a second television channel
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/439—Processing of audio elementary streams
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/44—Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/47—End-user applications
- H04N21/482—End-user interface for program selection
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/47—End-user applications
- H04N21/482—End-user interface for program selection
- H04N21/4825—End-user interface for program selection using a list of items to be played back in a given order, e.g. playlists
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/60—Network structure or processes for video distribution between server and client or between remote clients; Control signalling between clients, server and network components; Transmission of management data between server and client, e.g. sending from server to client commands for recording incoming content stream; Communication details between server and client
- H04N21/61—Network physical structure; Signal processing
- H04N21/6106—Network physical structure; Signal processing specially adapted to the downstream path of the transmission network
- H04N21/6125—Network physical structure; Signal processing specially adapted to the downstream path of the transmission network involving transmission via Internet
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/60—Network structure or processes for video distribution between server and client or between remote clients; Control signalling between clients, server and network components; Transmission of management data between server and client, e.g. sending from server to client commands for recording incoming content stream; Communication details between server and client
- H04N21/63—Control signaling related to video distribution between client, server and network components; Network processes for video distribution between server and clients or between remote clients, e.g. transmitting basic layer and enhancement layers over different transmission paths, setting up a peer-to-peer communication via Internet between remote STB's; Communication protocols; Addressing
- H04N21/643—Communication protocols
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/60—Network structure or processes for video distribution between server and client or between remote clients; Control signalling between clients, server and network components; Transmission of management data between server and client, e.g. sending from server to client commands for recording incoming content stream; Communication details between server and client
- H04N21/63—Control signaling related to video distribution between client, server and network components; Network processes for video distribution between server and clients or between remote clients, e.g. transmitting basic layer and enhancement layers over different transmission paths, setting up a peer-to-peer communication via Internet between remote STB's; Communication protocols; Addressing
- H04N21/643—Communication protocols
- H04N21/64322—IP
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/80—Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
- H04N21/83—Generation or processing of protective or descriptive data associated with content; Content structuring
- H04N21/845—Structuring of content, e.g. decomposing content into time segments
- H04N21/8456—Structuring of content, e.g. decomposing content into time segments by decomposing the content in the time domain, e.g. in time segments
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/80—Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
- H04N21/85—Assembly of content; Generation of multimedia applications
- H04N21/854—Content authoring
- H04N21/8549—Creating video summaries, e.g. movie trailer
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/80—Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
- H04N21/85—Assembly of content; Generation of multimedia applications
- H04N21/858—Linking data to content, e.g. by linking an URL to a video object, by creating a hotspot
- H04N21/8586—Linking data to content, e.g. by linking an URL to a video object, by creating a hotspot by using a URL
Definitions
- the present invention relates to the field of data transmission, and in particular to a media presentation navigation method and related apparatus based on a hypertext transfer protocol media stream.
- HTTP Hyper Text Transfer Protocol
- the present invention provides a method and related apparatus for providing navigation media presentation based on a hypertext transfer protocol media stream, in order to support video navigation in an HTTP-based media streaming service scenario, thereby improving user experience.
- a first aspect of the embodiments of the present invention provides a media presentation navigation method based on a hypertext transfer protocol media stream, which may include:
- the client obtains a media presentation description of the navigation media presentation, wherein the media presentation description of the navigation media presentation describes N navigation units included in the navigation media presentation, and the N is an integer greater than one;
- the client acquires K navigation units of the N navigation units according to the media presentation description presented by the navigation media;
- the client presents the K navigation units, each navigation unit of the K navigation units points to a main media presentation, wherein the navigation unit i pointed to by the navigation unit i in the navigation unit
- the presentation quality of the media presentation is higher than the presentation quality of the navigation unit i.
- the media presentation description of the navigation media presentation is different from the primary media presentation pointed to by each navigation unit of the K navigation units The media presentation description.
- each of the K navigation units points to the primary media presentation described by the media presentation description in a manner that points to the media presentation description.
- the media presentation description of the navigation media presentation and the primary media presentation by the navigation unit of the K navigation units are aggregated to form an aggregated media presentation description.
- each of the K navigation units is configured to reference the aggregated media presentation The way the element is rendered to point to a primary media presentation.
- the N navigation Each of the navigation units in the unit includes a video component, or each of the N navigation units includes an audio component and a video component.
- the video component included in the different navigation units of the K navigation units is K video suitable And media representation in different video adaptation sets in the set, wherein media expressions in any one of the K video adaptation sets have selective mutual exclusion, and the K video adaptation sets are different There is selectivity compatibility between video adaptation sets.
- the K navigation unit includes an audio component that is a media expression in an audio adaptation set, the audio The adaptation set is different from any one of the K adaptation sets, and the audio component adaptation set has selection compatibility with the K video adaptation sets;
- the audio components included in the different navigation units of the K navigation units are media representations in different audio adaptation sets in the K audio adaptation sets, wherein different audio adaptations in the K audio adaptation sets There is selective mutual exclusion between sets.
- the media expression element in the audio adaptation set element includes the media expression described in the navigation A description of the area of the associated area in the media presentation.
- the media expressions described by the media expression elements including the same area description have an association relationship, or include the same
- the adaptation set described by the adaptation set element of the region description has an association relationship.
- the region description is an SRD spatial relationship description.
- the media presentation of the navigation media presentation includes K video adaptation set elements, and the K video adaptation set elements are in one-to-one correspondence with the K video adaptation sets; wherein the K video adaptation set elements include a descriptor Element Ci, having a selection compatibility between the video adaptation sets described in the K video adaptation set elements that satisfy the set common condition video adaptation set element, the set commonality condition being a video adaptation set
- the element name and the method identification schemeIdUri attribute of the description sub-element Ci included in the element are the same.
- the description sub-element Ci describes a video adaptation set element including the description sub-element Ci
- the media representations in the described video adaptation set are part of the navigation media presentation.
- the description sub-element Ci describes a video adaptation set element corresponding to the description sub-element Ci
- the video adaptation of the centralized media expresses the role presented in the navigation media.
- the descriptive sub-element Ci acts Explain the Role element or the basic property EssentialProptery element or the supplementary property SupplementalProptery element.
- the set common condition is video suitable
- the description element element Ci included in the matching element has the same element name, the method identification schemeIdUri attribute is the same, and the parameter value attribute is the same.
- the media presentation description of the navigation media presentation includes the K video adaptation set elements, and the K video adaptation set elements and One-to-one correspondence between K video adaptation sets,
- the video adaptation set element VI corresponding to the video adaptation set I of the K video adaptation set elements includes a pointer for pointing to a primary media presentation, and the video adaptation set I is the K Any video adaptation set in the video adaptation set.
- the pointer is carried by the attributes of the video adaptation set element VI.
- the pointer is carried by an xlink:href attribute of the video adaptation set element VI.
- the pointer is carried by a basic attribute EssentialProptery element or a supplementary attribute SupplementalProperty element in the video adaptation set element VI.
- the pointer is a child element in an EssentialProptery element of the video adaptation set element VI Carrying, or the pointer is carried by an attribute of an EssentialProptery element in the video adaptation set element VI; or the pointer is carried by a child element in a SupplementalProperty element in the video adaptation set element VI, or the pointer It is carried by the attribute of the SupplementalProperty element in the video adaptation set element VI.
- the pointer is carried by a value attribute of an EssentialProptery element in the video adaptation set element VI
- the pointer is carried by the value attribute of the SupplementalProperty element of the video adaptation set element VI.
- the pointer is represented by a virtual medium in the video adaptation set element VI to represent a Representation element Attribute bearing, or the pointer is carried by a child element in a virtual Representation element in the video adaptation set element VI, wherein the virtual Representation element does not include a media segment template element, a media segment list element, and a basic unification Resource locator BaseURL element.
- the pointer is directed to the ReferencedMediaPresentation element by the media presentation in the video adaptation set element VI Hosted.
- the time structure of the navigation media presentation does not depend on the temporal structure of the primary media presentation pointed to by the K navigation units in the navigation media presentation.
- the method further comprises, in the case where the focus of attention stays in the navigation unit i of the K navigation units, the client presents the audio component of the navigation unit i.
- the method further includes, in the case where the navigation unit i of the K navigation units is selected, the client acquires a main media presentation pointed to by the navigation unit i.
- a second aspect of the embodiments of the present invention provides a media presentation navigation method based on a hypertext transfer protocol media stream, including:
- the media presentation description of the navigation media presentation describing N navigation units included in the navigation media presentation, the N being an integer greater than 1; the N guides Each navigation unit in the browsing unit points to a main media presentation, wherein the presentation quality of the main media presentation pointed by the navigation unit i in the N navigation units is higher than the presentation quality of the navigation unit i .
- the media presentation description of the navigation media presentation is different from the primary media pointed to by each navigation unit of the K navigation units The rendered media presentation description.
- each of the N navigation units is directed to the media presentation description
- the media presentation describes the primary media presentation described.
- the media presentation description presented by the navigation media and the primary media presented by each navigation unit of the N navigation units are aggregated to form an aggregated media presentation description.
- each of the N navigation units is configured to reference the aggregated media presentation The way the element is rendered to point to a primary media presentation.
- the N navigation Each of the navigation units in the unit includes a video component, or each of the N navigation units includes an audio component and a video component.
- the video component included in the different navigation units of the N navigation units is N video suitable And media representation in different video adaptation sets in the set, wherein media expressions in any one of the N video adaptation sets have selective mutual exclusion, and the N video adaptation sets are different There is selectivity compatibility between video adaptation sets.
- the N navigation units include an audio component that is a media representation in an audio adaptation set, the audio The adaptation set is different from any one of the N adaptation sets, and the audio component adaptation set has a selection compatibility with the N video adaptation sets;
- the audio components included in the different navigation units of the N navigation units are media representations in different audio adaptation sets in the N audio adaptation sets, wherein different audio adaptations in the N audio adaptation sets There is selective mutual exclusion between sets.
- the media expression element in the audio adaptation set element includes an area description of the associated area in which the media expression is expressed in the navigation media presentation.
- the media expressions described by the media expression elements including the same area description have an association relationship, or include the same
- the adaptation set described by the adaptation set element of the region description has an association relationship.
- the area description is an SRD spatial relationship description.
- the media presentation of the navigation media presentation includes N video adaptation set elements, the N video adaptation set elements and the N video adaptation sets are in one-to-one correspondence; wherein the N video adaptation set elements include a descriptor Element Ci, having a selection compatibility between video adaptation sets described by the set of common condition video adaptation set elements in the N video adaptation set elements, the set common condition being a video adaptation set
- the element name and the method identification schemeIdUri attribute of the description sub-element Ci included in the element are the same.
- the description sub-element Ci describes a video adaptation set element including the description sub-element Ci
- the media representations in the described video adaptation set are part of the navigation media presentation.
- the description sub-element Ci describes a video adaptation set element corresponding to the description sub-element Ci
- the video adaptation of the centralized media expresses the role presented in the navigation media.
- the descriptive sub-element Ci acts Explain the Role element or the basic property EssentialProptery element or the supplementary property SupplementalProptery element.
- the description sub-element Ci is an action description Role element
- the setting common condition is video suitable
- the description element element Ci included in the matching element has the same element name and method identification.
- the schemeIdUri attribute is the same, and the parameter value attribute is the same.
- the media that the navigation media presents The N video adaptation set elements are included in the presentation description, and the N video adaptation set elements are in one-to-one correspondence with the N video adaptation sets.
- the video adaptation set element VI corresponding to the video adaptation set I of the N video adaptation set elements includes a pointer for pointing to a primary media presentation, where the video adaptation set I is the N Any video adaptation set in the video adaptation set.
- the pointer is carried by the attributes of the video adaptation set element VI.
- the pointer is carried by an xlink:href attribute of the video adaptation set element VI.
- the pointer is carried by a basic attribute EssentialProptery element or a supplementary attribute SupplementalProperty element in the video adaptation set element VI.
- the pointer is a child element in an EssentialProptery element of the video adaptation set element VI Carrying, or the pointer is carried by an attribute of an EssentialProptery element in the video adaptation set element VI; or the pointer is carried by a child element in a SupplementalProperty element in the video adaptation set element VI, or the pointer It is carried by the attribute of the SupplementalProperty element in the video adaptation set element VI.
- the pointer is carried by a value attribute of an EssentialProptery element in the video adaptation set element VI
- the pointer is carried by the value attribute of the SupplementalProperty element of the video adaptation set element VI.
- the pointer is carried by an attribute of a virtual media expression Representation element in the video adaptation set element VI, or the pointer is carried by a child element in a virtual Representation element in the video adaptation set element VI, where
- the virtual Representation element does not include a media segment template element, a media segment list element, and a base uniform resource locator BaseURL element.
- the pointer is directed by the media presentation in the video adaptation set element VI to the ReferencedMediaPresentation element Hosted.
- the time structure of the navigation media presentation does not depend on the temporal structure of the primary media presentation pointed to by the N navigation units in the navigation media presentation.
- a third aspect of the present invention provides a client, including:
- a first obtaining unit configured to acquire a media presentation description of the navigation media presentation, where the media presentation description presented by the navigation media describes N navigation units included in the navigation media presentation, where N is greater than An integer of 1;
- a second acquiring unit configured to acquire K navigation units of the N navigation units according to the media presentation description presented by the navigation media
- a presentation unit for presenting the K navigation units, each navigation unit of the K navigation units pointing to a main media presentation, wherein the navigation unit i in the K navigation units points The presentation quality of the main media presentation is higher than the presentation quality of the navigation unit i.
- the media presentation description of the navigation media presentation is different from the primary media presentation pointed to by each navigation unit of the K navigation units The media presentation description.
- each of the K navigation units is directed in a manner that points to a media presentation description
- the media presentation describes the primary media presentation described.
- the navigation medium is The current media presentation description and the media presentation descriptions of the primary media presentations pointed to by each of the K navigation units are aggregated to form an aggregated media presentation description.
- each of the K navigation units is configured to reference the aggregated media presentation The way the element is rendered to point to a primary media presentation.
- the N navigation Each of the navigation units in the unit includes a video component, or each of the N navigation units includes an audio component and a video component.
- the video component included in the different navigation units of the K navigation units is K video suitable And media representation in different video adaptation sets in the set, wherein media expressions in any one of the K video adaptation sets have selective mutual exclusion, and the K video adaptation sets are different There is selectivity compatibility between video adaptation sets.
- the K navigation unit includes an audio component that is a media expression in an audio adaptation set, the audio The adaptation set is different from any one of the K adaptation sets, and the audio component adaptation set has selection compatibility with the K video adaptation sets;
- the audio components included in the different navigation units of the K navigation units are media representations in different audio adaptation sets in the K audio adaptation sets, wherein different audio adaptations in the K audio adaptation sets There is selective mutual exclusion between sets.
- the media expression element in the audio adaptation set element includes the media expression described in the navigation A description of the area of the associated area in the media presentation.
- the media expressions described by the media expression elements including the same area description have an association relationship, or include the same
- the adaptation set described by the adaptation set element of the region description has an association relationship.
- the area description is an SRD spatial relationship description.
- the media presentation of the navigation media presentation includes K video adaptation set elements, and the K video adaptation set elements are in one-to-one correspondence with the K video adaptation sets; wherein the K video adaptation set elements include a descriptor Element Ci, having a selection compatibility between the video adaptation sets described in the K video adaptation set elements that satisfy the set common condition video adaptation set element, the set commonality condition being a video adaptation set
- the element name and the method identification schemeIdUri attribute of the description sub-element Ci included in the element are the same.
- the description sub-element Ci describes a video adaptation set element including the description sub-element Ci
- the media representations in the described video adaptation set are part of the navigation media presentation.
- the description sub-element Ci describes a video adaptation set element corresponding to the description sub-element Ci
- the video adaptation of the centralized media expresses the role presented in the navigation media.
- the description sub element Ci is a function Explain the Role element or the basic property EssentialProptery element or the supplementary property SupplementalProptery element.
- the description sub-element Ci is an action description Role element
- the setting common condition is video suitable
- the description element element Ci included in the matching element has the same element name, the method identification schemeIdUri attribute is the same, and the parameter value attribute is the same.
- the medium displayed by the navigation medium The K video adaptation set elements are included in the presentation description, and the K video adaptation set elements are in one-to-one correspondence with the K video adaptation sets.
- the video adaptation set element VI corresponding to the video adaptation set I of the K video adaptation set elements includes a pointer for pointing to a primary media presentation, and the video adaptation set I is the K Any video adaptation set in the video adaptation set.
- the pointer is carried by the attributes of the video adaptation set element VI.
- the pointer is carried by an xlink:href attribute of the video adaptation set element VI.
- the pointer is carried by a basic attribute EssentialProptery element or a supplementary attribute SupplementalProperty element in the video adaptation set element VI.
- the pointer is a child element in an EssentialProptery element in the video adaptation set element VI Carrying, or the pointer is carried by an attribute of an EssentialProptery element in the video adaptation set element VI; or the pointer is carried by a child element in a SupplementalProperty element in the video adaptation set element VI, or the pointer It is carried by the attribute of the SupplementalProperty element in the video adaptation set element VI.
- the pointer is carried by a value attribute of an EssentialProptery element in the video adaptation set element VI
- the pointer is carried by the value attribute of the SupplementalProperty element of the video adaptation set element VI.
- the pointer is carried by an attribute of a virtual media expression Representation element in the video adaptation set element VI, or the pointer is carried by a child element in a virtual Representation element in the video adaptation set element VI, where
- the virtual Representation element does not include a media segment template element, a media segment list element, and a base uniform resource locator BaseURL element.
- the pointer is pointed by the media presentation in the video adaptation set element VI to the ReferencedMediaPresentation element Hosted.
- the time structure of the navigation media presentation does not depend on the temporal structure of the primary media presentation pointed to by the K navigation units in the navigation media presentation.
- the presentation unit is further configured to present the audio component of the navigation unit i in the case where the focus of attention stays in the navigation unit i of the K navigation units.
- the presentation unit is further configured to acquire, when the navigation unit i of the K navigation units is selected, the main media presentation pointed to by the navigation unit i.
- a fourth aspect of the present invention provides a media presentation navigation apparatus, including:
- a determining unit configured to determine N navigation units included in the navigation media presentation
- a generating unit configured to generate a media presentation description of the navigation media presentation, where the media presentation description of the navigation media presentation describes N navigation units included in the navigation media presentation, and the N is an integer greater than 1;
- Each of the N navigation units points to a main media presentation, and the presentation quality of the main media presentation pointed by the navigation unit i in the N navigation units is higher than that of the navigation unit i The quality of the presentation.
- the media presentation description of the navigation media presentation is different from the primary media presentation pointed to by each navigation unit of the K navigation units The media presentation description.
- each of the N navigation units is directed to the media presentation description
- the media presentation describes the primary media presentation described.
- the media presentation description presented by the navigation media and the primary media presented by each navigation unit of the N navigation units are aggregated to form an aggregated media presentation description.
- each of the N navigation units is configured to reference the aggregate media presentation The way the element is rendered to point to a primary media presentation.
- the N navigation Each of the navigation units in the unit includes a video component, or each of the N navigation units includes an audio component and a video component.
- the video component included in the different navigation units of the N navigation units is N video suitable And media representation in different video adaptation sets in the set, wherein media expressions in any one of the N video adaptation sets have selective mutual exclusion, and the N video adaptation sets are different There is selectivity compatibility between video adaptation sets.
- the N navigation unit includes an audio component that is a media expression in an audio adaptation set, the audio The adaptation set is different from any one of the N adaptation sets, and the audio component adaptation set has a selection compatibility with the N video adaptation sets;
- the audio components included in the different navigation units of the N navigation units are media representations in different audio adaptation sets in the N audio adaptation sets, wherein different audio adaptations in the N audio adaptation sets There is selective mutual exclusion between sets.
- the media expression element in the audio adaptation set element includes the media expression described in the navigation A description of the area of the associated area in the media presentation.
- the media expressions described by the media presentation elements including the same region description are related to each other
- the adaptation set described by the adaptation set element containing the same area description has an association relationship.
- the area description is an SRD spatial relationship description.
- the media presentation of the navigation media presentation includes N video adaptation set elements, the N video adaptation set elements and the N video adaptation sets are in one-to-one correspondence; wherein the N video adaptation set elements include a descriptor Element Ci, having a selection compatibility between video adaptation sets described by the set of common condition video adaptation set elements in the N video adaptation set elements, the set common condition being a video adaptation set
- the element name and the method identification schemeIdUri attribute of the description sub-element Ci included in the element are the same.
- the description sub-element Ci describes a video adaptation set element including the description sub-element Ci
- the media representations in the described video adaptation set are part of the navigation media presentation.
- the description sub-element Ci describes a video adaptation set element corresponding to the description sub-element Ci
- the video adaptation of the centralized media expresses the role presented in the navigation media.
- the description sub element Ci is a function Explain the Role element or the basic property EssentialProptery element or the supplementary property SupplementalProptery element.
- the description sub-element Ci is an action description Role element
- the setting common condition is video suitable
- the description element element Ci included in the matching element has the same element name, the method identification schemeIdUri attribute is the same, and the parameter value attribute is the same.
- the medium displayed by the navigation medium The N video adaptation set elements are included in the presentation description, and the N video adaptation set elements and N One-to-one correspondence between video adaptation sets,
- the video adaptation set element VI corresponding to the video adaptation set I of the N video adaptation set elements includes a pointer for pointing to a primary media presentation, where the video adaptation set I is the N Any video adaptation set in the video adaptation set.
- the pointer is carried by the attributes of the video adaptation set element VI.
- the pointer is carried by an xlink:href attribute of the video adaptation set element VI.
- the pointer is carried by a basic attribute EssentialProptery element or a supplementary attribute SupplementalProperty element in the video adaptation set element VI.
- the pointer is a child element in an EssentialProptery element in the video adaptation set element VI Carrying, or the pointer is carried by an attribute of an EssentialProptery element in the video adaptation set element VI; or the pointer is carried by a child element in a SupplementalProperty element in the video adaptation set element VI, or the pointer It is carried by the attribute of the SupplementalProperty element in the video adaptation set element VI.
- the pointer is carried by a value attribute of an EssentialProptery element in the video adaptation set element VI
- the pointer is carried by the value attribute of the SupplementalProperty element of the video adaptation set element VI.
- the pointer is carried by an attribute of a virtual media expression Representation element in the video adaptation set element VI, or the pointer is carried by a child element in a virtual Representation element in the video adaptation set element VI, where The virtual Representation element does not include a media segment module The version element, the media fragment list element, and the base uniform resource locator BaseURL element.
- the pointer is pointed by the media presentation in the video adaptation set element VI to the ReferencedMediaPresentation element Hosted.
- the time structure of the navigation media presentation does not depend on the temporal structure of the primary media presentation pointed to by the N navigation units in the navigation media presentation.
- a fifth aspect of the present invention provides a client, including:
- the processor by calling code or instructions in the memory, for obtaining a media presentation description of the navigation media presentation, wherein the media presentation description of the navigation media presentation describes the navigation media presentation N navigation units included, the N is an integer greater than 1; acquiring K navigation units of the N navigation units according to the media presentation description presented by the navigation media; presenting the K guides a navigation unit, each navigation unit of the K navigation unit points to a main media presentation, wherein a presentation quality of the main media presentation pointed by the navigation unit i in the K navigation units is higher than the guide View the presentation quality of unit i.
- the media presentation description of the navigation media presentation is different from the primary media presentation pointed to by each navigation unit of the K navigation units The media presentation description.
- each of the K navigation units is directed to the media presentation description
- the media presentation describes the primary media presentation described.
- the media presentation description presented by the navigation media and the primary media presented by each navigation unit of the K navigation units are aggregated to form an aggregated media presentation description.
- each of the K navigation units is configured to reference the aggregated media presentation The way the element is rendered to point to a primary media presentation.
- the N navigation Each of the navigation units in the unit includes a video component, or each of the N navigation units includes an audio component and a video component.
- the video component included in the different navigation units of the K navigation units is K video suitable And media representation in different video adaptation sets in the set, wherein media expressions in any one of the K video adaptation sets have selective mutual exclusion, and the K video adaptation sets are different There is selectivity compatibility between video adaptation sets.
- the K navigation unit includes an audio component that is a media expression in an audio adaptation set, the audio The adaptation set is different from any one of the K adaptation sets, and the audio component adaptation set has selection compatibility with the K video adaptation sets;
- the audio components included in the different navigation units of the K navigation units are media representations in different audio adaptation sets in the K audio adaptation sets, wherein different audio adaptations in the K audio adaptation sets There is selective mutual exclusion between sets.
- the media expression element in the audio adaptation set element includes the media expression described in the navigation A description of the area of the associated area in the media presentation.
- the media expressions described by the media expression elements including the same area description have an association relationship, or include the same
- the adaptation set described by the adaptation set element of the region description has an association relationship.
- the area description is a description of the SRD spatial relationship.
- the media presentation description of the navigation media presentation includes K video adaptation set elements, and the K video adaptation set elements and the One-to-one correspondence between the K video adaptation sets; wherein the K video adaptation set elements include a description sub-element Ci, and the K common video adaptation set elements satisfy a set common condition video adaptation set
- the set commonality condition is that the element name and the method identification schemeIdUri attribute of the description sub-element Ci included in the video adaptation set element are the same.
- the description sub-element Ci describes a video adaptation set element including the description sub-element Ci
- the media representations in the described video adaptation set are part of the navigation media presentation.
- the description sub-element Ci describes a video adaptation set element corresponding to the description sub-element Ci
- the video adaptation of the centralized media expresses the role presented in the navigation media.
- the description sub element Ci is a function Explain the Role element or the basic property EssentialProptery element or the supplementary property SupplementalProptery element.
- the description sub-element Ci is an action description Role element
- the setting common condition is video suitable
- the description element element Ci included in the matching element has the same element name, the method identification schemeIdUri attribute is the same, and the parameter value attribute is the same.
- the media displayed by the navigation medium The K video adaptation set elements are included in the presentation description, and the K video adaptation set elements are in one-to-one correspondence with the K video adaptation sets.
- the video adaptation set element VI corresponding to the video adaptation set I of the K video adaptation set elements includes a pointer for pointing to a primary media presentation, and the video adaptation set I is the K Any video adaptation set in the video adaptation set.
- the seventeenth possible aspect of the fifth aspect In an embodiment,
- the pointer is carried by the attributes of the video adaptation set element VI.
- the pointer is carried by an xlink:href attribute of the video adaptation set element VI.
- the pointer is carried by a basic attribute EssentialProptery element or a supplementary attribute SupplementalProperty element in the video adaptation set element VI.
- the pointer is a child element in an EssentialProptery element in the video adaptation set element VI Carrying, or the pointer is carried by an attribute of an EssentialProptery element in the video adaptation set element VI; or the pointer is carried by a child element in a SupplementalProperty element in the video adaptation set element VI, or the pointer It is carried by the attribute of the SupplementalProperty element in the video adaptation set element VI.
- the pointer is carried by a value attribute of an EssentialProptery element in the video adaptation set element VI
- the pointer is carried by the value attribute of the SupplementalProperty element of the video adaptation set element VI.
- the pointer is carried by an attribute of a virtual media expression Representation element in the video adaptation set element VI, or the pointer is carried by a child element in a virtual Representation element in the video adaptation set element VI, where
- the virtual Representation element does not include a media segment template element, a media segment list element, and a base uniform resource locator BaseURL element.
- the pointer is pointed by the media presentation in the video adaptation set element VI to the ReferencedMediaPresentation element Hosted.
- the time structure of the navigation media presentation does not depend on the temporal structure of the primary media presentation pointed to by the K navigation units in the navigation media presentation.
- the method processor is further configured to present the audio component of the navigation unit i in the case where the focus of attention stays in the navigation unit i of the K navigation units.
- the method processor is further configured to: in the case where the navigation unit i in the K navigation units is selected, the client acquires a main media presentation pointed by the navigation unit i.
- a sixth aspect of the embodiments of the present invention provides a media presentation navigation apparatus, including:
- the processor invokes code or instructions in the memory for determining N navigation units included in the navigation media presentation; generating a media presentation description of the navigation media presentation, the media presented by the navigation media
- the presentation description describes N navigation units included in the navigation media presentation, the N being an integer greater than 1; each of the N navigation units pointing to a primary media presentation, the N
- the presentation quality of the primary media presentation pointed to by the navigation unit i in the navigation unit is higher than the presentation quality of the navigation unit i.
- the media presentation description of the navigation media presentation is different from the primary media presentation pointed to by each navigation unit of the K navigation units The media presentation description.
- each of the N navigation units is directed to the media presentation description
- the media presentation describes the primary media presentation described.
- the media presentation description presented by the navigation media and the primary media presented by each navigation unit of the N navigation units are aggregated to form an aggregated media presentation description.
- each of the N navigation units is configured to reference the aggregate media The way the element is rendered to point to a primary media presentation.
- the N navigation Each of the navigation units in the unit includes a video component, or each of the N navigation units includes an audio component and a video component.
- the video component included in the different navigation units of the N navigation units is N video suitable And media representation in different video adaptation sets in the set, wherein media expressions in any one of the N video adaptation sets have selective mutual exclusion, and the N video adaptation sets are different There is selectivity compatibility between video adaptation sets.
- the N navigation unit includes an audio component that is a media expression in an audio adaptation set, the audio The adaptation set is different from any one of the N adaptation sets, and the audio component adaptation set has a selection compatibility with the N video adaptation sets;
- the audio components included in the different navigation units of the N navigation units are media representations in different audio adaptation sets in the N audio adaptation sets, wherein different audio adaptations in the N audio adaptation sets There is selective mutual exclusion between sets.
- the media expression element in the audio adaptation set element includes the media expression described in the navigation A description of the area of the associated area in the media presentation.
- the media expressions described by the media expression elements including the same area description have an association relationship, or include the same
- the adaptation set described by the adaptation set element of the region description has an association relationship.
- the area description is SRD space Department description.
- the media presentation of the navigation media presentation includes N video adaptation set elements, the N video adaptation set elements and the N video adaptation sets are in one-to-one correspondence; wherein the N video adaptation set elements include a descriptor Element Ci, having a selection compatibility between video adaptation sets described by the set of common condition video adaptation set elements in the N video adaptation set elements, the set common condition being a video adaptation set
- the element name and the method identification schemeIdUri attribute of the description sub-element Ci included in the element are the same.
- the description sub-element Ci describes a video adaptation set element including the description sub-element Ci
- the media representations in the described video adaptation set are part of the navigation media presentation.
- the description sub-element Ci describes a video adaptation set element corresponding to the description sub-element Ci
- the video adaptation of the centralized media expresses the role presented in the navigation media.
- the description sub element Ci is a function Explain the Role element or the basic property EssentialProptery element or the supplementary property SupplementalProptery element.
- the description sub-element Ci is an action description Role element
- the setting common condition is video suitable
- the description element element Ci included in the matching element has the same element name, the method identification schemeIdUri attribute is the same, and the parameter value attribute is the same.
- the media displayed by the navigation medium The N video adaptation set elements are included in the presentation description, and the N video adaptation set elements are in one-to-one correspondence with the N video adaptation sets.
- the video adaptation set element VI corresponding to the video adaptation set I of the N video adaptation set elements includes a pointer for pointing to a primary media presentation, where the video adaptation set I is the N Vision
- the frequency adaptation focuses on any one of the video adaptation sets.
- the pointer is carried by the attributes of the video adaptation set element VI.
- the pointer is carried by an xlink:href attribute of the video adaptation set element VI.
- the pointer is carried by a basic attribute EssentialProptery element or a supplementary attribute SupplementalProperty element in the video adaptation set element VI.
- the pointer is a child element in an EssentialProptery element in the video adaptation set element VI Carrying, or the pointer is carried by an attribute of an EssentialProptery element in the video adaptation set element VI; or the pointer is carried by a child element in a SupplementalProperty element in the video adaptation set element VI, or the pointer It is carried by the attribute of the SupplementalProperty element in the video adaptation set element VI.
- the pointer is carried by a value attribute of an EssentialProptery element in the video adaptation set element VI
- the pointer is carried by the value attribute of the SupplementalProperty element of the video adaptation set element VI.
- the pointer is carried by an attribute of a virtual media expression Representation element in the video adaptation set element VI, or the pointer is carried by a child element in a virtual Representation element in the video adaptation set element VI, where
- the virtual Representation element does not include a media segment template element, a media segment list element, and a base uniform resource locator BaseURL element.
- the pointer is pointed by the media presentation in the video adaptation set element VI
- the ReferencedMediaPresentation element is hosted.
- the time structure of the navigation media presentation does not depend on the temporal structure of the primary media presentation pointed to by the N navigation units in the navigation media presentation.
- a seventh aspect of the present invention provides a communication system, including:
- the client is configured to obtain a media presentation description of the navigation media presentation from the content server, where the media presentation description presented by the navigation media describes the N navigation units included in the navigation media presentation.
- the N is an integer greater than 1; the media presentation description presented according to the navigation medium acquires K navigation units of the N navigation units from the content server; the K navigation units are presented, Each navigation unit of the K navigation units points to a main media presentation, wherein the presentation quality of the main media presentation pointed to by the navigation unit i in the K navigation units is higher than the presentation of the navigation unit i quality.
- the media presentation description of the navigation media presentation is different from the primary media presentation pointed to by each navigation unit of the K navigation units The media presentation description.
- each of the K navigation units is directed to the media presentation description
- the media presentation describes the primary media presentation described.
- the media presentation description presented by the navigation media and the primary media presented by each navigation unit of the K navigation units are aggregated to form an aggregated media presentation description.
- each of the K navigation units is configured to reference the aggregated media presentation The way the element is rendered to point to a primary media presentation.
- the N navigation Each of the navigation units in the unit includes a video component, or each of the N navigation units includes an audio component and a video component.
- the video component included in the different navigation units of the K navigation units is K video suitable And media representation in different video adaptation sets in the set, wherein media expressions in any one of the K video adaptation sets have selective mutual exclusion, and the K video adaptation sets are different There is selectivity compatibility between video adaptation sets.
- the K navigation unit includes an audio component that is a media expression in an audio adaptation set, the audio The adaptation set is different from any one of the K adaptation sets, and the audio component adaptation set has selection compatibility with the K video adaptation sets;
- the audio components included in the different navigation units of the K navigation units are media representations in different audio adaptation sets in the K audio adaptation sets, wherein different audio adaptations in the K audio adaptation sets There is selective mutual exclusion between sets.
- the media expression element in the audio adaptation set element includes the media expression described in the navigation A description of the area of the associated area in the media presentation.
- the media expressions described by the media expression elements including the same area description have an association relationship, or include the same
- the adaptation set described by the adaptation set element of the region description has an association relationship.
- the area description is an SRD spatial relationship description.
- the media presentation of the navigation media presentation includes K video adaptation set elements, and the K video adaptation set elements are in one-to-one correspondence with the K video adaptation sets; wherein the K video adaptation set elements include a descriptor Yuan a Ci, a selection compatibility between the video adaptation sets described in the K video adaptation set elements that satisfy the set common condition video adaptation set element, the set common condition being a video adaptation set
- the element name and the method identification schemeIdUri attribute of the description sub-element Ci included in the element are the same.
- the description sub-element Ci describes a video adaptation set element including the description sub-element Ci
- the media representations in the described video adaptation set are part of the navigation media presentation.
- the description sub-element Ci describes a video adaptation set element corresponding to the description sub-element Ci
- the video adaptation of the centralized media expresses the role presented in the navigation media.
- the description sub element Ci is a function Explain the Role element or the basic property EssentialProptery element or the supplementary property SupplementalProptery element.
- the description sub-element Ci is an action description Role element
- the setting common condition is video suitable
- the description element element Ci included in the matching element has the same element name, the method identification schemeIdUri attribute is the same, and the parameter value attribute is the same.
- the media presented by the navigation medium The K video adaptation set elements are included in the presentation description, and the K video adaptation set elements are in one-to-one correspondence with the K video adaptation sets.
- the video adaptation set element VI corresponding to the video adaptation set I of the K video adaptation set elements includes a pointer for pointing to a primary media presentation, and the video adaptation set I is the K Any video adaptation set in the video adaptation set.
- the pointer is carried by the attributes of the video adaptation set element VI.
- the pointer is carried by an xlink:href attribute of the video adaptation set element VI.
- the pointer is carried by a basic attribute EssentialProptery element or a supplementary attribute SupplementalProperty element in the video adaptation set element VI.
- the pointer is a child element in an EssentialProptery element in the video adaptation set element VI Carrying, or the pointer is carried by an attribute of an EssentialProptery element in the video adaptation set element VI; or the pointer is carried by a child element in a SupplementalProperty element in the video adaptation set element VI, or the pointer It is carried by the attribute of the SupplementalProperty element in the video adaptation set element VI.
- the pointer is carried by a value attribute of an EssentialProptery element in the video adaptation set element VI
- the pointer is carried by the value attribute of the SupplementalProperty element of the video adaptation set element VI.
- the pointer is carried by an attribute of a virtual media expression Representation element in the video adaptation set element VI, or the pointer is carried by a child element in a virtual Representation element in the video adaptation set element VI, where
- the virtual Representation element does not include a media segment template element, a media segment list element, and a base uniform resource locator BaseURL element.
- the pointer is pointed by the media presentation in the video adaptation set element VI to the ReferencedMediaPresentation element Hosted.
- the time structure of the navigation media presentation does not depend on the temporal structure of the primary media presentation pointed to by the K navigation units in the navigation media presentation.
- each navigation unit of the K navigation units can respectively point to one main media presentation, this is equivalent to the introduction between the navigation unit and the main media presentation.
- Correlation which enables the client to obtain a media presentation description of the main media presentation j pointed to by the navigation unit i in the case where the navigation unit i of the K navigation units is selected, and thus The media presentation description of the primary media presentation j obtains the primary media presentation j for presentation, which is beneficial to facilitate flexible switching between the navigation media presentation and the primary media presentation, thereby implementing the HTTP-based media streaming service scenario. Support for video navigation, which will help improve the user's high quality experience.
- FIG. 1 is a schematic structural diagram of a media presentation description according to an embodiment of the present disclosure
- FIG. 1 is a schematic flowchart of a media presentation and navigation method based on an HTTP media stream according to an embodiment of the present invention
- FIG. 1 is a schematic diagram of a time structure of a single media presentation according to an embodiment of the present invention
- FIG. 1 is a schematic diagram of a time structure of multiple media presentations according to an embodiment of the present invention
- 1 - e and 1 - f are schematic diagrams of media representations of a coded navigation unit according to an embodiment of the present invention.
- FIG. 1 is a schematic diagram of a time structure of another multiple media presentation according to an embodiment of the present invention
- FIG. 1 is a schematic diagram of a time structure of another multiple media presentation according to an embodiment of the present invention
- FIG. 1 is a schematic diagram of a synthesized navigation media presentation according to an embodiment of the present invention
- FIG. 1 is a schematic diagram of a video component of a client decoding output navigation unit according to an embodiment of the present invention
- FIG. 1 is a schematic diagram of an audio component of a client decoding output navigation unit according to an embodiment of the present invention
- FIG. 2 is a schematic flowchart of another media presentation and navigation method based on HTTP media stream according to an embodiment of the present invention
- FIG. 3 is a schematic flowchart of another media presentation and navigation method based on an HTTP media stream according to an embodiment of the present disclosure
- FIG. 3 is a schematic diagram of a network architecture according to an embodiment of the present disclosure.
- FIG. 4 is a schematic diagram of a client according to an embodiment of the present invention.
- FIG. 5 is a schematic diagram of another client according to an embodiment of the present disclosure.
- FIG. 6 is a schematic diagram of a server according to an embodiment of the present disclosure.
- FIG. 7 is a schematic diagram of another server according to an embodiment of the present disclosure.
- FIG. 8 is a schematic diagram of a communication system according to an embodiment of the present invention.
- Embodiments of the present invention provide a media presentation navigation method and related device based on a hypertext transfer protocol media stream, so as to support video navigation in an HTTP-based media streaming service scenario, thereby improving user experience.
- EPG Electronic Program Guide
- the EPG is actually a list.
- the EPG contains information such as programs and times of different channels, and the like, and the user can find the electric power of interest through the EPG.
- Video channel then switch from the EPG channel to the channel.
- the navigation service provided in a graphical manner is more user-friendly and user-friendly.
- a navigation channel is represented by a navigation unit.
- the navigation unit like the TV channel it represents, can have different media components, such as video, audio, and so on.
- the graphical navigation service presents a video of a set of navigation units in the form of multiple small-format images (dynamic image sequences or static images). The user can browse through the images of multiple small frames, change the navigation unit of interest, and the user can even hear the audio of the navigation unit currently focused on. The user can select a navigation unit to switch to the channel corresponding to the navigation unit.
- HTTP-based adaptive streaming services have become the mainstream technology for multimedia streaming services, representing the latest developments in this field.
- Apple's HTTP streaming service HLS, HTTP Live Streaming
- Microsoft's Smooth Streaming SS
- HTTP-based dynamic image expert group MPEG, Moving Picture Experts Group
- DASH Dynamic Adaptative Streaming Over HTTP
- the MPEG DASH standard is a standardized technology developed by MPEG and is expected to be widely adopted, thus changing the fragmented market structure.
- the existing HTTP-based media streaming service is only applicable to one media presentation (media presentation is a term used in the DASH standard, conceptually equivalent to a TV channel), while the navigation service serves multiple media presentations, which is a cross-multiple Media presentation of the business.
- the present invention aims to solve the support of the navigation service by the HTTP-based media streaming service.
- the present invention refers to terms in the DASH standard as a basis for the description and embodiments, the method of the present invention is not limited to the DASH standard, but is applicable to a variety of HTTP-based media streaming services.
- the technical solutions of some embodiments of the present invention may be, for example, according to some DASH specifications and supplementary revisions thereof as follows:
- ISO/IEC 23009-1 Part 1: Media presentation description and segment formats, 2nd Edition, 2014.
- Part 1 Media presentation description and segment formats.
- AMENDMENT 1 High Profile and Availability Time Synchronization Extended profiles and time synchronization
- ISO/IEC 23009-1 2014/FDAM 1Part 1: Media presentation description and segment formats.
- Part 1 Media presentation description and segment formats.
- AMENDMENT 2 Spatial Relationship Description, Generalized URL parameters and other extensions.
- a media content is encoded into multiple versions, each version has different characteristics, such as code rate. These versions are called Representation in DASH, and they represent the same media content, from content presentation ( The viewing/playing angles are alternative to each other.
- a media representation is divided into accessible units in time - usually a few seconds in length, called a media segment or a media sub-segment (a media segment can be logically divided into media sub-segments).
- the media segment the initialization segment is referred to as a segment.
- the media expression is stored on the content server - the HTTP server for the client to obtain, and the fragment is the smallest unit that the client can access through the Uniform Resource Locator (URL).
- the Media Presentation Description is an extensible Markup Language (XML) file that contains the metadata required by the client, describes the characteristics of the media representation, and how to obtain media expressions from the server. Including: the code rate of the media expression, the resolution, the aspect ratio of the video image, the URL of the clip included in the media expression, and the like.
- the client can construct an HTTP URL to request media segments in the media presentation from the content server, and can switch to other media representations at the media segment boundaries to accommodate changes in available bandwidth.
- the HTTP-based adaptive media streaming service allows for changes in the characteristics of the content in a media presentation, such as changes in the way the media is encoded.
- this is achieved through the concept of the so-called "Period", which is used for splicing of content, such as the previous content paragraph is a news program, and the next content paragraph is an advertisement.
- a media presentation includes one or more content paragraphs (Period), These content passages are sequential in time, and the beginning of a content passage means that there are some changes compared to the previous content passage, such as changes in content, such as from news programs to sports programs, sports programs to movie programs, and from Movie programs to advertisements, advertisements, variety shows, etc.; changes in the way the content is encoded, for example, can be changed from H.264 encoding scheme to H.265 encoding scheme; changes in the number of media expressions, for example, can increase or decrease media expression; The change of the content component, for example, can increase the audio expression of Chinese and the like.
- the client's working conditions have changed and may have to be reinitialized.
- a collection of media expressions containing the same media content and media components is referred to as an adaptation set, an adaptation set containing at least one media representation, and media representations in an adaptation set having mutual substitution.
- Different adaptation sets may be compatible or repulsive.
- the media presentation can include one or more temporally sequential content paragraphs, each content paragraph containing one or more Adaptation Sets.
- Each of the Adaptation Sets contains one or more media representations (Representations).
- One of the media expressions contains one or more segments.
- the media presentation description has a hierarchical structure similar to the media presentation, as shown in Figure 1-a.
- the concept of media presentation described above may be represented by an XML element in the media presentation description, the media presentation element includes one or more content paragraph elements, and each content paragraph element contains one or more adaptation sets ( AdaptationSet) element. Each AdaptationSet element contains one or more Representation elements.
- the media presentation corresponds to a media presentation description element in the media presentation description
- a content paragraph in the media presentation corresponds to a content paragraph element in the media presentation description
- an adaptation set in the media presentation corresponds to an appropriate one in the media presentation description
- the following describes the media presentation navigation method based on HTTP media stream.
- the navigation service serves multiple media presentations, and is convenient for selecting a group of media presentations, and is a service presented across multiple media.
- the plurality of media services served by the navigation service present a member media presentation called the navigation service, referred to as member media presentation or main media presentation.
- the navigation service can be implemented as a media presentation (ie, navigation) Media presentation), the navigation media presents a presentation of the media independent of its members.
- the navigation business and its member media presentations are each illustrated by their respective media presentation descriptions. Wherein, if the navigation service is served by N media presentations, then there are N+1 media presentations and corresponding N+1 media presentation descriptions.
- each member media presentation corresponds to the navigation media presentation.
- a navigation unit that represents the member's media presentation.
- the navigation business and its member media presentations are each described by their respective media presentations.
- a navigation unit represents a media presentation that may include multiple media components, such as video components (also referred to as video media representations) and audio components (also referred to as audio media representations).
- the video of a navigation unit is a small format image representing a media presentation.
- the video of the navigation unit is usually cropped from the video component represented by the media it represents, that is, a part of the picture, the navigation unit presentation quality (such as resolution and/or frame rate, etc.) is lower than the main media presentation.
- the audio of the viewing unit comes from the audio presented by the main media.
- the video of one navigation unit is implemented as one or more media representations (one in some examples).
- FIG. 1-b is a schematic flowchart of a media presentation navigation method based on HTTP media stream according to an embodiment of the present invention.
- a media presentation navigation method based on an HTTP media stream provided by an embodiment of the present invention may include:
- the client obtains a media presentation description of the navigation media presentation.
- the media presentation description presented by the navigation media describes N navigation units included in the navigation media presentation.
- the client may obtain a media presentation description of the navigation media presentation from a content server or other device.
- N is an integer greater than 1.
- the N may be equal to 7, 2, 3, 4, 5, 8, 11, 15, 20, 25, 30 or other values, for example.
- the client may be a DASH client or other client with DASH client logic function or other client of HTTP-based media streaming service.
- the client may be, for example, a personal computer, a mobile phone, a tablet, a television, or a set top box.
- the navigation media presentation can be seen as a special media presentation.
- the client acquires K navigation units of the N navigation units according to the media presentation description presented by the navigation media.
- K is a positive integer less than or equal to the N.
- the K may be equal to 1, 2, 3, 4, 5, 8, 11, 15, 20, 25, 30 or other values, for example.
- the K navigation units may be in one-to-one correspondence with the K logical presentation units (the logical presentation units may be, for example, a navigation window), that is, each navigation unit of the K navigation units may be presented by different logical presentation units. .
- the client presents the K navigation units.
- Each of the K navigation units points to a primary media presentation.
- the presentation quality of the main media presentation pointed to by the navigation unit i in the K navigation units is higher than the presentation quality of the navigation unit i.
- each of the K navigation units can point to a main media presentation.
- the presentation quality of the main media presentation pointed by the navigation unit i in the K navigation units is higher than the presentation quality of the navigation unit i. That is to say, the presentation quality of the media expression of the navigation unit is lower than the presentation quality of the main media presentation represented by the navigation unit.
- the media presentation description presented by the navigation media may be different from the media presentation of the primary media presentation pointed to by each navigation unit of the K navigation units.
- the navigation media presentation may have an independent media presentation description, and each of the K navigation units may be directed to the primary media presentation or may be independent of the media presentation presented by the navigation media.
- K navigation units point to K main media presentations, and K main media presentations respectively have corresponding media presentation descriptions, ie, K media presentation descriptions, and the media presentation descriptions presented by the navigation media are different from the K media. Any one of the presentation descriptions, that is, the navigation media presentation can be described by the K+1th media presentation.
- the media presentation description presented by the navigation media and the media presentation description of the primary media presentation pointed to by each navigation unit of the K navigation units may be aggregated.
- An aggregated media presentation description (also referred to as a supermedia presentation description) is formed.
- An aggregated media presentation description (which can be called a supermedia presentation description) can be used to describe the navigation media presentation and The navigation media presents the primary media presentation that it points to.
- the introduction of the supermedia presentation description facilitates enhancing the association between the navigation media presentation and the guided master media presentation.
- the way in which the navigation unit points to the main media presentation can be flexible, and the navigation unit can directly point to the main media presentation or indirectly to the main media presentation.
- each of the K navigation units can point to the primary media presentation described by the media presentation description in a manner that points to a media presentation description.
- the navigation unit can also point to the main media presentation by other direct or indirect pointing methods.
- the media presentation description of the navigation media presentation and the media presentation description of the primary media presentation pointed to by each of the K navigation units may be aggregated to form an aggregated media presentation description.
- each of the K navigation units may point to a primary media presentation in a manner that references the presentation elements in the aggregated media presentation description.
- each of the N navigation units includes a video component, or each of the N navigation units includes audio Component and video components, further, the navigation unit may also include subtitle components or other types of media components.
- the present invention provides a signaling mechanism for navigation services by media presentation descriptions (such as MPD in the DASH standard).
- the media presentation description can inform the client which navigation unit consists of a navigation unit, the components of the navigation unit, the relationship between the navigation unit and the media presentation of the members of the navigation service, and the relationship between the video components of the navigation unit, The relationship between the audio components of the navigation unit, the relationship between the audio component and the video component of the navigation unit, and the like.
- the video components included in different navigation units of the K navigation units are media expressions in different video adaptation sets in the K video adaptation sets, where The media representations in any one of the K video adaptation sets have selective mutual exclusion, and the different video adaptation sets in the K video adaptation sets have selection compatibility.
- the video components included in the navigation unit i in the K navigation units may be attributed to the video adaptation set Ci in the K video adaptation sets, which are included in the navigation unit j of the K navigation units.
- the video component may be attributed to a video adaptation set Cj in the K video adaptation sets, wherein the video adaptation set Cj and the video adaptation set Ci are two different video adaptation sets in the K video adaptation sets.
- Guide The element j and the navigation unit i can be any two navigation units of the K navigation units.
- selection compatibility means that these objects can be selected at the same time. For example, if there is selection compatibility between different video adaptation sets in the K video adaptation sets, it means that K video adaptation sets can be selected at the same time. Media representation in multiple video adaptation sets.
- the so-called mutual exclusion means that these objects are not supported at the same time. For example, if there is selective mutual exclusion between media expressions in any one of the K video adaptation sets, it means that one is not supported at the same time.
- Multiple media representations in the video adaptation set for example, assuming that the video adaptation set I in the K video adaptation sets includes more than 10 media representations, if there is selective mutual exclusion between media representations in the video adaptation set, then each Only one of the 10 media expressions can be selected at a time, and multiple of the 10 media expressions cannot be selected at the same time.
- the K navigation unit includes an audio component that is a media representation in an audio adaptation set, where the audio adaptation set is different from the K video adaptation sets.
- the audio component adaptation set has a selection compatibility with the K video adaptation sets. For example, suppose the audio adaptation set includes 20 multiple media representations, and if there is selective mutual exclusion between media expressions in the audio adaptation set, then only one of 20 multiple media expressions can be selected at a time, and Multiple of the 30 media expressions cannot be selected at the same time.
- the audio components included in the different navigation units of the K navigation units are media expressions in different audio adaptation sets in the K audio adaptation sets.
- the different audio adaptation sets in the K audio adaptation sets have selective mutual exclusion.
- the media expression element in the audio adaptation set element may include an area description of an associated area of the media expression in the navigation media presentation.
- media representations including media presentation elements of the same region description have an association relationship, or an adaptation set element including the same region description There is an association between the sets.
- the media expression described by the media expression element i is expressed as a media expression ri
- the media expression described by the media expression element j is expressed as a media expression rj. If the media expression element i and the media expression element j contain the same region description, then the representation may be There is a relationship between the media expression ri and the media expression rj.
- the media expression element i and the adaptation set element ci include the same area description, and then may also describe the media expression and the adaptation set element ci described by the media expression element i.
- the media expressions in the adaptation set have an association relationship, for example, the media expression element i can be an audio media expression, and the media expression in the adaptation set described by the adaptation set element ci can be a video media expression.
- the area description may be a spatial relationship description (SRD).
- SRD spatial relationship description
- the area description can also be other types of descriptive information that can be used to describe the location area.
- the media presentation description of the navigation media presentation includes K video adaptation set elements, and the K video adaptation set elements and the K video One-to-one correspondence between adaptation sets.
- the K video adaptation set elements include a description sub-element Ci
- the video adaptation sets described in the K video adaptation set elements satisfying the set common condition video adaptation set element have a selection
- the set commonality condition may be the same for both the element name and the method identifier (schemeIdUri) attribute of the description sub-element Ci included in the video adaptation set element.
- the description sub-element Ci may describe that the media representation in the video adaptation set described by the video adaptation set element including the description sub-element Ci is the navigation medium.
- the components of the presentation may describe a role that the media expression in the video adaptation set corresponding to the video adaptation set element corresponding to the description sub-element Ci is presented in the navigation medium, and the role may be, for example, primary, supplementary, subtitle or Translation dubbing, etc.
- the description sub-element Ci may be, for example, an EssentialProptery element or a Supplemental Proptery element or a Role element or other elements.
- the description sub-element Ci is an action description Role element
- the set commonality condition may be an element name of the descriptive sub-element Ci included in the video adaptation set element.
- the same method and method identification schemeIdUri attributes can be the same, and the parameter attributes can be the same.
- the media presentation description of the navigation media presentation includes the K video adaptation set elements, the K video adaptation set elements and K videos One-to-one correspondence between adaptation sets.
- the video adaptation set element VI corresponding to the video adaptation set I of the K video adaptation set elements includes a pointer for pointing to a primary media presentation, and the video adaptation set I may be the K Any video adaptation set in a video adaptation set.
- the location of the pointer in the video adaptation set element VI may be determined according to the needs of the scene.
- the pointer can be carried by an attribute of the video adaptation set element VI.
- the pointer may be carried by an xlink:href attribute or other attribute of the video adaptation set element VI.
- the pointer can be carried by an EssentialProptery element or a SupplementalProperty element in the video adaptation set element VI.
- the pointer may be carried by a child element in an EssentialProptery element of the video adaptation set element VI, or the pointer may be carried by an attribute of an EssentialProptery element in the video adaptation set element VI;
- the pointer may be carried by a child element in a SupplementalProperty element in the video adaptation set element VI, or the pointer may be carried by an attribute of a SupplementalProperty element in the video adaptation set element VI.
- the pointer may be carried by a value attribute or other attribute of an EssentialProptery element in the video adaptation set element VI, or the pointer may be a value attribute of a SupplementalProperty element in the video adaptation set element VI or other Property bearer.
- the pointer may be carried by an attribute of a virtual Representation element in the video adaptation set element VI, or the pointer may be carried by a child element in a virtual Representation element in the video adaptation set element VI, where The virtual Representation element does not include a media segment template element, a media segment list element, and a base Uniform Resource Locator (BaseURL) element.
- BaseURL base Uniform Resource Locator
- the pointer can also be carried by a Mediad Presentation (ReferencedMediaPresentation) element in the Video Adaptation Set Element VI.
- the ReferencedMediaPresentation element is an element of the new extension, that is, the pointer can be carried by the newly extended element in the Video Adaptation Set Element VI, which is newly expanded in the Video Adaptation Set Element VI.
- Bearer The name of the element of the pointer is not limited to ReferencedMediaPresentation, but may be other element names.
- the time structure of the navigation media presentation may be independent of the time of presentation of the primary media pointed by the K navigation units in the navigation media presentation.
- the audio of the navigation unit may be obtained by encoding the audio presented by the main media
- the video of the navigation unit may be obtained by encoding the video presented by the main media, which may make the time structure of the navigation unit and There is no correlation between the time structures presented by the main media.
- Figure 1-c illustrates, by way of example, the temporal structure of a media presentation that includes a number of consecutive Periods.
- Figure 1-d illustrates, by way of example, the temporal structure of multiple media presentations, each media presentation including a contiguous number of Periods.
- the time structure between multiple media presentations is different, for example, the boundaries of Period are not aligned.
- the media presentation is sequential in time
- the media presentation description also describes the sequential time structure
- the non-sequential time structure describing multiple concurrent media presentations exceeds the ability of the traditional media presentation description.
- the media expression (audio and video, etc.) of the main media presentation pointed by each navigation unit can be re-encoded to obtain the media expression of the navigation unit, that is, each navigation unit points
- the media expression of the main media presentation and the media expression of the navigation unit are independent.
- the media representation of each navigation unit is independent, and the audio component and video component of the same navigation unit are also independent.
- the media representation that may not be presented by the media of the navigation unit is not affected by the Period arrangement of the media representation presented by the corresponding primary media.
- Figures 1-e and 1-f illustrate examples of the manner in which the content server encodes the video media representation and audio media representation of the primary media presentation pointed to by the navigation unit.
- Figure 1-g shows an example of a Period arrangement for media presentation of N navigation units presented by the navigation media.
- the Period arrangement of the media presentation of the N navigation units presented by the navigation media is aligned.
- Figure 1-h shows that when a navigation unit is added, the newly arranged navigation unit is aligned with the periodic arrangement of the media presentations of the other navigation units.
- Figure 1-i shows the media presentation of the main media presentation by the content server using the navigation units.
- An example of the manner in which the media presentation description presented by the navigation media is obtained.
- the content server can also obtain the media presentation description of the navigation media presentation by other means.
- Figures 1-j and 1-k show an example of a client selecting K navigation units for rendering.
- the video media representations of the K navigation units in the N navigation units will be decoded and rendered, and the audio media representations of the highlighted navigation units in the audio media representations of the K navigation units will be decoded for presentation.
- the client can select the specific manner in which the K navigation units are presented based on the media presentation description and user instructions presented by the navigation media.
- the method further includes: if the focus of attention stays in the navigation unit i in the K navigation units, the client presents the The audio component of navigation unit i.
- the method further includes: if the navigation unit i in the K navigation units is selected, the client acquires the navigation The primary media pointed to by unit i is presented. Further, the client may also present a primary media presentation pointed by the navigation unit i.
- each navigation unit of the K navigation units can respectively point to one main media presentation, this is equivalent to the introduction between the navigation unit and the main media presentation.
- Correlation which enables the client to obtain a media presentation description of the main media presentation j pointed to by the navigation unit i in the case where the navigation unit i of the K navigation units is selected, and thus The media presentation description of the primary media presentation j obtains the primary media presentation j for presentation, which is beneficial to facilitate flexible switching between the navigation media presentation and the primary media presentation, thereby implementing the HTTP-based media streaming service scenario. Support for video navigation, which will help improve the user's high quality experience.
- the technical solution of the embodiment of the present invention is advantageous for making the navigation service more flexible.
- the present invention can implement a personalized navigation service.
- the navigation service can be configured on the client, such as: displayed in a navigation page/window.
- the number of navigation units, the combination of navigation units, the presentation position and order of the navigation units, etc. can all be configured on the client side, which facilitates greatly facilitating the use of the navigation service on a variety of different devices.
- mobile phone terminals, tablets, their capabilities are different - display device size, resolution, computing power.
- FIG. 2 is a schematic flowchart diagram of another media presentation and navigation method based on HTTP media stream according to another embodiment of the present invention.
- a media presentation navigation method based on an HTTP media stream provided by another embodiment of the present invention may include:
- the media presentation description of the navigation media presentation describes N navigation units included in the navigation media presentation, where N is an integer greater than 1;
- N is an integer greater than 1;
- Each of the navigation units of the navigation unit points to a main media presentation, wherein the presentation quality of the main media presentation pointed by the navigation unit i of the N navigation units is higher than that of the navigation unit i Presentation quality.
- the execution body of the embodiment of the present invention may be a content server or other device.
- the content server can store a media presentation description of the navigation media presentation and can provide it to the client.
- the media presentation description presented by the navigation media describes N navigation units included in the navigation media presentation.
- the client may obtain a media presentation description of the navigation media presentation from a content server or other device.
- N is an integer greater than 1.
- the N may be equal to 7, 2, 3, 4, 5, 8, 11, 15, 20, 25, 30 or other values, for example.
- the client may be a DASH client or other client with DASH client logic function or other client of HTTP-based media streaming service.
- the client may be, for example, a personal computer, a mobile phone, a tablet, a television, or a set top box.
- the navigation media presentation can be seen as a special media presentation.
- the navigation media presentation of the navigation media presentation describes the N navigation units included in the navigation media presentation, because each navigation unit of the N navigation units can respectively Pointing to a master media presentation, which is equivalent to a certain association relationship introduced between the navigation unit and the main media presentation, which enables the client to select the navigation unit i of the N navigation units.
- the client may obtain a media presentation description of the primary media presentation j pointed to by the navigation unit i, and then obtain the primary media presentation j according to the media presentation description of the primary media presentation j, and the scheme is
- the foundation of the flexible switching between the navigation media presentation and the main media presentation is laid, which lays a foundation for supporting video navigation in the HTTP-based media streaming service scenario.
- the presentation quality of the main media presentation pointed by the navigation unit i in the N navigation units is higher than the presentation quality of the navigation unit i. That is to say, the presentation quality of the media expression of the navigation unit is lower than the presentation quality of the main media presentation represented by the navigation unit.
- the media presentation description of the navigation media presentation may be different from the media presentation of the primary media presentation pointed to by each of the N navigation units.
- description that is, the navigation media presentation may have an independent media presentation description, and each of the N navigation units may also have a media presentation that is independent of the presentation of the navigation media.
- N navigation units point to N main media presentations, and N main media presentations respectively have corresponding media presentation descriptions, ie, N media presentation descriptions, and the media presentation descriptions presented by the navigation media are different from the N media.
- Any one of the presentation descriptions, that is, the navigation media presentation can be described by the K+1th media presentation.
- the media presentation description presented by the navigation media and the media presentation description of the primary media presentation pointed to by each navigation unit of the N navigation units may be aggregated.
- An aggregated media presentation description (also referred to as a supermedia presentation description) is formed.
- An aggregated media presentation description (which may be referred to as a supermedia presentation description) may be utilized to describe the primary media presentation to which the navigation media presentation and navigation media presentations are directed.
- the introduction of a super-media presentation description facilitates enhanced navigation The relationship between the presentation of the media and the presentation of the master media presented.
- the way in which the navigation unit points to the main media presentation can be flexible, and the navigation unit can directly point to the main media presentation or indirectly to the main media presentation.
- each of the N navigation units can point to the primary media presentation described by the media presentation description in a manner that points to a media presentation description.
- the navigation unit can also point to the main media presentation by other direct or indirect pointing methods.
- the media presentation description of the navigation media presentation and the media presentation description of the primary media presentation pointed to by each of the N navigation units may be aggregated to form an aggregated media presentation description.
- each of the N navigation units may point to a primary media presentation in a manner that references the presentation elements in the aggregated media presentation description.
- each of the N navigation units includes a video component, or each of the N navigation units includes audio Component and video components, further, the navigation unit may also include subtitle components or other types of media components.
- the present invention provides a signaling mechanism for navigation services by media presentation descriptions (such as MPD in the DASH standard).
- the media presentation description can inform the client which navigation unit consists of a navigation unit, the components of the navigation unit, the relationship between the navigation unit and the media presentation of the members of the navigation service, and the relationship between the video components of the navigation unit, The relationship between the audio components of the navigation unit, the relationship between the audio component and the video component of the navigation unit, and the like.
- the video components included in the different navigation units of the N navigation units are media expressions in different video adaptation sets in the N video adaptation sets, where The media representations in any one of the N video adaptation sets have selective mutual exclusion, and the different video adaptation sets in the N video adaptation sets have selection compatibility.
- the video component included in the navigation unit i in the N navigation units may be attributed to the video adaptation set Ci in the N video adaptation sets, where the navigation unit j included in the N navigation units
- the video component may be attributed to a video adaptation set Cj in the N video adaptation sets, wherein the video adaptation set Cj and the video adaptation set Ci are two different video adaptation sets in the N video adaptation sets.
- the navigation unit j and the navigation unit i can be any two navigation units of the N navigation units.
- selection compatibility means that these objects can be selected at the same time. For example, if there is selection compatibility between different video adaptation sets in the N video adaptation sets, it means that N video adaptation sets can be selected at the same time. Media representation in multiple video adaptation sets.
- mutual exclusion indicating that these objects do not support simultaneous selection, for example, if there is selective mutual exclusion between media expressions in any one of the N video adaptation sets, indicating that one is not supported at the same time.
- Multiple media representations in the video adaptation set for example, assuming that the video adaptation set I in the N video adaptation sets includes 10 multiple media representations, if there is selective mutual exclusion between media expressions in the video adaptation set, then each Only one of the 10 media expressions can be selected at a time, and multiple of the 10 media expressions cannot be selected at the same time.
- the N navigation unit includes an audio component that is a media expression in an audio adaptation set, where the audio adaptation set is different from the N video adaptation sets.
- the audio component adaptation set has a selection compatibility with the N video adaptation sets. For example, suppose the audio adaptation set includes 20 multiple media representations, and if there is selective mutual exclusion between media expressions in the audio adaptation set, then only one of 20 multiple media expressions can be selected at a time, and Multiple of the 30 media expressions cannot be selected at the same time.
- the audio components included in the different navigation units of the N navigation units are media expressions in different audio adaptation sets in the N audio adaptation sets.
- the different audio adaptation sets in the N audio adaptation sets have selective mutual exclusion.
- the media expression element in the audio adaptation set element may include an area description of an associated area of the media expression in the navigation media presentation.
- media representations including media presentation elements of the same region description have an association relationship, or an adaptation set element including the same region description There is an association between the sets.
- the media expression described by the media expression element i is expressed as a media expression ri
- the media expression described by the media expression element j is expressed as a media expression rj. If the media expression element i and the media expression element j contain the same regional description, the media expression ri may be represented. There is a relationship with the media expression rj.
- the media expression element i and the adaptation set element ci include the same area description, and then may also describe the media expression and the adaptation set element ci described by the media expression element i.
- the media expressions in the adaptation set have an association relationship, for example, the media expression element i can be an audio media expression, and the media expression in the adaptation set described by the adaptation set element ci can be a video media expression.
- the area description may be a spatial relationship description (SRD).
- SRD spatial relationship description
- the area description can also be other types of descriptive information that can be used to describe the location area.
- the media presentation description of the navigation media presentation includes N video adaptation set elements, and the N video adaptation set elements and the N video One-to-one correspondence between adaptation sets.
- the N video adaptation set elements include a description sub-element Ci
- the video adaptation sets described in the N video adaptation set elements satisfying the set common condition video adaptation set element have a selection
- the set commonality condition may be the same for both the element name and the method identifier (schemeIdUri) attribute of the description sub-element Ci included in the video adaptation set element.
- the description sub-element Ci may describe that the media representation in the video adaptation set described by the video adaptation set element including the description sub-element Ci is the navigation medium.
- the components of the presentation may describe a role that the media expression in the video adaptation set corresponding to the video adaptation set element corresponding to the description sub-element Ci is presented in the navigation medium, and the role may be, for example, primary, supplementary, subtitle or Translation dubbing, etc.
- the description sub-element Ci may be, for example, an EssentialProptery element or a Supplemental Proptery element or a Role element or other elements.
- the description sub-element Ci is an action description Role element
- the set commonality condition may be an element name of the descriptive sub-element Ci included in the video adaptation set element.
- the same method and method identification schemeIdUri attributes can be the same, and the parameter attributes can be the same.
- the media presented by the navigation media is The N video adaptation set elements are included in the description, and the N video adaptation set elements are in one-to-one correspondence with the N video adaptation sets.
- the video adaptation set element VI corresponding to the video adaptation set I of the N video adaptation set elements includes a pointer for pointing to a primary media presentation, and the video adaptation set I may be the N Any video adaptation set in a video adaptation set.
- the location of the pointer in the video adaptation set element VI may be determined according to the needs of the scene.
- the pointer can be carried by an attribute of the video adaptation set element VI.
- the pointer may be carried by an xlink:href attribute or other attribute of the video adaptation set element VI.
- the pointer can be carried by an EssentialProptery element or a SupplementalProperty element in the video adaptation set element VI.
- the pointer may be carried by a child element in an EssentialProptery element of the video adaptation set element VI, or the pointer may be carried by an attribute of an EssentialProptery element in the video adaptation set element VI;
- the pointer may be carried by a child element in a SupplementalProperty element in the video adaptation set element VI, or the pointer may be carried by an attribute of a SupplementalProperty element in the video adaptation set element VI.
- the pointer may be carried by a value attribute or other attribute of an EssentialProptery element in the video adaptation set element VI, or the pointer may be a value attribute of a SupplementalProperty element in the video adaptation set element VI or other Property bearer.
- the pointer may be carried by an attribute of a virtual Representation element in the video adaptation set element VI, or the pointer may be carried by a child element in a virtual Representation element in the video adaptation set element VI, where The virtual Representation element does not include a media segment template element, a media segment list element, and a base Uniform Resource Locator (BaseURL) element.
- BaseURL base Uniform Resource Locator
- the pointer can also be carried by a Mediad Presentation (ReferencedMediaPresentation) element in the Video Adaptation Set Element VI.
- the ReferencedMediaPresentation element is an element of the new extension, that is, the pointer can be carried by the newly extended element in the Video Adaptation Set Element VI, which is newly expanded in the Video Adaptation Set Element VI.
- the name of the element carrying the pointer is not limited to ReferencedMediaPresentation, but may be other The name of the element.
- the time structure of the navigation media presentation may be independent of the time of presentation of the primary media pointed by the N navigation units in the navigation media presentation.
- the audio of the navigation unit may be obtained by encoding the audio presented by the main media
- the video of the navigation unit may be obtained by encoding the video presented by the main media, which may make the time structure of the navigation unit and There is no correlation between the time structures presented by the main media.
- FIG. 3-a is a schematic flowchart diagram of a method for providing navigation media presentation based on HTTP streaming media according to another embodiment of the present invention.
- the method for providing navigation media presentation based on HTTP streaming as shown in FIG. 3-a can be implemented based on the network architecture shown in FIG. 3-b.
- the network architecture shown in Figure 3-b mainly includes the DASH client and the content server.
- a method for providing a navigation media presentation based on HTTP streaming media may include:
- the DASH client obtains a media presentation description of the navigation media presentation from the content server.
- the media presentation description presented by the navigation media describes N navigation units included in the navigation media presentation.
- N is an integer greater than 1.
- the N may be equal to 7, 2, 3, 4, 5, 8, 11, 15, 20, 25, 30 or other values, for example.
- the DASH client can be, for example, a personal computer, a mobile phone, a tablet, a television, or a set top box.
- the DASH client acquires, from the content server, the K navigation units of the N navigation units according to the media presentation description presented by the navigation media.
- K is a positive integer less than or equal to the N.
- the K may be equal to 1, 2, 3, 4, 5, 8, 11, 15, 20, 25, 30 or other values, for example.
- K navigation units can be in one-to-one correspondence with K logical presentation units, that is, K navigation units Each of the navigation units can be presented by a different logical presentation unit.
- the DASH client presents the K navigation units.
- each of the K navigation units can point to a main media presentation.
- the presentation quality of the main media presentation pointed by the navigation unit i in the K navigation units is higher than the presentation quality of the navigation unit i. That is to say, the presentation quality of the media expression of the navigation unit is lower than the presentation quality of the main media presentation represented by the navigation unit.
- the DASH client acquires, from the content server, a media presentation description of the main media presentation pointed by the navigation unit i.
- the DASH client obtains the primary media presentation from the content server based on the media presentation description of the primary media presentation.
- the DASH client presents a primary media presentation pointed by the navigation unit i.
- the presentation quality of the main media presentation pointed by the navigation unit i in the K navigation units is higher than the presentation quality of the navigation unit i. That is to say, the presentation quality of the media expression of the navigation unit is lower than the presentation quality of the main media presentation represented by the navigation unit.
- the media presentation description presented by the navigation media may be different from the media presentation of the primary media presentation pointed to by each navigation unit of the K navigation units.
- the navigation media presentation may have an independent media presentation description, and each of the K navigation units may be directed to the primary media presentation or may be independent of the media presentation presented by the navigation media.
- K navigation units point to K main media presentations, and K main media presentations respectively have corresponding media presentation descriptions, ie, K media presentation descriptions, and the media presentation descriptions presented by the navigation media are different from the K media. Any one of the presentation descriptions, that is, the navigation media presentation can be described by the K+1th media presentation.
- the media presentation description presented by the navigation media and the media presentation description of the primary media presentation pointed to by each navigation unit of the K navigation units may be aggregated.
- An aggregated media presentation description (also referred to as a supermedia presentation description) is formed.
- An aggregated media presentation description (which may be referred to as a supermedia presentation description) may be utilized to describe the primary media presentation to which the navigation media presentation and navigation media presentations are directed. The introduction of the supermedia presentation description facilitates enhancing the association between the navigation media presentation and the guided master media presentation.
- the way in which the navigation unit points to the main media presentation can be flexible, and the navigation unit can directly point to the main media presentation or indirectly to the main media presentation.
- each of the K navigation units can point to the primary media presentation described by the media presentation description in a manner that points to a media presentation description.
- the navigation unit can also point to the main media presentation by other direct or indirect pointing methods.
- the media presentation description of the navigation media presentation and the media presentation description of the primary media presentation pointed to by each of the K navigation units may be aggregated to form an aggregated media presentation description.
- each of the K navigation units may point to a primary media presentation in a manner that references the presentation elements in the aggregated media presentation description.
- each of the N navigation units includes a video component, or each of the N navigation units includes audio Component and video components, further, the navigation unit may also include subtitle components or other types of media components.
- the present invention provides a signaling mechanism for navigation services by media presentation descriptions (such as MPD in the DASH standard).
- the media presentation description can inform the client which navigation unit consists of a navigation unit, the components of the navigation unit, the relationship between the navigation unit and the media presentation of the members of the navigation service, and the relationship between the video components of the navigation unit, The relationship between the audio components of the navigation unit, the relationship between the audio component and the video component of the navigation unit, and the like.
- the video components included in different navigation units of the K navigation units are media expressions in different video adaptation sets in the K video adaptation sets, where The media representations in any one of the K video adaptation sets have selective mutual exclusion, and the different video adaptation sets in the K video adaptation sets have selection compatibility.
- the video components included in the navigation unit i in the K navigation units may be attributed to the video adaptation set Ci in the K video adaptation sets, which are included in the navigation unit j of the K navigation units.
- the video component may be attributed to a video adaptation set Cj in the K video adaptation sets, wherein the video adaptation set Cj and the video adaptation set Ci are two different video adaptation sets in the K video adaptation sets.
- the navigation unit j and the navigation unit i can be any two navigation units of the K navigation units.
- the so-called selection compatibility means that these objects can be selected at the same time, for example, if K videos Selective compatibility between different sets of video adaptations in the adaptation set means that media representations in multiple video adaptation sets in the K video adaptation sets can be selected simultaneously.
- the so-called mutual exclusion means that these objects are not supported at the same time. For example, if there is selective mutual exclusion between media expressions in any one of the K video adaptation sets, it means that one is not supported at the same time.
- Multiple media representations in the video adaptation set for example, assuming that the video adaptation set I in the K video adaptation sets includes more than 10 media representations, if there is selective mutual exclusion between media representations in the video adaptation set, then each Only one of the 10 media expressions can be selected at a time, and multiple of the 10 media expressions cannot be selected at the same time.
- the K navigation unit includes an audio component that is a media representation in an audio adaptation set, where the audio adaptation set is different from the K video adaptation sets.
- the audio component adaptation set has a selection compatibility with the K video adaptation sets. For example, suppose the audio adaptation set includes 20 multiple media representations, and if there is selective mutual exclusion between media expressions in the audio adaptation set, then only one of 20 multiple media expressions can be selected at a time, and Multiple of the 30 media expressions cannot be selected at the same time.
- the audio components included in the different navigation units of the K navigation units are media expressions in different audio adaptation sets in the K audio adaptation sets.
- the different audio adaptation sets in the K audio adaptation sets have selective mutual exclusion.
- the media expression element in the audio adaptation set element may include an area description of an associated area of the media expression in the navigation media presentation.
- media representations including media presentation elements of the same region description have an association relationship, or an adaptation set element including the same region description There is an association between the sets.
- the media expression described by the media expression element i is expressed as a media expression ri
- the media expression described by the media expression element j is expressed as a media expression rj. If the media expression element i and the media expression element j contain the same regional description, the media expression ri may be represented. There is a relationship with the media expression rj.
- the media expression element i and the adaptation set element ci may also indicate that the media expression described by the media expression element i has an association relationship with each media expression in the adaptation set described by the adaptation set element ci, for example, the media expression element i may be an audio media expression.
- the media representation in the adaptation set described by the adaptation set element ci may be a video media representation.
- the area description may be a spatial relationship description (SRD).
- SRD spatial relationship description
- the area description can also be other types of descriptive information that can be used to describe the location area.
- the media presentation description of the navigation media presentation includes K video adaptation set elements, and the K video adaptation set elements and the K video One-to-one correspondence between adaptation sets.
- the K video adaptation set elements include a description sub-element Ci
- the video adaptation sets described in the K video adaptation set elements satisfying the set common condition video adaptation set element have a selection
- the set commonality condition may be the same for both the element name and the method identifier (schemeIdUri) attribute of the description sub-element Ci included in the video adaptation set element.
- the description sub-element Ci may describe that the media representation in the video adaptation set described by the video adaptation set element including the description sub-element Ci is the navigation medium.
- the components of the presentation may describe a role that the media expression in the video adaptation set corresponding to the video adaptation set element corresponding to the description sub-element Ci is presented in the navigation medium, and the role may be, for example, primary, supplementary, subtitle or Translation dubbing, etc.
- the description sub-element Ci may be, for example, an EssentialProptery element or a Supplemental Proptery element or a Role element or other elements.
- the description sub-element Ci is an action description Role element
- the set commonality condition may be an element name of the descriptive sub-element Ci included in the video adaptation set element.
- the same method and method identification schemeIdUri attributes can be the same, and the parameter attributes can be the same.
- the media presentation description of the navigation media presentation includes the K video adaptation set elements, the K video adaptation set elements and K views One-to-one correspondence between frequency adaptation sets.
- the video adaptation set element VI corresponding to the video adaptation set I of the K video adaptation set elements includes a pointer for pointing to a primary media presentation, and the video adaptation set I may be the K Any video adaptation set in a video adaptation set.
- the location of the pointer in the video adaptation set element VI may be determined according to the needs of the scene.
- the pointer can be carried by an attribute of the video adaptation set element VI.
- the pointer may be carried by an xlink:href attribute or other attribute of the video adaptation set element VI.
- the pointer can be carried by an EssentialProptery element or a SupplementalProperty element in the video adaptation set element VI.
- the pointer may be carried by a child element in an EssentialProptery element of the video adaptation set element VI, or the pointer may be carried by an attribute of an EssentialProptery element in the video adaptation set element VI;
- the pointer may be carried by a child element in a SupplementalProperty element in the video adaptation set element VI, or the pointer may be carried by an attribute of a SupplementalProperty element in the video adaptation set element VI.
- the pointer may be carried by a value attribute or other attribute of an EssentialProptery element in the video adaptation set element VI, or the pointer may be a value attribute of a SupplementalProperty element in the video adaptation set element VI or other Property bearer.
- the pointer may be carried by an attribute of a virtual Representation element in the video adaptation set element VI, or the pointer may be carried by a child element in a virtual Representation element in the video adaptation set element VI, where The virtual Representation element does not include a media segment template element, a media segment list element, and a base Uniform Resource Locator (BaseURL) element.
- BaseURL base Uniform Resource Locator
- the pointer can also be carried by a Mediad Presentation (ReferencedMediaPresentation) element in the Video Adaptation Set Element VI.
- the ReferencedMediaPresentation element is an element of the new extension, that is, the pointer can be carried by the newly extended element in the Video Adaptation Set Element VI, which is newly expanded in the Video Adaptation Set Element VI.
- the name of the element carrying the pointer is not limited to ReferencedMediaPresentation, but may be other element names.
- the time structure of the navigation media presentation may be independent of the time of presentation of the primary media pointed by the K navigation units in the navigation media presentation.
- the audio of the navigation unit may be obtained by encoding the audio presented by the main media
- the video of the navigation unit may be obtained by encoding the video presented by the main media, which may make the time structure of the navigation unit and There is no correlation between the time structures presented by the main media.
- each navigation unit of the K navigation units can respectively point to one main media presentation, this is equivalent to the introduction between the navigation unit and the main media presentation.
- the association relationship which enables the DASH client to obtain the media presentation description of the main media presentation j pointed to by the navigation unit i in the case where the navigation unit i of the K navigation units is selected, and thus The media presentation description of the main media presentation j obtains the main media presentation j for presentation, which is beneficial to realize a more flexible switching between the navigation media presentation and the main media presentation, thereby implementing the HTTP-based media streaming service scenario. Support for video navigation, which in turn helps to enhance the user's high quality experience.
- the videos of the various navigation units are parallel and juxtaposed, the video of the plurality of navigation units is presented on the display screen of the user equipment or a window, and the audio is mutually exclusive, and there can only be one at any time.
- the audio of the navigation unit is selected and played, and the video screen of the navigation unit is the focus of the user.
- the navigation service needs to be supported by the corresponding signaling mechanism.
- the signaling informs the client which navigation unit consists of a navigation unit, the components of the navigation unit, the relationship between the navigation unit and the media presentation of the members of the navigation service, and the relationship between the video components of the navigation unit.
- the relationship between the audio components of the unit the relationship between the audio component and the video component of the navigation unit.
- the signaling of the navigation service is represented by a description file presented by the navigation media, implemented as some elements in the description file, expressing various relationships between the media expressions of the media components.
- the navigation services in the example serve 16 member media presentations.
- These MPD examples can be based on some of the following DASH specifications and their additions:
- ISO/IEC 23009-1 Part 1: Media presentation description and segment formats, 2nd Edition, 2014.
- Part 1 Media presentation description and segment formats.
- AMENDMENT 1 High Profile and Availability Time Synchronization Extended profiles and time synchronization
- ISO/IEC 23009-1 2014/FDAM 1Part 1: Media presentation description and segment formats.
- Part 1 Media presentation description and segment formats.
- AMENDMENT 2 Spatial Relationship Description, Generalized URL parameters and other extensions.
- each example is not a complete MPD, but an MPD segment taken to illustrate the features of the present invention.
- the example scenario S1 illustrates a signaling mechanism for navigating a service in the example scenario S1, informing the client which navigation units are comprised of a navigation unit, the components of the navigation unit, the navigation unit and the members of the navigation service.
- a Role element is used for each adaptation set element, including a video adaptation set element and an audio adaptation set element.
- the adaptation set element contains a use Role element, and the adaptation set of the description sub-element whose parameter is "main" is compatible and can be selected together by the client.
- media representations in multiple video adaptation sets - video media representations of different navigation units can be selected together and presented on the client.
- the video of the navigation unit and the main media presentation it represents are expressed by the attributes of the video adaptation set element of the navigation unit, specifically the attribute @xlink:href, which is essentially a pointer. Use it to point to a media presentation description of a remote primary media presentation. Because the pointing element is not an adaptation set element, the pointed element is not embedded in the navigation media presentation description (MPD's data model is hierarchical, and an element contains only elements of a lower-level type, not including Its more advanced type of element), which can be expressed in @xlink:show.
- the element pointed to by @xlink:href is the same as the type of the element in which the attribute is located, ie, if the attribute is at the level of the adaptation set element, the element it points to is adapted.
- Set element type In the present invention, the attribute is extended to the type of the element, which is used to point to a media presentation.
- the adaptation set element has both the far-end element (which points to a remote element) and the local media representation, which is not true in the existing DASH specification.
- the association relationship between the video media expression and the same navigation unit is established through association signaling.
- the @associationId attribute refers to the identifier of the associated video media expression
- the value of @id, @associationType may not appear. , indicating an unknown relationship, or adding a definition of an association, such as "accompany”.
- the semantic differences between the elements of the media presentation description are reflected in the behavior of the client.
- the client selects multiple media expressions in the same position in the navigation service.
- the status is a description of the role element in the adaptation set element to which the media expression belongs.
- the parameters of the usage description sub-element are main. It indicates that the media expression in the adaptation set is the main component in the media presentation.
- the client selects the video media representations of the plurality of navigation units, requests the segments of the media expressions from the content server, processes them, and presents them to the user together. Things like these: selecting several video adaptation sets (video media representations), presenting them in what order, the location layout of the presentation, the presentation mode (moving image sequence), etc., can all be determined by the client. The decision can be made according to the user's instructions, the user's configuration of the client, the capabilities of the client, and the like.
- the client selects the audio media representation of the navigation unit, acquires the segment expressed by the audio media, and plays the audio.
- the client switches to the primary media presentation.
- the switching process may include the following steps: the client first obtains a media presentation description of the main media presentation according to the pointer in the navigation unit, the second step analyzes the media presentation description of the main media presentation, and selects an appropriate media expression; Joining the main media presentation at a certain time, this is actually a seeking operation. If the navigation service is for a live media presentation service, then this time location is the time location of the media content in which the handover occurred, ie, interrupting the navigation service time location.
- Example scenario S2 A signaling mechanism for navigating traffic is illustrated in the example scenario S2, and scenario S2 illustrates the composition of the MPD used to represent the navigation service.
- the navigation description method takes a Universal Resource Identifier as a parameter, wherein the universal resource identifier is used to point to a media presentation, which is actually directed to the media presentation by a media presentation description directed to the media presentation.
- the method identifier for the method, such as: urn:mpeg:dash:mosaic:2011.
- the @smemeId of the Supplemental Property descriptor is the method identifier, which can represent the element containing the descriptor: an adaptation set or a media expression, which is an integral part of the navigation service.
- the attribute @value of the descriptor is a parameter of the navigation service description method, and a universal resource identifier that points to the media presentation description of the main media presentation.
- one video adaptation set (corresponding to one navigation unit) has two media representations.
- One of them is a virtual media expression that does not contain any fragments, but refers to the main media presentation represented by the navigation unit, and actually points to the media presentation by pointing to the media presentation description presented by the media.
- the template of the fragment does not appear at the level of the adaptation set element, but appears in the actual media expression element.
- a referenced remote unit may only know its type after being parsed because of a remote unit It's just an XML object whose type may be a media presentation description type, or it may be a time period or an adaptation set. If you relax the compatibility restrictions, introducing a new element description in the media presentation description means citing a media expression so that ambiguity can be avoided. This element can be attributed to parent elements of different levels, such as adaptation sets, media expressions.
- the Mediad Presentation (ReferencedMediaPresentation) in the example of the example scenario S4 is a specific implementation.
- An example of an aggregated media presentation description is given in example scenario S5.
- the aggregated media presentation description is MPD, which is a superset of MPD. It describes multiple parallel media presentations, including member media rendering and navigation media rendering.
- the presentation element is introduced in the aggregated media presentation description, which can be a remote element, pointing to a media presentation description, or an embedded media presentation description.
- the media presentation description of the member media presentation is a remote element, while the navigation media presentation is local and is an embedded media presentation description.
- Embodiments of the present invention also provide related apparatus for implementing the above solution.
- an embodiment of the present invention provides a client 400, which may include:
- the first obtaining unit 410 is configured to obtain a media presentation description of the navigation media presentation, where the media presentation description of the navigation media presentation describes the N navigation units included in the navigation media presentation, where the N is An integer greater than one;
- the second obtaining unit 420 is configured to acquire K navigation units of the N navigation units according to the media presentation description presented by the navigation media;
- a presentation unit 430 configured to present the K navigation units, each navigation unit of the K navigation units pointing to a main media presentation, wherein the navigation unit i in the K navigation units is pointed
- the presentation quality of the primary media presentation is higher than the presentation quality of the navigation unit i.
- the media presentation description presented by the navigation media may be different from the media presentation of the primary media presentation pointed to by each navigation unit of the K navigation units.
- the navigation media presentation may have an independent media presentation description, and each of the K navigation units may be directed to the primary media presentation or may be independent of the media presentation presented by the navigation media.
- K navigation units point to K main media presentations, and K main media presentations respectively have corresponding media presentation descriptions, ie, K media presentation descriptions, and the media presentation descriptions presented by the navigation media are different from the K media. Any one of the presentation descriptions, that is, the navigation media presentation can be described by the K+1th media presentation.
- the media presentation description of the navigation media presentation and the media presentation of the primary media presentation pointed to by each navigation unit of the K navigation units can be aggregated to form an aggregated media presentation description (also referred to as a supermedia presentation description).
- An aggregated media presentation description (which may be referred to as a supermedia presentation description) may be utilized to describe the primary media presentation to which the navigation media presentation and navigation media presentations are directed.
- the introduction of the supermedia presentation description facilitates enhancing the association between the navigation media presentation and the guided master media presentation.
- the way in which the navigation unit points to the main media presentation can be flexible, and the navigation unit can directly point to the main media presentation or indirectly to the main media presentation.
- each of the K navigation units can point to the primary media presentation described by the media presentation description in a manner that points to a media presentation description.
- the navigation unit can also point to the main media presentation by other direct or indirect pointing methods.
- the media presentation description of the navigation media presentation and the media presentation description of the primary media presentation pointed to by each of the K navigation units may be aggregated to form an aggregated media presentation description.
- each of the K navigation units may point to a primary media presentation in a manner that references the presentation elements in the aggregated media presentation description.
- each of the N navigation units includes a video component, or each of the N navigation units includes audio Component and video components, further, the navigation unit may also include subtitle components or other types of media components.
- the present invention provides a signaling mechanism for navigation services by media presentation descriptions (such as MPD in the DASH standard).
- the media presentation description can inform the client which navigation unit consists of a navigation unit, the components of the navigation unit, the relationship between the navigation unit and the media presentation of the members of the navigation service, and the relationship between the video components of the navigation unit, The relationship between the audio components of the navigation unit, the relationship between the audio component and the video component of the navigation unit, and the like.
- the video components included in different navigation units of the K navigation units are media expressions in different video adaptation sets in the K video adaptation sets, where The media representations in any one of the K video adaptation sets have selective mutual exclusion, and the different video adaptation sets in the K video adaptation sets have selection compatibility.
- the video components included in the navigation unit i in the K navigation units may be attributed to the video adaptation set Ci in the K video adaptation sets, which are included in the navigation unit j of the K navigation units.
- the video component may be attributed to a video adaptation set Cj in the K video adaptation sets, wherein the video adaptation set Cj and the video adaptation set Ci are two different video adaptation sets in the K video adaptation sets.
- the navigation unit j and the navigation unit i can be any two navigation units of the K navigation units.
- selection compatibility means that these objects can be selected at the same time. For example, if there is selection compatibility between different video adaptation sets in the K video adaptation sets, it means that K video adaptation sets can be selected at the same time. Media representation in multiple video adaptation sets.
- the so-called mutual exclusion means that these objects are not supported at the same time. For example, if there is selective mutual exclusion between media expressions in any one of the K video adaptation sets, it means that one is not supported at the same time.
- Multiple media representations in the video adaptation set for example, assuming that the video adaptation set I in the K video adaptation sets includes more than 10 media representations, if there is selective mutual exclusion between media representations in the video adaptation set, then each Only one of the 10 media expressions can be selected at a time, and multiple of the 10 media expressions cannot be selected at the same time.
- the K navigation unit includes an audio component that is a media representation in an audio adaptation set, where the audio adaptation set is different from the K video adaptation sets.
- the audio component adaptation set has a selection compatibility with the K video adaptation sets. For example, suppose the audio adaptation set includes 20 multiple media representations, and if there is selective mutual exclusion between media expressions in the audio adaptation set, then only one of 20 multiple media expressions can be selected at a time, and Multiple of the 30 media expressions cannot be selected at the same time.
- the audio components included in the different navigation units of the K navigation units are media expressions in different audio adaptation sets in the K audio adaptation sets.
- the different audio adaptation sets in the K audio adaptation sets have selective mutual exclusion.
- the media expression element in the audio adaptation set element may include an area description of an associated area of the media expression in the navigation media presentation.
- media representations including media presentation elements of the same region description have an association relationship, or an adaptation set element including the same region description There is an association between the sets.
- media expression element i The media expression is expressed as a media expression ri
- the media expression described by the media expression element j is expressed as a media expression rj. If the media expression element i and the media expression element j contain the same region description, the media expression ri and the media expression rj may be represented. There is an association between them.
- the media expression element i and the adaptation set element ci include the same area description, and then may also describe the media expression and the adaptation set element ci described by the media expression element i.
- the media expressions in the adaptation set have an association relationship, for example, the media expression element i can be an audio media expression, and the media expression in the adaptation set described by the adaptation set element ci can be a video media expression.
- the area description may be a spatial relationship description (SRD).
- SRD spatial relationship description
- the area description can also be other types of descriptive information that can be used to describe the location area.
- the media presentation description of the navigation media presentation includes K video adaptation set elements, and the K video adaptation set elements and the K video One-to-one correspondence between adaptation sets.
- the K video adaptation set elements include a description sub-element Ci
- the video adaptation sets described in the K video adaptation set elements satisfying the set common condition video adaptation set element have a selection
- the set commonality condition may be the same for both the element name and the method identifier (schemeIdUri) attribute of the description sub-element Ci included in the video adaptation set element.
- the description sub-element Ci may describe that the media representation in the video adaptation set described by the video adaptation set element including the description sub-element Ci is the navigation medium.
- the components of the presentation may describe a role that the media expression in the video adaptation set corresponding to the video adaptation set element corresponding to the description sub-element Ci is presented in the navigation medium, and the role may be, for example, primary, supplementary, subtitle or Translation dubbing, etc.
- the description sub-element Ci may be, for example, an EssentialProptery element or a Supplemental Proptery element or a Role element or other elements.
- the description sub-element Ci is an action description Role element
- the set commonality condition may be a descriptive sub-element included in the video adaptation set element.
- the element names of Ci can be the same, the method identification schemeIdUri attribute can be the same, and the parameter attribute can be the same.
- the media presentation description of the navigation media presentation includes the K video adaptation set elements, the K video adaptation set elements and K videos One-to-one correspondence between adaptation sets.
- the video adaptation set element VI corresponding to the video adaptation set I of the K video adaptation set elements includes a pointer for pointing to a primary media presentation, and the video adaptation set I may be the K Any video adaptation set in a video adaptation set.
- the location of the pointer in the video adaptation set element VI may be determined according to the needs of the scene.
- the pointer can be carried by an attribute of the video adaptation set element VI.
- the pointer may be carried by an xlink:href attribute or other attribute of the video adaptation set element VI.
- the pointer can be carried by an EssentialProptery element or a SupplementalProperty element in the video adaptation set element VI.
- the pointer may be carried by a child element in an EssentialProptery element of the video adaptation set element VI, or the pointer may be carried by an attribute of an EssentialProptery element in the video adaptation set element VI;
- the pointer may be carried by a child element in a SupplementalProperty element in the video adaptation set element VI, or the pointer may be carried by an attribute of a SupplementalProperty element in the video adaptation set element VI.
- the pointer may be carried by a value attribute or other attribute of an EssentialProptery element in the video adaptation set element VI, or the pointer may be a value attribute of a SupplementalProperty element in the video adaptation set element VI or other Property bearer.
- the pointer may be carried by an attribute of a virtual Representation element in the video adaptation set element VI, or the pointer may be carried by a child element in a virtual Representation element in the video adaptation set element VI, where The virtual Representation element does not include a media segment template element, a media segment list element, and a base Uniform Resource Locator (BaseURL) element.
- BaseURL base Uniform Resource Locator
- the pointer can also be carried by a Mediad Presentation (ReferencedMediaPresentation) element in the Video Adaptation Set Element VI.
- ReferencedMediaPresentation element Is a new extension of the element, that is, the pointer can be carried by the newly extended element in the video adaptation set element VI, the newly extended bearer in the video adaptation set element VI.
- the name of the element of the pointer is not limited to the ReferencedMediaPresentation, and may be other element names.
- the time structure of the navigation media presentation may be independent of the time of presentation of the primary media pointed by the K navigation units in the navigation media presentation.
- the audio of the navigation unit may be obtained by encoding the audio presented by the main media
- the video of the navigation unit may be obtained by encoding the video presented by the main media, which may make the time structure of the navigation unit and There is no correlation between the time structures presented by the main media.
- the presenting unit is further configured to present the navigation unit if the focus of attention stays in the navigation unit i in the K navigation units.
- the presentation unit is further configured to acquire, by using the navigation unit i in the K navigation units, the navigation unit i Point to the main media presentation. Further, the client may also present a primary media presentation pointed by the navigation unit i.
- the client 400 can be, for example, a personal computer, a mobile phone, a tablet computer, a television set or a set top box.
- the client 400 can be used to implement any of the media presentation navigation methods based on the hypertext transfer protocol media stream provided by the foregoing embodiments.
- each navigation unit of the K navigation units can respectively point to one main media presentation, this is equivalent to the introduction between the navigation unit and the main media presentation.
- the association relationship which enables the client 400 to obtain a media presentation description of the main media presentation j pointed to by the navigation unit i in the case where the navigation unit i of the K navigation units is selected, and thus Obtaining the main media presentation j according to the media presentation description of the primary media presentation j, which is visible to facilitate a more flexible switching between the navigation media presentation and the primary media presentation.
- the video navigation is supported in the HTTP-based media streaming service scenario, which is beneficial to enhance the user's high-quality experience.
- a client 500 may include:
- the processor 502 and the memory 503 are coupled by a bus 501.
- the processor 502 by calling a code or instruction in the memory 503, for obtaining a media presentation description of the navigation media presentation, wherein the media presentation description of the navigation media presentation describes the navigation media presentation N navigation units included, the N is an integer greater than 1; acquiring K navigation units of the N navigation units according to the media presentation description presented by the navigation media; presenting the K guides a navigation unit, each navigation unit of the K navigation unit points to a main media presentation, wherein a presentation quality of the main media presentation pointed by the navigation unit i in the K navigation units is higher than the guide View the presentation quality of unit i.
- the media presentation description presented by the navigation media may be different from the media presentation of the primary media presentation pointed to by each navigation unit of the K navigation units.
- the navigation media presentation may have an independent media presentation description, and each of the K navigation units may be directed to the primary media presentation or may be independent of the media presentation presented by the navigation media.
- K navigation units point to K main media presentations, and K main media presentations respectively have corresponding media presentation descriptions, ie, K media presentation descriptions, and the media presentation descriptions presented by the navigation media are different from the K media. Any one of the presentation descriptions, that is, the navigation media presentation can be described by the K+1th media presentation.
- the media presentation description presented by the navigation media and the media presentation description of the primary media presentation pointed to by each navigation unit of the K navigation units may be aggregated.
- An aggregated media presentation description (also referred to as a supermedia presentation description) is formed.
- An aggregated media presentation description (which may be referred to as a supermedia presentation description) may be utilized to describe the primary media presentation to which the navigation media presentation and navigation media presentations are directed. The introduction of the supermedia presentation description facilitates enhancing the association between the navigation media presentation and the guided master media presentation.
- the way in which the navigation unit points to the main media presentation can be flexible, and the navigation unit can directly point to the main media presentation or indirectly to the main media presentation.
- each of the K navigation units can point to the primary media presentation described by the media presentation description in a manner that points to a media presentation description.
- the navigation unit can also point to the main media presentation by other direct or indirect pointing methods.
- the media presentation description of the navigation media presentation and the media presentation description of the primary media presentation pointed to by each of the K navigation units may be aggregated to form an aggregated media presentation description.
- each of the K navigation units may point to a primary media presentation in a manner that references the presentation elements in the aggregated media presentation description.
- each of the N navigation units includes a video component, or each of the N navigation units includes audio Component and video components, further, the navigation unit may also include subtitle components or other types of media components.
- the present invention provides a signaling mechanism for navigation services by media presentation descriptions (such as MPD in the DASH standard).
- the media presentation description can inform the client which navigation unit consists of a navigation unit, the components of the navigation unit, the relationship between the navigation unit and the media presentation of the members of the navigation service, and the relationship between the video components of the navigation unit, The relationship between the audio components of the navigation unit, the relationship between the audio component and the video component of the navigation unit, and the like.
- the video components included in different navigation units of the K navigation units are media expressions in different video adaptation sets in the K video adaptation sets, where The media representations in any one of the K video adaptation sets have selective mutual exclusion, and the different video adaptation sets in the K video adaptation sets have selection compatibility.
- the video components included in the navigation unit i in the K navigation units may be attributed to the video adaptation set Ci in the K video adaptation sets, which are included in the navigation unit j of the K navigation units.
- the video component may be attributed to a video adaptation set Cj in the K video adaptation sets, wherein the video adaptation set Cj and the video adaptation set Ci are two different video adaptation sets in the K video adaptation sets.
- the navigation unit j and the navigation unit i can be any two navigation units of the K navigation units.
- selection compatibility means that these objects can be selected at the same time. For example, if there is selection compatibility between different video adaptation sets in the K video adaptation sets, it means that K video adaptation sets can be selected at the same time. Media representation in multiple video adaptation sets.
- the so-called mutual exclusion means that these objects are not supported at the same time. For example, if there is selective mutual exclusion between media expressions in any one of the K video adaptation sets, it means that one is not supported at the same time.
- Multiple media representations in the video adaptation set for example, assuming that the video adaptation set I in the K video adaptation sets includes more than 10 media representations, if there is selective mutual exclusion between media representations in the video adaptation set, then each Only one of the 10 media expressions can be selected at a time, and multiple of the 10 media expressions cannot be selected at the same time.
- the K navigation unit includes an audio component that is a media representation in an audio adaptation set, where the audio adaptation set is different from the K video adaptation sets.
- the audio component adaptation set has a selection compatibility with the K video adaptation sets. For example, suppose the audio adaptation set includes 20 multiple media representations, and if there is selective mutual exclusion between media expressions in the audio adaptation set, then only one of 20 multiple media expressions can be selected at a time, and Multiple of the 30 media expressions cannot be selected at the same time.
- the audio components included in the different navigation units of the K navigation units are media expressions in different audio adaptation sets in the K audio adaptation sets.
- the different audio adaptation sets in the K audio adaptation sets have selective mutual exclusion.
- the media expression element in the audio adaptation set element may include an area description of an associated area of the media expression in the navigation media presentation.
- media representations including media presentation elements of the same region description have an association relationship, or an adaptation set element including the same region description There is an association between the sets.
- the media expression described by the media expression element i is expressed as a media expression ri
- the media expression described by the media expression element j is expressed as a media expression rj. If the media expression element i and the media expression element j contain the same regional description, the media expression ri may be represented. There is a relationship with the media expression rj.
- the media expression element i and the adaptation set element ci include the same area description, and then may also describe the media expression and the adaptation set element ci described by the media expression element i.
- the element i may be an audio media representation
- the media representation in the adaptation set described by the adaptation set element ci may be a video media representation.
- the area description may be a spatial relationship description (SRD).
- SRD spatial relationship description
- the area description can also be other types of descriptive information that can be used to describe the location area.
- the media presentation description of the navigation media presentation includes K video adaptation set elements, and the K video adaptation set elements and the K video One-to-one correspondence between adaptation sets.
- the K video adaptation set elements include a description sub-element Ci
- the video adaptation sets described in the K video adaptation set elements satisfying the set common condition video adaptation set element have a selection
- the set commonality condition may be the same for both the element name and the method identifier (schemeIdUri) attribute of the description sub-element Ci included in the video adaptation set element.
- the description sub-element Ci may describe that the media representation in the video adaptation set described by the video adaptation set element including the description sub-element Ci is the navigation medium.
- the components of the presentation may describe a role that the media expression in the video adaptation set corresponding to the video adaptation set element corresponding to the description sub-element Ci is presented in the navigation medium, and the role may be, for example, primary, supplementary, subtitle or Translation dubbing, etc.
- the description sub-element Ci may be, for example, an EssentialProptery element or a Supplemental Proptery element or a Role element or other elements.
- the description sub-element Ci is an action description Role element
- the set commonality condition may be an element name of the descriptive sub-element Ci included in the video adaptation set element.
- the same method and method identification schemeIdUri attributes can be the same, and the parameter attributes can be the same.
- the media presentation description of the navigation media presentation includes the K video adaptation set elements, the K video adaptation set elements and K videos One-to-one correspondence between adaptation sets.
- the video adaptation set element VI corresponding to the video adaptation set I of the K video adaptation set elements includes a pointer for pointing to a primary media presentation, where the video is suitable
- the set I can be any one of the K video adaptation sets.
- the location of the pointer in the video adaptation set element VI may be determined according to the needs of the scene.
- the pointer can be carried by an attribute of the video adaptation set element VI.
- the pointer may be carried by an xlink:href attribute or other attribute of the video adaptation set element VI.
- the pointer can be carried by an EssentialProptery element or a SupplementalProperty element in the video adaptation set element VI.
- the pointer may be carried by a child element in an EssentialProptery element of the video adaptation set element VI, or the pointer may be carried by an attribute of an EssentialProptery element in the video adaptation set element VI;
- the pointer may be carried by a child element in a SupplementalProperty element in the video adaptation set element VI, or the pointer may be carried by an attribute of a SupplementalProperty element in the video adaptation set element VI.
- the pointer may be carried by a value attribute or other attribute of an EssentialProptery element in the video adaptation set element VI, or the pointer may be a value attribute of a SupplementalProperty element in the video adaptation set element VI or other Property bearer.
- the pointer may be carried by an attribute of a virtual Representation element in the video adaptation set element VI, or the pointer may be carried by a child element in a virtual Representation element in the video adaptation set element VI, where The virtual Representation element does not include a media segment template element, a media segment list element, and a base Uniform Resource Locator (BaseURL) element.
- BaseURL base Uniform Resource Locator
- the pointer can also be carried by a Mediad Presentation (ReferencedMediaPresentation) element in the Video Adaptation Set Element VI.
- the ReferencedMediaPresentation element is an element of the new extension, that is, the pointer can be carried by the newly extended element in the Video Adaptation Set Element VI, which is newly expanded in the Video Adaptation Set Element VI.
- the name of the element carrying the pointer is not limited to ReferencedMediaPresentation, but may be other element names.
- the time structure of the navigation media presentation may be independent of the primary media presentation pointed by the K navigation units in the navigation media presentation.
- the audio of the navigation unit may be obtained by encoding the audio presented by the main media
- the video of the navigation unit may be obtained by encoding the video presented by the main media, which may make the time structure of the navigation unit and There is no correlation between the time structures presented by the main media.
- the processor is further configured to present the navigation unit if the focus of attention stays in the navigation unit i in the K navigation units.
- the processor is further configured to acquire, by using the navigation unit i in the K navigation units, the navigation unit i Point to the main media presentation. Further, the client may also present a primary media presentation pointed by the navigation unit i.
- the client 500 can be, for example, a personal computer, a mobile phone, a tablet computer, a television set or a set top box.
- the client 500 in this embodiment may be specifically implemented according to the method in the foregoing method embodiment.
- the client 500 can be used to implement any of the media presentation navigation methods based on the hypertext transfer protocol media stream provided by the foregoing embodiments.
- each navigation unit of the K navigation units can respectively point to one main media presentation, this is equivalent to the introduction between the navigation unit and the main media presentation.
- Correlation relationship which enables the client 500 to obtain a media presentation description of the main media presentation j pointed to by the navigation unit i in the case where the navigation unit i of the K navigation units is selected, and thus Obtaining the main media presentation j according to the media presentation description of the primary media presentation j, which is convenient to implement a flexible switch between the navigation media presentation and the primary media presentation, thereby implementing the HTTP-based media streaming service.
- Video navigation is supported in the scenario, which in turn helps to enhance the user's high-quality experience.
- an embodiment of the present invention provides a server 600, which may include:
- the determining unit 610 is configured to determine N navigation units included in the navigation media presentation.
- a generating unit 620 configured to generate a media presentation description of the navigation media presentation, where the media presentation description of the navigation media presentation describes the N navigation units included in the navigation media presentation, where N is An integer greater than 1; each of the N navigation units points to a primary media presentation, and the presentation quality of the primary media presented by the navigation unit i in the N navigation units is higher than The presentation quality of the navigation unit i is described.
- the presentation quality of the main media presentation pointed by the navigation unit i in the N navigation units is higher than the presentation quality of the navigation unit i. That is to say, the presentation quality of the media expression of the navigation unit is lower than the presentation quality of the main media presentation represented by the navigation unit.
- the media presentation description of the navigation media presentation may be different from the media presentation of the primary media presentation pointed to by each of the N navigation units.
- description that is, the navigation media presentation may have an independent media presentation description, and each of the N navigation units may also have a media presentation that is independent of the presentation of the navigation media.
- N navigation units point to N main media presentations, and N main media presentations respectively have corresponding media presentation descriptions, ie, N media presentation descriptions, and the media presentation descriptions presented by the navigation media are different from the N media.
- Any one of the presentation descriptions, that is, the navigation media presentation can be described by the K+1th media presentation.
- the media presentation description presented by the navigation media and the media presentation description of the primary media presentation pointed to by each navigation unit of the N navigation units may be aggregated.
- An aggregated media presentation description (also referred to as a supermedia presentation description) is formed.
- An aggregated media presentation description (which may be referred to as a supermedia presentation description) may be utilized to describe the primary media presentation to which the navigation media presentation and navigation media presentations are directed. The introduction of the supermedia presentation description facilitates enhancing the association between the navigation media presentation and the guided master media presentation.
- the way in which the navigation unit points to the main media presentation can be flexible, and the navigation unit can directly point to the main media presentation or indirectly to the main media presentation.
- each of the N navigation units can point to the primary media presentation described by the media presentation description in a manner that points to a media presentation description.
- the navigation unit can also point to the main media presentation by other direct or indirect pointing methods.
- the media presentation description of the navigation media presentation and the media presentation description of the primary media presentation pointed to by each of the N navigation units may be aggregated to form an aggregated media presentation description.
- each of the N navigation units may present the presentation in the aggregated media description by reference. The way the element points to a main media presentation.
- each of the N navigation units includes a video component, or each of the N navigation units includes audio Component and video components, further, the navigation unit may also include subtitle components or other types of media components.
- the present invention provides a signaling mechanism for navigation services by media presentation descriptions (such as MPD in the DASH standard).
- the media presentation description can inform the client which navigation unit consists of a navigation unit, the components of the navigation unit, the relationship between the navigation unit and the media presentation of the members of the navigation service, and the relationship between the video components of the navigation unit, The relationship between the audio components of the navigation unit, the relationship between the audio component and the video component of the navigation unit, and the like.
- the video components included in the different navigation units of the N navigation units are media expressions in different video adaptation sets in the N video adaptation sets, where The media representations in any one of the N video adaptation sets have selective mutual exclusion, and the different video adaptation sets in the N video adaptation sets have selection compatibility.
- the video component included in the navigation unit i in the N navigation units may be attributed to the video adaptation set Ci in the N video adaptation sets, where the navigation unit j included in the N navigation units
- the video component may be attributed to a video adaptation set Cj in the N video adaptation sets, wherein the video adaptation set Cj and the video adaptation set Ci are two different video adaptation sets in the N video adaptation sets.
- the navigation unit j and the navigation unit i can be any two navigation units of the N navigation units.
- selection compatibility means that these objects can be selected at the same time. For example, if there is selection compatibility between different video adaptation sets in the N video adaptation sets, it means that N video adaptation sets can be selected at the same time. Media representation in multiple video adaptation sets.
- mutual exclusion indicating that these objects do not support simultaneous selection, for example, if there is selective mutual exclusion between media expressions in any one of the N video adaptation sets, indicating that one is not supported at the same time.
- Multiple media representations in the video adaptation set for example, assuming that the video adaptation set I in the N video adaptation sets includes 10 multiple media representations, if there is selective mutual exclusion between media expressions in the video adaptation set, then each Only one of the 10 media expressions can be selected at a time, and multiple of the 10 media expressions cannot be selected at the same time.
- the N navigation unit includes an audio component that is a media expression in an audio adaptation set, where the audio adaptation set is different from the N video adaptation sets.
- the audio component adaptation set has a selection compatibility with the N video adaptation sets. For example, suppose the audio adaptation set includes 20 multiple media representations, and if there is selective mutual exclusion between media expressions in the audio adaptation set, then only one of 20 multiple media expressions can be selected at a time, and Multiple of the 30 media expressions cannot be selected at the same time.
- the audio components included in the different navigation units of the N navigation units are media expressions in different audio adaptation sets in the N audio adaptation sets.
- the different audio adaptation sets in the N audio adaptation sets have selective mutual exclusion.
- the media expression element in the audio adaptation set element may include an area description of an associated area of the media expression in the navigation media presentation.
- media representations including media presentation elements of the same region description have an association relationship, or an adaptation set element including the same region description There is an association between the sets.
- the media expression described by the media expression element i is expressed as a media expression ri
- the media expression described by the media expression element j is expressed as a media expression rj. If the media expression element i and the media expression element j contain the same regional description, the media expression ri may be represented. There is a relationship with the media expression rj.
- the media expression element i and the adaptation set element ci include the same area description, and then may also describe the media expression and the adaptation set element ci described by the media expression element i.
- the media expressions in the adaptation set have an association relationship, for example, the media expression element i can be an audio media expression, and the media expression in the adaptation set described by the adaptation set element ci can be a video media expression.
- the area description may be a spatial relationship description (SRD).
- SRD spatial relationship description
- the area description can also be other types of descriptive information that can be used to describe the location area.
- the media presented by the navigation media is The N video adaptation set elements are included in the description, and the N video adaptation set elements are in one-to-one correspondence with the N video adaptation sets.
- the N video adaptation set elements include a description sub-element Ci
- the video adaptation sets described in the N video adaptation set elements satisfying the set common condition video adaptation set element have a selection
- the set commonality condition may be the same for both the element name and the method identifier (schemeIdUri) attribute of the description sub-element Ci included in the video adaptation set element.
- the description sub-element Ci may describe that the media representation in the video adaptation set described by the video adaptation set element including the description sub-element Ci is the navigation medium.
- the components of the presentation may describe a role that the media expression in the video adaptation set corresponding to the video adaptation set element corresponding to the description sub-element Ci is presented in the navigation medium, and the role may be, for example, primary, supplementary, subtitle or Translation dubbing, etc.
- the description sub-element Ci may be, for example, an EssentialProptery element or a Supplemental Proptery element or a Role element or other elements.
- the description sub-element Ci is an action description Role element
- the set commonality condition may be an element name of the descriptive sub-element Ci included in the video adaptation set element.
- the same method and method identification schemeIdUri attributes can be the same, and the parameter attributes can be the same.
- the media presentation description of the navigation media presentation includes the N video adaptation set elements, the N video adaptation set elements and N videos One-to-one correspondence between adaptation sets.
- the video adaptation set element VI corresponding to the video adaptation set I of the N video adaptation set elements includes a pointer for pointing to a primary media presentation, and the video adaptation set I may be the N Any video adaptation set in a video adaptation set.
- the location of the pointer in the video adaptation set element VI may be determined according to the needs of the scene.
- the pointer can be carried by an attribute of the video adaptation set element VI.
- the pointer may be carried by an xlink:href attribute or other attribute of the video adaptation set element VI.
- the pointer may be an EssentialProptery element in the video adaptation set element VI Or the SupplementalProperty element is hosted.
- the pointer may be carried by a child element in an EssentialProptery element of the video adaptation set element VI, or the pointer may be carried by an attribute of an EssentialProptery element in the video adaptation set element VI;
- the pointer may be carried by a child element in a SupplementalProperty element in the video adaptation set element VI, or the pointer may be carried by an attribute of a SupplementalProperty element in the video adaptation set element VI.
- the pointer may be carried by a value attribute or other attribute of an EssentialProptery element in the video adaptation set element VI, or the pointer may be a value attribute of a SupplementalProperty element in the video adaptation set element VI or other Property bearer.
- the pointer may be carried by an attribute of a virtual Representation element in the video adaptation set element VI, or the pointer may be carried by a child element in a virtual Representation element in the video adaptation set element VI, where The virtual Representation element does not include a media segment template element, a media segment list element, and a base Uniform Resource Locator (BaseURL) element.
- BaseURL base Uniform Resource Locator
- the pointer can also be carried by a Mediad Presentation (ReferencedMediaPresentation) element in the Video Adaptation Set Element VI.
- the ReferencedMediaPresentation element is an element of the new extension, that is, the pointer can be carried by the newly extended element in the Video Adaptation Set Element VI, which is newly expanded in the Video Adaptation Set Element VI.
- the name of the element carrying the pointer is not limited to ReferencedMediaPresentation, but may be other element names.
- the time structure of the navigation media presentation may be independent of the time of presentation of the primary media pointed by the N navigation units in the navigation media presentation.
- the audio of the navigation unit may be obtained by encoding the audio presented by the main media
- the video of the navigation unit may be obtained by encoding the video presented by the main media, which may make the time structure of the navigation unit and There is no correlation between the time structures presented by the main media.
- server 600 can be used to implement any of the foregoing embodiments.
- the server 600 can be a content server or other server.
- the navigation media presentation generated by the navigation media generated by the server 600 presents the N navigation units included in the navigation media presentation, because each of the N navigation units The viewing unit can respectively point to a main media presentation, which is equivalent to a certain association relationship introduced between the navigation unit and the main media presentation, which causes the client to be selected in the navigation unit i of the N navigation units.
- the client can obtain the media presentation description of the main media presentation j pointed to by the navigation unit i, and then the main media presentation j can be obtained according to the media presentation description of the primary media presentation j, and visible.
- This solution lays the foundation for the flexible switching between the navigation media presentation and the main media presentation, and lays a foundation for supporting video navigation in the HTTP-based media streaming service scenario.
- a server 700 may include:
- the processor 702 and the memory 703 are coupled by a bus 701.
- the processor 702 by calling code or instructions in the memory 703, for determining that the navigation media presentation includes N navigation units; generating a media presentation description of the navigation media presentation, the navigation media presentation
- the media presentation description describes N navigation units included in the navigation media presentation, the N being an integer greater than one; each of the N navigation units pointing to a primary media presentation,
- the presentation quality of the primary media presentation pointed to by the navigation unit i in the N navigation units is higher than the presentation quality of the navigation unit i.
- the presentation quality of the main media presentation pointed by the navigation unit i in the N navigation units is higher than the presentation quality of the navigation unit i. That is to say, the presentation quality of the media expression of the navigation unit is lower than the presentation quality of the main media presentation represented by the navigation unit.
- the media presentation description of the navigation media presentation may be different from the media presentation of the primary media presentation pointed to by each of the N navigation units. description. That is, the navigation media presentation may have an independent media presentation description, and each of the N navigation units may be independent of the navigation.
- the media presentation of the media presentation presented by the media presentation For example, N navigation units point to N main media presentations, and N main media presentations respectively have corresponding media presentation descriptions, ie, N media presentation descriptions, and the media presentation descriptions presented by the navigation media are different from the N media. Any one of the presentation descriptions, that is, the navigation media presentation can be described by the K+1th media presentation.
- the media presentation description presented by the navigation media and the media presentation description of the primary media presentation pointed to by each navigation unit of the N navigation units may be aggregated.
- An aggregated media presentation description (also referred to as a supermedia presentation description) is formed.
- An aggregated media presentation description (which may be referred to as a supermedia presentation description) may be utilized to describe the primary media presentation to which the navigation media presentation and navigation media presentations are directed. The introduction of the supermedia presentation description facilitates enhancing the association between the navigation media presentation and the guided master media presentation.
- the way in which the navigation unit points to the main media presentation can be flexible, and the navigation unit can directly point to the main media presentation or indirectly to the main media presentation.
- each of the N navigation units can point to the primary media presentation described by the media presentation description in a manner that points to a media presentation description.
- the navigation unit can also point to the main media presentation by other direct or indirect pointing methods.
- the media presentation description of the navigation media presentation and the media presentation description of the primary media presentation pointed to by each of the N navigation units may be aggregated to form an aggregated media presentation description.
- each of the N navigation units may point to a primary media presentation in a manner that references the presentation elements in the aggregated media presentation description.
- each of the N navigation units includes a video component, or each of the N navigation units includes audio Component and video components, further, the navigation unit may also include subtitle components or other types of media components.
- the present invention provides a signaling mechanism for navigation services by media presentation descriptions (such as MPD in the DASH standard).
- the media presentation description can inform the client which navigation unit consists of a navigation unit, the components of the navigation unit, the relationship between the navigation unit and the media presentation of the members of the navigation service, and the relationship between the video components of the navigation unit, The relationship between the audio components of the navigation unit, the relationship between the audio component and the video component of the navigation unit, and the like.
- the video components included in the different navigation units of the N navigation units are media expressions in different video adaptation sets in the N video adaptation sets, where The media representations in any one of the N video adaptation sets have selective mutual exclusion, and the different video adaptation sets in the N video adaptation sets have selection compatibility.
- the video component included in the navigation unit i in the N navigation units may be attributed to the video adaptation set Ci in the N video adaptation sets, where the navigation unit j included in the N navigation units
- the video component may be attributed to a video adaptation set Cj in the N video adaptation sets, wherein the video adaptation set Cj and the video adaptation set Ci are two different video adaptation sets in the N video adaptation sets.
- the navigation unit j and the navigation unit i can be any two navigation units of the N navigation units.
- selection compatibility means that these objects can be selected at the same time. For example, if there is selection compatibility between different video adaptation sets in the N video adaptation sets, it means that N video adaptation sets can be selected at the same time. Media representation in multiple video adaptation sets.
- mutual exclusion indicating that these objects do not support simultaneous selection, for example, if there is selective mutual exclusion between media expressions in any one of the N video adaptation sets, indicating that one is not supported at the same time.
- Multiple media representations in the video adaptation set for example, assuming that the video adaptation set I in the N video adaptation sets includes 10 multiple media representations, if there is selective mutual exclusion between media expressions in the video adaptation set, then each Only one of the 10 media expressions can be selected at a time, and multiple of the 10 media expressions cannot be selected at the same time.
- the N navigation unit includes an audio component that is a media expression in an audio adaptation set, where the audio adaptation set is different from the N video adaptation sets.
- the audio component adaptation set has a selection compatibility with the N video adaptation sets. For example, suppose the audio adaptation set includes 20 multiple media representations, and if there is selective mutual exclusion between media expressions in the audio adaptation set, then only one of 20 multiple media expressions can be selected at a time, and Multiple of the 30 media expressions cannot be selected at the same time.
- the audio components included in the different navigation units of the N navigation units are media expressions in different audio adaptation sets in the N audio adaptation sets.
- the different audio adaptation sets in the N audio adaptation sets have selective mutual exclusion.
- the media expression element in the audio adaptation set element may include an area description of an associated area of the media expression in the navigation media presentation.
- media representations including media presentation elements of the same region description have an association relationship, or an adaptation set element including the same region description There is an association between the sets.
- the media expression described by the media expression element i is expressed as a media expression ri
- the media expression described by the media expression element j is expressed as a media expression rj. If the media expression element i and the media expression element j contain the same regional description, the media expression ri may be represented. There is a relationship with the media expression rj.
- the media expression element i and the adaptation set element ci include the same area description, and then may also describe the media expression and the adaptation set element ci described by the media expression element i.
- the media expressions in the adaptation set have an association relationship, for example, the media expression element i can be an audio media expression, and the media expression in the adaptation set described by the adaptation set element ci can be a video media expression.
- the area description may be a spatial relationship description (SRD).
- SRD spatial relationship description
- the area description can also be other types of descriptive information that can be used to describe the location area.
- the media presentation description of the navigation media presentation includes N video adaptation set elements, and the N video adaptation set elements and the N video One-to-one correspondence between adaptation sets.
- the N video adaptation set elements include a description sub-element Ci
- the video adaptation sets described in the N video adaptation set elements satisfying the set common condition video adaptation set element have a selection
- the set commonality condition may be the same for both the element name and the method identifier (schemeIdUri) attribute of the description sub-element Ci included in the video adaptation set element.
- the description sub-element Ci may describe that the media representation in the video adaptation set described by the video adaptation set element including the description sub-element Ci is the navigation medium.
- the components of the presentation may describe that the description sub-element Ci may describe that the media expression in the video adaptation set corresponding to the video adaptation set element including the description sub-element Ci is presented in the navigation media.
- Roles, such as roles may be primary, supplementary, subtitle or translation dubbing.
- the description sub-element Ci may be, for example, an EssentialProptery element or a Supplemental Proptery element or a Role element or other elements.
- the description sub-element Ci is an action description Role element
- the set commonality condition may be an element name of the descriptive sub-element Ci included in the video adaptation set element.
- the same method and method identification schemeIdUri attributes can be the same, and the parameter attributes can be the same.
- the media presentation description of the navigation media presentation includes the N video adaptation set elements, the N video adaptation set elements and N videos One-to-one correspondence between adaptation sets.
- the video adaptation set element VI corresponding to the video adaptation set I of the N video adaptation set elements includes a pointer for pointing to a primary media presentation, and the video adaptation set I may be the N Any video adaptation set in a video adaptation set.
- the location of the pointer in the video adaptation set element VI may be determined according to the needs of the scene.
- the pointer can be carried by an attribute of the video adaptation set element VI.
- the pointer may be carried by an xlink:href attribute or other attribute of the video adaptation set element VI.
- the pointer can be carried by an EssentialProptery element or a SupplementalProperty element in the video adaptation set element VI.
- the pointer may be carried by a child element in an EssentialProptery element of the video adaptation set element VI, or the pointer may be carried by an attribute of an EssentialProptery element in the video adaptation set element VI;
- the pointer may be carried by a child element in a SupplementalProperty element in the video adaptation set element VI, or the pointer may be carried by an attribute of a SupplementalProperty element in the video adaptation set element VI.
- the pointer may be carried by a value attribute or other attribute of an EssentialProptery element in the video adaptation set element VI, or the pointer may be a value attribute of a SupplementalProperty element in the video adaptation set element VI or other Property bearer.
- the pointer can be virtual Representation in the video adaptation set element VI
- the attribute of the element is carried, or the pointer may be carried by a child element in a virtual Representation element in the video adaptation set element VI, wherein the virtual Representation element does not include a media segment template element, a media segment list element, and a basic unification Resource Locator (BaseURL) element.
- BaseURL basic unification Resource Locator
- the pointer can also be carried by a Mediad Presentation (ReferencedMediaPresentation) element in the Video Adaptation Set Element VI.
- the ReferencedMediaPresentation element is an element of the new extension, that is, the pointer can be carried by the newly extended element in the Video Adaptation Set Element VI, which is newly expanded in the Video Adaptation Set Element VI.
- the name of the element carrying the pointer is not limited to ReferencedMediaPresentation, but may be other element names.
- the time structure of the navigation media presentation may be independent of the time of presentation of the primary media pointed by the N navigation units in the navigation media presentation.
- the audio of the navigation unit may be obtained by encoding the audio presented by the main media
- the video of the navigation unit may be obtained by encoding the video presented by the main media, which may make the time structure of the navigation unit and There is no correlation between the time structures presented by the main media.
- server 700 can be used to implement any of the media presentation navigation methods based on the hypertext transfer protocol media stream provided by the foregoing embodiments.
- the server 700 can be a content server or other server.
- the navigation media presentation generated by the navigation media generated by the server 700 presents N navigation units included in the navigation media presentation, because each of the N navigation units
- the viewing unit can respectively point to a main media presentation, which is equivalent to a certain association relationship introduced between the navigation unit and the main media presentation, which causes the client to be selected in the navigation unit i of the N navigation units.
- the client can obtain the media presentation description of the main media presentation j pointed to by the navigation unit i, and then the main media presentation j can be obtained according to the media presentation description of the primary media presentation j, and visible.
- This solution lays the foundation for the flexible switching between the navigation media presentation and the main media presentation, and is implemented in the HTTP-based media streaming service scenario. Support for video navigation has laid the foundation.
- an embodiment of the present invention further provides a communication system, which may include:
- the client 810 is configured to obtain, from the content server 820, a media presentation description of the navigation media presentation, where the media presentation description presented by the navigation media describes the N navigations included in the navigation media presentation. a unit, the N being an integer greater than 1; acquiring, according to the media presentation description of the navigation media, K navigation units from the content navigation server 820; presenting the K navigation units Each of the K navigation units points to a primary media presentation, wherein the presentation quality of the primary media presentation pointed to by the navigation unit i in the K navigation units is higher than the navigation unit The presentation quality of i.
- the client 810 can be any client provided by the foregoing embodiment, for example.
- the content is based on the same concept as the method embodiment of the present invention.
- the description in the method embodiment of the present invention and details are not described herein again.
- the embodiment of the present invention further provides a computer storage medium, wherein the computer storage medium can store a program, and the program includes some or all of the steps of any one of the methods described in the foregoing method embodiments.
- the disclosed apparatus may be implemented in other ways.
- the device embodiments described above are merely illustrative.
- the division of the above units is only a logical function division. In actual implementation, there may be another division manner.
- multiple units or components may be combined or may be integrated into Another system, or some features can be ignored Or not.
- the mutual coupling or direct coupling or communication connection shown or discussed may be an indirect coupling or communication connection through some interface, device or unit, and may be electrical or otherwise.
- the units described above as separate components may or may not be physically separate, and the components displayed as units may or may not be physical units, that is, may be located in one place, or may be distributed to multiple network units. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution of the embodiment.
- each functional unit in each embodiment of the present invention may be integrated into one processing unit, or each unit may exist physically separately, or two or more units may be integrated into one unit.
- the above integrated unit can be implemented in the form of hardware or in the form of a software functional unit.
- the above-described integrated unit if implemented in the form of a software functional unit and sold or used as a stand-alone product, may be stored in a computer readable storage medium.
- the instructions include a plurality of instructions for causing a computer device (which may be a personal computer, server or network device, etc., and in particular a processor in a computer device) to perform all or part of the steps of the above-described methods of various embodiments of the present invention.
- the foregoing storage medium may include: a U disk, a mobile hard disk, a magnetic disk, an optical disk, a read-only memory (ROM), or a random access memory (RAM), and the like. The medium of the code.
Landscapes
- Engineering & Computer Science (AREA)
- Signal Processing (AREA)
- Multimedia (AREA)
- Databases & Information Systems (AREA)
- Human Computer Interaction (AREA)
- Computer Networks & Wireless Communication (AREA)
- Computer Security & Cryptography (AREA)
- Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
- Information Transfer Between Computers (AREA)
Abstract
Description
Claims (104)
- 一种基于超文本传输协议媒体流的媒体呈现导览方法,其特征在于,包括:客户端获取导览媒体呈现的媒体呈现描述,其中,所述导览媒体呈现的媒体呈现描述描述了所述导览媒体呈现包括的N个导览单元,所述N为大于1的整数;所述客户端根据所述导览媒体呈现的媒体呈现描述获取所述N个导览单元中的K个导览单元;所述客户端呈现所述K个导览单元,所述K个导览单元中的每个导览单元指向一个主媒体呈现,其中,K个导览单元中的导览单元i所指向的主媒体呈现的呈现质量高于所述导览单元i的呈现质量。
- 根据权利要求1所述的方法,其特征在于,所述导览媒体呈现的媒体呈现描述不同于所述K个导览单元中的每个导览单元所指向主媒体呈现的媒体呈现描述。
- 根据权利要求2所述的方法,其特征在于,所述K个导览单元中的每个导览单元以指向媒体呈现描述的方式来指向该媒体呈现描述所描述的主媒体呈现。
- 根据权利要求1所述的方法,其特征在于,所述导览媒体呈现的媒体呈现描述与所述K个导览单元中的每个导览单元所指向主媒体呈现的媒体呈现描述被聚合形成了一个聚合媒体呈现描述。
- 根据权利要求4所述的方法,其特征在于,所述K个导览单元中的每个导览单元以引用所述聚合媒体呈现描述中的呈现元素的方式来指向一个主媒体呈现。
- 根据权利要求1至5任一项所述的方法,其特征在于,所述N个导览单元中的每个导览单元包括视频分量,或者所述N个导览单元中的每个导览单元包括音频分量和视频分量。
- 根据权利要求6所述的方法,其特征在于,所述K个导览单元中的不同导览单元所包括的视频分量为K个视频适配集中的不同视频适配集中的媒 体表达,其中,所述K个视频适配集中的任意一个视频适配集中的媒体表达之间具有选择互斥性,所述K个视频适配集中的不同视频适配集之间具有选择相容性。
- 根据权利要求7所述的方法,其特征在于,所述K个导览单元包括的音频分量为音频适配集中的媒体表达,所述音频适配集不同于所述K个适配集中的任意一个适配集,所述音频分量适配集与所述K个视频适配集之间具有选择相容性;或者,所述K个导览单元中的不同导览单元所包括的音频分量为K个音频适配集中的不同音频适配集中的媒体表达,其中,所述K个音频适配集中的不同音频适配集之间具有选择互斥性。
- 根据权利要求8所述的方法,其特征在于,所述音频适配集元素中的媒体表达元素,包含其所描述的媒体表达在导览媒体呈现中的关联区域的区域说明。
- 根据权利要求9所述的方法,其特征在于,包含相同区域说明的媒体表达元素所描述的媒体表达之间具有关联关系,或者,包含相同区域说明的适配集元素所描述的适配集之间具有关联关系。
- 根据权利要求9或10所述的方法,其特征在于,所述区域说明为SRD空间关系描述。
- 根据权利要求7至11任一项所述的方法,其特征在于,所述导览媒体呈现的媒体呈现描述中包括K个视频适配集元素,所述K个视频适配集元素与所述K个视频适配集之间一一对应;其中,所述K个视频适配集元素中包括描述子元素Ci,所述K个视频适配集元素中的满足设定共性条件视频适配集元素所描述的视频适配集之间具有选择相容性,所述设定共性条件为视频适配集元素所包括的描述子元素Ci的元素名称和方法识别schemeIdUri属性均相同。
- 根据权利要求12所述的方法,其特征在于,所述描述子元素Ci描述了包括该描述子元素Ci的视频适配集元素所描述的视频适配集中的媒体表达 为导览媒体呈现的组成部分。
- 根据权利要求12所述的方法,其特征在于,所述描述子元素Ci描述了包括该描述子元素Ci的视频适配集元素对应的视频适配集中的媒体表达在导览媒体呈现的角色。
- 根据权利要求13或14所述的方法,其特征在于,所述描述子元素Ci为作用说明Role元素或者基本属性EssentialProptery元素或者补充属性SupplementalProptery元素。
- 根据权利要求15所述的方法,其特征在于,若描述子元素Ci为作用说明Role元素,则所述设定共性条件为视频适配集元素所包括的描述子元素Ci的元素名称相同、方法识别schemeIdUri属性相同,且参数value属性相同。
- 根据权利要求5至16任一项所述的方法,其特征在于,所述导览媒体呈现的媒体呈现描述中包括所述K个视频适配集元素,所述K个视频适配集元素与K个视频适配集之间一一对应,其中,所述K个视频适配集元素中的与视频适配集I对应的视频适配集元素VI包括用于指向一个主媒体呈现的指针,所述视频适配集I为所述K个视频适配集中任意一个视频适配集。
- 根据权利要求17所述的方法,其特征在于,所述指针由所述视频适配集元素VI的属性承载。
- 根据权利要求18所述的方法,其特征在于,所述指针由所述视频适配集元素VI的xlink:href属性承载。
- 根据权利要求17所述的方法,其特征在于,所述指针由所述视频适配集元素VI中的基本属性EssentialProptery元素或补充属性SupplementalProperty元素承载。
- 根据权利要求17所述的方法,其特征在于,所述指针由所述视频适配集元素VI之中的EssentialProptery元素中的子元素承载,或所述指针由所述视频适配集元素VI中的EssentialProptery元素的属性承载;或者所述指针由所述视频适配集元素VI中的SupplementalProperty元素中的子元素承载,或者所 述指针由所述视频适配集元素VI中的SupplementalProperty元素的属性承载。
- 根据权利要求21所述的方法,其特征在于,所述指针由所述视频适配集元素VI中的EssentialProptery元素的value属性承载,或者所述指针由所述视频适配集元素VI之中的SupplementalProperty元素的value属性承载。
- 根据权利要求17所述的方法,其特征在于,所述指针由所述视频适配集元素VI中的虚拟媒体表达Representation元素的属性承载,或,所述指针由所述视频适配集元素VI中的虚拟Representation元素中的子元素承载,其中,所述虚拟Representation元素不包括媒体片段模版元素、媒体片段列表元素和基础统一资源定位符BaseURL元素。
- 根据权利要求17所述的方法,其特征在于,所述指针由所述视频适配集元素VI中的媒体呈现指向ReferencedMediaPresentation元素来承载。
- 根据权利要求1至24任意一项所述的方法,其特征在于,所述导览媒体呈现的时间结构不依赖于所述导览媒体呈现中的所述K个导览单元所指向的主媒体呈现的时间结构。
- 根据权利要求1至25任意一项所述的方法,其特征在于,在关注焦点停留在所述K个导览单元中的导览单元i的情况下,所述客户端呈现所述导览单元i的音频分量。
- 根据权利要求1至26任意一项所述的方法,其特征在于,所述方法还包括:在所述K个导览单元中的导览单元i被选择的情况下,所述客户端获取所述导览单元i所指向的主媒体呈现。
- 一种基于超文本传输协议媒体流的媒体呈现导览方法,其特征在于,包括:确定导览媒体呈现包括的N个导览单元;生成导览媒体呈现的媒体呈现描述,所述导览媒体呈现的媒体呈现描述描述了所述导览媒体呈现包括的N个导览单元,所述N为大于1的整数;所述N个导览单元中的每个导览单元指向一个主媒体呈现,其中,所述N个导览 单元中的导览单元i所指向的主媒体呈现的呈现质量高于所述导览单元i的呈现质量。
- 根据权利要求28所述的方法,其特征在于,所述导览媒体呈现的媒体呈现描述不同于所述N个导览单元中的每个导览单元所指向主媒体呈现的媒体呈现描述。
- 根据权利要求29所述的方法,其特征在于,所述N个导览单元中的每个导览单元以指向媒体呈现描述的方式来指向该媒体呈现描述所描述的主媒体呈现。
- 根据权利要求28所述的方法,其特征在于,所述导览媒体呈现的媒体呈现描述与所述N个导览单元中的每个导览单元所指向主媒体呈现的媒体呈现描述被聚合形成了一个聚合媒体呈现描述。
- 根据权利要求31所述的方法,其特征在于,所述N个导览单元中的每个导览单元以引用所述聚合媒体呈现描述中的呈现元素的方式来指向一个主媒体呈现。
- 根据权利要求28至32任一项所述的方法,其特征在于,所述N个导览单元中的每个导览单元包括视频分量,或者所述N个导览单元中的每个导览单元包括音频分量和视频分量。
- 根据权利要求33所述的方法,其特征在于,所述N个导览单元中的不同导览单元所包括的视频分量为N个视频适配集中的不同视频适配集中的媒体表达,其中,所述N个视频适配集中的任意一个视频适配集中的媒体表达之间具有选择互斥性,所述N个视频适配集中的不同视频适配集之间具有选择相容性。
- 根据权利要求34所述的方法,其特征在于,所述N个导览单元包括的音频分量为音频适配集中的媒体表达,所述音频适配集不同于所述N个适配集中的任意一个适配集,所述音频分量适配集与所述N个视频适配集之间具有选择相容性;或者,所述N个导览单元中的不同导览单元所包括的音频分量为N个音频适配 集中的不同音频适配集中的媒体表达,其中,所述N个音频适配集中的不同音频适配集之间具有选择互斥性。
- 根据权利要求35所述的方法,其特征在于,所述音频适配集元素中的媒体表达元素,包含其所描述的媒体表达在导览媒体呈现中的关联区域的区域说明。
- 根据权利要求36所述的方法,其特征在于,包含相同区域说明的媒体表达元素所描述的媒体表达之间具有关联关系,或者,包含相同区域说明的适配集元素所描述的适配集之间具有关联关系。
- 根据权利要求36或37所述的方法,其特征在于,所述区域说明为SRD空间关系描述。
- 根据权利要求34至38任一项所述的方法,其特征在于,所述导览媒体呈现的媒体呈现描述中包括N个视频适配集元素,所述N个视频适配集元素与所述N个视频适配集之间一一对应;其中,所述N个视频适配集元素中包括描述子元素Ci,所述N个视频适配集元素中的满足设定共性条件视频适配集元素所描述的视频适配集之间具有选择相容性,所述设定共性条件为视频适配集元素所包括的描述子元素Ci的元素名称和方法识别schemeIdUri属性均相同。
- 根据权利要求39所述的方法,其特征在于,所述描述子元素Ci描述了包括该描述子元素Ci的视频适配集元素所描述的视频适配集中的媒体表达为导览媒体呈现的组成部分。
- 根据权利要求39所述的方法,其特征在于,所述描述子元素Ci描述了包括该描述子元素Ci的视频适配集元素对应的视频适配集中的媒体表达在导览媒体呈现的角色。
- 根据权利要求40或41所述的方法,其特征在于,所述描述子元素Ci为作用说明Role元素或者基本属性EssentialProptery元素或者补充属性SupplementalProptery元素。
- 根据权利要求42所述的方法,其特征在于,若描述子元素Ci为作用说明Role元素,则所述设定共性条件为视频适配 集元素所包括的描述子元素Ci的元素名称相同、方法识别schemeIdUri属性相同,且参数value属性相同。
- 根据权利要求32至43任一项所述的方法,其特征在于,所述导览媒体呈现的媒体呈现描述中包括所述N个视频适配集元素,所述N个视频适配集元素与N个视频适配集之间一一对应,其中,所述N个视频适配集元素中的与视频适配集I对应的视频适配集元素VI包括用于指向一个主媒体呈现的指针,所述视频适配集I为所述N个视频适配集中任意一个视频适配集。
- 根据权利要求44所述的方法,其特征在于,所述指针由所述视频适配集元素VI的属性承载。
- 根据权利要求45所述的方法,其特征在于,所述指针由所述视频适配集元素VI的xlink:href属性承载。
- 根据权利要求44所述的方法,其特征在于,所述指针由所述视频适配集元素VI中的基本属性EssentialProptery元素或补充属性SupplementalProperty元素承载。
- 根据权利要求44所述的方法,其特征在于,所述指针由所述视频适配集元素VI之中的EssentialProptery元素中的子元素承载,或所述指针由所述视频适配集元素VI中的EssentialProptery元素的属性承载;或者所述指针由所述视频适配集元素VI中的SupplementalProperty元素中的子元素承载,或者所述指针由所述视频适配集元素VI中的SupplementalProperty元素的属性承载。
- 根据权利要求48所述的方法,其特征在于,所述指针由所述视频适配集元素VI中的EssentialProptery元素的value属性承载,或者所述指针由所述视频适配集元素VI之中的SupplementalProperty元素的value属性承载。
- 根据权利要求44所述的方法,其特征在于,所述指针由所述视频适配集元素VI中的虚拟媒体表达Representation元素的属性承载,或,所述指针由所述视频适配集元素VI中的虚拟Representation元素中的子元素承载,其中,所述虚拟Representation元素不包括媒体片段模版元素、媒体片段列表元素和基础统一资源定位符BaseURL元素。
- 根据权利要求44所述的方法,其特征在于,所述指针由所述视频适配集元素VI中的媒体呈现指向ReferencedMediaPresentation元素来承载。
- 根据权利要求28至51任意一项所述的方法,其特征在于,所述导览媒体呈现的时间结构不依赖于所述导览媒体呈现中的所述N个导览单元所指向的主媒体呈现的时间结构。
- 一种客户端,其特征在于,包括:第一获取单元,用于获取导览媒体呈现的媒体呈现描述,其中,所述导览媒体呈现的媒体呈现描述描述了所述导览媒体呈现包括的N个导览单元,所述N为大于1的整数;第二获取单元,用于根据所述导览媒体呈现的媒体呈现描述获取所述N个导览单元中的K个导览单元;呈现单元,用于呈现所述K个导览单元,所述K个导览单元中的每个导览单元指向一个主媒体呈现,其中,K个导览单元中的导览单元i所指向的主媒体呈现的呈现质量高于所述导览单元i的呈现质量。
- 根据权利要求53所述的客户端,其特征在于,所述导览媒体呈现的媒体呈现描述不同于所述K个导览单元中的每个导览单元所指向主媒体呈现的媒体呈现描述。
- 根据权利要求54所述的客户端,其特征在于,所述K个导览单元中的每个导览单元以指向媒体呈现描述的方式来指向该媒体呈现描述所描述的主媒体呈现。
- 根据权利要求53所述的客户端,其特征在于,所述导览媒体呈现的媒体呈现描述与所述K个导览单元中的每个导览单元所指向主媒体呈现的媒体呈现描述被聚合形成了一个聚合媒体呈现描述。
- 根据权利要求56所述的客户端,其特征在于,所述K个导览单元中的每个导览单元以引用所述聚合媒体呈现描述中的呈现元素的方式来指向一个主媒体呈现。
- 根据权利要求53至57任一项所述的客户端,其特征在于,所述N个导览单元中的每个导览单元包括视频分量,或者所述N个导览单元中的每 个导览单元包括音频分量和视频分量。
- 根据权利要求58所述的客户端,其特征在于,所述K个导览单元中的不同导览单元所包括的视频分量为K个视频适配集中的不同视频适配集中的媒体表达,其中,所述K个视频适配集中的任意一个视频适配集中的媒体表达之间具有选择互斥性,所述K个视频适配集中的不同视频适配集之间具有选择相容性。
- 根据权利要求59所述的客户端,其特征在于,所述K个导览单元包括的音频分量为音频适配集中的媒体表达,所述音频适配集不同于所述K个适配集中的任意一个适配集,所述音频分量适配集与所述K个视频适配集之间具有选择相容性;或者,所述K个导览单元中的不同导览单元所包括的音频分量为K个音频适配集中的不同音频适配集中的媒体表达,其中,所述K个音频适配集中的不同音频适配集之间具有选择互斥性。
- 根据权利要求60所述的客户端,其特征在于,所述音频适配集元素中的媒体表达元素,包含其所描述的媒体表达在导览媒体呈现中的关联区域的区域说明。
- 根据权利要求61所述的客户端,其特征在于,包含相同区域说明的媒体表达元素所描述的媒体表达之间具有关联关系,或者,包含相同区域说明的适配集元素所描述的适配集之间具有关联关系。
- 根据权利要求61或62所述的客户端,其特征在于,所述区域说明为SRD空间关系描述。
- 根据权利要求58至63任一项所述的客户端,其特征在于,所述导览媒体呈现的媒体呈现描述中包括K个视频适配集元素,所述K个视频适配集元素与所述K个视频适配集之间一一对应;其中,所述K个视频适配集元素中包括描述子元素Ci,所述K个视频适配集元素中的满足设定共性条件视频适配集元素所描述的视频适配集之间具有选择相容性,所述设定共性条件为视频适配集元素所包括的描述子元素Ci的元素名称和客户端识别schemeIdUri 属性均相同。
- 根据权利要求64所述的客户端,其特征在于,所述描述子元素Ci描述了包括该描述子元素Ci的视频适配集元素所描述的视频适配集中的媒体表达为导览媒体呈现的组成部分。
- 根据权利要求64所述的客户端,其特征在于,所述描述子元素Ci描述了包括该描述子元素Ci的视频适配集元素对应的视频适配集中的媒体表达在导览媒体呈现的角色。
- 根据权利要求65或66所述的客户端,其特征在于,所述描述子元素Ci为作用说明Role元素或者基本属性EssentialProptery元素或者补充属性SupplementalProptery元素。
- 根据权利要求67所述的客户端,其特征在于,若描述子元素Ci为作用说明Role元素,则所述设定共性条件为视频适配集元素所包括的描述子元素Ci的元素名称相同、客户端识别schemeIdUri属性相同,且参数value属性相同。
- 根据权利要求57至68任一项所述的客户端,其特征在于,所述导览媒体呈现的媒体呈现描述中包括所述K个视频适配集元素,所述K个视频适配集元素与K个视频适配集之间一一对应,其中,所述K个视频适配集元素中的与视频适配集I对应的视频适配集元素VI包括用于指向一个主媒体呈现的指针,所述视频适配集I为所述K个视频适配集中任意一个视频适配集。
- 根据权利要求69所述的客户端,其特征在于,所述指针由所述视频适配集元素VI的属性承载。
- 根据权利要求70所述的客户端,其特征在于,所述指针由所述视频适配集元素VI的xlink:href属性承载。
- 根据权利要求70所述的客户端,其特征在于,所述指针由所述视频适配集元素VI中的基本属性EssentialProptery元素或补充属性SupplementalProperty元素承载。
- 根据权利要求70所述的客户端,其特征在于,所述指针由所述视频 适配集元素VI之中的EssentialProptery元素中的子元素承载,或所述指针由所述视频适配集元素VI中的EssentialProptery元素的属性承载;或者所述指针由所述视频适配集元素VI中的SupplementalProperty元素中的子元素承载,或者所述指针由所述视频适配集元素VI中的SupplementalProperty元素的属性承载。
- 根据权利要求73所述的客户端,其特征在于,所述指针由所述视频适配集元素VI中的EssentialProptery元素的value属性承载,或者所述指针由所述视频适配集元素VI之中的SupplementalProperty元素的value属性承载。
- 根据权利要求70所述的客户端,其特征在于,所述指针由所述视频适配集元素VI中的虚拟媒体表达Representation元素的属性承载,或,所述指针由所述视频适配集元素VI中的虚拟Representation元素中的子元素承载,其中,所述虚拟Representation元素不包括媒体片段模版元素、媒体片段列表元素和基础统一资源定位符BaseURL元素。
- 根据权利要求70所述的客户端,其特征在于,所述指针由所述视频适配集元素VI中的媒体呈现指向ReferencedMediaPresentation元素来承载。
- 根据权利要求53至76任意一项所述的客户端,其特征在于,所述导览媒体呈现的时间结构不依赖于所述导览媒体呈现中的所述K个导览单元所指向的主媒体呈现的时间结构。
- 根据权利要求53至77任意一项所述的客户端,其特征在于,所述呈现单元还用于,在关注焦点停留在所述K个导览单元中的导览单元i的情况下,呈现所述导览单元i的音频分量。
- 根据权利要求53至78任意一项所述的客户端,其特征在于,所述呈现单元还用于,在所述K个导览单元中的导览单元i被选择的情况下,获取所述导览单元i所指向的主媒体呈现。
- 一种服务器,其特征在于,包括:确定单元,用于确定导览媒体呈现包括的N个导览单元;生成单元,用于生成导览媒体呈现的媒体呈现描述,所述导览媒体呈现的媒体呈现描述描述了所述导览媒体呈现包括的N个导览单元,所述N为大于 1的整数;所述N个导览单元中的每个导览单元指向一个主媒体呈现,所述N个导览单元中的导览单元i所指向的主媒体呈现的呈现质量高于所述导览单元i的呈现质量。
- 根据权利要求80所述的服务器,其特征在于,所述导览媒体呈现的媒体呈现描述不同于所述N个导览单元中的每个导览单元所指向主媒体呈现的媒体呈现描述。
- 根据权利要求81所述的服务器,其特征在于,所述N个导览单元中的每个导览单元以指向媒体呈现描述的方式来指向该媒体呈现描述所描述的主媒体呈现。
- 根据权利要求80所述的服务器,其特征在于,所述导览媒体呈现的媒体呈现描述与所述N个导览单元中的每个导览单元所指向主媒体呈现的媒体呈现描述被聚合形成了一个聚合媒体呈现描述。
- 根据权利要求82所述的服务器,其特征在于,所述N个导览单元中的每个导览单元以引用所述聚合媒体呈现描述中的呈现元素的方式来指向一个主媒体呈现。
- 根据权利要求80至84任一项所述的服务器,其特征在于,所述N个导览单元中的每个导览单元包括视频分量,或者所述N个导览单元中的每个导览单元包括音频分量和视频分量。
- 根据权利要求85所述的服务器,其特征在于,所述N个导览单元中的不同导览单元所包括的视频分量为N个视频适配集中的不同视频适配集中的媒体表达,其中,所述N个视频适配集中的任意一个视频适配集中的媒体表达之间具有选择互斥性,所述N个视频适配集中的不同视频适配集之间具有选择相容性。
- 根据权利要求86所述的服务器,其特征在于,所述N个导览单元包括的音频分量为音频适配集中的媒体表达,所述音频适配集不同于所述N个适配集中的任意一个适配集,所述音频分量适配集与所述N个视频适配集之间具有选择相容性;或者,所述N个导览单元中的不同导览单元所包括的音频分量为N个音频适配集中的不同音频适配集中的媒体表达,其中,所述N个音频适配集中的不同音频适配集之间具有选择互斥性。
- 根据权利要求87所述的服务器,其特征在于,所述音频适配集元素中的媒体表达元素,包含其所描述的媒体表达在导览媒体呈现中的关联区域的区域说明。
- 根据权利要求88所述的服务器,其特征在于,包含相同区域说明的媒体表达元素所描述的媒体表达之间具有关联关系,或者,包含相同区域说明的适配集元素所描述的适配集之间具有关联关系。
- 根据权利要求88或89所述的服务器,其特征在于,所述区域说明为SRD空间关系描述。
- 根据权利要求86至90任一项所述的服务器,其特征在于,所述导览媒体呈现的媒体呈现描述中包括N个视频适配集元素,所述N个视频适配集元素与所述N个视频适配集之间一一对应;其中,所述N个视频适配集元素中包括描述子元素Ci,所述N个视频适配集元素中的满足设定共性条件视频适配集元素所描述的视频适配集之间具有选择相容性,所述设定共性条件为视频适配集元素所包括的描述子元素Ci的元素名称和服务器识别schemeIdUri属性均相同。
- 根据权利要求91所述的服务器,其特征在于,所述描述子元素Ci描述了包括该描述子元素Ci的视频适配集元素所描述的视频适配集中的媒体表达为导览媒体呈现的组成部分。
- 根据权利要求91所述的服务器,其特征在于,所述描述子元素Ci描述了包括该描述子元素Ci的视频适配集元素对应的视频适配集中的媒体表达在导览媒体呈现的角色。
- 根据权利要求92或93所述的服务器,其特征在于,所述描述子元素Ci为作用说明Role元素或者基本属性EssentialProptery元素或者补充属性SupplementalProptery元素。
- 根据权利要求94所述的服务器,其特征在于,若描述子元素Ci为作用说明Role元素,则所述设定共性条件为视频适配集元素所包括的描述子元素Ci的元素名称相同、服务器识别schemeIdUri属性相同,且参数value属性相同。
- 根据权利要求84至95任一项所述的服务器,其特征在于,所述导览媒体呈现的媒体呈现描述中包括所述N个视频适配集元素,所述N个视频适配集元素与N个视频适配集之间一一对应,其中,所述N个视频适配集元素中的与视频适配集I对应的视频适配集元素VI包括用于指向一个主媒体呈现的指针,所述视频适配集I为所述N个视频适配集中任意一个视频适配集。
- 根据权利要求96所述的服务器,其特征在于,所述指针由所述视频适配集元素VI的属性承载。
- 根据权利要求95所述的服务器,其特征在于,所述指针由所述视频适配集元素VI的xlink:href属性承载。
- 根据权利要求96所述的服务器,其特征在于,所述指针由所述视频适配集元素VI中的基本属性EssentialProptery元素或补充属性SupplementalProperty元素承载。
- 根据权利要求96所述的服务器,其特征在于,所述指针由所述视频适配集元素VI之中的EssentialProptery元素中的子元素承载,或所述指针由所述视频适配集元素VI中的EssentialProptery元素的属性承载;或者所述指针由所述视频适配集元素VI中的SupplementalProperty元素中的子元素承载,或者所述指针由所述视频适配集元素VI中的SupplementalProperty元素的属性承载。
- 根据权利要求100所述的服务器,其特征在于,所述指针由所述视频适配集元素VI中的EssentialProptery元素的value属性承载,或者所述指针由所述视频适配集元素VI之中的SupplementalProperty元素的value属性承载。
- 根据权利要求96所述的服务器,其特征在于,所述指针由所述视频适配集元素VI中的虚拟媒体表达Representation元素的属性承载,或,所述指针由所述视频适配集元素VI中的虚拟Representation元素中的子元素承载,其中,所述虚拟Representation元素不包括媒体片段模 版元素、媒体片段列表元素和基础统一资源定位符BaseURL元素。
- 根据权利要求96所述的服务器,其特征在于,所述指针由所述视频适配集元素VI中的媒体呈现指向ReferencedMediaPresentation元素来承载。
- 根据权利要求80至103任意一项所述的服务器,其特征在于,所述导览媒体呈现的时间结构不依赖于所述导览媒体呈现中的所述N个导览单元所指向的主媒体呈现的时间结构。
Priority Applications (6)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1020177025344A KR101919726B1 (ko) | 2015-02-15 | 2015-02-15 | 하이퍼텍스트 전송 프로토콜 미디어 스트림에 기초한 미디어 프레젠테이션 가이드 방법 및 관련 장치 |
CN201580038222.5A CN106664299B (zh) | 2015-02-15 | 2015-02-15 | 基于超文本传输协议媒体流的媒体呈现导览方法和相关装置 |
JP2017542417A JP6478357B2 (ja) | 2015-02-15 | 2015-02-15 | メディアストリーミング・オーバー・ハイパーテキストトランスファープロトコルにおけるメディアプレゼンテーションガイドを提供するための方法及び関連する装置 |
PCT/CN2015/073148 WO2016127440A1 (zh) | 2015-02-15 | 2015-02-15 | 基于超文本传输协议媒体流的媒体呈现导览方法和相关装置 |
EP15881602.5A EP3249873B1 (en) | 2015-02-15 | 2015-02-15 | Media presentation guide method based on hyper text transport protocol media stream and related device |
US15/677,436 US20170374122A1 (en) | 2015-02-15 | 2017-08-15 | Method and Related Apparatus for Providing Media Presentation Guide in Media Streaming Over Hypertext Transfer Protocol |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/CN2015/073148 WO2016127440A1 (zh) | 2015-02-15 | 2015-02-15 | 基于超文本传输协议媒体流的媒体呈现导览方法和相关装置 |
Related Child Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US15/677,436 Continuation US20170374122A1 (en) | 2015-02-15 | 2017-08-15 | Method and Related Apparatus for Providing Media Presentation Guide in Media Streaming Over Hypertext Transfer Protocol |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2016127440A1 true WO2016127440A1 (zh) | 2016-08-18 |
Family
ID=56615026
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2015/073148 WO2016127440A1 (zh) | 2015-02-15 | 2015-02-15 | 基于超文本传输协议媒体流的媒体呈现导览方法和相关装置 |
Country Status (6)
Country | Link |
---|---|
US (1) | US20170374122A1 (zh) |
EP (1) | EP3249873B1 (zh) |
JP (1) | JP6478357B2 (zh) |
KR (1) | KR101919726B1 (zh) |
CN (1) | CN106664299B (zh) |
WO (1) | WO2016127440A1 (zh) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110431848A (zh) * | 2017-03-24 | 2019-11-08 | 索尼公司 | 内容提供系统、内容提供方法和程序 |
Families Citing this family (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2016133296A1 (ko) * | 2015-02-16 | 2016-08-25 | 엘지전자 주식회사 | 방송 신호 송신 장치, 방송 신호 수신 장치, 방송 신호 송신 방법, 및 방송 신호 수신 방법 |
WO2020050550A1 (en) * | 2018-09-03 | 2020-03-12 | Samsung Electronics Co., Ltd. | Methods and systems for performing editing operations on media |
US11895173B2 (en) * | 2022-01-07 | 2024-02-06 | Avago Technologies International Sales Pte. Limited | Gapped and/or subsegmented adaptive bitrate streams |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102137137A (zh) * | 2010-09-17 | 2011-07-27 | 华为技术有限公司 | 基于http流的媒体内容动态插播方法、装置及系统 |
US20140013003A1 (en) * | 2012-07-09 | 2014-01-09 | Futurewei Technologies, Inc. | Content-Specific Identification and Timing Behavior in Dynamic Adaptive Streaming over Hypertext Transfer Protocol |
CN103974147A (zh) * | 2014-03-07 | 2014-08-06 | 北京邮电大学 | 一种基于mpeg-dash协议的带有码率切换控制和静态摘要技术的在线视频播控系统 |
Family Cites Families (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8677005B2 (en) * | 2009-11-04 | 2014-03-18 | Futurewei Technologies, Inc. | System and method for media content streaming |
CN102055789B (zh) * | 2009-11-09 | 2013-10-09 | 华为技术有限公司 | 实现基于http的流媒体业务的方法、系统和网络设备 |
CN102055773B (zh) * | 2009-11-09 | 2013-10-09 | 华为技术有限公司 | 实现基于http的流媒体业务的方法、系统和网络设备 |
KR101709903B1 (ko) * | 2010-02-19 | 2017-02-23 | 텔레폰악티에볼라겟엘엠에릭슨(펍) | 에이치티티피 스트리밍에서 적응화를 위한 방법 및 장치 |
US8468262B2 (en) * | 2010-11-01 | 2013-06-18 | Research In Motion Limited | Method and apparatus for updating http content descriptions |
CN109600632B (zh) * | 2011-10-13 | 2020-12-25 | 三星电子株式会社 | 用于发送和接收多媒体服务的方法和装置 |
US9712874B2 (en) * | 2011-12-12 | 2017-07-18 | Lg Electronics Inc. | Device and method for receiving media content |
EP3018912B1 (en) * | 2013-07-02 | 2018-09-12 | Sony Corporation | Content provision device, content provision method, program, terminal device, and content provision system |
US20160373496A1 (en) * | 2013-07-02 | 2016-12-22 | Sony Corporation | Content supply device, content supply method, program, terminal device, and content supply system |
EP3020208B1 (en) * | 2013-07-12 | 2022-03-09 | Canon Kabushiki Kaisha | Adaptive data streaming with push messages control |
JP6493765B2 (ja) * | 2013-07-19 | 2019-04-03 | ソニー株式会社 | 情報処理装置および方法 |
US20150026358A1 (en) * | 2013-07-19 | 2015-01-22 | Futurewei Technologies, Inc. | Metadata Information Signaling And Carriage In Dynamic Adaptive Streaming Over Hypertext Transfer Protocol |
KR20160077067A (ko) * | 2013-10-30 | 2016-07-01 | 소니 주식회사 | 송신 장치, 송신 방법, 수신 장치, 및 수신 방법 |
-
2015
- 2015-02-15 WO PCT/CN2015/073148 patent/WO2016127440A1/zh active Application Filing
- 2015-02-15 KR KR1020177025344A patent/KR101919726B1/ko active IP Right Grant
- 2015-02-15 EP EP15881602.5A patent/EP3249873B1/en active Active
- 2015-02-15 JP JP2017542417A patent/JP6478357B2/ja active Active
- 2015-02-15 CN CN201580038222.5A patent/CN106664299B/zh active Active
-
2017
- 2017-08-15 US US15/677,436 patent/US20170374122A1/en not_active Abandoned
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102137137A (zh) * | 2010-09-17 | 2011-07-27 | 华为技术有限公司 | 基于http流的媒体内容动态插播方法、装置及系统 |
US20140013003A1 (en) * | 2012-07-09 | 2014-01-09 | Futurewei Technologies, Inc. | Content-Specific Identification and Timing Behavior in Dynamic Adaptive Streaming over Hypertext Transfer Protocol |
CN103974147A (zh) * | 2014-03-07 | 2014-08-06 | 北京邮电大学 | 一种基于mpeg-dash协议的带有码率切换控制和静态摘要技术的在线视频播控系统 |
Non-Patent Citations (1)
Title |
---|
"ISO/IEC 23009-1", PART1: MEDIA PRESENTATION DESCRIPTION AND SEGMENT FORMATS, 15 May 2014 (2014-05-15), pages 16 - 82, XP055214031 * |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110431848A (zh) * | 2017-03-24 | 2019-11-08 | 索尼公司 | 内容提供系统、内容提供方法和程序 |
CN110431848B (zh) * | 2017-03-24 | 2021-12-21 | 索尼公司 | 内容提供系统、内容提供方法和程序 |
Also Published As
Publication number | Publication date |
---|---|
KR101919726B1 (ko) | 2018-11-16 |
EP3249873B1 (en) | 2018-09-12 |
EP3249873A1 (en) | 2017-11-29 |
US20170374122A1 (en) | 2017-12-28 |
KR20170116116A (ko) | 2017-10-18 |
CN106664299B (zh) | 2020-01-17 |
EP3249873A4 (en) | 2017-11-29 |
CN106664299A (zh) | 2017-05-10 |
JP2018510552A (ja) | 2018-04-12 |
JP6478357B2 (ja) | 2019-03-06 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US10187668B2 (en) | Method, system and server for live streaming audio-video file | |
US9294728B2 (en) | System and method for routing content | |
WO2019024919A1 (zh) | 视频转码方法及其装置、服务器、可读存储介质 | |
CN105681912A (zh) | 一种视频播放方法和装置 | |
JP2020519094A (ja) | ビデオ再生方法、デバイス、およびシステム | |
WO2018014691A1 (zh) | 一种媒体数据的获取方法和装置 | |
CN107888993B (zh) | 一种视频数据的处理方法及装置 | |
IL230273A (en) | Transmission of reconstruction data in a layered signal quality hierarchy | |
WO2016127440A1 (zh) | 基于超文本传输协议媒体流的媒体呈现导览方法和相关装置 | |
WO2021143360A1 (zh) | 资源传输方法及计算机设备 | |
WO2016202225A1 (zh) | 内容项聚合方法和相关装置及通信系统 | |
CN105142012A (zh) | 智能电视直播频道列表获取、频道切换及同屏观看的方法 | |
CN109068169A (zh) | 一种视频播放方法及装置 | |
CN109587478A (zh) | 一种媒体信息的处理方法及装置 | |
US20110200093A1 (en) | Method and apparatus for transmitting and receiving video and video links | |
WO2008103364A1 (en) | Systems and methods for sending, receiving and processing multimedia bookmarks | |
US10637904B2 (en) | Multimedia streaming service presentation method, related apparatus, and related system | |
CN104185033A (zh) | 一种电视多画面的处理方法、装置及系统 | |
Kaiser et al. | MPEG-DASH enabling adaptive streaming with personalized commercial breaks and second screen scenarios | |
WO2019188485A1 (ja) | 情報処理装置、情報処理装置およびプログラム | |
Marfil et al. | Enhancing the broadcasted TV consumption experience with broadband omnidirectional video content | |
Cheong et al. | Interactive terrestrial digital multimedia broadcasting (T-DMB) player | |
WO2019176590A1 (ja) | 情報処理装置、情報処理装置およびプログラム | |
EP2744215A1 (en) | Method for streaming AV content and method for presenting AV content | |
JP2016533673A (ja) | 隠し広告のための方法、装置、およびシステム |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 15881602 Country of ref document: EP Kind code of ref document: A1 |
|
ENP | Entry into the national phase |
Ref document number: 2017542417 Country of ref document: JP Kind code of ref document: A |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
REEP | Request for entry into the european phase |
Ref document number: 2015881602 Country of ref document: EP |
|
ENP | Entry into the national phase |
Ref document number: 20177025344 Country of ref document: KR Kind code of ref document: A |