WO2016202225A1 - 内容项聚合方法和相关装置及通信系统 - Google Patents

内容项聚合方法和相关装置及通信系统 Download PDF

Info

Publication number
WO2016202225A1
WO2016202225A1 PCT/CN2016/085590 CN2016085590W WO2016202225A1 WO 2016202225 A1 WO2016202225 A1 WO 2016202225A1 CN 2016085590 W CN2016085590 W CN 2016085590W WO 2016202225 A1 WO2016202225 A1 WO 2016202225A1
Authority
WO
WIPO (PCT)
Prior art keywords
media presentation
content item
description
content
offset
Prior art date
Application number
PCT/CN2016/085590
Other languages
English (en)
French (fr)
Inventor
张少波
王新
Original Assignee
华为技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 华为技术有限公司 filed Critical 华为技术有限公司
Priority to EP16810966.8A priority Critical patent/EP3285455B1/en
Publication of WO2016202225A1 publication Critical patent/WO2016202225A1/zh
Priority to US15/830,516 priority patent/US20180146230A1/en

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/83Generation or processing of protective or descriptive data associated with content; Content structuring
    • H04N21/845Structuring of content, e.g. decomposing content into time segments
    • H04N21/8456Structuring of content, e.g. decomposing content into time segments by decomposing the content in the time domain, e.g. in time segments
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/25Management operations performed by the server for facilitating the content distribution or administrating data related to end-users or client devices, e.g. end-user or client device authentication, learning user preferences for recommending movies
    • H04N21/266Channel or content management, e.g. generation and management of keys and entitlement messages in a conditional access system, merging a VOD unicast channel into a multicast channel
    • H04N21/26603Channel or content management, e.g. generation and management of keys and entitlement messages in a conditional access system, merging a VOD unicast channel into a multicast channel for automatically generating descriptors from content, e.g. when it is not made available by its provider, using content analysis techniques
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L65/00Network arrangements, protocols or services for supporting real-time applications in data packet communication
    • H04L65/60Network streaming of media packets
    • H04L65/61Network streaming of media packets for supporting one-way streaming services, e.g. Internet radio
    • H04L65/613Network streaming of media packets for supporting one-way streaming services, e.g. Internet radio for the control of the source by the destination
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L65/00Network arrangements, protocols or services for supporting real-time applications in data packet communication
    • H04L65/60Network streaming of media packets
    • H04L65/75Media network packet handling
    • H04L65/764Media network packet handling at the destination 
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/02Protocols based on web technology, e.g. hypertext transfer protocol [HTTP]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/234Processing of video elementary streams, e.g. splicing of video streams, manipulating MPEG-4 scene graphs
    • H04N21/23424Processing of video elementary streams, e.g. splicing of video streams, manipulating MPEG-4 scene graphs involving splicing one content stream with another content stream, e.g. for inserting or substituting an advertisement
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/25Management operations performed by the server for facilitating the content distribution or administrating data related to end-users or client devices, e.g. end-user or client device authentication, learning user preferences for recommending movies
    • H04N21/262Content or additional data distribution scheduling, e.g. sending additional data at off-peak times, updating software modules, calculating the carousel transmission frequency, delaying a video stream transmission, generating play-lists
    • H04N21/26258Content or additional data distribution scheduling, e.g. sending additional data at off-peak times, updating software modules, calculating the carousel transmission frequency, delaying a video stream transmission, generating play-lists for generating a list of items to be played back in a given order, e.g. playlist, or scheduling item distribution according to such list
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/60Network structure or processes for video distribution between server and client or between remote clients; Control signalling between clients, server and network components; Transmission of management data between server and client, e.g. sending from server to client commands for recording incoming content stream; Communication details between server and client 
    • H04N21/63Control signaling related to video distribution between client, server and network components; Network processes for video distribution between server and clients or between remote clients, e.g. transmitting basic layer and enhancement layers over different transmission paths, setting up a peer-to-peer communication via Internet between remote STB's; Communication protocols; Addressing
    • H04N21/643Communication protocols
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/83Generation or processing of protective or descriptive data associated with content; Content structuring
    • H04N21/84Generation or processing of descriptive data, e.g. content descriptors
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/85Assembly of content; Generation of multimedia applications
    • H04N21/854Content authoring
    • H04N21/8543Content authoring using a description language, e.g. Multimedia and Hypermedia information coding Expert Group [MHEG], eXtensible Markup Language [XML]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/85Assembly of content; Generation of multimedia applications
    • H04N21/858Linking data to content, e.g. by linking an URL to a video object, by creating a hotspot
    • H04N21/8586Linking data to content, e.g. by linking an URL to a video object, by creating a hotspot by using a URL
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L65/00Network arrangements, protocols or services for supporting real-time applications in data packet communication
    • H04L65/60Network streaming of media packets
    • H04L65/61Network streaming of media packets for supporting one-way streaming services, e.g. Internet radio
    • H04L65/612Network streaming of media packets for supporting one-way streaming services, e.g. Internet radio for unicast

Definitions

  • the present invention relates to the field of network communication technologies, and in particular, to a content item aggregation method and related apparatus and communication system.
  • HTTP-based media streaming services do not support the aggregation of media content (such as cross-channel broadcast services, etc.), which is a big drawback.
  • Embodiments of the present invention provide a content item aggregation method, a related device, and a communication system, in order to enable flexible aggregation of media content.
  • An embodiment of the present invention provides a content item aggregation method, including:
  • the server generates a media presentation description of the first media presentation, wherein the first media presentation includes a content item, the media presentation description includes a description of the content item or the media presentation description includes a description of the content item Pointing to the information, wherein the description of the content item is for indicating that the content item is from a second media presentation, wherein the first media presentation and the second media presentation are different media presentations;
  • the media presentation description is stored or transmitted.
  • An embodiment of the present invention provides a content item aggregation method, including:
  • the client obtains a media presentation description of the first media presentation, wherein the first media presentation includes a content item, the media presentation description includes a description of the content item or the media presentation description includes a description of the content item Pointing to the information, wherein the description of the content item is for indicating that the content item is from a second media presentation, wherein the first media presentation and the second media presentation are different media presentations;
  • the client acquires the content item according to the description of the content item; the client plays the content item.
  • the embodiment of the invention further provides a server, which may include:
  • a generating unit configured to generate a media presentation description of the first media presentation, wherein the first media presentation includes a content item, the media presentation description includes a description of the content item, or the media presentation description includes the content item Descriptive information of the description, the description of the content item is used to indicate that the content item is from a second media presentation, wherein the first media presentation and the second media presentation are different media presentations;
  • a processing unit configured to store or send the media presentation description.
  • the embodiment of the invention further provides a client, which may include:
  • a first acquiring unit configured to acquire a media presentation description of the first media presentation, where the first media presentation includes a content item, the media presentation description includes a description of the content item, or the media presentation description includes the a description of the content item, the description of the content item is used to indicate that the content item is from a second media presentation, wherein the first media presentation and the second media presentation are different media presentations;
  • a second acquiring unit configured to acquire the content item according to the description of the content item
  • a playing unit for playing the content item.
  • the embodiment of the invention further provides a server, which may include: a processor and a memory.
  • the client can also include a network interface.
  • the memory is used to store instructions
  • the processor is configured to execute the instructions
  • the network interface is configured to communicate with other devices under the control of the processor.
  • a processor for generating a media presentation description of the first media presentation wherein the first media presentation includes a content item, the media presentation description including a description of the content item or the media presentation description includes the content
  • the description of the item points to the information, the description of the content item is used to indicate that the content item is from a second media presentation, wherein the first media presentation and the second media presentation are different media presentations; Sending the media presentation description.
  • the embodiment of the invention further provides a client, which may include: a processor and a memory.
  • the client can also include a network interface.
  • the memory is used to store instructions
  • the processor is configured to execute the instructions
  • the network interface is configured to communicate with other devices under the control of the processor.
  • a processor for obtaining a media presentation description of the first media presentation wherein the first media presentation includes a content item, the media presentation description including a description of the content item or the media presentation description includes the content
  • the description of the item is directed to the information, the description of the content item is used to indicate that the content item is from a second media presentation, wherein the first media presentation and the second media presentation are different media presentations; Describe the content item to obtain the content item; play the content item.
  • the description of the content item is further used to indicate that the second media presentation is a real-time media presentation or a non-real-time media presentation.
  • the description of the content item is further used to indicate a temporal location of the content item embedded in the first media presentation.
  • the description of the content item is further for indicating that part or all of the content item is embedded in the first media presentation.
  • the description of the content item when the description of the content item is further used to indicate that a portion of the content item is embedded in the first media presentation, the description of the content item is further used to indicate The initial play time position and/or the end play time position of the portion of the content item.
  • the description of the content item includes an offset indication fz for indicating an offset between a start play time position and a start content time position of the content item The amount is offset.
  • the offset indication fz indicates an offset offset equal to 0, indicating that the content item will be a content location corresponding to the current time Starting to play; or when the second medium is presented as a real-time media presentation, and the offset indication fz indicates that the offset offset is not equal to 0, indicating that the content item will be offset from the current time by offset offset Corresponding content location starts playing; or when the second media is presented as real-time media presentation, and the offset indication fz indicates that the offset offset is not equal to 0, indicating that the content item will be from the content item The starting content position is shifted backward by the content position corresponding to the offset offset to start playing.
  • the offset indication fz indicates an offset offset equal to 0, indicating that the content item is to be from the content item Starting content position starts playing; or when the second medium is presented as non-real time media presentation, and the offset indication fz indicates that the offset amount offset is not equal to 0, indicating that the content item will be from the Content item
  • the starting content position is shifted backward by the content position corresponding to the offset offset to start playing.
  • the description of the content item is included in an aggregation method descriptor of the media presentation description, or the orientation information of the description of the content item is included in an aggregation method descriptor of the media presentation description.
  • the first media presentation is an aggregated media presentation
  • the media presentation is described as an aggregated media presentation description
  • the aggregated media presentation description includes N media presentation description elements, the N being greater than 1 or An integer equal to 1, the first media presentation description element being one of the N media presentation description elements included in the aggregate media presentation description, wherein the description of the content item is included in the Pointing information in a media presentation description element or a description of the content item is included in the first media presentation description element.
  • the aggregate media presentation description further includes a time window indication corresponding to the first media presentation description element, wherein the time window indication is used to indicate that the client indicates the indicated time window in the time window And acquiring, from the server, the updated content of the aggregated media presentation description, where the updated content includes the first media presentation description element.
  • the content item is a content paragraph or a media expression or an adaptive set.
  • the present invention further provides a communication system, including any of the clients provided by the embodiments of the present invention and any server provided by the embodiments of the present invention.
  • an embodiment of the present invention further provides a computer readable storage medium storing program code for content item aggregation performed by a server.
  • the program code includes instructions for executing a method executed by a server.
  • an embodiment of the present invention further provides a computer readable storage medium storing program code for content item aggregation performed by a client.
  • the program code includes instructions for executing a method performed by a client.
  • the content item included in the media presentation may be from other media presentations different from the media presentation, that is, some or all of the content items of other media presentations may be performed.
  • the technical solution of the embodiment facilitates flexible aggregation of media content.
  • FIG. 1 is a schematic diagram of a structure of a DASH according to an embodiment of the present invention.
  • FIG. 2 is a schematic flowchart of a content item aggregation method according to an embodiment of the present invention
  • FIG. 3 is a schematic flowchart of another content item aggregation method according to an embodiment of the present invention.
  • FIG. 4 is a schematic diagram of a network architecture according to an embodiment of the present invention.
  • FIG. 4 is a schematic flowchart of another content item aggregation method according to an embodiment of the present invention.
  • FIG. 4 is a schematic diagram of a content item that is aggregated in different media presentations according to an embodiment of the present invention.
  • FIG. 5-a is a schematic diagram of a timing arrangement of an aggregated content item according to an embodiment of the present invention.
  • FIG. 5-b is a schematic diagram of a data structure of an AMPD described by using an XML data rule according to an embodiment of the present invention
  • FIG. 5-c is a schematic diagram showing another data structure of an MPD described by using an XML data rule according to an embodiment of the present invention.
  • FIG. 5-d is a schematic diagram of another data structure of an MPD described by an XML data rule according to an embodiment of the present invention.
  • FIG. 5-e is a schematic diagram showing a time relationship between another content items according to an embodiment of the present invention.
  • FIG. 5-f is a schematic diagram showing another data structure of an AMPD described by an XML data rule according to an embodiment of the present invention.
  • FIG. 5-g is a schematic diagram showing another data structure of an AMPD described by an XML data rule according to an embodiment of the present invention.
  • FIG. 6 is a schematic diagram of a server provided by an embodiment of the present invention.
  • FIG. 7 is a schematic diagram of a client according to an embodiment of the present invention.
  • FIG. 8 is a schematic diagram of another client according to an embodiment of the present invention.
  • FIG. 9 is a schematic diagram of another client provided by an embodiment of the present invention.
  • Embodiments of the present invention provide a content item aggregation method, a related device, and a communication system, in order to enable flexible aggregation of media content.
  • Broadcasting is a traditional way of transmitting media content. Both radio and television stations transmit audio and video through wireless broadcasting, while cable television carries broadcast signals over wired cables.
  • the former has improved the level of communication services, and the latter has enhanced the capabilities of personal devices.
  • the transmission of multimedia through online media streaming services over the Internet is becoming more and more common. .
  • online media streaming services better meet the different needs of people for media content. Users can make on-demand selection of acquired media content when needed, which changes the user's one-way. And passive reception.
  • the HTTP-based Dynamic Adaptative Streaming Over HTTP (DASH) service is a mainstream technology for multimedia streaming services and represents a recent development in this field.
  • Microsoft's Smooth Streaming (SS) and the Moving Picture Experts Group (MPEG)
  • Dynamic Adaptative Streaming Over HTTP (DASH)
  • Apple's HTTP service HLS, HTTP Live Streaming
  • MPEG DASH The standard is a standardized technology developed by MPEG and is expected to be widely adopted to change the fragmented market landscape.
  • the current DASH specification defines the format of the media segment and media presentation description, and the media presentation description can also be referred to as the format of the media presentation description file.
  • a media segment is a packaged form of media presentation for storage and access of media expressions, and a media presentation description is used to describe a media presentation.
  • the so-called media presentation refers to a piece of media content that is sequential in time.
  • a media presentation can be equivalent to a TV show or a TV show channel.
  • DASH can only describe one media presentation, and can not simultaneously describe multiple parallel media presentations for users to select, such as multiple TV channels simultaneously displayed as a program channel guide in a television service. .
  • the timing of different media presentations is different and interlaced, and such a time structure cannot be described in DASH. Therefore, conventional DASH cannot conveniently implement time-parallel content aggregation.
  • DASH For chronological content aggregation, DASH is also insufficient. To generate a new media presentation description, DASH has n+1 media presentation descriptions. In addition, if there is other manifestation in content aggregation: re-arrange the different media presentations in time to form a new media presentation - in a TV program channel, different programs are arranged in chronological order. The channel provider should splicing the program content.
  • DASH can describe temporally sequential media content, if the source of the media content is different, in the process of content aggregation, each media presentation description needs to be processed to generate a description file for the merged media presentation through the content aggregation. In the process of merging, it is necessary to process the time presented by each media, and adopt a consistent time description, which is prone to errors.
  • a media presentation can consist of one or more content paragraphs (Period).
  • a content paragraph includes a media content, which is continuous in time, and all aspects of the content of the media are consistent, such as coding, language and content protection.
  • the media content exists in the form of a media code representation, and the coded representation is divided into adaptation sets according to attributes, such as media components.
  • the media code representations within the adaptation set are different coded versions of the same media component of the same media content and are interchangeable.
  • the content passages are sequential in time, and different media content can be stitched together in time through the content passages. For example, the previous content paragraph is a news program, and the next content paragraph is an advertisement.
  • the beginning of a content paragraph means that the content paragraph is compared to the previous content segment
  • Some aspects of the change such as: content from news programs to sports programs; video coding from H.264 to H.265; increased subtitles as a media component; increased English audio and so on.
  • the client encounters the beginning of a new content paragraph, the client is reconfigured - the choice of media components, the range of adaptation (the code rate of the encoded representation of the media), the initialization of the decoder, and so on.
  • the content paragraphs are sequential in time, one content paragraph ends, the next content paragraph begins, and the two paragraphs do not coincide in time.
  • DASH has no way to describe multiple media presentations that are parallel in time.
  • a media content is encoded into multiple versions, each version having different characteristics, such as code rate. These versions are called media expressions in DASH, and they represent the same media content, from content presentation (viewing/playing). Angles are alternative to each other.
  • a media expression is divided into accessible units in time, usually a number of seconds, called a media segment or a media sub-segment (a media segment can be logically divided into media sub-segments).
  • media clips initialization clips are called clips.
  • the media expression is stored on the content server (such as HTTP server) for client to obtain, and the fragment is the smallest unit that the client can access through the URL.
  • the Media Presentation Description is an extensible Markup Language (XML) file.
  • the MPD includes the metadata required by the client, describes the characteristics of the media expression, and how to obtain the media from the server.
  • the expression includes a code rate, a resolution, an aspect ratio of the video image, a Uniform Resource Locator (URL) of the segment included in the media expression, and the like.
  • the client constructs an HTTP URL to request media segments in the media presentation from the content server, and can switch to other media representations at the media segment boundaries to accommodate changes in available bandwidth.
  • FIG. 1 illustrates an example of a DASH structure.
  • HTTP-based adaptive streaming service allows one Changes in the characteristics of the content in the media presentation, such as changes in the way the media is encoded. In DASH, this is achieved through the concept of the so-called "Period", which is used for content stitching, such as the previous content paragraph is a news program, and the next content paragraph is an advertisement.
  • the HTTP-based adaptive media streaming service allows for changes in content characteristics in a media presentation, such as changes in media encoding.
  • Period is used for splicing of content.
  • the previous content paragraph is a news program
  • the next content paragraph is an advertisement.
  • a media presentation includes one or more content paragraphs, which are sequential in time.
  • the beginning of a content paragraph means that there are some changes compared to the previous content paragraph, such as changes in content, such as News programs to sports programs, from sports programs to movie programs, from movie programs to advertisements, from advertisements to variety shows, etc.; changes in the way content is encoded, for example, from H.264 encoding schemes to H.265 encoding schemes; Changes in the amount of media expression, for example, can increase or decrease media expression; changes in content components, such as increased audio expression in Chinese, and the like. Among them, when the client encounters the beginning of a new content paragraph, the client working conditions have changed and may have to be reinitialized.
  • an adaptation set includes at least one media representation
  • media representations in an adaptation set have mutual substitution.
  • Different adaptation sets may be compatible or repulsive.
  • the media presentation may include one or more temporally sequential content paragraphs, each content paragraph including one or more Adaptation Sets.
  • each adaptation set includes one or more media representations (Representations).
  • One of the media expressions includes one or more segments.
  • the media presentation description can have a hierarchical structure similar to the media presentation.
  • the concept of media presentation described above may be represented by an XML element in the media presentation description, the media presentation element includes one or more content paragraph elements, and each content paragraph element includes one or more adaptation sets ( AdaptationSet) element.
  • Each AdaptationSet element includes one or more Representation elements.
  • the media presentation corresponds to a media presentation description element in the media presentation description
  • a content paragraph in the media presentation corresponds to a content paragraph element in the media presentation description
  • an adaptation set in the media presentation corresponds to an appropriate one in the media presentation description Matching element, a media expression in the media presentation A media expression element in the media presentation description, and so on.
  • the embodiment of the present invention provides a content item aggregation method, which may include: the server generates a media presentation description of the first media presentation, where the first media presentation includes a content item, where the media presentation description includes the content a description of the item, or the media presentation description includes pointing information of the description of the content item, the description of the content item is used to indicate that the content item is from a second media presentation, the first media presentation and the The second medium is presented as a different media presentation; the media presentation description is stored or transmitted.
  • FIG. 2 is a schematic flowchart diagram of a content item aggregation method according to an embodiment of the present invention. As shown in FIG. 2, an embodiment of the present invention provides a content item aggregation method. include:
  • the server generates a media presentation description of the first media presentation.
  • the first media presentation includes a content item (for convenience of reference, the content item may be referred to as a first content item below).
  • the media presentation description includes a description of the first content item or the media presentation description includes pointing information of the description of the first content item.
  • the description of the first content item is for indicating that the first content item is from a second media presentation, wherein the first media presentation and the second media presentation are different media presentations.
  • the first content item may be one of N content items included in the first media presentation, and the N is an integer greater than 1 or equal to 1.
  • the first media presentation further includes a second content item (the first content item and the second content item are different content items), the media presentation description including the description of the second content item or the media presentation description includes The pointing information of the description of the second content item, wherein the description of the second content item is used to indicate that the second content item is from a second media presentation or media presentation X.
  • the N can be equal to 1, 2, 3, 4, 5, 6, 8, 10, 15, 19, 21, 30, 500 or other values.
  • the pointing information of the description of the first content item is used to point to a description of the first content item, for example, the pointing information of the description of the first content item may include a description of the first content item.
  • the description of the first content item may be obtained by using the pointing information of the description of the first content item.
  • the first content item may be, for example, a content paragraph or a media representation (Representation) or an adaptation set (AdaptationSet) or other forms of media content.
  • the server generates a media presentation description of the first media presentation after receiving the program play request from the client, and of course, the server may also generate the first trigger under other conditions.
  • the media presentation of the media presentation may be generated.
  • the server stores or sends a media presentation description of the first media presentation.
  • the server may send the media presentation description to the client, for example, the client may further acquire the first content item according to the description of the first content item; the client may further play the first A content item.
  • the first content item included in the first media presentation may be from the second media presentation, that is, the content items presented by other media may be re-aggregated to form A new media presentation that satisfies a specific orchestration needs, and the media presentation description of the new media presentation includes a description of the content items presented by other media that are aggregated, so that the client can perform acquisition and playback of the corresponding content items, and the like.
  • the technical solution in this embodiment facilitates flexible aggregation of media content.
  • the description of the first content item further includes a time indication Sd for indicating a play start time of the first content item.
  • the time indication Sd can be the @Start attribute or the @Start element.
  • the description of the second content item further includes a time indication Se for indicating a play start time of the second content item, where the time indication Se indicates The play start time of the second content item is equal to the play end time of the first content item, or the play start time of the second content item indicated by the time indication Se is later than The play end time of the first content item, and the time difference ⁇ t between the play start time of the second content item and the play end time of the first content item is less than a threshold.
  • the description of the first content item is further used to
  • the second media is indicated as being presented as a real-time media presentation or a non-real-time media presentation.
  • the real-time media presentation is, for example, a live broadcast media presentation, such as a live sports game or a live show variety show. Instead of real-time media presentation, this media presentation has been previously recorded or otherwise made available.
  • the non-real time media presentation can be, for example, a previously recorded television show, movie, sports game, variety show, and the like.
  • the description of the first content item is further used to indicate a time position in which the first content item is embedded in the first media presentation.
  • the time position in which the first content item is embedded in the first media presentation, that is, the first content item is arranged at the time position of the first media presentation.
  • the description of the first content item is further used to indicate that part or all of the first content item is embedded in the first media presentation. That is, the description of the first content item may be further used to indicate that all of the first content item is embedded in the first media presentation, and the description of the first content item may also be used to indicate the first content A portion of the item is embedded in the first media presentation, and the "part" of the first content item can be viewed from different dimensions such as time, content, etc., for example, if the first content item is an AdaptationSet, then the first content item is The description indicates that a portion of the first content item is embedded in the first media presentation, and a partial version of the AdaptationSet and/or a partially intercepted media representation is embedded in the first media presentation, such as an AdaptationSet A media expression including 15 versions of a duration of 15 minutes, for example, in one case, the description of the first content item may indicate that 2 of the 5 versions of the media expression have a duration of 15
  • the description of the first content item when the description of the first content item is further used to indicate that a part of the first content item is embedded in the first media presentation, Said The description of the first content item can also be used to indicate a starting play time position and/or an end play time position of the portion of the first content item.
  • the description of the first content item may be further used to indicate that a starting play time position of the portion of the first content item is a start content time position of the first content item, or the first The content time position of the content item is offset by the content position after five minutes, and the like.
  • the description of the first content item includes an offset indication fz, where the offset indication fz is used to indicate a starting play time position of the first content item. Offset offset from the start content time position.
  • the offset indication fz indicates an offset offset equal to 0, indicating the first content
  • the item starts playing from the content position corresponding to the current time; or when the second medium is presented as a real-time media presentation, and the offset indication fz indicates that the offset offset is not equal to 0, indicating the first content
  • the item starts playing from the content position corresponding to the current time back offset offset; or when the second medium is presented as a real-time media presentation, and the offset indication fz indicates that the offset offset is not equal to 0, It is indicated that the first content item starts to play back from the content content position of the first content item and the content position corresponding to the offset offset.
  • the offset indication fz indicates an offset offset equal to 0, indicating the first The content item will start playing from the starting content location of the first content item; or when the second medium is presented as a non-real time media presentation, and the offset indication fz indicates that the offset offset is not equal to 0 And indicating that the first content item starts to play back from the content content position of the first content item and the content position corresponding to the offset offset.
  • the description of the first content item is included in an aggregation method descriptor of the media presentation description, or the orientation information of the description of the first content item is included in The media presentation is described in the description of the aggregation method.
  • the first media presentation is an aggregate media presentation
  • the media presentation description is an aggregate media presentation description
  • the aggregate media presentation description includes N media presentation description elements.
  • the N is an integer greater than 1 or equal to 1
  • the first media presentation may be an aggregate media presentation or a normal media presentation.
  • the media presentation description can be an aggregate media presentation description or a generic media presentation description.
  • the first media presentation is an aggregate media presentation
  • the media presentation description is an aggregate media presentation description
  • the aggregate media presentation description includes N media presentation description elements.
  • the N is an integer greater than 1 or equal to 1
  • the first media presentation description element is one of the N media presentation description elements included in the aggregate media presentation description, where the A description of a content item included in the first media presentation description element or a pointing information of a description of the first content item is included in the first media presentation description element.
  • the aggregated media presentation description further includes a time window indication corresponding to the first media presentation description element (the time window indication may include, for example, an attribute @expriy and an attribute @timeAdvance), where The time window indication is used to instruct the client to obtain the updated content of the aggregated media presentation description from the server within the time window indicated by the time window indication, wherein the update content includes the first A media presentation descriptor element. Since the time window indication is introduced to limit the period during which the client updates the aggregated media presentation description, it is advantageous to better control the content playback of the client.
  • An embodiment of the present invention provides a content item aggregation method, which may include: a client acquiring a media presentation description of a first media presentation, where the first media presentation includes a first content item, where the media presentation description includes the The description of the first content item or the media presentation description includes pointing information of the description of the first content item, the description of the first content item is used to indicate that the first content item is from a second media presentation; The client obtains the first content item according to the description of the first content item; the client plays the first content item.
  • FIG. 3 is a schematic flowchart of a content item aggregation method according to an embodiment of the present invention, wherein, as exemplified in FIG. 3, an embodiment of the present invention provides a content item aggregation.
  • Methods can include:
  • the client obtains a media presentation description of the first media presentation.
  • the first media presentation includes a content item (which may be referred to as a first content item below for convenience of reference), and the media presentation description includes a description of the first content item or the media presentation description. Included in the description of the first content item, the description of the first content item is used to indicate that the first content item is from a second media presentation. The first media presentation and the second media are presented as different media presentations.
  • the first content item is one of N content items included in the first media presentation, and the N is an integer greater than 1 or equal to 1.
  • the first media presentation further includes a second content item (the second content item is different from the first content item), the media presentation description including the description of the second content item or the media presentation description including the second The pointing information of the description of the content item, wherein the description of the second content item is used to indicate that the second content item is from the second media presentation or media presentation X.
  • the pointing information of the description of the first content item is used to point to a description of the first content item, for example, the pointing information of the description of the first content item may include a description of the first content item. Pointer or URL etc.
  • the description of the first content item may be obtained by using the pointing information of the description of the first content item.
  • the content item (for example, the first content item or the second content item may be, for example, a content paragraph or a media representation (Representation) or an adaptation set (AdaptationSet) or Other forms of media content.
  • the N can be equal to 1, 2, 3, 4, 5, 6, 8, 10, 15, 19, 21, 30, 500 or other values.
  • the server generates a media presentation description of the first media presentation after receiving the program play request from the client, and of course, the server may also generate the first trigger under other conditions.
  • the media presentation of the media presentation may be generated.
  • the client acquires the first content item according to the description of the first content item.
  • the client plays the first content item.
  • the first content item included in the first media presentation may be from a second media presentation, that is, the content items presented by other media may be re-re Aggregating the arrangement to form a new media presentation that meets the specific programming needs, and the media presentation description of the new media presentation includes a description of the aggregated other media presented content items, such that the client can retrieve and play the corresponding content item accordingly. Wait.
  • the technical solution in this embodiment facilitates flexible aggregation of media content.
  • the description of the first content item is further used to indicate that the second media presentation is a real-time media presentation or a non-real-time media presentation.
  • the real-time media presentation is, for example, a live broadcast media presentation, such as a live sports game or a live show variety show. Instead of real-time media presentation, this media presentation has been previously recorded or otherwise made available.
  • the non-real time media presentation can be, for example, a previously recorded television show, movie, sports game, variety show, and the like.
  • the description of the second content item further includes a time indication Se for indicating a play start time of the second content item, where the time indication Se indicates The play start time of the second content item is equal to the play end time of the first content item, or the play start time of the second content item indicated by the time indication Se is later than The play end time of the first content item, and the time difference ⁇ t between the play start time of the second content item and the play end time of the first content item is less than a threshold.
  • the description of the first content item is further used to indicate a time position in which the first content item is embedded in the first media presentation.
  • the time position in which the first content item is embedded in the first media presentation, that is, the first content item is arranged at the time position of the first media presentation.
  • the description of the first content item is further used to indicate that part or all of the first content item is embedded in the first media presentation. That is, the description of the first content item may be further used to indicate that all of the first content item is embedded in the first media presentation, and the description of the first content item may also be used to indicate the first content A portion of the item is embedded in the first media presentation, and the "part" of the first content item can be viewed from different dimensions such as time, content, etc., for example, if the first content item is an AdaptationSet, then the first content item is The description indicates that a portion of the first content item is embedded in the first media presentation, and a partial version of the AdaptationSet and/or a partially intercepted media representation is embedded in the first media presentation Now, for example, the AdaptationSet includes 5 versions of the media expression of a duration of 15 minutes, for example, in one case, the description of the first content item may indicate 2 of the 5 versions of the media expression.
  • Media expressions each having a duration of 15 minutes are embedded in the first media presentation, for example, in another case, the description of the first content item may indicate 3 of the media expressions of the 5 versions.
  • the media expression of the version having a duration of 12 minutes ie, the 12-minute media expression intercepted from the 15-minute media expression
  • the description of the first content item may indicate that the mediation of the five versions of the five versions of the media expression is 12 minutes (ie, the media expression of 12 minutes is intercepted from the 15-minute media expression) Embedded in the first media presentation.
  • the description of the first content item when the description of the first content item is further used to indicate that a part of the first content item is embedded in the first media presentation,
  • the description of the first content item may also be used to indicate a starting play time position and/or an end play time position of the portion of the first content item.
  • the description of the first content item may be further used to indicate that a starting play time position of the portion of the first content item is a start content time position of the first content item, or the first The content time position of the content item is offset by the content position after five minutes, and the like.
  • the description of the first content item includes an offset indication fz, where the offset indication fz is used to indicate a starting play time position of the first content item. Offset offset from the start content time position.
  • the offset indication fz indicates an offset offset equal to 0, indicating the first content
  • the item starts playing from the content position corresponding to the current time; or when the second medium is presented as a real-time media presentation, and the offset indication fz indicates that the offset offset is not equal to 0, indicating the first content
  • the item starts playing from the content position corresponding to the current time back offset offset; or when the second medium is presented as a real-time media presentation, and the offset indication fz indicates that the offset offset is not equal to 0, It is indicated that the first content item starts to play back from the content content position of the first content item and the content position corresponding to the offset offset.
  • the offset indication fz indicates that the offset offset is equal to 0, indicating that the first content item will start playing from the starting content position of the first content item; or when the second When the media presentation is a non-real time media presentation, and the offset indication fz indicates that the offset offset is not equal to 0, indicating that the first content item is to be backward from the initial content location of the first content item Offset the content position corresponding to the offset offset to start playing.
  • the description of the first content item is included in an aggregation method descriptor of the media presentation description, or the orientation information of the description of the first content item is included in The media presentation is described in the description of the aggregation method.
  • the first media presentation is an aggregate media presentation
  • the media presentation description is an aggregate media presentation description
  • the aggregate media presentation description includes N media presentation description elements.
  • the N is an integer greater than 1 or equal to 1
  • the first media presentation description element is one of the N media presentation description elements included in the aggregate media presentation description, where the A description of a content item included in the first media presentation description element or a pointing information of a description of the first content item is included in the first media presentation description element.
  • the first media presentation may be an aggregate media presentation or a normal media presentation.
  • the media presentation description can be an aggregate media presentation description or a generic media presentation description.
  • the first media presentation is an aggregate media presentation
  • the media presentation description is an aggregate media presentation description
  • the aggregate media presentation description includes N media presentation description elements.
  • the N is an integer greater than 1 or equal to 1
  • the first media presentation description element is one of the N media presentation description elements included in the aggregate media presentation description, where the A description of a content item included in the first media presentation description element or a pointing information of a description of the first content item is included in the first media presentation description element.
  • the aggregate media presentation description further includes a time window indication corresponding to the first media presentation description element (where the time window indication may include, for example, an attribute @expriy and an attribute @timeAdvance, That is, the attribute @expriy and the attribute @timeAdvance may indicate a time window), wherein the time window indication is used to indicate a customer Ending, in the time window indicated by the time window indication, obtaining, from the server, the updated content of the aggregated media presentation description, the update content including the first media presentation description element. Since the time window indication is introduced to limit the period during which the client updates the aggregated media presentation description, it is advantageous to better control the content playback of the client.
  • FIG. 4-b is a schematic flowchart of another content item aggregation method according to another embodiment of the present invention, where the content item aggregation method shown in FIG. 4-b is shown. It can be implemented under the network architecture as shown in Figure 4-a. As shown in the example of Figure 4-a, another content item aggregation method provided by another embodiment of the present invention may include:
  • the client sends a play request to the server.
  • the server receives the play request from the client.
  • the server generates a media presentation description of the first media presentation.
  • the server refers to a device that provides services on the network side, including but not limited to a server, a CDN node, or a login server.
  • the server may be a device, and the server may be a plurality of different devices. They are considered as a whole in the present invention.
  • the server sends the media presentation description for responding to the program request to the client.
  • the first media presentation includes a first content item, wherein the media presentation description includes a description of the first content item or the media presentation description includes pointing information of a description of the first content item, where The description of the first content item is used to indicate that the first content item is from a second media presentation.
  • the first media presentation and the second media are presented as different media presentations.
  • the client receives a media presentation description of the first media presentation from the server, and the client acquires the first content item according to the description of the first content item.
  • the client plays the first content item.
  • the first content item may be one of N content items included in the first media presentation, and the N is an integer greater than 1 or equal to 1.
  • the first media presentation further includes a second content item, the media presentation description including a description of the second content item or the media presentation description including the second internal item
  • the pointing information of the description of the content wherein the description of the second content item is used to indicate that the second content item is from the second media presentation or media presentation X.
  • Figure 4-c illustrates one possible source of each content item in the first media presentation, wherein some of the content items are from real-time media presentations, and some of the content items may be from non-real-time media.
  • another source of each content item in the first media presentation may be that all content items are from real-time media presentation.
  • another source of each content item in the first media presentation may be that all content items are from non-real time media presentation.
  • the pointing information of the description of the first content item is used to point to a description of the first content item, for example, the pointing information of the description of the first content item may include a description of the first content item. Pointer or URL etc.
  • the description of the first content item may be obtained by using the pointing information of the description of the first content item.
  • the first content item may be, for example, a content paragraph or a media representation (Representation) or an adaptation set (AdaptationSet) or other forms of media content.
  • the content can be played in a manner similar to the acquisition and playback of the first content item.
  • the description of the first content item is further used to indicate that the second media presentation is a real-time media presentation or a non-real-time media presentation.
  • the real-time media presentation is, for example, a live broadcast media presentation, such as a live sports game or a live show variety show. Instead of real-time media presentation, this media presentation has been previously recorded or otherwise made available.
  • the non-real time media presentation can be, for example, a previously recorded television show, movie, sports game, variety show, and the like.
  • the description of the first content item is further used to indicate a time position in which the first content item is embedded in the first media presentation.
  • the time position in which the first content item is embedded in the first media presentation, that is, the first content item is arranged at the time position of the first media presentation.
  • the description of the first content item is further used to indicate that part or all of the first content item is embedded in the first media presentation. That is the first A description of a content item can also be used to indicate that all of the first content item is embedded in the first media presentation, and the description of the first content item can also be used to indicate that a portion of the first content item is Embedded in the first media presentation, the "part" of the first content item may be viewed from different dimensions such as time, content, etc., for example, if the first content item is an AdaptationSet, then if the description of the first content item indicates A portion of the first content item is embedded in the first media presentation, a partial version of the AdaptationSet and/or a partially intercepted media representation is embedded in the first media presentation, eg, the AdaptationSet includes 5 versions The duration is 15 minutes of media expression.
  • the description of the first content item may indicate that the mediation of the two versions of the media expressions of the five versions is 15 minutes.
  • the description of the first content item may indicate that the duration of three of the five versions of the media expression is A 12-minute media expression (ie, a 12-minute media expression intercepted from a 15-minute media presentation) is embedded in the first media presentation, such as in another case, a description of the first content item Media expressions indicating that 5 of the 5 versions of the media expressions are 12 minutes in length (ie, 12 minutes of media expressions intercepted from the 15-minute media expression) are embedded in the first Media presentation.
  • the description of the first content item when the description of the first content item is further used to indicate that a part of the first content item is embedded in the first media presentation,
  • the description of the first content item may also be used to indicate a starting play time position and/or an end play time position of the portion of the first content item.
  • the description of the first content item may be further used to indicate that a starting play time position of the portion of the first content item is a start content time position of the first content item, or the first The content time position of the content item is offset by the content position after five minutes, and the like.
  • the description of the first content item includes an offset indication fz, where the offset indication fz is used to indicate a starting play time position of the first content item. Offset offset from the start content time position.
  • the offset indication fz indicates an offset offset equal to 0, indicating the first content The item will start playing from the content location corresponding to the current time; or when the second media is presented as a real-time medium
  • the offset indication fz indicates that the offset offset is not equal to 0, indicating that the first content item starts to play from the content position corresponding to the current time back offset offset; or when When the second medium is presented as a real-time media presentation, and the offset indication fz indicates that the offset offset is not equal to 0, indicating that the first content item will be backward from the initial content position of the first content item Offset the content position corresponding to the offset offset to start playing.
  • the offset indication fz indicates an offset offset equal to 0, indicating the first The content item will start playing from the starting content location of the first content item; or when the second medium is presented as a non-real time media presentation, and the offset indication fz indicates that the offset offset is not equal to 0 And indicating that the first content item starts to play back from the content content position of the first content item and the content position corresponding to the offset offset.
  • the description of the first content item is included in an aggregation method descriptor of the media presentation description, or the orientation information of the description of the first content item is included in The media presentation is described in the description of the aggregation method.
  • the first media presentation is an aggregate media presentation
  • the media presentation description is an aggregate media presentation description
  • the aggregate media presentation description includes N media presentation description elements.
  • the N is an integer greater than 1 or equal to 1
  • the first media presentation description element is one of the N media presentation description elements included in the aggregate media presentation description, where the A description of a content item included in the first media presentation description element or a pointing information of a description of the first content item is included in the first media presentation description element.
  • the first media presentation may be an aggregate media presentation or a normal media presentation.
  • the media presentation description can be an aggregate media presentation description or a generic media presentation description.
  • the first media presentation is an aggregate media presentation
  • the media presentation description is an aggregate media presentation description
  • the aggregate media presentation description includes N media presentation description elements.
  • the N is an integer greater than 1 or equal to 1
  • the first media presentation description element is one of the N media presentation description elements included in the aggregate media presentation description, where the a description of a content item included in the first media presentation description
  • the pointing information in the element or the description of the first content item is included in the first media presentation description element.
  • the aggregate media presentation description further includes a time window indication corresponding to the first media presentation description element (where the time window indication may include, for example, an attribute @expriy and an attribute @timeAdvance, That is, the attribute @expriy and the attribute @timeAdvance may indicate a time window), wherein the time window indication is used to instruct the client to acquire from the server within the time window indicated by the time window indication
  • the aggregated media presents the updated content of the description, the updated content including the first media presentation description element. Since the time window indication is introduced to limit the period during which the client updates the aggregated media presentation description, it is advantageous to better control the content playback of the client.
  • the content item included in the media presentation may be from another media presentation, that is, the content items presented by other media may be re-aggregated to form a content that meets specific programming requirements.
  • the new media presentation, and the media presentation description of the new media presentation includes descriptions of the content items presented by other media that are aggregated, so that the client can perform acquisition and playback of the corresponding content items according to the content.
  • the technical solution in this embodiment facilitates flexible aggregation of media content.
  • the media presentation unit is a different media content, which constitutes a media component of the media presentation, a coding of the media component, a storage location, a media presentation description, and the like. They are parallel or sequential in time.
  • the aggregated media presentation description is a metadata file that describes the media presentation units in the aggregated media presentation and the relationships between them. It is an extension of the media presentation description (file).
  • the root element of the aggregated media presentation description is the Aggregate Media Presentation Description Element (AMPD), whose two attributes @expiry and @timeAdvance are updates for the aggregated media presentation description, usually at the time Over time, the composite media presentation description is updated to describe the changes in the aggregated media presentation, particularly the temporal extension of the aggregated media presentation.
  • @expiry indicates the validity period of the aggregated media presentation, which is expressed in terms of wall clock time, and the content of the AMPD aggregated media presentation description is valid before the validity period.
  • @timeAdvance indicates the amount of time advancement of the update described in the aggregated media presentation, ie, the aggregated media presentation describes the earliest update time.
  • These two attributes combine to define a time window, that is, the time period from texp-tadv to texp, where texp represents the value of @expiry and tadv is the value of @timeAdvance.
  • a syntax element MediaPresentation is introduced, and the media presentation unit element represents a media presentation unit.
  • Aggregated media presentations describe a set of media presentation units and the temporal relationship between them.
  • the source media presentation can be local, where the MediaPresentation element contains an MPD element and the MPD element contains at least one Period element.
  • a pointer can be used to point to the referenced media presentation description, such as the @xlink:href attribute.
  • the reference may be all or part of, that is, one of the pointed media presentations or a plurality of consecutive content paragraphs, and the referenced content paragraph may be described by the attribute @periodId.
  • a piece of media content is contiguous in time, with a time frame in which the time is the media time to the media content, regardless of the wall clock time.
  • the media time can be used to locate the (time) location in the media content.
  • the media content time can be mapped to absolute time during playback.
  • the temporal position and absolute time of the media content are fixedly corresponding; but once the time passes, the media time and the absolute time no longer have a fixed correspondence.
  • Media content can move in time.
  • the user can join at any time at the current time location of the live media content or at a time location prior to the current location. If the media content can be stored, then the user obtains the media content of the past time (on the absolute time axis).
  • the movement of the media content in absolute time can be represented by two attributes: the start time @startTime represents a moment in absolute time, ie a media content is started from this moment.
  • the offset @timeOffset represents the temporal position of the media content. For live broadcast, it is relative to the current (absolute time axis on the time) time position of the media content. Since only the past content can be accessed, the offset value is smaller than Equal to 0; for on-demand, @startTime is the relative time position relative to the beginning of this media content, the offset value is less than or equal to 0. This way the client behaves differently during live and on-demand.
  • the client joins the live media content at @startTime.
  • the time of the accessed media content is the media content of @startTime+@timeOffset at this moment; when the client is on demand, the client adds the on-demand media content at @startTime.
  • the time position of the media content starts with @timeOffset.
  • Content aggregation is essentially the movement of media content over absolute time (axis) plus the temporal offset of media content.
  • An example of the above relationship is illustrated in Figure 5-a.
  • the following example is an expression of an aggregated media presentation description, expressed by a hierarchically subordinated data structure, an element containing several attributes and low-level elements, each layer being such that one layer is nested.
  • the aggregated media presentation describes the meaning of an expression element and attribute of AMPD as follows:
  • @timeAdvance used to indicate the time advance of the update of the aggregated media presentation description, ie the aggregated media update describes the earliest time update time, which is the time indicated relative to @expiry, which may be present only when the @expiry attribute is present.
  • Presentation used to describe a media presentation.
  • @type used to indicate whether the media presentation is live (real-time generated) or on-demand (existing, non-real-time).
  • @startTime used to indicate the start time of the media rendering unit. This property will appear if it is a sequential merge.
  • live media presentation it is a (forward) time offset relative to the media time position of the media presentation unit at @startTime.
  • time offset relative to the starting position of the media presentation unit.
  • @periodId if there is more than one Period in the MPD pointed to, @periodId indicates the selected period.
  • @xlink:actuate used to indicate the processing of the media presentation description pointed to by @xlink:xref.
  • MPD used to indicate local media rendering.
  • Figure 5-b illustrates a data structure of an AMPD described using XML data rules.
  • the aggregated media presentation description can be implemented by other methods.
  • This method uses an existing media presentation description, in which multiple media content items are sequentially aggregated by hooking (in time) between content paragraphs, and (a) media content item refers to one content in (one) media presentation. paragraph. Note that the sources of these media content items can be different, and are the content paragraphs presented by different media.
  • the "hook" mechanism uses a descriptor to describe the temporal relationship between the content segment that is hooked (aggregated) and the current content paragraph (the content paragraph to which the descriptor belongs).
  • the mechanism has a method identifier and a corresponding set of parameters.
  • the client interprets the set of parameters that accompany it based on this method identifier. If the client does not recognize this method identifier, it cannot understand/interpret the parameter set, parameters, order of parameters, values, and so on.
  • the method identifier is "urn:mpeg:dash:mpd-linking:2015”.
  • the parameters of this method are as follows:
  • Direction used to indicate the direction of the link, the time relationship of the linked content paragraph and the current content paragraph.
  • a pre-roll indicates that the content paragraph of the link is inserted before the current content paragraph.
  • a post-roll indicates that the content paragraph of the link is inserted before the current content paragraph.
  • the current content paragraph local means that the content paragraph of the link is used as the paragraph in the current content.
  • Type used to describe the nature of the content being referenced (real-time or non-real-time media rendering).
  • mpdUrl the URL used to indicate the media presentation description of the referenced content.
  • periodId used to indicate the content paragraph of the reference.
  • timeOffset used to indicate the time offset relative to the beginning of the program segment. If the target content If it is non-real-time, it already exists, then the start time of the program paragraph can be 0. If the target content is real-time, such as live content, the program paragraph starts at a certain moment of absolute time.
  • Duration used to indicate the length of time of the linked content item.
  • a plurality of content items each of which is a content paragraph described in a different media presentation description. Some of the content items are non-real time, while others are real-time.
  • a temporally continuous media presentation description is generated.
  • client behavior control is introduced in addition to content aggregation.
  • content item B is the recorded ad, which has a corresponding media presentation description.
  • Content item A is a real-time badminton game that begins at time t0.
  • the content server wants the user to watch the advertisement B before watching the game.
  • the media service description of the media content A is published on the content service. Note that the EssentialProperty descriptor is added to the content paragraph element corresponding to the content item A, and the client must process the descriptor, otherwise it cannot identify the method identifier of the descriptor. The processing of this content item should be abandoned.
  • the descriptor of this descriptor tells the client that this is a link method for a content paragraph.
  • the meaning of the parameter is: a content paragraph is prepended, the content of the preceding content paragraph is non-real-time, and the content segment of the preceding content is a reference.
  • the URL is the content paragraph ad1 in the media presentation description of http://example.com/ad/ad1.mpd, and the pre-content paragraph begins from the start time position of the referenced content paragraph.
  • Figure 5-e illustrates the temporal relationship between content items.
  • the content item B must be viewed before the content item A can be viewed.
  • the real-time content paragraph starts at time t0, the user starts watching the program at time t1, the user first watches the pre-content item B, and starts watching the content item A after the content ends, when the time is already t2, the user does not see the content paragraph
  • the part of t0 to t1 is indicated by a dashed box in the figure.
  • T1 to t2 are the lengths of time of the content item B.
  • An example of this is an example of a live broadcast of a pre-advertisement.
  • the server provides a live content of the project. No matter when the user (client) joins the live broadcast, it will look at a pre-roll and then join the live broadcast.
  • the service process begins when the client sends a request for a live program, and after the server receives the request, Generating an aggregate media description, the aggregated media description uses the current time t0 as a time reference point, and the aggregated media description can indicate that the aggregated media presentation description is dynamically updated by the presence of the @expiry attribute, which will be indicated by @expiry
  • the t1 is expired (expired), and the next version of the aggregated media description is available at time t1-tw1 (a time window is formed at time t1-tw1).
  • the first version of the aggregated media presentation description contains a MediaPresentation element, which describes the start time tp1 of the media presentation as indicated by the attribute @start.
  • the MediaPresentation element contains a pointer to the media rendering description MPD1 of the inserted pre-roll.
  • the second version of the aggregated media presentation description replaces the first version of the aggregated media presentation description
  • the second version of the aggregated media presentation description adds a second MediaPresentation element, which provides a description of the live program. information. It is a live program, and the time tp2 to start access is given by the element's attribute @start, which is also the end time of the first item.
  • the presence of the attribute @offset tells the client that it is not the current time tp2, but joins the live program in a delayed manner according to the time offset - ⁇ t, that is, joins the live program with tp2- ⁇ t.
  • the value of the time offset is non-positive, ie the delay time is greater than or equal to 0, as it is generally not possible to join the live program in advance.
  • the client After the client sends the request, it receives the aggregated media presentation returned by the server, and the client parses the aggregated media presentation.
  • the first item (media presentation) is processed according to MPD1 from time tp1 until tp2.
  • the client requests an updated aggregated media presentation description at time tc1 (t1-tw1 ⁇ tc1 ⁇ t1) according to the instructions of @expriy and @timeAdvance, and acquires MPD2 according to the second MediaPresentation element in the aggregated media presentation description.
  • the first item is still playing, and the first media presentation ends to tp2 to start processing the second media presentation until the end.
  • the red line segment in the figure represents the processing time of the first media presentation
  • the green line segment represents the processing time of the first media presentation.
  • the second media presentation is live content
  • MPD2 may be a dynamic update
  • the customer obtains the updated MPD2
  • the process is carried out by the client according to the information in MPD2
  • MPD2 may have multiple updates, but this process and aggregate media Presenting a description of AMPD has nothing to do with it.
  • MPD1, MPD2, and AMPD may be from different servers, respectively, and the server name or IP address reflected in the URL is different.
  • the aggregated media presentation is aggregated by three different media presentations.
  • the first part of aggregated media rendering (also known as composite media rendering) Is a local media presentation.
  • MediaPresentation is an MPD element that describes a media presentation, including a Period.
  • Period element is reserved under the MPD element and other elements and attributes are omitted.
  • Figure 5-f illustrates an AMPD.
  • the media presentation is of a live broadcast type, and the live media presentation is accessed at 2015-3-2510:00, and the location where the media presentation is added is the current location of the live broadcast media on the absolute time axis.
  • the second part of the aggregated media presentation is also a remote media presentation.
  • This media presentation is of the on-demand type, which is an inserted advertisement that is accessed from the starting location.
  • the third part of the composite media presentation is a remote media presentation, and the @xlink:herf attribute points to the universal resource locator URL of its media presentation description. As can be seen from the URL, its source is different from the media presentation of the first part. Among them, this media presentation is of the live broadcast type, and the content paragraph m1 in this media presentation is cited.
  • the live media presentation Join the live media presentation at 2015-3-25 10:22, but not at the current location (media time) of the media presentation, where the current media content is at the media corresponding to the absolute time 10:22
  • the location of the content, but added 10 minutes before the current location of the live media presentation, that is, the location of the media content corresponding to the absolute time, the live media presentation is delayed by 10 minutes, and the delay time is indicated by @timeOffset.
  • the unit can be seconds.
  • FIG. 5-g Another scenario is exemplified below, as shown in the example of FIG. 5-g.
  • this application scenario this is an example of parallelizing content in time. Multiple media presentations are parallel in time, where they are in one The description file is described. Among them, the media presentations that are aggregated in parallel in time are the same in nature, either live broadcast or on-demand. In fact, it provides a way to implement a client-based navigation.
  • This method is primarily client-based and does not require any processing of individual media presentations during the distribution process.
  • a media presentation is composed of temporally sequential content segments. Multiple media presentations, content paragraphs are arranged independently of each other, are interlaced in time, and such a time structure is not handled by DASH.
  • the media presentation can of course be re-encoded to eliminate the boundaries of the time segments so that multiple media presentations can be included in one content paragraph.
  • the advantage of this is that you only need to introduce a small extension in the DASH specification, the client The processing is simple, but at the cost of processing (re-encoding) the media presentation, adding complexity to a certain extent.
  • the aggregated media presentation description has multiple MediaPresentation elements, each MediaPresentation element corresponds to a media presentation, and the MediaPresentation element may be local, including the MPD and its subordinate elements, or may be non-local, referencing one Remote media presentation description. Each maintains its own content paragraph and time structure without making changes.
  • each Presentation element does not have the @startTime attribute, or each Presentation element has the @startTime attribute, and the value of @startTime is the same.
  • the former means that each media presentation is available when the composite media presentation description is available, the latter indicating that each Presentation is available at the time indicated by @startTime.
  • the client may establish a DASH client instance for each media presentation, perform media segment acquisition of the media presentation, decode and play media data, and the like.
  • @schemeIdUri in the EssentialProperty element specifies the rule referenced by the descriptor, where @value is the parameter of the referenced rule.
  • the referenced rule is identified (identified) by the generic resource name urn:mpeg:dash:srd:2013, which is used to identify the spatial relationship, where the value of @value is required by the rule.
  • Parameters, such as the second, third, represent the coordinates of the upper left corner of the object (here, Presentation), and the fourth and fifth values represent the width and height of the object.
  • an embodiment of the present invention provides a server 600, which may include:
  • a generating unit 610 configured to generate a media presentation description of the first media presentation, where the first media presentation includes a content item, the media presentation description includes a description of the content item, or the media presentation description includes the content
  • the description of the item points to the information, the description of the content item is used to indicate that the content item is from a second media presentation, wherein the first media presentation and the second media presentation are not The same media presentation.
  • the processing unit 620 is configured to store or send the media presentation description.
  • the description of the content item is further used to indicate that the second media presentation is a real-time media presentation or a non-real-time media presentation.
  • the description of the content item is further used to indicate a time position in which the content item is embedded in the first media presentation.
  • the description of the content item is further used to indicate that part or all of the content item is embedded in the first media presentation.
  • the description of the content item when the description of the content item is further used to indicate that a portion of the content item is embedded in the first media presentation, the content The description of the item is also used to indicate the initial play time position and/or the end play time position of the portion of the content item.
  • the description of the content item includes an offset indication fz
  • the offset indication fz is used to indicate a starting play time position and start of the content item.
  • the offset between content time positions is offset.
  • the content item when the second media is presented as a real-time media presentation, and the offset indication fz indicates an offset offset equal to 0, the content item is represented Starting from the content location corresponding to the current time; or when the second media is presented as a real-time media presentation, and the offset indication fz indicates that the offset offset is not equal to 0, indicating that the content item will be The content position corresponding to the current time back offset offset starts to play; or when the second medium is presented as a real-time media presentation, and the offset indication fz indicates that the offset offset is not equal to 0, The content item starts to play back from the content content position of the content item and the content position corresponding to the offset offset.
  • the content when the second media is presented as a non-real time media presentation, and the offset indication fz indicates an offset offset equal to 0, the content is represented The item will start playing from the starting content position of the content item; or when the second medium is presented as a non-real time media presentation, and the offset indication fz indicates that the offset amount offset is not equal to 0, The content item of the content item is offset from the starting content position of the content item by the content bit corresponding to the offset offset Start playing.
  • the description of the content item is included in an aggregation method descriptor of the media presentation description, or the orientation information of the description of the content item is included in the media Present the description in the description of the aggregation method.
  • the first media presentation is an aggregate media presentation
  • the media presentation description is an aggregate media presentation description
  • the aggregate media presentation description includes N media presentation description elements.
  • the N is an integer greater than 1 or equal to 1, the first media presentation description element being one of the N media presentation description elements included in the aggregate media presentation description, wherein the content The description of the item included in the first media presentation description element or the pointing information of the description of the content item is included in the first media presentation description element.
  • the aggregate media presentation description further includes a time window indication corresponding to the first media presentation description element, wherein the time window indication is used to indicate that the client is in the The time window indicates, within the indicated time window, the updated content of the aggregated media presentation description is obtained from the server, wherein the updated content includes the first media presentation description element.
  • the content item is a content paragraph or a media expression or an adaptive set.
  • the content item included in the first media presentation may be from the second media presentation, that is, some or all of the content items of the other plurality of media presentations may be re-aggregated.
  • a new media presentation that satisfies a specific programming requirement is formed, and the media presentation description of the new media presentation includes a description of the content items presented by other media that are aggregated, so that the client can perform acquisition and playback of the corresponding content item, and the like.
  • the technical solution in this embodiment facilitates flexible aggregation of media content.
  • an embodiment of the present invention provides a client 700, which may include:
  • the first obtaining unit 710 is configured to acquire a media presentation description of the first media presentation, where the The first media presentation includes a content item, the media presentation description including a description of the content item or the media presentation description including pointing information of the description of the content item, the description of the content item being used to indicate the content
  • the item is from a second media presentation, wherein the first media presentation and the second media are presented as different media presentations;
  • a second obtaining unit 720 configured to acquire the content item according to the description of the content item
  • the playing unit 730 is configured to play the content item.
  • the description of the content item is further used to indicate that the second media presentation is a real-time media presentation or a non-real-time media presentation.
  • the description of the content item is further used to indicate a time position in which the content item is embedded in the first media presentation.
  • the description of the content item is further used to indicate that part or all of the content item is embedded in the first media presentation.
  • the description of the content item when the description of the content item is further used to indicate that a portion of the content item is embedded in the first media presentation, the content The description of the item is also used to indicate the initial play time position and/or the end play time position of the portion of the content item.
  • the description of the content item includes an offset indication fz
  • the offset indication fz is used to indicate a starting play time position and start of the content item.
  • the offset between content time positions is offset.
  • the content item when the second media is presented as a real-time media presentation, and the offset indication fz indicates an offset offset equal to 0, the content item is represented Starting from the content location corresponding to the current time; or when the second media is presented as a real-time media presentation, and the offset indication fz indicates that the offset offset is not equal to 0, indicating that the content item will be The content position corresponding to the current time back offset offset starts to play; or when the second medium is presented as a real-time media presentation, and the offset indication fz indicates that the offset offset is not equal to 0, The content item starts to play back from the content content position of the content item and the content position corresponding to the offset offset.
  • the offset indication fz indicates that the offset offset is equal to 0, indicating that the content item will start playing from the starting content position of the content item; or when the second medium is presented as When the non-real time media is presented, and the offset indication fz indicates that the offset offset is not equal to 0, indicating that the content item is offset backward from the starting content position of the content item by the offset The content position corresponding to offset starts playing.
  • the description of the content item is included in an aggregation method descriptor of the media presentation description, or the orientation information of the description of the content item is included in the media Present the description in the description of the aggregation method.
  • the first media presentation is an aggregate media presentation
  • the media presentation description is an aggregate media presentation description
  • the aggregate media presentation description includes N media presentation description elements.
  • the N is an integer greater than 1 or equal to 1, the first media presentation description element being one of the N media presentation description elements included in the aggregate media presentation description, wherein the content The description of the item included in the first media presentation description element or the pointing information of the description of the content item is included in the first media presentation description element.
  • the aggregate media presentation description further includes a time window indication corresponding to the first media presentation description element, wherein the time window indication is used to indicate that the client is in the The time window indicates that the updated content of the aggregated media presentation description is obtained from the server within the indicated time window, wherein the updated content may include the first media presentation description element.
  • the content item is a content paragraph or a media expression or an adaptive set.
  • the content item included in the first media presentation may be from the second media presentation, that is, some or all of the content items of the other plurality of media presentations may be re-aggregated.
  • the technical solution in this embodiment facilitates flexible aggregation of media content.
  • FIG. 8 is a structural block diagram of a server 800 according to another embodiment of the present invention.
  • the server 800 may include: at least one processor 801, a memory 805, and at least one communication bus 802. Among them, the communication bus 802 is used to implement connection communication between these components.
  • the server 800 may optionally include at least one network interface 804 and/or a user interface 803, and the user interface 803 may include a display (eg, a touch screen, an LCD, a Holographic, a CRT, or a Projector). Click on a device (such as a mouse or trackball touchpad or touch screen, etc.), a camera and/or a pickup device, and the like.
  • a display eg, a touch screen, an LCD, a Holographic, a CRT, or a Projector.
  • Click on a device such as a mouse or trackball touchpad or touch screen, etc.
  • a camera and/or a pickup device and the like.
  • the memory 805 can include a read only memory and a random access memory and provides instructions and data to the processor 801. A portion of the memory 805 may also include a non-volatile random access memory.
  • memory 805 stores elements, executable modules or data structures, or a subset thereof, or their extension set:
  • the operating system 8051 includes various system programs for implementing various basic services and processing hardware-based tasks.
  • the application module 8052 includes various applications for implementing various application services.
  • the processor 801 generates a media presentation description of the first media presentation by invoking a program or instruction stored in the memory 805, wherein the first media presentation includes a content item, the media presentation description including the The description of the content item or the media presentation description includes pointing information of the description of the content item, the description of the content item is used to indicate that the content item is from a second media presentation, wherein the first media presentation And presenting the second media as a different media presentation; storing or transmitting the media presentation description.
  • the description of the content item is further used to indicate that the second media presentation is a real-time media presentation or a non-real-time media presentation.
  • the description of the content item is further used to indicate a time position in which the content item is embedded in the first media presentation.
  • the description of the content item is further used to indicate that part or all of the content item is embedded in the first media presentation.
  • the description of the content item when the description of the content item is further used to indicate that a portion of the content item is embedded in the first media presentation, the content The description of the item is also used to indicate the initial play time position and/or the end play time position of the portion of the content item.
  • the description of the content item includes an offset indication fz
  • the offset indication fz is used to indicate a starting play time position and start of the content item.
  • the offset between content time positions is offset.
  • the content item when the second media is presented as a real-time media presentation, and the offset indication fz indicates an offset offset equal to 0, the content item is represented Starting from the content location corresponding to the current time; or when the second media is presented as a real-time media presentation, and the offset indication fz indicates that the offset offset is not equal to 0, indicating that the content item will be The content position corresponding to the current time back offset offset starts to play; or when the second medium is presented as a real-time media presentation, and the offset indication fz indicates that the offset offset is not equal to 0, The content item starts to play back from the content content position of the content item and the content position corresponding to the offset offset.
  • the content when the second media is presented as a non-real time media presentation, and the offset indication fz indicates an offset offset equal to 0, the content is represented The item will start playing from the starting content position of the content item; or when the second medium is presented as a non-real time media presentation, and the offset indication fz indicates that the offset amount offset is not equal to 0, The content item starts to play back from the content content position of the content item and the content position corresponding to the offset offset.
  • the description of the content item is included in an aggregation method descriptor of the media presentation description, or the orientation information of the description of the content item is included in the media Present the description in the description of the aggregation method.
  • the first media presentation is an aggregate media presentation
  • the media presentation description is an aggregate media presentation description
  • the aggregate media presentation description includes N media presentation description elements.
  • the N is an integer greater than 1 or equal to 1
  • the first media presentation description element is one of the N media presentation description elements included in the aggregate media presentation description
  • the body presentation descriptor element wherein the description of the content item is included in the first media presentation description element or the pointing information of the description of the content item is included in the first media presentation description element.
  • the aggregate media presentation description further includes a time window indication corresponding to the first media presentation description element, wherein the time window indication is used to indicate that the client is in the The time window indicates, within the indicated time window, the updated content of the aggregated media presentation description is obtained from the server, wherein the updated content includes the first media presentation description element.
  • the content item is a content paragraph or a media expression or an adaptive set.
  • the content item included in the first media presentation may be from the second media presentation, that is, some or all of the content items of the other plurality of media presentations may be re-aggregated.
  • a new media presentation that satisfies a specific programming requirement is formed, and the media presentation description of the new media presentation includes a description of the content items presented by other media that are aggregated, so that the client can perform acquisition and playback of the corresponding content item, and the like.
  • the technical solution in this embodiment facilitates flexible aggregation of media content.
  • FIG. 9 is a structural block diagram of a client 900 according to another embodiment of the present invention.
  • the client 900 can include at least one processor 901, a memory 905, and at least one communication bus 902.
  • the communication bus 902 is used to implement connection communication between these components.
  • the client 900 may optionally include at least one network interface 904 and/or a user interface 903, and the user interface 903 may include a display (eg, a touch screen, an LCD, a holographic image, a CRT, or a Projector). Click on a device (such as a mouse or trackball touchpad or touch screen, etc.), a camera and/or a pickup device, and the like.
  • a display eg, a touch screen, an LCD, a holographic image, a CRT, or a Projector.
  • Click on a device such as a mouse or trackball touchpad or touch screen, etc.
  • a camera and/or a pickup device and the like.
  • the memory 905 can include a read only memory and a random access memory, and provides instructions and data to the processor 901. A portion of the memory 905 may also include a non-volatile random access memory.
  • the memory 905 stores the following elements, executable modules or data. Structures, or their subsets, or their extension set:
  • the operating system 9051 includes various system programs for implementing various basic services and processing hardware-based tasks.
  • the application module 9052 includes various applications for implementing various application services.
  • the processor 901 acquires a media presentation description of the first media presentation by calling a program or instruction stored in the memory 905, wherein the first media presentation includes a content item, and the media presentation description includes the The description of the content item or the media presentation description includes pointing information of the description of the content item, the description of the content item is used to indicate that the content item is from a second media presentation, wherein the first media presentation And presenting, by the second media, a different media presentation; acquiring the content item according to the description of the content item; playing the content item.
  • the description of the content item is further used to indicate that the second media presentation is a real-time media presentation or a non-real-time media presentation.
  • the description of the content item is further used to indicate a time position in which the content item is embedded in the first media presentation.
  • the description of the content item is further used to indicate that part or all of the content item is embedded in the first media presentation.
  • the description of the content item when the description of the content item is further used to indicate that a portion of the content item is embedded in the first media presentation, the content The description of the item is also used to indicate the initial play time position and/or the end play time position of the portion of the content item.
  • the description of the content item includes an offset indication fz
  • the offset indication fz is used to indicate a starting play time position and start of the content item.
  • the offset between content time positions is offset.
  • the content item when the second media is presented as a real-time media presentation, and the offset indication fz indicates an offset offset equal to 0, the content item is represented Starting from the content location corresponding to the current time; or when the second media is presented as a real-time media presentation, and the offset indication fz indicates that the offset offset is not equal to 0, indicating that the content item will be The content position corresponding to the current time back offset offset starts playing; or when the second media is presented When the real-time media is present, and the offset indication fz indicates that the offset offset is not equal to 0, indicating that the content item will be offset backward from the starting content position of the content item by the offset The content position corresponding to offset starts playing.
  • the content when the second media is presented as a non-real time media presentation, and the offset indication fz indicates an offset offset equal to 0, the content is represented The item will start playing from the starting content position of the content item; or when the second medium is presented as a non-real time media presentation, and the offset indication fz indicates that the offset amount offset is not equal to 0, The content item starts to play back from the content content position of the content item and the content position corresponding to the offset offset.
  • the description of the content item is included in an aggregation method descriptor of the media presentation description, or the orientation information of the description of the content item is included in the media Present the description in the description of the aggregation method.
  • the first media presentation is an aggregate media presentation
  • the media presentation description is an aggregate media presentation description
  • the aggregate media presentation description includes N media presentation description elements.
  • the N is an integer greater than 1 or equal to 1, the first media presentation description element being one of the N media presentation description elements included in the aggregate media presentation description, wherein the content The description of the item included in the first media presentation description element or the pointing information of the description of the content item is included in the first media presentation description element.
  • the aggregate media presentation description further includes a time window indication corresponding to the first media presentation description element, wherein the time window indication is used to indicate that the client is in the The time window indicates, within the indicated time window, the updated content of the aggregated media presentation description is obtained from the server, wherein the updated content includes the first media presentation description element.
  • the content item is a content paragraph or a media expression or an adaptive set.
  • the content item included in the first media presentation may be from the second media presentation, that is, some or all of the content items of the other plurality of media presentations may be re-aggregated.
  • a new media presentation that satisfies a specific programming requirement is formed, and the media presentation description of the new media presentation includes a description of the content items presented by other media that are aggregated, so that the client can perform acquisition and playback of the corresponding content item, and the like.
  • the technical solution in this embodiment facilitates flexible aggregation of media content.
  • the embodiment of the present invention provides a communication system, which includes any client provided by the embodiment of the present invention and any server provided by the embodiment of the present invention.
  • the embodiment of the present invention further provides a computer storage medium, wherein the computer storage medium can store a program, and the program includes some or all of the steps of any one of the content item aggregation methods described in the foregoing method embodiments.
  • the disclosed apparatus may be implemented in other ways.
  • the device embodiments described above are merely illustrative.
  • the division of the unit is only a logical function division.
  • there may be another division manner for example, multiple units or components may be combined or may be Integrate into another system, or some features can be ignored or not executed.
  • the mutual coupling or direct coupling or communication connection shown or discussed may be an indirect coupling or communication connection through some interface, device or unit, and may be electrical or otherwise.
  • the units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical units, that is, may be located in one place, or may be distributed to multiple network units. You can choose some or all of them according to actual needs.
  • the unit is to achieve the purpose of the solution of the embodiment.
  • each functional unit in each embodiment of the present invention may be integrated into one processing unit, or each unit may exist physically separately, or two or more units may be integrated into one unit.
  • the above integrated unit can be implemented in the form of hardware or in the form of a software functional unit.
  • the integrated unit if implemented in the form of a software functional unit and sold or used as a standalone product, may be stored in a computer readable storage medium.
  • the technical solution of the present invention which is essential or contributes to the prior art, or all or part of the technical solution, may be embodied in the form of a software product stored in a storage medium.
  • a number of instructions are included to cause a computer device (which may be a personal computer, server or network device, etc.) to perform all or part of the steps of the methods described in various embodiments of the present invention.
  • the foregoing storage medium includes: a U disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a removable hard disk, a magnetic disk, or an optical disk, and the like. .

Abstract

内容项聚合方法和相关装置及通信系统。一种内容项聚合方法包括服务端生成第一媒体呈现的媒体呈现描述,第一媒体呈现包括内容项,所述媒体呈现描述包括所述内容项的描述或所述媒体呈现描述包括所述内容项的描述的指向信息,所述内容项的描述用于指示出所述内容项来自第二媒体呈现,第一媒体呈现不同于第二媒体呈现;存储或发送所述媒体呈现描述。本发明实施例提供技术方案有利于实现媒体内容的灵活聚合。

Description

内容项聚合方法和相关装置及通信系统 技术领域
本发明涉及网络通信技术领域,具体涉及内容项聚合方法和相关的装置及通信系统。
背景技术
基于超文本传输协议(HTTP,Hyper Text Transfer Protocol)媒体流的多媒体业务正日益发展,甚至挑战了传统的广播电视的地位。基于HTTP的媒体流服务还不支持媒体内容的聚合(例如跨频道连播业务等),这不能不说是一个较大缺憾。
发明内容
本发明实施例提供内容项聚合方法和相关装置及通信系统,以期能够实现媒体内容的灵活聚合。
本发明实施例提供一种内容项聚合方法,包括:
服务端生成第一媒体呈现的媒体呈现描述,其中,所述第一媒体呈现包括内容项,所述媒体呈现描述包括所述内容项的描述或所述媒体呈现描述包括所述内容项的描述的指向信息,其中,所述内容项的描述用于指示出所述内容项来自第二媒体呈现,其中,所述第一媒体呈现和所述第二媒体呈现为不同的媒体呈现;
存储或发送所述媒体呈现描述。
本发明实施例提供还一种内容项聚合方法,包括:
客户端获取第一媒体呈现的媒体呈现描述,其中,所述第一媒体呈现包括内容项,所述媒体呈现描述包括所述内容项的描述或所述媒体呈现描述包括所述内容项的描述的指向信息,其中,所述内容项的描述用于指示出所述内容项来自第二媒体呈现,其中,所述第一媒体呈现和所述第二媒体呈现为不同的媒体呈现;
所述客户端根据所述内容项的描述获取所述内容项;所述客户端播放所述内容项。
本发明实施例还提供一种服务端,可包括:
生成单元,用于生成第一媒体呈现的媒体呈现描述,其中,所述第一媒体呈现包括内容项,所述媒体呈现描述包括所述内容项的描述或所述媒体呈现描述包括所述内容项的描述的指向信息,所述内容项的描述用于指示出所述内容项来自第二媒体呈现,其中,所述第一媒体呈现和所述第二媒体呈现为不同的媒体呈现;
处理单元,用于存储或发送所述媒体呈现描述。
本发明实施例还提供一种客户端,可包括:
第一获取单元,用于获取第一媒体呈现的媒体呈现描述,其中,所述第一媒体呈现包括内容项,所述媒体呈现描述包括所述内容项的描述或所述媒体呈现描述包括所述内容项的描述的指向信息,所述内容项的描述用于指示出所述内容项来自第二媒体呈现,其中,所述第一媒体呈现和所述第二媒体呈现为不同的媒体呈现;
第二获取单元,用于根据所述内容项的描述获取所述内容项;
播放单元,用于播放所述内容项。
本发明实施例还提供一种服务端,可包括:处理器和存储器。客户端还可包括网络接口。
其中,所述存储器用于存储指令,所述处理器用于执行所述指令,所述网络接口用于在所述处理器的控制下与其他设备进行通信。
例如处理器,用于生成第一媒体呈现的媒体呈现描述,其中,所述第一媒体呈现包括内容项,所述媒体呈现描述包括所述内容项的描述或所述媒体呈现描述包括所述内容项的描述的指向信息,所述内容项的描述用于指示出所述内容项来自第二媒体呈现,其中,所述第一媒体呈现和所述第二媒体呈现为不同的媒体呈现;存储或发送所述媒体呈现描述。
本发明实施例还提供一种客户端,可包括:处理器和存储器。客户端还可包括网络接口。
其中,所述存储器用于存储指令,所述处理器用于执行所述指令,所述网络接口用于在所述处理器的控制下与其他设备进行通信。
例如处理器,用于获取第一媒体呈现的媒体呈现描述,其中,所述第一媒体呈现包括内容项,所述媒体呈现描述包括所述内容项的描述或所述媒体呈现描述包括所述内容项的描述的指向信息,所述内容项的描述用于指示出所述内容项来自第二媒体呈现,其中,所述第一媒体呈现和所述第二媒体呈现为不同的媒体呈现;根据所述内容项的描述获取所述内容项;播放所述内容项。
在一些可能实施方式中,所述内容项的描述还用于指示出所述第二媒体呈现为实时媒体呈现或非实时媒体呈现。
在一些可能实施方式中,在第四方面的第二种可能的实施方式中,所述内容项的描述还用于指示出所述内容项在所述第一媒体呈现中嵌入的时间位置。
在一些可能实施方式中,所述内容项的描述还用于指示出所述内容项的部分或全部被嵌入到所述第一媒体呈现中。
在一些可能实施方式中,当所述内容项的描述还用于指示出所述内容项的部分被嵌入到所述第一媒体呈现中的情况下,所述内容项的描述还用于指示出所述内容项的所述部分的起始播放时间位置和/或结束播放时间位置。
在一些可能实施方式中,所述内容项的描述包括偏移指示fz,所述偏移指示fz用于指示出所述内容项的起始播放时间位置和起始内容时间位置之间的偏移量offset。
在一些可能实施方式中,当所述第二媒体呈现为实时媒体呈现时,并且所述偏移指示fz指示的偏移量offset等于0时,表示所述内容项将从当前时间对应的内容位置开始播放;或当所述第二媒体呈现为实时媒体呈现时,且所述偏移指示fz指示的偏移量offset不等于0时,表示所述内容项将从当前时间回退偏移量offset对应的内容位置开始播放;或当所述第二媒体呈现为实时媒体呈现时,并且所述偏移指示fz指示的偏移量offset不等于0时,表示所述内容项将从所述内容项的起始内容位置向后偏移所述偏移量offset对应的内容位置开始播放。
在一些可能实施方式中,当所述第二媒体呈现为非实时媒体呈现时,且所述偏移指示fz指示的偏移量offset等于0时,表示所述内容项将从所述内容项的起始内容位置开始播放;或当所述第二媒体呈现为非实时媒体呈现时,且所述偏移指示fz指示的偏移量offset不等于0时,表示所述内容项的将从所述内容项 的起始内容位置向后偏移所述偏移量offset对应的内容位置开始播放。
在一些可能实施方式中,所述内容项的描述包括在所述媒体呈现描述的聚合方法描述子中,或所述内容项的描述的指向信息包括在所述媒体呈现描述的聚合方法描述子中。
在一些可能实施方式中,所述第一媒体呈现为聚合媒体呈现,所述媒体呈现描述为聚合媒体呈现描述,所述聚合媒体呈现描述包括N个媒体呈现描述元素,所述N为大于1或等于1的整数,第一媒体呈现描述元素为所述聚合媒体呈现描述包括的所述N个媒体呈现描述元素之中的一个媒体呈现描述元素,其中,所述内容项的描述包括在所述第一媒体呈现描述元素中或者所述内容项的描述的指向信息包括在所述第一媒体呈现描述元素中。
在一些可能实施方式中,所述聚合媒体呈现描述还包括第一媒体呈现描述元素对应的时间窗口指示,其中,所述时间窗口指示用于指示客户端在所述时间窗口指示所指示的时间窗口内,从所述服务端获取所述聚合媒体呈现描述的更新内容,所述更新内容包括所述第一媒体呈现描述元素。
在一些可能实施方式中,所述内容项为内容段落或媒体表达或自适应集。
本发明还提供一种通信系统,包括本发明实施例提供的任意一种客户端和本发明实施例提供的任意一种服务端。
此外,本发明实施例还提供了一种计算机可读存储介质,所述计算机可读存储介质存储了服务端所执行的用于内容项聚合的程序代码。所述程序代码包括用于执行服务端所执行方法的指令。
此外,本发明实施例还提供了一种计算机可读存储介质,所述计算机可读存储介质存储了客户端所执行的用于内容项聚合的程序代码。所述程序代码包括用于执行客户端所执行方法的指令。
可以看出,在本实施例的技术方案中,媒体呈现包括的某内容项可以来自不同于上述媒体呈现的其它媒体呈现,也就是说,可以将其它若干个媒体呈现的部分或者全部内容项进行重新聚合安排以形成满足特定编排需要的新媒体呈现,并且新媒体呈现的媒体呈现描述包括了聚合来的其它媒体呈现的内容项的描述,使得客户端可据此进行相应内容项的获取和播放等。总得来说,在本 实施例的技术方案有利于实现媒体内容的灵活聚合。
附图说明
为了更清楚地说明本发明实施例或现有技术中的技术方案,下面将对实施例或现有技术描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本发明的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。
图1是本发明实施例提供的一种DASH的结构的示意图;
图2是本发明实施例提供的一种内容项聚合方法的流程示意图;
图3是本发明实施例提供的另一种内容项聚合方法的流程示意图;
图4-a是本发明实施例提供的一种网络架构的示意图;
图4-b是本发明实施例提供的另一种内容项聚合方法的流程示意图;
图4-c是本发明实施例提供的聚合不同媒体呈现的内容项的示意图;
图5-a是本发明实施例提供的聚合内容项的一种时间安排的示意图;
图5-b是本发明实施例提供一种采用XML数据规则描述的AMPD的数据结构的示意图;
图5-c是本发明实施例提供另一种采用XML数据规则描述的MPD的数据结构的示意图;
图5-d是本发明实施例提供另一种采用XML数据规则描述的MPD的数据结构的示意图;
图5-e是本发明实施例提供另一种内容项之间的时间关系的示意图;
图5-f是本发明实施例提供另一种采用XML数据规则描述的AMPD的数据结构的示意图;
图5-g是本发明实施例提供另一种采用XML数据规则描述的AMPD的数据结构的示意图;
图6是本发明实施例提供的一种服务端的示意图;
图7是本发明实施例提供的一种客户端的示意图;
图8是本发明实施例提供的另一种客户端的示意图;
图9是本发明实施例提供的另一种客户端的示意图。
具体实施方式
本发明实施例提供内容项聚合方法和相关装置及通信系统,以期能够实现媒体内容的灵活聚合。
为了使本技术领域的人员更好地理解本发明方案,下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例仅仅是本发明一部分的实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都应当属于本发明保护的范围。
以下分别进行详细说明。
本发明说明书和权利要求书及附图中的术语“包括”和“具有”以及它们任何变形,意图在于覆盖不排他的包括,例如,可包括了一系列步骤或单元的过程、方法、系统、产品或设备未限定于已列出的步骤或单元,而是可选地还包括没有列出的步骤或单元,或可选地还包括对于这些过程、方法、产品或设备固有的其它步骤或单元。术语“第一”、“第二”和“第三”等是用于区别不同对象,而不是用于描述特定顺序。
广播是传统的媒体内容传输方式,广播电台和电视台都是通过无线广播实现音视频的传输,有线电视则是以有线电缆了承载广播信号。然而,随着技术的发展,特别是宽带技术和微处理器技术,前者提高了通信服务的水平,后者增强了个人设备的能力,目前,通过互联网的在线媒体流服务传送多媒体越来越普遍。比之传统的媒体广播服务,在线媒体流服务更好地满足了人们对媒体内容的不同需求,用户可在需要时候对获取的媒体内容作出点播(on demand)选择,这就改变了用户单向和被动的接收方式。
基于HTTP的自适应流(DASH,Dynamic Adaptative Streaming Over HTTP)服务是多媒体流服务的一种主流技术,代表了这一领域的一个最新发展。例如微软(Microsoft)公司的平滑流服务(SS,Smooth Streaming)、又如动态图像专家组(MPEG,Moving Picture Experts Group)的基于HTTP的动态自适应媒体流(DASH,Dynamic Adaptative Streaming Over HTTP)、苹果公司的HTTP服务(HLS,HTTP Live Streaming)都是这一技术的不同形式。MPEG的DASH 标准是由MPEG制订的标准化技术,有望得到广泛的采用,从而改变割裂的市场格局。
现在的DASH规范定义了媒体片段和媒体呈现描述的格式,媒体呈现描述也可称媒体呈现描述文件的格式。媒体片段是媒体呈现的封装形式,用于媒体表达的存储和访问,媒体呈现描述用于描述一个媒体呈现。所谓的媒体呈现是指时间上顺序的一段媒体内容。一个媒体呈现可相当于一个电视节目或者一个电视节目频道。其中,和多个节目频道电视服务相比,DASH只能描述一个媒体呈现,不能同时描述多个并行的媒体呈现供用户选择,如像电视服务中的节目频道导览那样同时呈现多个电视频道。不同媒体呈现的时间安排不同,相互交错,这样的时间结构无法在DASH描述。因此常规DASH不能方便地实现时间并行的内容聚合。
对于时间顺序的内容聚合,DASH也是不足的。DASH要生成一个新的媒体呈现描述,就有n+1个媒体呈现描述。另外,如果在进行内容聚合还有其他的表现形式:把不同的媒体呈现重新在时间上顺序安排,形成一个新的媒体呈现——在一个电视节目频道中,不同的节目是按照时间顺序安排的,频道的提供者要把节目内容拼接起来。DASH虽能够描述时间上顺序的媒体内容,如果媒体内容的来源不同,在进行内容聚合的过程中,需要对各个媒体呈现描述进行处理,生成一个对经过内容聚合的合并的媒体呈现的描述文件,合并的过程中需要对各个媒体呈现的时间进行处理,采用一致的时间描述,这个过程容易发生错误。
DASH的基本概念是媒体呈现,一个媒体呈现可以包括一个或者多个内容段落(Period)组成。其中,一个内容段落包括一项媒体内容,它在时间上是连续的,媒体的内容的各个方面是一致的,如:编码、语言和内容保护。媒体内容以媒体编码表达的形式存在,编码表达按照属性,例如媒体分量划分为适配集。适配集内的媒体编码表达是同一媒体内容的相同的媒体分量的不同编码版本,是可以相互替代的。内容段落在时间上是顺序的,通过内容段落可以把不同的媒体内容在时间上拼接起来。比如:前一个内容段落是新闻节目,下一个内容段落是广告。一个内容段落的开始意味着该内容段落相比前一个内容段 落的某些方面的变化,例如:内容从新闻节目到体育节目;视频编码从H.264转变为H.265;增加了作为一种媒体分量的字幕;增加英文伴音等等。当客户端遇到一个新的内容段落的开始,那么客户端要进行重新配置——媒体分量的选择,自适应的范围(媒体的编码表达的码率),解码器的初始化等等。内容段落在时间上是顺序的,一个内容段落结束,下一个内容段落开始,两个段落之间在时间上是不重合的。这样DASH就没有办法描述多个在时间上并行的媒体呈现。
另外,在现有的DASH中,有对于空间对象描述的支持,这是为了适应终端设备不同的显示能力或者进行显示的缩放。但是由于DASH的限制,空间对象是同一媒体内容的不同空间部分,这样这一能力无法用于实现不同的媒体呈现在空间的聚合。特别的,现有DASH仅适用于一个媒体呈现,它无法对时间上并行的多个媒体呈现进行描述。
在DASH中,一项媒体内容编码为多个版本,各个版本有不同特性,例如码率,这些版本在DASH中称为媒体表达,它们代表相同的媒体内容,从内容呈现(观看/播放)的角度彼此具有替代性。其中,一个媒体表达在时间上分割为可访问的单位,通常长度为若干秒,称为媒体片段或者媒体子片段(一个媒体片段可以在逻辑上划分为媒体子片段)。另外还有一个初始化片段,初始化片段只包括有元数据而没有媒体编码数据。下文中,媒体片段,初始化片段都称为片段。其中,媒体表达存储在内容服务器(例如HTTP服务器)上供客户端获取,而片段是客户端能够通过URL访问的最小单位。
其中,媒体呈现描述(MPD,Media Presentation Description)是一个扩展标记语言(XML,extensible Markup Language)文件,MPD包括了客户端所需要的元数据,描述了媒体表达的特性以及如何从服务器上获取媒体表达,包括媒体表达的码率、分辨率、视频图像的长宽比,媒体表达包括的片段的统一资源定位符(URL,Universal Resource Locator)等。基于MPD中的信息,客户端构造HTTP URL以从内容服务器请求媒体表达中的媒体片段,在媒体片段边界可以切换到其他的媒体表达以适应可用带宽的变化。
图1举例示意了一种DASH结构。基于HTTP的自适应流媒体服务允许一个 媒体呈现中内容特性的变化,例如媒体编码方式的改变。在DASH中,这是通过所谓的“内容段落(Period)”这一概念来实现的,它用于内容拼接,比如前一个内容段落是新闻节目,下一个内容段落是广告。
基于HTTP的自适应媒体流服务允许一个媒体呈现中内容特性变化,例如媒体编码方式的改变。在DASH标准中,Period用于内容的拼接,比如前一个内容段落是新闻节目,下一个内容段落是广告。一个媒体呈现包括一个或者多个内容段落(Period),这些内容段落在时间上是顺序的,一个内容段落的开始意味着相比前一个内容段落有某些变化,例如内容的变化,例如可从新闻节目到体育节目,从体育节目到电影节目、从电影节目到广告、从广告到综艺节目等等;内容的编码方式的变化,例如可从H.264编码方案转变为H.265编码方案;媒体表达数量的变化,例如,可以增加或者减少媒体表达;内容分量的变化,例如可增加中文的音频表达等等。其中,当客户端遇到一个新的内容段落的开始,客户端工作条件发生了变化,可能要重新初始化。
在一个内容段落中,包括相同媒体内容和媒体分量的媒体表达的集合称为适配集,一个适配集至少包括一个媒体表达,一个适配集中的媒体表达具有相互替代性。不同的适配集之间可能是相容或者相斥的。
总结以上所述,媒体呈现可包括一个或多个时间上顺序的内容段落,每个内容段落包括一个或者多个适配集(Adaptation Set)。其中,每个适配集包括一个或者多个媒体表达(Representation)。其中一个媒体表达包括一个或者多个片段(Segment)。
媒体呈现描述可具有和媒体呈现相似的层次化结构。以上介绍的媒体呈现的概念在媒体呈现描述中可用一个XML元素表示,媒体呈现元素包括一个或多个内容段落(Period)元素,每个内容段落(Period)元素包括一个或多个适配集(AdaptationSet)元素。每个适配集(AdaptationSet)元素包括一个或多个媒体表达(Representation)元素。
媒体呈现对应于媒体呈现描述中的媒体呈现描述元素,媒体呈现中的一个内容段落对应于媒体呈现描述中的一个内容段落元素,媒体呈现中的一个适配集对应于媒体呈现描述中的一个适配集元素,媒体呈现中的一个媒体表达对应 于媒体呈现描述中的一个媒体表达元素,以此类推。
上述简单介绍了DASH的一些基本概念,下面具体介绍本发明实施例的具体实现方案。
本发明实施例提供一种内容项聚合方法,可包括:服务端生成第一媒体呈现的媒体呈现描述,其中,所述第一媒体呈现包括内容项,其中,所述媒体呈现描述包括所述内容项的描述,或者所述媒体呈现描述包括所述内容项的描述的指向信息,所述内容项的描述用于指示出所述内容项来自第二媒体呈现,所述第一媒体呈现和所述第二媒体呈现为不同的媒体呈现;存储或发送所述媒体呈现描述。
请参见图2,图2为本发明的一个实施例提供的一种内容项聚合方法的流程示意图,其中,如图2举例所示,本发明的一个实施例提供的一种内容项聚合方法可包括:
201、服务端生成第一媒体呈现的媒体呈现描述。
其中,所述第一媒体呈现包括内容项(为便于引述,该内容项下面可以称之为第一内容项)。所述媒体呈现描述包括所述第一内容项的描述或所述媒体呈现描述包括所述第一内容项的描述的指向信息。所述第一内容项的描述用于指示出所述第一内容项来自第二媒体呈现,其中,所述第一媒体呈现和所述第二媒体呈现为不同的媒体呈现。
其中,第一内容项可为第一媒体呈现包括的N个内容项中的其中一个,所述N为大于1或等于1的整数。例如,第一媒体呈现还包括第二内容项(第一内容项和第二内容项是不同的内容项),所述媒体呈现描述包括所述第二内容项的描述或所述媒体呈现描述包括所述第二内容项的描述的指向信息,其中,所述第二内容项的描述用于指示出所述第二内容项来自第二媒体呈现或媒体呈现X。
例如所述N可等于1、2、3、4、5、6、8、10、15、19、21、30、500或其他值。
可以理解的是,所述第一内容项的描述的指向信息用于指向所述第一内容项的描述,例如所述第一内容项的描述的指向信息可包括所述第一内容项的描 述的指针或URL等。其中,利用所述第一内容项的描述的指向信息,可以获取到所述第一内容项的描述。
可选的,在本发明的一些可能的实施方式中,所述第一内容项例如可为内容段落(Period)或媒体表达(Representation)或自适应集(AdaptationSet)或其它形式的媒体内容。
可选的,在本发明一些可能的实施方式中,服务端在接收到来自客户端的节目播放请求之后生成第一媒体呈现的媒体呈现描述,当然服务端也可能的其它条件的触发下生成第一媒体呈现的媒体呈现描述。
202、服务端存储或发送所述第一媒体呈现的媒体呈现描述。
其中,服务端例如可向客户端发送所述媒体呈现描述,所述客户端进而可根据所述第一内容项的描述获取所述第一内容项;所述客户端还可进一步播放所述第一内容项。
可以看出,在本实施例的技术方案中,所述第一媒体呈现包括的第一内容项可以来自第二媒体呈现,也就是说,可以将其它媒体呈现的内容项进行重新聚合安排以形成满足特定编排需要的新媒体呈现,并且,新媒体呈现的媒体呈现描述包括了聚合来的其它媒体呈现的内容项的描述,使得客户端可据此进行相应内容项的获取和播放等。总得来说,在本实施例的技术方案有利于实现媒体内容的灵活聚合。
可选的,在本发明的一些可能的实施方式中,所述第一内容项的描述还包括用于指示出所述第一内容项的播放开始时间的时间指示Sd。例如时间指示Sd可为@Start属性或@Start元素。
可选的,在本发明一些可能的实施方式中,所述第二内容项的描述还包括用于指示出所述第二内容项的播放开始时间的时间指示Se,所述时间指示Se所指示出的所述第二内容项的播放开始时间等于为所述第一内容项的播放结束时间,或者所述时间指示Se所指示出的所述第二内容项的播放开始时间晚于为所述第一内容项的播放结束时间,且所述第二内容项的播放开始时间于所述第一内容项的播放结束时刻之间的时间差Δt小于阈值。
可选的,在本发明的一些可能实施方式中,所述第一内容项的描述还用于 指示出所述第二媒体呈现为实时媒体呈现或非实时媒体呈现。
其中,实时媒体呈现例如指的直播媒体呈现,例如直播的体育比赛或直播的综艺节目等。而非实时媒体呈现表示这个媒体呈现事先已经通过录制或其它方式使之存在了。非实时媒体呈现例如可为事先已录制的电视剧、电影、体育比赛或综艺节目等。
可选的,在本发明的一些可能实施方式中,所述第一内容项的描述还用于指示出所述第一内容项在所述第一媒体呈现中嵌入的时间位置。所述第一内容项在所述第一媒体呈现中嵌入的时间位置,也就是所述第一内容项被安排在了所述第一媒体呈现的那个时间位置。
可选的,在本发明的一些可能实施方式中,所述第一内容项的描述还用于指示出所述第一内容项的部分或全部被嵌入到所述第一媒体呈现中。即所述第一内容项的描述还可用于指示出所述第一内容项的全部被嵌入到所述第一媒体呈现,所述第一内容项的描述也可用于指示出所述第一内容项的部分被嵌入到第一媒体呈现,所述第一内容项的“部分”,可以从时间、内容等不同维度来看,例如假设第一内容项为AdaptationSet,那么若所述第一内容项的描述指示出所述第一内容项的部分被嵌入到所述第一媒体呈现中,可表示该AdaptationSet的部分版本和/或部分截取的媒体表达被嵌入到所述第一媒体呈现,例如AdaptationSet包括5个版本的时长均为15分钟的媒体表达,例如在一种情况下,所述第一内容项的描述可指示出这5个版本的媒体表达中的其中2个版本的时长均为15分钟的媒体表达被嵌入到所述第一媒体呈现中,例如在另一种情况下,所述第一内容项的描述可指示出这5个版本的媒体表达中的其中3个版本的时长均为12分钟的媒体表达(即从15分钟的媒体表达中截取了其中的12分钟的媒体表达)被嵌入到所述第一媒体呈现中,例如在又一种情况下,所述第一内容项的描述可指示出这5个版本的媒体表达中的其中5个版本的时长均为12分钟的媒体表达(即从15分钟的媒体表达中截取了其中的12分钟的媒体表达)被嵌入到所述第一媒体呈现中。
可选的,在本发明的一些可能实施方式中,当所述第一内容项的描述还用于指示出所述第一内容项的部分被嵌入到所述第一媒体呈现中的情况下,所述 第一内容项的描述还可用于指示出所述第一内容项的所述部分的起始播放时间位置和/或结束播放时间位置。例如,所述第一内容项的描述还可用于指示出所述第一内容项的所述部分的起始播放时间位置为所述第一内容项的起始内容时间位置,或者所述第一内容项的起始内容时间位置偏移五分钟后的内容位置等。
可选的,在本发明的一些可能实施方式中,所述第一内容项的描述包括偏移指示fz,所述偏移指示fz用于指示出所述第一内容项的起始播放时间位置和起始内容时间位置之间的偏移量offset。
可选的,在本发明的一些可能实施方式中,当所述第二媒体呈现为实时媒体呈现时,并且所述偏移指示fz指示的偏移量offset等于0时,表示所述第一内容项将从当前时间对应的内容位置开始播放;或当所述第二媒体呈现为实时媒体呈现时,且所述偏移指示fz指示的偏移量offset不等于0时,表示所述第一内容项将从当前时间回退偏移量offset对应的内容位置开始播放;或当所述第二媒体呈现为实时媒体呈现时,并且所述偏移指示fz指示的偏移量offset不等于0时,表示所述第一内容项将从所述第一内容项的起始内容位置向后偏移所述偏移量offset对应的内容位置开始播放。
可选的,在本发明的一些可能实施方式中,当所述第二媒体呈现为非实时媒体呈现时,且所述偏移指示fz指示的偏移量offset等于0时,表示所述第一内容项将从所述第一内容项的起始内容位置开始播放;或当所述第二媒体呈现为非实时媒体呈现时,且所述偏移指示fz指示的偏移量offset不等于0时,表示所述第一内容项的将从所述第一内容项的起始内容位置向后偏移所述偏移量offset对应的内容位置开始播放。
可选的,在本发明的一些可能实施方式中,所述第一内容项的描述包括在所述媒体呈现描述的聚合方法描述子中,或所述第一内容项的描述的指向信息包括在所述媒体呈现描述的聚合方法描述子中。
可选的,在本发明的一些可能的实施方式中,所述第一媒体呈现为聚合媒体呈现,所述媒体呈现描述为聚合媒体呈现描述,所述聚合媒体呈现描述包括N个媒体呈现描述元素,所述N为大于1或者等于1的整数,第一媒体呈现描述 元素为所述聚合媒体呈现描述包括的所述N个媒体呈现描述元素之中的一个媒体呈现描述元素,其中,所述第一内容项的描述包括在所述第一媒体呈现描述元素中或者所述第一内容项的描述的指向信息包括在所述第一媒体呈现描述元素中。
其中,所述第一媒体呈现可以为聚合媒体呈现或普通媒体呈现。所述媒体呈现描述可为聚合媒体呈现描述或普通媒体呈现描述。
可选的,在本发明的一些可能的实施方式中,所述第一媒体呈现为聚合媒体呈现,所述媒体呈现描述为聚合媒体呈现描述,所述聚合媒体呈现描述包括N个媒体呈现描述元素,所述N为大于1或等于1的整数,第一媒体呈现描述元素为所述聚合媒体呈现描述包括的所述N个媒体呈现描述元素之中的一个媒体呈现描述元素,其中,所述第一内容项的描述包括在所述第一媒体呈现描述元素中或者所述第一内容项的描述的指向信息包括在所述第一媒体呈现描述元素中。
可选的,在本发明的一些可能实施方式中,所述聚合媒体呈现描述还包括第一媒体呈现描述元素对应的时间窗口指示(时间窗口指示例如可包括属性@expriy和属性@timeAdvance),其中,所述时间窗口指示用于指示客户端在所述时间窗口指示所指示的时间窗口内,从所述服务端获取所述聚合媒体呈现描述的更新内容,其中,所述更新内容包括所述第一媒体呈现描述元素。由于引入了时间窗口指示来限制客户端更新聚合媒体呈现描述的时段,这样有利于更好的控制客户端的内容播放。
本发明实施例提供一种内容项聚合方法可包括:客户端获取第一媒体呈现的媒体呈现描述,其中,所述第一媒体呈现包括第一内容项,其中,所述媒体呈现描述包括所述第一内容项的描述或所述媒体呈现描述包括所述第一内容项的描述的指向信息,所述第一内容项的描述用于指示出所述第一内容项来自第二媒体呈现;所述客户端根据所述第一内容项的描述获取所述第一内容项;所述客户端播放所述第一内容项。
请参见图3,图3为本发明的一个实施例提供的一种内容项聚合方法的流程示意图,其中,如图3举例所示,本发明的一个实施例提供的一种内容项聚合 方法可包括:
301、客户端获取第一媒体呈现的媒体呈现描述。
其中,所述第一媒体呈现包括内容项(为便于引述,该内容项下面可以称之为第一内容项),所述媒体呈现描述包括所述第一内容项的描述或所述媒体呈现描述包括所述第一内容项的描述的指向信息,所述第一内容项的描述用于指示出所述第一内容项来自第二媒体呈现。其中,所述第一媒体呈现和所述第二媒体呈现为不同的媒体呈现。
其中,第一内容项为第一媒体呈现包括的N个内容项中的其中一个,所述N为大于1或等于1的整数。例如,第一媒体呈现还包括第二内容项(第二内容项不同于第一内容项),所述媒体呈现描述包括所述第二内容项的描述或所述媒体呈现描述包括所述第二内容项的描述的指向信息,其中,所述第二内容项的描述用于指示出所述第二内容项来自第二媒体呈现或媒体呈现X。
可以理解的是,所述第一内容项的描述的指向信息用于指向所述第一内容项的描述,例如所述第一内容项的描述的指向信息可包括所述第一内容项的描述的指针或URL等。其中,利用所述第一内容项的描述的指向信息,可以获取到所述第一内容项的描述。
可选的,在本发明的一些可能的实施方式中,内容项(例如第一内容项或第二内容项例如可为内容段落(Period)或媒体表达(Representation)或自适应集(AdaptationSet)或其它形式的媒体内容。
例如所述N可等于1、2、3、4、5、6、8、10、15、19、21、30、500或其他值。
可选的,在本发明一些可能的实施方式中,服务端在接收到来自客户端的节目播放请求之后生成第一媒体呈现的媒体呈现描述,当然服务端也可能的其它条件的触发下生成第一媒体呈现的媒体呈现描述。
302、所述客户端根据所述第一内容项的描述获取所述第一内容项。
303、所述客户端播放所述第一内容项。
可以看出,在本实施例的技术方案中,所述第一媒体呈现包括的第一内容项可以来自第二媒体呈现,也就是说,可以将其它媒体呈现的内容项进行重新 聚合安排以形成满足特定编排需要的新媒体呈现,并且,新媒体呈现的媒体呈现描述包括了聚合来的其它媒体呈现的内容项的描述,使得客户端可据此进行相应内容项的获取和播放等。总得来说,在本实施例的技术方案有利于实现媒体内容的灵活聚合。
可选的,在本发明的一些可能实施方式中,所述第一内容项的描述还用于指示出所述第二媒体呈现为实时媒体呈现或非实时媒体呈现。
其中,实时媒体呈现例如指的直播媒体呈现,例如直播的体育比赛或直播的综艺节目等。而非实时媒体呈现表示这个媒体呈现事先已经通过录制或其它方式使之存在了。非实时媒体呈现例如可为事先已录制的电视剧、电影、体育比赛或综艺节目等。
可选的,在本发明一些可能的实施方式中,所述第二内容项的描述还包括用于指示出所述第二内容项的播放开始时间的时间指示Se,所述时间指示Se所指示出的所述第二内容项的播放开始时间等于为所述第一内容项的播放结束时间,或者所述时间指示Se所指示出的所述第二内容项的播放开始时间晚于为所述第一内容项的播放结束时间,且所述第二内容项的播放开始时间于所述第一内容项的播放结束时刻之间的时间差Δt小于阈值。
可选的,在本发明的一些可能实施方式中,所述第一内容项的描述还用于指示出所述第一内容项在所述第一媒体呈现中嵌入的时间位置。所述第一内容项在所述第一媒体呈现中嵌入的时间位置,也就是所述第一内容项被安排在了所述第一媒体呈现的那个时间位置。
可选的,在本发明的一些可能实施方式中,所述第一内容项的描述还用于指示出所述第一内容项的部分或全部被嵌入到所述第一媒体呈现中。即所述第一内容项的描述还可用于指示出所述第一内容项的全部被嵌入到所述第一媒体呈现,所述第一内容项的描述也可用于指示出所述第一内容项的部分被嵌入到第一媒体呈现,所述第一内容项的“部分”,可以从时间、内容等不同维度来看,例如假设第一内容项为AdaptationSet,那么若所述第一内容项的描述指示出所述第一内容项的部分被嵌入到所述第一媒体呈现中,可表示该AdaptationSet的部分版本和/或部分截取的媒体表达被嵌入到所述第一媒体呈 现,例如AdaptationSet包括5个版本的时长均为15分钟的媒体表达,例如在一种情况下,所述第一内容项的描述可指示出这5个版本的媒体表达中的其中2个版本的时长均为15分钟的媒体表达被嵌入到所述第一媒体呈现中,例如在另一种情况下,所述第一内容项的描述可指示出这5个版本的媒体表达中的其中3个版本的时长均为12分钟的媒体表达(即从15分钟的媒体表达中截取了其中的12分钟的媒体表达)被嵌入到所述第一媒体呈现中,例如在又一种情况下,所述第一内容项的描述可指示出这5个版本的媒体表达中的其中5个版本的时长均为12分钟的媒体表达(即从15分钟的媒体表达中截取了其中的12分钟的媒体表达)被嵌入到所述第一媒体呈现中。
可选的,在本发明的一些可能实施方式中,当所述第一内容项的描述还用于指示出所述第一内容项的部分被嵌入到所述第一媒体呈现中的情况下,所述第一内容项的描述还可用于指示出所述第一内容项的所述部分的起始播放时间位置和/或结束播放时间位置。例如,所述第一内容项的描述还可用于指示出所述第一内容项的所述部分的起始播放时间位置为所述第一内容项的起始内容时间位置,或者所述第一内容项的起始内容时间位置偏移五分钟后的内容位置等。
可选的,在本发明的一些可能实施方式中,所述第一内容项的描述包括偏移指示fz,所述偏移指示fz用于指示出所述第一内容项的起始播放时间位置和起始内容时间位置之间的偏移量offset。
可选的,在本发明的一些可能实施方式中,当所述第二媒体呈现为实时媒体呈现时,并且所述偏移指示fz指示的偏移量offset等于0时,表示所述第一内容项将从当前时间对应的内容位置开始播放;或者当所述第二媒体呈现为实时媒体呈现时,且所述偏移指示fz指示的偏移量offset不等于0时,表示所述第一内容项将从当前时间回退偏移量offset对应的内容位置开始播放;或当所述第二媒体呈现为实时媒体呈现时,并且所述偏移指示fz指示的偏移量offset不等于0时,表示所述第一内容项将从所述第一内容项的起始内容位置向后偏移所述偏移量offset对应的内容位置开始播放。
可选的,在本发明的一些可能实施方式中,当所述第二媒体呈现为非实时 媒体呈现时,且所述偏移指示fz指示的偏移量offset等于0时,表示所述第一内容项将从所述第一内容项的起始内容位置开始播放;或当所述第二媒体呈现为非实时媒体呈现时,且所述偏移指示fz指示的偏移量offset不等于0时,表示所述第一内容项的将从所述第一内容项的起始内容位置向后偏移所述偏移量offset对应的内容位置开始播放。
可选的,在本发明的一些可能实施方式中,所述第一内容项的描述包括在所述媒体呈现描述的聚合方法描述子中,或所述第一内容项的描述的指向信息包括在所述媒体呈现描述的聚合方法描述子中。
可选的,在本发明的一些可能的实施方式中,所述第一媒体呈现为聚合媒体呈现,所述媒体呈现描述为聚合媒体呈现描述,所述聚合媒体呈现描述包括N个媒体呈现描述元素,所述N为大于1或等于1的整数,第一媒体呈现描述元素为所述聚合媒体呈现描述包括的所述N个媒体呈现描述元素之中的一个媒体呈现描述元素,其中,所述第一内容项的描述包括在所述第一媒体呈现描述元素中或者所述第一内容项的描述的指向信息包括在所述第一媒体呈现描述元素中。
其中,所述第一媒体呈现可以为聚合媒体呈现或普通媒体呈现。所述媒体呈现描述可为聚合媒体呈现描述或普通媒体呈现描述。
可选的,在本发明的一些可能的实施方式中,所述第一媒体呈现为聚合媒体呈现,所述媒体呈现描述为聚合媒体呈现描述,所述聚合媒体呈现描述包括N个媒体呈现描述元素,所述N为大于1或等于1的整数,第一媒体呈现描述元素为所述聚合媒体呈现描述包括的所述N个媒体呈现描述元素之中的一个媒体呈现描述元素,其中,所述第一内容项的描述包括在所述第一媒体呈现描述元素中或者所述第一内容项的描述的指向信息包括在所述第一媒体呈现描述元素中。
可选的,在本发明的一些可能实施方式中,所述聚合媒体呈现描述还包括第一媒体呈现描述元素对应的时间窗口指示(其中,时间窗口指示例如可包括属性@expriy和属性@timeAdvance,也就是说,属性@expriy和属性@timeAdvance可指示出时间窗口),其中,所述时间窗口指示用于指示客户 端在所述时间窗口指示所指示的时间窗口内,从所述服务端获取所述聚合媒体呈现描述的更新内容,所述更新内容包括所述第一媒体呈现描述元素。由于引入了时间窗口指示来限制客户端更新聚合媒体呈现描述的时段,这样有利于更好的控制客户端的内容播放。
为便于更好的理解本发明实施例提供的上述技术方案,下面结合一些具体的应用场景进行举例描述。
请参见图4-a和图4-b,图4-b为本发明的另一个实施例提供的另一种内容项聚合方法的流程示意图,其中,图4-b所示的内容项聚合方法可在如图图4-a所示的网络架构下具体实施。如图4-a举例所示,本发明另一个实施例提供的另一种内容项聚合方法可包括:
401、客户端向服务端发送播放请求;所述服务端接收来自所述客户端的所述播放请求。
402、所述服务端生成第一媒体呈现的媒体呈现描述。
服务端是指运行在网络侧的提供服务的设备,包括但不限于服务器、CDN节点或登陆服务器等等,服务端可能是一个设备,服务端也可能是多个不同的设备,为描述方便,在本发明中它们被视为一个整体。
403、所述服务端向所述客户端发送用于响应所述节目请求的所述媒体呈现描述。
其中,所述第一媒体呈现包括第一内容项,其中,所述媒体呈现描述包括所述第一内容项的描述或所述媒体呈现描述包括所述第一内容项的描述的指向信息,其中,所述第一内容项的描述用于指示出所述第一内容项来自第二媒体呈现。所述第一媒体呈现和所述第二媒体呈现为不同的媒体呈现。
404、客户端接收来自所述服务端的第一媒体呈现的媒体呈现描述,所述客户端根据所述第一内容项的描述获取所述第一内容项。
405、所述客户端播放所述第一内容项。
其中,第一内容项可为第一媒体呈现包括的N个内容项中的其中一个,所述N为大于1或等于1的整数。例如,第一媒体呈现还包括第二内容项,所述媒体呈现描述包括所述第二内容项的描述或所述媒体呈现描述包括所述第二内 容项的描述的指向信息,其中,所述第二内容项的描述用于指示出所述第二内容项来自第二媒体呈现或媒体呈现X。
参见图4-c,图4-c举例示出了第一媒体呈现中的各内容项的一种可能的来源方式,其中,部分内容项来自实时媒体呈现,另部分内容项可来自非实时媒体呈现。当然,第一媒体呈现中的各内容项的另一种来源方式可以是所有内容项来自实时媒体呈现。其中,当然第一媒体呈现中的各内容项的另一种来源方式可以是所有内容项来自非实时媒体呈现。
可以理解的是,所述第一内容项的描述的指向信息用于指向所述第一内容项的描述,例如所述第一内容项的描述的指向信息可包括所述第一内容项的描述的指针或URL等。其中,利用所述第一内容项的描述的指向信息,可以获取到所述第一内容项的描述。
可选的,在本发明的一些可能的实施方式中,所述第一内容项例如可为内容段落(Period)或媒体表达(Representation)或自适应集(AdaptationSet)或其它形式的媒体内容。
可以理解,对于第一媒体呈现包括的其它内容项,均可按照类似于所述第一内容项的获取和播放方式进行播放。
可选的,在本发明的一些可能实施方式中,所述第一内容项的描述还用于指示出所述第二媒体呈现为实时媒体呈现或非实时媒体呈现。
其中,实时媒体呈现例如指的直播媒体呈现,例如直播的体育比赛或直播的综艺节目等。而非实时媒体呈现表示这个媒体呈现事先已经通过录制或其它方式使之存在了。非实时媒体呈现例如可为事先已录制的电视剧、电影、体育比赛或综艺节目等。
可选的,在本发明的一些可能实施方式中,所述第一内容项的描述还用于指示出所述第一内容项在所述第一媒体呈现中嵌入的时间位置。所述第一内容项在所述第一媒体呈现中嵌入的时间位置,也就是所述第一内容项被安排在了所述第一媒体呈现的那个时间位置。
可选的,在本发明的一些可能实施方式中,所述第一内容项的描述还用于指示出所述第一内容项的部分或全部被嵌入到所述第一媒体呈现中。即所述第 一内容项的描述还可用于指示出所述第一内容项的全部被嵌入到所述第一媒体呈现,所述第一内容项的描述也可用于指示出所述第一内容项的部分被嵌入到第一媒体呈现,所述第一内容项的“部分”,可以从时间、内容等不同维度来看,例如假设第一内容项为AdaptationSet,那么若所述第一内容项的描述指示出所述第一内容项的部分被嵌入到所述第一媒体呈现中,可表示该AdaptationSet的部分版本和/或部分截取的媒体表达被嵌入到所述第一媒体呈现,例如AdaptationSet包括5个版本的时长均为15分钟的媒体表达,例如在一种情况下,所述第一内容项的描述可指示出这5个版本的媒体表达中的其中2个版本的时长均为15分钟的媒体表达被嵌入到所述第一媒体呈现中,例如在另一种情况下,所述第一内容项的描述可指示出这5个版本的媒体表达中的其中3个版本的时长均为12分钟的媒体表达(即从15分钟的媒体表达中截取了其中的12分钟的媒体表达)被嵌入到第一媒体呈现中,例如在又一种情况下,所述第一内容项的描述可指示出这5个版本的媒体表达中的其中5个版本的时长均为12分钟的媒体表达(即从15分钟的媒体表达中截取其中的12分钟的媒体表达)被嵌入到所述第一媒体呈现中。
可选的,在本发明的一些可能实施方式中,当所述第一内容项的描述还用于指示出所述第一内容项的部分被嵌入到所述第一媒体呈现中的情况下,所述第一内容项的描述还可用于指示出所述第一内容项的所述部分的起始播放时间位置和/或结束播放时间位置。例如,所述第一内容项的描述还可用于指示出所述第一内容项的所述部分的起始播放时间位置为所述第一内容项的起始内容时间位置,或者所述第一内容项的起始内容时间位置偏移五分钟后的内容位置等。
可选的,在本发明的一些可能实施方式中,所述第一内容项的描述包括偏移指示fz,所述偏移指示fz用于指示出所述第一内容项的起始播放时间位置和起始内容时间位置之间的偏移量offset。
可选的,在本发明的一些可能实施方式中,当所述第二媒体呈现为实时媒体呈现时,并且所述偏移指示fz指示的偏移量offset等于0时,表示所述第一内容项将从当前时间对应的内容位置开始播放;或当所述第二媒体呈现为实时媒 体呈现时,且所述偏移指示fz指示的偏移量offset不等于0时,表示所述第一内容项将从当前时间回退偏移量offset对应的内容位置开始播放;或当所述第二媒体呈现为实时媒体呈现时,并且所述偏移指示fz指示的偏移量offset不等于0时,表示所述第一内容项将从所述第一内容项的起始内容位置向后偏移所述偏移量offset对应的内容位置开始播放。
可选的,在本发明的一些可能实施方式中,当所述第二媒体呈现为非实时媒体呈现时,且所述偏移指示fz指示的偏移量offset等于0时,表示所述第一内容项将从所述第一内容项的起始内容位置开始播放;或当所述第二媒体呈现为非实时媒体呈现时,且所述偏移指示fz指示的偏移量offset不等于0时,表示所述第一内容项的将从所述第一内容项的起始内容位置向后偏移所述偏移量offset对应的内容位置开始播放。
可选的,在本发明的一些可能实施方式中,所述第一内容项的描述包括在所述媒体呈现描述的聚合方法描述子中,或所述第一内容项的描述的指向信息包括在所述媒体呈现描述的聚合方法描述子中。
可选的,在本发明的一些可能的实施方式中,所述第一媒体呈现为聚合媒体呈现,所述媒体呈现描述为聚合媒体呈现描述,所述聚合媒体呈现描述包括N个媒体呈现描述元素,所述N为大于1或等于1的整数,第一媒体呈现描述元素为所述聚合媒体呈现描述包括的所述N个媒体呈现描述元素之中的一个媒体呈现描述元素,其中,所述第一内容项的描述包括在所述第一媒体呈现描述元素中或者所述第一内容项的描述的指向信息包括在所述第一媒体呈现描述元素中。
其中,所述第一媒体呈现可以为聚合媒体呈现或普通媒体呈现。所述媒体呈现描述可为聚合媒体呈现描述或普通媒体呈现描述。
可选的,在本发明的一些可能的实施方式中,所述第一媒体呈现为聚合媒体呈现,所述媒体呈现描述为聚合媒体呈现描述,所述聚合媒体呈现描述包括N个媒体呈现描述元素,所述N为大于1或等于1的整数,第一媒体呈现描述元素为所述聚合媒体呈现描述包括的所述N个媒体呈现描述元素之中的一个媒体呈现描述元素,其中,所述第一内容项的描述包括在所述第一媒体呈现描述 元素中或者所述第一内容项的描述的指向信息包括在所述第一媒体呈现描述元素中。
可选的,在本发明的一些可能实施方式中,所述聚合媒体呈现描述还包括第一媒体呈现描述元素对应的时间窗口指示(其中,时间窗口指示例如可包括属性@expriy和属性@timeAdvance,也就是说,属性@expriy和属性@timeAdvance可指示出时间窗口),其中,所述时间窗口指示用于指示客户端在所述时间窗口指示所指示的时间窗口内,从所述服务端获取所述聚合媒体呈现描述的更新内容,所述更新内容包括所述第一媒体呈现描述元素。由于引入了时间窗口指示来限制客户端更新聚合媒体呈现描述的时段,这样有利于更好的控制客户端的内容播放。
可以看出,在本实施例的技术方案中,媒体呈现包括的某内容项可以来自另一媒体呈现,也就是说,可以将其它媒体呈现的内容项进行重新聚合安排以形成满足特定编排需要的新媒体呈现,并且,新媒体呈现的媒体呈现描述包括了聚合来的其它媒体呈现的内容项的描述,使得客户端可据此进行相应内容项的获取和播放等。总得来说,在本实施例的技术方案有利于实现媒体内容的灵活聚合。
下面结合一些更为具体的应用场景进行举例描述。
在一些应用场景中,聚合媒体呈现中有多个媒体呈现单元,其中,一个媒体呈现单元是一个媒体呈现或者一个媒体呈现(以下简称源媒体呈现)中的一个或者多个时间连续的内容项(如内容段落)。媒体呈现单元为各不相同的媒体内容,组成媒体呈现的媒体分量,媒体分量的编码,存储位置,媒体呈现描述等。它们在时间上是并行的或者顺序的。其中,聚合媒体呈现描述是一个元数据文件,描述了聚合媒体呈现中的媒体呈现单元和它们之间的关系。它是对媒体呈现描述(文件)的扩展。
以下举例说明中元素或属性的命名是示意性的,可采用其他名称,重要的是它所表达的含义。
聚合媒体呈现描述的根元素是聚合媒体呈现描述元素(AMPD),它的两个属性@expiry和@timeAdvance是用于聚合媒体呈现描述的更新,通常随着时 间推移,复合媒体呈现描述发生更新以描述聚合媒体呈现的变化,特别是聚合媒体呈现在时间上的延展。@expiry指示聚合媒体呈现的有效期,它是以绝对时间(wall clock time)表示的,在有效期之前,AMPD聚合媒体呈现描述的内容是有效的。其中,@timeAdvance指示在聚合媒体呈现描述的更新的时间提前量,即聚合媒体呈现描述最早的更新时间。这两个属性结合在一起定义了一个时间窗口,即从texp-tadv到texp的时间段,其中texp表示@expiry的取值,tadv是@timeAdvance的取值。
在聚合媒体呈现描述中,引入了语法元素MediaPresentation,媒体呈现单元元素表示一个媒体呈现单元。聚合媒体呈现描述了一组媒体呈现单元和它们之间的时间关系。
在聚合媒体呈现描述中,源媒体呈现可以是本地的,这时MediaPresentation元素包含一个MPD元素,而MPD元素包含至少一个Period元素。如果媒体呈现的引用是远端的,可以通过一个指针指向被引用的媒体呈现描述,如@xlink:href属性。引用可以是全部的或部分的,即指向的媒体呈现中的一个或者连续的多个内容段落,可以用属性@periodId说明被引用的内容段落。
在很多场景下要区别两个时间:媒体内容时间和绝对时间。一项媒体内容在时间上是连续的,有一个时间范围,这个范围内的时间是这向媒体内容的媒体时间,它和绝对时间(wall clock time)无关。通过媒体内容时间可以定位到媒体内容中的(时间)位置。
在播放时,媒体内容时间可以映射到绝对时间上。对于直播,媒体内容的时间位置和绝对时间是固定对应的;但是时间一旦过去,媒体时间和绝对时间就不再有固定的对应关系了。媒体内容可以在时间上移动。用户可以在任何时间在直播媒体内容的当前时间位置或者当前位置之前的时间位置加入,如果媒体内容可以被存储,这时,用户获得的是(在绝对时间轴上)过去时间的媒体内容。不能在直播媒体内容的当前时间位置之后的某一时间位置加入,因为提前获取的未来的媒体内容是不可能的;对于点播,媒体内容已经存在,媒体内容的一个时间位置可以映射到绝对时间上的任何一个时刻,用户可以在任意时间从任何的媒体时间位置访问媒体内容。内容聚合是多项媒体内容在时间上是 移动和组合。
媒体内容在绝对时间上的移动可用两个属性表示:开始时间@startTime表示绝对时间上的一个时刻,即从这个时刻起开始一项媒体内容。偏移量@timeOffset表示媒体内容的时间位置,对于直播,它是相对于媒体内容的当前(绝对时间轴上的现在)时间位置,因为只能访问过去的内容,所以偏移量的取值小于等于0;而对于点播,@startTime是相对于这项媒体内容开始的相对时间位置,偏移量的取值小于等于0。这样客户端在直播和点播时的行为是不一样的。直播时,客户端在@startTime时刻加入直播的媒体内容,访问的媒体内容的时间位置是@startTime+@timeOffset这一时刻的媒体内容;点播时客户端在@startTime时刻加入点播的媒体内容,访问的媒体内容的时间位置是从@timeOffset开始的。
内容聚合本质上是媒体内容在绝对时间(轴)上的移动加上媒体内容的时间位置偏移。图5-a示意了上述关系的一种举例。
以下的举例是一种聚合媒体呈现描述的表达方式,通过层级次化的数据结构来表达,一个元素包含若干属性和低级元素,每一层都是如此,一层层嵌套起来。
聚合媒体呈现描述AMPD的一种表达方式元素和属性的含义可如下:
@expiry,用于指示该聚合媒体呈现的有效期。聚合媒体呈现的描述会在有效期到达之前更新。
@timeAdvance,用于指示聚合媒体呈现描述的更新的时间提前量,即聚合媒体呈现描述最早的时间更新时间,它是相对于@expiry指示的时间,可以是在仅当@expiry属性存在时出现。
Presentation,用于描述一个媒体呈现。
@type,用于指示媒体呈现是直播(实时生成的)还是点播(既有的,非实时的)。
@startTime,用于指示媒体呈现单元的开始时间。如果是顺序合并则该属性会出现。
@timeOffset,用于指示媒体时间的偏移量。
其中,对于直播媒体呈现,它是相对于该媒体呈现单元在@startTime时刻的媒体时间位置的(向前的)时间偏移。对于点播媒体呈现,它是相对于媒体呈现单元开始位置的时间偏移。
@periodId,若指向的MPD中有多个Period,@periodId指出所选择的period。
@xlink:xref,用于指向一个媒体呈现描述。
@xlink:actuate,用于指示对@xlink:xref所指向的媒体呈现描述的处理。
MPD,用于指示本地的媒体呈现。
图5-b举例了一种采用XML数据规则描述的AMPD的数据结构。
聚合媒体呈现描述可以通过其他的方法来实现。这个方法采用现有的媒体呈现描述,多个媒体内容项通过内容段落之间的钩连(在时间上)顺序聚合在一起,(一个)媒体内容项是指(一个)媒体呈现中的一个内容段落。注意这些媒体内容项的来源可以各不相同,是不同的媒体呈现的内容段落。“钩”连机制采用描述子说明所钩连(被聚合)的内容段落和当前内容段落(该描述子所属的内容段落)之间的时间关系。该机制有一个方法识别符以及对应的参数集合。客户端根据这个方法识别符来解释伴随它的参数集。如果客户端不认识这个方法识别符,它就无法理解/解释参数集,参数,参数的顺序,取值等。
下面定义一个内容段落的链接方法.
其中,方法识别符号为”urn:mpeg:dash:mpd-linking:2015”,这个方法的参数如下:
Direction,用于指示链接的方向,所链接的内容段落和当前内容段落的时间关系。其中,前向链接(pre-roll),表示在当前内容段落之前插入链接的内容段落。其中,后向链接(post-roll),表示在当前内容段落之前插入链接的内容段落。其中,当前内容段落(local),表示把链接的内容段落作为在当前内容段落。
type,用于说明所引用的内容性质(实时或非实时媒体呈现)。
mpdUrl,用于指示引用的内容的媒体呈现描述的URL。
periodId,用于指示引用的内容段落。
timeOffset,用于指示相对于节目段落开始的时间偏移量。如果目标内容 是非实时的,已经存在的,那么节目段落的开始时间可为0,如果目标内容是实时的如直播内容,节目段落开始是绝对时间的某一时刻。
当type=1(表示实时媒体呈现),如果timeOffset不出现,则表示在直播内容的当前时间位置加入链接的媒体呈现。
duration,用于说明链接的内容项的时间长度。参见图5-c,图5-c所示的例子中,多个内容项,每一个内容项都是不同的媒体呈现描述中说明的一个内容段落。其中一些内容项是非实时的,而另外的为实时的。通过内容聚合,生成一个在时间上连续的媒体呈现描述。
参见图5-d,其中,图5-d所示的例子中,在内容聚合之外,引入了客户端行为控制。有两个内容项,内容项B是录制的广告,它有一个对应的媒体呈现描述。内容项A是实时的羽毛球比赛,开始于时间t0。内容服务者希望用户在观看比赛前收看广告B。内容服务上发布了媒体内容A的媒体呈现描述,注意内容项A对应的内容段落元素中加入了EssentialProperty描述子,客户端须处理这个描述子,否则它不能识别该描述子的方法识别符,就应该放弃对该内容项的处理。这个描述子的方法符告诉客户端这是1个内容段落的链接方法,参数的含义是:前置一个内容段落,该前置的内容段落的内容是非实时的,该前置的内容段落系引用URL为http://example.com/ad/ad1.mpd的媒体呈现描述中的内容段落ad1,该前置的内容段落从引用的内容段落的起始时间位置开始。
图5-e示意了内容项之间的时间关系。如图5-e所示,无论用户在何时开始接收节目,都要先收看内容项B,然后才能收看内容项A。实时内容段落开始于时间t0,用户在时间t1开始收看节目,用户首先收看前置的内容项B,在该内容结束后开始收看内容项A,这时时间已经是t2,用户没有看到内容段落t0~t1的那一部分,图中以虚线框示意。t1~t2是内容项B的时间长度。
下面的一个例子是前置广告的直播节目的例子。服务端提供的是一项目直播内容,无论用户(客户端)从什么时间加入直播,都会先看一段前置广告而后加入到直播中。
下面分别从服务端(网络设备端)和客户端来看这个例子。
服务过程是从客户端发出对直播节目的请求开始,服务端收到该请求之后 生成一个聚合媒体描述,该聚合媒体描述以当前时间t0作为时间参考点,聚合媒体描述可以通过@expiry属性的存在,来说明该聚合媒体呈现描述是动态更新的,将在@expiry所指示的时间t1之后失效(过期),而下一个版本的聚合媒体描述在t1-tw1时刻(t1-tw1时刻形成一个时间窗口)可获得。第一版本的聚合媒体呈现描述中包含一个MediaPresentation元素,它所描述的媒体呈现的开始时间tp1由属性@start指示。MediaPresentation元素包含一个指针,指向插入的前置广告的媒体呈现描述MPD1。从t1-tw1开始,第二版本的聚合媒体呈现描述取代第一版本的聚合媒体呈现描述,第二版本的聚合媒体呈现描述中增加了第二个MediaPresentation元素,该MediaPresentation元素提供了直播节目的描述信息。它是一个直播节目,开始接入的时间tp2由该元素的属性@start给出,这个时间也是第一项内容的结束时间。其中,属性@offset的存在告知客户端不是当前时间tp2,而是以按照时间偏移-□t以延迟的方式来加入到直播节目中的,即以tp2-□t加入到直播节目中。对于直播媒体呈现,时间偏移量的数值是非正的,即延迟的时间大于等于0,因为通常是不可能提前加入直播节目中的。
客户端发出请求后收到服务端返回的聚合媒体呈现,客户端解析聚合媒体呈现。从时间tp1开始根据MPD1处理第一项内容(媒体呈现)直到tp2。在这期间,客户端根据@expriy和@timeAdvance的指示,在时间tc1(t1-tw1<tc1<t1)请求更新的聚合媒体呈现描述,根据聚合媒体呈现描述中的第二个MediaPresentation元素获取了MPD2,这时第一项内容还在播放,到tp2结束第一媒体呈现开始处理第二媒体呈现,直到结束。图中的红色线段表示第一媒体呈现的处理时间,绿色线段表示第一媒体呈现的处理时间。特别注意,第二项媒体呈现是直播内容,MPD2可能是动态更新,客户获取更新的MPD2,这个过程是客户端根据MPD2中的信息进行的,MPD2可能有多次更新,不过这个过程和聚合媒体呈现描述AMPD没有关系。另外,MPD1,MPD2和AMPD可能分别来自不同的服务器,反映在URL中的服务器名称或者IP地址不同。
下面再举例另一场景,在这个应用场景中,聚合媒体呈现是由三个不同的媒体呈现聚合而成的。聚合媒体呈现(也可称之为复合媒体呈现)的第一部分 是一个本地的媒体呈现。在复合媒体呈现描述中,媒体呈现元素MediaPresentation之下是MPD元素,描述了一个媒体呈现,其中包括一个Period。在作为示例的复合媒体呈现描述中,简明起见,MPD元素下只保留了一个Period元素而省略了其他的元素和属性。
图5-f举例了一种AMPD。其中,这个媒体呈现是直播类型的,在2015-3-2510:00接入这个直播的媒体呈现,加入媒体呈现的位置是该直播媒体呈现在绝对时间轴上的当前位置。
其中,聚合媒体呈现的第二部分也是一个远端的媒体呈现,这个媒体呈现是点播类型的,它是插入的一个广告,从开始位置接入。复合媒体呈现的第三部分是一个远端的媒体呈现,@xlink:herf属性指向了它的媒体呈现描述的通用资源定位符URL。从URL可以看出来,它的来源和第一部分的媒体呈现是不相同的。其中,这个媒体呈现是直播类型的,被引用的是这个媒体呈现中的内容段落m1。在2015-3-25 10:22加入这个直播的媒体呈现,不过不是在该媒体呈现的当前的位置(媒体时间)加入,其中,当前媒体内容的位置是在绝对时间10:22所对应的媒体内容的位置,而是在该直播媒体呈现的当前位置的10分钟之前加入的,即在绝对时间所对应的媒体内容的位置,相当把直播的媒体呈现延迟10分钟,延迟时间由@timeOffset指示,单位可以是秒。
下面再举例另一场景,如图5-g举例所示,在这个应用场景中,这是一个在时间上并行聚合内容的例子,多个媒体呈现在时间上是并行的,其中,它们在一个描述文件中被描述。其中,时间上并行聚合在一起的媒体呈现性质上是相同的,或者是直播,或者是点播。事实上,它提供了一种实现基于客户端的导览的方法。
这个方法是主要是基于客户端的,在分发环节可不需要对各个媒体呈现进行任何的处理。其中,根据DASH规范,一个媒体呈现是时间上顺序的内容段落构成的。多个媒体呈现,内容段落的安排是相互独立的,在时间上是交错的,这样的时间结构不是DASH能够处理的。为了适应DASH要求,当然可以对媒体呈现进行重新编码,消除时间段落的边界,这样多个媒体呈现可以包含在一个内容段落中。这样做的好处是只需要在DASH规范中引入小的扩展,客户端 的处理简单,但代价是需要对媒体呈现进行加工(重新编码),在一定程度上增加了复杂性。
在本实施例中,聚合媒体呈现描述中有多个MediaPresentation元素,每个MediaPresentation元素对应于一个媒体呈现,MediaPresentation元素可以是本地的,包含MPD及其下属的元素,也可以是非本地的,引用一个远端的媒体呈现描述。每个它们保持各自的内容段落和时间结构,不作改变。
其中,为了更好表示媒体呈现并发性,要么任何一个Presentation元素都不带有@startTime属性,要么每个Presentation元素都带有@startTime属性,并且@startTime的取值是相同的。前者,表示在复合媒体呈现描述可用的时候每个媒体呈现都是可用的,后者表示在@startTime指示的时间,每个Presentation是可用的。
其中,客户端收到复合媒体呈现之后可为每一个媒体呈现建立一个DASH客户端实例,进行该媒体呈现的媒体片段的获取,媒体数据的译码和播放等等处理。
在Presentation元素中引入空间位置关系描述符元素。EssentialProperty元素中的@schemeIdUri说明该描述符所引用的规则,其中,@value是所引用的规则的参数。
其中,在这个例子中,引用的规则用通用资源名称urn:mpeg:dash:srd:2013来识别(标识),它是用于标识空间关系的,其中,@value的取值是该规则要求的参数,如其中的第二,第三个数值表示对象(此处为Presentation)左上角的坐标,第四个数值和第五个数值表示对象的宽度和高度。
可以立即,上述举例都是示例性的,在实际应用中,可根据具体需要来进行适应性的调整。
参见图6,本发明实施例提供一种服务端600,可包括:
生成单元610,用于生成第一媒体呈现的媒体呈现描述,其中,所述第一媒体呈现包括内容项,所述媒体呈现描述包括所述内容项的描述或所述媒体呈现描述包括所述内容项的描述的指向信息,所述内容项的描述用于指示出所述内容项来自第二媒体呈现,其中,所述第一媒体呈现和所述第二媒体呈现为不 同的媒体呈现。
处理单元620,用于存储或发送所述媒体呈现描述。
可选的,在本发明的一些可能的实时方式中,所述内容项的描述还用于指示出所述第二媒体呈现为实时媒体呈现或非实时媒体呈现。
可选的,在本发明的一些可能的实时方式中,所述内容项的描述还用于指示出所述内容项在所述第一媒体呈现中嵌入的时间位置。
可选的,在本发明的一些可能的实时方式中,所述内容项的描述还用于指示出所述内容项的部分或全部被嵌入到所述第一媒体呈现中。
可选的,在本发明的一些可能的实时方式中,当所述内容项的描述还用于指示出所述内容项的部分被嵌入到所述第一媒体呈现中的情况下,所述内容项的描述还用于指示出所述内容项的所述部分的起始播放时间位置和/或结束播放时间位置。
可选的,在本发明的一些可能的实时方式中,所述内容项的描述包括偏移指示fz,所述偏移指示fz用于指示出所述内容项的起始播放时间位置和起始内容时间位置之间的偏移量offset。
可选的,在本发明的一些可能的实时方式中,当所述第二媒体呈现为实时媒体呈现时,并且所述偏移指示fz指示的偏移量offset等于0时,表示所述内容项将从当前时间对应的内容位置开始播放;或当所述第二媒体呈现为实时媒体呈现时,且所述偏移指示fz指示的偏移量offset不等于0时,表示所述内容项将从当前时间回退偏移量offset对应的内容位置开始播放;或当所述第二媒体呈现为实时媒体呈现时,并且所述偏移指示fz指示的偏移量offset不等于0时,表示所述内容项将从所述内容项的起始内容位置向后偏移所述偏移量offset对应的内容位置开始播放。
可选的,在本发明的一些可能的实时方式中,当所述第二媒体呈现为非实时媒体呈现时,且所述偏移指示fz指示的偏移量offset等于0时,表示所述内容项将从所述内容项的起始内容位置开始播放;或当所述第二媒体呈现为非实时媒体呈现时,且所述偏移指示fz指示的偏移量offset不等于0时,表示所述内容项的将从所述内容项的起始内容位置向后偏移所述偏移量offset对应的内容位 置开始播放。
可选的,在本发明的一些可能的实时方式中,所述内容项的描述包括在所述媒体呈现描述的聚合方法描述子中,或所述内容项的描述的指向信息包括在所述媒体呈现描述的聚合方法描述子中。
可选的,在本发明的一些可能的实时方式中,所述第一媒体呈现为聚合媒体呈现,所述媒体呈现描述为聚合媒体呈现描述,所述聚合媒体呈现描述包括N个媒体呈现描述元素,所述N为大于1或等于1的整数,第一媒体呈现描述元素为所述聚合媒体呈现描述包括的所述N个媒体呈现描述元素之中的一个媒体呈现描述元素,其中,所述内容项的描述包括在所述第一媒体呈现描述元素中或者所述内容项的描述的指向信息包括在所述第一媒体呈现描述元素中。
可选的,在本发明的一些可能的实时方式中,所述聚合媒体呈现描述还包括第一媒体呈现描述元素对应的时间窗口指示,其中,所述时间窗口指示用于指示客户端在所述时间窗口指示所指示的时间窗口内,从所述服务端获取所述聚合媒体呈现描述的更新内容,其中,所述更新内容包括所述第一媒体呈现描述元素。
可选的,在本发明的一些可能的实时方式中,所述内容项为内容段落或媒体表达或自适应集。
可以理解的是,本实施例的服务端600的各功能模块的功能可根据上述方法实施例中的方法具体实现,其具体实现过程可以参照上述方法实施例的相关描述,此处不再赘述。
可以看出,在本实施例的技术方案中,第一媒体呈现包括的内容项可以来自第二媒体呈现,也就是说,可以将其它若干个媒体呈现的部分或全部内容项进行重新聚合安排以形成满足特定编排需要的新媒体呈现,并且,新媒体呈现的媒体呈现描述包括了聚合来的其它媒体呈现的内容项的描述,使得客户端可据此进行相应内容项的获取和播放等。总得来说,在本实施例的技术方案有利于实现媒体内容的灵活聚合。
参见图7,本发明实施例提供一种客户端700,可包括:
第一获取单元710,用于获取第一媒体呈现的媒体呈现描述,其中,所述 第一媒体呈现包括内容项,所述媒体呈现描述包括所述内容项的描述或所述媒体呈现描述包括所述内容项的描述的指向信息,所述内容项的描述用于指示出所述内容项来自第二媒体呈现,其中,所述第一媒体呈现和所述第二媒体呈现为不同的媒体呈现;
第二获取单元720,用于根据所述内容项的描述获取所述内容项;
播放单元730,用于播放所述内容项。
可选的,在本发明的一些可能的实时方式中,所述内容项的描述还用于指示出所述第二媒体呈现为实时媒体呈现或非实时媒体呈现。
可选的,在本发明的一些可能的实时方式中,所述内容项的描述还用于指示出所述内容项在所述第一媒体呈现中嵌入的时间位置。
可选的,在本发明的一些可能的实时方式中,所述内容项的描述还用于指示出所述内容项的部分或全部被嵌入到所述第一媒体呈现中。
可选的,在本发明的一些可能的实时方式中,当所述内容项的描述还用于指示出所述内容项的部分被嵌入到所述第一媒体呈现中的情况下,所述内容项的描述还用于指示出所述内容项的所述部分的起始播放时间位置和/或结束播放时间位置。
可选的,在本发明的一些可能的实时方式中,所述内容项的描述包括偏移指示fz,所述偏移指示fz用于指示出所述内容项的起始播放时间位置和起始内容时间位置之间的偏移量offset。
可选的,在本发明的一些可能的实时方式中,当所述第二媒体呈现为实时媒体呈现时,并且所述偏移指示fz指示的偏移量offset等于0时,表示所述内容项将从当前时间对应的内容位置开始播放;或当所述第二媒体呈现为实时媒体呈现时,且所述偏移指示fz指示的偏移量offset不等于0时,表示所述内容项将从当前时间回退偏移量offset对应的内容位置开始播放;或当所述第二媒体呈现为实时媒体呈现时,并且所述偏移指示fz指示的偏移量offset不等于0时,表示所述内容项将从所述内容项的起始内容位置向后偏移所述偏移量offset对应的内容位置开始播放。
可选的,在本发明的一些可能的实时方式中,当所述第二媒体呈现为非实 时媒体呈现时,且所述偏移指示fz指示的偏移量offset等于0时,表示所述内容项将从所述内容项的起始内容位置开始播放;或当所述第二媒体呈现为非实时媒体呈现时,且所述偏移指示fz指示的偏移量offset不等于0时,表示所述内容项的将从所述内容项的起始内容位置向后偏移所述偏移量offset对应的内容位置开始播放。
可选的,在本发明的一些可能的实时方式中,所述内容项的描述包括在所述媒体呈现描述的聚合方法描述子中,或所述内容项的描述的指向信息包括在所述媒体呈现描述的聚合方法描述子中。
可选的,在本发明的一些可能的实时方式中,所述第一媒体呈现为聚合媒体呈现,所述媒体呈现描述为聚合媒体呈现描述,所述聚合媒体呈现描述包括N个媒体呈现描述元素,所述N为大于1或等于1的整数,第一媒体呈现描述元素为所述聚合媒体呈现描述包括的所述N个媒体呈现描述元素之中的一个媒体呈现描述元素,其中,所述内容项的描述包括在所述第一媒体呈现描述元素中或者所述内容项的描述的指向信息包括在所述第一媒体呈现描述元素中。
可选的,在本发明的一些可能的实时方式中,所述聚合媒体呈现描述还包括第一媒体呈现描述元素对应的时间窗口指示,其中,所述时间窗口指示用于指示客户端在所述时间窗口指示所指示的时间窗口内,从所述服务端获取所述聚合媒体呈现描述的更新内容,其中,所述更新内容可以包括所述第一媒体呈现描述元素。
可选的,在本发明的一些可能的实时方式中,所述内容项为内容段落或媒体表达或自适应集。
可以理解的是,本实施例的客户端700的各功能模块的功能可根据上述方法实施例中的方法具体实现,其具体实现过程可以参照上述方法实施例的相关描述,此处不再赘述。
可以看出,在本实施例的技术方案中,第一媒体呈现包括的内容项可以来自第二媒体呈现,也就是说,可以将其它若干个媒体呈现的部分或全部内容项进行重新聚合安排以形成满足特定编排需要的新媒体呈现,并且,新媒体呈现的媒体呈现描述包括了聚合来的其它媒体呈现的内容项的描述,使得客户端可 据此进行相应内容项的获取和播放等。总得来说,在本实施例的技术方案有利于实现媒体内容的灵活聚合。
参见图8,图8是本发明的另一实施例提供的服务端800的结构框图。其中,服务端800可包括:至少1个处理器801,存储器805和至少1个通信总线802。其中,通信总线802用于实现这些组件之间的连接通信。
其中,该服务端800可选的可以包含至少1个网络接口804和/或用户接口803,用户接口803可以包括显示器(例如触摸屏、LCD、全息成像(Holographic)、CRT或者投影(Projector)等)、点击设备(例如鼠标或轨迹球(trackball)触感板或触摸屏等)、摄像头和/或拾音装置等。
其中,存储器805可以包括只读存储器和随机存取存储器,并向处理器801提供指令和数据。存储器805中的一部分还可以包括非易失性随机存取存储器。
在一些实施方式中,存储器805存储了如下的元素,可执行模块或者数据结构,或者他们的子集,或者他们的扩展集:
操作系统8051,包含各种系统程序,用于实现各种基础业务以及处理基于硬件的任务。
应用程序模块8052,包含各种应用程序,用于实现各种应用业务。
在本发明的实施例中,通过调用存储器805存储的程序或指令,处理器801生成第一媒体呈现的媒体呈现描述,其中,所述第一媒体呈现包括内容项,所述媒体呈现描述包括所述内容项的描述或所述媒体呈现描述包括所述内容项的描述的指向信息,所述内容项的描述用于指示出所述内容项来自第二媒体呈现,其中,所述第一媒体呈现和所述第二媒体呈现为不同的媒体呈现;存储或发送所述媒体呈现描述。
可选的,在本发明的一些可能的实时方式中,所述内容项的描述还用于指示出所述第二媒体呈现为实时媒体呈现或非实时媒体呈现。
可选的,在本发明的一些可能的实时方式中,所述内容项的描述还用于指示出所述内容项在所述第一媒体呈现中嵌入的时间位置。
可选的,在本发明的一些可能的实时方式中,所述内容项的描述还用于指示出所述内容项的部分或全部被嵌入到所述第一媒体呈现中。
可选的,在本发明的一些可能的实时方式中,当所述内容项的描述还用于指示出所述内容项的部分被嵌入到所述第一媒体呈现中的情况下,所述内容项的描述还用于指示出所述内容项的所述部分的起始播放时间位置和/或结束播放时间位置。
可选的,在本发明的一些可能的实时方式中,所述内容项的描述包括偏移指示fz,所述偏移指示fz用于指示出所述内容项的起始播放时间位置和起始内容时间位置之间的偏移量offset。
可选的,在本发明的一些可能的实时方式中,当所述第二媒体呈现为实时媒体呈现时,并且所述偏移指示fz指示的偏移量offset等于0时,表示所述内容项将从当前时间对应的内容位置开始播放;或当所述第二媒体呈现为实时媒体呈现时,且所述偏移指示fz指示的偏移量offset不等于0时,表示所述内容项将从当前时间回退偏移量offset对应的内容位置开始播放;或当所述第二媒体呈现为实时媒体呈现时,并且所述偏移指示fz指示的偏移量offset不等于0时,表示所述内容项将从所述内容项的起始内容位置向后偏移所述偏移量offset对应的内容位置开始播放。
可选的,在本发明的一些可能的实时方式中,当所述第二媒体呈现为非实时媒体呈现时,且所述偏移指示fz指示的偏移量offset等于0时,表示所述内容项将从所述内容项的起始内容位置开始播放;或当所述第二媒体呈现为非实时媒体呈现时,且所述偏移指示fz指示的偏移量offset不等于0时,表示所述内容项的将从所述内容项的起始内容位置向后偏移所述偏移量offset对应的内容位置开始播放。
可选的,在本发明的一些可能的实时方式中,所述内容项的描述包括在所述媒体呈现描述的聚合方法描述子中,或所述内容项的描述的指向信息包括在所述媒体呈现描述的聚合方法描述子中。
可选的,在本发明的一些可能的实时方式中,所述第一媒体呈现为聚合媒体呈现,所述媒体呈现描述为聚合媒体呈现描述,所述聚合媒体呈现描述包括N个媒体呈现描述元素,所述N为大于1或等于1的整数,第一媒体呈现描述元素为所述聚合媒体呈现描述包括的所述N个媒体呈现描述元素之中的一个媒 体呈现描述元素,其中,所述内容项的描述包括在所述第一媒体呈现描述元素中或者所述内容项的描述的指向信息包括在所述第一媒体呈现描述元素中。
可选的,在本发明的一些可能的实时方式中,所述聚合媒体呈现描述还包括第一媒体呈现描述元素对应的时间窗口指示,其中,所述时间窗口指示用于指示客户端在所述时间窗口指示所指示的时间窗口内,从所述服务端获取所述聚合媒体呈现描述的更新内容,其中,所述更新内容包括所述第一媒体呈现描述元素。
可选的,在本发明的一些可能的实时方式中,所述内容项为内容段落或媒体表达或自适应集。
可以理解的是,本实施例的服务端800的各功能模块的功能可根据上述方法实施例中的方法具体实现,其具体实现过程可以参照上述方法实施例的相关描述,此处不再赘述。
可以看出,在本实施例的技术方案中,第一媒体呈现包括的内容项可以来自第二媒体呈现,也就是说,可以将其它若干个媒体呈现的部分或全部内容项进行重新聚合安排以形成满足特定编排需要的新媒体呈现,并且,新媒体呈现的媒体呈现描述包括了聚合来的其它媒体呈现的内容项的描述,使得客户端可据此进行相应内容项的获取和播放等。总得来说,在本实施例的技术方案有利于实现媒体内容的灵活聚合。
参见图9,图9是本发明的另一实施例提供的客户端900的结构框图。其中,客户端900可包括:至少1个处理器901,存储器905和至少1个通信总线902。其中,通信总线902用于实现这些组件之间的连接通信。
其中,该客户端900可选的可以包含至少1个网络接口904和/或用户接口903,用户接口903可以包括显示器(例如触摸屏、LCD、全息成像(Holographic)、CRT或者投影(Projector)等)、点击设备(例如鼠标或轨迹球(trackball)触感板或触摸屏等)、摄像头和/或拾音装置等。
其中,存储器905可包括只读存储器和随机存取存储器,并向处理器901提供指令和数据。存储器905中的一部分还可以包括非易失性随机存取存储器。
在一些实施方式中,存储器905存储了如下的元素,可执行模块或者数据 结构,或者他们的子集,或者他们的扩展集:
操作系统9051,包含各种系统程序,用于实现各种基础业务以及处理基于硬件的任务。
应用程序模块9052,包含各种应用程序,用于实现各种应用业务。
在本发明的实施例中,通过调用存储器905存储的程序或指令,处理器901获取第一媒体呈现的媒体呈现描述,其中,所述第一媒体呈现包括内容项,所述媒体呈现描述包括所述内容项的描述或所述媒体呈现描述包括所述内容项的描述的指向信息,所述内容项的描述用于指示出所述内容项来自第二媒体呈现,其中,所述第一媒体呈现和所述第二媒体呈现为不同的媒体呈现;根据所述内容项的描述获取所述内容项;播放所述内容项。
可选的,在本发明的一些可能的实时方式中,所述内容项的描述还用于指示出所述第二媒体呈现为实时媒体呈现或非实时媒体呈现。
可选的,在本发明的一些可能的实时方式中,所述内容项的描述还用于指示出所述内容项在所述第一媒体呈现中嵌入的时间位置。
可选的,在本发明的一些可能的实时方式中,所述内容项的描述还用于指示出所述内容项的部分或全部被嵌入到所述第一媒体呈现中。
可选的,在本发明的一些可能的实时方式中,当所述内容项的描述还用于指示出所述内容项的部分被嵌入到所述第一媒体呈现中的情况下,所述内容项的描述还用于指示出所述内容项的所述部分的起始播放时间位置和/或结束播放时间位置。
可选的,在本发明的一些可能的实时方式中,所述内容项的描述包括偏移指示fz,所述偏移指示fz用于指示出所述内容项的起始播放时间位置和起始内容时间位置之间的偏移量offset。
可选的,在本发明的一些可能的实时方式中,当所述第二媒体呈现为实时媒体呈现时,并且所述偏移指示fz指示的偏移量offset等于0时,表示所述内容项将从当前时间对应的内容位置开始播放;或当所述第二媒体呈现为实时媒体呈现时,且所述偏移指示fz指示的偏移量offset不等于0时,表示所述内容项将从当前时间回退偏移量offset对应的内容位置开始播放;或当所述第二媒体呈 现为实时媒体呈现时,并且所述偏移指示fz指示的偏移量offset不等于0时,表示所述内容项将从所述内容项的起始内容位置向后偏移所述偏移量offset对应的内容位置开始播放。
可选的,在本发明的一些可能的实时方式中,当所述第二媒体呈现为非实时媒体呈现时,且所述偏移指示fz指示的偏移量offset等于0时,表示所述内容项将从所述内容项的起始内容位置开始播放;或当所述第二媒体呈现为非实时媒体呈现时,且所述偏移指示fz指示的偏移量offset不等于0时,表示所述内容项的将从所述内容项的起始内容位置向后偏移所述偏移量offset对应的内容位置开始播放。
可选的,在本发明的一些可能的实时方式中,所述内容项的描述包括在所述媒体呈现描述的聚合方法描述子中,或所述内容项的描述的指向信息包括在所述媒体呈现描述的聚合方法描述子中。
可选的,在本发明的一些可能的实时方式中,所述第一媒体呈现为聚合媒体呈现,所述媒体呈现描述为聚合媒体呈现描述,所述聚合媒体呈现描述包括N个媒体呈现描述元素,所述N为大于1或等于1的整数,第一媒体呈现描述元素为所述聚合媒体呈现描述包括的所述N个媒体呈现描述元素之中的一个媒体呈现描述元素,其中,所述内容项的描述包括在所述第一媒体呈现描述元素中或者所述内容项的描述的指向信息包括在所述第一媒体呈现描述元素中。
可选的,在本发明的一些可能的实时方式中,所述聚合媒体呈现描述还包括第一媒体呈现描述元素对应的时间窗口指示,其中,所述时间窗口指示用于指示客户端在所述时间窗口指示所指示的时间窗口内,从所述服务端获取所述聚合媒体呈现描述的更新内容,其中,所述更新内容包括所述第一媒体呈现描述元素。
可选的,在本发明的一些可能的实时方式中,所述内容项为内容段落或媒体表达或自适应集。
可以理解的是,本实施例的服务端900的各功能模块的功能可根据上述方法实施例中的方法具体实现,其具体实现过程可以参照上述方法实施例的相关描述,此处不再赘述。
可以看出,在本实施例的技术方案中,第一媒体呈现包括的内容项可以来自第二媒体呈现,也就是说,可以将其它若干个媒体呈现的部分或全部内容项进行重新聚合安排以形成满足特定编排需要的新媒体呈现,并且,新媒体呈现的媒体呈现描述包括了聚合来的其它媒体呈现的内容项的描述,使得客户端可据此进行相应内容项的获取和播放等。总得来说,在本实施例的技术方案有利于实现媒体内容的灵活聚合。
本发明实施例提供一种通信系统,包括本发明实施例提供的任意一种客户端和本发明实施例提供的任意一种服务端。
本发明实施例还提供一种计算机存储介质,其中,该计算机存储介质可存储有程序,该程序执行时包括上述方法实施例中记载的任何一种内容项聚合方法的部分或全部步骤。
需要说明的是,对于前述的各方法实施例,为了简单描述,故将其都表述为一系列的动作组合,但是本领域技术人员应该知悉,本发明并不受所描述的动作顺序的限制,因为依据本发明,某些步骤可以采用其他顺序或者同时进行。其次,本领域技术人员也应该知悉,说明书中所描述的实施例均属于优选实施例,所涉及的动作和模块并不一定是本发明所必须的。
在上述实施例中,对各个实施例的描述都各有侧重,某个实施例中没有详述的部分,可以参见其他实施例的相关描述。
在本申请所提供的几个实施例中,应该理解到,所揭露的装置,可通过其它的方式实现。例如,以上所描述的装置实施例仅仅是示意性的,例如所述单元的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式,例如多个单元或组件可以结合或者可以集成到另一个系统,或一些特征可以忽略,或不执行。另一点,所显示或讨论的相互之间的耦合或直接耦合或通信连接可以是通过一些接口,装置或单元的间接耦合或通信连接,可以是电性或其它的形式。
所述作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部 单元来实现本实施例方案的目的。
另外,在本发明各个实施例中的各功能单元可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中。上述集成的单元既可以采用硬件的形式实现,也可以采用软件功能单元的形式实现。
所述集成的单元如果以软件功能单元的形式实现并作为独立的产品销售或使用时,可以存储在一个计算机可读取存储介质中。基于这样的理解,本发明的技术方案本质上或者说对现有技术做出贡献的部分或者该技术方案的全部或部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质中,包括若干指令用以使得一台计算机设备(可为个人计算机、服务器或者网络设备等)执行本发明各个实施例所述方法的全部或部分步骤。而前述的存储介质包括:U盘、只读存储器(ROM,Read-Only Memory)、随机存取存储器(RAM,Random Access Memory)、移动硬盘、磁碟或者光盘等各种可以存储程序代码的介质。
以上所述,以上实施例仅用以说明本发明技术方案,而非对其限制;尽管参照前述实施例对本发明进行了详细的说明,其中,本领域的普通技术人员应当理解:其依然可以对前述各实施例所记载的技术方案进行修改,或者对其中部分技术特征进行等同替换;而这些修改或者替换,并不使相应技术方案的本质脱离本发明各实施例技术方案的范围。

Claims (66)

  1. 一种内容项聚合方法,其特征在于,包括:
    服务端生成第一媒体呈现的媒体呈现描述,其中,所述第一媒体呈现包括内容项,所述媒体呈现描述包括所述内容项的描述或所述媒体呈现描述包括所述内容项的描述的指向信息,其中,所述内容项的描述用于指示出所述内容项来自第二媒体呈现,其中,所述第一媒体呈现和所述第二媒体呈现为不同的媒体呈现;
    存储或发送所述媒体呈现描述。
  2. 根据权利要求1所述的方法,其特征在于,
    所述内容项的描述还用于指示出所述第二媒体呈现为实时媒体呈现或非实时媒体呈现。
  3. 根据权利要求1或2所述的方法,其特征在于,
    所述内容项的描述还用于指示出所述内容项在所述第一媒体呈现中嵌入的时间位置。
  4. 根据权利要求1至3任意一项所述的方法,其特征在于,
    所述内容项的描述还用于指示出所述内容项的部分或全部被嵌入到所述第一媒体呈现中。
  5. 根据权利要求4所述的方法,其特征在于,
    当所述内容项的描述还用于指示出所述内容项的部分被嵌入到所述第一媒体呈现中的情况下,所述内容项的描述还用于指示出所述内容项的所述部分的起始播放时间位置和/或结束播放时间位置。
  6. 根据权利要求5所述的方法,其特征在于,所述内容项的描述包括偏移指示fz,所述偏移指示fz用于指示出所述内容项的起始播放时间位置和起始内容时间位置之间的偏移量offset。
  7. 根据权利要求6所述的方法,其特征在于,当所述第二媒体呈现为实时媒体呈现时,并且所述偏移指示fz指示的偏移量offset等于0时,表示所述内容项将从当前时间对应的内容位置开始播放;或当所述第二媒体呈现为实时媒体呈现时,且所述偏移指示fz指示的偏移量offset不等于0时,表示所述内容项将 从当前时间回退偏移量offset对应的内容位置开始播放;或当所述第二媒体呈现为实时媒体呈现时,并且所述偏移指示fz指示的偏移量offset不等于0时,表示所述内容项将从所述内容项的起始内容位置向后偏移所述偏移量offset对应的内容位置开始播放。
  8. 根据权利要求6所述的方法,其特征在于,当所述第二媒体呈现为非实时媒体呈现时,且所述偏移指示fz指示的偏移量offset等于0时,表示所述内容项将从所述内容项的起始内容位置开始播放;或当所述第二媒体呈现为非实时媒体呈现时,且所述偏移指示fz指示的偏移量offset不等于0时,表示所述内容项的将从所述内容项的起始内容位置向后偏移所述偏移量offset对应的内容位置开始播放。
  9. 根据权利要求1至8任意一项所述的方法,其特征在于,所述内容项的描述包括在所述媒体呈现描述的聚合方法描述子中,或所述内容项的描述的指向信息包括在所述媒体呈现描述的聚合方法描述子中。
  10. 根据权利要求1至8任意一项所述的方法,其特征在于,所述第一媒体呈现为聚合媒体呈现,所述媒体呈现描述为聚合媒体呈现描述,所述聚合媒体呈现描述包括N个媒体呈现描述元素,所述N为大于1或等于1的整数,第一媒体呈现描述元素为所述聚合媒体呈现描述包括的所述N个媒体呈现描述元素之中的一个媒体呈现描述元素,其中,所述内容项的描述包括在所述第一媒体呈现描述元素中或者所述内容项的描述的指向信息包括在所述第一媒体呈现描述元素中。
  11. 根据权利要求10所述的方法,其特征在于,所述聚合媒体呈现描述还包括第一媒体呈现描述元素对应的时间窗口指示,其中,所述时间窗口指示用于指示客户端在所述时间窗口指示所指示的时间窗口内,从所述服务端获取所述聚合媒体呈现描述的更新内容,其中,所述更新内容包括所述第一媒体呈现描述元素。
  12. 根据权利要求1至11任意一项所述的方法,其特征在于,所述内容项为内容段落或媒体表达或自适应集。
  13. 一种内容项聚合方法,其特征在于,包括:
    客户端获取第一媒体呈现的媒体呈现描述,其中,所述第一媒体呈现包括内容项,所述媒体呈现描述包括所述内容项的描述或所述媒体呈现描述包括所述内容项的描述的指向信息,其中,所述内容项的描述用于指示出所述内容项来自第二媒体呈现,其中,所述第一媒体呈现和所述第二媒体呈现为不同的媒体呈现;
    所述客户端根据所述内容项的描述获取所述内容项;所述客户端播放所述内容项。
  14. 根据权利要求13所述的方法,其特征在于,
    所述内容项的描述还用于指示出所述第二媒体呈现为实时媒体呈现或非实时媒体呈现。
  15. 根据权利要求13或14所述的方法,其特征在于,
    所述内容项的描述还用于指示出所述内容项在所述第一媒体呈现中嵌入的时间位置。
  16. 根据权利要求13至15任意一项所述的方法,其特征在于,
    所述内容项的描述还用于指示出所述内容项的部分或全部被嵌入到所述第一媒体呈现中。
  17. 根据权利要求16所述的方法,其特征在于,
    当所述内容项的描述还用于指示出所述内容项的部分被嵌入到所述第一媒体呈现中的情况下,所述内容项的描述还用于指示出所述内容项的所述部分的起始播放时间位置和/或结束播放时间位置。
  18. 根据权利要求17所述的方法,其特征在于,所述内容项的描述包括偏移指示fz,所述偏移指示fz用于指示出所述内容项的起始播放时间位置和起始内容时间位置之间的偏移量offset。
  19. 根据权利要求18所述的方法,其特征在于,当所述第二媒体呈现为实时媒体呈现时,并且所述偏移指示fz指示的偏移量offset等于0时,表示所述内容项将从当前时间对应的内容位置开始播放;或当所述第二媒体呈现为实时媒体呈现时,且所述偏移指示fz指示的偏移量offset不等于0时,表示所述内容项将从当前时间回退偏移量offset对应的内容位置开始播放;或当所述第二媒体 呈现为实时媒体呈现时,并且所述偏移指示fz指示的偏移量offset不等于0时,表示所述内容项将从所述内容项的起始内容位置向后偏移所述偏移量offset对应的内容位置开始播放。
  20. 根据权利要求18所述的方法,其特征在于,当所述第二媒体呈现为非实时媒体呈现时,且所述偏移指示fz指示的偏移量offset等于0时,表示所述内容项将从所述内容项的起始内容位置开始播放;或当所述第二媒体呈现为非实时媒体呈现时,且所述偏移指示fz指示的偏移量offset不等于0时,表示所述内容项的将从所述内容项的起始内容位置向后偏移所述偏移量offset对应的内容位置开始播放。
  21. 根据权利要求13至20任意一项所述的方法,其特征在于,所述内容项的描述包括在所述媒体呈现描述的聚合方法描述子中,或所述内容项的描述的指向信息包括在所述媒体呈现描述的聚合方法描述子中。
  22. 根据权利要求13至20任意一项所述的方法,其特征在于,所述第一媒体呈现为聚合媒体呈现,所述媒体呈现描述为聚合媒体呈现描述,所述聚合媒体呈现描述包括N个媒体呈现描述元素,所述N为大于1或等于1的整数,第一媒体呈现描述元素为所述聚合媒体呈现描述包括的所述N个媒体呈现描述元素之中的一个媒体呈现描述元素,其中,所述内容项的描述包括在所述第一媒体呈现描述元素中或者所述内容项的描述的指向信息包括在所述第一媒体呈现描述元素中。
  23. 根据权利要求22所述的方法,其特征在于,所述聚合媒体呈现描述还包括第一媒体呈现描述元素对应的时间窗口指示,其中,所述时间窗口指示用于指示客户端在所述时间窗口指示所指示的时间窗口内,从服务端获取所述聚合媒体呈现描述的更新内容,所述更新内容包括所述第一媒体呈现描述元素。
  24. 根据权利要求12至23任意一项所述的方法,其特征在于,所述内容项为内容段落或媒体表达或自适应集。
  25. 一种服务端,其特征在于,包括:
    生成单元,用于生成第一媒体呈现的媒体呈现描述,其中,所述第一媒体呈现包括内容项,所述媒体呈现描述包括所述内容项的描述或所述媒体呈现描 述包括所述内容项的描述的指向信息,所述内容项的描述用于指示出所述内容项来自第二媒体呈现,其中,所述第一媒体呈现和所述第二媒体呈现为不同的媒体呈现;
    处理单元,用于存储或发送所述媒体呈现描述。
  26. 根据权利要求25所述的服务端,其特征在于,
    所述内容项的描述还用于指示出所述第二媒体呈现为实时媒体呈现或非实时媒体呈现。
  27. 根据权利要求25或26所述的服务端,其特征在于,
    所述内容项的描述还用于指示出所述内容项在所述第一媒体呈现中嵌入的时间位置。
  28. 根据权利要求25至27任意一项所述的服务端,其特征在于,
    所述内容项的描述还用于指示出所述内容项的部分或全部被嵌入到所述第一媒体呈现中。
  29. 根据权利要求28所述的服务端,其特征在于,
    当所述内容项的描述还用于指示出所述内容项的部分被嵌入到所述第一媒体呈现中的情况下,所述内容项的描述还用于指示出所述内容项的所述部分的起始播放时间位置和/或结束播放时间位置。
  30. 根据权利要求29所述的服务端,其特征在于,所述内容项的描述包括偏移指示fz,所述偏移指示fz用于指示出所述内容项的起始播放时间位置和起始内容时间位置之间的偏移量offset。
  31. 根据权利要求30所述的服务端,其特征在于,当所述第二媒体呈现为实时媒体呈现时,并且所述偏移指示fz指示的偏移量offset等于0时,表示所述内容项将从当前时间对应的内容位置开始播放;或当所述第二媒体呈现为实时媒体呈现时,且所述偏移指示fz指示的偏移量offset不等于0时,表示所述内容项将从当前时间回退偏移量offset对应的内容位置开始播放;或当所述第二媒体呈现为实时媒体呈现时,且所述偏移指示fz指示的偏移量offset不等于0时,表示所述内容项将从所述内容项的起始内容位置向后偏移所述偏移量offset对应的内容位置开始播放。
  32. 根据权利要求30所述的服务端,其特征在于,当所述第二媒体呈现为非实时媒体呈现时,且所述偏移指示fz指示的偏移量offset等于0时,表示所述内容项将从所述内容项的起始内容位置开始播放;或当所述第二媒体呈现为非实时媒体呈现时,且所述偏移指示fz指示的偏移量offset不等于0时,表示所述内容项的将从所述内容项的起始内容位置向后偏移所述偏移量offset对应的内容位置开始播放。
  33. 根据权利要求25至32任意一项所述的服务端,其特征在于,所述内容项的描述包括在所述媒体呈现描述的聚合方法描述子中,或所述内容项的描述的指向信息包括在所述媒体呈现描述的聚合方法描述子中。
  34. 根据权利要求25至32任一项所述的服务端,其特征在于,所述第一媒体呈现为聚合媒体呈现,所述媒体呈现描述为聚合媒体呈现描述,所述聚合媒体呈现描述包括N个媒体呈现描述元素,所述N为大于1或等于1的整数,第一媒体呈现描述元素为所述聚合媒体呈现描述包括的所述N个媒体呈现描述元素之中的一个媒体呈现描述元素,其中,所述内容项的描述包括在所述第一媒体呈现描述元素中或者所述内容项的描述的指向信息包括在所述第一媒体呈现描述元素中。
  35. 根据权利要求34所述的服务端,其特征在于,所述聚合媒体呈现描述还包括第一媒体呈现描述元素对应的时间窗口指示,其中,所述时间窗口指示用于指示客户端在所述时间窗口指示所指示的时间窗口内,从所述服务端获取所述聚合媒体呈现描述的更新内容,其中,所述更新内容包括所述第一媒体呈现描述元素。
  36. 根据权利要求25至35任意一项所述的服务端,其特征在于,所述内容项为内容段落或媒体表达或自适应集。
  37. 一种客户端,其特征在于,包括:
    第一获取单元,用于获取第一媒体呈现的媒体呈现描述,其中,所述第一媒体呈现包括内容项,所述媒体呈现描述包括所述内容项的描述或所述媒体呈现描述包括所述内容项的描述的指向信息,所述内容项的描述用于指示出所述内容项来自第二媒体呈现,其中,所述第一媒体呈现和所述第二媒体呈现为不 同的媒体呈现;
    第二获取单元,用于根据所述内容项的描述获取所述内容项;
    播放单元,用于播放所述内容项。
  38. 根据权利要求37所述的客户端,其特征在于,
    所述内容项的描述还用于指示出所述第二媒体呈现为实时媒体呈现或非实时媒体呈现。
  39. 根据权利要求37或38所述的客户端,其特征在于,
    所述内容项的描述还用于指示出所述内容项在所述第一媒体呈现中嵌入的时间位置。
  40. 根据权利要求37至39任意一项所述的客户端,其特征在于,
    所述内容项的描述还用于指示出所述内容项的部分或全部被嵌入到所述第一媒体呈现中。
  41. 根据权利要求40所述的客户端,其特征在于,
    当所述内容项的描述还用于指示出所述内容项的部分被嵌入到所述第一媒体呈现中的情况下,所述内容项的描述还用于指示出所述内容项的所述部分的起始播放时间位置和/或结束播放时间位置。
  42. 根据权利要求37至41任意一项所述的客户端,其特征在于,所述内容项为内容段落或媒体表达或自适应集。
  43. 一种服务端,其特征在于,包括:存储器和处理器;
    其中,所述处理器用于,生成第一媒体呈现的媒体呈现描述,所述第一媒体呈现包括内容项,所述媒体呈现描述包括所述内容项的描述或所述媒体呈现描述包括所述内容项的描述的指向信息,其中,所述内容项的描述用于指示出所述内容项来自第二媒体呈现,其中,所述第一媒体呈现和所述第二媒体呈现为不同的媒体呈现;存储或发送所述媒体呈现描述。
  44. 根据权利要求43所述的服务端,其特征在于,
    所述内容项的描述还用于指示出所述第二媒体呈现为实时媒体呈现或非实时媒体呈现。
  45. 根据权利要求43或44所述的服务器,其特征在于,
    所述内容项的描述还用于指示出所述内容项在所述第一媒体呈现中嵌入的时间位置。
  46. 根据权利要求43至45任意一项所述的所述处理器,其特征在于,
    所述内容项的描述还用于指示出所述内容项的部分或全部被嵌入到所述第一媒体呈现中。
  47. 根据权利要求46所述的服务器,其特征在于,
    当所述内容项的描述还用于指示出所述内容项的部分被嵌入到所述第一媒体呈现中的情况下,所述内容项的描述还用于指示出所述内容项的所述部分的起始播放时间位置和/或结束播放时间位置。
  48. 根据权利要求47所述的服务器,其特征在于,所述内容项的描述包括偏移指示fz,所述偏移指示fz用于指示出所述内容项的起始播放时间位置和起始内容时间位置之间的偏移量offset。
  49. 根据权利要求48所述的服务器,其特征在于,当所述第二媒体呈现为实时媒体呈现时,并且所述偏移指示fz指示的偏移量offset等于0时,表示所述内容项将从当前时间对应的内容位置开始播放;或当所述第二媒体呈现为实时媒体呈现时,且所述偏移指示fz指示的偏移量offset不等于0时,表示所述内容项将从当前时间回退偏移量offset对应的内容位置开始播放;或当所述第二媒体呈现为实时媒体呈现时,并且所述偏移指示fz指示的偏移量offset不等于0时,表示所述内容项将从所述内容项的起始内容位置向后偏移所述偏移量offset对应的内容位置开始播放。
  50. 根据权利要求48所述的服务器,其特征在于,当所述第二媒体呈现为非实时媒体呈现时,且所述偏移指示fz指示的偏移量offset等于0时,表示所述内容项将从所述内容项的起始内容位置开始播放;或当所述第二媒体呈现为非实时媒体呈现时,且所述偏移指示fz指示的偏移量offset不等于0时,表示所述内容项的将从所述内容项的起始内容位置向后偏移所述偏移量offset对应的内容位置开始播放。
  51. 根据权利要求43至50任意一项所述的服务器,其特征在于,所述内容项的描述包括在所述媒体呈现描述的聚合服务器描述子中,或所述内容项的描 述的指向信息包括在所述媒体呈现描述的聚合服务器描述子中。
  52. 根据权利要求43至50任意一项所述的服务器,其特征在于,所述第一媒体呈现为聚合媒体呈现,所述媒体呈现描述为聚合媒体呈现描述,所述聚合媒体呈现描述包括N个媒体呈现描述元素,所述N为大于1或等于1的整数,第一媒体呈现描述元素为所述聚合媒体呈现描述包括的所述N个媒体呈现描述元素之中的一个媒体呈现描述元素,其中,所述内容项的描述包括在所述第一媒体呈现描述元素中或者所述内容项的描述的指向信息包括在所述第一媒体呈现描述元素中。
  53. 根据权利要求52所述的服务器,其特征在于,所述聚合媒体呈现描述还包括第一媒体呈现描述元素对应的时间窗口指示,其中,所述时间窗口指示用于指示客户端在所述时间窗口指示所指示的时间窗口内,从所述服务端获取所述聚合媒体呈现描述的更新内容,其中,所述更新内容包括所述第一媒体呈现描述元素。
  54. 根据权利要求43至53任意一项所述的服务器,其特征在于,所述内容项为内容段落或媒体表达或自适应集。
  55. 一种客户端,其特征在于,包括:
    存储器和处理器;
    其中,所述处理器用于,获取第一媒体呈现的媒体呈现描述,所述第一媒体呈现包括内容项,所述媒体呈现描述包括所述内容项的描述或所述媒体呈现描述包括所述内容项的描述的指向信息,所述内容项的描述用于指示出所述内容项来自第二媒体呈现,所述第一媒体呈现和所述第二媒体呈现为不同的媒体呈现;根据所述内容项的描述获取所述内容项;播放所述内容项。
  56. 根据权利要求55所述的客户端,其特征在于,
    所述内容项的描述还用于指示出所述第二媒体呈现为实时媒体呈现或非实时媒体呈现。
  57. 根据权利要求55或56所述的客户端,其特征在于,
    所述内容项的描述还用于指示出所述内容项在所述第一媒体呈现中嵌入的时间位置。
  58. 根据权利要求55至57任意一项所述的客户端,其特征在于,
    所述内容项的描述还用于指示出所述内容项的部分或全部被嵌入到所述第一媒体呈现中。
  59. 根据权利要求58所述的客户端,其特征在于,
    当所述内容项的描述还用于指示出所述内容项的部分被嵌入到所述第一媒体呈现中的情况下,所述内容项的描述还用于指示出所述内容项的所述部分的起始播放时间位置和/或结束播放时间位置。
  60. 根据权利要求59所述的客户端,其特征在于,所述内容项的描述包括偏移指示fz,所述偏移指示fz用于指示出所述内容项的起始播放时间位置和起始内容时间位置之间的偏移量offset。
  61. 根据权利要求60所述的客户端,其特征在于,当所述第二媒体呈现为实时媒体呈现时,并且所述偏移指示fz指示的偏移量offset等于0时,表示所述内容项将从当前时间对应的内容位置开始播放;或当所述第二媒体呈现为实时媒体呈现时,且所述偏移指示fz指示的偏移量offset不等于0时,表示所述内容项将从当前时间回退偏移量offset对应的内容位置开始播放;或当所述第二媒体呈现为实时媒体呈现时,并且所述偏移指示fz指示的偏移量offset不等于0时,表示所述内容项将从所述内容项的起始内容位置向后偏移所述偏移量offset对应的内容位置开始播放。
  62. 根据权利要求60所述的客户端,其特征在于,当所述第二媒体呈现为非实时媒体呈现时,且所述偏移指示fz指示的偏移量offset等于0时,表示所述内容项将从所述内容项的起始内容位置开始播放;或当所述第二媒体呈现为非实时媒体呈现时,且所述偏移指示fz指示的偏移量offset不等于0时,表示所述内容项的将从所述内容项的起始内容位置向后偏移所述偏移量offset对应的内容位置开始播放。
  63. 根据权利要求55至62任意一项所述的客户端,其特征在于,所述内容项的描述包括在所述媒体呈现描述的聚合客户端描述子中,或所述内容项的描述的指向信息包括在所述媒体呈现描述的聚合客户端描述子中。
  64. 根据权利要求55至62任意一项所述的客户端,其特征在于,所述第一 媒体呈现为聚合媒体呈现,所述媒体呈现描述为聚合媒体呈现描述,所述聚合媒体呈现描述包括N个媒体呈现描述元素,所述N为大于1或等于1的整数,第一媒体呈现描述元素为所述聚合媒体呈现描述包括的所述N个媒体呈现描述元素之中的一个媒体呈现描述元素,其中,所述内容项的描述包括在所述第一媒体呈现描述元素中或者所述内容项的描述的指向信息包括在所述第一媒体呈现描述元素中。
  65. 根据权利要求64所述的客户端,其特征在于,所述聚合媒体呈现描述还包括第一媒体呈现描述元素对应的时间窗口指示,其中,所述时间窗口指示用于指示客户端在所述时间窗口指示所指示的时间窗口内,从服务端获取所述聚合媒体呈现描述的更新内容,所述更新内容包括所述第一媒体呈现描述元素。
  66. 根据权利要求55至65任意一项所述的客户端,其特征在于,所述内容项为内容段落或媒体表达或自适应集。
PCT/CN2016/085590 2015-06-16 2016-06-13 内容项聚合方法和相关装置及通信系统 WO2016202225A1 (zh)

Priority Applications (2)

Application Number Priority Date Filing Date Title
EP16810966.8A EP3285455B1 (en) 2015-06-16 2016-06-13 Content item aggregation method and related device and communication system
US15/830,516 US20180146230A1 (en) 2015-06-16 2017-12-04 Content item aggregation method, related apparatus, and communications system

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201510334315.4A CN104935595B (zh) 2015-06-16 2015-06-16 内容项聚合方法和相关装置及通信系统
CN201510334315.4 2015-06-16

Related Child Applications (1)

Application Number Title Priority Date Filing Date
US15/830,516 Continuation US20180146230A1 (en) 2015-06-16 2017-12-04 Content item aggregation method, related apparatus, and communications system

Publications (1)

Publication Number Publication Date
WO2016202225A1 true WO2016202225A1 (zh) 2016-12-22

Family

ID=54122567

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2016/085590 WO2016202225A1 (zh) 2015-06-16 2016-06-13 内容项聚合方法和相关装置及通信系统

Country Status (4)

Country Link
US (1) US20180146230A1 (zh)
EP (1) EP3285455B1 (zh)
CN (1) CN104935595B (zh)
WO (1) WO2016202225A1 (zh)

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104935595B (zh) * 2015-06-16 2019-10-15 华为技术有限公司 内容项聚合方法和相关装置及通信系统
CN107566854B (zh) * 2016-06-30 2020-08-07 华为技术有限公司 一种媒体内容的获取和发送方法及装置
US11356715B2 (en) * 2018-12-28 2022-06-07 Tencent America LLC Dynamic shortening of advertisement duration during live streaming
CN111479164A (zh) * 2019-01-23 2020-07-31 上海哔哩哔哩科技有限公司 硬件解码动态分辨率无缝切换方法、装置及存储介质
CN110650366B (zh) * 2019-10-29 2021-09-24 成都超有爱科技有限公司 互动配音方法、装置、电子设备及可读存储介质

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20120023253A1 (en) * 2010-07-20 2012-01-26 Samsung Electronics Co., Ltd. Method and apparatus for transmitting and receiving adaptive streaming mechanism-based content
CN102714662A (zh) * 2010-01-18 2012-10-03 瑞典爱立信有限公司 用于http媒体流分发的方法和装置
CN104935595A (zh) * 2015-06-16 2015-09-23 华为技术有限公司 内容项聚合方法和相关装置及通信系统

Family Cites Families (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101355664B (zh) * 2008-09-23 2010-08-04 华为终端有限公司 一种节目的播放方法、装置和系统
KR101737084B1 (ko) * 2009-12-07 2017-05-17 삼성전자주식회사 메인 콘텐트에 다른 콘텐트를 삽입하여 스트리밍하는 방법 및 장치
CN102291373B (zh) * 2010-06-15 2016-08-31 华为技术有限公司 元数据文件的更新方法、装置和系统
CN102130936B (zh) * 2010-08-17 2013-10-09 华为技术有限公司 一种在动态http流传输方案中支持时移回看的方法和装置
CN103747365B (zh) * 2010-09-17 2017-04-26 华为技术有限公司 基于http流的媒体内容动态插播方法、装置及系统
US9591361B2 (en) * 2011-09-07 2017-03-07 Qualcomm Incorporated Streaming of multimedia data from multiple sources
US9954717B2 (en) * 2012-07-11 2018-04-24 Futurewei Technologies, Inc. Dynamic adaptive streaming over hypertext transfer protocol as hybrid multirate media description, delivery, and storage format
US9203811B2 (en) * 2012-10-09 2015-12-01 Futurewei Technologies, Inc. Authenticated encryption support in ISO/IEC 23009-4
EP2962469A1 (en) * 2013-07-15 2016-01-06 Huawei Technologies Co., Ltd. Just-in-time dereferencing of remote elements in dynamic adaptive streaming over hypertext transfer protocol
US9258747B2 (en) * 2013-09-17 2016-02-09 Intel IP Corporation User equipment and methods for fast handover failure recovery in 3GPP LTE network

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102714662A (zh) * 2010-01-18 2012-10-03 瑞典爱立信有限公司 用于http媒体流分发的方法和装置
US20120023253A1 (en) * 2010-07-20 2012-01-26 Samsung Electronics Co., Ltd. Method and apparatus for transmitting and receiving adaptive streaming mechanism-based content
CN104935595A (zh) * 2015-06-16 2015-09-23 华为技术有限公司 内容项聚合方法和相关装置及通信系统

Also Published As

Publication number Publication date
CN104935595A (zh) 2015-09-23
EP3285455A1 (en) 2018-02-21
EP3285455B1 (en) 2019-12-04
CN104935595B (zh) 2019-10-15
US20180146230A1 (en) 2018-05-24
EP3285455A4 (en) 2018-05-02

Similar Documents

Publication Publication Date Title
US11962835B2 (en) Synchronizing internet (over the top) video streams for simultaneous feedback
US9992537B2 (en) Real-time tracking collection for video experiences
US9426543B1 (en) Server-based video stitching
WO2016202225A1 (zh) 内容项聚合方法和相关装置及通信系统
US20090106357A1 (en) Synchronized Media Playback Using Autonomous Clients Over Standard Internet Protocols
CN108605153A (zh) 同步媒体内容标签数据
US10515476B2 (en) Image fetching for timeline scrubbing of digital media
US20090193466A1 (en) Distributed network-based video content for television
CN109348251A (zh) 用于视频播放的方法、装置、计算机可读介质及电子设备
CN113141522B (zh) 资源传输方法、装置、计算机设备及存储介质
US20170374122A1 (en) Method and Related Apparatus for Providing Media Presentation Guide in Media Streaming Over Hypertext Transfer Protocol
US20180324480A1 (en) Client and Method for Playing a Sequence of Video Streams, and Corresponding Server and Computer Program Product
US20120151538A1 (en) Method for interactive delivery of multimedia content, content production entity and server entity for realizing such a method
CN106537930A (zh) 多媒体流业务呈现方法和相关装置及相关系统
CA2938484C (en) In-band trick mode control
US11856242B1 (en) Synchronization of content during live video stream
Bassbouss Concepts and models for creating distributed multimedia applications and content in a multiscreen environment
KR20210052345A (ko) 이종 네트워크를 통해 수신한 콘텐츠의 삽입 방법 및 장치
CN109429109A (zh) 一种共享信息的方法及机顶盒
LÓPEZ et al. Enhancing the Broadcasted TV Consumption Experience With Broadband Omnidirectional Video Content

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 16810966

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE