WO2011022432A1 - Encoding video streams for adaptive video streaming - Google Patents

Encoding video streams for adaptive video streaming Download PDF

Info

Publication number
WO2011022432A1
WO2011022432A1 PCT/US2010/045805 US2010045805W WO2011022432A1 WO 2011022432 A1 WO2011022432 A1 WO 2011022432A1 US 2010045805 W US2010045805 W US 2010045805W WO 2011022432 A1 WO2011022432 A1 WO 2011022432A1
Authority
WO
WIPO (PCT)
Prior art keywords
sequence
gops
gop
video stream
key frame
Prior art date
Application number
PCT/US2010/045805
Other languages
French (fr)
Inventor
Anthony Neal Park
Yung-Hsiao Lai
David Randall Ronca
Original Assignee
Netflix, Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Netflix, Inc. filed Critical Netflix, Inc.
Priority to MX2012002087A priority Critical patent/MX2012002087A/en
Priority to EP10810513.1A priority patent/EP2467956B1/en
Priority to IN2232DEN2012 priority patent/IN2012DN02232A/en
Priority to JP2012525650A priority patent/JP5499314B2/en
Priority to CA2771187A priority patent/CA2771187C/en
Priority to BR112012003843-5A priority patent/BR112012003843B1/en
Publication of WO2011022432A1 publication Critical patent/WO2011022432A1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/234Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs
    • H04N21/2343Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements
    • H04N21/23439Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements for generating different versions
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/236Assembling of a multiplex stream, e.g. transport stream, by combining a video stream with other content or additional data, e.g. inserting a URL [Uniform Resource Locator] into a video stream, multiplexing software data into a video stream; Remultiplexing of multiplex streams; Insertion of stuffing bits into the multiplex stream, e.g. to obtain a constant bit-rate; Assembling of a packetised elementary stream
    • H04N21/23611Insertion of stuffing data into a multiplex stream, e.g. to obtain a constant bitrate
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/236Assembling of a multiplex stream, e.g. transport stream, by combining a video stream with other content or additional data, e.g. inserting a URL [Uniform Resource Locator] into a video stream, multiplexing software data into a video stream; Remultiplexing of multiplex streams; Insertion of stuffing bits into the multiplex stream, e.g. to obtain a constant bit-rate; Assembling of a packetised elementary stream
    • H04N21/2362Generation or processing of Service Information [SI]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/238Interfacing the downstream path of the transmission network, e.g. adapting the transmission rate of a video stream to network bandwidth; Processing of multiplex streams
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/238Interfacing the downstream path of the transmission network, e.g. adapting the transmission rate of a video stream to network bandwidth; Processing of multiplex streams
    • H04N21/2389Multiplex stream processing, e.g. multiplex stream encrypting
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/238Interfacing the downstream path of the transmission network, e.g. adapting the transmission rate of a video stream to network bandwidth; Processing of multiplex streams
    • H04N21/2389Multiplex stream processing, e.g. multiplex stream encrypting
    • H04N21/23895Multiplex stream processing, e.g. multiplex stream encrypting involving multiplex stream encryption

Definitions

  • the present invention relates generally to digital media and, more
  • Conventional digital content distribution systems usually include a content server, a content player, and a communications network connecting the content server to the content player.
  • the content server is configured to store digital content files corresponding to different content titles that can be downloaded from the content server to the content player.
  • Each digital content file typically includes a video stream encoded to a particular playback bit rate as well as an audio stream. As is well- understood, a video stream encoded to a high playback bit rate is larger in size than a video stream encoded to a lower playback bit rate.
  • the content player is configured to download and play a digital content file corresponding to a specific content title in response to a user selecting the content title for playback.
  • Downloading the digital content file typically involves a technique known in the art as "streaming," whereby the content server sequentially transmits the digital content file corresponding to the selected content title to the content player.
  • the content player then plays the video stream and the audio stream included in the digital content file as portions of those streams become available.
  • the content player may measure available bandwidth from the content server and select a digital content file having a video stream encoded to a bit rate that can be supported by the measured available bandwidth.
  • the content player When switching from downloading a current video stream to downloading a new video stream, the content player needs to match the video frame in the new video stream corresponding to the video frame in the current video stream being played at the time of the switch. To match video frames, the content player typically sequentially searches the new video stream to locate the video frame that matches the relevant video frame in the current video stream.
  • One drawback to this approach is that the searching operation may be very time consuming, thereby causing an interruption in downloading the video stream that disrupts the viewing experience for the user.
  • One embodiment of the present invention sets forth a method for encoding a video stream associated with a content title for adaptive video streaming.
  • the method includes the steps of applying a video codec to the video stream at a specific playback bit rate to generate a sequence of groups of pictures (GOPs), wherein each GOP is associated with a playback time interval and a different playback offset and includes a key frame and one or more frames of video data, applying an advanced system format to the sequence of GOPs to generate one or more data packets that include the sequence of GOPs, generating a sequence header index for the sequence of GOPs that includes a first switch point corresponding to a first GOP in the sequence of GOPs, wherein the first switch point specifies the playback offset associated with the first GOP and a first data packet that includes a first key frame included in the first GOP, and combining the sequence header index with the one or more data packets to generate an encoded video stream.
  • GOPs groups of pictures
  • One advantage of the disclosed method is that a content player can efficiently switch from one encoded video stream associated with a specific content title and having a specific playback bit rate to another encoded video stream associated with the same content title and having different playback bit rate by identifying the appropriate switch point in the sequence header index associated with the new encoded video stream.
  • Figure 1 illustrates a content distribution system configured to implement one or more aspects of the invention
  • Figure 2 is a more detailed illustration of the encoding server of Figure 1 , according to one embodiment of the invention.
  • Figure 3 is a conceptual diagram illustrating the different encoded stages of a video stream processed by the encoding server of Figure 2, according to one embodiment of the invention
  • Figure 4 is more detailed illustration of the sequence header index of Figure 3, according to one embodiment of the invention.
  • Figure 5 is a flow diagram of method steps for encoding a video stream for adaptive video streaming, according to one embodiment of the invention.
  • Figure 6 is a flow diagram of method steps for encoding and encrypting a video stream for adaptive video streaming, according to another embodiment of the invention.
  • Figure 1 illustrates a content distribution system 100 configured to implement one or more aspects of the invention.
  • the content distribution system 100 includes an encoding server 102, a communications network 104, a content distribution network (CDN) 106 and a content player 108.
  • the communications network 104 includes a plurality of network
  • communications systems such as routers and switches, configured to facilitate data communication between the encoding server 102, the CDN 106 and the content player 108.
  • Persons skilled in the art will recognize that many technically feasible techniques exist for building the communications network 104, including technologies practiced in deploying the well-known internet communications network.
  • the encoding server 102 is a computer system configured to encode video streams associated with digital content files for adaptive streaming.
  • the encoding workflow for encoding the video streams for adaptive streaming is described in greater detail below with respect to Figures 2 and 3.
  • the content distribution system 100 maybe include one or more encoding servers 102, where each encoding server 102 is configured to perform all the functions needed to encode the video streams or where each encoding server 102 is configured to perform a particular function needed to encode the video streams.
  • the digital content files including the encoded video streams are retrieved by the CDN 106 via the communications network 104 for distribution to the content player 108.
  • the CDN 106 comprises one or more computer systems configured to serve download requests for digital content files from the content player 108.
  • the digital content files may reside on a mass storage system accessible to the computer system.
  • the mass storage system may include, without limitation, direct attached storage, network attached file storage, or network attached block-level storage.
  • the digital content files may be formatted and stored on the mass storage system using any technically feasible technique.
  • a data transfer protocol such as the well-known hyper-text transfer protocol (HTTP), may be used to download digital content files from the content server 106 to the content player 108.
  • HTTP hyper-text transfer protocol
  • the content player 108 may comprise a computer system, a set top box, a mobile device such as a mobile phone, or any other technically feasible computing platform that has network connectivity and is coupled to or includes a display device and speaker device for presenting video frames, and generating acoustic output, respectively.
  • the content player 108 is configured for adaptive streaming, i.e., to download units of a video stream encoded to a specific playback bit rate, and switch to downloading subsequent units of a video stream encoded to a different playback bit rate based on prevailing bandwidth conditions within the communications network 104. As available bandwidth within the communications network 104 becomes limited, the content player 108 may select a video stream encoded to a lower playback bit rate.
  • Figure 1 is in no way intended to limit the scope of the present invention in any way.
  • FIG. 2 is a more detailed illustration of the encoding server 102 of Figure 1 , according to one embodiment of the invention.
  • the encoding server 102 includes a central processing unit (CPU) 202, a system disk 204, an input/output (I/O) devices interface 206, a network interface 208, an interconnect 210 and a system memory 212.
  • CPU central processing unit
  • I/O input/output
  • the CPU 202 is configured to retrieve and execute programming
  • the CPU 202 is configured to store application data and retrieve application data from the system memory 212.
  • the interconnect 210 is configured to facilitate transmission of data, such as programming instructions and application data, between the CPU 202, the system disk 204, I/O devices interface 206, the network interface 208, and the system memory 212.
  • the I/O devices interface 206 is configured to receive input data from I/O devices 222 and transmit the input data to the CPU 202 via the interconnect 210.
  • I/O devices 222 may comprise one or more buttons, a keyboard, and a mouse or other pointing device.
  • the I/O devices interface 206 is also configured to receive output data from the CPU 202 via the interconnect 210 and transmit the output data to the I/O devices 222.
  • the system disk 204 such as a hard disk drive or flash memory storage drive or the like, is configured to store non-volatile data such as encoded video streams. The encoded video streams can then be retrieved by the CDN 106 via the communications network 104.
  • the network interface 218 is coupled to the CPU 202 via the interconnect 210 and is configured to transmit and receive packets of data via the communications network 104.
  • the network interface 208 is configured to operate in compliance with the well-known Ethernet standard.
  • the system memory 212 includes software components that include instructions for encoding one or more video streams associated with a specific content title for adaptive streaming. As shown, these software components include a VC1 encoder 214, an advanced systems format (ASF) packaging tool 216, a padding tool 218 and a sequence header index (SHI) generator 220.
  • ASF advanced systems format
  • SHI sequence header index
  • the VC1 encoder 214 executes encoding operations for encoding a video stream to a specific playback bit rate such that the encoded video stream complies with the VC1 video codec standard and is configured for adaptive streaming.
  • the video stream can be encoded to comply with a different video codec standard such as MPEG or H.264.
  • An encoded video stream generated by the VC1 encoder 214 includes a sequence of groups of pictures (GOPs), each GOP comprising multiple frames of video data.
  • GOPs groups of pictures
  • the VC1 encoder 214 encodes the video stream according to three settings included in the VC1 video codec standard.
  • the closed entry point setting is enabled to ensure that each GOP in the encoded video stream is independent of the other GOPs in the encoded video stream.
  • the sequence header output mode setting is enabled so that a key frame that includes a sequence header is inserted at the beginning of each GOP.
  • the sequence header included in the key frame of a GOP specifies, among other information, a sequence header start code that can be used to locate the key frame within the encoded video stream and the resolution and aspect ratio of the frames of video data in the GOP.
  • the adaptive GOP setting is disabled to ensure that each GOP is associated with the same playback time interval and a different playback offset. The playback offset associated with a GOP is determined based on the location of the GOP in the sequence of GOPs included in the encoded video stream.
  • the VC1 encoder 214 transmits the encoded video stream to the ASF packaging tool 216 for further processing.
  • the ASF packaging tool 216 packages the encoded video stream received from the VC1 encoder 214 into an advanced systems format (ASF) compliant encoded video stream, which can be downloaded and processed for playback by multiple types of standards-compliant content players, including content player 108.
  • the ASF compliant encoded video stream includes a data object and an ASF header.
  • the data object stores the GOPs in one or more data packets of the same size.
  • a specific data packet may include frames of video data associated with two or more GOPs.
  • the ASF header includes information associated with the ASF compliant encoded video stream, such as the size and the number of data packets, needed by a content player, such as the content player 108, to process the ASF compliant encoded video stream for playback.
  • the ASF compliant encoded video stream is then processed by the padding tool 218.
  • the padding tool 218 inserts padding into the data object of the ASF compliant encoded video stream to ensure that the key frame associated with each GOP is located at the start of a different data packet within the data object. As described below, aligning key frames with different data packets allows the SHI generator 220 to define switch points for the ASF compliant encoded video stream, thus enabling content players to switch between multiple ASF compliant encoded video streams efficiently.
  • the padding tool 218 then transmits the ASF compliant encoded video stream to the SHI generator 220.
  • the SHI generator 220 generates a sequence header index associated with the ASF compliant encoded video stream.
  • the SHI generator 220 first searches the data object of the ASF compliant encoded video stream for the key frames associated with the different GOPs included in the data object.
  • the key frames can be located by the SHI generator 220 based on the sequence start codes specified in the sequence headers included in the key frames.
  • the SHI generator 220 defines a switch point within the sequence header index that stores (i) a data packet number that indentifies the data packet that includes the key frame associated with the GOP and (ii) the playback offset associated with the GOP. Again, the playback offset associated with the GOP is determined based on the location of the GOP in the sequence of GOPs included in the encoded video stream.
  • the encoding server 102 may generate multiple ASF compliant encoded video streams associated with the same content title and encoded to different playback bit rates in the manner described above.
  • the encoding process described herein ensures that, across the different ASF compliant encoded video streams the GOPs are associated with the same playback time interval and that corresponding GOPs across the different ASF compliant encoded video streams are associated with the same playback offsets. Therefore, each switch point defined in a sequence header included in one of the ASF compliant encoded video stream associated with a specific content title has a corresponding switch point defined in a sequence header included in each of the other ASF compliant encoded video stream associated with the same content title.
  • a content player can efficiently switch between the ASF compliant encoded video streams by identifying the appropriate switch points in the sequence header indices.
  • a content player such as the content player 108, searches the sequence header index included in the new ASF compliant encoded video stream to locate the particular switch point specifying the playback offset associated with the next GOP to be played. The content player can then switch to the new ASF compliant encoded video stream and download the GOP stored in the data packet specified at the particular switch point for playback.
  • the content player searches the sequence header associated with the new encoded stream for the particular switch point specifying a playback offset of three seconds. Once locating the particular switch point, the content player would download the GOP stored in the data packet specified in the switch point for playback.
  • padding is not inserted into the data object of the encoded video stream, and therefore, the key frames of the different GOPs are not necessarily aligned with new data packets.
  • the sequence header index specifies the data packet including a specific key frame, and the content player searches through the data packet for the key frame. Without padding, the size of the encoded video stream is reduced and, therefore, the encoded video stream can be downloaded faster by a content player.
  • FIG. 3 is a conceptual diagram illustrating the different encoded stages of a video stream processed by the encoding server 102 of Figure 2, according to one embodiment of the invention.
  • the video stream 302 is a mezzanine video stream associated with a specific content title as distributed by a video stream distributor once the rights to the specific content title are acquired.
  • the video stream 302 comprises a series of sequential frames of video data, such as frame 304 and frame 306.
  • the video stream 302 is encoded by the VC1 encoder 214 to generate the encoded video stream 308.
  • the VC1 encoder 214 encodes the mezzanine video stream to a specific playback bit rate.
  • the encoded video stream 308 is divided into multiple GOPs, such as GOP 318 and GOP 320.
  • Each GOP includes a key frame including a sequence header, such as key frame 310 in GOP 318 and key frame 314 in GOP 320. Further, each GOP within the encoded video stream 308 is associated with the same playback time interval and a different playback offset. For example, if the playback time interval is three seconds, then GOP 318 is associated with a playback offset of zero seconds, while GOP 320 is associated with a playback offset of six seconds.
  • the encoded video stream 308 is then processed by the ASF packaging tool 216 to generate an ASF compliant encoded video stream 322.
  • the ASF compliant encoded video stream 322 includes an ASF header 324, a data object including same-sized data packets, such as data packet 1 and data packet 7, and an ASF index 326.
  • the ASF header 324 includes information associated with the ASF compliant encoded video stream 322, such as the size and the number of data packets.
  • the ASF index 326 includes index information associated with the ASF compliant encoded video stream 322, and the data packets within the data object store the GOPs.
  • one GOP may be stored across different data packets. For example, as shown, GOP 318 is stored in data packet 1 , data packet 2 and partially in data packet 3.
  • the ASF compliant encoded video stream 322 is then processed by the padding tool 218.
  • the padding tool 218 inserts padding into the data object of the ASF compliant encoded video stream 322 to ensure that the key frame associated with each GOP is located at the start of a different data packet within the data object.
  • the padding tool 218 inserts padding 334 into data packet 3 after GOP 318 such that the key frame 316 of GOP 323 is aligned with a new data packet, i.e., data packet 4.
  • the padding tool 218 inserts padding 336 into data packet 5 after GOP 323 such that key frame 314 of GOP 320 is aligned with a new data packet, i.e., data packet 6.
  • the SHI generator 220 generates a sequence header index 338 associated with the ASF compliant encoded video stream 322. For the GOP associated with each of the identified key frames, the SHI generator 220 defines a switch point within the sequence header index 338 that stores (i) a data packet number that indentifies the data packet that includes the key frame associated with the GOP and (ii) the playback offset associated with the GOP.
  • the sequence header index 338 is described in greater detail below in conjunction with Figure 4. Once generated, the SHI generator 220 inserts the sequence header index 338 into the ASF header 324 of the ASF compliant encoded video stream 322.
  • FIG 4 is more detailed illustration of the sequence header index of Figure 3, according to one embodiment of the invention.
  • the sequence header index includes one or more switch points, such as switch point 408 and switch point 410, and each switch point includes an index portion 402, an offset portion 404, and a data packet portion 406.
  • Each switch point is associated with a specific GOP starting at a particular playback offset specified in the offset portion 404 of the switch point, where the key frame of that GOP is located within a particular data packet specified in the data packet portion 406 of the switch point.
  • switch point 408 is associated with GOP 318, and the offset portion 404 of the switch point 408 indicates that the playback offset for the GOP 318 is zero seconds (i.e., the first GOP in the video stream) and the data packet portion 406 indicates that the key frame 310 of GOP 318 is located in data packet 1.
  • FIG. 5 is a flow diagram of method steps for encoding a video stream for adaptive video streaming, according to one embodiment of the invention. Although the method steps are described in conjunction with the systems for Figures 1 -4, persons skilled in the art will understand that any system configured to perform the method steps, in any order, is within the scope of the invention.
  • the method 500 begins at step 502 where the VC1 encoder 214 executes encoding operations on a mezzanine video stream to generate an encoded video stream encoded to a specific play back bit rate.
  • An encoded video stream generated by the VC1 encoder 214 includes a sequence of groups of pictures (GOPs), each GOP comprising multiple frames of video data and a key frame that includes a sequence header.
  • Each GOP is associated with the same playback time interval and a different playback offset. Again, the playback offset associated with a GOP is determined based on the location of the GOP in the sequence of GOPs included in the encoded video stream.
  • the ASF packaging tool 216 processes the encoded video stream to generate an ASF compliant encoded video stream.
  • the ASF compliant encoded video stream includes an ASF header, a data object including same-sized data packets and, optionally, an ASF index.
  • the ASF header and ASF index store information related to the ASF compliant encoded video stream such as the size of the data packets and the indices of the data packets.
  • the data object stores the GOPs of the encoded video stream in the data packets.
  • the padding tool 218 inserts padding into the data object of the ASF compliant encoded video stream to ensure that the key frame associated with each GOP is located at the start of a different data packet within the data object.
  • the padding tool 218 then transmits the ASF compliant encoded video stream to the SHI generator 220.
  • the SHI generator 220 searches the data object of the ASF compliant encoded video stream for the key frames associated with the different GOPs included in the data object.
  • the key frames can be located by the SHI generator 220 based on the sequence start codes specified in the sequence headers included in the key frames.
  • the SHI generator 220 generates a sequence header index associated with the ASF compliant encoded video stream based on the locations of the key frames. For the GOP associated with each of the identified key frames, the SHI generator 220 defines a switch point within a sequence header index that stores (i) a data packet number that indentifies the data packet that includes the key frame associated with the GOP and (ii) the playback offset associated with the GOP.
  • the SHI generator 220 inserts the sequence header index into the ASF header of the ASF compliant encoded video stream.
  • a video stream being processed by the encoding server 102 may be encrypted using a digital rights management (DRM) encryption technique during the encoding process.
  • DRM digital rights management
  • the SHI generator 220 would end up searching for the key frames based on the sequence header start codes post-encryption and, thus, not be able to generate a sequence header index associated with the encoded video stream.
  • the technique described below in conjunction with Figure 6 can be used as an alternative to the technique described above.
  • Figure 6 is a flow diagram of method steps for encoding and encrypting a video stream for adaptive video streaming, according to another embodiment of the invention.
  • the method steps are described in conjunction with the systems for Figures 1 -4, persons skilled in the art will understand that any system configured to perform the method steps, in any order, is within the scope of the invention.
  • the method 600 begins at step 602, where the VC1 encoder 214 executes encoding operations on a mezzanine video stream to generate an encoded video stream encoded to a specific play back bit rate.
  • An encoded video stream generated by the VC1 encoder 214 includes a sequence of groups of pictures (GOPs), each GOP comprising multiple frames of video data and a key frame that includes a sequence header. Each GOP is associated with the same playback time interval and a different playback offset.
  • GOPs groups of pictures
  • the ASF packaging tool 216 processes the encoded video stream to generate an ASF compliant encoded video stream.
  • the ASF compliant encoded video stream includes an ASF header, a data object including same-sized data packets and, optionally, an ASF index.
  • the data object stores the GOPs of the encoded video stream in the data packets.
  • the SHI generator 220 searches the data object of the ASF compliant encoded video stream for the key frames associated with the different GOPs included in the data object.
  • the key frames can be located by the SHI generator 220 based on the sequence start codes specified in the sequence headers included in the key frames.
  • the SHI generator 220 generates a sequence header index associated with the ASF compliant encoded video stream based on the locations of the key frames. For the GOP associated with each of the identified key frames, the SHI generator 220 defines a switch point within a sequence header index that stores (i) a data packet number that indentifies the data packet that includes the key frame associated with the GOP and (ii) the playback offset associated with the GOP.
  • the SHI generator 220 inserts the sequence header index into the ASF header of the ASF compliant encoded video stream.
  • the encoding server 102 encrypts the ASF compliant encoded video stream using a DRM encryption technique, such as PlayReady DRM or Windows Media DRM (WMDRM).
  • a DRM encryption technique such as PlayReady DRM or Windows Media DRM (WMDRM).
  • encrypting a video stream using a DRM encryption technique may change the size of the frames of video data stored in the each GOP.
  • the locations of the key frames within the ASF compliant encoded video stream may change post-encryption.
  • the SHI generator 220 locates each key frame in the ASF compliant encoded video stream based on the corresponding playback offset stored in the sequence header index. Again, during encryption, the location of a key frame may change, but the playback offset associated with the GOP including the key frame does not change, thereby allowing the SHI generator 220 to locate accurately the key frame based on the playback offset.
  • the padding tool 218 inserts padding into the data object of the ASF compliant encoded video stream to ensure that the key frame associated with each GOP is located at the start of a different data packet within the data object.
  • the SHI generator 220 modifies the sequence header index stored in the ASF header of the ASF compliant encoded video stream based on the padding inserted into the data object of the encrypted ASF compliant encoded video stream. Specifically, the SHI generator 220 modifies the data packet identifiers stored in the sequence header index to specify the data packet storing the key frame.
  • the SHI generator 220 is able to generate the sequence header index associated with the ASF compliant encoded video stream before DRM encryption. Because the playback offsets associated with the GOPs remain the same during encryption, the SHI generator 220 is able to modify the sequence header index based on the new locations of the key frames included in the GOPs post-encryption.
  • a content player can efficiently switch between encrypted ASF compliant encoded video streams associated with the same content title by identifying the appropriate switch points in the sequence header indices included in encrypted ASF compliant encoded video streams.
  • the ASF compliant encoded video stream can be encrypted using WMDRM encryption. Because the WMDRM encryption technique does not change the locations of the key frames in the encrypted video stream, the sequence header index does not need to be re-adjusted after WMDRM encryption. As persons skilled in the art will recognize, the technique of Figure 6 may also be used in WMDRM implementations.
  • an encoding server encodes a video stream associated with a content title to identify switch points that are specified in a sequence header index included in the encoded video stream.
  • the switch points of two or more video streams corresponding to the same content title and encoded to different playback bit rates occur at the same playback time intervals across each of the two or more video streams.
  • the VC1 encoder within the encoding server first processes the video stream to generate an encoded video stream that is divided into one or more groups of pictures (GOPs) of video data.
  • Each GOP includes a sequence header followed by multiple frames of video data.
  • the sequence header specifies the resolution and the aspect ratio of the frames of video data, and the frames of video data within the GOP are associated with a particular playback time interval starting at a specific playback offset.
  • the ASF packaging tool within the encoding server packages the encoded video stream into an ASF compliant encoded video stream.
  • the ASF compliant encoded video stream includes an ASF header and a data object.
  • the ASF header includes information associated with the encoded video stream, such as the size and the number of data packets, needed by a content player to process the encoded video stream for playback.
  • the data object stores the GOPs in one or more data packets.
  • the ASF packaging tool transmits the ASF compliant encoded video stream to the padding tool within the encoding server.
  • the padding tool inserts padding into the data object of the ASF compliant encoded video stream to align the sequence header of each GOP with a new data packet within the data object.
  • the sequence header index (SHI) generator within the encoding server generates an SHI associated with the ASF compliant encoded video stream. For each GOP in the ASF compliant encoded video stream, the SHI specifies the data packet including the sequence header of the GOP and the playback offset corresponding to the GOP. The SHI generator then inserts the SHI into the ASF header of the ASF compliant encoded video stream. [0059] When encoding two or more video streams associated with the same content title, encoding server 102 generates two or more ASF compliant encoded video streams encoded to different playback bit rates in the manner described above.
  • each switch point defined in a sequence header included in one ASF compliant encoded video stream associated with a specific content title has a corresponding switch point defined in a sequence header included in a different ASF compliant encoded video stream associated with the same content title.
  • Another advantage of the disclosed technique is that the encoded video streams generated by the encoding server are ASF compliant and, therefore, can be downloaded and processed for playback by any standards- compliant content player.
  • Non-writable storage media e.g., read-only memory devices within a computer such as CD-ROM disks readable by a CD-ROM drive, flash memory, ROM chips or any type of solid-state non-volatile semiconductor memory
  • writable storage media e.g., floppy disks within a diskette drive or hard-disk drive or any type of solid-state random-access semiconductor memory

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)

Abstract

One embodiment of the invention sets forth an encoding server including components configured to encode a video stream associated with a content title for adaptive streaming. The video stream is first processed by a VC1 encoder to generate an encoded video stream comprising a multiple GOPs, each GOP including a key frame and having a different playback offset. The encoded video stream is then packaged such that the GOPs are stored in data packets of the packaged encoded stream. An SHI generator generates an SHI associated with the packaged encoded stream that includes a switch point associated with each GOP. Each switch point includes the playback offset associated with the corresponding GOP and the data packet storing the key frame of the corresponding GOP. The SHI associated with multiple packaged encoded video streams associated with the same content title and encoded to different playback bit rates have corresponding switch points.

Description

ENCODING VIDEO STREAMS FOR ADAPTIVE VIDEO STREAMING
CROSS-REFERENCE TO RELATED APPLICATIONS
[0001] This application claims benefit of United States patent application serial number 12/543,328, filed August 18, 2009, which is herein incorporated by reference. BACKGROUND OF THE INVENTION
Field of the Invention
[0002] The present invention relates generally to digital media and, more
specifically, to encoding video streams for adaptive video streaming.
Description of the Related Art
[0003] Conventional digital content distribution systems usually include a content server, a content player, and a communications network connecting the content server to the content player. The content server is configured to store digital content files corresponding to different content titles that can be downloaded from the content server to the content player. Each digital content file typically includes a video stream encoded to a particular playback bit rate as well as an audio stream. As is well- understood, a video stream encoded to a high playback bit rate is larger in size than a video stream encoded to a lower playback bit rate.
[0004] The content player is configured to download and play a digital content file corresponding to a specific content title in response to a user selecting the content title for playback. Downloading the digital content file typically involves a technique known in the art as "streaming," whereby the content server sequentially transmits the digital content file corresponding to the selected content title to the content player. The content player then plays the video stream and the audio stream included in the digital content file as portions of those streams become available. Prior to initiating the download of the digital content file, the content player may measure available bandwidth from the content server and select a digital content file having a video stream encoded to a bit rate that can be supported by the measured available bandwidth. To the extent the communications network can provide adequate bandwidth to download the selected digital content file, while satisfying bit rate requirements, playback of the downloaded digital content file proceeds satisfactorily. [0005] In practice, however, available bandwidth in the communications network constantly changes as different devices connected to the communications network perform independent tasks. To maximize playback quality in the face of changing bandwidth availability, an adaptive streaming technique may be implemented. In adaptive streaming, if the available bandwidth in the communications network increases, then the content player downloads a different content file corresponding to the selected content title that includes a video stream encoded to a higher playback bit rate. Similarly, if the available bandwidth in the communications network decreases, then the content player may download a different content file
corresponding to the selected content title that includes a video stream encoded to a lower playback bit rate.
[0006] When switching from downloading a current video stream to downloading a new video stream, the content player needs to match the video frame in the new video stream corresponding to the video frame in the current video stream being played at the time of the switch. To match video frames, the content player typically sequentially searches the new video stream to locate the video frame that matches the relevant video frame in the current video stream. One drawback to this approach is that the searching operation may be very time consuming, thereby causing an interruption in downloading the video stream that disrupts the viewing experience for the user.
[0007] As the foregoing illustrates, what is needed in the art is a video stream encoding mechanism that allows for switching between video streams that reduces the incidence of playback interruption relative to prior art techniques.
SUMMARY OF THE INVENTION
[0008] One embodiment of the present invention sets forth a method for encoding a video stream associated with a content title for adaptive video streaming. The method includes the steps of applying a video codec to the video stream at a specific playback bit rate to generate a sequence of groups of pictures (GOPs), wherein each GOP is associated with a playback time interval and a different playback offset and includes a key frame and one or more frames of video data, applying an advanced system format to the sequence of GOPs to generate one or more data packets that include the sequence of GOPs, generating a sequence header index for the sequence of GOPs that includes a first switch point corresponding to a first GOP in the sequence of GOPs, wherein the first switch point specifies the playback offset associated with the first GOP and a first data packet that includes a first key frame included in the first GOP, and combining the sequence header index with the one or more data packets to generate an encoded video stream.
[0009] One advantage of the disclosed method is that a content player can efficiently switch from one encoded video stream associated with a specific content title and having a specific playback bit rate to another encoded video stream associated with the same content title and having different playback bit rate by identifying the appropriate switch point in the sequence header index associated with the new encoded video stream.
BRIEF DESCRIPTION OF THE DRAWINGS
[0010] Figure 1 illustrates a content distribution system configured to implement one or more aspects of the invention;
[0011] Figure 2 is a more detailed illustration of the encoding server of Figure 1 , according to one embodiment of the invention;
[0012] Figure 3 is a conceptual diagram illustrating the different encoded stages of a video stream processed by the encoding server of Figure 2, according to one embodiment of the invention;
[0013] Figure 4 is more detailed illustration of the sequence header index of Figure 3, according to one embodiment of the invention;
[0014] Figure 5 is a flow diagram of method steps for encoding a video stream for adaptive video streaming, according to one embodiment of the invention; and
[0015] Figure 6 is a flow diagram of method steps for encoding and encrypting a video stream for adaptive video streaming, according to another embodiment of the invention.
DETAILED DESCRIPTION
[0016] In the following description, numerous specific details are set forth to provide a more thorough understanding of the present invention. However, it will be apparent to one of skill in the art that the present invention may be practiced without one or more of these specific details. In other instances, well-known features have not been described in order to avoid obscuring the present invention. [0017] Figure 1 illustrates a content distribution system 100 configured to implement one or more aspects of the invention. As shown, the content distribution system 100 includes an encoding server 102, a communications network 104, a content distribution network (CDN) 106 and a content player 108. [0018] The communications network 104 includes a plurality of network
communications systems, such as routers and switches, configured to facilitate data communication between the encoding server 102, the CDN 106 and the content player 108. Persons skilled in the art will recognize that many technically feasible techniques exist for building the communications network 104, including technologies practiced in deploying the well-known internet communications network.
[0019] The encoding server 102 is a computer system configured to encode video streams associated with digital content files for adaptive streaming. The encoding workflow for encoding the video streams for adaptive streaming is described in greater detail below with respect to Figures 2 and 3. The content distribution system 100 maybe include one or more encoding servers 102, where each encoding server 102 is configured to perform all the functions needed to encode the video streams or where each encoding server 102 is configured to perform a particular function needed to encode the video streams. The digital content files including the encoded video streams are retrieved by the CDN 106 via the communications network 104 for distribution to the content player 108.
[0020] The CDN 106 comprises one or more computer systems configured to serve download requests for digital content files from the content player 108. The digital content files may reside on a mass storage system accessible to the computer system. The mass storage system may include, without limitation, direct attached storage, network attached file storage, or network attached block-level storage. The digital content files may be formatted and stored on the mass storage system using any technically feasible technique. A data transfer protocol, such as the well-known hyper-text transfer protocol (HTTP), may be used to download digital content files from the content server 106 to the content player 108. [0021] The content player 108 may comprise a computer system, a set top box, a mobile device such as a mobile phone, or any other technically feasible computing platform that has network connectivity and is coupled to or includes a display device and speaker device for presenting video frames, and generating acoustic output, respectively. The content player 108 is configured for adaptive streaming, i.e., to download units of a video stream encoded to a specific playback bit rate, and switch to downloading subsequent units of a video stream encoded to a different playback bit rate based on prevailing bandwidth conditions within the communications network 104. As available bandwidth within the communications network 104 becomes limited, the content player 108 may select a video stream encoded to a lower playback bit rate. As available bandwidth increases, a video stream encoded to a higher playback bit rate may be selected. [0022] Although, in the above description, the content distribution system 100 is shown with one content player 108 and one CDNs 106, persons skilled in the art will recognize that the architecture of Figure 1 contemplates only an exemplary
embodiment of the invention. Other embodiments may include any number of content players 108 and/or CDNs 106. Thus, Figure 1 is in no way intended to limit the scope of the present invention in any way.
[0023] Figure 2 is a more detailed illustration of the encoding server 102 of Figure 1 , according to one embodiment of the invention. As shown, the encoding server 102 includes a central processing unit (CPU) 202, a system disk 204, an input/output (I/O) devices interface 206, a network interface 208, an interconnect 210 and a system memory 212.
[0024] The CPU 202 is configured to retrieve and execute programming
instructions stored in the system memory 212. Similarly, the CPU 202 is configured to store application data and retrieve application data from the system memory 212. The interconnect 210 is configured to facilitate transmission of data, such as programming instructions and application data, between the CPU 202, the system disk 204, I/O devices interface 206, the network interface 208, and the system memory 212. The I/O devices interface 206 is configured to receive input data from I/O devices 222 and transmit the input data to the CPU 202 via the interconnect 210. For example, I/O devices 222 may comprise one or more buttons, a keyboard, and a mouse or other pointing device. The I/O devices interface 206 is also configured to receive output data from the CPU 202 via the interconnect 210 and transmit the output data to the I/O devices 222. The system disk 204, such as a hard disk drive or flash memory storage drive or the like, is configured to store non-volatile data such as encoded video streams. The encoded video streams can then be retrieved by the CDN 106 via the communications network 104. The network interface 218 is coupled to the CPU 202 via the interconnect 210 and is configured to transmit and receive packets of data via the communications network 104. In one embodiment, the network interface 208 is configured to operate in compliance with the well-known Ethernet standard.
[0025] The system memory 212 includes software components that include instructions for encoding one or more video streams associated with a specific content title for adaptive streaming. As shown, these software components include a VC1 encoder 214, an advanced systems format (ASF) packaging tool 216, a padding tool 218 and a sequence header index (SHI) generator 220.
[0026] The VC1 encoder 214 executes encoding operations for encoding a video stream to a specific playback bit rate such that the encoded video stream complies with the VC1 video codec standard and is configured for adaptive streaming. In an alternative embodiment, the video stream can be encoded to comply with a different video codec standard such as MPEG or H.264. An encoded video stream generated by the VC1 encoder 214 includes a sequence of groups of pictures (GOPs), each GOP comprising multiple frames of video data. When encoding the video stream, the VC1 encoder 214 encodes the video stream according to three settings included in the VC1 video codec standard. First, the closed entry point setting is enabled to ensure that each GOP in the encoded video stream is independent of the other GOPs in the encoded video stream. Second, the sequence header output mode setting is enabled so that a key frame that includes a sequence header is inserted at the beginning of each GOP. The sequence header included in the key frame of a GOP specifies, among other information, a sequence header start code that can be used to locate the key frame within the encoded video stream and the resolution and aspect ratio of the frames of video data in the GOP. Third, the adaptive GOP setting is disabled to ensure that each GOP is associated with the same playback time interval and a different playback offset. The playback offset associated with a GOP is determined based on the location of the GOP in the sequence of GOPs included in the encoded video stream. For example, in an encoded video stream where each GOP has a playback time interval of three seconds, a first GOP in the encoded video stream would have a playback offset of zero seconds, a second GOP in the encoded video stream would have a playback offset of three seconds and so forth. Once encoded, the VC1 encoder 214 transmits the encoded video stream to the ASF packaging tool 216 for further processing.
[0027] The ASF packaging tool 216 packages the encoded video stream received from the VC1 encoder 214 into an advanced systems format (ASF) compliant encoded video stream, which can be downloaded and processed for playback by multiple types of standards-compliant content players, including content player 108. The ASF compliant encoded video stream includes a data object and an ASF header. The data object stores the GOPs in one or more data packets of the same size.
Since the size of the data packets may not match the size of the GOPs, a specific data packet may include frames of video data associated with two or more GOPs. The ASF header includes information associated with the ASF compliant encoded video stream, such as the size and the number of data packets, needed by a content player, such as the content player 108, to process the ASF compliant encoded video stream for playback.
[0028] The ASF compliant encoded video stream is then processed by the padding tool 218. The padding tool 218 inserts padding into the data object of the ASF compliant encoded video stream to ensure that the key frame associated with each GOP is located at the start of a different data packet within the data object. As described below, aligning key frames with different data packets allows the SHI generator 220 to define switch points for the ASF compliant encoded video stream, thus enabling content players to switch between multiple ASF compliant encoded video streams efficiently. The padding tool 218 then transmits the ASF compliant encoded video stream to the SHI generator 220. [0029] The SHI generator 220 generates a sequence header index associated with the ASF compliant encoded video stream. To generate the sequence header index, the SHI generator 220 first searches the data object of the ASF compliant encoded video stream for the key frames associated with the different GOPs included in the data object. The key frames can be located by the SHI generator 220 based on the sequence start codes specified in the sequence headers included in the key frames. For the GOP associated with each of the identified key frames, the SHI generator 220 defines a switch point within the sequence header index that stores (i) a data packet number that indentifies the data packet that includes the key frame associated with the GOP and (ii) the playback offset associated with the GOP. Again, the playback offset associated with the GOP is determined based on the location of the GOP in the sequence of GOPs included in the encoded video stream.
[0030] The encoding server 102 may generate multiple ASF compliant encoded video streams associated with the same content title and encoded to different playback bit rates in the manner described above. The encoding process described herein ensures that, across the different ASF compliant encoded video streams the GOPs are associated with the same playback time interval and that corresponding GOPs across the different ASF compliant encoded video streams are associated with the same playback offsets. Therefore, each switch point defined in a sequence header included in one of the ASF compliant encoded video stream associated with a specific content title has a corresponding switch point defined in a sequence header included in each of the other ASF compliant encoded video stream associated with the same content title. [0031] Based on the sequence header indices included in two ASF compliant encoded video streams associated with the same content title, a content player can efficiently switch between the ASF compliant encoded video streams by identifying the appropriate switch points in the sequence header indices. When switching between a currently playing ASF compliant encoded video stream and a new ASF compliant encoded video stream, a content player, such as the content player 108, searches the sequence header index included in the new ASF compliant encoded video stream to locate the particular switch point specifying the playback offset associated with the next GOP to be played. The content player can then switch to the new ASF compliant encoded video stream and download the GOP stored in the data packet specified at the particular switch point for playback. For example, for ASF compliant encoded video streams where each GOP were associated with a playback time interval of three seconds, if the first GOP associated with the playback offset of zero seconds were currently being played, then the next GOP to be played would be associated with the playback offset of three seconds. In such a scenario, the content player searches the sequence header associated with the new encoded stream for the particular switch point specifying a playback offset of three seconds. Once locating the particular switch point, the content player would download the GOP stored in the data packet specified in the switch point for playback. [0032] In one alternative embodiment, padding is not inserted into the data object of the encoded video stream, and therefore, the key frames of the different GOPs are not necessarily aligned with new data packets. In such an embodiment, the sequence header index specifies the data packet including a specific key frame, and the content player searches through the data packet for the key frame. Without padding, the size of the encoded video stream is reduced and, therefore, the encoded video stream can be downloaded faster by a content player.
[0033] In another alternative embodiment, the ASF packaging tool 216 ensures that the data packet size across multiple encoded video streams associated with the same content title are the same size. Because the ASF standard requires that the size of the data packets in a single encoded video stream are the same, ensuring that data packets across multiple encoded video stream have the same size allows content players to splice data packets of multiple encoded video streams into a single encoded video stream. [0034] Figure 3 is a conceptual diagram illustrating the different encoded stages of a video stream processed by the encoding server 102 of Figure 2, according to one embodiment of the invention. The video stream 302 is a mezzanine video stream associated with a specific content title as distributed by a video stream distributor once the rights to the specific content title are acquired. The video stream 302 comprises a series of sequential frames of video data, such as frame 304 and frame 306.
[0035] The video stream 302 is encoded by the VC1 encoder 214 to generate the encoded video stream 308. As previously described herein, the VC1 encoder 214 encodes the mezzanine video stream to a specific playback bit rate. The encoded video stream 308 is divided into multiple GOPs, such as GOP 318 and GOP 320.
Each GOP includes a key frame including a sequence header, such as key frame 310 in GOP 318 and key frame 314 in GOP 320. Further, each GOP within the encoded video stream 308 is associated with the same playback time interval and a different playback offset. For example, if the playback time interval is three seconds, then GOP 318 is associated with a playback offset of zero seconds, while GOP 320 is associated with a playback offset of six seconds.
[0036] The encoded video stream 308 is then processed by the ASF packaging tool 216 to generate an ASF compliant encoded video stream 322. As shown, the ASF compliant encoded video stream 322 includes an ASF header 324, a data object including same-sized data packets, such as data packet 1 and data packet 7, and an ASF index 326. Again, the ASF header 324 includes information associated with the ASF compliant encoded video stream 322, such as the size and the number of data packets. The ASF index 326 includes index information associated with the ASF compliant encoded video stream 322, and the data packets within the data object store the GOPs. As previously described herein, because the size of the data packets does not necessarily match the size of the GOPs, one GOP may be stored across different data packets. For example, as shown, GOP 318 is stored in data packet 1 , data packet 2 and partially in data packet 3.
[0037] The ASF compliant encoded video stream 322 is then processed by the padding tool 218. Again, the padding tool 218 inserts padding into the data object of the ASF compliant encoded video stream 322 to ensure that the key frame associated with each GOP is located at the start of a different data packet within the data object. For example, the padding tool 218 inserts padding 334 into data packet 3 after GOP 318 such that the key frame 316 of GOP 323 is aligned with a new data packet, i.e., data packet 4. Similarly, the padding tool 218 inserts padding 336 into data packet 5 after GOP 323 such that key frame 314 of GOP 320 is aligned with a new data packet, i.e., data packet 6.
[0038] Once the data object of the ASF compliant encoded video stream 322 is padded, the SHI generator 220 generates a sequence header index 338 associated with the ASF compliant encoded video stream 322. For the GOP associated with each of the identified key frames, the SHI generator 220 defines a switch point within the sequence header index 338 that stores (i) a data packet number that indentifies the data packet that includes the key frame associated with the GOP and (ii) the playback offset associated with the GOP. The sequence header index 338 is described in greater detail below in conjunction with Figure 4. Once generated, the SHI generator 220 inserts the sequence header index 338 into the ASF header 324 of the ASF compliant encoded video stream 322.
[0039] Figure 4 is more detailed illustration of the sequence header index of Figure 3, according to one embodiment of the invention. As shown, the sequence header index includes one or more switch points, such as switch point 408 and switch point 410, and each switch point includes an index portion 402, an offset portion 404, and a data packet portion 406. Each switch point is associated with a specific GOP starting at a particular playback offset specified in the offset portion 404 of the switch point, where the key frame of that GOP is located within a particular data packet specified in the data packet portion 406 of the switch point. For example, switch point 408 is associated with GOP 318, and the offset portion 404 of the switch point 408 indicates that the playback offset for the GOP 318 is zero seconds (i.e., the first GOP in the video stream) and the data packet portion 406 indicates that the key frame 310 of GOP 318 is located in data packet 1.
[0040] Figure 5 is a flow diagram of method steps for encoding a video stream for adaptive video streaming, according to one embodiment of the invention. Although the method steps are described in conjunction with the systems for Figures 1 -4, persons skilled in the art will understand that any system configured to perform the method steps, in any order, is within the scope of the invention.
[0041] The method 500 begins at step 502 where the VC1 encoder 214 executes encoding operations on a mezzanine video stream to generate an encoded video stream encoded to a specific play back bit rate. An encoded video stream generated by the VC1 encoder 214 includes a sequence of groups of pictures (GOPs), each GOP comprising multiple frames of video data and a key frame that includes a sequence header. Each GOP is associated with the same playback time interval and a different playback offset. Again, the playback offset associated with a GOP is determined based on the location of the GOP in the sequence of GOPs included in the encoded video stream.
[0042] At step 504, the ASF packaging tool 216 processes the encoded video stream to generate an ASF compliant encoded video stream. As previously described herein, the ASF compliant encoded video stream includes an ASF header, a data object including same-sized data packets and, optionally, an ASF index. The ASF header and ASF index store information related to the ASF compliant encoded video stream such as the size of the data packets and the indices of the data packets. The data object stores the GOPs of the encoded video stream in the data packets.
[0043] At step 506, the padding tool 218 inserts padding into the data object of the ASF compliant encoded video stream to ensure that the key frame associated with each GOP is located at the start of a different data packet within the data object. The padding tool 218 then transmits the ASF compliant encoded video stream to the SHI generator 220.
[0044] At step 508, the SHI generator 220 searches the data object of the ASF compliant encoded video stream for the key frames associated with the different GOPs included in the data object. The key frames can be located by the SHI generator 220 based on the sequence start codes specified in the sequence headers included in the key frames. At step 510, the SHI generator 220 generates a sequence header index associated with the ASF compliant encoded video stream based on the locations of the key frames. For the GOP associated with each of the identified key frames, the SHI generator 220 defines a switch point within a sequence header index that stores (i) a data packet number that indentifies the data packet that includes the key frame associated with the GOP and (ii) the playback offset associated with the GOP. At step 512, the SHI generator 220 inserts the sequence header index into the ASF header of the ASF compliant encoded video stream. [0045] In an alternative embodiment, a video stream being processed by the encoding server 102 may be encrypted using a digital rights management (DRM) encryption technique during the encoding process. In DRM implementations, because the sequence header start codes identifying the key frames of the GOPs in an encoded video stream are also encrypted, if the technique described above were employed, the SHI generator 220 would end up searching for the key frames based on the sequence header start codes post-encryption and, thus, not be able to generate a sequence header index associated with the encoded video stream. To address this nuance of DRM implementations, the technique described below in conjunction with Figure 6 can be used as an alternative to the technique described above.
[0046] Figure 6 is a flow diagram of method steps for encoding and encrypting a video stream for adaptive video streaming, according to another embodiment of the invention. Although the method steps are described in conjunction with the systems for Figures 1 -4, persons skilled in the art will understand that any system configured to perform the method steps, in any order, is within the scope of the invention.
[0047] The method 600 begins at step 602, where the VC1 encoder 214 executes encoding operations on a mezzanine video stream to generate an encoded video stream encoded to a specific play back bit rate. An encoded video stream generated by the VC1 encoder 214 includes a sequence of groups of pictures (GOPs), each GOP comprising multiple frames of video data and a key frame that includes a sequence header. Each GOP is associated with the same playback time interval and a different playback offset.
[0048] At step 604, the ASF packaging tool 216 processes the encoded video stream to generate an ASF compliant encoded video stream. As previously described herein, the ASF compliant encoded video stream includes an ASF header, a data object including same-sized data packets and, optionally, an ASF index. The data object stores the GOPs of the encoded video stream in the data packets.
[0049] At step 606, the SHI generator 220 searches the data object of the ASF compliant encoded video stream for the key frames associated with the different GOPs included in the data object. The key frames can be located by the SHI generator 220 based on the sequence start codes specified in the sequence headers included in the key frames. At step 608, the SHI generator 220 generates a sequence header index associated with the ASF compliant encoded video stream based on the locations of the key frames. For the GOP associated with each of the identified key frames, the SHI generator 220 defines a switch point within a sequence header index that stores (i) a data packet number that indentifies the data packet that includes the key frame associated with the GOP and (ii) the playback offset associated with the GOP. At step 610, the SHI generator 220 inserts the sequence header index into the ASF header of the ASF compliant encoded video stream.
[0050] At step 612, the encoding server 102 encrypts the ASF compliant encoded video stream using a DRM encryption technique, such as PlayReady DRM or Windows Media DRM (WMDRM). As is well-known, encrypting a video stream using a DRM encryption technique may change the size of the frames of video data stored in the each GOP. Thus, the locations of the key frames within the ASF compliant encoded video stream may change post-encryption.
[0051] At step 614, the SHI generator 220 locates each key frame in the ASF compliant encoded video stream based on the corresponding playback offset stored in the sequence header index. Again, during encryption, the location of a key frame may change, but the playback offset associated with the GOP including the key frame does not change, thereby allowing the SHI generator 220 to locate accurately the key frame based on the playback offset. [0052] At step 616, the padding tool 218 inserts padding into the data object of the ASF compliant encoded video stream to ensure that the key frame associated with each GOP is located at the start of a different data packet within the data object. At step 616, the SHI generator 220 modifies the sequence header index stored in the ASF header of the ASF compliant encoded video stream based on the padding inserted into the data object of the encrypted ASF compliant encoded video stream. Specifically, the SHI generator 220 modifies the data packet identifiers stored in the sequence header index to specify the data packet storing the key frame.
[0053] In this fashion, the SHI generator 220 is able to generate the sequence header index associated with the ASF compliant encoded video stream before DRM encryption. Because the playback offsets associated with the GOPs remain the same during encryption, the SHI generator 220 is able to modify the sequence header index based on the new locations of the key frames included in the GOPs post-encryption.
As a result, a content player can efficiently switch between encrypted ASF compliant encoded video streams associated with the same content title by identifying the appropriate switch points in the sequence header indices included in encrypted ASF compliant encoded video streams.
[0054] In another alternative embodiment, when encrypting a video stream using WMDRM encryption, the encoding technique set forth in Figure 5 may be
implemented. Once the sequence header index associated with the ASF compliant encoded video stream is generated, the ASF compliant encoded video stream can be encrypted using WMDRM encryption. Because the WMDRM encryption technique does not change the locations of the key frames in the encrypted video stream, the sequence header index does not need to be re-adjusted after WMDRM encryption. As persons skilled in the art will recognize, the technique of Figure 6 may also be used in WMDRM implementations.
[0055] In sum, an encoding server encodes a video stream associated with a content title to identify switch points that are specified in a sequence header index included in the encoded video stream. The switch points of two or more video streams corresponding to the same content title and encoded to different playback bit rates occur at the same playback time intervals across each of the two or more video streams. [0056] When encoding a particular video stream, the VC1 encoder within the encoding server first processes the video stream to generate an encoded video stream that is divided into one or more groups of pictures (GOPs) of video data. Each GOP includes a sequence header followed by multiple frames of video data. The sequence header specifies the resolution and the aspect ratio of the frames of video data, and the frames of video data within the GOP are associated with a particular playback time interval starting at a specific playback offset.
[0057] Once the encoded video stream is generated, the ASF packaging tool within the encoding server packages the encoded video stream into an ASF compliant encoded video stream. The ASF compliant encoded video stream includes an ASF header and a data object. The ASF header includes information associated with the encoded video stream, such as the size and the number of data packets, needed by a content player to process the encoded video stream for playback. The data object stores the GOPs in one or more data packets. [0058] The ASF packaging tool transmits the ASF compliant encoded video stream to the padding tool within the encoding server. The padding tool inserts padding into the data object of the ASF compliant encoded video stream to align the sequence header of each GOP with a new data packet within the data object. Once the padding is inserted into the data object, the sequence header index (SHI) generator within the encoding server generates an SHI associated with the ASF compliant encoded video stream. For each GOP in the ASF compliant encoded video stream, the SHI specifies the data packet including the sequence header of the GOP and the playback offset corresponding to the GOP. The SHI generator then inserts the SHI into the ASF header of the ASF compliant encoded video stream. [0059] When encoding two or more video streams associated with the same content title, encoding server 102 generates two or more ASF compliant encoded video streams encoded to different playback bit rates in the manner described above. Importantly, across the two or more ASF compliant encoded video streams, corresponding GOPs are associated with the same time interval and the same playback offsets. Therefore, each switch point defined in a sequence header included in one ASF compliant encoded video stream associated with a specific content title has a corresponding switch point defined in a sequence header included in a different ASF compliant encoded video stream associated with the same content title. [0060] One advantage of the disclosed technique is that a content player can efficiently switch from one encoded video stream associated with a specific content title and having a specific playback bit rate to another encoded video stream associated with the same content title and having different playback bit rate by identifying the appropriate switch point in the sequence header index associated with the new encoded video stream. Because the content player does not have to search for the appropriate frame of video data included in the encoded video stream for playback, the incidence of playback interruption when switching between encoded video streams is reduced. Another advantage of the disclosed technique is that the encoded video streams generated by the encoding server are ASF compliant and, therefore, can be downloaded and processed for playback by any standards- compliant content player.
[0061] While the foregoing is directed to embodiments of the present invention, other and further embodiments of the present invention may be devised without departing from the basic scope thereof. For example, aspects of the present invention may be implemented in hardware or software or in a combination of hardware and software. One embodiment of the present invention may be
implemented as a program product for use with a computer system. The program(s) of the program product define functions of the embodiments (including the methods described herein) and can be contained on a variety of computer-readable storage media. Illustrative computer-readable storage media include, but are not limited to: (i) non-writable storage media (e.g., read-only memory devices within a computer such as CD-ROM disks readable by a CD-ROM drive, flash memory, ROM chips or any type of solid-state non-volatile semiconductor memory) on which information is permanently stored; and (ii) writable storage media (e.g., floppy disks within a diskette drive or hard-disk drive or any type of solid-state random-access semiconductor memory) on which alterable information is stored. Such computer-readable storage media, when carrying computer-readable instructions that direct the functions of the present invention, are embodiments of the present invention. [0062] In view of the foregoing, the scope of the present invention is determined by the claims that follow.

Claims

WHAT IS CLAIMED IS:
1. A computer-implemented method for encoding a video stream associated with a content title for adaptive video streaming, the method comprising:
applying a video codec to the video stream at a specific playback bit rate to generate a sequence of groups of pictures (GOPs), wherein each GOP is associated with a playback time interval and a different playback offset and includes a key frame and one or more frames of video data; applying an advanced system format to the sequence of GOPs to generate one or more data packets that include the sequence of GOPs;
generating a sequence header index for the sequence of GOPs that includes a first switch point corresponding to a first GOP in the sequence of GOPs, wherein the first switch point specifies the playback offset associated with the first GOP and a first data packet that includes a first key frame included in the first GOP; and
combining the sequence header index with the one or more data packets to generate an encoded video stream.
2. The method of claim 1 , wherein the first key frame includes a sequence header start code and a sequence header that stores information associated with the first GOP.
3. The method of claim 2, wherein the step of generating the sequence header index comprises searching the one or more data packets for the sequence header start code included in the first key frame to identify the first data packet.
4. The method of claim 1 , further comprising the step of padding the one or more data packets to align each key frame included in the sequence of GOPs with a different data packet.
5. The method of claim 1 , further comprising the step of encrypting the one or more data packets based on a digital rights management (DRM) encryption technique to generate one or more encrypted data packets.
6. The method of claim 5, wherein the DRM encryption technique comprises a Windows Media DRM encryption technique.
7. The method of claim 5, further comprising the steps of:
based on the playback offset specified in the first switch point, determining that a first encrypted data packet stores the first key frame, wherein the first data packet and the first encrypted data packet are different data packets; and
modifying the first switch point included in the sequence header index to
specify that the first encrypted data packet stores the first key frame.
8. The method of claim 7, further comprising the step of padding the one or more encrypted data packets to align each key frame included in the sequence of GOPs with a different data packet.
9. The method of claim 1 , wherein each GOP in the sequence of GOPs is independent of the other GOPs in the sequence of GOPs.
10. The method of claim 1 , further comprising the steps of:
applying the video codec to a second video stream at a second playback bit rate to generate a second sequence of groups of pictures (GOPs), wherein each GOP is associated with the playback time interval and a different playback offset and includes a key frame and one or more frames of video data;
applying the advanced system format to the second sequence of GOPs to
generate one or more other data packets that include the second sequence of GOPs;
generating a second sequence header index for the second sequence of
GOPs that includes a second switch point corresponding to a second
GOP in the second sequence of GOPs, wherein the second switch point specifies the playback offset associated with the second GOP and a second data packet included in the one or more other data packets, and wherein the second data packet includes a second key frame included in the second GOP; and
combining the second sequence header index with the one or more other data packets to generate a second encoded video stream, wherein the playback offset associated with the second GOP is equal to the playback offset associated with the first GOP, and wherein the second switch point corresponds to the first switch point.
11. A computer-readable medium for storing instructions that, when executed by a processor, cause the processor to encode a video stream associated with a content title for adaptive video streaming, by performing the steps of:
applying a video codec to the video stream at a specific playback bit rate to generate a sequence of groups of pictures (GOPs), wherein each GOP is associated with a playback time interval and a different playback offset and includes a key frame and one or more frames of video data; applying an advanced system format to the sequence of GOPs to generate one or more data packets that include the sequence of GOPs;
generating a sequence header index for the sequence of GOPs that includes a first switch point corresponding to a first GOP in the sequence of GOPs, wherein the first switch point specifies the playback offset associated with the first GOP and a first data packet that includes a first key frame included in the first GOP; and
combining the sequence header index with the one or more data packets to generate an encoded video stream.
12. The computer-readable medium of claim 11 , wherein the first key frame includes a sequence header start code and a sequence header that stores
information associated with the first GOP.
13. The computer-readable medium of claim 12, wherein the step of generating the sequence header index comprises searching the one or more data packets for the sequence header start code included in the first key frame to identify the first data packet.
14. The computer-readable medium of claim 11 , further comprising the step of padding the one or more data packets to align each key frame included in the sequence of GOPs with a different data packet.
15. The computer-readable medium of claim 11 , further comprising the step of encrypting the one or more data packets based on a digital rights management (DRM) encryption technique to generate one or more encrypted data packets.
16. The computer-readable medium of claim 15, wherein the DRM encryption technique comprises a Windows Media DRM encryption technique.
17. The computer-readable medium of claim 15, further comprising the steps of: based on the playback offset specified in the first switch point, determining that a first encrypted data packet stores the first key frame, wherein the first data packet and the first encrypted data packet are different data packets; and
modifying the first switch point included in the sequence header index to
specify that the first encrypted data packet stores the first key frame.
18. The computer-readable medium of claim 17, further comprising the step of padding the one or more encrypted data packets to align each key frame included in the sequence of GOPs with a different data packet.
19. The computer-readable medium of claim 11 , wherein each GOP in the sequence of GOPs is independent of the other GOPs in the sequence of GOPs.
20. The computer-readable medium of claim 11 , further comprising the steps of: applying the video codec to a second video stream at a second playback bit rate to generate a second sequence of groups of pictures (GOPs), wherein each GOP is associated with the playback time interval and a different playback offset and includes a key frame and one or more frames of video data;
applying the advanced system format to the second sequence of GOPs to
generate one or more other data packets that include the second sequence of GOPs;
generating a second sequence header index for the second sequence of
GOPs that includes a second switch point corresponding to a second
GOP in the second sequence of GOPs, wherein the second switch point specifies the playback offset associated with the second GOP and a second data packet included in the one or more other data packets, and wherein the second data packet includes a second key frame included in the second GOP; and
combining the second sequence header index with the one or more other data packets to generate a second encoded video stream,
wherein the playback offset associated with the second GOP is equal to the playback offset associated with the first GOP, and wherein the second switch point corresponds to the first switch point.
21. A computer system, comprising:
a processor; and
a memory storing instructions that when executed by the processor are
configured to:
apply a video codec to a video stream at a specific playback bit rate to generate a sequence of groups of pictures (GOPs), wherein each GOP is associated with a playback time interval and a different playback offset and includes a key frame and one or more frames of video data,
apply an advanced system format to the sequence of GOPs to generate one or more data packets that include the sequence of GOPs, generate a sequence header index for the sequence of GOPs that
includes a first switch point corresponding to a first GOP in the sequence of GOPs, wherein the first switch point specifies the playback offset associated with the first GOP and a first data packet that includes a first key frame included in the first GOP, and
combine the sequence header index with the one or more data packets to generate an encoded video stream.
PCT/US2010/045805 2009-08-18 2010-08-17 Encoding video streams for adaptive video streaming WO2011022432A1 (en)

Priority Applications (6)

Application Number Priority Date Filing Date Title
MX2012002087A MX2012002087A (en) 2009-08-18 2010-08-17 Encoding video streams for adaptive video streaming.
EP10810513.1A EP2467956B1 (en) 2009-08-18 2010-08-17 Encoding video streams for adaptive video streaming
IN2232DEN2012 IN2012DN02232A (en) 2009-08-18 2010-08-17
JP2012525650A JP5499314B2 (en) 2009-08-18 2010-08-17 Video stream encoding for adaptive video streaming
CA2771187A CA2771187C (en) 2009-08-18 2010-08-17 Encoding video streams for adaptive video streaming
BR112012003843-5A BR112012003843B1 (en) 2009-08-18 2010-08-17 COMPUTER IMPLEMENTED METHOD TO ENCODE A VIDEO FLOW, NON TRANSIENT, COMPUTER-READABLE MEDIUM AND COMPUTER SYSTEM

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US12/543,328 2009-08-18
US12/543,328 US8355433B2 (en) 2009-08-18 2009-08-18 Encoding video streams for adaptive video streaming

Publications (1)

Publication Number Publication Date
WO2011022432A1 true WO2011022432A1 (en) 2011-02-24

Family

ID=43607311

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2010/045805 WO2011022432A1 (en) 2009-08-18 2010-08-17 Encoding video streams for adaptive video streaming

Country Status (10)

Country Link
US (1) US8355433B2 (en)
EP (1) EP2467956B1 (en)
JP (1) JP5499314B2 (en)
BR (1) BR112012003843B1 (en)
CA (1) CA2771187C (en)
CL (1) CL2012000416A1 (en)
CO (1) CO6612207A2 (en)
IN (1) IN2012DN02232A (en)
MX (1) MX2012002087A (en)
WO (1) WO2011022432A1 (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2013033335A1 (en) * 2011-08-30 2013-03-07 Divx, Llc Selection of resolutions for seamless resolution switching of multimedia content
CN103516731A (en) * 2012-06-15 2014-01-15 华为技术有限公司 Cache server service method, cache server, and system
US8935425B2 (en) 2011-10-05 2015-01-13 Qualcomm Incorporated Switching between representations during network streaming of coded multimedia data

Families Citing this family (96)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6307487B1 (en) 1998-09-23 2001-10-23 Digital Fountain, Inc. Information additive code generator and decoder for communication systems
US7068729B2 (en) 2001-12-21 2006-06-27 Digital Fountain, Inc. Multi-stage code generator and decoder for communication systems
US9240810B2 (en) 2002-06-11 2016-01-19 Digital Fountain, Inc. Systems and processes for decoding chain reaction codes through inactivation
JP4546246B2 (en) 2002-10-05 2010-09-15 デジタル ファウンテン, インコーポレイテッド Systematic encoding and decryption of chained encryption reactions
CN1954501B (en) 2003-10-06 2010-06-16 数字方敦股份有限公司 Method for receving data transmitted from a source by communication channel
US7519274B2 (en) 2003-12-08 2009-04-14 Divx, Inc. File format for multiple track digital data
US8472792B2 (en) 2003-12-08 2013-06-25 Divx, Llc Multimedia distribution system
CN103124182B (en) 2004-05-07 2017-05-10 数字方敦股份有限公司 File download and streaming system
US7721184B2 (en) * 2004-08-11 2010-05-18 Digital Fountain, Inc. Method and apparatus for fast encoding of data symbols according to half-weight codes
CN101686107B (en) * 2006-02-13 2014-08-13 数字方敦股份有限公司 Streaming and buffering using variable FEC overhead and protection periods
US9270414B2 (en) 2006-02-21 2016-02-23 Digital Fountain, Inc. Multiple-field based code generator and decoder for communications systems
US7515710B2 (en) 2006-03-14 2009-04-07 Divx, Inc. Federated digital rights management scheme including trusted systems
WO2007134196A2 (en) 2006-05-10 2007-11-22 Digital Fountain, Inc. Code generator and decoder using hybrid codes
US9178535B2 (en) 2006-06-09 2015-11-03 Digital Fountain, Inc. Dynamic stream interleaving and sub-stream based delivery
US9209934B2 (en) 2006-06-09 2015-12-08 Qualcomm Incorporated Enhanced block-request streaming using cooperative parallel HTTP and forward error correction
US9419749B2 (en) 2009-08-19 2016-08-16 Qualcomm Incorporated Methods and apparatus employing FEC codes with permanent inactivation of symbols for encoding and decoding processes
US9386064B2 (en) 2006-06-09 2016-07-05 Qualcomm Incorporated Enhanced block-request streaming using URL templates and construction rules
US9432433B2 (en) 2006-06-09 2016-08-30 Qualcomm Incorporated Enhanced block-request streaming system using signaling or block creation
US9380096B2 (en) 2006-06-09 2016-06-28 Qualcomm Incorporated Enhanced block-request streaming system for handling low-latency streaming
EP2203836A4 (en) 2007-09-12 2014-11-05 Digital Fountain Inc Generating and communicating source identification information to enable reliable communications
US8268107B2 (en) 2007-09-21 2012-09-18 The Boeing Company Fly away caul plate
KR20100106327A (en) 2007-11-16 2010-10-01 디브이엑스, 인크. Hierarchical and reduced index structures for multimedia files
US8997161B2 (en) 2008-01-02 2015-03-31 Sonic Ip, Inc. Application enhancement tracks
WO2010080911A1 (en) 2009-01-07 2010-07-15 Divx, Inc. Singular, collective and automated creation of a media guide for online content
US9281847B2 (en) 2009-02-27 2016-03-08 Qualcomm Incorporated Mobile reception of digital video broadcasting—terrestrial services
US9288010B2 (en) 2009-08-19 2016-03-15 Qualcomm Incorporated Universal file delivery methods for providing unequal error protection and bundled file delivery services
US9917874B2 (en) 2009-09-22 2018-03-13 Qualcomm Incorporated Enhanced block-request streaming using block partitioning or request controls for improved client-side handling
WO2011068668A1 (en) 2009-12-04 2011-06-09 Divx, Llc Elementary bitstream cryptographic material transport systems and methods
US9237178B2 (en) * 2010-02-03 2016-01-12 Futurewei Technologies, Inc. Combined binary string for signaling byte range of media fragments in adaptive streaming
JP5824465B2 (en) * 2010-02-19 2015-11-25 テレフオンアクチーボラゲット エル エム エリクソン(パブル) Method and apparatus for adaptation in HTTP streaming
US9485546B2 (en) 2010-06-29 2016-11-01 Qualcomm Incorporated Signaling video samples for trick mode video representations
US8918533B2 (en) 2010-07-13 2014-12-23 Qualcomm Incorporated Video switching for streaming video data
US9185439B2 (en) 2010-07-15 2015-11-10 Qualcomm Incorporated Signaling data for multiplexing video components
US9596447B2 (en) 2010-07-21 2017-03-14 Qualcomm Incorporated Providing frame packing type information for video coding
US8806050B2 (en) 2010-08-10 2014-08-12 Qualcomm Incorporated Manifest file updates for network streaming of coded multimedia data
US8925026B2 (en) * 2010-09-29 2014-12-30 Verizon Patent And Licensing Inc. Back office support for a video provisioning system
US10298897B2 (en) * 2010-12-30 2019-05-21 Interdigital Madison Patent Holdings Method of processing a video content allowing the adaptation to several types of display devices
US8914534B2 (en) 2011-01-05 2014-12-16 Sonic Ip, Inc. Systems and methods for adaptive bitrate streaming of media stored in matroska container files using hypertext transfer protocol
US8958375B2 (en) 2011-02-11 2015-02-17 Qualcomm Incorporated Framing for an improved radio link protocol including FEC
US9270299B2 (en) 2011-02-11 2016-02-23 Qualcomm Incorporated Encoding and decoding using elastic codes with flexible source block mapping
US9363522B2 (en) * 2011-04-28 2016-06-07 Warner Bros. Entertainment, Inc. Region-of-interest encoding enhancements for variable-bitrate mezzanine compression
KR101840008B1 (en) * 2011-06-24 2018-05-04 에스케이플래닛 주식회사 High quality video streaming service system and method
WO2013028565A1 (en) * 2011-08-19 2013-02-28 General Instrument Corporation Encoder-aided segmentation for adaptive streaming
US8818171B2 (en) 2011-08-30 2014-08-26 Kourosh Soroushian Systems and methods for encoding alternative streams of video for playback on playback devices having predetermined display aspect ratios and network connection maximum data rates
US9955195B2 (en) 2011-08-30 2018-04-24 Divx, Llc Systems and methods for encoding and streaming video encoded using a plurality of maximum bitrate levels
US9253233B2 (en) * 2011-08-31 2016-02-02 Qualcomm Incorporated Switch signaling methods providing improved switching between representations for adaptive HTTP streaming
US8964977B2 (en) 2011-09-01 2015-02-24 Sonic Ip, Inc. Systems and methods for saving encoded media streamed using adaptive bitrate streaming
US8909922B2 (en) 2011-09-01 2014-12-09 Sonic Ip, Inc. Systems and methods for playing back alternative streams of protected content protected using common cryptographic information
US9843844B2 (en) 2011-10-05 2017-12-12 Qualcomm Incorporated Network streaming of media data
WO2013070802A1 (en) * 2011-11-07 2013-05-16 Finitiv Corporation System and method for indexing and annotation of video content
US20130179199A1 (en) 2012-01-06 2013-07-11 Rovi Corp. Systems and methods for granting access to digital content using electronic tickets and ticket tokens
US9166864B1 (en) * 2012-01-18 2015-10-20 Google Inc. Adaptive streaming for legacy media frameworks
US20150026711A1 (en) * 2012-02-27 2015-01-22 Telefonaktiebolaget L M Ericsson (Publ) Method and apparatus for video content distribution
US9294226B2 (en) 2012-03-26 2016-03-22 Qualcomm Incorporated Universal object delivery and template-based file delivery
US8826429B2 (en) 2012-04-02 2014-09-02 The Boeing Company Information security management
US9532080B2 (en) 2012-05-31 2016-12-27 Sonic Ip, Inc. Systems and methods for the reuse of encoding information in encoding alternative streams of video data
US9354799B2 (en) * 2012-06-13 2016-05-31 Sonic Ip, Inc. Systems and methods for adaptive streaming systems with interactive video timelines
US9197685B2 (en) 2012-06-28 2015-11-24 Sonic Ip, Inc. Systems and methods for fast video startup using trick play streams
US9143812B2 (en) 2012-06-29 2015-09-22 Sonic Ip, Inc. Adaptive streaming of multimedia
US10452715B2 (en) 2012-06-30 2019-10-22 Divx, Llc Systems and methods for compressing geotagged video
EP2875417B1 (en) 2012-07-18 2020-01-01 Verimatrix, Inc. Systems and methods for rapid content switching to provide a linear tv experience using streaming content distribution
US8997254B2 (en) 2012-09-28 2015-03-31 Sonic Ip, Inc. Systems and methods for fast startup streaming of encrypted multimedia content
US8914836B2 (en) 2012-09-28 2014-12-16 Sonic Ip, Inc. Systems, methods, and computer program products for load adaptive streaming
US9727321B2 (en) * 2012-10-11 2017-08-08 Netflix, Inc. System and method for managing playback of streaming digital content
US10708335B2 (en) * 2012-11-16 2020-07-07 Time Warner Cable Enterprises Llc Situation-dependent dynamic bit rate encoding and distribution of content
US9813325B2 (en) 2012-12-27 2017-11-07 Comcast Cable Communications, Llc Information stream management
US9313510B2 (en) 2012-12-31 2016-04-12 Sonic Ip, Inc. Use of objective quality measures of streamed content to reduce streaming bandwidth
US9264475B2 (en) 2012-12-31 2016-02-16 Sonic Ip, Inc. Use of objective quality measures of streamed content to reduce streaming bandwidth
US9191457B2 (en) 2012-12-31 2015-11-17 Sonic Ip, Inc. Systems, methods, and media for controlling delivery of content
WO2014113710A1 (en) * 2013-01-18 2014-07-24 Huawei Technologies. Co., Ltd Method and apparatus for performing adaptive streaming on media contents
US9350990B2 (en) 2013-02-28 2016-05-24 Sonic Ip, Inc. Systems and methods of encoding multiple video streams with adaptive quantization for adaptive bitrate streaming
US9357210B2 (en) 2013-02-28 2016-05-31 Sonic Ip, Inc. Systems and methods of encoding multiple video streams for adaptive bitrate streaming
US9906785B2 (en) 2013-03-15 2018-02-27 Sonic Ip, Inc. Systems, methods, and media for transcoding video data according to encoding parameters indicated by received metadata
US10397292B2 (en) 2013-03-15 2019-08-27 Divx, Llc Systems, methods, and media for delivery of content
US9344517B2 (en) 2013-03-28 2016-05-17 Sonic Ip, Inc. Downloading and adaptive streaming of multimedia content to a device with cache assist
US9094737B2 (en) 2013-05-30 2015-07-28 Sonic Ip, Inc. Network video streaming with trick play based on separate trick play files
US9247317B2 (en) 2013-05-30 2016-01-26 Sonic Ip, Inc. Content streaming with client device trick play index
EP3697100A1 (en) * 2013-06-05 2020-08-19 Sun Patent Trust Data decoding method, data decoding apparatus, and data transmitting method
US9967305B2 (en) 2013-06-28 2018-05-08 Divx, Llc Systems, methods, and media for streaming media content
US9538171B2 (en) * 2013-07-23 2017-01-03 Intel Corporation Techniques for streaming video quality analysis
ITMI20131710A1 (en) * 2013-10-15 2015-04-16 Sky Italia S R L "ENCODING CLOUD SYSTEM"
US9343112B2 (en) 2013-10-31 2016-05-17 Sonic Ip, Inc. Systems and methods for supplementing content from a server
US20150189365A1 (en) * 2013-12-26 2015-07-02 Thomson Licensing Method and apparatus for generating a recording index
CN104869103B (en) * 2014-02-24 2018-05-18 华为终端(东莞)有限公司 Search method, terminal device and the server of multimedia file
US9866878B2 (en) 2014-04-05 2018-01-09 Sonic Ip, Inc. Systems and methods for encoding and playing back video at different frame rates using enhancement layers
US10804958B2 (en) * 2015-02-24 2020-10-13 Comcast Cable Communications, Llc Multi-bitrate video with dynamic blocks
US10499070B2 (en) * 2015-09-11 2019-12-03 Facebook, Inc. Key frame placement for distributed video encoding
US9426543B1 (en) * 2015-12-18 2016-08-23 Vuclip (Singapore) Pte. Ltd. Server-based video stitching
US10075292B2 (en) 2016-03-30 2018-09-11 Divx, Llc Systems and methods for quick start-up of playback
US10148989B2 (en) 2016-06-15 2018-12-04 Divx, Llc Systems and methods for encoding video content
US10498795B2 (en) 2017-02-17 2019-12-03 Divx, Llc Systems and methods for adaptive switching between multiple content delivery networks during adaptive bitrate streaming
US10873775B2 (en) * 2017-06-12 2020-12-22 Netflix, Inc. Staggered key frame video encoding
CN109788372B (en) * 2019-01-24 2021-06-08 维沃移动通信有限公司 Streaming media playing method and related device
US11509949B2 (en) * 2019-09-13 2022-11-22 Disney Enterprises, Inc. Packager for segmenter fluidity
US11128688B2 (en) * 2019-10-16 2021-09-21 Disney Enterprises, Inc. Transcoder conditioning for segment fluidity
CN115119009B (en) * 2022-06-29 2023-09-01 北京奇艺世纪科技有限公司 Video alignment method, video encoding device and storage medium

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6453114B2 (en) * 1997-02-18 2002-09-17 Thomson Licensing Sa Random picture decoding
US20030067872A1 (en) * 2001-09-17 2003-04-10 Pulsent Corporation Flow control method for quality streaming of audio/video/media over packet networks
US20050123274A1 (en) * 2003-09-07 2005-06-09 Microsoft Corporation Signaling coding and display options in entry point headers
US7263129B2 (en) * 2002-08-29 2007-08-28 Sony Corporation Predictive encoding and data decoding control
US20080259799A1 (en) * 2007-04-20 2008-10-23 Van Beek Petrus J L Packet Scheduling with Quality-Aware Frame Dropping for Video Streaming
US20090083279A1 (en) * 2007-09-26 2009-03-26 Hasek Charles A Methods and apparatus for content caching in a video network

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5623424A (en) * 1995-05-08 1997-04-22 Kabushiki Kaisha Toshiba Rate-controlled digital video editing method and system which controls bit allocation of a video encoder by varying quantization levels
JP3060919B2 (en) * 1995-11-16 2000-07-10 松下電器産業株式会社 Compressed video decoding / display device and simplified compressed video editing device
US7302490B1 (en) * 2000-05-03 2007-11-27 Microsoft Corporation Media file format to support switching between multiple timeline-altered media streams
US7519274B2 (en) * 2003-12-08 2009-04-14 Divx, Inc. File format for multiple track digital data
US7818444B2 (en) 2004-04-30 2010-10-19 Move Networks, Inc. Apparatus, system, and method for multi-bitrate content streaming
US8543720B2 (en) * 2007-12-05 2013-09-24 Google Inc. Dynamic bit rate scaling

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6453114B2 (en) * 1997-02-18 2002-09-17 Thomson Licensing Sa Random picture decoding
US20030067872A1 (en) * 2001-09-17 2003-04-10 Pulsent Corporation Flow control method for quality streaming of audio/video/media over packet networks
US7263129B2 (en) * 2002-08-29 2007-08-28 Sony Corporation Predictive encoding and data decoding control
US20050123274A1 (en) * 2003-09-07 2005-06-09 Microsoft Corporation Signaling coding and display options in entry point headers
US20080259799A1 (en) * 2007-04-20 2008-10-23 Van Beek Petrus J L Packet Scheduling with Quality-Aware Frame Dropping for Video Streaming
US20090083279A1 (en) * 2007-09-26 2009-03-26 Hasek Charles A Methods and apparatus for content caching in a video network

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2013033335A1 (en) * 2011-08-30 2013-03-07 Divx, Llc Selection of resolutions for seamless resolution switching of multimedia content
US8935425B2 (en) 2011-10-05 2015-01-13 Qualcomm Incorporated Switching between representations during network streaming of coded multimedia data
JP2016154348A (en) * 2011-10-05 2016-08-25 クゥアルコム・インコーポレイテッドQualcomm Incorporated Switching between representations during network streaming of coded multimedia data
CN103516731A (en) * 2012-06-15 2014-01-15 华为技术有限公司 Cache server service method, cache server, and system
CN103516731B (en) * 2012-06-15 2017-04-19 华为技术有限公司 Cache server service method, cache server, and system

Also Published As

Publication number Publication date
EP2467956A1 (en) 2012-06-27
JP2013502836A (en) 2013-01-24
CL2012000416A1 (en) 2012-08-24
BR112012003843B1 (en) 2021-07-27
US20110268178A1 (en) 2011-11-03
EP2467956B1 (en) 2015-09-30
CA2771187C (en) 2017-02-28
US8355433B2 (en) 2013-01-15
JP5499314B2 (en) 2014-05-21
EP2467956A4 (en) 2013-02-13
CA2771187A1 (en) 2011-02-24
BR112012003843A2 (en) 2016-03-22
IN2012DN02232A (en) 2015-08-21
MX2012002087A (en) 2012-06-12
CO6612207A2 (en) 2013-02-01

Similar Documents

Publication Publication Date Title
CA2771187C (en) Encoding video streams for adaptive video streaming
US10123059B2 (en) Fast start of streaming digital media playback with deferred license retrieval
JP7100175B2 (en) Transmission method and transmission device
US8954596B2 (en) Dynamic virtual chunking of streaming media content
US8682139B2 (en) L-cut stream startup
EP3639516B1 (en) Staggered key frame video encoding
WO2009149364A2 (en) Methods and systems for use in providing playback of variable length content in a fixed length framework
CN103491430A (en) Streaming media data processing method and electronic device
US10284529B2 (en) Information processing apparatus and information processing method
CN110636368B (en) Media playing method, system, device and storage medium
US10484725B2 (en) Information processing apparatus and information processing method for reproducing media based on edit file
CN106941630A (en) A kind of method and apparatus for the code check for obtaining video slicing
KR101327638B1 (en) Media transmitting apparatus and packet scheduling method

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 10810513

Country of ref document: EP

Kind code of ref document: A1

WWE Wipo information: entry into national phase

Ref document number: 2771187

Country of ref document: CA

WWE Wipo information: entry into national phase

Ref document number: 2012525650

Country of ref document: JP

WWE Wipo information: entry into national phase

Ref document number: 2012000416

Country of ref document: CL

Ref document number: MX/A/2012/002087

Country of ref document: MX

Ref document number: 2010810513

Country of ref document: EP

NENP Non-entry into the national phase

Ref country code: DE

WWE Wipo information: entry into national phase

Ref document number: 12029435

Country of ref document: CO

WWE Wipo information: entry into national phase

Ref document number: 2232/DELNP/2012

Country of ref document: IN

REG Reference to national code

Ref country code: BR

Ref legal event code: B01A

Ref document number: 112012003843

Country of ref document: BR

ENP Entry into the national phase

Ref document number: 112012003843

Country of ref document: BR

Kind code of ref document: A2

Effective date: 20120222