WO2013074160A1 - Association of mvc stereoscopic views to left or right eye display for 3dtv - Google Patents

Association of mvc stereoscopic views to left or right eye display for 3dtv Download PDF

Info

Publication number
WO2013074160A1
WO2013074160A1 PCT/US2012/049064 US2012049064W WO2013074160A1 WO 2013074160 A1 WO2013074160 A1 WO 2013074160A1 US 2012049064 W US2012049064 W US 2012049064W WO 2013074160 A1 WO2013074160 A1 WO 2013074160A1
Authority
WO
WIPO (PCT)
Prior art keywords
stream
mpeg
view
systems standard
video
Prior art date
Application number
PCT/US2012/049064
Other languages
French (fr)
Inventor
Mandayam A. Narasimhan
Original Assignee
General Instrument Corporation
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by General Instrument Corporation filed Critical General Instrument Corporation
Priority to BR112014011618A priority Critical patent/BR112014011618A2/en
Priority to EP12753616.7A priority patent/EP2781103A1/en
Priority to CN201280055877.XA priority patent/CN103931200A/en
Publication of WO2013074160A1 publication Critical patent/WO2013074160A1/en
Priority to IN3560CHN2014 priority patent/IN2014CN03560A/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/236Assembling of a multiplex stream, e.g. transport stream, by combining a video stream with other content or additional data, e.g. inserting a URL [Uniform Resource Locator] into a video stream, multiplexing software data into a video stream; Remultiplexing of multiplex streams; Insertion of stuffing bits into the multiplex stream, e.g. to obtain a constant bit-rate; Assembling of a packetised elementary stream
    • H04N21/2362Generation or processing of Service Information [SI]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N13/00Stereoscopic video systems; Multi-view video systems; Details thereof
    • H04N13/10Processing, recording or transmission of stereoscopic or multi-view image signals
    • H04N13/106Processing image signals
    • H04N13/161Encoding, multiplexing or demultiplexing different image signal components
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N13/00Stereoscopic video systems; Multi-view video systems; Details thereof
    • H04N13/10Processing, recording or transmission of stereoscopic or multi-view image signals
    • H04N13/106Processing image signals
    • H04N13/172Processing image signals image signals comprising non-image signal components, e.g. headers or format information
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N13/00Stereoscopic video systems; Multi-view video systems; Details thereof
    • H04N13/10Processing, recording or transmission of stereoscopic or multi-view image signals
    • H04N13/194Transmission of image signals
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/236Assembling of a multiplex stream, e.g. transport stream, by combining a video stream with other content or additional data, e.g. inserting a URL [Uniform Resource Locator] into a video stream, multiplexing software data into a video stream; Remultiplexing of multiplex streams; Insertion of stuffing bits into the multiplex stream, e.g. to obtain a constant bit-rate; Assembling of a packetised elementary stream
    • H04N21/2365Multiplexing of several video streams
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/434Disassembling of a multiplex stream, e.g. demultiplexing audio and video streams, extraction of additional data from a video stream; Remultiplexing of multiplex streams; Extraction or processing of SI; Disassembling of packetised elementary stream
    • H04N21/4345Extraction or processing of SI, e.g. extracting service information from an MPEG stream
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/434Disassembling of a multiplex stream, e.g. demultiplexing audio and video streams, extraction of additional data from a video stream; Remultiplexing of multiplex streams; Extraction or processing of SI; Disassembling of packetised elementary stream
    • H04N21/4347Demultiplexing of several video streams
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/436Interfacing a local distribution network, e.g. communicating with another STB or one or more peripheral devices inside the home
    • H04N21/4363Adapting the video stream to a specific local network, e.g. a Bluetooth® network
    • H04N21/43632Adapting the video stream to a specific local network, e.g. a Bluetooth® network involving a wired protocol, e.g. IEEE 1394
    • H04N21/43635HDMI
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/81Monomedia components thereof
    • H04N21/816Monomedia components thereof involving special video data, e.g 3D video
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/597Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding specially adapted for multi-view video sequence encoding

Definitions

  • the subject matter of this application relates to association of MVC stereoscopic views to left or right eye display for 3DTV.
  • a video encoder 10 receives raw video data, typically in the HD-SDI format defined in SMPTE 292M, from a source (not shown) such as a camera.
  • the video encoder utilizes the HD-SDI data to generate a video elementary stream and supplies the video elementary stream to a video packetizer 14, which produces a video packetized elementary stream (PES) composed of variable length packets of up to 64 kbytes.
  • PES video packetized elementary stream
  • an audio encoder receives raw audio data from, for example, a microphone and supplies an audio elementary stream to an audio packetizer, which creates an audio PES composed of variable length packets.
  • the video encoder compresses the raw video data utilizing well-known prediction techniques based on correlations between successive pictures represented by the raw video data.
  • the video elementary stream represents a sequence of I (intra-coded) pictures, P (predictively coded) pictures and B (bidirectional predictively coded) pictures organized as a succession of groups of pictures (GOPs) starting with an I picture.
  • each packet of the video PES contains one or more encoded pictures.
  • the video and audio packetizers may supply the video and audio PESs to a transport stream multiplexer 18, which assigns respective program identifiers (PIDs) to the video PES and the audio PES and organizes the variable-length packets of the video and audio PESs as fixed-length MPEG-2 transport stream (TS) packets each having a header that includes the PID of the PES and a payload containing PES video (or audio) data.
  • PIDs program identifiers
  • TS MPEG-2 transport stream
  • the video and audio elementary streams conveyed by the video and audio PESs are considered to be services included in the single program transport stream (SPTS) that is output by the transport stream multiplexer.
  • SPTS single program transport stream
  • the SPTS may include other services also, such as second language audio and program guide data, conveyed by respective packetized elementary streams having respective PIDs assigned thereto.
  • the TS packets of the SPTS also include a program map table (PMT), which contains the PIDs of the elementary streams conveyed by the SPTS and may contain other signaling mechanisms.
  • PMT program map table
  • the SPTS that is output by the transport stream multiplexer 18 may be supplied to a program multiplexer 22 that combines that SPTS with other transport streams, conveying other programs, to produce a multi-program transport stream (MPTS).
  • MPTS multi-program transport stream
  • the MPTS is transmitted over a channel to a receiver 24 at which a program demultiplexer 26 separates a selected SPTS from the MPTS and supplies it to a transport stream demultiplexer 30.
  • the receiver 24 may be implemented in a set-top box (STB) connected to a digital TV appliance 25.
  • STB set-top box
  • the SPTS that is output by the transport stream multiplexer 18 may be transmitted directly to the transport stream demultiplexer 30 without first being combined with other transport streams to create the MPTS but in either case the transport stream demultiplexer receives the fixed-length transport stream packets of the selected SPTS and separates them on the basis of PID, depacketizes the transport stream packets to recreate the PES packets, and directs the video PES to a so-called video transport system target decoder (T-STD) 34 and the audio PES to an audio T- STD 38.
  • T-STD video transport system target decoder
  • the video T-STD 34 comprises a video depacketizer 40 and a video decoder 42.
  • the video depacketizer 40 receives the video PES from the transport stream demultiplexer and provides an encoded bitstream to the video decoder, which decodes the bitstream and outputs a stream of pictures in display order to the TV appliance 25.
  • the decoder 42 is connected to the TV appliance through an HDMI (High Definition Multimedia Interface) cable.
  • An HDMI encoder in the STB creates an HDMI compliant digital signal, which passes to an HDMI decoder in the TV appliance through the HDMI cable.
  • the HDMI decoder generates appropriate signals for driving the display circuits of the TV appliance.
  • the MPEG-2 transport stream which is defined in the MPEG-2 systems standard (ISO/IEC 13818-1), is widely used for delivery of encoded video over an error prone channel.
  • the MPEG-2 systems standard also defines the MPEG-2 program stream, which may be used for transmission of encoded video in an error free environment.
  • FIG. 1 illustrates transmission of the video PESs as a program stream to a video program system target decoder (P-STD) 50 as an alternative to delivery as a transport stream to the video T-STD 34.
  • P-STD video program system target decoder
  • MPEG-2 systems standard stream is used herein to refer to both the MPEG-2 transport stream and the MPEG-2 program stream. It will be appreciated that regardless of whether the video content is delivered over a program stream or a transport stream, other functional blocks than those described and illustrated above might be required in a practical implementation of the method described.
  • the bitstream produced by the video encoder 10 may comply with the video compression standard that is specified in ISO/IEC 14496-10 (MPEG-4 part 10) Advanced Video Coding (AVC), commonly referred to as H.264/ AVC.
  • AVC Advanced Video Coding
  • Those skilled in the art will understand that H.264/AVC allows for a picture to be segmented into slices and that a picture may be composed of one or more I slices, P slices and B slices. For simplicity and clarity, however, we will confine the following discussion to pictures.
  • the first picture, or access unit, of a coded video sequence (which corresponds to the GOP of previous video compression standards) must be an IDR (instantaneous decoder refresh) access unit, which is defined as an access unit that contains only I slices or SI (switched intra-coded) slices.
  • IDR instantaneous decoder refresh
  • Annex H of H.264/AVC prescribes an extension of H.264/AVC known as multiview video coding or MVC, which provides efficient compression of multiple views of the same scene.
  • an MVC video encoder 110 receives raw video data, typically as HD-SDI data streams, from respective sources representing views of the same scene. Three sources and three data streams suffice to illustrate the principles of Annex H but it will be appreciated by those skilled in the art that there could be two sources or four or more sources.
  • the MVC encoder 110 generates a single compressed video elementary stream conveying a succession of coded video sequences each starting with an IDR access unit and containing access units derived from the three views respectively, using the temporal and inter-view correlations among the three streams.
  • the IDR access unit at the start of a coded video sequence may be derived from any one of the three streams. In general, the IDR access unit at the start of a coded video sequence will be derived from a different stream from the IDR access unit at the start of the immediately preceding coded video sequence.
  • the compressed video elementary stream is packetized by the video packetizer 114 and the video PES passes to a transport stream multiplexer 118.
  • the MPEG-2 systems standard does not allow a single PID to be assigned both to an IDR access unit and to access units derived from other views. Accordingly, the transport stream multiplexer 118 separates the coded video sequence into a primary sequence of access units containing the IDR access unit and other access units derived from the same view (referred to as the base view) and a secondary sequence of access units containing access units derived from additional views (two additional views in the case of the example) and assigns respective PIDs to the two sequences.
  • the transport stream multiplexer 18 shown in FIG. 1 the transport stream multiplexer 118 shown in FIG.
  • the transport stream demultiplexer 130 recovers the succession of coded video sequences and passes the video PES packets to a video depacketizer 140, which depacketizes the video PES and reconstructs the video elementary stream.
  • An MVC video decoder 142 decodes the access units and supplies three streams of presentation units, corresponding to the three streams of pictures received by the MVC video encoder 110, to an HDMI encoder 146.
  • the HDMI encoder receives control signals from a display appliance 125, such as a video game console, via an HDMI decoder 148 and in response to the control signals the HDMI encoder creates an HDMI compliant digital signal conveying a selected one of the three views to the HDMI decoder through an HDMI cable.
  • the HDMI decoder generates appropriate signals for driving the display circuits of the display appliance for displaying the selected view.
  • Multiview video coding has been applied to the case in which the multiple views are left and right eye stereoscopic views, in order to support delivery of 3D video content to a 3D capable TV appliance.
  • left and right cameras 208L, 208R provide raw video data representing left and right eye views of a scene and the MVC encoder 110 produces an encoded bitstream conveying access units derived from the left and right eye views that enable the MVC decoder 142, which decodes the access units and recovers the corresponding presentation units, to supply a sequence of presentation units to the HDMI encoder 146.
  • a signaling mechanism outside the TS bitstream enables the HDMI encoder to determine whether a given presentation unit represents a left eye view or a right eye view.
  • the HDMI decoder 148 sends control signals to the HDMI encoder calling for a sequence of left and right eye view presentation units, as required by the display circuits of the TV appliance 225 in order to provide a 3D display, and the HDMI encoder responds to the control signals by supplying the proper sequence of left and right eye view presentation units to the HDMI decoder 148.
  • These extensions of MVC are not applicable to a signal that is to be transmitted over an MPEG-2 system standard stream because the MPEG-2 systems standard does not currently provide a mechanism for preserving the left and right eye view information.
  • Neither the H.264/AVC video compression standard nor the MPEG-2 systems standard provides a signaling mechanism for providing information that associates presentation units or access units with the hand (left or right) of the view represented by the presentation unit or access unit. Absence of such a standardized mechanism limits development of applications of MVC to delivery of 3D content.
  • a method of delivering video data representing left and right eye views of a scene encoded in accordance with multiview video coding (MVC) from a transmitter to a receiver over an MPEG-2 systems standard stream comprising receiving at the transmitter an input video elementary stream conveying data encoded as a base view and an enhancement view, wherein the base view represents one of the left and right eye views and the enhancement view represents the other of the left and right eye views, employing an MPEG-2 systems standard multiplexer at the transmitter to generate an MPEG-2 systems standard stream that is derived from the input video elementary stream and conveys association information associating the base view with said one of the left and right eye views, transmitting the MPEG-2 systems standard stream from the transmitter, receiving the MPEG-2 systems standard stream at the receiver, and employing an MPEG-2 systems standard demultiplexer at the receiver to generate an output video elementary stream derived from the MPEG-2 systems standard stream, wherein the output video elementary stream conveys data encoded as a base
  • a method of delivering video data representing left and right eye views of a scene encoded in accordance with multiview video coding (MVC) from a transmitter to a receiver over an MPEG-2 systems standard stream comprising receiving at a transmitter an input video elementary stream conveying data encoded as a base view and an enhancement view, wherein the base view represents one of the left and right eye views and the enhancement view represents the other of the left and right eye views, employing an MPEG-2 systems standard multiplexer at the transmitter to generate an MPEG-2 systems standard stream that is derived from the input video elementary stream and conveys association information associating the base view with said one of the left and right eye views, and transmitting the MPEG-2 systems standard stream from the transmitter.
  • MVC multiview video coding
  • a method of processing an MPEG-2 systems standard stream that conveys video data encoded in accordance with multiview video coding (MVC) and representing a base view and an enhancement view of a scene, and also conveys association information associating the base view with one of a left eye view and a right eye view of a scene, comprising employing an MPEG-2 systems standard demultiplexer to generate an output video elementary stream derived from the MPEG- 2 systems standard stream, wherein the output video elementary stream conveys data encoded as a base view and an enhancement view, and recover the association information from the MPEG-2 systems standard stream.
  • MVC multiview video coding
  • a non-transitory computer readable medium containing software that, when executed by a computer having an input for receiving a signal conveying video data representing left and right eye views of a scene encoded in accordance with multiview video coding (MVC) from a transmitter to a receiver over an MPEG-2 systems standard stream, delivers the data to a receiver by a method that includes receiving at a transmitter an input video elementary stream conveying data encoded as a base view and an enhancement view, wherein the base view represents one of the left and right eye views and the enhancement view represents the other of the left and right eye views, employing an MPEG-2 systems standard multiplexer at the transmitter to generate an MPEG-2 systems standard stream that is derived from the input video elementary stream and conveys association information associating the base view with said one of the left and right eye views, and transmitting the MPEG-2 systems standard stream from the transmitter.
  • MVC multiview video coding
  • a non-transitory computer readable medium containing software that, when executed by a computer having an input for receiving an MPEG-2 systems standard stream that conveys video data encoded in accordance with multiview video coding (MVC) and representing a base view and an enhancement view of a scene, and also conveys association information associating the base view with one of a left eye view and a right eye view of a scene, processes the MPEG-2 systems standard stream by a method that comprises employing an MPEG-2 systems standard demultiplexer to generate an output video elementary stream derived from the MPEG-2 systems standard stream, wherein the output video elementary stream conveys data encoded as a base view and an enhancement view, and recover the association information from the MPEG-2 systems standard stream.
  • MVC multiview video coding
  • FIG. 1 is a block schematic illustration of the architecture of a first system for supplying compressed video material for presentation
  • FIG. 2 is a block schematic illustration of the architecture of a second system for supplying compressed video material for presentation
  • FIG. 3 is a block schematic illustration of the architecture of a third system for supplying compressed video material for presentation
  • FIG. 4 is a block schematic illustration of the architecture of a fourth system for supplying compressed video material for presentation.
  • FIG. 5 is a block schematic diagram of a computing machine that may be used to implement parts of the processes described with reference to FIG. 4.
  • Each coded video sequence produced by the MVC encoder 110 shown in FIG. 4 starts with an IDR picture, which may be derived from the left eye view or the right eye view.
  • the view (left or right eye) that provides the IDR picture is the base view for the coded video sequence; the other view is the enhancement view.
  • the base view for a given coded video sequence is not the same (left eye or right eye) as the base view for the immediately preceding coded video sequence.
  • the transport stream multiplexer 218 shown in FIG. 4 organizes the variable-length PES packets for a coded video sequence received from the video packetizer in two component sequences of fixed-length TS packets.
  • One component sequence (referred to herein as the primary sequence) contains the PES packets that convey the base view pictures and is assigned a primary PID.
  • the other component sequence (the secondary sequence) contains the PES packets that convey the enhancement view pictures and is assigned a secondary PID.
  • the PES packets of the primary sequence convey the IDR picture with which the coded video sequence starts and that the secondary sequence does not convey an IDR picture.
  • the secondary sequence starts with a so-called anchor picture. It will also be appreciated that if the primary sequence derived from the first of two consecutive coded video sequences conveys right eye view pictures, in general the primary sequence derived from the second of the two coded video sequence will convey left eye view pictures.
  • the transport stream multiplexer 218 receives information that explicitly associates the base view of the current coded video sequence with the appropriate (left or right) eye view. This association information is ultimately derived from the association of the cameras with the left and right eye views respectively and may be included in a supplemental enhancement information (SEI) message produced by the MVC encoder or it may be provided to the TS multiplexer by a signaling mechanism separate from the video data provided by the cameras.
  • SEI Supplemental Enhancement information
  • the program map table (PMT) of the MPEG-2 transport stream produced by the transport stream multiplexer 218 may contain program specific information (PSI), i.e. information about the program conveyed by the transport stream.
  • PSI may include descriptors containing standards-defined or user-defined data elements.
  • the program map table may include a video stream descriptor containing coding parameters of a video elementary stream.
  • the descriptors may be used to signal information about the program to the T-STD. Similar descriptors may be conveyed in a program stream map of an MPEG-2 program stream to signal information about the program to the P-STD.
  • the TS multiplexer 218 includes a descriptor processor that receives the association information and generates a view association descriptor that contains a parameter having a value left or right depending on whether the association information indicates that the base view of the current coded video sequence is the left eye view or the right eye view.
  • the transport stream multiplexer 218 includes the view association descriptor in the PMT of the transport stream that is output by the transport stream multiplexer.
  • the single program transport stream is delivered over a signal propagation medium to a transport stream demultiplexer 230.
  • the signal propagation medium may include any suitable medium or combination of media, such as cable TV distribution network, the Internet, and wireless transmitters and receivers.
  • the SPTS may be incorporated in an MPTS for delivery over the signal propagation medium.
  • the transport stream demultiplexer 230 receives the SPTS conveying the primary and secondary component sequences, derived from the coded video sequences provided to the transport stream multiplexer 218, separates the primary and secondary component sequences based on the respective PIDs and recombines the primary and secondary component sequences to recreate PES packets containing the succession of coded video sequences.
  • the video PES packets pass to the video depacketizer, which depacketizes the video PES packets and reconstructs the video elementary stream and supplies it to the MVC decoder 142.
  • the MVC decoder decompresses the access units and generates two sequences of presentation units, corresponding to the base view and the enhancement view respectively, and supplies the two sequences of presentation units to the HDMI encoder 146.
  • the transport stream demultiplexer 230 also recovers the association information from the view association descriptor and passes the association information to the HDMI encoder.
  • the association information may be passed from the TS demultiplexer to the HDMI encoder via the MVC decoder, as schematically indicated in FIG. 4, for example in an SEI message that the TS demultiplexer includes in the bitstream supplied to the video depacketizer, or it may be passed directly from the TS demultiplexer to the MVC decoder.
  • the association information is synchronized with the coded video sequences by virtue of the structure of the video PES; in the latter case it is important to ensure that the association information is synchronized with the coded video sequences.
  • the HMDI decoder 148 sends control signals to the HDMI encoder 146 calling for an alternating sequence of left eye view pictures and right eye view pictures.
  • the HDMI encoder uses the synchronized association information to determine whether the base view picture is a left eye view picture or a right eye view picture and uses the two sequences of presentation units and the association information to create an HDMI compliant digital signal conveying the proper sequence of left eye view pictures and right eye view pictures, at the times required by the control signals provided by the HDMI decoder, and supplies the HDMI signal to the HDMI decoder in the TV appliance through an HDMI cable.
  • the 3D capable TV appliance is then able to provide a proper 3D display.
  • the video PES that is output by the video packetizer 114 may also be conveyed to a video depacketizer 240 by an MPEG-2 program stream.
  • the video PES packets are delivered to a program stream multiplexer for forming the program stream, which passes to a program stream demultiplexer.
  • the program stream demultiplexer outputs the video PES packets to the depacketizer 240.
  • the program stream multiplexer receives the association information either in an SEI message or separately from any SEI messages received from the MVC encoder.
  • the PS multiplexer includes the association information in a descriptor that is conveyed by the program stream map of the program stream.
  • the PS demultiplexer recovers the association information and supplies it to the video depacketizer.
  • the HDMI decoder 248 does not call for an alternating sequence of left eye view pictures and right eye view pictures.
  • the HDMI encoder 146 provides a sequence of single eye view pictures.
  • the single eye view pictures will generally alternate between the base view and the enhancement view.
  • the HDMI decoder 248 generates appropriate signals for driving the display circuits 250 of the TV appliance for displaying the single eye view.
  • the delivery system shown in FIG. 4 is backwardly compatible with a 2D TV appliance.
  • a computer comprising at least one processor 161, random access memory 162, read only memory 163, I/O devices 164 (including suitable adaptors for receiving and transmitting bitstreams), a user interface 165, a CD ROM drive 166 and a hard disk drive 167, configured in a generally conventional architecture.
  • the computer operates in accordance with a program that is stored in a non-transitory computer readable medium, such as the hard disk drive 167 or a CD-ROM 168, and is loaded into the random access memory 162 for execution.
  • the program is composed of instructions such that when the computer receives a signal representing the input of the functional block or blocks, by way of a suitable interface included in the I/O devices 164, the computer allocates memory to appropriate buffers and utilizes other suitable resources and functions to perform the various operations that are described above as being performed by the functional block or blocks.
  • the program might not be loadable directly from the CD-ROM 168 into the random access memory utilizing the CD-ROM drive 166 and that generally the program will be stored on the CD- ROM or other distribution medium in a form that requires the program to be installed on the hard disk drive 167 from the CD-ROM 168.
  • FIG. 4 refers to the view association descriptor as containing information that indicates whether the base view is the left eye view or the right eye view
  • the descriptor could contain information that indicates whether the enhancement view is the left eye view or the right eye view, or information that indicates explicitly both the hand (left or right) of the base view and the hand of the enhancement view.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Testing, Inspecting, Measuring Of Stereoscopic Televisions And Televisions (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)

Abstract

A method of delivering video data representing left (208L) and right (208R) eye views of a scene encoded (110) in accordance with mul- tiview video coding (MVC) from a transmitter to a receiver over an MPEG-2 systems standard stream, includes receiving at a transmitter (114, 218) an input video elementary stream conveying data encoded as a base view and an enhancement view, wherein the base view rep- resents a specific one of the left and right eye views and the enhance- ment view represents the other of the left and right eye views. An MPEG-2 systems standard multiplexer (218) at the transmitter gener- ates an MPEG-2 systems standard stream that is derived from the in- put video elementary stream and conveys association information as- sociating the base view with the specific one of the left and right eye views. The MPEG-2 systems standard stream is transmitted from the transmitter to the receiver.

Description

ASSOCIATION OF MVC STEREOSCOPIC VIEWS TO LEFT OR RIGHT EYE
DISPLAY FOR 3DTV
CROSS-REFERENCE TO RELATED APPLICATION
[0001] This application claims benefit under 35 USC 120 of U.S. Provisional Application No. 61/559,149 filed November 14, 2011, the entire disclosure of which is hereby incorporated by reference herein for all purposes.
BACKGROUND OF THE INVENTION
[0002] The subject matter of this application relates to association of MVC stereoscopic views to left or right eye display for 3DTV.
[0003] Referring to FIG. 1 of the drawings, in a conventional method of distributing 2D video content a video encoder 10 receives raw video data, typically in the HD-SDI format defined in SMPTE 292M, from a source (not shown) such as a camera. The video encoder utilizes the HD-SDI data to generate a video elementary stream and supplies the video elementary stream to a video packetizer 14, which produces a video packetized elementary stream (PES) composed of variable length packets of up to 64 kbytes. Similarly, an audio encoder (not shown) receives raw audio data from, for example, a microphone and supplies an audio elementary stream to an audio packetizer, which creates an audio PES composed of variable length packets.
[0004] In generating the video elementary stream, the video encoder compresses the raw video data utilizing well-known prediction techniques based on correlations between successive pictures represented by the raw video data. The video elementary stream represents a sequence of I (intra-coded) pictures, P (predictively coded) pictures and B (bidirectional predictively coded) pictures organized as a succession of groups of pictures (GOPs) starting with an I picture. Typically, each packet of the video PES contains one or more encoded pictures. [0005] The video and audio packetizers may supply the video and audio PESs to a transport stream multiplexer 18, which assigns respective program identifiers (PIDs) to the video PES and the audio PES and organizes the variable-length packets of the video and audio PESs as fixed-length MPEG-2 transport stream (TS) packets each having a header that includes the PID of the PES and a payload containing PES video (or audio) data.
[0006] The video and audio elementary streams conveyed by the video and audio PESs are considered to be services included in the single program transport stream (SPTS) that is output by the transport stream multiplexer. The SPTS may include other services also, such as second language audio and program guide data, conveyed by respective packetized elementary streams having respective PIDs assigned thereto.
[0007] The TS packets of the SPTS also include a program map table (PMT), which contains the PIDs of the elementary streams conveyed by the SPTS and may contain other signaling mechanisms. [0008] The SPTS that is output by the transport stream multiplexer 18 may be supplied to a program multiplexer 22 that combines that SPTS with other transport streams, conveying other programs, to produce a multi-program transport stream (MPTS). The MPTS is transmitted over a channel to a receiver 24 at which a program demultiplexer 26 separates a selected SPTS from the MPTS and supplies it to a transport stream demultiplexer 30. The receiver 24 may be implemented in a set-top box (STB) connected to a digital TV appliance 25. It will be appreciated by those skilled in the art that the SPTS that is output by the transport stream multiplexer 18 may be transmitted directly to the transport stream demultiplexer 30 without first being combined with other transport streams to create the MPTS but in either case the transport stream demultiplexer receives the fixed-length transport stream packets of the selected SPTS and separates them on the basis of PID, depacketizes the transport stream packets to recreate the PES packets, and directs the video PES to a so-called video transport system target decoder (T-STD) 34 and the audio PES to an audio T- STD 38. The subject matter of this application is concerned with processing video data and accordingly we will not discuss the audio decoder further. [0009] The video T-STD 34 comprises a video depacketizer 40 and a video decoder 42. The video depacketizer 40 receives the video PES from the transport stream demultiplexer and provides an encoded bitstream to the video decoder, which decodes the bitstream and outputs a stream of pictures in display order to the TV appliance 25. Typically, the decoder 42 is connected to the TV appliance through an HDMI (High Definition Multimedia Interface) cable. An HDMI encoder in the STB creates an HDMI compliant digital signal, which passes to an HDMI decoder in the TV appliance through the HDMI cable. The HDMI decoder generates appropriate signals for driving the display circuits of the TV appliance. [0010] The MPEG-2 transport stream, which is defined in the MPEG-2 systems standard (ISO/IEC 13818-1), is widely used for delivery of encoded video over an error prone channel. The MPEG-2 systems standard also defines the MPEG-2 program stream, which may be used for transmission of encoded video in an error free environment. FIG. 1 illustrates transmission of the video PESs as a program stream to a video program system target decoder (P-STD) 50 as an alternative to delivery as a transport stream to the video T-STD 34. The term "MPEG-2 systems standard stream" is used herein to refer to both the MPEG-2 transport stream and the MPEG-2 program stream. It will be appreciated that regardless of whether the video content is delivered over a program stream or a transport stream, other functional blocks than those described and illustrated above might be required in a practical implementation of the method described.
[0011] The bitstream produced by the video encoder 10 may comply with the video compression standard that is specified in ISO/IEC 14496-10 (MPEG-4 part 10) Advanced Video Coding (AVC), commonly referred to as H.264/ AVC. Those skilled in the art will understand that H.264/AVC allows for a picture to be segmented into slices and that a picture may be composed of one or more I slices, P slices and B slices. For simplicity and clarity, however, we will confine the following discussion to pictures. The first picture, or access unit, of a coded video sequence (which corresponds to the GOP of previous video compression standards) must be an IDR (instantaneous decoder refresh) access unit, which is defined as an access unit that contains only I slices or SI (switched intra-coded) slices.
[0012] Annex H of H.264/AVC prescribes an extension of H.264/AVC known as multiview video coding or MVC, which provides efficient compression of multiple views of the same scene.
[0013] Referring to FIG. 2, an MVC video encoder 110 receives raw video data, typically as HD-SDI data streams, from respective sources representing views of the same scene. Three sources and three data streams suffice to illustrate the principles of Annex H but it will be appreciated by those skilled in the art that there could be two sources or four or more sources. The MVC encoder 110 generates a single compressed video elementary stream conveying a succession of coded video sequences each starting with an IDR access unit and containing access units derived from the three views respectively, using the temporal and inter-view correlations among the three streams. The IDR access unit at the start of a coded video sequence may be derived from any one of the three streams. In general, the IDR access unit at the start of a coded video sequence will be derived from a different stream from the IDR access unit at the start of the immediately preceding coded video sequence.
[0014] The compressed video elementary stream is packetized by the video packetizer 114 and the video PES passes to a transport stream multiplexer 118. The MPEG-2 systems standard does not allow a single PID to be assigned both to an IDR access unit and to access units derived from other views. Accordingly, the transport stream multiplexer 118 separates the coded video sequence into a primary sequence of access units containing the IDR access unit and other access units derived from the same view (referred to as the base view) and a secondary sequence of access units containing access units derived from additional views (two additional views in the case of the example) and assigns respective PIDs to the two sequences. Similarly to the transport stream multiplexer 18 shown in FIG. 1, the transport stream multiplexer 118 shown in FIG. 2 organizes the variable-length packets of the video PES and the corresponding audio PES as fixed-length TS packets and transmits the TS packets to a transport stream demultiplexer 130, either directly or as part of a multi-program transport stream through a program multiplexer/demultiplexer 122, 126. The transport stream demultiplexer 130 recovers the succession of coded video sequences and passes the video PES packets to a video depacketizer 140, which depacketizes the video PES and reconstructs the video elementary stream. An MVC video decoder 142 decodes the access units and supplies three streams of presentation units, corresponding to the three streams of pictures received by the MVC video encoder 110, to an HDMI encoder 146. The HDMI encoder receives control signals from a display appliance 125, such as a video game console, via an HDMI decoder 148 and in response to the control signals the HDMI encoder creates an HDMI compliant digital signal conveying a selected one of the three views to the HDMI decoder through an HDMI cable. The HDMI decoder generates appropriate signals for driving the display circuits of the display appliance for displaying the selected view.
[0015] Multiview video coding has been applied to the case in which the multiple views are left and right eye stereoscopic views, in order to support delivery of 3D video content to a 3D capable TV appliance. Referring to FIG. 3, left and right cameras 208L, 208R provide raw video data representing left and right eye views of a scene and the MVC encoder 110 produces an encoded bitstream conveying access units derived from the left and right eye views that enable the MVC decoder 142, which decodes the access units and recovers the corresponding presentation units, to supply a sequence of presentation units to the HDMI encoder 146. A signaling mechanism outside the TS bitstream enables the HDMI encoder to determine whether a given presentation unit represents a left eye view or a right eye view. The HDMI decoder 148 sends control signals to the HDMI encoder calling for a sequence of left and right eye view presentation units, as required by the display circuits of the TV appliance 225 in order to provide a 3D display, and the HDMI encoder responds to the control signals by supplying the proper sequence of left and right eye view presentation units to the HDMI decoder 148. These extensions of MVC are not applicable to a signal that is to be transmitted over an MPEG-2 system standard stream because the MPEG-2 systems standard does not currently provide a mechanism for preserving the left and right eye view information. [0016] Neither the H.264/AVC video compression standard nor the MPEG-2 systems standard provides a signaling mechanism for providing information that associates presentation units or access units with the hand (left or right) of the view represented by the presentation unit or access unit. Absence of such a standardized mechanism limits development of applications of MVC to delivery of 3D content.
SUMMARY OF THE INVENTION
[0017] According to a first aspect of the subject matter of this application there is provided a method of delivering video data representing left and right eye views of a scene encoded in accordance with multiview video coding (MVC) from a transmitter to a receiver over an MPEG-2 systems standard stream, comprising receiving at the transmitter an input video elementary stream conveying data encoded as a base view and an enhancement view, wherein the base view represents one of the left and right eye views and the enhancement view represents the other of the left and right eye views, employing an MPEG-2 systems standard multiplexer at the transmitter to generate an MPEG-2 systems standard stream that is derived from the input video elementary stream and conveys association information associating the base view with said one of the left and right eye views, transmitting the MPEG-2 systems standard stream from the transmitter, receiving the MPEG-2 systems standard stream at the receiver, and employing an MPEG-2 systems standard demultiplexer at the receiver to generate an output video elementary stream derived from the MPEG-2 systems standard stream, wherein the output video elementary stream conveys data encoded as a base view and an enhancement view, and recover the association information from the MPEG-2 systems standard stream.
[0018] According to a second aspect of the subject matter of this application there is provided a method of delivering video data representing left and right eye views of a scene encoded in accordance with multiview video coding (MVC) from a transmitter to a receiver over an MPEG-2 systems standard stream, comprising receiving at a transmitter an input video elementary stream conveying data encoded as a base view and an enhancement view, wherein the base view represents one of the left and right eye views and the enhancement view represents the other of the left and right eye views, employing an MPEG-2 systems standard multiplexer at the transmitter to generate an MPEG-2 systems standard stream that is derived from the input video elementary stream and conveys association information associating the base view with said one of the left and right eye views, and transmitting the MPEG-2 systems standard stream from the transmitter.
[0019] According to a third aspect of the subject matter of this application there is provided a method of processing an MPEG-2 systems standard stream that conveys video data encoded in accordance with multiview video coding (MVC) and representing a base view and an enhancement view of a scene, and also conveys association information associating the base view with one of a left eye view and a right eye view of a scene, comprising employing an MPEG-2 systems standard demultiplexer to generate an output video elementary stream derived from the MPEG- 2 systems standard stream, wherein the output video elementary stream conveys data encoded as a base view and an enhancement view, and recover the association information from the MPEG-2 systems standard stream.
[0020] According to a fourth aspect of the subject matter of this application there is provided a non-transitory computer readable medium containing software that, when executed by a computer having an input for receiving a signal conveying video data representing left and right eye views of a scene encoded in accordance with multiview video coding (MVC) from a transmitter to a receiver over an MPEG-2 systems standard stream, delivers the data to a receiver by a method that includes receiving at a transmitter an input video elementary stream conveying data encoded as a base view and an enhancement view, wherein the base view represents one of the left and right eye views and the enhancement view represents the other of the left and right eye views, employing an MPEG-2 systems standard multiplexer at the transmitter to generate an MPEG-2 systems standard stream that is derived from the input video elementary stream and conveys association information associating the base view with said one of the left and right eye views, and transmitting the MPEG-2 systems standard stream from the transmitter. [0021] According to a fifth aspect of the subject matter of this application there is provided a non-transitory computer readable medium containing software that, when executed by a computer having an input for receiving an MPEG-2 systems standard stream that conveys video data encoded in accordance with multiview video coding (MVC) and representing a base view and an enhancement view of a scene, and also conveys association information associating the base view with one of a left eye view and a right eye view of a scene, processes the MPEG-2 systems standard stream by a method that comprises employing an MPEG-2 systems standard demultiplexer to generate an output video elementary stream derived from the MPEG-2 systems standard stream, wherein the output video elementary stream conveys data encoded as a base view and an enhancement view, and recover the association information from the MPEG-2 systems standard stream.
BRIEF DESCRIPTION OF THE DRAWINGS
[0022] For a better understanding of the invention, and to show how the same may be carried into effect, reference will now be made, by way of example, to the accompanying drawings, in which:
[0023] FIG. 1 is a block schematic illustration of the architecture of a first system for supplying compressed video material for presentation,
[0024] FIG. 2 is a block schematic illustration of the architecture of a second system for supplying compressed video material for presentation,
[0025] FIG. 3 is a block schematic illustration of the architecture of a third system for supplying compressed video material for presentation,
[0026] FIG. 4 is a block schematic illustration of the architecture of a fourth system for supplying compressed video material for presentation, and
[0027] FIG. 5 is a block schematic diagram of a computing machine that may be used to implement parts of the processes described with reference to FIG. 4. DETAILED DESCRIPTION
[0028] Each coded video sequence produced by the MVC encoder 110 shown in FIG. 4 starts with an IDR picture, which may be derived from the left eye view or the right eye view. The view (left or right eye) that provides the IDR picture is the base view for the coded video sequence; the other view is the enhancement view. In general, the base view for a given coded video sequence is not the same (left eye or right eye) as the base view for the immediately preceding coded video sequence.
[0029] The transport stream multiplexer 218 shown in FIG. 4 organizes the variable-length PES packets for a coded video sequence received from the video packetizer in two component sequences of fixed-length TS packets. One component sequence (referred to herein as the primary sequence) contains the PES packets that convey the base view pictures and is assigned a primary PID. The other component sequence (the secondary sequence) contains the PES packets that convey the enhancement view pictures and is assigned a secondary PID. It will be appreciated that the PES packets of the primary sequence convey the IDR picture with which the coded video sequence starts and that the secondary sequence does not convey an IDR picture. The secondary sequence starts with a so-called anchor picture. It will also be appreciated that if the primary sequence derived from the first of two consecutive coded video sequences conveys right eye view pictures, in general the primary sequence derived from the second of the two coded video sequence will convey left eye view pictures.
[0030] The transport stream multiplexer 218 receives information that explicitly associates the base view of the current coded video sequence with the appropriate (left or right) eye view. This association information is ultimately derived from the association of the cameras with the left and right eye views respectively and may be included in a supplemental enhancement information (SEI) message produced by the MVC encoder or it may be provided to the TS multiplexer by a signaling mechanism separate from the video data provided by the cameras. [0031] The program map table (PMT) of the MPEG-2 transport stream produced by the transport stream multiplexer 218 may contain program specific information (PSI), i.e. information about the program conveyed by the transport stream. The PSI may include descriptors containing standards-defined or user-defined data elements. For example, the program map table may include a video stream descriptor containing coding parameters of a video elementary stream. The descriptors may be used to signal information about the program to the T-STD. Similar descriptors may be conveyed in a program stream map of an MPEG-2 program stream to signal information about the program to the P-STD. [0032] The TS multiplexer 218 includes a descriptor processor that receives the association information and generates a view association descriptor that contains a parameter having a value left or right depending on whether the association information indicates that the base view of the current coded video sequence is the left eye view or the right eye view. The transport stream multiplexer 218 includes the view association descriptor in the PMT of the transport stream that is output by the transport stream multiplexer.
[0033] The single program transport stream is delivered over a signal propagation medium to a transport stream demultiplexer 230. The signal propagation medium may include any suitable medium or combination of media, such as cable TV distribution network, the Internet, and wireless transmitters and receivers. The SPTS may be incorporated in an MPTS for delivery over the signal propagation medium.
[0034] The transport stream demultiplexer 230 receives the SPTS conveying the primary and secondary component sequences, derived from the coded video sequences provided to the transport stream multiplexer 218, separates the primary and secondary component sequences based on the respective PIDs and recombines the primary and secondary component sequences to recreate PES packets containing the succession of coded video sequences. The video PES packets pass to the video depacketizer, which depacketizes the video PES packets and reconstructs the video elementary stream and supplies it to the MVC decoder 142. For each coded video sequence, the MVC decoder decompresses the access units and generates two sequences of presentation units, corresponding to the base view and the enhancement view respectively, and supplies the two sequences of presentation units to the HDMI encoder 146.
[0035] The transport stream demultiplexer 230 also recovers the association information from the view association descriptor and passes the association information to the HDMI encoder. The association information may be passed from the TS demultiplexer to the HDMI encoder via the MVC decoder, as schematically indicated in FIG. 4, for example in an SEI message that the TS demultiplexer includes in the bitstream supplied to the video depacketizer, or it may be passed directly from the TS demultiplexer to the MVC decoder. In the former case, the association information is synchronized with the coded video sequences by virtue of the structure of the video PES; in the latter case it is important to ensure that the association information is synchronized with the coded video sequences.
[0036] In order to provide a proper 3D display, the HMDI decoder 148 sends control signals to the HDMI encoder 146 calling for an alternating sequence of left eye view pictures and right eye view pictures. The HDMI encoder uses the synchronized association information to determine whether the base view picture is a left eye view picture or a right eye view picture and uses the two sequences of presentation units and the association information to create an HDMI compliant digital signal conveying the proper sequence of left eye view pictures and right eye view pictures, at the times required by the control signals provided by the HDMI decoder, and supplies the HDMI signal to the HDMI decoder in the TV appliance through an HDMI cable. The 3D capable TV appliance is then able to provide a proper 3D display. [0037] The video PES that is output by the video packetizer 114 may also be conveyed to a video depacketizer 240 by an MPEG-2 program stream. As shown in FIG. 4, the video PES packets are delivered to a program stream multiplexer for forming the program stream, which passes to a program stream demultiplexer. The program stream demultiplexer outputs the video PES packets to the depacketizer 240. [0038] Similarly to the transport stream case, the program stream multiplexer receives the association information either in an SEI message or separately from any SEI messages received from the MVC encoder. The PS multiplexer includes the association information in a descriptor that is conveyed by the program stream map of the program stream. The PS demultiplexer recovers the association information and supplies it to the video depacketizer.
[0039] In the event that the HDMI encoder is connected to a 2D TV appliance 325, the HDMI decoder 248 does not call for an alternating sequence of left eye view pictures and right eye view pictures. The HDMI encoder 146 provides a sequence of single eye view pictures. The single eye view pictures will generally alternate between the base view and the enhancement view. The HDMI decoder 248 generates appropriate signals for driving the display circuits 250 of the TV appliance for displaying the single eye view. Thus, the delivery system shown in FIG. 4 is backwardly compatible with a 2D TV appliance. [0040] Referring to FIG. 5, one or more of the functional blocks shown in FIG. 4 for producing the transport stream or the program stream, or one or more of the functional blocks shown in FIG. 4 for receiving the transport stream or the program stream and providing the HDMI compliant signal to the appliance 225 or 325, may be implemented using a computer comprising at least one processor 161, random access memory 162, read only memory 163, I/O devices 164 (including suitable adaptors for receiving and transmitting bitstreams), a user interface 165, a CD ROM drive 166 and a hard disk drive 167, configured in a generally conventional architecture. The computer operates in accordance with a program that is stored in a non-transitory computer readable medium, such as the hard disk drive 167 or a CD-ROM 168, and is loaded into the random access memory 162 for execution. The program is composed of instructions such that when the computer receives a signal representing the input of the functional block or blocks, by way of a suitable interface included in the I/O devices 164, the computer allocates memory to appropriate buffers and utilizes other suitable resources and functions to perform the various operations that are described above as being performed by the functional block or blocks. [0041] It will be appreciated by those skilled in the art that the program might not be loadable directly from the CD-ROM 168 into the random access memory utilizing the CD-ROM drive 166 and that generally the program will be stored on the CD- ROM or other distribution medium in a form that requires the program to be installed on the hard disk drive 167 from the CD-ROM 168.
[0042] It will be appreciated that the invention is not restricted to the particular embodiment that has been described, and that variations may be made therein without departing from the scope of the invention as defined in the appended claims, as interpreted in accordance with principles of prevailing law, including the doctrine of equivalents or any other principle that enlarges the enforceable scope of a claim beyond its literal scope. For example, although the description of FIG. 4 refers to the view association descriptor as containing information that indicates whether the base view is the left eye view or the right eye view, the descriptor could contain information that indicates whether the enhancement view is the left eye view or the right eye view, or information that indicates explicitly both the hand (left or right) of the base view and the hand of the enhancement view. Unless the context indicates otherwise, a reference in a claim to the number of instances of an element, be it a reference to one instance or more than one instance, requires at least the stated number of instances of the element but is not intended to exclude from the scope of the claim a structure or method having more instances of that element than stated. The word "comprise" or a derivative thereof, when used in a claim, is used in a nonexclusive sense that is not intended to exclude the presence of other elements or steps in a claimed structure or method.

Claims

1. A method of delivering video data representing left and right eye views of a scene encoded in accordance with multiview video coding (MVC) from a transmitter to a receiver over an MPEG-2 systems standard stream, comprising: receiving at a transmitter an input video elementary stream conveying data encoded as a base view and an enhancement view, wherein the base view represents one of the left and right eye views and the enhancement view represents the other of the left and right eye views,
employing an MPEG-2 systems standard multiplexer at the transmitter to generate an MPEG-2 systems standard stream that is derived from the input video elementary stream and conveys association information associating the base view with said one of the left and right eye views, and
transmitting the MPEG-2 systems standard stream from the transmitter.
2. A method according to claim 1, wherein the MPEG-2 systems standard stream is a transport stream and the method comprises conveying the association information by a descriptor of the transport stream.
3. A method according to claim 1, wherein the MPEG-2 systems standard stream is a program stream and the method comprises conveying the association information by a descriptor of the program stream.
4. A method of processing an MPEG-2 systems standard stream that conveys video data encoded in accordance with multiview video coding (MVC) and representing a base view and an enhancement view of a scene, and also conveys association information associating the base view with one of a left eye view and a right eye view of a scene, comprising:
employing an MPEG-2 systems standard demultiplexer to generate an output video elementary stream derived from the MPEG-2 systems standard stream, wherein the output video elementary stream conveys data encoded as a base view and an enhancement view, and recover the association information from the MPEG-2 systems standard stream.
5. A method according to claim 4, comprising delivering the output video elementary stream to a 3D capable display device via an interface that receives the association information and provides a succession of left eye view pictures and right eye view pictures to the display device in accordance with the association
information.
6. A method according to claim 4, wherein the MPEG-2 systems standard stream is a transport stream that conveys the association information by a descriptor of the transport stream.
7. A method according to claim 4, wherein the MPEG-2 systems standard stream is a program stream that conveys the association information by a descriptor of the program stream.
8. A non-transitory computer readable medium containing software that, when executed by a computer having an input for receiving an MPEG-2 systems standard stream that conveys video data encoded in accordance with multiview video coding (MVC) and representing a base view and an enhancement view of a scene, and also conveys association information associating the base view with one of a left eye view and a right eye view of a scene, processes the MPEG-2 systems standard stream by a method that comprises:
employing an MPEG-2 systems standard demultiplexer to generate an output video elementary stream derived from the MPEG-2 systems standard stream, wherein the output video elementary stream conveys data encoded as a base view and an enhancement view, and recover the association information from the MPEG-2 systems standard stream.
PCT/US2012/049064 2011-11-14 2012-07-31 Association of mvc stereoscopic views to left or right eye display for 3dtv WO2013074160A1 (en)

Priority Applications (4)

Application Number Priority Date Filing Date Title
BR112014011618A BR112014011618A2 (en) 2011-11-14 2012-07-31 association of mcc stereoscopic views for left or right eye view for 3dtv
EP12753616.7A EP2781103A1 (en) 2011-11-14 2012-07-31 Association of mvc stereoscopic views to left or right eye display for 3dtv
CN201280055877.XA CN103931200A (en) 2011-11-14 2012-07-31 Association of MVC stereoscopic views to left or right eye display for 3DTV
IN3560CHN2014 IN2014CN03560A (en) 2011-11-14 2014-05-12

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
US201161559149P 2011-11-14 2011-11-14
US61/559,149 2011-11-14
US13/563,277 2012-07-31
US13/563,277 US20130120526A1 (en) 2011-11-14 2012-07-31 Association of mvc stereoscopic views to left or right eye display for 3dtv

Publications (1)

Publication Number Publication Date
WO2013074160A1 true WO2013074160A1 (en) 2013-05-23

Family

ID=48280242

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2012/049064 WO2013074160A1 (en) 2011-11-14 2012-07-31 Association of mvc stereoscopic views to left or right eye display for 3dtv

Country Status (6)

Country Link
US (1) US20130120526A1 (en)
EP (1) EP2781103A1 (en)
CN (1) CN103931200A (en)
BR (1) BR112014011618A2 (en)
IN (1) IN2014CN03560A (en)
WO (1) WO2013074160A1 (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2015016913A1 (en) * 2013-07-31 2015-02-05 Empire Technology Development Llc Encoding scheme

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100208750A1 (en) * 2009-02-13 2010-08-19 Samsung Electronics Co., Ltd. Method and appartus for generating three (3)-dimensional image data stream, and method and apparatus for receiving three (3)-dimensional image data stream
EP2343907A2 (en) * 2008-10-10 2011-07-13 LG Electronics Inc. Reception system and data processing method

Family Cites Families (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2384585A4 (en) * 2009-02-01 2017-03-15 LG Electronics Inc. Broadcast receiver and 3d video data processing method
KR20110126518A (en) * 2009-02-19 2011-11-23 파나소닉 주식회사 Recording medium, reproduction device, and integrated circuit
JP4993224B2 (en) * 2009-04-08 2012-08-08 ソニー株式会社 Playback apparatus and playback method
WO2010143439A1 (en) * 2009-06-12 2010-12-16 パナソニック株式会社 Reproduction device, integrated circuit, and recording medium
JP5521486B2 (en) * 2009-06-29 2014-06-11 ソニー株式会社 Stereoscopic image data transmitting apparatus and stereoscopic image data transmitting method
JP5407968B2 (en) * 2009-06-29 2014-02-05 ソニー株式会社 Stereoscopic image data transmitting apparatus and stereoscopic image data receiving apparatus
KR101372376B1 (en) * 2009-07-07 2014-03-14 경희대학교 산학협력단 Method for receiving stereoscopic video in digital broadcasting system
US8948241B2 (en) * 2009-08-07 2015-02-03 Qualcomm Incorporated Signaling characteristics of an MVC operation point
EP2489197A4 (en) * 2009-10-13 2014-03-05 Lg Electronics Inc Broadcast receiver and 3d video data processing method thereof
KR20110088334A (en) * 2010-01-28 2011-08-03 삼성전자주식회사 Method and apparatus for generating datastream to provide 3-dimensional multimedia service, method and apparatus for receiving the same
KR20110101099A (en) * 2010-03-05 2011-09-15 한국전자통신연구원 Method and appatus for providing 3 dimension tv service relating plural transmitting layer
US8683514B2 (en) * 2010-06-22 2014-03-25 Verizon Patent And Licensing Inc. Enhanced media content transport stream for media content delivery systems and methods

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2343907A2 (en) * 2008-10-10 2011-07-13 LG Electronics Inc. Reception system and data processing method
US20100208750A1 (en) * 2009-02-13 2010-08-19 Samsung Electronics Co., Ltd. Method and appartus for generating three (3)-dimensional image data stream, and method and apparatus for receiving three (3)-dimensional image data stream

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
THOMAS SCHIERL ET AL: "Transport and Storage Systems for 3-D Video Using MPEG-2 Systems, RTP, and ISO File Format", PROCEEDINGS OF THE IEEE, IEEE. NEW YORK, US, vol. 99, no. 4, 1 April 2011 (2011-04-01), pages 671 - 683, XP011363622, ISSN: 0018-9219, DOI: 10.1109/JPROC.2010.2091370 *

Also Published As

Publication number Publication date
EP2781103A1 (en) 2014-09-24
IN2014CN03560A (en) 2015-10-09
BR112014011618A2 (en) 2017-05-02
US20130120526A1 (en) 2013-05-16
CN103931200A (en) 2014-07-16

Similar Documents

Publication Publication Date Title
KR101303480B1 (en) Method and apparatus for decoding an enhanced video stream
US11146822B2 (en) Method and apparatus for decoding an enhanced video stream
KR102181994B1 (en) Transmission device, transmission method, reception device, reception method, and reception display method
KR20120036848A (en) Broadcast transmitter, broadcast receiver and 3d video processing method thereof
WO2012081874A2 (en) Signaling method for a stereoscopic video service and apparatus using the method
US10911736B2 (en) MMT apparatus and MMT method for processing stereoscopic video data
US20140078256A1 (en) Playback device, transmission device, playback method and transmission method
US20140125762A1 (en) Transmission device, transmission method, reception apparatus, and reception method
KR101977260B1 (en) Digital broadcasting reception method capable of displaying stereoscopic image, and digital broadcasting reception apparatus using same
US9270972B2 (en) Method for 3DTV multiplexing and apparatus thereof
US20140354770A1 (en) Digital broadcast receiving method for displaying three-dimensional image, and receiving device thereof
WO2013054775A1 (en) Transmission device, transmission method, receiving device and receiving method
US20130120526A1 (en) Association of mvc stereoscopic views to left or right eye display for 3dtv
KR20120036255A (en) Method for transmitting information relating 3d video service in digital broadcast
EP2752018A1 (en) Method of delivering media data from a transmitter to a receiver using extension descriptors
KR20150133620A (en) Method and apparatus for providing three-dimensional territorial brordcasting based on non real time service

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 12753616

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

WWE Wipo information: entry into national phase

Ref document number: 2012753616

Country of ref document: EP

REG Reference to national code

Ref country code: BR

Ref legal event code: B01A

Ref document number: 112014011618

Country of ref document: BR

ENP Entry into the national phase

Ref document number: 112014011618

Country of ref document: BR

Kind code of ref document: A2

Effective date: 20140514