EP0783823A4 - Synchronisationsmessung und -regelung von gemultiplexten video- und audiodaten - Google Patents

Synchronisationsmessung und -regelung von gemultiplexten video- und audiodaten

Info

Publication number
EP0783823A4
EP0783823A4 EP94931269A EP94931269A EP0783823A4 EP 0783823 A4 EP0783823 A4 EP 0783823A4 EP 94931269 A EP94931269 A EP 94931269A EP 94931269 A EP94931269 A EP 94931269A EP 0783823 A4 EP0783823 A4 EP 0783823A4
Authority
EP
European Patent Office
Prior art keywords
data stream
video
system data
audio
bitstream
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
EP94931269A
Other languages
English (en)
French (fr)
Other versions
EP0783823A1 (de
Inventor
Stephen G Haigh
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
FutureTel Inc
Original Assignee
FutureTel Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by FutureTel Inc filed Critical FutureTel Inc
Publication of EP0783823A1 publication Critical patent/EP0783823A1/de
Publication of EP0783823A4 publication Critical patent/EP0783823A4/de
Ceased legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/434Disassembling of a multiplex stream, e.g. demultiplexing audio and video streams, extraction of additional data from a video stream; Remultiplexing of multiplex streams; Extraction or processing of SI; Disassembling of packetised elementary stream
    • H04N21/4341Demultiplexing of audio and video streams
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/4302Content synchronisation processes, e.g. decoder synchronisation
    • H04N21/4307Synchronising the rendering of multiple content streams or additional data on devices, e.g. synchronisation of audio on a mobile phone with the video output on the TV screen
    • H04N21/43072Synchronising the rendering of multiple content streams or additional data on devices, e.g. synchronisation of audio on a mobile phone with the video output on the TV screen of multiple content streams on the same device
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/4302Content synchronisation processes, e.g. decoder synchronisation
    • H04N21/4305Synchronising client clock from received content stream, e.g. locking decoder clock with encoder clock, extraction of the PCR packets

Definitions

  • the present invention relates generally to the technical field of recorded and/or transmitted compressed digital data, and, more particularly, to enabling a subsequent synchronized presentation of video and audio data as they are combined into a single compressed digital data stream.
  • Proper reproduction of a recorded and/or transmitted multimedia program consisting of compressed digitized video data accompanied by associated compressed digitized audio data, requires combining two independent digital data bitstreams into a single, synchronized, serial system data stream that includes both video and audio data.
  • Lack of or an improper synchroniza ⁇ tion of the video and audio data in assembling the data into the system data stream, or in decoding and presenting an assembled system data stream frequently causes a visible image to appear out of synchronization with accompanying sound. For example, a presentation of images showing lip movements of an individual speaking words may not be synchronized with the audible sound of those words.
  • the ISO/IEC 11172 standard defining MPEG compression specified that packets of data, extracted from the compressed video bitstream and from the compressed audio bitstream, are to be interleaved in assembling the system data stream. Further ⁇ more, in accordance with the ISO/IEC 11172 standard a system data stream may include private, reserved and padding streams in addition to compressed video and compressed audio bitstreams. While properties of the system data stream as defined by the MPEG standard impose functional and performance requirements on MPEG encoders and decoders, the system data stream specified in the MPEG standard does not define an architecture for or an implemen ⁇ tation of MPEG encoders or decoders. In fact, considerable degrees of freedom exists for possible designs and implemen ⁇ tations of encoders and decoders that operate in accordance with the ISO/IEC 11172 standard.
  • a system data stream in accordance with Part 1 of the ISO/IEC 11172 standard includes two layers of data; a system data layer which envelopes digital data of a compression layer.
  • the ISO/IEC 11172 system layer is itself divided into two sub-layers, one layer for multiplex-wide operation identified as the "pack layer,” and one for stream-specific operations identified as the "packet layer.”
  • Packs, belonging to the pack layer of a system data stream in accordance with the ISO/IEC 11172 standard include headers which specify a system clock reference ("SCR").
  • SCR system clock reference
  • the SCR fixes intended times for commencing decompression of digitized video and audio data included in the compression layer in a period of 90 kilohertz ("kHz").
  • the ISO/IEC 11172 standard defining the packet layer provides for "presentation time-stamps" ("PTS") and also optional decoding time-stamps ("DTS").
  • PTS presentation time-stamps
  • DTS decoding time-stamps
  • the PTS and DTS specify synchroni ⁇ zation for the video and audio data with respect to the SCR specified in the pack layer.
  • the packet layer which optionally contains both the PTS and DTS, is independent of the data contained in the compression layer defined by the ISO/IEC 11172 standard. For example, a video packet may start at any byte in the video stream. However, the PTS and optional DTS if encoded into each packet's header apply to the first "access unit" (“AU”) that begins within that packet.
  • AU access unit
  • the MPEG standard ISO/IEC 11172 defines an AU to be the coded representation of a "presentation unit" ("PU").
  • the ISO/IEC 11172 standard further defines a PU as a decoded audio AU or a decoded picture.
  • the standard also defines three (3) different methods, called “Layers” in the standard, for compress ⁇ ing and decompressing an audio signal. For two of these methods, the standard defines an audio AU as the smallest part of the encoded audio bitstream which can be decoded by itself. For the third method, the standard defines an audio AU as the smallest part of the encoded audio bitstream that is decodable with the use of previously acquired side and main information.
  • Part 1 of the ISO/IEC 11172 standard suggests that during synchronized presentation of compressed video and audio data, the reproduction of the video images and audio sounds may be synchro ⁇ nized by adjusting the playback of both compressed digital data streams to a master time base called the system time-clock ("STC") rather than by adjusting the playback of one stream, e.g. the video data stream, to match the playback of another stream, e.g. the audio data stream.
  • STC system time-clock
  • the ISO/IEC 11172 standard suggests that an MPEG decoder's STC may be one of the decoder's clocks (e.g. the SCR, the video PTS, or the audio PTS), the digital storage media (“DSM”) or channel clock, or it may be some external clock.
  • End-to-end synchronization of a multimedia program encoded into an MPEG system data stream occurs: a. if an encoder embeds time-stamps during assembly of the system data stream; b. if video and audio decoders receive the embedded time- stamps together with the compressed data, and c. if the decoders use the time-stamps in scheduling presentation of the multimedia program.
  • a "system header" (“SH"), which occurs at the beginning of a system data stream and which may be repeated within the stream, includes a "s--stem_audio_lock_flag” and a "system_video_lock_flag.”
  • SH system header
  • Sett ng the system_audi- o_lock_flag to one (1) indicates that a specified, constant relationship exists between the audio sampling rate and the SCR.
  • Setting the system_video_lock_flag to one (1) indicates that a specified, constant relationship exists between the video picture rate and the SCR. Setting either of these flags to zero (0) indicates that the corresponding relationship does not exist.
  • the ISO/IEC 11172 standard specifically provides that the system data stream may include a padding stream. Packets assembled into the system data stream from the padding stream may be used to maintain a constant total data rate, to achieve sector alignment, or to prevent decoder buffer underflow. Since the padding stream is not associated with decoding and presentation, padding stream packets lack both PTS and DTS values. In addition to the padding stream, "stuffing" of up to 16 bytes is allowed within each packet. Stuffing is used for purposes similar to that of the padding stream, and is particu ⁇ larly useful for providing word (16-bit) or long word (32-bit) alignment in applications where byte (8-bit) alignment is insufficient. Stuffing is the only method of filling a packet when the number of bytes required is less than the minimum size of a padding stream packet.
  • a bitstream of video data compressed in accordance with Part 2 of the ISO/IEC 11172 standard consists of a succession of frames of compressed video data.
  • a succession of frames in an MPEG compressed video data bitstream include intra ("I") frames, predicted (“P") frames, and bidirectional (“B") frames.
  • Decoding the data of an MPEG I frame without reference to any other data reproduces an entire uncompressed frame of video data.
  • An MPEG P frame may be decoded to obtain an entire uncompressed frame of video data only by reference to a prior decoded frame of video data, either reference to a prior decoded I frame or reference to a prior decoded P frame.
  • An MPEG B frame may be decoded to obtain an entire uncompressed frame of video data only by reference both to a prior and to a successive reference frame, i.e. reference to decoded I or P frames.
  • the ISO/IEC 11172 specification defines as a group of pictures ("GOP") one or more I frames together with all of the P frames and B frames for which the I frame(s) is(are) a reference.
  • a real-time MPEG encoder In assembling a system data stream, a real-time MPEG encoder must include a system header at the beginning of each system data stream, and that system header must set the sys- tem_audio_lock_flag and the system_video_lock_flag to either zero (0) or one (1). If a real-time MPEG encoder specifies that either or both of these flags are to be set, then it must appropriately insure that throughout the entire system data stream the specified, constant relationship exists respectively between the audio sampling rate and the SCR, and between the video picture rate and the SCR. If a compressed audio bitstream encoder operates independently of the rate at which frames of video occur, there can be no assurance that these constant relationships will exist in the encoded data that is to be interleaved into the system data stream.
  • An object of the present invention is to provide a method for assembling a system data stream which permits synchronized presentation of visible images and accompanying sound.
  • the present invention is a method for real-time assembly of an encoded system data stream that may be decoded by a decoder into decoded video pictures and into a decoded audio signal.
  • a system data stream assembled in accor ⁇ dance with the present invention permits a decoder to present the decoded audio signal substantially in synchronism with the decoded video pictures.
  • This system data stream is assembled by interleaving packets of data selected from a compressed audio bitstream with packets of data selected from a compressed video bitstream.
  • the compressed audio bitstream interleaved into the system data stream is generated by compressing an audio signal that is sampled at a pre-specified audio sampling rate.
  • the compressed video bitstream interleaved into the system data stream is generated by compressing a sequence of frames of a video signal having a pre-specified video frame rate.
  • an expected encoded audio-video ratio is computed which equals the pre-specified audio sampling rate divided by the pre-specified video frame rate.
  • a system header (“SH") is then embedded into the system data stream which includes both a sys- tem_audio_lock_flag and a system_video_lock_flag that are set to indicate respectively that a specified, constant relationship exists between an audio sampling rate and a system clock reference (“SCR”), and a specified, constant relationship exists between a video picture rate and the SCR.
  • Packets of data are then respectively selected from either the compressed audio bitstream or from the compressed video bitstream for assembly into the system data stream.
  • a presentation time-stamp PTS
  • DTS decoder time-stamp
  • an actual encoded audio-video ratio is computed which equals a total number of frames of the video signal that have been received for compression divided by a number that represents a count of all the samples of the audio signal that have been received for compression.
  • an encoded frame error value is then computed by first subtracting the expected encoded audio-video ratio from the actual encoded audio-video ratio to obtain a difference of the ratios. This difference of the ratios is then multiplied by the total number of frames of the video signal that have been received for compression.
  • both the pre-specified positive error value and the pre-specified negative error value represent an interval of time which is approximately equal to the time interval required for presenting one and one-half frames of the decoded video pictures.
  • An advantage of the present invention is that it produces a system data stream which may be decoded more easily.
  • Another advantage of the present invention is that it produces a system data stream which may be decoded by a variety different decoders.
  • Another advantage of the present invention is that it produces a system data stream which may be decoded by compara ⁇ tively simple decoders.
  • FIG. 1 is a diagram graphically depicting interleaving packets selected from a compressed audio bitstream with packets selected from a compressed video bitstream to ssemble a system data stream
  • FIG. 2 is a block diagram illustrating a video encoder for compressing a sequence of frames of a video signal into a compressed video bitstream, an audio encoder for compressing an audio signal into a compressed audio bitstream, and a multiplexer for interleaving packets selected from the compressed video bitstream with packets selected from the compressed audio bitstream to assemble a system data stream;
  • FIG. 3 is a diagram illustrating a system data stream assembled by interleaving packets selected from a compressed audio bitstream with packets selected from a compressed video bitstream;
  • FIG. 4 is a computer program written in the C programming language which implements the process for determining if all data for an entire frame of the video signal is to be omitted from the system data stream, or if all the data for a second copy of an entire frame of the video signal is to be assembled into the system data stream.
  • Arrows 12a and 12b in FIG. 1 depict interleaving packets selected from a compressed audio bitstream 16 with packets selected from a compressed video bitstream 18 to assemble a serial system data stream 22 consisting of concatenated packs 24.
  • An audio encoder 32 illustrated in the block diagram FIG. 2, generates the compressed audio bitstream 16 by processing an audio signal, illustrated in FIG. 2 by an arrow 34.
  • the audio encoder 32 generates the compressed audio bitstream 16 by first digitizing the audio signal 34 at a pre-specified audio sampling rate ("PSASR") , and then compressing the digitized representation of the digitized audio signal.
  • a video encoder 36 generates the compressed video bitstream 18 by compressing into MPEG GOPs a sequence of frames of a video signal, illustrated in FIG.
  • the audio encoder 32 is preferably an audio compres ⁇ sion engine model no. 96-0003-0002 marketed by FutureTel, Inc. of 1092 E. Arques Avenue, Sunnyvale, California 94086.
  • the video encoder 36 is preferable a video compression engine model no. 96-0002-002 also marketed by FutureTel, Inc.
  • the preferred audio encoder 32 and video encoder 36 are capable of real-time compression respectively of the audio signal 34 into the compressed audio bitstream 16, and the video signal 38 into the compressed video bitstream 18.
  • a system data stream multiplexer 44 repeti- tively selects a packet of compressed audio data or compressed video data respectively from the compressed audio bitstream 16 or from the compressed video bitstream 18 for interleaved assembly into packs 24 of the system data stream 22 illustrated in FIG. 1.
  • the system data stream multiplexer 44 is preferably a computer program executed by a host microprocessor included in a personal computer (not illustrated in any of the FIGs.) in which the audio encoder 32 and the video encoder 36 are located.
  • the computer program executed by the host microproces ⁇ sor transfers commands and data to the audio encoder 32 and to the video encoder 36 to produce at pre-specified bitrates the compressed audio bitstream 16 and the compressed video bitstream 18.
  • the sum of the bitrates specified by the computer program for the compressed audio bitstream 16 and the compressed video bitstream 18 are slightly less than the a bitrate specified for the system data stream 22.
  • the host microprocessor transfers additional control data to the audio encoder 32 which directs the audio encoder 32 to digitize the audio signal 34 at the PSASR.
  • the computer program executed by the host microprocessor In addition to transferring control data to the audio encoder 32 and to the video encoder 36 to prepare them for respectively producing the compressed audio bitstream 16 and the compressed video bitstream 18, the computer program executed by the host microprocessor also prepares certain data used in assembling the system data stream 22. In particular with respect to the present invention, the computer program executed by the host microprocessor computes an expected encoded audio-video ratio ("EEAVR") for the system data stream 22 by dividing the PSASR by the PSVFR.
  • EAAVR expected encoded audio-video ratio
  • the system data stream multiplexer 44 repetitively selects a packet of data respectively from the compressed audio bitstream 16 or from the compressed video bitstream 18 for assembly into the packs 24 of the system data stream 22.
  • every pack 24 of the assembled system data stream 22 in accordance with the ISO/IEC 11172 specification has a pre-specified length L.
  • Each pack 24 may have a length L as long as 65,536 bytes.
  • Each pack 24 begins with a pack header 52, designated PH in FIG. 3, which includes the system clock reference (“SCR") value for that particular pack 24.
  • SCR system clock reference
  • SH system header
  • the system header 54 may also be repeated in each pack 24 in the system data stream 22.
  • the system header 54 includes both a system_audio_lock_flag and a sys- tem_video_lock_flag.
  • the computer program executed by the host microprocessor sets the system_audio_lock_flag and the sys- tem_video_lock_flag to one (1) to indicate respectively that a specified, constant relationship exists between an audio sampling rate and the SCR, and a specified, constant relationship exists between a video picture rate and the SCR.
  • each pack 24 illustrated in FIG. 3 contains a packet 56 of data selected by the system data stream multiplexer 44 either from the compressed audio bitstream 16 or from the compressed video bitstream 18.
  • Each packet 56 includes a packet header, not illustrated in any of the FIGs. , which may contain a presentation time stamp ("PTS”) and may also include the optional decoding time stamp (“DTS”) in accordance with ISO/IEC 11172 specification.
  • PTS presentation time stamp
  • DTS decoding time stamp
  • the system data stream 22 in accordance with the present invention may also include packs of a padding stream.
  • the system data stream multiplexer 44 will assemble packs from the padding stream into the system data stream 22 to maintain a constant total data rate, to achieve sector alignment, or to prevent decoder buffer underflow.
  • the preferred audio encoder 32 Because the preferred audio encoder 32 generates the compressed audio bitstream 16 by digitizing the audio signal 34 at a pre-specified sampling rate, and then compresses the digitized audio signal to produce the compressed audio bitstream 16 at a pre-specified bitrate, the compressed audio bitstream 16 produced by the preferred audio encoder 32 inherently provides a stable timing reference for assigning the SCR, the PTS and the DTS to the packs 24 of the system data stream 22.
  • the frame rate of the video signal 38 does not provide a stable timing reference for assigning the SCR, the PTS and the DTS.
  • the computer program executed by the host microprocessor fetches data from the audio encoder 32 and the video encoder 36 in addition to packets 56 selected from the compressed audio bitstream 16 or from the compressed video bitstream 18.
  • system data stream multiplexer 44 fetches from a location 62 in the audio encoder 32 a number that represents a running count of all the samples (“NOS") of the audio signal 34 that the audio encoder 32 has received for compression.
  • system data stream multiplexer 44 also fetches from a location 64 in the video encoder 36 a running count of the total number of frames (“NOF") of the video signal 38 that the video encoder 36 has received for compression.
  • the computer program executed by the host microprocessor fetches these two values as close together in time as possible.
  • the system data stream multiplexer 44 then divides NOS by NOF to obtain an actual encoded audio-video ratio ("AEAVR”) .
  • AEAVR actual encoded audio-video ratio
  • the system data stream multiplexer 44 then first subtracts the previously computed EEAVR from the AEAVR to obtain a difference of ratios ("DOR"). Then the DOR is multiplied by NOF to obtain an encoded frame error value ("EFEV").
  • EFEV represents a difference in time, based upon the pre-specified audio sampling rate, between the actual time for the NOF that have been assembled into the system data stream 22, and the expected time for the NOF that have been assembled into the system data stream 22.
  • PSNEV pre-specified negative error value
  • the system data stream multiplexer 44 assembles into the system data stream 22 a second copy of all the data for an entire B frame in the compressed video bitstream 18.
  • the preferred values for the PSNEV and for the PSPEV represent an interval in time required for the presentation of one and one-half frames of the decoded video pictures. Thus, only if the magnitude of the EFEV represents an interval of time which exceeds the time interval required for the presentation of one and one-half frames of the decoded video pictures will an entire B frame in the compressed video bitstream 18 be omitted from the system data stream 22, or will a second copy of an entire B frame in the compressed video bitstream 18 be assembled into the system data stream 22.
  • each frame in the system data stream 22 in accor ⁇ dance with Part 2 of the ISO/IEC 11172 is numbered, if the system data stream multiplexer 44 omits from the system data stream 22 all data for an entire B frame in the compressed video bitstream 18, then the system data stream multiplexer 44 must renumber all subsequent frames in the present GOP accordingly before assem ⁇ bling them into the system data stream 22.
  • the system data stream multiplexer 44 assembles into the system data stream 22 a second copy of all the data for an entire B frame in the compressed video bitstream 18, then the system data stream multiplexer 44 must number that frame and renumber all subsequent frames from the present GOP accordingly.
  • FIG. 5 is a computer program written in the C programming language which implements the process for determining if all data for an entire frame of the video signal 38 is to be omitted from the system data stream 22, or if all the data for a second copy of an entire frame of the video signal 38 is to be assembled into the system data stream 22.
  • Line numbers 1-8 in FIG. 4 fetch counts from the location 62 in the audio encoder 32, and from the location 64 in the video signal 38 to establish values for NOF and NOS.
  • Line numbers 13-16 in FIG. 4 implement the computation of EFEV.
  • Line numbers 21-22 in FIG. 4 apply the low pass filter to EFEV.
  • Line numbers 26-36 in FIG. 4 determine whether all data for an entire frame of the video signal 38 is to be omitted from the system data stream 22, or if all the data for a second copy of an entire frame of the video signal 38 is to be assembled into the system data stream 22.
  • bitrate for the compressed video bitstream 18 In establishing the bitrate for the compressed video bitstream 18, the computer program executed by the host micropro- cessor sets that bitrate approximately one percent (1%) below a desired nominal bitrate for the system data stream 22 minus the pre-specified bitrate for the compressed audio bitstream 16. Setting the bitrate for the compressed video bitstream 18 one percent (1%) below the desired nominal bitrate provides a sufficient safety margin that the sum of the bitrates for the compressed audio bitstream 16 and the compressed video bitstream 18 plus the overhead of the system data stream 22 should never exceed the maximum bitrate for the system data stream 22 even though occasionally a second copy of all the data for an entire B frame in the compressed video bitstream 18 is assembled into the system data stream 22.
  • the system data stream multiplexer 44 only begins omitting B frames from or adding B frames to the system data stream 22 after the system data stream multiplexer 44 has been assembling the system data stream 22 for several minutes.
  • the system data stream multiplexer 44 inhibits omission or addition of B frames for a short interval of time to avoid erratic operation. Such erratic omission or addition of B frames during the first few minutes of the system data stream 22 is a consequence of dividing one comparatively small number for NOS by another comparatively small number for NOF.
  • a low pass filter is applied to EFEV to further inhibit erratic omission or addition of B frames.
  • Applying a low pass filter to EFEV insures that B frames are omitted from or added to the system data stream 22 only in response to a long term trend in the difference between the EEAVR and the AEAVR, and not due to fluctuations in the values of NOS and NOF, perhaps due to reading one value of either NOS or NOF during one GOP and reading the corresponding value either of NOF or NOS during the immediately preceding or immediately succeeding GOP.
  • the preferred low pass filter applied to EFEV has an asymmetric response. That is, characteristics of the low pass filter cause the filter's output value to return to zero (0) more quickly in response to a zero (0) value for EFEV than the filter's output value departs from zero in response to a non-zero value for EFEV.
  • the actual response times employed in the low pass filter are determined empirically.
  • the system data stream multiplexer 44 omits from or adds to the system data stream 22 a frame of the compressed video bitstream 18, then the low pass filter's output value is arbitrarily set to zero (0). Setting the low pass filter's output value to zero (0) tends to inhibit the omission of an entire frame of the compressed video bitstream 18 or the addition of a second copy of an entire frame of the compressed video bitstream 18 during processing of immediately succeeding MPEG GOPs.
  • the combination of the preferred audio encoder 32, the preferred video encoder 36, and the system data stream multiplexer 44 in accordance with the present invention permits assembly of virtually any desired system data stream 22 directly and without any intervening processing operations.
  • Phillips Consumer Electronics B.V. Coordination Office Optical and Magnetic Media Systems, Building SA-1, P.O. Box 80002, 5600 JB Eindhoven, The Netherlands has established a specification for Video CD that is colloquially referred to as the "White Book" standard.
  • Phillips' White Book standard specifies a maximum bitrate for the compressed video bitstream 18 of 1,151929.1 bits per second, an audio sampling frequency of 44.1 kHz, and an audio bitrate of 224 kBits per second.
  • Phillips' White Book standard also specifies that an audio packet is to be 2279 bytes long while a video packet has a length of 2296 bytes, and the system data stream 22 has a pack rate of 75 packs per second.
  • the system data stream multiplexer 44 in accordance with the present invention operating in conjunction with the preferred audio encoder 32 and the preferred video encoder 36, can directly assemble a system data stream 22 in accordance with Phillips' White Book standard from a suitably specified compressed audio bitstream 16 and compressed video bitstream 18 without any intervening operations.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)
EP94931269A 1994-08-29 1994-08-29 Synchronisationsmessung und -regelung von gemultiplexten video- und audiodaten Ceased EP0783823A4 (de)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/US1994/009565 WO1996007274A1 (en) 1994-08-29 1994-08-29 Measuring and regulating synchronization of merged video and audio data

Publications (2)

Publication Number Publication Date
EP0783823A1 EP0783823A1 (de) 1997-07-16
EP0783823A4 true EP0783823A4 (de) 1998-12-02

Family

ID=22242901

Family Applications (1)

Application Number Title Priority Date Filing Date
EP94931269A Ceased EP0783823A4 (de) 1994-08-29 1994-08-29 Synchronisationsmessung und -regelung von gemultiplexten video- und audiodaten

Country Status (4)

Country Link
EP (1) EP0783823A4 (de)
JP (1) JPH10507042A (de)
AU (1) AU8009894A (de)
WO (1) WO1996007274A1 (de)

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0797197B1 (de) * 1996-03-21 2002-11-06 Kabushiki Kaisha Toshiba Verfahren zur Bildung von Paketen, Aufnahmeträger und Gerät zum Aufzeichnen von Daten variabler Länge
JPH09282849A (ja) * 1996-04-08 1997-10-31 Pioneer Electron Corp 情報記録媒体並びにその記録装置及び再生装置
CA2204828C (en) * 1996-05-10 2004-11-23 Ray Nuber Error detection and recovery for high rate isochronous data in mpeg-2 data streams
US6249319B1 (en) * 1998-03-30 2001-06-19 International Business Machines Corporation Method and apparatus for finding a correct synchronization point within a data stream
JP5483081B2 (ja) * 2010-01-06 2014-05-07 ソニー株式会社 受信装置及び方法、プログラム、並びに受信システム

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CA1284211C (en) * 1985-04-29 1991-05-14 Terrence Henry Pocock Cable television system selectively distributing pre-recorder video and audio messages
US4847690A (en) * 1987-02-19 1989-07-11 Isix, Inc. Interleaved video system, method and apparatus
US4849817A (en) * 1987-02-19 1989-07-18 Isix, Inc. Video system, method and apparatus for incorporating audio or data in video scan intervals
US5053860A (en) * 1988-10-03 1991-10-01 North American Philips Corp. Method and apparatus for the transmission and reception multicarrier high definition television signal
DE3942957C2 (de) * 1989-12-23 1994-06-01 Ziegler Hans Peter Vorrichtung zum Einbringen eines dosierten Gasvolumens in einen mit einer Kunststoffschmelze gefüllten Formhohlraum einer Spritzform

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
No further relevant documents disclosed *

Also Published As

Publication number Publication date
EP0783823A1 (de) 1997-07-16
JPH10507042A (ja) 1998-07-07
AU8009894A (en) 1996-03-22
WO1996007274A1 (en) 1996-03-07

Similar Documents

Publication Publication Date Title
US5874997A (en) Measuring and regulating synchronization of merged video and audio data
CA2278376C (en) Method and apparatus for adaptive synchronization of digital video and audio playback in a multimedia playback system
US6873629B2 (en) Method and apparatus for converting data streams
TW580810B (en) Method and apparatus for converting data streams
US6339760B1 (en) Method and system for synchronization of decoded audio and video by adding dummy data to compressed audio data
KR100694164B1 (ko) 재생 방법 및 그 기록매체
JPH08168042A (ja) データ復号化装置およびデータ復号化方法
US8045836B2 (en) System and method for recording high frame rate video, replaying slow-motion and replaying normal speed with audio-video synchronization
US7359621B2 (en) Recording apparatus
JPH08237650A (ja) データバッファの同期システム
JP3429652B2 (ja) ディジタル符号化多重化装置
JP2008123693A (ja) 再生装置、再生方法及びその記録媒体
US6754273B1 (en) Method for compressing an audio-visual signal
CN113490047A (zh) 一种Android音视频播放方法
EP0783823A1 (de) Synchronisationsmessung und -regelung von gemultiplexten video- und audiodaten
JPH1118051A (ja) Iフレーム抽出方法
JP4534168B2 (ja) 情報処理装置および方法、記録媒体、並びにプログラム
US20130287361A1 (en) Methods for storage and access of video data while recording
JP2004040579A (ja) デジタル放送受信装置、およびデジタル放送同期再生方法
CA2197559A1 (en) Measuring and regulating synchronization of merged video and audio data
JP2008176918A (ja) 再生装置、再生方法及びその記録媒体
Lu et al. Mechanisms of MPEG stream synchronization
Kanai et al. MPEG2 3D player system
JP2004248104A (ja) 情報処理装置及び情報処理方法
US20090110364A1 (en) Reproduction apparatus and reproduction method

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

17P Request for examination filed

Effective date: 19970221

AK Designated contracting states

Kind code of ref document: A1

Designated state(s): DE FR GB IT NL

A4 Supplementary search report drawn up and despatched

Effective date: 19981014

AK Designated contracting states

Kind code of ref document: A4

Designated state(s): DE FR GB IT NL

RTI1 Title (correction)

Free format text: METHOD FOR REAL-TIME ASSEMBLY OF OF AN ENCODED SYSTEM DATA STREAM

GRAG Despatch of communication of intention to grant

Free format text: ORIGINAL CODE: EPIDOS AGRA

RTI1 Title (correction)

Free format text: METHOD FOR REAL-TIME ASSEMBLY OF AN ENCODED SYSTEM DATA STREAM

17Q First examination report despatched

Effective date: 20001005

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE APPLICATION HAS BEEN REFUSED

18R Application refused

Effective date: 20010408