US7139316B2 - System method and apparatus for seamlessly splicing data - Google Patents
System method and apparatus for seamlessly splicing data Download PDFInfo
- Publication number
- US7139316B2 US7139316B2 US10/282,784 US28278402A US7139316B2 US 7139316 B2 US7139316 B2 US 7139316B2 US 28278402 A US28278402 A US 28278402A US 7139316 B2 US7139316 B2 US 7139316B2
- Authority
- US
- United States
- Prior art keywords
- picture
- stream
- encoding
- encoded
- spliced
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Lifetime, expires
Links
- 238000000034 method Methods 0.000 title claims description 96
- 239000013598 vector Substances 0.000 claims abstract description 65
- 230000000694 effects Effects 0.000 claims abstract description 4
- 230000008569 process Effects 0.000 claims description 80
- 230000006866 deterioration Effects 0.000 claims description 11
- 238000001514 detection method Methods 0.000 claims description 8
- 238000013139 quantization Methods 0.000 abstract description 11
- 230000015556 catabolic process Effects 0.000 abstract description 4
- 230000015654 memory Effects 0.000 description 23
- HKSZLNNOFSGOKW-HMWZOHBLSA-N staurosporine Chemical compound C12=C3N4C5=CC=CC=C5C3=C3CNC(=O)C3=C2C2=CC=CC=C2N1[C@@H]1C[C@H](NC)[C@H](OC)[C@@]4(C)O1 HKSZLNNOFSGOKW-HMWZOHBLSA-N 0.000 description 11
- 230000008859 change Effects 0.000 description 10
- 238000007906 compression Methods 0.000 description 8
- 230000006835 compression Effects 0.000 description 7
- 238000010586 diagram Methods 0.000 description 5
- 230000005540 biological transmission Effects 0.000 description 4
- 230000002452 interceptive effect Effects 0.000 description 4
- 230000003044 adaptive effect Effects 0.000 description 3
- 238000012986 modification Methods 0.000 description 3
- 230000004048 modification Effects 0.000 description 3
- 230000003287 optical effect Effects 0.000 description 3
- 230000002457 bidirectional effect Effects 0.000 description 2
- 230000006837 decompression Effects 0.000 description 2
- 238000006731 degradation reaction Methods 0.000 description 2
- 230000002401 inhibitory effect Effects 0.000 description 2
- 230000002265 prevention Effects 0.000 description 2
- 230000002441 reversible effect Effects 0.000 description 2
- 150000001875 compounds Chemical class 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 230000008014 freezing Effects 0.000 description 1
- 238000007710 freezing Methods 0.000 description 1
- 230000000644 propagated effect Effects 0.000 description 1
- 230000008707 rearrangement Effects 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 230000001360 synchronised effect Effects 0.000 description 1
- 230000002123 temporal effect Effects 0.000 description 1
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/20—Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
- H04N21/23—Processing of content or additional data; Elementary server operations; Server middleware
- H04N21/234—Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs
- H04N21/23424—Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving splicing one content stream with another content stream, e.g. for inserting or substituting an advertisement
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N5/00—Details of television systems
- H04N5/76—Television signal recording
- H04N5/91—Television signal processing therefor
- H04N5/92—Transformation of the television signal for recording, e.g. modulation, frequency changing; Inverse transformation for playback
-
- G—PHYSICS
- G11—INFORMATION STORAGE
- G11B—INFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
- G11B27/00—Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
- G11B27/02—Editing, e.g. varying the order of information signals recorded on, or reproduced from, record carriers
- G11B27/031—Electronic editing of digitised analogue information signals, e.g. audio or video signals
- G11B27/034—Electronic editing of digitised analogue information signals, e.g. audio or video signals on discs
-
- G—PHYSICS
- G11—INFORMATION STORAGE
- G11B—INFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
- G11B27/00—Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
- G11B27/02—Editing, e.g. varying the order of information signals recorded on, or reproduced from, record carriers
- G11B27/031—Electronic editing of digitised analogue information signals, e.g. audio or video signals
- G11B27/036—Insert-editing
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/103—Selection of coding mode or of prediction mode
- H04N19/114—Adapting the group of pictures [GOP] structure, e.g. number of B-frames between two anchor frames
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/124—Quantisation
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/134—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
- H04N19/142—Detection of scene cut or scene change
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/134—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
- H04N19/146—Data rate or code amount at the encoder output
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/134—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
- H04N19/157—Assigned coding mode, i.e. the coding mode being predefined or preselected to be further used for selection of another element or parameter
- H04N19/159—Prediction type, e.g. intra-frame, inter-frame or bidirectional frame prediction
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/17—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
- H04N19/172—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a picture, frame or field
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/177—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a group of pictures [GOP]
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/40—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using video transcoding, i.e. partial or full decoding of a coded input stream followed by re-encoding of the decoded output stream
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/503—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
- H04N19/51—Motion estimation or motion compensation
- H04N19/513—Processing of motion vectors
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/503—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
- H04N19/51—Motion estimation or motion compensation
- H04N19/513—Processing of motion vectors
- H04N19/517—Processing of motion vectors by encoding
- H04N19/52—Processing of motion vectors by encoding by predictive encoding
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/503—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
- H04N19/51—Motion estimation or motion compensation
- H04N19/577—Motion compensation with bidirectional frame interpolation, i.e. using B-pictures
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/60—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
- H04N19/61—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding in combination with predictive coding
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/20—Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
- H04N21/23—Processing of content or additional data; Elementary server operations; Server middleware
- H04N21/234—Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs
- H04N21/23406—Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving management of server-side video buffer
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/20—Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
- H04N21/23—Processing of content or additional data; Elementary server operations; Server middleware
- H04N21/234—Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs
- H04N21/2343—Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/44—Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs
- H04N21/44004—Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs involving video buffer management, e.g. video decoder buffer or video display buffer
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/44—Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs
- H04N21/44016—Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs involving splicing one content stream with another content stream, e.g. for substituting a video clip
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/44—Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs
- H04N21/4402—Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs involving reformatting operations of video signals for household redistribution, storage or real-time display
- H04N21/440254—Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs involving reformatting operations of video signals for household redistribution, storage or real-time display by altering signal-to-noise parameters, e.g. requantization
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N7/00—Television systems
- H04N7/24—Systems for the transmission of television signals using pulse code modulation
- H04N7/52—Systems for transmission of a pulse code modulated video signal with one or more other pulse code modulated signals, e.g. an audio signal or a synchronizing signal
-
- G—PHYSICS
- G11—INFORMATION STORAGE
- G11B—INFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
- G11B2220/00—Record carriers by type
- G11B2220/20—Disc-shaped record carriers
- G11B2220/25—Disc-shaped record carriers characterised in that the disc is based on a specific recording technology
- G11B2220/2537—Optical discs
- G11B2220/2562—DVDs [digital versatile discs]; Digital video discs; MMCDs; HDCDs
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/134—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
- H04N19/146—Data rate or code amount at the encoder output
- H04N19/149—Data rate or code amount at the encoder output by estimating the code amount by means of a model, e.g. mathematical model or statistical model
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/134—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
- H04N19/146—Data rate or code amount at the encoder output
- H04N19/15—Data rate or code amount at the encoder output by monitoring actual compressed data size at the memory before deciding storage at the transmission buffer
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
Definitions
- the present invention relates to an editing system, method and apparatus for editing images and, more particularly, an editing system, method and apparatus for seamlessly splicing a plurality of bit streams of video data.
- High quality recording/reproducing systems compression-encode/decode the audio/video data utilizing the MPEG (Moving Picture Experts Group) standard.
- MPEG Motion Picture Experts Group
- One example of such a system is the DVD (Digital Versatile Disk or Digital Video Disk), which provides a powerful means by which unprecedented quantities of high quality audio/video are compressed on an optical disk.
- FIG. 1 illustrates the general recording/reproducing system.
- the video encoder ill of the encoding-side apparatus 110 encodes input video data D V in accordance with the MPEG standard to thereby produce a video elementary stream (video ES).
- the packetizer 112 packetizes the video elementary stream into a video packetized elementary stream (video PES) comprising access units; each access unit representing a picture in a group of pictures making up a portion of the video program.
- the audio encoder 113 of the encoding-side apparatus encodes input audio data D A to thereby produce an audio elementary stream (audio ES)
- the packetizer 114 formats the audio elementary stream into an audio packetized elementary stream (audio PES) comprising access units; each access unit represent decodable segment of an audio bit stream.
- the transport stream multiplexer 115 multiplexes the audio and video packetized elementary streams to thereby produce a transport stream packet.
- a Video Buffer Verifier (VBV) buffer (not shown) stores/retrieves the multiplexed streams at a variable target rate which is controlled in accordance with the number of bits to be encoded and the capacity of the VBV buffer. An illustration of the Video Buffer Verifier is provided with reference to FIG. 2 .
- the decoding-side apparatus 120 of FIG. 1 stores in a decoding-side Video Buffer Verifier (VBV) buffer (not shown) the received transport stream which is transmitted via the transmission medium 116 .
- the transport stream demultiplexer 121 demultiplexes the received transport stream fetched from the decoding buffer at a timing determined by a decoding time stamp (DTS) to thereby reproduce the video packetized elementary stream (video PES) and the audio packetized elementary stream (audio PES).
- DTS decoding time stamp
- the video packetized elementary stream is depacketized by depacketizer 122 and decoded by video decoder 123 thereby reproducing the video data D V .
- the audio packetized elementary stream is depacketized by depacketizer 124 and decoded by audio decoder 125 thereby reproducing the audio data D A .
- the transport stream multiplexer 115 and the transport stream demultiplexer 121 are respectively replaced with a program stream multiplexer and demultiplexer which DVD format/unformat the encoded bit streams.
- the recording/reproducing system of FIG. 1 it is desirable to seamlessly splice a plurality of bit streams by concentrating at the transport level two or more different elementary streams representing the merger of different video programs.
- editors at a broadcasting station splice a plurality of bit streams from different video sources such as, for example, live video feeds received from local stations for generating a spliced broadcast video program.
- the director splices movie scenes to be recorded on the DVD optical disk.
- the DVD decoder splices multiple bit streams reproduced from the DVD optical disk in response to user-entered actions which is particularly useful for generating alternate scenes for interactive movies and video games.
- the MPEG standard implements a compression process which includes motion-compensated predictive coding in conjunction with adaptive Discrete Cosine Transform (DCT) quantization.
- the motion-compensated predictive coding predicts motion in each image frame/field using both unidirectional and bidirectional motion prediction.
- the DCT quantization adaptively compresses each frame/field in accordance with the motion-compensated prediction.
- frames hereinafter refers to pictures in general including frames as well as fields.
- motion-compensated prediction of the MPEG compression standard classifies the frames into one of three types: intracoded-frames (I-frames), predictively coded frames (P-frames) and bi-directionally coded frames (B-frames).
- MPEG establishes the I-frames as the reference by which the B- and P-frames are encoded and, thus, preserves the I-frames as complete frames.
- the I-frames are considered “intra-coded” since they proceed as complete frames, having bypassed the motion-compensated prediction, to the DCT quantization whereupon each I-frame is compression encoded with reference only to itself.
- FIG. 7 illustrates an example of the direction of prediction for each I, B and P-frame in a group of pictures (GOP) as indicated by the arrows in the figure.
- frames are arranged in ordered groups of pictures (GOP), each group of pictures comprising a closed set of I-, B- and P-frames which are encoded with reference to only those frames within that group.
- FIG. 3( a ) illustrates the natural presentation order ( 1 to 15 ) of the GOP in which the pictures are naturally presented to the viewer. Since the B- and P-frames within the GOP are encoded with reference to other frames, the MPEG standard dictates that the natural presentation order shown in FIG. 3( a ) be rearranged into the decoding order shown in FIG. 3( b ) in which the frames are to be decoded and transmitted in the coded order shown in FIG. 3( c ).
- the frames necessary for decoding other frames are first decoded to provide the basis upon which the following inter-coded frames are decoded.
- an I-frame which forms the reference by which the following frames in the GOP are motion-compensation predicted is positioned first in the decoding order.
- the pictures are rearranged in their natural presentation order for display to the viewer.
- Motion-compensated predictive coding divides each I-, B- and P-frame into 8 ⁇ 8 pel macroblocks.
- the motion vectors for a present frame are motion-compensation predicted with reference to the motion vectors of another frame which is selected in accordance with the direction of prediction of the type of frame (e.g., I-, B- or P-frame)
- P-frame macroblocks are motion-predicted with reference to the macroblocks in a previous I or P-frame
- B-frame macroblocks are motion-predicted with reference to the previous/successive I- and/or P-frames.
- the I-frames which are not inter-coded, bypass motion compensation and are directly DCT quantized.
- FIGS. 4( a )–( e ) The process for motion-predicting a current picture in a GOP is illustrated in FIGS. 4( a )–( e ).
- the GOP are input in the natural presentation order shown in FIG. 4( a ), rearranged in accordance with the decoding order shown in FIG. 4( b ), motion-predicted utilizing two frame memories (FM 1 , FM 2 ) as shown in FIGS. 4( c ) and ( d ) and output in the form of the encoding stream (ES) shown in FIG. 4( e ).
- the I-frame ( 13 ) of FIG. 4( b ) is intra-coded and, therefore, output directly to the encoding stream (ES); the B-frame (B 1 ) of FIG.
- FIG. 4( b ) is motion predicted with reference to the I-frame (I 3 ) stored in the first frame memory (FM 1 ) of FIG. 4( c ) and the P-frame (P) stored in the second frame memory (FM 2 ) of FIG. 4( d ); the P-frame (P 6 ) of FIG. 4( b ) is motion predicted with reference to the I-frame (I 3 ) stored in the first frame memory (FM 1 ) of FIG. 4( c ). From the foregoing illustration, it is apparent that a minimum of two frame memories are needed for bi-directional motion prediction.
- each macroblock is Discrete Cosine Transform (DCT) encoded. More particularly, the macroblocks are transformed from pixel domain to the DCT coefficient domain.
- DCT Discrete Cosine Transform
- adaptive quantization is performed on each block of DCT coefficients in accordance with a variable quantization step size. After adaptive quantization is applied to the DCT coefficients, the coefficients undergo further compression involving such techniques as differential coding, run-length coding or variable length coding.
- the encoded data is stored/retrieved to/from the Video Buffer Verifier (VBV) buffer at a controlled target bit rate in the form of a serial bit stream.
- VBV Video Buffer Verifier
- FIG. 2 illustrates a locus of the data occupancy of the VBV buffer wherein the bits (oordinate) of the I-, B- and P-frames are stored in the VBV buffer along a time axis (presentation time T p -abscissa) at a transmission bit rate (inclination 131 ) and output from the VBV buffer as indicted by the vertical lines.
- the VBV buffer is considered a “virtual” buffer because it emulates the buffer on the decoding side.
- the I-frames in FIG. 2 require four times the amount of storage time (VBV buffer delay) as the P-frames and twice the B-frames. For that matter, care must be taken that the varied amount of bits in a GOP does not cause an overflow when the number of bits exceeds the buffer capacity (upper-hatched line) or an underflow when the number of bits drops below a predetermined minimum number (lower-hatched line) which will sustain an efficient encoding/decoding process.
- VBV buffer delay amount of storage time
- the decoding process for decoding the transmitted group of pictures is explained.
- the coded order shown in FIG. 5( a ) is received by the decoding side apparatus 120 ( FIG. 1) and stored in the decoding-side VBV buffer.
- the transport stream demultiplexer 121 demultiplexes the stream into the packetized elementary stream illustrated in FIG. 5( b ).
- the GOP are decoded by fetching the compressed picture data from the decoding-side buffer at a timing determined by the decoding time stamp (DTS), de-compressing the fetched picture data and reconstructing each I-, B- and P-frame from the decompressed picture data. It will be appreciated that the I-frames are complete upon decompression.
- DTS decoding time stamp
- the B- and P-frames are reconstructed by motion estimating the previously decoded frames based on the decompressed motion vectors of the current B- or P-frame. Afterwards, the decoded frames are rearranged in their original presentation order for display as shown in FIG. 5( c ).
- the decoding-side apparatus requires relatively less hardware complexity than the encoding-side, the wisdom of the MPEG encoding/decoding scheme will be immediately recognized.
- the complex hardware necessary to perform motion prediction is not a part of the decoding-side apparatus since the decoder need only apply the motion vectors to the encoded pictures.
- the high quality audio/video is, thus, generated by a high-end encoder for distribution enmasse to numerous, considerably less-complex (and less-expensive) decoders.
- FIGS. 6( a )–( d ) The motion decoding process is illustrated in FIGS. 6( a )–( d ) wherein FIG. 6( a ) shows the coded video elementary stream (ES) which is supplied to the decoder.
- a first frame memory (FM 1 ) as illustrated in FIG. 6( b ) stores a first previously-decoded picture for decoding the current picture.
- a second frame memory (FM 2 ) as illustrated in FIG. 6( c ) stores a second previously-decoded picture for decoding the current picture.
- the decoded I-frame (I 3 ) (first picture in the ES of FIG.
- the B-frame (B 1 ) is decoded by motion estimating the frames in the frame memories (FM 1 , FM 2 ) based on the motion vectors of B 1 .
- the decoded GOP are output in the presentation order illustrated in FIG. 6( d )
- the difficulties confronted when splicing coded streams will be better appreciated.
- the bit streams must be decoded. This is because the prediction direction of the first stream may be inconsistent with that of the second.
- the selected direction of prediction (forward/backward) for the B-frames mutually effects the prediction direction of other B-frames and, for that matter, defines which frames are selected for the motion prediction throughout the GOP.
- the prediction direction for a frame in the first coded bit stream may be decoded with reference to a frame with an inconsistent prediction direction in the second coded bit stream.
- discontinuity migrates to other frames in motion estimation, consequently effecting the motion estimation decoding of the GOP as a whole.
- This discontinuity manifests as visible macroblocks on the display when, for example, the channel of a digital television is changed.
- bit streams In order to prevent discontinuity, it is suggested to decode the bit streams before splicing.
- the bit streams When the bit streams are decoded, the frames thereof are not motion predicted, i.e., not encoded with reference to other frames and thus are not subject to the discontinuity of the foregoing method.
- the spliced bit stream must be re-encoded. Since MPEG coding is not a 100% reversible process, the signal quality is deteriorated when re-encoding is performed. The problem is compounded because the re-encoding process encodes a decoded signal, i.e., a degraded version of the original audio/video signal.
- FIGS. 8( a )–( d ) illustrate the ideal case where no problems arise in the presentation order of the spliced stream ST SP .
- stream ST A of FIG. 8( a ) is spliced at the splicing point SPA with stream ST B of FIG. 8( b ) at the splicing point SP B .
- the spliced bit stream ST SP of FIG. 8( c ) presents the pictures of stream ST A followed by the pictures of stream ST B without problem.
- FIGS. 9( a ) to ( d ) illustrate the problem where the decoder rearranges the presentation order of the spliced bit stream.
- Stream ST A of FIG. 9( a ) is bit-spliced with stream ST B of FIG. 9( b ) at respective splicing positions (SP A , SP B ).
- the decoder on the decoding-side rearranges the order of presentation of the frames of the spliced bit stream ST SP ( FIG. 9( c )) such that, in this example, the last frame (P-frame) in bit stream ST A is inserted at the third-picture position of stream ST B . This appears visually as an arbitrary picture inserted in the video program.
- the second problem arises in motion estimation upon decoding of the spliced bit stream.
- the motion estimation reconstructs the pictures of stream ST A of the spliced bit stream ST SP of FIG. 10( a ) with reference to only those frames from that stream. This is indicated by the arrows in FIG. 10( b ) which represent the motion estimation direction.
- stream ST B is motion estimated with reference to only those pictures in that stream.
- FIGS. 11( a ) and ( b ) illustrate the problem of crossover motion estimation.
- the P-frame in stream ST A is based on frames in stream ST B as illustrated by the hatched arrows labeled “NG” in FIG. 11( b ).
- the P-frame in stream ST B is reconstructed from the wrong picture which appears visually as a distorted image.
- This problem is propagated through the GOP as shown in FIGS. 12( a ), ( b ) when the incorrectly-estimated P-frame of stream ST B is utilized by the decoder to motion estimate other frames. This results in a number of distorted pictures which are quite noticeable.
- FIGS. 13( a ) to 18 ( b ) illustrate the third problem of underflow/overflow related to splicing bit streams.
- the ideal case is illustrated in FIGS. 13( a )–( d ) wherein three streams (ST A , ST B , ST C ) are spliced at splicing points SP V and a buffer occupancy V OC .
- FIG. 13( a ) illustrates the locus of the data occupancy of the video buffer verifier (VBV) buffer on the decoding side wherein I-, B- and P-frames are stored in the VBV buffer.
- FIG. 13( b ) illustrates the spliced stream ST SP , FIG.
- FIG. 13( c ) the timing at which each of the pictures is generated after rearrangement and FIG. 13( c ) the order of the pictures after the decoding operation.
- FIG. 13( a ) the instant case does not present a problem of overflow (upper-hatched line) or underflow (lower-hatched line).
- bit streams ST A , ST B do not pose an overflow/underflow problem as will be appreciated from FIGS. 14( a ), 15 ( a ).
- FIGS. 16( a ), ( b ) when the bit streams ST A , ST B are spliced as illustrated in FIGS. 16( a ), ( b ) at a splicing point SP V an overflow/underflow condition occurs.
- the overflow condition which is illustrated in FIGS. 17( a ), ( b ) occurs when bit stream ST B continues to fill the VBV buffer to a point where the VBV buffer overflows as indicated at 141 in FIG. 17( a ).
- the underflow case which is illustrated in FIGS.
- the present invention there is provided a system, method and apparatus for splicing a plurality of bit streams.
- the present invention inhibits a picture in the spliced bit stream which, upon decoding, would be out of sequence. In this manner, the present invention prevents an improper reordering of the spliced bit stream pictures on the decoding side.
- the present invention selectively reuses motion vector information fetched from the source coded streams for use in the re-encoding process.
- the new motion vectors are supplied to the motion compensation portion of the re-encoder in place of the original motion vectors.
- the present invention sets the direction of prediction to a picture which is positioned adjacent the splicing point thereby preventing degradation in image quality.
- the present invention has a capability of changing the picture type of a picture in the vicinity of the splicing point in order to prevent erroneous motion prediction from pictures from another bit stream source.
- the overflow/underflow condition occurs owing to an improper selection of the target bit rate for the spliced bit stream.
- the target amount of bits is calculated anew for the spliced bit stream.
- the target amount of bits is calculated by reference to a quantizing characteristic produced in a previous coding process which may be retrieved from the source coded streams. In the alternative, the target amount is approximated.
- the plural bit streams are decoded in the region of the splicing point(s) and re-encoded in accordance with the new target bit rate.
- seamlessly-spliced bit streams are provided without signal deterioration arising from improper reordering of the frames, picture distortion due to improper motion estimation or a breakdown in the video verifier (VBV) buffer due to improper selection of the target bit rate.
- the present invention is applicable to a wide range of applications including, for example, an editing system for generating seamless bit streams on the fly from video feeds of various sources for broadcast by a broadcasting station, a DVD system, a system for providing interactive movies, a video game system for generating alternative user-directed scenes of a video game or a system for encoding/decoding audio/video feeds for on-line transmission.
- FIG. 1 is a block diagram of a recording/reproducing system
- FIG. 2 illustrates the operation of a VBV buffer
- FIGS. 3( a )–( c ) illustrate the operation of an encoder
- FIGS. 4( a )–( e ) illustrate the operation of the frame memories of the encoder
- FIGS. 5( a )–( c ) illustrate the operation of a decoder
- FIGS. 6( a )–( d ) illustrate the operation of the frame memories of the decoder
- FIG. 7 illustrates the prediction direction for encoding/decoding
- FIGS. 8( a )–( d ) illustrate the bit splicing operation
- FIGS. 9( a )–( d ) illustrate the reordering of the spliced bit stream
- FIGS. 10( a ), ( b ) illustrate motion estimation of the spliced bit stream
- FIGS. 11( a )– 12 ( d ) illustrate motion compensation crossover in the spliced bit stream
- FIGS. 13( a )–( d ) illustrate the operation of the video buffer verifier
- FIGS. 14( a ), ( b ) illustrate the operation of the video buffer verifier storing stream ST A ;
- FIGS. 15( a ), ( b ) illustrate the operation of the video buffer verifier storing stream ST B ;
- FIGS. 16( a ), ( b ) illustrate the video buffer verifier storing the spliced bit stream
- FIGS. 17( a ), ( b ) illustrate an overflow of the video buffer verifier
- FIGS. 18( a ), ( b ) illustrate an underflow of the video buffer verifier
- FIG. 19 illustrates the present invention
- FIG. 20 illustrates the block diagram of the encoder and decoder of FIG. 19 ;
- FIGS. 21( a ), ( b ) illustrate the re-encoding operation of the present invention
- FIGS. 22( a )–( d ) illustrate the operation of decoding the bit streams according to the present invention
- FIGS. 23( a ), ( b ) illustrate the splicing operation of the present invention
- FIGS. 24( a ), ( b ) illustrate streams ST A , ST B for splicing in accordance with the present invention
- FIGS. 25( a )–( d ) illustrate the decoding operation in accordance with the present invention
- FIGS. 26( a ), ( b ) illustrate the spliced bit stream in accordance with the present invention
- FIGS. 27( a ), ( b ) illustrate an underflow of the video buffer verifier
- FIGS. 28( a ), ( b ) illustrate the prevention of underflow in accordance with the present invention
- FIGS. 29( a ), ( b ) illustrate an overflow of the video buffer verifier
- FIGS. 30( a ), ( b ) illustrate the prevention of overflow in accordance with the present invention
- FIG. 31 presents a flow diagram of the present invention
- FIG. 32 illustrates a continuation of the flow diagram of FIG. 31 .
- FIG. 19 illustrates the present invention. It will be appreciated from the figure that the present invention receives a plurality of bit streams, in this case streams A and B (ST A and ST B ), which are selectively spliced in accordance with the bit splicing technique hereinafter described.
- the present invention is applicable to both the encoding and decoding sides and, as such, may optionally include encoders 1 A and 1 B for respectively encoding video data VD A and VD B which produce the bit streams ST A and ST B .
- the output spliced bit stream ST SP of the present invention complies with the MPEG standard and is of course acceptable for any MPEG encoding/decoding system.
- the present invention is transparent to the end-viewer and, for that reason, is marketably attractive since the decoder may not need to be upgraded to receive the spliced bit streams of the present invention.
- the present invention is not limited to splicing one particular type of bit stream, but may of course be applied to any type of bit stream including, for example, the elementary stream, the packetized elementary stream and the transport stream.
- FIG. 19 illustrates that the streams ST A and ST B are input to a buffer memory 10 , a stream counter 11 and a stream analyzing portion 12 .
- the stream counter 11 counts the number of bits in each of the streams ST A and ST B whilst the stream analyzing portion 12 analyzes the syntax of each of the streams.
- a splice controller 13 controls the bit splicing operation of the present invention as will be described in more detail.
- MPEG decoders 14 A and 14 B decode the streams ST A and ST B retrieved from the buffer memory 10 which output respective base-band video data to a switch 15 .
- the switch 15 outputs either stream ST A or ST B to an MPEG encoder 16 .
- the MPEG encoder 16 at the control of the splice controller 13 , encodes the video base-band data selected by the switch 15 to thereby output a re-encoded bit stream ST RE .
- a switch 17 as controlled by the splice controller 13 , selectively outputs either the bit streams ST A , ST B retrieved from the buffer memory 10 or the re-encoded bit stream ST RE to thereby output the spliced bit stream ST SP .
- the stream counter 11 counts the number of bits of each of the received streams ST A and ST B and supplies the count value to the splice controller 13 .
- the number of bits of the streams is counted because the locus of the data occupancy of the video buffer verifier needs to be controlled to prevent overflow/underflow.
- the stream analyzing portion 12 analyzes the syntax of each of the streams to fetch appropriate information from the layers of the bit streams including the sequence layer, the GOP layer, the picture layer and the macroblock layer. For example, encoded information such as the picture type (I, B or P), motion vectors, quantizing steps and quantizing matrices are retrieved by the stream analyzing portion.
- the splice controller 13 based on the count value from the stream counter 11 and the information from the stream analyzing portion 12 , sets a re-encoding range for each bit stream in accordance with the range parameters n 0 and m 0 . Likewise, the splicing point(s) are set in accordance with the splice point parameter p 0 (s).
- the splice controller 13 controls the timing of the switch 15 to select the appropriate bit stream ST A or ST B to be sent to the MPEG encoder 16 in accordance with the splicing point parameter p 0 and the range parameters n 0 , m 0 .
- the phase and timing of the bit streams are controlled by the splice controller to coincide at the predetermined splicing point(s).
- the splice controller 13 controls the switch 17 to select the bit streams ST A and ST B normally.
- the re-encoded bit stream ST RE produced by the MPEG encoder 16 is selected during the re-encoding range in accordance with the parameters n 0 , m 0 and p 0 .
- FIG. 20 illustrates in more detail the MPEG decoding/encoding section of the present invention wherein reference numeral 14 generally indicates the MPEG decoders 14 A, B and reference numeral 16 generally indicates the MPEG encoder 16 shown in the previous figure.
- the decoding section 14 of the figure essentially performs MPEG decoding utilizing a decompression section for decompressing the input stream ST including a variable-length decoding circuit (VLD) 21 , an inverse quantization circuit (IQ) 22 and an inverse discrete cosine transform circuit (IDCT) 23 .
- the motion estimation section of the decoding section 14 includes an addition circuit 24 for adding the decompressed bit-stream motion prediction coefficients to the motion estimation coefficients produced in the motion estimation section of the decoder.
- a switch 25 alternates between selecting the decompressed data corresponding to the I-frames, which bypass motion estimation, and the motion estimated data output from the addition circuit 24 .
- the motion estimation section performs motion estimation utilizing frame memories (FM 1 , FM 2 ) 26 , 27 and a motion compensation section (MC) in accordance with the operation described with reference to FIGS. 6( a )–( d ).
- the encoding section 16 shown in FIG. 20 encodes the decoded video data output from the MPEG decoders 14 A and 14 B in accordance with the operations of the splice controller 13 .
- An encoder's previous processing circuit 30 preprocesses the decoded video data by rearranging the pictures of the decoded-video date in accordance with the bidirectional predictive coding process, forms pixel macroblocks and calculates the difficulty in coding each picture.
- the encoder's previous processing circuit 30 forms 16 ⁇ 16 pixel macroblocks.
- the encoding section 16 further incorporates a subtraction circuit 31 for subtracting a motion prediction error from the input decoded video data, a switch 32 for bypassing the motion-compensated prediction process in the case of I-frames and a compression/motion-compensated prediction section.
- the compression portion includes a discrete cosine transform circuit (DCT) 33 , a quantizing circuit (Q) 34 and a variable-length coding circuit (VLC) 35 .
- DCT discrete cosine transform circuit
- Q quantizing circuit
- VLC variable-length coding circuit
- the motion-compensated prediction portion predicts the motion within the B- and P-frames of the input decoded video data.
- the compressed bit stream is decompressed by application to an inverse quantizing circuit (IQ) 36 followed by an inverse discrete cosine transform circuit (IDCT) 37 .
- the decompressed bit stream is added by the addition circuit 38 to the motion-compensated version of the picture in order to reconstitute the current frame.
- the frame memories FM 1 , FM 2 ( 39 , 40 ) store the appropriate reconstructed frames at the control of the motion detection circuit 42 in accordance with the type of predictive coding (B- or P-frame encoding).
- the motion compensation circuit 41 performs motion compensation in accordance with the frame(s) stored in the frame memories (FM 1 , FM 2 ) 39 , 40 based on the motion vectors provided by the motion detection circuit 42 .
- the motion compensated picture which is essentially a prediction of the current frame, is subtracted from the actual current frame by the subtraction circuit 31 . It will be appreciated that the output of the subtraction circuit 31 is essentially an error result representing the difference between the actual frame and the prediction.
- An encode controller 43 provides substitute motion vectors and controls a switch 44 in order to select between the motion vectors determined by the motion detection circuit 42 and the substitute motion vectors.
- the decoding section 14 decodes the input stream ST preferably in accordance with the MPEG standard.
- the encoder's previous processing circuit 30 rearranges the pictures for encoding in accordance with the picture type information extracted by the stream analyzing circuit 12 and forms picture data into macroblocks.
- the rearranged pictures are forwarded to the encoding section 16 of the figure for encoding.
- the splice controller 13 forwards the encoded information, more particularly the motion vectors, which are extracted by the stream analyzing circuit to the encode controller 43 .
- the encode controller 43 causes the switch 44 to select the motion vectors supplied thereto.
- the encode controller 43 causes the switch 44 to select the motion vectors produced by the motion detection circuit 42 .
- the encode controller 43 controls the frame memories (FM 1 , FM 2 ) 39 , 40 to store the appropriate pictures required to produce the predictive image data based on the substitute motion vectors and in accordance with the picture type of the current picture to be encoded.
- the encode controller 43 controls the quantization step size of the quantizing circuit 34 and the inverse quantization circuit 36 to accommodate the motion vectors in accordance with the target bit rate supplied by the splice controller 13 .
- the encode controller 43 moreover, controls the variable-length coding circuit 35 .
- the encode controller 43 adds dummy data to the variable-length coding circuit 35 in order to account for the shortage with respect to the target amount of bits.
- the encode controller 43 performs a skipped macroblock process (ISO/IEC 13818-27.6.6) which interrupts the coding process in terms of macroblock units when it is determined that the variable-length coding circuit 35 generates an amount of bits that is relatively larger than the target amount of bits which warns of an overflow.
- a skipped macroblock process ISO/IEC 13818-27.6.6
- FIGS. 21( a ), ( b ) illustrate the process of selecting the video data to be re-encoded (also referred to as “presentation video data”) representing those portions of the bit streams ST A ( FIG. 21( a )) and ST B ( FIG. 21( b )) decoded respectively by the decoders 14 A and 14 B.
- the pictures comprising the presentation video data are selected to include those pictures within the re-encoding ranges as defined by the parameters n 0 and m 0 .
- a picture at the splicing point corresponding to stream ST A is expressed as A n ⁇ P0 , wherein n is an integer and p 0 is the splicing point parameter. Following this convention, pictures which are future to the picture at the splicing point are expressed as A (n ⁇ P0)+1 , A (n ⁇ P0)+2 , A (n ⁇ P0)+3 , A (n ⁇ P0)+4 . . . A (n ⁇ P0)+n0 , wherein n o is the range parameter defining the range of the presentation video data corresponding to bit stream ST A .
- pictures more previous than the picture A n ⁇ P0 at the splicing point are expressed as A (n ⁇ P0) ⁇ 1 , A (n ⁇ P0) ⁇ 2 , A (n ⁇ P0) ⁇ 3 , A (n ⁇ P0) ⁇ 4 , and so on.
- the presentation video data corresponding to the stream ST B at the splicing point is expressed as B (m ⁇ P0) and the pictures in the re-encoding range defined by the parameter m 0 are expressed as B (m ⁇ P0)+1 , B (m ⁇ P0)+2 , B (m ⁇ P0)+3 , B (m ⁇ P0)+4 . . .
- the range of pictures in each respective bit stream ST A , ST B are indicated by the ranges for re-encoding (n 0 , m 0 ).
- the re-encoding ranges include the pictures from picture A (n ⁇ P0)+n0 to picture A (n ⁇ P0) and pictures from picture B (m ⁇ P0) to picture B (m ⁇ P0) ⁇ m0 .
- Each decoder 14 A, B respectively decodes stream ST A , ST B thereby providing the decoded pictures A and B shown in FIGS. 22( b ), ( c ).
- the splice controller 13 selects the re-encoding pictures REP A , REP B from pictures A, B by operation of switch 15 . Since the streams ST A , ST B are decoded by two separate decoders, each set of pictures A, B are not cross-referenced and, therefore, not reordered upon decoding. In other words, the pictures which are incorrectly inserted into the wrong stream upon decoding are excluded by the splice controller 13 .
- the present invention provides a seamlessly-spliced stream which, upon decoding by the decoding-side decoder, arranges the pictures in the correct order of presentation as shown in FIG. 23( a ).
- FIGS. 23( a ), ( b ) illustrate the solution to the problem of crossover motion compensation.
- the splice controller 13 controls the encoder 16 to change a direction of prediction of those pictures which improperly reference pictures in another stream. This occurs, as discussed with reference to FIGS. 11( a ), ( b ), because a B-picture in stream ST B , for example, is originally encoded by the encoder 1 B with reference to the P-picture in the same stream. Since the P-picture occurs before the splicing point, however, the B-picture of stream ST B now improperly refers to a picture in stream ST A . In order to resolve this problem, the splice controller 13 according to the present invention changes the prediction direction.
- FIGS. 24( a )–( b ) illustrate an example of changing the picture type in accordance with the present invention to prevent an incorrect motion estimation of a particular picture in the region of the splicing point.
- FIG. 24( a ) shows the presentation video data corresponding to stream ST A
- FIG. 24( b ) shows the presentation video data corresponding to stream ST B .
- the encoded stream ST A is decoded by the decoder 14 A ( FIG. 19) resulting in the decoded pictures shown in FIG. 25( b ).
- the encoded stream ST B of FIG. 25( d ) is decoded by the decoder 14 B ( FIG. 19) resulting in the decoded pictures of FIG. 25( c ).
- FIG. 24( a ) shows the presentation video data corresponding to stream ST A
- FIG. 24( b ) shows the presentation video data corresponding to stream ST B .
- the encoded stream ST A is decoded by the decoder 14 A ( FIG.
- the B-picture at the splicing point is motion predicted with reference to the following B-frame which occurs after the re-encoding region REP A . If this situation is left uncorrected, the B-picture at the splicing point will be motion estimated upon decoding with reference to the B-picture in the wrong bit stream, i.e., bit stream ST B . Similarly, the P-picture at the splicing point of the bit stream ST B shown in FIG. 25( c ) is motion encoded on the basis of a P-picture occurring outside the re-encoding region REP B . This motion estimation error causes the macroblocks in the frame to be seen and, when compounded by propagation of the error throughout the group of pictures, becomes quite noticeable.
- the splice controller 13 in accordance with the present invention changes the picture type of the problematic pictures of the foregoing example.
- the B-picture of stream ST A at the splicing point A n ⁇ P0 is changed to a P-picture which is motion estimated on the basis of the previous P-picture which is within the re-encoding range of that stream ST A .
- the P-frame of the stream ST B is changed to an I-picture which is not motion estimated. It will be appreciated that the B-pictures (B (m ⁇ PO)+1 ) and B (m ⁇ P0)+1 ) are discarded as shown in FIG.
- the new picture type may require new prediction direction data and motion vectors.
- the splice controller 13 provides the encode controller 43 with the encoding information such as the prediction direction and the motion vectors of a previously-coded picture.
- the present invention may also provide new motion prediction data using other techniques such as reconstructing the new picture entirely.
- T RE represents the re-encode control time
- OST A represents the original stream A
- ST RE ′ represents the stream which is re-encoded resulting in an underflow condition
- OST B represents original stream B
- SP VBV represents a splicing point in the VBV buffer
- SP represents a splicing point of the streams.
- FIGS. 27( a ), ( b ), illustrate the underflow condition.
- the locus of the VBV buffer for the stream ST RE ′ before the splicing point SP corresponds to stream A (ST A ).
- the locus corresponds to stream B (ST B ). Since the level of data occupancy of the VBV buffer for stream ST A at the splicing point is different from the level of data occupancy of the VBV buffer for stream ST B , the data occupancy of the VBV buffer at the splicing point is discontinuous. In actuality, since the streams of ST A , ST B are seamlessly spliced, the VBV buffer continuously stores the streams without discontinuity.
- VBV buffer occupancy is lower at the splicing point SP VBV by VBV_gap than in the case where the stream ST B is stored by itself. Because of this artificially-low occupancy level, the VBV buffer suffers an underflow VBV_under when the following I-frame, which typically occupies four times the VBV buffer space as the B- or P-frames, is retrieved from the VBV buffer.
- FIG. 29( a ) is a diagram showing a locus of data occupancy in the VBV buffer for the spliced stream ST SP shown in FIG. 29( b ).
- the level of the data occupancy at the splicing point is artificially-higher as compared with an original locus of the data occupancy in the VBV buffer for stream ST B .
- the VBV buffer suffers an overflow when an I-frame is stored in the VBV buffer as shown in the figure.
- Overflow occurs because the target bit rate for each picture is too small for the spliced bit stream.
- the reason for this is that the target bit rate is set for the smaller bit stream ST B including VBV OST — B which, as will be seen from FIGS. 27( a ), 29 ( a ), is not included in the spliced bit stream ST RE ′.
- the underflow condition is the opposite case where the target bit rate is too large for the spliced bit stream ST B .
- the locus of the data occupancy of the VBV buffer becomes discontinuous at a point where the stream ST RE ′ to be re-encoded is switched back to the original stream OST B which presents an additional overflow/underflow situation.
- VBV OST — B is an optimum locus determined to prevent overflow or underflow of the original stream OST B . If the level of the optimum locus is controlled, there is a possibility that overflow or underflow occurs.
- the splice controller 13 operation for setting the new target bit rate will be discussed with reference to FIGS. 19 , 28 ( a ), ( b ) and 30 ( a ), ( b ).
- the splice controller 13 calculates a locus of the data occupancy of the VBV buffer for the original stream OST A , a locus of a data occupancy of the VBV buffer for the original stream OST B and a locus of a data occupancy of the VBV buffer for the stream ST RE ′ to be re-encoded in a case where stream ST A and stream ST B are spliced.
- the locus of the data occupancy of the VBV buffer in each case can be calculated by subtracting an amount of bits output from the VBV buffer corresponding to the presentation times from the bit count value supplied from the stream counter 11 . Therefore, the splice controller 13 is able to virtually recognize the locus of the data occupancy of the VBV buffer for the original stream OST A , the locus of the data occupancy of the VBV buffer for the original stream OST B and the locus of the data occupancy of the VBV buffer for the stream ST RE ′ to be re-encoded in a case where stream ST A and stream ST B are spliced.
- the splice controller 13 references the locus of the data occupancy of stream ST RE ′, to calculate an amount of overflow/underflow (vbv_over)/(vbv_under) of the stream ST RE ′ to be re-encoded. Moreover, the splice controller 13 makes reference to the data occupancy of the stream ST RE ′ and the locus (VBV OST — B ) of the data occupancy of the original stream OST B in the VBV buffer. The splice controller 13 calculates the gap value (vbv_gap) in the VBV buffer at the switching point between the stream ST RE ′ to be re-encoded and the original stream OST B .
- Equation (1) is used to calculate the offset amount vbv_off. If the VBV buffer overflows as in the case shown in FIG. 29( a ), Equation (2) is used to calculate the offset amount vbv_off.
- the splice controller 13 uses the offset amount vbv_off obtained in accordance with Equations (1) or (2) to calculate a target amount of codes (a target amount of bits) TB P0 in accordance with the following Equation (3):
- the target amount of bits TB P0 is a value assigned to the picture which is subjected to the re-encoding process.
- GB_A is a value indicating an amount of generated bits of a picture which is any one of pictures A n ⁇ P0 to A (n ⁇ P0)+n0 in stream ST A and ⁇ GB_A (n ⁇ P0)+i is a sum of the amount of generated bits of the pictures A n ⁇ P0 to A (n ⁇ P0)+n0 .
- GB_B is a value indicating an amount of generated bits of a picture which is any one of pictures B m ⁇ P0 to B (m ⁇ P0) ⁇ m0 in stream ST B and ⁇ GB_B (m ⁇ P0)+i is a sum of the amount of generated bits of the pictures B m ⁇ P0 to B (m ⁇ P0) ⁇ m0 .
- the target amount of bits TB P0 expressed by Equation (3) is a value obtained by adding the offset amount vbv_off of the VBV buffer to the total amount of generated bits of the pictures A (n ⁇ P0)+n0 to B (m ⁇ P0) ⁇ m0 .
- the offset amount vbv_off is added to correct the target amount of bits TB P0 such that the gap of the locus of the data occupancy at the switching point between the stream ST SP , which is to be re-encoded, and the original stream OST B is minimized (preferably zero).
- the splice controller 13 assigns the target amount of bits TB P0 obtained in accordance with Equation (3) to the pictures A (n ⁇ p0)+n0 to B (m ⁇ P0) ⁇ m0 .
- the splicing apparatus according to at least one embodiment of the present invention is not so rigid but makes reference to the quantizing characteristics including the previous quantizing steps and the quantizing matrices of the pictures A (n ⁇ P0)+n0 to B (m ⁇ P0) ⁇ m0 so as to determine a new quantizing characteristic.
- the encode controller 43 makes reference to the quantizing steps and the quantizing matrices included in streams ST A and ST B . To prevent an excessive deviation from the quantizing characteristic realized in the previous encoder process of the encoders 1 A, 1 B the encode controller. 43 determines the quantizing characteristic when the re-encoding process is performed.
- FIGS. 28( a ), ( b ) illustrate a data occupancy of the VBV buffer when a re-encoding process is performed using the target amount of bits TB P0 calculated by the splice controller 13 which resolves the problem of underflow described with reference to FIGS. 27( a ), ( b ).
- FIGS. 30( a ), ( b ) similarly illustrate a data occupancy of the VBV buffer when a re-encoding process is performed using the target amount of bits TB P0 calculated by the splice controller 13 which resolves the problem of overflow described with reference to FIGS. 29( a ), ( b ).
- the operations of the splicing and editing process according to the present invention will be described with reference to FIGS. 31 and 32 .
- the present invention preferably meets regulations of Annex C of ISO13818-2 and ISO11172-2 and Annex L of ISO13818-1 and of course may conform to their encoding/decoding standards.
- step S 10 the splice controller 13 receives the splicing point parameter p 0 for splicing the streams ST A and ST B and re-encoding ranges n 0 and m 0 . It is possible that an operator inputs these parameters.
- the re-encoding ranges n 0 and m 0 may be automatically set in accordance with the configuration of the GOP of the stream or the like.
- step S 11 the splice controller 13 temporarily stores the streams ST A and ST B in the buffer memory 10 .
- the phases of the splicing point of each of the streams ST A and ST B are synchronized with reference to the presentation time by controlling a reading operation of the buffer memory 10 .
- step S 12 the splice controller 13 selects a picture to be output for re-encoding while inhibiting a picture in stream ST A appearing after the picture A n ⁇ P0 . Moreover, the splice controller 13 selects a picture to be output for re-encoding while inhibiting a picture appearing before the picture B m ⁇ P0 of stream ST B at the splicing point.
- FIGS. 25( a ),( b ) illustrate the situation where a P picture A (n ⁇ P0) ⁇ 2 of stream ST A appears after the picture A n ⁇ P0 at the splicing point. In an order of presentation, picture A (n ⁇ P0) ⁇ 2 is a picture in the future as compared with picture A n ⁇ P0 .
- the P picture A (n ⁇ P0) ⁇ 2 is not output in the present invention.
- the B pictures B (m ⁇ P0)+2 and B (m ⁇ P0)+1 are before the picture B m ⁇ P0 at the splicing point.
- pictures B (m ⁇ P0)+2 and B (m ⁇ P0)+1 are previous to picture B m ⁇ P0 . Therefore, the B pictures B (m ⁇ P0)+2 and B (m ⁇ P0)+1 are not output in the present invention.
- pictures to be transmitted are selected with reference to the order of presentation, thereby preventing the problem of the presentation order described with reference to FIGS. 9( a )–( d ).
- step S 13 the splice controller 13 initiates a process for setting the coding parameters required to reconstruct the pictures for re-encoding in accordance with steps S 14 to S 30 .
- the parameters which are set in this process include the picture type, a direction of prediction and the motion vectors for example.
- step S 14 the splice controller 13 determines whether the picture to be subjected to the picture reconstruction process is the picture A n ⁇ P0 at the splicing point. If so, the operation proceeds to step S 15 . Otherwise, the operation proceeds to step S 20 .
- step S 15 the splice controller 13 determines whether the picture to be subjected to the picture reconstruction is a B picture, a P picture or an I picture. If the picture to be subjected to the picture reconstruction is a B picture, the operation proceeds to step S 17 . If the picture to be subjected to the picture reconstruction is a P picture or an I picture, the operation proceeds to step S 18 .
- step S 16 the splice controller 13 determines whether two or more B pictures exist in front of picture A n ⁇ P0 in the spliced stream ST SP . For example, and as shown is FIG. 26( b ), if two B pictures (A (n ⁇ P0)+2 , A (n ⁇ P0)+3 ) exist in front of picture A n ⁇ P0 , the operation proceeds to step S 18 . Otherwise, the operation proceeds to step S 17 . In step S 17 , the splice controller 13 determines that the change of the picture type of the picture A n ⁇ P0 is unnecessary.
- the splice controller 13 sets a picture type for use in the process for re-encoding the picture A n ⁇ P0 to the same picture type (the B picture) used previously by the encoder 1 A. Therefore, in the re-encoding process in this case the picture A n ⁇ P0 is re-encoded as the B picture
- step S 18 the splice controller 13 changes the picture type of the picture A n ⁇ P0 from the B picture to the P picture.
- the present invention changes the picture A n ⁇ P0 type from the B picture to the P picture type as described with reference to FIGS. 26( a ), ( b ).
- the picture A n ⁇ P0 is reliably decoded as a P picture.
- step S 19 the splice controller 13 determines that the change in the picture type of the picture A n ⁇ P0 is unnecessary. At this time, the splice controller 13 sets the picture type for use when the picture A n ⁇ P0 is re-encoded to the picture type (the I picture or the P picture) set previously by the encoder 1 A.
- step S 20 the splice controller 13 determines that the change in the picture type of the picture A n ⁇ P0 is unnecessary. At this time, the splice controller 13 sets the picture type for use when the picture A n ⁇ P0 is re-encoded to the picture type (the I picture, the P picture or the B picture) set previously by the encoder 1 A.
- step S 21 the splice controller 13 sets a direction of prediction and the motion vectors for each picture.
- the picture A n ⁇ P0 to be subjected to the picture reconstruction process is a B picture in the original stream OST A .
- the B picture A n ⁇ P0 is bi-directionally predicted from the P pictures A (n ⁇ P0)+1 and A (n ⁇ P0) 2 .
- step S 12 the P picture A (n ⁇ P0) ⁇ 2 is inhibited from being output as the spliced stream and, thus, is prevented from becoming an inversely predicted picture of the picture A n ⁇ P0 specified in the picture reconstruction process. Therefore, when the picture A n ⁇ P0 is a B picture, its picture type is unchanged in step S 17 and, as such, is subjected to a forward and one-sided prediction in which only the P picture of A (n ⁇ P0)+1 is employed for prediction. This is similar to the case in step S 18 where the B picture is changed to the P picture such that the one-sided prediction parameter for predicting the picture A n ⁇ P0 is based only on the P picture A (n ⁇ P0)+ 1.
- the direction of prediction when the picture A n ⁇ P0 is a P picture in step S 19 is unchanged. That is, the splice controller 13 sets a forward and one-sided prediction for the picture A n ⁇ P0 as in the previous encode process performed by the encoder 1 A.
- a change in the direction of prediction of the pictures A (n ⁇ P0)+n0 to A (n ⁇ P0)+1 as determined in step S 20 is unnecessary. That is, the splice controller 13 sets a direction of prediction for the pictures A (n ⁇ P0)+n0 to A (n ⁇ P0)+1 as set previously by the encoder 1 A. If the two pictures A (n ⁇ P0)+1 and A n ⁇ P0 are B pictures predicted from two directions from the forward-directional P picture or I picture and the inverse-directional I picture or the P picture, the prediction for the picture A (n ⁇ p0)+1 as well as the picture A n ⁇ P0 must be changed to one-sided prediction such that prediction is performed from only the forward-directional picture.
- step S 21 the splice controller 13 determines whether the motion vectors for each picture in the previous encode process performed by the encoder 1 A is reused when the re-encoding process is performed in accordance with the newly set direction of prediction.
- the motion vectors used in a previous encode process performed by the encoder 1 A are the same as in the re-encoding process, i.e., employed for the P picture and the B picture when the direction of prediction of each has not changed. In the examples shown in FIGS.
- the motion vectors used in the previous encode process performed by the encoder 1 A are reused when the pictures A (n ⁇ P0)+n0 to A (n ⁇ P0)+1 are re-encoded.
- the picture A (n ⁇ P0)+1 and the picture A n ⁇ P0 are B pictures predicted from both directions from a P picture or an I picture in the forward direction and an I picture or a P picture in the reverse direction, the prediction is changed to one-sided prediction in which prediction is performed in only a forward-directional picture. Therefore, only motion vectors corresponding to the forward-directional picture are used.
- the splice controller 13 sets the prediction direction such that the motion vector for the forward-directional picture is used and the inverse-directional motion vector is not used in step S 21 .
- the motion vectors produced in the previous encoder process performed by the encoder 1 A are not used.
- new motion vectors corresponding to A (n ⁇ p0)+1 are produced. That is, the splice controller 13 sets the direction of prediction in step S 21 such that any previous motion vectors are not used.
- step S 22 the splice controller 13 determines whether all parameters of the picture type, the direction of prediction and previous motion vectors of the pictures from pictures A (n ⁇ P0)+n0 to A n ⁇ P0 are set. If so, control proceeds to step S 23 .
- step S 23 the splice controller 13 determines whether the picture to be subjected to the picture reconstruction process is a picture B m ⁇ P0 at the splicing point. If so, the operation proceeds to step S 24 . Otherwise, if the picture to be subjected to the picture reconstruction is any one of pictures B (m ⁇ P0) ⁇ 1 to B (m ⁇ p0)+m0 , the operation proceeds to step S 28 .
- step S 24 the splice controller 13 determines whether the picture to be subjected to the picture reconstruction process is a B picture, a P picture or an I picture. If the picture to be subjected to the picture reconstruction process is a B picture, the operation proceeds to step S 25 . If the picture to be subjected to the picture reconstruction process is a P picture, the operation proceeds to step S 26 . If the picture to be subjected to the picture reconstruction process is an I picture, the operation proceeds to step S 27 .
- step S 25 the splice controller 13 determines that a change in the picture type of the picture B m ⁇ P0 in the re-encoding process is unnecessary as in the example shown in FIGS. 22( a )–( d ) and 23 ( a ), ( b ). Thus, the splice controller 13 sets the picture type for use in a re-encoding process of the picture B m ⁇ P0 to the same picture type (the B picture) as set previously by the encoder 1 B.
- step S 26 the splice controller 13 changes the picture type of the picture B m ⁇ P0 from the P picture to the I picture as in the examples shown in FIGS. 25( a )–( d ) and 26 ( a ), ( b ).
- the P picture is a one-sided prediction picture which is predicted from the forward-directional I- or P-picture, the P picture is always positioned behind the pictures used for prediction on the stream. If the first picture B m ⁇ P0 at the splicing point in the stream ST B is a P picture, prediction must be performed from a forward-directional picture of the stream ST A which exists in front of the picture B m ⁇ P0 .
- the splice controller 13 changes the picture type of the B picture to the I picture.
- step S 27 the splice controller 13 determines that a change in the picture type of the picture B m ⁇ P0 is unnecessary. Thus, the splice controller 13 sets the picture for use in the re-encoding process of the picture B m ⁇ P0 to the same picture type (I picture) set previously by the encoder 1 B.
- step S 28 the splice controller 13 determines that a change in the picture type of the pictures B (m ⁇ P0) ⁇ 1 to B (m ⁇ P0) ⁇ m0 is unnecessary.
- the splice controller 13 sets the picture for use in the re-encoding process of each of the foregoing pictures to the same picture type (the I picture, the P picture or the B picture) set previously by the encoder 1 B.
- step S 29 the splice controller 13 sets a direction of prediction and motion vectors for each picture. If the picture B m ⁇ P0 to be subjected to the picture reconstruction process is, in the original stream OST B , a B picture as in the example shown in FIGS. 22( a )–( d ) and 23 ( a ), ( b ), the picture B m ⁇ P0 is a picture predicted from two directions, i.e., from the P picture B (m ⁇ P0)+1 and the I picture B (m ⁇ P0) ⁇ 2 .
- the P picture of B (mPO)+1 is not output as a splicing stream and, therefore, is not specified as a forward-directional prediction picture for the picture B m ⁇ P0 to be subjected to the picture reconstruction process. Therefore, the picture B m ⁇ P0 which is set such that a change in its picture type is unnecessary in step S 25 must be set to perform an inverse and one-sided prediction such that only the I picture B (m ⁇ P0) ⁇ 2 is predicted. Therefore, the splice controller 13 sets a direction of prediction for the picture B m ⁇ P0 to perform the inverse and one-side prediction such that only the I picture B (m ⁇ P0) ⁇ 2 is used in the prediction.
- a change in the direction of prediction of the pictures B (m ⁇ P0)+m0 to B (m ⁇ P0)+1 in step S 28 is deemed unnecessary.
- the splice controller 13 sets a direction of prediction for the pictures B (m ⁇ P0)+m0 to B (m ⁇ P0)+1 to the same picture previously set by the encoder 1 B. If the picture B (m ⁇ P0) ⁇ 1 is a B picture, a direction of prediction for the B (m ⁇ P0) ⁇ 1 is set such that inverse and one-sided prediction is performed so that only the I picture of B (m ⁇ P0) ⁇ 2 is predicted. This is similar to the foregoing case in which the picture B m ⁇ P0 is predicted.
- the splice controller 13 determines in step S 29 whether the motion vectors set previously are reused for each picture when the re-encoding process is performed.
- the re-encoding process is performed such that the motion vectors used in a previous encode process of the encoder 1 B are reused for the P pictures and the B pictures when the prediction direction has not been changed.
- the motion vectors used in a previous encode process are used for the pictures from the I picture B (m ⁇ P0) ⁇ 2 to the P picture B (m ⁇ P0) ⁇ m0 .
- step S 30 the splice controller 13 determines whether the parameters relating to the picture type, the direction of prediction and the motion vectors for all of the pictures from the picture B m ⁇ P0 to the picture B (m ⁇ P0) ⁇ m0 are set. If so, the splice controller 13 in step S 31 calculates a target amount of bits (TB P0 ) to be generated in the re-encoding period in accordance with Equation (3).
- the splice controller 13 initially calculates a locus of the data occupancy of the VBV buffer for the original stream OST A , a locus of the data occupancy of the VBV buffer for the original stream OST B and a locus of the data occupancy of the VBV buffer for the stream ST RE ′ to be encoded in a case where streams ST A , ST B are spliced in accordance with a bit count value of stream ST A and the bit count value of stream ST B supplied from the stream counter 11 . Then, the splice controller 13 analyzes the virtually-obtained locus of the data occupancy of the VBV buffer for the stream ST RE ′ to be re-encoded.
- the splice controller 13 calculates an amount of underflow (vbv_under) or an amount of overflow (vbv_over) of the stream ST RE ′ to be re-encoded. Moreover, the splice controller 13 compares the virtually-obtained locus of the data occupancy of the VBV buffer for stream ST RE ′ to be re-encoded and a locus (VBV OST — B ) of the data occupancy in the VBV buffer for the original stream OST B . Thus, the splice controller 13 calculates a gap value (vbv_gap) of the VBV buffer at a switching point between stream ST RE ′ to be re-encoded and the original stream OST B .
- the splice controller 13 calculates an offset amount vbv_off of the target amount of codes in accordance with Equations (1) and (2). Then, the splice controller 13 uses the offset amount vbv_off calculated in accordance with Equation (1) or (2) to calculate a target amount of codes (target amount of bits) TB P0 in accordance with Equation (3).
- step S 32 the splice controller 13 determines a quantizing characteristic to be set for each picture.
- the quantizing characteristic is determined in accordance with an assignment to the pictures A (n ⁇ P0)+n0 to B (m ⁇ P0) ⁇ m0 of the target amount of bits TB P0 calculated in accordance with Equation (3).
- the splicing apparatus according to the present invention makes reference to quantizing characteristics including the previous quantizing steps and the quantizing matrices of each of the pictures A (n ⁇ P0)+n0 to B (m ⁇ P0) ⁇ m0 used by the encoders 1 A and 1 B so as to determine new quantizing characteristics.
- the splice controller 13 initially receives from the stream analyzing portion 12 information about the coding parameters, quantizing steps and quantizing matrices produced in a previous coding process performed by the encoders 1 A and 1 B and included in the streams ST A , ST B .
- the splice controller 13 makes reference to the amounts of codes (bits) assigned to the target amount of bits TB P0 and information of the previous coding parameters.
- the splice controller 13 determines the quantizing characteristics when the re-encoding process is performed so as to prevent excessive deviation from the quantizing characteristics in the encoding processes performed by the encoders 1 A and 1 B.
- the quantizing characteristics of the pictures, the picture type of each of which has been changed by the picture reconstruction process are newly calculated when the re-encoding process is performed without reference to the information of the quantizing steps and the quantizing matrices.
- step S 33 the splice controller 13 decodes the pictures A (n ⁇ P0)+n0 to B (m ⁇ P0) ⁇ m0 included in the re-encoding range.
- step S 34 the splice controller 13 uses the quantizing characteristics set to pictures A (n ⁇ P0)+n0 to B (m ⁇ P0) ⁇ m0 while controlling the amount of generated bits. If the splice controller 13 reuses the previous motion vectors, the encode controller 43 , at the control of the splice controller 13 , causes switch 44 to channel the previous motion vectors to the motion compensation portion 41 . When the previous motion vectors are not used, the encode controller 43 controls the switch 44 to channel the motion vectors newly produced by the motion detection circuit 42 to the motion compensation portion 41 .
- the encode controller 43 controls the frame memories 39 and 40 in accordance with information about the picture type supplied from the splice controller 13 to store the pictures required to produce predicted image data.
- the encode controller 43 sets, to the quantizing circuit 34 and the inverse quantization circuit 36 , the quantizing characteristics in the re-encoding range supplied from the splice controller 13 .
- step S 35 the splice controller 13 controls the switch 17 to selectively output stream ST A from the buffer 10 , stream ST B from the buffer 10 or the re-encoded stream ST RE from the MPEG encoder 16 .
- the splice controller 13 seamlessly-splices stream ST A which appears before the re-encoding range, re-encoded stream ST RE in the re-encoding range and stream ST B which appears after the re-encoding range to provide seamlessly-spliced bit stream ST SP .
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Business, Economics & Management (AREA)
- Marketing (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
- Television Signal Processing For Recording (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Signal Processing For Digital Recording And Reproducing (AREA)
- Management Or Editing Of Information On Record Carriers (AREA)
Abstract
A plurality of bit streams are seamlessly spliced. Separate decoders decode each bit stream. A controller selects the decoded pictures according to a re-encoding range in the vicinity of a splicing point of the bit streams. Pictures presenting a reordering of the streams are excluded in the selection of the decoded pictures. An encoder re-encodes the pictures within the re-encoding range. When it is determined that crossover motion compensation exists between pictures of different streams, the controller changes the motion prediction direction of the problematic picture. The controller changes a motion prediction picture type of a picture which is improperly motion predicted with reference to another stream. A quantization characteristic or motion vectors for the new picture type are generated by the controller. The controller effects the encoding in accordance with a target amount of bits to prevent a breakdown of a buffer and a discontinuation of an amount of data occupancy thereof. A multiplexer multiplexes the original streams with the re-encoded stream to produce a seamless bit stream.
Description
This is a continuation of co-pending International Application PCT/JP98/03332 having an international filing date of Jul. 27, 1998.
1. Field of the Invention
The present invention relates to an editing system, method and apparatus for editing images and, more particularly, an editing system, method and apparatus for seamlessly splicing a plurality of bit streams of video data.
2. Related Art
Recording/reproducing systems have recently been introduced which record/reproduce high quality audio/video data utilizing compression schemes. High quality recording/reproducing systems compression-encode/decode the audio/video data utilizing the MPEG (Moving Picture Experts Group) standard. One example of such a system is the DVD (Digital Versatile Disk or Digital Video Disk), which provides a powerful means by which unprecedented quantities of high quality audio/video are compressed on an optical disk.
The decoding-side apparatus 120 of FIG. 1 stores in a decoding-side Video Buffer Verifier (VBV) buffer (not shown) the received transport stream which is transmitted via the transmission medium 116. The transport stream demultiplexer 121 demultiplexes the received transport stream fetched from the decoding buffer at a timing determined by a decoding time stamp (DTS) to thereby reproduce the video packetized elementary stream (video PES) and the audio packetized elementary stream (audio PES). The video packetized elementary stream is depacketized by depacketizer 122 and decoded by video decoder 123 thereby reproducing the video data DV. The audio packetized elementary stream is depacketized by depacketizer 124 and decoded by audio decoder 125 thereby reproducing the audio data DA. For DVD applications, the transport stream multiplexer 115 and the transport stream demultiplexer 121 are respectively replaced with a program stream multiplexer and demultiplexer which DVD format/unformat the encoded bit streams.
In the recording/reproducing system of FIG. 1 , it is desirable to seamlessly splice a plurality of bit streams by concentrating at the transport level two or more different elementary streams representing the merger of different video programs. In digital broadcasting, for example, editors at a broadcasting station splice a plurality of bit streams from different video sources such as, for example, live video feeds received from local stations for generating a spliced broadcast video program. In DVD applications, the director splices movie scenes to be recorded on the DVD optical disk. In another DVD application, the DVD decoder splices multiple bit streams reproduced from the DVD optical disk in response to user-entered actions which is particularly useful for generating alternate scenes for interactive movies and video games.
There are, however, unforeseen difficulties to splicing a plurality of bit streams using the MPEG compression standard. In order to illuminate the problem, a closer look at MPEG is warranted. In summary, the MPEG standard implements a compression process which includes motion-compensated predictive coding in conjunction with adaptive Discrete Cosine Transform (DCT) quantization. The motion-compensated predictive coding predicts motion in each image frame/field using both unidirectional and bidirectional motion prediction. The DCT quantization adaptively compresses each frame/field in accordance with the motion-compensated prediction. The term “frames” hereinafter refers to pictures in general including frames as well as fields.
As illustrated in FIG. 3( a), motion-compensated prediction of the MPEG compression standard classifies the frames into one of three types: intracoded-frames (I-frames), predictively coded frames (P-frames) and bi-directionally coded frames (B-frames). MPEG establishes the I-frames as the reference by which the B- and P-frames are encoded and, thus, preserves the I-frames as complete frames. The I-frames are considered “intra-coded” since they proceed as complete frames, having bypassed the motion-compensated prediction, to the DCT quantization whereupon each I-frame is compression encoded with reference only to itself. P-frames, which rely on forward temporal prediction, are coded using the previous I- or P-frame. B-frames are coded using bi-directional (forward and/or backward) motion compensated predictive encoding using the two adjacent I- and/or P-frames. B- and P-frames are considered “inter-coded” since they are motion-prediction encoded with reference to other frames FIG. 7 illustrates an example of the direction of prediction for each I, B and P-frame in a group of pictures (GOP) as indicated by the arrows in the figure.
In accordance with the MPEG standard, frames are arranged in ordered groups of pictures (GOP), each group of pictures comprising a closed set of I-, B- and P-frames which are encoded with reference to only those frames within that group. FIG. 3( a) illustrates the natural presentation order (1 to 15) of the GOP in which the pictures are naturally presented to the viewer. Since the B- and P-frames within the GOP are encoded with reference to other frames, the MPEG standard dictates that the natural presentation order shown in FIG. 3( a) be rearranged into the decoding order shown in FIG. 3( b) in which the frames are to be decoded and transmitted in the coded order shown in FIG. 3( c). With this arrangement, the frames necessary for decoding other frames are first decoded to provide the basis upon which the following inter-coded frames are decoded. For example, an I-frame which forms the reference by which the following frames in the GOP are motion-compensation predicted is positioned first in the decoding order. Once decoded, the pictures are rearranged in their natural presentation order for display to the viewer.
Motion-compensated predictive coding divides each I-, B- and P-frame into 8×8 pel macroblocks. The motion vectors for a present frame are motion-compensation predicted with reference to the motion vectors of another frame which is selected in accordance with the direction of prediction of the type of frame (e.g., I-, B- or P-frame) For example, P-frame macroblocks are motion-predicted with reference to the macroblocks in a previous I or P-frame; B-frame macroblocks are motion-predicted with reference to the previous/successive I- and/or P-frames. The I-frames, which are not inter-coded, bypass motion compensation and are directly DCT quantized.
The process for motion-predicting a current picture in a GOP is illustrated in FIGS. 4( a)–(e). The GOP are input in the natural presentation order shown in FIG. 4( a), rearranged in accordance with the decoding order shown in FIG. 4( b), motion-predicted utilizing two frame memories (FM1, FM2) as shown in FIGS. 4( c) and (d) and output in the form of the encoding stream (ES) shown in FIG. 4( e). For example, the I-frame (13) of FIG. 4( b) is intra-coded and, therefore, output directly to the encoding stream (ES); the B-frame (B1) of FIG. 4( b) is motion predicted with reference to the I-frame (I3) stored in the first frame memory (FM1) of FIG. 4( c) and the P-frame (P) stored in the second frame memory (FM2) of FIG. 4( d); the P-frame (P6) of FIG. 4( b) is motion predicted with reference to the I-frame (I3) stored in the first frame memory (FM1) of FIG. 4( c). From the foregoing illustration, it is apparent that a minimum of two frame memories are needed for bi-directional motion prediction.
After the motion vectors are calculated, each macroblock is Discrete Cosine Transform (DCT) encoded. More particularly, the macroblocks are transformed from pixel domain to the DCT coefficient domain. Next, adaptive quantization is performed on each block of DCT coefficients in accordance with a variable quantization step size. After adaptive quantization is applied to the DCT coefficients, the coefficients undergo further compression involving such techniques as differential coding, run-length coding or variable length coding. The encoded data is stored/retrieved to/from the Video Buffer Verifier (VBV) buffer at a controlled target bit rate in the form of a serial bit stream.
Referring to FIGS. 5A–C , the decoding process for decoding the transmitted group of pictures (GOP) is explained. The coded order shown in FIG. 5( a) is received by the decoding side apparatus 120 (FIG. 1) and stored in the decoding-side VBV buffer. The transport stream demultiplexer 121 demultiplexes the stream into the packetized elementary stream illustrated in FIG. 5( b). The GOP are decoded by fetching the compressed picture data from the decoding-side buffer at a timing determined by the decoding time stamp (DTS), de-compressing the fetched picture data and reconstructing each I-, B- and P-frame from the decompressed picture data. It will be appreciated that the I-frames are complete upon decompression. The B- and P-frames are reconstructed by motion estimating the previously decoded frames based on the decompressed motion vectors of the current B- or P-frame. Afterwards, the decoded frames are rearranged in their original presentation order for display as shown in FIG. 5( c).
When it is considered that the decoding-side apparatus requires relatively less hardware complexity than the encoding-side, the wisdom of the MPEG encoding/decoding scheme will be immediately recognized. To explain, the complex hardware necessary to perform motion prediction is not a part of the decoding-side apparatus since the decoder need only apply the motion vectors to the encoded pictures. The high quality audio/video is, thus, generated by a high-end encoder for distribution enmasse to numerous, considerably less-complex (and less-expensive) decoders.
The motion decoding process is illustrated in FIGS. 6( a)–(d) wherein FIG. 6( a) shows the coded video elementary stream (ES) which is supplied to the decoder. A first frame memory (FM1) as illustrated in FIG. 6( b) stores a first previously-decoded picture for decoding the current picture. A second frame memory (FM2) as illustrated in FIG. 6( c) stores a second previously-decoded picture for decoding the current picture. For example, the decoded I-frame (I3) (first picture in the ES of FIG. 6( a)) is stored in the first frame memory (FM1) and the P-frame (previous ES) is stored in the second frame memory (FM2). In this example, the B-frame (B1) is decoded by motion estimating the frames in the frame memories (FM1, FM2) based on the motion vectors of B1. The decoded GOP are output in the presentation order illustrated in FIG. 6( d)
With the rudiments of the MPEG standard explained, the difficulties confronted when splicing coded streams will be better appreciated. In the conventional editing system for splicing bit streams, it is recognized that the bit streams must be decoded. This is because the prediction direction of the first stream may be inconsistent with that of the second. To explain, the selected direction of prediction (forward/backward) for the B-frames mutually effects the prediction direction of other B-frames and, for that matter, defines which frames are selected for the motion prediction throughout the GOP. When two coded bit streams are spliced arbitrarily, for example, the prediction direction for a frame in the first coded bit stream may be decoded with reference to a frame with an inconsistent prediction direction in the second coded bit stream. For this reason, motion estimation upon decoding in the area of the splicing point will result in reconstructing an incorrect picture. The error, referred to as a discontinuity, migrates to other frames in motion estimation, consequently effecting the motion estimation decoding of the GOP as a whole. This discontinuity manifests as visible macroblocks on the display when, for example, the channel of a digital television is changed.
In order to prevent discontinuity, it is suggested to decode the bit streams before splicing. When the bit streams are decoded, the frames thereof are not motion predicted, i.e., not encoded with reference to other frames and thus are not subject to the discontinuity of the foregoing method. However, the spliced bit stream must be re-encoded. Since MPEG coding is not a 100% reversible process, the signal quality is deteriorated when re-encoding is performed. The problem is compounded because the re-encoding process encodes a decoded signal, i.e., a degraded version of the original audio/video signal.
A splicing technique which addresses signal deterioration selectively decodes the bit streams at a splicing point. However, such a splicing technique produces unsatisfactory results. The first problem arises in the presentation order of the spliced stream which may be understood with reference to FIGS. 8( a)–(d) to 9(a)–(d). FIGS. 8( a)–(d) illustrate the ideal case where no problems arise in the presentation order of the spliced stream STSP. In this case, stream STA of FIG. 8( a) is spliced at the splicing point SPA with stream STB of FIG. 8( b) at the splicing point SPB. Thus, the spliced bit stream STSP of FIG. 8( c) presents the pictures of stream STA followed by the pictures of stream STB without problem.
The second problem, hereinafter termed “crossover”, arises in motion estimation upon decoding of the spliced bit stream. In the ideal case illustrated in FIGS. 10( a), (b) the motion estimation reconstructs the pictures of stream STA of the spliced bit stream STSP of FIG. 10( a) with reference to only those frames from that stream. This is indicated by the arrows in FIG. 10( b) which represent the motion estimation direction. Likewise, stream STB is motion estimated with reference to only those pictures in that stream.
The problematic case is illustrated in FIGS. 14( a)–16(b). By themselves, bit streams STA, STB do not pose an overflow/underflow problem as will be appreciated from FIGS. 14( a), 15(a). However, when the bit streams STA, STB are spliced as illustrated in FIGS. 16( a), (b) at a splicing point SPV an overflow/underflow condition occurs. The overflow condition which is illustrated in FIGS. 17( a), (b) occurs when bit stream STB continues to fill the VBV buffer to a point where the VBV buffer overflows as indicated at 141 in FIG. 17( a). The underflow case which is illustrated in FIGS. 18( a) and (b) occurs when stream STB does not thereafter fill the VBV buffer by a sufficient amount thereby resulting in an underflow 142 shown in FIG. 18( b). In the decoding-side apparatus (IRD), either an overflow or underflow of the VBV buffer consequently results in a failure in decoding pictures on the decoding-side. It is not a typical to see the effects of overflow/underflow manifesting as the skipping, freezing or interruption of the images.
Heretofore, there has been no solution for providing a seamlessly-spliced bit stream from a plurality of bit streams without the serious defects illustrated in the foregoing examples.
It is therefore an object of the present invention to provide a system for splicing bit streams;
It is another object of the present invention to provide a system for seamlessly splicing bit streams;
It is another object of the present invention to prevent signal deterioration in a system for splicing bit streams;
It is another object of the present invention to prevent degradation of image quality due to improper reordering of the pictures in the spliced bit stream;
It is another object of the present invention to prevent picture distortion due to improper motion estimation and the propagation thereof;
It is another object of the present invention to prevent overflow/underflow in the video verifier buffer (VBV) buffer;
It is another object of the present invention to provide an editing system to generate seamless bit streams on the fly from video feeds of various sources for broadcast by a broadcasting station;
It is another object of the present invention to provide a system for splicing bit streams in a DVD system;
It is another object of the present invention to provide a system for generating interactive movies by splicing a plurality of bit streams representing various portions of the movie;
It is another object of the present invention to provide a video game system for generating interactive video game scenes selected in accordance with user commands by splicing a plurality of bit streams representing alternative user-directed scenes of a video game; and
It is another object of the present invention to provide a system for encoding/decoding audio/video feeds spliced from a plurality of bit streams for on-line transmission.
According to the present invention, there is provided a system, method and apparatus for splicing a plurality of bit streams. The present invention inhibits a picture in the spliced bit stream which, upon decoding, would be out of sequence. In this manner, the present invention prevents an improper reordering of the spliced bit stream pictures on the decoding side.
In order to prevent deterioration in the image quality of the spliced bit stream, the present invention selectively reuses motion vector information fetched from the source coded streams for use in the re-encoding process. The new motion vectors are supplied to the motion compensation portion of the re-encoder in place of the original motion vectors. In order to prevent the improper prediction of a picture from an incorrect bit stream source, the present invention sets the direction of prediction to a picture which is positioned adjacent the splicing point thereby preventing degradation in image quality. In addition, the present invention has a capability of changing the picture type of a picture in the vicinity of the splicing point in order to prevent erroneous motion prediction from pictures from another bit stream source.
It is recognized in the present invention that the overflow/underflow condition occurs owing to an improper selection of the target bit rate for the spliced bit stream. So as to prevent overflow/underflow of the video buffer verifier (VBV) buffer, the target amount of bits is calculated anew for the spliced bit stream. The target amount of bits is calculated by reference to a quantizing characteristic produced in a previous coding process which may be retrieved from the source coded streams. In the alternative, the target amount is approximated. The plural bit streams are decoded in the region of the splicing point(s) and re-encoded in accordance with the new target bit rate.
With the present invention, seamlessly-spliced bit streams are provided without signal deterioration arising from improper reordering of the frames, picture distortion due to improper motion estimation or a breakdown in the video verifier (VBV) buffer due to improper selection of the target bit rate. It will be appreciated that the present invention is applicable to a wide range of applications including, for example, an editing system for generating seamless bit streams on the fly from video feeds of various sources for broadcast by a broadcasting station, a DVD system, a system for providing interactive movies, a video game system for generating alternative user-directed scenes of a video game or a system for encoding/decoding audio/video feeds for on-line transmission.
In more detail, FIG. 19 illustrates that the streams STA and STB are input to a buffer memory 10, a stream counter 11 and a stream analyzing portion 12. The stream counter 11 counts the number of bits in each of the streams STA and STB whilst the stream analyzing portion 12 analyzes the syntax of each of the streams. A splice controller 13 controls the bit splicing operation of the present invention as will be described in more detail. MPEG decoders 14A and 14B decode the streams STA and STB retrieved from the buffer memory 10 which output respective base-band video data to a switch 15. At the control of the splice controller 13, the switch 15 outputs either stream STA or STB to an MPEG encoder 16. The MPEG encoder 16, at the control of the splice controller 13, encodes the video base-band data selected by the switch 15 to thereby output a re-encoded bit stream STRE. A switch 17, as controlled by the splice controller 13, selectively outputs either the bit streams STA, STB retrieved from the buffer memory 10 or the re-encoded bit stream STRE to thereby output the spliced bit stream STSP.
The operation of the present invention shown in FIG. 19 will now be described. The stream counter 11 counts the number of bits of each of the received streams STA and STB and supplies the count value to the splice controller 13. The number of bits of the streams is counted because the locus of the data occupancy of the video buffer verifier needs to be controlled to prevent overflow/underflow. The stream analyzing portion 12 analyzes the syntax of each of the streams to fetch appropriate information from the layers of the bit streams including the sequence layer, the GOP layer, the picture layer and the macroblock layer. For example, encoded information such as the picture type (I, B or P), motion vectors, quantizing steps and quantizing matrices are retrieved by the stream analyzing portion.
The splice controller 13, based on the count value from the stream counter 11 and the information from the stream analyzing portion 12, sets a re-encoding range for each bit stream in accordance with the range parameters n0 and m0. Likewise, the splicing point(s) are set in accordance with the splice point parameter p0(s). The splice controller 13 controls the timing of the switch 15 to select the appropriate bit stream STA or STB to be sent to the MPEG encoder 16 in accordance with the splicing point parameter p0 and the range parameters n0, m0. The phase and timing of the bit streams are controlled by the splice controller to coincide at the predetermined splicing point(s). The splice controller 13 controls the switch 17 to select the bit streams STA and STB normally. The re-encoded bit stream STRE produced by the MPEG encoder 16 is selected during the re-encoding range in accordance with the parameters n0, m0 and p0.
The encoding section 16 shown in FIG. 20 encodes the decoded video data output from the MPEG decoders 14A and 14B in accordance with the operations of the splice controller 13. An encoder's previous processing circuit 30 preprocesses the decoded video data by rearranging the pictures of the decoded-video date in accordance with the bidirectional predictive coding process, forms pixel macroblocks and calculates the difficulty in coding each picture. In the preferred embodiment, the encoder's previous processing circuit 30 forms 16×16 pixel macroblocks. The encoding section 16 further incorporates a subtraction circuit 31 for subtracting a motion prediction error from the input decoded video data, a switch 32 for bypassing the motion-compensated prediction process in the case of I-frames and a compression/motion-compensated prediction section. The compression portion includes a discrete cosine transform circuit (DCT) 33, a quantizing circuit (Q) 34 and a variable-length coding circuit (VLC) 35. The quantizing circuit 34 of the present invention is controlled by the splice controller 13.
The motion-compensated prediction portion predicts the motion within the B- and P-frames of the input decoded video data. In more detail, the compressed bit stream is decompressed by application to an inverse quantizing circuit (IQ) 36 followed by an inverse discrete cosine transform circuit (IDCT) 37. The decompressed bit stream is added by the addition circuit 38 to the motion-compensated version of the picture in order to reconstitute the current frame. The frame memories FM1, FM2 (39, 40) store the appropriate reconstructed frames at the control of the motion detection circuit 42 in accordance with the type of predictive coding (B- or P-frame encoding). The motion compensation circuit 41 performs motion compensation in accordance with the frame(s) stored in the frame memories (FM1, FM2) 39, 40 based on the motion vectors provided by the motion detection circuit 42. The motion compensated picture, which is essentially a prediction of the current frame, is subtracted from the actual current frame by the subtraction circuit 31. It will be appreciated that the output of the subtraction circuit 31 is essentially an error result representing the difference between the actual frame and the prediction. An encode controller 43 provides substitute motion vectors and controls a switch 44 in order to select between the motion vectors determined by the motion detection circuit 42 and the substitute motion vectors.
The operation of the decoding/encoding section shown in FIG. 20 will now be described. The decoding section 14 decodes the input stream ST preferably in accordance with the MPEG standard. The encoder's previous processing circuit 30 rearranges the pictures for encoding in accordance with the picture type information extracted by the stream analyzing circuit 12 and forms picture data into macroblocks. The rearranged pictures are forwarded to the encoding section 16 of the figure for encoding.
The splice controller 13 forwards the encoded information, more particularly the motion vectors, which are extracted by the stream analyzing circuit to the encode controller 43. When it is determined to reuse the substitute motion vectors, the encode controller 43 causes the switch 44 to select the motion vectors supplied thereto. At other times, the encode controller 43 causes the switch 44 to select the motion vectors produced by the motion detection circuit 42. The encode controller 43 controls the frame memories (FM1, FM2) 39, 40 to store the appropriate pictures required to produce the predictive image data based on the substitute motion vectors and in accordance with the picture type of the current picture to be encoded. In addition, the encode controller 43 controls the quantization step size of the quantizing circuit 34 and the inverse quantization circuit 36 to accommodate the motion vectors in accordance with the target bit rate supplied by the splice controller 13.
The encode controller 43, moreover, controls the variable-length coding circuit 35. When it is determined that an amount of generated bits of the variable-length coding circuit 35 is insufficiently large with respect to the target amount of bits supplied by the splice controller 13, which forewarns of an underflow in the VBV buffer, the encode controller 43 adds dummy data to the variable-length coding circuit 35 in order to account for the shortage with respect to the target amount of bits. Conversely, the encode controller 43 performs a skipped macroblock process (ISO/IEC 13818-27.6.6) which interrupts the coding process in terms of macroblock units when it is determined that the variable-length coding circuit 35 generates an amount of bits that is relatively larger than the target amount of bits which warns of an overflow.
An example of the control of the decoding/encoding section according to the present invention will now be described with reference to FIGS. 21( a) to 26(b). FIGS. 21( a), (b) illustrate the process of selecting the video data to be re-encoded (also referred to as “presentation video data”) representing those portions of the bit streams STA (FIG. 21( a)) and STB (FIG. 21( b)) decoded respectively by the decoders 14A and 14B. In summary, when the splicing point as determined by the parameter p0 is set, the pictures comprising the presentation video data are selected to include those pictures within the re-encoding ranges as defined by the parameters n0 and m0.
A picture at the splicing point corresponding to stream STA is expressed as An−P0, wherein n is an integer and p0 is the splicing point parameter. Following this convention, pictures which are future to the picture at the splicing point are expressed as A(n−P0)+1, A(n−P0)+2, A(n−P0)+3, A(n−P0)+4 . . . A(n−P0)+n0, wherein no is the range parameter defining the range of the presentation video data corresponding to bit stream STA. Conversely, pictures more previous than the picture An−P0 at the splicing point are expressed as A(n−P0)−1, A(n−P0)−2, A(n−P0)−3, A(n−P0)−4, and so on. Likewise, the presentation video data corresponding to the stream STB at the splicing point is expressed as B(m−P0) and the pictures in the re-encoding range defined by the parameter m0 are expressed as B(m−P0)+1, B(m−P0)+2, B(m−P0)+3, B(m−P0)+4 . . . B(m−P0)−1, BP0)−3, B(m−P0)−4 . . . B(m−P0)−m0. As illustrated in FIGS. 21( a) and (b), the range of pictures in each respective bit stream STA, STB are indicated by the ranges for re-encoding (n0, m0). In other words, the re-encoding ranges include the pictures from picture A(n−P0)+n0 to picture A(n−P0) and pictures from picture B(m−P0) to picture B(m−P0)−m0.
With the present invention, the problem that the decoder on the decoding-side presents the pictures in the improper order is prevented. Each decoder 14A, B respectively decodes stream STA, STB thereby providing the decoded pictures A and B shown in FIGS. 22( b), (c). The splice controller 13 selects the re-encoding pictures REPA, REPB from pictures A, B by operation of switch 15. Since the streams STA, STB are decoded by two separate decoders, each set of pictures A, B are not cross-referenced and, therefore, not reordered upon decoding. In other words, the pictures which are incorrectly inserted into the wrong stream upon decoding are excluded by the splice controller 13. As shown in FIGS. 25( c), (d), for example, the B-pictures B(m−P0)+2, B(m−P0)+1 are excluded from the decoded pictures. Thus, the present invention provides a seamlessly-spliced stream which, upon decoding by the decoding-side decoder, arranges the pictures in the correct order of presentation as shown in FIG. 23( a).
The splice controller 13 in accordance with the present invention changes the picture type of the problematic pictures of the foregoing example. As illustrated in FIGS. 26( a), (b), the B-picture of stream STA at the splicing point An−P0 is changed to a P-picture which is motion estimated on the basis of the previous P-picture which is within the re-encoding range of that stream STA. The P-frame of the stream STB is changed to an I-picture which is not motion estimated. It will be appreciated that the B-pictures (B(m−PO)+1) and B(m−P0)+1) are discarded as shown in FIG. 25( c) when the P-picture is changed to an I-picture and thus do not exist in the picture stream after the re-encoding process is performed. The new picture type may require new prediction direction data and motion vectors. In at least one embodiment, the splice controller 13 provides the encode controller 43 with the encoding information such as the prediction direction and the motion vectors of a previously-coded picture. However, the present invention may also provide new motion prediction data using other techniques such as reconstructing the new picture entirely.
Referring to FIGS. 27( a) to 30(b), a method for calculating a new target amount of bits for image data in a re-encoding range to prevent underflow/overflow in the VBV buffer according to the present invention will now be described. In the figures, TRE represents the re-encode control time, OSTA represents the original stream A and STRE′ represents the stream which is re-encoded resulting in an underflow condition. OSTB represents original stream B, SPVBV represents a splicing point in the VBV buffer and SP represents a splicing point of the streams.
The problem of overflow of the VBV buffer for the spliced streams will now be described with reference to FIGS. 29( a), (b). FIG. 29( a) is a diagram showing a locus of data occupancy in the VBV buffer for the spliced stream STSP shown in FIG. 29( b). In this case, the level of the data occupancy at the splicing point is artificially-higher as compared with an original locus of the data occupancy in the VBV buffer for stream STB. As a result, the VBV buffer suffers an overflow when an I-frame is stored in the VBV buffer as shown in the figure.
Overflow occurs because the target bit rate for each picture is too small for the spliced bit stream. The reason for this is that the target bit rate is set for the smaller bit stream STB including VBVOST — B which, as will be seen from FIGS. 27( a), 29(a), is not included in the spliced bit stream STRE′. The underflow condition is the opposite case where the target bit rate is too large for the spliced bit stream STB. To compound the problem, the locus of the data occupancy of the VBV buffer becomes discontinuous at a point where the stream STRE′ to be re-encoded is switched back to the original stream OSTB which presents an additional overflow/underflow situation.
It is possible to resolve the overflow/underflow problem by controlling the locus VBVOST — B of the amount of data occupancy of the VBV buffer corresponding to the original stream OSTB. However, VBVOST — B is an optimum locus determined to prevent overflow or underflow of the original stream OSTB. If the level of the optimum locus is controlled, there is a possibility that overflow or underflow occurs.
The splice controller 13 operation for setting the new target bit rate will be discussed with reference to FIGS. 19 , 28(a), (b) and 30(a), (b). Initially, with reference to FIG. 19 , in accordance with a bit count value of stream STA and a bit count value of stream STB supplied from the stream counter 11, the splice controller 13 calculates a locus of the data occupancy of the VBV buffer for the original stream OSTA, a locus of a data occupancy of the VBV buffer for the original stream OSTB and a locus of a data occupancy of the VBV buffer for the stream STRE′ to be re-encoded in a case where stream STA and stream STB are spliced. The locus of the data occupancy of the VBV buffer in each case can be calculated by subtracting an amount of bits output from the VBV buffer corresponding to the presentation times from the bit count value supplied from the stream counter 11. Therefore, the splice controller 13 is able to virtually recognize the locus of the data occupancy of the VBV buffer for the original stream OSTA, the locus of the data occupancy of the VBV buffer for the original stream OSTB and the locus of the data occupancy of the VBV buffer for the stream STRE′ to be re-encoded in a case where stream STA and stream STB are spliced.
The splice controller 13 references the locus of the data occupancy of stream STRE′, to calculate an amount of overflow/underflow (vbv_over)/(vbv_under) of the stream STRE′ to be re-encoded. Moreover, the splice controller 13 makes reference to the data occupancy of the stream STRE′ and the locus (VBVOST — B) of the data occupancy of the original stream OSTB in the VBV buffer. The splice controller 13 calculates the gap value (vbv_gap) in the VBV buffer at the switching point between the stream STRE′ to be re-encoded and the original stream OSTB. The splice controller 13 calculates an offset amount vbv_off of a target amount of codes in accordance with the following Equations (1) and (2):
vbv — off=−(vbv — under−vbv — gap) (1)
vbv — off=+(vbv — over−vbv — gap) (2)
vbv — off=−(vbv — under−vbv — gap) (1)
vbv — off=+(vbv — over−vbv — gap) (2)
If the VBV buffer underflows as in the case shown in FIG. 27( a), Equation (1) is used to calculate the offset amount vbv_off. If the VBV buffer overflows as in the case shown in FIG. 29( a), Equation (2) is used to calculate the offset amount vbv_off.
Then, the splice controller 13 uses the offset amount vbv_off obtained in accordance with Equations (1) or (2) to calculate a target amount of codes (a target amount of bits) TBP0 in accordance with the following Equation (3):
The target amount of bits TBP0 is a value assigned to the picture which is subjected to the re-encoding process. In Equation (3), GB_A is a value indicating an amount of generated bits of a picture which is any one of pictures An−P0 to A(n−P0)+n0 in stream STA and Σ GB_A(n−P0)+i is a sum of the amount of generated bits of the pictures An−P0 to A(n−P0)+n0. Similarly, GB_B is a value indicating an amount of generated bits of a picture which is any one of pictures Bm−P0 to B(m−P0)−m0 in stream STB and Σ GB_B(m−P0)+i is a sum of the amount of generated bits of the pictures Bm−P0 to B(m−P0)−m0.
That is, the target amount of bits TBP0 expressed by Equation (3) is a value obtained by adding the offset amount vbv_off of the VBV buffer to the total amount of generated bits of the pictures A(n−P0)+n0 to B(m−P0)−m0. The offset amount vbv_off is added to correct the target amount of bits TBP0 such that the gap of the locus of the data occupancy at the switching point between the stream STSP, which is to be re-encoded, and the original stream OSTB is minimized (preferably zero). With the present invention, seamless splicing is realized.
The splice controller 13 assigns the target amount of bits TBP0 obtained in accordance with Equation (3) to the pictures A(n−p0)+n0 to B(m−P0)−m0. Usually, the quantizing characteristic of each picture is determined in such a manner that the target amount of bits TBP0 is distributed at a ratio of I picture:P picture:B picture=4:2:1. The splicing apparatus according to at least one embodiment of the present invention is not so rigid but makes reference to the quantizing characteristics including the previous quantizing steps and the quantizing matrices of the pictures A(n−P0)+n0 to B(m−P0)−m0 so as to determine a new quantizing characteristic. Specifically, the encode controller 43 makes reference to the quantizing steps and the quantizing matrices included in streams STA and STB. To prevent an excessive deviation from the quantizing characteristic realized in the previous encoder process of the encoders 1A, 1B the encode controller. 43 determines the quantizing characteristic when the re-encoding process is performed.
The present invention in accordance with the foregoing prevents underflow/overflow in the VBV buffer. FIGS. 28( a), (b) illustrate a data occupancy of the VBV buffer when a re-encoding process is performed using the target amount of bits TBP0 calculated by the splice controller 13 which resolves the problem of underflow described with reference to FIGS. 27( a), (b). FIGS. 30( a), (b) similarly illustrate a data occupancy of the VBV buffer when a re-encoding process is performed using the target amount of bits TBP0 calculated by the splice controller 13 which resolves the problem of overflow described with reference to FIGS. 29( a), (b).
The operations of the splicing and editing process according to the present invention will be described with reference to FIGS. 31 and 32 . The present invention preferably meets regulations of Annex C of ISO13818-2 and ISO11172-2 and Annex L of ISO13818-1 and of course may conform to their encoding/decoding standards.
In step S10, the splice controller 13 receives the splicing point parameter p0 for splicing the streams STA and STB and re-encoding ranges n0 and m0. It is possible that an operator inputs these parameters. The re-encoding ranges n0 and m0 may be automatically set in accordance with the configuration of the GOP of the stream or the like. In step S11, the splice controller 13 temporarily stores the streams STA and STB in the buffer memory 10. The phases of the splicing point of each of the streams STA and STB are synchronized with reference to the presentation time by controlling a reading operation of the buffer memory 10.
In step S12, the splice controller 13 selects a picture to be output for re-encoding while inhibiting a picture in stream STA appearing after the picture An−P0. Moreover, the splice controller 13 selects a picture to be output for re-encoding while inhibiting a picture appearing before the picture Bm−P0 of stream STB at the splicing point. FIGS. 25( a),(b) illustrate the situation where a P picture A(n−P0)−2 of stream STA appears after the picture An−P0 at the splicing point. In an order of presentation, picture A(n−P0)−2 is a picture in the future as compared with picture An−P0. Therefore, the P picture A(n−P0)−2 is not output in the present invention. Similarly, as shown in FIGS. 25( c) and (d), the B pictures B(m−P0)+2 and B(m−P0)+1 are before the picture Bm−P0 at the splicing point. In an order of presentation, pictures B(m−P0)+2 and B(m−P0)+1 are previous to picture Bm−P0. Therefore, the B pictures B(m−P0)+2 and B(m−P0)+1 are not output in the present invention. As described, pictures to be transmitted are selected with reference to the order of presentation, thereby preventing the problem of the presentation order described with reference to FIGS. 9( a)–(d).
In step S13, the splice controller 13 initiates a process for setting the coding parameters required to reconstruct the pictures for re-encoding in accordance with steps S14 to S30. The parameters which are set in this process include the picture type, a direction of prediction and the motion vectors for example.
In step S14, the splice controller 13 determines whether the picture to be subjected to the picture reconstruction process is the picture An−P0 at the splicing point. If so, the operation proceeds to step S15. Otherwise, the operation proceeds to step S20.
In step S15, the splice controller 13 determines whether the picture to be subjected to the picture reconstruction is a B picture, a P picture or an I picture. If the picture to be subjected to the picture reconstruction is a B picture, the operation proceeds to step S17. If the picture to be subjected to the picture reconstruction is a P picture or an I picture, the operation proceeds to step S18.
In step S16, the splice controller 13 determines whether two or more B pictures exist in front of picture An−P0 in the spliced stream STSP. For example, and as shown is FIG. 26( b), if two B pictures (A(n−P0)+2, A(n−P0)+3) exist in front of picture An−P0, the operation proceeds to step S18. Otherwise, the operation proceeds to step S17. In step S17, the splice controller 13 determines that the change of the picture type of the picture An−P0 is unnecessary. At this time, the splice controller 13 sets a picture type for use in the process for re-encoding the picture An−P0 to the same picture type (the B picture) used previously by the encoder 1A. Therefore, in the re-encoding process in this case the picture An−P0 is re-encoded as the B picture
In step S18, the splice controller 13 changes the picture type of the picture An−P0 from the B picture to the P picture. To explain, when two B pictures (A(n−P0)+2, A(n−P0)+3) exist in front of the B picture (An−P0), there are three B pictures to be re-encoded which are arranged sequentially in the stream STRE′. Since a typical MPEG decoder has only two frame memories for temporarily storing predicted pictures, the third B picture cannot be decoded. Therefore, the present invention changes the picture An−P0 type from the B picture to the P picture type as described with reference to FIGS. 26( a), (b). Thus, the picture An−P0 is reliably decoded as a P picture.
In step S19, the splice controller 13 determines that the change in the picture type of the picture An−P0 is unnecessary. At this time, the splice controller 13 sets the picture type for use when the picture An−P0 is re-encoded to the picture type (the I picture or the P picture) set previously by the encoder 1A.
In step S20, the splice controller 13 determines that the change in the picture type of the picture An−P0 is unnecessary. At this time, the splice controller 13 sets the picture type for use when the picture An−P0 is re-encoded to the picture type (the I picture, the P picture or the B picture) set previously by the encoder 1A.
In step S21, the splice controller 13 sets a direction of prediction and the motion vectors for each picture. In the example shown in FIGS. 25( a)–(d) and 26(a), (b), the picture An−P0 to be subjected to the picture reconstruction process is a B picture in the original stream OSTA. In this case, the B picture An−P0 is bi-directionally predicted from the P pictures A(n−P0)+1 and A(n−P0) 2. According to step S12, the P picture A(n−P0)−2 is inhibited from being output as the spliced stream and, thus, is prevented from becoming an inversely predicted picture of the picture An−P0 specified in the picture reconstruction process. Therefore, when the picture An−P0 is a B picture, its picture type is unchanged in step S17 and, as such, is subjected to a forward and one-sided prediction in which only the P picture of A(n−P0)+1 is employed for prediction. This is similar to the case in step S18 where the B picture is changed to the P picture such that the one-sided prediction parameter for predicting the picture An−P0 is based only on the P picture A (n−P0)+1.
The direction of prediction when the picture An−P0 is a P picture in step S19 is unchanged. That is, the splice controller 13 sets a forward and one-sided prediction for the picture An−P0 as in the previous encode process performed by the encoder 1A.
A change in the direction of prediction of the pictures A(n−P0)+n0 to A(n−P0)+1 as determined in step S20 is unnecessary. That is, the splice controller 13 sets a direction of prediction for the pictures A(n−P0)+n0 to A(n−P0)+1 as set previously by the encoder 1A. If the two pictures A(n−P0)+1 and An−P0 are B pictures predicted from two directions from the forward-directional P picture or I picture and the inverse-directional I picture or the P picture, the prediction for the picture A(n−p0)+1 as well as the picture An−P0 must be changed to one-sided prediction such that prediction is performed from only the forward-directional picture.
In step S21, the splice controller 13 determines whether the motion vectors for each picture in the previous encode process performed by the encoder 1A is reused when the re-encoding process is performed in accordance with the newly set direction of prediction. As described above, the motion vectors used in a previous encode process performed by the encoder 1A are the same as in the re-encoding process, i.e., employed for the P picture and the B picture when the direction of prediction of each has not changed. In the examples shown in FIGS. 23( a), (b) and 26(a), (b), the motion vectors used in the previous encode process performed by the encoder 1A are reused when the pictures A(n−P0)+n0 to A(n−P0)+1 are re-encoded. When the picture A(n−P0)+1 and the picture An−P0 are B pictures predicted from both directions from a P picture or an I picture in the forward direction and an I picture or a P picture in the reverse direction, the prediction is changed to one-sided prediction in which prediction is performed in only a forward-directional picture. Therefore, only motion vectors corresponding to the forward-directional picture are used. That is, when the picture A(n−P0)+1 and the picture An−P0 are B pictures, the splice controller 13 sets the prediction direction such that the motion vector for the forward-directional picture is used and the inverse-directional motion vector is not used in step S21.
If the picture An−P0 is a picture predicted in one direction, e.g., the inverse direction from only a future picture such as A(n−P0)−2, the motion vectors produced in the previous encoder process performed by the encoder 1A are not used. In this case, new motion vectors corresponding to A(n−p0)+1 are produced. That is, the splice controller 13 sets the direction of prediction in step S21 such that any previous motion vectors are not used.
In step S22, the splice controller 13 determines whether all parameters of the picture type, the direction of prediction and previous motion vectors of the pictures from pictures A(n−P0)+n0 to An−P0 are set. If so, control proceeds to step S23.
In step S23, the splice controller 13 determines whether the picture to be subjected to the picture reconstruction process is a picture Bm−P0 at the splicing point. If so, the operation proceeds to step S24. Otherwise, if the picture to be subjected to the picture reconstruction is any one of pictures B(m−P0)−1 to B(m−p0)+m0, the operation proceeds to step S28. In step S24, the splice controller 13 determines whether the picture to be subjected to the picture reconstruction process is a B picture, a P picture or an I picture. If the picture to be subjected to the picture reconstruction process is a B picture, the operation proceeds to step S25. If the picture to be subjected to the picture reconstruction process is a P picture, the operation proceeds to step S26. If the picture to be subjected to the picture reconstruction process is an I picture, the operation proceeds to step S27.
In step S25, the splice controller 13 determines that a change in the picture type of the picture Bm−P0 in the re-encoding process is unnecessary as in the example shown in FIGS. 22( a)–(d) and 23(a), (b). Thus, the splice controller 13 sets the picture type for use in a re-encoding process of the picture Bm−P0 to the same picture type (the B picture) as set previously by the encoder 1B.
In step S26, the splice controller 13 changes the picture type of the picture Bm−P0 from the P picture to the I picture as in the examples shown in FIGS. 25( a)–(d) and 26(a), (b). The reason will now be described. Since the P picture is a one-sided prediction picture which is predicted from the forward-directional I- or P-picture, the P picture is always positioned behind the pictures used for prediction on the stream. If the first picture Bm−P0 at the splicing point in the stream STB is a P picture, prediction must be performed from a forward-directional picture of the stream STA which exists in front of the picture Bm−P0. Since the streams STA, STB are different, it is apparent that the quality of the image obtained by a decoding process deteriorates considerably if the picture type of the first picture Bm−P0 is set to the P picture. In this case, the splice controller 13 changes the picture type of the B picture to the I picture.
In step S27, the splice controller 13 determines that a change in the picture type of the picture Bm−P0 is unnecessary. Thus, the splice controller 13 sets the picture for use in the re-encoding process of the picture Bm−P0 to the same picture type (I picture) set previously by the encoder 1B.
In step S28, the splice controller 13 determines that a change in the picture type of the pictures B(m−P0)−1 to B(m−P0)−m0 is unnecessary. The splice controller 13 sets the picture for use in the re-encoding process of each of the foregoing pictures to the same picture type (the I picture, the P picture or the B picture) set previously by the encoder 1B.
In step S29, the splice controller 13 sets a direction of prediction and motion vectors for each picture. If the picture Bm−P0 to be subjected to the picture reconstruction process is, in the original stream OSTB, a B picture as in the example shown in FIGS. 22( a)–(d) and 23(a), (b), the picture Bm−P0 is a picture predicted from two directions, i.e., from the P picture B(m−P0)+1 and the I picture B(m−P0)−2. As described in step S12, the P picture of B(mPO)+1 is not output as a splicing stream and, therefore, is not specified as a forward-directional prediction picture for the picture Bm−P0 to be subjected to the picture reconstruction process. Therefore, the picture Bm−P0 which is set such that a change in its picture type is unnecessary in step S25 must be set to perform an inverse and one-sided prediction such that only the I picture B(m−P0)−2 is predicted. Therefore, the splice controller 13 sets a direction of prediction for the picture Bm−P0 to perform the inverse and one-side prediction such that only the I picture B(m−P0)−2 is used in the prediction.
A change in the direction of prediction of the pictures B(m−P0)+m0 to B(m−P0)+1 in step S28 is deemed unnecessary. In this case, the splice controller 13 sets a direction of prediction for the pictures B(m−P0)+m0 to B(m−P0)+1 to the same picture previously set by the encoder 1B. If the picture B(m−P0)−1 is a B picture, a direction of prediction for the B(m−P0)−1 is set such that inverse and one-sided prediction is performed so that only the I picture of B(m−P0)−2 is predicted. This is similar to the foregoing case in which the picture Bm−P0 is predicted.
In accordance with the newly set direction of prediction, the splice controller 13 determines in step S29 whether the motion vectors set previously are reused for each picture when the re-encoding process is performed. As described above, the re-encoding process is performed such that the motion vectors used in a previous encode process of the encoder 1B are reused for the P pictures and the B pictures when the prediction direction has not been changed. For example, in FIGS. 22( a)–(d) and 23(a), (b), the motion vectors used in a previous encode process are used for the pictures from the I picture B(m−P0)−2 to the P picture B(m−P0)−m0. The direction of prediction for each of the pictures Bm−P0 and B(m−P0)−1 predicted from the two directions, e.g., from the P picture B(m−P0)+1 and the I picture of B(m−P0)−2, in a previous encoder process performed by the encoder 1B is changed to one-sided prediction such that only the I picture B(m−P0)−2 is used for prediction. Therefore, the motion vectors corresponding to the picture B(m−P0)+1 are not used. That is, in step S29, the splice controller 13 reuses the previous motion vectors for only one direction for the pictures Bm−P0 and B(m−P0)−1. That is, the motion vectors for the inverse direction are not used.
Next, in step S30, the splice controller 13 determines whether the parameters relating to the picture type, the direction of prediction and the motion vectors for all of the pictures from the picture Bm−P0 to the picture B(m−P0)−m0 are set. If so, the splice controller 13 in step S31 calculates a target amount of bits (TBP0) to be generated in the re-encoding period in accordance with Equation (3). Specifically, the splice controller 13 initially calculates a locus of the data occupancy of the VBV buffer for the original stream OSTA, a locus of the data occupancy of the VBV buffer for the original stream OSTB and a locus of the data occupancy of the VBV buffer for the stream STRE′ to be encoded in a case where streams STA, STB are spliced in accordance with a bit count value of stream STA and the bit count value of stream STB supplied from the stream counter 11. Then, the splice controller 13 analyzes the virtually-obtained locus of the data occupancy of the VBV buffer for the stream STRE′ to be re-encoded.
Thus, the splice controller 13 calculates an amount of underflow (vbv_under) or an amount of overflow (vbv_over) of the stream STRE′ to be re-encoded. Moreover, the splice controller 13 compares the virtually-obtained locus of the data occupancy of the VBV buffer for stream STRE′ to be re-encoded and a locus (VBVOST — B) of the data occupancy in the VBV buffer for the original stream OSTB. Thus, the splice controller 13 calculates a gap value (vbv_gap) of the VBV buffer at a switching point between stream STRE′ to be re-encoded and the original stream OSTB. Then, the splice controller 13 calculates an offset amount vbv_off of the target amount of codes in accordance with Equations (1) and (2). Then, the splice controller 13 uses the offset amount vbv_off calculated in accordance with Equation (1) or (2) to calculate a target amount of codes (target amount of bits) TBP0 in accordance with Equation (3).
In step S32, the splice controller 13 determines a quantizing characteristic to be set for each picture. The quantizing characteristic is determined in accordance with an assignment to the pictures A(n−P0)+n0 to B(m−P0)−m0 of the target amount of bits TBP0 calculated in accordance with Equation (3). The splicing apparatus according to the present invention makes reference to quantizing characteristics including the previous quantizing steps and the quantizing matrices of each of the pictures A(n−P0)+n0 to B(m−P0)−m0 used by the encoders 1A and 1B so as to determine new quantizing characteristics. Specifically, the splice controller 13 initially receives from the stream analyzing portion 12 information about the coding parameters, quantizing steps and quantizing matrices produced in a previous coding process performed by the encoders 1A and 1B and included in the streams STA, STB.
Further, the splice controller 13 makes reference to the amounts of codes (bits) assigned to the target amount of bits TBP0 and information of the previous coding parameters. The splice controller 13 determines the quantizing characteristics when the re-encoding process is performed so as to prevent excessive deviation from the quantizing characteristics in the encoding processes performed by the encoders 1A and 1B. As described in steps S18 and S26, the quantizing characteristics of the pictures, the picture type of each of which has been changed by the picture reconstruction process, are newly calculated when the re-encoding process is performed without reference to the information of the quantizing steps and the quantizing matrices.
In step S33, the splice controller 13 decodes the pictures A(n−P0)+n0 to B(m−P0)−m0 included in the re-encoding range. In step S34, the splice controller 13 uses the quantizing characteristics set to pictures A(n−P0)+n0 to B(m−P0)−m0 while controlling the amount of generated bits. If the splice controller 13 reuses the previous motion vectors, the encode controller 43, at the control of the splice controller 13, causes switch 44 to channel the previous motion vectors to the motion compensation portion 41. When the previous motion vectors are not used, the encode controller 43 controls the switch 44 to channel the motion vectors newly produced by the motion detection circuit 42 to the motion compensation portion 41. At this time, the encode controller 43 controls the frame memories 39 and 40 in accordance with information about the picture type supplied from the splice controller 13 to store the pictures required to produce predicted image data. The encode controller 43 sets, to the quantizing circuit 34 and the inverse quantization circuit 36, the quantizing characteristics in the re-encoding range supplied from the splice controller 13.
In step S35, the splice controller 13 controls the switch 17 to selectively output stream STA from the buffer 10, stream STB from the buffer 10 or the re-encoded stream STRE from the MPEG encoder 16. Thus, the splice controller 13 seamlessly-splices stream STA which appears before the re-encoding range, re-encoded stream STRE in the re-encoding range and stream STB which appears after the re-encoding range to provide seamlessly-spliced bit stream STSP.
Although preferred embodiments of the present invention and modifications thereof have been described in detail herein, it is to be understood that this invention is not limited to those precise embodiments and modifications, and that other modifications and variations may be affected by one skilled in the art without departing from the spirit and scope of the invention as defined by the appended claims.
Claims (35)
1. A splicing apparatus for splicing a plurality of source encoded bit streams, comprising:
splicing-point setting means for setting an intended splicing point for said plurality of source encoded bit streams;
determining means for determining an amount of each of said encoded bit streams to be decoded in a region preceding and succeeding said intended splicing point;
decoding means for decoding pictures included in a plurality of each of said source encoded bit streams in said region of said intended splicing point to generate decoded video data;
splicing means for splicing decoded bit streams of said intended splicing point;
re-encoding means for re-encoding said decoded video data to generate a re-encoded stream;
spliced-stream producing means for switching between said source encoded bit streams and said re-encoded stream to produce a spliced stream; and
splice control means for controlling said re-encoding means and said spliced-stream producing means so as to prevent a discontinuity of said spliced stream when said spliced stream is decoded.
2. A splicing apparatus according to claim 1 , wherein a decoding-side for decoding said spliced stream includes a video buffer verifier (VBV); wherein the splice control means calculates a target amount of bits upon re-encoding said decoded video data by the re-encoding means so as to prevent overflow and underflow of said VBV buffer; and
the re-encoding means encodes the decoded video data in accordance with the target amount of bits generated by the splice control means.
3. The splicing apparatus according to claim 1 wherein the splice control means fetches coding parameters included in the source encoded streams and selectively reuses the fetched coding parameters when re-encoding is performed by the re-encoding means so as to prevent deterioration in image quality of the spliced stream.
4. The splicing apparatus according to claim 1 , wherein the splice control means fetches information about a quantizing characteristic included in the source encoded streams and controls the re-encoding means to perform the re-encoding process in accordance with the fetched quantizing characteristic.
5. The splicing apparatus according to claim 1 , wherein a decoding-side decodes said spliced stream includes a video buffer verifier (VBV) characterized by a data occupancy;
wherein the splice control means fetches for each picture quantizer information included in the source encoded streams,
calculates a target amount of bits for each picture re-encoded in re-encoding said decoded video data by the re-encoding means so as to prevent overflow and underflow of the VBV buffer,
fetches, for each picture, quantizer information included in the source encoded streams and calculates new quantizer information in accordance with the quantizer information fetched from the source encoded stream and the calculated target amount of bits, and
controls the re-encoding means to cause the re-encoding means to perform the re-encoding process in accordance with the new quantizer information.
6. The splicing apparatus according to claim 1 , wherein a decoding-side decodes said spliced stream includes a video buffer verifier (VBV) characterized by a data occupancy;
wherein the splice control means calculates a target amount of bits in re-encoding said decoded video data by the re-encoding means so as to prevent overflow and underflow of the VBV buffer,
assigns the target amount of bits to each picture to be re-encoded by making reference to a quantizer information which had been generated in a previous coding process and included in the source encoded streams, and
controls the re-encoding means to perform the re-encoding process for each picture in accordance with the target amount of bits assigned to each picture.
7. The splicing apparatus according to claim 1 , wherein a decoding-side decodes said spliced stream includes a video buffer verifier (VBV) characterized by a data occupancy;
wherein the splice control means calculates a target amount of bits in re-encoding the decoded video data by the re-encoding means so as to prevent overflow and underflow of the VBV buffer,
assigns the target amount of bits to each picture to be re-encoded so as to approximate the target amount of bits to an amount of generated bits for each picture in a previous coding process of the source encoded streams, and
controls the re-encoding means to perform re-encoding for each picture in accordance with the target amount of bits assigned.
8. The splicing apparatus according to claim 1 , wherein the splice control means selectively reuses motion vector information fetched from the source coded streams in the re-encoding process which is performed by the re-encoding means so as to prevent deterioration in the image quality of the spliced stream.
9. The splicing apparatus according to claim 1 , wherein the re-encoding means incorporates motion detection means for detecting motion of each picture to produce a motion vector, and
motion compensation means for performing motion compensation in accordance with the motion vector detected by the motion detection means, and
the splice control means determines whether motion vector information fetched from the source coded streams is reused in re-encoding the decoded video data by the re-encoding means, and
controls the re-encoding means to supply the motion vector fetched from the stream to a motion compensation circuit of the re-encoding means in place of the motion vector detected by the motion detection means when a determination has been made that the motion vector information is reused.
10. The splicing apparatus according to claim 1 , wherein the splice control means set a direction of prediction of a picture which is positioned adjacent to the spliced point and which is subjected to the re-encoding process by the re-encoding means so as to prevent prediction from pictures in different source coded streams positioned opposite with respect to the spliced point.
11. The splicing apparatus according to claim 1 , wherein the splice control means selectively changes a picture type of a picture which is positioned adjacent the spliced point and which is re-encoded by the re-encoding means so as to prevent deterioration in image quality of the pictures included in the spliced stream and positioned adjacent to the spliced points.
12. The splicing apparatus according to claim 1 , wherein the splice control means selectively changes the picture type of a picture which is re-encoded by the re-encoding means and which is positioned adjacent to the spliced point so as to prevent prediction from pictures in different source coded streams positioned opposite with respect to the spliced point.
13. The splicing apparatus according to claim 1 , wherein the plurality of source coded streams include at least a first encoded stream and a second encoded stream, and
the splice control means selects output pictures from the plurality of pictures which constitute the first encoded stream so as to prevent an output of a future picture in a time axis of presentation as the spliced stream to a first spliced point set to the first encoded stream, and
selects output pictures from the plurality of pictures which constitute the second encoded stream so as to prevent an output of a previous picture in a time axis of presentation to a second spliced point set to the second coded stream.
14. The splicing apparatus according to claim 1 , wherein a decoding-side decodes said spliced stream includes a video buffer verifier (VBV) characterized by a data occupancy;
wherein the plurality of source encoded streams include at least a first encoded stream and a second encoded stream, and
the splice control means selects output pictures from the plurality of pictures which constitute the first encoded stream so as to prevent an output of a future picture in a time axis of presentation as the spliced stream to a first spliced point set to the first encoded stream,
selects output pictures from the plurality of pictures which constitute the second encoded stream so as to prevent an output of a previous picture in a time axis of presentation to a second spliced point set to the second coded stream,
sets a picture type and a direction of prediction of a picture which is re-encoded by the re-encoding means and which is positioned adjacent to the spliced point, sets motion vector information fetched from the source encoded streams to a picture which is reused in the re-encoding process which is performed by the re-encoding means,
calculates a target amount of bits in the re-encoding process which is performed by the re-encoding means so as to prevent overflow and underflow of the VBV buffer,
assigns the target amount of bits to each of pictures to be re-encoded so as to approximate the target amount of bits to an amount of generated bits for each picture in a previous coding process of the source coded stream, and
controls the re-encoding means to cause the re-encoding means to perform the re-encoding process for each picture in accordance with the target amount of bits assigned to each picture, the direction of prediction, the picture type and the motion vector.
15. The splicing apparatus according to claim 1 , wherein said splice control means is an editing means for editing said source encoded streams.
16. A splicing method for splicing a plurality of source encoded bit streams to produce a spliced stream, comprising the steps of:
setting intended splicing points for said plurality of source encoded bit streams;
determining an amount of each of said encoded bit streams to be decoded in a region preceding and succeeding said intended splicing point;
decoding pictures in said region of said splicing points of a plurality of each of said source encoded bit streams and generating decoded video data;
splicing decoded bit streams of said intended splicing point;
re-encoding said decoded video data to generate a re-encoded stream;
performing switching between said source encoded bit streams and said re-encoded stream to effect output so as to produce said spliced stream; and
controlling the re-encoding step and said spliced stream producing step so as to prevent a discontinuity of said spliced stream when said spliced stream is decoded.
17. The splicing method according to claim 16 , wherein a decoding-side for decoding the spliced stream includes a video buffer verifier (VBV);
wherein the splice control step calculates a target amount of bits upon re-encoding said decoded video data in the re-encoding step so as to prevent overflow and underflow of a VBV buffer; and
the re-encoding step encodes the decoded video data in accordance with the target amount of bits generated in the splice control step.
18. The splicing method according to claim 16 , wherein the splice control step fetches a coding parameter included in the source encoded streams and selectively reuses the fetched coding parameter when the re-encoding process is performed in the re-encoding step so as to prevent deterioration in image quality of the spliced stream.
19. The splicing method according to claim 16 , wherein the splice control step fetches quantizer information included in the source encoded streams and controls the re-encoding step to perform the re-encoding process in accordance with the fetched quantizer information.
20. The splicing method according to claim 16 , wherein said spliced control step edits the source encoded streams.
21. A splicing apparatus for splicing a plurality of source encoded bit streams, comprising:
splicing-pointer for selling an intended splicing point for said plurality of source encoded bit streams;
a calculator for determining an amount of each of said encoded bit streams to be decoded in a region preceding and succeeding said intended splicing point;
a decoder for decoding pictures included in a plurality of each of said source encoded bit streams in said region of said splicing point to generate decoded video data;
a splicer for splicing decoded bit streams of said intended splicing point;
a re-encoder for re-encoding said decoded video data to generate a re-encoded stream;
a spliced-stream switcher for switching between said source encoded bit streams and said re-encoded stream to produce a spliced stream; and
a splice controller for controlling said re-encoder and said spliced-stream switcher so as to prevent a discontinuity of said spliced stream when said spliced stream is decoded.
22. The splicing apparatus according to claim 21 , wherein a decoding-side for decoding said spliced stream includes a video buffer verifier (VBV); wherein the splice controller calculates a target amount of bits upon re-encoding said decoded video data by the re-encoder so as to prevent overflow and underflow of said VBV buffer; and
the re-encoder encodes the decoded video data in accordance with the target amount of bits generated by the splice controller.
23. The splicing apparatus according to claim 21 , wherein the splice controller fetches coding parameters included in the source encoded streams and selectively reuses the fetched coding parameters when re-encoding is performed by the re-encoder so as to prevent deterioration in image quality of the spliced stream.
24. The splicing apparatus according to claim 21 , wherein the splice controller fetches information about a quantizing characteristic included in the source encoded streams and controls the re-encoder to perform the re-encoding process in accordance with the fetched quantizing characteristic.
25. The splicing apparatus according to claim 21 , wherein a decoding-side decodes said spliced stream includes a video buffer verifier (VBV) characterized by a data occupancy;
wherein the splice controller fetches for each picture quantizer information included in the source encoded streams,
calculates a target amount of bits for each picture re-encoded in re-encoding said decoded video data by the re-encoder so as to prevent overflow and underflow of the VBV buffer,
fetches, for each picture, quantizer information included in the source encoded streams and calculates new quantizer information in accordance with the quantizer information fetched from the source encoded stream and the calculated target amount of bits, and
controls the re-encoder to cause the re-encoder to perform the re-encoding process in accordance with the new quantizer information.
26. The splicing apparatus according to claim 21 , wherein a decoding-side decodes said spliced stream includes a video buffer verifier (VBV) characterized by a data occupancy;
wherein the splice controller calculates a target amount of bits in re-encoding said decoded video data by the re-encoder so as to prevent overflow and underflow of the VBV buffer,
assigns the target amount of bits to each picture to be re-encoded by making reference to a quantizer information which had been generated in a previous coding process and included in the source encoded streams, and
controls the re-encoder to perform the re-encoding process for each picture in accordance with the target amount of bits assigned to each picture.
27. The splicing apparatus according to claim 21 , wherein a decoding-side decodes said spliced stream includes a video buffer verifier (VBV) characterized by a data occupancy;
wherein the splice controller calculates a target amount of bits in re-encoding the decoded video data by the re-encoder so as to prevent overflow and underflow of the VBV buffer,
assigns the target amount of bits to each picture to be re-encoded so as to approximate the target amount of bits to an amount of generated bits for each picture in a previous coding process of the source encoded streams, and
controls the re-encoder to perform re-encoding for each picture in accordance with the target amount of bits assigned.
28. The splicing apparatus according to claim 21 , wherein the splice controller selectively reuses motion vector information fetched from the source coded streams in the re-encoding process which is performed by the re-encoder so as to prevent deterioration in the image quality of the spliced stream.
29. The splicing apparatus according to claim 21 , wherein the re-encoder incorporates a motion detector for detecting motion of each picture to produce a motion vector, and
a motion compensator for performing motion compensation in accordance with the motion vector detected by the motion detector, and
the splice controller determines whether motion vector information fetched from the source coded streams is reused in re-encoding the decoded video data by the re-encoder, and
controls the re-encoder to supply the motion vector fetched from the stream to a motion compensation circuit of the re-encoder in place of the motion vector detected by the motion detector when a determination has been made that the motion vector information is reused.
30. The splicing apparatus according to claim 21 , wherein the splice controller set a direction of prediction of a picture which is positioned adjacent to the spliced point and which is subjected to the re-encoding process by the re-encoder so as to prevent prediction from pictures in different source coded streams positioned opposite with respect to the spliced point.
31. The splicing apparatus according to claim 21 , wherein the splice controller selectively changes a picture type of a picture which is positioned adjacent the spliced point and which is re-encoded by the re-encoder so as to prevent deterioration in image quality of the pictures included in the spliced stream and positioned adjacent to the spliced points.
32. The splicing apparatus according to claim 21 , wherein the splice controller selectively changes the picture type of a picture which is re-encoded by the re-encoder and which is positioned adjacent to the spliced point so as to prevent prediction from pictures in different source coded streams positioned opposite with respect to the spliced point.
33. The splicing apparatus according to claim 21 , wherein the plurality of source coded streams include at least a first encoded stream and a second encoded stream, and
the splice controller selects output pictures from the plurality of pictures which constitute the first encoded stream so as to prevent an output of a future picture in a time axis of presentation as the spliced stream to a first spliced point set to the first encoded stream, and
selects output pictures from the plurality of pictures which constitute the second encoded stream so as to prevent an output of a previous picture in a time axis of presentation to a second spliced point set to the second coded stream.
34. The splicing apparatus according to claim 21 , wherein a decoding-side decodes said spliced stream includes a video buffer verifier (VBV) characterized by a data occupancy;
wherein the plurality of source encoded streams include at least a first encoded stream and a second encoded stream, and
the splice controller selects output pictures from the plurality of pictures which constitute the first encoded stream so as to prevent an output of a future picture in a time axis of presentation as the spliced stream to a first spliced point set to the first encoded stream,
selects output pictures from the plurality of pictures which constitute the second encoded stream so as to prevent an output of a previous picture in a time axis of presentation to a second spliced point set to the second coded stream,
sets a picture type and a direction of prediction of a picture which is re-encoded by the re-encoder and which is positioned adjacent to the spliced point,
sets motion vector information fetched from the source encoded streams to a picture which is reused in the re-encoding process which is performed by the re-encoder,
calculates a target amount of bits in the re-encoding process which is performed by the re-encoder so as to prevent overflow and underflow of the VBV buffer,
assigns the target amount of bits to each of pictures to be re-encoded so as to approximate the target amount of bits to an amount of generated bits for each picture in a previous coding process of the source coded stream, and
controls the re-encoder to cause the re-encoder to perform the re-encoding process for each picture in accordance with the target amount of bits assigned to each picture, the direction of prediction, the picture type and the motion vector.
35. The splicing apparatus according to claim 21 , wherein said splice controller is an editor for editing said source encoded streams.
Priority Applications (5)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US10/282,784 US7139316B2 (en) | 1997-07-25 | 2002-10-29 | System method and apparatus for seamlessly splicing data |
US11/586,245 US7711051B2 (en) | 1997-07-25 | 2006-10-25 | System method and apparatus for seamlessly splicing data |
US11/591,063 US8923409B2 (en) | 1997-07-25 | 2006-11-01 | System method and apparatus for seamlessly splicing data |
US11/591,073 US8798143B2 (en) | 1997-07-25 | 2006-11-01 | System method and apparatus for seamlessly splicing data |
US11/642,369 US8223847B2 (en) | 1997-07-25 | 2006-12-19 | Editing device, editing method, splicing device, splicing method, encoding device, and encoding method |
Applications Claiming Priority (5)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP19992397 | 1997-07-25 | ||
JP9-199923 | 1997-07-25 | ||
PCT/JP1998/003332 WO1999005864A1 (en) | 1997-07-25 | 1998-07-27 | Editing device, editing method, splicing device, splicing method, encoding device, and encoding method |
US09/275,999 US6567471B1 (en) | 1997-07-25 | 1999-03-25 | System method and apparatus for seamlessly splicing data |
US10/282,784 US7139316B2 (en) | 1997-07-25 | 2002-10-29 | System method and apparatus for seamlessly splicing data |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US09/275,999 Continuation US6567471B1 (en) | 1997-07-25 | 1999-03-25 | System method and apparatus for seamlessly splicing data |
Related Child Applications (3)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US11/586,245 Continuation US7711051B2 (en) | 1997-07-25 | 2006-10-25 | System method and apparatus for seamlessly splicing data |
US11/591,073 Continuation US8798143B2 (en) | 1997-07-25 | 2006-11-01 | System method and apparatus for seamlessly splicing data |
US11/591,063 Continuation US8923409B2 (en) | 1997-07-25 | 2006-11-01 | System method and apparatus for seamlessly splicing data |
Publications (2)
Publication Number | Publication Date |
---|---|
US20030067989A1 US20030067989A1 (en) | 2003-04-10 |
US7139316B2 true US7139316B2 (en) | 2006-11-21 |
Family
ID=16415853
Family Applications (6)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US09/275,999 Expired - Lifetime US6567471B1 (en) | 1997-07-25 | 1999-03-25 | System method and apparatus for seamlessly splicing data |
US10/282,784 Expired - Lifetime US7139316B2 (en) | 1997-07-25 | 2002-10-29 | System method and apparatus for seamlessly splicing data |
US11/586,245 Expired - Fee Related US7711051B2 (en) | 1997-07-25 | 2006-10-25 | System method and apparatus for seamlessly splicing data |
US11/591,063 Expired - Fee Related US8923409B2 (en) | 1997-07-25 | 2006-11-01 | System method and apparatus for seamlessly splicing data |
US11/591,073 Expired - Fee Related US8798143B2 (en) | 1997-07-25 | 2006-11-01 | System method and apparatus for seamlessly splicing data |
US11/642,369 Expired - Fee Related US8223847B2 (en) | 1997-07-25 | 2006-12-19 | Editing device, editing method, splicing device, splicing method, encoding device, and encoding method |
Family Applications Before (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US09/275,999 Expired - Lifetime US6567471B1 (en) | 1997-07-25 | 1999-03-25 | System method and apparatus for seamlessly splicing data |
Family Applications After (4)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US11/586,245 Expired - Fee Related US7711051B2 (en) | 1997-07-25 | 2006-10-25 | System method and apparatus for seamlessly splicing data |
US11/591,063 Expired - Fee Related US8923409B2 (en) | 1997-07-25 | 2006-11-01 | System method and apparatus for seamlessly splicing data |
US11/591,073 Expired - Fee Related US8798143B2 (en) | 1997-07-25 | 2006-11-01 | System method and apparatus for seamlessly splicing data |
US11/642,369 Expired - Fee Related US8223847B2 (en) | 1997-07-25 | 2006-12-19 | Editing device, editing method, splicing device, splicing method, encoding device, and encoding method |
Country Status (7)
Country | Link |
---|---|
US (6) | US6567471B1 (en) |
EP (3) | EP1467563A1 (en) |
JP (5) | JP3736808B2 (en) |
KR (2) | KR100555164B1 (en) |
CN (1) | CN1161989C (en) |
DE (1) | DE69841897D1 (en) |
WO (1) | WO1999005864A1 (en) |
Cited By (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20040179553A1 (en) * | 2001-04-20 | 2004-09-16 | Marcus Wiklund | Method and apparatus for localizing data |
US20040247188A1 (en) * | 2003-06-03 | 2004-12-09 | Ajay Luthra | Method for restructuring a group of pictures to provide for random access into the group of pictures |
US20060007958A1 (en) * | 2004-07-12 | 2006-01-12 | Samsung Electronics Co., Ltd. | Multiplexing method and apparatus to generate transport stream |
US20060044163A1 (en) * | 2004-08-24 | 2006-03-02 | Canon Kabushiki Kaisha | Image reproduction apparatus, control method thereof, program and storage medium |
US7305040B1 (en) * | 1998-01-19 | 2007-12-04 | Sony Corporation | Edit system, edit control device, and edit control method |
US20070286244A1 (en) * | 2006-06-13 | 2007-12-13 | Sony Corporation | Information processing apparatus and information processing method |
US20080317125A1 (en) * | 2004-08-11 | 2008-12-25 | Tomokazu Murakami | Bit Stream Recording Medium, Video Encoder, and Video Decoder |
US20090180758A1 (en) * | 2004-06-02 | 2009-07-16 | Tadamasa Toma | Multiplexing apparatus and demultiplexing apparatus |
US20100104022A1 (en) * | 2008-10-24 | 2010-04-29 | Chanchal Chatterjee | Method and apparatus for video processing using macroblock mode refinement |
WO2010057027A1 (en) * | 2008-11-14 | 2010-05-20 | Transvideo, Inc. | Method and apparatus for splicing in a compressed video bitstream |
Families Citing this family (101)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH0725399U (en) * | 1993-10-19 | 1995-05-12 | 廉正 赤澤 | Mission oil cleaner |
GB2327548B (en) * | 1997-07-18 | 2002-05-01 | British Broadcasting Corp | Switching compressed video bitstreams |
KR100555164B1 (en) * | 1997-07-25 | 2006-03-03 | 소니 가부시끼 가이샤 | Editing device, editing method, re-encoding device, re-encoding method, splicing device, and splicing method |
JP2000138896A (en) * | 1998-10-30 | 2000-05-16 | Hitachi Ltd | Image audio recorder |
JP2000165802A (en) | 1998-11-25 | 2000-06-16 | Matsushita Electric Ind Co Ltd | Stream edit system and edit method |
WO2000062551A1 (en) * | 1999-04-14 | 2000-10-19 | Sarnoff Corporation | Frame-accurate seamless splicing of information streams |
JP2000354249A (en) * | 1999-04-16 | 2000-12-19 | Sony United Kingdom Ltd | Video signal processor, video signal processing method and computer program product |
JP4689001B2 (en) * | 1999-04-16 | 2011-05-25 | ソニー ヨーロッパ リミテッド | Video signal processing apparatus, computer program, and video signal processing method |
GB9908809D0 (en) | 1999-04-16 | 1999-06-09 | Sony Uk Ltd | Signal processor |
JP4487374B2 (en) * | 1999-06-01 | 2010-06-23 | ソニー株式会社 | Encoding apparatus, encoding method, multiplexing apparatus, and multiplexing method |
US7254175B2 (en) | 1999-07-02 | 2007-08-07 | Crystalmedia Technology, Inc. | Frame-accurate seamless splicing of information streams |
GB2353653B (en) * | 1999-08-26 | 2003-12-31 | Sony Uk Ltd | Signal processor |
US6985188B1 (en) * | 1999-11-30 | 2006-01-10 | Thomson Licensing | Video decoding and channel acquisition system |
US6778533B1 (en) | 2000-01-24 | 2004-08-17 | Ati Technologies, Inc. | Method and system for accessing packetized elementary stream data |
US6885680B1 (en) | 2000-01-24 | 2005-04-26 | Ati International Srl | Method for synchronizing to a data stream |
US6785336B1 (en) | 2000-01-24 | 2004-08-31 | Ati Technologies, Inc. | Method and system for retrieving adaptation field data associated with a transport packet |
US8284845B1 (en) | 2000-01-24 | 2012-10-09 | Ati Technologies Ulc | Method and system for handling data |
US6763390B1 (en) * | 2000-01-24 | 2004-07-13 | Ati Technologies, Inc. | Method and system for receiving and framing packetized data |
US6988238B1 (en) | 2000-01-24 | 2006-01-17 | Ati Technologies, Inc. | Method and system for handling errors and a system for receiving packet stream data |
JP2001218213A (en) * | 2000-01-31 | 2001-08-10 | Mitsubishi Electric Corp | Image signal conversion coder |
JP4170555B2 (en) * | 2000-02-28 | 2008-10-22 | 株式会社東芝 | Video encoding apparatus and video encoding method |
GB0007868D0 (en) | 2000-03-31 | 2000-05-17 | Koninkl Philips Electronics Nv | Methods and apparatus for editing digital video recordings and recordings made by such methods |
JP2002010259A (en) * | 2000-06-21 | 2002-01-11 | Mitsubishi Electric Corp | Image encoding apparatus and its method and recording medium recording image encoding program |
US7490344B2 (en) | 2000-09-29 | 2009-02-10 | Visible World, Inc. | System and method for seamless switching |
US7095945B1 (en) * | 2000-11-06 | 2006-08-22 | Ati Technologies, Inc. | System for digital time shifting and method thereof |
JP2002281433A (en) * | 2001-03-15 | 2002-09-27 | Kddi Corp | Device for retrieving and reading editing moving image and recording medium |
US20020133486A1 (en) * | 2001-03-15 | 2002-09-19 | Kddi Corporation | Video retrieval and browsing apparatus, video retrieval, browsing and editing apparatus, and recording medium |
US7349691B2 (en) * | 2001-07-03 | 2008-03-25 | Microsoft Corporation | System and apparatus for performing broadcast and localcast communications |
US6965597B1 (en) * | 2001-10-05 | 2005-11-15 | Verizon Laboratories Inc. | Systems and methods for automatic evaluation of subjective quality of packetized telecommunication signals while varying implementation parameters |
KR100454501B1 (en) * | 2001-12-26 | 2004-10-28 | 브이케이 주식회사 | Apparatus for prediction to code or decode image signal and method therefor |
KR100475412B1 (en) * | 2002-03-11 | 2005-03-10 | 주식회사 럭스퍼트 | Top-pumped optical device and its array |
DE10212656A1 (en) * | 2002-03-21 | 2003-10-02 | Scm Microsystems Gmbh | Selective encryption of multimedia data |
US7151856B2 (en) * | 2002-04-25 | 2006-12-19 | Matsushita Electric Industrial Co., Ltd. | Picture coding apparatus and picture coding method |
US9948977B2 (en) * | 2003-01-09 | 2018-04-17 | Avago Technologies General Ip (Singapore) Pte. Ltd. | System, method, and apparatus for determining presentation time for picture without presentation time stamp |
US7426306B1 (en) * | 2002-10-24 | 2008-09-16 | Altera Corporation | Efficient use of keyframes in video compression |
FR2848766B1 (en) * | 2002-12-13 | 2005-03-11 | Thales Sa | METHOD FOR SWITCHING DIGITAL SIGNALS BEFORE TRANSMITTING, SWITCH AND SIGNAL RESULTING |
US8213779B2 (en) | 2003-09-07 | 2012-07-03 | Microsoft Corporation | Trick mode elementary stream and receiver system |
US7852919B2 (en) * | 2003-09-07 | 2010-12-14 | Microsoft Corporation | Field start code for entry point frames with predicted first field |
US7839930B2 (en) * | 2003-11-13 | 2010-11-23 | Microsoft Corporation | Signaling valid entry points in a video stream |
US7609762B2 (en) * | 2003-09-07 | 2009-10-27 | Microsoft Corporation | Signaling for entry point frames with predicted first field |
US7924921B2 (en) | 2003-09-07 | 2011-04-12 | Microsoft Corporation | Signaling coding and display options in entry point headers |
US20050060420A1 (en) * | 2003-09-11 | 2005-03-17 | Kovacevic Branko D. | System for decoding multimedia data and method thereof |
JP3675464B2 (en) * | 2003-10-29 | 2005-07-27 | ソニー株式会社 | Moving picture coding apparatus and moving picture coding control method |
US9715898B2 (en) * | 2003-12-16 | 2017-07-25 | Core Wireless Licensing S.A.R.L. | Method and device for compressed-domain video editing |
US7391809B2 (en) * | 2003-12-30 | 2008-06-24 | Microsoft Corporation | Scalable video transcoding |
CN1713727B (en) * | 2004-06-14 | 2010-11-10 | 松下电器产业株式会社 | Method and device for editing data stream |
JP4174728B2 (en) * | 2004-08-25 | 2008-11-05 | ソニー株式会社 | Information processing apparatus, information processing method, recording medium, and program |
JP4221667B2 (en) * | 2004-08-25 | 2009-02-12 | ソニー株式会社 | Information processing apparatus, information processing method, recording medium, and program |
JP4743119B2 (en) * | 2004-08-25 | 2011-08-10 | ソニー株式会社 | Information processing apparatus, information processing method, recording medium, and program |
BRPI0613969A2 (en) * | 2005-07-28 | 2011-02-22 | Thompson Licensing | method and apparatus for transmitting multiple video streams over one video channel |
JP4528694B2 (en) * | 2005-08-12 | 2010-08-18 | 株式会社東芝 | Video encoding device |
JP4492484B2 (en) * | 2005-08-22 | 2010-06-30 | ソニー株式会社 | Information processing apparatus, information processing method, recording medium, and program |
JP4791129B2 (en) * | 2005-10-03 | 2011-10-12 | ルネサスエレクトロニクス株式会社 | Image coding apparatus, image coding method, and image editing apparatus |
US20070116117A1 (en) * | 2005-11-18 | 2007-05-24 | Apple Computer, Inc. | Controlling buffer states in video compression coding to enable editing and distributed encoding |
JP4828925B2 (en) * | 2005-11-30 | 2011-11-30 | パナソニック株式会社 | Encoder |
JP4932242B2 (en) * | 2005-12-13 | 2012-05-16 | 三菱電機株式会社 | Stream switching device and stream switching method |
JP4207072B2 (en) | 2006-04-07 | 2009-01-14 | ソニー株式会社 | Information processing apparatus, information processing method, recording medium, and program |
JP4229149B2 (en) * | 2006-07-13 | 2009-02-25 | ソニー株式会社 | Video signal processing device, video signal processing method, video signal encoding device, video signal encoding method, and program |
JP2008066851A (en) * | 2006-09-05 | 2008-03-21 | Sony Corp | Unit and method for information processing, recording medium, and program |
JP4221676B2 (en) | 2006-09-05 | 2009-02-12 | ソニー株式会社 | Information processing apparatus, information processing method, recording medium, and program |
JP4369948B2 (en) * | 2006-09-20 | 2009-11-25 | シャープ株式会社 | Image display apparatus and method, image processing apparatus and method |
JP4303743B2 (en) * | 2006-10-04 | 2009-07-29 | シャープ株式会社 | Image display apparatus and method, image processing apparatus and method |
JP4241839B2 (en) | 2007-02-02 | 2009-03-18 | ソニー株式会社 | Data and file system information recording apparatus and recording method |
JP2009077105A (en) * | 2007-09-20 | 2009-04-09 | Sony Corp | Editing device, editing method, program, and recording medium |
US20090083811A1 (en) * | 2007-09-26 | 2009-03-26 | Verivue, Inc. | Unicast Delivery of Multimedia Content |
US8457958B2 (en) | 2007-11-09 | 2013-06-04 | Microsoft Corporation | Audio transcoder using encoder-generated side information to transcode to target bit-rate |
US8432804B2 (en) * | 2007-11-29 | 2013-04-30 | Hewlett-Packard Development Company, L.P. | Transmitting video streams |
US8543667B2 (en) | 2008-01-14 | 2013-09-24 | Akamai Technologies, Inc. | Policy-based content insertion |
US8335262B2 (en) * | 2008-01-16 | 2012-12-18 | Verivue, Inc. | Dynamic rate adjustment to splice compressed video streams |
WO2009151789A2 (en) * | 2008-04-17 | 2009-12-17 | Sony Corporation | Dual-type of playback for multimedia content |
DE102008002005A1 (en) * | 2008-05-27 | 2009-12-03 | Robert Bosch Gmbh | Eccentric planetary drive |
JP2010004142A (en) * | 2008-06-18 | 2010-01-07 | Hitachi Kokusai Electric Inc | Moving picture encoder, decoder, encoding method, and decoding method |
US20090327334A1 (en) * | 2008-06-30 | 2009-12-31 | Rodriguez Arturo A | Generating Measures of Video Sequences to Detect Unauthorized Use |
US8259177B2 (en) * | 2008-06-30 | 2012-09-04 | Cisco Technology, Inc. | Video fingerprint systems and methods |
US8347408B2 (en) * | 2008-06-30 | 2013-01-01 | Cisco Technology, Inc. | Matching of unknown video content to protected video content |
US8904426B2 (en) * | 2008-06-30 | 2014-12-02 | Rgb Networks, Inc. | Preconditioning ad content for digital program insertion |
CN102150432A (en) * | 2008-09-17 | 2011-08-10 | 夏普株式会社 | Scalable video stream decoding apparatus and scalable video stream generating apparatus |
US8743906B2 (en) * | 2009-01-23 | 2014-06-03 | Akamai Technologies, Inc. | Scalable seamless digital video stream splicing |
US8396114B2 (en) * | 2009-01-29 | 2013-03-12 | Microsoft Corporation | Multiple bit rate video encoding using variable bit rate and dynamic resolution for adaptive video streaming |
US8311115B2 (en) * | 2009-01-29 | 2012-11-13 | Microsoft Corporation | Video encoding using previously calculated motion information |
WO2010093430A1 (en) * | 2009-02-11 | 2010-08-19 | Packetvideo Corp. | System and method for frame interpolation for a compressed video bitstream |
US9906757B2 (en) * | 2009-02-26 | 2018-02-27 | Akamai Technologies, Inc. | Deterministically skewing synchronized events for content streams |
US9565397B2 (en) * | 2009-02-26 | 2017-02-07 | Akamai Technologies, Inc. | Deterministically skewing transmission of content streams |
US8650602B2 (en) * | 2009-02-27 | 2014-02-11 | Akamai Technologies, Inc. | Input queued content switching using a playlist |
JP5152402B2 (en) * | 2009-02-27 | 2013-02-27 | 富士通株式会社 | Moving picture coding apparatus, moving picture coding method, and moving picture coding computer program |
US8270473B2 (en) * | 2009-06-12 | 2012-09-18 | Microsoft Corporation | Motion based dynamic resolution multiple bit rate video encoding |
US8724710B2 (en) * | 2010-02-24 | 2014-05-13 | Thomson Licensing | Method and apparatus for video encoding with hypothetical reference decoder compliant bit allocation |
JP2011211691A (en) * | 2010-03-11 | 2011-10-20 | Sony Corp | Information processing apparatus, information processing method and program |
CN102194502A (en) * | 2010-03-11 | 2011-09-21 | 索尼公司 | Information processing apparatus, information processing method and program |
US8705616B2 (en) | 2010-06-11 | 2014-04-22 | Microsoft Corporation | Parallel multiple bitrate video encoding to reduce latency and dependences between groups of pictures |
WO2011160113A2 (en) | 2010-06-18 | 2011-12-22 | Akamai Technologies, Inc. | Extending a content delivery network (cdn) into a mobile or wireline network |
JP6056122B2 (en) * | 2011-01-24 | 2017-01-11 | ソニー株式会社 | Image encoding apparatus, image decoding apparatus, method and program thereof |
EP2547062B1 (en) * | 2011-07-14 | 2016-03-16 | Nxp B.V. | Media streaming with adaptation |
US9591318B2 (en) * | 2011-09-16 | 2017-03-07 | Microsoft Technology Licensing, Llc | Multi-layer encoding and decoding |
US11089343B2 (en) | 2012-01-11 | 2021-08-10 | Microsoft Technology Licensing, Llc | Capability advertisement, configuration and control for video coding and decoding |
JP5891975B2 (en) * | 2012-07-02 | 2016-03-23 | 富士通株式会社 | Moving picture encoding apparatus, moving picture decoding apparatus, moving picture encoding method, and moving picture decoding method |
ITMI20131710A1 (en) * | 2013-10-15 | 2015-04-16 | Sky Italia S R L | "ENCODING CLOUD SYSTEM" |
EP3185564A1 (en) * | 2015-12-22 | 2017-06-28 | Harmonic Inc. | Video stream splicing of groups of pictures (gop) |
CN105657547B (en) * | 2015-12-31 | 2019-05-10 | 北京奇艺世纪科技有限公司 | A kind of detection method and device of similar video and pirate video |
US11936712B1 (en) * | 2023-04-06 | 2024-03-19 | Synamedia Limited | Packet-accurate targeted content substitution |
CN118674618A (en) * | 2024-08-21 | 2024-09-20 | 苏州东方克洛托光电技术有限公司 | Method for realizing rapid splicing of aerial images by using image transmission video coding information |
Citations (15)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH06253331A (en) | 1993-03-01 | 1994-09-09 | Toshiba Corp | Editing device corresponding to variable-length encoded signal |
EP0694921A2 (en) | 1994-07-22 | 1996-01-31 | Victor Company Of Japan, Ltd. | Video data editor |
WO1996017492A2 (en) | 1994-12-02 | 1996-06-06 | Philips Electronics N.V. | Encoder system level buffer management |
JPH08149408A (en) | 1994-11-17 | 1996-06-07 | Matsushita Electric Ind Co Ltd | Digital animation editing method and device therefor |
EP0742674A2 (en) | 1995-05-08 | 1996-11-13 | Kabushiki Kaisha Toshiba | Video encoding method and system using a rate-quantizer model |
US5602592A (en) | 1994-01-18 | 1997-02-11 | Matsushita Electric Industrial Co., Ltd. | Moving picture compressed signal changeover apparatus |
WO1997008898A1 (en) | 1995-08-31 | 1997-03-06 | British Broadcasting Corporation | Switching between bit-rate reduced signals |
JPH10112840A (en) | 1996-10-07 | 1998-04-28 | Sony Corp | Editing device |
US5982436A (en) | 1997-03-28 | 1999-11-09 | Philips Electronics North America Corp. | Method for seamless splicing in a video encoder |
US6025878A (en) | 1994-10-11 | 2000-02-15 | Hitachi America Ltd. | Method and apparatus for decoding both high and standard definition video signals using a single video decoder |
US6137834A (en) | 1996-05-29 | 2000-10-24 | Sarnoff Corporation | Method and apparatus for splicing compressed information streams |
US6529555B1 (en) * | 1999-08-26 | 2003-03-04 | Sony United Kingdom Limited | Signal processor |
US6611624B1 (en) * | 1998-03-13 | 2003-08-26 | Cisco Systems, Inc. | System and method for frame accurate splicing of compressed bitstreams |
US6760377B1 (en) * | 1999-04-16 | 2004-07-06 | Sony United Kingdom Limited | Signal processing |
US6983015B1 (en) * | 1999-08-26 | 2006-01-03 | Sony United Kingdom Limited | Signal processor |
Family Cites Families (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
DE69422960T2 (en) * | 1993-12-01 | 2000-06-15 | Matsushita Electric Industrial Co., Ltd. | Method and device for editing or mixing compressed images |
JP3493872B2 (en) * | 1996-02-29 | 2004-02-03 | ソニー株式会社 | Image data processing method and apparatus |
CA2208950A1 (en) * | 1996-07-03 | 1998-01-03 | Xuemin Chen | Rate control for stereoscopic digital video encoding |
US6151443A (en) * | 1997-05-16 | 2000-11-21 | Indigita Corporation | Digital video and data recorder |
US6101195A (en) * | 1997-05-28 | 2000-08-08 | Sarnoff Corporation | Timing correction method and apparatus |
US6298088B1 (en) * | 1997-05-28 | 2001-10-02 | Sarnoff Corporation | Method and apparatus for splicing compressed information signals |
GB2327548B (en) * | 1997-07-18 | 2002-05-01 | British Broadcasting Corp | Switching compressed video bitstreams |
KR100555164B1 (en) * | 1997-07-25 | 2006-03-03 | 소니 가부시끼 가이샤 | Editing device, editing method, re-encoding device, re-encoding method, splicing device, and splicing method |
-
1998
- 1998-07-27 KR KR1019997002513A patent/KR100555164B1/en not_active IP Right Cessation
- 1998-07-27 JP JP50967099A patent/JP3736808B2/en not_active Expired - Lifetime
- 1998-07-27 CN CNB98801159XA patent/CN1161989C/en not_active Expired - Lifetime
- 1998-07-27 EP EP20040076010 patent/EP1467563A1/en not_active Ceased
- 1998-07-27 WO PCT/JP1998/003332 patent/WO1999005864A1/en active IP Right Grant
- 1998-07-27 EP EP19980933940 patent/EP0923243B1/en not_active Expired - Lifetime
- 1998-07-27 DE DE69841897T patent/DE69841897D1/en not_active Expired - Lifetime
- 1998-07-27 EP EP20040076022 patent/EP1445773A1/en not_active Ceased
- 1998-07-27 KR KR1020057018094A patent/KR100604631B1/en not_active IP Right Cessation
-
1999
- 1999-03-25 US US09/275,999 patent/US6567471B1/en not_active Expired - Lifetime
-
2002
- 2002-10-29 US US10/282,784 patent/US7139316B2/en not_active Expired - Lifetime
-
2005
- 2005-05-16 JP JP2005143131A patent/JP4088799B2/en not_active Expired - Lifetime
- 2005-05-16 JP JP2005143129A patent/JP4045553B2/en not_active Expired - Lifetime
- 2005-05-16 JP JP2005143132A patent/JP4088800B2/en not_active Expired - Lifetime
- 2005-05-16 JP JP2005143130A patent/JP2005295587A/en active Pending
-
2006
- 2006-10-25 US US11/586,245 patent/US7711051B2/en not_active Expired - Fee Related
- 2006-11-01 US US11/591,063 patent/US8923409B2/en not_active Expired - Fee Related
- 2006-11-01 US US11/591,073 patent/US8798143B2/en not_active Expired - Fee Related
- 2006-12-19 US US11/642,369 patent/US8223847B2/en not_active Expired - Fee Related
Patent Citations (16)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH06253331A (en) | 1993-03-01 | 1994-09-09 | Toshiba Corp | Editing device corresponding to variable-length encoded signal |
US5602592A (en) | 1994-01-18 | 1997-02-11 | Matsushita Electric Industrial Co., Ltd. | Moving picture compressed signal changeover apparatus |
EP0694921A2 (en) | 1994-07-22 | 1996-01-31 | Victor Company Of Japan, Ltd. | Video data editor |
JPH0837640A (en) | 1994-07-22 | 1996-02-06 | Victor Co Of Japan Ltd | Image data editing device |
US6025878A (en) | 1994-10-11 | 2000-02-15 | Hitachi America Ltd. | Method and apparatus for decoding both high and standard definition video signals using a single video decoder |
JPH08149408A (en) | 1994-11-17 | 1996-06-07 | Matsushita Electric Ind Co Ltd | Digital animation editing method and device therefor |
WO1996017492A2 (en) | 1994-12-02 | 1996-06-06 | Philips Electronics N.V. | Encoder system level buffer management |
EP0742674A2 (en) | 1995-05-08 | 1996-11-13 | Kabushiki Kaisha Toshiba | Video encoding method and system using a rate-quantizer model |
WO1997008898A1 (en) | 1995-08-31 | 1997-03-06 | British Broadcasting Corporation | Switching between bit-rate reduced signals |
US6137834A (en) | 1996-05-29 | 2000-10-24 | Sarnoff Corporation | Method and apparatus for splicing compressed information streams |
JPH10112840A (en) | 1996-10-07 | 1998-04-28 | Sony Corp | Editing device |
US5982436A (en) | 1997-03-28 | 1999-11-09 | Philips Electronics North America Corp. | Method for seamless splicing in a video encoder |
US6611624B1 (en) * | 1998-03-13 | 2003-08-26 | Cisco Systems, Inc. | System and method for frame accurate splicing of compressed bitstreams |
US6760377B1 (en) * | 1999-04-16 | 2004-07-06 | Sony United Kingdom Limited | Signal processing |
US6529555B1 (en) * | 1999-08-26 | 2003-03-04 | Sony United Kingdom Limited | Signal processor |
US6983015B1 (en) * | 1999-08-26 | 2006-01-03 | Sony United Kingdom Limited | Signal processor |
Non-Patent Citations (1)
Title |
---|
Wee S J et al: "Splicing MPEG Video Streams in the Compressed Domain" IEEE Workshop on Multimedia Signal Processing. Proceedings of Signal Processing Society Workshop on Multimedia Signal Processing, XX, XX Jun. 23, 1997, pp. 225-230, XP000957700. |
Cited By (16)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7305040B1 (en) * | 1998-01-19 | 2007-12-04 | Sony Corporation | Edit system, edit control device, and edit control method |
US20040179553A1 (en) * | 2001-04-20 | 2004-09-16 | Marcus Wiklund | Method and apparatus for localizing data |
US20040247188A1 (en) * | 2003-06-03 | 2004-12-09 | Ajay Luthra | Method for restructuring a group of pictures to provide for random access into the group of pictures |
US8175154B2 (en) * | 2003-06-03 | 2012-05-08 | General Instrument Corporation | Method for restructuring a group of pictures to provide for random access into the group of pictures |
US8411759B2 (en) | 2004-06-02 | 2013-04-02 | Panasonic Corporation | Multiplexing apparatus and demultiplexing apparatus |
US20090180758A1 (en) * | 2004-06-02 | 2009-07-16 | Tadamasa Toma | Multiplexing apparatus and demultiplexing apparatus |
US20100046638A1 (en) * | 2004-06-02 | 2010-02-25 | Tadamasa Toma | Multiplexing apparatus and demultiplexing apparatus |
US20060007958A1 (en) * | 2004-07-12 | 2006-01-12 | Samsung Electronics Co., Ltd. | Multiplexing method and apparatus to generate transport stream |
US8155186B2 (en) * | 2004-08-11 | 2012-04-10 | Hitachi, Ltd. | Bit stream recording medium, video encoder, and video decoder |
US20080317125A1 (en) * | 2004-08-11 | 2008-12-25 | Tomokazu Murakami | Bit Stream Recording Medium, Video Encoder, and Video Decoder |
US7613819B2 (en) * | 2004-08-24 | 2009-11-03 | Canon Kabushiki Kaisha | Image reproduction apparatus, control method thereof, program and storage medium |
US20060044163A1 (en) * | 2004-08-24 | 2006-03-02 | Canon Kabushiki Kaisha | Image reproduction apparatus, control method thereof, program and storage medium |
US20070286244A1 (en) * | 2006-06-13 | 2007-12-13 | Sony Corporation | Information processing apparatus and information processing method |
US20100104022A1 (en) * | 2008-10-24 | 2010-04-29 | Chanchal Chatterjee | Method and apparatus for video processing using macroblock mode refinement |
WO2010057027A1 (en) * | 2008-11-14 | 2010-05-20 | Transvideo, Inc. | Method and apparatus for splicing in a compressed video bitstream |
US20100128779A1 (en) * | 2008-11-14 | 2010-05-27 | Chanchal Chatterjee | Method and apparatus for splicing in a compressed video bitstream |
Also Published As
Publication number | Publication date |
---|---|
EP0923243A1 (en) | 1999-06-16 |
KR100555164B1 (en) | 2006-03-03 |
KR100604631B1 (en) | 2006-07-28 |
JP3736808B2 (en) | 2006-01-18 |
US8798143B2 (en) | 2014-08-05 |
JP2005253116A (en) | 2005-09-15 |
EP1445773A1 (en) | 2004-08-11 |
DE69841897D1 (en) | 2010-10-28 |
JP4088799B2 (en) | 2008-05-21 |
US8923409B2 (en) | 2014-12-30 |
JP2005295587A (en) | 2005-10-20 |
WO1999005864A1 (en) | 1999-02-04 |
CN1236522A (en) | 1999-11-24 |
US20070047661A1 (en) | 2007-03-01 |
US20070047662A1 (en) | 2007-03-01 |
KR20000068626A (en) | 2000-11-25 |
JP4088800B2 (en) | 2008-05-21 |
EP0923243A4 (en) | 2002-12-04 |
EP0923243B1 (en) | 2010-09-15 |
US20070165715A1 (en) | 2007-07-19 |
US20030067989A1 (en) | 2003-04-10 |
CN1161989C (en) | 2004-08-11 |
US7711051B2 (en) | 2010-05-04 |
EP1467563A1 (en) | 2004-10-13 |
US8223847B2 (en) | 2012-07-17 |
KR20050103248A (en) | 2005-10-27 |
US6567471B1 (en) | 2003-05-20 |
US20070058729A1 (en) | 2007-03-15 |
JP4045553B2 (en) | 2008-02-13 |
JP2005328548A (en) | 2005-11-24 |
JP2005323386A (en) | 2005-11-17 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US7139316B2 (en) | System method and apparatus for seamlessly splicing data | |
US7046910B2 (en) | Methods and apparatus for transcoding progressive I-slice refreshed MPEG data streams to enable trick play mode features on a television appliance | |
EP1005232B1 (en) | Rate control for an MPEG transcoder without a priori knowledge of picture type | |
EP1145558B1 (en) | System for editing compressed image sequences | |
US6980594B2 (en) | Generation of MPEG slow motion playout | |
JP4769717B2 (en) | Image decoding method | |
JPH09238347A (en) | Image data processing method and device therefor | |
US7636482B1 (en) | Efficient use of keyframes in video compression | |
US20070154185A1 (en) | Method and system for transcoding video information to enable digital video recording (DVR) trick modes | |
JPH08251582A (en) | Encoded data editing device | |
WO2004112397A1 (en) | Image processing device, image processing method, information processing device, information processing method, information recording device, information recording method, information reproduction device, information reproduction method, recording medium, and program | |
Arachchi et al. | An intelligent rate control algorithm to improve the video quality at scene transitions for off-line MPEG-1/2 encoders | |
JPH09238353A (en) | Image coding method and device, image transmission method, and image recording medium | |
GB2353654A (en) | Processing GOPs to be stored as all I-frames | |
JP2000295567A (en) | Coded data editor | |
GB2353652A (en) | Video coding employing a predetermined percentage of stuffing bits |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
STCF | Information on status: patent grant |
Free format text: PATENTED CASE |
|
FEPP | Fee payment procedure |
Free format text: PAYER NUMBER DE-ASSIGNED (ORIGINAL EVENT CODE: RMPN); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY Free format text: PAYOR NUMBER ASSIGNED (ORIGINAL EVENT CODE: ASPN); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY |
|
FPAY | Fee payment |
Year of fee payment: 4 |
|
FPAY | Fee payment |
Year of fee payment: 8 |
|
MAFP | Maintenance fee payment |
Free format text: PAYMENT OF MAINTENANCE FEE, 12TH YEAR, LARGE ENTITY (ORIGINAL EVENT CODE: M1553) Year of fee payment: 12 |