WO1999033274A1 - Scalable predictive coding method and apparatus - Google Patents

Scalable predictive coding method and apparatus Download PDF

Info

Publication number
WO1999033274A1
WO1999033274A1 PCT/US1998/026984 US9826984W WO9933274A1 WO 1999033274 A1 WO1999033274 A1 WO 1999033274A1 US 9826984 W US9826984 W US 9826984W WO 9933274 A1 WO9933274 A1 WO 9933274A1
Authority
WO
WIPO (PCT)
Prior art keywords
layer
signal
enhancement
enhancement layer
base
Prior art date
Application number
PCT/US1998/026984
Other languages
French (fr)
Inventor
Kenneth Rose
Original Assignee
Kenneth Rose
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Kenneth Rose filed Critical Kenneth Rose
Priority to AU19289/99A priority Critical patent/AU1928999A/en
Publication of WO1999033274A1 publication Critical patent/WO1999033274A1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/30Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using hierarchical techniques, e.g. scalability
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/30Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using hierarchical techniques, e.g. scalability
    • H04N19/39Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using hierarchical techniques, e.g. scalability involving multiple description coding [MDC], i.e. with separate layers being structured as independently decodable descriptions of input picture data

Definitions

  • This invention pertains generally to data compression methods and systems, and more particularly to an efficient scalable predictive coding method and system where most or all of the information available to the enhancement- layer is exploited to improve the quality of the prediction.
  • Many applications require data, such as video, to be simultaneously decodable at a variety of rates. Examples include applications involving broadcast over differing channels, multicast in a complex network where the channels/links dictate the feasible bit rate for each user, the co-existence of receivers of different complexity (and cost), and time-varying channels.
  • An associated compression technique is "scalable" if it offers a variety of decoding rates using the same basic algorithm, and where the lower rate information streams are embedded in the higher rate bit-streams in a manner that minimizes redundancy.
  • a predictive coding system for encoding and decoding a signal without scalability is well-known in the literature of signal compression. (See for example: predictive vector quantization [6], and motion-compensated predictive transform coding of video [3]).
  • the encoder includes a decoder and memory so that what is actually encoded is the difference between the input signal and a predicted version of the reproduced signal, this difference signal being called the residual.
  • the decoder contains a prediction loop whereby the current residual frame is decoded and then it is added to a prediction of the current frame obtained from the previous reproduced frame. In some cases, the predictor uses several prior frames to predict the current frame.
  • FIG. 1 depicts a two-layer scalable coding system 10 where it is assumed that the original input signal (e.g., an audio or video signal) is segmented into frames that are sequentially encoded. Typical examples are video frames, and speech frames, but "frame” here will also cover the degenerate case of a single sample as in differential pulse coded modulation (DPCM).
  • DPCM differential pulse coded modulation
  • frame refers either to a group of contiguous samples of an original input signal or a set of parameters extracted from the original group of samples (such as a set of transform coefficients obtained by a discrete-cosine transform (DCT) operation on the original group of samples) and in each case the terminology “frame” or “signal” will be used to refer to this entity that is representative of the original group of samples or is itself the original group of samples.
  • DCT discrete-cosine transform
  • the input frame 12, x ⁇ ) is compressed by the base encoder (BE) 14 which produces the base bit-stream 16.
  • the enhancement-layer encoder (EE) 18 has access to the input frame 12 and to any information produced by or available to BE 14. EE 18 uses this data to generate the enhancement-layer bit-stream 20.
  • a base decoder (BD) 22 receives the base bit-stream 16 and produces a reconstruction 24, x b ⁇ ) , while the enhancement-layer decoder (ED) 26 has access to both bit-streams and produces an enhanced reconstruction 28, ⁇ e ⁇ n) .
  • the reconstruction frames that are available at the decoder are used to predict or estimate the current frame.
  • ED 26 has access to both bit streams and hence it effectively has access to both the reconstruction frame at the base layer, x b ⁇ n) , and the previous reconstructed frame at the enhancement layer x e ⁇ n - 1)
  • BD 22 has only access to the previous reconstructed frame at the base layer, x b ⁇ n - 1) , which is stored in the memory within BD.
  • an enhancement layer decoder may have access to the reconstruction frames from lower enhancement layers as well as from the base layer.
  • the prediction loop (internal to the operation of BD as in any predictive coding system but not shown in the figure) in this configuration causes severe difficulties in the design of scalable coding. Accordingly, a number of approaches to scalable coding have been developed. These include,
  • a deficiency of this approach is that no advantage is taken of the potentially superior prediction due to the availability of x e ⁇ n - 1) at the ED 26.
  • a deficiency of this approach is that, while the approach takes advantage of information available only to the enhancement-layer, it does not exploit the knowledge of f b ⁇ n) which is also available at the enhancement-layer.
  • the two layers are, in fact, separately encoded except for savings on overhead information which needs not be repeated (such as motion vectors in video coding) [2].
  • the encoders BE 14 and EE 18 use the same prediction [3], and the options are:
  • the present invention addresses the prediction loop deficiencies in conventional scalable coding methods and systems in a way that achieves efficient scalability of predictive coding.
  • the approach is generally applicable and may, in particular, be applied to standard video and audio compression.
  • most or all of the information available at an enhancement-layer may be exploited to improve the quality of the prediction.
  • the current frame is predicted at the enhancement-layer by processing and combining the reconstructed signal representing: (i) the current base-layer (or lower layers) frame; and (ii) the previous enhancement-layer frame.
  • the combining rule takes into account the compressed prediction error of the base-layer, and the parameters used for its compression.
  • the invention optionally includes a special enhancement-layer synchronization mode for the case where the communication rate for a given receiver is time varying (e.g., in mobile communications). This mode may be applied periodically to allow the receiver to upgrade to enhancement-layer performance even though it does not have prior enhancement-layer reconstructed frames.
  • An object of the invention is to achieve efficient scalability of predictive coding.
  • Another object of the invention is to provide a method and system for scalable predictive coding that is applicable to typical or standard video and audio compression.
  • Another object of the invention is to provide a scalable predictive coding method and system in which all or most of the information available at an enhancement-layer is exploited to improve the quality of the prediction.
  • FIG. 1 is functional block diagram of a conventional two-layer scalable predicting coding system.
  • FIG. 2 is a functional block diagram of an enhancement layer encoder of a scalable predictive coding system in accordance with the present invention.
  • FIG. 3 is a functional block diagram of a base layer reconstruction module according to the present invention.
  • FIG. 4 is a functional block diagram of an enhancement layer reconstruction module according to the present invention.
  • FIG. 5 is a functional block diagram of a three-layer scalable encoder employing the enhancement encoder of the present invention.
  • FIG. 6 is a functional block diagram of a three-layer scalable decoder corresponding to the encoder shown in FIG. 5.
  • FIG. 7 is a functional block diagram of a two-layer scalable video encoder employing the enhancement encoder of the present invention.
  • FIG. 8 is a functional block diagram of a two-layer decoder corresponding to the encoder shown in FIG. 7.
  • FIG. 9 is a functional block diagram of the spatial motion compensator blocks shown in FIG. 7 and FIG. 8.
  • the method of the present invention generally comprises upgrading the prediction used at each enhancement-layer by combining, with minimal conflict, the information provided from both sources, namely, information available at, and used by, the base-layer (or lower layers), and information that is available only at the enhancement-layer.
  • the prediction at an enhancement layer may combine information provided from all lower enhancement layers as well.
  • the invention provides for prediction or estimation of the signal frame itself in any representation, or any subset of signal representation coefficients such as transform coefficients (e.g., in video, audio), line spectral frequencies (e.g., in speech or audio), etc.
  • transform coefficients e.g., in video, audio
  • line spectral frequencies e.g., in speech or audio
  • an enhancement layer estimator (ELE) 102 computes a new predicted frame 104, x e ⁇ ) , by combining information from the reconstruction frame 106 at the base layer, x b ⁇ ) and from the previous reconstructed frame 108 at the enhancement layer x e ⁇ n - 1) . Note that first order prediction is described for notational simplicity but several previous frames may be used.
  • the combining rule depends on any or all of, but not limited to, the following parameters: the compression parameters 1 10 of the base layer (such as quantization step and threshold, and the quantized base-layer residual 112, f b ⁇ n) , (see FIG. 3)), and the statistical parameters 1 14 of the time evolution of the frames (such as inter-frame correlation coefficients and variance).
  • the statistical parameters may be either estimated off-line from training data, or estimated on-line by an adaptive estimator which tracks variation in the signal statistics based on either the original signal (in which case the parameters need to be transmitted to the decoder) or based on reconstructed signals which are available to the receiver.
  • the exact definition of the combination rule depends on the level of complexity allowed for the module.
  • the enhancement layer residual 116, r e ⁇ ) which is the difference between the input frame 1 18, ⁇ ) , and the predicted frame 104, x e ⁇ n) , is then compressed by a compressor 120 to produce the enhancement bits 122.
  • a compressor 120 to produce the enhancement bits 122.
  • FIG. 3 shows a base layer reconstruction module 124 which receives the quantized base layer residual 1 12, r b ⁇ n) , and adds it to the base predicted frame 126, x b ⁇ n) , to produce the base layer reconstruction frame 106, x b ⁇ n) .
  • a delay 128 produces a delayed base reconstructed frame 130, ⁇ b ⁇ n - l) , which is input to the base predictor 132 which computes the base predicted frame 126, x b ⁇ n) , which is needed to produce the reconstructed frame as explained above.
  • the enhancement layer reconstruction module 134 shown in FIG. 4 receives the quantized enhancement layer residual 136, r e ⁇ ) , and adds it to the enhancement layer predicted frame 104, x e ⁇ ) , to produce the enhancement layer reconstruction frame 138, x e ⁇ ) .
  • a delay 140 produces a delayed enhancement layer reconstructed frame 108, ⁇ e ⁇ n-l) , which is input to the enhancement layer estimator 102, which in turn computes the enhancement layer predicted frame 104, x e ⁇ n) , as explained with reference to FIG. 2.
  • FIG. 5 shows how the modules described in FIG. 2 through FIG. 4 may be combined to obtain a complete scalable predictive encoder.
  • the signal frame to be compressed (which may be the original raw signal, or any set of coefficients extracted from it for the purpose of compression) denoted x(n) is fed to all layers in parallel.
  • x(n) The signal frame to be compressed (which may be the original raw signal, or any set of coefficients extracted from it for the purpose of compression) denoted x(n) is fed to all layers in parallel.
  • the predicted frame ⁇ b ⁇ n) in the base layer, (EL1) e (n) in the first enhancement layer, and (EL2) ⁇ e ⁇ n) at the second enhancement layer) is subtracted from x(n) to obtain the prediction error (or residual) at the layer ⁇ r b ⁇ n) , (EL1 ) r e ⁇ ) , and (EL2) r e ⁇ n) , for the base, first enhancement and second enhancement layers, respectively).
  • the residual is compressed by the layer's Compressor/Quantizer which outputs: the layer's bits for transmission to the decoder, the reconstructed (quantized) residual (r b ⁇ n) , (EL1 ) r e ⁇ n) , and (EL2) r e ⁇ n) , for the base, first enhancement and second enhancement layers, respectively), as input to the layer's reconstruction module, and the set of compression parameters for use by a higher layer.
  • the enhancement layer compressor/quantizer subsumes the compressor 120 of FIG. 2 as, beside the bit stream, it also outputs the quantized residual.
  • the reconstruction module of each layer processes its input signals as per Figures 3 and 4, and outputs the reconstructed frame for the layer ( ⁇ b ⁇ n) , (EL1 ) x e ⁇ ) , and (EL2) ⁇ e ⁇ n) , for the base, first enhancement and second enhancement layers, respectively), and the layer's predicted frame ⁇ b ⁇ ) , (EL1 ) x e ⁇ n) , and (EL2) x e ⁇ n) , for the base, first enhancement and second enhancement layers, respectively).
  • Each layer's inverse compressor/quantizer receives as input the layer's bit stream from which it reproduces the layer's quantized residual. It also extracts the layer's compression parameters for use by a higher layer reconstruction module. The rest of the diagram is identical to the encoder of FIG. 2 and similarly produces the reconstructed frame at each layer.
  • the invention is generally applicable to predictive coding and, in particular, may be applied to known vector quantizer-base ⁇ compression techniques, and known -ra/7s/ ?-based techniques. Further, it is applicable to compression of speech, audio, and video signals.
  • a combining rule employing optimal estimation for scalable compression is described next as an implementation example of the invention. In typical predictive coding, a number of signal representation coefficients
  • a specific low complexity implementation of the invention consists of optimally combining the information available for predicting the coefficient at an enhancement-layer.
  • the reconstructed coefficient at the base-layer, x b ⁇ ) , and the quantization interval (or partition region in the case of vector quantization) of the corresponding reconstructed residual f b ⁇ ) determine an interval/cell I ⁇ ) within which the original coefficient x ⁇ n) must lie.
  • ⁇ e ⁇ n-l From the corresponding reconstructed coefficient at the previous enhancement-layer frame, ⁇ e ⁇ n-l) , and a statistical model on time evolution of the coefficients, one may construct a probability density function for x ⁇ n) conditional on ⁇ e ⁇ n- ⁇ ) , denoted by p[ ⁇ n) ⁇ e ⁇ n-l)] .
  • This predictor incorporates the information provided by the base-layer (interval within which x ⁇ ) lies), and by the enhancement-layer (probability distribution of x ⁇ n) conditional on x e ⁇ n - l) ).
  • FIG. 7 and FIG. 8 a system for scalable predictive transform coding which is designed for the compression of video signals is shown.
  • the system uses motion compensation for basic frame prediction, applies the discrete cosine transform (DCT) to the prediction error (residual), and quantizes the transform coefficients one at a time.
  • DCT discrete cosine transform
  • FIG. 7 A block diagram of a two-layer scalable video encoder is shown in FIG. 7, and the corresponding decoder is shown in FIG. 8.
  • FIG. 9 shows a functional block diagram corresponding to the spatial motion compensator blocks shown in the base layer and the enhancement later.
  • the symbols x,r,x,f,x for the video and residual signals at the base and enhancement layers in the diagram are in the transform domain, even though motion compensation is performed in the spatial domain (FIG. 9).
  • additional enhancement layers may be added where an enhancement layer k builds on and relates to layer k- 1 below it exactly as shown for the first two enhancement layers.
  • the correlation coefficient p is assumed to be nearly one.
  • x(n) has a Laplacian density
  • the driving process, z(n) is zero-mean, white, stationary, and has the density
  • [a,b] be the quantization interval, hence r b ⁇ n) e[a,b] .
  • the information the base layer provides on x(n) is captured in the statement: (") e [x b (n) + a,x b (n)+b] .
  • the prediction exploits the information available from both layers.
  • the optimal predictor is given therefore by the expectation:
  • the integral may be analytically evaluated and its closed form solution given explicitly in terms of the integral limits and the parameters a, ⁇ , ⁇ s normally used for simple implementation.
  • This embodiment of the invention is of low complexity, uses standard video compression for its base layer, and provides substantial performance gains which build up and increase with the number of layers implemented. Its absence in all leading standards in spite of its gains and low complexity strongly suggests that the invention is not obvious to the leading researchers and developers in the field of video compression.
  • the scalable predictive coding method of the invention is repeatedly applicable to further layers of enhancement in a straightforward manner. For example, at layer k we combine signal information from the current reconstructed frame at layer k-1 , and from the previous reconstruction frame at layer k.
  • a higher complexity version allows for the combining rule to take into account data from all lower layers. In the special implementation described, information from all lower layers contributes to restricting the final interval within which the coefficient must lie.
  • Another higher complexity version uses higher order prediction (based on multiple past frames).
  • Another application of the invention pertains to time-varying channels, such as mobile communications, and most common network communications.
  • the receiver experiences an improvement in channel conditions, it attempts to decode higher enhancement bits and improve the quality of the reconstruction.
  • it can not compute the enhancement layer prediction as past enhancement layer reconstruction frames were not decoded and are not available.
  • the present invention includes a solution to this problem, which comprises periodically (e.g., once per fixed number of frames) constraining the enhancement encoder to exclusively use lower layer information for the prediction.
  • This periodic constrained prediction synchronizes the enhancement decoder with the enhancement encoder and allows the receiver to decode the enhancement-layer signals.
  • the frequency of application of this constrained mode may be different for each layer and may be optimized for the time-varying channel statistics. The trade off is between some temporary degradation in prediction (when the prediction is constrained) and the receiver's capability to upgrade to enhancement layer performance as the channel conditions improve.
  • this invention provides for efficient scalability of predictive coding that is applicable to standard video and audio compression.
  • the invention uses most or all of the information available at an enhancement-layer to improve the quality of the prediction.
  • the invention provides for enhancement-layer synchronization to accommodate situations where the communication rate for a given receiver is time varying (e.g., in mobile communications).

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)

Abstract

A scalable predictive coder in which the current frame of data is predicted at the enhancement-layer by processing and combining the reconstructed signal at: (i) the current base-layer (or lower layers) frame; and (ii) the previous enhancement-layer frame. The combining rule takes into account the compressed prediction error of the base-layer, and the parameters used for its compression.

Description

SCALABLE PREDICTIVE CODING METHOD AND APPARATUS
BACKGROUND OF THE INVENTION
1. Field of the Invention This invention pertains generally to data compression methods and systems, and more particularly to an efficient scalable predictive coding method and system where most or all of the information available to the enhancement- layer is exploited to improve the quality of the prediction.
2. Description of the Background Art The following publications which are referenced herein using numbers in square brackets (e.g., [1]) are incorporated herein by reference:
[1] D. Wilson and M. Ghanbari, 'Transmission of SNR scalable two layer MPEG-2 coded video through ATM networks," Proc. 7th International Workshop on Packet Video, pp. 185-189, Brisbane Australia, March 1996. [2] B. Girod, U. Horn, and B. Belzer, "Scalable video coding with multiscale motion compensation and unequal error protection," In Y. Wang, S. Panwar, S.-P. Kim, and H. L. Bertoni, editors, Multimedia Communications and Video Coding, pp. 475-482, New York: Plenum Press, 1996.
[3] B. G. Haskell, A. Puri, and A. N. Netravali, Digital video: an introduction to MPEG-2. New York: Chapman and Hall, International Thomson Pub., 1997.
[4] Draft text of H.263, Version 2 (H.263+).
[5] T. K. Tan, K. K. Pang, and K. N. Ngan, "A frequency scalable coding scheme employing pyramid and subband techniques," IEEE Transactions on Circuits and Systems for Video Technology, pp. 203-207, April 1994.
[6] A. Gersho and R. M. Gray, Vector Quantization and Signal Compression. Kluwer Academic Press, 1992.
Many applications require data, such as video, to be simultaneously decodable at a variety of rates. Examples include applications involving broadcast over differing channels, multicast in a complex network where the channels/links dictate the feasible bit rate for each user, the co-existence of receivers of different complexity (and cost), and time-varying channels. An associated compression technique is "scalable" if it offers a variety of decoding rates using the same basic algorithm, and where the lower rate information streams are embedded in the higher rate bit-streams in a manner that minimizes redundancy.
A predictive coding system for encoding and decoding a signal without scalability is well-known in the literature of signal compression. (See for example: predictive vector quantization [6], and motion-compensated predictive transform coding of video [3]). In such predictive coding systems the encoder includes a decoder and memory so that what is actually encoded is the difference between the input signal and a predicted version of the reproduced signal, this difference signal being called the residual. The decoder contains a prediction loop whereby the current residual frame is decoded and then it is added to a prediction of the current frame obtained from the previous reproduced frame. In some cases, the predictor uses several prior frames to predict the current frame.
A major difficulty encountered in scalable predictive coding is how to take advantage of the additional information, available to the enhancement-layer decoder for improved prediction, without causing undesired conflicts with the information obtained from the base layer. FIG. 1 depicts a two-layer scalable coding system 10 where it is assumed that the original input signal (e.g., an audio or video signal) is segmented into frames that are sequentially encoded. Typical examples are video frames, and speech frames, but "frame" here will also cover the degenerate case of a single sample as in differential pulse coded modulation (DPCM). The term "frame" as used herein refers either to a group of contiguous samples of an original input signal or a set of parameters extracted from the original group of samples (such as a set of transform coefficients obtained by a discrete-cosine transform (DCT) operation on the original group of samples) and in each case the terminology "frame" or "signal" will be used to refer to this entity that is representative of the original group of samples or is itself the original group of samples.
The input frame 12, x{ή) , is compressed by the base encoder (BE) 14 which produces the base bit-stream 16. The enhancement-layer encoder (EE) 18 has access to the input frame 12 and to any information produced by or available to BE 14. EE 18 uses this data to generate the enhancement-layer bit-stream 20. A base decoder (BD) 22 receives the base bit-stream 16 and produces a reconstruction 24, xb{ή) , while the enhancement-layer decoder (ED) 26 has access to both bit-streams and produces an enhanced reconstruction 28, χe{n) . The reconstruction frames that are available at the decoder are used to predict or estimate the current frame. Note that ED 26 has access to both bit streams and hence it effectively has access to both the reconstruction frame at the base layer, xb{n) , and the previous reconstructed frame at the enhancement layer xe{n - 1) , while BD 22 has only access to the previous reconstructed frame at the base layer, xb{n - 1) , which is stored in the memory within BD. In the case of a scalable coding system with multiple enhancement layers, an enhancement layer decoder may have access to the reconstruction frames from lower enhancement layers as well as from the base layer. The prediction loop (internal to the operation of BD as in any predictive coding system but not shown in the figure) in this configuration causes severe difficulties in the design of scalable coding. Accordingly, a number of approaches to scalable coding have been developed. These include,
(1) The standard approach: At the base layer, BE 14 compresses the residual rb{n) = χ{n) - P[χb{n - 1)] , where P denotes the predictor (e.g., motion compensator in the case of video coding). Note that for notational simplicity we assume first-order prediction, but in general several previous frames may be used. BD 22 produces the reconstruction xb{n) = P[xb{n -l)]+fb{n) , where rb{n) is the compressed-reconstructed residual. At the enhancement-layer, EE 18 compresses the base layer's reconstruction error re (1) = x{n) - xb {n) = x{n) - P[xb{n - 1)] - fb{ ) . The enhancement-layer reconstruction is xe{n) = xb{n) + re m{n) = P xb{n -l)] + rb{n) -rre il){n) . See, e.g., [1]. A deficiency of this approach is that no advantage is taken of the potentially superior prediction due to the availability of xe{n - 1) at the ED 26. (2) The separate coding approach: BE 14 compresses rb{n) as above, but EE 18 compresses the "enhancement-only" prediction error re (2) = x{n) - P[χe{n - 1)] directly. The enhancement-layer reconstruction is xe(n) = P[xe(n - 1)3+^(2)(") • A deficiency of this approach is that, while the approach takes advantage of information available only to the enhancement-layer, it does not exploit the knowledge of fb{n) which is also available at the enhancement-layer. The two layers are, in fact, separately encoded except for savings on overhead information which needs not be repeated (such as motion vectors in video coding) [2].
(3) Layer-specific prediction at the decoder approach: BD 22 reconstructs the frame as xb{n) = P[xb{n- l)]+fb{n) , and ED 26 reconstructs as χe{n) = P[χe{n - l)] + rb{n) +fe{n) . However, the encoders BE 14 and EE 18 use the same prediction [3], and the options are:
(a) Both encoders use base-layer prediction P[xb{n - 1)] . This results in drift of the enhancement-layer decoder. (The term "drift" refers to a form of mismatch where the decoder uses a different prediction than the one assumed by the encoder. This mismatch tends to grow as the
"corrections" provided by the encoder are misguiding, hence, the decoder "drifts away").
(b) Both encoders use enhancement-layer prediction P[xe{n - 1)] . This results in drift of the base-layer decoder. (4) Switch between approaches (1 ) and (2) on a per frame or per block basis [4], or per sample [5]. This approach has the deficiencies of either approach (1 ) or (2) as described above, at each time depending on the switching decision.
BRIEF SUMMARY OF THE INVENTION The present invention addresses the prediction loop deficiencies in conventional scalable coding methods and systems in a way that achieves efficient scalability of predictive coding. The approach is generally applicable and may, in particular, be applied to standard video and audio compression. In the present invention, most or all of the information available at an enhancement-layer may be exploited to improve the quality of the prediction.
By way of example, and not of limitation, in the present invention the current frame is predicted at the enhancement-layer by processing and combining the reconstructed signal representing: (i) the current base-layer (or lower layers) frame; and (ii) the previous enhancement-layer frame. The combining rule takes into account the compressed prediction error of the base-layer, and the parameters used for its compression. The main difficulty overcome by this invention is in the apparent conflicts between these two sources of information and their impact as described in the Background of the Invention. This difficulty may explain why existing known methods exclusively use one of these information sources at any given time. These methods will be generally referred to here as switching techniques (which include as a special case the exclusive use of one of the information sources at all times). Additionally, the invention optionally includes a special enhancement-layer synchronization mode for the case where the communication rate for a given receiver is time varying (e.g., in mobile communications). This mode may be applied periodically to allow the receiver to upgrade to enhancement-layer performance even though it does not have prior enhancement-layer reconstructed frames.
An object of the invention is to achieve efficient scalability of predictive coding.
Another object of the invention is to provide a method and system for scalable predictive coding that is applicable to typical or standard video and audio compression.
Another object of the invention is to provide a scalable predictive coding method and system in which all or most of the information available at an enhancement-layer is exploited to improve the quality of the prediction.
Further objects and advantages of the invention will be brought out in the following portions of the specification, wherein the detailed description is for the purpose of fully disclosing preferred embodiments of the invention without placing limitations thereon.
BRIEF DESCRIPTION OF THE DRAWINGS The invention will be more fully understood by reference to the following drawings which are for illustrative purposes only:
FIG. 1 is functional block diagram of a conventional two-layer scalable predicting coding system.
FIG. 2 is a functional block diagram of an enhancement layer encoder of a scalable predictive coding system in accordance with the present invention.
FIG. 3 is a functional block diagram of a base layer reconstruction module according to the present invention. FIG. 4 is a functional block diagram of an enhancement layer reconstruction module according to the present invention.
FIG. 5 is a functional block diagram of a three-layer scalable encoder employing the enhancement encoder of the present invention. FIG. 6 is a functional block diagram of a three-layer scalable decoder corresponding to the encoder shown in FIG. 5.
FIG. 7 is a functional block diagram of a two-layer scalable video encoder employing the enhancement encoder of the present invention.
FIG. 8 is a functional block diagram of a two-layer decoder corresponding to the encoder shown in FIG. 7.
FIG. 9 is a functional block diagram of the spatial motion compensator blocks shown in FIG. 7 and FIG. 8.
DETAILED DESCRIPTION OF THE INVENTION Referring more specifically to the drawings, where like reference numbers, labels and symbols denote like parts, for illustrative purposes the present invention will be described with reference to the encoder generally shown in FIG. 2, as well as the encoding system shown in FIG. 2 through FIG. 6, and the scalable predictive coding method described in connection therewith. Various embodiments of encoders and decoders employing the present invention, and details therefore, are shown and described in FIG. 7 through FIG. 9.
The method of the present invention generally comprises upgrading the prediction used at each enhancement-layer by combining, with minimal conflict, the information provided from both sources, namely, information available at, and used by, the base-layer (or lower layers), and information that is available only at the enhancement-layer. In the case of a scalable predictive coding system with multiple enhancement layers, the prediction at an enhancement layer may combine information provided from all lower enhancement layers as well. The invention provides for prediction or estimation of the signal frame itself in any representation, or any subset of signal representation coefficients such as transform coefficients (e.g., in video, audio), line spectral frequencies (e.g., in speech or audio), etc. The term "frame" and the corresponding mathematical notation will be used generally to refer to the relevant set of frame coefficients being estimated or predicted by the method in each particular application.
Referring first to FIG. 2, a functional block diagram of an enhancement layer encoder of a scalable predictive coding system in accordance with the present invention is shown. In the enhancement layer encoder 100 of the present invention, an enhancement layer estimator (ELE) 102 computes a new predicted frame 104, xe{ή) , by combining information from the reconstruction frame 106 at the base layer, xb{ή) and from the previous reconstructed frame 108 at the enhancement layer xe{n - 1) . Note that first order prediction is described for notational simplicity but several previous frames may be used. The combining rule depends on any or all of, but not limited to, the following parameters: the compression parameters 1 10 of the base layer (such as quantization step and threshold, and the quantized base-layer residual 112, fb{n) , (see FIG. 3)), and the statistical parameters 1 14 of the time evolution of the frames (such as inter-frame correlation coefficients and variance). The statistical parameters may be either estimated off-line from training data, or estimated on-line by an adaptive estimator which tracks variation in the signal statistics based on either the original signal (in which case the parameters need to be transmitted to the decoder) or based on reconstructed signals which are available to the receiver. The exact definition of the combination rule depends on the level of complexity allowed for the module. At the high end, one may compute a possibly complex, optimal predicted frame given all the available information. The enhancement layer residual 116, re{ή) , which is the difference between the input frame 1 18, χ{ή) , and the predicted frame 104, xe{n) , is then compressed by a compressor 120 to produce the enhancement bits 122. Referring to FIG. 3 through FIG. 6, a complete scalable predictive coding system for use with this invention is shown. While only three layers are shown, it will be appreciated that additional layers can be added and are contemplated within the scope of the invention. FIG. 3 shows a base layer reconstruction module 124 which receives the quantized base layer residual 1 12, rb{n) , and adds it to the base predicted frame 126, xb{n) , to produce the base layer reconstruction frame 106, xb{n) . A delay 128 produces a delayed base reconstructed frame 130, χb{n - l) , which is input to the base predictor 132 which computes the base predicted frame 126, xb{n) , which is needed to produce the reconstructed frame as explained above.
The enhancement layer reconstruction module 134 shown in FIG. 4 receives the quantized enhancement layer residual 136, re{ή) , and adds it to the enhancement layer predicted frame 104, xe{ή) , to produce the enhancement layer reconstruction frame 138, xe{ή) . A delay 140 produces a delayed enhancement layer reconstructed frame 108, χe{n-l) , which is input to the enhancement layer estimator 102, which in turn computes the enhancement layer predicted frame 104, xe{n) , as explained with reference to FIG. 2. FIG. 5 shows how the modules described in FIG. 2 through FIG. 4 may be combined to obtain a complete scalable predictive encoder. Only three layers are shown without implying any limitation, as extensions to further layers is obvious and straightforward. Most inputs and outputs were explained in the context of the previous figures, and to distinguish between the notation for the first and second enhancement layer signals, the prefix EL1 or EL2 was added, respectively.
The signal frame to be compressed (which may be the original raw signal, or any set of coefficients extracted from it for the purpose of compression) denoted x(n) is fed to all layers in parallel. In each layer the predicted frame {χb{n) in the base layer, (EL1) e(n) in the first enhancement layer, and (EL2) χe{n) at the second enhancement layer) is subtracted from x(n) to obtain the prediction error (or residual) at the layer {rb{n) , (EL1 ) re{ή) , and (EL2) re{n) , for the base, first enhancement and second enhancement layers, respectively). The residual is compressed by the layer's Compressor/Quantizer which outputs: the layer's bits for transmission to the decoder, the reconstructed (quantized) residual (rb{n) , (EL1 ) re{n) , and (EL2) re{n) , for the base, first enhancement and second enhancement layers, respectively), as input to the layer's reconstruction module, and the set of compression parameters for use by a higher layer. Note that the enhancement layer compressor/quantizer subsumes the compressor 120 of FIG. 2 as, beside the bit stream, it also outputs the quantized residual. The reconstruction module of each layer processes its input signals as per Figures 3 and 4, and outputs the reconstructed frame for the layer (χb{n) , (EL1 ) xe{ή) , and (EL2) χe{n) , for the base, first enhancement and second enhancement layers, respectively), and the layer's predicted frame {χb{ ) , (EL1 ) xe{n) , and (EL2) xe{n) , for the base, first enhancement and second enhancement layers, respectively).
The corresponding three layer scalable predictive decoder is shown in FIG. 6. Each layer's inverse compressor/quantizer receives as input the layer's bit stream from which it reproduces the layer's quantized residual. It also extracts the layer's compression parameters for use by a higher layer reconstruction module. The rest of the diagram is identical to the encoder of FIG. 2 and similarly produces the reconstructed frame at each layer.
It will be appreciated that the invention is generally applicable to predictive coding and, in particular, may be applied to known vector quantizer-baseά compression techniques, and known -ra/7s/ ?-based techniques. Further, it is applicable to compression of speech, audio, and video signals. A combining rule employing optimal estimation for scalable compression is described next as an implementation example of the invention. In typical predictive coding, a number of signal representation coefficients
(e.g., vectors of transform coefficients, line spectral frequencies, or vectors of raw signal samples) are extracted per frame and quantized independently. A specific low complexity implementation of the invention consists of optimally combining the information available for predicting the coefficient at an enhancement-layer. The reconstructed coefficient at the base-layer, xb{ ) , and the quantization interval (or partition region in the case of vector quantization) of the corresponding reconstructed residual fb{ή) , determine an interval/cell I{ή) within which the original coefficient x{n) must lie. From the corresponding reconstructed coefficient at the previous enhancement-layer frame, χe{n-l) , and a statistical model on time evolution of the coefficients, one may construct a probability density function for x{n) conditional on χe{n-ϊ) , denoted by p[χ{n)\χe{n-l)] . The optimal estimate of x{ ) is obtained by expectation: j xp[x{n)\xe{n - l)]dx xe{n) = . j p[x{n)\xe{n - l)]dx
This predictor incorporates the information provided by the base-layer (interval within which x{ ) lies), and by the enhancement-layer (probability distribution of x{n) conditional on xe{n - l) ).
Referring now to FIG. 7 and FIG. 8, a system for scalable predictive transform coding which is designed for the compression of video signals is shown. In current practice and standards (e.g., [4]), the system uses motion compensation for basic frame prediction, applies the discrete cosine transform (DCT) to the prediction error (residual), and quantizes the transform coefficients one at a time. A block diagram of a two-layer scalable video encoder is shown in FIG. 7, and the corresponding decoder is shown in FIG. 8. FIG. 9 shows a functional block diagram corresponding to the spatial motion compensator blocks shown in the base layer and the enhancement later.
Note that, for simplicity, the symbols x,r,x,f,x for the video and residual signals at the base and enhancement layers in the diagram are in the transform domain, even though motion compensation is performed in the spatial domain (FIG. 9). Note further that additional enhancement layers may be added where an enhancement layer k builds on and relates to layer k- 1 below it exactly as shown for the first two enhancement layers.
The first-order Laplace-Markov process was chosen for modeling the time evolution statistics of the video signal: x{n) = pMC[x{n -l)] + z{n) , where x(n) is the DCT coefficient in the current frame and MC[x(n-λ)] is the corresponding (after motion compensation) coefficient in the previous frame. The correlation coefficient p is assumed to be nearly one. As x(n) has a Laplacian density, the driving process, z(n), is zero-mean, white, stationary, and has the density
p{z) = p2δ{z) + {l-p2)^e→
(Both a and p may in practice be estimated "offline" from training data, or via an adaptive estimator that tracks variations in local statistics of the signal). The base layer performs standard video compression: its predictor consists only of motion compensation, xb{n) = MC[χb{n-l)] , the residual rb{n) = x{ή)-xb{n) is quantized and the corresponding index is transmitted. Let [a,b] be the quantization interval, hence rb{n) e[a,b] . Thus the information the base layer provides on x(n) is captured in the statement: (") e [xb(n) + a,xb(n)+b] . At the enhancement layer, the prediction exploits the information available from both layers. The optimal predictor is given therefore by the expectation:
-«,(») = £WΛ)|x,(» - l),J-(») 6[Λj(ll) + fl,Λ4(») + 6]} , which is conveniently rewritten as xe{n) = xe{ - 1) +
Figure imgf000013_0001
el2{n)} where 3 β(« - l) = iWC[Jc,(ιι - l)] and the expectation interval is
^(») = K(«) + fl --f,( i -l),yt(«)+6--te(»-l)] . This prediction is directly implemented using the model for p(z) given above:
\ zp{z)dz
Figure imgf000013_0002
The integral may be analytically evaluated and its closed form solution given explicitly in terms of the integral limits and the parameters a,β , \s normally used for simple implementation.
This embodiment of the invention is of low complexity, uses standard video compression for its base layer, and provides substantial performance gains which build up and increase with the number of layers implemented. Its absence in all leading standards in spite of its gains and low complexity strongly suggests that the invention is not obvious to the leading researchers and developers in the field of video compression.
The scalable predictive coding method of the invention, although illustrated herein on a two or three-layer scalable system, is repeatedly applicable to further layers of enhancement in a straightforward manner. For example, at layer k we combine signal information from the current reconstructed frame at layer k-1 , and from the previous reconstruction frame at layer k. A higher complexity version allows for the combining rule to take into account data from all lower layers. In the special implementation described, information from all lower layers contributes to restricting the final interval within which the coefficient must lie. Another higher complexity version uses higher order prediction (based on multiple past frames).
Another application of the invention pertains to time-varying channels, such as mobile communications, and most common network communications. When the receiver experiences an improvement in channel conditions, it attempts to decode higher enhancement bits and improve the quality of the reconstruction. However, it can not compute the enhancement layer prediction as past enhancement layer reconstruction frames were not decoded and are not available. The present invention includes a solution to this problem, which comprises periodically (e.g., once per fixed number of frames) constraining the enhancement encoder to exclusively use lower layer information for the prediction. This periodic constrained prediction synchronizes the enhancement decoder with the enhancement encoder and allows the receiver to decode the enhancement-layer signals. The frequency of application of this constrained mode may be different for each layer and may be optimized for the time-varying channel statistics. The trade off is between some temporary degradation in prediction (when the prediction is constrained) and the receiver's capability to upgrade to enhancement layer performance as the channel conditions improve.
Finally, it will be appreciated that the scalability advantages of the invention may be easily combined with known methods for temporal and spatial scalability. Accordingly, it will be seen that this invention provides for efficient scalability of predictive coding that is applicable to standard video and audio compression. The invention uses most or all of the information available at an enhancement-layer to improve the quality of the prediction. In addition, the invention provides for enhancement-layer synchronization to accommodate situations where the communication rate for a given receiver is time varying (e.g., in mobile communications). Although the description above contains many specificities, these should not be construed as limiting the scope of the invention but as merely providing illustrations of some of the presently preferred embodiments of this invention. Thus the scope of this invention should be determined by the appended claims and their legal equivalents.

Claims

CLAIMS What is claimed is:
1. A method for predicting the current frame of data in a digital coding system wherein a signal is segmented into frames of data that are sequentially encoded, said system including a base layer and an enhancement layer, said base layer including a base encoder and a base decoder, said enhancement layer including an enhancement encoder and an enhancement decoder, said base decoder producing a reconstructed signal, said enhancement decoder producing an enhanced reconstructed signal, said method comprising the steps of: predicting the current frame of data at the enhancement-layer by processing and combining the reconstructed data representing the current base layer frame and the reconstructed data representing the previous enhancement layer frame.
2. A method for scalable predictive coding of a signal, comprising the steps of:
(a) encoding data representing said signal with a base layer predictive coding system that provides a first prediction of said signal and information indicative of a decoded base layer approximation to said signal; (b) encoding data representing said signal by a first enhancement layer which performs predictive coding with a second prediction of said signal derived from a combination of information from the base layer and information indicative of the past decoded signal approximation generated in said first enhancement layer.
3. A method as recited in claim 2, wherein the step of encoding said signal data with said enhancement layer comprises the steps of providing to said first enhancement layer compression parameters from the base layer to aid in the computation of said second prediction.
4. A method as recited in claim 2, wherein the step of encoding said signal data with said first enhancement layer comprises the steps of providing to said first enhancement layer time evolution statistics derived either by off-line computation or by computations using quantized parameters of said signal.
5. A method as recited in claim 2, wherein said coding system includes a second enhancement layer and wherein said second enhancement layer performs predictive coding with a third prediction of said signal derived from a combination of information from said first enhancement layer and information indicative of the past decoded signal approximation generated in said second enhancement layer.
6. A method as recited in claim 2, wherein said second prediction at predetermined intervals is derived exclusively from information from the base layer and at all other times is derived by combining information from the base layer and information indicative of the past decoded signal approximation generated in said first enhancement layer.
7. An apparatus for predicting the current frame of data in a digital coding system wherein a signal is segmented into frames of data that are sequentially encoded, said system including a base layer and an enhancement layer, said base layer including a base encoder and a base decoder, said enhancement layer including an enhancement encoder and an enhancement decoder, said base decoder producing a reconstructed signal, said enhancement decoder producing an enhanced reconstructed signal, comprising: means for predicting the current frame of data at the enhancement-layer by processing and combining the reconstructed data representing the current base layer frame and the reconstructed data representing the previous enhancement layer frame.
8. An apparatus for scalable predictive coding of a signal, comprising: (a) means for encoding data representing said signal with a base layer predictive coding system that provides a first prediction of said signal and information indicative of a decoded base layer approximation to said signal;
(b) means for encoding data representing said signal by a first enhancement layer which performs predictive coding with a second prediction of said signal derived from a combination of information from the base layer and information indicative of the past decoded signal approximation generated in said first enhancement layer.
9. An apparatus as recited in claim 8, wherein said means for encoding said signal data with said enhancement layer comprises means for providing to said first enhancement layer compression parameters from the base layer to aid in the computation of said second prediction.
10. An apparatus as recited in claim 8, wherein said means for encoding said signal data with said first enhancement layer comprises means for providing to said first enhancement layer time evolution statistics derived either by off-line computation or by computations using quantized parameters of said signal.
11. An apparatus as recited in claim 8, further comprising a second enhancement layer, wherein said second enhancement layer performs predictive coding with a third prediction of said signal derived from a combination of information from said first enhancement layer and information indicative of the past decoded signal approximation generated in said second enhancement layer.
12. An apparatus as recited in claim 8, wherein said second prediction at predetermined intervals is derived exclusively from information from the base layer and at all other times is derived by combining information from the base layer and information indicative of the past decoded signal approximation generated in said first enhancement layer.
13. A scalable predictive coding system for compressing a signal, comprising at least one enhancement layer and at least one lower layer, wherein prediction in an enhancement layer combines information from a lower layer with information from the enhancement layer.
14. A scalable predictive coding method for compressing a signal in system comprising at least one enhancement layer and at least one lower layer, the method comprising the steps of performing prediction in an enhancement layer by combining information from a lower layer with information from the enhancement layer.
PCT/US1998/026984 1997-12-19 1998-12-18 Scalable predictive coding method and apparatus WO1999033274A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
AU19289/99A AU1928999A (en) 1997-12-19 1998-12-18 Scalable predictive coding method and apparatus

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US6833197P 1997-12-19 1997-12-19
US60/068,331 1997-12-19

Publications (1)

Publication Number Publication Date
WO1999033274A1 true WO1999033274A1 (en) 1999-07-01

Family

ID=22081882

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US1998/026984 WO1999033274A1 (en) 1997-12-19 1998-12-18 Scalable predictive coding method and apparatus

Country Status (2)

Country Link
AU (1) AU1928999A (en)
WO (1) WO1999033274A1 (en)

Cited By (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2001003441A1 (en) * 1999-07-06 2001-01-11 Koninklijke Philips Electronics N.V. System and method for improved fine granular scalable video using base layer coding information
WO2001062010A1 (en) * 2000-02-15 2001-08-23 Microsoft Corporation System and method with advance predicted bit-plane coding for progressive fine-granularity scalable (pfgs) video coding
WO2001065848A1 (en) * 2000-03-02 2001-09-07 Koninklijke Philips Electronics N.V. System and method for improving video transmission over a wireless network.
WO2002005563A1 (en) * 2000-07-11 2002-01-17 Motorola Inc. Improving spatial quality of coded pictures using layered scalable video bit streams
GB2364841A (en) * 2000-07-11 2002-02-06 Motorola Inc Method and apparatus for video encoding
WO2002043398A1 (en) * 2000-11-23 2002-05-30 Koninklijke Philips Electronics N.V. Video decoding method and corresponding decoder
WO2002043394A2 (en) * 2000-11-23 2002-05-30 Koninklijke Philips Electronics N.V. Video coding method and corresponding encoder
DE10102155A1 (en) * 2001-01-18 2002-08-08 Fraunhofer Ges Forschung Method and device for generating a scalable data stream and method and device for decoding a scalable data stream
DE10102154A1 (en) * 2001-01-18 2002-08-08 Fraunhofer Ges Forschung Method and device for generating a scalable data stream and method and device for decoding a scalable data stream taking into account a bit savings bank function
DE10102159A1 (en) * 2001-01-18 2002-08-08 Fraunhofer Ges Forschung Method and device for generating or decoding a scalable data stream taking into account a bit savings bank, encoder and scalable encoder
US6480547B1 (en) 1999-10-15 2002-11-12 Koninklijke Philips Electronics N.V. System and method for encoding and decoding the residual signal for fine granular scalable video
WO2003036980A1 (en) * 2001-10-26 2003-05-01 Koninklijke Philips Electronics N.V. Spatial scalable compression scheme using spatial sharpness enhancement techniques
US6639943B1 (en) 1999-11-23 2003-10-28 Koninklijke Philips Electronics N.V. Hybrid temporal-SNR fine granular scalability video coding
WO2005094081A1 (en) * 2004-03-09 2005-10-06 Nokia Corporation Method,device and software product for binarization of video data
US6956972B2 (en) 1999-12-03 2005-10-18 Microsoft Corporation System and method for robust video coding using progressive fine-granularity scalable (PFGS) coding
JP2006514455A (en) * 2003-01-30 2006-04-27 コーニンクレッカ フィリップス エレクトロニクス エヌ ヴィ Video encoding
WO2007080491A1 (en) * 2006-01-09 2007-07-19 Nokia Corporation System and apparatus for low-complexity fine granularity scalable video coding with motion compensation
WO2007109938A1 (en) * 2006-03-27 2007-10-04 Huawei Technologies Co., Ltd. A video layered coding method and a coding and decoding device in interlaced mode
WO2008006806A2 (en) * 2006-07-10 2008-01-17 Nokia Siemens Networks Gmbh & Co. Kg Method and encoding device for encoding an image area of an image of an image sequence in at least two quality levels, and method and decoding device for decoding a first encoded data flow and a second encoded data flow
US20080056356A1 (en) * 2006-07-11 2008-03-06 Nokia Corporation Scalable video coding
US7583730B2 (en) 2000-12-15 2009-09-01 Microsoft Corporation Drifting reduction and macroblock-based control in progressive fine granularity scalable video coding
US7643559B2 (en) 2001-09-14 2010-01-05 Ntt Docomo, Inc. Coding method, decoding method, coding apparatus, decoding apparatus, image processing system, coding program, and decoding program
US8437561B2 (en) 1997-12-19 2013-05-07 Wasinoski Procter, Llc Scalable predictive coding method and apparatus

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0634871A2 (en) * 1993-07-13 1995-01-18 AT&T Corp. Scalable encoding and decoding of high-resolution progressive video
EP0644695A2 (en) * 1993-09-21 1995-03-22 AT&T Corp. Spatially scalable video encoding and decoding
EP0771119A2 (en) * 1995-10-27 1997-05-02 Kabushiki Kaisha Toshiba Video encoding and decoding apparatus

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0634871A2 (en) * 1993-07-13 1995-01-18 AT&T Corp. Scalable encoding and decoding of high-resolution progressive video
EP0644695A2 (en) * 1993-09-21 1995-03-22 AT&T Corp. Spatially scalable video encoding and decoding
EP0771119A2 (en) * 1995-10-27 1997-05-02 Kabushiki Kaisha Toshiba Video encoding and decoding apparatus

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
NAKAMURA M ET AL: "SCALABLE CODING SCHEMES BASED ON DCT AND MC PREDICTION", PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), WASHINGTON, OCT. 23 - 26, 1995, vol. 2, 23 October 1995 (1995-10-23), INSTITUTE OF ELECTRICAL AND ELECTRONICS ENGINEERS, pages 575 - 578, XP000624034 *

Cited By (51)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9654787B2 (en) 1997-12-19 2017-05-16 Callahan Cellular L.L.C. Scalable predictive coding method and apparatus
US8437561B2 (en) 1997-12-19 2013-05-07 Wasinoski Procter, Llc Scalable predictive coding method and apparatus
US6501797B1 (en) 1999-07-06 2002-12-31 Koninklijke Phillips Electronics N.V. System and method for improved fine granular scalable video using base layer coding information
WO2001003441A1 (en) * 1999-07-06 2001-01-11 Koninklijke Philips Electronics N.V. System and method for improved fine granular scalable video using base layer coding information
JP2003533067A (en) * 1999-07-06 2003-11-05 コーニンクレッカ フィリップス エレクトロニクス エヌ ヴィ System and method for improved definition scalable video by using reference layer coded information
US6480547B1 (en) 1999-10-15 2002-11-12 Koninklijke Philips Electronics N.V. System and method for encoding and decoding the residual signal for fine granular scalable video
CN1303817C (en) * 1999-10-15 2007-03-07 皇家菲利浦电子有限公司 System and method for encoding and decoding residual signal for fine granular scalable video
US6639943B1 (en) 1999-11-23 2003-10-28 Koninklijke Philips Electronics N.V. Hybrid temporal-SNR fine granular scalability video coding
US6956972B2 (en) 1999-12-03 2005-10-18 Microsoft Corporation System and method for robust video coding using progressive fine-granularity scalable (PFGS) coding
US7130473B2 (en) 1999-12-03 2006-10-31 Microsoft Corporation System and method for robust video coding using progressive fine-granularity scalable (PFGS) coding
US7269289B2 (en) 1999-12-03 2007-09-11 Microsoft Corporation System and method for robust video coding using progressive fine-granularity scalable (PFGS) coding
US6700933B1 (en) 2000-02-15 2004-03-02 Microsoft Corporation System and method with advance predicted bit-plane coding for progressive fine-granularity scalable (PFGS) video coding
EP1511324A1 (en) * 2000-02-15 2005-03-02 Microsoft Corporation Predictive bit-plane coding for progressive fine-granularity scalable (PFGS) video coding
WO2001062010A1 (en) * 2000-02-15 2001-08-23 Microsoft Corporation System and method with advance predicted bit-plane coding for progressive fine-granularity scalable (pfgs) video coding
WO2001065848A1 (en) * 2000-03-02 2001-09-07 Koninklijke Philips Electronics N.V. System and method for improving video transmission over a wireless network.
GB2364841B (en) * 2000-07-11 2002-09-11 Motorola Inc Method and apparatus for video encoding
US7844000B2 (en) 2000-07-11 2010-11-30 Motorola, Inc. Method and apparatus for video encoding
GB2364841A (en) * 2000-07-11 2002-02-06 Motorola Inc Method and apparatus for video encoding
GB2364842A (en) * 2000-07-11 2002-02-06 Motorola Inc Method and system for improving video quality
WO2002005563A1 (en) * 2000-07-11 2002-01-17 Motorola Inc. Improving spatial quality of coded pictures using layered scalable video bit streams
KR100895725B1 (en) * 2000-11-23 2009-04-30 엔엑스피 비 브이 Video decoding method and corresponding decoder
WO2002043394A3 (en) * 2000-11-23 2002-08-29 Koninkl Philips Electronics Nv Video coding method and corresponding encoder
US7944966B2 (en) 2000-11-23 2011-05-17 Trident Microsystems (Far East) Ltd. Video decoding method and corresponding decoder
WO2002043394A2 (en) * 2000-11-23 2002-05-30 Koninklijke Philips Electronics N.V. Video coding method and corresponding encoder
WO2002043398A1 (en) * 2000-11-23 2002-05-30 Koninklijke Philips Electronics N.V. Video decoding method and corresponding decoder
US7583730B2 (en) 2000-12-15 2009-09-01 Microsoft Corporation Drifting reduction and macroblock-based control in progressive fine granularity scalable video coding
AU2002249122B2 (en) * 2001-01-18 2005-06-23 Fraunhofer-Gesellschaft Zur Forderung Der Angewandten Forschung E.V. Method and device for the generation or decoding of a scalable data stream with provision for a bit-store, encoder and scalable encoder
DE10102154C2 (en) * 2001-01-18 2003-02-13 Fraunhofer Ges Forschung Method and device for generating a scalable data stream and method and device for decoding a scalable data stream taking into account a bit savings bank function
DE10102155A1 (en) * 2001-01-18 2002-08-08 Fraunhofer Ges Forschung Method and device for generating a scalable data stream and method and device for decoding a scalable data stream
DE10102154A1 (en) * 2001-01-18 2002-08-08 Fraunhofer Ges Forschung Method and device for generating a scalable data stream and method and device for decoding a scalable data stream taking into account a bit savings bank function
DE10102159A1 (en) * 2001-01-18 2002-08-08 Fraunhofer Ges Forschung Method and device for generating or decoding a scalable data stream taking into account a bit savings bank, encoder and scalable encoder
DE10102159C2 (en) * 2001-01-18 2002-12-12 Fraunhofer Ges Forschung Method and device for generating or decoding a scalable data stream taking into account a bit savings bank, encoder and scalable encoder
DE10102155C2 (en) * 2001-01-18 2003-01-09 Fraunhofer Ges Forschung Method and device for generating a scalable data stream and method and device for decoding a scalable data stream
US7454353B2 (en) 2001-01-18 2008-11-18 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Method and device for the generation of a scalable data stream and method and device for decoding a scalable data stream
US7496517B2 (en) 2001-01-18 2009-02-24 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Method and device for generating a scalable data stream and method and device for decoding a scalable data stream with provision for a bit saving bank function
US7516230B2 (en) 2001-01-18 2009-04-07 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Method and device for the generation or decoding of a scalable data stream with provision for a bit-store, encoder and scalable encoder
US7643559B2 (en) 2001-09-14 2010-01-05 Ntt Docomo, Inc. Coding method, decoding method, coding apparatus, decoding apparatus, image processing system, coding program, and decoding program
WO2003036980A1 (en) * 2001-10-26 2003-05-01 Koninklijke Philips Electronics N.V. Spatial scalable compression scheme using spatial sharpness enhancement techniques
KR100927967B1 (en) * 2001-10-26 2009-11-24 코닌클리케 필립스 일렉트로닉스 엔.브이. Spatial scalable compression scheme using spatial sharpness enhancement techniques
US8005148B2 (en) 2003-01-30 2011-08-23 Koninklijke Philips Electronics N.V. Video coding
JP2006514455A (en) * 2003-01-30 2006-04-27 コーニンクレッカ フィリップス エレクトロニクス エヌ ヴィ Video encoding
JP4860156B2 (en) * 2003-01-30 2012-01-25 コーニンクレッカ フィリップス エレクトロニクス エヌ ヴィ Video encoding
US9036715B2 (en) 2003-01-30 2015-05-19 Koninklijke Philips N.V. Video coding
WO2005094081A1 (en) * 2004-03-09 2005-10-06 Nokia Corporation Method,device and software product for binarization of video data
WO2007080491A1 (en) * 2006-01-09 2007-07-19 Nokia Corporation System and apparatus for low-complexity fine granularity scalable video coding with motion compensation
WO2007109938A1 (en) * 2006-03-27 2007-10-04 Huawei Technologies Co., Ltd. A video layered coding method and a coding and decoding device in interlaced mode
WO2008006806A3 (en) * 2006-07-10 2008-05-29 Nokia Siemens Networks Gmbh Method and encoding device for encoding an image area of an image of an image sequence in at least two quality levels, and method and decoding device for decoding a first encoded data flow and a second encoded data flow
WO2008006806A2 (en) * 2006-07-10 2008-01-17 Nokia Siemens Networks Gmbh & Co. Kg Method and encoding device for encoding an image area of an image of an image sequence in at least two quality levels, and method and decoding device for decoding a first encoded data flow and a second encoded data flow
WO2008007342A3 (en) * 2006-07-11 2008-06-19 Nokia Corp Scalable video coding
US20080056356A1 (en) * 2006-07-11 2008-03-06 Nokia Corporation Scalable video coding
US8422555B2 (en) 2006-07-11 2013-04-16 Nokia Corporation Scalable video coding

Also Published As

Publication number Publication date
AU1928999A (en) 1999-07-12

Similar Documents

Publication Publication Date Title
US7289675B2 (en) Scalable predictive coding method and apparatus
WO1999033274A1 (en) Scalable predictive coding method and apparatus
KR100714696B1 (en) Method and apparatus for coding video using weighted prediction based on multi-layer
KR100703760B1 (en) Video encoding/decoding method using motion prediction between temporal levels and apparatus thereof
EP0691054B1 (en) Efficient transcoding device and method
US7627040B2 (en) Method for processing I-blocks used with motion compensated temporal filtering
US20050157793A1 (en) Video coding/decoding method and apparatus
US20020037046A1 (en) Totally embedded FGS video coding with motion compensation
KR20060006711A (en) Method for temporal decomposition and inverse temporal decomposition for video coding and decoding, and video encoder and video decoder
US20050157794A1 (en) Scalable video encoding method and apparatus supporting closed-loop optimization
KR20060135992A (en) Method and apparatus for coding video using weighted prediction based on multi-layer
WO2006006777A1 (en) Method and apparatus for predecoding and decoding bitstream including base layer
US7245662B2 (en) DCT-based scalable video compression
US20060250520A1 (en) Video coding method and apparatus for reducing mismatch between encoder and decoder
US20060008002A1 (en) Scalable video encoding
JP2004511978A (en) Motion vector compression
KR20040068973A (en) Method and apparatus for motion compensated temporal interpolation of video sequences
US20070031052A1 (en) Morphological significance map coding using joint spatio-temporal prediction for 3-d overcomplete wavelet video coding framework
Kossentini et al. Rate-distortion-constrained subband video coding
Kim et al. Error resilient video coding using optimal multiple description of DCT coefficients
KR100664930B1 (en) Video coding method supporting temporal scalability and apparatus thereof
WO2003101115A1 (en) System and method for enchancing videos from drift-free scalable bitstream
Kim et al. Adaptive multiple reference frame based scalable video coding algorithm
Bosveld et al. Compatible video compression using subband and motion compensation techniques
WO2006109989A1 (en) Video coding method and apparatus for reducing mismatch between encoder and decoder

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): AL AM AT AU AZ BA BB BG BR BY CA CH CN CU CZ DE DK EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MD MG MK MN MW MX NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT UA UG UZ VN YU ZW

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): GH GM KE LS MW SD SZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
REG Reference to national code

Ref country code: DE

Ref legal event code: 8642

122 Ep: pct application non-entry in european phase