US6665646B1 - Predictive balanced multiple description coder for data compression - Google Patents

Predictive balanced multiple description coder for data compression Download PDF

Info

Publication number
US6665646B1
US6665646B1 US09/458,659 US45865999A US6665646B1 US 6665646 B1 US6665646 B1 US 6665646B1 US 45865999 A US45865999 A US 45865999A US 6665646 B1 US6665646 B1 US 6665646B1
Authority
US
United States
Prior art keywords
coupled
mapping unit
scalar
adder
input
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
US09/458,659
Inventor
Sam John
Vinay A. Vaishampayan
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
AT&T Corp
Original Assignee
AT&T Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by AT&T Corp filed Critical AT&T Corp
Priority to US09/458,659 priority Critical patent/US6665646B1/en
Assigned to AT&T CORPORATION reassignment AT&T CORPORATION ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: JOHN, SAM, VAISHAMPAYAN, VINAY A.
Application granted granted Critical
Publication of US6665646B1 publication Critical patent/US6665646B1/en
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/167Audio streaming, i.e. formatting and decoding of an encoded audio signal representation into a data stream for transmission or storage purposes

Definitions

  • Data compression applications are used for various types of data including audio data, video data and executable content.
  • data compression occurs at an encoder.
  • the compressed data may be delivered to a decoder via a channel and decompressed into a replica (or sometimes an exact duplicate) of the source data.
  • a layered coder attempts to harmonize these competing interests.
  • a layered coder generates a compressed data signal that consists of two or more layers of coded data. These layers are often referred to as “streams.”
  • a first “base” layer of coded data represents basic information about the source data signal. If decompression were performed solely upon the base layer, it would be possible to obtain an acceptable and usable representation of the source data signal.
  • Layered coder typically provides additional information in one or more enhancement layers that, when decoded together with the base layer, refine the estimate that is obtained from the base layer.
  • the enhancement layers typically do not completely represent the source data.
  • the base layer is critical because a usable replica of the source signal cannot be obtained without it.
  • Such coders are disadvantageous because it may not always be possible to ensure that the base layer is available at the decoder. Accordingly, there is a need in the art for a coding system that generates multiple streams with the property that a usable replica can be decoded from any single stream. Further, there is a need in the art for a coding system that generates such streams with sufficient data compression.
  • the present invention provides a balanced multiple descriptive coder, one that codes data in streams so that an acceptable replica of source data can be generated if either stream is lost.
  • FIG. 1 is a block diagram of a balanced multiple descriptive coder.
  • FIG. 2 illustrates a first exemplary index assignment scheme
  • FIG. 3 illustrates a second exemplary index assignment scheme.
  • FIG. 4 is a block diagram of a balanced multiple descriptive coder according to an embodiment of the present invention.
  • FIGS. 5 ( a )-( c ) illustrate mutual refinability under shifts for index assignment schemes according to an embodiment of the present invention.
  • FIG. 6 is a block diagram of a balanced multiple descriptive coder according to a further embodiment of the present invention.
  • FIG. 7 is a block diagram of a balanced multiple descriptive coder according to an embodiment of the present invention.
  • FIG. 8 is a block diagram of a decoder according to another embodiment of the present invention.
  • FIG. 9 is a block diagram of a decoding branch according to an embodiment of the present invention.
  • the present invention provides a predictive balanced multiple description coding system.
  • An encoder generates two streams of compressed data signals from source data and outputs the streams to a channel, typically a packet switched network.
  • the decoder can generate a usable replica of the source data signal from any one of the streams. If all of the layers are received, the decoder can generate an improved replica of the source data signal.
  • FIG. 1 is a block diagram of a conventional balanced multiple description coder 100 .
  • Two channels connect the coder 100 to a decoder (not shown). Each channel supports a rate of R bits/source sample. When both channels carry data, a decoder is able to reconstruct a high-quality representation of the source signal s(t). When only one channel carries data, the decoder is able to reconstruct a usable representation of the source signal s(t).
  • the coder 100 may include the input terminal 110 for source data s(t) and a plurality of output terminals 120 , 130 for output data.
  • the coder 100 is populated by a quantizer 140 and a plurality of mapping units 150 , 160 .
  • the quantizer 140 may operate as conventional quantizers do.
  • q( ⁇ ) may be a uniform threshold quantization function.
  • q( ⁇ ) may represent division by a quantizer step size Q followed by rounding.
  • the output signal l(t) is called the “central index.”
  • Each of the mapping units 150 , 160 perform an index assignment that generates a pair of bin indices (i(t), j(t)) from the central index l(t) according to predetermined mapping functions a 1 ( ⁇ ), a 2 ( ⁇ ).
  • the mapping functions reflect a predetermined index assignment.
  • Bin index i(t) may be further coded and placed in a first channel.
  • Bin index j(t) also may be further coded and placed in a second channel.
  • the mapping units 150 , 160 may operate in accordance with predetermined index assignment schemes.
  • FIGS. 2 and 3 illustrate index assignment tables reflecting two different index assignment schemes.
  • Index assignment schemes are characterized by their reuse index (N) and spread.
  • the reuse index refers to the number of central indices l(t) that can be accessed by a single bin index i(t) or a single bin index j(t).
  • the reuse index N is 2.
  • the reuse index N is 3.
  • Spread refers to a difference between the largest and smallest central index in each row or column.
  • the spread in each row and column is 1.
  • the spread will be the identical among columns and rows of its index assignment table.
  • the spread determines the distortion imposed upon the decoding of the central index when one of the channels fail.
  • the decoder could estimate l according to some predetermined estimation process.
  • the index assignment table of FIG. 3 where the spread is 3, a different degree of distortion is introduced in the event of channel failure than would be introduced by the index assignment scheme of FIG. 2 .
  • the central index is reconstructed as one of several values.
  • FIG. 4 is a block diagram of a balanced multiple description coder 200 according to an embodiment of the present invention.
  • the embodiment of FIG. 4 provides a balanced multiple description coder 200 that employs predictive techniques.
  • the balanced multiple description coder 200 includes a pair of branch coders 210 , 220 .
  • Each of the branch coders 210 , 220 may include a coding chain 230 , 240 and a prediction circuit 250 , 260 .
  • source data s(t) is input to a subtractor 270 .
  • a predicted value of the source data ⁇ tilde over (s) ⁇ 1 (t) is input to a second input of the subtractor 270 .
  • the subtractor 270 generates a differential signal ⁇ s 1 (t) representing a difference between the source signal s(t) and the predicted source signal ⁇ tilde over (s) ⁇ 1 (t).
  • the differential signal ⁇ s 1 (t) is input to the first coding chain 230 .
  • The, first coding chain 230 may include a quantizer 280 and a mapping unit 290 .
  • the quantizer 280 requantizes the differential signal ⁇ s 1 (t) and generates an output that is used as a first central index signal l 1 (t).
  • the mapping unit 290 generates a first bin index signal i(t) from the central index signal l 1 (t). It outputs the bin index signal i(t) to the first channel and to the prediction circuit 250 .
  • the prediction circuit 250 may include an inverse mapping unit 300 , a multiplier 310 , an adder 320 , a predictor 330 , and a shifting circuit 410 .
  • the inverse mapping circuit 270 reconstructs a central index signal l 1 ′(t) based upon the bin index signal i(t). This reconstructed central index signal l 1 ′(t) has a value that would be obtained by decoding the first bin index signal i(t) only—as if the channel carrying the j(t) signal had failed.
  • the multiplier 310 scales the reconstructed central index signal l 1 ′(t) in a manner that inverts the quantization of the quantizer 280 .
  • the adder 320 reintroduces the value of the predicted source signal ⁇ tilde over (s) ⁇ 1 (t) that had been subtracted by the subtractor 270 .
  • the subtractor 270 generates a signal ⁇ dot over (s) ⁇ 1 (t) that represents a reconstruction of the source signal s(t).
  • the reconstructed source signal ⁇ dot over (s) ⁇ 1 (t) is input to the predictor 330 .
  • An output of the predictor 330 is input to a shifting circuit 410 .
  • the predictor 330 selects data from a reconstructed source signal ⁇ dot over (s) ⁇ 1 (t) at one time instant as a basis for prediction of the source signal s(t) at another time instant.
  • the shifting circuit 410 may scale the signal according to a mutual refinability property for the index assignment scheme.
  • the shifting circuit outputs a predicted source signal ⁇ tilde over (s) ⁇ 1 (t) to the adder 320 and the subtractor 270 .
  • the second coding branch 220 possesses a similar structure to the first coding branch 210 .
  • the second coding branch 220 may be populated by a coding 240 chain that includes a quantizer 340 and a mapping unit 350 .
  • a subtractor 360 receives the source signal s(t) on a first input and a second predicted source signal ⁇ tilde over (s) ⁇ 2 (t) on a second input. It outputs a differential signal ⁇ s 2 (t) to the quantizer 340 .
  • the quantizer 340 generates a second central index signal l 2 (t) from the differential signal ⁇ s 2 (t).
  • the mapping unit 350 generates a second bin index signal j(t) based upon the second central index l 2 (t).
  • the second bin index signal j(t) is output to the second channel and to the second prediction circuit 260 .
  • the second prediction circuit 260 may include an inverse mapping unit 370 , a multiplier 380 , an adder 390 , a predictor 400 and a shifting circuit 420 .
  • the inverse mapping unit 370 reconstructs the second central index signal l 2 ′(t) based solely upon the bin index signal j(t).
  • the multiplier 380 scales the second reconstructed central index signal l 2 ′(t) to invert the processing of the quantizer 340 .
  • the adder 390 adds the predicted value of the source data ⁇ tilde over (s) ⁇ 2 (t) that had been subtracted by the subtractor 360 ; it outputs a reconstructed source data signal ⁇ dot over (s) ⁇ 2 (t).
  • the predictor 400 selects data from the reconstructed source signal ⁇ dot over (s) ⁇ 2 (t) at one time instant as a basis for prediction of the source signal s(t) at another time instant.
  • the second shifting circuit 420 may scale the signal according to the mutual refinability property.
  • the second shifting circuit 420 outputs a predicted source signal ⁇ tilde over (s) ⁇ 2 (t) to the adder 390 and the subtractor 360 .
  • the prediction circuits 260 , 330 from each coding branch 210 , 220 each perform an inverse map based solely upon the branch's respective bin index i(t) or j(t).
  • inverse mapping unit 300 of the first coding branch 210 generates a first reconstructed central index signal l 1 ′(t) based solely on the first bin index i(t);
  • the inverse mapping unit 370 of the second coding branch 220 generates a second reconstructed central index signal l 2 ′(t) based on solely on the second bin index j(t).
  • these reconstructed signals are created independently from each other and may have different values as the encoder 200 operates.
  • Differences in the reconstructed signals l 1 ′(t) and l 2 ′(t) may lead to differences in the reconstructed source signals ⁇ dot over (s) ⁇ 1 (t) and ⁇ dot over (s) ⁇ 2 (t) and in the predicted source signals ⁇ tilde over (s) ⁇ 1 (t) and ⁇ tilde over (s) ⁇ 2 (t).
  • FIG. 4 exploits a property of the balanced multiple descriptive coder called “mutual refinability”.
  • the index assignment scheme is one-to-one: For every bin index combination, the bin indices intersect at most one non-empty central index position.
  • the resulting assignment is obtained by shifting i relative to j.
  • FIG. 5 ( b ) illustrates, however, that indiscriminate shifts may cause the index assignment scheme to lose the single intersection property.
  • a six-position shift maintains the one-to-one mapping property of the index assignment scheme of FIG. 5 ( a ).
  • the six-position shift is shown in FIG. 5 ( c ). Accordingly, an index assignment scheme is said to be “mutually refinable under shifts” having period M if M is a finite integer representing the smallest number of shift positions that maintains the singular intersection property.
  • the mutual refinement property is advantageous because, when both channels work, it ensures that the quantizer error is bounded by half of the quantizer step size.
  • the prediction branches include shifters 410 , 420 that exploit the periodicity of the index assignment scheme.
  • the second shifter 420 also operates according to this principle.
  • the present invention provides a predictive balanced multiple description encoder.
  • the encoder possesses an advantageous property in that the encoder generates coded data streams on two independent properties in such a way that, if one stream were lost entirely, a useful replica of a source signal could be recovered based on the other stream.
  • the balanced multiple description coder may be further enhanced according to any of the following:
  • FIG. 6 illustrates a balanced multiple descriptive coder 500 according to an embodiment of the present invention.
  • FIG. 5 illustrates a pair of coding branches 510 and 520 , each being populated by a coding chain 530 , 540 , and a prediction circuit 550 , 560 .
  • the encoding circuit 530 may be populated by a source transformation 570 , a quantizer 580 and a mapping unit 590 .
  • An input source signal s(t) is input first to a subtractor 600 .
  • the subtractor 600 outputs a differential signal ⁇ s 1 (t) based upon a difference between the source signal s(t) and a first predicted signal ⁇ tilde over (s) ⁇ 1 (t).
  • the differential signal ⁇ s 1 (t) is input to the encoding circuit 530 .
  • the source transformation 570 typically tailors the encoder 500 for use with a predetermined type of data signal such as video.
  • the source transformation 570 codes the differential signal according to an algorithm that takes advantage of certain redundancies that characterize the source data s(t).
  • Exemplary transformations for video include: discrete cosine transform (DCT), fast Fourier transform (FFT), and wavelet transform (WT).
  • DCT discrete cosine transform
  • FFT fast Fourier transform
  • WT wavelet transform
  • the source transformation 570 outputs a coded signal to the quantizer 580 .
  • the quantizer 580 and mapping unit 590 operate according to the techniques described above with respect to the previous embodiments.
  • the quantizer 580 may perform data truncation upon the coded signal from the source encoder 570 , outputting a signal that is used as the central index signal l 1 (t).
  • the mapping unit 590 generates a bin index signal i(t) from the central index signal l 1 (t). The bin index signal is output to the first channel and to the prediction circuit 550 .
  • the prediction circuit 550 may be populated by an inverse mapping unit 610 , a source decoder 620 , a predictor 630 , a multiplier 640 , an adder 650 and a shifting circuit 670 .
  • the inverse mapping unit 610 generates a reconstructed central index l 1 ′(t) from the bin index signal i(t).
  • the multiplier 640 scales the reconstructed index signal l 1 ′(t) by a multiplying factor Q that inverts the quantization applied by the quantizer 580 .
  • the source decoder 620 inverts the transformation that had been applied by the source transformation 570 .
  • the adder 650 reintroduces the predicted signal ⁇ tilde over (s) ⁇ 2 (t) that had been subtracted by subtractor 600 .
  • the prediction circuit 630 selects data from the reconstructed source data signal at one time instant as a basis for prediction of the source signal at another time instant.
  • the shifting circuit 670 exploits the mutual refinability property as discussed above.
  • the prediction chain also may include a second source transformation 660 and inverse source transform 680 coupled to inputs and outputs of the shifting circuit 670 , respectively.
  • a second source transformation 660 and inverse source transform 680 coupled to inputs and outputs of the shifting circuit 670 , respectively.
  • Such an embodiment permits the shifting circuit 670 to operate in a domain of source transformed data.
  • the second coding branch 520 may possess the same structure as the first coding branch 510 . It may be populated by a second encoder chain 540 and a second prediction circuit 560 .
  • the encoder chain 540 may include a source transform 690 , a quantizer 700 and a mapping unit 710 .
  • the mapping unit 710 outputs a second bin index signal j(t) to a channel and to the prediction circuit 560 .
  • the structure of the second prediction circuit 560 parallels that of the first prediction circuit 550 .
  • the second prediction circuit 560 may be populated by an inverse mapping unit 730 , an inverse source transform 740 , a prediction circuit 750 , a multiplier 760 , an adder 770 , and a shifter 800 .
  • source transformations 790 and inverse source transformations 810 may be provided on inputs and outputs of the shifter 800 , respectively.
  • an embodiment permits the advantages of source coding transforms to be integrated with a balanced multiple descriptive coder.
  • the encoder 500 benefit from motion compensated temporal prediction.
  • the predictors 630 and 750 may accept motion vectors on inputs 820 , 830 , respectively, and predict source data using both temporal and spatial prediction. This embodiment finds application in video coding applications.
  • source data for video includes both spatial and temporal references.
  • video data may be organized into frames, each frame being represented as a two-dimensional array of display data.
  • Motion vectors generally, identify display data from a spatial region in another frame that can be used as a basis for prediction of source data in a current position of a current frame.
  • source data s(t) and reconstructed source data ⁇ dot over (s) ⁇ (t) may possess both temporal and spatial references.
  • the prediction circuits 630 and 750 may store an entire frame of reconstructed source data ⁇ dot over (s) ⁇ (t).
  • the predictors may output a portion of the stored reconstructed source data identified by the motion vectors as a predicted source signal.
  • the motion vectors may be generated at the encoder according to any of a number of conventional techniques.
  • FIG. 7 illustrates a balanced multiple description coder 900 according to another embodiment of the present invention.
  • the coder 900 includes a pair of coding branches 910 , 920 , each coding branch including a coding chain 930 , 940 and a prediction circuit 950 , 960 .
  • the first coding chain 930 is populated by a quantizer 970 , a mapping unit 980 and a selector 990 .
  • the quantizer 970 receives a differential signal ⁇ s 1 (t) from a subtractor 1000 , requantifies it according to a quantizer scalar and outputs the requantified signal as the central index signal l 1 (t).
  • the mapping unit 990 generates two bin index signals i 1 (t), j 1 (t) from the central index signal l 1 (t). Both bin index signals i 1 (t), j 1 (t) are input the selector 990 .
  • the selector 999 outputs one of the bin index signals (say, j 1 (t)) and outputs it both to the first channel and to the prediction circuit 950 .
  • the second coding chain 940 possesses a structure that parallels the structure of the first coding chain 930 . It may include a quantizer 1010 , a mapping unit 1020 and a selector 1030 .
  • the quantizer 1010 receives a second differential signal ⁇ s 2 (t) from a subtractor 1040 , requantifies it and outputs a second central index signal l 2 (t).
  • the mapping unit 1020 generates a second pair of bin index signals i 2 (t), j 2 (t) from the second central index signal l 2 (t). Both bin index signals i 2 (t), j 2 (t) are input the selector 1030 .
  • the selector 1030 outputs one of the bin index signals (say, i 2 (t)) and outputs it both to the second channel and to the second prediction circuit 960 .
  • the first prediction circuit 950 may be populated by an inverse mapping unit 1040 , a multiplier 1050 , an adder 1060 , a predictor 1070 , and a shifting circuit 1140 operating in accordance with the previous embodiments.
  • the second prediction circuit 960 may be populated by an inverse mapping unit 1090 , a multiplier 1100 , an adder 1110 , a predictor 1120 , and a shifting circuit 1150 that operates according to previous embodiments.
  • FIG. 7 maintains mutual refinability for shifts that are smaller than the minimum period M defined with respect to FIG. 6 .
  • This embodiment maintains refinability for shift of 3 for the index assignment shown in FIG. 6 .
  • the enhancements introduced with respect to FIGS. 6 and 7 may be employed independently from one another.
  • the principles of the present invention are not so limited.
  • the principles of the present invention permit a predictive balanced multiple descriptive coder having source transformation per se, motion compensated prediction per se or multiple index assignment predictions per branch per se. Indeed, embodiments of the present invention may omit all three of these enhancements and simply provide a predictive balanced multiple descriptive coder.
  • FIG. 8 illustrates a decoder 1200 according to an embodiment of the present invention.
  • the decoder 1200 includes a pair of input terminals 1210 , 1220 for respective bin index signals i(t) and j(t) and several output terminals 1230 - 1250 .
  • the decoder 1200 may include a pair of decoding chains 1260 , 1270 , one for each bin index signal.
  • a first decoding chain 1260 may include an inverse mapping unit 1280 , a multiplier 1290 , an adder 1300 a prediction chain that includes a predictor 1310 and a shifting circuit 1430 .
  • the inverse mapping unit 1280 recreates a central bin index signal l 1 (t) from the bin index signal i(t).
  • the multiplier 1290 scales the recreated bin index signal l 1 (t) by the quantizer scalar Q and the adder 1300 adds a predicted source signal to the scaled signal.
  • the adder 1300 outputs a reconstructed source signal s 1 (t) based solely upon the bin index signal i(t).
  • the reconstructed source signal s 1 (t) may be output on terminal 1230 .
  • the predictor 1310 uses the reconstructed source signal s 1 (t) at one time instant as a basis for prediction of the source signal at another time instant.
  • the shifting circuit 1430 exploits the mutual refinement property discussed previously.
  • the shifting circuit 1430 outputs a predicted source signal to the adder 1300 .
  • the second prediction chain 1270 may possess a similar structure as the first prediction chain 1260 . It may include an inverse mapping unit 1320 , a multiplier 1330 , an adder 1340 and a predictor 1350 .
  • the inverse mapping unit 1320 generates a recreated central bin index signal l 2 (t) from the bin index signal j(t).
  • the multiplier 1330 scales the recreated bin index signal l 2 (t) by the quantizer scalar Q and the adder 1340 adds a predicted source signal to the scaled signal.
  • the adder 1340 outputs a reconstructed source signal s 2 (t) based solely upon the bin index signal j(t).
  • the reconstructed source signal s 2 (t) may be output on terminal 1240 .
  • the predictor 1350 generates a second predicted source signal from the reconstructed source signal and outputs the second predicted source signal to the adder 1340 .
  • the predictor 1350 and the shifting circuit 1420 may operate in manners similar to those of the first prediction chain, predictor 1310 and shifting circuit 1430 .
  • the decoder 1200 also may include an inverse mapping unit 1360 , an additional pair of adders 1370 , 1380 and three additional multipliers 1390 , 1400 , 1410 .
  • the multipliers 1390 , 1400 scale respective predicted source signals by a scalar ⁇ .
  • the scalar a represents a scale that is appropriate to render the predicted source signals to a level that is appropriate for the bin index signals.
  • the scalar ⁇ simply may be 1/M where M is the periodicity of the index assignment scheme.
  • the adders 1370 , 1380 add respective bin index signals i(t), j(t) to respective scaled predicted source signals from the multipliers 1390 or 1400 .
  • the summed signals both are input the inverse mapping unit 1360 .
  • the inverse mapping unit retrieves a central index value l(t) based upon the input signals.
  • the third multiplier 1410 scales the central index value by the quantization scalar Q, thereby generating a third source signal s 3 (t).
  • the third source signal s 3 (t) may be selected to be the decoded signal.
  • an associated output s 1 (t) or s 2 (t) may be taken to be the decoded source signal.
  • FIG. 9 is a partial block diagram illustrating processing of an enhanced decoder 1500 according to an embodiment of the present invention.
  • the decoder 1500 may include an inverse mapping unit 1510 , a multiplier 1520 and an adder 1530 that operates in accordance with the embodiment above. They generate a first decoded source signal s 1 (t) in response to a single bin index signal i(t).
  • the decoding branch may include a predictor 1560 and a shift circuit 1570 that operate in accordance with those of FIG. 8.
  • a decoder 1500 will include a corresponding set of inverse mapping units, adders and multipliers (not shown) to generate a second decoded source signal s 2 (t) based solely upon the other bin index signal j(t).
  • the decoding branches also may include a source inverse transform 1540 that operates upon a signal output from the first adder 1530 .
  • the embodiment also may include a source transform 1550 in a prediction branch in addition to the traditional predictor 1560 and shifter 1570 . This embodiment is appropriate to complement those embodiments that provide a source transform during an encoding process.
  • the prediction circuit 1560 may be augmented to perform prediction based on spatial and temporal references received in the data stream.
  • the prediction circuit 1560 may respond to motion vectors generated in the encoder (not shown), by using a predetermined spatial region of reconstructed source data from another time instant as a basis of prediction for source data in a current spatial region at a current time instant.
  • the decoder 1500 may include additional multipliers 1580 , 1600 , adders 1590 and a third inverse mapping unit 1610 that generates the third reconstructed source signal s 3 (t). These elements may operate in accordance with the embodiment of FIG. 8

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)

Abstract

A balanced multiple descriptive coder codes data in streams so that an acceptable replica of source data can be generated if either stream is lost. The balanced multiple descriptive coder may include a pair of coding branches, each branch including a coding chain and a prediction chain.

Description

RELATED APPLICATIONS
This application may benefit from the priority of U.S. patent application Ser. No. 60/111,889 filed Dec. 11, 1998 and U.S. patent application Ser. No. 60/117,407 filed Jan. 27, 1999, the disclosures of which are incorporated herein by reference.
BACKGROUND
Data compression applications are used for various types of data including audio data, video data and executable content. Typically, data compression occurs at an encoder. The compressed data may be delivered to a decoder via a channel and decompressed into a replica (or sometimes an exact duplicate) of the source data.
Various data compression designs are available in the art. They typically reflect two competing interests. On one hand, it is desirable to eliminate all redundancies from a source data signal so that the compressed data signal occupies as little bandwidth as possible when it is placed in a channel. One the other hand, it becomes necessary to ensure that some amount of redundancy remains in the compressed data signal to make it possible to perform data decompression even in the face of data corruption that may be caused by channel imperfections.
Known “layered” coders (or “hierarchical” or “embedded” coders) attempt to harmonize these competing interests. A layered coder generates a compressed data signal that consists of two or more layers of coded data. These layers are often referred to as “streams.” A first “base” layer of coded data represents basic information about the source data signal. If decompression were performed solely upon the base layer, it would be possible to obtain an acceptable and usable representation of the source data signal. Layered coder typically provides additional information in one or more enhancement layers that, when decoded together with the base layer, refine the estimate that is obtained from the base layer. The enhancement layers typically do not completely represent the source data.
In a layered coder, the base layer is critical because a usable replica of the source signal cannot be obtained without it. Such coders are disadvantageous because it may not always be possible to ensure that the base layer is available at the decoder. Accordingly, there is a need in the art for a coding system that generates multiple streams with the property that a usable replica can be decoded from any single stream. Further, there is a need in the art for a coding system that generates such streams with sufficient data compression.
SUMMARY OF THE INVENTION
The present invention provides a balanced multiple descriptive coder, one that codes data in streams so that an acceptable replica of source data can be generated if either stream is lost.
BRIEF DESCRIPTION OF THE DRAWINGS
FIG. 1 is a block diagram of a balanced multiple descriptive coder.
FIG. 2 illustrates a first exemplary index assignment scheme.
FIG. 3 illustrates a second exemplary index assignment scheme.
FIG. 4 is a block diagram of a balanced multiple descriptive coder according to an embodiment of the present invention.
FIGS. 5(a)-(c) illustrate mutual refinability under shifts for index assignment schemes according to an embodiment of the present invention.
FIG. 6 is a block diagram of a balanced multiple descriptive coder according to a further embodiment of the present invention.
FIG. 7 is a block diagram of a balanced multiple descriptive coder according to an embodiment of the present invention.
FIG. 8 is a block diagram of a decoder according to another embodiment of the present invention.
FIG. 9 is a block diagram of a decoding branch according to an embodiment of the present invention.
DETAILED DESCRIPTION
The present invention provides a predictive balanced multiple description coding system. An encoder generates two streams of compressed data signals from source data and outputs the streams to a channel, typically a packet switched network. The decoder can generate a usable replica of the source data signal from any one of the streams. If all of the layers are received, the decoder can generate an improved replica of the source data signal.
A balanced multiple descriptive coder is known per se. FIG. 1 is a block diagram of a conventional balanced multiple description coder 100. Two channels connect the coder 100 to a decoder (not shown). Each channel supports a rate of R bits/source sample. When both channels carry data, a decoder is able to reconstruct a high-quality representation of the source signal s(t). When only one channel carries data, the decoder is able to reconstruct a usable representation of the source signal s(t).
As shown in FIG. 1, the coder 100 may include the input terminal 110 for source data s(t) and a plurality of output terminals 120, 130 for output data. The coder 100 is populated by a quantizer 140 and a plurality of mapping units 150, 160.
The quantizer 140 may operate as conventional quantizers do. The quantizer 140 requantizes a source sample s(t) to yield an integer index according to l(t)=q(s(t)) where the q(·) function represents a quantization function. In one example, q(·) may be a uniform threshold quantization function. In a simple case, q(·) may represent division by a quantizer step size Q followed by rounding. Herein, the output signal l(t) is called the “central index.”
Each of the mapping units 150, 160 perform an index assignment that generates a pair of bin indices (i(t), j(t)) from the central index l(t) according to predetermined mapping functions a1(·), a2(·). The mapping functions reflect a predetermined index assignment. Bin index i(t) may be further coded and placed in a first channel. Bin index j(t) also may be further coded and placed in a second channel.
The mapping units 150, 160 may operate in accordance with predetermined index assignment schemes. FIGS. 2 and 3 illustrate index assignment tables reflecting two different index assignment schemes. Consider the index assignment table of FIG. 2 first. There, the table includes a plurality of cells, most of which are empty. Values of l(t) are provided in the non-empty cells of the table. For each value of l(t) there is only one cell in the table occupied by that value. Thus, in the exemplary table of FIG. 2, l=−1 is found in the cell at (i=0, j=−1) and no other.
Index assignment schemes are characterized by their reuse index (N) and spread. The reuse index refers to the number of central indices l(t) that can be accessed by a single bin index i(t) or a single bin index j(t). In the example of FIG. 2, the reuse index N is 2. In the example of FIG. 3, the reuse index N is 3.
Spread refers to a difference between the largest and smallest central index in each row or column. In the index assignment table of FIG. 2, the spread in each row and column is 1. For a perfectly balanced multiple descriptive coder, the spread will be the identical among columns and rows of its index assignment table.
The spread determines the distortion imposed upon the decoding of the central index when one of the channels fail. Consider an example where the quantizer 140 generates a central index l=2. According to the index assignment table of FIG. 2, bin index i=1 and bin index j=1. At a decoder, if the channel that includes bin index j were lost, a decoder could not determine l with precision. Using the surviving bin index, bin index i, the decoder would determine that l=1 or 2. In the absence of a precise description of l, the decoder could estimate l according to some predetermined estimation process. According to the index assignment table of FIG. 3 where the spread is 3, a different degree of distortion is introduced in the event of channel failure than would be introduced by the index assignment scheme of FIG. 2. Again, in the absence of one of the bin indices, the central index is reconstructed as one of several values.
FIG. 4 is a block diagram of a balanced multiple description coder 200 according to an embodiment of the present invention. The embodiment of FIG. 4 provides a balanced multiple description coder 200 that employs predictive techniques. The balanced multiple description coder 200 includes a pair of branch coders 210, 220. Each of the branch coders 210, 220 may include a coding chain 230, 240 and a prediction circuit 250, 260.
Within the first branch coder 210, source data s(t) is input to a subtractor 270. A predicted value of the source data {tilde over (s)}1(t) is input to a second input of the subtractor 270. The subtractor 270 generates a differential signal Δs1(t) representing a difference between the source signal s(t) and the predicted source signal {tilde over (s)}1(t). The differential signal Δs1(t) is input to the first coding chain 230.
The, first coding chain 230 may include a quantizer 280 and a mapping unit 290. The quantizer 280 requantizes the differential signal Δs1(t) and generates an output that is used as a first central index signal l1(t). The mapping unit 290 generates a first bin index signal i(t) from the central index signal l1(t). It outputs the bin index signal i(t) to the first channel and to the prediction circuit 250.
The prediction circuit 250 may include an inverse mapping unit 300, a multiplier 310, an adder 320, a predictor 330, and a shifting circuit 410. The inverse mapping circuit 270 reconstructs a central index signal l1′(t) based upon the bin index signal i(t). This reconstructed central index signal l1′(t) has a value that would be obtained by decoding the first bin index signal i(t) only—as if the channel carrying the j(t) signal had failed. The multiplier 310 scales the reconstructed central index signal l1′(t) in a manner that inverts the quantization of the quantizer 280. The adder 320 reintroduces the value of the predicted source signal {tilde over (s)}1(t) that had been subtracted by the subtractor 270. The subtractor 270 generates a signal {dot over (s)}1(t) that represents a reconstruction of the source signal s(t).
The reconstructed source signal {dot over (s)}1(t) is input to the predictor 330. An output of the predictor 330 is input to a shifting circuit 410. The predictor 330 selects data from a reconstructed source signal {dot over (s)}1(t) at one time instant as a basis for prediction of the source signal s(t) at another time instant. The shifting circuit 410 may scale the signal according to a mutual refinability property for the index assignment scheme. The shifting circuit outputs a predicted source signal {tilde over (s)}1(t) to the adder 320 and the subtractor 270.
The second coding branch 220 possesses a similar structure to the first coding branch 210. The second coding branch 220 may be populated by a coding 240 chain that includes a quantizer 340 and a mapping unit 350. A subtractor 360 receives the source signal s(t) on a first input and a second predicted source signal {tilde over (s)}2(t) on a second input. It outputs a differential signal Δs2(t) to the quantizer 340. The quantizer 340 generates a second central index signal l2(t) from the differential signal Δs2(t). The mapping unit 350 generates a second bin index signal j(t) based upon the second central index l2(t). The second bin index signal j(t) is output to the second channel and to the second prediction circuit 260.
The second prediction circuit 260 may include an inverse mapping unit 370, a multiplier 380, an adder 390, a predictor 400 and a shifting circuit 420. The inverse mapping unit 370 reconstructs the second central index signal l2′(t) based solely upon the bin index signal j(t). The multiplier 380 scales the second reconstructed central index signal l2′(t) to invert the processing of the quantizer 340. The adder 390 adds the predicted value of the source data {tilde over (s)}2(t) that had been subtracted by the subtractor 360; it outputs a reconstructed source data signal {dot over (s)}2(t).
The predictor 400 selects data from the reconstructed source signal {dot over (s)}2(t) at one time instant as a basis for prediction of the source signal s(t) at another time instant. The second shifting circuit 420 may scale the signal according to the mutual refinability property. The second shifting circuit 420 outputs a predicted source signal {tilde over (s)}2(t) to the adder 390 and the subtractor 360.
According to an embodiment of the present invention, the prediction circuits 260, 330 from each coding branch 210, 220 each perform an inverse map based solely upon the branch's respective bin index i(t) or j(t). Thus, inverse mapping unit 300 of the first coding branch 210 generates a first reconstructed central index signal l1′(t) based solely on the first bin index i(t); the inverse mapping unit 370 of the second coding branch 220 generates a second reconstructed central index signal l2′(t) based on solely on the second bin index j(t). Thus, these reconstructed signals are created independently from each other and may have different values as the encoder 200 operates. Differences in the reconstructed signals l1′(t) and l2′(t) may lead to differences in the reconstructed source signals {dot over (s)}1(t) and {dot over (s)}2(t) and in the predicted source signals {tilde over (s)}1(t) and {tilde over (s)}2(t).
According to an embodiment of the present invention, the first and second predictors 330, 400 may be general finite impulse response filters with transfer function H(z)=h1z−1+h2z−2+ . . . hkz−k. For example, the predictors may be single sample delay elements H(z)=z−1.
The embodiment of FIG. 4 exploits a property of the balanced multiple descriptive coder called “mutual refinability”. As described with respect to FIGS. 2 and 3, the index assignment scheme is one-to-one: For every bin index combination, the bin indices intersect at most one non-empty central index position. FIG. 5(a) illustrates the index assignment scheme of FIG. 3, demonstrating that any given pair of bin indices (e.g. i=0, j=0) intersects over a single central index (l=0). In the design of a predictive multiple description system, the resulting assignment is obtained by shifting i relative to j. FIG. 5(b) illustrates, however, that indiscriminate shifts may cause the index assignment scheme to lose the single intersection property. FIG. 5(b) illustrates a one-position shift which causes i=0, j=0 to intersect at a pair of non-empty central indices, not just one central index.
A six-position shift, however, maintains the one-to-one mapping property of the index assignment scheme of FIG. 5(a). The six-position shift is shown in FIG. 5(c). Accordingly, an index assignment scheme is said to be “mutually refinable under shifts” having period M if M is a finite integer representing the smallest number of shift positions that maintains the singular intersection property.
The mutual refinement property is advantageous because, when both channels work, it ensures that the quantizer error is bounded by half of the quantizer step size.
Returning to FIG. 4, the prediction branches include shifters 410, 420 that exploit the periodicity of the index assignment scheme. Consider shifter 410 first. Given a locally decoded value of the source signal ŝ1(t), the shifter 410 may quantize the locally decoded source signal as: s ~ 1 ( t ) = MQ i h i s ^ 1 ( t - i ) MQ + 1 2 ( 1 )
Figure US06665646-20031216-M00001
where M is the mutually refinable shift period of the index assignment scheme, Q is the quantizer scalar. This shift has an effect of rounding the estimate of the predicted source signal {tilde over (s)}1(t) to the to the closest available shifted position in the index assignment scheme that satisfies mutual refinability. The second shifter 420 also operates according to this principle.
Thus, the present invention provides a predictive balanced multiple description encoder. The encoder possesses an advantageous property in that the encoder generates coded data streams on two independent properties in such a way that, if one stream were lost entirely, a useful replica of a source signal could be recovered based on the other stream.
Several enhancements are available to further improve the operation of the balanced multiple description encoder. According to the present invention, the balanced multiple description coder may be further enhanced according to any of the following:
source transform encoding prior to index assignment;
motion compensated temporal prediction; and
multiple index assignments in a coding branch.
Each of these embodiments may be applied independently from the other. They are described with reference to FIGS. 6 and 7.
FIG. 6 illustrates a balanced multiple descriptive coder 500 according to an embodiment of the present invention. FIG. 5 illustrates a pair of coding branches 510 and 520, each being populated by a coding chain 530, 540, and a prediction circuit 550, 560. Considering the first coding branch 510 first, the encoding circuit 530 may be populated by a source transformation 570, a quantizer 580 and a mapping unit 590. An input source signal s(t) is input first to a subtractor 600. The subtractor 600 outputs a differential signal Δs1(t) based upon a difference between the source signal s(t) and a first predicted signal {tilde over (s)}1(t). The differential signal Δs1(t) is input to the encoding circuit 530.
The source transformation 570 typically tailors the encoder 500 for use with a predetermined type of data signal such as video. Thus, the source transformation 570 codes the differential signal according to an algorithm that takes advantage of certain redundancies that characterize the source data s(t). Exemplary transformations for video include: discrete cosine transform (DCT), fast Fourier transform (FFT), and wavelet transform (WT). The source transformation 570 outputs a coded signal to the quantizer 580.
The quantizer 580 and mapping unit 590 operate according to the techniques described above with respect to the previous embodiments. The quantizer 580 may perform data truncation upon the coded signal from the source encoder 570, outputting a signal that is used as the central index signal l1(t). The mapping unit 590 generates a bin index signal i(t) from the central index signal l1(t). The bin index signal is output to the first channel and to the prediction circuit 550.
The prediction circuit 550 may be populated by an inverse mapping unit 610, a source decoder 620, a predictor 630, a multiplier 640, an adder 650 and a shifting circuit 670. The inverse mapping unit 610 generates a reconstructed central index l1′(t) from the bin index signal i(t). The multiplier 640 scales the reconstructed index signal l1′(t) by a multiplying factor Q that inverts the quantization applied by the quantizer 580. The source decoder 620 inverts the transformation that had been applied by the source transformation 570. The adder 650 reintroduces the predicted signal {tilde over (s)}2(t) that had been subtracted by subtractor 600.
The prediction circuit 630 selects data from the reconstructed source data signal at one time instant as a basis for prediction of the source signal at another time instant. The shifting circuit 670 exploits the mutual refinability property as discussed above.
In an embodiment employing source transformations in the coding chain 530, the prediction chain also may include a second source transformation 660 and inverse source transform 680 coupled to inputs and outputs of the shifting circuit 670, respectively. Such an embodiment permits the shifting circuit 670 to operate in a domain of source transformed data.
The second coding branch 520 may possess the same structure as the first coding branch 510. It may be populated by a second encoder chain 540 and a second prediction circuit 560. The encoder chain 540 may include a source transform 690, a quantizer 700 and a mapping unit 710. The mapping unit 710 outputs a second bin index signal j(t) to a channel and to the prediction circuit 560.
The structure of the second prediction circuit 560 parallels that of the first prediction circuit 550. The second prediction circuit 560 may be populated by an inverse mapping unit 730, an inverse source transform 740, a prediction circuit 750, a multiplier 760, an adder 770, and a shifter 800. Optionally, source transformations 790 and inverse source transformations 810 may be provided on inputs and outputs of the shifter 800, respectively.
Thus, an embodiment permits the advantages of source coding transforms to be integrated with a balanced multiple descriptive coder.
In another embodiment, the encoder 500 benefit from motion compensated temporal prediction. According to such an embodiment, the predictors 630 and 750 may accept motion vectors on inputs 820, 830, respectively, and predict source data using both temporal and spatial prediction. This embodiment finds application in video coding applications.
As is known, source data for video includes both spatial and temporal references. For example, video data may be organized into frames, each frame being represented as a two-dimensional array of display data. Motion vectors, generally, identify display data from a spatial region in another frame that can be used as a basis for prediction of source data in a current position of a current frame.
In this embodiment, source data s(t) and reconstructed source data {dot over (s)}(t) may possess both temporal and spatial references. The prediction circuits 630 and 750 may store an entire frame of reconstructed source data {dot over (s)}(t). In response to motion vectors generated in the encoder (not shown), the predictors may output a portion of the stored reconstructed source data identified by the motion vectors as a predicted source signal. The motion vectors may be generated at the encoder according to any of a number of conventional techniques.
FIG. 7 illustrates a balanced multiple description coder 900 according to another embodiment of the present invention. As with prior embodiments, the coder 900 includes a pair of coding branches 910, 920, each coding branch including a coding chain 930, 940 and a prediction circuit 950, 960.
The first coding chain 930 is populated by a quantizer 970, a mapping unit 980 and a selector 990. The quantizer 970 receives a differential signal Δs1(t) from a subtractor 1000, requantifies it according to a quantizer scalar and outputs the requantified signal as the central index signal l1(t). The mapping unit 990 generates two bin index signals i1(t), j1(t) from the central index signal l1(t). Both bin index signals i1(t), j1(t) are input the selector 990. The selector 999 outputs one of the bin index signals (say, j1(t)) and outputs it both to the first channel and to the prediction circuit 950.
The second coding chain 940 possesses a structure that parallels the structure of the first coding chain 930. It may include a quantizer 1010, a mapping unit 1020 and a selector 1030. The quantizer 1010 receives a second differential signal Δs2(t) from a subtractor 1040, requantifies it and outputs a second central index signal l2(t). The mapping unit 1020 generates a second pair of bin index signals i2(t), j2(t) from the second central index signal l2(t). Both bin index signals i2(t), j2(t) are input the selector 1030. The selector 1030 outputs one of the bin index signals (say, i2(t)) and outputs it both to the second channel and to the second prediction circuit 960.
The first prediction circuit 950 may be populated by an inverse mapping unit 1040, a multiplier 1050, an adder 1060, a predictor 1070, and a shifting circuit 1140 operating in accordance with the previous embodiments. The first prediction circuit 950 also may include prediction analyzer 1080 that controls operation of the selector 990 and the inverse mapping unit 1040 based on the predicted source data signal. For example, in an embodiment having the index assignment of FIG. 5, the prediction logic 1080 causes the selector 990 to output i1(t) to the channel if the predicted source signal {tilde over (s)}1(t) is an even multiple of 3Q. Otherwise the prediction logic 1080 causes the selector 990 to output the second bin index j1(t). For N=2, the logic unit always outputs i1(t).
Similarly, the second prediction circuit 960 may be populated by an inverse mapping unit 1090, a multiplier 1100, an adder 1110, a predictor 1120, and a shifting circuit 1150 that operates according to previous embodiments. The second prediction circuit 960 also may include prediction logic 1130 that controls operation of the selector 1030 and the inverse mapping unit 1090 in the second coding branch 920 based on the second predicted source data signal. For example, in an embodiment having the index assignment of FIG. 5, the prediction logic 1130 causes the selector 1030 to output i2(t) to the channel if {tilde over (s)}2(t) is an odd multiple of 3Q. Otherwise, the prediction logic 1130 causes the selector 1030 to output the second bin index signal j2(t) to the second channel. For N=2, the logic unit always outputs j2(t).
The embodiment illustrated in FIG. 7 maintains mutual refinability for shifts that are smaller than the minimum period M defined with respect to FIG. 6. This embodiment maintains refinability for shift of 3 for the index assignment shown in FIG. 6.
As noted, the enhancements introduced with respect to FIGS. 6 and 7 may be employed independently from one another. Thus, while it may be advantageous to provide a predictive balanced multiple descriptive coder that includes source transformation, motion compensated prediction and multiple index assignment predictions per branch, the principles of the present invention are not so limited. The principles of the present invention permit a predictive balanced multiple descriptive coder having source transformation per se, motion compensated prediction per se or multiple index assignment predictions per branch per se. Indeed, embodiments of the present invention may omit all three of these enhancements and simply provide a predictive balanced multiple descriptive coder.
FIG. 8 illustrates a decoder 1200 according to an embodiment of the present invention. As shown, the decoder 1200 includes a pair of input terminals 1210, 1220 for respective bin index signals i(t) and j(t) and several output terminals 1230-1250. The decoder 1200 may include a pair of decoding chains 1260, 1270, one for each bin index signal.
A first decoding chain 1260 may include an inverse mapping unit 1280, a multiplier 1290, an adder 1300 a prediction chain that includes a predictor 1310 and a shifting circuit 1430. The inverse mapping unit 1280 recreates a central bin index signal l1(t) from the bin index signal i(t). The multiplier 1290 scales the recreated bin index signal l1(t) by the quantizer scalar Q and the adder 1300 adds a predicted source signal to the scaled signal. The adder 1300 outputs a reconstructed source signal s1(t) based solely upon the bin index signal i(t). The reconstructed source signal s1(t) may be output on terminal 1230.
In the prediction chain, the predictor 1310 uses the reconstructed source signal s1(t) at one time instant as a basis for prediction of the source signal at another time instant. The shifting circuit 1430 exploits the mutual refinement property discussed previously. The shifting circuit 1430 outputs a predicted source signal to the adder 1300.
The second prediction chain 1270 may possess a similar structure as the first prediction chain 1260. It may include an inverse mapping unit 1320, a multiplier 1330, an adder 1340 and a predictor 1350. The inverse mapping unit 1320 generates a recreated central bin index signal l2(t) from the bin index signal j(t). The multiplier 1330 scales the recreated bin index signal l2(t) by the quantizer scalar Q and the adder 1340 adds a predicted source signal to the scaled signal. The adder 1340 outputs a reconstructed source signal s2(t) based solely upon the bin index signal j(t). The reconstructed source signal s2(t) may be output on terminal 1240. The predictor 1350 generates a second predicted source signal from the reconstructed source signal and outputs the second predicted source signal to the adder 1340.
In the second prediction chain, the predictor 1350 and the shifting circuit 1420 may operate in manners similar to those of the first prediction chain, predictor 1310 and shifting circuit 1430.
The decoder 1200 also may include an inverse mapping unit 1360, an additional pair of adders 1370, 1380 and three additional multipliers 1390, 1400, 1410. The multipliers 1390, 1400 scale respective predicted source signals by a scalar α. The scalar a represents a scale that is appropriate to render the predicted source signals to a level that is appropriate for the bin index signals. In an embodiment, the scalar α simply may be 1/M where M is the periodicity of the index assignment scheme.
The adders 1370, 1380 add respective bin index signals i(t), j(t) to respective scaled predicted source signals from the multipliers 1390 or 1400. The summed signals both are input the inverse mapping unit 1360. The inverse mapping unit retrieves a central index value l(t) based upon the input signals. The third multiplier 1410 scales the central index value by the quantization scalar Q, thereby generating a third source signal s3(t).
According to an embodiment of the present invention, when both bin index signals are available to the decoder 1200, the third source signal s3(t) may be selected to be the decoded signal. When only one of the bin index signals i(t), or j(t) are available to the decoder 1200, an associated output s1(t) or s2(t) may be taken to be the decoded source signal.
The enhancements described herein above with respect to the encoder (FIG. 5) also are available to the decoder. FIG. 9 is a partial block diagram illustrating processing of an enhanced decoder 1500 according to an embodiment of the present invention. According to the embodiment, the decoder 1500 may include an inverse mapping unit 1510, a multiplier 1520 and an adder 1530 that operates in accordance with the embodiment above. They generate a first decoded source signal s1(t) in response to a single bin index signal i(t). The decoding branch may include a predictor 1560 and a shift circuit 1570 that operate in accordance with those of FIG. 8. A decoder 1500 will include a corresponding set of inverse mapping units, adders and multipliers (not shown) to generate a second decoded source signal s2(t) based solely upon the other bin index signal j(t).
In an embodiment, the decoding branches also may include a source inverse transform 1540 that operates upon a signal output from the first adder 1530. The embodiment also may include a source transform 1550 in a prediction branch in addition to the traditional predictor 1560 and shifter 1570. This embodiment is appropriate to complement those embodiments that provide a source transform during an encoding process.
According to another embodiment, the prediction circuit 1560 may be augmented to perform prediction based on spatial and temporal references received in the data stream. For example, the prediction circuit 1560 may respond to motion vectors generated in the encoder (not shown), by using a predetermined spatial region of reconstructed source data from another time instant as a basis of prediction for source data in a current spatial region at a current time instant.
The decoder 1500 may include additional multipliers 1580,1600, adders 1590 and a third inverse mapping unit 1610 that generates the third reconstructed source signal s3(t). These elements may operate in accordance with the embodiment of FIG. 8
Several embodiments of the present invention are specifically illustrated and described herein. However, it will be appreciated that modifications and variations of the present invention are covered by the above teachings and within the purview of the appended claims without departing from the spirit and intended scope of the invention.

Claims (34)

We claim:
1. A balanced multiple descriptive coder, comprising:
a plurality of coding branches, each coding branch coupled to a common data sources each branch comprising:
1) an encoding circuit comprising:
a) a quantizer, and
b) a mapping unit coupled to the quantizer and responsive to a data value from the common data source, the mapping unit generating a respective one of a plurality of index values representative of the data value with reference to an index assignment scheme having uniform spread;
2) a prediction circuit coupled to an output of the encoding circuit, and
3) a subtractor having an input for source data and a second input coupled to the prediction circuit, an output of the subtractor being input to the encoding circuit.
2. The balanced multiple descriptive coder of claim 1, wherein the encoding circuits further comprise a source transform coupled to the subtractor, an output of which is input to the quantizer.
3. The balanced multiple descriptive coder of claim 2, wherein the prediction circuit further comprises an inverse source transform.
4. The balanced multiple descriptive coder of claim 1, wherein:
the prediction circuit includes a prediction analyzer,
the encoding circuits further comprise a selection switch controlled by the prediction analyzer and
the mapping unit includes a pair of outputs, both coupled to the selection switch.
5. The balanced multiple descriptive coder of claim 1, wherein the prediction circuit comprises:
an inverse mapping unit,
a scalar coupled to the inverse mapping unit,
an adder having inputs coupled to the scalar and an output of the prediction circuit, and
a predictor coupled to the adder.
6. The balanced multiple descriptive coder of claim 5, wherein the predictor is a single sample delay element.
7. The balanced multiple descriptive coder of claim 5, wherein the predictor is a motion compensated predictor.
8. The balanced multiple descriptive coder of claim 5, wherein the prediction circuit further comprises an inverse source transform interposed between the scalar and the adder.
9. The balanced multiple descriptive coder of claim 5, wherein the prediction circuit further comprises a mutual refinability shifter coupled to the predictor.
10. The balanced multiple descriptive coder of claim 5, wherein the prediction circuit further comprises:
a source transform coupled to the predictor,
a mutual refinability shifter coupled to the source transform, and
an inverse source transform coupled to the shifter.
11. A method of coding a source data signal comprising in each of a pair of coding branches:
generating a respective differential signal representing a difference between the source data signal and a respective predicted source signal,
quantizing the respective differential source signal,
generating a respective bin index signal accordingly to a mutually refinable index assignment scheme,
predicting the respective source signal from the respective bin index signals, and
outputting each bin index signal to respective channels.
12. The method of claim 11, further comprising, prior to the quantizing step, source encoding of the respective differential signal in each encoding branch.
13. A balanced multiple descriptive coder, comprising:
a plurality of coding branches, each coding branch comprising:
1) an encoding circuit, comprising:
a) a source transform,
b) a quantizer coupled to the source transform, and
c) a mapping unit coupled toe quantizer and operating in accordance with an index assignment scheme having uniform spread;
2) a prediction circuit, comprising:
a) an inverse mapping unit,
b) a scalar coupled to the inverse mapping unit,
c) an adder having inputs coupled to an input of the encoding circuit and to the scalar,
d) a predictor coupled to the scalar, and
e) an inverse source transform coupled to the predictor and
3) a subtractor having an input for source data and a second input coupled to the prediction circuit, an output of the subtractor being coupled to the encoder circuit.
14. A balanced multiple descriptive coder, comprising:
a plurality of coding branches, each coding branch comprising:
1) an encoding circuit comprising;
a) a quantizer,
b) a mapping unit coupled to the quantizer and operating in accordance with an index assignment scheme having uniform spread, the mapping unit having a pair of outputs, and
c) a selection switch coupled the two outputs of the mapping unit,
2) a prediction circuit, comprising:
a) an inverse mapping unit,
b) a scalar coupled to the inverse mapping unit,
c) an adder having inputs coupled to an input of the encoding circuit and to the scalar,
d) a predictor coupled to the scalar, and
e) a prediction analyzer coupled to a control input of the encoding circuit; and
3) a subtractor having an input for source data and a second input coupled to the prediction circuit, an output of the subtractor being coupled to the encoding circuit.
15. A balanced multiple descriptive coder, comprising:
a plurality of coding branches, each coding branch comprising:
1) an encoding circuits comprising:
a) a quantizer, and
b) a mapping unit coupled to the quantizer and operating in accordance with an index assignment scheme having uniform spread;
2) a prediction circuit, comprising:
a) an inverse mapping unit,
b) a scalar coupled to the inverse mapping unit,
c) an adder having inputs coupled to an input of the encoding circuit and to the scalar, and
d) a motion compensated predictor coupled to the adder; and a subtractor having an input for source data and a second input coupled to the prediction circuit, an output of the subtractor being coupled to the encoding circuit.
16. A balanced multiple descriptive coder, comprising:
a plurality of coding branches, each coding branch comprising:
1an encoding circuit, comprising:
a) a quantizer, and
b) a mapping unit coupled to the quantizer and operating in accordance with an index assignment scheme having uniform spread;
2) a prediction circuit, comprising:
a) an inverse mapping unit,
b) a scalar coupled to the inverse mapping unit,
c) an adder having inputs coupled to an input of the encoding circuit and to the scalar,
d) a predictor coupled to the scalar, and
e) a mutual refinability shifter, coupled to the predictor; and
3) a subtractor having an input for source data and a second input coupled to the prediction circuit, an output of the subtractor being coupled to the encoding circuit.
17. A balanced multiple descriptive decoder, comprising:
1) a pair of decoding chains, each comprising:
a) an inverse mapping unit,
b) a flat scalar coupled to the inverse mapping unit,
c) a first adder coupled to the scalar,
d) a prediction circuit having an input coupled to an output of the adder and an output coupled to an input of the adder,
e) a second scalar coupled to the output of the prediction circuit, and
f) a second adder having inputs coupled to an input of the inverse mapping unit and to the second scalar; and
2) an inverse mapping unit coupled to outputs of the second adders from each decoding chain.
18. The balanced multiple descriptive decoder of claim 16, further comprising an inverse source transform coupled to the first adder.
19. The balanced multiple descriptive decoder of claim 17, wherein the prediction circuit is a single sample delay element.
20. The balanced multiple descriptive decoder of claim 17, wherein the predictor is a motion compensated predictor.
21. The balanced multiple descriptive decode of claim 16, wherein the prediction circuit comprises:
a predictor, and
an inverse source transform interposed between an output of the predictor and an input of the first adder.
22. The balanced multiple descriptive decoder of claim 16, wherein the prediction circuit comprises:
a predictor, and
a mutual refinability shifter coupled to the predictor.
23. The balance multiple descriptive decoder of claim 16, wherein the prediction circuit comprises:
a predictor,
a source transform coupled to the predictor,
a mutual refinability shifter coupled to the source transform, and
an inverse source transform coupled to the shifter.
24. A balanced multiple descriptive decoder, comprising:
1) a pair of decoding chains, each comprising:
a) an inverse mapping unit,
b) a first scalar coupled to the inverse mapping unit,
c) a first adder coupled to the scalar,
d) a inverse source transform coupled to the first adder,
e) a prediction circuit having an input coupled to an output of the inverse source transform;
f) an inverse source transform coupled to an input of the prediction circuit and having an output coupled to an input of the first adder,
g) a second scalar coupled to the output of the prediction circuit, and
h) a second adder having inputs coupled to an input of the inverse mapping unit and to the second scalar; and
2) an inverse mapping unit coupled to outputs of the second adders from each decoding chain.
25. A balanced multiple descriptive decoder, comprising:
1) a pair of decoding chains, each comprising:
a) an inverse mapping unit,
b) a first scalar coupled to the inverse mapping unit,
c) a first adder coupled to the scalar,
d) a motion compensated prediction circuit having an input coupled to an output of the first adder and an output coupled to an input of the first adder,
e) a second scalar coupled to the output of the prediction circuit, and
f) a second adder having inputs coupled to an input of the inverse mapping unit and to the second scalar; and
2) an inverse mapping unit coupled to outputs of the second adders from each decoding chain.
26. A balanced multiple descriptive decoder, comprising:
1) a pair of decoding chains, each comprising:
a) an inverse mapping unit,
b) a first scalar coupled to the inverse mapping unit,
c) a first adder coupled to the scalar, and
d) a predication circuit, comprising:
i) a predictor, and
ii) a mutual refinability shifter coupled to the predictor,
e) a second scalar coupled to the output of the prediction circuit, and
f) a second adder having inputs coupled to an input of the inverse mapping unit and to the second scalar; and
2) an inverse mapping unit coupled to outputs of the second adders from each decoding chain.
27. A balanced multiple descriptive decoder, comprising:
1) a pair of decoding chains, each comprising:
a) an inverse mapping unit,
b) a first scalar coupled to the inverse mapping unit,
c) a first adder coupled to the scalar,
d) a inverse source transform coupled to the adder,
e) a prediction circuit, comprising;
i) a predictor,
ii) a source transform coupled to the predictor,
iii) a mutual refinability shifter coupled to the source transform, and
iv) an inverse source transform coupled to the shifter,
f) a second scalar coupled to the output of the prediction circuit, and
g) a second adder having inputs coupled to an input of the inverse mapping unit and to the second scalar; and
2) an inverse mapping unit coupled to outputs of the second adders from each decoding chain.
28. The balanced multiple descriptive coder of claim 1, wherein the encoding circuit associated with each of the plurality of coding branches generates a respective index value from the data value, the collection of index values uniquely representing the data value according to the index assignment scheme.
29. The balanced multiple descriptive coder of claim 13, wherein the encoding circuit associated with each of the plurality of coding branches generates a respective index value from the data value, the collection of index values uniquely representing the data value according to the index assignment scheme.
30. The balanced multiple descriptive coder of claim 14, wherein the encoding circuit associated with each of the plurality of coding branches generates a respective index value from the data value, the collection of index values uniquely representing the data value according to the index assignment scheme.
31. The balanced multiple descriptive coder of claim 15, wherein the encoding circuit associated with each of the plurality of coding branches generates a respective index value from the data value, the collection of index values uniquely representing the data value according to the index assignment scheme.
32. The balanced multiple descriptive coder of claim 16, wherein the encoding circuit associated with each of the plurality of coding branches generates a respective index value from the data value, the collection of the index values uniquely representing the data value according to the index assignment scheme.
33. The balanced multiple descriptive coder of 1, wherein the prediction circuit of each coding branch operates independently of the prediction circuit of every other coding branch.
34. The balanced multiple descriptive coder of claim 1, wherein the prediction circuit of each coding branch generates a prediction of a source data value based on a predicted index value of the coding branch to which the prediction circuit belongs but no other index values.
US09/458,659 1998-12-11 1999-12-10 Predictive balanced multiple description coder for data compression Expired - Fee Related US6665646B1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US09/458,659 US6665646B1 (en) 1998-12-11 1999-12-10 Predictive balanced multiple description coder for data compression

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US11188998P 1998-12-11 1998-12-11
US11740799P 1999-01-27 1999-01-27
US09/458,659 US6665646B1 (en) 1998-12-11 1999-12-10 Predictive balanced multiple description coder for data compression

Publications (1)

Publication Number Publication Date
US6665646B1 true US6665646B1 (en) 2003-12-16

Family

ID=29715991

Family Applications (1)

Application Number Title Priority Date Filing Date
US09/458,659 Expired - Fee Related US6665646B1 (en) 1998-12-11 1999-12-10 Predictive balanced multiple description coder for data compression

Country Status (1)

Country Link
US (1) US6665646B1 (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050229072A1 (en) * 2004-03-31 2005-10-13 Setton Ezra J E Multimedia content delivery using pre-stored multiple description coded video with restart
US20060074643A1 (en) * 2004-09-22 2006-04-06 Samsung Electronics Co., Ltd. Apparatus and method of encoding/decoding voice for selecting quantization/dequantization using characteristics of synthesized voice
US20080015856A1 (en) * 2000-09-14 2008-01-17 Cheng-Chieh Lee Method and apparatus for diversity control in mutiple description voice communication
US7412381B1 (en) * 2000-09-14 2008-08-12 Lucent Technologies Inc. Method and apparatus for diversity control in multiple description voice communication
US20100027678A1 (en) * 2008-07-30 2010-02-04 Stmicroelectronics S.R.I. Encoding and decoding methods and apparatus, signal and computer program product therefor
US11030524B2 (en) * 2017-04-28 2021-06-08 Sony Corporation Information processing device and information processing method

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5576765A (en) * 1994-03-17 1996-11-19 International Business Machines, Corporation Video decoder
US5651090A (en) * 1994-05-06 1997-07-22 Nippon Telegraph And Telephone Corporation Coding method and coder for coding input signals of plural channels using vector quantization, and decoding method and decoder therefor
US5740283A (en) * 1995-07-06 1998-04-14 Rubin, Bednarek & Associates, Inc. Digital video compression utilizing mixed vector and scalar outputs
US5777679A (en) * 1996-03-15 1998-07-07 International Business Machines Corporation Video decoder including polyphase fir horizontal filter
US5867602A (en) * 1994-09-21 1999-02-02 Ricoh Corporation Reversible wavelet transform and embedded codestream manipulation
US6195465B1 (en) * 1994-09-21 2001-02-27 Ricoh Company, Ltd. Method and apparatus for compression using reversible wavelet transforms and an embedded codestream
US6292589B1 (en) * 1996-06-21 2001-09-18 Compaq Computer Corporation Method for choosing rate control parameters in motion-compensated transform-based picture coding scheme using non-parametric technique
US6377914B1 (en) * 1999-03-12 2002-04-23 Comsat Corporation Efficient quantization of speech spectral amplitudes based on optimal interpolation technique

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5576765A (en) * 1994-03-17 1996-11-19 International Business Machines, Corporation Video decoder
US5651090A (en) * 1994-05-06 1997-07-22 Nippon Telegraph And Telephone Corporation Coding method and coder for coding input signals of plural channels using vector quantization, and decoding method and decoder therefor
US5867602A (en) * 1994-09-21 1999-02-02 Ricoh Corporation Reversible wavelet transform and embedded codestream manipulation
US6195465B1 (en) * 1994-09-21 2001-02-27 Ricoh Company, Ltd. Method and apparatus for compression using reversible wavelet transforms and an embedded codestream
US5740283A (en) * 1995-07-06 1998-04-14 Rubin, Bednarek & Associates, Inc. Digital video compression utilizing mixed vector and scalar outputs
US5777679A (en) * 1996-03-15 1998-07-07 International Business Machines Corporation Video decoder including polyphase fir horizontal filter
US6292589B1 (en) * 1996-06-21 2001-09-18 Compaq Computer Corporation Method for choosing rate control parameters in motion-compensated transform-based picture coding scheme using non-parametric technique
US6377914B1 (en) * 1999-03-12 2002-04-23 Comsat Corporation Efficient quantization of speech spectral amplitudes based on optimal interpolation technique

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
Ingle et al, "DPCM System Design for Diversity Systems with Applications to Packetized Speech", IEEE Transactions on Speech and Audio Processing, vol. 3, No. 1, pp 48-58, publication of the IEEE Signal Processing Society.
Vaishampayan et al, "Speech Predictor Design for Diversity Communication Systems", Proceedings 1995 IEEE Workshop on Speech Coding for Telecommunications, Speech Coding for Interoperable Global Communications, Annapolis, Maryland, Sep. 20-22, 1995.
Vaishampayan, V.A., "Design of Multiple Description Scalar Quantizers", IEEE Transactions on Speech and Audio Processing, vol. 39, No. 3, pp 821-835, The Institute of Electrical and Electronics Engineers, Inc.New York, New York.

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20080015856A1 (en) * 2000-09-14 2008-01-17 Cheng-Chieh Lee Method and apparatus for diversity control in mutiple description voice communication
US7412381B1 (en) * 2000-09-14 2008-08-12 Lucent Technologies Inc. Method and apparatus for diversity control in multiple description voice communication
US7756705B2 (en) * 2000-09-14 2010-07-13 Alcatel-Lucent Usa Inc. Method and apparatus for diversity control in multiple description voice communication
US20050229072A1 (en) * 2004-03-31 2005-10-13 Setton Ezra J E Multimedia content delivery using pre-stored multiple description coded video with restart
US7676722B2 (en) * 2004-03-31 2010-03-09 Sony Corporation Multimedia content delivery using pre-stored multiple description coded video with restart
US20060074643A1 (en) * 2004-09-22 2006-04-06 Samsung Electronics Co., Ltd. Apparatus and method of encoding/decoding voice for selecting quantization/dequantization using characteristics of synthesized voice
US8473284B2 (en) * 2004-09-22 2013-06-25 Samsung Electronics Co., Ltd. Apparatus and method of encoding/decoding voice for selecting quantization/dequantization using characteristics of synthesized voice
US20100027678A1 (en) * 2008-07-30 2010-02-04 Stmicroelectronics S.R.I. Encoding and decoding methods and apparatus, signal and computer program product therefor
US8488680B2 (en) 2008-07-30 2013-07-16 Stmicroelectronics S.R.L. Encoding and decoding methods and apparatus, signal and computer program product therefor
US11030524B2 (en) * 2017-04-28 2021-06-08 Sony Corporation Information processing device and information processing method

Similar Documents

Publication Publication Date Title
US5999656A (en) Overlapped reversible transforms for unified lossless/lossy compression
CN109451308B (en) Video compression processing method and device, electronic equipment and storage medium
US5909518A (en) System and method for performing wavelet-like and inverse wavelet-like transformations of digital data
US5845243A (en) Method and apparatus for wavelet based data compression having adaptive bit rate control for compression of audio information
US5467086A (en) Apparatus and method of coding/decoding video data
Ho et al. Classified transform coding of images using vector quantization
US4281344A (en) Video interframe transform coding technique
JP4374448B2 (en) Multi-channel signal encoding method, decoding method thereof, apparatus, program and recording medium thereof
EP0369682B1 (en) Efficient coding method and its decoding method
JPH09507347A (en) Quad-tree WALSH conversion video / image coding
CN102007489A (en) Reduction of errors during computation of inverse discrete cosine transform
KR20060063533A (en) Method and apparatus for encoding/decoding multi-layer video using dct upsampling
JPH07284108A (en) Method for coding and decoding video signal
JP4163680B2 (en) Adaptive method and system for mapping parameter values to codeword indexes
Kuroki et al. Lossless image compression by two-dimensional linear prediction with variable coefficients
US6665646B1 (en) Predictive balanced multiple description coder for data compression
JP2509039B2 (en) Vector codebook quantizer
Brower Low-bit-rate image compression evaluations
CN102239640B (en) Encoding method and decoding method, and devices using methods
CN101156318A (en) Predictor
JP2856439B2 (en) Image signal encoding / decoding method
US5534930A (en) Method for constructing a quantization pattern codebook
CN100552669C (en) Be used for changed digital sound signal and the inverse transformation method and apparatus of changed digital sound signal
KR100216600B1 (en) Method of transmitting multi-code for vector quantizer of image signal
EP1359762A1 (en) Quantizer for scalable video coding

Legal Events

Date Code Title Description
AS Assignment

Owner name: AT&T CORPORATION, NEW YORK

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:JOHN, SAM;VAISHAMPAYAN, VINAY A.;REEL/FRAME:010480/0268

Effective date: 19991209

FPAY Fee payment

Year of fee payment: 4

REMI Maintenance fee reminder mailed
LAPS Lapse for failure to pay maintenance fees
STCH Information on status: patent discontinuation

Free format text: PATENT EXPIRED DUE TO NONPAYMENT OF MAINTENANCE FEES UNDER 37 CFR 1.362

FP Lapsed due to failure to pay maintenance fee

Effective date: 20111216