WO1996026612A2 - Device and method for coding video pictures - Google Patents

Device and method for coding video pictures Download PDF

Info

Publication number
WO1996026612A2
WO1996026612A2 PCT/IB1996/000129 IB9600129W WO9626612A2 WO 1996026612 A2 WO1996026612 A2 WO 1996026612A2 IB 9600129 W IB9600129 W IB 9600129W WO 9626612 A2 WO9626612 A2 WO 9626612A2
Authority
WO
WIPO (PCT)
Prior art keywords
series
sub
coefficients
picture
step size
Prior art date
Application number
PCT/IB1996/000129
Other languages
French (fr)
Other versions
WO1996026612A3 (en
Inventor
Marcel Breeuwer
Reinier Bernardus Maria Klein Gunnewiek
Original Assignee
Philips Electronics N.V.
Philips Norden Ab
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Philips Electronics N.V., Philips Norden Ab filed Critical Philips Electronics N.V.
Priority to JP8525528A priority Critical patent/JPH10500551A/en
Priority to EP96901475A priority patent/EP0758513B1/en
Priority to DE69603924T priority patent/DE69603924T2/en
Publication of WO1996026612A2 publication Critical patent/WO1996026612A2/en
Publication of WO1996026612A3 publication Critical patent/WO1996026612A3/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/18Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a set of transform coefficients
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/115Selection of the code volume for a coding unit prior to coding
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/124Quantisation
    • H04N19/126Details of normalisation or weighting functions, e.g. normalisation matrices or variable uniform quantisers
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/134Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
    • H04N19/146Data rate or code amount at the encoder output
    • H04N19/149Data rate or code amount at the encoder output by estimating the code amount by means of a model, e.g. mathematical model or statistical model
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/17Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
    • H04N19/172Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a picture, frame or field
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/17Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
    • H04N19/176Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/60Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/134Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
    • H04N19/146Data rate or code amount at the encoder output

Definitions

  • the invention relates to a device and a method for coding and decoding video pictures.
  • the known device comprises a picture transformer for obtaining a series of coefficients which is representative of a picture block, a quantizer for quantizing the series of coefficients with a step size, and an encoder for coding the series of coefficients. Moreover, the known device comprises control means for controlling the step size in conformity with a target value for the number of bits per series of coefficients.
  • the device supplies a bit stream with a variable bitrate.
  • the known device comprises a buffer in which the bit stream is written at a variable bitrate and is read at the fixed bitrate.
  • the quantization step size is controlled with the aid of the control means in such a way that the buffer maintains a desired fullness.
  • the control means of the known device update the quantization step size per macroblock, i.e. one or more contiguous picture blocks.
  • a picture transform mode which is often used is Discrete Cosine Transform (DCT). This transform is performed with relatively small, contiguous picture blocks of, for example 8*8 pixels.
  • the Lapped Orthogonal Transform (LOT) is currently in the limelight due to the absence of block artefacts.
  • the picture blocks partly overlap each other, for example by 50% in both the horizontal and the vertical direction.
  • the LOT has appeared to be interesting.
  • the X-ray picture characteristics are different from conventional video pictures. The dimensions of the picture blocks are therefore considerably larger. X-ray pictures having a dimension of 512*512 pixels appear to be optimally coded at a block size of 64*64 pixels with an overlap of 50%. After transform, 2 this yields 256 blocks of 1024 coefficients each for each picture.
  • a problem of the known device is that with its step size control a fixed bitrate cannot be satisfactorily obtained per picture if the number of blocks is too small.
  • the average bitrate is constant over a large number of pictures, but the bitrate may fluctuate to an unwanted large extent per picture.
  • a normal video picture comprises several thousand picture blocks. With such a number, a fixed bitrate per picture can be obtained within a margin of, for example 0.5%.
  • the device according to the invention is characterized in that the device is provided with dividing means for dividing the series of coefficients into sub- series and for determining a sub-target value for the number of bits per sub-series, the control means being adapted to control the step size per sub-series in conformity with the corresponding sub-target value.
  • control means control the step size per sub- series, as if they were smaller picture blocks.
  • the actual picture block dimensions are, however, unmodified and may be optimally adapted to the desired transform.
  • the number of sub-series per picture is N times the number of picture blocks and can be freely chosen.
  • the value of N can be chosen to be such that a desired bitrate per picture can be obtained within a very small margin.
  • the variation of the step size within a picture is very small. Notably for medical picture sequences, this is an extremely important aspect because a large variation leads to an inhomogeneous picture quality.
  • the sub-target values within a picture block are preferably equally large.
  • the corresponding sub-series can be derived from a predetermined destination of the cumulative distribution of the bits among the coefficients of a picture block.
  • a sensible estimation of the cumulative distribution is obtained from a statistical analysis of a large number of pictures. Such an estimation is eminently suitable for coding the first picture of a picture sequence. For the further pictures of a sequence, the estimation can be derived from the coding results of already coded pictures from the picture sequence.
  • Fig. 1 shows a device for coding video pictures according to the invention.
  • Figs. 2 and 3 show embodiments of dividing the series of coefficients of a picture block into sub-series.
  • Fig. 4 shows a possible embodiment of a control circuit shown in Fig. 1.
  • Figs. 5 and 6 show embodiments of a circuit for the division into sub- series.
  • Fig. 7 shows a diagram to explain the circuit shown in Fig. 5.
  • Fig. 8 shows a possible division of the digital picture format of the coded signal.
  • Fig. 9 shows a device for decoding video pictures according to the invention.
  • Fig. 1 shows a device for coding video pictures in accordance with the invention.
  • the device comprises block formation means 1 in which each video picture is divided into picture blocks having a dimension of, for example 64 x 64 pixels overlapping each other by 50%.
  • the picture blocks are applied to a picture transformer 2.
  • the picture blocks are subjected to a Lapped Orthogonal Transform (LOT).
  • LOT Lapped Orthogonal Transform
  • An embodiment of such a picture transformer is described in United States Patent US 4,442,454. Non-overlapping picture transforms such as the Discrete Cosine Transform are, however, alternatively possible.
  • each picture block is transformed to a 2-dimensional block of 32*32 coefficients.
  • the 1024 coefficients of a block are read in known manner in a zigzag sequence and applied in the form of a 1 -dimensional series to a quantizer 3 which quantizes the coefficients.
  • the quantized coefficients are encoded to digital codewords in a variable-length coder 4 and applied to a transmission channel.
  • the quantizer 3 quantizes the series of coefficients with a given step size in dependence upon an applied quantization parameter s.
  • the step size may be the same for all coefficients of a series. In that case, s is the relevant step size.
  • the quantization step size may be dependent in a predetermined manner on the spatial picture frequency, hence on the location of the coefficient in the series. In that case, s is a scale factor. For the sake of simplicity, s will hereinafter be referred to as "step size".
  • the step size s is applied to the quantizer 3 by a bitrate control circuit 5.
  • This control circuit receives for each picture block k a predetermined target value R,(k) for the number of bits with which this picture block must be coded. Moreover, the control circuit receives the actually produced bits R(k) for each picture block k from the encoder 4. Dependent on R,(k) and R(k), the control circuit adapts the step size s per picture block. If more bits than the target value are produced, then the control circuit enlarges the step size. The coefficients are then quantized in a coarser way and the number of bits decreases. If fewer bits than the target value are produced, then the control circuit reduces the step size.
  • the target value R,(k) for the number of bits per picture block may be the same for all picture blocks of the picture. However, the target value preferably varies from block to block within a picture. In fact, some picture blocks have a substantially equal brightness, whereas other picture blocks have a high degree of complexity. Algorithms for determining the target value R,(k) per picture block are known per se. In the known device, said target value is obtained from a preanalysis of the current picture.
  • the series is divided into sub-series of equal length and the sub-target value is equal for each sub- series.
  • embodiments in which the sub-target value per sub-series is dependent on the spatial picture frequencies represented by a sub-series are more efficient.
  • Fig. 2 shows a diagram to explain a possible embodiment for dividing the series of coefficients into sub-series.
  • the reference numeral 20 denotes an estimation of the cumulative distribution of the number of bits which is produced per picture block by a variable-length encoder in practice.
  • the x axis indicates the series of 1024 coefficients, starting with the DC coefficient and ending with the least significant AC coefficient.
  • the value 1 along the y axis represents the total number of bits R(k) per picture block.
  • a sensible choice is to divide the target value into N equal sub-target values R (k)/N.
  • the sub-series thus obtained then have a distinctive series length L(n).
  • Fig. 3 shows a further embodiment.
  • the N sub-series have a predetermined (for example, equal) length L, and the sub-target value R,(k,n) for each sub-series is different.
  • a sub-series of coefficients will hereinafter be referred to as a segment.
  • a full image consisting of K picture blocks thus comprises K*N segments.
  • the control circuit 5 shown in Fig. 1 is arranged to adapt the step size per segment.
  • Fig. 4 shows the diagrammatical structure of an embodiment of the control circuit 5. It comprises a counter/latch 51 for counting the number of bits R(k,n) produced per segment n of picture block k by the encoder 4 (see Fig. 1). To this end, the counter/latch receives a latch signal L(n) which is indicative of the length of each segment. The counted number of bits is compared with a target value R,(k,n) by means of a subtracter circuit 52. The difference D(k,n) formed thereby is accumulated in an accumulator 53 consisting of an adder 531 and a register 532.
  • the accumulated difference B(k,n) is indicative of the extent to which the desired bitrate is achieved with the actual quantization step size. A positive value indicates that too many bits are produced on average and that the step size must be enlarged. A negative value indicates that too few bits are produced on average and that the step size must be reduced.
  • the step size is controlled in dependence upon the accumulated difference B(k,n) by means of a proportionally integrating (PI) control member 54.
  • This member has a proportional branch (constituted by a multiplier 541) and an integrating branch (constituted by an adder 542, a register 543 and a multiplier 544).
  • the PI control member comprises a summing device 545 in which the output of the proportional branch and the integrating branch are added to an initial estimation ⁇ y for the quantization step size.
  • the PI control member determines the step size s for the next segment as follows:
  • the initial estimation s e for the step size may be a fixed value which is obtained from a statistical analysis of a large number of pictures. It may also be the average step size with which the previous picture has been coded.
  • the sub-target value R,(k,n) for the number of bits per segment n of picture block k, as well as the latch signal L(n) for the counter/latch 51 and the registers 532 and 543 are generated by a segmentation circuit 50.
  • Fig. 5 shows a first embodiment of the segmentation circuit 50.
  • This embodiment is adapted to divide a series of coefficients into segments in accordance with Fig. 2.
  • the segmentation circuit receives a clock signal cl for each coefficient applied by picture transformer 2 to quantizer 3 (see Fig. 1).
  • the clock signal is applied to a coefficient counter 500 which counts down the number of coefficients (1..1024) per picture block.
  • the 6 count of this counter is applied to a memory 501 in which an estimation of the cumulative distribution of the bit cost per picture block (see Fig. 2) is stored.
  • the applied count represents a value of said cumulative distribution on the x axis.
  • memory 501 supplies the corresponding y value to a comparator 502.
  • the segmentation circuit receives the target value R ⁇ (k) for the number of bits per picture block. As already elucidated hereinbefore, this value may vary from block to block.
  • Fig. 6 shows a second embodiment of the segmentation circuit 50. This embodiment is adapted to divide a series of coefficients into segments in accordance with Fig. 3.
  • the segmentation circuit comprises the same coefficient counter 500 and memory 501 as the previous embodiment.
  • the clock signal cl is now also applied to a divider 504 which generates the latch signal L(n) always after 256 coefficients (N times per picture block).
  • the y value supplied by memory 501 is applied by means of the latch signal to a computing circuit 505. With reference to the actual and the previous y value, this computing circuit determines the fraction F(n) of the number of bits R,(k) which can be spent on coding of the actual segment. Said fraction F(n) is multiplied in a multiplier 506 by the target value R,(k) so as to obtain the sub-target value R,(k,n).
  • the memory 501 in Figs.
  • the memory is a RAM which is loaded by an analysis circuit, for example, at the start of a picture.
  • an analysis circuit is shown in Fig. 5 only and is denoted by 507.
  • Fig. 7 shows a diagram to illustrate an algorithm which is performed by the analysis circuit. In this Figure, the current cumulative bit cost distribution is denoted by 20.
  • the analysis circuit performs the following steps:
  • the device according to the invention controls the quantization step size per segment and the number of segments is a factor of N times larger than the number of picture blocks, an accurate bitrate control is possible.
  • the latch signal L(n) which is representative of the segment boundaries, and the step size s are combined in a multiplexer 6 with the bit stream produced by encoder 4.
  • Fig. 8 shows a possible division of the digital picture format thus obtained.
  • the picture signal further has a header 91 and a series of coded picture blocks 92.
  • the segment lengths L(n) applying to the picture are accommodated in the header.
  • Each coded picture block subsequently comprises, for each segment, the relevant step sizes s(n), the coded coefficients and an end-of-block code EOB.
  • Fig. 9 shows a device for decoding the picture signal.
  • the device comprises a demultiplexer 101 for splitting, on the one hand, the picture signal into variable- length codewords representing coefficients, and the step sizes s and segment lengths L(n) on the other hand.
  • the variable-length codewords are applied to a variable-length decoder 102 which applies the quantized coefficients to an inverse quantizer 103.
  • the inverse quantizer reproduces the series of coefficients in dependence upon the step size s in the segment defined by L(n).
  • the series of coefficients is subsequently retransformed by an inverse picture transformer 104 to a picture block in the pixel domain.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Physics & Mathematics (AREA)
  • Algebra (AREA)
  • General Physics & Mathematics (AREA)
  • Mathematical Analysis (AREA)
  • Mathematical Optimization (AREA)
  • Pure & Applied Mathematics (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)
  • Compression Of Band Width Or Redundancy In Fax (AREA)

Abstract

The bitrate control of known video encoders is based on a control of the quantization step size per picture block. In transform coding of relatively large picture blocks, the video picture comprises a too small number of picture blocks to obtain a desired number of bits per picture. In conformity with the invention, the target value for the number of bits per picture block is divided into N sub-targets (Rt(k,n)). The series of coefficients of a picture block is divided into N segments (L(n)) with reference to an estimation for the cumulative distribution (20) of the bit cost per picture block. The quantization step size is subsequently adapted per segment to the corresponding sub-target. Since the number of segments is N times larger than the number of picture blocks, an accurate bitrate control is possible. If desired, the estimation for the cumulative distribution is adapted from picture to picture.

Description

Device and method for coding video pictures.
FIELD OF THE INVENTION
The invention relates to a device and a method for coding and decoding video pictures.
BACKGROUND OF THE INVENTION A known device for coding video pictures is described in "Hardware
Implementation of the Framestore and Data Rate Control for a Digital HDTV- VCR" presented at the HDTV Symposium in Japan, November 1992, The known device comprises a picture transformer for obtaining a series of coefficients which is representative of a picture block, a quantizer for quantizing the series of coefficients with a step size, and an encoder for coding the series of coefficients. Moreover, the known device comprises control means for controlling the step size in conformity with a target value for the number of bits per series of coefficients.
The device supplies a bit stream with a variable bitrate. To obtain a fixed bitrate per picture (or group of pictures), the known device comprises a buffer in which the bit stream is written at a variable bitrate and is read at the fixed bitrate. The quantization step size is controlled with the aid of the control means in such a way that the buffer maintains a desired fullness. The control means of the known device update the quantization step size per macroblock, i.e. one or more contiguous picture blocks.
A picture transform mode which is often used is Discrete Cosine Transform (DCT). This transform is performed with relatively small, contiguous picture blocks of, for example 8*8 pixels. The Lapped Orthogonal Transform (LOT) is currently in the limelight due to the absence of block artefacts. In this transform mode, the picture blocks partly overlap each other, for example by 50% in both the horizontal and the vertical direction. Notably for storage of X-ray angiographic pictures for medical applications, the LOT has appeared to be interesting. However, the X-ray picture characteristics are different from conventional video pictures. The dimensions of the picture blocks are therefore considerably larger. X-ray pictures having a dimension of 512*512 pixels appear to be optimally coded at a block size of 64*64 pixels with an overlap of 50%. After transform, 2 this yields 256 blocks of 1024 coefficients each for each picture.
A problem of the known device is that with its step size control a fixed bitrate cannot be satisfactorily obtained per picture if the number of blocks is too small. The average bitrate is constant over a large number of pictures, but the bitrate may fluctuate to an unwanted large extent per picture. For comparison: a normal video picture comprises several thousand picture blocks. With such a number, a fixed bitrate per picture can be obtained within a margin of, for example 0.5%.
OBJECT AND SUMMARY OF THE INVENTION It is an object of the invention to obviate the above-mentioned drawbacks.
To this end, the device according to the invention is characterized in that the device is provided with dividing means for dividing the series of coefficients into sub- series and for determining a sub-target value for the number of bits per sub-series, the control means being adapted to control the step size per sub-series in conformity with the corresponding sub-target value.
It is thereby achieved that the control means control the step size per sub- series, as if they were smaller picture blocks. The actual picture block dimensions are, however, unmodified and may be optimally adapted to the desired transform. The number of sub-series per picture is N times the number of picture blocks and can be freely chosen. The value of N can be chosen to be such that a desired bitrate per picture can be obtained within a very small margin. Moreover, it is achieved that the variation of the step size within a picture is very small. Notably for medical picture sequences, this is an extremely important aspect because a large variation leads to an inhomogeneous picture quality. The sub-target values within a picture block are preferably equally large.
The corresponding sub-series can be derived from a predetermined destination of the cumulative distribution of the bits among the coefficients of a picture block. A sensible estimation of the cumulative distribution is obtained from a statistical analysis of a large number of pictures. Such an estimation is eminently suitable for coding the first picture of a picture sequence. For the further pictures of a sequence, the estimation can be derived from the coding results of already coded pictures from the picture sequence.
BRIEF DESCRIPTION OF THE FIGURES
Fig. 1 shows a device for coding video pictures according to the invention.
Figs. 2 and 3 show embodiments of dividing the series of coefficients of a picture block into sub-series.
Fig. 4 shows a possible embodiment of a control circuit shown in Fig. 1. Figs. 5 and 6 show embodiments of a circuit for the division into sub- series.
Fig. 7 shows a diagram to explain the circuit shown in Fig. 5.
Fig. 8 shows a possible division of the digital picture format of the coded signal. Fig. 9 shows a device for decoding video pictures according to the invention.
DESCRIPTION OF EMBODIMENTS
Fig. 1 shows a device for coding video pictures in accordance with the invention. The device comprises block formation means 1 in which each video picture is divided into picture blocks having a dimension of, for example 64 x 64 pixels overlapping each other by 50%. The picture blocks are applied to a picture transformer 2. In this transformer the picture blocks are subjected to a Lapped Orthogonal Transform (LOT). An embodiment of such a picture transformer is described in United States Patent US 4,442,454. Non-overlapping picture transforms such as the Discrete Cosine Transform are, however, alternatively possible. In the LOT, each picture block is transformed to a 2-dimensional block of 32*32 coefficients. The 1024 coefficients of a block are read in known manner in a zigzag sequence and applied in the form of a 1 -dimensional series to a quantizer 3 which quantizes the coefficients. The quantized coefficients are encoded to digital codewords in a variable-length coder 4 and applied to a transmission channel.
The quantizer 3 quantizes the series of coefficients with a given step size in dependence upon an applied quantization parameter s. The step size may be the same for all coefficients of a series. In that case, s is the relevant step size. Alternatively, the quantization step size may be dependent in a predetermined manner on the spatial picture frequency, hence on the location of the coefficient in the series. In that case, s is a scale factor. For the sake of simplicity, s will hereinafter be referred to as "step size".
The step size s is applied to the quantizer 3 by a bitrate control circuit 5. This control circuit receives for each picture block k a predetermined target value R,(k) for the number of bits with which this picture block must be coded. Moreover, the control circuit receives the actually produced bits R(k) for each picture block k from the encoder 4. Dependent on R,(k) and R(k), the control circuit adapts the step size s per picture block. If more bits than the target value are produced, then the control circuit enlarges the step size. The coefficients are then quantized in a coarser way and the number of bits decreases. If fewer bits than the target value are produced, then the control circuit reduces the step size.
The target value R,(k) for the number of bits per picture block may be the same for all picture blocks of the picture. However, the target value preferably varies from block to block within a picture. In fact, some picture blocks have a substantially equal brightness, whereas other picture blocks have a high degree of complexity. Algorithms for determining the target value R,(k) per picture block are known per se. In the known device, said target value is obtained from a preanalysis of the current picture.
In conformity with the invention, the series of 1024 coefficients of a picture block k is split up into N (for example, 4) sub-series with a sub-target value R,(k,n), n = 1 ... N for the number of bits per sub-series. In an extremely simple embodiment, the series is divided into sub-series of equal length and the sub-target value is equal for each sub- series. However, embodiments in which the sub-target value per sub-series is dependent on the spatial picture frequencies represented by a sub-series are more efficient. Practice has proved that the need for bits decreases as the coefficients represent a higher spatial picture frequency. Fig. 2 shows a diagram to explain a possible embodiment for dividing the series of coefficients into sub-series. In this Figure, the reference numeral 20 denotes an estimation of the cumulative distribution of the number of bits which is produced per picture block by a variable-length encoder in practice. The x axis indicates the series of 1024 coefficients, starting with the DC coefficient and ending with the least significant AC coefficient. The value 1 along the y axis represents the total number of bits R(k) per picture block. In the example shown the target value R,(k) of the number of bits per picture block is divided into N predetermined sub-target values R,(k,n) (n = 1 ... N). A sensible choice is to divide the target value into N equal sub-target values R (k)/N. The sub-series thus obtained then have a distinctive series length L(n). Fig. 3 shows a further embodiment. In this embodiment, the N sub-series have a predetermined (for example, equal) length L, and the sub-target value R,(k,n) for each sub-series is different. A sub-series of coefficients will hereinafter be referred to as a segment. A full image consisting of K picture blocks thus comprises K*N segments.
The control circuit 5 shown in Fig. 1 is arranged to adapt the step size per segment. Fig. 4 shows the diagrammatical structure of an embodiment of the control circuit 5. It comprises a counter/latch 51 for counting the number of bits R(k,n) produced per segment n of picture block k by the encoder 4 (see Fig. 1). To this end, the counter/latch receives a latch signal L(n) which is indicative of the length of each segment. The counted number of bits is compared with a target value R,(k,n) by means of a subtracter circuit 52. The difference D(k,n) formed thereby is accumulated in an accumulator 53 consisting of an adder 531 and a register 532. The accumulated difference B(k,n) is indicative of the extent to which the desired bitrate is achieved with the actual quantization step size. A positive value indicates that too many bits are produced on average and that the step size must be enlarged. A negative value indicates that too few bits are produced on average and that the step size must be reduced.
The step size is controlled in dependence upon the accumulated difference B(k,n) by means of a proportionally integrating (PI) control member 54. This member has a proportional branch (constituted by a multiplier 541) and an integrating branch (constituted by an adder 542, a register 543 and a multiplier 544). Moreover, the PI control member comprises a summing device 545 in which the output of the proportional branch and the integrating branch are added to an initial estimation ϊy for the quantization step size. As is apparent from the Figure, the PI control member determines the step size s for the next segment as follows:
s = kι .B (k,n) + k . B i ,j ) + se
in which the summation ΣB(iJ) takes place for all already coded segments of the picture and in which k, and k2 are control constants. The initial estimation se for the step size may be a fixed value which is obtained from a statistical analysis of a large number of pictures. It may also be the average step size with which the previous picture has been coded. As is shown in Fig. 4, the sub-target value R,(k,n) for the number of bits per segment n of picture block k, as well as the latch signal L(n) for the counter/latch 51 and the registers 532 and 543 are generated by a segmentation circuit 50.
Fig. 5 shows a first embodiment of the segmentation circuit 50. This embodiment is adapted to divide a series of coefficients into segments in accordance with Fig. 2. The segmentation circuit receives a clock signal cl for each coefficient applied by picture transformer 2 to quantizer 3 (see Fig. 1). The clock signal is applied to a coefficient counter 500 which counts down the number of coefficients (1..1024) per picture block. The 6 count of this counter is applied to a memory 501 in which an estimation of the cumulative distribution of the bit cost per picture block (see Fig. 2) is stored. The applied count represents a value of said cumulative distribution on the x axis. In response to the applied value x, memory 501 supplies the corresponding y value to a comparator 502. As soon as the y value has predetermined values (0.25, 0.50, 0.75 and 1 for N = 4), said comparator supplies the latch signal L(n). Moreover, the segmentation circuit receives the target value R<(k) for the number of bits per picture block. As already elucidated hereinbefore, this value may vary from block to block. The target value R,(k) is divided by the number of segments N in a multiplier 503 so as to obtain a sub-target value R,(k,n) = R,(k)/N per segment. Fig. 6 shows a second embodiment of the segmentation circuit 50. This embodiment is adapted to divide a series of coefficients into segments in accordance with Fig. 3. The segmentation circuit comprises the same coefficient counter 500 and memory 501 as the previous embodiment. The clock signal cl is now also applied to a divider 504 which generates the latch signal L(n) always after 256 coefficients (N times per picture block). The y value supplied by memory 501 is applied by means of the latch signal to a computing circuit 505. With reference to the actual and the previous y value, this computing circuit determines the fraction F(n) of the number of bits R,(k) which can be spent on coding of the actual segment. Said fraction F(n) is multiplied in a multiplier 506 by the target value R,(k) so as to obtain the sub-target value R,(k,n). The memory 501 in Figs. 5 and 6 may be a ROM in which an estimation once determined for the cumulative distribution of the bit cost per picture block is stored. However, it is alternatively possible and sensible to regularly adapt the cumulative distribution of the bit cost per picture block to the statistics of the picture signal. In this case, the memory is a RAM which is loaded by an analysis circuit, for example, at the start of a picture. For the sake of simplicity, such an analysis circuit is shown in Fig. 5 only and is denoted by 507. Fig. 7 shows a diagram to illustrate an algorithm which is performed by the analysis circuit. In this Figure, the current cumulative bit cost distribution is denoted by 20. The analysis circuit performs the following steps:
1. Storing the produced number of bits R(k,n) for each segment n of each picture block, as well as the step size s(k,n) then used.
2. Determining, per segment, its complexity. The complexity of a segment is defined as: c (k ,n) = s (k ,n) .R (k ,n) 3. Determining the average complexity of the nth segments of the picture blocks of a picture:
c ( ) = ∑ C (*,JI)
*«1
4. Determining the average relative complexity of the segments within a picture block:
cr(n) = _£££l_ xioo%
N
∑ e(-n) n=l
5. Fixing four points PI ... P4 of a new cumulative bit cost distribution for the next picture with reference to the percentages cr(n) and the current segment boundaries Nl ... N4 (see Fig. 7). 6. Estimating the new cumulative distribution 21 by computing an interpolation curve through the points PI ... P4.
7. Loading the new division into memory 501 (see Figs. 5 and 6) at the start of the next picture.
Since the device according to the invention controls the quantization step size per segment and the number of segments is a factor of N times larger than the number of picture blocks, an accurate bitrate control is possible. Practical experiments have proved that an X-ray picture of 256*256 overlapping picture blocks of 64*64 pixels each can be coded sufficiently accurately in the desired number of bits with a division into N = 4 segments. Reverting to Fig. 1, it appears that the latch signal L(n), which is representative of the segment boundaries, and the step size s are combined in a multiplexer 6 with the bit stream produced by encoder 4. Fig. 8 shows a possible division of the digital picture format thus obtained. The picture signal further has a header 91 and a series of coded picture blocks 92. The segment lengths L(n) applying to the picture are accommodated in the header. Each coded picture block subsequently comprises, for each segment, the relevant step sizes s(n), the coded coefficients and an end-of-block code EOB.
Fig. 9 shows a device for decoding the picture signal. The device comprises a demultiplexer 101 for splitting, on the one hand, the picture signal into variable- length codewords representing coefficients, and the step sizes s and segment lengths L(n) on the other hand. The variable-length codewords are applied to a variable-length decoder 102 which applies the quantized coefficients to an inverse quantizer 103. The inverse quantizer reproduces the series of coefficients in dependence upon the step size s in the segment defined by L(n). The series of coefficients is subsequently retransformed by an inverse picture transformer 104 to a picture block in the pixel domain.

Claims

CLAIMS;
1. A device for coding video pictures, comprising a picture transformer (2) for obtaining a series of coefficients which is representative of a picture block, a quantizer (3) for quantizing the series of coefficients with a step size (s), an encoder (4) for coding the series of coefficients, and control means (5) for controlling the step size in conformity with a target value (R,(k)) for the number of bits per series of coefficients, characterized in that the device is provided with dividing means (50) for dividing the series of coefficients into sub- series and for determining a sub-target value (Rt(k,n)) for the number of bits per sub-series, the control means (5) being adapted to control the step size per sub-series in conformity with the corresponding sub- target value.
2. A device as claimed in Claim 1, characterized in that the dividing means are adapted to derive the relation between said sub-series and corresponding sub-target values from a predetermined estimation (501) of the cumulative distribution of the bits among the series of coefficients.
3. A device as claimed in Claim 1 or 2, characterized in that the dividing means are adapted (503) to divide the target value (R,(k)) into N predetermined sub-target values (R,(k)/N) and to determine the corresponding sub-series (L(n)).
4. A device as claimed in Claim 1 or 2, characterized in that the dividing means are adapted (504) to divide the series into N predetermined sub-series (L) and to determine the corresponding sub-target values (R,(k,n)).
5. A device as claimed in Claim 2, characterized in that the device comprises means (507) for adapting the estimation of the cumulative distribution of the bits among the series of coefficients to previous coding results.
6. A device for decoding a coded picture signal, comprising a decoder (102) for obtaining a series of quantized coefficients, an inverse quantizer (103) for reproducing transform coefficients in response to an applied step size (s), and a picture transformer (104) for transforming the reproduced coefficients into a picture block, characterized in that the device comprises means (101) for receiving a parameter (L(n)) indicating how the series of quantized coefficients is divided into sub-series, the inverse quantizer being adapted to reproduce the transform coefficients per sub-series with the step size for this sub-series.
7. A picture signal comprising a series of coded picture blocks (92) in the form of quantized coefficients and a step size with which the coefficients have been coded, characterized in that a signal accommodates a parameter L(n)) indicating how the series of quantized coefficients of a picture block is divided into sub-series, and in that the step size (s(n)) is accommodated in the signal for each sub-series.
8. A method of coding video pictures, comprising the steps of transforming a picture block into a series of coefficients, quantizing the series of coefficients with a step size, coding the series of coefficients and controlling the step size in conformity with a target value (R«(k)) for the number of bits per series of coefficients, characterized in that the series of coefficients is divided into sub-series with a sub-target value (R,(k,n) for the number of bits per sub-series, the step size per sub-series being controlled in conformity with the corresponding sub-target value.
9. A method as claimed in Claim 9, characterized in that the relation between said sub-series and corresponding sub-target values is derived from a predetermined destination of the cumulative distribution of the bits among the series of coefficients.
10. A method as claimed in Claim 9 or 10, characterized in that the target value (R,(k)) is divided into N predetermined sub-target values (R1(k)/N) and in that corresponding sub-series (L(n)) are computed.
11. A method as claimed in Claim 9 or 10, characterized in that the series is divided into N predetermined sub-series (L) and in that the corresponding sub-target values
(R,(k,n)) are computed.
12. A method as claimed in Claim 10, characterized in that the estimation of the cumulative distribution of the bits among the series of coefficients is adapted to previous coding results.
13. A method of decoding a coded picture signal, comprising the steps of obtaining a series of quantized coefficients, inverse quantizing for reproducing transform coefficients in response to an applied step size (s) and transforming the reproduced coefficients into a picture block, characterized in that a parameter (L(n)) is received which indicates how the series of quantized coefficients is divided into sub-series, and in that the transform coefficients per sub-series are reproduced with the step size for this sub-series.
PCT/IB1996/000129 1995-02-24 1996-02-15 Device and method for coding video pictures WO1996026612A2 (en)

Priority Applications (3)

Application Number Priority Date Filing Date Title
JP8525528A JPH10500551A (en) 1995-02-24 1996-02-15 Video image coding apparatus and method
EP96901475A EP0758513B1 (en) 1995-02-24 1996-02-15 Device and method for coding video pictures
DE69603924T DE69603924T2 (en) 1995-02-24 1996-02-15 DEVICE AND METHOD FOR ENCODING VIDEO IMAGES

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
EP95200449.7 1995-02-24
EP95200449 1995-02-24

Publications (2)

Publication Number Publication Date
WO1996026612A2 true WO1996026612A2 (en) 1996-08-29
WO1996026612A3 WO1996026612A3 (en) 1996-11-14

Family

ID=8220046

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/IB1996/000129 WO1996026612A2 (en) 1995-02-24 1996-02-15 Device and method for coding video pictures

Country Status (5)

Country Link
US (1) US7257159B1 (en)
EP (1) EP0758513B1 (en)
JP (1) JPH10500551A (en)
DE (1) DE69603924T2 (en)
WO (1) WO1996026612A2 (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0876060A1 (en) * 1996-11-08 1998-11-04 Matsushita Electric Industrial Co., Ltd. Image coder and image coding method, and image decoder and image decoding method, and quantization control method and inverse quantization control method, and data recording medium
WO1999007159A3 (en) * 1997-07-29 1999-05-14 Koninkl Philips Electronics Nv Variable bitrate video coding method and corresponding video coder
WO2006041994A1 (en) 2004-10-08 2006-04-20 Nvidia Corporation Methods and systems for rate control in image compression

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR102249819B1 (en) * 2014-05-02 2021-05-10 삼성전자주식회사 System on chip and data processing system including the same

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH0362789A (en) * 1989-07-31 1991-03-18 Matsushita Electric Ind Co Ltd Picture coder
JPH04111591A (en) * 1990-08-30 1992-04-13 Mitsubishi Electric Corp Picture orthogonal conversion vector quantization device
GB2251756A (en) * 1991-01-11 1992-07-15 Sony Broadcast & Communication Compression of composite colour video signals
EP0495501A2 (en) * 1991-01-17 1992-07-22 Sharp Kabushiki Kaisha Image coding system using an orthogonal transform and bit allocation method suitable therefor
EP0554871A2 (en) * 1992-02-04 1993-08-11 Sony Corporation Method and apparatus for encoding a digital image signal
EP0634866A1 (en) * 1993-06-30 1995-01-18 Victor Company Of Japan, Limited Digital video signal processing with separation of data sequences

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4442454A (en) 1982-11-15 1984-04-10 Eastman Kodak Company Image processing method using a block overlap transformation procedure
US5640208A (en) * 1991-06-27 1997-06-17 Sony Corporation Video signal encoding in accordance with stored parameters
US5144424A (en) * 1991-10-15 1992-09-01 Thomson Consumer Electronics, Inc. Apparatus for video data quantization control
US5283646A (en) * 1992-04-09 1994-02-01 Picturetel Corporation Quantizer control method and apparatus
US5426463A (en) * 1993-02-22 1995-06-20 Rca Thomson Licensing Corporation Apparatus for controlling quantizing in a video signal compressor
BE1007807A3 (en) * 1993-11-30 1995-10-24 Philips Electronics Nv Apparatus for encoding a video signal.

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH0362789A (en) * 1989-07-31 1991-03-18 Matsushita Electric Ind Co Ltd Picture coder
JPH04111591A (en) * 1990-08-30 1992-04-13 Mitsubishi Electric Corp Picture orthogonal conversion vector quantization device
GB2251756A (en) * 1991-01-11 1992-07-15 Sony Broadcast & Communication Compression of composite colour video signals
EP0495501A2 (en) * 1991-01-17 1992-07-22 Sharp Kabushiki Kaisha Image coding system using an orthogonal transform and bit allocation method suitable therefor
EP0554871A2 (en) * 1992-02-04 1993-08-11 Sony Corporation Method and apparatus for encoding a digital image signal
EP0634866A1 (en) * 1993-06-30 1995-01-18 Victor Company Of Japan, Limited Digital video signal processing with separation of data sequences

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
PATENT ABSTRACTS OF JAPAN, Vol. 15, No. 219, E-1074; & JP,A,03 062 789, (MATSUSHITA ELECTRIC IND CO LTD), 18 March 1991. *
PATENT ABSTRACTS OF JAPAN, Vol. 16, No. 354, E-1242; & JP,A,04 111 591, (MITSUBISHI ELECTRIC CORP), 13 April 1992. *

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0876060A1 (en) * 1996-11-08 1998-11-04 Matsushita Electric Industrial Co., Ltd. Image coder and image coding method, and image decoder and image decoding method, and quantization control method and inverse quantization control method, and data recording medium
EP0876060A4 (en) * 1996-11-08 2000-09-20 Matsushita Electric Ind Co Ltd Image coder and image coding method, and image decoder and image decoding method, and quantization control method and inverse quantization control method, and data recording medium
US6275530B1 (en) 1996-11-08 2001-08-14 Matsushita Electric Industrial Co., Ltd. Image coder and image coding method, and image decoder and image decoding method, and quantization control method and inverse quantization control method, and data recording medium
WO1999007159A3 (en) * 1997-07-29 1999-05-14 Koninkl Philips Electronics Nv Variable bitrate video coding method and corresponding video coder
US6205174B1 (en) * 1997-07-29 2001-03-20 U.S. Philips Corporation Variable bitrate video coding method and corresponding video coder
WO2006041994A1 (en) 2004-10-08 2006-04-20 Nvidia Corporation Methods and systems for rate control in image compression
EP1797520A2 (en) * 2004-10-08 2007-06-20 Nvidia Corporation Methods and systems for rate control in image compression
EP1797520A4 (en) * 2004-10-08 2009-05-27 Nvidia Corp Methods and systems for rate control in image compression
US7747095B2 (en) 2004-10-08 2010-06-29 Nvidia Corporation Methods and systems for rate control in image compression

Also Published As

Publication number Publication date
JPH10500551A (en) 1998-01-13
EP0758513B1 (en) 1999-08-25
WO1996026612A3 (en) 1996-11-14
US7257159B1 (en) 2007-08-14
EP0758513A1 (en) 1997-02-19
DE69603924D1 (en) 1999-09-30
DE69603924T2 (en) 2000-08-31

Similar Documents

Publication Publication Date Title
US5517583A (en) Image encoding method and apparatus
US6408027B2 (en) Apparatus and method for coding moving picture
KR0150955B1 (en) Compressive and extensive method of an image for bit-fixing and the apparatus thereof
JP4111351B2 (en) Apparatus and method for optimizing rate control in a coding system
EP1563688B1 (en) Method and apparatus for control of rate-distortion tradeoff by using lagrange multiplier and visual mask
EP0689360B1 (en) Variable transfer rate video coding apparatus and method
EP0519962B1 (en) Digital image coding using a random scanning of image frames
JP3864461B2 (en) Video data compression apparatus and method
US7065138B2 (en) Video signal quantizing apparatus and method thereof
JP2921358B2 (en) Image coding device
JP2003018603A (en) Method and device for encoding moving image
WO2005079077A1 (en) Encoding and decoding of video images based on a non-linear quantization
EP1377071A1 (en) Image processing device, image processing method, image processing program, and recording medium
EP0758513B1 (en) Device and method for coding video pictures
WO1991014339A1 (en) Digital image coding with quantization level computation
JPWO2002080575A1 (en) Image processing apparatus, image processing method, image processing program, and recording medium
EP1933569A2 (en) Method and apparatus for control of rate-distortion tradeoff by using lagrange multiplier and/or quantizer value
JPH07236139A (en) Data compressing device
JP3262114B2 (en) Video signal encoding device and video signal encoding method
JP4582710B2 (en) Video encoding device
KR100225348B1 (en) A quantizing coefficient count apparatus for decision of quantizing number
JPH06292180A (en) Image compression encoder
JP3282526B2 (en) Image coding device
JPH04208775A (en) Bit distribution encoding system
JPH07212757A (en) Picture compression coder

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): JP

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): AT BE CH DE DK ES FR GB GR IE IT LU MC NL PT SE

WWE Wipo information: entry into national phase

Ref document number: 1996901475

Country of ref document: EP

AK Designated states

Kind code of ref document: A3

Designated state(s): JP

AL Designated countries for regional patents

Kind code of ref document: A3

Designated state(s): AT BE CH DE DK ES FR GB GR IE IT LU MC NL PT SE

121 Ep: the epo has been informed by wipo that ep was designated in this application
WWP Wipo information: published in national office

Ref document number: 1996901475

Country of ref document: EP

WWG Wipo information: grant in national office

Ref document number: 1996901475

Country of ref document: EP