WO2004025964A2

WO2004025964A2 - Manipulation of video compression

Info

Publication number: WO2004025964A2
Application number: PCT/GB2003/003991
Authority: WO
Inventors: Violet Snell; Michael James Knee
Original assignee: Snell & Wilcox Limited
Priority date: 2002-09-12
Filing date: 2003-09-12
Publication date: 2004-03-25
Also published as: GB0504586D0; GB2410852B; AU2003269142A8; DE10393211T5; WO2004025964A3; GB2393060B; GB2410852A; GB2393060A; GB0221146D0; AU2003269142A1

Abstract

A method of compressing a picture sequence in which a first set of pictures in the sequence, and a second set of pictures interleaved with the first set of pictures in the sequence are identified, and in which the first set of pictures are coded to a high coding quality and the second set of pictures are coded to a quality determined by the remaining bitrate resources. Downstream the first set of pictures can be extracted to produce a separate high quality picture sequence. The compression coding is preferably MPEG based, and control of the coding quality may be achieved by varying quantizer weighting matrix values in a linear combination of a flat matrix and a default non-flat matrix.

Description

MANIPULATION OF VIDEO COMPRESSION

This invention is directed to the compression of picture sequences, and in particular aspects to the manipulation of compression techniques in controlling the relative quality of pictures of an image sequence.

Methods for video sequence compression are well known to the art, and amongst these, techniques exist for varying the coding quality.

It is an object of the present invention to provide novel and flexible techniques for manipulating the compression of image sequences.

Accordingly, the invention consists in one aspect in a system for compressing a picture sequence, comprising: means for compressing the picture sequence to give a compressed sequence wherein a first set of pictures have a high coding quality, and wherein a second set of pictures, constituent pictures of which are intermediate pictures of the first set, have a lower coding quality; and downstream, means for separating the first set of pictures from the sequence, and playing the first set of pictures as a separate high quality picture sequence. In another aspect, the invention consists in a method of compressing a picture sequence in which a first set of pictures occurs in a regular pattern in the sequence, and pictures of a second set of pictures occur intermediate pictures of the first set of pictures, comprising coding the first set of pictures to a high coding quality, and coding the second set of pictures to quality determined by remaining bit-rate resources.

Suitably, the first set of pictures are coded independently of said second intermediate set of pictures, with all pictures preferably being intra coded. In this way the sequence is compressed with very little overall loss of quality. Advantageously, the compression algorithm is MPEG based. In one form of the invention, the bit-rate of the second intermediate set of pictures is controlled by varying quantizer weighting matrix values. Suitably, said quantizer weighting matrix values are a linear combination of the values of a flat weighting matrix and a default, non-flat matrix. Advantageously, a fixed number of bits are generated for the period covered by one picture of the first set of pictures and the associated intermediate pictures.

In yet another aspect, the invention provides a method of video signal compression, comprising the steps of receiving a video signal, performing a transform on the video signal, and quantizing the transformed signal, wherein the step of quantizing comprises applying a variable quantization matrix, the variation of which matrix controlled by a parameter relating to the compression bit rate.

Preferably, the parameter is dependent upon the number of bits required for a current picture. Suitably, the method comprises determining the number of bits required for a current picture according to a regular pattern.

In still another aspect, the invention provides a compressed signal comprising a first sequence of high quality pictures interleaved with a second sequence of lower quality pictures, the first sequence being separable from the signal to form a video signal, the second sequence of pictures being for use in editing or further processing of the video signal.

The invention will now be described by way of example with reference to the accompanying drawings, in which:

Figure 1 is a diagram illustrating apparatus according to an embodiment of one aspect of the invention;

Figure 2 to 6 illustrate picture sequences which can be derived from an input sequence according to aspects of the invention;

Figure 7 is a diagram illustrating the function of a method according to an embodiment of another aspect of the invention.

Figure 1 shows a high definition 72 Hz image capture device 102 which provides an input video signal to a compression stage 104. Here the video signal is compressed according to a control signal 106 which identifies key frames that are to be coded to a higher quality than that of other frames. In Figure 1 , key frames are identified every third frame. The compressed output is then transmitted to be received by a decompression stage 108, and subsequently a further processing stage 110. These act to select and decompress only the key frames and output these frames as a 24 Hz high definition video signal. Stages 108 and 110 can optionally output an additional 72Hz signal as will be described in relation to Figures 2 to 6.

The particular example of use of the invention described hereafter concerns the capture of high-frame-rate, high-definition image sequences in an environment where there are limitations in sampling, storage or transmission bandwidth or in processing complexity. It describes a method of compressing high-frame-rate, high-definition image sequences with very little loss of quality. The method ensures that the highest possible quality is retained for a temporally subsampled version of the input sequence, while allowing some loss of quality for intermediate frames according to the available coded bit-rate.

The invention also concerns the use of picture signals having a high quality at a low frame rate together with those having a lower quality at a higher frame rate, as a means for reducing the overall sampling, storage or transmission bandwidth of the picture information, as will be described. In the prior art, the different quality of the two kinds of image is expressed in terms of resolution. Thus, for example in German patent DE-A-41 32 359, a high-definition 25 Hz sequence is multiplexed with a standard-definition 100 Hz sequence. Motion information may be derived from both sequences and used for motion- compensated conversion of the high-definition sequence to the higher frame rate or indeed to any other frame rate.

The prior art does not necessarily deliver the required combination of the highest possible quality at the low frame rate together with an optimum trade-off between quality, bit-rate and processing complexity for the higher-frame-rate version of the sequence. For example, the method described in DE-A-41 32 359 relies on motion compensated up-conversion to restore the full resolution of the image at the higher frame rate. Figure 2 illustrates one embodiment of the invention in use, in which we begin with a high-definition (e.g. 1920 pixels by 1080 lines) picture signal with a frame rate of 72 Hz and with a 4:2:2 luminance and colour-difference sampling structure (represented by input sequence of frames 202) as used in the most common digital television sampling standards such as SMPTE 274M. With 8-bit sampling, the bit rate of such a signal is about 2.4 Gbit s. Let us suppose that at some point in the subsequent processing chain there is a bit-rate limit of, say, 500 Mbit/s. We require a method of compressing the signal into the output bit- rate with the following properties:

-A low-frame-rate (say 24 Hz) version of the input signal may be independently decoded with zero or negligible loss of quality

-The remaining input frames are transmitted with some quality loss, which should nevertheless be minimized. The decoding of these frames may be allowed to depend upon that of the 24 Hz frames, because subsequent processing of those frames may also depend on the 24 Hz frames.

Input sequence 202 is therefore coded to produce coded sequence 204. "Key" frames (every third) are coded to a high quality as indicated by solid arrows 206 (and by the resulting coded frames having darker shading) whereas intermediate frames are coded to a lesser quality as indicated by dashed arrows 208 (and resulting lighter shaded coded frames). Advantageously, the frames which are in between the "key" frames need not be sub-sampled - they are simply coded at a lower quality. The method also preserves the key frames to that degree of quality required at the output. For example, here the output requires a 24Hz high definition signal. Typically, the total bit rate assigned to the remaining, intermediate frames will depend on that required for the high quality frames, and the remaining bit rate allowed by the transmission channel used. Preferably, these intermediate frames are not be coded in dependence, in terms of picture content, upon the key frames, in the sense that normal P and B frames of MPEG are dependent upon prediction from nearby I frames. In this way the coded sequence preferably consists entirely of Intra coded frames (ie frames coded without reference to any other frames in the sequence), however the quality of these Intra coded frames differs. In certain cases though, it may be necessary to use prediction coding for the intermediate frames, for example where a particular bit rate constraint demands.

Any of a number of known compression techniques could be used to perform the actual compression, once the key frames are chosen and the quality level set, for example an established compression scheme such as MPEG-2 or JPEG could be used. In this case, there would be a small but possibly significant loss of quality to all input frames, not just the intermediates.

A particular embodiment makes use of MPEG-2 for the compression of the source picture sequence using a novel method for ensuring that the highest possible quality is retained for the low-frame-rate signal, which in this example is every third frame. This can be done by encoding this "key frame" as an l-frame with a very fine quantizer and a flat quantizer weighting matrix. The remaining bits are allocated to the other two "intermediate" frames, which are also coded as l-frames. As mentioned above however, in certain cases it may be necessary to code these frames as B-frames with or without motion compensation. Returning to Figure 2, downstream of the coding process, the key frames are separated from the intermediate frames, to produce high quality 24Hz sequence 210.

Turning now to figures 3 and 4, different methods of producing a 72 Hz browse track to accompany the key frame sequence are described. In Figure 3, intermediate frames are decoded from sequence 204 in a standard fashion as indicated by the solid arrows 212, while the key frames are decoded and downconverted in an appropriate fashion, as indicated by dashed arrows 214, to provide every third frame in the browse track 216. Alternatively Figure 4 illustrates an alternative way of creating a 72 Hz browse track. Here, only intermediate frames need be extracted from sequence 204, as shown by sequence 218. This essentially provides the desired browse track but with "holes" at each third frame. In this particular example the "holes" are filled by interpolating (possibly using motion compensation) from neighbouring intermediate frames, as illustrated by arrows 220, to produce a compete browse track 222. Figures 5 and 6 illustrate further advantages of the present invention.

Figure 5 shows a first method for producing an output sequence at a frequency which is not an integer multiple of the capture rate. Here the output sequence 504 is derived by interpolating the input sequence 502 which has been coded according to the invention. Output frames are interpolated from the frames which are temporally closest to the respective output frame as indicated by solid arrows 514. It can be seen that for some output frames this will involve one key frame and one intermediate frame (for example at frame 506) while in other cases this will be two intermediate frames (eg frame 508). Output frames such as 506 will still obtain a quality benefit from one key frame (in this case frame 510). While the same is not automatically true for frames such as 508, use of the key frames can still be made, by adding texture to the output frame from key frames which are temporally close to the output frame. This is indicated by dashed arrows 512.

Figure 6 shows an alternative method for producing an output sequence at a frequency which is not an integer multiple of the capture rate. Here only the key frames 610 of the input (which has been coded according to the invention) are used and the output is interpolated from these key frames using conventional methods as illustrated by arrows 612. In a variation on the method illustrated in Figure 6, the intermediate frames are employed, for the purpose of determining motion vectors to be used in the interpolation of the key frames. In many circumstances it will be necessary to reduce the coding quality of these intermediate frames in order to meet the bit-rate target. The conventional way to control the bit-rate in MPEG-2 coding is to vary the quantizer scale parameter. Unfortunately, at high quality levels the control available is very coarse. For example, changing the quantizer scale from 1 to 2 will quadruple the quantizing noise power and will typically reduce the number of bits generated for a given area of the picture by 40%. In the present invention, finer control of the bit rate and coding quality is achieved by modifying the quantizer weighting matrix. There now follows an example of how such a modified matrix may be derived, in the case of intra-frame coding:

Let f be the value of the elements of a flat quantizer weighting matrix. Let Wjj be the elements of a typical quantizer weighting matrix designed to balance the visibility of quantizing noise across the spectrum of the picture (an example being the MPEG-2 default intra matrix). Let a be a bit-rate control parameter.

Then the values ι/_/7 of the matrix actually used for the frame being coded are given by ^ui_j = { - )f + αw_ij

Thus, the variation of the control parameter implements a cross-fade between a flat matrix and a subjectively lossless one. A value α - 0 corresponds to the flat matrix and α - 1 to the default matrix. A value of α between 0 and 1 gives a matrix which leads to subjectively lossless coding in normal viewing conditions, while a value α > 1 may lead to some subjective loss. Figure 7 illustrates how the weighting matrix may vary according to the control parameter. In the graph, a one-dimensional section of the matrix is plotted for simplicity.

For a given intermediate frame, the value of the control parameter α can be calculated using an existing model of the relationship between α and the number of bits generated. The number of bits actually used can serve in calculating a correction to the model for subsequent intermediate frames. The bit-rate control may be run using a first-in first-out buffer as is well known. Alternatively, the control may aim to generate a fixed number of bits for each group of three frames, so that the 24 Hz picture may be extracted more easily. For both the key frames and the intermediate frames, well-known fallback methods of bit-rate control may be employed to ensure that the bit-rate target is met and that the coding quality is acceptable both for the key frames and for the intermediate frames.

It will be appreciated by those skilled in the art that the invention has been described by way of example only, and that a wide variety of alternative approaches may be adopted.

Claims

1. A system for compressing a picture sequence, comprising: means for compressing the picture sequence to give a compressed sequence wherein a first set of pictures have a high coding quality, and wherein a second set of pictures, constituent pictures of which are intermediate pictures of the first set, have a lower coding quality; and downstream, means for separating the first set of pictures from the sequence, and playing the first set of pictures as a separate high quality picture sequence.

2. A system according to Claim 1 , in which the first set of pictures is coded independently of said second intermediate set of pictures.

3. A system according to Claim 1 or Claim 2, in which the compression algorithm used is MPEG based.

4. A system according to any of the Claims 1 to 3, in which the quality of coding of the second intermediate set of pictures is determined by remaining bit-rate resources.

5. A system according to Claim 4, in which the bit-rate of the second intermediate set of pictures is controlled by varying quantizer weighting matrix values.

6. A system according to Claim 5, in which said quantizer weighting matrix values are a linear combination of the values of a flat weighting matrix and a default, non-flat matrix.

7. A system according to any previous claim, in which a fixed number of bits are generated for the period covered by one picture of the first set of pictures and the associated intermediate pictures.

8. A system according to any previous claim, further including means for outputting a picture sequence, having a higher frame rate but of a lower quality than said separate high quality picture sequence.

9. A method of compressing a picture sequence in which a first set of pictures occurs in a regular pattern in the sequence, and pictures of a second set of pictures occur intermediate pictures of the first set of pictures, comprising coding the first set of pictures to a high coding quality, and coding the second set of pictures to a quality determined by remaining bit-rate resources.

10. A method according to Claim 9, in which the first set of pictures is coded independently of said second intermediate set of pictures.

11. A method according to Claim 9 or Claim 10, in which all of said pictures are intra coded.

12. A method according to any one of Claims 9 to 11 in which the compression algorithm is MPEG based.

13. A method according to any one of Claims 9 to 12, in which the bit- rate of the second intermediate set of pictures is controlled by varying quantizer weighting matrix values.

14. A method according to Claim 13, in which said quantizer weighting matrix values are a linear combination of the values of a flat weighting matrix and a default, non-flat matrix.

15. A method according to any one of Claims 9 to 14, in which a fixed number of bits are generated for the period covered by one picture of the first set of pictures and the associated intermediate pictures.

16. Video processing apparatus for compressing a picture sequence comprising: means for dividing the picture sequence into sets, each set containing a key picture and one or more intermediate pictures; a first coder for compressing said key pictures at a first coding quality; means for determining bit rate resources available for the intermediate pictures; and a second coder for compressing the intermediate pictures at a second coding quality dependent on the available bitrate resources, said second coding quality being lower than the first coding quality.

17. Video processing apparatus according to Claim 16, wherein said first and second coders operate independently.

18. Video processing apparatus according to Claims 16 or 17, in which all pictures are compressed using intra coding.

19. Video processing apparatus according to any one of Claims 16 to 18, wherein the remaining bit rate resources are determined such that a constant number of bits are generated for each set of pictures.

20. A method of video signal compression, comprising the steps of receiving a video signal, performing a transform on the video signal, and quantizing the transformed signal, wherein the step of quantizing comprises applying a variable quantization matrix, the variation of which matrix controlled by a parameter relating to the compression bit rate.

21. A method according to Claim 20, wherein the parameter is dependent upon the number of bits required for a current picture.

22. A method according to Claim 20 or Claim 21 , comprising determining the number of bits required for a current picture according to a regular pattern.

23. A method according to any one of claims 20 to 22, wherein said quantization matrix is a linear combination of a flat weighting matrix and a default, non-flat matrix.

24. A compressed signal comprising a first sequence of high quality pictures interleaved with a second sequence of lower quality pictures, the first sequence being separable from the signal to form a video signal, the second sequence of pictures being for use in editing or further processing of the video signal.