WO2005117449A1 - Method and device for encoding digital video data - Google Patents

Method and device for encoding digital video data Download PDF

Info

Publication number
WO2005117449A1
WO2005117449A1 PCT/IB2005/051651 IB2005051651W WO2005117449A1 WO 2005117449 A1 WO2005117449 A1 WO 2005117449A1 IB 2005051651 W IB2005051651 W IB 2005051651W WO 2005117449 A1 WO2005117449 A1 WO 2005117449A1
Authority
WO
WIPO (PCT)
Prior art keywords
pictures
picture
intra
encoded
called
Prior art date
Application number
PCT/IB2005/051651
Other languages
French (fr)
Inventor
Stéphane VALENTE
Original Assignee
Koninklijke Philips Electronics N.V.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Koninklijke Philips Electronics N.V. filed Critical Koninklijke Philips Electronics N.V.
Priority to EP05748193A priority Critical patent/EP1754378A1/en
Priority to US11/597,534 priority patent/US20070230574A1/en
Priority to KR1020067027023A priority patent/KR20070033364A/en
Priority to JP2007514255A priority patent/JP2008500760A/en
Publication of WO2005117449A1 publication Critical patent/WO2005117449A1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/503Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
    • H04N19/51Motion estimation or motion compensation
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/85Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression
    • H04N19/86Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression involving reduction of coding artifacts, e.g. of blockiness
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/103Selection of coding mode or of prediction mode
    • H04N19/107Selection of coding mode or of prediction mode between spatial and temporal predictive coding, e.g. picture refresh
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/134Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
    • H04N19/142Detection of scene cut or scene change
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/17Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
    • H04N19/172Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a picture, frame or field
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/60Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
    • H04N19/61Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding in combination with predictive coding

Definitions

  • the present invention generally relates to the field of digital video compression and, for instance, to the video coding standards of the ISO/MPEG family (MPEG-1, MPEG-2, MPEG-4) and to the video recommendations of the ITU-H.26X family (H.261, H.263 and extensions, H.264).
  • the invention relates to a method of encoding digital video data corresponding to an original sequence of images and available in the form of a video stream consisting of successive pictures which are either INTRA pictures, called I- pictures and encoded by means of a so-called INTRA mode without any reference to any past or future picture, or INTER pictures that are themselves either monodirectionally predicted pictures, called P-pictures and encoded with reference to a past or future reference picture which is an INTRA or INTER picture, or bidirectionally predicted pictures, called B-pictures and encoded with reference to one or more reference picture(s), said INTRA pictures themselves comprising either I-pictures placed at the beginning of a new group of pictures corresponding to a scene change, where no temporal redundancy is available, and called scene change I-pictures, or I-pictures placed in other locations, where some temporal redundancy is available, and called refresh pictures.
  • the invention also relates to a corresponding encoding device.
  • the INTRA mode the luminance and chrominance channels are encoded by exploiting the spatial redundancy of the pixels in a given channel of a single image via transform coding.
  • the INTER mode exploiting the temporal redundancy between separate images, relies on a motion-compensation technique that predicts an image from one (or more) previously decoded image(s) by encoding the motion of pixels from one image to the other.
  • an image to be encoded is partitioned into independent blocks, each of them being assigned one or several motion vectors.
  • a prediction of the image is constructed by displacing pixel blocks from the reference image(s) according to the set of motion vectors (luminance and chrominance channels share the same motion description).
  • the difference, called the residual signal, between the image to be encoded and its motion-compensated prediction is encoded like in the INTRA mode by transform coding to further refine the decoded image.
  • the INTRA mode corresponds to I pictures or slices (a slice is a group of consecutive macroblocks), while the INTER mode corresponds to P and B pictures or slices.
  • the coding efficiency of the INTER mode is much higher than the one of the INTRA mode, because it takes advantage of temporal prediction : much of the signal is contained in the prediction formed by motion compensation, and the residual signal has a smaller energy than the original signal. Because their encoding relies only on their own spatial redundancy, INTRA pictures can be decoded independently from any other pictures (which is not the case for INTER pictures). In spite of their lower coding efficiency, INTRA pictures are therefore inserted periodically in a bitstream to make random access points, begin new GOPs (Group of Pictures), or erase drifts between encoders/decoders (decoding errors due to channel losses or encoder/decoder implementation mismatches).
  • GOPs Group of Pictures
  • the INTRA pictures that are placed at locations where an INTER picture would have been more efficient will be called "refresh" pictures.
  • INTRA pictures can also be advantageously placed at scene-cuts, where no temporal redundancy will help the encoding.
  • this invention specifically focuses on refresh INTRA pictures, not scene-cuts.
  • the problem overcome below by the invention is the following one : INTRA and INTER pictures exhibit different coding artefacts, since the underlying encoding method is different. Throughout an homogeneous video sequence, the quality and artefacts of subsequent INTER pictures tend to stabilize. However, if an INTRA picture would have been more efficient (in other words, the INTRA pictures at locations where a lot of temporal redundancy is available, not at scene changes) will be called "refresh" pictures.
  • INTRA pictures can also be advantageously placed at scene-cuts, where no temporal redundancy will help the encoding.
  • this invention specifically focuses on refresh INTRA pictures, not scene-cuts.
  • INTRA refresh frame is encoded, all preceding artefacts, due to INTER coding, are erased, and new ones, due to INTRA coding, are introduced abruptly.
  • Video quality is therefore disruptive at refresh frames, resulting in what is here called a flashing effect, especially visible in low motion sequences and at moderate or low bitrates (when coding artefacts become quite noticeable).
  • This flashing effect exists for all the MPEG family, but it is amplified by the latest standard, MPEG-4 part 10 (H.264), which uses a deblocking filter. In homogeneous and stable regions of INTER pictures, the deblocking filter has a very low impact, which results in almost unfiltered reconstruction, because little residual signal has to be encoded.
  • the deblocking is activated again, because the residual signal has a much larger energy. Hence activating the deblocking filter increases even more the visual gap between INTRA (filtered) and INTER (unfiltered) pictures. The flashing effect is therefore made worse by the deblocking filter adaptive action.
  • the invention relates to an encoding method such as described in the introductory part of the description and which is moreover characterized in that, before being quantized and encoded in INTRA mode, said INTRA refresh pictures arc replaced by an INTER picture having quality and artefacts substantially similar to those of the last encoded INTER picture(s).
  • This technical solution is efficient in reducing or cancelling the flashing effect, since INTRA refresh frames are not directly encoded for original pictures, as it is generally the case, but from so-called "fake" pictures generated for replacing the refresh pictures.
  • the invention relates to an encoding device provided for encoding digital video data corresponding to an original sequence of images and available in the form of a video stream consisting of successive pictures which are either INTRA pictures, called I-pictures and encoded by means of a so-called INTRA mode without any reference to any past or future picture, or INTER pictures, that are themselves either monodirectionally predicted pictures, called P-pictures and encoded with reference to a past or future reference picture which is an INTRA or INTER picture, or bidirectionally predicted pictures, called B-pictures and encoded with reference to one or more reference picture(s), said INTRA pictures themselves comprising either I-pictures placed at the beginning of a new group of pictures corresponding to a scene change, where no temporal redundancy is available, and called scene change I-pictures, or I-pictures placed in other locations, where some temporal redundancy is available, and called refresh pictures, said encoding device, intended to generate an output coded bitstream, comprising at least a quantizing and coding branch, which
  • said controlling branch controlling, when the I picture to be encoded is a refresh picture, the implementation of the following steps : (a) the concerned INTRA refresh picture is encoded as an INTER picture, similarly to the encoding step of the previous INTER picture(s) of the sequence, no corresponding output bits being however sent into the output coded bitstream ; (b) the temporally predicted picture corresponding to the encoded INTRA refresh picture thus obtained is reconstructed ; (c) the reconstructed picture thus obtained is encoded in INTRA mode, the corresponding output bits being now sent into said output coded bitstream.
  • FIG.l An example of conventional coding system is illustrated in Fig.l.
  • said system the input of which receives an input video sequence 10, comprises a motion estimation module 11, a motion compensation module 12, a rate control module 13, a DCT module 14, a quantization (Q) module 15, a variable length coding (VLC) module 16, a buffer 17 (at the output of which a coded bitstream 22 that has to be stored or transmitted to the decoding side is available), an inverse quantization (Q "1 ) module 18, an inverse DCT (DCT 1 ) transform module 19, a subtractor 20, an adder 21 and a frame memory 23 (although this system comprises a plurality of modules, those skilled in the art will realize that the functions performed by the various modules are not required to be isolated into separate modules as shown in FIG.l, and that, for example, the set of modules comprising the motion compensation module 12, the inverse quantization module 18 and the inverse DCT module 19 can be implemented as an "embedded decoder").
  • a deblocking filter (shown in Fig 1, with the reference 24) may be provided between the output of the adder 21 and the input of the frame memory 23.
  • the input pictures are, as previously said, of I, P or B type.
  • the first step of the coding method according to the invention will now be described.
  • an INTRA refresh picture that should be encoded in INTRA mode
  • said picture is in fact not coded in INTRA mode, but as a P picture, similarly to the last real picture, in order to obtain a fake reconstructed picture having substantially the same quality and artefacts as other temporally predicted pictures (or very similar ones).
  • the decoder would expect an INTRA picture.
  • the following steps of the coding method are then the following one.
  • the temporally predicted picture corresponding to the encoded refresh picture thus obtained is reconstructed in the prediction branch comprising the modules 12, 18 and 19.
  • the reconstructed picture thus obtained which now includes temporal defects similar to those of the previous P-pictures, takes the place of the INTRA refresh picture and is encoded in the INTRA mode. This time, the encoded bits thus generated are output to the coded bitstream.
  • the advantage of the invention may be observed at the decoding side.
  • the decoder does not know that it is decoding a fake picture when it begins decoding a new GOP, but the visual quality of the refresh picture remains equal to other pictures and does not look like an INTRA picture, as if it had not been refreshed.
  • the method and device according to the present invention are not limited to the above- indicated implementation, and that other embodiments may be proposed.
  • the encoder will reproduce (as planned in accordance with the principle of the invention) the temporal artefacts, but it is preferable not to introduce visible spatial artefacts. It is consequently proposed to use significantly lower quantization levels than for the other INTER pictures. It is thus possible to minimize spatial artefacts.
  • the method described above is applied only to a limited category of INTRA pictures (the INTRA refresh pictures). It can also be applied to all INTRA pictures, but, in this case, if lower quantization levels have been moreover used, it will then be advantageous to disable the method at scene cuts, since the encoding of the fake picture costs more bits (due to the lower quantization levels). Also in order not to introduce visible spatial artefacts, when the encoder is an H.264 encoder including a deblocking filter in its decoding loop (it is a normative part of H.264, since the encoding and decoding devices then perform the same filtering to avoid drift effects), said deblocking filter is disabled for the INTRA refresh pictures.
  • the prediction step is more accurate and the residual signal obtained at the output of the subtractor 20 (by difference between the input signal and the predicted one) is smaller, i.e. the compression factor is improved.
  • the image reconstruction at the decoding side is then performed with a higher quality.
  • the deblocking filter 24 may be present, or not, in the prediction branch. It can be added here that there are numerous ways of implementing functions by means of items of hardware or software, or both.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)

Abstract

The invention relates to a method and device for encoding digital video data corresponding to an original sequence of images and available in the form of a video stream consisting of successive pictures which are either INTRA pictures or INTER pictures. The INTRA pictures are called I-pictures and encoded by means of a so-called INTRA mode without any reference to any past or future picture. The INTER pictures are either monodirectionally predicted pictures, called P-pictures and encoded with reference to a past or future reference picture which is an INTRA or INTER picture, or bidirectionally predicted pictures, called B-pictures and encoded with reference to one or more reference picture(s). The INTRA pictures themselves comprise either I-pictures placed at the beginning of a new group of pictures corresponding to a scene change, where no temporal redundancy is available, and called scene change I-pictures, or I-pictures placed in other locations, where some temporal redundancy is available, and called refresh pictures. According to the invention, the method is characterized in that, before being quantized and encoded in INTRA mode, said INTRA refresh pictures are replaced by an INTER picture having quality and artefacts substantially similar to those of the last encoded INTER picture(s).

Description

"METHOD AND DEVICE FOR ENCODING DIGITAL VIDEO DATA"
FIELD OF THE INVENTION The present invention generally relates to the field of digital video compression and, for instance, to the video coding standards of the ISO/MPEG family (MPEG-1, MPEG-2, MPEG-4) and to the video recommendations of the ITU-H.26X family (H.261, H.263 and extensions, H.264). More precisely, it relates to a method of encoding digital video data corresponding to an original sequence of images and available in the form of a video stream consisting of successive pictures which are either INTRA pictures, called I- pictures and encoded by means of a so-called INTRA mode without any reference to any past or future picture, or INTER pictures that are themselves either monodirectionally predicted pictures, called P-pictures and encoded with reference to a past or future reference picture which is an INTRA or INTER picture, or bidirectionally predicted pictures, called B-pictures and encoded with reference to one or more reference picture(s), said INTRA pictures themselves comprising either I-pictures placed at the beginning of a new group of pictures corresponding to a scene change, where no temporal redundancy is available, and called scene change I-pictures, or I-pictures placed in other locations, where some temporal redundancy is available, and called refresh pictures. The invention also relates to a corresponding encoding device.
BACKGROUND OF THE INVENTION In modern digital video coding systems, two main modes are used to compress video signals : the INTRA mode and the INTER mode. In the INTRA mode, the luminance and chrominance channels are encoded by exploiting the spatial redundancy of the pixels in a given channel of a single image via transform coding. The INTER mode, exploiting the temporal redundancy between separate images, relies on a motion-compensation technique that predicts an image from one (or more) previously decoded image(s) by encoding the motion of pixels from one image to the other. Usually, an image to be encoded is partitioned into independent blocks, each of them being assigned one or several motion vectors. A prediction of the image is constructed by displacing pixel blocks from the reference image(s) according to the set of motion vectors (luminance and chrominance channels share the same motion description). Finally, the difference, called the residual signal, between the image to be encoded and its motion-compensated prediction is encoded like in the INTRA mode by transform coding to further refine the decoded image. In MPEG terminology, recalled for example in a document such as "MPEG video coding : a basic tutorial introduction", by S.R. Ely, Report BBC RD 1996/3, the INTRA mode corresponds to I pictures or slices (a slice is a group of consecutive macroblocks), while the INTER mode corresponds to P and B pictures or slices. The coding efficiency of the INTER mode is much higher than the one of the INTRA mode, because it takes advantage of temporal prediction : much of the signal is contained in the prediction formed by motion compensation, and the residual signal has a smaller energy than the original signal. Because their encoding relies only on their own spatial redundancy, INTRA pictures can be decoded independently from any other pictures (which is not the case for INTER pictures). In spite of their lower coding efficiency, INTRA pictures are therefore inserted periodically in a bitstream to make random access points, begin new GOPs (Group of Pictures), or erase drifts between encoders/decoders (decoding errors due to channel losses or encoder/decoder implementation mismatches). In the present patent application, the INTRA pictures that are placed at locations where an INTER picture would have been more efficient (in other words, the INTRA pictures at locations where a lot of temporal redundancy is available, not at scene changes) will be called "refresh" pictures. INTRA pictures can also be advantageously placed at scene-cuts, where no temporal redundancy will help the encoding. However, this invention specifically focuses on refresh INTRA pictures, not scene-cuts. The problem overcome below by the invention is the following one : INTRA and INTER pictures exhibit different coding artefacts, since the underlying encoding method is different. Throughout an homogeneous video sequence, the quality and artefacts of subsequent INTER pictures tend to stabilize. However, if an
INTRA refresh frame is encoded, all preceding artefacts, due to INTER coding, are erased, and new ones, due to INTRA coding, are introduced abruptly. Video quality is therefore disruptive at refresh frames, resulting in what is here called a flashing effect, especially visible in low motion sequences and at moderate or low bitrates (when coding artefacts become quite noticeable). This flashing effect exists for all the MPEG family, but it is amplified by the latest standard, MPEG-4 part 10 (H.264), which uses a deblocking filter. In homogeneous and stable regions of INTER pictures, the deblocking filter has a very low impact, which results in almost unfiltered reconstruction, because little residual signal has to be encoded. At INTRA frames, the deblocking is activated again, because the residual signal has a much larger energy. Suddenly activating the deblocking filter increases even more the visual gap between INTRA (filtered) and INTER (unfiltered) pictures. The flashing effect is therefore made worse by the deblocking filter adaptive action.
SUMMARY OF THE INVENTION It is an object of the invention to propose a technical solution for reducing or cancelling this flashing effect. To this end, the invention relates to an encoding method such as described in the introductory part of the description and which is moreover characterized in that, before being quantized and encoded in INTRA mode, said INTRA refresh pictures arc replaced by an INTER picture having quality and artefacts substantially similar to those of the last encoded INTER picture(s). This technical solution is efficient in reducing or cancelling the flashing effect, since INTRA refresh frames are not directly encoded for original pictures, as it is generally the case, but from so-called "fake" pictures generated for replacing the refresh pictures. These fake pictures do no exist in the original sequence of pictures but have the same quality and artefacts as other temporally predicted pictures of said sequence. The encoder takes care to encode this different version of the pictures, and the visual quality of the decoded picture then remains equal to that of other pictures : said decoded picture does not look like an INTRA picture, as if it had not been refreshed and encoded only in the INTER mode. It is another object of the invention to propose an encoding device allowing to carry out said encoding method. To this end, the invention relates to an encoding device provided for encoding digital video data corresponding to an original sequence of images and available in the form of a video stream consisting of successive pictures which are either INTRA pictures, called I-pictures and encoded by means of a so-called INTRA mode without any reference to any past or future picture, or INTER pictures, that are themselves either monodirectionally predicted pictures, called P-pictures and encoded with reference to a past or future reference picture which is an INTRA or INTER picture, or bidirectionally predicted pictures, called B-pictures and encoded with reference to one or more reference picture(s), said INTRA pictures themselves comprising either I-pictures placed at the beginning of a new group of pictures corresponding to a scene change, where no temporal redundancy is available, and called scene change I-pictures, or I-pictures placed in other locations, where some temporal redundancy is available, and called refresh pictures, said encoding device, intended to generate an output coded bitstream, comprising at least a quantizing and coding branch, which receives and encodes the sequence of I, P, B pictures to be encoded, a prediction branch, which reconstructs predicted pictures corresponding to the received pictures that are respectively encoded, and a controlling branch which controls the successive encoding operations applied to said
I, P, B pictures, said controlling branch controlling, when the I picture to be encoded is a refresh picture, the implementation of the following steps : (a) the concerned INTRA refresh picture is encoded as an INTER picture, similarly to the encoding step of the previous INTER picture(s) of the sequence, no corresponding output bits being however sent into the output coded bitstream ; (b) the temporally predicted picture corresponding to the encoded INTRA refresh picture thus obtained is reconstructed ; (c) the reconstructed picture thus obtained is encoded in INTRA mode, the corresponding output bits being now sent into said output coded bitstream.
BRIEF DESCRIPTION OF THE DRAWINGS The present invention will now be described, by way of example, with reference to the accompanying drawings, in which Figs 1 and 2 illustrate two embodiments of a coding system.
BRIEF DESCRIPTION OF THE INVENTION An example of conventional coding system is illustrated in Fig.l. As illustrated, said system, the input of which receives an input video sequence 10, comprises a motion estimation module 11, a motion compensation module 12, a rate control module 13, a DCT module 14, a quantization (Q) module 15, a variable length coding (VLC) module 16, a buffer 17 (at the output of which a coded bitstream 22 that has to be stored or transmitted to the decoding side is available), an inverse quantization (Q"1) module 18, an inverse DCT (DCT 1) transform module 19, a subtractor 20, an adder 21 and a frame memory 23 (although this system comprises a plurality of modules, those skilled in the art will realize that the functions performed by the various modules are not required to be isolated into separate modules as shown in FIG.l, and that, for example, the set of modules comprising the motion compensation module 12, the inverse quantization module 18 and the inverse DCT module 19 can be implemented as an "embedded decoder"). A deblocking filter (shown in Fig 1, with the reference 24) may be provided between the output of the adder 21 and the input of the frame memory 23. In accordance with the MPEG standard and all digital block-based video encoding systems, the input pictures are, as previously said, of I, P or B type. The first step of the coding method according to the invention will now be described. When an INTRA refresh picture, that should be encoded in INTRA mode, is present at the input 10 of the coding system, said picture is in fact not coded in INTRA mode, but as a P picture, similarly to the last real picture, in order to obtain a fake reconstructed picture having substantially the same quality and artefacts as other temporally predicted pictures (or very similar ones). However, no bits are output to the coded bitstream (at the decoding side, the decoder would expect an INTRA picture). The following steps of the coding method are then the following one. First, the temporally predicted picture corresponding to the encoded refresh picture thus obtained is reconstructed in the prediction branch comprising the modules 12, 18 and 19. Then the reconstructed picture thus obtained, which now includes temporal defects similar to those of the previous P-pictures, takes the place of the INTRA refresh picture and is encoded in the INTRA mode. This time, the encoded bits thus generated are output to the coded bitstream. The advantage of the invention may be observed at the decoding side. The decoder does not know that it is decoding a fake picture when it begins decoding a new GOP, but the visual quality of the refresh picture remains equal to other pictures and does not look like an INTRA picture, as if it had not been refreshed. It can be mentioned that the method and device according to the present invention are not limited to the above- indicated implementation, and that other embodiments may be proposed. For example, when the fake picture obtained according to the invention is encoded in the INTRA mode, the encoder will reproduce (as planned in accordance with the principle of the invention) the temporal artefacts, but it is preferable not to introduce visible spatial artefacts. It is consequently proposed to use significantly lower quantization levels than for the other INTER pictures. It is thus possible to minimize spatial artefacts. In the proposed embodiment of the invention, the method described above is applied only to a limited category of INTRA pictures (the INTRA refresh pictures). It can also be applied to all INTRA pictures, but, in this case, if lower quantization levels have been moreover used, it will then be advantageous to disable the method at scene cuts, since the encoding of the fake picture costs more bits (due to the lower quantization levels). Also in order not to introduce visible spatial artefacts, when the encoder is an H.264 encoder including a deblocking filter in its decoding loop (it is a normative part of H.264, since the encoding and decoding devices then perform the same filtering to avoid drift effects), said deblocking filter is disabled for the INTRA refresh pictures. It is thus possible to avoid filtering the fake picture reconstruction, which already takes into account the action of the deblocking filter on INTER pictures. This disabling operation must be signaled in the coded bitstream. A disabling operation may also be proposed for refresh pictures in scenes undergoing large motions, in order to save bits. It can be justified by the fact that the flashing effect is not visible in such scenes. It may also be proposed, as shown in Fig.2, to add in the prediction branch (with or without the deblocking filter 24), between the output of the adder 21 and the input of the frame memory 23, a temporal filtering circuit 25. Different implementations may be proposed for such a circuit. For example, it could keep in memory (in a memory having the size of an image) the previous (or a previous) image or the following (or a following) image, or keep in memory a lot of past and/or next images and filter corresponding pixels using median filters or filters of a similar nature. With such a structure, the prediction step is more accurate and the residual signal obtained at the output of the subtractor 20 (by difference between the input signal and the predicted one) is smaller, i.e. the compression factor is improved. The image reconstruction at the decoding side is then performed with a higher quality. It can be noted that, the deblocking filter 24 may be present, or not, in the prediction branch. It can be added here that there are numerous ways of implementing functions by means of items of hardware or software, or both. In this respect, the drawings are very diagrammatic. Thus, although a drawing shows different functions as different blocks, this by no means excludes that a single item of hardware or software carries out several functions. Nor does it exclude that an assembly of items of hardware or software or both carry out a function. The remarks made herein before demonstrate that the detailed description," with reference to the drawings, illustrates rather than limits the invention, and that there are numerous alternatives falling which fall within the scope of the appended claims. The word "comprising" does not exclude the presence of other elements or steps than those listed in a claim. The word "a" or "an" preceding an element or step does not exclude the presence of a plurality of such elements or steps.

Claims

1. A method of encoding digital video data corresponding to an original sequence of images and available in the form of a video stream consisting of successive pictures which are either INTRA pictures, called I-pictures and encoded by means of a so-called INTRA mode without any reference to any past or future picture, or
INTER pictures, that are themselves either monodirectionally predicted pictures, called P-pictures and encoded with reference to a past or future reference picture which is an INTRA or INTER picture, or bidirectionally predicted pictures, called B-pictures and encoded with reference to one or more reference picture(s), said INTRA pictures themselves comprising either I-pictures placed at the beginning of a new group of pictures corresponding to a scene change, where no temporal redundancy is available, and called scene change I-pictures, or I-pictures placed in other locations, where some temporal redundancy is available, and called refresh pictures, said method, intended to generate an output coded bitstream, being characterized in that, before being quantized and encoded in INTRA mode, said
INTRA refresh pictures are replaced by an INTER picture having quality and artefacts substantially similar to those of the last encoded INTER picture(s).
2. An encoding method according to claim 1, in which each INTRA refresh picture is encoded by implementation of the following steps : (a) the concerned INTRA refresh picture is encoded as an INTER picture, similarly to the encoding step of the previous INTER picture(s) of the sequence, no corresponding output bits being however sent into the output coded bitstream ; (b) the temporally predicted picture corresponding to the encoded INTRA refresh picture thus obtained is reconstructed ; (c) the reconstructed picture thus obtained is encoded in INTRA mode, the corresponding output bits being now sent into said output coded bitstream.
3. An encoding method according to claim 1, in which each INTRA refresh picture is encoded by implementation of the following steps : (a) the concerned INTRA refresh picture is encoded as a P-picture, similarly to the encoding step of the previous P-picture of the sequence, no corresponding output bits being however sent into the output coded bitstream ; (b) the temporally predicted picture corresponding to the encoded INTRA refresh picture thus obtained is reconstructed ; y (c) the reconstructed picture thus obtained is encoded in INTRA mode, the corresponding output bits being now sent into said output coded bitstream.
4. An encoding method according to anyone of claims 2 and 3, in which, before the final encoding step, the quantizing step of the concerned INTRA refresh pictures is performed with a quantization level significantly lower than for the other
INTER pictures.
5. An encoding method according to claim 4, wherein it is applied to all INTRA pictures, except those located just after the scene cuts.
6. An encoding method according to claim 4, wherein it is applied to all INTRA pictures, except to the INTRA refresh pictures located in scenes undergoing large motions.
7. An encoding method according to claim 1, in which, when said sequence is processed by an H.264-like encoder comprising a deblocking filter in its decoding loop, said deblocking filter is disabled for INTRA refresh pictures.
8. An encoding device provided for encoding digital video data corresponding to an original sequence of images and available in the form of a video stream consisting of successive pictures which are either INTRA pictures, called I-pictures and encoded by means of a so-called INTRA mode without any reference to any past or future picture, or INTER pictures, that are themselves either monodirectionally predicted pictures, called P-pictures and encoded with reference to a past or future reference picture which is an INTRA or INTER picture, or bidirectionally predicted pictures, called B-pictures and encoded with reference to one or more reference picture(s), said INTRA pictures themselves comprising either I-pictures placed at the beginning of a new group of pictures corresponding to a scene change, where no temporal redundancy is available, and called scene change
I-pictures, or I-pictures placed in other locations, where some temporal redundancy is available, and called refresh pictures, said encoding device, intended to generate an output coded bitstream, comprising at least a quantizing and coding branch, which receives and encodes the sequence of I, P, B pictures to be encoded, a prediction branch, which reconstructs predicted pictures corresponding to the received pictures that are respectively encoded, and a controlling branch which controls the successive encoding operations applied to said I, P, B pictures, said controlling branch controlling, when the I picture to be encoded is a refresh picture, the implementation of the following steps : (a) the concerned INTRA refresh picture is encoded as an INTER picture, similarly to the encoding step of the previous INTER picture(s) of the sequence, no corresponding output bits being however sent into the output coded bitstream ; (b) the temporally predicted picture corresponding to the encoded INTRA refresh picture thus obtained is reconstructed ; (c) the reconstructed picture thus obtained is encoded in INTRA mode, the corresponding output bits being now sent into said output coded bitstream.
9. An encoding device acording to claim 8, in which, in the step (a), said
INTER picture is a P-picture.
10. An encoding device according to anyone of claims 8 and 9, in which the concerned INTRA refresh pictures are, before they are encoded in the final encoding step, quantized with a quantization level significantly lower than for the other INTER pictures.
11. An encoding device according to claim 10, wherein it is applied to all INTRA pictures, except those located just after the scene cuts.
12. An encoding device according to claim 10, wherein it is applied to all INTRA pictures, except to the INTRA refresh pictures located in scenes undergoing large motions.
13. An encoding device according to claim 10, in which, when said sequence is processed by an H.264-like encoder comprising a deblocking filter in its decoding loop, said deblocking filter is disabled for INTRA refresh pictures.
PCT/IB2005/051651 2004-05-25 2005-05-20 Method and device for encoding digital video data WO2005117449A1 (en)

Priority Applications (4)

Application Number Priority Date Filing Date Title
EP05748193A EP1754378A1 (en) 2004-05-25 2005-05-20 Method and device for encoding digital video data
US11/597,534 US20070230574A1 (en) 2004-05-25 2005-05-20 Method and Device for Encoding Digital Video Data
KR1020067027023A KR20070033364A (en) 2004-05-25 2005-05-20 Method and apparatus for encoding digital video data
JP2007514255A JP2008500760A (en) 2004-05-25 2005-05-20 Method and apparatus for encoding digital video data

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
EP04300302.9 2004-05-25
EP04300302 2004-05-25

Publications (1)

Publication Number Publication Date
WO2005117449A1 true WO2005117449A1 (en) 2005-12-08

Family

ID=34970916

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/IB2005/051651 WO2005117449A1 (en) 2004-05-25 2005-05-20 Method and device for encoding digital video data

Country Status (6)

Country Link
US (1) US20070230574A1 (en)
EP (1) EP1754378A1 (en)
JP (1) JP2008500760A (en)
KR (1) KR20070033364A (en)
CN (1) CN100534196C (en)
WO (1) WO2005117449A1 (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1845735A1 (en) * 2004-12-28 2007-10-17 NEC Corporation Moving picture encoding method, and apparatus and computer program using the same
WO2009044475A1 (en) * 2007-10-04 2009-04-09 Fujitsu Limited Coding device, coding method, coding program, and coding circuit
EP2057846A4 (en) * 2006-09-01 2016-06-29 Thomson Licensing Method and apparatus for encoding a flash picture occurring in a video sequence, and for decoding corresponding data for a flash picture

Families Citing this family (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101321284B (en) * 2007-06-10 2012-01-04 华为技术有限公司 Encoding/decoding method, equipment and system
CN101459840B (en) * 2007-12-13 2010-04-21 华为技术有限公司 Encoding and decoding method, apparatus and system for video image
CN101686391A (en) * 2008-09-22 2010-03-31 华为技术有限公司 Video coding/decoding method and device as well as video playing method, device and system
US20110080948A1 (en) * 2009-10-05 2011-04-07 Xuemin Chen Method and system for 3d video decoding using a tier system framework
CN102215396A (en) 2010-04-09 2011-10-12 华为技术有限公司 Video coding and decoding methods and systems
CN106454370B (en) 2010-04-13 2019-11-01 Ge视频压缩有限责任公司 Decoder, method, encoder, coding method and the data flow for rebuilding array
JP5718453B2 (en) 2010-04-13 2015-05-13 フラウンホーファー−ゲゼルシャフト・ツール・フェルデルング・デル・アンゲヴァンテン・フォルシュング・アインゲトラーゲネル・フェライン Decryption method
CN102939750B (en) 2010-04-13 2016-07-06 Ge视频压缩有限责任公司 Across planar prediction
ES2907203T3 (en) 2010-04-13 2022-04-22 Ge Video Compression Llc Video encoding using multitree image subdivisions
EP2536143B1 (en) 2011-06-16 2015-01-14 Axis AB Method and a digital video encoder system for encoding digital video data
EP3174295B1 (en) 2012-04-13 2018-12-12 GE Video Compression, LLC Low delay picture coding
CN115442626A (en) 2012-06-29 2022-12-06 Ge视频压缩有限责任公司 Video data stream, encoder, method of encoding video content and decoder
EP3673654A4 (en) * 2018-01-30 2020-07-01 SZ DJI Technology Co., Ltd. Video data encoding

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6317518B1 (en) 1994-04-28 2001-11-13 Canon Kabushiki Kaisha Image coding apparatus

Family Cites Families (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH0714209B2 (en) * 1989-12-20 1995-02-15 松下電器産業株式会社 Video coding device
US5774593A (en) * 1995-07-24 1998-06-30 University Of Washington Automatic scene decomposition and optimization of MPEG compressed video
DK172429B1 (en) * 1996-04-25 1998-06-08 Peter Mikkelsen Method of training an image analysis system for use in analyzing a subject, as well as using the method
US6014689A (en) * 1997-06-03 2000-01-11 Smith Micro Software Inc. E-mail system with a video e-mail player
US6731684B1 (en) * 1998-09-29 2004-05-04 General Instrument Corporation Method and apparatus for detecting scene changes and adjusting picture coding type in a high definition television encoder
US20020122491A1 (en) * 2001-01-03 2002-09-05 Marta Karczewicz Video decoder architecture and method for using same
CN100551009C (en) * 2001-04-24 2009-10-14 皇家菲利浦电子有限公司 Be used to produce the method and apparatus of vision signal
FI114679B (en) * 2002-04-29 2004-11-30 Nokia Corp Random start points in video encoding
US7068722B2 (en) * 2002-09-25 2006-06-27 Lsi Logic Corporation Content adaptive video processor using motion compensation
EP1496707A1 (en) * 2003-07-09 2005-01-12 Matsushita Electric Industrial Co., Ltd. Encoding and decoding of video images with delayed reference picture refresh
KR101050828B1 (en) * 2003-08-26 2011-07-21 톰슨 라이센싱 Method and apparatus for decoding hybrid intra-inter coded block
US7609762B2 (en) * 2003-09-07 2009-10-27 Microsoft Corporation Signaling for entry point frames with predicted first field
US8401069B2 (en) * 2004-04-05 2013-03-19 Snell Limited Apparatus and process for re-timing video cuts
US8879856B2 (en) * 2005-09-27 2014-11-04 Qualcomm Incorporated Content driven transcoder that orchestrates multimedia transcoding using content information
US8582663B2 (en) * 2006-08-08 2013-11-12 Core Wireless Licensing S.A.R.L. Method, device, and system for multiplexing of video streams

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6317518B1 (en) 1994-04-28 2001-11-13 Canon Kabushiki Kaisha Image coding apparatus

Non-Patent Citations (7)

* Cited by examiner, † Cited by third party
Title
CRUSTY: "The Unofficial XviD FAQ", 4 March 2004 (2004-03-04), XP002338686, Retrieved from the Internet <URL:http://ronald.vslcatena.nl/docs/Xvid-FAQ-Crusty-04-03-2004.pdf> [retrieved on 20050728] *
ELY S R: "MPEG VIDEO CODING: A basic tutorial introduction", BBC RD, no. 1996/3, 1996, pages 1 - 10, XP002338690 *
FAN X ET AL: "Flicking Reduction in All Intra Frame Coding", JOINT VIDEO TEAM (JVT) OF ISO/IEC MPEG & ITU-T VCEG, 5TH MEETING: GENEVA SWITZERLAND, DOCUMENT JVT-E070, 9 October 2002 (2002-10-09), XP002338688 *
KURCEREN R ET AL: "New Macroblock Modes for SP-frames", ITU-TELECOMMUNICATIONS STANDARDIZATION SECTOR, STUDY GROUP 16 QUESTION 6 VIDEO CODING EXPERTS GROUP (VCEG), DOCUMENT VCEG-O47, 4 December 2001 (2001-12-04), pages 1 - 6, XP002338687 *
S.R. ELY: "MPEG video coding: a basic tutorial introduction", REPORT BBC RD 1996/3
SEGALL C A ET ALL: "Pre-processing of compressed digital video", PROC. SPIE INT. SOC. OPT. ENG, vol. 4310, 2001, pages 163 - 174, XP002338689 *
SONG H ET AL: "RATE CONTROL FOR LOW-BIT-RATE VIDEO VIA VARIALBE-ENCODING FRAME RATES", IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, IEEE INC. NEW YORK, US, vol. 11, no. 4, April 2001 (2001-04-01), pages 512 - 521, XP001093570, ISSN: 1051-8215 *

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1845735A1 (en) * 2004-12-28 2007-10-17 NEC Corporation Moving picture encoding method, and apparatus and computer program using the same
EP1845735A4 (en) * 2004-12-28 2009-07-15 Nec Corp Moving picture encoding method, and apparatus and computer program using the same
US8325799B2 (en) 2004-12-28 2012-12-04 Nec Corporation Moving picture encoding method, device using the same, and computer program
EP2057846A4 (en) * 2006-09-01 2016-06-29 Thomson Licensing Method and apparatus for encoding a flash picture occurring in a video sequence, and for decoding corresponding data for a flash picture
WO2009044475A1 (en) * 2007-10-04 2009-04-09 Fujitsu Limited Coding device, coding method, coding program, and coding circuit
JP5152190B2 (en) * 2007-10-04 2013-02-27 富士通株式会社 Encoding device, encoding method, encoding program, and encoding circuit
US8605792B2 (en) 2007-10-04 2013-12-10 Fujitsu Limited Slice coding allocation method, apparatus and circuit

Also Published As

Publication number Publication date
CN1957617A (en) 2007-05-02
US20070230574A1 (en) 2007-10-04
EP1754378A1 (en) 2007-02-21
CN100534196C (en) 2009-08-26
KR20070033364A (en) 2007-03-26
JP2008500760A (en) 2008-01-10

Similar Documents

Publication Publication Date Title
US20070230574A1 (en) Method and Device for Encoding Digital Video Data
US10225549B1 (en) System and method for video coding
US7310371B2 (en) Method and/or apparatus for reducing the complexity of H.264 B-frame encoding using selective reconstruction
US8897591B2 (en) Method and apparatus for video coding using adaptive loop filter
US6931063B2 (en) Method and apparatus for controlling loop filtering or post filtering in block based motion compensationed video coding
US7324595B2 (en) Method and/or apparatus for reducing the complexity of non-reference frame encoding using selective reconstruction
EP2735149B1 (en) Adaptation parameter sets for video coding
CA2467496C (en) Global motion compensation for video pictures
US20020118755A1 (en) Video coding architecture and methods for using same
WO2020185959A1 (en) Gradual decoding refresh in video coding
US20180332281A1 (en) Adaptive anchor frame and quantization parameter decision in video coding
US20010026587A1 (en) Image encoding apparatus and method of same, video camera, image recording apparatus, and image transmission apparatus
US6907071B2 (en) Selective prediction for intra-coding video data block
US7277487B2 (en) Picture segment coding with motion prediction
US20160080752A1 (en) Method and apparatus for processing video signal
US9131233B1 (en) Methods for intra beating reduction in video compression

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BW BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE EG ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KM KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NA NG NI NO NZ OM PG PH PL PT RO RU SC SD SE SG SK SL SM SY TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): BW GH GM KE LS MW MZ NA SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LT LU MC NL PL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
WWE Wipo information: entry into national phase

Ref document number: 2005748193

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 11597534

Country of ref document: US

Ref document number: 2007230574

Country of ref document: US

WWE Wipo information: entry into national phase

Ref document number: 200580016869.4

Country of ref document: CN

Ref document number: 2007514255

Country of ref document: JP

Ref document number: 4334/CHENP/2006

Country of ref document: IN

NENP Non-entry into the national phase

Ref country code: DE

WWW Wipo information: withdrawn in national office

Ref document number: DE

WWE Wipo information: entry into national phase

Ref document number: 1020067027023

Country of ref document: KR

WWP Wipo information: published in national office

Ref document number: 2005748193

Country of ref document: EP

WWP Wipo information: published in national office

Ref document number: 1020067027023

Country of ref document: KR

WWP Wipo information: published in national office

Ref document number: 11597534

Country of ref document: US