WO2004102971A1 - Video processing device with low memory bandwidth requirements - Google Patents

Video processing device with low memory bandwidth requirements Download PDF

Info

Publication number
WO2004102971A1
WO2004102971A1 PCT/IB2004/001608 IB2004001608W WO2004102971A1 WO 2004102971 A1 WO2004102971 A1 WO 2004102971A1 IB 2004001608 W IB2004001608 W IB 2004001608W WO 2004102971 A1 WO2004102971 A1 WO 2004102971A1
Authority
WO
WIPO (PCT)
Prior art keywords
data
memory
pictures
cache memory
processing device
Prior art date
Application number
PCT/IB2004/001608
Other languages
French (fr)
Inventor
Stéphane Mutz
Hugues De Perthuis
Eric Desmicht
Original Assignee
Koninklijke Philips Electronics N.V.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Family has litigation
First worldwide family litigation filed litigation Critical https://patents.darts-ip.com/?family=33442888&utm_source=google_patent&utm_medium=platform_link&utm_campaign=public_patent_search&patent=WO2004102971(A1) "Global patent litigation dataset” by Darts-ip is licensed under a Creative Commons Attribution 4.0 International License.
Application filed by Koninklijke Philips Electronics N.V. filed Critical Koninklijke Philips Electronics N.V.
Priority to JP2006530662A priority Critical patent/JP2007503787A/en
Priority to US10/556,616 priority patent/US8155459B2/en
Priority to EP04731432A priority patent/EP1629674A1/en
Publication of WO2004102971A1 publication Critical patent/WO2004102971A1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/42Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation
    • H04N19/43Hardware specially adapted for motion estimation or compensation
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/42Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation
    • H04N19/43Hardware specially adapted for motion estimation or compensation
    • H04N19/433Hardware specially adapted for motion estimation or compensation characterised by techniques for memory access
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/44Decoders specially adapted therefor, e.g. video decoders which are asymmetric with respect to the encoder
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/503Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
    • H04N19/51Motion estimation or motion compensation
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/60Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
    • H04N19/61Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding in combination with predictive coding

Definitions

  • the present invention relates to a video processing device for processing data corresponding to a sequence of pictures according to a predictive block-based encoding technique.
  • This invention is particularly relevant to video encoder, decoder and transcoder based on MPEG or an equivalent video standard.
  • Video decoders or encoders based on predictive block-based encoding techniques are based on a recursive use of motion estimation/compensation in order to reduce the amount of information to be transmitted.
  • Fig. 1 shows a conventional video decoder according to these encoding techniques.
  • Such a conventional video decoder is described for example in "MPEG video encoding: a basic tutorial introduction", BBC Research and Development Report, by S.R. Ely 1996/3.
  • Said video decoder (100) comprises a decoding unit (10) for decoding an encoded data stream ES corresponding to a sequence of encoded pictures.
  • a decoding unit (10) for decoding an encoded data stream ES corresponding to a sequence of encoded pictures.
  • three types of pictures are considered: I (or infra) pictures, encoded without any reference to other pictures, P (or predicted) pictures, encoded with reference to a past picture (I or P), and B (or bidirectionally predicted) pictures, encoded with reference to a past and a future picture (I or P) in a display order.
  • I and P pictures will be hereinafter referred to as reference pictures.
  • each picture of an MPEG sequence is subdivided into motion compensation areas called macroblocks.
  • the decoding unit includes: a parser (12), for analysing the encoded data stream, a macroblock processing unit MBPU (13), for computing motion vectors V(n) and variable length decoded data, an inverse quantizing and inverse discrete cosine transform IQ/IDCT circuit (15) for delivering a residual error data R'(n) from the variable length decoded data, a motion compensation circuit MC (14) for delivering motion compensated data using the motion vector V(n), a reconstruction circuit REC (16) for reconstructing pictures from a sum of motion compensated data and residual error data.
  • the known video decoder comprises an external memory EMEM (1) for storing reconstructed pictures delivered by the reconstruction circuit.
  • the pictures to be stored are reference pictures F0 and FI of the infra or predictive type.
  • the decoding unit further comprises a memory controller MMI (11) for controlling data exchange between said decoding unit and the external memory via a data bus (2).
  • Said data exchange is, for example, the storage of reference pictures from the reconstruction circuit into the external memory, or the read-out from the external memory of the motion compensated data in a reference picture in order to fetch them to the motion compensation circuit.
  • a first drawback of the prior art is that the motion compensation is performed on a macroblock basis, so that the motion compensated data are generally read out from different zones of the external memory for successive macroblocks.
  • the data readout from the external memory is achieved in an irregular manner and a video decoder according to the prior art needs an important memory bandwidth due to the amount of data to be read and to the difficulty of optimizing the access to the external memory with the memory controller.
  • the data to be read are not necessary aligned in the memory data banks. This drawback is strengthened by the fact that the bandwidth resources do not increase as fast as processor frequency does according to Moore's law.
  • the following example illustrates this point in the case of an MPEG-2 decoding.
  • an external memory organized in words of 64 bits.
  • a word can then contain 8 values (luminance or chrominance) of pixels.
  • the motion compensation circuit has to read areas of at least 16x8 pixels.
  • the motion compensation has a half-pixel accuracy.
  • the motion compensation unit has to read an area of 17x9 pixels in order to compute the interpolated pixel values.
  • the motion compensation circuit reads in fact 3 words of 9 lines or in other words 24x9 bytes, corresponding to a loss of bandwidth of 30% (17x9 corresponds to a bandwidth of approximately 180 Mbytes/s and 24x9 corresponds to a bandwidth of approximately 270 Mbytes/s for a MPEG-2 High Definition HD picture).
  • Another problem relates to the optimization of the memory controller. This is due to the fact that external memory, such as SDRAM for example, operates in a burst mode, which is not adapted to an irregular read-out of data. Bursts are generated for each lines of the memory. A burst comprises at least 7 or 8 cycles, whereas 3 cycles, in our example, would have been enough to read out the 3 words of a line. As a consequence, the needed bandwidth required for a video decoder according to the prior art is more than twice the bandwidth that would have theoretically been necessary for the decoding process.
  • references pictures cannot be stored easily in embedded memories instead of the external memory, as said memories are still very expensive.
  • an embedded memory of 6 Mbytes would be necessary in a high definition HD format, such a memory corresponding to a circuit of approximately 50 mm 2 size in a CMOS 0.12 micron technology, which represents a too important circuit surface.
  • the video processing device in accordance with the invention comprises: a processing unit including a reconstruction circuit for reconstructing pictures from decoded data, - an external memory for storing the reconstructed pictures delivered by the reconstruction circuit, the processing unit further comprising: a memory controller for controlling data exchange between the processing unit and the external memory, - a cache memory for temporarily storing data corresponding to a prediction area, said data being read out from the external memory via the memory controller, and a motion compensation circuit for delivering motion compensated data to the reconstruction circuit on the basis of the prediction area read out from the cache memory.
  • the present invention is based on the fact that, during the decompression process, the processing unit needs to read recursively a predetermined zone of the external memory corresponding to a predetermined area of a reference picture, said predetermination area being hereinafter referred to as prediction area.
  • Said prediction area serves as a reference for reconstructing a current picture block per block.
  • Such a prediction area can be loaded into an embedded memory, i.e. a cache memory, without requiring prohibitive cost or circuit surface, as said area is much smaller than the whole picture.
  • the memory bandwidth required by a processing device in accordance with the invention is decreased compared to a solution without cache memory. Moreover, there is no loss of bandwidth at the memory controller level, as the readout of data from the external memory into the cache memory is achieved on a regular basis.
  • Fig. 1 is a schematic view of a conventional video decoder
  • Fig. 2 is a schematic view of a video decoder in accordance with the invention.
  • Fig. 3 is a schematic view of a video decoder in accordance with the invention
  • the present invention is here described by way of examples of a video decoder and a video encoder but it will obvious to a person skilled in the art that said invention is applicable to any video processing device for processing data corresponding to a sequence of pictures according to a predictive block-based encoding technique, such as a transcoder for transcoding a first encoded data stream corresponding to a sequence of encoded pictures into a second encoded data stream, or a device for performing video scaling.
  • the present invention is also based on the fact that the size of the prediction area in which the 17x9 pixel area (as it has been hereinabove defined) has to be found is predetermined.
  • the prediction area is limited to 256 lines for decoding.
  • Fig. 2 describes a video decoder in accordance with the invention.
  • Said video decoder (200) comprises a decoding unit (20) for decoding an encoded data stream ES corresponding to a sequence of encoded pictures.
  • Said decoding unit includes: a parser (12), for analyzing the encoded data stream, - a macroblock processing unit MBPU (13), for computing motion vectors V(n) and variable length decoded data, an inverse quantizing and inverse discrete cosine transform IQ/IDCT circuit (15) for delivering a residual error data R'(n) from the variable length decoded data, a motion compensation circuit MC (14) for delivering motion compensated data using the motion vector V(n), a reconstruction circuit REC (16) for reconstructing pictures from a sum of motion compensated data and residual error data.
  • the video decoder comprises an external memory EMEM (1) for storing reference pictures F0 and FI delivered by the reconstruction circuit.
  • the decoding unit also comprises a memory controller MMI (11) for controlling data exchange between said decoding unit and the external memory via a data bus (2).
  • the video decoder according to the invention further comprises a cache memory CM (17) for temporarily storing data read out from the external memory via the memory controller.
  • Said cache memory comprises, in the MPEG2 case, 256 lines and is adapted to receive the prediction area.
  • the content of the cache memory can be updated in different ways.
  • the data corresponding to the prediction area are read out from the external memory in a regular manner during the decoding process.
  • the content of the cache memory is changed row by row, each time a row of macroblocks has been processed.
  • Motion compensation is then performed directly using the content of said cache memory, the irregular read-out of data being done at the level of the cache memory and no more at the level of the external memory, thus without requiring additional memory bandwidth.
  • the bandwidth required by a decoding device according to the invention is fixed and is equal to about 180 Mbytes/s.
  • the 256 lines of the cache memory are divided into equal zones. If the decoding unit needs to access a specific pixel in a zone, then a request, e.g. a cache miss, is generated by the cache memory, and it is only in that case that the corresponding zone is fetched from the external memory to the cache memory thanks to the memory controller. So, if during decoding, no pixel from a zone is needed, the bandwidth to fetch the corresponding part of the picture is saved. As a result, the bandwidth required by the decoding device according to the invention is variable and is comprised between 0 and 180 MByte/s, depending on the decoded stream. According to a first embodiment of the invention, the prediction areas of 2 reference pictures are stored in the cache memory.
  • the size of the embedded memory is thus divided by more than 4 in HD format compared to a solution where the whole frames would have been embedded.
  • a second embodiment of the invention only the prediction area of the past reference picture is stored in the cache memory, whereas the future reference picture is read out from the external memory.
  • the embedded memory size is decreased but the memory bandwidth required by a video decoder in accordance with the invention is slightly increased compared to the first embodiment.
  • the prediction areas of the luminance component of the reference pictures are stored in the cache memory, whereas the prediction areas of the chrominance component of said reference pictures is read out directly from the external memory.
  • the embedded memory size is decreased but the bandwidth required by the video decoder is slightly increased compared to the first embodiment.
  • Fig. 3 describes a video encoder according to the invention.
  • Said video encoder (300) comprises an encoding unit (30) for encoding an input data sfream corresponding to a sequence of pictures.
  • Said encoding unit includes: a subfractor SUB (32) for delivering first residual error data R(n), a discrete cosine transform and quantizing DCT/Q circuit (33) for transforming and quantizing successively the first residual error data R(n), - a variable length coder VLC (34) for delivering variable length coded data from the quantized data, an inverse quantizing and inverse discrete cosine transform IQ/IDCT circuit (35) for delivering second residual error data R'(n) from the quantized data, a motion compensation circuit MC (37) for delivering motion compensated data P(F(n-l);V(n)) to a reconstruction circuit REC (36) and to the subfractor using a motion vector V(n), the subfractor being adapted to subtract the motion compensated data from the input data I(n), a reconstruction circuit REC (36) for reconstructing pictures from a sum of the motion compensated data and the second residual error data R', - a motion estimation circuit ME (38) for finding, in a
  • the motion estimation circuit is based, for example, on the computing of the sum of absolute differences SAD, the expression of the SAD being: k k-1
  • the video decoder comprises an external memory EMEM (1) for storing reference pictures F0 and FI delivered by the reconstruction circuit, as well as the current picture to be encoded.
  • the encoding unit comprises a memory controller MMI (31) for controlling data exchange between said encoding unit and the external memory via a data bus (2).
  • the video decoder further comprises a cache memory CM (39) for temporarily storing data corresponding to the prediction area and read out from the external memory via the memory controller. Motion estimation and motion compensation are then performed directly using said cache memory
  • CM cache memory
  • the gain in terms of bandwidth can even be increased compared to a video decoder, as the size of the prediction area is not normative for encoding and thus can be decreased to 128 lines or even 64 lines but, of course, at the cost of a decreased video quality.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)

Abstract

The present invention relates to a video processing device for processing data corresponding to a sequence of pictures according to a predictive block-based encoding technique. Said device comprises a processing unit (20) including a reconstruction circuit (16) for reconstructing pictures from decoded data and an external memory (1) for storing reference pictures delivered by the reconstruction circuit. The processing unit further comprises a memory controller (11) for controlling data exchange between the processing unit and the external memory, a cache memory (17) for temporarily storing data corresponding to a prediction area, said data being read out from the external memory via the memory controller, and a motion compensation circuit (14) for delivering motion compensated data to the reconstruction circuit on the basis of the prediction area read out from the cache memory.

Description

VIDEO PROCESSING DEVICE WITH LOW MEMORY BANDWIDTH REQUIREMENTS
FIELD OF THE INVENTION
The present invention relates to a video processing device for processing data corresponding to a sequence of pictures according to a predictive block-based encoding technique.
This invention is particularly relevant to video encoder, decoder and transcoder based on MPEG or an equivalent video standard.
BACKGROUND OF THE INVENTION
Video decoders or encoders based on predictive block-based encoding techniques, such as MPEG-2 or H.264, for example, are based on a recursive use of motion estimation/compensation in order to reduce the amount of information to be transmitted.
Fig. 1 shows a conventional video decoder according to these encoding techniques. Such a conventional video decoder is described for example in "MPEG video encoding: a basic tutorial introduction", BBC Research and Development Report, by S.R. Ely 1996/3.
Said video decoder (100) comprises a decoding unit (10) for decoding an encoded data stream ES corresponding to a sequence of encoded pictures. In the MPEG standard, three types of pictures are considered: I (or infra) pictures, encoded without any reference to other pictures, P (or predicted) pictures, encoded with reference to a past picture (I or P), and B (or bidirectionally predicted) pictures, encoded with reference to a past and a future picture (I or P) in a display order. These I and P pictures will be hereinafter referred to as reference pictures. Moreover, each picture of an MPEG sequence is subdivided into motion compensation areas called macroblocks. The decoding unit according to the prior art includes: a parser (12), for analysing the encoded data stream, a macroblock processing unit MBPU (13), for computing motion vectors V(n) and variable length decoded data, an inverse quantizing and inverse discrete cosine transform IQ/IDCT circuit (15) for delivering a residual error data R'(n) from the variable length decoded data, a motion compensation circuit MC (14) for delivering motion compensated data using the motion vector V(n), a reconstruction circuit REC (16) for reconstructing pictures from a sum of motion compensated data and residual error data. The known video decoder comprises an external memory EMEM (1) for storing reconstructed pictures delivered by the reconstruction circuit. The pictures to be stored are reference pictures F0 and FI of the infra or predictive type.
The decoding unit further comprises a memory controller MMI (11) for controlling data exchange between said decoding unit and the external memory via a data bus (2). Said data exchange is, for example, the storage of reference pictures from the reconstruction circuit into the external memory, or the read-out from the external memory of the motion compensated data in a reference picture in order to fetch them to the motion compensation circuit. A first drawback of the prior art is that the motion compensation is performed on a macroblock basis, so that the motion compensated data are generally read out from different zones of the external memory for successive macroblocks. As a consequence, the data readout from the external memory is achieved in an irregular manner and a video decoder according to the prior art needs an important memory bandwidth due to the amount of data to be read and to the difficulty of optimizing the access to the external memory with the memory controller. In effect, the data to be read are not necessary aligned in the memory data banks. This drawback is strengthened by the fact that the bandwidth resources do not increase as fast as processor frequency does according to Moore's law.
The following example illustrates this point in the case of an MPEG-2 decoding. Let us assume an external memory organized in words of 64 bits. A word can then contain 8 values (luminance or chrominance) of pixels. The motion compensation circuit has to read areas of at least 16x8 pixels. In MPEG2 standard, the motion compensation has a half-pixel accuracy. As a consequence, the motion compensation unit has to read an area of 17x9 pixels in order to compute the interpolated pixel values. Due to the memory organization in words, the motion compensation circuit reads in fact 3 words of 9 lines or in other words 24x9 bytes, corresponding to a loss of bandwidth of 30% (17x9 corresponds to a bandwidth of approximately 180 Mbytes/s and 24x9 corresponds to a bandwidth of approximately 270 Mbytes/s for a MPEG-2 High Definition HD picture).
Another problem relates to the optimization of the memory controller. This is due to the fact that external memory, such as SDRAM for example, operates in a burst mode, which is not adapted to an irregular read-out of data. Bursts are generated for each lines of the memory. A burst comprises at least 7 or 8 cycles, whereas 3 cycles, in our example, would have been enough to read out the 3 words of a line. As a consequence, the needed bandwidth required for a video decoder according to the prior art is more than twice the bandwidth that would have theoretically been necessary for the decoding process.
Moreover, reference pictures cannot be stored easily in embedded memories instead of the external memory, as said memories are still very expensive. In our example, an embedded memory of 6 Mbytes would be necessary in a high definition HD format, such a memory corresponding to a circuit of approximately 50 mm2 size in a CMOS 0.12 micron technology, which represents a too important circuit surface.
SUMMARY OF THE INVENTION It is an object of the invention to propose a video processing device that requires a lower memory bandwidth than those of the prior art.
To this end, the video processing device in accordance with the invention comprises: a processing unit including a reconstruction circuit for reconstructing pictures from decoded data, - an external memory for storing the reconstructed pictures delivered by the reconstruction circuit, the processing unit further comprising: a memory controller for controlling data exchange between the processing unit and the external memory, - a cache memory for temporarily storing data corresponding to a prediction area, said data being read out from the external memory via the memory controller, and a motion compensation circuit for delivering motion compensated data to the reconstruction circuit on the basis of the prediction area read out from the cache memory.
The present invention is based on the fact that, during the decompression process, the processing unit needs to read recursively a predetermined zone of the external memory corresponding to a predetermined area of a reference picture, said predetermination area being hereinafter referred to as prediction area. Said prediction area serves as a reference for reconstructing a current picture block per block.
Such a prediction area can be loaded into an embedded memory, i.e. a cache memory, without requiring prohibitive cost or circuit surface, as said area is much smaller than the whole picture.
As a result, the memory bandwidth required by a processing device in accordance with the invention is decreased compared to a solution without cache memory. Moreover, there is no loss of bandwidth at the memory controller level, as the readout of data from the external memory into the cache memory is achieved on a regular basis.
BRIEF DESCRIPTION OF THE DRAWINGS
The present invention will now be described in more detail, by way of example, with reference to the accompanying drawings, wherein:
Fig. 1 is a schematic view of a conventional video decoder,
Fig. 2 is a schematic view of a video decoder in accordance with the invention, and
Fig. 3 is a schematic view of a video decoder in accordance with the invention
DETAILED DESCRIPTION OF THE INVENTION
The present invention is here described by way of examples of a video decoder and a video encoder but it will obvious to a person skilled in the art that said invention is applicable to any video processing device for processing data corresponding to a sequence of pictures according to a predictive block-based encoding technique, such as a transcoder for transcoding a first encoded data stream corresponding to a sequence of encoded pictures into a second encoded data stream, or a device for performing video scaling.
It is described in the case of the MPEG2 standard but is also applicable to other encoding format in which the prediction area has a limited format such as, for example, H.264.
The present invention is also based on the fact that the size of the prediction area in which the 17x9 pixel area (as it has been hereinabove defined) has to be found is predetermined. In the example of the MPEG2 standard, the prediction area is limited to 256 lines for decoding.
Fig. 2 describes a video decoder in accordance with the invention. Said video decoder (200) comprises a decoding unit (20) for decoding an encoded data stream ES corresponding to a sequence of encoded pictures. Said decoding unit includes: a parser (12), for analyzing the encoded data stream, - a macroblock processing unit MBPU (13), for computing motion vectors V(n) and variable length decoded data, an inverse quantizing and inverse discrete cosine transform IQ/IDCT circuit (15) for delivering a residual error data R'(n) from the variable length decoded data, a motion compensation circuit MC (14) for delivering motion compensated data using the motion vector V(n), a reconstruction circuit REC (16) for reconstructing pictures from a sum of motion compensated data and residual error data. The video decoder comprises an external memory EMEM (1) for storing reference pictures F0 and FI delivered by the reconstruction circuit.
The decoding unit also comprises a memory controller MMI (11) for controlling data exchange between said decoding unit and the external memory via a data bus (2).
The video decoder according to the invention further comprises a cache memory CM (17) for temporarily storing data read out from the external memory via the memory controller. Said cache memory comprises, in the MPEG2 case, 256 lines and is adapted to receive the prediction area. The content of the cache memory can be updated in different ways.
According to a first way, the data corresponding to the prediction area are read out from the external memory in a regular manner during the decoding process. The content of the cache memory is changed row by row, each time a row of macroblocks has been processed. Motion compensation is then performed directly using the content of said cache memory, the irregular read-out of data being done at the level of the cache memory and no more at the level of the external memory, thus without requiring additional memory bandwidth. As a result, the bandwidth required by a decoding device according to the invention is fixed and is equal to about 180 Mbytes/s.
According to another way, the 256 lines of the cache memory are divided into equal zones. If the decoding unit needs to access a specific pixel in a zone, then a request, e.g. a cache miss, is generated by the cache memory, and it is only in that case that the corresponding zone is fetched from the external memory to the cache memory thanks to the memory controller. So, if during decoding, no pixel from a zone is needed, the bandwidth to fetch the corresponding part of the picture is saved. As a result, the bandwidth required by the decoding device according to the invention is variable and is comprised between 0 and 180 MByte/s, depending on the decoded stream. According to a first embodiment of the invention, the prediction areas of 2 reference pictures are stored in the cache memory. The size of the embedded memory is thus divided by more than 4 in HD format compared to a solution where the whole frames would have been embedded. According to a second embodiment of the invention, only the prediction area of the past reference picture is stored in the cache memory, whereas the future reference picture is read out from the external memory. In this case, the embedded memory size is decreased but the memory bandwidth required by a video decoder in accordance with the invention is slightly increased compared to the first embodiment.
According to a third embodiment of the invention, the prediction areas of the luminance component of the reference pictures are stored in the cache memory, whereas the prediction areas of the chrominance component of said reference pictures is read out directly from the external memory. In the same manner, the embedded memory size is decreased but the bandwidth required by the video decoder is slightly increased compared to the first embodiment.
The present invention is also applicable to a video encoder. Fig. 3 describes a video encoder according to the invention. Said video encoder (300) comprises an encoding unit (30) for encoding an input data sfream corresponding to a sequence of pictures. Said encoding unit includes: a subfractor SUB (32) for delivering first residual error data R(n), a discrete cosine transform and quantizing DCT/Q circuit (33) for transforming and quantizing successively the first residual error data R(n), - a variable length coder VLC (34) for delivering variable length coded data from the quantized data, an inverse quantizing and inverse discrete cosine transform IQ/IDCT circuit (35) for delivering second residual error data R'(n) from the quantized data, a motion compensation circuit MC (37) for delivering motion compensated data P(F(n-l);V(n)) to a reconstruction circuit REC (36) and to the subfractor using a motion vector V(n), the subfractor being adapted to subtract the motion compensated data from the input data I(n), a reconstruction circuit REC (36) for reconstructing pictures from a sum of the motion compensated data and the second residual error data R', - a motion estimation circuit ME (38) for finding, in a reference picture, a reference macroblock associated to the current macroblock to be encoded, as well as its corresponding motion vector V(n).
The motion estimation circuit is based, for example, on the computing of the sum of absolute differences SAD, the expression of the SAD being: k k-1
SAD = ]|A(i) -B(i)| i=0 where B(i) and A(i) respectively designate the current macroblock of size k x k (16x16 pixels for example in the MPEG-2 standard) and the reference macroblock in the reference picture. The reference macroblock that minimizes the SAD is considered as the best matching macroblock and the corresponding data and motion vector are derived.
The video decoder comprises an external memory EMEM (1) for storing reference pictures F0 and FI delivered by the reconstruction circuit, as well as the current picture to be encoded.
The encoding unit comprises a memory controller MMI (31) for controlling data exchange between said encoding unit and the external memory via a data bus (2).
The video decoder according to the invention further comprises a cache memory CM (39) for temporarily storing data corresponding to the prediction area and read out from the external memory via the memory controller. Motion estimation and motion compensation are then performed directly using said cache memory In the case of a video encoder, the gain in terms of bandwidth can even be increased compared to a video decoder, as the size of the prediction area is not normative for encoding and thus can be decreased to 128 lines or even 64 lines but, of course, at the cost of a decreased video quality.
The drawings and their description hereinbefore illustrate rather than limit the invention. It will be evident that there are numerous alternatives, which fall within the scope of the appended claims. In this respect, the following closing remarks are made.
There are numerous ways of implementing functions by means of items of hardware.
In this respect, the drawings are very diagrammatic, each representing only one possible embodiment of the invention. Thus, although a drawing shows different functions as different blocks, this by no means excludes that a single item of hardware carries out several functions.
Nor does it exclude that an assembly of items of hardware carries out a function.
Any reference sign in the following claims should not be construed as limiting the claim. It will be obvious that the use of the verb "to comprise" and its conjugations do not exclude the presence of any other steps or elements besides those defined in any claim. The word "a" or "an" preceding an element or step does not exclude the presence of a plurality of such elements or steps.

Claims

1 A video processing device for processing data corresponding to a sequence of pictures according to a predictive block-based encoding technique, said device comprising: - a processing unit (20;30) including a reconstruction circuit (16;36) for reconstructing pictures from decoded data, an external memory (1) for storing reference pictures delivered by the reconstruction circuit, the processing unit further comprising: - a memory controller ( 11 ;31 ) for controlling data exchange between the processing unit and the external memory, a cache memory (17;39) for temporarily storing data corresponding to a prediction area, said data being read out from the external memory via the memory controller, and a motion compensation circuit (14;37) for delivering motion compensated data to the reconstruction circuit on the basis of the prediction area read out from the cache memory.
2 A video processing device as claimed in Claim 1, wherein the processing unit is a decoding unit (20) for decoding an encoded data stream corresponding to a sequence of encoded pictures.
3 A video processing device as claimed in Claim 1, wherein the processing unit is an encoding unit (30) for encoding an input data stream corresponding to a sequence of pictures.
4 A video processing device as claimed in Claim 1, wherein the processing unit is a transcoding unit for transcoding a first encoded data stream corresponding to a sequence of encoded pictures into a second encoded data stream.
5 A video processing device as claimed in Claim 1, wherein the memory controller (11 ;31) is able to fetch automatically the data corresponding to a complete prediction area from the external memory (1) to the cache memory (17;39).
6 A video processing device as claimed in Claim 1, wherein the cache memory (17;39) is divided into equal zones, and the memory controller (11;31) is able to fetch data corresponding to a zone from the external memory (1) to the cache memory (17;39) upon request of the processing unit.
7 A video processing device as claimed in Claim 1 , wherein the cache memory (17;39) is adapted to receive the prediction areas of two reference pictures.
8 A video processing device as claimed in Claim 1, wherein the cache memory (17;39) is adapted to receive the prediction area of a past reference picture, the prediction area of a future reference picture being read out from the external memory (1).
9 A video processing device as claimed in Claim 1, wherein the cache memory (17;39) is adapted to receive luminance components of the prediction area of at least one reference picture.
10 A video processing method for processing data corresponding to a sequence of pictures according to a predictive block-based encoding technique, said method comprising the steps of: reconstructing pictures from decoded data, storing reference pictures delivered by the reconstruction step in an external memory (1), temporarily storing data corresponding to a prediction area in a cache memory (17;39), said data being read out from the external memory via a memory controller, and motion compensation, able to deliver motion compensated data to the reconstruction step on the basis of the prediction area read out from the cache memory.
PCT/IB2004/001608 2003-05-19 2004-05-06 Video processing device with low memory bandwidth requirements WO2004102971A1 (en)

Priority Applications (3)

Application Number Priority Date Filing Date Title
JP2006530662A JP2007503787A (en) 2003-05-19 2004-05-06 Video processing device with low memory bandwidth requirements
US10/556,616 US8155459B2 (en) 2003-05-19 2004-05-06 Video processing device with low memory bandwidth requirements
EP04731432A EP1629674A1 (en) 2003-05-19 2004-05-06 Video processing device with low memory bandwidth requirements

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
EP03300015.9 2003-05-19
EP03300015 2003-05-19

Publications (1)

Publication Number Publication Date
WO2004102971A1 true WO2004102971A1 (en) 2004-11-25

Family

ID=33442888

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/IB2004/001608 WO2004102971A1 (en) 2003-05-19 2004-05-06 Video processing device with low memory bandwidth requirements

Country Status (6)

Country Link
US (1) US8155459B2 (en)
EP (1) EP1629674A1 (en)
JP (1) JP2007503787A (en)
KR (1) KR20060012626A (en)
CN (1) CN1792097A (en)
WO (1) WO2004102971A1 (en)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2007112132A2 (en) * 2006-03-29 2007-10-04 Nvidia Corporation Video processing method and system
EP1890495A2 (en) * 2006-08-18 2008-02-20 NEC Electronics Corporation Bit-plane decoding device and bit-plane decoding method
WO2009109891A1 (en) * 2008-03-03 2009-09-11 Nxp B.V. Processor comprising a cache memory
EP2252064A1 (en) * 2008-03-31 2010-11-17 Panasonic Corporation Image decoding device, image decoding method, integrated circuit, and reception device
US8577165B2 (en) 2008-06-30 2013-11-05 Samsung Electronics Co., Ltd. Method and apparatus for bandwidth-reduced image encoding and decoding
US8599841B1 (en) 2006-03-28 2013-12-03 Nvidia Corporation Multi-format bitstream decoding engine
DE102007005866B4 (en) 2007-02-06 2021-11-04 Intel Deutschland Gmbh Arrangement, method and computer program product for displaying a sequence of digital images

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR100944995B1 (en) * 2007-12-12 2010-03-05 재단법인서울대학교산학협력재단 Apparatus for motion compensation
US8411749B1 (en) * 2008-10-07 2013-04-02 Zenverge, Inc. Optimized motion compensation and motion estimation for video coding
US8732384B1 (en) 2009-08-04 2014-05-20 Csr Technology Inc. Method and apparatus for memory access
JP2012209914A (en) * 2010-12-08 2012-10-25 Sony Corp Image processor, image processing method and program
KR20120066305A (en) * 2010-12-14 2012-06-22 한국전자통신연구원 Caching apparatus and method for video motion estimation and motion compensation
KR101898464B1 (en) * 2011-03-17 2018-09-13 삼성전자주식회사 Motion estimation apparatus and method for estimating motion thereof

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0602642A2 (en) * 1992-12-16 1994-06-22 Nec Corporation Moving picture decoding system
EP0639032A2 (en) * 1993-08-09 1995-02-15 C-Cube Microsystems, Inc. Structure and method for a multistandard video encoder/decoder
EP0750429A1 (en) * 1995-06-21 1996-12-27 STMicroelectronics Limited Video signal processor apparatus and method
FR2761499A1 (en) * 1997-03-25 1998-10-02 Sgs Thomson Microelectronics Coder and decoder for animated images, with cache memory
EP0971546A2 (en) * 1998-04-01 2000-01-12 Nec Corporation Moving pictures decompression device and method

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS6370386A (en) 1986-09-11 1988-03-30 Nec Corp Graphics display and control system
US5444489A (en) * 1993-02-11 1995-08-22 Georgia Tech Research Corporation Vector quantization video encoder using hierarchical cache memory scheme
US5638531A (en) * 1995-06-07 1997-06-10 International Business Machines Corporation Multiprocessor integrated circuit with video refresh logic employing instruction/data caching and associated timing synchronization
KR100280285B1 (en) 1996-08-19 2001-02-01 윤종용 Multimedia processor suitable for multimedia signals
US6178203B1 (en) * 1997-04-03 2001-01-23 Lsi Logic Corporation Method and apparatus for two-row decoding of MPEG video
US6163576A (en) * 1998-04-13 2000-12-19 Lsi Logic Corporation Video encoder having reduced memory bandwidth requirements
FR2820846B1 (en) 2001-02-12 2003-05-30 Thomson Multimedia Sa DEVICE AND METHOD FOR MANAGING ACCESS TO A RECORDING MEDIUM
US20030014596A1 (en) * 2001-07-10 2003-01-16 Naohiko Irie Streaming data cache for multimedia processor

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0602642A2 (en) * 1992-12-16 1994-06-22 Nec Corporation Moving picture decoding system
EP0639032A2 (en) * 1993-08-09 1995-02-15 C-Cube Microsystems, Inc. Structure and method for a multistandard video encoder/decoder
EP0750429A1 (en) * 1995-06-21 1996-12-27 STMicroelectronics Limited Video signal processor apparatus and method
FR2761499A1 (en) * 1997-03-25 1998-10-02 Sgs Thomson Microelectronics Coder and decoder for animated images, with cache memory
EP0971546A2 (en) * 1998-04-01 2000-01-12 Nec Corporation Moving pictures decompression device and method

Cited By (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8599841B1 (en) 2006-03-28 2013-12-03 Nvidia Corporation Multi-format bitstream decoding engine
WO2007112132A2 (en) * 2006-03-29 2007-10-04 Nvidia Corporation Video processing method and system
WO2007112132A3 (en) * 2006-03-29 2007-11-29 Nvidia Corp Video processing method and system
US8593469B2 (en) 2006-03-29 2013-11-26 Nvidia Corporation Method and circuit for efficient caching of reference video data
TWI471011B (en) * 2006-03-29 2015-01-21 Nvidia Corp Video processing method and system
EP1890495A2 (en) * 2006-08-18 2008-02-20 NEC Electronics Corporation Bit-plane decoding device and bit-plane decoding method
EP1890495A3 (en) * 2006-08-18 2008-12-03 NEC Electronics Corporation Bit-plane video decoding
DE102007005866B4 (en) 2007-02-06 2021-11-04 Intel Deutschland Gmbh Arrangement, method and computer program product for displaying a sequence of digital images
WO2009109891A1 (en) * 2008-03-03 2009-09-11 Nxp B.V. Processor comprising a cache memory
EP2252064A1 (en) * 2008-03-31 2010-11-17 Panasonic Corporation Image decoding device, image decoding method, integrated circuit, and reception device
EP2252064A4 (en) * 2008-03-31 2011-04-20 Panasonic Corp Image decoding device, image decoding method, integrated circuit, and reception device
US8577165B2 (en) 2008-06-30 2013-11-05 Samsung Electronics Co., Ltd. Method and apparatus for bandwidth-reduced image encoding and decoding

Also Published As

Publication number Publication date
US20070086522A1 (en) 2007-04-19
KR20060012626A (en) 2006-02-08
EP1629674A1 (en) 2006-03-01
CN1792097A (en) 2006-06-21
US8155459B2 (en) 2012-04-10
JP2007503787A (en) 2007-02-22

Similar Documents

Publication Publication Date Title
US6067322A (en) Half pixel motion estimation in motion video signal encoding
KR100322056B1 (en) Method for reducing processing power requirements of a video decoder
KR100566826B1 (en) System for processing a data stream of compressed image representative pixel data blocks
US7079692B2 (en) Reduced complexity video decoding by reducing the IDCT computation in B-frames
US5974185A (en) Methods and apparatus for encoding video data using motion vectors for decoding by regular or downconverting decoders
US20090175343A1 (en) Hybrid memory compression scheme for decoder bandwidth reduction
JPH0851631A (en) Trans-coding method and device
KR101147744B1 (en) Method and Apparatus of video transcoding and PVR of using the same
US20070171979A1 (en) Method of video decoding
KR19990036188A (en) Method and apparatus for decoding encoded digital video signal
US8155459B2 (en) Video processing device with low memory bandwidth requirements
US6148032A (en) Methods and apparatus for reducing the cost of video decoders
KR20030020419A (en) Method and device for video transcoding
US20030016745A1 (en) Multi-channel image encoding apparatus and encoding method thereof
KR100364748B1 (en) Apparatus for transcoding video
JP2001045491A (en) Decoding.display device for coded image
US6144323A (en) Method and apparatus for decoding video data
US20070153909A1 (en) Apparatus for image encoding and method thereof
US11622106B2 (en) Supporting multiple partition sizes using a unified pixel input data interface for fetching reference pixels in video encoders
JP2003520511A (en) Transcoding method and apparatus
KR100493476B1 (en) Image encoding circuit
US7813569B2 (en) MPEG video decoding method and MPEG video decoder using results from analysis of motion-vector data and DCT coefficients
JP2002374531A (en) Decoder
JP2005507620A (en) compression
Xu et al. Pixel-Copy Prediction Based Lossless Reference Frame Compression

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BW BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE EG ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NA NI NO NZ OM PG PH PL PT RO RU SC SD SE SG SK SL SY TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): GM KE LS MW MZ NA SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IT LU MC NL PL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
WWE Wipo information: entry into national phase

Ref document number: 2004731432

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 2007086522

Country of ref document: US

Ref document number: 10556616

Country of ref document: US

WWE Wipo information: entry into national phase

Ref document number: 1020057021969

Country of ref document: KR

Ref document number: 2006530662

Country of ref document: JP

Ref document number: 20048135840

Country of ref document: CN

WWP Wipo information: published in national office

Ref document number: 1020057021969

Country of ref document: KR

WWP Wipo information: published in national office

Ref document number: 2004731432

Country of ref document: EP

WWP Wipo information: published in national office

Ref document number: 10556616

Country of ref document: US