US9363514B2 - Spatial prediction technique for video coding - Google Patents
Spatial prediction technique for video coding Download PDFInfo
- Publication number
- US9363514B2 US9363514B2 US13/575,105 US201113575105A US9363514B2 US 9363514 B2 US9363514 B2 US 9363514B2 US 201113575105 A US201113575105 A US 201113575105A US 9363514 B2 US9363514 B2 US 9363514B2
- Authority
- US
- United States
- Prior art keywords
- pixel
- current block
- prediction
- pixels
- threshold value
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related, expires
Links
- 238000000034 method Methods 0.000 title claims abstract description 45
- 230000009466 transformation Effects 0.000 claims abstract description 33
- 230000001364 causal effect Effects 0.000 claims description 25
- 238000004364 calculation method Methods 0.000 description 6
- 230000008901 benefit Effects 0.000 description 5
- 230000006870 function Effects 0.000 description 5
- 238000000605 extraction Methods 0.000 description 3
- 238000013139 quantization Methods 0.000 description 3
- 230000000694 effects Effects 0.000 description 2
- 238000013213 extrapolation Methods 0.000 description 2
- 230000002123 temporal effect Effects 0.000 description 2
- 238000000844 transformation Methods 0.000 description 2
- 230000003044 adaptive effect Effects 0.000 description 1
- 230000000873 masking effect Effects 0.000 description 1
- 230000000644 propagated effect Effects 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/103—Selection of coding mode or of prediction mode
- H04N19/105—Selection of the reference unit for prediction within a chosen coding or prediction mode, e.g. adaptive choice of position and number of pixels used for prediction
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/593—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving spatial prediction techniques
-
- H04N19/00763—
-
- H04N19/00696—
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/103—Selection of coding mode or of prediction mode
- H04N19/11—Selection of coding mode or of prediction mode among a plurality of spatial predictive coding modes
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/17—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
- H04N19/176—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/18—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a set of transform coefficients
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/182—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a pixel
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/189—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the adaptation method, adaptation tool or adaptation type used for the adaptive coding
- H04N19/192—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the adaptation method, adaptation tool or adaptation type used for the adaptive coding the adaptation method, adaptation tool or adaptation type being iterative or recursive
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/48—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using compressed domain processing techniques other than decoding, e.g. modification of transform coefficients, variable length coding [VLC] data or run-length data
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/503—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
- H04N19/51—Motion estimation or motion compensation
- H04N19/513—Processing of motion vectors
- H04N19/517—Processing of motion vectors by encoding
- H04N19/52—Processing of motion vectors by encoding by predictive encoding
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/60—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/60—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
- H04N19/61—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding in combination with predictive coding
Definitions
- the invention relates to the general domain of image coding.
- the invention relates more specifically to a method for coding and a method for decoding a block of an image.
- the coding of the current block generally comprises the transformation into a block of coefficients, for example by a DCT (Discrete Cosine Transform), of the difference between the pixels of the current block and the pixels of the prediction block.
- the coding further comprises the quantization of coefficients, then the entropy coding of quantized coefficients.
- the purpose of the invention is to overcome at least one of the disadvantages of the prior art.
- the invention relates to a method for coding a block of pixels, called the current block, by spatial prediction.
- the coding method according to the invention comprises the following steps for:
- the current threshold value is determined or coded from neighbouring reconstructed pixels of the current block.
- the coding efficiency is improved.
- the step of determination, for each pixel of the current block, of a prediction pixel is reiterated with each threshold value of a plurality of threshold values.
- the method also comprises the selection as current threshold value, from among the plurality of threshold values, of the threshold value for which a prediction error calculated between the current block and the prediction block is smallest.
- the current threshold value is coded by difference with a prediction threshold value, which prediction value depends on neighbouring reconstructed pixels of the current block.
- the prediction threshold value is equal to the average of threshold values used in the neighbouring blocks of the current block.
- the prediction threshold value is equal to the median value of threshold values used in the neighbouring blocks of the current block.
- the prediction threshold value is determined according to the following steps for:
- the current threshold value is determined according to the following steps for:
- the prediction error calculated between the reconstructed pixels of the neighbouring area of the current block and the corresponding prediction pixels takes into account, for each of the reconstructed pixels of the neighbouring area of the current block their distance with respect to the edge of the current block.
- the current threshold value being selected for a current block of size 8 ⁇ 8 a current threshold value is calculated for each of the blocks of size 4 ⁇ 4 of the current block by multiplication of the current threshold value selected by a coefficient ⁇ strictly less than 1.
- the size of the window depends on the position in the current block of the pixel to be predicted.
- the invention also relates to a method for decoding a current block of pixels by spatial prediction comprising the following steps for:
- the current threshold value is determined from neighbouring reconstructed pixels of the current block.
- the current threshold value is
- the method for decoding according to the invention also comprises the following steps for:
- the prediction threshold value is determined according to the following steps for:
- FIG. 1 shows a coding method according to the invention
- FIG. 2 shows an image part comprising a block to be predicted and a window used for the prediction of this block
- FIGS. 3 and 4 show in detail a step of the coding method according to the invention
- FIGS. 5 and 6 show an image part comprising a block to be predicted and different windows used for the prediction of this block
- FIG. 7 shows an image part comprising a block to be predicted, a causal zone Zc neighbouring this block and a window used for the prediction of pixels of this causal zone,
- FIG. 8 shows a decoding method according to the invention
- FIG. 9 illustrates a coding device according to the invention.
- FIG. 10 shows a decoding device according to the invention.
- An image comprises pixels or image points with each of which is associated at least one item of image data.
- An item of image data is for example an item of luminance data or an item of chrominance data.
- the term “residue” designates the data obtained after extraction of other data.
- the extraction is generally a subtraction of prediction pixels from source pixels. However, the extraction is more general and comprises notably a weighted subtraction.
- the term “reconstructs” designates data (for example pixels, blocks) obtained after merging of residues with prediction data.
- the merge is generally a sum of residue prediction pixels. However, the merging is more general and comprises notably the weighted sum.
- a reconstructed block is a block of reconstructed pixels.
- reconstruction and “decoding” are very often used as being synonymous.
- decoding is also designated under the terminology of “decoded block”.
- the invention relates to a method for coding a block of pixels, called the current block, by spatial prediction. It applies to the coding of an image or to image sequences.
- the method for coding according to the invention is based on a method for extrapolation of a signal described in the document by Guleryuz, O. G. entitled “Nonlinear approximation based image recovery using adaptive sparse reconstructions and iterated denoising”, Image Processing, IEEE Transactions on Volume 15, Issue 3, March 2006 Page(s): 539-571. Originally this method of extrapolation was used for the purpose of masking errors.
- a prediction pixel is determined for each pixel of a current block.
- the prediction pixels form a prediction block of the current block.
- the prediction pixels are obtained by a thresholding, with a current threshold value, of coefficients from a transformation applied on a window covering at least the pixel of the current block to be predicted.
- the window corresponds to the support of the transform.
- the transform used is for example a DCT. However, the invention is not limited to this latter. Other transforms such as the Discrete Fourier Transform can be applied.
- the current threshold value is determined or coded from neighbouring reconstructed pixels of the current block. In determining or in coding the current threshold value from neighbouring reconstructed pixels of the current block, the coding efficiency is improved.
- the prediction block formed from prediction pixels is extracted from the current block to generate a residue block.
- the residue block is coded in a stream S.
- the residue block is transformed, for example by a DCT or a wavelet transform into a block of coefficients that are quantized then coded by entropy coding.
- the residue block is only quantized then coded by entropy coding.
- Step 10 of determination of prediction pixels is described more precisely in references to FIGS. 2 and 3 .
- the prediction pixel p 0,0 corresponding to the pixel on the top left of the current block B to be predicted.
- the pixels identified in FIG. 2 by crosses are known pixels, i.e. reconstructs.
- the pixel p 0,0 is predicted in order to give it a value representative of its environment.
- the window F in its initial position F 0,0 , covers at least the pixel p 0,0 to be predicted. It is on this window that is applied a transformation in step 10 .
- an initial value is assigned to the pixel p 0,0 .
- the average value from the neighbouring pixels is assigned to the pixel p 0,0 .
- the median value of pixels a, b and c is assigned to the pixel p 0,0 .
- one of the values a, b or c is assigned to the pixel p 0,0 .
- other pixels situated in the causal neighbouring area of the pixel p 0,0 are taken into account to determine an initial value for the pixel p 0,0 .
- the causal neighbouring area of a current pixel comprises the set of pixels of the current image already reconstructed during the coding (respectively the decoding) of the current pixel.
- a transformation is applied to the pixels of window F. These pixels are then transformed into coefficients.
- the coefficients are thresholded in the domain transformed using a threshold value th opt .
- This thresholding has the effect of eliminating the noise to retain only the significant coefficients.
- an inverse transform to that applied in step 120 is applied to return into the pixel domain so as to recover a new prediction pixel value noted as p tr0,0 0,0 , the indexes greater than p tr0,0 0,0 of null values correspond to a null offset of the window F in line and column with respect to the initial position of the window.
- the same method as that described in reference to FIG. 3 is applied with a window F offset to determine prediction pixels for the other pixels of the current block B.
- the pixels p 0,0 to p n-1,m-1 are predicted iteratively.
- the window is offset by 1 pixel to the right for the determination of the prediction pixel p 0,1 corresponding to the pixel of the current block just to the right of the pixel p 0,0 .
- a, b and c are neighbouring reconstructed pixels of the pixel p 0,1 situated respectively to the left, above and on the diagonal of the pixel to be predicted p 0,1 where a is more specifically, in the present case, equal to the value of the pixel p tr0,0 0,0 interpolated previously.
- the window in position F 0,0 is offset by 2 pixels to the right, i.e. into the position F 0,2
- the pixel p 0,1 and the pixel p 0,2 are predicted.
- the window F can be offset by m pixels in m pixels. In order that the prediction is not too degraded, it is advantageous that the value of m remains small.
- step 100 the average value from the neighbouring pixels is assigned to p 0,1 .
- p tr0,0 0,0 +b+c the value (p tr0,0 0,0 +b+c)/3. This value is noted as p av0,1 0,1 .
- the variants described in step 100 in reference to the pixel p 0,0 can also apply.
- a transformation is applied to the pixels of window F 0,1 . These pixels are then transformed into coefficients.
- the coefficients are thresholded in the domain transformed with the threshold value th opt . This thresholding has the effect of eliminating the noise to retain only the significant coefficients.
- an inverse transform to that applied in step 110 is applied to return into the pixel domain so as to recover a new prediction pixel value noted as p tr0,1 0,1 , the indexes greater than p tr0,1 0,1 correspond to an offset of the window F of 0 in lines and 1 in columns.
- le pixel p 0,0 is comprised in the window in position F 0,1 .
- a new value is also calculated for the pixel p 0,0 .
- a value p tr.0,0 0,1 is assigned to the pixel p 0,0 .
- This value p tr.0,0 0,1 can be different to that p tr.0,0 0,0 calculated in the preceding iteration (with the window in F 0,0 ), when the window F was not offset.
- p tr.0,0 0,0 obtained in the preceding iteration for a null offset of the window F and that p tr.0,0 0,1 obtained in the current iteration for an offset of 0 in lines and of 1 in columns.
- the method is reiterated until all the pixels of the block B are predicted. For this purpose, during a step 140 it is verified if the current pixel is the last of the block to be predicted. If this is the case then the step of determination of the prediction block is terminated. In the contrary case, the window F is offset by one column to the right if there are still remaining pixels to be predicted in the current line or to the line below so as to again place the window at the start of the line.
- the way in which the window F is offset at each iteration is not fixed. It depends on the scanning order defined for the block to be predicted. In reference to the preceding figures, the scanning of pixels is carried out pixel by pixel from left to right then line by line. This scanning is not exclusive, a zigzag type scanning is possible as well as other types, such as for example a scanning of the first line then of the first column, then of the second line then the second column and so on.
- a value p tr.sk,sl sk,sl is determined for the new pixel p sk,sl to be predicted.
- a new prediction value is also calculated for the pixels of the current block comprised in the window F for which a prediction value or prediction values is/are already calculated during the preceding iterations. For these pixels a new prediction value is determined as described above in reference to the pixel p 0,0 in the following way:
- p tr.k,l sk,sl is the pixel predicted in line k and column l of the block to be predicted during the iteration corresponding to the position F sk,sl of the window F,
- sk and sl the offsets respectively in line and column of the window F
- p Int.k,l sk,sl is the value of the prediction pixel in position (k, l) recursively predicted by successive offsets of the window F up to the position F sk,sl .
- the weighted sum is replaced by the median function
- the threshold value th opt is determined from a causal zone Zc, i.e. comprising reconstructed pixels in a neighbouring area of the current block B, not necessarily adjacent to this block.
- a causal zone Zc i.e. comprising reconstructed pixels in a neighbouring area of the current block B, not necessarily adjacent to this block.
- the crosses designate reconstructed pixels.
- the crosses on a grey background designate pixels that belong to the causal zone Zc.
- This zone Zc is used to determine a threshold value for the current block to be predicted.
- the method described in reference to FIGS. 4 and 5 is applied to the pixels of this zone Zc in order to determine for each among them a prediction pixel and this for several threshold values th i .
- an energy level is calculated on the zone Zc. As a simple illustration, this energy will be calculated according to the following formula:
- p represents the position of a pixel contained in the block
- Y is the value of an item of image data (for example luminance and/or chrominance) of the pixel in the current block to be predicted
- p Int is the prediction value determined for a threshold value th i .
- a threshold value th zc is determined for the zone Zc as being that generating the minimal prediction energy SSE i
- the energy is calculated as follows:
- the energy is calculated as follows:
- a weighting function is introduced that enables the prediction error of pixels of Zc to be relativized according to the distance of pixels of Zc with respect to the frontier of the block to be predicted.
- the values of this weighting function will thus vary, for example, according to the distance of pixels relative to the centre of the block to be predicted such that:
- the threshold value th opt for the current block to be predicted is equal to th zc .
- the zone Zc can have different forms according particularly to the availability of neighbouring pixels. Likewise the thickness of the zone can go beyond 1 pixel.
- the threshold value th opt is determined for a current block by reiterating the method described in reference to FIGS. 4 and 5 with different threshold values th i and by determining the threshold value for which a prediction error calculated between the prediction block and the current block is smallest.
- p represents the position of a pixel contained in the block
- Y is the value of an item of image data (for example luminance and/or chrominance) of the pixel in the current block to be predicted
- p Int is the prediction value determined for a threshold value th i .
- the threshold value th opt is that generating the minimal prediction energy SSE i According to a variant the energy is calculated as follows:
- the energy is calculated as follows:
- the value of the current threshold th opt determined according to this second embodiment is either coded directly in the stream S or advantageously is coded in the stream S by difference with a prediction threshold value th pred in order to reduce its coding cost.
- the prediction threshold value th pred is equal to the average of threshold values th opt determined for blocks neighbouring the current block and that have already been coded.
- the block to the left, the block above and the top left block are taken into account.
- the top right block is also taken into account.
- the prediction threshold value th pred is equal to the median of threshold values th opt determined for blocks neighbouring the current block and that have already been coded.
- the invention also relates to a method for decoding described in reference to FIG. 8 .
- a residue block is decoded for the current block. For example, a part of the stream S is decoded into coefficients. The coefficients are dequantized then if necessary transformed by an inverse transform to that used on the coder side. The residue block is thus obtained.
- a prediction pixel is determined for each pixel of a current block.
- the prediction pixels form a prediction block of the current block.
- the prediction pixels are obtained by a thresholding, with a current threshold value, of coefficients from a transformation applied on a window covering at least the pixel of the current block to be predicted.
- the window corresponds to the support of the transform.
- the transform used is for example a DCT.
- the invention is not limited to this latter.
- Other transforms such as the Discrete Fourier Transform can be applied.
- the current block is reconstructed by merging the prediction block with the decoded residue block.
- the step 22 of determination of prediction pixels is identical to step 10 of the coding method. However on the decoder side the threshold value th opt is either decoded from the stream S in the case where it was coded in this same stream on the coder side, or it is determined directly from reconstructed pixels in a neighbouring area of the current block.
- th opt is decoded from the stream either directly or by addition of the decoded value to a prediction value th pred in the case of a coding differential of this value.
- the prediction threshold value th pred is equal to the average of threshold values th opt determined for blocks neighbouring the current block and that have already been coded.
- the block to the left, the block above and the top left block are taken into account.
- the top right block is also taken into account.
- the prediction threshold value th pred is equal to the median of threshold values th opt determined for blocks neighbouring the current block and that have already been coded.
- the prediction threshold value th pred is equal to th zc , where th zc is determined as described on the coder side in reference to FIG. 7 .
- the threshold value th opt is directly determined from reconstructed pixels in the zone Zc in the same way as described on the coder side in reference to FIG. 7 .
- th opt is equal to th zc .
- the size of the window F depends on the position in the current block of the pixel to be predicted as shown in FIG. 6 .
- the window in position F 0,0 is of a smaller size than the window in position F n-1,m-1 .
- This has the advantage of improving the relevance of the prediction block.
- the size of the window is 4 ⁇ 4 for the pixels situated on the edges on top and on the left of the current block, i.e. first lines and first columns and the window is of size 8 ⁇ 8 for the other pixels of the current block.
- the dimensions of the window or windows used are not limited to powers of 2.
- the invention is not limited to the use of transformations known as “rapid” transformations which apply to a number of multiple samples of 2 N .
- the transformation used is not necessarily be separated.
- the invention also relates to a coding device 12 described in reference to FIG. 9 and a decoding device 13 described in reference to FIG. 10 .
- the modules shown are functional units, that may correspond or not to physically distinguishable units. For example, these modules or some of them can be grouped together in a single component, or constitute functions of the same software. Conversely, some modules may be composed of separate physical entities.
- the coding device 12 receives at input an image or images.
- the coding device 12 is able to implement the coding method according to the invention described in reference to FIG. 1 .
- Each image is divided into blocks of pixels with each of which is associated at least one item of image data.
- the coding device 12 notably implements a coding with spatial prediction. Only the modules of the coding device 12 relating to the coding by spatial prediction or INTRA coding are shown in FIG. 9 . Other modules not shown and known by those skilled in the art of video coders implement the coding temporal prediction (for example motion estimation, motion compensation).
- the coding device 12 notably comprises a calculation module 1200 able to extract, for example by subtraction pixel by pixel, from a current block B of a prediction block Pr to generate a residue block Bres.
- the calculation module 1200 is able to implement step 12 of the coding method according to the invention. It further comprises a module 1202 able to transform then quantize the residue block Bres into quantized data.
- the transform T is for example a discrete cosine transform (or DCT).
- the coding device 12 further comprises an entropy coding module 1204 able to code the quantized data into a stream S of coded data. It also comprises a module 1206 performing the inverse operation of the module 1202 .
- the module 1206 carries out an inverse quantization Q ⁇ 1 followed by an inverse transformation T ⁇ 1 .
- the module 1206 is connected to a calculation module 1208 capable of merging, for example by addition pixel by pixel, the block of data from the module 1206 and the prediction block Pr to generate a reconstructed block that is stored in a memory 1210 .
- a prediction module 1216 determines the prediction block Pr.
- the prediction module 1216 is able to implement step 10 of the coding method according to the invention.
- Step 14 of the coding method is implemented in the modules 1202 and 1204 .
- the decoding module 13 receives at input a stream S of coded data representative of an image.
- the stream S is for example transmitted by a coding device 12 via a channel.
- the decoding device 13 is able to implement the decoding method according to the invention described in reference to FIG. 8 .
- the decoding device 13 comprises an entropy decoding module 1300 able to generate decoded data.
- the decoded data are then transmitted to a module 1302 able to carry out an inverse quantization followed by an inverse transform.
- the module 1302 is identical to the module 1206 of the coding device 12 having generated the stream S.
- the module 1302 is connected to a calculation module 1304 able to merge, for example by addition pixel by pixel, the block from the module 1302 and a prediction block Pr to generate a reconstructed current block Bc that is stored in a memory 1306 .
- the calculation module 1304 is able to implement step 24 of the decoding method.
- the decoding device 13 also comprises a prediction module 1308 .
- the prediction module 1308 determines the prediction block Pr.
- the invention is not limited to the embodiment examples mentioned above.
- those skilled in the art may apply any variant to the stated embodiments and combine them to benefit from their various advantages.
- the invention is not limited to the transform type used (for example DCT, wavelet, Discrete Fourier Transform, etc).
- the scanning order of pixels can vary (for example raster scan, zigzag, etc.).
- the invention is in no way limited by the way an energy level is calculated (for example SSE, SAD, Max, etc.).
- the invention applies to the coding of fixed images or of image sequences.
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
Abstract
-
- determining, for each pixel of the current block, a prediction pixel by thresholding, with a current threshold value, of coefficients resulting from a transformation applied on a window covering at least the pixel of the current block and by inverse transformation applied to the thresholded coefficients,
- extracting from the current block a prediction block formed of prediction pixels to generate a residue block, and
- coding said residue block,
Description
Where:
or a histogram peak type function.
where
where:
-
- c standardisation coefficient,
- ρ=0.8,
- i and j correspond to the coordinates of weighting coefficients in the frame of the weighting window, where the centre of the block to be predicted is in (5.5, 5.5) and (11.5, 11.5) respectively for blocks of dimensions in 4×4 and 8×8,
- the origin (0, 0) being in the top left.
where
Claims (12)
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| FR1050466 | 2010-01-25 | ||
| FR1050466A FR2955730A1 (en) | 2010-01-25 | 2010-01-25 | CODING AND DECODING METHODS |
| PCT/EP2011/050693 WO2011089158A1 (en) | 2010-01-25 | 2011-01-19 | Spatial prediction technique for video coding |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| US20130195181A1 US20130195181A1 (en) | 2013-08-01 |
| US9363514B2 true US9363514B2 (en) | 2016-06-07 |
Family
ID=43415367
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US13/575,105 Expired - Fee Related US9363514B2 (en) | 2010-01-25 | 2011-01-19 | Spatial prediction technique for video coding |
Country Status (8)
| Country | Link |
|---|---|
| US (1) | US9363514B2 (en) |
| EP (1) | EP2529552A1 (en) |
| JP (1) | JP5715647B2 (en) |
| KR (1) | KR101819762B1 (en) |
| CN (1) | CN102726045B (en) |
| BR (1) | BR112012017865A2 (en) |
| FR (1) | FR2955730A1 (en) |
| WO (1) | WO2011089158A1 (en) |
Families Citing this family (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| FR2955730A1 (en) * | 2010-01-25 | 2011-07-29 | Thomson Licensing | CODING AND DECODING METHODS |
| CN105144720B (en) | 2013-01-04 | 2018-12-28 | Ge视频压缩有限责任公司 | Efficient Scalable Coding Concept |
| KR102698537B1 (en) * | 2013-04-08 | 2024-08-23 | 지이 비디오 컴프레션, 엘엘씨 | Coding concept allowing efficient multi-view/layer coding |
| US11172215B2 (en) * | 2018-10-08 | 2021-11-09 | Qualcomm Incorporated | Quantization artifact suppression and signal recovery by the transform domain filtering |
Citations (12)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO1998019461A1 (en) | 1996-10-30 | 1998-05-07 | Matsushita Electric Industrial Co., Ltd. | Image encoder, image encoding method, image decoder, image decoding method, and data recording medium |
| US6680974B1 (en) * | 1999-12-02 | 2004-01-20 | Lucent Technologies Inc. | Methods and apparatus for context selection of block transform coefficients |
| WO2008012918A1 (en) | 2006-07-28 | 2008-01-31 | Kabushiki Kaisha Toshiba | Image encoding and decoding method and apparatus |
| WO2008048489A2 (en) | 2006-10-18 | 2008-04-24 | Thomson Licensing | Method and apparatus for video coding using prediction data refinement |
| JP2009081489A (en) | 2007-09-25 | 2009-04-16 | Kddi Corp | Moving picture encoding apparatus and moving picture decoding apparatus |
| US20110090969A1 (en) * | 2008-05-07 | 2011-04-21 | Lg Electronics Inc. | Method and apparatus for decoding video signal |
| US20130101040A1 (en) * | 2009-10-20 | 2013-04-25 | Thomson Licensing | Method for coding a block of a sequence of images and method for reconstructing said block |
| US8446949B2 (en) * | 2008-06-23 | 2013-05-21 | Sungkyunkwan University Foundation For Corporate Collaboration | Distributed coded video decoding apparatus and method capable of successively improving side information on the basis of reliability of reconstructed data |
| US20130195181A1 (en) * | 2010-01-25 | 2013-08-01 | Thomson Licensing | Methods for coding and decoding |
| US8670488B2 (en) * | 2007-12-21 | 2014-03-11 | Telefonaktiebolaget Lm Ericsson (Publ) | Adaptive intra mode selection |
| US8699562B2 (en) * | 2008-10-06 | 2014-04-15 | Lg Electronics Inc. | Method and an apparatus for processing a video signal with blocks in direct or skip mode |
| US8818114B2 (en) * | 2008-10-01 | 2014-08-26 | Sk Telecom Co., Ltd. | Method and apparatus for image encoding/decoding |
-
2010
- 2010-01-25 FR FR1050466A patent/FR2955730A1/en not_active Withdrawn
-
2011
- 2011-01-19 US US13/575,105 patent/US9363514B2/en not_active Expired - Fee Related
- 2011-01-19 WO PCT/EP2011/050693 patent/WO2011089158A1/en active Application Filing
- 2011-01-19 CN CN201180007027.8A patent/CN102726045B/en not_active Expired - Fee Related
- 2011-01-19 BR BR112012017865A patent/BR112012017865A2/en not_active Application Discontinuation
- 2011-01-19 EP EP11700373A patent/EP2529552A1/en not_active Withdrawn
- 2011-01-19 KR KR1020127019600A patent/KR101819762B1/en not_active Expired - Fee Related
- 2011-01-19 JP JP2012549347A patent/JP5715647B2/en not_active Expired - Fee Related
Patent Citations (16)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| EP0873015A1 (en) | 1996-10-30 | 1998-10-21 | Matsushita Electric Industrial Co., Ltd. | Image encoder, image encoding method, image decoder, image decoding method, and data recording medium |
| WO1998019461A1 (en) | 1996-10-30 | 1998-05-07 | Matsushita Electric Industrial Co., Ltd. | Image encoder, image encoding method, image decoder, image decoding method, and data recording medium |
| US6680974B1 (en) * | 1999-12-02 | 2004-01-20 | Lucent Technologies Inc. | Methods and apparatus for context selection of block transform coefficients |
| EP2056606A1 (en) | 2006-07-28 | 2009-05-06 | Kabushiki Kaisha Toshiba | Image encoding and decoding method and apparatus |
| WO2008012918A1 (en) | 2006-07-28 | 2008-01-31 | Kabushiki Kaisha Toshiba | Image encoding and decoding method and apparatus |
| US8542736B2 (en) * | 2006-10-18 | 2013-09-24 | Thomson Licensing | Method and apparatus for video coding using prediction data refinement |
| US20090238276A1 (en) | 2006-10-18 | 2009-09-24 | Shay Har-Noy | Method and apparatus for video coding using prediction data refinement |
| WO2008048489A2 (en) | 2006-10-18 | 2008-04-24 | Thomson Licensing | Method and apparatus for video coding using prediction data refinement |
| JP2009081489A (en) | 2007-09-25 | 2009-04-16 | Kddi Corp | Moving picture encoding apparatus and moving picture decoding apparatus |
| US8670488B2 (en) * | 2007-12-21 | 2014-03-11 | Telefonaktiebolaget Lm Ericsson (Publ) | Adaptive intra mode selection |
| US20110090969A1 (en) * | 2008-05-07 | 2011-04-21 | Lg Electronics Inc. | Method and apparatus for decoding video signal |
| US8446949B2 (en) * | 2008-06-23 | 2013-05-21 | Sungkyunkwan University Foundation For Corporate Collaboration | Distributed coded video decoding apparatus and method capable of successively improving side information on the basis of reliability of reconstructed data |
| US8818114B2 (en) * | 2008-10-01 | 2014-08-26 | Sk Telecom Co., Ltd. | Method and apparatus for image encoding/decoding |
| US8699562B2 (en) * | 2008-10-06 | 2014-04-15 | Lg Electronics Inc. | Method and an apparatus for processing a video signal with blocks in direct or skip mode |
| US20130101040A1 (en) * | 2009-10-20 | 2013-04-25 | Thomson Licensing | Method for coding a block of a sequence of images and method for reconstructing said block |
| US20130195181A1 (en) * | 2010-01-25 | 2013-08-01 | Thomson Licensing | Methods for coding and decoding |
Non-Patent Citations (18)
| Title |
|---|
| Bjontegaad, "Calculation of average PSNR differences between RD curves", Document VCEG-M33, ITU-T VCEG Meeting, Study Group 16, Austin, Texas, USA, Apr. 2, 2001, pp. 1-4. |
| Guleryuz, "Nonlinear Approximation Based Image Recovery Using Adaptive Sparse Reconstructions and Iterated Denoising-Part I: Theory," IEEE Transactions on Image Processing, US. vol. 15, No. 3, Mar. 2006, pp. 539-554. |
| Guleryuz, "Nonlinear Approximation Based Image Recovery Using Adaptive Sparse Reconstructions and iterated Denoising-Part II: Adaptive Algorithms," IEEE Transactions on Image Processing, US, vol. 15, No. 3, Mar. 2006, pp. 555-571. |
| Hua et al., "Spatial Sparsity Induced Temporal Prediction for Hybrid Video Compression", Proceedings of 2007 Data Compression Conference, Snowbird, Utah, USA, Jan. 10, 2007, pp. 1-10. |
| Ichigaya et al., "Description of Video Coding Technology Proposal by NHK and Mitsubishi", Joint Collaborative Team on Video Coding of ITU-T SG16 WP3 and ISO/IEC JTC1/SC29/WG11, 1st Meeting, Dresden, Germany, Apr. 15, 2010, pp. 1-17. |
| Kim et al., "Description of Video Coding Technology Proposal by Etr", Jloint Collaborative Team on Video Coding of Itu-T SG16 WP3 and Iso/Iec JTC1/SC29/WG11, 1st Meeting, Dresden, Germany, 15 Apr. 2010, pp. 1-10. |
| Martin et al., "Atomic decomposition dedicated to AVC and spatial SVC prediction," Proceedings of the 15th International Conference on Image Processing (ICIP 2008). IEEE, US, Oct. 12, 2008, pp. 2492-2495. |
| Martin et al., "Phase refinement for image prediction based on sparse representation," Proceedings of the SPIE Bellingham. VA, US, vol. 7543, Jan. 19, 2010, pp. 1-8. |
| Search Report Dated Jul. 1, 2011. |
| Suhring, "KTA Software", Fraunhofer Heinrich Hertz Institute (HHI), http://iphome.hhi.de/suehring/tml/download/KTA/, Jan. 6, 2011, pp. 1. |
| Tan et Al., "Intra Prediction by Template Matching", 2006 International Conference on Image Processing, Atlanta, Georgia, USA, Oct. 8, 2006, pp. 1693-1696. |
| Thoreau et al., "Sparse Shift-DCT Spatial Prediction", Proceedings of 2010 IEEE 17th International Conference on Image Processing, Hong Kong, China, Sep. 26, 2010, pp. 3385-3388. |
| Turkan et al., "Sparse approximation with adaptive dictionary for image prediction", 2009 International Conference on Image Processing, Cairo, Egypt, Nov. 7, 2009, pp. 25-28. |
| Wang et al., "Video Quality Assessment Based on Structural Distortion Measurement", Signal Processing: Image Communication, vol. 19, No. 1, Jan. 2004, pp. 1-9. |
| Wiegand et al., "Draft ITU-T Recommendation and Final Draft International Standard of Joint Video Specification (ITU-T Rec. H.264 ISO/IEC 14496-10 AVC)", Joint Video Team (JVT) of ISO/IEC MPEG & ITU-T VCEG (ISO/IEC JTC1/SC29/WG11 and ITU-T SG16 Q.6), 7th Meeting, Pattaya, Thailand, Mar. 7, 2003, pp. 1-264. |
| Wiegand et al., "Overview of the H.264/AVC Video Coding Standard", IEEE Transactions on Circuits and Systems for Video Technology, July 2003, vol. 13, No. 7, pp. 560-576. |
| Xu et al., "Sparsity-based deartifacting filtering in video compression", 2009 International Conference on Image Processing, Cairo, Egypt, Nov. 7, 2009, pp. 3933-3936. |
| Ye et al., "Improved H.264 Intra Coding Based on Bi-Directional Intra Prediction, Directional Transform, and Adaptive Coefficient Scanning", 2008 International Conference on Image Processing, San Diego, California, USA, Oct. 12, 2008, pp. 2116-2119. |
Also Published As
| Publication number | Publication date |
|---|---|
| WO2011089158A1 (en) | 2011-07-28 |
| BR112012017865A2 (en) | 2016-04-19 |
| US20130195181A1 (en) | 2013-08-01 |
| FR2955730A1 (en) | 2011-07-29 |
| CN102726045B (en) | 2016-05-04 |
| EP2529552A1 (en) | 2012-12-05 |
| KR20120118466A (en) | 2012-10-26 |
| KR101819762B1 (en) | 2018-01-17 |
| JP2013518456A (en) | 2013-05-20 |
| JP5715647B2 (en) | 2015-05-13 |
| CN102726045A (en) | 2012-10-10 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US7756348B2 (en) | Method for decomposing a video sequence frame | |
| EP2063644A2 (en) | Image encoding device and encoding method, and image decoding device and decoding method | |
| US8520960B2 (en) | Method for decoding a stream of coded data representative of a sequence of images and method for coding a sequence of images | |
| KR101379255B1 (en) | Method and apparatus for encoding and decoding based on intra prediction using differential equation | |
| US7471726B2 (en) | Spatial-domain lapped transform in digital media compression | |
| KR101354151B1 (en) | Method and apparatus for transforming and inverse-transforming image | |
| US8285064B2 (en) | Method for processing images and the corresponding electronic device | |
| US8369417B2 (en) | Optimal denoising for video coding | |
| US11134242B2 (en) | Adaptive prediction of coefficients of a video block | |
| US9877025B2 (en) | Video encoding and decoding with prediction at higher precision | |
| KR20090095014A (en) | Method and apparatus for encoding and decoding image usging filtered prediction block | |
| US8594189B1 (en) | Apparatus and method for coding video using consistent regions and resolution scaling | |
| US9363514B2 (en) | Spatial prediction technique for video coding | |
| US20070076964A1 (en) | Method of and an apparatus for predicting DC coefficient in transform domain | |
| KR20090098214A (en) | Image encoding and decoding method and apparatus | |
| US20250150595A1 (en) | Apparatuses and Methods for Encoding or Decoding a Picture of a Video | |
| EP2587803A1 (en) | Methods for coding and reconstructing a pixel block and corresponding devices. | |
| US8792549B2 (en) | Decoder-derived geometric transformations for motion compensated inter prediction | |
| US9781446B2 (en) | Method for coding and method for decoding a block of an image and corresponding coding and decoding devices | |
| WO2017220164A1 (en) | Devices and methods for video coding using segmentation based partitioning of video coding blocks | |
| US20130195371A1 (en) | Method for coding and for reconstruction of a block of an image sequence and corresponding devices | |
| Hussien et al. | Multi-Frame Video Compression Scheme Using Three Step Search (TSS) Matching Algorithm | |
| Hussien et al. | DWT based-video compression using (4SS) matching algorithm | |
| Younis et al. | Dept. of Computer Science, College of Science, University of Basrah, Basrah, IRAQ |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| AS | Assignment |
Owner name: THOMSON LICENSING, FRANCE Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:THOREAU, DOMINIQUE;MARTIN, AURELIE;FRANCOIS, EDOUARD;AND OTHERS;SIGNING DATES FROM 20120919 TO 20121002;REEL/FRAME:031284/0525 Owner name: THOMSON LICENSING, FRANCE Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:THOREAU, DOMINIQUE;MARTIN, AURELIE;FRANCOIS, EDOUARD;AND OTHERS;SIGNING DATES FROM 20120919 TO 20121002;REEL/FRAME:031284/0517 |
|
| AS | Assignment |
Owner name: THOMSON LICENSING, FRANCE Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:THOREAU, DOMINIQUE;MARTIN, AURELIE;FRANCOIS, EDOUARD;AND OTHERS;SIGNING DATES FROM 20120919 TO 20121002;REEL/FRAME:031506/0838 |
|
| STCF | Information on status: patent grant |
Free format text: PATENTED CASE |
|
| AS | Assignment |
Owner name: THOMSON LICENSING DTV, FRANCE Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:THOMSON LICENSING;REEL/FRAME:041370/0433 Effective date: 20170113 |
|
| AS | Assignment |
Owner name: THOMSON LICENSING DTV, FRANCE Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:THOMSON LICENSING;REEL/FRAME:041378/0630 Effective date: 20170113 |
|
| AS | Assignment |
Owner name: INTERDIGITAL MADISON PATENT HOLDINGS, FRANCE Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:THOMSON LICENSING DTV;REEL/FRAME:046763/0001 Effective date: 20180723 |
|
| FEPP | Fee payment procedure |
Free format text: MAINTENANCE FEE REMINDER MAILED (ORIGINAL EVENT CODE: REM.); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY |
|
| LAPS | Lapse for failure to pay maintenance fees |
Free format text: PATENT EXPIRED FOR FAILURE TO PAY MAINTENANCE FEES (ORIGINAL EVENT CODE: EXP.); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY |
|
| STCH | Information on status: patent discontinuation |
Free format text: PATENT EXPIRED DUE TO NONPAYMENT OF MAINTENANCE FEES UNDER 37 CFR 1.362 |
|
| FP | Expired due to failure to pay maintenance fee |
Effective date: 20200607 |