WO2016055484A1 - Method and apparatus for vector encoding in video coding and decoding - Google Patents

Method and apparatus for vector encoding in video coding and decoding Download PDF

Info

Publication number
WO2016055484A1
WO2016055484A1 PCT/EP2015/073060 EP2015073060W WO2016055484A1 WO 2016055484 A1 WO2016055484 A1 WO 2016055484A1 EP 2015073060 W EP2015073060 W EP 2015073060W WO 2016055484 A1 WO2016055484 A1 WO 2016055484A1
Authority
WO
WIPO (PCT)
Prior art keywords
block
coding tree
image
mode
blocks
Prior art date
Application number
PCT/EP2015/073060
Other languages
French (fr)
Inventor
Guillaume Laroche
Christophe Gisquet
Patrice Onno
Original Assignee
Canon Kabushiki Kaisha
Canon Europe Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Canon Kabushiki Kaisha, Canon Europe Ltd filed Critical Canon Kabushiki Kaisha
Priority to KR1020177011152A priority Critical patent/KR102076398B1/en
Priority to CN201580053419.6A priority patent/CN106797464B/en
Priority to EP15781605.9A priority patent/EP3205091B1/en
Priority to RU2017115409A priority patent/RU2663348C1/en
Priority to US15/516,856 priority patent/US11051037B2/en
Priority to JP2017517079A priority patent/JP6590918B2/en
Publication of WO2016055484A1 publication Critical patent/WO2016055484A1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/103Selection of coding mode or of prediction mode
    • H04N19/105Selection of the reference unit for prediction within a chosen coding or prediction mode, e.g. adaptive choice of position and number of pixels used for prediction
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T9/00Image coding
    • G06T9/40Tree coding, e.g. quadtree, octree
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/134Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
    • H04N19/156Availability of hardware or computational resources, e.g. encoding based on power-saving criteria
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/134Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
    • H04N19/157Assigned coding mode, i.e. the coding mode being predefined or preselected to be further used for selection of another element or parameter
    • H04N19/159Prediction type, e.g. intra-frame, inter-frame or bidirectional frame prediction
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/17Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
    • H04N19/174Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a slice, e.g. a line of blocks or a group of blocks
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/17Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
    • H04N19/176Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/182Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a pixel
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/42Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation
    • H04N19/436Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation using parallelised computational arrangements
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/44Decoders specially adapted therefor, e.g. video decoders which are asymmetric with respect to the encoder
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/593Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving spatial prediction techniques
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/70Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/90Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using coding techniques not provided for in groups H04N19/10-H04N19/85, e.g. fractals
    • H04N19/96Tree coding, e.g. quad-tree coding

Definitions

  • the present invention concerns a method and a device for encoding or decoding blocks of pixels in the process of encoding or decoding a video. It concerns more particularly methods to handle parallelization when using INTRA Block Copy mode of HEVC Screen Content extension. It is based on the control of the area available for providing predictor blocks in INTRA Block Copy mode. It applies more particularly to a mode of coding where a block of pixel is predictively encoded based on a predictor block pertaining to the same image. This mode of encoding a block of pixel is generally referred to as INTRA Block Copy mode. It is considered as a tool candidate for the Screen content Extension of the High Efficiency Video Coding (HEVC: ISO/IEC 23008-2 MPEG-H Part 21 ITU-T H.265) international standard and now in the Screen Content extension of the same.
  • HEVC High Efficiency Video Coding
  • Coding Tree Block When encoding an image in a video sequence, the image is first divided into coding elements that are entities of pixels of equal size referred to as Coding Tree Block (Coding Tree Block).
  • the size of a Coding Tree Block is typically 64 by 64 pixels.
  • Each Coding Tree Block may then be decomposed in a hierarchical tree of smaller blocks which size may vary and which are the actual blocks to encode. These smaller blocks to encode are referred to as Coding Unit (CU).
  • CU Coding Unit
  • the encoding of a particular Coding Unit is typically predictive. This means that a predictor block is first determined. Next, the difference between the predictor block and the Coding Unit is calculated. This difference is called the residual . Next, this residual is compressed. The actual encoded information of the Coding Unit is made of some information to indicate the way of determining the predictor block and the compressed residual . Best predictor blocks are blocks as similar as possible to the Coding Unit in order to get a small residual that could be efficiently compressed.
  • Encoding may be lossy, meaning that information is lost in the encoding process.
  • the decoded block of pixel is not exactly the same as the original Coding Unit.
  • the loss of information comes from a quantization applied to the residual before entropy coding. This quantization allows a higher compression rate at the price of the loss of accuracy.
  • high frequencies namely the high level of details, are removed in the block.
  • Encoding may be lossless, meaning that the residual is not quantized. This kind of encoding allows retrieving the exact copy of the original samples of the Coding Unit. The lossless encoding is obtained at the expense of compression rate which is much smaller compared to a lossy compression.
  • the coding mode is defined based on the method used to determine the predictor block for the predictive encoding method of a Coding Unit.
  • a first coding mode is referred to as INTRA mode.
  • the predictor block is built based on the value of pixels immediately surrounding the Coding Unit within the current image. It is worth noting that the predictor block is not a block of the current image but a construction. A direction is used to determine which pixels of the border are actually used to build the predictor block and how they are used.
  • the idea behind INTRA mode is that, due to the general coherence of natural images, the pixels immediately surrounding the Coding Unit are likely to be similar to pixels of the current Coding Unit. Therefore, it is possible to get a good prediction of the value of pixels of the Coding Unit using a predictor block based on these surrounding pixels.
  • a second coding mode is referred to as INTER mode.
  • the predictor block is a block of another image.
  • the idea behind the INTER mode is that successive images in a sequence are generally very similar. The main difference comes typically from a motion between these images due to the scrolling of the camera or due to moving objects in the scene.
  • the predictor block is determined by a vector giving its location in a reference image relatively to the location of the Coding Unit within the current image. This vector is referred to as a motion vector.
  • the encoding of such Coding Unit using this mode comprises motion information comprising the motion vector and the compressed residual.
  • a third coding mode called INTRA Block
  • the block predictor is an actual block of the current image.
  • a block vector is used to locate the predictor block. This block vector gives the location in the current image of the predictor block relatively to the location of the Coding Unit in the same current image. It comes that this block vector shares some similarities with the motion vector of the INTER mode. It is sometime called motion vector by analogy. As there could not be a motion within an image, strictly speaking, and for the sake of clarity, in this document motion vector always refer to the INTER mode while block vector is used for the INTRA Block Copy mode.
  • the causal principle is the principle that states that all information to decode a particular Coding Unit must be based on already reconstructed Coding Units. At encoding, the whole information may be considered as available. Namely, to encode a given Coding Unit it would be possible to use any information from the entire current images or from all decoded and available other images in the sequence. At decoding, things are different.
  • the decoding of the current images is typically done by decoding sequentially all Coding Unit.
  • the order of decoding follows typically a raster scan order, namely beginning in the upper left of the image, progressing from left to right and from top to bottom. It come that when decoding a given Coding Unit, only the part of the current image located up or left to the current Coding Unit has already been decoded.
  • a predictor block in INTRA Block Copy mode should pertain to the part of the image that will be available at decoding.
  • the predictor block is determined using the block vector.
  • the residual is decoded and applied to the predictor to obtain a raw reconstructed block.
  • some post filtering is applied.
  • a first filter is applied to remove some artefacts in the reconstructed image due to the block encoding. This filter is called the deblocking filter.
  • a sample adaptive loop filter (SAO) is then applied to get the final image.
  • SAO sample adaptive loop filter
  • the processing is parallelized in order to speed up the process.
  • a particular coding tree block is reconstructed while the previous one is filtered for example. Namely, reconstruction of some coding tree block and filtering of others are made in parallel.
  • the HEVC standard offers some high level of parallelism as Wavefront or Tiles or Slices for frame parallelism and flexible reference frames management for Inter parallelism. These tools are not mandatory, yet, the decoder needs to decode their related syntax even if they are not mandatory.
  • the Wavefront parallel processing is based on parallelizing the reconstruction of lines of Coding Tree blocks. Namely, a number of Coding Tree Blocks are reconstructed in parallel. A delay is introduced between the treatment of each line due to the fact that the reconstruction of a subsequent line of Coding Tree Blocks needs some information from the previous line. It means that the reconstruction of the different lines being parallelized is going on with a delay between each line.
  • This Wavefront parallel processing may prove to have a problem when reconstructing a particular coding unit encoded according to INTRA Block Copy mode.
  • the block predictor for a coding unit encoded according to INTRA Block Copy mode may be located anywhere in the complete causal area, namely the previous Coding Tree Block lines and the previous Coding Tree Blocks in the current line. As the previous lines are reconstructed in parallel with the considered one, it may happen that the predictor block has not yet been reconstructed at the time it is needed for the reconstruction of the coding unit encoded according to INTRA Block Copy mode. As such INTRA Block Copy mode is not fully compatible with Wavefront parallel reconstruction.
  • the present invention has been devised to address one or more of the foregoing concerns.
  • a method of encoding an image comprising a plurality of Coding Tree blocks made of blocks of pixels, each block of pixels being encoded according to a mode out of a plurality of modes, one such mode being a mode in which the block is encoded based on a predictor block being a block of the current image, the method comprising: determining the search area for said one mode as an area constituted by any reconstructed blocks of the current Coding Tree block and Coding Tree blocks having coordinates (X, Y) where :
  • encoding is performed using Wavefront parallel processing.
  • a method of decoding an image comprising a plurality of Coding Tree blocks made of blocks of pixels, each block of pixels being encoded according to a mode out of a plurality of modes, one such mode being a mode in which the block is decoded based on a predictor block being a block of the current image, the method comprising: restricting the area from which said predictor block may be obtained for said one mode to an area constituted by any reconstructed blocks of the current Coding Tree block and Coding Tree blocks having coordinates (X, Y) where:
  • X represents the horizontal coordinate
  • Y represents the vertical one, the origin being in the top left corner of the image
  • (X Q , Y Q ) are the coordinates of the current Coding Tree block.
  • the decoding is performed using Wavefront parallel processing.
  • a device for encoding an image comprising a plurality of Coding Tree blocks made of blocks of pixels, each block of pixels being encoded according to a mode out of a plurality of modes, one such mode being a mode in which the block is encoded based on a predictor block being a block of the current image, the device comprising: means for determining the search area for said one mode as an area constituted by any reconstructed blocks of the current Coding Tree block and Coding Tree blocks having coordinates (X, Y) where :
  • a device for decoding an image comprising a plurality of Coding Tree blocks made of blocks of pixels, each block of pixels being decoded according to a mode out of a plurality of modes, one such mode being a mode in which the block is decoded based on a predictor block being a block of the current image, the device comprising: means for restricting the area from which said predictor block may be obtained to an area constituted by the reconstructed blocks of the current Coding Tree block and the Coding Tree blocks of coordinates (X, Y) where :
  • restricting the area may take the form of not performing (e.g. stopping) the decoding process if the area from which a predictor block is to be obtained is found to be outside of the area constituted by the reconstructed blocks of the current Coding Tree block and the Coding Tree blocks of coordinates (X, Y) where :
  • X represents the horizontal coordinate
  • Y represents the vertical one, the origin being in the top left corner of the image
  • (X Q , Y Q ) are the coordinates of the current Coding Tree block.
  • a system for encoding and decoding an image comprising a device for encoding an image according to the preceding encoder aspects and a device for decoding an image according to the preceding decoder aspects.
  • the device for encoding and the device for decoding may be configured to use Wavefront parallel processing.
  • the device for encoding and the device for decoding may be configured to use the same number of synchronized threads for respectively encoding and decoding the image.
  • a bitstream comprising encoded images, wherein encoded images have been encoded according to the preceding encoding aspects.
  • a bitstream comprising an encoded sequence of images the images each comprising a plurality of Coding Tree blocks made of blocks of pixels, each block of pixels having been encoded according to a mode out of a plurality of modes, one mode being a mode in which the block is encoded based on a predictor block being a block of the current image, wherein the position of any predictor block indicated by the bitstream is restricted to an area constituted by any reconstructed blocks of the current Coding Tree block and Coding Tree blocks having coordinates (X, Y) where :
  • X represents the horizontal coordinate
  • Y represents the vertical one, the origin being in the top left corner of an encoded image
  • (X Q , Y Q ) are the coordinates of the current Coding Tree block.
  • a machine readable carrier or storage medium having stored thereon a bitstream according to the preceding bitstream aspects.
  • the carrier may also be a signal on which said bitstream is embodied.
  • a computer program product for a programmable apparatus comprising a sequence of instructions for implementing a method according to any of the preceding method aspects, when loaded into and executed by the programmable apparatus.
  • a computer-readable storage medium storing instructions of a computer program for implementing a method according to any one of the preceding method aspects.
  • a method of encoding an image comprising a plurality of Coding Tree blocks made of blocks of pixels, each block of pixels being encoded according to a mode out of a plurality of modes, one mode being called INTRA Block Copy mode in which the block is encoded based on a predictor block being an actual block of the current image, the method comprising: determining the search range for INTRA Block Copy mode as the area constituted by the Coding Tree blocks of coordinates (X, Y) such as :
  • the implementation is simple in order to allow the Wavefront process.
  • a method of encoding an image comprising a plurality of Coding Tree blocks made of blocks of pixels, each block of pixels being encoded according to a mode out of a plurality of modes, one mode being called INTRA Block Copy mode in which the block is encoded based on a predictor block being an actual block of the current image, the method comprising: determining the search range for INTRA Block Copy mode as the area constituted by the Coding Tree blocks of coordinates (X, Y) such as :
  • the search area is bigger, which leads to a better encoding.
  • a method of encoding an image comprising a plurality of Coding Tree blocks made of blocks of pixels, each block of pixels being encoded according to a mode out of a plurality of modes, one mode being called INTRA Block Copy mode in which the block is encoded based on a predictor block being an actual block of the current image, the method comprising: determining the search range for INTRA Block Copy mode as the area constituted by the Coding Tree blocks of coordinates ⁇ X, Y) such as :
  • the search area is bigger, which leads to a better encoding.
  • a method of encoding an image comprising a plurality of Coding Tree blocks made of blocks of pixels, each block of pixels being encoded according to a mode out of a plurality of modes, one mode being called INTRA Block Copy mode in which the block is encoded based on a predictor block being an actual block of the current image, the method comprising: determining the search range for INTRA Block Copy mode as the area constituted by the Coding Tree blocks of coordinates (X, Y) such as :
  • a method of encoding an image comprising a plurality of Coding Tree blocks made of blocks of pixels, each block of pixels being encoded according to a mode out of a plurality of modes, one mode being called INTRA Block Copy mode in which the block is encoded based on a predictor block being an actual block of the current image, the method comprising: determining the search range for INTRA Block Copy mode as the area constituted by the Coding Tree blocks of coordinates ⁇ X, Y) such as :
  • a method of encoding an image comprising a plurality of Coding Tree blocks made of blocks of pixels, each block of pixels being encoded according to a mode out of a plurality of modes, one mode being called INTRA Block Copy mode in which the block is encoded based on a predictor block being an actual block of the current image, the encoding being carried on by a plurality of parallel threads of encoding, each threads being dedicated to the encoding of a line of Coding Tree blocks, the method comprising: determining the search range for INTRA Block Copy mode as the area constituted by, for a current INTRA Block Copy block, all data reconstructed by all threads of the previous Coding Tree Block lines and the current Coding Tree Block line.
  • a method of encoding an image comprising a plurality of Coding Tree blocks made of blocks of pixels, each block of pixels being encoded according to a mode out of a plurality of modes, one mode being called INTRA Block Copy mode in which the block is encoded based on a predictor block being an actual block of the current image, the encoding being carried on by a plurality of synchronized parallel threads of encoding, each threads being dedicated to the encoding of a line of Coding Tree blocks, the method comprising: determining the search range for INTRA Block Copy mode as the area constituted by, for a current INTRA Block Copy block, all data reconstructed by all threads (including the current Coding Tree Block).
  • encoding is done according to Wavefront parallelized mode.
  • a method of decoding an image comprising a plurality of Coding Tree blocks made of blocks of pixels, each block of pixels being decoded according to a mode out of a plurality of modes, one mode being called INTRA Block Copy mode in which the block is encoded based on a predictor block being an actual block of the current image, the decoding being carried on by a plurality parallel threads of decoding, each threads being dedicated to the decoding of a line of Coding Tree blocks, wherein said plurality of threads are synchronized.
  • a device for encoding an image comprising a plurality of Coding Tree blocks made of blocks of pixels, each block of pixels being encoded according to a mode out of a plurality of modes, one mode being called INTRA Block Copy mode in which the block is encoded based on a predictor block being an actual block of the current image
  • the device comprising: means for determining the search range for INTRA Block Copy mode as the area constituted by the Coding Tree blocks of coordinates (X, Y) such as :
  • a device for encoding an image comprising a plurality of Coding Tree blocks made of blocks of pixels, each block of pixels being encoded according to a mode out of a plurality of modes, one mode being called INTRA Block Copy mode in which the block is encoded based on a predictor block being an actual block of the current image
  • the device comprising: means for determining the search range for INTRA Block Copy mode as the area constituted by the Coding Tree blocks of coordinates (X, Y) such as :
  • a device for encoding an image comprising a plurality of Coding Tree blocks made of blocks of pixels, each block of pixels being encoded according to a mode out of a plurality of modes, one mode being called INTRA Block Copy mode in which the block is encoded based on a predictor block being an actual block of the current image
  • the device comprising: means for determining the search range for INTRA Block Copy mode as the area constituted by the Coding Tree blocks of coordinates ⁇ X, Y) such as :
  • a device for encoding an image comprising a plurality of Coding Tree blocks made of blocks of pixels, each block of pixels being encoded according to a mode out of a plurality of modes, one mode being called INTRA Block Copy mode in which the block is encoded based on a predictor block being an actual block of the current image
  • the device comprising: means for determining the search range for INTRA Block Copy mode as the area constituted by the Coding Tree blocks of coordinates (X, Y) such as :
  • a device for encoding an image comprising a plurality of Coding Tree blocks made of blocks of pixels, each block of pixels being encoded according to a mode out of a plurality of modes, one mode being called INTRA Block Copy mode in which the block is encoded based on a predictor block being an actual block of the current image
  • the device comprising: means for determining the search range for INTRA Block Copy mode as the area constituted by the Coding Tree blocks of coordinates (X, Y) such as :
  • a device for encoding an image comprising a plurality of Coding Tree blocks made of blocks of pixels, each block of pixels being encoded according to a mode out of a plurality of modes, one mode being called INTRA Block Copy mode in which the block is encoded based on a predictor block being an actual block of the current image, the encoding being carried on by a plurality of parallel threads of encoding, each threads being dedicated to the encoding of a line of Coding Tree blocks, the device comprising means for determining the search range for INTRA Block Copy mode as the area constituted by, for a current INTRA Block Copy block, all data reconstructed by all threads of the previous Coding Tree Block lines and the current Coding Tree Block line.
  • a device for encoding an image comprising a plurality of Coding Tree blocks made of blocks of pixels, each block of pixels being encoded according to a mode out of a plurality of modes, one mode being called INTRA Block Copy mode in which the block is encoded based on a predictor block being an actual block of the current image, the encoding being carried on by a plurality of synchronized parallel threads of encoding, each thread being dedicated to the encoding of a line of Coding Tree blocks, the device comprising: means for determining the search range for INTRA Block Copy mode as the area constituted by, for a current INTRA Block Copy block, all data reconstructed by all threads.
  • a device for decoding an image comprising a plurality of Coding Tree blocks made of blocks of pixels, each block of pixels being decoded according to a mode out of a plurality of modes, one mode being called INTRA Block Copy mode in which the block is encoded based on a predictor block being an actual block of the current image
  • the device comprising: means for processing a plurality of parallel threads of decoding, each thread being dedicated to the decoding of a line of Coding Tree blocks; and wherein synchronization means for synchronizing said plurality of threads.
  • a system for encoding and decoding an image comprising an encoder according to the invention and a decoder according to the invention.
  • the encoder and the decoder are using the same number of synchronized threads for respectively encoding and decoding the image.
  • bitstream comprising encoded images, wherein encoded images have been encoded according to the invention.
  • a computer program product for a programmable apparatus, the computer program product comprising a sequence of instructions for implementing a method according to the invention, when loaded into and executed by the programmable apparatus.
  • a computer-readable storage medium storing instructions of a computer program for implementing a method according to the invention.
  • Some aspects of the invention recited above mention the mode of the plurality of modes being an Intra block copy mode, however, as will be appreciated, this is merely an arbitrary label for this mode and is not intended to be limited. Accordingly, those aspects have within their intended scope any mode in which a block is encoded (or decoded) based on a predictor block being an actual block of the current image being encoded (or decoded) whether that mode is referred to as an Intra block copy mode or otherwise. At least parts of the methods according to the invention may be computer implemented.
  • the present invention may take the form of an entirely hardware embodiment, an entirely software embodiment (including firmware, resident software, micro-code, etc.) or an embodiment combining software and hardware aspects that may all generally be referred to herein as a "circuit", "module” or "system”.
  • the present invention may take the form of a computer program product embodied in any tangible medium of expression having computer usable program code embodied in the medium.
  • a tangible carrier medium may comprise a storage medium such as a floppy disk, a CD-ROM, a hard disk drive, a magnetic tape device or a solid state memory device and the like.
  • a transient carrier medium may include a signal such as an electrical signal, an electronic signal, an optical signal, an acoustic signal, a magnetic signal or an electromagnetic signal, e.g. a microwave or RF signal.
  • Figure 1 illustrates the HEVC encoder architecture
  • Figure 2 illustrates the HEVC decoder architecture
  • Figure 3 illustrates the Level decomposition of Video frame
  • FIG. 4 illustrates the principle of Wavefront processing
  • Figure 5 illustrates the location of block for the initialization of context variable with Wavefront
  • Figure 6 illustrates the concept of the causal area
  • Figure 7 illustrates the INTRA Block Copy search area
  • FIG. 9 illustrates one embodiment of the invention
  • Figure 10 illustrates one embodiment of the invention
  • Figure 11 illustrates one embodiment of the invention
  • FIG. 12 illustrates one embodiment of the invention
  • FIG. 13 illustrates one embodiment of the invention.
  • Figure 14 is a schematic block diagram of a computing device for implementation of one or more embodiments of the invention.
  • Figure 1 illustrates the HEVC encoder architecture.
  • an original sequence 101 is divided into blocks of pixels 102.
  • a coding mode is then affected to each block.
  • An INTRA Coding Unit is generally predicted from the encoded pixels at its causal boundary by a process called INTRA prediction.
  • Temporal prediction first consists in finding in a previous or future frame called the reference frame 116 the reference area that most closely matches the Coding Unit in a motion estimation step 104. This reference area constitutes the predictor block. Next this Coding Unit is predicted using the predictor block to compute the residual in a motion compensation step 105.
  • a residual is computed by subtracting the Coding Unit from the original predictor block.
  • a prediction direction is encoded.
  • the temporal prediction at least one motion vector is encoded.
  • a motion vector is not directly encoded. Indeed, assuming that motion is homogeneous, it is particularly interesting to encode a motion vector as a difference between this motion vector, and a motion vector in its surrounding.
  • H.264/AVC coding standard for instance, motion vectors are encoded with respect to a median vector computed between 3 blocks located above and on the left of the current block.
  • the mode optimizing the rate distortion performance is selected in module 106.
  • a transform typically a DCT
  • a quantization is applied to the coefficients in module 108.
  • the quantized block of coefficients is then entropy coded in module 109 and the result is inserted in the bitstream 110.
  • the encoder then performs a decoding of the encoded frame for the future motion estimation in modules 111 to 116. These steps allow the encoder and the decoder to have the same reference frames.
  • the residual is inverse quantized in module 111 and inverse transformed in module 112 in order to provide the "reconstructed" residual in the pixel domain. According to the encoding mode (INTER or INTRA), this residual is added to the INTER predictor 114 or to the INTRA predictor 113.
  • this first reconstruction is filtered in module 115 by one or several kinds of post filtering.
  • These post filters are integrated in the encoded and decoded loop. It means that they need to be applied on the reconstructed frame at encoder and decoder side in order to use the same reference frame at encoder and decoder side.
  • the aim of this post filtering is to remove compression artifacts.
  • Figure 2 have been represented the principle of a decoder.
  • the video stream 201 is first entropy decoded in a module 202.
  • the residual data are then inverse quantized in a module 203 and inverse transformed in a module 204 to obtain pixel values.
  • the mode data are also entropy decoded in function of the mode, an INTRA type decoding or an INTER type decoding is performed.
  • an INTRA predictor is determined in function of the INTRA prediction mode specified in the bitstream 205. If the mode is INTER, the motion information is extracted from the bitstream 202. This is composed of the reference frame index and the motion vector residual. The motion vector predictor is added to the motion vector residual to obtain the motion vector 210. The motion vector is then used to locate the reference area in the reference frame 206. Note that the motion vector field data 211 is updated with the decoded motion vector in order to be used for the prediction of the next decoded motion vectors.
  • This first reconstruction of the decoded frame is then post filtered 207 with exactly the same post filter as used at encoder side.
  • the output of the decoder is the de-compressed video 209.
  • This INTRA Block Copy coding mode is particularly well suited for extremely repetitive patterns. In particular, it is known to help coding graphical elements such as glyphs, the graphical representation of a character, or traditional GUI elements, which are very difficult to code using traditional INTRA prediction methods.
  • prediction is based on coherence between neighboring Coding Units.
  • This coherence may be geographic when considered within the current frame or temporal when considered across successive frames.
  • This kind of coherence occurs in natural images.
  • INTRA Block Copy encoding mode is seen as a mode dedicated to text or symbolic images, predication is thought as useless for this kind of image. For instance, there is no reason to have two successive Coding Units in an image representing a text having good predictors close to each other. The first Coding Unit may be the part of letter "A", a good predictor block would therefore come from another "A” in the text. While the next Coding Unit would be a "P" letter having a predictor block from another "P” in the text. There is no reason, a-priori, to have the two predictor blocks in the same neighborhood. This is why prior art does not contemplate introducing prediction in INTRA Block Copy encoding mode.
  • SEI messages contain information related to the display process, and are therefore optional.
  • Figure 3 shows the coding structure used in HEVC.
  • the original video sequence 301 is a succession of digital images "images i".
  • images i digital images
  • a digital image is represented by one or more matrices the coefficients of which represent pixels.
  • the images 302 are divided into slices 303.
  • a slice is a part of the image or the entire image.
  • these slices are divided into non-overlapping Coding Tree Blocks (CTB) 304, generally blocks of size 64 pixels x 64 pixels.
  • Each Coding Tree Block may in its turn be iteratively divided into smaller variable size Coding Units (CUs) 305 using quadtree decomposition.
  • Coding units are the elementary coding elements and are constituted of two sub units which Prediction Unit (PU) and Transform Units (TU) of maximum size equal to the Coding Unit's size.
  • Prediction Unit corresponds to the partition of the Coding Unit for prediction of pixels values.
  • Each Coding Unit can be further partitioned into a maximum of 4 square Partition Units or 2 rectangular Partition Units 306.
  • Transform units are used to represent the elementary units that are spatially transform with DCT.
  • a Coding Unit can be partitioned in TU based on a quadtree representation 307.
  • Each slice is embedded in one NAL unit.
  • the coding parameters of the video sequence are stored in dedicated NAL units called parameter sets.
  • parameter sets In HEVC and H.264/AVC two kinds of parameter sets NAL units are employed: first, the Sequence Parameter Set (SPS) NAL unit that gathers all parameters that are unchanged during the whole video sequence. Typically, it handles the coding profile, the size of the video frames and other parameters. Secondly, Picture Parameter Sets (PPS) codes the different values that may change from one frame to another.
  • HEVC include also Video Parameter Set (VPS) which contains parameters describing the overall structure of the stream.
  • SPS Sequence Parameter Set
  • PPS Picture Parameter Sets
  • VPS Video Parameter Set
  • the HEVC standard offers some high level of parallelism as Wavefront or Tiles or Slices for frame parallelism and flexible reference frames management for Inter parallelism. These tools are not mandatory, yet, the decoder needs to decode their related syntax even if they are not mandatory.
  • the invention is dedicated to the Wavefront processing when it is combined with the INTRA Block Copy tools of the Screen Content extension of HEVC.
  • the principle of Wavefront processing is presented in Figure 4.
  • the principle is to parallelize the decoding process of several lines of Coding Tree Block.
  • the Wavefront keeps a large majority of predictions.
  • the Wavefront introduces a delay between each line for the parallelization.
  • 4 threads are run in parallel. So, 4 current Coding Tree Block are decoded in parallel. There is a delay between threads; for example, the second stream needs some information decoded by the first thread.
  • the context variable of the CABAC are initialized with the spatial neighbouring block T as depicted in Figure 5. More precisely the context variable of the CABAC takes the same values as those of block T. This block is the first block of the top right Coding Tree Block. If this block T is not available the context variables are initialized as the first Coding Tree Block of a frame.
  • the values of the context variables of the first Coding Tree Block of Coding Tree Block line is set equal to the values of the context variables of the last block of the last Coding Tree Block of the previous Coding Tree Block line.
  • this variable context are initialized with those of the top right Coding Tree Block (T). This is the only change that is needed at decoder side to use Wavefront at both encoder and decoder.
  • the Screen Content Coding extension of HEVC under definition contains additional tools to efficiently code screen coding sequences.
  • the current added tools are the Intra block copy mode, the Palette mode and the residual color transform.
  • the current invention is dedicated to the Intra block copy mode only, so only this mode is described in the following.
  • the Palette mode and the INTRA Block Copy mode are new Intra modes and consequently added to the modules 103 and 205 of respectively Figure 1 and 2.
  • the Intra Block Copy (IBC) was added as an additional mode for Screen content coding extension of HEVC.
  • This prediction method is particularly well suited for extremely repetitive patterns.
  • it is known to help coding graphical elements such as glyphs (i.e., the graphical representation of a character) or traditional GUI elements, which are very difficult to code using traditional intra prediction methods.
  • Figure 6 illustrates how this Intra Block Copy prediction mode works.
  • an image is divided into Coding Units that are encoded in raster scan order.
  • Area 603 is called the causal area of the Coding Unit 601.
  • This next Coding Unit, as well as all the next ones belongs to area 604 illustrated as doted area, and cannot be used for coding the current Coding Unit 601. It is worth noting that the causal area is constituted by raw reconstructed blocks.
  • the information used to encode a given Coding Unit is not the original blocks of the image for the reason that this information is not available at decoding.
  • the only information available at decoding is the reconstructed version of the blocks of pixels in the causal area, namely the decoded version of these blocks. For this reason, at encoding, previously encoded blocks of the causal area are decoded to provide this reconstructed version of these blocks.
  • INTRA Block Copy works by signaling a block 602 in the causal area which should be used to produce a prediction of block 601. For example, the block 602 may be found by using a matching algorithm. In the HEVC Screen content Extension, this block is indicated by a block vector 605, and the residual of this vector according to a predictor is transmitted in the bitstream.
  • the INTRA Block Copy predictor comes from all the reconstructed causal area of the current frame. As for other Intra modes, the causal area is not loop filtered.
  • This block vector is the difference in coordinates between a particular point of the Coding Unit 601 and the equivalent point in the predictor block 602. Although it would be possible to use subpixel accuracy as for INTER blocks, this displacement is typically in integer units of pixels, therefore not to require costly subpixel interpolation.
  • each INTRA Block Copy Coding Unit can be split into one or 2 PUs as depicted in Figure 3.
  • the Coding Unit can be also split into 4 PUs of 4x4 pixels each.
  • the NxN partition is not available. It means that the 4x4 block size can't be used for Inter mode.
  • the following table summarizes the block size for both modes.
  • the search area at encoder side depends on the blocks sizes. This is represented in the following table:
  • 2NxN and Nx2N PU sizes are tested only for 8x8 Coding Units in the current encoder implementation. These sizes are not depicted in this table.
  • the two Coding Tree Blocks search range corresponds to the left Coding Tree Block 703 and to the blocks of the current Coding Tree Block 702 already encoded/decoded.
  • the blocks of current Coding Tree Block already encoded are depicted in dotted area in Figure 7.
  • the full frame search corresponds to all the Coding Tree Blocks already encoded/decoded 704.
  • the "block" vector is the difference in coordinates between a particular point in a block 601 to encode and the equivalent point in the predictor block 602 of Figure 6.
  • This displacement is in integer units of pixels, therefore it doesn't require costly subpixel interpolation.
  • This block vector (BV) is itself predicted using a predictor which can be the left, the above BV or the latest decoded block vector of the current Coding Tree Block or the latest of the latest decoded BV.
  • This vector predictors come of course from the decoded Intra Block Copy block. With these methods a predictor index is transmitted.
  • INTRA Block Copy is an Intra mode so its predictors come from the raw reconstructed data before any loop filtering.
  • the decoder implementations using Wavefront processing should be decreased in decoding.
  • an INTRA Block Copy block predictor can come from a Coding Tree Block which has not been reconstructed. So, it means that the decoder can fully wait for the decoding process of this INTRA Block Copy predictor. So by considering the worst case, which is that each first block of each Coding Tree Block line points to the last block of each previous Coding Tree Block line, the decoding process with Wavefront can't be significantly faster than the classical decoding.
  • the INTRA Block Copy search range is limited to all left, top left and top Coding Tree Blocks and of course the reconstructed blocks of the current Coding Tree Block.
  • the INTRA Block Copy search range is the area in the image where a predictor block may be searched for the encoding of a given coding unit according to INTRA Block Copy mode. It means that the top right Coding Tree Block are considered as unavailable for INTRA Block Copy prediction at encoder side and consequently no INTRA Block Copy block predictor for the current Coding Tree Block can point to any top Right Coding Tree Block at decoder side.
  • Figure 9 shows this embodiment for the current Coding Tree Block of thread 4.
  • the search range for INTRA Block Copy mode is determined as the area constituted by the Coding Tree Blocks of coordinates (X, Y) such that: Y ⁇ Y 0 and X ⁇ X 0
  • all Coding Tree Blocks located to the left of a diagonal that starts at top right of the current Coding Tree Block and finishes at the top edge of the image are available for the INTRA Block Copy prediction.
  • the mentioned diagonal follows a stepped (i.e. ladder shaped) path that follows a line that travels one CTB in the positive direction along the x-axis followed by one CTB in the negative direction along the y-axis and so on until the line reaches the top edge of the image.
  • any already reconstructed CTBs of the current CTB row (thread) and any reconstructed blocks of the current CTB are also available for the INTRA Block Copy prediction.
  • Figure 10 shows this embodiment for the current Coding Tree Block of the 4 th thread.
  • the INTRA Block Copy predictors for the current Coding Tree Block come from this area only. It corresponds to an encoder using one Coding Tree Block delay between threads.
  • the search range for INTRA Block Copy mode as the area constituted by the Coding Tree blocks of coordinates (X, Y) such that:
  • the search range for INTRA Block Copy mode as the area constituted by the Coding Tree blocks of coordinates (X, Y) such that:
  • This embodiment increases the coding efficiency compared to the first one, but as shown in Figure 11 , potentially this search range increase the delay between thread.
  • each INTRA Block Copy block of a thread can access only the blocks reconstructed by this same thread. It corresponds to the use of a Coding Tree Block line search area for INTRA Block Copy at encoder side.
  • This embodiment offers a flexible decoding for Wavefront processing. Indeed, there are no additional dependencies between Coding Tree Block lines compared to an implementation without INTRA Block Copy but it reduces the coding efficiency.
  • the search range for INTRA Block Copy mode as the area constituted by the Coding Tree blocks of coordinates (X, Y) such that:
  • both encoder and decoder use the wavefront parallelism with the required number of threads.
  • the threads are synchronized. It means that each thread, before starting the decoding of the following Coding Tree Block, waits for the decoding end of all Coding Tree Block of the other threads. In that case, at encoder and decoder, the reconstruction of all blocks is synchronized at Coding Tree Block level. In this embodiment, an INTRA Block Copy block of a thread can access all reconstructed Coding Tree Block available, even if the Coding Tree Block is in the below Coding Tree Block lines.
  • each INTRA Block Copy block of each decoded Coding Tree Block (marked by an X) can access all available data of thread 4 in the classical implementation.
  • each block of a Coding Tree Block can access all reconstructed blocks of its own Coding Tree Block.
  • the advantage of this embodiment is that it increases the average number of blocks for each Coding Tree Block and it increases the search area for the first Coding Tree Block line for which the bitrate is generally higher due to the lack of possible predictions compared to other following Coding Tree Block lines.
  • a mono-threaded decoder can obtain the same decoding results than multithreaded decoder. Indeed, only the Coding Tree Block synchronization is mandatory.
  • Figure 13 shows the decoding order of CTB when a mono-threaded decoder implementation is used for a 2 CTB delay. Another order is needed if the delay considered is 1 CTB.
  • FIG. 14 is a schematic block diagram of a computing device 1400 for implementation of one or more embodiments of the invention.
  • the computing device 1400 may be a device such as a micro-computer, a workstation or a light portable device.
  • the computing device 1400 comprises a communication bus connected to:
  • central processing unit 1401 such as a microprocessor
  • RAM random access memory 1402
  • the executable code of the method of embodiments of the invention as well as the registers adapted to record variables and parameters necessary for implementing the method for encoding or decoding at least part of an image according to embodiments of the invention, the memory capacity thereof can be expanded by an optional RAM connected to an expansion port for example;
  • ROM read only memory
  • a network interface 1404 is typically connected to a communication network over which digital data to be processed are transmitted or received.
  • the network interface 1404 can be a single network interface, or composed of a set of different network interfaces (for instance wired and wireless interfaces, or different kinds of wired or wireless interfaces). Data packets are written to the network interface for transmission or are read from the network interface for reception under the control of the software application running in the CPU 1401 ;
  • a user interface 1405 may be used for receiving inputs from a user or to display information to a user;
  • HD hard disk 1406 denoted HD may be provided as a mass storage device
  • an I/O module 1407 may be used for receiving/sending data from/to external devices such as a video source or display.
  • the executable code may be stored either in read only memory 1403, on the hard disk 1406 or on a removable digital medium such as for example a disk.
  • the executable code of the programs can be received by means of a communication network, via the network interface 1404, in order to be stored in one of the storage means of the communication device 1400, such as the hard disk 1406, before being executed.
  • the central processing unit 1401 is adapted to control and direct the execution of the instructions or portions of software code of the program or programs according to embodiments of the invention, which instructions are stored in one of the aforementioned storage means. After powering on, the CPU 1401 is capable of executing instructions from main RAM memory 1402 relating to a software application after those instructions have been loaded from the program ROM 1403 or the hard-disc (HD) 1406 for example. Such a software application, when executed by the CPU 1401 , causes the steps of the flowcharts described herein to be performed.
  • Any step of the algorithm described herein may be implemented in software by execution of a set of instructions or program by a programmable computing machine, such as a PC ("Personal Computer"), a DSP ("Digital Signal Processor") or a microcontroller; or else implemented in hardware by a machine or a dedicated component, such as an FPGA ("Field-Programmable Gate Array”) or an ASIC ("Application-Specific Integrated Circuit”).
  • a programmable computing machine such as a PC ("Personal Computer"), a DSP ("Digital Signal Processor") or a microcontroller
  • FPGA Field-Programmable Gate Array
  • ASIC Application-Specific Integrated Circuit

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Theoretical Computer Science (AREA)
  • Computing Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)

Abstract

The present invention concerns a method and a device for encoding or decoding blocks of pixelsin the process of encoding or decoding a video. It concerns more particularly methods to handle parallelization when using INTRA Block Copy mode of HEVC Screen Content extension. It is based on the control of the area available for providing predictor blocks in INTRA Block Copy mode. Accordingly, Accordingly, the implementation is simple in order to allow parallelized process.

Description

METHOD AND APPARATUS FOR VECTOR ENCODING IN VIDEO CODING
AND DECODING
The present invention concerns a method and a device for encoding or decoding blocks of pixels in the process of encoding or decoding a video. It concerns more particularly methods to handle parallelization when using INTRA Block Copy mode of HEVC Screen Content extension. It is based on the control of the area available for providing predictor blocks in INTRA Block Copy mode. It applies more particularly to a mode of coding where a block of pixel is predictively encoded based on a predictor block pertaining to the same image. This mode of encoding a block of pixel is generally referred to as INTRA Block Copy mode. It is considered as a tool candidate for the Screen content Extension of the High Efficiency Video Coding (HEVC: ISO/IEC 23008-2 MPEG-H Part 21 ITU-T H.265) international standard and now in the Screen Content extension of the same.
When encoding an image in a video sequence, the image is first divided into coding elements that are entities of pixels of equal size referred to as Coding Tree Block (Coding Tree Block). The size of a Coding Tree Block is typically 64 by 64 pixels. Each Coding Tree Block may then be decomposed in a hierarchical tree of smaller blocks which size may vary and which are the actual blocks to encode. These smaller blocks to encode are referred to as Coding Unit (CU).
The encoding of a particular Coding Unit is typically predictive. This means that a predictor block is first determined. Next, the difference between the predictor block and the Coding Unit is calculated. This difference is called the residual . Next, this residual is compressed. The actual encoded information of the Coding Unit is made of some information to indicate the way of determining the predictor block and the compressed residual . Best predictor blocks are blocks as similar as possible to the Coding Unit in order to get a small residual that could be efficiently compressed.
Encoding may be lossy, meaning that information is lost in the encoding process. The decoded block of pixel is not exactly the same as the original Coding Unit. Typically, the loss of information comes from a quantization applied to the residual before entropy coding. This quantization allows a higher compression rate at the price of the loss of accuracy. Typically, high frequencies, namely the high level of details, are removed in the block.
Encoding may be lossless, meaning that the residual is not quantized. This kind of encoding allows retrieving the exact copy of the original samples of the Coding Unit. The lossless encoding is obtained at the expense of compression rate which is much smaller compared to a lossy compression.
The coding mode is defined based on the method used to determine the predictor block for the predictive encoding method of a Coding Unit.
A first coding mode is referred to as INTRA mode. According to INTRA mode, the predictor block is built based on the value of pixels immediately surrounding the Coding Unit within the current image. It is worth noting that the predictor block is not a block of the current image but a construction. A direction is used to determine which pixels of the border are actually used to build the predictor block and how they are used. The idea behind INTRA mode is that, due to the general coherence of natural images, the pixels immediately surrounding the Coding Unit are likely to be similar to pixels of the current Coding Unit. Therefore, it is possible to get a good prediction of the value of pixels of the Coding Unit using a predictor block based on these surrounding pixels.
A second coding mode is referred to as INTER mode. According to INTER mode, the predictor block is a block of another image. The idea behind the INTER mode is that successive images in a sequence are generally very similar. The main difference comes typically from a motion between these images due to the scrolling of the camera or due to moving objects in the scene. The predictor block is determined by a vector giving its location in a reference image relatively to the location of the Coding Unit within the current image. This vector is referred to as a motion vector. According to this mode, the encoding of such Coding Unit using this mode comprises motion information comprising the motion vector and the compressed residual. We focus in this document on a third coding mode called INTRA Block
Copy mode. According to the INTRA Block Copy mode, the block predictor is an actual block of the current image. A block vector is used to locate the predictor block. This block vector gives the location in the current image of the predictor block relatively to the location of the Coding Unit in the same current image. It comes that this block vector shares some similarities with the motion vector of the INTER mode. It is sometime called motion vector by analogy. As there could not be a motion within an image, strictly speaking, and for the sake of clarity, in this document motion vector always refer to the INTER mode while block vector is used for the INTRA Block Copy mode.
The causal principle is the principle that states that all information to decode a particular Coding Unit must be based on already reconstructed Coding Units. At encoding, the whole information may be considered as available. Namely, to encode a given Coding Unit it would be possible to use any information from the entire current images or from all decoded and available other images in the sequence. At decoding, things are different. The decoding of the current images is typically done by decoding sequentially all Coding Unit. The order of decoding follows typically a raster scan order, namely beginning in the upper left of the image, progressing from left to right and from top to bottom. It come that when decoding a given Coding Unit, only the part of the current image located up or left to the current Coding Unit has already been decoded. This is the only available information for the decoding of the current Coding Unit. This has to be taken into account at encoding. For example, a predictor block in INTRA Block Copy mode, should pertain to the part of the image that will be available at decoding. At decoding, to retrieve a block encoded using INTRA Block Copy mode, first of all, the predictor block is determined using the block vector. Then the residual is decoded and applied to the predictor to obtain a raw reconstructed block. When the complete image has been reconstructed, some post filtering is applied. Typically a first filter is applied to remove some artefacts in the reconstructed image due to the block encoding. This filter is called the deblocking filter. Typically, while not mandatory, a sample adaptive loop filter (SAO) is then applied to get the final image. in some decoding architecture, the processing is parallelized in order to speed up the process. In this situation, a particular coding tree block is reconstructed while the previous one is filtered for example. Namely, reconstruction of some coding tree block and filtering of others are made in parallel.
The HEVC standard offers some high level of parallelism as Wavefront or Tiles or Slices for frame parallelism and flexible reference frames management for Inter parallelism. These tools are not mandatory, yet, the decoder needs to decode their related syntax even if they are not mandatory.
We focus in this document to the Wavefront parallel processing and how it be efficiently combined with the INTRA Block Copy mode of encoding a particular Coding Unit.
The Wavefront parallel processing is based on parallelizing the reconstruction of lines of Coding Tree blocks. Namely, a number of Coding Tree Blocks are reconstructed in parallel. A delay is introduced between the treatment of each line due to the fact that the reconstruction of a subsequent line of Coding Tree Blocks needs some information from the previous line. It means that the reconstruction of the different lines being parallelized is going on with a delay between each line.
This Wavefront parallel processing may prove to have a problem when reconstructing a particular coding unit encoded according to INTRA Block Copy mode. Actually, the block predictor for a coding unit encoded according to INTRA Block Copy mode may be located anywhere in the complete causal area, namely the previous Coding Tree Block lines and the previous Coding Tree Blocks in the current line. As the previous lines are reconstructed in parallel with the considered one, it may happen that the predictor block has not yet been reconstructed at the time it is needed for the reconstruction of the coding unit encoded according to INTRA Block Copy mode. As such INTRA Block Copy mode is not fully compatible with Wavefront parallel reconstruction.
The present invention has been devised to address one or more of the foregoing concerns.
According to a first aspect of the present invention, there is provided a method of encoding an image, the image comprising a plurality of Coding Tree blocks made of blocks of pixels, each block of pixels being encoded according to a mode out of a plurality of modes, one such mode being a mode in which the block is encoded based on a predictor block being a block of the current image, the method comprising: determining the search area for said one mode as an area constituted by any reconstructed blocks of the current Coding Tree block and Coding Tree blocks having coordinates (X, Y) where :
Y≤ Y0 and (X - X0) ≤ - (Y - Y0)
where X represents the horizontal coordinate, Y represents the vertical one, the origin being in the top left corner of the image, and (XQ, YQ) are the coordinates of the current Coding Tree block. In an embodiment encoding is performed using Wavefront parallel processing.
In a second aspect of the present invention, there is provided a method of decoding an image, the image comprising a plurality of Coding Tree blocks made of blocks of pixels, each block of pixels being encoded according to a mode out of a plurality of modes, one such mode being a mode in which the block is decoded based on a predictor block being a block of the current image, the method comprising: restricting the area from which said predictor block may be obtained for said one mode to an area constituted by any reconstructed blocks of the current Coding Tree block and Coding Tree blocks having coordinates (X, Y) where:
Y≤ Y0 and (X - X0) ≤ - (Y - Y0)
where X represents the horizontal coordinate, Y represents the vertical one, the origin being in the top left corner of the image, and (XQ, YQ) are the coordinates of the current Coding Tree block.
In an embodiment the decoding is performed using Wavefront parallel processing.
In a third aspect of the present invention, there is provided a device for encoding an image, the image comprising a plurality of Coding Tree blocks made of blocks of pixels, each block of pixels being encoded according to a mode out of a plurality of modes, one such mode being a mode in which the block is encoded based on a predictor block being a block of the current image, the device comprising: means for determining the search area for said one mode as an area constituted by any reconstructed blocks of the current Coding Tree block and Coding Tree blocks having coordinates (X, Y) where :
Y≤ Y0 and (X - X0) ≤ - (Y - Y0)
where X represents the horizontal coordinate, Y represents the vertical one, the origin being in the top left corner of the image, and (XQ, YQ) are the coordinates of the current Coding Tree block. In a fourth aspect of the present invention, there is provided a device for decoding an image, the image comprising a plurality of Coding Tree blocks made of blocks of pixels, each block of pixels being decoded according to a mode out of a plurality of modes, one such mode being a mode in which the block is decoded based on a predictor block being a block of the current image, the device comprising: means for restricting the area from which said predictor block may be obtained to an area constituted by the reconstructed blocks of the current Coding Tree block and the Coding Tree blocks of coordinates (X, Y) where :
Y≤ Y0 and (X - X0) ≤ - (Y - Y0)
where X represents the horizontal coordinate, Y represents the vertical one, the origin being in the top left corner of the image, and (XQ, YQ) are the coordinates of the current Coding Tree block. For example, restricting the area may take the form of not performing (e.g. stopping) the decoding process if the area from which a predictor block is to be obtained is found to be outside of the area constituted by the reconstructed blocks of the current Coding Tree block and the Coding Tree blocks of coordinates (X, Y) where :
Y≤ Y0 and (X - X0) ≤ - (Y - Y0)
where X represents the horizontal coordinate, Y represents the vertical one, the origin being in the top left corner of the image, and (XQ, YQ) are the coordinates of the current Coding Tree block.
In a fifth aspect of the present invention, there is provided a system for encoding and decoding an image, the system comprising a device for encoding an image according to the preceding encoder aspects and a device for decoding an image according to the preceding decoder aspects.
The device for encoding and the device for decoding may be configured to use Wavefront parallel processing. The device for encoding and the device for decoding may be configured to use the same number of synchronized threads for respectively encoding and decoding the image. According to a sixth aspect of the present invention, there is provided a bitstream comprising encoded images, wherein encoded images have been encoded according to the preceding encoding aspects.
According to a seventh aspect of the present invention, there is provided a bitstream comprising an encoded sequence of images the images each comprising a plurality of Coding Tree blocks made of blocks of pixels, each block of pixels having been encoded according to a mode out of a plurality of modes, one mode being a mode in which the block is encoded based on a predictor block being a block of the current image, wherein the position of any predictor block indicated by the bitstream is restricted to an area constituted by any reconstructed blocks of the current Coding Tree block and Coding Tree blocks having coordinates (X, Y) where :
Y≤ Y0 and (X - X0)≤ -(Y - Y0)
where X represents the horizontal coordinate, Y represents the vertical one, the origin being in the top left corner of an encoded image, and (XQ, YQ) are the coordinates of the current Coding Tree block.
According to an eighth aspect of the present invention, there is provided a machine readable carrier or storage medium having stored thereon a bitstream according to the preceding bitstream aspects. The carrier may also be a signal on which said bitstream is embodied.
According to a ninth aspect of the present invention, there is provided a computer program product for a programmable apparatus, the computer program product comprising a sequence of instructions for implementing a method according to any of the preceding method aspects, when loaded into and executed by the programmable apparatus. According to a tenth aspect of the present invention, there is provided a computer-readable storage medium storing instructions of a computer program for implementing a method according to any one of the preceding method aspects.
According to a further aspect of the invention, there is provided a method of encoding an image, the image comprising a plurality of Coding Tree blocks made of blocks of pixels, each block of pixels being encoded according to a mode out of a plurality of modes, one mode being called INTRA Block Copy mode in which the block is encoded based on a predictor block being an actual block of the current image, the method comprising: determining the search range for INTRA Block Copy mode as the area constituted by the Coding Tree blocks of coordinates (X, Y) such as :
Y≤ Y0 and X≤ X0
where X represents the horizontal coordinate, Y represents the vertical one, the origin being in the top left corner of the image, and where (XQ, YQ) are the coordinates of the current Coding Tree block, plus the reconstructed blocks of the current Coding Tree block.
Accordingly, the implementation is simple in order to allow the Wavefront process.
According to another aspect of the invention there is provided a method of encoding an image, the image comprising a plurality of Coding Tree blocks made of blocks of pixels, each block of pixels being encoded according to a mode out of a plurality of modes, one mode being called INTRA Block Copy mode in which the block is encoded based on a predictor block being an actual block of the current image, the method comprising: determining the search range for INTRA Block Copy mode as the area constituted by the Coding Tree blocks of coordinates (X, Y) such as :
Y≤ Y0 and (X - X0)≤ -(Y - Y0) where X represents the horizontal coordinate, Y represents the vertical one, the origin being in the top left corner of the image, and where (XQ, YQ) are the coordinates of the current Coding Tree block, plus the reconstructed blocks of the current Coding Tree block.
Accordingly, the search area is bigger, which leads to a better encoding.
According to another aspect of the invention there is provided a method of encoding an image, the image comprising a plurality of Coding Tree blocks made of blocks of pixels, each block of pixels being encoded according to a mode out of a plurality of modes, one mode being called INTRA Block Copy mode in which the block is encoded based on a predictor block being an actual block of the current image, the method comprising: determining the search range for INTRA Block Copy mode as the area constituted by the Coding Tree blocks of coordinates {X, Y) such as :
Y≤ Y0 and (X - X0)≤ -2 * (Y - Y0)
where X represents the horizontal coordinate, Y represents the vertical one, the origin being in the top left corner of the image, and where (XQ, YQ) are the coordinates of the current Coding Tree block, plus the reconstructed blocks of the current Coding Tree block.
Accordingly, the search area is bigger, which leads to a better encoding.
According to another aspect of the invention there is provided a method of encoding an image, the image comprising a plurality of Coding Tree blocks made of blocks of pixels, each block of pixels being encoded according to a mode out of a plurality of modes, one mode being called INTRA Block Copy mode in which the block is encoded based on a predictor block being an actual block of the current image, the method comprising: determining the search range for INTRA Block Copy mode as the area constituted by the Coding Tree blocks of coordinates (X, Y) such as :
Y≤ Y0 and (X - X0) < -2 * (Y - Y0) where X represents the horizontal coordinate, Y represents the vertical one, the origin being in the top left corner of the image, and where (XQ, YQ) are the coordinates of the current Coding Tree block, plus the reconstructed blocks of the current Coding Tree block.
Accordingly, implementation is simpler.
According to another aspect of the invention there is provided a method of encoding an image, the image comprising a plurality of Coding Tree blocks made of blocks of pixels, each block of pixels being encoded according to a mode out of a plurality of modes, one mode being called INTRA Block Copy mode in which the block is encoded based on a predictor block being an actual block of the current image, the method comprising: determining the search range for INTRA Block Copy mode as the area constituted by the Coding Tree blocks of coordinates {X, Y) such as :
X≤ X0 and Y = Y0
where X represents the horizontal coordinate, Y represents the vertical one, the origin being in the top left corner of the image, and where (XQ, YQ) are the coordinates of the current Coding Tree block, plus the reconstructed blocks of the current Coding Tree block.
According to another aspect of the invention there is provided a method of encoding an image, the image comprising a plurality of Coding Tree blocks made of blocks of pixels, each block of pixels being encoded according to a mode out of a plurality of modes, one mode being called INTRA Block Copy mode in which the block is encoded based on a predictor block being an actual block of the current image, the encoding being carried on by a plurality of parallel threads of encoding, each threads being dedicated to the encoding of a line of Coding Tree blocks, the method comprising: determining the search range for INTRA Block Copy mode as the area constituted by, for a current INTRA Block Copy block, all data reconstructed by all threads of the previous Coding Tree Block lines and the current Coding Tree Block line. According to another aspect of the invention there is provided a method of encoding an image, the image comprising a plurality of Coding Tree blocks made of blocks of pixels, each block of pixels being encoded according to a mode out of a plurality of modes, one mode being called INTRA Block Copy mode in which the block is encoded based on a predictor block being an actual block of the current image, the encoding being carried on by a plurality of synchronized parallel threads of encoding, each threads being dedicated to the encoding of a line of Coding Tree blocks, the method comprising: determining the search range for INTRA Block Copy mode as the area constituted by, for a current INTRA Block Copy block, all data reconstructed by all threads (including the current Coding Tree Block).
In an embodiment encoding is done according to Wavefront parallelized mode.
According to another aspect of the invention there is provided a method of decoding an image, the image comprising a plurality of Coding Tree blocks made of blocks of pixels, each block of pixels being decoded according to a mode out of a plurality of modes, one mode being called INTRA Block Copy mode in which the block is encoded based on a predictor block being an actual block of the current image, the decoding being carried on by a plurality parallel threads of decoding, each threads being dedicated to the decoding of a line of Coding Tree blocks, wherein said plurality of threads are synchronized.
According to another aspect of the invention there is provided a device for encoding an image, the image comprising a plurality of Coding Tree blocks made of blocks of pixels, each block of pixels being encoded according to a mode out of a plurality of modes, one mode being called INTRA Block Copy mode in which the block is encoded based on a predictor block being an actual block of the current image, the device comprising: means for determining the search range for INTRA Block Copy mode as the area constituted by the Coding Tree blocks of coordinates (X, Y) such as :
Y≤ Y0 and X≤ X0
where X represents the horizontal coordinate, Y represents the vertical one, the origin being in the top left corner of the image, and where (XQ, YQ) are the coordinates of the current Coding Tree block, plus the reconstructed blocks of the current Coding Tree block.
According to another aspect of the invention there is provided a device for encoding an image, the image comprising a plurality of Coding Tree blocks made of blocks of pixels, each block of pixels being encoded according to a mode out of a plurality of modes, one mode being called INTRA Block Copy mode in which the block is encoded based on a predictor block being an actual block of the current image, the device comprising: means for determining the search range for INTRA Block Copy mode as the area constituted by the Coding Tree blocks of coordinates (X, Y) such as :
Y≤ Y0 and (X - X0)≤ -(Y - Y0)
where X represents the horizontal coordinate, Y represents the vertical one, the origin being in the top left corner of the image, and where (XQ, YQ) are the coordinates of the current Coding Tree block, plus the reconstructed blocks of the current Coding Tree block.
According to another aspect of the invention there is provided a device for encoding an image, the image comprising a plurality of Coding Tree blocks made of blocks of pixels, each block of pixels being encoded according to a mode out of a plurality of modes, one mode being called INTRA Block Copy mode in which the block is encoded based on a predictor block being an actual block of the current image, the device comprising: means for determining the search range for INTRA Block Copy mode as the area constituted by the Coding Tree blocks of coordinates {X, Y) such as :
Y≤ Y0 and (X - X0)≤ -2 * (Y - Y0) where X represents the horizontal coordinate, Y represents the vertical one, the origin being in the bottom left corner of the image, and where (XQ, YQ) are the coordinates of the current Coding Tree block, plus the reconstructed blocks of the current Coding Tree block.
According to another aspect of the invention there is provided a device for encoding an image, the image comprising a plurality of Coding Tree blocks made of blocks of pixels, each block of pixels being encoded according to a mode out of a plurality of modes, one mode being called INTRA Block Copy mode in which the block is encoded based on a predictor block being an actual block of the current image, the device comprising: means for determining the search range for INTRA Block Copy mode as the area constituted by the Coding Tree blocks of coordinates (X, Y) such as :
Y≤ Y0 and (X - X0) < -2 * (Y - Y0)
where X represents the horizontal coordinate, Y represents the vertical one, the origin being in the top left corner of the image, and where (XQ, YQ) are the coordinates of the current Coding Tree block, plus the reconstructed blocks of the current Coding Tree block. According to another aspect of the invention there is provided a device for encoding an image, the image comprising a plurality of Coding Tree blocks made of blocks of pixels, each block of pixels being encoded according to a mode out of a plurality of modes, one mode being called INTRA Block Copy mode in which the block is encoded based on a predictor block being an actual block of the current image, the device comprising: means for determining the search range for INTRA Block Copy mode as the area constituted by the Coding Tree blocks of coordinates (X, Y) such as :
X≤ X0 and Y = Y0
where X represents the horizontal coordinate, Y represents the vertical one, the origin being in the top left corner of the image, and where (XQ, YQ) are the coordinates of the current Coding Tree block, plus the reconstructed blocks of the current Coding Tree block. According to another aspect of the invention there is provided a device for encoding an image, the image comprising a plurality of Coding Tree blocks made of blocks of pixels, each block of pixels being encoded according to a mode out of a plurality of modes, one mode being called INTRA Block Copy mode in which the block is encoded based on a predictor block being an actual block of the current image, the encoding being carried on by a plurality of parallel threads of encoding, each threads being dedicated to the encoding of a line of Coding Tree blocks, the device comprising means for determining the search range for INTRA Block Copy mode as the area constituted by, for a current INTRA Block Copy block, all data reconstructed by all threads of the previous Coding Tree Block lines and the current Coding Tree Block line.
According to another aspect of the invention there is provided a device for encoding an image, the image comprising a plurality of Coding Tree blocks made of blocks of pixels, each block of pixels being encoded according to a mode out of a plurality of modes, one mode being called INTRA Block Copy mode in which the block is encoded based on a predictor block being an actual block of the current image, the encoding being carried on by a plurality of synchronized parallel threads of encoding, each thread being dedicated to the encoding of a line of Coding Tree blocks, the device comprising: means for determining the search range for INTRA Block Copy mode as the area constituted by, for a current INTRA Block Copy block, all data reconstructed by all threads.
According to another aspect of the invention there is provided a device for decoding an image, the image comprising a plurality of Coding Tree blocks made of blocks of pixels, each block of pixels being decoded according to a mode out of a plurality of modes, one mode being called INTRA Block Copy mode in which the block is encoded based on a predictor block being an actual block of the current image, the device comprising: means for processing a plurality of parallel threads of decoding, each thread being dedicated to the decoding of a line of Coding Tree blocks; and wherein synchronization means for synchronizing said plurality of threads.
According to another aspect of the invention there is provided a system for encoding and decoding an image, the system comprising an encoder according to the invention and a decoder according to the invention.
In an embodiment, the encoder and the decoder are using the same number of synchronized threads for respectively encoding and decoding the image.
According to another aspect of the invention there is provided a bitstream comprising encoded images, wherein encoded images have been encoded according to the invention.
According to another aspect of the invention there is provided a computer program product for a programmable apparatus, the computer program product comprising a sequence of instructions for implementing a method according to the invention, when loaded into and executed by the programmable apparatus.
According to another aspect of the invention there is provided a computer-readable storage medium storing instructions of a computer program for implementing a method according to the invention. Some aspects of the invention recited above mention the mode of the plurality of modes being an Intra block copy mode, however, as will be appreciated, this is merely an arbitrary label for this mode and is not intended to be limited. Accordingly, those aspects have within their intended scope any mode in which a block is encoded (or decoded) based on a predictor block being an actual block of the current image being encoded (or decoded) whether that mode is referred to as an Intra block copy mode or otherwise. At least parts of the methods according to the invention may be computer implemented. Accordingly, the present invention may take the form of an entirely hardware embodiment, an entirely software embodiment (including firmware, resident software, micro-code, etc.) or an embodiment combining software and hardware aspects that may all generally be referred to herein as a "circuit", "module" or "system". Furthermore, the present invention may take the form of a computer program product embodied in any tangible medium of expression having computer usable program code embodied in the medium.
Since the present invention can be implemented in software, the present invention can be embodied as computer readable code for provision to a programmable apparatus on any suitable carrier medium. A tangible carrier medium may comprise a storage medium such as a floppy disk, a CD-ROM, a hard disk drive, a magnetic tape device or a solid state memory device and the like. A transient carrier medium may include a signal such as an electrical signal, an electronic signal, an optical signal, an acoustic signal, a magnetic signal or an electromagnetic signal, e.g. a microwave or RF signal. Embodiments of the invention will now be described, by way of example only, and with reference to the following drawings in which:
Figure 1 illustrates the HEVC encoder architecture;
Figure 2 illustrates the HEVC decoder architecture;
Figure 3 illustrates the Level decomposition of Video frame;
Figure 4 illustrates the principle of Wavefront processing;
Figure 5 illustrates the location of block for the initialization of context variable with Wavefront;
Figure 6 illustrates the concept of the causal area;
Figure 7 illustrates the INTRA Block Copy search area;
Figure 8 illustrates a problem solved by the invention;
Figure 9 illustrates one embodiment of the invention;
Figure 10 illustrates one embodiment of the invention; Figure 11 illustrates one embodiment of the invention;
Figure 12 illustrates one embodiment of the invention;
Figure 13 illustrates one embodiment of the invention; and
Figure 14 is a schematic block diagram of a computing device for implementation of one or more embodiments of the invention.
Figure 1 illustrates the HEVC encoder architecture. In the video encoder, an original sequence 101 is divided into blocks of pixels 102. A coding mode is then affected to each block. There are two families of coding modes typically used in HEVC: the modes based on spatial prediction or INTRA modes 103 and the modes based on temporal prediction or INTER modes based on motion estimation 104 and motion compensation 105. An INTRA Coding Unit is generally predicted from the encoded pixels at its causal boundary by a process called INTRA prediction.
Temporal prediction first consists in finding in a previous or future frame called the reference frame 116 the reference area that most closely matches the Coding Unit in a motion estimation step 104. This reference area constitutes the predictor block. Next this Coding Unit is predicted using the predictor block to compute the residual in a motion compensation step 105.
In both cases, spatial and temporal prediction, a residual is computed by subtracting the Coding Unit from the original predictor block. In the INTRA prediction, a prediction direction is encoded. In the temporal prediction, at least one motion vector is encoded. However, in order to further reduce the bitrate cost related to motion vector encoding, a motion vector is not directly encoded. Indeed, assuming that motion is homogeneous, it is particularly interesting to encode a motion vector as a difference between this motion vector, and a motion vector in its surrounding. In H.264/AVC coding standard for instance, motion vectors are encoded with respect to a median vector computed between 3 blocks located above and on the left of the current block. Only a difference, also called residual motion vector, computed between the median vector and the current block motion vector is encoded in the bitstream. This is processed in module "Mv prediction and coding" 117. The value of each encoded vector is stored in the motion vector field 118. The neighboring motion vectors, used for the prediction, are extracted from the motion vector field 118.
Then, the mode optimizing the rate distortion performance is selected in module 106. In order to further reduce the redundancies, a transform, typically a DCT, is applied to the residual block in module 107, and a quantization is applied to the coefficients in module 108. The quantized block of coefficients is then entropy coded in module 109 and the result is inserted in the bitstream 110. The encoder then performs a decoding of the encoded frame for the future motion estimation in modules 111 to 116. These steps allow the encoder and the decoder to have the same reference frames. To reconstruct the coded frame, the residual is inverse quantized in module 111 and inverse transformed in module 112 in order to provide the "reconstructed" residual in the pixel domain. According to the encoding mode (INTER or INTRA), this residual is added to the INTER predictor 114 or to the INTRA predictor 113.
Then, this first reconstruction is filtered in module 115 by one or several kinds of post filtering. These post filters are integrated in the encoded and decoded loop. It means that they need to be applied on the reconstructed frame at encoder and decoder side in order to use the same reference frame at encoder and decoder side. The aim of this post filtering is to remove compression artifacts. In Figure 2, have been represented the principle of a decoder. The video stream 201 is first entropy decoded in a module 202. The residual data are then inverse quantized in a module 203 and inverse transformed in a module 204 to obtain pixel values. The mode data are also entropy decoded in function of the mode, an INTRA type decoding or an INTER type decoding is performed. In the case of INTRA mode, an INTRA predictor is determined in function of the INTRA prediction mode specified in the bitstream 205. If the mode is INTER, the motion information is extracted from the bitstream 202. This is composed of the reference frame index and the motion vector residual. The motion vector predictor is added to the motion vector residual to obtain the motion vector 210. The motion vector is then used to locate the reference area in the reference frame 206. Note that the motion vector field data 211 is updated with the decoded motion vector in order to be used for the prediction of the next decoded motion vectors. This first reconstruction of the decoded frame is then post filtered 207 with exactly the same post filter as used at encoder side. The output of the decoder is the de-compressed video 209. This INTRA Block Copy coding mode is particularly well suited for extremely repetitive patterns. In particular, it is known to help coding graphical elements such as glyphs, the graphical representation of a character, or traditional GUI elements, which are very difficult to code using traditional INTRA prediction methods.
It is worth noting that prediction is based on coherence between neighboring Coding Units. This coherence may be geographic when considered within the current frame or temporal when considered across successive frames. This kind of coherence occurs in natural images. As INTRA Block Copy encoding mode is seen as a mode dedicated to text or symbolic images, predication is thought as useless for this kind of image. For instance, there is no reason to have two successive Coding Units in an image representing a text having good predictors close to each other. The first Coding Unit may be the part of letter "A", a good predictor block would therefore come from another "A" in the text. While the next Coding Unit would be a "P" letter having a predictor block from another "P" in the text. There is no reason, a-priori, to have the two predictor blocks in the same neighborhood. This is why prior art does not contemplate introducing prediction in INTRA Block Copy encoding mode.
In HEVC, it is possible to transmit specific NAL Units, called SEI messages of different types. SEI messages contain information related to the display process, and are therefore optional.
Figure 3 shows the coding structure used in HEVC. According to HEVC and one of its previous predecessors, the original video sequence 301 is a succession of digital images "images i". As is known by those skilled in the art, a digital image is represented by one or more matrices the coefficients of which represent pixels.
The images 302 are divided into slices 303. A slice is a part of the image or the entire image. In HEVC these slices are divided into non-overlapping Coding Tree Blocks (CTB) 304, generally blocks of size 64 pixels x 64 pixels. Each Coding Tree Block may in its turn be iteratively divided into smaller variable size Coding Units (CUs) 305 using quadtree decomposition. Coding units are the elementary coding elements and are constituted of two sub units which Prediction Unit (PU) and Transform Units (TU) of maximum size equal to the Coding Unit's size. Prediction Unit corresponds to the partition of the Coding Unit for prediction of pixels values. Each Coding Unit can be further partitioned into a maximum of 4 square Partition Units or 2 rectangular Partition Units 306. Transform units are used to represent the elementary units that are spatially transform with DCT. A Coding Unit can be partitioned in TU based on a quadtree representation 307.
Each slice is embedded in one NAL unit. In addition, the coding parameters of the video sequence are stored in dedicated NAL units called parameter sets. In HEVC and H.264/AVC two kinds of parameter sets NAL units are employed: first, the Sequence Parameter Set (SPS) NAL unit that gathers all parameters that are unchanged during the whole video sequence. Typically, it handles the coding profile, the size of the video frames and other parameters. Secondly, Picture Parameter Sets (PPS) codes the different values that may change from one frame to another. HEVC include also Video Parameter Set (VPS) which contains parameters describing the overall structure of the stream.
For real time or fast implementation, it is often needed to parallelize some encoding and decoding processes. The HEVC standard offers some high level of parallelism as Wavefront or Tiles or Slices for frame parallelism and flexible reference frames management for Inter parallelism. These tools are not mandatory, yet, the decoder needs to decode their related syntax even if they are not mandatory.
The invention is dedicated to the Wavefront processing when it is combined with the INTRA Block Copy tools of the Screen Content extension of HEVC. The principle of Wavefront processing is presented in Figure 4. The principle is to parallelize the decoding process of several lines of Coding Tree Block. In opposite to the Tiles or in classical Slices, which avoid some predictions to offer parallelism but generate some losses in coding efficiency, the Wavefront keeps a large majority of predictions. The Wavefront introduces a delay between each line for the parallelization. In the example of Figure 4, 4 threads are run in parallel. So, 4 current Coding Tree Block are decoded in parallel. There is a delay between threads; for example, the second stream needs some information decoded by the first thread. So it is run with a delay of one Coding Tree Block for the entropy decoding. In the same way, thread 3 needs some decoded information from thread 2 etc... If we consider that the parsing and the reconstruction of each Coding Tree Block is exactly the same, the delay should be at decoder of 2 Coding Tree Block as it is represented in Figure 4. Indeed for reconstruction, the top right Coding Unit of the top right Coding Tree Block could be needed to decode the current Coding Tree Block. So in order to prevent a thread from waiting for its previous thread, a 2 Coding Tree Blocks delay should be considered. In the HEVC standard, the Wavefront processing is not explicitly defined. Only some CABAC resettings are explicitly described. When the flag entropy_coding_sync_enabled_flag is enabled, and when the first pixel of the first Coding Tree Block of a Coding Tree Block line is decoded, the context variable of the CABAC are initialized with the spatial neighbouring block T as depicted in Figure 5. More precisely the context variable of the CABAC takes the same values as those of block T. This block is the first block of the top right Coding Tree Block. If this block T is not available the context variables are initialized as the first Coding Tree Block of a frame.
So, for the HEVC Wavefront only the CABAC dependencies for the first block of each Coding Tree Block differ from the classical decoding process. For the classical decoding process, the values of the context variables of the first Coding Tree Block of Coding Tree Block line is set equal to the values of the context variables of the last block of the last Coding Tree Block of the previous Coding Tree Block line. When Wavefront is enable, this variable context are initialized with those of the top right Coding Tree Block (T).This is the only change that is needed at decoder side to use Wavefront at both encoder and decoder.
Moreover, it is possible to reset the context variables CABAC as the first Coding Tree Block of a frame according to some entry point syntax elements. Yet, these specific entry points are not needed for the current solution. The Screen Content Coding extension of HEVC under definition contains additional tools to efficiently code screen coding sequences. The current added tools are the Intra block copy mode, the Palette mode and the residual color transform. The current invention is dedicated to the Intra block copy mode only, so only this mode is described in the following. Yet, please note that the Palette mode and the INTRA Block Copy mode are new Intra modes and consequently added to the modules 103 and 205 of respectively Figure 1 and 2. The Intra Block Copy (IBC) was added as an additional mode for Screen content coding extension of HEVC. This prediction method is particularly well suited for extremely repetitive patterns. In particular, it is known to help coding graphical elements such as glyphs (i.e., the graphical representation of a character) or traditional GUI elements, which are very difficult to code using traditional intra prediction methods.
Figure 6 illustrates how this Intra Block Copy prediction mode works.
At a high-level, an image is divided into Coding Units that are encoded in raster scan order. Thus, when coding block 601 , all the blocks of area 603 have already been encoded/decoded, and can be considered available to the encoder/decoder. Area 603 is called the causal area of the Coding Unit 601. Once Coding Unit 601 is encoded/decoded, it will belong to the causal area for the next Coding Unit. This next Coding Unit, as well as all the next ones, belongs to area 604 illustrated as doted area, and cannot be used for coding the current Coding Unit 601. It is worth noting that the causal area is constituted by raw reconstructed blocks. The information used to encode a given Coding Unit is not the original blocks of the image for the reason that this information is not available at decoding. The only information available at decoding is the reconstructed version of the blocks of pixels in the causal area, namely the decoded version of these blocks. For this reason, at encoding, previously encoded blocks of the causal area are decoded to provide this reconstructed version of these blocks. INTRA Block Copy works by signaling a block 602 in the causal area which should be used to produce a prediction of block 601. For example, the block 602 may be found by using a matching algorithm. In the HEVC Screen content Extension, this block is indicated by a block vector 605, and the residual of this vector according to a predictor is transmitted in the bitstream.
The INTRA Block Copy predictor comes from all the reconstructed causal area of the current frame. As for other Intra modes, the causal area is not loop filtered.
This block vector is the difference in coordinates between a particular point of the Coding Unit 601 and the equivalent point in the predictor block 602. Although it would be possible to use subpixel accuracy as for INTER blocks, this displacement is typically in integer units of pixels, therefore not to require costly subpixel interpolation.
In the current INTRA Block Copy design, each INTRA Block Copy Coding Unit can be split into one or 2 PUs as depicted in Figure 3. For the smallest Coding Unit size, 8x8, the Coding Unit can be also split into 4 PUs of 4x4 pixels each.
For Inter mode the NxN partition is not available. It means that the 4x4 block size can't be used for Inter mode. The following table summarizes the block size for both modes.
Block sizes IBC mode Inter mode
64x64 (2Nx2N) X X
64x32 (2NxN) X X
32x64 (Nx2N) X X
32x32 (2Nx2N) X X
32x16 (2NxN) X X
16x32 (Nx2N) X X
16x16 (2Nx2N) X X 16x8 (2NxN) X X
8x16 (Nx2N) X X
8x8 (2Nx2N) X X
8x4 (2NxN) X X
4x8 (Nx2N) X X
4x4 (NxN) X
In the current implementation of Intra Block Copy prediction mode, the search area at encoder side depends on the blocks sizes. This is represented in the following table:
Figure imgf000027_0001
Please note that the 2NxN and Nx2N PU sizes are tested only for 8x8 Coding Units in the current encoder implementation. These sizes are not depicted in this table. There are 2 types of Intra Block Copy block vector estimation. The first one is the classical INTRA Block Copy search and it corresponds to a dedicated block matching algorithm. The second one is based on the Hash search algorithm. Two search ranges are also defined.
As depicted in Figure 7, for a frame 701 , the two Coding Tree Blocks search range corresponds to the left Coding Tree Block 703 and to the blocks of the current Coding Tree Block 702 already encoded/decoded. The blocks of current Coding Tree Block already encoded are depicted in dotted area in Figure 7. The full frame search corresponds to all the Coding Tree Blocks already encoded/decoded 704.
In the Intra Block Copy mode the "block" vector is the difference in coordinates between a particular point in a block 601 to encode and the equivalent point in the predictor block 602 of Figure 6. Although it would be possible to use subpixel accuracy as for INTER blocks, this displacement is in integer units of pixels, therefore it doesn't require costly subpixel interpolation. This block vector (BV) is itself predicted using a predictor which can be the left, the above BV or the latest decoded block vector of the current Coding Tree Block or the latest of the latest decoded BV. This vector predictors come of course from the decoded Intra Block Copy block. With these methods a predictor index is transmitted.
As mentioned previously, INTRA Block Copy is an Intra mode so its predictors come from the raw reconstructed data before any loop filtering. As a consequence, the decoder implementations using Wavefront processing should be decreased in decoding. Indeed as depicted in Figure 8, an INTRA Block Copy block predictor can come from a Coding Tree Block which has not been reconstructed. So, it means that the decoder can fully wait for the decoding process of this INTRA Block Copy predictor. So by considering the worst case, which is that each first block of each Coding Tree Block line points to the last block of each previous Coding Tree Block line, the decoding process with Wavefront can't be significantly faster than the classical decoding.
In a first embodiment of the invention, for a current Coding Tree Block, the INTRA Block Copy search range is limited to all left, top left and top Coding Tree Blocks and of course the reconstructed blocks of the current Coding Tree Block. The INTRA Block Copy search range is the area in the image where a predictor block may be searched for the encoding of a given coding unit according to INTRA Block Copy mode. It means that the top right Coding Tree Block are considered as unavailable for INTRA Block Copy prediction at encoder side and consequently no INTRA Block Copy block predictor for the current Coding Tree Block can point to any top Right Coding Tree Block at decoder side. Figure 9 shows this embodiment for the current Coding Tree Block of thread 4.
Namely the search range for INTRA Block Copy mode is determined as the area constituted by the Coding Tree Blocks of coordinates (X, Y) such that: Y≤ Y0 and X≤ X0
where X represents the horizontal coordinate, Y represents the vertical one, the origin being in the top left corner of the image, and where (XQ, YQ) are the coordinates of the current Coding Tree Block. Naturally, for the current Coding Tree Block, the area contains only already reconstructed blocks. This solution is very simple in term of implementation and simplifies significantly the Wavefront process.
In other embodiments, in order to improve the coding efficiency, more reconstructed blocks are available for INTRA Block Copy prediction or a current Coding Tree Block.
In one embodiment, all Coding Tree Blocks (CTBs) located to the left of a diagonal that starts at top right of the current Coding Tree Block and finishes at the top edge of the image are available for the INTRA Block Copy prediction. The mentioned diagonal follows a stepped (i.e. ladder shaped) path that follows a line that travels one CTB in the positive direction along the x-axis followed by one CTB in the negative direction along the y-axis and so on until the line reaches the top edge of the image. In addition, any already reconstructed CTBs of the current CTB row (thread) and any reconstructed blocks of the current CTB are also available for the INTRA Block Copy prediction. Figure 10 shows this embodiment for the current Coding Tree Block of the 4th thread. At decoder side the INTRA Block Copy predictors for the current Coding Tree Block come from this area only. It corresponds to an encoder using one Coding Tree Block delay between threads.
Namely, in this embodiment, the search range for INTRA Block Copy mode as the area constituted by the Coding Tree blocks of coordinates (X, Y) such that:
• Y≤ Y0 an0 .X - X0)≤ -(Y - Y0) .
In another embodiment, all Coding Tree Block left to the diagonal with a delay of 2 Coding Tree Block are available for INTRA Block Copy prediction in addition to the reconstructed blocks of the current Coding Tree Block. Figure 11 shows this embodiment for the current Coding Tree Block of the 4th thread.
Namely, in this embodiment, the search range for INTRA Block Copy mode as the area constituted by the Coding Tree blocks of coordinates (X, Y) such that:
• Y≤ Y0 and (X - X0)≤ -2 * (Y - Y0)
This embodiment increases the coding efficiency compared to the first one, but as shown in Figure 11 , potentially this search range increase the delay between thread.
In another embodiment, all Coding Tree Block left to the diagonal with a delay of 1 Coding tree block for the previous Coding Tree Block line and 2 Coding Tree Block all other previous Coding Tree Block lines are available for INTRA Block Copy prediction in addition to the reconstructed blocks of the current Coding Tree Block. Figure 12 shows this embodiment for the current Coding Tree Block of the 4th thread. It corresponds to a decoder which decodes the frame with 2 Coding Tree Block delays between threads in order to have the reconstructed top right Coding Unit available to decode the current Coding Tree Block. This area is more dedicated to decoder Wavefront processing. Namely, in this embodiment, the search range for INTRA Block Copy mode as the area constituted by the Coding Tree blocks of coordinates (X, Y) such that:
Y≤ Y0 and (X - XQ < -2 * (Y - YQ)
In one embodiment, each INTRA Block Copy block of a thread can access only the blocks reconstructed by this same thread. It corresponds to the use of a Coding Tree Block line search area for INTRA Block Copy at encoder side. This embodiment offers a flexible decoding for Wavefront processing. Indeed, there are no additional dependencies between Coding Tree Block lines compared to an implementation without INTRA Block Copy but it reduces the coding efficiency.
Namely, in this embodiment, the search range for INTRA Block Copy mode as the area constituted by the Coding Tree blocks of coordinates (X, Y) such that:
-Y < -Y0 and Y = Yro.
In a particular embodiment, when the Wavefront is enabled, both encoder and decoder use the wavefront parallelism with the required number of threads. Moreover for this embodiment, the threads are synchronized. It means that each thread, before starting the decoding of the following Coding Tree Block, waits for the decoding end of all Coding Tree Block of the other threads. In that case, at encoder and decoder, the reconstruction of all blocks is synchronized at Coding Tree Block level. In this embodiment, an INTRA Block Copy block of a thread can access all reconstructed Coding Tree Block available, even if the Coding Tree Block is in the below Coding Tree Block lines. For example, in Figure 12 each INTRA Block Copy block of each decoded Coding Tree Block (marked by an X) can access all available data of thread 4 in the classical implementation. In addition, each block of a Coding Tree Block can access all reconstructed blocks of its own Coding Tree Block. The advantage of this embodiment is that it increases the average number of blocks for each Coding Tree Block and it increases the search area for the first Coding Tree Block line for which the bitrate is generally higher due to the lack of possible predictions compared to other following Coding Tree Block lines. Please note that in this embodiment, a mono-threaded decoder can obtain the same decoding results than multithreaded decoder. Indeed, only the Coding Tree Block synchronization is mandatory.
Figure 13 shows the decoding order of CTB when a mono-threaded decoder implementation is used for a 2 CTB delay. Another order is needed if the delay considered is 1 CTB.
Figure 14 is a schematic block diagram of a computing device 1400 for implementation of one or more embodiments of the invention. The computing device 1400 may be a device such as a micro-computer, a workstation or a light portable device. The computing device 1400 comprises a communication bus connected to:
- a central processing unit 1401 , such as a microprocessor, denoted
CPU;
- a random access memory 1402, denoted RAM, for storing the executable code of the method of embodiments of the invention as well as the registers adapted to record variables and parameters necessary for implementing the method for encoding or decoding at least part of an image according to embodiments of the invention, the memory capacity thereof can be expanded by an optional RAM connected to an expansion port for example;
- a read only memory 1403, denoted ROM, for storing computer programs for implementing embodiments of the invention;
- a network interface 1404 is typically connected to a communication network over which digital data to be processed are transmitted or received. The network interface 1404 can be a single network interface, or composed of a set of different network interfaces (for instance wired and wireless interfaces, or different kinds of wired or wireless interfaces). Data packets are written to the network interface for transmission or are read from the network interface for reception under the control of the software application running in the CPU 1401 ;
- a user interface 1405 may be used for receiving inputs from a user or to display information to a user;
- a hard disk 1406 denoted HD may be provided as a mass storage device;
- an I/O module 1407 may be used for receiving/sending data from/to external devices such as a video source or display. The executable code may be stored either in read only memory 1403, on the hard disk 1406 or on a removable digital medium such as for example a disk. According to a variant, the executable code of the programs can be received by means of a communication network, via the network interface 1404, in order to be stored in one of the storage means of the communication device 1400, such as the hard disk 1406, before being executed.
The central processing unit 1401 is adapted to control and direct the execution of the instructions or portions of software code of the program or programs according to embodiments of the invention, which instructions are stored in one of the aforementioned storage means. After powering on, the CPU 1401 is capable of executing instructions from main RAM memory 1402 relating to a software application after those instructions have been loaded from the program ROM 1403 or the hard-disc (HD) 1406 for example. Such a software application, when executed by the CPU 1401 , causes the steps of the flowcharts described herein to be performed.
Any step of the algorithm described herein may be implemented in software by execution of a set of instructions or program by a programmable computing machine, such as a PC ("Personal Computer"), a DSP ("Digital Signal Processor") or a microcontroller; or else implemented in hardware by a machine or a dedicated component, such as an FPGA ("Field-Programmable Gate Array") or an ASIC ("Application-Specific Integrated Circuit"). Although the present invention has been described hereinabove with reference to specific embodiments, the present invention is not limited to the specific embodiments, and modifications will be apparent to a skilled person in the art which lie within the scope of the present invention.
Many further modifications and variations will suggest themselves to those versed in the art upon making reference to the foregoing illustrative embodiments, which are given by way of example only and which are not intended to limit the scope of the invention, that being determined solely by the appended claims. In particular the different features from different embodiments may be interchanged, where appropriate.
In the claims, the word "comprising" does not exclude other elements or steps, and the indefinite article "a" or "an" does not exclude a plurality. The mere fact that different features are recited in mutually different dependent claims does not indicate that a combination of these features cannot be advantageously used.

Claims

A method of encoding an image, the image comprising a plurality of Coding Tree blocks made of blocks of pixels, each block of pixels being encoded according to a mode out of a plurality of modes, one such mode being a mode in which the block is encoded based on a predictor block being a block of the current image, the method comprising:
determining the search area for said one mode as an area constituted by any reconstructed blocks of the current Coding Tree block and Coding Tree blocks having coordinates (X, Y) where :
Y≤ Y0 and (X - X0) ≤ - (Y - Y0)
where X represents the horizontal coordinate, Y represents the vertical one, the origin being in the top left corner of the image, and (XQ, YQ) are the coordinates of the current Coding Tree block.
The method of claim 1 , wherein encoding is performed using Wavefront parallel processing.
A method of decoding an image, the image comprising a plurality of Coding Tree blocks made of blocks of pixels, each block of pixels being encoded according to a mode out of a plurality of modes, one such mode being a mode in which the block is decoded based on a predictor block being a block of the current image, the method comprising restricting the area from which said predictor block may be obtained for said one mode to an area constituted by any reconstructed blocks of the current Coding Tree block and Coding Tree blocks having coordinates (X, Y) where :
Y≤ Y0 and (X - X0) ≤ - (Y - Y0)
where X represents the horizontal coordinate, Y represents the vertical one, the origin being in the top left corner of the image, and (XQ, YQ) are the coordinates of the current Coding Tree block. A method of decoding according to claim 3, wherein decoding is performed using Wavefront parallel processing.
A device for encoding an image, the image comprising a plurality of Coding Tree blocks made of blocks of pixels, each block of pixels being encoded according to a mode out of a plurality of modes, one such mode being a mode in which the block is encoded based on a predictor block being a block of the current image, the device comprising:
means for determining the search area for said one mode as an area constituted by any reconstructed blocks of the current Coding Tree block and Coding Tree blocks having coordinates (X, Y) where :
Y≤ Y0 and (X - X0) ≤ - (Y - Y0)
where X represents the horizontal coordinate, Y represents the vertical one, the origin being in the top left corner of the image, and (XQ, YQ) are the coordinates of the current Coding Tree block.
A device for decoding an image, the image comprising a plurality of Coding Tree blocks made of blocks of pixels, each block of pixels being decoded according to a mode out of a plurality of modes, one such mode being a mode in which the block is decoded based on a predictor block being a block of the current image, the device comprising: means for restricting the area from which said predictor block may be obtained to an area constituted by the reconstructed blocks of the current Coding Tree block and the Coding Tree blocks of coordinates (X, Y) where :
Y≤ Y0 and (X - X0) ≤ - (Y - Y0) where X represents the horizontal coordinate, Y represents the vertical one, the origin being in the top left corner of the image, and (XQ, YQ) are the coordinates of the current Coding Tree block
A system for encoding and decoding an image, the system comprising a device for encoding an image according to claim 5 and a device for decoding an image according to claim 6.
8. The system of claim 7, wherein the device for encoding and the device for decoding are configured to use Wavefront parallel processing. 9. The system of claim 8, wherein device for encoding and the device for decoding are configured to use the same number of synchronized threads for respectively encoding and decoding the image.
10. A bitstream comprising encoded images, wherein encoded images have been encoded according to the method of claim 1 .
1 1 . A bitstream comprising an encoded sequence of images the images each comprising a plurality of Coding Tree blocks made of blocks of pixels, each block of pixels having been encoded according to a mode out of a plurality of modes, one mode being a mode in which the block is encoded based on a predictor block being a block of the current image, wherein the position of any predictor block indicated by the bitstream is restricted to an area constituted by any reconstructed blocks of the current Coding Tree block and Coding Tree blocks having coordinates (X, Y) where :
Y≤ Y0 and (X - X0) ≤ - (Y - Y0) where X represents the horizontal coordinate, Y represents the vertical one, the origin being in the top left corner of an encoded image, and (XQ, YQ) are the coordinates of the current Coding Tree block.
12. A carrier medium carrying a bitstream according to claim 1 1 .
13. A carrier medium according to claim 12 wherein the carrier medium is a storage medium on which said bitstream is stored.
14. A carrier medium according to claim 12 wherein the carrier medium is a signal embodying said bitstream.
15. A computer program product for a programmable apparatus, the computer program product comprising a sequence of instructions for implementing a method according to any one of claims 1 to 4, when loaded into and executed by the programmable apparatus.
16. A computer-readable storage medium storing instructions of a computer program for implementing a method according to any one of claims 1 to
PCT/EP2015/073060 2014-10-06 2015-10-06 Method and apparatus for vector encoding in video coding and decoding WO2016055484A1 (en)

Priority Applications (6)

Application Number Priority Date Filing Date Title
KR1020177011152A KR102076398B1 (en) 2014-10-06 2015-10-06 Method and apparatus for vector encoding in video coding and decoding
CN201580053419.6A CN106797464B (en) 2014-10-06 2015-10-06 Method and apparatus for vector coding in video encoding and decoding
EP15781605.9A EP3205091B1 (en) 2014-10-06 2015-10-06 Methods, apparatuses and corresponding computer program and computer-readable storage medium for encoding and decoding an image
RU2017115409A RU2663348C1 (en) 2014-10-06 2015-10-06 Method and device for coding vector for coding and decoding video
US15/516,856 US11051037B2 (en) 2014-10-06 2015-10-06 Method and apparatus for vector encoding in video coding and decoding
JP2017517079A JP6590918B2 (en) 2014-10-06 2015-10-06 Image encoding method, image decoding method, image encoding device, image decoding device, and program

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
GB1417634.1A GB2531001B (en) 2014-10-06 2014-10-06 Method and apparatus for vector encoding in video coding and decoding
GB1417634.1 2014-10-06

Publications (1)

Publication Number Publication Date
WO2016055484A1 true WO2016055484A1 (en) 2016-04-14

Family

ID=51946909

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/EP2015/073060 WO2016055484A1 (en) 2014-10-06 2015-10-06 Method and apparatus for vector encoding in video coding and decoding

Country Status (8)

Country Link
US (1) US11051037B2 (en)
EP (1) EP3205091B1 (en)
JP (1) JP6590918B2 (en)
KR (1) KR102076398B1 (en)
CN (1) CN106797464B (en)
GB (1) GB2531001B (en)
RU (2) RU2684200C2 (en)
WO (1) WO2016055484A1 (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2018076336A1 (en) * 2016-10-31 2018-05-03 富士通株式会社 Video decoding method, video decoding apparatus and electronic device
CN112204978A (en) * 2018-06-01 2021-01-08 夏普株式会社 Image decoding device and image encoding device
CN113170194A (en) * 2019-01-02 2021-07-23 北京字节跳动网络技术有限公司 Simplification of hash-based motion search
JPWO2020129698A1 (en) * 2018-12-21 2021-11-04 ソニーグループ株式会社 Image processing equipment and methods

Families Citing this family (20)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CA2981916C (en) * 2015-04-13 2021-08-31 Mediatek, Inc. Methods of constrained intra block copy for reducing worst case bandwidth in video coding
FR3068557A1 (en) * 2017-07-05 2019-01-04 Orange METHOD FOR ENCODING AND DECODING IMAGES, CORRESPONDING ENCODING AND DECODING DEVICE AND COMPUTER PROGRAMS
FR3062010A1 (en) 2017-07-05 2018-07-20 Orange METHODS AND DEVICES FOR ENCODING AND DECODING A DATA STREAM REPRESENTATIVE OF AN IMAGE SEQUENCE
FR3068558A1 (en) 2017-07-05 2019-01-04 Orange METHOD FOR ENCODING AND DECODING IMAGES, CORRESPONDING ENCODING AND DECODING DEVICE AND COMPUTER PROGRAMS
FI3808090T3 (en) * 2018-07-18 2024-09-18 Beijing Dajia Internet Information Tech Co Ltd Methods and apparatus of video coding using history-based motion vector prediction
CN112385228B (en) * 2018-08-03 2022-07-01 联发科技股份有限公司 Method and apparatus for enhanced intra block copy mode for video coding
WO2020108572A1 (en) * 2018-11-28 2020-06-04 Beijing Bytedance Network Technology Co., Ltd. Independent construction method for block vector list in intra block copy mode
CN113170195B (en) 2018-12-22 2024-09-03 北京字节跳动网络技术有限公司 Intra block copy mode with dual tree partitioning
WO2020156547A1 (en) 2019-02-02 2020-08-06 Beijing Bytedance Network Technology Co., Ltd. Buffer resetting for intra block copy in video coding
CN113366853B (en) 2019-02-02 2024-08-02 北京字节跳动网络技术有限公司 Buffer initialization for intra block copying in video codec
CN113545068B (en) 2019-03-01 2023-09-15 北京字节跳动网络技术有限公司 Order-based update for intra block copying in video codec
CN117395439A (en) 2019-03-01 2024-01-12 北京字节跳动网络技术有限公司 Direction-based prediction for intra block copying in video codec
KR20240132530A (en) 2019-03-04 2024-09-03 베이징 바이트댄스 네트워크 테크놀로지 컴퍼니, 리미티드 Implementation aspects in intra block copy in video coding
US11252442B2 (en) * 2019-04-08 2022-02-15 Tencent America LLC Method and apparatus for video coding
CA3146016C (en) 2019-07-06 2024-05-07 Beijing Bytedance Network Technology Co., Ltd. Virtual prediction buffer for intra block copy in video coding
CN114175633B (en) * 2019-07-10 2023-12-29 北京字节跳动网络技术有限公司 Sample identification for intra block copying in video codec
JP2022539887A (en) 2019-07-11 2022-09-13 北京字節跳動網絡技術有限公司 Bitstream Conformance Constraints for Intra-Block Copies in Video Coding
EP3991423A4 (en) * 2019-07-25 2022-09-07 Beijing Bytedance Network Technology Co., Ltd. Mapping restriction for intra-block copy virtual buffer
KR20220064968A (en) 2019-09-23 2022-05-19 베이징 바이트댄스 네트워크 테크놀로지 컴퍼니, 리미티드 Setting of intra block copy virtual buffer based on virtual pipeline data unit
CN115362674A (en) 2020-03-18 2022-11-18 抖音视界有限公司 Intra block copy buffer and palette predictor update

Family Cites Families (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
RU2322770C2 (en) 2002-04-23 2008-04-20 Нокиа Корпорейшн Method and device for indication of quantizer parameters in video encoding system
CN101064849A (en) * 2006-04-29 2007-10-31 鲁海宁 Dynamic image coding method, apparatus and computer readable record medium
WO2011118486A1 (en) * 2010-03-23 2011-09-29 東レ株式会社 Separation membrane and method for producing same
US8837592B2 (en) * 2010-04-14 2014-09-16 Mediatek Inc. Method for performing local motion vector derivation during video coding of a coding unit, and associated apparatus
US8837577B2 (en) * 2010-07-15 2014-09-16 Sharp Laboratories Of America, Inc. Method of parallel video coding based upon prediction type
WO2012044707A1 (en) * 2010-10-01 2012-04-05 General Instrument Corporation Coding and decoding utilizing picture boundary variability in flexible partitioning
US20130121417A1 (en) * 2011-11-16 2013-05-16 Qualcomm Incorporated Constrained reference picture sets in wave front parallel processing of video data
US9332259B2 (en) 2012-01-18 2016-05-03 Qualcomm Incorporated Indication of use of wavefront parallel processing in video coding
US9838684B2 (en) 2012-04-11 2017-12-05 Qualcomm Incorporated Wavefront parallel processing for video coding
US10390034B2 (en) * 2014-01-03 2019-08-20 Microsoft Technology Licensing, Llc Innovations in block vector prediction and estimation of reconstructed sample values within an overlap area
US10477232B2 (en) * 2014-03-21 2019-11-12 Qualcomm Incorporated Search region determination for intra block copy in video coding
EP3160144B1 (en) 2014-06-20 2022-02-02 Sony Group Corporation Image encoding apparatus and method
CN111147846B (en) * 2014-07-07 2022-03-11 寰发股份有限公司 Video coding method using intra block copy mode coding
EP3917146A1 (en) * 2014-09-30 2021-12-01 Microsoft Technology Licensing, LLC Rules for intra-picture prediction modes when wavefront parallel processing is enabled
US10516882B2 (en) * 2015-01-29 2019-12-24 Vid Scale, Inc. Intra-block copy searching

Non-Patent Citations (12)

* Cited by examiner, † Cited by third party
Title
BALLE J ET AL: "Extended Texture Prediction for H.264/AVC Intra Coding", 1 September 2007 (2007-09-01), pages 93 - 96, XP008155518, Retrieved from the Internet <URL:http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=4379529> DOI: 10.1109/ICIP.2007.4379529 *
CHI CHING CHI ET AL: "Parallel Scalability and Efficiency of HEVC Parallelization Approaches", IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, IEEE SERVICE CENTER, PISCATAWAY, NJ, US, vol. 22, no. 12, 1 December 2012 (2012-12-01), pages 1827 - 1838, XP011487165, ISSN: 1051-8215, DOI: 10.1109/TCSVT.2012.2223056 *
G.J. HAN ET AL: "Overview of the High Efficiency Video Coding (HEVC) Standard", IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 1 January 2012 (2012-01-01), pages 1 - 1, XP055045358, ISSN: 1051-8215, DOI: 10.1109/TCSVT.2012.2221191 *
GORDON C ET AL: "Wavefront Parallel Processing for HEVC Encoding and Decoding", 6. JCT-VC MEETING; 97. MPEG MEETING; 14-7-2011 - 22-7-2011; TORINO; (JOINT COLLABORATIVE TEAM ON VIDEO CODING OF ISO/IEC JTC1/SC29/WG11 AND ITU-T SG.16 ); URL: HTTP://WFTP3.ITU.INT/AV-ARCH/JCTVC-SITE/,, no. JCTVC-F274, 1 January 2011 (2011-01-01), XP030009297 *
KWON DO-KYOUNG ET AL: "Fast intra block copy (IntraBC) search for HEVC screen content coding", 2014 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), IEEE, 1 June 2014 (2014-06-01), pages 9 - 12, XP032624581, DOI: 10.1109/ISCAS.2014.6865052 *
LAI P ET AL: "AHG14: Intra Block Copy reference area for Wavefront Parallel Procsssing (WPP)", 19. JCT-VC MEETING; 17-10-2014 - 24-10-2014; STRASBOURG; (JOINT COLLABORATIVE TEAM ON VIDEO CODING OF ISO/IEC JTC1/SC29/WG11 AND ITU-T SG.16 ); URL: HTTP://WFTP3.ITU.INT/AV-ARCH/JCTVC-SITE/,, no. JCTVC-S0101, 8 October 2014 (2014-10-08), XP030116850 *
LAI P ET AL: "Description of screen content coding technology proposal by MediaTek", 17. JCT-VC MEETING; 27-3-2014 - 4-4-2014; VALENCIA; (JOINT COLLABORATIVE TEAM ON VIDEO CODING OF ISO/IEC JTC1/SC29/WG11 AND ITU-T SG.16 ); URL: HTTP://WFTP3.ITU.INT/AV-ARCH/JCTVC-SITE/,, no. JCTVC-Q0033-v4, 26 March 2014 (2014-03-26), XP030115920 *
LAROCHE G ET AL: "AHG14: On IBC constraint for Wavefront Parallel Processing", 19. JCT-VC MEETING; 17-10-2014 - 24-10-2014; STRASBOURG; (JOINT COLLABORATIVE TEAM ON VIDEO CODING OF ISO/IEC JTC1/SC29/WG11 AND ITU-T SG.16 ); URL: HTTP://WFTP3.ITU.INT/AV-ARCH/JCTVC-SITE/,, no. JCTVC-S0070, 7 October 2014 (2014-10-07), XP030116810 *
LI B ET AL: "On WPP with palette mode and intra BC mode", 19. JCT-VC MEETING; 17-10-2014 - 24-10-2014; STRASBOURG; (JOINT COLLABORATIVE TEAM ON VIDEO CODING OF ISO/IEC JTC1/SC29/WG11 AND ITU-T SG.16 ); URL: HTTP://WFTP3.ITU.INT/AV-ARCH/JCTVC-SITE/,, no. JCTVC-S0088, 7 October 2014 (2014-10-07), XP030116832 *
PANG C ET AL: "Non-RCE3: Intra Motion Compensation with 2-D MVs", 14. JCT-VC MEETING; 25-7-2013 - 2-8-2013; VIENNA; (JOINT COLLABORATIVE TEAM ON VIDEO CODING OF ISO/IEC JTC1/SC29/WG11 AND ITU-T SG.16 ); URL: HTTP://WFTP3.ITU.INT/AV-ARCH/JCTVC-SITE/,, no. JCTVC-N0256, 16 July 2013 (2013-07-16), XP030114776 *
RAPAKA KRISHNA ET AL: "Improved intra-block copy and motion search methods for screen content coding", PROCEEDINGS OF SPIE, S P I E - INTERNATIONAL SOCIETY FOR OPTICAL ENGINEERING, US, vol. 9599, 22 September 2015 (2015-09-22), pages 95991D - 95991D, XP060060838, ISSN: 0277-786X, ISBN: 978-1-62841-730-2, DOI: 10.1117/12.2193685 *
YU: "New Intra Prediction using Self-Frame MCP", 3. JVT MEETING; 60. MPEG MEETING; 06-05-2002 - 10-05-2002; FAIRFAX,US; (JOINT VIDEO TEAM OF ISO/IEC JTC1/SC29/WG11 AND ITU-T SG.16 ),, no. JVT-C151r1-L, 10 May 2002 (2002-05-10), XP030005267, ISSN: 0000-0442 *

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2018076336A1 (en) * 2016-10-31 2018-05-03 富士通株式会社 Video decoding method, video decoding apparatus and electronic device
CN112204978A (en) * 2018-06-01 2021-01-08 夏普株式会社 Image decoding device and image encoding device
CN112204978B (en) * 2018-06-01 2024-03-15 夏普株式会社 Image decoding device and image encoding device
JPWO2020129698A1 (en) * 2018-12-21 2021-11-04 ソニーグループ株式会社 Image processing equipment and methods
JP7409320B2 (en) 2018-12-21 2024-01-09 ソニーグループ株式会社 Image processing device and method
CN113170194A (en) * 2019-01-02 2021-07-23 北京字节跳动网络技术有限公司 Simplification of hash-based motion search

Also Published As

Publication number Publication date
RU2018126868A3 (en) 2019-03-13
EP3205091A1 (en) 2017-08-16
US20180302645A1 (en) 2018-10-18
RU2684200C2 (en) 2019-04-04
JP2017535150A (en) 2017-11-24
GB201417634D0 (en) 2014-11-19
GB2531001A (en) 2016-04-13
KR102076398B1 (en) 2020-02-11
GB2531001B (en) 2019-06-05
JP6590918B2 (en) 2019-10-16
EP3205091B1 (en) 2023-05-17
US11051037B2 (en) 2021-06-29
RU2018126868A (en) 2019-03-13
KR20170063808A (en) 2017-06-08
RU2663348C1 (en) 2018-08-03
CN106797464B (en) 2020-10-30
CN106797464A (en) 2017-05-31

Similar Documents

Publication Publication Date Title
US11051037B2 (en) Method and apparatus for vector encoding in video coding and decoding
JP6882560B2 (en) Image prediction method and equipment
JP6931690B2 (en) How to encode content and arithmetic units
US10721469B2 (en) Line buffer reduction for adaptive loop filtering in video coding
US11044473B2 (en) Adaptive loop filtering classification in video coding
US10009615B2 (en) Method and apparatus for vector encoding in video coding and decoding
US20150350674A1 (en) Method and apparatus for block encoding in video coding and decoding
CN109565587B (en) Method and system for video encoding with context decoding and reconstruction bypass
WO2019010217A1 (en) Adaptive loop filter with enhanced classification methods
EP3560199A1 (en) Low-complexity sign prediction for video coding
WO2018175911A1 (en) Motion vector difference (mvd) prediction
GB2533905A (en) Method and apparatus for video coding and decoding
US10178405B2 (en) Enhanced coding and decoding using intra block copy mode
AU2013228045A1 (en) Method, apparatus and system for encoding and decoding video data
WO2015142833A1 (en) Method for motion estimation of non-natural video data
JP2024095835A (en) Shape adaptive discrete cosine transform for geometric partitioning with adaptive number of regions
GB2527354A (en) Method and apparatus for vector encoding in video coding and decoding

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 15781605

Country of ref document: EP

Kind code of ref document: A1

ENP Entry into the national phase

Ref document number: 2017517079

Country of ref document: JP

Kind code of ref document: A

WWE Wipo information: entry into national phase

Ref document number: 15516856

Country of ref document: US

NENP Non-entry into the national phase

Ref country code: DE

ENP Entry into the national phase

Ref document number: 20177011152

Country of ref document: KR

Kind code of ref document: A

REEP Request for entry into the european phase

Ref document number: 2015781605

Country of ref document: EP

ENP Entry into the national phase

Ref document number: 2017115409

Country of ref document: RU

Kind code of ref document: A