WO2015077720A1 - Advanced screen content coding solution - Google Patents

Advanced screen content coding solution Download PDF

Info

Publication number
WO2015077720A1
WO2015077720A1 PCT/US2014/067155 US2014067155W WO2015077720A1 WO 2015077720 A1 WO2015077720 A1 WO 2015077720A1 US 2014067155 W US2014067155 W US 2014067155W WO 2015077720 A1 WO2015077720 A1 WO 2015077720A1
Authority
WO
WIPO (PCT)
Prior art keywords
color
color palette
specified
palette table
pixel
Prior art date
Application number
PCT/US2014/067155
Other languages
French (fr)
Inventor
Zhan MA
Wei Wang
Haoping Yu
Xian WANG
Jing Ye
Original Assignee
Futurewei Technologies, Inc.
Huawei Technologies Co., Ltd.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority to JP2016533032A priority Critical patent/JP6294482B2/en
Priority to CA2931386A priority patent/CA2931386C/en
Priority to NZ720776A priority patent/NZ720776A/en
Priority to RU2016124544A priority patent/RU2646355C2/en
Priority to CN201480063141.6A priority patent/CN105745671B/en
Priority to BR112016011471-0A priority patent/BR112016011471B1/en
Priority to MX2016006612A priority patent/MX362406B/en
Priority to AU2014352656A priority patent/AU2014352656B2/en
Application filed by Futurewei Technologies, Inc., Huawei Technologies Co., Ltd. filed Critical Futurewei Technologies, Inc.
Priority to EP14864463.6A priority patent/EP3063703A4/en
Priority to KR1020167016238A priority patent/KR101972936B1/en
Priority to UAA201606679A priority patent/UA118114C2/en
Publication of WO2015077720A1 publication Critical patent/WO2015077720A1/en
Priority to IL245752A priority patent/IL245752B/en
Priority to HK16108372.3A priority patent/HK1220531A1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N1/00Scanning, transmission or reproduction of documents or the like, e.g. facsimile transmission; Details thereof
    • H04N1/46Colour picture communication systems
    • H04N1/64Systems for the transmission or the storage of the colour picture signal; Details therefor, e.g. coding or decoding means therefor
    • H04N1/646Transmitting or storing colour television type signals, e.g. PAL, Lab; Their conversion into additive or subtractive colour signals or vice versa therefor
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/503Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T9/00Image coding
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/593Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving spatial prediction techniques
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/70Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/134Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
    • H04N19/157Assigned coding mode, i.e. the coding mode being predefined or preselected to be further used for selection of another element or parameter
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/17Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
    • H04N19/174Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a slice, e.g. a line of blocks or a group of blocks
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/17Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
    • H04N19/176Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/186Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a colour or a chrominance component
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/90Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using coding techniques not provided for in groups H04N19/10-H04N19/85, e.g. fractals
    • H04N19/93Run-length coding
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N2201/00Indexing scheme relating to scanning, transmission or reproduction of documents or the like, and to details thereof
    • H04N2201/0077Types of the still picture apparatus
    • H04N2201/0089Image display device

Definitions

  • the present disclosure is generally directed to screen content coding.
  • Screen content coding imposes new challenges for video compression technology because of its distinct signal characteristics compared with conventional natural videos.
  • There appear to be a few promising techniques for the advanced screen content coding e.g., pseudo string match, color palette coding, and intra motion compensation or intra block copy.
  • pseudo string match shows the highest gain for lossless coding, but with significant complexity overhead and difficulties on lossy coding mode.
  • the color palette coding is developed for screen content under the assumption that non-camera captured content typically contains a limited few distinct colors, rather than the continuous color tone in natural videos.
  • intra motion compensation or intra block copy was adopted into the working draft (WD) version 4 and reference software of on-going HEVC range extension (HEVC RExt) for screen content coding. This is mainly due to the fact that the motion estimation and compensation approach has been studied extensively over decades, as well as its idea and practical implementation is fairly easy (especially for hardware) .
  • This disclosure is directed to an advanced screen content coding solution.
  • a method for coding screen content into a bitstream selects a color palette table for a coding unit (CU) of screen content.
  • the color palette table created for the CU and a color palette table is created for a neighboring CU.
  • a color index map is created having indices for the coding unit (CU) of the screen content using the selected color palette table.
  • the selected color palette table and the color index map are encoded/compressed for each of a plurality of CUs into a bitstream.
  • FIGURE 1 illustrates a screen content encoding solution using color palette table and index map mode or palette mode according to one example embodiment of this disclosure
  • FIGURE 2 illustrates a screen content decoding solution for color palette table and index map mode or palette mode
  • FIGURE 3 illustrates a process or workflow of the screen content solution for this color palette table and index map mode or palette mode of a CU
  • FIGURE 4 illustrates a conventional G, B, R in planar mode (left) to Packed mode (right) ;
  • FIGURE 5 illustrates color palette table re-generation using neighboring reconstructed blocks
  • FIGURE 6 illustrates an index map is parsed from a real word screen content
  • FIGURE 7 illustrates a piece of a segment for a 1-D search after horizontal scanning
  • FIGURE 8 illustrates a U_PIXEL module
  • FIGURE 9 illustrates a U_ROW module
  • FIGURE 10 illustrates a U_CMP module
  • FIGURE 11 illustrates a U_COL module
  • FIGURE 12 illustrates a U_2D_BLOCK module
  • FIGURE 13 is an illustration of horizontal and vertical scan for index map processing of a exemplified CU
  • FIGURE 14A is an illustration of a 4:2:0 chroma sampling format
  • FIGURE 14B is an illustration of a 4:4:4 chroma sampling format;
  • FIGURE 15 illustrates an interpolation between 4:20 and 4:4:4;
  • FIGURE 16 illustrates Index Map processing with Upper/Left line buffer
  • FIGURE 17 illustrates the apparatus and methods/flows incorporated into the current HEVC
  • FIGURE 18 illustrates one example of a communication system
  • FIGURE 19A and FIGURE 19B illustrate example devices that may implement the methods and teachings according to this disclosure .
  • HEVC High-Efficiency Video Coding
  • HEVC Version 2 or HEVC RExt HEVC Version 2
  • This new solution includes several algorithms that are designed specifically for coding screen content. These algorithms include pixel representation using a color palette or a color table, referred to herein as a color palette table, color palette compression, color index map compression, string search, and residual compression.
  • This technology is developed, harmonized, and can be integrated with the HEVC range extension (RExt) and future HEVC extensions to support efficient screen content coding.
  • this technology could be implemented with any existing video standards.
  • HEVC RExt is used as an example in the description below, and HEVC RExt software is used to describe and demonstrate the compression efficiency.
  • This solution is integrated as an additional mode by using a color palette table and index map, defined herein as a color palette mode, in HEVC to demonstrate the performance.
  • FIGURE 1 shows an encoder 10 having a processor 12 including memory
  • FIGURE 2 shows a decoder 14 having a processor 16 and memory, together illustrating an example embodiment of an encoding and decoding solution for the color palette mode, respectively, in accordance with this disclosure.
  • the encoder 10 and decoder 14 each comprise a processor and memory and form a codec solution.
  • the codec solution includes the processor 12 of encoder 10 executing new algorithms or methods including Process 1 creating a Color Palette Table, Process 2 classifying colors or pixel values using a previously derived color palette table corresponding color indices, Process 3 encoding the Color Palette Table, Process 4 encoding the color index map, Process 5 encoding the residuals. and Process 6 writing new syntax elements into the compressed bitstream.
  • Processor 16 of decoder 14 executes new algorithms or methods including the reverse steps.
  • FIGURE 3 provides a process or workflow of the screen content solution according to this disclosure.
  • a coding unit is a basic operating unit in HEVC and HEVC RExt, which is a squared block of pixels consisting of three components (i.e., RGB, or YUV, or XYZ) .
  • the CPC method includes two major steps.
  • the processor 12 derives or generates a color palette table in the first step.
  • This table is ordered according to a histogram (i.e., occurrence frequency of each color value), or its actual color intensity, or any arbitrary method in order to increase the efficiency of the following encoding process.
  • a histogram i.e., occurrence frequency of each color value
  • each pixel in the original CU is converted to its color index within the color palette table.
  • a contribution of this disclosure is technology to efficiently encode, such as using compression, the color palette table and the color index map of each CU into the stream.
  • the processor 16 parses the compressed bitstream to reconstruct, for each CU, the complete color palette table and the color index map, and then further derive the pixel value at each position by combing the color index and color palette table.
  • the CU typically contains three chrominance (chroma) components (i.e., G, B, R, or Y, Cb, Cr, or X, Y Z) at a different sampling ratio (i.e., 4:4:4, 4:2:2, 4:2:0).
  • chroma chrominance
  • sequences of 4:4:4 are illustrated in the disclosure.
  • chroma upsampling could be applied to obtain the 4:4:4 sequences, or each color component could be processed independently.
  • the same procedure described in this disclosure can be applied.
  • 4:0:0 monochrome videos this can be treated as an individual plane of 4:4:4 without other two planes. All methods for 4:4:4 can be applied directly.
  • FIGURE 4 illustrates a conventional G, B, R in planar mode (left) to Packed mode (right) .
  • YUV or other color format could be processed in the same fashion as exemplified for RGB content .
  • Both the packed mode and the planar mode have its own advantage and disadvantage.
  • the planar mode supports parallel color component processing for G/B/R or Y/U/V.
  • the packed mode can share the header information (such as the color palette table and index map in this disclosure) for this CU among different color components.
  • R-D rate distortion
  • the enable_packed_component_flag is used to signal the encoding mode to the decoder explicitly.
  • enable_packed_component_flag at the CU level for low-level handling, it can be duplicated in slice header or even sequence level (e.g., Sequence Parameter Set or Picture Parameter Set) to allow slice level or sequence level handling, depending on the specific application requirement.
  • sequence level e.g., Sequence Parameter Set or Picture Parameter Set
  • each pixel is mapped to the corresponding color index to form the index map of the current CU.
  • index map is described in the subsequent section .
  • each color or chrominance component can have its individual color palette table, such as colorTable_Y, colorTable_U, colorTable_V or colorTable_R, colorTable_G, colorTable_B, naming a few here as an example.
  • the color palette table for a major component can be derived, such as Y in YUV or G in GBR, and shared for all components. Normally, by this sharing, other color components, other than Y or G, would have some mismatch relative to its original pixel colors from those shared in color palette table.
  • the residual engine (such as HEVC coefficients coding methods) is then applied to encode those mismatched residuals.
  • a single color palette table is shared among all components.
  • a pseudo code is provided to exemplify the color palette table and index map derivation as follows:
  • colorTablelntensity [ j ++] colorHist [i] ;
  • idxMap[pos] idx
  • color palette table processing involves the processor 12 encoding of the size of a color palette table (i.e., the total number of distinct colors) and each color itself. A majority of the bits are consumed by the encoding of each color in a color palette table. Hence, focus is placed on the color encoding (or encoding of each entry in color palette table) .
  • a Neighboring Color palette table Merge where a color_table_merge_flag is defined to indicate whether the current CU uses the color palette table from its left or upper CU. If not, the current CU will carry the color palette table signaling explicitly.
  • another color_table_merge_direction indicates the merging direction either from upper or from left CU.
  • the candidates could be more than current upper or left CU, e.g. upper-left, upper-right and etc.
  • the upper and left CU are used in this disclosure to exemplify the idea.
  • each pixel is compared with the entries in an existing color palette table and assigned an index yielding the least prediction difference (i.e., pixel subtracts the closest color in color palette table) via deriveldxMap ( ) .
  • the prediction difference is non-zero
  • all these residuals are encoded using the HEVC RExt residual engine. Note that whether using the merging process or not can be decided by the R-D cost.
  • the color palette table of neighbor CUs are generated upon the available reconstructed pixels, regardless of CU depth, size and etc. For each CU, the reconstructions are retrieved for its neighboring CU at the same size and same depth (assuming the color similarity would be higher in this case) .
  • Constrained Encoder Only Process for this method, the merging process occurs when a current CU shares the same size and depth as its upper and/or left CU.
  • the color palette tables of the available neighbors are used to derive the color index map of the current CU for subsequent operations. For example, for a current 16x16 CU, if its neighboring CU, i.e., either upper or left placed, are encoded using the color palette table and index method, its color palette table is used for the current CU directly to derive the R-D cost. This merge cost is compared with the case that the current CU derives its color palette table explicitly (as well as other conventional modes existing in the HEVC or HEVC RExt) .
  • FIGURE 6 an index map is parsed from a real word screen content.
  • FIGURE 7 shows a piece of a segment after a 1-D search (i.e., just beginning of this index map) .
  • pldx[j+len] ! pldx[len+uildx] break
  • uildx uildx + maxLen
  • a 4-pixel running hash structure is described in this disclosure.
  • a running hash is calculated for every pixel at horizontal direction to generate a horizontal hash array running_hash_h [ ] .
  • Another running hash is calculated on top of running_hash_h [ ] to generate a 2D hash array running_hash_hv[ ] .
  • Each value match in this 2D hash array represents a 4x4 block match.
  • To perform a 2D match as many as 4x4 block matches are to be found before performing pixel-wised comparison to their neighbors . Since pixel wised comparison is limited to 1-3 pixels, the search speed can be increased dramatically.
  • each row has to be processed separately.
  • a block based algorithm is disclosed, which can be used in both a hardware and software implementation. Much similar to standard motion estimation, this algorithm processes one rectangle block at a time.
  • U_PIXEL The basic unit in this design is called U_PIXEL, as shown in FIGURE 8.
  • the coded signal is a flag that indicates if the reference pixel has already been encoded from previous string match operation.
  • the input signal Cmp[n-1] can be forced to w 0", which allows removal of the last "OR" gate from U_PIXEL module.
  • the first step is to process each row in parallel.
  • Each pixel in one row of the rectangle is assigned to one U_PIXEL block; this processing unit is called U_ROW.
  • An example of the processing unit for the first row is shown in FIGURE 9.
  • the next step is to process each column of the cmp array in parallel.
  • Each cmp in a column of the cmp array is processed by processing unit U_COL, as shown in FIGURE 11.
  • r_width[n] the number of zeros in each row of rw[n] [0-3] is then counted and the 4 results are recorded to array r_width[n] .
  • r_width[n] equals to rwidth[n] in step #7.
  • l_width[n] is generated in the same fashion.
  • the min_width array in step #7 can be obtained as ⁇ ⁇ l_width[l] , r_width[l] ⁇ , ⁇ l_width[2] , r_width[2] ⁇ , ⁇ l_width[3], r_width[3] ⁇ ... ⁇
  • This hardware architecture can be modified to fit in the parallel processing framework of any modern CPU/DSP/GPU.
  • a simplified pseudo-code for fast software implementation is listed below.
  • tmp2 tmpl[0]
  • RW[0] [x] CMP[0] [x] ;
  • RW[y] [x] CMP[y] [x]
  • R_WIDTH[y] LZD (RW [y] [ 0 ] , RW[y][l], RW[y][2],
  • This method can also apply to a ID search if the number of rows is limited to one.
  • a simplified pseudo-code for fast software implementation of fix length based ID search is listed below.
  • tmp2 tmpl[0]
  • RW [x] C[x] I RW [x-1]
  • R_WIDTH LZD(RW[0], RW[1], RW[2], RW[3]);
  • next starting location is calculated using current_location + length if the previous match is a ID match, or current_location + (lwidth+rwidth) if the previous match is a 2D match.
  • ID match if any to-be-matched-pixel falls into any previous 2D match region where its location has already been covered by a 2D match, the next pixels will be scanned through until a pixel location is found where it has not been coded by previous match.
  • an entropy engine is applied to convert these symbols into the binary stream.
  • an entropy engine is applied to convert these symbols into the binary stream. Exemplified here is the idea of using the equal probability context mode. An advanced adaptive context mode could be applied as well for better compression efficiency.
  • encodeEPs (pldx[uildx] , uilndexBits) ; uildx++ ;
  • encodeEPs (pDist [uildx] , uiDistBits ) ;
  • parseEPs ( uiSymbol, uilndexBits ) ;
  • pldx[uildx] uiSymbol
  • parseEPs ( uiSymbol, uiLenBits);
  • index or delta output they usually contain limited number of unique value under certain encoding mode.
  • This disclosure introduces a second delta palette table to utilize this observation.
  • This delta palette table can be built after all literal data are obtained in this CU, it will be signaled explicitly in the bit stream. Alternatively, it can be built adaptively during the coding process, so that the table does not have to be included in the bit stream.
  • a delta_color_table_adaptive_flag is defined for this choice.
  • Another advanced scheme is provided, called Neighboring Delta Color palette table Merge.
  • an encoder can use a delta palette from top or left CU as the initial starting point.
  • the encoder can also use a delta palette from top or left CU and compare the RD cost among top, left and current CU.
  • a delta_color_table_merge_flag is defined to indicate whether a current CU uses the delta color palette table from its left or upper CU.
  • delta_color_table_merge_flag if delta_color_table_merge_flag is asserted, another delta_color_table_merge_direction is defined to indicate whether the merge candidate is from either upper or left CU.
  • An example of an encoding process for an adaptive delta palette generation is shown as follows. At a decoding side, whenever a decoder receives a literal data, it regenerates a delta palette based on reverse steps.
  • a mask flag is used to separate the text section and graphics section.
  • the text section is compressed by the above described method; the graphics section is compressed by another compression method.
  • the index map has to be compressed losslessly. This allows the efficient processing using a ID or a 2D string match.
  • the ID or the 2D string match is constrained at current LCU, but the search window can extend beyond the current LCU.
  • the matched distance can be encoded using a pair of motion vector in horizontal and vertical directions, i.e.,
  • the ID search can be allowed in either horizontal or vertical directions by defining the color_idx_map_pred_direction indicator.
  • the optimal index scanning direction can be made based on the R-D cost.
  • FIGURE 6 shows the scanning directions, starting from the very first position. Further illustrated is the horizontal and vertical scanning pattern in FIGURE 9.
  • the deriveMatchPairs ( ) and associated entropy coding steps are performed twice for both the horizontal and the vertical scanning pattern. Then, the final scanning direction is chosen with the smallest RD cost.
  • the color palette table and a pair of matched information for the color index map are encoded. They are encoded using fixed length binarization. Alternatively, variable-length binarization can be used.
  • the max value can be used to bound its binarization, given the constrained implementation of this approach within the area of the current CU.
  • the residual coding could be significantly improved by a different binarization method.
  • transform coefficient is binarization using the variable length codes at the assumption that the residual magnitude should be small after prediction, transform and quantization.
  • residuals with larger and random value (not close to "1", "2", "0" relative smaller value). If the current HEVC coefficients binarization are used, it turns out to yield a very long code word.
  • using the fixed length binarization saves the code length for the residuals produced by the color palette table and index coding mode.
  • the foregoing provides various techniques for high- efficiency screen content coding under the framework of the HEVC/HE C-RExt .
  • mixed content is treated with 4:4:4 chroma sampling.
  • the 4:2:0 chroma sampling may be sufficient to provide perceptual lossless quality. This is due to the fact that the human vision system is less sensitive to the spatial changes in chroma components compared with that from the luma components.
  • sub-sampling typically is performed on the chroma part (e.g., the popular 4:2:0 video format) to achieve noticeable bit rate reduction while maintaining same reconstructed quality.
  • the present disclosure provides a new flag (i.e., enable_chroma_subsampling) that is defined and signaled at the CU level recursively. For each CU, the encoder determines whether it is being coded using 4:2:0 or 4:4:4 according to the rate- distortion cost.
  • enable_chroma_subsampling i.e., enable_chroma_subsampling
  • FIGURE 14A and FIGURE 14B Shown in FIGURE 14A and FIGURE 14B are the 4:2:0 and 4:4:4 chroma sampling formats.
  • the rate-distortion cost is derived when encoding the CU at 4:2:0 space and comparing it with the cost when encoding the CU at 4:4:4. Whichever encoding gives the less rate-distortion cost will be chosen for the final encoding.
  • FIGURE 15 Illustrated in FIGURE 15 is the interpolation process from 4:4:4 to 4:2:0 and vice versa. Usually this video color sampling format conversion process requires a large number of interpolation filters.
  • an HEVC interpolation filter i.e., DCT-IF
  • DCT-IF HEVC interpolation filter
  • the process starts with the grey "circles” in the chroma components, the half-pel positions are interpolated horizontally to obtain all "circles,” and then the "squared box” are interpolated using DCT-IF vertically. All the interpolated "squared box” are chosen to form the reconstructed 4:4:4 source.
  • enable_packed_component_flag is used to indicate whether current CU uses its packed format or conventional planar format for encoding the processing. Whether to enable a packed format could depend on the R-D cost calculated at the encoder.
  • a low-complexity solution is achieved by analyzing the histogram of the CU and finding the best threshold for the decision, as shown in FIGURE 3.
  • Index map encoding direction could be determined by the R-D optimization, or using the local spatial orientation (such as sobel operator based direction estimation) .
  • the line buffer from its upper and left CU can be used, as shown in the FIGURE 16.
  • the search can be extended to further improve the coding efficiency.
  • upper/left buffers are formed using the reconstructed pixels from neighboring CUs, these pixels (as well as its corresponding indices) are available for reference before processing current CU index map.
  • the current CU index map could be 14, 14, 14, 1, 2, 1 (as ID presentation) .
  • the first "14" will be coded as an unmatched pair.
  • the string match can start at the very first pixel, as shown below (horizontal and vertical scanning patterns are shown as well) .
  • Coding unit syntax coding_unit ( xO, yO, log2CbSize ) ⁇ Descriptor if( transquant_bypass_enabled_flag )
  • FIGURE 17 illustrates the apparatus and methods/flows incorporated into the current HEVC .
  • FIGURE 18 illustrates an example communication system 100 that uses signaling to support advanced wireless receivers according to this disclosure.
  • the system 100 enables multiple wireless users to transmit and receive data and other content.
  • the system 100 may implement one or more channel access methods, such as code division multiple access (CDMA) , time division multiple access (TDMA) , frequency division multiple access (FDMA) , orthogonal FDMA (OFDMA) , or single-carrier FDMA (SC-FDMA) .
  • CDMA code division multiple access
  • TDMA time division multiple access
  • FDMA frequency division multiple access
  • OFDMA orthogonal FDMA
  • SC-FDMA single-carrier FDMA
  • the communication system 100 includes user equipment (UE) llOa-llOc, radio access networks (RANs) 120a- 120b, a core network 130, a public switched telephone network (PSTN) 140, the Internet 150, and other networks 160. While certain numbers of these components or elements are shown in FIGURE 18, any number of these components or elements may be included
  • the UEs llOa-llOc are configured to operate and/or communicate in the system 100.
  • the UEs llOa-llOc are configured to transmit and/or receive wireless signals or wired signals.
  • Each UE llOa-llOc represents any suitable end user device and may include such devices (or may be referred to) as a user equipment/device (UE) , wireless transmit/receive unit (WTRU) , mobile station, fixed or mobile subscriber unit, pager, cellular telephone, personal digital assistant (PDA) , smartphone, laptop, computer, touchpad, wireless sensor, or consumer electronics device.
  • UE user equipment/device
  • WTRU wireless transmit/receive unit
  • PDA personal digital assistant
  • the RANs 120a-120b here include base stations 170a- 170b, respectively.
  • Each base station 170a-170b is configured to wirelessly interface with one or more of the UEs llOa-llOc to enable access to the core network 130, the PSTN 140, the Internet 150, and/or the other networks 160.
  • the base stations 170a-170b may include (or be) one or more of several well-known devices, such as a base transceiver station (BTS) , a Node-B (NodeB) , an evolved NodeB (eNodeB) , a Home NodeB, a Home eNodeB, a site controller, an access point (AP) , or a wireless router, or a server, router, switch, or other processing entity with a wired or wireless network.
  • BTS base transceiver station
  • NodeB Node-B
  • eNodeB evolved NodeB
  • AP access point
  • AP access point
  • AP access point
  • a wireless router or a server, router, switch, or other processing entity with a wired or wireless network.
  • the base station 170a forms part of the RAN 120a, which may include other base stations, elements, and/or devices.
  • the base station 170b forms part of the RAN 120b, which may include other base stations, elements, and/or devices.
  • Each base station 170a-170b operates to transmit and/or receive wireless signals within a particular geographic region or area, sometimes referred to as a "cell.”
  • MIMO multiple-input multiple-output
  • the base stations 170a-170b communicate with one or more of the UEs llOa-llOc over one or more air interfaces 190 using wireless communication links.
  • the air interfaces 190 may utilize any suitable radio access technology.
  • the system 100 may use multiple channel access functionality, including such schemes as described above.
  • the base stations and UEs implement LTE, LTE-A, and/or LTE-B.
  • LTE Long Term Evolution
  • LTE-A Long Term Evolution
  • LTE-B Long Term Evolution-B
  • the RANs 120a-120b are in communication with the core network 130 to provide the UEs llOa-llOc with voice, data, application, Voice over Internet Protocol (VoIP) , or other services. Understandably, the RANs 120a-120b and/or the core network 130 may be in direct or indirect communication with one or more other RANs (not shown) .
  • the core network 130 may also serve as a gateway access for other networks (such as PSTN 140, Internet 150, and other networks 160) .
  • some or all of the UEs llOa-llOc may include functionality for communicating with different wireless networks over different wireless links using different wireless technologies and/or protocols.
  • FIGURE 18 illustrates one example of a communication system
  • the communication system 100 could include any number of UEs, base stations, networks, or other components in any suitable configuration, and can further include the EPC illustrated in any of the figures herein.
  • FIGURES 19A and 19B illustrate example devices that may implement the methods and teachings according to this disclosure.
  • FIGURE 19A illustrates an example UE 110
  • FIGURE 19B illustrates an example base station 170. These components could be used in the system 100 or in any other suitable system.
  • the UE 110 includes at least one processing unit 200.
  • the processing unit 200 implements various processing operations of the UE 110.
  • the processing unit 200 could perform signal coding, data processing, power control, input/output processing, or any other functionality enabling the UE 110 to operate in the system 100.
  • the processing unit 200 also supports the methods and teachings described in more detail above.
  • Each processing unit 200 includes any suitable processing or computing device configured to perform one or more operations.
  • Each processing unit 200 could, for example, include a microprocessor, microcontroller, digital signal processor, field programmable gate array, or application specific integrated circuit.
  • the UE 110 also includes at least one transceiver 202.
  • the transceiver 202 is configured to modulate data or other content for transmission by at least one antenna 204.
  • the transceiver 202 is also configured to demodulate data or other content received by the at least one antenna 204.
  • Each transceiver 202 includes any suitable structure for generating signals for wireless transmission and/or processing signals received wirelessly.
  • Each antenna 204 includes any suitable structure for transmitting and/or receiving wireless signals.
  • One or multiple transceivers 202 could be used in the UE 110, and one or multiple antennas 204 could be used in the UE 110.
  • a transceiver 202 could also be implemented using at least one transmitter and at least one separate receiver.
  • the UE 110 further includes one or more input/output devices 206.
  • the input/output devices 206 facilitate interaction with a user.
  • Each input/output device 206 includes any suitable structure for providing information to or receiving information from a user, such as a speaker, microphone, keypad, keyboard, display, or touch screen.
  • the UE 110 includes at least one memory 208.
  • the memory 208 stores instructions and data used, generated, or collected by the UE 110.
  • the memory 208 could store software or firmware instructions executed by the processing unit(s) 200 and data used to reduce or eliminate interference in incoming signals.
  • Each memory 208 includes any suitable volatile and/or non-volatile storage and retrieval device (s). Any suitable type of memory may be used, such as random access memory (RAM) , read only memory (ROM) , hard disk, optical disc, subscriber identity module (SIM) card, memory stick, secure digital (SD) memory card, and the like.
  • RAM random access memory
  • ROM read only memory
  • SIM subscriber identity module
  • SD secure digital
  • the base station 170 includes at least one processing unit 250, at least one transmitter 252, at least one receiver 254, one or more antennas 256, and at least one memory 258.
  • the processing unit 250 implements various processing operations of the base station 170, such as signal coding, data processing, power control, input/output processing, or any other functionality.
  • the processing unit 250 can also support the methods and teachings described in more detail above.
  • Each processing unit 250 includes any suitable processing or computing device configured to perform one or more operations.
  • Each processing unit 250 could, for example, include a microprocessor, microcontroller, digital signal processor, field programmable gate array, or application specific integrated circuit .
  • Each transmitter 252 includes any suitable structure for generating signals for wireless transmission to one or more UEs or other devices.
  • Each receiver 254 includes any suitable structure for processing signals received wirelessly from one or more UEs or other devices. Although shown as separate components, at least one transmitter 252 and at least one receiver 254 could be combined into a transceiver.
  • Each antenna 256 includes any- suitable structure for transmitting and/or receiving wireless signals. While a common antenna 256 is shown here as being coupled to both the transmitter 252 and the receiver 254, one or more antennas 256 could be coupled to the transmitter (s) 252, and one or more separate antennas 256 could be coupled to the receiver (s) 254.
  • Each memory 258 includes any suitable volatile and/or non-volatile storage and retrieval device(s).

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)
  • Compression Of Band Width Or Redundancy In Fax (AREA)
  • Image Processing (AREA)

Abstract

A method and device for coding screen content into a bitstream by selecting a color palette table for a coding unit (CU) of screen content, creating a color index map having indices for the coding unit (CU), and encoding the selected color palette table and the color index map for the CU into a bitstream.

Description

ADVANCED SCREEN CONTENT CODING SOLUTION
TECHNICAL FIELD
[0001] The present disclosure is generally directed to screen content coding.
BACKGROUND
[0002] Screen content coding imposes new challenges for video compression technology because of its distinct signal characteristics compared with conventional natural videos. There appear to be a few promising techniques for the advanced screen content coding, e.g., pseudo string match, color palette coding, and intra motion compensation or intra block copy.
[0003] Among these techniques, pseudo string match shows the highest gain for lossless coding, but with significant complexity overhead and difficulties on lossy coding mode. The color palette coding is developed for screen content under the assumption that non-camera captured content typically contains a limited few distinct colors, rather than the continuous color tone in natural videos. Even though the pseudo string match and color palette coding methods showed great potential, intra motion compensation or intra block copy was adopted into the working draft (WD) version 4 and reference software of on-going HEVC range extension (HEVC RExt) for screen content coding. This is mainly due to the fact that the motion estimation and compensation approach has been studied extensively over decades, as well as its idea and practical implementation is fairly easy (especially for hardware) .
[0004] However, the coding performance of intra block copy is bounded because of its fixed block structure partitions. On the other hand, performing block matching, something similar to motion estimation in intra picture, also brings up the encoder complexity significantly on both computing and memory access. SUMMARY
[0005] This disclosure is directed to an advanced screen content coding solution.
[0006] In one example embodiment, a method for coding screen content into a bitstream selects a color palette table for a coding unit (CU) of screen content. The color palette table created for the CU and a color palette table is created for a neighboring CU. A color index map is created having indices for the coding unit (CU) of the screen content using the selected color palette table. The selected color palette table and the color index map are encoded/compressed for each of a plurality of CUs into a bitstream.
BRIEF DESCRIPTION OF THE FIGURES
[0007] For a more complete understanding of the present disclosure, and the advantages thereof, reference is now made to the following descriptions taken in conjunction with the accompanying drawings, wherein like numbers designate like objects, and in which:
[0008] FIGURE 1 illustrates a screen content encoding solution using color palette table and index map mode or palette mode according to one example embodiment of this disclosure;
[0009] FIGURE 2 illustrates a screen content decoding solution for color palette table and index map mode or palette mode;
[0010] FIGURE 3 illustrates a process or workflow of the screen content solution for this color palette table and index map mode or palette mode of a CU;
[0011] FIGURE 4 illustrates a conventional G, B, R in planar mode (left) to Packed mode (right) ;
[0012] FIGURE 5 illustrates color palette table re-generation using neighboring reconstructed blocks;
[0013] FIGURE 6 illustrates an index map is parsed from a real word screen content;
[0014] FIGURE 7 illustrates a piece of a segment for a 1-D search after horizontal scanning;
[0015] FIGURE 8 illustrates a U_PIXEL module;
[0016] FIGURE 9 illustrates a U_ROW module;
[0017] FIGURE 10 illustrates a U_CMP module;
[0018] FIGURE 11 illustrates a U_COL module;
[0019] FIGURE 12 illustrates a U_2D_BLOCK module;
[0020] FIGURE 13 is an illustration of horizontal and vertical scan for index map processing of a exemplified CU;
[0021] FIGURE 14A is an illustration of a 4:2:0 chroma sampling format;
[0022] FIGURE 14B is an illustration of a 4:4:4 chroma sampling format; [0023] FIGURE 15 illustrates an interpolation between 4:20 and 4:4:4;
[0024] FIGURE 16 illustrates Index Map processing with Upper/Left line buffer;
[0025] FIGURE 17 illustrates the apparatus and methods/flows incorporated into the current HEVC;
[0026] FIGURE 18 illustrates one example of a communication system; and
[0027] FIGURE 19A and FIGURE 19B illustrate example devices that may implement the methods and teachings according to this disclosure .
DETAILED DESCRIPTION
[0028] In this disclosure, an advanced screen content coding solution is described that outperforms a High-Efficiency Video Coding (HEVC) range extension (such as HEVC Version 2 or HEVC RExt) . This new solution includes several algorithms that are designed specifically for coding screen content. These algorithms include pixel representation using a color palette or a color table, referred to herein as a color palette table, color palette compression, color index map compression, string search, and residual compression. This technology is developed, harmonized, and can be integrated with the HEVC range extension (RExt) and future HEVC extensions to support efficient screen content coding. However, this technology could be implemented with any existing video standards. For simplicity, HEVC RExt is used as an example in the description below, and HEVC RExt software is used to describe and demonstrate the compression efficiency. This solution is integrated as an additional mode by using a color palette table and index map, defined herein as a color palette mode, in HEVC to demonstrate the performance.
[0029] The concept and description of this disclosure is illustrated in the Figures. FIGURE 1 shows an encoder 10 having a processor 12 including memory, and FIGURE 2 shows a decoder 14 having a processor 16 and memory, together illustrating an example embodiment of an encoding and decoding solution for the color palette mode, respectively, in accordance with this disclosure. As shown, the encoder 10 and decoder 14 each comprise a processor and memory and form a codec solution. The codec solution includes the processor 12 of encoder 10 executing new algorithms or methods including Process 1 creating a Color Palette Table, Process 2 classifying colors or pixel values using a previously derived color palette table corresponding color indices, Process 3 encoding the Color Palette Table, Process 4 encoding the color index map, Process 5 encoding the residuals. and Process 6 writing new syntax elements into the compressed bitstream. Processor 16 of decoder 14 executes new algorithms or methods including the reverse steps. FIGURE 3 provides a process or workflow of the screen content solution according to this disclosure.
[0030] Basically, a high-efficiency color palette compression method (CPC) is performed on each coding unit (CU) . A coding unit is a basic operating unit in HEVC and HEVC RExt, which is a squared block of pixels consisting of three components (i.e., RGB, or YUV, or XYZ) .
[0031] At each CU level, the CPC method includes two major steps. First, the processor 12 derives or generates a color palette table in the first step. This table is ordered according to a histogram (i.e., occurrence frequency of each color value), or its actual color intensity, or any arbitrary method in order to increase the efficiency of the following encoding process. Based on the derived color palette table, each pixel in the original CU is converted to its color index within the color palette table. A contribution of this disclosure is technology to efficiently encode, such as using compression, the color palette table and the color index map of each CU into the stream. At the receiver side, the processor 16 parses the compressed bitstream to reconstruct, for each CU, the complete color palette table and the color index map, and then further derive the pixel value at each position by combing the color index and color palette table.
[0032] In an illustrative example of this disclosure, assume a CU with NxN pixels (N=8, 16, 32, 64 for compatibility with HEVC) . The CU typically contains three chrominance (chroma) components (i.e., G, B, R, or Y, Cb, Cr, or X, Y Z) at a different sampling ratio (i.e., 4:4:4, 4:2:2, 4:2:0). For simplicity, sequences of 4:4:4 are illustrated in the disclosure. For sequences of 4:2:2 and 4:2:0 videos, chroma upsampling could be applied to obtain the 4:4:4 sequences, or each color component could be processed independently. Then, the same procedure described in this disclosure can be applied. For 4:0:0 monochrome videos, this can be treated as an individual plane of 4:4:4 without other two planes. All methods for 4:4:4 can be applied directly.
Packed or Planar
[0033] This method is shown for the Block CTU or CU in FIGURE 1. First of all, a flag is defined, called the enable_packed_component_flag, for each CU to indicate whether the current CU is processed using packed fashion or conventional planar mode (i.e., G, B, R or Y, U, V components are processed independently) . FIGURE 4 illustrates a conventional G, B, R in planar mode (left) to Packed mode (right) . YUV or other color format could be processed in the same fashion as exemplified for RGB content .
[0034] Both the packed mode and the planar mode have its own advantage and disadvantage. For instance, the planar mode supports parallel color component processing for G/B/R or Y/U/V. However, it might suffer the low coding efficiency. The packed mode can share the header information (such as the color palette table and index map in this disclosure) for this CU among different color components. However, it might break the parallelism. An easy way to decide whether the current CU should be encoded in the packed fashion is to measure rate distortion (R-D) cost. The enable_packed_component_flag is used to signal the encoding mode to the decoder explicitly.
[0035] In addition, to define the enable_packed_component_flag at the CU level for low-level handling, it can be duplicated in slice header or even sequence level (e.g., Sequence Parameter Set or Picture Parameter Set) to allow slice level or sequence level handling, depending on the specific application requirement. Color palette table and Index Map Derivation
[0036] As shown in FIGURE 1, for Processes 1 and 3, for each CU, pixel locations are transversed and the color palette table and index map for the subsequent processing is derived. Each distinct color is ordered in the color palette table, depending on either its histogram (i.e., frequency of occurrence), or its intensity, or any arbitrary method in order to increase the efficiency of the following encoding process. For example, if the encoding process uses a differential pulse-code modulation (DPCM) method to code the difference between adjacent pixels, the optimal coding result can be obtained if the adjacent pixels are assigned with adjacent color index in Color palette table.
[0037] After obtaining the color palette table, each pixel is mapped to the corresponding color index to form the index map of the current CU. The processing of index map is described in the subsequent section .
[0038] For a conventional planar CU, each color or chrominance component can have its individual color palette table, such as colorTable_Y, colorTable_U, colorTable_V or colorTable_R, colorTable_G, colorTable_B, naming a few here as an example. Meanwhile, the color palette table for a major component can be derived, such as Y in YUV or G in GBR, and shared for all components. Normally, by this sharing, other color components, other than Y or G, would have some mismatch relative to its original pixel colors from those shared in color palette table. The residual engine (such as HEVC coefficients coding methods) is then applied to encode those mismatched residuals. On the other hand, for a packed CU, a single color palette table is shared among all components.
[0039] A pseudo code is provided to exemplify the color palette table and index map derivation as follows:
deriveColorTablelndexMap ( )
{
deriveColorTable ( ) ;
deriveIndexMap ( ) ;
} deriveColorTable (src, cuWidth, cuHeight, maxColorNum) {
// src - input video source in planar or packed mode
// cuWidth, cuHeight - width and height of current CU
/* maxColorNum - max num of colors allowed in color table*/
/*transverse */
//
// memset (colorHist , 0,
(l«bitDepth) *sizeof (UINT) )
pos=0 ;
cuSize=cuWidth*cuHeight;
while (pos<cuSize) {
colorHist [src [pos++] ] ++;
} /*just pick non-zero entry in colorHist [] for color intensity ordered table*/
j=0;
for (i=0 ; i< (l«bitDepth) ; i++)
{
if (colorHist [i] !=0)
colorTablelntensity [ j ++] = colorHist [i] ;
}
colorNum=j ;
/*quicksort for histgram*/ colorTableHist = quicksort (colorTablelntensity, colorNum) ; /*if maxColorNum >= colorNum, all colors will be picked*/
/*if maxColorNum < colorNum, only maxColorNum colors will be picked for colorTableHist . In this case, all pixels will find its best matched color and corresponding index with difference (actual pixel and its corresponding color) coded by the residual engine.*/
/*Best number of colors in color table could be determined by iterative R-D cost derivation!*/
} derivelndexMa ( )
{
pos=0;
cuSize=cuWidth*cuHeight ;
while ( pos < cuSize)
{
minErr=MAX_UIN ;
for (i=0; i<colorNum; i++)
{
err = abs(src[pos] - colorTable [i] ) ;
if (err<minErr)
{
minErr = err;
idx = i ;
}
}
idxMap[pos] = idx;
}
Color palette table Processing [0040] For Process 1 in FIGURE 1, color palette table processing involves the processor 12 encoding of the size of a color palette table (i.e., the total number of distinct colors) and each color itself. A majority of the bits are consumed by the encoding of each color in a color palette table. Hence, focus is placed on the color encoding (or encoding of each entry in color palette table) .
[0041] The most straightforward method to encode the colors in a color palette table is using the pulse-code modulation (PCM) style algorithm where each color is coded independently. Alternatively, the nearest prediction for a successive color can be applied, and then the prediction delta can be encoded rather than the default color intensity, which is DPCM (differential PCM) style. Both methods can be later entropy encoded using equal probability model or adaptive context model, depending on the trade-off between complexity costs and coding efficiency.
[0042] Here, another advanced scheme is disclosed, called a Neighboring Color palette table Merge, where a color_table_merge_flag is defined to indicate whether the current CU uses the color palette table from its left or upper CU. If not, the current CU will carry the color palette table signaling explicitly. For the merging process, another color_table_merge_direction indicates the merging direction either from upper or from left CU. Of course, the candidates could be more than current upper or left CU, e.g. upper-left, upper-right and etc. However, the upper and left CU are used in this disclosure to exemplify the idea. For any of which, each pixel is compared with the entries in an existing color palette table and assigned an index yielding the least prediction difference (i.e., pixel subtracts the closest color in color palette table) via deriveldxMap ( ) . For the case where the prediction difference is non-zero, all these residuals are encoded using the HEVC RExt residual engine. Note that whether using the merging process or not can be decided by the R-D cost.
[0043] There are several ways to generate the neighboring color palette tables for being used in the merging process in coding the current CU. Depending on its implementation, one of them requires updating at both the encoder and the decoder and the other one is an encoder side process only.
[0044] Updating at both the encoder and the decoder: In this method, the color palette table of neighbor CUs are generated upon the available reconstructed pixels, regardless of CU depth, size and etc. For each CU, the reconstructions are retrieved for its neighboring CU at the same size and same depth (assuming the color similarity would be higher in this case) . For example, if a current CU is 16x16 with depth = 2 , no matter the partition of its neighboring CUs (for example 8x8 with depth = 3 for left CU and 32x32 with depth =1 for upper CU) , the pixel offset (=16) will be located from the current CU origin to the left to process the left 16x16 block and to the upper for the upper 16x16 block, as shown in the FIGURE 5. Note that both the encoder and the decoder should maintain this process.
[0045] Constrained Encoder Only Process: for this method, the merging process occurs when a current CU shares the same size and depth as its upper and/or left CU. The color palette tables of the available neighbors are used to derive the color index map of the current CU for subsequent operations. For example, for a current 16x16 CU, if its neighboring CU, i.e., either upper or left placed, are encoded using the color palette table and index method, its color palette table is used for the current CU directly to derive the R-D cost. This merge cost is compared with the case that the current CU derives its color palette table explicitly (as well as other conventional modes existing in the HEVC or HEVC RExt) . Whichever produces the less R-D cost is chosen as the final mode to be written into the output bit stream. As seen, only the encoder is required to experiment/simulate different potential modes. At the decoder side, the color_table_merge_flag and the color_table_merge_direction infer the merge decision and merge direction without requiring additional processing workload.
Color Index Map Processing
[0046] For Process 3 in FIGURE 1, for coding the color index map, a few solutions have been studied, such as RUN mode, RUN and COPY_ABOVE, and adaptive neighbor index prediction. In this disclosure, a ID string matching approach and its 2D variation is disclosed to encode the index map coding. At each position, it finds its matched point and records the matched distance and length for a ID string match, or width/height for a 2D string match. For an unmatched position, its index intensity, or delta value between its index intensity and predicted index intensity, is encoded directly.
[0047] Disclosed here is a straightforward ID search method over the color index map. Referring to FIGURE 6, an index map is parsed from a real word screen content. FIGURE 7 shows a piece of a segment after a 1-D search (i.e., just beginning of this index map) .
[0048] On top of this 1-D color index vector, string match is applied. An example of this 1-D string match is given below. For the first position of each index map, such as 14 as shown in FIGURE 7, since there is no buffered reference yet, this very first index is treated as the "unmatched pair" , where it is given -1 and 1 to its corresponding distance and length, noted as (dist, len) = (-1, 1). For the 2nd index, again another "14", it is the first index coded as reference, therefore the dist=l. Because there is another "14" at 3rd position, the length is 2, i.e., len=2, (given the every proceed index could be served as the reference immediately for the subsequent index) . Moving forward to the 4th position, encountered is the "17" which has not been seen before. Hence, it is encoded as an unmatched pair again, i.e., (dist, len) = (-1, 1). For the unmatched pair, the flag is encoded (such as the "dist == -1") and followed by the real value of the index (like first appeared "14", "17", "6" and etc) . On the other hand, for the matched pairs, the flag is still encoded (such as the "dist != -1" ), and followed by the length of the matched string.
[0049] Here is a summary for the encoding procedure using the exemplified index shown in FIGURE 7.
dist = -1, len = 1, idx=14 (unmatched)
dist= 1, len = 2 (matched)
dist = -1, len = 1, idx=17 (unmatched)
dist= 1, len = 3 (matched)
dist = -1, len = 1, idx= 6 (unmatched)
dist= 1, len = 25 (matched)
dist= 30, len = 4 (matched) /*for the "17" which appeared before*/
[0050] A pseudo code is given for this matched pair derivation, i.e.,
Void deriveMatchedPairs ( TComDataCU* pcCU, Pel* pidx, Pel* pDist, Pel* pLen, UInt uiWidth, UInt uiHeight)
{
// pidx is a idx CU bounded within uiWidth*uiHeight
UInt uiTotal = uiWidth*uiHeight ;
UInt uildx = 0;
Int j = 0;
Int len = 0; // first pixel coded as itself if there isn't left/upper buffer
pDist[uiIdx] = -1;
pLen[uiIdx] = 0;
uildx++; while (uildx < uiTotal )
{
len = 0 ;
dist = -1;
for ( j=uildx-l; j >= 0; j-- )
{
// if finding matched pair, currently exhaust search is applied
// fast string search could be applied if ( pldx[j] == pldx[uildx] )
{
for (len = 0; len < (uiTotal-uildx) ; len++
{
if ( pldx[j+len] != pldx[len+uildx] break;
}
}
if ( len > maxLen ) /*better to change with R-D decision* /
{
maxLen = len;
dist = (uildx - j ) ;
}
} pDist[uiIdx] = dist;
pLen[uiIdx] = maxLen;
uildx = uildx + maxLen;
}
The following steps are made when a 2D search variation:
Identify the location of current pixel and reference pixel as starting point,
Apply a horizontal ID string match to the right direction of current pixel and reference pixel. Maximum search length is constrained by the end of current horizontal row. Record the maximum search length as right_width
Apply a horizontal ID string match to the left direction of current pixel and reference pixel . Maximum search length is constrained by the beginning of current horizontal row. Record the maximum search length as left_width
Perform same ID string match at next row, using pixels below current pixel and reference pixel as new current pixel and reference pixel
Stop until right_width == left_width == 0.
Now for each height [n] = {1, 2, 3...}, there is a corresponding array of widt [n] { {left_width[l] , right_width[l] } , { left_width [2 ] , right_width[2] } , Ueft_width[3] , right_width[3] }...}
Define a new min_width array {{lwidthfl], rwidth[l]}, {lwidth[2], rwidth[2]}, {lwidth [3 ] , rwidth[3] }...} for each height [n] , where lwidth [n] = min ( left_width [ 1 : n-1 ] ) , rwidth[n] = min(right_width[l :n-l] ) 8. A size array{size [1] , size[2], size[3]...} is also defined, where size[n] = height [n] x (lwidth[n] +hwidth[n] )
9. Assume size[n] holds the maximum value in size array, the width and height of 2D string match is selected using the corresponding {lwidth[n] , rwidth[n], height[n]}
[0052] One way to optimize the speed of the ID or 2D search is to use running hash. A 4-pixel running hash structure is described in this disclosure. A running hash is calculated for every pixel at horizontal direction to generate a horizontal hash array running_hash_h [ ] . Another running hash is calculated on top of running_hash_h [ ] to generate a 2D hash array running_hash_hv[ ] . Each value match in this 2D hash array represents a 4x4 block match. To perform a 2D match, as many as 4x4 block matches are to be found before performing pixel-wised comparison to their neighbors . Since pixel wised comparison is limited to 1-3 pixels, the search speed can be increased dramatically.
[0053] From the above description, the matched widths of each row are different from each other, thus each row has to be processed separately. To achieve efficiency and low complexity, a block based algorithm is disclosed, which can be used in both a hardware and software implementation. Much similar to standard motion estimation, this algorithm processes one rectangle block at a time.
[0054] Take a 4x4 block as example. The basic unit in this design is called U_PIXEL, as shown in FIGURE 8. The coded signal is a flag that indicates if the reference pixel has already been encoded from previous string match operation. Optionally, the input signal Cmp[n-1] can be forced to w0", which allows removal of the last "OR" gate from U_PIXEL module.
[0055] The first step is to process each row in parallel. Each pixel in one row of the rectangle is assigned to one U_PIXEL block; this processing unit is called U_ROW. An example of the processing unit for the first row is shown in FIGURE 9.
[0056] 4 U_ROW units are needed to process this 4x4 block, as shown in FIGURE 10. Its output is an array of cmp[4] [4] .
[0057] The next step is to process each column of the cmp array in parallel. Each cmp in a column of the cmp array is processed by processing unit U_COL, as shown in FIGURE 11.
[0058] 4 U_COL units are needed to process this 4x4 block. Its output is an array of rw[4] [4] as shown in FIGURE 12.
[0059] The number of zeros in each row of rw[n] [0-3] is then counted and the 4 results are recorded to array r_width[n] . It is noted r_width[n] equals to rwidth[n] in step #7. l_width[n] is generated in the same fashion. The min_width array in step #7 can be obtained as { {l_width[l] , r_width[l]}, { l_width[2] , r_width[2]}, { l_width[3], r_width[3] }...}
[0060] This hardware architecture can be modified to fit in the parallel processing framework of any modern CPU/DSP/GPU. A simplified pseudo-code for fast software implementation is listed below.
// I. Generate array C[][]
For(y = 0; y < height; ++y)
{
For(x = 0; x < width; ++x)
{
tmpl = cur_pixel Λ ref_pixel;
tmp2 = tmpl[0] | tmpl[l] | tmpl [2] | tmpl [3] | tmpl [4] I tmpl [5] | tmpl [6] | tmpl [7];
C[y][x] = tmp2 & ( ! coded [y] [x] ) ;
}
}
/ / 2. Generate array CMP [ ] [ ]
For(y = 0; y < height; ++y) {
CMP[y] [0] = C[y] [0] ;
}
For(x = 1; x < width; ++x)
{
For(y = 0; y < height; ++y)
{
C P[y] [x] = C[y][x] | CMP[y][x-l]
}
}
// 3. Generate array RW[] [] or LW[] []
For(x = 0; x < width; ++x)
{
RW[0] [x] = CMP[0] [x] ;
}
For(y = 1; y < height; ++y)
{
For(x = 0; x < width; ++x)
{
RW[y] [x] = CMP[y] [x] | RW[y-l] [x] ;
}
}
// 4. Convert RW[][] to R_WIDTH[]
For(y = 0; y < height; ++y)
{
// count zero, or leading zero detection
R_WIDTH[y] = LZD (RW [y] [ 0 ] , RW[y][l], RW[y][2],
RW[y] [3] ) ;
} [0061] There is no data dependence in each loop, so a traditional software parallel processing method, such as loop unrolling, MMX/SSE, can be applied to increase the execution speed.
[0062] This method can also apply to a ID search if the number of rows is limited to one. A simplified pseudo-code for fast software implementation of fix length based ID search is listed below.
// 1. Generate array C[]
For (x = 0; x < width; ++x)
{
tmpl = cur_pixel Λ ref_pixel;
tmp2 = tmpl[0] | tmpl[l] | tmpl [2] | tmpl [3] | tmpl [4] I tmpl [5] I tmpl [6] | tmpl [7];
C[x] = tmp2 & ( ! coded [x] ) ;
}
/ / 2. Generate array RW [ ] or LW [ ]
If (last "OR" operation in U_PIXEL module is removed)
Assign RW[] = C[]
Else {
RW [ 0 ] = C [ 0 ] ;
For(x = 1; x < width; ++x)
{
RW [x] = C[x] I RW [x-1]
}
]
/ / 3. Convert RW [ ] [ ] to R_WIDTH [ ]
// count zero, or leading zero detection
If (last "OR" operation in U_PIXEL module is removed)
R_WIDTH = LZD(RW[0], RW[1], RW[2], RW[3]);
Else
R_WIDTH [y] = COUNT_ZERO (RW [ 0 ] , RW[1], RW[2], RW[3]); [0063] After both ID match and 2D match is completed, max (Id length, 2d (width x height) ) is chosen as the winner. If the lwidth of 2D match is non-zero, the length of the prior ID match (length = length - lwidth) needs to be adjusted to avoid the overlap between prior ID match and current 2D match. If the length of the prior ID match becomes zero after adjustment, it is removed from the match list.
[0064] The next starting location is calculated using current_location + length if the previous match is a ID match, or current_location + (lwidth+rwidth) if the previous match is a 2D match. When a ID match is performed, if any to-be-matched-pixel falls into any previous 2D match region where its location has already been covered by a 2D match, the next pixels will be scanned through until a pixel location is found where it has not been coded by previous match.
[0065] After obtaining these matched pairs, an entropy engine is applied to convert these symbols into the binary stream. Exemplified here is the idea of using the equal probability context mode. An advanced adaptive context mode could be applied as well for better compression efficiency.
// loop for each CU, uiTotal=uiWidth*uiHeight , uildx=0; while ( uildx < uiTotal) {
// *pDist: store the distance value for each matched pair
// *pldx: store the index value for each matched pair // *pLen: store the length value for each matched pair // encodeEP ( ) and encodeEPs ( ) are reusing HEVC or similar by-pass entropy coding. if (pDist [uildx] == -1 )
{
//encode one-bin with equal-probability model to indicate the //whether current pair is matched or not.
unmatchedPairFlag = TRUE;
encodeEP (unmatchedPairFlag) ;
/ /uilndexBits is controlled by the color table size
// i.e., for 24 different colors, we need 5 bits, for 8 colors, 3 bits
encodeEPs (pldx[uildx] , uilndexBits) ; uildx++ ;
}
else
{
unmatchedPairFlag= FALSE;
encodeEP (unmatchedPairFlag) ;
/*bound binarization with max possible value*/ UInt uiDistBits =0;
// offset is used to add additional references from neighboring blocks
// here, we first let offset=0;
while ( ( l<<uiDistBits ) <= (uildx+offset) )
{
uiDistBits++;
}
encodeEPs (pDist [uildx] , uiDistBits ) ;
/*bound binarization with max possible value*/ UInt uiLenBits =0;
while ( (l<<uiLenBits) <= (uiTotal-uildx) )
{
uiLenBits++;
}
encodeEPs (pLen [uildx] , uiLenBits) ; uildx += pLen[uiIdx]
[0066] Shown is the encoding procedure for each matched pair. Correspondingly, the decoding process for the matched pair is as follows .
// loop for each CU, uiTotal—uiWidth*uiHeight , uildx=0 ; while ( uildx < uiTotal) {
// *pDist: store the distance value for each matched pair
// *pldx: store the index value for each matched pair // *pLen: store the length value for each matched pair // parseEPO and parseEPs ( ) are reusing HEVC or similar by-pass entropy coding.
// parse the unmatched pair flag
parseEP (&uiUnmatchedPairFlag) ; if (uiUnmatchedPairFlag )
{
parseEPs ( uiSymbol, uilndexBits ) ;
pldx[uildx] = uiSymbol;
uildx++ ;
}
else
{
/*bound binarization with max possible value*/ UInt uiDistBits =0;
// offset is used to add additional references from neighboring blocks
// here, we first let offset=0;
while{ (l«uiDistBits) <= (uildx+offset) ) uiDistBits++ ;
UInt uiLenBits = 0 ;
while ( ( l<<uiLenBits ) <= (uiTotal-uildx) )
uiLenBits++; parseEPs ( uiSymbol, uiDistBits) ;
pDist[uiIdx] = uiSymbol;
parseEPs ( uiSymbol, uiLenBits);
pLenfuildx] = uiSymbol; for(UInt i= 0 ; i< pLenfuildx]; i++)
pldx[i+uildx] = pldx[i+uildx- pDist[uiIdx] uildx += pLen [uildx];
[ 0067 ] Note that only pixels in an unmatched position are encoded into a bit stream. To have a more accurate statistics modal, use only these pixels and their neighbors for Color palette table Derivation, instead of using all pixels in this CU.
[ 0068 ] For these index or delta output, they usually contain limited number of unique value under certain encoding mode. This disclosure introduces a second delta palette table to utilize this observation. This delta palette table can be built after all literal data are obtained in this CU, it will be signaled explicitly in the bit stream. Alternatively, it can be built adaptively during the coding process, so that the table does not have to be included in the bit stream. A delta_color_table_adaptive_flag is defined for this choice.
[ 0069 ] Another advanced scheme is provided, called Neighboring Delta Color palette table Merge. For adaptive delta palette generation, an encoder can use a delta palette from top or left CU as the initial starting point. For non-adaptive palette generation, the encoder can also use a delta palette from top or left CU and compare the RD cost among top, left and current CU.
[0070] A delta_color_table_merge_flag is defined to indicate whether a current CU uses the delta color palette table from its left or upper CU. A current CU carries the delta color palette table signaling explicitly only when de11a_co1or_tab1e_adaptive_f1ag==0 and delta_color_table_merge_flag==0 at the same time.
[0071] For a merging process, if delta_color_table_merge_flag is asserted, another delta_color_table_merge_direction is defined to indicate whether the merge candidate is from either upper or left CU.
[0072] An example of an encoding process for an adaptive delta palette generation is shown as follows. At a decoding side, whenever a decoder receives a literal data, it regenerates a delta palette based on reverse steps.
10. Define palette_table [ ] and palette_count [ ]
11. Initialize palette_table (n) = n (n = 0...255) , alternatively, it can use palette_table [ ] from top or left CU as initial value
12. Initialize palette_count (n) = 0 (n = 0...255) , alternatively, it can use palette_count [ ] from top or left CU as initial value
13. For any delta value c' :
1) Locate n so that palette_table (n) == delta c'
2) Use n as the new index of delta c'
3) ++palette_count (n)
4) Sort palette_count [ ] so that it is in descendent order
5) Sort palette_table [ ] accordingly
14. Go back to step 1 until all delta c' in current LCU are processed
[0073] For any block that includes both text and graphics, a mask flag is used to separate the text section and graphics section. The text section is compressed by the above described method; the graphics section is compressed by another compression method.
[0074] Note that because the value of any pixel covered by the mask flag has been coded by a text layer losslessly, these pixels in graphics section can be as "don't-care-pixel". When the graphics section is compressed, any arbitrary value can be assigned to a don't-care-pixel in order to obtain optimal compression efficiency.
[0075] Since the lossy part could be handled by the color palette table derivation, the index map has to be compressed losslessly. This allows the efficient processing using a ID or a 2D string match. For this disclosure, the ID or the 2D string match is constrained at current LCU, but the search window can extend beyond the current LCU. Also note that the matched distance can be encoded using a pair of motion vector in horizontal and vertical directions, i.e.,
(MVy=matched_distance/cuWidth, MVy=matched_distance-cuWidth*MVy) .
[0076] Given that image would have a different spatial texture orientation at local regions, the ID search can be allowed in either horizontal or vertical directions by defining the color_idx_map_pred_direction indicator. The optimal index scanning direction can be made based on the R-D cost. FIGURE 6 shows the scanning directions, starting from the very first position. Further illustrated is the horizontal and vertical scanning pattern in FIGURE 9. Consider an 8x8 CU as an example. The deriveMatchPairs ( ) and associated entropy coding steps are performed twice for both the horizontal and the vertical scanning pattern. Then, the final scanning direction is chosen with the smallest RD cost.
Improved Binarization
[0077] As shown in FIGURE 13, the color palette table and a pair of matched information for the color index map are encoded. They are encoded using fixed length binarization. Alternatively, variable-length binarization can be used.
[0078] For example, as for the color palette table encoding, the table can have 8 different color values. Therefore, it only contains 8 different indices in the color index map. Instead of using fixed 3 bins to encode every index value equally, just one bit can be used to represent the background pixel, for instance 0. Then, the rest of 7 pixel values use a fixed-length codeword, such as 1000, 1001, 1010, 1011, 1100, 1101, and 1110 to encode the color index. This is based on the fact that the background color may occupy the largest percentile and therefore a special codeword for it saves the total bins . This scenario happens commonly for screen content. Consider a 16x16 CU, for fixed 3-bin binarization, it requires 3x16x16=768 bins. Also, let 0 index be background color, occupying 40%, while other colors are equally distributed. In this case, it only requires 2.8xl6xl6<768 bins.
[0079] For the matched pair encoding, the max value can be used to bound its binarization, given the constrained implementation of this approach within the area of the current CU. Mathematically, the matched distance and length could be as long as 64x64=4K in each case. However, this wouldn't be happening jointly. For every matched position, the matched distance is bounded by the distance between current position and the very first position in the reference buffer (such as the first position in current CU as an example), for instance L. Therefore, the maximum bins for this distance binarization is log2(L)+l (instead of fixed length), and the maximum bins for the length binarization is log2 (cuSize-L) +1 with cuSize=cuWidth*cuHeight .
[0080] In addition to the color palette table and index map, the residual coding could be significantly improved by a different binarization method. As for HEVC RExt and HEVC version, transform coefficient is binarization using the variable length codes at the assumption that the residual magnitude should be small after prediction, transform and quantization. However, after introducing the transform skip, especially for the transform skip on the screen content with distinct color, there commonly exists residuals with larger and random value (not close to "1", "2", "0" relative smaller value). If the current HEVC coefficients binarization are used, it turns out to yield a very long code word. Alternatively, using the fixed length binarization saves the code length for the residuals produced by the color palette table and index coding mode.
[0081] Adaptive chroma sampling for mixed content
[0082] The foregoing provides various techniques for high- efficiency screen content coding under the framework of the HEVC/HE C-RExt . In practice, in addition to pure screen content (such as text, graphics) or pure natural video, there is also content containing both screen material and camera-captured natural video -- called mixed content. Currently, mixed content is treated with 4:4:4 chroma sampling. However, for the embedded camera-captured natural video portion in such mixed content, the 4:2:0 chroma sampling may be sufficient to provide perceptual lossless quality. This is due to the fact that the human vision system is less sensitive to the spatial changes in chroma components compared with that from the luma components. Hence, sub-sampling typically is performed on the chroma part (e.g., the popular 4:2:0 video format) to achieve noticeable bit rate reduction while maintaining same reconstructed quality.
[0083] The present disclosure provides a new flag (i.e., enable_chroma_subsampling) that is defined and signaled at the CU level recursively. For each CU, the encoder determines whether it is being coded using 4:2:0 or 4:4:4 according to the rate- distortion cost.
[0084] Shown in FIGURE 14A and FIGURE 14B are the 4:2:0 and 4:4:4 chroma sampling formats. [0085] At the encoder side, for each CU, assuming the input is 4:4:4 source shown above, the rate-distortion cost is derived directly using the 4:4:4 encoding procedure with enable_chroma_subsampling = 0 or FALSE. Then, the process sub- samples 4:4:4 samples to 4:2:0 to derive its bit consumption. The reconstructed 4:2:0 format is interpolated back to the 4:4:4 format for distortion measurement (using SSE/SAD) . Together with the bit consumption, the rate-distortion cost is derived when encoding the CU at 4:2:0 space and comparing it with the cost when encoding the CU at 4:4:4. Whichever encoding gives the less rate-distortion cost will be chosen for the final encoding.
[0086] Illustrated in FIGURE 15 is the interpolation process from 4:4:4 to 4:2:0 and vice versa. Usually this video color sampling format conversion process requires a large number of interpolation filters.
[0087] To reduce the implementation complexity, an HEVC interpolation filter (i.e., DCT-IF) may be utilized. As shown in FIGURE 15, the "squared box" represents the original 4:4:4 samples. From 4:4:4 to 4:2:0, the half-pel pixels ("circle") are interpolated using DCT-IF vertically for the chroma components. Also shown are the quarter-pel positions ("diamond") for illustration purposes. The grey shaded "circles" are picked to form the 4:2:0 samples. For the interpolation from 4:2:0 to 4:4:4, the process starts with the grey "circles" in the chroma components, the half-pel positions are interpolated horizontally to obtain all "circles," and then the "squared box" are interpolated using DCT-IF vertically. All the interpolated "squared box" are chosen to form the reconstructed 4:4:4 source.
[0088] Encoder Control
[0089] As discussed in the previous sections, disclosed are flags to control the low-level processing. For instance, enable_packed_component_flag is used to indicate whether current CU uses its packed format or conventional planar format for encoding the processing. Whether to enable a packed format could depend on the R-D cost calculated at the encoder. For a practical encoder implementation, a low-complexity solution is achieved by analyzing the histogram of the CU and finding the best threshold for the decision, as shown in FIGURE 3.
[0090] The size of the color palette table has a direct impact on the complexity. maxColorNum is introduced to control the trade-off between complexity and coding efficiency. The most straightforward way is choosing the one yielding the least R-D cost.
[0091] Index map encoding direction could be determined by the R-D optimization, or using the local spatial orientation (such as sobel operator based direction estimation) .
[0092] This disclosure limits the processing within every CTU/CU. In practice, this constraint can be relaxed. For example, for a color index map processing, the line buffer from its upper and left CU can be used, as shown in the FIGURE 16. With an upper and a left buffer, the search can be extended to further improve the coding efficiency. Given that upper/left buffers are formed using the reconstructed pixels from neighboring CUs, these pixels (as well as its corresponding indices) are available for reference before processing current CU index map. For instance, after re-ordering, the current CU index map could be 14, 14, 14, 1, 2, 1 (as ID presentation) . Without a line buffer reference, the first "14" will be coded as an unmatched pair. However, with a neighboring line buffer, the string match can start at the very first pixel, as shown below (horizontal and vertical scanning patterns are shown as well) .
[0093] Decoder Syntax
[0094] The following information can be used to describe the decoder shown in FIGURE 2. The syntax of this disclosure is aligned with a committee draft of HEVC RExt.
[0095] 7.3.5.8 Coding unit syntax: coding_unit ( xO, yO, log2CbSize ) { Descriptor if( transquant_bypass_enabled_flag )
cu_transquant_bypass_flag ae (v) if( slice_type != I )
cu_skip_flag [ xO ] [ yO ] ae (v) nCbS = ( 1 « log2CbSize )
if ( cu_skip_flag [ xO ] [ yO ] )
prediction_unit ( xO, yO, nCbS, nCbS )
else {
if( intra_block_copy_enabled_flag )
intra_bc_flag [ xO ] [ yO ] ae (v) if( color_table_enabled_flag )
color_table_flag [ xO ] [ yO ] ae (v) if( delta_color_table_enabled_flag )
delta_color_table_flag [ xO ] [ yO ] ae (v) if( ! intra_bc_flag [ xO ] [ yO ] ) {
if( slice_type != I )
pred_mode_flag ae (v) if( CuPredMode[ xO ] [ yO ] != M0DE_INTRA log2CbSize = = MinCbLog2SizeY )
part_mode ae (v)
}
if( CuPredMode [ xO ] [ yO ] = = MODE_INTRA
) {
if( PartMode = = PART_2Nx2N &&
pcm_enabled_flag && ! intra_bc_flag
log2CbSize >= Log2MinIpcmCbSizeY &&
log2CbSize <= Log2MaxIpcmCbSizeY )
pcm_flag[ xO ] [ yO ] ae (v) if( pcm_flag[ xO ] [ yO ] ) {
while ( !byte_aligned( ) ) pcm_alignment_zero_bit f (1) pcm_sample( xO, yO, log2CbSize )
} else if ( intra_bc_flag [ xO ] [ yO ] ) {
mvd_coding( xO, yO, 2)
} else if ( color_table_flag [xO ] [yO] | |
delta_color_table_flag [xO ] [yO]) {
enable_packed_component_flag ae (v) if (color_table_flag[xO] [yO] ) {
color_table_merge_flag ae (v) if (color_table_merge_flag) {
co1or_tab1e_merge_idx ae (v)
}else{
color_table_size ae (v) for ( i=0 ; i<
color_table_size; i++)
color_table_entry [ i ] ae (v)
}
color_idx_map_pred_direction ae (v)
}
if (delta_color_table_flag[xO] [yO] ) {
de11a_co1or_tab1e_adaptive_flag ae (v) delta_color_table_merge_flag ae (v) if (delta_color_table_merge_flag) { delta_color_table_merge_idx ae (v)
}else if
{ ! delta_color_table_adaptive_flag) {
delta_color_table_size ae (v) for ( i=0 ; i<
delta_color_table_size; i++)
delta_color_table_entry [ i ] ae (v)
}
} Pos=0; cuWidth=l«log2CbSize;
cuHeight=l<<log2CbSize;
while (Pos<cuWidth*cuHeight) {
matched_flag ae (v) if (matched_flag ) {
matched_distance /*MVx, MVy*/ ae (v) matched_length ae (v)
}else{
index_delta ae (v)
}
}
} else {
pbOffset = ( PartMode = = PARTjSTx )
? ( nCbS 1 2 ) : nCbS
[0096] FIGURE 17 illustrates the apparatus and methods/flows incorporated into the current HEVC .
[0097] The above identified methods /flows and devices may be incorporated into a wireless or wired, or combination thereof, communications network and implemented in devices, such as that described below, and in the drawings below:
[0098] FIGURE 18 illustrates an example communication system 100 that uses signaling to support advanced wireless receivers according to this disclosure. In general, the system 100 enables multiple wireless users to transmit and receive data and other content. The system 100 may implement one or more channel access methods, such as code division multiple access (CDMA) , time division multiple access (TDMA) , frequency division multiple access (FDMA) , orthogonal FDMA (OFDMA) , or single-carrier FDMA (SC-FDMA) . [0099] In this example, the communication system 100 includes user equipment (UE) llOa-llOc, radio access networks (RANs) 120a- 120b, a core network 130, a public switched telephone network (PSTN) 140, the Internet 150, and other networks 160. While certain numbers of these components or elements are shown in FIGURE 18, any number of these components or elements may be included in the system 100.
[00100] The UEs llOa-llOc are configured to operate and/or communicate in the system 100. For example, the UEs llOa-llOc are configured to transmit and/or receive wireless signals or wired signals. Each UE llOa-llOc represents any suitable end user device and may include such devices (or may be referred to) as a user equipment/device (UE) , wireless transmit/receive unit (WTRU) , mobile station, fixed or mobile subscriber unit, pager, cellular telephone, personal digital assistant (PDA) , smartphone, laptop, computer, touchpad, wireless sensor, or consumer electronics device.
[00101] The RANs 120a-120b here include base stations 170a- 170b, respectively. Each base station 170a-170b is configured to wirelessly interface with one or more of the UEs llOa-llOc to enable access to the core network 130, the PSTN 140, the Internet 150, and/or the other networks 160. For example, the base stations 170a-170b may include (or be) one or more of several well-known devices, such as a base transceiver station (BTS) , a Node-B (NodeB) , an evolved NodeB (eNodeB) , a Home NodeB, a Home eNodeB, a site controller, an access point (AP) , or a wireless router, or a server, router, switch, or other processing entity with a wired or wireless network.
[00102] In the embodiment shown in FIGURE 18, the base station 170a forms part of the RAN 120a, which may include other base stations, elements, and/or devices. Also, the base station 170b forms part of the RAN 120b, which may include other base stations, elements, and/or devices. Each base station 170a-170b operates to transmit and/or receive wireless signals within a particular geographic region or area, sometimes referred to as a "cell." In some embodiments, multiple-input multiple-output (MIMO) technology may be employed having multiple transceivers for each cell.
[00103] The base stations 170a-170b communicate with one or more of the UEs llOa-llOc over one or more air interfaces 190 using wireless communication links. The air interfaces 190 may utilize any suitable radio access technology.
[00104] It is contemplated that the system 100 may use multiple channel access functionality, including such schemes as described above. In particular embodiments, the base stations and UEs implement LTE, LTE-A, and/or LTE-B. Of course, other multiple access schemes and wireless protocols may be utilized.
[00105] The RANs 120a-120b are in communication with the core network 130 to provide the UEs llOa-llOc with voice, data, application, Voice over Internet Protocol (VoIP) , or other services. Understandably, the RANs 120a-120b and/or the core network 130 may be in direct or indirect communication with one or more other RANs (not shown) . The core network 130 may also serve as a gateway access for other networks (such as PSTN 140, Internet 150, and other networks 160) . In addition, some or all of the UEs llOa-llOc may include functionality for communicating with different wireless networks over different wireless links using different wireless technologies and/or protocols.
[00106] Although FIGURE 18 illustrates one example of a communication system, various changes may be made to FIGURE 18. For example, the communication system 100 could include any number of UEs, base stations, networks, or other components in any suitable configuration, and can further include the EPC illustrated in any of the figures herein.
[00107] FIGURES 19A and 19B illustrate example devices that may implement the methods and teachings according to this disclosure. In particular, FIGURE 19A illustrates an example UE 110, and FIGURE 19B illustrates an example base station 170. These components could be used in the system 100 or in any other suitable system.
[00108] As shown in FIGURE 19A, the UE 110 includes at least one processing unit 200. The processing unit 200 implements various processing operations of the UE 110. For example, the processing unit 200 could perform signal coding, data processing, power control, input/output processing, or any other functionality enabling the UE 110 to operate in the system 100. The processing unit 200 also supports the methods and teachings described in more detail above. Each processing unit 200 includes any suitable processing or computing device configured to perform one or more operations. Each processing unit 200 could, for example, include a microprocessor, microcontroller, digital signal processor, field programmable gate array, or application specific integrated circuit.
[00109] The UE 110 also includes at least one transceiver 202. The transceiver 202 is configured to modulate data or other content for transmission by at least one antenna 204. The transceiver 202 is also configured to demodulate data or other content received by the at least one antenna 204. Each transceiver 202 includes any suitable structure for generating signals for wireless transmission and/or processing signals received wirelessly. Each antenna 204 includes any suitable structure for transmitting and/or receiving wireless signals. One or multiple transceivers 202 could be used in the UE 110, and one or multiple antennas 204 could be used in the UE 110. Although shown as a single functional unit, a transceiver 202 could also be implemented using at least one transmitter and at least one separate receiver.
[00110] The UE 110 further includes one or more input/output devices 206. The input/output devices 206 facilitate interaction with a user. Each input/output device 206 includes any suitable structure for providing information to or receiving information from a user, such as a speaker, microphone, keypad, keyboard, display, or touch screen.
[00111] In addition, the UE 110 includes at least one memory 208. The memory 208 stores instructions and data used, generated, or collected by the UE 110. For example, the memory 208 could store software or firmware instructions executed by the processing unit(s) 200 and data used to reduce or eliminate interference in incoming signals. Each memory 208 includes any suitable volatile and/or non-volatile storage and retrieval device (s). Any suitable type of memory may be used, such as random access memory (RAM) , read only memory (ROM) , hard disk, optical disc, subscriber identity module (SIM) card, memory stick, secure digital (SD) memory card, and the like.
[00112] As shown in FIGURE 19B, the base station 170 includes at least one processing unit 250, at least one transmitter 252, at least one receiver 254, one or more antennas 256, and at least one memory 258. The processing unit 250 implements various processing operations of the base station 170, such as signal coding, data processing, power control, input/output processing, or any other functionality. The processing unit 250 can also support the methods and teachings described in more detail above. Each processing unit 250 includes any suitable processing or computing device configured to perform one or more operations. Each processing unit 250 could, for example, include a microprocessor, microcontroller, digital signal processor, field programmable gate array, or application specific integrated circuit .
[00113] Each transmitter 252 includes any suitable structure for generating signals for wireless transmission to one or more UEs or other devices. Each receiver 254 includes any suitable structure for processing signals received wirelessly from one or more UEs or other devices. Although shown as separate components, at least one transmitter 252 and at least one receiver 254 could be combined into a transceiver. Each antenna 256 includes any- suitable structure for transmitting and/or receiving wireless signals. While a common antenna 256 is shown here as being coupled to both the transmitter 252 and the receiver 254, one or more antennas 256 could be coupled to the transmitter (s) 252, and one or more separate antennas 256 could be coupled to the receiver (s) 254. Each memory 258 includes any suitable volatile and/or non-volatile storage and retrieval device(s).
[00114] Additional details regarding UEs 110 and base stations 170 are known to those of skill in the art. As such, these details are omitted here for clarity.
[00115] It may be advantageous to set forth definitions of certain words and phrases used throughout this patent document. The terms "include" and "comprise, " as well as derivatives thereof, mean inclusion without limitation. The term "or" is inclusive, meaning and/or. The phrases "associated with" and "associated therewith, " as well as derivatives thereof, mean to include, be included within, interconnect with, contain, be contained within, connect to or with, couple to or with, be communicable with, cooperate with, interleave, juxtapose, be proximate to, be bound to or with, have, have a property of, or the like.
[00116] While this disclosure has described certain embodiments and generally associated methods, alterations and permutations of these embodiments and methods will be apparent to those skilled in the art. Accordingly, the above description of example embodiments does not define or constrain this disclosure. Other changes, substitutions, and alterations are also possible without departing from the spirit and scope of this disclosure, as defined by the following claims.

Claims

WHAT IS CLAIMED
1. A method for coding screen content into a bitstream, the method comprising:
selecting a color palette table for a coding unit (CU) of screen content;
creating a color index map having indices for the CU using the selected color palette table; and
encoding the selected color palette table and the color index map for the CU into a bitstream.
2. The method as specified in Claim 1 wherein the method is processed using a planar color format or an interleaved color format .
3. The method as specified in Claim 2 wherein the method is processed at a level selected from the group of: a CU level, a slice level, a picture level or a sequence level.
4. The method as specified in Claim 1 wherein the color palette table is derived from the CU or from a neighboring CU.
5. The method as specified in Claim 4 wherein the color palette table is derived from a neighboring CU using a reconstructed CU in a pixel domain.
6. The method as specified in Claim 5 further comprising generating a color palette table of the neighboring CU based on available reconstructed pixels, regardless of the CU depth and
7. The method as specified in Claim 4 further comprising generating a color palette table of the neighboring CU, wherein the neighboring CU is encoded using a color palette mode.
8. The method as specified in Claim 4 wherein the neighboring CU is not encoded using a color palette mode, and a neighboring CU color palette table is propagated from a previous CU which is encoded using a color palette mode.
9. The method as specified in Claim 1 wherein the color palette table is derived in a pixel domain at a decoder, wherein the encoded bitstream is parsed to reconstruct, for the CU, the color palette table and the color index map.
10. The method as specified in Claim 6 wherein a pixel value is derived at each position in the CU by combing the color index, and the color palette table.
11. The method as specified in Claim 1 further comprising classifying colors or pixel values of the CU based on a previous derived color palette table for corresponding indices .
12. The method as specified in Claim 1 further comprising writing new syntax elements into the encoded bitstream.
13. The method as specified in Claim 1 wherein the color palette table is generated and ordered according to a histogram, or its actual color intensity.
14. The method as specified in Claim 1 wherein each pixel of the CU is converted into a color index within the color palette table.
15. The method as specified in Claim 1 wherein a flag is defined for the CU to indicate whether the CU is processed using a packed fashion or a planar mode.
16. The method as specified in Claim 1 wherein the color palette table is processed by encoding a size of the color palette table and each color in the color palette table.
17. The method as specified in Claim 1 further comprising generating a flag indicating the CU uses the color palette table form its left or upper CU.
18. The method as specified in Claim 1 wherein the index map is encoded using a string match selected from the group comprising: one dimensional (1-D) string match, a hybrid 1-D string match, and a two dimensional (2-D) string match,
wherein the string match is signaled using matched pairs.
19. The method as specified in Claim 17 wherein the string match is performed using a running hash method.
20. The method as specified in Claim 1 wherein a two dimensional (2D) search method is performed over the color index map by identifying a location of a current pixel and a reference pixel in the CU as a starting point.
21. The method as specified in Claim 1 wherein the CU has a 4:4:4 format processed using a down-sampled 4:2:0 sampling format .
22. The method as specified in Claim 21 wherein the down- sampled format is processed at a level selected from the group of: a CU level, a slice level, a picture level or a sequence level .
23. A processor for coding screen content into a bitstream, the processor configured to:
select a color palette table for a coding unit (CU) of screen content;
create a color index map having indices for the CU using the selected color palette table; and
encode the selected color palette table and the color index map for the CU into a bitstream.
24. The processor as specified in Claim 23 wherein the color palette table is derived from the CU or from a neighboring CU.
25. The method as specified in Claim 24 wherein the color palette table is derived from a neighboring CU using a reconstructed CU in a pixel domain.
PCT/US2014/067155 2013-11-22 2014-11-24 Advanced screen content coding solution WO2015077720A1 (en)

Priority Applications (13)

Application Number Priority Date Filing Date Title
MX2016006612A MX362406B (en) 2013-11-22 2014-11-24 Advanced screen content coding solution.
NZ720776A NZ720776A (en) 2013-11-22 2014-11-24 Advanced screen content coding solution
RU2016124544A RU2646355C2 (en) 2013-11-22 2014-11-24 Solution for improved coding of screen content
CN201480063141.6A CN105745671B (en) 2013-11-22 2014-11-24 Level screen content scrambling scheme
BR112016011471-0A BR112016011471B1 (en) 2013-11-22 2014-11-24 METHOD AND SYSTEM FOR ENCODING SCREEN CONTENT IN A BITS STREAM
JP2016533032A JP6294482B2 (en) 2013-11-22 2014-11-24 Advanced screen content coding solution
AU2014352656A AU2014352656B2 (en) 2013-11-22 2014-11-24 Advanced screen content coding solution
CA2931386A CA2931386C (en) 2013-11-22 2014-11-24 Advanced screen content coding solution
EP14864463.6A EP3063703A4 (en) 2013-11-22 2014-11-24 Advanced screen content coding solution
KR1020167016238A KR101972936B1 (en) 2013-11-22 2014-11-24 Advanced screen content coding solution
UAA201606679A UA118114C2 (en) 2013-11-22 2014-11-24 Advanced screen content coding solution
IL245752A IL245752B (en) 2013-11-22 2016-05-19 Advanced screen content coding solution
HK16108372.3A HK1220531A1 (en) 2013-11-22 2016-07-15 Advanced screen content coding solution

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
US201361907903P 2013-11-22 2013-11-22
US61/907,903 2013-11-22
US14/549,405 US10291827B2 (en) 2013-11-22 2014-11-20 Advanced screen content coding solution
US14/549,405 2014-11-20

Publications (1)

Publication Number Publication Date
WO2015077720A1 true WO2015077720A1 (en) 2015-05-28

Family

ID=53180267

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2014/067155 WO2015077720A1 (en) 2013-11-22 2014-11-24 Advanced screen content coding solution

Country Status (14)

Country Link
US (1) US10291827B2 (en)
EP (1) EP3063703A4 (en)
JP (1) JP6294482B2 (en)
KR (1) KR101972936B1 (en)
CN (1) CN105745671B (en)
AU (1) AU2014352656B2 (en)
CA (1) CA2931386C (en)
CL (1) CL2016001224A1 (en)
HK (1) HK1220531A1 (en)
MX (1) MX362406B (en)
NZ (1) NZ720776A (en)
RU (1) RU2646355C2 (en)
UA (1) UA118114C2 (en)
WO (1) WO2015077720A1 (en)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2016115343A3 (en) * 2015-01-14 2016-10-13 Vid Scale, Inc. Palette coding for non-4:4:4 screen content video
WO2016197392A1 (en) * 2015-06-12 2016-12-15 Mediatek Singapore Pte. Ltd. Improvements for non-local index prediction
EP3055830A4 (en) * 2014-03-21 2017-02-22 Huawei Technologies Co., Ltd. Advanced screen content coding with improved color table and index map coding methods
CN107710760A (en) * 2015-10-15 2018-02-16 富士通株式会社 Method for encoding images, device and image processing equipment
US10091512B2 (en) 2014-05-23 2018-10-02 Futurewei Technologies, Inc. Advanced screen content coding with improved palette table and index map coding methods
US10291827B2 (en) 2013-11-22 2019-05-14 Futurewei Technologies, Inc. Advanced screen content coding solution
GB2539486B (en) * 2015-06-18 2019-07-31 Gurulogic Microsystems Oy Encoder, decoder and method employing palette compression

Families Citing this family (55)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
BR122019025407B8 (en) * 2011-01-13 2023-05-02 Canon Kk IMAGE CODING APPARATUS, IMAGE CODING METHOD, IMAGE DECODING APPARATUS, IMAGE DECODING METHOD AND STORAGE MEDIA
KR102257269B1 (en) 2013-10-14 2021-05-26 마이크로소프트 테크놀로지 라이센싱, 엘엘씨 Features of intra block copy prediction mode for video and image coding and decoding
CN105659602B (en) 2013-10-14 2019-10-08 微软技术许可有限责任公司 Coder side option for the intra block duplication prediction mode that video and image encode
KR102275639B1 (en) 2013-10-14 2021-07-08 마이크로소프트 테크놀로지 라이센싱, 엘엘씨 Features of base color index map mode for video and image coding and decoding
US10321141B2 (en) 2013-12-18 2019-06-11 Hfi Innovation Inc. Method and apparatus for palette initialization and management
CA2934116C (en) 2013-12-18 2019-07-30 Tzu-Der Chuang Method and apparatus for palette table prediction
WO2015091879A2 (en) * 2013-12-19 2015-06-25 Canon Kabushiki Kaisha Improved encoding process using a palette mode
CN105981388B (en) 2013-12-27 2019-05-10 寰发股份有限公司 The method and apparatus that syntax redundancy removes in palette coding
CN110225345B (en) 2013-12-27 2022-07-19 寰发股份有限公司 Method and apparatus for primary color index map coding
US10182242B2 (en) * 2013-12-27 2019-01-15 Mediatek Inc. Method and apparatus for palette coding with cross block prediction
WO2015103496A2 (en) * 2014-01-02 2015-07-09 Vid Scale, Inc. Two-demensional palette coding for screen content coding
US10390034B2 (en) 2014-01-03 2019-08-20 Microsoft Technology Licensing, Llc Innovations in block vector prediction and estimation of reconstructed sample values within an overlap area
US10469863B2 (en) * 2014-01-03 2019-11-05 Microsoft Technology Licensing, Llc Block vector prediction in video and image coding/decoding
EP3061247A1 (en) * 2014-01-07 2016-08-31 MediaTek Inc. Method and apparatus for color index prediction
US11284103B2 (en) 2014-01-17 2022-03-22 Microsoft Technology Licensing, Llc Intra block copy prediction with asymmetric partitions and encoder-side search patterns, search ranges and approaches to partitioning
US10542274B2 (en) * 2014-02-21 2020-01-21 Microsoft Technology Licensing, Llc Dictionary encoding and decoding of screen content
US20150264345A1 (en) * 2014-03-13 2015-09-17 Mitsubishi Electric Research Laboratories, Inc. Method for Coding Videos and Pictures Using Independent Uniform Prediction Mode
CN106576152A (en) * 2014-03-13 2017-04-19 华为技术有限公司 Improved method for screen content coding
KR102319384B1 (en) * 2014-03-31 2021-10-29 인텔렉추얼디스커버리 주식회사 Method and apparatus for intra picture coding based on template matching
CN105323583B (en) * 2014-06-13 2019-11-15 财团法人工业技术研究院 Encoding method, decoding method, encoding/decoding system, encoder and decoder
WO2015192353A1 (en) 2014-06-19 2015-12-23 Microsoft Technology Licensing, Llc Unified intra block copy and inter prediction modes
US9906799B2 (en) 2014-06-20 2018-02-27 Qualcomm Incorporated Copy from previous rows for palette mode coding
US9955157B2 (en) 2014-07-11 2018-04-24 Qualcomm Incorporated Advanced palette prediction and signaling
US9544607B2 (en) * 2014-08-25 2017-01-10 Hfi Innovation Inc. Method of palette index signaling for image and video coding
EP3202150B1 (en) 2014-09-30 2021-07-21 Microsoft Technology Licensing, LLC Rules for intra-picture prediction modes when wavefront parallel processing is enabled
CN107005707A (en) * 2014-10-31 2017-08-01 三星电子株式会社 Method and apparatus for being encoded or being decoded to image
JP6122516B2 (en) 2015-01-28 2017-04-26 財團法人工業技術研究院Industrial Technology Research Institute Encoding method and encoder
US10448058B2 (en) * 2015-05-21 2019-10-15 Qualcomm Incorporated Grouping palette index at the end and index coding using palette size and run value
CN107637057A (en) * 2015-06-03 2018-01-26 联发科技股份有限公司 The palette decoding method of image and video data
CN106664405B (en) 2015-06-09 2020-06-09 微软技术许可有限责任公司 Robust encoding/decoding of escape-coded pixels with palette mode
US10148977B2 (en) 2015-06-16 2018-12-04 Futurewei Technologies, Inc. Advanced coding techniques for high efficiency video coding (HEVC) screen content coding (SCC) extensions
GB2542858A (en) * 2015-10-02 2017-04-05 Canon Kk Encoder optimizations for palette encoding of content with subsampled colour component
JP6593122B2 (en) * 2015-11-20 2019-10-23 富士通株式会社 Moving picture coding apparatus, moving picture coding method, and program
CN107071450B (en) * 2016-02-10 2021-07-27 同济大学 Coding and decoding method and device for data compression
US10986349B2 (en) 2017-12-29 2021-04-20 Microsoft Technology Licensing, Llc Constraints on locations of reference blocks for intra block copy prediction
US10949087B2 (en) 2018-05-15 2021-03-16 Samsung Electronics Co., Ltd. Method for rapid reference object storage format for chroma subsampled images
US11449256B2 (en) 2018-05-15 2022-09-20 Samsung Electronics Co., Ltd. Method for accelerating image storing and retrieving differential latency storage devices based on access rates
US11265579B2 (en) 2018-08-01 2022-03-01 Comcast Cable Communications, Llc Systems, methods, and apparatuses for video processing
US10848787B2 (en) * 2018-08-28 2020-11-24 Google Llc Lossy image compression using palettization of locally mixed colors
CN109819254B (en) * 2019-01-31 2022-05-03 深圳市战音科技有限公司 Lossy image compression transmission method and system
CN116684583A (en) 2019-08-26 2023-09-01 Lg电子株式会社 Decoding device, encoding device, and data transmitting device
WO2021040402A1 (en) * 2019-08-26 2021-03-04 엘지전자 주식회사 Image or video coding based on palette coding
US20220286700A1 (en) * 2019-08-26 2022-09-08 Lg Electronics Inc. Image or video coding based on palette escape coding
CN114375581A (en) * 2019-09-12 2022-04-19 字节跳动有限公司 Use of palette predictor in video coding
JP2022549011A (en) * 2019-09-24 2022-11-22 華為技術有限公司 Picture header signaling in video coding
US11076151B2 (en) * 2019-09-30 2021-07-27 Ati Technologies Ulc Hierarchical histogram calculation with application to palette table derivation
CN110996127B (en) * 2019-11-25 2022-12-09 西安万像电子科技有限公司 Image encoding and decoding method, device and system
CN111246208B (en) * 2020-01-22 2022-04-08 北京字节跳动网络技术有限公司 Video processing method and device and electronic equipment
US11792408B2 (en) 2020-03-30 2023-10-17 Alibaba Group Holding Limited Transcoder target bitrate prediction techniques
US11470327B2 (en) 2020-03-30 2022-10-11 Alibaba Group Holding Limited Scene aware video content encoding
US11386873B2 (en) 2020-04-01 2022-07-12 Alibaba Group Holding Limited Method and apparatus for efficient application screen compression
US11575916B2 (en) * 2020-10-30 2023-02-07 Advanced Micro Devices, Inc. Top palette colors selection using sorting for palette mode in video encoding
US11463716B2 (en) 2021-02-25 2022-10-04 Qualcomm Incorporated Buffers for video coding in palette mode
CN113192148B (en) * 2021-04-12 2023-01-03 中山大学 Attribute prediction method, device, equipment and medium based on palette
WO2023200302A1 (en) * 2022-04-14 2023-10-19 주식회사 케이티 Image encoding/decoding method and apparatus

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5848195A (en) * 1995-12-06 1998-12-08 Intel Corporation Selection of huffman tables for signal encoding
US20070116370A1 (en) 2002-06-28 2007-05-24 Microsoft Corporation Adaptive entropy encoding/decoding for screen capture content
US20090010533A1 (en) * 2007-07-05 2009-01-08 Mediatek Inc. Method and apparatus for displaying an encoded image
US20120275697A1 (en) * 2006-02-23 2012-11-01 Microsoft Corporation Pre-processing of image data for enhanced compression
US20130114893A1 (en) * 2011-11-03 2013-05-09 Google Inc. Image Compression Using Sub-Resolution Images

Family Cites Families (39)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS5484416A (en) * 1977-12-19 1979-07-05 Ricoh Co Ltd Meothod and device for transmission and reception of telautogram information
US5463702A (en) 1992-05-12 1995-10-31 Sony Electronics Inc. Perceptual based color-compression for raster image quantization
US5930390A (en) * 1996-03-28 1999-07-27 Intel Corporation Encoding/decoding signals using a remap table
JPH11161782A (en) * 1997-11-27 1999-06-18 Seiko Epson Corp Method and device for encoding color picture, and method and device for decoding color picture
US6597812B1 (en) * 1999-05-28 2003-07-22 Realtime Data, Llc System and method for lossless data compression and decompression
US6522783B1 (en) * 1999-11-23 2003-02-18 Sharp Laboratories Of America, Inc. Re-indexing for efficient compression of palettized images
US6674479B2 (en) * 2000-01-07 2004-01-06 Intel Corporation Method and apparatus for implementing 4:2:0 to 4:2:2 and 4:2:2 to 4:2:0 color space conversion
US7162077B2 (en) 2001-10-19 2007-01-09 Sharp Laboratories Of America, Inc. Palette-based image compression method, system and data file
US6898313B2 (en) 2002-03-06 2005-05-24 Sharp Laboratories Of America, Inc. Scalable layered coding in a multi-layer, compound-image data transmission system
US7120297B2 (en) * 2002-04-25 2006-10-10 Microsoft Corporation Segmented layered image system
US7343037B1 (en) 2004-02-04 2008-03-11 Microsoft Corporation Dynamic, locally-adaptive, lossless palettization of color and grayscale images
JP4515832B2 (en) 2004-06-14 2010-08-04 オリンパス株式会社 Image compression apparatus and image restoration apparatus
EP2320380B1 (en) 2004-07-08 2014-11-12 Telefonaktiebolaget L M Ericsson (Publ) Multi-mode image processing
US7468733B2 (en) 2004-10-06 2008-12-23 Microsoft Corporation Method and system for improving color reduction
CN101233542B (en) 2005-05-27 2013-01-23 艾利森电话股份有限公司 Weight based image processing
JP2007108877A (en) 2005-10-11 2007-04-26 Toshiba Corp Information management system and information display device
JP4367418B2 (en) 2006-01-20 2009-11-18 セイコーエプソン株式会社 Print control device
US8130317B2 (en) * 2006-02-14 2012-03-06 Broadcom Corporation Method and system for performing interleaved to planar transformation operations in a mobile terminal having a video display
JP4816262B2 (en) 2006-06-06 2011-11-16 ソニー株式会社 Playback apparatus, playback method, and playback program
WO2009002603A1 (en) 2007-06-25 2008-12-31 L-3 Communications Avionics Systems, Inc. Systems and methods for generating, storing and using electronic navigation charts
TWI452537B (en) 2008-02-21 2014-09-11 Generalplus Technology Inc Method for processing image, methods for applying to digital photo frame and interaction image process
US8326067B2 (en) * 2009-02-27 2012-12-04 Research In Motion Limited Optimization of image encoding using perceptual weighting
JP5052569B2 (en) 2009-06-25 2012-10-17 シャープ株式会社 Image compression apparatus, image compression method, image expansion apparatus, image expansion method, image forming apparatus, computer program, and recording medium
CN105959688B (en) 2009-12-01 2019-01-29 数码士有限公司 Method for decoding high resolution image
KR101682147B1 (en) * 2010-04-05 2016-12-05 삼성전자주식회사 Method and apparatus for interpolation based on transform and inverse transform
US8600158B2 (en) 2010-11-16 2013-12-03 Hand Held Products, Inc. Method and system operative to process color image data
KR101506446B1 (en) 2010-12-15 2015-04-08 에스케이 텔레콤주식회사 Code Motion Information Generating/Motion Information Reconstructing Method and Apparatus Using Motion Information Merge and Image Encoding/Decoding Method and Apparatus Using The Same
EP2745290A1 (en) 2011-09-27 2014-06-25 Koninklijke Philips N.V. Apparatus and method for dynamic range transforming of images
CN102611888B (en) 2011-11-18 2014-07-23 北京工业大学 Encoding method for screen content
US9262986B2 (en) 2011-12-07 2016-02-16 Cisco Technology, Inc. Reference frame management for screen content video coding using hash or checksum functions
JP2014107742A (en) 2012-11-28 2014-06-09 Toshiba Corp Image encoding device, image decoding device, image encoding method, and image decoding method
US11259020B2 (en) * 2013-04-05 2022-02-22 Qualcomm Incorporated Determining palettes in palette-based video coding
US9558567B2 (en) 2013-07-12 2017-01-31 Qualcomm Incorporated Palette prediction in palette-based video coding
KR102275639B1 (en) 2013-10-14 2021-07-08 마이크로소프트 테크놀로지 라이센싱, 엘엘씨 Features of base color index map mode for video and image coding and decoding
US10291827B2 (en) 2013-11-22 2019-05-14 Futurewei Technologies, Inc. Advanced screen content coding solution
WO2015103496A2 (en) 2014-01-02 2015-07-09 Vid Scale, Inc. Two-demensional palette coding for screen content coding
KR102268090B1 (en) 2014-03-14 2021-06-23 브이아이디 스케일, 인크. Palette coding for screen content coding
US9826242B2 (en) 2014-03-14 2017-11-21 Qualcomm Incorporated Palette-based video coding
US10638143B2 (en) 2014-03-21 2020-04-28 Futurewei Technologies, Inc. Advanced screen content coding with improved color table and index map coding methods

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5848195A (en) * 1995-12-06 1998-12-08 Intel Corporation Selection of huffman tables for signal encoding
US20070116370A1 (en) 2002-06-28 2007-05-24 Microsoft Corporation Adaptive entropy encoding/decoding for screen capture content
US20120275697A1 (en) * 2006-02-23 2012-11-01 Microsoft Corporation Pre-processing of image data for enhanced compression
US20090010533A1 (en) * 2007-07-05 2009-01-08 Mediatek Inc. Method and apparatus for displaying an encoded image
US20130114893A1 (en) * 2011-11-03 2013-05-09 Google Inc. Image Compression Using Sub-Resolution Images

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
D. IVANOV; YE. KUZMIN: "Computer Graphics Forum", vol. 19, 2000, WILEY-BLACKWELL PUBLISHING, article "Color Distribution - a new approach to texture compression"
See also references of EP3063703A4

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10291827B2 (en) 2013-11-22 2019-05-14 Futurewei Technologies, Inc. Advanced screen content coding solution
EP3055830A4 (en) * 2014-03-21 2017-02-22 Huawei Technologies Co., Ltd. Advanced screen content coding with improved color table and index map coding methods
US10638143B2 (en) 2014-03-21 2020-04-28 Futurewei Technologies, Inc. Advanced screen content coding with improved color table and index map coding methods
US10091512B2 (en) 2014-05-23 2018-10-02 Futurewei Technologies, Inc. Advanced screen content coding with improved palette table and index map coding methods
WO2016115343A3 (en) * 2015-01-14 2016-10-13 Vid Scale, Inc. Palette coding for non-4:4:4 screen content video
WO2016197392A1 (en) * 2015-06-12 2016-12-15 Mediatek Singapore Pte. Ltd. Improvements for non-local index prediction
GB2539486B (en) * 2015-06-18 2019-07-31 Gurulogic Microsystems Oy Encoder, decoder and method employing palette compression
US12015790B2 (en) 2015-06-18 2024-06-18 Gurulogic Microsystems Oy Encoder, decoder and method employing palette compression
CN107710760A (en) * 2015-10-15 2018-02-16 富士通株式会社 Method for encoding images, device and image processing equipment

Also Published As

Publication number Publication date
CL2016001224A1 (en) 2016-12-09
EP3063703A4 (en) 2016-10-19
HK1220531A1 (en) 2017-05-05
US10291827B2 (en) 2019-05-14
CN105745671A (en) 2016-07-06
CA2931386C (en) 2020-06-09
RU2016124544A (en) 2017-12-27
CN105745671B (en) 2019-09-13
BR112016011471A2 (en) 2021-05-18
AU2014352656B2 (en) 2017-06-22
US20150146976A1 (en) 2015-05-28
MX2016006612A (en) 2017-04-25
KR20160085893A (en) 2016-07-18
UA118114C2 (en) 2018-11-26
CA2931386A1 (en) 2015-05-28
JP6294482B2 (en) 2018-03-14
MX362406B (en) 2019-01-16
JP2017502561A (en) 2017-01-19
RU2646355C2 (en) 2018-03-02
KR101972936B1 (en) 2019-04-26
EP3063703A1 (en) 2016-09-07
AU2014352656A1 (en) 2016-06-23
NZ720776A (en) 2017-09-29

Similar Documents

Publication Publication Date Title
AU2014352656B2 (en) Advanced screen content coding solution
US10659791B2 (en) Hierarchy of motion prediction video blocks
KR102268090B1 (en) Palette coding for screen content coding
KR101951083B1 (en) Two-dimensional palette coding for screen content coding
CN110677656A (en) Method for performing palette decoding and decoding apparatus
AU2013217035A1 (en) Restriction of prediction units in B slices to uni-directional inter prediction
EP3984222A1 (en) Chroma coding enhancement in cross-component sample adaptive offset
US11800124B2 (en) Chroma coding enhancement in cross-component sample adaptive offset
US20230209093A1 (en) Chroma coding enhancement in cross-component sample adaptive offset
US20230199209A1 (en) Chroma coding enhancement in cross-component sample adaptive offset
KR20130050902A (en) Method for image encoding/decoding and apparatus thereof
BR112016011471B1 (en) METHOD AND SYSTEM FOR ENCODING SCREEN CONTENT IN A BITS STREAM

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 14864463

Country of ref document: EP

Kind code of ref document: A1

WWE Wipo information: entry into national phase

Ref document number: 245752

Country of ref document: IL

ENP Entry into the national phase

Ref document number: 2931386

Country of ref document: CA

Ref document number: 2016533032

Country of ref document: JP

Kind code of ref document: A

WWE Wipo information: entry into national phase

Ref document number: MX/A/2016/006612

Country of ref document: MX

NENP Non-entry into the national phase

Ref country code: DE

REG Reference to national code

Ref country code: BR

Ref legal event code: B01A

Ref document number: 112016011471

Country of ref document: BR

REEP Request for entry into the european phase

Ref document number: 2014864463

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 2014864463

Country of ref document: EP

ENP Entry into the national phase

Ref document number: 20167016238

Country of ref document: KR

Kind code of ref document: A

WWE Wipo information: entry into national phase

Ref document number: IDP00201604072

Country of ref document: ID

WWE Wipo information: entry into national phase

Ref document number: A201606679

Country of ref document: UA

ENP Entry into the national phase

Ref document number: 2016124544

Country of ref document: RU

Kind code of ref document: A

ENP Entry into the national phase

Ref document number: 2014352656

Country of ref document: AU

Date of ref document: 20141124

Kind code of ref document: A

ENP Entry into the national phase

Ref document number: 112016011471

Country of ref document: BR

Kind code of ref document: A2

Effective date: 20160520

ENPC Correction to former announcement of entry into national phase, pct application did not enter into the national phase

Ref document number: 112016011471

Country of ref document: BR

Kind code of ref document: A2

Free format text: ANULADA A PUBLICACAO CODIGO 1.3 NA RPI NO 2431 DE 08/08/2017 POR TER SIDO INDEVIDA.

REG Reference to national code

Ref country code: BR

Ref legal event code: B01E

Ref document number: 112016011471

Country of ref document: BR

Kind code of ref document: A2

Free format text: APRESENTAR, EM ATE 60 (SESSENTA) DIAS, DOCUMENTOS DE CESSAO ESPECIFICOS PARA AS PRIORIDADES US 61/907,903 DE 22/11/2013 E US 14/549,405 DE 20/11/2014, CONFORME DISPOSTO NO ART. 2O, 1O E 2O DA RES 179/2017, UMA VEZ QUE O DOCUMENTO ENVIADO NA PETICAO NO 870160036404 DE 15/07/2016 NAO APRESENTA TODOS OS DADOS IDENTIFICADORES DA PRIORIDADE, CONTEMPLANDO SOMENTE UM TITULO. A CESSAO DEVE CONTER, NO MINIMO, NUMERO ESPECIFICO DA PRIORIDADE A SER CEDIDA, DATA DE DEPOSITO DA PRIORIDADE, ASSINATURA DE TODOS OS INVENTORES E DATA.

ENP Entry into the national phase

Ref document number: 112016011471

Country of ref document: BR

Kind code of ref document: A2

Effective date: 20160520