EP2767085A1 - Bereichsbasierte bildkompression - Google Patents
Bereichsbasierte bildkompressionInfo
- Publication number
- EP2767085A1 EP2767085A1 EP12778021.1A EP12778021A EP2767085A1 EP 2767085 A1 EP2767085 A1 EP 2767085A1 EP 12778021 A EP12778021 A EP 12778021A EP 2767085 A1 EP2767085 A1 EP 2767085A1
- Authority
- EP
- European Patent Office
- Prior art keywords
- region
- image
- acceptability criteria
- regions
- compression
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/587—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal sub-sampling or interpolation, e.g. decimation or subsequent interpolation of pictures in a video sequence
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/119—Adaptive subdivision aspects, e.g. subdivision of a picture into rectangular or non-rectangular coding blocks
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/12—Selection from among a plurality of transforms or standards, e.g. selection between discrete cosine transform [DCT] and sub-band transform or selection between H.263 and H.264
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/12—Selection from among a plurality of transforms or standards, e.g. selection between discrete cosine transform [DCT] and sub-band transform or selection between H.263 and H.264
- H04N19/122—Selection of transform size, e.g. 8x8 or 2x4x8 DCT; Selection of sub-band transforms of varying structure or type
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/124—Quantisation
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/17—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/189—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the adaptation method, adaptation tool or adaptation type used for the adaptive coding
- H04N19/192—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the adaptation method, adaptation tool or adaptation type used for the adaptive coding the adaptation method, adaptation tool or adaptation type being iterative or recursive
- H04N19/194—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the adaptation method, adaptation tool or adaptation type used for the adaptive coding the adaptation method, adaptation tool or adaptation type being iterative or recursive involving only two passes
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/189—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the adaptation method, adaptation tool or adaptation type used for the adaptive coding
- H04N19/196—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the adaptation method, adaptation tool or adaptation type used for the adaptive coding being specially adapted for the computation of encoding parameters, e.g. by averaging previously computed encoding parameters
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/46—Embedding additional information in the video signal during the compression process
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/90—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using coding techniques not provided for in groups H04N19/10-H04N19/85, e.g. fractals
- H04N19/96—Tree coding, e.g. quad-tree coding
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/134—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
- H04N19/146—Data rate or code amount at the encoder output
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/134—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
- H04N19/154—Measured or subjectively estimated visual quality after decoding, e.g. measurement of distortion
Definitions
- the present invention is generally directed to image compression and in particular, to a method for region-based image compression.
- Lossy compression techniques require methods to effectively encode images at lower bit rates without sacrificing significant image quality.
- Fixed rate compression schemes generally have poor image quality at rates significantly below four bits per pixel.
- Some existing variable rate compression techniques like Joint Photographic Experts Group (JPEG), apply some form of transform and quantization.
- Some methods of reducing the amount of data to be stored after compression may involve storing the data in a sparse manner and interpolating the results.
- Existing methods generally have not provided good levels of image quality, and in some cases may also introduce potentially undesired image artifacts (e.g., high frequency noise).
- Adding a local per-region transform and quantization step before subsequent compression steps may reduce the amount of data to be compressed, thereby reducing the required bit rate needed to maintain a high level of image quality.
- a reconstruction transformation is applied to generate the pixel values.
- a method for compressing an image includes decomposing the image into one or more regions.
- a region of the image is selected to be evaluated.
- the selected region is transformed and quantized if the region does not meet a predetermined compression acceptability criteria.
- the predetermined compression acceptability criteria may include a specific bit rate, a specific image quality, or combinations thereof. If the region does not meet the predetermined compression acceptability criteria after the region has been transformed and quantized, then the transformation and quantization settings are adjusted and the region is transformed and quantized using the adjusted settings.
- the region is then encoded when the predetermined compression acceptability criteria has been reached.
- a method for compressing an image includes decomposing the image into one or more regions.
- a region of the image is selected to be evaluated.
- the selected region is decomposed into subregions.
- the subregions are transformed and quantized if the region does not meet a predetermined compression acceptability criteria. If the region does not meet the predetermined compression acceptability criteria based on a combination of subregion split, transform, and quantization, then the subregion split is adjusted and the adjusted subregion is transformed and quantized.
- the region is then encoded when the predetermined compression acceptability criteria has been reached.
- a method for decompressing an image, the image including one or more regions includes selecting a region of the image to decode. The selected region and metadata associated with the selected region are decoded. A reconstruction transformation is applied to the selected region, wherein the metadata includes information regarding the reconstruction transformation.
- a method for decompressing an image including one or more regions, each of which includes one or more subregions.
- a region of the image is selected to be decompressed.
- the selected region and metadata associated with the selected region are decoded.
- Reconstruction transformations are applied to each of the subregions in the selected region, wherein the metadata includes information regarding the reconstruction transformations.
- a system for compressing an image includes an encoder.
- the encoder is configured to decompose the image into one or more regions, select a region of the image to evaluate, and transform and quantize the region if the region does not meet a predetermined compression acceptability criteria. If the region does not meet the predetermined compression acceptability criteria after the region has been transformed and quantized, the encoder is further configured to adjust the transformation and quantization settings, and transform and quantize the region using the adjusted settings. The region is then encoded when the predetermined compression acceptability criteria has been reached.
- a system for decompressing an image includes a decoder.
- the decoder is configured to select a region of the image to decode, decode the selected region and metadata associated with the selected region, and apply a reconstruction transformation to the selected region, wherein the metadata includes information regarding the reconstruction transformation.
- a non-transitory computer-readable storage medium storing a set of instructions for execution by a general purpose computer to compress an image includes a decomposing code segment, a selecting code segment, a transforming and quantizing code segment, an adjusting code segment, and an encoding code segment.
- the decomposing code segment is for decomposing the image into one or more regions.
- the selecting code segment is for selecting a region of the image to evaluate.
- the transforming and quantizing code segment is for transforming and quantizing the region if the region does not meet a predetermined compression acceptability criteria.
- the adjusting code segment is for adjusting the transformation and quantization settings if the region does not meet the predetermined compression acceptability criteria after the region has been transformed and quantized.
- a method for compressing an image includes decomposing the image into one or more regions.
- a region is selected, and is split into one or more subregions.
- a subregion, one of a plurality of transforms, and one of a plurality of quantizers are selected.
- the selected subregion split, transform, and quantizer are evaluated against a predetermined compression acceptability criteria.
- Each of the subregions, the subregion splits, the transforms, and the quantizers are iteratively selected to determine a best subregion split, transform, and quantizer in terms of the predetermined compression acceptability criteria. All of the subregions are encoded using the best subregion split, transform, and quantizer when the predetermined compression acceptability criteria has been reached.
- a method for compressing an image includes decomposing the image into one or more regions.
- a region of the image is selected to be evaluated.
- the selected region is compressed, and is transformed and quantized if the compressed region does not meet a predetermined compression acceptability criteria. If the transformed and quantized region does not meet the predetermined compression acceptability criteria after it has been transformed and quantized, then the transformation and quantization settings are adjusted.
- the compressed region is transformed and quantized using the adjusted settings.
- the transformed and quantized region is encoded when the predetermined compression acceptability criteria has been reached.
- Figure 1 is a block diagram of an example device in which one or more disclosed embodiments may be implemented
- Figure 2 is a flow chart of a method for compressing an image
- Figure 3 is a flow chart of an alternate method for compressing an image
- Figure 4 is a flow chart of a method for decompressing a region of an image
- Figure 5 is a flow chart of a method for compressing an image that evaluates combinations of transforms and quantizers.
- FIG. 1 is a block diagram of an example device 100 in which one or more disclosed embodiments may be implemented.
- the device 100 may include, for example, a computer, a gaming device, a handheld device, a set-top box, a television, a mobile phone, or a tablet computer.
- the device 100 includes a processor 102, a memory 104, a storage 106, one or more input devices 108, and one or more output devices 110.
- the device 100 may also optionally include an input driver 112 and an output driver 114. It is understood that the device 100 may include additional components not shown in Figure 1.
- the processor 102 may include a central processing unit (CPU), a graphics processing unit (GPU), a CPU and GPU located on the same die, or one or more processor cores, wherein each processor core may be a CPU or a GPU.
- the memory 104 may be located on the same die as the processor 102, or may be located separately from the processor 102.
- the memory 104 may include a volatile or non-volatile memory, for example, random access memory (RAM), dynamic RAM, or a cache.
- the storage 106 may include a fixed or removable storage, for example, a hard disk drive, a solid state drive, an optical disk, or a flash drive.
- the input devices 108 may include a keyboard, a keypad, a touch screen, a touch pad, a detector, a microphone, an accelerometer, a gyroscope, a biometric scanner, or a network connection (e.g., a wireless local area network card for transmission and/or reception of wireless IEEE 802 signals).
- the output devices 110 may include a display, a speaker, a printer, a haptic feedback device, one or more lights, an antenna, or a network connection (e.g., a wireless local area network card for transmission and/or reception of wireless IEEE 802 signals).
- the input driver 112 communicates with the processor 102 and the input devices 108, and permits the processor 102 to receive input from the input devices 108.
- the output driver 114 communicates with the processor 102 and the output devices 110, and permits the processor 102 to send output to the output devices 110. It is noted that the input driver 112 and the output driver 114 are optional components, and that the device 100 will operate in the same manner if the input driver 112 and the output driver 114 are not present.
- Figure 2 is a flow chart of a method 200 for compressing an image.
- An image to be encoded is selected (step 202) and the selected image is decomposed into several regions according to a predetermined method (step 204).
- the regions may be a fixed size or a variable size, and the decomposing method may be hierarchical. It is noted that the particular method used to decompose the image into regions does not affect the overall operation of the method 200.
- a region is selected for evaluation (step 206) and is examined to determine if the region meets a predetermined compression acceptability criteria (step 208).
- the predetermined compression acceptability criteria may include, but is not limited to, a specific bit rate, a specific image quality, or combinations thereof. It may be possible to encode the region to meet the predetermined compression acceptability criteria using the basic underlying compression system. In this case, no additional transform and quantization step is required, and the region can be processed directly in the encoding stage. This may be viewed as a special case where the transform is the identity transform.
- the region does not meet the predetermined compression acceptability criteria (step 208), then several refinements may be performed.
- the region is transformed and quantized (step 210). If the method determines that the region needs to be transformed and quantized to satisfy predefined compression acceptability criteria, then the method selects the transform and quantization from a predefined set.
- the set may include only linear transforms, for example filtering with a smoothing kernel, wavelet transforms, curvelet transforms, Gabor wavelet transforms, etc. In another embodiment, the set may include non-linear transforms.
- the encoder may evaluate multiple potential combinations of transform and quantization, selecting the combination that achieves the highest quality at the predetermined compression acceptability criteria.
- the encoder may have parameters to control the extent of any optimization steps at this stage to tradeoff overall compression quality against encoding performance. These controls may limit the extent of the search for optimal transforms and quantizations, and may also provide threshold values, permitting the technique to exit early when certain targets are reached.
- Quantization is performed by taking the coefficients output from the transform and rounding them to a predefined set of values, and the set may be different for each coefficient.
- sets of the values corresponding to some of the coefficients may consist of a single value of zero, which means that the corresponding coefficients are discarded (such as in downsampling). After quantization, the remaining coefficients are encoded.
- the region is encoded (step 216).
- the encoding may incorporate some underlying compression.
- the output data format includes some metadata to be stored and/or transmitted with the region, to indicate the transform and the quantization applied for that region and other information that may be required for decoding.
- the encoded region may be transmitted (not shown in Figure 2). If all of the regions of the image have not been examined (step 218), then the method continues by selecting another region to evaluate (step 206). If all of the regions of the image have been examined (step 218), then the method terminates.
- the transform and quantization in step 210 may be configured to be a downscaling operation.
- a region that is to be compressed is evaluated and downscaled with a selected aspect ratio (which encompasses the transform and quantization) prior to compression, to reduce the total number of pixels in the region while retaining as much of the information as possible.
- Performing the downscaling reduces the amount of data prior to encoding, allowing the encoding (which may include additional compression steps) to occur with a higher accuracy for a given bit rate.
- One of a set of different aspect ratios may be selected for downscaling the region.
- the selected aspect ratio provides the best results according to a selected error metric (for example, peak signal to noise ratio) by evaluating the results of quantizing to each possible ratio against this metric for the current region.
- the target bit rate is known (for example, in a fixed-rate compression scheme), and the amount of space available at the target bit rate can be calculated.
- this information there may be multiple ways a region could be scaled to fit in the available space.
- some of the high-frequency image information is discarded, effectively blurring the region.
- the choice of the scaling aspect ratio may have a significant impact on preserving the image quality.
- By applying a non-uniform scaling to the original data more of the important information in the original image can be preserved.
- the compression can respond to local characteristics in the image content for different regions.
- the implementation may potentially examine multiple possible choices of transform and quantization for each region in the image to optimize the predetermined compression acceptability criteria.
- FIG. 3 is a flow chart of an alternate method 300 for compressing an image.
- An image to be encoded is selected (step 302) and the selected image is decomposed into several regions according to a predetermined method (step 304).
- the regions may be a fixed size or a variable size, and the decomposing method may be hierarchical. It is noted that the particular method used to decompose the image into regions does not affect the overall operation of the method 300.
- a region is selected for evaluation (step 306) and is examined to determine if the region meets a predetermined compression acceptability criteria (step 308).
- the predetermined compression acceptability criteria may include, but is not limited to, a specific bit rate, a specific image quality, or combinations thereof. It may be possible to encode the region to meet the predetermined compression acceptability criteria using the basic underlying compression system. In this case, no transform and quantization step is required, and the region can be processed by the underlying compression scheme. This may be viewed as a special case where the transform is the identity transform.
- the region does not meet the predetermined compression acceptability criteria (step 308), then several refinements may be performed.
- the region is split into subregions (step 310), and the subregions are transformed and quantized (step 312). If the encoder determines that the region needs to be split, transformed, and quantized to satisfy the predetermined compression acceptability criteria, then the encoder selects a split, transform, and quantization from a set of predefined splits, transforms, and quantizations.
- the set may include only linear transforms, for example filtering with a smoothing kernel, wavelet transforms, curvelet transforms, Gabor wavelet transforms, etc. In another embodiment, the set may include nonlinear transforms.
- the encoder may evaluate multiple potential combinations of region split (how the region is split into subregions), transform, and quantization, selecting the combination that achieves the highest quality to meet the predetermined compression acceptability criteria.
- the encoder may have parameters to control the extent of any optimization steps at this stage to tradeoff overall compression quality against encoding performance. These controls may limit the extent of the search for optimal regions, subregion splits, transforms, and quantizations, and may also provide threshold values, permitting the technique to exit early when certain targets are reached.
- step 314 It is then determined whether the region meets the predetermined compression acceptability criteria based on a combination of split, transform, and quantization (step 314). If the region does not meet the predetermined compression acceptability criteria, then the split (how the region is split into subregions), transform, and/or quantization may be adjusted (step 316) and the adjusted subregions are transformed and quantized (step 312) based on the adjustment(s). If the split is adjusted (step 316), a different splitting technique may be used to generate alternative region splits that may result in achieving the predetermined compression acceptability criteria.
- the region is encoded (step 318).
- the encoding may incorporate some underlying compression.
- the output data format includes some metadata to be stored and/or transmitted with the region, to indicate the region split, the transform, and the quantization applied for that region and other information that may be required for decoding.
- the encoded region may be transmitted (not shown in Figure 3). If all of the regions of the image have not been examined (step 320), then the method continues by selecting another region to evaluate (step 306). If all of the regions of the image have been examined (step 320), then the method terminates.
- Figure 4 is a flow chart of a method 400 for decompressing a region of an image.
- a region of the image is selected for decoding (step 402), and the selected region and its associated metadata are decoded (step 404).
- a reconstruction transformation is applied to the region using information included in the metadata (step 406).
- Additional processing is then performed on the region as needed prior to displaying the image (step 408). Examples of the additional processing may include, for example, texture mapping operations, etc.
- the region maybe split into subregions.
- the subregions may share a single transform and quantization (specified for the whole region), or each subregion may have its own individual transform and quantization specified.
- step 406 may be an upscaling operation, if the region was downscaled during encoding.
- the data is expanded according to the underlying compression method for the region.
- the region is then upscaled using information included in the metadata describing the aspect ratio used for the downscaling (step 406).
- the upscaling may use any applicable filter, but to preserve image quality, the encoder needs to know what filter will be used by the decoder, as this allows the compression quality to be tuned more precisely.
- the upscaling filter may be bilinear, because this filter is simple and cheap to implement. Other types of upscaling filters may be used without substantially altering the operation of the method 400.
- the type of filter used for upscaling may be uniform over the entire image or may be selected independently for each region of the image.
- the encoder uses a fixed- rate region-based compression scheme with a given region size, e.g., 8x8.
- a given region size e.g. 8x8.
- Each region is compressed independently. If it is not possible to encode every pixel in the region explicitly at the required bit rate, then the region is downscaled by a predetermined ratio prior to compression. For example, the 8x8 region may be reduced in size to 8x6, which would reduce the amount of pixel information that needs to be stored by 25%. The level of information reduction is chosen to allow the region to be encoded at the desired compression acceptability criteria.
- the downscaling may be accomplished by any appropriate method, with higher quality methods being used to retain more useful information.
- the encoder may try different ratios, and use the ratio that provides the best image quality in terms of the predetermined compression acceptability criteria (selecting from the multiple different quantizations).
- the method may choose to use a higher level of downscaling (e.g., 8x5, 8x4, 6x5, etc.) and evaluate these ratios in conjunction with the encoder using back-end compression schemes that have a lower compression rate.
- a higher level of downscaling e.g., 8x5, 8x4, 6x5, etc.
- the remaining pixels may be encoded with a higher accuracy (i.e., a lower compression rate), while achieving the same predetermined compression acceptability criteria.
- One extension to this embodiment is to downsample information along a selected vector direction, to preserve more of the image quality in the region (rather than the approximation achieved using downsampling aligned to the X and Y axes but with a variable aspect ratio).
- the quantization could be the same as above, but the transform is now different.
- This extension may allow better preservation of detail in regions of the image where the high frequency content is aligned closer to the diagonals. For example, if an image can be downscaled with knowledge of the direction of motion in the image (if any), then the high-frequency information orthogonal to the direction of the motion can be retained, while other information may be discarded.
- a second extension to this embodiment is to subdivide the original region further into subregions, and independently scale each subregion (select a different transform and quantization for each subset) to better match the characteristics of the region to provide a higher image quality.
- FIG. 5 is a flow chart of a method 500 for compressing an image that evaluates combinations of regions, subregions, transforms, and quantizers.
- An image to be encoded is selected (step 502) and the selected image is decomposed into regions (step 504).
- a region of the image is selected (step 506).
- the selected region is split into subregions (step 508), a subregion is selected (step 510), a transform is selected (step 512), and a quantizer is selected (step 514).
- the selected subregion of the image is processed and evaluated to determine whether it meets predetermined compression acceptability criteria (step 516).
- the selection of the split (step 508), transform (step 512), and quantizer (step 514) may be performed in any order without affecting the overall operation of the method 500.
- the compression acceptability criteria that are determined by the selected subregion split, transform, and quantizer may be stored for later comparison.
- step 518 If all of the quantizers have not been evaluated or a threshold compression acceptability criteria has not been reached (step 518), then another quantizer is selected (step 514) and processing continues as described above. If all of the quantizers have been evaluated or if the threshold compression acceptability criteria has been reached (step 518), then a determination is made whether all of the transforms have been evaluated or the threshold compression acceptability criteria has been reached (step 520).
- step 520 If all of the transforms have not been evaluated or the threshold compression acceptability criteria has not been reached (step 520), then another transform is selected (step 512) and processing continues as described above. If all of the transforms have been evaluated or the threshold compression acceptability criteria has been reached (step 520), then a determination is made whether all subregions have been evaluated (step 522).
- step 522) then another subregion of the region is selected (step 510) and processing continues as described above. If all of the subregions have been evaluated (step 522), then a determination is made whether all of the subregion splits have been evaluated or the threshold compression acceptability criteria has been reached (step 524).
- step 524 If all of the subregion splits have not been evaluated and the threshold compression acceptability criteria has not been reached (step 524), then the region is split into different subregions (step 508) and processing continues as described above. In an alternative embodiment (not shown), the threshold compression acceptability criteria defined in steps 518, 520, and 524 may not be used.
- the best splits, transforms, and quantizers are selected (step 526). All of the subregions of the region are encoded using the best subregion splits, transforms, and quantizers (step 528). In one implementation, the encoding in step 528 may also include additional compression.
- the output data format includes some metadata to be stored and/or transmitted with the region, to indicate the subregion splits, transforms, and quantizers applied for that region and other information that may be required for decoding.
- step 530 a determination is made whether all of the regions of the image have been examined. If all of the regions of the image have not been examined, then another region of the image is selected (step 506) and processing continues as described above. If all of the regions of the image have been examined (step 530), then the method terminates.
- texture filtering operations will be performed on the decoded data, so an upscaling filter may be implemented by manipulating the texture filtering hardware, rather than by implementing an additional dedicated upscaler.
- the (encoding) generates index coefficients that are used to select colors.
- the transformation and quantization may be performed on the index coefficients produced by the underlying encoder, rather than on the original color data.
- the region is compressed prior to the region being transformed and quantized.
- processors include, by way of example, a general purpose processor, a special purpose processor, a conventional processor, a digital signal processor (DSP), a plurality of microprocessors, one or more microprocessors in association with a DSP core, a controller, a microcontroller, Application Specific Integrated Circuits (ASICs), Field Programmable Gate Arrays (FPGAs) circuits, any other type of integrated circuit (IC), and/or a state machine.
- DSP digital signal processor
- ASICs Application Specific Integrated Circuits
- FPGAs Field Programmable Gate Arrays
- Such processors may be manufactured by configuring a manufacturing process using the results of processed hardware description language (HDL) instructions and other intermediary data including netlists (such instructions capable of being stored on a computer readable media).
- HDL hardware description language
- netlists such instructions capable of being stored on a computer readable media.
- the results of such processing may be maskworks that are then used in a semiconductor manufacturing process to manufacture a processor which implements aspects of the present invention.
- non-transitory computer-readable storage medium for execution by a general purpose computer or a processor.
- non-transitory computer-readable storage mediums include a read only memory (ROM), a random access memory (RAM), a register, cache memory, semiconductor memory devices, magnetic media such as internal hard disks and removable disks, magneto-optical media, and optical media such as CD-ROM disks, and digital versatile disks (DVDs).
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Physics & Mathematics (AREA)
- Discrete Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Computing Systems (AREA)
- Theoretical Computer Science (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201161547648P | 2011-10-14 | 2011-10-14 | |
PCT/US2012/060069 WO2013056129A1 (en) | 2011-10-14 | 2012-10-12 | Region-based image compression |
Publications (1)
Publication Number | Publication Date |
---|---|
EP2767085A1 true EP2767085A1 (de) | 2014-08-20 |
Family
ID=51205651
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP12778021.1A Ceased EP2767085A1 (de) | 2011-10-14 | 2012-10-12 | Bereichsbasierte bildkompression |
Country Status (1)
Country | Link |
---|---|
EP (1) | EP2767085A1 (de) |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6532308B1 (en) * | 1999-02-04 | 2003-03-11 | Quvis, Inc. | Quality priority image storage and communication |
-
2012
- 2012-10-12 EP EP12778021.1A patent/EP2767085A1/de not_active Ceased
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6532308B1 (en) * | 1999-02-04 | 2003-03-11 | Quvis, Inc. | Quality priority image storage and communication |
Non-Patent Citations (1)
Title |
---|
See also references of WO2013056129A1 * |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US11503295B2 (en) | Region-based image compression and decompression | |
US9204169B2 (en) | System and method for compressing images and video | |
US9253507B2 (en) | Method and device for interpolating images by using a smoothing interpolation filter | |
US7634148B2 (en) | Image signal transforming and inverse-transforming method and computer program product with pre-encoding filtering features | |
EP2145476B1 (de) | Bildkomprimierung und -dekomprimierung mithilfe des pixon-verfahrens | |
Kumar et al. | A review: DWT-DCT technique and arithmetic-Huffman coding based image compression | |
KR20040005962A (ko) | 버터플라이 프로세서를 이용하여 이산 코사인 변환을인코딩하고 계산하는 장치 및 방법 | |
CN105392014B (zh) | 一种优化的小波变换图像压缩方法 | |
CN108182712B (zh) | 图像处理方法、装置及系统 | |
KR102113904B1 (ko) | 보간을 이용한 연산 방법, 인코더, 및 디코더 | |
Thakker et al. | Lossy Image Compression-A Comparison Between Wavelet Transform, Principal Component Analysis, K-Means and Autoencoders | |
EP2767085A1 (de) | Bereichsbasierte bildkompression | |
US11310496B2 (en) | Determining quality values for blocks of encoded video | |
WO2023178662A1 (en) | Image and video coding using multi-sensor collaboration and frequency adaptive processing | |
US9020291B2 (en) | Resized image compression based on frequency content | |
US11563945B2 (en) | Adaptive offset for variance based quantization | |
RU2645290C1 (ru) | Способ кодирования оцифрованных изображений с использованием адаптивного ортогонального преобразования | |
Sadanandan et al. | Image compression with modified skipline encoding and curve fitting | |
CN117939125A (zh) | 图像处理方法、计算机设备及计算机可读存储介质 | |
US20010024523A1 (en) | Reduction of artifacts in a DWT-based compression scheme by adjusting inverse quantized transform data according to noise values | |
Saeed | Compression technique for DubaiSat-2 images based on the DCT blocks | |
US20130051695A1 (en) | Image compressing device, image compressing method, and image compressing program | |
Reddy et al. | Image Compression with Variable Threshold and Adaptive Block Size |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
17P | Request for examination filed |
Effective date: 20140411 |
|
AK | Designated contracting states |
Kind code of ref document: A1 Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR |
|
DAX | Request for extension of the european patent (deleted) | ||
17Q | First examination report despatched |
Effective date: 20151125 |
|
REG | Reference to a national code |
Ref country code: DE Ref legal event code: R003 |
|
RAP1 | Party data changed (applicant data changed or rights of an application transferred) |
Owner name: ADVANCED MICRO DEVICES, INC. |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: THE APPLICATION HAS BEEN REFUSED |
|
RAP1 | Party data changed (applicant data changed or rights of an application transferred) |
Owner name: ADVANCED MICRO DEVICES, INC. |
|
18R | Application refused |
Effective date: 20171121 |