WO2006076602A1 - A nonlinear, in-the-loop, denoising filter for quantization noise removal for hybrid video compression - Google Patents
A nonlinear, in-the-loop, denoising filter for quantization noise removal for hybrid video compression Download PDFInfo
- Publication number
- WO2006076602A1 WO2006076602A1 PCT/US2006/001290 US2006001290W WO2006076602A1 WO 2006076602 A1 WO2006076602 A1 WO 2006076602A1 US 2006001290 W US2006001290 W US 2006001290W WO 2006076602 A1 WO2006076602 A1 WO 2006076602A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- frame
- transform
- denoising
- prediction
- coefficient
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/85—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression
- H04N19/86—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression involving reduction of coding artifacts, e.g. of blockiness
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/117—Filters, e.g. for pre-processing or post-processing
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/134—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
- H04N19/157—Assigned coding mode, i.e. the coding mode being predefined or preselected to be further used for selection of another element or parameter
- H04N19/159—Prediction type, e.g. intra-frame, inter-frame or bidirectional frame prediction
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/18—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a set of transform coefficients
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/189—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the adaptation method, adaptation tool or adaptation type used for the adaptive coding
- H04N19/192—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the adaptation method, adaptation tool or adaptation type used for the adaptive coding the adaptation method, adaptation tool or adaptation type being iterative or recursive
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/60—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
- H04N19/61—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding in combination with predictive coding
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/80—Details of filtering operations specially adapted for video compression, e.g. for pixel interpolation
- H04N19/82—Details of filtering operations specially adapted for video compression, e.g. for pixel interpolation involving filtering within a prediction loop
Definitions
- the present invention relates to the field of processing video frames; more particularly, the present invention relates to filtering quantization noise from video frames, thereby improving video compression.
- Hybrid video compression consists of encoding an anchor video frame and then predictively encoding a set of predicted frames. Predictive encoding uses motion compensated prediction with respect to previously coded frames in order to obtain a prediction error frame followed by the encoding of this prediction error frame. Anchor frames and prediction errors are encoded using transform coders in a manner well-known in the art.
- Figure 1 is a block diagram of a generic hybrid video coder.
- a MC prediction module 110 generates a motion compensated prediction from a previously decoded frame 109.
- a first adder 102 subtracts the motion compensated prediction from a current frame 101 to obtain a residual frame 111.
- a transform coder 103 converts residual frame 111 to a coded differential 104, for example by using a combination of a transform, a quantizer, and an entropy encoder.
- a transform decoder 105 converts coded differential 104 to a reconstructed residual frame 112, for example by using a combination of an entropy decoder, an inverse quantizer, and an inverse transform.
- a second adder 106 adds reconstructed residual frame 112 to the motion compensated prediction from MC prediction module 110 to obtain a reconstructed frame 113.
- a delay element "Z-I" 108 stores the reconstructed frame for future reference by MC prediction module 110.
- Transform coded frames incur quantization noise. Due to the predictive coding of frames, quantization noise has two adverse consequences: (i) the quantization noise in frame n causes reduced quality in the display of frame n, (ii) the quantization noise in frame n causes reduced quality in all frames that use frame n as part of their prediction.
- Prior solutions have been applied to video frames that have smoothly varying pixel values. This is because prior solutions are derived using smooth image models.
- the filters derived are typically restricted to low-pass filters. These are not applicable on many types of image regions, such as on edges, textures, etc.
- Related art is typically restricted to a single type of quantization artifacts. Techniques typically specialize on blocking artifacts, or ringing artifacts, or other types of artifacts without providing a general means for addressing all types of artifacts.
- the video encoder comprises a transform coder to apply a transform to a residual frame representing a difference between a current frame and a first prediction, the transform coder outputting a coded differential frame as an output of the video encoder; a transform decoder to generate a reconstructed residual frame in response to the coded differential frame; a first adder to create a reconstructed frame by adding the reconstructed residual frame to the first prediction; a non-linear denoising filter to filter the reconstructed frame by deriving expectations and performing denoising operations based on the expectations; and a prediction module to generate predictions, including the first prediction, based on previously decoded frames.
- Figure 1 illustrates a prior art video encoder
- Figure 2A is a block diagram of one embodiment of a video encoder with an in-the-loop denoising filter embedded therein;
- Figure 2B is a block diagram of one embodiment of a video decoder
- Figure 3 is a flow diagram of one embodiment of a process for obtaining a denoised video frame
- Figure 4 is a flow diagram of an alternative embodiment of a process to alter the denoising parameters for each of the coefficients that is designed to perform denoising on a selected subset of pixels
- Figure 5 is a block diagram of one embodiment of a process for obtaining a denoised video frame using a multitude of transforms
- Figure 6 is a block diagram of one embodiment of a nonlinear denoising filter.
- Figure 7 is an example of a computer system.
- the filtering is designed to remove noise incurred during the compression of the frames.
- the filtering technique is based on deriving an expectation and implementing denoising operations based on the derived expectation. Mode based decisions are derived for the effective denoising of differentially compressed video frames.
- weighted denoising technique is applied which further improves performance.
- the described techniques are robust and general, being able to effectively handle a multitude of image region types and a multitude of compression techniques.
- the derived nonlinear, in-the-loop denoising filters adaptively and autonomously develop the proper frequency selectivity for the multitude of image region types by developing low-pass selectivity in smooth image regions, high-pass selectivity in high-frequency regions, etc.
- This apparatus may be specially constructed for the required purposes, or it may comprise a general purpose computer selectively activated or reconfigured by a computer program stored in the computer.
- a computer program may be stored in a computer readable storage medium, such as, but is not limited to, any type of disk including floppy disks, optical disks, CD-ROMs, and magnetic-optical disks, read-only memories (ROMs), random access memories (RAMs), EPROMs, EEPROMs, magnetic or optical cards, or any type of media suitable for storing electronic instructions, and each coupled to a computer system bus.
- a machine-readable medium includes any mechanism for storing or transmitting information in a form readable by a machine (e.g., a computer).
- a machine-readable medium includes read only memory ("ROM”); random access memory (“RAM”); magnetic disk storage media; optical storage media; flash memory devices; electrical, optical, acoustical or other form of propagated signals (e.g., carrier waves, infrared signals, digital signals, etc.); etc.
- FIG. 2A is a block diagram of one embodiment of a hybrid video encoder with an in-the-loop denoising filter embedded therein.
- the enhanced video encoder includes a nonlinear denoising filter 207 that filters the reconstructed frame to remove compression noise before storing it in the delay element. That is, filter 207 is embedded inside the prediction loop so that the benefits of filtering can also be used in building predictions for all frames that depend on the filtered frame.
- FIG. 2B is a block diagram of one embodiment of a hybrid video decoder.
- MC prediction module 225 generates a motion compensated prediction from a previously decoded frame 232.
- a transform decoder 221 converts the coded differential to a reconstructed residual frame (decoded differential 231), for example by using a combination of an entropy decoder, an inverse quantizer, and an inverse transform.
- Adder 222 adds the reconstructed residual frame 233 to the motion compensated prediction to obtain a reconstructed frame 233.
- a nonlinear denoising filter 223 converts the reconstructed frame 233 to a filtered frame 234, which may be output to a display.
- a delay element "Z-I" 224 stores filtered frame 234 for future reference by MC prediction module 225.
- the operation of the nonlinear denoising filter 223 is further described below.
- the denoising filter derives a statistical expectation that is used to define a denoising operation. The expectation determines whether a coefficient of a linear transform of the quantized video frame should remain as is or should be set to a predetermined value in order to obtain a better estimate of the same coefficient in the original video frame. In one embodiment, the predetermined value is zero. In one embodiment, this operation of obtaining better estimates is applied to all linear transform coefficients of the quantized video frame. Better estimates are found for all coefficients yields a better estimate of the video frame.
- the denoising filter derives mode-based decisions that refine the expectation calculation and the denoising operation. Specifically, depending on the compression mode of the predictively encoded frame, the denoising filter determines the spatial locations that should be denoised. In one embodiment, the parameters that are used in the expectation calculation are also determined based on the compression mode.
- the expectation processing described above is performed for multiple linear transforms applied to the quantized video frame.
- Each linear transform determines an estimate for the video frame. Multiple linear transforms thereby result in multiple estimates.
- the denoising filter combines these estimates to form an overall estimate. In one embodiment, the overall estimate is better than each of the estimates individually.
- the linear transform applied to the quantized video frame described above is an orthonormal transform such as, for example, a block nxn DCT.
- Other transforms, including non-orthogonal transforms and non- block transforms can also be applied. It is desirable but not necessary for this transform to have a fast implementation so that denoising computations can be performed in an efficient manner.
- y denote a quantized video frame (arranged into an NxI vector)
- H denote the linear transform as specified in the above paragraph (an NxN matrix).
- x denote the original unquantized version of y
- C Hx be its transform coefficients.
- ⁇ a canonical variable that encapsulates all of the available information (y, any previously decoded video frames, side information about x, etc.)
- _P] is referred to herein as the conditional expectation of c(i), conditioned on all the available information.
- the above rule is referred to herein as the denoising rule to obtain 6(i).
- FIG. 3 is a flow diagram of one embodiment of a process for obtaining a denoised video frame.
- the process is performed by processing logic that may comprise hardware (e.g., circuitry, dedicated logic, etc.), software (such as is run on a general purpose computer system or a dedicated machine), or a combination of both.
- processing logic may comprise firmware.
- the processing logic is in the denoising filter.
- the process begins by processing logic obtaining a decoded frame y and collecting other available information (processing block 301).
- the other available information may include quantization parameters, motion information, and mode information.
- processing logic obtains a set of coefficients d by applying a transform H to the decoded frame y (processing block 302).
- the transform H may represent a block-wise two-dimensional DCT.
- processing logic also sets a set of image elements e equal to the elements of y.
- processing logic computes a conditional expectation of c(i) for each coefficient in d based on the set of image elements e and obtains a filtered coefficient c(i) by applying a denoising rule using the value of the coefficient in d and the conditional expectation of c(i) (processing block 303).
- processing logic obtains a filtered frame x by applying the inverse of transform H to the set of coefficients c (processing block 304).
- processing logic determines whether more iterations are needed (processing block 305). For example, a fixed number of iterations such as two, maybe preset. If more iterations are needed, processing logic sets the set of image elements e to x (processing block 307) and processing transactions to processing block 303. Otherwise, the processing flow proceeds to processing block 306 where the processing logic outputs the filtered frame x .
- FIG. 4 is a flow diagram of an alternative embodiment of a modified procedure to alter the denoising parameters for each of the coefficients.
- coefficient denoising parameters may be needed in cases other than "a requirement that not all pixel values should be changed" as well.
- the denoising parameters are determined using compression mode parameters of the quantized frame.
- the subset of pixels is determined by defining a mask using compression mode parameters of the quantized frame.
- processing logic may comprise hardware (e.g., circuitry, dedicated logic, etc.), software (such as is run on a general purpose computer system or a dedicated machine), or a combination of both.
- processing logic may comprise firmware.
- the processing logic is in a denoising filter.
- the process begins by processing logic obtaining a decoded frame y and collecting other available information (processing block 401).
- the other available information may include compression parameters such as quantization parameters, motion information, and mode information.
- processing logic obtains a set of coefficients d by applying a transform H to the decoded frame y (processing block 402).
- the transform H may represent a block-wise two-dimensional DCT.
- Processing logic also sets a set of image elements e equal to the elements of y.
- Processing logic determines coefficient denoising parameters for each coefficient based on compression parameters (processing block 403) and determines a mask based on compression parameters (processing block 404). [0050] Afterwards, processing logic computes a conditional expectation of c(i) for each coefficient in d based on e and coefficient parameters and a filtered coefficient c(i) by applying a denoising rule using the value of the coefficient in d and the conditional expectation of c(i) (processing block 405). [0051] Next, processing logic obtains a filtered frame x by applying the mask function to the result of the inverse of transform H applied to the set of coefficients c (processing block 406).
- Processing logic determines whether more iterations are needed
- processing block 407 For example, a fixed number of iterations such as two, may be preset. If more iterations are needed, processing logic sets the set of image elements e to x (processing block 408) and the process transitions to processing block 405; otherwise, processing transitions to processing block 408 where processing logic outputs the filtered frame x .
- processing logic sets the set of image elements e to x (processing block 408) and the process transitions to processing block 405; otherwise, processing transitions to processing block 408 where processing logic outputs the filtered frame x .
- Hi, H 2 HM- Each of these transforms are used in a basic procedure of its own to produce estimates of the original unquantized video frame x given by X 1 , x 2 , ... , x M . These individual estimates are combined to form an overall estimate x that is better than each of the estimates.
- Figure 5 One embodiment of such a process using multiple transforms is illustrated in Figure 5.
- processing logic may comprise hardware (e.g., circuitry, dedicated logic, etc.), software (such as is run on a general purpose computer system or a dedicated machine), or a combination of both.
- processing logic may comprise firmware.
- the processing logic is part of a denoising filter.
- processing logic begins by processing logic obtaining a decoded frame y and collecting other available information (processing block 501).
- the other available information may include compression parameters such as quantization parameters, motion information, and mode information.
- processing logic obtains a set of coefficients d 1:M by applying M transforms H j to the decoded frame y (processing block 502).
- each transform H j may represent a block-wise two-dimensional DCT, where the block alignment is dependent onj.
- Processing logic also sets a set of image elements e equal to the elements of y.
- Processing logic determines coefficient denoising parameters for each coefficient based on compression parameters (processing block 503) and determines a mask based on compression parameters (processing block 504).
- processing logic computes a conditional expectation of CI : M(I) for each coefficient in di :M based on e and coefficient parameters and obtains a filtered coefficient c hM (i) by applying a denoising rule using the value of the coefficient in di ⁇ and the conditional expectation of C 1: M(0 (processing block 505).
- processing logic obtains filtered frames x 1:M (i) by applying the mask function to the result of the inverses of transforms H 1 : M applied to the set of coefficients C 1: M (processing block 506).
- Processing logic determines an overall estimate x (processing block 507). This may be performed by averaging all the estimates together.
- the averaging may be a weighted average, hi one embodiment, the overall estimate block in Figure 5 is given by weighted averaging of the individual estimates X 11 X 211111 X N .
- This can be done with equal weights or using more sophisticated weight determination techniques known in the art, such as, for example, the techniques set forth in Onur G. Guleryuz, "Weighted Overcomplete Denoising," Proc. Asilomar Conference on Signals and Systems, Pacific Grove, CA, Nov. 2003, which identifies three different weighting techniques.
- the simplest of the three is used in the present invention. Therefore, an overall estimate is obtained, which is then masked. In an alternative embodiment, the individual estimates are masked and then an overall estimate is formed.
- processing logic 508 determines whether more iterations are needed (processing logic 508). For example, a fixed number of iterations such as two, may be preset. If more iterations are needed, processing logic sets the set of image elements e to i (processing block 509) and the process transitions to processing block 505; otherwise, processing transitions to processing block 510 where processing logic outputs the filtered frame x .
- Figure 6 is a block diagram describing a nonlinear denoising filter.
- a compression mode-based denoising parameter determination module 601 generates thresholds 602 and a mask function 603 based on compression parameters 604.
- a linear transform 605 is applied to decompressed video 606 to create coefficients 607.
- An expectation calculation module 608 generates expectations 609 for each coefficient of coefficients 607 based on decompressed video 606 and thresholds 602 in a first iteration, and based on altered thresholds 610 and a denoised estimate 611 in subsequent iterations.
- expectation calculation module 608 may also use additional information 612, such as data from previous decompressed video frames. The operation of the module is described above.
- a denoising rule module 620 generates denoised coefficients 621 based on coefficients 607 and expectations 609. The operation of module 620 is described above. An inverse linear transform 630 is applied to denoised coefficients
- An overall estimate construction module 640 combines initial denoised estimates 631 to generate an overall denoised estimate 641.
- a mask application module 650 selects between elements of the overall denoised estimate
- An iteration decision module 660 generates a final denoised estimate 661 equal to the denoised estimate if it is determined that a predetermined number of iterations is reached. Otherwise, thresholds are modified into altered thresholds 610. The altering of thresholds is described in greater detail below.
- Altered thresholds 610 and the denoised estimate 611 are fed back to expectation calculation module 608 for further iterations.
- T is taken as -/J times the standard deviation of the quantization noise that c(i) incurs.
- h be the i th row of H, i.e., the basis function responsible for the generation of c(i).
- This neighborhood can be taken as +/- j pixels around the default spatial position and j can be set to 0, 1, 2 or higher.
- k coefficients in the set have magnitudes greater than T (subset 1) and the remaining 1 coefficients have magnitudes less than T (subset 2). Construct the k average value of the coefficients in subset 1 and multiply the average with .
- the resulting value is assigned to ⁇ jef/Jlifj. This calculation can be extended to incorporate coefficients from previously decoded frames that match the coefficients in the subset using motion trajectories to determine matches.
- a general condition expectation calculation is used in conjunction with a general denoising rule with coefficients set to 0 or d(i) based on the expectation.
- a second round of denoising is performed again using the filtered frame x rather than y as follows.
- T' can be taken as T/2. This calculation can also be extended to incorporate matching coefficients from previously decoded/denoised frames using motion trajectories.
- mode-based decisions for the encoder utilized in hybrid video compression is an MPEG-like encoder and the compression mode alters the threshold used in the expectation calculation for each coefficient and also alters the mask function that determines the spatial locations to which denoising is applied. Below an example of mode based decisions that accomplish these two objectives is set forth.
- a block is an BxB rectangle of pixels, where B can be 4, 8, 16, or higher.
- B can be 4, 8, 16, or higher.
- For each coded block its mode is determined as one of:
- INTRA_QUANTIZED a block coded in the anchor or intra mode
- P_QUANTIZED a block coded differentially, having at least two nonzero quantized coefficients
- P_QUANTIZED_M (a block coded differentially, having only one nonzero quantized coefficient but with a large motion difference of 4 or more pixels with respect to a horizontal or vertical block),
- P_QUANTIZED_1 (a block coded differentially, having only one nonzero quantized coefficient with a small motion difference)
- SKIP_M (a block coded differentially, having no quantized coefficients but with a large motion difference of 4 or more pixels with respect to a horizontal or vertical block),
- SKIP (a block coded differentially, having no quantized coefficients with a small motion difference with respect to a horizontal or vertical block)
- each block has a spatial denoising influence that determines a spatial mask around its boundaries.
- the influence I is specified in pixel units and it identifies a rectangular shell of thickness of I pixels around the boundaries of the block.
- the masks of all blocks are combined to form a frame- wide mask.
- the influence factors I used in mask construction are determined as follows:
- mode-based denoising decisions are accomplished as follows: If the basis function generating a coefficient overlaps an INTRA_QUANTIZED block then that coefficient uses the given threshold T in its denoising operation, else, if a coefficient overlaps a P QUANTIZED block, it uses T, else, if a coefficient overlaps a P_QUANTIZED_M block, it uses T, else, if a coefficient overlaps a P_QUANTIZED_1 block, it uses 7/8 T, else, if a coefficient overlaps a SKIP_M block, it uses use 1/2 T, else, if a coefficient overlaps a SKIP block, it uses use 1/2 T, else, if a coefficient overlaps a OTHER block, it is not denoised.
- the denoising linear transforms are given by a nxn block DCT and all of its n 2 spatial translations, where n is determined by the size of the video frame.
- n is determined by the size of the video frame.
- n is expected to get larger for higher resolution video frames.
- Embodiments of the invention can accommodate video coders that use block as well as non-block transforms in transform coding.
- Embodiments of the invention are applicable to video frames that have pixel values due to a large range of statistics, such as low- pass, band-pass, high-pass, texture, edge, etc.
- the invention is not limited to video frames that have smoothly varying pixel values.
- Embodiments of the invention are effective in denoising a wide range of quantization artifacts, such as blocking artifacts, ringing artifacts, etc.
- the invention is not limited to blocking artifacts only or ringing artifacts only, etc.
- the rate-distortion performance of embodiment of the invention on typical video frames and the visual quality of the invention on typical video frames is significantly above related art.
- embodiment of the invention can be deployed hi a way that achieves low computational complexity.
- Figure 7 is a block diagram of an exemplary computer system that may perform one or more of the operations described herein.
- Computer system 700 may comprise an exemplary client or server computer system. Components described with respect to the computer system may be part of a handheld or mobile device (e.g., a cell phone).
- computer system 700 comprises a communication mechanism or bus 711 for communicating information, and a processor 712 coupled with bus 711 for processing information.
- Processor 712 includes a microprocessor, but is not limited to a microprocessor, such as, for example, PentiumTM processor, etc.
- System 700 further comprises a random access memory (RAM), or other dynamic storage device 704 (referred to as main memory) coupled to bus 711 for storing information and instructions to be executed by processor 712.
- RAM random access memory
- Mam memory 704 also may be used for storing temporary variables or other intermediate information during execution of instructions by processor 712.
- Computer system 700 also comprises a read only memory (ROM) and/or other static storage device 706 coupled to bus 711 for storing static information and instructions for processor 712, and a data storage device 707, such as a magnetic disk or optical disk and its corresponding disk drive.
- Data storage device 707 is coupled to bus 711 for storing information and instructions.
- Computer system 700 may further be coupled to a display device 721 , such as a cathode ray tube (CRT) or liquid crystal display (LCD), coupled to bus 711 for displaying information to a computer user.
- a display device 721 such as a cathode ray tube (CRT) or liquid crystal display (LCD)
- An alphanumeric input device 722, including alphanumeric and other keys, may also be coupled to bus 711 for communicating information and command selections to processor 712.
- cursor control 723 such as a mouse, trackball, trackpad, stylus, or cursor direction keys, coupled to bus 711 for communicating direction information and command selections to processor 712, and for controlling cursor movement on display 721.
- Another device that may be coupled to bus 711 is hard copy device
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| JP2007551412A JP4920599B2 (ja) | 2005-01-13 | 2006-01-13 | ハイブリッドビデオ圧縮の量子化雑音除去用の非線形ループ内デノイズフィルタ |
Applications Claiming Priority (4)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US64423005P | 2005-01-13 | 2005-01-13 | |
| US60/644,230 | 2005-01-13 | ||
| US11/331,814 | 2006-01-12 | ||
| US11/331,814 US8218634B2 (en) | 2005-01-13 | 2006-01-12 | Nonlinear, in-the-loop, denoising filter for quantization noise removal for hybrid video compression |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| WO2006076602A1 true WO2006076602A1 (en) | 2006-07-20 |
Family
ID=36653232
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/US2006/001290 Ceased WO2006076602A1 (en) | 2005-01-13 | 2006-01-13 | A nonlinear, in-the-loop, denoising filter for quantization noise removal for hybrid video compression |
Country Status (3)
| Country | Link |
|---|---|
| US (2) | US8218634B2 (enExample) |
| JP (1) | JP4920599B2 (enExample) |
| WO (1) | WO2006076602A1 (enExample) |
Cited By (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2010507335A (ja) * | 2006-10-18 | 2010-03-04 | トムソン ライセンシング | 予測データ精緻化を使用してビデオを符号化する方法及び装置 |
| US8804831B2 (en) | 2008-04-10 | 2014-08-12 | Qualcomm Incorporated | Offsets at sub-pixel resolution |
| US8831086B2 (en) | 2008-04-10 | 2014-09-09 | Qualcomm Incorporated | Prediction techniques for interpolation in video coding |
| TWI463878B (zh) * | 2009-02-19 | 2014-12-01 | Sony Corp | Image processing apparatus and method |
| US9967590B2 (en) | 2008-04-10 | 2018-05-08 | Qualcomm Incorporated | Rate-distortion defined interpolation for video coding based on fixed filter or adaptive filter |
| US10681380B2 (en) | 2015-06-12 | 2020-06-09 | Panasonic Intellectual Property Management Co., Ltd. | Image encoding method, image decoding method, image encoding apparatus, and image decoding apparatus |
Families Citing this family (36)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US7123172B1 (en) * | 2005-07-29 | 2006-10-17 | Hewlett-Packard Development Company, L.P. | Method and system for determining an optimal or near optimal set of contexts by constructing a multi-directional context tree |
| JP5160451B2 (ja) * | 2006-01-31 | 2013-03-13 | トムソン ライセンシング | エッジ・ベースの空間‐時間的フィルタリングの方法および装置 |
| WO2007116551A1 (ja) * | 2006-03-30 | 2007-10-18 | Kabushiki Kaisha Toshiba | 画像符号化装置及び画像符号化方法並びに画像復号化装置及び画像復号化方法 |
| US9277243B2 (en) | 2006-11-08 | 2016-03-01 | Thomson Licensing | Methods and apparatus for in-loop de-artifact filtering |
| US8879001B2 (en) * | 2006-12-28 | 2014-11-04 | Thomson Licensing | Detecting block artifacts in coded images and video |
| US20080316364A1 (en) * | 2007-06-25 | 2008-12-25 | The Hong Kong University Of Science And Technology | Rate distortion optimization for video denoising |
| US8437564B2 (en) * | 2007-08-07 | 2013-05-07 | Ntt Docomo, Inc. | Image and video compression using sparse orthonormal transforms |
| US8249144B2 (en) * | 2008-07-08 | 2012-08-21 | Imagine Communications Ltd. | Distributed transcoding |
| US8326075B2 (en) | 2008-09-11 | 2012-12-04 | Google Inc. | System and method for video encoding using adaptive loop filter |
| JP5697301B2 (ja) | 2008-10-01 | 2015-04-08 | 株式会社Nttドコモ | 動画像符号化装置、動画像復号装置、動画像符号化方法、動画像復号方法、動画像符号化プログラム、動画像復号プログラム、及び動画像符号化・復号システム |
| JP5542680B2 (ja) * | 2008-10-07 | 2014-07-09 | 株式会社Nttドコモ | 画像加工装置、方法及びプログラム、動画像符号化装置、方法及びプログラム、動画像復号装置、方法及びプログラム、並びに、符号化・復号システム及び方法 |
| EP2351374A1 (en) * | 2008-11-25 | 2011-08-03 | Thomson Licensing | Methods and apparatus for sparsity-based de-artifact filtering for video encoding and decoding |
| KR102003137B1 (ko) * | 2009-03-12 | 2019-07-23 | 인터디지탈 매디슨 페이튼트 홀딩스 | 디-아티팩트 필터링을 위한 영역-기반 필터 파라미터 선택 방법들 및 장치들 |
| US20100278231A1 (en) * | 2009-05-04 | 2010-11-04 | Imagine Communications Ltd. | Post-decoder filtering |
| WO2011086836A1 (ja) * | 2010-01-12 | 2011-07-21 | シャープ株式会社 | 符号化装置、復号装置、および、データ構造 |
| US20110249742A1 (en) * | 2010-04-07 | 2011-10-13 | Apple Inc. | Coupled video pre-processor and codec including reference picture filter that minimizes coding expense during pre-processing mode transitions |
| JP5393573B2 (ja) | 2010-04-08 | 2014-01-22 | 株式会社Nttドコモ | 動画像予測符号化装置、動画像予測復号装置、動画像予測符号化方法、動画像予測復号方法、動画像予測符号化プログラム、及び動画像予測復号プログラム |
| US9721333B2 (en) | 2010-09-15 | 2017-08-01 | Sharp Laboratories Of America, Inc. | Methods and systems for estimation of additive noise |
| US9361707B2 (en) | 2010-09-15 | 2016-06-07 | Sharp Laboratories Of America, Inc. | Methods and systems for detection and estimation of compression noise |
| US20120170653A1 (en) * | 2010-12-30 | 2012-07-05 | General Instrument Corporation | Block based sampling coding systems |
| JP5594841B2 (ja) * | 2011-01-06 | 2014-09-24 | Kddi株式会社 | 画像符号化装置及び画像復号装置 |
| US8781004B1 (en) | 2011-04-07 | 2014-07-15 | Google Inc. | System and method for encoding video using variable loop filter |
| US8780971B1 (en) | 2011-04-07 | 2014-07-15 | Google, Inc. | System and method of encoding using selectable loop filters |
| US8780996B2 (en) | 2011-04-07 | 2014-07-15 | Google, Inc. | System and method for encoding and decoding video data |
| US8885706B2 (en) | 2011-09-16 | 2014-11-11 | Google Inc. | Apparatus and methodology for a video codec system with noise reduction capability |
| JP6120490B2 (ja) | 2011-11-07 | 2017-04-26 | キヤノン株式会社 | 画像符号化装置、画像符号化方法及びプログラム、画像復号装置、画像復号方法及びプログラム |
| US9131073B1 (en) | 2012-03-02 | 2015-09-08 | Google Inc. | Motion estimation aided noise reduction |
| US9344729B1 (en) | 2012-07-11 | 2016-05-17 | Google Inc. | Selective prediction signal filtering |
| US9286653B2 (en) | 2014-08-06 | 2016-03-15 | Google Inc. | System and method for increasing the bit depth of images |
| US10102613B2 (en) * | 2014-09-25 | 2018-10-16 | Google Llc | Frequency-domain denoising |
| TWI511530B (zh) * | 2014-12-09 | 2015-12-01 | Univ Nat Kaohsiung 1St Univ Sc | Distributed video coding system and decoder for distributed video coding system |
| CN106470292B (zh) | 2015-08-20 | 2019-08-27 | 联咏科技股份有限公司 | 图像处理装置及图像处理方法 |
| JP6643884B2 (ja) * | 2015-12-04 | 2020-02-12 | 日本放送協会 | 映像符号化装置およびプログラム |
| US10380973B2 (en) * | 2017-02-01 | 2019-08-13 | Disney Enterprises, Inc. | Luminance comfort prediction and adjustment |
| CN109859124B (zh) * | 2019-01-11 | 2020-12-18 | 深圳奥比中光科技有限公司 | 一种深度图像降噪方法及装置 |
| CN114391254B (zh) * | 2019-06-28 | 2024-12-10 | 弗劳恩霍夫应用研究促进协会 | 视频解码器、视频编码器、用于编码和解码视频信号的方法以及调整一个或多个去噪操作的计算机程序 |
Citations (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JPH03136586A (ja) * | 1989-10-23 | 1991-06-11 | Nec Corp | 圧縮符号化装置 |
| JPH05130592A (ja) * | 1991-10-31 | 1993-05-25 | Toshiba Corp | 動画像符号化装置 |
| US5565925A (en) * | 1992-10-26 | 1996-10-15 | Nec Corporation | Image sub-sampling apparatus |
| EP1282075A2 (en) * | 2001-07-31 | 2003-02-05 | Ricoh Company, Ltd. | Enhancement of compressed images |
| US20040022315A1 (en) * | 2002-05-03 | 2004-02-05 | Samsung Electronics Co., Ltd. | Filtering method and apparatus for removing blocking artifacts and/or ringing noise |
| WO2004030369A1 (en) * | 2002-09-27 | 2004-04-08 | Videosoft, Inc. | Real-time video coding/decoding |
Family Cites Families (29)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US4189748A (en) * | 1977-08-23 | 1980-02-19 | Northrop Corporation | Video bandwidth reduction system using a two-dimensional transformation, and an adaptive filter with error correction |
| JPS6031325A (ja) * | 1983-07-29 | 1985-02-18 | Nec Corp | 予測停止adpcm符号化方式およびその回路 |
| DE4017375A1 (de) * | 1990-05-30 | 1991-12-05 | Philips Patentverwaltung | Adaptives filter zur reduktion von codierartefakten |
| JP3165296B2 (ja) * | 1992-12-25 | 2001-05-14 | 三菱電機株式会社 | フレーム間符号化処理方式及びフレーム間符号化処理方法及び符号化制御方式 |
| JPH0715631A (ja) * | 1993-06-29 | 1995-01-17 | Nippon Telegr & Teleph Corp <Ntt> | 画像信号雑音除去方法および装置 |
| JP3495766B2 (ja) * | 1993-10-01 | 2004-02-09 | テキサス インスツルメンツ インコーポレイテツド | 画像処理方法 |
| US6285710B1 (en) * | 1993-10-13 | 2001-09-04 | Thomson Licensing S.A. | Noise estimation and reduction apparatus for video signal processing |
| US5587708A (en) * | 1994-01-19 | 1996-12-24 | Industrial Technology Research Institute | Division technique unified quantizer-dequantizer |
| JP3674158B2 (ja) * | 1996-07-01 | 2005-07-20 | ソニー株式会社 | 画像符号化方法及び画像復号装置 |
| KR100213089B1 (ko) | 1997-01-29 | 1999-08-02 | 윤종용 | 루프필터링 방법 및 루프필터 |
| JPH10224790A (ja) * | 1997-02-07 | 1998-08-21 | Matsushita Electric Ind Co Ltd | 圧縮伸張された画像中のブロック状ノイズを除去するフィルタおよびフィルタ方法 |
| JPH11187400A (ja) * | 1997-12-22 | 1999-07-09 | Matsushita Electric Ind Co Ltd | 歪除去方法および歪除去フィルタ |
| US6188799B1 (en) | 1997-02-07 | 2001-02-13 | Matsushita Electric Industrial Co., Ltd. | Method and apparatus for removing noise in still and moving pictures |
| FI106071B (fi) | 1997-03-13 | 2000-11-15 | Nokia Mobile Phones Ltd | Mukautuva suodatin |
| KR100243225B1 (ko) * | 1997-07-16 | 2000-02-01 | 윤종용 | 블록화효과 및 링잉잡음 감소를 위한 신호적응필터링방법 및신호적응필터 |
| US6081552A (en) * | 1998-01-13 | 2000-06-27 | Intel Corporation | Video coding using a maximum a posteriori loop filter |
| KR100308016B1 (ko) * | 1998-08-31 | 2001-10-19 | 구자홍 | 압축 부호화된 영상에 나타나는 블럭현상 및 링현상 제거방법및 영상 복호화기 |
| GB9822092D0 (en) * | 1998-10-09 | 1998-12-02 | Snell & Wilcox Ltd | Analysis of compression decoded sequences |
| JP2000244921A (ja) * | 1999-02-24 | 2000-09-08 | Matsushita Electric Ind Co Ltd | 映像符号化方法および装置 |
| US7170941B2 (en) * | 1999-08-13 | 2007-01-30 | Patapsco Designs Inc. | Temporal compression |
| GB2362073B (en) * | 2000-05-03 | 2003-12-17 | Siemens Ag | Equaliser and method of state reduction therefor |
| CN1167034C (zh) * | 2001-02-27 | 2004-09-15 | 华为技术有限公司 | 图像预去噪的方法 |
| JP3964391B2 (ja) * | 2001-06-29 | 2007-08-22 | 株式会社エヌ・ティ・ティ・ドコモ | 画像符号化装置、画像復号装置、画像符号化方法及び画像復号方法 |
| DE60230666D1 (de) * | 2001-11-29 | 2009-02-12 | Panasonic Corp | Verfahren zur beseitigung von kodierungsverzerrung und verfahren zur videokodierung und -dekodierung |
| EP2894856A1 (en) * | 2002-01-31 | 2015-07-15 | Samsung Electronics Co., Ltd | Filtering method and apparatus for reducing block artifacts or ringing noise |
| US7430334B2 (en) * | 2003-07-31 | 2008-09-30 | Hewlett Packard Development Company, L.P. | Digital imaging systems, articles of manufacture, and digital image processing methods |
| MXPA06002212A (es) * | 2003-08-26 | 2006-05-19 | Thomson Licensing | Metodo y aparato para codificar bloques intra-inter-codificados hibridos. |
| US20050105817A1 (en) | 2003-11-17 | 2005-05-19 | Guleryuz Onur G. | Inter and intra band prediction of singularity coefficients using estimates based on nonlinear approximants |
| DE102004007185B3 (de) * | 2004-02-13 | 2005-06-30 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Prädiktives Codierungsschema |
-
2006
- 2006-01-12 US US11/331,814 patent/US8218634B2/en not_active Expired - Fee Related
- 2006-01-13 WO PCT/US2006/001290 patent/WO2006076602A1/en not_active Ceased
- 2006-01-13 JP JP2007551412A patent/JP4920599B2/ja not_active Expired - Fee Related
-
2012
- 2012-04-11 US US13/444,686 patent/US20120195374A1/en not_active Abandoned
Patent Citations (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JPH03136586A (ja) * | 1989-10-23 | 1991-06-11 | Nec Corp | 圧縮符号化装置 |
| JPH05130592A (ja) * | 1991-10-31 | 1993-05-25 | Toshiba Corp | 動画像符号化装置 |
| US5565925A (en) * | 1992-10-26 | 1996-10-15 | Nec Corporation | Image sub-sampling apparatus |
| EP1282075A2 (en) * | 2001-07-31 | 2003-02-05 | Ricoh Company, Ltd. | Enhancement of compressed images |
| US20040022315A1 (en) * | 2002-05-03 | 2004-02-05 | Samsung Electronics Co., Ltd. | Filtering method and apparatus for removing blocking artifacts and/or ringing noise |
| WO2004030369A1 (en) * | 2002-09-27 | 2004-04-08 | Videosoft, Inc. | Real-time video coding/decoding |
Non-Patent Citations (2)
| Title |
|---|
| PATENT ABSTRACTS OF JAPAN vol. 015, no. 355 (E - 1109) 9 September 1991 (1991-09-09) * |
| PATENT ABSTRACTS OF JAPAN vol. 017, no. 511 (E - 1432) 14 September 1993 (1993-09-14) * |
Cited By (10)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2010507335A (ja) * | 2006-10-18 | 2010-03-04 | トムソン ライセンシング | 予測データ精緻化を使用してビデオを符号化する方法及び装置 |
| US8542736B2 (en) | 2006-10-18 | 2013-09-24 | Thomson Licensing | Method and apparatus for video coding using prediction data refinement |
| JP2013258771A (ja) * | 2006-10-18 | 2013-12-26 | Thomson Licensing | 予測データ精緻化を使用してビデオを符号化する方法及び装置 |
| US8804831B2 (en) | 2008-04-10 | 2014-08-12 | Qualcomm Incorporated | Offsets at sub-pixel resolution |
| US8831086B2 (en) | 2008-04-10 | 2014-09-09 | Qualcomm Incorporated | Prediction techniques for interpolation in video coding |
| US9967590B2 (en) | 2008-04-10 | 2018-05-08 | Qualcomm Incorporated | Rate-distortion defined interpolation for video coding based on fixed filter or adaptive filter |
| US10440388B2 (en) | 2008-04-10 | 2019-10-08 | Qualcomm Incorporated | Rate-distortion defined interpolation for video coding based on fixed filter or adaptive filter |
| US11683519B2 (en) | 2008-04-10 | 2023-06-20 | Qualcomm Incorporated | Rate-distortion defined interpolation for video coding based on fixed filter or adaptive filter |
| TWI463878B (zh) * | 2009-02-19 | 2014-12-01 | Sony Corp | Image processing apparatus and method |
| US10681380B2 (en) | 2015-06-12 | 2020-06-09 | Panasonic Intellectual Property Management Co., Ltd. | Image encoding method, image decoding method, image encoding apparatus, and image decoding apparatus |
Also Published As
| Publication number | Publication date |
|---|---|
| US20060153301A1 (en) | 2006-07-13 |
| JP4920599B2 (ja) | 2012-04-18 |
| US8218634B2 (en) | 2012-07-10 |
| US20120195374A1 (en) | 2012-08-02 |
| JP2008527932A (ja) | 2008-07-24 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US8218634B2 (en) | Nonlinear, in-the-loop, denoising filter for quantization noise removal for hybrid video compression | |
| US8059902B2 (en) | Spatial sparsity induced temporal prediction for video compression | |
| Triantafyllidis et al. | Blocking artifact detection and reduction in compressed data | |
| US6950473B2 (en) | Hybrid technique for reducing blocking and ringing artifacts in low-bit-rate coding | |
| US8625682B2 (en) | Nonlinear, prediction filter for hybrid video compression | |
| US6115503A (en) | Method and apparatus for reducing coding artifacts of block-based image encoding and object-based image encoding | |
| US6983079B2 (en) | Reducing blocking and ringing artifacts in low-bit-rate coding | |
| US8743963B2 (en) | Image/video quality enhancement and super-resolution using sparse transformations | |
| EP2300982B1 (en) | Image/video quality enhancement and super-resolution using sparse transformations | |
| CN113873237B (zh) | 用于恢复由重构产生的劣化帧的劣化图块的方法和装置 | |
| Gandam et al. | An efficient post-processing adaptive filtering technique to rectifying the flickering effects | |
| US8000539B2 (en) | Geometrical image representation and compression | |
| US20060288065A1 (en) | Method and apparatus for lapped transform coding and decoding | |
| US6611361B1 (en) | Method for restoring compressed image of image processing system and apparatus therefor | |
| Cheung et al. | Improving MPEG-4 coding performance by jointly optimising compression and blocking effect elimination | |
| Makol et al. | Review of Deblocking Of Compressed Images Using Post Processing Spatial Techniques | |
| Shen | Compression artifact removal and inverse halftoning using robust nonlinear filtering |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
| WWE | Wipo information: entry into national phase |
Ref document number: 2007551412 Country of ref document: JP |
|
| NENP | Non-entry into the national phase |
Ref country code: DE |
|
| 122 | Ep: pct application non-entry in european phase |
Ref document number: 06718375 Country of ref document: EP Kind code of ref document: A1 |