WO2016070363A1

WO2016070363A1 - Merge with inter prediction offset

Info

Publication number: WO2016070363A1
Application number: PCT/CN2014/090357
Authority: WO
Inventors: Han HUANG
Original assignee: Mediatek Singapore Pte. Ltd.
Priority date: 2014-11-05
Filing date: 2014-11-05
Publication date: 2016-05-12

Abstract

Merge with inter prediction offset is proposed. Besides the motion information, an inter prediction offset is also derived from the neighboring coded block.

Description

MERGE WITH INTER PREDICTION OFFSET

TECHNICAL FIELD

The invention relates generally to image and video processing. In particular, the presented invention relates to image and video coding.

BACKGROUND

Merge mode in HEVC is a powerful mode to improve coding efficiency. In the merge mode, motion information is signaled only by an index. At the decoder side, a merge candidate list is constructed, and the candidate referred by the decoded index is used as the motion information for current block. The motion information includes: motion vector (s) and the reference index of the reference picture (s) .

SUMMARY

Methods of merge with inter prediction offset are proposed. Besides motion information, an inter prediction offset is also derived in merge candidate.

Other aspects and features of the invention will become apparent to those with ordinary skill in the art upon review of the following descriptions of specific embodiments.

BRIEF DESCRIPTION OF DRAWINGS

The invention can be more fully understood by reading the subsequent detailed description and examples with references made to the accompanying drawings, wherein:

Fig. 1 is a diagram illustrating the positions of spatial merge candidates in HEVC.

DETAILED DESCRIPTION

The following description is of the best-contemplated mode of carrying out the invention. This description is made for the purpose of illustrating the general principles of the invention and should not be taken in a limiting sense. The scope of the invention is best determined by reference to the appended claims.

In the proposed method, an inter prediction offset together with motion information are derived from a neighboring coded block. If this neighboring block is chosen as the merge candidate, its motion information is used for motion compensation in current block and the offset is also used for prediction. Fig. 1 is a diagram illustrating the positions of spatial merge candidates in HEVC.

Let Orig_X and Pred_X be the original and motion compensation prediction signal of current block, Offset_X be the derived inter prediction offset from the neighboring block. The final residual signal of current block is Resi_X＝Orig_X-Pred_X-Offset_X. At the decoder side, the reconstructed signal is Reco_X＝Resi'_X+Pred_X+Offset_X, where Resi'_X is the reconstructed residual signal. The Offset_X is derived by analyzing the prediction signal, reconstructed residual signal and the inter prediction offset of the neighboring block.

In one embodiment, Offset_X＝mean (Resi'_Y) is derived as the mean value of Resi'_Y, where Resi'_Y is the reconstructed residual signal of the neighboring coded block.

In another embodiment, if the neighboring coded block itself is merge mode, then Offset_X＝mean (Resi'_Y) +Offset_Y, where Offset_Y is the inter prediction offset for the neighboring coded block.

In still another embodiment, the inter prediction offset is only applied in luma component if the video is YUV or YCbCr format.

In still another embodiment, the neighboring coded block is the blocks used in merge list construction process.

In still another embodiment, the neighboring coded block used for inter prediction offset derivation can be a prediction unit (PU) or transform unit (TU) .

In still another embodiment, Offset_X is set to zero if Offset_Xis larger than a threshold.

In still another embodiment, Offset_X is set to zero if neighboring block is smaller than current block.

In still another embodiment, Offset_X is set to zero if the variance of neighboring block is larger than a threshold.

The methods described above can be used in a video encoder as well as in a video decoder. Embodiments of merge with inter prediction offset according to the present invention as described above may be implemented in various hardware, software codes, or a combination of both. For example, an embodiment of the present invention can be a circuit integrated into a video compression chip or program codes integrated into video compression software to perform the processing described herein. An embodiment of the present invention may also be program codes to be executed on a Digital Signal Processor (DSP) to perform the processing described herein. The invention may also involve a number of functions to be performed by a computer processor, a digital signal processor, a microprocessor, or field programmable gate array (FPGA) . These processors can be configured to perform particular tasks according to the invention, by executing machine-readable software code or firmware code that defines the particular methods embodied by the invention. The software code or firmware codes may be developed in different programming languages and different format or style. The software code may also be compiled for different target platform. However, different code formats, styles and languages of software codes and other means of configuring code to perform the tasks in accordance with the invention will not depart from the spirit and scope of the invention.

The invention may be embodied in other specific forms without departing from its spirit or essential characteristics. The described examples are to be considered in all respects only as illustrative and not restrictive. To the contrary, it is intended to cover various modifications and similar arrangements (as would be apparent to those skilled in the art) . Therefore, the scope of the appended claims should be accorded the broadest interpretation so as to encompass all such modifications and similar arrangements.

Claims

A method of merge with inter prediction offset：

a residual signal for a current block is calculated as Resi_X＝Orig_X-Pred_X-Offset_X， where Orig_Xis an original signal， Pred_Xis a prediction signal， Offset_Xis an inter prediction offset； at a decoder side， a reconstructed signal is Reco_X＝Resi′_x+Pred_X+Offset_X， where Resi′_x is a reconstructed residual signal；

wherein motion information used for obtaining Pred_X and inter prediction offset Offset_X are both derived from a neighboring coded block.
The method as claimed in claim 1， wherein said merge candidate is used to replace an original spatial merge candidate.
The method as claimed in claim 1 and claim 2， wherein an inter prediction offset Offset_X is also derived when construction spatial merge candidate.
The method as claimed in claim 1 and claim 2， wherein the residual signal for current block is calculated as Resi_X＝Orig_X-Pred_X-Offset_X when the current block is merge mode.
The method as claimed in claim 1， wherein Offset_X＝mean (Resi′_Y) is derived as the mean value of Resi′_Y， where Resi′_Y is the reconstructed residual signal of the neighboring coded block.
The method as claimed in claim 1， wherein Offset_X＝mean (Resi′_Y) +Offset_Y if the neighboring coded block itself is merge mode， Offset_Y is the inter prediction offset for the neighboring coded block.
The method as claimed in claim 1， wherein the inter prediction offset is only applied in luma component if the video is YUV or YCbCr format.
The method as claimed in claim 1， wherein the neighboring coded block is the blocks used in merge list construction process.
The method as claimed in claim 1， wherein the neighboring coded block used for inter prediction offset derivation can be a prediction unit (PU) or transform unit (TU) .
The method as claimed in claim 1， wherein Offset_X is set to zero if Offset_X is larger than a threshold.
The method as claimed in claim 1， wherein Offset_X is set to zero if neighboring block is smaller than current block.
The method as claimed in claim 1， wherein Offset_X is set to zero if the variance of neighboring block is larger than a threshold.
The method as claimed in claim 1， wherein one or more syntax elements can be used to signal whether merge with inter prediction offset is used. The syntax element can be coded.
The method as claimed in claim 13， wherein the syntax elements can be explicitly transmitted in the sequence level， view level， picture level， slice level， or other levels. For example， it can be coded in VPS， SPS， PPS， APS， slice header， LCU et al.
The method as claimed in claim 14， wherein the information about whether the merge with inter prediction offset is used can also be derived implicitly on decoder side according to statistics of mode selections.