WO2016070363A1 - Merge with inter prediction offset - Google Patents

Merge with inter prediction offset Download PDF

Info

Publication number
WO2016070363A1
WO2016070363A1 PCT/CN2014/090357 CN2014090357W WO2016070363A1 WO 2016070363 A1 WO2016070363 A1 WO 2016070363A1 CN 2014090357 W CN2014090357 W CN 2014090357W WO 2016070363 A1 WO2016070363 A1 WO 2016070363A1
Authority
WO
WIPO (PCT)
Prior art keywords
offset
inter prediction
resi
merge
neighboring
Prior art date
Application number
PCT/CN2014/090357
Other languages
French (fr)
Inventor
Han HUANG
Original Assignee
Mediatek Singapore Pte. Ltd.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Mediatek Singapore Pte. Ltd. filed Critical Mediatek Singapore Pte. Ltd.
Priority to PCT/CN2014/090357 priority Critical patent/WO2016070363A1/en
Priority to PCT/CN2015/093763 priority patent/WO2016070808A1/en
Priority to EP15857286.7A priority patent/EP3202151B1/en
Priority to CN201580060217.4A priority patent/CN107079165B/en
Priority to US15/521,499 priority patent/US10785475B2/en
Publication of WO2016070363A1 publication Critical patent/WO2016070363A1/en

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/18Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a set of transform coefficients
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/189Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the adaptation method, adaptation tool or adaptation type used for the adaptive coding
    • H04N19/196Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the adaptation method, adaptation tool or adaptation type used for the adaptive coding being specially adapted for the computation of encoding parameters, e.g. by averaging previously computed encoding parameters
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/46Embedding additional information in the video signal during the compression process
    • H04N19/463Embedding additional information in the video signal during the compression process by compressing encoding parameters before transmission
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/503Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
    • H04N19/51Motion estimation or motion compensation
    • H04N19/513Processing of motion vectors
    • H04N19/517Processing of motion vectors by encoding
    • H04N19/52Processing of motion vectors by encoding by predictive encoding
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/103Selection of coding mode or of prediction mode
    • H04N19/109Selection of coding mode or of prediction mode among a plurality of temporal predictive coding modes
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/134Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
    • H04N19/154Measured or subjectively estimated visual quality after decoding, e.g. measurement of distortion
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/90Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using coding techniques not provided for in groups H04N19/10-H04N19/85, e.g. fractals
    • H04N19/96Tree coding, e.g. quad-tree coding

Definitions

  • the invention relates generally to image and video processing.
  • the presented invention relates to image and video coding.
  • Merge mode in HEVC is a powerful mode to improve coding efficiency.
  • motion information is signaled only by an index.
  • a merge candidate list is constructed, and the candidate referred by the decoded index is used as the motion information for current block.
  • the motion information includes: motion vector (s) and the reference index of the reference picture (s) .
  • Fig. 1 is a diagram illustrating the positions of spatial merge candidates in HEVC.
  • Fig. 1 is a diagram illustrating the positions of spatial merge candidates in HEVC.
  • Offset X be the derived inter prediction offset from the neighboring block.
  • the Offset X is derived by analyzing the prediction signal, reconstructed residual signal and the inter prediction offset of the neighboring block.
  • Offset X mean (Resi' Y ) is derived as the mean value of Resi' Y , where Resi' Y is the reconstructed residual signal of the neighboring coded block.
  • Offset X mean (Resi' Y ) +Offset Y , where Offset Y is the inter prediction offset for the neighboring coded block.
  • the inter prediction offset is only applied in luma component if the video is YUV or YCbCr format.
  • the neighboring coded block is the blocks used in merge list construction process.
  • the neighboring coded block used for inter prediction offset derivation can be a prediction unit (PU) or transform unit (TU) .
  • Offset X is set to zero if Offset X is larger than a threshold.
  • Offset X is set to zero if neighboring block is smaller than current block.
  • Offset X is set to zero if the variance of neighboring block is larger than a threshold.
  • an embodiment of the present invention can be a circuit integrated into a video compression chip or program codes integrated into video compression software to perform the processing described herein.
  • An embodiment of the present invention may also be program codes to be executed on a Digital Signal Processor (DSP) to perform the processing described herein.
  • DSP Digital Signal Processor
  • the invention may also involve a number of functions to be performed by a computer processor, a digital signal processor, a microprocessor, or field programmable gate array (FPGA) .
  • processors can be configured to perform particular tasks according to the invention, by executing machine-readable software code or firmware code that defines the particular methods embodied by the invention.
  • the software code or firmware codes may be developed in different programming languages and different format or style.
  • the software code may also be compiled for different target platform.
  • different code formats, styles and languages of software codes and other means of configuring code to perform the tasks in accordance with the invention will not depart from the spirit and scope of the invention.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Computing Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)

Abstract

Merge with inter prediction offset is proposed. Besides the motion information, an inter prediction offset is also derived from the neighboring coded block.

Description

MERGE WITH INTER PREDICTION OFFSET TECHNICAL FIELD
The invention relates generally to image and video processing. In particular, the presented invention relates to image and video coding.
BACKGROUND
Merge mode in HEVC is a powerful mode to improve coding efficiency. In the merge mode, motion information is signaled only by an index. At the decoder side, a merge candidate list is constructed, and the candidate referred by the decoded index is used as the motion information for current block. The motion information includes: motion vector (s) and the reference index of the reference picture (s) .
SUMMARY
Methods of merge with inter prediction offset are proposed. Besides motion information, an inter prediction offset is also derived in merge candidate.
Other aspects and features of the invention will become apparent to those with ordinary skill in the art upon review of the following descriptions of specific embodiments.
BRIEF DESCRIPTION OF DRAWINGS
The invention can be more fully understood by reading the subsequent detailed description and examples with references made to the accompanying drawings, wherein:
Fig. 1 is a diagram illustrating the positions of spatial merge candidates in HEVC.
DETAILED DESCRIPTION
The following description is of the best-contemplated mode of carrying out the invention. This description is made for the purpose of illustrating the general principles of the invention and should not be taken in a limiting sense. The scope of the invention is best determined by reference to the appended claims.
In the proposed method, an inter prediction offset together with motion information are derived from a neighboring coded block. If this neighboring block is chosen as the merge candidate, its motion information is used for motion compensation in current block and the offset is also used for prediction. Fig. 1 is a diagram illustrating the positions of spatial merge candidates in HEVC.
Let OrigX and PredX be the original and motion compensation prediction signal of current block, OffsetX be the derived inter prediction offset from the neighboring block. The final residual signal of current block is ResiX=OrigX-PredX-OffsetX. At the decoder side, the reconstructed signal is RecoX=Resi'X+PredX+OffsetX, where Resi'X is the reconstructed residual signal. The OffsetX is derived by analyzing the prediction signal, reconstructed residual signal and the inter prediction offset of the neighboring block.
In one embodiment, OffsetX=mean (Resi'Y) is derived as the mean value of Resi'Y, where Resi'Y is the reconstructed residual signal of the neighboring coded block.
In another embodiment, if the neighboring coded block itself is merge mode, then OffsetX=mean (Resi'Y) +OffsetY, where OffsetY is the inter prediction offset for the neighboring coded block.
In still another embodiment, the inter prediction offset is only applied in luma component if the video is YUV or YCbCr format.
In still another embodiment, the neighboring coded block is the blocks used in merge list construction process.
In still another embodiment, the neighboring coded block used for inter prediction offset derivation can be a prediction unit (PU) or transform unit (TU) .
In still another embodiment, OffsetX is set to zero if OffsetXis larger than  a threshold.
In still another embodiment, OffsetX is set to zero if neighboring block is smaller than current block.
In still another embodiment, OffsetX is set to zero if the variance of neighboring block is larger than a threshold.
The methods described above can be used in a video encoder as well as in a video decoder. Embodiments of merge with inter prediction offset according to the present invention as described above may be implemented in various hardware, software codes, or a combination of both. For example, an embodiment of the present invention can be a circuit integrated into a video compression chip or program codes integrated into video compression software to perform the processing described herein. An embodiment of the present invention may also be program codes to be executed on a Digital Signal Processor (DSP) to perform the processing described herein. The invention may also involve a number of functions to be performed by a computer processor, a digital signal processor, a microprocessor, or field programmable gate array (FPGA) . These processors can be configured to perform particular tasks according to the invention, by executing machine-readable software code or firmware code that defines the particular methods embodied by the invention. The software code or firmware codes may be developed in different programming languages and different format or style. The software code may also be compiled for different target platform. However, different code formats, styles and languages of software codes and other means of configuring code to perform the tasks in accordance with the invention will not depart from the spirit and scope of the invention.
The invention may be embodied in other specific forms without departing from its spirit or essential characteristics. The described examples are to be considered in all respects only as illustrative and not restrictive. To the contrary, it is intended to cover various modifications and similar arrangements (as would be apparent to those skilled in the art) . Therefore, the scope of the appended claims should be accorded the broadest interpretation so as to encompass all such modifications and similar arrangements.

Claims (15)

  1. A method of merge with inter prediction offset:
    a residual signal for a current block is calculated as ResiX=OrigX-PredX-OffsetX, where OrigXis an original signal, PredXis a prediction signal, OffsetXis an inter prediction offset; at a decoder side, a reconstructed signal is RecoX=Resi′x+PredX+OffsetX, where Resi′x is a reconstructed residual signal;
    wherein motion information used for obtaining PredX and inter prediction offset OffsetX are both derived from a neighboring coded block.
  2. The method as claimed in claim 1, wherein said merge candidate is used to replace an original spatial merge candidate.
  3. The method as claimed in claim 1 and claim 2, wherein an inter prediction offset OffsetX is also derived when construction spatial merge candidate.
  4. The method as claimed in claim 1 and claim 2, wherein the residual signal for current block is calculated as ResiX=OrigX-PredX-OffsetX when the current block is merge mode.
  5. The method as claimed in claim 1, wherein OffsetX=mean (Resi′Y) is derived as the mean value of Resi′Y, where Resi′Y is the reconstructed residual signal of the neighboring coded block.
  6. The method as claimed in claim 1, wherein OffsetX=mean (Resi′Y) +OffsetY if the neighboring coded block itself is merge mode, OffsetY is the inter prediction offset for the neighboring coded block.
  7. The method as claimed in claim 1, wherein the inter prediction offset is only applied in luma component if the video is YUV or YCbCr format.
  8. The method as claimed in claim 1, wherein the neighboring coded block is the blocks used in merge list construction process.
  9. The method as claimed in claim 1, wherein the neighboring coded block used for inter prediction offset derivation can be a prediction unit (PU) or transform unit (TU) .
  10. The method as claimed in claim 1, wherein OffsetX is set to zero if  OffsetX is larger than a threshold.
  11. The method as claimed in claim 1, wherein OffsetX is set to zero if neighboring block is smaller than current block.
  12. The method as claimed in claim 1, wherein OffsetX is set to zero if the variance of neighboring block is larger than a threshold.
  13. The method as claimed in claim 1, wherein one or more syntax elements can be used to signal whether merge with inter prediction offset is used. The syntax element can be coded.
  14. The method as claimed in claim 13, wherein the syntax elements can be explicitly transmitted in the sequence level, view level, picture level, slice level, or other levels. For example, it can be coded in VPS, SPS, PPS, APS, slice header, LCU et al.
  15. The method as claimed in claim 14, wherein the information about whether the merge with inter prediction offset is used can also be derived implicitly on decoder side according to statistics of mode selections.
PCT/CN2014/090357 2014-11-05 2014-11-05 Merge with inter prediction offset WO2016070363A1 (en)

Priority Applications (5)

Application Number Priority Date Filing Date Title
PCT/CN2014/090357 WO2016070363A1 (en) 2014-11-05 2014-11-05 Merge with inter prediction offset
PCT/CN2015/093763 WO2016070808A1 (en) 2014-11-05 2015-11-04 Method and apparatus of video coding with prediction offset
EP15857286.7A EP3202151B1 (en) 2014-11-05 2015-11-04 Method and apparatus of video coding with prediction offset
CN201580060217.4A CN107079165B (en) 2014-11-05 2015-11-04 Video coding method and apparatus using prediction residual
US15/521,499 US10785475B2 (en) 2014-11-05 2015-11-04 Method and apparatus of video coding with prediction offset

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2014/090357 WO2016070363A1 (en) 2014-11-05 2014-11-05 Merge with inter prediction offset

Publications (1)

Publication Number Publication Date
WO2016070363A1 true WO2016070363A1 (en) 2016-05-12

Family

ID=55908381

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2014/090357 WO2016070363A1 (en) 2014-11-05 2014-11-05 Merge with inter prediction offset

Country Status (1)

Country Link
WO (1) WO2016070363A1 (en)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070177674A1 (en) * 2006-01-12 2007-08-02 Lg Electronics Inc. Processing multiview video
US20130016772A1 (en) * 2011-07-14 2013-01-17 Toru Matsunobu Image coding method and image decoding method
US20130114717A1 (en) * 2011-11-07 2013-05-09 Qualcomm Incorporated Generating additional merge candidates
WO2014166338A1 (en) * 2013-04-11 2014-10-16 Mediatek Inc. Method and apparatus for prediction value derivation in intra coding

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070177674A1 (en) * 2006-01-12 2007-08-02 Lg Electronics Inc. Processing multiview video
US20130016772A1 (en) * 2011-07-14 2013-01-17 Toru Matsunobu Image coding method and image decoding method
US20130114717A1 (en) * 2011-11-07 2013-05-09 Qualcomm Incorporated Generating additional merge candidates
WO2014166338A1 (en) * 2013-04-11 2014-10-16 Mediatek Inc. Method and apparatus for prediction value derivation in intra coding

Similar Documents

Publication Publication Date Title
AU2011354441B2 (en) Method and apparatus of improved intra luma prediction mode coding
WO2016165069A1 (en) Advanced temporal motion vector prediction in video coding
WO2017143467A1 (en) Localized luma mode prediction inheritance for chroma coding
WO2016074147A1 (en) Separated coding tree for luma and chroma
US20170078695A1 (en) Method and Apparatus for Intra Mode Coding
WO2017139937A1 (en) Advanced linear model prediction for chroma coding
WO2015180014A1 (en) An improved merge candidate list construction method for intra block copy
CN107079165B (en) Video coding method and apparatus using prediction residual
US20170171558A1 (en) Method of Temporal Derived Bi-Directional Motion Vector for Motion Vector Prediction
WO2015085575A1 (en) Methods for background residual prediction
WO2015100522A1 (en) Methods for inter-component residual prediction
WO2016115708A1 (en) Methods for chroma component coding with separate intra prediction mode
WO2015180166A1 (en) Improved intra prediction mode coding
WO2015192372A1 (en) A simplified method for illumination compensation in multi-view and 3d video coding
WO2013102299A1 (en) Residue quad tree depth for chroma components
WO2016115736A1 (en) Additional intra prediction modes using cross-chroma-component prediction
WO2016205999A1 (en) Adaptive coding group for image/video coding
WO2016049894A1 (en) Scaling in color transform
WO2016070363A1 (en) Merge with inter prediction offset
WO2015131404A1 (en) Methods for depth map coding
WO2017035833A1 (en) Neighboring-derived prediction offset (npo)
WO2016123749A1 (en) Deblocking filtering with adaptive motion vector resolution
WO2016165122A1 (en) Inter prediction offset
WO2015100732A1 (en) A padding method for intra block copying
WO2016183814A1 (en) Coded block flag coding using cross-component correlation

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 14905550

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 14905550

Country of ref document: EP

Kind code of ref document: A1