WO2011053054A9 - Procédé et appareil de codage/décodage de vecteur de mouvement au moyen d'une limitation de résolution de vecteur de mouvement, et procédé et appareil de codage/décodage d'image dans lesquels ils interviennent - Google Patents

Procédé et appareil de codage/décodage de vecteur de mouvement au moyen d'une limitation de résolution de vecteur de mouvement, et procédé et appareil de codage/décodage d'image dans lesquels ils interviennent Download PDF

Info

Publication number
WO2011053054A9
WO2011053054A9 PCT/KR2010/007548 KR2010007548W WO2011053054A9 WO 2011053054 A9 WO2011053054 A9 WO 2011053054A9 KR 2010007548 W KR2010007548 W KR 2010007548W WO 2011053054 A9 WO2011053054 A9 WO 2011053054A9
Authority
WO
WIPO (PCT)
Prior art keywords
motion vector
resolution
image
encoding
decoding
Prior art date
Application number
PCT/KR2010/007548
Other languages
English (en)
Korean (ko)
Other versions
WO2011053054A3 (fr
WO2011053054A2 (fr
Inventor
김수년
임정연
이규민
최재훈
한종기
이영렬
문주희
김해광
전병우
서찬원
Original Assignee
에스케이텔레콤 주식회사
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 에스케이텔레콤 주식회사 filed Critical 에스케이텔레콤 주식회사
Publication of WO2011053054A2 publication Critical patent/WO2011053054A2/fr
Publication of WO2011053054A3 publication Critical patent/WO2011053054A3/fr
Publication of WO2011053054A9 publication Critical patent/WO2011053054A9/fr

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/503Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
    • H04N19/51Motion estimation or motion compensation
    • H04N19/513Processing of motion vectors
    • H04N19/517Processing of motion vectors by encoding

Definitions

  • the present invention relates to a motion vector encoding / decoding method and apparatus using a motion vector resolution limitation, and an image encoding / decoding method and apparatus using the same. More particularly, the present invention relates to a method and an apparatus for improving compression efficiency by determining and encoding a motion vector by adaptively limiting the resolution of the motion vector in encoding an image.
  • H.264 / AVC is the standard for the most highly compressed video codecs among the standardized video coders and decodes to date.
  • the H.264 / AVC standard includes intra prediction with directionality to improve compression efficiency, integer transform in 4x4 pixel units, and block mode with various sizes from 16x16 pixel size to 4x4 pixel size. Mode and a deblocking filter to predictively encode an image.
  • the H.264 / AVC standard performs motion estimation by interpolating images up to 1/2 pixel unit and 1/4 pixel unit in order to find a more accurate motion vector.
  • an embodiment of the present invention has a main object to improve compression efficiency by determining and encoding a motion vector by adaptively limiting the resolution of the motion vector.
  • an embodiment of the present invention provides an apparatus for encoding / decoding an image, wherein the motion vector of the image is determined and determined according to the limited step-by-step motion vector resolution in which the motion vector resolution is limited step by step.
  • Generate a predictive image of the image by using a motion vector, subtract, transform, and quantize the image and the predictive image, and then encode the resultant to generate image coded data for each step, and calculate encoding cost of the image coded data for each step.
  • Vector resolution Video encoder for encoding the display limitations index showing a step of outputting the display limitations index data; And decoding the motion vector encoded data extracted from the bitstream in accordance with the limitation of the motion vector resolution, which is obtained by decoding the resolution limit index data extracted from the bitstream, and converting and quantized from the bitstream.
  • An image decoder for restoring an image by restoring a residual image, inverse quantization and inverse transform, and generating a prediction image of the image by using the restored motion vector, and adding the residual image and the prediction image to restore the image. It provides a video encoding / decoding apparatus comprising a.
  • the motion vector resolution is determined and determined according to the motion vector resolution of the step-by-step limited motion vector resolution
  • a predictor for generating a predictive image of the image using the motion vector A subtractor configured to generate a residual image by subtracting the image and the predicted image;
  • a converter and quantizer for transforming and quantizing the residual image And encoding the transformed and quantized residual images to generate restriction image encoding data, calculating encoding cost of the restriction phase encoding image data, and selecting a restriction level of the motion vector resolution using the encoding cost of the restriction encoding image encoding data.
  • a video encoding apparatus is provided.
  • the bit according to the limitation step of the motion vector resolution to be recovered by decoding the resolution limit index data extracted from the bitstream A decoder which decodes motion vector encoded data extracted from the stream to reconstruct the motion vector, and decodes the encoded video extracted from the bitstream to reconstruct the transformed and quantized residual image; An inverse quantizer and an inverse converter that inverse quantizes and inverse transforms the transformed and quantized residual image to reconstruct the residual image; A predictor for generating a predicted image of the image by using the reconstructed motion vector; And an adder for reconstructing the image by adding the residual image and the predictive image.
  • an embodiment of the present invention in the encoding device for a motion vector, encoding cost calculator for calculating the encoding cost according to the motion vector resolution of the step-by-step limited motion vector resolution ;
  • a limiting resolution determiner which selects a limiting step of the motion vector resolution using the encoding cost calculated in the limiting step;
  • a motion information encoder for encoding the motion vector coded data in which the motion vector determined according to the limiting step of the selected motion vector resolution is encoded and the resolution limit index indicating the limiting step of the selected motion vector resolution to output resolution limit index data.
  • an embodiment of the present invention in the apparatus for decoding a motion vector, a limitation for restoring the motion vector resolution limitation step by decoding the resolution limitation index data extracted from the bitstream Resolution restorer; And a motion vector reconstructor for reconstructing the motion vector by decoding the motion vector encoded data extracted from the bitstream according to the limiting step of the reconstructed motion vector resolution.
  • an embodiment of the present invention in the method for encoding / decoding an image, determines the motion vector of the image according to the limited step-by-step motion vector resolution in which the motion vector resolution is limited step by step. Generating a predicted image of the image by using the determined motion vector, subtracting, transforming, and quantizing the image and the predicted image, and encoding the same to generate image coded data for each step, and encoding the image coded data for each step. Calculates a cost, selects a restriction step of motion vector resolution using the encoding cost of the restriction-level image coded data, and encodes image coded data encoded using a motion vector determined according to the motion vector resolution according to the selected restriction step.
  • the image encoding step by encoding the limited resolution index indicating a limiting step of the emitter which outputs a resolution of the display limitations index data; And decoding the motion vector encoded data extracted from the bitstream in accordance with the limitation of the motion vector resolution, which is obtained by decoding the resolution limit index data extracted from the bitstream, and converting and quantized from the bitstream.
  • An image decoding step of restoring a residual image, inverse quantization and inverse transformation, and restoring the image, generating a predicted image of the image using the restored motion vector, and adding the residual image and the predicted image to restore the image It provides a video encoding / decoding method comprising a.
  • a method of encoding an image comprising: determining a motion vector of an image according to a step-by-step motion vector resolution in which the motion vector resolution is limited step by step; Generating a predicted image of the image using the determined motion vector; Generating a residual image by subtracting the image and the predictive image; Converting and quantizing the residual image; Encoding the transformed and quantized residual images to generate image coded data for each step; Calculating a coding cost of the image coded data for each step; Selecting a limiting step of the motion vector resolution using the encoding cost of the video encoding data for each step; And generating a bitstream including image coded data encoded using the motion vector determined according to the motion vector resolution according to the selected limiting step and resolution limit index data indicating the limiting step of the selected motion vector resolution.
  • a video encoding method is provided.
  • an embodiment of the present invention to achieve another object of the present invention, in the method for decoding an image, the bit according to the limitation step of the motion vector resolution to be recovered by decoding the resolution limit index data extracted from the bitstream Reconstructing the motion vector by decoding the motion vector coded data extracted from the stream; Reconstructing the transformed and quantized residual image by decoding the image encoded data extracted from the bitstream; Inverse quantization and inverse transformation of the transformed and quantized residual image to reconstruct the residual image; Generating a predicted image of the image by using the reconstructed motion vector; And reconstructing the image by adding the residual image and the prediction image.
  • an embodiment of the present invention in the method for encoding a motion vector of the image, calculating the encoding cost according to the motion vector resolution of the step-by-step limited motion vector resolution ; Selecting a limiting step of the motion vector resolution using the encoding cost calculated in the limiting step; And encoding the motion vector encoded data encoded with the motion vector determined according to the limiting step of the selected motion vector resolution and the resolution limit index indicating the limiting step of the selected motion vector resolution to output resolution limit index data.
  • a motion vector encoding method is provided.
  • an embodiment of the present invention to achieve another object of the present invention, in the method for decoding a motion vector, the step of restoring the limit of the motion vector resolution by decoding the resolution limit index data extracted from the bitstream ; And reconstructing the motion vector by decoding the motion vector encoded data extracted from the bitstream according to the limiting step of the reconstructed motion vector resolution.
  • the compression efficiency can be improved by adaptively limiting the resolution of the motion vector to determine and encode the motion vector.
  • FIG. 1 is a block diagram schematically illustrating a video encoding apparatus according to an embodiment of the present invention
  • FIG. 2 is an exemplary view showing a state of determining a motion vector through motion estimation according to an embodiment of the present invention
  • FIG. 3 is a block diagram schematically illustrating an encoder according to an embodiment of the present invention.
  • FIG. 4 is a block diagram schematically illustrating a motion vector encoding apparatus according to an embodiment of the present invention.
  • FIG. 5 is an exemplary diagram for describing a process of estimating a motion vector having a motion vector resolution of a small pixel unit using template matching according to an embodiment of the present invention
  • FIG. 6 is an exemplary view for explaining a process of selecting a motion vector according to an embodiment of the present invention.
  • FIG. 7 is a flowchart illustrating a motion vector encoding method according to an embodiment of the present invention.
  • FIG. 8 is a flowchart illustrating an example of implementing a motion vector encoding method according to an embodiment of the present invention
  • FIG. 9 is a flowchart illustrating a video encoding method according to an embodiment of the present invention.
  • FIG. 10 is a flowchart for explaining another example of implementing an image encoding method according to an embodiment of the present invention.
  • FIG. 11 is a block diagram schematically illustrating an image decoding apparatus according to an embodiment of the present invention.
  • FIG. 12 is a block diagram schematically illustrating a decoder according to an embodiment of the present invention.
  • FIG. 13 is a block diagram schematically illustrating a motion vector decoding apparatus according to an embodiment of the present invention.
  • FIG. 14 is a flowchart illustrating a motion vector decoding method according to an embodiment of the present invention.
  • 15 is a flowchart illustrating an image decoding method according to an embodiment of the present invention.
  • the motion vector encoding apparatus (Motion Vector Encoding Apparatus), the motion vector decoding apparatus (Motion Vector Decoding Apparatus), the video encoding apparatus (Video Encoding Apparatus), and the video decoding apparatus (Video Decoding Apparatus) will be described later.
  • Computer Notebook Computer, Personal Digital Assistant (PDA), Portable Multimedia Player (PMP), PlayStation Portable (PSP), Wireless Communication Terminal (Wireless Communication Terminal), Smartphone ( It may be a user terminal such as a smart phone) or a TV, or a server terminal such as an application server or a service server, and may encode, decode, encode, or decode an image, a communication device such as a communication modem for communicating with various devices or a wired / wireless communication network.
  • Various programs and data for predicting inter or intra may mean a variety of devices including a memory for storing, a microprocessor for executing and controlling a program.
  • the motion vector or the image encoded in the bitstream by the motion vector encoding apparatus or the image encoding apparatus is real time or non-real time through a wired or wireless communication network such as the Internet, a local area wireless communication network, a wireless LAN network, a WiBro network, a mobile communication network, or the like.
  • a wired or wireless communication network such as the Internet, a local area wireless communication network, a wireless LAN network, a WiBro network, a mobile communication network, or the like.
  • USB universal serial bus
  • a moving picture is composed of a series of pictures, and each picture is divided into a predetermined area such as a block.
  • each picture is divided into a predetermined area such as a block.
  • the divided blocks are largely classified into intra blocks and inter blocks according to encoding methods.
  • An intra block refers to a block that is encoded by using an intra prediction coding scheme. Intra prediction coding is performed by using pixels of blocks that have been previously encoded, decoded, and reconstructed in a current picture that performs current encoding. A prediction block is generated by predicting pixels of a block, and a difference value with pixels of the current block is encoded.
  • An inter block refers to a block that is encoded using inter prediction coding.
  • Inter prediction coding generates a prediction block by predicting a current block in a current picture by referring to one or more past pictures or future pictures, and then generates a current block. This is a method of encoding the difference value with.
  • a picture referred to for encoding or decoding the current picture is referred to as a reference picture.
  • FIG. 1 is a block diagram schematically illustrating a video encoding apparatus according to an embodiment of the present invention.
  • the image encoding apparatus 100 is an apparatus for encoding an image, and includes a predictor 110, a subtractor 120, a transformer and quantizer 130, and an encoder.
  • the encoder may include an encoder 140, an inverse quantizer and an inverse transformer 150, an adder 160, and a reference picture memory 170.
  • the predictor 110 determines a motion vector of the image according to the limited step-by-step motion vector resolution in which the motion vector resolution is limited in stages, and uses the determined motion vector to predict the image.
  • the motion vector and the reference frame index of the current block are estimated by estimating the motion of a block to be encoded in the predictor 110 input image (hereinafter referred to as a current block). Determining the motion information (Motion Information) including, and by using the determined motion information of the current block to compensate for the motion of the current block to generate a prediction block of the current block.
  • the predictor 110 further estimates a motion vector having a motion vector resolution of a pixel unit larger than the minimum motion vector resolution according to the limitation of the motion vector resolution selected by the encoder 140, and predicts the motion vector using the estimated motion vector.
  • An image can be generated.
  • the predictor 110 may estimate a motion vector having a smaller motion unit resolution by using template matching (TM).
  • template matching refers to a method of estimating a motion vector of a current block by using pixels that are already encoded, decoded, and reconstructed among neighboring pixels (or adjacent pixels among neighboring pixels) of a current block to be encoded. A method of additionally estimating a motion vector using template matching will be described in detail later with reference to FIGS. 5 and 6.
  • the process of determining the motion information in units of blocks of the predictor 110 and generating a prediction block is performed by a coding unit (eg, a block, a slice, a picture, and a picture group (GOP). Picture), etc., and an image obtained by collecting each prediction block generated by performing the coding unit may be generated as a prediction image.
  • a coding unit eg, a block, a slice, a picture, and a picture group (GOP). Picture
  • an image obtained by collecting each prediction block generated by performing the coding unit may be generated as a prediction image.
  • the predictor 110 determines a motion vector by estimating the motion of each block in the slice, and generates a prediction image by collecting the prediction blocks generated using the motion vector in units of slices. can do.
  • the input image may be an image (eg, an image corresponding to an arbitrary slice in an arbitrary picture) for a predetermined coding unit constituting a video
  • the prediction image may be an input image. It may be an image (eg, a slice image in which prediction blocks are collected) for a predetermined coding unit including prediction blocks generated by predicting blocks in the block.
  • the predicted image generated by the predictor 110 may be generated according to the motion vector resolution for each step. That is, the predictor 110 determines a motion vector for each block in the input image according to the motion vector resolution for each step, generates a prediction block using the prediction vector, and generates a prediction image by collecting the generated prediction blocks. As many as the number of limiting steps of the vector resolution may be generated. In addition, when a motion vector having a motion vector resolution of a pixel unit smaller than the minimum motion vector resolution according to the selected motion vector resolution is further estimated, one prediction image may be generated for an image of a coding unit.
  • the subtractor 120 subtracts the input image and the prediction image to generate a residual image.
  • the residual image is an image generated by subtracting an input image and a prediction image to be encoded, and means an image including a residual signal that is a difference between a pixel of the input image and a pixel of the prediction image.
  • the residual image generated by the subtractor 120 may be generated as many as the number of limiting steps of the motion vector resolution.
  • a prediction image is generated according to the estimated motion vector, a residual image of the prediction image may be generated.
  • the converter and quantizer 130 transforms and quantizes the residual image. That is, the transformer and the quantizer 130 convert the residual signal of the residual image generated by the subtractor 120 into the frequency domain to generate a transformed residual image having a transform coefficient and convert the residual signal.
  • the transform coefficients of the residual image are quantized to generate transformed and quantized residual images.
  • a transform method used a method of transforming an image signal in a spatial domain into a frequency domain such as a Hadamard transform or a discrete cosine transform based integer transform is used.
  • various quantization techniques such as Dead Zone Uniform Threshold Quantization (DZUTQ) or quantization weighted matrix (DZUTQ) may be used.
  • DZUTQ Dead Zone Uniform Threshold Quantization
  • DZUTQ quantization weighted matrix
  • the encoder 140 encodes the transformed and quantized residual images to generate the limited-level image coded data, calculates the encoding cost of the limited-step image coded data, and uses the encoding cost of the limited-step image coded data to obtain the motion vector resolution.
  • a restriction step is selected, and the encoded image data is encoded using a motion vector determined according to the motion vector resolution according to the selected restriction step.
  • the entropy encoding technique may be used as a technique for encoding the quantized transform coefficients by the encoder 140, but various encoding techniques may be used without being limited thereto.
  • the encoder 140 will be described in detail with reference to FIGS. 2 and 3.
  • the inverse quantizer and inverse converter 150 inverse quantizes and inverse transforms the transformed and quantized residual image to reconstruct the residual image. That is, the inverse quantizer and the inverse transformer 150 inverse quantizes the transformed and quantized residual images transmitted from the transformer and the quantizer 130 to restore a residual image having a transform coefficient, and reconstructs the residual image having a transform coefficient. Inverse conversion restores a residual image having a residual signal. In this case, the inverse quantizer and the inverse transformer 150 may reconstruct the residual image by performing the transformed and quantized method in the inverse of the transformer and the quantizer 130.
  • the inverse quantizer and inverse transformer 150 inversely quantizes and inverse transforms all the transformed and quantized residual images transmitted from the transformer and quantizer 130, but limits the motion vector resolution selected by the encoder 140. Only the residual image according to the limitation of the motion vector resolution may be reconstructed by performing inverse quantization and inverse transformation on the residual image of the predicted image generated using the motion vector determined according to the step, and transforming only the residual image.
  • the adder 160 reconstructs the input image by adding the residual image reconstructed by the inverse quantizer and the inverse transformer 150 and the predictive image generated by the predictor 110.
  • the reference picture memory 170 stores a reference picture in which reconstructed input pictures are accumulated by picture units, and the stored reference pictures may be used by the predictor 110 to predict the next input picture or the next picture.
  • the image encoding apparatus 100 may add an intra predictor, a deblocking filter, or the like for intra prediction in the predictor 110. It can be included as.
  • the subtractor 120 may generate a residual image by subtracting the input image and the prediction image generated by the intra predictor, and the transformer and quantizer 130 and the inverse quantizer and inverse transformer 150 may generate the residual image. Transform and quantization for and inverse transform and inverse quantization on the transformed and quantized residual image may be further performed.
  • the encoder 140 may generate image encoded data by encoding the transformed and quantized residual image, which is included in the bitstream.
  • the deblocking filter deblocks the input image to be reconstructed.
  • the deblocking filtering refers to an operation of reducing block distortion generated by encoding an image in block units, and applying a deblocking filter to a block boundary and a macroblock boundary, or applying a deblocking filter only to a macroblock boundary or a deblocking filter. You can optionally use one of the methods that does not use.
  • FIG. 2 is an exemplary view showing a state of determining a motion vector through motion estimation according to an embodiment of the present invention.
  • the predictor 110 finds a reference block that is the block most similar to the current block among predetermined search regions within several reference pictures that can be used to determine a motion vector of the current block, and finds a reference picture index and a reference block representing the reference picture.
  • the motion of the current block is estimated by acquiring the indicated motion vector as motion information.
  • a method of determining a motion vector may be represented by Equation 1 and Equation 2.
  • MV represents a motion vector of a current block, and the position of the motion vector may exist anywhere in several reference pictures.
  • org represents the original current block
  • ref MV represents a reference block indicated by MV .
  • SAD ( org , ref MV ) represents a function that calculates the sum of absolute values for the difference of pixels between org and ref MV
  • represents a Lagrange Multiplier
  • PMV represents a predictive motion vector.
  • Ref_idx represents a reference picture index, which is an index representing a reference picture
  • Rate ( MV , PMV , ref_idx ) represents a function for calculating the difference between MV and PMV and the amount of bits required to encode ref_idx .
  • Mcost ( MV ) represents an encoding cost required for encoding motion information according to MV.
  • MV * represents a motion vector of the current block that is finally determined
  • S MV represents a predetermined motion search region.
  • the predictor 110 may determine a predetermined motion search region of several reference pictures.
  • S MV The coding cost for each MV that can be considered within is calculated using Equation 1, and the MV having the smallest coding cost can be found using Equation 2 and determined as the motion vector of the current block.
  • the encoding cost is calculated as the rate-distortion cost, but the encoding cost is not necessarily calculated as the rate-distortion cost and may be calculated in various other ways.
  • Equation 3 org, ⁇ , PMV, ref_idx each represent the same as described above through Equation 1, i represents a block mode index for identifying an interblock mode , and S mode from a block of 16x16 pixel size Rec i represents a reconstructed block which is recovered by encoding and decoding the current block using block mode i , and SSD ( org , rec i ) represents org and rec i.
  • MV i represents a function for calculating a pixel sum of the square error between
  • Coeff i represents the quantized transform coefficients produced by the transform, and both the residual block of the current block by using the block mode
  • i Rate (MV i, PMV , ref_idx , Coeff i) is and for coding the current block by using the block mode, i, calculating the bit rate required for encoding the MV and the difference value, ref_idx, and the quantized transform coefficient between PMV
  • rdcost (i) represents the coding cost of encoding the current block by using the block mode, i.
  • i * is an index indicating the block mode of the current block that is finally determined.
  • the predictor 110 calculates encoding costs for several interblock modes by using Equation 3, and finds i * , which is an index indicating a block mode having a minimum encoding cost, by using Equation 4. It can be determined as the block mode of the final current block.
  • the encoding cost is calculated as a rate-distortion cost, but the encoding cost is not necessarily calculated as the rate-distortion cost as shown in Equation 3 and may be calculated in various other ways.
  • the predictor 110 calculates the encoding cost, but the present invention is not limited thereto.
  • the encoder 140 calculates the encoding cost and transfers the calculated encoding cost to the predictor 110 so that the predictor 110 may calculate the encoding cost.
  • the motion vector may be determined.
  • the motion vector obtained through Equation 2 is predictively encoded, and the predictor 110 or the encoder 140 encodes a differential motion vector which is a difference value between the motion vector and its predicted motion vector as shown in Equation 5.
  • motion vectors are obtained by performing motion estimation at resolutions of integer pixel units, 1/2 pixel units, and 1/4 pixel units, and differential motion vectors of the obtained motion vectors are obtained. Is encoded using a code table to which a bit string for each multiple of quarter pixels as shown in Table 1 is assigned.
  • Long codewords are used to encode a motion vector having a small size, such as a codeword for encoding a motion vector having a resolution of 1/2 pixel and a resolution of a quarter pixel and a motion of a resolution of an integer pixel. This is because a codeword for encoding a vector is used together.
  • an embodiment of the present invention restricts the motion vector resolution stepwise to perform motion estimation, motion compensation, encoding of the residual image, and encoding of the motion vector at the limited motion vector resolution of each step.
  • the cost is calculated to select a limit of motion vector resolution having a minimum encoding cost, and a bitstream including image encoded data and motion vector encoded data encoded at the selected motion vector resolution of the selected limit level is generated.
  • FIG. 3 is a block diagram schematically illustrating an encoder according to an embodiment of the present invention.
  • the encoder 140 may include a predicted residual encoder 310 and a motion vector encoder 320.
  • the predictive residual encoder 310 encodes the transformed and quantized residual image to generate image coded data for each step. That is, the predictive residual encoder 310 encodes the residual image transformed and quantized by the transformer and the quantizer 130. The residual residual encoder 310 encodes the transformed and quantized residual image in the step of limiting the motion vector resolution. Generates video coded data.
  • the motion vector encoder 320 calculates an encoding cost of the image coded data for each step, selects a restriction step of the motion vector resolution using the encoding cost of the image coded data for each step, and according to the motion vector resolution according to the selected restriction step.
  • the motion vector encoded data obtained by encoding the determined motion vector is output. That is, the motion vector encoder 320 receives various types of information transmitted from the predictor 110 and the predictive residual encoder 310 in the process of encoding the image coded data of each step of the limitation of the motion vector resolution generated by the predictive residual encoder 310. Calculate the encoding cost for encoding each image coded data, select the motion vector resolution limit step among the motion vector resolution limit steps, and determine according to the limit level of the selected motion vector resolution.
  • the motion vector encoded data obtained by encoding the motion vector to be output is output.
  • the motion vector encoder 320 encodes a motion vector that is already determined by the limitation step of each motion vector resolution and generates motion vector encoded data for each restriction step when calculating the encoding cost for each restriction step of the motion vector resolution. After selecting the limit of the vector resolution, the generated motion vector encoded data may be output without generating the motion vector encoded data again.
  • the motion vector encoder 320 will be described in detail with reference to FIG. 4.
  • FIG. 4 is a block diagram schematically illustrating a motion vector encoding apparatus according to an embodiment of the present invention.
  • the motion vector encoding apparatus may be implemented as the motion vector encoder 320 in the encoder 140 described above with reference to FIG. 3.
  • the motion vector encoding apparatus is called a motion vector encoder 320.
  • the motion vector encoder 320 may include a coding cost calculator 410, a motion information encoder 430, and a limited resolution determiner 420.
  • the encoding cost calculator 410 calculates the encoding cost according to the motion vector resolution of the step-by-step limited motion vector resolution. That is, the encoding cost calculator 410 is required to encode a residual image which is a difference between the predicted image and the input image generated by using the motion vector including the motion vector and the reference picture index determined according to the motion vector resolution for each step. Calculate the cost.
  • the motion vector resolution may include one or more of an integer pixel resolution, a 1/2 pixel resolution, a 1/4 pixel resolution, and a 1/8 pixel resolution, but is not limited thereto. It may also include a pixel-by-pixel resolution.
  • the step of limiting the motion vector resolution refers to a group formed step by step in order of increasing motion vector resolution in a set of motion vector resolutions that can be used for motion estimation.
  • the first limiting step is a group containing only resolution in integer pixel units
  • the second limiting step of motion vector resolution is a group including resolution in integer pixel units and resolution in 1/2 pixel units
  • motion vector resolution is the resolution in integer pixel unit, 1/2 It may be a group including a resolution in units of pixels, a resolution in units of 1/4 pixels and a resolution in units of 1/8 pixels.
  • the encoding cost calculator 410 predicts and encodes the input image using the motion vector determined by estimating the motion at the resolution of the integer pixel unit included in the first limiting step of the motion vector resolution.
  • An input image is calculated by using the motion vector determined by calculating the encoding cost required as the encoding cost of the first constraint step and estimating the motion at a resolution of 1/2 pixel unit, which is an integer included in the second constraint stage of the motion vector resolution.
  • the coding cost required for predictive coding is calculated as the coding cost of the second limiting step, and the motion is estimated using the integer, 1/2, and 1/4 pixel resolutions included in the third limiting step of the motion vector resolution.
  • the encoding cost required to predict-encode the input image using the determined motion vector is calculated as the encoding cost of the third limited step.
  • the encoding cost is calculated as the encoding cost of the fourth restriction stage.
  • the encoding cost calculator 410 may determine at least one of a motion vector for each block and a block mode according to the motion vector resolution for each step, and calculate an encoding cost using at least one of the determined motion vector and the determined block mode. For example, the encoding cost calculator 410 may calculate the encoding cost considering the motion vector and the block mode using Equation 6.
  • cost ( k ) represents the encoding cost of each step.
  • Equation 6 k represents a resolution limit index for identifying a limit step of motion vector resolution.
  • the considered motion vector resolution is up to 1/2 k pixel units. Therefore, in the above example, k becomes 0, 1, 2, and 3, respectively, when the resolution limit index indicates the first to fourth limit steps of the motion vector resolution.
  • N MB denotes the number of blocks (macroblock or subblock in which a motion vector is determined) existing in a coding unit (eg, slice) in which a limiting step of motion vector resolution and a coding cost are respectively selected and calculated.
  • i * m represents a block mode for the block in the coding unit.
  • the N_MB value may have a positive integer value.
  • N_MB 1, the N_MB value is a case where the limit level of the motion vector resolution is determined for every subblock.
  • the encoding cost calculator 410 may calculate the encoding cost by encoding the motion vector using a different code table according to the motion vector resolution for each step. That is, the encoding cost calculator 410 may store another code table in advance for each step of limiting the motion vector resolution and use the same to encode a motion vector of the motion vector resolution for each step by step.
  • another code table for each step of limiting the motion vector resolution refers to a code table for encoding considering only the motion vector of the motion vector resolution within the limit step of each motion vector resolution.
  • the code table for the first limiting step of the motion vector resolution is a code table for encoding considering only the motion vector having the resolution of the integer pixel unit
  • the code table for the second limit step of the motion vector resolution is a code table for encoding by considering only the motion vector having the resolution of integer, 1/2 pixel unit.
  • the code table for the third limiting step is a code table for encoding by considering only motion vectors of resolutions of integer, 1/2, and 1/4 pixel units.
  • the code table for the fourth constraint step as exemplarily shown in Table 5, is a motion of resolution in integer, 1/2, 1/4, and 1/8 pixel units. Code table for encoding by considering only vectors.
  • the code table for encoding the motion vector determined by the step of limiting the motion vector resolution is configured and used differently to be suitable for the step of limiting the motion vector resolution.
  • the compression efficiency can be improved by encoding the vector efficiently.
  • the limited resolution determiner 420 selects a limited step of the motion vector resolution by using the encoding cost calculated in the limited step. In other words, the limited resolution determiner 420 compares the encoding costs of the limited stages calculated by the encoding cost calculator 410 with each other and selects the restriction of the motion vector resolution having the smallest encoding cost.
  • the limited resolution determiner 420 may select a limit step of the motion vector resolution for each coding unit including at least one of a block, a slice, a picture, and a picture group.
  • the limited resolution determiner 420 may calculate the motion vector resolution having the smallest encoding cost among the limited step coding costs calculated by summing the coding costs for all the blocks in the coding unit as shown in Equation 6 in the step of limiting the motion vector resolution.
  • the restriction step can be selected in the same manner as in Equation 7.
  • Equation 7 k denotes a resolution limit index, and k * denotes a resolution limit index for identifying a limit step of the motion vector resolution selected by the limit resolution determiner 420.
  • the resolution limit index k * for identifying the limit level of the motion vector resolution selected by the limited resolution determiner 420 is included in the bitstream when encoded by the motion information encoder 430 and generated as the resolution limit index data. .
  • k is represented by ⁇ 0, 1, 2, 3 ⁇ , and does not necessarily mean that the value of k should have only one value of ⁇ 0, 1, 2, 3 ⁇ . Therefore, k has an integer value of 0 or more and may have one or more values.
  • the motion information encoder 430 outputs motion vector encoded data obtained by encoding a motion vector determined according to a limitation step of the selected motion vector resolution. That is, when the motion vector encoder 430 selects the limit of the motion vector resolution by the limited resolution determiner 420, the motion vector coded data is generated by encoding in advance in the process of calculating the encoding cost by the encoding cost calculator 410. The motion vector encoded data obtained by encoding the motion vector determined according to the motion vector resolution of the limited step selected by the limited resolution determiner 420 is output.
  • the motion vector coded data output from the motion information encoder 430 is also determined for each block in the predetermined coding unit such as a slice. It may be generated motion vector encoded data.
  • the motion vector encoded data may further include a reference picture index as well as a motion vector.
  • the restriction information of the motion vector resolution is determined, and a unit for encoding the information may be not limited to a slice unit but may be a small subblock unit to a large frame and an entire sequence.
  • the motion information encoder 430 may output resolution restriction index data by encoding a resolution restriction index indicating a restriction level of the selected motion vector resolution.
  • the resolution limit index data is additionally included in the bitstream, and used to identify the limiting step of the motion vector resolution in the motion vector decoding apparatus or the image decoding apparatus.
  • FIG. 5 is an exemplary diagram for describing a process of estimating a motion vector having a motion vector resolution of a small pixel unit using template matching according to an embodiment of the present invention.
  • the predictor 110 is presently present in the reference picture at a motion vector resolution of a pixel unit smaller than the minimum motion vector resolution among the motion vector resolutions included in the motion vector resolution limiting step selected by the encoder 140.
  • a reference block having an adjacent pixel most similar to the adjacent pixels of the block is found to obtain a motion vector indicating the reference block.
  • the search region for estimating the motion by using template matching at a motion vector resolution of a small pixel unit may be limited to a peripheral region of the motion vector determined according to the limiting step of the selected motion vector resolution.
  • the peripheral area of the motion vector may be an area that is equally or differently spaced apart by a predetermined number of pixels with respect to the pixel indicated by the motion vector.
  • FIG. 6 is an exemplary diagram for describing a process of selecting a motion vector according to an embodiment of the present invention.
  • 6A illustrates a process of calculating an encoding cost for a motion vector determined according to the limitation of the motion vector resolution selected by the encoder 140
  • 6B illustrates a process of limiting the motion vector resolution selected by the encoder 140.
  • a process of calculating a coding cost for a motion vector determined by using template matching at a motion vector resolution of a pixel unit smaller than the minimum motion vector resolution is shown.
  • the coding costs calculated by the methods shown in 6A and 6B can be calculated using Equations 8 and 9, respectively.
  • T org ( i ) represents a current template representing adjacent pixels of the current block
  • N T denotes a pixel of the pixel included in the current template and the reference template.
  • the distortion cost is taken as an encoding cost, but the encoding cost is not necessarily limited thereto and may be calculated in various ways.
  • the predictor 110 Since the motion vector having the small SAD among the SADs calculated using Equations 8 and 9 has better coding efficiency, the predictor 110 outputs the predicted block by compensating for the motion of the block using the corresponding motion vector. do. In this case, the predictor 110 may select a motion vector for each block in the coding unit. Therefore, the predictor 110 may output a predicted block whose motion is compensated using the estimated motion vector only when the selected motion vector for each block is a motion vector additionally estimated using template matching. If the motion vector determined according to the limit of the motion vector resolution is selected, the prediction block is not additionally generated.
  • the image coded data encoded by using additional information or any motion vector for additionally encoding or identifying the bit is encoded. It is not necessary to encode information indicating whether the stream is included. This is because the image decoding apparatus may perform the same process as described with reference to FIGS. 5 and 6 in the same manner as the image encoding apparatus 100.
  • FIG. 7 is a flowchart illustrating a motion vector encoding method according to an embodiment of the present invention.
  • the motion vector encoder 320 calculates the encoding cost according to the motion vector resolution in which the motion vector resolution is limited step by step (S710), A step of limiting the motion vector resolution is selected using the encoding cost (S720), and motion vector encoded data encoded by the motion vector determined according to the limit of the selected motion vector resolution is output (S730).
  • the motion vector resolution may include at least one of resolution in integer pixel units, resolution in 1/2 pixel units, resolution in 1/4 pixel units, and resolution in 1/8 pixel units.
  • the motion vector encoder 320 may further output resolution limit index data by encoding a resolution limit index indicating a limit step of the selected motion vector resolution.
  • the motion vector encoder 320 may determine at least one of a motion vector for each block and a block mode according to the motion vector resolution for each step, and calculate a coding cost using at least one of the determined motion vector and the determined block mode.
  • the coding cost can be calculated by encoding the motion vector using a different code table according to the motion vector resolution for each step.
  • the motion vector encoder 320 may select a restriction step of the motion vector resolution for each coding unit including one or more of blocks, slices, pictures, and picture groups, and encode the smallest of the encoding costs calculated for each restriction step.
  • the limiting step with cost can be selected as the limiting step of the motion vector resolution.
  • FIG. 8 is a flowchart for explaining an example of implementing a motion vector encoding method according to an embodiment of the present invention.
  • the motion vector resolution that the motion vector encoder 320 can use to determine the motion vector is resolution in integer pixels, resolution in 1/2 pixel, resolution in 1/4 pixel, and 1/8 pixel. Assuming a resolution of and coding unit is a slice, a process of encoding a motion vector is illustrated.
  • the motion vector encoder 320 calculates cost (0), which is an encoding cost when encoding an image, by considering only a motion vector having a motion vector resolution of an integer pixel unit as a first limiting step of the motion vector resolution (S810). That is, the motion vector encoder 320 performs motion estimation on the motion vector resolution in integer pixel units by using Equation 1 and Equation 2 for all blocks in the slice in the input image. A motion vector is determined, and a block mode according to the motion vector determined by the motion vector resolution in units of integer pixels is determined using Equation 3 and Equation 4.
  • the motion vector encoder 320 predictively encodes each block according to the determined motion vector and the block mode to generate image coded data for each block of which the motion vector resolution is limited in units of integer pixels.
  • the encoding cost for each image encoded data is calculated using the, and the encoding cost for all blocks in the slice is summed to calculate the encoding cost cost (0) for the first limiting step of the motion vector resolution as shown in Equation (6).
  • the motion vector encoder 320 when the motion vector encoder 320 predictively encodes each block to generate image coded data, the motion vector encoder 320 also predicts and encodes a motion vector having a motion vector resolution of an integer pixel unit. Encode using a code table.
  • the motion vector encoder 320 calculates cost (1), which is an encoding cost when encoding an image, by considering only a motion vector having a motion vector resolution of an integer and 1/2 pixel unit as a second limiting step of the motion vector resolution. (S820). That is, the motion vector encoder 320 performs motion estimation on the motion vector resolutions of integer pixel units and 1/2 pixel units using Equation 1 and Equation 2 for all blocks in the slice in the input image. A motion vector having a motion vector resolution of a unit or 1/2 pixel unit is determined, and a block mode according to a motion vector determined by an integer pixel unit or a motion vector resolution of a unit of 1/2 pixel using Equations 3 and 4 Determine.
  • the motion vector encoder 320 predictively encodes each block according to the determined motion vector and the block mode, and generates image coded data for each block of which the motion vector resolution is limited only in integer pixel units and 1/2 pixel units. Equation 3 and Equation 4 are used to calculate the encoding cost for each piece of image encoded data and add the encoding cost for all the blocks in the slice to calculate the encoding cost cost for the second limited step of motion vector resolution as shown in Equation 6. Calculate 1).
  • the motion vector encoder 320 predictively encodes each block to generate image coded data
  • the motion vector encoder 320 also predicts and encodes a motion vector having a motion vector resolution of an integer pixel unit or 1/2 pixel unit, as shown in Table 3 below. Encode using the code table for the second constraint step of.
  • the motion vector encoder 320 is a third limitation step of the motion vector resolution, which is a cost cost of encoding an image by considering only a motion vector having a motion vector resolution of integer, 1/2, and 1/4 pixel units. 2) is calculated (S830). That is, the motion vector encoder 320 moves all the blocks in the slice in the input image using the equation 1 and the equation 2 at the motion vector resolution of integer pixel unit, 1/2 pixel unit and 1/4 pixel unit. By performing the estimation, a motion vector having a motion vector resolution of an integer pixel unit, 1/2 pixel unit, or 1/4 pixel unit is determined, and an integer pixel unit, 1/2 pixel unit or A block mode according to a motion vector determined by a motion vector resolution of a quarter pixel unit is determined.
  • the motion vector encoder 320 predicts and encodes each block according to the determined motion vector and the block mode, so that an image of each block whose motion vector resolution is limited to an integer pixel unit, 1/2 pixel unit, and 1/4 pixel unit only. Generating encoded data, calculating encoding cost for each image encoded data using Equation 3 and Equation 4, summing the encoding cost for all blocks in the slice, and limiting the motion vector resolution as shown in Equation 6. Calculate the coding cost cost (2) for.
  • the motion vector encoder 320 predictively encodes each block to generate image coded data
  • the motion vector encoder 320 also predicts and encodes a motion vector having a motion vector resolution of integer pixel unit, 1/2 pixel unit, or 1/4 pixel unit. As shown in FIG. 4, encoding is performed using a code table for a third limit step of motion vector resolution.
  • the motion vector encoder 320 encodes an image by encoding only a motion vector having a motion vector resolution of integer, 1/2, 1/4, and 1/8 pixel units as a fourth limiting step of the motion vector resolution.
  • the cost (3) is calculated (S840). That is, the motion vector encoder 320 uses equations 1 and 2 for all blocks in the slice in the input image to be integer pixel units, 1/2 pixel units, 1/4 pixel units, and 1/8 pixel units. By performing motion estimation at a motion vector resolution of, a motion vector having a motion vector resolution of integer pixel unit, 1/2 pixel unit, 1/4 pixel unit, or 1/8 pixel unit is determined.
  • a block mode according to a motion vector determined by a motion vector resolution of an integer pixel unit, 1/2 pixel unit, 1/4 pixel unit, or 1/8 pixel unit is determined.
  • the motion vector encoder 320 predictively encodes each block according to the determined motion vector and the block mode, and the motion vector resolution is increased only in integer pixel units, 1/2 pixel units, 1/4 pixel units, and 1/8 pixel units.
  • Generate image encoding data for each limited block calculate encoding cost for each image encoding data using equations (3) and (4), add the encoding cost for all blocks in the slice, and add the motion vector as shown in (6).
  • the coding cost cost (3) for the fourth limiting step of resolution is calculated.
  • the motion vector encoder 320 predictively encodes each block to generate the image coded data
  • the motion vector resolution moves in the integer pixel unit, 1/2 pixel unit, 1/4 pixel unit, or 1/8 pixel unit.
  • the vector is also predictively encoded using a code table for the fourth limited step of motion vector resolution, as shown in Table 5.
  • the motion vector encoder 320 selects a limit step of the motion vector resolution having the smallest cost () among cost (0), cost (1), cost (2), and cost (3) (S850). That is, the motion vector encoder 320 calculates a resolution limit index corresponding to a cost having a minimum value among cost (0), cost (1), cost (2), and cost (3) calculated in steps S810 to S840. Equation 7 is used to select.
  • the motion vector encoder 320 encodes the motion vector and the resolution limit index determined according to the limit of the selected motion vector resolution (S860). That is, the motion vector encoder 320 encodes the motion vector determined according to the restriction of the motion vector resolution selected in step S850 among the motion vectors determined according to the restriction step of each motion vector resolution when calculating the encoding cost in steps S810 to S840.
  • the encoded motion vector encoded data is outputted, and the resolution limited index representing the limited level of the motion vector resolution selected in step S850 is encoded to output the resolution limited index data.
  • restriction index data may be inserted into a block header or a slice header of the bitstream. If the coding unit is a picture other than a slice, it will be inserted into the picture header.
  • steps S810 to S840 are illustrated and described as being sequentially performed, but steps S810 to S840 may be performed in a changed order or in parallel.
  • FIG. 9 is a flowchart illustrating an image encoding method according to an embodiment of the present invention.
  • the image encoding apparatus 100 determines a motion vector of an image according to a limited step-by-step motion vector resolution in which the motion vector resolution is limited step by step (S910), and determines the determined motion vector.
  • Generate the coded image of the restriction stage (S950) calculate the encoding cost of the image coded data of the restriction stage (S960), and select the restriction stage of the motion vector resolution using the encoding cost of the image encoding data of the restriction stage (S970).
  • Image coded using the motion vector determined according to the motion vector resolution according to the selected restriction step A bitstream including the data stream is generated (S980).
  • the image encoding apparatus 100 estimates a motion vector having a motion vector resolution of a pixel unit smaller than the minimum motion vector resolution according to the limiting step of the selected motion vector resolution, and moves the estimated motion vector resolution of a small pixel unit.
  • the image may be predictively encoded using a vector to generate a bitstream including encoded image encoded data.
  • the image encoding apparatus 100 may estimate a motion vector having a motion vector resolution of a small pixel unit by using template matching, and predictively encode an image according to a motion vector determined according to a limitation step of the selected motion vector resolution.
  • a bitstream including image encoded data encoded using a motion vector having a small encoding cost among encoding costs required to predict and encode an image according to a motion vector having a motion vector resolution of a small pixel. Can be generated.
  • FIG. 10 is a flowchart for explaining another example of implementing an image encoding method according to an embodiment of the present invention.
  • Steps S1010 to S1070 are the same as or similar to steps S910 to S970 described above with reference to FIG. 9, and thus a detailed description thereof will be omitted.
  • the image encoding apparatus 100 determines according to the motion vector resolution of the pixel unit smaller than the minimum motion vector resolution among the motion vector resolutions within the limit of the motion vector resolution selected in step S1070 and the limit of the selected motion vector resolution.
  • a search region for motion estimation is determined based on the motion vector, in step S1080. For example, in step S1070, if a limiting step of motion vector resolution including a motion vector resolution in integer pixel units and 1/2 pixel units is selected, the minimum motion vector resolution is a motion vector resolution in units of 1/2 pixel, which is the minimum. do.
  • the image encoding apparatus 100 minimizes the motion vector resolution in the 1/4 pixel unit and the motion vector resolution in the 1/8 pixel unit that is smaller than the motion vector resolution in the 1/2 pixel unit.
  • the resolution is determined as the motion vector resolution in pixels smaller than the vector resolution.
  • step S1070 the step of limiting the motion vector resolution including the motion vector resolution of the integer pixel unit and the half pixel unit is selected. Therefore, additional motion estimation is performed to the peripheral region of the motion vector determined when the encoding cost is calculated in step S1060. Set to limit the search area.
  • the image encoding apparatus 100 determines a motion vector having a smaller motion unit resolution by performing motion estimation using template matching at a motion vector resolution of a smaller pixel unit in the search area determined in operation S1080 (S1090). ). That is, as described above with reference to FIG. 5, the image encoding apparatus 100 moves a motion of each block in a slice at a motion vector resolution of 1/4 pixel unit and a motion vector resolution of 1/8 pixel unit using template matching. Is further estimated to determine a motion vector with a motion vector resolution of 1/4 pixel unit or with a motion vector resolution of 1/8 pixel unit. In this case, as described above with reference to FIG.
  • the image encoding apparatus 100 uses the motion vector determined according to the limit of the motion vector resolution selected in step S1070 and the template matching in step S1090 to a motion vector resolution in small pixel units.
  • a bitstream including image coded data encoded using a motion vector having good encoding efficiency among the motion vectors determined by performing motion estimation may be output.
  • FIG. 11 is a block diagram schematically illustrating an image decoding apparatus according to an embodiment of the present invention.
  • the image decoding apparatus 1100 may include a decoder 1110, an inverse quantizer and an inverse converter 1120, a predictor 1130, an adder 1140, and a reference picture memory 1150. It can be configured to include.
  • the decoder 1110 reconstructs the motion vector by decoding motion vector encoded data extracted from the bitstream according to the motion vector resolution limiting step, which is obtained by decoding the resolution limit index data extracted from the bitstream, and extracting from the bitstream.
  • the transformed and quantized residual images are reconstructed by decoding the encoded image encoded data. That is, the decoder 1110 extracts and decodes the resolution limited index data, the motion vector encoded data, and the image encoded data from the bitstream, respectively, to restore the resolution limited index, the motion vector, and the transformed and quantized residual images.
  • the decoder 1110 decodes the motion vector encoded data by using the code table identified by the motion vector resolution limit step indicated by the reconstructed resolution limit index.
  • the decoder 1110 may decode using an encoding technique such as entropy encoding.
  • the decoder 1110 may reversely decode the encoder 140 described above with reference to FIG. 1 to decode.
  • the inverse quantizer and inverse transformer 1120 inverse quantizes and inverse transforms the transformed and quantized residual image to reconstruct the residual image. That is, the inverse quantizer and inverse transformer 1120 inverse quantizes and inversely transforms the quantized transform coefficients of the transformed and quantized residual image transmitted by the decoder 1110 to reconstruct the residual image having the residual signal.
  • the inverse quantizer and the inverse converter 1120 may perform inverse quantization and inverse transformation by performing the transformation and quantization of the converter and the quantizer 1130 described above with reference to FIG. 1.
  • the reconstructed residual image is an image in a predetermined coding unit including several residual blocks, and the image of the coding unit may be various images such as a block, a slice, a picture, a picture group, and the like.
  • the predictor 1130 generates a predicted image of the image by using the reconstructed motion vector. That is, the predictor 1130 generates a prediction block by compensating for the motion of blocks to be encoded by using the motion vector reconstructed by the decoder 1110, and the prediction blocks are collected in a predetermined coding unit to form a prediction image. .
  • the predictor 1130 is not a motion vector reconstructed by the decoder 1110, but a pixel unit smaller than the minimum motion vector resolution according to the limit of the motion vector resolution indicated by the resolution limit index reconstructed by the decoder 1110.
  • a motion vector having a motion vector resolution of H may be estimated, and a prediction image may be generated using the motion vector having a motion vector resolution of a small pixel unit.
  • the predictor 1130 may determine a smaller encoding cost between the encoding cost calculated using the motion vector reconstructed by the decoder 1110 and the encoding cost calculated using the motion vector estimated with the motion vector resolution of a small pixel.
  • the branch may generate a prediction image by using a motion vector.
  • the predictor 1130 may estimate a motion vector having a motion vector resolution of a small pixel unit by using template matching.
  • the adder 1140 reconstructs the image by adding the residual image and the predicted image. That is, the adder 1140 reconstructs the image by adding the residual image reconstructed by the inverse quantizer and the inverse transformer 1120 and the predicted image generated by the predictor 1130.
  • the image reconstructed by the adder 1140 may be accumulated in picture units and output as a reconstructed image.
  • the image reconstructed by the adder 1140 may be stored in the reference picture memory 1150 and used by the predictor 1130 to predict the next image.
  • the image decoding apparatus 1100 may include an intra predictor for intra prediction, a deblocking filter for deblocking filtering the reconstructed current block, and the like. It may further comprise.
  • FIG. 12 is a block diagram schematically illustrating a decoder according to an embodiment of the present invention.
  • the decoder 1110 may include a motion vector decoder 1210 and a predictive residual decoder 1220.
  • the motion vector decoder 1210 reconstructs the motion vector by decoding the motion vector coded data extracted from the bitstream according to the limitation of the motion vector resolution, which is obtained by decoding the resolution limit index data extracted from the bitstream.
  • the motion vector encoder 1210 will be described in detail with reference to FIG. 13.
  • the predictive residual decoder 1220 reconstructs the transformed and quantized residual image by decoding image encoded data extracted from the bitstream.
  • FIG. 13 is a block diagram schematically illustrating a motion vector decoding apparatus according to an embodiment of the present invention.
  • the motion vector decoding apparatus may be implemented as the motion vector decoder 1210 in the decoder 1110 described above with reference to FIG. 12.
  • the motion vector decoding apparatus according to an embodiment of the present invention is called a motion vector decoder 1210.
  • the motion vector decoder 1210 may include a restricted resolution reconstructor 1310 and a motion vector reconstructor 1320.
  • the limited resolution decompressor 1310 decodes the resolution limited index data extracted from the bitstream to restore the limiting step of the motion vector resolution. That is, the limited resolution restorer 1310 extracts and decodes the resolution limited index data from the bitstream to restore the resolution limited index, and restores the limiting step of the motion vector resolution indicated by the reconstructed resolution limited index.
  • the limited resolution reconstructor 1310 may extract the resolution limited index data from a header of a predetermined coding unit such as a block header, a slice header, a picture header, and the like of the bitstream.
  • the restraining step of the reconstructed motion vector resolution has limit information of the motion vector resolution for all blocks in the image of a predetermined coding unit. For example, assuming that the limiting step of the motion vector resolution is the second step of the motion vector resolution, the motion vectors of all blocks within a predetermined coding unit have only the motion vector resolution of integer pixels or 1/2 pixel units.
  • the motion vector reconstructor 1320 reconstructs the motion vector by decoding motion vector coded data extracted from the bitstream according to the limitation of the reconstructed motion vector resolution. That is, the motion vector decompressor 1320 extracts and decodes motion vector encoded data from the bitstream to reconstruct motion vectors of all blocks within a predetermined coding unit, and the motion vector resolution restored by the limited resolution decompressor 1310.
  • the motion vector encoded data may be decoded using the code table identified by the restriction step of. For example, assuming that the limiting step of the motion vector resolution is the second step of the motion vector resolution, the motion vector may be reconstructed by decoding the motion vector encoded data using a code table exemplarily shown in Table 3.
  • the motion vector decompressor 1320 includes a different code table in advance and stores the code table according to the limitation of the motion vector resolution, and the code table is previously agreed with the motion vector encoder 430 or the image encoding apparatus 100. This is the same table.
  • FIG. 14 is a flowchart illustrating a motion vector decoding method according to an embodiment of the present invention.
  • the motion vector decoder 1210 decodes the resolution limit index data extracted from the bitstream to restore the limit of the motion vector resolution (S1410).
  • the motion vector is reconstructed by decoding the motion vector encoded data extracted from the bitstream according to the motion vector resolution limiting step (S1420).
  • the motion vector decoder 1210 may decode the motion vector encoded data using another code table according to the limitation of the motion vector resolution restored in operation S1410.
  • 15 is a flowchart illustrating an image decoding method according to an embodiment of the present invention.
  • the image decoding apparatus 1100 is a motion vector extracted from a bitstream in accordance with a limitation step of motion vector resolution, which is reconstructed by decoding resolution limited index data extracted from a bitstream.
  • Decode the encoded data to restore the motion vector (S1510) decode the image coded data extracted from the bitstream to restore the transformed and quantized residual image (S1520), and inverse quantization and inverse transformation of the transformed and quantized residual image Reconstruct the residual image (S1530), generate a predicted image of the image using the reconstructed motion vector (S1540), and reconstruct the image by adding the residual image and the predicted image (S1550).
  • the image decoding apparatus 1100 may generate a prediction image by using the motion vector reconstructed in operation S1510, but the pixel smaller than the minimum motion vector resolution according to the limit of the motion vector resolution restored in operation S1510.
  • a prediction image may also be generated using a motion vector having.
  • the image decoding apparatus 1100 may estimate a motion vector having a motion vector resolution of a small pixel unit by using template matching.
  • each step described above with reference to FIGS. 14 and 15 is not necessarily all performed, and some steps may be selectively omitted or added.
  • the order of each step is not necessarily determined as shown, and the order of some or all of the steps may be changed or even performed in parallel.
  • the motion vectors of the motion vectors when encoding an image, it is determined by limiting the resolution of motion vectors for blocks in an image of a predetermined coding unit such as blocks and slices, and limited motion vector resolution. Since the motion vectors of the motion vectors can be encoded using a code table suitable for the limited motion vector resolution, the amount of bits required to encode the motion vectors can be reduced, and consequently, the compression efficiency of the image can be improved.
  • the image encoding apparatus 100 and the image decoding apparatus 1100 may perform more than a limited motion vector resolution through a motion estimation technique such as template matching that can be performed in the same manner. Since additional motion estimation can be performed at a smaller pixel-by-pixel motion vector resolution, the image can be encoded using motion vectors with more detailed motion vector resolution. Since it is not necessary to additionally encode a motion vector having a determined fine motion vector resolution, the compression efficiency of an image may be further improved.
  • a motion estimation technique such as template matching
  • the image encoding / decoding apparatus connects the encoded data output terminal of the image encoding apparatus 100 of FIG. 1 to the bitstream (encoding data) input terminal of the image decoding apparatus 1100 of FIG. Can be implemented.
  • An image encoding / decoding apparatus determines a motion vector of the image according to a limited step-by-step motion vector resolution in which the motion vector resolution is limited step by step, and predicts the image of the image by using the determined motion vector. Generate and subtract, transform, and quantize the image and the predictive image, and then encode the image to generate the limited-level image coded data, calculate the encoding cost of the limited-step image coded data, and encode the encoded image data of the limited-step image. Selects a limiting step of the motion vector resolution using; and is a resolution limit index indicating a limiting step of the image coded data encoded using the motion vector determined according to the motion vector resolution according to the selected limiting step and the selected motion vector resolution.
  • the motion vector is decoded by decoding the motion vector encoded data extracted from the bitstream according to a video encoding apparatus for encoding and outputting the resolution limited index data and the motion vector resolution, which is obtained by decoding the resolution limited index data extracted from the bitstream. And reconstructing the image by reconstructing and inversely transforming and quantizing the bitstream from the bitstream, and reconstructing the image by inverse quantization and inverse transformation.
  • the prediction image of the image is generated using the reconstructed motion vector.
  • an image decoder for reconstructing the image.
  • an image encoder may be implemented by the image encoding apparatus 100 according to an embodiment of the present invention, and the image decoder is an image decoding apparatus according to an embodiment of the present invention. It is possible to implement (1100).
  • a motion vector of the image is determined according to a limited step-by-step motion vector resolution in which the motion vector resolution is limited in stages, and the predicted image of the image is determined using the determined motion vector.
  • the image encoding method comprising: encoding and outputting the display limitations index data; And decoding the motion vector encoded data extracted from the bitstream in accordance with the limitation of the motion vector resolution, which is obtained by decoding the resolution limit index data extracted from the bitstream, and converting and quantized from the bitstream.
  • An image decoding step of restoring a residual image, inverse quantization and inverse transformation, and restoring the image, generating a predicted image of the image using the restored motion vector, and adding the residual image and the predicted image to restore the image It includes.
  • the image encoding step may be implemented as an image encoding step according to an embodiment of the present invention
  • the image decoding step is an image decoding step according to an embodiment of the present invention. It can be implemented.
  • the present invention is applied to an image compression processing field that encodes or decodes a video and encodes or decodes a motion vector therefor, thereby adaptively limiting the resolution of the motion vector to thereby remove the motion vector. It is a very useful invention that produces the effect of improving the compression efficiency by determining and encoding.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)

Abstract

Un mode de réalisation de la présente invention concerne un procédé et un appareil de codage/décodage de vecteur de mouvement au moyen d'une limitation de résolution de vecteur de mouvement, et un procédé et un appareil de codage/décodage d'image dans lesquels ils interviennent. Le mode de réalisation de l'invention concerne un procédé de codage de vecteur de mouvement consistant à : calculer des frais de codage pour chaque niveau de limitation d'une résolution de vecteur de mouvement qui est limitée par des niveaux; sélectionner un niveau de limitation de la résolution de vecteur de mouvement au moyen des frais de codage calculés pour chaque niveau de limitation; et fournir des données codées de vecteur de mouvement obtenues par codage du vecteur de mouvement déterminé par le niveau de limitation sélectionné de la résolution de vecteur de mouvement, et des données d'indice de limitation de résolution obtenues par codage d'un indice de limitation de résolution pour indiquer le niveau de limitation sélectionné de la résolution de vecteur de mouvement. Selon le mode de réalisation de l'invention, la résolution du vecteur de mouvement est limitée de manière adaptative pour déterminer et coder un vecteur de mouvement, ce qui permet d'améliorer l'efficacité de compression.
PCT/KR2010/007548 2009-10-30 2010-10-29 Procédé et appareil de codage/décodage de vecteur de mouvement au moyen d'une limitation de résolution de vecteur de mouvement, et procédé et appareil de codage/décodage d'image dans lesquels ils interviennent WO2011053054A2 (fr)

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
KR10-2009-0104411 2009-10-30
KR20090104411 2009-10-30
KR1020100106747A KR101449683B1 (ko) 2009-10-30 2010-10-29 움직임 벡터 해상도 제한을 이용한 움직임 벡터 부호화/복호화 방법 및 장치와 그를 이용한 영상 부호화/복호화 방법 및 장치
KR10-2010-0106747 2010-10-29

Publications (3)

Publication Number Publication Date
WO2011053054A2 WO2011053054A2 (fr) 2011-05-05
WO2011053054A3 WO2011053054A3 (fr) 2011-08-04
WO2011053054A9 true WO2011053054A9 (fr) 2012-01-19

Family

ID=43922877

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/KR2010/007548 WO2011053054A2 (fr) 2009-10-30 2010-10-29 Procédé et appareil de codage/décodage de vecteur de mouvement au moyen d'une limitation de résolution de vecteur de mouvement, et procédé et appareil de codage/décodage d'image dans lesquels ils interviennent

Country Status (2)

Country Link
KR (1) KR101449683B1 (fr)
WO (1) WO2011053054A2 (fr)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10536701B2 (en) 2011-07-01 2020-01-14 Qualcomm Incorporated Video coding using adaptive motion vector resolution

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20160065982A1 (en) * 2013-04-05 2016-03-03 Samsung Electronics Co., Ltd. Method for determining whether or not present image is reference image, and apparatus therefor
KR102349788B1 (ko) * 2015-01-13 2022-01-11 인텔렉추얼디스커버리 주식회사 영상의 부호화/복호화 방법 및 장치
KR102368621B1 (ko) * 2018-02-28 2022-02-28 삼성전자주식회사 비디오 복호화 방법 및 장치, 비디오 부호화 방법 및 장치
CN112565767B (zh) * 2020-06-04 2022-04-01 腾讯科技(深圳)有限公司 视频解码方法、视频编码方法及相关设备

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5276513A (en) * 1992-06-10 1994-01-04 Rca Thomson Licensing Corporation Implementation architecture for performing hierarchical motion analysis of video images in real time
KR970000683B1 (ko) * 1993-05-31 1997-01-16 삼성전자 주식회사 해상도 가변 적응적 화상압축/복원방법 및 장치
JP2008258769A (ja) * 2007-04-02 2008-10-23 Canon Inc 画像符号化装置及びその制御方法、並びに、コンピュータプログラム
JP2009027762A (ja) * 2008-11-07 2009-02-05 Renesas Technology Corp 動き補償画像符号化装置及び符号化方法

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10536701B2 (en) 2011-07-01 2020-01-14 Qualcomm Incorporated Video coding using adaptive motion vector resolution

Also Published As

Publication number Publication date
WO2011053054A3 (fr) 2011-08-04
KR101449683B1 (ko) 2014-10-15
KR20110048004A (ko) 2011-05-09
WO2011053054A2 (fr) 2011-05-05

Similar Documents

Publication Publication Date Title
WO2013109039A1 (fr) Procédé et appareil de codage/décodage d'images utilisant la prédiction de poids
WO2010044563A2 (fr) Procede et appareil pour coder/decoder les vecteurs de mouvement de plusieurs images de reference, et appareil et procede pour coder/decoder des images les utilisant
WO2010050706A2 (fr) Procédé et appareil de codage d'un vecteur mouvement, procédé et appareil de codage/décodage d'une image faisant appel à ces derniers
WO2020071829A1 (fr) Procédé de codage d'image basé sur l'historique, et appareil associé
WO2020071830A1 (fr) Procédé de codage d'images utilisant des informations de mouvement basées sur l'historique, et dispositif associé
WO2011111954A2 (fr) Procédé et appareil de codage/décodage de vecteur de mouvement, à l'aide de combinaison de résolutions de vecteur de mouvement, et procédé et appareil de codage/décodage d'images utilisant ceux-ci
WO2010120113A2 (fr) Procédé et appareil de sélection d'un mode de prédiction, et procédé et appareil de codage/décodage d'image utilisant ceux-ci
WO2010027182A2 (fr) Procédé et dispositif de codage/décodage d'images au moyen de pixels aléatoires dans un sous-bloc
WO2011145819A2 (fr) Dispositif et procédé de codage/décodage d'image
WO2013002549A2 (fr) Procédés et appareil de codage/décodage d'une image
WO2011031030A2 (fr) Procédé et dispositif de codage/décodage de vecteur de mouvement et procédé et dispositif de codage/décodage d'image les utilisant
WO2010038961A2 (fr) Procédé et appareil pour codage/décodage de vecteurs de mouvement au moyen d'une pluralité d'estimations de vecteurs de mouvement, et procédé et appareil pour codage/décodage d'image au moyen d'un tel appareil et d'un tel procédé
WO2010114283A9 (fr) Procédé et appareil pour traiter des signaux vidéo
WO2011031044A2 (fr) Procédé et dispositif d'encodage/décodage pour des images mobiles à haute résolution
WO2013070006A1 (fr) Procédé et appareil de codage et de décodage vidéo faisant appel à un mode de saut
WO2012081879A1 (fr) Procédé de décodage prédictif inter de films codés
WO2012011672A2 (fr) Procédé et dispositif de codage/décodage d'images par mode de saut étendu
WO2012015275A2 (fr) Procédé et dispositif de codage/décodage d'image à l'aide d'une prédiction fragmentée de bloc
WO2012099440A2 (fr) Appareil et procédé de génération/récupération d'informations de mouvement basées sur un encodage d'indice de vecteur de mouvement prédictif, et appareil et procédé d'encodage/décodage d'images utilisant ce dernier
WO2011142603A2 (fr) Procédé et appareil de filtrage d'images, et procédé et appareil de codage/décodage utilisant ces derniers
WO2012046979A2 (fr) Procédé et appareil de codage de bloc de conversion de fréquence au moyen d'une table de masques de fréquence, et procédé et appareil de codage/décodage vidéo au moyen de celle-ci
WO2017048008A1 (fr) Procédé et appareil de prédiction inter dans un système de codage vidéo
WO2013002550A2 (fr) Méthode et appareil de codage/décodage par décision de mode de l'unité de codage grande vitesse
WO2013077659A1 (fr) Procédé et appareil de codage/décodage prédictif d'un vecteur de mouvement
WO2011021910A2 (fr) Procédé et appareil de codage/décodage à prédiction intra

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 10827138

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase in:

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS (EPO FORM 1205A DATED 02-08-2012)

122 Ep: pct app. not ent. europ. phase

Ref document number: 10827138

Country of ref document: EP

Kind code of ref document: A2