WO2011021912A2 - Method and apparatus for interpolating reference picture and method and apparatus for encoding/decoding image using same - Google Patents
Method and apparatus for interpolating reference picture and method and apparatus for encoding/decoding image using same Download PDFInfo
- Publication number
- WO2011021912A2 WO2011021912A2 PCT/KR2010/005569 KR2010005569W WO2011021912A2 WO 2011021912 A2 WO2011021912 A2 WO 2011021912A2 KR 2010005569 W KR2010005569 W KR 2010005569W WO 2011021912 A2 WO2011021912 A2 WO 2011021912A2
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- resolution
- motion vector
- reference picture
- filter
- region
- Prior art date
Links
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/80—Details of filtering operations specially adapted for video compression, e.g. for pixel interpolation
- H04N19/82—Details of filtering operations specially adapted for video compression, e.g. for pixel interpolation involving filtering within a prediction loop
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/60—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
- H04N19/61—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding in combination with predictive coding
Definitions
- the present invention relates to a reference picture interpolation method and apparatus, and an image encoding / decoding method and apparatus using the same. More specifically, in inter-prediction encoding and inter-prediction decoding of an image, a filter or filter coefficient for interpolating a reference picture is determined according to characteristics of the image to interpolate the reference picture and interpolate the reference picture through multi-step filtering.
- the present invention relates to a method and an apparatus for improving coding efficiency by adaptively changing the resolution of a motion vector.
- the encoding of data for a video is made up of intra prediction encoding and inter prediction encoding.
- Such intra prediction encoding or inter prediction encoding is widely used for compressing various data in an effective way to reduce correlation between data.
- the motion vector of the current block determined by estimating the motion of the current block to be currently encoded by inter prediction encoding is closely correlated with the motion vector of the neighboring block, the motion vector of the neighboring block is determined from the motion vector of the current block.
- PMV Predicted Motion Vector
- DMV Differential Motion Vector
- the encoder encodes and transmits only a difference vector, which is a difference between the predicted motion vector and the current motion vector determined by estimating the motion of the current block in a previously encoded, decoded and reconstructed reference frame. .
- the decoder also predicts the motion vector of the current block by using the motion vector of the neighboring block previously decoded, and reconstructs the current motion vector by adding the transmitted difference vector and the predicted motion vector.
- the resolution when performing inter prediction encoding, the resolution may be collectively increased by interpolating a reference frame, and then encoded by a difference vector, which is a difference between the prediction motion vector determined by estimating the motion of the current block and the current motion vector, and then transmitted. have.
- a difference vector which is a difference between the prediction motion vector determined by estimating the motion of the current block and the current motion vector, and then transmitted.
- the resolution of the reference frame image that is, the reference image
- more accurate inter prediction is possible, thereby reducing the amount of bits generated by encoding the residual signal between the original image and the predicted image, but increasing the resolution vector of the motion vector.
- the amount of bits generated by encoding also increases.
- the resolution of the reference picture becomes smaller, the bit amount generated by encoding the residual signal becomes larger, but since the resolution of the motion vector becomes smaller, the bit amount generated by encoding the difference vector also becomes smaller.
- the present invention in inter-prediction encoding and inter-prediction decoding of an image, interpolates a reference picture by determining a filter or filter coefficient for interpolating the reference picture according to characteristics of the image, and performs multi-step filtering.
- the main purpose is to improve coding efficiency by interpolating a reference picture or adaptively changing the resolution of a motion vector.
- the present invention provides an apparatus for interpolating a reference picture, comprising: a filter selector for selecting a plurality of filters for interpolating a reference picture; And a filter for generating a reference picture having a target precision by gradually filtering the reference picture using a plurality of filters.
- an apparatus for interpolating a reference picture comprising: a filter information decoder configured to decode a bitstream to restore information for a plurality of filters; And a filter for generating a reference picture having a target precision by gradually filtering the reference picture using a plurality of filters identified by the information on the plurality of filters to be reconstructed, and providing a reference picture interpolation apparatus.
- an apparatus for encoding an image comprising: a reference picture interpolator for filtering a reference picture step by step using a plurality of filters and interpolating a reference picture with a target precision; And an inter prediction encoder configured to inter prediction encode an image using a reference picture interpolated at a target precision.
- a target picture is gradually filtered by using a plurality of filters identified by information on a plurality of filters reconstructed by decoding a bitstream.
- a reference picture interpolator for interpolating with a precision reference picture;
- an inter prediction decoder configured to reconstruct an image by inter-prediction-decoding a bitstream using a reference picture having an interpolated target precision.
- a method of interpolating a reference picture comprising: selecting a first filter for interpolating subpixels using integer pixels of a reference picture; Interpolating a reference picture using a first filter; Selecting a second filter for interpolating a subpixel having a target precision using the subpixel interpolated with the integer pixel; And interpolating the interpolated reference picture using the second filter.
- a method of interpolating a reference picture comprising: restoring information on a first filter and information on a second filter by decoding a bitstream; Interpolating a reference picture using the first filter identified by the information for the first filter; And interpolating the interpolated reference picture using the second filter identified by the information on the second filter.
- a method of encoding an image comprising: interpolating a reference picture step by step using a plurality of filters and interpolating a reference picture with a target precision; And inter-prediction encoding the image using the reference picture interpolated with the target precision.
- a reference picture is gradually filtered using a plurality of filters identified by information on a plurality of filters that are reconstructed by decoding a bitstream. Interpolating to a reference picture of precision; And reconstructing the image by inter-prediction decoding the bitstream using the reference picture of the interpolated target precision.
- a filter or filter coefficient for interpolating a reference picture is determined according to characteristics of an image to interpolate the reference picture, and the reference picture is interpolated through multi-step filtering or the image is divided into predetermined regions.
- FIG. 1 is a block diagram schematically illustrating a video encoding apparatus
- FIG. 2 is an exemplary diagram for explaining a process of interpolating a reference picture
- 3 is an exemplary diagram for explaining a process of determining a predicted motion vector
- FIG. 4 is an exemplary diagram showing a cutting type unary code having a maximum value T of 10
- FIG. 5 is an exemplary diagram illustrating an exponential golem code of 0, 1, and 2;
- 5 is an exemplary diagram showing exponential Gollum codes of 0, 1, and 2
- FIG. 6 is an exemplary view showing a cut-unary / K-order index gollum combined code when the maximum value T is 9 and K is 3;
- FIG. 7 is an exemplary diagram illustrating a zigzag scan order
- FIG. 8 is a block diagram schematically illustrating a video decoding apparatus
- FIG. 9 is a block diagram schematically illustrating a video encoding apparatus according to an embodiment of the present invention.
- FIG. 10 is an exemplary diagram hierarchically expressing a motion vector resolution in a quad tree structure according to an embodiment of the present invention
- FIG. 11 is an exemplary diagram for describing a process of encoding a motion vector resolution hierarchically represented in a quad tree structure according to an embodiment of the present invention
- FIG. 12 is an exemplary diagram illustrating a motion vector resolution of a region determined according to an embodiment of the present invention.
- FIG. 13 is an exemplary diagram hierarchically expressing a motion vector resolution in a tag tree structure according to an embodiment of the present invention
- FIG. 14 is an exemplary diagram illustrating a result of encoding a motion vector resolution expressed hierarchically expressed in a tag tree structure according to an embodiment of the present invention
- 15 is an exemplary diagram for describing a process of determining a motion vector resolution using pixels around a region according to an embodiment of the present invention
- 16 is an exemplary diagram for explaining a process of predicting a predicted motion vector according to an embodiment of the present invention.
- 17 is a flowchart illustrating a video encoding method using an adaptive motion vector resolution according to an embodiment of the present invention.
- FIG. 18 is a block diagram schematically illustrating an apparatus for decoding an image using adaptive motion vector resolution according to an embodiment of the present invention
- 19 is a flowchart illustrating an image decoding method using adaptive motion vector resolution according to an embodiment of the present invention.
- FIG. 20 is an exemplary diagram illustrating another method of dividing a node into lower layers according to an embodiment of the present invention.
- 21 is an exemplary diagram illustrating a binarization bit string allocated to each symbol according to a motion vector resolution according to an embodiment of the present invention
- 22 is a diagram illustrating an optimal motion vector of a neighboring block and a current block to explain a process of determining a resolution of a motion vector
- FIG. 23 is a diagram illustrating a conversion equation according to a motion vector resolution
- 24 is a diagram illustrating a table obtained by converting a resolution of a motion vector of a neighboring block into a resolution of a block to be currently encoded;
- 25 is a diagram illustrating a code number table of a differential motion vector according to a motion vector resolution
- FIG. 26 is a diagram illustrating an optimal motion vector of a neighboring block and a current block to explain a process of determining a resolution of a differential motion vector
- 27 is a diagram illustrating a differential motion vector code number table according to differential motion vector resolution
- FIG. 28 is a diagram illustrating a motion vector of a current block and a reference motion vector of a neighboring block
- 29 is a diagram illustrating a code number table of a differential reference motion vector according to a differential reference motion vector resolution
- FIG. 30 is a diagram illustrating an example of indexing and encoding a reference picture based on a distance between a current picture and a reference picture;
- FIG. 31 is a diagram illustrating a reference picture index according to a reference picture number and a resolution
- FIG. 32 is a diagram schematically illustrating an image encoding apparatus using an adaptive motion vector according to another embodiment of the present invention.
- 33 is a diagram illustrating a resolution identification flag when the specified resolutions are 1/2 and 1/4;
- 34 is a diagram illustrating a neighboring block of a current block
- 35 is a diagram illustrating a probability model according to a condition
- 36 is a diagram illustrating a resolution identification flag when the specified resolution is 1/2, 1/4, 1/8;
- FIG. 37 is a view showing another example of a probability model different from a condition
- 38 and 39 are diagrams illustrating the degree of adaptability according to the distance between a current picture and a reference picture
- 40 is a diagram showing an example of storing a different reference picture index number for each predetermined resolution set
- 41 is a diagram illustrating a structure in which reference pictures are encoded
- 43 is a diagram illustrating a resolution of a current block and a resolution of a neighboring block
- 44 is a diagram illustrating another example of a resolution of a current block and a resolution of a neighboring block
- 46 is a diagram illustrating a resolution of a current block and a resolution of a neighboring block
- 47 is a view showing a video encoding method using an adaptive motion vector resolution according to another embodiment of the present invention.
- 49 is a diagram illustrating a peripheral motion vector of the current block
- 50 is a diagram illustrating a converted value of a peripheral motion vector according to a current resolution
- 51 is a flowchart illustrating a method of decoding an image using adaptive motion vector resolution according to another embodiment of the present invention.
- FIG. 52 is a block diagram schematically illustrating a video encoding apparatus according to a third embodiment of the present invention.
- 53 is a block diagram schematically illustrating a reference picture interpolation apparatus for image encoding according to an embodiment of the present invention
- FIG. 54 and 55 are exemplary diagrams illustrating a filter used for multi-stage filtering according to an embodiment of the present invention
- FIG. 56 is an exemplary diagram illustrating a multi-stage filtering process according to an embodiment of the present invention.
- 57 is a flowchart illustrating a reference picture interpolation method for image encoding according to an embodiment of the present invention
- FIG. 58 is a flowchart for explaining a video encoding method according to a third embodiment of the present invention.
- FIG. 59 is a block diagram schematically illustrating an image decoding apparatus according to a third embodiment of the present invention.
- 60 is a block diagram schematically illustrating a reference picture interpolation apparatus for image decoding according to an embodiment of the present invention
- 61 is a flowchart illustrating a reference picture interpolation method for image decoding according to an embodiment of the present invention
- FIG. 62 is a flowchart for explaining a video decoding method according to a third embodiment of the present invention.
- 63 is a diagram illustrating a table of filter tabs according to resolutions
- 64 is a diagram illustrating a table showing types of filter taps depending on the resolution of a motion vector
- 65 is a view showing another example of the type of filter tap that varies depending on the resolution of a motion vector
- 66 is a diagram illustrating a table in a case of finding an optimal position using resolutions of 1/2 and 1/4 positions;
- 67 is a diagram illustrating an image encoding apparatus 6700 according to a fourth embodiment of the present invention.
- FIG. 68 is a view showing another example of types of filter taps according to the resolution of a motion vector
- 69 is a diagram illustrating an identification flag according to a resolution
- 70 is a diagram illustrating another example of a resolution identification flag according to resolution
- 71 is a diagram illustrating a video encoding method according to a fourth embodiment of the present invention.
- FIG. 72 is a block diagram schematically illustrating an image decoding apparatus according to a fourth embodiment of the present invention.
- 73 is a diagram illustrating a video decoding method according to the fourth embodiment of the present invention.
- a video encoding apparatus (Video Encoding Apparatus), a video decoding apparatus (Video Decoding Apparatus) to be described below is a personal computer (PC), notebook computer, personal digital assistant (PDA), portable multimedia player (PMP) : Portable Multimedia Player (PSP), PlayStation Portable (PSP: PlayStation Portable), Wireless Communication Terminal (Wireless Communication Terminal), TV, etc., and communication devices, such as communication modems and images for communicating with various devices or wired and wireless communication network It refers to various devices having various programs for encoding or decoding, a memory for storing data, a microprocessor for executing and controlling a program, and the like.
- the image encoded in the bitstream by the video encoding apparatus is real-time or non-real-time through the wired or wireless communication network, such as the Internet, local area wireless communication network, wireless LAN network, WiBro network, mobile communication network, or the like, or a cable, universal serial bus (USB: Universal)
- the image decoding apparatus may be transmitted to a video decoding apparatus through a communication interface such as a serial bus, decoded by the video decoding apparatus, reconstructed, and played back.
- image encoding apparatus and the image decoding apparatus may be provided with functions for performing intra prediction as well as inter prediction, detailed descriptions will be omitted in order to prevent confusion because they are not directly related to the embodiment of the present invention.
- a moving picture is composed of a series of pictures, and each picture is divided into a predetermined area such as a block.
- the divided blocks are largely classified into intra blocks and inter blocks according to encoding methods.
- An intra block refers to a block that is encoded by using an intra prediction coding method.
- An intra prediction coding is performed by using pixels of blocks that are previously encoded, decoded, and reconstructed in a current picture that performs current encoding.
- a prediction block is generated by predicting pixels of a block, and a difference value with pixels of the current block is encoded.
- An inter block refers to a block that is encoded using inter prediction coding.
- Inter prediction coding generates a prediction block by predicting a current block within a current picture by referring to one or more past or future pictures, and then generates a current block. This is a method of encoding a difference value with.
- a picture referred to for encoding or decoding the current picture is referred to as a reference picture.
- the block may be a macroblock of MxN size or a subblock or subblock of OxP size.
- encoding and decoding in units of blocks is an example, and an image is encoded into a standardized area or an unstructured area such as a block. You may be able to decrypt it.
- FIG. 1 is a block diagram schematically illustrating a video encoding apparatus.
- the image encoding apparatus 100 may include a predictor 110, a subtractor 120, a transformer 130, a quantizer 140, an encoder 150, an inverse quantizer, and an inverse quantizer. 160, an inverse transformer 170, an adder 180, and a memory 190.
- the predictor 110 inter-predictions the current block to generate a predicted block. That is, the predictor 110 determines the original pixel values of the pixels of the current block through motion estimation when the block to be currently encoded, that is, the current block, is input from the input picture. A prediction block having predicted pixel values is generated and output by predicting using a motion vector of the current block.
- the subtractor 120 subtracts the current block and the prediction block to generate a residual block of the current block.
- the output residual block includes a residual signal, which has a value obtained by subtracting the prediction pixel value of the prediction block from the original pixel value of the current block.
- the transformer 130 transforms the residual block to generate a transformed block. That is, the transformer 130 converts the residual signal of the residual block output from the subtractor 120 into a frequency domain to generate and output a transform block having a transform coefficient.
- a method of transforming the residual signal into the frequency domain a Discrete Cosine Transform (DCT) -based transform or a Hadamard Transform (DCT) may be used, but the present invention is not limited thereto.
- Various transformation techniques such as the technique may be used, and the residual signal is transformed into a frequency domain and transformed into transformation coefficients.
- the quantizer 140 quantizes the transform block to generate a quantized transform block. That is, the quantizer 140 quantizes the transform coefficients of the transform block output from the transformer 130 to generate and output a quantized transform block having quantized transform coefficients.
- DZUTQ dead zone uniform threshold quantization
- DZUTQ quantization weighted matrix
- various quantization methods such as quantization improved therefrom may be used.
- the encoder 150 may output a bitstream by encoding the quantized transform block. That is, the encoder 150 uses various encoding techniques, such as entropy encoding, on a frequency coefficient string scanned by various scan methods such as zigzag scan of the quantized transform coefficients output from the quantizer 140. To generate and output a bitstream including additional information necessary for encoding and decoding the corresponding block (for example, information on a prediction mode, a quantization coefficient, a motion vector, etc.). can do.
- Inverse quantizer 160 inverse quantizes the quantized transform block. That is, the inverse quantizer 160 inversely quantizes and outputs quantized transform coefficients of the quantized transform block output from the quantizer 140.
- Inverse transformer 170 inversely transforms an inverse quantized transform block. That is, the inverse transformer 170 inversely transforms the transform coefficients inversely quantized by the inverse quantizer 160 to restore a residual block having the reconstructed residual coefficients.
- the adder 180 reconstructs the current block by adding the prediction block predicted by the predictor 110 and the residual block inversely transformed and reconstructed by the inverse transformer 170.
- the reconstructed current block is stored in the memory 190, and the reconstructed current block stored in the memory 190 is accumulated in units of blocks or in units of pictures and transferred to the predictor 110 in units of pictures, and then other blocks such as the next block or the next picture are stored. Can be used to predict blocks.
- the predictor 110 determines the motion vector of the current block by estimating the motion of the current block by using the reference picture stored in the memory 190.
- the resolution of the reference picture is interpolated by interpolating the reference picture stored in the memory 190. Can be used to perform motion estimation.
- FIG. 2 is an exemplary diagram for describing a process of interpolating a reference picture.
- FIG. 2 illustrates pixels of a reference picture stored in the memory 190 and pixels interpolated with subpixels. Filtering the previously reconstructed pixels A to U of the reference picture with an interpolation filter can generate subpixels a to s, and the subpixels a to s are interpolated between the already reconstructed pixels as shown.
- the resolution of the reference picture may be four times higher.
- Motion Estimation is a process of finding a part most similar to the current block in the interpolated reference picture and outputting a block of the corresponding part and a motion vector indicating the corresponding block.
- the prediction block found in this process is subtracted from the current block in the subtractor 120, through which a residual block having a residual signal is generated.
- the motion vector is encoded by the encoder 160.
- the encoder 160 may predict the motion vector of the current block by using the motion vectors of blocks adjacent to the current block and encode the motion vector by using the predicted motion vector, that is, the predicted motion vector. .
- 3 is an exemplary diagram for describing a process of determining a predicted motion vector.
- the current block is X
- the motion vector of the adjacent block A on the left of the current block is MV_A (x component: MVx_A, y component: MVy_A), and the motion of the adjacent block B above the current block.
- Each component of the predicted motion vector MV_pred_X may be determined as a median value for each component of the motion vector of the neighboring block of the current block, as shown in Equation 1.
- the motion vector prediction method in the present invention is not limited to the method introduced herein.
- the encoder 160 may encode a difference vector having a difference between the predicted motion vector and the motion vector.
- a difference vector for example, Universal Variable Length Coding (UVLC) Entropy Coding (UVLC ') and Context-Adaptive Binary Arithmetic Coding (CABAC) may be used.
- UVLC Universal Variable Length Coding
- CABAC Context-Adaptive Binary Arithmetic Coding
- the encoding method in the encoder 160 is not limited to the method introduced here.
- the difference vector When the difference vector is encoded using UVLC, the difference vector may be encoded using a K-th Order Exp-Golomb Code. At this time, the value of K may be '0' or any other value.
- the prefix of the Kth order Gollum code It has a truncated unary code corresponding to, and the suffix can be represented as a binary bit string of x + 2 k (1-2 l (x) ) values having a length of k + l (x). have.
- FIG. 4 is an exemplary diagram showing a cutting type unary code having a maximum value T of 10
- FIG. 5 is an exemplary diagram illustrating an exponential Golem code of 0, 1, and 2.
- the difference vector when the difference vector is encoded using CABAC, the difference vector is to be encoded using the sign bit of the concatenated Truncated Unary / K-th Order Exp-Golomb Code. Can be.
- FIG. 6 is an exemplary view showing a cut-unary / K-order index gollum combined code when the maximum value T is 9 and K is 3.
- FIG. 7 is an exemplary diagram illustrating a zigzag scan sequence.
- the quantized frequency coefficients quantized by the quantizer 140 may be scanned by the encoder 160 and encoded in the form of quantized frequency coefficient sequences.
- the quantized frequency coefficients in the block form may be scanned in a zigzag order as shown in FIG. 7. However, it may be scanned in various other scanning orders.
- FIG. 8 is a block diagram schematically illustrating a video decoding apparatus.
- the image decoding apparatus 800 may include a decoder 810, an inverse quantizer 820, an inverse transformer 830, a predictor 840, an adder 850, and a memory 860.
- the decoder 810 extracts a quantized transform block by decoding the bitstream. That is, the decoder 810 decodes and inversely scans the bit string extracted from the input bitstream to restore the quantized transform block having the quantized transform coefficients. In this case, the decoder 810 may decode using an encoding technique such as entropy encoding used by the encoder 160 of the image encoding apparatus 100.
- the decoder 810 extracts and decodes the encoded difference vector from the bitstream, reconstructs the difference vector, predicts the motion vector of the current block, adds the predicted motion vector and the reconstructed difference vector, and adds the motion vector of the current block. Can be restored.
- Inverse quantizer 820 inverse quantizes the quantized transform block. That is, the inverse quantizer 820 inverse quantizes the quantized transform coefficients of the quantized transform block output from the decoder 810. In this case, the inverse quantizer 820 performs inverse quantization by inversely performing the quantization technique used by the quantizer 140 of the image encoding apparatus 100.
- the inverse transformer 830 inversely transforms the inverse quantized transform block output from the inverse quantizer 820 to restore the residual block. That is, the inverse transformer 830 restores the residual block having the reconstructed residual signal by inversely transforming the inverse quantized transform coefficients of the inverse quantized transform block output from the inverse quantizer 820. Inverse conversion is performed by performing the conversion technique used in the converter 130 of the inverse.
- the predictor 840 generates a prediction block by predicting the current block by using the motion vector of the current block extracted from the bitstream and decoded.
- the adder 850 reconstructs the current block by adding the prediction block and the reconstructed residual block. That is, the adder 850 adds the predicted pixel value of the predicted block output from the predictor 840 and the reconstructed residual signal of the restored residual block output from the inverse transformer 830 to add the reconstructed pixel value of the current block. Restore the current block.
- the current block reconstructed by the adder 850 is stored in the memory 860 and stored in block units or as reference pictures in picture units, so that the predictor 840 can be used to predict the next block.
- the image encoding apparatus 100 and the image decoding apparatus 800 described above with reference to FIGS. 1 through 8 interpolate the reference picture in subpixel units to increase the resolution of the reference picture and the motion vector, thereby inter prediction encoding and inter prediction decoding.
- the reference picture may be interpolated at the same resolution in units of pictures or in groups of pictures (GOPs) and the resolution of the motion vector may be increased.
- the prediction is more accurate by increasing the resolution of the reference picture and the bit amount generated by encoding the residual signal can be reduced accordingly.
- the resolution of the motion vector is also increased to encode the motion vector. Since the amount of bits to be collectively increased, as a result, even in the case of performing inter prediction encoding by increasing the resolution of the reference picture, the encoding efficiency may not be greatly increased or the encoding efficiency may be lowered depending on the image.
- inter prediction encoding is performed by adaptively increasing the resolution of a reference picture in an area unit having an arbitrary size, such as a picture unit, a slice unit, a block unit, etc. according to characteristics of an image, thereby increasing the resolution or complexity of an image.
- a method and apparatus for inter prediction encoding and decoding by increasing the resolution in a small region and lowering the resolution in a region where the image is not complicated or large in motion are described.
- FIG. 9 is a block diagram schematically illustrating a video encoding apparatus according to a first embodiment of the present invention.
- the image encoding apparatus 900 may include an inter prediction encoder (910), a resolution change flag encoder (920), a resolution determiner (930), A resolution encoder 940 and a differential vector encoder 950 may be included.
- the resolution change flag generator 920, the resolution encoder 940, and the difference vector encoder 950 are not necessarily all included in the image encoding apparatus 900, but may be selectively included according to an implementation method.
- the inter prediction encoder 910 inter-prediction encodes an image in units of regions using a motion vector according to a motion vector resolution determined for each region or motion vector of the image.
- the inter prediction encoder 910 may be implemented by the image encoding apparatus 100 described above with reference to FIG. 1.
- the functions of the additional components and the encoder 150 in the inter prediction encoder 910 are included.
- the overlapping function may be omitted in the encoder 150.
- the overlapping function may be omitted in the predictor 110.
- one or more components added among the resolution encoder 940 and the differential vector encoder 950 may be configured separately from the inter prediction encoder 910 as shown in FIG. 9, but may be configured as an encoder in the inter prediction encoder 910. It may be configured integrally with 150.
- the flag information generated by the resolution change flag generator 920 may be converted into a bitstream by the resolution change flag generator 920, the function of converting the bitstream into a bitstream is performed by the encoder 150 of the inter prediction encoder 910. It may be.
- the image encoding apparatus 100 described above with reference to FIG. 1 has been described as encoding an image in units of blocks.
- the inter prediction encoder 910 uses a variety of blocks such as macroblocks or subblocks, slices, pictures, and the like. It may be divided into regions of shape and size and encoded in an area unit having an arbitrary size.
- the predetermined area may be a macroblock of 16 ⁇ 16 size, but may be a block of various sizes and shapes such as a 64 ⁇ 64 size block and a 32 ⁇ 16 size block.
- the inter prediction encoder 910 is a motion vector resolution determined differently for each region.
- Each region may be inter prediction encoded by using a motion vector having.
- an image region in which the motion vector resolution may be determined may be a picture (frame or field), a slice, or an image block having a predetermined size.
- the inter prediction encoder 910 increases the resolution by interpolating a reference picture that is already encoded, decoded, and reconstructed, and then performs motion estimation.
- an interpolation filter for interpolating a reference picture various filters such as a Winner filter, a bilinear filter, a Kalman filter, and the like can be used.
- the resolution for interpolation is 2/1 pixel.
- the resolution of various fractional and integer pixel units such as 1/1 pixel, 1/2 pixel, 1/4 pixel, 1/8 pixel unit, etc. may be applied.
- different filter coefficients may be used or the number of filter coefficients may be changed according to the various resolutions.
- interpolation when the resolution is 1/2 pixel unit, interpolation may be performed using a Wiener filter, and when the resolution is 1/4 pixel unit, interpolation may be performed using a Kalman filter.
- the number of filter taps may be changed when interpolating each resolution. For example, when the resolution is 1/2 pixel unit, the interpolation may be performed using an 8 tap winner filter, and when the resolution is 1/4 pixel unit, the interpolation may be performed using a 6 tap winner filter.
- the inter prediction encoder 910 may encode the filter coefficients by determining the optimum filter coefficient for each motion vector resolution that minimizes the error between the current picture to be encoded and the reference picture.
- the filter may use any filter such as a Wiener filter or a Kalman filter, and the number of filter taps may be several, and the number of filters and filter taps may be different for each resolution.
- the inter prediction encoder 910 may inter predict using a reference picture interpolated using a different filter according to the resolution of a region or a motion vector. For example, in order to calculate an optimal filter that minimizes a sum of squared difference (SSD) between a current picture to be encoded and a reference picture, as shown in Equation 2, an optimal filter tap is selected for each resolution using the Wiener filter. Can be calculated
- Equation 2 S denotes a pixel of the current picture, h sp denotes a filter coefficient of the pixel domain, P denotes a pixel of the reference picture, e SP denotes an error, and x and y denote a position of the current pixel. Indicates.
- the inter prediction encoder 910 calculates filter coefficients for each resolution by using a Wiener hop equation as shown in Equation 2, encodes the optimal filter coefficients for each resolution, and includes them in the bitstream, and performs interpolation filtering on the reference picture.
- reference pictures for each resolution may be generated and encoded.
- the filter coefficient of the 6-tap Winner filter at 1/2 resolution, the 8-tap Kalman filter at 1/4 resolution, the filter coefficient of the linear filter at 1/8 resolution are calculated, and the filter coefficients are encoded and included in the bitstream.
- the reference picture for each resolution can be interpolated and encoded.
- the inter prediction encoder 910 encodes using a reference picture interpolated by a 6-tap winner filter when the resolution of the current region or motion vector is 1/2 resolution, and interpolates by an 8-tap Kalman filter when 1/4 resolution.
- the encoded reference picture may be used.
- the resolution change flag generator 920 may generate and include a resolution change flag indicating whether the motion vector resolution and / or the resolution of the differential motion vector is determined for each region or motion vector of the image.
- the area for changing the motion vector resolution and / or the resolution of the differential motion vector by the resolution change flag may be a block, a macroblock, a group grouping blocks, a group grouping macroblocks, or an area having an arbitrary size such as MxN. have. That is, the resolution change flag generator 920 inter-predictively encodes a motion vector having a fixed motion vector resolution for all regions or sub-regions of an image or motion vector resolution for each region (or motion vector).
- the inter prediction encoding may be performed using a motion vector having a determined motion vector resolution, and a resolution change flag indicating whether to generate a differential motion vector having a fixed resolution may be generated and included in the bitstream.
- the resolution change flag may be determined and generated according to setting information input by a user, or may be determined and generated according to a predetermined criterion by analyzing an image to be encoded. Such a resolution change flag may be included in a header of the bitstream.
- the header may be a picture parameter set, a sequence parameter set, a slice header, or the like.
- the inter prediction encoder 910 is fixed for the subregion defined in the header. Inter-prediction encoding of each region is performed by using the motion vector of each region having a set motion vector resolution. For example, if the resolution change flag included in a slice header of a slice indicates that the motion vector resolution is fixed, the motion vector resolution having the least rate-distortion cost is determined for the image of the slice and all regions of the slice are determined. For, the inter prediction encoding may be performed using a motion vector of each region having a corresponding motion vector resolution.
- the inter prediction encoder 910 may determine each region determined by the resolution determiner 930. Each region is inter prediction encoded by using the motion vector of each region having a motion vector resolution of. For example, if the resolution change flag included in the slice header of a slice indicates that the motion vector resolution and / or the resolution of the differential motion vector is adaptively changed for each region or motion vector, the slice is determined by the resolution determiner 910. An area of each slice may be inter prediction encoded by using a motion vector of each area having a motion vector resolution determined for each area within.
- the slice is determined by the resolution determiner 930.
- Inter prediction encoding may be performed using a motion vector resolution determined for each motion vector in the frame.
- the resolution determiner 930 When the resolution determiner 930 generates a flag indicating that the resolution of the motion vector resolution and / or the differential motion vector is adaptively changed for each area or motion vector by the resolution change flag generator 920, the resolution determiner 930 generates a unit of motion or unit of motion vector.
- Optimum motion vector resolution and / or differential motion vector resolution using some cost function, such as RD Cost (Rate-Distortion Cost) while changing the resolution of the motion vector resolution and / or differential motion vector. Determine.
- the optimal motion vector resolution and / or the resolution of the differential motion vector only represent the motion vector resolution and / or the resolution of the differential motion vector determined using a predetermined cost function, and the motion vector resolution and / or are determined in this way.
- the motion vector resolution and / or the resolution of the differential motion vector may be the optimal motion vector resolution and / or the resolution of the differential motion vector.
- the resolution encoder 940 may encode a motion vector resolution and / or a resolution of a differential motion vector determined for each region or motion vector. That is, the resolution encoder 940 encodes a motion vector resolution identification flag indicating the motion vector resolution of each region determined by the resolution determiner 930 and / or a resolution identification flag of the differential motion vector indicating the resolution of the differential motion vector. It can be included in the bitstream.
- the implementation of the motion vector resolution identification flag and / or the resolution identification flag of the differential motion vector is possible in a variety of ways.
- the resolution indicated by the resolution identification flag may be the resolution of the motion vector resolution or the differential motion vector, or may be used simultaneously for the motion vector resolution and the resolution of the differential motion vector.
- the differential vector encoder 950 may encode a differential motion vector that is a difference between a motion vector and a predicted motion vector according to a motion vector resolution determined for each region or motion vector.
- the differential motion vector may be encoded according to the resolution of the differential motion vector.
- the resolution identification flag indicating the motion vector resolution may indicate both the x component and the y component of the motion vector or may indicate respective resolutions. That is, when the camera photographing the image moves or an object in the image moves, the resolution determiner 930 may determine different resolutions of the x component and the y component of the motion vector for motion estimation. For example, the resolution determiner may determine resolution of 1/8 pixel unit for the x component of a motion vector of a certain area and resolution of 1/2 pixel unit for the y component.
- the 910 may inter-prediction the corresponding region by determining a motion vector with different resolutions for each of the x component and the y component for the motion vector of the corresponding region, and using the determined motion vector to perform motion estimation and motion compensation.
- FIG. 22 is a diagram illustrating an optimal motion vector of a neighboring block and a current block in order to explain a process of determining the resolution of a motion vector in the resolution determiner 930.
- the resolution changing flag generator 920 adaptively for each region or motion vector (in the second embodiment, the resolution designation flag generator 3220) Can be set whether or not the motion vector resolution and / or the resolution of the differential motion vector are changed or fixed by the resolution designation flag generated by the controller), and the types of resolutions of the current block and neighboring blocks are 1/2, 1/4, Assume that 1/8 and the optimal resolution is determined as shown in FIG.
- the resolution of block A is 1/2 and the motion vector is (4/2, -8/2)
- the resolution of block B is 1/4 and the motion vector is (36/4, -28/4)
- the resolution of block C is 1/8 and the motion vector is (136/8, -104/8)
- the current block X is 1/4 and the motion vector is (16/4, 20/4).
- the predicted motion vector may depend on the resolution of the current motion vector.
- a resolution conversion process may be performed such that the resolution of the neighboring motion vector is the same as the resolution of the current motion vector to calculate the predicted motion vector.
- FIG. 23 is a diagram illustrating a conversion equation according to a motion vector resolution
- FIG. 24 is a diagram illustrating a table obtained by converting a resolution of a motion vector of a neighboring block to a resolution of a block X to be currently encoded.
- the predicted motion vector can be obtained using the neighboring motion vectors. If the neighboring motion vectors are stored according to their resolutions, they can be converted using multiplication and division when the neighboring motion vectors are different from the current motion vectors. . In this case, the resolution conversion process may be performed when a prediction motion vector is obtained. Alternatively, if the neighboring motion vector is stored based on the highest resolution, the current motion vector may be converted using division if it is not the highest resolution. In this case, the resolution conversion process may perform a resolution conversion process of converting the encoded motion vector to the highest resolution if it is not the highest resolution.
- the current motion vector may be converted using multiplication and division when the current motion vector is different from the reference resolution in which the peripheral motion vectors are stored.
- the resolution conversion process may perform a resolution conversion process of converting the motion vector to the reference resolution when the motion vector resolution is not the reference resolution when the motion vector is stored.
- division rounding, rounding, and rounding can be performed. As shown in Figs. 23 and 24, rounding is used in the example.
- the example uses a peripheral motion vector stored according to each resolution.
- a predicted motion vector can be obtained with reference to the table of FIG. 23.
- the predicted motion vector can be obtained using median, and median can be obtained for each component.
- the predicted motion vector is (32/4, -28/7).
- the differential motion vector is obtained using the predicted motion vector thus obtained.
- the differential motion vector can be obtained by the difference between the motion vector and the predicted motion vector as shown in Equation (3).
- 25 is a diagram illustrating a code number table of differential motion vectors according to motion vector resolution.
- the differential motion vector may be encoded for each motion vector value by using the code number table of the differential motion vector according to the motion vector resolution as shown in FIG. 25.
- the prediction motion vector may be obtained as follows using the example of FIG. 22.
- the median for each component can be obtained without converting the surrounding motion vector according to the current resolution.
- the predicted motion vector is (36/4, -104/8).
- the differential motion vector is obtained using the predicted motion vector thus obtained.
- the differential motion vector may be obtained as a difference between the motion vector and the predicted motion vector as shown in Equation 4.
- the differential motion vector may be encoded with respect to the motion vector value for each resolution by using a code number table of the differential motion vector according to the motion vector resolution as shown in FIG. 25.
- the prediction motion vector may be obtained as follows using the example of FIG. 22.
- the median instead of converting the surrounding motion vector according to the current resolution, the median may be obtained for each component and then converted according to the current resolution.
- the predicted motion vector becomes (36/4, -52/4) with reference to FIG.
- the differential motion vector is obtained using the predicted motion vector thus obtained.
- the differential motion vector may be obtained as a difference between the motion vector and the predicted motion vector as shown in Equation 5.
- the differential motion vector may be encoded with respect to the motion vector value for each resolution by using a code number table of the differential motion vector according to the motion vector resolution as shown in FIG. 25.
- the prediction motion vector may be obtained as follows using the example of FIG. 22.
- the median may also be obtained by using only motion vectors around the same resolution as the current motion vector.
- the predicted motion vector becomes (36/4, -28/4).
- the differential motion vector is obtained using the predicted motion vector thus obtained.
- the differential motion vector may be obtained as a difference between the motion vector and the predicted motion vector as shown in Equation 6.
- the differential motion vector may be encoded with respect to the motion vector value for each resolution by using a code number table of the differential motion vector according to the motion vector resolution as shown in FIG. 25.
- the resolution encoder 940 may encode the resolution change flag and the type of resolution in the header (meaning the resolution designation flag in the second embodiment), and at this time, the optimally determined resolution identification flag is encoded by 1/4.
- the differential vector encoder 950 may encode the differential motion vector obtained by using the predicted motion vector calculated using the peripheral motion vector converted according to the resolution determined by the resolution determiner 930.
- FIG. 26 is a diagram illustrating an optimal motion vector of a neighboring block and a current block in order to explain a process of determining a resolution of a differential motion vector in the resolution determiner 930.
- the predicted motion vector may be calculated as shown in Equation 7 below.
- the MVDx can be encoded with 1/8 and the MVDy can be encoded with 1/4.
- FIG. 27 is a diagram illustrating a differential motion vector code number table according to differential motion vector resolution.
- the resolution encoder 940 encodes the differential motion vector resolution identification flag by (1/8, 1/4) for each of the x and y components, encodes the code number of the differential motion vector as (1, 1), and performs the differential motion. Signs can be separately encoded for x and y components of the vector.
- the differential vector encoder 950 encodes a differential motion vector
- a reference resolution is determined, and a motion vector having a resolution other than the reference resolution is converted into a motion vector having a reference resolution and obtained from a reference motion vector of a neighboring block.
- the differential motion vector is calculated using the predictive motion vector. If the motion vector has a resolution other than the reference resolution, a method of additionally encoding the reference resolution flag may be used.
- the reference resolution flag may include data indicating whether the resolution is the same as the same reference resolution and data indicating the position of the actual motion vector.
- the reference resolution may be defined in a header, and may be a picture parameter set, a sequence parameter set, a slice header, or the like.
- FIG. 28 is a diagram illustrating a motion vector of a current block X and a reference motion vector of a neighboring block.
- the resolution change flag meaning the resolution designation flag in the second embodiment
- the types of resolutions are 1/2, 1/4, 1/8, and the reference resolution is 1/4;
- the optimal resolution is determined, the motion vectors 4/8 and 5/8 of the current block are converted into the reference motion vector Ref_MV using Equation 9 using the reference resolution 1/4.
- the resolution of the motion vector of the current block is different from the resolution of the reference motion vector, it may be converted using multiplication and division.
- division rounding, rounding, rounding, etc. can be used, and rounding is used in this embodiment.
- the reference motion vector becomes (2/4, 3/4), and the position of the actual motion vector having a resolution other than the reference resolution may be represented by the reference resolution flag.
- the difference between the motion vector of the current block and the reference motion vector is (0, 1/8), and the value of the reference resolution flag may include location information such as (0, 1), for example.
- 0 indicates that the reference motion vector is the same as the motion vector
- 1 indicates the motion vector as small as -1/8 in the reference motion vector.
- the difference vector of the reference motion vector is calculated using the prediction reference motion vector, which is a median value of the reference motion vector of the neighboring block.
- the prediction reference motion vector can be obtained by using Equation 10.
- the encoder encodes Ref_MVD (-2/4, 7/4) and encodes a reference resolution flag (0, 1) indicating the position of the motion vector.
- FIG. 29 is a diagram illustrating a code number table of a differential reference motion vector according to differential reference motion vector resolution.
- the reference resolution is 1/4
- the value of the differential reference motion vector is 2/4
- the code number is 2
- the value of the differential reference motion vector is 3/4 as in the present embodiment.
- the code number is 3, and the code number for each component of the differential reference motion vector is included in the reference resolution flag.
- the resolution encoder 940 may variously encode a motion vector resolution and / or a resolution of a differential motion vector determined for each region or motion vector.
- various examples of encoding the motion vector resolution or the resolution of the differential motion vector will be described with reference to FIGS. 10 to 14. In the following description, only an example of encoding a motion vector resolution will be described. However, since the resolution of the differential motion vector may be encoded in the same manner as the motion vector resolution, an example of encoding the resolution of the differential motion vector is omitted.
- the resolution encoder 940 may integrate the motion vector resolution determined for each region or motion vector between adjacent regions having the same motion vector resolution and generate a resolution identification flag in units of the integrated region.
- the resolution identification flag may be generated hierarchically in a quadtree structure.
- the resolution encoder 940 may encode an identifier indicating the maximum number of quad tree layers and the area size indicated by the lowest node of the quad tree layer in the header for the corresponding area of the bitstream.
- FIG. 10 is an exemplary diagram hierarchically expressing a motion vector resolution in a quad tree structure according to an embodiment of the present invention.
- 10A shows regions having various motion vector resolutions in one picture.
- Each region in 10A may be macroblocks of size 16 ⁇ 16, and the number indicated in each region represents the motion vector resolution of each region.
- 10B is a group showing regions having the same motion vector resolution among the regions shown in 10A.
- 10C is a hierarchical representation of the motion vector resolution of regions bounded by 10B in a quad tree structure. In 10C, since the size of the region indicated by the lowest node is a 16 ⁇ 16 macroblock and the maximum number of hierarchies in the quad tree is 4, such information is encoded and included in the header for the region.
- FIG. 11 is an exemplary diagram illustrating a result of encoding a motion vector resolution expressed hierarchically in a quad tree structure according to an embodiment of the present invention.
- the final bit as shown in Fig. 11 can be obtained. Encodes whether a node is divided into one bit. For example, if the bit value is '1', the node is divided into lower layers. If the bit value is '0', the node is not split into lower layers. It is not shown.
- the node at level 0 is divided into lower layers, so that it is encoded with bit value '1'. Since the first node of the divided level 1 is no longer divided into 1/2 resolutions, the first node is encoded with a bit value '0' and the motion vector resolution 1/2 is encoded. Since the second node of level 1 is divided into lower layers, it is encoded with bit value '1'. Since the third node of level 1 is not divided into lower layers, the third node is encoded with bit value '0' and the motion vector resolution 1/4 is encoded. Since the last fourth node of level 1 has been divided into lower layers, it is encoded with bit value '1'. In the same way, each node of level 2 is encoded.
- the final bit hierarchically encoded with the quadtree structure of the motion vector resolution for the region shown in 10A may be generated as shown in FIG. 11.
- Motion vector resolutions such as 1/2, 1/4, and 1/8 shown in the last bit mean that they are encoded with bits representing them, but are not represented as bit values for convenience of description.
- the motion vector resolution may be expressed as a bit value in various ways depending on the implementation manner. For example, if the available motion vector resolution is two, it may be indicated by a 1-bit flag. If the available motion vector resolution is four or less, it may be indicated by a 2-bit flag.
- the resolution identification flag generated as described above may be included in the field of the slice data.
- the image decoding apparatus to be described later may extract and decode the resolution identification flag from the bitstream to restore the motion vector resolution of each region.
- nodes can be subdivided into lower layers in various ways, such as not divided into two regions, divided into two horizontally long regions, divided into two vertically long regions, and divided into four regions.
- the resolution encoder 940 may generate a resolution identification flag by encoding the motion vector resolution determined for each region or motion vector using the motion vector resolution predicted using the motion vector resolution of the peripheral region of each region. For example, assuming that an area is a block of size 64 ⁇ 64, the motion vector resolution of the corresponding area may be estimated using the motion vector resolutions of the left area and the upper area of the area. In this case, when the predicted motion vector resolution is the same as the motion vector resolution of the corresponding region, the resolution encoder 940 encodes the resolution identification flag for the corresponding region as a bit value '1'.
- the resolution identification flag for the corresponding area may be encoded into a bit value '0' and a bit value representing the motion vector resolution of the corresponding area. For example, if the resolution of the upper region and the left region of the region is 1/2 resolution and the resolution of the region is also 1/2 resolution, the resolution identification flag for the region is encoded with the bit value '1' and the region If the resolution of the motion vector is not encoded and the resolution of the upper and left regions of the region is 1/2 resolution and the resolution of the region is 1/4 resolution, the resolution identification flag for the region is set to the bit value '0'. Can be encoded and additionally encoded the motion vector resolution of the region.
- the resolution encoder 940 may generate a resolution identification flag by encoding the motion vector resolution determined for each region or motion vector by using the run and length of the motion vector resolution of each region or motion vector. have.
- FIG. 12 is an exemplary diagram illustrating a motion vector resolution of a region determined according to an embodiment of the present invention.
- regions in one picture are represented by macroblocks having a size of 16 ⁇ 16, and motion vector resolutions of the regions are illustrated in each region.
- the motion vector resolution of the regions shown in FIG. 12 is encoded using a run and a length.
- a motion vector resolution of 1/2 is obtained. 4 times in succession, 1/4 motion vector resolution comes out 1 time, 1/8 motion vector resolution comes out 2 times in succession, 1/2 motion vector resolution comes out 4 times in succession (after motion Vector resolution is omitted), when the motion vector resolution of each region is expressed by the run and length, (1 ⁇ 2, 4), (1 ⁇ 4, 1), (1 ⁇ 8, 2), (1 ⁇ 2, 4),... It can be expressed as Accordingly, the resolution encoder 940 may generate a resolution identification flag by encoding the motion vector resolution of each region represented by the run and the length by representing the bit value.
- the resolution encoder 940 may generate a resolution identification flag by hierarchically encoding a motion vector resolution determined for each region or motion vector using a tag tree.
- the resolution encoder 940 may include an identifier indicating the maximum number of hierarchies of the tag tree and the size of the region indicated by the lowest node in the header.
- FIG. 13 is an exemplary diagram hierarchically expressing a motion vector resolution in a tag tree structure according to an embodiment of the present invention.
- FIG. 13 hierarchically shows a motion vector resolution determined for each region within a portion of an image in a tag tree structure.
- each region is a macroblock of size 16 ⁇ 16.
- FIG. 14 is an exemplary diagram illustrating a result of encoding a motion vector resolution expressed hierarchically expressed in a tag tree structure according to an embodiment of the present invention.
- the difference between the number of motion vector resolutions of the upper layer and the current layer is expressed as a sequence of '0', and the bit value of the last bit is expressed as '1'.
- the motion vector resolution number of the upper layer is '0'
- the motion vector resolution number of 1/2 is '1'
- the motion vector resolution number of 1/4 is '2', 1 /
- the number of motion vector resolutions of 8 is '3'
- a resolution identification flag may be generated as shown in FIG. 14.
- the number assigned to each motion vector resolution may be changed.
- the motion vector resolution number of the upper layer at level 0 is '0' and the motion vector resolution number at 1/2 is number '1'. Therefore, the difference between the number of level 1 and level 0 is '1', and when '1' is converted into coded bits, it is '01'. In addition, since the difference value of the number of motion vector resolutions from the level 1 to the higher layer (level 0) becomes '0', the encoding bit becomes '1'. In addition, since the difference value of the number of the motion vector resolution with respect to the upper layer (level 1) at level 2 becomes '0', the encoding bit becomes '1'.
- the difference value of the number of the motion vector resolution with the higher layer (level 2) becomes '0', so the encoding bit becomes '1', and finally the motion vector resolution of the region identified by (0,0).
- the coded bit of becomes '0111'.
- level 0, level 1, and level 2 are already reflected in the resolution identification flag of the area identified by (0,0), so they are higher in level 3.
- '1' which is a difference value of the number of motion vector resolutions with respect to the layer (level 2), may be encoded, and the corresponding encoding bit becomes '01'. Therefore, in the region identified by (0, 1), only '01' is generated as the resolution identification flag of the region.
- level 0 is already reflected in the resolution identification flag of the region identified by (0,0), so that the level 1, By encoding only level 2 and level 3, the coded bit becomes '0111'.
- the resolution encoder 940 may generate a resolution identification flag by changing and encoding the number of bits allocated to the motion vector resolution according to the frequency of the motion vector resolution determined for each region or motion vector. To this end, the resolution encoder 940 may change and encode the number of bits allocated to the motion vector resolution of the corresponding area according to the frequency of occurrence of the motion vector resolution up to the immediately preceding area in units of areas, or include a plurality of areas. The number of bits allocated to the motion vector resolution of the partial region may be encoded according to the frequency of occurrence of the motion vector resolution of the immediately preceding partial region or the frequency of occurrence of the motion vector resolution to the immediately preceding partial region on an area basis.
- the resolution encoder 940 calculates the frequency of the motion vector resolution in units of regions or partial regions, assigns small numbers in order of increasing frequency of the calculated motion vector resolution, and assigns the small numbers of motion vector resolutions.
- a small number of bits can be allocated to encode the motion vector resolution of each region.
- the motion vector resolution of 1/2 is 10 times in all the regions up to the previous region. Is generated, 1/4 motion vector resolution is generated 15 times, and 1/8 motion vector resolution is generated 8 times, the smallest number (e.g., 1) is assigned to 1/4 motion vector resolution. Assign the next smaller number (e.g., 2) to 1/2 motion vector resolution, give the largest number (e.g. 3) to 1/8 motion vector resolution, After allocating small bits in numerical order, if the motion vector resolution of the region to be encoded with motion vector resolution is 1/4 pixel unit, the smallest bit is allocated to encode 1/4 motion vector resolution for the region. Can be.
- the resolution encoder 940 when the resolution encoder 940 changes the number of bits according to the frequency of occurrence of the motion vector resolution of the previous region group for each region group unit, the resolution encoder 940 generates the motion vector resolution of each region of the previous region group.
- the frequency vector may be updated to allocate small numbers in order of occurrence frequency and to allocate small bits in order of number order to encode motion vector resolution of each region of the region group to be encoded.
- An area group may be a quad tree, a quad tree bundle, a tag tree, a tag tree bundle, a macroblock, a macroblock bundle, and an area of any size.
- the frequency of occurrence of the motion vector resolution is updated by every two macroblocks, and the number of bits of the motion vector resolution is assigned, or if four region groups are designated as quadtrees.
- the frequency of occurrence of the motion vector resolution is updated every four to allocate the number of bits of the motion vector resolution.
- the resolution encoder 940 may change the encoding method of the resolution identification flag according to the distribution of the motion vector resolutions of the surrounding areas of each region based on the motion vector resolution determined for each region or motion vector. That is, the shortest number of bits is allocated to the resolution having the highest probability of being the resolution of the corresponding area according to the distribution of the motion vector resolution of the surrounding area or the area group. For example, if the motion vector resolution of the left region of the region is 1/2 and the motion vector resolution of the upper region is 1/2, the motion vector resolution of the region is likely to be 1/2, so 1 / The shortest number of bits is allocated to two motion vector resolutions and encoded.
- the motion vector resolution of the left region of the region is 1/4
- the motion vector resolution of the upper region is 1/2
- the motion vector resolution of the upper region is 1/2
- the motion vector resolution of the upper right region is 1
- the number of short bits is allocated and encoded in the order of the highest probability, such as the order of 1/2, 1/4, 1/8, etc., of the motion vector resolution of the corresponding region.
- the resolution encoder 940 when entropy encoding by arithmetic coding, the resolution encoder 940 generates a binary coded bit string of the resolution identification flag based on the distribution of the motion vector resolution determined for each region or motion vector according to the distribution of the motion vector resolution of the surrounding region of each region.
- the arithmetic coding and the probability update are performed by applying the probability model differently according to the distribution of the motion vector resolution of the surrounding area and the probability of the motion vector resolution which has occurred up to now.
- arithmetic coding and probability updating are performed using different probability models according to bit positions.
- the probability model when encoding the first binarization bit, may be divided into four probability models and encoded.
- the four probabilistic models are, firstly, a probability model in which both the left and top regions have the same resolution and are the same as the highest probability resolution so far; second, the left and the top regions all have the same resolution and the most Probability model when the resolution of the high probability differs from the third. Third, the probability model when the resolution of the left and upper regions are all different, and the resolution of the highest probability so far and the resolution of the left or upper region is the fourth. In this case, the resolution of the left region and the upper region and the highest probability resolution so far can be divided into probability models.
- the probability model When the second binarization bit is encoded, the probability model may be divided into two probability models and encoded.
- the two probabilistic models are the first when the motion vector resolutions of both the left and top regions of the corresponding region are different and have the highest probability motion vector resolution so far and the motion vector resolution such as the motion vector resolution of the left or upper region.
- Probability model can be classified into a probability model in which the motion vector resolution of the left region and the upper region of the corresponding region and the resolution of the highest probability so far are different.
- the CABAC is used for encoding
- the motion vector resolution used for encoding uses only three motion vector resolutions, such as 1/2, 1/4, and 1/8, and has the highest probability of occurrence. If the motion vector resolution is 1/4, assign the shortest binarization bit string '1' to 1/4 motion vector resolution, and set '00' to the remaining 1/2 motion vector resolution and 1/8 motion vector resolution, respectively. Assign '01'.
- a probability model may be divided into three probability models and encoded.
- the three probabilistic models are: first, a probability model where the motion vector resolution of the left region and the upper region of the region is the same as the highest resolution that has occurred so far; and second, only one of the resolutions of the left region and the upper region of the region. Probability model in which the probability that occurred so far is the same as the highest resolution, and third, the probability model when the motion vector resolution of the left and upper regions of the corresponding region differs from the motion vector resolution of the highest probability so far. Can be.
- the probability model may be divided into six probability models and encoded.
- the six probabilistic models are, firstly, probabilistic models when the resolution of the left and top regions of the region is 1/8 motion vector resolution, and second, the motion vector resolution of the left and upper regions of the region is 1/1. Probabilistic model with 2 motion vector resolutions, Third, Probabilistic model with motion vector resolutions of the left and upper regions of the corresponding region being 1/4 motion vector resolution, and Fourth, resolution of the left and upper regions of the region Probability model when one is 1/8 motion vector resolution and the other is 1/4 motion vector resolution.Fifth, one of the motion vector resolutions of the left and upper regions of the region is 1/2 motion vector resolution and the other Probabilistic model for one motion of 1/4 motion vector resolution, and sixth for motion vector resolutions of the left and upper areas of the area. Im a vector resolution and the other may be separated by one-half if the probability model of the motion vector resolution. The resolution with the highest probability so far may be the probability of the resolution encoded up to the previous region, the probability of any region, or the predetermined fixed resolution.
- the resolution encoder 940 determines whether the motion vector resolution determined for each region or motion vector can be estimated according to a pre-determined estimation method in the image decoding apparatus, and thus, for the region where the motion vector resolution can be estimated.
- a resolution identification flag may be generated by encoding an identifier indicating that it is possible to estimate, and a resolution identification flag may be generated by encoding an identifier indicating that the motion vector resolution cannot be estimated and a motion vector resolution of the corresponding area. .
- the motion encoder 940 when the encoder 940 wants to encode a motion vector resolution for each region or motion vector, the motion encoder calculates a motion vector and a predicted motion vector of the corresponding region by applying a plurality of motion vector resolutions, and encodes the difference vector. After decoding the difference vector, the reconstructed difference vector is assumed to be the optimum resolution, and the motion vector is decoded for each resolution. Assuming that each resolution is an optimal resolution, when the motion of the pixels around the region is compensated for by using the reconstructed motion vector, the motion vector resolution that minimizes the cost according to a predetermined cost function is determined.
- the resolution encoder 940 determines that the motion vector resolution thus determined is the motion vector resolution of the region to be originally encoded (the motion vector resolution determined as the optimal motion vector resolution of the region, except that the optimal motion vector resolution always provides the best performance.
- the identifier indicating that the motion vector resolution of the corresponding area can be estimated by the image decoding apparatus (eg, '1' ') May be generated as a resolution identification flag of the corresponding area, in which case the motion vector resolution of the corresponding area is not encoded.
- the identifier (eg, '0') indicating that the motion vector resolution of the region cannot be estimated by the image decoding apparatus and the corresponding
- the original motion vector resolution of the region may be encoded and generated as a resolution identification flag of the region.
- various distortion functions such as Mean Squre Error (MSE) or Sum of Absolute Transformed Differences (SATD) may be used.
- the resolution encoder 940 may not encode the resolution of the region or the motion vector.
- the motion vector resolution does not need to be encoded because the predictive motion vector is encoded as a motion vector.
- FIG. 15 is an exemplary diagram for describing a process of determining a motion vector resolution using pixels around a region according to an embodiment of the present invention.
- an optimal motion vector resolution determined by the motion encoder 940 performing motion estimation on a region to which motion vector resolution is to be encoded is a motion vector resolution of 1/2, and a motion vector is (4,10). If the predicted motion vector is (2,7), the difference vector is (2,3). At this time, the resolution encoder 940 assumes that the image decoding apparatus can decode and reconstruct only the difference vector, and changes the motion vector resolution to several to predict the predicted motion vector according to each motion vector resolution, thereby resolving each motion vector resolution. After reconstructing the motion vector according to the motion vector resolution, the motion vector resolution at which the distortion between the surrounding pixels of the current area and the surrounding pixels of the area indicated by the motion vector according to the reconstructed motion vector resolution is minimized is determined.
- the motion vector resolution is 1/4 pixel unit
- the difference vector reconstructed by the image decoding apparatus is (2,3), and thus the motion vector of the corresponding region to be reconstructed is (5,17).
- the motion vector resolution is 1/2 pixel unit and the predicted motion vector is (2,7)
- the motion vector of the corresponding region to be reconstructed is (4, 10). In this manner, the motion vector of the corresponding region restored by the image decoding apparatus when the motion vector resolution is 1/8 pixel units is also calculated.
- the resolution encoder 940 encodes only an identifier indicating that the motion vector resolution can be estimated by the image decoding apparatus, and generates the resolution vector as a resolution identification flag of the corresponding region. The resolution is not encoded.
- the resolution determiner 930 determines a motion vector resolution determined for each region or motion vector when the size of the predicted motion vector or the difference vector of the motion vector according to the motion vector resolution determined for each region or motion vector is larger than a threshold. Can be determined. For example, when the magnitude of the prediction motion vector or the difference vector of a certain region is larger than the threshold, the motion vector resolution of the region or motion vector is determined as a predetermined value or a random value. May not be encoded. In addition, when the size of the motion vector of the region or the surrounding vector of the motion vector is large or the size of the motion vector of the region or the motion vector is larger than the threshold, the motion vector resolution of the region is set to a predetermined value or a random value.
- the motion vector resolution of the corresponding region may not be encoded.
- the motion vector resolution of the region or motion vector may be changed to any resolution without any flag.
- the threshold may be a predetermined value or any value input and may be calculated from the motion vectors of the surrounding blocks.
- the resolution determiner 930 may encode information about the resolution by encoding the reference picture index without generating a resolution identification flag when the resolution of the current block is known using the reference picture index.
- the reference picture may be indexed and encoded based on the distance between the current picture and the reference picture as shown in FIG. 30. For example, if four reference pictures are written, candidates of indexable reference pictures can be indexed as shown in FIG. 31 when the current picture is five times.
- FIG. 31 is a diagram illustrating a reference picture index according to a reference picture number and a resolution.
- the reference picture index can be encoded with 3 In FIG. 3, reference picture number 3 is extracted from a bitstream, and reference picture is 3 and resolution is 1/8 using the same table as the encoder.
- the difference vector encoder 950 may encode the difference vector differently according to the motion vector resolution. In other words, as the motion vector resolution increases, the size of the motion vector increases, so that the amount of bits required for encoding the difference vector also increases. Accordingly, the amount of bits can be reduced by encoding the difference vector in different ways according to the size of the motion vector resolution.
- the difference vector encoder 950 may encode using the K-th order exponential golem code, and according to the motion vector resolution determined for each region, The order K can be changed. For example, when encoding a difference vector using UVLC, if the motion vector resolution is 1/4 pixel unit, the order K of the code may be set to '0', and if the motion vector resolution is 1/8 pixel unit, The order K of may be set to '1'.
- the difference vector encoder 950 may encode using the unary / K-order exponential golem combined code, wherein the unary / The order (K) and the maximum value (T) of the K-order exponential golem combined code may be changed and encoded.
- the order K of the code may be set to '3' and the maximum value T may be set to '6' and the motion
- the order K of the code may be set to '5' and the maximum value T may be set to '12'.
- the cumulative probability may be differently calculated and encoded according to the motion vector resolution determined for each region. For example, each time the difference vector of the regions is encoded, the probability model is updated according to the motion vector resolution determined for each region, and when the difference vector of the other regions is encoded, the probability model according to the updated motion vector resolution is updated. It is available.
- the difference vector is encoded using a probability model of 1/2 pixel unit, the probability model of 1/2 pixel unit is updated, and the motion vector of a region
- the differential vector may be encoded using a 1/8 pixel unit probability model and the probability model may be updated in 1/8 pixel unit.
- the difference vector encoder 950 may predict the predicted motion vector for the motion vector of each region by using the motion vector of the surrounding region of each region, in order to calculate the difference vector of each region.
- the motion vector resolution of the surrounding region may be converted into the motion vector resolution of each region to be predicted.
- rounding, rounding, or rounding may be used.
- the surrounding area should be understood as a concept including an adjacent area.
- 16 is an exemplary diagram for describing a process of predicting a predicted motion vector according to an embodiment of the present invention.
- the motion vectors of the surrounding area of the region to be predicted are (4,5), (10,7), (5,10) and rounding is used when converting the predicted motion vector
- the predicted motion vector may be (5,5). If the motion vector resolution of the desired region is 1/8, the predictive motion vector may be (10, 10).
- the difference vector encoder 950 is an area to predict the motion vector at the highest resolution among the motion vector resolutions of the surrounding regions of the region.
- the motion vector resolution of can be converted and predicted. Referring to the example illustrated in FIG. 16, when the region to be predicted is the skip mode, since the highest resolution among the resolutions of the motion vectors of the surrounding region is 1/8, it is assumed that the resolution of the region to be predicted is 1/8. Predicting the predicted motion vector may be (10, 10).
- the differential vector encoder 950 may convert and predict the motion vector of the surrounding region at an arbitrary resolution.
- the final predicted motion vector may be obtained by converting the predicted motion vector to the motion vector resolution of the region to be predicted. Referring to the example shown in FIG.
- the predicted motion vector is converted to (3,3), and if the motion vector resolution of the region to be predicted is 1/8 pixel unit Since it is not the same as any motion vector resolution, the predicted motion vector (3,3) may be converted in units of 1/8 pixels so that the final predicted motion vector may be (12,12). In the same way, if the motion vector resolution of the region to be predicted is 1/4 pixel unit, the final predicted motion vector may be (6,6).
- 17 is a flowchart illustrating a video encoding method according to a first embodiment of the present invention.
- the image encoding method determines the motion vector resolution for each region or motion vector, and inter-predictively encodes the image in units of regions using the motion vector according to the motion vector resolution determined for each region or motion vector. .
- the image encoding apparatus 900 determines whether to change the motion vector resolution for each region or motion vector of the image (S1710), and in case of changing the motion vector resolution for each region, The motion vector resolution is determined for each region or motion vector (S1720), the image is inter prediction encoded by region using a motion vector according to the motion vector resolution determined for each region or motion vector (S1730), and the motion vector resolution is determined for each region.
- the image is inter-predictively encoded in units of regions by using a motion vector according to a fixed motion vector resolution for all regions or sub-regions of the region (S1740).
- the motion vector resolution determined for each region may be different for each x component and y component of the motion vector.
- the image encoding apparatus 900 may generate a resolution change flag indicating whether the motion vector resolution is determined for each region or motion vector. For example, when it is determined in step S1710 that the motion vector resolution is changed, the image encoding apparatus 900 generates a resolution change flag (eg, '1') indicating that the motion vector resolution is changed for each region. When it is determined in S1710 that the motion vector resolution is fixed without being changed, a resolution change flag (eg, '0') indicating that the motion vector resolution is fixed for each region may be generated.
- a resolution change flag eg, '0'
- the image encoding apparatus 900 generates a resolution change flag according to the setting information input from the user or the outside, and determines whether to determine the motion vector resolution for each region in step S1710 based on the bit value of the generated resolution change flag. You can also judge.
- the image encoding apparatus 900 may encode a motion vector resolution determined for each region or motion vector.
- the image encoding apparatus 900 hierarchically encodes a motion vector resolution determined for each region or motion vector in a quad tree structure by tying regions having the same motion vector resolution, or a motion vector determined for each region or motion vector.
- the resolution is encoded using a motion vector resolution predicted using the motion vector resolution of the surrounding area of each region, or the motion vector resolution determined for each region or motion vector is encoded using run and length, or the motion vector determined for each region.
- Resolution is encoded hierarchically using tag tree, or it is encoded by changing the number of bits allocated to motion vector resolution according to the frequency of motion vector resolution determined for each region or motion vector, or motion vector determined for each region or motion vector.
- an identifier indicating that the motion vector resolution can be estimated is encoded, and the motion vector resolution cannot be estimated.
- An identifier indicating that the region cannot be estimated and a motion vector resolution may be encoded together.
- the image encoding apparatus 900 hierarchically encodes the quad tree structure or hierarchically encodes using the tag tree, the maximum number of quad tree layers and the area size indicated by the lowest node of the quad tree layer respectively.
- an identifier indicating the maximum number of tag tree hierarchies and the area size indicated by the lowest node of the tag tree hierarchies may be encoded and included in the header.
- the image encoding apparatus 900 may determine a motion vector resolution determined for each region if the size of the predicted motion vector or the difference vector of the motion vector according to the motion vector resolution determined for each region is larger than a threshold. Can be determined by value. In addition, when each component of the difference vector is '0', the resolution of the region or the motion vector may not be encoded.
- the image encoding apparatus 900 may encode a difference vector that is a difference between a motion vector and a predicted motion vector according to a motion vector resolution determined for each region or motion vector.
- the image encoding apparatus 900 may encode the difference vector differently according to the motion vector resolution.
- the image encoding apparatus 900 may encode using the K-th order exponential golem code, and in this case, the image of the exponential golem code according to the motion vector resolution determined for each region.
- the order K can be changed.
- the image encoding apparatus 900 may encode using the unary / K-order exponential golem combined code, in which case the unary / K according to the motion vector resolution determined for each region.
- the order K and the maximum value T of the difference exponential golem combined code may be changed and encoded.
- the cumulative probability may be calculated and encoded differently according to a motion vector resolution determined for each region.
- the image encoding apparatus 900 may predict the predicted motion vector of the motion vector of each region by using the motion vector of the surrounding region of each region.
- the motion vector resolution of the surrounding region and the motion vector resolution of each region may be predicted. If is not the same, the motion vector resolution of the surrounding area may be converted to the motion vector resolution of each area to be predicted.
- the image encoding apparatus 900 may change the method of encoding the resolution identification flag according to the distribution of the motion vector resolutions of the surrounding areas of each region based on the motion vector resolution determined for each region or motion vector.
- a method of generating a binarization bit string of a resolution identification flag based on a distribution of a motion vector resolution determined for each region or motion vector according to a distribution of motion vector resolutions of surrounding regions of each region are applied differently according to the distribution of the motion vector resolution of the surrounding area and the probability of the motion vector resolution that has occurred so far to perform arithmetic coding and probability update.
- arithmetic coding and probability updating are performed using different probability models according to bit positions.
- the image encoding apparatus 900 may convert and predict the motion vector resolution of the region to be predicted to the highest resolution among the motion vector resolutions of the surrounding region. .
- FIG. 32 is a diagram illustrating an image encoding apparatus 3200 according to a second embodiment of the present invention.
- the image encoding apparatus 3200 includes an inter prediction encoder 3210, a resolution change flag encoder 3220, a resolution determiner 3230, It may include a resolution encoder 3240, a differential vector encoder 3250, and a resolution conversion flag generator 3260.
- the resolution designation flag generator 3220, the resolution encoder 3240, the differential vector encoder 3250, and the resolution conversion flag generator 3260 are not necessarily all included in the image encoding apparatus 3200. May optionally be included.
- the inter prediction encoder 3210 inter-prediction encodes an image in units of regions by using a motion vector according to a motion vector resolution determined for each region or motion vector of the image.
- the inter prediction encoder 3210 may be implemented with the image encoding apparatus 100 described above with reference to FIG. 1.
- the functions of the components included in the inter prediction encoder 3210 and the functions of the encoder 150 in the inter prediction encoder 3210 may be included.
- the overlapping function may be omitted in the encoder 150.
- the overlapping function may be omitted in the predictor 110.
- the encoder in the inter prediction encoder 3210 may be used. It may be configured integrally with 150.
- the flag information generated by the resolution specifying flag generator 3220 and the resolution switching flag generator 3260 may be converted into a bitstream by the resolution specifying flag generator 3220 and the resolution switching flag generator 3260, respectively. The function may be performed by the encoder 150 in the inter prediction encoder 3210.
- inter prediction encoder 3210 The functions of the inter prediction encoder 3210, the resolution encoder 3240, and the difference vector encoder 3250 are the same as the functions of the inter prediction encoder 910, the resolution encoder 940, and the differential vector encoder 950 in FIG. 9. Since each may be the same or similar, detailed descriptions of the inter prediction encoder 3210, the resolution encoder 3240, and the differential vector encoder 3250 are omitted.
- the resolution designation flag generator 3220 may designate an adaptive degree of resolution differently for each region or motion vector of the image.
- the resolution designation flag generator 3220 may generate and include a resolution designation flag for designating a set of the motion vector resolution and / or the resolution of the differential motion vector for each region or motion vector of the image.
- the area for specifying the motion vector resolution and / or the differential motion vector resolution by the resolution specification flag may be a block, a macroblock, a group grouping blocks, a group grouping macroblocks, or an area having an arbitrary size such as MxN. have. That is, the resolution designation flag generator 3220 may generate and include a resolution designation flag indicating a resolution available for all regions or sub-regions in a partial region of the image in the bitstream.
- the resolution designation flag may be determined and generated according to setting information input by a user, or may be determined and generated according to a predetermined criterion by analyzing an image to be encoded. Such a resolution designation flag may be included in a header of the bitstream.
- the header may be a picture parameter set, a sequence parameter set, a slice header, or the like.
- the resolution designation flag specifies the resolution type as 1/2 or 1/4
- the optimal resolution determined by the resolution determiner 3230 and the resolution identification flag encoded by the resolution encoder 3240 are 1/2 and 1/4.
- the resolution identification flag can be encoded in any manner. 33 is a diagram illustrating a resolution identification flag when the designated resolutions are 1/2 and 1/4.
- the resolution identification flag may be encoded using Unary, CABAC, quad tree encoding, or the like.
- CABAC a binary bit string may be generated with reference to the table of FIG. 33, and then arithmetic and probability encoding may be performed.
- the probabilistic model can be divided into three cases by using the resolution of the surrounding motion vector or block.
- 34 illustrates A, B, and C, which are neighboring blocks of the current block X
- FIG. 35 is a diagram illustrating a probability model according to a condition.
- the resolution identification flag to be encoded is selected from resolutions of 1/2, 1/4, and 1/8, and the resolution identification flag is It can be encoded in any way.
- 36 is a diagram illustrating a resolution identification flag when the designated resolutions are 1/2, 1/4, and 1/8. Referring to FIG. 36, the resolution identification flag may be 0, 10, or 11.
- the resolution identification flag may be encoded using Unary, CABAC, quad tree encoding, or the like.
- CABAC when CABAC is encoded, a binary bit string may be generated with reference to the table of FIG. 36 and then arithmetic and probability encoded.
- the probabilistic model can be divided into six cases using a binary bit string index (binidx) and the surrounding motion vector or block resolution.
- FIG. 34 when A, B, and C, which are neighboring blocks of the current block X, are shown, FIG. 37 is a diagram illustrating a probability model according to a condition.
- the resolution specifying flag generated by the resolution specifying flag generator 3220 may indicate a single resolution.
- the resolution designation flag may be encoded to indicate that the resolution of the corresponding area is fixed at 1/2 resolution.
- the adaptive degree of the resolution (that is, the resolution set) can be changed for each reference picture according to an arbitrary criterion without encoding the resolution designation flag.
- the degree of adaptability of the resolution may vary according to the distance between the reference picture and the current picture.
- 38 and 39 are diagrams illustrating the degree of adaptability (ie, a resolution set) according to a distance between a current picture and a reference picture.
- the resolution selects an optimal resolution from a set of resolutions of 1/1, 1/2, 1/4, and 1/8. And encoding a resolution identification flag, and if the distance between the current picture and the reference picture is the furthest of the plurality of reference pictures, the resolution selects an optimal resolution from a resolution set of 1/2, 1/4, and encodes the resolution identification flag.
- the resolution can be set to select and encode the optimal resolution among the 1/2, 1/4, and 1/8 resolution sets. Can be. Referring to FIG. 39, it can be seen that a single resolution may be used.
- the degree of adaptability of the resolution may be changed by using error measuring means such as a sum of squared difference (SSD) between the resolutions when generating the reference picture.
- error measuring means such as a sum of squared difference (SSD) between the resolutions when generating the reference picture.
- SSD sum of squared difference
- the resolution 1/2 may not be used, and if the error value exceeds the threshold, the resolution 1/2 may be set to be used.
- the resolution of 1/2 is set not to be used, 1/4 is used when the error value obtained by using an error measuring means such as SSD for the resolution of 1/1 and 1/4 does not exceed an arbitrary threshold value.
- the error value exceeds a certain threshold, set both 1/1 and 1/4 resolutions.
- the resolution of 1/4 is set, do not use 1/8 when the error value obtained by using an error measuring means such as SSD for the resolution of 1/4 and 1/8 does not exceed an arbitrary threshold value.
- the error value exceeds a certain threshold, all resolutions of 1/1, 1/4, and 1/8 are used.
- the threshold may be different or the same by resolution or by quantization parameter.
- a reference picture index number (the resolution set index in FIG. 9, which may be a reference picture number) for each predetermined resolution set is stored in a header to the decoding apparatus.
- 41 is a diagram illustrating a structure in which reference pictures are encoded.
- the resolution designation flag generator 3220 may set the resolution set differently in case of a picture to be used as a reference picture and when it is not used as a reference picture. For example, it is assumed that reference pictures are encoded in the structure as shown in FIG. 41 and according to a time layer, TL0, TL1, and TL2 are pictures used as reference pictures, and pictures of TL3 are pictures not used as reference pictures. . In this case, when the resolution set is set to 1/2 and 1/4 by the resolution designation flag generator 3220, the resolution set for each reference picture may be set as shown in FIG. 42.
- the resolution sets at the time of encoding picture 6 are 1/2 and 1/4, and the resolution identification flag or the resolution designation flag is encoded in an arbitrary region or motion vector unit with respect to the resolution set thus determined. Can be.
- the resolution identification flag or the resolution designation flag may not be encoded.
- the resolution designation flag generator 3220 may include all the functions of the resolution change flag generator 920 mentioned in the description of FIG. 9.
- the resolution switch flag generator 3260 generates a resolution switch flag indicating a change from the surrounding or previous value of the area to encode the current resolution.
- 43 is a diagram illustrating a resolution of a current block and a resolution of a neighboring block.
- the resolution set is 1/2, 1/4, 1/8, and the resolution of the neighboring block and the current optimal resolution are as shown in FIG. 43, the resolution of the neighboring block is (1/8, 1/4, 1 / 4, 1/4), so the highest resolution is 1/4.
- the resolution conversion flag is encoded as 0.
- the decoder extracts the resolution switch flag from the bitstream, and when the resolution switch flag is 0, it can be seen that 1/4, the resolution having the highest frequency among the resolutions of the neighboring blocks, is the resolution of the current block X.
- FIG. 44 is a diagram illustrating another example of the resolution of the current block and the resolution of the neighboring block
- FIG. 45 is a diagram illustrating a resolution identification flag according to the resolution.
- the resolution conversion flag is coded as 1 to indicate that the resolution is different from the neighboring value.
- the resolution identification flag is encoded as 1 using the table. If the current resolution is 1/4, it is unlikely that 1/4 will be selected as the resolution to be switched. Therefore, resolution 1/4 is not given a resolution identification flag.
- 46 is a diagram illustrating a resolution of a current block and a resolution of a neighboring block.
- the resolution switch flag may indicate whether the resolution of the block A and the current block X are the same. Therefore, in the above case, since the resolutions of the block A and the current block X are not the same, the resolution switching flag may be 1. In addition, since the resolution sets are 1/2 and 1/4, it is understood that the resolution of the current block is 1/4 using only the resolution conversion flag without encoding the resolution identification flag.
- 47 is a diagram illustrating a video encoding method according to a second embodiment of the present invention.
- a resolution designation flag generation step S4702 a resolution determination step S4704, an inter prediction encoding step S4706, and a differential vector encoding step S4708
- the resolution designation flag generation step S4702 is performed by the operation of the resolution designation flag generator 3220, the resolution determination step S4704 is performed by the resolution determiner 3230, and the inter prediction encoding step S4706 is the inter prediction encoder 3210.
- the difference vector encoding operation S4708 operates in the operation of the difference vector encoder 3250
- the resolution encoding operation S4710 operates in the operation of the resolution encoder 3240
- the resolution conversion flag generating operation S4712 includes the resolution switching flag. Since the operation corresponds to the operation of the generator 3260, detailed description thereof will be omitted.
- FIG. 52 is a block diagram schematically illustrating a video encoding apparatus according to a third embodiment of the present invention.
- the image encoding apparatus 5200 is an apparatus for encoding an image, and may include a reference picture interpolator 5210 and an inter prediction encoder 5220. Can be.
- the reference picture interpolator 5210 may adaptively determine the type of filter or the filter coefficient according to the target precision determined in a predetermined area unit. To this end, the reference picture interpolator 5210 may select one filter that is optimal in a corresponding region from among a plurality of filters having fixed filter coefficients, and determine a filter coefficient that is optimal in a corresponding region of a preset or fixed filter. It can also be determined by calculation. The information on the filter or the filter coefficient determined as selected or calculated in this way may be encoded and included in the bitstream.
- the reference picture interpolator 5210 filters the reference picture step by step using a plurality of filters and interpolates the reference picture with a target precision. That is, the reference picture is generated by interpolating the reference picture, and the reference picture is generated by using one filter or filter coefficient, and the filter is not interpolated by using a single filter or filter coefficient. By filtering the reference picture step by step, the reference picture can be interpolated to the reference picture of the target precision.
- various filters such as a Winner filter, a bilinear filter, a Kalman filter, or the like may be used as a filter for interpolating the reference picture.
- the target precision refers to the target precision when the image encoding apparatus 5200 estimates the motion of the region to be encoded, and various precisions such as 1x precision, 2x precision, 4x precision, and 8x precision can be used. have.
- the reference picture interpolator 5210 will be described in detail with reference to FIG. 53 in a later process.
- the inter prediction encoder 5220 inter-predicts an image using the reference picture interpolated with the target precision. That is, the inter prediction encoder 5220 estimates and compensates for a motion of a predetermined area of an image, such as a block to be encoded, using the reference picture interpolated by the reference picture interpolator 5210 with a target precision to compensate for the image of the corresponding area. Is subjected to inter prediction encoding to generate a bitstream.
- the bitstream generated as described above also includes information encoded by the reference picture interpolator 5210.
- the inter prediction encoder 5220 may be implemented by the image encoding apparatus 100 described above with reference to FIG. 1.
- the image encoding apparatus 100 described above with reference to FIG. 1 has been described as encoding an image in units of blocks.
- the inter prediction encoder 920 may block a region such as a macroblock or a subblock, a slice, a picture, or the like. It may be divided into regions of shape and size and encoded in an area unit having an arbitrary size.
- the predetermined area may be a macroblock of 16 ⁇ 16 size, but may be a block of various sizes and shapes such as a 64 ⁇ 64 size block and a 32 ⁇ 16 size block.
- the image encoding apparatus 100 described above with reference to FIG. 1 inter-prediction-codes a motion vector having the same motion vector precision for all blocks of an image, and thereby estimates the motion by interpolating a reference picture with the same precision.
- the inter prediction encoder 5220 estimates the motion using the reference picture interpolated with different precision for each predetermined region using a filter or filter coefficient determined for each predetermined region by the reference picture interpolator 5210. By doing so, the motion vector is determined and inter prediction coded.
- 53 is a block diagram schematically illustrating a reference picture interpolation apparatus for image encoding according to an embodiment of the present invention.
- the reference picture interpolation apparatus may be implemented as a reference picture interpolator 910 in the image encoding apparatus 5200 according to the third embodiment of the present invention described above with reference to FIG. 52.
- the reference picture interpolation apparatus is referred to as a reference picture interpolator 5210.
- the reference picture interpolator 5210 may include a filter selector 5310, a filter 5320, and a filter information encoder 5330.
- the filter selector 5310 adaptively determines the type of filter or the filter coefficient according to the target precision determined in a predetermined area unit. To this end, the filter selector 5310 may select and determine one filter that is optimal in a corresponding region from a filter set including a plurality of filters having fixed filter coefficients. That is, when the filter selector 5310 filters and interpolates a reference picture using each of a plurality of filters having fixed filter coefficients, the filter selector 5310 optimizes one filter having a minimum difference between the interpolated reference picture and the current picture. It can be selected as a filter of.
- the filter selector 5310 may calculate a filter coefficient at which the difference between the interpolated reference picture and the current picture is minimum for any one filter and determine the calculated filter coefficient as an optimal filter coefficient.
- the difference between the interpolated reference picture and the current picture may be calculated by a difference function such as Sum of Absolute Difference (SAD) or Sum of Squared Difference (SSD), but is not necessarily limited thereto. Can be.
- the filter 5320 generates an interpolated reference picture having a target precision by filtering the reference picture using a filter or a filter coefficient determined by the filter selector 5310. That is, when one filter is selected from the plurality of filters by the filter selector 5310, the filter 5320 interpolates the reference picture by filtering the reference picture using the selected filter, and the filter coefficient is selected by the filter selector 5310. Once calculated, the reference picture is interpolated by filtering the reference picture using a filter having the calculated filter coefficients.
- the interpolated reference picture becomes a reference picture having a target precision having pixels (integer pixels and / or subpixels) of the target precision, and when the inter prediction encoder 5220 determines the motion vector by estimating the motion of a predetermined region. , As a reference picture. That is, the inter prediction encoder 5220 performs motion estimation using the reference picture interpolated by the filter 5320 to the target precision.
- the filter information encoder 5330 encodes information about a filter or information about a filter coefficient determined by the filter selector 5310.
- the information about the coded filter or the information about the filter coefficient may be included in the bitstream.
- the filter selector 5310 may determine filter coefficients differently according to target precision for interpolation. For example, the filter selector 5310 uses a Wiener filter to calculate an optimal filter coefficient that minimizes a sum of squared difference (SSD) between a current picture and a reference picture to be currently encoded, as shown in Equation 2 below. The optimum filter tap can be calculated for each precision.
- SSD sum of squared difference
- the filter selector 5310 may calculate filter coefficients for each precision by using a Wiener Hope equation such as Equation (2). For example, the filter selector 5310 calculates the filter coefficients of a six-tap Winner filter at twice the precision, the filter coefficients of the 8-tap Kalman filter at four times the precision, and the filter coefficients of the linear filter at eight times the precision. The calculated filter coefficients are encoded and included in the bitstream.
- the filter 5320 is encoded using a reference picture interpolated by a six-tap winner filter when the precision of the current region or the motion vector is twice the precision, and interpolated by an 8-tap Kalman filter when the precision is four times. It can be encoded using a reference picture.
- the filter selector 5310 may determine a plurality of filters or filter coefficients for one region. That is, the filter selector 5310 may determine an optimal filter or coefficient of the filter for interpolation of the reference picture, and may further determine a filter or filter coefficient for interpolating the interpolated reference picture using the same. That is, when the reference picture interpolator 5210 interpolates the reference picture of the target precision by interpolating the reference picture using multi-step filtering, the filter that selects the filter having the minimum difference between the interpolated reference picture and the current picture is selected or the filter coefficients are interpolated. After calculating, the filter that is the minimum between the interpolated reference picture and the current picture may be further selected, or the filter coefficient may be further calculated.
- the filter 5320 may further filter the interpolated reference picture using the selected filter to interpolate the reference picture of the target precision, or filter the interpolated reference picture using a filter having a filter coefficient that is further calculated to refer to the target precision.
- the picture can be interpolated.
- the filter selector 5310 may reduce the number of filter coefficients to be encoded by assuming that filter coefficients at similar positions are symmetrical. For example, as shown in Equation 11, S 02 position of the sub-pixels when interpolating to interpolate sub-pixel of the filter coefficients and S 20 where the application in the C1 pixels from the filter using used filter from A3 pixels filter coefficient applied to the F3 pixel from filter used to interpolate the filter coefficients and the sub-pixels in S 60 where the application to C6 the pixels in the filter used to interpolate the filter coefficients and S 06 where the sub-pixels of the to be applied to the The values can all be assumed to be the same value. Based on this assumption, the compression efficiency can be improved by reducing the number of filter coefficients for interpolation that the filter information encoder 5330 needs to encode.
- the filter selector 1010 may calculate the filter coefficient h SP at which the square of the error e SP is minimized as shown in Equation 2.
- FIG. 54 and 55 are exemplary diagrams illustrating a filter used for multi-stage filtering according to an embodiment of the present invention
- FIG. 56 is an exemplary diagram for describing a multi-stage filtering process according to an embodiment of the present invention.
- FIG. 54 exemplarily shows a 6x6 tap filter
- FIG. 55 exemplarily shows a 4x4 tap filter.
- the pixels shown in FIG. 56 are pixels of hatched portions in FIGS. 54 and 55, and the pixels interpolated using the filters shown in FIGS. 54 and 55.
- the reference picture interpolator 5210 may interpolate subpixels of hatched portions based on A1 to F6 which are integer pixels already reconstructed in the reference picture using the 6x6 tap filter illustrated in FIG. 54. Also, the reference picture interpolator 5210 is based on A1, B2, C3, D4, E5, and F6, which are integer pixels already reconstructed in the reference picture when using only the 6 tap filter among the 6x6 tap filters shown in FIG. Interpolate subpixels S 11, S 22, S 33, S 44, S 55, and S 66 shown in 56 or subpixels S 01, S based on integer pixels C1, C2, C3, C4, C5, C6 You can also interpolate 02, S 03, S 04, S 05, S 06 and S 07 . In addition, the reference picture interpolator 5210 can interpolate the subpixels of the hatched portions shown in FIG. 55 in a similar manner as described above using the 4x4 tap filter shown in FIG. 55.
- the reference picture interpolator 5210 may set 63 filter coefficient sets of the 6x6 tap filter to interpolate 63 subpixels (S 01 to S 77 ). Can be encoded.
- the reference picture interpolator 5210 interpolates a subpixel at a 1/4 position or a 1/2 pixel position based on integer pixels using a filter having an optimal filter coefficient calculated in the first step, and the second In the step, the subpixel at the 1/8 pixel position may be interpolated using an optimal interpolation filter or a bilinear interpolation filter based on the subpixel at the 1/4 or 1/2 pixel position and the integer pixel.
- the reference picture interpolator 5210 first uses a six-tap filter based on integer pixels C1, C2, C3, C4, C5, and C6 as a first step. Interpolate subpixels S 02, S 04, and S 06, and interpolate S 20, S 40, and S 60 using a six-tap filter based on integer pixels A3, B3, C3, D3, E3, and F3. The subpixels S 22, S 24, S 26, S 42, S 44, S 46, S 62, S 64 and S 66 are interpolated using a 6 ⁇ 6 tap filter.
- the subpixel at 1/8 pixel position can be interpolated again using the 4x4 tap filter, 4 tap filter, 6 tap filter, 6x6 tap filter, etc. based on the subpixels and integer pixels interpolated in the first step. have.
- 67 is a diagram illustrating an image encoding apparatus 6700 according to a fourth embodiment of the present invention.
- the image encoding apparatus 6700 includes an inter prediction encoder 3210, a resolution change flag encoder 3220, and a resolution determiner. It may include a determiner 3230, a resolution encoder 3240, a differential vector encoder 3250, a resolution conversion flag generator 3260, and a reference picture interpolator 6710. Meanwhile, in FIG. 67, the resolution designation flag generator 3220, the resolution determiner 3230, the resolution encoder 3240, the difference vector encoder 3250, and the resolution conversion flag generator 3260 must all include an image encoding device ( 6700), but may be optionally included depending on the implementation manner. In the present specification, when the same operation is performed twice between the operation of the reference picture interpolator 6710 or 5210 and the operation of the inter prediction encoder 3210, the operation in the inter prediction encoder 3210 is duplicated. May be omitted.
- the operation of the filter information encoder in the reference picture interpolator 6710 or 5210 may be integrated into the encoder 150 in the inter prediction encoder 3210.
- operations of the resolution designation flag generator 3220, the resolution determiner 3230, the resolution encoder 3240, the difference vector encoder 3250, and the resolution conversion flag generator 3260 illustrated in FIG. 67 are the resolutions shown in FIG. 32.
- the designated flag generator 3220, the resolution determiner 3230, the resolution encoder 3240, the differential vector encoder 3250, and the resolution switch flag generator 3260 may be the same as or similar to each other, and the resolution designated flag generator 3220 may be used.
- the operation of the resolution determiner 3230 and the resolution switch flag generator 3260 is different from FIG. 32 except for a function of generating a resolution designation flag, a resolution identification flag, and a resolution switch flag to the reference picture interpolator 6710.
- FIG. 67 may be the same as or similar to the functions in FIG. 32. Therefore, in the description of FIG. 67, detailed descriptions of operations of the resolution designation flag generator 3220, the resolution determiner 3230, the resolution encoder 3240, the difference vector encoder 3250, and the resolution conversion flag generator 3260 are omitted. do.
- the function of the inter prediction encoder 3210 in FIG. 67 may be the same as or similar to that of the inter prediction encoder 3210 in FIG. 32 or the inter prediction encoder 5220 in FIG. Detailed description of the operation of the encoder 3210 will be omitted.
- the reference picture interpolator 6710 may include the functions of the reference picture interpolator 5210 of FIG. 52 and may include the functions mentioned in the description of the reference picture interpolator 6710 below, or may include both. .
- the function of the reference picture interpolator 6710 may be described with reference to FIGS. 63 to 65.
- the filter When generating the reference picture, the filter may be selected using a degree similar to the current picture. In addition, the filter may use a different filter for each resolution. In this case, the filter tap may be applied on a picture or slice basis. Therefore, the filter information according to the resolution may be included in the bitstream on a picture or slice basis. The filter information may be informed by a filter flag when using fixed filter coefficients or may include filter coefficients when using adaptive filter coefficients.
- the reference picture when the reference picture is interpolated at 1/8 resolution, the reference picture may be interpolated for each resolution differently using the table of FIG. 63.
- an optimal filter is selected for each resolution. Selection selects the filter that most closely resembles the current picture.
- the difference may be calculated as a sum of absolute difference (SAD) or a sum of squared difference (SSD).
- SAD sum of absolute difference
- SSD sum of squared difference
- a 1/2 resolution for making 1/4 resolution may be interpolated using another filter when the pixel interpolated as an optimal filter at 1/2 resolution is not used.
- the reference picture interpolator 6710 and the reference picture interpolator 7110 of the image decoding apparatus to be described later may promise the same 1/2 resolution for making 1/4 resolution.
- the difference with the current picture is calculated for each filter in the same manner as the 1/2 resolution, and the filter having the smallest difference is selected as an optimal filter.
- a 1/8 resolution filter you can select a 1/8 resolution filter by using the interpolated pixels as the optimal filter at 1/2 and 1/4 resolution, and 1/2 and 1 for 1/8 resolution. You can also use / 4 resolution.
- the reference picture interpolator 6710 and the reference picture interpolator 7110 of the image decoding apparatus to be described later may make the same promise.
- 64 is a diagram illustrating a table showing types of filter taps that vary depending on the resolution of a motion vector.
- the reference picture When the reference picture is interpolated to 1/4 pixels or 1/8 pixels or less, the reference picture may be interpolated using only 1/1 pixel position instead of the previous resolution position, and the filter may be changed according to the resolution of the motion vector. .
- the reference picture when the reference picture is interpolated at 1/16 resolution, it may be interpolated for each resolution differently using the table of FIG. 64.
- the differential motion vector may be encoded based on the 1/16 resolution.
- 65 is a diagram illustrating another example of the type of filter tap that varies depending on the resolution of a motion vector.
- the motion vector may be encoded based on a resolution of 1/16.
- 66 is a diagram illustrating a table in a case of finding an optimal position using resolutions of 1/2 and 1/4 positions.
- an optimal filter may be found for each resolution in any area unit without applying the filter in the picture or slice unit.
- the arbitrary region may be 16x16, 32x32, 64x64, 128x128 or motion vector units.
- an optimal filter may be encoded for each resolution in an arbitrary region unit, or the reference picture interpolator 6710 and the reference picture interpolator 7110 of the image decoding apparatus to be described later may pre-determine the filter for each resolution. For example, if you use up to 1/4 resolution and determine the optimal filter for each resolution by motion vector unit, you can find the motion vector at 1/1 pixel position and change the filter around the resolution. In this case, first, the motion vector is found at the 1/1 pixel position, and the optimal positions are found by interpolating the resolutions of the 1/2 and 1/4 positions using the table of FIG. 66. It can be encoded based on the resolution.
- the decoder generates a reproduced image with reference to FIG. 66 according to the motion vector position.
- the differential motion vector encoder may encode the differential motion vector code number table according to the motion vector resolution as shown in FIG. 25.
- the decoder decodes the motion vector resolution identification flag according to the motion vector resolution, and the motion vector decoder decodes using the differential motion vector code number table according to the motion vector resolution as shown in FIG. 25, and generates a playback image according to the motion vector resolution. do.
- the reference picture interpolator 6710 uses a resolution of a motion vector to change the filter and is optimized among the plurality of filters for each resolution. Can be selected.
- interpolation is performed using only 1/1 pixel position instead of the previous resolution position, and the filter is changed according to the resolution of the motion vector.
- the resolution designation flag indicates (1/2, 1/4, 1/8)
- an optimal filter for each resolution may be selected and interpolated differently using the table of FIG. 63.
- the selection selects a filter most similar to the current picture.
- a 1/2 resolution filter the difference between the half-pixels interpolated using the 8-tap Wiener filter and the current picture, and the half-pixels interpolated using the 8-tap Kalman filter
- the filter having the smallest difference from the current picture is selected as the best filter.
- the difference may be calculated as a sum of absolute difference (SAD) or a sum of squared difference (SSD).
- SAD sum of absolute difference
- SSD sum of squared difference
- the reference picture interpolator 6710 and the reference picture interpolator 7110 of the image decoding apparatus to be described later may promise the same 1/2 resolution for making 1/4 resolution.
- the difference with the current picture is calculated for each filter in the same manner as the 1/2 resolution, and the filter having the smallest difference is selected as an optimal filter.
- you can select a 1/8 resolution filter you can select a 1/8 resolution filter by using the interpolated pixels as the optimal filter at 1/2 and 1/4 resolution, and 1/2 and 1 for 1/8 resolution. You can also use / 4 resolution.
- the reference picture interpolator 6710 and the reference picture interpolator 7110 of the image decoding apparatus to be described later may make the same promise.
- the reference picture interpolator 6710 encodes optimal filter information (filter flag or filter coefficient) for each resolution in an arbitrary area unit, and sets the resolution identification flag in motion vector unit. Can be coded as 4.
- the resolution identification flag is encoded in 1/4 in motion vector units and the reference picture interpolator 6710 is encoded.
- Optimal filter information (filter flag or filter coefficient) of 1/4 resolution may be encoded in motion vector units.
- the decoder decodes the resolution identification flag in units of a motion vector, decodes optimal filter information (filter flag or filter coefficient) of the corresponding resolution, and generates a reproduced image using the filter information.
- the reference picture interpolator 6710 may optimize the filter information (filter flag or filter coefficient for each resolution).
- the motion vector resolution identification flag are encoded in motion vector units.
- the decoder decodes optimal filter information for each resolution in units of slices or pictures, and decodes a motion vector resolution identification flag in units of motion vectors to generate a reproduced image using filter information of the corresponding motion vector resolution.
- the reference picture interpolator 6710 when the reference picture interpolator 6710 receives the resolution identification flag from the resolution determiner 3230, the reference picture interpolator 6710 uses different resolution filters using the resolution of the motion vector and uses a single filter for each resolution. You can filter.
- the reference picture When the reference picture is interpolated to 1/4 pixels or 1/8 pixels or less, the reference picture may be interpolated using only 1/1 pixel position instead of the previous resolution position, and the filter may be changed according to the resolution of the motion vector.
- the resolution designation flag indicates (1/2, 1/4, 1/8, 1/16)
- it may be interpolated for each resolution differently using the table of FIG. 64.
- the differential motion vector is encoded based on the resolution of 1/1, and when the motion vector is the 1/8 position, the resolution identification flag is encoded by 1/8.
- the filter may be changed as shown in the table of FIG. 65 using 1/1 and 1/2 resolution.
- the differential motion vector is encoded based on a resolution of 1/2, and when the motion vector is a 1/4 position, the resolution identification flag is encoded by 1/4.
- the reference picture interpolator 6710 receives a resolution identification flag from the resolution determiner 3230, when the reference picture is interpolated, a filter is not applied in units of pictures or slices and is optimal for each resolution in an arbitrary area unit. You can also select a filter.
- 68 is a diagram illustrating another example of the type of filter tap according to the resolution of a motion vector.
- the arbitrary region may be 16x16, 32x32, 64x64, 128x128 or motion vector units.
- an optimal filter may be encoded for each resolution in an arbitrary area unit, or the reference picture interpolator 6710 and the reference picture interpolator 7110 of the image decoding apparatus to be described later may promise a filter for each resolution. For example, if you use up to 1/4 resolution and determine the optimal filter for each resolution by motion vector unit, you can find the motion vector at 1/1 pixel position and change the filter around the resolution. In this case, the motion vector is first found at the 1/1 pixel position, and the optimal positions are found by interpolating the resolutions of the 1/2 and 1/4 positions using the table of FIG. 68. At this time, the differential motion vector is encoded on the basis of 1/1 resolution, and when the motion vector is at 1/4 position, the resolution identification flag is encoded at 1/4.
- the reference picture interpolator 6710 uses a resolution of the motion vector to change the filter and is optimal among the plurality of filters for each resolution. You can choose a filter. In this case, when interpolating the reference picture to 1/4 pixel or 1/8 pixel or less, interpolate using only 1/1 pixel position instead of the previous resolution position, and change the filter according to the resolution of the motion vector. can do. For example, when the resolution designation flag indicates (1/2, 1/4, 1/8), an optimal filter for each resolution may be selected and interpolated differently using the table of FIG. 63.
- an optimal filter is selected for each resolution, and the selection selects a filter most similar to the current picture.
- the difference may be calculated as a sum of absolute difference (SAD) or a sum of squared difference (SSD).
- 69 is a diagram illustrating an identification flag according to a resolution.
- the reference picture interpolator 6710 encodes optimal filter information (filter flag or filter coefficient) for each resolution in units of arbitrary regions
- the resolution change flag may be encoded as 1 in units of a motion vector
- the resolution identification flag may be encoded into 1/4 using the table of FIG. 69.
- the reference picture interpolator 6710 when the reference picture interpolator 6710 receives the resolution switch flag from the resolution switch flag generator 3260, the reference picture interpolator 6710 uses a resolution of the motion vector to change the filter and uses a single filter for each resolution. Can be filtered.
- the reference picture interpolator 6710 may encode the differential motion vector based on the resolution of 1/1 and encode the resolution conversion flag as 0 when the previous resolution is 1/8 and the current optimal resolution is 1/8.
- 70 is a diagram illustrating another example of a resolution identification flag according to resolution.
- the reference picture interpolator 6710 encodes the differential motion vector based on the resolution of 1/2, and if the previous resolution is 1/8 and the current resolution is 1/4 position, the resolution conversion flag is encoded as 1 and the resolution identification flag is The 1/4 may be encoded using the table of FIG. 70.
- the reference picture interpolator 6710 receives the resolution switch flag from the resolution switch flag generator 3260, when the reference picture is interpolated, the filter is not applied in units of pictures or slices, and every resolution in any area unit. You can also choose the best filter.
- the arbitrary region may be 16x16, 32x32, 64x64, 128x128 or motion vector units.
- an optimal filter may be encoded for each resolution in an arbitrary area unit, or the reference picture interpolator 6710 and the reference picture interpolator 7110 of the image decoding apparatus to be described later may promise a filter for each resolution. For example, if the resolution designation flag is (1/2, 1/4) and the optimal filter is determined for each resolution by the motion vector unit, the motion vector is searched for 1/1 pixel position, and the filter is changed depending on the resolution. can do. In this case, first, the motion vector can be found at the 1/1 pixel position, and the optimal positions can be found by interpolating the resolutions of the 1/2 and 1/4 positions using the table of FIG. 68.
- the reference picture interpolator 6710 may encode the differential motion vector based on 1/1 resolution and encode the resolution conversion flag as 0 when the previous resolution is 1/4 and the current optimal resolution is 1/4. .
- the reference picture interpolator 6710 receives a resolution designation flag received from the resolution designation flag generator 3220 is a flag indicating a single resolution, the filter is changed for each resolution, and the reference picture interpolator 6710 and the following will be described later.
- the reference picture interpolator 7110 of the video decoding apparatus may promise the same. For example, if the resolution designation flag specifies 1/4 resolution as a single resolution and uses an optimal filter using a plurality of filters for each resolution, the reference picture interpolator 6710 interpolates the current picture when interpolating the reference picture. See the difference between and select the best filter for each resolution.
- the optimal filter may be applied in units of arbitrary regions, such as slices, pictures, and images. In this case, the information (filter flag or filter coefficient) of the optimum filter may be encoded for each resolution in units of arbitrary regions.
- 57 is a flowchart illustrating a reference picture interpolation method for image encoding according to an embodiment of the present invention.
- the reference picture interpolator 5210 selects a first filter for interpolating subpixels using integer pixels of the reference picture (S5710).
- the reference picture is interpolated using the first filter (S5720), the second filter for interpolating the subpixels having the target precision is selected using the interpolated subpixels (S5730), and the second filter is used.
- the interpolated reference picture is interpolated (S5740).
- step S5710 when the reference picture interpolator 5210 interpolates subpixels using integer pixels of the reference picture, the difference between the current picture and the interpolated reference picture among the plurality of filters having fixed filter coefficients is minimized.
- One filter can be selected as the first filter.
- step S5710 when the reference picture interpolator 5210 interpolates the subpixels using integer pixels of the reference picture, the reference picture interpolator 5210 uses the filter coefficient at which the difference between the current picture and the interpolated reference picture is minimized as the filter coefficient of the first filter. Can be calculated
- the reference picture interpolator 5210 interpolates again with the current picture among a plurality of filters having fixed filter coefficients when interpolating a subpixel having a target precision by using an interpolated subpixel and an integer pixel of a reference picture.
- One filter can be selected as the second filter in which the difference between the referred reference pictures is minimized.
- the coefficient can be calculated as the filter coefficient of the second filter.
- the reference picture interpolator 5210 may encode information about the first filter and information about the second filter.
- the information about the first filter and the second filter encoded as described above are included in the bitstream.
- FIG. 58 is a flowchart for explaining a video encoding method according to a third embodiment of the present invention.
- the image encoding apparatus 5200 performs stepwise filtering of a reference picture using a plurality of filters to interpolate to a reference picture having a target precision (S5810), and then, to the target precision.
- the image is inter prediction encoded using the interpolated reference picture (S5820).
- the image encoding apparatus 5200 repeats the process of filtering the reference picture with one of a plurality of filters and interpolating the filter, and repeating the interpolation by filtering the interpolated reference picture with another filter. You can interpolate with. That is, the image encoding apparatus 5200 selects a filter or filter coefficient that minimizes the difference between the current picture and the interpolated reference picture and interpolates the reference picture as a reference picture having a target precision, and then uses the selected filter or filter coefficient. By interpolating the reference picture first by interpolating the reference picture, selecting a filter or filter coefficient that minimizes the difference between the current picture and the interpolated reference picture again, and interpolating the interpolated reference picture using the selected filter or filter coefficient.
- Step-by-step interpolation such as third-order and fourth-order, in a manner of interpolating the reference picture in the second order, allows the reference picture of the target precision to be generated.
- one of a plurality of filters having a fixed filter coefficient may be selected as the filter selected for each step, and an optimal filter coefficient of a predetermined filter may be calculated.
- 71 is a diagram illustrating a video encoding method according to a fourth embodiment of the present invention.
- a resolution designation flag generation step S7102
- a resolution determination step S7104
- a reference picture interpolation step S7106
- an inter prediction encoding step S7108
- the resolution designation flag generation step S7102 is an operation of the resolution designation flag generator 3220 of the image encoding apparatus 6700 according to the fourth embodiment
- the resolution determination step S7104 is the video encoding according to the fourth embodiment.
- the reference picture interpolation step S7106 is performed by the reference picture interpolator 6710
- the inter prediction encoding step S7108 is performed by the video encoding apparatus according to the fourth embodiment.
- the difference vector encoding step S7110 is performed by the operation of the differential vector encoder 3250 of the image encoding apparatus 6700 according to the fourth embodiment.
- the resolution conversion flag generation step S7114 is a resolution conversion flag generator of the image encoding apparatus 6700 according to the fourth embodiment. Corresponding to the operation of 3260, respectively. A description thereof will be omitted.
- FIG. 18 is a block diagram schematically illustrating an image decoding apparatus according to a first embodiment of the present invention.
- the image decoding apparatus 1800 includes a resolution change flag extractor 1810, a resolution decoder 1820, a differential vector decoder 1830, and an inter prediction decoder 1840. Can be configured.
- the resolution change flag extractor 1810 extracts a resolution change flag from the bitstream. That is, the resolution change flag extractor 1810 extracts a resolution change flag indicating whether the motion vector resolution is fixed or changed for each region in the header of the bitstream. When the resolution change flag extractor 1810 indicates that the motion change resolution is fixed, the resolution change flag extractor 1810 extracts and decodes the encoded motion vector resolution from the bitstream to inter-predict the fixed motion vector resolution or the preset motion vector resolution.
- the decoder 1840 controls inter prediction decoding on all sub-regions defined in the header, and controls the difference vector decoder 1830 to reconstruct the motion vector of each region at a fixed motion vector resolution.
- the resolution decoder flag 1820 causes the motion decoder 1820 to move the motion vector of each region or motion vector of the lower region defined in the header. Control to reconstruct the resolution, and control the inter prediction decoder 1840 to inter predict and decode the reconstructed motion vector resolution for the lower angular region or motion vector defined in the header, and cause the differential vector decoder 1830 to Control to restore the motion vector of each region to the reconstructed motion vector resolution.
- the resolution change flag extractor 1810 may determine the motion vector resolution determined for each region or motion vector when the size of the predicted motion vector or the difference vector of the motion vector according to the motion vector resolution determined for each region or motion vector is larger than the threshold. May be determined as a predetermined value. For example, when the magnitude of the prediction motion vector or the difference vector of a certain region is larger than the threshold, the motion vector resolution of the region or motion vector is determined as a predetermined value or a random value. It may not be decoded. In addition, when the size of the motion vector of the region or the surrounding vector of the motion vector is large or the size of the motion vector of the region is larger than the threshold value, the motion vector resolution of the region or the motion vector is set to a predetermined value or any value.
- the motion vector resolution may not be decoded.
- the motion vector resolution of the region or motion vector may be changed to any resolution without any flag.
- the threshold may be a predetermined value or any value input and may be calculated from motion vectors of surrounding blocks.
- the resolution decoder 1820 extracts and decodes the resolution identification flag encoded from the bitstream according to the resolution change flag extracted by the resolution change flag extractor 1810 to restore the motion vector resolution for each region.
- the resolution decoder 1820 decodes the motion vector resolution, this is for convenience of description, and in reality, the resolution of the motion vector resolution or the differential motion vector or both resolutions are decoded. You may. Accordingly, the resolution indicated by the resolution identification flag may be the resolution of the motion vector resolution or the differential motion vector, or may be used simultaneously for the motion vector resolution and the resolution of the differential motion vector.
- the resolution decoder 1810 may reconstruct the motion vector resolution for each region or motion vector by decoding resolution identification flags hierarchically encoded in a quad tree structure by tying regions having the same motion vector resolution.
- the resolution decoder 1820 reconstructs a motion vector resolution by decoding a resolution identification flag by dividing a quad tree form as shown in FIG. 10 into respective regions as shown in FIG. 12. For example, when decoding the resolution identification flag generated by encoding as shown in FIG. 11, the first bit is '1' and thus the layer is divided. The second bit is '0', so the first node of level 1 Since it is known that is not divided, the next bits are decoded to restore the motion vector resolution of 1/2. In this manner, the same as the encoding method described above with reference to FIGS. 10 and 11, but in reverse order, the resolution identification flags for level 1 and level 2 are decoded to restore motion vector resolution of the corresponding regions or motion vectors.
- the resolution decoder 1810 decodes an identifier indicating the maximum number of quad tree layers and the area size indicated by the lowest node of the quad tree layer, which is included in the header of the bitstream.
- nodes are divided into lower layers (that is, divided into four zones) or not divided are shown.
- the nodes are not divided into lower layers.
- a node can be divided into lower layers in various ways: divided into two horizontally long regions, divided into two vertically long regions, and divided into four regions.
- the resolution decoder 1820 decodes the resolution identification flag encoded by using the motion vector resolution predicted by using the motion vector resolution of each area or the surrounding area of the motion vector to determine the motion vector resolution for each area or motion vector. Can be restored For example, the resolution decoder 1820 indicates that the resolution identification flag extracted for each region or motion vector from the bitstream is the same as the motion vector resolution predicted using the motion vector resolution of the surrounding region (e.g., For example, when the bit value of the resolution identification flag is '1'), the motion vector resolution predicted using the motion vector resolution of the surrounding area of the corresponding region may be restored without reading the next resolution identification flag from the bitstream.
- the resolution identification flag indicates that it is not the same as the motion vector resolution predicted using the motion vector resolution of the surrounding area (for example, when the bit value of the resolution identification flag is '0'), the corresponding resolution is identified from the bitstream. Solution of motion vector restored by reading and decoding the next bit value of the flag You can restore the top view.
- the resolution decoder 1820 may restore the motion vector resolution for each region or motion vector by decoding the resolution identification flag in which the run and the length of the motion vector resolution for each region or the motion vector are encoded.
- the resolution decoder 1820 decodes the resolution identification flag encoded together with the motion vector resolution and / or the differential motion vector resolution for some regions including several regions to restore the run and length of the motion vector resolution.
- the motion vector resolution of the regions shown in FIG. 12 may be restored using the reconstructed run and length of the motion vector resolution.
- the resolution decoder 1820 may reconstruct the motion vector resolution for each region by decoding the resolution identification flag hierarchically coded using the tag tree for the motion vector resolution for each region or motion vector. For example, with reference to FIGS. 13 and 14, the resolution decoder 1820 may know that the bit corresponding to the level 0 is '01' because the bit of the first region illustrated in FIG. 14 is '0111'. Since the number of motion vector resolutions of the upper layer from 0 is assumed to be '0', it is possible to restore the motion vector resolution of 1/2 in which the difference value of the number is increased to '1'.
- next bit is '1'
- the difference between the number and the upper layer is '0', so that the level 1 is also restored to the motion vector resolution of 1/2.
- level 2 and level 3 may be restored to a motion vector resolution of 1/2 as well.
- the bit of the second region shown in FIG. 14 is '01', and since it has already been decoded for level 0, level 1, and level 2 in the first region, only level 3 needs to be decoded. Since the difference is '1', the motion vector resolution of 1/4 can be restored. In this way, the motion vector resolution of the remaining areas can also be restored.
- the resolution decoder 1820 may change and decode the number of bits allocated to the resolution identification flag according to the frequency of the motion vector resolution for each region or motion vector. For example, the resolution decoder 1820 may calculate the frequency of the motion vector resolution reconstructed up to the last region, number the motion vector resolution according to the calculated frequency, and allocate and decode the number of bits according to the number.
- An area group may be a quad tree, a quad tree bundle, a tag tree, a tag tree bundle, a macroblock, a macroblock bundle, and an area of any size.
- the area group is designated as two macroblocks
- the frequency of occurrence of the motion vector resolution is updated by every two macroblocks, and the number of bits of the motion vector resolution is allocated and decoded
- the area group is designated as four quad trees.
- the frequency of occurrence of the motion vector resolution is updated every four quad trees, and the number of bits of the motion vector resolution is allocated and decoded.
- the resolution decoder 1820 may vary the decoding method of the resolution identification flag based on the distribution of the motion vector resolution of the surrounding area of each region with the motion vector resolution determined for each region or motion vector. According to the distribution of the motion vector resolution of the surrounding area or region group, the shortest number of bits is allocated and decoded to the motion vector resolution having the highest probability of being the resolution of the corresponding area. For example, if the motion vector resolution of the left region of the region is 1/2 and the motion vector resolution of the upper region is 1/2, the motion vector resolution of the region is most likely to be 1/2, so 1 / The shortest number of bits is allocated to two motion vector resolutions and decoded.
- the motion vector resolution of the left region of the region is 1/4
- the motion vector resolution of the upper region is 1/2
- the motion vector resolution of the upper region is 1/2
- the motion vector resolution of the upper right region is 1
- the probability of motion vector resolution of the corresponding region is decoded by allocating short bits in order of high probability such as 1/2, 1/4, 1/8, and so on.
- the resolution decoder 1820 entropy decodes by arithmetic decoding, a method of generating a binarization bit string of the resolution identification flag based on a distribution of motion vector resolutions of the surrounding regions of each region according to a motion vector resolution determined for each region or motion vector.
- the arithmetic decoding and the probability update may be performed by differently applying the probabilistic model according to the distribution of the motion vector resolution of the surrounding area and the probability of the motion vector resolution generated up to now.
- arithmetic decoding and probability updating may be performed using different probability models according to bit positions.
- the bit string having '00' is allocated to the 1/8 motion vector resolution as shown in FIG. 21, and the bit having '01'. Assign columns to 1/2 motion vector resolution.
- the probability model may be divided into four probability models to be decoded.
- the four probabilistic models are, firstly, probabilistic models where both the left and top regions have the same resolution and are the same as the highest probabilities resolution so far. Second, the left and the top regions have the same resolution and the highest Probabilistic model when the resolution of the probability is different from the third.
- the probability model when the resolution of the left region and the upper region are all different and the resolution of the highest probability up to now and the resolution of the left region or the upper region is the fourth.
- the probability model may be divided into two probability models to be decoded.
- the two probabilistic models are the first when the motion vector resolutions of both the left and top regions of the corresponding region are different and have the highest probability motion vector resolution so far and the motion vector resolution such as the motion vector resolution of the left or upper region.
- Probability model can be classified into a probability model in which the motion vector resolution of the left region and the upper region of the corresponding region and the resolution of the highest probability so far are different.
- the probability model when encoding the first binarization bit, may be divided into three probability models and decoded.
- the three probabilistic models are: first, a probability model where the motion vector resolution of the left region and the upper region of the region is the same as the highest resolution that has occurred so far; second, only one of the resolutions of the left region and the upper region of the region Probability model when the probability that occurred so far is the same as the highest resolution, and third, the probability model when the motion vector resolution of the left region and the upper region of the region differ from the highest motion vector resolution that has occurred so far. Can be.
- the probability model When decoding the second binarization bit, the probability model may be divided into six probability models to be decoded.
- the six probabilistic models are, firstly, probabilistic models when the resolution of the left and top regions of the region is 1/8 motion vector resolution, and second, the motion vector resolution of the left and upper regions of the region is 1/1. Probabilistic model with 2 motion vector resolutions, Third, Probabilistic model with motion vector resolutions of the left and upper regions of the corresponding region being 1/4 motion vector resolution, and Fourth, resolution of the left and upper regions of the region Probability model when one is 1/8 motion vector resolution and the other is 1/4 motion vector resolution.Fifth, one of the motion vector resolutions of the left and upper regions of the region is 1/2 motion vector resolution and the other Probability model for one motion of 1/4 motion vector resolution.
- motion vector resolution of the left and upper areas of the area is 1 / 8th Im a vector resolution and the other may be separated by one-half if the probability model of the motion vector resolution.
- the resolution with the highest probability so far may be the probability of the resolution encoded up to the previous region, the probability of any region or the predetermined fixed resolution.
- the resolution decoder 1820 when the resolution decoder 1820 is a flag indicating that a resolution identification flag decoded for each region or motion vector can be estimated, the resolution decoder 1820 estimates the motion vector resolution according to a pre-determined estimation method and uses the estimated motion vector resolution as the region or the motion vector resolution. In the case of a flag indicating that the resolution identification flag decoded for each region or motion vector cannot be estimated, the motion vector resolution indicated by the decoded resolution identification flag can be restored as a motion vector of the corresponding region. have.
- the resolution decoder 1820 may perform each motion vector resolution decoded by the same or similar method as that of the image encoding apparatus 900.
- the predicted motion vector is predicted by using a change, and the motion vector is reconstructed using the predicted predictive motion vector and the difference vector reconstructed by the difference vector decoder 1830.
- the motion vector resolution of a region is 1/4 pixel unit, when the prediction motion vector is (3, 14), the difference vector is (2, 3), and thus the reconstructed motion vector of the region is (5, 17).
- the motion vector resolution of the region is 1/2 pixel
- the predicted motion vector is (2, 7)
- the reconstructed difference vector is (2, 3)
- the reconstructed motion vector of the region is (4, 10).
- the resolution decoder 1820 may additionally decode and reconstruct the motion vector resolution of the corresponding region or motion vector in the resolution identification flag. Can be.
- the resolution decoder 1820 may restore the motion vector resolution for each region or motion vector only when each component of the difference vector is not '0'. That is, when the component of the difference vector of the specific region is '0', since the prediction motion vector is decoded into the motion vector, the motion vector resolution of the corresponding region may not be restored.
- the difference vector decoder 1830 extracts and decodes the encoded difference vector from the bitstream.
- the difference vector decoder 1830 reconstructs the difference vector for each region or motion vector by decoding according to the motion vector resolution for each reconstructed region or motion vector.
- a motion vector for each area may be reconstructed by predicting the prediction motion vector for each area and using the reconstructed difference vector and the predictive motion vector.
- the difference vector decoder 1830 can decode the difference vector using UVLC, and in this case, decode using the K-order exponential golem code, but in accordance with the motion vector resolution for each reconstructed region. You can change the order (K) of the code.
- the difference vector decoder 1830 can decode the difference vector using CABAC. In this case, the vector is decoded using the unary / K-order exponential golem combined code, but the motion vector resolution for each reconstructed region or motion vector. The decoding can be performed by changing the order (K) and the maximum value (T) of the unary / K-order exponential golem combined code.
- the difference vector decoder 1830 may calculate and decode the cumulative probability differently according to the motion vector resolution for each reconstructed region or motion vector.
- the difference vector decoder 1830 may predict the predicted motion vector for each region or the motion vector by using the motion vector of the region or the surrounding region of the motion vector.
- the motion vector resolutions of ⁇ are not the same, the motion vector resolution of the surrounding area may be converted into the motion vector resolution of each area to be predicted.
- the predicted motion vector may be obtained in the same manner in the image encoding apparatus and the decoding apparatus. Accordingly, various embodiments of deriving a predictive motion vector by the image encoding apparatus and embodiments of motion vector resolution conversion with reference to FIGS. 22 to 26 may be similarly implemented in the embodiment of the image decoding apparatus according to the present invention. have.
- the differential vector decoder 1830 converts the motion vector resolution of the surrounding region to the highest resolution among the motion vector resolutions of the surrounding region. It can be predicted.
- the resolution identification flag indicating the motion vector resolution decoded by the resolution decoder 1820 may represent both the x component and the y component of the motion vector, or may indicate respective resolutions. That is, when the camera photographing the image moves or an object in the image moves, the resolution decoder 1820 may decode the resolutions of the x component and the y component of the motion vector for motion estimation differently. For example, the x component of a motion vector of a certain region may be decoded at a motion vector resolution of 1/8 pixel unit, and the y component may be decoded at a motion vector resolution of 1/2 pixel unit, thereby inter prediction.
- the decoder 1840 may inter-prediction-decode a corresponding region by motion estimation and motion compensation with a motion vector having different motion vector resolutions for each x component and y component.
- the inter prediction decoder 1840 inter-predicts and decodes each region by using motion vectors for each region according to the motion vector resolution of each reconstructed region or motion vector.
- the inter prediction decoder 1840 may be implemented by the image decoding apparatus 800 described above with reference to FIG. 8.
- the decoder. In 810 the overlapping function may be omitted.
- the overlapping operation may be omitted in the predictor 850.
- the resolution change flag extractor 1810, the resolution decoder 1820, and the differential vector decoder 1830 may be configured as separate components from the inter prediction decoder 1840 as shown in FIG. 18, the inter prediction decoder It may be integrated with the decoder 810 in 1800.
- the inter prediction decoder 1840 uses various regions such as macroblocks, blocks, subblocks, slices, pictures, and the like. It can be divided into areas of shape and size and decoded by area having an arbitrary size.
- the predetermined area may be a macroblock of 16 ⁇ 16 size, but may be a block of various sizes and shapes such as a 64 ⁇ 64 size block and a 32 ⁇ 16 size block.
- the image decoding apparatus 800 described above with reference to FIG. 8 performs inter prediction decoding on motion blocks and / or differential motion vectors having the same motion vector resolution and / or differential motion vector resolution for all blocks of the image, inter prediction is performed.
- the decoder 1840 may inter-prediction and decode each region with a motion vector and / or a differential motion vector having a motion vector resolution and / or a differential motion vector resolution determined differently for each region or motion vector. That is, when the inter prediction decoder 1840 inter predicts and decodes a predetermined region, the inter prediction decoder 1840 is already encoded according to the motion vector resolution and / or differential motion vector resolution of each region or motion vector reconstructed by the resolution decoder 1820.
- the interpolated, decoded and reconstructed reference picture to increase the resolution, and then the motion vector and / or differential according to the motion vector resolution and / or differential motion vector resolution of the corresponding region or motion vector reconstructed by the differential vector decoder 1830.
- Motion estimation is performed using the motion vector.
- an interpolation filter for interpolating a reference picture various filters such as a Wiener filter, a bilinear filter, a Kalman filter, and the like can be used.
- the resolution for interpolation is 2 pixel unit, 1 pixel unit, 1/2 pixel unit, 1 / Resolution of various fractional pixel units and integer pixel units such as 4 pixel units, 1/8 pixel units, and the like may be applied.
- filter coefficients may be used or the number of filter coefficients may be changed according to the various resolutions. For example, when the resolution is 1/2 pixel unit, interpolation is performed using a Wiener filter, and when the resolution is 1/4 pixel unit, interpolation is performed using a Kalman filter.
- the number of filter taps may be changed when interpolating each resolution. For example, when the resolution is 1/2 pixel unit, interpolation may be performed using an 8 tap winner filter, and when the resolution is 1/4 pixel unit, an interpolation may be performed using a 6 tap winner filter.
- the inter prediction decoder 1840 may decode filter coefficients for each motion vector resolution in the bitstream and interpolate the reference picture with optimal coefficients for each motion vector resolution.
- the filter may use any filter such as a Wiener ⁇ Kalman filter, and the number of filter taps may be several.
- the number of filters and filter taps may be changed for each motion vector resolution.
- inter prediction decoding may be performed using a reference picture interpolated using a different filter according to the motion vector resolution of the region or the motion vector.
- a filter with a 6-tap Winner filter for 1/2 motion vector resolution, an 8-tap Kalman filter for 1/4 motion vector resolution, and a filter coefficient of a linear filter for 1/8 motion vector resolution are decoded.
- the reference picture may be interpolated and decoded for each motion vector resolution.
- the motion vector resolution of the current region or the motion vector is 1/2 motion vector resolution
- it is decoded using the reference picture interpolated by the 6-tap winner filter, and when the 1/4 motion vector resolution is interpolated by the 8-tap Kalman filter.
- the reference picture can be used to decode.
- FIG. 48 is a block diagram schematically illustrating an image decoding apparatus according to a second embodiment of the present invention.
- the video decoding apparatus 4800 includes a resolution designation flag extractor 4810, a resolution decoder 4820, a differential vector decoder 4830, an inter prediction decoder 4840, and resolution switching. And a flag extractor 4850.
- the resolution designation flag extractor 4810, the resolution decoder 4820, the differential vector decoder 4830, and the resolution change flag extractor 4850 are not necessarily included in the image decoding apparatus 4800. It may be selectively included according to the encoding method of the video encoding apparatus for generating the bitstream.
- the inter prediction decoder 4840 inter-predicts and decodes each region by using motion vectors for each region according to the motion vector resolution of each reconstructed region or motion vector.
- the inter prediction decoder 4840 may be implemented by the image decoding apparatus 800 described above with reference to FIG. 8.
- the overlapping function may be omitted in the decoder 810.
- the operation of the resolution decoder 4820 and the operation of the predictor 850 in the inter prediction decoder 4840 overlap, the overlapping operation may be omitted in the predictor 850.
- the resolution designation flag extractor 4810, the resolution decoder 4820, the differential vector decoder 4830, and the resolution conversion flag extractor 4850 are separate components from the inter prediction decoder 4840 as shown in FIG. 48. It may be configured, but may be integrated with the decoder 810 in the inter prediction decoder 4800.
- the resolution specifying flag extractor 4810 extracts the resolution specifying flag from the input bitstream.
- the resolution designation flag is a flag indicating that it is fixed at a single resolution or a resolution set having one or more resolutions.
- the resolution specification flag extractor 4810 extracts the resolution specification flag from the bitstream. That is, the resolution designation flag extractor 4810 extracts a resolution change flag indicating whether the motion vector resolution is fixed to an arbitrary value or there is a resolution set changed for each region in the header of the bitstream. If the resolution designation flag extractor 4810 indicates that the resolution designation flag indicates that the motion vector resolution and / or the differential motion vector resolution is fixed to an arbitrary resolution, the resolution specification flag extractor 4840 and the differential vector encoder In operation 4830, the difference vector encoder 4830 encodes the difference vector at the received resolution and transmits the difference vector to the inter prediction decoder 4840. In this case, the inter-prediction decoder 4840 performs inter-prediction decoding using the received difference vector and the resolution received from the resolution designation flag extractor 4810 and the received bitstream.
- the resolution designation flag extractor 4810 causes the resolution decoder 4820 to perform motion vector resolution and / or differential motion vector of each region or motion vector of the lower region defined in the header when the extracted resolution designation flag is a predetermined resolution set. Control to reconstruct the resolution, and control the inter prediction decoder 4840 to inter predict and decode the reconstructed motion vector resolution for the lower angular region or motion vector defined in the header, and cause the differential vector decoder 4830 to Control to restore the motion vector of each region with the reconstructed differential motion vector resolution.
- an adaptive degree of resolution (that is, a resolution set) may be calculated for each reference picture according to an arbitrary reference.
- the degree of adaptability of the resolution may vary according to the distance between the reference picture and the current picture. Since the description thereof has been mentioned in the descriptions of FIGS. 38 and 39, a detailed description thereof will be omitted.
- the resolution set may be calculated by using an error measuring means such as a sum of squared difference (SSD) between the resolutions when generating the reference picture.
- SSD sum of squared difference
- the resolution set may be calculated by using an error measuring means such as a sum of squared difference (SSD) between the resolutions when generating the reference picture.
- an error measuring means such as a sum of squared difference (SSD) between the resolutions when generating the reference picture.
- SSD sum of squared difference
- the error value exceeds a certain threshold, set both 1/1 and 1/4 resolutions.
- the resolution of 1/4 is set, do not use 1/8 when the error value obtained by using an error measuring means such as SSD for the resolution of 1/4 and 1/8 does not exceed an arbitrary threshold value.
- the error value exceeds a certain threshold, all resolutions of 1/1, 1/4, and 1/8 are used.
- the threshold may be different or the same by resolution or by quantization parameter.
- the resolution designation flag extractor 4810 extracts the reference picture index number from the bitstream instead of the resolution designation flag, and for each predetermined resolution set as shown in FIG. A resolution set can be extracted by storing and referring to a corresponding reference picture index number.
- the resolution set that is actually used may be set differently according to whether the reference picture is used by setting the resolution set differently in case of a picture to be used as a reference picture and when it is not used as a reference picture.
- the table shown in FIG. 42 is also stored in the image decoding apparatus 4800 and may be used by the resolution decoder 4820 to decode the resolution. Since the description thereof has been mentioned in the descriptions of FIGS. 41 and 42, a detailed description thereof will be omitted.
- the resolution designation flag extractor 4810 may determine the area or the motion vector when the predicted motion vector or the differential motion vector of the motion vector according to the motion vector resolution and / or the differential motion vector resolution determined for each area or motion vector is larger than the threshold.
- the motion vector resolution and / or differential motion vector resolution determined for each motion vector may be determined as a predetermined value. Details of this have been mentioned in the description of the resolution change flag extractor 1810 of the image decoding apparatus 1800 according to the first embodiment, and thus detailed descriptions thereof will be omitted.
- the resolution decoder 4820 extracts and decodes the resolution identification flag encoded from the bitstream according to the resolution designation flag extracted by the resolution designation flag extractor 4810 to restore the motion vector resolution for each region.
- the resolution decoder 4810 may reconstruct the motion vector resolution for each region or motion vector by decoding regions of the same region having the same motion vector resolution and decoding the resolution identification flag hierarchically encoded in a quad tree structure. Since details thereof have been mentioned in the description of the resolution decoder 1810 of the image decoding apparatus 1800 according to the first embodiment, detailed descriptions thereof will be omitted.
- the resolution decoder 4820 decodes the resolution identification flag encoded by using the motion vector resolution predicted by using the motion vector resolution of each region or the surrounding region of the motion vector, thereby obtaining the motion vector resolution for each region or motion vector. Can be restored Since details thereof have been mentioned in the description of the resolution decoder 1810 of the image decoding apparatus 1800 according to the first embodiment, detailed descriptions thereof will be omitted.
- the resolution decoder 4820 may restore the motion vector resolution for each region or motion vector by decoding the resolution identification flag in which the run and the length of the motion vector resolution for each region or the motion vector are encoded. Since details thereof have been mentioned in the description of the resolution decoder 1810 of the image decoding apparatus 1800 according to the first embodiment, detailed descriptions thereof will be omitted.
- the resolution decoder 4820 may restore the motion vector resolution for each region by decoding the resolution identification flag hierarchically encoded using the tag tree. Since details thereof have been mentioned in the description of the resolution decoder 1810 of the image decoding apparatus 1800 according to the first embodiment, detailed descriptions thereof will be omitted.
- the resolution decoder 4820 may change the number of bits allocated to the resolution identification flag according to the frequency of the motion vector resolution for each region or the motion vector, and decode. For example, the resolution decoder 4820 may calculate the frequency of the motion vector resolution reconstructed up to the last region, number the motion vector resolution according to the calculated frequency, and allocate and decode the number of bits according to the number. Since details thereof have been mentioned in the description of the resolution decoder 1810 of the image decoding apparatus 1800 according to the first embodiment, detailed descriptions thereof will be omitted.
- the resolution decoder 4820 may vary the decoding method of the resolution identification flag according to the distribution of the motion vector resolutions of the surrounding areas of each area. According to the distribution of the motion vector resolution of the surrounding area or region group, the shortest number of bits is allocated and decoded to the motion vector resolution having the highest probability of being the resolution of the corresponding area. Since details thereof have been mentioned in the description of the resolution decoder 1810 of the image decoding apparatus 1800 according to the first embodiment, detailed descriptions thereof will be omitted.
- the resolution decoder 4820 performs entropy decoding by arithmetic decoding
- a method of generating a binary coded bit stream of the resolution identification flag based on the distribution of the motion vector resolution of each region or the motion vector resolution determined for each region or motion vector is determined.
- the arithmetic decoding and the probability update may be performed by differently applying the probabilistic model according to the distribution of the motion vector resolution of the surrounding area and the probability of the motion vector resolution generated up to now.
- arithmetic decoding and probability updating may be performed using different probability models according to bit positions. Since details thereof have been mentioned in the description of the resolution decoder 1810 of the image decoding apparatus 1800 according to the first embodiment, detailed descriptions thereof will be omitted.
- the resolution decoder 4820 when the resolution decoder 4820 is a flag indicating that a resolution identification flag decoded for each region or motion vector can be estimated, the resolution decoder 4820 estimates the motion vector resolution according to a pre-determined estimation method and uses the estimated motion vector resolution as the region or the motion vector resolution. In the case of a flag indicating that the resolution identification flag decoded for each region or motion vector cannot be estimated, the motion vector resolution indicated by the decoded resolution identification flag can be restored as a motion vector of the corresponding region. have. Since details thereof have been mentioned in the description of the resolution decoder 1810 of the image decoding apparatus 1800 according to the first embodiment, detailed descriptions thereof will be omitted.
- the resolution decoder 4820 may additionally decode and reconstruct the motion vector resolution of the corresponding region or motion vector in the resolution identification flag. Can be.
- the resolution decoder 4820 may restore the motion vector resolution for each region or motion vector only when each component of the difference vector is not '0'. That is, when the component of the difference vector of the specific region is '0', since the prediction motion vector is decoded into the motion vector, the motion vector resolution of the corresponding region may not be restored. Since details thereof have been mentioned in the description of the resolution decoder 1810 of the image decoding apparatus 1800 according to the first embodiment, detailed descriptions thereof will be omitted.
- the resolution decoder 4820 extracts a resolution identification flag according to the resolution type of the resolution change flag extracted and decoded from the header, and by using the difference vector decoder 4830 to the motion vector resolution as shown in FIG.
- the value of the differential motion vector corresponding to the resolution is extracted by referring to the extracted code number by storing the differential motion vector code number table according to the same content as the video encoding apparatus.
- the decoded resolution identification flag is 1/4, when the motion vector is decoded, it may be decoded using the prediction motion vector obtained by converting the motion vector extracted from the bitstream and the resolution 1/4 of the neighboring motion vector.
- the predicted motion vector can be obtained by taking a median using a neighboring motion vector transformed using multiplication and division similarly to the encoder, but the present invention is not limited thereto.
- FIG. 49 is a diagram illustrating a peripheral motion vector of the current block X
- FIG. 50 is a diagram illustrating a converted value of the peripheral motion vector according to the current resolution.
- the resolution decoder 4820 may calculate the resolution using the reference picture index without extracting the resolution identification flag. Since this is described in the description of FIGS. 30 to 31 in the image encoding apparatus 3200, detailed descriptions thereof will be omitted.
- the inter prediction decoder 4840 may obtain a motion vector and perform decoding using the prediction motion vector obtained using the table shown in FIG. 50 and the difference vector calculated from the difference vector decoder 4830.
- the resolution decoder 4820 extracts and decodes the resolution identification flag of the differential motion vector in the bitstream.
- the code number of the differential motion vector and the sign of the differential motion vector are decoded. If the code number table of the differential motion vector according to the differential motion vector resolution as shown in FIG. 27 is stored and the code number extracted from the bitstream is (1, 1), the differential motion vector code number table according to the differential motion vector resolution of FIG. By reference, the differential motion vectors (-1/8, -1/4) can be obtained.
- the inter prediction decoder 4840 may calculate the predictive motion vector in the same manner as the image encoding apparatus as follows.
- MV (1/8, -4/8) MVD (-1/8, -1/4)-PMV (1/4, -1/4)
- the differential vector decoder 1830 when the differential vector decoder 1830 receives the reference resolution flag, the differential vector decoder 1830 reconstructs the differential motion vector and decodes it at the reference resolution.
- the code number included in the reference resolution flag is extracted
- the differential reference motion vector is decoded by referring to the code number table according to the differential reference motion vector resolution as shown in FIG. 29, and the difference is obtained by using the position information included in the reference resolution flag.
- Restore the motion vector that is, when the differential vector decoder 1830 decodes the differential motion vector, the differential vector decoder 1830 extracts a reference resolution from the bitstream and calculates a differential reference motion vector using a code number table according to the differential reference motion vector resolution as shown in FIG. 29. do.
- the reference resolution flag representing the reference resolution and the position coordinate of the motion vector is extracted from the bitstream, and the differential motion vector is decoded using the reference resolution flag from the differential reference motion vector.
- the reference resolution flag may include data indicating whether the resolution is the same as the same reference resolution and data indicating the position of the actual motion vector.
- another function of the difference vector decoder 1830 may include a function of the difference vector decoder 1830 described in the image decoding apparatus 1800 according to the first embodiment. Since the detailed description has described the function of the differential vector decoder 1830 in the first embodiment, the detailed description thereof will be omitted.
- the resolution identification flag is extracted and decoded from the bitstream using a resolution identification flag table excluding the resolution having the highest frequency among the same peripheral resolutions of the encoder.
- the resolution switching flag extractor 4850 extracts the resolution switching flag from the bitstream and excludes the resolution having the highest frequency among the peripheral resolutions according to the value of the resolution switching flag (for example, 1). Table), the resolution identification flag can be extracted and decoded in the bitstream.
- the resolution conversion flag is encoded as 0 when the resolution of the current block is the same as the resolution among the A and B resolutions as well as the highest frequency resolution, and the resolution conversion flag is set to 1 when the resolution is different. It can also be encoded.
- the resolution switch flag is encoded as 1, the flag may be excluded from the resolution identification flag table.
- the resolution switch flag extractor 4850 calculates the resolution of the current block by extracting a resolution switch flag value (for example, 1) from the bitstream so that the resolution is different from the previous block A.
- a resolution switch flag value for example, 1
- the resolution set is 1/2, 1/4
- the resolution to be converted is 1/4 if the resolution of the previous block is 1/2.
- FIG. 59 is a block diagram schematically illustrating an image decoding apparatus according to a third embodiment of the present invention.
- the image decoding apparatus 5900 may include a reference picture interpolator 5910 and an inter prediction decoder 5920.
- the reference picture interpolator 5910 interpolates the reference picture by using a plurality of filters identified by the information on the plurality of filters reconstructed by decoding the bitstream, and interpolates the reference picture with a target precision. That is, the reference picture interpolator 5910 decodes the bitstream to restore information on the plurality of filters, and uses the plurality of filters or the filter coefficients of the plurality of filters identified by the information on the plurality of filters to be restored. By interpolating the reference picture, the reference picture of the target precision is interpolated.
- the reference picture interpolator 5910 interpolates the reference picture using one of the two filters.
- the interpolated reference picture is interpolated again using the other one of the two filters to the reference picture of the target precision.
- the reference picture interpolator 5910 may identify a filter having one filter coefficient among two filter coefficients. The interpolated reference picture is interpolated, and the interpolated reference picture is interpolated again using a filter having a filter coefficient of the other one of the two filters, thereby interpolating the reference picture with a target precision.
- the reference picture interpolator 5910 will be described in detail with reference to FIG. 60 in a later process. However, the information on the plurality of filters in which the bitstream is decoded and reconstructed is not necessarily reconstructed by the reference picture interpolator 5910, and the inter prediction decoder 5920 decodes the bitstream to provide information on the plurality of filters. When the information is reconstructed, the reference picture interpolator 5910 may receive information about the plurality of filters reconstructed from the inter prediction decoder 5920.
- the inter prediction decoder 5920 reconstructs an image by inter prediction decoding a bitstream using a reference picture having an interpolated target precision.
- the inter prediction decoder 5920 may be implemented by the image decoding apparatus 800 described above with reference to FIG. 8. However, although the above-described image decoding apparatus 800 described with reference to FIG. 8 encodes an image in units of blocks, the inter prediction decoder 5940 may encode the image in various forms such as macroblocks, blocks, subblocks, slices, pictures, and the like. It can be divided into areas of shape and size and decoded by area having an arbitrary size.
- the predetermined area may be a macroblock of 16 ⁇ 16 size, but may be a block of various sizes and shapes such as a 64 ⁇ 64 size block and a 32 ⁇ 16 size block.
- the inter prediction decoder 5920 determines differently for each region or motion vector. Each region may be inter-prediction decoded with a motion vector having a motion vector precision.
- FIG. 72 is a block diagram schematically illustrating an image decoding apparatus according to a fourth embodiment of the present invention.
- the image decoding apparatus 7200 includes a resolution designation flag extractor 4810, a resolution decoder 4820, a differential vector decoder 4830, an inter prediction decoder 4840, and resolution switching. And a flag extractor 4850 and a reference picture interpolator 7210.
- the resolution designation flag extractor 4810, the resolution decoder 4820, the differential vector decoder 4830, and the resolution change flag extractor 4850 are not necessarily included in the image decoding apparatus 4800. It may be selectively included according to the encoding method of the video encoding apparatus for generating the bitstream.
- the resolution designation flag extractor 4810, the resolution decoder 4820, and the resolution change flag extractor 4850 extract the resolution designation flag, the resolution identification flag, and the resolution conversion flag, respectively, and transmit the same to the reference picture interpolator 7210. Except for the points, the description is the same as or similar to that of the resolution designation flag extractor 4810, the resolution decoder 4820, and the resolution change flag extractor 4850 of FIG. 48, and thus a detailed description thereof will be omitted.
- the inter prediction decoder 4840 in FIG. 72 inter-predicts and decodes each region by using motion vectors for each region according to the motion vector resolution of each reconstructed region or motion vector.
- the overlapping function may be omitted in the decoder 810.
- the resolution designation flag extractor 4810, the resolution decoder 4820, the differential vector decoder 4830, and the resolution conversion flag extractor 4850 are separate components from the inter prediction decoder 4840 as shown in FIG. 72. It may be configured, but may be integrated with the decoder 810 in the inter prediction decoder 4800.
- the reference picture interpolator 7210 of FIG. 72 may include the functions of the reference picture interpolator 5910 of FIG. 59 and may include the functions mentioned in the description of the reference picture interpolator 7210 below and both. You may.
- the reference picture interpolator 7210 interpolates the reference picture by determining a filter tap using the motion vector resolution for each extracted region or motion vector.
- the reference picture interpolator 7210 uses the extracted resolution change flag.
- the filter tap is determined using the determined motion vector resolution to interpolate the reference picture.
- the reference picture interpolator 7210 is a single resolution when it means that the extracted resolution designation flag is a single resolution.
- the filter tab is determined according to the interpolation of the reference picture.
- the reference picture interpolator 7210 may set the type of filter tab for each picture resolution, and interpolate the reference picture by selecting a filter having the smallest difference from the current picture as a result of the interpolation among the types of the filter tab. .
- the reference picture interpolator 7210 may interpolate the reference picture by selecting a filter tap according to the resolution of the motion vector.
- the reference picture interpolator 7210 may interpolate the reference picture by selecting an optimum filter tap for each resolution in units of arbitrary regions within the picture or slice.
- the operation of the reference picture interpolator 7210 in the image decoding apparatus 7200 according to the fourth embodiment of the present invention is performed by the reference picture interpolator 6710 in the image encoding apparatus 6700 according to the fourth embodiment of the present invention.
- the detailed description may be omitted since it may be the same as or similar to the operation.
- 60 is a block diagram schematically illustrating a reference picture interpolation apparatus for image decoding according to an embodiment of the present invention.
- a reference picture interpolation apparatus for image decoding according to an embodiment of the present invention may be implemented as a reference picture interpolator 5910 in the image decoding apparatus 5900 according to the third embodiment of the present invention described above with reference to FIG. 59. Can be.
- the reference picture interpolation apparatus for image decoding according to an embodiment of the present invention is referred to as a reference picture interpolator 5910.
- the reference picture interpolator 5910 may be configured to include a filter information decoder 6010 and a filter 6020.
- the filter information decoder 6010 restores information on the plurality of filters by decoding the bitstream. That is, the filter information decoder 6010 extracts and decodes data encoded with information about the plurality of filters from the bitstream and restores the information about the plurality of filters.
- the information on the plurality of filters restored as described above may be information on the type of filter. In this case, information on which filter is selected among filter sets having fixed filter coefficients may be used. In this case, a plurality of filters selected from filter sets may be indicated.
- the information on the plurality of filters to be restored may be information on filter coefficients of a predetermined filter. When multi-level filtering is used, it may be information on the plurality of filter coefficients.
- various filters such as a Wiener filter, a bilinear interpolation filter, a Kalman filter, and the like may be used.
- the filter 6020 interpolates the reference picture by using the information about the filter reconstructed by the filter information decoder 6010.
- the filter 6020 may interpolate the reference picture by performing multi-stage filtering in filtering and interpolating the reference picture.
- the filter 6020 may step-by-step filter the reference picture by using the information about the plurality of filters reconstructed by the filter information decoder 6010 and interpolate the reference picture with the target precision.
- the filter 6020 uses the 6x6 tap filter having the reconstructed filter coefficients and the already reconstructed integer pixel of the reference picture. Interpolate subpixels based on In addition, when the information on the filter restored by the filter information decoder 6010 is the filter coefficient of the 6-tap filter, the filter 6020 uses integer 6 pixels A1, B2, Interpolate subpixels S 11, S 22, S 33, S 44, S 55, and S 66 based on C3, D4, E5, F6, or use negative integers C1, C2, C3, C4, C5, C6
- the pixels S 01, S 02, S 03, S 04, S 05, S 06 and S 07 may be interpolated.
- the reference picture interpolator 6010 may interpolate the reference picture using multi-step filtering. For example, in the case of interpolating the horizontal and vertical precisions of the reference picture by 8 times, respectively, as illustrated in FIG. 56, in order to interpolate 63 subpixels (S 01 to S 77 ), the filter information decoder 6010 The reference picture may be interpolated by restoring 63 filter coefficient sets of the decoded 6x6 tap filter and using the 63 filter coefficient sets from which the filter 6020 is reconstructed. Alternatively, when interpolating the horizontal and vertical precisions by 8 times, respectively, the filter 6020 has a filter coefficient identified by the information about the plurality of filters reconstructed by the filter information decoder 6010 as a first step.
- the subpixels at the 1/8 pixel position may be interpolated again based on the subpixels at the 1/4 or 1/2 pixel positions and the integer pixels.
- the filter 6020 is a first step, which is identified by the information on the plurality of filters reconstructed by the filter information decoder 6010.
- the six-tap filter is used to interpolate subpixels S 02, S 04, and S 06 based on integer pixels C1, C2, C3, C4, C5, and C6 of the reference picture, and is reconstructed by the filter information decoder 6010.
- the subpixels to the 1/2 or 1/4 pixel position of the reference picture are interpolated using a 6x6 tap filter identified by the information on the plurality of filters reconstructed by 6010.
- the filter 6020 uses a 4x4 tap filter or a 4 tap filter, a 6 tap filter, a 6x6 tap filter, and the like identified by the information on the plurality of filters restored by the filter information decoder 6010.
- the subpixels of the 1/8 pixel position are interpolated again by using the integer pixels of the interpolated reference picture and the subpixels interpolated in the first step.
- an image encoding / decoding apparatus includes a bitstream output terminal of the image encoding apparatus of FIG. 9, 32, 52, or 67 and image decoding of FIG. 18, 48, 59, or 72. This can be achieved by connecting the bitstream input of the device.
- An image encoding / decoding apparatus performs stepwise filtering of a reference picture using a plurality of filters to interpolate a reference picture with a target precision, and inter-predicts an image using the reference picture interpolated with the target precision.
- An image encoder to encode; And step-by-step filtering the reference picture using a plurality of filters identified by the information reconstructed by decoding the bitstream to interpolate to the reference picture of the target precision, and inter-predicting the bitstream using the reference picture of the interpolated target precision.
- an image decoder which decodes and restores an image.
- FIG. 19 is a flowchart illustrating an image decoding method according to a first embodiment of the present invention.
- the image decoding method extracts a resolution change flag from a bitstream, extracts and decodes an encoded resolution identification flag from the bitstream according to the extracted resolution change flag, and then moves each region or motion vector.
- the vector resolution is reconstructed, and each region is inter-predicted and decoded using a motion vector for each region according to the reconstructed region or the motion vector resolution of the motion vector.
- the image decoding apparatus 1800 extracts a resolution change flag from the bitstream (S1910), and determines whether the extracted resolution change flag changes the motion vector resolution for each region (S1920). If the vector resolution is changed for each region or motion vector, the resolution identification flag is extracted and decoded from the bitstream to restore the motion vector resolution of each region or motion vector (S1930), and the motion vector of each restored region or motion vector. Inter-prediction decoding is performed by reconstructing the motion vector of each region or the motion vector at the resolution (S1940). In addition, when the resolution change flag indicates that the motion vector resolution is fixed without changing for each region or motion vector, the image decoding apparatus 1800 extracts, decodes, and restores the motion vector resolution flag from the bitstream (S1950).
- Inter-prediction decoding is performed on each region by reconstructing the motion vector according to the fixed motion vector resolution for the lower region defined in the header according to the motion vector resolution (S1960).
- the motion vector resolution decoded for each region or motion vector may be different for each x component and y component of the motion vector.
- the image decoding apparatus 1800 decodes the resolution identification flag hierarchically coded in a quad tree structure by tying regions or motion vectors having the same motion vector resolution to restore motion vector resolution for each region or motion vector, or Decode the resolution identification flag encoded using the motion vector resolution predicted using the motion vector resolution of the surrounding area of the area to restore the motion vector resolution for each area, or the run and length of the motion vector resolution for each area or motion vector.
- the motion vector resolution is estimated according to a pre-determined estimation method, and the estimated motion vector resolution is restored as the motion vector resolution of the corresponding region or motion vector, and the resolution identification flag decoded for each region or motion vector
- the motion vector resolution indicated by the decoded resolution identification flag may be restored.
- the image decoding apparatus 1800 may identify an identifier indicating the maximum number of quad tree layers and the area size indicated by the lowest node of the quad tree layer from the header of the bitstream, the maximum number of tag tree layers, and the lowest node of the tag tree hierarchy. The identifier indicating the area size can be decrypted and restored.
- the image decoding apparatus 1800 extracts and decodes the encoded difference vector from the bitstream, and decodes the difference vector for each region or motion vector by decoding according to the reconstructed region or the motion vector resolution for each motion vector.
- the prediction motion vector for each region or motion vector may be predicted, and the motion vector for each region may be reconstructed using the reconstructed difference vector and the prediction motion vector.
- the image decoding apparatus 1800 may decode the difference vector using UVLC.
- the image decoding apparatus 1800 may decode using the K-th order exponential golem code, but according to the reconstructed region or the motion vector resolution for each motion vector.
- the order K of the exponential Gollum code can be changed.
- the image decoding apparatus 1800 may decode the difference vector by using context-based binary arithmetic coding.
- the image decoding apparatus 1800 may decode using the unary / K-order exponential golem combined code, but the motion vector for each reconstructed region or motion vector.
- the order (K) and the maximum value (T) of the unary / K-order exponential golem combined code can be changed and decoded according to the resolution.
- the image decoding apparatus 1800 may calculate and decode a cumulative probability differently according to the motion vector resolution for each reconstructed region or motion vector.
- the image decoding apparatus 1800 predicts a predicted motion vector with respect to the motion vector of each region or the motion vector by using the motion vector of the surrounding region of each region, but the motion vector resolution of the surrounding region and the motion vector resolution of each region If is not equal, the motion vector resolution of the surrounding area may be predicted by converting the motion vector resolution of each area or the motion vector.
- the predicted motion vector may be obtained in the same manner in the image encoding apparatus and the decoding apparatus. Accordingly, various embodiments of deriving a predictive motion vector by the image encoding apparatus and embodiments of motion vector resolution conversion with reference to FIGS. 22 to 26 may be similarly implemented in the image decoding apparatus according to the present invention.
- the image decoding apparatus 1800 may change the resolution identification flag decoding method according to the distribution of the motion vector resolution of each region or the motion vector resolution determined for each region or motion vector.
- a method of generating a binarization bit string of a resolution identification flag based on a distribution of a motion vector resolution determined for each region or motion vector according to the distribution of the motion vector resolution of the peripheral region of each region may be performed by differently applying the probability model according to the distribution of the motion vector resolution of the surrounding area and the probability of the motion vector resolution that has occurred up to now.
- arithmetic decoding and probability updating may be performed using different probability models according to bit positions.
- the image decoding apparatus 1800 may convert and predict the motion vector resolution of the surrounding region to the highest resolution among the motion vector resolutions of the surrounding region.
- 51 is a flowchart illustrating an image decoding method according to a second embodiment of the present invention.
- the resolution designation flag extraction step S5102 the resolution decoding step S5104, the differential vector decoding step S5106, the inter prediction decoding step S5108, and the resolution switching flag extraction are performed. It may be configured to include a step (S5110).
- the resolution designation flag extraction step S5102 is performed by the operation of the resolution designation flag extractor 4810
- the resolution decoding step S5104 is performed by the resolution decoder 4820
- the differential vector decoding step S5106 is differential vector decoding.
- the inter prediction decoding operation S5108 may correspond to the operation of the inter prediction decoder 4840
- the resolution switching flag extraction operation S5110 may correspond to an operation of the resolution conversion flag extractor 4850, respectively. Therefore, detailed description is omitted.
- 61 is a flowchart illustrating a reference picture interpolation method for image decoding according to an embodiment of the present invention.
- the reference picture interpolator 5910 decodes the bitstream to restore the information on the first filter and the information on the second filter (S6110).
- the reference picture is interpolated using the first filter identified by the information on the first filter, and the interpolated reference picture is interpolated using the second filter identified by the information on the second filter ( S6130).
- the information on the first filter and the information on the second filter may include information on the filter coefficient or information on the type of the selected filter among the plurality of filters.
- the reference picture interpolator 5910 may interpolate a subpixel of the reference picture based on the integer pixel of the reference picture using the first filter.
- the reference picture interpolator 5910 may interpolate subpixels having a target precision based on the integer pixels of the reference picture and the interpolated subpixels of the reference picture.
- FIG. 62 is a flowchart illustrating a video decoding method according to a third embodiment of the present invention.
- the image decoding apparatus 5900 may gradually perform a reference picture using a plurality of filters identified by information on a plurality of filters reconstructed by decoding a bitstream.
- the filter is interpolated to a reference picture having a target precision (S6210), and the image is reconstructed by inter-prediction decoding a bitstream using the interpolated target picture of the reference precision (S6220).
- the image decoding apparatus 5900 repeats the process of filtering the reference picture with one of a plurality of filters and interpolating the filter, and repeating the interpolation by filtering the interpolated reference picture with another filter. You can interpolate with.
- 73 is a diagram illustrating a video decoding method according to the fourth embodiment of the present invention.
- the resolution designation flag generation step (S7302) is an operation of the resolution designation flag extractor 3220 of the image decoding apparatus 7200 according to the fourth embodiment
- the resolution decoding step (S7304) is the video decoding according to the fourth embodiment.
- the reference picture interpolation step S7306 is performed by the reference picture interpolator 7210
- the differential vector decoding step S7308 is performed by the image decoding according to the fourth embodiment.
- the inter prediction decoding operation S7310 is performed by the inter prediction decoder 4840 of the image decoding device 7200 according to the fourth embodiment.
- the flag extracting step S7312 corresponds to an operation of the resolution switch flag extractor 4850 of the image decoding apparatus 7200 according to the fourth embodiment, and thus a detailed description thereof will be omitted.
- the motion vector by adaptively determining the precision of the reference picture for each region as a predetermined coding unit of an image such as a block, a macroblock, a slice, a picture, a picture group, etc.
- an interpolation filter for changing the precision of the reference picture can be selected from a set of filters having fixed filter coefficients, or adaptively calculating the optimal filter coefficients for a given filter to generate the reference picture. By interpolating, an optimal interpolation filter and filter coefficients can be adaptively determined for an area to be encoded.
- the reference picture can be filtered in multiple steps and interpolated into a reference picture with a target precision, a reference picture with more precise resolution can be generated to increase the accuracy of motion estimation to increase the accuracy of prediction. Compression efficiency and recovery efficiency can be improved.
- the image encoding / decoding method according to an embodiment of the present invention may be one of one of the image encoding methods according to the first to fourth embodiments of the present invention, and one of the image decoding methods according to the first to fourth embodiments. It can be implemented by combining the method of.
- An image encoding / decoding method includes an image encoding step of determining a motion vector resolution for each region or motion vector and inter prediction encoding using a motion vector according to the motion vector resolution determined for each region or motion vector; And an image decoding step of extracting the resolution information from the bitstream, restoring the resolution, and performing inter prediction decoding using a motion vector according to the motion vector resolution of each reconstructed region or motion vector.
- the motion vector resolution may be adjusted in units of an area or a motion vector of any size according to the characteristics of the image (for example, the complexity of the image or the degree of motion of the image). Since inter prediction encoding may be performed using a motion vector having a determined and adaptive motion vector resolution, the bit rate due to encoding may be reduced while improving the quality of an image, thereby improving compression efficiency. For example, in some pictures of an image, if some areas have a high degree of complexity but a small amount of motion, and other areas have a low degree of complexity but a large degree of motion, for some areas, the motion vector resolution may be increased for some areas.
- Predictive coding improves the accuracy of inter prediction, which reduces the residual signal and the amount of encoded bits.
- the amount of bits encoding the motion vector is not significantly increased, thereby improving image quality.
- the amount of encoded bits can be reduced.
- the motion vector resolution is lowered so that even if the inter prediction encoding is performed, the motion vector resolution is small without significantly reducing the image quality, thereby reducing the amount of bits encoding the motion vector, thereby reducing the overall image quality without significantly reducing the image quality.
- the amount of coding bits can be reduced, so that the compression efficiency can be improved.
- encoding is performed while adaptively determining the precision of a reference picture for each region as a predetermined coding unit of an image such as a block, a macroblock, a slice, a picture, a picture group, and the like while changing the motion vector resolution.
- the interpolation filter for changing the precision of the reference picture may be selected from among a set of filters having fixed filter coefficients, or adaptively calculating an optimal filter coefficient for a given filter to be encoded by interpolating the reference picture.
- the optimal interpolation filter and filter coefficients can be adaptively determined for the region.
- the reference picture can be filtered in multiple steps and interpolated into a reference picture with a target precision, a reference picture with more precise resolution can be generated to increase the accuracy of motion estimation to increase the accuracy of prediction. Compression efficiency and recovery efficiency can be improved.
- the present invention is applied to an image processing field for compressing a moving image, and determines a filter or filter coefficient for interpolating a reference picture according to characteristics of an image to interpolate a reference picture and multiply filter the reference picture.
- inter-prediction encoding by interpolating or adaptively changing a motion vector resolution of an image by a predetermined area unit, it is a very useful invention to generate an effect of efficiently encoding an image.
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Abstract
Description
Claims (51)
- 영상 부/복호화 장치에 있어서,In the video encoding / decoding device,참조 픽처를 복수 개의 필터를 이용하여 단계적으로 필터링하여 목표 정밀도의 참조 픽처로 보간하며, 상기 목표 정밀도로 보간된 참조 픽처를 이용하여 상기 영상을 인터 예측 부호화하는 영상 부호화부; 및An image encoder which filters the reference picture step by step using a plurality of filters to interpolate to a reference picture having a target precision, and inter-predictively encodes the image using the reference picture interpolated to the target precision; And비트스트림을 복호화하여 복원되는 정보에 의해 식별되는 상기 복수 개의 필터를 이용하여 참조 픽처를 단계적으로 필터링하여 목표 정밀도의 참조 픽처로 보간하며, 상기 보간된 목표 정밀도의 참조 픽처를 이용하여 상기 비트스트림을 인터 예측 복호화하여 영상을 복원하는 영상 복호화부Step-by-step filtering of a reference picture using the plurality of filters identified by the information reconstructed by decoding the bitstream to interpolate to a reference picture of a target precision, and using the reference picture of the interpolated target precision to determine the bitstream. An image decoder for reconstructing an image by inter prediction decoding를 포함하는 것을 특징으로 하는 영상 부/복호화 장치.Video encoding / decoding apparatus comprising a.
- 영상을 부호화하는 장치에 있어서,In the apparatus for encoding a video,참조 픽처를 복수 개의 필터를 이용하여 단계적으로 필터링하여 목표 정밀도의 참조 픽처로 보간하는 참조 픽처 보간기; 및A reference picture interpolator for filtering the reference picture step by step using a plurality of filters to interpolate the reference picture with a target precision; And상기 목표 정밀도로 보간된 참조 픽처를 이용하여 상기 영상을 인터 예측 부호화하는 인터 예측 부호화기An inter prediction encoder for inter prediction encoding the image using the reference picture interpolated with the target precision.를 포함하는 것을 특징으로 하는 영상 부호화 장치.An image encoding apparatus comprising a.
- 제 2항에 있어서, The method of claim 2,상기 영상 부호화 장치는,The video encoding apparatus,영역 또는 움직임 벡터마다 움직임 벡터 해상도를 결정하는 해상도 결정기를 추가로 포함하고,Further comprising a resolution determiner for determining a motion vector resolution for each region or motion vector,상기 참조 픽처 보간기는,The reference picture interpolator,상기 결정된 움직임 벡터 해상도를 이용하여 필터탭을 결정하는 것을 특징으로 하는 영상 부호화 장치.And a filter tap is determined using the determined motion vector resolution.
- 제 2항에 있어서, The method of claim 2,상기 영상 부호화 장치는,The video encoding apparatus,현재 해상도를 부호화할 영역의 주변 해상도 또는 이전 블록의 해상도와의 변화를 알려주는 해상도 전환 플래그를 발생하는 해상도 전환 플래그 생성기를 추가로 포함하고,And further including a resolution conversion flag generator for generating a resolution conversion flag indicating a change from the peripheral resolution of the region to be encoded with the current resolution or the resolution of the previous block,상기 참조 픽처 보간기는,The reference picture interpolator,상기 해상도 전환 플래그를 이용하여 결정된 움직임 벡터 해상도를 이용하여 필터탭을 결정하는 것을 특징으로 하는 영상 부호화 장치.And a filter tap is determined using a motion vector resolution determined using the resolution switch flag.
- 제 2항에 있어서, The method of claim 2,상기 영상 부호화 장치는,The video encoding apparatus,영상의 영역 또는 움직임 벡터마다 해상도 세트를 다르게 지정하는 해상도 지정 플래그를 생성하는 해상도 지정 플래그 생성기를 추가로 포함하고,Further comprising a resolution designation flag generator for generating a resolution designation flag for differently specifying a resolution set for each region or motion vector of the image,상기 참조 픽처 보간기는,The reference picture interpolator,상기 해상도 지정 플래그가 단일 해상도임을 의미하는 경우에 상기 단일 해상도에 따라 필터탭을 결정하는 것을 특징으로 하는 영상 부호화 장치.And a filter tap is determined according to the single resolution when the resolution designation flag is a single resolution.
- 제 2항에 있어서, 상기 참조 보간기는,The method of claim 2, wherein the reference interpolator,픽처의 해상도별로 필터탭의 종류를 설정하고 상기 필터탭의 종류 중에서 보간 결과 현재 픽처와 가장 차이가 적은 필터를 최적 필터로 선택하는 것을 특징으로 하는 영상 부호화 장치.And setting a filter tap type for each picture resolution, and selecting a filter having the smallest difference from the current picture as a result of the interpolation among the filter tap types as an optimum filter.
- 제 2항에 있어서, 상기 참조 보간기는,The method of claim 2, wherein the reference interpolator,움직임 벡터의 해상도에 따라 필터탭을 선택하는 것을 특징으로 하는 영상 부호화 장치.And a filter tap is selected according to the resolution of the motion vector.
- 제 2항에 있어서, 상기 참조 보간기는,The method of claim 2, wherein the reference interpolator,픽처 또는 슬라이스 내의 임의의 영역 단위로 해상도별 최적 필터탭을 선택하는 것을 특징으로 하는 영상 부호화 장치.And an optimum filter tap for each resolution in units of an arbitrary region within a picture or a slice.
- 영상을 복호화하는 장치에 있어서,In the apparatus for decoding an image,비트스트림을 복호화하여 복원되는 복수 개의 필터에 대한 정보에 의해 식별되는 복수 개의 필터를 이용하여 참조 픽처를 단계적으로 필터링하여 목표 정밀도의 참조 픽처로 보간하는 참조 픽처 보간기; 및A reference picture interpolator for stepwise filtering a reference picture using a plurality of filters identified by information on a plurality of filters reconstructed by decoding the bitstream, and interpolating the reference picture to a reference picture having a target precision; And상기 보간된 목표 정밀도의 참조 픽처를 이용하여 비트스트림을 인터 예측 복호화하여 영상을 복원하는 인터 예측 복호화기An inter prediction decoder configured to reconstruct an image by inter prediction decoding a bitstream using the reference picture of the interpolated target precision를 포함하는 것을 특징으로 하는 영상 복호화 장치.Video decoding apparatus comprising a.
- 제 9항에 있어서, The method of claim 9,상기 영상 복호화 장치는,The video decoding apparatus,영역 또는 움직임 벡터마다 움직임 벡터 해상도를 추출하는 해상도 복호화기를 추가로 포함하고,A resolution decoder for extracting a motion vector resolution for each region or motion vector,상기 참조 픽처 보간기는,The reference picture interpolator,상기 추출된 움직임 벡터 해상도를 이용하여 필터탭을 결정하는 것을 특징으로 하는 영상 복호화 장치.And a filter tap is determined using the extracted motion vector resolution.
- 제 9항에 있어서, The method of claim 9,상기 영상 복호화 장치는,The video decoding apparatus,현재 해상도를 부호화할 영역의 주변 해상도 또는 이전 블록의 해상도와의 변화를 알려주는 해상도 전환 플래그를 추출하는 해상도 전환 플래그 추출기를 추가로 포함하고,And further including a resolution conversion flag extractor for extracting a resolution conversion flag indicating a change from the peripheral resolution of the region to be encoded with the current resolution or the resolution of the previous block,상기 참조 픽처 보간기는,The reference picture interpolator,상기 추출된 해상도 전환 플래그를 이용하여 결정된 움직임 벡터 해상도를 이용하여 필터탭을 결정하는 것을 특징으로 하는 영상 복호화 장치.And a filter tap is determined using a motion vector resolution determined using the extracted resolution switch flag.
- 제 9항에 있어서, The method of claim 9,상기 영상 복호화 장치는,The video decoding apparatus,영상의 영역 또는 움직임 벡터마다 해상도 세트를 다르게 지정하는 해상도 지정 플래그를 추출하는 해상도 지정 플래그 생성기를 추가로 포함하고,And further including a resolution designation flag generator for extracting a resolution designation flag for differently specifying a resolution set for each region or motion vector of the image,상기 참조 픽처 보간기는,The reference picture interpolator,상기 추출된 해상도 지정 플래그가 단일 해상도임을 의미하는 경우에 상기 단일 해상도에 따라 필터탭을 결정하는 것을 특징으로 하는 영상 복호화 장치.And a filter tap is determined according to the single resolution when the extracted resolution designation flag is a single resolution.
- 제 9항에 있어서, 상기 참조 보간기는,The method of claim 9, wherein the reference interpolator comprises:픽처의 해상도별로 필터탭의 종류를 설정하고 상기 필터탭의 종류 중에서 보간 결과 현재 픽처와 가장 차이가 적은 필터를 최적 필터로 선택하는 것을 특징으로 하는 영상 복호화 장치.And setting a filter tap type for each picture resolution and selecting a filter having the smallest difference from the current picture as a result of the interpolation among the filter tap types as an optimum filter.
- 제 9항에 있어서, 상기 참조 보간기는,The method of claim 9, wherein the reference interpolator comprises:움직임 벡터의 해상도에 따라 필터탭을 선택하는 것을 특징으로 하는 영상 복호화 장치.And a filter tab is selected according to the resolution of the motion vector.
- 제 9항에 있어서, 상기 참조 보간기는,The method of claim 9, wherein the reference interpolator comprises:픽처 또는 슬라이스 내의 임의의 영역 단위로 해상도별 최적 필터탭을 선택하는 것을 특징으로 하는 영상 복호화 장치.And an optimum filter tap for each resolution in units of an arbitrary region within a picture or a slice.
- 참조 픽처를 보간하는 장치에 있어서,An apparatus for interpolating a reference picture,상기 참조 픽처를 보간하기 위한 복수 개의 필터를 선택하는 필터 선택기; 및A filter selector for selecting a plurality of filters for interpolating the reference picture; And상기 복수 개의 필터를 이용하여 상기 참조 픽처를 단계적으로 필터링하여 목표 정밀도를 가지는 참조 픽처를 생성하는 필터A filter for generating a reference picture having a target precision by filtering the reference picture step by step using the plurality of filters를 포함하는 것을 특징으로 하는 참조 픽처 보간 장치.Reference picture interpolation apparatus comprising a.
- 제 16항에 있어서, 상기 필터 선택기는,The method of claim 16, wherein the filter selector,상기 참조 픽처의 정수 화소를 이용하여 부화소를 보간하기 위한 제1 필터를 선택하고, 상기 정수 화소와 상기 보간된 부화소를 이용하여 목표 정밀도의 부화소를 보간하기 위한 제2 필터를 선택하는 것을 특징으로 하는 참조 픽처 보간 장치.Selecting a first filter for interpolating subpixels using integer pixels of the reference picture, and selecting a second filter for interpolating subpixels of target precision using the integer pixels and the interpolated subpixels And a reference picture interpolation device.
- 제 17항에 있어서, 상기 필터는,The method of claim 17, wherein the filter,상기 제1 필터를 이용하여 상기 참조 픽처를 보간하고, 상기 제2 필터를 이용하여 상기 보간된 참조 픽처를 보간하는 것을 특징으로 하는 참조 픽처 보간 장치.And interpolating the reference picture using the first filter and interpolating the interpolated reference picture using the second filter.
- 제 17항에 있어서, 상기 필터 선택기는,The method of claim 17, wherein the filter selector,상기 참조 픽처의 정수 화소를 이용하여 상기 부화소를 보간하는 경우, 고정된 필터 계수를 가지는 복수 개의 필터 중 현재 픽처와 보간된 참조 픽처 간의 차이가 최소가 되는 필터를 상기 제 1 필터로서 선택하는 것을 특징으로 하는 참조 픽처 보간 장치.When interpolating the subpixels using integer pixels of the reference picture, selecting a filter having a minimum difference between the current picture and the interpolated reference picture among the plurality of filters having a fixed filter coefficient as the first filter. And a reference picture interpolation device.
- 제 17항에 있어서, 상기 필터 선택기는,The method of claim 17, wherein the filter selector,상기 참조 픽처의 정수 화소를 이용하여 상기 부화소를 보간하는 경우, 현재 픽처와 보간된 참조 픽처 간의 차이가 최소가 되는 필터 계수를 상기 제 1 필터의 필터 계수로서 계산하는 것을 특징으로 하는 참조 픽처 보간 장치.In the case of interpolating the subpixels using the integer pixels of the reference picture, a reference picture interpolation is calculated as a filter coefficient of the first filter, the filter coefficient of which the difference between the current picture and the interpolated reference picture is minimized. Device.
- 제 17항에 있어서, 상기 필터 선택기는,The method of claim 17, wherein the filter selector,상기 참조 픽처의 정수 화소와 상기 보간된 부화소를 이용하여 상기 목표 정밀도의 부화소를 보간하는 경우, 고정된 필터 계수를 가지는 복수 개의 필터 중 현재 픽처와 다시 보간된 참조 픽처 간의 차이가 최소가 되는 필터를 상기 제 2 필터로서 선택하는 것을 특징으로 하는 참조 픽처 보간 장치.When interpolating the sub-pixel having the target precision using the integer pixel of the reference picture and the interpolated sub-pixel, the difference between the current picture and the interpolated reference picture is minimized among a plurality of filters having fixed filter coefficients. And a filter is selected as the second filter.
- 제 17항에 있어서, 상기 필터 선택기는,The method of claim 17, wherein the filter selector,상기 참조 픽처의 정수 화소와 상기 보간된 부화소를 이용하여 상기 목표 정밀도의 부화소를 보간하는 경우, 현재 픽처와 다시 보간된 참조 픽처 간의 차이가 최소가 되는 필터 계수를 상기 제 2 필터의 필터 계수로서 계산하는 것을 특징으로 하는 참조 픽처 보간 장치.When interpolating the sub-pixel of the target precision by using the integer pixel of the reference picture and the interpolated sub-pixel, the filter coefficient of which the difference between the current picture and the interpolated reference picture is minimized is determined as the filter coefficient of the second filter. And reference picture interpolation device.
- 제 17항에 있어서,The method of claim 17,상기 제1 필터에 대한 정보 및 상기 제2 필터에 대한 정보를 부호화하는 필터 정보 부호화기A filter information encoder for encoding information about the first filter and information about the second filter를 더 포함하는 것을 특징으로 하는 참조 픽처 보간 장치.The reference picture interpolation apparatus further comprises.
- 참조 픽처를 보간하는 장치에 있어서,An apparatus for interpolating a reference picture,비트스트림을 복호화하여 복수 개의 필터에 대한 정보를 복원하는 필터 정보 복호화기; 및A filter information decoder for decoding the bitstream and restoring information on the plurality of filters; And상기 복원되는 복수 개의 필터에 대한 정보에 의해 식별되는 복수 개의 필터를 이용하여 상기 참조 픽처를 단계적으로 필터링하여 목표 정밀도를 가지는 참조 픽처를 생성하는 필터A filter for generating a reference picture having a target precision by gradually filtering the reference picture using a plurality of filters identified by the information on the plurality of reconstructed filters.를 포함하는 것을 특징으로 하는 참조 픽처 보간 장치. Reference picture interpolation apparatus comprising a.
- 제 24항에 있어서, 상기 필터는,The method of claim 24, wherein the filter,상기 비트스트림을 복호화하여 복원된 제1 필터에 대한 정보에 의해 식별되는 제1 필터를 이용하여 상기 참조 픽처를 보간하며, 복원된 제2 필터에 대한 정보에 의해 식별되는 제2 필터를 이용하여 상기 보간된 참조 픽처를 보간하는 것을 특징으로 하는 참조 픽처 보간 장치.Interpolating the reference picture using a first filter identified by the information on the first filter reconstructed by decoding the bitstream, and using the second filter identified by the information on the reconstructed second filter. A reference picture interpolation device, characterized in that it interpolates an interpolated reference picture.
- 제 25항에 있어서,The method of claim 25,상기 제 1 필터에 대한 정보 및 상기 제 2 필터에 대한 정보는 필터 계수에 대한 정보 또는 복수 개의 필터 중 선택된 필터의 종류에 대한 정보를 포함하는 것을 특징으로 하는 참조 픽처 보간 장치. And the information on the first filter and the information on the second filter include information on filter coefficients or information on a type of a selected filter among a plurality of filters.
- 제 25항에 있어서, 상기 필터는,The method of claim 25, wherein the filter,상기 제 1 필터를 이용하여 상기 참조 픽처의 정수 화소를 기초로 상기 참조 픽처의 부화소를 보간하는 것을 특징으로 하는 참조 픽처 보간 장치.And a subpixel of the reference picture is interpolated based on the integer pixel of the reference picture using the first filter.
- 제 25항에 있어서, 상기 필터는,The method of claim 25, wherein the filter,상기 참조 픽처의 정수 화소와 상기 참조 픽처의 보간된 부화소를 기초로 상기 목표 정밀도의 부화소를 보간하는 것을 특징으로 하는 참조 픽처 보간 장치.And interpolating the sub-pixels of the target precision based on the integer pixels of the reference picture and the interpolated sub-pixels of the reference picture.
- 영상 부/복호화 방법에 있어서,In the video encoding / decoding method,참조 픽처를 복수 개의 필터를 이용하여 단계적으로 필터링하여 목표 정밀도의 참조 픽처로 보간하는 단계; Stepwise filtering the reference picture using a plurality of filters and interpolating the reference picture into a reference picture having a target precision;상기 목표 정밀도로 보간된 참조 픽처를 이용하여 상기 영상을 인터 예측 부호화하는 단계;Inter-prediction encoding the image using the reference picture interpolated at the target precision;비트스트림을 복호화하여 복원되는 정보에 의해 식별되는 상기 복수 개의 필터를 이용하여 참조 픽처를 단계적으로 필터링하여 목표 정밀도의 참조 픽처로 보간하는 단계; 및 Stepwise filtering a reference picture using the plurality of filters identified by the information reconstructed by decoding the bitstream and interpolating the reference picture to a reference picture having a target precision; And상기 보간된 목표 정밀도의 참조 픽처를 이용하여 상기 비트스트림을 인터 예측 복호화하여 영상을 복원하는 단계Reconstructing the image by inter-prediction-decoding the bitstream using the reference picture of the interpolated target precision를 포함하는 것을 특징으로 하는 영상 부/복호화 방법.Image encoding / decoding method comprising a.
- 영상을 부호화하는 방법에 있어서,In the method of encoding an image,참조 픽처를 복수 개의 필터를 이용하여 단계적으로 필터링하여 목표 정밀도의 참조 픽처로 보간하는 단계; 및Stepwise filtering the reference picture using a plurality of filters and interpolating the reference picture into a reference picture having a target precision; And상기 목표 정밀도로 보간된 참조 픽처를 이용하여 상기 영상을 인터 예측 부호화하는 단계Inter-prediction encoding the image using the reference picture interpolated at the target precision를 포함하는 것을 특징으로 하는 영상 부호화 방법.Image encoding method comprising a.
- 제 30 항에 있어서, 상기 목표 정밀도의 참조 픽처로 보간하는 단계는,31. The method of claim 30, wherein interpolating to the reference picture of the target precision comprises:상기 참조 픽처를 상기 복수 개의 필터 중 하나의 필터로 필터링하여 보간하고, 상기 보간된 참조 픽처를 다른 하나의 필터로 필터링하여 보간하는 과정을 반복함으로써 상기 목표 정밀도의 참조 픽처로 보간하는 것을 특징으로 하는 영상 부호화 방법.And interpolating the reference picture with one of the plurality of filters by interpolation, and repeating the interpolation by filtering the interpolated reference picture with another filter and interpolating with the reference picture having the target precision. Image coding method.
- 제 30항에 있어서, The method of claim 30,상기 영상 부호화 방법은,The video encoding method,영역 또는 움직임 벡터마다 움직임 벡터 해상도를 결정하는 해상도 결정단계를 추가로 포함하고,A resolution determining step of determining a motion vector resolution for each region or motion vector,상기 목표 정밀도의 참조 픽처로 보간하는 단계는,Interpolating to the reference picture of the target precision,상기 결정된 움직임 벡터 해상도를 이용하여 필터탭을 결정하는 것을 특징으로 하는 영상 부호화 방법.And a filter tap is determined using the determined motion vector resolution.
- 제 30항에 있어서, The method of claim 30,상기 영상 부호화 방법은,The video encoding method,현재 해상도를 부호화할 영역의 주변 해상도 또는 이전 블록의 해상도와의 변화를 알려주는 해상도 전환 플래그를 발생하는 해상도 전환 플래그 생성단계를 추가로 포함하고,And a resolution conversion flag generation step of generating a resolution conversion flag informing of a change in the peripheral resolution of the region to be encoded with the current resolution or the resolution of the previous block.상기 목표 정밀도의 참조 픽처로 보간하는 단계는,Interpolating to the reference picture of the target precision,상기 해상도 전환 플래그를 이용하여 결정된 움직임 벡터 해상도를 이용하여 필터탭을 결정하는 것을 특징으로 하는 영상 부호화 방법.And a filter tap is determined using a motion vector resolution determined using the resolution switch flag.
- 제 30항에 있어서, The method of claim 30,상기 영상 부호화 방법은,The video encoding method,영상의 영역 또는 움직임 벡터마다 해상도 세트를 다르게 지정하는 해상도 지정 플래그를 생성하는 해상도 지정 플래그 생성단계를 추가로 포함하고,A resolution designation flag generation step of generating a resolution designation flag for designating a different resolution set for each region or motion vector of the image;상기 목표 정밀도의 참조 픽처로 보간하는 단계는,Interpolating to the reference picture of the target precision,상기 해상도 지정 플래그가 단일 해상도임을 의미하는 경우에 상기 단일 해상도에 따라 필터탭을 결정하는 것을 특징으로 하는 영상 부호화 방법.And a filter tap is determined according to the single resolution when the resolution designation flag is a single resolution.
- 영상을 복호화하는 방법에 있어서,In the method of decoding an image,비트스트림을 복호화하여 복원되는 복수 개의 필터에 대한 정보에 의해 식별되는 복수 개의 필터를 이용하여 참조 픽처를 단계적으로 필터링하여 목표 정밀도의 참조 픽처로 보간하는 단계; 및Performing stepwise filtering on a reference picture using a plurality of filters identified by information on a plurality of filters reconstructed by decoding the bitstream, and interpolating the reference picture into a reference picture having a target precision; And상기 보간된 목표 정밀도의 참조 픽처를 이용하여 비트스트림을 인터 예측 복호화하여 영상을 복원하는 단계Reconstructing the image by inter-prediction decoding a bitstream using the interpolated target picture of the reference precision를 포함하는 것을 특징으로 영상 복호화 방법.Image decoding method comprising a.
- 제 35 항에 있어서, 상기 목표 정밀도의 참조 픽처로 보간하는 단계는,36. The method of claim 35, wherein interpolating to the reference picture of the target precision comprises:상기 참조 픽처를 상기 복수 개의 필터 중 하나의 필터로 필터링하여 보간하고, 상기 보간된 참조 픽처를 다른 하나의 필터로 필터링하여 보간하는 과정을 반복함으로써 상기 목표 정밀도의 참조 픽처로 보간하는 것을 특징으로 하는 영상 복호화 방법.And interpolating the reference picture with one of the plurality of filters by interpolation, and repeating the interpolation by filtering the interpolated reference picture with another filter and interpolating with the reference picture having the target precision. Image Decoding Method.
- 제 35항에 있어서, 상기 목표 정밀도의 참조 픽처로 보간하는 단계는,36. The method of claim 35, wherein interpolating to the reference picture of the target precision comprises:픽처의 해상도별로 필터탭의 종류를 설정하고 상기 필터탭의 종류 중에서 보간 결과 현재 픽처와 가장 차이가 적은 필터를 최적 필터로 선택하는 것을 특징으로 하는 영상 복호화 방법.And setting a filter tap type for each picture resolution and selecting a filter having the smallest difference from the current picture as a result of the interpolation among the filter tap types as an optimum filter.
- 제 40항에 있어서, 상기 목표 정밀도의 참조 픽처로 보간하는 단계는,41. The method of claim 40, wherein interpolating to the reference picture of the target precision comprises:움직임 벡터의 해상도에 따라 필터탭을 선택하는 것을 특징으로 하는 영상 복호화 방법.And a filter tap is selected according to the resolution of the motion vector.
- 제 35항에 있어서, 상기 목표 정밀도의 참조 픽처로 보간하는 단계는,36. The method of claim 35, wherein interpolating to the reference picture of the target precision comprises:픽처 또는 슬라이스 내의 임의의 영역 단위로 해상도별 최적 필터탭을 선택하는 것을 특징으로 하는 영상 복호화 장치.And an optimum filter tap for each resolution in units of an arbitrary region within a picture or a slice.픽처의 해상도별로 필터탭의 종류를 설정하고 상기 필터탭의 종류 중에서 보간 결과 현재 픽처와 가장 차이가 적은 필터를 최적 필터로 선택하는 것을 특징으로 하는 영상 복호화 방법.And setting a filter tap type for each picture resolution and selecting a filter having the smallest difference from the current picture as a result of the interpolation among the filter tap types as an optimum filter.
- 제 35항에 있어서, 상기 목표 정밀도의 참조 픽처로 보간하는 단계는,36. The method of claim 35, wherein interpolating to the reference picture of the target precision comprises:움직임 벡터의 해상도에 따라 필터탭을 선택하는 것을 특징으로 하는 영상 복호화 방법.And a filter tap is selected according to the resolution of the motion vector.
- 제 35항에 있어서, 상기 목표 정밀도의 참조 픽처로 보간하는 단계는,36. The method of claim 35, wherein interpolating to the reference picture of the target precision comprises:픽처 또는 슬라이스 내의 임의의 영역 단위로 해상도별 최적 필터탭을 선택하는 것을 특징으로 하는 영상 복호화 방법.An image decoding method comprising selecting an optimum filter tap for each resolution in units of an arbitrary region in a picture or a slice.
- 참조 픽처를 보간하는 방법에 있어서,In the method of interpolating a reference picture,상기 참조 픽처의 정수 화소를 이용하여 부화소를 보간하기 위한 제 1 필터를 선택하는 단계;Selecting a first filter for interpolating subpixels using integer pixels of the reference picture;상기 제 1 필터를 이용하여 상기 참조 픽처를 보간하는 단계;Interpolating the reference picture using the first filter;상기 정수 화소와 상기 보간된 부화소를 이용하여 목표 정밀도의 부화소를 보간하기 위한 제 2 필터를 선택하는 단계; 및Selecting a second filter for interpolating a subpixel having a target precision using the integer pixel and the interpolated subpixel; And제 2 필터를 이용하여 상기 보간된 참조 픽처를 보간하는 단계Interpolating the interpolated reference picture using a second filter를 포함하는 것을 특징으로 하는 참조 픽처 보간 방법.The reference picture interpolation method comprising a.
- 제 42 항에 있어서, 상기 제 1 필터를 선택하는 단계는,43. The method of claim 42, wherein selecting the first filter comprises:상기 참조 픽처의 정수 화소를 이용하여 상기 부화소를 보간하는 경우, 고정된 필터 계수를 가지는 복수 개의 필터 중 현재 픽처와 보간된 참조 픽처 간의 차이가 최소가 되는 하나의 필터를 상기 제 1 필터로서 선택하는 것을 특징으로 하는 참조 픽처 보간 방법.When interpolating the subpixels using integer pixels of the reference picture, one filter having a minimum difference between a current picture and an interpolated reference picture among a plurality of filters having fixed filter coefficients is selected as the first filter. And reference picture interpolation method.
- 제 42 항에 있어서, 상기 제 1 필터를 선택하는 단계는,43. The method of claim 42, wherein selecting the first filter comprises:상기 참조 픽처의 정수 화소를 이용하여 상기 부화소를 보간하는 경우, 현재 픽처와 보간된 참조 픽처 간의 차이가 최소가 되는 필터 계수를 상기 제 1 필터의 필터 계수로서 계산하는 것을 특징으로 하는 참조 픽처 보간 방법. In the case of interpolating the subpixels using the integer pixels of the reference picture, a reference picture interpolation is calculated as a filter coefficient of the first filter, the filter coefficient of which the difference between the current picture and the interpolated reference picture is minimized. Way.
- 제 42 항에 있어서, 상기 제 2 필터를 선택하는 단계는,43. The method of claim 42, wherein selecting the second filter comprises:상기 참조 픽처의 정수 화소와 상기 보간된 부화소를 이용하여 상기 목표 정밀도의 부화소를 보간하는 경우, 고정된 필터 계수를 가지는 복수 개의 필터 중 현재 픽처와 다시 보간된 참조 픽처 간의 차이가 최소가 되는 하나의 필터를 상기 제 2 필터로서 선택하는 것을 특징으로 하는 참조 픽처 보간 방법. When interpolating the sub-pixel having the target precision using the integer pixel of the reference picture and the interpolated sub-pixel, the difference between the current picture and the interpolated reference picture is minimized among a plurality of filters having fixed filter coefficients. And selecting one filter as the second filter.
- 제 42 항에 있어서, 상기 제 2 필터를 선택하는 단계는,43. The method of claim 42, wherein selecting the second filter comprises:상기 참조 픽처의 정수 화소와 상기 보간된 부화소를 이용하여 상기 목표 정밀도의 부화소를 보간하는 경우, 현재 픽처와 다시 보간된 참조 픽처 간의 차이가 최소가 되는 필터 계수를 상기 제 2 필터의 필터 계수로서 계산하는 것을 특징으로 하는 참조 픽처 보간 방법. When interpolating the sub-pixel of the target precision by using the integer pixel of the reference picture and the interpolated sub-pixel, the filter coefficient of the difference between the current picture and the interpolated reference picture is minimized And reference picture interpolation method.
- 제 42 항에 있어서, 상기 참조 픽처 보간 방법은,43. The method of claim 42, wherein the reference picture interpolation method comprises:상기 제 1 필터에 대한 정보 및 상기 제 2 필터에 대한 정보를 부호화하는 단계를 추가로 포함하는 것을 특징으로 하는 참조 픽처 보간 방법.And encoding the information about the first filter and the information about the second filter.
- 참조 픽처를 보간하는 방법에 있어서,In the method of interpolating a reference picture,비트스트림을 복호화하여 제 1 필터에 대한 정보 및 제 2 필터에 대한 정보를 복원하는 단계;Restoring information about the first filter and information about the second filter by decoding the bitstream;상기 제 1 필터에 대한 정보에 의해 식별되는 제 1 필터를 이용하여 상기 참조 픽처를 보간하는 단계; 및Interpolating the reference picture using a first filter identified by the information about the first filter; And상기 제 2 필터에 대한 정보에 의해 식별되는 제 2 필터를 이용하여 상기 보간된 참조 픽처를 보간하는 단계Interpolating the interpolated reference picture using a second filter identified by information about the second filter를 포함하는 것을 특징으로 하는 참조 픽처 보간 방법.The reference picture interpolation method comprising a.
- 제 48 항에 있어서,49. The method of claim 48 wherein상기 제 1 필터에 대한 정보 및 상기 제 2 필터에 대한 정보는 필터 계수에 대한 정보 또는 복수 개의 필터 중 선택된 필터의 종류에 대한 정보를 포함하는 것을 특징으로 하는 참조 픽처 보간 방법.And the information on the first filter and the information on the second filter include information on filter coefficients or information on a type of a selected filter among a plurality of filters.
- 제 48 항에 있어서,49. The method of claim 48 wherein상기 제 1 필터를 이용하여 상기 참조 픽처를 보간하는 단계는,Interpolating the reference picture using the first filter,상기 제 1 필터를 이용하여 상기 참조 픽처의 정수 화소를 기초로 상기 참조 픽처의 부화소를 보간하는 것을 특징으로 하는 참조 픽처 보간 방법.And a subpixel of the reference picture is interpolated based on the integer pixel of the reference picture using the first filter.
- 제 48 항에 있어서,49. The method of claim 48 wherein상기 제 2 필터를 이용하여 상기 보간된 참조 픽처를 보간하는 단계는,Interpolating the interpolated reference picture using the second filter,상기 참조 픽처의 정수 화소와 상기 참조 픽처의 보간된 부화소를 기초로 상기 목표 정밀도의 부화소를 보간하는 것을 특징으로 하는 참조 픽처 보간 방법.And interpolating the sub-pixels of the target precision based on the integer pixels of the reference picture and the interpolated sub-pixels of the reference picture.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US13/391,517 US9674546B2 (en) | 2009-08-21 | 2010-08-21 | Method and apparatus for interpolating reference picture and method and apparatus for encoding/decoding image using same |
Applications Claiming Priority (6)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR20090077452 | 2009-08-21 | ||
KR10-2009-0077452 | 2009-08-21 | ||
KR10-2010-0019208 | 2010-03-03 | ||
KR20100019208 | 2010-03-03 | ||
KR1020100081097A KR101678968B1 (en) | 2009-08-21 | 2010-08-20 | Reference Picture Interpolation Method and Apparatus and Video Coding Method and Apparatus Using Same |
KR10-2010-0081097 | 2010-08-20 |
Publications (2)
Publication Number | Publication Date |
---|---|
WO2011021912A2 true WO2011021912A2 (en) | 2011-02-24 |
WO2011021912A3 WO2011021912A3 (en) | 2011-06-09 |
Family
ID=43607507
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/KR2010/005569 WO2011021912A2 (en) | 2009-08-21 | 2010-08-21 | Method and apparatus for interpolating reference picture and method and apparatus for encoding/decoding image using same |
Country Status (1)
Country | Link |
---|---|
WO (1) | WO2011021912A2 (en) |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR19990066469A (en) * | 1998-01-26 | 1999-08-16 | 윤종용 | Reference block selection method and shape information resolution variable method for shape information pixel interpolation |
KR20070030825A (en) * | 2004-06-10 | 2007-03-16 | 가부시키가이샤 소니 컴퓨터 엔터테인먼트 | Encoder apparatus, encoding method, decoder apparatus, decoding method, program, program recording medium, data recording medium, data structure, and playback apparatus |
KR20080069110A (en) * | 2007-01-22 | 2008-07-25 | 삼성전자주식회사 | Method and apparatus for video encoding and decoding using adaptive interpolation filter |
-
2010
- 2010-08-21 WO PCT/KR2010/005569 patent/WO2011021912A2/en active Application Filing
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR19990066469A (en) * | 1998-01-26 | 1999-08-16 | 윤종용 | Reference block selection method and shape information resolution variable method for shape information pixel interpolation |
KR20070030825A (en) * | 2004-06-10 | 2007-03-16 | 가부시키가이샤 소니 컴퓨터 엔터테인먼트 | Encoder apparatus, encoding method, decoder apparatus, decoding method, program, program recording medium, data recording medium, data structure, and playback apparatus |
KR20080069110A (en) * | 2007-01-22 | 2008-07-25 | 삼성전자주식회사 | Method and apparatus for video encoding and decoding using adaptive interpolation filter |
Also Published As
Publication number | Publication date |
---|---|
WO2011021912A3 (en) | 2011-06-09 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2020067835A1 (en) | Method and apparatus for processing video signal by using affine prediction | |
WO2017176030A1 (en) | Method and apparatus for processing video signal | |
WO2017043816A1 (en) | Joint inter-intra prediction mode-based image processing method and apparatus therefor | |
WO2017039117A1 (en) | Method for encoding/decoding image and device therefor | |
WO2018097692A2 (en) | Method and apparatus for encoding/decoding image, and recording medium in which bit stream is stored | |
WO2011040794A2 (en) | Method and apparatus for encoding/decoding image using variable-size macroblocks | |
WO2017043734A1 (en) | Method for processing image on basis of inter prediction mode and device therefor | |
WO2020096388A1 (en) | Method for encoding/decoding image signal, and apparatus therefor | |
WO2019182292A1 (en) | Method and apparatus for video signal processing | |
WO2011021914A2 (en) | Method and apparatus for encoding/decoding images using adaptive motion vector resolution | |
WO2017183751A1 (en) | Inter prediction mode-based image processing method and device therefor | |
WO2017003063A1 (en) | Method for processing image based on inter prediction mode and system therefor | |
WO2021015586A1 (en) | Video signal processing method and device | |
WO2020096426A1 (en) | Method for encoding/decoding image signal, and device for same | |
WO2020111843A1 (en) | Video signal processing method and device using intra prediction filtering | |
WO2020222588A1 (en) | Video signal processing method and apparatus using adaptive motion vector resolution | |
WO2019225994A1 (en) | Method and apparatus for processing video signal | |
WO2020184979A1 (en) | Method for encoding/decoding image signal, and device therefor | |
WO2018101700A1 (en) | Image encoding/decoding method and device, and recording medium having stored bitstream therein | |
WO2020159199A1 (en) | Image signal encoding/decoding method and device therefor | |
WO2020130714A1 (en) | Video signal encoding/decoding method, and device therefor | |
WO2020067700A1 (en) | Image encoding/decoding method and device | |
WO2011129672A2 (en) | Video encoding/decoding apparatus and method | |
WO2020004978A1 (en) | Method and apparatus for processing video signal | |
WO2017176092A1 (en) | Method and device for inducing motion prediction information |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 10810220 Country of ref document: EP Kind code of ref document: A2 |
|
NENP | Non-entry into the national phase in: |
Ref country code: DE |
|
WWE | Wipo information: entry into national phase |
Ref document number: 13391517 Country of ref document: US |
|
32PN | Ep: public notification in the ep bulletin as address of the adressee cannot be established |
Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 29/05/12) |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 10810220 Country of ref document: EP Kind code of ref document: A2 |