WO2012044108A2 - Procédé et appareil de codage et de décodage d'un vecteur de mouvement adaptatif au moyen de données structurelles d'une vidéo, et procédé et appareil de codage et de décodage vidéo les utilisant - Google Patents

Procédé et appareil de codage et de décodage d'un vecteur de mouvement adaptatif au moyen de données structurelles d'une vidéo, et procédé et appareil de codage et de décodage vidéo les utilisant Download PDF

Info

Publication number
WO2012044108A2
WO2012044108A2 PCT/KR2011/007228 KR2011007228W WO2012044108A2 WO 2012044108 A2 WO2012044108 A2 WO 2012044108A2 KR 2011007228 W KR2011007228 W KR 2011007228W WO 2012044108 A2 WO2012044108 A2 WO 2012044108A2
Authority
WO
WIPO (PCT)
Prior art keywords
motion vector
block
current block
prediction
structure information
Prior art date
Application number
PCT/KR2011/007228
Other languages
English (en)
Korean (ko)
Other versions
WO2012044108A3 (fr
Inventor
송진한
임정연
최윤식
김용구
최영호
김종호
Original Assignee
에스케이텔레콤 주식회사
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 에스케이텔레콤 주식회사 filed Critical 에스케이텔레콤 주식회사
Publication of WO2012044108A2 publication Critical patent/WO2012044108A2/fr
Publication of WO2012044108A3 publication Critical patent/WO2012044108A3/fr

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/503Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
    • H04N19/51Motion estimation or motion compensation
    • H04N19/513Processing of motion vectors
    • H04N19/517Processing of motion vectors by encoding
    • H04N19/52Processing of motion vectors by encoding by predictive encoding

Definitions

  • An embodiment of the present invention relates to an adaptive motion vector encoding / decoding method and apparatus using structural information of an image and an image encoding / decoding method and apparatus using the same. More specifically, in encoding the motion vector of the current block, a motion vector encoding / decoding method for improving the coding efficiency of the motion vector of the current block by extracting structural information of neighboring blocks adjacent to the current block and determining an efficient predictive motion vector. And an apparatus and method and apparatus for image encoding / decoding using the same.
  • the interblock using inter-screen prediction is encoded not only a prediction error but also a motion vector.
  • a median value of motion vectors of a neighboring block adjacent to the current block to be encoded is mediated.
  • the encoding efficiency of the motion vector is increased by encoding only the difference value between the motion vector to be encoded and the predictive motion vector.
  • JCT-VC Joint Collaborate Team for Video Coding
  • VCEG Video Coding Experts Group
  • MPEG ISO / IEC Moving Picture Experts Group
  • MVC Motion Vector Competition
  • the MVC technique creates a plurality of candidate predictive motion vectors in advance in the image encoding apparatus, and selects a candidate predictive motion vector having a minimum difference between the current motion vector and A method of predictively encoding a current motion vector and transmitting information on which candidate predictive motion vector is selected is also performed.
  • the image decoding apparatus receives information on the selected candidate prediction motion vector together with the difference value of the motion vector to decode the original motion vector.
  • this technique can be selected regardless of the shape and motion of the image or object including the current block.
  • the candidate prediction motion vector is always fixed, and as the number of candidate prediction motion vectors increases, the amount of indexing additional information for the candidate prediction motion vector to be transmitted to the image decoding apparatus increases.
  • an embodiment of the present invention extracts the structural information of a neighboring block adjacent to the current block to determine a predictive motion vector that most closely represents the shape and motion of an object including the current block.
  • the main purpose is to more efficiently encode the motion vector of.
  • a prediction block is generated by predicting a current block, and a residual block is generated by subtracting the prediction block from the current block.
  • a frequency conversion block is generated by transforming and quantizing the residual block, and a prediction motion vector is determined based on structural information of neighboring blocks of the current block, and a differential motion is a difference value between the motion vector of the current block and the prediction motion vector.
  • An image encoder for encoding a vector and the quantized frequency transform block; And determining a prediction motion vector based on the structure information of the neighboring block, extracting a differential motion vector from a bitstream, adding the extracted differential motion vector and the prediction motion vector, and restoring a motion vector of the current block to be restored. And an image decoder for decoding the quantized frequency transform block from the stream and reconstructing the current block by using the decoded frequency transform block and the reconstructed motion vector.
  • an apparatus for encoding an image comprising: a prediction unit for predicting a current block to generate a prediction block; A subtraction unit for generating a residual block by subtracting the prediction block from the current block; A transform and quantizer for generating a frequency transform block by transforming and quantizing the residual block; And an encoder configured to determine a prediction motion vector based on the structure information of a neighboring block, and to encode a differential motion vector that is a difference value between the motion vector of the current block and the prediction motion vector, and the quantized frequency transform block.
  • a video encoding apparatus is provided.
  • an embodiment of the present invention to achieve another object of the present invention, in the apparatus for decoding an image, to determine the predictive motion vector based on the structural information of the neighboring block and extracts the differential motion vector from the bitstream
  • a decoder which adds the difference motion vector and the predictive motion vector to reconstruct the motion vector of the current block and decodes the quantized frequency transform block from the bitstream
  • An inverse quantization and inverse transformation unit for restoring a residual block by inverse quantization and inverse transformation of the frequency transform block
  • a prediction unit for predicting a current block to generate a prediction block
  • an adder configured to reconstruct the current block by adding the reconstructed residual block and the prediction block.
  • a motion vector encoding apparatus comprising: a structure information extractor for extracting structure information of neighboring blocks; A prediction motion vector determiner which determines a prediction motion vector based on the structure information; And a motion vector predictor encoding unit encoding a difference motion vector that is a difference value between the motion vector of the current block and the prediction motion vector.
  • a motion vector decoding apparatus the structure information extracting unit for extracting the structure information of the neighboring block; A prediction motion vector determiner which determines a prediction motion vector based on the structure information; A differential motion vector extractor which extracts a differential motion vector from the bitstream; And a motion vector decoder configured to reconstruct the motion vector of the current block by adding the difference motion vector and the prediction motion vector.
  • an embodiment of the present invention in a method for encoding / decoding an image, predicts a current block to generate a prediction block and subtracts the prediction block from the current block A block is generated, the residual block is transformed and quantized to generate a frequency transform block, and a prediction motion vector is determined based on the structural information of neighboring blocks of the current block, and the difference between the motion vector and the prediction motion vector of the current block is determined.
  • an embodiment of the present invention a method for encoding an image, comprising: a prediction step of predicting a current block to generate a prediction block; Subtracting the prediction block from the current block to generate a residual block; A transform and quantization step of transforming and quantizing the residual block to generate a frequency transform block; And encoding a quantized frequency transform block and a differential motion vector, which is a difference value between the motion vector of the current block and the predictive motion vector, by determining a prediction motion vector based on structural information of a neighboring block.
  • a video encoding method is provided.
  • an embodiment of the present invention in the method for decoding an image, determines a predictive motion vector based on the structure information of the neighboring block, extracts the differential motion vector from the bitstream A decoding step of reconstructing the motion vector of the current block by adding the difference motion vector and the prediction motion vector and decoding the quantized frequency transform block from the bitstream; Inverse quantization and inverse transformation for restoring a residual block by inverse quantization and inverse transformation of the frequency transform block; A prediction step of predicting a current block to generate a prediction block; And an addition step of reconstructing the current block by adding the reconstructed residual block and the prediction block.
  • a motion vector encoding method the structure information extraction step of extracting the structure information of the neighboring block; A prediction motion vector determining step of determining a prediction motion vector based on the structure information; And a motion vector predictive encoding step of encoding a differential motion vector, which is a difference value between the motion vector of the current block and the predictive motion vector.
  • a motion vector decoding method the structure information extraction step of extracting the structure information of the neighboring block; A prediction motion vector determining step of determining a prediction motion vector based on the structure information; A differential motion vector extracting step of extracting a differential motion vector from the bitstream; And a motion vector decoding step of reconstructing the motion vector of the current block by adding the difference motion vector and the prediction motion vector.
  • the structural information of the neighboring block adjacent to the current block is extracted by extracting structural information from the neighboring block.
  • FIG. 1 is a block diagram schematically illustrating a video encoding apparatus according to an embodiment of the present invention.
  • FIG. 2 is a diagram illustrating a motion vector encoding apparatus 200 according to an embodiment of the present invention.
  • FIG. 3 is a diagram illustrating an example in which the structural information extracting unit 210 extracts edge information using a Sobel filter.
  • FIG. 4 is a diagram illustrating a neighboring block of the current block and a motion vector of the neighboring block.
  • 5 is a diagram illustrating the partitioning of neighboring blocks by edge lines when the current block is the same as the macroblock size.
  • FIG. 6 is a diagram illustrating a partitioning state of neighboring blocks by an edge line when the current block is a subblock divided from a macroblock.
  • FIG. 7 is a block diagram schematically illustrating a configuration of an image decoding apparatus according to an embodiment of the present invention.
  • FIG. 8 is a diagram illustrating a motion vector decoding apparatus 800 according to an embodiment of the present invention.
  • a video encoding apparatus (Video Encoding Apparatus), a video decoding apparatus (Video Decoding Apparatus) to be described below is a personal computer (PC), notebook computer, personal digital assistant (PDA), portable multimedia player (PMP) It may be a user terminal such as a portable multimedia player (PSP), a PlayStation Portable (PSP), a wireless communication terminal, a smart phone, a TV, or a server terminal such as an application server or a service server.
  • a communication device such as a communication modem for communicating with various devices or a wired / wireless communication network, a memory for storing various programs and data for encoding or decoding an image or inter or intra prediction for encoding or decoding, and executing a program And a microprocessor for controlling and the like. It can mean a variety of devices.
  • the image encoded in the bitstream by the video encoding apparatus is real-time or non-real-time through the wired or wireless communication network, such as the Internet, local area wireless communication network, wireless LAN network, WiBro network, mobile communication network, or the like, or a cable, universal serial bus (USB: Universal) It may be transmitted to an image decoding apparatus through various communication interfaces such as a serial bus, and may be decoded by the image decoding apparatus to restore and reproduce the image.
  • wired or wireless communication network such as the Internet, local area wireless communication network, wireless LAN network, WiBro network, mobile communication network, or the like, or a cable, universal serial bus (USB: Universal) It may be transmitted to an image decoding apparatus through various communication interfaces such as a serial bus, and may be decoded by the image decoding apparatus to restore and reproduce the image.
  • USB universal serial bus
  • a video may be composed of a series of pictures, and each picture may be divided into a predetermined area such as a frame or a block.
  • the divided blocks may be classified into intra blocks and inter blocks according to an encoding method.
  • An intra block refers to a block that is encoded by using an intra prediction coding method.
  • An intra prediction coding is performed by using pixels of blocks previously encoded, decoded, and reconstructed in a current picture that performs current encoding.
  • a prediction block is generated by predicting pixels of a block and a difference value with pixels of the current block is encoded.
  • An inter block refers to a block that is encoded using inter prediction coding.
  • Inter prediction coding generates a prediction block by predicting a current block within a current picture by referring to one or more past or future pictures, and then generates a current block. This is a method of encoding the difference value with.
  • a frame referred to for encoding or decoding the current picture is referred to as a reference frame.
  • FIG. 1 is a block diagram schematically illustrating a video encoding apparatus according to an embodiment of the present invention.
  • the image encoding apparatus 100 is an apparatus for encoding an image, and includes an intra predictor 110, an inter predictor 120, a subtractor 130, a transform and quantizer 140, It may be configured to include an encoder 150, an inverse quantization and inverse transform unit 160, an adder 170, and a frame memory 180.
  • the input image to be encoded is input in units of macro blocks.
  • the macro blocks are in the form of M ⁇ N, M and N each have a size of 2 n , and M and N may be the same or different. have. Thus, it may be equal to or larger than the macro block of H.264.
  • the intra predictor 110 generates a prediction block using adjacent pixels to predict the current macroblock. That is, the intra prediction unit 110 generates a prediction block according to the mode of the intra prediction unit 110 by using neighboring pixels of the current macro block, which have already been encoded and reconstructed.
  • the inter prediction unit 120 generates a prediction block by using another frame to predict the current macro block. That is, the inter predictor 120 generates a motion vector through motion estimation according to the mode of the inter predictor 120 in a previous frame that has already been encoded and reconstructed, and generates a predictive block in a motion compensation process using the motion vector. do.
  • the subtraction unit 130 subtracts the prediction block from the current block to generate a residual block. That is, the subtractor 130 generates a residual block by subtracting the pixel value of the current block and the pixel value of the prediction block generated by the intra predictor 110 or the inter predictor 120.
  • the transform and quantization unit 140 converts and quantizes the residual block generated by the subtractor 130 into frequency coefficients. That is, the transform and quantization unit 140 generates a residual block having frequency coefficients from the residual coefficients of the residual block generated by the subtractor 130 and quantizes the residual blocks of the generated frequency coefficients. For example, a method of transforming an image signal in a spatial domain into a frequency domain such as a Hadamard transform or a discrete cosine transform based integer transform is used.
  • Various quantization techniques such as Dead Zone Uniform Threshold Quantization (DZUTQ) (hereinafter referred to as 'DZUTQ') or Quantization Weighted Matrix, may be used.
  • DZUTQ Dead Zone Uniform Threshold Quantization
  • 'DZUTQ' Quantization Weighted Matrix
  • the encoder 150 may generate encoded data by encoding the residual block transformed and quantized by the transform and quantizer 140.
  • an entropy encoding technique may be used, but various encoding techniques may be used without being limited thereto.
  • the encoder 150 may include not only a bit string encoding the quantized frequency coefficients, but also various pieces of information necessary to decode the encoded bit string in the encoded data. That is, the coded data may include a first field including a coded block pattern (CBP), a delta quantization coefficient, and a bit string in which the quantization frequency coefficients are encoded, and information necessary for prediction (eg, In the case of intra prediction, a second field including bits for an intra prediction mode or a motion vector in the case of inter prediction may be included.
  • CBP coded block pattern
  • a second field including bits for an intra prediction mode or a motion vector in the case of inter prediction may be included.
  • the encoder 150 extracts the structural information of the neighboring block, and uses the extracted structural information, the motion vector of the neighboring block to express the motion of the same object as the motion vector of the current block. Is determined as a predictive motion vector and a differential motion vector, which is a difference value between the motion vector of the current block and the predictive motion vector, can be encoded. This will be described in detail with reference to FIG. 2 in the following process.
  • the inverse quantization and inverse transform unit 160 inverse quantizes and inverse transforms the residual block transformed and quantized by the transform and quantization unit 140 to reconstruct the residual block.
  • Inverse quantization and inverse transformation may be performed by inversely performing a transform process and a quantization process performed by the transform and quantization unit 140. That is, the inverse quantization and inverse transform unit 160 may use the transform and quantization unit (for example, information on a transform and quantization type) generated and transmitted from the transform and quantization unit 140 (for example, information about a transform and quantization type).
  • the inverse quantization and inverse transformation may be performed by performing the transform and quantization inversely.
  • the adder 170 generates a reconstructed block by adding the prediction block generated by the intra predictor 110 or the inter predictor 120 and the residual block generated by the inverse quantization and inverse transformer 160.
  • the frame memory 180 is used as a reference block to store a block reconstructed by the adder 170 to generate a prediction block when performing intra or inter prediction.
  • FIG. 2 is a diagram illustrating a motion vector encoding apparatus 200 according to an embodiment of the present invention, and the encoder 150 of FIG. 1 includes a motion vector encoding apparatus 200 according to an embodiment of the present invention. can do.
  • the motion vector encoding apparatus 200 includes a structure information extractor 210, a predictive motion vector determiner 220, and a motion vector predictor encoder 230.
  • the structure information extractor 210 extracts the structure information of the neighboring block, and the predictive motion vector determiner 220 determines an optimal predictive motion vector based on the structure information of the neighboring block, and the motion vector predictor encoder 230. ) Encodes a differential motion vector, which is a difference value between the motion vector of the current block and the predictive motion vector.
  • the structure information extracting unit 210 extracts the structure information of the neighboring block to analyze what type of object the current block is connected to the neighboring block.
  • As representing the structure information of the neighboring block there can be many features such as edge, contour, and corner, and by extracting and analyzing such structure information, the neighboring block adjacent to the current block It is possible to distinguish whether the object is included in the object, and by selecting the motion vector of the neighboring block belonging to the same object as the current block as the predictive motion vector, it is possible to determine the predictive motion vector more similar to the current motion vector.
  • FIG. 3 is a diagram illustrating an example in which the structural information extracting unit 210 extracts edge information using a Sobel filter.
  • the neighboring block may refer to the left block, the upper block, and the upper right block of the current block as shown in FIG. 3.
  • the Sobel filter may be composed of two 3x3 filters, g x for obtaining a vertical gradient and g y for obtaining a horizontal gradient, as shown in Equation 1.
  • the size G of the edge of the pixel may be calculated by Equation 2.
  • the direction of the edge can be calculated by the equation (3).
  • the Sobel filter is applied to all pixels of the neighboring block adjacent to the current block, and the points where the edge size is larger than a certain threshold and the point G having the largest edge size among them are found.
  • Peripheral blocks are divided according to the direction of a straight line (edge line) composed of points having an edge size larger than the value.
  • the portion of the divided neighboring blocks adjacent to the current block may be determined as an area belonging to the same object as the current block. If the size of all edges of the neighboring block is not larger than a certain threshold, it is considered that there is no structural feature point and all neighboring blocks are determined to be the same object as the current block without dividing the neighboring block.
  • the prediction motion vector determiner 220 determines an optimal prediction motion vector based on the structure information of the neighboring blocks.
  • the prediction motion vector determiner 220 may determine the prediction motion vector using the motion vector of the neighboring block representing the motion of the same object as the current block based on the structure information analyzed by the structure information extractor 210. .
  • FIG. 4 is a diagram illustrating a neighboring block of the current block and a motion vector of the neighboring block.
  • the middle of three motion vectors in the left block (A), the upper block (B), and the upper right block (C) based on the current block as shown in FIG. It can be calculated as Median.
  • the predicted motion vector is obtained by using the median ⁇ mv a , mv b , mv c ⁇ of the motion vectors of the nearest subblocks based on the upper left boundary of the current block.
  • 5 is a diagram illustrating the partitioning of neighboring blocks by edge lines when the current block is the same as the macroblock size.
  • the prediction motion vector determiner 220 is divided by the position of the current block to be encoded and the edge lines extracted by the structure information extractor 210, as shown in FIGS. 5A and 5B, to determine the predicted motion vector.
  • a predicted motion vector is calculated by referring to motion vectors of a neighboring block (or a neighboring subblock) located in an adjacent division region close to the current block among the neighboring blocks.
  • the neighboring block or neighboring subblock closest to the upper left point of the current block in the region (adjacent division region) of the neighboring block adjacent to the current block.
  • the predicted motion vector can be determined from the motion vectors.
  • the edge line crosses the current block
  • a region having a large area among the areas of the current block divided into the edge lines may be used as a reference region of the current block.
  • the predicted motion vector may be calculated by referring to motion vectors of a neighboring block (or a neighboring subblock) located closest to the upper left point of the reference region in a partition adjacent to the current block as a reference.
  • a prediction motion vector may be set as an intermediate value of three motion vectors.
  • the predictive motion vector is converted into a larger or smaller value among the two motion vectors.
  • the prediction motion vector may be set to a value similar to the motion vector of the current block.
  • the motion vector may be set as a prediction motion vector.
  • FIG. 6 is a diagram illustrating a partitioning state of neighboring blocks by an edge line when the current block is a subblock divided from a macroblock.
  • the predictive motion vector determiner 220 determines the position of the current block to be encoded and the structure information extractor 210 to determine the predicted motion vector as shown in FIGS. 6A, 6B, 6D, and 6D.
  • a predicted motion vector is calculated by referring to motion vectors of a neighboring block (or a neighboring subblock) located in a region close to the current block among the regions of the neighboring block divided by the extracted edge information.
  • a prediction motion vector may be determined from motion vectors of neighboring blocks located closest to the current block in an adjacent division region. Even in this case, when the edge line crosses the current block, a region having a large area among the areas of the current block divided into the edge lines may be used as the reference region of the current block.
  • three, two, or one peripheral motion vector may be used according to a form in which neighboring blocks are divided by edge lines to determine a predicted motion vector.
  • the location of the neighboring block (or sub block) may also vary.
  • the predicted motion vector having a motion similar to the current motion vector can be more accurately determined by referring to motion vectors of neighboring blocks belonging to the same object as the current block. Therefore, when there are three motion vectors of neighboring blocks (or neighboring subblocks) located in a partition close to the current block, the predictive motion vector may be set as an intermediate value of the three motion vectors.
  • the predictive motion vector is set to the larger or smaller value of the two motion vectors, or the predictive motion vector is set to a value similar to the motion vector of the current block. Can be set.
  • the motion vector may be set as a prediction motion vector.
  • the motion vector predictor encoding unit 230 encodes a differential motion vector which is a difference value between the motion vector of the current block and the predictive motion vector received from the predictive motion vector determiner 220.
  • Entropy coding techniques such as fixed length coding, variable length coding, arithmetic coding, and the like may be used as a technique for encoding the differential motion vector.
  • FIG. 7 is a block diagram schematically illustrating a configuration of an image decoding apparatus according to an embodiment of the present invention.
  • the image decoding apparatus 700 includes a decoder 710, an inverse quantization and inverse transform unit 720, an intra predictor 730, an inter predictor 740, and an adder 750. It may be configured to include a frame memory 760.
  • the decoder 710 may decode the encoded data to extract information necessary for block decoding. That is, the decoder 710 extracts frequency mask table information by decoding the coded data, extracts a quantized frequency coefficient sequence using the frequency mask table information, and inversely scans the quantized frequency coefficient sequence to have a residual having quantized frequency coefficients. You can create a block. In this case, the decoder 710 may extract and decode the encoded residual block from the first field included in the encoded data, extract information necessary for prediction from the second field included in the encoded data, and extract the extracted residual block. Information required for prediction may be transmitted to the intra predictor 730 or the inter predictor 740 so that the predictor predicts the current block in the same manner as the predictor of the image encoding apparatus 100.
  • the inverse quantization and inverse transformer 720 inversely quantizes the residual block to generate an inverse quantized residual block, and inversely transforms the generated quantization block to generate a residual block.
  • the intra predictor 730 generates a predictive block for the current block by using information required for intra prediction transmitted from the decoder 710.
  • the inter prediction unit 730 generates a prediction block for the current block by using information required for inter prediction transmitted from the decoder 710.
  • the adder 750 reconstructs the current block by adding the inverse transform residual block and the prediction block.
  • the current block reconstructed by the adder 750 may be transferred to the frame memory 760, and used by the predictor to predict another block.
  • the frame memory 760 stores the reconstructed image to enable generation of intra and inter prediction blocks.
  • the decoder 710 may decode or extract encoded data and decode or extract information necessary for decoding as well as transformed and quantized residual blocks.
  • the information necessary for decoding refers to information necessary for decoding the coded bit string in the encoded data. For example, information about a block type, information about an intra prediction mode when the prediction mode is an intra prediction mode, and an inter prediction mode In the case of the prediction mode, the information may be information on a motion vector, information on a transform and quantization type, or may be various other information.
  • FIG. 8 is a diagram illustrating a motion vector decoding apparatus 800 according to an embodiment of the present invention, and the decoder 710 of FIG. 7 includes a motion vector decoding apparatus 800 according to an embodiment of the present invention. can do.
  • the motion vector decoding apparatus 800 includes a structure information extractor 810, a predictive motion vector determiner 820, a differential motion vector extractor 830, and a motion vector decoder 840. Include.
  • the structure information extractor 810 extracts structure information of the neighboring block, and the predictive motion vector determiner 820 determines an optimal predictive motion vector based on the structure information of the neighboring block, and the differential motion vector extractor 830. ) Extracts the differential motion vector from the bitstream, and the motion vector decoder 840 reconstructs the motion vector of the current block by adding the predicted motion vector and the differential motion vector.
  • the structure information extractor 810 and the predictive motion vector determiner 820 have the same or similar functions as the structure information extractor 210 and the predictive motion vector determiner 130 of the motion vector encoding apparatus 200. Description is omitted.
  • the motion vector decoder 840 restores the motion vector of the current block by adding the predicted motion vector calculated from the predictive motion vector determiner 820 and the difference motion vector value extracted by the differential motion vector extractor 830.
  • the image and the motion vector are predictively encoded and decoded in units of blocks.
  • the image and the motion vector are not necessarily predictively encoded and decoded in units of blocks.
  • the prediction coder may be predictively coded in a predetermined coding unit such as a slice, a picture, or a sequence unit instead of a block unit, or may be predictively coded in the form of various regions of atypical shape instead of a block shape.
  • the image encoding / decoding apparatus uses the encoded data (bitstream) output terminal of the image encoding apparatus 100 of FIG. 1 to encode the encoded data (bitstream) of the image decoding apparatus 700 of FIG. 7. This can be achieved by connecting to the input.
  • An image encoding / decoding apparatus generates a prediction block by predicting a current block, subtracts the prediction block from the current block, generates a residual block, and transforms and quantizes the residual block to generate a frequency transform block. And determine the predictive motion vector based on the structural information of the neighboring blocks of the current block, and then encode the differential motion vector and the quantized frequency transform block, which is the difference between the motion vector and the predictive motion vector, Based on the structural information, the predictive motion vector is determined, the differential motion vector is extracted from the bitstream, and the extracted differential motion vector and the predictive motion vector are added to restore the motion vector of the current block to be recovered and quantized from the bitstream. And decode the decoded frequency transform block and the reconstructed motion vector. It includes a video decoder to restore the current block by using.
  • the image encoder may be implemented by the image encoding apparatus 100 according to an embodiment of the present invention
  • the image decoder may be implemented by the image decoding apparatus 700 according to the embodiment of the present invention.
  • the prediction step of generating a prediction block by predicting the current block (S910), the subtraction step of generating a residual block by subtracting the prediction block from the current block (S920), A transform and quantization step of transforming and quantizing the residual block to generate a frequency transform block (S930) and determining a predictive motion vector based on the structure information of the neighboring block to determine a difference value between the motion vector of the current block and the predictive motion vector.
  • An encoding step (S940) of encoding the difference motion vector and the quantized frequency transform block (S910), the subtraction step of generating a residual block by subtracting the prediction block from the current block (S920), A transform and quantization step of transforming and quantizing the residual block to generate a frequency transform block (S930) and determining a predictive motion vector based on the structure information of the neighboring block to determine a difference value between the motion vector of the current block and the predictive motion vector.
  • the prediction step S910 corresponds to the operation of the predictor 110 or 120
  • the subtraction step S920 corresponds to the operation of the subtractor 130
  • the transform and quantization step S930 is a transform and quantization unit ( Corresponding to the operation of 140, and the encoding step (S940) corresponds to the operation of the encoder 150, detailed description thereof will be omitted.
  • the prediction motion vector is determined based on the structure information of the neighboring block, the difference motion vector is extracted from the bitstream, and the difference motion vector and the prediction motion vector are added.
  • the decoding step S1010 corresponds to the operation of the decoding unit 710
  • the inverse quantization and inverse transform step S1020 corresponds to the operation of the inverse quantization and inverse transform unit 720
  • the prediction step S1030 is the prediction unit.
  • the addition step (S1040) corresponds to the operation of the adder 750, detailed description thereof will be omitted.
  • Motion vector encoding method for extracting the structural information of the neighboring block, the prediction motion vector determining step of determining a prediction motion vector based on the structural information (S1120) And a motion vector prediction encoding step (S1130) of encoding a difference motion vector that is a difference value between the motion vector of the current block and the prediction motion vector.
  • the structure information extraction step (S1110) corresponds to the operation of the structure information extractor 210
  • the prediction motion vector determining step (S1120) corresponds to the operation of the prediction motion vector determiner 220, and the motion vector prediction encoding is performed.
  • Step S1130 corresponds to the operation of the motion vector predictor encoding unit 230, and thus a detailed description thereof will be omitted.
  • the structural information extraction step (S1210) for extracting the structural information of the neighboring block
  • the prediction motion vector determination step (S1220) for determining a prediction motion vector based on the structure information
  • a differential motion vector extracting step (S1230) for extracting a differential motion vector from the bitstream
  • a motion vector decoding step (S1240) for reconstructing the motion vector of the current block by adding the differential motion vector and the prediction motion vector.
  • the structural information extraction step (S1210) corresponds to the operation of the structural information extraction unit 810
  • the prediction motion vector determination step (S1220) corresponds to the operation of the prediction motion vector determination unit 820, and extracts the differential motion vector.
  • Operation S1230 corresponds to the operation of the differential motion vector extractor 830
  • motion vector decoding operation S1240 corresponds to the operation of the motion vector decoder 840, and thus detailed description thereof will be omitted.
  • An image encoding / decoding method may be realized by combining the image encoding method according to an embodiment of the present invention and the image decoding method according to an embodiment of the present invention.
  • a prediction block is generated by predicting a current block
  • a residual block is generated by subtracting a prediction block from the current block
  • a frequency transform block is generated by transforming and quantizing the residual block.
  • Image encoding step and neighboring block encoding a motion vector and a quantized frequency transform block, which is a difference value between the motion vector and the predictive motion vector of the current block by determining the predictive motion vector based on the structural information of the neighboring block of the current block.
  • Determining the predictive motion vector from the bitstream extracting the differential motion vector from the bitstream, adding the extracted differential motion vector and the predictive motion vector, and restoring the motion vector of the current block to be recovered and quantizing the frequency from the bitstream.
  • Decoded block, decoded frequency transform block and reconstructed motion vector Used to include an image decoding method comprising: restoring the current block.
  • the image encoding step may be implemented as an image encoding step according to an embodiment of the present invention
  • the image decoding step may be implemented as an image decoding step according to an embodiment of the present invention.
  • a non-zero frequency is concentrated on low frequency by applying a frequency domain encoding information by dividing a frequency conversion block into a frequency domain, which is applied to a technical field of encoding and decoding an image. It is a very useful invention that can efficiently improve the coding efficiency in consideration of the characteristics of a general video.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)

Abstract

Dans l'un de ses modes de réalisation, la présente invention se rapporte à un procédé et à un appareil de codage et de décodage d'un vecteur de mouvement adaptatif au moyen de données structurelles d'une vidéo. L'invention se rapporte d'autre part à un procédé et à un appareil de codage et de décodage vidéo les utilisant. Le mode de réalisation de la présente invention qui se rapporte au procédé et à l'appareil de codage et de décodage du vecteur de mouvement et qui se rapporte d'autre part au procédé et à l'appareil de codage et de décodage vidéo comprend les étapes suivantes consistant : à déterminer un vecteur de mouvement prédictif sur la base de données structurelles d'un bloc périphérique d'un bloc actuel ; à coder un vecteur de mouvement différentiel, qui correspond à la différence de valeur entre le bloc actuel et le vecteur de mouvement prédictif ; à déterminer le vecteur de mouvement prédictif sur la base de données structurelles du bloc périphérique ; à extraire le vecteur de mouvement différentiel d'un train de bits ; à ajouter le vecteur de mouvement différentiel extrait et le vecteur de mouvement prédictif ; et à récupérer et à décoder le vecteur de mouvement du bloc actuel devant être récupéré.
PCT/KR2011/007228 2010-09-30 2011-09-30 Procédé et appareil de codage et de décodage d'un vecteur de mouvement adaptatif au moyen de données structurelles d'une vidéo, et procédé et appareil de codage et de décodage vidéo les utilisant WO2012044108A2 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
KR10-2010-0095118 2010-09-30
KR1020100095118A KR101658592B1 (ko) 2010-09-30 2010-09-30 영상의 구조적 정보를 이용한 적응적 움직임 벡터 부호화/복호화 방법 및 장치와 그를 이용한 영상 부호화/복호화 방법 및 장치

Publications (2)

Publication Number Publication Date
WO2012044108A2 true WO2012044108A2 (fr) 2012-04-05
WO2012044108A3 WO2012044108A3 (fr) 2012-05-31

Family

ID=45893686

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/KR2011/007228 WO2012044108A2 (fr) 2010-09-30 2011-09-30 Procédé et appareil de codage et de décodage d'un vecteur de mouvement adaptatif au moyen de données structurelles d'une vidéo, et procédé et appareil de codage et de décodage vidéo les utilisant

Country Status (2)

Country Link
KR (1) KR101658592B1 (fr)
WO (1) WO2012044108A2 (fr)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113468905A (zh) * 2021-07-12 2021-10-01 深圳思谋信息科技有限公司 图形码识别方法、装置、计算机设备和存储介质

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11240526B2 (en) 2017-01-02 2022-02-01 Industry-University Cooperation Foundation Hanyang University Method and apparatus for decoding image using interpicture prediction

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH08205194A (ja) * 1995-01-24 1996-08-09 Nec Corp 動き補償フレーム間予測装置
KR100212559B1 (ko) * 1996-03-22 1999-08-02 전주범 물체의 윤곽 부호화 시스템 및 그의 움직임 추정방법
KR20080088039A (ko) * 2007-03-28 2008-10-02 삼성전자주식회사 영상의 부호화, 복호화 방법 및 장치

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH08205194A (ja) * 1995-01-24 1996-08-09 Nec Corp 動き補償フレーム間予測装置
KR100212559B1 (ko) * 1996-03-22 1999-08-02 전주범 물체의 윤곽 부호화 시스템 및 그의 움직임 추정방법
KR20080088039A (ko) * 2007-03-28 2008-10-02 삼성전자주식회사 영상의 부호화, 복호화 방법 및 장치

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113468905A (zh) * 2021-07-12 2021-10-01 深圳思谋信息科技有限公司 图形码识别方法、装置、计算机设备和存储介质
CN113468905B (zh) * 2021-07-12 2024-03-26 深圳思谋信息科技有限公司 图形码识别方法、装置、计算机设备和存储介质

Also Published As

Publication number Publication date
KR101658592B1 (ko) 2016-09-21
KR20120033546A (ko) 2012-04-09
WO2012044108A3 (fr) 2012-05-31

Similar Documents

Publication Publication Date Title
WO2012015275A2 (fr) Procédé et dispositif de codage/décodage d'image à l'aide d'une prédiction fragmentée de bloc
WO2013002549A2 (fr) Procédés et appareil de codage/décodage d'une image
WO2013070006A1 (fr) Procédé et appareil de codage et de décodage vidéo faisant appel à un mode de saut
WO2011062392A2 (fr) Procédé et appareil pour coder/décoder un vecteur de mouvement en sélectionnant un ensemble de vecteurs de mouvement candidats prédits, procédé et appareil de codage/décodage d'image associés
WO2010050706A2 (fr) Procédé et appareil de codage d'un vecteur mouvement, procédé et appareil de codage/décodage d'une image faisant appel à ces derniers
WO2011052897A2 (fr) Procédé et appareil de codage/décodage de vecteurs de mouvement par segmentation spatiale, et procédé et appareil pour le codage/décodage d'images au moyen de ceux-ci
WO2012077960A2 (fr) Procédé et dispositif de codage/décodage d'une image par inter-prédiction en utilisant un bloc aléatoire
WO2013005941A2 (fr) Appareil et procédé de codage et de décodage d'une image
WO2012099440A2 (fr) Appareil et procédé de génération/récupération d'informations de mouvement basées sur un encodage d'indice de vecteur de mouvement prédictif, et appareil et procédé d'encodage/décodage d'images utilisant ce dernier
WO2013002550A2 (fr) Méthode et appareil de codage/décodage par décision de mode de l'unité de codage grande vitesse
WO2012033344A2 (fr) Procédé et appareil destinés à coder/décoder des images à l'aide de la sélection efficace d'un groupe de mode de prédiction intra
WO2012046979A2 (fr) Procédé et appareil de codage de bloc de conversion de fréquence au moyen d'une table de masques de fréquence, et procédé et appareil de codage/décodage vidéo au moyen de celle-ci
WO2010044569A2 (fr) Procédé et appareil permettant de générer une trame de référence et procédé et appareil permettant le coder/décoder une image au moyen de la trame de référence
WO2012077959A2 (fr) Procédé et dispositif pour coder/décoder une image dans un domaine spatial pour une composante de bruit
WO2012011672A2 (fr) Procédé et dispositif de codage/décodage d'images par mode de saut étendu
WO2013069996A1 (fr) Procédé et appareil de codage/décodage d'image à l'aide d'un filtre à boucle adaptatif sur un domaine de fréquence faisant intervenir une conversion
WO2012081917A2 (fr) Procédé et appareil de génération d'informations de mouvement codées/de récupération d'informations de mouvement à l'aide d'une intégration d'informations de mouvement, et procédé et appareil de codage/de décodage d'images qui utilisent ledit procédé et ledit appareil
WO2011037337A2 (fr) Procédé et appareil de codage-décodage d'images tenant compte de composantes basse fréquence
WO2010047498A2 (fr) Procédé et dispositif de codage/décodage d'images utilisant des coefficients d'un filtre d'interpolation adaptatif
WO2012021040A2 (fr) Procédé et dispositif de codage/décodage d'image ayant un mode de filtrage amovible
WO2011111954A2 (fr) Procédé et appareil de codage/décodage de vecteur de mouvement, à l'aide de combinaison de résolutions de vecteur de mouvement, et procédé et appareil de codage/décodage d'images utilisant ceux-ci
WO2011052990A2 (fr) Procédé et appareil de codage/décodage d'images sur la base d'une résolution adaptative
US10715805B2 (en) Method and device for subband coding frequency conversion unit, and method and device for image encoding/decoding using same
KR20130057953A (ko) 세밀한 예측 단위를 사용하는 효과적인 부호화/복호화 방법 및 장치
WO2011053054A2 (fr) Procédé et appareil de codage/décodage de vecteur de mouvement au moyen d'une limitation de résolution de vecteur de mouvement, et procédé et appareil de codage/décodage d'image dans lesquels ils interviennent

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 11829614

Country of ref document: EP

Kind code of ref document: A2

NENP Non-entry into the national phase in:

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 12/07/13)

122 Ep: pct application non-entry in european phase

Ref document number: 11829614

Country of ref document: EP

Kind code of ref document: A2