WO2012011672A2 - 확장된 스킵모드를 이용한 영상 부호화/복호화 방법 및 장치 - Google Patents
확장된 스킵모드를 이용한 영상 부호화/복호화 방법 및 장치 Download PDFInfo
- Publication number
- WO2012011672A2 WO2012011672A2 PCT/KR2011/004521 KR2011004521W WO2012011672A2 WO 2012011672 A2 WO2012011672 A2 WO 2012011672A2 KR 2011004521 W KR2011004521 W KR 2011004521W WO 2012011672 A2 WO2012011672 A2 WO 2012011672A2
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- block
- motion vector
- backward
- mode
- current block
- Prior art date
Links
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/103—Selection of coding mode or of prediction mode
- H04N19/109—Selection of coding mode or of prediction mode among a plurality of temporal predictive coding modes
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/503—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
- H04N19/51—Motion estimation or motion compensation
- H04N19/513—Processing of motion vectors
- H04N19/517—Processing of motion vectors by encoding
- H04N19/52—Processing of motion vectors by encoding by predictive encoding
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/134—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
- H04N19/136—Incoming video signal characteristics or properties
- H04N19/137—Motion inside a coding unit, e.g. average field, frame or block difference
- H04N19/139—Analysis of motion vectors, e.g. their magnitude, direction, variance or reliability
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/134—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
- H04N19/146—Data rate or code amount at the encoder output
- H04N19/147—Data rate or code amount at the encoder output according to rate distortion criteria
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/17—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
- H04N19/176—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/503—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
- H04N19/51—Motion estimation or motion compensation
- H04N19/567—Motion estimation based on rate distortion criteria
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/60—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
- H04N19/61—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding in combination with predictive coding
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/117—Filters, e.g. for pre-processing or post-processing
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/134—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
- H04N19/157—Assigned coding mode, i.e. the coding mode being predefined or preselected to be further used for selection of another element or parameter
- H04N19/159—Prediction type, e.g. intra-frame, inter-frame or bidirectional frame prediction
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/80—Details of filtering operations specially adapted for video compression, e.g. for pixel interpolation
- H04N19/82—Details of filtering operations specially adapted for video compression, e.g. for pixel interpolation involving filtering within a prediction loop
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/85—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression
- H04N19/86—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression involving reduction of coding artifacts, e.g. of blockiness
Definitions
- An embodiment of the present invention relates to a method and apparatus for image encoding / decoding using an extended skip mode. More specifically, when performing block-based motion prediction in a video data compression apparatus, redundancy between the current block and the reference image data can be applied by enabling a skip mode in one direction by using the data of the decoded reference image.
- the present invention relates to a method and apparatus for encoding / decoding using an extended skip mode for further improving the performance of video data compression by efficiently removing the same to obtain better reconstructed picture quality at the same bit rate.
- a mode in which no data (quantized transform coefficients, motion vectors, etc.) are transmitted except the mode information is defined as a skip mode.
- the skip mode in H.264 / AVC may be divided into a case of a P slice and a case of a B slice.
- motion compensation is performed by selecting the reference frame closest to L0 (List 0), which is a reference frame buffer, by using the median of the motion vector of the neighboring block of the current block E.
- L0 which is a reference frame buffer
- Skip mode in B slices can occur in two cases depending on the DIRECT mode.
- the DIRECT mode is a temporal direct mode
- a motion vector of a -located block is used to predict a motion vector of the current block.
- the motion compensation is performed by the weighted sum of two blocks indicated by the two prediction motion vectors thus obtained, and likewise, no additional information about the residual signal or the motion vector is sent.
- the DIRECT mode is a spatial direct mode
- two motion vectors are generated by using the L0 and L1 motion vectors of neighboring blocks A, B, and C of the current block E, similar to the SKIP mode of a P slice.
- the motion compensation of the current block is performed by adding the weights of the indicated blocks, and the encoder does not send additional information other than the mode information.
- both the temporal and spatial direct prediction modes refer to two motion vectors, and thus the motion block most similar to the current block. It is structured to generate.
- an embodiment of the present invention provides a skip mode in one direction by using data of a decoded reference image when performing block-based motion prediction in a video data compression apparatus.
- the purpose of the present invention is to improve compression efficiency by efficiently removing redundancy between blocks and reference image data.
- the main purpose is to further improve the performance of video data compression to obtain better reconstructed picture quality at the same bit rate.
- a motion vector of a backward reference block of a neighboring block of a current block is set as a predicted motion vector of the current block, or Set a predictive motion vector from the forward reference block motion vector of the block in the backward reference picture at the same position as the block, perform the motion compensation using the predicted motion vector, and predict the prediction mode if the result of the motion compensation satisfies the optimal skip condition.
- An image encoder configured to set and encode the prediction mode; And decoding a prediction mode by decoding the encoded data.
- a motion vector of a forward reference block in the same direction as the forward reference block motion vector of the block in the backward reference picture at the same position as the current block Predict the current block by using, and if the prediction mode is the backward temporal extended skip mode, the motion vector for the backward reference block in the opposite direction to the forward reference block motion vector of the block in the backward reference picture at the same position as the current block
- a prediction block is generated by predicting a current block, and when the prediction mode is a backward spatial extended skip mode, image decoding for generating a prediction block by predicting the current block using a motion vector of a backward reference block of a neighboring block of the current block.
- An image comprising a flag It offers luxury / decryption device.
- an embodiment of the present invention in the apparatus for encoding an image, the forward reference of the anchor block with reference to the anchor block which is a block in the back reference picture at the same position as the current block If the motion vector for the forward reference block in the same direction as the motion vector for the block is set as the predictive motion vector, the motion compensation is performed using the predicted motion vector, and if the result of the motion compensation satisfies the optimal skip condition, the prediction mode is selected.
- a mode determiner for setting the forward temporal extended skip mode; And an encoder for encoding the prediction mode.
- an embodiment of the present invention in the apparatus for encoding an image, the forward reference of the anchor block with reference to the anchor block which is a block in the back reference picture at the same position as the current block Set the motion vector for the backward reference block in the opposite direction to the motion vector for the block as the predicted motion vector, perform motion compensation using the predicted motion vector, and if the result of the motion compensation satisfies the optimal skip condition, the prediction mode is selected.
- a mode determiner for setting a backward temporal extended skip mode; And an encoder for encoding the prediction mode.
- the prediction motion vector of the current block is determined from the motion vector of the backward reference block of the neighboring block of the current block;
- a mode determiner configured to perform motion compensation using the predicted motion vector and to set a prediction mode to a backward spatial extended skip mode if the result of the motion compensation satisfies an optimal skip condition;
- an encoder for encoding the prediction mode is
- an apparatus for decoding an image comprising: a decoder for decoding the prediction mode by decoding the encoded data; And if the prediction mode is the forward temporal extended skip mode, the prediction block is predicted by using the motion vector for the forward reference block in the same direction as the forward reference block motion vector of the block in the backward reference frame at the same position as the current block. It provides a video decoding apparatus comprising a predictor for generating a.
- an apparatus for decoding an image comprising: a decoder for decoding the prediction mode by decoding the encoded data; And if the prediction mode is a backward temporal extended skip mode, the prediction block is predicted by predicting the current block using a motion vector of a forward reference block motion vector of a block in a backward reference frame at the same position as the current block and a backward reference block in the opposite direction. It provides a video decoding apparatus comprising a predictor for generating a.
- an apparatus for decoding an image comprising: a decoder for decoding the prediction mode by decoding the encoded data; And a predictor configured to predict the current block to generate a predicted block by using a motion vector of a backward reference block of a neighboring block of the current block when the prediction mode is a backward spatial extended skip mode.
- an embodiment of the present invention in the method for encoding / decoding an image, the motion vector for the backward reference block of the neighboring block of the current block to the predicted motion vector of the current block Set a predicted motion vector from a forward reference block motion vector of a block in a backward reference frame at the same position as the current block, perform motion compensation using the predicted motion vector, and determine the optimal skip condition as a result of the motion compensation. Setting a prediction mode and encoding the prediction mode if satisfied; And decoding a prediction mode by decoding the encoded data.
- the prediction mode is a forward temporal extended skip mode
- a motion vector of a forward reference block in the same direction as the forward reference block motion vector of the block in the backward reference frame at the same position as the current block Predict the current block by using, and if the prediction mode is the backward temporal extended skip mode, the motion vector for the backward reference block in the opposite direction to the forward reference block motion vector of the block in the backward reference frame at the same position as the current block
- Video code characterized in that it comprises Provide / decoding method.
- an embodiment of the present invention in the method for encoding an image, forward reference in the same direction as the forward reference block motion vector of the block in the backward reference frame at the same position as the current block Setting a motion vector for a block as a predicted motion vector, performing motion compensation using the predicted motion vector, and setting a prediction mode to a forward temporal extended skip mode if the result of the motion compensation satisfies an optimal skip condition; And it provides a video encoding method comprising the step of encoding the prediction mode.
- the block in the rear reference frame may be a block in the reference frame closest to the current block among all the rear reference frames.
- the rate-distortion of the forward temporal extended skip mode in consideration of the amount of bits and distortions generated when the current block is predicted and encoded for each of the inter prediction mode candidates in the set of all inter prediction predictable modes including the forward temporal extended skip mode. If the cost is small, it may be determined that the optimal skip condition is satisfied.
- an embodiment of the present invention to achieve another object of the present invention, in the method for encoding an image, backward reference in the opposite direction to the forward reference block motion vector of the block in the backward reference frame at the same position as the current block Setting a motion vector for a block as a predicted motion vector, performing motion compensation using the predicted motion vector, and setting a prediction mode to a backward temporal extended skip mode if the result of the motion compensation satisfies an optimal skip condition; And it provides a video encoding method comprising the step of encoding the prediction mode.
- the block in the rear reference frame may be a block in the reference frame closest to the current block among all the rear reference frames.
- the rate-distortion of the backward temporal extended skip mode in consideration of the amount of bits and distortions generated when the current block is predicted and encoded for each inter prediction mode candidate in the set of all inter prediction predictable modes including the backward temporal extended skip mode. If the cost is small, it may be determined that the optimal skip condition is satisfied.
- an embodiment of the present invention in the method for encoding an image, the motion vector for the backward reference block of the neighboring block of the current block as the prediction motion vector of the current block Performing motion compensation using the predicted motion vector and setting a prediction mode to a backward spatial extended skip mode if the result of the motion compensation satisfies an optimal skip condition; And it provides a video encoding method comprising the step of encoding the prediction mode.
- the rate-distortion of the backward spatial extended skip mode in consideration of the amount of bits and distortions generated when the current block is predicted and encoded for each inter prediction mode candidate in the set of all inter prediction predictable modes including the backward spatial extended skip mode. If the cost is small, it may be determined that the optimal skip condition is satisfied.
- the backward motion vector may be set to a median of backward motion vectors of neighboring blocks of the current block.
- a method of decoding an image comprising: decoding a prediction mode by decoding encoded data; And if the prediction mode is the forward temporal extended skip mode, the prediction block is predicted by using the motion vector for the forward reference block in the same direction as the forward reference block motion vector of the block in the backward reference frame at the same position as the current block. It provides a video decoding method comprising the step of generating a.
- a method of decoding an image comprising: decoding a prediction mode by decoding encoded data; And if the prediction mode is a backward temporal extended skip mode, the prediction block is predicted by predicting the current block using a motion vector of a forward reference block motion vector of a block in a backward reference frame at the same position as the current block and a backward reference block in the opposite direction. It provides a video decoding method comprising the step of generating a.
- a method of decoding an image comprising: decoding a prediction mode by decoding encoded data; And if the prediction mode is a backward spatial extended skip mode, predicting the current block using a motion vector of a backward reference block of a neighboring block of the current block to generate a prediction block.
- the block in the rear reference frame may be a block in the reference frame closest to the current block among all the rear reference frames.
- the backward motion vector may be set to a median of backward motion vectors of neighboring blocks of the current block.
- the present invention enables to apply a skip mode in one direction by using the data of the decoded reference image to efficiently remove and compress redundancy between the current block and the reference image data.
- the performance of video data compression is further improved, thereby obtaining a better reconstructed picture quality at the same bit rate.
- FIG. 1 is a block diagram schematically illustrating a video encoding apparatus according to an embodiment of the present invention.
- FIG. 2 is a diagram illustrating the positional relationship between the current block, the front reference frame L0 and the rear reference frame L1 of the current block.
- 3 is a diagram illustrating a predicted motion vector in the forward temporal extended skip mode.
- FIG. 4 is a diagram illustrating a predicted motion vector in a backward temporal extended skip mode.
- FIG. 5 is a diagram illustrating a positional relationship between a current block and a neighboring block.
- FIG. 6 is a block diagram schematically illustrating a configuration of an image decoding apparatus according to an embodiment of the present invention.
- FIG. 7 is a flowchart illustrating an image encoding method according to a first embodiment of the present invention.
- FIG. 8 is a flowchart illustrating an image encoding method according to a second embodiment of the present invention.
- FIG. 9 is a flowchart illustrating a video encoding method according to a third embodiment of the present invention.
- FIG. 10 is a flowchart illustrating an image decoding method according to an embodiment of the present invention.
- a video encoding apparatus (Video Encoding Apparatus), a video decoding apparatus (Video Decoding Apparatus) to be described below is a personal computer (PC), notebook computer, personal digital assistant (PDA), portable multimedia player (PMP) : User terminal such as Portable Multimedia Player (PSP), PlayStation Portable (PSP: PlayStation Portable), Wireless Communication Terminal, Smart Phone, or a server terminal such as an application server or a service server.
- a communication device such as a communication modem for communicating with a wired / wireless communication network, a memory for storing various programs and data for encoding or decoding an image or inter / intra prediction for encoding or decoding, and executing and controlling a program.
- a variety of cabinets with microprocessors Can mean.
- the image encoded in the bitstream by the video encoding apparatus is real-time or non-real-time through the wired or wireless communication network, such as the Internet, local area wireless communication network, wireless LAN network, WiBro network, mobile communication network, or the like, or a cable, universal serial bus (USB: Universal) It may be transmitted to an image decoding apparatus through various communication interfaces such as a serial bus, and may be decoded by the image decoding apparatus to restore and reproduce the image.
- wired or wireless communication network such as the Internet, local area wireless communication network, wireless LAN network, WiBro network, mobile communication network, or the like, or a cable, universal serial bus (USB: Universal) It may be transmitted to an image decoding apparatus through various communication interfaces such as a serial bus, and may be decoded by the image decoding apparatus to restore and reproduce the image.
- USB universal serial bus
- a video is composed of a series of pictures, and each picture may be divided into a predetermined area such as a frame or a block.
- the divided blocks may be classified into intra blocks and inter blocks according to an encoding method.
- An intra block refers to a block that is encoded by using an intra prediction coding scheme. Intra prediction coding is performed by using pixels of blocks that have been previously encoded, decoded, and reconstructed in a current picture that performs current encoding. A prediction block is generated by predicting pixels of a block, and a difference value with pixels of the current block is encoded.
- An inter block refers to a block that is encoded using inter prediction coding.
- Inter prediction coding generates a prediction block by predicting a current block in a current picture by referring to one or more past pictures or future pictures, and then generates a current block. This is a method of encoding the difference value with.
- a frame referred to for encoding or decoding the current picture is called a reference frame, and a picture including the reference frame is called a reference picture.
- FIG. 1 is a block diagram schematically illustrating a video encoding apparatus according to an embodiment of the present invention.
- the image encoding apparatus 100 includes a mode determiner 110, a predictor 120, a subtractor 130, a transformer 140, a scanner 150, an encoder.
- the encoder 160 may include an encoder 160, an inverse transformer 170, an adder 180, and a filter 190.
- the input image to be encoded may be input in block units, and the block may be a macroblock.
- the shape of the macroblock may be various forms of M ⁇ N, where M and N may be natural numbers having a value of 2 n (where n is an integer of 1 or more).
- different types of blocks may be used for each frame to be encoded.
- information about a block type that is, information about the block type, is encoded for each frame, thereby encoding the encoded data in the image decoding apparatus.
- the shape of a block of a frame to be decoded may be determined.
- the image encoding apparatus 100 may further include a block type determiner (not shown) that determines the block type and encodes information about the block type in the encoded data.
- a block type determiner (not shown) that determines the block type and encodes information about the block type in the encoded data.
- the mode determiner 110 may select one of a set of prediction modes and set the prediction mode.
- the set of prediction modes used in the image encoding apparatus 100 may include at least one of a forward temporal extended skip mode, a backward temporal extended skip mode, and a backward spatial extended skip mode.
- the encoder 160 encodes the prediction mode determined by the mode determiner 110. Data for the encoded prediction mode may be transmitted to the image decoder.
- FIG. 2 is a diagram illustrating the positional relationship between the current block, the front reference frame L0 and the rear reference frame L1 of the current block.
- the mode determiner 110 may determine a block (anchor) in a backward reference picture (or a backward reference frame) at the same position as the current block. Block), set the motion vector for the forward reference block in the same direction as the motion vector (MV) for the forward reference block of the anchor block as the predictive motion vector, and use the set predictive motion vector to compensate for the motion compensation. If the result of motion compensation satisfies the optimal skip condition, the prediction mode is set to the forward temporal extended skip mode.
- the optimal skip condition is a rate-distortion cost of the forward temporal extended skip mode in consideration of the amount of bits and distortions generated when the current block is predicted and encoded for each of the inter prediction mode candidates in the set of all inter prediction modes.
- the prediction mode is set to the forward temporal extended skip mode.
- the mode determiner 110 forwards a motion vector of a forward reference block of the same position block that is a block in a backward reference frame L1 or List 1 having the same position as the position in the current frame of the current block.
- the forward motion vector MV L0 (that is, the motion vector for the forward reference block) having the same direction as the MV is set as the predicted motion vector. If the motion compensation is performed using the predicted motion vector set here and the result of the motion compensation satisfies the optimal skip condition, the prediction mode of the current block is set to the forward temporal extended skip mode.
- the mode determiner 110 selects a block (referred to as an anchor block) in a backward reference frame at the same position as the current block.
- the motion vector MV for the forward reference block of the anchor block and the backward motion vector MV L1 in the opposite direction are set as the prediction motion vectors.
- the prediction mode is set to the reverse temporal extended skip mode.
- Rate-distortion of the backward temporal extended skip mode in consideration of the amount of bits and distortions generated when the current block is predicted and encoded for each inter prediction mode candidate in all the inter prediction predictable mode sets including the backward temporal extended skip mode. If the cost is small, it may be determined that the optimal skip condition is satisfied. In this case, the prediction mode is set as the reverse temporal extended skip mode.
- the forward motion vector MV L0 and the backward motion vector MV L1 of the current block can be obtained from Equation 1.
- TR B is a time interval between the reference picture (L0) and the current encoding target picture (current picture)
- TR D is a time interval between the reference picture (L0) and the back reference picture.
- the block in the rear reference frame may be a block in the reference frame closest to the current picture among all the rear reference frames.
- FIG. 3 is a diagram illustrating a predicted motion vector in the forward temporal extended skip mode
- FIG. 4 is a diagram illustrating a predicted motion vector in the backward temporal extended skip mode.
- FIG. 5 is a diagram illustrating a positional relationship between a current block and a neighboring block.
- the mode determiner 110 may include neighboring blocks of the current block E (eg, A (left block), Determine the predicted motion vector of the current block E from the rear motion vectors of B (upper block) and C (right upper block), perform motion compensation using the determined predicted motion vector, and optimally skip the result of the motion compensation. If the condition is satisfied, the prediction mode is set to the backward spatial extended skip mode.
- the neighboring blocks of the current block E are not limited to A, B, and C, and may be A, B, C, or D (left upper block).
- Rate-distortion of the backward spatial extended skip mode in consideration of the amount of bits and distortions generated when the current block is predicted and encoded for each inter prediction mode candidate in the set of all interpredictable modes including the backward spatial extended skip mode. If the cost is small, it may be determined that the optimal skip condition is satisfied. In this case, the prediction mode is set to the backward spatial extended skip mode.
- the predicted motion vector may be set as a median of backward motion vectors of neighboring blocks A, B, and C of the current block, but the present invention is not limited thereto, and the neighboring block may be determined in various ways.
- the predicted motion vector can be calculated from various backward motion vectors of the neighboring blocks.
- the horizontal component of the prediction motion vector may be calculated from the horizontal component of the backward motion vector of the neighboring blocks A, B, and C, and the vertical component of the prediction motion vector is the backward motion of the neighboring blocks A, B, and C. Can be calculated from the vertical components of the vector.
- the predictor 120 generates a prediction block by predicting the current block. That is, the predictor 120 predicts a pixel value of each pixel of the current block to be encoded in the image to generate a predicted block having a predicted pixel value of each pixel predicted. do.
- the predictor 120 may predict the current block by using intra prediction or inter prediction.
- the prediction mode is one of the forward temporal extended skip mode, the backward temporal extended skip mode, and the backward spatial extended skip mode, the predictor 120 does not generate the predictive block.
- the subtractor 130 subtracts the prediction block from the current block to generate a residual block. That is, the subtractor 130 calculates a difference between the pixel value of each pixel of the current block to be encoded and the predicted pixel value of each pixel of the prediction block predicted by the predictor 120 to obtain a residual signal in the form of a block. Creates a residual block with
- the quantization process may include a transform process.
- the transform is completed when the quantization is completed.
- the transform method includes a spatial signal such as a Hadamard transform and a discrete cosine transform based integer transform (hereinafter, referred to as an integer transform) to the frequency domain. Transformation techniques may be used, and various quantization techniques such as Dead Zone Uniform Threshold Quantization (DZUTQ) or Quantization Weighted Matrix (DZUTQ) are used as quantization schemes. Can be.
- DZUTQ Dead Zone Uniform Threshold Quantization
- DZUTQ Quantization Weighted Matrix
- the scanner 150 scans the coefficients of the color space prediction block generated by the converter 140 to generate a coefficient sequence.
- the scanning method considers characteristics of a transform technique, a quantization technique, and a block (macroblock or subblock), and the scanning order may be determined such that the scanned coefficient sequence has a minimum length.
- the scanner 150 is illustrated and described as being implemented independently of the encoder 160, but the scanner 150 may be omitted, and its function may be integrated into the encoder 160.
- the encoder 160 may include not only the prediction mode but also various pieces of information necessary to decode the encoded bit string in the encoded data.
- the various pieces of information necessary for decoding the encoded bit string may be various pieces of information such as information on a block type.
- the inverse transformer 170 reconstructs the residual block by performing an inverse transform on the transformed residual block generated by the transformer 140. If the quantization is also performed in the transformer 140, the inverse transformer 170 performs inverse transform after inverse quantization, and may be performed by inversely performing a transform process and a quantization process performed by the transformer 140.
- the adder 180 reconstructs the current block by adding the prediction block predicted by the predictor 120 and the residual block generated by the inverse transformer 170.
- the filter 190 filters the current block reconstructed by the adder 180.
- the filter 190 reduces blocking effects occurring at the block boundary or the transform boundary by transformation and quantization of the block unit of the image.
- the subtractor 130, the converter 140, the scanner 150, the inverse transformer 170, and the adder ( 180, filter 190 may not work.
- FIG. 6 is a block diagram schematically illustrating a configuration of an image decoding apparatus according to an embodiment of the present invention.
- the image decoding apparatus 600 includes a decoder 610, an inverse scanner 620, an inverse transformer 630, an adder 640, a predictor 650, and a filter 660.
- a decoder 610 receives a coded bit stream from a decoded data stream from a decoded data stream.
- the inverse scanner 620 and the filter 660 are not necessarily included, and may be optionally omitted depending on the implementation manner.
- the inverse scanner 620 is omitted, the function is integrated in the decoder 610. Can be implemented.
- the decoder 610 decodes the encoded data to decode the prediction mode.
- the decoder 610 may rescan the encoded data to restore the transformed residual block.
- the decoder 610 may decode and extract encoded data to decode or extract information necessary for decoding as well as a color space prediction block.
- the information necessary for decoding refers to information necessary for decoding the coded bit string in the encoded data. For example, information about a block type, information about an intra prediction mode when the prediction mode is an intra prediction mode, and an inter prediction mode In the case of the prediction mode, the information may be information on a motion vector, information on a transform and quantization type, or the like.
- information about the block type may be transmitted to the inverse transformer 630 and the predictor 650, and information about a transform type (or a transform and quantization type) may be transferred to the inverse transformer 630.
- information necessary for prediction such as information about a prediction mode and information about a motion vector, may be transmitted to the predictor 650.
- the inverse scanner 620 restores the transform coefficient sequence decoded by the decoder 610 and restores the prediction block by reverse scanning the transform coefficient sequence.
- the inverse scanner 620 inversely scans the extracted coefficient sequence by various inverse scanning methods such as inverse zigzag scan to generate a color space prediction block.
- the inverse scanning method may obtain information about the size of the transform from the decoder 610 and generate a residual block by using an inverse scanning method corresponding thereto.
- the inverse transformer 630 inversely transforms the transformed residual block to be recovered to restore the residual block.
- the inverse transformer 630 may inversely transform the residual block transformed according to the transformation type.
- the method of inversely transforming the transformed residual block according to the transform type by the inverse transformer 630 is the same as or similar to inversely performing the process of transforming the transformed block according to the transform type in the converter 140 of the image encoding apparatus 100. Therefore, detailed description of the inverse transform method is omitted.
- the predictor 650 predicts the current block to generate a predicted block.
- the predictor 650 determines the size and shape of the current block according to the block type identified by the information on the block type, and predicts the current block by using an intra prediction mode or a motion vector identified by the information required for prediction.
- a prediction block can be generated.
- the predictor 650 is the same or similar to the predictor 120 of the image encoding apparatus 100.
- the predictor 650 divides the current block into subblocks and combines the prediction subblocks generated by predicting the divided subblocks to predict the blocks. Can be generated.
- the adder 640 reconstructs the current block by adding the residual block reconstructed by the inverse transformer 630 and the predictive block generated by the predictor 650.
- the filter 660 filters the current block reconstructed by the adder 640, and the reconstructed filtered current block is accumulated in units of pictures and stored in a memory (not shown) as a reference picture to be stored in the next block or in the predictor 650. It can be used when predicting the next picture.
- the filter 190 of the image encoding apparatus 100 is the same as or similar to that of the deblocking filtering, and thus the detailed description of the filtering method is omitted.
- the prediction mode is one of the forward temporal extended skip mode, the backward temporal extended skip mode, and the reverse spatial extended skip mode
- the inverse scanner 620, the inverse transformer 630, the adder 640, and the filter 660 are operated. You can't.
- the predictor 650 predicts the current block by using the forward motion vector of the same direction as the forward reference block motion vector of the block in the backward reference frame at the same position as the current block. Create a block. I.e., obtain the current block is forward motion vector (MV L0) in the equation (1), such as to produce a forward motion vector (MV L0) the block indicated (see Fig. 3) to the prediction block. Meanwhile, since the information about the pixels of the residual block is not transmitted from the image encoder 100, the prediction block generated here becomes a reconstruction block.
- the predictor 650 predicts the current block by using the backward motion vector of the block in the opposite direction to the forward reference block motion vector of the block in the backward reference frame at the same position as the current block. Create a block. I.e., obtain the current block is a backward motion vector (MV L1) of as shown in Equation 1, it produces the backward motion vector (MV L1) block is pointing (see Fig. 4) as a prediction block. Meanwhile, since the information about the pixels of the residual block is not transmitted from the image encoder 100, the prediction block generated here becomes a reconstruction block.
- the block in the rear reference frame may be a block in the reference frame closest to the current block among all the rear reference frames.
- the predictor 650 predicts the current block using the backward motion vector of the neighboring block of the current block to generate the predicted block. That is, the median of the backward motion vectors of the neighboring blocks A, B, and C of the current block as shown in FIG. 5 may be set as the predicted motion vector, but the present invention is not limited thereto and the neighboring block may be determined in various ways. In addition, the predicted motion vector may be calculated from various backward methods from the backward motion vector of the neighboring block.
- the horizontal component of the prediction motion vector may be calculated from the horizontal component of the backward motion vector of the neighboring blocks A, B, and C
- the vertical component of the prediction motion vector is the backward motion of the neighboring blocks A, B, and C. Can be calculated from the vertical components of the vector.
- the backward motion vector may be set to a median value of backward motion vectors of neighboring blocks of the current block.
- the image encoding / decoding apparatus may be implemented by combining the image encoding apparatus 100 of FIG. 1 and the image decoding apparatus 600 of FIG. 6.
- the image encoding / decoding apparatus sets the backward motion vector of the neighboring block of the current block as the predicted motion vector of the current block or the forward reference block motion vector of the block in the backward reference frame at the same position as the current block.
- a video encoder (the video encoding apparatus 100) configured to set a predictive motion vector, perform a motion compensation using the predictive motion vector, and set a prediction mode and encode the prediction mode when the result of the motion compensation satisfies an optimal skip condition. And decoding the prediction mode by decoding the encoded data, and if the prediction mode is the forward temporal extended skip mode, the forward direction in the same direction as the forward reference block motion vector of the block in the backward reference frame at the same position as the current block.
- the prediction block is generated by predicting the current block using the backward motion vector of the block in the opposite direction to the forward reference block motion vector of the block in the backward reference frame at the same position as the current block.
- the extended skip mode includes an image decoder (which may be implemented using the image decoding apparatus 600) that predicts the current block using the backward motion vector of the neighboring block of the current block to generate the prediction block.
- FIG. 7 is a flowchart illustrating an image encoding method according to a first embodiment of the present invention.
- a forward motion vector in the same direction as a motion vector of a forward reference block of an anchor block with reference to a block (anchor block) in a rear reference frame at the same position as the current block.
- the predictive motion vector S702
- the method may include encoding a prediction mode in operation S808.
- the block in the rear reference frame may be a block in the reference frame closest to the current block among all the rear reference frames.
- the rate-distortion of the forward temporal extended skip mode in consideration of the amount of bits and distortions generated when the current block is predicted and encoded for each of the inter prediction mode candidates in the set of all inter prediction predictable modes including the forward temporal extended skip mode. If the cost is small, it may be determined that the optimal skip condition is satisfied.
- FIG. 8 is a flowchart illustrating an image encoding method according to a second embodiment of the present invention.
- the image encoding method refers to a block (anchor block) in a rear reference frame at the same position as the current block, and performs a backward motion vector in the opposite direction to the motion vector of the forward reference block of the anchor block.
- the operation may include encoding the prediction mode in operation S808.
- the block in the rear reference frame may be a block in the reference frame closest to the current block among all the rear reference frames.
- the rate-distortion of the backward temporal extended skip mode in consideration of the amount of bits and distortions generated when the current block is predicted and encoded for each inter prediction mode candidate in the set of all inter prediction predictable modes including the backward temporal extended skip mode. If the cost is small, it may be determined that the optimal skip condition is satisfied.
- FIG. 9 is a flowchart illustrating a video encoding method according to a third embodiment of the present invention.
- the method sets the backward motion vector of the neighboring block of the current block as the predicted motion vector of the current block (S902), and performs motion compensation using the predicted motion vector.
- the method may include setting the prediction mode to the backward spatial extended skip mode (S906) and encoding the prediction mode (S908).
- the rate-distortion of the backward spatial extended skip mode in consideration of the amount of bits and distortions generated when the current block is predicted and encoded for each inter prediction mode candidate in the set of all inter prediction predictable modes including the backward spatial extended skip mode. If the cost is small, it may be determined that the optimal skip condition is satisfied.
- the backward motion vector may be set to a median of backward motion vectors of neighboring blocks of the current block.
- FIG. 10 is a flowchart illustrating an image decoding method according to an embodiment of the present invention.
- the video decoding method decodes the encoded data to decode the prediction mode (S1002), determines the prediction mode (S1004), and if the prediction mode is the forward temporal extended skip mode, the same as the current block.
- the block in the rear reference frame may be a block in the reference frame closest to the current block among all the rear reference frames.
- the backward motion vector may be set to a median of backward motion vectors of neighboring blocks of the current block.
- a prediction mode that may be set in an image decoding method according to an embodiment of the present invention does not include a forward temporal extended skip mode. If not, step S1006 may be omitted. If the backward temporal extended skip mode is not included, step S1008 may be omitted. If the backward spatial extended skip mode is not included, step S1010 may be omitted.
- An image encoding / decoding method includes the image encoding method according to the first to third embodiments of the present invention of FIGS. 7 to 9 and the embodiment of the present invention of FIG. 10. This can be achieved by implementing a combination of video decoding methods.
- an image encoding / decoding method includes setting a backward motion vector of a neighboring block of a current block as a predicted motion vector of the current block or a forward reference block of a block in a backward reference frame at the same position as the current block. Setting a predictive motion vector from the motion vector, performing motion compensation using the predictive motion vector, setting a prediction mode and encoding a prediction mode if the result of the motion compensation satisfies an optimal skip condition, and decoding the encoded data.
- the prediction mode is decoded, and if the prediction mode is the forward temporal extended skip mode, the current block is predicted using the forward motion vector in the same direction as the forward reference block motion vector of the block in the backward reference frame at the same position as the current block. Is equal to the current block if backward temporal extended skip mode
- the prediction block is generated by predicting the current block using the backward motion vector of the block in the backward reference frame at the one position in the opposite direction, and when the prediction mode is the backward spatial extended skip mode, the neighboring block of the current block And predicting a current block using a backward motion vector to generate a predicted block.
- the context of the motion vector is generated based on the motion vector correlation of the neighboring block, and the candidate motion vector is generated as the context of the neighboring block.
- the present invention enables to apply a skip mode in one direction by using the data of the decoded reference image to efficiently remove and compress redundancy between the current block and the reference image data.
- it is suitable for obtaining the effect of further improving the performance of video data compression to obtain better reconstructed picture quality at the same bit rate, and thus it is industrially applicable.
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
Abstract
Description
Claims (32)
- 영상을 부호화/복호화하는 장치에 있어서,현재 블록의 주변블록의 후방 참조블록에 대한 움직임 벡터를 상기 현재 블록의 예측 움직임 벡터로 설정하거나 현재 블록과 동일한 위치의 후방 참조픽처 내의 블록의 전방 참조블록 움직임 벡터로부터 예측 움직임 벡터를 설정하고, 상기 예측 움직임 벡터를 이용하여 움직임 보상을 수행하고 상기 움직임 보상의 결과가 최적 스킵조건을 만족하면 예측모드를 설정하고 상기 예측모드를 부호화하는 영상 부호화기; 및부호화 데이터를 복호화하여 예측모드를 복호하고, 상기 예측모드가 순방향 시간적 확장스킵 모드이면 현재 블록과 동일한 위치의 후방 참조픽처 내의 블록의 전방 참조블록 움직임 벡터와 동일한 방향의 전방 참조블록에 대한 움직임 벡터를 이용하여 현재 블록을 예측하고, 상기 예측모드가 역방향 시간적 확장스킵 모드이면 현재 블록과 동일한 위치의 후방 참조픽처 내의 블록의 전방 참조블록 움직임 벡터와 정반대 방향의 후방 참조블록에 대한 움직임 벡터를 이용하여 현재 블록을 예측하여 예측 블록을 생성하고, 상기 예측모드가 역방향 공간적 확장스킵 모드이면 현재 블록의 주변블록의 후방 참조블록에 대한 움직임 벡터를 이용하여 상기 현재 블록을 예측하여 예측 블록을 생성하는 영상 복호화기를 포함하는 것을 특징으로 하는 영상 부호화/복호화 장치.
- 영상을 부호화하는 장치에 있어서,현재 블록과 동일한 위치의 후방 참조픽처 내의 블록인 앵커블록을 참조하여 상기 앵커블록의 전방 참조블록에 대한 움직임 벡터와 동일한 방향의 전방 참조블록에 대한 움직임 벡터를 예측 움직임 벡터로 설정하고 상기 예측 움직임 벡터를 이용하여 움직임 보상을 수행하고 상기 움직임 보상의 결과가 최적 스킵조건을 만족하면 예측모드를 순방향 시간적 확장스킵 모드로 설정하는 모드결정기; 및상기 예측모드를 부호화하는 부호화기를 포함하는 것을 특징으로 하는 영상 부호화 장치.
- 제 2항에 있어서,상기 후방 참조픽처 내의 블록은,모든 후방 참조픽처 중에서 상기 현재 블록과 가장 가까운 참조픽처 내의 블록인 것을 특징으로 하는 영상 부호화 장치.
- 제 2항에 있어서,상기 최적 스킵조건은 상기 순방향 시간적 확장스킵 모드를 포함한 모든 인터예측 가능 모드 집합 내의 인터 예측 모드 후보 각각에 대해 현재 블록을 예측하고 부호화하는 경우에 발생하는 비트량과 왜곡치를 고려하여 상기 순방향 시간적 확장스킵 모드의 율-왜곡 비용이 작은 경우인 것을 특징으로 하는 영상 부호화 장치.
- 영상을 부호화하는 장치에 있어서,현재 블록과 동일한 위치의 후방 참조픽처 내의 블록인 앵커블록을 참조하여 상기 앵커블록의 전방 참조블록에 대한 움직임 벡터와 정반대 방향의 후방 참조블록에 대한 움직임 벡터를 예측 움직임 벡터로 설정하고 상기 예측 움직임 벡터를 이용하여 움직임 보상을 수행하고 상기 움직임 보상의 결과가 최적 스킵조건을 만족하면 예측모드를 역방향 시간적 확장스킵 모드로 설정하는 모드결정기; 및상기 예측모드를 부호화하는 부호화기를 포함하는 것을 특징으로 하는 영상 부호화 장치.
- 제 5항에 있어서,상기 후방 참조픽처 내의 블록은,모든 후방 참조픽처 중에서 상기 현재 블록과 가장 가까운 참조픽처 내의 블록인 것을 특징으로 하는 영상 부호화 장치.
- 제 5항에 있어서,상기 최적 스킵조건은 상기 역방향 시간적 확장스킵 모드를 포함한 모든 인터예측 가능 모드 집합 내의 인터 예측 모드 후보 각각에 대해 현재 블록을 예측하고 부호화하는 경우에 발생하는 비트량과 왜곡치를 고려하여 상기 역방향 시간적 확장스킵 모드의 율-왜곡 비용이 작은 경우인 것을 특징으로 하는 영상 부호화 장치.
- 영상을 부호화하는 장치에 있어서,현재 블록의 주변블록의 후방 참조블록에 대한 움직임 벡터로부터 상기 현재 블록의 예측 움직임 벡터를 결정하고 상기 예측 움직임 벡터를 이용하여 움직임 보상을 수행하고 상기 움직임 보상의 결과가 최적 스킵조건을 만족하면 예측모드를 역방향 공간적 확장스킵 모드로 설정하는 모드결정기; 및상기 예측모드를 부호화하는 부호화기를 포함하는 것을 특징으로 하는 영상 부호화 장치.
- 제 8항에 있어서,상기 최적 스킵조건은 상기 역방향 공간적 확장스킵 모드를 포함한 모든 인터예측 가능 모드 집합 내의 인터 예측 모드 후보 각각에 대해 현재 블록을 예측하고 부호화하는 경우에 발생하는 비트량과 왜곡치를 고려하여 상기 역방향 공간적 확장스킵 모드의 율-왜곡 비용이 작은 경우인 것을 특징으로 하는 영상 부호화 장치.
- 제 8항에 있어서,상기 후방 참조블록에 대한 움직임 벡터는,상기 현재 블록의 주변블록의 후방 참조블록에 대한 움직임 벡터들의 중앙값으로 설정된 것을 특징으로 하는 영상 부호화 장치.
- 영상을 복호화하는 장치에 있어서,부호화 데이터를 복호화하여 예측모드를 복호하는 복호화기; 및상기 예측모드가 순방향 시간적 확장스킵 모드이면, 현재 블록과 동일한 위치의 후방 참조픽처 내의 블록인 앵커블록을 참조하여 상기 앵커블록의 전방 참조블록에 대한 움직임 벡터와 동일한 방향의 전방 참조블록에 대한 움직임 벡터를 이용하여 현재 블록을 예측하여 예측 블록을 생성하는 예측기를 포함하는 것을 특징으로 하는 영상 복호화 장치.
- 제 11항에 있어서,상기 후방 참조픽처 내의 블록은,모든 후방 참조픽처 중에서 상기 현재 블록과 가장 가까운 참조픽처 내의 블록인 것을 특징으로 하는 영상 복호화 장치.
- 영상을 복호화하는 장치에 있어서,부호화 데이터를 복호화하여 예측모드를 복호하는 복호화기; 및상기 예측모드가 역방향 시간적 확장스킵 모드이면, 현재 블록과 동일한 위치의 후방 참조픽처 내의 블록의 전방 참조블록 움직임 벡터와 정반대 방향의 후방 참조블록에 대한 움직임 벡터를 이용하여 현재 블록을 예측하여 예측 블록을 생성하는 예측기를 포함하는 것을 특징으로 하는 영상 복호화 장치.
- 제 13항에 있어서,상기 후방 참조픽처 내의 블록은,모든 후방 참조픽처 중에서 상기 현재 블록과 가장 가까운 참조픽처 내의 블록인 것을 특징으로 하는 영상 복호화 장치.
- 영상을 복호화하는 장치에 있어서,부호화 데이터를 복호화하여 예측모드를 복호하는 복호화기; 및상기 예측모드가 역방향 공간적 확장스킵 모드이면, 현재 블록의 주변블록의 후방 참조블록에 대한 움직임 벡터를 이용하여 상기 현재 블록을 예측하여 예측 블록을 생성하는 예측기를 포함하는 것을 특징으로 하는 영상 복호화 장치.
- 제 15항에 있어서,상기 후방 참조블록에 대한 움직임 벡터는,상기 현재 블록의 주변블록의 후방 참조블록에 대한 움직임 벡터들의 중앙값으로 설정된 것을 특징으로 하는 영상 복호화 장치.
- 영상을 부호화/복호화하는 방법에 있어서,현재 블록의 주변블록의 후방 참조블록에 대한 움직임 벡터를 상기 현재 블록의 예측 움직임 벡터로 설정하거나 현재 블록과 동일한 위치의 후방 참조픽처 내의 블록의 전방 참조블록 움직임 벡터로부터 예측 움직임 벡터를 설정하고, 상기 예측 움직임 벡터를 이용하여 움직임 보상을 수행하고 상기 움직임 보상의 결과가 최적 스킵조건을 만족하면 예측모드를 설정하고 상기 예측모드를 부호화하는 단계; 및부호화 데이터를 복호화하여 예측모드를 복호하고, 상기 예측모드가 순방향 시간적 확장스킵 모드이면 현재 블록과 동일한 위치의 후방 참조픽처 내의 블록의 전방 참조블록 움직임 벡터와 동일한 방향의 전방 참조블록에 대한 움직임 벡터를 이용하여 현재 블록을 예측하고, 상기 예측모드가 역방향 시간적 확장스킵 모드이면 현재 블록과 동일한 위치의 후방 참조픽처 내의 블록의 전방 참조블록 움직임 벡터와 정반대 방향의 후방 참조블록에 대한 움직임 벡터를 이용하여 현재 블록을 예측하여 예측 블록을 생성하고, 상기 예측모드가 역방향 공간적 확장스킵 모드이면 현재 블록의 주변블록의 후방 참조블록에 대한 움직임 벡터를 이용하여 상기 현재 블록을 예측하여 예측 블록을 생성하는 단계를 포함하는 것을 특징으로 하는 영상 부호화/복호화 방법.
- 영상을 부호화하는 방법에 있어서,현재 블록과 동일한 위치의 후방 참조픽처 내의 블록인 앵커블록을 참조하여 상기 앵커블록의 전방 참조블록에 대한 움직임 벡터와 동일한 방향의 전방 참조블록에 대한 움직임 벡터를 예측 움직임 벡터로 설정하고 상기 예측 움직임 벡터를 이용하여 움직임 보상을 수행하고 상기 움직임 보상의 결과가 최적 스킵조건을 만족하면 예측모드를 순방향 시간적 확장스킵 모드로 설정하는 단계; 및상기 예측모드를 부호화하는 단계를 포함하는 것을 특징으로 하는 영상 부호화 방법.
- 제 18항에 있어서,상기 후방 참조픽처 내의 블록은,모든 후방 참조픽처 중에서 상기 현재 블록과 가장 가까운 참조픽처 내의 블록인 것을 특징으로 하는 영상 부호화 방법.
- 제 18항에 있어서,상기 최적 스킵조건은 상기 순방향 시간적 확장스킵 모드를 포함한 모든 인터예측 가능 모드 집합 내의 인터 예측 모드 후보 각각에 대해 현재 블록을 예측하고 부호화하는 경우에 발생하는 비트량과 왜곡치를 고려하여 상기 순방향 시간적 확장스킵 모드의 율-왜곡 비용이 작은 경우인 것을 특징으로 하는 영상 부호화 방법.
- 영상을 부호화하는 방법에 있어서,현재 블록과 동일한 위치의 후방 참조픽처 내의 블록인 앵커블록을 참조하여 상기 앵커블록의 전방 참조블록에 대한 움직임 벡터와 정반대 방향의 후방 참조블록에 대한 움직임 벡터를 예측 움직임 벡터로 설정하고 상기 예측 움직임 벡터를 이용하여 움직임 보상을 수행하고 상기 움직임 보상의 결과가 최적 스킵조건을 만족하면 예측모드를 역방향 시간적 확장스킵 모드로 설정하는 단계; 및상기 예측모드를 부호화하는 단계를 포함하는 것을 특징으로 하는 영상 부호화 방법.
- 제 21항에 있어서,상기 후방 참조픽처 내의 블록은,모든 후방 참조픽처 중에서 상기 현재 블록과 가장 가까운 참조픽처 내의 블록인 것을 특징으로 하는 영상 부호화 방법.
- 제 21항에 있어서,상기 최적 스킵조건은 상기 역방향 시간적 확장스킵 모드를 포함한 모든 인터예측 가능 모드 집합 내의 인터 예측 모드 후보 각각에 대해 현재 블록을 예측하고 부호화하는 경우에 발생하는 비트량과 왜곡치를 고려하여 상기 역방향 시간적 확장스킵 모드의 율-왜곡 비용이 작은 경우인 것을 특징으로 하는 영상 부호화 방법.
- 영상을 부호화하는 방법에 있어서,현재 블록의 주변블록의 후방 참조블록에 대한 움직임 벡터를 상기 현재 블록의 예측 움직임 벡터로 설정하고 상기 예측 움직임 벡터를 이용하여 움직임 보상을 수행하고 상기 움직임 보상의 결과가 최적 스킵조건을 만족하면 예측모드를 역방향 공간적 확장스킵 모드로 설정하는 단계; 및상기 예측모드를 부호화하는 단계를 포함하는 것을 특징으로 하는 영상 부호화 방법.
- 제 24항에 있어서,상기 최적 스킵조건은 상기 역방향 공간적 확장스킵 모드를 포함한 모든 인터예측 가능 모드 집합 내의 인터 예측 모드 후보 각각에 대해 현재 블록을 예측하고 부호화하는 경우에 발생하는 비트량과 왜곡치를 고려하여 상기 역방향 공간적 확장스킵 모드의 율-왜곡 비용이 작은 경우인 것을 특징으로 하는 영상 부호화 방법.
- 제 24항에 있어서,상기 후방 참조블록에 대한 움직임 벡터는,상기 현재 블록의 주변블록의 후방 참조블록에 대한 움직임 벡터들의 중앙값으로 설정된 것을 특징으로 하는 영상 부호화 방법.
- 영상을 복호화하는 방법에 있어서,부호화 데이터를 복호화하여 예측모드를 복호하는 단계; 및상기 예측모드가 순방향 시간적 확장스킵 모드이면, 현재 블록과 동일한 위치의 후방 참조픽처 내의 블록인 앵커블록을 참조하여 상기 앵커블록의 전방 참조블록에 대한 움직임 벡터와 동일한 방향의 전방 참조블록에 대한 움직임 벡터를 이용하여 현재 블록을 예측하여 예측 블록을 생성하는 단계를 포함하는 것을 특징으로 하는 영상 복호화 방법.
- 제 37항에 있어서,상기 후방 참조픽처 내의 블록은,모든 후방 참조픽처 중에서 상기 현재 블록과 가장 가까운 참조픽처 내의 블록인 것을 특징으로 하는 영상 복호화 방법.
- 영상을 복호화하는 방법에 있어서,부호화 데이터를 복호화하여 예측모드를 복호하는 단계; 및상기 예측모드가 역방향 시간적 확장스킵 모드이면, 현재 블록과 동일한 위치의 후방 참조픽처 내의 블록인 앵커블록을 참조하여 상기 앵커블록의 전방 참조블록에 대한 움직임 벡터와 정반대 방향의 후방 참조블록에 대한 움직임 벡터를 이용하여 현재 블록을 예측하여 예측 블록을 생성하는 단계를 포함하는 것을 특징으로 하는 영상 복호화 방법.
- 제 39항에 있어서,상기 후방 참조픽처 내의 블록은,모든 후방 참조픽처 중에서 상기 현재 블록과 가장 가까운 참조픽처 내의 블록인 것을 특징으로 하는 영상 복호화 방법.
- 영상을 복호화하는 방법에 있어서,부호화 데이터를 복호화하여 예측모드를 복호하는 단계; 및상기 예측모드가 역방향 공간적 확장스킵 모드이면, 현재 블록의 주변블록의 후방 참조블록에 대한 움직임 벡터를 이용하여 상기 현재 블록을 예측하여 예측 블록을 생성하는 단계를 포함하는 것을 특징으로 하는 영상 복호화 방법.
- 제 31항에 있어서,상기 후방 참조블록에 대한 움직임 벡터는,상기 현재 블록의 주변블록의 후방 참조블록에 대한 움직임 벡터들의 중앙값으로 설정된 것을 특징으로 하는 영상 복호화 방법.
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2011800457098A CN103141093A (zh) | 2010-07-22 | 2011-06-21 | 使用扩展跳过模式编码/解码图像的方法和设备 |
US13/811,602 US20130202039A1 (en) | 2010-07-22 | 2011-06-21 | Method and device for encoding/decoding image using extended skip mode |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1020100070755A KR20120009861A (ko) | 2010-07-22 | 2010-07-22 | 확장된 스킵모드를 이용한 영상 부호화/복호화 방법 및 장치 |
KR10-2010-0070755 | 2010-07-22 |
Publications (2)
Publication Number | Publication Date |
---|---|
WO2012011672A2 true WO2012011672A2 (ko) | 2012-01-26 |
WO2012011672A3 WO2012011672A3 (ko) | 2012-04-12 |
Family
ID=45497251
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/KR2011/004521 WO2012011672A2 (ko) | 2010-07-22 | 2011-06-21 | 확장된 스킵모드를 이용한 영상 부호화/복호화 방법 및 장치 |
Country Status (4)
Country | Link |
---|---|
US (1) | US20130202039A1 (ko) |
KR (1) | KR20120009861A (ko) |
CN (1) | CN103141093A (ko) |
WO (1) | WO2012011672A2 (ko) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110786012A (zh) * | 2018-04-02 | 2020-02-11 | 深圳市大疆创新科技有限公司 | 用于图像运动补偿的方法和装置 |
Families Citing this family (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
MX2014000159A (es) * | 2011-07-02 | 2014-02-19 | Samsung Electronics Co Ltd | Metodo y aparato para la codificacion de video, y metodo y aparato para la decodificacion de video acompañada por inter prediccion utilizando imagen co-localizada. |
FR2986395A1 (fr) * | 2012-01-30 | 2013-08-02 | France Telecom | Codage et decodage par heritage progressif |
CN103338372A (zh) * | 2013-06-15 | 2013-10-02 | 浙江大学 | 一种视频处理方法及装置 |
US10244167B2 (en) | 2016-06-17 | 2019-03-26 | Gopro, Inc. | Apparatus and methods for image encoding using spatially weighted encoding quality parameters |
CN106375769B (zh) * | 2016-08-31 | 2019-04-30 | 西安万像电子科技有限公司 | 图像特征搜索方法和装置、存储介质及处理器 |
EP3713236A4 (en) * | 2017-12-14 | 2021-04-21 | LG Electronics Inc. | METHOD AND DEVICE FOR DECODING IMAGE ACCORDING TO INTER-PREDICTION IN AN IMAGE CODING SYSTEM |
AU2019292471A1 (en) * | 2018-06-30 | 2021-01-28 | Guangdong Oppo Mobile Telecommunications Corp., Ltd. | Merge mode-based inter-prediction method and apparatus |
WO2020096898A1 (en) * | 2018-11-05 | 2020-05-14 | Interdigital Vc Holdings, Inc. | Video encoding or decoding using block extension for overlapped block motion compensation |
CN110072112B (zh) * | 2019-03-12 | 2023-05-12 | 浙江大华技术股份有限公司 | 帧内预测方法、编码器及存储装置 |
CN114339231B (zh) * | 2021-12-27 | 2023-10-27 | 杭州当虹科技股份有限公司 | 利用运动矢量快速跳Cu级模式选择的方法 |
CN116168790B (zh) * | 2023-04-25 | 2023-07-14 | 深圳爱递医药科技有限公司 | 一种临床试验的大数据招募系统 |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR100905059B1 (ko) * | 2007-08-16 | 2009-06-30 | 한국전자통신연구원 | 동영상 부호화에 있어서 비트 발생 가능성 예측을 이용한블록 모드 결정 방법 및 장치 |
KR20100014553A (ko) * | 2007-04-25 | 2010-02-10 | 엘지전자 주식회사 | 비디오 신호의 인코딩/디코딩 방법 및 장치 |
KR100951164B1 (ko) * | 2008-01-28 | 2010-04-06 | 울산대학교 산학협력단 | 윤곽선을 이용한 h.264/avc의 효율적인 고속 모드결정 방법 |
Family Cites Families (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
DE69031107T2 (de) * | 1989-10-14 | 1997-11-13 | Sony Corp | Anordnung und Verfahren zum Übertragen eines Videosignals |
GB9928022D0 (en) * | 1999-11-26 | 2000-01-26 | British Telecomm | Video coding and decording |
US7003035B2 (en) * | 2002-01-25 | 2006-02-21 | Microsoft Corporation | Video coding methods and apparatuses |
JP2004088722A (ja) * | 2002-03-04 | 2004-03-18 | Matsushita Electric Ind Co Ltd | 動画像符号化方法および動画像復号化方法 |
EP3324625B1 (en) * | 2002-04-19 | 2019-06-12 | Panasonic Intellectual Property Corporation of America | Motion vector calculating method |
KR100865034B1 (ko) * | 2002-07-18 | 2008-10-23 | 엘지전자 주식회사 | 모션 벡터 예측 방법 |
US8854486B2 (en) * | 2004-12-17 | 2014-10-07 | Mitsubishi Electric Research Laboratories, Inc. | Method and system for processing multiview videos for view synthesis using skip and direct modes |
KR101003105B1 (ko) * | 2008-01-29 | 2010-12-21 | 한국전자통신연구원 | 어파인 변환 기반의 움직임 보상을 이용한 비디오 부호화 및 복호화 방법 및 장치 |
KR101505195B1 (ko) * | 2008-02-20 | 2015-03-24 | 삼성전자주식회사 | 직접 모드 부호화 및 복호화 방법 |
KR101619972B1 (ko) * | 2008-10-02 | 2016-05-11 | 한국전자통신연구원 | 이산 여현 변환/이산 정현 변환을 선택적으로 이용하는 부호화/복호화 장치 및 방법 |
WO2011075096A1 (en) * | 2009-12-15 | 2011-06-23 | Thomson Licensing | Method and apparatus for bi-directional prediction within p-slices |
-
2010
- 2010-07-22 KR KR1020100070755A patent/KR20120009861A/ko not_active Application Discontinuation
-
2011
- 2011-06-21 US US13/811,602 patent/US20130202039A1/en not_active Abandoned
- 2011-06-21 WO PCT/KR2011/004521 patent/WO2012011672A2/ko active Application Filing
- 2011-06-21 CN CN2011800457098A patent/CN103141093A/zh active Pending
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR20100014553A (ko) * | 2007-04-25 | 2010-02-10 | 엘지전자 주식회사 | 비디오 신호의 인코딩/디코딩 방법 및 장치 |
KR100905059B1 (ko) * | 2007-08-16 | 2009-06-30 | 한국전자통신연구원 | 동영상 부호화에 있어서 비트 발생 가능성 예측을 이용한블록 모드 결정 방법 및 장치 |
KR100951164B1 (ko) * | 2008-01-28 | 2010-04-06 | 울산대학교 산학협력단 | 윤곽선을 이용한 h.264/avc의 효율적인 고속 모드결정 방법 |
Cited By (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110786012A (zh) * | 2018-04-02 | 2020-02-11 | 深圳市大疆创新科技有限公司 | 用于图像运动补偿的方法和装置 |
US11871032B2 (en) | 2018-04-02 | 2024-01-09 | SZ DJI Technology Co., Ltd. | Method and device for image motion compensation |
CN110786012B (zh) * | 2018-04-02 | 2024-01-30 | 深圳市大疆创新科技有限公司 | 用于图像运动补偿的方法和装置 |
US11949912B2 (en) | 2018-04-02 | 2024-04-02 | SZ DJI Technology Co., Ltd. | Method and device for video image processing |
US11949911B2 (en) | 2018-04-02 | 2024-04-02 | SZ DJI Technology Co., Ltd. | Method and device for obtaining motion vector of video image |
US11997312B2 (en) | 2018-04-02 | 2024-05-28 | SZ DJI Technology Co., Ltd. | Method and device for video image processing |
Also Published As
Publication number | Publication date |
---|---|
US20130202039A1 (en) | 2013-08-08 |
CN103141093A (zh) | 2013-06-05 |
WO2012011672A3 (ko) | 2012-04-12 |
KR20120009861A (ko) | 2012-02-02 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2012011672A2 (ko) | 확장된 스킵모드를 이용한 영상 부호화/복호화 방법 및 장치 | |
WO2010050706A2 (ko) | 움직임 벡터 부호화 방법 및 장치와 그를 이용한 영상 부호화/복호화 방법 및 장치 | |
WO2013002549A2 (ko) | 영상 부호화/복호화 방법 및 장치 | |
WO2011145819A2 (ko) | 영상 부호화/복호화 장치 및 방법 | |
WO2013070006A1 (ko) | 스킵모드를 이용한 동영상 부호화 및 복호화 방법 및 장치 | |
WO2010027182A2 (ko) | 서브블록 내 임의 화소를 이용한 영상 부호화/복호화 방법 및 장치 | |
WO2013109039A1 (ko) | 가중치예측을 이용한 영상 부호화/복호화 방법 및 장치 | |
WO2012099440A2 (ko) | 예측 움직임벡터 색인부호화에 기반한 움직임정보 생성/복원 장치 및 방법, 및 그것을 이용한 영상 부호화/복호화 장치 및 방법 | |
WO2013005941A2 (ko) | 영상 부호화 및 복호화 방법과 장치 | |
WO2011031030A2 (ko) | 움직임 벡터 부호화/복호화 방법 및 장치와 그를 이용한 영상 부호화/복호화 방법 및 장치 | |
WO2011068331A2 (ko) | 비디오 인코딩 장치 및 그 인코딩 방법, 비디오 디코딩 장치 및 그 디코딩 방법, 및 거기에 이용되는 방향적 인트라 예측방법 | |
WO2011062392A2 (ko) | 후보 예측 움직임 벡터 집합 선택을 이용한 움직임 벡터 부호화/복호화 방법 및 장치와 그를 이용한 영상 부호화/복호화 방법 및 장치 | |
WO2012077960A2 (ko) | 임의의 형태의 블록을 이용한 인터예측에 의한 영상의 부호화/복호화 방법 및 장치 | |
WO2013002550A2 (ko) | 고속 코딩 단위(Coding Unit) 모드 결정을 통한 부호화/복호화 방법 및 장치 | |
WO2011142603A2 (ko) | 영상의 필터링 방법 및 장치와 그를 이용한 부호화/복호화를 위한 방법 및 장치 | |
WO2012096550A2 (ko) | 양방향 인트라 예측을 이용한 영상 부호화/복호화 방법 및 장치 | |
WO2011004986A2 (ko) | 영상 부호화/복호화 방법 및 장치 | |
WO2010038961A2 (ko) | 복수 개의 움직임 벡터 추정을 이용한 움직임 벡터 부호화/복호화 방법 및 장치와 그를 이용한 영상 부호화/복호화 방법 및 장치 | |
WO2011037337A2 (ko) | 저주파수 성분을 고려한 영상 부호화/복호화 방법 및 장치 | |
WO2009157665A2 (ko) | 블록 변환을 이용한 인트라 예측 방법 및 장치와 그를 이용한 영상 부호화/복호화 방법 및 장치 | |
WO2012046979A2 (ko) | 주파수 마스크 테이블을 이용한 주파수변환 블록 부호화 방법 및 장치와 그를 이용한 영상 부호화/복호화 방법 및 장치 | |
WO2013069996A1 (ko) | 변환을 이용한 주파수 도메인 상의 적응적 루프 필터를 이용한 영상 부호화/복호화 방법 및 장치 | |
WO2012015275A2 (ko) | 블록 분할예측을 이용한 영상 부호화/복호화 방법 및 장치 | |
WO2012033344A2 (ko) | 효과적인 화면내 예측모드 집합 선택을 이용한 영상 부호화/복호화 방법 및 장치 | |
WO2011108879A2 (ko) | 영상 부호화 장치, 그 영상 부호화 방법, 영상 복호화 장치 및 그 영상 복호화 방법 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
WWE | Wipo information: entry into national phase |
Ref document number: 201180045709.8 Country of ref document: CN |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 11809799 Country of ref document: EP Kind code of ref document: A2 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
WWE | Wipo information: entry into national phase |
Ref document number: 13811602 Country of ref document: US |
|
32PN | Ep: public notification in the ep bulletin as address of the adressee cannot be established |
Free format text: NOTING OF LOSS OF RIGHTS (EPO F1205A DATED 17-05-2013) |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 11809799 Country of ref document: EP Kind code of ref document: A2 |