US9271000B2 - Image-encoding apparatus and method, transform-encoding apparatus and method, apparatus and method for generating a transform base, and image-decoding apparatus and method used in same - Google Patents

Image-encoding apparatus and method, transform-encoding apparatus and method, apparatus and method for generating a transform base, and image-decoding apparatus and method used in same Download PDF

Info

Publication number
US9271000B2
US9271000B2 US13/514,552 US201013514552A US9271000B2 US 9271000 B2 US9271000 B2 US 9271000B2 US 201013514552 A US201013514552 A US 201013514552A US 9271000 B2 US9271000 B2 US 9271000B2
Authority
US
United States
Prior art keywords
intra prediction
prediction mode
matrix
current block
transform
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active, expires
Application number
US13/514,552
Other versions
US20120307895A1 (en
Inventor
Sunyeon KIM
Jeongyeon Lim
Gyumin Lee
Jaehoon Choi
Yonggoo Kim
Yoonsik Choe
Yungho Choi
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
SK Telecom Co Ltd
Original Assignee
SK Telecom Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by SK Telecom Co Ltd filed Critical SK Telecom Co Ltd
Assigned to SK TELECOM CO., LTD. reassignment SK TELECOM CO., LTD. ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: KIM, YONGGOO, KIM, SUNYEON, CHOI, JAEHOON, LEE, GYUMIN, LIM, JEONGYEON, CHOI, YUNGHO, CHOE, YOONSIK
Publication of US20120307895A1 publication Critical patent/US20120307895A1/en
Application granted granted Critical
Publication of US9271000B2 publication Critical patent/US9271000B2/en
Active legal-status Critical Current
Adjusted expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/503Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
    • H04N19/51Motion estimation or motion compensation
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/103Selection of coding mode or of prediction mode
    • H04N19/11Selection of coding mode or of prediction mode among a plurality of spatial predictive coding modes
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/12Selection from among a plurality of transforms or standards, e.g. selection between discrete cosine transform [DCT] and sub-band transform or selection between H.263 and H.264
    • H04N19/122Selection of transform size, e.g. 8x8 or 2x4x8 DCT; Selection of sub-band transforms of varying structure or type
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/593Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving spatial prediction techniques
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/60Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
    • H04N19/61Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding in combination with predictive coding

Definitions

  • the present disclosure relates to an image-encoding apparatus and method, a transform-encoding apparatus and method, an apparatus and method for generating a transform base, and an image-decoding apparatus and method used in the same. More particularly, the present disclosure relates to a video encoding apparatus and method, which can significantly improve the intra prediction encoding performance with adding no additional information by adaptively generating a transform base according to an image characteristic change as well as an intra prediction mode for a specific encoding unit and transform-encoding an intra prediction error, and a transform encoding apparatus and method, a transform base generating apparatus and method, and a video decoding apparatus and method used in the same.
  • Multimedia data requires a storage medium having a large capacity due to its large amount or size, and requires a wide bandwidth for a transmission. Therefore, it is necessary to use a compression coding technique to transmit the multimedia data including text, video, and audio data.
  • a basic principle of compressing a data includes a process of removing a factor of the data redundancy.
  • the data can be compressed by removing the spatial redundancy corresponding to the repetition of the same color or object in an image, the temporal redundancy corresponding to the repetition of the same note in an audio or a case where there is little change of an adjacent frame in a dynamic image, or the psychological vision redundancy considering a fact that human's visual and perceptive abilities are insensitive to a high frequency.
  • H.264/AVC Recently draws more interests for its improved compression efficiency over MPEG-4 (Moving Picture Experts Group-4).
  • H.264 Being a digital video codec standard with a very high data compression rate, H.264 is also referred to as MPEG-4 part 10 or AVC (Advanced Video Coding).
  • AVC Advanced Video Coding
  • This standard is a result from constructing a Joint Video Team and performing the standardization together by VCEG (Video Coding Experts Group) of ITU-T (International Telecommunication Union Telecommunication Standardization Sector) and MPEG of ISO/IEC (International Standardization Organization/International Electrotechnical Commission).
  • VCEG Video Coding Experts Group
  • ITU-T International Telecommunication Union Telecommunication Standardization Sector
  • MPEG International Standardization Organization/International Electrotechnical Commission
  • Various methods are proposed to improve the compression efficiency in a compression encoding, and include methods using a temporal prediction and a spatial prediction as representative methods.
  • the temporal prediction corresponds to a scheme of performing a prediction with reference to a reference block 122 of another frame 120 temporally adjacent in predicting a current block 112 of a current frame 110 , as shown in FIG. 1 . That is, in inter-predicting the current block 112 of the current frame 110 , the temporally adjacent reference frame 120 is searched for, and the reference block 122 , which is the most similar to the current block within the reference frame 120 , is searched for.
  • the reference block 122 is a block, which can predict the current block 112 best, and a block, which has the smallest SAD (Sum of Absolute Difference) from the current block 112 , can be the reference block 122 .
  • the reference block 122 becomes a predicted block of the current block 112 , and a residual block is generated by subtracting the reference block 122 from the current block 112 .
  • the generated residual block is encoded and inserted in a bitstream.
  • a relative difference between a position of the current block in the current frame 110 and a position of the reference block 122 in the reference frame 120 corresponds to a motion vector 130 , and the motion vector 130 is encoded like the residual block.
  • the temporal prediction is also referred to as an inter prediction or an inter frame prediction.
  • the spatial prediction corresponds to a prediction of obtaining a predicted pixel value of a target block by using a reconstructed pixel value of a reference block adjacent to the target block in one frame, and is also referred to as a directional intra prediction (hereinafter, simply referred to as an “intra prediction”) or an inter frame prediction.
  • intra prediction a directional intra prediction
  • H.264 defines an encoding/decoding using the intra prediction.
  • the intra prediction corresponds to a scheme of predicting values of a current subblock by copying one subblock in a determined direction based on adjacent pixels located in an upper direction and a left direction with respect to the subblock and encoding only a differential.
  • a predicted block for a current block is generated based on another block having a prior coding order.
  • a coding is a value generated by subtracting the predicted block from the current block.
  • a video encoder based on the H.264 standard selects a prediction mode having the smallest difference between the current block and the predicted block for each block from prediction modes.
  • the intra prediction based on the H.264 standard defines nine prediction modes shown in FIG. 2 in consideration of the prediction directivity and positions of adjacent pixels used for generating predicted pixel values of a 4 ⁇ 4 luma block and an 8 ⁇ 8 luma block.
  • the nine prediction modes are divided into a vertical prediction mode (prediction mode 0 ), a horizontal prediction mode (prediction mode 1 ), a DC prediction mode (prediction mode 2 ), a diagonal_down_left prediction mode (prediction mode 3 ), a diagonal_down_right prediction mode (prediction mode 4 ), a vertical_right prediction mode (prediction mode 5 ), a horizontal_down prediction mode (prediction mode 6 ), a vertical_left prediction mode (prediction mode 7 ), and a horizontal_up prediction mode (prediction mode 8 ) according to their prediction directions.
  • the DC prediction mode uses an average value of eight adjacent pixels.
  • intra prediction modes are used for an intra prediction processing for a 16 ⁇ 16 luma block, wherein the four intra prediction modes are the vertical prediction mode (prediction mode 0 ), the horizontal prediction mode (prediction mode 1 ), the DC prediction mode (prediction mode 2 ), and the diagonal_down_left prediction mode (prediction mode 3 ).
  • the same four intra prediction modes are used for an intra prediction processing for an 8 ⁇ 8 chroma block.
  • FIG. 3 illustrates a labeling example for the nine intra prediction modes shown in FIG. 2 .
  • a predicted block an area including a to p
  • samples A to M
  • FIG. 4 is a diagram for illustrating the nine prediction modes shown in FIG. 2 by using FIG. 3 .
  • a predicted block in a case of the prediction mode 0 predicts pixel values in the same vertical line as the same pixel value. That is, in pixels of the predicted block, pixel values are predicted from pixels, which are most adjacent to a reference block located in an upper side of the predicted block. Reconstructed pixel values of an adjacent pixel A are set to predicted pixel values of a pixel a, a pixel e, a pixel i, and a pixel m in a first column of the predicted block.
  • pixel values of a pixel b, a pixel f, a pixel j, and a pixel n in a second column are predicted from reconstructed pixel values of an adjacent pixel B
  • pixel values of a pixel c, a pixel g, a pixel k, and a pixel o in a third column are predicted from reconstructed pixel values of an adjacent pixel C
  • pixel values of a pixel d, a pixel h, a pixel l, and a pixel p in a fourth column are predicted from reconstructed pixel values of an adjacent pixel D.
  • a predicted block in a case of the prediction mode 1 predicts pixel values in the same horizontal line as the same pixel value. That is, in pixels of the predicted block, pixel values are predicted from pixels, which are most adjacent to a reference block located in a left side of the predicted block. Reconstructed pixel values of an adjacent pixel l are set to predicted pixel values of a pixel a, a pixel b, a pixel c, and a pixel d in a first row of the predicted block.
  • pixel values of a pixel e, a pixel f, a pixel g, and a pixel h in a second row are predicted from reconstructed pixel values of an adjacent pixel J
  • pixel values of a pixel i, a pixel j, a pixel k, and a pixel l in a third row are predicted from reconstructed pixel values of an adjacent pixel K
  • pixel values of a pixel m, a pixel n, a pixel o, and a pixel p in a fourth row are predicted from reconstructed pixel values of an adjacent pixel L.
  • pixels of a predicted block in a case of the prediction mode 2 are equally replaced with an average of pixel values of upper pixels A, B, C, and D, and left pixels I, J, K, and L.
  • pixels of a predicted block in a case of the prediction mode 3 are interpolated in a lower-left direction at an angle of 45° between a lower-left side and an upper-right side of the predicted block
  • pixels of a predicted block in a case of the prediction mode 4 are extrapolated in a lower-right direction at an angle of 45° between a lower-left side and an upper-right side of the predicted block.
  • pixels of a predicted block in a case of the prediction mode 6 are extrapolated in a lower-right direction at an angle of about 26.6° with respect to a horizontal line
  • pixels of a predicted block in a case of the prediction mode 7 are extrapolated in a lower-left direction at an angle of about 26.6° with respect to a vertical line
  • pixels of a predicted block in a case of the prediction mode 8 are interpolated in an upper direction at an angle of about 26.6° with respect to a horizontal line.
  • the pixels of the predicted block can be generated from a weighted average of the pixels A to M of the reference block decoded in advance in the prediction mode 3 to 8 .
  • the pixel d located in an upper right side of the predicted block can be estimated as shown in Equation (1).
  • a round( ) function is a function of rounding off to the nearest whole number.
  • d round( B/ 4 +C/ 2 +D/ 4) Equation 1
  • pixels of the predicted block are interpolated from upper pixels
  • the pixels of the predicted block are interpolated from left pixels.
  • the pixels of the predicted block are calculated as an average of the upper pixels and the left pixels.
  • a linear “plane” function suitable for the upper pixels and the left pixels is used. The prediction mode 3 is more suitable for an area in which the luminance is smoothly changed.
  • pixel values of the predicted block are generated in directions corresponding to respective modes based on adjacent pixels of the predicted block to be currently encoded in the respective prediction modes except for the DC mode.
  • a prediction error between a predicted value predicted by each prediction mode and a current pixel value is transform-encoded using an integer transform scheme based on a DCT (Discrete Cosine Transform).
  • An integer transform in the 4 ⁇ 4 unit is applied when a 4 ⁇ 4 intra prediction mode and a 16 ⁇ 16 intra prediction mode are used according to a block size, and an inter transform in the 8 ⁇ 8 unit is applied when an 8 ⁇ 8 intra prediction mode is used.
  • the Video Coding Expert Group of the ITU-T has further developed the H.264 standard recently, so that the predictive encoding performance is further improved.
  • the predictive encoding performance is improved by increasing the number of intra prediction modes through further diversifying the directivity of a pixel value used in the intra prediction and introducing a scheme of adding weights of two intra prediction modes in “Improvement of Bidirectional Intra Prediction”, ITU-T SG16/Q.6 Doc. VCEG-AG08, October 2007 by Shiodera Taichiro, Akiyuki Tanizawa, Takeshi Chujoh, and Tomoo Yamakage.
  • this scheme has a disadvantage of greatly increasing an amount of operations for finding an optimal mode according to the increase of the number of intra prediction modes, which should be considered, up to 4 times and thus increasing an amount of additional information for encoding the increased prediction modes.
  • a transform scheme of using different KLT (Karhunen-Loeve Transform) based directivity bases is proposed based on the idea that there still remains the spatial redundancy in a prediction error after the intra prediction and such a spatial redundancy has a high correlation with an intra prediction direction in “Improved Intra Coding”, ITU-T SG16/Q.6 Doc. VCEG-AG11, October 2007 by Yan Ye and Marta Karczewicz.
  • the transform scheme has significantly improved the intra mode encoding performance by performing an adaptive prediction error encoding according to the intra prediction mode without any addition information by using KLT transform bases trained through several experiment images.
  • the transform scheme has a disadvantage that a generated transform base cannot have the optimal energy concentration efficiency for various video sequences having different characteristics or other partial local images having different characteristics within one sequence.
  • An aspect of the present disclosure to solve the above-mentioned problem provides a higher energy concentration effect by efficiently removing the spatial redundancy remaining in a prediction error, and provides a video encoding apparatus and method, which can improve the compression efficiency of an intra prediction encoding by adaptively generating a transform base according to a local characteristic change of the prediction error as well as an intra prediction mode and using the generated transform base in a transform encoding of the prediction error in order to more efficiently transform-encode the prediction error after an intra prediction, and a transform encoding apparatus and method, a transform base generating apparatus and method, and a video decoding apparatus and method used in the same.
  • An aspect of the present disclosure provides a video encoding apparatus, including: an intra prediction error collector for collecting prediction errors of blocks having an equal intra prediction mode from macroblocks in a regular unit, which are encoded prior to a current macroblock; a transform base generator for generating transform bases for respective intra prediction modes based on the prediction errors collected by the intra prediction error collector; an intra predictor for predicting a pixel value of a current pixel by using neighboring pixels of a target block within a current frame according to a directional intra prediction mode and generating a prediction error through a difference between a predicted pixel value and the current pixel; and a transform encoder for transform-encoding the prediction error generated by the intra predictor by using the transform bases generated by the transform base generator.
  • the transform base generator includes a correlation matrix calculator for calculating an autocorrelation matrix for a set of the prediction errors collected by the intra prediction error collector.
  • the transform base generator generates a Karhunen-Loeve Transform or KLT-based transform base based on the autocorrelation matrix calculated by the correlation matrix calculator.
  • the transform base generator may include a correlation matrix calculator for calculating an autocorrelation matrix for a set of the prediction errors collected by the intra prediction error collector; and an eigenvector calculator for calculating an eigenvector from the autocorrelation matrix calculated by the correlation matrix calculator.
  • the transform encoder preferably transform-encodes the prediction error generated by the intra predictor by using a calculated eigenvector.
  • transform encoding apparatus for transforming and encoding a prediction error generated by a difference between a predicted pixel predicted by an intra prediction apparatus and a current pixel
  • the transform encoding apparatus including: an intra prediction error collector for collecting prediction errors of blocks having an equal intra prediction mode from macroblocks in a regular unit, which are encoded prior to a current macroblock; and a transform base generator for generating transform bases for respective intra prediction modes based on the prediction errors collected by the intra prediction error collector.
  • the transform encoding apparatus preferably transform-encoding the prediction error generated by the intra prediction apparatus by using the transform bases generated by the transform base generator.
  • the transform encoding apparatus may further include a correlation matrix calculator for calculating an autocorrelation matrix for a set of the prediction errors collected by the intra prediction error collector based on an equation of
  • the transform encoding apparatus may further include a correlation matrix calculator for calculating an autocorrelation matrix for a set of the prediction errors collected by the intra prediction error collector based on an equation of
  • transform base generating apparatus for generating transform bases for intra prediction modes
  • the transform base generating apparatus including: an intra prediction error collector for collecting prediction errors of blocks having an equal intra prediction mode from macroblocks in a regular unit, which are encoded prior to a current macroblock; a correlation matrix calculator for calculating an autocorrelation matrix for a set of the prediction errors collected by the intra prediction error collector; and an eigenvector calculator for calculating an eigenvector from the autocorrelation matrix calculated by the correlation matrix calculator, the transform base generating apparatus generating the transform base for each intra prediction mode based on the eigenvector calculated by the eigenvector calculator.
  • the transform base generating apparatus preferably generates a KLT-based transform base based on the autocorrelation matrix and the eigenvector.
  • an intra prediction apparatus including: an intra predictor for predicting a pixel value of a current pixel by using neighboring pixels of a target block within a current frame according to a directional intra prediction mode and generating a prediction error through a difference between a predicted pixel value and the current pixel; and an intra prediction error collector for collecting prediction errors of blocks having an equal intra prediction mode from macroblocks in a regular unit, which are encoded prior to a current macroblock.
  • the intra prediction apparatus preferably outputs the prediction errors for the macroblocks in the regular unit, which are encoded prior to the current macroblock, the prediction error being collected by the intra prediction error collector, together with the prediction error generated by the intra predictor for the current frame.
  • Yet another aspect of the present disclosure provides a video decoding apparatus, including: an intra prediction error collector for collecting prediction errors of blocks having an equal intra prediction mode from macroblocks in a regular unit, which are encoded prior to a current macroblock; a transform base generator for generating transform bases for respective intra prediction modes based on the prediction errors collected by the intra prediction error collector; an intra prediction mode reader for reading an intra prediction mode of a target block to be decoded for an input bitstream; an inverse transformer for inversely transforming a prediction error for the target block by using a transform base corresponding to the intra prediction mode read by the intra prediction mode reader among the transform bases generated by the transform base generator; and a current block reconstructer for predicting a pixel value of a current pixel by using neighboring pixels of the target block within a current frame according to the intra prediction mode read by the intra prediction mode reader and reconstructing a current block by adding a predicted pixel value and a value of the prediction error inversely transformed by the inverse transformer.
  • the transform base generator preferably includes a correlation matrix calculator for calculating an autocorrelation matrix for a set of the prediction errors collected by the intra prediction error collector; and an eigenvector calculator for calculating an eigenvector from the autocorrelation matrix calculated by the correlation matrix calculator, and generates a KLT-based transform base based on the autocorrelation matrix and the eigenvector.
  • Yet another aspect of the present disclosure provides a video encoding method, including: collecting prediction errors of blocks having an equal intra prediction mode from macroblocks in a regular unit, which are encoded prior to a current macroblock, and predicting a value of a current pixel by using neighboring pixels of a target block according to a directional intra prediction mode for a current frame and generating a prediction error through a difference between a predicted value and the value of the current pixel; generating transform bases for respective intra prediction modes based on the prediction errors collected in collecting of the prediction errors; and transform-encoding the prediction error generated for the current frame by using the transform bases generated in generating of the transform bases.
  • transform encoding method of transforming and encoding a prediction error generated by a difference between a pixel predicted by an intra prediction apparatus and a current pixel
  • the transform encoding method including: collecting prediction errors of blocks having an equal intra prediction mode from macroblocks in a regular unit, which are encoded prior to a current macroblock; generating transform bases for respective intra prediction modes based on the prediction errors collected in collecting of the prediction errors.
  • the transform encoding method preferably transform-encodes the prediction error generated by the intra prediction apparatus by using the transform bases generated in generating of the transform bases.
  • the transform encoding method preferably further includes calculating an autocorrelation matrix for a set of the prediction errors collected in collecting of the prediction errors.
  • the process of generating the transform bases generates the transform bases by using a calculated autocorrelation matrix.
  • Yet another aspect of the present disclosure provides a video decoding method, including: collecting prediction errors of blocks having an equal intra prediction mode from macroblocks in a regular unit, which are encoded prior to a current macroblock; generating transform bases for respective intra prediction modes based on the prediction errors collected in collecting of the prediction errors; reading an intra prediction mode of a target block to be decoded for an input bitstream; inversely transforming a prediction error for the target block by using a transform base corresponding to the intra prediction mode read in reading of the intra prediction mode among the transform bases generated in generating of the transform bases; and predicting a pixel value of a current pixel by using neighboring pixels of the target block within a current frame according to the intra prediction mode read in reading of the intra prediction mode and reconstructing a current block by adding a predicted pixel value and a value of the prediction error inversely transformed in inversely transforming of the prediction error.
  • the process of generating the transform bases preferably includes calculating an autocorrelation matrix for a set of the prediction errors collected in collecting of the prediction errors; and calculating an eigenvector from the autocorrelation matrix calculated in calculating of the correlation matrix.
  • a KLT-based transform base is preferably generated based on the autocorrelation matrix and the eigenvector.
  • the intra predictive encoding performance can be significantly improved with adding no additional information by adaptively generating a transform base according to an image characteristic change as well as an intra prediction mode for the specific encoding unit and transform-encoding an intra prediction error, and thus the compression efficiency of a compression apparatus or the picture quality of a reconstructed image can be greatly improved.
  • FIG. 1 is a diagram for illustrating a general inter prediction
  • FIG. 2 is a diagram for illustrating the directivity of intra prediction modes
  • FIG. 3 is a diagram illustrating a labeling example for the intra prediction modes of FIG. 2 ;
  • FIG. 4 is a diagram for illustrating respective intra prediction modes of FIG. 2 ;
  • FIG. 5A is a diagram for illustrating a prediction mode 0 among the intra prediction modes of FIG. 2
  • FIG. 5B is a diagram for illustrating a prediction mode 1 among the intra prediction modes of FIG. 2 ;
  • FIG. 6 is a diagram schematically illustrating a video encoding apparatus according to an aspect of the present disclosure
  • FIG. 7 is a flowchart illustrating a video encoding method by the video encoding apparatus of FIG. 6 ;
  • FIG. 8 is a flowchart illustrating a transform encoding method according to another aspect of the present disclosure.
  • FIG. 9 is a diagram illustrating an example of a structure of a bitstream generated by the video encoding apparatus of FIG. 6 ;
  • FIG. 10 is a diagram schematically illustrating a video decoding apparatus according to an aspect of the present disclosure.
  • FIG. 11 is a flowchart illustrating the video decoding method by the video decoding apparatus of FIG. 10 .
  • first, second, A, B, (a), and (b) are solely for the purpose of differentiating one component from the other but not to imply or suggest the substances, order or sequence of the components.
  • a component were described as ‘connected’, ‘coupled’, or ‘linked’ to another component, they may mean the components are not only directly ‘connected’, ‘coupled’, or ‘linked’ but also are indirectly ‘connected’, ‘coupled’, or ‘linked’ via a third component.
  • FIG. 6 is a diagram schematically illustrating a video encoding apparatus according to an aspect of the present disclosure.
  • the video encoding apparatus 600 includes an intra prediction error collector 610 , a transform base generator 620 , an intra predictor 630 , and a transform encoder 640 .
  • the intra prediction error collector 610 , the transform base generator 620 , and the transform encoder 640 may be referred to as a transform encoding apparatus.
  • the intra prediction error collector 610 collects prediction errors of blocks having the same intra prediction mode from macroblocks in the regular unit, which are encoded prior to a current macroblock. That is, in order to generate transform bases for various intra prediction modes, the intra prediction error collector 610 receives macroblocks in the regular unit, which are encoded prior to the current macroblock, and collects prediction errors of blocks having the same intra prediction mode from blocks in which intra prediction modes have been determined. In this event, since 9 types of intra prediction modes are defined in the 4 ⁇ 4 intra prediction mode and the 8 ⁇ 8 intra prediction mode, 4 ⁇ 4 intra prediction errors and 8 ⁇ 8 intra mode prediction errors can be collected into 9 types, respectively.
  • 16 ⁇ 16 intra prediction errors can be collected into 4 types.
  • intra prediction errors for the 4 ⁇ 4 intra prediction mode can be collected into a set as defined in Equation (2).
  • P m ⁇ P k m
  • Equation (2) m
  • N m denotes the number of blocks in which an intra prediction mode is determined as an intra prediction mode m
  • P m denotes a 4 ⁇ 4 prediction error block set of the blocks in which the intra prediction mode is determined as the intra prediction mode m among the macroblocks in the regular unit which are encoded prior to the current macroblock
  • P k m one 4 ⁇ 4 prediction error block which is a k th element of P m .
  • the transform base generator 620 generates transform bases of respective intra prediction modes based on prediction errors collected by the intra prediction error collector 610 according to an intra prediction block size and an intra prediction mode.
  • the transform base is generated based on a Karhunen-Loeve Transform or KLT, which is theoretically known as a transform having the best energy concentration efficiency.
  • the transform base generator 620 can be implemented as an independent element, or may include a correlation matrix calculator 622 and an eigenvector calculator 624 . Further, the transform base generating apparatus can be implemented, including the intra prediction error collector 610 , the correlation matrix calculator 622 , and the eigenvector calculator 624 .
  • the correlation matrix calculator 622 calculates an autocorrelation matrix for a set of the prediction errors collected by the intra prediction error collector 610 .
  • two transform bases for a column vector signal and a row vector signal should be generated in Equation (2) because the intra prediction error block P k m is a two-dimensional signal.
  • an autocorrelation matrix of the intra prediction error should be obtained in order to generate the KLT base, and the autocorrelation matrix can be obtained as defined in Equations (3) and (4).
  • R c m denotes a 4 ⁇ 4 autocorrelation matrix for a column vector signal of a 4 ⁇ 4 intra prediction error in which an intra prediction mode is determined as an intra prediction mode m
  • N m denotes the number of blocks in which an intra prediction mode is determined as the intra prediction mode m among macroblocks in the regular unit which are encoded prior to the current macroblock
  • P k m denotes one 4 ⁇ 4 prediction error block which is a k th element of P m denoting a 4 ⁇ 4 prediction error block set of the blocks in which the intra prediction mode is determined as the intra prediction mode m among the macroblocks in the regular unit which are encoded prior to the current macroblock.
  • R r m denotes a 4 ⁇ 4 autocorrelation matrix for a row vector signal of the 4 ⁇ 4 intra prediction error in which the intra prediction mode is determined as the intra prediction mode m.
  • the KLT base for the 4 ⁇ 4 intra prediction error block can be obtained through an eigenvector of the autocorrelation matrix, and the eigenvector calculator 624 can calculate eigenvectors as defined in Equations (5) and (6) from the autocorrelation matrices defined in Equations (3) and (4), which are calculated by the correlation matrix calculator 622 .
  • R c m ⁇ n m,c ⁇ n m,c ⁇ n m,c , 0 ⁇ n ⁇ 3 Equation 5
  • R r m ⁇ n m,r ⁇ n m,r ⁇ n m,r , 0 ⁇ n ⁇ 3 Equation 6
  • Equation (5) ⁇ n m,c denotes an eigenvector of R c m
  • ⁇ n m,c denotes an eigenvalue of R c m
  • Equation (6) ⁇ n m,r denotes an eigenvector of R r m
  • ⁇ n m,r denotes an eigenvalue of R r m
  • Equations (7) and (8) can be generated by obtaining eigenvectors satisfying Equations (5) and (6) and expressing the eigenvectors as matrices.
  • Equation (7) ⁇ c m denotes a KLT base for a column vector signal of a prediction error block corresponding to the intra prediction mode m
  • the intra predictor 630 predicts a pixel value for a predicted block by using neighboring pixels of a target block within a current frame according to a directional intra prediction mode. Further, the intra predictor 630 generates a prediction error through a difference between the pixel value for the target block and the pixel value for the predicted block. That is, the intra predictor 630 includes a differentiator (not shown) for calculating a differential between the target block and the predicted block.
  • the transform encoder 640 transform-encodes a prediction error generated by the intra predictor 630 by using transform bases generated by the transform base generator 620 .
  • the aforementioned transform of the two-dimensional signal using the KLT base is performed as defined in Equation (9).
  • V m ( ⁇ c m ) T U m ( ⁇ r m )
  • Equation (9) U m denotes a prediction error signal of the intra prediction mode m
  • a method of generating a KLT base for an intra prediction error of the 8 ⁇ 8 intra prediction mode is equal to the case of the 4 ⁇ 4 intra prediction mode.
  • a method of generating a KLT base for an intra prediction error of the 16 ⁇ 16 intra prediction mode is equal to the case of the 4 ⁇ 4 intra prediction mode, and only difference is that the number of intra prediction error sets and the number of KLT bases are 4 smaller than those of the 4 ⁇ 4 intra prediction mode.
  • the KLT base generated by the transform base generator 620 is not a transform base optimized for the prediction error generated by the intra predictor 630 , but the KLT base has no significant difference in performance from a transform base optimized for a current frame because there is a high correlation between the current frame and a previous frame based on characteristics of a general video signal, and has properties, which require the transmission of no additional information on a transform base for a decoding by generating the transform base among macroblocks in the regular unit, which are encoded prior to the current macroblock.
  • the intra predictor 630 may include the intra prediction error collector 610 . That is, the intra predictor 630 predicts the pixel value of the predicted block by using neighboring pixels of the target block within the current frame according to the directional intra prediction mode, generates the prediction error through the difference between the pixel value of the predicted block and the pixel value of the target block, and collects prediction errors of blocks having the same intra prediction mode from blocks in which intra prediction modes have been determined among macroblocks in the regular unit, which are encoded prior to the current macroblock at the same time, so that the intra prediction can output a prediction error of the current frame together with the collected prediction errors of the blocks.
  • an intra prediction encoding is performed according to the aforementioned method, not only the performance can be improved by applying different transform bases depending on intra perdition modes but more excellent intra prediction encoding efficiency can be achieved by providing an adaptive transform base, which can immediately respond to a characteristic change of an image, in every specific encoding unit.
  • FIG. 7 is a flowchart illustrating a video encoding method by the video encoding apparatus of FIG. 6 .
  • the intra prediction error collector 610 collects prediction errors of blocks having the same intra prediction mode from macroblocks in the regular unit, which are encoded prior to a current macroblock in step S 701 .
  • the collected prediction errors for the macroblocks in the regular unit, which are encoded prior to the current macroblock, can be expressed in a form of a set as shown in FIG. 2 .
  • the transform base generator 620 calculates an autocorrelation matrix for an intra prediction mode set based on the prediction errors collected by the intra prediction error collector 610 in step S 703 .
  • the 4 ⁇ 4 intra prediction error block P k m is a two-dimensional signal
  • two types of transform bases for a column vector signal and a row vector signal should be generated.
  • the autocorrelation matrix can be calculated as defined in Equations (3) and (4).
  • the KLT base can be calculated through an eigenvector of the autocorrelation matrix as defined in Equations (5) and (6) in step S 705 .
  • the calculated eigenvector can be expressed as matrices shown in Equations (7) and (8).
  • the intra predictor 630 predicts a pixel value of a current pixel by using neighboring pixels of a target block within a current frame according to a directional intra prediction mode, and generates a prediction error through a difference between the predicted pixel and the current pixel in step S 707 .
  • the transform encoder 640 transform-encodes the prediction error generated by the intra predictor 630 by using the transform bases generated by the transform base generator 620 as shown in Equation (9).
  • FIG. 8 is a flowchart illustrating a transform encoding method according to another aspect of the present disclosure.
  • the intra predictor 630 is independently constructed from the intra prediction error collector 610 , the transform base generator 620 , and the transform encoder 640 , the intra prediction error collector 610 , the transform base generator 620 , and the transform encoder 640 may be referred to as elements of the transform encoding apparatus.
  • steps S 801 to S 805 of the transform encoding apparatus are equally performed to steps S 701 to S 705 of FIG. 7 , and the prediction error independently generated using the generated transform base by the intra predictor 630 is transform-encoded in step S 807 .
  • FIG. 9 is a diagram illustrating an example of a structure of a bitstream generated by the video encoding apparatus of FIG. 6 .
  • the bitstream is encoded in the unit of slices.
  • the bitstream includes a slice header 910 and a slice data 920
  • the slice data 920 includes a plurality of macroblock data (MB) 921 to 924 .
  • one macroblock data 923 may include an mb_type field 930 , an mb_pred field 935 , and a texture data field 939 .
  • the mb_type field 930 records a value indicating a macroblock type. That is, the value recorded in the mb_type field 930 indicates whether a current macroblock is an intra macroblock or an inter macroblock.
  • the mb_pred field 935 records a detailed prediction mode according to the macroblock type.
  • an information on a prediction mode selected in the intra prediction is recorded.
  • an information on a motion vector and a reference frame number for each macroblock partition is recorded.
  • the mb_pred field 935 is divided into a plurality of block information 941 to 944 , and each information piece 942 is divided into a main_mode field 945 for recording a value of a main mode and a sub_mode field 946 for recording a value of a sub mode.
  • the texture data field 939 records an encoded residual image, that is, a texture data.
  • FIG. 10 is a diagram schematically illustrating a video decoding apparatus according to an aspect of the present disclosure
  • FIG. 11 is a flowchart illustrating the video decoding method by the video decoding apparatus of FIG. 10 .
  • a detailed construction and operation of the video decoding apparatus 1000 will be described with reference to FIGS. 10 and 11 .
  • the video decoding apparatus 1000 includes an intra prediction error collector 1010 , a transform base generator 1020 , a prediction mode reader 1030 , an inverse transformer 1040 , and a current block reconstructer 1050 .
  • the transform base generator 1020 may include a correlation matrix calculator 1022 and an eigenvector calculator 1024 .
  • the intra prediction error collector 1010 collects prediction errors of blocks having the same intra prediction mode from macroblocks in the regular unit, which are encoded prior to a current macroblock in step S 1101 . That is, in order to generate transform bases for various intra prediction modes, the intra prediction error collector 1010 receives the macroblocks in the regular unit, which are encoded prior to the current macroblock, and collects the prediction errors of the blocks having the same intra prediction mode from blocks in which intra prediction modes have been selected, like the intra prediction error collector of FIG. 6 . In this event, since 9 types of intra prediction modes are defined in the 4 ⁇ 4 intra prediction mode and the 8 ⁇ 8 intra prediction mode, 4 ⁇ 4 intra prediction errors and 8 ⁇ 8 intra mode prediction errors can be collected into 9 types, respectively. Further, since 4 types of intra prediction modes are defined in the 16 ⁇ 16 intra prediction mode, 16 ⁇ 16 intra prediction errors can be collected into 4 types. For example, intra prediction errors for the 4 ⁇ 4 intra prediction mode can be collected into a set as defined in Equation (2).
  • the transform base generator 1020 generates transform bases for respective intra prediction modes based on the prediction errors collected by the intra prediction error collector 1010 .
  • the transform base is generated based on the KLT, which is theoretically known as a transform having the best energy concentration efficiency like a case of the video encoding apparatus 600 .
  • the transform base generator 1020 can be implemented as an independent element, or implemented as the transform base generating apparatus including the intra prediction error collector 1010 , the correlation matrix calculator 1022 , and the eigenvector calculator 1024 .
  • the correlation matrix calculator 1022 calculates an autocorrelation matrix for a set of the prediction errors collected by the intra prediction error collector 1010 in step S 1103 .
  • two transform bases for a column vector signal and a row vector signal should be generated in Equation (2) because the intra prediction error block P k m is a two-dimensional signal.
  • an autocorrelation matrix of the intra prediction error should be obtained in order to generate the KLT base, and the autocorrelation matrix can be obtained as defined in Equations (3) and (4).
  • the KLT base for the 4 ⁇ 4 intra prediction error block can be obtained through an eigenvector of the autocorrelation matrix, and the eigenvector calculator 1024 can calculate eigenvectors as defined in Equations (5) and (6) from the autocorrelation matrices defined in Equations (3) and (4), which are calculated by the correlation matrix calculator 1022 in step S 1105 .
  • the transform base generator 1020 can generate a KLT based transform base by obtaining eigenvectors satisfying Equations (5) and (6) and expressing the eigenvectors as matrices as defined in Equations (7) and (8).
  • the prediction mode reader 1030 reads an intra prediction mode of a target block to be decoded from the bitstream structure shown in FIG. 9 in step S 1109 . That is, the prediction mode reader 1030 receives a bitstream generated in the video encoding apparatus 600 and reads the intra prediction mode of the target block to be decoded of the current frame.
  • the inverse transformer 1040 inversely transforms a prediction error received through the bitstream by using a transform base corresponding to the intra prediction mode read by the intra prediction mode reader 1030 among transform bases generated by the transform base generator 1020 in step S 1111 .
  • the prediction error of the target block received through the bitstream generated by the video encoding apparatus 600 is transform-encoded using different transform bases depending on the intra prediction mode of the target block unlike the H.264 standard applying the integer transform and the inverse transform based on a fixed DCT (Discrete Cosine Transform) in a process of the transform and inverse transform of the prediction error regardless of the intra prediction mode.
  • the intra prediction mode reader 1030 determines the intra prediction mode of the target block to be decoded of the current frame from an input bitstream, and inversely transforms the prediction error by applying a transform base corresponding to the intra prediction mode read by the intra prediction mode reader 1030 .
  • Equation 10 an inverse transform of the two-dimensional signal using the aforementioned KLT based transform base can be performed as defined in Equation (10).
  • V m denotes a signal generated by transform-encoding a prediction error of the intra prediction mode m
  • ⁇ m denotes a signal generated by inversely transforming V m by the KLT.
  • a transpose matrix is used in the inverse transform without acquiring the inverse matrix because the KLT base is an orthogonal matrix generated through an eigenvector and thus the inverse matrix of the KLT base and the transpose matrix of the KLT base are equal to each other.
  • the inverse transform of the two-dimensional signal can be performed as defined in Equation (11) by more simply using the transpose matrix instead of the inverse matrix.
  • ⁇ m ( ⁇ c m ) V m ( ⁇ r m ) T
  • the current block reconstructer 1050 predicts a pixel value of a current pixel by using neighboring pixels of a target block within a current frame according to the intra prediction mode read by the intra prediction mode reader 1030 , and reconstructs a current block by adding the predicted pixel value and the prediction error value inversely transformed by the inverse transformer 1040 in step S 1113 .
  • the inverse transform and decoding can be performed by generating exactly the same adaptive transform base with reference to a previous frame in which a decoding is terminated in the video decoding apparatus 1000 like the transform encoding performed by generating different adaptive transform bases depending on the intra prediction mode with reference to the previous frame in the video encoding apparatus 600 .
  • the present disclosure is not intended to limit itself to such embodiments. Rather, within the objective scope of the present disclosure, the respective components may be selectively and operatively combined in any numbers. Every one of the components may be also implemented by itself in hardware while the respective ones can be combined in part or as a whole selectively and implemented in a computer program having program modules for executing functions of the hardware equivalents. Codes or code segments to constitute such a program may be easily deduced by a person skilled in the art.
  • the computer program may be stored in computer readable media, which in operation can realize the aspects of the present disclosure.
  • the candidates include magnetic recording media, optical recording media, and carrier wave media.
  • the present disclosure is highly useful for application in the fields of an encoder and a decoder using an intra prediction, an image compression apparatus, etc. to generate an effect of improving the compression efficiency of an intra prediction encoding by adaptively generating a transform base according to a local characteristic change of a prediction error as well as an intra prediction mode and using the generated transform base in a transform encoding of the prediction error in order to efficiently transform-encoding the prediction error after the intra prediction.

Abstract

A video encoding apparatus includes an intra prediction error collector; a transform base generator; an intra predictor; and a transform encoder, the intra prediction error collector collecting prediction errors of blocks having an equal intra prediction mode from macroblocks in a regular unit, which are encoded prior to a current macroblock, the transform base generator generating transform bases for respective intra prediction modes based on the prediction errors collected by the intra prediction error collector. Accordingly, the intra prediction encoding performance can be significantly improved with adding no addition information by adaptively generating a transform base according to an image characteristic change and transform-encoding an intra prediction error. As a result, the compression efficiency of a video compression apparatus or the picture quality of a reconstructed image can be greatly improved.

Description

CROSS REFERENCE TO RELATED APPLICATION
This application claims the priority of Korean Patent Application No. 10-2009-0121980, filed on Dec. 9, 2009 in the KIPO (Korean Intellectual Property Office). Further, this application is the National Phase application of International Application No. PCT/KR2010/008777 filed Dec. 9, 2010, which designates the United States and was published in Korean.
TECHNICAL FIELD
The present disclosure relates to an image-encoding apparatus and method, a transform-encoding apparatus and method, an apparatus and method for generating a transform base, and an image-decoding apparatus and method used in the same. More particularly, the present disclosure relates to a video encoding apparatus and method, which can significantly improve the intra prediction encoding performance with adding no additional information by adaptively generating a transform base according to an image characteristic change as well as an intra prediction mode for a specific encoding unit and transform-encoding an intra prediction error, and a transform encoding apparatus and method, a transform base generating apparatus and method, and a video decoding apparatus and method used in the same.
BACKGROUND
The statements in this section merely provide background information related to the present disclosure and may not constitute the prior art.
As information and communication technologies including an internet are developed, the use of video communication is increased as well as voice communication. Conventional communication based on text is not sufficient to satisfy various demands of consumers. Accordingly, multimedia services capable of accommodating diverse types of information such as texts, videos, music, etc. are increasingly provided. Multimedia data requires a storage medium having a large capacity due to its large amount or size, and requires a wide bandwidth for a transmission. Therefore, it is necessary to use a compression coding technique to transmit the multimedia data including text, video, and audio data.
A basic principle of compressing a data includes a process of removing a factor of the data redundancy. The data can be compressed by removing the spatial redundancy corresponding to the repetition of the same color or object in an image, the temporal redundancy corresponding to the repetition of the same note in an audio or a case where there is little change of an adjacent frame in a dynamic image, or the psychological vision redundancy considering a fact that human's visual and perceptive abilities are insensitive to a high frequency.
As a video compressing method, H.264/AVC recently draws more interests for its improved compression efficiency over MPEG-4 (Moving Picture Experts Group-4).
Being a digital video codec standard with a very high data compression rate, H.264 is also referred to as MPEG-4 part 10 or AVC (Advanced Video Coding). This standard is a result from constructing a Joint Video Team and performing the standardization together by VCEG (Video Coding Experts Group) of ITU-T (International Telecommunication Union Telecommunication Standardization Sector) and MPEG of ISO/IEC (International Standardization Organization/International Electrotechnical Commission).
Various methods are proposed to improve the compression efficiency in a compression encoding, and include methods using a temporal prediction and a spatial prediction as representative methods.
The temporal prediction corresponds to a scheme of performing a prediction with reference to a reference block 122 of another frame 120 temporally adjacent in predicting a current block 112 of a current frame 110, as shown in FIG. 1. That is, in inter-predicting the current block 112 of the current frame 110, the temporally adjacent reference frame 120 is searched for, and the reference block 122, which is the most similar to the current block within the reference frame 120, is searched for. Here, the reference block 122 is a block, which can predict the current block 112 best, and a block, which has the smallest SAD (Sum of Absolute Difference) from the current block 112, can be the reference block 122. The reference block 122 becomes a predicted block of the current block 112, and a residual block is generated by subtracting the reference block 122 from the current block 112. The generated residual block is encoded and inserted in a bitstream. In this event, a relative difference between a position of the current block in the current frame 110 and a position of the reference block 122 in the reference frame 120 corresponds to a motion vector 130, and the motion vector 130 is encoded like the residual block. The temporal prediction is also referred to as an inter prediction or an inter frame prediction.
The spatial prediction corresponds to a prediction of obtaining a predicted pixel value of a target block by using a reconstructed pixel value of a reference block adjacent to the target block in one frame, and is also referred to as a directional intra prediction (hereinafter, simply referred to as an “intra prediction”) or an inter frame prediction. H.264 defines an encoding/decoding using the intra prediction.
The intra prediction corresponds to a scheme of predicting values of a current subblock by copying one subblock in a determined direction based on adjacent pixels located in an upper direction and a left direction with respect to the subblock and encoding only a differential. According to the intra prediction scheme based on the H.264 standard, a predicted block for a current block is generated based on another block having a prior coding order. Further, a coding is a value generated by subtracting the predicted block from the current block. A video encoder based on the H.264 standard selects a prediction mode having the smallest difference between the current block and the predicted block for each block from prediction modes.
The intra prediction based on the H.264 standard defines nine prediction modes shown in FIG. 2 in consideration of the prediction directivity and positions of adjacent pixels used for generating predicted pixel values of a 4×4 luma block and an 8×8 luma block. The nine prediction modes are divided into a vertical prediction mode (prediction mode 0), a horizontal prediction mode (prediction mode 1), a DC prediction mode (prediction mode 2), a diagonal_down_left prediction mode (prediction mode 3), a diagonal_down_right prediction mode (prediction mode 4), a vertical_right prediction mode (prediction mode 5), a horizontal_down prediction mode (prediction mode 6), a vertical_left prediction mode (prediction mode 7), and a horizontal_up prediction mode (prediction mode 8) according to their prediction directions. Here, the DC prediction mode uses an average value of eight adjacent pixels.
Further, four intra prediction modes are used for an intra prediction processing for a 16×16 luma block, wherein the four intra prediction modes are the vertical prediction mode (prediction mode 0), the horizontal prediction mode (prediction mode 1), the DC prediction mode (prediction mode 2), and the diagonal_down_left prediction mode (prediction mode 3). In addition, the same four intra prediction modes are used for an intra prediction processing for an 8×8 chroma block.
FIG. 3 illustrates a labeling example for the nine intra prediction modes shown in FIG. 2. In this event, a predicted block (an area including a to p) for the current block is generated using samples (A to M) decoded in advance. When E, F, G, and H cannot be decoded in advance, E, F, G, and H can be virtually generated by copying D in their positions.
FIG. 4 is a diagram for illustrating the nine prediction modes shown in FIG. 2 by using FIG. 3. Referring to FIG. 4, a predicted block in a case of the prediction mode 0 predicts pixel values in the same vertical line as the same pixel value. That is, in pixels of the predicted block, pixel values are predicted from pixels, which are most adjacent to a reference block located in an upper side of the predicted block. Reconstructed pixel values of an adjacent pixel A are set to predicted pixel values of a pixel a, a pixel e, a pixel i, and a pixel m in a first column of the predicted block. Further, in the same way, pixel values of a pixel b, a pixel f, a pixel j, and a pixel n in a second column are predicted from reconstructed pixel values of an adjacent pixel B, pixel values of a pixel c, a pixel g, a pixel k, and a pixel o in a third column are predicted from reconstructed pixel values of an adjacent pixel C, and pixel values of a pixel d, a pixel h, a pixel l, and a pixel p in a fourth column are predicted from reconstructed pixel values of an adjacent pixel D. As a result, a predicted block in which predicted pixel values of each column correspond to pixel values of the pixel A, pixel B, pixel C, and pixel D is generated as shown in FIG. 5A.
Further, a predicted block in a case of the prediction mode 1 predicts pixel values in the same horizontal line as the same pixel value. That is, in pixels of the predicted block, pixel values are predicted from pixels, which are most adjacent to a reference block located in a left side of the predicted block. Reconstructed pixel values of an adjacent pixel l are set to predicted pixel values of a pixel a, a pixel b, a pixel c, and a pixel d in a first row of the predicted block. Further, in the same way, pixel values of a pixel e, a pixel f, a pixel g, and a pixel h in a second row are predicted from reconstructed pixel values of an adjacent pixel J, pixel values of a pixel i, a pixel j, a pixel k, and a pixel l in a third row are predicted from reconstructed pixel values of an adjacent pixel K, and pixel values of a pixel m, a pixel n, a pixel o, and a pixel p in a fourth row are predicted from reconstructed pixel values of an adjacent pixel L. As a result, a predicted block in which predicted pixel values of each column correspond to pixel values of the pixel l, pixel J, pixel K, and pixel L is generated as shown in FIG. 5B.
Furthermore, pixels of a predicted block in a case of the prediction mode 2 are equally replaced with an average of pixel values of upper pixels A, B, C, and D, and left pixels I, J, K, and L.
Meanwhile, pixels of a predicted block in a case of the prediction mode 3 are interpolated in a lower-left direction at an angle of 45° between a lower-left side and an upper-right side of the predicted block, and pixels of a predicted block in a case of the prediction mode 4 are extrapolated in a lower-right direction at an angle of 45° between a lower-left side and an upper-right side of the predicted block. Further, pixels of a predicted block in a case of the prediction mode 5 are extrapolated in a lower-right direction at an angle of about 26.6° (width/height=½) with respect to a vertical line. In addition, pixels of a predicted block in a case of the prediction mode 6 are extrapolated in a lower-right direction at an angle of about 26.6° with respect to a horizontal line, pixels of a predicted block in a case of the prediction mode 7 are extrapolated in a lower-left direction at an angle of about 26.6° with respect to a vertical line, and pixels of a predicted block in a case of the prediction mode 8 are interpolated in an upper direction at an angle of about 26.6° with respect to a horizontal line.
The pixels of the predicted block can be generated from a weighted average of the pixels A to M of the reference block decoded in advance in the prediction mode 3 to 8. For example, in a case of the prediction mode 4, the pixel d located in an upper right side of the predicted block can be estimated as shown in Equation (1). Here, a round( ) function is a function of rounding off to the nearest whole number.
d=round(B/4+C/2+D/4)  Equation 1
Meanwhile, in a 16×16 prediction model for luma components, there are 4 modes including the prediction mode 0, prediction mode 1, prediction mode 2, and prediction mode 3 as described above.
In a case of the prediction mode 0, pixels of the predicted block are interpolated from upper pixels, and, in a case of the prediction mode 1, the pixels of the predicted block are interpolated from left pixels. Further, in a case of the prediction mode 2, the pixels of the predicted block are calculated as an average of the upper pixels and the left pixels. Lastly, in a case of the prediction mode 3, a linear “plane” function suitable for the upper pixels and the left pixels is used. The prediction mode 3 is more suitable for an area in which the luminance is smoothly changed.
As described above, in the H.264 standard, pixel values of the predicted block are generated in directions corresponding to respective modes based on adjacent pixels of the predicted block to be currently encoded in the respective prediction modes except for the DC mode.
Meanwhile, a prediction error between a predicted value predicted by each prediction mode and a current pixel value is transform-encoded using an integer transform scheme based on a DCT (Discrete Cosine Transform). An integer transform in the 4×4 unit is applied when a 4×4 intra prediction mode and a 16×16 intra prediction mode are used according to a block size, and an inter transform in the 8×8 unit is applied when an 8×8 intra prediction mode is used.
The Video Coding Expert Group of the ITU-T has further developed the H.264 standard recently, so that the predictive encoding performance is further improved. Specifically, the predictive encoding performance is improved by increasing the number of intra prediction modes through further diversifying the directivity of a pixel value used in the intra prediction and introducing a scheme of adding weights of two intra prediction modes in “Improvement of Bidirectional Intra Prediction”, ITU-T SG16/Q.6 Doc. VCEG-AG08, October 2007 by Shiodera Taichiro, Akiyuki Tanizawa, Takeshi Chujoh, and Tomoo Yamakage. However, this scheme has a disadvantage of greatly increasing an amount of operations for finding an optimal mode according to the increase of the number of intra prediction modes, which should be considered, up to 4 times and thus increasing an amount of additional information for encoding the increased prediction modes.
Unlike a conventional research for improving the intra mode encoding through performing an exact intra encoding, a transform scheme of using different KLT (Karhunen-Loeve Transform) based directivity bases is proposed based on the idea that there still remains the spatial redundancy in a prediction error after the intra prediction and such a spatial redundancy has a high correlation with an intra prediction direction in “Improved Intra Coding”, ITU-T SG16/Q.6 Doc. VCEG-AG11, October 2007 by Yan Ye and Marta Karczewicz. The transform scheme has significantly improved the intra mode encoding performance by performing an adaptive prediction error encoding according to the intra prediction mode without any addition information by using KLT transform bases trained through several experiment images. However, the transform scheme has a disadvantage that a generated transform base cannot have the optimal energy concentration efficiency for various video sequences having different characteristics or other partial local images having different characteristics within one sequence.
DISCLOSURE Technical Problem
An aspect of the present disclosure to solve the above-mentioned problem provides a higher energy concentration effect by efficiently removing the spatial redundancy remaining in a prediction error, and provides a video encoding apparatus and method, which can improve the compression efficiency of an intra prediction encoding by adaptively generating a transform base according to a local characteristic change of the prediction error as well as an intra prediction mode and using the generated transform base in a transform encoding of the prediction error in order to more efficiently transform-encode the prediction error after an intra prediction, and a transform encoding apparatus and method, a transform base generating apparatus and method, and a video decoding apparatus and method used in the same.
SUMMARY
An aspect of the present disclosure provides a video encoding apparatus, including: an intra prediction error collector for collecting prediction errors of blocks having an equal intra prediction mode from macroblocks in a regular unit, which are encoded prior to a current macroblock; a transform base generator for generating transform bases for respective intra prediction modes based on the prediction errors collected by the intra prediction error collector; an intra predictor for predicting a pixel value of a current pixel by using neighboring pixels of a target block within a current frame according to a directional intra prediction mode and generating a prediction error through a difference between a predicted pixel value and the current pixel; and a transform encoder for transform-encoding the prediction error generated by the intra predictor by using the transform bases generated by the transform base generator.
Preferably, the transform base generator includes a correlation matrix calculator for calculating an autocorrelation matrix for a set of the prediction errors collected by the intra prediction error collector. In this event, the transform base generator generates a Karhunen-Loeve Transform or KLT-based transform base based on the autocorrelation matrix calculated by the correlation matrix calculator.
The transform base generator may include a correlation matrix calculator for calculating an autocorrelation matrix for a set of the prediction errors collected by the intra prediction error collector; and an eigenvector calculator for calculating an eigenvector from the autocorrelation matrix calculated by the correlation matrix calculator. In this event, the transform encoder preferably transform-encodes the prediction error generated by the intra predictor by using a calculated eigenvector.
Another aspect of the present disclosure provides a transform encoding apparatus for transforming and encoding a prediction error generated by a difference between a predicted pixel predicted by an intra prediction apparatus and a current pixel, the transform encoding apparatus including: an intra prediction error collector for collecting prediction errors of blocks having an equal intra prediction mode from macroblocks in a regular unit, which are encoded prior to a current macroblock; and a transform base generator for generating transform bases for respective intra prediction modes based on the prediction errors collected by the intra prediction error collector. In this event, the transform encoding apparatus preferably transform-encoding the prediction error generated by the intra prediction apparatus by using the transform bases generated by the transform base generator.
Preferably, the intra prediction error collector may collect the prediction errors into a set as defined in an equation of Pm={Pk m|1≦k≦Nm}, where m denotes an index indicating a 4×4 intra prediction mode number, the index having values from 0 to 8, Nm denotes a number of blocks in which an intra prediction mode is determined as an intra prediction mode m among macroblocks in a regular unit which are encoded prior to a current macroblock, Pm denotes a 4×4 prediction error block set of the blocks in which the intra prediction mode is determined as the intra prediction mode m| among the macroblocks in the regular unit which are encoded prior to the current macroblock, and Pk m denotes one 4×4 prediction error block which is a kth element of Pm.
The transform encoding apparatus may further include a correlation matrix calculator for calculating an autocorrelation matrix for a set of the prediction errors collected by the intra prediction error collector based on an equation of
R c m = E [ P k m ( P k m ) T ] = 1 N m k = 1 N m P k m ( P k m ) T ,
where Rc m denotes a 4×4 autocorrelation matrix for a column vector signal of a 4×4 intra prediction error in which an intra prediction mode is determined as an intra prediction mode m, m denotes an index indicating a 4×4 intra prediction mode number, the index having values from 0 to 8, Nm denotes a number of blocks in which an prediction mode is determined as the intra prediction mode m| among macroblocks in a regular unit which are encoded prior to a current macroblock, and Pk m denotes one 4×4 prediction error block which is a kth element of Pm denoting a 4×4 prediction error block set of the blocks in which the intra prediction mode is determined as the intra prediction mode m| among the macroblocks in the regular unit which are encoded prior to the current macroblock, wherein the transform base generator generates transform bases by using a calculated autocorrelation matrix.
The transform encoding apparatus may further include a correlation matrix calculator for calculating an autocorrelation matrix for a set of the prediction errors collected by the intra prediction error collector based on an equation of
R r m = E [ ( P k m ) T P k m ] = 1 N m k = 1 N m ( P k m ) T P k m ,
where Rr m denotes a 4×4 autocorrelation matrix for a row vector signal of a 4×4 intra prediction error in which an intra prediction mode is determined as an intra prediction mode m|, m| denotes an index indicating a 4×4 intra prediction mode number, the index having values from 0 to 8, Nm denotes a number of blocks in which an intra prediction mode is determined as the intra prediction mode m| among macroblocks in a regular unit which are encoded prior to a current macroblock, and Pk m denotes one 4×4 prediction error block which is a kth element of Pm denoting a 4×4 prediction error block set of the blocks in which the intra prediction mode is determined as the intra prediction mode m| among the macroblocks in the regular unit which are encoded prior to the current macroblock, wherein the transform base generator generates transform bases by using a calculated autocorrelation matrix.
Yet another aspect of the present disclosure provides a transform base generating apparatus for generating transform bases for intra prediction modes, the transform base generating apparatus including: an intra prediction error collector for collecting prediction errors of blocks having an equal intra prediction mode from macroblocks in a regular unit, which are encoded prior to a current macroblock; a correlation matrix calculator for calculating an autocorrelation matrix for a set of the prediction errors collected by the intra prediction error collector; and an eigenvector calculator for calculating an eigenvector from the autocorrelation matrix calculated by the correlation matrix calculator, the transform base generating apparatus generating the transform base for each intra prediction mode based on the eigenvector calculated by the eigenvector calculator.
Here, the transform base generating apparatus preferably generates a KLT-based transform base based on the autocorrelation matrix and the eigenvector.
Yet another aspect of the present disclosure provides an intra prediction apparatus, including: an intra predictor for predicting a pixel value of a current pixel by using neighboring pixels of a target block within a current frame according to a directional intra prediction mode and generating a prediction error through a difference between a predicted pixel value and the current pixel; and an intra prediction error collector for collecting prediction errors of blocks having an equal intra prediction mode from macroblocks in a regular unit, which are encoded prior to a current macroblock. In this event, the intra prediction apparatus preferably outputs the prediction errors for the macroblocks in the regular unit, which are encoded prior to the current macroblock, the prediction error being collected by the intra prediction error collector, together with the prediction error generated by the intra predictor for the current frame.
Yet another aspect of the present disclosure provides a video decoding apparatus, including: an intra prediction error collector for collecting prediction errors of blocks having an equal intra prediction mode from macroblocks in a regular unit, which are encoded prior to a current macroblock; a transform base generator for generating transform bases for respective intra prediction modes based on the prediction errors collected by the intra prediction error collector; an intra prediction mode reader for reading an intra prediction mode of a target block to be decoded for an input bitstream; an inverse transformer for inversely transforming a prediction error for the target block by using a transform base corresponding to the intra prediction mode read by the intra prediction mode reader among the transform bases generated by the transform base generator; and a current block reconstructer for predicting a pixel value of a current pixel by using neighboring pixels of the target block within a current frame according to the intra prediction mode read by the intra prediction mode reader and reconstructing a current block by adding a predicted pixel value and a value of the prediction error inversely transformed by the inverse transformer.
Here, the transform base generator preferably includes a correlation matrix calculator for calculating an autocorrelation matrix for a set of the prediction errors collected by the intra prediction error collector; and an eigenvector calculator for calculating an eigenvector from the autocorrelation matrix calculated by the correlation matrix calculator, and generates a KLT-based transform base based on the autocorrelation matrix and the eigenvector.
Yet another aspect of the present disclosure provides a video encoding method, including: collecting prediction errors of blocks having an equal intra prediction mode from macroblocks in a regular unit, which are encoded prior to a current macroblock, and predicting a value of a current pixel by using neighboring pixels of a target block according to a directional intra prediction mode for a current frame and generating a prediction error through a difference between a predicted value and the value of the current pixel; generating transform bases for respective intra prediction modes based on the prediction errors collected in collecting of the prediction errors; and transform-encoding the prediction error generated for the current frame by using the transform bases generated in generating of the transform bases.
Another aspect of the present disclosure provides a transform encoding method of transforming and encoding a prediction error generated by a difference between a pixel predicted by an intra prediction apparatus and a current pixel, the transform encoding method including: collecting prediction errors of blocks having an equal intra prediction mode from macroblocks in a regular unit, which are encoded prior to a current macroblock; generating transform bases for respective intra prediction modes based on the prediction errors collected in collecting of the prediction errors. In this event, the transform encoding method preferably transform-encodes the prediction error generated by the intra prediction apparatus by using the transform bases generated in generating of the transform bases.
The transform encoding method preferably further includes calculating an autocorrelation matrix for a set of the prediction errors collected in collecting of the prediction errors. In this event, the process of generating the transform bases generates the transform bases by using a calculated autocorrelation matrix.
Yet another aspect of the present disclosure provides a video decoding method, including: collecting prediction errors of blocks having an equal intra prediction mode from macroblocks in a regular unit, which are encoded prior to a current macroblock; generating transform bases for respective intra prediction modes based on the prediction errors collected in collecting of the prediction errors; reading an intra prediction mode of a target block to be decoded for an input bitstream; inversely transforming a prediction error for the target block by using a transform base corresponding to the intra prediction mode read in reading of the intra prediction mode among the transform bases generated in generating of the transform bases; and predicting a pixel value of a current pixel by using neighboring pixels of the target block within a current frame according to the intra prediction mode read in reading of the intra prediction mode and reconstructing a current block by adding a predicted pixel value and a value of the prediction error inversely transformed in inversely transforming of the prediction error.
The process of generating the transform bases preferably includes calculating an autocorrelation matrix for a set of the prediction errors collected in collecting of the prediction errors; and calculating an eigenvector from the autocorrelation matrix calculated in calculating of the correlation matrix. In this event, a KLT-based transform base is preferably generated based on the autocorrelation matrix and the eigenvector.
Advantageous Effects
According to the present disclosure as described above, the intra predictive encoding performance can be significantly improved with adding no additional information by adaptively generating a transform base according to an image characteristic change as well as an intra prediction mode for the specific encoding unit and transform-encoding an intra prediction error, and thus the compression efficiency of a compression apparatus or the picture quality of a reconstructed image can be greatly improved.
DESCRIPTION OF DRAWINGS
FIG. 1 is a diagram for illustrating a general inter prediction;
FIG. 2 is a diagram for illustrating the directivity of intra prediction modes;
FIG. 3 is a diagram illustrating a labeling example for the intra prediction modes of FIG. 2;
FIG. 4 is a diagram for illustrating respective intra prediction modes of FIG. 2;
FIG. 5A is a diagram for illustrating a prediction mode 0 among the intra prediction modes of FIG. 2, and FIG. 5B is a diagram for illustrating a prediction mode 1 among the intra prediction modes of FIG. 2;
FIG. 6 is a diagram schematically illustrating a video encoding apparatus according to an aspect of the present disclosure;
FIG. 7 is a flowchart illustrating a video encoding method by the video encoding apparatus of FIG. 6;
FIG. 8 is a flowchart illustrating a transform encoding method according to another aspect of the present disclosure;
FIG. 9 is a diagram illustrating an example of a structure of a bitstream generated by the video encoding apparatus of FIG. 6;
FIG. 10 is a diagram schematically illustrating a video decoding apparatus according to an aspect of the present disclosure; and
FIG. 11 is a flowchart illustrating the video decoding method by the video decoding apparatus of FIG. 10.
DETAILED DESCRIPTION
Hereinafter, aspects of the present disclosure will be described in detail with reference to the accompanying drawings. In the following description, the same elements will be designated by the same reference numerals although they are shown in different drawings. Further, in the following description of the present disclosure, a detailed description of known functions and configurations incorporated herein will be omitted when it may make the subject matter of the present disclosure rather unclear.
Additionally, in describing the components of the present disclosure, there may be terms used like first, second, A, B, (a), and (b). These are solely for the purpose of differentiating one component from the other but not to imply or suggest the substances, order or sequence of the components. If a component were described as ‘connected’, ‘coupled’, or ‘linked’ to another component, they may mean the components are not only directly ‘connected’, ‘coupled’, or ‘linked’ but also are indirectly ‘connected’, ‘coupled’, or ‘linked’ via a third component.
FIG. 6 is a diagram schematically illustrating a video encoding apparatus according to an aspect of the present disclosure. Referring to FIG. 6, the video encoding apparatus 600 includes an intra prediction error collector 610, a transform base generator 620, an intra predictor 630, and a transform encoder 640. Here, the intra prediction error collector 610, the transform base generator 620, and the transform encoder 640 may be referred to as a transform encoding apparatus.
The intra prediction error collector 610 collects prediction errors of blocks having the same intra prediction mode from macroblocks in the regular unit, which are encoded prior to a current macroblock. That is, in order to generate transform bases for various intra prediction modes, the intra prediction error collector 610 receives macroblocks in the regular unit, which are encoded prior to the current macroblock, and collects prediction errors of blocks having the same intra prediction mode from blocks in which intra prediction modes have been determined. In this event, since 9 types of intra prediction modes are defined in the 4×4 intra prediction mode and the 8×8 intra prediction mode, 4×4 intra prediction errors and 8×8 intra mode prediction errors can be collected into 9 types, respectively. Further, since 4 types of intra prediction modes are defined in the 16×16 intra prediction mode, 16×16 intra prediction errors can be collected into 4 types. For example, intra prediction errors for the 4×4 intra prediction mode can be collected into a set as defined in Equation (2).
P m ={P k m|1≦k≦N m}|  Equation 2
In Equation (2), m| denotes an index indicating a number of the 4×4 intra prediction mode, wherein the index has values from 0 to 8. Nm denotes the number of blocks in which an intra prediction mode is determined as an intra prediction mode m| among macroblocks in the regular unit which are encoded prior to the current macroblock. Further, Pm denotes a 4×4 prediction error block set of the blocks in which the intra prediction mode is determined as the intra prediction mode m among the macroblocks in the regular unit which are encoded prior to the current macroblock, and Pk m one 4×4 prediction error block which is a kth element of Pm.
The transform base generator 620 generates transform bases of respective intra prediction modes based on prediction errors collected by the intra prediction error collector 610 according to an intra prediction block size and an intra prediction mode. Here, it is preferable that the transform base is generated based on a Karhunen-Loeve Transform or KLT, which is theoretically known as a transform having the best energy concentration efficiency. The transform base generator 620 can be implemented as an independent element, or may include a correlation matrix calculator 622 and an eigenvector calculator 624. Further, the transform base generating apparatus can be implemented, including the intra prediction error collector 610, the correlation matrix calculator 622, and the eigenvector calculator 624.
The correlation matrix calculator 622 calculates an autocorrelation matrix for a set of the prediction errors collected by the intra prediction error collector 610. In the case of the 4×4 intra prediction mode, two transform bases for a column vector signal and a row vector signal should be generated in Equation (2) because the intra prediction error block Pk m is a two-dimensional signal. Further, an autocorrelation matrix of the intra prediction error should be obtained in order to generate the KLT base, and the autocorrelation matrix can be obtained as defined in Equations (3) and (4).
R c m = E [ P k m ( P k m ) T ] = 1 N m k = 1 N m P k m ( P k m ) T Equation 3 R r m = E [ ( P k m ) T P k m ] = 1 N m k = 1 N m ( P k m ) T P k m Equation 4
In Equation (3), Rc m denotes a 4×4 autocorrelation matrix for a column vector signal of a 4×4 intra prediction error in which an intra prediction mode is determined as an intra prediction mode m|, and m| denotes an index indicating a number of the 4×4 intra prediction mode, wherein the index has values from 0 to 8. Further, Nm denotes the number of blocks in which an intra prediction mode is determined as the intra prediction mode m among macroblocks in the regular unit which are encoded prior to the current macroblock, and Pk m denotes one 4×4 prediction error block which is a kth element of Pm denoting a 4×4 prediction error block set of the blocks in which the intra prediction mode is determined as the intra prediction mode m among the macroblocks in the regular unit which are encoded prior to the current macroblock. Further, Rr m denotes a 4×4 autocorrelation matrix for a row vector signal of the 4×4 intra prediction error in which the intra prediction mode is determined as the intra prediction mode m.
The KLT base for the 4×4 intra prediction error block can be obtained through an eigenvector of the autocorrelation matrix, and the eigenvector calculator 624 can calculate eigenvectors as defined in Equations (5) and (6) from the autocorrelation matrices defined in Equations (3) and (4), which are calculated by the correlation matrix calculator 622.
R c mφn m,cn m,cφn m,c, 0≦n≦3  Equation 5
R r mφn m,rn m,rφn m,r, 0≦n≦3  Equation 6
In Equation (5), φn m,c denotes an eigenvector of Rc m, λn m,c denotes an eigenvalue of Rc m. Further, in Equation (6), φn m,r denotes an eigenvector of Rr m, and λn m,r, denotes an eigenvalue of Rr m. Equations (7) and (8) can be generated by obtaining eigenvectors satisfying Equations (5) and (6) and expressing the eigenvectors as matrices.
Φc m=└φ0 m,cφ1 m,cφ2 m,cφ3 m,c|  Equation 7
Φr m=└φ0 m,rφ1 m,rφ2 m,rφ3 m,r┘|  Equation 8
In Equation (7), Φc m denotes a KLT base for a column vector signal of a prediction error block corresponding to the intra prediction mode m|, and, in Equation (8), Φr m denotes a KLT base for a row vector signal of the prediction error block corresponding to the intra prediction mode m.
Meanwhile, the intra predictor 630 predicts a pixel value for a predicted block by using neighboring pixels of a target block within a current frame according to a directional intra prediction mode. Further, the intra predictor 630 generates a prediction error through a difference between the pixel value for the target block and the pixel value for the predicted block. That is, the intra predictor 630 includes a differentiator (not shown) for calculating a differential between the target block and the predicted block.
The transform encoder 640 transform-encodes a prediction error generated by the intra predictor 630 by using transform bases generated by the transform base generator 620. The aforementioned transform of the two-dimensional signal using the KLT base is performed as defined in Equation (9).
V m=(Φc m)T U mr m)|  Equation 9
In Equation (9), Um denotes a prediction error signal of the intra prediction mode m|, and Vm denotes a signal transformed by a KLT of Um.
Although a case of the 4×4 intra prediction mode has been described as an example, a method of generating a KLT base for an intra prediction error of the 8×8 intra prediction mode is equal to the case of the 4×4 intra prediction mode. Further, a method of generating a KLT base for an intra prediction error of the 16×16 intra prediction mode is equal to the case of the 4×4 intra prediction mode, and only difference is that the number of intra prediction error sets and the number of KLT bases are 4 smaller than those of the 4×4 intra prediction mode.
The KLT base generated by the transform base generator 620 is not a transform base optimized for the prediction error generated by the intra predictor 630, but the KLT base has no significant difference in performance from a transform base optimized for a current frame because there is a high correlation between the current frame and a previous frame based on characteristics of a general video signal, and has properties, which require the transmission of no additional information on a transform base for a decoding by generating the transform base among macroblocks in the regular unit, which are encoded prior to the current macroblock.
Meanwhile, although it has been described that the intra predictor 630 is independently constructed from the intra prediction error collector 610 in FIG. 6, the intra predictor 630 may include the intra prediction error collector 610. That is, the intra predictor 630 predicts the pixel value of the predicted block by using neighboring pixels of the target block within the current frame according to the directional intra prediction mode, generates the prediction error through the difference between the pixel value of the predicted block and the pixel value of the target block, and collects prediction errors of blocks having the same intra prediction mode from blocks in which intra prediction modes have been determined among macroblocks in the regular unit, which are encoded prior to the current macroblock at the same time, so that the intra prediction can output a prediction error of the current frame together with the collected prediction errors of the blocks.
When an intra prediction encoding is performed according to the aforementioned method, not only the performance can be improved by applying different transform bases depending on intra perdition modes but more excellent intra prediction encoding efficiency can be achieved by providing an adaptive transform base, which can immediately respond to a characteristic change of an image, in every specific encoding unit.
FIG. 7 is a flowchart illustrating a video encoding method by the video encoding apparatus of FIG. 6.
Referring to FIG. 7, the intra prediction error collector 610 collects prediction errors of blocks having the same intra prediction mode from macroblocks in the regular unit, which are encoded prior to a current macroblock in step S701. The collected prediction errors for the macroblocks in the regular unit, which are encoded prior to the current macroblock, can be expressed in a form of a set as shown in FIG. 2.
The transform base generator 620 calculates an autocorrelation matrix for an intra prediction mode set based on the prediction errors collected by the intra prediction error collector 610 in step S703. In this event, since the 4×4 intra prediction error block Pk m is a two-dimensional signal, two types of transform bases for a column vector signal and a row vector signal should be generated. Further, in order to generate the KLT base, the autocorrelation matrix can be calculated as defined in Equations (3) and (4). In this event, the KLT base can be calculated through an eigenvector of the autocorrelation matrix as defined in Equations (5) and (6) in step S705. The calculated eigenvector can be expressed as matrices shown in Equations (7) and (8).
Meanwhile, the intra predictor 630 predicts a pixel value of a current pixel by using neighboring pixels of a target block within a current frame according to a directional intra prediction mode, and generates a prediction error through a difference between the predicted pixel and the current pixel in step S707.
The transform encoder 640 transform-encodes the prediction error generated by the intra predictor 630 by using the transform bases generated by the transform base generator 620 as shown in Equation (9).
FIG. 8 is a flowchart illustrating a transform encoding method according to another aspect of the present disclosure. When the intra predictor 630 is independently constructed from the intra prediction error collector 610, the transform base generator 620, and the transform encoder 640, the intra prediction error collector 610, the transform base generator 620, and the transform encoder 640 may be referred to as elements of the transform encoding apparatus. In this event, steps S801 to S805 of the transform encoding apparatus are equally performed to steps S701 to S705 of FIG. 7, and the prediction error independently generated using the generated transform base by the intra predictor 630 is transform-encoded in step S807.
FIG. 9 is a diagram illustrating an example of a structure of a bitstream generated by the video encoding apparatus of FIG. 6. In the H.264 standard, the bitstream is encoded in the unit of slices. The bitstream includes a slice header 910 and a slice data 920, and the slice data 920 includes a plurality of macroblock data (MB) 921 to 924. Further, one macroblock data 923 may include an mb_type field 930, an mb_pred field 935, and a texture data field 939.
Here, the mb_type field 930 records a value indicating a macroblock type. That is, the value recorded in the mb_type field 930 indicates whether a current macroblock is an intra macroblock or an inter macroblock.
Further, the mb_pred field 935 records a detailed prediction mode according to the macroblock type. In a case of the intra macroblock, an information on a prediction mode selected in the intra prediction is recorded. In a case of the inter macroblock, an information on a motion vector and a reference frame number for each macroblock partition is recorded.
When the mb_type field 930 indicates the intra macroblock, the mb_pred field 935 is divided into a plurality of block information 941 to 944, and each information piece 942 is divided into a main_mode field 945 for recording a value of a main mode and a sub_mode field 946 for recording a value of a sub mode.
Lastly, the texture data field 939 records an encoded residual image, that is, a texture data.
FIG. 10 is a diagram schematically illustrating a video decoding apparatus according to an aspect of the present disclosure, and FIG. 11 is a flowchart illustrating the video decoding method by the video decoding apparatus of FIG. 10. A detailed construction and operation of the video decoding apparatus 1000 will be described with reference to FIGS. 10 and 11.
Referring to FIGS. 10 and 11, the video decoding apparatus 1000 includes an intra prediction error collector 1010, a transform base generator 1020, a prediction mode reader 1030, an inverse transformer 1040, and a current block reconstructer 1050. Here, the transform base generator 1020 may include a correlation matrix calculator 1022 and an eigenvector calculator 1024.
The intra prediction error collector 1010 collects prediction errors of blocks having the same intra prediction mode from macroblocks in the regular unit, which are encoded prior to a current macroblock in step S1101. That is, in order to generate transform bases for various intra prediction modes, the intra prediction error collector 1010 receives the macroblocks in the regular unit, which are encoded prior to the current macroblock, and collects the prediction errors of the blocks having the same intra prediction mode from blocks in which intra prediction modes have been selected, like the intra prediction error collector of FIG. 6. In this event, since 9 types of intra prediction modes are defined in the 4×4 intra prediction mode and the 8×8 intra prediction mode, 4×4 intra prediction errors and 8×8 intra mode prediction errors can be collected into 9 types, respectively. Further, since 4 types of intra prediction modes are defined in the 16×16 intra prediction mode, 16×16 intra prediction errors can be collected into 4 types. For example, intra prediction errors for the 4×4 intra prediction mode can be collected into a set as defined in Equation (2).
The transform base generator 1020 generates transform bases for respective intra prediction modes based on the prediction errors collected by the intra prediction error collector 1010. Here, it is preferable that the transform base is generated based on the KLT, which is theoretically known as a transform having the best energy concentration efficiency like a case of the video encoding apparatus 600. The transform base generator 1020 can be implemented as an independent element, or implemented as the transform base generating apparatus including the intra prediction error collector 1010, the correlation matrix calculator 1022, and the eigenvector calculator 1024.
The correlation matrix calculator 1022 calculates an autocorrelation matrix for a set of the prediction errors collected by the intra prediction error collector 1010 in step S1103. In the case of the 4×4 intra prediction mode, two transform bases for a column vector signal and a row vector signal should be generated in Equation (2) because the intra prediction error block Pk m is a two-dimensional signal. Further, an autocorrelation matrix of the intra prediction error should be obtained in order to generate the KLT base, and the autocorrelation matrix can be obtained as defined in Equations (3) and (4).
Further, the KLT base for the 4×4 intra prediction error block can be obtained through an eigenvector of the autocorrelation matrix, and the eigenvector calculator 1024 can calculate eigenvectors as defined in Equations (5) and (6) from the autocorrelation matrices defined in Equations (3) and (4), which are calculated by the correlation matrix calculator 1022 in step S1105. In this event, the transform base generator 1020 can generate a KLT based transform base by obtaining eigenvectors satisfying Equations (5) and (6) and expressing the eigenvectors as matrices as defined in Equations (7) and (8).
The prediction mode reader 1030 reads an intra prediction mode of a target block to be decoded from the bitstream structure shown in FIG. 9 in step S1109. That is, the prediction mode reader 1030 receives a bitstream generated in the video encoding apparatus 600 and reads the intra prediction mode of the target block to be decoded of the current frame.
The inverse transformer 1040 inversely transforms a prediction error received through the bitstream by using a transform base corresponding to the intra prediction mode read by the intra prediction mode reader 1030 among transform bases generated by the transform base generator 1020 in step S1111.
The prediction error of the target block received through the bitstream generated by the video encoding apparatus 600 is transform-encoded using different transform bases depending on the intra prediction mode of the target block unlike the H.264 standard applying the integer transform and the inverse transform based on a fixed DCT (Discrete Cosine Transform) in a process of the transform and inverse transform of the prediction error regardless of the intra prediction mode. Accordingly, the intra prediction mode reader 1030 determines the intra prediction mode of the target block to be decoded of the current frame from an input bitstream, and inversely transforms the prediction error by applying a transform base corresponding to the intra prediction mode read by the intra prediction mode reader 1030. In this event, an inverse transform of the two-dimensional signal using the aforementioned KLT based transform base can be performed as defined in Equation (10).
{circumflex over (U)}m=((Φc m)T)−1 V mr m)−1   Equation 10
In Equation (10), Vm denotes a signal generated by transform-encoding a prediction error of the intra prediction mode m|, and Ûm denotes a signal generated by inversely transforming Vm by the KLT. In general, although an inverse matrix of the KLT base should be used in the inverse transform, a transpose matrix is used in the inverse transform without acquiring the inverse matrix because the KLT base is an orthogonal matrix generated through an eigenvector and thus the inverse matrix of the KLT base and the transpose matrix of the KLT base are equal to each other. Accordingly, the inverse transform of the two-dimensional signal can be performed as defined in Equation (11) by more simply using the transpose matrix instead of the inverse matrix.
Û m=(Φc m)V mr m)T|  Equation 11
The current block reconstructer 1050 predicts a pixel value of a current pixel by using neighboring pixels of a target block within a current frame according to the intra prediction mode read by the intra prediction mode reader 1030, and reconstructs a current block by adding the predicted pixel value and the prediction error value inversely transformed by the inverse transformer 1040 in step S1113.
Through the video encoding and decoding performed in the above described way, the inverse transform and decoding can be performed by generating exactly the same adaptive transform base with reference to a previous frame in which a decoding is terminated in the video decoding apparatus 1000 like the transform encoding performed by generating different adaptive transform bases depending on the intra prediction mode with reference to the previous frame in the video encoding apparatus 600.
In the description above, although all of the components of the embodiments of the present disclosure may have been explained as assembled or operatively connected as a unit, the present disclosure is not intended to limit itself to such embodiments. Rather, within the objective scope of the present disclosure, the respective components may be selectively and operatively combined in any numbers. Every one of the components may be also implemented by itself in hardware while the respective ones can be combined in part or as a whole selectively and implemented in a computer program having program modules for executing functions of the hardware equivalents. Codes or code segments to constitute such a program may be easily deduced by a person skilled in the art. The computer program may be stored in computer readable media, which in operation can realize the aspects of the present disclosure. As the computer readable media, the candidates include magnetic recording media, optical recording media, and carrier wave media.
In addition, terms like ‘include’, ‘comprise’, and ‘have’ should be interpreted in default as inclusive or open rather than exclusive or closed unless expressly defined to the contrary. All the terms that are technical, scientific or otherwise agree with the meanings as understood by a person skilled in the art unless defined to the contrary. Common terms as found in dictionaries should be interpreted in the context of the related technical writings not too ideally or impractically unless the present disclosure expressly defines them so.
Although exemplary aspects of the present disclosure have been described for illustrative purposes, those skilled in the art will appreciate that various modifications, additions and substitutions are possible, without departing from essential characteristics of the disclosure. Therefore, exemplary aspects of the present disclosure have not been described for limiting purposes. Accordingly, the scope of the disclosure is not to be limited by the above aspects but by the claims and the equivalents thereof.
INDUSTRIAL APPLICABILITY
As described above, the present disclosure is highly useful for application in the fields of an encoder and a decoder using an intra prediction, an image compression apparatus, etc. to generate an effect of improving the compression efficiency of an intra prediction encoding by adaptively generating a transform base according to a local characteristic change of a prediction error as well as an intra prediction mode and using the generated transform base in a transform encoding of the prediction error in order to efficiently transform-encoding the prediction error after the intra prediction.

Claims (12)

The invention claimed is:
1. A video encoding apparatus, comprising:
an intra prediction error collector configured to collect, for an intra prediction mode of a current block, prediction errors of respective blocks having the same intra prediction mode as the intra prediction mode of the current block, the respective blocks being encoded prior to encoding the current block;
a transform base generator configured to generate a transform base for the intra prediction mode based on the collected prediction errors;
an intra predictor configured to
predict the current block to generate a predicted block by using neighboring pixels of the current block according to the intra prediction mode of the current block, and
generate a prediction error block of the current block according to a difference between the predicted block and the current block; and
a transform encoder configured to transform the prediction error block by using the transform base for the intra prediction mode,
wherein the transform base generator comprises a correlation matrix calculator configured to calculate an autocorrelation matrix for the intra prediction mode by using (i) each first matrix corresponding to each of the collected prediction errors of each of the respective blocks for the intra prediction mode and (ii) each transpose matrix of said each first matrix,
wherein the autocorrelation matrix for the intra prediction mode is generated based on an average of respective multiplications of said each first matrix and said each transpose matrix of said each first matrix, and
wherein the transform base for the intra prediction mode is generated based on the autocorrelation matrix for the intra prediction mode.
2. The video encoding apparatus of claim 1, wherein the transform base generator further comprises:
an eigenvector calculator configured to calculate an eigenvector from the autocorrelation matrix,
wherein the transform base for the intra prediction mode is generated by using the calculated eigenvector.
3. The video encoding apparatus of claim 1, wherein the intra prediction error collector is configured to produce the collected prediction errors based on an equation of

P m ={P k m|1≦k≦N m},
wherein m denotes an index indicating the intra prediction mode of the current block,
Nm denotes the number of blocks in which an intra prediction mode is determined as intra prediction mode m, among blocks which are encoded prior to the current block,
Pm denotes the collected prediction errors in which an intra prediction mode is determined as intra prediction mode m, and
Pk m denotes one prediction error block which is a kth element of Pm.
4. The video encoding apparatus of claim 1, wherein the correlation matrix calculator is configured to calculate the autocorrelation matrix for the collected prediction errors for the intra prediction mode, based on an equation of
R c m = E [ P k m ( P k m ) T ] = 1 N m k = 1 N m P k m ( P k m ) T ,
wherein m denotes an index indicating the intra prediction mode of the current block,
Rc m denotes an autocorrelation matrix for a column vector signal of the collected prediction errors in which an intra prediction mode is determined as intra prediction mode m,
Nm denotes the number of blocks in which an intra prediction mode is determined as intra prediction mode m, among blocks which are encoded prior to the current block, and Pk m denotes one prediction error block which is a kth element of Pm denoting the collected prediction errors in which an intra prediction mode is determined as intra prediction mode m.
5. The video encoding apparatus of claim 1, wherein the correlation matrix calculator is configured to calculate the autocorrelation matrix for the collected prediction errors for the intra prediction mode, based on an equation of
R r m = E [ ( P k m ) T P k m ] = 1 N m k = 1 N m ( P k m ) T P k m ,
where m denotes an index indicating an intra prediction mode,
Rr m denotes an autocorrelation matrix for a row vector signal of the collected prediction errors in which an intra prediction mode is determined as intra prediction mode m,
Nm denotes the number of blocks in which an intra prediction mode is determined as intra prediction mode m, among blocks which are encoded prior to the current block, and Pk m denotes one prediction error block which is a kth element of Pm denoting the collected prediction errors in which an intra prediction mode is determined as intra prediction mode m.
6. A transform base generating apparatus for generating a transform base for an intra prediction mode of a current block, the transform base generating apparatus comprising:
an intra prediction error collector configured to collect, for the intra prediction mode of the current block, prediction errors of respective blocks having the same intra prediction mode as the intra prediction mode of the current block, the respective blocks being encoded prior to encoding the current block;
a correlation matrix calculator configured to calculate an autocorrelation matrix for the intra prediction mode by using (i) each first matrix corresponding to each of the collected prediction errors of each of the respective blocks for the intra prediction mode and (ii) each transpose matrix of said each first matrix, wherein the autocorrelation matrix for the intra prediction mode is generated based on an average of respective multiplications of said each first matrix and said each transpose matrix of said each first matrix; and
an eigenvector calculator configured to calculate an eigenvector from the autocorrelation matrix,
wherein the transform base for the intra prediction mode is generated based on the calculated eigenvector.
7. The transform base generating apparatus of claim 6, wherein the transform base is a KLT-based transform base.
8. A video decoding apparatus, comprising:
an intra prediction mode reader configured to extract an intra prediction mode of a current block to be decoded, from a bitstream;
an intra prediction error collector configured to collect, for the intra prediction mode of the current block, prediction errors of respective blocks having the same intra prediction mode as the intra prediction mode of the current block, the respective blocks being encoded prior to encoding the current block;
a transform base generator configured to generate a transform base for the intra prediction mode based on the collected prediction errors;
an inverse transformer configured to inversely transform prediction error information of the current block by using the transform base for the intra prediction mode to generate an error block, the prediction error information being extracted from the bitstream; and
a current block reconstructer configured to
predict the current block to generate a predicted block by using neighboring pixels of the current block according to the intra prediction mode of the current block, and
reconstruct the current block by adding the predicted block and the error block,
wherein the transform base generator comprises a correlation matrix calculator configured to calculate an autocorrelation matrix for the intra prediction mode by using (i) each first matrix corresponding to each of the collected prediction errors of each of the respective blocks for the intra prediction mode and (ii) each transpose matrix of said each first matrix,
wherein the autocorrelation matrix for the intra prediction mode is generated based on an average of respective multiplications of said each first matrix and said each transpose matrix of said each first matrix, and
wherein the transform base for the intra prediction mode is generated based on the autocorrelation matrix for the intra prediction mode.
9. The video decoding apparatus of claim 8, wherein the transform base generator further comprises:
an eigenvector calculator configured to calculate an eigenvector from the autocorrelation matrix,
wherein the transform base for the intra prediction mode is generated based on the eigenvector.
10. A video encoding method, comprising:
collecting, for an intra prediction mode of a current block, prediction errors of respective blocks having the same intra prediction mode as the intra prediction mode of the current block, the respective blocks being encoded prior to encoding the current block;
calculating an autocorrelation matrix for the intra prediction mode by using (i) each first matrix corresponding to each of the collected prediction errors of each of the respective blocks for the intra prediction mode and (ii) each transpose matrix of said each first matrix, wherein the autocorrelation matrix for the intra prediction mode is generated based on an average of respective multiplications of said each first matrix and said each transpose matrix of said each first matrix;
predicting the current block to generate a predicted block by using neighboring pixels of the current block according to the intra prediction mode of the current block;
generating a prediction error block of the current block according to a difference between the predicted block and the current block;
generating a transform base for the intra prediction mode based on the autocorrelation matrix for the intra prediction mode; and
transforming the prediction error block of the current block by using the transform base for the intra prediction mode.
11. A video decoding method, comprising:
extracting an intra prediction mode of a current block to be decoded, from a bitstream;
collecting, for the intra prediction mode of the current block, prediction errors of respective blocks having the same intra prediction mode as the intra prediction mode of the current block, the respective blocks being encoded prior to encoding the current block;
calculating an autocorrelation matrix for the intra prediction mode by using (i) each first matrix corresponding to each of the collected prediction errors of each of the respective blocks for the intra prediction mode and (ii) each transpose matrix of said each first matrix, wherein the autocorrelation matrix for the intra prediction mode is generated based on an average of respective multiplications of said each first matrix and said each transpose matrix of said each first matrix;
generating a transform base for the intra prediction mode based on the autocorrelation matrix for the intra prediction mode;
inversely transforming prediction error information of the current block by using the transform base for the intra prediction mode to generate an error block, the prediction error information being extracted from the bitstream;
predicting the current block to generate a predicted block by using neighboring pixels of the current block according to the intra prediction mode of the current block; and
reconstructing the current block by adding the predicted block and the error block.
12. The video decoding method of claim 11, wherein generating of the transform base comprises:
calculating an eigenvector from the autocorrelation matrix,
wherein the transform base for the intra prediction mode is generated based on the eigenvector.
US13/514,552 2009-12-09 2010-12-09 Image-encoding apparatus and method, transform-encoding apparatus and method, apparatus and method for generating a transform base, and image-decoding apparatus and method used in same Active 2033-01-26 US9271000B2 (en)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
KR10-2009-0121980 2009-12-09
KR1020090121980A KR101441879B1 (en) 2009-12-09 2009-12-09 Video encoding apparatus and method, transform encoding apparatus and method, basis transform generating apparatus and method, and video decoding apparatus and method
PCT/KR2010/008777 WO2011071316A2 (en) 2009-12-09 2010-12-09 Image-encoding apparatus and method, transform-encoding apparatus and method, apparatus and method for generating a transform base, and image-decoding apparatus and method used in same

Publications (2)

Publication Number Publication Date
US20120307895A1 US20120307895A1 (en) 2012-12-06
US9271000B2 true US9271000B2 (en) 2016-02-23

Family

ID=44146054

Family Applications (1)

Application Number Title Priority Date Filing Date
US13/514,552 Active 2033-01-26 US9271000B2 (en) 2009-12-09 2010-12-09 Image-encoding apparatus and method, transform-encoding apparatus and method, apparatus and method for generating a transform base, and image-decoding apparatus and method used in same

Country Status (3)

Country Link
US (1) US9271000B2 (en)
KR (1) KR101441879B1 (en)
WO (1) WO2011071316A2 (en)

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2011259205A (en) 2010-06-09 2011-12-22 Sony Corp Image decoding device, image encoding device, and method and program thereof
WO2013095460A1 (en) * 2011-12-21 2013-06-27 Intel Corporation Perceptual lossless compression of image data to reduce memory bandwidth and storage
US20180115787A1 (en) * 2015-04-12 2018-04-26 Lg Electronics Inc. Method for encoding and decoding video signal, and apparatus therefor
TWI761551B (en) * 2017-07-13 2022-04-21 美商松下電器(美國)知識產權公司 Encoding device, encoding method, decoding device, and decoding method
KR20190014194A (en) 2017-07-28 2019-02-12 희성전자 주식회사 Light guide plate and backlight unit having the same
IL309549A (en) * 2019-06-25 2024-02-01 Guangdong Oppo Mobile Telecommunications Corp Ltd Image encoding method, image decoding method, encoder, decoder and storage medium

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20050007607A (en) 2002-06-11 2005-01-19 노키아 코포레이션 Spatial prediction based intra coding
KR20050012762A (en) 2002-05-28 2005-02-02 샤프 가부시키가이샤 Methods and systems for image intra-prediction mode estimation, communication, and organization
KR20070037533A (en) 2005-10-01 2007-04-05 삼성전자주식회사 Method for intra predictive coding for image data and encoder thereof
US20080310504A1 (en) * 2007-06-15 2008-12-18 Qualcomm Incorporated Adaptive coefficient scanning for video coding
US20120121009A1 (en) * 2009-07-23 2012-05-17 Thomson Licensing Methods and apparatus for adaptive transform selecton for video encoding and decoding

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2008099175A (en) 2006-10-16 2008-04-24 Renesas Technology Corp Video coder, and video coding method

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20050012762A (en) 2002-05-28 2005-02-02 샤프 가부시키가이샤 Methods and systems for image intra-prediction mode estimation, communication, and organization
KR20050007607A (en) 2002-06-11 2005-01-19 노키아 코포레이션 Spatial prediction based intra coding
KR20070037533A (en) 2005-10-01 2007-04-05 삼성전자주식회사 Method for intra predictive coding for image data and encoder thereof
US20080310504A1 (en) * 2007-06-15 2008-12-18 Qualcomm Incorporated Adaptive coefficient scanning for video coding
US20120121009A1 (en) * 2009-07-23 2012-05-17 Thomson Licensing Methods and apparatus for adaptive transform selecton for video encoding and decoding

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
International Search Report mailed Aug. 12, 2011 for PCT/KR2010/008777.
Shiodera Taichiro; Akiyuki Tanizawa; Takeshi Chujoh; and Tomoo Yamakage; "Improvement of Bidirectional Intra Prediction"; ITU-T SG16/Q.6 Doc. VCEG-AG08; Oct. 2007.
Yan Ye; and Marta Karczewicz; "Improved Intra Coding"; ITU-T SG16/Q.6 Doc. VCEG-AG11; Oct. 2007.
Yan Ye; Karczewicz, M., "Improved h.264 intra coding based on bi-directional intra prediction, directional transform, and adaptive coefficient scanning," Image Processing, 2008. ICIP 2008. 15th IEEE International Conference on, vol., No., pp. 2116-2119, Oct. 12-15, 2008. *

Also Published As

Publication number Publication date
KR101441879B1 (en) 2014-09-23
KR20110065116A (en) 2011-06-15
WO2011071316A3 (en) 2011-11-24
US20120307895A1 (en) 2012-12-06
WO2011071316A2 (en) 2011-06-16

Similar Documents

Publication Publication Date Title
US9749653B2 (en) Motion vector encoding/decoding method and device and image encoding/decoding method and device using same
US10623749B2 (en) Video encoding apparatus, video decoding apparatus, and video decoding method for performing intra-prediction based on directionality of neighboring block
US10051284B2 (en) Method and apparatus for encoding/decoding the motion vectors of a plurality of reference pictures, and apparatus and method for image encoding/decoding using same
US9473777B2 (en) Video encoding device and encoding method thereof, video decoding device and decoding method thereof, and directional intra-prediction method to be used thereto
US8976863B2 (en) Method and apparatus for encoding motion vector, and method and apparatus for encoding/decoding image using same
US9736475B2 (en) Method and apparatus for generating a reference frame and method and apparatus for encoding/decoding image using the same
US8199815B2 (en) Apparatus and method for video encoding/decoding and recording medium having recorded thereon program for executing the method
US8811487B2 (en) Method and apparatus for inter prediction decoding with selective use of inverse quantization and inverse transform
US9271000B2 (en) Image-encoding apparatus and method, transform-encoding apparatus and method, apparatus and method for generating a transform base, and image-decoding apparatus and method used in same
US20130202039A1 (en) Method and device for encoding/decoding image using extended skip mode
US11968373B2 (en) Video encoding apparatus, video decoding apparatus, and video decoding method for performing intra-prediction based on directionality of neighboring block
KR20110050914A (en) Method and apparatus for directional intra-prediction

Legal Events

Date Code Title Description
AS Assignment

Owner name: SK TELECOM CO., LTD., KOREA, REPUBLIC OF

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:KIM, SUNYEON;LIM, JEONGYEON;LEE, GYUMIN;AND OTHERS;SIGNING DATES FROM 20120619 TO 20120716;REEL/FRAME:028854/0212

STCF Information on status: patent grant

Free format text: PATENTED CASE

MAFP Maintenance fee payment

Free format text: PAYMENT OF MAINTENANCE FEE, 4TH YEAR, LARGE ENTITY (ORIGINAL EVENT CODE: M1551); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

Year of fee payment: 4

MAFP Maintenance fee payment

Free format text: PAYMENT OF MAINTENANCE FEE, 8TH YEAR, LARGE ENTITY (ORIGINAL EVENT CODE: M1552); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

Year of fee payment: 8