WO2008035842A1 - Apparatus and method for encoding and decoding using alternative converter according to the correlation of residual signal - Google Patents

Apparatus and method for encoding and decoding using alternative converter according to the correlation of residual signal Download PDF

Info

Publication number
WO2008035842A1
WO2008035842A1 PCT/KR2007/001809 KR2007001809W WO2008035842A1 WO 2008035842 A1 WO2008035842 A1 WO 2008035842A1 KR 2007001809 W KR2007001809 W KR 2007001809W WO 2008035842 A1 WO2008035842 A1 WO 2008035842A1
Authority
WO
WIPO (PCT)
Prior art keywords
quantization
dst
inverse
coefficients
onto
Prior art date
Application number
PCT/KR2007/001809
Other languages
French (fr)
Inventor
Dae-Yeon Kim
Jeong-Il Seo
Seung-Kwon Beack
In-Seon Jang
Dae-Young Jang
Jae-Gon Kim
Kyung-Ae Moon
Jin-Woo Hong
Jin-Woong Kim
Seoung-Jun Oh
Chang-Beom Ahn
Se-Yoon Jeong
Hae-Chul Choi
Yung-Lyul Lee
Dong-Gyu Sim
Sung-Chang Lim
Original Assignee
Electronics And Telecommunications Research Institute
Kwangwoon University Research Institute For Industry Cooperation
Industry-Academia Cooperation Group Of Sejong University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from KR1020070036089A external-priority patent/KR100927733B1/en
Application filed by Electronics And Telecommunications Research Institute, Kwangwoon University Research Institute For Industry Cooperation, Industry-Academia Cooperation Group Of Sejong University filed Critical Electronics And Telecommunications Research Institute
Priority to US12/441,940 priority Critical patent/US20090238271A1/en
Publication of WO2008035842A1 publication Critical patent/WO2008035842A1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/60Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
    • H04N19/61Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding in combination with predictive coding
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/12Selection from among a plurality of transforms or standards, e.g. selection between discrete cosine transform [DCT] and sub-band transform or selection between H.263 and H.264
    • H04N19/122Selection of transform size, e.g. 8x8 or 2x4x8 DCT; Selection of sub-band transforms of varying structure or type
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/134Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
    • H04N19/146Data rate or code amount at the encoder output
    • H04N19/147Data rate or code amount at the encoder output according to rate distortion criteria
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/17Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
    • H04N19/176Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/46Embedding additional information in the video signal during the compression process

Definitions

  • the present invention relates to an apparatus and method for encoding and decoding using alternative transform unit according to the correlation of residual signals; and, more particularly, to an encoding apparatus and method for improving a compression rate of image blocks by performing both of discrete cosine transform
  • DCT discrete sine transform
  • DST discrete sine transform
  • video coding is divided into intra coding for encoding frames in a picture, such as an intra frame, and inter coding for encoding frames between pictures, such as a predictive coded picture frame or a bidirectional predictive coded picture frame.
  • Motion estimation is performed in a unit of a block in video compression standards H.263, MPEG-4, and H.264. That is, the motion estimation is performed in a unit of a plurality of macroblocks, or the motion estimation is performed in a unit of a sub-block which is obtained by dividing a macroblock into two equal parts or four equal parts.
  • the motion estimation is performed to reduce a bit rate by removing temporal redundancy while encoding video.
  • H.264 has a higher coding efficiency than the others because H.264 codes video using variable block-based motion estimation.
  • a motion vector is predicted with reference to past frames or with reference to both of past frames and future frames based on a time domain.
  • a reference frame is a frame referred to encode or decode a current frame. Since H.264 supports multiple reference frames, H.264 selects a block of a frame having the most redundancy for the current block as a reference frame. Therefore, H.264 provides a higher coding efficiency than the others using only a past frame as a reference frame. Also, H.264 further improves the coding efficiency of H.264 baseline profile (BP) using a rate-distortion optimizing technology for selecting the optimal mode among a variable block mode, three space prediction modes (Intra 16x16, Intra 4x4, and IBLOCK), and a SKIP mode.
  • BP H.264 baseline profile
  • a transform unit is used for reducing spatial correlation of residual coefficients in a block after performing inter prediction and intra prediction and improving a compression rate and a quantizer is used for improving compression efficiency by further reducing the energy of transform coefficient after using the transform unit.
  • the transform unit of the H.264/MPEG-4 AVC standard performs integer-approximated discrete cosine transform (DCT) on a 4x4 block basis onto residual coefficients that are generated after inter and intra prediction as shown in Eq. 1.
  • DCT integer-approximated discrete cosine transform
  • Eq. 1 In Eq. 1, Y denotes an integer-approximated discrete cosine-transformed 4x4 coefficient, and X denotes a 4x4 residual coefficient.
  • a quantizer After performing the integer-approximated DCT through Eq. 1, a quantizer quantizes the transformed coefficient through Eq. 2, thereby generating a quantized transform coefficient.
  • Y ⁇ j denotes the integer-approximated discrete cosine-transformed coefficient at a position
  • the transform coefficient Z ⁇ j is converted to a bitstream through zigzag scanning and entropy encoding and the bitstream is transmitted or stored.
  • a decoding procedure decodes a bitstream through entropy decoding, inverse quantization (inverse quantizer), and 4x4 integer-approximated discrete cosine inverse transform (inverse converter).
  • the inverse quantization (inverse quantizer) is performed after entropy decoding.
  • ⁇ 1 3 denotes the inverse transformed coefficient after inverse quantization and V ⁇ J denotes a scaling factor.
  • Table 2 shows scaling factors V 13 of the inverse quantization, and (0,0), (1,0), -.., (3,3) denotes a position (i,j) of a 4X4 matrix.
  • the inverse-transformed coefficient, a 4x4 matrix Y' is expressed as a restored residual coefficient X r through the integer-approximated discrete cosine inverse transform as shown in Eq. 4.
  • the residual coefficients are expressed as first order stationary Markov sequences having high correlativity, and the integer-approximated inverse discrete cosine transform and the inverse quantization have superior performance when the correlation coefficient value is close to 1.
  • the correlation of residual coefficients in a picture has been lowered due to the development of the video encoding technology.
  • video encoding efficiency deteriorates if the correlation of the residual coefficients decreases.
  • the video encoding method according to the related art has a problem of the degradation of compression efficiency because the video encoding method according to the related art performs only quantizing a DCT coefficient in a picture when video is encoded. That is, as shown in Fig. 2, the video encoding method according to the related art performs inter frame prediction and intra frame prediction at steps S201 and S203 and performs DCT, quantization, inverse quantization, IDCT, and entropy coding at steps S202 and S204. At step S205, the video encoding method according to the related art decides a mode that minimizes a rate-distortion cost
  • RDcost among all possible encoding modes used in H.264, such as a variable block mode, three spatial prediction modes, and a SKIP mode, as an encoding mode by performing rate-distortion optimization in order to select the optimal mode.
  • the spatial prediction mode denotes an intra prediction mode
  • the SKIP mode means a case not requiring encoding because a pixel value of a macroblock of a previous frame is identical to that of the current frame.
  • the RDcost is calculated in consideration of image quality distortion and rates of each mode.
  • the video encoding efficiency of the video encoding method according to the related art deteriorates if the correlation of the residual coefficients decreases although the video encoding method according to the related art provide good video encoding efficiency when the correlation of the residual coefficients is high. Therefore, there is a demand for developing a new transforming scheme (transform unit) suitable to the low correlation of residual coefficients in order to prevent the deterioration of encoding efficiency when video is encoded.
  • An embodiment of the present invention is directed to providing an encoding apparatus and method for improving a compression rate of image blocks by performing both discrete cosine transform (DCT) and discrete sine transform (DST) and selecting one having a higher compression rate than the other between the DCT and DST through rate-distortion optimization when a quantized transformed coefficient is generated through transform and quantization after performing intra prediction and inter prediction on a predetermined size of block (macroblock) , and a decoding apparatus and method thereof.
  • DCT discrete cosine transform
  • DST discrete sine transform
  • an encoding apparatus including a first transforming unit for performing discrete cosine transform (DCT), first quantization, first inverse quantization, and inverse DCT on a block basis onto residual coefficients generated after performing intra frame prediction or inter frame prediction; a second transforming unit for performing discrete sine transform (DST), second quantization, second inverse quantization, and inverse DST on a block basis onto the residual coefficients; a selecting unit for selecting one having a high compression rate between the first and second transforming unit for each block through performing rate-distortion optimization; and a flag marking unit for recording information about the selected transforming unit at a flag bit provided on a macroblock basis.
  • DCT discrete cosine transform
  • DST discrete sine transform
  • second quantization discrete sine transform
  • second inverse quantization second inverse quantization
  • inverse DST discrete sine transform
  • a selecting unit for selecting one having a high compression rate between the first and second transforming unit for each block through performing rate-distortion optimization
  • a flag marking unit
  • a video decoding apparatus including: a flag identifying unit for detecting an encoding method of the bitstream by identifying a flag value included in a received bitstream header; and a decoding unit for performing first inverse quantization and inverse discrete cosine transform or second inverse quantization and inverse discrete sine transform according to the encoding method figured out by the flag identifying unit.
  • a video encoding method including the steps of: performing discrete cosine transform (DCT) , first quantization, first inverse quantization, and inverse DCT on a block basis onto residual coefficients generated after intra frame prediction or inter frame prediction; performing discrete sine transform (DST), second quantization, second inverse quantization, and inverse DST on a block basis onto the residual coefficients in addition to the step of performing DCT, first quantization, first inverse quantization, and inverse DCT; selecting a transforming scheme having a high compression rate for each a block through performing rate-distortion optimization; and recording information about the selected transforming scheme at a flag bit provided on a macroblock basis.
  • DCT discrete cosine transform
  • DST discrete sine transform
  • a video decoding method including the steps of: detecting an encoding method of the bitstream by identifying a flag value included in a header of the received bitstream; and decoding the received bitstream on a block basis by performing first inverse quantization and inverse discrete cosine transform, or second inverse quantization and inverse discrete sine transform according to the detected encoding method.
  • An encoding/decoding apparatus and method can improve a compression rate by performing both DCT and DST in a transform unit and selecting one having a highei compression rate than the other between the DCT and DST through rate-distortion optimization when a quantized transform coefficient is generated through the transform unit and a quantizer after inter prediction and intra prediction are performed on a block of a predetermined size.
  • Fig. 1 illustrates a H .264/MPEG-4 AVC encoding apparatus where the present invention is applied.
  • Fig. 2 is a flowchart describing an encoding method for optimizing a rate-distortion optimizing structure in a H.264/MPEG-4 AVC encoding apparatus in accordance with a related art.
  • Fig. 3 is a block diagram illustrating an encoding apparatus selectively using transform units according to the correlation of residual coefficients in accordance with an embodiment of the present invention.
  • Fig. 4 is a block diagram illustrating a decoding apparatus in accordance with an embodiment of the present invention.
  • Fig. 5 is a flowchart describing an encoding method for optimizing a rate-distortion optimizing structure in an H.264/MPEG-4 AVC in accordance with an embodiment of the present invention.
  • Figs. 6 and 7 are rate-distortion graphs for comparing an encoding/decoding method according to the present invention with the encoding/decoding method according to a related art based on "Foreman” and “Coastguard” QCIF picture.
  • Figs. 8 and 9 are rate-distortion graphs for comparing an encoding/decoding method according to the present invention with the encoding/decoding method according to the related art based on "Stephen” and "HallMonitor” QCIF picture.
  • FIGS. 10 and 11 are rate-distortion graphs for comparing an encoding/decoding method according to the present invention with the encoding/decoding method according to the related art based on "Foreman” and “Coastguard” CIF picture.
  • Figs. 12 and 13 are rate-distortion graphs for comparing an encoding/decoding method according to the present invention with the encoding/decoding method according to the related art based on "MobileandCalender” and "Soccer” QCIF picture.
  • Fig. 1 illustrates a H.264/MPEG-4 AVC encoding apparatus where the present invention is applied.
  • the H.264/MPEG-4 AVC encoding apparatus includes a transform and quantization unit 11, an entropy encoder 12, a coding controller (rate-distortion optimizer) 13, an inverse quantization and inverse transform unit 14, a loop filter 15, a reference image storing unit 16, a motion estimation unit 17, and a motion compensation unit 18.
  • an encoding apparatus includes a transcoder function that performs an encoding process and a decoding process, and a decoding apparatus perform a decoding process. Since the decoding process of the decoding apparatus is identical to the decoding process of the encoding apparatus, the encoding apparatus will be mainly described.
  • the transform and quantization unit 11 receives an input image predicted by Intra or Inter prediction.
  • the transform and quantization unit 11 performs discrete cosine transform (DCT) and first quantization and discrete sine transform (DST) and second quantization on the received input image.
  • the entropy encoder 12 performs entropy coding onto the transformed and quantized coefficient data and outputs a bitstream thereof.
  • the input image is also input to the coding controller 13 (rate-distortion optimization unit).
  • the coding controller 13 decides an optimal block mode by performing inverse quantization and inverse DCT (IDCT) and inverse quantization and inverse DST (IDST) onto the input image and outputs the decided optimal block mode to the transform and quantization unit 11.
  • IDCT inverse quantization and inverse DCT
  • IDST inverse quantization and inverse DST
  • the inverse quantization and inverse transform unit 14 receives image data acquired after the DCT, first quantization, DST, and second quantization and performs first inverse quantization, IDCT, second inverse quantization, and IDST thereon.
  • the loop filter 15 smoothes a block boundary of the inverse transformed and inverse quantized image data through low pass filtering. Then, the filtered image data is stored in the reference image storing unit 16.
  • the motion estimation unit 17 performs motion estimation based on the stored reference image and the input image and transfers the result thereof to the motion compensation unit 18.
  • the motion compensation unit 18 decides whether the reference image is subtracted from the input image or not according to whether a target input image to encode is an inter frame or an intra frame. Then, the motion compensation unit 18 transfers the reference image to the transform and quantization unit 11.
  • the encoding apparatus performs the DST process and the second quantization process and the second inverse quantization process and the IDST process for each block as well as the DCT process and the IDCT process and selects one providing a higher compression rate (DCT/IDCT or DST/IDST) than the other between the transforming processes (transform units) through rate-distortion optimization. Therefore, the encoding apparatus according to the present embodiment can improve the compression rate of an image block. That is, the encoding apparatus according to the present embodiment decides the optimal rnacroblock. type used for motion estimation and compensation by performing rate-distortion optimization and performs the motion estimation and compensation using the decided macroblock.
  • the encoding apparatus records the selected transform information (DCT information or DST information) at a k-bit prediction flag in a header of a macroblock layer syntax which is composed of a header field and a data field and where k is an integer number and transmits the recorded information to the decoding apparatus. Therefore, a decoding apparatus is enabled to select a decoding method based on the flag value recorded in the prediction flag.
  • DCT information or DST information selected transform information
  • the DST provides energy compression performance identical to optimal Karhunen Loeve transform (KL transform unit) when the correlation of residual coefficients is not large and a region of the correlation coefficient values is in (-0.5, 0.5).
  • KL transform unit Karhunen Loeve transform
  • transform may be performed in a NxM block as a basic block processing unit, where N and M are integer numbers.
  • transform may be performed in 4x8, 8x4, 8x8, 8x16, 16x8, and 16x16 blocks as well as 4x4 block.
  • the encoding/decoding apparatus and method according to the present embodiment will be described to perform transform in a 4x4 block as a preferred embodiment.
  • the encoding apparatus selects one providing a higher compression rate the other between DCT and DST by performing rate- distortion optimization in a block when a quantized transform coefficient is generated through transformation and quantization after performing inter prediction and intra prediction for a predetermined size of a block (macroblock) , records information about the selected transforming scheme (DCT or DST) at a 1-bit flag bit: that is added on a macroblock basis and transmits the flag bit to the decoding apparatus .
  • DCT or DST transforming scheme
  • the encoding and decoding apparatus includes a first transform unit for performing DCT and first quantization, and first inverse quantization and IDCT on a block basis for residual coefficients that are generated after performing inter prediction and intra prediction, a second transform unit for performing DST and second quantization, and second inverse quantization and IDST on a block basis for the residual coefficients, a rate-distortion optimization unit 29 for selecting one having a higher compression rate than the other between the first transform unit and the second transform unit by performing rate-distortion optimization, and a flag marking unit 40 for recording information about the selected transform unit to a corresponding flag bit disposed on a macroblock basis.
  • the first transform unit includes a DCT processor 31 for performing integer approximated discrete cosine transform (DCT) (integer transform) for residual coefficients (see Eq. 1), a quantization unit 32 for generating a quantized transform coefficient by performing the first quantization (referred to Eq. 2) onto the integer-transformed coefficient, an inverse quantization unit 33 for generating an integer- transformed coefficient by performing first inverse quantization (see Eq. 3) onto the quantized transform coefficient, and an IDCT processor 34 for restoring a residual coefficient by performing integer approximated inverse discrete cosine transform (see Eq. 4) onto the integer-transformed coefficient .
  • the second transform unit includes a DST processor 35 for performing integer approximated discrete sine transform (DST) (see Eq.
  • a quantization unit 36 for generating quantized transform coefficients by performing second quantization (referred to Eq. 10) onto the integer- transformed coefficients
  • an inverse quantization unit 37 for generating integer-transformed coefficients by performing second inverse quantization (referred to Eq. 11) onto the quantized transform coefficients
  • an IDST processor 38 for restoring residual coefficients by performing integer approximated inverse discrete sine transform (referred to Eq. 9) onto the integer- transformed coefficients.
  • one of the transform units is selected according to the correlation of residual coefficients, information about the selected transform unit (DCT or DST information) is recorded at a 1-bit flag bit, and the flag bit is transmitted to a decoding apparatus of Fig. 4.
  • the decoding apparatus of Fig. 4 identifies the information about the selected transform unit through a flag identifying unit 41 and performs inverse quantization and IDCT onto a received bitstream on a block basis through an inverse quantization unit 44 and an IDST processor 45 or performs inverse quantization and IDST through an inverse quantization unit 44 and an IDST processor 45, thereby performing decode with a suitable block unit.
  • the decoding apparatus includes a flag identifying unit 41 for identifying a flag value included in a header of a received bitstream and detecting a coding method of the received bitstream based on the identified flag value and a decoding unit for decoding a bitstream on a block basis through inverse quantization and IDCT or inverse quantization and IDST.
  • the decoding unit includes an inverse quantization unit 42,, an IDCT processor 43, an inverse quantization unit 44, and an IDST processor 45.
  • a flag value included in a bitstream header indicates the selected one of the first transform unit and the second transform unit, which provides the higher compression efficiency.
  • the first transform unit performs the DCT (see Eq. 1), the first quantization (see Eq. 2), the first inverse quantization (see Eq. 3), and the IDCT (see Eq. 4) on a block basis onto residual coefficients generated after inter prediction and intra prediction.
  • the second transform unit performs the DST (Eq. 8), the second quantization (Eq. 10), the second inverse quantization (Eq. 11), and the IDST (Eq. 9) on a block basis for residual coefficients.
  • Eq. 6 and Eq. 7 express the first order discrete sine transform (DST) and the first order inverse discrete sine transform (IDST) .
  • X denotes a residual coefficient to be processed through DST
  • Y is a DST processed coefficient
  • N denotes a unit side of DST.
  • Eq. 6 and Eq. 7 are converted to a 4x4 discrete sine transform matrix and an inverse discrete sine transform matrix as shown in Eq. 8 and Eq. 9.
  • C denotes a DST matrix for each row of X and C ⁇ denotes a DST matrix transposed for each column of X.
  • C and C ⁇ are identical to those in Eq. 8.
  • X' denotes a restored residual coefficient
  • Y' denotes an inverse-quantized transform coefficient.
  • Elements a and b in the matrix denote constants Vi" 11 "?' and V ⁇ S111 ⁇ f" y .
  • the DST is performed by the DST processor 35 on a 4x4 block basis for the residual coefficient generated after inter predict-icn and intra prediction as shown in Eq. 8 as a method of a H.264/MPEG-4 AVC transform unit.
  • the discrete sine-transformed coefficient is quantized through the second quantization process of Eq. 10 by the quantization unit 36, thereby generating a quantized DST coefficient.
  • Z 13 denotes a quantized DST coefficient located at a position ⁇ i,j) of a matrix.
  • QStep denotes a step size of a quantization unit, and round () denotes a rounding off function.
  • the transformed bitstream is processed through inverse quantization using an inverse quantization unit 37 and 4x4 IDST using an IDST processor 38 in a decoding procedure.
  • the operations of the inverse quantization unit 37 and the IDST processor 38 will be described.
  • the inverse quantization unit 37 performs inverse quantization onto the quantized DST coefficient as shown in Eq. 11.
  • the DST coefficient 4x4 matrix ⁇ is converted to a 4x4 restored residual coefficient X through IDST by the IDST processor 38 as shown in Eq. 9.
  • X * ⁇ j denotes a final restored residual coefficient of a 4X4 block.
  • the DST, the second quantization, the second inverse quantization, and the IDST are completely performed.
  • the information about a transform unit (DCT or DST) selected according to the correlation of residual signals by the encoding apparatus is recorded in a 1-bit flag bit which is added on a macroblock basis. Then, the flag bit is transmitted to the decoding apparatus of Fig. 4. Therefore, the decoding apparatus is enabled to decode the bitstream with a proper method.
  • the flag bit having information about the selected transform unit may be applied to various unit blocks such as the maximum NxN unit block to minimum 4x4 unit block.
  • a compression rate can be improved by selecting a transform unit by modifying the structure of rate-distortion optimization in the H.264/MPEG-4 AVC encoding apparatus according to the related art to that shown in Fig. 5.
  • intra frame prediction and inter frame prediction are performed at steps S501 and 504.
  • integer approximated discrete cosine transform (DCT), first quantization, first inverse quantization, and integer approximated inverse DCT, and entropy encoding are performed at steps S505 and S506.
  • a mode that minimizes a rate-distortion cost (RDcost) is selected from all possible coding modes used in H.264, such as a variable block mode, three spatial prediction modes, and a SKIP mode at step S507. That is, a transform unit having high compression efficiency is selected.
  • the information about the selected transform unit is recorded at a corresponding flag bit disposed on a macroblock basis and transmitted to the decoding apparatus. Therefore, the decoding apparatus is enabled to decide a proper decoding method using the flag value recorded in the prediction flag.
  • the simulations were performed using a joint model (JM) 10.2 encoder that supports H.264/MPEG-4 AVC.
  • JM joint model
  • As test images four 176 x 144 quarter common intermediate format (QCIF) images and four 352 x 288 common intermediate format (CIF) images, which are stored at 30Hz frame rate.
  • Table 3 shows simulation conditions.
  • Table 4 shows compression rates obtained from simulations performed under the conditions of Table 3.
  • various images were compressed using the H.264/MPEG-4 AVC compressing method according to the related art and the encoding method according to the present embodiment.
  • Table 4 clearly shows that the performance of the encoding method selectively using the transform unit according to the correlation of residual coefficients according to the present embodiment is much better than the H.264 /MPEG-4 AVC compression method.
  • Figs. 6, 7, 8, and 9 are rate-distortion graphs of QCIF pictures used in Table 4 for comparing an encoding/decoding method (apparatus) according to the present invention with the encoding/decoding method according to the related art.
  • Figs. 10, 11, 12 and 13 are rate-distortion graphs of CIF pictures used in Table 4 for comparing an encoding/decoding method (apparatus) according to the present invention with the encoding/decoding method according to the related art.
  • the rate-distortion graphs also clearly shows that the performance of the encoding method selectively using the transform unit according to the correlation of residual coefficients according to the present embodiment is improved as much as maximum 3db compared to the H.264/MPEG-4 AVC compression method.
  • the method of the present invention described above may be programmed for a computer. Codes and code segments constituting the computer program may be easily inferred by a computer programmer of ordinary skill in the art to which the present invention pertains.
  • the computer program may be stored in a computer-readable recording medium, i.e., data storage, and it may be read and executed by a computer to realize the method of the present invention.
  • the recording medium includes all types of computer-readable recording media. While the present invention has been described with respect to certain preferred embodiments, it will be apparent to those skilled in the art that various changes and modifications may be made without departing from the scope of the invention as defined in the following claims.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Physics & Mathematics (AREA)
  • Discrete Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)

Abstract

Provided is an apparatus and method for encoding and decoding using alternative transform units according to the correlation of residual signals. The video encoding apparatus includes a first transforming unit for performing discrete cosine transform (DCT), first quantization, first inverse quantization, and inverse DCT on a block basis onto residual coefficients generated after intra frame prediction or inter frame prediction; a second transforming unit for performing discrete sine transform (DST), second quantization, second inverse quantization, and inverse DST on a block basis onto the residual coefficients; a selecting unit for selecting one having a high compression rate between the first and second transforming units for each block through performing rate-distortion optimization; and a flag marking unit for recording information about the selected transforming unit at a flag bit provided on a macroblock basis.

Description

DESCRIPTION
APPARATUS AND METHOD FOR ENCODING AND DECODING USING ALTERNATIVE CONVERTER ACCORDING TO THE CORRELATION OF
RESIDUAL SIGNAL
TECHNICAL FIELD
The present invention relates to an apparatus and method for encoding and decoding using alternative transform unit according to the correlation of residual signals; and, more particularly, to an encoding apparatus and method for improving a compression rate of image blocks by performing both of discrete cosine transform
(DCT) and discrete sine transform (DST) and selecting one having a higher compression rate than the other between DCT and DST through performing rate-distortion optimization when a quantized transform coefficient is generated through transform and quantization after performing intra and inter prediction onto a predetermined size of block (macroblock) , and a decoding apparatus and method thereof.
BACKGROUND ART
In general, video coding is divided into intra coding for encoding frames in a picture, such as an intra frame, and inter coding for encoding frames between pictures, such as a predictive coded picture frame or a bidirectional predictive coded picture frame.
Motion estimation is performed in a unit of a block in video compression standards H.263, MPEG-4, and H.264. That is, the motion estimation is performed in a unit of a plurality of macroblocks, or the motion estimation is performed in a unit of a sub-block which is obtained by dividing a macroblock into two equal parts or four equal parts. The motion estimation is performed to reduce a bit rate by removing temporal redundancy while encoding video. Particularly, H.264 has a higher coding efficiency than the others because H.264 codes video using variable block-based motion estimation.
A motion vector is predicted with reference to past frames or with reference to both of past frames and future frames based on a time domain. A reference frame is a frame referred to encode or decode a current frame. Since H.264 supports multiple reference frames, H.264 selects a block of a frame having the most redundancy for the current block as a reference frame. Therefore, H.264 provides a higher coding efficiency than the others using only a past frame as a reference frame. Also, H.264 further improves the coding efficiency of H.264 baseline profile (BP) using a rate-distortion optimizing technology for selecting the optimal mode among a variable block mode, three space prediction modes (Intra 16x16, Intra 4x4, and IBLOCK), and a SKIP mode.
According to a H.264/MPEG-4 AVC standard for encoding/decoding video data, a transform unit is used for reducing spatial correlation of residual coefficients in a block after performing inter prediction and intra prediction and improving a compression rate and a quantizer is used for improving compression efficiency by further reducing the energy of transform coefficient after using the transform unit.
That is, the transform unit of the H.264/MPEG-4 AVC standard performs integer-approximated discrete cosine transform (DCT) on a 4x4 block basis onto residual coefficients that are generated after inter and intra prediction as shown in Eq. 1.
Figure imgf000004_0001
Eq. 1 In Eq. 1, Y denotes an integer-approximated discrete cosine-transformed 4x4 coefficient, and X denotes a 4x4 residual coefficient.
After performing the integer-approximated DCT through Eq. 1, a quantizer quantizes the transformed coefficient through Eq. 2, thereby generating a quantized transform coefficient.
Zff
Figure imgf000005_0001
In Eq. 2, Y±j denotes the integer-approximated discrete cosine-transformed coefficient at a position
(i,j) of a 4X4 matrix and Z±j is a quantized transform coefficient at a position {i,j) of a 4x4 matrix. QP denotes a quantization parameter and MF is a multiplication factor. Table 1 shows multiplication factors (MF) for quantization of Eq. 2 and (0,0),
(1,0), ..., (3,3) denote a position {i,j) of a 4x4 matrix.
Table 1
Figure imgf000005_0002
The transform coefficient Z±j is converted to a bitstream through zigzag scanning and entropy encoding and the bitstream is transmitted or stored.
On the contrary, a decoding procedure decodes a bitstream through entropy decoding, inverse quantization (inverse quantizer), and 4x4 integer-approximated discrete cosine inverse transform (inverse converter).
Hereinafter, the inverse quantization (inverse quantizer) , and the 4x4 integer-approximated discrete cosine inverse transform (inverse converter) will be described.
As shown Eq. 3, the inverse quantization (inverse quantizer) is performed after entropy decoding.
Figure imgf000006_0001
In Eq. 3, Ϋ 13 denotes the inverse transformed coefficient after inverse quantization and VλJ denotes a scaling factor. Table 2 shows scaling factors V13 of the inverse quantization, and (0,0), (1,0), -.., (3,3) denotes a position (i,j) of a 4X4 matrix. Table 2
Figure imgf000006_0002
Then, the inverse-transformed coefficient, a 4x4 matrix Y' , is expressed as a restored residual coefficient Xr through the integer-approximated discrete cosine inverse transform as shown in Eq. 4.
Figure imgf000007_0001
Then, the restored residual coefficient X il IS expressed as x"' ±D through post-scaling as shown in Eq. 5.
Tf χτj " = round(—^)
Eq. 5
The residual coefficients are expressed as first order stationary Markov sequences having high correlativity, and the integer-approximated inverse discrete cosine transform and the inverse quantization have superior performance when the correlation coefficient value is close to 1. However, the correlation of residual coefficients in a picture has been lowered due to the development of the video encoding technology. Particularly, video encoding efficiency deteriorates if the correlation of the residual coefficients decreases.
The video encoding method according to the related art has a problem of the degradation of compression efficiency because the video encoding method according to the related art performs only quantizing a DCT coefficient in a picture when video is encoded. That is, as shown in Fig. 2, the video encoding method according to the related art performs inter frame prediction and intra frame prediction at steps S201 and S203 and performs DCT, quantization, inverse quantization, IDCT, and entropy coding at steps S202 and S204. At step S205, the video encoding method according to the related art decides a mode that minimizes a rate-distortion cost
(RDcost) among all possible encoding modes used in H.264, such as a variable block mode, three spatial prediction modes, and a SKIP mode, as an encoding mode by performing rate-distortion optimization in order to select the optimal mode. Here, the spatial prediction mode denotes an intra prediction mode, and the SKIP mode means a case not requiring encoding because a pixel value of a macroblock of a previous frame is identical to that of the current frame. The RDcost is calculated in consideration of image quality distortion and rates of each mode.
Since the video encoding method according to the related art only quantizes the DCT coefficient in a picture when the video is encoded, the video encoding efficiency of the video encoding method according to the related art deteriorates if the correlation of the residual coefficients decreases although the video encoding method according to the related art provide good video encoding efficiency when the correlation of the residual coefficients is high. Therefore, there is a demand for developing a new transforming scheme (transform unit) suitable to the low correlation of residual coefficients in order to prevent the deterioration of encoding efficiency when video is encoded.
DISCLOSURE TECHNICAL PROBLEM An embodiment of the present invention is directed to providing an encoding apparatus and method for improving a compression rate of image blocks by performing both discrete cosine transform (DCT) and discrete sine transform (DST) and selecting one having a higher compression rate than the other between the DCT and DST through rate-distortion optimization when a quantized transformed coefficient is generated through transform and quantization after performing intra prediction and inter prediction on a predetermined size of block (macroblock) , and a decoding apparatus and method thereof.
Other objects and advantages of the present invention can be understood by the following description, and become apparent with reference to the embodiments of the present invention. Also, it is obvious to those skilled in the art of the present invention that the objects and advantages of the present invention can be realized by the means as claimed and combinations thereof.
TECHNICAL SOLUTION
In accordance with an aspect of the present invention, there is provided an encoding apparatus including a first transforming unit for performing discrete cosine transform (DCT), first quantization, first inverse quantization, and inverse DCT on a block basis onto residual coefficients generated after performing intra frame prediction or inter frame prediction; a second transforming unit for performing discrete sine transform (DST), second quantization, second inverse quantization, and inverse DST on a block basis onto the residual coefficients; a selecting unit for selecting one having a high compression rate between the first and second transforming unit for each block through performing rate-distortion optimization; and a flag marking unit for recording information about the selected transforming unit at a flag bit provided on a macroblock basis.
In accordance with another aspect of the present invention, there is provided a video decoding apparatus including: a flag identifying unit for detecting an encoding method of the bitstream by identifying a flag value included in a received bitstream header; and a decoding unit for performing first inverse quantization and inverse discrete cosine transform or second inverse quantization and inverse discrete sine transform according to the encoding method figured out by the flag identifying unit.
In accordance with yet another aspect of the present invention, there is provided a video encoding method including the steps of: performing discrete cosine transform (DCT) , first quantization, first inverse quantization, and inverse DCT on a block basis onto residual coefficients generated after intra frame prediction or inter frame prediction; performing discrete sine transform (DST), second quantization, second inverse quantization, and inverse DST on a block basis onto the residual coefficients in addition to the step of performing DCT, first quantization, first inverse quantization, and inverse DCT; selecting a transforming scheme having a high compression rate for each a block through performing rate-distortion optimization; and recording information about the selected transforming scheme at a flag bit provided on a macroblock basis. In accordance with still another aspect of the present invention, there is provided a video decoding method including the steps of: detecting an encoding method of the bitstream by identifying a flag value included in a header of the received bitstream; and decoding the received bitstream on a block basis by performing first inverse quantization and inverse discrete cosine transform, or second inverse quantization and inverse discrete sine transform according to the detected encoding method. ADVANTAGEOUS EFFECTS
An encoding/decoding apparatus and method according to the present invention can improve a compression rate by performing both DCT and DST in a transform unit and selecting one having a highei compression rate than the other between the DCT and DST through rate-distortion optimization when a quantized transform coefficient is generated through the transform unit and a quantizer after inter prediction and intra prediction are performed on a block of a predetermined size.
BRIEF DESCRIPTION OF THE DRAWINGS
Fig. 1 illustrates a H .264/MPEG-4 AVC encoding apparatus where the present invention is applied. Fig. 2 is a flowchart describing an encoding method for optimizing a rate-distortion optimizing structure in a H.264/MPEG-4 AVC encoding apparatus in accordance with a related art.
Fig. 3 is a block diagram illustrating an encoding apparatus selectively using transform units according to the correlation of residual coefficients in accordance with an embodiment of the present invention.
Fig. 4 is a block diagram illustrating a decoding apparatus in accordance with an embodiment of the present invention.
Fig. 5 is a flowchart describing an encoding method for optimizing a rate-distortion optimizing structure in an H.264/MPEG-4 AVC in accordance with an embodiment of the present invention. Figs. 6 and 7 are rate-distortion graphs for comparing an encoding/decoding method according to the present invention with the encoding/decoding method according to a related art based on "Foreman" and "Coastguard" QCIF picture. Figs. 8 and 9 are rate-distortion graphs for comparing an encoding/decoding method according to the present invention with the encoding/decoding method according to the related art based on "Stephen" and "HallMonitor" QCIF picture. Figs. 10 and 11 are rate-distortion graphs for comparing an encoding/decoding method according to the present invention with the encoding/decoding method according to the related art based on "Foreman" and "Coastguard" CIF picture. Figs. 12 and 13 are rate-distortion graphs for comparing an encoding/decoding method according to the present invention with the encoding/decoding method according to the related art based on "MobileandCalender" and "Soccer" QCIF picture.
BEST MODE FOR THE INVENTION
The advantages, features and aspects of the invention will become apparent from the following description of the embodiments with reference to the accompanying drawings, which is set forth hereinafter. Therefore, those skilled in the field of this art of the present invention can embody the technological concept and scope of the invention easily. In addition, if it is considered that detailed description on a related art may obscure the points of the present invention, the detailed description will not be provided herein. The preferred embodiments of the present invention will be described in detail hereinafter with reference to the attached drawings . Fig. 1 illustrates a H.264/MPEG-4 AVC encoding apparatus where the present invention is applied.
The H.264/MPEG-4 AVC encoding apparatus includes a transform and quantization unit 11, an entropy encoder 12, a coding controller (rate-distortion optimizer) 13, an inverse quantization and inverse transform unit 14, a loop filter 15, a reference image storing unit 16, a motion estimation unit 17, and a motion compensation unit 18.
In general, an encoding apparatus includes a transcoder function that performs an encoding process and a decoding process, and a decoding apparatus perform a decoding process. Since the decoding process of the decoding apparatus is identical to the decoding process of the encoding apparatus, the encoding apparatus will be mainly described.
The transform and quantization unit 11 receives an input image predicted by Intra or Inter prediction. The transform and quantization unit 11 performs discrete cosine transform (DCT) and first quantization and discrete sine transform (DST) and second quantization on the received input image. The entropy encoder 12 performs entropy coding onto the transformed and quantized coefficient data and outputs a bitstream thereof. Here, the input image is also input to the coding controller 13 (rate-distortion optimization unit). The coding controller 13 decides an optimal block mode by performing inverse quantization and inverse DCT (IDCT) and inverse quantization and inverse DST (IDST) onto the input image and outputs the decided optimal block mode to the transform and quantization unit 11.
In a decoder loop, the inverse quantization and inverse transform unit 14 receives image data acquired after the DCT, first quantization, DST, and second quantization and performs first inverse quantization, IDCT, second inverse quantization, and IDST thereon. The loop filter 15 smoothes a block boundary of the inverse transformed and inverse quantized image data through low pass filtering. Then, the filtered image data is stored in the reference image storing unit 16. The motion estimation unit 17 performs motion estimation based on the stored reference image and the input image and transfers the result thereof to the motion compensation unit 18. The motion compensation unit 18 decides whether the reference image is subtracted from the input image or not according to whether a target input image to encode is an inter frame or an intra frame. Then, the motion compensation unit 18 transfers the reference image to the transform and quantization unit 11.
As described above, the encoding apparatus according to the present embodiment performs the DST process and the second quantization process and the second inverse quantization process and the IDST process for each block as well as the DCT process and the IDCT process and selects one providing a higher compression rate (DCT/IDCT or DST/IDST) than the other between the transforming processes (transform units) through rate-distortion optimization. Therefore, the encoding apparatus according to the present embodiment can improve the compression rate of an image block. That is, the encoding apparatus according to the present embodiment decides the optimal rnacroblock. type used for motion estimation and compensation by performing rate-distortion optimization and performs the motion estimation and compensation using the decided macroblock. Here, the encoding apparatus records the selected transform information (DCT information or DST information) at a k-bit prediction flag in a header of a macroblock layer syntax which is composed of a header field and a data field and where k is an integer number and transmits the recorded information to the decoding apparatus. Therefore, a decoding apparatus is enabled to select a decoding method based on the flag value recorded in the prediction flag.
The DST provides energy compression performance identical to optimal Karhunen Loeve transform (KL transform unit) when the correlation of residual coefficients is not large and a region of the correlation coefficient values is in (-0.5, 0.5).
In the present embodiment, transform may be performed in a NxM block as a basic block processing unit, where N and M are integer numbers. For example, transform may be performed in 4x8, 8x4, 8x8, 8x16, 16x8, and 16x16 blocks as well as 4x4 block. Hereinafter, the encoding/decoding apparatus and method according to the present embodiment will be described to perform transform in a 4x4 block as a preferred embodiment.
An encoding apparatus for selectively using transform units according to correlation of residual coefficients in accordance with an embodiment of the present invention will be described.
Since processes of performing DCT (see Eq. 1), performing the first quantization (see Eq. 2), performing the first inverse quantization (see Eq. 3), and performing the IDCT (see Eq. 4) for the residual coefficients generated after motion estimation and motion compensation are identical to those described in the background art, detail descriptions thereof are omitted.
However, the encoding apparatus according to present embodiment selects one providing a higher compression rate the other between DCT and DST by performing rate- distortion optimization in a block when a quantized transform coefficient is generated through transformation and quantization after performing inter prediction and intra prediction for a predetermined size of a block (macroblock) , records information about the selected transforming scheme (DCT or DST) at a 1-bit flag bit: that is added on a macroblock basis and transmits the flag bit to the decoding apparatus .
The encoding and decoding apparatus according to the present embodiment will be described in more detail with reference to Fig. 3 in detail. The encoding and decoding apparatus according to the present embodiment includes a first transform unit for performing DCT and first quantization, and first inverse quantization and IDCT on a block basis for residual coefficients that are generated after performing inter prediction and intra prediction, a second transform unit for performing DST and second quantization, and second inverse quantization and IDST on a block basis for the residual coefficients, a rate-distortion optimization unit 29 for selecting one having a higher compression rate than the other between the first transform unit and the second transform unit by performing rate-distortion optimization, and a flag marking unit 40 for recording information about the selected transform unit to a corresponding flag bit disposed on a macroblock basis.
Here, the first transform unit includes a DCT processor 31 for performing integer approximated discrete cosine transform (DCT) (integer transform) for residual coefficients (see Eq. 1), a quantization unit 32 for generating a quantized transform coefficient by performing the first quantization (referred to Eq. 2) onto the integer-transformed coefficient, an inverse quantization unit 33 for generating an integer- transformed coefficient by performing first inverse quantization (see Eq. 3) onto the quantized transform coefficient, and an IDCT processor 34 for restoring a residual coefficient by performing integer approximated inverse discrete cosine transform (see Eq. 4) onto the integer-transformed coefficient . The second transform unit includes a DST processor 35 for performing integer approximated discrete sine transform (DST) (see Eq. 8) for residual coefficients to generate integer-transformed coefficients, a quantization unit 36 for generating quantized transform coefficients by performing second quantization (referred to Eq. 10) onto the integer- transformed coefficients, an inverse quantization unit 37 for generating integer-transformed coefficients by performing second inverse quantization (referred to Eq. 11) onto the quantized transform coefficients, and an IDST processor 38 for restoring residual coefficients by performing integer approximated inverse discrete sine transform (referred to Eq. 9) onto the integer- transformed coefficients. As described above, one of the transform units is selected according to the correlation of residual coefficients, information about the selected transform unit (DCT or DST information) is recorded at a 1-bit flag bit, and the flag bit is transmitted to a decoding apparatus of Fig. 4.
The decoding apparatus of Fig. 4 identifies the information about the selected transform unit through a flag identifying unit 41 and performs inverse quantization and IDCT onto a received bitstream on a block basis through an inverse quantization unit 44 and an IDST processor 45 or performs inverse quantization and IDST through an inverse quantization unit 44 and an IDST processor 45, thereby performing decode with a suitable block unit. The decoding apparatus includes a flag identifying unit 41 for identifying a flag value included in a header of a received bitstream and detecting a coding method of the received bitstream based on the identified flag value and a decoding unit for decoding a bitstream on a block basis through inverse quantization and IDCT or inverse quantization and IDST. The decoding unit includes an inverse quantization unit 42,, an IDCT processor 43, an inverse quantization unit 44, and an IDST processor 45.
Here, a flag value included in a bitstream header indicates the selected one of the first transform unit and the second transform unit, which provides the higher compression efficiency. As described above with reference to Fig. 3, the first transform unit performs the DCT (see Eq. 1), the first quantization (see Eq. 2), the first inverse quantization (see Eq. 3), and the IDCT (see Eq. 4) on a block basis onto residual coefficients generated after inter prediction and intra prediction. The second transform unit performs the DST (Eq. 8), the second quantization (Eq. 10), the second inverse quantization (Eq. 11), and the IDST (Eq. 9) on a block basis for residual coefficients.
Hereinafter, the operation of the encoding apparatus for selectively using transform units according to the correlation of residual coefficients according to the present embodiment will be described in detail.
At first, Eq. 6 and Eq. 7 express the first order discrete sine transform (DST) and the first order inverse discrete sine transform (IDST) .
Y(Jc) = . Y X(n) sm — — -, 0 ≤ k ≤ N -l i N + lti N -H Eq . 6
π(k + l)(w + 1)
X(n) = A V r(A) Sm W±l>Vl±±lt 0 < n ≤ N -l
Eq . 7
In Eq. 6 and Eq. 7, X denotes a residual coefficient to be processed through DST, Y is a DST processed coefficient, and N denotes a unit side of DST.
In order to use Eq. 6 and Eq. 7 in a video coding apparatus, Eq. 6 and Eq. 7 are converted to a 4x4 discrete sine transform matrix and an inverse discrete sine transform matrix as shown in Eq. 8 and Eq. 9.
Figure imgf000019_0001
Figure imgf000019_0002
In Eq. 8, C denotes a DST matrix for each row of X and Cτ denotes a DST matrix transposed for each column of X. In Eq. 9, C and Cτ are identical to those in Eq. 8. Also, X' denotes a restored residual coefficient and Y' denotes an inverse-quantized transform coefficient. Elements a and b in the matrix denote constants Vi"11"?' and V^S111<f"y .
Therefore, the DST is performed by the DST processor 35 on a 4x4 block basis for the residual coefficient generated after inter predict-icn and intra prediction as shown in Eq. 8 as a method of a H.264/MPEG-4 AVC transform unit.
After performing the DST through Eq. 8, the discrete sine-transformed coefficient is quantized through the second quantization process of Eq. 10 by the quantization unit 36, thereby generating a quantized DST coefficient.
Y
Zυ = round 4-0.5
QStep
Eq. 10
In Eq. 10, Z13 denotes a quantized DST coefficient located at a position {i,j) of a matrix. QStep denotes a step size of a quantization unit, and round () denotes a rounding off function. On the contrary, the transformed bitstream is processed through inverse quantization using an inverse quantization unit 37 and 4x4 IDST using an IDST processor 38 in a decoding procedure. Hereinafter, the operations of the inverse quantization unit 37 and the IDST processor 38 will be described.
At first, the inverse quantization unit 37 performs inverse quantization onto the quantized DST coefficient as shown in Eq. 11.
Figure imgf000020_0001
Then, the DST coefficient 4x4 matrix Ϋ is converted to a 4x4 restored residual coefficient X through IDST by the IDST processor 38 as shown in Eq. 9.
Then, the restored residual coefficient x'ij is transformed to x" ±j through rounding off as shown in Eq. 12.
Xϋ " = round (χy ' +0.5)
Eq. 12
In Eq. 12, X* ±j denotes a final restored residual coefficient of a 4X4 block.
As described above, the DST, the second quantization, the second inverse quantization, and the IDST are completely performed.
As described above, the information about a transform unit (DCT or DST) selected according to the correlation of residual signals by the encoding apparatus is recorded in a 1-bit flag bit which is added on a macroblock basis. Then, the flag bit is transmitted to the decoding apparatus of Fig. 4. Therefore, the decoding apparatus is enabled to decode the bitstream with a proper method. Here, the flag bit having information about the selected transform unit may be applied to various unit blocks such as the maximum NxN unit block to minimum 4x4 unit block.
Therefore, a compression rate can be improved by selecting a transform unit by modifying the structure of rate-distortion optimization in the H.264/MPEG-4 AVC encoding apparatus according to the related art to that shown in Fig. 5.
As shown in Fig. 5, intra frame prediction and inter frame prediction are performed at steps S501 and 504. Then, integer approximated discrete cosine transform (DCT), first quantization, first inverse quantization, and integer approximated inverse DCT, and entropy encoding are performed at steps S505 and S506. Then, a mode that minimizes a rate-distortion cost (RDcost) is selected from all possible coding modes used in H.264, such as a variable block mode, three spatial prediction modes, and a SKIP mode at step S507. That is, a transform unit having high compression efficiency is selected. The information about the selected transform unit is recorded at a corresponding flag bit disposed on a macroblock basis and transmitted to the decoding apparatus. Therefore, the decoding apparatus is enabled to decide a proper decoding method using the flag value recorded in the prediction flag.
Hereinafter, the performance of the encoding/decoding apparatus and method for selectively using transform units according to the correlation of residual coefficients according to the present embodiment will be described using results of simulations with various images.
The simulations were performed using a joint model (JM) 10.2 encoder that supports H.264/MPEG-4 AVC. As test images, four 176 x 144 quarter common intermediate format (QCIF) images and four 352 x 288 common intermediate format (CIF) images, which are stored at 30Hz frame rate. Table 3 shows simulation conditions.
Table 3
GOP Structure I PPP
I ntra Period Ev ery 1 0th frame
QP 4, 8, 1 2, 1 6, 20
Search Rang e 1 6
M ultiple Refere nce Frames
Rate Control Off
Entropy Coding Method CABAC
Rate — Distortion Optim ization On
Table 4 shows compression rates obtained from simulations performed under the conditions of Table 3. In the simulations, various images were compressed using the H.264/MPEG-4 AVC compressing method according to the related art and the encoding method according to the present embodiment.
Table 4
Figure imgf000023_0001
Table 4 clearly shows that the performance of the encoding method selectively using the transform unit according to the correlation of residual coefficients according to the present embodiment is much better than the H.264 /MPEG-4 AVC compression method.
Figs. 6, 7, 8, and 9 are rate-distortion graphs of QCIF pictures used in Table 4 for comparing an encoding/decoding method (apparatus) according to the present invention with the encoding/decoding method according to the related art.
Figs. 10, 11, 12 and 13 are rate-distortion graphs of CIF pictures used in Table 4 for comparing an encoding/decoding method (apparatus) according to the present invention with the encoding/decoding method according to the related art.
The rate-distortion graphs also clearly shows that the performance of the encoding method selectively using the transform unit according to the correlation of residual coefficients according to the present embodiment is improved as much as maximum 3db compared to the H.264/MPEG-4 AVC compression method. The method of the present invention described above may be programmed for a computer. Codes and code segments constituting the computer program may be easily inferred by a computer programmer of ordinary skill in the art to which the present invention pertains. The computer program may be stored in a computer-readable recording medium, i.e., data storage, and it may be read and executed by a computer to realize the method of the present invention. The recording medium includes all types of computer-readable recording media. While the present invention has been described with respect to certain preferred embodiments, it will be apparent to those skilled in the art that various changes and modifications may be made without departing from the scope of the invention as defined in the following claims.

Claims

WHAT IS CLAIMED IS:
1. A video encoding apparatus, comprising: a first transforming means for performing discrete cosine transform (DCT) , first quantization, first inverse quantization, and inverse DCT (IDCT) on a block basis onto residual coefficients that are generated after intra frame prediction or inter frame prediction; a second transforming means for performing discrete sine transform (DST) , second quantization, second inverse quantization, and inverse DST (IDST) onto the residual coefficients on a block basis; a selecting means for selecting a transforming means having a high compression rate for each block by performing rate-distortion optimization; and a flag marking means for recording information about the selected transforming means at a flag bit provided on a macroblock basis.
2. The video encoding apparatus of claim 1, wherein the block is an N x M block, where N and M are integer numbers .
3. The video encoding apparatus of claim 1, wherein the information about the selected transforming means is recorded at the flag bit in a macroblock layer header of bitstream.
4. The video encoding apparatus of claim 3, wherein the selecting means selects the first transforming means when correlation of the residual coefficients is high and selects the second transforming means when the correlativity of the residual coefficients is low.
5. The video encoding apparatus of claim 4, wherein the first transforming means includes: a DCT means for performing integer-approximated DCT onto the residual coefficients to thereby generate integer-transformed coefficients; a quantization means for performing first quantization onto the integer-transformed coefficients acquired in the DCT means generating quantized integer- transformed coefficients; an inverse quantization means for performing first inverse quantization onto the quantized integer- transformed coefficients acquired in the quantization means to thereby generate integer-transformed coefficients; and an inverse discrete cosine transform (IDCT) means for restoring the residual coefficients by performing integer-approximated IDCT onto the integer-transformed coefficients acquired in the Inverse quantization means.
6. The video encoding apparatus of claim 4, wherein the second transforming means includes: a discrete sine transform (DST) means for performing DST onto the residual coefficients to thereby generate discrete sine-transformed coefficients; a quantization means for performing second quantization onto the discrete sine-transformed coefficients acquired in the DST means to thereby generate quantized DST coefficiesnts; an inverse quantization means for performing second inverse quantization onto the quantized DST coefficients acquired in the quantization means to thereby generate discrete sine-transformed coefficients; and an inverse discrete sine transform (IDST) means for restoring residual coefficients by performing IDST onto the discrete sine-transformed coefficients acquired in the inverse quantization means.
7. The video encoding apparatus of claim 6, wherein the second transforming means has optimal compression performance when a region of correlation coefficient values is (-0.5, 0.5).
8. The video encoding apparatus of claim 6, wherein the DST means performs DST onto residual coefficients generated after intra frame prediction and inter frame prediction on a 4x4 block basis using an equation 1:
Figure imgf000027_0001
where X denotes a residual coefficient to be discrete sine-transformed;
C denotes a DST matrix for each row of X; and
Cτ denotes a DST matrix transposed for each column of X.
9. The video encoding apparatus of claim 8, wherein the quantization means generates quantized DST coefficients by performing second quantization onto the discrete sine-transformed coefficients using an equation
2:
Z = round + 0.5 ' ' QStep
Eq. 2
where Z13 denotes a quantized discrete sine-transformed coefficient located at a position (i,j) of a matrix; QStep denotes a step size of a quantization unit; and round () denotes a rounding off function.
10. The video encoding apparatus of claim 9, wherein the inverse quantization means performs second inverse quantization onto quantized DST coefficients using an equation 3:
Yp =Z1,- QStep- Eq. 3
where Y ij is an integer-transformed coefficient after inverse quantization.
11. The encoding apparatus of claim 10, wherein the IDST means generates a restored residual coefficient X' of a 4x4 matrix by performing IDST onto a discrete sine- transformed coefficient 4x4 matrix Yr based on an equation 4 :
Figure imgf000028_0001
where C denotes a DST matrix for each row of X;
Cτ denotes a DST matrix transposed for each column of X;
X' denotes a restored residual coefficient;
Y' denotes an inverse-quantized transformed coefficient; and
A— sin(--) elements a and b in the matrix denote constants »5 ^
jsin(-θ and respectively.
12. The video encoding apparatus of claim 11, wherein x"±j is generated by rounding off the restored residual coefficient x' λj using an equation 5:
Figure imgf000029_0001
where x" ±D is a finally restored residual coefficient of a 4x4 unit block .
13. The video encoding apparatus of claim 3, wherein the encoding apparatus performs a transcoder function including an encoding function and a decoding function .
14. A video decoding apparatus, comprising: a flag identifying means for detecting an encoding method of a received bitstream by identifying a flag value included in a header of the received bitstream; and a decoding means for decoding the received bitstream on a block basis by performing first inverse quantization and inverse discrete cosine transform, or second inverse quantization and inverse discrete sine transform according to the encoding method found out by the flag identifying means.
15. The video decoding apparatus of claim 14, wherein the flag value is inserted by an encoding apparatus into a flag bit provided on a macroblock basis, and the flag value corresponds to a transforming scheme having a high compression rate for each block by performing rate-distortion optimization between a first transforming scheme and a second transforming scheme, where the first transforming scheme is to perform discrete cosine transform (DCT), first quantization, first inverse quantization, and inverse DCT on the basis of a block onto residual coefficients generated after inter prediction and intra prediction, and the second transforming scheme is to perform discrete sine transform (DST), second quantization, second inverse quantization, and inverse DST onto the residual coefficients on a block basis .
16. The video decoding apparatus of claim 15, wherein the decoding means performs the second quantization onto quantized DST coefficients using an equation 1 expressed as:
Y' =Zif- QStep- Eq. 1
where Y'±j is an integer-transformed coefficient after inverse quantization.
17. The video decoding apparatus of claim 16, wherein the decoding means generates a restored residual coefficient X' of a 4x4 matrix by performing IDST onto a discrete sine-transformed coefficient 4x4 matrix Y' using an equation 2:
a b b a a b b a b a —a —b b a —a —b
X = C/ YC = b —a —a b [>] b —a —a b a ~b b —a a —b b —a
Eq . 2
where C denotes a DST matrix for each row of X;
Cτ denotes a DST matrix transposed for each column of X;
X' denotes a restored residual coefficient; Y' denotes an inverse-quantized transform coefficient; and elements a and b in the matrix denote constants v? sim 3 and
Figure imgf000031_0001
r respectively.
18. The decoding apparatus of claim 17, wherein x"ij is generated by rounding off the restored residual coefficient X ±j using an equation 3:
X".. = round (X\. +0.5) y V ? / Eq. 3
where X ±j is a final restored residual coefficient of a 4x4 unit block.
19. A video encoding method, comprising the steps of: performing discrete cosine transform (DCT), first quantization, first inverse quantization, and inverse DCT on a block basis onto residual coefficients generated after intra frame prediction or inter frame prediction; performing discrete sine transform (DST) , second quantization, second inverse quantization, and inverse DST on a block basis onto the residual coefficients in addition to the step of performing DCT, first quantization, first inverse quantization, and inverse DCT; selecting a transforming scheme having a high compression rate for each block by performing rate- distortion optimization; and compressing a video on a block basis by recording information about the selected transforming scheme at a flag bit provided on a macroblock basis.
20. The video encoding method of claim 19, wherein the block is an N x M block, where N and M are integer numbers .
21. The video encoding method of claim 19, wherein the information about the selected transforming scheme is recorded at the flag bit in a macroblock layer header of bitstream.
22. The video encoding method of claim 21, wherein the step of performing DCT, first quantization, first inverse quantization, and inverse DCT on a block basis includes the steps of: performing integer-approximated DCT onto the residual coefficients to thereby generate integer- transformed coefficients; performing first quantization onto the integer- transformed coefficients to thereby generate quantized integer-transformed coefficients; performing first inverse quantization onto the quantized integer-transformed coefficients to thereby generate integer-transformed coefficients; and restoring residual coefficients by performing integer-approximated inverse discrete cosine transform (IDCT) onto the integer-transformed coefficients acquired from the step of performing first inverse quantization to perform integer inverse transform.
23. The video encoding method of claim 21, wherein the step of performing DST, second quantization, second inverse quantization, and IDST includes the steps of : performing DST onto the residual coefficients to thereby generate discrete sine-transformed coefficients ; performing second quantization onto the discrete sine-transformed coefficients to thereby generate quantized DST coefficients; performing second inverse quantization onto the quantized DST coefficients to generate discrete sine- transformed coefficients; and restoring residual coefficients by performing IDST onto the discrete sine-transformed coefficients acquired from the step of performing second inverse quantization.
24. The video decoding apparatus of claim 23, wherein the step of performing DST, DST is performed onto residual coefficients generated after intra frame prediction and inter frame prediction on a 4x4 block basis using an equation 1:
Figure imgf000033_0001
where X denotes a residual coefficient to be discrete sine-transformed;
C denotes a DST matrix for each row of X; and
Cτ denotes a DST matrix transposed for each column of X.
25. The video encoding method of claim 24, wherein in the step of performing second quantization, quantized DST coefficients are generated by quantizing the discrete sine-transformed coefficients using an equation 2 expressed as:
Z,, — round
Figure imgf000033_0002
where Z±j denotes a quantized discrete sine-transformed coefficient located at a position (i,j) of a matrix; QStep denotes a step size of a quantization unit; and round () denotes a rounding off function.
26. The video encoding method of claim 25, wherein in the step of performing second inverse quantization onto the quantized DST coefficients, inverse quantization is performed onto the quantized DST coefficients using an equation 3 expressed as:
Yy = Z17 - CJStV1? Eq. 3
where Y' zj is an integer-transformed coefficient after inverse quantization.
27. The video encoding method of claim 26, wherein in the step of restoring residual coefficients, a restored residual coefficient X' of a 4x4 matrix is generated by performing IDST onto a discrete sine- transformed coefficient 4x4 matrix Y' based on an equation 4 :
Figure imgf000034_0001
where C denotes a DST matrix for each row of X;
Cτ denotes a DST matrix transposed for each column of X;
X' denotes a restored residual coefficient;
Y' denotes an inverse-quantized transform coefficient; and ,in<7) elements a and b in the matrix denote constants V | <3 5
I— sin(—O and ^5 5 , respectively.
28. The video encoding method of claim 27, wherein in the step of restoring residual coefficients, X IJ IS generated by rounding off the restored residual coefficient x' ±j using an equation 5:
X".. = round {x^ +0.5)
Eq. 5
where X IJ XS finally restored residual coefficient of a 4x4 unit block.
29. A video decoding method, comprising the steps of: detecting an encoding method of a received bitstream by identifying a flag value included in a header of the received bitstream; and decoding the received bitstream on a block basis by performing first inverse quantization and inverse discrete cosine transform (IDCT), or second inverse quantization and inverse discrete sine transform (IDST) according to the encoding method.
30. The video decoding method of claim 29, wherein the flag value is inserted into a flag bit provided on a macroblock basis, and the flag value indicates a transforming scheme having a high compression rate by performing rate-distortion optimization for each block between a first transforming scheme and a second transforming, where the first transforming scheme performs DCT, first quantization, first inverse quantization, and inverse DCT onto residual coefficients generated after inter prediction and intra prediction on a block basis, and the second transforming scheme performs DST, second quantization, second inverse quantization, and inverse DST onto the residual coefficients on a block basis.
31. The video decoding method of claim 30, wherein in the step of performing first inverse quantization and IDCT or second inverse quantization and IDST, the second quantization is performed onto quantized DST coefficients using an equation 1:
v = QStep Eq. 1
where Ϋ±1 is an integer-transformed coefficient after inverse quantization.
32. The video decoding apparatus of claim 31, wherein in the step of performing first inverse quantization and IDCT or second inverse quantization and IDST, a restored residual coefficient X' of a 4x4 matrix is generated by performing IDST onto a discrete sine- transformed coefficient 4x4 matrix Y' using an equation 2:
Figure imgf000036_0001
where C denotes a DST matrix for each row of X; Cτ denotes a DST matrix transposed for each column of X;
X' denotes a restored residual coefficient;
Y' denotes an inverse-quantized transform coefficient; and
elements a and b in the matrix denote constants
Figure imgf000037_0001
and
Figure imgf000037_0002
, respectively.
33. The video decoding method of claim 32, wherein x"ij is generated by rounding off the restored residual coefficient X ±j using an equation 3:
Xv " = round (Xv ' +0.5)
Eq. 3
where X ±j is a final restored residual coefficient of a 4x4 unit block.
PCT/KR2007/001809 2006-09-20 2007-04-13 Apparatus and method for encoding and decoding using alternative converter according to the correlation of residual signal WO2008035842A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US12/441,940 US20090238271A1 (en) 2006-09-20 2007-04-13 Apparatus and method for encoding and decoding using alternative converter accoding to the correlation of residual signal

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
KR10-2006-0091426 2006-09-20
KR20060091426 2006-09-20
KR10-2007-0036089 2007-04-12
KR1020070036089A KR100927733B1 (en) 2006-09-20 2007-04-12 An apparatus and method for encoding / decoding selectively using a transformer according to correlation of residual coefficients

Publications (1)

Publication Number Publication Date
WO2008035842A1 true WO2008035842A1 (en) 2008-03-27

Family

ID=39200652

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/KR2007/001809 WO2008035842A1 (en) 2006-09-20 2007-04-13 Apparatus and method for encoding and decoding using alternative converter according to the correlation of residual signal

Country Status (1)

Country Link
WO (1) WO2008035842A1 (en)

Cited By (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101605255B (en) * 2008-06-12 2011-05-04 华为技术有限公司 Method and device for encoding and decoding video
EP2346258A2 (en) * 2008-10-02 2011-07-20 Electronics and Telecommunications Research Institute Apparatus and method for coding/decoding image selectivly using descrete cosine/sine transtorm
US20120099642A1 (en) * 2009-07-06 2012-04-26 Joel Sole Methods and apparatus for spatially varying residue coding
CN102484707A (en) * 2009-07-04 2012-05-30 Sk电信有限公司 Image encoding/decoding method and apparatus
GB2487777A (en) * 2011-02-04 2012-08-08 Canon Kk Estimating motion in a sequence of digital images
JP2013522957A (en) * 2010-03-10 2013-06-13 トムソン ライセンシング Method and apparatus for performing constrained transforms for video encoding and decoding with transform selection
JP2014220624A (en) * 2013-05-07 2014-11-20 日本放送協会 Image processing apparatus, encoding apparatus, and program
AU2012326873B2 (en) * 2011-10-17 2015-12-24 Kt Corporation Method and apparatus for encoding/decoding image
CN107835414A (en) * 2011-10-18 2018-03-23 株式会社Kt Video signal decoding method
EP3399748A1 (en) * 2009-09-10 2018-11-07 Guangdong OPPO Mobile Telecommunications Corp., Ltd. Speedup techniques for rate distortion optimized quantization
US10356422B2 (en) 2015-03-06 2019-07-16 Qualcomm Incorporated Fast rate-distortion optimized quantization
CN111556319A (en) * 2020-05-14 2020-08-18 电子科技大学 Video coding method based on matrix decomposition

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH0522715A (en) * 1991-07-12 1993-01-29 Sony Corp Picture encoder
US6611560B1 (en) * 2000-01-20 2003-08-26 Hewlett-Packard Development Company, L.P. Method and apparatus for performing motion estimation in the DCT domain
US6876703B2 (en) * 2000-05-11 2005-04-05 Ub Video Inc. Method and apparatus for video coding

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH0522715A (en) * 1991-07-12 1993-01-29 Sony Corp Picture encoder
US6611560B1 (en) * 2000-01-20 2003-08-26 Hewlett-Packard Development Company, L.P. Method and apparatus for performing motion estimation in the DCT domain
US6876703B2 (en) * 2000-05-11 2005-04-05 Ub Video Inc. Method and apparatus for video coding

Cited By (54)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101605255B (en) * 2008-06-12 2011-05-04 华为技术有限公司 Method and device for encoding and decoding video
KR102129046B1 (en) * 2008-10-02 2020-07-01 인텔렉추얼디스커버리 주식회사 Apparatus and method for coding/decoding image selectivly using descrete cosine/sine transtorm
KR101619972B1 (en) 2008-10-02 2016-05-11 한국전자통신연구원 Apparatus and method for coding/decoding image selectivly using descrete cosine/sine transtorm
US11538198B2 (en) 2008-10-02 2022-12-27 Dolby Laboratories Licensing Corporation Apparatus and method for coding/decoding image selectively using discrete cosine/sine transform
KR102468143B1 (en) * 2008-10-02 2022-11-18 돌비 레버러토리즈 라이쎈싱 코오포레이션 Apparatus and method for coding/decoding image selectivly using descrete cosine/sine transtorm
US11176711B2 (en) 2008-10-02 2021-11-16 Intellectual Discovery Co., Ltd. Apparatus and method for coding/decoding image selectively using discrete cosine/sine transform
KR20210073509A (en) * 2008-10-02 2021-06-18 인텔렉추얼디스커버리 주식회사 Apparatus and method for coding/decoding image selectivly using descrete cosine/sine transtorm
WO2010039015A3 (en) * 2008-10-02 2013-01-03 한국전자통신연구원 Apparatus and method for coding/decoding image selectivly using descrete cosine/sine transtorm
KR20200078461A (en) * 2008-10-02 2020-07-01 인텔렉추얼디스커버리 주식회사 Apparatus and method for coding/decoding image selectivly using descrete cosine/sine transtorm
KR101805429B1 (en) * 2008-10-02 2017-12-06 한국전자통신연구원 Apparatus and method for coding/decoding image selectivly using descrete cosine/sine transtorm
CN102334337A (en) * 2008-10-02 2012-01-25 韩国电子通信研究院 Apparatus and method for coding/decoding image selectivly using descrete cosine/sine transtorm
KR102266576B1 (en) * 2008-10-02 2021-06-18 인텔렉추얼디스커버리 주식회사 Apparatus and method for coding/decoding image selectivly using descrete cosine/sine transtorm
KR20200015678A (en) * 2008-10-02 2020-02-12 인텔렉추얼디스커버리 주식회사 Apparatus and method for coding/decoding image selectivly using descrete cosine/sine transtorm
EP2346258A2 (en) * 2008-10-02 2011-07-20 Electronics and Telecommunications Research Institute Apparatus and method for coding/decoding image selectivly using descrete cosine/sine transtorm
KR102076092B1 (en) * 2008-10-02 2020-02-11 인텔렉추얼디스커버리 주식회사 Apparatus and method for coding/decoding image selectivly using descrete cosine/sine transtorm
KR20190042538A (en) * 2008-10-02 2019-04-24 인텔렉추얼디스커버리 주식회사 Apparatus and method for coding/decoding image selectivly using descrete cosine/sine transtorm
CN105306939A (en) * 2008-10-02 2016-02-03 韩国电子通信研究院 Apparatus and method for coding/decoding videos
KR101971909B1 (en) * 2008-10-02 2019-04-24 인텔렉추얼디스커버리 주식회사 Apparatus and method for coding/decoding image selectivly using descrete cosine/sine transtorm
EP3154266A1 (en) * 2008-10-02 2017-04-12 Electronics and Telecommunications Research Institute Apparatus and method for coding/decoding image selectivly using discrete cosine/sine transtorm
KR20170135807A (en) * 2008-10-02 2017-12-08 한국전자통신연구원 Apparatus and method for coding/decoding image selectivly using descrete cosine/sine transtorm
EP2346258A4 (en) * 2008-10-02 2014-03-19 Korea Electronics Telecomm Apparatus and method for coding/decoding image selectivly using descrete cosine/sine transtorm
CN104869419A (en) * 2009-07-04 2015-08-26 Sk电信有限公司 Image encoding/decoding method and apparatus
CN102484707A (en) * 2009-07-04 2012-05-30 Sk电信有限公司 Image encoding/decoding method and apparatus
CN102484707B (en) * 2009-07-04 2015-06-10 Sk电信有限公司 Image encoding/decoding method and apparatus
CN104869418A (en) * 2009-07-04 2015-08-26 Sk电信有限公司 Image encoding/decoding method and apparatus
US20120099642A1 (en) * 2009-07-06 2012-04-26 Joel Sole Methods and apparatus for spatially varying residue coding
CN102484701A (en) * 2009-07-06 2012-05-30 汤姆逊许可证公司 Methods and apparatus for spatially varying residue coding
US9736500B2 (en) * 2009-07-06 2017-08-15 Thomson Licensing Methods and apparatus for spatially varying residue coding
CN107277512A (en) * 2009-07-06 2017-10-20 汤姆逊许可证公司 Method and apparatus for spatial variations residual coding
EP3399748A1 (en) * 2009-09-10 2018-11-07 Guangdong OPPO Mobile Telecommunications Corp., Ltd. Speedup techniques for rate distortion optimized quantization
US11190780B2 (en) 2009-09-10 2021-11-30 Guangdong Oppo Mobile Telecommunications Corp., Ltd. Speedup techniques for rate distortion optimized quantization
US11039152B2 (en) 2009-09-10 2021-06-15 Guangdong Oppo Mobile Telecommunications Corp., Ltd. Speedup techniques for rate distortion optimized quantization
JP2013522957A (en) * 2010-03-10 2013-06-13 トムソン ライセンシング Method and apparatus for performing constrained transforms for video encoding and decoding with transform selection
US9277245B2 (en) 2010-03-10 2016-03-01 Thomson Licensing Methods and apparatus for constrained transforms for video coding and decoding having transform selection
GB2487777A (en) * 2011-02-04 2012-08-08 Canon Kk Estimating motion in a sequence of digital images
GB2487777B (en) * 2011-02-04 2015-01-07 Canon Kk Method and device for motion estimation in a sequence of images
US9560385B2 (en) 2011-10-17 2017-01-31 Kt Corporation Method and apparatus for encoding/decoding image
US9661346B2 (en) 2011-10-17 2017-05-23 Kt Corporation Method and apparatus for encoding/decoding image
AU2012326873B2 (en) * 2011-10-17 2015-12-24 Kt Corporation Method and apparatus for encoding/decoding image
US9661352B2 (en) 2011-10-17 2017-05-23 Kt Corporation Method and apparatus for encoding/decoding image
US9826251B2 (en) 2011-10-17 2017-11-21 Kt Corporation Method and apparatus for encoding/decoding image
US9560384B2 (en) 2011-10-17 2017-01-31 Kt Corporation Method and apparatus for encoding/decoding image
US9661354B2 (en) 2011-10-17 2017-05-23 Kt Corporation Method and apparatus for encoding/decoding image
CN107959857A (en) * 2011-10-18 2018-04-24 株式会社Kt Video signal decoding method
CN107835414B (en) * 2011-10-18 2020-11-06 株式会社Kt Video signal decoding method
US10575015B2 (en) 2011-10-18 2020-02-25 Kt Corporation Method and apparatus for decoding a video signal using adaptive transform
CN107835414A (en) * 2011-10-18 2018-03-23 株式会社Kt Video signal decoding method
CN107959858A (en) * 2011-10-18 2018-04-24 株式会社Kt Video signal decoding method
CN107959857B (en) * 2011-10-18 2022-03-01 株式会社Kt Video signal decoding method
US10264283B2 (en) 2011-10-18 2019-04-16 Kt Corporation Method and apparatus for decoding a video signal using adaptive transform
JP2014220624A (en) * 2013-05-07 2014-11-20 日本放送協会 Image processing apparatus, encoding apparatus, and program
US10356422B2 (en) 2015-03-06 2019-07-16 Qualcomm Incorporated Fast rate-distortion optimized quantization
CN111556319A (en) * 2020-05-14 2020-08-18 电子科技大学 Video coding method based on matrix decomposition
CN111556319B (en) * 2020-05-14 2021-12-17 电子科技大学 Video coding method based on matrix decomposition

Similar Documents

Publication Publication Date Title
US11538198B2 (en) Apparatus and method for coding/decoding image selectively using discrete cosine/sine transform
US10924734B2 (en) Method and apparatus of deriving quantization parameter
US20230247229A1 (en) Video encoding method for encoding division block, video decoding method for decoding division block, and recording medium for implementing the same
US20090238271A1 (en) Apparatus and method for encoding and decoding using alternative converter accoding to the correlation of residual signal
WO2008035842A1 (en) Apparatus and method for encoding and decoding using alternative converter according to the correlation of residual signal
KR101431545B1 (en) Method and apparatus for Video encoding and decoding
KR101095938B1 (en) Apparatus and Method for Encoding and Decoding Moving Picture using Adaptive Scanning
KR101375891B1 (en) Video coding with large macroblocks
EP2868080B1 (en) Method and device for encoding or decoding an image
KR101344115B1 (en) Video coding with large macroblocks
KR101228020B1 (en) Video coding method and apparatus using side matching, and video decoding method and appartus thereof
KR101232420B1 (en) Rate-distortion quantization for context-adaptive variable length coding (cavlc)
CA2822391C (en) Enhanced intra-prediction coding using planar representations
RU2734800C2 (en) Method of encoding and decoding images, an encoding and decoding device and corresponding computer programs
US20110150072A1 (en) Encoding method, decoding method and apparatus thereof
KR101712097B1 (en) Method and apparatus for encoding and decoding image based on flexible orthogonal transform
US20070171970A1 (en) Method and apparatus for video encoding/decoding based on orthogonal transform and vector quantization
KR20070005848A (en) Method and apparatus for intra prediction mode decision
WO2013067320A1 (en) Secondary boundary filtering for video coding
EP2156674A1 (en) Method and apparatus for intraprediction encoding/decoding using image inpainting
US20080107175A1 (en) Method and apparatus for encoding and decoding based on intra prediction
KR101496324B1 (en) Method and apparatus for video encoding, and method and apparatus for video decoding
US8306115B2 (en) Method and apparatus for encoding and decoding image
KR20070077609A (en) Method and apparatus for deciding intra prediction mode
JP2007266861A (en) Image encoding device

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 07745972

Country of ref document: EP

Kind code of ref document: A1

WWE Wipo information: entry into national phase

Ref document number: 12441940

Country of ref document: US

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 07745972

Country of ref document: EP

Kind code of ref document: A1