WO2008020672A1 - Apparatus for encoding and decoding image using adaptive dct coefficient scanning based on pixel similarity and method therefor - Google Patents

Apparatus for encoding and decoding image using adaptive dct coefficient scanning based on pixel similarity and method therefor Download PDF

Info

Publication number
WO2008020672A1
WO2008020672A1 PCT/KR2007/001433 KR2007001433W WO2008020672A1 WO 2008020672 A1 WO2008020672 A1 WO 2008020672A1 KR 2007001433 W KR2007001433 W KR 2007001433W WO 2008020672 A1 WO2008020672 A1 WO 2008020672A1
Authority
WO
WIPO (PCT)
Prior art keywords
scanning
encoding
directional
horizontal
vertical
Prior art date
Application number
PCT/KR2007/001433
Other languages
French (fr)
Inventor
Se-Yoon Jeong
Hae-Chul Choi
Jeong-Il Seo
Seung-Kwon Beack
In-Seon Jang
Jae-Gon Kim
Kyung-Ae Moon
Dae-Young Jang
Jin-Woo Hong
Jin-Woong Kim
Yung-Lyul Lee
Dong-Gyu Sim
Seoung-Jun Oh
Chang-Beom Ahn
Dae-Yeon Kim
Dong-Kyun Kim
Original Assignee
Electronics And Telecommunications Research Institute
Kwangwoon University Research Institute For Industry Cooperation
Industry-Academia Cooperation Group Of Sejong University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from KR1020070008247A external-priority patent/KR100882949B1/en
Application filed by Electronics And Telecommunications Research Institute, Kwangwoon University Research Institute For Industry Cooperation, Industry-Academia Cooperation Group Of Sejong University filed Critical Electronics And Telecommunications Research Institute
Priority to US12/377,617 priority Critical patent/US8548060B2/en
Publication of WO2008020672A1 publication Critical patent/WO2008020672A1/en
Priority to US13/975,213 priority patent/US9154784B2/en
Priority to US13/975,251 priority patent/US9225982B2/en
Priority to US14/823,273 priority patent/US9736484B2/en
Priority to US14/823,186 priority patent/US9819942B2/en
Priority to US15/784,623 priority patent/US10499064B2/en
Priority to US16/438,785 priority patent/US11330274B2/en
Priority to US17/219,964 priority patent/US11949881B2/en
Priority to US17/219,985 priority patent/US20210218973A1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/103Selection of coding mode or of prediction mode
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/60Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding

Definitions

  • the present invention relates to an encoding/decoding apparatus and method using an adaptive Discrete Cosine Transform (DCT) coefficient scanning based on pixel similarity. More particularly, the present invention relates to an encoding/decoding apparatus and method which performs intra prediction onto input video, predicts pixel similarity based on pixel similarity information of coefficients to be encoded that is acquired from adjacent pixels in the intra-predicted video, and performs a most effective scanning, e.g., Discrete Cosine Transform (DCT) coefficient scanning, according to the predicted pixel similarity.
  • DCT Discrete Cosine Transform
  • a frame is divided into a plurality of macro blocks and a macro block may be divided into a plurality of sub-blocks.
  • the encoding/decoding is performed on the basis of a macro block unit or a sub-block unit based on temporal prediction and spatial prediction.
  • the temporal prediction is to predict motion of macro blocks or sub-blocks of a current frame by referring to blocks of adjacent frames.
  • the spatial prediction is to predict motion of macro blocks or sub-blocks of a current frame to be encoded by using boundary pixels of already recovered adjacent blocks.
  • the spatial prediction is also called intra prediction.
  • the intra prediction takes advantage of a characteristic that when a pixel is predicted, pixels adjacent to it are highly likely to have similar values.
  • H.264 /Advanced Video Coding (AVC) standard technology can compress video about twice as high as Moving Picture Experts Group 2 (MPEG-2) and about one and a half times as high as MPEG-4 by using such technique as intra prediction encoding, 1/4-based variable block motion prediction and compensation, Context-Based Adaptive Variable Length Coding (CAVLC) , and Context- Based Adaptive Binary Arithmetic Coding (CABAC) .
  • MPEG-2 Moving Picture Experts Group 2
  • CABAC Context-Based Adaptive Binary Arithmetic Coding
  • the H.264/AVC standard predicts pixel values of a current block by using prediction modes of 9 directivities .
  • Fig. 1 illustrates 9 prediction modes used for intra prediction of 4 x 4 blocks.
  • the 9 prediction modes used for intra prediction of 4 x 4 blocks include a vertical mode (mode 0), a horizontal mode (mode 1), a direct current (DC) mode (mode 2), a diagonal_down_left mode (mode 3) , a diagonal down right mode (mode 4) , a vertical_right mode (mode 5), a horizontal_down mode
  • mode 6 a vertical_left mode
  • mode 8 a horizontal_up mode
  • intra prediction is performed using a mean value of adjacent pixels.
  • the arrows indicate prediction directions.
  • intra 16 x 16 prediction encoding includes a total of four modes, which are a vertical mode, a horizontal mode, a DC mode, and a plane mode.
  • intra 8 x 8 prediction encoding includes a total of 9 modes, just like the intra 4 x 4 prediction encoding.
  • intra 8 x 8 prediction encoding is performed, and the intra 8 x 8 prediction encoding includes a DC mode, a vertical mode, a horizontal mode, and a plane mode and so on.
  • Fig. 2 exemplarily illustrates a pixel prediction method in a vertical direction.
  • pixel a 201, pixel b 202, pixel i 203, and pixel m 204 are predicted based on an adjacent pixel A in the vertical direction.
  • pixels b, f, j and b are predicted based on an adjacent pixel B in the vertical direction
  • pixels c, g, k and o are predicted based on an adjacent pixel C in the vertical direction.
  • Pixels d, h, 1 and p are predicted based on an adjacent pixel D in the vertical direction.
  • Fig. 3 exemplarily illustrates a pixel prediction method in a horizontal direction.
  • pixel a 205, pixel b 206, pixel c 207, and pixel d 208 are predicted based on an adjacent pixel I in a horizontal direction.
  • pixels e, f, g and h are predicted based on an adjacent pixel J in the horizontal direction
  • pixels i, j, k and 1 are predicted based on an adjacent pixel K in the horizontal direction
  • Pixels m, n, o and p are predicted based on an adjacent pixel 1 in the horizontal direction.
  • An encoder performs Discrete Cosine Transform (DCT) and quantization onto residual signals (which are of a pixel area) acquired by differentiating the predicted pixels and a current pixel. Subsequently, the encoder performs zigzag scanning and entropy encoding onto the transformed coefficients obtained from DCT and quantization .
  • DCT Discrete Cosine Transform
  • the zigzag scanning takes advantage of an energy compaction characteristic of a transformed coefficient that energy converges onto low frequency and energy appears little in high frequency, the energy compaction after intra prediction is not always effective.
  • the zigzag scanning is a method of scanning a transformed coefficient from low frequency components to high frequency components. When distribution of transformed coefficients appears more in the low frequency components, the zigzag scanning is effective.
  • spatial prediction having directivity is used, the distribution of transformed coefficients is influenced by the direction of prediction. Therefore, it is ineffective to apply the zigzag scanning to the prediction of all directions.
  • An embodiment of the present invention which is devised to overcome the above problems, is directed to providing an encoding/decoding apparatus and method which performs intra prediction onto input video, predicts pixel similarity based on pixel similarity information of coefficients to be encoded acquired from adjacent pixels in the intra-predicted video, and performs a most effective scanning, e.g., DCT coefficient scanning, according to the predicted pixel similarity.
  • a most effective scanning e.g., DCT coefficient scanning
  • an encoding apparatus using a Discrete Cosine Transform (DCT) scanning which includes: a mode selection means for selecting an optimal mode for intra prediction; an intra prediction means for performing intra prediction onto video inputted based on the mode selected in the mode selection means; a DCT and quantization means for performing DCT and quantization onto residual coefficients of a block outputted from the intra prediction means; and an entropy encoding means for performing entropy encoding onto DCT coefficients acquired from the DCT and quantization by using a scanning mode decided based on pixel similarity of the residual coefficients.
  • DCT Discrete Cosine Transform
  • a decoding apparatus using a DCT scanning which includes: an entropy decoding means for performing entropy decoding onto encoded video; a scanning decision means for deciding a scanning mode for the video decoded in the entropy decoding means; and a video recovery means for recovering the video based on the scanning mode decided in the scanning decision means.
  • an encoding method using a DCT scanning which includes the steps of: selecting an optimal mode for intra prediction; performing intra prediction onto video inputted based on the mode selected in the mode selection step; performing DCT and quantization onto residual coefficients of a block outputted from the intra prediction step; deciding pixel similarity of the residual coefficients; and performing entropy encoding onto DCT coefficients acquired from the DCT and quantization by using a scanning mode decided in the pixel similarity decision step.
  • a decoding method using a DCT scanning which includes the steps of: performing entropy decoding onto encoded video; deciding a scanning mode for the video decoded in the entropy decoding step; and recovering the video based on the scanning mode decided in the scanning decision step.
  • a luminance block may go through an intra 4 x 4 luminance encoding mode of H.264 /Advanced Video Coding (AVC) , which includes a vertical mode, a horizontal mode, a diagonal_down_left mode, a diagonal_down_right mode, a vertical_right mode, a horizontal down mode, a vertical_left mode, and a horizontal_up mode, and an intra 16 x 16 luminance encoding mode of H.264/AVC, which includes a vertical mode, a horizontal mode, a plane mode, and a DC mode.
  • AVC Advanced Video Coding
  • a chrominance block may go through an intra M x N chrominance encoding mode of H.264/AVC, which includes a vertical mode, a horizontal mode, a plane mode and a DC mode .
  • the present invention can improve a compression rate of intra encoding by applying a most effective scanning method according to pixel similarity in order to encode/decode video.
  • the present invention can improve a video compression rate by being applied to a video compression technology using intra prediction, which will be developed in future.
  • the present invention can reduce a need for an additional module by applying the same similarity information to both encoder and decoder.
  • Fig. 1 illustrates 9 prediction modes used for intra prediction of 4 x 4 blocks according to H.264/AVC.
  • Fig. 2 exemplarily illustrates a pixel prediction method in a vertical mode.
  • Fig. 3 exemplarily illustrates a pixel prediction method in a horizontal direction.
  • Fig. 4 is a block view showing an encoding apparatus using an adaptive DCT coefficient scanning based on pixel similarity in accordance with an embodiment of the present invention.
  • Fig. 5 exemplarily illustrates a zigzag scanning method used in the present invention.
  • Fig. ⁇ exemplarily illustrates a horizontal scanning method used in the present invention.
  • Fig. 7 exemplarily illustrates a vertical scanning method used in the present invention.
  • Fig. 8 illustrates a method for predicting pixel similarity in vertical and horizontal directions in accordance with an embodiment of the present invention.
  • Fig. 9 is a flowchart describing an adaptive scanning method based on pixel similarity in a vertical intra prediction mode in accordance with an embodiment of the present invention.
  • Fig. 10 is a flowchart describing an adaptive scanning method based on pixel similarity in a horizontal intra prediction mode in accordance with an embodiment of the present invention.
  • Fig. 11 is a block view showing a decoding apparatus using an adaptive DCT coefficient scanning based on pixel similarity in accordance with an embodiment of the present invention.
  • Fig. 4 is a block view showing an encoding apparatus using an adaptive DCT coefficient scanning based on pixel similarity in accordance with an embodiment of the present invention.
  • the encoding apparatus based on DCT coefficient scanning adaptive to pixel similarity includes a mode selection unit 10, an intra prediction unit 20, a DCT and quantization unit 30, and an entropy encoding unit 40.
  • the mode selection unit 10 selects an optimal mode among several available prediction modes for intra prediction. In other words, it selects one among a plurality of possible encoding modes when 4 x 4, 16 x 16, or 8 x 8 intra prediction is performed.
  • the mode selection unit 10 selects one mode according to a rate-distortion optimization method for reducing a rate-distortion.
  • the intra prediction unit 20 receives a video, and performs 4 x 4 intra prediction for pixels of luminance blocks and 8 x 8 intra prediction for pixels of chrominance blocks based on a mode selected in the mode selection unit 10.
  • the DCT and quantization unit 30 performs DCT and quantization onto difference values outputted from the mode selection unit 10, that is, onto residual coefficient blocks indicating differences between pixel values of macro blocks of a current frame to be encoded and predicted pixel values, and transmits resulting coefficients to the entropy encoding unit 40.
  • the entropy encoding unit 40 arrays DCT coefficients obtained in the DCT and quantization unit 30 by using an adaptive DCT coefficient scanning based on pixel similarity, performs entropy encoding onto the arrayed DCT coefficients, and outputs the result.
  • the entropy encoding is an encoding technique for enhancing a compression rate by allocating small bits to data highly likely to occur and many bits for data that are not likely to occur. Examples of the entropy encoding used in the present invention include Context Adaptive Variable Length Coding (CAVLC) or Context-Based Adaptive Binary Arithmetic Coding (CABAC) .
  • CABAC Context Adapt
  • Figs. 8 to 10 described hereafter are a method of predicting pixel similarity in vertical and horizontal directions in the entropy encoding unit 40, and a scanning method in vertical and horizontal intra prediction modes.
  • Fig. 5 exemplarily illustrates a typical zigzag scanning method used in the present invention.
  • Fig. 6 exemplarily illustrates a typical horizontal scanning method used in the present invention.
  • Fig. 7 exemplarily illustrates a typical vertical scanning method used in the present invention.
  • the zigzag scanning method used in the present invention is devised in consideration that low frequency components of transformed coefficient acquired from the DCT and quantization are highly likely to be positioned in the upper left part on a two- dimensional plane. It takes advantage of a transformed coefficient energy compaction characteristic that coefficients after DCT collectively appear in low frequencies, whereas coefficients after DCT less appear in high frequencies.
  • the zigzag scanning method may be more efficient when pixel similarity in the horizontal direction is similar to the pixel similarity in the vertical direction.
  • the vertical prediction mode is selected as an optimal mode in a rate-distortion process, when the pixel similarity in the vertical direction is high.
  • significant coefficients are distributed in the first row. Therefore, the horizontal scanning shown in Fig. 6 is more efficient than the typical zigzag scanning.
  • the horizontal prediction mode is selected as an optimal mode, when the pixel similarity in the horizontal direction is high.
  • significant coefficients are distributed in the first column. Therefore, the vertical scanning shown in Fig. 7 is more efficient.
  • Fig. 8 illustrates a method for predicting pixel similarity in vertical and horizontal directions in accordance with an embodiment of the present invention.
  • pixels A, B, C and D are positioned in the upper part of a current block to be encoded, whereas pixels E, F, G and H are positioned in the left part of the current block to be encoded.
  • vertical-directional pixel similarity of the pixels a, e, i and m are positioned in a first row of the current block to be encoded is the same as the vertical- directional pixel similarity of residual coefficients a-A, e-A, i-A, and m-A after vertical prediction. This is because the residual coefficients a-A, e-A, i-A, and m-A are differentiated by the same prediction pixels as the pixels a, e, i and m and thus the correlation does not change .
  • the vertical-directional pixel similarity of pixels in 2, 3 and 4 rows of a block is the same as the vertical-directional pixel similarity of residual coefficients after vertical prediction.
  • the horizontal-directional pixel similarity of the 1-row pixels a, b, c and d of the current block to be encoded is different from the horizontal-directional pixel similarity of residual coefficients a-A, b-B, c-C, and d-D after vertical prediction.
  • horizontal-directional pixel similarity before vertical prediction becomes higher than the horizontal-directional pixel similarity after the vertical prediction.
  • horizontal-directional pixel similarity of the pixels a, b, c and d in a first row of a block is the same as the horizontal-directional pixel similarity of residual coefficients a-E, b-E, c-E, and d-E after horizontal prediction.
  • the horizontal-directional pixel similarity of the pixels in 2, 3 and 4 rows of the bock is the same as the horizontal-directional pixel similarity of the residual coefficients after horizontal prediction.
  • the vertical-directional pixel similarity of the 1-row pixels a, e, i and m of the block is different from the vertical-directional pixel similarity of residual coefficients a-E, e-F, i-G, and m-H after horizontal prediction.
  • the vertical-directional pixel similarity before horizontal prediction becomes higher than the vertical -directional pixel similarity after the horizontal prediction. Thus, it becomes similar to or higher than the horizontal-directional pixel similarity.
  • Variance () denotes a dispersion
  • a value obtained by multiplying S_HOR by a multiplication factor a ( a> ⁇ ) is used as a horizontal-directional pixel similarity prediction value of residual coefficients of the current block.
  • the a value is fixed at 2 in an experiment.
  • the S_VER as it is is used as a vertical-directional pixel similarity prediction value of the residual coefficients of the current block.
  • a value obtained by multiplying S__VER by a multiplication factor ⁇ ( ⁇ l) is used as a vertical-directional pixel similarity prediction value of the residual coefficients of the current block.
  • the ⁇ value is fixed at 2 in an experiment.
  • the S_HOR as it is is used as a horizontal-directional pixel similarity prediction value of the residual coefficients of the current block.
  • the vertical and horizontal-directional pixel similarity prediction values acquired in the above methods are compared to each other to decide a scanning method.
  • the present invention is not limited to the mode and the present invention can be applied to an M x N intra prediction mode, too.
  • Fig. 9 is a flowchart describing an adaptive scanning method based on pixel similarity in a vertical intra prediction mode in accordance with an embodiment of the present invention.
  • a vertical intra prediction mode in step S601, an S_VER value and a value of a x S_HOR are compared in step S602.
  • a horizontal scanning method is used in step S603.
  • a zigzag scanning method is used in step S604.
  • Fig. 10 is a flowchart describing an adaptive scanning method based on pixel similarity in a horizontal intra prediction mode in accordance with an embodiment of the present invention.
  • a horizontal intra prediction mode in step S701, an S_HOR value and a value of ⁇ x S_VER are compared in step S702.
  • step S703 When the S_HOR value is greater than the value of ⁇ x S_VER, a vertical scanning method is used in step S703. When the S_HOR value is smaller than the value of ⁇ x S_VER, a zigzag scanning method is used in step S704.
  • Fig. 11 is a block view showing a decoding apparatus using an adaptive DCT coefficient scanning based on pixel similarity in accordance with an embodiment of the present invention.
  • the decoding apparatus using an adaptive DCT coefficient scanning based on pixel similarity includes an entropy decoding unit 50, a scanning decision unit 60, and a video recovery unit 70.
  • the entropy decoding unit 50 receives video bitstream encoded in the encoding apparatus using an adaptive DCT coefficient scanning based on pixel similarity and decodes it through an entropy decoding method such as CAVLC or CABAC. Then, the entropy decoding unit 50 transmits the entropy-decoded video bitstream to the scanning decision unit 60.
  • the scanning decision unit 60 decides a scanning method for the coefficients decoded in the entropy decoding unit 50 according to an intra prediction mode, as described in the above with reference to Figs. 8 to 11.
  • the video recovery unit 70 finally recovers the coefficients by using the scanning method decided in the scanning decision unit 60 to recover the video.
  • JM86 Joint Model 86
  • H.264/AVC Reference Codec H.264/AVC Reference Codec
  • Table 2 presents video compression rates when the test videos were compressed using a conventional compression method, which is a zigzag scanning method of H.264/AVC, and the compression method of the present invention, which is the adaptive scanning method according to intra prediction mode under the same conditions as the Table 1.
  • the Table 2 shows that the result of video compression using the adaptive scanning method according to the intra prediction mode, which is suggested in the present invention, is superior to that of video compression using only the conventional zigzag scanning method of H.264 /AVC.
  • the method of the present invention described above may be realized as a program and stored in a computer-readable recording medium such as CD-ROM, RAM, ROM, floppy disks, hard disks, magneto-optical disks and so fourth. Since the process can be easily implemented by those skilled in the art to which the present invention pertains to, further description on it will not be provided herein. While the present invention has been described with respect to certain preferred embodiments, it will be apparent to those skilled in the art that various changes and modifications may be made without departing from the scope of the invention as defined in the following claims,

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)

Abstract

The present invention discloses an encoding apparatus using a Discrete Cosine Transform (DCT) scanning, which includes: a mode selection means for selecting an optimal mode for intra prediction; an intra prediction means for performing intra prediction onto video inputted based on the mode selected in the mode selection means; a DCT and quantization means for performing DCT and quantization onto residual coefficients of a block outputted from the intra prediction means; and an entropy encoding means for performing entropy encoding onto DCT coefficients acquired from the DCT and quantization by using a scanning mode decided based on pixel similarity of the residual coefficients.

Description

DESCRIPTION
APPARATUS FOR ENCODING AND DECODING IMAGE USING ADAPTIVE DCT COEFFICIENT SCANNING BASED ON PIXEL SIMILARITY AND
METHOD THEREFOR
TECHNICAL FIELD
The present invention relates to an encoding/decoding apparatus and method using an adaptive Discrete Cosine Transform (DCT) coefficient scanning based on pixel similarity. More particularly, the present invention relates to an encoding/decoding apparatus and method which performs intra prediction onto input video, predicts pixel similarity based on pixel similarity information of coefficients to be encoded that is acquired from adjacent pixels in the intra-predicted video, and performs a most effective scanning, e.g., Discrete Cosine Transform (DCT) coefficient scanning, according to the predicted pixel similarity.
BACKGROUND ART
According to video compression standards for encoding/decoding video data, a frame is divided into a plurality of macro blocks and a macro block may be divided into a plurality of sub-blocks. The encoding/decoding is performed on the basis of a macro block unit or a sub-block unit based on temporal prediction and spatial prediction.
Herein, the temporal prediction is to predict motion of macro blocks or sub-blocks of a current frame by referring to blocks of adjacent frames.
The spatial prediction is to predict motion of macro blocks or sub-blocks of a current frame to be encoded by using boundary pixels of already recovered adjacent blocks. The spatial prediction is also called intra prediction. The intra prediction takes advantage of a characteristic that when a pixel is predicted, pixels adjacent to it are highly likely to have similar values.
H.264 /Advanced Video Coding (AVC) standard technology can compress video about twice as high as Moving Picture Experts Group 2 (MPEG-2) and about one and a half times as high as MPEG-4 by using such technique as intra prediction encoding, 1/4-based variable block motion prediction and compensation, Context-Based Adaptive Variable Length Coding (CAVLC) , and Context- Based Adaptive Binary Arithmetic Coding (CABAC) .
The H.264/AVC standard predicts pixel values of a current block by using prediction modes of 9 directivities . Fig. 1 illustrates 9 prediction modes used for intra prediction of 4 x 4 blocks.
As illustrated in Fig. 1, the 9 prediction modes used for intra prediction of 4 x 4 blocks include a vertical mode (mode 0), a horizontal mode (mode 1), a direct current (DC) mode (mode 2), a diagonal_down_left mode (mode 3) , a diagonal down right mode (mode 4) , a vertical_right mode (mode 5), a horizontal_down mode
(mode 6), a vertical_left mode (mode 7), and a horizontal_up mode (mode 8) . Herein, in the DC mode (mode 2), intra prediction is performed using a mean value of adjacent pixels. The arrows indicate prediction directions.
Meanwhile, intra 16 x 16 prediction encoding includes a total of four modes, which are a vertical mode, a horizontal mode, a DC mode, and a plane mode.
Also, intra 8 x 8 prediction encoding includes a total of 9 modes, just like the intra 4 x 4 prediction encoding. As for color difference signals, intra 8 x 8 prediction encoding is performed, and the intra 8 x 8 prediction encoding includes a DC mode, a vertical mode, a horizontal mode, and a plane mode and so on.
Hereinafter, prediction methods in the vertical and horizontal modes for intra prediction of 4 x 4 blocks will be described with reference to Figs. 2 and 3. Fig. 2 exemplarily illustrates a pixel prediction method in a vertical direction.
As shown in Fig. 2, pixel a 201, pixel b 202, pixel i 203, and pixel m 204 are predicted based on an adjacent pixel A in the vertical direction. Also, pixels b, f, j and b are predicted based on an adjacent pixel B in the vertical direction, and pixels c, g, k and o are predicted based on an adjacent pixel C in the vertical direction. Pixels d, h, 1 and p are predicted based on an adjacent pixel D in the vertical direction.
Fig. 3 exemplarily illustrates a pixel prediction method in a horizontal direction.
As illustrated in Fig. 3, pixel a 205, pixel b 206, pixel c 207, and pixel d 208 are predicted based on an adjacent pixel I in a horizontal direction.
Also, pixels e, f, g and h are predicted based on an adjacent pixel J in the horizontal direction, and pixels i, j, k and 1 are predicted based on an adjacent pixel K in the horizontal direction. Pixels m, n, o and p are predicted based on an adjacent pixel 1 in the horizontal direction.
An encoder performs Discrete Cosine Transform (DCT) and quantization onto residual signals (which are of a pixel area) acquired by differentiating the predicted pixels and a current pixel. Subsequently, the encoder performs zigzag scanning and entropy encoding onto the transformed coefficients obtained from DCT and quantization .
Herein, although the zigzag scanning takes advantage of an energy compaction characteristic of a transformed coefficient that energy converges onto low frequency and energy appears little in high frequency, the energy compaction after intra prediction is not always effective. In short, the zigzag scanning is a method of scanning a transformed coefficient from low frequency components to high frequency components. When distribution of transformed coefficients appears more in the low frequency components, the zigzag scanning is effective. However, when spatial prediction having directivity is used, the distribution of transformed coefficients is influenced by the direction of prediction. Therefore, it is ineffective to apply the zigzag scanning to the prediction of all directions.
DISCLOSURE TECHNICAL PROBLEM
An embodiment of the present invention, which is devised to overcome the above problems, is directed to providing an encoding/decoding apparatus and method which performs intra prediction onto input video, predicts pixel similarity based on pixel similarity information of coefficients to be encoded acquired from adjacent pixels in the intra-predicted video, and performs a most effective scanning, e.g., DCT coefficient scanning, according to the predicted pixel similarity.
Other objects and advantages of the present invention can be understood by the following description, and become apparent with reference to the embodiments of the present invention. Also, it is obvious to those skilled in the art of the present invention that the objects and advantages of the present invention can be realized by the means as claimed and combinations thereof. TECHNICAL SOLUTION
In accordance with an aspect of the present invention, there is provided an encoding apparatus using a Discrete Cosine Transform (DCT) scanning, which includes: a mode selection means for selecting an optimal mode for intra prediction; an intra prediction means for performing intra prediction onto video inputted based on the mode selected in the mode selection means; a DCT and quantization means for performing DCT and quantization onto residual coefficients of a block outputted from the intra prediction means; and an entropy encoding means for performing entropy encoding onto DCT coefficients acquired from the DCT and quantization by using a scanning mode decided based on pixel similarity of the residual coefficients.
In accordance with another aspect of the present invention, there is provided a decoding apparatus using a DCT scanning, which includes: an entropy decoding means for performing entropy decoding onto encoded video; a scanning decision means for deciding a scanning mode for the video decoded in the entropy decoding means; and a video recovery means for recovering the video based on the scanning mode decided in the scanning decision means.
In accordance with another aspect of the present invention, there is provided an encoding method using a DCT scanning, which includes the steps of: selecting an optimal mode for intra prediction; performing intra prediction onto video inputted based on the mode selected in the mode selection step; performing DCT and quantization onto residual coefficients of a block outputted from the intra prediction step; deciding pixel similarity of the residual coefficients; and performing entropy encoding onto DCT coefficients acquired from the DCT and quantization by using a scanning mode decided in the pixel similarity decision step. In accordance with an aspect of the present invention, there is provided a decoding method using a DCT scanning, which includes the steps of: performing entropy decoding onto encoded video; deciding a scanning mode for the video decoded in the entropy decoding step; and recovering the video based on the scanning mode decided in the scanning decision step.
According to an embodiment of the present invention, a luminance block may go through an intra 4 x 4 luminance encoding mode of H.264 /Advanced Video Coding (AVC) , which includes a vertical mode, a horizontal mode, a diagonal_down_left mode, a diagonal_down_right mode, a vertical_right mode, a horizontal down mode, a vertical_left mode, and a horizontal_up mode, and an intra 16 x 16 luminance encoding mode of H.264/AVC, which includes a vertical mode, a horizontal mode, a plane mode, and a DC mode.
Also, according to an embodiment of the present invention, a chrominance block may go through an intra M x N chrominance encoding mode of H.264/AVC, which includes a vertical mode, a horizontal mode, a plane mode and a DC mode .
ADVANTAGEOUS EFFECTS As described above, the present invention can improve a compression rate of intra encoding by applying a most effective scanning method according to pixel similarity in order to encode/decode video.
Also, the present invention can improve a video compression rate by being applied to a video compression technology using intra prediction, which will be developed in future.
Also, the present invention can reduce a need for an additional module by applying the same similarity information to both encoder and decoder. BRIEF DESCRIPTION OF THE DRAWINGS
Fig. 1 illustrates 9 prediction modes used for intra prediction of 4 x 4 blocks according to H.264/AVC. Fig. 2 exemplarily illustrates a pixel prediction method in a vertical mode.
Fig. 3 exemplarily illustrates a pixel prediction method in a horizontal direction.
Fig. 4 is a block view showing an encoding apparatus using an adaptive DCT coefficient scanning based on pixel similarity in accordance with an embodiment of the present invention.
Fig. 5 exemplarily illustrates a zigzag scanning method used in the present invention. Fig. β exemplarily illustrates a horizontal scanning method used in the present invention.
Fig. 7 exemplarily illustrates a vertical scanning method used in the present invention.
Fig. 8 illustrates a method for predicting pixel similarity in vertical and horizontal directions in accordance with an embodiment of the present invention.
Fig. 9 is a flowchart describing an adaptive scanning method based on pixel similarity in a vertical intra prediction mode in accordance with an embodiment of the present invention.
Fig. 10 is a flowchart describing an adaptive scanning method based on pixel similarity in a horizontal intra prediction mode in accordance with an embodiment of the present invention. Fig. 11 is a block view showing a decoding apparatus using an adaptive DCT coefficient scanning based on pixel similarity in accordance with an embodiment of the present invention.
BEST MODE FOR THE INVENTION The advantages, features and aspects of the invention will become apparent from the following description of the embodiments with reference to the accompanying drawings, which is set forth hereinafter. When it is considered that detailed description on a related art may obscure a point of the present invention, the description will not be provided herein. Hereinafter, specific embodiments of the present invention will be described in detail with reference to the accompanying drawings.
Fig. 4 is a block view showing an encoding apparatus using an adaptive DCT coefficient scanning based on pixel similarity in accordance with an embodiment of the present invention. As illustrated in Fig. 4, the encoding apparatus based on DCT coefficient scanning adaptive to pixel similarity includes a mode selection unit 10, an intra prediction unit 20, a DCT and quantization unit 30, and an entropy encoding unit 40. The mode selection unit 10 selects an optimal mode among several available prediction modes for intra prediction. In other words, it selects one among a plurality of possible encoding modes when 4 x 4, 16 x 16, or 8 x 8 intra prediction is performed. Generally, the mode selection unit 10 selects one mode according to a rate-distortion optimization method for reducing a rate-distortion.
The intra prediction unit 20 receives a video, and performs 4 x 4 intra prediction for pixels of luminance blocks and 8 x 8 intra prediction for pixels of chrominance blocks based on a mode selected in the mode selection unit 10.
The DCT and quantization unit 30 performs DCT and quantization onto difference values outputted from the mode selection unit 10, that is, onto residual coefficient blocks indicating differences between pixel values of macro blocks of a current frame to be encoded and predicted pixel values, and transmits resulting coefficients to the entropy encoding unit 40. The entropy encoding unit 40 arrays DCT coefficients obtained in the DCT and quantization unit 30 by using an adaptive DCT coefficient scanning based on pixel similarity, performs entropy encoding onto the arrayed DCT coefficients, and outputs the result. Herein, the entropy encoding is an encoding technique for enhancing a compression rate by allocating small bits to data highly likely to occur and many bits for data that are not likely to occur. Examples of the entropy encoding used in the present invention include Context Adaptive Variable Length Coding (CAVLC) or Context-Based Adaptive Binary Arithmetic Coding (CABAC) .
With reference to Figs. 8 to 10, described hereafter are a method of predicting pixel similarity in vertical and horizontal directions in the entropy encoding unit 40, and a scanning method in vertical and horizontal intra prediction modes.
Fig. 5 exemplarily illustrates a typical zigzag scanning method used in the present invention. Fig. 6 exemplarily illustrates a typical horizontal scanning method used in the present invention. Fig. 7 exemplarily illustrates a typical vertical scanning method used in the present invention.
As shown in Fig. 5, the zigzag scanning method used in the present invention is devised in consideration that low frequency components of transformed coefficient acquired from the DCT and quantization are highly likely to be positioned in the upper left part on a two- dimensional plane. It takes advantage of a transformed coefficient energy compaction characteristic that coefficients after DCT collectively appear in low frequencies, whereas coefficients after DCT less appear in high frequencies.
The zigzag scanning method may be more efficient when pixel similarity in the horizontal direction is similar to the pixel similarity in the vertical direction.
However, when intra prediction encoding, particularly, vertical or horizontal intra prediction, is performed, the similarity of the residual coefficients in the vertical direction shows much difference from the similarity in the horizontal direction. Thus, the above- described coefficient distribution is not always effective. Therefore, it is inefficient to apply the zigzag scanning to prediction of all directions.
To describe an example of the vertical prediction mode, the vertical prediction mode is selected as an optimal mode in a rate-distortion process, when the pixel similarity in the vertical direction is high. Herein, significant coefficients are distributed in the first row. Therefore, the horizontal scanning shown in Fig. 6 is more efficient than the typical zigzag scanning.
Meanwhile, to describe an example of the horizontal prediction mode, the horizontal prediction mode is selected as an optimal mode, when the pixel similarity in the horizontal direction is high. Herein, significant coefficients are distributed in the first column. Therefore, the vertical scanning shown in Fig. 7 is more efficient.
However, since the pixel similarity before intra prediction is different from pixel similarity of residual coefficients after the intra prediction, it is inefficient to simply use the scanning method of Fig. 6 or Fig. 7 according to the intra prediction mode.
Therefore, if pixel similarities in the vertical and horizontal directions of blocks around a block to be encoded are predicted based on similarity information among adjacent block boundary pixels which are already recovered and an adaptive scanning method according to the prediction result is used, the encoding efficiency can be increased. Fig. 8 illustrates a method for predicting pixel similarity in vertical and horizontal directions in accordance with an embodiment of the present invention.
As illustrated in Fig. 8, pixels A, B, C and D are positioned in the upper part of a current block to be encoded, whereas pixels E, F, G and H are positioned in the left part of the current block to be encoded.
Herein, when vertical prediction encoding is performed, vertical-directional pixel similarity of the pixels a, e, i and m are positioned in a first row of the current block to be encoded is the same as the vertical- directional pixel similarity of residual coefficients a-A, e-A, i-A, and m-A after vertical prediction. This is because the residual coefficients a-A, e-A, i-A, and m-A are differentiated by the same prediction pixels as the pixels a, e, i and m and thus the correlation does not change .
As described above, the vertical-directional pixel similarity of pixels in 2, 3 and 4 rows of a block is the same as the vertical-directional pixel similarity of residual coefficients after vertical prediction.
However, the horizontal-directional pixel similarity of the 1-row pixels a, b, c and d of the current block to be encoded is different from the horizontal-directional pixel similarity of residual coefficients a-A, b-B, c-C, and d-D after vertical prediction. Also, horizontal-directional pixel similarity before vertical prediction becomes higher than the horizontal-directional pixel similarity after the vertical prediction. Thus, it becomes similar to or higher than the vertical-directional pixel similarity. Likewise, in case of the horizontal prediction encoding, horizontal-directional pixel similarity of the pixels a, b, c and d in a first row of a block is the same as the horizontal-directional pixel similarity of residual coefficients a-E, b-E, c-E, and d-E after horizontal prediction. Also, the horizontal-directional pixel similarity of the pixels in 2, 3 and 4 rows of the bock is the same as the horizontal-directional pixel similarity of the residual coefficients after horizontal prediction.
However, the vertical-directional pixel similarity of the 1-row pixels a, e, i and m of the block is different from the vertical-directional pixel similarity of residual coefficients a-E, e-F, i-G, and m-H after horizontal prediction. Also, the vertical-directional pixel similarity before horizontal prediction becomes higher than the vertical -directional pixel similarity after the horizontal prediction. Thus, it becomes similar to or higher than the horizontal-directional pixel similarity.
As described above, when the pixel similarities in the vertical and horizontal directions become similar, a general zigzag scanning method is more efficient than the horizontal and vertical scanning methods. Therefore, when the vertical intra prediction mode is performed and the vertical-directional pixel similarity of residual coefficients is high and their horizontal-directional pixel similarity is low, it is more efficient to use the horizontal scanning. Meanwhile, when the horizontal intra prediction mode is performed and the horizontal-directional pixel similarity of residual coefficients is high and their vertical-directional pixel similarity is low, it is more efficient to use the vertical scanning. When vertical-directional pixel similarity of recovered 8 pixels A, B, C, D, E, F, G and H of Fig. 8 is referred to as S_VER and their horizontal-directional pixel similarity is referred to as S_HOR, the pixel similarities for increasing the efficiency of 4 x 4 prediction encoding can be calculated as the following Equation 1.
Equation 1
S VER = -
Variance(E,F,G,H)
S HOR =
Variance{A,B,C,D)
where Variance () denotes a dispersion; E, F, G and
H DENOTE pixels adjacent to the left part of the current block to be endoed; and A, B, C and D denote pixels adjacent to the upper part of the current block to be encoded.
When the vertical prediction mode is carried out, a value obtained by multiplying S_HOR by a multiplication factor a ( a>\ ) is used as a horizontal-directional pixel similarity prediction value of residual coefficients of the current block. Herein, the a value is fixed at 2 in an experiment. The S_VER as it is is used as a vertical-directional pixel similarity prediction value of the residual coefficients of the current block.
When the horizontal prediction mode is carried out, a value obtained by multiplying S__VER by a multiplication factor β (β≥l) is used as a vertical-directional pixel similarity prediction value of the residual coefficients of the current block. Herein, the β value is fixed at 2 in an experiment. The S_HOR as it is is used as a horizontal-directional pixel similarity prediction value of the residual coefficients of the current block.
The vertical and horizontal-directional pixel similarity prediction values acquired in the above methods are compared to each other to decide a scanning method.
Although the example of 4 x 4 intra prediction mode is described in the above, the present invention is not limited to the mode and the present invention can be applied to an M x N intra prediction mode, too.
Hereinafter, a method of selecting a scanning method in the vertical and horizontal prediction modes will be described in detail with reference to Figs. 9 and 10.
Fig. 9 is a flowchart describing an adaptive scanning method based on pixel similarity in a vertical intra prediction mode in accordance with an embodiment of the present invention. In case of a vertical intra prediction mode in step S601, an S_VER value and a value of a x S_HOR are compared in step S602. When the S_VER value is greater than the value of a x S_HOR, a horizontal scanning method is used in step S603. When the S_VER value is smaller than the value of a x S_HOR, a zigzag scanning method is used in step S604.
Herein, when a vertical-directional pixel similarity of the current block to be encoded based on similarity of adjacent pixels is predicted higher than the horizontal-directional pixel similarity thereof, transformed coefficients obtained after DCT and quantization are highly likely to be distributed in a direction horizontal to a first row of the block. Therefore, the horizontal scanning method can bring about a high encoding efficiency. Fig. 10 is a flowchart describing an adaptive scanning method based on pixel similarity in a horizontal intra prediction mode in accordance with an embodiment of the present invention. In case of a horizontal intra prediction mode in step S701, an S_HOR value and a value of β x S_VER are compared in step S702. When the S_HOR value is greater than the value of β x S_VER, a vertical scanning method is used in step S703. When the S_HOR value is smaller than the value of β x S_VER, a zigzag scanning method is used in step S704.
Herein, when a horizontal-directional pixel similarity of the current block to be encoded based on similarity of adjacent pixels is predicted higher than the vertical-directional pixel similarity thereof, transformed coefficients obtained after DCT and quantization are highly likely to be disposed in a direction vertical to a first row of the block. Therefore, the vertical scanning method can bring about a high encoding efficiency.
Fig. 11 is a block view showing a decoding apparatus using an adaptive DCT coefficient scanning based on pixel similarity in accordance with an embodiment of the present invention. As shown in Fig. 11, the decoding apparatus using an adaptive DCT coefficient scanning based on pixel similarity includes an entropy decoding unit 50, a scanning decision unit 60, and a video recovery unit 70.
The entropy decoding unit 50 receives video bitstream encoded in the encoding apparatus using an adaptive DCT coefficient scanning based on pixel similarity and decodes it through an entropy decoding method such as CAVLC or CABAC. Then, the entropy decoding unit 50 transmits the entropy-decoded video bitstream to the scanning decision unit 60. The scanning decision unit 60 decides a scanning method for the coefficients decoded in the entropy decoding unit 50 according to an intra prediction mode, as described in the above with reference to Figs. 8 to 11.
The video recovery unit 70 finally recovers the coefficients by using the scanning method decided in the scanning decision unit 60 to recover the video.
An experiments was carried out for diverse test videos by using Joint Model 86 (JM86) , which is H.264/AVC Reference Codec, according to the above-described methods. The result of increase in compression efficiency was as follows. In the experiment, video recommended by H.264/AVC as test video was used. The following Table 1 shows conditions of the experiment.
Table 1
Figure imgf000018_0001
As shown in Table 1, five pieces of video with different sizes were used for the experiment. The following Table 2 presents video compression rates when the test videos were compressed using a conventional compression method, which is a zigzag scanning method of H.264/AVC, and the compression method of the present invention, which is the adaptive scanning method according to intra prediction mode under the same conditions as the Table 1. Table 2
Figure imgf000019_0001
The Table 2 shows that the result of video compression using the adaptive scanning method according to the intra prediction mode, which is suggested in the present invention, is superior to that of video compression using only the conventional zigzag scanning method of H.264 /AVC.
The method of the present invention described above may be realized as a program and stored in a computer-readable recording medium such as CD-ROM, RAM, ROM, floppy disks, hard disks, magneto-optical disks and so fourth. Since the process can be easily implemented by those skilled in the art to which the present invention pertains to, further description on it will not be provided herein. While the present invention has been described with respect to certain preferred embodiments, it will be apparent to those skilled in the art that various changes and modifications may be made without departing from the scope of the invention as defined in the following claims,

Claims

WHAT IS CLAIMED IS
1. An encoding apparatus using a Discrete Cosine Transform (DCT) scanning, comprising: a mode selection means for selecting an optimal mode for intra prediction; an intra prediction means for performing intra prediction onto video inputted based on the mode selected in the mode selection means; a DCT and quantization means for performing DCT and quantization onto residual coefficients of a block outputted from the intra prediction means; and an entropy encoding means for performing entropy encoding onto DCT coefficients acquired from the DCT and quantization by using a scanning mode decided based on pixel similarity of the residual coefficients.
2. The encoding apparatus of claim 1, wherein the scanning mode is any one among a horizontal- directional scanning, a vertical-directional scanning, and a zigzag scanning.
3. The encoding apparatus of claim 2, wherein the entropy encoding means: performing encoding using a horizontal-directional scanning, when vertical-directional pixel similarity of the residual signals is high; performing encoding using a vertical-directional scanning, when horizontal-directional pixel similarity of the residual signals is high; and performing encoding using a zigzag scanning, when vertical and horizontal-directional pixel similarities of the residual signals are similar.
4. The encoding apparatus of claim 3, wherein the entropy encoding means decides that the vertical- directional pixel similarity is high, when a vertical- directional pixel similarity value is greater than a value obtained by multiplying a horizontal-directional pixel similarity value by a first multiplication factor.
5. The encoding apparatus of claim 3, wherein the entropy encoding means decides that the horizontal- directional pixel similarity is high, when a horizontal- directional pixel similarity value is greater than a value obtained by multiplying a vertical-directional pixel similarity value by a second multiplication factor.
6. The encoding apparatus of claim 4, wherein the vertical-directional pixel similarity is calculated by performing dispersion onto pixels adjacent to a left part of a currently encoded block.
7. The encoding apparatus of claim 5, wherein the horizontal-directional pixel similarity is calculated by performing dispersion onto pixels adjacent to an upper part of a currently encoded block.
8. The encoding apparatus of claim 6, wherein the first and second multiplication factors are natural number 2.
9. A decoding apparatus using a DCT scanning, comprising: an entropy decoding means for performing entropy decoding onto encoded video; a scanning decision means for deciding a scanning mode for the video decoded in the entropy decoding means; and a video recovery means for recovering the video based on the scanning mode decided in the scanning decision means.
10. The decoding apparatus of claim 9, wherein the decided scanning mode is any one among a horizontal- directional scanning, a vertical-directional scanning, and a zigzag scanning.
11. An encoding method using a DCT scanning, comprising the steps of: selecting an optimal mode for intra prediction; performing intra prediction onto video inputted based on the mode selected in the mode selection step; performing DCT and quantization onto residual coefficients of a block outputted from the intra prediction step; deciding pixel similarity of the residual coefficients; and performing entropy encoding onto DCT coefficients acquired from the DCT and quantization by using a scanning mode decided in the pixel similarity decision step.
12. The encoding method of claim 11, wherein the decided scanning mode is any one among a horizontal- directional scanning, a vertical-directional scanning, and a zigzag scanning.
13. The encoding method of claim 12, wherein the entropy encoding step includes the steps of: performing encoding using a horizontal-directional scanning, when vertical-directional pixel similarity of the residual signals is high; performing encoding using a vertical-directional scanning, when horizontal-directional pixel similarity of the residual signals is high; and performing encoding using a zigzag scanning, when vertical and horizontal-directional pixel similarities of the residual signals are similar.
14. The encoding method of claim 13, wherein the vertical-directional pixel similarity is decided high in the entropy encoding step, when a vertical-directional pixel similarity value is greater than a value obtained by multiplying a horizontal-directional pixel similarity value by a first multiplication factor.
15. The encoding method of claim 13, wherein the horizontal-directional pixel similarity is decided high in the entropy encoding step, when a horizontal- directional pixel similarity value is greater than a value obtained by multiplying a vertical-directional pixel similarity value by a second multiplication factor.
16. The encoding method of claim 14, wherein the vertical-directional pixel similarity is calculated by performing dispersion onto pixels adjacent to a left part of a currently encoded block.
17. The encoding method of claim 15, wherein the horizontal-directional pixel similarity is calculated by performing dispersion onto pixels adjacent to an upper part of a currently encoded block.
18. The encoding method of claim 16, wherein the first and second multiplication factors are natural number 2.
19. A decoding method using a DCT scanning, comprising the steps of: performing entropy decoding onto encoded video; deciding a scanning mode for the video decoded in the entropy decoding step; and recovering the video based on the scanning mode decided in the scanning decision step.
20. The decoding apparatus of claim 19, wherein the decided scanning mode is any one among a horizontal- directional scanning, a vertical-directional scanning, and a zigzag scanning.
PCT/KR2007/001433 2006-08-17 2007-03-23 Apparatus for encoding and decoding image using adaptive dct coefficient scanning based on pixel similarity and method therefor WO2008020672A1 (en)

Priority Applications (9)

Application Number Priority Date Filing Date Title
US12/377,617 US8548060B2 (en) 2006-08-17 2007-03-23 Apparatus for encoding and decoding image using adaptive DCT coefficient scanning based on pixel similarity and method therefor
US13/975,213 US9154784B2 (en) 2006-08-17 2013-08-23 Apparatus for encoding and decoding image using adaptive DCT coefficient scanning based on pixel similarity and method therefor
US13/975,251 US9225982B2 (en) 2006-08-17 2013-08-23 Apparatus for encoding and decoding image using adaptive DCT coefficient scanning based on pixel similarity and method therefor
US14/823,273 US9736484B2 (en) 2006-08-17 2015-08-11 Apparatus for encoding and decoding image using adaptive DCT coefficient scanning based on pixel similarity and method therefor
US14/823,186 US9819942B2 (en) 2006-08-17 2015-08-11 Apparatus for encoding and decoding image using adaptive DCT coefficient scanning based on pixel similarity and method therefor
US15/784,623 US10499064B2 (en) 2006-08-17 2017-10-16 Apparatus for encoding and decoding image using adaptive DCT coefficient scanning based on pixel similarity and method therefor
US16/438,785 US11330274B2 (en) 2006-08-17 2019-06-12 Apparatus for encoding and decoding image using adaptive DCT coefficient scanning based on pixel similarity and method therefor
US17/219,964 US11949881B2 (en) 2006-08-17 2021-04-01 Apparatus for encoding and decoding image using adaptive DCT coefficient scanning based on pixel similarity and method therefor
US17/219,985 US20210218973A1 (en) 2006-08-17 2021-04-01 Apparatus for encoding and decoding image using adaptive dct coefficient scanning based on pixel similarity and method therefor

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
KR20060077851 2006-08-17
KR10-2006-0077851 2006-08-17
KR1020070008247A KR100882949B1 (en) 2006-08-17 2007-01-26 Apparatus and method of encoding and decoding using adaptive scanning of DCT coefficients according to the pixel similarity
KR10-2007-0008247 2007-01-26

Related Child Applications (3)

Application Number Title Priority Date Filing Date
US12/377,617 A-371-Of-International US8548060B2 (en) 2006-08-17 2007-03-23 Apparatus for encoding and decoding image using adaptive DCT coefficient scanning based on pixel similarity and method therefor
US13/975,213 Continuation US9154784B2 (en) 2006-08-17 2013-08-23 Apparatus for encoding and decoding image using adaptive DCT coefficient scanning based on pixel similarity and method therefor
US13/975,251 Continuation US9225982B2 (en) 2006-08-17 2013-08-23 Apparatus for encoding and decoding image using adaptive DCT coefficient scanning based on pixel similarity and method therefor

Publications (1)

Publication Number Publication Date
WO2008020672A1 true WO2008020672A1 (en) 2008-02-21

Family

ID=39082184

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/KR2007/001433 WO2008020672A1 (en) 2006-08-17 2007-03-23 Apparatus for encoding and decoding image using adaptive dct coefficient scanning based on pixel similarity and method therefor

Country Status (1)

Country Link
WO (1) WO2008020672A1 (en)

Cited By (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2011137814A1 (en) * 2010-09-30 2011-11-10 华为技术有限公司 Scanning method and device, reverse scanning method and device
CN102934432A (en) * 2010-04-05 2013-02-13 三星电子株式会社 Method and apparatus for encoding video by using transformation index, and method and apparatus for decoding video by using transformation index
US8982961B2 (en) 2010-04-05 2015-03-17 Samsung Electronics Co., Ltd. Method and apparatus for encoding video by using transformation index, and method and apparatus for decoding video by using transformation index
US9049450B2 (en) 2010-04-05 2015-06-02 Samsung Electronics Co., Ltd. Method and apparatus for encoding video based on internal bit depth increment, and method and apparatus for decoding video based on internal bit depth increment
CN104954790A (en) * 2011-11-04 2015-09-30 英孚布瑞智有限私人贸易公司 Method of generating quantized block
US9369736B2 (en) 2010-04-05 2016-06-14 Samsung Electronics Co., Ltd. Low complexity entropy-encoding/decoding method and apparatus
WO2016124154A1 (en) * 2015-02-08 2016-08-11 同济大学 Image coding method and apparatus, and image decoding method and apparatus
US9736484B2 (en) 2006-08-17 2017-08-15 Electronics And Telecommunications Research Institute Apparatus for encoding and decoding image using adaptive DCT coefficient scanning based on pixel similarity and method therefor
CN109547781A (en) * 2018-10-26 2019-03-29 西安科锐盛创新科技有限公司 Compression method and equipment based on image prediction
US10750180B2 (en) 2015-02-08 2020-08-18 Tongji University Image coding method and apparatus, and image decoding method and apparatus
CN111988630A (en) * 2020-09-11 2020-11-24 北京锐马视讯科技有限公司 Video transmission method and device, equipment and storage medium

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2003006643A (en) * 2001-06-25 2003-01-10 Canon Inc Device and method for processing image and program
JP2004348741A (en) * 2003-05-20 2004-12-09 Mitsubishi Electric Information Technology Centre Europa Bv Image comparison method, computer readable storage medium for storing program for execution of the method, and device for execution of the method

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2003006643A (en) * 2001-06-25 2003-01-10 Canon Inc Device and method for processing image and program
JP2004348741A (en) * 2003-05-20 2004-12-09 Mitsubishi Electric Information Technology Centre Europa Bv Image comparison method, computer readable storage medium for storing program for execution of the method, and device for execution of the method

Cited By (28)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9736484B2 (en) 2006-08-17 2017-08-15 Electronics And Telecommunications Research Institute Apparatus for encoding and decoding image using adaptive DCT coefficient scanning based on pixel similarity and method therefor
US11949881B2 (en) 2006-08-17 2024-04-02 Electronics And Telecommunications Research Institute Apparatus for encoding and decoding image using adaptive DCT coefficient scanning based on pixel similarity and method therefor
US11330274B2 (en) 2006-08-17 2022-05-10 Electronics And Telecommunications Research Institute Apparatus for encoding and decoding image using adaptive DCT coefficient scanning based on pixel similarity and method therefor
US10499064B2 (en) 2006-08-17 2019-12-03 Electronics And Telecommunications Research Institute Apparatus for encoding and decoding image using adaptive DCT coefficient scanning based on pixel similarity and method therefor
US9819942B2 (en) 2006-08-17 2017-11-14 Electronics And Telecommunications Research Institute Apparatus for encoding and decoding image using adaptive DCT coefficient scanning based on pixel similarity and method therefor
US9049450B2 (en) 2010-04-05 2015-06-02 Samsung Electronics Co., Ltd. Method and apparatus for encoding video based on internal bit depth increment, and method and apparatus for decoding video based on internal bit depth increment
US9042459B1 (en) 2010-04-05 2015-05-26 Samsung Electronics Co., Ltd. Method and apparatus for encoding video by using transformation index, and method and apparatus for decoding video by using transformation index
CN102934432A (en) * 2010-04-05 2013-02-13 三星电子株式会社 Method and apparatus for encoding video by using transformation index, and method and apparatus for decoding video by using transformation index
US8982961B2 (en) 2010-04-05 2015-03-17 Samsung Electronics Co., Ltd. Method and apparatus for encoding video by using transformation index, and method and apparatus for decoding video by using transformation index
US10158890B2 (en) 2010-04-05 2018-12-18 Samsung Electronics Co., Ltd. Low complexity entropy-encoding/decoding method and apparatus
CN102934432B (en) * 2010-04-05 2016-02-17 三星电子株式会社 For the method and apparatus by using manipulative indexing to encode to video, for the method and apparatus by using manipulative indexing to decode to video
US9294781B2 (en) 2010-04-05 2016-03-22 Samsung Electronics Co., Ltd. Method and apparatus for encoding video by using transformation index, and method and apparatus for decoding video by using transformation index
US9369736B2 (en) 2010-04-05 2016-06-14 Samsung Electronics Co., Ltd. Low complexity entropy-encoding/decoding method and apparatus
US10027991B2 (en) 2010-04-05 2018-07-17 Samsung Electronics Co., Ltd. Low complexity entropy-encoding/decoding method and apparatus
US9602845B2 (en) 2010-04-05 2017-03-21 Samsung Electronics Co., Ltd. Low complexity entropy-encoding/decoding method and apparatus
US9042460B1 (en) 2010-04-05 2015-05-26 Samsung Electronics Co., Ltd. Method and apparatus for encoding video by using transformation index, and method and apparatus for decoding video by using transformation index
US9049460B1 (en) 2010-04-05 2015-06-02 Samsung Electronics Co., Ltd. Method and apparatus for encoding video by using transformation index, and method and apparatus for decoding video by using transformation index
US9866875B2 (en) 2010-04-05 2018-01-09 Samsung Electronics Co., Ltd. Low complexity entropy-encoding/decoding method and apparatus
RU2544788C2 (en) * 2010-09-30 2015-03-20 Хуавэй Текнолоджиз Ко., Лтд. Scanning method and apparatus and inverse scanning method and apparatus
US9137531B2 (en) 2010-09-30 2015-09-15 Huawei Technologies Co., Ltd. Scanning method and device, and reverse scanning method and device
WO2011137814A1 (en) * 2010-09-30 2011-11-10 华为技术有限公司 Scanning method and device, reverse scanning method and device
CN104954790A (en) * 2011-11-04 2015-09-30 英孚布瑞智有限私人贸易公司 Method of generating quantized block
WO2016124154A1 (en) * 2015-02-08 2016-08-11 同济大学 Image coding method and apparatus, and image decoding method and apparatus
US10750180B2 (en) 2015-02-08 2020-08-18 Tongji University Image coding method and apparatus, and image decoding method and apparatus
US11431977B2 (en) 2015-02-08 2022-08-30 Xi'an Zhongxing New Software Co., Ltd Image coding method and apparatus, and image decoding method and apparatus
CN109547781A (en) * 2018-10-26 2019-03-29 西安科锐盛创新科技有限公司 Compression method and equipment based on image prediction
CN109547781B (en) * 2018-10-26 2020-12-22 嘉兴奥恒进出口有限公司 Compression method and device based on image prediction
CN111988630A (en) * 2020-09-11 2020-11-24 北京锐马视讯科技有限公司 Video transmission method and device, equipment and storage medium

Similar Documents

Publication Publication Date Title
US11949881B2 (en) Apparatus for encoding and decoding image using adaptive DCT coefficient scanning based on pixel similarity and method therefor
WO2008020672A1 (en) Apparatus for encoding and decoding image using adaptive dct coefficient scanning based on pixel similarity and method therefor
TWI650993B (en) Method for decoding chroma images
KR100727972B1 (en) Method and apparatus for intra prediction of video
EP3059708B1 (en) Method for encoding a coding unit at a picture boundary
EP3282696B1 (en) Method for decoding and apparatus for encoding images by using large transformation units
EP2996340B1 (en) Method and apparatus for encoding and decoding image by using large transform unit
EP1729520A2 (en) Apparatus and method for image encoding and decoding with non square macroblocks
US20070098067A1 (en) Method and apparatus for video encoding/decoding
US8090025B2 (en) Moving-picture coding apparatus, method and program, and moving-picture decoding apparatus, method and program
US20070053433A1 (en) Method and apparatus for video intraprediction encoding and decoding
EP1796395A2 (en) Method and device for intra prediction coding and decoding of images
KR100727970B1 (en) Apparatus for encoding and decoding image, and method theroff, and a recording medium storing program to implement the method
KR20110112224A (en) Method and apparatus for encdoing/decoding information regarding encoding mode
EP2753081A2 (en) Image encoding/decoding method for rate-distortion optimization and device for performing same
KR20110073263A (en) Method and apparatus for encoding and decoding for intra prediction
WO2011150805A1 (en) Method, apparatus and system for processing image residual block
CA2895855C (en) Video encoding and decoding apparatus and method using quantization in sub-blocks
JP2009111712A (en) Video encoding device and method
Wu et al. Screen image sequence compression method utilizing adaptive block size coding and hierarchical GOP structure

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 07715776

Country of ref document: EP

Kind code of ref document: A1

WWE Wipo information: entry into national phase

Ref document number: 12377617

Country of ref document: US

NENP Non-entry into the national phase

Ref country code: DE

NENP Non-entry into the national phase

Ref country code: RU

122 Ep: pct application non-entry in european phase

Ref document number: 07715776

Country of ref document: EP

Kind code of ref document: A1