KR20060109246A - Video encoding and decoding method for improving coding efficiency and apparatus thereof - Google Patents
Video encoding and decoding method for improving coding efficiency and apparatus thereof Download PDFInfo
- Publication number
- KR20060109246A KR20060109246A KR1020050065635A KR20050065635A KR20060109246A KR 20060109246 A KR20060109246 A KR 20060109246A KR 1020050065635 A KR1020050065635 A KR 1020050065635A KR 20050065635 A KR20050065635 A KR 20050065635A KR 20060109246 A KR20060109246 A KR 20060109246A
- Authority
- KR
- South Korea
- Prior art keywords
- macroblock
- frequency
- inverse
- video
- entropy
- Prior art date
Links
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/124—Quantisation
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/13—Adaptive entropy coding, e.g. adaptive variable length coding [AVLC] or context adaptive binary arithmetic coding [CABAC]
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/17—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
- H04N19/176—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/60—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
- H04N19/61—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding in combination with predictive coding
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
Abstract
Description
동영상 부호화 기술에서 압축 효율성은 가장 중요한 성능이 되고 있다. H.264 동영상 국제 표준은 현재까지 가장 압축 성능이 우수한 기술로서 하나의 매크로블록을 4x4 블록으로 나누어 4x4 블록 단위로 DCT를 수행하고 그 결과를 CABAC 혹은 CAVLC로 복호된 데이터를 콘텍스트로 사용하여 적응적 부호화를 실시한다. 이러한 종래의 기술은 움직임 보상 혹은 스케일러블 부호화 기술 등에 있어서 잔여 매크로블럭의 부호화에 있어 최적의 압축 효율을 달성하지 못한다. In video encoding technology, compression efficiency is becoming the most important performance. The H.264 video international standard is the most highly compressed technology to date, and one macroblock is divided into 4x4 blocks to perform DCT in 4x4 blocks, and the result is adaptively used by using CABAC or CAVLC decoded data as context. Perform the encoding. This conventional technique does not achieve optimal compression efficiency in encoding residual macroblocks in motion compensation or scalable encoding techniques.
움직임 보상 혹은 스케일러블 부호화 기술 등에 있어서 잔여 매크로블럭의 부호화에 있어 압축 효율을 향상한다.Compression efficiency is improved in encoding residual macroblocks in motion compensation or scalable coding techniques.
Abstract:Abstract:
In this proposal, a method for improving SVC coding efficiency on the basis of ARR is described. The presented method proposes to implement ARR on residual signal before integer transformation in SVC enhancement layer. Experiments based on JSVM2.0 are performed with different test sequences on Palma test points. Experimental results show that the presented method gives 0.08 to 0.2dB improvement over the tested sequence based on the same bit rate. It can also be observed that the presented method effectively improves the coding efficiency of FGS enhancement layer by 14% average bit rate reduction with 0.07dB PSNR loss in enhancement layer.In this proposal, a method for improving SVC coding efficiency on the basis of ARR is described. The presented method proposes to implement ARR on residual signal before integer transformation in SVC enhancement layer. Experiments based on JSVM2.0 are performed with different test sequences on Palma test points. Experimental results show that the presented method gives 0.08 to 0.2dB improvement over the tested sequence based on the same bit rate. It can also be observed that the presented method effectively improves the coding efficiency of FGS enhancement layer by 14% average bit rate reduction with 0.07dB PSNR loss in enhancement layer.
ARRARR for improving coding efficiency of video compression for improving coding efficiency of video compression
ARR(rearrangement): The residual pixel distributions in the original 8x8 block before rearrangement and the new one after rearrangement are compared in the following figure 0.ARR (rearrangement): The residual pixel distributions in the original 8x8 block before rearrangement and the new one after rearrangement are compared in the following figure 0.
Fig. 0 Rearrangement process.Fig. 0 Rearrangement process.
As shown in Figure 0, the original residual block is shown as (a), after rearrangement, the residual data are shown in (b).As shown in Figure 0, the original residual block is shown as (a), after rearrangement, the residual data are shown in (b).
Inverse ARR process in the decoder, transforms the block in Fig. 0 (b) to the block in Fig. 0 (a).Inverse ARR process in the decoder, transforms the block in Fig. 0 (b) to the block in Fig. 0 (a).
ARR is useful for improving coding efficiency of residual texture signal in video coding. The residual signal whether by temporal prediction or spatial prediction, the error pixel values are most of cases close to zero. So after rearrangement, the rearranged 4x4 blocks also contain almost zero error pixel values. However, before rearrangement, the neighboring 4x4 residual blocks, the statistics of error pixel value can be different because that for example, in the case of temporal prediction using motion vectors, the neighboring 4x4 blocks will use different reference information with different motion vectors. There are video coding technologies that use already decoded neighboring blocks as context for enhancing coding efficiency of current block coding such as CABAC in JVT. ARR helps improving coding efficiency by making the statistics of neighboring residual blocks be similar to each other. ARR is useful for improving coding efficiency of residual texture signal in video coding. The residual signal whether by temporal prediction or spatial prediction, the error pixel values are most of cases close to zero. So after rearrangement, the rearranged 4x4 blocks also contain almost zero error pixel values. However, before rearrangement, the neighboring 4x4 residual blocks, the statistics of error pixel value can be different because that for example, in the case of temporal prediction using motion vectors, the neighboring 4x4 blocks will use different reference information with different motion vectors. There are video coding technologies that use already decoded neighboring blocks as context for enhancing coding efficiency of current block coding such as CABAC in JVT. ARR helps improving coding efficiency by making the statistics of neighboring residual blocks be similar to each other.
ARRARR implementation for implementation for JVTJVT SVC SVC
The ARR operation on residual signal has been proposed for error resilient coding purpose. In this document, ARR is adopted for improving coding efficiency in SVC enhancement layer. In SVC, MCTF is first implemented on each spatial layer, after that, the input video signal is decomposed into high-pass and low-pass signal, then, the texture information and motion information derived from the MCTF process are coded for each low-pass and high-pass picture. The SVC encoder structure with the proposed ARR module is shown in Fig. 1. The ARR operation on residual signal has been proposed for error resilient coding purpose. In this document, ARR is adopted for improving coding efficiency in SVC enhancement layer. In SVC, MCTF is first implemented on each spatial layer, after that, the input video signal is decomposed into high-pass and low-pass signal, then, the texture information and motion information derived from the MCTF process are coded for each low- pass and high-pass picture. The SVC encoder structure with the proposed ARR module is shown in Fig. One.
Fig. 1 Example of the encoder structure using ARRFig. 1 example of the encoder structure using ARR
As the figure shown, ARR module is applied to the residual texture information. As the figure shown, ARR module is applied to the residual texture information.
The proposed method can also be applied to Intra texture information. The proposed method can also be applied to Intra texture information.
In SVC, the input video signal of each layer after MCTF is decomposed into a set of temporal low-pass and high-pass pictures. For each picture, after MCTF, texture information and motion information are encoded, respectively. When fine grain SNR scalability (FGS) is enabled, the quality scalability is achieved by progressive SNR refinement texture coding. The presented method implements ARR on residual signal before DCT transformation in the coding process of high-pass picture for enhancement layer. The proposed method can be easily implemented with simple modification to the current JSVM coding structure. Furthermore, as ARR is implemented on each 8x8 inter block where 4x4 transformation is adopted, hence only one 8x8x2 Octet memory block is required for the whole ARR process In SVC, the input video signal of each layer after MCTF is decomposed into a set of temporal low-pass and high-pass pictures. For each picture, after MCTF, texture information and motion information are encoded, respectively. When fine grain SNR scalability (FGS) is enabled, the quality scalability is achieved by progressive SNR refinement texture coding. The presented method implements ARR on residual signal before DCT transformation in the coding process of high-pass picture for enhancement layer. The proposed method can be easily implemented with simple modification to the current JSVM coding structure. Additionally, as ARR is implemented on each 8x8 inter block where 4x4 transformation is adopted, hence only one 8x8x2 Octet memory block is required for the whole ARR process
Above description is the implementation of the proposed method in SVC encoder. When the decoder is concerned, the inverse operation needs to be implemented for the decoding process. Corresponding to the ARR operation in encoder, the Inverse ARR is performed at decoder side to the reconstructed residual signal which is obtained from the inverse integer transform. Above description is the implementation of the proposed method in SVC encoder. When the decoder is concerned, the inverse operation needs to be implemented for the decoding process. Corresponding to the ARR operation in encoder, the Inverse ARR is performed at decoder side to the reconstructed residual signal which is obtained from the inverse integer transform.
Experimental resultsExperimental results
To evaluate the performance of the proposed method, experiments based on JSVM2.0 are performed on Palma test points using the configuration files provided along with the software. In order to get the preliminary experimental results, ARR is implemented on luma residual signals. The experimental results are shown in Fig. 2 to Fig. 8. The legend name "PictureFormat, FrameRate and CodingMethod" represents an encoding method for the comparative experiment. For example, "QCIF 15 JSVM" represents QCIF picture format, frame rate of 15 frame/sec with the JSVM 2.0 encoder. The encoding method 'JSVM-ARR' represents the invented, JSVM 2.0 with ARR process. To evaluate the performance of the proposed method, experiments based on JSVM2.0 are performed on Palma test points using the configuration files provided along with the software. In order to get the preliminary experimental results, ARR is implemented on luma residual signals. The experimental results are shown in Fig. 2 to Fig. 8.the legend name "PictureFormat, FrameRate and CodingMethod" represents an encoding method for the comparative experiment. For example, "QCIF 15 JSVM" represents QCIF picture format, frame rate of 15 frame / sec with the JSVM 2.0 encoder. The encoding method 'JSVM-ARR' represents the invented, JSVM 2.0 with ARR process.
From the preliminary results, it can be observed that the proposed method which implements ARR in FGS coding process, achieves 0.08 to 0.2dB PSNR improvement at high bit-rate. As the proposed method is only implemented to FGS enhancement layer, so at low bit-rate, the proposed method will have similar performance as the original JSVM2.0. In addition, in order to evaluate the effectiveness of the proposed method, a simple simulation with 1 FGS layer on QCIF format base layer is made on foreman sequence. The test condition for that is illustrated in Table 1. The curves of per-frame bits reduction on FGS layer by the proposed method is given in Fig. 8. Where the ratio of bits reduction in FGS layer is obtained by (1): From the preliminary results, it can be observed that the proposed method which implements ARR in FGS coding process, achieves 0.08 to 0.2dB PSNR improvement at high bit-rate. As the proposed method is only implemented to FGS enhancement layer, so at low bit-rate, the proposed method will have similar performance as the original JSVM2.0. In addition, in order to evaluate the effectiveness of the proposed method, a simple simulation with 1 FGS layer on QCIF format base layer is made on foreman sequence. The test condition for that is illustrated in Table 1.The curves of per-frame bits reduction on FGS layer by the proposed method is given in Fig. 8.Where the ratio of bits reduction in FGS layer is obtained by (1):
r = ( ra - R) / ra x 100 (1)r = (r a -R) / r a x 100 (1)
Where r is the ratio of bits reduction, minus value represents a decrease in bit rate, while positive value indicates an increase in bit rate. ra and R represent the bits used for coding each frame by the proposed method and the original JSVM2.0, respectively. Where r is the ratio of bits reduction, minus value represents a decrease in bit rate, while positive value indicates an increase in bit rate. r a and R represent the bits used for coding each frame by the proposed method and the original JSVM2.0, respectively.
Experimental results in Fig. 8 clearly show that the proposed method significantly reduced the bits used for coding each frame in FGS layer. Around 14% average bit-rate reduction can be obtained by implementing the proposed method in FGS layer with the penalty of 0.07dB quality loss. Experimental results in Fig. 8 clearly show that the proposed method significantly reduced the bits used for coding each frame in FGS layer. Around 14% average bit-rate reduction can be obtained by implementing the proposed method in FGS layer with the penalty of 0.07dB quality loss.
Fig. 2Fig. 2
Fig. 3Fig. 3
Fig. 4Fig. 4
Fig. 5Fig. 5
Fig. 6Fig. 6
Fig. 7Fig. 7
Table 1. Test condition for analysis on FGS enhancement layer bit-rate reductionTable 1.Test condition for analysis on FGS enhancement layer bit-rate reduction
Fig. 8 Foreman sequence frame by frame performance on bits reduction in FGS enhancement layerFig. 8 Foreman sequence frame by frame performance on bits reduction in FGS enhancement layer
ConclusionConclusion
Compared with the original JSVM2.0, the proposed method significantly reduced the bit rate on FGS layer by approximately 14% with ARR. Compared with the original JSVM2.0, the proposed method significantly reduced the bit rate on FGS layer by approximately 14% with ARR.
동영상 압축 효율을 향상하는 것에 의해 동영상 저장이나 전송에 있어서 매체를 효율적으로 사용한다. By improving the video compression efficiency, the medium is efficiently used for video storage and transmission.
Claims (9)
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR20050030868 | 2005-04-13 | ||
KR1020050030868 | 2005-04-13 |
Publications (1)
Publication Number | Publication Date |
---|---|
KR20060109246A true KR20060109246A (en) | 2006-10-19 |
Family
ID=37615641
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR1020050065635A KR20060109246A (en) | 2005-04-13 | 2005-07-20 | Video encoding and decoding method for improving coding efficiency and apparatus thereof |
Country Status (1)
Country | Link |
---|---|
KR (1) | KR20060109246A (en) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2009110720A3 (en) * | 2008-03-04 | 2009-10-29 | 삼성전자 주식회사 | Image encoding and decoding method and device |
WO2011096662A3 (en) * | 2010-02-02 | 2011-12-22 | (주)휴맥스 | Image encoding/decoding method for rate-distortion optimization and apparatus for performing same |
WO2014107072A1 (en) * | 2013-01-04 | 2014-07-10 | 삼성전자 주식회사 | Lossless-coding-mode video encoding method and device, and decoding method and device |
-
2005
- 2005-07-20 KR KR1020050065635A patent/KR20060109246A/en not_active Application Discontinuation
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2009110720A3 (en) * | 2008-03-04 | 2009-10-29 | 삼성전자 주식회사 | Image encoding and decoding method and device |
US8306115B2 (en) | 2008-03-04 | 2012-11-06 | Samsung Electronics Co., Ltd. | Method and apparatus for encoding and decoding image |
WO2011096662A3 (en) * | 2010-02-02 | 2011-12-22 | (주)휴맥스 | Image encoding/decoding method for rate-distortion optimization and apparatus for performing same |
US8792740B2 (en) | 2010-02-02 | 2014-07-29 | Humax Holdings Co., Ltd. | Image encoding/decoding method for rate-distortion optimization and apparatus for performing same |
WO2014107072A1 (en) * | 2013-01-04 | 2014-07-10 | 삼성전자 주식회사 | Lossless-coding-mode video encoding method and device, and decoding method and device |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP5453253B2 (en) | Subband scanning technique for subband entropy coding | |
JP5048146B2 (en) | Bitplane coding and decoding for AC prediction state information | |
KR100991410B1 (en) | Methods and systems for significance coefficient coding in video compression | |
Marpe et al. | Performance evaluation of Motion-JPEG2000 in comparison with H. 264/AVC operated in pure intracoding mode | |
US7277587B2 (en) | System and method for lossless video coding | |
EP2345253B1 (en) | VIDEO CODING USING TRANSFORMS BIGGER THAN 4x4 AND 8x8 | |
CA2645230C (en) | Method and system for coding and decoding information associated with video compression | |
CA2828556C (en) | Quantized pulse code modulation in video coding | |
KR101247923B1 (en) | Video coding using transforms bigger than 4×4 and 8×8 | |
GB2434050A (en) | Encoding at a higher quality level based on mixed image prediction factors for different quality levels | |
JP2009531992A5 (en) | ||
Kim et al. | Efficient RGB video coding using adaptive inter-plane residual prediction | |
Zeng et al. | A tutorial on image/video coding standards | |
EP1841235A1 (en) | Video compression by adaptive 2D transformation in spatial and temporal direction | |
KR20060109246A (en) | Video encoding and decoding method for improving coding efficiency and apparatus thereof | |
Topiwala | Comparative Study of JPEG2000 and H. 264/AVC FRExt I–Frame Coding on High-Definition Video Sequences | |
Topiwala et al. | Performance comparison of JPEG2000 and H. 264/AVC high profile intra-frame coding on HD video sequences | |
Wu et al. | Brief analysis of the H. 264 coding standard | |
Lu et al. | Performance comparison of the SVC, WSVC, and Motion JPEG2000 advanced scalable video coding schemes | |
Meerwald | The dirac video codec | |
Chitradevi et al. | Efficient Document Compression Using Intra Frame Prediction Tecthnique | |
Peng et al. | Advances of MPEG Scalable Video Coding Standard | |
Yang et al. | Improved interlayer prediction for scalable video coding | |
Campana et al. | Comparison between H. 264/AVC and JPEG2000 arithmetic encoders | |
Wu et al. | The core experiment on the SP frame coding |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
WITN | Withdrawal due to no request for examination |