KR20130107414A - Video coding method using adaptive division transform - Google Patents
Video coding method using adaptive division transform Download PDFInfo
- Publication number
- KR20130107414A KR20130107414A KR1020120029156A KR20120029156A KR20130107414A KR 20130107414 A KR20130107414 A KR 20130107414A KR 1020120029156 A KR1020120029156 A KR 1020120029156A KR 20120029156 A KR20120029156 A KR 20120029156A KR 20130107414 A KR20130107414 A KR 20130107414A
- Authority
- KR
- South Korea
- Prior art keywords
- block
- encoding
- prediction
- transform
- image
- Prior art date
Links
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/103—Selection of coding mode or of prediction mode
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/17—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
- H04N19/176—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/503—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
- H04N19/51—Motion estimation or motion compensation
- H04N19/567—Motion estimation based on rate distortion criteria
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
Abstract
The present invention relates to a video encoding method in a video encoding apparatus.
According to an embodiment of the present invention, an image encoding method includes acquiring a prediction block of each of a plurality of candidate encoding modes, generating a differential block based on a difference between a prediction block and an input block, and determining whether to divide or transform a differential block. Determining, determining a split transform size of the differential block, split transforming the differential block to a split transform size to generate a split transform block, calculating a rate-distortion based coding cost and splitting rate of the split transform block. And determining a final encoding mode of the input block among the plurality of candidate encoding modes based on the distortion-based encoding cost.
Description
The present invention relates to a method of encoding an image, and more particularly, to a method of determining an encoding mode of an image.
Recently, as the broadcasting system supporting HD (High Definition) resolution has been expanded not only in Korea but also in the world, many users are getting used to high resolution and high quality images, and many organizations are accelerating the development of the next generation video equipment. . In addition, as interest in Ultra High Definition (UHD), which supports four times the resolution of HDTV, is increased along with HDTV, a compression technology for higher resolution and higher quality images is required.
An inter prediction technique for predicting a pixel value included in the current picture from a preceding picture and / or a succeeding picture for compression of an image, an intra prediction method for predicting a pixel value using pixel information in a picture, An entropy coding technique may be used in which a short code is assigned to a symbol having a high frequency of occurrence and / or appearance frequency, and a long code is assigned to a symbol having a low appearance frequency.
The present invention provides a method of encoding an image using adaptive segmentation transform.
The present invention provides a method for adaptively determining whether or not to perform split transformation on an encoding target block.
The present invention provides a method for adaptively determining the size of a split transform of a block to be encoded.
According to an embodiment of the present invention, an image encoding method in an image encoding apparatus is provided. The image encoding method may include: obtaining a prediction block of each of a plurality of candidate encoding modes, generating a difference block based on a difference between a prediction block and an input block, determining whether or not to divide or transform a difference block, Determining a split transform size, generating a split transform block by splitting the difference block into a split transform size, calculating a rate-distortion based encoding cost of the split transform block, and based on the rate-distortion based encoding cost Determining a final encoding mode of the input block among the plurality of candidate encoding modes.
According to the present invention, the complexity of the image encoding apparatus can be reduced.
1 is a block diagram illustrating an example of a structure of a video encoding apparatus.
2 is a block diagram illustrating an example of a structure of an image decoding apparatus.
3 is a flowchart illustrating an image encoding method according to an embodiment of the present invention.
4 illustrates intra prediction modes according to 33 directions currently supported by the HEVC test model (HM).
5 is a conceptual diagram schematically illustrating an embodiment in which one block is divided into a plurality of sub-blocks.
Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings. However, in describing the embodiments of the present invention, when it is determined that the detailed description of the known configuration or function may obscure the gist of the present invention, the detailed description thereof will be omitted.
When a component is described as being "connected" or "connected" to another component, it may be directly connected to or connected to another component, but another component may be present in between. . In addition, when the present invention is described as "includes" a specific component, rather than excluding components other than the component, it is understood that additional components may be included in the scope of the embodiments or technical spirit of the present invention. it means.
Terms such as "first" and "second" may be used to describe various components, but the components are not limited by the terms. In other words, the terms are used only for the purpose of distinguishing one component from another. For example, without departing from the scope of the present invention, the first component may be referred to as the second component, and likewise, the second component may be referred to as the first component.
In addition, the components shown in the embodiments of the present invention are shown independently to indicate that they perform different characteristic functions, and do not mean that each component may not be implemented in one hardware or software. That is, each component is divided for convenience of description, and a plurality of components may be combined to operate as one component, or one component may be divided into and operate as a plurality of components, which does not depart from the essence of the present invention. Unless included in the scope of the present invention.
In addition, some components may be optional components for improving performance rather than essential components for performing essential functions of the present invention. The present invention may be implemented in a structure including only essential components except for optional components, and a structure including only essential components is also included in the scope of the present invention.
1 is a block diagram illustrating an example of a structure of a video encoding apparatus.
Referring to FIG. 1, the
The
In the intra prediction mode, the
In the inter prediction mode, the
The
The
The coded picture needs to be decoded and stored again to be used as a reference picture for performing inter prediction coding. Accordingly, the
The
The
2 is a block diagram illustrating an example of a structure of an image decoding apparatus.
2, the
The image decoding
The
The
In the intra prediction mode, the
In the inter-prediction mode, the
The
The reconstructed image may be stored in the
Hereinafter, a block means a unit of encoding / decoding. In the encoding / decoding process, an image is divided into a predetermined size and encoded / decoded. Therefore, a block may also be called a macro block (MB), a coding unit (CU), a prediction unit (PU), a transform unit (TU), or the like. It may be divided into smaller blocks of smaller size.
Here, the prediction unit means a basic unit of performing prediction and / or motion compensation. The prediction unit may be divided into a plurality of partitions, and each partition is called a prediction unit partition. When the prediction unit is divided into a plurality of partitions, the prediction unit partition may be a basic unit of performing prediction and / or motion compensation. Hereinafter, in an embodiment of the present invention, the prediction unit may mean a prediction unit partition.
Meanwhile, in the image encoding method, an optimal encoding mode is determined using a rate-distortion based optimization technique. That is, for every candidate coding mode, the coding unit is converted into a possible size, and among them, the candidate coding mode having the minimum rate-distortion cost based coding cost is determined as the final coding mode. This process is essential for encoding high resolution video because it maximizes coding efficiency. However, this process has a problem in that it is difficult to apply to a broadcast video encoding apparatus that needs to perform real-time encoding because of its high complexity.
In the rate-distortion-based optimization technique, the present invention provides a method for determining whether a coding unit is split transformed by using features of an image when calculating a rate-distortion-based encoding cost. It also provides a method for adaptively determining the split transform size. Therefore, according to the present invention, by eliminating a part of the rate-distortion-based encoding cost calculation process using the adaptive split transformation, it is possible to maintain the performance of the coding efficiency while reducing the complexity of the video encoding apparatus.
Meanwhile, the following description is based on an encoding method and apparatus, but a person having ordinary knowledge in the art to which the present invention belongs can easily apply the technical idea of the present invention to a decoding method and apparatus.
3 is a flowchart illustrating an image encoding method according to an embodiment of the present invention.
Referring to FIG. 3, in the image encoding method according to an embodiment of the present invention, a prediction block obtaining step (S310), a differential block generation step (S320), a split transform determination step (S330), and a split size determination step (S340) A transform block generation step S350, an encoding cost calculation step S360, and a final encoding mode determination step S370 are included.
The image encoding apparatus obtains prediction blocks of each of the plurality of candidate encoding modes (S310). That is, the image encoding apparatus generates a prediction block corresponding to the candidate mode from the neighboring samples for each of the plurality of candidate encoding modes.
In the intra prediction mode, a prediction block of a block to be currently encoded is generated using spatially decoded spatial peripheral information as a reference sample. Compared to the intra prediction technique in the past video compression standard, HEVC (High Efficiency Video Coding), which is currently being standardized, supports a larger number of prediction modes as shown in Table 1.
One of the past video compression standards, H.264 / AVC, supports up to nine intra prediction modes depending on the size of the prediction unit, while HEVC supports up to 35 intra prediction modes. 4 illustrates intra prediction modes according to 33 directions currently supported by the HEVC test model (HM). The coding method adopted as a standard is integrated into one software for fair verification and easy development of coding tools. This is called HM.
Referring back to FIG. 3, the image encoding apparatus generates a difference block based on the difference between the prediction block and the input block obtained through the prediction block obtaining step S310 (S320). That is, the difference image between the original image and the predicted image is obtained.
In addition, the apparatus for encoding an image determines whether or not to segment-convert the differential block generated through the differential block generation step S320 (S330). In this case, whether to divide the differential block may be determined in consideration of the complexity of the image, whether to divide the neighboring block, or a boundary error.
The rate-distortion based encoding cost is calculated based on the encoding result of coefficients with which the differential block generated through the differential block generation step S330 is transformed. Accordingly, the rate-distortion based coding cost calculated according to the split transform size of the differential block varies.
In the conventional method of determining a coding mode using a rate-distortion based optimization technique, after performing a split transform on a plurality of split transform sizes for all candidate coding modes, the mode having the least cost and the split transform size are compared with the final coding mode. Determine the final split transform size. However, since the rate-distortion-based encoding cost has a high complexity, the existing encoding mode determination method for obtaining the cost for each split transform size has a problem that the complexity continues to increase.
An image encoding apparatus according to an embodiment of the present invention adaptively performs partition transform to increase encoding efficiency compared to complexity. For example, in a method of determining an optimal encoding mode using a rate-distortion based optimization method according to an embodiment of the present invention, whether or not to perform a split transformation is determined according to the complexity of an image and whether or not to perform a split transformation of a neighboring block.
Referring back to FIG. 3, when it is determined to perform a division transform on the difference block, the image encoding apparatus determines the division transform size of the difference block (S340). In this case, the split transform size of the difference block may be one predetermined size or a predetermined lower size. That is, the difference block may be converted into a predetermined size, or divided into a predetermined lower size to be converted.
In addition, the apparatus for encoding an image generates a split transform block by splitting the difference block into a split transform size determined through the split transform size determination step S340 (S350), and calculates a rate-distortion based encoding cost of the split transform block. (S360).
For example, when the encoding mode is determined in consideration of the rate-distortion-based encoding cost of the split-converted differential block, after calculating the rate-distortion-based encoding cost of the differential block, the differential block is transformed to a predetermined depth. The rate-distortion based coding cost of each split transform block can be obtained.
5 is a conceptual diagram schematically illustrating an embodiment in which one block is divided into a plurality of sub-blocks.
The depth of a block means the degree to which the block is divided. For example, as shown in FIG. 5, the root node that is the highest node in the tree structure is the shallowest, and the
According to an embodiment of the present invention, when the predetermined split transform depth is 3, the video encoding apparatus may determine a difference block as (a, b, j), (c, h, i), (d, e, f, g). By performing split transform, the rate-distortion-based encoding cost of each split transform block can be obtained.
On the other hand, the rate-distortion-based coding cost is calculated from the viewpoint of rate and distortion, and the rate-distortion optimization technique selects an optimal coding mode in consideration of the rate-distortion-based coding cost. .
Therefore, the image encoding apparatus calculates a rate-distortion-based encoding cost based on the bit amount and the image quality difference between the input block and the reconstruction block when the transform coefficients of the transformed transform block are transformed through the transform block generation step S350. do.
Referring to FIG. 3 again, the apparatus for encoding an image determines a final encoding mode based on the rate-distortion based encoding cost calculated through the encoding cost calculating step S360 (S370). That is, the encoding mode having the lowest rate-distortion based encoding cost among the plurality of candidate encoding modes is determined as the final encoding mode.
In this case, whether or not to perform a split transform and a split transform size may be encoded or stored together with the determined mode.
On the other hand, while the above-described embodiments are described through a flowchart represented by a series of steps or blocks, the present invention is not limited to the order of the above-described steps, some steps may occur in a different order or at the same time with other steps. have. In addition, one of ordinary skill in the art will appreciate that the steps shown in the flowcharts are not exclusive, that other steps may be included or some steps may be deleted.
In addition, the above-described embodiments include examples of various aspects. While not all possible combinations can be described to illustrate various aspects, one of ordinary skill in the art will recognize that other combinations are possible. Accordingly, it is intended that the invention include all alternatives, modifications and variations that fall within the scope of the following claims.
Claims (1)
Obtaining a prediction block of each of the plurality of candidate encoding modes;
Generating a difference block based on the difference between the prediction block and the input block;
Determining whether to divide or transform the difference block;
In case of splitting the difference block,
Determining a split transform size of the difference block;
Generating a split transform block by splitting the difference block into the split transform size;
Calculating a rate-distortion based encoding cost of the split transform block; And
And determining a final encoding mode of the input block among the plurality of candidate encoding modes based on the rate-distortion based encoding cost.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1020120029156A KR20130107414A (en) | 2012-03-22 | 2012-03-22 | Video coding method using adaptive division transform |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1020120029156A KR20130107414A (en) | 2012-03-22 | 2012-03-22 | Video coding method using adaptive division transform |
Publications (1)
Publication Number | Publication Date |
---|---|
KR20130107414A true KR20130107414A (en) | 2013-10-02 |
Family
ID=49630720
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR1020120029156A KR20130107414A (en) | 2012-03-22 | 2012-03-22 | Video coding method using adaptive division transform |
Country Status (1)
Country | Link |
---|---|
KR (1) | KR20130107414A (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2019009502A1 (en) * | 2017-07-06 | 2019-01-10 | 삼성전자 주식회사 | Video coding method and device, video decoding method and device |
-
2012
- 2012-03-22 KR KR1020120029156A patent/KR20130107414A/en not_active Application Discontinuation
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2019009502A1 (en) * | 2017-07-06 | 2019-01-10 | 삼성전자 주식회사 | Video coding method and device, video decoding method and device |
US11196994B2 (en) | 2017-07-06 | 2021-12-07 | Samsung Electronics Co., Ltd. | Video coding method and device, video decoding method and device |
US11689721B2 (en) | 2017-07-06 | 2023-06-27 | Samsung Electronics Co.. Ltd. | Video coding method and device, video decoding method and device |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
KR102518242B1 (en) | Encoding And Decoding Methods For Video Information | |
KR101962183B1 (en) | Method for encoding/decoding an intra prediction mode and apparatus for the same | |
KR102086145B1 (en) | Method for intra prediction and apparatus thereof | |
US11044482B2 (en) | Method for encoding/decoding image and device using same | |
JP2024052919A (en) | Video Decoding Method | |
JP6518274B2 (en) | Video decoding method and video coding method | |
KR102175472B1 (en) | Method for intra prediction and apparatus thereof | |
KR102138828B1 (en) | Method and Apparatus for image encoding | |
JP2015136150A (en) | Method of encoding division block in video encoding, method of decoding division block in video decoding, and recording medium for implementing the same | |
KR20130112374A (en) | Video coding method for fast intra prediction and apparatus thereof | |
KR20230035300A (en) | Method and apparatus for deciding boundary filtering strength of deblocking filtering | |
KR101688085B1 (en) | Video coding method for fast intra prediction and apparatus thereof | |
KR20130107414A (en) | Video coding method using adaptive division transform | |
KR20140043014A (en) | Method and apparatus for image encoding | |
KR101436949B1 (en) | Method and apparatus for encoding picture, and apparatus for processing picture | |
KR101730593B1 (en) | Fast video coding method | |
KR20130107611A (en) | Methods of encoding and decoding using bottom-up prediction mode decision and apparatuses for using the same | |
KR20120095794A (en) | Fast video coding method | |
KR20120131038A (en) | Method and apparatus for predicting encoding information using temporal and spatial correlation |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
WITN | Withdrawal due to no request for examination |