KR101928185B1

KR101928185B1 - Apparatus for hevc coding and method for predicting coding unit depth range using the same

Info

Publication number: KR101928185B1
Application number: KR1020170060103A
Authority: KR
Inventors: 박상주
Original assignee: 홍익대학교 산학협력단
Priority date: 2017-05-15
Filing date: 2017-05-15
Publication date: 2018-12-11
Also published as: KR20180125326A

Abstract

본 발명은 HEVC 부호화 장치 및 그것을 이용한 코딩 유닛 깊이 범위 예측 방법에 관한 것이다.
본 발명에 따르면, HEVC 부호화 장치를 이용한 코딩 유닛 깊이 범위 예측 방법에 있어서, 코딩 유닛 깊이 범위 예측 방법은 현재 영상 프레임의 코딩 트리 유닛(CTU)에 대한 율-왜곡 비용을 연산하는 단계, 이전 영상 프레임의 CTU에 대한 율-왜곡 비용에 기 설정된 가중치를 곱한 값과 상기 현재 영상 프레임의 CTU에 대한 율-왜곡 비용을 비교하는 단계, 그리고 상기 비교 결과에 따라, 상기 현재 영상 프레임의 CTU에 대한 코딩 유닛(CU) 깊이 범위를 이용하여 다음 영상 프레임의 CTU에 대한 CU 깊이 범위를 예측하는 단계를 포함한다.
이와 같이 본 발명에 따르면, 예측된 CU 깊이 범위에 포함되지 않은 CU 분할에 대한 RDOQ를 수행하지 않으므로, HEVC 부호화를 고속으로 수행할 수 있다. 뿐만 아니라, 상호관련성이 높은 CTU의 율-왜곡 비용 정보를 이용하여 CU 깊이 범위를 예측하므로 고속 압축에도 불구하고 영상 압축시 성능의 열화가 거의 발생하지 않는 장점이 있다. The present invention relates to an HEVC coding apparatus and a coding unit depth range predicting method using the same.
According to the present invention, in a method of predicting a coding unit depth range using an HEVC coding apparatus, a coding unit depth range predicting method includes calculating a rate-distortion cost for a coding tree unit (CTU) Comparing the rate-distortion cost for the CTU of the current image frame by a predetermined weight and a rate-distortion cost for the CTU of the current image frame; (CU) depth range to predict the CU depth range for the CTU of the next image frame.
As described above, according to the present invention, since the RDOQ for CU division not included in the predicted CU depth range is not performed, HEVC coding can be performed at a high speed. In addition, since the CU depth range is predicted by using the rate-distortion cost information of the highly correlated CTU, there is an advantage that the degradation of the performance in the image compression hardly occurs despite the high compression.

Description

[0001] The present invention relates to an HEVC encoding apparatus and a coding unit depth range predicting method using the HEVC encoding apparatus and a method of predicting the depth range of a coding unit using the HEVC encoding apparatus.

본 발명은 HEVC 부호화 장치 및 그것을 이용한 코딩 유닛 깊이 범위 예측 방법에 관한 것으로서, 더욱 상세하게는 HEVC 영상 압축시 압축 성능의 열화가 적은 반면 압축 속도를 향상시킬 수 있는 HEVC 부호화 장치 및 그것을 이용한 코딩 유닛 깊이 범위 예측 방법에 관한 것이다.The present invention relates to an HEVC coding apparatus and a coding unit depth prediction method using the same, and more particularly, to a HEVC coding apparatus and a coding unit using the same, To a range predicting method.

최근 스마트폰, 태블릿 등 모바일 기기들이 HD 1080p과 같은 고화질 영상을 지원하고 있을 뿐만 아니라, QHD(Quad High Definition)나 UHD(Ultra High Definition)와 같은 초고해상도를 지원하는 디스플레이 장치들이 속속 등장하고 있다. 더불어, 3D 고화질 영상 서비스나 다시점 영상 서비스 등의 제공이 증가하고 있다. Recently, mobile devices such as smartphones and tablets not only support high-definition video such as HD 1080p but also display devices supporting ultrahigh-resolution such as QHD (Quad High Definition) or UHD (Ultra High Definition). In addition, 3D high-definition video services and multi-point video services are increasingly being provided.

이러한 고화질 영상이나 입체 영상의 경우 필연적으로 영상 데이터의 고용량화를 수반하며, 고용량의 영상 데이터를 전송하기 위한 문제가 발생하고 있다. 이에 따라, 화질의 열화는 최소화하되 종래의 영상 압축 기법보다 압축 효율이 비약적으로 높은 새로운 영상 압축기법에 대한 요구가 높다. Such a high-quality image or a stereoscopic image inevitably accompanies high-capacity image data, and a problem arises for transmitting high-capacity image data. Accordingly, there is a high demand for a new image compression technique in which deterioration of image quality is minimized but the compression efficiency is remarkably higher than that of the conventional image compression technique.

고효율 영상 압축 기법(High Efficiency Video Coding, HEVC)은 종래 영상 압축 기법인 H.264/AVC 대비 50%의 비트 레이트(bit rate)로 부호화하여 2배의 압축 성능을 제공하는 새로운 영상 압축 표준화 기술이다. High Efficiency Video Coding (HEVC) is a new image compression standardization technology that provides twice the compression performance by coding at a bit rate of 50% compared to the conventional image compression technique H.264 / AVC .

도 1은 HEVC 기법의 블록 단위를 설명하기 위한 도면이다. 1 is a diagram for explaining a block unit of the HEVC technique.

도 1에 나타난 바와 같이, HEVC는 쿼드트리(quad-tree) 기반의 블록을 이용하고 있으며, CTU(Coding tree Unit), CU(Coding Unit), PU(Prediction Unit), TU(Transform Unit)와 같은 블록 단위를 사용한다. 구체적으로 CTU는 H.264/AVC의 매크로블록에 대응하는 개념으로서, 각각의 CTU는 도 1의 (a)에서와 같이 복수개의 CU로 분할된다. PU는 화면 내 예측 또는 화면 간 예측에 따라 압축 성능을 향상시키기 위하여 CU로부터 분할된 블록으로서, 도 1의 (b)와 같이 나타난다. 도 1의 (c)에 나타난 TU는 2차원 변환이 적용되는 분할 영역을 의미한다. 이와 같이, 쿼드트리 기반의 HEVC는 CTU, CU, PU, TU의 크기를 유연하게 결정할 수 있어 기존의 압축 표준에 비해 압축률이 크게 향상되었다. As shown in FIG. 1, the HEVC uses a quad-tree based block, and the HEVC includes blocks such as a coding tree unit (CTU), a coding unit (CU), a prediction unit (PU) Block units are used. Specifically, a CTU corresponds to a macroblock of H.264 / AVC, and each CTU is divided into a plurality of CUs as shown in FIG. 1 (a). The PU is divided from the CU as shown in FIG. 1 (b) in order to improve the compression performance according to intra-picture prediction or inter-picture prediction. The TU shown in FIG. 1 (c) means a divided area to which the two-dimensional transformation is applied. In this way, the quadtree-based HEVC can flexibly determine the sizes of CTU, CU, PU, and TU, which greatly improves the compression ratio compared to the conventional compression standard.

이와 같이, HEVC는 종래의 표준 영상 압축 기술보다 뛰어난 압축 효율을 보여주나, 부호화의 복잡도가 크게 증가하여 상용화에 어려움을 겪고 있다. 그러므로 HEVC가 상용화되기 위해서는 화질의 열화가 적은 반면 빠른 압축 속도를 제공하는 고속 부호화 알고리즘을 필요로 한다. As described above, HEVC shows a compression efficiency higher than that of the conventional standard image compression technology, but has a difficulty in commercialization because the coding complexity increases greatly. Therefore, the commercialization of HEVC requires a high-speed encoding algorithm that provides fast compression rate while having little deterioration of image quality.

본 발명의 배경이 되는 기술은 한국등록특허 제10-1620755호(2016.05.12.공고)에 개시되어 있다. The technology of the background of the present invention is disclosed in Korean Patent No. 10-1620755 (published on May 12, 2016).

본 발명이 이루고자 하는 기술적 과제는 HEVC 영상 압축시 압축 성능의 열화가 적은 반면 압축 속도를 향상시킬 수 있는 EVC 부호화 장치 및 그것을 이용한 코딩 유닛 깊이 범위 예측 방법을 제공하기 위한 것이다.SUMMARY OF THE INVENTION The present invention is directed to an EVC encoding apparatus and a coding unit depth prediction method using the EVC encoding apparatus, which can reduce a compression performance deterioration during HEVC image compression while improving a compression rate.

이러한 기술적 과제를 이루기 위한 본 발명의 실시예에 따르면 HEVC 부호화 장치를 이용한 코딩 유닛 깊이 범위 예측 방법에 있어서, 코딩 유닛 깊이 범위 예측 방법은 현재 영상 프레임의 코딩 트리 유닛(CTU)에 대한 율-왜곡 비용을 연산하는 단계, 이전 영상 프레임의 CTU에 대한 율-왜곡 비용에 기 설정된 가중치를 곱한 값과 상기 현재 영상 프레임의 CTU에 대한 율-왜곡 비용을 비교하는 단계, 그리고 상기 비교 결과에 따라, 상기 현재 영상 프레임의 CTU에 대한 코딩 유닛(CU) 깊이 범위를 이용하여 다음 영상 프레임의 CTU에 대한 CU 깊이 범위를 예측하는 단계를 포함한다. According to an aspect of the present invention, there is provided a method of predicting a coding unit depth range using an HEVC coding apparatus, the method comprising: estimating a coding unit depth range based on a rate- Comparing the rate-distortion cost for the CTU of the previous image frame multiplied by a predetermined weight to the rate-distortion cost for the CTU of the current image frame, and comparing the rate- And using the coding unit (CU) depth range for the CTU of the image frame to predict the CU depth range for the CTU of the next image frame.

상기 현재 영상 프레임의 CTU, 상기 이전 영상 프레임의 CTU 및 상기 다음 영상 프레임의 CTU는 영상 내 서로 동일한 영역에 위치할 수 있다. The CTU of the current image frame, the CTU of the previous image frame, and the CTU of the next image frame may be located in the same area in the image.

상기 다음 영상 프레임의 CTU에 대한 CU 깊이 범위를 예측하는 단계는, 상기 현재 영상 프레임의 CTU에 대한 율-왜곡 비용이 이전 영상 프레임의 CTU에 대한 율-왜곡 비용에 기 설정된 제1 가중치를 곱한 값보다 작으면, 상기 현재 영상 프레임의 CTU에 대한 CU 깊이 범위를 다음 영상 프레임의 CTU에 대한 CU 깊이 범위로 예측하고, 상기 현재 영상 프레임의 CTU에 대한 율-왜곡 비용이 이전 영상 프레임의 CTU에 대한 율-왜곡 비용에 상기 제1 가중치를 곱한 값보다 크거나 같으면, 상기 현재 영상 프레임의 CTU에 대한 CU 깊이 범위를 확장하여 다음 영상 프레임의 CTU에 대한 코딩 유닛 깊이 범위로 예측할 수 있다. The step of predicting the CU depth range for the CTU of the next image frame may include calculating a rate-distortion cost for the CTU of the current image frame by multiplying the rate-distortion cost for the CTU of the previous image frame by a first weight, Estimates the CU depth range for the CTU of the current image frame to the CU depth range for the CTU of the next image frame and if the rate-distortion cost for the CTU of the current image frame is less than the CU depth for the CTU of the previous image frame, Rate-distortion cost multiplied by the first weight, the CU depth range for the CTU of the current image frame may be extended to predict the coding unit depth range for the CTU of the next image frame.

상기 다음 영상 프레임의 CTU에 대한 CU 깊이 범위를 예측하는 단계는, 상기 현재 영상 프레임의 CTU에 대한 율-왜곡 비용이 이전 영상 프레임의 CTU에 대한 율-왜곡 비용에 상기 제1 가중치를 곱한 값보다 크거나 같고 이전 영상 프레임의 CTU에 대한 율-왜곡 비용에 상기 제1 가중치보다 큰 제2 가중치를 곱한 값보다 작으면, 상기 현재 영상 프레임의 CTU에 대한 CU 깊이 범위의 최대값보다 1만큼 크게 확장하거나 최소값보다 1만큼 작게 확장하여 다음 영상 프레임의 CTU에 대한 CU 깊이 범위로 예측하고, 상기 현재 영상 프레임의 CTU에 대한 율-왜곡 비용이 이전 영상 프레임의 CTU에 대한 율-왜곡 비용에 상기 제2 가중치를 곱한 값보다 크거나 같으면, 상기 현재 영상 프레임의 CTU에 대한 CU 깊이 범위의 최대값보다 1만큼 크고 최소값보다 1만큼 작게 확장하여 다음 영상 프레임의 CTU에 대한 CU 깊이 범위로 예측할 수 있다. The step of predicting the CU depth range for the CTU of the next image frame may include comparing the rate-distortion cost for the CTU of the current image frame with the rate-distortion cost for the CTU of the previous image frame multiplied by the first weight Is greater than or equal to the maximum value of the CU depth range for the CTU of the current image frame if the ratio is equal to or greater than the value obtained by multiplying the rate-distortion cost for the CTU of the previous image frame by a second weight greater than the first weight, Distortion cost for the CTU of the next image frame to a rate-distortion cost for the CTU of the previous image frame by predicting the rate-distortion cost for the CTU of the current image frame to 1 / If the value of the CU is greater than or equal to the value obtained by multiplying the weight of the current image frame by 1, Can be predicted by the CU depth range for the CTU of the image frame.

상기 다음 영상 프레임의 CTU에 대한 CU 깊이 범위를 예측하는 단계는, 기 저장된 재설정 간격에 따라 상기 현재 영상 프레임의 CTU에 대한 CU 깊이 범위를 최대로 재설정할 수 있다.The step of predicting the CU depth range for the CTU of the next image frame may reset the CU depth range for the CTU of the current image frame to the maximum according to the pre-stored reset interval.

상기 다음 영상 프레임의 CTU에 대한 CU 깊이 범위를 예측하는 단계는, 이전 영상 프레임과 현재 영상 프레임의 픽셀값을 이용하여 장면 전환(scene change) 여부를 판단하는 단계, 그리고 상기 장면 전환이 되지 않았다고 판단되면 상기 현재 영상 프레임의 CTU에 대한 CU 깊이 범위를 최대로 재설정하지 않고, 상기 장면 전환이 되었다고 판단되면 상기 현재 영상 프레임의 CTU에 대한 CU 깊이 범위를 최대로 재설정하는 단계를 포함할 수 있다. The step of predicting the CU depth range for the CTU of the next image frame may include determining a scene change using the pixel values of the previous image frame and the current image frame, And resetting the CU depth range for the CTU of the current image frame to the maximum if it is determined that the scene change has been performed without resetting the CU depth range for the CTU of the current image frame to the maximum.

상기 기 설정된 가중치는, 0에서 무한대(∞)의 값 중 하나를 가지며, 영상 코딩 압축 효율이 높을수록 작은 값으로 설정되고, 영상 코딩 속도가 높을수록 큰 값으로 설정될 수 있다. The predetermined weight has one of values from 0 to infinity, and is set to a smaller value as the image coding compression efficiency is higher, and may be set to a larger value as the image coding rate is higher.

상기 예측된 CU 깊이 범위를 이용하여 다음 영상 프레임의 CTU에 대한 CU 분할을 수행하여 상기 다음 영상 프레임의 CTU에 대한 CU 최적 구조를 생성하는 단계를 더 포함할 수 있다. And performing a CU division on the CTU of the next image frame using the predicted CU depth range to generate a CU optimal structure for the CTU of the next image frame.

상기 현재 영상 프레임의 코딩 트리 유닛(CTU)에 대한 율-왜곡 비용을 연산하는 단계는, 상기 현재 영상 프레임의 CTU에 대한 CU 최적 구조가 형성된 상태에서 상기 현재 영상 프레임의 CTU에 대한 율-왜곡 비용을 연산하며, 아래의 수학식을 이용하여 상기 율-왜곡 비용(J_SATD)을 연산할 수 있다. Wherein the step of calculating a rate-distortion cost for a coding tree unit (CTU) of the current image frame comprises: calculating a rate-distortion cost for the CTU of the current image frame in a state in which a CU optimal structure for the CTU of the current image frame is formed; And calculates the rate-distortion cost (J _SATD ) using the following equation.

여기서, SATD는 CU 최적 구조가 형성된 CTU에 대한 원 화소값과 복원 화소값 사이의 하다마드 변환 왜곡값을 의미하고, λ_pred는 라그랑주 승수를 의미하고, B_pred는 상기 CU 최적 구조가 형성된 CTU의 부호화에 필요한 비트수를 의미한다. Here, SATD denotes a Hadamard transform distortion value between an original pixel value and a reconstructed pixel value for a CTU in which a CU optimum structure is formed _{,? Pred} denotes a Lagrangian multiplier, and _Bred denotes a CTU optimum value Means the number of bits necessary for encoding.

본 발명의 다른 실시예에 따른 HEVC 부호화 장치는 현재 영상 프레임의 코딩 트리 유닛(CTU)에 대한 율-왜곡 비용을 연산하는 연산부, 이전 영상 프레임의 CTU에 대한 율-왜곡 비용에 기 설정된 가중치를 곱한 값과 상기 현재 영상 프레임의 CTU에 대한 율-왜곡 비용을 비교하는 비교부, 그리고 상기 비교 결과에 따라, 상기 현재 영상 프레임의 CTU에 대한 코딩 유닛(CU) 깊이 범위를 이용하여 다음 영상 프레임의 CTU에 대한 CU 깊이 범위를 예측하는 예측부를 포함한다.The HEVC encoding apparatus according to another embodiment of the present invention includes an operation unit for calculating a rate-distortion cost for a current frame of a CTU, a rate-distortion cost for a CTU of a previous frame multiplied by a predetermined weight (CU) depth value of the current image frame to a CTU of the next image frame according to a result of the comparison, Lt; RTI ID = 0.0 > CU < / RTI >

이와 같이 본 발명에 따르면, 예측된 CU 깊이 범위에 포함되지 않은 CU 분할에 대한 RDOQ를 수행하지 않으므로, HEVC 부호화를 고속으로 수행할 수 있다. 뿐만 아니라, 상호관련성이 높은 CTU의 율-왜곡 비용 정보를 이용하여 CU 깊이 범위를 예측하므로 고속 압축에도 불구하고 영상 압축시 성능의 열화가 거의 발생하지 않는 장점이 있다. As described above, according to the present invention, since the RDOQ for CU division not included in the predicted CU depth range is not performed, HEVC coding can be performed at a high speed. In addition, since the CU depth range is predicted by using the rate-distortion cost information of the highly correlated CTU, there is an advantage that the degradation of the performance in the image compression hardly occurs despite the high compression.

도 1은 HEVC 기법의 블록 단위를 설명하기 위한 도면이다.
도 2은 본 발명의 실시예에 따른 HEVC 부호화 장치의 구성도이다.
도 3는 본 발명의 실시예에 따른 코딩 유닛 깊이 범위 예측 방법의 순서도이다.
도 4는 도 3의 S250 단계를 상세하게 설명하기 위한 도면이다.
도 6는 하나의 영상 프레임에 포함된 CTU간 율-왜곡 비용을 나타낸 그래프이다.
도 7는 영상 내 동일 영역에 위치하는 CTU간 율-왜곡 비용을 나타낸 그래프이다.1 is a diagram for explaining a block unit of the HEVC technique.
2 is a configuration diagram of an HEVC encoding apparatus according to an embodiment of the present invention.
3 is a flowchart of a method of predicting a coding unit depth range according to an embodiment of the present invention.
FIG. 4 is a diagram for explaining step S250 of FIG. 3 in detail.
FIG. 6 is a graph illustrating the CTU inter-distortion cost included in one image frame.
7 is a graph showing CTU inter-rate distortion costs located in the same area in an image.

아래에서는 첨부한 도면을 참고로 하여 본 발명의 실시예에 대하여 본 발명이 속하는 기술 분야에서 통상의 지식을 가진 자가 용이하게 실시할 수 있도록 상세히 설명한다. 그러나 본 발명은 여러 가지 상이한 형태로 구현될 수 있으며 여기에서 설명하는 실시예에 한정되지 않는다. 그리고 도면에서 본 발명을 명확하게 설명하기 위해서 설명과 관계없는 부분은 생략하였으며, 명세서 전체를 통하여 유사한 부분에 대해서는 유사한 도면 부호를 붙였다.Hereinafter, embodiments of the present invention will be described in detail with reference to the accompanying drawings so that those skilled in the art can easily carry out the present invention. The present invention may, however, be embodied in many different forms and should not be construed as limited to the embodiments set forth herein. In order to clearly illustrate the present invention, parts not related to the description are omitted, and similar parts are denoted by like reference characters throughout the specification.

명세서 전체에서, 어떤 부분이 어떤 구성요소를 "포함"한다고 할 때, 이는 특별히 반대되는 기재가 없는 한 다른 구성요소를 제외하는 것이 아니라 다른 구성요소를 더 포함할 수 있다는 것을 의미한다. Throughout the specification, when an element is referred to as "comprising ", it means that it can include other elements as well, without excluding other elements unless specifically stated otherwise.

그러면 첨부한 도면을 참고로 하여 본 발명의 실시예에 대하여 본 발명이 속하는 기술 분야에서 통상의 지식을 가진 자가 용이하게 실시할 수 있도록 상세히 설명한다. Hereinafter, embodiments of the present invention will be described in detail with reference to the accompanying drawings so that those skilled in the art can easily carry out the present invention.

우선, 도 2를 통해 본 발명의 실시예에 따른 HEVC 부호화 장치의 구성에 대해 살펴보도록 한다. First, a configuration of an HEVC encoding apparatus according to an embodiment of the present invention will be described with reference to FIG.

도 2는 본 발명의 실시예에 따른 HEVC 부호화 장치의 구성도이다. 2 is a configuration diagram of an HEVC encoding apparatus according to an embodiment of the present invention.

도 2에 도시된 바와 같이, 본 발명의 실시예에 따른 HEVC 부호화 장치(100)는 연산부(110), 비교부(120) 및 예측부(130)를 포함하며, 분할부(140)를 더 포함할 수 있다. 2, the HEVC encoding apparatus 100 according to the embodiment of the present invention includes an operation unit 110, a comparison unit 120, and a prediction unit 130, and further includes a division unit 140 can do.

먼저, 연산부(110)는 현재 영상 프레임의 코딩 트리 유닛(Coding Tree Unit, 이하 CTU라 한다)에 대한 율-왜곡 비용(Rate-Distortion cost, RD cost)을 연산한다. First, the operation unit 110 calculates a Rate-Distortion cost (RD cost) for a Coding Tree Unit (CTU) of a current image frame.

이때, 연산부(110)는 현재 영상 프레임의 CTU에 대한 최적 구조로 코딩 유닛(Coding Unit, 이하 CU라 한다) 분할이 수행된 상태, 즉, 현재 영상 프레임의 CTU가 최적 구조로 CU 분할된 상태에서 현재 영상 프레임의 CTU에 대한 율-왜곡 비용을 연산한다. At this time, the calculation unit 110 determines whether the CU unit is divided into CUs (i.e., CUs) with the optimal structure for the CTU of the current image frame, i.e., Calculates the rate-distortion cost for the CTU of the current video frame.

다음으로, 비교부(120)는 이전 영상 프레임의 CTU에 대한 율-왜곡 비용에 기 설정된 가중치를 곱한 값과 현재 영상 프레임의 CTU에 대한 율-왜곡 비용을 비교한다. Next, the comparison unit 120 compares the rate-distortion cost for the CTU of the previous image frame multiplied by the predetermined weight and the rate-distortion cost for the CTU of the current image frame.

여기서, 이전 영상 프레임의 CTU에 대한 율-왜곡 비용은 본 발명의 실시예에 따른 HEVC 부호화 장치(100)에 기 저장될 수 있다. Here, the rate-distortion cost for the CTU of the previous image frame may be stored in the HEVC encoding apparatus 100 according to the embodiment of the present invention.

그리고, 기 설정된 가중치는 0에서 무한대(∞)의 값 중 하나를 가진다. 그리고, 기 설정된 가중치는 영상 코딩 압축 효율이 높을수록 작은 값으로 설정되고, 영상 코딩 속도가 높을수록 큰 값으로 설정된다. 따라서, 사용자는 영상 코딩 압축 효율과 영상 코딩 속도를 고려하여 가중치를 설정할 수 있다. The predetermined weight has one of values of 0 to infinity. The predetermined weight is set to a smaller value as the image coding compression efficiency is higher, and is set to a larger value as the image coding speed is higher. Accordingly, the user can set a weight value in consideration of the image coding compression efficiency and the image coding speed.

다음으로, 예측부(130)는 비교 결과에 따라, 현재 영상 프레임의 CTU에 대한 CU 깊이 범위를 이용하여 다음 영상 프레임의 CTU에 대한 CU 깊이 범위를 예측한다. Next, the predictor 130 predicts the CU depth range for the CTU of the next image frame using the CU depth range for the CTU of the current image frame, according to the comparison result.

구체적으로, 예측부(130)는 현재 영상 프레임의 CTU에 대한 율-왜곡 비용이 이전 영상 프레임의 CTU에 대한 율-왜곡 비용에 기 설정된 제1 가중치를 곱한 값보다 작으면, 현재 영상 프레임의 CTU에 대한 CU 깊이 범위를 다음 영상 프레임의 CTU에 대한 CU 깊이 범위로 예측한다. Specifically, if the rate-distortion cost for the CTU of the current image frame is less than the value of the rate-distortion cost for the CTU of the previous image frame multiplied by the first weight, To the CU depth range for the CTU of the next image frame.

반면, 예측부(130)는 현재 영상 프레임의 CTU에 대한 율-왜곡 비용이 이전 영상 프레임의 CTU에 대한 율-왜곡 비용에 제1 가중치를 곱한 값보다 크거나 같으면, 현재 영상 프레임의 CTU에 대한 CU 깊이 범위를 확장하여 다음 영상 프레임의 CTU에 대한 코딩 유닛 깊이 범위로 예측한다. On the other hand, if the rate-distortion cost for the CTU of the current image frame is greater than or equal to the rate-distortion cost for the CTU of the previous image frame times the first weight, The CU depth range is extended to predict the coding unit depth range for the CTU of the next image frame.

이때, 예측부(130)는 현재 영상 프레임의 CTU에 대한 율-왜곡 비용이 이전 영상 프레임의 CTU에 대한 율-왜곡 비용에 제1 가중치를 곱한 값보다 크거나 같고 이전 영상 프레임의 CTU에 대한 율-왜곡 비용에 제1 가중치보다 큰 제2 가중치를 곱한 값보다 작으면, 현재 영상 프레임의 CTU에 대한 CU 깊이 범위의 최대값보다 1만큼 크게 확장하거나 최소값보다 1만큼 작게 확장하여 다음 영상 프레임의 CTU에 대한 CU 깊이 범위로 예측한다. At this time, the predicting unit 130 determines whether the rate-distortion cost for the CTU of the current image frame is greater than or equal to the rate-distortion cost for the CTU of the previous image frame multiplied by the first weight, Distortion cost is multiplied by a second weight greater than the first weight, it is expanded by one more than the maximum value of the CU depth range for the CTU of the current image frame or by one less than the minimum value, CU < / RTI >

그리고, 예측부(130)는 현재 영상 프레임의 CTU에 대한 율-왜곡 비용이 이전 영상 프레임의 CTU에 대한 율-왜곡 비용에 상기 제2 가중치를 곱한 값보다 크거나 같으면, 현재 영상 프레임의 CTU에 대한 CU 깊이 범위의 최대값보다 1만큼 크고 최소값보다 1만큼 작게 확장하여 다음 영상 프레임의 CTU에 대한 CU 깊이 범위로 예측한다. If the rate-distortion cost for the CTU of the current image frame is greater than or equal to the rate-distortion cost for the CTU of the previous image frame multiplied by the second weight, the predicting unit 130 determines whether the rate- Expansion is made by 1 larger than the maximum value of the CU depth range and by 1 smaller than the minimum value to predict the CU depth range for the CTU of the next image frame.

여기서, 현재 영상 프레임의 CTU, 이전 영상 프레임의 CTU 및 다음 영상 프레임의 CTU는 영상 내 서로 동일한 영역에 위치한다. Here, the CTU of the current image frame, the CTU of the previous image frame, and the CTU of the next image frame are located in the same area in the image.

구체적으로, 하나의 영상에 포함된 복수의 영상 프레임들은 서로 같은 크기로 형성된다. 그러므로, 각각의 프레임을 부호화하는 과정에서 복수의 CTU로 프레임을 분할하여 넘버링하게 되면, 각 프레임의 CTU 넘버가 동일한 CTU들은 영상내에서 동일한 위치에 형성된다. Specifically, a plurality of image frames included in one image are formed to have the same size. Therefore, when dividing a frame into a plurality of CTUs in the process of coding each frame, CTUs having the same CTU number of each frame are formed at the same position in the image.

예를 들어, 본 발명의 실시예에 따른 HEVC 부호화 장치(100)는 23번째 영상 프레임의 34번째 CTU 정보와 24번째 영상 프레임의 34번째 CTU 정보를 이용하여 25번째 영상 프레임의 34번째 CTU에 대한 CU 깊이 범위를 예측한다. For example, the HEVC encoding apparatus 100 according to an exemplary embodiment of the present invention may use the 34th CTU information of the 23rd image frame and the 34th CTU information of the 24th image frame to calculate the 34th CTU of the 25th image frame, Predicts the CU depth range.

그러므로, 본 발명의 실시예에 따른 HEVC 부호화 장치(100)는 각 영상 프레임별로 동일한 영역에 위치한 CTU의 정보를 이용하여 CU 깊이 범위를 예측한다. Therefore, the HEVC encoding apparatus 100 according to the embodiment of the present invention predicts the CU depth range using information of the CTUs located in the same area for each image frame.

한편, 예측부(130)는 기 저장된 재설정 간격에 따라 현재 영상 프레임의 CTU에 대한 CU 깊이 범위를 최대로 재설정한다. On the other hand, the predictor 130 resets the CU depth range for the CTU of the current image frame to the maximum according to the previously stored reset interval.

다른 한편으로, 예측부(130)는 이전 영상 프레임과 현재 영상 프레임의 픽셀값을 이용하여 장면 전환(scene change) 여부를 판단하며, 장면 전환이 되지 않았다고 판단되면 현재 영상 프레임의 CTU에 대한 CU 깊이 범위를 최대로 재설정하지 않고, 장면 전환이 되었다고 판단되면 현재 영상 프레임의 CTU에 대한 CU 깊이 범위를 최대로 재설정한다. On the other hand, the predicting unit 130 determines whether or not a scene change is performed using the pixel values of the previous image frame and the current image frame. If it is determined that the scene change has not occurred, the CU depth If it is determined that the scene change has been made without resetting the range to the maximum, the CU depth range for the CTU of the current image frame is reset to the maximum.

다음으로, 분할부(140)는 예측된 CU 깊이 범위를 이용하여 다음 영상 프레임의 CTU에 대한 CU 분할을 수행하여 다음 영상 프레임의 CU 최적 구조를 생성한다. Next, the partitioning unit 140 generates a CU optimal structure of the next image frame by performing CU division on the CTU of the next image frame using the predicted range of the CU depth.

한편, 본 발명의 실시예에 따르면, 코딩 유닛 깊이 범위는 0에서 4까지의 정수값 중 최소값 및 최대값을 가진다. Meanwhile, according to the embodiment of the present invention, the coding unit depth range has a minimum value and a maximum value among the integer values from 0 to 4.

HEVC 기법에서 CTU는 64X64, 32X32, 16X16, 8X8 크기 중 어느 하나로 선택이 가능하며, 쿼드트리(quad tree) 형태로 CU 분할을 수행한다. In the HEVC technique, CTU can be selected from 64X64, 32X32, 16X16, and 8X8 sizes, and performs CU partitioning in the form of a quad tree.

종래의 HEVC 기법의 경우, 64X64 크기의 CTU에서 최대 8X8까지 쿼드 트리 형태로 분할함을 가정하여 최대 깊이 범위를 [0,3]으로 설정하고 있으며, 4X4 크기의 쿼드 트리 분할은 예측 유닛(Prediction Unit, PU) 분할 단계에서 수행한다. In the conventional HEVC technique, the maximum depth range is set to [0, 3] assuming that the CTU is divided into a quadtree form from a CTX size of 64X64 to a maximum of 8X8. A quad- , PU).

그러나, 본 발명의 실시예에 따르면, 4X4 크기로의 분할까지 CU 분할 단계로 설정하고, 64X64크기의 CTU에서 최대 깊이 범위를 [0,4]로 설정할 수 있다. However, according to the embodiment of the present invention, it is possible to set the CU division step up to the division into the 4X4 size, and to set the maximum depth range to [0, 4] in the 64X64 CTU size.

다음으로, 도 3 및 도 4를 통해 본 발명의 실시예에 따른 코딩 유닛 깊이 범위 예측 방법에 대해 살펴보도록 한다. Next, a method of predicting the depth of a coding unit according to an embodiment of the present invention will be described with reference to FIGS. 3 and 4. FIG.

도 3은 본 발명의 실시예에 따른 코딩 유닛 깊이 범위 예측 방법의 순서도이다. 3 is a flowchart of a method of predicting a coding unit depth range according to an embodiment of the present invention.

우선, 연산부(110)는 현재 영상 프레임의 CTU에 대한 율-왜곡 비용을 연산한다(S210). First, the operation unit 110 calculates the rate-distortion cost for the CTU of the current image frame (S210).

구체적으로, 연산부(110)는 아래의 수학식 1을 이용하여 율-왜곡 비용(J_SATD)을 연산한다. Specifically, the operation unit 110 calculates the rate-distortion cost (J _SATD ) using the following equation (1).

여기서, SATD는 CU 최적 구조가 형성된 CTU에 대한 원 화소값과 복원 화소값 사이의 하다마드 변환 왜곡값(Sum of Absolute Transformed Differences)을 의미하고, λ_pred는 라그랑주 승수(Lagrangian multiplier)를 의미하고, B_pred는 CU 최적 구조가 형성된 CTU의 부호화에 필요한 비트수를 의미한다. Here, SATD denotes a Sum of Absolute Transformed Differences between an original pixel value and a reconstructed pixel value for a CTU in which a CU optimal structure is formed, _ldpred denotes a Lagrangian multiplier, B _pred means the number of bits required for coding the CTU in which the CU optimum structure is formed.

다음으로, 비교부(120)는 이전 영상 프레임의 CTU에 대한 율-왜곡 비용에 기 설정된 가중치를 곱한 값과 현재 영상 프레임의 CTU에 대한 율-왜곡 비용을 비교한다(S220). Next, the comparison unit 120 compares the rate-distortion cost for the CTU of the previous image frame multiplied by a predetermined weight and the rate-distortion cost for the CTU of the current image frame at step S220.

여기서, 기 설정된 가중치는 제1 가중치 및 제1 가중치보다 큰 값을 가지는 제2 가중치를 포함할 수 있다. Here, the predetermined weight may include a first weight and a second weight having a larger value than the first weight.

이때, 기 설정된 가중치는 0에서 무한대(∞)의 값 중 하나를 가지며, 영상 코딩 압축 효율이 높을수록 작은 값으로 설정되고, 영상 코딩 속도가 높을수록 큰 값으로 설정된다. At this time, the predetermined weight has one of values from 0 to infinity. The higher the image coding compression efficiency, the smaller the value is set, and the higher the image coding speed, the larger the value.

그러면, 예측부(130)는 비교 결과에 따라, 현재 영상 프레임의 CTU에 대한 코딩 유닛(CU) 깊이 범위를 이용하여 다음 영상 프레임의 CTU에 대한 CU 깊이 범위를 예측한다(S230, S240). The predictor 130 predicts the CU depth range for the CTU of the next image frame using the depth range of the coding unit (CU) for the CTU of the current image frame according to the comparison result (S230, S240).

우선, 예측부(130)는 현재 영상 프레임의 CTU에 대한 율-왜곡 비용이 이전 영상 프레임의 CTU에 대한 율-왜곡 비용에 제1 가중치를 곱한 값보다 크거나 같으면, 현재 영상 프레임의 CTU에 대한 CU 깊이 범위를 확장하여 다음 영상 프레임의 CTU에 대한 코딩 유닛 깊이 범위로 예측한다(S230). If the rate-distortion cost for the CTU of the current image frame is greater than or equal to the rate-distortion cost for the CTU of the previous image frame multiplied by the first weight, the predicting unit 130 estimates the CTU of the current image frame, The CU depth range is extended to predict the coding unit depth range for the CTU of the next image frame (S230).

그리고, 예측부(130)는 현재 영상 프레임의 CTU에 대한 율-왜곡 비용이 이전 영상 프레임의 CTU에 대한 율-왜곡 비용에 제2 가중치를 곱한 값보다 크거나 같으면, 현재 영상 프레임의 CTU에 대한 CU 깊이 범위의 최대값보다 1만큼 크고 최소값보다 1만큼 작게 확장하여 다음 영상 프레임의 CTU에 대한 CU 깊이 범위로 예측한다. 다만, CU 깊이 범위는 설정될 수 있는 최대 범위가 [0,4]로 제한되어 있으므로, 이를 초과하는 범위는 예측될 수 없다. If the rate-distortion cost for the CTU of the current image frame is greater than or equal to the rate-distortion cost for the CTU of the previous image frame multiplied by the second weight, the predicting unit 130 predicts the CTU of the current image frame Expansion is made by 1 larger than the maximum value of the CU depth range and by 1 smaller than the minimum value, and is predicted as the CU depth range for the CTU of the next image frame. However, since the maximum range in which the CU depth range can be set is limited to [0, 4], a range exceeding this range can not be predicted.

반면, 예측부(130)는 현재 영상 프레임의 CTU에 대한 율-왜곡 비용이 이전 영상 프레임의 CTU에 대한 율-왜곡 비용에 제1 가중치를 곱한 값보다 작으면, 현재 영상 프레임의 CTU에 대한 CU 깊이 범위를 다음 영상 프레임의 CTU에 대한 CU 깊이 범위로 예측한다(S240). On the other hand, if the rate-distortion cost for the CTU of the current image frame is smaller than the rate-distortion cost for the CTU of the previous image frame multiplied by the first weight, the predictor 130 determines that the CU The depth range is predicted by the CU depth range for the CTU of the next image frame (S240).

구체적으로, 도 3의 S230 단계 및 S240 단계는 아래의 수학식 2와 같이 표현될 수 있다. Specifically, steps S230 and S240 of FIG. 3 may be expressed as Equation 2 below.

여기서,

는 다음 영상 프레임(n+1)의 k번째 CTU에 대한 CU 깊이 범위를 의미하고,

는 현재 영상 프레임(n)의 k번째 CTU에 대한 CU 깊이 범위를 의미한다.

는 현재 영상 프레임(n)의 k번째 CTU에 대한 CU 깊이 범위의 최소값을 의미하고,

는 현재 영상 프레임(n)의 k번째 CTU에 대한 CU 깊이 범위의 최대값을 의미한다.

는 현재 영상 프레임(n)의 k번째 CTU에 대한 율-왜곡 비용을 의미하고,

는 이전 영상 프레임(n-1)의 k번째 CTU에 대한 율-왜곡 비용을 의미한다.

은 제1 가중치를 의미하고,

는 제2 가중치를 의미한다. here,

Denotes the CU depth range for the kth CTU of the next image frame (n + 1)

Means the CU depth range for the kth CTU of the current video frame (n).

Denotes the minimum value of the CU depth range for the kth CTU of the current image frame n,

Means the maximum value of the CU depth range for the kth CTU of the current image frame (n).

Denotes the rate-distortion cost for the kth CTU of the current video frame n,

Denotes the rate-distortion cost for the kth CTU of the previous video frame (n-1).

Quot; means a first weight,

Quot; means a second weight.

예를 들어, 이전 영상 프레임(n-1)의 k번째 CTU에 대한 깊이 범위가 [2,3]이라고 가정한다. For example, assume that the depth range for the kth CTU of the previous image frame (n-1) is [2, 3].

이때, S220의 비교 결과, 현재 영상 프레임(n)의 k번째 CTU에 대한 율-왜곡 비용(

)이 이전 영상 프레임(n-1)의 k번째 CTU에 대한 율-왜곡 비용(

)에 제1 가중치(

)를 곱한 값보다 작으면(

), 현재 영상 프레임(n)의 k번째 CTU에 대한 CU 깊이 범위인 [2,3]을 다음 영상 프레임의 k번째 CTU에 대한 CU 깊이 범위로 예측한다. At this time, as a result of the comparison at S220, the rate-distortion cost (n) for the kth CTU of the current image frame n

) Is the rate-distortion cost for the kth CTU of the previous video frame (n-1)

) To the first weight (

) Is smaller than the value obtained by multiplying

) Predicts the CU depth range [2,3] for the kth CTU of the current video frame (n) to the CU depth range for the kth CTU of the next video frame.

반면, S220의 비교 결과, 현재 영상 프레임(n)의 k번째 CTU에 대한 율-왜곡 비용(

)이 이전 영상 프레임(n-1)의 k번째 CTU에 대한 율-왜곡 비용(

)에 제1 가중치(

)를 곱한 값보다 크거나 같고 제2 가중치를 곱합 값보다 작으면(

), 현재 영상 프레임(n)의 k번째 CTU에 대한 CU 깊이 범위인 [2,3]에서 최대값만 확장한 [2,4] 또는 최소값만 확장한 [1,3]을 다음 영상 프레임(n+1)의 k번째 CTU에 대한 CU 깊이 범위로 예측한다. On the other hand, as a result of the comparison of S220, the rate-distortion cost (n) for the kth CTU of the current image frame n

) Is the rate-distortion cost for the kth CTU of the previous video frame (n-1)

) To the first weight (

) And the second weight is less than or equal to the product of the product of (

), [2,4] or [1,3], which only extends the maximum value in [2,3], which is the CU depth range for the kth CTU of the current image frame (n) +1) with the CU depth range for the kth CTU.

그리고, 현재 영상 프레임(n)의 k번째 CTU에 대한 율-왜곡 비용(

)이 이전 영상 프레임(n-1)의 k번째 CTU에 대한 율-왜곡 비용(

)에 제2 가중치(

)를 곱한 값보다 크거나 같으면(

), 현재 영상 프레임(n)의 k번째 CTU에 대한 CU 깊이 범위인 [2,3]에서 최대값 및 최소값을 모두 확장한 [1,4]를 다음 영상 프레임(n+1)의 k번째 CTU에 대한 CU 깊이 범위로 예측한다. Then, the rate-distortion cost (n) for the kth CTU of the current video frame n

) Is the rate-distortion cost for the kth CTU of the previous video frame (n-1)

) To the second weight (

) Is greater than or equal to the product of

), [1,4], which extends both the maximum value and the minimum value in the range [2, 3], which is the CU depth range for the kth CTU of the current image frame (n) CU < / RTI >

다른 예로, 이전 영상 프레임(n-1)의 k번째 CTU에 대한 깊이 범위가 [1,4]라고 가정한다.As another example, it is assumed that the depth range for the kth CTU of the previous image frame (n-1) is [1,4].

)이 이전 영상 프레임(n-1)의 k번째 CTU에 대한 율-왜곡 비용(

)에 기 설정된 가중치(

)를 곱한 값보다 작으면(

), 현재 영상 프레임(n)의 k번째 CTU에 대한 CU 깊이 범위인 [1,4]를 다음 영상 프레임의 k번째 CTU에 대한 CU 깊이 범위로 예측한다. At this time, as a result of the comparison at S220, the rate-distortion cost (n) for the kth CTU of the current image frame n

) Is the rate-distortion cost for the kth CTU of the previous video frame (n-1)

) To a predetermined weight (

) Is smaller than the value obtained by multiplying

) Predicts the CU depth range [1,4] for the kth CTU of the current image frame (n) to the CU depth range for the kth CTU of the next image frame.

)이 이전 영상 프레임(n-1)의 k번째 CTU에 대한 율-왜곡 비용(

)에 제1 가중치(

), 현재 영상 프레임(n)의 k번째 CTU에 대한 CU 깊이 범위인 [1,4]에서 최대값만 확장한 [1,4] 또는 최소값만 확장한 [0,4]을 다음 영상 프레임(n+1)의 k번째 CTU에 대한 CU 깊이 범위로 예측한다. On the other hand, as a result of the comparison of S220, the rate-distortion cost (n) for the kth CTU of the current image frame n

) Is the rate-distortion cost for the kth CTU of the previous video frame (n-1)

) To the first weight (

), [1,4] or [0,4], which is the maximum value expanded only in the CU depth range [1,4] for the kth CTU of the current image frame (n) +1) with the CU depth range for the kth CTU.

)이 이전 영상 프레임(n-1)의 k번째 CTU에 대한 율-왜곡 비용(

)에 제2 가중치(

)를 곱한 값보다 크거나 같으면(

), 현재 영상 프레임(n)의 k번째 CTU에 대한 CU 깊이 범위인 [1,4]에서 최대값 및 최소값을 모두 확장한 [0,4]를 다음 영상 프레임(n+1)의 k번째 CTU에 대한 CU 깊이 범위로 예측한다. Then, the rate-distortion cost (n) for the kth CTU of the current video frame n

) Is the rate-distortion cost for the kth CTU of the previous video frame (n-1)

) To the second weight (

) Is greater than or equal to the product of

), [0, 4], which is the sum of the maximum value and the minimum value in the CU depth range [1,4] for the kth CTU of the current image frame n, to the kth CTU of the next image frame n + CU < / RTI >

여기서, 최대값만 확장한 범위가 [1,4]가 되고 최대값 및 최소값을 모두 확장한 범위가 [0,4]가 되는 것은 CU 깊이의 최대 범위가 [0,4]로 제한되기 때문이다. 즉, 현재 영상 프레임(n)의 k번째 CTU에 대한 CU 깊이 범위의 최대값인 4에서 1을 하면 5가 되지만, CU 깊이의 최대 범위인 4를 초과할 수 없으므로, 최대값만 확장한 범위는 [1,4]가 되고 최대값 및 최소값을 모두 확장한 범위는 [0,4]가 된다. Here, the range in which the maximum value is extended is [1, 4] and the range in which both the maximum value and the minimum value are extended is [0, 4] because the maximum range of the CU depth is limited to [0, 4] . That is, if the maximum value of the CU depth range for the kth CTU of the current image frame (n), which is the maximum value of 4, 1 is set to 5 but the maximum range of the CU depth can not exceed 4, [1,4], and the range in which both the maximum value and the minimum value are expanded is [0,4].

한편, 예측부(130)는 기 저장된 재설정 간격에 따라 현재 영상 프레임의 CTU에 대한 CU 깊이 범위를 최대로 설정한다. Meanwhile, the predicting unit 130 sets the CU depth range for the CTU of the current image frame to the maximum according to the pre-stored reset interval.

예를 들어, 현재 영상 프레임인 60번째의 CTU에 대한 CU 분할에 따른 CU 최적 구조의 CU 깊이 범위가 [2,3]이며, 영상의 프레임 율이 30Hz라고 가정한다. 그리고, 영상의 프레임 율(frame rate)과 동일하게 재설정 간격이 설정되었다고 가정한다. For example, it is assumed that the CU depth range of the CU optimum structure according to the CU division for the 60th CTU, which is the current image frame, is [2,3], and the frame rate of the image is 30Hz. It is assumed that a reset interval is set equal to the frame rate of the image.

이 경우, 현재 영상 프레임의 프레임 넘버가 영상 프레임 율인 30Hz의 2배이므로, 현재 영상 프레임의 CU 깊이 범위를 [2,3]이 아닌 최대 CU 깊이 범위인 [0,4]로 설정하여 61번째 영상 프레임의 CTU 깊이 범위를 예측한다. 즉, 현재 영상 프레임의 프레임 넘버가 영상의 프레임 율의 정수배가 되면, 현재 영상 프레임의 CU 깊이 범위를 최대 CU 깊이 범위인 [0,4]로 재설정하여 다음 영상 프레임의 CTU 깊이 범위를 예측한다. In this case, since the frame number of the current image frame is twice the image frame rate of 30 Hz, the CU depth range of the current image frame is set to [0, 4] which is the maximum CU depth range, Estimate the CTU depth range of the frame. That is, when the frame number of the current image frame is an integral multiple of the frame rate of the image, the CTU depth range of the next image frame is predicted by resetting the CU depth range of the current image frame to the maximum CU depth range [0, 4].

다른 한편으로, 예측부(130)는 이전 영상 프레임과 현재 영상 프레임의 픽셀값을 이용하여 장면 전환 여부를 판단하며, 장면 전환이 되지 않았다고 판단되면 현재 영상 프레임의 CTU에 대한 CU 깊이 범위를 최대로 재설정하지 않고, 장면 전환이 되었다고 판단되면 현재 영상 프레임의 CTU에 대한 CU 깊이 범위를 최대로 재설정한다. On the other hand, the predicting unit 130 determines whether or not the scene change is performed using the pixel values of the previous image frame and the current image frame. If it is determined that the scene change has not been performed, the CU depth range for the CTU of the current image frame is maximized If it is determined that the scene change has been made without resetting, the CU depth range for the CTU of the current image frame is reset to the maximum.

예를 들어, 이전 영상 프레임의 픽셀 평균값과 현재 영상 프레임의 픽셀 평균값의 차이가 임계값보다 크면, 예측부(130)는 프레임간 장면 전환이 발생한 것으로 판단하여 현재 영상 프레임의 CTU에 대한 CU 깊이 범위를 최대로 재설정한다. For example, if the difference between the pixel average value of the previous image frame and the pixel average value of the current image frame is greater than the threshold value, the predictor 130 determines that the inter-frame scene change has occurred, To the maximum.

다음으로, 분할부(140)는 예측된 CU 깊이 범위를 이용하여 다음 영상 프레임의 CTU에 대한 CU 분할을 수행하여 다음 영상 프레임의 CU 최적 구조를 생성한다(S240). Next, the dividing unit 140 generates a CU optimal structure of the next image frame by performing CU division on the CTU of the next image frame using the predicted range of the CU depth (S240).

도 4는 도 3의 S250 단계를 상세하게 설명하기 위한 도면이다. FIG. 4 is a diagram for explaining step S250 of FIG. 3 in detail.

예를 들어, S230 또는 S240 단계를 통해 다음 영상 프레임에 대한 CU 깊이 범위가 [1,2]로 예측되었다고 가정한다. 그러면, 분할부(140)는 도 4의 (a), (d), (e)에 대한 CU 분할은 고려하지 않는다. 즉, 분할부(140)는 도 4의 (b) 및 (c)에 따른 CU 분할만을 고려하여 총 16개의 CU 분할 구조에 대한 RDOQ(rate distortion optimized quantization)를 수행하여, 최적의 CU 구조를 생성한다. For example, it is assumed that the CU depth range for the next image frame is predicted as [1,2] through S230 or S240. Then, the partitioning unit 140 does not consider CU partitioning for (a), (d), and (e) in FIG. That is, the partitioning unit 140 performs rate distortion optimized quantization (RDOQ) on a total of 16 CU division structures considering only the CU division according to (b) and (c) do.

도 5는 본 발명의 실시예에 따른 하나의 영상 프레임에 대한 CU 분할 구조를 나타낸 도면이다. 5 is a diagram illustrating a CU division structure for one image frame according to an embodiment of the present invention.

도 5의 (a)는 하나의 영상 프레임 전체가 최적의 구조로 CU 분할된 것을 나타내며, 도 5의 (b)는 최적의 구조로 CU 분할된 일부 CTU들을 나타낸다. FIG. 5A shows that an entire image frame is divided into CUs by an optimum structure, and FIG. 5B shows some CTUs divided into CUs by an optimal structure.

도 5에 나타난 바와 같이, 본 발명의 실시예에 따른 HEVC 부호화 장치(100)는 S210 내지 S250 단계를 반복하여 각 영상 프레임에 대한 CU 최적 구조를 생성한다. As shown in FIG. 5, the HEVC encoding apparatus 100 according to the embodiment of the present invention repeats steps S210 through S250 to generate a CU optimal structure for each image frame.

도 6은 하나의 영상 프레임에 포함된 CTU간 율-왜곡 비용을 나타낸 그래프이고, 도 7은 영상 내 동일 영역에 위치하는 CTU간 율-왜곡 비용을 나타낸 그래프이다.FIG. 6 is a graph showing the CTU-distortion cost included in one image frame, and FIG. 7 is a graph illustrating the CTU-distortion cost located in the same area in the image.

도 6에 나타난 바와 같이, 하나의 프레임에 포함된 CTU들의 율-왜곡 비용은 특정한 패턴이 없이, 넓고 다양한 범위의 값을 가진다는 것을 알 수 있다. 따라서, 프레임내 인접한 CTU간의 율-왜곡 비용을 이용하여 고속 부호화 수행 시 압축 효율이 크게 개선되지 않을 수 있다. As shown in FIG. 6, it can be seen that the rate-distortion cost of the CTUs contained in one frame has a wide range of values, without any specific pattern. Therefore, the compression efficiency may not be significantly improved when fast coding is performed using the rate-distortion cost between adjacent CTUs in a frame.

반면, 도 7에 나타난 바와 같이, 하나의 영상에서 각 프레임별 동일 영역에 위치하는 CTU들의 율-왜곡 비용은 매우 강한 상관관계를 가짐을 알 수 있다. On the other hand, as shown in FIG. 7, the rate-distortion cost of the CTUs located in the same area of each frame in one image has a very strong correlation.

도 7에서 가로축은 현재 영상 프레임의 CTU에 대한 율-왜곡 비용과 이전 영상 프레임의 CTU에 대한 율-왜곡 비용의 비율을 의미하는데, 인접한 영상 프레임간 동일한 넘버의 CTU에 대한 율-왜곡 비용의 비율은 평균값인 1을 중심으로 밀집 분포되는 형태를 보여준다. In FIG. 7, the abscissa represents the ratio of the rate-distortion cost to the CTU of the current image frame and the rate-distortion cost to the CTU of the previous image frame. The ratio of the rate-distortion cost to the CTU of the same number between adjacent image frames Is a density distribution centered around the mean value of 1.

이를 통해, 각 영상 프레임의 동일 영역에 위치하는 CTU들에 대한 율-왜곡 비용을 이용하여 HEVC 고속 부호화를 수행하는 것이 영상 효율을 크게 개선시킨다는 것을 알 수 있다. Thus, it can be seen that performing the HEVC fast encoding using the rate-distortion cost for the CTUs located in the same area of each image frame greatly improves the image efficiency.

표 1은 본 발명의 실시예에 따른 HEVC 부호화 장치의 시뮬레이션 결과이다. Table 1 shows the simulation results of the HEVC encoding apparatus according to the embodiment of the present invention.

표 1은 HM14 참조 코드를 이용한 종래 부호화 장치에 대비한 본 발명의 실시예에 따른 HEVC 부호화 장치의 성능을 나타낸다. 표 1에서는, 현재 영상 프레임의 CTU에 대한 율-왜곡 비용이 이전 영상 프레임의 CTU에 대한 율-왜곡 비용에 제1 가중치를 곱한 값보다 크거나 같으면, 현재 영상 프레임의 CTU에 대한 CU 깊이 범위의 최대값보다 1만큼 크고 최소값보다 1만큼 작게 확장하여 다음 영상 프레임의 CTU에 대한 CU 깊이 범위로 예측하였다. Class A는 4K 화질의 영상, Class B는 1080p 화질의 영상, Class C는 WVGA 화질의 영상, Class D는 WQVGA 화질의 영상, Class E는 780p 화질의 영상이다. 표 1에서 BDR은 압축 효율을 의미하고, ATS는 시간 감축율을 의미한다. Table 1 shows performance of the HEVC encoding apparatus according to the embodiment of the present invention in comparison with the conventional encoding apparatus using the HM14 reference code. In Table 1, if the rate-distortion cost for the CTU of the current video frame is greater than or equal to the rate-distortion cost for the CTU of the previous video frame times the first weight, the CU depth range for the CTU of the current video frame The CU depth range for the CTU of the next image frame is estimated by expanding by 1 larger than the maximum value and 1 smaller than the minimum value. Class A is a 4K image, Class B is 1080p, Class C is a WVGA image, Class D is a WQVGA image, and Class E is a 780p image. In Table 1, BDR means compression efficiency and ATS means time reduction rate.

표 1에 나타난 바와 같이, 제1 가중치(ρ)를 1.02로 설정한 본 발명의 실시예에 따른 HEVC 부호화 장치(DP_ρ) 및 확률 p에 따라 탐색 범위를 무작위로 확장한 본 발명의 실시예에 따른 HEVC 부호화 장치(DP_random _(p))의 경우 모두 기존의 부호화 장치에 비해 압축 효율이 거의 동일한 반면, 시간 감축율은 상당히 높아졌음을 알 수 있다. As shown in Table 1, in the embodiment of the present invention in which the search range is randomly extended according to the HEVC encoding apparatus DP _? And the probability p according to the embodiment of the present invention in which the first weight? It can be seen that the compression efficiency of the HEVC encoder (DP _random _(p) ) according to the present invention is substantially the same as that of the conventional encoder, while the time reduction rate is significantly increased.

본 발명의 실시예에 따르면, 예측된 CU 깊이 범위에 포함되지 않은 CU 분할에 대한 RDOQ를 수행하지 않으므로, HEVC 부호화를 고속으로 수행할 수 있다. 뿐만 아니라, 상호관련성이 높은 CTU의 율-왜곡 비용 정보를 이용하여 CU 깊이 범위를 예측하므로 고속 압축에도 불구하고 영상 압축시 성능의 열화가 거의 발생하지 않는 장점이 있다. According to the embodiment of the present invention, since the RDOQ for the CU division not included in the predicted CU depth range is not performed, HEVC coding can be performed at a high speed. In addition, since the CU depth range is predicted by using the rate-distortion cost information of the highly correlated CTU, there is an advantage that the degradation of the performance in the image compression hardly occurs despite the high compression.

본 발명은 도면에 도시된 실시예를 참고로 설명되었으나 이는 예시적인 것에 불과하며, 본 기술 분야의 통상의 지식을 가진 자라면 이로부터 다양한 변형 및 균등한 다른 실시예가 가능하다는 점을 이해할 것이다. 따라서, 본 발명의 진정한 기술적 보호 범위는 첨부된 특허청구범위의 기술적 사상에 의하여 정해져야 할 것이다. While the present invention has been described with reference to exemplary embodiments, it is to be understood that the invention is not limited to the disclosed exemplary embodiments, but, on the contrary, is intended to cover various modifications and equivalent arrangements included within the spirit and scope of the appended claims. Accordingly, the true scope of the present invention should be determined by the technical idea of the appended claims.

100 : HEVC 부호화 장치 110 : 연산부
120 : 비교부 130 : 예측부
140 : 분할부 100: HEVC encoding device 110:
120: comparison unit 130: prediction unit
140: minute installment

Claims

A method for estimating a coding unit depth range using an HEVC coding apparatus,
Calculating a rate-distortion cost for a coding tree unit (CTU) of a current video frame,
Comparing the rate-distortion cost for the CTU of the previous image frame by a predetermined weight and the rate-distortion cost for the CTU of the current image frame, and
And predicting a CU depth range for a CTU of a next image frame using a depth range of a coding unit (CU) for the CTU of the current image frame according to the comparison result,
Wherein the step of predicting the CU depth range for the CTU of the next image frame comprises:
Distortion rate for the CTU of the current image frame is smaller than a value obtained by multiplying the rate-distortion cost for the CTU of the previous image frame by a predetermined first weight, the CU depth range for the CTU of the current image frame is The CU depth range for the CTU of the video frame is predicted,
Wherein the rate-distortion cost for the CTU of the current video frame is greater than or equal to the rate-distortion cost for the CTU of the previous video frame multiplied by the first weight and the rate- The CU depth for the CTU of the next image frame is increased by one or more than the maximum value of the CU depth range for the CTU of the current image frame, Range,
Wherein the rate-distortion cost for the CTU of the current video frame is greater than or equal to the rate-distortion cost for the CTU of the previous video frame times the second weight, Value and a value smaller than the minimum value by 1 to predict the CU depth range for the CTU of the next image frame,
The step of calculating a rate-distortion cost for a coding tree unit (CTU) of the current video frame comprises:
Calculating a rate-distortion cost for the CTU of the current image frame in a state in which the CU optimal structure for the CTU of the current image frame is formed,
A coding unit depth range prediction method for calculating the rate-distortion cost (J _SATD ) using the following equation:

Here, SATD denotes a Hadamard transform distortion value between an original pixel value and a reconstructed pixel value for a CTU in which a CU optimal structure is formed, λ _pred denotes a Lagrangian multiplier, and B _pred denotes a CTU The number of bits required for encoding the bitstream.

The method according to claim 1,
Wherein the CTU of the current image frame, the CTU of the previous image frame, and the CTU of the next image frame are located in the same area in the image.

delete

The method according to claim 1,
Wherein the step of predicting the CU depth range for the CTU of the next image frame comprises:
Wherein the CU depth range for the CTU of the current image frame is reset to the maximum according to the pre-stored reset interval.

The method according to claim 1,
Wherein the step of predicting the CU depth range for the CTU of the next image frame comprises:
Determining whether a scene change is to be made using pixel values of a previous image frame and a current image frame, and
If it is determined that the scene change has not been made, the CU depth range for the CTU of the current image frame is reset to the maximum without resetting the CU depth range for the CTU of the current image frame to the maximum, / RTI > wherein the coding unit depth range prediction method comprises the steps of:

The method according to claim 1,
The predetermined weight may be expressed as:
Has one of the values of 0 to infinity,
Wherein the coding unit is set to a smaller value as the image coding compression efficiency is higher and is set to a larger value as the image coding rate is higher.

The method according to claim 1,
Generating a CU optimal structure for the CTU of the next image frame by performing CU division on the CTU of the next image frame using the predicted CU depth range.

delete

An operation unit for calculating a rate-distortion cost for a coding tree unit (CTU) of a current image frame,
A comparing unit for comparing the rate-distortion cost for the CTU of the previous image frame by a predetermined weight and the rate-distortion cost for the CTU of the current image frame, and
And a predictor for predicting a CU depth range for a CTU of a next image frame using a depth range of a coding unit (CU) for the CTU of the current image frame,
The predicting unit,
Distortion rate for the CTU of the current image frame is smaller than a value obtained by multiplying the rate-distortion cost for the CTU of the previous image frame by a predetermined first weight, the CU depth range for the CTU of the current image frame is The CU depth range for the CTU of the video frame is predicted,
Wherein the rate-distortion cost for the CTU of the current video frame is greater than or equal to the rate-distortion cost for the CTU of the previous video frame multiplied by the first weight and the rate- The CU depth for the CTU of the next image frame is increased by one or more than the maximum value of the CU depth range for the CTU of the current image frame, Range,
Wherein the rate-distortion cost for the CTU of the current video frame is greater than or equal to the rate-distortion cost for the CTU of the previous video frame times the second weight, The CU depth range for the CTU of the next image frame is predicted by expanding by 1 larger than the largest value and smaller than the minimum value by 1,
The operation unit,
Calculating a rate-distortion cost for the CTU of the current image frame in a state in which the CU optimal structure for the CTU of the current image frame is formed,
An HEVC encoder for calculating the rate-distortion cost (J _{SA TD} ) using the following equation:

Here, SATD denotes a Hadamard transform distortion value between an original pixel value and a reconstructed pixel value for a CTU in which a CU optimum structure is formed _{,? Pred} denotes a Lagrangian multiplier, and _Bred denotes a CTU optimum value Means the number of bits necessary for encoding.

11. The method of claim 10,
Wherein the CTU of the current image frame, the CTU of the previous image frame, and the CTU of the next image frame are located in the same area in the image.

delete

11. The method of claim 10,
The predicting unit,
Wherein the CU depth range for the CTU of the current image frame is reset to a maximum according to the pre-stored reset interval.

11. The method of claim 10,
The predicting unit,
A scene change is determined using pixel values of a previous image frame and a current image frame,
If it is determined that the scene change has not been made, the CU depth range for the CTU of the current image frame is not reset to the maximum, Encoding apparatus.

11. The method of claim 10,
The predetermined weight may be expressed as:
Has one of the values of 0 to infinity,
Wherein the HEVC coding unit is set to a smaller value as the image coding compression efficiency is higher and is set to a larger value as the image coding rate is higher.

11. The method of claim 10,
And a division unit for performing a CU division on the CTU of the next image frame using the predicted range of the CU depth to generate a CU optimal structure for the CTU of the next image frame.

delete