KR20060043435A

KR20060043435A - Integer transform matrix selection method in video coding and related integer transform method

Info

Publication number: KR20060043435A
Application number: KR1020050018437A
Authority: KR
Inventors: 꽝찌 쮸; 웬유 리우; 찌아오후아 티안; 야오 왕; 리 유
Original assignee: 삼성전자주식회사; 화중과기대
Priority date: 2004-03-18
Filing date: 2005-03-05
Publication date: 2006-05-15
Also published as: CN1564602A; KR100636225B1; CN100433837C

Abstract

본 발명은 이미지 처리 기술과 관련되며, 더욱 상세하게는 비디오 코덱에서 이미지 데이터 압축의 정수 변환과 관련된다. 8×8 DCT 정수 변환이 채택된 중국 제1 오디오 및 비디오 코딩 표준(AVS)에 따르면, 감상관 표율 및 에너지 집중화 효율로 변환 베이스의 품질을 평가하기 위하여 사용된 정수 변환 베이스 선택 방법이 제안된다. 계산 복잡성도 선택 절차에서 고려된다. 이러한 방법에 기초하여, 2그룹의 8×8 변환 베이스 (5, 6, 4, 1) 및 (4, 5, 3, 1)이 선택되며, 이러한 2 그룹을 위한 고속 변환 알고리즘이 제공된다.The present invention relates to image processing techniques, and more particularly to integer conversion of image data compression in a video codec. According to the Chinese First Audio and Video Coding Standard (AVS), which adopts an 8x8 DCT integer transform, an integer transform base selection method used for evaluating the quality of the transform base with audience rating and energy centralization efficiency is proposed. Computational complexity is also taken into account in the selection procedure. Based on this method, two groups of 8x8 transform bases (5, 6, 4, 1) and (4, 5, 3, 1) are selected, and a fast transform algorithm for these two groups is provided.

Description

Integer transform matrix selection method in video coding and related integer transform method

도 1은 변환 베이스 평가 절차의 흐름도이다.1 is a flowchart of a transform base evaluation procedure.

도 2는 (5, 6, 4, 1)의 변환 베이스를 갖는 변환의 고속 변환 알고리즘을 도시한다.2 shows a fast transform algorithm of a transform having a transform base of (5, 6, 4, 1).

도 3은 (5, 6, 4, 1)의 변환 베이스를 갖는 역변환의 고속 변환 알고리즘을 도시한다.3 shows a fast transform algorithm of inverse transform with a transform base of (5, 6, 4, 1).

도 4는 (4, 5, 3, 1)의 변환 베이스를 갖는 고속 변환 알고리즘을 도시한다.4 shows a fast transform algorithm with a transform base of (4, 5, 3, 1).

도 5는 (4, 5, 3, 1)의 변환 베이스를 갖는 역변환의 고속 변환 알고리즘을 도시한다.5 shows a fast transform algorithm of inverse transform with a transform base of (4, 5, 3, 1).

본 발명은 이미지 처리 기술에 관련되며, 더욱 상세하게는 비디오 코덱에서 이미지 데이터 압축의 정수 변환에 관련된다. 본 발명은 정수 변환의 변환 베이스(변환 행렬)를 선택하기 위한 방법 및 상기 변환 베이스의 선택에 기초하여 블록 변 환을 구현하기 위한 방법을 포함한다.The present invention relates to image processing techniques, and more particularly to integer conversion of image data compression in a video codec. The present invention includes a method for selecting a transform base (transform matrix) of an integer transform and a method for implementing block transform based on the selection of the transform base.

H.264 및 MPEG-4와 같은, 현재 국제 비디오 코딩 표준에서, 비디오 신호는 시퀀스, 프레임, 슬라이스, 매크로 블록 및 블록으로 계층적으로 분할 되며, 블록은 최소 처리 유닛이 된다. 인코딩 측면에서, 인트라-프레임 또는 인터-프레임 예측을 통하여, 블록의 예측 잔류 오류가 획득되며, 블록 변환은 에너지가 소수의 계수에 집중될 수 있도록 실행되며; 그리고 나서 양자화, 스캐닝, 런 렝스 코딩(Run Length Coding) 및 엔트로피 코딩을 통하여, 이미지 데이터는 압축되며, 코딩된 비트 스트림으로 기록된다. 디코딩 측면에서, 처리 절차는 반대가 된다. 우선, 엔트로피 코딩의 블록 변환 계수가 비트 스트림으로부터 추출된다. 그리고 나서, 역양자화 및 역변환을 통하여, 블록의 예측 잔류 오류는 재구성되며, 예측 정보는 블록의 비디오 데이터를 재구성하기 위하여 사용된다. 인코딩-디코딩 처리절차에서, 변환 모듈은 비디오 압축의 기초이며, 변환 성능은 코덱의 일반적인 성능에 직접적으로 영향을 준다.In current international video coding standards, such as H.264 and MPEG-4, video signals are hierarchically divided into sequences, frames, slices, macro blocks, and blocks, with blocks becoming the minimum processing unit. In terms of encoding, through intra-frame or inter-frame prediction, the prediction residual error of the block is obtained, and the block transform is performed so that energy can be concentrated on a few coefficients; Then, through quantization, scanning, run length coding and entropy coding, the image data is compressed and recorded in the coded bit stream. In terms of decoding, the processing procedure is reversed. First, the block transform coefficients of entropy coding are extracted from the bit stream. Then, through inverse quantization and inverse transformation, the prediction residual error of the block is reconstructed, and the prediction information is used to reconstruct the video data of the block. In the encoding-decoding process, the transform module is the basis of video compression, and the transform performance directly affects the general performance of the codec.

이산 코사인 변환(DCT)은 MPEG-1 및 H.261과 같은 초기 비디오 코딩 표준에서 채택되었다. 1974년 이산 코사인 변환의 제안 이후에, DCT는 이미지 및 비디오 코딩 분야에서 널리 사용되었다. 변환 도메인 내의 이미지 요소의 상관성을 제거하며, 고효율 이미지 압축을 위한 기반을 마련하기 때문에, 그것의 변환 성능은 모든 차선 변환(sub-optimal transform)중 우수하다. 그러나, DCT 변환 행렬은 부동 소수점 수로 표현되기 때문에, 대량의 부동 소수점 계산으로 인하여 많은 시스템 자원이 소모된다. 변환 효율을 개선하기 위하여, 고정 소수점 계산 또는 대형 정수 변환을 사용한 접근법은 부동 소수점 계산 DCT를 종료하기 위하여 개발되었다. 그러나, 양자화가 없을때 조차, 정밀한 오류의 등장으로 인하여, 역변환 후에 이미지 데이터는 완전히 재구성될 수 없다. 즉, 코딩의 역변환성은 충분치 않았다. 정수 변환은 계산 정확성 및 코딩 효율에 대한 문제점을 해결한다. 정수 변환의 특성은 이하를 포함한다: DCT의 부동 소수점 변환 행렬은 정수 변환 행렬로 대체되어, 정수 연산(interger operation)은 전체 변환 과정에서 실행되며, 정밀한 오류가 존재하지 않으며, 그러므로 코딩의 역변환성이 보장된다. 더욱이, 정수의 곱셈은 덧셈/뺄셈 및 시프팅(shifting) 연산으로 대체될 수 있다. 그러므로, 변환 과정은 덧셈/뺄셈 및 시프팅(shifting) 연산에 의해서 완전히 구현될 수 있으므로, 계산량은 매우 감소된다. 정수 변환은 최근 국제 비디오 코딩 표준 H.264/MPEG-4 파트 10에서 사용되며, 우수한 변환 결과가 획득되었다. 최근, 정수 변환에 행해진 연구는 이미지 및 비디오 처리 분야에서 상당하다. 정수 변환으로 다른 나라에서 획득한 특허는 다음과 같다:Discrete cosine transform (DCT) has been adopted in early video coding standards such as MPEG-1 and H.261. After the proposal of the discrete cosine transform in 1974, DCT was widely used in the field of image and video coding. Its transform performance is superior among all sub-optimal transforms because it removes the correlation of image elements in the transform domain and lays the foundation for high efficiency image compression. However, since the DCT transformation matrix is represented by a floating point number, a large amount of system resources are consumed due to the large amount of floating point calculations. In order to improve conversion efficiency, an approach using fixed-point calculations or large integer conversions has been developed to terminate floating-point calculation DCT. However, even in the absence of quantization, due to the appearance of precise error, the image data cannot be completely reconstructed after the inverse transformation. In other words, the inverse of the coding was not sufficient. Integer conversion solves the problem of computational accuracy and coding efficiency. The properties of integer conversion include: DCT's floating point conversion matrix is replaced with integer conversion matrix so that integer operation is performed in the whole conversion process, there is no precise error, therefore inverse transformability of coding This is guaranteed. Moreover, multiplication of integers can be replaced by addition / subtraction and shifting operations. Therefore, the conversion process can be fully implemented by addition / subtraction and shifting operations, so that the amount of calculation is greatly reduced. Integer conversion is recently used in international video coding standard H.264 / MPEG-4 Part 10, and excellent conversion results have been obtained. Recently, research conducted on integer conversion is considerable in the field of image and video processing. Patents obtained in other countries by integer conversion are:

1. U.S. 특허 제5,999,957호 "디지털 신호를 위한 손실없는 변환 시스템".1. U.S. Patent 5,999,957 "Lossless Conversion System for Digital Signals".

상기 특허에 따르면, 고정값은 DCT 변환 행렬의 각 행마다 곱해지며, 각 곱셈의 결과는 라운드되며, 변환 행렬의 계수는 역변환가능한 변환을 구현하기 위하여 정수로 전환된다. 그러나, 변환 직교성의 고려없이 변환 행렬의 이러한 유도는 정수 변환이 직교라는 것을 보장할 수 없다. 그러므로, 변환 효율은 영향을 받는다. 더욱이, 계산은 양자화 과정에서 수행된 복수의 곱셈/나눗셈으로 복잡해 진다. 더욱이, 고속 변환 알고리즘에서 복수의 곱셈은 변환 효율에 영향을 준다.According to the patent, the fixed value is multiplied for each row of the DCT transformation matrix, the result of each multiplication is rounded, and the coefficients of the transformation matrix are converted to integers to implement an invertible transformation. However, this derivation of the transformation matrix without consideration of the transformation orthogonality cannot guarantee that the integer transformation is orthogonal. Therefore, conversion efficiency is affected. Moreover, the calculation is complicated by a plurality of multiplications / divisions performed in the quantization process. Moreover, multiple multiplications in the fast conversion algorithm affect the conversion efficiency.

2. WO01/08001A1 "정수 연산을 이용한 정수 코사인 변환".2. WO01 / 08001A1 "Integer Cosine Transformation Using Integer Operations".

3. U.S. 특허 제20020111979A1호 "화상 코딩을 위한 정수 변환 행렬". 정수 변환 행렬의 변환 효율을 평가하기 위한 방법은 DCT와의 유사성 비교를 통하여 주로 제공된다. 상기 방법은 변환의 직교성을 보장한다. 상기 특허에 따르면, 이론적으로 가장 최적의 행렬은 4×4, 8×8 및 16×16의 3가지 조건하에서 제안되었다. 그러나, 변환 성능상의 계산 복잡성의 효과는 고려되지 않았다. 더욱이, 행렬의 각 라인 또는 행의 동일한 벡터 기준(norm)을 보장하기 위하여, 선택된 변환 행렬은 변환 효율에서 DCT에 가장 근접하지 않았다. 3. U.S. Patent No. 20020111979A1 "Integer Transformation Matrix for Image Coding". A method for evaluating the conversion efficiency of an integer conversion matrix is mainly provided through similarity comparison with DCT. The method ensures orthogonality of the transform. According to the patent, the most optimal matrix was theoretically proposed under three conditions of 4x4, 8x8 and 16x16. However, the effect of computational complexity on the conversion performance is not taken into account. Moreover, to ensure the same vector norm of each line or row of the matrix, the selected transform matrix was not closest to the DCT in the conversion efficiency.

4. U.S 특허 제2003/0093452A1호, "비디오 블록 변환". 직교 및 비직교에서 정수 변환 및 역변환의 행렬은 H.26L에 기초한 4×4 블록을 형성하며, 매크로블록 DC 계수의 변환 행렬 및 직교 변환에 상응하는 양자화된 단계 길이는 본 특허에서 제공된다. 상기 특허에 따른 변환 행렬의 크기는 본 발명의 크기와는 다르다. 더욱이, 상기 특허의 작은 크기의 변환 행렬은 HDTV와 같은 애플리케이션에 적절하지 않다. 4. U.S Patent 2003 / 0093452A1, “Video Block Conversion”. The matrix of integer and inverse transforms in orthogonal and non-orthogonal forms a 4x4 block based on H.26L, and the quantized step length corresponding to the transform matrix and orthogonal transform of macroblock DC coefficients is provided in this patent. The size of the transformation matrix according to the patent differs from that of the present invention. Moreover, the small size of the transformation matrix of the patent is not suitable for applications such as HDTV.

8×8 DCT는 이하의 식으로 표현될 수 있다.8 × 8 DCT can be expressed by the following equation.

(1)

(One)

여기서, C(0)=

, C(w)=1, (w=1,...,7)이다. 식은 Y=P₀XP₀ ^T와 같은 행렬의 형태로 표현되며, X는 8×8 픽셀 예측 잔류 오류 행렬이며, Y는 변환된 행렬이 다. Where C (0) =

, C (w) = 1, (w = 1, ..., 7). The equation is expressed in the form of a matrix such as Y = P ₀ XP ₀ ^T , where X is an 8 × 8 pixel prediction residual error matrix, and Y is a transformed matrix.

여기서,

here,

국제 표준 H.264에 의한 4×4 DCT 변환을 위한 수정 절차에 따르면, 8×8 변환은 다음과 같이 재기록될 수 있다: 공통 계수는 벡터 V8 = [a, m, f, m, a, m, f, m]를 획득하기 위하여 행렬의 각 행으로부터 추출되며, 여기서 m은 행렬 P₀의 짝수 넘버링된 행으로부터 추출된 공통 계수이며, k4 보다 크지 않은 양수값이다. 그리고 나서, 변환 행렬은 다음과 같이 재기록된다:According to the correction procedure for 4x4 DCT transformation according to international standard H.264, the 8x8 transformation can be rewritten as follows: the common coefficient is the vector V8 = [a, m, f, m, a, m , f, m] are extracted from each row of the matrix to obtain m, where m is a common coefficient extracted from the even numbered rows of the matrix P ₀ and is a positive value not greater than k4. Then, the transformation matrix is rewritten as follows:

, 여기서

.

, here

.

행렬 E8 = V₈ ^TV₈, 8×8행렬로 정의되며, 상기 변환은 다음과 같이 표현될 수 있다: Matrix E8 = V ₈ ^T V ₈ , defined by an 8x8 matrix, the transformation can be expressed as:

(2)

여기서,

는 벡터 곱(cross multiplication) 연산을 나타내며, 즉, 행렬의 상응하는 요소가 곱해진다. 식 (2)의 경우, 행렬 E₈의

연산은 변환을 간단화하기 위하여 양자화 연산과 함께 수행될 수 있다. 그러므로, 변환의 핵심은 P₁XP₁ ^T의 계산에 있으며, 여기서 X는 8×8 픽셀 예측 잔류 오류 행렬이며, 정수를 갖는다. 만약, P₁의 변수 k1, k2, k3, k4 및 k5가 정수라면, 전체 변환은 정수 연산으로 전환될 수 있다. 따라서, 남은 작업은 5개의 파라미터 k1, k2, k3, k4 및 k5의 선택을 결정하는 것이다. 본 발명에 따른 다수의 실험을 통하여, k1, k2, k3 및 k4가 선택된 후, k5의 값을 2로 설정할 때, 변환 성능이 최선임을 입증하였다. Cham은 '다이애딕 대칭의 원리에 의한 정수 코사인 변환의 개선'이라는 그의 논문(IEE Proceedings, 1989, 136(4): 276-288)에서 유사한 결론을 도출하였다. 그러므로, k5는 본 발명에서 고정값 2로 설정되며, 나머지 4개의 파라미터의 선택만 연구된다. (k1, k2, k3, k4)는 변환 베이스로서 정의된다. 상응하는 변환 행렬 P는 다음과 같다:here,

Denotes a cross multiplication operation, that is, the corresponding elements of the matrix are multiplied. For equation (2), the matrix E ₈

The operation can be performed in conjunction with a quantization operation to simplify the transformation. Therefore, the key to the transformation lies in the calculation of P ₁ XP ₁ ^T , where X is an 8x8 pixel prediction residual error matrix and has an integer. If the variables k1, k2, k3, k4, and k5 of P ₁ are integers, then the entire transformation can be converted to integer arithmetic. Thus, the remaining work is to determine the selection of the five parameters k1, k2, k3, k4 and k5. Through a number of experiments in accordance with the present invention, when k1, k2, k3 and k4 are selected, the conversion performance is best when the value of k5 is set to 2. Cham draws a similar conclusion in his paper, "Improvement of Integer Cosine Transformation by the Principle of Diadic Symmetry" (IEE Proceedings, 1989, 136 (4): 276-288). Therefore, k5 is set to a fixed value 2 in the present invention, and only the selection of the remaining four parameters is studied. (k1, k2, k3, k4) is defined as the transformation base. The corresponding transformation matrix P is

비디오 코딩시 정수 변환 행렬 선택 방법 및 관련 정수 변환 방법은 본 발명에서 제안된다. 8×8 정수 DCT 변환이 채택되는 중국의 제1 오디오 및 비디오 코딩 표준(AVS)을 고려하여, 정수 변환의 변환 베이스를 선택하기 위한 방법이 제안되며, 여기서, 감상관(de-correlation) 효율 및 변환 베이스의 에너지 집중화 효율, 변환 베이스의 다이내믹 변환 범위 및 계산 복잡성이 평가된다. 더욱이, 2개의 8×8 정수 변환 베이스 (5, 6, 4, 1) 및 (4, 5, 3, 1)은 이러한 방법에 따라 제안되며, 그리고 2개의 그룹 베이스에 기초한 고속 변환 알고리즘도 제안된다. An integer transform matrix selection method and related integer transform method in video coding are proposed in the present invention. In view of China's first Audio and Video Coding Standard (AVS), in which an 8x8 integer DCT transform is adopted, a method for selecting a transform base of integer transform is proposed, wherein de-correlation efficiency and The energy concentration efficiency of the transform base, the dynamic transform range of the transform base and the computational complexity are evaluated. Furthermore, two 8x8 integer transform bases (5, 6, 4, 1) and (4, 5, 3, 1) are proposed according to this method, and a fast transform algorithm based on two group bases is also proposed. .

변환 베이스의 선택은 이하의 원칙에 기초한다:The choice of transform base is based on the following principles:

원칙 1: 변환 직교성. 직교 변환은 변환이 단지 좌표 시스템의 회전이나, 이미지의 에너지는 변화하지 않는다는 것을 보장한다. 변환의 직교성을 보장하기 위하여, 식(2)에서 P는 다음의 조건을 만족해야 한다.Principle 1: Transform Orthogonality. Orthogonal transformations ensure that the transformation is only a rotation of the coordinate system, but the energy of the image does not change. In order to guarantee the orthogonality of the transform, P in Equation (2) must satisfy the following condition.

(3)

여기서, Diag는 대각 행렬이며, 즉 그것의 넌-리딩-대각(non-leading-diagonal) 요소는 제로이다. 그리고 나서, 양자화 절차는 양자화 행렬의 조절을 통하여 변환 직교성을 만족시킨다.Here Diag is a diagonal matrix, ie its non-leading-diagonal element is zero. The quantization procedure then satisfies the transform orthogonality through the adjustment of the quantization matrix.

원칙 2: 에너지 집중화. DCT 변환의 목적은 가능하면 적은 계수로 변환 후에 많은 에너지를 집중시키기 위하여 요소들중에 상관성을 제거하여, 양자화 후에 엔트로피 코딩의 압축 효율이 개선되도록 하는 것이다. 정수 변환 베이스의 선택도 이러한 원칙으로 행해진다.Principle 2: Energy Concentration. The purpose of the DCT transform is to eliminate correlation among the elements in order to concentrate as much energy after conversion with as few coefficients as possible, so that the compression efficiency of entropy coding after quantization is improved. The selection of the integer conversion base is also made on this principle.

원칙 3: 고속 변환 알고리즘의 간소화. 변환 베이스의 값은 매우 크지 않으며, 계산의 수는 가능하면 작은 것이 요구된다.Principle 3: Simplification of Fast Conversion Algorithms. The value of the transformation base is not very large, and the number of calculations should be as small as possible.

본 발명에 따른 비디오 코딩에서의 정수 변환 행렬 선택 방법은 연속적으로 특정 범위에서 직교 조건을 만족하는 모든 정수 변환 베이스를 검색하되, 8×8 변환 행렬 P를 위한 변환 베이스는 (k1, k2, k3, k4)로 정의되며, The integer transform matrix selection method in video coding according to the present invention continuously searches all integer transform bases satisfying orthogonal conditions in a specific range, but the transform bases for the 8 × 8 transform matrix P are (k1, k2, k3, k4),

변환 베이스 계수의 값 범위는 k1, k2, k3∈[1, 10]이며, k4∈[1, 4]이며, P·P ^T =Diag 를 만족하는 모든 정수 직교 변환 베이스가 획득되며, Diag는 대각 행렬 인 단계; The range of values of the transform base coefficients is k1, k2, k3∈ [1, 10], k4∈ [1, 4], and all integer orthogonal transform bases satisfying P · P ^T = Diag are obtained, and Diag is diagonal Being a matrix;

(b) 상관 계수

의 값이 0.75, 0.8, 0.85, 0.9 및 0.95일 때, 입력 이미지 잔류 오류 데이터의 공분산 행렬 COV(X_V)을 수립하되, (b) correlation coefficient

When the values of are 0.75, 0.8, 0.85, 0.9, and 0.95, establish a covariance matrix COV (X _V ) of the input image residual error data,

8 길이를 갖는 일차원 이미지 예측 잔류 오류 벡터는 X_V = [x₁, x₂, ...x₈]로 가정하며, 1차 마르코프(Markov) 모델에 기초하여 수립된 X_V요소의 공분산 행렬COV(X_V)은

이며,

는 인접 X_V 요소 사이의 상관 계수이며,

≤1인 단계;The one-dimensional image prediction residual error vector with 8 lengths is assumed to be X _V = [x ₁ , x ₂ , ... x ₈ ] and the covariance matrix COV of the X _V elements established based on the first-order Markov model (X _V ) is

Is,

Is the correlation coefficient between adjacent X _V elements,

≤ 1;

(c) 변환 베이스에 상응하는 변환 행렬 P를 통하여 변환 도메인의 공분산 행렬 COV(Y_V)를 획득하되, (c) obtaining the covariance matrix COV (Y _V ) of the transform domain through the transform matrix P corresponding to the transform base,

변환 베이스 (k1, k2, k3, k4)에 상응하는 변환 행렬 P는 표준화되며, 즉 P의 각 행은 직교 행렬 P_u을 획득하기 위하여 행의 벡터 길이만큼 분할되며, X_V는 Y_V=P_uX_V와 같이 직교적으로 변환되며, Y_V의 공분산 행렬은

인 단계;The transformation matrix P corresponding to the transformation base (k1, k2, k3, k4) is normalized, i.e. each row of P is divided by the vector length of the row to obtain an orthogonal matrix P _u , where X _V is Y _V = P as _u X _V is converted to orthogonally, _V is the covariance matrix of Y

Phosphorus step;

(d) 상기 (b), (c)단계를 통하여, 상관 계수

의 값이 0.75, 0.8, 0.85, 0.9 및 0.95일 때, 에너지 집중화 효율

E 및 감상관 효율

C을 계산하되,(d) through the steps (b) and (c), the correlation coefficient

When the values of are 0.75, 0.8, 0.85, 0.9 and 0.95, energy concentration efficiency

E and auditorium efficiency

Calculate C,

상기 에너지 집중화 효율

E은 이하와 같이 정의되며,The energy concentration efficiency

E is defined as

,

상기 감상관 효율

C는 이하와 같이 정의되는 단계;The appreciation hall efficiency

C is defined as follows;

(e) 소정의 상관 계수

의 값에서 각각의 변환 베이스를 위한 에너지 집중화 효율

E 및 감상관 효율

C의 표준화된 결과를 계산하되, 상기 동일한

에서 i번째 변환 베이스를 위한

E의 표준화된 결과는 다음과 같으며,(e) predetermined correlation coefficient

Energy concentration efficiency for each conversion base at the value of

E and auditorium efficiency

Compute a standardized result of C, but with the same

For the i th conversion base

The standardized result of E is

i번째 변환 베이스를 위한

C의 표준화된 결과는 다음과 같은 단계;for the i transform base

The standardized results of C are as follows;

(f) 각 상관 계수

에서 모든 그룹의 베이스에 대한 상기 에너지 집중화 효 율

E 및 상기 감상관 효율

C의 복합 평가값 Eval_E 및 Evla_C를 획득하기 위하여, 가중된 합을 계산하되, 5개의

지점에 상응하는 가중치는 각각 1/15, 2/15, 3/15, 4/15 및 5/15인 단계 및(f) each correlation coefficient

The energy concentration efficiency for the bases of all groups in

E and the appreciation hall efficiency

In order to obtain the composite evaluation values of Eval _E and Evla _C of _C , the weighted sum is calculated, but 5

The weights corresponding to the points are 1/15, 2/15, 3/15, 4/15 and 5/15, respectively, and

(g) 변환 베이스 성능에 대한 복합 평가값(Eval)을 획득하기 위하여 상기 Evla_C및 Eval_E의 가중된 합을 계산하되, Evla_C및 Eval_E의 가중치는 각각 0.4 및 0.6인 단계를 포함하는 것을 특징으로 한다.(g) calculating the weighted sum of Evla _C and Eval _E to obtain a composite evaluation value (Eval) for transform base performance, wherein the weights of Evla _C and Eval _E include 0.4 and 0.6, respectively. It features.

비디오 코딩시 전술한 정수 변환 행렬 선택 방법의 다른 특징은 변환 베이스의 성능에 대한 상기 복합 평가값(Eval)이 획득된 후, 변환 베이스 (k1, k2, k3, k4)를 위한 계산 복잡성을 평가하기 위한 단계가 추가되며; 우선 더 높은 복합 평가값 Eval을 갖는 변환 베이스가 선택되며, 만약 Eval 값들의 차이가 0.02 보다 작으면, 계산 복잡성에서 더 많은 이점을 제공하는 베이스 즉, 더 적은 덧셈/뺄셈 및 더 적은 시프팅 연산을 요구하는 베이스가 더 나은 실시간 성능을 요구하는 애플리케이션에 바람직한 것을 특징으로 한다.Another feature of the above-described integer transform matrix selection method in video coding is to evaluate the computational complexity for the transform base (k1, k2, k3, k4) after the composite evaluation value (Eval) for the performance of the transform base is obtained. Steps are added; First, the transform base with the higher composite evaluation value Eval is selected, and if the difference between the Eval values is less than 0.02, then a base that provides more benefit in computational complexity, i.e. fewer addition / subtraction and fewer shifting operations, The required base is characterized by being desirable for applications requiring better real time performance.

본 발명의 비디오 코딩시 정수 변환 방법에 따르면. 인트라-프레임 또는 인터-프레임을 통한 인코딩 측면에서, 블록의 예측 잔류 에러가 획득되며, 예측 및 블록 변환은 에너지가 소량의 계수에 집중될 수 있도록 실행되며, 그리고 나서, 양자화, 스캐닝, 런 렝스 코딩 및 엔트로피 코딩을 통하여, 이미지 데이터는 압축되며, 코딩 비트스트림에 기록되고, 디코딩 측면에서, 엔트로피 코딩의 블록 변환 계 수는 비트 스트림으로부터 추출되며, 그리고 나서 역양자화 및 역변환을 통하여, 블록의 예측 잔류 오류가 재구성되며, 비디오 데이터를 재구성하기 위하여 예측 정보가 함께 사용되는 방법으로서, According to the integer conversion method in video coding of the present invention. In terms of encoding via intra-frame or inter-frame, the predictive residual error of the block is obtained, and the prediction and block transforms are performed so that energy can be concentrated on a small amount of coefficients, and then quantization, scanning, run length coding And through entropy coding, the image data is compressed, recorded in the coding bitstream, and in terms of decoding, the block transform coefficients of the entropy coding are extracted from the bit stream, and then through inverse quantization and inverse transformation, the prediction residual of the block Error is reconstructed, and a method in which prediction information is used together to reconstruct video data,

(a) 상기 청구항 1 또는 청구항 2에서 청구된 바와 같은 비디오 코딩시 정수 변환 행렬 선택 방법을 통하여 비디오 코딩시 8×8 정수 변환시 사용된 변환 행렬 P를 획득하되, 상기 변환 행렬 P는 이하와 같이 표현되며, 상기 상응하는 정수 변환 베이스는 (5, 6, 4, 1)인 단계;(a) Acquire a transform matrix P used for 8 × 8 integer transform during video coding through an integer transform matrix selection method in video coding as claimed in claim 1, wherein the transform matrix P is expressed as follows. The corresponding integer conversion base is (5, 6, 4, 1);

(b) 8×8 이미지 잔류 오류 데이터 블록 상에서 Y=PXP^T 로 표현되는 정수변환을 실행하되, 기본 변환 유닛은 y=Px와 같이 표현되는 8-포인트 일차원 변환이며, 여기서, x=[x0, x1, x2, x3, x4, x5, x6, x7]^T , 출력 벡터 y=[y0, y1, y2, y3, y4, y5, y6, y7]^T 이며, 계산은 다음과 같은 단계;(b) perform an integer transform represented by Y = PXP ^T on an 8 × 8 image residual error data block, wherein the basic transform unit is an 8-point one-dimensional transform expressed as y = Px, where x = [x0, x1, x2, x3, x4, x5, x6, x7] ^T , output vector y = [y0, y1, y2, y3, y4, y5, y6, y7] ^T , and the calculation is performed as follows;

A. a=x0-x7, a1=x1-x6, a2=x2-x5, a3=x3-x4, a4=x0+x7, a5=x1+x6, a6=x2+x5, a7=x3+x4;A. a = x0-x7, a1 = x1-x6, a2 = x2-x5, a3 = x3-x4, a4 = x0 + x7, a5 = x1 + x6, a6 = x2 + x5, a7 = x3 + x4;

B. b0=a4+a7, b1=a5+a6, b2=a4-a7, b3=a5-a6;B. b0 = a4 + a7, b1 = a5 + a6, b2 = a4-a7, b3 = a5-a6;

C. y0=b0+b1, y4=b0-b1, y2=b2<<1+b3, y6=b2-b3<<1;C. y0 = b0 + b1, y4 = b0-b1, y2 = b2 << 1 + b3, y6 = b2-b3 << 1;

이하의 식과 동일하게 표현되는 계산 과정을 완성시키며,Complete the calculation process expressed in the same way as

D. c0=a0<<2+a0+a3; c1=a2-a1-a1<<2; c2=a1+a2+a2<<2; c3=a3<<2+a3-a0;D. c0 = a0 << 2 + a0 + a3; c1 = a2-a1-a1 << 2; c2 = a1 + a2 + a2 << 2; c3 = a3 << 2 + a3-a0;

E. y1=c0-c1+c2; y3=c0-c2-c3; y5=c0+c1+c3; y7=c1+c2-c3;E. y1 = c0-c1 + c2; y3 = c0-c2-c3; y5 = c0 + c1 + c3; y7 = c1 + c2-c3;

(c) 일차원 역변환을 수행하되, x=P^Ty를 일치원 변환의 기본 유닛으로 정의하며, y=[y0, y1, y2, y3, y4, y5, y6, y7]^T, x=[x0, x1, x2, x3, x4, x5, x6, x7]^T 이며, 상기 일차원 역변환은 이하와 같이 수행되며,(c) Perform one-dimensional inverse transformation, where x = P ^T y is defined as the base unit of coincidence transformation, y = [y0, y1, y2, y3, y4, y5, y6, y7] ^T , x = [x0 , x1, x2, x3, x4, x5, x6, x7] ^T , and the one-dimensional inverse transform is performed as follows.

A. m0=y0+y4; m1=y0-y4; m2=y2<<1+y6; m3=y2-y6<<1;A. m0 = y0 + y4; m1 = y0-y4; m2 = y2 << 1 + y6; m3 = y2-y6 << 1;

B. b0=m0+m2; b1=m1+m3; b2=m1-m3; b3=m0-m2;B. b0 = m0 + m2; b1 = m1 + m3; b2 = m1-m3; b3 = m0-m2;

C. 이하의 식을 이용하여 4×4 행렬곱을 계산하며;C. Calculate a 4x4 matrix product using the following formula;

상기 계산에서 변환시 4×4 행렬곱은 동일하며, 단지 입력 및 출력만이 교환 되며;In the calculation the 4 × 4 matrix product is the same at conversion, only input and output are exchanged;

D. x0=a0+b0; x1=a1+b1; x2=a2+b2; x3=a3+b3;D. x0 = a0 + b0; x1 = a1 + b1; x2 = a2 + b2; x3 = a3 + b3;

x7=-a0+b0; x6=-a1+b1; x5=-a2+b2; x4=-a3+b3; x7 = -a0 + b0; x6 = -a1 + b1; x5 = -a2 + b2; x4 = -a3 + b3;

여기서, "<<"연산은 레프트 시프팅 연산을 나타내며, 덧셈/뺄셈 연산의 우선 순위보다 더 높은 우선 순위를 갖는다. "a<<b"는 a가 비트 레프트 시프트된다는 것을 나타내는 단계를 포함한다.Here, the "<<" operation represents a left shifting operation and has a higher priority than the priority of the addition / subtraction operation. “a << b” includes a step indicating that a is left left shifted.

본 발명의 비디오 코딩시 다른 정수 변환 방법에 따르면, 인트라-프레임 또는 인터-프레임을 통한 인코딩 측면에서, 블록의 예측 잔류 에러가 획득되며, 예측 및 블록 변환은 에너지가 소량의 계수에 집중될 수 있도록 실행되며, 그리고 나서, 양자화, 스캐닝, 런 렝스 코딩 및 엔트로피 코딩을 통하여, 이미지 데이터는 압축되며, 코딩 비트스트림에 기록되고, 디코딩 측면에서, 엔트로피 코딩의 블록 변환 계수는 비트 스트림으로부터 추출되며, 그리고 나서 역양자화 및 역변환을 통하여, 블록의 예측 잔류 오류가 재구성되며, 비디오 데이터를 재구성하기 위하여 예측 정보가 함께 사용되는 방법으로서, According to another integer transformation method in the video coding of the present invention, in terms of encoding through an intra-frame or an inter-frame, a prediction residual error of a block is obtained, and the prediction and block transformation is performed so that energy can be concentrated in a small amount of coefficients. And then, through quantization, scanning, run length coding and entropy coding, the image data is compressed, written to the coding bitstream, in terms of decoding, the block transform coefficients of the entropy coding are extracted from the bit stream, and Then, through inverse quantization and inverse transformation, a prediction residual error of a block is reconstructed, and a method in which prediction information is used together to reconstruct video data,

(a) 상기 청구항 1 또는 청구항 2에서 청구된 바와 같은 비디오 코딩시 정수 변환 행렬 선택 방법을 통하여 비디오 코딩시 8×8 정수 변환시 사용된 변환 행렬 P를 획득하되, 상기 변환 행렬 P는 이하와 같이 표현되며, 상기 상응하는 정수 변환 베이스는 (4, 5, 3, 1)인 단계;(a) Acquire a transform matrix P used for 8 × 8 integer transform during video coding through an integer transform matrix selection method in video coding as claimed in claim 1, wherein the transform matrix P is expressed as follows. The corresponding integer conversion base is (4, 5, 3, 1);

D. c0=a0<<2+a3; c1=a2-a1<<2; c2=a1+a2<<2; c3=a3<<2-a0;D. c0 = a0 << 2 + a3; c1 = a2-a1 << 2; c2 = a1 + a2 << 2; c3 = a3 << 2-a0;

상기 계산에서 변환시 4×4 행렬곱은 동일하며, 단지 입력 및 출력만이 교환되며;In the calculation the 4 × 4 matrix product is the same at conversion, only input and output are exchanged;

본 발명에 따르면, 정수 변환 베이스의 성능을 위한 복합 평가 방법이 제안 되며, 그리고 이러한 방법에 기초한 더 나은 성능을 갖는 몇몇 그룹의 변환 베이스가 선택되며, 2개 그룹의 변환 베이스를 위한 고속 변환 방법이 제공된다. 고해상도 비디오 테스팅 시퀀스의 테스트 결과는 본 발명에 따른 바람직한 변환 베이스 그룹의 성능은 JVT의 ABT(적응성 블록 변환;Adaptive Block Transform) 8×8 변환 보다 우수하다는 것을 증명하며, 여기서 베이스(10, 9, 6, 2)는 최선의 변환 성능을 나타내며, (4, 5, 3, 1)은 가장 낮은 계산 복잡성을 제공하며, 그리고 (5, 6, 4, 1)의 성능은 상기 두 개의 중간에 위치한다. ABT 8×8 변환과 비교하여, 3개의 그룹 베이스는 변환 성능 및 계산 복잡성에서의 이점을 유지하였다. 더욱이, 선택된 변환 베이스의 테스트된 성능은 본 발명에 따른 변환 베이스 선택 방법의 정확성 및 실행 가능성을 입증한다. 상기 방법은 정수 변환 행렬에 적합할 뿐만 아니라, 변환 행렬의 선택을 위한 중요성을 포함하는 다양한 변환 행렬의 성능 평가에도 적합하다. According to the present invention, a complex evaluation method for the performance of integer transform base is proposed, and several groups of transform bases with better performance based on this method are selected, and a fast transform method for two groups of transform bases is selected. Is provided. Test results of high resolution video testing sequences demonstrate that the performance of the preferred transform base group according to the present invention is superior to JVT's ABT (Adaptive Block Transform) 8 × 8 transform, where the bases 10, 9, 6 , 2) represents the best conversion performance, (4, 5, 3, 1) provides the lowest computational complexity, and the performance of (5, 6, 4, 1) lies between the two. Compared to the ABT 8x8 transform, the three group bases retained the advantages in transform performance and computational complexity. Moreover, the tested performance of the selected transform base demonstrates the accuracy and feasibility of the transform base selection method according to the present invention. The method is not only suitable for integer transformation matrices, but also for performance evaluation of various transformation matrices, including the importance for the selection of the transformation matrices.

(1) 변환 베이스의 선택(1) Selection of conversion base

변환 베이스의 평가 절차는 도 1에 도시된다.The evaluation procedure of the transform base is shown in FIG.

다양한 이미지 잔류 에러 데이터의 상관 계수(

)의 값은 0.75와 0.95 사이에서 주로 분배된다. 0.75, 0.8, 0.85, 0.9 및 0.95의

값에서 각각의 변환 베이스에 상응하는 에너지 집중화 효율(

E) 값들이 계산되며, 동일한

에서 다양한 변환 베이스의

E 값들은 표준화된다. 상이한 상관 계수

에서 동일한 변환 베이스에 상응한

E의 표준화된 결과의 가중된 합은 베이스 그룹에 상응한 에너지 집중화 효율

E의 복합 평가값(Eval_E)을 획득하기 위하여 계산되며, 여기서 가중치는 상이한

값의 확률에 의해 결정된다. 본 발명에 따르면, 5개의

포인트에 상응하는 가중치는 연속적으로 1/15, 2/15, 3/15, 4/15, 5/15로 설정된다. 변환 베이스 그룹에 상응한 감상관 효율(de-correlation efficiency)

c의 복합 평가값(Eval_C)은 동일한 절차로 계산될 수 있다. Correlation Coefficients of Various Image Residual Error Data

) Is mainly distributed between 0.75 and 0.95. 0.75, 0.8, 0.85, 0.9 and 0.95

The energy concentration efficiency corresponding to each conversion base in the value (

E) The values are calculated and the same

Of various conversion bases

E values are normalized. Different correlation coefficients

Corresponds to the same conversion base in

The weighted sum of the standardized results of E is the energy concentration efficiency corresponding to the base group.

Calculated to obtain a composite estimate of _E (Eval _E ), where the weights are different

It is determined by the probability of the value. According to the invention, five

The weights corresponding to the points are successively set to 1/15, 2/15, 3/15, 4/15, 5/15. De-correlation efficiency corresponding to transform base group

The composite evaluation value of c (Eval _C ) can be calculated with the same procedure.

최종적으로, 변환 베이스에 상응한 에너지 집중화 효율의 복합 평가값(Eval) 및 감상관 효율은 Eval_E 및 Eval_C의 가중된 합을 계산함으로써 획득될 수 있다. 에너지 집중화 효율은 변환 후에 압축 성능에 직접적으로 영향을 주기 때문에, 그것의 가중치는 더 크다. 평가값(Eval_E 및 Eval_C)의 가중치는 본 발명에서 각각 0.6 및 0.4로 정의된다. Finally, the composite evaluation value Eval and the auditorium efficiency of the energy concentration efficiency corresponding to the conversion base can be obtained by calculating the weighted sum of Eval _E and Eval _C. Since the energy concentration efficiency directly affects the compression performance after conversion, its weight is greater. The weights of the evaluation values Eval _E and Eval _C are defined as 0.6 and 0.4 in the present invention, respectively.

Eval의 값을 마감(close)할 때, 더 낮은 계산 복잡성을 갖는 베이스는 더 잘 수행된다. When closing the value of Eval, the base with lower computational complexity performs better.

이하의 표는 5개의 그룹 베이스에 상응하는

E 및

C의 복합 평가값을 도시하며, 변환 베이스의 범위가 k1, k2, k3∈[1, 10] 및 k4∈[1, 4]일 때, 8-포인트 일차원 변환을 완성하기 위하여 필요한 덧셈 회수 및 시프팅 연산의 회수를 나타낸다. (변환 및 역변환을 위한 연산의 회수는 동일하다)The table below corresponds to five group bases.

E and

Addition number and shift necessary to complete an 8-point one-dimensional transform when the transform base ranges k1, k2, k3 '[1, 10] and k4' [1, 4] Indicates the number of ting operations. (The number of operations for transform and inverse transform is the same)

k1, k2, k3, k4 k1, k2, k3, k4

E 및

C의 복합 평가값

E and

Composite evaluation of C Number of additions (+/-) Number of shifting operations (<<) 10, 9, 6, 2 0.9859 36 10 5, 6, 4, 1 0.8579 32 6 6, 6, 3, 2 0.8441 36 10 6, 7, 5, 1 0.8409 32 10 4, 5, 3, 1 0.8249 28 6

(10, 9, 6, 2) 및 (6, 6, 3, 2)는 관련된 논문에서 제안되었다. 베이스 (5, 6, 4, 1)에 상응하는 감상관 효율 및 에너지 집중화 효율의 복합 평가값은 베이스 (10, 9, 6, 2)에 상응하는 것 다음이며, 계산 복잡성은 더 낮다. 베이스 (4, 5, 3, 1)에 상응하는 복합 평가값은 베이스 (6, 6, 3, 2)에 상응하는 복합 평가값 보다 다소 낮지만, 계산 복잡성에서의 이점은 명백하다. 실제 비디오 시퀀스 테스트는 베이스 (5, 6, 4, 1), (4, 5, 3, 1) 및 (6, 7, 5, 1)에 의해 제공된 왜곡율 성능(distortion rate performance)은 (6, 6, 3, 2)에 의한 것 보다 더 좋으며, 베이스 (10, 9, 6, 2)에 의한 성능에 가장 근접하다.(10, 9, 6, 2) and (6, 6, 3, 2) have been proposed in related papers. The composite estimates of the auditorium efficiency and energy concentration efficiency corresponding to the bases (5, 6, 4, 1) are next to those corresponding to the bases (10, 9, 6, 2) and the computational complexity is lower. The composite estimate corresponding to base (4, 5, 3, 1) is somewhat lower than the composite estimate corresponding to base (6, 6, 3, 2), but the advantage in computational complexity is evident. The actual video sequence test shows that the distortion rate performance provided by the bases (5, 6, 4, 1), (4, 5, 3, 1) and (6, 7, 5, 1) is (6, 6). , Better than by 3, 2), closest to the performance by the base 10, 9, 6, 2.

(2) 8×8 정수 변환 고속 알고리즘의 구현도(2) Implementation diagram of 8x8 integer conversion fast algorithm

도 2 내지 도 5에서, x0, x1, x2, x3, x4, x5, x6 및 x7은 정수 변환의 일차원 변환의 8 입력값을 나타내며, 동시에 역변환의 8 출력값이다; 그리고 y0, y1, y2, y3, y4, y5, y6 및 y7은 일차원 변환의 8 출력값이며, 동시에 역변환의 8 입력값이다. 데이터 처리 방향은 좌측에서 우측이다. 점에서 교차하는 2개의 라인은 2개의 수의 덧셈을 나타내며, 한 점에서 교차하는 3개의 라인은 3개의 수의 덧셈을 나타낸다. 사각형은 계수에 의한 곱셈을 나타내며, 여기서, "-" 부정(negation)을 나타내며, "2"는 2의 곱셈 즉, 1 비트씩 레프트 시프팅을 나타내며; "4"는 4의 곱 셈 즉, 2비트씩 레프트 시프팅을 나타낸다.2 to 5, x0, x1, x2, x3, x4, x5, x6 and x7 represent 8 input values of the one-dimensional transform of the integer transform, and at the same time 8 output values of the inverse transform; And y0, y1, y2, y3, y4, y5, y6, and y7 are 8 output values of the one-dimensional transform, and 8 input values of the inverse transform. The data processing direction is from left to right. Two lines intersecting at a point represent the addition of two numbers, and three lines intersecting at a point represent the addition of three numbers. The square represents multiplication by coefficient, where "-" represents negation, and "2" represents multiplication of two, ie left shifted by one bit; "4" represents a multiplication of 4, that is, left shift by 2 bits.

1. 변환1. Conversion

정수 변환은 8×8 이미지 잔류 오류 데이터 블록 상에서 실행되며, 기본 변환 유닛은 y=Px와 같은 8 포인트 일차원 변환이며, x=[x0, x1, x2, x3, x4, x5, x6, x7]^T 이며, 출력 y=[y0, y1, y2, y3, y4, y5, y6, y7]^T이다. 계산 절차는 다음과 같다:The integer transform is performed on an 8 × 8 image residual error data block, and the basic transform unit is an 8 point one-dimensional transform such as y = Px, where x = [x0, x1, x2, x3, x4, x5, x6, x7] ^T And output y = [y0, y1, y2, y3, y4, y5, y6, y7] ^T. The calculation procedure is as follows:

우선, 변환이 상이한 변환 행렬 P로 실행될 때, 공통 단계는 다음과 같다:First, when the transform is performed with a different transform matrix P, the common steps are as follows:

(1) a=0x0-x7, a1=x1-x6, a2=x2-x5, a3=x3-x4, a4=x0+x7, a5=x1+x6, a6=x2+x5, a7=x3+x4;(1) a = 0x0-x7, a1 = x1-x6, a2 = x2-x5, a3 = x3-x4, a4 = x0 + x7, a5 = x1 + x6, a6 = x2 + x5, a7 = x3 + x4 ;

(2) b0=a4+a7, b1=a5+a6, b2=a4-a7, b3=a5-a6;(2) b0 = a4 + a7, b1 = a5 + a6, b2 = a4-a7, b3 = a5-a6;

(3) y0=b0+b1, y4=b0-b1, y2=b2<<1+b3, y6=b2-b3<<1;(3) y0 = b0 + b1, y4 = b0-b1, y2 = b2 << 1 + b3, y6 = b2-b3 << 1;

여기서, 계산의 동일한 부분은 16 덧셈/뺄셈 및 2 시프팅 연산이 필요하다.Here, the same part of the calculation requires 16 addition / subtraction and two shifting operations.

그리고 나서, 이하의 식으로 계산되는 것과 동일한 개별 단계가 실행된다:Then, the same individual steps are executed as calculated by the equation:

베이스 (5, 6, 4, 1)에 상응하는 계산 단계는 다음과 같다:The calculation steps corresponding to the bases (5, 6, 4, 1) are as follows:

(1) c0=a0<<2+a0+a3; c1=a2-a1-a1<<2; c2=a1+a2+a2<<2; c3=a3<<2+a3-a0;(1) c0 = a0 << 2 + a0 + a3; c1 = a2-a1-a1 << 2; c2 = a1 + a2 + a2 << 2; c3 = a3 << 2 + a3-a0;

(2) y1=c0-c1+c2;y3=c0-c2-c3; y5=c0+c1+c3; y7=c1+c2-c3; (2) y1 = c0-c1 + c2; y3 = c0-c2-c3; y5 = c0 + c1 + c3; y7 = c1 + c2-c3;

전체 16 덧셈/뺄셈 및 4 시프팅 연산이 필요하다.A total of 16 addition / subtraction and four shifting operations are required.

베이스 (4, 5, 3, 1)을 위한 계산 단계는 다음과 같다:The calculation steps for the base (4, 5, 3, 1) are as follows:

(1) c0=a0<<2+a3; c1=a2-a1<<2; c2=a1+a2<<2; c3=a3<<2-a0;(1) c0 = a0 << 2 + a3; c1 = a2-a1 << 2; c2 = a1 + a2 << 2; c3 = a3 << 2-a0;

(2) y1=c0-c1+c2; y3=c0-c2-c3; y5=c0+c1+c3; y7=c1+c2-c3;(2) y1 = c0-c1 + c2; y3 = c0-c2-c3; y5 = c0 + c1 + c3; y7 = c1 + c2-c3;

전체 12 덧셈/뺄셈 및 4 시프팅 연산이 필요하다.A total of 12 addition / subtraction and four shifting operations are required.

따라서, y=Px를 한 번에 완성하기 위하여, 변환 베이스 (5, 6, 4, 1)에 대하여 전체 32 덧셈/뺄셈 및 6 시프팅 연산이 필요하며, 변환 베이스 (4, 5, 3, 1)에 대하여 28 덧셈/뺄셈 및 6 시프팅 연산이 필요하다. 8×8 블록에 한 번의 정수 변환을 완성시키기 위하여 필요한 계산량은 전술된 유닛 계산량의 16배이다. 베이스 (5, 6, 4, 1)을 위한 변환의 고속 알고리즘은 도 2에 도시된다. 베이스 (4, 5, 3, 1)을 위한 변환의 고속 알고리즘은 도 4에 도시된다.Thus, to complete y = Px at once, a total of 32 addition / subtraction and six shifting operations are required for the transform base (5, 6, 4, 1), and the transform base (4, 5, 3, 1). 28 addition / subtraction and 6 shifting operations are required. The amount of computation necessary to complete one integer transform in an 8x8 block is 16 times the unit computation described above. The fast algorithm of the transform for the base 5, 6, 4, 1 is shown in FIG. 2. The fast algorithm of the transform for the bases 4, 5, 3, 1 is shown in FIG.

2. 역변환2. Inverse transformation

기본 일차원 변환 유닛은 x=P^Ty로 정의되며, y=[y0, y1, y2, y3, y4, y5, y6, y7]^T, x=[x0, x1, x2, x3, x4, x5, x6, x7]^T 이다. 이하의 단계는 1회의 x=P^Ty 계산이다.The basic one-dimensional transform unit is defined as x = P ^T y, where y = [y0, y1, y2, y3, y4, y5, y6, y7] ^T , x = [x0, x1, x2, x3, x4, x5, x6, x7] ^T. The following steps are one x = P ^T y calculation.

(1) m0=y0+y4; m1=y0-y4; m2=y2<<1+y6; m3=y2-y6<<1;(1) m0 = y0 + y4; m1 = y0-y4; m2 = y2 << 1 + y6; m3 = y2-y6 << 1;

(2) b0=m0+m2; b1=m1+m3; b2=m1-m3; b3=m0-m2;(2) b0 = m0 + m2; b1 = m1 + m3; b2 = m1-m3; b3 = m0-m2;

(3) 이하의 식을 이용하여 4×4 행렬곱을 계산하는 단계:(3) calculating the 4 × 4 matrix product using the following equation:

계산식 및 변환에서, 행렬곱과 알고리즘은 동일하며, 단지 입력 및 출력 데이터 벡터만이 교환된다. 2가지 표현의 계산량은 동일하다. 베이스 (5, 6, 4, 1)을 위하여, 16 뎃셈/뺄셈 및 4 시프팅 연산이 필요하며, 베이스 (4, 5, 3, 1)을 위하여, 12 덧셈/뺄셈 및 4 시프팅 연산이 필요하다. In equations and transformations, matrix multiplication and algorithm are the same, only the input and output data vectors are exchanged. The computations of the two representations are the same. For the base (5, 6, 4, 1), 16 subtraction / subtraction and 4 shifting operations are needed, and for the base (4, 5, 3, 1), 12 addition / subtraction and 4 shifting operations are required. Do.

(4) x0=a0+b0; x1=a1+b1; x2=a2+b2; x3=a3+b3;(4) x0 = a0 + b0; x1 = a1 + b1; x2 = a2 + b2; x3 = a3 + b3;

"<<"연산은 레프트 시프팅 연산을 나타내며, 덧셈/뺄셈 연산의 우선 순위보다 더 높은 우선 순위를 갖는다. "a<<b"는 a가 비트 레프트 시프트된다는 것을 나타낸다.The "<<" operation represents a left shift operation and has a higher priority than the priority of the add / sub operation. "a << b" indicates that a is left left shifted.

공통 부분의 계산량은 16 덧셈/뺄셈 및 2 시프팅 연산이다.The amount of computation in the common part is 16 addition / subtraction and two shifting operations.

따라서, 1회의 x=P^Ty를 완성시키기 위하여, 베이스 (5, 6, 4, 1)을 위하여, 32 덧셈/뺄셈 및 6 시프팅 연산이 필요하며, 베이스 (4, 5, 3, 1)을 위하여, 28 덧셈/뺄셈 및 6 시프팅 연산이 필요하다. 베이스 (5, 6, 4, 1)을 위한 역변환의 고속 알고리즘은 도 3에 도시된다. 베이스 (4, 5, 3, 1)을 위한 역변환의 고속 알고리즘은 도 5에 도시된다. 8×8 블록에 1회의 정수 변환의 역변환을 완성시키기 위하여 필요한 계산량은 전술한 유닛 계산량의 16배이다.Thus, to complete one x = P ^T y, for the base (5, 6, 4, 1), 32 addition / subtraction and 6 shifting operations are needed, and the base (4, 5, 3, 1) For this, 28 addition / subtraction and 6 shifting operations are required. The fast algorithm of the inverse transform for the bases 5, 6, 4, 1 is shown in FIG. The fast algorithm of the inverse transform for the bases 4, 5, 3, 1 is shown in FIG. The amount of computation necessary to complete the inverse transformation of one integer transform in an 8x8 block is 16 times the unit computation described above.

본 발명에 따르면, 정수 변환 베이스의 성능을 위한 복합 평가 방법이 제안되며, 그리고 이러한 방법에 기초한 더 나은 성능을 갖는 몇몇 그룹의 변환 베이스가 선택되며, 2개 그룹의 변환 베이스를 위한 고속 변환 방법이 제공된다.According to the present invention, a complex evaluation method for the performance of an integer transform base is proposed, and several groups of transform bases with better performance based on this method are selected, and a fast transform method for two groups of transform bases is selected. Is provided.

Claims

In the method for selecting an integer transform matrix in video coding,

(a) Search all integer transform bases satisfying orthogonal conditions in a specific range, but the transform base for 8 × 8 transform matrix P is defined as (k1, k2, k3, k4),

The range of transform base coefficient values is k1, k2, k3∈ [1, 10], k4∈ [1, 4], and all integer orthogonal transform bases satisfying P · P ^T = Diag are obtained, and Diag is diagonal A matrix;

(b) correlation coefficient

The one-dimensional image prediction residual error vector with 8 lengths is assumed to be X _V = [x ₁ , x ₂ , ... x ₈ ], and the covariance matrix COV of the X _V elements established based on the first-order Markov model (X _V ) is

Is,

Is the correlation coefficient between adjacent X _V elements,

≤ 1;

(c) obtaining the covariance matrix COV (Y _V ) of the transform domain through the transform matrix P corresponding to the transform base,

The transformation matrix P corresponding to the transformation base (k1, k2, k3, k4) is normalized, i.e. each row of P is divided by the vector length of the row to obtain an orthogonal matrix P _u , where X _V is Y _V = P as _u X _V is converted to orthogonally, _V is the covariance matrix of Y

Phosphorus step;

(d) through the steps (b) and (c), the correlation coefficient

E and auditorium efficiency

Calculate C,

The energy concentration efficiency

E is defined as

,

The appreciation hall efficiency

C is defined as follows;

(e) predetermined correlation coefficient

Energy concentration efficiency for each conversion base at the value of

E and auditorium efficiency

Compute a standardized result of C, but with the same

For the i th conversion base

The standardized result of E is

for the i transform base

The standardized results of C are as follows;

(f) each correlation coefficient

The energy concentration efficiency for all groups of bases in

E and the appreciation hall efficiency

Corresponding weights are 1/15, 2/15, 3/15, 4/15 and 5/15, respectively, and

(g) Computing the weighted sum of Evla _C and Eval _E to obtain a composite evaluation value (Eval) for transform base performance, wherein the weights of Evla _C and Eval _E are 0.4 and 0.6, respectively.

The method according to claim 1, wherein after the composite evaluation value (Eval) of the performance of the transform base is obtained, a step for evaluating the computational complexity for the transform base (k1, k2, k3, k4) is added; First, the transform base with the higher composite evaluation value Eval is selected, and if the difference between the Eval values is less than 0.02, then a base that provides more benefit in computational complexity, i.e. fewer addition / subtraction and fewer shifting operations, Characterized in that the desired base is preferred for applications requiring better real time performance.

Integer conversion method for video coding,

In terms of encoding via intra-frame or inter-frame, the predictive residual error of the block is obtained, and the prediction and block transforms are performed so that energy can be concentrated on a small amount of coefficients, and then quantization, scanning, run length coding And through entropy coding, the image data is compressed and recorded in a coding bitstream,

In terms of decoding, the block transform coefficients of entropy coding are extracted from the bit stream, and then through inverse quantization and inverse transform, the predictive residual error of the block is reconstructed, and the prediction information is used together to reconstruct the video data. In

(a) Acquire a transform matrix P used for 8 × 8 integer transform during video coding through an integer transform matrix selection method in video coding as claimed in claim 1, wherein the transform matrix P is expressed as follows. The corresponding integer conversion base is (5, 6, 4, 1);

(b) perform an integer transform represented by Y = PXP ^T on an 8 × 8 image residual error data block, wherein the basic transform unit is an 8-point one-dimensional transform expressed as y = Px, where x = [x0, x1, x2, x3, x4, x5, x6, x7] ^T , output vector y = [y0, y1, y2, y3, y4, y5, y6, y7] ^T , and the calculation is performed as follows;

A. a = x0-x7, a1 = x1-x6, a2 = x2-x5, a3 = x3-x4, a4 = x0 + x7, a5 = x1 + x6, a6 = x2 + x5, a7 = x3 + x4;

B. b0 = a4 + a7, b1 = a5 + a6, b2 = a4-a7, b3 = a5-a6;

C. y0 = b0 + b1, y4 = b0-b1, y2 = b2 << 1 + b3, y6 = b2-b3 << 1;

Complete the calculation process expressed in the same way as

D. c0 = a0 << 2 + a0 + a3; c1 = a2-a1-a1 << 2; c2 = a1 + a2 + a2 << 2; c3 = a3 << 2 + a3-a0;

E. y1 = c0-c1 + c2; y3 = c0-c2-c3; y5 = c0 + c1 + c3; y7 = c1 + c2-c3;

(c) Perform one-dimensional inverse transformation, where x = P ^T y is defined as the base unit of coincidence transformation, y = [y0, y1, y2, y3, y4, y5, y6, y7] ^T , x = [x0 , x1, x2, x3, x4, x5, x6, x7] ^T , and the one-dimensional inverse transform is performed as follows.

A. m0 = y0 + y4; m1 = y0-y4; m2 = y2 << 1 + y6; m3 = y2-y6 << 1;

B. b0 = m0 + m2; b1 = m1 + m3; b2 = m1-m3; b3 = m0-m2;

C. Calculate a 4x4 matrix product using the following formula;

In the calculation the 4 × 4 matrix product is the same at conversion, only input and output are exchanged;

D. x0 = a0 + b0; x1 = a1 + b1; x2 = a2 + b2; x3 = a3 + b3;

x7 = -a0 + b0; x6 = -a1 + b1; x5 = -a2 + b2; x4 = -a3 + b3;

Here, the "<<" operation represents a left shifting operation and has a higher priority than the priority of the addition / subtraction operation. “a << b” includes a step indicating that a is left left shifted.

Integer conversion method for video coding,

In terms of decoding, the block transform coefficients of entropy coding are extracted from the bit stream, and then through inverse quantization and inverse transform, the prediction residual error of the block is reconstructed, and the prediction information is used together to reconstruct the video data. ,

(a) Acquire a transform matrix P used for 8 × 8 integer transform during video coding through an integer transform matrix selection method in video coding as claimed in claim 1, wherein the transform matrix P is expressed as follows. The corresponding integer conversion base is (4, 5, 3, 1);

B. b0 = a4 + a7, b1 = a5 + a6, b2 = a4-a7, b3 = a5-a6;

C. y0 = b0 + b1, y4 = b0-b1, y2 = b2 << 1 + b3, y6 = b2-b3 << 1;

Complete the calculation process expressed in the same way as

D. c0 = a0 << 2 + a3; c1 = a2-a1 << 2; c2 = a1 + a2 << 2; c3 = a3 << 2-a0;

E. y1 = c0-c1 + c2; y3 = c0-c2-c3; y5 = c0 + c1 + c3; y7 = c1 + c2-c3;

A. m0 = y0 + y4; m1 = y0-y4; m2 = y2 << 1 + y6; m3 = y2-y6 << 1;

B. b0 = m0 + m2; b1 = m1 + m3; b2 = m1-m3; b3 = m0-m2;

C. Calculate a 4x4 matrix product using the following formula;

D. x0 = a0 + b0; x1 = a1 + b1; x2 = a2 + b2; x3 = a3 + b3;

x7 = -a0 + b0; x6 = -a1 + b1; x5 = -a2 + b2; x4 = -a3 + b3;

In the method for selecting an integer transform matrix in video coding,

(a) searching for an integer transform base that satisfies an orthogonal condition in a predetermined range, wherein the transform base for the 8x8 transform matrix P is defined as (k1, k2, k3, k4);

(b) the correlation coefficient of various image residual error data

Setting);

(c) the set correlation coefficients (

Energy Concentration Efficiency for

E and auditorium efficiency

Calculating C;

(d) a predetermined correlation coefficient (

Energy Concentration Efficiency for Each Transformation Base

E and auditorium efficiency

Calculating a standardized result of C, and

(e) the correlation coefficients (

The normalized energy concentration efficiency for the conversion base

E and auditorium efficiency

Calculating a weighted sum to obtain a composite evaluation value of C (Eval _E and Evla _C ).

The method of claim 5, wherein step (c)

(c1) a predetermined correlation coefficient (

Calculating a covariance matrix COV (X _V ) of the input image residual error data for

(c2) calculating an orthogonal transform matrix P _u for the transform base, and

(c3) calculating a covariance matrix COV (Y _V ) of a transform domain through a transform matrix P corresponding to the transform base.

The method of claim 6,

The covariance matrix COV (X _V ) is

_{_{And, X V = [x 1,}} x 2, ... x 8] is assumed as, in the X _V element established on the basis of the primary Markov (Markov) model

Is the correlation coefficient between adjacent X _V elements,

≤1, and

The covariance matrix COV (Y _V ) is

Where the transform matrix P corresponding to the transform base (k1, k2, k3, k4) is normalized, i.e. each row of P is divided by the vector length of the row to obtain an orthogonal matrix P _u , where X _V is Y _V Is orthogonally transformed as = P _u X _V ,

The energy concentration efficiency

E and auditorium efficiency

C is each

,

Method as characterized in that as defined.

The method of claim 5,

(f) calculating a weighted sum of the Evla _C and Eval _E to obtain a composite estimate (Eval) of the performance of the transform base.

9. The method of claim 8, wherein the weights of Evla _C and Eval _E are 0.4 and 0.6, respectively.

The method of claim 8,

(g) estimating the computational complexity for the transform base.

The method of claim 10, wherein a transform base having a higher composite evaluation value (Eval) for the transform base performance is selected, and if the difference between the Eval values is smaller than a predetermined value, selecting a transform base having an advantageous computational complexity. How to feature.

6. The integer orthogonal transform base according to claim 5, wherein a range of the transform base coefficient values is k1, k2, k3 '[1, 10], k4' [1, 4], and satisfies P · P ^T = Diag . Is obtained, and Diag is a diagonal matrix.

The method of claim 5, wherein the correlation coefficient

The value of is set to 0.75, 0.8, 0.85, 0.9 and 0.95.

The method of claim 13, wherein the five correlation coefficients in the step (e)

Weights corresponding to 1/15, 2/15, 3/15, 4/15 and 5/15, respectively.

In the integer conversion method for video coding,

(a) obtaining a transformation matrix P used for 8 × 8 integer transformation during video coding through an integer transformation matrix selection method during video coding as claimed in claim 5;

(b) perform an integer transform represented by Y = PXP ^T on an 8 × 8 image residual error data block, wherein the basic transform unit is an 8-point one-dimensional transform expressed as y = Px, where x = [x0 , x1, x2, x3, x4, x5, x6, x7] ^T , output vector y = [y0, y1, y2, y3, y4, y5, y6, y7] ^T and

(c) Perform one-dimensional inverse transformation, where x = P ^T y is defined as the base unit of coincidence transformation, y = [y0, y1, y2, y3, y4, y5, y6, y7] ^T , x = [x0 , x1, x2, x3, x4, x5, x6, x7] ^T.