KR100744442B1

KR100744442B1 - Improved cascaded compression method and system for digital video and images

Info

Publication number: KR100744442B1
Application number: KR1020017008507A
Authority: KR
Inventors: 크리시나마차리산타나
Original assignee: 코닌클리케 필립스 일렉트로닉스 엔.브이.
Priority date: 1999-11-03
Filing date: 2000-10-13
Publication date: 2007-08-01
Also published as: JP2003513563A; CN1186940C; WO2001033860A1; CN1342369A; KR20010089765A; EP1145562A1

Abstract

캐스케이드 압축 방식을 이용하는 시스템에서 양자화 에러를 감소시키는 시스템 방법을 개시한다. 캐스케이드 압축 방식에 의해 도입되는 예상된 양자화 에러가 결정된다. 제 2 또는 더 높은 단의 테스트 양자화기들에 기초한 예상된 양자화 에러들이 비교된다. 이러한 비교에 기초하여, 제 2 또는 더 높은 단의 압축을 위한 양자화기가 선택됨으로써 캐스케이드 압축 방식에 의해 야기되는 양자화 에러를 감소 및/또는 최소화한다.A system method for reducing quantization error in a system using a cascade compression scheme is disclosed. The expected quantization error introduced by the cascade compression scheme is determined. The expected quantization errors based on the second or higher stage test quantizers are compared. Based on this comparison, a quantizer for the second or higher stage compression is selected to reduce and / or minimize the quantization error caused by the cascade compression scheme.

디지털 비디오, 캐스케이드 압축 방법, 양자화기, 양자화 에러, 확률 분포 함수Digital Video, Cascade Compression Method, Quantizer, Quantization Error, Probability Distribution Function

Description

Improved cascaded compression method and system for digital video and images

본 발명은 일반적으로 비디오/이미지 압축 분야에 관한 것으로, 특히, 본 발명은 디지털 비디오 및 이미지들의 캐스케이드 압축(cascaded compression)에서 도입되는 양자화 에러를 감소시키는 시스템들 및 방법들에 관한 것이다.FIELD OF THE INVENTION The present invention relates generally to the field of video / image compression, and in particular, the present invention relates to systems and methods for reducing quantization errors introduced in cascaded compression of digital video and images.

비디오폰, 디지털 텔레비전, 원격지간 회의(teleconferencing), 및 정보 하이웨이는 최근의 디지털 시대의 단지 소수의 요소들일 뿐이다. 디지털 이미지들 및 비디오 처리의 발전들은 디지털 시대로의 진행에 도움을 주었다. 특히, 디지털 이미지 압축 방법들은 이러한 발전에 중요한 역할을 하였다. 이미지 압축은 디지털 이미지를 나타내는데 필요한 데이터량을 감소시킨다. 예를 들면, 컬러, 그레이 스케일(grey scale), 또는 2진 이미지들이 압축된 다음 압축 해제되어 원래의 이미지를 정확하게 생성할 수 있다.Video phones, digital television, teleconferencing, and information highways are just a few of the elements of the modern digital age. Advances in digital images and video processing have helped advance the digital age. In particular, digital image compression methods have played an important role in this development. Image compression reduces the amount of data needed to represent a digital image. For example, color, gray scale, or binary images can be compressed and then decompressed to produce the original image correctly.

일반적으로 데이터를 저장 또는 전송하기 전에 압축이 수행된다. 이것은 방대한 양의 정보가 경제적인 방식으로 저장되거나 및/또는 신속하게 전송될 수 있게 한다. 명백한 바와 같이, 이미지 압축은 일반적으로 압축 및 압축 해제를 포함하는 양방향 프로세스이다. 이들 프로세스들은 대칭적이지 않을 수 있다. 즉 한 프로세스에 소요되는 시간 및/또는 계산 파워가, 사용되는 압축 알고리즘의 주어진 다른 타입에 따라 다를 수 있다.In general, compression is performed before storing or transmitting data. This allows a vast amount of information to be stored and / or sent quickly in an economical manner. As is apparent, image compression is generally a bidirectional process that includes compression and decompression. These processes may not be symmetrical. That is, the time and / or computational power spent on one process may vary depending upon the other type of compression algorithm used.

일반적으로 2 가지 유형의 이미지 압축으로서 손실 압축 및 무손실 압축이 있다. 손실 압축에서, 압축 해제된 이미지는 원 이미지와 유사하나 정확하게 동일하지는 않다. 이것은 원 데이터의 적어도 일부가 변경되거나 해체되기 때문이다. 손실 압축 기술들은 샘플 서브샘플링(sample subsampling), 차분 펄스 부호화 변조(DPCM:differential pulse coding modulation), 및 이산 코사인 변환(DCT) 계수들의 양자화를 포함한다. 반면, 무손실 압축은 원래 이미지의 모든 데이터를 보유하는 것으로, 근본적으로 완전 가역 부호화 프로세스이다. 무손실 압축기술들은 가변길이 부호화(VLC: Variable-Length Coding) 및 런-길이 부호화(RLC: Run-Length Coding)를 포함한다.There are generally two types of image compression, lossy compression and lossless compression. In lossy compression, the decompressed image is similar to the original image but not exactly the same. This is because at least part of the original data is changed or disassembled. Lossy compression techniques include sample subsampling, differential pulse coding modulation (DPCM), and quantization of discrete cosine transform (DCT) coefficients. Lossless compression, on the other hand, holds all the data of the original image, which is essentially a fully reversible encoding process. Lossless compression techniques include Variable-Length Coding (VLC) and Run-Length Coding (RLC).

압축비는 통상적으로, 압축될 데이터 콘텐트와 압축 후 결과적인 데이터 사이의 비로 규정된다. 손실 압축 방법들은 100:1 이상의 압축비들을 제공할 수 있다. 무손실 압축 방법들은 일반적으로 대략 3:1의 비를 달성한다. 일반적으로 손실 압축비가 증가함에 따라 이미지 열화가 증가하므로 절충한다.The compression ratio is typically defined as the ratio between the data content to be compressed and the resulting data after compression. Lossy compression methods can provide compression ratios of 100: 1 or more. Lossless compression methods generally achieve a ratio of approximately 3: 1. In general, as image loss increases as the lossy compression ratio increases, this is a compromise.

압축비는 1 단(stage) 압축 또는 캐스케이드된(cascaded) 복수 단들의 압축에 의해 달성될 수 있다. 이미지 또는 비디오 소스는 입력 소스 신호가 직렬 방식으로 다단 압축될 때, 캐스케이드 압축된다. 예를 들면, 캐스케이드 압축에서, 소스 신호(이미지 또는 비디오)가 먼저 특정의 압축비로 압축되고, 이어서 제 2 또는 그 이상의 레벨(들)로 압축하여 보다 높은 압축비를 달성한다. 실제로, 보다 높은 압축이 효율적인 저장 또는 제한된 대역폭 전송에 필요로 될 수 있다.The compression ratio may be achieved by stage compression or cascaded compression of multiple stages. The image or video source is cascaded when the input source signal is multistage compressed in a serial manner. For example, in cascade compression, the source signal (image or video) is first compressed to a specific compression ratio and then to a second or higher level (s) to achieve a higher compression ratio. Indeed, higher compression may be required for efficient storage or limited bandwidth transmission.

JPEG(Joint Photographic Experts Group) 및 MPEG(Motion Picture Experts Group) 표준은 이미지들 및 비디오들 각각에 대해 가장 일반적으로 이용되는 압축 방식이다. JPEG 표준은 자연적인 실세계 장면들의 컬러 또는 그레이 스케일 이미지들의 압축을 위한 것이다. JPEG 표준이 무손실 및 손실 모드를 모두 포함하고 있지만, 보다 큰 압축비를 달성하기 위해서 보통 손실 모드에서 이용된다. 통상, 이미지는 이산 코사인 변환(DCT)을 이용하여 주파수 영역으로 변환된다. 그 결과 나타난 작은 값의 주파수 성분들이 제거되고, 큰 값의 성분들만 남겨둔다. 이들 큰 값의 성분들은 차분 펄스 부호 변조(DPCM)로 부호화되고 허프만(Huffman) 부호화된다. JPEG 압축은 조정 가능 특징이 있기 때문에 가변 압축비들이 가능하고 또한 특정 응용 조건들을 위한 알고리즘을 미세하게 조정할 수 있다.The Joint Photographic Experts Group (JPEG) and Motion Picture Experts Group (MPEG) standards are the most commonly used compression schemes for each of images and videos. The JPEG standard is for the compression of color or gray scale images of natural real-world scenes. Although the JPEG standard includes both lossless and lossy modes, it is usually used in lossy mode to achieve greater compression ratios. Typically, images are transformed into the frequency domain using discrete cosine transform (DCT). The resulting small frequency components are removed, leaving only the larger components. These large value components are coded with differential pulse code modulation (DPCM) and Huffman coded. Since JPEG compression is an adjustable feature, variable compression ratios are possible and can also fine tune the algorithm for specific application conditions.

MPEG 표준은, 보다 나은 압축비들을 제공하는데 이용되는 프레임간 부호화 기술들과 더불어 DCT 및 허프만 부호화 방법들을 이용한다. MPEG-1 및 MPEG-2는 각각 저해상도 이미지 시퀀스들과 고해상도 시퀀스들에 이용된다. MPEG-4는 프레임들보다는 통합된 시청각 대상물들(audio-visual objects) 및 장면들(scenes)에 초점을 맞추고 있다. MPEG-7은 시청각 콘텐트(audio-visual content)를 찾는 데 도움을 준다.The MPEG standard uses DCT and Huffman coding methods along with interframe coding techniques used to provide better compression ratios. MPEG-1 and MPEG-2 are used for low resolution image sequences and high resolution sequences, respectively. MPEG-4 focuses on integrated audio-visual objects and scenes rather than frames. MPEG-7 helps to find audio-visual content.

상기 압축 기술들은 DCT 변환 및 이에 이어 DCT 계수들의 양자화 및 가변길이 부호화를 이용하여 데이터 압축을 달성한다. DCT 계수들을 양자화하므로 이들 압축 기술들은 손실 압축이 된다. 전술한 바와 같이, 손실 압축 방식은 압축 해제된 데이터가 원 데이터의 정확한 복제가 아니라는 것이다. 예를 들면, JPEG 또는 MPEG 압축 방식에서, 캐스케이드 압축에서의 각 압축 단은 사실상 손실 압축이다. 또한, 복수의 캐스케이드 압축을 수행하는 것은 추가 손실을 도입한다.The compression techniques achieve data compression using DCT transform followed by quantization of the DCT coefficients and variable length coding. These compression techniques are lossy compression because they quantize the DCT coefficients. As mentioned above, the lossy compression scheme is that the decompressed data is not an exact copy of the original data. For example, in the JPEG or MPEG compression scheme, each compression stage in cascade compression is in fact lossy compression. In addition, performing multiple cascade compressions introduces additional losses.

이 추가 손실을 설명하기 위해서, 도 1을 참조한다. 도 1은 2 개의 손실 압축 시나리오(a), (b)를 도시한 것이다. 시나리오(a)에서, 소스 데이터(10)는 압축 시스템(11)에 의해 20:1의 비로 압축된다. 시나리오(b)에서, 소스 데이터(10)는 먼저 10:1로 제 1 단 압축되고 이어서 캐스케이드 압축 시스템(12)에 의해 2:1로 제 2 단 압축된다. 시나리오(b)에서, 제 2 압축 단은 원 소스 데이터(10)에 액세스하는 것이 아니라, 제 1 압축 단의 출력으로서 얻어진 압축된 신호에만 액세스한다. 그럼에도 불구하고, 이들 시나리오(a), (b)는 동일한 20:1의 압축을 달성한다. 그러나, 시나리오(b)에서 도입된 평균 제곱 에러(Mean Square Error; MSE)는 캐스케이드 압축으로 인해 항상 시나리오(a)보다 크거나 같을 것이다. 이러한 추가 에러는 부분적으로는 제 2 또는 더 높은 레벨의 압축 단들에서 양자화기(quqntizer) 값들의 선택에 기인한다.To illustrate this additional loss, reference is made to FIG. 1. Figure 1 shows two lossy compression scenarios (a) and (b). In scenario (a), source data 10 is compressed by compression system 11 in a ratio of 20: 1. In scenario (b), source data 10 is first compressed first to 10: 1 and then secondly compressed to 2: 1 by cascade compression system 12. In scenario (b), the second compression stage does not access the original source data 10 but only the compressed signal obtained as the output of the first compression stage. Nevertheless, these scenarios (a) and (b) achieve the same 20: 1 compression. However, the mean square error (MSE) introduced in scenario (b) will always be greater than or equal to scenario (a) due to cascade compression. This additional error is due in part to the selection of quantizer values at the second or higher level compression stages.

일부 종래의 캐스케이드 압축 시스템들은 연속적인 압축 생성(generation)들을 위해 비축퇴(non-degenerative) 세트의 양자화 팩터들을 이용한다. 이 세트는 이를테면 q 값 = K*(3**n)과 같은 수치적인 관계들을 이용하여 선택된다. 양자화기들 중 하나가 제 1 생성에서 이용된다면, 다른 양자화기들 중 어느 하나는 후속하는 생성에서 이용될 수 있다. 그러나, 이들 시스템들은 캐스케이드 양자화 에러(cascading quantization error)를 최소화할 후속되는 생성에서의 양자화기들 선택방법에 어떠한 통찰력도 제공하지 않는다. 따라서, 캐스케이드 압축에서 도입되는 MSE를 최소화하는, 특히 제 2 또는 더 높은 단들의 종속 접속 압축 방식들에서(the second or higher order stages of the cascaded conpression schemes) 양자화기들의 적합한 선택에 의해 손실을 최소화하는 향상된 시스템들, 방법들 및 기술들의 필요성이 존재한다.
WO 98/38800은 MPEG-2에 대한 캐스케이드 코딩 시스템을 개시한다. 제 1 단 양자화기의 양자화 레벨은, 평균 제곱 에러가 감소되도록 제 2 단 양자화기의 2 개의 가능한 표현 레벨들 중 하나로 매핑된다. Some conventional cascade compression systems use a non-degenerative set of quantization factors for successive compression generations. This set is selected using numerical relationships such as q value = K * (3 ** n). If one of the quantizers is used in the first generation, then one of the other quantizers can be used in the subsequent generation. However, these systems do not provide any insight into how to select quantizers in subsequent generations that will minimize cascading quantization error. Thus, minimizing the MSE introduced in cascade compression, in particular the second or higher order stages of the cascaded conpression schemes, minimizes losses by proper selection of quantizers. There is a need for improved systems, methods and techniques.
WO 98/38800 discloses a cascade coding system for MPEG-2. The quantization level of the first stage quantizer is mapped to one of two possible representation levels of the second stage quantizer such that the mean squared error is reduced.

<발명의 요약>Summary of the Invention

본 발명의 목적은 전술한 종래의 캐스케이드 압축 시스템의 한계들을 해결하는 것이다.It is an object of the present invention to solve the above limitations of the conventional cascade compression system.

본 발명의 또 다른 목적은, 주어진 한 쌍의 양자화기들에 대해 캐스케이드 압축에 의해 도입되는 에러를 계산하는 방법을 제공하는 것이다.It is yet another object of the present invention to provide a method for calculating the error introduced by cascade compression for a given pair of quantizers.

본 발명의 또 다른 목적은, 적합한 양자화기 선택에 의해 캐스케이드 압축에 의해 도입된 손실을 최소화하는 방법을 제공하는 것이다.It is another object of the present invention to provide a method of minimizing the losses introduced by cascade compression by selecting the appropriate quantizer.

바람직한 일 실시예는 JPEG 및 MPEG 압축 방식들에서 도입되는 양자화 에러를 감소시키는 것에 관한 것이다.
본 발명에 따라, 청구항 제 1 항에 따른 방법, 제 9 항에 따른 메모리 매체, 제 13 항에 따른 장치 및 제 17 항에 따른 시스템이 제공된다. 바람직한 실시예들은 종속항들에서 규정된다. One preferred embodiment relates to reducing the quantization error introduced in JPEG and MPEG compression schemes.
According to the invention there is provided a method according to claim 1, a memory medium according to claim 9, an apparatus according to claim 13 and a system according to claim 17. Preferred embodiments are defined in the dependent claims.

본 발명의 한 특징은, 상기 캐스케이드 압축 시스템의 제 2 또는 더 높은 단에 대한 적어도 2 개의 다른 양자화기들에 대해 상기 제 2 또는 더 높은 단에 의해 도입되는 예상된 양자화 에러(quantization error)를 결정하는 단계 및 상기 적어도 2 개의 양자화기들의 상기 예상된 양자화 에러를 비교하는 단계를 포함하는 캐스케이드 압축 시스템용 방법에 관한 것이다. 이 방법은 또한 비교 결과에 따라 양자화기들 중 하나를 선택하여 캐스케이드 압축 시스템에 대해 예상된 양자화 에러를 최소화하는 단계를 포함한다.One feature of the present invention is to determine an expected quantization error introduced by the second or higher stage for at least two other quantizers for the second or higher stage of the cascade compression system. And comparing the expected quantization error of the at least two quantizers. The method also includes selecting one of the quantizers according to the comparison result to minimize the expected quantization error for the cascade compression system.

본 발명의 바람직한 일 실시예에서, 예상된 양자화 에러를 결정하기 위해 확률 분포 함수가 이용된다.In one preferred embodiment of the present invention, a probability distribution function is used to determine the expected quantization error.

본 발명의 또 다른 특징은 상기 방법을 실행하는 메모리 매체, 장치 및 시스템에 관한 것이다.Another aspect of the invention relates to a memory medium, an apparatus and a system for performing the method.

이들 및 다른 실시예와 본 발명의 특징들은 다음 상세한 설명에서 예시된다.
본 발명의 특성들 및 장점들이 이하의 도면들을 취하여 설명되는 바람직한 실시예들의 상세한 설명을 참조하여 이해될 수 있다. These and other embodiments and features of the invention are illustrated in the following detailed description.
The features and advantages of the present invention can be understood with reference to the detailed description of the preferred embodiments described by taking the following figures.

도 1은 비캐스케이드 압축 시스템 및 캐스케이드 압축 시스템의 개략도.1 is a schematic diagram of a non-cascade compression system and a cascade compression system.

도 2는 캐스케이드 압축 시스템 내 양자화기들을 도시한 도면.2 shows quantizers in a cascade compression system.

도 3은 캐스케이드 압축 시스템에서 양자화 에러를 도시한 도면.3 illustrates quantization error in a cascade compression system.

도 4는 본 발명의 한 특징에 따른 예시적인 컴퓨터 시스템의 블록도.4 is a block diagram of an exemplary computer system in accordance with an aspect of the present invention.

도 2에 도시한 도면을 참조하여 캐스케이드 압축에 의해 도입되는 추가 MSE를 설명한다. 도 2(a)는 스텝 사이즈 Q₁의 균일한 양자화기에 대한 재구성 포인트들 및 결정 경계들을 도시한다. 재구성 포인트들은 흑점으로 표시하였고 결정 경계들은 짧은 수직선으로 표시하였다. 제 n 재구성 포인트는 Qⁿ ₁(도시없음)에 위치하여 있고 Qⁿ ₁의 어느 일측의 결정 경계들은 Dⁿ ₁ 및 D^n-1 ₁(도시없음)에 위치하여 있다. 결정 경계들은 두 개의 연속한 재구성 포인트들 사이의 대략 중간 지점에 놓여 있다. 스텝 사이즈Q₁의 균일한 양자화기에 있어서, 재구성 포인트들은 Q₁의 배수가 되는 지점에 놓여있다. 두 개의 결정 경계들 사이에 드는 입력 신호 소스의 어떠한 값이든 두 개의 결정 경계들 사이에 놓인 재구성 포인트로 양자화된다.The additional MSE introduced by cascade compression is described with reference to the figure shown in FIG. 2 (a) shows the reconstruction points and the crystal boundaries for a uniform quantizer of step size Q ₁ . Reconstruction points are indicated by sunspots and crystal boundaries are indicated by short vertical lines. The nth reconstruction point is located at Q ⁿ ₁ (not shown) and the crystal boundaries on either side of Q ⁿ ₁ are located at D ⁿ ₁ and D ^n-1 ₁ (not shown). The crystal boundaries lie at approximately midpoint between two consecutive reconstruction points. For a uniform quantizer of step size Q ₁ , the reconstruction points lie at points that are multiples of Q ₁ . Any value of the input signal source that falls between the two decision boundaries is quantized with a reconstruction point that lies between the two decision boundaries.

양자화기(Q₁)는 캐스케이드 압축의 제 1 단(a)에서 이용되고, 양자화기(Q₂)는 압축의 제 2 단(b)에서 이용된다. 단일 압축 단에 있어서는, 예를 들면, 양자화기(Q₂)가 이용된다. 이 실시예에서, 양자화기(Q₂)의 스텝 사이즈는 양자화기(Q₁)보다 큰 것으로 도시되었다. 양자화기의 스텝 사이즈가 클수록, 더 많은 손실이 도입되지만 더 높은 압축비가 달성될 수 있음에 유의한다. 바람직하게, MPEG I-프레임들 및 JPEG 압축 방식들에서 이용되는 것과 같은 균일한 양자화기들(uniform quantizers)이 이용된다. 그러나, 다른 스텝 사이즈들 및 비균일 양자화기들도 이용될 수 있다.Quantizer Q ₁ is used in the first stage (a) of cascade compression, and quantizer Q ₂ is used in the second stage (b) of compression. In a single compression stage, for example, a quantizer Q ₂ is used. In this embodiment, the step size of quantizer Q ₂ is shown to be larger than quantizer Q ₁ . Note that the larger the step size of the quantizer, the more loss is introduced but a higher compression ratio can be achieved. Preferably uniform quantizers such as those used in MPEG I-frames and JPEG compression schemes are used. However, other step sizes and nonuniform quantizers may also be used.

도 2에서, x는 양자화될 입력 소스 신호의 값을 나타낸다. x가 [Q⁰ ₂, D⁰ ₂)의 범위 내에 있으면, 단일 단의 양자화기(이 경우 단(b)만을 이용함)에 의해서, 이 값은 Q⁰ ₂로 양자화될 것이다. "[" 기호는 값이 범위 내에 포함됨을 나타내며 ")" 기호는 값이 범위에 포함되지 않음을 나타낸다. 2 단 양자화기((a) 및 (b))의 경우에, 제 1 단 양자화기의 출력은 x가 [0, D⁰ ₁)의 범위 내에 있으면 Q⁰ ₁이 되고, [D⁰ ₁, D⁰ ₂)의 범위 내에 있으면 출력은 Q¹ ₁이 된다. 이어서 단(a)으로부터의 출력은 Q⁰ ₁에서 Q⁰ ₂, 및 Q¹ ₁에서 Q¹ ₂로 양자화하는 제 2 양자화기 단(b)을 통과하게 된다. 그러므로, 캐스케이드 압축에 의해, [0, D⁰ ₁) 범위 내 x 값들은 Q⁰ ₂로 양자화되고, [D⁰ ₁, D⁰ ₂) 범위 내 x 값은 Q¹ ₂로 양자화된다. 따라서 [D⁰ ₁, D⁰ ₂) 범위 내 x 값들은 단일 단의 양자화기에 비해 캐스케이드 압축의 이용으로 보다 큰 평균 제곱 에러를 갖고 (부정확하게) 양자화된다. 마찬가지로, [D¹ ₁, D¹ ₂) 범위 내 임의의 x 값은, 단일 단의 양자화기이면 이들 값들을 Q¹ ₂로 양자화할 것이지만, 2 단 양자화기들에 의해 Q² ₂의 값으로 부정확하게 양자화될 것임을 볼 수 있다. 이러한 추가 에러는 캐스케이드 압축의 직접적인 결과이다.In Figure 2, x represents the value of the input source signal to be quantized. If x is in the range of [Q ⁰ ₂ , D ⁰ ₂ ), this value will be quantized to Q ⁰ ₂ by a single stage quantizer (in this case using only stage b). The "[" symbol indicates that the value is within the range and the ")" symbol indicates that the value is not within the range. In the case of the two-stage quantizer ((a) and (b)), the output of the first-stage quantizer is Q ⁰ ₁ if x is within the range of [0, D ⁰ ₁ ), and [D ⁰ ₁ , D If it is in the range of ⁰ ₂ ), the output becomes Q ¹ ₁ . The output from stage a is then passed through a second quantizer stage b that quantizes from Q ⁰ ₁ to Q ⁰ ₂ and Q ¹ ₁ to Q ¹ ₂ . Therefore, by cascade compression, x values in the range [0, D ⁰ ₁ ) are quantized to Q ⁰ ₂ and x values in the range [D ⁰ ₁ , D ⁰ ₂ ) to Q ¹ ₂ . Thus, x values in the range [D ⁰ ₁ , D ⁰ ₂ ) are quantized (incorrectly) with a larger mean squared error with the use of cascade compression compared to a single stage quantizer. Likewise, any x value in the range [D ¹ ₁ , D ¹ ₂ ) will quantize these values to Q ¹ ₂ if it is a single stage quantizer, but incorrectly to the value of Q ² ₂ by the two stage quantizers. It can be seen that it will be quantized. This additional error is a direct result of cascade compression.

이러한 추가 에러를 감소 및/또는 제거하기 위해서, 양자화기들 Q₁ 및 Q₂이 주어졌을 때, 부정확하게 양자화되는 x의 값의 범위들이 먼저 결정되어야 한다. 이에 대해서, Dⁿ ₂에서 Q₂ 양자화기에 대한 특정의 결정경계를 고려한다. 이러한 결정 경계에 있어서, 양자화기(Q₁)의 2 개의 가장 가까운 결정 경계들이 도 3a 및 도 3b에 도시한 바와 같이 위치하여 있으며, 이들 중 하나는 Dⁿ ₂보다 큰 D^m ₁이고 다른 하나는 Dⁿ ₂보다 작은 D^m-1 ₁이다. In order to reduce and / or eliminate this additional error, given quantizers Q ₁ and Q ₂ , the ranges of values of x that are incorrectly quantized must first be determined. In this regard, consider a particular crystal boundary for D ⁿ ₂ to Q ₂ quantizers. In this crystal boundary, the two nearest crystal boundaries of the quantizer Q ₁ are located as shown in FIGS. 3A and 3B, one of which is D ^m ₁ greater than D ⁿ ₂ and the other D ⁿ is smaller than ₁ D ^m-1 _2.

실례로, 다음의 경우를 고려한다.As an example, consider the following case.

경우 1(도 3a):

Case 1 (FIG. 3A):

단일 단 Q₂ 양자화기는 x를 Qⁿ⁺¹ ₂로 양자화할 것이다. 두 개의 단의 양자화기(Q₁, Q₂)에서, Q₁ 양자화기는 x를 Q^m ₁으로 양자화할 것이며 다음 Q₂ 양자화기는 x를 Qⁿ ₂로 양자화할 것이므로, 이에 의해 추가 양자화 에러가 도입된다.A single stage Q ₂ quantizer will quantize x to Q ^{n + 1} ₂ . In the two stages of quantizers Q ₁ , Q ₂ , the Q ₁ quantizer will quantize x to Q ^m ₁ and the next Q ₂ quantizer will quantize x to Q ⁿ ₂ , thereby introducing additional quantization errors. do.

경우 2(도 3b):

Case 2 (Figure 3b):

단일 단 Q₂ 양자화기는 x를 Qⁿ ₂로 양자화할 것이다. 두 개의 단의 양자화기(Q₁, Q₂)에서, Q₁ 양자화기는 x를 Q^m ₁으로 양자화할 것이며, 다음 Q₂ 양자화기는 x를 Qⁿ⁺¹ ₂로 양자화할 것이므로, 이에 의해 추가 양자화 에러가 도입된다.A single stage Q ₂ quantizer will quantize x to Q ⁿ ₂ . In the two stage quantizers Q ₁ , Q ₂ , the Q ₁ quantizer will quantize x to Q ^m ₁ , and the next Q ₂ quantizer will quantize x to Q ^{n + 1} ₂ , thereby further quantization. An error is introduced.

부정확하게 양자화될 x 값들의 모든 범위들을 확인하기 위해서, Q₂의 모든 결정 경계들에 대해 상기 계산이 반복되어야 한다. 그러나, 예를 들면 JPEG 압축에서 이용되는 균일 양자화기들의 경우, 상기 계산은 0 내지 LCM(Q₁, Q₂)의 범위에서만 Q₂의 모든 결정 경계들에 대해 수행될 필요가 있을 뿐이다. LCM(Q₁, Q₂)은 두 개의 수 Q₁ 및 Q₂의 최소 공배수이다. 이 계산으로부터 얻어진 결과는 범위 [0, LCM(Q₁, Q₂)] 밖에 놓인 x값의 범위들을 얻는데 이용되고 부정확하게 양자화된다. 알 수 있는 바와 같이, 계산은 균일한 양자화기의 경우에 훨씬 간단하다.In order to identify all ranges of x values to be incorrectly quantized, the calculation must be repeated for all decision boundaries of Q ₂ . However, for uniform quantizers used in JPEG compression, for example, the calculation only needs to be performed for all crystal boundaries of Q ₂ in the range of 0 to LCM (Q ₁ , Q ₂ ). LCM (Q ₁ , Q ₂ ) is the least common multiple of two numbers Q ₁ and Q ₂ . The result obtained from this calculation is used to obtain ranges of x values that lie outside the range [0, LCM (Q ₁ , Q ₂ )] and are quantized incorrectly. As can be seen, the calculation is much simpler in the case of a uniform quantizer.

배경 내 화소값들의 각각의 조합에 있어서, 흑백 화소들의 확률 분포는 서로 다를 수 있다. 예를 들면, 모두 백색의 배경에서, 백 화소가 부호화될 확률은 흑 화소를 부호화할 경우보다 훨씬 클 것이다. 입력 소스 신호(x)에 대한 확률 분포 f(x)가 주어졌을 때, 양자화기들(Q₁ 및 Q₂)을 이용한 캐스케이드 양자화에 의해 도입되는 예상된 양자화 에러는 다음과 같이 계산될 수 있다.For each combination of pixel values in the background, the probability distribution of the black and white pixels may be different. For example, on an all white background, the probability that white pixels are encoded will be much greater than when black pixels are encoded. Given the probability distribution f (x) for the input source signal x, the expected quantization error introduced by cascade quantization using quantizers Q ₁ and Q ₂ can be calculated as follows.

기호 ξ는 상기 결정된 바와 같이 부정확하게 양자화되는 x값들의 모든 범위들을 포함하는 집합을 나타낸다.The symbol ξ represents a set containing all ranges of x values that are incorrectly quantized as determined above.

상기 식에서 양자화 에러의 계산은 제 2 단의 연속 계산에서 적합한 양자화기들을 선택하는 데 이용된다. 예를 들면, 양자화기(Q₁)가 제 1 단에서 이용되고 제 2 단에 대해 두 개의 가능한 양자화기들(Q₂, Q'₂)이 있다고 가정할 때, 이들 양자화기들에 대해 양자화 에러들이 계산될 수 있다. 이들 두 개의 가능한 양자화기들은 테스트 양자화기들이다. 최소 양자화 에러값은 양자화기의 가장 적합한 선택, 즉 Q₂또는 Q'₂을 결정하는데 이용된다.The calculation of the quantization error in the above equation is used to select suitable quantizers in the continuous calculation of the second stage. For example, assuming that quantizer Q ₁ is used in the first stage and there are two possible quantizers Q ₂ , Q ' ₂ for the second stage, quantization errors for these quantizers Can be calculated. These two possible quantizers are test quantizers. The minimum quantization error value is used to determine the most suitable choice of quantizer, namely Q ₂ or Q ' ₂ .

양자화기(Q₂)가 양자화 에러 E(Q₂)+E(Q₁, Q₂)를 갖는 비트 레이트 r(레이트가 클수록, 압축이 낮아진다)를 제공하고, 양자화기(Q'₂)가 양자화 에러 E(Q'₂)+E(Q₁, Q'₂)를 갖는 레이트 r'를 제공할 것으로 예상되면, 양자화 에러에 대한 레이트의 비는 양자화기의 선택에서 척도로서 이용될 수 있다. 여기서 E(Q₂) 및 E(Q'₂)는 양자화기에 의해 본연히 발생되는 양자화 에러이고 캐스케이드 양자화에 의해 야기되는 추가 에러와는 관계없다.A quantizer (Q _2), the quantization error _{E (Q 2) + E (} Q 1, Q 2) provides a bit rate r (The higher the rate, the lower the compression) having a, and a quantizer (Q _'2) are quantized If expected to provide a rate r 'with an error E (Q' ₂ ) + E (Q ₁ , Q ' ₂ ), the ratio of rate to quantization error can be used as a measure in the selection of the quantizer. Where E (Q ₂ ) and E (Q ' ₂ ) are the quantization errors inherently generated by the quantizer and are independent of the additional errors caused by cascade quantization.

물론, 전술한 Q₂ 또는 Q'₂ 이외의 양자화기들이 시작 점으로서 이용될 수도 있다. 양자화기들의 선택은 부분적으로는 원하는 전체 압축 비에 기초한다. 또한 초기 양자화기들을 선택함에 있어 일부 시행 착오가 이용될 수도 있다. 몇 개의 양자화기들에 대한 양자화 에러가 전술한 바와 같이 계산될 수 있고, 따라서 가장 적합한 양자화기가 선택된다.Of course, quantizers other than Q ₂ or Q ′ ₂ described above may be used as the starting point. The choice of quantizers is based in part on the overall desired compression ratio. Also, some trial and error may be used in selecting early quantizers. The quantization error for several quantizers can be calculated as described above, so the most suitable quantizer is selected.

전술한 바와 같이, 본 발명의 일 실시예는 JPEG 및 MPEG 압축 방식들을 이용하는 응용들에 관한 것이다. 이들 압축 방식들은 입력 소스 데이터를, DCT 변환되어 64개의 DCT 계수들로 되는 8 x 8 크기의 연속한 블록들로 공간적으로 분할한다. 이에 이어 DCT 계수들이 양자화된다. DC 계수는 차분 부호화된다. 63개의 나머지 AC 계수들은 제로 계수들의 런-길이를 명시한 후 다음 비-제로(non-zero) 계수값을 부호화함으로써 부호화된다.As mentioned above, one embodiment of the present invention relates to applications using JPEG and MPEG compression schemes. These compression schemes spatially partition the input source data into successive blocks of size 8 × 8 that are DCT transformed into 64 DCT coefficients. Following this, the DCT coefficients are quantized. DC coefficients are differentially coded. The 63 remaining AC coefficients are encoded by specifying the run-length of the zero coefficients and then encoding the next non-zero coefficient value.

JPEG의 경우, 양자화 테이블의 엔트리들은 서로 상이한 DCT 계수들에 대해 이용되는 양자화기를 결정한다. 서로 상이한 대역들(예를 들면, 휘도 및 크로미넌스)에 대해 서로 상이한 양자화 테이블들을 이용할 수 있지만, 단일 대역에 대해서는 양자화 테이블들은 고정되어 있다. 본 발명의 양자화 선택 방법을 이미 JPEG 압축된 데이터를 재압축하는 데 적용하기 위해서, 각각의 DCT 계수에 대한 확률 분포 f(x)를 알아야 한다. 실험적으로 AC DCT 계수들의 분포는 라플라시안 분포를 따르는 것으로 알려졌다. 라플라시안 분포에 연관된 파라미터는 상이한 DCT 계수들에 대해 서로 다름에 유의한다. 이 파라미터는 추정될 수 있거나, 이를테면 레일리(Rayliegh) 또는 가우시안과 같은 다른 분포가, 사용가능한 압축 데이터로부터 얻어질 수 있다.In the case of JPEG, the entries in the quantization table determine the quantizer used for the different DCT coefficients. Different quantization tables can be used for different bands (eg, luminance and chrominance), but for a single band the quantization tables are fixed. In order to apply the quantization selection method of the present invention to recompressing already compressed JPEG data, the probability distribution f (x) for each DCT coefficient must be known. Experimentally it is known that the distribution of AC DCT coefficients follows the Laplacian distribution. Note that the parameters associated with the Laplacian distribution are different for different DCT coefficients. This parameter can be estimated, or other distributions, such as Rayliegh or Gaussian, can be obtained from the available compressed data.

라플라시안 분포는 DCT 부호화된 이미지 및 MPEG 에러 항들의 휘도 및 크로미넌스 채널들에 모두 이용될 수 있음을 알아야 한다. MPEG 압축 방식은 화상 정보만이 아니라 에러항들을 부호화하기 위해 DCT를 이용한다. 에러항들은 MPEG 움직임 보상 알고리즘으로부터 얻어진다. 에러항은 시퀀스 내 또다른 화상 상의 블록에서 이미지 블록을 감하고 그 차에 DCT를 적용함으로써 얻어진다. 이에 따라 화상은 이미지들 내에 소수의 변화들만이 있다면 보다 적은 비트들을 이용하여 부화화될 수 있다.It should be noted that the Laplacian distribution can be used for both luminance and chrominance channels of the DCT coded image and MPEG error terms. The MPEG compression scheme uses DCT to encode error terms as well as picture information. Error terms are obtained from the MPEG motion compensation algorithm. The error term is obtained by subtracting an image block from a block on another picture in the sequence and applying a DCT to the difference. Thus the picture can be hatched using fewer bits if there are only a few changes in the images.

또한, MPEG 경우, 바람직한 실시예는 MPEG 포맷에서 I-(인트라 부호화된)프레임들에 중점을 둔다. I-프레임들은 다른 화상들을 참조함이 없이 인트라블록들만으로 구성된다. 이들 프레임들은 시퀀스 내 랜덤 액세스 포인트들로서 작용할 수 있다. P-(예측 부호화) 및 B-(양방향 보간)프레임과 같은 다른 MPEG 프레임 유형들도 또한 이용될 수 있다.In addition, in the MPEG case, the preferred embodiment focuses on I- (intra-encoded) frames in the MPEG format. I-frames are composed of only intrablocks without referring to other pictures. These frames can act as random access points in a sequence. Other MPEG frame types may also be used, such as P- (predictive encoding) and B- (bidirectional interpolation) frames.

예를 들면, MPEG 비디오에서 각각의 (I) 프레임에 대해, 양자화기 값은 양자화기_스케일 및 양자화 테이블에 의해 결정된다. 서로 상이한 양자화 테이블들이 크로미넌스 및 휘도에 이용될 수 있다. 양자화 테이블들은 각 프레임에 대해 고정되어 있으나, 양자화기_스케일은 각각의 매크로블록마다 변경될 수 있다. 일 실시예에서, 전술한 양자화기 선택 방법들을 이용해서 각각의 MPEG 프레임 매크로블록에 대해 양자화기_스케일을 선택한다. 그러나, 양자화기_스케일은 각각의 DCT 계수마다 변경될 수 없음에 유의한다. 이 값은 전체 매크로블록에 대해 고정되어 있다. 사람의 시각 체계는 저주파 계수들의 양자화 에러들에 대해 더욱 민감하기 때문에, 선택된 저주파 계수들의 평균 양자화 에러를 최소화하기 위해서 바람직하게 양자화_스케일이 선택된다.For example, for each (I) frame in MPEG video, the quantizer value is determined by quantizer_scale and quantization table. Different quantization tables can be used for chrominance and luminance. The quantization tables are fixed for each frame, but the quantizer_scale may be changed for each macroblock. In one embodiment, quantizer_scale is selected for each MPEG frame macroblock using the quantizer selection methods described above. However, note that the quantizer_scale cannot be changed for each DCT coefficient. This value is fixed for the entire macroblock. Since the human visual system is more sensitive to quantization errors of low frequency coefficients, quantization scale is preferably selected to minimize the average quantization error of the selected low frequency coefficients.

도 4는 본 발명이 구현될 수 있는 비디오/이미지 처리 시스템(20)을 도시한 것이다. 예로서, 시스템(20)은 텔레비전, 셋탑 박스, 데스크탑, 랩탑 또는 팜탑 컴퓨터, 개인용 디지털 보조기(PDA), 비디오 카세트 레코더(VCR), 디지털 비디오 레코더(DVR), TiVO 장치 등의 비디오/이미지 저장 장치, 및 이들의 일부 또는 조합 및 그 외의 장치들을 의미할 수 있다. 시스템(20)은 하나 이상의 비디오/이미지 소스들(22), 하나 이상의 입력/출력 장치들(24), 프로세서(25) 및 메모리(26)을 포함한다. 비디오/이미지 소스(들)(22)은 예를 들면 텔레비전 수신기, VCR 또는 그 외 비디오/이미지 저장 장치를 의미할 수 있다. 대안적으로, 소스(들)(22)는 이를테면 인터넷, 광역 네트워크, 대도시 지역 네트워크, 로컬 영역 네트워크, 지상 방송 시스템, 케이블 네트워크, 위성 네트워크, 무선 네트워크, 또는 전화 네트워크, 및 이들의 일부 또는 조합 및 그 외 네트워크 유형 등의 글러벌 컴퓨터 통신 네트워크를 통해 서버 또는 서버들로부터 비디오/이미지들을 수신하는 하나 이상의 네트워크 접속들을 의미할 수 있다.4 illustrates a video / image processing system 20 in which the present invention may be implemented. By way of example, system 20 may be a video / image storage device such as a television, set-top box, desktop, laptop or palmtop computer, personal digital assistant (PDA), video cassette recorder (VCR), digital video recorder (DVR), TiVO device, or the like. And some or a combination thereof and other devices. System 20 includes one or more video / image sources 22, one or more input / output devices 24, a processor 25, and a memory 26. Video / image source (s) 22 may refer to, for example, a television receiver, a VCR or other video / image storage device. Alternatively, the source (s) 22 may comprise, for example, the Internet, wide area networks, metropolitan area networks, local area networks, terrestrial broadcast systems, cable networks, satellite networks, wireless networks, or telephone networks, and some or combinations thereof, and Other may refer to one or more network connections that receive video / images from a server or servers via a global computer communication network, such as a network type.

입력/출력 장치들(24), 프로세서(25) 및 메모리(26)는 통신 매체(27)를 통해 통신한다. 통신 매체(27)는 버스, 통신 네트워크, 회로의 하나 이상의 내부 접속들, 회로 카드 또는 그 외 장치, 및 이들의 일부 및 조합과 그 외의 통신 매체를 의미할 수 있다. 소스(들)(22)로부터 입력 비디오/이미지들은 메모리(26) 내 저장된 하나 이상의 소프트웨어 프로그램들에 따라 처리되어 프로세서(25)에 의해 실행됨으로써, 텔레비전 디스플레이, 컴퓨터 모니터 등의 디스플레이 장치(28)에 공급되는 출력 비디오/이미지들을 발생한다.Input / output devices 24, processor 25 and memory 26 communicate via communication medium 27. Communication medium 27 may mean a bus, a communication network, one or more internal connections of a circuit, a circuit card or other device, and some and combinations thereof and other communication media. The input video / images from the source (s) 22 are processed in accordance with one or more software programs stored in the memory 26 and executed by the processor 25 to thereby display the display device 28, such as a television display, computer monitor, or the like. Generates output video / images to be supplied.

바람직한 실시예에서, 캐스케이드 압축 및 적합한 양자화기들의 선택에 기인한 예상된 양자화 에러의 계산은 시스템(20)에 의해 실행되는 컴퓨터로 읽을 수 있는 코드로 구현된다. 코드는 메모리(26)에 저장될 수 있으며 또는 CD-ROM이나 플로피 디스크와 같은 메모리 매체로부터 판독/다운로드될 수 있다. 다른 실시예에서, 하드웨어 회로는 본 발명을 구현하는 소프트웨어 명령들 대신에, 또는 이와 조합하여 이용될 수 있다.In a preferred embodiment, the calculation of expected quantization error due to cascade compression and the selection of suitable quantizers is implemented in computer readable code executed by system 20. The code may be stored in memory 26 or may be read / downloaded from a memory medium such as a CD-ROM or floppy disk. In other embodiments, hardware circuitry may be used in place of or in combination with software instructions that implement the invention.

도 4에 도시한 바와 같은 시스템(20)의 특정 구성은 단지 예임을 알아야 한다. 이 기술에 숙련된 자들은 본 발명은 광범위한 대안 시스템 구성을 이용하여 구현될 수 있음을 알 것이다.It should be noted that the specific configuration of the system 20 as shown in FIG. 4 is merely an example. Those skilled in the art will appreciate that the present invention can be implemented using a wide variety of alternative system configurations.

본 발명을 구체적인 실시예를 통해 기술하였지만, 본 발명은 여기 개시된 실시예로 제한되거나 한정되도록 의도된 것이 아님을 알아야 한다. 예를 들면, 본 발명은 임의의 구체저인 압축 방식, 프레임 유형 또는 확률 분포로 제한되지 않는다. 반대로, 본 발명은 첨부된 청구항의 정신 및 범위 내에 포함되는 여러 가지 구성 및 수정을 포괄하도록 의도된 것이다.Although the present invention has been described in terms of specific embodiments, it should be understood that the present invention is not intended to be limited or limited to the embodiments disclosed herein. For example, the present invention is not limited to any embodiment, compression scheme, frame type or probability distribution. On the contrary, the invention is intended to cover various modifications and variations that fall within the spirit and scope of the appended claims.

Claims

In a method for a cascaded compression system (12):

Determining an expected quantization error introduced by the second or higher stage for at least two other quantizers for the second or higher stage of the cascade compression system;

Comparing the expected quantization error of the at least two quantizers; And

Selecting one of the at least two quantizers according to a result of the comparison.

The method of claim 1,

And the expected quantization error is determined using a probability distribution function.

The method of claim 2,

And the probability distribution function is a Laplacian distribution.

The method of claim 2,

And said selecting step selects said one quantizer in accordance with a ratio of an expected bit rate and said expected quantization error.

The method of claim 2,

The probability distribution function is determined according to the input data (10) to be compressed.

The method of claim 1,

Wherein the at least two quantizers are determined according to the required compression ratio of the cascade compression system.

The method of claim 1,

The method is used in at least one of a JPEG or MPEG system.

The method of claim 1,

The determining step includes determining a set of ranges for values of input data to be incorrectly quantized by the cascade compression system.

A memory medium containing code for the cascade compression system 12,

The code is:

Decision code for calculating an expected quantization error introduced by the second or higher stage for at least two other quantizers for the second or higher stage of the cascade compression system;

Comparison code for comparing the expected quantization error of the at least two quantizers; And

And a selection code allowing selection of one of the at least two quantizers according to a result of the comparison code.

The method of claim 9,

The method of claim 10,

Wherein the probability distribution function is a Laplacian distribution.

The method of claim 9,

And the code is configured for use in at least one of a JPEG or MPEG system.

In the cascade compression device 20:

A memory 26 that stores executable code; And

(i) determine an expected quantization error introduced by the second or higher stage for at least two other quantizers for the second or higher stage of the cascade compression system, and (ii) the at least two quantizations A processor 25 for executing the code stored in the memory 26 to compare the expected quantization error of the groups and (iii) select one of the at least two quantizers according to the result of the comparison. Cascade compression apparatus 20, including.

The method of claim 13,

Wherein the expected quantization error is determined using a probability distribution function.

The method of claim 14,

The probability distribution function is a Laplacian distribution.

The method of claim 13,

Wherein the device is configured for use in at least one of a JPEG or MPEG system.

In the cascade compression system 12:

Means (25, 26) for determining an expected quantization error introduced by the second or higher stage for at least two other quantizers for the second or higher stage of the cascade compression system;

Means (25, 26) for comparing the expected quantization error of the at least two quantizers; And

A cascade compression system (12) comprising means (25, 26) for selecting one of the at least two quantizers according to the result of the comparison.