KR100240655B1

KR100240655B1 - Extraction method of compressed image content

Info

Publication number: KR100240655B1
Application number: KR1019970053937A
Authority: KR
Inventors: 유성준; 박동권; 장호욱; 원치선; 한희일; 이의택
Original assignee: 정선종; 한국전자통신연구원
Priority date: 1997-10-21
Filing date: 1997-10-21
Publication date: 2000-01-15
Also published as: KR19990032789A

Abstract

본 발명은 압축된 영상 정보의 내용 추출 방법에 관한 것으로, 영상 정보가 여러 가지 변수를 내포하고 있어 특징 정보로 규정하기 위한 알고리즘 및 하드웨어가 복잡해 지고 영상 정보 추출을 위해 많은 계산량이 요구되는 문제점을 해결하기 위하여, 허프만 디코딩 과정 이전의 부호화된 이산적 여현 변환(DCT) 계수로부터 디코딩을 수행하지 않고 직접 영상 정보의 내용을 추출하므로써 보다 빠르게 압추고딘 디지털 영상 정보를 처리할 수 있는 압축된 영상 정보의 내용 추출 방법이 제시된다.The present invention relates to a method for extracting content of compressed video information, which solves a problem in that algorithms and hardware for defining feature information are complicated, and that a large amount of calculation is required for extracting video information. In order to extract the image information directly from the encoded discrete cosine transform (DCT) coefficients prior to the Huffman decoding process, the content of the compressed image information can be processed faster by extracting the image information directly. Extraction methods are presented.

Description

Content Extraction Method of Compressed Image Information

본 발명은 압축된 영상 정보의 내용 추출 방법에 관한 것으로, 특히 허프만 디코딩 과정 이전의 부호화된 이산적 여현 변환(Discrete Cosine Transform ;이하 DCT라 함) 계수로부터 디코딩을 수행하지 않고 직접 영상 정보의 내용을 추출하므로써 빠른 시간 내에 간단한 알고리즘과 하드웨어로 보다 정확한 정보를 얻을 수 있는 압축된 영상 정보의 내용 추출 방법에 관한 것이다.The present invention relates to a method of extracting content of compressed image information, and more particularly, to directly extract content of image information without performing decoding from a coded discrete cosine transform (DCT) coefficient before Huffman decoding. The present invention relates to a method of extracting the content of compressed image information that can be obtained quickly with simple algorithm and hardware by extracting.

일반적으로 TV 및 비디오 데이터 등 대부분의 동영상 정보는 디지털화된 후 MPEG-1 또는 MPEG-2로 압축되어 저장 또는 전송된다. 이때 사용되는 MPEG-1 및 2는 동영상의 저장 전송 및 통신에 관한 압축 표준안으로 확정되어 널리 사용되고 있다.In general, most moving image information such as TV and video data is digitized and then compressed or stored in MPEG-1 or MPEG-2 and stored or transmitted. MPEG-1 and 2 used at this time are confirmed as a compression standard for storage transmission and communication of video and are widely used.

이렇게 저장 또는 수신된 비디오 데이터로부터 영상 정보의 내용을 추출하는 종래의 방법은 다음과 같다. MPEG-1 및 MPEG-2는 DCT를 기본적으로 사용하므로 8×8 블록 단위로 영상 정보를 부호화한다. 따라서 하나의 화상에서 DCT 블록내 영상의 평균 밝기 정보를 구하기 위해서는 DCT 계열 중 블록의 평균 밝기 성분을 갖는 직류(Direct Current ;이하 DC라 함) 계수를 디코딩해야 한다. 즉, 부호화된 DCT의 DC 비트열로부터 비트열 분석(parsing)을 위한 역 VLC(Variable Length Code ;이하 VLC라 함)를 수행한 후 역 허프만(Huffman) 부호화 및 역 차분 펄스 코드 변조(Differential Pulse Code Modulation ;이하 DPCM이라 함) 과정을 거쳐 차분 성분을 계산하므로써 블록의 밝기 정보를 구할 수 있다.The conventional method of extracting the content of the image information from the stored or received video data is as follows. Since MPEG-1 and MPEG-2 basically use DCT, image information is encoded in 8 × 8 block units. Accordingly, in order to obtain average brightness information of an image in a DCT block in one image, a DC (Direct Current) coefficient having an average brightness component of a block among DCT series should be decoded. That is, after performing inverse variable length code (VLC) for parsing the bit stream from the DC bit stream of the encoded DCT, inverse Huffman coding and inverse differential pulse code modulation (Differential Pulse Code) are performed. The brightness information of the block can be obtained by calculating the difference component through the process of Modulation (hereinafter referred to as DPCM).

이러한 영상 정보의 내용 추출 방법은 계산량을 증가시키고 실시간 영상처리를 요구하는 비디오 처리 분야에 적용하기 어려운 단점이 있다. 즉, 정확한 특징 정보를 구하기 위하여 DCT 계수를 복원할 때 하드웨어 및 알고리즘이 복잡해지는 문제점이 있다The content extraction method of image information has a disadvantage in that it is difficult to apply to a video processing field that increases computational amount and requires real-time image processing. That is, there is a problem in that hardware and algorithms become complicated when restoring DCT coefficients to obtain accurate feature information.

따라서, 본 발명은 디코딩 과정을 최소화하고 간단한 하드웨어 및 알고리즘으로 단시간 내에 영상의 특징 정보를 구할 수 있는 영상 정보의 내용 추출 방법을 제공하는데 그 목적이 있다.Accordingly, an object of the present invention is to provide a method of extracting content of image information which can minimize decoding process and obtain feature information of image in a short time with simple hardware and algorithm.

상술한 목적을 달성하기 위한 본 발명에 따른 압축된 영상 정보의 내용 추출 방법은 휘도 신호 정보를 갖는 다수의 직류 계수를 해석하기 위해 가변 길이 비트열을 분석하는 단계와, 상기 가변 길이 코드를 이용하여 휘도 신호에 대한 직류 성분의 크기를 구하는 단계와, 상기 휘도 신호에 대한 직류 성분의 크기를 이용하여 블록간 밝기 세기의 차분값의 범위를 구하는 단계와, 상기 차분값의 범위를 이용하여 영상의 밝기 정보 및 에지 정보를 추출하는 단계를 포함하여 이루어진 것을 특징으로 한다.According to an aspect of the present invention, there is provided a method of extracting content of compressed image information by analyzing a variable length bit string to analyze a plurality of direct current coefficients having luminance signal information, and using the variable length code. Obtaining a magnitude of a DC component with respect to the luminance signal, obtaining a range of difference values of brightness intensity between blocks using the magnitude of the DC component with respect to the luminance signal, and using the range of the difference value, And extracting the information and the edge information.

도 1은 MPEG-2로 압축된 비트열 내의 각 정보의 위치를 나타낸 구성도.BRIEF DESCRIPTION OF THE DRAWINGS Fig. 1 is a block diagram showing the position of each information in a bit string compressed with MPEG-2.

도 2는 DC 계수를 이용하여 차분값을 구하는 과정을 설명하기 위해 도시한 블럭도.2 is a block diagram illustrating a process of obtaining a difference value using a DC coefficient.

도 3(a) 및 3(b)는 여러 가지 방향에 대한 DC 성분 크기가 갖는 특성을 설명하기 위해 도시한 매크로 블록 내 DC 값 분포의 예시도.3 (a) and 3 (b) are exemplary diagrams of a DC value distribution in a macro block shown to explain characteristics of DC component sizes in various directions.

＜도면의 주요 부분에 대한 부호 설명＞<Explanation of symbols on the main parts of the drawing>

A : 매크로 블록 B : 매크로 블록 내의 하나의 DCT 블록A: macro block B: one DCT block in the macro block

C : 직류 계수 11 : 휘도에 대한 직류 성분 크기C: DC coefficient 11: DC component magnitude for luminance

12 : 차분값 13 : 교류 계수12: difference value 13: AC coefficient

14 : 블록 종단부14: block termination

이하, 첨부된 도면을 참조하여 본 발명을 상세히 설명하기로 한다.Hereinafter, with reference to the accompanying drawings will be described in detail the present invention.

도 1은 MPEG로 압축된 비트열 내의 각 정보의 위치를 나타낸 구성도이다.1 is a configuration diagram showing the position of each information in a bit string compressed with MPEG.

하나의 매크로 블록(A)은 다수의 DCT 블록(B)으로 구성되며, 하나의 DCT 블록(B)에는 64개의 데이터가 있다. 또한 매크로 블록(A) 내의 각각의 단위 DCT 블록(B)은 휘도에 대한 DC 성분의 크기(dct_dc_size)(11), 차분값(difference)(12), 교류(Alternative Current ;이하 AC라 함) 계수(13) 및 블록 종단부(end of block)(14)로 구성된다. 이때 휘도에 대한 DC 성분의 크기(dct_dc_size)(11)와 차분값(difference)(12)을 DC 계수(C)라 한다. 또한 하나의 DCT 블록(B)내의 64개의 데이터 중 첫 번째 데이터는 DC 계수이고 나머지 63개의 데이터는 AC 계수로 이루어 진다. 매크로 블록(A) 내의 DC 계수(C)의 개수는 휘도신호와 색차신호의 비율에 따라 달라진다. 즉, 휘도신호 : 제 1 색차신호 : 제 2 색차신호의 비율이 4:2:0인 경우에는 6개, 4:2:2인 경우에는 8개, 그리고 4:4:4인 경우에는 12개가 존재한다. 그런데 영상의 밝기 정보는 휘도신호에 존재하므로 발생하는 DC 계수 중 처음 4개의 신호를 해석하므로써 영상의 밝기 정보를 얻을 수 있다. 휘도 성분의 DC 계수들은 평균 부호 비트수를 줄이기 위해 그 발생 빈도수에 따라 가변적인 비트 길이를 할당하는 가변 길이 부호화(Variable Length Code ;VLC)된다. 이것을 허프만 테이블(Huffman table)이라 불리는 [표 1]에 나타내었다.One macro block (A) is composed of a plurality of DCT blocks (B), there is 64 data in one DCT block (B). In addition, each unit DCT block B in the macro block A has a DC component magnitude (dct_dc_size) 11, a difference 12, and an alternating current (AC) coefficient for luminance. 13 and end of block 14. At this time, the magnitude (dct_dc_size) 11 and the difference 12 of the DC component with respect to the luminance are referred to as the DC coefficient (C). In addition, the first of the 64 data in one DCT block (B) is the DC coefficient and the remaining 63 data is composed of the AC coefficient. The number of DC coefficients C in the macro block A depends on the ratio of the luminance signal and the color difference signal. That is, the ratio of luminance signal: first color difference signal: second color difference signal is 6 when 4: 2: 0, 8 when 4: 2: 2, and 12 when 4: 4: 4. exist. However, since the brightness information of the image exists in the luminance signal, the brightness information of the image can be obtained by analyzing the first four signals of the generated DC coefficients. The DC coefficients of the luminance component are variable length coded (VLC) which allocates a variable bit length in accordance with the frequency of occurrence to reduce the average number of code bits. This is shown in Table 1 called the Huffman table.

가변 길이 코드(VLC)Variable length code (VLC) 휘도에 대한 DC 성분 크기 (dct_dc_size)DC component size for luminance (dct_dc_size) 차분값(difference)범위Difference range 100100 00 00 00 1One ±1± 1 1One 22 ±2∼3± 2 to 3 101101 33 ±4∼7± 4 to 7 110110 44 ±8∼15± 8-15 11101110 55 ±16∼31± 16 to 31 1111011110 66 ±32∼63± 32 to 63 111110111110 77 ±64∼127± 64 to 127 11111101111110 88 ±128∼255± 128 to 255

[표 1]에서 볼 수 있듯이 DCT 블록당 DC 값의 차이가 커질수록 부호화된 비트수가 증가함을 알 수 있다. 이 부호화된 비트열을 이용하여 휘도에 대한 DC 성분의 크기(dct_dc_size)를 계산하고, 이후 DC 성분의 크기(dct_dc_size)로부터 DC의 차분값의 범위를 알 수 있다.As shown in [Table 1], it can be seen that as the difference of DC values per DCT block increases, the number of encoded bits increases. The size of the DC component (dct_dc_size) with respect to the luminance is calculated using the encoded bit string, and then the range of the difference value of the DC can be known from the size of the DC component (dct_dc_size).

도 2는 DC 계수를 이용하여 차분값을 구하는 과정을 설명하기 위해 도시한 블록도이다. 각각의 블록에는 64(8×8)개의 데이터가 있으며, 각 블록내의 빗금으로 표시한 원은 도시된 번호 순서대로 이전 블록과의 밝기 차이의 정보를 갖는 DC 계수를 나타낸다.2 is a block diagram illustrating a process of obtaining a difference value using a DC coefficient. Each block has 64 (8 × 8) pieces of data, and the circles indicated by the hatched in each block represent DC coefficients having information on the brightness difference from the previous block in the order of numbers shown.

[표 1]에서 알 수 있듯이 휘도에 대한 DC 성분의 크기(dct_dc_size)는 블록간의 밝기 차이가 커질수록 큰 값을 갖는다. 따라서 비트열의 분석(parsing)으로부터 얻은 휘도에 대한 DC 성분의 크기(dct_dc_size)를 이용하여 블록의 밝기 변화를 측정할 수 있다. 즉, 블록의 밝기 성분을 갖는 4개의 DC 계수는 도 2에 도시된 순서대로 이전 블록과의 밝기 차이의 정보를 갖고 있다.As shown in Table 1, the DC component size (dct_dc_size) for luminance has a larger value as the brightness difference between blocks increases. Therefore, the brightness change of the block can be measured using the size of the DC component (dct_dc_size) with respect to the luminance obtained from parsing the bit string. That is, the four DC coefficients having the brightness component of the block have information of the brightness difference from the previous block in the order shown in FIG.

예를 들어 휘도신호 성분 4개의 블록으로 구성된 매크로 블럭 내 에지(edge)가 있는 경우 해당 에지에 대해 수직으로 가로지르는 DCT 블록의 DC 성분값의 차이는 큰 값을 가지며 dct_dc_size 역시 긴 값을 갖게 된다.For example, if there is an edge in a macroblock composed of four blocks of luminance signal components, the difference between the DC component values of the DCT blocks perpendicular to the edges is large and dct_dc_size has a long value.

다음의 [표 2]는 휘도에 대한 DC 성분의 크기가 3인 경우의 실제 차분값을 나타낸 것이다. 차분값의 최초 비트를 사용하여 부호를 결정할 수 있고 연속되는 차분값을 계산할 수 있다. 즉, 차분값의 최상위 비트가 0이면 밝기의 세기 감소를 나타내는 음의 부호를 뜻하고 차분값의 최상위 비트가 1이면 밝기의 세기 증가를 나타내는 양의 부호를 의미한다.[Table 2] below shows the actual difference value when the magnitude of the DC component with respect to luminance is three. The first bit of the difference value can be used to determine the sign and to calculate successive differences. That is, if the most significant bit of the difference value is 0, it means a negative sign indicating a decrease in intensity of brightness. If the most significant bit of the difference value is 1, it means a positive sign indicating an increase in intensity of brightness.

dct_dc_size=3인 경우의 예Example when dct_dc_size = 3 차분(difference)Difference 실제 값Actual value 00 -7-7 1One -6-6 1010 -5-5 1111 -4-4 100100 44 101101 55 110110 66 111111 77

도 3(a) 및 3(b)는 여러 가지 방향에 대한 DC 성분 크기가 갖는 특성을 설명하기 위해 도시한 매크로 블록 내 DC 값 분포의 예시도이다.3 (a) and 3 (b) are exemplary diagrams of DC value distribution in a macro block shown to explain characteristics of DC component sizes in various directions.

d1은 블록1(31)과 블록2(32)간의 밝기 차이, d2는 블록2(32)과 블록3(33)간의 밝기 차이, 그리고 d3는 블록3(33)과 블록4(34)간의 밝기 차이를 나타낸다. 밝기 차이가 없이 평탄한 영역인 경우에 d1, d2 및 d3는 모두 작은 값을 갖는다.d1 is the brightness difference between block 1 (31) and block 2 (32), d2 is the brightness difference between block 2 (32) and block 3 (33), and d3 is the brightness between block 3 (33) and block 4 (34). Indicates a difference. In the case of flat areas without difference in brightness, d1, d2, and d3 all have small values.

도 3(b)에 도시된 것과 같은 수평 에지가 있는 경우에는 d2는 큰 값을, d1 및 d3는 작은 값을 갖는다.When there is a horizontal edge as shown in Fig. 3B, d2 has a large value, and d1 and d3 have a small value.

수직에지가 있는 경우에는 d1, d2 및 d3가 모두 큰 값을 갖지만 d1+d2, d2+d3은 작은 값을 갖는다. 45。의 대각선 에지가 있는 경우에는 d1+d2+d3는 큰 값을 갖고 d2가 작은 값을 갖으며, 135。의 대각선 에지가 있는 경우에는 45。의 대각선 에지가 있는 경우와 반대로 d1+d2+d3가 작은 값을 갖고 d2는 큰 값을 갖는다.In the case of the vertical edge, d1, d2, and d3 all have large values, but d1 + d2 and d2 + d3 have small values. D1 + d2 + d3 has a large value and d2 has a small value when there is a 45 ° diagonal edge, and d1 + d2 + as opposed to a 45 ° diagonal edge when there is a diagonal edge of 135 °. d3 has a small value and d2 has a large value.

상술한 바와 같이 본 발명에 따르면 비트열의 간단한 분석(parsing)만을 통하여 단시간 내에 정확한 비트길이 정보를 구할 수 있다. 이 비트길이 정보는 영상의 국부적인 특성 및 전체적인 특성을 반영할 수 있기 때문에 영상 검색 및 장면전환 등과 같은 영상 처리 응용 분야에 적용할 수 있으며 하드웨어 및 알고리즘의 복잡도를 낮출 수 있는 탁월한 효과가 있다.As described above, according to the present invention, accurate bit length information can be obtained in a short time through only parsing the bit string. Since this bit length information can reflect local and overall characteristics of the image, it can be applied to image processing applications such as image retrieval and scene change, and has an excellent effect of reducing the complexity of hardware and algorithms.

Claims

Analyzing a variable bit string to interpret a plurality of direct current coefficients having luminance information;

Obtaining a magnitude of a DC component of luminance information using the bit string of the variable length code;

Obtaining a range of a difference value for brightness intensity between blocks by using a magnitude of a DC component of the luminance signal;

And extracting brightness information and edge information of the image using the range of the difference value.