KR0160458B1

KR0160458B1 - Moving picture compressing device and method for estimation of motion vector with mask block matching method

Info

Publication number: KR0160458B1
Application number: KR1019950009262A
Authority: KR
Inventors: 김은숙; 전준현
Original assignee: 조백제; 한국전기통신공사
Priority date: 1995-04-19
Filing date: 1995-04-19
Publication date: 1999-01-15
Also published as: KR960040019A

Abstract

본 발명은 동영상 압축 장치 및 방법에 관한 것으로, 움직임 벡터를 보다 간단한 방법으로 예측할 수 있는 매스크 블록 매칭 방법을 사용하여 예측오차가 집중되는 부분만을 압축함으로서 전송 데이터량을 줄이는 동영상 압축 장치 및 방법을 제공하기 위하여, 움직임을 추정하는 움직임 추정 수단(11); 그리드를 변형시키는 그리드 변형수단(12); 영상을 합성해 내는 영상 합성 수단(13); 에러 영상을 출력하는 에러 영상 처리 수단(14); 재생 영상을 생성하는 영상 생성 수단(15); 영상을 저장하는 영상 저장 수단(16); 및 움직임 벡터와 압축 모드와 에러 영상을 외부로 출력하는 버퍼링 수단(17)을 구비하고, 상기 장치에 적용되는 방법에 있어서, 움직임 추정과 그리드를 변형한 후에 현재 입력 영상을 합성한 다음에 예측오차를 계산하는 제1단계(21 내지 25); 인트라 코딩(Intra coding)을 수행하는 `제2단계(26,28,29); 및 움직임 벡터와 에러 영상을 이용하여 압축하는 제3단계(27 내지 29)를 포함하여 움직임 벡터 추정을 간단하게 수행할 수 있고, 전송 데이터량을 줄일 수 있는 효과가 있다.The present invention relates to a video compression apparatus and method, and provides a video compression apparatus and method for reducing the amount of transmission data by compressing only a portion where a prediction error is concentrated by using a mask block matching method that can predict a motion vector in a simpler manner. Motion estimating means (11) for estimating motion; Grid deforming means 12 for deforming the grid; Image synthesizing means (13) for synthesizing the images; Error image processing means (14) for outputting an error image; Image generating means (15) for generating a reproduced image; Image storage means (16) for storing an image; And a buffering means (17) for outputting a motion vector, a compression mode, and an error image to the outside. In the method applied to the apparatus, a prediction error is performed after synthesizing the current input image after modifying the motion estimation and grid. First step (21 to 25) to calculate; A second step (26, 28, 29) for performing intra coding; And a third step (27 to 29) of compressing using the motion vector and the error image, the motion vector estimation can be easily performed, and the amount of transmission data can be reduced.

Description

Motion picture compression apparatus and method for predicting motion vectors using mask block matching

제1도는 본 발명에 따른 동영상 압축장치의 구성도.1 is a block diagram of a video compression apparatus according to the present invention.

제2도는 본 발명에 따른 흐름도.2 is a flow chart in accordance with the present invention.

제3도는 그리드를 씌운 영상의 형태도.3 is a diagram showing the shape of the grid overlaid image.

제4도는 움직임 벡터에 의하여 변형된 그리드의 형태도.4 is a shape diagram of a grid deformed by a motion vector.

제5도는 변형된 그리드와 이전 프레임 영상만을 이용하여 텍스쳐 매핑 방법으로 합성된 영상의 형태도.5 is a shape diagram of an image synthesized by a texture mapping method using only a modified grid and a previous frame image.

제6도는 합성된 영상과 원래 영상간 차이 영상의 형태도.6 is a shape diagram of a difference image between a synthesized image and an original image.

제7도는 에러블럭 수에 따라 재생된 영상의 형태도.7 is a diagram of a type of image reproduced according to the number of error blocks.

* 도면의 주요부분에 대한 부호의 설명* Explanation of symbols for main parts of the drawings

11 : 움직임 추정부 12 : 그리드 변형부11: motion estimation unit 12: grid deformation unit

13 : 영상 합성부 14 : 에러 영상 처리부13 image synthesis unit 14 error image processing unit

15 : 영상 생성부 16 : 영상저장부15: image generating unit 16: image storage unit

17 : 버퍼17: buffer

본 발명은 매스크 블록 매칭 방법으로 움직임 벡터를 예측하여 전송데이타량을 줄인 동영상 압축 장치 및 방법에 관한 것이다.The present invention relates to a video compression apparatus and method for reducing a transmission data amount by predicting a motion vector using a mask block matching method.

현존하는 대부분의 영상 압축 장치는 영상을 부블럭들로 분할하여 움직임 추정 및 예측을 수행하는 블록기반 방법을 기본으로 하고 있다. 각 블록의 움직임 벡터들은 독립적으로 예측되어 지며, 한 블록내의 모든 픽셀값은 동일한 움직임 벡터를 가지게 된다.Most existing image compression apparatuses are based on a block-based method of performing motion estimation and prediction by dividing an image into subblocks. The motion vectors of each block are predicted independently, and all pixel values in a block have the same motion vector.

따라서, 대상물체의 회전이나 확대, 축소등과 같은 운동은 기존의 블록기반 방법으로는 충분히 보상해 주기가 어려운 문제점이 있었다.Therefore, the movement such as rotation, enlargement, reduction, etc. of the object has a problem that it is difficult to sufficiently compensate by the conventional block-based method.

이러한 문제점을 해결하기 위하여 제안된 어핀변환을 이용한 텍스쳐 매핑 방법은 움직임 벡터 예측을 위하여 그래디언트(gradient) 제한 조건을 이용한 반복적 추정 방법을 사용한다.In order to solve this problem, the proposed texture mapping method using affine transform uses an iterative estimation method using gradient constraints for motion vector prediction.

그러나, 이러한 어핀변환을 이용한 텍스쳐 매핑 방법 역시 계산 시간이 많이 걸리고 움직임 벡터의 초기값에 따라 움직임 벡터의 정확도가 좌우될 수 있는 문제점이 있었다.However, the texture mapping method using the affine transformation also takes a lot of computation time and there is a problem that the accuracy of the motion vector can be dependent on the initial value of the motion vector.

상기 문제점을 해결하기 위하여 안출된 본 발명은 움직임 벡터를 보다 간단한 방법으로 예측할 수 있는 매스크 블록 매칭 방법을 사용하여 예측오차가 집중되는 부분만을 이산 여현 변환기(DCT)를 이용하여 압축함으로서 전송 데이터량을 줄이는 동영상 압축 장치 및 방법을 제공하는데 그 목적이 있다.In order to solve the above problems, the present invention compresses only a portion of the prediction error concentrated using a discrete cosine transform (DCT) using a mask block matching method that can predict a motion vector in a simpler manner. It is an object of the present invention to provide a video compression apparatus and method for reducing.

상기 목적을 달성하기 위한 본 발명의 장치는, 외부로부터 현재 입력되는 입력 영상과 이전 프레임 영상사이에서 움직임을 추정하여 움직임 벡터를 출력하는 움직임 추정 수단(Motion Estimation); 상기 움직임 추정 수단에서 구한 움직임 벡터를 이용하여 그리드를 변형시키는 그리드 변형 수단(Grid Deformation); 상기 그리드 변형 수단에서 변형된 그리드 정보와 상기 이전 프레임 영상을 입력받아 현재 입력 영상을 합성해 내는 영상 합성 수단(Image Synthesis); 상기 입력 영상과 상기 영상 합성 수단에서 생성된 합성 영상과의 차이 영상을 부블럭들로 분할하여 각 블록의 평균오차를 계산하여 평균오차값이 큰 블록부터 순서대로 소정의 갯수를 선택적으로 압축하여 에러 영상을 출력하고, 압출모드를 결정하여 출력하는 에러 영상 처리 수단; 상기 에러 영상 처리 수단으로부터 출력되는 에러 영상과 상기 영상합성 수단에서 생성된 합성 영상을 상기 에러 영상 처리 수단으로부터 출력되는 압축모드에 따라 더하여 재생 영상을 생성하는 영상 생성 수단(Image Gemeration); 상기 영상 생성 수단에서 재생된 영상을 입력받아 상기 움직임 추정 수단과 영상 합성 수단에 이전 프레임 영상을 출력하기 위하여 재생된 영상을 저장하는 영상 저장 수단(Frame store); 및 상기 움직임 추정 수단에서 계산된 움직임 벡터와 상기 에러 영상 처리 수단에서 결정된 압축 모드와 에러 영상을 입력받아 외부로 출력하는 버퍼링 수단을 구비하는 것을 특징으로 한다.An apparatus of the present invention for achieving the above object comprises: motion estimation means for outputting a motion vector by estimating a motion between an input image currently input from the outside and a previous frame image; Grid deforming means for deforming a grid using a motion vector obtained by the motion estimating means; Image synthesis means for synthesizing the current input image by receiving the grid information deformed by the grid deforming means and the previous frame image; The difference between the input image and the synthesized image generated by the image synthesizing means is divided into subblocks to calculate an average error of each block, and selectively compresses a predetermined number in order from a block having a large average error value. Error image processing means for outputting an image and determining and outputting an extrusion mode; Image generation means (Image Gemeration) for generating a reproduced image by adding the error image output from the error image processing means and the composite image generated by the image synthesis means according to the compression mode output from the error image processing means; Image storage means for receiving the image reproduced by the image generating means and storing the reproduced image for outputting a previous frame image to the motion estimation means and the image combining means; And a buffering means for receiving a motion vector calculated by the motion estimation means, a compression mode determined by the error image processing means, and an error image and outputting them to the outside.

상기 다른 목적을 달성하기 위한 본 발명의 방법은, 외부로부터 현재 입력되는 입력 영상과 이전 프레임 영상사이에서 움직임을 추정하여 움직임 벡터를 출력하는 움직임 추정 수단(Motion Estimation); 상기 움직임 추정 수단에서 구한 움직임 벡터를 이용하여 그리드를 변형시키는 그리드 변형 수단(Grid Deformation); 상기 그리드 변형 수단에서 변형된 그리드 정보와 상기 이전 프레임 영상을 입력받아 현재 입력 영상을 합성해 내는 영상 합성 수단(Image Synthesis); 상기 입력 영상과 상기 영상 합성 수단에서 생성된 합성 영상과의 차이 영상을 부블럭들로 분할하여 각 블록의 평균오차를 계산하여 평균오차값이 큰 블록부터 순서대로 소정의 갯수를 선택적으로 압축하여 에러 영상을 출력하고, 압축모드를 결정하여 출력하는 에러 영상 처리 수단; 상기 에러 영상 처리수단으로부터 출력되는 에러 영상과 상기 영상 합성 수단에서 생성된 합성 영상을 상기 에러 영상 처리 수단으로부터 출력되는 압축모드에 따라 더하여 재생 영상을 생성하는 영상 생성 수단(Image Gemeration); 상기 영상 생성 수단에서 재생된 영상을 입력받아 상기 움직임 추정 수단과 영상 합성 수단에 이전 프레임 영상을 출력하기 위하여 재생된 영상을 저장하는 영상 저장 수단(Frame store); 및 상기 움직임 추정 수단에서 계산된 움직임 벡터와 상기 영상 처리 수단에서 결정된 압축 모드와 에러 영상를 입력받아 외부로 출력하는 버퍼링 수단을 구비하는 동영상 압축 장치에 적용되는 방법에 있어서, 입력 영상을 입력받아 움직임 추정과 그리드 변형을 수행한 후에 상기 변형된 그리드와 이전 프레임 영상을 이용하여 텍스쳐 매핑 방법으로 현재 입력 영상을 합성한 다음에 예측오차를 계산하고 임계차를 설정하여 압축모드를 결정하는 제1단계; 상기 제1단계 수행 후, 예측오차가 임계치보다 크면 입력 영상의 정보만을 이용하여 입력 영상을 압축하는 인트라 코딩(Intra coding)을 수행한 후에 재생 영상을 생성하고 저장하는 제2단게; 및 상기 제1단계 수행 후, 예측오차가 임계치보다 작으면 움직임 벡터와 에러 영상을 이용하여 현재 입력 영상을 압축한 후에 재생 영상을 생성하고 저장하는 제3단계를 포함하는 것을 특징으로 한다.According to another aspect of the present invention, there is provided a method, comprising: motion estimation means for estimating motion between an input image currently input from the outside and a previous frame image to output a motion vector; Grid deforming means for deforming a grid using a motion vector obtained by the motion estimating means; Image synthesis means for synthesizing the current input image by receiving the grid information deformed by the grid deforming means and the previous frame image; The difference between the input image and the synthesized image generated by the image synthesizing means is divided into subblocks to calculate an average error of each block, and selectively compresses a predetermined number in order from a block having a large average error value. Error image processing means for outputting an image and determining and outputting a compression mode; Image generating means (Image Gemeration) for generating a reproduced image by adding an error image output from the error image processing means and a synthesized image generated by the image combining means in accordance with a compression mode output from the error image processing means; Image storage means for receiving the image reproduced by the image generating means and storing the reproduced image for outputting a previous frame image to the motion estimation means and the image combining means; And a buffering means for receiving a motion vector calculated by the motion estimating means, a compression mode determined by the image processing means, and an error image and outputting the result to the outside. A first step of synthesizing the current input image by a texture mapping method using the deformed grid and the previous frame image and then calculating a prediction error and setting a threshold difference to determine a compression mode; A second step of generating and storing a reproduced image after performing intra coding to compress the input image using only the information of the input image if the prediction error is greater than the threshold after performing the first step; And after performing the first step, if the prediction error is smaller than the threshold, a third step of compressing the current input image using the motion vector and the error image, and generating and storing the reproduced image.

이하, 첨부된 도면을 참조하여 본 발명에 따른 일실시예를 상세히 설명한다.Hereinafter, with reference to the accompanying drawings will be described an embodiment according to the present invention;

제1도는 본 발명에 따른 동영상 압축 장치의 구성도로서, 11은 움직임 추정부, 12는 그리드 변형부, 13은 영상 합성부, 14는 에러 영상 처리부, 15는 영상 생성부, 16은 영상 저장부, 17은 버퍼를 각각 나타낸다.1 is a block diagram of a video compression apparatus according to the present invention, 11 is a motion estimation unit, 12 is a grid deformation unit, 13 is an image synthesizer, 14 is an error image processing unit, 15 is an image generator, 16 is an image storage unit And 17 represent buffers, respectively.

그 구체적인 구성과 동작을 살펴보면, 움직임 추정부(11)는 외부로부터 현재 입력되는 입력 영상과 영상 저장부(16)로부터 출력되는 이전에 압축된 영상 사이에서 그리드의 움직임을 추정하여 움직임 벡터를 영상 합성부(13)과 버퍼(17)로 출력한다. 이때, 그리드의 움직임 추정은 다음과 같은 방법으로 이루어진다.Looking at the specific configuration and operation, the motion estimation unit 11 synthesizes the motion vector by estimating the motion of the grid between the current input image from the outside and the previously compressed image output from the image storage unit 16 Output to section 13 and buffer 17. At this time, the motion of the grid is estimated by the following method.

영상 저장부(16)로부터 출력되는 이전에 입축된 영상에 16×16 부 블록들로 구성된 그리드를 씌운다(제3도). 그리드의 교차점을 중심으로 15×15 매스크를 구성한 다음에 현재 입력되는 입력 영상과 비교하여 매스크간의 오차값을 최소로 만드는 블록을 찾아낸다. 이때, 탐색 영역은 수평과 수직 방향으로 각각 ±8화소를 하고, 이전 프레임 영상위의 그리드점을 중심으로 씌워진 매스크 블록과 현재 입력 영상의 탐색 영역내 블록들의 차이를 계산하여 가장 작은 오차를 갖는 블록을 찾은 다음에 이 블록과 매스크 블록과의 x좌표와 y좌표의 차이를 움직임 벡터로 한다.A grid composed of 16x16 sub-blocks is overlaid on a previously captured image output from the image storage unit 16 (FIG. 3). A 15 × 15 mask is constructed around the intersections of the grids and then compared with the currently input image to find a block that minimizes the error between the masks. In this case, the search area has ± 8 pixels in the horizontal and vertical directions, respectively, and the block having the smallest error by calculating the difference between the mask block covered by the grid point on the previous frame image and the blocks in the search area of the current input image. Next, the difference between the x coordinate and the y coordinate between the block and the mask block is the motion vector.

그리드 변형부(12)에서는 움직임 추정부(11)에서 구한 움직임 벡터를 이용하여 각 그리드의 교차점을 이동시킨 다음에 다시 그리드교차점을 이어주므로서 그리드를 변형시켜 주며(제4도) 변형된 그리드를 영상 합성부(13)로 출력한다.In the grid deforming unit 12, the grids are transformed by moving the intersections of the grids using the motion vectors obtained by the motion estimation unit 11, and then connecting the grid intersections again (FIG. 4). Output to the image synthesizing unit 13.

영상 합성부(13)에서는 그리드 변형부(12)에서 출력되는 변형된 그리드 정보와 영상 저장부(16)로부터 출력되는 이전에 압축된 영상을 이용하여 현재 입력 영상을 합성하여 합성 영상을 에러 영상 처리부(14)와 영상 생성부(15)로 출력한다. 즉, 변형되기 이전과 이후의 그리드를 구성하는 각각의 삼각형은 일대일 대응되므로 어핀변환을 이용한 텍스텨 매핑 방법으로 변형된 영상을 얻을 수 있으며, 이 영상이 이전 프레임 영상과 변형된 그리드를 이용하여 합성한 현재 영상이 된다. 텍스쳐 매핑이란 삼각형 ABC가 삼각형 A'B'C'으로 변형된 경우 삼각형 A'B'C'내의 모든 픽셀값은 삼각형 ABC내의 픽셀값들로부터 어핀변환을 이용하여 계산할 수 있는 방법이다.The image synthesizing unit 13 synthesizes the current input image by using the modified grid information output from the grid deforming unit 12 and a previously compressed image output from the image storing unit 16 to convert the synthesized image into an error image processing unit. 14 and the image generating unit 15 are output. That is, since each triangle constituting the grid before and after deformation is one-to-one correspondence, a modified image can be obtained by a texture mapping method using affine transformation, and this image is synthesized by using the modified grid with the previous frame image. Becomes the current video. Texture mapping is a method in which all the pixel values in the triangle A'B'C 'are calculated using affine transformation from the pixel values in the triangle ABC when the triangle ABC is transformed into the triangle A'B'C'.

에러 영상 처리부(14)에서는 상기 입력 영상과 영상 합성부(13)에서 출력되는 합성 영상과의 차이 영상을 구한 후에 차이 영상을 8×8부블럭들로 분할하여 각 부블럭의 평균값을 계산하여 값이 큰 순서대로 재배열한 다음에 M개의 블록만을 압축하여 에러 영상을 영상 생성부(15)와 버퍼(17)로 출력한다. 이때, 전체 에러 평균값을 계산하여 이 값을 임계치와 비교하여 입력 영상의 압축 모드를 결정하여 이 값을 버퍼(17)와 영상 생성부(15)로 출력한다.The error image processor 14 obtains a difference image between the input image and the synthesized image output from the image synthesizer 13, divides the difference image into 8 × 8 subblocks, and calculates an average value of each subblock. After rearranging in large order, only M blocks are compressed to output an error image to the image generating unit 15 and the buffer 17. At this time, the total error average value is calculated, and this value is compared with a threshold to determine a compression mode of the input image, and this value is output to the buffer 17 and the image generator 15.

영상 생성부(15)에서는 영상 합성부(13)로부터 출력되는 합성 영상과 에러 영상 처리부(14)로부터 출력되는 에러 영상을 입력받아 에러 영상 처리부(14)로부터 출력되는 압축 모드에 따라 재생 영상을 생성하여 영상 생성부(16)로 출력한다. 즉, 압축 모드가 인트라(Intra)인 경우에는 입력된 영상을 이전의 모든 과정을 무시한 채로 인트라 코딩(Intra coding)을 수행하고, 압축 모드가 인터(Inter)인 경우에는 상기 영상 합성부(13)에서 합성한 합성 영상과 에러 영상 처리부(14)에서 계산된 에러 영상을 더하여 재생 영상을 만들어주고, 이 영상을 영상저장부(15)로 출력한다.The image generating unit 15 receives a composite image output from the image synthesizing unit 13 and an error image output from the error image processing unit 14, and generates a reproduced image according to a compression mode output from the error image processing unit 14. And output to the image generator 16. That is, when the compression mode is Intra, intra coding is performed on the input image while ignoring all the previous processes. When the compression mode is Inter, the image synthesis unit 13 The synthesized video synthesized at and the error image calculated by the error image processor 14 are added to generate a reproduced image, and the image is output to the image storage unit 15.

버퍼(17)에서는 움직임 추정부(11)에서 계산된 움직임 벡터와 에러 영상 처리부(14)에서 결정된 압축 모드와 에러 영상를 입력받아 외부로 전송해 준다.The buffer 17 receives the motion vector calculated by the motion estimation unit 11 and the compression mode and the error image determined by the error image processor 14, and transmits them to the outside.

영상 저장부(16)에서는 움직임 추정부(11)의 움직임 추정과 영상 함성부(13)의 현재 영상 합성시 이전 프레임 영상으로 사용하기 위하여 영상 생성부(15)로부터 출력되는 재생 영상을 저장한다.The image storage unit 16 stores the playback image output from the image generating unit 15 for use as the previous frame image when the motion estimation unit 11 estimates the motion and the image combiner 13 synthesizes the current image.

제2도는 본 발명에 따른 흐름도이다.2 is a flow chart according to the present invention.

그 구체적인 동작을 살펴보면, 먼저 입력 영상을 입력받으면(21) 이전 프레임 영상에 16×16의 기본 블록으로 구성된 그리드를 씌운 다음에 그리드 교차점을 중심으로 15×15 매스크를 씌워 매스크 단위로 입력 영상과 비교하여 움직임 벡터를 찾아 이 움직임 벡터에 의하여 그리드를 변형시켜 현재 입력 영상에 씌운다(22). 이때, 탐색 영역은 수평과 수직 방향으로 각각 ±8 화소로 하고, 이전 프레임 영상위의 그리드점을 중심으로 씌워진 매스크 블록과 현재 입력 영상의 탐색영역내 블록들의 차이를 계산하여 가장 작은 오차를 갖는 블록을 찾은 다음에 이 블록과 매스크 블록과의 x좌표와 y좌표의 차이를 움직임 벡터로 한다.In detail, when the input image is received (21), a grid composed of 16 × 16 basic blocks is overlaid on a previous frame image, and a 15 × 15 mask is covered around a grid intersection to be compared with the input image in a mask unit. In order to find the motion vector, the grid is transformed by the motion vector and overlaid on the current input image (22). In this case, the search area is set to ± 8 pixels in the horizontal and vertical directions, respectively, and the block having the smallest error is calculated by calculating the difference between the mask block covered by the grid point on the previous frame image and the blocks in the search area of the current input image. Next, the difference between the x coordinate and the y coordinate between the block and the mask block is the motion vector.

이후, 변형된 그리드와 이전에 압축된 영상을 이용하여 현재 입력 영상의 합성 영상을 구한다(23). 이때, 합성 영상 재생은 어핀변환을 이용한 텍스쳐 매핑 방법을 이용한다. 즉, 변형되기 이전과 이후의 그리드를 구성하는 각각의 삼각형은 일대일 대응되므로 어핀변환을 이용한 텍스쳐 매핑 방법으로 변형된 영상을 얻을 수 있으며, 이 영상이 이전 프레임 영상과 변형된 그리드를 이용하여 합성한 현재 영상이 된다. 텍스쳐 매핑이란 삼각형 ABC가 삼각형 A'B'C' 내의 모든 픽셀값은 삼각형 ABC내의 픽셀값들로부터 어핀변환을 이용하여 계산할 수 있는 방법이다.Thereafter, a composite image of the current input image is obtained using the modified grid and the previously compressed image (23). In this case, the composite image reproduction uses a texture mapping method using affine transformation. That is, since each triangle constituting the grid before and after the deformation is one-to-one correspondence, the deformed image can be obtained by the texture mapping method using affine transformation, and the image is synthesized by using the deformed grid with the previous frame image. It becomes the current video. Texture mapping is a method in which triangle ABC can calculate all pixel values in triangle A'B'C 'using affine transformation from pixel values in triangle ABC.

다음에는 상기 입력 영상과 합성 영상가의 차이 영상을 구한 후에 차이 영상을 8×8 부블럭들로 분할하여 각 부블럭의 평균값을 계산하여 값이 큰 순서대로 재배열한 다음에 M개의 블록만을 압축하여 예측오차를 계산하고 임계치를 설정한다(25).Next, after obtaining the difference image between the input image and the composite image price, the difference image is divided into 8 × 8 subblocks, the average value of each subblock is calculated, rearranged in the order of the larger values, and then compressed by only M blocks. The error is calculated and the threshold is set 25.

이후, 상기 예측오차와 임계치와 비교하여 입력 영상의 압축모드를 결정한다(25). 즉, 예측오차가 임계치보다 크면 입력 영상의 정보만을 이용하여 입력 영상을 압축하는 인트라 코팅(Intra coding)을 수행한 후에(26) 재생 영상을 생성하고(28) 저장하며(29), 예측오차가 임계치보다 작으면 움직임 벡터와 에러 영상을 이용하여 현재 입력 영상을 압축한 후에(27) 재생 영상을 생성하고(28) 저장한다(29). 이때, 인트라 코딩(Intra coding) 모드에서는 기존의 정지 영상 압축에 사용되는 방법중 택일하여 압축한다.Then, the compression mode of the input image is determined by comparing the prediction error and the threshold (25). That is, if the prediction error is greater than the threshold, intra coding is performed using only the information of the input image to compress the input image (26), and then the reproduced image is generated (28) and stored (29). If the threshold value is smaller than the threshold, the current input image is compressed using the motion vector and the error image (27), and then the reproduced image is generated (28) and stored (29). In this case, in the intra coding mode, one of the methods used for conventional still image compression is compressed.

상기 움직임 벡터와 에러 영상을 이용하여 압축하는 과정을 상세히 살펴보면, 텍스쳐 매핑을 이용하여 합성한 영상과 입력 영상과의 차이 영상을 살펴보면(제6도) 에러가 눈,코,입등 특정 부분에 집중적으로 나타나기 때문에 에러 영상을 모두 압축하는 것이 아니라 에러가 큰 상위 M개의 블록만을 선택적으로 압축한다(28). 즉, 약 1/10의 에러 블록만을 선택적으로 압축하여도 재생 영상의 화질에는 큰 영향이 나타나지 않는다.Looking at the process of compressing using the motion vector and the error image in detail, when looking at the difference image between the input image and the image synthesized using texture mapping (Fig. 6), the error is concentrated on a specific part such as the eye, nose, and mouth. Instead of compressing all of the error images, only the upper M blocks with large errors are selectively compressed (28). That is, even if only about one-tenth of the error blocks are selectively compressed, there is no significant effect on the quality of the reproduced video.

이후, 압축한 에러 영상과 텍스쳐 매핑 방법으로 합성한 합성 영상을 더하여 재생 영상을 생성한 후에(28) 재생된 영상을 다음 영상의 압축에 사용하기 위하여 프레임 스토어(Frame Store) 저장한다(29).Subsequently, after generating the reproduced image by adding the compressed error image and the synthesized image synthesized by the texture mapping method (28), the reproduced image is stored in a frame store for use in compressing the next image (29).

본 발명에 따른 실험에는 352×288 크기의 CIF(Common Intermediate Format) 영상 Miss America를 사용하였다.In the experiment according to the present invention, a Common Intermediate Format (CIF) image of Miss 352 × 288 was used.

제3도는 그리드를 씌운 영상의 형태도로서, 변형되기 이전의 그리드를 씌운 영상이며, 제4도는 움직임 벡터에 의하여 변형된 그리드의 형태도로서, 움직임 벡터를 이용하여 변형시킨 그리드를 씌운 영상이며, 제5도는 변형된 그리드와 이전 프레임 영상만을 이용하여 텍스쳐 매핑 방법으로 합성된 영상의 형태도이다.FIG. 3 is a shape diagram of an image overlaid with a grid, and is an image overlaid with a grid before deformation. FIG. 4 is a shape diagram of a grid deformed with a motion vector. 5 is a shape diagram of an image synthesized by the texture mapping method using only the modified grid and the previous frame image.

제6도는 합성된 영상과 원래 영상간 차이 영상의 형태도로서, 예측오차가 눈과 코, 입 부분과 같이 부분적 움직임(Local motion)이 큰 부분에 집중적으로 나타나는 것을 볼 수 있다.FIG. 6 is a shape diagram of the difference image between the synthesized image and the original image, and it can be seen that the prediction error is concentrated in the large part of the local motion such as the eyes, nose, and mouth.

제7도는 에러블럭 수에 따라 재생된 영상의 형태도로서, 에러 집중 현상을 이용하여 에러가 큰 블록부터 선택적으로 압축한 경우의 실험 결과인데, 총 584개의 에러블럭중 100개 정도만 압축하여도 화질에 큰 차이가 없는 것을 볼 수 있다.FIG. 7 is a shape diagram of the image reproduced according to the number of error blocks. The experiment results are obtained by selectively compressing a block having a large error by using an error concentration phenomenon. The image quality is obtained by compressing only about 100 of the 584 error blocks. You can see that there is no big difference.

상기와 같은 본 발명은 매스크 블록 매칭 방법을 사용하여 움직임 벡터 추정을 간단하게 수행할 수 있고, 예측오차가 집중되는 부분만을 이산 여현 변환기를 이용하여 압축함으로서 전송 데이터량을 줄일수 있고, 에러 블록을 1/10 정도만 압축하여도 좋은 화질의 재생 영상을 얻을 수 있는 효과가 있다.As described above, the present invention can easily perform motion vector estimation using a mask block matching method, reduce the amount of transmitted data by compressing only a portion where the prediction error is concentrated using a discrete cosine converter, and reduce an error block. Compression of only about 1/10 can produce a good quality video.

Claims

Motion estimating means (11) for estimating motion between an input image currently input from the outside and a previous frame image to output a motion vector; Grid deformation means (12) for deforming a grid using a motion vector obtained by said motion estimation means (11); Image synthesis (13) for receiving the grid information modified by the grid modifying means (12) and the previous frame image to synthesize a current input image; The difference between the input image and the synthesized image generated by the image synthesizing means 13 is divided into subblocks, and the average error of each block is calculated to selectively select a predetermined number in order from blocks having the large average error value. Error image processing means (14) for compressing and outputting an error image, and for determining and outputting a compression mode; Image generation for generating a playback image by adding the error image output from the error image processing means 14 and the composite image generated by the image synthesizing means 13 in accordance with the compression mode output from the error image processing means 14. Means (Image Gemeration) 15; An image storage means for receiving the image reproduced by the image generating means 15 and storing the reproduced image for outputting a previous frame image to the motion estimating means 11 and the image synthesizing means 13; 16); And a masking means (17) for receiving the motion vector calculated by the motion estimation means (11), the compression mode determined by the error image processing means (14), and the error image, and outputting them to the outside. A video compression device that predicts motion vectors in a way.

Motion estimating means (11) for estimating motion between an input image currently input from the outside and a previous frame image to output a motion vector; Grid deformation means (12) for deforming a grid using a motion vector obtained by said motion estimation means (11); Image synthesis means (13) for receiving the grid information transformed by the grid transformation means (12) and the previous frame image to synthesize a current input image; The difference between the input image and the synthesized image generated by the image synthesizing means 13 is divided into subblocks, and the average error of each block is calculated to selectively select a predetermined number in order from blocks having the large average error value. Error image processing means (14) for compressing and outputting an error image, and for determining and outputting a compression mode; Image generation for generating a playback image by adding the error image output from the error image processing means 14 and the composite image generated by the image synthesizing means 13 in accordance with the compression mode output from the error image processing means 14. Means (Image Gemeration) 15; An image storage means for receiving the image reproduced by the image generating means 15 and storing the reproduced image for outputting a previous frame image to the motion estimating means 11 and the image synthesizing means 13; 16); And a buffering means (17) for receiving a motion vector calculated by the motion estimation means (11), a compression mode determined by the error image processing means (14), and an error image, and outputting them to the outside. After receiving the input image and performing motion estimation and grid transformation, the current input image is synthesized using the texture mapping method using the deformed grid and the previous frame image, and then the prediction error is calculated and the threshold is compressed. First steps 21-25 to determine the mode; After performing the first steps 21 to 25, if the prediction error is larger than the threshold, the second step of generating and storing the reproduced image after performing intra coding to compress the input image using only the information of the input image. (26,28,29); And after performing the first steps 21 to 25, if the prediction error is smaller than the threshold, the third step (27 to 29) of compressing the current input image by using the motion vector and the error image, and then generating and storing the reproduced image. The video compression method for predicting the motion vector by a mask block matching method comprising a.

The method of claim 2, wherein the motion estimation and the grid transformation of the first steps 21 to 25 are performed by overlaying a grid of 16 × 16 basic blocks on a previous frame image and then centering the grid intersections. A fourth step of finding a motion vector by covering a mask with a × 15 mask and comparing the input image with a mask unit; And a fifth step of modifying a grid by the motion vector and covering the current input image after performing the fourth step.

The method of claim 3, wherein the searching of the motion vector in the fourth step comprises: a search block having ± 8 pixels in the horizontal and vertical directions, respectively, and a mask block and a current input image covered by a grid point on a previous frame image. The motion vector is a mask block matching method in which a block having the smallest error is found by calculating a difference between blocks in a search region of the block, and then the difference between the x and y coordinates between the block and the mask block is a motion vector. Video compression method to predict.

The prediction error calculation process of the first steps 21 to 25 is performed by obtaining a difference image between the input image and the synthesized image, and then dividing the difference image into 8x8 subblocks. Calculating a mean value of the fourth step; And after performing the fourth step, rearranging the calculated average values in order of the greatest, and then compressing only a predetermined number of blocks to calculate a prediction error. The motion vector is predicted by a mask block matching method. Video compression method.

3. The method of claim 2, wherein the second steps 26, 28, and 29 are performed by performing intra coding to compress the input image using only the information of the input image when the prediction error is greater than a threshold value. 26); And a fifth step (28, 29) of storing a frame store for use in compressing a next video after generating the playback video after performing the fourth step 26. A video compression method for predicting a motion vector using a mask block matching method.

3. The method of claim 2, wherein the third step (27 to 29) comprises: a fourth step (27) of compressing the current input image using the motion vector and the error image when the prediction error is smaller than the threshold value; And after performing the fourth step 27, adding a compressed error image and a synthesized image synthesized by a texture mapping method to generate a reproduced image, and then using the reproduced image to compress the next image. And a fifth step (28, 29) of storing the motion vector compression method using the mask block matching method.

The method of claim 7, wherein the compressing by using the motion vector and the error image in the fourth step 27 comprises: the difference between the image synthesized by using texture mapping and the input image is an error of eyes, nose, mouth, etc. A video compression method for predicting a motion vector by using a mask block matching method, which compresses only a predetermined number of blocks having a large error, rather than compressing all of an error image because it appears intensively in a specific part.