KR100731981B1

KR100731981B1 - The method and system of multi-view video encoding based on grid type pyramid gop structure

Info

Publication number: KR100731981B1
Application number: KR1020050096980A
Authority: KR
Inventors: 최병호; 김제우; 호요성; 오관정; 오한
Original assignee: 전자부품연구원
Priority date: 2005-10-14
Filing date: 2005-10-14
Publication date: 2007-06-25
Also published as: KR20070041146A

Abstract

본 발명은 개별 시점에 대하여 복수개의 프레임으로 이루어지는 GOP(Group of Pictures)를 하나 또는 그 이상 포함하는 영상을 구성하고, 동일 대상의 영상을 복수 개의 시점에서 획득하여 압축하는 다시점 영상 압축 부호화 방법으로서, 상기 각각의 GOP를 I 프레임, B 프레임 및 RB 프레임으로 구성하되, 상호 인접하는 시점의 두 GOP는 상기 GOP를 구성하는 프레임 수의 1/2 만큼 시간 차이를 두어 상기 I 프레임이 배치되도록 구성된 격자 형태 GOP 구조를 형성하여 복호화하는 것인 다시점 영상 압축 부호화 방법이다.The present invention provides a multi-view image compression encoding method for constructing an image including one or more GOPs (Group of Pictures) composed of a plurality of frames for individual viewpoints, and obtaining and compressing images of the same object from a plurality of viewpoints. A grid configured such that each GOP comprises an I frame, a B frame, and an RB frame, wherein two GOPs at adjacent points in time are spaced apart by 1/2 of the number of frames constituting the GOP; A multi-view video compression encoding method for forming and decoding a shape GOP structure.

GOP, 다시점, 압축 부호화, H.264, 격자 GOP, Multiview, Compression Coding, H.264, Grid

Description

METHOD AND SYSTEM OF MULTI-VIEW VIDEO ENCODING BASED ON GRID TYPE PYRAMID GOP STRUCTURE}

도 1은 종래의 다시점 영상 부호화 방식을 설명한 도면.1 is a view for explaining a conventional multi-view video encoding method.

도 2는 본 발명에 따라 다시점 영상 부호화를 위하여 배치된 격자형 GOP 구조를 도시한 도면.2 is a diagram illustrating a lattice-type GOP structure arranged for multi-view image coding according to the present invention.

도 3은 본 발명의 일 실시예에 따라 다시점 영상 부호화를 하는 과정을 도시한 도면.3 is a diagram illustrating a process of performing multi-view image encoding according to an embodiment of the present invention.

도 4는 본 발명에 따른 부호화 방법에서 각 프레임의 계위를 도시한 도면.4 is a diagram showing the hierarchy of each frame in the encoding method according to the present invention;

도 5는 본 발명의 다른 일 실시예에 따라 다시점 영상 부호화 하는 과정을 도시한 도면. 5 is a diagram illustrating a process of multiview image encoding according to another embodiment of the present invention.

본 발명은 다시점 영상(multiview video)에 관한 것으로서, 특히 다시점 영상의 효율적 압축에 관한 것이다. The present invention relates to multiview video, and more particularly to efficient compression of multiview video.

다시점 영상은 복수 개의 카메라를 통해 촬영된 영상들을 기하학적으로 교정하고, 공간적인 합성 등을 통하여 여러 방향의 다양한 시점을 사용자에게 제공하는 3차원 영상처리의 한 분야이다. 이러한 다시점 영상은 사용자에게 시점의 자유를 증가시킬 수 있다는 특징을 지닌다. 이는 사용자들에게 자유로운 시점 및 넓은 프레임을 통한 입체감 제공이라는 장점을 지니고 있지만, 영상 획득시에 필수적으로 존재하는 중복 조건, 순간 처리 데이터 양의 증가, 그리고 고가의 장비가 요구되는 문제점이 있다. A multi-view image is a field of 3D image processing that geometrically corrects images taken by a plurality of cameras and provides a user with various viewpoints in various directions through spatial synthesis. Such a multi-view image is characterized by increasing freedom of view to a user. This has the advantage of providing a user with a free view and a three-dimensional effect through a wide frame, but there is a problem that requires an overlapping condition, an increase in the amount of instantaneous processing data, and expensive equipment that are essential when acquiring an image.

최근의 다시점 영상의 압축은 도 1에 도시된 바와 같이, 종래의 가장 효율적인 압축 방식인 H.264를 이용하여 각 시점별로 독립적으로 압축하는 방식을 취하고 있다.As shown in FIG. 1, the latest multi-view image compression is performed by independently compressing each view using H.264, which is the most efficient compression method of the related art.

도 1은 8개 시점에 대한 예를 든 것이다. 한 시점의 영상은 여러 개의 GOP(Group of Pictures)로 구성되어 있다. 여기서 GOP란 하나의 I 프레임과 다수의 P 프레임이나 B 프레임으로 구성된 프레임들의 집합을 말하는 데, 도 1의 경우 하나의 GOP는 하나의 I 프레임과 3개의 P 프레임, 그리고 8개의 B 프레임 총 12개의 프레임으로 구성되어 있다. 1 shows an example of eight viewpoints. An image of one viewpoint is composed of several GOPs (Group of Pictures). Here, the GOP refers to a set of frames composed of one I frame and a plurality of P frames or B frames. In FIG. 1, one GOP includes one I frame, three P frames, and eight B frames. It consists of a frame.

각 GOP의 I 프레임은 영상의 재생시 임의 접근이 가능한 위치로 기존 방법의 경우 0.5초 마다 나타난다. P 프레임은 I 프레임을 참조 프레임으로 이용하여 예측 부호화 되며, B 프레임의 경우는 I 프레임과 P 프레임을 참조 프레임으로 이용하여 예측 부호화 된다. The I frame of each GOP is a location that can be accessed randomly during video playback, and appears every 0.5 seconds in the conventional method. P frames are predictively encoded using I frames as reference frames, and in the case of B frames, they are predictively encoded using I frames and P frames as reference frames.

그런데, 종래의 방법은 각 시점의 영상들을 각각 독립적으로 부호화하는 방법을 취하고 있다. 각 시점의 독립적인 부호화에 이용되는 H.264의 경우 부호화 파라미터를 다음과 같이 설정한다. 비트율 제어를 사용하고 비트율 제어의 기본 단위 는 부호화 하고자 하는 영상 크기의 한 줄에 해당하는 매크로 블록의 수로 정한다. However, the conventional method takes a method of encoding images of each viewpoint independently. In the case of H.264 used for independent encoding of each view, encoding parameters are set as follows. The bit rate control is used and the basic unit of the bit rate control is the number of macro blocks corresponding to one line of the image size to be encoded.

비트율-왜곡 최적화 기법을 사용 하고, P 프레임이나 B 프레임 예측에 이용되는 움직임 예측에 대한 탐색 범위는 ±32이다. 그리고 참조 프레임의 수는 5개 이고, 시간적인 임의 접근은 15fps의 경우 1초, 25/30fps의 경우 0.5초 마다 가능해야 한다. 즉 이 단위마다 I 프레임이 삽입되어야 한다. Using the rate-distortion optimization technique, the search range for motion prediction used for P frame or B frame prediction is ± 32. The number of reference frames is five, and temporal random access should be possible every 1 second for 15 fps and 0.5 seconds for 25/30 fps. In other words, an I frame must be inserted in each unit.

GOP의 구조는 IBBP..... 구조를 이용한다. 루프 필터는 사용하고, VLC의 경우 CABAC (context-based adaptive binary arithmetic coding)을 이용한다. The structure of the GOP uses the IBBP ..... structure. Loop filter is used, and in the case of VLC, context-based adaptive binary arithmetic coding (CABAC) is used.

이러한 다시점 부호화에 관한 방법은 기본적으로 부호화 효율이 좋아야 하고, 시간적으로는 지연이 작아야 한다. 또한 시점간의 화질의 일관성이 유지되어야 하고, 시간·공간적인 임의 접근이 가능해야 한다. Such a method for multi-view coding should basically have good coding efficiency and have a small delay in time. In addition, consistency of image quality between viewpoints should be maintained, and temporal and spatial random access should be possible.

종래 방법의 경우는 하나의 시점에 대한 영상 부호화 마찬가지로 하나의 프레임내 공간적 상관관계는 고려하였으나 시점별 공간적 상관도를 고려하지 않았다.In the case of the conventional method, the spatial correlation in one frame is considered, but the spatial correlation in each view is not considered.

여기서 공간적인 상관도란 동일 시간의 인접한 시점의 영상들 간의 유사성을 일컫는다. 인접한 시점의 경우 카메라의 위치만 조금 바뀐 것이기 때문에 동일 시간의 인접한 시점의 영상들을 상당히 유사함을 알 수 있고 이를 실제로 예측 부호화 과정에서 이용한다면 비트율이나 영상의 화질 면에서도 종래의 방법보다 우수한 성능을 보일 수 있다. 그러나 종래 방법의 경우는 각 시점의 영상들을 인접 시점 영상을 고려치 않고 독립적으로 부호화 하여 개별 시점의 시간적인 상관도에만 부호화 효율을 의존하고 있다. 하지만 다시점 영상 부호화라는 점을 감안할 때 보다 나은 부호화 효율을 위해서는 시간적인 상관도와 함께 공간적인 상관도도 함께 고 려될 필요가 있다.In this case, the spatial correlation refers to the similarity between images of adjacent viewpoints at the same time. In the case of adjacent viewpoints, only the position of the camera has changed slightly, so that the images of the adjacent viewpoints of the same time are very similar, and if they are actually used in the prediction encoding process, they show better performance than conventional methods in terms of bit rate and image quality. Can be. However, in the conventional method, images of respective viewpoints are independently encoded without considering neighboring viewpoint images, and the encoding efficiency is dependent only on the temporal correlation of individual viewpoints. However, considering multi-view video coding, spatial correlation and spatial correlation need to be considered for better coding efficiency.

전술한 종래 기술의 문제점을 해결하기 위하여, 본 발명은 다시점 영상 압축 부호화에 있어서, 각 시점간의 시간적 상관도 및 공간적 상관도를 고려하는 압축 부호화 방법을 제공하는 데 그 목적이 있다.SUMMARY OF THE INVENTION In order to solve the above-mentioned problems of the prior art, an object of the present invention is to provide a compression encoding method that considers temporal and spatial correlation between viewpoints in multi-view video compression encoding.

상기 기술적 과제를 달성하기 위한 본 발명에 따라, 개별 시점에 대하여 복수개의 프레임으로 이루어지는 GOP(Group of Pictures)를 하나 또는 그 이상 포함하는 영상을 구성하고, 동일 대상의 영상을 복수 개의 시점에서 획득하여 압축하는 다시점 영상 압축 부호화 방법에 있어서, 상기 각각의 GOP를 I 프레임, B 프레임 및 RB 프레임으로 구성하되, 상호 인접하는 시점의 두 GOP는 상기 GOP를 구성하는 프레임 수의 1/2 만큼 시간 차이를 두어 상기 I 프레임이 배치되도록 구성된 격자 형태 GOP 구조를 형성하여 부호화하는 것인 다시점 영상 압축 부호화 방법이 제공된다. 상기 각 GOP는 I, B, B, RB, B, B, RB, B, B, RB, B, B 프레임의 순서로 구성되며, 각 GOP의 I 프레임을 먼저 부호화하는 단계와, 동일 시점의 I 프레임과, 동일 시간대의 인접 시점의 I 프레임을 이용하여 RB 프레임을 형성하는 단계와, 동일 시점의 I 프레임과, 동일 시점의 RB 프레임을 이용하여 상기 I 프레임과 RB 프레임의 중간에 위치하는 RB 프레임을 형성하는 단계와, 상기 RB 프레임과 상기 I 프레임을 이용하여 B 프레임을 형성하는 단계를 포함하여 압축이 진행된다.According to the present invention for achieving the above technical problem, to configure an image including one or more GOP (Group of Pictures) consisting of a plurality of frames for each viewpoint, and to obtain the image of the same object from a plurality of viewpoints In the multi-view video compression encoding method for compressing, each GOP includes an I frame, a B frame, and an RB frame, wherein two GOPs at adjacent points in time are separated by a half of the number of frames constituting the GOP. A multiview image compression encoding method is provided in which a lattice-shaped GOP structure configured to arrange the I frames is formed and encoded. Each GOP is configured in the order of I, B, B, RB, B, B, RB, B, B, RB, B, B frames, and first encoding the I frame of each GOP, and I at the same time. Forming an RB frame by using a frame, I frames of adjacent time points of the same time zone, and an RB frame positioned between the I frame and the RB frame by using an I frame at the same time point and an RB frame at the same time point. And forming a B frame using the RB frame and the I frame.

본 발명의 다른 일 면에 따라 다시점 영상 압축 부호화 시스템에 있어서,동 일 대상을 복수 개의 시점에서 복수 개의 영상 획득 장치와, 상기 영상 획득 장치별로 얻어진 영상을 I 프레임, B 프레임 및 RB 프레임으을 포함하는 복수개의 프레임으로 이루어지는 GOP(Group of Pictures) 하나 또는 그 이상으로 이루어지는 압축 영상을 형성하는 압축 부호화 수단을 포함하고, 상기 압축 부호화 수단은 상호 인접하는 영상 획득 장치에 대한 GOP는 상기 GOP를 구성하는 프레임 수의 1/2 만큼 시간 차이를 두어 상기 I 프레임이 배치되도록 구성된 격자 형태 GOP 구조를 형성하여 부호화하는 것인 다시점 영상 압축 부호화 시스템이 제공된다.In a multi-view video compression encoding system according to another aspect of the present invention, a plurality of image capturing apparatuses of the same object from a plurality of viewpoints, and the image obtained for each of the image capturing apparatus includes I frames, B frames and RB frames Compression encoding means for forming a compressed image consisting of one or more GOPs (Group of Pictures) consisting of a plurality of frames, wherein the compression encoding means comprises a GOP for an image acquisition device adjacent to each other A multi-view video compression encoding system is provided, which forms and encodes a grid-shaped GOP structure configured to arrange the I frames with a time difference of 1/2 of the number of frames.

상기 압축 부호화 수단은 각 GOP의 I 프레임을 먼저 부호화하는 과정과, 동일 시점의 I 프레임과, 동일 시간대의 둘 이상의 인접 시점의 I 프레임을 이용하여 RB 프레임을 형성하는 과정과, 동일 시점의 I 프레임과, 동일 시점의 RB 프레임을 이용하여 상기 I 프레임과 RB 프레임의 중간에 위치하는 RB 프레임을 형성하는 과정과, 상기 RB 프레임과 상기 I 프레임을 이용하여 B 프레임을 형성하는 과정을 수행하여 다시점 영상 압축 부호화 작업을 수행한다.The compression encoding means first encodes an I frame of each GOP, forms an RB frame using I frames of the same view, and I frames of two or more adjacent views of the same time zone, and I frames of the same view. And a process of forming an RB frame located between the I frame and the RB frame using the RB frame at the same time point, and forming a B frame using the RB frame and the I frame. Perform video compression coding.

이하, 첨부 도면을 참조하여 본 발명의 구성에 대하여 상세히 설명한다.EMBODIMENT OF THE INVENTION Hereinafter, with reference to an accompanying drawing, the structure of this invention is demonstrated in detail.

도 2는 본 발명의 바람직한 일 실시예에 따라 인접 시점의 프레임을 참조하여 다시점 영상 압축 부호화를 행하기 위한 격자형 GOP 구조를 도시한 도면이다. 도시된 바와 같이, 기본적으로 각 GOP는 I, B, B, RB, B, B, RB, B, B, RB, B, B 프레임의 순서로 구성되며, 각 시점의 I 프레임의 위치가 인접한 시점과 일치하지 않도록 하나의 GOP를 구성하는 프레임의 절반의 수만큼 짝수 시점에 해당하는 GOP에 지연을 두는 GOP 구조를 마련한다. 2 is a diagram illustrating a lattice GOP structure for performing multi-view video compression coding with reference to a frame of an adjacent view according to an exemplary embodiment of the present invention. As shown, basically, each GOP is composed of I, B, B, RB, B, B, RB, B, B, RB, B, and B frames in order, and the positions of the I frames of each viewpoint are adjacent to each other. A GOP structure is provided in which a delay is applied to a GOP corresponding to an even time point by half the number of frames constituting one GOP so as not to coincide with.

이와 같은 본 발명의 신규하고 진보한 GOP구조를 채택하는 이유는 다음과 같다. 한 시점의 대한 부호화 효율은 I 프레임에 가장 많이 의존한다. 이는 I 프레임이 다른 프레임의 참고 프레임으로 이용되기 때문에 다음 I 프레임이 나오기 전까지 부호화 효율에 큰 영향을 미친다. 따라서 I 프레임의 간격이 좁을수록 해당 영상의 화질은 좋아진다. 그러나 I 프레임이 많아지면 비트 발생이 많아져 압축 효율면에서는 손해를 보게 된다. The reason for adopting such a novel and advanced GOP structure of the present invention is as follows. The coding efficiency for a point in time is most dependent on the I frame. Since the I frame is used as a reference frame of another frame, it has a great influence on the coding efficiency until the next I frame comes out. Therefore, the narrower the interval between the I frames, the better the image quality. However, the more I frames, the more bits are generated, which results in a loss of compression efficiency.

이러한 단점을 보완하면서 효과적으로 부호화 효율을 증진시키기 위하여 본 발명에 따른 방법은 RB 프레임이라는 새로운 프레임을 정의하고 있다. 이는 참조가 가능한 B 프레임이다. RB 프레임은 종래 방법의 P 프레임과 동일한 위치에 있으며 하는 일도 같다. 다만 부호화시 B 프레임처럼 부호화되고 다른 B 프레임의 참조 프레임으로 쓰이는 특성을 갖는다. 그리고 제안하는 알고리즘에서 RB 프레임은 I 프레임만큼 고화질로 부호화 된다. 이런 방법을 통해 적은 비트를 이용하여 다수의 고화질 프레임을 부호화 하여 이를 다시 참조 화면으로 이용하면 다른 프레임들의 부호화 효율은 이에 따라 좋아지게 된다. In order to effectively improve the coding efficiency while compensating for this disadvantage, the method according to the present invention defines a new frame called an RB frame. This is a reference B frame. The RB frame is at the same position as the P frame of the conventional method and is the same. However, it has a characteristic of being encoded like a B frame and used as a reference frame of another B frame. In the proposed algorithm, RB frames are encoded with high quality as much as I frames. In this way, if a plurality of high quality frames are encoded using fewer bits and used as reference pictures, the encoding efficiency of other frames is improved accordingly.

도 3은 도2의 격자형 GOP 구조에서, 각 프레임이 어떤 과정으로 부호화되는지를 설명하기 위한 도면이다. 도 3에서, 화면상에서 맨 왼쪽에 위치한 두 시점의 부호화를 예로 들면서 본 발명의 방법을 설명하도록 한다.FIG. 3 is a diagram for describing how each frame is encoded in the lattice GOP structure of FIG. 2. In FIG. 3, the method of the present invention will be described by taking the encoding of two viewpoints located at the far left on the screen as an example.

먼저, 왼쪽 시점의 첫 번째 I 프레임[I(1)]이 부호화 되고, 인접하는 오른쪽 시점의 I 프레임[I(2)]이 부호화 된다. 이 두 프레임을 참조화면으로 두고 RB 프레임[RB(3), RB(4)}이 부호화 된다. 그 다음, 다른 RB 프레임[RB(5), RB(6)]이 각각 동일 시점의 전후 RB 프레임과 I 프레임을 참조 화면으로 두고 부호화 된다. First, the first I frame [I (1)] of the left view is encoded, and the I frame [I (2)] of the adjacent right view is encoded. The RB frames (RB (3) and RB (4)) are encoded with these two frames as reference pictures. Then, other RB frames (RB (5), RB (6)) are encoded with reference frames before and after the same RB frame and I frame, respectively.

도 3의 각 화살표는 위에 설명된 참조 관계를 보여주고 있다. Each arrow in FIG. 3 shows the reference relationship described above.

이와 같이 RB 프레임의 경우 I 프레임과 동일 시간대에 위치한 RB 프레임의 경우 동일 시간대의 인접 시점의 I 프레임과 동일 시점의 I 프레임을 이용하여 부호화 되고, I 프레임과 동일 시간대에 위치하지 않은 RB 프레임의 경우 동일 시점의 RB 프레임과 I 프레임을 참조 화면으로 이용한다. 나머지 B 프레임의 경우 종래의 방법과 마찬가지로 전후에 위치한 RB 프레임이나 I 프레임을 참조 화면으로 이용하여 부호화 된다. As described above, in the case of an RB frame, an RB frame located at the same time zone as the I frame is encoded using an I frame at the same time as an I frame at an adjacent time point in the same time zone, and an RB frame not located at the same time zone as the I frame. The RB frame and I frame at the same time are used as reference pictures. The remaining B frames are encoded by using RB frames or I frames positioned before and after, as in the conventional method, as reference pictures.

RB 프레임과 마찬가지로 도 3의 오른쪽에 B 프레임들에 대한 참조관계가 도시되어 있다. As with the RB frame, a reference relationship for the B frames is shown on the right side of FIG. 3.

위에서 언급했듯이 제안하는 방법의 경우 기본적으로 GOP 구조를 격자로 배치하여 I 프레임의 효율성을 다시점 영상의 전반에 골고루 나눌 수 있도록 하였다. 이를 위해 I 프레임과 동일한 시간대에 위치한 인접 시점의 영상의 경우 동일 시점의 I 프레임뿐만 아니라 인접한 시점의 I 프레임까지를 참조 화면으로 이용하여 시간·공간적으로 효율적인 예측을 행하여 적은 비트로 I 프레임과 유사한 화질로 부호화하여 I 프레임이 실제 간격보다 절반마다 나오는 효과를 주었다. 또한 종래의 방법의 P 프레임을 RB 프레임으로 대체하였고 부호화 순서를 조정하여 시간적인 상관도의 경우도 전후 시간대의 영상을 모두 이용할 수 있도록 하여 시간적인 상관도로 인한 부호화 효율도 효과적으로 개선할 수 있다. As mentioned above, in the proposed method, the GOP structure is basically arranged in a grid so that the efficiency of I frame can be divided evenly in the first half of multi-view image. To this end, in the case of images of adjacent viewpoints located in the same time zone as the I frame, time and space efficient prediction is performed using not only I frames of the same viewpoint but also I frames of adjacent viewpoints as a reference screen, so that the image quality is similar to that of the I frame with fewer bits. The coding resulted in an I frame appearing at half the actual interval. In addition, the P frame of the conventional method is replaced with the RB frame, and the coding order is adjusted to enable the use of all images in the front and rear time zones, so that the coding efficiency due to the temporal correlation can be effectively improved.

전술한 실시예에서, 상기 RB 프레임의 양자화 파라미터(QP)는 I 프레임의 QP 와 동일 또는 유사하도록 설정함으로써 RB 프레임의 화질을 I 프레임의 화질과 동일 또는 유사하게 한다.In the above-described embodiment, the quantization parameter (QP) of the RB frame is set to be the same or similar to the QP of the I frame to make the image quality of the RB frame the same or similar to the image quality of the I frame.

또한, I 프레임, RB 프레임, B 프레임의 순서로 QP를 증가시켜 압축 부호화함으로써 화질의 저하를 최소화하면서 압축 효율을 높게 할 수 있다.In addition, by compressing and encoding QP in the order of I frames, RB frames, and B frames, compression efficiency can be increased while minimizing degradation of image quality.

도 4는 각 프레임의 계층구조를 도시한 도면이다. 도 4는 인트라 간격이 12인 경우에 대한 피라미드 GOP 구조를 보여주고 있다. 그림에서와 같이 기준이 되는 I 장면들을 이용하여 중간 RB 장면을 만든다. 이 RB 장면과 I 장면들을 이용하여 다시 두 개의 RB 장면을 만든 후 RB 장면과 I 장면을 이용하여 B 장면들을 만든다. 4 is a diagram illustrating a hierarchical structure of each frame. 4 shows a pyramid GOP structure for the case where the intra interval is 12. Create an intermediate RB scene using the reference I scenes as shown. Create two RB scenes using these RB and I scenes, and then create B scenes using the RB and I scenes.

이를 통해 시간적으로 이전 장면과 이후 장면을 참조하여 B 장면의 부호화 효율을 극대화 시킬 수 있다. 이 과정에서는 기준이 되는 I 장면 위치의 영상들은 고화질로 부호화 하고 아래 계층으로 갈수록 QP를 증가시켜 이용한다. Through this, it is possible to maximize the coding efficiency of the B scene by referring to the previous scene and the subsequent scene in time. In this process, images of the I scene position, which is a reference, are encoded in high definition and used by increasing the QP toward the lower layer.

위에서 제안한 방법의 경우 다음과 같이 확장 적용이 가능 하다. 아래 그림과 같이 하나로 묶이는 시점을 두 개 뿐만이 아니라 3개 이상의 다수로 확장이 가능하다. 또한 GOP의 길이에 대한 조정도 가능하다The proposed method can be extended as follows. As shown in the figure below, not only two points of view can be extended but also three or more points. It is also possible to adjust the length of the GOP.

도 5는 3개의 시점을 연계하여 다시점 영상을 부호화하는 경우의 GOP 구조 및 각 프레임의 참조관계를 도시한 도면이다.FIG. 5 is a diagram illustrating a GOP structure and a reference relationship between frames when a multiview image is encoded by linking three viewpoints.

도시된 바와 같이, 3개의 시점에 대한 GOP의 I 프레임[I(1), I(2), I(3)]을 먼저 부호화하고, RB 프레임[예컨대, RB(4)]을 3개의 I 프레임[I(1), I(2), I(3)]을 이용하여 부호화한다. 기타 프레임의 경우에는 전술한 바와 같이 부호화 할 수 있으며, 인접 시점의 I 프레임 또는 RB 프레임을 이용하여 부호화 할 수 있다.As shown, I frames (I (1), I (2), I (3)) of the GOP for three viewpoints are first encoded, and RB frames (eg, RB (4)) are three I frames. Encoding is performed using [I (1), I (2), I (3)]. Other frames may be encoded as described above, and may be encoded using an I frame or an RB frame of an adjacent view.

이상 본 발명의 바람직한 실시예와 첨부 도면을 참조하여, 본 발명의 구성에 대하여 구체적으로 설명하였으나, 이는 예시에 불과한 것으로서 본 발명의 범위가 이에 한정되는 것은 아니다. 즉, 본 발명의 기술 분야에 통상의 지식을 가진 자라면 본 발명의 범위내에서 다양한 변형과 변경을 행할 수 있을 것이다. 따라서, 본 발명의 보호범위는 이하의 특허청구범위의 기재에 의하여 정하여져야 할 것이다.Although the configuration of the present invention has been described in detail with reference to the preferred embodiments of the present invention and the accompanying drawings, this is only an example and the scope of the present invention is not limited thereto. That is, those skilled in the art will be able to make various modifications and changes within the scope of the present invention. Therefore, the protection scope of the present invention should be defined by the following claims.

이상 설명한 바와 같이, 본 발명에 따르면 종래의 방법에서 단점으로 지적되었던 공간적인 상관성까지를 고려한 방법으로 다시점 영상 부호화의 특성에 걸맞게 종래 방법에서 구조를 효율적으로 변형하여 시간·공간적인 상관도를 효율적으로 고려할 수 있다. 격자 GOP 배치와 RB 장면을 통해 영상 부호화 효율을 좌우하는데 큰 역할을 하는 I 장면의 대한 효과를 극대화 시켰다. As described above, according to the present invention, in consideration of the spatial correlation that has been pointed out as a disadvantage in the conventional method, the structure is modified efficiently in the conventional method in accordance with the characteristics of the multi-view image coding to efficiently improve the spatial and spatial correlation. Can be considered. Lattice GOP placement and RB scenes maximize the effects of the I-scene, which plays a large role in determining image coding efficiency.

종래의 방법에 비하여 I 장면의 간격이 이론적으로 절반마다 나오는 효과를 가지면서도 적은 비트를 이용하기 때문에 영상의 화질과 비트 발생율 면에서 이득을 볼 수 있다. 또한 동일 시점의 영상의 경우도 시간적인 순서를 조금 변형하여 보다 향상된 시간적인 상관도를 이용할 수 있게 하였다. Compared to the conventional method, since the interval between I scenes is theoretically half-effective and uses fewer bits, the image quality and bit generation rate can be gained. In addition, in the case of the image of the same view, the temporal order is slightly modified to use an improved temporal correlation.

이를 통해 종래의 다시점 영상 부호화 방법에 비하여 보다 적은 비트로 좋은 화질을 갖도록 하였다. 또한 불필요한 참조 화면의 수를 줄여서 부호화 시간적인 면에서도 좋은 성능을 보인다. As a result, it is possible to have a better image quality with fewer bits than the conventional multi-view image coding method. In addition, the number of unnecessary reference pictures is reduced to show good performance in terms of encoding time.

Claims

A multi-view image compression encoding method for constructing an image including one or more GOPs (Group of Pictures) composed of a plurality of frames for individual viewpoints, and obtaining and compressing images of the same object from a plurality of viewpoints,

Each GOP is composed of an I frame, a B frame, and an RB frame, and two GOPs of adjacent time points are arranged in a lattice form in which the I frames are arranged with a time difference of 1/2 of the number of frames constituting the GOP. A multi-view video compression encoding method for forming and encoding a GOP structure.

The method of claim 1,

Wherein each GOP consists of I, B, B, RB, B, B, RB, B, B, RB, B, B frames.

The method of claim 1,

First encoding the I frames of each GOP,

Forming an RB frame using I frames at the same time and I frames at adjacent time points in the same time zone;

Forming an RB frame located between the I frame and the RB frame by using an I frame at the same time and an RB frame at the same time;

Forming a B frame using the RB frame and the I frame

How to include.

4. The method of claim 3, wherein the quantization parameter (QP) of the RB frame is the same as or similar to the QP of the I frame.

The method of claim 3, wherein the compression is performed by increasing the QP in the order of the I frame, the RB frame, and the B frame.

In the multi-view image compression encoding method for forming an image including one or more GOP (Group of Pictures) consisting of a plurality of frames for each viewpoint, and obtaining and compressing the same object from a plurality of viewpoints,

Each GOP comprises an I frame, a B frame, and an RB frame, wherein GOPs in the same time zone at three or more adjacent time points are spaced apart by 1/2 of the number of frames constituting the GOP. And forming and encoding a lattice-shaped GOP structure configured to be arranged.

The method of claim 6,

First encoding the I frames of each GOP,

Forming an RB frame by using an I frame at the same time and two or more adjacent frames at the same time.

Forming a B frame using the RB frame and the I frame

How to include.

8. The method of claim 7, wherein the quantization parameter (QP) of the RB frame is the same as or similar to the QP of the I frame.

The method of claim 7, wherein the compression is performed by increasing the QP in the order of the I frame, the RB frame, and the B frame.

In a multi-view video compression coding system,

A plurality of image acquisition apparatuses for the same object from a plurality of viewpoints,

Compression encoding means for forming a compressed image including one or more group of pictures (GOPs) including a plurality of frames including an I frame, a B frame, and an RB frame;

Wherein the compression encoding means is to form and encode a grid-shaped GOP structure configured such that the GOPs for the adjacent video acquisition apparatus by the time difference by 1/2 of the number of frames constituting the GOP is arranged to place the I frame Point video compression coding system.

The method of claim 10, wherein the compression encoding means

Encoding the I frame of each GOP first,

Forming an RB frame using I frames at the same time and two or more adjacent time frames at the same time;

And performing a process of forming a B frame by using the RB frame and the I frame.