KR20070106318A

KR20070106318A - Decoder, processing system and processing method for multi-view frame data, and recording medium having program performing this

Info

Publication number: KR20070106318A
Application number: KR1020060039038A
Authority: KR
Inventors: 김만배; 김종원; 호요성; 김윤
Original assignee: 광주과학기술원
Priority date: 2006-04-28
Filing date: 2006-04-28
Publication date: 2007-11-01
Also published as: KR100780844B1

Abstract

A multi-view image decoder, a system and a method for processing multi-view image data, and a recording medium storing a program for performing the method are provided to make it possible to process a view image using one encoder and one decoder, thereby reducing the complexity of the system. K view buffers(420a-420k) store data corresponding to K views, respectively. A bit sequence distributor(410) rearranges K view image sequences having the same GOP(Group Of Picture) structure to receive a single encoded bit sequence. The bit sequence distributor(410) divides the single bit sequence into view image data units corresponding to respective views to store the view image data units respectively in corresponding view buffers. A scheduler outputs a control signal for processing the view image data stored in the view buffers. An image decoding unit(430) decodes and outputs the view image data stored in the view buffers in response to the control signal.

Description

Multi-view image decoder, multi-view image data processing system, multi-view image data processing method and a recording medium recording a program for performing the same {Decoder, processing system and processing method for multi-view frame data, and recording medium having program performing this }

도 1a는 평행(parallel) 데이터인 다시점 화상 데이터의 획득을 위한 다수의 카메라 배열 방법을 나타낸 도면.1A is a diagram illustrating a method of arranging a plurality of cameras for obtaining multiview image data, which is parallel data.

도 1b는 아크(arc) 데이터인 다시점 화상 데이터의 획득을 위한 다수의 카메라 배열 방법을 나타낸 도면.1B is a diagram illustrating a plurality of camera arrangement methods for obtaining multi-view image data, which is arc data.

도 2는 본 발명의 바람직한 일 실시예에 따른 다시점 화상 부호화기의 간략한 구성블록도.2 is a simplified block diagram of a multi-view picture encoder in accordance with one preferred embodiment of the present invention.

도 3은 시점 화상 데이터를 재배열한 단일 비트열을 생성하는 방법을 나타낸 도면.3 is a diagram illustrating a method of generating a single bit string rearranged of viewpoint image data.

도 4는 본 발명의 바람직한 일 실시예에 따른 다시점 화상 복호화기의 간략한 구성블록도.4 is a simplified block diagram of a multi-view picture decoder in accordance with one preferred embodiment of the present invention.

도 5는 GoGOP의 각 GOP들의 복호화 시간의 일례.5 is an example of a decoding time of each GOP of a GoGOP.

도 6은 도 4의 다시점 화상 복호화기의 화상 스킵부의 간략한 구성블록도.FIG. 6 is a schematic block diagram of a picture skip part of the multi-view picture decoder of FIG. 4; FIG.

도 7은 본 발명의 바람직한 일 실시예에 다른 GOP 내에서 복호화 화상을 선택하는 방법.7 is a method for selecting a decoded picture within a GOP according to a preferred embodiment of the present invention.

도 8은 본 발명의 바람직한 다른 실시예에 다른 GOP 내에서 복호화 화상을 선택하는 방법.8 is a method for selecting a decoded picture within a GOP according to another preferred embodiment of the present invention.

<도면의 주요부분에 대한 부호의 설명><Description of the symbols for the main parts of the drawings>

200 : 다시점 화상 부호화기200: multiview image encoder

210 : 다시점 화상 재배열부210: multiview image rearrangement unit

220 : 화상 부호화부220: image encoder

400 : 다시점 화상 복호화기400: multiview image decoder

410 : 비트열 분배기410: Bit Train Splitter

420a, 420b, …, 420K : 시점 버퍼420a, 420b,... , 420K: Viewpoint buffer

430 : 화상 복호화부430: picture decoding unit

440 : 스케쥴러440: scheduler

본 발명은 다시점 화상 시스템에 관한 것으로, 보다 상세하게는 하나의 부호화기와 복호화기를 이용하는 다시점 화상 데이터 처리 시스템에 관한 것이다. The present invention relates to a multiview image system, and more particularly, to a multiview image data processing system using one encoder and a decoder.

일반적으로, 디지털 데이터는 일정한 타입의 전송 장치로부터 일정한 타입의 수신 장치로 전송된다. 그리고, 전송 장치는 전송을 위한 데이터를 부호화(encoding)하는 부호화기(encoder)를 통상적으로 포함하고, 수신 장치는 수신된 데이터를 복호화(decoding)하는 복호화기(decoder)를 포함한다. 비디오 데이터, 오디오 데이터, 오디오/비디오 데이터 등과 같은 다양한 형태의 디지털 데이터가 전송 장치로부터 수신 장치로 전송되어 수신 장치를 통해 출력될 수 있다.In general, digital data is transmitted from a certain type of transmitting device to a certain type of receiving device. In addition, the transmission apparatus typically includes an encoder for encoding data for transmission, and the reception apparatus includes a decoder for decoding the received data. Various types of digital data such as video data, audio data, audio / video data, etc. may be transmitted from the transmitting device to the receiving device and output through the receiving device.

지배적인 비디오 압축 및 전송 포맷들은 혼성 블록 기반 화상 신호 보정 변환 비디오 부호기(hybrid block-based motion-compensated transform video coder)라고 불리는 군(family)으로부터 유래하는데, 상기 부호기의 예는 ITU-T VCEG 비디오 코딩 표준들을 포함하고, 비디오 코딩 표준들로는 VCEG(Video Coding Experts Group) 및 ISO/IEC MPEG(Moving Picture Experts Group)의 H.261, MPEG-1, H.262/MPEG-2 비디오, H.263, MPEG-4 비쥬얼 및 인-프로세스 초안 표준(in-process draft standard) H.264/AVC 등이 있다. 또한, 정지 화상(still picture), 오디오, 문서, 웹 페이지 등을 포함하는 다양한 다른 유형의 매체를 위하여, 또한 이러한 신호들을 동기화하고 함께 멀티플렉싱하기 위하여 부호화 및 압축 표준들이 지정되어 있다.The dominant video compression and transmission formats come from a family called hybrid block-based motion-compensated transform video coder, an example of which is ITU-T VCEG video coding. Standards, and video coding standards include H.261, MPEG-1, H.262 / MPEG-2 video, H.263, MPEG from the Video Coding Experts Group (VCEG) and ISO / IEC Moving Picture Experts Group (MPEG). -4 visual and in-process draft standard H.264 / AVC. In addition, for various other types of media including still pictures, audio, documents, web pages, etc., encoding and compression standards are also specified to synchronize and multiplex these signals together.

일반적으로 비디오 스트림은 세 가지 타입의 프레임들(Frames) 또는 픽쳐들(pictures)로 구성(이하, '화상'이라는 용어로 통칭하여 설명함)된다. 세 가지 타입의 화상들이란 화면 내 화상(I 화상 : intra frame), 예측 화상(P 화상 : predictive frame) 및 양방향 예측 화상(B 화상 : bi-directionally predictive frame)이다.In general, a video stream is composed of three types of frames or pictures (hereinafter, collectively described as the term 'picture'). The three types of pictures are intra picture (I picture: intra frame), predictive picture (P picture: predictive frame), and bidirectional predictive picture (B picture: bi-directionally predictive frame).

I 화상은 움직임 벡터 추정/보상을 이용하지 않고 단순히 그 화상만을 DCT하여 부호화하고, P 화상은 I 화상 또는 다른 P 화상을 참조하면서 움직임 추정/보상을 한 후, 나머지 차분의 데이터를 DCT하여 부호화하며, B 화상은 P 화상과 같이 움직임 보상을 사용하지만 시간축 상에 있는 두 개의 화상으로부터 움직임 추정/보상을 수행한다.I pictures are DCT-encoded only without using motion vector estimation / compensation, P pictures are motion-tagged / compensated with reference to I-picture or other P-picture, and then DCT-coded with the remaining difference data. The B picture uses motion compensation like the P picture, but performs motion estimation / compensation from two pictures on the time axis.

비디오 스트림 또는 시퀀스는 GOP(Group of Pictures)라 불리는 세그먼트에 의해 정의되며, I, B, B, P, B, B, P, …와 같이 이루어진 구조에서 I 화상부터 다음의 I 화상까지를 GOP(Group of Picture)라 칭한다. 통상적으로, GOP는 의도된 속도로 디스플레이되는 경우, 미리 설정된 시간(예를 들어, 0.5초)의 지속시간(duration)을 갖는 한 세트의 화상으로 구성된다.A video stream or sequence is defined by a segment called a group of pictures (GOP), where I, B, B, P, B, B, P,... In the structure formed as described above, the I picture to the next I picture are referred to as a GOP (Group of Picture). Typically, a GOP is composed of a set of pictures with a duration of a preset time (eg, 0.5 seconds) when displayed at the intended speed.

상술한 바와 같은 비디오 스트림 등의 화상 정보 전달을 위한 매체는 텔레비전과 같은 2차원 단말기 기술로부터 발전해왔다. 즉, 흑백 화상에서 출발하여 칼라 화상, 즉 SD(Standard Definition)급 텔레비전에서 고해상도 텔레비전(예로, HDTV)으로 발전함에 따라 화상 정보의 데이터량은 증가하는 추세인 것이다. 또한, 정보 통신의 기술 발달과 사용자의 요구에 따라 오락, 게임, 입체 영화 등의 다양한 컨텐츠들이 개발되고 있다. 이에, 현재 화상 정보는 평면적인 2차원의 세계가 아니라 공간을 포함하는 3차원적인 실감 컨텐츠로서, 현실감 및 자연 자체의 멀티미디어 정보의 전달을 위해서는 그 자체 그대로 재현할 수 있는 3차원 화상 정보와 관련된 기술이 개발되고 있다. 이러한 3차원 화상 정보와 관련된 기술의 하나로 다시점 화 상(multi-view video)이 있다.As described above, a medium for transmitting image information such as a video stream has been developed from a two-dimensional terminal technology such as a television. That is, the data amount of image information is increasing as the color image, i.e., the development of a high definition television (e.g., HDTV) from a black and white image to a high definition television (e.g., HDTV). In addition, various contents such as entertainment, games, stereoscopic movies, etc. are being developed according to the development of information and communication technology and user's request. Therefore, current image information is not a planar two-dimensional world but three-dimensional realistic content that includes space, and technology related to three-dimensional image information that can be reproduced as it is to deliver multimedia information of reality and nature itself. Is being developed. One of the technologies related to such three-dimensional image information is a multi-view video.

다시점 화상 장치는 여러 대의 카메라로부터 획득한 화상을 사용자에게 전달해 원하는 시점(view)에서 화상을 시청할 수 있도록 하는 차세대 화상 장치이다. 즉, 사용자는 하나의 정적 시점보다는 다수의 시점 영상을 제공받아 임의의 시점으로 변경함으로써, 해당 시점의 영상 컨텐츠를 시청할 수 있다. 다시점 화상을 공급하기 위한 다시점 화상 시스템 방식은 일반적으로 다시점 입력 비디오 신호가 들어오면 카메라 개수와 같은 개수의 부호화기를 사용하여 개별적인 압축 비트열(compressed bitstream)으로 만드는 동시(simulcast) 방식이다. 각 비트열은 MUX(멀티플렉서)를 통해 전송되는데, MUX는 일반적으로 해당 전송 규격에 맞추어 데이터 구조를 만든 후 채널로 전송한다. 이렇게 전달된 압축 비트열, 즉 부호화 화상은 수신 측에서 각각의 복호화기를 통해 복호화되며, 최종적으로 사용자에게 전달된다. 따라서, 시점 카메라의 수가 증가할수록 필요로 하는 부호화기 및 복호화기의 수가 증가하게 되고, 시스템이 복잡해짐으로 인해 다시점 화상의 전송을 위한 실용적인 시스템의 구현에 어려움이 있다.The multi-view image device is a next-generation image device that delivers images obtained from several cameras to a user so that the user can view the image at a desired view. That is, the user may receive a plurality of viewpoint images rather than one static viewpoint and change the image to an arbitrary viewpoint, thereby viewing the image contents of the viewpoint. The multi-view image system scheme for supplying a multi-view image is generally a simulcast method in which a multi-view input video signal is input into individual compressed bitstreams using the same number of encoders as the number of cameras. Each bit string is transmitted through a MUX (multiplexer), which typically creates a data structure in accordance with the transmission specification and transmits it to the channel. The compressed bit string thus transmitted, that is, the encoded image, is decoded through each decoder at the receiving side and finally delivered to the user. Therefore, as the number of viewpoint cameras increases, the number of encoders and decoders required increases. As the system becomes complicated, it is difficult to implement a practical system for transmitting a multiview image.

따라서, 본 발명은 시점 카메라의 수와 무관하게 하나의 부호화기와 복호화기를 이용하여 다시점 화상을 전송하고 처리할 수 있는 다시점 화상 데이터 처리 시스템을 제공한다.Accordingly, the present invention provides a multiview image data processing system capable of transmitting and processing a multiview image using one encoder and a decoder regardless of the number of viewpoint cameras.

또한, 본 발명은 다시점 화상을 재배열함으로써 하나의 단일 압축 비트열로 부호화하는 다시점 화상 처리를 위한 다시점 화상 부호화기 및 그 방법을 제공한다. The present invention also provides a multiview image encoder and a method for multiview image processing for encoding a single compressed bit string by rearranging the multiview images.

또한, 본 발명은 하나의 단일 압축 비트열로 부호화된 다시점 화상을 복호화하여 사용자에게 각 시점에 따른 화상을 제공할 수 있는 다시점 화상 복호화기 및 그 방법을 제공한다. In addition, the present invention provides a multi-view image decoder and method capable of decoding a multi-view image encoded with one single compressed bit string to provide a user with an image corresponding to each viewpoint.

또한, 본 발명은 하나의 부호화기와 하나의 복호화기를 이용하여 다시점 화상을 처리함으로써 전체 시스템의 복잡도를 감소시키면서도 채널 대역폭 및 부호화하는 매크로 블록의 수 등은 영향을 받지 않는 다시점 화상 데이터 처리 시스템을 제공한다. In addition, the present invention provides a multi-view image data processing system using a single encoder and a decoder to reduce the complexity of the overall system by processing a multi-view image, while not affecting the channel bandwidth and the number of macro blocks to be encoded. to provide.

본 발명의 이외의 목적들은 하기의 설명을 통해 쉽게 이해될 수 있을 것이다. Other objects of the present invention will be readily understood through the following description.

상기 목적들을 달성하기 위하여, 본 발명의 일 측면에 따르면, K(임의의 자연수)개의 시점(view)에 상응하는 데이터를 각각 저장하는 K개의 시점 버퍼; 동일한 GOP(Group Of Picture) 구조를 가지는 K개의 시점 화상 시퀀스(sequence)를 재배열하여 부호화한 단일 비트열을 전송받고, 상기 단일 비트열을 상기 각 시점에 상응하는 시점 화상 데이터 단위로 분리하여 상응하는 상기 시점 버퍼에 각각 저장하는 비트열 분배기; 상기 시점 버퍼에 저장된 상기 시점 화상 데이터를 처리하도록 하는 제어신호를 출력하는 스케쥴러; 및 상기 제어신호에 따라 상기 시점 버퍼 에 저장된 상기 시점 화상 데이터를 각각 복호화하여 출력하는 화상 복호화부를 포함하는 다시점 화상 복호화기가 제공될 수 있다. In order to achieve the above objects, according to an aspect of the present invention, the K view buffer for storing data corresponding to the K (random natural number) view (view), respectively; A single bit string obtained by rearranging and encoding K viewpoint image sequences having the same group of picture (GOP) structure is received, and the single bit sequence is divided into viewpoint image data units corresponding to the respective viewpoints to correspond. A bit string divider for storing in each of the time buffers; A scheduler for outputting a control signal for processing the viewpoint image data stored in the viewpoint buffer; And a picture decoder to decode and output the view image data stored in the view buffer, respectively, according to the control signal.

상기 목적들을 달성하기 위하여, 본 발명의 다른 측면에 따르면, K(임의의 자연수)개의 카메라로부터 동일한 GOP 구조를 가지는 각각의 시점 화상 시퀀스를 입력받고, 상기 각 시점 화상 시퀀스를 GOP 단위로 구분하여 순차적으로 재배열한 단일 비트열을 생성하는 다시점 화상 재배열부; 상기 단일 비트열을 부호화하여 전송하는 화상 부호화부; 상기 각 카메라의 시점에 상응하는 데이터를 각각 저장하는 K개의 시점 버퍼; 상기 단일 비트열을 전송받고, 상기 단일 비트열을 상기 각 시점에 상응하는 시점 화상 데이터 단위로 분리하여 상응하는 상기 시점 버퍼에 각각 저장하는 비트열 분배기; 상기 시점 버퍼에 저장된 상기 시점 화상 데이터를 처리하도록 하는 제어신호를 출력하는 스케쥴러; 및 상기 제어신호에 따라 상기 시점 버퍼에 저장된 상기 시점 화상 데이터를 각각 복호화하여 출력하는 화상 복호화부를 포함하는 다시점 화상 데이터 처리 시스템이 제공될 수 있다. In order to achieve the above objects, according to another aspect of the present invention, each viewpoint image sequence having the same GOP structure is input from K (random natural numbers) cameras, and the respective viewpoint image sequences are divided into GOP units and sequentially A multi-view image rearrangement unit for generating a single bit sequence rearranged into An image encoder which encodes and transmits the single bit string; K view buffers respectively storing data corresponding to the viewpoints of the cameras; A bit string divider configured to receive the single bit string and to separate the single bit string into view image data units corresponding to the respective view points and to store the single bit stream in the corresponding view buffer; A scheduler for outputting a control signal for processing the viewpoint image data stored in the viewpoint buffer; And an image decoder which decodes and outputs the viewpoint image data stored in the viewpoint buffer according to the control signal, respectively.

바람직하게는, 상기 화상 부호화부는 상기 부호화된 단일 비트열을 단일 채널로 전송할 수 있다. 상기 다시점 화상 재배열부는 상기 각 시점 화상 시퀀스를 GOP 단위로 구분하며, 동일 시간대의 GOP 단위별로 그룹화하여 순차적으로 재배열할 수 있으며, 상기 시점 화상 데이터는 상기 단일 비트열 중 상기 각 시점에 상응하는 부호화된 상기 시점 화상 시퀀스 중 최소 하나의 GOP에 속하는 화상들을 포함하거나 최대 상기 시점 화상 시퀀스를 모두 포함할 수 있다. Preferably, the image encoder may transmit the encoded single bit string through a single channel. The multi-viewpoint image rearranger divides each of the viewpoint image sequences into GOP units, and may rearrange them sequentially by grouping the GOP units in the same time zone, and the viewpoint image data corresponds to the viewpoints of the single bit stream. One of the encoded view image sequences may include pictures belonging to at least one GOP, or may include all of the maximum view image sequences.

상기 다시점 화상 복호화기 또는 상기 다시점 화상 데이터 처리 시스템은 하기와 같은 특징을 더 포함할 수 있다. The multi-view image decoder or the multi-view image data processing system may further include the following features.

바람직하게는, 상기 시점 화상 데이터는 상기 단일 비트열 중 상기 각 시점에 상응하는 부호화된 상기 시점 화상 시퀀스 중 최소 하나의 GOP에 속하는 화상들을 포함하고 최대 상기 시점 화상 시퀀스를 모두 포함할 수 있다.Preferably, the viewpoint image data may include images belonging to at least one GOP among the encoded viewpoint image sequences corresponding to the respective viewpoints of the single bit string, and may include all of the maximum viewpoint image sequences.

여기서, 상기 스케쥴러는, 상기 화상 복호화부로부터 가장 최근에 복호화된 시간대의 GOP에서의 상기 각 시점 화상 데이터의 복호화 시간을 수신하고 상기 복호화 시간에 따른 화상 복호화부 이용률을 계산하는 이용률 산출부; 및 상기 화상 복호화부 이용률로부터 상기 화상 복호화부에서 직후에 복호화될 시간대의 GOP에서의 화상 스킵 방법을 결정하는 화상 스킵부를 포함할 수 있다. 상기 화상 스킵부는 상기 시점별로 상기 가장 최근에 복호화된 시간대의 GOP에서의 상기 화상 복호화부 이용률이 클수록 상기 직후에 복호화될 시간대의 GOP에서의 화상 스킵 수가 증가되도록 할 수 있다.Here, the scheduler may include: a utilization calculator configured to receive a decoding time of each viewpoint image data in a GOP of a time zone most recently decoded from the image decoding unit, and calculate a usage rate of the image decoding unit according to the decoding time; And a picture skip unit for determining a picture skip method in a GOP of a time zone to be decoded immediately after the picture decoder from the picture decoder utilization rate. The picture skip unit may increase the number of picture skips in the GOP of the time zone to be decoded immediately after the time as the use rate of the picture decoder in the GOP of the most recently decoded time zone increases for each time point.

상기 화상 스킵부는, 상기 가장 최근에 복호화된 시간대의 GOP에서의 상기 화상 복호화부 이용률에 따라 상기 직후에 복호화될 시간대의 GOP에서의 화상 스킵 수를 결정하는 스킵 화상 결정 모듈; 및 상기 화상 스킵 수에 따라 스킵될 화상을 선택하는 복호화 화상 선택 모듈을 포함할 수 있다. The picture skip unit may include: a skip picture determination module configured to determine a number of picture skips in a GOP in a time zone to be immediately decoded immediately after the picture decoding unit in accordance with a utilization rate of the picture decoder in the GOP in a most recently decoded time zone; And a decoded picture selecting module for selecting a picture to be skipped according to the picture skip number.

상기 복호화 화상 선택 모듈은 GOP 내에서 스킵 시작 화상 및 상기 스킵 시작 화상 이후의 화상들을 연속적으로 스킵할 수 있으며, 상기 시점 화상 데이터는 베이스밴드 프로파일 시퀀스 및 메인 프로파일 시퀀스 중 어느 하나일 수 있다.The decoded picture selecting module may continuously skip a skip start picture and pictures after the skip start picture in a GOP, and the view point image data may be any one of a baseband profile sequence and a main profile sequence.

또는 상기 복호화 화상 선택 모듈은 GOP 내에서 화상들을 불연속적으로 스킵할 수 있으며, 상기 시점 화상 데이터는 메인 프로파일 시퀀스일 수 있다. Alternatively, the decoded picture selection module may discontinuously skip pictures in a GOP, and the view picture data may be a main profile sequence.

여기서, 상기 스케쥴러는, 상기 화상 복호화부 이용률로부터 직후에 복호화될 시간대의 GOP에서의 시점 우선순위를 결정하는 시점 우선순위 결정부를 더 포함할 수 있다. 상기 시점 우선순위 결정부는 상기 시점별로 가장 최근에 복호화된 시간대의 GOP에서의 상기 화상 복호화부 이용률이 작을수록 상기 직후에 복호화될 시간대의 GOP에서의 시점 우선순위가 높아지도록 할 수 있다. Here, the scheduler may further include a viewpoint priority determiner that determines a viewpoint priority in a GOP of a time zone to be decoded immediately after the picture decoder utilization. The viewpoint priority determining unit may make the viewpoint priority in the GOP of the time zone to be decoded immediately after the smaller the utilization rate of the picture decoder in the GOP of the time zone most recently decoded for each viewpoint.

상기 목적들을 달성하기 위하여, 본 발명의 또 다른 측면에 따르면, (a) 동일한 GOP 구조를 가지는 K(임의의 자연수)개의 시점 화상 시퀀스를 재배열하여 부호화한 단일 비트열을 전송받는 단계; (b) 상기 단일 비트열을 각 시점에 상응하는 시점 화상 데이터 단위로 분리하여 상응하는 시점 버퍼에 저장하는 단계; (c) 상기 시점 버퍼에 저장된 시점 화상 데이터를 동일 시간대의 GOP 단위별로 복호화하는 단계; 및 (d) 상기 시점 화상 데이터별로 가장 최근에 복호화된 시간대의 GOP의 복호화 시간에 따라 직후에 복호화될 시간대의 GOP에서의 화상 스킵 방법을 결정하고, 순차적으로 상기 단계 (c)를 반복하는 단계를 포함하는 다시점 화상 데이터 처리 방법이 제공될 수 있다.According to another aspect of the present invention, (a) receiving a single bit string obtained by rearranging and encoding a K (arbitrary natural number) view image sequence having the same GOP structure; (b) dividing the single bit stream into view image data units corresponding to each view and storing the single bit string in a corresponding view buffer; (c) decoding the viewpoint image data stored in the viewpoint buffer for each GOP unit of the same time zone; And (d) determining a method of skipping pictures in the GOP of the time zone to be decoded immediately after the decoding time of the GOP of the time zone decoded most recently for each view image data, and sequentially repeating step (c). A multi-viewpoint image data processing method may be provided.

바람직하게는, 상기 단계 (d)는 상기 시점 화상 데이터별로 상기 가장 최근에 복호화된 시간대의 GOP의 복호화 시간이 길수록 상기 직후에 복호화될 시간대의 GOP에서의 화상 스킵 수가 증가되도록 할 수 있다. Preferably, the step (d) may increase the number of picture skips in the GOP of the time zone to be decoded immediately after the decoding time of the GOP of the most recently decoded time zone is longer for each view image data.

그리고 상기 단계 (d)는, (d1) 상기 시점 화상 데이터별로 상기 가장 최근에 복호화된 시간대의 복호화 시간에 따라 상기 직후에 복호화될 시간대의 화상 스킵 수를 결정하는 단계; (d2) 상기 화상 스킵 수에 따라 스킵될 화상을 선택하는 단계를 포함할 수 있다. 여기서, 상기 단계 (d2)는 GOP 내에서 스킵 시작 화상 및 상기 스킵 시작 화상 이후의 화상들을 연속적으로 스킵하거나 GOP 내에서 화상들을 불연속적으로 스킵할 수 있다. The step (d) may include: (d1) determining the number of image skips of the time zone to be decoded immediately after the time according to the decoding time of the most recently decoded time zone for each of the viewpoint image data; (d2) may include selecting an image to be skipped according to the number of image skips. Here, the step (d2) may skip the skip start picture and the pictures after the skip start picture in the GOP continuously or discontinuously skip pictures in the GOP.

또한, 상기 단계 (d)는 상기 시점 화상 데이터별로 상기 가장 최근에 복호화된 시간대의 GOP의 복호화 시간이 길수록 상기 직후에 복호화될 시간대의 GOP에서의 시점 우선순위가 높아지도록 할 수 있다. In addition, in the step (d), the longer the decoding time of the GOP in the most recently decoded time zone for each viewpoint image data, the higher the viewpoint priority in the GOP of the time zone to be decoded immediately after.

여기서, 상기 시점 화상 데이터는 상기 단일 비트열 중 상기 각 시점에 상응하는 부호화된 상기 시점 화상 시퀀스 중 최소 하나의 GOP에 속하는 화상들을 포함하거나 최대 상기 시점 화상 시퀀스를 모두 포함할 수 있다. Here, the viewpoint image data may include images belonging to at least one GOP of the encoded viewpoint image sequences corresponding to the respective viewpoints of the single bit string, or may include all of the maximum viewpoint image sequences.

상기 목적들을 달성하기 위하여, 본 발명의 또 다른 측면에 따르면, 동일한 GOP 구조를 가지는 K(임의의 자연수)개의 시점 화상 시퀀스를 재배열하여 부호화한 단일 비트열을 각 시점에 상응하는 시점 화상 데이터 단위로 분리하고 동일 시간대의 GOP 단위별로 순차적으로 복호화하는 방법에 있어서, (a) 가장 최근에 복호화된 시간대의 GOP의 복호화 시간 정보를 입력받는 단계; (b) 상기 복호화 시간에 따라 직후에 복호화될 시간대의 GOP에서의 화상 스킵 방법을 결정하는 단계; 및 (c) 상기 화상 스킵 방법에 따라 상기 직후에 복호화될 시간대의 GOP의 복호화를 제어하 는 신호를 출력하는 단계를 포함하는 다시점 화상 데이터 처리 방법이 제공될 수 있다.In order to achieve the above objects, according to another aspect of the present invention, a viewpoint image data unit corresponding to each viewpoint includes a single bit string obtained by rearranging and encoding K (arbitrary natural numbers) viewpoint image sequences having the same GOP structure. 10. A method of separating and sequentially decoding each GOP unit of a same time zone, the method comprising: (a) receiving decoding time information of a GOP of a most recently decoded time zone; (b) determining a method of skipping pictures in a GOP in a time zone to be decoded immediately after the decoding time; And (c) outputting a signal for controlling the decoding of the GOP in the time zone to be decoded immediately after the image according to the picture skipping method.

바람직하게는, 상기 단계 (b)는 상기 시점 화상 데이터별로 상기 가장 최근에 복호화된 시간대의 GOP의 복호화 시간이 길수록 상기 직후에 복호화될 시간대의 GOP에서의 화상 스킵 수가 증가되도록 할 수 있다. Preferably, the step (b) may increase the number of picture skips in the GOP of the time zone to be decoded immediately after the decoding time of the GOP of the most recently decoded time zone increases for each view image data.

또한, 상기 단계 (b)는, (b1) 상기 시점 화상 데이터별로 상기 가장 최근에 복호화된 시간대의 복호화 시간에 따라 상기 직후에 복호화될 시간대의 화상 스킵 수를 결정하는 단계; (b2) 상기 화상 스킵 수에 따라 스킵될 화상을 선택하는 단계를 포함할 수 있다. Further, the step (b) may include: (b1) determining the number of image skips of the time zone to be decoded immediately after the time according to the decoding time of the most recently decoded time zone for each of the viewpoint image data; (b2) selecting an image to be skipped according to the number of skipped images.

그리고 상기 단계 (b2)는 GOP 내에서 스킵 시작 화상 및 상기 스킵 시작 화상 이후의 화상들을 연속적으로 스킵하거나 GOP 내에서 화상들을 불연속적으로 스킵할 수 있다. In the step (b2), the skip start picture and the pictures after the skip start picture in the GOP may be continuously skipped or the pictures may be discontinuously skipped in the GOP.

상기 단계 (b)는 상기 시점 화상 데이터별로 상기 가장 최근에 복호화된 시간대의 GOP의 복호화 시간이 길수록 상기 직후에 복호화될 시간대의 GOP에서의 시점 우선순위가 높아지도록 할 수 있으며, 상기 시점 화상 데이터는 상기 단일 비트열 중 상기 각 시점에 상응하는 부호화된 상기 시점 화상 시퀀스 중 최소 하나의 GOP에 속하는 화상들을 포함하거나 최대 상기 시점 화상 시퀀스를 모두 포함할 수 있다. In the step (b), the longer the decoding time of the GOP of the most recently decoded time zone for each of the viewpoint image data, the higher the viewpoint priority in the GOP of the time zone to be decoded immediately afterwards. The single bit string may include pictures belonging to at least one GOP among the encoded view picture sequences corresponding to the respective view points, or may include all of the view picture sequences at a maximum.

상기 목적들을 달성하기 위하여, 본 발명의 또 다른 측면에 따르면, 디지털 처리 장치에 의해 실행될 수 있는 명령어들의 프로그램이 유형적으로 구현되어 있으며 디지털 처리 장치에 의해 판독될 수 있는 기록매체로서, 제32항 내지 제38항 중 어느 한 항에 기재된 다시점 화상 데이터 처리 방법을 수행하는 프로그램을 기록한 기록매체가 제공될 수 있다.In order to achieve the above objects, according to another aspect of the present invention, there is provided a recording medium in which a program of instructions executable by a digital processing apparatus is tangibly embodied and can be read by the digital processing apparatus. A recording medium on which a program for performing the multi-viewpoint image data processing method according to any one of claims 38 to 38 can be provided.

본 발명은 다양한 변경을 가할 수 있고 여러 가지 실시예를 가질 수 있는 바, 특정 실시예들을 도면에 예시하고 상세한 설명에 상세하게 설명하고자 한다. 그러나, 이는 본 발명을 특정한 실시 형태에 대해 한정하려는 것이 아니며, 본 발명의 사상 및 기술 범위에 포함되는 모든 변경, 균등물 내지 대체물을 포함하는 것으로 이해되어야 한다. 각 도면을 설명하면서 유사한 참조부호를 유사한 구성요소에 대해 사용하였다. As the invention allows for various changes and numerous embodiments, particular embodiments will be illustrated in the drawings and described in detail in the written description. However, this is not intended to limit the present invention to specific embodiments, it should be understood to include all modifications, equivalents, and substitutes included in the spirit and scope of the present invention. In describing the drawings, similar reference numerals are used for similar elements.

제1, 제2, A, B 등의 용어는 다양한 구성요소들을 설명하는데 사용될 수 있지만, 상기 구성요소들은 상기 용어들에 의해 한정되어서는 안 된다. 상기 용어들은 하나의 구성요소를 다른 구성요소로부터 구별하는 목적으로만 사용된다. 예를 들어, 본 발명의 권리 범위를 벗어나지 않으면서 제1 구성요소는 제2 구성요소로 명명될 수 있고, 유사하게 제2 구성요소도 제1 구성요소로 명명될 수 있다. 및/또는 이라는 용어는 복수의 관련된 기재된 항목들의 조합 또는 복수의 관련된 기재된 항목들 중의 어느 항목을 포함한다. Terms such as first, second, A, and B may be used to describe various components, but the components should not be limited by the terms. The terms are used only for the purpose of distinguishing one component from another. For example, without departing from the scope of the present invention, the first component may be referred to as the second component, and similarly, the second component may also be referred to as the first component. The term and / or includes a combination of a plurality of related items or any item of a plurality of related items.

어떤 구성요소가 다른 구성요소에 "연결되어" 있다거나 "접속되어" 있다고 언급된 때에는, 그 다른 구성요소에 직접적으로 연결되어 있거나 또는 접속되어 있 을 수도 있지만, 중간에 다른 구성요소가 존재할 수도 있다고 이해되어야 할 것이다. 반면에, 어떤 구성요소가 다른 구성요소에 "직접 연결되어" 있다거나 "직접 접속되어" 있다고 언급된 때에는, 중간에 다른 구성요소가 존재하지 않는 것으로 이해되어야 할 것이다. When a component is said to be "connected" or "connected" to another component, it may be directly connected to or connected to that other component, but other components may be present in the middle. It should be understood. On the other hand, when a component is said to be "directly connected" or "directly connected" to another component, it should be understood that there is no other component in between.

본 출원에서 사용한 용어는 단지 특정한 실시예를 설명하기 위해 사용된 것으로, 본 발명을 한정하려는 의도가 아니다. 단수의 표현은 문맥상 명백하게 다르게 뜻하지 않는 한, 복수의 표현을 포함한다. 본 출원에서, "포함하다" 또는 "가지다" 등의 용어는 명세서상에 기재된 특징, 숫자, 단계, 동작, 구성요소, 부품 또는 이들을 조합한 것이 존재함을 지정하려는 것이지, 하나 또는 그 이상의 다른 특징들이나 숫자, 단계, 동작, 구성요소, 부품 또는 이들을 조합한 것들의 존재 또는 부가 가능성을 미리 배제하지 않는 것으로 이해되어야 한다.The terminology used herein is for the purpose of describing particular example embodiments only and is not intended to be limiting of the present invention. Singular expressions include plural expressions unless the context clearly indicates otherwise. In this application, the terms "comprise" or "have" are intended to indicate that there is a feature, number, step, operation, component, part, or combination thereof described in the specification, and one or more other features. It is to be understood that the present invention does not exclude the possibility of the presence or the addition of numbers, steps, operations, components, components, or a combination thereof.

다르게 정의되지 않는 한, 기술적이거나 과학적인 용어를 포함해서 여기서 사용되는 모든 용어들은 본 발명이 속하는 기술 분야에서 통상의 지식을 가진 자에 의해 일반적으로 이해되는 것과 동일한 의미를 가지고 있다. 일반적으로 사용되는 사전에 정의되어 있는 것과 같은 용어들은 관련 기술의 문맥 상 가지는 의미와 일치하는 의미를 가지는 것으로 해석되어야 하며, 본 출원에서 명백하게 정의하지 않는 한, 이상적이거나 과도하게 형식적인 의미로 해석되지 않는다.Unless defined otherwise, all terms used herein, including technical or scientific terms, have the same meaning as commonly understood by one of ordinary skill in the art. Terms such as those defined in the commonly used dictionaries should be construed as having meanings consistent with the meanings in the context of the related art and shall not be construed in ideal or excessively formal meanings unless expressly defined in this application. Do not.

이하, 첨부한 도면들을 참조하여, 본 발명의 바람직한 실시예를 보다 상세하게 설명하고자 한다. 이하, 도면상의 동일한 구성요소에 대해서는 동일한 참조부호를 사용하고 동일한 구성요소에 대해서 중복된 설명은 생략한다. 또한, 본 발명을 설명함에 있어서 관련된 공지 기술에 대한 구체적인 설명이 본 발명의 요지를 불필요하게 흐릴 수 있다고 판단되는 경우 그 상세한 설명을 생략한다.Hereinafter, with reference to the accompanying drawings, it will be described in detail a preferred embodiment of the present invention. Hereinafter, the same reference numerals are used for the same components in the drawings, and duplicate descriptions of the same components are omitted. In addition, in describing the present invention, when it is determined that the detailed description of the related known technology may unnecessarily obscure the gist of the present invention, the detailed description thereof will be omitted.

도 1a는 평행(parallel) 데이터인 다시점 화상 데이터의 획득을 위한 다수의 카메라 배열 방법을 나타낸 도면이고, 도 1b는 아크(arc) 데이터인 다시점 화상 데이터의 획득을 위한 다수의 카메라 배열 방법을 나타낸 도면이다. FIG. 1A is a diagram illustrating a plurality of camera arrangement methods for acquiring multiview image data, which is parallel data, and FIG. 1B is a diagram of a plurality of camera arrangement methods for acquiring multiview image data, which is arc data. The figure shown.

도 1a에 도시된 평행 데이터인 다시점 화상 데이터를 획득하기 위한 다수의 카메라 배열 방법은 피사체(110)와 수직으로 배치된 임의의 카메라를 기준으로 K(임의의 자연수)개의 카메라들(120a, 120b, …, 120K-1, 120K)을 일렬로 배치하는 것이다. 도 1a에 도시된 방법에 따라 K개의 카메라(120a, 120b, …, 120K-1, 120K)를 배열하는 경우 각 카메라와 피사체 간의 거리는 일반적으로 동일하지 않게 된다. 물론, 중앙에 배치된 카메라를 기준으로 양측에 배치된 두 개의 카메라와 피사체(110) 간의 거리는 일치할 수 있다.A plurality of camera arrangement methods for acquiring multi-view image data, which is parallel data shown in FIG. 1A, may include K (random natural numbers) cameras 120a and 120b based on an arbitrary camera disposed perpendicular to the subject 110. , ..., 120K-1, 120K) are arrange | positioned in a line. In the case of arranging K cameras 120a, 120b, ..., 120K-1, 120K according to the method shown in FIG. 1A, the distance between each camera and the subject is generally not the same. Of course, the distance between the two cameras disposed on both sides and the subject 110 based on the camera disposed at the center may coincide.

도 1b에 도시된 아크 데이터인 다시점 화상 데이터를 획득하기 위한 다수의 카메라 배열 방법은 피사체(110)를 중심으로 일정 거리(d) 이격된 원주 상에 K개의 카메라(120a, 120b, …, 120K-1, 120K)를 배치하는 것이다. 도 1b에 도시된 방법에 따라 K개의 카메라(120a, 120b, …, 120K-1, 120K)를 배열하는 경우 각 카메라와 피사체(110) 간의 거리는 d로 동일하다. A plurality of camera arrangement methods for acquiring multi-view image data, which is arc data shown in FIG. 1B, may include K cameras 120a, 120b,..., 120K on a circumference spaced a certain distance d from the subject 110. -1, 120K). In the case of arranging K cameras 120a, 120b, ..., 120K-1, 120K according to the method shown in FIG. 1B, the distance between each camera and the subject 110 is equal to d.

도 1a 내지 도 1b에 각각 예시된 방법으로 배치된 각각의 카메라는 시간이 경과함에 따라 설치된 렌즈 각도에 통해 입력되는 피사체 화상을 입력받아 화상 정 보를 생성하며, 당해 화상 정보는 앞서 설명한 바와 같이 I, B, B, P, B, B, P, …와 같이 이루어진 GOP(Group Of Picture)라는 세그먼트들로 구성된다. Each camera disposed in the method illustrated in FIGS. 1A to 1B receives image of a subject input through a lens angle installed as time passes, and generates image information. B, B, P, B, B, P,... It consists of segments called GOP (Group Of Picture).

상술한 도 1a 내지 도 1b에 도시된 다수의 카메라에 의해 생성된 각 화상 정보는 서로 동기화(synchronization)되어 있으며, 동일한 GOP 구조를 가지고 있다. 이하에서는 각 카메라에서 생성된 화상 정보를 시점 화상 시퀀스(sequence)라 한다. Each image information generated by the plurality of cameras shown in Figs. 1A to 1B described above is synchronized with each other and has the same GOP structure. Hereinafter, the image information generated by each camera is called a viewpoint image sequence.

이하에서 사용될 변수들은 다음과 같이 정의된다. Variables to be used below are defined as follows.

N : GOP 내의 화상의 개수. 즉, 인트라 구간(intra period)N: number of pictures in a GOP. In other words, intra period

M : I/P 구간M: I / P section

K : 시점(view)의 개수. 즉, 다시점 카메라의 개수K: number of views. That is, the number of multiview cameras

V_k : k번째 시점의 화상 시퀀스(view sequence), k ∈ {1, 2, …, K}V _k : View sequence at k-th time point, k ∈ {1, 2,... , K}

GoGOP(Group of GOPs) : 시간 영역 [t1, t1+N]에 있는 GOP의 집합GoGOP (Group of GOPs): set of GOPs in the time domain [t1, t1 + N]

D_k : GoGOP 내의 V_k 화상들의 복호화 시간의 합D _k : Sum of decoding times of V _k pictures in GoGOP

U_k : GoGOP 내의 V_k 화상들의 화상 복호화부(후술함) 이용률U _k : Utilization of the picture decoding unit (to be described later) of the V _k pictures in the GoGOP

S : GoGOP에서 복호화되는 화상의 개수S: Number of pictures to be decoded in GoGOP

일반적으로 사용되는 동화상 데이터 압축 기술로는 국제전기통신연합(ITU: International Telecommunication Union)에서 권고하는 H.261, H.262, H.263등과 국제표준화기구(ISO: International Organization for Standardization)의 동화상 전문가 그룹(MPEG: Motion Picture Experts Group)에서 제안하는 MPEG-1, MPEG-2, MPEG-4 등이 있다. 그리고, 최근에는 국제전기통신연합과 국제표준화기구에서 공동으로 더 높은 압축률을 구현할 수 있는 차세대 동화상 압축기술인 H.264 또는 MPEG-4 part10 AVC가 제정되었다. H.264 표준은 기존의 압축기술들에 비해 복잡하여 실제 구현이 어렵다는 단점이 있지만, 높은 압축률을 구현할 수 있고, 다양한 네트워크 환경에 쉽게 적응시킬 수 있는 장점이 있어 차세대 동화상 압축기술로서 그 연구가 활발히 진행되고 있는 바 이를 기초로 하여 이하 설명한다.Commonly used moving picture compression techniques include moving picture experts from H.261, H.262, H.263, and the International Organization for Standardization (ISO) recommended by the International Telecommunication Union (ITU). MPEG-1, MPEG-2, MPEG-4, etc. proposed by the Motion Picture Experts Group (MPEG). In recent years, H.264 or MPEG-4 part10 AVC, a next-generation moving picture compression technology, has been enacted jointly by the International Telecommunication Union and the International Organization for Standardization. The H.264 standard has the disadvantage of being difficult to implement because it is more complicated than the existing compression technologies, but it has the advantage of being able to implement high compression rate and easily adapt to various network environments. Based on this progress, it will be described below.

한편, H.264 표준에서는 베이스라인 프로파일(Baseline profile)과 메인 프로파일(Main profile) 형태의 표준 프로파일들을 정의하고 있다. 프로파일이란 사용할 수 있는 툴을 나타내는 것이며, 이중 베이스라인 프로파일은 휴대통신, 비디오폰 비디오 회의용 단말기의 A/V통신에 주로 사용되고, 메인 프로파일은 HD 급까지 망라한 셋탑, TV방송, DVD저장 등과 같은 방송 서비스에서의 고화질의 동화상 데이터를 사용하는 단말들의 특성을 정의한 집합이다. Meanwhile, the H.264 standard defines standard profiles in the form of a baseline profile and a main profile. A profile is a tool that can be used, and the dual baseline profile is mainly used for A / V communication of mobile communication and videophone video conferencing terminals, and the main profile includes broadcasting services such as set-top, TV broadcasting, and DVD storage covering up to HD level Is a set of defining characteristics of terminals using high quality moving picture data.

일반적으로 앞서 상술한 바와 같이 화상을 I, P, B 화상으로 분류하는 경우, 본 발명에서는 B 화상이 없으면 베이스라인 프로파일로, B 화상이 있으면 메인 프로파일로 정의하도록 한다. 이 구조는 MPEG-1, MPEG-2, MPEG-4에서도 적용 가능함은 물론이다. In general, when the image is classified into I, P, and B pictures as described above, in the present invention, if there is no B picture, the baseline profile is defined, and if there is a B picture, the main profile is defined. Of course, this structure can be applied to MPEG-1, MPEG-2 and MPEG-4.

도 2는 본 발명의 바람직한 일 실시예에 따른 다시점 화상 부호화기의 간략한 구성블록도이고, 도 3은 시점 화상 데이터를 재배열한 단일 비트열을 생성하는 방법을 나타낸 도면이다. 2 is a schematic block diagram of a multi-view picture encoder according to an exemplary embodiment of the present invention, and FIG.

도 2를 참조하면, 다시점 화상 부호화기(multi-view frame encoder, 200)는 다시점 화상 재배열부(210)와, 화상 부호화부(220)를 포함하여 구성된다. Referring to FIG. 2, the multi-view frame encoder 200 includes a multi-view image rearranger 210 and an image encoder 220.

다시점 화상 재배열부(210)는 K(자연수)개의 카메라들(120a, 120b, …, 120K-1, 120K)이 각각 촬영한 시점 화상 시퀀스를 입력받는다. 시점 화상 시퀀스는 각 카메라들이 동기화되어 있는 바, 시점 화상 시퀀스들은 모두 동일한 GOP 구조(예를 들어, 하나의 GOP 내의 프레임 수가 4라면, IPPPI…)를 가진다. The multi-view image rearranger 210 receives a viewpoint image sequence photographed by K cameras 120a, 120b, ..., 120K-1, and 120K, respectively. The viewpoint image sequence is that each camera is synchronized, so that the viewpoint image sequences all have the same GOP structure (eg, IPPPI… if the number of frames in one GOP is 4).

도 3을 참조하면, 각 시점에 해당하는 카메라들은 다수의 GOP를 가지는 시점 화상 시퀀스들(예를 들어, 카메라가 4대인 경우 시점은 V₁, V₂, V₃, V₄의 4개가 있음)을 생성한다. 각 시점 화상 시퀀스는 시간 경과에 따라 GOP₁, GOP₂, …를 가지게 되며, 동일 번호의 GOP들은 동일한 시간대에 생성된 것을 의미한다. GOP_i(V_k)는 k번째 시점 화상 시퀀스 V_k의 i번째 GOP를 가리킨다. 이하에서는 K = 4, N = 4인 것을 기준으로 설명한다. Referring to FIG. 3, cameras corresponding to respective viewpoints have viewpoint image sequences having a plurality of GOPs (for example, when four cameras are present, four viewpoints are V ₁ , V ₂ , V ₃ , and V ₄ ). Create Each viewpoint image sequence is divided into GOP ₁ , GOP ₂ ,... This means that GOPs with the same number are generated in the same time zone. GOP _i (V _k ) indicates the i-th GOP of the k-th view picture sequence V _k . Hereinafter, description will be made based on the fact that K = 4 and N = 4.

다시점 화상 재배열부(210)는 4개의 시점 화상 시퀀스 V₁, V₂, V₃, V₄를 재배열한 1개의 단일 비트열을 생성한다. 도 3의 (b)에 도시된 것과 같이, 재배열의 방법은 GOP₁(V₁), GOP₁(V₂), GOP₁(V₃), GOP₁(V₄), GOP₂(V₁), GOP₂(V₂) 순으로 정렬하는 것이다. 같은 시간대의 GOP들의 그룹인 GoGOP_i(Group of GOPs, 310)는 {GOP_i(V₁), GOP_i(V₂), GOP_i(V₃), GOP_i(V₄)}로 구성된다. 그리고 각 GoGOP는 시간대 순서별로 재배 치된다. 즉, GoGOP₁, GoGOP₂, GoGOP₃ 순으로 배열되어 단일 비트열을 생성하게 된다. The multi-view image rearranger 210 generates one single bit string in which four viewpoint image sequences V ₁ , V ₂ , V ₃ , and V ₄ are rearranged. As shown in Figure 3 (b), the method of rearrangement is GOP ₁ (V ₁ ), GOP ₁ (V ₂ ), GOP ₁ (V ₃ ), GOP ₁ (V ₄ ), GOP ₂ (V ₁ ) , Sort by GOP ₂ (V ₂ ). GoGOP _i (Group of GOPs, 310), which is a group of GOPs in the same time zone, is composed of {GOP _i (V ₁ ), GOP _i (V ₂ ), GOP _i (V ₃ ), GOP _i (V ₄ )}. Each GoGOP is rearranged by time zone order. That is, GoGOP ₁ , GoGOP ₂ , and GoGOP ₃ are arranged in order to generate a single bit string.

카메라의 개수(즉, 시점의 개수), MPEG GOP의 크기, 복호화기 프로파일 등 중 하나 이상의 정보는 단일 비트열의 헤더에 포함되거나 별도의 비트열로 생성되어 후술할 다시점 화상 복호화기로 전송하는 것이 가능하다. One or more pieces of information of the number of cameras (ie, the number of viewpoints), the size of the MPEG GOP, the decoder profile, etc. may be included in a header of a single bit string or generated as a separate bit string and transmitted to a multi-view image decoder to be described later. Do.

화상 부호화부(220)는 GoGOP가 시간대 순서별로 재배치된 단일 비트열을 다시점 화상 재배열부(210)로부터 입력받아 H.264/AVC로 부호화한다. 이렇게 부호화된 단일 비트열은 하나의 비트열이므로, 기존에 다시점 화상 데이터를 동시 방식으로 부호화하기 위해 시점 개수와 동일한 개수의 부호화기가 필요하던 것과는 달리 하나의 부호화기로 다시점 화상 데이터, 즉 K개의 시점 화상 시퀀스를 부호화하는 것이 가능하다. 그리고 부호화된 단일 비트열은 단일 전송 채널(300)을 통해 전송된다. The image encoder 220 receives a single bit string in which GoGOP is rearranged by time zone order from the multi-view image rearranger 210 and encodes the H.264 / AVC. Since the encoded single bit string is one bit string, unlike the conventional number of encoders equal to the number of viewpoints for encoding the multi-view image data in a simultaneous manner, a single multi-view image data, that is, K It is possible to encode a viewpoint image sequence. The encoded single bit string is transmitted through the single transport channel 300.

이하에서는 단일 전송 채널(300)을 통해 전송된 부호화된 단일 비트열을 동시 방식으로 복호화하여 다시점 화상을 생성하면서도 각 시점 간의 동등한 화질 유지가 가능한 장치 및 그 방법에 대하여 상세히 설명한다. Hereinafter, an apparatus and method for generating a multi-view image by simultaneously decoding a coded single bit string transmitted through a single transmission channel 300 and maintaining the same image quality between respective viewpoints will be described in detail.

도 4는 본 발명의 바람직한 일 실시예에 따른 다시점 화상 복호화기의 간략한 구성블록도이고, 도 5는 GoGOP의 각 GOP들의 복호화 시간의 일례이다. 도 6은 도 4의 다시점 화상 복호화기의 화상 스킵부의 간략한 구성블록도이고, 도 7은 본 발명의 바람직한 일 실시예에 다른 GOP 내에서 복호화 화상을 선택하는 방법이고, 도 8은 본 발명의 바람직한 다른 실시예에 다른 GOP 내에서 복호화 화상을 선택하는 방법이다. 4 is a simplified block diagram of a multi-view picture decoder according to an exemplary embodiment of the present invention, and FIG. 5 is an example of a decoding time of each GOP of a GoGOP. FIG. 6 is a schematic block diagram of a picture skip part of the multi-view picture decoder of FIG. 4. FIG. 7 is a method of selecting a decoded picture in a GOP according to a preferred embodiment of the present invention. Another preferred embodiment is a method for selecting a decoded picture in another GOP.

도 4를 참조하면, 다시점 화상 복호화기(400)는 비트열 분배기(410), K개의 시점 버퍼(420a, 420b, …, 420K), 화상 복호화부(430) 및 스케쥴러(440)를 포함한다. 스케쥴러(440)는 이용률 산출부(441), 시점 우선순위 결정부(443), 화상 스킵부(445)를 포함한다. Referring to FIG. 4, the multiview image decoder 400 includes a bit stream divider 410, K view buffers 420a, 420b,..., 420K, an image decoder 430, and a scheduler 440. . The scheduler 440 includes a utilization calculator 441, a viewpoint priority determiner 443, and an image skip unit 445.

비트열 분배기(410)는 단일 전송 채널(300)을 통해 전송되는 단일 비트열을 입력받는다. 단일 비트열은 상술한 바와 같이 다시점 화상 부호화기(200)에 의해 K개의 시점 화상 시퀀스들이 재배열되고 부호화된 데이터를 의미한다. The bit string divider 410 receives a single bit string transmitted through a single transmission channel 300. As described above, the single bit string refers to data in which K view image sequences are rearranged and coded by the multi-view image encoder 200.

단일 비트열의 헤더 또는 단일 비트열과 함께 전송되는 별도의 비트열에는 카메라의 개수, MPEG GOP의 크기 등의 정보가 포함되어 있다. 비트열 분배기(410)는 이러한 정보를 통해 이용하고자 하는 시점 버퍼의 수를 결정하고, 단일 비트열에 순차적으로 배열되어 있는 GOP들을 시점 화상 데이터 단위로 분리하여 각 카메라(즉, 시점)에 상응하는 K개의 시점 버퍼(420a, 420b, …, 420K)에 저장한다. 시점 화상 데이터는 각 시점 화상 시퀀스를 최소 크기인 경우에는 GOP 크기만큼, 최대 크기인 경우에는 시점 화상 시퀀스를 모두 포함할 수 있다. 시점 버퍼는 시점 화상 데이터의 크기에 따라 하나의 GOP 크기만큼의 시점 화상 시퀀스 중 일부를 저장하고 각 GOP마다 복호화 후에 버퍼를 비우고 다음 GOP를 새롭게 저장하거나 복수의 GOP를 저장할 수 있다.A separate bit string transmitted along with a single bit string header or a single bit string includes information such as the number of cameras and the size of the MPEG GOP. The bit stream divider 410 determines the number of view buffers to be used based on this information, and divides GOPs sequentially arranged in a single bit stream into view image data units, and corresponds to K corresponding to each camera (ie, view). Are stored in the four viewpoint buffers 420a, 420b, ..., 420K. The viewpoint image data may include all of the viewpoint image sequences by the GOP size when the viewpoint image sequence is the minimum size, and when the maximum size. The viewpoint buffer may store some of the viewpoint image sequences corresponding to one GOP size according to the size of the viewpoint image data, empty the buffer after decoding for each GOP, newly store the next GOP, or store a plurality of GOPs.

각 시점 버퍼(420a, 420b, …, 420K)는 K개의 카메라에서 촬영한 시점 화상 시퀀스에 상응하는 부호화된 GOP들을 저장하게 된다. 시점 버퍼에는 시점 화상 데이터가 저장된다. 시점 화상 데이터는 최소 크기가 GOP 단위이며(이는 화상 복호화부(430)에서의 복호화 처리 기준이 GOP 이기 때문이다), 최대 크기는 각 시점의 시점 화상 시퀀스 전체가 될 수 있다. Each viewpoint buffer 420a, 420b, ..., 420K stores encoded GOPs corresponding to a sequence of viewpoint images photographed by K cameras. The viewpoint image data is stored in the viewpoint buffer. The viewpoint image data has a minimum size in units of GOP (since the decoding processing reference in the image decoding unit 430 is a GOP), and the maximum size may be the entire viewpoint image sequence of each viewpoint.

시점 화상 데이터가 최소 단위인 GOP 크기를 가지고 있는 경우에, 특정 시점 버퍼에서 후술할 우선순위에 따라 복호화가 이루어진 후 타 시점 버퍼에서 복호화가 이루어지는 동안에 특정 시점 버퍼에 저장되어 있던 시점 화상 데이터가 시점 버퍼에서 바로 삭제되고, 다음 시간대의 GOP의 시점 화상 데이터가 기록될 수 있다. 또는 특정 시간대의 각 시점의 시점 화상 데이터, 즉 GOP들의 복호화가 모두 완료된 후에 일괄적으로 각 시점 버퍼들을 다음 시간대의 GOP들로 갱신할 수도 있다. 그 외에도 시점 버퍼에 시점 화상 데이터를 저장하고 읽어내는 방법은 다양한 바 이는 해당 기술 분야에서 통상의 지식을 가진 자라면 변형이 가능하다. When the viewpoint image data has the GOP size that is the minimum unit, the viewpoint image data stored in the viewpoint buffer is decoded according to the priority to be described later in the viewpoint buffer after the decoding is performed in the other viewpoint buffer. Can be immediately deleted, and the viewpoint image data of the GOP of the next time zone can be recorded. Alternatively, each of the viewpoint buffers may be collectively updated with GOPs of the next time zone after decoding of viewpoint image data, that is, GOPs, of each viewpoint in a specific time zone is completed. In addition, various methods of storing and reading the viewpoint image data in the viewpoint buffer may be modified by those skilled in the art.

그리고 화상 복호화부(430)는 필요에 따라 각 시점 버퍼(420a, 420b, …, 420K)에 저장된 각 시점의 GOP들을 읽어낸다. The image decoding unit 430 reads the GOPs of respective viewpoints stored in the viewpoint buffers 420a, 420b,..., 420K as necessary.

화상 복호화부(430)는 각 시점 버퍼(420a, 420b, …, 420K)에 저장된 시점 화상 데이터 중에서 동일 시간대에 해당하는 각 시점의 GOP들, 즉 하나의 GoGOP 단위로 부호화된 데이터를 복호화 처리한다. 복호화 처리를 함에 있어서 스케쥴러(440)에서 출력되는 제어 신호에 따라 GoGOP 내의 각 시점의 GOP의 복호화 시간, 복호화 대상 화상, 스킵 화상, 스킵 화상 수 등이 변화될 수 있다. GOP의 복호화 시간, 복화화 대상 화상, 스킵 화상, 스킵 화상 수는 복호화하고자 하는 GOP의 시 간대에 따라 후술할 스케쥴러(440)에 의해 적응적으로 변화가 가능하다. The image decoding unit 430 decodes GOPs of each viewpoint corresponding to the same time zone, that is, data encoded in one GoGOP unit, among viewpoint image data stored in each viewpoint buffer 420a, 420b, ..., 420K. In the decoding process, the decoding time, decoding target picture, skip picture, skip picture number, and the like of the GOP at each time point in the GoGOP may be changed according to the control signal output from the scheduler 440. The decoding time, the decoding target image, the skipped image, and the number of skipped images of the GOP can be adaptively changed by the scheduler 440 which will be described later according to the time zone of the GOP to be decoded.

스케쥴러(440)는 화상 복호화부(430)에서의 복호화에 있어서 가장 최근에 복호화된 GOP 내의 시점 화상 데이터들의 복호화 시간으로부터 적응적으로 변화시킬 직후에 복호화될 GOP에서의 각 시점 화상 데이터들의 복호화 시간, 복호화 대상 화상, 스킵 화상, 스킵 화상 수 등을 제어하는 제어 신호를 화상 복호화부(430)에 전송한다. The scheduler 440 decodes each of the viewpoint image data in the GOP to be decoded immediately after adaptively changing from the decoding time of the viewpoint image data in the most recently decoded GOP in the decoding by the image decoding unit 430, A control signal for controlling the decoding target image, the skipped image, the number of skipped images, and the like is transmitted to the image decoder 430.

스케쥴러(440)는 각 시점의 시점 화상 데이터를 GOP 단위로 처리한다. The scheduler 440 processes the viewpoint image data of each viewpoint in GOP units.

K개의 시점이 있는 경우 각 시점의 GOP들이 N개의 화상으로 구성된 것으로 가정하면, 하나의 GoGOP의 화상 개수는 KㅧN이다. 따라서, KㅧN 화상을 정해진 프레임률(frame rate)(예를 들어, 24 fps, 30 fps)을 만족하며 모두 복호화하는 것은 시스템 성능 등의 문제로 어려움이 있다. 따라서, KㅧN 개의 화상 중에서 일부는 드롭(drop)/스킵(skip)(이하에서는 '스킵'이라 통칭한다)된다. 즉, 스킵된 화상들을 제외한 나머지 화상들만이 복호화되는 바 스킵할 화상을 결정하는 방법은 후술한다. If there are K viewpoints, it is assumed that the GOPs of each viewpoint are composed of N pictures, and the number of pictures of one GoGOP is K ㅧ N. Therefore, it is difficult to decode all K ㅧ N pictures with a predetermined frame rate (for example, 24 fps, 30 fps) due to problems such as system performance. Therefore, some of the K ㅧ N pictures are dropped / skip (hereinafter referred to as 'skip'). That is, a method of determining an image to skip since only remaining pictures except the skipped picture is decoded will be described later.

이용률 산출부(441)는 도 5에 도시된 바와 같이, 화상 복호화부(430)로부터 동일 시간대의 GOP들, 즉 동일 GoGOP(310) 내의 각 시점의 GOP들(GOP(V₁), GOP(V₂), GOP(V₃), …, GOP(V_K))의 가장 최근의 각각의 복호화 시간(D₁, D₂, D₃, …, D_K)을 입력받는다. 가장 최근의 GOP(V_k)의 복호화 시간 D_k으로부터 시점 V_k의 화상 복호화부 이용률 U_k을 산출하면, 하기의 수학식 1과 같다. As shown in FIG. 5, the utilization calculator 441 uses the GOPs of the same time zone, that is, the GOPs GOP (V ₁ ) and the GOP (V) at the same time within the same GoGOP 310 from the image decoder 430. ₂ ), the most recent respective decoding times D ₁ , D ₂ , D ₃ , ..., D _{K of} GOP (V ₃ ), ..., GOP (V _K )) are input. When the picture decoding unit utilization rate U _k of the viewpoint V _k is _calculated from the most recent decoding time D _k of GOP (V _k ), the following equation (1) is given.

여기서,

이다. here,

to be.

복호화 시간의 총합 중에서 당해 시점 V_k의 복호화 시간 D_k이 차지하는 비중을 화상 복호화부 이용률 U_k로 결정한다. Art from the sum of a decoding time to determine the decoding time D _k is the proportion of time occupied by V _k to the picture decoding unit usage rate U _k.

시점 우선순위 결정부(443)는 이용률 산출부(441)에서 산출된 가장 최근에 복호화된 시간대의 각 시점의 GOP의 화상 복호화부 이용률을 이용하여 화상 복호화부(430)에서 직후에 복호화될 시간대의 각 시점의 GOP의 복호화를 위한 우선순위를 결정한다. 예를 들어, n번째 GoGOP 처리 후 스케쥴러(440)에서의 제어 신호 생성 동안 n+1번째 GoGOP는 이미 복호화처리될 수 있으므로, n번째 GoGOP 처리에서의 복호화 시간을 이용하여 산출한 우선순위 또는 후술할 스킵 화상 수, 스킵 화상들은 n+1번째가 아닌 n+2번째, n+3번째 GoGOP가 될 수도 있다. The viewpoint priority determiner 443 uses the image decoder portion utilization rate of the GOP of each viewpoint in the most recently decoded time zone calculated by the utilization rate calculator 441 to determine the time zone to be decoded immediately after the image decoder 430. The priority for decoding the GOP at each time point is determined. For example, during the control signal generation in the scheduler 440 after the nth GoGOP processing, the n + 1th GoGOP may already be decoded, and thus the priority calculated using the decoding time in the nth GoGOP processing or described later. The number of skip pictures and the skip pictures may be n + 2th and n + 3th GoGOPs instead of the n + 1th.

가장 최근에 복호화된 GOP 내에서의 화상 복호화부 이용률에 따라 직후에 복호화될 GOP에서 복호화되는 시점의 우선순위는 매 GOP마다 동적으로 변화하게 된다. 가장 최근에 복호화된 시간대의 화상 복호화부 이용률이 낮은 시점은 상대적으로 값이 큰 시점에 비해 적은 개수의 화상이 복호화되었음을 의미하므로, 직후에 복호화될 시간대의 GOP에서 복호화를 위한 우선순위를 높임으로써, 재생 프레임률 을 동등하게 유지시키게 되고, 시간 축상으로 각 시점이 동등한 복호화 시간을 가지며, 또한 각 시점이 균등한 재생 기회를 가지도록 구현한다. 예를 들어, 하기의 표 1과 같이 가장 최근에 복호화된 GOP 내에서의 각 시점의 화상 복호화부 이용률을 산출하고, 이 중 가장 작은 화상 복호화부 이용률을 가지는 시점 V₄에 대해서 직후에 복호화될 GOP의 복호화시 가장 큰 우선순위를 할당하도록 한다. The priority of the time point to be decoded in the GOP to be decoded immediately after the decoded part in the most recently decoded GOP is dynamically changed for each GOP. The point at which the picture decoding unit utilization rate of the most recently decoded time zone is low means that a small number of pictures are decoded compared to the time point where the value is relatively large. The reproduction frame rate is kept equal, and each viewpoint has an equal decoding time on the time axis, and each viewpoint has an equal reproduction opportunity. For example, calculating a picture decoding unit usage rate of each point within a decoded GOP to the last as shown in Table 1 below, and of which to be decoded immediately after with respect to time of V ₄ with the smallest picture decoding unit utilization GOP To assign the highest priority when decoding.

시점Viewpoint 가장 최근에복호화된 GOP의 화상 복호화부 이용률Utilization rate of picture decoder of most recently decoded GOP 직후에 복호화될 GOP에서의 우선 순위Priority in GOP to be Decrypted Immediately V₁ V ₁ 0.350.35 44 V₂ V ₂ 0.240.24 33 V₃ V ₃ 0.210.21 22 V₄ V ₄ 0.200.20 1One

시점 우선순위 결정부(443)의 결정 결과는 화상 복호화부(430)에 입력되어, 화상 복호화부(430)에서 복호화를 함에 있어서 각 시점의 복호화 순서를 결정하게 된다. The determination result of the viewpoint priority determiner 443 is input to the image decoder 430 to determine the decoding order of each viewpoint in decoding by the image decoder 430.

화상 스킵부(445)는 각 시점의 현재 시간대의 GOP의 화상 복호화부 이용률을 활용하여 다음 시간대의 GOP의 복호화시 화상 스킵 방법을 결정한다. 다음 시간대의 GOP의 복호화가 시작되기 전에 화상 복호화부 이용률이 높은 시점의 화상을 상대적으로 많이 드롭시킴으로써 모든 시점들이 공평하게 복호화되도록 한다. 즉, 해당 버퍼의 모든 시점의 화상들의 복호화가 끝나면 화상 복호화부 이용률은 다시 계산되어 다음 시간대의 GOP의 화상 스킵을 결정하게 된다. The picture skip unit 445 determines a picture skip method when decoding the GOP in the next time zone by utilizing the picture decoder utilization rate of the GOP in the current time zone at each time point. Before the decoding of the GOP of the next time zone starts, all the viewpoints are decoded fairly evenly by dropping a relatively large number of images having a high rate of use of the image decoding unit. In other words, when the decoding of the pictures of all the time points of the buffer is finished, the picture decoding unit utilization rate is recalculated to determine the skip of the picture of the GOP of the next time zone.

화상 스킵부(445)는 도 6을 참조하여 설명한다. 화상 스킵부(445)는 각 시점마다 스킵되는 화상의 개수를 결정하는 스킵 화상 결정 모듈(510)과, 복호화되는 화상들을 선택하는 복호화 화상 선택 모듈(520)을 포함할 수 있다. The image skip part 445 is demonstrated with reference to FIG. The image skip unit 445 may include a skipped image determination module 510 that determines the number of skipped images at each time point, and a decoded image selection module 520 that selects the decoded images.

스킵 화상 결정 모듈(510)은 현재 시간대의 GOP의 화상 복호화부 이용률로부터 다음 시간대의 GOP에서의 화상 스킵 수를 결정하게 된다. 화상 스킵 수를 결정하는 방법은 다음과 같다. The skipped picture determination module 510 determines the number of skipped pictures in the GOP in the next time zone from the picture decoder utilization rate of the GOP in the current time zone. The method of determining the number of image skips is as follows.

N개의 화상으로 구성되는 1개의 GOP(V_k)가 있으며, ℓ번째 화상인 F_ℓ을 스킵하면 후속하는 F_ℓ+1, F_ℓ+2, F_ℓ+3, …, F_N도 스킵된다고 가정한다. 이 경우 실제 복호화되는 화상의 개수는 (ℓ-1)이 된다. 따라서, GoGOP에서 총 복호화되는 화상의 개수는 하기의 수학식 2와 같다.There is one GOP (V _k ) composed of N pictures, and skipping F _l , the l-th picture, causes subsequent F _{L + 1} , F _{L + 2} , F _{L + 3,.} Assume that F _{N is} also skipped. In this case, the number of pictures actually decoded is (l-1). Therefore, the total number of pictures to be totally decoded in GoGOP is expressed by Equation 2 below.

여기서, ℓ_k는 GOP(V_k)에서 스킵되는 첫번째 화상 번호이다. 예를 들어, ℓ₁은 첫번째 시점의 GOP(V₁)에서 스킵되는 첫번째 화상 번호, ℓ₂은 두번째 시점의 GOP(V₂)에서 스킵되는 첫번째 화상 번호이다. Here, l _k is the first picture number skipped in the GOP (V _k ). For example, l ₁ is the first picture number skipped in the GOP (V ₁ ) at the first time point, and l ₂ is the first picture number skipped in the GOP (V ₂ ) at the second time point.

GoGOP에서의 최적의 성능은 KㅧN 개의 화상, 즉 모든 화상을 복호화했을 때가 된다. 하지만, 실제 다시점 화상에서 모든 화상의 복호화가 가능한 시스템은 불가능하기 때문에, GoGOP에서 복호화되는 화상의 개수를 S로 가정한다. S는 [0, Kㅧ N]에 포함되는 어느 한 값이 되며, 시스템 성능에 따라 증감이 가능하다. 따라서, 상기의 수학식 2는 하기의 수학식 3, 4와 같은 결과를 획득하게 된다. The optimum performance in GoGOP is when K ㅧ N pictures, i.e., all pictures are decoded. However, since a system that is capable of decoding all pictures in an actual multi-view picture is impossible, it is assumed that the number of pictures to be decoded in GoGOP is S. S is any value included in [0, K ㅧ N], and can be increased or decreased depending on system performance. Therefore, Equation 2 above obtains the same result as Equation 3 and 4 below.

상기 수학식 4와 같은 결과를 만족하는 ℓ_k 값을 구하기 위하여 상기의 수학식 2를 다시 이용한다. 계산된 화상 복호화부 이용률 U_k에 따라 ℓ_k는 역선형 매핑을 이용하여 구한다. 즉, U = [0, 1] 은

= [S, 1]로 역매핑된다. 즉, 서로 수직하는 가로축과 세로축을 가정할 때, 가로축은 화상 복호화부 이용률 U 을, 세로축은

를 의미한다. U가 0일 때

는 S에, U가 1일 때

는 1에 대응하는 일차함수를 생각하면, 화상 복호화부 이용률 U에 따른

이 결정될 수 있다. Equation 2 above is used again to obtain a value of l _k that satisfies the same result as Equation 4 above. L _k is obtained using inverse linear mapping according to the calculated picture decoding unit utilization rate U _k . That is, U = [0, 1] is

= Reverse mapped to [S, 1]. That is, assuming that the horizontal axis and the vertical axis perpendicular to each other, the horizontal axis is the image decoding unit utilization rate U,

Means. When U is 0

Is in S, when U is 1

Considering a linear function corresponding to 1,

This can be determined.

이는 가장 최근에 복호화된 시간대의 GOP에서의 화상 복호화부 이용률이 높은 경우 직후에 복호화될 시간대의 GOP에서는 복호화되는 화상의 개수를 작게 한다. 이때 복호화되는 화상의 개수가 작다는 것은 스킵되는 첫번째 화상 번호인 ℓ_k 가 작다는 것을 의미하므로, 가장 최근에 복호화된 시간대의 GOP에서 특정 시점의 화상 복호화부 이용률과 직후에 복호화될 시간대의 GOP에서 해당 시점의 스킵되는 첫번째 화상 번호인 ℓ_k 는 화상 복호화부 이용률이 증가함에 따라 스킵되는 첫번째 화상 번호가 작아지는 역선형 관계에 있게 된다. This reduces the number of pictures to be decoded in the GOP in the time zone to be decoded immediately after the use of the picture decoding unit in the GOP in the most recently decoded time zone is high. In this case, the small number of pictures to be decoded means that the first picture number to be skipped, l _k, is small. 1 _k , which is the first picture number to be skipped at that time, is in an inverse linear relationship in which the first picture number to be skipped becomes smaller as the picture decoding unit utilization increases.

수식으로 표현하면 하기의 수학식 5와 같다. When expressed as an equation, Equation 5 below.

모든 K개의 시점에 대하여 산출된 스킵되는 첫번째 화상 번호를 합하게 되면, 하기의 수학식 6과 같다. If the first skipped picture number calculated for all K viewpoints is summed, Equation 6 below is obtained.

수학식 4와 수학식 6으로부터 GOP(V_k)에서 스킵하게 되는 첫번째 화상 번호

는 하기의 수학식 7과 같이 계산된다. First picture number to be skipped in GOP (V _k ) from

Equations

4 and 6

Is calculated as in Equation 7 below.

여기서,

는 플로어(floor) 함수이다. 플로어 함수는

안의 값보다 작거나 같은 최대의 정수를 출력값으로 한다. here,

Is a floor function. Floor function is

The maximum integer less than or equal to the value inside is used as the output value.

이렇게 구해진

값을 이용하여 해당 시점 이후에 복호화될 시간대의 GOP 복호화시 스킵해야 하는 화상의 개수를 결정하는 것이 가능하게 된다. So obtained

It is possible to determine the number of pictures to be skipped during GOP decoding of the time zone to be decoded after the corresponding time point using the value.

복호화 화상 선택 모듈(520)은 스킵 화상 결정 모듈(510)에서 결정된 스킵 화상 수로부터 화상 스킵 방법을 결정하게 된다. 화상 스킵 방법은 연속 스킵 또는 불연속 스킵이 있을 수 있다. The decoded picture selection module 520 determines a picture skip method from the number of skip pictures determined by the skip picture determination module 510. The picture skip method may be continuous skip or discontinuous skip.

연속 스킵부(521)는 구해진

값을 스킵 시작 화상 번호로 결정한다. 도 7에 도시된 바와 같이, 하나의 GOP 내에 10개의 화상이 있는 경우, 스킵 시작 화상은 6번째 화상으로 계산되었으면, 스킵 시작 화상 및 스킵 시작 화상 이후의 화상들(7, 8, 9, 10번째 화상들)은 모두 스킵되도록 한다. 불연속 스킵부(523)는 구해진

값으로부터 하나의 GOP 내에서 스킵되는 화상의 수를 결정하게 된다. 그리고 스킵 화상 수에 따라, 그리고 시각 화상 화질을 이용하여 스킵되는 화상의 간격을 조절하여 불연속으로 스킵 화상을 결정하여 스킵되도록 한다. Continuous skip portion 521 is obtained

The value is determined by the skip start picture number. As shown in Fig. 7, when there are 10 pictures in one GOP, the skip start picture and the pictures after the skip start picture (7, 8, 9, 10th) have been calculated as the 6th picture. Pictures) are all skipped. The discontinuous skip portion 523 is obtained

The value determines the number of pictures that are skipped in one GOP. The skipped image is determined to be skipped by adjusting the interval of the skipped image according to the number of skipped images and the visual image quality.

복호화 화상 선택 모듈(520)은 화상 스킵 방법을 결정함에 있어서, 프로파일 모듈(610) 및/또는 비용 함수 모듈(620)에서의 출력값에 영향을 받는다. 여기서, 프로파일 모듈(610) 및/또는 비용 함수 모듈(620)은 화상 스킵부(445)에 더 포함될 수 있다.The decoded picture selection module 520 is influenced by the output values from the profile module 610 and / or the cost function module 620 in determining the picture skipping method. In this case, the profile module 610 and / or the cost function module 620 may be further included in the image skip unit 445.

프로파일 모듈(610)은 복호화되는 화상이 베이스파인 프로파일 시퀀스인지 메인 프로파일 시퀀스인지 여부를 결정한다. 일반적으로 H.264/AVC의 베이스라인 프로파일은 I, P 화상만을 사용하므로 복호화는 상대적으로 간단한다. P 화상은 전방위 예측(forward perdiction)으로 부호화되기 때문에, 이전 참조 화상이 존재해야 복원이 가능하다. 다음은 베이스라인 프로파일에서 사용하는 GOP 구조를 나타낸다. N=6이고, M=1이다. The profile module 610 determines whether the decoded picture is a base fine profile sequence or a main profile sequence. In general, since the baseline profile of H.264 / AVC uses only I and P pictures, decoding is relatively simple. Since the P picture is encoded by forward perdiction, the previous reference picture must exist to be reconstructed. The following shows the GOP structure used in the baseline profile. N = 6 and M = 1.

I₁ P₁ P₂ P₃ P₄ P₅ I₂ P₆ P₇ P₈ …I ₁ P ₁ P ₂ P ₃ P ₄ P ₅ I ₂ P ₆ P ₇ P ₈ .

화상 스킵은 다음과 같은 특징을 가지고 수행된다. Image skip is performed with the following characteristics.

1) I₁ 화상이 스킵되면 다음 P₁, …, P₅ 화상들이 모두 스킵된다.1) If I ₁ image is skipped, then P ₁ ,. , P ₅ pictures are all skipped.

2) P₁ 화상이 스킵되면 P₂, …, P₅ 화상들이 모두 스킵된다. 따라서, P_i 화상이 스킵되면, P_i+1 부터의 해당 GOP의 모든 P 화상들이 스킵된다. 2) If P ₁ image is skipped, P ₂ ,. , P ₅ pictures are all skipped. Therefore, if the P _i picture is skipped, all P pictures of the corresponding GOP from P _{i + 1} are skipped.

3) 따라서, 현재 GOP에서 3개의 화상 및 다음 GOP에서 2개의 화상을 복호화하게 되면, I₁ P₁ P₂ 및 I₂ P₆가 재생된다. 3) Therefore, when three pictures in the current GOP and two pictures in the next GOP are decoded, I ₁ P ₁ P ₂ and I ₂ P ₆ are reproduced.

여기서, 베이스라인 프로파일은 상술한 화상 스킵 방법 중에서 연속 스킵 방법만이 사용가능하다. 복호화되는 화상들이 연속적으로 발생하고, 마찬가지로 스킵되는 화상들로 연속적으로 발생한다. Here, only the continuous skip method may be used as the baseline profile in the above-described image skip method. The pictures to be decoded occur in succession, and likewise in succession to the skipped pictures.

메인 프로파일에서의 선택 순서는 화상 타입에 따라 달라진다. 스킵되는 첫번째 프레임의 번호

가 정해지면, 즉 복호화되는 화상의 수가 결정되면, 화상 간의 예측 구조에 따라 연속적 또는 불연속적으로 복호화하고자 하는 화상 또는 스킵하고자 하는 화상을 선택하게 된다. The order of selection in the main profile depends on the picture type. The number of the first frame to be skipped

If is determined, that is, the number of pictures to be decoded is selected, the picture to be decoded continuously or discontinuously or the picture to be skipped is selected according to the prediction structure between pictures.

비용 함수 모듈(620)은 시각 화상 화질을 객관적으로 정량화한 비용 함수(cost function)를 통해 시각 화상 화질을 결정하고, 화상 스킵 방법을 결정하도록 한다. The cost function module 620 determines the visual image quality through a cost function that objectively quantifies the visual image quality, and determines the image skip method.

비용 함수는 다음과 같이 계산된다. The cost function is calculated as

1) 화상 i가 연속적으로 발생하는 드롭 화상에 속하고, 동시에 k_i번째의 연속적인 화상이라면, 해당 화상의 비용 c_i는 k_i이다. 1) If the image i belongs to a drop image that occurs continuously and at the same time a k _i -th consecutive image, the cost c _i of the image is k _i .

2) 연속적이 아닌 2개의 스킵 화상 간의 거리가 d_i이면, 비용 c_i는

이다. 2) If the distance between two non-contiguous skip pictures is d _i , the cost c _i is

to be.

3) 전체 비용 함수

이다. 여기서, F는 시퀀스에서 스킵되는 화상들의 집합이고, G는 고려하는 GOP 내의 전체 화상의 집합이다. 3) total cost function

to be. Here, F is a set of pictures skipped in a sequence, and G is a set of all pictures in the GOP under consideration.

도 7 및 도 8에 도시된 비용 함수의 값은 하기의 수학식 8, 9와 같이 계산된다. The value of the cost function shown in FIGS. 7 and 8 is calculated as in Equations 8 and 9 below.

따라서,

가

보다 작으며, 비용 함수가 작은 도 8의 방법이 우수한 시각 화상 화질을 가지게 된다. 동일한 개수의 화상을 스킵하지만, 실제 계산되는 비용은 다른 결과를 가져오게 된다. 따라서, 시스템 성능 및 화질에 따라 적응적으로 화상 스킵 방법을 결정하는 것이 가능하다. therefore,

end

The smaller, smaller cost function of FIG. 8 results in superior visual image quality. Skipping the same number of images, but the actual cost of the calculation will result in different results. Therefore, it is possible to adaptively determine the image skip method according to the system performance and the image quality.

화상 스킵부(445)는 스킵 화상 결정 모듈(510)에서 결정된 스킵 화상 수에 따라 프로파일 모듈(610) 및/또는 비용 함수 모듈(620)에서의 출력값을 참조하여 화상 스킵 방법을 결정하게 되고, 이에 따라 스킵되는 화상을 제외한 나머지, 즉 복호화되는 화상 번호(530)를 결정하여 화상 복호화부(430)에 제어 신호로서 이를 알려준다. 화상 복호화부(430)는 제어 신호의 한 형태인 복호화되는 화상 번호(530)에 상응하는 화상들을 직후 복호화될 시간대의 GOP 복호화시 복호화하게 된다. The image skip unit 445 determines an image skip method by referring to output values of the profile module 610 and / or the cost function module 620 according to the number of skipped images determined by the skip image determination module 510. Accordingly, the image number 530 to be decoded is determined except for the skipped image, and the image decoding unit 430 is informed of this as a control signal. The image decoding unit 430 decodes pictures corresponding to the decoded picture number 530, which is a form of a control signal, during GOP decoding of a time zone to be immediately decoded.

이하에서는 다시점 화상 복호화기(400)에서의 다시점 화상 데이터 처리 방법을 설명한다. Hereinafter, the multi-view image data processing method in the multi-view image decoder 400 will be described.

우선, 다시점 화상 복호화기(400)는 동일한 GOP 구조를 가지는 K(자연수)개의 시점 화상 시퀀스를 재배열하여 부호화한 단일 비트열을 단일 전송 채널(300)을 통하여 다시점 화상 부호화기(200)로부터 전송받는다. First, the multi-view image decoder 400 rearranges and encodes a K (natural number) viewpoint image sequence having the same GOP structure from the multi-view image encoder 200 through a single transmission channel 300. Receive.

그리고 비트열 분배기(410)를 통해 단일 비트열을 시점 화상 데이터 단위로 분리하여 상응하는 시점 버퍼(420a, 420b, …, 420K)에 저장한다. The bit stream divider 410 separates the single bit stream into view image data units and stores the single bit stream in the corresponding view buffers 420a, 420b, ..., 420K.

화상 복호화부(430)는 시점 버퍼(420a, 420b, …, 420K)에 저장된 시점 화상 데이터를 동일 시간대의 GOP 단위별로 복호화하게 된다. The image decoding unit 430 decodes the viewpoint image data stored in the viewpoint buffers 420a, 420b,..., 420K for each GOP unit of the same time zone.

그리고 복호화함에 있어서 각 시점 별로, 즉 시점 화상 데이터별로 현재 시간대의 GOP의 복호화 시간에 따라 다음 시간대의 GOP에서의 화상 스킵 방법을 스케쥴러(440)에서 결정하고, 시간 순서에 따라 GOP의 복호화를 반복하게 된다. 화상 스킵 방법은 도 5 내지 도 8을 참조하여 상세히 설명하였는 바 자세한 설명은 생략한다. In decoding, according to the decoding time of the GOP of the current time zone for each time point, that is, the viewpoint image data, the scheduler 440 determines a method of skipping the picture in the GOP of the next time zone, and repeats decoding of the GOP according to the time sequence. do. Since the image skip method has been described in detail with reference to FIGS. 5 to 8, a detailed description thereof will be omitted.

본 발명의 바람직한 다른 실시예에 따르면, 다시점 화상 부호화기(200)와 다시점 화상 복호화기(400)를 포함하는 다시점 화상 데이터 처리 시스템을 구현할 수 있다. 다시점 화상 부호화기(200)의 동작과, 다시점 화상 복호화기(400)의 동작은 앞서 상술한 바와 동일하다. According to another preferred embodiment of the present invention, a multi-view image data processing system including a multi-view image encoder 200 and a multi-view image decoder 400 may be implemented. The operation of the multiview image encoder 200 and the operation of the multiview image decoder 400 are the same as described above.

또한, 본 발명의 바람직한 또 다른 실시예에 따른 다시점 화상 데이터 처리 방법은 다음과 같다.동일한 GOP 구조를 가지는 K(임의의 자연수)개의 시점 화상 시퀀스를 재배열하여 부호화한 단일 비트열을 각 시점에 상응하는 시점 화상 데이터 단위로 분리하고 동일 시간대의 GOP 단위별로 순차적으로 복호화하는 방법에 있어서, 우선 가장 최근에 복호화된 시간대의 GOP의 복호화 시간 정보를 입력받는다. A multi-view image data processing method according to another preferred embodiment of the present invention is as follows. A single bit string obtained by rearranging and encoding a sequence of K (arbitrary natural numbers) view images having the same GOP structure at each viewpoint In the method of dividing into view image data units corresponding to and sequentially decoding the GOP units of the same time zone, first, decoding time information of the GOP of the most recently decoded time zone is received.

그리고 복호화 시간에 따라 직후에 복호화될 시간대의 GOP에서의 화상 스킵 방법을 결정한다. 화상 스킵 방법은 도 5 내지 도 8을 참조하여 상세히 설명하였는 바 자세한 설명은 생략한다.A picture skip method in the GOP in the time zone to be decoded immediately after the decoding time is determined. Since the image skip method has been described in detail with reference to FIGS. 5 to 8, a detailed description thereof will be omitted.

이후에 화상 스킵 방법에 따라 직후에 복호화될 시간대의 GOP의 복호화를 제어하는 신호를 출력한다. Subsequently, a signal for controlling decoding of the GOP in the time zone to be decoded immediately after the picture skip method is output.

상술한 다시점 화상 데이터 처리 방법을 수행할 수 있는 프로그램을 외부의 서버에서 저장하고 있으며, 유무선 네트워크를 통해 프로그램을 다운로드하여 다시점 화상 복호화기에 내장함으로써 다시점 화상 복호화를 할 수도 있다. A program capable of performing the above-described multiview image data processing method is stored in an external server, and multiview image decoding may be performed by downloading a program through a wired or wireless network and embedding the program in a multiview image decoder.

상술한 바와 같이, 본 발명에 따른 다시점 화상 시스템은 시점 카메라의 수와 무관하게 하나의 부호화기와 복호화기를 이용하여 다시점 화상을 전송하고 처리할 수 있다. As described above, the multiview image system according to the present invention can transmit and process a multiview image using one encoder and a decoder regardless of the number of viewpoint cameras.

또한, 하나의 부호화기와 하나의 복호화기를 이용하여 다시점 화상을 처리함으로써 전체 시스템의 복잡도를 감소시키면서도 채널 대역폭 및 부호화하는 매크로 블록의 수 등은 영향을 받지 않는다. In addition, by processing a multiview image using one encoder and one decoder, the channel bandwidth and the number of macro blocks to be encoded are not affected while reducing the complexity of the entire system.

또한, 다시점 화상을 재배열함으로써 하나의 단일 압축 비트열로 부호화하고, 이를 복호화하여 사용자에게 각 시점에 따른 화상을 제공할 수 있다. 단일 압축 비트열은 기존 H.264/AVC를 포함하는 MPEG 비디오 압축을 이용하여 부호화가 가능하다. In addition, by rearranging the multi-view images, a single compressed bit string may be encoded, decoded, and the user may be provided an image according to each viewpoint. A single compressed bit string can be encoded using MPEG video compression including existing H.264 / AVC.

또한, 동시 방식에 기반한 효율적인 다시점 화상을 부호화함에 따라 DPB(Decoded Picture Buffer) 개수의 증가가 불필요하다. 그리고 복호화시에도 단일 복호화기를 이용하면서도 화상 복호화부 이용률에 따라 다음 시간대에서의 복호화 우선순위, 스킵 화상 수, 스킵 화상 방법 등을 적응적으로 변화시킴으로써 시점 간에 동등한 화질을 유지하는 것이 가능하다. In addition, an increase in the number of Decoded Picture Buffers (DPBs) is unnecessary as an efficient multi-view image is encoded based on a simultaneous scheme. In the decoding process, it is possible to maintain the same image quality between viewpoints by adaptively changing the decoding priority, the number of skipped images, the skipped image method, etc. in the next time zone according to the utilization rate of the image decoding unit while using a single decoder.

상기에서는 본 발명의 바람직한 실시예를 참조하여 설명하였지만, 해당 기술 분야에서 통상의 지식을 가진 자라면 하기의 특허 청구의 범위에 기재된 본 발명의 사상 및 영역으로부터 벗어나지 않는 범위 내에서 본 발명을 다양하게 수정 및 변경시킬 수 있음을 이해할 수 있을 것이다.Although the above has been described with reference to a preferred embodiment of the present invention, those skilled in the art to which the present invention pertains without departing from the spirit and scope of the present invention as set forth in the claims below It will be appreciated that modifications and variations can be made.

Claims

K view buffers each storing data corresponding to K (random natural numbers) views;

A single bit string obtained by rearranging and encoding K viewpoint image sequences having the same group of picture (GOP) structure is received, and the single bit sequence is divided into viewpoint image data units corresponding to the respective viewpoints to correspond. A bit string divider for storing in each of the time buffers;

A scheduler for outputting a control signal for processing the viewpoint image data stored in the viewpoint buffer; And

An image decoding unit for decoding and outputting the viewpoint image data stored in the viewpoint buffer according to the control signal, respectively.

Multi-view image decoder comprising a.

The method of claim 1,

The viewpoint image data includes multi-viewpoint image decoding, wherein the viewpoint image data includes pictures belonging to at least one GOP of the encoded viewpoint sequence corresponding to the respective viewpoints of the single bit string, or includes all of the maximum viewpoint sequence. group.

The method of claim 1, wherein the scheduler,

A utilization rate calculation unit for receiving a decoding time of each of the viewpoint image data in the GOP of the most recently decoded time zone from the image decoding unit and calculating a utilization rate of the image decoding unit according to the decoding time; And

And a picture skip section for determining a picture skip method in a GOP in a time zone to be decoded immediately after the picture decoder from the picture decoder utilization rate.

The method of claim 3,

The picture skipping unit increases the number of picture skips in the GOP of the time zone to be immediately decoded as the picture decoding unit usage rate in the GOP of the most recently decoded time zone increases for each time point. .

The method of claim 3, wherein the image skip unit,

A skip picture determination module for determining a number of pictures skipped in a GOP in a time zone to be immediately decoded immediately after the picture decoding unit in a GOP in the most recently decoded time zone; And

And a decoded picture selecting module for selecting a picture to be skipped according to the picture skip number.

The method of claim 5,

And the decoded picture selecting module continuously skips a skip start picture and pictures after the skip start picture in a GOP.

The method of claim 6,

And the view point image data is any one of a baseband profile sequence and a main profile sequence.

The method of claim 5,

And the decoded picture selection module skips pictures discontinuously in a GOP.

The method of claim 8,

And said viewpoint image data is a main profile sequence.

The method of claim 3, wherein the scheduler,

And a viewpoint priority determiner for determining a viewpoint priority in a GOP of a time zone to be decoded immediately after the picture decoder utilization rate.

The method of claim 10,

The viewpoint priority determiner makes the viewpoint priority in the GOP of the time zone to be decoded immediately after the smaller as the utilization rate of the picture decoder in the GOP of the time zone decoded most recently for each viewpoint. Picture decoder.

A multi-view image rearrangement unit which receives respective viewpoint image sequences having the same GOP structure from K (random natural numbers) cameras, and generates a single bit sequence sequentially rearranged by dividing each viewpoint image sequence into GOP units;

An image encoder which encodes and transmits the single bit string;

K view buffers respectively storing data corresponding to the viewpoints of the cameras;

A bit string divider configured to receive the single bit string and to separate the single bit string into view image data units corresponding to the respective view points and to store the single bit stream in the corresponding view buffer;

Multi-view image data processing system comprising a.

The method of claim 12,

And the image encoder transmits the encoded single bit string through a single channel.

The method of claim 12,

The multi-viewpoint image rearranging unit divides each of the viewpoint image sequences into GOP units, and rearranges them sequentially by grouping them by GOP units in the same time zone.

The method of claim 12,

The viewpoint image data includes pictures belonging to at least one GOP of the encoded viewpoint image sequences corresponding to the respective viewpoints of the single bit string, or the maximum viewpoint image sequence includes all of the viewpoint image sequences. Processing system.

The method of claim 12, wherein the scheduler,

And a picture skip section for determining a picture skip method in a GOP in a time zone to be decoded immediately after the picture decoding section from the picture decoding section utilization rate.

The method of claim 16,

Multi-view image data processing, characterized in that the picture skip unit increases the number of picture skips in the GOP of the time zone to be decoded immediately after the larger the utilization rate of the picture decoding unit in the GOP of the most recently decoded time zone for each time point. system.

The method of claim 16, wherein the image skip unit,

And a decoded picture selection module for selecting a picture to be skipped according to the picture skip number.

The method of claim 18,

And the decoded picture selection module continuously skips a skip start picture and pictures after the skip start picture in a GOP.

The method of claim 19,

And said viewpoint image data is any one of a baseband profile sequence and a main profile sequence.

The method of claim 18,

Wherein said decoded picture selection module discontinuously skips pictures in a GOP.

The method of claim 21,

And said viewpoint image data is a main profile sequence.

The method of claim 16, wherein the scheduler,

The method of claim 23, wherein

Multiview image data, characterized in that the viewpoint priority in the GOP of the time zone to be decoded immediately after the time is smaller as the utilization rate of the picture decoder in the GOP of the time zone most recently decoded for each time point decreases Processing system.

(a) receiving a single bit string obtained by rearranging and encoding K (random natural numbers) viewpoint image sequences having the same GOP structure;

(b) dividing the single bit stream into view image data units corresponding to each view and storing the single bit string in a corresponding view buffer;

(c) decoding the viewpoint image data stored in the viewpoint buffer for each GOP unit of the same time zone; And

(d) determining a method of skipping pictures in the GOP of the time zone to be decoded immediately after the decoding time of the GOP of the time zone decoded most recently for each view image data, and sequentially repeating step (c)

Multi-view image data processing method comprising a.

The method of claim 25,

In step (d), the number of image skips in the GOP in the time zone to be immediately decoded increases as the decoding time of the GOP in the most recently decoded time zone increases for each view image data. Way.

The method of claim 25, wherein step (d)

(d1) determining the number of image skips of a time zone to be decoded immediately after the time according to the decoding time of the most recently decoded time zone for each view image data;

(d2) multi-view image data processing method comprising selecting an image to be skipped according to the image skip number.

The method of claim 27,

And said step (d2) continuously skips the skip start picture and the pictures after the skip start picture in a GOP.

The method of claim 27,

And said step (d2) discontinuously skips pictures in a GOP.

The method of claim 25,

In the step (d), the viewpoint priority of the GOP of the time zone to be decoded immediately after the decoded time of the GOP of the most recently decoded time zone becomes higher according to the viewpoint image data. Treatment method.

The method of claim 25,

The viewpoint image data includes pictures belonging to at least one GOP of the encoded viewpoint image sequences corresponding to the respective viewpoints of the single bit string, or the maximum viewpoint image sequence includes all of the viewpoint image sequences. Treatment method.

In a method of separating a single bit string obtained by rearranging and encoding K (arbitrary natural number) view image sequences having the same GOP structure into view image data units corresponding to each view, and sequentially decoding each GOP unit in the same time zone. ,

(a) receiving decoding time information of a GOP of a most recently decoded time zone;

(b) determining a method of skipping pictures in a GOP in a time zone to be decoded immediately after the decoding time; And

and (c) outputting a signal for controlling the decoding of the GOP in the time zone to be decoded immediately after the image according to the picture skipping method.

33. The method of claim 32,

In step (b), the number of image skips in the GOP in the time zone to be decoded immediately after the decoding is increased as the decoding time of the GOP in the most recently decoded time zone increases for each view image data. Way.

33. The method of claim 32, wherein step (b) comprises:

(b1) determining, according to the decoding time of the most recently decoded time zone, the number of image skips of the time zone to be decoded immediately after each view image data;

(b2) multi-view image data processing method comprising selecting an image to be skipped according to the number of image skips.

The method of claim 34, wherein

And said step (b2) continuously skips a skip start picture and pictures after the skip start picture in a GOP.

The method of claim 34, wherein

And said step (b2) discontinuously skips pictures in a GOP.

33. The method of claim 32,

In step (b), the viewpoint priority of the GOP of the time zone to be decoded immediately after the decoding is increased as the decoding time of the GOP of the most recently decoded time zone becomes longer according to the viewpoint image data. Treatment method.

33. The method of claim 32,

As a recording medium in which a program of instructions executable by a digital processing apparatus is tangibly embodied and can be read by a digital processing apparatus,

A recording medium on which a program for performing the multi-view image data processing method according to any one of claims 32 to 38 is recorded.