KR20100046485A

KR20100046485A - A method and apparatus for an 3d broadcasting service by using region of interest depth information

Info

Publication number: KR20100046485A
Application number: KR1020080105342A
Authority: KR
Inventors: 김태원; 김진웅; 허남호; 엄기문; 방건; 장은영; 이수인
Original assignee: 한국전자통신연구원
Priority date: 2008-10-27
Filing date: 2008-10-27
Publication date: 2010-05-07
Also published as: KR101005015B1

Abstract

PURPOSE: A method and an apparatus for an 3D broadcasting service by using a region of interest depth information are provided to induce the interest from a viewer by displaying 3D depth information at a desirable region based on an interest region concept. CONSTITUTION: A content provider obtains a depth information image(512) and 2D image information. An encoder(514) performs masking operation for the depth image information, and encodes a masked depth information image. A decoder(421) decodes the depth information image and the 2D image, and a multi-view generator(522) adds the depth information to the 2D image information.

Description

3D broadcast service method and apparatus based on ROI depth information {A METHOD AND APPARATUS FOR AN 3D BROADCASTING SERVICE BY USING REGION OF INTEREST DEPTH INFORMATION}

본 발명은 방송 서비스를 제공하기 위한 방법 및 장치에 관한 것으로, 3차원 방송 서비스를 제공하기 위한 방법 및 장치에 관한 것이다.The present invention relates to a method and apparatus for providing a broadcast service, and to a method and apparatus for providing a 3D broadcast service.

일반적으로 대표적인 방송 서비스는 공중파 방송 서비스가 있으며, 위성을 이용한 위성 방송 서비스, 지상파 방송 서비스, 인터넷 또는 전용 케이블을 이용하는 방송 서비스 등 다양한 방송 서비스가 존재한다. 이러한 방송 서비스는 실시간성을 중요시하는 뉴스와 같은 방송이 있으며, 미리 녹화된 콘텐츠를 제공하는 드라마, 코미디, 다큐멘터리 등의 다양한 콘텐츠가 제공되고 있다.In general, a typical broadcast service includes an over-the-air broadcast service, and there are various broadcast services such as a satellite broadcast service using a satellite, a terrestrial broadcast service, a broadcast service using the Internet or a dedicated cable. Such a broadcast service has a broadcast such as news that emphasizes real time, and various contents such as drama, comedy, and documentary that provide pre-recorded contents are provided.

한편, 현대 기술의 발전에 힘입어 영화 및 게임 산업 등에서 3차원 영상 서비스가 제공되고 있다. 현재에는 이러한 3차원 영상 서비스 기술이 특수 영화 또는 게임 등의 산업에서 제공되고 있다. 또한 현재에 방송 콘텐츠에 3차원 영상 서비스 를 제공하기 위한 다양한 시도와 연구가 이루어지고 있다. On the other hand, thanks to the development of modern technology, three-dimensional image services are provided in the movie and game industries. At present, such 3D video service technology is provided in industries such as special films or games. In addition, various attempts and studies are being made to provide 3D video services for broadcast contents.

일반적으로 3차원 영상 서비스를 제공하는 경우 3차원 그래픽 정보를 제공하기 위해서는 기존의 2차원 영상 정보와 2차원 영상에 표시되는 각 사물 또는 객체들간의 깊이 정보를 이용하여 3차원 그래픽 정보를 제공한다. 여기서 깊이 정보란, 카메라로부터 2차원 영상에 표시되는 각 사물간의 거리 또는 특정한 기준 위치로부터 사물간의 거리를 정보로 표현한 것을 의미한다.In general, when providing a 3D image service, in order to provide 3D graphic information, 3D graphic information is provided using depth information between existing 2D image information and each object or object displayed on the 2D image. Here, the depth information means that the distance between the objects displayed on the two-dimensional image from the camera or the distance between the objects from a specific reference position is expressed as information.

그러면 3차원 방식이 방송에서 사용되는 경우와 깊이 정보를 사용하는 예에 대하여 살펴보기로 한다. 유럽의 ATTEST 프로젝트에서 연구되었던 3차원 방식의 방송은, 초기 획득 단계에서 영상 정보와 깊이 정보를 동시 획득하여 이를 전송한다. 그러면 재생 단계에서는 영상과 깊이 정보 기반으로 하여 DIBR(Depth Image Based Rendering) 방법을 사용하여 양안/다시점 영상(Stereoscopic or multi-view video)을 생성한 후에 양안/다시점 디스플레이 장치에 디스플레이를 하는 과정으로 이루어져 있다. ATTEST 프로젝트에서 연구되었던 이 방식의 특징은 깊이 정보 획득을 위해 영상 획득 시 ZCam이라 알려진 영상 장비를 사용한다. 즉, ZCam을 이용하여 일반 비디오 영상과 함께 깊이 정보를 획득하고, 두 정보를 함께 재생단으로 전송함한다. 상기한 방식을 도 1을 참조하여 좀 더 상세히 살펴보기로 한다.Next, a case where the 3D method is used in broadcasting and an example of using depth information will be described. The three-dimensional broadcast, which was studied at the ATTEST project in Europe, simultaneously acquires and transmits image information and depth information in the initial acquisition stage. Then, in the playback step, a process of generating a stereoscopic or multi-view video using DIBR (Depth Image Based Rendering) based on the image and depth information, and then displaying it on the binocular / multi-view display device. Consists of The feature of this method, which was studied in the ATTEST project, uses an imaging device known as ZCam to acquire images for depth information. That is, depth information is acquired along with a general video image using ZCam, and both information are transmitted to a play end. The above-described method will be described in more detail with reference to FIG. 1.

도 1은 3차원 방송 서비스를 제공하기 위한 시스템의 개념도이다.1 is a conceptual diagram of a system for providing a 3D broadcast service.

참조부호 110은 영상을 획득하는 장치이며, 참조부호 120은 영상을 재생하는 장치이다. 상기 영상을 획득하는 장치는, 앞에서 설명한 바와 같이 ZCam으로 구현할 수 있으며, 도 1에 도시한 바와 같이 일반 영상 정보(111)와 깊이 정보 영 상(112)을 함께 획득한다. 그리고 이와 같이 획득된 일반 영상 정보(111)와 깊이 정보 영상(112)은 함께 또는 각각 부호화된다. 그러면 영상 정보(111)와 깊이 정보 영상(112)을 획득하여 부호화하는 장치의 구성 및 동작에 대하여 살펴보기로 한다.Reference numeral 110 denotes an apparatus for acquiring an image, and reference numeral 120 denotes an apparatus for reproducing an image. The apparatus for acquiring the image may be implemented by ZCam as described above, and as shown in FIG. 1, the general image information 111 and the depth information image 112 are acquired together. The general image information 111 and the depth information image 112 thus obtained are encoded together or separately. Next, the configuration and operation of the apparatus for obtaining and encoding the image information 111 and the depth information image 112 will be described.

도 2는 영상 정보 및 깊이 정보 영상을 함께 획득하기 위한 장치의 개념적인 블록 구성도이다.2 is a conceptual block diagram of an apparatus for acquiring image information and depth information image together.

센서부(211)는 도 1에서 설명한 바와 같이 일반 영상(111)과 깊이 정보 영상(112)을 함께 획득한다. 이와 같이 획득된 일반 영상(111)은 영상 처리부(212)로 입력되고, 깊이 정보 영상(112)은 깊이 정보 처리부(214)로 입력된다. 영상 처리부(212)는 센서부(211)에서 획득된 영상 정보를 방송 시스템 또는 제공하는 콘텐츠에서 사용할 수 있는 형태로 변환한다. 깊이 정보 처리부(214) 또한 센서부(211)에서 획득된 깊이 정보를 방송 시스템 또는 제공하는 콘텐츠에서 사용할 수 있는 형태로 변환한다. 이와 같은 처리가 이루어진 정보들은 영상 부호기(213)에서 영상 신호의 부호화가 이루어지고, 깊이 정보 부호기(215)에서 깊이 정보를 부호화한다. 그리고 부호화된 정보는 다중화기(216)에서 다중화되어 부호화된 3차원 영상 정보를 출력한다.As described above with reference to FIG. 1, the sensor unit 211 acquires the general image 111 and the depth information image 112 together. The general image 111 obtained as described above is input to the image processor 212, and the depth information image 112 is input to the depth information processor 214. The image processor 212 converts the image information acquired by the sensor unit 211 into a form that can be used in a broadcasting system or content provided. The depth information processor 214 also converts the depth information obtained by the sensor unit 211 into a form that can be used in a broadcast system or content provided. The information processed as described above is encoded by the video encoder 213, and the depth information encoder 215 encodes the depth information. The encoded information is multiplexed by the multiplexer 216 and outputs encoded 3D image information.

이상에서 설명한 바와 같이 부호화된 3차원 영상 정보는 무선 통신 시스템 예를 들어, 공중파 방송 또는 지상파 방송 또는 위성 방송 등의 방식을 통해 전송될 수도 있다. 또는 유선을 이용한 전송 방식 예를 들어 인터넷(Internet) 또는 전용 케이블 등의 전송 방식을 통해 부호화된 영상 및 깊이 정보가 전달될 수도 있다. 이때, 각 전송 방식에서 요구되는 부호화가 다시 수행될 수도 있다.As described above, the encoded 3D image information may be transmitted through a wireless communication system, for example, over-the-air broadcasting, terrestrial broadcasting, or satellite broadcasting. Alternatively, the encoded image and depth information may be transmitted through a transmission method using a wire, for example, a transmission method such as the Internet or a dedicated cable. In this case, encoding required for each transmission scheme may be performed again.

상기한 방식으로 영상 정보가 전송되면, 영상 재생 장치에서는 이를 해당하는 방식으로 수신한다. 이러한 영상 정보는 무선 또는 유선의 방식 중 어느 방식으로 전송되어도 무방하므로 도 1에서는 전송 방식에 따른 수신 과정은 도시하지 않았다. 소정의 방식으로 수신된 영상 정보는 참조부호 121과 같이 영상 정보와 깊이 정보에 대하여 복호화 과정을 거친다. 그러면 일반 영상과 깊이 정보 영상으로 다시 복원된다. 이후 영상 재생 장치의 DIBR 다시점 생성부(122)에서 다시점 영상으로 복원이 이루어지고, 다시점 디스플레이(123)에서 3차원 영상을 디스플레이하게 된다. 그러면 영상 정보(111)와 깊이 정보 영상(112)이 소정 통신 시스템을 통해 전송되어 디스플레이 하기 위한 장치의 구성 및 동작에 대하여 살펴보기로 한다.When the image information is transmitted in the above manner, the image reproducing apparatus receives it in the corresponding manner. Since the image information may be transmitted by either a wireless or wired method, the reception process according to the transmission method is not illustrated in FIG. 1. Image information received in a predetermined manner is subjected to a decoding process for the image information and depth information, as shown by reference numeral 121. Then, the normal image and the depth information image are restored again. Thereafter, the DIBR multi-view generator 122 of the image reproducing apparatus restores the multi-view image, and displays the 3D image on the multi-view display 123. Then, the configuration and operation of the apparatus for displaying and transmitting the image information 111 and the depth information image 112 through the predetermined communication system will be described.

도 3은 영상 정보 및 깊이 정보 영상을 디스플레이하기 위한 장치의 개념적인 블록 구성도이다.3 is a conceptual block diagram of an apparatus for displaying image information and depth information image.

상기와 같은 방법을 통해 수신된 데이터는 수신부(311)로 입력된다. 수신부(311)는 무선 시스템 또는 유선 시스템에서 요구되는 방식으로 신호를 수신하고, 소정 대역의 신호로 처리한다. 예를 들어 무선 시스템의 경우 RF 신호에서 캐리어(carrier) 신호를 제거하여 원하는 대역의 신호로 변환한 후 무선 신호의 전송에 따른 복호화를 수행하여 출력한다. 수신부(311)의 출력은 역다중화부(312)로 입력된다. 역다중화부(312)는 송신측에서 영상 정보와 깊이 정보를 다중화한 반대 방법으로 역다중화를 수행한다. 따라서 역다중화부(312)는 영상 정보와 깊이 정보를 구별하여 출력한다. 역다중화된 영상 정보는 영상 복호기(313)로 입력되고, 역다중화된 깊이 정보는 깊이 정보 복호기(314)로 입력된다.The data received through the above method is input to the receiver 311. The receiver 311 receives a signal in a manner required by a wireless system or a wired system, and processes the signal into a signal of a predetermined band. For example, in a wireless system, a carrier signal is removed from an RF signal, converted into a signal of a desired band, and then decoded according to transmission of a wireless signal and output. The output of the receiver 311 is input to the demultiplexer 312. The demultiplexer 312 performs demultiplexing by the opposite method of multiplexing the image information and the depth information at the transmitting side. Therefore, the demultiplexer 312 distinguishes and outputs image information and depth information. The demultiplexed image information is input to the image decoder 313, and the demultiplexed depth information is input to the depth information decoder 314.

영상 복호기(313)는 촬영된 2차원의ㅏ 영상 신호를 복호하여 출력한다. 그리고 깊이 정보 복호기(314)는 2차원 정보와 함께 획득된 깊이 정보를 복호하여 출력한다. 이와 같이 영상 복호기(313)와 깊이 정보 복호기(314)에서 복호된 정보는 다시점 영상 생성부(315)로 입력되어 다시점 영상을 생성한다. 즉, 3차원 영상을 생성하게 된다. 다시점 영상 생성부(315)에서 생성된 다시점 영상은 다시점 디스플레이부(316)에서 3차원 영상으로 디스플레이 된다. 이를 통해 시청자들은 3차원 영상을 청취할 수 있다.The video decoder 313 decodes and outputs the photographed two-dimensional? Video signal. The depth information decoder 314 decodes and outputs the depth information obtained together with the 2D information. As described above, the information decoded by the image decoder 313 and the depth information decoder 314 is input to the multiview image generator 315 to generate a multiview image. That is, a three-dimensional image is generated. The multiview image generated by the multiview image generator 315 is displayed as a 3D image by the multiview display 316. Through this, viewers can listen to the 3D image.

한편, 일반적으로 깊이 정보를 얻기 위한 방법으로는 앞에서 살펴본 ZCam과 같은 깊이 카메라(depth camera)를 이용하는 방법 이외에도 여러 가지 방법들이 존재한다. 3차원 방송에 활용하기에 적합한 방법으로 다른 한 가지를 예를 들면, 대표적으로는 스테레오 영상 정합(stereo matching) 기법이 있다. 그러면 스테레오 영상 정합 기법과 ZCam 기반 깊이(depth) 정보의 획득 방법에 대해 간략히 설명한다.In general, there are various methods for obtaining depth information in addition to using a depth camera such as ZCam described above. Another suitable method for utilizing in 3D broadcasting is, for example, stereo matching technique. Next, a stereo image matching technique and a method of acquiring ZCam-based depth information will be briefly described.

(1) 스테레오 영상 정합 기법(1) stereo image matching technique

스테레오 영상 정합 기법은 두 장의 영상을 입력으로 하여 화소값(pixel value) 정보를 바탕으로 각 화소의 대응점 및 깊이 값을 계산하는 방법이다. 일반적으로 영상 전역에 걸쳐서 일정 수준 이상의 정확도를 갖는 대응점을 찾는 것에 어려움이 있어 스테레오 영상 정합 기법은 일부 영역에서 계산된 깊이 값이 부정확한 값을 갖는다. 특히, 영상에 텍스쳐가 없는 부분(textureless region)의 대응점 계산은 이론적으로는 불가능한 한계점을 갖는다.Stereo image matching is a method of calculating corresponding points and depth values of each pixel based on pixel value information using two images as inputs. In general, it is difficult to find a corresponding point having a certain level or more of accuracy throughout the image, so that stereo image matching technique has an inaccurate depth value calculated in some regions. In particular, the calculation of the correspondence point of the textureless region in the image has a limit that is theoretically impossible.

(2) ZCam 기반 깊이(depth) 획득 방법(2) ZCam-based depth acquisition method

ZCam을 이용하면 일반 영상과 그에 해당하는 깊이 영상을 동시에 얻을 수 있는 있는 장점이 있다. 그러나 실제 ZCam의 깊이 획득 범위 한계로 인하여 영상의 전체 영역이 아니라 일부 영역 해당하는 제한된 영역에서만 깊이 정보를 획득할 수 있다는 한계가 있다.Using the ZCam has the advantage of obtaining a normal image and a corresponding depth image at the same time. However, due to the limitation of the depth acquisition range of the actual ZCam, there is a limitation that the depth information can be obtained only in a limited area corresponding to a part of the image, not the entire area of the image.

위 두 가지 방법 이외에도 영상 전체 영역에 걸쳐서 신뢰도 있는 깊이 정보를 얻기 위한 방법으로 영역 스케너(range scanner)와 같은 장치가 존재하나 정지 환경(static scene)에 대한 깊이 정보만을 얻을 수 있다. 그러므로 동영상을 재생해야 하는 3차원 방송에 사용하기에 적합하지 않다.In addition to the above two methods, a device such as a range scanner exists as a method for obtaining reliable depth information over an entire image area, but only depth information about a static scene can be obtained. Therefore, it is not suitable for use in three-dimensional broadcasting that requires video playback.

부연 설명하면, 스테레오 영상 정합 기법은 일부 영상 영역에서 부정확한 값을 가진다. 그리고 ZCam 방법은 약 1.4m의 깊이범위와 같이 깊이 범위의 한계를 가진다. 이러한 스테레오 영상 정합 기법과 ZCam으로 획득된 깊이 정보를 기반으로, DIBR 기법을 이용하여 다시점 영상을 생성할 수 있다. 이러한 방법으로 생성된 영상을 3차원 디스플레이 모니터에 디스플레이를 할 경우에, 부정확한 깊이 값 및 깊이 범위 한계로 인해 제대로 된 입체 영상을 보는데 어려움이 발생할 수 있다. 또한 깊이 정보의 신뢰도와 관계없이 일반 영상과 일반 영상에 해당하는 깊이 정보를 모두 보내게 됨으로 인해 전송 대역을 충분히 확보하지 못한 상황에서는 전송 문제점도 발생할 수 있게 된다.In other words, the stereo image matching technique has an incorrect value in some image regions. And the ZCam method has a depth range limit of about 1.4m. Based on the stereo image matching technique and the depth information acquired by ZCam, a multiview image can be generated using the DIBR technique. When displaying an image generated in this way on a three-dimensional display monitor, it may be difficult to see the correct stereoscopic image due to inaccurate depth value and depth range limitation. In addition, regardless of the reliability of the depth information, since both the depth information corresponding to the normal image and the general image is sent, a transmission problem may also occur in a situation where the transmission band is not sufficiently secured.

다른 한편, 3차원 영상 콘텐츠는 시청자 입장에서는 흥미를 유발할 수는 있지만, 시각적인 부분에서 피로도가 매우 크게 증가한다. 따라서 3차원 영상 콘텐츠 에 오랜 시간 노출된 경우 2차원 영상을 제공받는 경우보다 피로도가 매우 크며, 시력저하의 원인이 될 수도 있다.On the other hand, three-dimensional image content may be interesting to the viewer, but fatigue is greatly increased in the visual aspect. Therefore, when exposed to 3D image content for a long time, fatigue is much greater than when 2D image is provided, and it may cause vision loss.

따라서 본 발명에서는 신뢰할 수 있는 작은 양의 정보로 3차원 입체 방송 서비스를 제공할 수 있는 방법 및 장치를 제공한다.Accordingly, the present invention provides a method and apparatus capable of providing a 3D stereoscopic broadcast service with a reliable amount of information.

또한 본 발명에서는 범위 한계를 줄여 3차원 입체 방송 서비스를 제공할 수 있는 방법 및 장치를 제공한다.In addition, the present invention provides a method and apparatus that can provide a three-dimensional stereoscopic broadcast service by reducing the range limits.

또한 본 발명에서는 3차원 영상을 제공함에 있어 시청자의 피로도를 줄일 수 있는 방법 및 장치를 제공한다.In addition, the present invention provides a method and apparatus that can reduce the fatigue of the viewer in providing a three-dimensional image.

본 발명의 일 실시 예에 따른 방송 서비스 제공 방법은, 3차원 방송 서비스를 제공하기 위한 송신 방법으로, 2차원 영상 정보와 함께 상기 2차원 영상의 깊이 정보 영상을 획득하는 과정과, 상기 2차원 영상 정보 중 비관심 영역의 제거하기 위한 마스크 값을 생성하는 과정과, 상기 마스크 값을 이용하여 깊이 정보 영상을 마스킹하는 과정과, 상기 2차원 영상 정보와 상기 마스킹된 깊이 정보 영상을 부호화하여 전송하는 과정을 포함한다.The broadcast service providing method according to an embodiment of the present invention is a transmission method for providing a 3D broadcast service, the process of obtaining a depth information image of the 2D image together with 2D image information, and the 2D image. Generating a mask value for removing an uninterested region of information, masking a depth information image using the mask value, and encoding and transmitting the 2D image information and the masked depth information image It includes.

본 발명의 일 실시 예에 따른 방송 서비스 제공 장치는, 3차원 방송 서비스 를 제공하기 위한 송신 장치로, 2차원 영상 정보와 함께 상기 2차원 영상의 깊이 정보 영상을 획득하는 센서부와, 상기 2차원 영상 정보 중 비관심 영역의 깊이 정보를 제거하기 위한 마스크 값을 마스크 정보 생성부와, 상기 마스크 값을 이용하여 상기 깊이 정보 영상을 마스킹하는 결합부와, 상기 2차원 영상 정보와 상기 마스킹된 깊이 정보 영상을 부호화하여 전송하는 송신부를 포함한다.An apparatus for providing a broadcast service according to an embodiment of the present invention is a transmission apparatus for providing a 3D broadcast service, and includes a sensor unit which acquires a depth information image of the 2D image together with 2D image information, A mask value generator for removing depth information of an uninterested region of the image information, a combiner for masking the depth information image using the mask value, the two-dimensional image information and the masked depth information And a transmitter for encoding and transmitting an image.

본 발명의 일 실시 예에 따른 방송 서비스 수신 방법은, 3차원 방송 서비스를 제공받기 위한 방송 서비스 수신 방법으로, 수신된 데이터에서 2차원 영상 정보와 관심 영역에 대응하는 깊이 영상 정보와 마스크 정보를 분리하는 과정과, 상기 마스크 정보와 관심 영역 정보를 이용하여 상기 2차원 영상 정보 중 관심 영역에 3차원 다시점 영상을 생성하는 과정과, 상기 다시점 영상을 디스플레이하는 과정을 포함한다.The broadcast service receiving method according to an embodiment of the present invention is a broadcast service receiving method for receiving a 3D broadcast service, and separates 2D image information and depth image information and mask information corresponding to the ROI from the received data. And generating a 3D multiview image in the ROI of the 2D image information by using the mask information and the ROI information, and displaying the multiview image.

본 발명의 일 실시 예에 따른 방송 서비스 수신 장치는, 3차원 방송 서비스를 제공받기 위한 방송 서비스 수신 장치로, 수신된 데이터에서 2차원 영상 정보와 관심 영역 정보와 마스크 정보를 분리하는 역다중화부와, 상기 마스크 정보와 관심 영역 정보를 이용하여 상기 2차원 영상 정보 중 관심 영역에 3차원 다시점 영상을 생성하는 영상 제공부와, 상기 다시점 영상을 디스플레이하는 디스플레이부를 포함한다.The broadcast service receiving apparatus according to an embodiment of the present invention is a broadcast service receiving apparatus for receiving a 3D broadcast service, and includes a demultiplexer that separates 2D image information, ROI information, and mask information from received data; And an image providing unit generating a 3D multiview image in the ROI of the 2D image information using the mask information and the ROI information, and a display unit displaying the multiview image.

본 발명에 따른 3차원 입체 방송은 3차원 깊이 정보를 관심 영역 개념에 기 반하여 원하는 영역에만 입체감이 있도록 표현함으로써 시청자의 관심과 흥미를 집중하도록 할 수 있으며, 전체 영상에 대한 깊이정보를 필요치 않아 전송되는 정보의 양을 줄일 수 있다. 또한 3차원 깊이 정보 획득의 어려움에서 벗어날 수 있게 되고, 불필요한 부분의 3차원 영상으로 인한 눈의 피로를 줄일 수 있다.In the 3D stereoscopic broadcasting according to the present invention, the 3D depth information may be expressed in a desired area only based on the concept of the area of interest so that the viewer can focus attention and interest, and does not need the depth information of the entire image and transmits it. This can reduce the amount of information that is generated. In addition, it is possible to escape from the difficulty of acquiring three-dimensional depth information, and to reduce eye fatigue due to unnecessary three-dimensional images.

이하 첨부된 도면을 참조하여 본 발명을 설명한다. 본 발명을 설명함에 있어 당업자에게 자명한 부분에 대하여는 본 발명의 요지를 흩뜨리지 않도록 생략하기로 한다. 또한 이하에서 설명되는 각 용어들은 본 발명의 이해를 돕기 위해 사용된 것일 뿐이며, 각 제조 회사 또는 연구 그룹에서는 동일한 용도임에도 불구하고 서로 다른 용어로 사용될 수 있음에 유의해야 한다.Hereinafter, the present invention will be described with reference to the accompanying drawings. In the following description of the present invention, a part obvious to those skilled in the art will be omitted so as not to disturb the gist of the present invention. In addition, it is to be noted that each of the terms described below are only used to help the understanding of the present invention, and may be used in different terms despite the same purpose in each manufacturing company or research group.

먼저 본 발명의 전반적인 개념을 살피고, 구체적인 구성과 동작에 대하여 설명하기로 한다.First, the overall concept of the present invention will be described, and specific configurations and operations will be described.

본 발명에서는 관심 영역(Region of Interest : ROI)을 두어 3차원 영상을 제공하며, 관심 영역에 대하여만 3차원 디스플레이가 이루어지도록 한다. 본 발명에서 언급하는 관심 영역이란, 카메라가 담을 수 있는 모든 영역의 영상 중에서 콘텐츠의 제공자 또는 콘텐츠를 제공받는 시청자 입장에서 3차원 영상을 제공받기를 원하는 대상이 관심 영역이 된다. 이를 더 상세히 설명하면, 하나의 영상이 제공될 때, 제공되는 영상 중에서 주요 대상이 존재하게 된다. 관심 영역과 비관심 영역을 예를 들어 살펴보면, 두 사람이 대화를 하는 영상을 가정하면, 두 사람의 얼굴 또 는 두 사람의 신체가 주요 대상이 되고, 그 외의 배경들은 주요 대상에서 제외된다. 또한 방송 콘텐츠 중 뉴스나 대담 프로의 경우에는 사회자 및 일부 토론자들에 주로 관심이 가게 되므로 배경 보다는 사람, 데스크 등의 관심 영역 대한 입체감 표현만으로도 충분히 입체 영상 방송이 가능할 수 있다. 또한 광고 등에서는 배우가 물건을 광고할 시에 예를 들면 홍길동이 휴대폰을 들고 있는 장면에서는 배우의 얼굴 또는 특정한 신체 부위, 배우가 들고 있는 물건 등등의 특정한 영역에 대해 관심을 가지게 된다. 따라서 이러한 경우에 위와 같은 관심 영역에 대해서 입체감을 느끼게 하고 나머지 부분은 기존의 2D 영상을 보여줄 수 있다. 즉, 본 발명은 주요 대상인 관심 영역에 대하여는 3차원 영상을 제공하고, 나머지 영역인 비관심 영역에는 2차원 영상을 제공하도록 하는 것이다. 이와 같이 선택적으로 3차원 영상을 제공하면, 제공해야 하는 데이터의 양이 줄어들게 된다. 또한 실제로 주요 대상에 대하여는 3차원 영상을 제공할 수 있으므로, 시청자들의 흥미를 유발할 수 있다. 또한 필요한 부분에만 3차원 영상을 제공함으로써 시청자의 입체감 시청 시에 피로감 현상도 획기적으로 줄일 수 있다.In the present invention, a region of interest (ROI) is provided to provide a 3D image, and the 3D display is performed only on the ROI. In the region of interest referred to in the present invention, the region of interest is a region of interest that includes a content provider or an object that wants to receive a 3D image from a viewer who receives the content. In more detail, when one image is provided, a main object exists among the provided images. For example, in the region of interest and the region of indifferent interest, assuming a video of two people talking, two faces or two bodies are the main objects, and the other backgrounds are excluded. In addition, in the case of news or audacity pro among broadcast contents, the presenter and some debaters are mainly interested, so it may be possible to sufficiently broadcast a 3D image by expressing a 3D effect on a region of interest such as a person or a desk rather than a background. Also, in advertisements, when an actor advertises an object, for example, when Hong Gil-dong is holding a cell phone, he is interested in an actor's face or a specific body part, an object that the actor is holding, and the like. Therefore, in such a case, a 3D feeling may be felt for the above-mentioned ROI, and the remaining part may show a conventional 2D image. That is, the present invention is to provide a three-dimensional image for the region of interest, which is the main object, and a two-dimensional image for the uninterested region, which is the remaining region. If the 3D image is selectively provided in this way, the amount of data to be provided is reduced. In addition, since the 3D image may be provided to the main object, the viewer may be interested. In addition, by providing a three-dimensional image only in the necessary portion, it is possible to significantly reduce the fatigue phenomenon when viewing the stereoscopic view of the viewer.

도 4는 본 발명의 일 실시 예에 따라 방송 콘텐츠를 관심 영역에 대하여만 3차원으로 생성하여 제공하기 위한 개념도이다.4 is a conceptual diagram for generating and providing broadcast content in 3D only for a region of interest according to an embodiment of the present invention.

먼저 서비스 제공자 또는 콘텐츠 제공자 측에서 이루어지는 과정에 대하여 살펴보기로 한다. 본 발명에서는 종래 기술에서 설명한 바와 같이 일반 영상(411)을 2차원 정보로 획득한다. 그리고 관심 영역을 포함하는 깊이 정보 영상(512) 또는 관심 영역만의 깊이 정보 영상(512)을 획득한다. 일반적으로 관심 영역에 대하 여만 대한 깊이 정보 영상을 획득하는 것보다 전체적인 깊이 정보 영상을 획득하는 것이 보다 쉽기 때문에 이하에서는 전체의 깊이 정보 영상을 획득하는 것으로 가정하여 설명하기로 한다. 그러나 특정한 영역 즉, 관심 영역에 대하여만 깊이 정보 영상을 획득할 수 있는 경우에는 관심 영역에 대하여만 깊이 정보 영상을 획득하도록 구성할 수도 있다. 또한 본 발명에서는 관심 영역에 대하여만 3차원 영상을 제공하기 위해 관심 영역을 제외한 부분에는 마스킹(masking) 기법을 이용한다. 즉, 관심 영역이 아닌 부분 즉, 비관심 영역에 대하여는 깊이 영상 정보를 마스킹 한다. 따라서 관심 영역을 제외한 부분에 마스킹하기 위한 마스크 정보를 생성한다. 상기한 정보들은 모두 부호기로 입력되어 부호화된다. 이때, 마스크 정보를 전송하지 않아도 되는 경우에는 마스크 정보는 부호화되지 않도록 구성할 수 있다. 또한 2차원 영상 및 깊이 정보 영상은 각각 객체별로 객체화 되어 있다고 가정한다. 영상의 객체화 방법에 관해서는 이미 많은 논문이 발표되어 있으므로 여기서는 간단한 예를 하나만 살피기로 한다. 예를 들어 MPEG4의 경우 한 프레임의 영상을 객체별로 description하기 위한 방법이 이미 표준화 되어 있다. 또한 상기한 영상 정보는 2진 영상(binary image)으로 이루어져 있음을 가정한다. 이때, 생성된 마스크 정보는 관심 영역을 나타낼 수 있는 정보이므로, 2차원 영상 정보 및 깊이 영상 정보와 함께 전송할 수도 있고, 특별한 약속이 되어 있는 경우 전송되지 않을 수도 있다.First, the process performed at the service provider or content provider will be described. In the present invention, as described in the prior art, the general image 411 is acquired as two-dimensional information. The depth information image 512 including the ROI or the depth information image 512 of only the ROI is obtained. In general, since it is easier to acquire the overall depth information image than to acquire the depth information image only for the ROI, the following description will be based on the assumption that the entire depth information image is acquired. However, when the depth information image may be acquired only for a specific region, that is, the ROI, the depth information image may be acquired only for the ROI. In addition, in the present invention, a masking technique is used for a portion excluding the ROI in order to provide a 3D image only for the ROI. That is, the depth image information is masked on the portion that is not the region of interest, that is, the uninterested region. Therefore, mask information for masking a portion except for the ROI is generated. All of the above information is inputted into an encoder and encoded. In this case, when the mask information does not need to be transmitted, the mask information may be configured not to be encoded. In addition, it is assumed that the 2D image and the depth information image are objectized for each object. Many papers have already been published on how to objectize images, so we will look at one simple example. For example, in the case of MPEG4, a method for describing an image of one frame for each object is already standardized. In addition, it is assumed that the image information is composed of a binary image. In this case, since the generated mask information may indicate the ROI, the generated mask information may be transmitted together with the 2D image information and the depth image information, or may not be transmitted when a special appointment is made.

위와 같은 방송 서비스는 서비스되는 방송 시스템에 해당하는 형식으로 변환된 후 시청자들에게 제공된다. 여기서 방송 시스템은, 무선 네트워크를 이용한 방 송 시스템과 유선 네트워크를 이용한 방송 시스템 또는 복합적인 방법을 이용한 방송 시스템 등이 존재할 수 있다. 예를 들어 무선 네트워크를 이용하는 방송 시스템은 공중파 방송, 지상파 방송, 위성 방송 등이 있을 수 있으며, 유선 네트워크를 이용하는 방송 시스템은 인터넷 등을 이용한 방송 또는 케이블 방송 등이 존재할 수 있다. 또한 유선과 무선이 혼합되어 있는 형태의 방송 시스템도 가능하다. 뿐만 아니라 상기한 방송 시스템들은 단방향 방송 뿐 아니라 양방향 방송도 포함한다.Such a broadcast service is provided to viewers after being converted into a format corresponding to a service system. Here, the broadcasting system may include a broadcasting system using a wireless network, a broadcasting system using a wired network, or a broadcasting system using a complex method. For example, a broadcasting system using a wireless network may include air broadcasting, terrestrial broadcasting, satellite broadcasting, and the like, and a broadcasting system using a wired network may include broadcasting or cable broadcasting using the Internet. It is also possible to have a broadcasting system in which wired and wireless are mixed. In addition, the broadcast systems include two-way broadcast as well as one-way broadcast.

이와 같은 방법으로 전달된 방송 정보는 복호기(421)에서 2차원 영상 정보와 관심 영역의 깊이 정보 및 마스크 값이 복호되고, 복호된 정보를 이용하여 다시점 생성부(422)에서 다시점 영상을 생성한다. 다시점 영상은 다시점 디스플레이(423)에서 다시점 영상으로 시청자에게 제공된다. 즉, 3차원 영상을 시청자에게 제공하게 된다.In the broadcast information transmitted in this manner, the decoder 421 decodes the 2D image information, the depth information of the ROI, and the mask value, and generates a multiview image from the multiview generator 422 using the decoded information. do. The multiview image is provided to the viewer as a multiview image in the multiview display 423. That is, the 3D image is provided to the viewer.

도 5는 본 발명의 다른 실시 예에 따라 방송 콘텐츠의 관심 영역을 설정하여 3차원 방송을 제공하기 위한 개념도이다.5 is a conceptual diagram for providing a 3D broadcast by setting a region of interest of broadcast content according to another embodiment of the present invention.

도 5는 앞에서 설명한 도 4와 유사한 형태이다. 따라서 콘텐츠 제공자 또는 서비스 사업자는 2차원의 일반 영상 정보(511)와 관심 영역을 포함하는 깊이 정보 영상(512)을 획득한다. 이때, 관심 영역의 마스크 정보는 콘텐츠 제공자 또는 서비스 사업자가 제공하는 선별해서 제공하지 않고, 시청자가 선택한 정보를 이용하여 선택된 영역을 제외한 부분이 마스킹 된다. 이에 대하여는 이하에서 더 살피기로 한다.FIG. 5 is similar to FIG. 4 described above. Accordingly, the content provider or service provider acquires the 2D general image information 511 and the depth information image 512 including the ROI. In this case, the mask information of the region of interest is not selectively provided by the content provider or the service provider, and the portion except for the region selected using the information selected by the viewer is masked. This will be further discussed below.

상기와 같이 획득된 정보는 부호기(514)로 입력되어 획득된 마스크 정보에 따라 깊이 영상 정보에 마스킹 작업을 수행하고, 일반 영상과 마스킹된 깊이 정보 영상을 부호화한다. 이러한 방법으로 생성된 방송 콘텐츠 또는 방송 서비스는 양방향 전송 매체를 통해 전달된다. 이때, 도 5에서 방송 서비스가 제공되는 방법은 앞서 설명한 도 4에서 기술한 방법 중 하나의 방법으로 제공된다. 다만 도 5의 방법이 도 4와 달라지는 점은 방송 서비스를 제공하는 사업자에게 방송 서비스에 대하여 마스크 정보를 궤환(feedback)시키기 위한 방법이 새롭게 부가되는 점에서 차이가 있다. 즉, 도 5는 도 4와 대비하여 방송 서비스를 제공받는 시청자와 방송 서비스를 제공하는 사업자간 상호 작용(interaction)이 이루어진다는 점에서 큰 차이가 있다.The information obtained as described above is input to the encoder 514 to perform a masking operation on the depth image information according to the obtained mask information, and to encode the normal image and the masked depth information image. The broadcast content or broadcast service generated in this way is delivered through a bidirectional transmission medium. In this case, the method of providing a broadcast service in FIG. 5 is provided by one of the methods described in FIG. 4. However, the method of FIG. 5 is different from that of FIG. 4 in that a method for feeding back mask information for a broadcast service is newly added to a service provider that provides a broadcast service. That is, FIG. 5 has a big difference in that interaction between a viewer who receives a broadcast service and a provider that provides a broadcast service is performed in comparison with FIG. 4.

그러면 시청자의 단말(520)에서 이루어지는 과정에 대하여 살펴보기로 한다. 청취자의 단말의 복호기(521)는 2차원 영상과 마스킹된 깊이 정보 영상을 복호하고, 이를 일반 영상과 관심 영역에 대한 깊이 정보 영상으로 변환하여 다시점 생성부(522)로 제공한다. 그러면 다시점 생성부(522)는 2차원 영상 정보에 깊이 정보를 부가하여 관심 영역에 대하여만 3차원 정보를 생성한다. 따라서 부분적인 3차원 그래픽을 생성한다. 이와 같이 생성된 부분적인 3차원 그래픽 정보는 다시점 디스플레이(523)를 통해 사용자에게 제공된다. 도 5와 같은 방식에서는 시청자가 관심 영역을 선택할 수 있으므로 깊이 정보 영상의 마스크 값을 미리 저장한다면 전송하지 않을 수도 있다.This will be described with respect to the process performed in the terminal 520 of the viewer. The decoder 521 of the listener's terminal decodes the 2D image and the masked depth information image, converts the 2D image and the depth information image of the region of interest, and provides the same to the multiview generator 522. Then, the multi-view generator 522 adds depth information to the 2D image information to generate 3D information only for the ROI. Therefore, we create partial three-dimensional graphics. The partial three-dimensional graphic information generated in this way is provided to the user through the multi-view display 523. In the method as shown in FIG. 5, since the viewer can select a region of interest, if the mask value of the depth information image is stored in advance, it may not be transmitted.

상술한 서비스가 최초 제공될 때는 콘텐츠 제공자에 의해 선택된 관심 영역에 대하여만 3차원 정보를 제공할 수 있다. 이러한 경우 사용자가 새로운 부분을 관심 영역으로 선택할 수 있다. 이는 다시점 디스플레이어(523)에서 시청자의 선택 정보를 검출할 수 있도록 하여 구현할 수 있다. 이러한 검출 및 검출된 정보는 송신부(525)에서 선택 객체를 판별하고, 그 정보를 관심 영역 깊이 마스크 정보로 생성한 후 콘텐츠 제공자 또는 방송 서비스 제공자에게 전달함으로써 시청자가 요구한 관심 영역의 마스크 값을 제공할 수 있다.When the above service is initially provided, 3D information may be provided only for the ROI selected by the content provider. In this case, the user can select a new part as the region of interest. This may be implemented by allowing the multi-view display 523 to detect the viewer's selection information. The detected and detected information is provided by the transmitter 525 to determine a selection object, and generates the ROI depth mask information, and then transfer the information to the content provider or broadcast service provider to provide a mask value of the ROI requested by the viewer. can do.

이를 예를 들어 살펴보면, 시청자가 방송 서비스가 제공되는 현재의 화면에서 특정 영역 또는 특정 대상물을 선택한다. 그러면 다시점 디스플레이어(523)에서는 터치 스크린 형식 또는 시청자의 요청을 수신할 수 있는 형태의 단말(520)에서 시청자의 선택 정보를 획득한다. 이러한 방법을 좀 더 구체적으로 살펴보면, 객체에 기반한 부호화 방법을 통해서 전송된 영상은 이미 각 객체별로 객체화 되어 있으므로, 디스플레이에서 사용자가 원하는 객체를 선택함으로써 선택된 객체 정보를 바탕으로 초기 송신에 필요한 관심 영역의 깊이 정보 및 해당 관심 영역의 마스크를 만들 수 있다. 구체적인 예를 들면, N 개의 객체로 이루어진 관심 영역의 깊이 정보와 그에 해당하는 관심 영역 마스크 정보를 이용하여 디스플레이에서 사용자가 원하는 객체를 선택한다. 이와 같이 선택된 객체가 전체 즉, N개의 객체 중에서 원하는 적어도 1개의 객체를 선택하면 선택된 객체에 대한 마스크 값을 생성하여 방송 송신측으로 제공함으로써 원하는 영역에 대한 입체 영상을 획득한다.As an example, the viewer selects a specific area or a specific object on a current screen on which a broadcast service is provided. Then, the multi-view display 523 obtains the viewer's selection information from the terminal 520 in the form of a touch screen or a form in which the viewer's request can be received. In more detail, since the image transmitted through the object-based encoding method is already objectized for each object, the area of interest required for the initial transmission is selected based on the object information selected by the user selecting the desired object on the display. You can create a depth mask and a mask of the region of interest. As a specific example, the user selects the desired object on the display using depth information of the ROI composed of N objects and corresponding ROI mask information. When the selected object selects at least one desired object from among all, that is, N objects, a mask value for the selected object is generated and provided to the broadcast transmitter to acquire a stereoscopic image of a desired area.

상기 마스크 정보는 관심 영역과 비관심 영역을 구분하는 것으로, 시청자에 의해 선택된 부분을 제외한 나머지 영역에 대하여는 비관심 영역이 된다. 따라서 선택되지 않은 부분에 마스크 값을 "0"으로 설정하고, 선택된 부분만을 "1"로 선택 함으로써 마스크 값을 생성할 수 있다. 이는 앞에서 설명한 바와 같이 동영상 파일을 제공하는 MPEG4의 경우 하나의 영상 프레임 내에 다수의 객체들이 존재하며, 해당하는 객체 중 시청자가 선택하지 않은 객체의 영역에는 "0"의 값이 선택되고 시청자가 선택한 영역에는 "1"의 값을 생성하도록 하여 마스크 값을 획득할 수 있다. 앞에서 가정한 바와 같이 동영상은 2진 영상으로 구현되므로 깊이 정보에 마스크 값을 곱하면, 비관심 영역의 깊이 정보는 모두 사라지게 된다. 이와 같이 생성된 마스크 값은 방송 시스템에서 제공하는 궤환 방식에 따라 궤환이 이루어진다. 예를 들어 IPTV의 경우 자신의 IP 정보를 이용하여 유선상으로 관심 영역 마스크 값을 전달할 수 있으며, 이동통신 단말인 경우 해당하는 이동통신 단말의 고유한 전화번호를 이용할 수도 있다. 또한 위성 방송을 이용하는 경우에도 양방향 서비스를 위하여 유선 전화 또는 무선 단말을 통해 궤환 방식이 결정된 경우 해당하는 궤환 방식을 이용할 수도 있다.The mask information distinguishes a region of interest from an uninterested region and becomes an uninterested region for the remaining regions except for the portion selected by the viewer. Therefore, the mask value can be generated by setting the mask value to "0" in the unselected portion and selecting "1" only in the selected portion. As described above, in the case of MPEG4 that provides a video file, a plurality of objects exist in one image frame, and a value of "0" is selected in an area of an object that is not selected by the viewer among the corresponding objects, and the area selected by the viewer. It is possible to obtain a mask value by generating a value of "1". As assumed above, since the video is implemented as a binary image, multiplying the depth information by the mask value causes all the depth information of the uninterested region to disappear. The mask value generated as described above is fed back according to the feedback method provided by the broadcasting system. For example, in case of IPTV, a region of interest mask value may be transmitted over a wire by using its own IP information. In the case of a mobile communication terminal, a unique phone number of a corresponding mobile communication terminal may be used. In addition, even when using satellite broadcasting, a feedback method may be used when a feedback method is determined through a wired telephone or a wireless terminal for an interactive service.

그러면 이상에서 살펴본 방식에 따른 장치들의 블록 구성 및 동작에 대하여 살펴보기로 한다.Next, the block configuration and operation of devices according to the above-described method will be described.

도 6은 본 발명에 따라 선택적 3차원 정보를 제공하기 위한 장치의 개념적인 블록 구성도이다.6 is a conceptual block diagram of an apparatus for providing selective three-dimensional information according to the present invention.

센서부(611)는 도 1, 도 4 및 도 5에서 전술한 바와 같이 2차원의 일반 영상과 깊이 정보 영상을 함께 획득한다. 이와 같이 획득된 일반 영상은 영상 처리부(612)로 입력되고, 깊이 정보 영상은 깊이 정보 처리부(614)로 입력된다. 영상 처리부(612)는 센서부(611)에서 획득된 영상 정보를 방송 시스템 또는 제공하는 콘 텐츠에서 사용할 수 있는 형태로 변환한다. 또한 센서부(611)에서 획득된 깊이 정보 또는/및 영상 정보는 마스크 정보 생성부(615)로 입력될 수 있다. 이는 마스크 정보 생성부(615)는 운용자 또는 콘텐츠 제작자 또는 시청자에 의해 결정된 마스크 정보를 의미한다. 따라서 마스크 정보 생성부(615)에서는 2차원 영상 및 깊이 정보 영상의 크기에 맞춰 마스크 정보를 생성할 수 있다.As described above with reference to FIGS. 1, 4, and 5, the sensor unit 611 acquires a two-dimensional general image and a depth information image. The general image thus obtained is input to the image processor 612, and the depth information image is input to the depth information processor 614. The image processor 612 converts the image information acquired by the sensor unit 611 into a form that can be used in a broadcast system or content provided. In addition, the depth information and / or image information acquired by the sensor unit 611 may be input to the mask information generation unit 615. The mask information generation unit 615 means mask information determined by an operator, a content producer, or a viewer. Accordingly, the mask information generator 615 may generate mask information according to the size of the 2D image and the depth information image.

마스크 정보를 획득하기 위한 과정을 좀 더 설명하면, 마스크 정보는 운영자 또는 사용자 또는 시청자의 선택에 의해 마스크 정보를 획득하게 된다. 이는 도 4의 실시 예인 경우로 서비스 제공자 또는 콘텐츠 제작자가 미리 마스크 정보를 선택적으로 생성하도록 할 수 있다. 또한 도 5의 경우에서는 기본적으로는 방송 시스템의 운영자가 기본으로 설정한 값으로 선택적인 3차원 영상을 제공하기 위한 마스크 값으로 방송 서비스를 제공하고, 시청자로부터 새로운 마스크 값을 수신하면 시청자가 선택한 마스크 값을 마스크 값으로 생성하여 제공한다.The process for acquiring the mask information will be described in more detail. The mask information is obtained by selecting the operator or the user or the viewer. In the case of the embodiment of FIG. 4, the service provider or the content producer may selectively generate mask information in advance. In addition, in the case of FIG. 5, basically, a broadcast service is provided as a mask value for providing an optional 3D image as a value set by the operator of the broadcasting system and a mask selected by the viewer when a new mask value is received from the viewer. Create and provide a value as a mask value.

이와 같이 마스크 값이 생성되면, 깊이 정보 처리부(614)에서의 깊이 정보와 생성된 마스크 정보를 결합부(616)에서 결합한다. 따라서 깊이 정보 중 시스템의 운영자 또는 콘텐츠 제공자 또는 시청자가 선택한 관심 영역 정보에 의해 생성된 마스크 값을 이용하여 마크킹 된다. 이와 같이 마스킹된 깊이 정보는 깊이 정보 부호기(617)에서 마스킹된 깊이 정보가 부호화된다. 여기서 마스크 정보를 전송해야 하는 경우 깊이 정보 부호기(613)에서 마스크 정보를 함께 부호화할 수 있다. 또한 영상 처리부(612)에서 처리된 2차원 영상은 영상 부호기(613)에서 영상 부호화가 이루어진다. 마스킹되어 부호화된 깊이 정보 영상과 부호화된 2차원 영상은 다중화 부(618)로 입력되어 다중화된다. 이때에도 마스크 값이 함께 전송되는 경우라면, 마스크 정보가 함께 다중화부(618)에서 다중화된다. 이와 같이 다중화된 영상은 부호화된 3차원 영상으로 3차원 영상은 선택적인 부분에 한하여 3차원 영상을 생성하게 된다. 이와 같이 생성된 정보는 실제로 전체 영역에 대한 3차원 영상 정보보다 적은 양의 데이터로 구성된다.When the mask value is generated in this manner, the depth information processor 614 combines the depth information and the generated mask information in the combiner 616. Therefore, the depth information is marked using a mask value generated by the ROI information selected by the operator, the content provider, or the viewer of the system. In the masked depth information, the depth information masked by the depth information encoder 617 is encoded. When the mask information needs to be transmitted, the mask information may be encoded together by the depth information encoder 613. Also, the 2D image processed by the image processor 612 is subjected to image encoding by the image encoder 613. The masked and encoded depth information image and the encoded two-dimensional image are input to the multiplexer 618 and multiplexed. In this case, when the mask values are transmitted together, the mask information is multiplexed together by the multiplexer 618. The multiplexed image is an encoded 3D image, and the 3D image generates a 3D image only in an optional part. The information generated in this way is actually composed of a smaller amount of data than the 3D image information for the entire area.

상기한 바와 같은 방법으로 생성된 영상 정보는 송신기(도면에는 도시하지 않음)를 통해 전송된다. 여기서도 영상 정보의 송신 방법은, 앞에서 기술한 방법들 중 하나의 방법을 통해 전송된다.Image information generated by the above-described method is transmitted through a transmitter (not shown). Here, the method of transmitting image information is transmitted through one of the methods described above.

도 7은 본 발명의 일 실시 예에 따라 선택적 3차원 영상 정보를 디스플레이 하기 위한 장치의 개념적인 블록 구성도이다.7 is a conceptual block diagram of an apparatus for displaying selective 3D image information according to an embodiment of the present invention.

수신부(311)는 소정의 방송 시스템을 통해 전송된 데이터를 수신한다. 그러면 수신부(311)는 방송 시스템의 전송 방식에 따라 즉, 무선 시스템 또는 유선 시스템에서 요구되는 방식으로 신호를 수신하고, 소정 대역의 신호로 처리한다. 예를 들어 무선 시스템의 경우 RF 신호에서 캐리어(carrier) 신호를 제거하여 원하는 대역의 신호로 변환한 후 무선 신호의 전송에 따른 복호화를 수행하여 출력한다. 만일 유선 시스템인 경우 유선 전송 선로를 통해 전송된 데이터에 맞춰 데이터를 변환한다. 이와 같이 수신부(311)에서 변환된 데이터는 역다중화부(312)로 입력된다. The receiver 311 receives data transmitted through a predetermined broadcast system. Then, the receiver 311 receives a signal according to a transmission method of the broadcasting system, that is, a method required by a wireless system or a wired system, and processes the signal into a signal of a predetermined band. For example, in a wireless system, a carrier signal is removed from an RF signal, converted into a signal of a desired band, and then decoded according to transmission of a wireless signal and output. In the case of a wired system, the data is converted according to the data transmitted through the wired transmission line. The data converted by the receiver 311 is input to the demultiplexer 312.

역다중화부(312)는 송신측에서 송신한 영상 정보와 마스킹된 깊이 정보를 다중화한 반대 방법으로 역다중화를 수행한다. 따라서 역다중화부(312)는 영상 정보와 마스킹된 깊이 정보를 구별하여 출력한다. 여기서 역다중화부(312)는 마스크 정 보가 함께 전송된 경우 마스크 정보를 함께 역다중화 한다. 역다중화된 영상 정보는 영상 복호기(313)로 입력되고, 역다중화된 깊이 영상 정보 또는 깊이 영상 정보와 마스크 정보는 깊이 정보 복호기(314)로 입력된다.The demultiplexer 312 performs demultiplexing by an opposite method of multiplexing the image information transmitted from the transmitter and the masked depth information. Accordingly, the demultiplexer 312 distinguishes and outputs image information and masked depth information. Here, the demultiplexer 312 demultiplexes the mask information when the mask information is transmitted together. The demultiplexed image information is input to the image decoder 313, and the demultiplexed depth image information or the depth image information and the mask information are input to the depth information decoder 314.

영상 복호기(313)는 촬영된 2차원의 영상 신호를 복호하여 출력한다. 그리고 깊이 정보 복호기(314)는 마스킹된 깊이 정보를 복호한다. 이때에도 마스크 정보가 함께 전송된 경우 마스크 정보를 이용하여 마스킹된 위치를 판별하도록 할 수 있다. 즉, 마스킹된 깊이 정보는 앞에서 설명한 바와 같이 2차원 정보와 함께 획득된 깊이 정보에서 운용자 또는 시청자에 의해 마스크 값이 결정되어 깊이 정보 영상을 마스킹하여 출력된 깊이 영상 정보이다. 이와 같이 영상 복호기(313)와 깊이 정보 복호기(314)에서 복호된 정보는 다시점 영상 생성부(315)로 입력되어 선택된 부분에 대하여만 다시점 영상을 생성하고, 나머지 부분은 2차원 영상을 재생한다. 즉, 본 발명에 따른 선택적 3차원 영상을 생성하게 된다. 다시점 영상 생성부(315)에서 생성된 다시점 영상은 다시점 디스플레이부(316)에서 선택된 부분에 대하여는 3차원 영상으로 그리고 선택되지 않은 부분에 대하여는 2차원 영상으로 디스플레이 된다. 이를 통해 시청자들은 선택적인 3차원 영상을 청취할 수 있다. 이상에서 설명한 영상 복호기(713)와 깊이 정보 복호기(714) 및 다시점 영상 제공부(715)를 영상 제공부라 칭한다. The image decoder 313 decodes and outputs the photographed two-dimensional image signal. The depth information decoder 314 then decodes the masked depth information. In this case, when the mask information is transmitted together, the masked position may be determined using the mask information. That is, the masked depth information is depth image information that is output by masking a depth information image by determining a mask value by an operator or a viewer from depth information acquired together with 2D information as described above. In this way, the information decoded by the image decoder 313 and the depth information decoder 314 is input to the multiview image generator 315 to generate a multiview image only for the selected portion, and the rest of the image reproduces the 2D image. do. That is, the selective three-dimensional image according to the present invention is generated. The multiview image generated by the multiview image generator 315 is displayed as a 3D image for the portion selected by the multiview display 316 and as a 2D image for the portion not selected. This allows viewers to listen to the optional 3D image. The image decoder 713, the depth information decoder 714, and the multiview image providing unit 715 described above are called image providing units.

도 1은 3차원 방송 서비스를 제공하기 위한 시스템의 개념도,1 is a conceptual diagram of a system for providing a 3D broadcast service;

도 2는 영상 정보 및 깊이 정보 영상을 함께 획득하기 위한 장치의 개념적인 블록 구성도,2 is a conceptual block diagram of an apparatus for acquiring image information and depth information image together;

도 3은 영상 정보 및 깊이 정보 영상을 디스플레이하기 위한 장치의 개념적인 블록 구성도,3 is a conceptual block diagram of an apparatus for displaying image information and depth information image;

도 4는 본 발명의 일 실시 예에 따라 방송 콘텐츠를 관심 영역에 대하여만 3차원으로 생성하여 제공하기 위한 개념도,4 is a conceptual diagram for generating and providing broadcast content in three dimensions only for a region of interest according to an embodiment of the present invention;

도 5는 본 발명의 다른 실시 예에 따라 방송 콘텐츠의 관심 영역을 설정하여 3차원 방송을 제공하기 위한 개념도,5 is a conceptual diagram for providing a 3D broadcast by setting a region of interest of broadcast content according to another embodiment of the present invention;

도 6은 본 발명에 따라 선택적 3차원 정보를 제공하기 위한 장치의 개념적인 블록 구성도,6 is a conceptual block diagram of an apparatus for providing selective three-dimensional information according to the present invention;

도 7은 본 발명의 일 실시 예에 따라 선택적 3차원 영상 정보를 디스플레이 하기 위한 장치의 개념적인 블록 구성도.7 is a conceptual block diagram of an apparatus for displaying selective 3D image information according to an embodiment of the present invention.

Claims

In the transmission method for providing a three-dimensional broadcast service,

Acquiring a depth information image of the 2D image together with 2D image information;

Generating a mask value for removing an uninterested region of the 2D image information;

Masking a depth information image using the mask value;

And encoding and transmitting the 2D image information and the masked depth information image.

The method of claim 1,

And transmitting the mask information by encoding the mask information and transmitting the mask information together with the 2D image information and the masked depth information image.

The method according to claim 1 or 2,

3D broadcast service method based on ROI depth information, wherein the 2D image information and the depth information image are each composed of objectized information.

The method of claim 3, wherein the mask value,

3D broadcast service method based on ROI depth information generated based on a ROI selected by an operator of the broadcast service.

The method of claim 3, wherein the mask value,

3D broadcast service method based on ROI depth information generated based on a ROI selected by a viewer.

In the transmitting device for providing a three-dimensional broadcast service,

A sensor unit which acquires a depth information image of the 2D image together with 2D image information;

A mask information generator for removing a depth information of an uninterested region of the 2D image information;

A coupling unit for masking the depth information image using the mask value;

And a transmitter for encoding and transmitting the 2D image information and the masked depth information image.

The method of claim 6,

The coupling unit outputs the mask information together,

3. The apparatus of claim 3, wherein the mask information is transmitted together with the 2D image information and the masked depth information image.

The method according to claim 6 or 7,

3D broadcast service apparatus based on ROI depth information, wherein the 2D image information and the depth information image are each composed of objectized information.

The method of claim 8, wherein the mask value,

3D broadcast service apparatus based on ROI depth information generated based on a ROI selected by an operator of the broadcast service.

The method of claim 8, wherein the mask value,

3D broadcast service apparatus based on ROI depth information generated based on ROI received from a viewer's terminal.

In the broadcast service receiving method for receiving a 3D broadcast service,

Dividing the 2D image information and the depth image information and the mask information corresponding to the ROI from the received data;

Generating a 3D multi-view image in the ROI of the 2D image information by using the mask information and ROI information;

And a region of interest depth information based on the multiview image.

The method of claim 11,

3. The method of claim 3, wherein the 2D image information and the depth image information are each composed of objectized information.

The method of claim 12,

Obtaining, by the viewer, the ROI by selecting at least one object from among the objectized images in the displayed multi-view image;

And transmitting the acquired ROI information to a service provider for providing a broadcast service.

In the broadcast service receiving apparatus for receiving a 3D broadcast service,

A demultiplexer which separates 2D image information, ROI information, and mask information from the received data;

An image providing unit generating a 3D multi-view image in the ROI of the 2D image information by using the mask information and ROI information;

3. The apparatus for 3D broadcasting service based on the ROI depth information, including a display unit configured to display the multiview image.

The method of claim 14,

And 3D broadcast service apparatus based on ROI depth information, wherein the 2D image information and the depth image information are each composed of objectified information.

The method of claim 15,

An object selection unit for obtaining a region of interest information by selecting at least one or more objects of the objectized image from the displayed multi-view image;

3. The method of claim 3, further comprising a transmitter configured to transmit the acquired ROI information to a service provider providing a broadcast service.