WO2006016782A1 - Method and apparatus to encode image, and method and apparatus to decode image data - Google Patents

Method and apparatus to encode image, and method and apparatus to decode image data Download PDF

Info

Publication number
WO2006016782A1
WO2006016782A1 PCT/KR2005/002638 KR2005002638W WO2006016782A1 WO 2006016782 A1 WO2006016782 A1 WO 2006016782A1 KR 2005002638 W KR2005002638 W KR 2005002638W WO 2006016782 A1 WO2006016782 A1 WO 2006016782A1
Authority
WO
WIPO (PCT)
Prior art keywords
image
omni
region
interest
directional
Prior art date
Application number
PCT/KR2005/002638
Other languages
French (fr)
Inventor
Gwang-Hoon Park
Original Assignee
Industry Academic Cooperation Foundation Kyunghee University
Samsung Electronics Co., Ltd.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from KR1020040075972A external-priority patent/KR100739686B1/en
Application filed by Industry Academic Cooperation Foundation Kyunghee University, Samsung Electronics Co., Ltd. filed Critical Industry Academic Cooperation Foundation Kyunghee University
Priority to CN2005800269727A priority Critical patent/CN101002471B/en
Priority to EP05780541A priority patent/EP1782632A1/en
Priority to JP2007525547A priority patent/JP2008510357A/en
Publication of WO2006016782A1 publication Critical patent/WO2006016782A1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/81Monomedia components thereof
    • H04N21/8146Monomedia components thereof involving graphical data, e.g. 3D object, 2D graphics
    • H04N21/8153Monomedia components thereof involving graphical data, e.g. 3D object, 2D graphics comprising still images, e.g. texture, background image
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/234Processing of video elementary streams, e.g. splicing of video streams, manipulating MPEG-4 scene graphs
    • H04N21/2343Processing of video elementary streams, e.g. splicing of video streams, manipulating MPEG-4 scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements
    • H04N21/234363Processing of video elementary streams, e.g. splicing of video streams, manipulating MPEG-4 scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements by altering the spatial resolution, e.g. for clients with a lower screen resolution
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/25Management operations performed by the server for facilitating the content distribution or administrating data related to end-users or client devices, e.g. end-user or client device authentication, learning user preferences for recommending movies
    • H04N21/266Channel or content management, e.g. generation and management of keys and entitlement messages in a conditional access system, merging a VOD unicast channel into a multicast channel
    • H04N21/2662Controlling the complexity of the video stream, e.g. by scaling the resolution or bitrate of the video stream based on the client capabilities
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/472End-user interface for requesting content, additional data or services; End-user interface for interacting with content, e.g. for content reservation or setting reminders, for requesting event notification, for manipulating displayed content
    • H04N21/4728End-user interface for requesting content, additional data or services; End-user interface for interacting with content, e.g. for content reservation or setting reminders, for requesting event notification, for manipulating displayed content for selecting a Region Of Interest [ROI], e.g. for requesting a higher resolution version of a selected region

Definitions

  • the present general inventive concept relates to an image encoding/decoding technique, and more particularly, to a method of encoding/decoding an omni-di- rectional image for three-dimensional (3D) realistic broadcasting.
  • Omni-directional video camera systems are camera systems that photograph a 360
  • Omni-directional video camera systems include a camera to which a special mirror, such as a hyperboloid mirror, or a special lens, such as a fish-eye lens, is installed or a plurality of cameras to photograph an omni-directional view.
  • a special mirror such as a hyperboloid mirror
  • a special lens such as a fish-eye lens
  • all image information regarding scenes viewed from diverse viewpoints including a viewpoint of a pitcher, a viewpoint of a catcher, a viewpoint of a hitter, and a viewpoint of an audience on the first base side in a baseball game is provided to a viewer's terminal.
  • the viewer can select a desired viewpoint and view a scene from the desired viewpoint.
  • FIG. 1 is a conceptual diagram of a con ⁇ ventional omni-directional video encoding/decoding system.
  • an omni-directional image is acquired using an omni-directional photographing unit 110.
  • An image converter 120 converts the omni-directional image into a predetermined format that can be processed by an existing MPEG-4 encoder 130.
  • An image photographed using an omni-directional camera system using a special lens or mirror or a plurality of cameras has characteristics corresponding to a 3D spherical environment. Since a conventional video codec receives, compresses, and transmits a 2D image, a 3D image photographed using an omni-directional camera system needs to be converted into a 2D image. Cartographical projection and polygonal projection have been presented to convert a 3D image into a 2D image.
  • Cartographical projection is a process of projecting a spherical shape onto a complete rectangular plane like producing a typical world map.
  • Polygonal projection is a process of projecting a spherical shape into a development figure of a polyhedron.
  • the MPEG-4 encoder 130 encodes the converted image to generate a bitstream and transmits the bitstream to a decoding unit of a user.
  • An MPEG-4 decoder 140 decodes the bitstream.
  • An image converter 150 converts the decoded bitstream into an omni ⁇ directional image.
  • a display unit 160 displays the omni-directional image.
  • the present general inventive concept provides an image encoding method and apparatus by which an omni-directional image is efficiently transmitted and a user's region-of-interest in the omni-directional image is provided to the user with improved picture quality.
  • the present general inventive concept also provides an image decoding method and apparatus by which a user's region-of-interest with improved picture quality in the omni-directional image is received and displayed.
  • a rough omni-directional image is transmitted first to a decoding apparatus through a channel having a restricted bandwidth and then a high-resolution image of a region-of-interest selected by a user from the omni-directional image is provided to the user. Therefore, the present invention can transmit an omni-directional image efficiently and improve the picture quality of a user's region-of-interest in the omni-directional image.
  • FlG. 1 is a conceptual diagram of a conventional omni-directional video encoding/ decoding system ;
  • FlG. 2 is a block diagram illustrating an image encoding apparatus according to an embodiment of the present general inventive concept
  • FlG. 3 is a block diagram illustrating an image decoding apparatus according to an embodiment of the present general inventive concept
  • FlG. 4 is a flowchart illustrating a method of encoding an image according to an embodiment of the present general inventive concept
  • FlG. 5 is a flowchart illustrating a method of decoding an image according to an embodiment of the present general inventive concept
  • FlG. 6 is a block diagram illustrating an image encoding apparatus according to another embodiment of the present general inventive concept.
  • FlG. 7 is a block diagram illustrating an image decoding apparatus according to another embodiment of the present general inventive concept.
  • the foregoing and/or other aspects of the present general inventive concept are achieved by providing a method of encoding an image, the method including generating a first bitstream by encoding an omni-directional image and transmitting the first bitstream to a decoding apparatus, receiving position information of a region- of-interest selected from an image reconstructed based on the first bitstream from the decoding apparatus, and generating a second bitstream by encoding an image of the region-of-interest based on the position information.
  • an apparatus to encode an image including a first encoder to encode an omni-directional image to generate a first bitstream, a data communicator to transmit the first bitstream to decoding apparatus and to receive position information of a region-of-interest selected from an image reconstructed based on the first bitstream from the decoding apparatus, and a second encoder to encode an image of the region- of-interest based on the position information to generate a second bitstream.
  • the foregoing and/or other aspects of the present general inventive concept are also achieved by providing a method of decoding an image, the method including receiving a first bitstream generated by encoding an omni-directional image from an encoding apparatus, decoding the first bitstream and displaying a reconstructed image, transmitting position information of a region-of-interest selected from the re ⁇ constructed image to the encoding apparatus, receiving a second bitstream generated by encoding an image of the region-of-interest from the encoding apparatus, and decoding the second bitstream.
  • an apparatus to decode an image including a first decoder to receive a first bitstream generated by encoding an omni-directional image from an encoding apparatus and to decode the first bitstream to generate a reconstructed omni ⁇ directional image, a first display unit to display the reconstructed omni-directional image output from the first decoder, a data communicator to transmit position in ⁇ formation of a region-of-interest selected from the reconstructed omni-directional image displayed through the first display unit to the encoding apparatus, and a second decoder to receive a second bitstream generated by encoding an image of the region- of-interest from the encoding apparatus and to decode the second bitstream.
  • a user may want to make a viewpoint transition based on information regarding the full omni-directional image and closely and partially observe a region-of-interest in the full omni-directional image.
  • embodiments of the present general inventive concept provide a method and apparatus to transmit a portion of the omni-directional image other than the user's region- of-interest to the user's terminal using a minimum bandwidth and transmitting an image of the region-of-interest at a high resolution.
  • a full panorama image is provided to a decoding apparatus at a low definition to provide the user a rough view of the panorama image.
  • a high-resolution image of the region-of-interest is provided to the decoding apparatus.
  • FlG. 2 is a block diagram illustrating an image encoding apparatus according to an embodiment of the present general inventive concept.
  • the image encoding apparatus includes a first encoder 210, a data communicator 220, a first conversion unit 240, a second conversion unit 230, a region-of-interest selector 250, a subtractor 260, and a second encoder 270.
  • the omni-directional image may be an annular image, but is not limited thereto.
  • the omni-directional camera system may be a camera system including a special lens or a combination of a mirror and a lens, and can photograph up to a 360 ° omni ⁇ directional view from a single viewpoint.
  • Sony's TVR-900 and HDW F900 are examples of such an omni-directional camera system.
  • the TVR-900 can photograph a 180 ° view and the HDW F900 can photograph a 360 ° view.
  • the omni ⁇ directional camera system can obtain an omni-directional image using a plurality of cameras.
  • An annular image is an image that is photographed after being reflected from a mirror in a mirror-based omni-directional camera system and implies a 360 ° omni ⁇ directional image.
  • the first encoder 210 receives and encodes the annular image using a pre ⁇ determined method to generate an annular image bitstream.
  • An encoder complying with a Motion Picture Experts Group (MPEG)-4 Part 2 standard or an H.264 (or MPEG-4 Part 10 AVC) standard may be used as the first encoder 210.
  • MPEG Motion Picture Experts Group
  • H.264 or MPEG-4 Part 10 AVC
  • the present general inventive concept is not restricted thereto, and an encoder modified to be suitable to an annular image may alternatively be used as the first encoder 210.
  • An annular image bitstream generated by the first encoder 210 is transmitted via the data communicator 220 to an image decoding apparatus, such as the image decoding apparatus illustrated in FIG. 3.
  • the image decoding apparatus decodes the annular image bitstream to obtain a reconstructed annular image, converts the re ⁇ constructed annular image into a panorama image, and displays the panorama image through a panorama image display unit 330 (see FIG. 3).
  • the first encoder 210 also generates a reconstructed annular image suitable to a particular bandwidth based on the annular image bitstream and stores it in a reconstructed annular image buffer (not shown).
  • the first encoder 210 generates the reconstructed annular image by decoding the annular image bitstream generated therein, and therefore the first encoder 210 has a decoding capability as well as an encoding capability.
  • the reconstructed annular image generated by the first encoder 210 is input to the second conversion unit 230.
  • the first conversion unit 240 includes a first annular-to-panorama converter (APC)
  • the second conversion unit 230 includes a second APC 231 and a second PPIC 233.
  • the first conversion unit 240 and the second conversion unit 230 respectively convert the original annular image and the reconstructed annular image into a pre ⁇ determined image format.
  • the first APC 241 and the second APC 231 convert the original annular image and the reconstructed annular image into first and second panorama images, respectively.
  • Cartographical projection and polygonal projection are methods that can be used by the first APC 241 and the second APC 231 to covert the original annular image and the reconstructed annular image into two-dimensional (2D) images (i.e. the first and second panorama images).
  • the first PPIC 243 and the second PPIC 233 convert the first and second panorama images into first and second perspective images, respectively.
  • Parallel projection and perspective projection are methods that can be used by the first PPIC 243 and the second PPIC 237 to convert the first and second panorama images into the first and second perspective images.
  • the region-of-interest selector 250 receives position information of a region- of-interest selected by a user from an image decoding apparatus, such as the image decoding apparatus illustrated in FlG. 3, and controls the first PPIC 243 and the second PPIC 233 to output the first and second perspective images corresponding to the region-of-interest.
  • the subtractor 260 outputs an error image between the first perspective image output from the first PPIC 243 and the second perspective image output from the second PPIC 233 to the second encoder 270.
  • the second encoder 270 encodes the error image using a predetermined method to generate a perspective image bitstream to be transmitted to the image decoding apparatus.
  • An encoder complying with the MPEG-4 Part 2 standard or the H.264 (or MPEG-4 Part 10 AVC) standard may be used as the second encoder 270, but the present general inventive concept is not restricted thereto.
  • FIG. 4 is a flowchart illustrating a method of encoding an image according to an embodiment of the present general inventive concept.
  • an omni-directional annular image is generated by an omni-di- rectional camera system (not shown).
  • the omni-directional annular image is input to the first encoder 210.
  • the omni-directional annular image is encoded by the first encoder 210 using a predetermined encoding method, such as the MPEG-4 Part 2 or the H.264, and thus an annular image bitstream, i.e., a first bitstream, is generated.
  • the annular image bitstream (first bitstream) is transmitted via the data communicator 220 to the image decoding apparatus, as il ⁇ lustrated in FIG. 3, over a predetermined channel.
  • the image decoding apparatus decodes the received annular image bitstream (first bitstream) to obtain a reconstructed annular image, converts the reconstructed annular image into a panorama image, and displays the panorama image through a panorama image display unit, such as the panorama image display unit 330 of FIG. 3 to be described in more detail infra.
  • the picture quality of the panorama image displayed through the panorama image display unit 330 cannot be guaranteed, but a user can view a full image through the panorama image display unit 330.
  • the user may input a command to select a region-of-interest, which the user wants to view more closely, in the full image displayed by the panorama image display unit 330, using a user interface (UI) 340 (see FIG. 3).
  • UI user interface
  • Position information of the region-of-interest is output from the UI 340 and transmitted through a data communicator 350 (see FlG. 3) to the image encoding apparatus of FlG. 2. Then, at operation S440, the position information of the region- of-interest is received by the region-of-interest selector 250 of the image encoding apparatus.
  • the region-of-interest selector 250 controls the first PPIC 243 and the second PPIC 233 to output images corresponding to the region-of-interest according to the position information of the region-of-interest.
  • the first PPIC 243 extracts an image corresponding to the region-of-interest from a first panorama image output from the first APC 241 and converts the extracted image into a first perspective image.
  • the second PPIC 233 extracts an image corresponding to the region- of-interest from a second panorama image output from the second APC 231 and converts the extracted image into a second perspective image.
  • the first perspective image output from the first PPIC 243 is a result of converting an original annular image into the first panorama image and then converting the first panorama image into the first perspective image.
  • the second perspective image output from the second PPIC 233 is a result of converting a reconstructed annular image output from the first encoder 210 into the second panorama image and then converting the second panorama image into the second perspective image.
  • the first perspective image output from the first PPIC 243 may be referred to as an original region-of-interest image and the output image from the second PPIC 233 may be referred to as a reconstructed region-of-interest image.
  • the subtractor 260 outputs an error image between the original region-of-interest image and the reconstructed region-of-interest image to the second encoder 270.
  • an amount of transmission data can be reduced as compared to encoding all of the original region-of-interest image.
  • the second encoder 270 encodes the error image using a predetermined encoding method, such as the MPEG-4 Part 2 or the H.264, to generate a perspective image bitstream, i.e. a second bitstream, to be transmitted to the image decoding apparatus.
  • the perspective image bitstream (second bitstream) is transmitted to the image decoding apparatus.
  • the user can view a high-resolution image of the region-of-interest.
  • FlG. 3 illustrates an image decoding apparatus according to an embodiment of the present general inventive concept.
  • the image decoding apparatus includes a first decoder 310, a conversion unit 320, the UI 340, the data communicator 350, a region-of-interest selector 360, a second decoder 370, and a mixer 380.
  • the image decoding apparatus of FlG. 3 receives an annular image bitstream and a perspective image bitstream from an image encoding apparatus, such as the image encoding apparatus of FlG. 2, and displays a full panorama image through the panorama image display unit 330 and a perspective image of a region-of-interest through a perspective image display unit (not shown).
  • the first decoder 310 receives and decodes the annular image bitstream generated by encoding an annular image.
  • the conversion unit 320 includes an APC 321 and a PPIC 323.
  • the conversion unit 320 receives a reconstructed annular image output from the first decoder 310 and converts the reconstructed annular image into a pre ⁇ determined image format.
  • the APC 321 converts the reconstructed annular image into a panorama image.
  • the PPIC 323 converts the panorama image into a perspective image.
  • the UI 340 receives a command input by a user.
  • the data communicator 350 performs data communication with an image encoding apparatus, such as the image encoding apparatus of FlG. 2.
  • the region-of-interest selector 360 receives position in ⁇ formation of a region-of-interest selected by the user from the UI 340 and controls the PPIC 323 to output an image of the region-of-interest.
  • the mixer 380 mixes an output image from the second decoder 370 and the image output from the PPIC 323 to generate a perspective image to be displayed by the perspective image display unit.
  • the output image from the second decoder 370 is the error image between the original region-of-interest image and the reconstructed region-of-interest image. Accordingly, a complete perspective image can be obtained by mixing the error image, i.e., the output image from the second decoder 370 and the reconstructed region-of-interest image, i.e., the output image from the PPIC 323.
  • FlG. 5 is a flowchart illustrating a method of decoding an image according to an embodiment of the present general inventive concept.
  • an annular image bitstream i.e., a first bitstream generated by encoding an omni-directional annular image
  • the annular image bitstream is decoded by the first decoder 310 and then converted into a panorama image by the APC 321, and the panorama image is displayed by the panorama image display unit 330.
  • a user can view the panorama image that is not a high-resolution image but provides an omni-directional image through the panorama image display unit 330.
  • the user may input a command to select a region-of-interest, which the user wants to view more closely in the omni-directional image displayed by the panorama image display unit 330, using the UI 340.
  • position information of the region- of-interest is output from the UI 340 and received by the region-of-interest selector 360.
  • the position information of the region-of-interest output from the UI 340 is transmitted through the data communicator 350 to an image encoding apparatus, such as the image encoding apparatus illustrated in FTG. 2.
  • the image encoding apparatus generates a perspective image bitstream including high-resolution perspective image data corresponding to the region-of-interest based on the position in ⁇ formation of the region-of-interest and transmits the perspective image bitstream to the image decoding apparatus of FTG.
  • the perspective image bitstream i.e., a second bitstream
  • the perspective image bitstream is received through the data communicator 350 and input to the second decoder 370.
  • the perspective image bitstream (second bitstream) is decoded by the second decoder 370 and then output to the mixer 380.
  • the mixer 380 mixes an output image from the second decoder 370 and an output image from the PPIC 323, thereby generating a perspective image of the region-of-interest.
  • the perspective image of the region-of-interest is displayed by the perspective image display unit (not shown).
  • FTG. 6 is a block diagram illustrating an image encoding apparatus according to another embodiment of the present general inventive concept.
  • the image encoding apparatus of FIG. 6 has a similar structure to the image encoding apparatus of FTG. 2, with the exception that the image encoding apparatus of FTG. 6 further includes a down-sampler 205 and an up-sampler 215 to provide spatial scalability.
  • FTG. 7 is a block diagram illustrating an image decoding apparatus according to another embodiment of the present general inventive concept.
  • the image decoding apparatus of FIG. 7 is provided to correspond to the image encoding apparatus of FTG. 6 and has a similar structure to the image decoding apparatus of FIG. 3, with the exception that the image decoding apparatus of FTG. 7 further includes an up-sampler 315 to up-sample an output image from the first decoder 310 in order to correspond to the image encoding apparatus of FIG. 6, which provides spatial scalability.
  • the present general inventive concept can also be embodied as computer readable codes on a computer readable recording medium.
  • the computer readable recording medium can be any data storage device that can store data which can be thereafter read by a computer system. Examples of the computer readable recording medium include read-only memory (ROM), random-access memory (RAM), CD-ROMs, magnetic tapes, floppy disks, optical data storage devices, and carrier waves (such as data transmission through the Internet).
  • ROM read-only memory
  • RAM random-access memory
  • CD-ROMs compact discs
  • magnetic tapes magnetic tapes
  • floppy disks floppy disks
  • optical data storage devices such as data transmission through the Internet
  • carrier waves such as data transmission through the Internet
  • a rough omni-directional image is transmitted first to a decoding apparatus through a channel having a restricted bandwidth and then a high-resolution image of a region-of-interest selected by a user from the omni-directional image is provided to the user.

Abstract

A method and apparatus to encode an image and a method and apparatus to decode an image are provided. The apparatus to encode an image includes a first encoder to encode an omni¬ directional image to generate a first bitstream, a data communicator to transmit the first bitstream to a decoding apparatus and to receive position information of a region-of-interest selected from an image reconstructed based on the first bitstream from the decoding apparatus, and a second encoder to encode an image of the region-of-interest based on the position information to generate a second bitstream.

Description

Description METHOD AND APPARATUS TO ENCODE IMAGE, AND
METHOD AND APPARATUS TO DECODE IMAGE DATA
Technical Field
[1] The present general inventive concept relates to an image encoding/decoding technique, and more particularly, to a method of encoding/decoding an omni-di- rectional image for three-dimensional (3D) realistic broadcasting.
Background Art
[2] Omni-directional video camera systems are camera systems that photograph a 360
° omni-directional view from a single viewpoint. Omni-directional video camera systems include a camera to which a special mirror, such as a hyperboloid mirror, or a special lens, such as a fish-eye lens, is installed or a plurality of cameras to photograph an omni-directional view. Studies on omni-directional video encoding for adapting video information generated by such an omni-directional video camera system to be broadcast are in progress.
[3] An example of using omni-directional video encoding is 3D realistic broadcasting.
For example, all image information regarding scenes viewed from diverse viewpoints including a viewpoint of a pitcher, a viewpoint of a catcher, a viewpoint of a hitter, and a viewpoint of an audience on the first base side in a baseball game is provided to a viewer's terminal. The viewer can select a desired viewpoint and view a scene from the desired viewpoint.
[4] Quick Time VR® is an example of 3D realistic broadcasting. According to the
Quick Time VR®, photos with a 360 ° cylindrical or cubical panoramic view can be produced and rotated 360 ° or zoomed in. However, users must download information regarding all panoramic images in advance of viewing the images, and the quality of these images is very low.
[5] Studies on a technique of applying conventional two-dimensional (2D) image encoding methods, such as Motion Picture Experts Group (MPEG)-4 and H.264, to omni-directional 3D images are in progress. FIG. 1 is a conceptual diagram of a con¬ ventional omni-directional video encoding/decoding system. Referring to FIG. 1, an omni-directional image is acquired using an omni-directional photographing unit 110. An image converter 120 converts the omni-directional image into a predetermined format that can be processed by an existing MPEG-4 encoder 130.
[6] An image photographed using an omni-directional camera system using a special lens or mirror or a plurality of cameras has characteristics corresponding to a 3D spherical environment. Since a conventional video codec receives, compresses, and transmits a 2D image, a 3D image photographed using an omni-directional camera system needs to be converted into a 2D image. Cartographical projection and polygonal projection have been presented to convert a 3D image into a 2D image.
[7] Cartographical projection is a process of projecting a spherical shape onto a complete rectangular plane like producing a typical world map. Polygonal projection is a process of projecting a spherical shape into a development figure of a polyhedron.
[8] The MPEG-4 encoder 130 encodes the converted image to generate a bitstream and transmits the bitstream to a decoding unit of a user. An MPEG-4 decoder 140 decodes the bitstream. An image converter 150 converts the decoded bitstream into an omni¬ directional image. A display unit 160 displays the omni-directional image.
[9] Since the amount of omni-directional image data to be transmitted to a user is large, a very broad bandwidth is needed to transmit the omni-directional image data to the user in real time. Moreover, problems like transmission delay and limits in performance of a user's decoding unit may occur. Furthermore, when conventional 2D image encoding is applied to an omni-directional image as it is, regardless of char¬ acteristic differences between an omni-directional image and a 2D image, encoding efficiency decreases. Disclosure of Invention
Technical Solution
[10] The present general inventive concept provides an image encoding method and apparatus by which an omni-directional image is efficiently transmitted and a user's region-of-interest in the omni-directional image is provided to the user with improved picture quality.
[11] The present general inventive concept also provides an image decoding method and apparatus by which a user's region-of-interest with improved picture quality in the omni-directional image is received and displayed.
[12] Additional aspects of the present general inventive concept will be set forth in part in the description which follows and, in part, will be obvious from the description, or may be learned by practice of the general inventive concept.
Advantageous Effects
[13] According to the embodiments of the present general inventive concept, a rough omni-directional image is transmitted first to a decoding apparatus through a channel having a restricted bandwidth and then a high-resolution image of a region-of-interest selected by a user from the omni-directional image is provided to the user. Therefore, the present invention can transmit an omni-directional image efficiently and improve the picture quality of a user's region-of-interest in the omni-directional image.
Description of Drawings [14] These and/or other aspects of the present general inventive concept will become apparent and more readily appreciated from the following description of the em¬ bodiments, taken in conjunction with the accompanying drawings of which:
[15] FlG. 1 is a conceptual diagram of a conventional omni-directional video encoding/ decoding system ;
[16] FlG. 2 is a block diagram illustrating an image encoding apparatus according to an embodiment of the present general inventive concept;
[17] FlG. 3 is a block diagram illustrating an image decoding apparatus according to an embodiment of the present general inventive concept;
[18] FlG. 4 is a flowchart illustrating a method of encoding an image according to an embodiment of the present general inventive concept;
[19] FlG. 5 is a flowchart illustrating a method of decoding an image according to an embodiment of the present general inventive concept;
[20] FlG. 6 is a block diagram illustrating an image encoding apparatus according to another embodiment of the present general inventive concept; and
[21] FlG. 7 is a block diagram illustrating an image decoding apparatus according to another embodiment of the present general inventive concept.
Best Mode
[22] The foregoing and/or other aspects of the present general inventive concept are achieved by providing a method of encoding an image, the method including generating a first bitstream by encoding an omni-directional image and transmitting the first bitstream to a decoding apparatus, receiving position information of a region- of-interest selected from an image reconstructed based on the first bitstream from the decoding apparatus, and generating a second bitstream by encoding an image of the region-of-interest based on the position information.
[23] The foregoing and/or other aspects of the present general inventive concept are also achieved by providing an apparatus to encode an image, including a first encoder to encode an omni-directional image to generate a first bitstream, a data communicator to transmit the first bitstream to decoding apparatus and to receive position information of a region-of-interest selected from an image reconstructed based on the first bitstream from the decoding apparatus, and a second encoder to encode an image of the region- of-interest based on the position information to generate a second bitstream.
[24] The foregoing and/or other aspects of the present general inventive concept are also achieved by providing a method of decoding an image, the method including receiving a first bitstream generated by encoding an omni-directional image from an encoding apparatus, decoding the first bitstream and displaying a reconstructed image, transmitting position information of a region-of-interest selected from the re¬ constructed image to the encoding apparatus, receiving a second bitstream generated by encoding an image of the region-of-interest from the encoding apparatus, and decoding the second bitstream.
[25] The foregoing and/or other aspects of the present general inventive concept are also achieved by providing an apparatus to decode an image, including a first decoder to receive a first bitstream generated by encoding an omni-directional image from an encoding apparatus and to decode the first bitstream to generate a reconstructed omni¬ directional image, a first display unit to display the reconstructed omni-directional image output from the first decoder, a data communicator to transmit position in¬ formation of a region-of-interest selected from the reconstructed omni-directional image displayed through the first display unit to the encoding apparatus, and a second decoder to receive a second bitstream generated by encoding an image of the region- of-interest from the encoding apparatus and to decode the second bitstream.
Mode for Invention
[26] Reference will now be made in detail to the embodiments of the present general inventive concept, examples of which are illustrated in the accompanying drawings, wherein like reference numerals refer to the like elements throughout. The em¬ bodiments are described below in order to explain the present general inventive concept while referring to the figures.
[27] Instead of viewing a full omni-directional image acquired by an omni-directional camera system at one time, a user may want to make a viewpoint transition based on information regarding the full omni-directional image and closely and partially observe a region-of-interest in the full omni-directional image. To meet the user's demands, embodiments of the present general inventive concept provide a method and apparatus to transmit a portion of the omni-directional image other than the user's region- of-interest to the user's terminal using a minimum bandwidth and transmitting an image of the region-of-interest at a high resolution. In other words, a full panorama image is provided to a decoding apparatus at a low definition to provide the user a rough view of the panorama image. Thereafter, when the user selects a region- of-interest in the full panorama image and transmits position information of the region- of-interest into an encoding apparatus, a high-resolution image of the region-of-interest is provided to the decoding apparatus.
[28] FlG. 2 is a block diagram illustrating an image encoding apparatus according to an embodiment of the present general inventive concept. Referring to FlG. 2, the image encoding apparatus includes a first encoder 210, a data communicator 220, a first conversion unit 240, a second conversion unit 230, a region-of-interest selector 250, a subtractor 260, and a second encoder 270.
[29] An omni-directional image photographed by an omni-directional camera system
(not shown) is input to the first encoder 210 and the first conversion unit 240. According to the embodiment illustrated in FlG. 2, the omni-directional image may be an annular image, but is not limited thereto.
[30] The omni-directional camera system may be a camera system including a special lens or a combination of a mirror and a lens, and can photograph up to a 360 ° omni¬ directional view from a single viewpoint. Sony's TVR-900 and HDW F900 are examples of such an omni-directional camera system. The TVR-900 can photograph a 180 ° view and the HDW F900 can photograph a 360 ° view. Alternatively, the omni¬ directional camera system can obtain an omni-directional image using a plurality of cameras. An annular image is an image that is photographed after being reflected from a mirror in a mirror-based omni-directional camera system and implies a 360 ° omni¬ directional image.
[31] The first encoder 210 receives and encodes the annular image using a pre¬ determined method to generate an annular image bitstream. An encoder complying with a Motion Picture Experts Group (MPEG)-4 Part 2 standard or an H.264 (or MPEG-4 Part 10 AVC) standard may be used as the first encoder 210. However, the present general inventive concept is not restricted thereto, and an encoder modified to be suitable to an annular image may alternatively be used as the first encoder 210.
[32] An annular image bitstream generated by the first encoder 210 is transmitted via the data communicator 220 to an image decoding apparatus, such as the image decoding apparatus illustrated in FIG. 3. The image decoding apparatus decodes the annular image bitstream to obtain a reconstructed annular image, converts the re¬ constructed annular image into a panorama image, and displays the panorama image through a panorama image display unit 330 (see FIG. 3). The first encoder 210 also generates a reconstructed annular image suitable to a particular bandwidth based on the annular image bitstream and stores it in a reconstructed annular image buffer (not shown). The first encoder 210 generates the reconstructed annular image by decoding the annular image bitstream generated therein, and therefore the first encoder 210 has a decoding capability as well as an encoding capability. The reconstructed annular image generated by the first encoder 210 is input to the second conversion unit 230.
[33] The first conversion unit 240 includes a first annular-to-panorama converter (APC)
241 and a first panorama-to-perspective image converter (PPIC) 243. The second conversion unit 230 includes a second APC 231 and a second PPIC 233.
[34] The first conversion unit 240 and the second conversion unit 230 respectively convert the original annular image and the reconstructed annular image into a pre¬ determined image format. The first APC 241 and the second APC 231 convert the original annular image and the reconstructed annular image into first and second panorama images, respectively. Cartographical projection and polygonal projection are methods that can be used by the first APC 241 and the second APC 231 to covert the original annular image and the reconstructed annular image into two-dimensional (2D) images (i.e. the first and second panorama images). The first PPIC 243 and the second PPIC 233 convert the first and second panorama images into first and second perspective images, respectively. Parallel projection and perspective projection are methods that can be used by the first PPIC 243 and the second PPIC 237 to convert the first and second panorama images into the first and second perspective images.
[35] The region-of-interest selector 250 receives position information of a region- of-interest selected by a user from an image decoding apparatus, such as the image decoding apparatus illustrated in FlG. 3, and controls the first PPIC 243 and the second PPIC 233 to output the first and second perspective images corresponding to the region-of-interest.
[36] The subtractor 260 outputs an error image between the first perspective image output from the first PPIC 243 and the second perspective image output from the second PPIC 233 to the second encoder 270. The second encoder 270 encodes the error image using a predetermined method to generate a perspective image bitstream to be transmitted to the image decoding apparatus. An encoder complying with the MPEG-4 Part 2 standard or the H.264 (or MPEG-4 Part 10 AVC) standard may be used as the second encoder 270, but the present general inventive concept is not restricted thereto.
[37] FIG. 4 is a flowchart illustrating a method of encoding an image according to an embodiment of the present general inventive concept. Referring to FIGS. 2 through 4, at operation S410, an omni-directional annular image is generated by an omni-di- rectional camera system (not shown). The omni-directional annular image is input to the first encoder 210. At operation S420, the omni-directional annular image is encoded by the first encoder 210 using a predetermined encoding method, such as the MPEG-4 Part 2 or the H.264, and thus an annular image bitstream, i.e., a first bitstream, is generated. At operation S430, the annular image bitstream (first bitstream) is transmitted via the data communicator 220 to the image decoding apparatus, as il¬ lustrated in FIG. 3, over a predetermined channel.
[38] The image decoding apparatus decodes the received annular image bitstream (first bitstream) to obtain a reconstructed annular image, converts the reconstructed annular image into a panorama image, and displays the panorama image through a panorama image display unit, such as the panorama image display unit 330 of FIG. 3 to be described in more detail infra. The picture quality of the panorama image displayed through the panorama image display unit 330 cannot be guaranteed, but a user can view a full image through the panorama image display unit 330. The user may input a command to select a region-of-interest, which the user wants to view more closely, in the full image displayed by the panorama image display unit 330, using a user interface (UI) 340 (see FIG. 3). [39] Position information of the region-of-interest is output from the UI 340 and transmitted through a data communicator 350 (see FlG. 3) to the image encoding apparatus of FlG. 2. Then, at operation S440, the position information of the region- of-interest is received by the region-of-interest selector 250 of the image encoding apparatus. The region-of-interest selector 250 controls the first PPIC 243 and the second PPIC 233 to output images corresponding to the region-of-interest according to the position information of the region-of-interest. The first PPIC 243 extracts an image corresponding to the region-of-interest from a first panorama image output from the first APC 241 and converts the extracted image into a first perspective image. Similarly, the second PPIC 233 extracts an image corresponding to the region- of-interest from a second panorama image output from the second APC 231 and converts the extracted image into a second perspective image.
[40] The first perspective image output from the first PPIC 243 is a result of converting an original annular image into the first panorama image and then converting the first panorama image into the first perspective image. The second perspective image output from the second PPIC 233 is a result of converting a reconstructed annular image output from the first encoder 210 into the second panorama image and then converting the second panorama image into the second perspective image. In other words, the first perspective image output from the first PPIC 243 may be referred to as an original region-of-interest image and the output image from the second PPIC 233 may be referred to as a reconstructed region-of-interest image.
[41] The subtractor 260 outputs an error image between the original region-of-interest image and the reconstructed region-of-interest image to the second encoder 270. When encoding the error image between the original region-of-interest image and the re¬ constructed region-of-interest image, an amount of transmission data can be reduced as compared to encoding all of the original region-of-interest image. At operation S450, the second encoder 270 encodes the error image using a predetermined encoding method, such as the MPEG-4 Part 2 or the H.264, to generate a perspective image bitstream, i.e. a second bitstream, to be transmitted to the image decoding apparatus. At operation S460, the perspective image bitstream (second bitstream) is transmitted to the image decoding apparatus. As a result, the user can view a high-resolution image of the region-of-interest.
[42] As briefly described above, FlG. 3 illustrates an image decoding apparatus according to an embodiment of the present general inventive concept. The embodiment of FlG. 3 will be described in more detail below. Referring to FlG. 3, the image decoding apparatus according to the present embodiment includes a first decoder 310, a conversion unit 320, the UI 340, the data communicator 350, a region-of-interest selector 360, a second decoder 370, and a mixer 380. [43] The image decoding apparatus of FlG. 3 receives an annular image bitstream and a perspective image bitstream from an image encoding apparatus, such as the image encoding apparatus of FlG. 2, and displays a full panorama image through the panorama image display unit 330 and a perspective image of a region-of-interest through a perspective image display unit (not shown).
[44] The first decoder 310 receives and decodes the annular image bitstream generated by encoding an annular image. The conversion unit 320 includes an APC 321 and a PPIC 323. The conversion unit 320 receives a reconstructed annular image output from the first decoder 310 and converts the reconstructed annular image into a pre¬ determined image format. The APC 321 converts the reconstructed annular image into a panorama image. The PPIC 323 converts the panorama image into a perspective image.
[45] The UI 340 receives a command input by a user. The data communicator 350 performs data communication with an image encoding apparatus, such as the image encoding apparatus of FlG. 2. The region-of-interest selector 360 receives position in¬ formation of a region-of-interest selected by the user from the UI 340 and controls the PPIC 323 to output an image of the region-of-interest.
[46] The mixer 380 mixes an output image from the second decoder 370 and the image output from the PPIC 323 to generate a perspective image to be displayed by the perspective image display unit. As described above, since the perspective image bitstream is not the result of encoding an original region-of-interest image but the result of encoding an error image between the original region-of-interest image and a reconstructed region-of-interest image, the output image from the second decoder 370 is the error image between the original region-of-interest image and the reconstructed region-of-interest image. Accordingly, a complete perspective image can be obtained by mixing the error image, i.e., the output image from the second decoder 370 and the reconstructed region-of-interest image, i.e., the output image from the PPIC 323.
[47] FlG. 5 is a flowchart illustrating a method of decoding an image according to an embodiment of the present general inventive concept. Referring to FIGS. 3 and 5, at operation S510, an annular image bitstream, i.e., a first bitstream generated by encoding an omni-directional annular image, is received through the data com¬ municator 350. At operation S520, the annular image bitstream (first bitstream) is decoded by the first decoder 310 and then converted into a panorama image by the APC 321, and the panorama image is displayed by the panorama image display unit 330.
[48] Then, a user can view the panorama image that is not a high-resolution image but provides an omni-directional image through the panorama image display unit 330. The user may input a command to select a region-of-interest, which the user wants to view more closely in the omni-directional image displayed by the panorama image display unit 330, using the UI 340. At operation S530, position information of the region- of-interest is output from the UI 340 and received by the region-of-interest selector 360. At operation S540, the position information of the region-of-interest output from the UI 340 is transmitted through the data communicator 350 to an image encoding apparatus, such as the image encoding apparatus illustrated in FTG. 2. The image encoding apparatus generates a perspective image bitstream including high-resolution perspective image data corresponding to the region-of-interest based on the position in¬ formation of the region-of-interest and transmits the perspective image bitstream to the image decoding apparatus of FTG. 3.
[49] At operation 550, the perspective image bitstream, i.e., a second bitstream, is received through the data communicator 350 and input to the second decoder 370. At operation S560, the perspective image bitstream (second bitstream) is decoded by the second decoder 370 and then output to the mixer 380. The mixer 380 mixes an output image from the second decoder 370 and an output image from the PPIC 323, thereby generating a perspective image of the region-of-interest. The perspective image of the region-of-interest is displayed by the perspective image display unit (not shown).
[50] FTG. 6 is a block diagram illustrating an image encoding apparatus according to another embodiment of the present general inventive concept. The image encoding apparatus of FIG. 6 has a similar structure to the image encoding apparatus of FTG. 2, with the exception that the image encoding apparatus of FTG. 6 further includes a down-sampler 205 and an up-sampler 215 to provide spatial scalability.
[51] FTG. 7 is a block diagram illustrating an image decoding apparatus according to another embodiment of the present general inventive concept. The image decoding apparatus of FIG. 7 is provided to correspond to the image encoding apparatus of FTG. 6 and has a similar structure to the image decoding apparatus of FIG. 3, with the exception that the image decoding apparatus of FTG. 7 further includes an up-sampler 315 to up-sample an output image from the first decoder 310 in order to correspond to the image encoding apparatus of FIG. 6, which provides spatial scalability.
[52] The present general inventive concept can also be embodied as computer readable codes on a computer readable recording medium. The computer readable recording medium can be any data storage device that can store data which can be thereafter read by a computer system. Examples of the computer readable recording medium include read-only memory (ROM), random-access memory (RAM), CD-ROMs, magnetic tapes, floppy disks, optical data storage devices, and carrier waves (such as data transmission through the Internet). The computer readable recording medium can also be distributed over network coupled computer systems such that the computer readable code is stored and executed in a distributed fashion. [53] As described above, according to the embodiments of the present general inventive concept, a rough omni-directional image is transmitted first to a decoding apparatus through a channel having a restricted bandwidth and then a high-resolution image of a region-of-interest selected by a user from the omni-directional image is provided to the user.
[54] Although a few embodiments of the present general inventive concept have been shown and described, it will be appreciated by those skilled in the art that changes may be made in these embodiments without departing from the principles and spirit of the general inventive concept, the scope of which is defined in the appended claims and their equivalents.

Claims

Claims
[ 1 ] L A method of encoding an image, comprising : generating a first bitstream by encoding an omni-directional image and transmitting the first bitstream to a decoding apparatus; receiving position information of a region-of-interest selected from an image re¬ constructed based on the first bitstream from the decoding apparatus; and generating a second bitstream by encoding an image of the region-of-interest based on the position information.
[2] 2. The method of claim 1, wherein the generating of the second bitstream comprises: obtaining a first image corresponding to the region-of-interest from the omni¬ directional image; obtaining a reconstructed omni-directional image by decoding the first bitstream; obtaining a second image corresponding to the region-of-interest from the re¬ constructed omni-directional image; and obtaining an error image between the first image and the second image and encoding the error image to generate the second bitstream.
[3] 3. The method of claim 2, wherein the obtaining of the first image comprises: converting the omni-directional image into a first panorama image, and obtaining the first image from the first panorama image, and the obtaining of the second image comprises: converting the reconstructed omni-directional image into a second panorama image, and obtaining the second image from the second panorama image.
[4] 4. The method of claim 3, wherein the obtaining of the first image from the first panorama image comprises: selecting the region-of-interest from the first panorama image, and converting the region-of-interest selected from the first panorama image into perspective image to obtain the first image, and the obtaining of the second image from the second panorama image comprises: selecting the region-of-interest from the second panorama image, and converting the region-of-interest selected from the second panorama image into a perspective image to obtain the second image.
[5] 5. The method of claim 2, further comprising: down-sampling the omni-directional image before generating the first bitstream, wherein the obtaining of the reconstructed omni-directional image comprises performing up-sampling corresponding to the down-sampling.
[6] 6. A method of encoding an image, comprising: encoding an input omni-directional image and outputting the encoded omni¬ directional image at a first resolution; and encoding an image corresponding to a region of the input omni-directional image determined according to input position information and outputting the encoded image corresponding to the region of the input omni-directional image at a second resolution.
[7] 7. The method of claim 6, wherein the second resolution is higher than the first resolution.
[8] 8. The method of claim 6, wherein the encoding of the image corresponding to the region of the input omni-directional image comprises: decoding the encoded omni-directional image output at the first resolution; generating a first perspective image of a region of the decoded omni-directional image according to the input position information; generating a second perspective image of the region of the input omni-directional image according to the input position information; calculating an error image between the second perspective image and the first perspective image; and encoding the calculated error image.
[9] 9. An apparatus to encode an image, comprising: a first encoder to encode an omni-directional image to generate a first bitstream; a data communicator to transmit the first bitstream to a decoding apparatus and to receive position information of a region-of-interest selected from an image re¬ constructed based on the first bitstream from the decoding apparatus; and a second encoder to encode an image of the region-of-interest based on the position information to generate a second bitstream.
[10] 10. The apparatus of claim 9, further comprising: a region-of-interest selector to receive the position information of the region- of-interest and to output a region selection control signal; a first conversion unit to output a first image corresponding to the region- of-interest in the omni-directional image in response to the region selection control signal; a second conversion unit to output a second image corresponding to the region- of-interest in a reconstructed omni-directional image, which is generated by the first encoder by decoding the first bitstream, in response to the region selection control signal; and a subtractor to output an error image between the first image and the second image to the second encoder as the image of the region-of-interest to be encoded by the second encoder.
[11] 11. The apparatus of claim 10, wherein the first conversion unit comprises: a first panorama image generator to convert the omni-directional image into a first panorama image and to output the first panorama image, and a first perspective image generator to convert a portion corresponding to the region-of-interest in the first panorama image into a first perspective image and to output the first perspective image, and the second conversion unit comprises: a second panorama image generator to convert the reconstructed omni-directional image into a second panorama image and to output the second panorama image, and a second perspective image generator to convert a portion corresponding to the region-of-interest in the second panorama image into a second perspective image and to output the second perspective image.
[12] 12. The apparatus of claim 10, further comprising: a down-sampler to down-sample the omni-directional image and to output the result of the down-sampling to the first encoder; and an up-sampler to perform up-sampling corresponding to the down-sampling with respect to the reconstructed omni-directional image generated by the first encoder and to output the result of the up-sampling.
[13] 13. An image encoding apparatus, comprising: a first encoding unit to encode an input omni-directional image and to decode the encoded input omni-directional image; a region-of-interest unit to receive input position information of a selected region of interest of the input omni-directional image and to generate first and second perspective images of the selected region of interest in the input omni-directional image and the decoded omni-directional image, respectively, according to the received position information; and a second encoding unit to encode an error image between the first and second perspective images of the selected region of interest.
[14] 14. The image encoding apparatus of claim 13, wherein the second encoding unit transmits the encoded error image at a higher resolution than the first encoding unit transmits the encoded omni-directional image.
[15] 15. The image encoding apparatus of claim 13, wherein the region-of-interest unit comprises: a first conversion unit to generate the first perspective image of the selected region in the input omni-directional image; a second conversion unit to generate the perspective second image of the selected region in the decoded omni-directional image; and a region-of-interest selector to receive the input position information of the selected region of interest and to control the first and second conversion units according to the received position information.
[16] 16. The image encoding apparatus of claim 15, wherein the first conversion unit comprises a first omni-to-panorama converter to convert the input omni-di¬ rectional image into a first panorama image and a first panorama-to-perspective image converter to convert a portion of the first panorama image into the first perspective image according to the input position information, and the second conversion unit comprises a second omni-to-panorama converter to convert the decoded omni-directional image into a second panorama image and a second panorama-to-perspective image converter to convert a portion of the second panorama image into the second perspective image according to the input position information.
[17] 17. The image encoding apparatus of claim 13, further comprising: a data communicator to transmit the encoded omni-directional image and the encoded error image to an external decoding apparatus and to receive the input position information of the selected region-of-interest from the external decoding apparatus and transmit the received position information to the region-of-interest unit.
[18] 18. The image encoding apparatus of claim 13, wherein the region-of-interest is selected from the decoded omni-directional image.
[19] 19. A method of decoding an image, comprising : receiving a first bitstream generated by encoding an omni-directional image from an encoding apparatus; decoding the first bitstream and displaying a reconstructed image; transmitting position information of a region-of-interest selected from the displayed reconstructed image to the encoding apparatus; receiving a second bitstream generated by encoding an image of the region- of-interest from the encoding apparatus; and decoding the second bitstream.
[20] 20. The method of claim 19, wherein the decoding of the second bitstream comprises: obtaining a first image corresponding to the region-of-interest by decoding the second bitstream; obtaining a second image corresponding to the region-of-interest from the re- constructed image; and generating a combined perspective image corresponding to the region-of-interest by mixing the first image and the second image.
[21] 21. The method of claim 20, wherein the obtaining of the second image comprises: converting the reconstructed image into a panorama image; selecting the region-of-interest from the panorama image; and converting the region-of-interest into a perspective image to obtain the second image.
[22] 22. The method of claim 20, further comprising, when down-sampling is performed when the encoding apparatus encodes the omni-directional image, performing up-sampling corresponding to the down-sampling with respect to the reconstructed omni-directional image.
[23] 23. A method of viewing an encoded image, comprising: decoding an input encoded omni-directional image and displaying the decoded omni-directional image; selecting a region-of interest in the decoded omni-directional image and outputting position information of the selected region-of-interest; decoding an input encoded error image and combining the decoded error image with an image of the selected region-of-interest in the decoded omni-directional image to form a combined region-of-interest image; and displaying the combined region-of-interest image.
[24] 24. The method of claim 23, wherein the input encoded error image is generated according to the output position information of the selected region of interest.
[25] 25. The method of claim 23, wherein the combining of the decoded error image with the image of the selected region-of-interest in the decoded omni-directional image comprises: converting the decoded omni-directional image into a panorama image; and converting a potion of the panorama image into a perspective image according to the selected region-of-interest.
[26] 26. An apparatus to decode an image, comprising: a first decoder to receive a first bitstream generated by encoding an omni¬ directional image from an encoding apparatus and to decode the first bitstream to generate a reconstructed omni-directional image; a first display unit to display the reconstructed omni-directional image decoded by the first decoder; a data communicator to transmit position information of a region-of-interest selected from the reconstructed omni-directional image displayed by the first display unit to the encoding apparatus; and a second decoder to receive a second bitstream generated by encoding an image of the region-of-interest from the encoding apparatus and to decode the second bitstream.
[27] 27. The apparatus of claim 26, further comprising: a region-of-interest selector to receive the position information of the selected region-of-interest and to output a region selection control signal; a conversion unit to output an image corresponding to the region-of-interest in the reconstructed omni-directional image in response to the region selection control signal; and a mixer to mix an image output from the second decoder and the image output from the conversion unit.
[28] 28. The apparatus of claim 27, wherein the conversion unit comprises: a panorama image generator to convert the reconstructed omni-directional image decoded by the first decoder into a panorama image; and a perspective image generator to convert a portion corresponding to the region- of-interest in the panorama image into a perspective image in response to the region selection control signal and to output the perspective image to the mixer.
[29] 29. The apparatus of claim 27, further comprising: an up-sampler to perform up-sampling corresponding to down-sampling performed when the encoding apparatus encodes the omni-directional image with respect to the reconstructed omni-directional image.
[30] 30. An image decoding apparatus, comprising: a first decoding unit to decode an input encoded omni-directional image; a region-of-interest unit to select a region-of-interest in the decoded omni¬ directional image, to output position information of the selected region of interest, and to generate a perspective image of the selected region-of-interest in the decoded omni-directional image; a second decoding unit to decode an input error image; and a calculating unit to combine the perspective image of the selected region- of-interest in the decoded omni-directional image and the decoded error image to form a combined region-of-interest image.
[31] 31. The image decoding apparatus of claim 30, wherein the region-of-interest unit comprises: a display to display the decoded omni-directional image; a user interface to allow a user to select the region-of-interest according to the displayed omni-directional image; and a conversion unit to generate the perspective image of the selected region- of-interest in the decoded omni-directional image.
[32] 32. The image decoding apparatus of claim 31, wherein the conversion unit comprises: an omni-to-panorama converter to convert the decoded omni-directional image into a panorama image and output the panorama image to the display to be displayed thereon; and a panorama-to-perspective image converter to convert a portion of the panorama image corresponding to the selected region-of-interest to the perspective image of the selected region-of-interest.
[33] 33. The image decoding apparatus of claim 30, further comprising: one or more displays to display the decoded omni-directional image and the combined region-of-interest image.
[34] 34. An image encoding and decoding apparatus, comprising: an encoding unit to encode an input omni-directional image, to transmit the encoded omni-directional image, to receive position information of a region- of-interest of the input omni-direction image, to encode an error image cor¬ responding to a portion of the input omni-directional image determined according to the received position information, and to transmit the encoded error signal; and a decoding unit to receive and decode the encoded omni-directional image, to select the region-of-interest based on the decoded omni-directional image, to transmit the position information of the region-of-interest to the encoding unit, to receive and decode the encoded error image, and to combine the decoded error image with an image of a portion of the decoded omni-directional image determined according to the position information to form a combined image of the selected region of interest.
[35] 35. A computer readable recording medium to record a program to implement a method of encoding an image, the method comprising: generating a first bitstream by encoding an omni-directional image and transmitting the first bitstream to a decoding apparatus; receiving position information of a region-of-interest selected from an image re¬ constructed based on the first bitstream from the decoding apparatus; and generating a second bitstream by encoding an image of the region-of-interest based on the position information.
[36] 36. A computer readable recording medium to record a program to implement a method of decoding an image, the method comprising: receiving a first bitstream generated by encoding an omni-directional image from and encoding apparatus; decoding the first bitstream and displaying a reconstructed image; transmitting position information of a region-of-interest selected from the re¬ constructed image to the encoding apparatus; receiving a second bitstream generated by encoding an image of the region- of-interest from the encoding apparatus; and decoding the second bitstream.
[37] 37. A computer readable recording medium to record a program to implement a method of encoding an image, the method comprising: encoding an input omni-directional image and outputting the encoded omni¬ directional image at a first resolution; and encoding an image corresponding to a region of the input omni-directional image determined according to input position information and outputting the encoded image corresponding to the region of the input omni-directional image at a second resolution.
[38] 38. A computer readable recording medium to record a program to implement a method of encoding an image, the method comprising: encoding an input omni-directional image and outputting the encoded omni¬ directional image; decoding the encoded omni-directional image; and encoding and outputting an error image between a region of the input omni-directional image and a cor¬ responding region of the decoded omni-directional image, the region determined according to input region-of interest information.
[39] 39. A computer readable recording medium to record a program to implement a method of viewing an encoded image, the method comprising: decoding an input encoded omni-directional image and displaying the decoded omni-directional image; selecting a region-of interest in the decoded omni-directional image and outputting position information of the selected region-of-interest; decoding an input encoded error image and combining the decoded error image with an image of the selected region-of-interest in the decoded omni-directional image to form a combined region-of-interest image; and displaying the combined region-of-interest image.
PCT/KR2005/002638 2004-08-13 2005-08-12 Method and apparatus to encode image, and method and apparatus to decode image data WO2006016782A1 (en)

Priority Applications (3)

Application Number Priority Date Filing Date Title
CN2005800269727A CN101002471B (en) 2004-08-13 2005-08-12 Method and apparatus to encode image, and method and apparatus to decode image data
EP05780541A EP1782632A1 (en) 2004-08-13 2005-08-12 Method and apparatus to encode image, and method and apparatus to decode image data
JP2007525547A JP2008510357A (en) 2004-08-13 2005-08-12 Image encoding method, encoding device, image decoding method, and decoding device

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
US60114704P 2004-08-13 2004-08-13
US60/601,147 2004-08-13
KR10-2004-0075972 2004-09-22
KR1020040075972A KR100739686B1 (en) 2004-08-13 2004-09-22 Method and apparatus for coding image, method and apparatus for decoding image data

Publications (1)

Publication Number Publication Date
WO2006016782A1 true WO2006016782A1 (en) 2006-02-16

Family

ID=35839504

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/KR2005/002638 WO2006016782A1 (en) 2004-08-13 2005-08-12 Method and apparatus to encode image, and method and apparatus to decode image data

Country Status (3)

Country Link
EP (1) EP1782632A1 (en)
JP (1) JP2008510357A (en)
WO (1) WO2006016782A1 (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2016220229A (en) * 2010-10-01 2016-12-22 サターン ライセンシング エルエルシーSaturn Licensing LLC Content reproduction device, content reproduction method and program
CN106911902A (en) * 2017-03-15 2017-06-30 微鲸科技有限公司 Video image transmission method, method of reseptance and device
EP3634005A1 (en) * 2018-10-05 2020-04-08 Nokia Technologies Oy Client device and method for receiving and rendering video content and server device and method for streaming video content
US10743000B2 (en) 2016-07-01 2020-08-11 Sk Telecom Co., Ltd. Video bitstream generation method and device for high-resolution video streaming

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2014005297A1 (en) * 2012-07-04 2014-01-09 Intel Corporation Panorama based 3d video coding
KR101430985B1 (en) * 2013-02-20 2014-09-18 주식회사 카몬 System and Method on Providing Multi-Dimensional Content
US9466090B2 (en) 2013-06-20 2016-10-11 Intel Corporation Subset based compression and decompression of graphics data
KR101821352B1 (en) * 2016-03-30 2018-01-25 (주)씨소 Records the location of the event system of omnidirectional camera and method using the same
US10387991B2 (en) 2016-07-01 2019-08-20 Intel Corporation Method and apparatus for frame buffer compression

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6014671A (en) * 1998-04-14 2000-01-11 International Business Machines Corporation Interactive retrieval and caching of multi-dimensional data using view elements
US6121970A (en) * 1997-11-26 2000-09-19 Mgi Software Corporation Method and system for HTML-driven interactive image client
US6157747A (en) * 1997-08-01 2000-12-05 Microsoft Corporation 3-dimensional image rotation method and apparatus for producing image mosaics
US6192393B1 (en) * 1998-04-07 2001-02-20 Mgi Software Corporation Method and system for panorama viewing
US6272235B1 (en) * 1997-03-03 2001-08-07 Bacus Research Laboratories, Inc. Method and apparatus for creating a virtual microscope slide
KR20020007945A (en) * 2000-08-24 2002-01-29 양윤원 Enlarged Digital Image Providing Method and Apparatus Using Data Communication Networks
EP1286308A1 (en) * 2001-08-14 2003-02-26 Koninklijke Philips Electronics N.V. Panoramic video editing visualisation by application of navigation control to said panoramic video

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH07284095A (en) * 1994-04-12 1995-10-27 Nippon Telegr & Teleph Corp <Ntt> View point adaptive-type image transmitting method and image transmitting device
JPH07288806A (en) * 1994-04-20 1995-10-31 Hitachi Ltd Moving image communication system
US6043837A (en) * 1997-05-08 2000-03-28 Be Here Corporation Method and apparatus for electronically distributing images from a panoptic camera system
JPH11205772A (en) * 1998-01-16 1999-07-30 Matsushita Joho System Kk Omnidirectionally picked-up image sending system and its method
JP2000270297A (en) * 1999-03-12 2000-09-29 Toshiba Video Products Japan Kk Monitor camera system having digital video recording and reproducing function
JP2002312778A (en) * 2001-04-09 2002-10-25 Be Here Corp Method and device for electronically distributing motion panoramic image
JP4146701B2 (en) * 2002-10-09 2008-09-10 松下電器産業株式会社 Moving picture coding method and moving picture coding apparatus
JP2004153605A (en) * 2002-10-31 2004-05-27 Victor Co Of Japan Ltd Image pickup device and system for transmitting pick-up image

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6272235B1 (en) * 1997-03-03 2001-08-07 Bacus Research Laboratories, Inc. Method and apparatus for creating a virtual microscope slide
US6157747A (en) * 1997-08-01 2000-12-05 Microsoft Corporation 3-dimensional image rotation method and apparatus for producing image mosaics
US6121970A (en) * 1997-11-26 2000-09-19 Mgi Software Corporation Method and system for HTML-driven interactive image client
US6192393B1 (en) * 1998-04-07 2001-02-20 Mgi Software Corporation Method and system for panorama viewing
US6014671A (en) * 1998-04-14 2000-01-11 International Business Machines Corporation Interactive retrieval and caching of multi-dimensional data using view elements
KR20020007945A (en) * 2000-08-24 2002-01-29 양윤원 Enlarged Digital Image Providing Method and Apparatus Using Data Communication Networks
EP1286308A1 (en) * 2001-08-14 2003-02-26 Koninklijke Philips Electronics N.V. Panoramic video editing visualisation by application of navigation control to said panoramic video

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2016220229A (en) * 2010-10-01 2016-12-22 サターン ライセンシング エルエルシーSaturn Licensing LLC Content reproduction device, content reproduction method and program
US10743000B2 (en) 2016-07-01 2020-08-11 Sk Telecom Co., Ltd. Video bitstream generation method and device for high-resolution video streaming
US10893278B2 (en) 2016-07-01 2021-01-12 Sk Telecom Co., Ltd. Video bitstream generation method and device for high-resolution video streaming
CN106911902A (en) * 2017-03-15 2017-06-30 微鲸科技有限公司 Video image transmission method, method of reseptance and device
CN106911902B (en) * 2017-03-15 2020-01-07 微鲸科技有限公司 Video image transmission method, receiving method and device
EP3634005A1 (en) * 2018-10-05 2020-04-08 Nokia Technologies Oy Client device and method for receiving and rendering video content and server device and method for streaming video content

Also Published As

Publication number Publication date
EP1782632A1 (en) 2007-05-09
JP2008510357A (en) 2008-04-03

Similar Documents

Publication Publication Date Title
US8217988B2 (en) Method and apparatus to encode image, and method and apparatus to decode image data
WO2006016782A1 (en) Method and apparatus to encode image, and method and apparatus to decode image data
US10600233B2 (en) Parameterizing 3D scenes for volumetric viewing
US10341632B2 (en) Spatial random access enabled video system with a three-dimensional viewing volume
US10419737B2 (en) Data structures and delivery methods for expediting virtual reality playback
US11057646B2 (en) Image processor and image processing method
US20180097867A1 (en) Video compression with adaptive view-dependent lighting removal
US20180035134A1 (en) Encoding and decoding virtual reality video
JP5241500B2 (en) Multi-view video encoding and decoding apparatus and method using camera parameters, and recording medium on which a program for performing the method is recorded
CN100586178C (en) Apparatus and method for transmitting and receiving image data
JP5337237B2 (en) Multipoint conference control method and apparatus
WO2016140060A1 (en) Image processing device and image processing method
US10244215B2 (en) Re-projecting flat projections of pictures of panoramic video for rendering by application
US20040086186A1 (en) Information providing system and method, information supplying apparatus and method, recording medium, and program
US20120162367A1 (en) Apparatus and method for converting image display mode
JP5700703B2 (en) Video decoding apparatus, video transmission / reception system, video decoding method, and video transmission / reception method
JP2011521570A5 (en)
CN110121065B (en) Multi-directional image processing in spatially ordered video coding applications
EP3434021B1 (en) Method, apparatus and stream of formatting an immersive video for legacy and immersive rendering devices
KR20120133006A (en) System and method for providing a service to streaming IPTV panorama image
JP2013128260A (en) Image encoder, image decoder, image encoding method and image decoding method
Peixoto et al. Progressive communication for interactive light field image data streaming
JP2009296135A (en) Video monitoring system
Cheung et al. Coding for Interactive Navigation in High‐Dimensional Media Data
WO2023199076A1 (en) Extended reality encoding

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BW BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE EG ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KM KP KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NA NG NI NO NZ OM PG PH PL PT RO RU SC SD SE SG SK SL SM SY TJ TM TN TR TT TZ UA UG UZ VC VN YU ZA ZM ZW

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): GM KE LS MW MZ NA SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LT LU LV MC NL PL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
DPEN Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed from 20040101)
WWE Wipo information: entry into national phase

Ref document number: 2007525547

Country of ref document: JP

Ref document number: 200580026972.7

Country of ref document: CN

WWE Wipo information: entry into national phase

Ref document number: 2005780541

Country of ref document: EP

NENP Non-entry into the national phase

Ref country code: DE

WWE Wipo information: entry into national phase

Ref document number: 273/MUMNP/2007

Country of ref document: IN

WWP Wipo information: published in national office

Ref document number: 2005780541

Country of ref document: EP