WO2007057497A1 - Method and devices for generating, transferring and processing three-dimensional image data - Google Patents

Method and devices for generating, transferring and processing three-dimensional image data Download PDF

Info

Publication number
WO2007057497A1
WO2007057497A1 PCT/FI2005/000491 FI2005000491W WO2007057497A1 WO 2007057497 A1 WO2007057497 A1 WO 2007057497A1 FI 2005000491 W FI2005000491 W FI 2005000491W WO 2007057497 A1 WO2007057497 A1 WO 2007057497A1
Authority
WO
WIPO (PCT)
Prior art keywords
raw image
minimum
maximum
location
disparity
Prior art date
Application number
PCT/FI2005/000491
Other languages
French (fr)
Inventor
Lachlan Pockett
Original Assignee
Nokia Corporation
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nokia Corporation filed Critical Nokia Corporation
Priority to PCT/FI2005/000491 priority Critical patent/WO2007057497A1/en
Priority to KR1020087014578A priority patent/KR100988894B1/en
Priority to JP2008540634A priority patent/JP2009516447A/en
Priority to EP05813919A priority patent/EP1952199B1/en
Priority to US12/085,124 priority patent/US8619121B2/en
Priority to TW095137888A priority patent/TW200745733A/en
Publication of WO2007057497A1 publication Critical patent/WO2007057497A1/en

Links

Classifications

    • GPHYSICS
    • G03PHOTOGRAPHY; CINEMATOGRAPHY; ANALOGOUS TECHNIQUES USING WAVES OTHER THAN OPTICAL WAVES; ELECTROGRAPHY; HOLOGRAPHY
    • G03BAPPARATUS OR ARRANGEMENTS FOR TAKING PHOTOGRAPHS OR FOR PROJECTING OR VIEWING THEM; APPARATUS OR ARRANGEMENTS EMPLOYING ANALOGOUS TECHNIQUES USING WAVES OTHER THAN OPTICAL WAVES; ACCESSORIES THEREFOR
    • G03B35/00Stereoscopic photography
    • G03B35/08Stereoscopic photography by simultaneous recording
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N13/00Stereoscopic video systems; Multi-view video systems; Details thereof
    • H04N13/10Processing, recording or transmission of stereoscopic or multi-view image signals
    • H04N13/106Processing image signals
    • H04N13/161Encoding, multiplexing or demultiplexing different image signal components
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N13/00Stereoscopic video systems; Multi-view video systems; Details thereof
    • H04N13/10Processing, recording or transmission of stereoscopic or multi-view image signals
    • H04N13/194Transmission of image signals
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N13/00Stereoscopic video systems; Multi-view video systems; Details thereof
    • H04N13/20Image signal generators
    • H04N13/204Image signal generators using stereoscopic image cameras
    • H04N13/239Image signal generators using stereoscopic image cameras using two 2D image sensors having a relative position equal to or related to the interocular distance
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N2213/00Details of stereoscopic systems
    • H04N2213/003Aspects relating to the "2D+depth" image format

Definitions

  • the invention concerns generally the technology of obtaining, transferring and outputting three-dimensional image data. Especially the invention concerns the problem of transferring three-dimensional image data in a form that allows it to be displayed with any display device.
  • the visual system of the brain produces a perception of three-dimensionality by combining the two slightly different images coming from the eyes.
  • An image displayed on a two-dimensional display screen can give rise to the same perception without the need of special viewing glasses or the like, if the display screen is autostereoscopic, i.e. in itself capable of emitting slightly different information to the right and left eye of the viewer.
  • the two autostereoscopic display technologies that are most widely used for this purpose at the time of writing this specification are known as the parallax barrier principle and the lenticular principle, although also other approaches are known as well.
  • Fig. 1 is a simple schematic example of a known parallax barrier type liquid crystal display.
  • a liquid crystal layer 101 comprises right-eye subpixels and left-eye sub- pixels marked with R and L respectively.
  • a backlighting layer 102 emits light from behind the liquid crystal display.
  • a parallax barrier layer 103 contains slits that only allow light to propagate through the right-eye subpixels to the right eye of the viewer and through the left-eye subpixels to the left eye of the viewer. It is also possible to have the parallax barrier layer 103 in front of the liquid crystal layer 101 instead of between it and the backlighting layer 102.
  • Fig. 2 is a simple schematic example of a known lenticular type liquid crystal dis- play.
  • the liquid crystal layer 201 comprises right-eye subpixels and left- eye subpixels.
  • the backlighting layer 202 emits light through the liquid crystal layer 201.
  • a layer 203 of lenticulars, i.e. cylindrical lenses, collimates the light so that light rays coming through a right-eye subpixel continue parallelly towards the right eye of the viewer and light rays coming through a left-eye subpixel continue parallelly towards the left eye of the viewer.
  • Fig. 3 illustrates schematically a known principle for generating three-dimensional image information of a group of imaged objects.
  • Two horizontally separated cameras 301 and 302 take pictures at the same time but otherwise independently of each other, resulting in two so-called raw images 303 and 304 respectively. Images of the objects appear at different locations in the raw images, because the cameras 301 and 302 see the imaged objects from different directions. It should be noted, though, that the differences in the raw images appear in highly exaggerated proportion in fig. 3 compared to most practical solutions, because for reasons of making fig. 3 graphically clear the imaged objects are drawn very close to the camera arrangement. Together the raw images 303 and 304 constitute a stereograph that could be displayed using any suitable display technology, including but not being limited to those illustrated in figs. 1 and 2.
  • a US patent publication number U S 2004/0218269 A1 discloses a 3D Data Formatter, which acts as a format converter between various known interlacing techniques and is also capable of certain basic picture processing operations, such as zooming, cropping and keystone correcting. Simply changing between presentation formats does not solve the problem of inherent incompatibility between dis- plays that may be of different size and may have a different default viewing distance.
  • a weakness of the reference publication is also that the solution considered therein can only work between formats and interlacing techniques that the formatter device knows in advance. The reference publication does not consider any way of generating good fusible 3D image content for any receiving device, the features of which are not yet known.
  • the objectives of the invention are achieved by recording a disparity range that describes limits of how much the raw images differ from each other, and rescaling the disparities related to different viewing depths to map the three-dimensional image into a comfortable viewing space between the maximum virtual distances in front of and behind the display at which objects in the image should appear.
  • a transmitting device is characterized by the features recited in the characterizing part of the independent claim directed to a transmitting device.
  • An imaging module according to the invention is characterized by the features recited in the characterizing part of the independent claim directed to an imaging module.
  • a receiving device is characterized by the features recited in the characterizing part of the independent claim directed to a receiving device.
  • a transmission system according to the invention is characterized by the features recited in the characterizing part of the independent claim directed to a transmission system.
  • a transmitting method according to the invention is characterized by the features recited in the characterizing part of the independent claim directed to a transmit- ting method.
  • a receiving method according to the invention is characterized by the features recited in the characterizing part of the independent claim directed to a receiving method.
  • Objects that appear in a three-dimensional image are located at various distances from the imaging arrangement.
  • the distance between the imaging arrangement and an imaged object is commonly referred to as the imaging depth.
  • the structural parameters of the camera arrangement determines, what is the disparity between raw images related to each imaging depth value.
  • the disparities related to the minimum imaging depth and the maximum imaging depth define a disparity range that is characteristic to each particular imaging arrangement.
  • the disparity range can be recorded, stored and transmitted together with a pair of raw images.
  • An autostereographic display that is to be used for fusing the raw images into a three-dimensional image has a characteristic comfortable viewing space, which extends from a virtual front edge in front of the display screen to a virtual back edge behind the display screen.
  • a characteristic comfortable viewing space which extends from a virtual front edge in front of the display screen to a virtual back edge behind the display screen.
  • the invention makes this particularly easy, because how much in front of or behind the plane of the display screen objects seem to appear depends directly on the corresponding disparity between the component images. If the user wants to e.g. shift the front edge of the comfortable viewing space towards the plane of the display screen, he simply tells the displaying device to decrease the corresponding maximum disparity value.
  • An important advantage of the invention is its adaptability to automatically and in- stantly display images on autostereographic displays of various sizes.
  • An image may be viewed on a small-size autostereographic display of a portable communications device or a display screen of a personal computer, or even on a giant screen of a 3D cinema system.
  • the same transmission format (raw images + disparity range) can be used to transmit a stereographic im- age to all these purposes, so that only some disparity mapping is needed to adapt the image for displaying in each case.
  • the invention allows flexible content sharing and seamless interaction between all kinds of devices, portable and non-portable, that are capable of displaying stereographic images.
  • Fig. 1 illustrates a known parallax barrier display principle
  • fig. 2 illustrates a known lenticular display principle
  • fig. 3 illustrates the known principle of producing a stereographic image
  • fig. 4 illustrates the concept of comfortable viewing space
  • fig. 5 illustrates an imaging arrangement imaging a close object and a distant object
  • fig. 6 illustrates a pair of raw images
  • fig. 7 illustrates certain angular relations
  • fig. 8 illustrates mapping the virtual appearance of the close and distant objects to the comfortable viewing space
  • fig. 9 illustrates subpixel separation associated with the image of a distant object
  • fig. 10 illustrates subpixel separation associated with the image of a close object
  • fig. 11 illustrates the transmission between a transmitting device, a receiving device and a display
  • fig. 12 illustrates an exemplary composition of functional blocks in the transmitting device, the receiving device and the display
  • fig. 13 illustrates a method and a software program product executed by a transmitting device
  • fig. 14 illustrates a method and a software program product executed by a receiving device and a display.
  • Fig. 4 illustrates schematically how the three-dimensional image taken in fig. 3 could appear to a viewer on an autostereogrphic display screen 401.
  • the imaged objects are transparent bubbles.
  • the corresponding right-eye subpixel should appear at AR and the corresponding left-eye subpixel should appear at AL.
  • Point A is the point of the imaged objects that was closest to the camera arrangement, so in the displayed image it appears closest to the viewer.
  • the corresponding right-eye and left-eye subpix- els should appear at BR and BL respectively.
  • Point A appears to be virtually located at a distance 402 in front of the display screen 401
  • point B appears to be virtually located at a distance 403 behind the display screen 401.
  • How large are the distances 402 and 403 depends on the disparity between AR and AL as well as BR and BL respectively as well as on the viewing distance (the distance between the eyes of the viewer and the display screen). For reasons of human visual ergonomy, it is not possible to strech the distances 402 and 403 more than certain limits that depend on the size of the display screen as well as on the default viewing distance.
  • the viewer's eyes should focus at the distance of the display screen 401 but converge on a point that is closer by the amount of distance 402, which contradicts the normal rules of operation of the human visual system.
  • a comfortable viewing space so that it extends from the maximum distance in front of the display screen where objects of the three-dimensional image may be made to virtually appear to the maximum distance behind the display screen where objects of the three-dimensional image may be made to virtually appear, so that said features of human visual ergonomy still allow said objects to be viewed comfortably.
  • the three- dimensional image has been made to utilize the whole depth of the comfortable viewing space in fig. 4, we may denote the depth of the comfortable viewing space as distance 404.
  • features of the display screen have a major effect on how far in front of the display screen and how far behind the display screen the comfortable viewing space will reach.
  • Said features include factors like size and shape of the display; the de- fault viewing distance; the size, shape and distribution of pixels; the sharpness and resolution of the display; reverse half occlusions (resulting from foreground objects being cut by a window that is perceived behind the object) and the structure and operation of the autostereography mechanism.
  • FIG. 5 illustrates an imaging arrangement in which two horizontally separated cameras 501 and 502 each take a picture of a scenery that comprises a very close object 503 and a very distant object (or background) 504.
  • minimum and maximum imaging depths have been defined for the imaging arrangement, and that the close object 503 happens to be exactly at the minimum imaging depth 505 while the distant object 504 happens to be at the maximum imaging depth 506.
  • the minimum imaging depth 505 and the maximum imaging depth 506 are features of the imaging arrangement and not features of any particular image, even if in the exemplary case of fig. 5 the objects 503 and 504 happen to be located at exactly the minimum and maximum imaging depths respectively.
  • the optical axes of the cameras 501 and 502 are parallel to each other in fig. 5, which means that the central axis mentioned above is an imaginary line that is parallel to the optical axes and located in the middle between them. It has been found that using parallel cameras rather than converged ones, the optical axes of which would intersect at some default imaging depth, produces superior image quality. Since the maximum imaging depth has been set at infinity in fig. 5, mathematically speaking this is the same as using converged cameras with just the optical axis intersection point taken to infinity. Thus, no contradiction is caused by saying that the optical axes of the cameras 501 and 502 intersect at the maximum imaging depth.
  • an object 504 located at the maximum imaging depth 506 has zero disparity in the raw images 601 and 602 illustrated in figs. 6a and 6b.
  • the distant object appears at the same horizontal location in each raw image.
  • there is a small but finite horizontal difference because even a distant object is never truly at infinite distance, but for practical considerations we may neglect the small difference for the time being.
  • the close object 503 does not appear at the same horizontal location in the raw images. In the left-hand raw image 601 it appears X/2 units to the right from the center of the raw image. Since the close object 503 was centered on the imaginary central axis between the optical axes of the cameras, in the right-hand raw image it appears correspondingly X/2 units to the left from the center. The close object 503 was located at the predefined minimum imaging depth 505, so we may say that the disparity X between its appearances in the raw images 601 and 602 is the maximum disparity. All disparity values associated with intermediate objects would fall between the maximum disparity X and the minimum disparity 0, which latter value could be denoted with Y.
  • X/2 which is one half of the maximum disparity corresponds to an angular separation between the optical axis of a camera and a line drawn from the camera to a centrally located object at the minimum imaging depth.
  • figs. 7 and 8 illustrate at least one alternative that holds for displays for which the maximum absolute value of dispar- ity is the same for objects appearing virtually in front of the screen and objects appearing virtually behind the screen.
  • the basic principle is that objects that were exactly at the minimum imaging depth should virtually appear exactly at the front edge of the comfortable viewing space, and objects that were exactly at the maximum imaging depth should virtually appear exactly at the back edge of the comfortable viewing space.
  • lines drawn exactly in the middle of the angle between the directions to the center of the front edge and the center of the back edge of the comfortable viewing space intersect exactly at the plane of the display screen 401.
  • Figs. 9 and 10 illustrate defining the disparities for the most distant object and closest object in the displayed image respectively.
  • fig. 9 to make an object vir- tually appear at the back edge of the comfortable viewing space, there should be a disparity the absolute value of which is Y 1 units between the left-eye and right-eye subpixels associated with said object.
  • fig. 10 to make an object virtually appear at the front edge of the comfortable viewing space, there should be a disparity the absolute value of which is X 1 units.
  • the sign of this disparity is opposite to the sign of the disparity associated with the most distant object. Selecting the signs of the disparities is just a matter of convention.
  • disparities where the left-eye subpixel is to the left of the right-eye subpixel are negative, and disparities where the left-eye subpixel is to the right of the right-eye subpixel are positive. If we make this selection, we must note that in the imaging arrangement of figs. 5 and 6 all disparities will have positive values (i.e. all objects closer than infinity will appear in the right-eye raw image more to the left than in the left-eye raw image).
  • - maps a disparity 0 (or more generally: a disparity associated with objects at the maximum imaging depth) between the raw images into a disparity -Y 1 between subpixels in the displayed image and
  • Linearly mapping a range of values into another range of values is a simple mathematical operation that only requires a scaling factor and a displacement value.
  • the most natural unit of disparity is pixels.
  • the imaging arrangement has a different resolution and thus a different number of pixels in horizontal direction across the image than the displaying arrangement. This has to be taken into account in determining the scaling factor.
  • the maximum disparity i.e. the disparity associated with the front edge of the comfortable viewing space
  • the minimum disparity i.e. the disparity associated with the back edge of the comfortable viewing space
  • the maximum disparity i.e. the disparity associated with the minimum imaging depth
  • the minimum disparity i.e.
  • ZD (Djn,max + Din.min — D O ut,max — D O ut,min) / 2 .
  • a concept of displacing the zero disparity plane without scaling is the same as performing a mapping from input disparities to intermediate disparities, assuming that the number of pixels in horizontal direction across the image remains the same.
  • Concerning processing order it is possible to first displace the zero disparity plane without scaling and to thereafter scale the intermediate disparities into output disparities to take into account the different number of pixels in horizontal direction across the image.
  • the other possibility is to scale first and displace the zero disparity plane thereafter. Of these two possibilities, the first-mentioned tends to produce more accurate results.
  • the maximum and minimum disparities D in ,m a x and Dj nim j n that may occur in a pair of raw images does not depend on image content but only on the properties of the imaging arrangement.
  • the maximum and minimum disparities D ou t,max and D ou t,min that correspond to objects virtually appearing at the front and back edge of the comfortable viewing space do not depend on image content but only on the properties of the displaying arrangement.
  • Naturally relatively few images will actually include objects at the very minimum or the very maximum imaging depth but something in between, but it will then be on the responsibility of the displaying ar- rangement to find the corresponding disparity values that will fall between the extreme limits.
  • the actual process of interlacing in which the displaying arrangement detects the pixels that represent close or distant objects and consequently associates each pixel pair in the raw images with the appropriate disparity value, is not important to the invention.
  • the present invention concerns the question of how does the displaying arrangement define the mapping function that maps an input disparity, once found, to an output disparity that will determine the horizontal distance between the left-eye and right-eye subpixels.
  • Fig. 11 illustrates a data flow process according to an embodiment of the invention when three-dimensional digital image data is transferred from a transmitting device
  • Fig. 12 illustrates an example of certain functional blocks of said devices that may take part in preparing, transferring, processing and outputting the image data.
  • the transmitting device 1101 transmits the raw images and an indication of an as- sociated disparity range to the receiving device.
  • the transmitting device is also the originator of the three-dimensional image data, for which purpose it comprises at least two parallel cameras 1201.
  • the raw images taken by the cameras are stored in an image memory 1202.
  • Characteristics of the imaging arrangement such as camera properties, camera separation, general image properties, possible standardized minimum and maximum imaging depth are stored in a characteristics memory 1203.
  • Control means 1204 are provided in the transmitting device for enabling a user to control the operations of taking, handling and transmitting three-dimensional images.
  • output means 1205 For transmitting raw images and the associated disparity ranges to outside of the transmit- ting device there is provided output means 1205, which in the case of a portable communications device typically include a wireless transceiver.
  • the cameras 1201 , the image memory 1202 and the characteristics memory 1203 or a part of these functional blocks could be implemented as an imaging module that can be manufactured and sold separately for installing to various kinds of electronic de- vices.
  • the imaging characteristics of the transmitting device are constant: the cameras are fixedly located in the transmitting device, they have a fixed focal length, the minimum and maximum imaging depths are constant and so on. In that case it is particularly simple to store indications of the maximum and minimum disparity values associated with a pair of raw images, because also said maximum and minimum disparity values will be constant.
  • the cameras are equipped with zoom objectives or exchangeable lenses that change the focal length, or the aperture or separation between cameras can be changed, or there may be more than two cameras so that the user may select which of them to use, or some other imaging characteristic is not constant.
  • the control means 1204 it is useful to have a coupling from the control means 1204 to the characteristics memory 1203 so that whatever changes the user makes to the im- aging arrangement, always the most appropriate corresponding indications of the maximum and minimum disparity values may be read from the characteristics memory 1203 and transmitted along with the raw images.
  • the coupling from the control means 1204 to the characteristics memory 1203 may be also indirect, so that when the user changes focal length or other imaging characteristic, the imaging arrangement produces an indication of the change that causes a subsequent read operation in the characteristics memory to find the most appropriate information.
  • the nature and content of the indication of the maximum and minimum disparity values may also vary.
  • the most straightforward alternative is to express the maximum and minimum disparity values in the units of pixels that have one-to-one correspondence with the horizontal pixel count in the raw images, and transmit the explicit values along with the raw images. It is also possible to use some other units than pixels.
  • Another alternative is that the transmitting device already compares the raw images enough to find the pixel pairs that correspond to the closest image point and the most image point; then the transmitting device could specifically identify these pixels in the raw images rather than announce any measure of their separation.
  • Yet another alternative may be used if the imaging characteristics of the transmitting device are constant.
  • the maximum and minimum disparity values typical to each commonly known type of transmitting device could be standardized, so that the transmitting device only needed to transmit an indication of its type. A receiving device could then consult some previously stored look-up table that associates imaging device types with their characteristic maximum and minimum disparity values.
  • the transmitting device transmits geometric details about the imaging arrangement along with the raw images, such as the maximum and minimum imaging depths, focal length, CCD size and/or others, from which a receiving device in turn can derive the appropriate maximum and minimum disparity values of the original imaging arrangement.
  • the minimum disparity between the raw image is always zero or some other constant value, so that no indication of minimum disparity needs to be transmitted. Constantly assuming a zero minimum disparity is synonymous to assuming that the two optical axes always intersect at the maximum imaging depth, which may but does not have to be infinity.
  • the receiving device 1102 receives it along with the raw images through a receiver 1211.
  • the raw image data goes to an image processor 1212, where it waits to be processed while a disparity mapper 1213 prepares the mapping between disparity values in the raw images and disparity values for the interlaced images to be displayed.
  • the disparity map- per 1213 In order to perform the mapping, the disparity map- per 1213 must know the disparity range associated with the raw images, as well as the allowable disparity range that may appear eventually in the displayed image. If the latter is constant, the disparity mapper 1213 may simply read it from a characteristics memory 1214.
  • the disparity mapper 1213 may calculate the disparity range allowable in the displayed image from stored information such as display size, display pixel pitch, and human ergonomic factors of the display (including default viewing distance). If the receiving device can be coupled to a variety of displays, it is advisable to arrange storing the appropriate, display- dependent values to the characteristics memory 1214 at the time when a new display is coupled. Most advantageously the receiving device 1102 comprises also control means 1215 through which a user may input his preferences about increasing or decreasing the disparity range allowable in the displayed image.
  • the disparity mapper 1213 may use it and its knowledge about the original disparity range as- sociated with the raw images to produce the mapping function (see e.g. formulas (1 )-(4) above), which it delivers to the image processor 1212.
  • the task of the image processor 1212 is ultimately to convert the raw images into the new image pair that will be conveyed to the display 1103.
  • This converting may include scaling and cropping of the images as well as making the changes to disparity pixel-wise. Cropping is needed because displacing the zero disparity plane effectively moves the background of each raw image sideways so that information is lost from a vertical bar at each side edge of the image. This vertical bars must be cut out
  • the proportions of the raw images are not the same as the proportions of the display, which means that either the image must be cropped to fit to the display or empty fields must be added to some sides of the image.
  • the image processor 1212 uses to identify mutually corresponding pixels in the raw images.
  • the invention affects the changes to be made in disparity: once the image processor 1212 has found a pixel pair with some initial disparity Dj n in the raw images, it relocates the pixels of that pixel pair in the new images so that their new disparity is calculated according to the mapping formula that is based on knowing the initial disparity range as well as the disparity range allowable in the displayed image.
  • the display 1103 is shown to comprise the interlacing means 1221 that directs the new images prepared in the image processor 1212 to the arrays of subpixels that constitute the display screen 1222.
  • Figs. 13 and 14 illustrate the methods performed at the transmitting and receiving ends respectively.
  • the drawings may also be considered as illustrations of the software program products employed at the transmitting and receiving ends.
  • the transmitting device records the raw images, and at step 1302 it obtains an indication of the disparity range, i.e. the maximum and minimum disparity values associated with the raw images.
  • the transmitting device combines the raw images and the indication of the disparity range for transmission, and at step 1304 it transmits them to a receiving device.
  • the receiving device receives the raw images and the indication of the disparity range associated with the raw images at step 1401. It obtains information about the display characteristics at step 1402 and uses that information to determine the disparity range allowable in the displayed image at step 1403.
  • the re-caliving device uses the information it has about the raw image disparity range and the output image disparity range to determine the appropriate disparity mapping.
  • the images are processed, which includes relocating the pixels in the horizontal direction according to the disparity mapping function determined in the previous step. Depending on what display technology and image file standard is to be used, the processing step 1405 may include format conversion operations such as those explained for example in the prior art publication US 2004/0218269 A1.
  • the left-eye and right-eye subimages are compressed in horizontal direction by a factor 2 and written side by side into a single image file that follows otherwise the JPEG (Joint Photographic Experts Group) standard but has a certain additional character string in its header and an extension .stj in its file name.
  • the completed new images are output on the display.
  • the exemplary embodiments described above should not be construed as limiting. For example, even if we have consistently considered using only two cameras, the invention does not exclude using three or more cameras. In a multi-camera arrangement the concept of disparity must be replaced with a more generally de- fined horizontal displacement of pixels depending on imaging depth, but otherwise the principle of the invention can be applied in a straightforward manner. For each camera there can be defined the characteristic horizontal displacements associated with objects at the minimum imaging depth and the maximum imaging depth on the central line of sight of the imaging arrangement. These characteristic hori- zontal displacements take the position of maximum and minimum disparities associated with a raw image pair in the description above.
  • the principle of the invention is also applicable to the obtaining, transmitting, processing and displaying of series of images, which as a sequence constitute a video signal. Since the initial disparity range is a property of the imaging arrangement and does not depend on image content, applying the invention to the processing of a video signal is particularly simple: the indication of the initial disparity range only needs to be transmitted once unless the characteristics of the imaging arrangement are dynamically changed during shooting, in which case the indication of the disparity range needs to be transmitted regularly so that it covers each change.
  • Yet another exemplary modification concerns the implementation of the cameras: instead of the (at least) two parallel cameras it is possible to equip a single camera (i.e. a single CCD array) with a branching lens system that sets up two parallel optical paths and includes a shutter system that allows taking a picture through each optical path in turn.
  • a single camera i.e. a single CCD array
  • a branching lens system that sets up two parallel optical paths and includes a shutter system that allows taking a picture through each optical path in turn.
  • image compression according to the JPEG format may average out adjacent pixels, which means that if such image compression were to be applied at an inappropriate phase of handling stereographic images, it might destroy the stereographic properties altogether.
  • numeric values used in the description are exemplary. For example, even if at the time of writing the description the default viewing distance of the autostereo- graphic displays of portable devices is in the order of 40-60 cm, other displays may involve shorter or longer default viewing distances. Displays for use in per- sonal computers have usually longer default viewing distances, up to 1-2 meters. Much longer default viewing distances occur in audiovisual presentation systems like 3D cinemas.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Testing, Inspecting, Measuring Of Stereoscopic Televisions And Televisions (AREA)
  • Image Processing (AREA)
  • Stereoscopic And Panoramic Photography (AREA)
  • Measurement Of Optical Distance (AREA)

Abstract

Three-dimensional digital image data comes from a stereographic imaging arrangement (501, 502, 1201) that takes a first raw image (601) along a first optical axis and a second raw image (602) along a second optical axis. The imaging arrangement (501, 502, 1201) has a maximum imaging depth (506) and a minimum imaging depth (505). It transmits the first raw image (601) and the second raw image (602) to a receiving device (1102) along with an indication of a disparity range between a maximum disparity value and a minimum disparity value. The maximum disparity value is a measure of a difference between locations in the first (601) and second (602) raw images that represent the minimum imaging depth (505). The minimum disparity value is a measure of a difference between locations in the first (601) and second (602) raw images that represent the maximum imaging depth (506).

Description

Method and devices for generating, transferring and processing three- dimensional image data
The invention concerns generally the technology of obtaining, transferring and outputting three-dimensional image data. Especially the invention concerns the problem of transferring three-dimensional image data in a form that allows it to be displayed with any display device.
The visual system of the brain produces a perception of three-dimensionality by combining the two slightly different images coming from the eyes. An image displayed on a two-dimensional display screen can give rise to the same perception without the need of special viewing glasses or the like, if the display screen is autostereoscopic, i.e. in itself capable of emitting slightly different information to the right and left eye of the viewer. The two autostereoscopic display technologies that are most widely used for this purpose at the time of writing this specification are known as the parallax barrier principle and the lenticular principle, although also other approaches are known as well.
Fig. 1 is a simple schematic example of a known parallax barrier type liquid crystal display. A liquid crystal layer 101 comprises right-eye subpixels and left-eye sub- pixels marked with R and L respectively. A backlighting layer 102 emits light from behind the liquid crystal display. A parallax barrier layer 103 contains slits that only allow light to propagate through the right-eye subpixels to the right eye of the viewer and through the left-eye subpixels to the left eye of the viewer. It is also possible to have the parallax barrier layer 103 in front of the liquid crystal layer 101 instead of between it and the backlighting layer 102.
Fig. 2 is a simple schematic example of a known lenticular type liquid crystal dis- play. Also here the liquid crystal layer 201 comprises right-eye subpixels and left- eye subpixels. The backlighting layer 202 emits light through the liquid crystal layer 201. A layer 203 of lenticulars, i.e. cylindrical lenses, collimates the light so that light rays coming through a right-eye subpixel continue parallelly towards the right eye of the viewer and light rays coming through a left-eye subpixel continue parallelly towards the left eye of the viewer.
Fig. 3 illustrates schematically a known principle for generating three-dimensional image information of a group of imaged objects. Two horizontally separated cameras 301 and 302 take pictures at the same time but otherwise independently of each other, resulting in two so-called raw images 303 and 304 respectively. Images of the objects appear at different locations in the raw images, because the cameras 301 and 302 see the imaged objects from different directions. It should be noted, though, that the differences in the raw images appear in highly exaggerated proportion in fig. 3 compared to most practical solutions, because for reasons of making fig. 3 graphically clear the imaged objects are drawn very close to the camera arrangement. Together the raw images 303 and 304 constitute a stereograph that could be displayed using any suitable display technology, including but not being limited to those illustrated in figs. 1 and 2.
There are no widespread standards that would define the parameters that affect the generation of stereographs or their presentation on display screens. Numerous parameters have a significant effect, such as the separation between cameras; fo- cal length; size, resolution and angular pixel pitch of the CCD (Charge-Coupled Device) arrays in the cameras; size, resolution and pixel structure of the display; default viewing distance; and the amount of scaling, cropping and other processing that is required to map the raw images to the subpixel arrays, time-interlaced fields or other display elements that eventually present the fused image to the viewer. The lack of standards means that a stereographic image taken with a certain imaging arrangement and prepared for presentation on a particular display type is not likely to work weil on any other display type.
The incompatibility problem will become more and more important when three- dimensional imaging and autostereoscopic displays find their way to simple and inexpensive consumer appliances, such as portable communication devices, where conventional cameras and high-quality two-dimensional displays are already in widespread use. A user that has taken a three-dimensional image with a portable communication device of one brand wants to be sure that he can transmit the image to another user, who can view it correctly irrespective of which brand of a device the recipient has.
A US patent publication number U S 2004/0218269 A1 discloses a 3D Data Formatter, which acts as a format converter between various known interlacing techniques and is also capable of certain basic picture processing operations, such as zooming, cropping and keystone correcting. Simply changing between presentation formats does not solve the problem of inherent incompatibility between dis- plays that may be of different size and may have a different default viewing distance. A weakness of the reference publication is also that the solution considered therein can only work between formats and interlacing techniques that the formatter device knows in advance. The reference publication does not consider any way of generating good fusible 3D image content for any receiving device, the features of which are not yet known.
An objective of the present invention is to present a method and devices for generating, transferring and processing three-dimensional image data in a form that does not require ensuring compatibility between the imaging arrangement and the displaying arrangement in advance. Another objective of the present invention is to enable efficient transfer of generic three-dimensional image content.
The objectives of the invention are achieved by recording a disparity range that describes limits of how much the raw images differ from each other, and rescaling the disparities related to different viewing depths to map the three-dimensional image into a comfortable viewing space between the maximum virtual distances in front of and behind the display at which objects in the image should appear.
A transmitting device according to the invention is characterized by the features recited in the characterizing part of the independent claim directed to a transmitting device. An imaging module according to the invention is characterized by the features recited in the characterizing part of the independent claim directed to an imaging module.
A receiving device according to the invention is characterized by the features recited in the characterizing part of the independent claim directed to a receiving device.
A transmission system according to the invention is characterized by the features recited in the characterizing part of the independent claim directed to a transmission system.
A transmitting method according to the invention is characterized by the features recited in the characterizing part of the independent claim directed to a transmit- ting method.
A receiving method according to the invention is characterized by the features recited in the characterizing part of the independent claim directed to a receiving method.
Software program products for a transmission operation and a reception operation according to the invention are characterized by the features recited in the characterizing part of the independent claims directed such software program products.
Embodiments of the invention are described in the depending claims.
Objects that appear in a three-dimensional image are located at various distances from the imaging arrangement. The distance between the imaging arrangement and an imaged object is commonly referred to as the imaging depth. We may rea- sonably assume that there are minimum and maximum limits to imaging depth: for example, all imaged objects must appear between half a meter and infinity. The structural parameters of the camera arrangement determines, what is the disparity between raw images related to each imaging depth value. The disparities related to the minimum imaging depth and the maximum imaging depth define a disparity range that is characteristic to each particular imaging arrangement.
The disparity range can be recorded, stored and transmitted together with a pair of raw images. An autostereographic display that is to be used for fusing the raw images into a three-dimensional image has a characteristic comfortable viewing space, which extends from a virtual front edge in front of the display screen to a virtual back edge behind the display screen. By suitably scaling and shifting the disparities between the raw images it is possible to find new disparities that make the fused image appear within the comfortable viewing space: objects that were located at the maximum depth from the imaging arrangement are mapped to the back edge of the comfortable viewing space, and objects that were located at the minimum depth are mapped to the front edge.
Since the limits of the comfortable viewing space depend partly on the personal preferences of each user, there should be a possibility of dynamically modifying them. The invention makes this particularly easy, because how much in front of or behind the plane of the display screen objects seem to appear depends directly on the corresponding disparity between the component images. If the user wants to e.g. shift the front edge of the comfortable viewing space towards the plane of the display screen, he simply tells the displaying device to decrease the corresponding maximum disparity value.
An important advantage of the invention is its adaptability to automatically and in- stantly display images on autostereographic displays of various sizes. An image may be viewed on a small-size autostereographic display of a portable communications device or a display screen of a personal computer, or even on a giant screen of a 3D cinema system. According to the invention, the same transmission format (raw images + disparity range) can be used to transmit a stereographic im- age to all these purposes, so that only some disparity mapping is needed to adapt the image for displaying in each case. The invention allows flexible content sharing and seamless interaction between all kinds of devices, portable and non-portable, that are capable of displaying stereographic images. The exemplary embodiments of the invention presented in this patent application are not to be interpreted to pose limitations to the applicability of the appended claims. The verb "to comprise" is used in this patent application as an open limita- tion that does not exclude the existence of also unrecited features. The features recited in depending claims are mutually freely combinable unless otherwise explicitly stated.
The novel features which are considered as characteristic of the invention are set forth in particular in the appended claims. The invention itself, however, both as to its construction and its method of operation, together with additional objects and advantages thereof, will be best understood from the following description of specific embodiments when read in connection with the accompanying drawings.
Fig. 1 illustrates a known parallax barrier display principle, fig. 2 illustrates a known lenticular display principle, fig. 3 illustrates the known principle of producing a stereographic image fig. 4 illustrates the concept of comfortable viewing space, fig. 5 illustrates an imaging arrangement imaging a close object and a distant object, fig. 6 illustrates a pair of raw images, fig. 7 illustrates certain angular relations, fig. 8 illustrates mapping the virtual appearance of the close and distant objects to the comfortable viewing space, fig. 9 illustrates subpixel separation associated with the image of a distant object, fig. 10 illustrates subpixel separation associated with the image of a close object, fig. 11 illustrates the transmission between a transmitting device, a receiving device and a display, fig. 12 illustrates an exemplary composition of functional blocks in the transmitting device, the receiving device and the display, fig. 13 illustrates a method and a software program product executed by a transmitting device, and fig. 14 illustrates a method and a software program product executed by a receiving device and a display.
Fig. 4 illustrates schematically how the three-dimensional image taken in fig. 3 could appear to a viewer on an autostereogrphic display screen 401. We assume for simplicity that the imaged objects are transparent bubbles. In order to correctly display point A in the image, the corresponding right-eye subpixel should appear at AR and the corresponding left-eye subpixel should appear at AL. Point A is the point of the imaged objects that was closest to the camera arrangement, so in the displayed image it appears closest to the viewer. In order to correctly display the most distant point B in the image, the corresponding right-eye and left-eye subpix- els should appear at BR and BL respectively.
Point A appears to be virtually located at a distance 402 in front of the display screen 401 , and point B appears to be virtually located at a distance 403 behind the display screen 401. How large are the distances 402 and 403 depends on the disparity between AR and AL as well as BR and BL respectively as well as on the viewing distance (the distance between the eyes of the viewer and the display screen). For reasons of human visual ergonomy, it is not possible to strech the distances 402 and 403 more than certain limits that depend on the size of the display screen as well as on the default viewing distance. For example, when looking at point A, the viewer's eyes should focus at the distance of the display screen 401 but converge on a point that is closer by the amount of distance 402, which contradicts the normal rules of operation of the human visual system. There are no globally valid maximum values for the distances 402 and 403 in fig. 4: what a viewer considers comfortable is after all a matter of personal taste.
Generally we may define the concept of a comfortable viewing space so that it extends from the maximum distance in front of the display screen where objects of the three-dimensional image may be made to virtually appear to the maximum distance behind the display screen where objects of the three-dimensional image may be made to virtually appear, so that said features of human visual ergonomy still allow said objects to be viewed comfortably. Assuming that the three- dimensional image has been made to utilize the whole depth of the comfortable viewing space in fig. 4, we may denote the depth of the comfortable viewing space as distance 404. Experiments have shown that for a display screen of a portable communications device, the default viewing distance of which is in the order of 40- 60 cm, an upper limit of the disparity between AR and AL - or between BR and BL - is in the order of a few millimeters, but depends remarkably on the exact type and size of the display. At the time of writing this description there are portable- device-sized displays in which the disparity should not be more than about ±2 mm, but also some in which it can be conveniently about ±10 mm. The plus or minus sign comes from the fact that for close objects the right-eye subpixel should be more to the left than the left-eye subpixel (see AR vs. AL) while for distant objects the left-eye subpixel should be more to the left than the right-eye subpixel (see BL vs. BR).
Features of the display screen have a major effect on how far in front of the display screen and how far behind the display screen the comfortable viewing space will reach. Said features include factors like size and shape of the display; the de- fault viewing distance; the size, shape and distribution of pixels; the sharpness and resolution of the display; reverse half occlusions (resulting from foreground objects being cut by a window that is perceived behind the object) and the structure and operation of the autostereography mechanism. Thus it is possible to determine for each displaying device certain default values for distances 402 and 403 that set the comfortable viewing space at a default location in relation to the display screen. Since the question of comfortable viewing is ultimately subjective and depends on personal taste, it is advantageous to allow the default values to be changed according to user preferences.
Having defined the comfortable viewing space we consider in more detail the concept of disparity between left-eye and right-eye subimages. Fig. 5 illustrates an imaging arrangement in which two horizontally separated cameras 501 and 502 each take a picture of a scenery that comprises a very close object 503 and a very distant object (or background) 504. We assume that minimum and maximum imaging depths have been defined for the imaging arrangement, and that the close object 503 happens to be exactly at the minimum imaging depth 505 while the distant object 504 happens to be at the maximum imaging depth 506. For excluding possible ambiguities, it is useful to define the minimum imaging depth 505 and the maximum imaging depth 506 along a central axis of the imaging arrangement. It should also be noted that the minimum imaging depth 505 and the maximum imaging depth 506 are features of the imaging arrangement and not features of any particular image, even if in the exemplary case of fig. 5 the objects 503 and 504 happen to be located at exactly the minimum and maximum imaging depths respectively.
The optical axes of the cameras 501 and 502 are parallel to each other in fig. 5, which means that the central axis mentioned above is an imaginary line that is parallel to the optical axes and located in the middle between them. It has been found that using parallel cameras rather than converged ones, the optical axes of which would intersect at some default imaging depth, produces superior image quality. Since the maximum imaging depth has been set at infinity in fig. 5, mathematically speaking this is the same as using converged cameras with just the optical axis intersection point taken to infinity. Thus, no contradiction is caused by saying that the optical axes of the cameras 501 and 502 intersect at the maximum imaging depth. An important consequence thereof is that an object 504 located at the maximum imaging depth 506 has zero disparity in the raw images 601 and 602 illustrated in figs. 6a and 6b. In other words, the distant object appears at the same horizontal location in each raw image. To be very exact, there is a small but finite horizontal difference because even a distant object is never truly at infinite distance, but for practical considerations we may neglect the small difference for the time being.
Other values than zero for minimum disparity are possible either so that the cameras are converged, meaning that objects that are more distant than the intersecting point of the optical axes will have a negative disparity, or so that for some reason related to e.g. lighting or focusing possibilities it is practical to define maximum imaging depth to be considerably less than infinity. In the last-mentioned case the minimum disparity will have a small positive value.
The close object 503 does not appear at the same horizontal location in the raw images. In the left-hand raw image 601 it appears X/2 units to the right from the center of the raw image. Since the close object 503 was centered on the imaginary central axis between the optical axes of the cameras, in the right-hand raw image it appears correspondingly X/2 units to the left from the center. The close object 503 was located at the predefined minimum imaging depth 505, so we may say that the disparity X between its appearances in the raw images 601 and 602 is the maximum disparity. All disparity values associated with intermediate objects would fall between the maximum disparity X and the minimum disparity 0, which latter value could be denoted with Y. Speaking in angular terms, X/2 which is one half of the maximum disparity corresponds to an angular separation between the optical axis of a camera and a line drawn from the camera to a centrally located object at the minimum imaging depth.
The purpose is to map the eventually resulting three-dimensional image to the comfortable viewing space of a display so that the closest objects in the image will virtually appear in front of the display screen and the most distant objects will virtually appear behind the display screen. How should one decide, what object (if any) should virtually appear exactly in the plane of the display screen? Generally speaking that could be decided quite freely, but figs. 7 and 8 illustrate at least one alternative that holds for displays for which the maximum absolute value of dispar- ity is the same for objects appearing virtually in front of the screen and objects appearing virtually behind the screen.
The basic principle is that objects that were exactly at the minimum imaging depth should virtually appear exactly at the front edge of the comfortable viewing space, and objects that were exactly at the maximum imaging depth should virtually appear exactly at the back edge of the comfortable viewing space. We may draw an imaginary line at half way between the optical axis (i.e. the direction to the centrally located object at the maximum imaging depth) and the direction to the cen- trally located object at the minimum imaging depth for both cameras. These lines intersect at imaging depth 701 in fig. 7. Similarly in the displayed three- dimensional image of fig. 8, lines drawn exactly in the middle of the angle between the directions to the center of the front edge and the center of the back edge of the comfortable viewing space intersect exactly at the plane of the display screen 401. We may deduce that if the imaged scenery contained an object at imaging depth 701 , that object would virtually appear in the plane of the display screen. Basic trigonometry could be used to derive an exact formula for the imaging depth 701 , defined in terms of camera separation and the minimum and maximum imaging depths.
Concerning displays for which the maximum absolute value of disparity is not the same for objects appearing virtually in front of the screen and objects appearing virtually behind the screen, similar geometric considerations can be made. For ex- ample, if the absolute value of the maximum disparity for objects that virtually appear behind the screen is 3 mm and the absolute value of the maximum disparity for objects that virtually appear in front of the screen is 2 mm, instead of the simple half-way angle lines of figs. 7 and 8 we should draw lines in the middle that in each case divide the angle between the limiting directions into component angles that have the relative magnitudes of 3 : 2 instead of 1 : 1 as in figs. 7 and 8. The zero disparity plane would then be at the distance where these drawn lines intersect.
Figs. 9 and 10 illustrate defining the disparities for the most distant object and closest object in the displayed image respectively. In fig. 9, to make an object vir- tually appear at the back edge of the comfortable viewing space, there should be a disparity the absolute value of which is Y1 units between the left-eye and right-eye subpixels associated with said object. In fig. 10, to make an object virtually appear at the front edge of the comfortable viewing space, there should be a disparity the absolute value of which is X1 units. We must note that the sign of this disparity is opposite to the sign of the disparity associated with the most distant object. Selecting the signs of the disparities is just a matter of convention. Here we define that disparities where the left-eye subpixel is to the left of the right-eye subpixel are negative, and disparities where the left-eye subpixel is to the right of the right-eye subpixel are positive. If we make this selection, we must note that in the imaging arrangement of figs. 5 and 6 all disparities will have positive values (i.e. all objects closer than infinity will appear in the right-eye raw image more to the left than in the left-eye raw image).
In order to correctly map the image taken with the imaging arrangement of figs. 5 and 6 to the displaying arrangement of fig. 8, we should thus construct a disparity mapping function that
- maps a disparity X between the raw images into a disparity X1 between subpixels in the displayed image
- maps a disparity 0 (or more generally: a disparity associated with objects at the maximum imaging depth) between the raw images into a disparity -Y1 between subpixels in the displayed image and
- linearly maps all disparities between the limiting values X and 0 between the raw images into corresponding disparities between the limiting values X1 and -Y' between subpixels in the displayed image.
Linearly mapping a range of values into another range of values is a simple mathematical operation that only requires a scaling factor and a displacement value.
When handling digital images, the most natural unit of disparity is pixels. However, we must note that in general, the imaging arrangement has a different resolution and thus a different number of pixels in horizontal direction across the image than the displaying arrangement. This has to be taken into account in determining the scaling factor. Assuming that the maximum disparity (i.e. the disparity associated with the front edge of the comfortable viewing space) and the minimum disparity (i.e. the disparity associated with the back edge of the comfortable viewing space) of the displaying arrangement are Dout,maχ and Dout.min respectively, and that the maximum disparity (i.e. the disparity associated with the minimum imaging depth) and the minimum disparity (i.e. the disparity associated with the maximum imaging depth) of the imaging arrangement are Din,max and Dln,min respectively, the most natural selection for the scaling factor SF is SF = (Dout,max ~~ Dout,min) ' (Djn,max ~ Djmjn) (1 )
and the most natural selection for the displacement value DV is
DV = (Dout.minDjn.max ~~ DOut,maχDin,min) ' (Din,max ~" Djn.min) ■ (2)
Alternatively we may express the amount ZD of how much the zero disparity plane should be displaced as
ZD = (Djn,max + Din.min — DOut,max — DOut,min) / 2 . (3)
As an example, an imaging arrangement might have Din,maχ = +60 pixels and Din.min = 0 pixels, and a displaying arrangement might have Dout,max = +10 pixels and Dout,min = -10 pixels. Applying formulas (1 ) and (2) above, we get values SF = 1/3 and DV = -10, so for any arbitrary disparity Djn in the raw images we get the corresponding disparity Dout in the displayed image as
D0Ut = 1/3 * Din - 10 . (4)
A concept of displacing the zero disparity plane without scaling is the same as performing a mapping from input disparities to intermediate disparities, assuming that the number of pixels in horizontal direction across the image remains the same. Concerning processing order, it is possible to first displace the zero disparity plane without scaling and to thereafter scale the intermediate disparities into output disparities to take into account the different number of pixels in horizontal direction across the image. The other possibility is to scale first and displace the zero disparity plane thereafter. Of these two possibilities, the first-mentioned tends to produce more accurate results.
To be mathematically exact, we must note that the simple linear model above is an approximation, because exactly speaking the half-way angle between the direction to the closest possible object and the direction to the most distant possible object does not divide the corresponding width across the display into half. However, the difference between the linear approximation and the exact, sinusoidal relationship is so small taken the small angles that are involved in practice that it can be neglected.
For the purposes of the present invention is important to note that the maximum and minimum disparities Din,max and Djnimjn that may occur in a pair of raw images does not depend on image content but only on the properties of the imaging arrangement. Similarly the maximum and minimum disparities Dout,max and Dout,min that correspond to objects virtually appearing at the front and back edge of the comfortable viewing space do not depend on image content but only on the properties of the displaying arrangement. Naturally relatively few images will actually include objects at the very minimum or the very maximum imaging depth but something in between, but it will then be on the responsibility of the displaying ar- rangement to find the corresponding disparity values that will fall between the extreme limits.
For the purposes of the present invention is should also be noted that the actual process of interlacing, in which the displaying arrangement detects the pixels that represent close or distant objects and consequently associates each pixel pair in the raw images with the appropriate disparity value, is not important to the invention. Several known algorithms exist for comparing the raw images and finding the pixels that represent the same point in the imaged scenery although they appear horizontally displaced in the raw images. The present invention concerns the question of how does the displaying arrangement define the mapping function that maps an input disparity, once found, to an output disparity that will determine the horizontal distance between the left-eye and right-eye subpixels.
Fig. 11 illustrates a data flow process according to an embodiment of the invention when three-dimensional digital image data is transferred from a transmitting device
1101 to a receiving device 1102 and subsequently displayed on a display 1103 coupled to the receiving device 1102. Fig. 12 illustrates an example of certain functional blocks of said devices that may take part in preparing, transferring, processing and outputting the image data.
The transmitting device 1101 transmits the raw images and an indication of an as- sociated disparity range to the receiving device. In the example of fig. 12 we assume that the transmitting device is also the originator of the three-dimensional image data, for which purpose it comprises at least two parallel cameras 1201. The raw images taken by the cameras are stored in an image memory 1202. Characteristics of the imaging arrangement, such as camera properties, camera separation, general image properties, possible standardized minimum and maximum imaging depth are stored in a characteristics memory 1203. Control means 1204 are provided in the transmitting device for enabling a user to control the operations of taking, handling and transmitting three-dimensional images. For transmitting raw images and the associated disparity ranges to outside of the transmit- ting device there is provided output means 1205, which in the case of a portable communications device typically include a wireless transceiver. The cameras 1201 , the image memory 1202 and the characteristics memory 1203 or a part of these functional blocks could be implemented as an imaging module that can be manufactured and sold separately for installing to various kinds of electronic de- vices.
In the simplest possible case the imaging characteristics of the transmitting device are constant: the cameras are fixedly located in the transmitting device, they have a fixed focal length, the minimum and maximum imaging depths are constant and so on. In that case it is particularly simple to store indications of the maximum and minimum disparity values associated with a pair of raw images, because also said maximum and minimum disparity values will be constant. However, it is possible that the cameras are equipped with zoom objectives or exchangeable lenses that change the focal length, or the aperture or separation between cameras can be changed, or there may be more than two cameras so that the user may select which of them to use, or some other imaging characteristic is not constant. For such cases it is useful to have a coupling from the control means 1204 to the characteristics memory 1203 so that whatever changes the user makes to the im- aging arrangement, always the most appropriate corresponding indications of the maximum and minimum disparity values may be read from the characteristics memory 1203 and transmitted along with the raw images. The coupling from the control means 1204 to the characteristics memory 1203 may be also indirect, so that when the user changes focal length or other imaging characteristic, the imaging arrangement produces an indication of the change that causes a subsequent read operation in the characteristics memory to find the most appropriate information.
The nature and content of the indication of the maximum and minimum disparity values may also vary. The most straightforward alternative is to express the maximum and minimum disparity values in the units of pixels that have one-to-one correspondence with the horizontal pixel count in the raw images, and transmit the explicit values along with the raw images. It is also possible to use some other units than pixels. Another alternative is that the transmitting device already compares the raw images enough to find the pixel pairs that correspond to the closest image point and the most image point; then the transmitting device could specifically identify these pixels in the raw images rather than announce any measure of their separation. Yet another alternative may be used if the imaging characteristics of the transmitting device are constant. The maximum and minimum disparity values typical to each commonly known type of transmitting device could be standardized, so that the transmitting device only needed to transmit an indication of its type. A receiving device could then consult some previously stored look-up table that associates imaging device types with their characteristic maximum and minimum disparity values. Yet another alternative is that the transmitting device transmits geometric details about the imaging arrangement along with the raw images, such as the maximum and minimum imaging depths, focal length, CCD size and/or others, from which a receiving device in turn can derive the appropriate maximum and minimum disparity values of the original imaging arrangement. Yet another alternative is to define that the minimum disparity between the raw image is always zero or some other constant value, so that no indication of minimum disparity needs to be transmitted. Constantly assuming a zero minimum disparity is synonymous to assuming that the two optical axes always intersect at the maximum imaging depth, which may but does not have to be infinity.
Whatever is the nature and content of the indication of the maximum and minimum disparity values, the receiving device 1102 receives it along with the raw images through a receiver 1211. The raw image data goes to an image processor 1212, where it waits to be processed while a disparity mapper 1213 prepares the mapping between disparity values in the raw images and disparity values for the interlaced images to be displayed. In order to perform the mapping, the disparity map- per 1213 must know the disparity range associated with the raw images, as well as the allowable disparity range that may appear eventually in the displayed image. If the latter is constant, the disparity mapper 1213 may simply read it from a characteristics memory 1214. Otherwise the disparity mapper 1213 may calculate the disparity range allowable in the displayed image from stored information such as display size, display pixel pitch, and human ergonomic factors of the display (including default viewing distance). If the receiving device can be coupled to a variety of displays, it is advisable to arrange storing the appropriate, display- dependent values to the characteristics memory 1214 at the time when a new display is coupled. Most advantageously the receiving device 1102 comprises also control means 1215 through which a user may input his preferences about increasing or decreasing the disparity range allowable in the displayed image.
Once the disparity range allowable in the displayed image is known, the disparity mapper 1213 may use it and its knowledge about the original disparity range as- sociated with the raw images to produce the mapping function (see e.g. formulas (1 )-(4) above), which it delivers to the image processor 1212. The task of the image processor 1212 is ultimately to convert the raw images into the new image pair that will be conveyed to the display 1103. This converting may include scaling and cropping of the images as well as making the changes to disparity pixel-wise. Cropping is needed because displacing the zero disparity plane effectively moves the background of each raw image sideways so that information is lost from a vertical bar at each side edge of the image. This vertical bars must be cut out Also in many cases the proportions of the raw images are not the same as the proportions of the display, which means that either the image must be cropped to fit to the display or empty fields must be added to some sides of the image.
As we have pointed out earlier, for the present invention it is not important, what algorithm the image processor 1212 uses to identify mutually corresponding pixels in the raw images. The invention affects the changes to be made in disparity: once the image processor 1212 has found a pixel pair with some initial disparity Djn in the raw images, it relocates the pixels of that pixel pair in the new images so that their new disparity is calculated according to the mapping formula that is based on knowing the initial disparity range as well as the disparity range allowable in the displayed image.
The display 1103 is shown to comprise the interlacing means 1221 that directs the new images prepared in the image processor 1212 to the arrays of subpixels that constitute the display screen 1222.
Figs. 13 and 14 illustrate the methods performed at the transmitting and receiving ends respectively. The drawings may also be considered as illustrations of the software program products employed at the transmitting and receiving ends. At step 1301 the transmitting device records the raw images, and at step 1302 it obtains an indication of the disparity range, i.e. the maximum and minimum disparity values associated with the raw images. At step 1303 the transmitting device combines the raw images and the indication of the disparity range for transmission, and at step 1304 it transmits them to a receiving device.
The receiving device receives the raw images and the indication of the disparity range associated with the raw images at step 1401. It obtains information about the display characteristics at step 1402 and uses that information to determine the disparity range allowable in the displayed image at step 1403. At step 1404 the re- ceiving device uses the information it has about the raw image disparity range and the output image disparity range to determine the appropriate disparity mapping. At step 1405 the images are processed, which includes relocating the pixels in the horizontal direction according to the disparity mapping function determined in the previous step. Depending on what display technology and image file standard is to be used, the processing step 1405 may include format conversion operations such as those explained for example in the prior art publication US 2004/0218269 A1. For example, if the image is to be displayed on a parallax barrier display of Sharp Electronics Corporation, the left-eye and right-eye subimages are compressed in horizontal direction by a factor 2 and written side by side into a single image file that follows otherwise the JPEG (Joint Photographic Experts Group) standard but has a certain additional character string in its header and an extension .stj in its file name. At 1406 the completed new images are output on the display.
The exemplary embodiments described above should not be construed as limiting. For example, even if we have consistently considered using only two cameras, the invention does not exclude using three or more cameras. In a multi-camera arrangement the concept of disparity must be replaced with a more generally de- fined horizontal displacement of pixels depending on imaging depth, but otherwise the principle of the invention can be applied in a straightforward manner. For each camera there can be defined the characteristic horizontal displacements associated with objects at the minimum imaging depth and the maximum imaging depth on the central line of sight of the imaging arrangement. These characteristic hori- zontal displacements take the position of maximum and minimum disparities associated with a raw image pair in the description above. Also even if we have considered solely still images so far, it should be noted that the principle of the invention is also applicable to the obtaining, transmitting, processing and displaying of series of images, which as a sequence constitute a video signal. Since the initial disparity range is a property of the imaging arrangement and does not depend on image content, applying the invention to the processing of a video signal is particularly simple: the indication of the initial disparity range only needs to be transmitted once unless the characteristics of the imaging arrangement are dynamically changed during shooting, in which case the indication of the disparity range needs to be transmitted regularly so that it covers each change.
Yet another exemplary modification concerns the implementation of the cameras: instead of the (at least) two parallel cameras it is possible to equip a single camera (i.e. a single CCD array) with a branching lens system that sets up two parallel optical paths and includes a shutter system that allows taking a picture through each optical path in turn. Additionally, even if we have used the term "raw image" to generally describe an image taken along one optical path and transmitted to a re- ceiving device, this does not mean that the transmitted image should be "raw" in the sense that it would not have undergone any changes or processing after having read from the CCD array. Normal image processing can be applied, like color and brightness correction, computed corrections to remove undesired optical effects, and the like. However, care must be taken not to delete information that is important to the reconstruction of a stereographic image. For example, image compression according to the JPEG format may average out adjacent pixels, which means that if such image compression were to be applied at an inappropriate phase of handling stereographic images, it might destroy the stereographic properties altogether.
The numeric values used in the description are exemplary. For example, even if at the time of writing the description the default viewing distance of the autostereo- graphic displays of portable devices is in the order of 40-60 cm, other displays may involve shorter or longer default viewing distances. Displays for use in per- sonal computers have usually longer default viewing distances, up to 1-2 meters. Much longer default viewing distances occur in audiovisual presentation systems like 3D cinemas.

Claims

Claims
1. A device (1101 ) for obtaining and transmitting three-dimensional digital image data, comprising a stereographic imaging arrangement (501 , 502, 1201) adapted to take a first raw image (601 ) along a first optical axis and a second raw image (602) along a second optical axis, wherein:
- the first and second optical axes have a separation at the imaging arrangement (501 , 502, 1201 ), and
- the imaging arrangement (501 , 502, 1201) has a maximum imaging depth (506) and a minimum imaging depth (505); characterized in that:
- the device (1101 ) is arranged to transmit the first raw image (601 ) and the second raw image (602) to a receiving device (1102) along with an indication of a disparity range between a maximum disparity value and a minimum disparity value, - the maximum disparity value is a measure of a difference between a location in the first raw image (601) that represents the minimum imaging depth (505) and a location in the second raw image (602) that represents the minimum imaging depth (505), and
- the minimum disparity value is a measure of a difference between a location in the first raw image (601 ) that represents the maximum imaging depth (506) and a location in the second raw image (602) that represents the maximum imaging depth (506).
2. A device (1101 ) according to claim 1 , characterized in that in order to transmit said indication of a disparity range, the device (1101) is adapted to transmit a maximum disparity in pixels between a location in the first raw image (601 ) that represents the minimum imaging depth (505) and a location in the second raw image (602) that represents the minimum imaging depth (505).
3. A device (1101 ) according to claim 2, characterized in that the device (1101 ) is further adapted to transmit a minimum disparity in pixels between a location in the first raw image (601 ) that represents the maximum imaging depth (506) and a location in the second raw image (602) that represents the maximum imaging depth (506).
4. A device (1101) according to claim 1 , characterized in that in order to transmit said indication of a disparity range, the device (1101 ) is adapted to transmit an identifier of at least one of a type of the device (1101 ) or a type of the imaging arrangement (501 , 502).
5. A device (1101 ) according to claim 1 , characterized in that the device (1101 ) comprises:
- control means (1204) for changing optical characteristics of the imaging arrangement (501 , 502, 1201 ) and
- a characteristics memory (1203) for outputting said indication of a disparity range for transmission; wherein said characteristics memory (1203) is arranged to respond to changes made to the optical characteristics of the imaging arrangement (501 , 502, 1201 ) by outputting an indication of a disparity range that corresponds to the changes made.
6. A device (1101 ) according to claim 1 , characterized in that it is a portable communications device and comprises output means (1205) arranged to transmit the first raw image (601 ), the second raw image (602) and the indication of the disparity range to the receiving device (1102) through a wireless communications network.
7. An imaging module for providing an electronic device with the capability of obtaining three-dimensional digital image data, comprising: - a stereographic imaging arrangement (501 , 502, 1201) adapted to take a first raw image (601 ) along a first optical axis and a second raw image (602) along a second optical axis; characterized in that: - the imaging module is arranged to store the first raw image (601 ) and the second raw image (602) and an indication of a disparity range between a maximum disparity value and a minimum disparity value,
- the maximum disparity value is a measure of a difference between a location in the first raw image (601 ) that represents a minimum imaging depth (505) and a location in the second raw image (602) that represents the minimum imaging depth (505), wherein said minimum imaging depth (505) is a feature of said imaging arrangement (501 , 502, 1201) and
- the minimum disparity value is a measure of a difference between a location in the first raw image (601 ) that represents a maximum imaging depth (506) and a location in the second raw image (602) that represents the maximum imaging depth (506), wherein said maximum imaging depth (505) is a feature of said imaging arrangement (501 , 502, 1201 ).
8. An imaging module according to claim 7, characterized in that in order to store said indication of a disparity range, the imaging module is adapted to store a maximum disparity in pixels between a location in the first raw image (601) that represents the minimum imaging depth (505) and a location in the second raw image (602) that represents the minimum imaging depth (505).
9. An imaging module according to claim 8, characterized in that the imaging module is further adapted to store a minimum disparity in pixels between a location in the first raw image (601 ) that represents the maximum imaging depth (506) and a location in the second raw image (602) that represents the maximum imag- ing depth (506).
10. An imaging module according to claim 7, characterized in that in order to store said indication of a disparity range, the imaging module is adapted to store an identifier of at least one of a type of the imaging module or a type of the imag- ing arrangement (501 , 502).
11. An imaging module for providing an electronic device with the capability of obtaining three-dimensional digital image data, comprising: - a stereographic imaging arrangement (501 , 502, 1201) adapted to take a first raw image (601 ) along a first optical axis and a second raw image (602) along a second optical axis; characterized in that: - the imaging module has means for storing the first raw image (601 ) and the second raw image (602) and means for obtaining and storing an indication of a disparity range between a maximum disparity value and a minimum disparity value,
- the maximum disparity value is a measure of a difference between a location in the first raw image (601) that represents a minimum imaging depth (505) and a lo- cation in the second raw image (602) that represents the minimum imaging depth (505), wherein said minimum imaging depth (505) is a feature of said imaging arrangement (501 , 502, 1201 ) and
- the minimum disparity value is a measure of a difference between a location in the first raw image (601 ) that represents a maximum imaging depth (506) and a location in the second raw image (602) that represents the maximum imaging depth (506), wherein said maximum imaging depth (505) is a feature of said imaging arrangement (501 , 502, 1201).
12. An imaging module according to claim 11 , characterized in that in order to store said indication of a disparity range, the imaging module has means for storing a maximum disparity in pixels between a location in the first raw image (601 ) that represents the minimum imaging depth (505) and a location in the second raw image (602) that represents the minimum imaging depth (505).
13. An imaging module according to claim 12, characterized in that the imaging module further comprises means for storing a minimum disparity in pixels between a location in the first raw image (601 ) that represents the maximum imaging depth (506) and a location in the second raw image (602) that represents the maximum imaging depth (506).
14. An imaging module according to claim 11 , characterized in that in order to store said indication of a disparity range, the imaging module has means for stor- ing an identifier of at least one of a type of the imaging module or a type of the imaging arrangement (501 , 502).
15. A device (1102) for receiving and processing three-dimensional digital image data, comprising a receiver (1211 ); characterized in that:
- the device (1102) is arranged to receive from a transmitting device (1101) a first raw image (601 ) and a second raw image (602) along with an indication of a disparity range between a maximum input disparity value and a minimum input dis- parity value,
- said maximum input disparity value is a measure of a difference between a location in the first raw image (601) that represents a minimum imaging depth (505) of an imaging arrangement and a location in the second raw image (602) that represents the minimum imaging depth (505), and - the minimum disparity value is a measure of a difference between a location in the first raw image (601) that represents a maximum imaging depth (506) of the imaging arrangement and a location in the second raw image (602) that represents the maximum imaging depth (506).
16. A device (1102) according to claim 15, characterized in that as said indication of a disparity range between a maximum input disparity value and a minimum input disparity value, the device (1102) is arranged to receive a maximum disparity in pixels between a location in the first raw image (601) that represents the minimum imaging depth (505) and a location in the second raw image (602) that represents the minimum imaging depth (505).
17. A device (1102) according to claim 16, characterized in that the device is further arranged to receive a minimum disparity in pixels between a location in the first raw image (601 ) that represents the maximum imaging depth (506) and a lo- cation in the second raw image (602) that represents the maximum imaging depth (506).
18. A device (1102) according to claim 15, characterized in that: - as said indication of a disparity range between a maximum input disparity value and a minimum input disparity value, the device (1102) is arranged to receive an identifier of at least one of a type of the transmitting device (1101) or a type of an imaging arrangement (501 , 502) used to produce the first (601) and second (602) raw images, and
- the device (1102) is arranged to derive, from the received indication, a maximum disparity in pixels between a location in the first raw image (601 ) that represents the minimum imaging depth (505) and a location in the second raw image (602) that represents the minimum imaging depth (505) and a minimum disparity in pix- els between a location in the first raw image (601 ) that represents the maximum imaging depth (506) and a location in the second raw image (602) that represents the maximum imaging depth (506).
19. A device (1102) according to claim 15, characterized in that: - the device (1102) is arranged to store a maximum output disparity value and a minimum output disparity value, of which the maximum output disparity value is a measure of a difference in location between two output subpixels that represent a virtual distance (402) in front of a display screen (401 ), and the minimum output disparity value is a measure of a difference in location between two output subpix- els that represent a virtual distance (403) behind the display screen (401 ), and
- the device (1102) is arranged to perform a linear mapping from input disparities in the first (601 ) and second (602) raw images that are between the maximum input disparity value and the minimum input disparity value to output disparities that are between the maximum output disparity value and the minimum output disparity value, in order to find locations for output subpixels to be displayed on an autos- tereographic display (1103).
20. A device (1102) according to claim 19, characterized in that it comprises control means (1215) arranged to convert input information given by a user into changes of at least one of the maximum output disparity value and the minimum output disparity value.
21. A device (1102) according to claim 15, characterized in that the device (1102) comprises an autostereographic display (1103) for displaying a three- dimensional image derived from the first (601 ) and second (602) raw images and the indication of the disparity range.
22. An image transmission system comprising a transmitting device (1101) and a receiving device (1102), of which the transmitting device (1101 ) comprises a stereographic imaging arrangement (501 , 502, 1201 ) adapted to take a first raw image (601) along a first optical axis and a second raw image (602) along a sec- ond optical axis, and the receiving device (1102) comprises a receiver, characterized in that:
- the transmitting device (1101) and the receiving device (1102) are arranged to exchange the first raw image (601) and the second raw image (602) and an indication of a disparity range between a maximum disparity value and a minimum dis- parity value,
- the maximum disparity value is a measure of a difference between a location in the first raw image (601) that represents minimum imaging depth (505) of the imaging arrangement (501 , 502, 1201) and a location in the second raw image (602) that represents the minimum imaging depth (505), and - the minimum disparity value is a measure of a difference between a location in the first raw image (601 ) that represents a maximum imaging depth (506) of the imaging arrangement (501 , 502, 1201) and a location in the second raw image (602) that represents the maximum imaging depth (506).
23. An image transmission system according to claim 22, characterized in that in order to exchange said indication of a disparity range, the transmitting device (1101 ) and the receiving device (1102) are adapted to exchange a maximum disparity in pixels between a location in the first raw image (601 ) that represents the minimum imaging depth (505) and a location in the second raw image (602) that represents the minimum imaging depth (505).
24. An image transmission system according to claim 23, characterized in that the transmitting device (1101) and the receiving device (1102) are further adapted to exchange a minimum disparity in pixels between a location in the first raw image (601) that represents the maximum imaging depth (506) and a location in the second raw image (602) that represents the maximum imaging depth (506).
25. An imaging module according to claim 22, characterized in that in order to exchange said indication of a disparity range, the transmitting device (1101) and the receiving device (1102) are adapted to exchange an identifier of at least one of a type of the imaging module or a type of the imaging arrangement (501 , 502).
26. A method for transmitting three-dimensional digital image data, comprising:
- transmitting a first raw image (601) taken along a first optical axis and
- transmitting a second raw image (602) taken along a second optical axis; characterized in that the method comprises:
- transmitting an indication of a disparity range between a maximum disparity value and a minimum disparity value, wherein the maximum disparity value is a measure of a difference between a location in the first raw image (601) that represents a minimum imaging depth (505) of an imaging arrangement used to produce the first (601) and second (602) raw images and a location in the second raw image (602) that represents the minimum imaging depth (505), and the minimum disparity value is a measure of a difference between a location in the first raw image (601) that represents a maximum imaging depth (506) of the imaging arrangement and a location in the second raw image (602) that represents the maximum imaging depth (506).
27. A method according to claim 26, characterized in that it comprises transmitting a maximum disparity in pixels between a location in the first raw image (601) that represents the minimum imaging depth (505) and a location in the second raw image (602) that represents the minimum imaging depth (505), in order to transmit said indication of a disparity range.
28. A method according to claim 27, characterized in that it further comprises transmitting a minimum disparity in pixels between a location in the first raw image (601 ) that represents the maximum imaging depth (506) and a location in the second raw image (602) that represents the maximum imaging depth (506).
29. A method according to claim 26, characterized in that it comprises transmit- ting an identifier of at least one of a type of the device (1101 ) or a type of the imaging arrangement (501 , 502), in order to transmit said indication of a disparity range.
30. A method for receiving and processing three-dimensional digital image data, comprising:
- receiving a first raw image (601 ) taken along a first optical axis and
- receiving a second raw image (602) taken along a second optical axis; characterized in that the method comprises:
- receiving an indication of a disparity range between a maximum input disparity value and a minimum input disparity value, wherein the maximum input disparity value is a measure of a difference between a location in the first raw image (601) that represents a minimum imaging depth (505) of an imaging arrangement used to produce the first (601 ) and second (602) raw images and a location in the second raw image (602) that represents the minimum imaging depth (505), and the minimum input disparity value is a measure of a difference between a location in the first raw image (601) that represents a maximum imaging depth (506) of the imaging arrangement and a location in the second raw image (602) that represents the maximum imaging depth (506).
31. A method according to claim 30, characterized in that in order to find locations for output subpixels to be displayed on an autostereographic display (1103) the method comprises linearly mapping input disparities in the first (601 ) and second (602) raw images that are between the maximum input disparity value and the minimum input disparity value into output disparities that are between a maximum output disparity value and a minimum output disparity value, of which the maximum output disparity value is a measure of a difference in location between two output subpixels that represent a virtual distance (402) in front of a display screen (401), and the minimum output disparity value is a measure of a difference in loca- tion between two output subpixels that represent a virtual distance (403) behind the display screen (401).
32. A method according to claim 31 , characterized in that for displaying said output subpixels on an autostereographic display (1103) a default viewing distance of which is between 20 and 60 centimeters, the method comprises using a maximum output disparity value and a minimum output disparity value the absolute values of which are between 2 and 10 millimeters.
33. A method according to claim 31 , characterized in that it comprises changing the value of at least one of the maximum output disparity value and the minimum output disparity value in response to control inputs given by a user.
34. A software program product, comprising software means for, when executed in a computer, causing the computer to execute steps of:
- transmitting a first raw image (601) taken along a first optical axis and
- transmitting a second raw image (602) taken along a second optical axis; characterized in that the software program product comprises software means for, when executed in a computer, causing the computer to execute a step of: - transmitting an indication of a disparity range between a maximum disparity value and a minimum disparity value, wherein the maximum disparity value is a measure of a difference between a location in the first raw image (601 ) that represents a minimum imaging depth (505) of an imaging arrangement used to produce the first (601 ) and second (602) raw images and a location in the second raw im- age (602) that represents the minimum imaging depth (505), and the minimum disparity value is a measure of a difference between a location in the first raw image (601) that represents a maximum imaging depth (506) of the imaging arrangement and a location in the second raw image (602) that represents the maximum imaging depth (506).
35. A software program product according to claim 34, characterized in that the software program product comprises software means for, when executed in a computer, causing the computer to execute a step of transmitting a maximum dis- parity in pixels between a location in the first raw image (601 ) that represents the minimum imaging depth (505) and a location in the second raw image (602) that represents the minimum imaging depth (505), in order to transmit said indication of a disparity range.
36. A software program product according to claim 35, characterized in that the software program product comprises software means for, when executed in a computer, causing the computer to execute a step of transmitting a minimum disparity in pixels between a location in the first raw image (601) that represents the maximum imaging depth (506) and a location in the second raw image (602) that represents the maximum imaging depth (506).
37. A software program product according to claim 34, characterized in that the software program product comprises software means for, when executed in a computer, causing the computer to execute a step of transmitting an identifier of at least one of a type of the device (1101 ) or a type of the imaging arrangement (501 , 502), in order to transmit said indication of a disparity range.
38. A software program product, comprising software means for, when executed in a computer, causing the computer to execute steps of:
- receiving a first raw image (601) taken along a first optical axis and
- receiving a second raw image (602) taken along a second optical axis; characterized in that the software program product comprises software means for, when executed in a computer, causing the computer to execute a step of: - receiving an indication of a disparity range between a maximum input disparity value and a minimum input disparity value, wherein the maximum input disparity value is a measure of a difference between a location in the first raw image (601) that represents a minimum imaging depth (505) of an imaging arrangement used to produce the first (601 ) and second (602) raw images and a location in the sec- ond raw image (602) that represents the minimum imaging depth (505), and the minimum input disparity value is a measure of a difference between a location in the first raw image (601) that represents a maximum imaging depth (506) of the imaging arrangement and a location in the second raw image (602) that represents the maximum imaging depth (506).
39. A software program product according to claim 38, characterized in that in order to find locations for output subpixels to be displayed on an autostereographic display (1103), the software program product comprises software means for, when executed in a computer, causing the computer to execute a step of linearly mapping input disparities in the first (601 ) and second (602) raw images that are between the maximum input disparity value and the minimum input disparity value into output disparities that are between a maximum output disparity value and a minimum output disparity value, of which the maximum output disparity value is a measure of a difference in location between two output subpixels that represent a virtual distance (402) in front of a display screen (401 ), and the minimum output disparity value is a measure of a difference in location between two output subpix- els that represent a virtual distance (403) behind the display screen (401 ).
40. A software program product according to claim 39, characterized in that for displaying said output subpixels on an autostereographic display (1103) a default viewing distance of which is between 20 and 60 centimeters, the software program product comprises software means for, when executed in a computer, causing the computer to execute a step of using a maximum output disparity value and a minimum output disparity value the absolute values of which are between 2 and 10 millimeters.
41. A software program product according to claim 39, characterized in that the software program product comprises software means for, when executed in a computer, causing the computer to execute a step of changing the value of at least one of the maximum output disparity value and the minimum output disparity value in response to control inputs given by a user.
PCT/FI2005/000491 2005-11-17 2005-11-17 Method and devices for generating, transferring and processing three-dimensional image data WO2007057497A1 (en)

Priority Applications (6)

Application Number Priority Date Filing Date Title
PCT/FI2005/000491 WO2007057497A1 (en) 2005-11-17 2005-11-17 Method and devices for generating, transferring and processing three-dimensional image data
KR1020087014578A KR100988894B1 (en) 2005-11-17 2005-11-17 Method and devices for generating, transferring and processing three-dimensional image data
JP2008540634A JP2009516447A (en) 2005-11-17 2005-11-17 Method and apparatus for generating, transferring and processing three-dimensional image data
EP05813919A EP1952199B1 (en) 2005-11-17 2005-11-17 Method and devices for generating, transferring and processing three-dimensional image data
US12/085,124 US8619121B2 (en) 2005-11-17 2005-11-17 Method and devices for generating, transferring and processing three-dimensional image data
TW095137888A TW200745733A (en) 2005-11-17 2006-10-14 Method and devices for generating, transferring and processing three-dimensional image data

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/FI2005/000491 WO2007057497A1 (en) 2005-11-17 2005-11-17 Method and devices for generating, transferring and processing three-dimensional image data

Publications (1)

Publication Number Publication Date
WO2007057497A1 true WO2007057497A1 (en) 2007-05-24

Family

ID=38048318

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/FI2005/000491 WO2007057497A1 (en) 2005-11-17 2005-11-17 Method and devices for generating, transferring and processing three-dimensional image data

Country Status (6)

Country Link
US (1) US8619121B2 (en)
EP (1) EP1952199B1 (en)
JP (1) JP2009516447A (en)
KR (1) KR100988894B1 (en)
TW (1) TW200745733A (en)
WO (1) WO2007057497A1 (en)

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2308241A2 (en) * 2008-07-24 2011-04-13 Koninklijke Philips Electronics N.V. Versatile 3-d picture format
WO2011121397A1 (en) * 2010-04-01 2011-10-06 Nokia Corporation Method, apparatus and computer program for selecting a stereoscopic imaging viewpoint pair
EP2418857A1 (en) * 2010-08-12 2012-02-15 Thomson Licensing Stereoscopic menu control
EP2458880A2 (en) * 2010-11-26 2012-05-30 LG Electronics Inc. Mobile terminal and operation control method thereof
EP2478706A1 (en) * 2009-09-16 2012-07-25 Koninklijke Philips Electronics N.V. 3d screen size compensation
US8300086B2 (en) 2007-12-20 2012-10-30 Nokia Corporation Image processing for supporting a stereoscopic presentation
WO2012161734A1 (en) * 2011-05-26 2012-11-29 Thomson Licensing Scale-independent maps
US8587638B2 (en) 2006-09-25 2013-11-19 Nokia Corporation Supporting a 3D presentation
US10791314B2 (en) 2010-03-31 2020-09-29 Interdigital Ce Patent Holdings, Sas 3D disparity maps

Families Citing this family (37)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8300089B2 (en) * 2008-08-14 2012-10-30 Reald Inc. Stereoscopic depth mapping
US9251621B2 (en) * 2008-08-14 2016-02-02 Reald Inc. Point reposition depth mapping
EP2409294B1 (en) 2009-03-17 2020-05-06 Koninklijke Philips N.V. Methods of driving colour sequential displays
US8279269B2 (en) * 2009-04-29 2012-10-02 Ke-Ou Peng Mobile information kiosk with a three-dimensional imaging effect
US20110050857A1 (en) * 2009-09-03 2011-03-03 Electronics And Telecommunications Research Institute Apparatus and method for displaying 3d image in 3d image system
US9699434B2 (en) 2009-10-07 2017-07-04 Samsung Electronics Co., Ltd. Apparatus and method for adjusting depth
KR101699920B1 (en) * 2009-10-07 2017-01-25 삼성전자주식회사 Apparatus and method for controling depth
EP2510499A1 (en) 2009-12-09 2012-10-17 Thomson Licensing Method for distinguishing a 3d image from a 2d image and for identifying the presence of a 3d image format by feature correspondence determination
EP2510702A1 (en) * 2009-12-09 2012-10-17 Thomson Licensing Method and apparatus for distinguishing a 3d image from a 2d image and for identifying the presence of a 3d image format by image difference determination
JP5387399B2 (en) * 2009-12-28 2014-01-15 ソニー株式会社 Information processing apparatus and information processing method
JP2011139209A (en) * 2009-12-28 2011-07-14 Sony Corp Image processing apparatus and method
JP2011138354A (en) * 2009-12-28 2011-07-14 Sony Corp Information processing apparatus and information processing method
TWI401614B (en) * 2010-02-05 2013-07-11 Pixart Imaging Inc Data arrangement method and data arrangement device
KR101802238B1 (en) * 2010-02-23 2017-11-29 삼성전자주식회사 Apparatus and method for generating a three-dimension image data in portable terminal
US20120320153A1 (en) * 2010-02-25 2012-12-20 Jesus Barcons-Palau Disparity estimation for stereoscopic subtitling
US8830300B2 (en) * 2010-03-11 2014-09-09 Dolby Laboratories Licensing Corporation Multiscalar stereo video format conversion
JP2011223482A (en) 2010-04-14 2011-11-04 Sony Corp Image processing apparatus, image processing method, and program
US20110304618A1 (en) * 2010-06-14 2011-12-15 Qualcomm Incorporated Calculating disparity for three-dimensional images
WO2012002347A1 (en) * 2010-06-30 2012-01-05 富士フイルム株式会社 Image processing device, imaging device and image processing method
JP5238767B2 (en) * 2010-07-26 2013-07-17 株式会社東芝 Parallax image generation method and apparatus
KR101875615B1 (en) * 2010-09-01 2018-07-06 엘지전자 주식회사 Method and apparatus for processing and receiving digital broadcast signal for 3-dimensional display
US20150179218A1 (en) * 2010-09-10 2015-06-25 3DOO, Inc. Novel transcoder and 3d video editor
US9035939B2 (en) 2010-10-04 2015-05-19 Qualcomm Incorporated 3D video control system to adjust 3D video rendering based on user preferences
JP5002047B2 (en) * 2010-11-05 2012-08-15 シャープ株式会社 Stereoscopic image data playback device
US9049423B2 (en) * 2010-12-01 2015-06-02 Qualcomm Incorporated Zero disparity plane for feedback-based three-dimensional video
JP5879713B2 (en) * 2010-12-09 2016-03-08 ソニー株式会社 Image processing apparatus, image processing method, and program
KR101814798B1 (en) * 2011-01-26 2018-01-04 삼성전자주식회사 Apparatus for processing three dimension image and method for the same
JP2012156816A (en) * 2011-01-26 2012-08-16 Toshiba Corp Image display apparatus and image display method
JP2012205267A (en) * 2011-03-28 2012-10-22 Sony Corp Display control device, display control method, detection device, detection method, program, and display system
US20140111627A1 (en) * 2011-06-20 2014-04-24 Panasonic Corporation Multi-viewpoint image generation device and multi-viewpoint image generation method
US10491915B2 (en) * 2011-07-05 2019-11-26 Texas Instruments Incorporated Method, system and computer program product for encoding disparities between views of a stereoscopic image
JP5720475B2 (en) * 2011-07-29 2015-05-20 株式会社Jvcケンウッド 3D image control apparatus, 3D image control method, and 3D image control program
JP2013090019A (en) * 2011-10-14 2013-05-13 Hitachi Consumer Electronics Co Ltd Image output device and image output method
US20130321572A1 (en) * 2012-05-31 2013-12-05 Cheng-Tsai Ho Method and apparatus for referring to disparity range setting to separate at least a portion of 3d image data from auxiliary graphical data in disparity domain
KR101375648B1 (en) 2012-07-12 2014-03-18 메디칸(주) Apparatus for providing mobile terminal with 3D image and method therefor
US20190324283A1 (en) * 2018-04-19 2019-10-24 Htc Corporation Display device and display method
TWI680325B (en) * 2018-04-19 2019-12-21 宏達國際電子股份有限公司 Display device and display method

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1089573A2 (en) * 1999-09-15 2001-04-04 Sharp Kabushiki Kaisha Method of producing a stereoscopic image
US20030035001A1 (en) * 2001-08-15 2003-02-20 Van Geest Bartolomeus Wilhelmus Damianus 3D video conferencing
US20040218269A1 (en) 2002-01-14 2004-11-04 Divelbiss Adam W. General purpose stereoscopic 3D format conversion system and method
US20040223640A1 (en) * 2003-05-09 2004-11-11 Bovyrin Alexander V. Stereo matching using segmentation of image columns

Family Cites Families (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5175616A (en) * 1989-08-04 1992-12-29 Her Majesty The Queen In Right Of Canada, As Represented By The Minister Of National Defence Of Canada Stereoscopic video-graphic coordinate specification system
US5179441A (en) * 1991-12-18 1993-01-12 The United States Of America As Represented By The Administrator Of The National Aeronautics And Space Administration Near real-time stereo vision system
JP3157384B2 (en) * 1994-06-20 2001-04-16 三洋電機株式会社 3D image device
US6163337A (en) * 1996-04-05 2000-12-19 Matsushita Electric Industrial Co., Ltd. Multi-view point image transmission method and multi-view point image display method
JP3477023B2 (en) * 1996-04-05 2003-12-10 松下電器産業株式会社 Multi-view image transmission method and multi-view image display method
JPH1127703A (en) 1997-06-30 1999-01-29 Canon Inc Display device and its control method
JP2001142166A (en) 1999-09-15 2001-05-25 Sharp Corp 3d camera
US6831677B2 (en) * 2000-02-24 2004-12-14 Yissum Research Development Company Of The Hebrew University Of Jerusalem System and method for facilitating the adjustment of disparity in a stereoscopic panoramic image pair
US7715591B2 (en) * 2002-04-24 2010-05-11 Hrl Laboratories, Llc High-performance sensor fusion architecture
US20040070667A1 (en) * 2002-10-10 2004-04-15 Fuji Photo Optical Co., Ltd. Electronic stereoscopic imaging system
AU2003221143A1 (en) * 2003-03-20 2004-10-11 Seijiro Tomita Stereoscopic video photographing/displaying system
JP4259913B2 (en) * 2003-05-08 2009-04-30 シャープ株式会社 Stereoscopic image processing apparatus, stereoscopic image processing program, and recording medium recording the program
JP2005026800A (en) 2003-06-30 2005-01-27 Konica Minolta Photo Imaging Inc Image processing method, imaging apparatus, image processing apparatus, and image recording apparatus
JP2005073049A (en) * 2003-08-26 2005-03-17 Sharp Corp Device and method for reproducing stereoscopic image
KR100682889B1 (en) * 2003-08-29 2007-02-15 삼성전자주식회사 Method and Apparatus for image-based photorealistic 3D face modeling
US20050201591A1 (en) * 2004-03-10 2005-09-15 Kiselewich Stephen J. Method and apparatus for recognizing the position of an occupant in a vehicle

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1089573A2 (en) * 1999-09-15 2001-04-04 Sharp Kabushiki Kaisha Method of producing a stereoscopic image
US20030035001A1 (en) * 2001-08-15 2003-02-20 Van Geest Bartolomeus Wilhelmus Damianus 3D video conferencing
US20040218269A1 (en) 2002-01-14 2004-11-04 Divelbiss Adam W. General purpose stereoscopic 3D format conversion system and method
US20040223640A1 (en) * 2003-05-09 2004-11-11 Bovyrin Alexander V. Stereo matching using segmentation of image columns

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See also references of EP1952199A4

Cited By (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8587638B2 (en) 2006-09-25 2013-11-19 Nokia Corporation Supporting a 3D presentation
US8300086B2 (en) 2007-12-20 2012-10-30 Nokia Corporation Image processing for supporting a stereoscopic presentation
EP2308241B1 (en) * 2008-07-24 2017-04-12 Koninklijke Philips N.V. Versatile 3-d picture format
US10567728B2 (en) 2008-07-24 2020-02-18 Koninklijke Philips N.V. Versatile 3-D picture format
EP2308241A2 (en) * 2008-07-24 2011-04-13 Koninklijke Philips Electronics N.V. Versatile 3-d picture format
KR101749893B1 (en) * 2008-07-24 2017-06-22 코닌클리케 필립스 엔.브이. Versatile 3-d picture format
US9432651B2 (en) 2008-07-24 2016-08-30 Koninklijke Philips N.V. Versatile 3-D picture format
EP2478706A1 (en) * 2009-09-16 2012-07-25 Koninklijke Philips Electronics N.V. 3d screen size compensation
US10791314B2 (en) 2010-03-31 2020-09-29 Interdigital Ce Patent Holdings, Sas 3D disparity maps
WO2011121397A1 (en) * 2010-04-01 2011-10-06 Nokia Corporation Method, apparatus and computer program for selecting a stereoscopic imaging viewpoint pair
EP2418857A1 (en) * 2010-08-12 2012-02-15 Thomson Licensing Stereoscopic menu control
WO2012019933A1 (en) * 2010-08-12 2012-02-16 Thomson Licensing Stereoscopic menu control
EP2458880A3 (en) * 2010-11-26 2013-03-27 LG Electronics Inc. Mobile terminal and operation control method thereof
US9088771B2 (en) 2010-11-26 2015-07-21 Lg Electronics Inc. Mobile terminal and operation control method thereof
EP2458880A2 (en) * 2010-11-26 2012-05-30 LG Electronics Inc. Mobile terminal and operation control method thereof
US9600923B2 (en) 2011-05-26 2017-03-21 Thomson Licensing Scale-independent maps
WO2012161734A1 (en) * 2011-05-26 2012-11-29 Thomson Licensing Scale-independent maps

Also Published As

Publication number Publication date
EP1952199A4 (en) 2010-02-03
JP2009516447A (en) 2009-04-16
EP1952199B1 (en) 2012-10-03
US20090295790A1 (en) 2009-12-03
KR100988894B1 (en) 2010-10-20
KR20080069693A (en) 2008-07-28
EP1952199A1 (en) 2008-08-06
US8619121B2 (en) 2013-12-31
TW200745733A (en) 2007-12-16

Similar Documents

Publication Publication Date Title
US8619121B2 (en) Method and devices for generating, transferring and processing three-dimensional image data
US20020030675A1 (en) Image display control apparatus
JP5575778B2 (en) Method for processing disparity information contained in a signal
KR101487587B1 (en) Method, apparatus and computer program for selecting a stereoscopic imaging viewpoint pair
US8629870B2 (en) Apparatus, method, and program for displaying stereoscopic images
CN101636747B (en) Two dimensional/three dimensional digital information acquisition and display device
JP4324435B2 (en) Stereoscopic video providing method and stereoscopic video display device
US20110298795A1 (en) Transferring of 3d viewer metadata
US20050207486A1 (en) Three dimensional acquisition and visualization system for personal electronic devices
EP2421268A2 (en) Method for processing images in display device outputting 3-dimensional contents and display using the same
WO2006062325A1 (en) Apparatus for correcting image distortion of stereo-camera and method thereof
US20090207237A1 (en) Method and Device for Autosterioscopic Display With Adaptation of the Optimal Viewing Distance
JP2004274642A (en) Transmission method for three dimensional video image information
CN103748872A (en) Receiver-side adjustment of stereoscopic images
TWI572899B (en) Augmented reality imaging method and system
JP4481275B2 (en) 3D video information transmission method
JP6179282B2 (en) 3D image display apparatus and 3D image display method
KR101746539B1 (en) System and device for processing stereo image, and glasses
WO2013061334A1 (en) 3d stereoscopic imaging device with auto parallax
Sheat et al. Three-dimensional imaging for video telephony
KR20110005515A (en) 3d real image displayable system

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application
ENP Entry into the national phase

Ref document number: 2008540634

Country of ref document: JP

Kind code of ref document: A

NENP Non-entry into the national phase

Ref country code: DE

WWE Wipo information: entry into national phase

Ref document number: 2005813919

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 1020087014578

Country of ref document: KR

WWP Wipo information: published in national office

Ref document number: 2005813919

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 12085124

Country of ref document: US