WO2007063478A2 - Stereoscopic image display method and apparatus, method for generating 3d image data from a 2d image data input and an apparatus for generating 3d image data from a 2d image data input - Google Patents

Stereoscopic image display method and apparatus, method for generating 3d image data from a 2d image data input and an apparatus for generating 3d image data from a 2d image data input Download PDF

Info

Publication number
WO2007063478A2
WO2007063478A2 PCT/IB2006/054458 IB2006054458W WO2007063478A2 WO 2007063478 A2 WO2007063478 A2 WO 2007063478A2 IB 2006054458 W IB2006054458 W IB 2006054458W WO 2007063478 A2 WO2007063478 A2 WO 2007063478A2
Authority
WO
WIPO (PCT)
Prior art keywords
regions
image
image data
depth
region
Prior art date
Application number
PCT/IB2006/054458
Other languages
French (fr)
Other versions
WO2007063478A3 (en
Inventor
Fabian E. Ernst
Bart G. B. Barenbrug
Original Assignee
Koninklijke Philips Electronics N.V.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Koninklijke Philips Electronics N.V. filed Critical Koninklijke Philips Electronics N.V.
Priority to AT06831957T priority Critical patent/ATE542194T1/en
Priority to EP06831957A priority patent/EP1958149B1/en
Priority to JP2008542901A priority patent/JP5073670B2/en
Priority to CN2006800453217A priority patent/CN101322155B/en
Priority to KR1020087016167A priority patent/KR101370356B1/en
Priority to US12/095,183 priority patent/US8325220B2/en
Publication of WO2007063478A2 publication Critical patent/WO2007063478A2/en
Publication of WO2007063478A3 publication Critical patent/WO2007063478A3/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T15/003D [Three Dimensional] image rendering
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/50Depth or shape recovery
    • G06T7/55Depth or shape recovery from multiple images
    • G06T7/571Depth or shape recovery from multiple images from focus
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N13/00Stereoscopic video systems; Multi-view video systems; Details thereof
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N13/00Stereoscopic video systems; Multi-view video systems; Details thereof
    • H04N13/20Image signal generators
    • H04N13/261Image signal generators with monoscopic-to-stereoscopic image conversion

Definitions

  • Stereoscopic image display method and apparatus method for generating 3D image data from a 2D image data input and an apparatus for generating 3D image data from a 2D image data input
  • the invention relates to a stereoscopic image display method wherein 2D image data are converted into 3D image data and wherein focus information is extracted from the 2D image data and used for generating the 3D image data.
  • the invention also relates to a stereoscopic image display device comprising an input for 2D image data and a converter to convert the input 2D image data into 3D image data the converter comprising a focus information extractor for extracting focus information from the 2D image data.
  • the invention also relates to an image display data conversion method wherein 2D image data are converted into 3D image data and wherein focus information is extracted from the 2D image data and used for generating the 3D image data.
  • the invention further relates to a 3D image signal.
  • the invention further relates to a computer program product to be loaded by a computer arrangement, comprising instructions to generate 3D image data on basis of a 2D image data input, the computer arrangement comprising processing means.
  • EP 1 021 049 A stereoscopic image display method and device of the kind described in the opening paragraph is disclosed in EP 1 021 049.
  • EP 1 021 049 a 3 dimensional video image is generated from a 2 dimensional video input.
  • the known device and method uses a foreground/background discriminating circuit which discriminates on the basis of focus information which is extracted from the 2 dimensional video input.
  • a parallax control signal is outputted on the basis of edge detecting wherein sharp edges are placed in the foreground of the 3D image.
  • the method in accordance with the invention is characterized in that on basis of focus characteristics the image is divided into two or more regions, it is determined to which region of the image an edge separating two regions belongs and a depth order is established between the regions following the rule that a region comprising an edge is closer to the viewer than an adjacent region and to the regions 3-D depth information is assigned in accordance with the established depth order of the regions.
  • the 3D sensation is then confusing at best and often lost, especially since the depth cue given by the known method is usually limited. It is assumed that the human brain is capable of reconstructing a stereoscopic sense from even an imperfect depth cue.
  • the depth cues in the prior art method and device are, however, sometimes at odds with each other, and may even change from scene to scene, i.e. in one scenes the depth cues may be correct, followed by a sudden shift to conflicting depth cues wherein a foreground figure hides behind a background tree.
  • the depth sensation is then lost or at least a very annoying conflict between depth cues is perceived by the viewer.
  • the method in accordance with the invention solves or at least reduces the problem.
  • the image is divided in regions on the basis of focus information, for instance the blur radius.
  • the pixels or blocks of the image are clustered into a number of regions having similar focus characteristic.
  • the focus information e.g. the average blur per block
  • the image is divided into two or more regions wherein each region has averaged focusing characteristics. It is determined to which region an edge separating two regions belongs. This may e.g. be done by comparing the sharpness (blur) of a detected edge to the average blur of the regions bordering either side of the edge.
  • a blurred edge belongs to a bordering region having a high blur, whereas a sharp edge to a region having a low blur.
  • a depth ordering is performed on the regions, wherein the rule is followed that a region comprising an edge is closer to the viewer than the adjacent region.
  • 3D information is assigned to the regions in accordance with the depth ordering.
  • the various regions of the image thus form depth layers.
  • Dividing the image into regions is performed by means of clustering pixels or blocks into regions. Although this clustering could be done on a pixel per pixel basis, it is found that more robust results are obtained when, prior to division of the image into regions, a focusing characteristic is determined per block of pixels and the blocks are clustered into regions.
  • Blocks are small parts of the image having nxm pixels, usually mxm, where n and m are typically 2, 4, 8 or 16. The advantage of the method in comparison to the known method is clear for e.g.
  • the method in accordance with the invention does not provide for conflicting depth cues.
  • the image is divided into regions and comprises at least two regions, for instance an in focus region comprising the person and an out-of-focus region comprising the flower arrangement.
  • the edges separating the regions comprising the flower arrangement and the region comprising the person are formed by the blurred edges of the flower arrangement.
  • the region of the image comprising the flower arrangement is placed on the foreground, in accordance with the rule that a region comprising the edge separating two regions is closer to a viewer than the other region.
  • Out-of-focus foreground regions which are bounded by blurred edges, are placed in front of in- focus background regions.
  • the correct parallax is assigned to both regions.
  • the correct 3D information is provided for the three regions.
  • the 3D depth information is assigned in dependence on the focusing characteristics of the regions.
  • the average focusing characteristics provides a clue as to the difference in depth between the regions. This can be used to improve the 3D effect.
  • the number of regions is three or two. Clustering the pixels or blocks of the image into two or three regions has proven to give good results, while requiring only limited calculating power. Almost all images have an in-focus part, and an out-of-focus part, the out-of-focus part sometimes being foreground, sometimes being background, so that two regions often suffice. Occasionally the out-of-focus part comprises a fore-ground and background part, for instance a foreground tree and a background forest and an intermediate in-focus region, in which case three regions usually suffice.
  • a statistical distribution is made of focusing characteristics of pixels or blocks of the image and the number of regions is determined in dependence on the statistical distribution.
  • the image display device in accordance with the invention comprises means for performing the method steps in accordance with the invention.
  • the invention is also embodied in a transmitter comprising means for performing the method steps in accordance with the invention.
  • Fig. 1 illustrates the thin lens model
  • Figs. 2A-2C illustrate a possible method for determining blur radius
  • Fig. 3 illustrates the relation between blur radius and focal plane
  • Fig. 4 illustrates a statistical distribution of blur radii
  • Figs. 5A and 5B illustrate a method of determining regions
  • Fig. 6 illustrates a method for deciding to which regions an edge belongs
  • Fig. 7 illustrates a method in accordance with the invention
  • Fig. 8 illustrates a display device in accordance with the invention
  • Fig. 9 illustrates a transmitter in accordance with invention.
  • the blur behavior is according to the thin lens formula:
  • uo denotes the distance for which points are in focus.
  • the parameter s is the image plane to lens distance and the parameter k is a constant determined by the characteristics of the lens system.
  • the parameters f, s and k are camera parameters, which can be determined from camera calibration.
  • estimating the distance u of an object involves determining the camera parameters and estimating the blur radius ⁇ .
  • disparity inverse depth
  • depth is a more relevant quantity than depth itself, as for instance the parallax for rendered views is linear in disparity.
  • the disparity difference to the focal plane is proportional to the blur radius.
  • the amount of disparity for rendered views can usually be changed to accommodate for the preference of the user and/or the possibilities of the display, accurate determination of the camera-related constant k/s is not necessary, all that is needed is determination of the blur radius ⁇ , i.e. of a focus characteristic.
  • the blur radius is taken for the focus characteristic for the simple reason that there is a simple relation between distance and blur radius.
  • determining the blur radius as the focus characteristic is preferred, due to the simple relation between blur radius and distance, other measures of blurriness could also be determined within the concept of the invention.
  • Figures 2A-C schematically illustrate a possible method for determining blur radius ⁇ .
  • a blurred edge with a blur radius ⁇ is schematically shown.
  • the horizontal axis denotes position, the vertical axis luminance.
  • a filtering function is shown which is the second derivative of a Gaussian filter with width s. Convolution of Figure 2A and Figure 2B provides for a function having two peaks. The distance dh between the peaks can be measured reasonably adequate and the relation between the blur radius ⁇ , filter width s and peak distance dh is as follows:
  • This exemplary algorithm is robust and the results obtained for various types of content were good. Taking various filter widths s for each pixel for each filter width a value for the blur radius ⁇ is found. Taking an average or median value of ⁇ per pixel and then determining an average or median value for ⁇ over a block wherein more pronounced edges, which have a larger height in part Figure 2C, are given a larger weight proved to give robust results. A reasonably good distinction in determined values for ⁇ between the in- focus and out-focus regions is found.
  • a first step the pixels or blocks of the image are clustered based on their focusing characteristic, thereby forming regions within the image.
  • pixels could be clustered.
  • the spread in values of ⁇ for pixels is even larger than for blocks.
  • a focusing characteristic in the examples given an average or median value for the blur radius ⁇ , is assigned on a block basis and the blocks are clustered into regions on the basis of the block values for ⁇ . To each region an average or medium blur radius is assigned. Clustering may be done in various manners.
  • a simple iterative clustering algorithm may be used which always divides the image into two or more clusters starting from a heuristic initial clustering. The decision whether we have one, two or more clusters is then based on the similarity of the characteristics of the clusters.
  • Figures 5 A and 5B illustrate such a method wherein it is assumed that there are two large regions, one in focus and more or less in the middle, surrounded by an out-of- focus region.
  • the initial clustering consists of assigning the blocks on the left, top and right border (say 1/4 of the image) to the "background' cluster C 2 , and the other pixels to the "foreground' cluster Ci (see Figure 5A). This choice originates from the selection of blocks for background motion model estimation.
  • the object of interest (usually the foreground) is somewhere in the center of the image, and the borders of the image do not contain objects of interest.
  • background motion model estimation it is assumed that the object of interest in the center is the foreground. It is, however, not necessary to make such an assumption in the clustering stage. It has been observed, however, that most of the time the center cluster is in focus.
  • the initial blur radius value ⁇ i respectively ⁇ 2 of a cluster is the median of the blur radii ⁇ of all those feature points.
  • Step 1 Reassign the blocks. A sweep is made over the image, and each block B on a cluster boundary is assigned to the cluster to which it has the smallest deviation to its mean focus estimate:
  • Step 2 Update the values for ⁇ i and ⁇ 2 .
  • Blocks have been reassigned to clusters Ci and C 2 so new average or median cluster blur radii ⁇ i and ⁇ 2 are computed for each of the two (or more if there are more) clusters.
  • Step 3 Iterate. A new sweep is made, see step 1.
  • Figure 5B shows the result of such iteration: two regions are formed, a foreground region Ci with a median blur radius ⁇ i, and background region C 2 with a median blur radius ⁇ 2 .
  • this method provides for two regions, an out-of- focus region and in a in- focus regions. These regions need not be connected, e.g. the in focus regions may comprise two separate sub regions, as may the out-of- focus region.
  • the statistics shows evidence of three regions, i.e. three peaks in the ⁇ distribution, it is possible to start with three regions.
  • An initial clustering may also be found by determining the peaks in the ⁇ diagram, and simply assigning each block to the peak with the best matching ⁇ .
  • FIG. 6 illustrates schematically a method for distinguishing from this principle which edge belongs to which regions.
  • Figure 6 shows along the horizontal axis a position parameter, such as the x, or y coordinate, or a coordinate perpendicular to a transition between two regions. Vertically the blur radius is shown.
  • FIG 6 schematically the transition between an in- focus region with a low value for blur radius ⁇ and an out-of- focus region with a high value for ⁇ is shown.
  • the width W illustrates schematically the blurriness of the edge.
  • An out-of- focus edge will have a larger width W than an in-focus edge.
  • this is shown in the top part of Figure 6, having a small W and thus a sharp transition, and the bottom part, showing a large width W and thus a blurred transition.
  • the edge separating the regions Ci and C 2 belongs to the region Ci with low blur radius ⁇ i.
  • region Ci is foreground, which is indicated in the figure by Ci(F).
  • Region C 2 is background indicated by C 2 (B).
  • the width W is large.
  • the edge separating the regions Ci and C 2 belongs to the region C 2 with high blur radius ⁇ 2 .
  • region C 2 is foreground, which is indicated in Figure 6 by C 2 (F).
  • Region Ci is background indicated by Ci(B).
  • a different method is for instance to segment the image, i.e. the find luminance or color edges in the image near the transitions between the regions and compare them to the edges between the regions as follows from the preceding clustering step.
  • luminance segmentation different methods may be used to find which edge belongs to which regions.
  • One way is to look at the orientation of luminance edges in the various regions near the transition between the regions.
  • the luminance edge corresponding to the transition between regions is determined solely by the foreground image and the edge or edges belonging to the foreground image often follow the transition, i.e. they are parallel to the transition.
  • Luminance edges in the background tend not to have a relation to the transition.
  • Yet another method is the following: the image is segmented based on focus, as explained above, and luminance edges are found near the transitions between the regions.
  • clustering of blocks tends on average to extend the region to which an edge belongs to slightly beyond the luminance edge because the whole edge or at least a major part of the edge is assigned the blur radius of the edge which belongs to the foreground object. There is thus a slight bias in clustering which extends a clustered region to include the edge belonging to said cluster. This bias does not occur for determination of edges when solely differences in luminance are concerned because in luminance segmentation the transition between the regions is drawn in the middle of the edge separating the regions.
  • Depth ordering can be done simply on the basis of what region is foreground and what region is background, i.e. a fixed difference in parallax can be used to distinguishing the foreground and background regions, or foremost, intermediate range and background regions, independent of the actual values C 1 .
  • the blur radius estimates for the regions are converted into a depth or inverse depth value.
  • the disparity of blurred objects is the disparity of in focus objects, i.e. the region with lowest ⁇ , plus a constant time the difference in blur radius between foreground and background.
  • is the difference in ⁇
  • K is a constant and uo is the focus plane. If ⁇ is very small ⁇ equals ⁇ of the out-of- focus plane.
  • the cluster with the lowest blur value is assigned the depth uo; all other clusters are assigned a depth value based on their depth ordering with respect to the cluster with the lowest radius value.
  • K is positive if the foreground in is focus and negative of the out-of-focus region is foreground.
  • step 6 From an input 2D signal, image blocks are formed in step 2, block focus characteristics, for instance the block blur radius ⁇ are determined in step 3, these blocks are clustered into two or more clusters in step 4.
  • step 6 the relation between the edge and the region is determined. This may be done directly from the focus characteristics, see Figure 6, or in parallel the image may be luminance segmented and image edge obtained by luminance segmentation (step 5) are compared in step 6 to edge determined by clustering wherein comparing the results leads to the determination of which edge belong to which regions and thereby which regions are positioned in front of which regions, i.e. the depth ordering of regions (step 7).
  • the depth is determined from the focus characteristics (step 8) in accordance with a preferred embodiment, which in the examples given is the blur radius, the resulting 3D output signal is provided (step 9).
  • Figure 8 shows an image device in accordance with the invention.
  • the image device has means for performing all the steps of the method, i.e. an input 1, for receiving a 2D input signal, a former 2 for formation image blocks, a computer 3 for computing block focus characteristics, a clusterer 4 for clustering image regions on basis of focus, an image edge detector 5, an edge-region relationship determinator 6, a depth orderer 7 and a depth information assigner 8. It furthermore comprises an output 9 for outputting a 3D signal to a 3D display screen 10.
  • a display device may for instance an autostereoscopic display device.
  • Figure 9 shows a transmitter in accordance with the invention. The difference with Figure 8 is that the display screen itself is not an integral part of the device.
  • Such a transmitter may for instance read DVD's having a 2D signal and converting the 2D signal into a 3D signal for use in 3D display device which may be separately sold. It may also be a device which makes a DVD having 3D signals from a DVD having a 2D signal, the 3D signals may thus be provided to a DVD burner, or for instance sent to another location.
  • 3D image signals comprising information on the division of the image in regions and the depth order of the regions and, in preferred embodiments, also the focus characteristic of the various regions also form embodiments of the invention.
  • the information may be given in a header in the signal, which header specifies which blocks belongs to the regions, or the dividing lines between the regions, the order of the regions and preferably also the focusing characteristics of the regions, preferably the region blur radii.
  • a 3D signal made with the prior art methods and devices does not comprise such information.
  • a 3D signal in accordance with the invention could for instance be generated as follows: A customer has a 3D display device but a normal 2D digital camera. A user sends a 2D home video or digital image to an internet site. The original 2D signal is converted into a 3D signal, which is sent back to the user which can display the video or image on his 3D display.
  • 2D image data are converted into 3D image data.
  • the image is divided, on the basis of focusing characteristics, into two or more regions, it is determined to which region an edge separating two regions belongs.
  • the regions are depth ordered in accordance with the rule that the rule that a region comprising an edge is closer to the viewer than an adjacent region and to the regions 3-D depth information is assigned in accordance with the established depth order of the regions.
  • a depth is assigned in dependence on an average or median focusing characteristic of the region.
  • the invention is also embodied in any computer program product for a method or device in accordance with the invention.
  • computer program product should be understood any physical realization of a collection of commands enabling a processor - generic or special purpose-, after a series of loading steps (which may include intermediate conversion steps, like translation to an intermediate language, and a final processor language) to get the commands into the processor, to execute any of the characteristic functions of an invention.
  • the computer program product may be realized as data on a carrier such as e.g. a disk or tape, data present in a memory, data traveling over a network connection -wired or wireless- , or program code on paper.
  • characteristic data required for the program may also be embodied as a computer program product.
  • the word "comprising” does not exclude the presence of other elements or steps than those listed in a claim.
  • the invention can be implemented by means of hardware comprising several distinct elements, and by means of a suitably programmed computer. In a device claim enumerating several means, several of these means can be embodied by one and the same item of hardware.
  • the invention may be implemented by any combination of features of various different preferred embodiments as described above. In particular it is mentioned that any embodiment shown or claimed in relation to an encoding method or encoder has, unless otherwise indicated or impossible, a corresponding embodiment for a decoding method or decoder and such decoding methods and decoder are embodiments of the invention and claimed herewith.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Computer Graphics (AREA)
  • Image Analysis (AREA)
  • Testing, Inspecting, Measuring Of Stereoscopic Televisions And Televisions (AREA)
  • Image Processing (AREA)

Abstract

2D image data are converted into 3D image data. The image is divided, on the basis of focusing characteristics, into two or more regions, it is determined to which region an edge separating two regions belongs. The regions are depth ordered in accordance with the rule that the rule that a region comprising an edge is closer to the viewer than an adjacent region and to the regions 3-D depth information is assigned in accordance with the established depth order of the regions. Preferably to each of the regions a depth is assigned in dependence on an average or median focusing characteristic of the region.

Description

Stereoscopic image display method and apparatus, method for generating 3D image data from a 2D image data input and an apparatus for generating 3D image data from a 2D image data input
The invention relates to a stereoscopic image display method wherein 2D image data are converted into 3D image data and wherein focus information is extracted from the 2D image data and used for generating the 3D image data.
The invention also relates to a stereoscopic image display device comprising an input for 2D image data and a converter to convert the input 2D image data into 3D image data the converter comprising a focus information extractor for extracting focus information from the 2D image data.
The invention also relates to an image display data conversion method wherein 2D image data are converted into 3D image data and wherein focus information is extracted from the 2D image data and used for generating the 3D image data.
The invention further relates to a 3D image signal.
The invention further relates to a computer program product to be loaded by a computer arrangement, comprising instructions to generate 3D image data on basis of a 2D image data input, the computer arrangement comprising processing means.
A stereoscopic image display method and device of the kind described in the opening paragraph is disclosed in EP 1 021 049. In EP 1 021 049 a 3 dimensional video image is generated from a 2 dimensional video input. The known device and method uses a foreground/background discriminating circuit which discriminates on the basis of focus information which is extracted from the 2 dimensional video input. A parallax control signal is outputted on the basis of edge detecting wherein sharp edges are placed in the foreground of the 3D image.
Although the known method and device provide for a relatively simple device and method, it has been found that the rendered 3D images occasionally are confusing images wherein depth of vision, i.e. the 3D effect, is difficult to distinguish. It is an object of the invention to improve 3D image rendering based on a 2D image input.
To this end the method in accordance with the invention is characterized in that on basis of focus characteristics the image is divided into two or more regions, it is determined to which region of the image an edge separating two regions belongs and a depth order is established between the regions following the rule that a region comprising an edge is closer to the viewer than an adjacent region and to the regions 3-D depth information is assigned in accordance with the established depth order of the regions.
In the prior art method of EP 1 021 049 edge detection is also performed. Sharp edges are placed in the foreground. This scheme, however, sometimes provides for confusing results since parts of the images that are in reality in the foreground are given background parallax and vice versa in case the background happened to be in focus and the foreground out-of- focus. This provides for confusing images wherein the parallax information provides the viewer the cue that certain parts of the 3D image are on the foreground and others parts in the background, but the actual content of the image provides the viewer with a completely opposite cue, i.e. what is foreground according the parallax cue is background according to the actual content.
The 3D sensation is then confusing at best and often lost, especially since the depth cue given by the known method is usually limited. It is assumed that the human brain is capable of reconstructing a stereoscopic sense from even an imperfect depth cue. The depth cues in the prior art method and device are, however, sometimes at odds with each other, and may even change from scene to scene, i.e. in one scenes the depth cues may be correct, followed by a sudden shift to conflicting depth cues wherein a foreground figure hides behind a background tree. The depth sensation is then lost or at least a very annoying conflict between depth cues is perceived by the viewer.
The method in accordance with the invention solves or at least reduces the problem. The image is divided in regions on the basis of focus information, for instance the blur radius. The pixels or blocks of the image are clustered into a number of regions having similar focus characteristic. Based on the focus information, e.g. the average blur per block, the image is divided into two or more regions wherein each region has averaged focusing characteristics. It is determined to which region an edge separating two regions belongs. This may e.g. be done by comparing the sharpness (blur) of a detected edge to the average blur of the regions bordering either side of the edge. A blurred edge belongs to a bordering region having a high blur, whereas a sharp edge to a region having a low blur. A depth ordering is performed on the regions, wherein the rule is followed that a region comprising an edge is closer to the viewer than the adjacent region. 3D information is assigned to the regions in accordance with the depth ordering. The various regions of the image thus form depth layers. Dividing the image into regions is performed by means of clustering pixels or blocks into regions. Although this clustering could be done on a pixel per pixel basis, it is found that more robust results are obtained when, prior to division of the image into regions, a focusing characteristic is determined per block of pixels and the blocks are clustered into regions. Blocks are small parts of the image having nxm pixels, usually mxm, where n and m are typically 2, 4, 8 or 16. The advantage of the method in comparison to the known method is clear for e.g. an image in which a person is seated partially behind a flower arrangement. The person is in focus; the flower arrangement is not. Using the known method the person being in focus and thus having sharp image edges, is given a parallax so that it seems in the foreground and image portion depicting the flower arrangement, having a blurred edge, is given a parallax corresponding with background. This conflicts with the true situation since the person is partially behind the flower arrangement and not the other way around. The known method and device thus confronts the viewer with two conflicting, in fact irreconcilable, depth cues. The parallax depth cue, putting the person on the foreground in front of the flower arrangement, contradicts the image information depth cue, which shows the person seated behind the flower arrangement.
The method in accordance with the invention does not provide for conflicting depth cues. The image is divided into regions and comprises at least two regions, for instance an in focus region comprising the person and an out-of-focus region comprising the flower arrangement. The edges separating the regions comprising the flower arrangement and the region comprising the person are formed by the blurred edges of the flower arrangement.
Thus the region of the image comprising the flower arrangement is placed on the foreground, in accordance with the rule that a region comprising the edge separating two regions is closer to a viewer than the other region. Out-of-focus foreground regions, which are bounded by blurred edges, are placed in front of in- focus background regions. Thus, if there are two regions, an out-of-focus foreground flower arrangement in front of an in- focus person, the correct parallax is assigned to both regions. If there are three regions, an out-of-focus foreground flower arrangement, an in-focus person and an out-of-focus background, the correct 3D information is provided for the three regions. It is emphasized that the results of the method in accordance with the invention provide, in this example, results that are against the very core of the teaching of EP 0 121 049, which dictates that depth ordering is done by placing sharp edges on the foreground.
Preferably the 3D depth information is assigned in dependence on the focusing characteristics of the regions. The average focusing characteristics provides a clue as to the difference in depth between the regions. This can be used to improve the 3D effect.
In preferred embodiments the number of regions is three or two. Clustering the pixels or blocks of the image into two or three regions has proven to give good results, while requiring only limited calculating power. Almost all images have an in-focus part, and an out-of-focus part, the out-of-focus part sometimes being foreground, sometimes being background, so that two regions often suffice. Occasionally the out-of-focus part comprises a fore-ground and background part, for instance a foreground tree and a background forest and an intermediate in-focus region, in which case three regions usually suffice.
In a preferred embodiment a statistical distribution is made of focusing characteristics of pixels or blocks of the image and the number of regions is determined in dependence on the statistical distribution.
It is found that the focusing characteristics, such a blur radius, often cluster around a limited number of peaks, one corresponding to a small blur radius, i.e. in focus or nearly in focus, and another or others at larger blur radii, corresponding to out of focus parts of the image. Using these statistical data allows for a quick determination of the number of regions in which the region can be divided.
The image display device in accordance with the invention comprises means for performing the method steps in accordance with the invention.
The invention is also embodied in a transmitter comprising means for performing the method steps in accordance with the invention. These and other objects of the invention will be apparent from and elucidated with reference to the embodiments described hereinafter.
In the drawings: Fig. 1 illustrates the thin lens model;
Figs. 2A-2C illustrate a possible method for determining blur radius; Fig. 3 illustrates the relation between blur radius and focal plane; Fig. 4 illustrates a statistical distribution of blur radii; Figs. 5A and 5B illustrate a method of determining regions; Fig. 6 illustrates a method for deciding to which regions an edge belongs; Fig. 7 illustrates a method in accordance with the invention; Fig. 8 illustrates a display device in accordance with the invention; and Fig. 9 illustrates a transmitter in accordance with invention.
The figures are not drawn to scale. Generally, identical components are denoted by the same reference numerals in the figures.
In a simple optical system, like a convex thin lens, objects at a particular distance from the lens are clearly depicted (objects are in focus) on the image plane, while objects at other distances are mapped blurred (objects are defocused) proportional to their distance from the plane of focus. The latter situation for a point source is depicted in Figure 1.
The blur behavior is according to the thin lens formula:
Figure imgf000006_0001
in which f represents the focal length of the lens, u is the object distance and v is the image distance. From the geometric relations in Figure 1 and the lens formula, the formula for the distance u can be derived:
u = A_- ifu>u0 (2) s - f - kef
u = — fs — — if u<uo (3) s - f + kqf
wherein uo denotes the distance for which points are in focus. The parameter s is the image plane to lens distance and the parameter k is a constant determined by the characteristics of the lens system. The parameters f, s and k are camera parameters, which can be determined from camera calibration. Thus estimating the distance u of an object involves determining the camera parameters and estimating the blur radius σ. Thus there is a relation between the blurriness of an image, i.e. a focus characteristic and the distance. For 2D-to-3D conversion, disparity (inverse depth) is a more relevant quantity than depth itself, as for instance the parallax for rendered views is linear in disparity. Using the above expression it is possible to find a relation between disparity differences between points in focus and out of focus and the blur radius σ.
1 1 kσ_
(4) s
In other words, the disparity difference to the focal plane is proportional to the blur radius. Moreover, as the amount of disparity for rendered views can usually be changed to accommodate for the preference of the user and/or the possibilities of the display, accurate determination of the camera-related constant k/s is not necessary, all that is needed is determination of the blur radius σ, i.e. of a focus characteristic. In the following description, the blur radius is taken for the focus characteristic for the simple reason that there is a simple relation between distance and blur radius. However, although determining the blur radius as the focus characteristic is preferred, due to the simple relation between blur radius and distance, other measures of blurriness could also be determined within the concept of the invention.
Figures 2A-C schematically illustrate a possible method for determining blur radius σ. In Figure 2A a blurred edge with a blur radius σ is schematically shown. The horizontal axis denotes position, the vertical axis luminance. In Figure 2B a filtering function is shown which is the second derivative of a Gaussian filter with width s. Convolution of Figure 2A and Figure 2B provides for a function having two peaks. The distance dh between the peaks can be measured reasonably adequate and the relation between the blur radius σ, filter width s and peak distance dh is as follows:
σ2 = (dh/2)2 - s2 (5)
This exemplary algorithm is robust and the results obtained for various types of content were good. Taking various filter widths s for each pixel for each filter width a value for the blur radius σ is found. Taking an average or median value of σ per pixel and then determining an average or median value for σ over a block wherein more pronounced edges, which have a larger height in part Figure 2C, are given a larger weight proved to give robust results. A reasonably good distinction in determined values for σ between the in- focus and out-focus regions is found.
The relation between u and the blur radius σ is schematically shown in Figure 3 and follows from equation (4). If the parameters k and s are known from calibration, then a true estimate of the absolute distance to the focal plane can be made, once the blur radius σ is known. Since this does not reveal if a blurred object is in front of the focal plane or behind it, also at least two images for different focal distances need to be known for true depth estimation from the blur radius σ. However, neither of these requirements is usually known or obtainable for arbitrary externally given image data such as e.g. video. A good distinction can nevertheless be made between out-of- focus regions of the image and in focus regions of the image and, if there are more regions, between the various regions.
Since the formula between the disparity difference and the blur radius gives a relation between the absolute value of the disparity difference and the blur radius, the equation has two separate solutions. Hence determination of two different values of the blur radius σ does not enable depth ordering, as the same values of σ may result from an object closer to or further away. In Figure 4 this is schematically shown for two different values for the blur radius σ (σl and σ2). In principle there are four different possible combinations of image planes possible. Figure 4 shows a typical distribution of blur radii for blocks within an image wherein the horizontal axis denotes the percentage of blocks with a certain blur radius. Clearly two modes centered on peaks with values of σi and σ2 can be distinguished corresponding in this example with in- focus and out-of- focus parts of the image. Such a distribution alone, however, does not enable to provide an accurate depth ordering for two reasons. First of all, as explained in relation to Figure 3, there is ambiguity as to the actual relative position of image planes corresponding with the peaks in Figure 3 since more than one solution is possible. Secondly the peaks in the distribution in σ are quite broad. This indicates that the actual blur values have a high numerical uncertainty and may not be suitable for deriving depth ordering information, as blur radius difference (the spread in the peaks in Figure 3) in each mode (e.g. the out-of-focus region) may exceed blur radius differences between modes. Hence only using actual numerical values of the blur radius to decide on depth ordering and depth ordering of each block introduces a large amount of noise. To nevertheless obtain reliable depth ordering the method and device in accordance with the invention executes two steps.
In a first step the pixels or blocks of the image are clustered based on their focusing characteristic, thereby forming regions within the image. Within the broadest scope of the invention, also pixels could be clustered. However, the spread in values of σ for pixels is even larger than for blocks. More robust results are obtained when, prior to clustering a focusing characteristic, in the examples given an average or median value for the blur radius σ, is assigned on a block basis and the blocks are clustered into regions on the basis of the block values for σ. To each region an average or medium blur radius is assigned. Clustering may be done in various manners.
A simple iterative clustering algorithm may be used which always divides the image into two or more clusters starting from a heuristic initial clustering. The decision whether we have one, two or more clusters is then based on the similarity of the characteristics of the clusters. Figures 5 A and 5B illustrate such a method wherein it is assumed that there are two large regions, one in focus and more or less in the middle, surrounded by an out-of- focus region. The initial clustering consists of assigning the blocks on the left, top and right border (say 1/4 of the image) to the "background' cluster C2, and the other pixels to the "foreground' cluster Ci (see Figure 5A). This choice originates from the selection of blocks for background motion model estimation. Heuristically, one expects that the object of interest (usually the foreground) is somewhere in the center of the image, and the borders of the image do not contain objects of interest. For background motion model estimation, it is assumed that the object of interest in the center is the foreground. It is, however, not necessary to make such an assumption in the clustering stage. It has been observed, however, that most of the time the center cluster is in focus.
As the initial clustering is rather coarse and based on heuristics, a robust method to arrive at initial estimates of the blur radii for each cluster is as follows.
A number of feature points (in our case 28), regularly distributed inside the clusters is selected. The initial blur radius value σi respectively σ2 of a cluster is the median of the blur radii σ of all those feature points.
Then an iterative procedure is carried out to refine this cluster:
Step 1 : Reassign the blocks. A sweep is made over the image, and each block B on a cluster boundary is assigned to the cluster to which it has the smallest deviation to its mean focus estimate:
Figure imgf000010_0001
B→C2 else
Step 2: Update the values for σi and σ2 . Blocks have been reassigned to clusters Ci and C2 so new average or median cluster blur radii σi and σ2 are computed for each of the two (or more if there are more) clusters.
Step 3: Iterate. A new sweep is made, see step 1.
This process converges after a few (typically 4) iterations. Figure 5B shows the result of such iteration: two regions are formed, a foreground region Ci with a median blur radius σi, and background region C2 with a median blur radius σ2.
Typically this method provides for two regions, an out-of- focus region and in a in- focus regions. These regions need not be connected, e.g. the in focus regions may comprise two separate sub regions, as may the out-of- focus region. When the statistics shows evidence of three regions, i.e. three peaks in the σ distribution, it is possible to start with three regions. An initial clustering may also be found by determining the peaks in the σ diagram, and simply assigning each block to the peak with the best matching σ.
Once the image is divided into regions C1, C2, C3 ete, it is possible to assign a region blur radius C1 to each of the regions. The next step in the method and device in accordance with the invention is that the mutual position, i.e. which region is in front of which region, of the regions is determined. A decision on depth ordering has to be made. In order to do so use is made of the principle that an edge belongs to the foremost object. Figure 6 illustrates schematically a method for distinguishing from this principle which edge belongs to which regions. Figure 6 shows along the horizontal axis a position parameter, such as the x, or y coordinate, or a coordinate perpendicular to a transition between two regions. Vertically the blur radius is shown. In Figure 6 schematically the transition between an in- focus region with a low value for blur radius σ and an out-of- focus region with a high value for σ is shown. The width W illustrates schematically the blurriness of the edge. An out-of- focus edge will have a larger width W than an in-focus edge. Schematically this is shown in the top part of Figure 6, having a small W and thus a sharp transition, and the bottom part, showing a large width W and thus a blurred transition. Thus in the top part the edge separating the regions Ci and C2 belongs to the region Ci with low blur radius σi. Thus region Ci is foreground, which is indicated in the figure by Ci(F). Region C2 is background indicated by C2(B). In the bottom part the width W is large. The edge separating the regions Ci and C2 belongs to the region C2 with high blur radius σ2. Thus "blurred" region C2 is foreground, which is indicated in Figure 6 by C2(F). Region Ci is background indicated by Ci(B). By taking various measurement points along lines perpendicular to the transition lines between the regions, and taking an average or deciding for each measure point to which the region the edge seems to belong and then voting between the different measurements, it is easily found whether the edge belongs to the an in-focus region, in which case the in- focus region lies in front of the out-of- focus region, or to an in-focus region, in which case the in- focus region lies in front of the out-of- focus region. To put it differently, the width W is only dependent on the σ of one the regions, not or at least hardly on the σ of the other region. This characteristic can be used to determine to which regions an edge separating two regions belong.
This is one example of a method for establishing to which region an edge belongs. A different method is for instance to segment the image, i.e. the find luminance or color edges in the image near the transitions between the regions and compare them to the edges between the regions as follows from the preceding clustering step.
Using luminance segmentation, different methods may be used to find which edge belongs to which regions. One way is to look at the orientation of luminance edges in the various regions near the transition between the regions. The luminance edge corresponding to the transition between regions is determined solely by the foreground image and the edge or edges belonging to the foreground image often follow the transition, i.e. they are parallel to the transition. Luminance edges in the background tend not to have a relation to the transition. Yet another method is the following: the image is segmented based on focus, as explained above, and luminance edges are found near the transitions between the regions. By determining the edge between regions in two different ways, by luminance segmentation and by clustering on the basis of blur radius it may be established to which region an edge belongs. Ideally the two determinations would completely coincide, but this is not the case. It has been found that clustering of blocks tends on average to extend the region to which an edge belongs to slightly beyond the luminance edge because the whole edge or at least a major part of the edge is assigned the blur radius of the edge which belongs to the foreground object. There is thus a slight bias in clustering which extends a clustered region to include the edge belonging to said cluster. This bias does not occur for determination of edges when solely differences in luminance are concerned because in luminance segmentation the transition between the regions is drawn in the middle of the edge separating the regions. There is thus a small difference in the determined position of the edge, since the clustering method based on blur radius determination as described above tends to overextend the border of the clustered foreground region to include into a region the edge belonging to said region, whereas such tendency to overextend does not exist for edges solely determined on the basis of luminance segmentation. To put it differently: luminance segmentation puts the edge exactly in the middle of the luminance transition, whereas clustering segmentation overestimates the size of the foreground region. This effect is also called morphological dilatation, i.e. the clustering slightly dilates, i.e. increases in size, the form of the foreground object. This bias of the clustering method draws foreground object edges into the foreground cluster. This seemingly negative effect can be brought to good use by comparing the edge as determined by luminance segmentation to the same edge as determined by blur radius segmentation. This allows to establish to which regions an edge belongs. Blur radius determination or more in particular determination of focus characteristics may be done using alternative algorithms. Alternative algorithms for clustering may also be used. Depending on the used algorithms the so determined foreground region will overextend or underextend in respect of edge determined by luminance edges. In both cases it is possible to determine to which region an edge belongs by comparing the regions determined by luminance segmentation to the regions determined by determination and clustering of focusing characteristics.
Depth ordering can be done simply on the basis of what region is foreground and what region is background, i.e. a fixed difference in parallax can be used to distinguishing the foreground and background regions, or foremost, intermediate range and background regions, independent of the actual values C1.
Preferably the blur radius estimates for the regions are converted into a depth or inverse depth value. Given the depth orderings and σ values we may take the disparity of blurred objects as the disparity of in focus objects, i.e. the region with lowest σ, plus a constant time the difference in blur radius between foreground and background.
- ~ — + KAc u Un Wherein Δσ is the difference in σ, K is a constant and uo is the focus plane. If σ is very small Δσ equals σ of the out-of- focus plane. The cluster with the lowest blur value is assigned the depth uo; all other clusters are assigned a depth value based on their depth ordering with respect to the cluster with the lowest radius value. In case we have only two clusters, in- focus and out-of- focus, K is positive if the foreground in is focus and negative of the out-of-focus region is foreground.
For single image blur radius estimation, the constants uo and K can not be recovered, for this we would need multiple images with different focal settings. However, if we only use the depth map for rendering, most of the time the depth map is translated and scaled anyhow to match the capabilities of the screen and the preferences of the user. For an autostereoscopic display device, we may for instance take uo in such a way that the in- focus region is rendered in the plane of the screen to have a maximal sharp image. The out- focus region can then be rendered behind or in front of the screen, depending on the depth ordering. Figure 7 shows a method in accordance with the invention. From an input 2D signal, image blocks are formed in step 2, block focus characteristics, for instance the block blur radius σβ are determined in step 3, these blocks are clustered into two or more clusters in step 4. In step 6 the relation between the edge and the region is determined. This may be done directly from the focus characteristics, see Figure 6, or in parallel the image may be luminance segmented and image edge obtained by luminance segmentation (step 5) are compared in step 6 to edge determined by clustering wherein comparing the results leads to the determination of which edge belong to which regions and thereby which regions are positioned in front of which regions, i.e. the depth ordering of regions (step 7). The depth is determined from the focus characteristics (step 8) in accordance with a preferred embodiment, which in the examples given is the blur radius, the resulting 3D output signal is provided (step 9).
Figure 8 shows an image device in accordance with the invention. The image device has means for performing all the steps of the method, i.e. an input 1, for receiving a 2D input signal, a former 2 for formation image blocks, a computer 3 for computing block focus characteristics, a clusterer 4 for clustering image regions on basis of focus, an image edge detector 5, an edge-region relationship determinator 6, a depth orderer 7 and a depth information assigner 8. It furthermore comprises an output 9 for outputting a 3D signal to a 3D display screen 10. Such a display device may for instance an autostereoscopic display device. Figure 9 shows a transmitter in accordance with the invention. The difference with Figure 8 is that the display screen itself is not an integral part of the device. Such a transmitter may for instance read DVD's having a 2D signal and converting the 2D signal into a 3D signal for use in 3D display device which may be separately sold. It may also be a device which makes a DVD having 3D signals from a DVD having a 2D signal, the 3D signals may thus be provided to a DVD burner, or for instance sent to another location. 3D image signals comprising information on the division of the image in regions and the depth order of the regions and, in preferred embodiments, also the focus characteristic of the various regions also form embodiments of the invention. The information may be given in a header in the signal, which header specifies which blocks belongs to the regions, or the dividing lines between the regions, the order of the regions and preferably also the focusing characteristics of the regions, preferably the region blur radii. A 3D signal made with the prior art methods and devices does not comprise such information. A 3D signal in accordance with the invention could for instance be generated as follows: A customer has a 3D display device but a normal 2D digital camera. A user sends a 2D home video or digital image to an internet site. The original 2D signal is converted into a 3D signal, which is sent back to the user which can display the video or image on his 3D display.
In short the invention can be described as follows:
2D image data are converted into 3D image data. The image is divided, on the basis of focusing characteristics, into two or more regions, it is determined to which region an edge separating two regions belongs. The regions are depth ordered in accordance with the rule that the rule that a region comprising an edge is closer to the viewer than an adjacent region and to the regions 3-D depth information is assigned in accordance with the established depth order of the regions. Preferably to each of the regions a depth is assigned in dependence on an average or median focusing characteristic of the region.
The invention is also embodied in any computer program product for a method or device in accordance with the invention. Under computer program product should be understood any physical realization of a collection of commands enabling a processor - generic or special purpose-, after a series of loading steps (which may include intermediate conversion steps, like translation to an intermediate language, and a final processor language) to get the commands into the processor, to execute any of the characteristic functions of an invention. In particular, the computer program product may be realized as data on a carrier such as e.g. a disk or tape, data present in a memory, data traveling over a network connection -wired or wireless- , or program code on paper. Apart from program code, characteristic data required for the program may also be embodied as a computer program product.
Some of the steps required for the working of the method may be already present in the functionality of the processor instead of described in the computer program product, such as data input and output steps.
It should be noted that the above-mentioned embodiments illustrate rather than limit the invention, and that those skilled in the art will be able to design many alternative embodiments without departing from the scope of the appended claims.
In the claims, any reference signs placed between parentheses shall not be construed as limiting the claim.
The word "comprising" does not exclude the presence of other elements or steps than those listed in a claim. The invention can be implemented by means of hardware comprising several distinct elements, and by means of a suitably programmed computer. In a device claim enumerating several means, several of these means can be embodied by one and the same item of hardware. The invention may be implemented by any combination of features of various different preferred embodiments as described above. In particular it is mentioned that any embodiment shown or claimed in relation to an encoding method or encoder has, unless otherwise indicated or impossible, a corresponding embodiment for a decoding method or decoder and such decoding methods and decoder are embodiments of the invention and claimed herewith.

Claims

CLAIMS:
1. A stereoscopic image display method wherein 2D image data (1) are converted into 3D image data (9) wherein focus information (σ) is extracted from the 2D image data and used for generating the 3D image data, wherein on the basis of focus characteristics (σ) the image is divided (4) into two or more regions (C1, C2) having a focusing characteristic (σi, σ2), it is determined (6) to which region of the image an edge separating two regions belongs, a depth order is established (7) between the regions following the rule that a region comprising an edge is closer to the viewer than an adjacent region and to the regions 3-D depth information is assigned in accordance with the established depth order of the regions.
2. A stereoscopic image display method as claimed in claim 1, wherein, prior to division of the image into regions, a focusing characteristic is assigned on a block basis (3) and the blocks are clustered into regions (4).
3. A stereoscopic image display method as claimed in claim 1, wherein the 3D depth information is assigned (8) in dependence on the focusing characteristics (σi, σ2) of the regions (C1, C2).
4. A stereoscopic image display method as claimed in claim 1, wherein the image is divided in two regions (C i, C2).
5. A stereoscopic image display method wherein the image is divided in three regions (C1, C2, C3).
6. A stereoscopic image display device comprising an input (1) for 2D image data and a converter to convert the input 2D image data into 3D image data, the converter comprising a focus information extractor (3) for extracting focus information from the 2D image data, wherein the device comprises a clusterer (4) for clustering the image on the basis of focus characteristics into two or more regions having a focusing characteristic, a determinator (6) for determining to which region of the image separating regions belongs, a depth orderer (7) for depth ordering of the regions following the rule that a region comprising an edge is closer to the viewer than an adjacent region and a display (10) for displaying the image wherein the apparent depth of the regions is in accordance with the depth ordering. A stereoscopic image display device as claimed in claim 6, comprising a computer (3) for computing, prior to division of the image into regions, a focusing characteristic on a block basis and wherein the clusterer (4) is arranged for clustering the blocks into regions.
7. A stereoscopic image display device as claimed in claim 6, wherein the device comprises a detector (5) for detecting luminance edges near region transitions.
8. A stereoscopic image display device as claimed in claim 6 wherein the device comprises a depth information assigner (8) for assigning depth information to regions on the basis of the focus characteristic of the regions.
9. An image display data conversion method wherein 2D image data are converted into 3D image data and wherein focus information is extracted from the 2D image data and used for generating the 3D image data, wherein on the basis of focus characteristics (σ) the image is divided (4) into two or more regions (C1, C2) having a focusing characteristic (σi, σ2), it is determined (6) to which region of the image an edge separating two regions belongs, a depth order is established (7) between the regions following the rule that a region comprising an edge is closer to the viewer than an adjacent region and to the regions 3-D depth information is assigned in accordance with the established depth order of the regions.
10. An image display data conversion method as claimed in claim 10, wherein the 3D depth information is assigned (8) in dependence on the focusing characteristics (σi, σ2) of the regions (C1, C2).
11. A 3D image signal comprising information on division of the image into two or more regions and depth ordering of the regions and an average focus characteristic for each of the regions.
12. A transmitter comprising an input (1) for 2D image data and a converter to convert the input 2D image data into 3D image data, the converter comprising a focus information extractor (3) for extracting focus information from the 2D image data, wherein the device comprises a clusterer (4) for clustering the image on the basis of focus characteristics into two or more regions (C1, C2) having a focusing characteristic (σi, σ2) , a determinator (6) for determining to which region of the image an edge separating two regions belongs, a depth orderer (7) for depth ordering of the regions following the rule that a region comprising an edge is closer to the viewer than an adjacent region and an output (9) for outputting the 3D image signal wherein to the regions 3-D depth information is assigned in accordance with the established depth order of the regions.
13. A computer program product to be loaded by a computer arrangement, comprising instructions to generate 3D image data on basis of a 2D image data input, the computer arrangement comprising processing means wherein the instructions are arranged for performing a method as claimed in any of the claims 1 to 5 or 10 to 11.
PCT/IB2006/054458 2005-12-02 2006-11-27 Stereoscopic image display method and apparatus, method for generating 3d image data from a 2d image data input and an apparatus for generating 3d image data from a 2d image data input WO2007063478A2 (en)

Priority Applications (6)

Application Number Priority Date Filing Date Title
AT06831957T ATE542194T1 (en) 2005-12-02 2006-11-27 METHOD AND DEVICE FOR STEREO IMAGE DISPLAY, METHOD FOR GENERATING 3D IMAGE DATA FROM A 2D IMAGE DATA INPUT AND DEVICE FOR GENERATING 3D IMAGE DATA FROM A 2D IMAGE DATA INPUT
EP06831957A EP1958149B1 (en) 2005-12-02 2006-11-27 Stereoscopic image display method and apparatus, method for generating 3d image data from a 2d image data input and an apparatus for generating 3d image data from a 2d image data input
JP2008542901A JP5073670B2 (en) 2005-12-02 2006-11-27 Stereoscopic image display method and method and apparatus for generating three-dimensional image data from input of two-dimensional image data
CN2006800453217A CN101322155B (en) 2005-12-02 2006-11-27 Stereoscopic image display method and apparatus, method for generating 3D image data from a 2D image data input and an apparatus for generating 3D image data from a 2D image data input
KR1020087016167A KR101370356B1 (en) 2005-12-02 2006-11-27 Stereoscopic image display method and apparatus, method for generating 3D image data from a 2D image data input and an apparatus for generating 3D image data from a 2D image data input
US12/095,183 US8325220B2 (en) 2005-12-02 2006-11-27 Stereoscopic image display method and apparatus, method for generating 3D image data from a 2D image data input and an apparatus for generating 3D image data from a 2D image data input

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
EP05111623 2005-12-02
EP05111623.4 2005-12-02

Publications (2)

Publication Number Publication Date
WO2007063478A2 true WO2007063478A2 (en) 2007-06-07
WO2007063478A3 WO2007063478A3 (en) 2007-10-11

Family

ID=38057450

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/IB2006/054458 WO2007063478A2 (en) 2005-12-02 2006-11-27 Stereoscopic image display method and apparatus, method for generating 3d image data from a 2d image data input and an apparatus for generating 3d image data from a 2d image data input

Country Status (8)

Country Link
US (1) US8325220B2 (en)
EP (1) EP1958149B1 (en)
JP (1) JP5073670B2 (en)
KR (1) KR101370356B1 (en)
CN (1) CN101322155B (en)
AT (1) ATE542194T1 (en)
RU (1) RU2411690C2 (en)
WO (1) WO2007063478A2 (en)

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2010062725A3 (en) * 2008-11-03 2010-08-05 Microsoft Corporation Converting 2d video into stereo video
WO2011097306A1 (en) * 2010-02-04 2011-08-11 Sony Corporation 2d to 3d image conversion based on image content
RU2487488C2 (en) * 2007-06-26 2013-07-10 Конинклейке Филипс Электроникс Н.В. Method and system for encoding three-dimensional video signal, encapsulated three-dimensional video signal, method and system for three-dimensional video decoder
RU2503062C2 (en) * 2008-08-26 2013-12-27 Конинклейке Филипс Электроникс Н.В. Method and system for encoding three-dimensional video signal, encoder for encoding three-dimensional video signal, encoded three-dimensional video signal, method and system for decoding three-dimensional video signal, decoder for decoding three-dimensional video signal
EP2680224A1 (en) * 2012-06-27 2014-01-01 Vestel Elektronik Sanayi ve Ticaret A.S. Method and device for determining a depth image
EP2426935A3 (en) * 2010-09-01 2014-01-08 Samsung Electronics Co., Ltd. Display apparatus and image generating method thereof
EP2416578A3 (en) * 2010-08-02 2014-09-24 Trdimize Ltd Multiclass clustering with side information from multiple sources and the application of converting 2D video to 3D
DE102007058779B4 (en) * 2007-12-06 2021-01-14 Robert Bosch Gmbh Device of a motor vehicle for generating an image suitable for image analysis

Families Citing this family (54)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8330801B2 (en) * 2006-12-22 2012-12-11 Qualcomm Incorporated Complexity-adaptive 2D-to-3D video sequence conversion
EP2130178A1 (en) * 2007-03-23 2009-12-09 Thomson Licensing System and method for region classification of 2d images for 2d-to-3d conversion
JP5337170B2 (en) 2008-02-08 2013-11-06 グーグル インコーポレイテッド Panorama camera with multiple image sensors using timed shutters
JP2010128450A (en) * 2008-12-01 2010-06-10 Nippon Telegr & Teleph Corp <Ntt> Three-dimensional display object, three-dimensional image forming apparatus, method and program for forming three-dimensional image
CN101751664B (en) * 2008-12-02 2013-04-17 奇景光电股份有限公司 Generating system and generating method for three-dimensional depth information
KR20100080704A (en) * 2009-01-02 2010-07-12 삼성전자주식회사 Method and apparatus for obtaining image data
JP4903240B2 (en) * 2009-03-31 2012-03-28 シャープ株式会社 Video processing apparatus, video processing method, and computer program
US9124874B2 (en) 2009-06-05 2015-09-01 Qualcomm Incorporated Encoding of three-dimensional conversion information with two-dimensional video sequence
JP5369952B2 (en) * 2009-07-10 2013-12-18 ソニー株式会社 Information processing apparatus and information processing method
US8878912B2 (en) * 2009-08-06 2014-11-04 Qualcomm Incorporated Encapsulating three-dimensional video data in accordance with transport protocols
US8629899B2 (en) * 2009-08-06 2014-01-14 Qualcomm Incorporated Transforming video data in accordance with human visual system feedback metrics
US9083958B2 (en) * 2009-08-06 2015-07-14 Qualcomm Incorporated Transforming video data in accordance with three dimensional input formats
US8254760B2 (en) 2009-08-28 2012-08-28 Apple Inc. Pixel analysis and frame alignment for background frames
KR101082545B1 (en) 2010-01-28 2011-11-10 주식회사 팬택 Mobile communication terminal had a function of transformation for a picture
KR101674568B1 (en) * 2010-04-12 2016-11-10 삼성디스플레이 주식회사 Image converting device and three dimensional image display device including the same
KR101690297B1 (en) * 2010-04-12 2016-12-28 삼성디스플레이 주식회사 Image converting device and three dimensional image display device including the same
KR20120005328A (en) 2010-07-08 2012-01-16 삼성전자주식회사 Stereoscopic glasses and display apparatus including the same
US20130113795A1 (en) * 2010-07-26 2013-05-09 City University Of Hong Kong Method for generating multi-view images from a single image
US9165367B2 (en) * 2010-09-02 2015-10-20 Samsung Electronics Co., Ltd. Depth estimation system for two-dimensional images and method of operation thereof
KR101638919B1 (en) * 2010-09-08 2016-07-12 엘지전자 주식회사 Mobile terminal and method for controlling the same
US9305398B2 (en) 2010-10-08 2016-04-05 City University Of Hong Kong Methods for creating and displaying two and three dimensional images on a digital canvas
TWI532009B (en) * 2010-10-14 2016-05-01 華晶科技股份有限公司 Method and apparatus for generating image with shallow depth of field
JP2012100116A (en) * 2010-11-02 2012-05-24 Sony Corp Display processing device, display processing method, and program
KR20120059367A (en) * 2010-11-30 2012-06-08 삼성전자주식회사 Apparatus for processing image based on energy value, and methods thereof
KR101188105B1 (en) * 2011-02-11 2012-10-09 팅크웨어(주) Apparatus and method for providing argumented reality using image information
KR101685418B1 (en) 2011-04-27 2016-12-12 한화테크윈 주식회사 Monitoring system for generating 3-dimensional picture
JP5868026B2 (en) 2011-05-24 2016-02-24 株式会社東芝 Ultrasonic diagnostic equipment
CN102857772B (en) * 2011-06-29 2015-11-11 晨星软件研发(深圳)有限公司 Image treatment method and image processor
WO2013009099A2 (en) 2011-07-12 2013-01-17 삼성전자 주식회사 Device and method for blur processing
US9438890B2 (en) * 2011-08-25 2016-09-06 Panasonic Intellectual Property Corporation Of America Image processor, 3D image capture device, image processing method, and image processing program
US8749548B2 (en) * 2011-09-01 2014-06-10 Samsung Electronics Co., Ltd. Display system with image conversion mechanism and method of operation thereof
CN102426693B (en) * 2011-10-28 2013-09-11 彩虹集团公司 Method for converting 2D into 3D based on gradient edge detection algorithm
WO2013077338A1 (en) * 2011-11-21 2013-05-30 株式会社ニコン Display device, and display control program
JP2013172190A (en) * 2012-02-17 2013-09-02 Sony Corp Image processing device and image processing method and program
US9286658B2 (en) * 2012-03-22 2016-03-15 Qualcomm Incorporated Image enhancement
KR20130127867A (en) * 2012-05-15 2013-11-25 삼성전자주식회사 Stereo vision apparatus and control method thereof
CN105531997B (en) * 2013-04-09 2018-07-13 贝塔尼美特股份有限公司 Method for transformation and system of the two-dimensional video to 3 D video
JP2015149547A (en) * 2014-02-05 2015-08-20 ソニー株式会社 Image processing method, image processing apparatus, and electronic apparatus
US9807372B2 (en) * 2014-02-12 2017-10-31 Htc Corporation Focused image generation single depth information from multiple images from multiple sensors
JP6603983B2 (en) * 2014-09-22 2019-11-13 カシオ計算機株式会社 Image processing apparatus, method, and program
CN104301706B (en) * 2014-10-11 2017-03-15 成都斯斐德科技有限公司 A kind of synthetic method for strengthening bore hole stereoscopic display effect
CN104796684A (en) * 2015-03-24 2015-07-22 深圳市广之爱文化传播有限公司 Naked eye 3D (three-dimensional) video processing method
US11024047B2 (en) * 2015-09-18 2021-06-01 The Regents Of The University Of California Cameras and depth estimation of images acquired in a distorting medium
EP3185209B1 (en) * 2015-12-23 2019-02-27 STMicroelectronics (Research & Development) Limited Depth maps generated from a single sensor
CN105701823A (en) * 2016-01-14 2016-06-22 无锡北邮感知技术产业研究院有限公司 Method of using occlusion relation to recover depth order
KR101825218B1 (en) * 2016-04-08 2018-02-02 한국과학기술원 Apparatus and method for generaing depth information
CN105957053B (en) * 2016-04-19 2019-01-01 深圳创维-Rgb电子有限公司 Two dimensional image depth of field generation method and device
FR3074385B1 (en) 2017-11-28 2020-07-03 Stmicroelectronics (Crolles 2) Sas SWITCHES AND PHOTONIC INTERCONNECTION NETWORK INTEGRATED IN AN OPTOELECTRONIC CHIP
KR101921608B1 (en) 2018-01-29 2018-11-26 한국과학기술원 Apparatus and method for generating depth information
JP7137313B2 (en) 2018-02-15 2022-09-14 キヤノン株式会社 Output device, image processing method and program
US10972714B2 (en) * 2018-02-15 2021-04-06 Canon Kabushiki Kaisha Image processing apparatus, image processing method and storage medium for storing program
US10523922B2 (en) * 2018-04-06 2019-12-31 Zspace, Inc. Identifying replacement 3D images for 2D images via ranking criteria
RU2690757C1 (en) 2018-08-21 2019-06-05 Самсунг Электроникс Ко., Лтд. System for synthesis of intermediate types of light field and method of its operation
US11941782B2 (en) * 2020-06-16 2024-03-26 Adobe Inc. GPU-based lens blur rendering using depth maps

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1021049A2 (en) 1999-01-14 2000-07-19 Sony Corporation Stereoscopic video display method and apparatus

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
AUPN732395A0 (en) * 1995-12-22 1996-01-25 Xenotech Research Pty Ltd Image conversion and encoding techniques
JP3500056B2 (en) * 1997-11-10 2004-02-23 三洋電機株式会社 Apparatus and method for converting 2D image to 3D image
EP1314138A1 (en) * 2000-08-04 2003-05-28 Dynamic Digital Depth Research Pty. Ltd. Image conversion and encoding technique
WO2004061765A2 (en) 2003-01-06 2004-07-22 Koninklijke Philips Electronics N.V. Method and apparatus for depth ordering of digital images
WO2004107266A1 (en) * 2003-05-29 2004-12-09 Honda Motor Co., Ltd. Visual tracking using depth data
KR101038452B1 (en) * 2003-08-05 2011-06-01 코닌클리케 필립스 일렉트로닉스 엔.브이. Multi-view image generation
CN100353760C (en) * 2004-09-10 2007-12-05 张保安 Combined wide-screen television system
US8384763B2 (en) * 2005-07-26 2013-02-26 Her Majesty the Queen in right of Canada as represented by the Minster of Industry, Through the Communications Research Centre Canada Generating a depth map from a two-dimensional source image for stereoscopic and multiview imaging

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1021049A2 (en) 1999-01-14 2000-07-19 Sony Corporation Stereoscopic video display method and apparatus

Cited By (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
RU2487488C2 (en) * 2007-06-26 2013-07-10 Конинклейке Филипс Электроникс Н.В. Method and system for encoding three-dimensional video signal, encapsulated three-dimensional video signal, method and system for three-dimensional video decoder
DE102007058779B4 (en) * 2007-12-06 2021-01-14 Robert Bosch Gmbh Device of a motor vehicle for generating an image suitable for image analysis
RU2503062C2 (en) * 2008-08-26 2013-12-27 Конинклейке Филипс Электроникс Н.В. Method and system for encoding three-dimensional video signal, encoder for encoding three-dimensional video signal, encoded three-dimensional video signal, method and system for decoding three-dimensional video signal, decoder for decoding three-dimensional video signal
WO2010062725A3 (en) * 2008-11-03 2010-08-05 Microsoft Corporation Converting 2d video into stereo video
US8345956B2 (en) 2008-11-03 2013-01-01 Microsoft Corporation Converting 2D video into stereo video
WO2011097306A1 (en) * 2010-02-04 2011-08-11 Sony Corporation 2d to 3d image conversion based on image content
US8520935B2 (en) 2010-02-04 2013-08-27 Sony Corporation 2D to 3D image conversion based on image content
EP2416578A3 (en) * 2010-08-02 2014-09-24 Trdimize Ltd Multiclass clustering with side information from multiple sources and the application of converting 2D video to 3D
EP2426935A3 (en) * 2010-09-01 2014-01-08 Samsung Electronics Co., Ltd. Display apparatus and image generating method thereof
EP2683170A3 (en) * 2010-09-01 2014-01-15 Samsung Electronics Co., Ltd Display apparatus and image generating method thereof
EP2680224A1 (en) * 2012-06-27 2014-01-01 Vestel Elektronik Sanayi ve Ticaret A.S. Method and device for determining a depth image

Also Published As

Publication number Publication date
EP1958149B1 (en) 2012-01-18
US20080303894A1 (en) 2008-12-11
KR20080077391A (en) 2008-08-22
KR101370356B1 (en) 2014-03-05
ATE542194T1 (en) 2012-02-15
CN101322155A (en) 2008-12-10
RU2411690C2 (en) 2011-02-10
RU2008126927A (en) 2010-01-10
WO2007063478A3 (en) 2007-10-11
JP2009517951A (en) 2009-04-30
CN101322155B (en) 2013-03-27
EP1958149A2 (en) 2008-08-20
US8325220B2 (en) 2012-12-04
JP5073670B2 (en) 2012-11-14

Similar Documents

Publication Publication Date Title
US8325220B2 (en) Stereoscopic image display method and apparatus, method for generating 3D image data from a 2D image data input and an apparatus for generating 3D image data from a 2D image data input
CA2704479C (en) System and method for depth map extraction using region-based filtering
CN100514367C (en) Color segmentation-based stereo 3D reconstruction system and process
JP5587894B2 (en) Method and apparatus for generating a depth map
JP4879326B2 (en) System and method for synthesizing a three-dimensional image
US8542929B2 (en) Image processing method and apparatus
KR102464523B1 (en) Method and apparatus for processing image property maps
KR100953076B1 (en) Multi-view matching method and device using foreground/background separation
KR100888081B1 (en) Apparatus and method for converting 2D image signals into 3D image signals
JP2001229390A (en) Method and device for changing pixel image into segment
KR20060129371A (en) Creating a depth map
JP2009512246A (en) Method and apparatus for determining shot type of an image
US20180295289A1 (en) Image processing apparatus, method, and storage medium
KR101458986B1 (en) A Real-time Multi-view Image Synthesis Method By Using Kinect
EP3616399B1 (en) Apparatus and method for processing a depth map
JP4862004B2 (en) Depth data generation apparatus, depth data generation method, and program thereof
KR101849696B1 (en) Method and apparatus for obtaining informaiton of lighting and material in image modeling system
CN114677393A (en) Depth image processing method, depth image processing device, image pickup apparatus, conference system, and medium
AT&T
CN112991419A (en) Parallax data generation method and device, computer equipment and storage medium
Tian et al. Upsampling range camera depth maps using high-resolution vision camera and pixel-level confidence classification
KR20230117601A (en) Apparatus and method for processing depth maps
Reddy Automatic 2D-to-3D conversion of single low depth-of-field images

Legal Events

Date Code Title Description
WWE Wipo information: entry into national phase

Ref document number: 200680045321.7

Country of ref document: CN

121 Ep: the epo has been informed by wipo that ep was designated in this application
WWE Wipo information: entry into national phase

Ref document number: 2006831957

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 12095183

Country of ref document: US

WWE Wipo information: entry into national phase

Ref document number: 2008542901

Country of ref document: JP

Ref document number: 2756/CHENP/2008

Country of ref document: IN

NENP Non-entry into the national phase

Ref country code: DE

WWE Wipo information: entry into national phase

Ref document number: 2008126927

Country of ref document: RU

Ref document number: 1020087016167

Country of ref document: KR

WWP Wipo information: published in national office

Ref document number: 2006831957

Country of ref document: EP