WO2004047008A1 - Reverse-rendering method for digital modeling - Google Patents

Reverse-rendering method for digital modeling Download PDF

Info

Publication number
WO2004047008A1
WO2004047008A1 PCT/US2003/036710 US0336710W WO2004047008A1 WO 2004047008 A1 WO2004047008 A1 WO 2004047008A1 US 0336710 W US0336710 W US 0336710W WO 2004047008 A1 WO2004047008 A1 WO 2004047008A1
Authority
WO
WIPO (PCT)
Prior art keywords
error function
scene
parameters
solution
determining
Prior art date
Application number
PCT/US2003/036710
Other languages
French (fr)
Inventor
Daniele Paolo David Piponi
Original Assignee
Esc Entertainment, A California Corporation
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Esc Entertainment, A California Corporation filed Critical Esc Entertainment, A California Corporation
Priority to JP2004553822A priority Critical patent/JP2006507585A/en
Priority to AU2003295582A priority patent/AU2003295582A1/en
Priority to EP03786780A priority patent/EP1565872A4/en
Publication of WO2004047008A1 publication Critical patent/WO2004047008A1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T15/003D [Three Dimensional] image rendering
    • G06T15/10Geometric effects
    • G06T15/20Perspective computation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T13/00Animation
    • G06T13/802D [Two Dimensional] animation, e.g. using sprites
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T17/00Three dimensional [3D] modelling, e.g. data description of 3D objects
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/50Depth or shape recovery
    • G06T7/55Depth or shape recovery from multiple images
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N5/00Details of television systems
    • H04N5/222Studio circuitry; Studio devices; Studio equipment
    • H04N5/262Studio circuits, e.g. for mixing, switching-over, change of character of image, other special effects ; Cameras specially adapted for the electronic generation of special effects
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2200/00Indexing scheme for image data processing or generation, in general
    • G06T2200/08Indexing scheme for image data processing or generation, in general involving all processing steps from image acquisition to 3D model generation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2210/00Indexing scheme for image generation or computer graphics
    • G06T2210/61Scene description

Definitions

  • the present invention relates to methods in the field of digital imaging, and more I0 particularly, to reverse rendering method, such as photogrammetry and match-moving, for digital modeling.
  • one of the problems encountered in digital modeling involves the use of photogrammetry to build three-dimensional ("3-D") models using two-dimensional (“2-D”) photographic data.
  • Match-moving methods extend photogrammetry to the modeling of movement over a sequence of images.
  • LA2:671419.3 in order to keep complexity and programming requirements at a manageable level.
  • many prior art methods do not permit a user to specify arbitrary expressions for defining relationships between parameters of a scene graph.
  • prior-art methods do not allow placement of the camera anywhere in the scene graph.
  • Present methods may also sometimes become locked in a computational loop or arrive at an incorrect solution, thereby failing construct useful 3-D models from the available 2-D data at all. It is therefore desirable to provide an improved method for constructing 3-D digital models from 2-D photographic data, that overcomes the limitations of prior-art photogrammetry techniques.
  • Photogrammetry, match-moving, and other reverse-rendering methods may be viewed as applications for differential calculus. Differential calculus has many applications in digital movie production.
  • SUMMARY OF THE INVENTION he present invention provides an improved reverse-rendering method, that overcomes the limitations of the prior art.
  • the invention further provides a method for solving photogrammetry problems, and similar problems that use differential calculus, using generically-coded software. Methods according to the invention have been recently applied in the production of feature films, and may readily be extended to related applications for digital imaging.
  • images (which may include still photographic images, video or motion pictures) are taken of a scene from multiple camera positions.
  • the images may be provided in digital form and stored in a database to be accessed during later steps of the method.
  • Cameras may be positioned within the scene itself. Position, focal length, and orientation of the cameras may be noted for later use; however, such information need not be highly accurate.
  • a user may collect the camera data, and inspect representative images to gain an initial impression of the scene.
  • a user may construct a rough three-dimensional model of the scene as an initial estimate.
  • Any suitable modeling software such as MayaTM, may be used.
  • any other suitable method for defining a starting assumption for the solution process may be used. This may further speed the solution process and/or prevent incorrect solutions. Relationships between modeled objects in the scene may be defined by intervening transforms in a scene graph.
  • the scene graph may include selected viewable objects in the scene and known camera data for at least those cameras positioned in the scene itself. Any desired
  • LA2:671419.3 transforms between nodes in the scene graph that are consistent with the photographs may also be defined as part of the scene graph. Not all objects in the scene need be included in the scene graph, and not every transform need be defined.
  • a scene graph may be particularly useful when the nature of the photographs are such that subsequent photogrammetry steps have difficulty arriving at a correct solution, for example, when several possibilities are apparent for the shape of an object, all of which conform to the photographs.
  • Software tools for creating scene 3D scene graphs are known in the art.
  • a user should also designate the images that are to be used as inputs for the photogrammetry solver, and identify corresponding points or other features on the designated images.
  • a user interface may be provided to assist a user in designating common points on related images.
  • the designated images, the identified corresponding points on the images, the scene graph, and any related 3D models, may be designated as input to a photogrammetry solver.
  • Initial information that is not defined by the scene graph or modeled objects may be defined in any suitable fashion to an arbitrary, interpolated, or estimated value.
  • the initial visual coordinates and camera parameters represent a baseline for an error function, such as a least-squares function and/or any other suitable error function.
  • the principles of photogrammetry may then be applied, using the corresponding points identified in the photographs and/or other selected points or features to determine an amount of difference, i.e., error, between the initial estimate of the scene (including both geometry and camera parameters), and a scene calculated by projecting the selected points on the photographs based on the camera parameters.
  • a point "A1" may have actual spatial coordinates , in the defined coordinate system.
  • a corresponding point "A2” may have coordinates u 2 , representing an estimated position for A1.
  • the estimate A2 is based on an intersection of rays passing through an assumed point A1 in the photographic plane, as determined from each photograph and its corresponding camera parameters.
  • the error function may be minimized using any suitable error minimization method.
  • Minimization software typically computes the sum of the squares of the error (or sometimes an alternative to the square in "robust” methods) and then tries to find the solution that minimizes the error through an iterative process.
  • the iterative process of adjusting the "bundles" of rays used to define the three-dimensional geometry of the scene by minimizing an error function is sometimes referred to as "bundle adjustment" by those of ordinary skill in the art.
  • some parameters may be marked as parameters that the photogrammetry tool is required to solve for. For example, such parameters may include camera focal length, relative object scales and rotation angles. ⁇ Other parameters may be established as constant values throughout the bundle adjustment process.
  • the error measure e generally varies as a function of the unknown parameters, and a certain set of parameter values will minimize error.
  • Bundle adjustment may proceed using a second-order Newton method that includes a computation of both the first derivative of the error function (its "Jacobian") and the second derivative (its "Hessian”).
  • Second-order error minimization in this context is described, for example, in Bundle adjustment - a Modern Synthesis, Triggs, et. al., 2000.
  • prior-art methods make use of a Gauss-Newtonian approximation and do not compute the full second derivative.
  • LA2:671419.3 calculation of the second derivative is considered to be computationally complex, and so modified Newton methods, such as the Jacobian-based version of the Levenberq- Marquardt methods, are used to avoid this complexity.
  • both the first derivative (Jacobian) and the second derivative (Hessian) of the error function may be accurately computed using the technique of automatic differentiation.
  • Principles and applications for automatic differentiation are known in other contexts, as published, for example, in On Automatic Differentiation by Andreas Griewank, Argonne National Laboratory, Mathematics and Computer Science Division, preprint ANL/MCS-P10-1088 (November 1988), which is incorporated herein by reference.
  • the techniques of automatic differentiation may be adapted to calculate the Jacobian and Hessian matrices by one of ordinary skill, with reference to the disclosure herein or to other published references on automatic differentiation. Automatic differentiation may also be used to compute the Jacobian without computing the full Hessian.
  • a solution of the error function is thereby realized much more quickly than using traditional methods, while using software that is comparatively efficient to program, to modify, and to add new features to.
  • programming for prior-art photogrammetry methods is comparatively expensive, time-consuming, and difficult to modify.
  • the full benefit of Newton's method can be realized, with rapid convergence to the error minimum, while computation of the derivatives is also accomplished quickly.
  • placing one or more of the cameras arbitrarily within the scene graph does not cause any difficulties when the solution technique of the invention is used.
  • This advantage makes it much more convenient to gather photographic images for photogrammetry, and may even enable application of photogrammetry to scenes that were heretofore difficult or impossible to adequately reconstruct from images.
  • prior-art photogrammetry methods have not permitted much or any flexibility in the placement of cameras within the scene to be reconstructed, to the extent that such placement was permitted at all.
  • automatic differentiation implemented in a generic programming scheme is preferably used to compute an accurate first and/or second derivative of the error function, thereby guiding the bundle adjustment to a solution.
  • the bundle adjustment is guided by the partial information contained in the scene graph, thereby reducing the solution time and ensuring a more accurate result.
  • An additional advantage is that an algorithm according to the invention is much simpler to implement in source code than both the traditional Jacobian-based Levenberg-Marquardt method, and algebraic derivative methods.
  • the use of automatic differentiation according to the invention provides the further advantage of flexibly accommodating almost any initial scene graph. This is a key advantage because it frees the person constructing the scene from defining the initial scene graph in any particular manner. Corrections and changes are also readily accommodated.
  • any node of the scene graph can be connected to any other via a mathematical expression entered by a user.
  • Almost any expression may be used.
  • the height of an object may be designated to be a scalar multiple of any other object.
  • the focal length of two cameras may be equated or otherwise related.
  • Almost any desired relationship between nodes that can be expressed mathematically may be entered as part of the scene graph. Even unusual relationships, for example, a relationship between the size of an object and an angle between two other objects, may be included.
  • a further advantage of the invention is the treatment of cameras.
  • camera data is represented at nodes, just as data for viewable objects.
  • Cameras can be placed in the scene graph in the same kinds of relationships as other objects.
  • a solution algorithm according to the invention is configured to simultaneously compute the scene geometry and the camera pose. This approach permits, unlike prior-art methods, posing a camera freely within the scene itself. In this context, "freely posing" means that the camera may be posed at any desired location
  • LA2:671419.3 within the scene, without limitations imposed by a defined solution algorithm.
  • a first camera may be pointed at a freeway as a car travels past, and a second camera may be mounted on the car.
  • the second camera may be mounted on an actual scene object - the car - that is being, solved for. This may be a great advantage for those situations in which the camera position and orientation depend on the geometry that is being solved for.
  • cameras may be placed in the scene on top of buildings of unknown height, obviating the need for a fly-over or for elevated vantage points external to the scene.
  • the scene to be reconstructed by photogrammetry is represented as a scene graph, which is a type of directed acylic graph, or "DAG.”
  • Scene graphs are generally not used in connection with photogrammetry for the post-production industry. Instead, in prior-art photogrammetry methods, the scene is typically represented by a collection of unrelated points. Parent/child relationships and transforms are not defined, unlike scene graphs. To the extent that the use of initial scene graphs containing partially-defined information as a precursor to a photogrammetry solution has been known at all, the ability to freely define transforms in an essentially unrestricted way is not known. Using these partial transform relationships, which are flexibly defined by the user, allows for much more accurate reconstructions and for reconstructions using far fewer images. It should be apparent that the invention is not limited to static photogrammetry.
  • the invention may readily be adapted to account for camera motion, or the motion of objects in the scene, according to principles understood in the art.
  • a sequence of five-hundred frames of film can be treated as five-hundred independent images, which are then solved for in the usual way.
  • Some parameters may vary from frame to frame (e.g. the position of a moving camera), while others (e.g. the height of a house) may remain constant.
  • the invention may readily be used for solving match-moving problems.
  • FIG. 1 is a diagram showing a three-dimensional scene to be reconstructed using a reverse-rendering method.
  • Fig. 2 is a flow chart showing exemplary steps of a reverse-rendering method according to an embodiment of the invention.
  • Fig. 3 is a block diagram showing an exemplary system for carrying out steps of the invention.
  • one aspect may comprise a method for solving bundle adjustment problems, and similar problems that use differential calculus, using generically-coded software for automatic differentiation.
  • Another aspect may comprise an improved photogrammetry method that is designed to exploit the benefits of automatic differentiation.
  • the analytical basis for the application of automatic differentiation to differential calculus problems in digital imaging is first described.
  • application of automatic differentiation using a generic programming approach is described in a second section.
  • a third section provides a description of an exemplary photogrammetry process that may make use of automatic differentiation and generic programming as disclosed herein.
  • the invention provide a method for computing derivatives of a function using a programming language, such as C++, that supports the introduction of new types and operator overloading.
  • the method is relatively easy to program without requiring a well- developed knowledge of differential calculus, does not require post-processing of
  • LA2:671419.3 source code and is both more efficient, and more accurate than finite difference methods.
  • the method makes use of automatic differentiation in the mathematical sense, as explained below.
  • the differential d has some additional properties that may prove useful.
  • Equation 1 can be rewritten as
  • the above automatic differentiation method may be generalized to partial derivatives.
  • a set of non-zero numbers (J 0 ,J, ,...J,) with i e lan index set may be introduced.
  • a general differential of this type may now be written as x - ⁇ + T . b i d i . That is, members of this new differential class may be represented as a pair consisting of the real number ⁇
  • Equation 3 the partial derivatives of a multi-variable function /(;c 0 ,*,,...*,.) may be obtained by computing f(x Q + J 0 , ⁇ , + d i ,... ⁇ l + d t ) , and reading the desired i' h partial derivative from the coefficient of d t .
  • partial derivatives for a function f( ⁇ ,y) mapping a pair of real numbers to another real number may be computed by computing f(x + d Q ,y + d0) .
  • the partial derivative with respect to x is read off of the coefficient of d 0
  • the partial derivative with respect to y is read off of the coefficient of d t .
  • All d Q d terms are zero by definition. Note that this technique requires substantially fewer computations of the function f( ⁇ ,y) than finite difference methods.
  • GENERIC PROGRAMMING Suitable programming languages, such as C++, may be used to operate on differential types because of their potential for "generic programming.”
  • Generic programming refers the methodology by which software is written so as to be independent of the underlying data types used.
  • Standard Template Library which employs the C++ template mechanism to define abstract data types and algorithms independently of the types of their contents and arguments.
  • template-based mathematical libraries for C++ that are generic.
  • One of the recognized purposes of generic programming is to enable code that is independent of information such as the machine representation of floating point numbers (for example, of whether the type "float " or "double " is used.)
  • Generically-programmed libraries may also be exploited to supply mathematical functions that will operate with a specially-defined object class for handling differentials.
  • the specially-defined object class should be designed to export an interface similar enough to that of the more usual number types that it can replace them.
  • computing derivatives of a wide class of functions may thereby be accomplished as easily as computing those functions applied to complex numbers.
  • An underlying data type float may be used.
  • the float type may be extended by adding in new elements implied by the differential J in a new Differential class. Every element of the Differential class can be written in the form a + bd , for real a and b .
  • the variable a may be referred to as the real part and b as the infinitesimal part.
  • the Differential class may be defined as: class Differential ⁇ public : float a; // Real part float b; // Infinitesimal part
  • LA2:671419.3 In the alternative to making members of the differential class public, an accessor method may be used.
  • Differential operator-*- (const Differential &x, const Differential &y) ⁇ return Differential (x. a+y. a,x.b+y.b) ,* Differential operator* (const Differential &x, const Differential &y) ⁇ return Differential (x. *y. a, . a*y.b+x. b*y. a) ;
  • a differential variable d may now be defined by:
  • d Differential d(0,l); d may then be used to compute the derivative of f(x) at any desired value of x, by evaluating the function with d added to the argument.
  • the ratio of two differential objects may be defined according to Equation 6. It should be apparent that this result requires that the real part ⁇ , of d in the denominator be non-zero, but this may easily be accomplished.
  • a cosine operation may be defined for Differential objects as follows:
  • FADBAD library one approach is to simply iterate the computation of the derivative.
  • a class of Differential objects may be defined over an arbitrary class: template ⁇ class X> class Differential ⁇ public : X a; // Real part
  • T Ih may be compared to the first example for defining a Differential class, which was defined over a class of float variables.
  • We may now compute second derivatives by iterating the above method. For example, to compute the second derivative of a first function "f" , a second C++ function "g" may be used to compute the derivative of f in a
  • interval arithmetic and affine arithmetic may also be combined with automatic differentiation for applications to inverse rendering problems, such as photogrammetry.
  • DIGITAL MOVIE APPLICATIONS Many applications in digital movie post-production may involve inverse rendering; that is, determining what input to a renderer will produce an output that matches a given image. For example, a number of parameters ( ⁇ ⁇ , ⁇ 2 ,...) may be provided as input to a 3D rendering process. These parameters may range from transformation parameters such as angle of rotation to shader parameters such as the exponent in a Blinn-Phong shader.
  • a rendering process may be considered to be a function f : ( ⁇ x , ⁇ 2 ,..) ⁇ (I l ⁇ C ) , (8) where (I I ⁇ C ) represents the color of the "c" channel of the (i,j )-th pixel of the rendered image. If J I J C is some "correct” or desired result used as a baseline, then we can write a sum of squares error term
  • the automatic differentiation tool may be applied in the context of a minimization algorithm, such as non-linear conjugate gradients, to efficiently derive input parameters that result in an image / that best matches J .
  • a subsystem of a 3D renderer For example, part of a ray-tracer takes as input a parameters of a light ray, and returns as output texture coordinates that the ray intersects.
  • this ray-tracing code generically, we can automatically compute the derivative of the texture coordinates as a function of the light ray parameters. This may be useful, for example, in performing anti-alias texture mapping.
  • Another inverse rendering problem concerns transforming and projecting 3D points to 2D screen coordinates. The inverse of this operation may comprise deriving a 3D modeled geometry that best fits a collection of projected 2D points; i.e., photogrammetry and match-moving.
  • a set of 3-dimensional points P.,i e I (for example, P1 , P2, P3 and P4), with coordinates (p ⁇ , representing a three- dimensional scene 100.
  • the coordinates (p.) may be defined with respect to a suitable datum 110 for a coordinate system referenced to scene 100.
  • a set of images exists indexed by the set J , in which some of the P t appear projected.
  • a camera projection function c ⁇ exists, such that c j (p i ) is the projection of the point P. into a 2-dimensional screen space associated with camera j .
  • a screen space 104 may be associated with camera 102, and screen space 106 with camera 108.
  • Selected cameras, for example, camera 108 may be freely posed within scene 100.
  • a screen space associated with a camera need not
  • LA2:671419.3 encompass all points of interest within a scene.
  • screen space " 106 " does not encompass P4, which, however, is encompassed by screen space 104.
  • the projection function comprises a type of reverse-rendering function. Methods for defining a suitable projection function c 1 are well understood in the art of photogrammetry, and need not be described herein.
  • a 2D position z, ; in an associated screen space may be measured from a corresponding image.
  • 2D positions for z- 1 , 1 to z 4 , ⁇ may be measured in an image from camera 102.
  • positions for Z ⁇ ⁇ 2 to z 3 , 2 may be measured in an image from camera 108.
  • the position z ltJ is related to the projected position c y (/j,) by varying amount of error e ⁇
  • e ⁇ is the difference between actual and measured projections of P t .
  • the error e t J varies as a function of c ⁇ and (/?,) , and the measured 2D positions z are generally fixed.
  • the amount of error may be defined over 2D coordinates x,y in the relevant screen space.
  • the x and y coordinates of e t J are independent normally distributed variables whose components have variance ⁇
  • the positions (/?,) of the points P l and any unknown parameters of c ⁇ may therefore be determined.
  • the maximum likelihood estimation for the positions (p t ) and unknown parameters of c may be determined from the minimum error e , wherein e is defined by
  • the derivative can be used to minimize the error function using an iterative minimization algorithm, as known in the art, and sometimes referred to as bundle adjustment.
  • the solution process may be greatly facilitated by expressing Equation 11 using generic programming techniques as disclosed herein, and solving for the derivative by automatic differentiation.
  • various choices for a solution process may present themselves during the design of a photogrammetry application implementing the invention, and exemplary details are provided below.
  • one such solution method 200 may be summarized as follows.
  • image data representing a plurality of images of a scene are received into a solution process.
  • a user interface may be provided for a user to identify and select pertinent images from a database, and then initiate a solution process using the selected images. Images of the scene may be collected using any suitable camera, as known in the art.
  • user input designating corresponding points or other corresponding features appearing in two or more of the selected images is received.
  • the user input serves to mark and identifies corresponding features appearing in more than one image.
  • Each corresponding feature should be located on a node of the scene graph. For example, while displaying multiple images on a computer display, a user may indicate corresponding points on the images using a pointing device. Any suitable method may be used to receive user input indication a plurality of corresponding features in the image data. Methods for entering and recording such measurements are known in the art.
  • a preliminary solution estimate for the scene is received for use in the solution process.
  • the preliminary solution estimate may be developed by a user, based on any desired available information or estimate.
  • One convenient approach may be accept a preliminary 3D model as the solution estimate, preferably in scene graph form.
  • Many users may be familiar with 3D modeling software, and may build an approximation to the scene using any suitable modeling program.
  • a solution algorithm was designed to accept an approximate model, in scene graph form, constructed using AliasjWavefront MayaTM as input.
  • an initial solution estimate may be developed automatically, or adopted from an arbitrary set of values. Points in the scene graph or other solution estimate should be related to measurements of 2D position, e.g., z.
  • a scene graph is a type of directed, acyclic graph, meaning that it is a one-way tree structure without looping, like a family tree. For example, a "parent" and/or "children" are identified for each object or node. A parent may have multiple children, and a child may have multiple parents.
  • a child is not permitted to be a parent to any node in its parental lineage.
  • Elements that have a data component, like viewable objects or camera locations, are represented at the nodes.
  • Each node represents a function that will return a value depending on input parameters including space and time.
  • An important aspect of scene graphs is the defined relationship between hierarchically-related objects, sometimes referred to as a "transform.”
  • the relative orientation, size, mode of attachment, or other relationship of a child object with respect to its parent is the transform of the child object.
  • An object's transform can be manipulated to adjust the relationships between the parent and the child objects. For example, to adjust the size of a hand (child) relative to an arm (parent), a size parameter of the hand transform may be increased.
  • Transforms are inherited, in that the transform of an object is inherited by its children. For example, when the transform for the arm is adjusted to make the arm twice as large, then the hand grows twice as large, too.
  • the entire collection of objects, parent-child relationships and transforms comprises the scene graph. So long as any desired transforms are expressed as differentiable functions, they may readily be incorporated into an expression for e and differentiated.
  • Scene graphs are useful for modeling movement and scaling of objects.
  • An object When an object is moved, grows in size, or shrinks, normally all of the child objects move, grow, or shrink along with it.
  • a computer-generated actor may provide a simple
  • LA2:671419.3 example When the actor's arm is moved, its attached hand normally moves along with it. In terms of the scene graph, the hand is defined as a child of the arm.
  • the animator when the actor's arm is moved, the animator doesn't need to animate the hand separately.
  • Static objects may also be included in a scene graph.
  • An example of a static object is a building with windows. If the building itself is the parent of the windows, then when the building is relocated during the photogrammetry method, the windows will automatically move with it. Additionally, if the size or proportion of the building is changed, the windows will also scale with it.
  • the relative orientation, size, mode of attachment, or other relationship of a child object with respect to its parent may be referred to as the transform of the child object.
  • any suitable software application for example, Open InventorTM or MayaTM , may be used to build a scene graph for use with a method according to the invention.
  • the scene graph contains partial and/or approximate information.
  • windows may be related as child objects to a building whose size and position is not yet known.
  • a transform for each window may be configured to contain partial information about its relative size and orientation.
  • the window has rectangular edges, lies flat in a wall of the building, and is oriented parallel to the edges of the wall, without specifying the dimensions of the window.
  • Any parameter of the transform that is incomplete or incorrect is automatically computed using photogrammetry techniques.
  • Complete information about the scene is not required, because the information in the initial scene graph guides the photogrammetry solution, but does not determine it.
  • the 3D model preferably in scene graph form, may be accepted as the initial solution estimate.
  • relationships in the preliminary solution estimate may be further defined from user input.
  • users may define transformations in a MayaTM scene graph to represent partially known information in the scene. For example, if an object in the scene is known to be a rectangular solid, but has unknown dimensions, then users may
  • LA2:671419.3 instantiate a "cube" object in the scene graph and scale it using a transform.
  • the user may then mark variables in the scene graph whose values are unknown.
  • variables in the scene graph whose values are unknown.
  • a user may mark three numbers defining the unknown dimensions of the solid.
  • the photogrammetry projection function should be defined to include information or assumptions regarding camera parameters, including but not limited to camera position, camera orientation, focal length, lens and/or focal plane distortion, and other factors that affect the appearance of the image.
  • these parameters may be received for use in the solution process.
  • at least some projection function parameters may be marked as unknown, and solved for.
  • the camera pose may be unknown, particularly if the images represent a time sequence during which the camera moved, or if the pose was not measured for any other reason.
  • Other camera parameters are often known, for example, focal length. Relationships may be defined between camera parameters. For example, the focal length of two or more camera may be equated. All of these camera parameters may be represented as nodes in the initial scene graph.
  • this may facilitate solving for unknown camera parameters in the same way - i.e., using the same homogenous set of equations - as the geometry of the scene.
  • This may greatly enhance the power and flexibility of a method according to the invention, compared to prior art methods in which camera parameters are handled separately. For example, camera pose at any given time may be treated just like any other unknown parameter for solution, enabling cameras to be freely posed inside or outside of the scene.
  • an error function e for calculating a solution is determined.
  • the solution may comprise the desired positions (p ⁇ and unknown parameters of the projection function, for example, any unknown camera parameters.
  • the error function e is defined such that it is differentiable, and represents a difference (for example, a
  • LA2:671419 3 least-squares difference as generally express in Equation 11 ) between predicted and measured values of the projected points z / . It should be apparent that e should generally comprise a system of equations that may be expressed in matrix form.
  • an error function may be defined using an rendering subsystem of an existing rendering and modeling program.
  • the rendering subsystem may be modified to compute the projections of points in this scene hierarchy (i.e., to perform reverse rendering) in a generic way.
  • the existing rendering subsystem utilizes scene graphs, including transforms and camera parameters, so the resulting generically-programmed projection function c ; may readily accept input and provide output using a standard scene graph format.
  • the error function e may be iterated so as to discover the collection of unknown parameters, e.g., the unknown marked parameters and the points (/?,) of the scene geometry, that minimizes its value.
  • the value of these parameters at the global minimum of e may be regarded as the solution for the originally-defined reverse- rendering problem.
  • the iteration may begin with a solution estimate comprising the information received at steps 206-210.
  • Minimization may be performed using an iterative minimization algorithm.
  • an active set variation of the Levenberg-Marquardt algorithm suitable for bounded and unbounded minimization, may be used for bundle adjustment.
  • the full exact Hessian and Jacobian may be calculated using automatic differentiation as disclosed herein.
  • a conjugate gradient algorithm may be used to solve the linear system in the Hessian.
  • the parameter values corresponding to the global minimum of e may be used to build a model of the three-dimensional scene. For example, solved
  • LA2:671419.3 parameters may be incorporated into the preliminary estimate " scene " g raph " as "l' a parameter of an existing or newly-added node, to complete the solution.
  • the reconstructed scene may be presented to the user. If the reconstruction contains substantial errors, the user may be given the opportunity to make adjustments to the rough model and run the process again. If the result is substantially correct, it may be readily detailed by a graphics artist, advantageously already being in a standard scene graph format.
  • the solution should be generalized to include partial differentiation, because e in general depends on more than unknown parameter. It may be advantageous to represent the vector (b,-)- e/ (the infinitesimal part of the partial differential object employed in automatic differentiation) sparsely as index-value pairs rather than as a dense list of values. This permits sparse representation of the Hessian, and use of a sparse conjugate gradient method to solve for the iteration step. Exploiting sparsity may prove very helpful for obtaining good performance.
  • Hessian has no more than LM 2 non-zero terms.
  • the number of non-zero terms is usually much less than N 2 .
  • forward mode differentiation In an embodiment of the invention that was used for post-production work on a motion picture, a form of automatic differentiation known as forward mode differentiation was implemented.
  • An alternative approach to automatic differentiation is reverse mode automatic differentiation, which is described in the mathematics and computer science art. Reverse mode differentiation may often be better suited to problems involving large numbers of input variables. However, in an embodiment of the invention, reverse mode differentiation was implemented, but the performance realized was inferior to that
  • LA2:671419.3 achieved in embodiments that implemented forward mode differentiation with sparse representation of the differentials. It may also be possible to implement the invention using a sparse variation of reverse mode differentiation, which is at yet untested for this application.
  • Method 200 readily encompasses the solution of complex match-moving problems. For example, sequences of live action images may be treated as sets of independent images, to permit simultaneous solution of both static and time-dependent parameters. In fact, instead of computing match-moves from frame to frame incrementally, as traditionally done, match-moves may be computed by minimizing over all frames simultaneously. To the extent that the input images represent a sequence over time, then the user may indicate whether or not a parameter is to be considered animated.
  • the invention provides a high degree of flexibility to describe known information about a scene that the invention affords to users.
  • any transformation parameters such as rotation angles or relative scale could be marked for solving.
  • An arbitrary number of cameras could be placed anywhere in the scene hierarchy, and parameters for camera transforms could be marked for solving.
  • the solution method proved able to reconstruct animated articulated geometry, and even to reconstruct cameras mounted on articulated geometry (as with real camera rigs). Such cameras may be handled like any other animated parameter.
  • some 3D modeling applications for example, MayaTM, support the connection of parameters in a scene graph using symbolic expressions. Such applications may be used to build input for the solution step.
  • an expression evaluator similar to that provided in MayaTM was written in
  • LA2:671419.3 C++ in a generic and thus differentiable manner. bxpressions representing relationships between parameters of a scene were allowed to enter into the expression for e .
  • users may be able to express constraints that that may be difficult to express using transforms. For example, users may be able to specify that two independent objects were of the same unknown height, or that two cameras have the same unknown focal length.
  • the invention has proved to be a highly efficient method for computing derivatives used for solving reverse-rendering problem.
  • the invention has provided an efficient solution to the problems of reconstruction of geometry and camera moves from film.
  • the invention has proven capable of solving for both structured and nstructured geometry.
  • An application embodying the invention has been found suitable for completely replacing prior-art commercial match-moving and photogrammetry software for time-sensitive and major visual effects projects for motion pictures.
  • the effectiveness of the invention as implemented for motion picture post- production rested in no small part on a consistently generic implementation of the expression for e .
  • Once reusable library components were written for defining e in a flexible and generic manner, then finding the solution for e was no more difficult to implement than the forward code that simply transforms and projects points.
  • the solver consisted almost entirely of generic code to compute e , and library code to optimize generic functions.
  • the invention and in particular, efficient automatic differentiation in a generic programming environment, enabled a solution using the full Newton method instead of the commonly-used Gauss-Newton approximation.
  • the full Newton method
  • LA2:671419.3 yields an accurate solution very quickly, but is often " considered too complex Tor practical consideration.
  • the application of automatic differentiation in the generically- programmed environment removed these formerly daunting limitations.
  • the Gauss-Newton method may also be implemented, and may be preferable for some applications. Comparison between the two methods is a complex topic, and either method may be adopted, depending on the circumstances.
  • LA2:671419.3 large number or frames. But when using the implementation according to the invention in this way, the time to solve was often significantly less than the time to communicate the results back to the 3D modeler.
  • Figure 3 shows one such system 300, comprising a computer 302 connected to receive image data from a database 304.
  • System 300 may further comprise a memory 306 operabiy associated
  • Memory 306 may contain coded instructions to enable one of ordinary skill to carry out a method according to the invention.
  • memory 306 may contain coded instructions to enable one of ordinary skill to carry out a method according to the invention.
  • memory 306 may contain coded instructions to enable one of ordinary skill to carry out a method according to the invention.
  • memory 306 may contain coded instructions to enable one of ordinary skill to carry out a method according to the invention.
  • memory 306 may contain coded instructions to enable one of ordinary skill to carry out a method according to the invention.
  • memory 306 may contain coded instructions to enable one of ordinary skill to carry out a method according to the invention.
  • memory 306 may contain coded instructions to enable one of ordinary skill to carry out a method according to the invention.
  • memory 306 may contain coded instructions to enable one of ordinary skill to carry out a method according to the invention.
  • memory 306 may contain coded instructions to enable one of ordinary skill to carry out a method according to the invention.
  • 306 may comprise instructions for performing steps of a method according to the invention. For example: (i) receiving image data comprising a plurality of photographic images of a three-dimensional scene.

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Geometry (AREA)
  • Computer Graphics (AREA)
  • Multimedia (AREA)
  • Computing Systems (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Signal Processing (AREA)
  • Software Systems (AREA)
  • Image Generation (AREA)
  • Image Processing (AREA)
  • Processing Or Creating Images (AREA)
  • Measuring And Recording Apparatus For Diagnosis (AREA)
  • Studio Devices (AREA)

Abstract

A method for automatically or semi-automatically constructing a digital 3D model of a scene from photographic data and photogrammetry data includes defining an initial rough model as a solution estimate. A reverse rendering step includes a second-order solution method that employs automatic differentiation techniques to accurately compute derivatives of an error function. In an embodiment of the method, at least one camera is placed within the scene being constructed, and photographic data from this camera is used in the solution process.

Description

REVERSE-RENDERING METHOD FOR DIGITAL MODELING
CROSS-REFERENCE TO RELATED APPLICATION
This application claims priority pursuant to 35 U.S.C. § 119(e) to U.S. Provisional 5 Application Number 60/426,560, filed November 15, 2002, which application is specifically incorporated herein, in its entirety, by reference.
BACKGROUND OF THE INVENTION
1. Field of the Invention
The present invention relates to methods in the field of digital imaging, and more I0 particularly, to reverse rendering method, such as photogrammetry and match-moving, for digital modeling.
2. Description of Related Art
In recent decades, computers have increasingly been used to generate and/or modify moving images for virtually every application that makes use of them, for
15 example, television and motion pictures, computer games, and engineering model building. It is believed that use of digital modeling will become much more widespread in the future. Reverse-rendering methods - i.e., methods for determining what geometric, motion, lighting, camera, and other input will result in a desired image or image sequence - are likely to play an important role in developing digital models for a
'.0 variety of applications.
For example, one of the problems encountered in digital modeling involves the use of photogrammetry to build three-dimensional ("3-D") models using two-dimensional ("2-D") photographic data. Match-moving methods extend photogrammetry to the modeling of movement over a sequence of images. Current photogrammetry and
>.5 match-moving methods are difficult to implement in a flexible manner. Consequently, prior-art applications generally place undesirable constraints on acceptable input data,
- 1 -
LA2:671419.3 in order to keep complexity and programming requirements at a manageable level. For example, many prior art methods do not permit a user to specify arbitrary expressions for defining relationships between parameters of a scene graph. In addition, prior-art methods do not allow placement of the camera anywhere in the scene graph. Present methods may also sometimes become locked in a computational loop or arrive at an incorrect solution, thereby failing construct useful 3-D models from the available 2-D data at all. It is therefore desirable to provide an improved method for constructing 3-D digital models from 2-D photographic data, that overcomes the limitations of prior-art photogrammetry techniques. Photogrammetry, match-moving, and other reverse-rendering methods may be viewed as applications for differential calculus. Differential calculus has many applications in digital movie production. For example, its applications include rendering (anti-aliasing, motion-blurring), animation (inverse-kinematics, match-moving), modeling (determination of normals, photogrammetry), simulation (implicit integration of equations of motion), and other miscellaneous applications (lens and color calibration, optical flow estimation, and others). Traditionally, differential calculus is applied to such problems by writing software to compute derivatives of functions and approximations of derivatives, such as by deriving expressions by manual analysis and implementing these as expressions in software code, by post-processing, or using finite differences. Each of these traditional methods is subject to various limitations. Differentiating code by hand is a tedious and error-prone process, and generally makes code more difficult to modify. Post-processing of source code to automatically generate coded differential expressions complicates the programming process and constrains coding styles. Finite difference methods, which calculate an approximation of a differential using a finite difference δ, may not be sufficiently accurate. With these methods, there is some difficulty associated with selecting a proper value for δ. Too large, and the approximation will be poor. Too small, and the result may be dominated by rounding errors. However, there is another technique: automatic differentiation. Despite extensive use of differential calculus, automatic differentiation is not well known in the digital imaging art, although some scattered references exist. It is further desirable to apply automatic differentiation to the solution of various differential calculus problems in digital imaging, for photogrammetry and other applications in a way that increases computational speed and/or lowers programming costs.
SUMMARY OF THE INVENTION he present invention provides an improved reverse-rendering method, that overcomes the limitations of the prior art. The invention further provides a method for solving photogrammetry problems, and similar problems that use differential calculus, using generically-coded software. Methods according to the invention have been recently applied in the production of feature films, and may readily be extended to related applications for digital imaging.
The following paragraphs summarize exemplary steps of a method according to the invention. Initially, images (which may include still photographic images, video or motion pictures) are taken of a scene from multiple camera positions. The images may be provided in digital form and stored in a database to be accessed during later steps of the method. Cameras may be positioned within the scene itself. Position, focal length, and orientation of the cameras may be noted for later use; however, such information need not be highly accurate. A user may collect the camera data, and inspect representative images to gain an initial impression of the scene.
Optionally, a user may construct a rough three-dimensional model of the scene as an initial estimate. Any suitable modeling software, such as Maya™, may be used.
In the alternative, any other suitable method for defining a starting assumption for the solution process may be used. This may further speed the solution process and/or prevent incorrect solutions. Relationships between modeled objects in the scene may be defined by intervening transforms in a scene graph.
The scene graph may include selected viewable objects in the scene and known camera data for at least those cameras positioned in the scene itself. Any desired
- 3 -
LA2:671419.3 transforms between nodes in the scene graph that are consistent with the photographs may also be defined as part of the scene graph. Not all objects in the scene need be included in the scene graph, and not every transform need be defined. A scene graph may be particularly useful when the nature of the photographs are such that subsequent photogrammetry steps have difficulty arriving at a correct solution, for example, when several possibilities are apparent for the shape of an object, all of which conform to the photographs. Software tools for creating scene 3D scene graphs are known in the art.
A user should also designate the images that are to be used as inputs for the photogrammetry solver, and identify corresponding points or other features on the designated images. A user interface may be provided to assist a user in designating common points on related images. The designated images, the identified corresponding points on the images, the scene graph, and any related 3D models, may be designated as input to a photogrammetry solver. Initial information that is not defined by the scene graph or modeled objects may be defined in any suitable fashion to an arbitrary, interpolated, or estimated value. The initial visual coordinates and camera parameters represent a baseline for an error function, such as a least-squares function and/or any other suitable error function.
The principles of photogrammetry may then be applied, using the corresponding points identified in the photographs and/or other selected points or features to determine an amount of difference, i.e., error, between the initial estimate of the scene (including both geometry and camera parameters), and a scene calculated by projecting the selected points on the photographs based on the camera parameters. For example, a point "A1" may have actual spatial coordinates , in the defined coordinate system. A corresponding point "A2" may have coordinates u2 , representing an estimated position for A1. The estimate A2 is based on an intersection of rays passing through an assumed point A1 in the photographic plane, as determined from each photograph and its corresponding camera parameters. The difference e = «, - «- is the error for point A1. This is repeated for multiple points to define an least-squares error β as a function
- 4 -
LA2-.671419.3 of all the unknown parameters of interest. Techniques for defining the error are known in the art.
The error function may be minimized using any suitable error minimization method. Minimization software typically computes the sum of the squares of the error (or sometimes an alternative to the square in "robust" methods) and then tries to find the solution that minimizes the error through an iterative process. The iterative process of adjusting the "bundles" of rays used to define the three-dimensional geometry of the scene by minimizing an error function is sometimes referred to as "bundle adjustment" by those of ordinary skill in the art. In the present method, when the user builds the scene graph, some parameters may be marked as parameters that the photogrammetry tool is required to solve for. For example, such parameters may include camera focal length, relative object scales and rotation angles.^ Other parameters may be established as constant values throughout the bundle adjustment process. The error measure e generally varies as a function of the unknown parameters, and a certain set of parameter values will minimize error.
The invention permits one or more unknown parameters to be related to one or more other unknown parameters using an arbitrary expression. For example, if "a" and "b" represent scale factors for respective objects in the scene, and it is known that a first one of the objects is twice as large as a second one, this may be expressed symbolically by "a = 2b." These further relationships may be freely incorporated into the error function. Unlike prior-art bundle adjustment methods, such changes may be freely incorporated into the error function.
Bundle adjustment may proceed using a second-order Newton method that includes a computation of both the first derivative of the error function (its "Jacobian") and the second derivative (its "Hessian"). Second-order error minimization in this context is described, for example, in Bundle adjustment - a Modern Synthesis, Triggs, et. al., 2000. However, prior-art methods make use of a Gauss-Newtonian approximation and do not compute the full second derivative. Traditionally, accurate
- 5 -
LA2:671419.3 calculation of the second derivative is considered to be computationally complex, and so modified Newton methods, such as the Jacobian-based version of the Levenberq- Marquardt methods, are used to avoid this complexity.
According to the present invention, however, both the first derivative (Jacobian) and the second derivative (Hessian) of the error function may be accurately computed using the technique of automatic differentiation. Principles and applications for automatic differentiation are known in other contexts, as published, for example, in On Automatic Differentiation by Andreas Griewank, Argonne National Laboratory, Mathematics and Computer Science Division, preprint ANL/MCS-P10-1088 (November 1988), which is incorporated herein by reference. The techniques of automatic differentiation may be adapted to calculate the Jacobian and Hessian matrices by one of ordinary skill, with reference to the disclosure herein or to other published references on automatic differentiation. Automatic differentiation may also be used to compute the Jacobian without computing the full Hessian. A solution of the error function is thereby realized much more quickly than using traditional methods, while using software that is comparatively efficient to program, to modify, and to add new features to. In contrast, programming for prior-art photogrammetry methods is comparatively expensive, time-consuming, and difficult to modify. The full benefit of Newton's method can be realized, with rapid convergence to the error minimum, while computation of the derivatives is also accomplished quickly. Advantageously, placing one or more of the cameras arbitrarily within the scene graph does not cause any difficulties when the solution technique of the invention is used. This advantage, in turn, makes it much more convenient to gather photographic images for photogrammetry, and may even enable application of photogrammetry to scenes that were heretofore difficult or impossible to adequately reconstruct from images. In contrast, prior-art photogrammetry methods have not permitted much or any flexibility in the placement of cameras within the scene to be reconstructed, to the extent that such placement was permitted at all.
LA2:671419.3 Unlike prior-art bundle-adjustment methods, automatic differentiation implemented in a generic programming scheme is preferably used to compute an accurate first and/or second derivative of the error function, thereby guiding the bundle adjustment to a solution. In addition, the bundle adjustment is guided by the partial information contained in the scene graph, thereby reducing the solution time and ensuring a more accurate result. An additional advantage is that an algorithm according to the invention is much simpler to implement in source code than both the traditional Jacobian-based Levenberg-Marquardt method, and algebraic derivative methods. In addition to these advantages, the use of automatic differentiation according to the invention provides the further advantage of flexibly accommodating almost any initial scene graph. This is a key advantage because it frees the person constructing the scene from defining the initial scene graph in any particular manner. Corrections and changes are also readily accommodated.
As an example of the flexibility afforded by the invention, any node of the scene graph can be connected to any other via a mathematical expression entered by a user. Almost any expression may be used. For example, the height of an object may be designated to be a scalar multiple of any other object. For further example, the focal length of two cameras may be equated or otherwise related. Almost any desired relationship between nodes that can be expressed mathematically may be entered as part of the scene graph. Even unusual relationships, for example, a relationship between the size of an object and an angle between two other objects, may be included.
A further advantage of the invention is the treatment of cameras. In a scene graph, camera data is represented at nodes, just as data for viewable objects. Cameras can be placed in the scene graph in the same kinds of relationships as other objects. At the same time, a solution algorithm according to the invention is configured to simultaneously compute the scene geometry and the camera pose. This approach permits, unlike prior-art methods, posing a camera freely within the scene itself. In this context, "freely posing" means that the camera may be posed at any desired location
- 7 -
LA2:671419.3 within the scene, without limitations imposed by a defined solution algorithm. For example, a first camera may be pointed at a freeway as a car travels past, and a second camera may be mounted on the car. Notably, the second camera may be mounted on an actual scene object - the car - that is being, solved for. This may be a great advantage for those situations in which the camera position and orientation depend on the geometry that is being solved for. For example, cameras may be placed in the scene on top of buildings of unknown height, obviating the need for a fly-over or for elevated vantage points external to the scene.
In an embodiment of the invention, the scene to be reconstructed by photogrammetry is represented as a scene graph, which is a type of directed acylic graph, or "DAG." Scene graphs are generally not used in connection with photogrammetry for the post-production industry. Instead, in prior-art photogrammetry methods, the scene is typically represented by a collection of unrelated points. Parent/child relationships and transforms are not defined, unlike scene graphs. To the extent that the use of initial scene graphs containing partially-defined information as a precursor to a photogrammetry solution has been known at all, the ability to freely define transforms in an essentially unrestricted way is not known. Using these partial transform relationships, which are flexibly defined by the user, allows for much more accurate reconstructions and for reconstructions using far fewer images. It should be apparent that the invention is not limited to static photogrammetry.
With the addition of a time coordinate, the invention may readily be adapted to account for camera motion, or the motion of objects in the scene, according to principles understood in the art. For example a sequence of five-hundred frames of film can be treated as five-hundred independent images, which are then solved for in the usual way. Some parameters may vary from frame to frame (e.g. the position of a moving camera), while others (e.g. the height of a house) may remain constant. Hence, the invention may readily be used for solving match-moving problems.
A more complete understanding of the methods according to the present invention will be afforded to those skilled in the art, as well as a realization of additional
- 8 -
LA2:671419.3 advantages and objects thereof, by a consideration of the following detailed description of the preferred embodiment. Reference will be made to the appended sheets of drawings which will first be described briefly.
BRIEF DESCRIPTION OF THE DRAWINGS Fig. 1 is a diagram showing a three-dimensional scene to be reconstructed using a reverse-rendering method.
Fig. 2 is a flow chart showing exemplary steps of a reverse-rendering method according to an embodiment of the invention.
Fig. 3 is a block diagram showing an exemplary system for carrying out steps of the invention.
DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS The present invention may be considered to have several different aspects that overcome the limitations of the prior art. For example, one aspect may comprise a method for solving bundle adjustment problems, and similar problems that use differential calculus, using generically-coded software for automatic differentiation. Another aspect may comprise an improved photogrammetry method that is designed to exploit the benefits of automatic differentiation. In the detailed description that follows, the analytical basis for the application of automatic differentiation to differential calculus problems in digital imaging is first described. Then, application of automatic differentiation using a generic programming approach is described in a second section. A third section provides a description of an exemplary photogrammetry process that may make use of automatic differentiation and generic programming as disclosed herein.
I. AUTOMATIC DIFFERENTIATION The invention provide a method for computing derivatives of a function using a programming language, such as C++, that supports the introduction of new types and operator overloading. The method is relatively easy to program without requiring a well- developed knowledge of differential calculus, does not require post-processing of
- 9 -
LA2:671419.3 source code, and is both more efficient, and more accurate than finite difference methods. The method makes use of automatic differentiation in the mathematical sense, as explained below.
The derivative of a function / , written as f'(x) or f may be defined as
Figure imgf000012_0001
The following example illustrates the basis for automatic differentiation as a solution technique, for one of ordinary skill in computer graphics programming. A more complete treatment may be found in the mathematics literature; e.g., Automatic Differentiation of Algorithms: Theory, Implementation, and Application, SIAM, Philadelphia, Penn. 1991. Consider an example wherein f'(x) = 2x2 + 1. Set the variable δ = d , wherein d is a small non-zero number, very similar in concept to an infinitesimal as first introduced by Newton and Leibniz. Then, an approximation to f'(x) may be obtained by
Figure imgf000012_0002
Comparing this result with the exact derivative Ax , an additional error term 2d is evident, having the same order of magnitude as d . Note that the error term arises from the 2d2 term in the numerator. Thus, if d is chosen to be sufficiently close to zero and less than one, then d2 « d and is therefore much closer to zero. For example, if d = 10-3 , then d2 = 10-6. Ultimately, if d > 0 and d2 = 0 , then Equation (2) would yield an exact derivative, but there is no real number d with such properties. The number system can be extended, however, to include a "differential" d having the property d > 0 and d2 = 0 , as introduced in a different context by Clifford in the year 1873. As with the imaginary number , the differential d has some additional properties that may prove useful. For example, d is commutative, so that αd = dα and α + d = d + α for any real number α . Using the differential d instead of the limit of δ , Equation 1 can be rewritten as
- 10 -
LA2:671419.3 f(χ + d) = f(x) + df'(x) . , (3)
The exact derivative may therefore be computed by computing f(x + d) , and reading the coefficient of d . For example, if f(x) = x" , then
.2 rn(n - \)
' f(x + d) = (x + d)" = x" X nx"-'d + d2[-x -iχ ."-1 + ...] . (4)
All of the terms on the right hand side of the equation are equal to zero, except for the first two terms. The exact derivative is therefore the coefficient of d , which is nx"~ , as expected.
The above automatic differentiation method may be generalized to partial derivatives. Instead of the differential J , a set of non-zero numbers (J0,J, ,...J,) with i e lan index set may be introduced. The set (J0,Jl ;...J.) commutes with all real numbers, and has the additional property that didJ. = 0for all ij e l . A general differential of this type may now be written as x - α + T. bidi . That is, members of this new differential class may be represented as a pair consisting of the real number α
(the real part) and a vector of real numbers, (b,),e/ (the infinitesimal part). Extending Equation 3 to this new differential class, the partial derivatives of a multi-variable function /(;c0 ,*,,...*,.) may be obtained by computing f(xQ + J0,χ, + di,...χl + dt) , and reading the desired i'h partial derivative from the coefficient of dt . For example, partial derivatives for a function f(χ,y) mapping a pair of real numbers to another real number may be computed by computing f(x + dQ,y + d0) . The partial derivative with respect to x is read off of the coefficient of d0 , and the partial derivative with respect to y is read off of the coefficient of dt . All dQd terms are zero by definition. Note that this technique requires substantially fewer computations of the function f(χ,y) than finite difference methods.
- 11 -
LA2:671419.3 II. GENERIC PROGRAMMING Suitable programming languages, such as C++, may be used to operate on differential types because of their potential for "generic programming." Generic programming refers the methodology by which software is written so as to be independent of the underlying data types used. For example, generic programming is adopted by the Standard Template Library, which employs the C++ template mechanism to define abstract data types and algorithms independently of the types of their contents and arguments. In fact, there are now a number of template-based mathematical libraries for C++ that are generic. One of the recognized purposes of generic programming is to enable code that is independent of information such as the machine representation of floating point numbers (for example, of whether the type "float " or "double " is used.)
Generically-programmed libraries may also be exploited to supply mathematical functions that will operate with a specially-defined object class for handling differentials. The specially-defined object class should be designed to export an interface similar enough to that of the more usual number types that it can replace them. The new class may be defined algebraically (e.g., through d2 = 0 ), and therefore may be implemented similarly to object classes for complex numbers. Generally speaking, computing derivatives of a wide class of functions may thereby be accomplished as easily as computing those functions applied to complex numbers.
Illustrative examples in C++ are provided below. An underlying data type float may be used. The float type may be extended by adding in new elements implied by the differential J in a new Differential class. Every element of the Differential class can be written in the form a + bd , for real a and b . The variable a may be referred to as the real part and b as the infinitesimal part. As coded in C++, the Differential class may be defined as: class Differential { public : float a; // Real part float b; // Infinitesimal part
Differential (float aO, float bO = O.Of) : a(aθ), b(bθ) { }
-12-
LA2:671419.3 In the alternative to making members of the differential class public, an accessor method may be used.
Operations on objects of type Differential should be defined in a specific way. For example, the sum of two Differential objects a0 +b0d and α, +b d should be defined as (a0 + b0d) + ( , + b d) = (aϋ + a]) + (b0 + b, )d . Similarly, the product of two differential objects should be defined as (a0 + b0d)(a] + b d) = aQal + (α0b, x a b0)d . In C++, operators may be specially defined for special object classes. This may sometimes be referred to as "operator overloading." The C++ examples below demonstrate operator overloading for addition and multiplication of Differential data types, respectively:
Differential operator-*- (const Differential &x, const Differential &y) { return Differential (x. a+y. a,x.b+y.b) ,* Differential operator* (const Differential &x, const Differential &y) { return Differential (x. *y. a, . a*y.b+x. b*y. a) ;
}
The operators may be used to compute derivatives. For example, consider the function f(x) = (x+2)(x+1 ). This may be implemented in generic form as a polymorphic function: template<class X> X f (X x) { return (x+X (2 ) ) * (x+X ( 1) ) ;
} Note that the constants in this expression have been cast to the "template" parameter type so that the arguments to the "+" operator are of the same type. An alternative is to overload the operators so that they can accept arguments of differing type. A differential variable d may now be defined by:
Differential d(0,l); d may then be used to compute the derivative of f(x) at any desired value of x, by evaluating the function with d added to the argument. The derivative is the infinitesimal part of the returned value. For example, at x = 3, the value of f (x (3 ) +d) is
(3 + d + 2)(3 + J + l) = (5 + )(4 + d) = 20 + 9d , (5)
- 13 -
LA2:671419.3 since d2 = 0 by definition. Thus, computing the function f(X(3)+d) in generic form will return the differential object (20, 9); the exact derivative is given by its second term 9. It should be apparent that this is the correct result. Note that exactly the same result for x=3 may be obtained by defining Differential d (3 , l) ; and computing f(d). In this case, the desired value of x is supplied as the real part of the differential.
The foregoing examples illustrate the use of overloaded addition and multiplication operators to compute a derivative using generic function, such as may be obtained from a template-based mathematical library. Subtraction should be a straightforward variation of addition, but defining a suitable division operator for Differential objects may be a little more subtle. One approach may be to adopt a generalized binomial expansion of (1 + x)-1. For example: a0 +b0d _ a0 +b0d ^ 1 α, + bxd a l + (b ax)d
Figure imgf000016_0001
^ a + aA -a d (6)
Thus, the ratio of two differential objects may be defined according to Equation 6. It should be apparent that this result requires that the real part α, of d in the denominator be non-zero, but this may easily be accomplished. This implementation of automatic differentiation is not limited to functions that are restricted to the foregoing operators. Any differentiable function defined over the real numbers can be extended to the objects of the Differential class by f(a + bd) = f(a) + bf(a)d . (7)
For example, a cosine operation may be defined for Differential objects as follows:
- 14 -
LA2:671419.3 Differential cos (const Differential &x) { return Differential (cos (x.a) , -x.b*sin (x. a) ) ;
} Thus, once a relatively small number of operations have been defined for
Differential objects, virtually any differentiable function such as commonly used in computer graphics and other applications can be automatically differentiated. In combination with a suitably generic vector and matrix library, vector and matrix expressions can be differentiated, as well. Complex code such as used in fluid dynamics simulation can also be differentiated to determine, for example, how the output parameters depend on an input parameter. Aspects of automatic differentiation in a C++ environment are further described in "Function Minimization and Automatic Differentiation Using C++," Jerrell, M.E. in Conference Proceedings on Object-Oriented Programming Systems, Languages, and Applications, ACM Press, 1989, and by Claus Benson and Ole Stauning in "FADBAD, Flexible Automatic Differentiation Using
Templates and Operator Overloading in ANSI C++," 2003, http://ww.imm.dtu.dk/fadbad.
For many applications, for example, bundle adjustment as further described herein, it may be desirable to compute a second derivative. As suggested by the
FADBAD library, one approach is to simply iterate the computation of the derivative. For example, in the single variable case, a class of Differential objects may be defined over an arbitrary class: template<class X> class Differential { public : X a; // Real part
X b; // Infinitesimal part
Differential (X a0,X bO = 0) : a(aθ), b(bθ) { } static Differential<X> d() { return Differential<X> (X(0) ,X(1) ) ;
T Ih,.is may be compared to the first example for defining a Differential class, which was defined over a class of float variables. We may now compute second derivatives by iterating the above method. For example, to compute the second derivative of a first function "f" , a second C++ function "g" may be used to compute the derivative of f in a
- 15 -
LA2:671419.3 generic manner. Then, g may be iterated on its result. The sample C++ code Felδw should illustrate this method: template<class X> X f (X x) { return ... // Compute some function of x
}
//
// Compute f ' (x)
// X g(X X) { return f (Differential<X> (x) +Differential<X> : : d ( ) ) .b;
}
//
// Compute f''(x)=g'(x) //
... = g (Differential<float> (x) +Differential<float> : :d () ) .b;
Generic programming techniques may also suitable for use with interval arithmetic and affine arithmetic. By implementing these methods generically, interval arithmetic and affine arithmetic may also be combined with automatic differentiation for applications to inverse rendering problems, such as photogrammetry.
III. APPLICATIONS TO PHOTOGRAMMETRY, MATCH-MOVING, AND OTHER
DIGITAL MOVIE APPLICATIONS Many applications in digital movie post-production may involve inverse rendering; that is, determining what input to a renderer will produce an output that matches a given image. For example, a number of parameters ( θ λ , θ 2 ,...) may be provided as input to a 3D rendering process. These parameters may range from transformation parameters such as angle of rotation to shader parameters such as the exponent in a Blinn-Phong shader. In general, a rendering process may be considered to be a function f : (θx2,..) → (Il< C) , (8) where (II ιC) represents the color of the "c" channel of the (i,j )-th pixel of the rendered image. If JI J C is some "correct" or desired result used as a baseline, then we can write a sum of squares error term
- 16 -
LA2:671419.3
Figure imgf000019_0001
If the rendering code is written generically enough that some parameters may be replaced by Differential objects, then e and its derivative may be automatically computed with respect to those parameters, as described above. The automatic differentiation tool may be applied in the context of a minimization algorithm, such as non-linear conjugate gradients, to efficiently derive input parameters that result in an image / that best matches J .
Many problems in post-production can be solved by differentiating or inverting a subsystem of a 3D renderer. For example, part of a ray-tracer takes as input a parameters of a light ray, and returns as output texture coordinates that the ray intersects. By implementing this ray-tracing code generically, we can automatically compute the derivative of the texture coordinates as a function of the light ray parameters. This may be useful, for example, in performing anti-alias texture mapping. Another inverse rendering problem concerns transforming and projecting 3D points to 2D screen coordinates. The inverse of this operation may comprise deriving a 3D modeled geometry that best fits a collection of projected 2D points; i.e., photogrammetry and match-moving.
Referring to Fig. 1 , it may be desired to reconstruct a set of 3-dimensional points P.,i e I (for example, P1 , P2, P3 and P4), with coordinates (pø , representing a three- dimensional scene 100. The coordinates (p.) may be defined with respect to a suitable datum 110 for a coordinate system referenced to scene 100. A set of images exists indexed by the set J , in which some of the Pt appear projected. For each image j a camera projection function c} exists, such that cj(pi) is the projection of the point P. into a 2-dimensional screen space associated with camera j . For example, a screen space 104 may be associated with camera 102, and screen space 106 with camera 108. Selected cameras, for example, camera 108, may be freely posed within scene 100. In the general case, a screen space associated with a camera need not
- 17 -
LA2:671419.3 encompass all points of interest within a scene. For example, screen space" 106 "does not encompass P4, which, however, is encompassed by screen space 104. Note that the projection function comprises a type of reverse-rendering function. Methods for defining a suitable projection function c1 are well understood in the art of photogrammetry, and need not be described herein.
An index set R may be defined such that (i,j) <= R for each pt that appears projected in an image j . For each (ι,j) e R , a 2D position z, ; in an associated screen space may be measured from a corresponding image. For example, 2D positions for z-1,1 to z4,ι may be measured in an image from camera 102. Likewise, positions for Zιι2 to z3,2 may be measured in an image from camera 108.
Because the positions of interest (e.g., the coordinates of P1 - P4 with respect to datum 110) and the projection function c} are at least partially undefined, the position zltJ is related to the projected position cy(/j,) by varying amount of error eι In other words, ^.j ^ ^ - j ip, ) , (10) where e ] is the difference between actual and measured projections of Pt . The error et J varies as a function of c} and (/?,) , and the measured 2D positions z are generally fixed.
The amount of error may be defined over 2D coordinates x,y in the relevant screen space. Suppose that the x and y coordinates of et J are independent normally distributed variables whose components have variance σ The positions (/?,) of the points Pl and any unknown parameters of c} , such that the deviation of the e is minimized in the least squares sense, may therefore be determined. The maximum likelihood estimation for the positions (pt) and unknown parameters of c may be determined from the minimum error e , wherein e is defined by
Figure imgf000020_0001
LA2-671419 3 It should be apparent that the minimum error may be determined from Equation 11 by finding its derivative and solving for those values of (p.) at which the derivative is zero.
One of ordinary skill may employ a different function for expressing the error, as known in the art. If desired, the derivative can be used to minimize the error function using an iterative minimization algorithm, as known in the art, and sometimes referred to as bundle adjustment. As should be apparent, the solution process may be greatly facilitated by expressing Equation 11 using generic programming techniques as disclosed herein, and solving for the derivative by automatic differentiation. Of course, various choices for a solution process may present themselves during the design of a photogrammetry application implementing the invention, and exemplary details are provided below.
But before discussing the solution process in greater detail, certain other general observations should be made. In practice, further information about the structure of P: is often available. For example, it may be known that certain ones of Pi are coplanar. Such information may be used to further constrain the solution. At the same time, at least some parameters of the projection function c . may be unknown. For example, the cameras may have unknown focal length and lens distortion, or may be located at an unknown position or orientation. Unknown parameters may be represented as variables in the projection function. Advantageously, when a generic programming/automatic differentiation approach is adopted, a great deal of flexibility may be accommodated in the definition and constraint of the solution. For example, a user interface may be provided for a user to define parameters of the solution using flexibly-defined relational expressions. These relations may be incorporated into the solution and differentiated automatically, without requiring manual differentiation. The foregoing general principles may be implemented in various ways in a solution process. Referring to Fig. 2, according to an embodiment of the invention, one such solution method 200 may be summarized as follows. At step 202, image data representing a plurality of images of a scene are received into a solution process. For
- 19 -
LA2:671419.3 example, a user interface may be provided for a user to identify and select pertinent images from a database, and then initiate a solution process using the selected images. Images of the scene may be collected using any suitable camera, as known in the art. At step 204, user input designating corresponding points or other corresponding features appearing in two or more of the selected images is received. The user input serves to mark and identifies corresponding features appearing in more than one image. Each corresponding feature should be located on a node of the scene graph. For example, while displaying multiple images on a computer display, a user may indicate corresponding points on the images using a pointing device. Any suitable method may be used to receive user input indication a plurality of corresponding features in the image data. Methods for entering and recording such measurements are known in the art.
At step 206, a preliminary solution estimate for the scene is received for use in the solution process. The preliminary solution estimate may be developed by a user, based on any desired available information or estimate. One convenient approach may be accept a preliminary 3D model as the solution estimate, preferably in scene graph form. Many users may be familiar with 3D modeling software, and may build an approximation to the scene using any suitable modeling program. For example, in an embodiment of the invention, a solution algorithm was designed to accept an approximate model, in scene graph form, constructed using AliasjWavefront Maya™ as input. In the alternative, an initial solution estimate may be developed automatically, or adopted from an arbitrary set of values. Points in the scene graph or other solution estimate should be related to measurements of 2D position, e.g., z.; , in the image data, based on the user input from step 204. Representing the scene being reconstructed as a DAG, for example, a scene graph, achieved more accurate results from far smaller sets of input images than prior- art methods that do not make use of scene graphs. Scene graph hierarchies, for example, as described in Chapter 7 of Object Hierarchy and Simple PHIGS (SPHIGS)" in "Computer Graphics, Principles and Practice in C" by Foley et al., Addison-Wesley,
- 20 -
LA2:671419.3 1995, ISBN 0-201-84840-6, are well known in the art oτ computer grapnics. i ne use of scene graphs is well-supported by standards for computer graphics. In many computer graphics applications, every discrete set of related graphics data (called "the scene") is represented in a corresponding scene graph. Each object in the scene graph stands in a hierarchical relationship to the other objects in a scene. More precisely, a scene graph is a type of directed, acyclic graph, meaning that it is a one-way tree structure without looping, like a family tree. For example, a "parent" and/or "children" are identified for each object or node. A parent may have multiple children, and a child may have multiple parents. A child is not permitted to be a parent to any node in its parental lineage. Elements that have a data component, like viewable objects or camera locations, are represented at the nodes. Each node represents a function that will return a value depending on input parameters including space and time.
An important aspect of scene graphs is the defined relationship between hierarchically-related objects, sometimes referred to as a "transform." In particular, the relative orientation, size, mode of attachment, or other relationship of a child object with respect to its parent is the transform of the child object. An object's transform can be manipulated to adjust the relationships between the parent and the child objects. For example, to adjust the size of a hand (child) relative to an arm (parent), a size parameter of the hand transform may be increased. Transforms are inherited, in that the transform of an object is inherited by its children. For example, when the transform for the arm is adjusted to make the arm twice as large, then the hand grows twice as large, too. The entire collection of objects, parent-child relationships and transforms comprises the scene graph. So long as any desired transforms are expressed as differentiable functions, they may readily be incorporated into an expression for e and differentiated.
Scene graphs are useful for modeling movement and scaling of objects. When an object is moved, grows in size, or shrinks, normally all of the child objects move, grow, or shrink along with it. A computer-generated actor may provide a simple
- 21 -
LA2:671419.3 example. When the actor's arm is moved, its attached hand normally moves along with it. In terms of the scene graph, the hand is defined as a child of the arm. Advantageously, when the actor's arm is moved, the animator doesn't need to animate the hand separately. The relationship defined in the scene graph ensures that the hand moves along with the arm. Static objects may also be included in a scene graph. An example of a static object is a building with windows. If the building itself is the parent of the windows, then when the building is relocated during the photogrammetry method, the windows will automatically move with it. Additionally, if the size or proportion of the building is changed, the windows will also scale with it. The relative orientation, size, mode of attachment, or other relationship of a child object with respect to its parent may be referred to as the transform of the child object.
Any suitable software application, for example, Open Inventor™ or Maya™ , may be used to build a scene graph for use with a method according to the invention. Initially (prior to the application of photogrammetry), the scene graph contains partial and/or approximate information. For example, windows may be related as child objects to a building whose size and position is not yet known. In such case, a transform for each window may be configured to contain partial information about its relative size and orientation. For example, it may be specified that the window has rectangular edges, lies flat in a wall of the building, and is oriented parallel to the edges of the wall, without specifying the dimensions of the window. Any parameter of the transform that is incomplete or incorrect is automatically computed using photogrammetry techniques. Complete information about the scene is not required, because the information in the initial scene graph guides the photogrammetry solution, but does not determine it.
The 3D model, preferably in scene graph form, may be accepted as the initial solution estimate. In addition, at step 208, relationships in the preliminary solution estimate may be further defined from user input. For example, in an embodiment of the invention, users may define transformations in a Maya™ scene graph to represent partially known information in the scene. For example, if an object in the scene is known to be a rectangular solid, but has unknown dimensions, then users may
- 22 -
LA2:671419.3 instantiate a "cube" object in the scene graph and scale it using a transform. The user may then mark variables in the scene graph whose values are unknown. In the rectangular solid example, a user may mark three numbers defining the unknown dimensions of the solid. There may be various unknown parameters marked as unknown in a scene: scaiings, rotation angles, translations, camera focal lengths and so on.
In addition, the photogrammetry projection function should be defined to include information or assumptions regarding camera parameters, including but not limited to camera position, camera orientation, focal length, lens and/or focal plane distortion, and other factors that affect the appearance of the image. At step 210, such of these parameters as are known may be received for use in the solution process. As previously mentioned, at least some projection function parameters may be marked as unknown, and solved for. For example, the camera pose may be unknown, particularly if the images represent a time sequence during which the camera moved, or if the pose was not measured for any other reason. Other camera parameters are often known, for example, focal length. Relationships may be defined between camera parameters. For example, the focal length of two or more camera may be equated. All of these camera parameters may be represented as nodes in the initial scene graph. Advantageously, this may facilitate solving for unknown camera parameters in the same way - i.e., using the same homogenous set of equations - as the geometry of the scene. This may greatly enhance the power and flexibility of a method according to the invention, compared to prior art methods in which camera parameters are handled separately. For example, camera pose at any given time may be treated just like any other unknown parameter for solution, enabling cameras to be freely posed inside or outside of the scene.
At step 212, an error function e for calculating a solution is determined. The solution may comprise the desired positions (pø and unknown parameters of the projection function, for example, any unknown camera parameters. The error function e is defined such that it is differentiable, and represents a difference (for example, a
- 23 -
LA2:671419 3 least-squares difference as generally express in Equation 11 ) between predicted and measured values of the projected points z / . It should be apparent that e should generally comprise a system of equations that may be expressed in matrix form.
In an embodiment of the invention, an error function may be defined using an rendering subsystem of an existing rendering and modeling program. The rendering subsystem may be modified to compute the projections of points in this scene hierarchy (i.e., to perform reverse rendering) in a generic way. Preferably, the existing rendering subsystem utilizes scene graphs, including transforms and camera parameters, so the resulting generically-programmed projection function c; may readily accept input and provide output using a standard scene graph format.
At step 214, the error function e may be iterated so as to discover the collection of unknown parameters, e.g., the unknown marked parameters and the points (/?,) of the scene geometry, that minimizes its value. The value of these parameters at the global minimum of e may be regarded as the solution for the originally-defined reverse- rendering problem. The iteration may begin with a solution estimate comprising the information received at steps 206-210. Minimization may be performed using an iterative minimization algorithm. As further described below, it should be advantageous to include a full computation of the exact second derivative (Hessian), using an automatic differentiation method disclosed herein, to guide the iterative solution process to a more rapid solution, according to Newton's method.
In particular, an active set variation of the Levenberg-Marquardt algorithm, suitable for bounded and unbounded minimization, may be used for bundle adjustment. However, instead of approximating the Hessian (the matrix of second derivatives) of e as a function of the Jacobian (as some have done in the past), the full exact Hessian and Jacobian may be calculated using automatic differentiation as disclosed herein. At each iteration, a conjugate gradient algorithm may be used to solve the linear system in the Hessian.
At step 216, the parameter values corresponding to the global minimum of e may be used to build a model of the three-dimensional scene. For example, solved
- 24 -
LA2:671419.3 parameters may be incorporated into the preliminary estimate "scene "g raph "as"l'a parameter of an existing or newly-added node, to complete the solution. Finally, the reconstructed scene may be presented to the user. If the reconstruction contains substantial errors, the user may be given the opportunity to make adjustments to the rough model and run the process again. If the result is substantially correct, it may be readily detailed by a graphics artist, advantageously already being in a standard scene graph format.
Referring again to solution step 214, it should be apparent that the solution should be generalized to include partial differentiation, because e in general depends on more than unknown parameter. It may be advantageous to represent the vector (b,-)-e/ (the infinitesimal part of the partial differential object employed in automatic differentiation) sparsely as index-value pairs rather than as a dense list of values. This permits sparse representation of the Hessian, and use of a sparse conjugate gradient method to solve for the iteration step. Exploiting sparsity may prove very helpful for obtaining good performance. Consider computing N2 second derivatives, with respect to (x,,...^) , of
f = ∑/X (12) ι=l where each f. is a function of some lesser number M of the x It follows that the
Hessian has no more than LM 2 non-zero terms. For match-moving problems, the number of non-zero terms is usually much less than N2.
In an embodiment of the invention that was used for post-production work on a motion picture, a form of automatic differentiation known as forward mode differentiation was implemented. An alternative approach to automatic differentiation is reverse mode automatic differentiation, which is described in the mathematics and computer science art. Reverse mode differentiation may often be better suited to problems involving large numbers of input variables. However, in an embodiment of the invention, reverse mode differentiation was implemented, but the performance realized was inferior to that
- 25 -
LA2:671419.3 achieved in embodiments that implemented forward mode differentiation with sparse representation of the differentials. It may also be possible to implement the invention using a sparse variation of reverse mode differentiation, which is at yet untested for this application. Method 200 readily encompasses the solution of complex match-moving problems. For example, sequences of live action images may be treated as sets of independent images, to permit simultaneous solution of both static and time-dependent parameters. In fact, instead of computing match-moves from frame to frame incrementally, as traditionally done, match-moves may be computed by minimizing over all frames simultaneously. To the extent that the input images represent a sequence over time, then the user may indicate whether or not a parameter is to be considered animated. Those parameters designated as animated (changing as a function of time) may be handled at each time step as an independent parameter for solution. In practice, applying an embodiment of the invention over multiple frames in a sequence often resulted in successful Levenberg-Marquardt minimizations in spaces of dimension greater than 10,000, demonstrating a significant advance in the art.
Advances over the prior art have been demonstrated in other ways, as well. The invention provides a high degree of flexibility to describe known information about a scene that the invention affords to users. In an embodiment of the invention, any transformation parameters such as rotation angles or relative scale could be marked for solving. An arbitrary number of cameras could be placed anywhere in the scene hierarchy, and parameters for camera transforms could be marked for solving. The solution method proved able to reconstruct animated articulated geometry, and even to reconstruct cameras mounted on articulated geometry (as with real camera rigs). Such cameras may be handled like any other animated parameter.
Additionally, some 3D modeling applications, for example, Maya™, support the connection of parameters in a scene graph using symbolic expressions. Such applications may be used to build input for the solution step. In an embodiment of the invention, an expression evaluator similar to that provided in Maya™ was written in
- 26 -
LA2:671419.3 C++, in a generic and thus differentiable manner. bxpressions representing relationships between parameters of a scene were allowed to enter into the expression for e . By defining such expressions, users may be able to express constraints that that may be difficult to express using transforms. For example, users may be able to specify that two independent objects were of the same unknown height, or that two cameras have the same unknown focal length.
It should be apparent that the foregoing capabilities tend to greatly increase the complexity of the expression for e . Despite this additional complexity, an embodiment of the invention proved capable, in an intensely demanding environment for a feature film production, of reliably, accurately, and swiftly identifying global minima for e . This too demonstrates the practical and proven advantages of the invention.
In general, the invention has proved to be a highly efficient method for computing derivatives used for solving reverse-rendering problem. In particular the invention has provided an efficient solution to the problems of reconstruction of geometry and camera moves from film. As implemented in a production environment, the invention has proven capable of solving for both structured and nstructured geometry. An application embodying the invention has been found suitable for completely replacing prior-art commercial match-moving and photogrammetry software for time-sensitive and major visual effects projects for motion pictures. The effectiveness of the invention as implemented for motion picture post- production rested in no small part on a consistently generic implementation of the expression for e . Once reusable library components were written for defining e in a flexible and generic manner, then finding the solution for e was no more difficult to implement than the forward code that simply transforms and projects points. The solver consisted almost entirely of generic code to compute e , and library code to optimize generic functions.
What is more, the invention, and in particular, efficient automatic differentiation in a generic programming environment, enabled a solution using the full Newton method instead of the commonly-used Gauss-Newton approximation. The full Newton method
- 27 -
LA2:671419.3 yields an accurate solution very quickly, but is often "considered too complex Tor practical consideration. The application of automatic differentiation in the generically- programmed environment removed these formerly formidable limitations. However, the Gauss-Newton method may also be implemented, and may be preferable for some applications. Comparison between the two methods is a complex topic, and either method may be adopted, depending on the circumstances.
Although there is no recognized standard set of benchmarks for comparing photogrammetry and match-moving applications, the invention as implemented for motion picture post-production is believed to be far more efficient than comparable prior-art applications. In a production environment, 3D models of scenes may be built incrementally (such as by using a commercially available modeler), and then the photogrammetry/match-moving solver may be applied to find the best local minimum using the most current information. When implemented in this fashion, with solution processing performed on a single 2:8GHz Xeon™ CPU, the time required to reach a solution was generally an insignificant part of the workflow. Usually, a scene is constructed in an incremental manner, with additional approximations for scene elements after interim solution processing. Such incremental changes can be solved very quickly indeed.
Even batch solutions for an entire scene at once can be performed quickly. For example, in a scenario such as solving for a six-hundred frame match-move, determining six camera transform parameters for each frame as well as approximately one-hundred parameters that do not vary in time (requiring minimization in a thirty- seven hundred dimensional space), the resulting error function might require as little as about five minutes, or at most about forty-five minutes, to solve. Significant further reductions in processing time may be realized by using selected frames (for example, every tenth frame) to obtain a good approximation that may be used to initiate a full solve. In comparison, prior-art commercial match-moving tools do not use all frames simultaneously, but instead solve one frame at a time, or solve using a small batch at a time. Thus, prior-art match-moving tools would require considerably longer to solve a
- 28 -
LA2:671419.3 large number or frames. But when using the implementation according to the invention in this way, the time to solve was often significantly less than the time to communicate the results back to the 3D modeler.
When working with such characteristically large solution spaces, often having a dimensionality running into the thousands, and that may have a complex topology due to the presence of parameters such as rotation angles, one might expect many local minima to be encountered, which might impede discovering the global minimum. But surprisingly, when the invention was implemented for real photogrammetry and match- moving problems that appeared during motion picture post-production activities, it was usually possible to start the Levenberg-Marquardt algorithm and arrive directly at the unique global minimum. In rare cases, a local minimum was discovered that was not the global minimum. Such local minima were often related to the global minimum by a simple symmetry operation. For example, a local minimum might be discovered having an object that was inverted with respect to the axis pointing directly forward from the camera in which it was projected. In these cases it was an easy process to correct the inversion and then repeat the solution algorithm with the corrected scene graph as the initial solution estimate.
Part of the effectiveness of the invention in directly reaching the global minimum is believed to result from the implementation of the full Newton method, including computation of the exact Hessian. This, in turn, was enabled by the efficiency and ease with which the differentials could be computed during each iteration using the generic programming and automatic differentiation methods described above. This was useful in ensuring that the algorithm never left the "basin" around the global minimum to become lost in the solution space, and made it less likely for the algorithm to become trapped in a local minimum.
According to the foregoing, therefore, one of ordinary skill may construct a system for performing a method according to the invention. Figure 3 shows one such system 300, comprising a computer 302 connected to receive image data from a database 304. System 300 may further comprise a memory 306 operabiy associated
- 29 -
LA2:671419.3 with the computer. Memory 306 may contain coded instructions to enable one of ordinary skill to carry out a method according to the invention. For example, memory
306 may comprise instructions for performing steps of a method according to the invention. For example: (i) receiving image data comprising a plurality of photographic images of a three-dimensional scene.
(ii) receiving user input indicating a plurality of corresponding features each appearing in at least two of the plurality of photographic images, (iii) determining an error function for a reverse-rendering function, the reverse- rendering function defining a relationship between three-dimensional coordinates in the three-dimensional scene and corresponding two- dimensional coordinates of the plurality of corresponding features, (iv) minimizing the error function to determine a solution corresponding to a global minimum of the error function, comprising calculating at least first derivatives of the error function using automatic differentiation, thereby computing intermediate solution estimates for successive iterations of the error function, until the solution estimates converge to the solution. Instructions for any other desired step of a method according to the invention for performance by computer 302 may also be held in memory 306. Any of the foregoing instructions may also be encoded on a removable media 308, for reading by computer 302 or another computer. Suitable computer systems for carrying out the invention are known in the art, and any suitable system may be used.
Having thus described the improved photogrammetry method, it should be apparent to those skilled in the art that certain advantages of the within system have been achieved. It should also be appreciated that various modifications, adaptations, and alternative embodiments thereof may be made within the scope and spirit of the present invention, as discussed above. For example, while a specific application to photogrammetry and match-moving has been disclosed, it should be apparent that the invention is not limited thereby. The invention is defined by the appended claims.
- 30 -
LA2:671419.3

Claims

CLAIMS What is Claimed is:
1. A method for determining parameters of a three-dimensional scene using a reverse-rendering function, the method comprising: receiving image data comprising a plurality of photographic images of a three-dimensional scene; receiving user input indicating a plurality of corresponding features each appearing in at least two of the plurality of photographic images; determining an error function for a reverse-rendering function, the reverse- rendering function defining a relationship between three-dimensional coordinates in the three-dimensional scene and corresponding two-dimensional coordinates of the plurality of corresponding features; and minimizing the error function to determine a solution corresponding to a global minimum of the error function, comprising calculating at least first derivatives of the error function using automatic differentiation, thereby computing intermediate solution estimates for successive iterations of the error function, until the solution estimates converge to the solution.
2. The method of Claim 1 , wherein the determining step further comprises determining the error function comprising reverse-rendering parameters selected from group consisting of camera position, camera orientation, focal length, aperture size, lens distortion, and distortion of focal plane.
3. The method of Claim 1 , wherein the determining step further comprises determining the error function comprising reverse-rendering parameters including at least one camera position located within the three-dimensional scene.
4. The method of Claim 1 , further comprising receiving an initial scene graph comprising at least a portion of an initial solution estimate.
- 31 -
LA2:671419.3
5. The method of Claim 4, wherein the receiving an initial scene graph step further comprises receiving the initial scene graph comprising at least one transform defining a relationship between a parent object and a child object.
6. The method of Claim 1 , wherein the minimizing step further comprises calculating an exact Hessian of the error function.
7. The method of Claim 1 , further comprising initializing at least selected three-dimensionai coordinates of the plurality of corresponding features and camera parameters for the plurality of photographic images as an initial solution estimate.
8. The method of Claim 1 , further comprising defining a resulting scene graph for the scene consistent with the solution.
9. The method of Claim 1 , wherein the determining step further comprises determining the error function further defined by a user-selected differentiable relationship between user-selected parameters of the reverse-rendering function.
10. The method of Claim 1 , wherein the determining step further comprises determining the error function further defined by animation parameters to solve match- moving relationships between frames of a motion picture sequence.
11. The method of Claim 1 , wherein the receiving step further comprises receiving the plurality of photographic images representing a time sequence, wherein the determining step further comprises determining the error function further defined by time parameters for solving match-moving relationships between frames of a motion picture sequence, and wherein the minimizing step further comprises minimizing the error function simultaneously over the frames.
12. The method of Claim 1 , wherein the receiving image data step comprises receiving the photographic images comprising digital images from a digital camera.
32
LA2:671419.3
1 . A method for determining parameters of a three-dimensional scene using a reverse-rendering function, the method comprising: receiving a plurality of two-dimensional images, at least one of the images captured using a camera posed inside of the three-dimensional scene; and 5 determining an error function for a reverse-rendering function, the reverse- rendering function defining a relationship between three-dimensional coordinates in the three-dimensional scene and corresponding two-dimensional coordinates of a plurality of corresponding features in the two-dimensional images; and minimizing the error function to determine a solution corresponding to a 0 global minimum of the error function, thereby computing intermediate solution estimates for successive iterations of the error function, until the solution estimates converge to the solution.
14. The method of Claim 13, further comprising receiving an initial scene graph comprising estimated scene parameters; and
5 defining an initial solution estimate for the error function based on the estimated scene parameters.
15. The method of Claim 14, wherein the receiving an initial scene graph step further comprises receiving the estimated parameters of the scene comprising at least one transform defining a relationship between a parent object and a child object.
:0 16. The method of Claim 13, wherein the determining step further comprises determining the error function further defined by a user-selected differentiable relationship between user-selected ones of the parameters.
17. The method of Claim 13, wherein the determining step further comprises determining the error function further defined by animation parameters to solve match- 5 moving relationships between frames of a motion picture sequence.
- 33
LA2:671419.3
18. The method of Claim 13, wherein the minimizing step further comprises calculating an Hessian matrix using automatic differentiation, thereby guiding the minimizing step according to Newton's method.
19. A system for defining a digital model of a three-dimensional scene using photogrammetry, the system comprising: a computer having a memory, the memory holding program instructions comprising: receiving image data comprising a plurality of photographic images of a three-dimensional scene; receiving user input indicating a plurality of corresponding features each appearing in at least two of the plurality of photographic images; determining an error function for a reverse-rendering function, the reverse-rendering function defining a relationship between three-dimensional coordinates in the three-dimensional scene and corresponding two-dimensional coordinates of the plurality of corresponding features; and minimizing the error function to determine a solution corresponding to a global minimum of the error function, comprising calculating at least first derivatives of the error function using automatic differentiation, thereby computing intermediate solution estimates for successive iterations of the error function, until the solution estimates converge to the solution.
20. The system of Claim 19, wherein the program instructions further comprise receiving an initial scene graph comprising at least a portion of an initial solution estimate.
21. The system of Claim 19, wherein the program instructions further comprise instructions for determining the error function further defined by a user- selected differentiable relationship between user-selected ones of the parameters.
- 34 -
LA2:671419.3
22. The system of Claim 19, wherein the program instructions further comprise instructions for determining the error function further defined by animation parameters to solve match-moving relationships between frames of a motion picture sequence.
5 23. The system of Claim 20, wherein the program instructions further comprise instructions for minimizing the error function by calculating an Hessian matrix using automatic differentiation, thereby guiding the iteration step according to Newton's method.
24. The system of Claim 20, wherein the program instructions further I0 comprise instructions for receiving the plurality of photographic images representing a time sequence, wherein the determining step further comprises determining the error function further defined by time parameters for solving match-moving relationships between frames of a motion picture sequence, and wherein the minimizing step further comprises minimizing the error function simultaneously over the frames.
15 25. The system of Claim 19, wherein the program instructions further comprise instructions for receiving the image data comprising at least one image from a camera at an unknown location inside the three-dimensional scene.
- 35 -
LA2:671419.3
PCT/US2003/036710 2002-11-15 2003-11-17 Reverse-rendering method for digital modeling WO2004047008A1 (en)

Priority Applications (3)

Application Number Priority Date Filing Date Title
JP2004553822A JP2006507585A (en) 2002-11-15 2003-11-17 Reverse rendering method for digital modeling
AU2003295582A AU2003295582A1 (en) 2002-11-15 2003-11-17 Reverse-rendering method for digital modeling
EP03786780A EP1565872A4 (en) 2002-11-15 2003-11-17 Reverse-rendering method for digital modeling

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US42656002P 2002-11-15 2002-11-15
US60/426,560 2002-11-15

Publications (1)

Publication Number Publication Date
WO2004047008A1 true WO2004047008A1 (en) 2004-06-03

Family

ID=32326376

Family Applications (3)

Application Number Title Priority Date Filing Date
PCT/US2003/036839 WO2004047426A2 (en) 2002-11-15 2003-11-17 Reality-based light environment for digital imaging in motion pictures
PCT/US2003/036710 WO2004047008A1 (en) 2002-11-15 2003-11-17 Reverse-rendering method for digital modeling
PCT/US2003/036720 WO2004047009A2 (en) 2002-11-15 2003-11-17 Method for digitally rendering skin or like materials

Family Applications Before (1)

Application Number Title Priority Date Filing Date
PCT/US2003/036839 WO2004047426A2 (en) 2002-11-15 2003-11-17 Reality-based light environment for digital imaging in motion pictures

Family Applications After (1)

Application Number Title Priority Date Filing Date
PCT/US2003/036720 WO2004047009A2 (en) 2002-11-15 2003-11-17 Method for digitally rendering skin or like materials

Country Status (5)

Country Link
US (6) US7079137B2 (en)
EP (3) EP1565872A4 (en)
JP (3) JP4220470B2 (en)
AU (3) AU2003295586B2 (en)
WO (3) WO2004047426A2 (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2008091198A1 (en) * 2007-01-24 2008-07-31 Swiftfoot Graphics Ab Method, display adapter and computer program product for improved graphics performance by using a replaceable culling program
US8281299B2 (en) 2006-11-10 2012-10-02 Purdue Research Foundation Map-closure: a general purpose mechanism for nonstandard interpretation
US8466917B2 (en) 2006-11-20 2013-06-18 Thomson Licensing Method and system for modeling light
US8739137B2 (en) 2006-10-19 2014-05-27 Purdue Research Foundation Automatic derivative method for a computer programming language
US9135746B2 (en) 2011-08-11 2015-09-15 Canon Kabushiki Kaisha Image processing apparatus and control method thereof
US9767620B2 (en) 2014-11-26 2017-09-19 Restoration Robotics, Inc. Gesture-based editing of 3D models for hair transplantation applications

Families Citing this family (147)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1074943A3 (en) * 1999-08-06 2004-03-24 Canon Kabushiki Kaisha Image processing method and apparatus
US7342489B1 (en) 2001-09-06 2008-03-11 Siemens Schweiz Ag Surveillance system control unit
US7239345B1 (en) * 2001-10-12 2007-07-03 Worldscape, Inc. Camera arrangements with backlighting detection and methods of using same
GB2393887B (en) * 2002-10-04 2005-10-26 Criterion Software Ltd Three-dimensional computer graphics
JP3962676B2 (en) * 2002-11-29 2007-08-22 キヤノン株式会社 Image processing method and apparatus
US7050078B2 (en) * 2002-12-19 2006-05-23 Accenture Global Services Gmbh Arbitrary object tracking augmented reality applications
US7714858B2 (en) * 2003-04-18 2010-05-11 Hewlett-Packard Development Company, L.P. Distributed rendering of interactive soft shadows
US7787692B2 (en) * 2003-09-25 2010-08-31 Fujifilm Corporation Image processing apparatus, image processing method, shape diagnostic apparatus, shape diagnostic method and program
FR2861858B1 (en) * 2003-10-29 2014-09-05 Snecma Moteurs MOVING A VIRTUAL ARTICULATED OBJECT INTO A VIRTUAL ENVIRONMENT BY AVOIDING COLLISIONS BETWEEN ARTICULATED OBJECT AND THE ENVIRONMENT
FR2861857B1 (en) * 2003-10-29 2006-01-20 Snecma Moteurs DISPLACEMENT OF A VIRTUAL ARTICULATED OBJECT IN A VIRTUAL ENVIRONMENT BY AVOIDING INTERNAL COLLISIONS BETWEEN THE ARTICULATED ELEMENTS OF THE ARTICULATED OBJECT
GB2410639A (en) * 2004-01-30 2005-08-03 Hewlett Packard Development Co Viewfinder alteration for panoramic imaging
JP4692956B2 (en) * 2004-11-22 2011-06-01 株式会社ソニー・コンピュータエンタテインメント Drawing processing apparatus and drawing processing method
KR100609145B1 (en) * 2004-12-20 2006-08-08 한국전자통신연구원 Rendering Apparatus and Method for real-time global illumination in real light environment
EP1686531B1 (en) * 2005-01-27 2018-04-25 QUALCOMM Incorporated A method, a software product and an electronic device for generating an image composition
US20060170956A1 (en) 2005-01-31 2006-08-03 Jung Edward K Shared image devices
US20060221197A1 (en) * 2005-03-30 2006-10-05 Jung Edward K Image transformation estimator of an imaging device
US9489717B2 (en) 2005-01-31 2016-11-08 Invention Science Fund I, Llc Shared image device
US9124729B2 (en) 2005-01-31 2015-09-01 The Invention Science Fund I, Llc Shared image device synchronization or designation
US9082456B2 (en) 2005-01-31 2015-07-14 The Invention Science Fund I Llc Shared image device designation
US9910341B2 (en) 2005-01-31 2018-03-06 The Invention Science Fund I, Llc Shared image device designation
US8606383B2 (en) * 2005-01-31 2013-12-10 The Invention Science Fund I, Llc Audio sharing
US8902320B2 (en) 2005-01-31 2014-12-02 The Invention Science Fund I, Llc Shared image device synchronization or designation
US8223845B1 (en) 2005-03-16 2012-07-17 Apple Inc. Multithread processing of video frames
US7710423B2 (en) * 2005-03-21 2010-05-04 Microsoft Corproation Automatic layout of items along an embedded one-manifold path
US7505883B2 (en) 2005-03-23 2009-03-17 Electronic Arts Inc. Computer simulation of body dynamics including a solver that solves in linear time for a set of constraints
KR101199498B1 (en) 2005-03-31 2012-11-09 삼성전자주식회사 Apparatus for encoding or generation of multi-view video by using a camera parameter, and a method thereof, and a recording medium having a program to implement thereof
US20060274070A1 (en) * 2005-04-19 2006-12-07 Herman Daniel L Techniques and workflows for computer graphics animation system
US8345252B2 (en) * 2005-04-25 2013-01-01 X-Rite, Inc. Method and system for enhanced formulation and visualization rendering
US9819490B2 (en) 2005-05-04 2017-11-14 Invention Science Fund I, Llc Regional proximity for shared image device(s)
US8964054B2 (en) 2006-08-18 2015-02-24 The Invention Science Fund I, Llc Capturing selected image objects
US9093121B2 (en) 2006-02-28 2015-07-28 The Invention Science Fund I, Llc Data management of an audio data stream
US9001215B2 (en) 2005-06-02 2015-04-07 The Invention Science Fund I, Llc Estimating shared image device operational capabilities or resources
US8681225B2 (en) 2005-06-02 2014-03-25 Royce A. Levien Storage access technique for captured data
US9191611B2 (en) 2005-06-02 2015-11-17 Invention Science Fund I, Llc Conditional alteration of a saved image
US9451200B2 (en) 2005-06-02 2016-09-20 Invention Science Fund I, Llc Storage access technique for captured data
US10003762B2 (en) 2005-04-26 2018-06-19 Invention Science Fund I, Llc Shared image devices
US9942511B2 (en) 2005-10-31 2018-04-10 Invention Science Fund I, Llc Preservation/degradation of video/audio aspects of a data stream
US9167195B2 (en) 2005-10-31 2015-10-20 Invention Science Fund I, Llc Preservation/degradation of video/audio aspects of a data stream
US9621749B2 (en) 2005-06-02 2017-04-11 Invention Science Fund I, Llc Capturing selected image objects
US9076208B2 (en) * 2006-02-28 2015-07-07 The Invention Science Fund I, Llc Imagery processing
US9967424B2 (en) 2005-06-02 2018-05-08 Invention Science Fund I, Llc Data storage usage protocol
US20060268360A1 (en) * 2005-05-12 2006-11-30 Jones Peter W J Methods of creating a virtual window
US20060274068A1 (en) * 2005-06-06 2006-12-07 Electronic Arts Inc. Adaptive contact based skeleton for animation of characters in video games
US7573477B2 (en) * 2005-06-17 2009-08-11 Honda Motor Co., Ltd. System and method for activation-driven muscle deformations for existing character motion
US7403202B1 (en) * 2005-07-12 2008-07-22 Electronic Arts, Inc. Computer animation of simulated characters using combinations of motion-capture data and external force modelling or other physics models
US20070120980A1 (en) 2005-10-31 2007-05-31 Searete Llc, A Limited Liability Corporation Of The State Of Delaware Preservation/degradation of video/audio aspects of a data stream
US20070273711A1 (en) * 2005-11-17 2007-11-29 Maffei Kenneth C 3D graphics system and method
JP2009520395A (en) * 2005-12-16 2009-05-21 トムソン ライセンシング Method, apparatus and system for providing reproducible digital video products from digitally captured images
US7589724B1 (en) 2006-02-15 2009-09-15 Adobe Systems, Incorporated Successive-convolution-compositing technique for rendering soft shadows
US7623137B1 (en) * 2006-02-15 2009-11-24 Adobe Systems, Incorporated Successive-convolution-compositing technique for rendering translucent surfaces
US7411688B1 (en) 2006-03-17 2008-08-12 Arius3D Inc. Method and system for laser intensity calibration in a three-dimensional multi-color laser scanning system
US20070242141A1 (en) * 2006-04-14 2007-10-18 Sony Corporation And Sony Electronics Inc. Adjustable neutral density filter system for dynamic range compression from scene to imaging sensor
US8633927B2 (en) 2006-07-25 2014-01-21 Nvidia Corporation Re-render acceleration of frame with lighting change
US8115774B2 (en) * 2006-07-28 2012-02-14 Sony Computer Entertainment America Llc Application of selective regions of a normal map based on joint position in a three-dimensional model
US8446509B2 (en) * 2006-08-09 2013-05-21 Tenebraex Corporation Methods of creating a virtual window
GB0616685D0 (en) * 2006-08-23 2006-10-04 Warwick Warp Ltd Retrospective shading approximation from 2D and 3D imagery
US8094182B2 (en) * 2006-11-16 2012-01-10 Imove, Inc. Distributed video sensor panoramic imaging system
JP4808600B2 (en) * 2006-11-22 2011-11-02 デジタルファッション株式会社 Rendering program, rendering apparatus, and rendering method
JP4842242B2 (en) * 2006-12-02 2011-12-21 韓國電子通信研究院 Method and apparatus for real-time expression of skin wrinkles during character animation
US9767599B2 (en) * 2006-12-29 2017-09-19 X-Rite Inc. Surface appearance simulation
US20080178087A1 (en) * 2007-01-19 2008-07-24 Microsoft Corporation In-Scene Editing of Image Sequences
KR100967701B1 (en) * 2007-02-26 2010-07-07 한국외국어대학교 연구산학협력단 Reconstructing three dimensional oil paintings
US20080320126A1 (en) * 2007-06-25 2008-12-25 Microsoft Corporation Environment sensing for interactive entertainment
US7929142B2 (en) * 2007-09-25 2011-04-19 Microsoft Corporation Photodiode-based bi-directional reflectance distribution function (BRDF) measurement
US20090079758A1 (en) * 2007-09-25 2009-03-26 Max-Planck-Gesellschaft Zur Forderung Per Wissenschaften E.V. Method and device for generating shadow maps
US8310481B2 (en) * 2007-10-12 2012-11-13 Edward Ernest Bailey Computer aided design method for enhancement of local refinement through T-splines
US8159490B2 (en) 2007-10-16 2012-04-17 Dreamworks Animation Llc Shading of translucent objects
JP5551075B2 (en) * 2007-11-16 2014-07-16 テネブラックス コーポレイション System and method for generating a virtual window
US20090290033A1 (en) * 2007-11-16 2009-11-26 Tenebraex Corporation Systems and methods of creating a virtual window
US8791984B2 (en) * 2007-11-16 2014-07-29 Scallop Imaging, Llc Digital security camera
KR100901270B1 (en) * 2007-12-15 2009-06-09 한국전자통신연구원 System and method for rendering surface materials
US20090172756A1 (en) * 2007-12-31 2009-07-02 Motorola, Inc. Lighting analysis and recommender system for video telephony
US8509569B2 (en) * 2008-02-11 2013-08-13 Apple Inc. Optimization of image processing using multiple processing units
US8243071B2 (en) * 2008-02-29 2012-08-14 Microsoft Corporation Modeling and rendering of heterogeneous translucent materials using the diffusion equation
US9098647B2 (en) 2008-03-10 2015-08-04 Apple Inc. Dynamic viewing of a three dimensional space
US8350850B2 (en) * 2008-03-31 2013-01-08 Microsoft Corporation Using photo collections for three dimensional modeling
IL190539A (en) * 2008-03-31 2015-01-29 Rafael Advanced Defense Sys Methods for transferring points of interest between images with non-parallel viewing directions
US7937245B2 (en) * 2008-04-02 2011-05-03 Dreamworks Animation Llc Rendering of subsurface scattering effects in translucent objects
AU2009201433B2 (en) * 2008-04-15 2013-11-21 Electronics And Telecommunications Research Institute Improved physics-based simulation
US8239822B2 (en) * 2008-04-18 2012-08-07 Microsoft Corp. Symbolic forward and reverse differentiation
US8238651B2 (en) * 2008-06-05 2012-08-07 Microsoft Corporation Image-guided abstraction of building facades
US9619917B2 (en) * 2008-10-03 2017-04-11 Apple Inc. Depth of field for a camera in a media-editing application
US8791951B2 (en) * 2008-12-01 2014-07-29 Electronics And Telecommunications Research Institute Image synthesis apparatus and method supporting measured materials properties
WO2010088840A1 (en) 2009-02-06 2010-08-12 The Hong Kong University Of Science And Technology Generating three-dimensional models from images
US9098926B2 (en) * 2009-02-06 2015-08-04 The Hong Kong University Of Science And Technology Generating three-dimensional façade models from images
US9098945B2 (en) * 2009-05-01 2015-08-04 Microsoft Technology Licensing, Llc Modeling anisotropic surface reflectance with microfacet synthesis
US8369564B2 (en) * 2009-06-30 2013-02-05 Apple Inc. Automatic generation and use of region of interest and domain of definition functions
EP2481209A1 (en) * 2009-09-22 2012-08-01 Tenebraex Corporation Systems and methods for correcting images in a multi-sensor system
US8860731B1 (en) * 2009-12-21 2014-10-14 Lucasfilm Entertainment Company Ltd. Refining animation
CN101872491B (en) * 2010-05-21 2011-12-28 清华大学 Free view angle relighting method and system based on photometric stereo
KR101633377B1 (en) * 2010-06-18 2016-07-08 삼성전자주식회사 Method and Apparatus for Processing Frames Obtained by Multi-Exposure
US8928659B2 (en) * 2010-06-23 2015-01-06 Microsoft Corporation Telepresence systems with viewer perspective adjustment
CN103155004B (en) * 2010-09-01 2016-05-18 玛斯柯有限公司 Demonstrate equipment, the system and method for illumination scheme by image rendering
KR101194364B1 (en) * 2011-07-14 2012-10-25 광주과학기술원 Appearance material design and manufacturing method and system
DE102011079380A1 (en) * 2011-07-19 2013-01-24 Siemens Aktiengesellschaft Method, computer program and system for computer-aided evaluation of image data sets
US8913065B2 (en) * 2011-08-05 2014-12-16 Jeffrey McCartney Computer system for animating 3D models using offset transforms
US9250966B2 (en) * 2011-08-11 2016-02-02 Otoy, Inc. Crowd-sourced video rendering system
JP2013127774A (en) * 2011-11-16 2013-06-27 Canon Inc Image processing device, image processing method, and program
US9183654B2 (en) * 2012-03-02 2015-11-10 Sean Geggie Live editing and integrated control of image-based lighting of 3D models
CN103327221B (en) * 2012-03-20 2016-12-14 华晶科技股份有限公司 Camera head and image prebrowsing system thereof and image method for previewing
GB2500405B (en) 2012-03-20 2014-04-16 Lightmap Ltd Point and click lighting for image based lighting surfaces
TWI520604B (en) * 2012-03-20 2016-02-01 華晶科技股份有限公司 Image pickup device and image preview system and image preview method thereof
US8416240B1 (en) * 2012-04-02 2013-04-09 Google Inc. Determining 3D model information from stored images
WO2014022833A2 (en) * 2012-08-03 2014-02-06 Dreamworks Animation Llc Temporal dependencies in dependency graphs
US20140067869A1 (en) 2012-08-30 2014-03-06 Atheer, Inc. Method and apparatus for content association and history tracking in virtual and augmented reality
US20140078144A1 (en) * 2012-09-14 2014-03-20 Squee, Inc. Systems and methods for avatar creation
US9947132B2 (en) * 2013-03-15 2018-04-17 Nvidia Corporation Material representation data structure and method of representing a material for digital image synthesis
US9261755B2 (en) 2013-04-11 2016-02-16 Satellite Lab, LLC System and method for producing virtual light source movement in motion pictures and other media
WO2015045501A1 (en) * 2013-09-27 2015-04-02 日立オートモティブシステムズ株式会社 External environment recognition device
EP3071294B1 (en) 2013-11-22 2019-03-06 Sonify Biosciences, LLC Skin cancer treatment using low intensity ultrasound
US9509905B2 (en) * 2013-12-17 2016-11-29 Google Inc. Extraction and representation of three-dimensional (3D) and bidirectional reflectance distribution function (BRDF) parameters from lighted image sequences
US9600904B2 (en) 2013-12-30 2017-03-21 Samsung Electronics Co., Ltd. Illuminating a virtual environment with camera light data
US9648699B2 (en) 2014-03-03 2017-05-09 LiveLocation, Inc. Automatic control of location-registered lighting according to a live reference lighting environment
JP6410451B2 (en) * 2014-03-31 2018-10-24 キヤノン株式会社 Information processing apparatus, measurement system, information processing method, and program.
US10169909B2 (en) * 2014-08-07 2019-01-01 Pixar Generating a volumetric projection for an object
US10133830B2 (en) * 2015-01-30 2018-11-20 Hover Inc. Scaling in a multi-dimensional building model
FR3034233B1 (en) * 2015-03-25 2018-08-10 Morpho METHOD OF CORRECTING AN IMAGE OF AT LEAST ONE REMOTELY PRESENTED OBJECT IN FRONT OF AN IMAGER AND LIGHTING BY A LIGHTING SYSTEM AND SHOOTING SYSTEM FOR IMPLEMENTING SAID METHOD
US11432046B1 (en) 2015-06-12 2022-08-30 Veepio Holdings, Llc Interactive, personalized objects in content creator's media with e-commerce link associated therewith
WO2017075452A1 (en) 2015-10-29 2017-05-04 True Image Interactive, Inc Systems and methods for machine-generated avatars
JP6792335B2 (en) 2016-01-19 2020-11-25 キヤノン株式会社 Image processing equipment and its method
EP3474236A4 (en) * 2016-06-16 2019-12-11 Sony Interactive Entertainment Inc. Image processing device
US10489968B1 (en) 2016-09-14 2019-11-26 Musco Corporation Apparatus, method, and system for three-dimensional (3D) visualization of light for evaluation of playability, glare, and gaps
US10594995B2 (en) * 2016-12-13 2020-03-17 Buf Canada Inc. Image capture and display on a dome for chroma keying
EP3336801A1 (en) * 2016-12-19 2018-06-20 Thomson Licensing Method and apparatus for constructing lighting environment representations of 3d scenes
EP3351899B1 (en) * 2017-01-24 2020-06-17 Leica Geosystems AG Method and device for inpainting of colourised three-dimensional point clouds
JP6859763B2 (en) * 2017-03-10 2021-04-14 株式会社リコー Program, information processing device
US11004173B2 (en) 2017-03-13 2021-05-11 Mediatek Inc. Method for processing projection-based frame that includes at least one projection face packed in 360-degree virtual reality projection layout
US11057643B2 (en) 2017-03-13 2021-07-06 Mediatek Inc. Method and apparatus for generating and encoding projection-based frame that includes at least one padding region and at least one projection face packed in 360-degree virtual reality projection layout
CN110506291B (en) 2017-04-05 2021-05-14 联发科技股份有限公司 Video processing method and device
US10181199B2 (en) * 2017-05-08 2019-01-15 Adobe Systems Incorporated Material capture using imaging
KR102149180B1 (en) * 2017-07-07 2020-08-28 한국전자통신연구원 Method for synthesizing virtual content for augmented reality and apparatus using the same
CN111034191A (en) 2017-08-18 2020-04-17 联发科技股份有限公司 Method and apparatus for reducing artifacts in projection-based frames
KR102107706B1 (en) * 2017-10-31 2020-05-07 에스케이텔레콤 주식회사 Method and apparatus for processing image
CN109147023A (en) * 2018-07-27 2019-01-04 北京微播视界科技有限公司 Three-dimensional special efficacy generation method, device and electronic equipment based on face
JP7328651B2 (en) * 2018-08-01 2023-08-17 東芝ライテック株式会社 Generation device, generation method and generation program
CN109587557B (en) * 2019-01-11 2022-03-08 京东方科技集团股份有限公司 Data transmission method and device and display device
US10986308B2 (en) * 2019-03-20 2021-04-20 Adobe Inc. Intelligent video reframing
US10949646B2 (en) 2019-04-30 2021-03-16 Samsung Electronics Co., Ltd. Performing an iterative bundle adjustment for an imaging device
EP3764249A1 (en) * 2019-07-08 2021-01-13 Dmitri Goloubentsev A streaming compiler for automatic adjoint differentiation
GB2586060B (en) * 2019-08-01 2022-09-21 Sony Interactive Entertainment Inc Surface characterisation apparatus and system
CN110930483B (en) * 2019-11-20 2020-11-24 腾讯科技(深圳)有限公司 Role control method, model training method and related device
US11461968B2 (en) * 2020-01-30 2022-10-04 Unity Technologies Sf Method of inferring microdetail on skin animation
US11620765B2 (en) * 2020-07-02 2023-04-04 Unity Technologies Sf Automatic detection of a calibration object for modifying image parameters
CN111815768B (en) * 2020-09-14 2020-12-18 腾讯科技(深圳)有限公司 Three-dimensional face reconstruction method and device
CN113034662B (en) * 2021-03-29 2023-03-31 网易(杭州)网络有限公司 Virtual scene rendering method and device, storage medium and electronic equipment
AU2022379381B2 (en) 2021-10-29 2023-09-14 Kara Technologies Limited Method for generating animated sentences for sign language translation
KR102555166B1 (en) * 2022-10-04 2023-07-12 인하대학교 산학협력단 Method and System for Facial Texture Synthesis with Skin Microelement Structure

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5499306A (en) * 1993-03-08 1996-03-12 Nippondenso Co., Ltd. Position-and-attitude recognition method and apparatus by use of image pickup means
US6571024B1 (en) * 1999-06-18 2003-05-27 Sarnoff Corporation Method and apparatus for multi-view three dimensional estimation

Family Cites Families (92)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4645459A (en) * 1982-07-30 1987-02-24 Honeywell Inc. Computer generated synthesized imagery
JPH0743765B2 (en) 1987-10-20 1995-05-15 富士写真フイルム株式会社 Radiation image processing method and apparatus
CA1316591C (en) 1987-10-20 1993-04-20 Kazuhiro Hishinuma Method and apparatus for radiation image processing and x-ray image processing
US5191406A (en) * 1990-04-20 1993-03-02 Nikon Corporation Method and apparatus for rapid scanning of color images
US5625577A (en) * 1990-12-25 1997-04-29 Shukyohojin, Kongo Zen Sohonzan Shorinji Computer-implemented motion analysis method using dynamics
US5623428A (en) * 1990-12-25 1997-04-22 Shukyohoji, Kongo Zen Sohozan Shoriji Method for developing computer animation
US5586224A (en) * 1990-12-25 1996-12-17 Shukyohojin, Kongo Zen Sohonzan Shorinji Robot or numerical control programming method
WO1993025355A1 (en) * 1992-06-05 1993-12-23 Fujitsu Limited Simulation method for manipulator and apparatus therefor, simulation and control method for manipulator and apparatus therefor, and control method for manipulator and apparatus therefor
US5739820A (en) * 1992-11-19 1998-04-14 Apple Computer Inc. Method and apparatus for specular reflection shading of computer graphic images
US5490240A (en) 1993-07-09 1996-02-06 Silicon Graphics, Inc. System and method of generating interactive computer graphic images incorporating three dimensional textures
US5403238A (en) * 1993-08-19 1995-04-04 The Walt Disney Company Amusement park attraction
JPH0773339A (en) * 1993-09-03 1995-03-17 Sharp Corp Light source luminance calculator
US5546475A (en) * 1994-04-29 1996-08-13 International Business Machines Corporation Produce recognition system
US5600763A (en) * 1994-07-21 1997-02-04 Apple Computer, Inc. Error-bounded antialiased rendering of complex scenes
US5745759A (en) * 1994-10-14 1998-04-28 Qnx Software Systems, Ltd. Window kernel
JP3554616B2 (en) * 1994-12-13 2004-08-18 富士通株式会社 Drawing method and apparatus using radiosity method
GB9501832D0 (en) 1995-01-31 1995-03-22 Videologic Ltd Texturing and shading of 3-d images
IL113496A (en) 1995-04-25 1999-09-22 Cognitens Ltd Apparatus and method for recreating and manipulating a 3d object based on a 2d projection thereof
US5982389A (en) * 1996-06-17 1999-11-09 Microsoft Corporation Generating optimized motion transitions for computer animated objects
US5966134A (en) * 1996-06-28 1999-10-12 Softimage Simulating cel animation and shading
EP0816986B1 (en) * 1996-07-03 2006-09-06 Hitachi, Ltd. System for recognizing motions
GB9616262D0 (en) 1996-08-02 1996-09-11 Philips Electronics Nv Post-processing generation of focus/defocus effects for computer graphics images
US5748792A (en) * 1996-08-13 1998-05-05 Polaroid Corporation Large kernel filtering using a fixed-size block processor
US6104412A (en) * 1996-08-21 2000-08-15 Nippon Telegraph And Telephone Corporation Method for generating animations of a multi-articulated structure, recording medium having recorded thereon the same and animation generating apparatus using the same
JP3750830B2 (en) * 1996-08-30 2006-03-01 ソニー株式会社 Color correction apparatus in imaging apparatus
US6246420B1 (en) * 1996-10-11 2001-06-12 Matsushita Electric Industrial Co., Ltd. Movement data connecting method and apparatus therefor
US6078332A (en) 1997-01-28 2000-06-20 Silicon Graphics, Inc. Real-time lighting method using 3D texture mapping
US6052124A (en) 1997-02-03 2000-04-18 Yissum Research Development Company System and method for directly estimating three-dimensional structure of objects in a scene and camera motion from three two-dimensional views of the scene
US5894309A (en) * 1997-02-27 1999-04-13 Mitsubishi Electric Information Technology Center America, Inc. System for modifying lighting in photographs
US6310644B1 (en) 1997-03-26 2001-10-30 3Dm Devices Inc. Camera theodolite system
US6184899B1 (en) * 1997-03-31 2001-02-06 Treyarch Invention, L.L.C. Articulated figure animation using virtual actuators to simulate solutions for differential equations to display more realistic movements
US6088042A (en) * 1997-03-31 2000-07-11 Katrix, Inc. Interactive motion data animation system
US6057859A (en) * 1997-03-31 2000-05-02 Katrix, Inc. Limb coordination system for interactive computer animation of articulated characters with blended motion data
US6124864A (en) * 1997-04-07 2000-09-26 Synapix, Inc. Adaptive modeling and segmentation of visual image streams
US6160907A (en) * 1997-04-07 2000-12-12 Synapix, Inc. Iterative three-dimensional process for creating finished media content
US6097394A (en) * 1997-04-28 2000-08-01 Board Of Trustees, Leland Stanford, Jr. University Method and system for light field rendering
JP3747589B2 (en) * 1997-09-17 2006-02-22 コニカミノルタビジネステクノロジーズ株式会社 Image feature amount comparison device and recording medium storing image feature amount comparison program
US6166744A (en) 1997-11-26 2000-12-26 Pathfinder Systems, Inc. System for combining virtual images with real-world scenes
JPH11175762A (en) * 1997-12-08 1999-07-02 Katsushi Ikeuchi Light environment measuring instrument and device and method for shading virtual image using same
JP3688879B2 (en) * 1998-01-30 2005-08-31 株式会社東芝 Image recognition apparatus, image recognition method, and recording medium therefor
US6148113A (en) * 1998-02-03 2000-11-14 Micrografx, Inc. System for stimulating the depth of field of an image in two dimensional space and method of operation
US6272231B1 (en) * 1998-11-06 2001-08-07 Eyematic Interfaces, Inc. Wavelet-based facial motion capture for avatar animation
US5974168A (en) * 1998-04-16 1999-10-26 International Business Machines Corporation Acquiring bump maps from curved objects
US6333749B1 (en) 1998-04-17 2001-12-25 Adobe Systems, Inc. Method and apparatus for image assisted modeling of three-dimensional scenes
US6137491A (en) 1998-06-05 2000-10-24 Microsoft Corporation Method and apparatus for reconstructing geometry using geometrically constrained structure from motion with points on planes
US6271855B1 (en) 1998-06-18 2001-08-07 Microsoft Corporation Interactive construction of 3D models from panoramic images employing hard and soft constraint characterization and decomposing techniques
US6628830B1 (en) * 1998-06-24 2003-09-30 Canon Kabushiki Kaisha Image processing method and apparatus and storage medium
US6628298B1 (en) 1998-07-17 2003-09-30 The Regents Of The University Of California Apparatus and method for rendering synthetic objects into real scenes using measurements of scene illumination
KR20010015674A (en) 1998-07-30 2001-02-26 마츠시타 덴끼 산교 가부시키가이샤 Moving picture synthesizer
US6373496B1 (en) 1998-08-12 2002-04-16 S3 Graphics Co., Ltd. Apparatus and method for texture mapping
US6342887B1 (en) * 1998-11-18 2002-01-29 Earl Robert Munroe Method and apparatus for reproducing lighting effects in computer animated objects
US6278460B1 (en) 1998-12-15 2001-08-21 Point Cloud, Inc. Creating a three-dimensional model from two-dimensional images
CA2259882A1 (en) * 1999-01-22 2000-07-22 I.S.G. Technologies, Inc. Interactive sculpting for volumetric exploration and feature extraction
US6313842B1 (en) * 1999-03-03 2001-11-06 Discreet Logic Inc. Generating image data
US6496597B1 (en) 1999-03-03 2002-12-17 Autodesk Canada Inc. Generating image data
US6362822B1 (en) 1999-03-12 2002-03-26 Terminal Reality, Inc. Lighting and shadowing methods and arrangements for use in computer graphic simulations
US6400848B1 (en) * 1999-03-30 2002-06-04 Eastman Kodak Company Method for modifying the perspective of a digital image
US6483514B1 (en) 1999-04-15 2002-11-19 Pixar Animation Studios Motion blurring implicit surfaces
US6552731B1 (en) * 1999-04-16 2003-04-22 Avid Technology, Inc. Multi-tone representation of a digital image on a digital nonlinear editing system
JP4001435B2 (en) * 1999-04-19 2007-10-31 株式会社バンダイナムコゲームス Game device, image data creation tool, and information storage medium
US6297834B1 (en) 1999-06-10 2001-10-02 Hewlett-Packard Company Direction-dependent texture maps in a graphics system
US6504538B1 (en) 1999-07-01 2003-01-07 Microsoft Corporation Method and system for generating light values for a set of vertices
JP3486575B2 (en) * 1999-08-31 2004-01-13 キヤノン株式会社 Mixed reality presentation apparatus and method, and storage medium
US6373487B1 (en) 1999-09-17 2002-04-16 Hewlett-Packard Company Methods and apparatus for constructing a 3D model of a scene from calibrated images of the scene
FR2799022B1 (en) * 1999-09-29 2002-02-01 Oreal MAKEUP ASSISTANCE DEVICE AND ASSEMBLY CONSISTING OF SUCH A DEVICE AND A DEVICE FOR DELIVERING A PRODUCT HAVING A PREDETERMINED BRDF, SELECTED BY THE MAKEUP ASSISTANCE DEVICE
US6694064B1 (en) * 1999-11-19 2004-02-17 Positive Systems, Inc. Digital aerial image mosaic method and apparatus
US20020122589A1 (en) * 1999-11-29 2002-09-05 Donald M. Reiman Constructing profiles to compensate for non-linearities in image capture
WO2001048697A1 (en) * 1999-12-23 2001-07-05 Intel Corporation Methods of hierarchical static scene simplification and polygon budgeting for 3d models
US6515674B1 (en) 2000-03-17 2003-02-04 Hewlett-Packard Company Apparatus for and of rendering 3d objects with parametric texture maps
US6750866B1 (en) * 2000-04-21 2004-06-15 Realistic Dynamics, Inc. Method and system for dynamically filtering the motion of articulated bodies
US6564108B1 (en) 2000-06-07 2003-05-13 The Delfin Project, Inc. Method and system of auxiliary illumination for enhancing a scene during a multimedia presentation
US6750873B1 (en) * 2000-06-27 2004-06-15 International Business Machines Corporation High quality texture reconstruction from multiple scans
US7034825B2 (en) * 2000-08-24 2006-04-25 Stowe Jason A Computerized image system
JP2002152719A (en) * 2000-08-29 2002-05-24 Usc Corp Monitor method and monitor device utilizing curved surface image
US6765573B2 (en) 2000-10-26 2004-07-20 Square Enix Co., Ltd. Surface shading using stored texture map based on bidirectional reflectance distribution function
DE60143814D1 (en) * 2000-11-17 2011-02-17 Sony Corp DEVICE AND METHOD FOR CONTROLLING A MOVABLE ROBOT WITH LEGS AND METHOD FOR PRODUCING MOTION PATTERNS FOR A MOVABLE ROBOT WITH LEGS
JP3406965B2 (en) * 2000-11-24 2003-05-19 キヤノン株式会社 Mixed reality presentation device and control method thereof
JP3572025B2 (en) * 2001-03-07 2004-09-29 キヤノン株式会社 Image reproducing apparatus, image processing apparatus and their methods
US6941028B2 (en) * 2001-04-30 2005-09-06 Hewlett-Packard Development Company, L.P. System and method for image enhancement, dynamic range compensation and illumination correction
US6639594B2 (en) * 2001-06-03 2003-10-28 Microsoft Corporation View-dependent image synthesis
US6685326B2 (en) * 2001-06-08 2004-02-03 University Of Southern California Realistic scene lighting simulation
US7106325B2 (en) * 2001-08-03 2006-09-12 Hewlett-Packard Development Company, L.P. System and method for rendering digital images having surface reflectance properties
US6961058B2 (en) * 2001-08-10 2005-11-01 Microsoft Corporation Macrostructure modeling with microstructure reflectance slices
US6538396B1 (en) 2001-09-24 2003-03-25 Ultimatte Corporation Automatic foreground lighting effects in a composited scene
JP4443083B2 (en) 2001-10-09 2010-03-31 株式会社バンダイナムコゲームス Image generation system and information storage medium
US7215813B2 (en) 2001-12-03 2007-05-08 Apple Computer, Inc. Method and apparatus for color correction
US7221809B2 (en) * 2001-12-17 2007-05-22 Genex Technologies, Inc. Face recognition system and method
US20030202120A1 (en) 2002-04-05 2003-10-30 Mack Newton Eliot Virtual lighting system
US7009608B2 (en) * 2002-06-06 2006-03-07 Nvidia Corporation System and method of using multiple representations per object in computer graphics
US7075534B2 (en) * 2002-06-21 2006-07-11 Forrester Hardenbergh Cole Method and system for automatically generating factored approximations for arbitrary bidirectional reflectance distribution functions
KR100483806B1 (en) * 2002-07-18 2005-04-20 한국과학기술원 Motion Reconstruction Method from Inter-Frame Feature Correspondences of a Single Video Stream Using a Motion Library
JP3972784B2 (en) * 2002-09-30 2007-09-05 ソニー株式会社 Image processing apparatus and method

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5499306A (en) * 1993-03-08 1996-03-12 Nippondenso Co., Ltd. Position-and-attitude recognition method and apparatus by use of image pickup means
US6571024B1 (en) * 1999-06-18 2003-05-27 Sarnoff Corporation Method and apparatus for multi-view three dimensional estimation

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8739137B2 (en) 2006-10-19 2014-05-27 Purdue Research Foundation Automatic derivative method for a computer programming language
US8281299B2 (en) 2006-11-10 2012-10-02 Purdue Research Foundation Map-closure: a general purpose mechanism for nonstandard interpretation
US8466917B2 (en) 2006-11-20 2013-06-18 Thomson Licensing Method and system for modeling light
WO2008091198A1 (en) * 2007-01-24 2008-07-31 Swiftfoot Graphics Ab Method, display adapter and computer program product for improved graphics performance by using a replaceable culling program
US9460552B2 (en) 2007-01-24 2016-10-04 Intel Corporation Method, display adapter and computer program product for improved graphics performance by using a replaceable culling program
US10140750B2 (en) 2007-01-24 2018-11-27 Intel Corporation Method, display adapter and computer program product for improved graphics performance by using a replaceable culling program
US9135746B2 (en) 2011-08-11 2015-09-15 Canon Kabushiki Kaisha Image processing apparatus and control method thereof
US9767620B2 (en) 2014-11-26 2017-09-19 Restoration Robotics, Inc. Gesture-based editing of 3D models for hair transplantation applications

Also Published As

Publication number Publication date
US20040150643A1 (en) 2004-08-05
EP1573653A2 (en) 2005-09-14
JP4220470B2 (en) 2009-02-04
US8515157B2 (en) 2013-08-20
EP1566052A4 (en) 2007-02-07
WO2004047009A2 (en) 2004-06-03
EP1565872A1 (en) 2005-08-24
WO2004047426A2 (en) 2004-06-03
EP1573653A4 (en) 2007-03-14
AU2003295586A1 (en) 2004-06-15
JP4276178B2 (en) 2009-06-10
US7536047B2 (en) 2009-05-19
US7079137B2 (en) 2006-07-18
WO2004047009A3 (en) 2004-07-08
US6990230B2 (en) 2006-01-24
AU2003295582A1 (en) 2004-06-15
WO2004047426A3 (en) 2004-07-15
US20040146197A1 (en) 2004-07-29
JP2006506745A (en) 2006-02-23
EP1565872A4 (en) 2007-03-07
EP1566052A2 (en) 2005-08-24
AU2003298666A1 (en) 2004-06-15
AU2003295586B2 (en) 2009-05-07
US20040150642A1 (en) 2004-08-05
US6983082B2 (en) 2006-01-03
EP1573653B1 (en) 2013-07-10
US20040169656A1 (en) 2004-09-02
US20040150641A1 (en) 2004-08-05
US20090174713A1 (en) 2009-07-09
JP2006507585A (en) 2006-03-02
JP2006506742A (en) 2006-02-23

Similar Documents

Publication Publication Date Title
US6990230B2 (en) Reverse-rendering method for digital modeling
Jambon et al. Nerfshop: Interactive editing of neural radiance fields
Coorg et al. Spherical mosaics with quaternions and dense correlation
Karsch et al. Depth transfer: Depth extraction from video using non-parametric sampling
JP3651590B2 (en) Method for restoring 3D scene structure and camera motion directly from points, lines and / or from image intensity
US7184071B2 (en) Method of three-dimensional object reconstruction from a video sequence using a generic model
Zhang et al. Path-space differentiable rendering of participating media
US6249285B1 (en) Computer assisted mark-up and parameterization for scene analysis
US20120081357A1 (en) System and method for interactive painting of 2d images for iterative 3d modeling
Piponi Automatic differentiation, C++ templates, and photogrammetry
CN115115780B (en) Three-dimensional reconstruction method and system based on multi-view RGBD camera
WO2023241065A1 (en) Method and apparatus for image inverse rendering, and device and medium
Cao et al. Single view 3D reconstruction based on improved RGB-D image
US6009437A (en) Linear fitting with missing data: applications to structure-from-motion and to characterizing intensity images
US6421049B1 (en) Parameter selection for approximate solutions to photogrammetric problems in interactive applications
KR20090075399A (en) Mechanism for reconstructing a 3d model using key-frames selected from image sequences
US11790606B2 (en) Determining camera rotations based on known translations
US20230206538A1 (en) Differentiable inverse rendering based on radiative backpropagation
Bazin et al. Integration of geometric elements, euclidean relations, and motion curves for parametric shape and motion estimation
Tykkälä Real-time image-based RGB-D camera motion tracking and environment mapping
Dominec 3D Surface Reconstruction From Video Sequences
Solem et al. Variational surface interpolation from sparse point and normal data
Bhotika Scene-space methods for Bayesian inference of three-dimensional shape and motion
Sun et al. Interactive optimization of 3D shape and 2D correspondence using multiple geometric constraints via POCS
Ruepp Recovery of Structure and Motion from Monocular Images under Poor Lighting and Texture Conditions.

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BW BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE EG ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NI NO NZ OM PH PL PT RO RU SC SD SE SG SK SL TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): BW GH GM KE LS MW MZ SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IT LU MC NL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
WWE Wipo information: entry into national phase

Ref document number: 2003295582

Country of ref document: AU

WWE Wipo information: entry into national phase

Ref document number: 2004553822

Country of ref document: JP

WWE Wipo information: entry into national phase

Ref document number: 2003786780

Country of ref document: EP

WWP Wipo information: published in national office

Ref document number: 2003786780

Country of ref document: EP