WO2017031113A1 - Système à multiples examinateurs de modèle 3d - Google Patents

Système à multiples examinateurs de modèle 3d Download PDF

Info

Publication number
WO2017031113A1
WO2017031113A1 PCT/US2016/047164 US2016047164W WO2017031113A1 WO 2017031113 A1 WO2017031113 A1 WO 2017031113A1 US 2016047164 W US2016047164 W US 2016047164W WO 2017031113 A1 WO2017031113 A1 WO 2017031113A1
Authority
WO
WIPO (PCT)
Prior art keywords
reviewer
model
image
images
coordinator
Prior art date
Application number
PCT/US2016/047164
Other languages
English (en)
Inventor
Jared Sandrew
Tony Baldridge
Anthony Lopez
Jacqueline Mcfarland
Scott Jones
Thomas Schad
Original Assignee
Legend3D, Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from US14/827,482 external-priority patent/US9282321B2/en
Application filed by Legend3D, Inc. filed Critical Legend3D, Inc.
Publication of WO2017031113A1 publication Critical patent/WO2017031113A1/fr

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N13/00Stereoscopic video systems; Multi-view video systems; Details thereof
    • H04N13/20Image signal generators
    • H04N13/261Image signal generators with monoscopic-to-stereoscopic image conversion
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N13/00Stereoscopic video systems; Multi-view video systems; Details thereof
    • H04N13/20Image signal generators
    • H04N13/275Image signal generators from 3D object models, e.g. computer-generated stereoscopic image signals
    • H04N13/279Image signal generators from 3D object models, e.g. computer-generated stereoscopic image signals the virtual viewpoint locations being selected by the viewers or determined by tracking
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N13/00Stereoscopic video systems; Multi-view video systems; Details thereof
    • H04N13/10Processing, recording or transmission of stereoscopic or multi-view image signals
    • H04N13/106Processing image signals
    • H04N13/128Adjusting depth or disparity

Definitions

  • One or more embodiments of the invention are related to the field of image analysis and image enhancement, and computer graphics processing of two-dimensional (2D) images into three-dimensional (3D) stereoscopic images. More particularly, but not by way of limitation, one or more embodiments of the invention enable a system for simultaneous review of a 3D model by multiple reviewers, each potentially viewing the model from a different viewpoint, and for coordination of these reviews by a coordinator. Graphical indicators of the reviewers' viewpoints may be overlaid onto an overview image of the 3D model so the coordinator can determine the regions and perspectives of the model that are under review.
  • 3D viewing is based on stereographic vision, with different viewpoints from one or more images provided to the left and right eyes to provide the illusion of depth.
  • Many techniques are known in the art to provide 3D viewing.
  • specialized glasses may be utilized for viewing 3D images, such as glasses with color filters, polarized lenses, or anamorphic lenses.
  • Some 3D viewing methods use separate screens for left eye and right eye images, or project images directly onto the left eye and right eye.
  • Virtual reality environments typically are computer-generated environments that simulate user presence in either real world or computer-generated worlds.
  • the systems utilized to display the virtual reality environment typically include a stereoscopic display for 3D viewing and generally instrument a viewer with one or more sensors, in order to detect and respond to the position, orientation, and movements of the viewer. Based on these values, the virtual reality environment generates images to provide an immersive experience.
  • the immersive experience may also include other outputs such as sound or vibration. Images may be projected onto screens, or provided using specialized glasses worn by the user.
  • the primary challenge with creating a 3D virtual reality environment is the complexity of generating the necessary stereo images for all possible positions and orientations of the viewer. These stereo images must be generated dynamically in approximately real-time as the viewer moves through the virtual reality environment. This requirement distinguishes 3D virtual reality from the process of generating 3D movies from 2D images as the location of the viewer is essentially fixed at the location of the camera.
  • the process of creating a 3D virtual reality is typically iterative. 2D images and possibly computer-generated elements are modeled in 3D space using depth assignments. Stereoscopic 3D images are then rendered from the 3D models and reviewed for accuracy and artistic effects. Typically, these reviews identify adjustments that are needed to the 3D models. These adjustments are applied, and the revised model is re-rendered for a subsequent review. This process may repeat several times. It is time consuming because the modifications to the 3D models and the rendering of images from these 3D models are labor intensive and computationally intensive and may be impossible in real-time. There are no known methods that provide rapid depth update and review cycles for virtual reality models. There are no known methods that allow editors to modify depth and immediately observe the effect of these depth changes on the stereoscopic images. Hence there is a need for a method for real-time depth modification of stereo images for a virtual reality environment.
  • Embodiments of the invention enable a system for coordination of multiple, simultaneous reviews of a 3D model, potentially from different viewpoints.
  • a set of 2D images may be obtained from an environment, which may for example be a real physical environment such as a room, a computer-generated environment, or a mix of physical and computer-generated elements.
  • 2D images may be captured using a camera aimed at various angles to form a panoramic collection of images covering a desired part of a scene. In some embodiments the collection of images may cover an entire sphere, providing 360° viewing in all directions (including left-to-right and up and down).
  • Embodiments of the invention enable converting these 2D images into a 3D virtual reality experience.
  • a viewer in the virtual reality environment may be able to view the environment from various locations and orientations, and perceive three- dimensional depth reflecting the physical or modeled characteristics of the captured scene.
  • subsets of the 2D images are first stitched together, using for example common features or overlapping pixels.
  • the integrated, stitched images may then be projected onto a spherical surface to form a complete 360-degree view of the scene (or a desired portion thereof) in any direction (left to right as well as up and down).
  • the spherical surface provides a complete spherical view of the scene, but this view is still two- dimensional since it lacks any depth information.
  • Embodiments of the invention enable addition of depth information to the spherical display.
  • the spherical surface image is unwrapped onto a plane.
  • This unwrapped image may be divided into regions to assist in generating depth information.
  • Depth information is generated for the points of the regions. Depth may be applied to the regions manually, via external depth information sources, calculated based on any portion of the underlying image data including hue, luminance, saturation or any other image feature or any combination of acceptance of manual depth or external depth or calculation of depth.
  • Depth information may comprise for example, without limitation, depth maps, bump maps, parallax maps, U maps, UV maps, disparity maps, ST maps, point clouds, z maps, offset maps, displacement maps, or more generally any information that may provide a three-dimensional shape or three-dimensional appearance to an image.
  • 3D stereoscopic images may be generated for a viewer in a 3D virtual reality environment.
  • the depth information determines the amount of offset for each point between the left eye and right eye images, which provides a three-dimensional viewing experience.
  • the depth information may be projected onto a sphere, yielding spherical depth information that provides depth for all or a portion of the points on the sphere.
  • Spherical depth information may comprise for example, without limitation, spherical depth maps, spherical bump maps, spherical parallax maps, spherical U maps, spherical UV maps, spherical disparity maps, spherical ST maps, spherical point clouds, spherical z maps, spherical offset maps, spherical displacement maps, or more generally any information that may provide a three-dimensional shape or three-dimensional appearance to a spherical surface.
  • the unwrapped plane image is also projected onto a spherical surface to form the spherical image. Left and right eye images may then be generated using the spherical depth information.
  • the spherical depth information may be a spherical depth map that provides a z-value for each point on the sphere.
  • left and right images may be formed by projecting each image point from its spherical depth position onto left and right image planes.
  • the position and orientation of the left and right image planes may depend on the position and orientation of the viewer in the virtual reality environment. Thus the stereo images seen by the viewer will change as the viewer looks around the virtual reality environment in different directions.
  • the projections from the spherical depth map points onto the left and right image planes may for example use standard 3D to 2D projections to a plane using a different focal point for each eye.
  • a different method may be used to generate the stereographic images. This method first generates a stereo image in 2D using the unwrapped plane image and the plane depth information. The left and right images are then projected onto spheres, with a separate left sphere and right sphere for the left and right images. Based on the position and orientation of the viewer in the virtual reality environment, left and right image planes and eye positions are calculated, the left sphere is projected onto the left image plane, and the right sphere is projected onto the right image plane.
  • the regions of the unwrapped plane image may be used to assist in creating the depth information.
  • One or more regions may be mapped onto flat or curved surfaces, and these surfaces may be positioned and oriented in three-dimensional space.
  • constraints may be applied manually or automatically to reflect continuous or flexible boundaries between region positions in space.
  • Depth information may be generated directly from the region positions and orientations in three-dimensional space by relating the depth to the distance of each point from a specified viewpoint.
  • objects may be captured in the 2D images that should not appear in the virtual reality environment; conversely it may be desirable to insert additional objects that were not present in the 2D images.
  • Operators may edit the original images, the unwrapped 2D image, the spherical images, or combinations of these to achieve the desired effects.
  • Removing an object from the environment consists of replacing the pixels of the removed object with a suitable fill, which may be obtained automatically from surrounding regions.
  • Adding an object consists of inserting an image and applying the appropriate depth information to the region or regions of the added image.
  • Inserted images may be obtained from real objects or they may be computer generated, or they may be a combination of real images and computer generated images.
  • Objects in the 2D images may also be extended in some embodiments to fill areas that were not captured in the original images, or that are in areas where undesired objects have been removed.
  • Some embodiments may add objects in multiple layers at multiple depths, providing for automatic gap filling when the viewpoint of a viewer in the 3D virtual reality environment changes to reveal areas behind the original objects in the scene.
  • One or more embodiments enable a system and method for rapidly and efficiently updating stereoscopic images to reflect changes in the depths assigned to objects in the scene. Multiple, iterative depth modifications may be needed in some situations. Artists, editors, producers, and quality control teams may review left and right viewpoint images generated from the virtual environment, and may determine in some cases that the assigned depths are not optimal.
  • One or more embodiments provide a method for applying depth changes without re- rendering the entire virtual reality scene to create new left and right images from the 3D model of the virtual environment.
  • a 3D model of a virtual reality environment is created or obtained, for example using the image stitching techniques described above.
  • One or more embodiments may generate a spherical translation map from this 3D model that may be used to apply depth updates without re-rendering.
  • a spherical translation map may for example assign translation values to points on the sphere, where the translation value for a point is a function of the distance between the viewer (at the center of the sphere) and the closest object in the scene along the ray between the viewer and that point of the sphere.
  • a translation map may be for example, without limitation, a depth map, a parallax map, a disparity map, a displacement map, a U map, or a UV map.
  • a parallax map contains left and right pixel offsets directly.
  • a depth map contains depth values that are inversely proportional to pixel offsets. Any map that can be used to determine pixel offsets between left and right viewpoint images may be used as a spherical translation map.
  • an initial stereoscopic (left and right) pair of images may be rendered from the 3D model, along with the spherical translation map. Subsequent modifications to the depth of selected objects may be made by directly editing and modifying the spherical translation map, rather than modifying and re-rendering the 3D model. This process provides a rapid and efficient method for modifying depth after left and right images are created.
  • Any left and right images from any user viewpoint may be used to review the 3D model for the virtual reality environment.
  • reviewers may use one or more planar image projections from one or more directions.
  • reviewers may use panoramic images generated from the 3D virtual environment to review and possibly modify the depth assignments. These panoramic images may for example include a full 360-degree horizontal field of view and a full 180-degree vertical field of view, or they may in other cases be partial panoramas covering only a subset of the user's entire field of view. Updates to left and right viewpoint images from spherical translation map modifications may be made to any stereoscopic images generated from the scene, including both planar projections and panoramic images.
  • One or more embodiments may apply the above methods for real-time, rapid update of stereo images using spherical translation maps to any 3D model of a virtual reality environment.
  • a 3D model of a virtual reality environment may for example be created from camera images of a real scene, from computer-generated elements, or from combinations of real images and computer-generated elements.
  • a 3D model may be created by assigning depth to computer- generated elements or to geometrical surfaces onto which images are projected.
  • Embodiments may create spherical translation maps from the 3D model and use these spherical translation maps to modify stereoscopic images without re-rendering.
  • One or more embodiments may enable simultaneous reviews of 3D models by multiple reviewers. Because it may be desirable to review a 3D model from multiple viewpoints, use of multiple simultaneous reviewers may dramatically shorten the time needed to complete a review. However, simultaneous review by multiple reviewers presents a challenge in coordinating the activities of the reviewers. In particular, during reviews of complex models from many different viewpoints, it may be difficult to track what each reviewer is observing at any point in time. Lack of coordinated viewpoint tracking may for example lead to communication problems. For example, if a reviewer comments that the depth of an object should be changed, it may not be clear precisely which object the reviewer is referring to. In addition, real-time, iterative updates of 3D models may require rechecking objects from various viewpoints; therefore it may be critical to coordinate the viewpoints used by each reviewer during the review process.
  • One or more embodiments enable a system for multi -reviewer reviews of 3D models that adds a central coordinator who receives information about each reviewer's activities.
  • multiple review stations are operated by multiple reviewers, and a coordinator station is operated by a coordinator; each station has access to a 3D model under review. Review stations and the coordinator station have displays that show various views of the 3D model.
  • a review station may have a viewpoint renderer that renders one or more viewpoint images of the 3D model from a particular position and orientation. Viewpoint images may be monoscopic or stereoscopic. The position or orientation (or both) may for example be measured by sensors attached to a reviewer. These sensors may for example be integrated into a virtual reality headset that the reviewer uses to review the 3D model. In one or more embodiments a reviewer may select a viewpoint, using for example a mouse or joystick.
  • a review station may include a reviewer pose subsystem, which may measure or obtain the pose of the reviewer.
  • the reviewer pose subsystem may include sensors, input devices, graphical user interface controls, or any combination of hardware and software configured to measure or obtain the pose of the reviewer.
  • the reviewer pose subsystem provides the reviewer's pose to the viewpoint renderer, which renders one or more viewpoint images of the 3D model based on the pose.
  • a review station may also include a reviewer focus subsystem, which determines the current focus region or regions within the reviewer's field of view. These focus regions indicate the areas of the 3D model that the reviewer is currently observing, checking, or commenting upon.
  • Embodiments may use various techniques to determine a reviewer's focus regions. For example, one or more embodiments may consider a reviewer's entire field of view to be the focus region. One or more embodiments may use other techniques to locate objects or regions within the field of view that represent the reviewer's current focus.
  • reviewer focus regions are communicated to the coordinator station.
  • the coordinator station may include a model overview renderer that renders one or more overview images of the 3D model.
  • the overview render may also add graphical indicators to these overview images to represent the reviewer focus regions. These graphical indicators may be for example overlays of icons, shapes, curves, or labels onto the overview images. The coordinator may therefore be able to see the 3D model and the current focus regions of each reviewer in a single set of integrated overview images.
  • the location of the graphical indicators of reviewer focus areas may correspond to the location of the reviewer focus regions within the 3D model.
  • the center of a reviewer's field of view may be represented by a point on the surface of the sphere.
  • An overview image of the sphere that is a two dimensional projection of the sphere may for example locate graphical indicators of a reviewer's focus area at the corresponding point on the two dimensional spherical projection.
  • One or more embodiments may differentiate the graphical indicators of focus areas for different reviewers using various visual characteristics, such as for example different colors, shapes, sizes, patterns, textures, icons, or labels.
  • Embodiments may determine reviewer focus areas based on the specific requirements of their application.
  • reviewer focus areas may be the entire field of view of a reviewer.
  • reviewer focus areas may be portions of this field of view, for example a central portion.
  • a point at the center of the field of view of a reviewer may be the reviewer focus area.
  • One or more embodiments may enable reviewers to identify all or a portion of the reviewer focus area. For example, a reviewer may be able to draw a boundary on his or her display to indicate a focus region within which the reviewer has observed an issue.
  • One or more embodiments may further enable a reviewer to add notes or annotations to a designated reviewer focus area. These user-designated focus areas, along with possibly notes or annotations, may be communicated to the coordinator station and displayed with the overview images.
  • One or more embodiments may enable a coordinator or a reviewer to update the 3D model, or to enter, store, or communicate desired updates that should be applied later to the 3D model.
  • a coordinator station may include a model update subsystem that provides 3D model editing capabilities.
  • updates to the 3D model may be propagated to the review stations to update the viewpoint images in real time or in near real time.
  • a coordinator may have the ability to receive, process, and then remove a reviewer focus region from the coordinator overview display. For example, a reviewer may highlight a region containing an object in the 3D model that should be modified; this region may then appear on the coordinator's overview image. The coordinator may then make a modification to the 3D model (or make a note for a subsequent update) to address the issue. After making the update, the coordinate may remove the update from the queue of review issues requiring attention; this action may result in the graphical indicator for this region being removed from the coordinator's overview screen.
  • Figure 1 illustrates a flowchart of at least one embodiment of a method for creating 3D virtual reality from 2D images.
  • Figure 1A illustrates a flowchart of at least one embodiment of a method for generating 3D virtual reality in which stereo images are generated using spherical depth information that positions 2D pixels relative to the surface of a sphere in 3D space.
  • Figure IB illustrates a flowchart of at least one embodiment of a method for generating 3D virtual reality in which stereo images are generated first in 2D, then projected onto separate left and right spheres, and finally converted to stereo in a 3D viewing environment.
  • Figure 2 illustrates an embodiment of a step to capture a series of 2D images of an environment by aiming a camera in varying horizontal and vertical angles.
  • Figure 3 illustrates an embodiment of a step to stitch sets 2D images together into integrated 2D images; in this illustration four 2D images from Figure 2 are stitched together into an integrated image by aligning matched features.
  • Figure 4 illustrates an embodiment of a step to project integrated 2D images onto a spherical surface.
  • Figure 5 illustrates an example of a spherical surface image that results from projecting 2D images onto a sphere.
  • Figure 6 illustrates an example of a spherical surface image that is unwrapped into a panoramic 2D image. (Note that this scene is different from the scene shown in Figure 5.)
  • Figure 7 illustrates an embodiment of a step of dividing the unwrapped image of Figure 6into multiple regions; each region is color-coded with a color mask to identify its boundaries.
  • Figure 8 illustrates an embodiment of a step of assigning a depth map to the points of each of the regions of Figure 7; darker points are closer to the viewer.
  • Figure 9 illustrates an anaglyph 3D stereoscopic view of the scene depicted in Figure 7 with the depth map of Figure 8.
  • Figure 10 illustrates an embodiment of a step of creating a spherical depth map from a 2D depth map as outlined in the flowchart of Figure 1A.
  • Figure 11 illustrates an embodiment of a step of generating left and right stereo 2D views from a spherical depth map.
  • Figure 12 illustrates an embodiment of a step of generating separate left and right spherical surface images, which are subsequently used to create stereo views, using a 2D image and a 2D depth map, as outlined in the flowchart of Figure IB.
  • Figure 13 illustrates an embodiment of a division of a 2D image of a human figure into regions, with each region assigned a distinct color for identification.
  • Figure 14 illustrates an embodiment of positioning and orienting the regions of Figure 13 in 3D space in order to generate depth information.
  • Figure 15 illustrates an embodiment of a depth map generated from the 3D model of Figure 14.
  • Figure 16 illustrates the 2D images captured from a scene as shown in Figure 2, highlighting features where modifications are desired to the images in order to form the 3D virtual reality environment.
  • Figure 17 illustrates an unwrapped image formed from the 2D images of Figure 16 with the desired modifications made to the image.
  • Figure 18 illustrates an extension of flowchart of Figure 1, which shows an iterative cycle of reviewing images, updating a 3D model, and re-rendering a scene.
  • Figure 19 illustrates an embodiment of the method that shortens the iterative review and update cycle shown in Figure 18 using modifications to a spherical translation map that result in immediate changes to the stereo images.
  • Figure 20 illustrates a graphical user interface for an embodiment of the method, which provides controls for an editor to modify the translations of selected objects in a virtual reality scene.
  • Figure 21 shows an illustrative panoramic image in anaglyph stereo format; the top image is modified by the edits specified in Figure 20, resulting in the bottom image.
  • Figure 22 shows a different edit for the translations of objects in the graphical user interface shown in Figure 20.
  • Figure 23 shows another illustrative panoramic image in anaglyph stereo format; the top image is modified by the edits specified in Figure 22, resulting in the bottom image.
  • Figure 24 shows an architectural overview of an embodiment of a system that enables multiple reviewers to review a 3D model simultaneously from different viewpoints, with the reviews coordinated by a coordinator.
  • Figure 25 illustrates an embodiment of a multi-reviewer system that shows the current field of view of each reviewer as an overlay on the coordinator's screen.
  • Figure 26 illustrates a variation of the embodiment of Figure 25, with a colored dot shown on the coordinator's screen for the center of focus of each reviewer.
  • Figure 27 illustrates a variation of the embodiment of Figure 26, with an additional capability for a reviewer to highlight an area on the screen and have this area displayed on the coordinator's overview screen.
  • Figure 28 illustrates an embodiment of a multi -reviewer system that includes a capability for the coordinator to modify the 3D model; in this example the coordinator increases the size of a selected object.
  • Figure 29 illustrates the embodiment of Figure 28 after the object's size has been increased; it also illustrates actions a coordinator may take to process the highlighted region selected by a reviewer.
  • Figure 30 illustrates the embodiment of Figure 29 after the coordinator has completed the modification, with the highlighted region no longer shown since the edits are finished.
  • Figure 1 illustrates a flowchart of at least one embodiment of a method for creating 3D virtual reality from 2D images, including exemplary components that may be utilized therewith.
  • step 101 multiple 2D images are obtained of environment 100, yielding a set of 2D images 102.
  • Environment 100 may for example be a room, an office, a building, a floor, a house, a factory, a landscape, or any other scene or combination of scenes for which a virtual reality experience is to be created.
  • This environment may be real, or it may itself be virtual or computer generated, or it may be a mix of real elements and computer generated elements.
  • the step 101 of obtaining 2D images may use for example one or more cameras—including single image and video cameras - or any other sensor or sensors that capture non-visible frequencies such as radar or lidar.
  • the cameras may be virtual and the 2D images may be viewing projections of a 2D or 3D computer generated scene.
  • the depth information associated with an image may be accepted by the system, for example by providing user interface elements for users to draw masks on regions or user interface elements to set parameters for auto-generation of masks for regions of luminance, color or other image parameters.
  • the system thus enables the user to assign depth information to regions in the 2D image or alternatively or in combination, obtain depth via radar or lidar to generate a depth map for example.
  • Step 103 stitches together subsets of the 2D images 102 into integrated 2D images 104.
  • the stitching process combines and aligns 2D images and eliminates overlap among the 2D images.
  • Stitching step 103 may combine all 2D images into a single panorama, or it may combine subsets of 2D images into various panoramic images that cover portions of the entire environment 100. Different embodiments may employ different stitching strategies.
  • Integrated images may cover all or any portion of the sphere of view directions visible from one or more cameras.
  • Any known technique for stitching together multiple images into a composite integrated image may be utilized. Stitching may be done manually, automatically, or with a hybrid manual-automatic procedure where one or more operators make rough stitching and software completes the smooth stitch. Embodiments of the invention may utilize any or all of these approaches.
  • Automated stitching typically aligns the overlap regions of multiple images using a best- fit based on feature differences or on pixel differences.
  • Feature-based methods perform a feature extraction pass first on the images, and then find the location of similar features in multiple images for alignment. See for example: M. Brown and D. Lowe (2007). Automatic Panoramic Image Stitching using Invariant Features. International Journal of Computer Vision, 74(1).
  • Pixel-based methods minimize the pixel differences between images in their regions of overlap. See for example: Suen et al. (2007). Photographic stitching with optimized object and color matching based on image derivatives. Optics Express, 15(12).
  • step 105 the integrated 2D images 104 are projected onto a spherical surface.
  • these projections may use a measured or estimated position and orientation of the camera or cameras used in step 101 to capture the 2D images that formed the integrated 2D images.
  • the output of step 105 is a spherical surface image 106.
  • a spherical surface represents an approximate 3D model of the location of the objects in the environment 100; this approximate model places all points on the objects equidistant from the center of the sphere. Adjustments to this approximate 3D model may be made in subsequent steps using depth information to form more realistic models of the environment.
  • step 107 the spherical surface image 106 is "unwrapped" onto an unwrapped plane image 108.
  • This unwrapping process may use any of the spherical -to-plane projection mappings that are known.
  • step 109 the unwrapped plane image 108 is divided into regions 110.
  • This step may be done by one or more operators, or it may be assisted by software. For example, software may tentatively generate region boundaries based on shapes, colors, or textures of objects in the unwrapped image 108.
  • depth information 112 is assigned to the points of the regions 110.
  • the depth information is used in subsequent steps to generate the 3D stereo images for a virtual reality experience.
  • Depth information reflects how far away each point of each region is from a viewer.
  • Many techniques for defining and using depth information are known in the art; any of these techniques may be used for generating or using the depth information 112.
  • depth information may comprise depth maps, bump maps, parallax maps, U maps, UV maps, disparity maps, ST maps, point clouds, z maps, offset maps, displacement maps, or more generally any information that may provide a three-dimensional shape or three- dimensional appearance to an image. Assigning of depth information may be done by one or more operators.
  • software may be used to assist the step 111 of assigning depth information. For example, operators may be able to rotate or reposition entire regions in a 3D scene, and depth information may be generated automatically for the regions based on this 3D positioning. Software may also be used to generate curved regions or to blend depth information at boundaries between regions.
  • the depth information 112 and the unwrapped image 108 are used as inputs to generate stereo images for a viewer at viewer position and orientation 113.
  • the stereo images consist of a left eye image 115 and a right eye image 116.
  • Any of the commonly available stereo 3D vision technologies such as special viewing glasses used to see 3D movies, may be used by the viewer to view the virtual reality environment in 3D using these stereo images.
  • viewers may use glasses with different colored left and right lenses, or glasses with different polarization in left and right lenses, or glasses with LCD lenses that alternately show left and right images.
  • Figures 2-9 illustrate exemplary embodiments of the steps of Figure 1 in greater detail, while Figures 1 A and IB are discussed after Figures 2-9.
  • Figure 2 illustrates an embodiment of step 101 - obtaining 2D images of environment 100.
  • the environment 100 is a room with a table 203 approximately in the center of the room.
  • a series of 2D images 201 is obtained of the room using a camera aimed at different angles.
  • the 2D images 201 are captured in five rows 202a, 202b, 202c, 202d and 202e, where each row corresponds to a vertical angle for the camera. Within each row the camera is aimed at 12 different horizontal angles, with an increment 204 of approximately 30° between each horizontal angle, forming a complete 360° panorama of the scene 100.
  • Other embodiments of the invention may use different techniques and angle increments for capturing a series of 2D images to cover a desired portion of an environment 100.
  • Figure 3 illustrates an embodiment of step 103 - stitching together 2D images into integrated images.
  • Image 301, 302, 303, and 304 are individual 2D images from row 202e of Figure 2.
  • a manual or automated scan for shared features identifies, for example, the back of the chair which appears as 305a in image 301 and as 305b in image 302, and the right front table leg which appears as 306a in image 303 and as 306b in image 304. Aligning the images on these (and other) shared features produces the rough stitch 307.
  • Different embodiments of the invention may use various grouping strategies for stitching together integrated images in step 103.
  • all of the images in a row may be stitched together, or portions of rows (as shown here in Figure 3) may be stitched together. Stitching may also be done vertically (in columns) in addition to or instead of horizontally (in rows), or with mixed approaches to combine similar images.
  • Figure 4 illustrates an embodiment of step 105 - projecting the integrated 2D images onto spherical surface 403.
  • Many techniques are known in the art for projecting plane images onto a sphere and for the reverse process of projecting spherical images onto a plane. These techniques are similar to known techniques of cartography, which generate 2D map images of a spherical surface or a portion thereof.
  • maps may use projections such as Mercator, Lambert cylindrical, Azimuthal, Orthographic, and many others. Projecting 2D plane images onto a sphere amounts to reversing the normal map projections that project a sphere onto plane images.
  • Figure 4 illustrates a spherical projection that may be used in one or more embodiments of the invention.
  • each 2D image is considered to be a plane perspective projection of a spherical surface using a fixed point of perspective for the projection.
  • the projection of the 2D image to the sphere simply reverses the perspective projection.
  • Multiple images may be projected to the same spherical surface using location and orientation of the camera when each image was captured.
  • FIG. 4 2D images 401 and 402 are projected onto the sphere 403.
  • Sphere 403 has center point c 404, and radius R 405.
  • Image 401 was obtained using a camera with a viewer located at point vi 406;
  • image 402 was obtained using a camera with a viewer located at point v 2 410.
  • the orientation of the planes of images 401 and 402 correspond to the orientation of the cameras used to capture those images.
  • Each point on the 2D images 401 and 402 is projected to the sphere along a ray from the camera's viewer.
  • point p 407 is projected onto point q 408 on sphere 403. Since the ray from vi through p is parameterized as ⁇ v t + t p— v t . t ⁇ 0 ⁇ , point q can be obtained easily by finding the parameter t such that ⁇ v 1 + t(p - Pj) - c
  • R.
  • Figure 5 illustrates a spherical projection of the 2D images from Figure 2. Images are projected onto spherical surface 501. For example, the 2D table image 203 from Figure 2 appears as image 502 on sphere 501.
  • Figure 6 illustrates an unwrapped image obtained from a spherical projection via step 107 - unwrap onto plane image. Converting the spherical image to a plane unwrapped image amounts to reversing the projections illustrated in Figure 4 using a single projection of the sphere onto a plane. Note that the scene illustrated in Figure 6 is not the same scene illustrated in Figures 2 and 4. In this unwrapped image the straight edges of the rug appear as curved lines such as 601. The unwrapped image is a 360 degree panorama; for example the left edge 602a of the chair corresponds to the right edge 602b.
  • Figure 7 illustrates an embodiment of step 109 - dividing the unwrapped image into regions - applied to the unwrapped image of Figure 6.
  • Each region is indicated by a different color mask.
  • the blue mask 701 defines the rug in the center of the room.
  • the system may enable the user to define masks for regions in the image and accept input for the masks by an operator, for example by implementing software on a computer system specifically for that purpose, or using a combination of methods.
  • the rug 601 in Figure 6 has a distinct color and pattern that may used to automatically or semi-automatically identify the blue mask region 701 in Figure 7.
  • the system may thus enable the user to input a foreground distance and background distance for mask region 701 for example.
  • radar or lidar may be obtained and utilized to auto generate depths for portions or the entire image or masks or regions therein.
  • Figure 8 illustrates an embodiment of step 111 - generating depth information for the points of the regions defined in step 109.
  • the depth information is encoded as a depth map, with points closer to the viewer shown with darker shades of grey, and points further from the viewer shown with lighter shades of grey.
  • the front edge 801 of the table in the center of the room has a dark shade since it is close to a viewer in or near the center of the room; the wall 802 behind the couch has a lighter shade since it is further from the viewer.
  • Operators may assign depth information to individual pixels, or they may use the region masks to assist in defining depth information by positioning and rotating the regions in three dimensional space. Numerical depth information that is not visible, for example compressed or encoded may also be utilized.
  • Figure 9 illustrates an embodiment of step 114 - generating stereo images.
  • the unwrapped image from Figure 6 is combined with the depth map from Figure 8 to generate left and right eye images, which are superimposed here on the same anaglyph image.
  • This anaglyph image provides a 3D stereoscopic view of the scene when viewed through anaglyph glasses with different color filters in the two lenses.
  • the amount of shift between left eye and right eye images for each pixel is a function of the depth map for that pixel.
  • embodiments of the invention may use various techniques in step 114 to generate the stereo images 115 and 116. As illustrated in Figure 9, embodiments may use the depth information 112 to shift the display of pixels in left eye images versus right eye images, with closer pixels being shifted a greater amount.
  • Figures 1A and IB illustrate specific techniques that may be used by some embodiments of the invention to perform step 114.
  • FIG. 1A illustrates a technique wherein step 114 comprises two additional steps 120 and 123.
  • step 120 the unwrapped image 108 and the depth information 112 are both projected onto a sphere. This process yields spherical image 121 and spherical depth information 122.
  • Figure 10 illustrates the generation of a spherical depth map, which is an example of spherical depth information 122.
  • Spherical depth information may comprise for example, without limitation, spherical depth maps, spherical bump maps, spherical parallax maps, spherical U maps, spherical UV maps, spherical disparity maps, spherical ST maps, spherical point clouds, spherical z maps, spherical offset maps, spherical displacement maps, or more generally any information that may provide a three-dimensional shape or three- dimensional appearance to a spherical surface.
  • 2D depth map 1001 contains two regions: region 1002 is closer to the viewer (hence shaded darker) and region 1003 is further from the viewer (hence shaded lighter).
  • Depth map 1001 is projected via projection 1004 to the sphere 1005. Regions with greater depth (further from the viewer) are pushed further away from the center of the sphere to form a spherical depth map. Hence spherical region 1006, corresponding to planar region 1002, is closer to the center of sphere 1005 than spherical region 1007, corresponding to planar region 1003.
  • spherical image 121 and spherical depth information 122 are used in step 123 to generate left eye image 115 and right eye image 116.
  • Figure 11 illustrates this process in greater detail.
  • Viewer 1110 is observing a virtual reality scene from a specific position and orientation.
  • the viewer 1110 has a left eye position 1111 and a right eye position 1112.
  • 2D images and a 2D depth map have been projected onto sphere 1005.
  • the image contains black colored region 1101, which is located near the center of the sphere, and speckled region 1102, which is located far from the center of the sphere.
  • the left eye image is formed by projecting points from the sphere onto the left eye plane 1113a; similarly the right eye image is formed by projecting points from the sphere onto the right eye plane 1114a.
  • the center point of region 1101 is projected onto point 1115a in the left eye plane, and onto point 1117a in the right eye plane.
  • the center point of region 1102 is projected onto point 1116a in the left eye plane, and onto point 1118a in the right eye plane.
  • left eye image 1113b and of the right eye image 1114b show that the relative shift of the regions 1101 and 1102 depends on the depth of each region on the spherical depth map:
  • Left eye image 1115b of region 1101 is offset significantly from right eye image 1117b of region 1101
  • left eye image 1116b of region 1102 is only slightly offset from right eye image 1118b of region 1102.
  • Figure IB illustrates a different technique for forming left and right eye images that may be used by one or more embodiments of the invention.
  • this technique first forms planar stereo images and then projects these onto left and right spheres.
  • This is an alternative technique to that shown in Figure 1A, which first projects onto a sphere, and then forms stereo images thereafter.
  • the unwrapped planar image 108 is combined with the depth information 112 to form a left unwrapped image 131 and a right unwrapped image 132.
  • Each of the unwrapped images 131 and 132 is then projected onto a sphere, forming a left sphere 135 and a right sphere 136.
  • the spheres 135 and 136 are projected on left eye image 115 and right eye image 116 respectively.
  • Figure 12 illustrates an embodiment of the details of steps 130, 133 and 134.
  • Unwrapped planar image 108 contains regions 1202 and 1203.
  • Depth map 112 assigns a depth 1205 to region 1202 and a depth 1206 to region 1203; the darker shading of depth 1206 vs. depth 1205 indicates that object 1203 is closer to the viewer than object 1202.
  • Step 130 combines unwrapped image 108 and depth map 112 to form left unwrapped image 131 and right unwrapped image 132.
  • the left image position 1202a of object 1202 is offset only slightly from the right position 1202b of object 1202 based on object 1202's, whereas the left image position 1203a of object 1203 is offset considerably from the right position 1203b of object 1203.
  • step 133 the left unwrapped image 131 is projected onto left sphere 135.
  • step 134 the right unwrapped image 132 is projected onto right sphere 136.
  • These spherical projections can be performed as previously described using any suitable plane-to- sphere projection technique.
  • Left planar image 1202a is projected onto left sphere image 1202c
  • right planar image 1202b is projected onto right sphere image 1202d.
  • left planar image 1203a is projected onto left sphere image 1203c
  • right planar image 1203b is projected onto right sphere image 1203d.
  • the steps 137 and 138 generate left eye image 115 and right eye image 116 from the spherical images 135 and 136 respectively. Any of the previously described techniques or any other known technique for projecting from a sphere to a plane may be used for these steps 137 and 138.
  • the depth information 112 may be generated by assigning a depth to the points of the regions 110 of the unwrapped image.
  • one or more portions of the depth information may be generated by defining a flat or curved surface for one or more of the regions, and positioning and orienting these surfaces in three-dimensional space using rotations and translations.
  • the depth information for the points of a region can then be generated automatically using the three-dimensional model of the region surface, simply by picking a view position and calculating the depth of each point as the distance from the view position to the point.
  • Figures 13 through 15 illustrate an embodiment of this procedure for positioning and orienting region surfaces or masks.
  • Embodiments of the invention may utilize any and all methods and apparatus described in United States Patent Application entitled "EXTERNAL DEPTH MAP TRANSFORMATION METHOD FOR CONVERSION OF TWO- DIMENSIONAL IMAGES TO STEREOSCOPIC IMAGES", U.S. Serial No. 13874625, filed 1 May 2013, the specification of which is hereby incorporated herein by reference.
  • a 2D image of a human figure is divided into regions.
  • Each region is assigned a distinct color for identification.
  • Region 1301 is one side of the figure's chest; region 1302 is the front of the figure's neck.
  • a surface corresponding to each region is positioned and oriented in 3D space.
  • Surface 1401 in Figure 14 corresponds to region 1301 in Figure 13
  • surface 1402 in Figure 14 corresponds to region 1302 in Figure 13.
  • the regions 1301 and 1302 may be positioned using the generally noisy depth information as shown in Figure 14.
  • Surfaces may be flat, or they may be curved surfaces such as for example Bezier surfaces or NURBS surfaces.
  • each surface may be positioned or adjusted by an operator using 3D editing or compositing tools.
  • software may assist in determining approximate positioning and orientation of each surface, or in applying constraints reflecting joints along surface boundaries. This enables the elimination of depth noise in the data to effectively smooth the regions shown in Figure 14 to have edges that approximate the edges shown in Figure 13 for example.
  • steps 109 or 111 or both in Figures 1, 1A and IB embodiments of the invention may obtain the generate regions and/or obtain depth information from an external system or sensor, for example separate from, coupled to or combined with a camera or cameras utilized to obtain the 2D images in step 101.
  • the system may auto-generate masks for the regions within a certain tolerance or curve and calculate best fit for the planar or curved mask in step 109.
  • Figure 15 shows a depth map generated from the 3D positions and orientations of the surfaces in Figure 14, for example through use of a function to eliminate noise and/or discontinuities in regions or masks, as performed by accepting user input or through use of smoothing algorithms or any combination thereof. Darker pixels indicate points closer to the viewer. Point 1501 in Figure 15 has a darker shade than point 1502, reflecting the positioning of surface 1401 in Figure 14 closer to the viewer than surface 1402.
  • modifications may be made to the images captured from the scene in order to create a modified 3D virtual reality environment. Such modifications may include additions, deletions, modifications, or any combinations of these changes to the images. Modifications may be made in some embodiments to the original captured 2D images, to the stitched integrated images, to the spherical projection, to the unwrapped plane image, to the depth information, to the stereo images, or to any combinations of these.
  • Figures 16 and 17 illustrate an embodiment with modifications made to the unwrapped plane image.
  • Figure 16 shows an example of a series of 2D images 201 captured from a scene, as is illustrated also in Figure 2.
  • Image 1601 contains equipment 1602 that was put in place to capture the images of the scene, as well as an image of an operator who was capturing the scene; it is desired to remove these items from the virtual reality environment.
  • Image 1603 shows wall 1604 behind the desk; it is desired to add an image of a person in front of this wall in the virtual reality environment.
  • Image 1605 shows a portion of the legs 1606 of the desk, but the 2D images did not capture the entire image of the legs; it is desired to extend these legs to form the complete desk in the virtual reality environment.
  • Image 17 illustrates an unwrapped image 1701 formed from the images of Figure 16, with the desired modifications made to the unwrapped image.
  • Equipment 1602 is removed from 1701 at location 1702.
  • Human figure 1703 is inserted into empty area 1604.
  • the table legs 1606 are extended to form complete table legs 1704.
  • These modifications may be made using tools and techniques commonly utilized in the art for photo and image compositing.
  • Objects that are inserted or extended require modifications to the unwrapped image, or elsewhere in the processing chain, as well as depth information.
  • the composite image and depth information may be generated in multiple layers, so that multiple objects may exist along a radius from the center of the viewing sphere, but at different depth locations. With multi-layered depth information, a viewer in the virtual reality environment may see certain objects exposed as he changes his view position. This technique provides for automatic gap-filling as pixels in the stereographic images are shifted to provide the 3D view.
  • step 111a assign Depth Map to Points of Regions
  • step 111a may comprise two sub-steps: 1801 (Create 3D Model), and 1802 (Render).
  • creating the 3D model may for example involve mapping images or regions to planes or other surfaces and positioning these planes or surfaces in 3D space.
  • Rendering may for example involve generating spherical or cylindrical projections from these planes and surfaces.
  • stereoscopic images 115 and 116 are generated in step 114, as discussed above.
  • a review step 1803 may be necessary or desirable to ensure that the 3D model created in step 1801 is accurate and meets the requirements of the project. For example, mapping of images onto 3D surfaces may in some cases introduce artifacts that are not apparent until the stereoscopic images 115 and 116 are reviewed. In other cases the images may be free of artifacts, but the depth assigned to regions or objects may not match the desired structure or artistic effects for the virtual reality environment. Therefore, the review 1803 may in many cases lead to additional modifications 1801 to the 3D model. Normally these changes require an additional rendering step 1802, followed by another stereoscopic image generation step 114, and another review 1803. The process may iterate through several loops 1804 before the final 3D model of the virtual reality environment is complete. This iterative process may be time consuming, particularly since the modifications 1801 to the 3D model and the rendering process 1802 may be computationally intensive.
  • Figure 19 illustrates an embodiment of the method that shortens the iterative review/update/render process of Figure 18.
  • the depth information 112 is used directly for updates to the stereoscopic images 115 and 116.
  • Steps 1801 and 1802 are used for the initial 3D model, and possibly for major modifications, but many or all modifications after the initial modeling may be made using the simpler and faster process described below.
  • Depth information 112 is used to generate a spherical translation map 1901. This spherical translation map encodes information that determines the degree of horizontal pixel shifting between the left eye image 115 and the right eye image 116.
  • One or more embodiments may use any desired format for the spherical translation map.
  • a spherical translation map may be, for example, without limitation, a parallax map, a disparity map, a depth map, a U map, a UV map, or any combination of these maps. Any information that determines the amount of pixel translation between left and right images is in keeping with the spirit of the invention.
  • Figure 19 illustrates a spherical translation map, since the 3D model is mapped to a sphere in one or more embodiments.
  • One or more embodiments may use other geometries for the virtual environment, such as for example a cylinder or a cube; in these embodiments the spherical translation map would be modified for these geometries.
  • one or more embodiments may use a cylindrical translation map or a cubical translation map instead of or in addition to a spherical translation map.
  • any required depth modifications may be performed in step 1902 (Modify Translations), which directly edits the spherical translation map 1901.
  • the modified spherical translation map 1901 is then used directly at input 1903 to modify the left eye image 115 and the right eye image 116. This process may be iterated over multiple loops 1904. Because the steps 1801 and 1802 of creating the 3D model and rendering from this 3D model are not needed, the iterations 1904 may be very rapid and may be viewed in real-time by the editor or team performing the reviews and updates. This real-time review-update cycle may greatly improve the efficiency of the method for creating a 3D virtual environment.
  • the spherical translation map 1901 is a parallax map, which encodes the amount of horizontal pixel offset between left eye image 115 and right eye image 116
  • modifications to this parallax map may be applied directly and rapidly to the images.
  • Changes in offsets may be applied by shifting pixels in left and right images by the changes in the offsets. Re-rendering from the 3D model is not required.
  • modifying the shifts of pixels may generate gaps in the images with missing pixels. Techniques known in the art for gap- filling may be used to generate the missing pixels.
  • One or more embodiments may use a graphical user interface to perform any of the steps of the method, including for example the step 1902 of modifying the spherical translation map.
  • Figure 20 shows an illustrative graphical user interface screen 2001 that may be used by one or more embodiments.
  • a spherical translation map 2002 is shown with the grayscale of each pixel corresponding to to the amount of translation between left and right images at that pixel location.
  • One or more embodiments may use any spherical translation map that can be used to determine the amount of pixel shifting between left and right images.
  • the translation map 2002 shown in Figure 20 is the depth map shown in Figure 8 of the virtual environment shown in Figures 6 and 7 and discussed above.
  • a depth map is illustrative; other embodiments of the user interface 2001 may for example use a spherical parallax map or a spherical displacement map. (In general the amount of pixel shifting between left and right images is inversely proportional to depth, as is known in the art.)
  • User interface screen 2001 includes a list of objects 2003. These objects may for example have been defined during the segmenting of the original stitched image into regions. One or more embodiments may use any desired method to create and modify masks that define objects.
  • the objects 2003 are selectable, with the objects highlighted in gray currently selected.
  • a selection boundary is shown in the translation map 2002 for each selected object; for example the boundary 2004 shows the selected rocking chair object.
  • the user interface provides one or more controls to edit the pixel translations of the selected objects.
  • the illustrative user interface 2001 in Figure 20 provides three controls: an offset control 2005 that adds or subtracts a fixed number of pixels to the translations for the selected objects; a scaling control 2006 that multiplies the pixel translations by a scaling factor; and a ramp control that applies a gradient to the pixel translations in the x and y directions.
  • These controls are illustrative; one or more embodiments may use any controls and any user interface to support any type of modifications to the spherical translation map.
  • the Apply Changes button 2008 applies the pixel translation modifications to the selected objects and regenerates the left and right images accordingly.
  • the offset control 2005 is selected and the offset value 2010 is set to 10 pixels.
  • Figure 21 shows the effect of this modification.
  • the images in Figure 21 are anaglyph images with left and right images overlaid using different color filters.
  • the top image 2101 shows a panoramic image prior to the modifications from Figure 20.
  • the bottom image 2103 shows the panoramic image after applying pixel translation modifications 2102.
  • the couch, rocking chair, and table appear to be pulled further forward in the scene because the left and right pixel shifts for these objects have been increased.
  • Figure 22 shows the same graphical user interface screen 2001 as Figure 20, but scale control 2006 is selected instead of offset control 2005. All objects are selected in list 2003, as shown with boundary 2204 around the entire spherical translation map.
  • the scaling value 2210 is set to 5x, indicating that the pixel translation values of the entire scene should be multiplied by 5.
  • the result of this modification is shown in Figure 23.
  • the top image 2101 as before shows an anaglyph stereoscopic image of the scene prior to the modifications. After modifications 2302, the image 2303 shows that all objects in the scene appear closer to the viewer.
  • 3D models in particular virtual reality environments, may be viewed from different viewpoints.
  • One or more embodiments enable a review process for validating or correcting a 3D model that uses multiple reviewers operating simultaneously to review the model from different viewpoints.
  • Figure 24 illustrates a block diagram showing components of an embodiment of a multi -reviewer system for reviewing 3D model 2401.
  • This model may be for example a virtual reality environment created from real images, from computer generated elements, or from a mixture of real images and computer generated elements.
  • the model may be either static or dynamic.
  • the 3D model may also be a 3D movie in one or more embodiments.
  • FIG. 24 The illustrative example in Figure 24 shows two reviewers, Reviewer 1 and Reviewer 2, and a single Coordinator coordinating the review process. These numbers are illustrative; embodiments of the system may support any desired number of reviewers and coordinators operating in parallel or sequentially.
  • Reviewer 1 operates review station 2410a; reviewer 2 operates review station 2410b, and the Coordinator operates coordinator station 2420.
  • Each review and coordinator station has access to 3D model 2401.
  • the 3D model may be replicated onto different stations; in other embodiments a central 3D model may be used and stations may access this central model over a network for example. Review stations may allow reviewers to observe the 3D model from various viewpoints.
  • a review station may include a device or subsystem to measure or to obtain input for the pose of the reviewer, in order to determine the desired viewpoint of the 3D model.
  • the pose of a reviewer may include for example the reviewer's orientation or position or both.
  • a review station may include a virtual reality headset that measures the orientation of the user's head or of the user's eyes; these measurements may be used as the pose of the reviewer.
  • One or more embodiments may use any device or sensor to measure any aspect of the pose of a reviewer.
  • one or more embodiments may provide input devices or user interface controls that allow a reviewer to select a pose manually.
  • a reviewer may use a joystick or mouse or other input device to move a viewpoint around a 3D model.
  • review station 2410a has reviewer pose subsystem 2412a
  • review station 2410b has reviewer pose subsystem 2412b.
  • These reviewer pose subsystems may be any sensor or sensors, or any input device or input devices, or any user interface controls, that measure the pose of a reviewer or that accept input to control the viewpoint from which the 3D model is viewed.
  • Review stations may generate one or more display images showing the 3D model from the viewpoint determined by the reviewer pose subsystem. Generating these images is the function of a viewpoint renderer. Using techniques known in the art, and techniques described above in this specification, viewpoint Tenderers may generate monoscopic or stereoscopic views of the 3D model from the viewpoint corresponding to the reviewer's pose. These views may be displayed on any number of display devices for each reviewer. Display devices may for example include computer monitors, as well as virtual reality headsets or other head-mounted displays. In Figure 24, viewpoint renderer 2413a in review station 2410a generates viewpoint images for display 241 la; viewpoint renderer 2413b in review station 2410b generates viewpoint images for display 2411b.
  • viewpoint Tenderers may be remote from the physical hardware of a review station, and viewpoint images may be sent for example over a network connection to the displays of the review station.
  • viewpoint Tenderers may use any combination of hardware and software to render viewpoint images.
  • multiple review stations may share viewpoint renders or components thereof.
  • the information on the current viewpoint of one or more reviewers may be communicated to the coordinator station. This information assists the coordinator in coordinating the reviews. For example, a coordinator may be able to determine that all relevant viewpoints have been reviewed and checked. In addition, a coordinator may be able to communicate more effectively with a reviewer about the 3D model if the coordinator can observe the precise viewpoint from which the reviewer is viewing the 3D model.
  • each reviewer may have one or more focus regions within his or her viewpoint that define areas of interest to the reviewer. These focus regions may for example encompass the reviewer's entire field of view or they may be selected portions of the field of view.
  • a review station may include a reviewer focus subsystem that determines the focus region or focus regions for each reviewer and communicates this information to the coordinator station.
  • reviewer station 2410a has reviewer focus subsystem 2414a
  • reviewer station 2410b has reviewer focus subsystem 2414b.
  • a reviewer focus subsystem may include any combination of hardware and software components that determine the region or regions of focus for the reviewer. Embodiments may use various approaches to determine the reviewer's focus regions.
  • the focus region for a reviewer may be presumed to be for example at the center of the field of view of the reviewer.
  • This example is illustrative; other embodiments may use different approaches to define focus regions, as described below.
  • the current focus region for review station 2410a is the central region 2415a at the center of the reviewer's field of view; similarly, the current focus region for review station 2410b is the central region 2415b at the center of the reviewer's field of view.
  • the coordinator station provides information that allows the coordinator to observe the entire review process from the collection of reviewers.
  • the coordinator station may generate one or more overview images of the 3D model that show the entire model.
  • the coordinator may for example focus on the model as a whole, while each reviewer focuses on the view of the model from various specific viewpoints.
  • an overview image may be a panoramic image of a complete spherical 3D model of a virtual reality environment. Any projection or image or collection of images may serve as overview images in one or more embodiments.
  • the overview images may be generated by model overview renderer that has access to the 3D model.
  • An overview renderer may use any techniques known in the art, and techniques described in this specification, to generate one or more overview images of the 3D model.
  • Overview images may be monoscopic or stereoscopic.
  • overview images may be for example, without limitation, spherical projections, cylindrical projections, cubical projections, or any collection of planar projections.
  • model overview renderer 2422 access 3D model 2401 and creates overview image 2423 on the coordinator display 2421. This illustrative example shows only a single coordinator display; one or more embodiments may use any number and type of coordinator displays.
  • One or more embodiments may augment the overview images shown in the coordinator station with information depicting or describing the reviewer focus regions.
  • graphics showing the location of each reviewer focus region may be overlaid onto an overview image of the 3D model. This is illustrated in Figure 24, with graphic 2430a showing the location of Reviewer l 's focus region 2415a, and graphic 2430b showing the location of Reviewer 2's focus region 2415b.
  • These graphic overlays show the coordinator which parts of the 3D model each reviewer is currently observing.
  • color coding is used to differentiate between the focus region for each reviewer: Reviewer l 's focus region is shown in blue with graphic 2430a, and reviewer 2's focus region is shown in red with graphic 2430b.
  • One or more embodiments may use any differentiating characteristic of graphics or text to identify the reviewer or reviewer station associated with each focus region. These characteristics may include for example, without limitation, color, size, shape, icon, pattern, texture, transparency, motion, or labeling. For example, in addition to or instead of using color coding, one or more embodiments may overlay the focus region with a label that shows the name of the reviewer or of the review station.
  • the graphics associated with the reviewer focus regions are overlaid onto a 3D model overview image.
  • these graphics may be shown only when and as requested by the coordinator.
  • a coordinator may have a selection list of reviewers, and may select one or more reviewers to enable display of the graphics associated with those users' focus regions. Any method of providing a coordinator with information that shows the focus regions of the reviewers is in keeping with the spirit of the invention.
  • Embodiments may define focus regions in any desired manner depending on the application for the multi -reviewer system.
  • Figure 25 illustrates an embodiment that displays the reviewers' entire fields of view as the focus regions.
  • Images 2411c, 24 l id, and 241 le correspond to reviewer station displays, and image 2421a corresponds to a coordinator station display.
  • a spherical lat-long projection of a 3D scene is used as overview image 2421a.
  • the reviewer viewpoint images 2411c, 24 l id, and 241 le are planar projections from selected viewpoints.
  • the entire field of view of each reviewer is displayed on the overview image 2421a as a dashed, color-coded outline.
  • the overview image also includes legend 2520 showing the color assigned to each reviewer.
  • reviewer Mo is looking left of center and boundary 2511c shows his field of view
  • reviewer Larry is looking right of center and is zoomed in on the window and picture, and boundary 251 Id shows his field of view
  • reviewer Curly is looking straight down, and boundary 251 le shows his field of view.
  • the lat-long projection of the overview image 2421a distorts the rectangular boundaries of the reviewer fields of view into curved boundaries.
  • Figure 26 illustrates a variation of the embodiment of Figure 26 that uses central dots 2611c, 261 Id, and 261 le instead of boundaries around the entire fields of view.
  • Each dot corresponds to the central region of the corresponding reviewer's field of view.
  • Dots are color- coded as before, and in addition in this example are partially transparent so that the coordinator can observe objects through the dots.
  • One or more embodiments may use any type of graphics to show any desired portion of a reviewer's field of view. For example, one or more embodiments may use different shapes instead of or in addition to different colors to differential between reviewers.
  • the size of the dots may be adjusted either by the system or by the coordinator.
  • One or more embodiments may combine the graphics of Figure 25 and Figure 26, to show for example the entire field of view with a boundary and the center of the field of view with a dot or other graphic.
  • One or more embodiments may show gridlines on the field of view for one or more reviewers.
  • reviewers may be able to designate specific areas within their field of view as focus regions. This capability may be useful for example if a reviewer wants to highlight a particular object to bring it to the attention of the coordinator. For example, a reviewer may notice that the 3D model has an anomaly or an inaccuracy at a particular location or a particular object. By highlighting this object or region the reviewer can notify the coordinator or other parties that this area requires attention.
  • Figure 27 illustrates an embodiment with this capability. Reviewer Larry, now viewing image 241 If, highlights the fern with selection box 2701. This creates an additional reviewer focus region that is communicated to the coordinator station.
  • the coordinator display 2421a also shows the corresponding highlighted fern with boundary 2702. This boundary is color-coded to show which reviewer made the selection.
  • reviewers may select any number of regions or objects and the system may communicate these as focus regions to the coordinator.
  • a reviewer may annotate selected regions or objects with notes, including possibly audio notes, and these notes may be communicated to the coordinator.
  • annotations from reviewers may be displayed on the coordinator overview images.
  • the coordinator may be able to modify the 3D model during the review process. In one or more embodiments some or all modifications may be performed after the review process, and the review process may be used to collect notes on changes that should be made or considered.
  • Figure 28 illustrates an embodiment that includes a model update capability in the coordinator station. In this illustrative example, reviewer Larry has highlighted a pillow on the couch with the annotation that it should be made larger. The selection and annotation 2801 appear on the reviewer display 241 lg. A corresponding color-coded selection and annotation 2802 appear on the coordinator display 2421a. Reviewers may note any desired changes to a 3D model, including for example changes in object position, depth, size, color, orientation, texture, shadows, lighting, motion. Reviewers may also note any observed artifacts that may for example arise during model creation.
  • the coordinator station has illustrative model update capabilities.
  • the model layers are shown in list 2810, and the coordinator may select a layer to modify. Modifications may include translations and rotations 2811, and scaling changes 2812.
  • These model update features are illustrative; embodiments may use any 3D modeling or editing techniques to provide coordinators or reviewers with the ability to make any desired changes to the 3D model.
  • the coordinator selects the layer containing the pillow, chooses a scale update, and doubles the size 2812 of the pillow.
  • the Apply Changes button 2813 the modifications are made to the 3D model and are reflected in the coordinator and reviewer displays.
  • Figure 29 continues the example shown in Figure 28 and shows an illustrative display after the model update is made.
  • the size of the pillow is increased as shown in 2901 in the reviewer display 241 lg and in 2902 in the coordinator display 2421a.
  • the coordinator station may include a capability to remove or otherwise process a reviewer focus region (along with possible annotations). For example, a coordinator may remove a region selected by a reviewer once a model update or correction for that region has been completed or scheduled.
  • the coordinator can right-click on the graphic 2802 showing the selected region, which brings up a pop-up menu 2903. This pop-up menu shows options for processing the focus region.
  • the coordinator selects "Complete" to indicate that the model update corresponding to this region has been completed.
  • Other illustrative options are shown, including Annotate to add further annotation, Dismiss to ignore the region without making an update, and Find Layer to locate the layer or layers within the boundary.
  • These options and the user interface illustrated in Figure 29 are illustrative; embodiments may use any user interface to provide coordinators or reviewers with capabilities to update models and process reviewer focus regions in any desired manner.
  • Figure 30 continues the example shown in Figure 29 and shows an illustrative display after the coordinator clicks the Complete option for the focus region.
  • the selection boxes and annotations for the pillow in the reviewer display 241 lg and the coordinator display 2421a are removed.
  • the larger pillow in the 3D model after the update is shown in the reviewer display at 2901 and in the coordinator display at 2902.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Processing Or Creating Images (AREA)

Abstract

L'invention concerne un système qui coordonne de multiples examens simultanés d'un modèle tridimensionnel (3D), potentiellement à partir de différents points de vue. Des modes de réalisation prennent en charge de multiples examinateurs utilisant des stations d'examen qui restituent des images du modèle sur la base de la pose de l'examinateur. Par exemple, de multiples examinateurs peuvent utiliser des casques d'écoute de réalité virtuelle pour observer un environnement virtuel 3D à partir de différentes orientations. Un coordinateur utilise une station de coordinateur pour observer le modèle 3D entier et les points de vue de chacun des examinateurs dans ce modèle 3D. Des modes de réalisation peuvent prendre en charge la désignation, par un examinateur, de régions dans le modèle qui nécessitent un examen ou une modification ; ces régions désignées sont également affichées sur la station de coordinateur. Des modes de réalisation peuvent prendre en charge des mises à jour en temps réel du modèle 3D et la propagation d'images mises à jour vers le coordinateur et les multiples examinateurs.
PCT/US2016/047164 2015-08-17 2016-08-16 Système à multiples examinateurs de modèle 3d WO2017031113A1 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US14/827,482 2015-08-17
US14/827,482 US9282321B2 (en) 2011-02-17 2015-08-17 3D model multi-reviewer system

Publications (1)

Publication Number Publication Date
WO2017031113A1 true WO2017031113A1 (fr) 2017-02-23

Family

ID=58051401

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2016/047164 WO2017031113A1 (fr) 2015-08-17 2016-08-16 Système à multiples examinateurs de modèle 3d

Country Status (1)

Country Link
WO (1) WO2017031113A1 (fr)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2024131479A1 (fr) * 2022-12-21 2024-06-27 腾讯科技(深圳)有限公司 Procédé et appareil d'affichage d'environnement virtuel, dispositif électronique pouvant être porté sur soi et support de stockage

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6363170B1 (en) * 1998-04-30 2002-03-26 Wisconsin Alumni Research Foundation Photorealistic scene reconstruction by voxel coloring
US6426750B1 (en) * 1998-07-14 2002-07-30 Microsoft Corporation Run-time geomorphs
US20070236514A1 (en) * 2006-03-29 2007-10-11 Bracco Imaging Spa Methods and Apparatuses for Stereoscopic Image Guided Surgical Navigation
US20070238981A1 (en) * 2006-03-13 2007-10-11 Bracco Imaging Spa Methods and apparatuses for recording and reviewing surgical navigation processes
US20130234934A1 (en) * 2010-12-22 2013-09-12 Zspace, Inc. Three-Dimensional Collaboration

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6363170B1 (en) * 1998-04-30 2002-03-26 Wisconsin Alumni Research Foundation Photorealistic scene reconstruction by voxel coloring
US6426750B1 (en) * 1998-07-14 2002-07-30 Microsoft Corporation Run-time geomorphs
US20070238981A1 (en) * 2006-03-13 2007-10-11 Bracco Imaging Spa Methods and apparatuses for recording and reviewing surgical navigation processes
US20070236514A1 (en) * 2006-03-29 2007-10-11 Bracco Imaging Spa Methods and Apparatuses for Stereoscopic Image Guided Surgical Navigation
US20130234934A1 (en) * 2010-12-22 2013-09-12 Zspace, Inc. Three-Dimensional Collaboration

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2024131479A1 (fr) * 2022-12-21 2024-06-27 腾讯科技(深圳)有限公司 Procédé et appareil d'affichage d'environnement virtuel, dispositif électronique pouvant être porté sur soi et support de stockage

Similar Documents

Publication Publication Date Title
US9282321B2 (en) 3D model multi-reviewer system
US9288476B2 (en) System and method for real-time depth modification of stereo images of a virtual reality environment
US9407904B2 (en) Method for creating 3D virtual reality from 2D images
US7643025B2 (en) Method and apparatus for applying stereoscopic imagery to three-dimensionally defined substrates
EP2546806B1 (fr) Rendu à base d'image pour ar - permettant à l'utilisateur la génération d'un contenu en 3d
US20120182403A1 (en) Stereoscopic imaging
US20110216160A1 (en) System and method for creating pseudo holographic displays on viewer position aware devices
US20110157155A1 (en) Layer management system for choreographing stereoscopic depth
US20020158873A1 (en) Real-time virtual viewpoint in simulated reality environment
US8666146B1 (en) Discontinuous warping for 2D-to-3D conversions
CN109791704B (zh) 用于自由运行fvv应用的基于多层uv映射的纹理渲染方法、系统和装置
CA2540538C (fr) Imagerie stereoscopique
WO2017031113A1 (fr) Système à multiples examinateurs de modèle 3d
WO2017031117A1 (fr) Système et procédé pour la modification de profondeur en temps réel d'images stéréo d'un environnement de réalité virtuelle
JPH03138784A (ja) 3次元モデルの再構成方法および表示方法
Andersen et al. An AR-guided system for fast image-based modeling of indoor scenes
JP3784104B2 (ja) 画像合成装置
EP2962290B1 (fr) Transmission d'informations 3d par simulation de profondeur à l'aide de déplacement de pixel 2d
CN103530869A (zh) 用于匹配移动质量控制的系统和方法
De Sorbier et al. Depth camera based system for auto-stereoscopic displays
Stavrakis et al. Interactive tools for image-based stereoscopic artwork
Chen et al. Augmented stereo panoramas
Thatte et al. Real-World Virtual Reality With Head-Motion Parallax
Ishikawa et al. Interactive indoor 3D modeling from a single photo with CV support
KR20130137356A (ko) 3d 영상의 뎁스 저작 도구 및 그 저작 방법

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 16837695

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 16837695

Country of ref document: EP

Kind code of ref document: A1