US20210125361A1 - Systems and methods for stereoscopic imaging - Google Patents

Systems and methods for stereoscopic imaging Download PDF

Info

Publication number
US20210125361A1
US20210125361A1 US17/143,849 US202117143849A US2021125361A1 US 20210125361 A1 US20210125361 A1 US 20210125361A1 US 202117143849 A US202117143849 A US 202117143849A US 2021125361 A1 US2021125361 A1 US 2021125361A1
Authority
US
United States
Prior art keywords
point
image
matching
block
candidate
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US17/143,849
Inventor
Kang Yang
Mingming GAO
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
SZ DJI Technology Co Ltd
Original Assignee
SZ DJI Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by SZ DJI Technology Co Ltd filed Critical SZ DJI Technology Co Ltd
Priority to US17/143,849 priority Critical patent/US20210125361A1/en
Assigned to SZ DJI Technology Co., Ltd. reassignment SZ DJI Technology Co., Ltd. ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: GAO, Mingming, YANG, KANG
Publication of US20210125361A1 publication Critical patent/US20210125361A1/en
Abandoned legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/50Depth or shape recovery
    • G06T7/55Depth or shape recovery from multiple images
    • G06T7/593Depth or shape recovery from multiple images from stereo images
    • G06K9/6202
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N13/00Stereoscopic video systems; Multi-view video systems; Details thereof
    • H04N13/10Processing, recording or transmission of stereoscopic or multi-view image signals
    • H04N13/106Processing image signals
    • H04N13/128Adjusting depth or disparity
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N13/00Stereoscopic video systems; Multi-view video systems; Details thereof
    • H04N13/20Image signal generators
    • H04N13/204Image signal generators using stereoscopic image cameras
    • H04N13/239Image signal generators using stereoscopic image cameras using two 2D image sensors having a relative position equal to or related to the interocular distance
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/10Image acquisition modality
    • G06T2207/10004Still image; Photographic image
    • G06T2207/10012Stereo images
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20021Dividing image into blocks, subimages or windows
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N13/00Stereoscopic video systems; Multi-view video systems; Details thereof
    • H04N2013/0074Stereoscopic image analysis
    • H04N2013/0081Depth or disparity estimation from stereoscopic image signals

Definitions

  • the disclosed embodiments relate generally to computer vision and more particularly, but not exclusively, to systems and methods for stereoscopic imaging.
  • High-precision depth mapping is important for applications such as automatic obstacle avoidance in robotics and mobile platforms, geographic mapping, and space exploration.
  • Stereoscopic imaging is a technique for providing depth information using binocular vision. Such a technique can be used for constructing a depth map from two or more images by matching features in the images.
  • Existing techniques for matching features often have mismatching problems, especially when encountering complex images with significant depth variation or smooth texture.
  • a method of block matching between a first image and a second image for stereoscopic imaging comprising: selecting a block configuration from a plurality of block configurations; and matching a first point on the first image with a corresponding second point on the second image using the selected block configuration.
  • a stereoscopic imaging system comprising: a first imaging device configured to obtain a first image; a second imaging device configured to obtain a second image; and one or more processors configured to: select a block configuration from a plurality of block configurations; and match a first point on the first image with a corresponding second point on the second image using the selected block configuration.
  • an apparatus for stereoscopic imaging comprising one or more processors configured to: receive a first image; receive a second image; select a block configuration from a plurality of block configurations; and match a first point on the first image with a corresponding second point on the second image using the selected block configuration.
  • a computer readable storage medium comprising: instruction for selecting a block configuration from a plurality of block configurations for stereoscopic imaging; and instruction for matching a first point on the first image with a corresponding second point on the second image using the selected block configuration.
  • a processing system comprising: a receiving module for receiving a first image obtained by an first imaging device and a second image obtained by a second imaging device; a selecting module for selecting a block configuration from a plurality of block configurations for stereoscopic imaging; and a matching module for matching a first point on the first image with a corresponding second point on the second image using the selected block configuration.
  • FIG. 1 is an exemplary diagram illustrating an embodiment of a system for stereoscopic imaging.
  • FIG. 2 is an exemplary diagram illustrating an embodiment of block matching using the system of FIG. 1 .
  • FIG. 3 is an exemplary diagram illustrating an embodiment of a stereoscopic imaging apparatus of the system of FIG. 1 .
  • FIG. 4 is an exemplary top-level flow chart illustrating an embodiment of a method for block matching using the system of FIG. 1 .
  • FIG. 5 is an exemplary diagram illustrating an embodiment of a method for triangulation of distance using the system of FIG. 1 .
  • FIG. 6 is an exemplary diagram illustrating another embodiment of a method for triangulation of distance using the system of FIG. 1 .
  • FIG. 7 is an exemplary diagram illustrating an embodiment of block matching using the system of FIG. 1 .
  • FIG. 8 is an exemplary diagram illustrating an embodiment of the system of FIG. 1 , wherein an image pre-processor is shown.
  • FIG. 9 is an exemplary diagram illustrating an embodiment of a method for rectifying images using the image pre-processor of FIG. 8 .
  • FIG. 10 is an exemplary diagram illustrating an embodiment of the system of FIG. 1 , wherein a disparity validator is shown.
  • FIG. 11 is an exemplary diagram illustrating an embodiment of the system of FIG. 1 , wherein block matching using matching costs are shown.
  • FIG. 12 is an exemplary diagram illustrating an embodiment of a method for identifying corresponding points using the system of FIG. 1 .
  • FIG. 13 is an exemplary flow chart illustrating an embodiment of a method for identifying corresponding points using the system of FIG. 1 .
  • FIG. 14 is an exemplary diagram illustrating an embodiment of the system of FIG. 1 , wherein matching costs are combined to identify a candidate point.
  • FIG. 15 is an exemplary diagram illustrating an embodiment of the system of FIG. 1 , wherein candidate disparities are found for determining a binocular disparity.
  • FIG. 16 is an exemplary diagram illustrating another embodiment of the system of FIG. 1 , wherein candidate disparities are validated using a disparity validator.
  • FIG. 17 is an exemplary top level block diagram illustrating modules of a processing system for block matching.
  • FIG. 18 is exemplary diagram illustrating an embodiment of the system of FIG. 1 , wherein the system is an unmanned aerial vehicle (UAV).
  • UAV unmanned aerial vehicle
  • the present systems and methods are illustrated with respect to an exemplary stereoscopic imaging system 100 in FIG. 1 .
  • the stereoscopic imaging system 100 can include a first imaging device 201 and a second imaging device 202 for capturing a first image 221 and a second image 222 , respectively.
  • the stereoscopic imaging system 100 is shown to include two imaging devices 201 , 202 for illustrative purposes only, the present systems and methods can use any number of imaging devices (for example, one, two, three, four, five, six, or more imaging devices) for stereoscopic imaging, as desired.
  • two or more pairs of imaging devices can be used for enhanced stereoscopic imaging, as desired.
  • the imaging devices 201 , 202 can receive incident light from an external environment and convert the incident light into digital and/or analog signals (not shown). The signals can be processed to produce the images 221 , 222 that can be displayed to a user.
  • Exemplary imaging devices 201 , 202 suitable for use with the present systems and methods include, but are not limited to, commercially-available cameras and camcorders.
  • the imaging devices 201 , 202 can be relatively positioned in a selected manner, so the images 221 , 222 can perceive an object (shown in FIG. 2 ) from different perspectives.
  • the imaging devices 201 , 202 can be positioned in any desired manner in the stereoscopic imaging system 100 .
  • the imaging devices 201 , 202 can be positioned side-by-side so that the imaging devices 201 , 202 have parallel optical axes.
  • the stereoscopic imaging system 100 can include a stereoscopic imaging apparatus 300 for receiving the images 221 , 222 from the imaging devices 201 , 202 , respectively.
  • the stereoscopic imaging apparatus 300 can obtain depth information of the object based on the images 221 , 222 .
  • Exemplary depth information can include an object distance from the object to the first imaging device 201 and/or the second imaging device 202 .
  • the stereoscopic imaging apparatus 300 can obtain the depth information by matching the first image 221 with the second image 222 using block matching. As shown in FIG. 1 , the stereoscopic imaging apparatus 300 can select a plurality of block configurations 310 for block matching. Each block configuration 310 can include one or more configuration parameters for defining one or more blocks 341 , 342 (shown in FIG. 2 ) on each of the first image 221 and the second image 222 , respectively. Exemplary configuration parameters can include size (expressed by number of pixels, for example), shape, or a combination thereof.
  • a block configuration calculator 320 can be implemented based on a corresponding block configuration 310 . That is, the block configuration calculators 320 A, 320 B, . . . , 320 N can be implemented based on block configurations 310 A, 320 B, . . . , 310 N, respectively.
  • the block configuration calculator 320 can perform any calculation and/or operations used for evaluating matching between the blocks 341 , 342 .
  • evaluating matching can include calculating a matching cost c.
  • the matching cost c can quantify a similarity between pixels in the blocks 341 , 342 . Any suitable comparison method between pixels of two blocks can be used to find the matching cost c. Exemplary methods for finding the matching cost c include comparison of pixel values using correlation, covariance, sum of absolute difference, sum of squared differences, and the like.
  • results generated by the block configuration calculators 320 can be inputted to a cross-block-configuration calculator 330 to compare and contrast matching results using different block configurations 310 .
  • exemplary first and second images 221 , 222 of an object 230 are shown for block matching using an exemplary block configuration 310 that is circular.
  • a first block 341 that includes a first point 241 is defined on the first image 221 .
  • a second block 342 that includes the second point 242 can be defined.
  • Each point 241 , 242 can include one or more adjacent pixels representing a part of the image.
  • each point can be a single pixel.
  • each point can include a feature (for example, a set of pixels representing an object) represented by a plurality of pixels.
  • FIG. 2 shows the block configuration 310 as having a circular shape for illustrative purposes only, the block configurations 310 can have any uniform and/or different shapes. At least one of the block configurations 310 can have a non-rectangular shape. Exemplary block configurations 310 can have a triangular shape, a circular shape, an oval shape, a trapezoidal shape, a polygonal shape, or any combination thereof. The shapes can be regular or irregular, as desired. Additionally and/or alternatively, the block configurations 310 can have any uniform and/or different sizes. Exemplary dimensions include, for example, 7 pixels, 9 pixels, 13 pixels, 20 pixels, 25 pixels, or more. In one example, the block configuration 310 can have a square shape having a size of 7 pixels by 7 pixels, 9 pixels by 9 pixels, 13 pixels by 13 pixels, or 25 pixels by 25 pixels.
  • the exemplary stereoscopic imaging apparatus 300 is shown as including processor 302 and a memory 304 .
  • the processor 302 can be any type of processing system for implementing block matching functions and/or other operations described in the present disclosure.
  • Exemplary processors 302 include, without limitation, one or more general purpose microprocessors (for example, single or multi-core processors), application-specific integrated circuits (ASICs), application-specific instruction-set processors, graphics processing units (GPUs), physics processing units, digital signal processing units, coprocessors, network processing units, audio processing units, encryption processing units, and the like.
  • the processor 302 can include specialized hardware and/or software for performing various block matching functions and operations described herein.
  • the processor 302 can include an image processing engine or media processing unit, which can include specialized hardware for enhancing the speed and efficiency of certain operations for image processing and block matching.
  • Specialized hardware can include, but are not limited to, specialized parallel processors, caches, high speed buses, and the like.
  • the processor 302 can include field-programmable gate arrays (FPGAs), ASICs, central processing units (CPUs), GPUs, visual processing units (VPUs), or a combination thereof.
  • FIG. 3 shows the stereoscopic imaging apparatus 300 as including one processor 302 for illustrative purposes only, the stereoscopic imaging apparatus 300 can include any number of uniform and/or different processors 302 .
  • the memory 304 can comprise any type of memory and can be, for example, a random-access memory (RAM), a static RAM, a dynamic RAM, a read-only memory (ROM), a programmable ROM, an erasable programmable ROM, an electrically erasable programmable ROM, a flash memory, a secure digital (SD) card, and the like.
  • the memory 304 has a storage capacity that accommodates the needs of the block matching functions and operations described herein.
  • the memory 304 can have any commercially-available memory capacity suitable for use in image processing applications and, in some embodiments, has a storage capacity of at least 512 Megabytes, 1 Gigabyte, 2 Gigabytes, 4 Gigabytes, 16 Gigabytes, 32 Gigabytes, 64 Gigabytes, or more.
  • FIG. 3 shows the stereoscopic imaging apparatus 300 as including one memory 304 for illustrative purposes only, the stereoscopic imaging apparatus 300 can include any number of uniform and/or different memories 304 .
  • the stereoscopic imaging apparatus 300 can further include any hardware and/or software desired for performing the block matching functions and operations described herein.
  • the stereoscopic imaging apparatus 300 can include one or more input/output interfaces (not shown) for receiving the images 221 , 222 shown in FIG. 1 .
  • Exemplary interfaces include, but are not limited to, universal serial bus (USB), digital visual interface (DVI), display port, serial ATA (SATA), IEEE 1394 interface (also known as FireWire), serial, video graphics array (VGA), super video graphics array (SVGA), small computer system interface (SCSI), high-definition multimedia interface (HDMI), audio ports, and/or proprietary input/output interfaces.
  • USB universal serial bus
  • DVI digital visual interface
  • SATA serial ATA
  • IEEE 1394 interface also known as FireWire
  • serial video graphics array
  • SVGA super video graphics array
  • SCSI small computer system interface
  • HDMI high-definition multimedia interface
  • audio ports and/or proprietary input/output interfaces.
  • the stereoscopic imaging apparatus 300 can include one or more input/output devices (not shown), for example, buttons, a keyboard, keypad, trackball, displays, and/or a monitor.
  • the stereoscopic imaging apparatus 300 can include hardware for communication between components of the image processing system 100 (for example, between the processor 302 and the memory 304 ).
  • the first image 221 can be obtained, at 401 .
  • the second image 221 can be obtained, at 402 .
  • the stereoscopic imaging apparatus 300 can receive the images 221 , 222 respectively obtained from the imaging devices 201 , 202 shown in FIG. 2 .
  • the stereoscopic imaging apparatus 300 can obtain the images 221 , 222 stored on another computer system (not shown), and the stereoscopic imaging apparatus 300 is not necessarily in communication with the imaging devices 201 , 202 .
  • a block configuration 310 is selected from a plurality of block configurations 310 .
  • the block configuration 310 can be selected from a library of block configurations 310 .
  • the library of block configurations 310 can be predefined by an operator, as desired.
  • the block configuration 310 can be selected based on a texture of the object 230 (shown in FIG. 2 ). In the example, when the object 230 has a smooth texture (such as a monochromatic wall or smooth glass), the block configurations 310 having a small size may provide limited information and result in mismatching. Thus, a block configurations 310 having a large size can be selected to provide more information for block matching.
  • a block configuration 310 having a large size can lead to poor overlapping (or large amount of mismatching features) between the images 221 , 222 . Such poor overlapping can result in a very low degree of matching. Thus, a block configuration 310 having a small size can be selected. In still another example, a block configuration 310 having a shape and/or size matching shape of the object 230 or a portion of the object 230 can be selected, for the block configuration 310 to include a selected texture for matching.
  • the stereoscopic imaging apparatus 300 can select one or more predetermined block configurations 310 .
  • an operator can select block configurations 310 with a variety of shapes and/or sizes, with or without consideration of content of the images 221 , 222 .
  • the stereoscopic imaging apparatus 300 can select the block configuration 310 dynamically.
  • block configurations 310 can be selected based on content of the images 221 , 222 , such as based on the shape and/or size of object 230 in the images 221 , 222 .
  • the stereoscopic imaging apparatus 300 can perform one or more image feature recognition and/or feature detection methods to determine texture of the object 230 in the images 221 , 222 .
  • Exemplary feature recognition methods can include edge detection (for example, Shi & Tomasi corner detection), image binarization, SURF blob (Speeded Up Robust Features) detection, MSER blob (Maximally Stable Extremal Regions) detection, SIFT (Scale-Invariant Feature Transform) descriptors, FREAK (Fast REtinA Keypoint) descriptors, BRISK (Binary Robust Invariant Scalable Keypoints) descriptors, HOG (Histogram of Oriented Gradients) descriptors, and the like.
  • edge detection for example, Shi & Tomasi corner detection
  • image binarization for example, image binarization, SURF blob (Speeded Up Robust Features) detection, MSER blob (Maximally Stable Extremal Regions) detection, SIFT (Scale-Invariant Feature Transform) descriptors, FREAK (Fast REtinA Keypoint) de
  • the stereoscopic imaging apparatus 300 can select the block configuration 310 randomly. Randomizing the selection of the block configuration 310 can reduce matching error associated with using a fixed set of block configuration 310 for a specific type of texture, for example.
  • the first point 241 on the first image 221 is matched with the corresponding second point 242 on the second image 222 using the block configurations 310 .
  • the second point 242 that corresponds to the first point 241 can be identified based on the matching.
  • the method 400 can advantageously improve accuracy of block matching.
  • different size and/or shape of the block configuration 310 can result in different accuracy of block matching.
  • the block configuration 310 may not be able to adapt to all types of complex scenes or objects.
  • Using a plurality of block configurations 310 can ensure that one or more block configurations 310 can be suitable for achieving accurate matching and reducing probability of mismatching.
  • Using the plurality of block configurations 310 can thus compensate for limitations or inaccuracies associated with single block configuration 310 , thereby improving the accuracy and reliability of a block matching result.
  • FIG. 5 and FIG. 6 are shown to illustrate obtaining the depth information using the images 221 , 222 .
  • FIG. 5 shows the imaging devices 201 , 202 imaging the object 230 using stereopsis.
  • FIG. 6 illustrates a method of ascertaining an object distance D using triangulation with reference to the two imaging devices 201 , 202 .
  • Each of the imaging devices 201 , 202 perceives a same point 220 on the object 230 , but in different spatial coordinates as illustrated by the coordinate axes (x1, y1, z1) and (x2, y2, z2).
  • the imaging devices 201 , 202 perceive the point 220 along respective optical axes 261 , 262 and thereby arrive at two different images 221 , 222 of the point 220 .
  • the point 220 is captured in the images 221 , 222 as the first and second points 241 , 242 (shown in FIG. 2 ), respectively.
  • the images 221 , 222 can be two-dimensional.
  • the images 221 , 222 are typically different, being taken from different positions, unless the imaging devices 201 , 202 are positioned such that their optical axes 261 , 262 coincide. Accordingly, under most circumstances, a disparity d shown in Equation (4) can be found between the images 221 , 222 , as described below with reference to FIG. 6 .
  • the images 221 , 222 can be compared to ascertain the object distance D between the pair of imaging devices 201 , 202 and the point 220 .
  • a method of triangulation can be used to ascertain the object distance D using the disparity d shown in Equation (4).
  • the position of the point 220 having an index i, represented by coordinates (x i , y i , z i ) can be given as follows:
  • the stereoscopic imaging apparatus 300 shown in FIG. 1 can determine the coordinate z i .
  • the object distance D can be equal to and/or based on the coordinate z i .
  • FIG. 7 another illustrative example of matching points between two images is shown.
  • a matching process can be used to select a second point 242 on the second image 222 that corresponds to the first point 241 .
  • a plurality of candidate second points 282 a . . . 282 m can be selected for evaluation of a matching cost c for determining the suitability of matching.
  • any suitable comparison method between pixels of two blocks can be used to find the matching cost c.
  • Exemplary methods for finding the matching cost c include comparison of pixel values using correlation, covariance, sum of absolute difference, sum of squared differences, and the like.
  • the second point 242 can be selected from among the candidate second points 282 a . . . 282 m.
  • the candidate second points 282 a . . . 282 m for matching can be selected in any manner.
  • the candidate second points 282 a . . . 282 m can be selected in a predefined neighborhood about the coordinates of the first point 241 .
  • candidate second points 282 a . . . 282 m can be evaluated that are within a defined range that is 2 pixels, 4 pixels, 8 pixels, 16 pixels, 32 pixels, 64 pixels, 128 pixels, 256 pixels, or more of the coordinates of the first point 241 .
  • all pixels of the second image 222 can be evaluated.
  • all of the pixels within the defined range can be evaluated for the matching cost c.
  • pixels within the defined range can be sampled (for example, randomly or by interval sampling) for the matching cost c.
  • an exemplary stereoscopic imaging system 100 can include an image pre-processor 350 for pre-processing images 221 , 222 .
  • the image pre-processor 350 can execute any image transformation operations for projecting the images 221 , 222 onto a common image plane defined in a common coordinate system (shown in FIG. 9 ).
  • the image pre-processor 350 can be used for rectifying the images 221 , 222 .
  • the image pre-processor 350 can transform the images 221 , 222 so that the coordinate axes (x1, y1, z1) and (x2, y2, z2) are aligned.
  • the image pre-processor 350 can perform rotation to align the x1-axis with the x2-axis and/or align the y1-axis with the y2-axis. Additionally and/or alternatively, the image pre-processor 350 can perform scaling transformation to eliminate scaling difference between the images 221 , 222 . Additionally and/or alternatively, the image pre-processor 350 can perform rotation and/or skew adjustments in z direction to align the z1-axis with the z2-axis. By pre-processing the images 221 , 222 images, the pre-processor 350 can simplify subsequent block matching between the images 221 , 222 , advantageously improving matching efficiency.
  • the image pre-processor 350 can be configured to perform any image processing operations to improve quality of the images 221 , 222 . Subsequent block matching can advantageously be more easily performed. Exemplary image processing operations include, but are not limited to, image de-noising, image sharpening, and/or image de-blurring.
  • FIG. 8 shows the image pre-processor 350 and the stereoscopic imaging apparatus 300 as being individual units, the image pre-processor 350 and the stereoscopic imaging apparatus 300 can be at least partially integrated, without limitation.
  • Rectification advantageously aligns the images 221 , 222 along the same y-axis and z-axis, such that the images 221 , 22 are offset only along the x-axis.
  • the second point 242 can have a same y-axis coordinate as the first point 241 .
  • the candidate second points 282 can be chosen in the second image 222 along a horizontal line 226 that passes the first point 241 . Searching of the candidate second points 282 can thus be one-dimensional. Computation for the searching is advantageously simplified.
  • the image pre-processor 350 can transmit the rectified images 221 , 222 to the stereoscopic imaging apparatus 300 . Subsequently, a block configuration 310 can be selected from a plurality of block configurations. Points 241 , 242 on the rectified images 221 , 222 can be matched using the selected block configuration 310 using the present systems and methods.
  • the stereoscopic imaging apparatus 300 can include a disparity validator 360 .
  • the disparity validator 360 can perform any calculation operations used for evaluating accuracy of the disparity d using a predetermined criteria. When accuracy of the disparity d does not satisfy the predetermined criteria, the disparity validator 360 can determine that the disparity d is invalid.
  • FIG. 10 shows the disparity validator 360 and the cross-block-configuration calculator 330 as being individual units, the disparity validator 360 and the cross-block-configuration calculator 330 can be at least partially integrated, without limitation.
  • Each block configuration calculator 320 can generate an individual matching cost MC ij between the first point 241 and a selected candidate second point 282 shown in FIG. 7 .
  • i is an index over the block configurations 310
  • j is an index over the candidate second points 282 .
  • 320 N can calculate individual matching cost MC Aj , MC Bj , . . . , MC Nj , respectively.
  • the individual matching cost MC ij can be provided to the cross-block-configuration calculator 330 to be combined for identifying the second point 242 corresponding to the first point 241 .
  • the first block 341 and the second block 342 are shown to include the first point 241 and the candidate second point 282 a .
  • the first block 341 and the second block 342 can be based on the block configuration 310 A.
  • the block configuration 310 A can have a triangular shape shown in FIG. 11 for illustrative purposes only.
  • the individual matching cost MC Aj between the first point 241 and the candidate second point 282 a can thus be calculated using the blocks 341 , 342 based on the block configuration 310 A.
  • the first block 341 of pixels around the first point 241 of the first image 221 can be chosen, at 510 , based on the block configuration 310 .
  • the block configuration 310 can include any one of the block configurations 310 A, 310 B, . . . , 310 N shown in FIG. 11 .
  • a second block 342 of pixels around the candidate second point 282 of the second image 222 can be chosen based on the block configuration 310 .
  • the candidate second point 282 can include any one of the candidate second points 282 a , 282 b , . . . , 282 m shown in FIG. 12 .
  • the individual matching cost MC ij for the block configuration 310 between the first point 241 of the first image 221 and the selected candidate second point 282 of the second image 222 can be determined, at 530 .
  • the method 500 can be performed for each candidate second point 282 based on each block configuration 310 .
  • the individual cost MC ij is shown to be combined via the cross-block-configuration calculator 330 (shown in FIG. 11 ) to identify the second point 242 among the candidate second points 282 .
  • the individual matching costs MC ij can include MC Aa , MC Ba , . . . , MC Na .
  • the individual matching costs MC ij can include MC Am , MC Bm , . . . , MC Nm .
  • the corresponding individual matching cost MC ij can be combined to generate the matching cost MG.
  • the individual matching costs MC Aa , MC Ba , . . . , MC Na can be combined to generate matching cost MC a .
  • the sub-individual matching costs MC Am , MC Bm , . . . , MC Nm can be combined to generate matching cost MC m .
  • the matching cost MC j of the corresponding second point 282 can be generated in any suitable manner.
  • the matching cost MC j can be a function ⁇ of the individual matching costs MC ij of the corresponding candidate second point 282 :
  • MC j ⁇ (MC Aj ,MC Bj , . . . ,MC Nj ). Equation (5)
  • the matching cost MC j can be determined by selecting one of the individual matching costs MC Aj , MC Bj , . . . , MC Nj .
  • the matching cost MC j can be determined by selecting the greatest or best one of the individual matching costs MC Aj , MC Bj , . . . , MC Nj , that is, by selecting the individual matching cost that indicates the best matching.
  • the MC j can be determined by weighing the individual matching costs MC Aj , MC Bj , . . . , MC Nj , as expressed as follows for illustrative purposes:
  • w i is a weight of the ith block configuration 310 .
  • the w i can be pre-determined based on accuracy of the ith block configuration 310 for all images or for the specific type of content of images 221 , 222 shown in FIG. 12 .
  • the matching costs MC j of the candidate second points 282 can be compared to identify a candidate second point 282 .
  • the second point 242 having the greatest matching cost MC j can be selected as the matching second point 242 .
  • the matching cost MC j of each candidate second point 282 can be compared with a threshold matching cost. When the matching cost MC j indicates a better match than the threshold matching cost, the corresponding candidate second point 282 can be selected as the matching second point 242 .
  • a plurality of individual matching costs MC ij can be obtained for each of the candidate second point 282 . Problems of mismatch and/or poor match for a single block configuration 310 can thereby be prevented.
  • the identified second point 242 can have the best matching cost that is provided by the block configuration 310 that is optimally adapted to the scene of the images 221 , 222 .
  • each block configuration calculator 320 can generate a candidate disparity d CA .
  • Each candidate disparity d CA can be obtained using the second point 242 (shown in FIG. 2 ) that is identified based on the block configuration 310 .
  • a second point 242 can be identified among the candidate second points 282 .
  • the candidate disparity d CA can thus be obtained based on the second point 242 .
  • the candidate disparities d A , d B , . . . , d N shown in FIG. 15 can be obtained based on the second points 242 respectively identified using the block configuration 310 A, 310 B, . . . , 310 N.
  • the block configuration calculator 320 can identify the corresponding second point 242 in any suitable manner. For example, individual matching cost MC ij can be calculated for each candidate second point 282 based on the ith block configuration 310 . The candidate second point 282 associated with the best individual matching cost MC ij can be selected as the second point 242 for the ith block configuration 310 .
  • the candidate disparities d CA are inputted to the cross-block-configuration calculator 330 to be combined for identifying the disparity d.
  • FIG. 16 an illustrative example of combining the candidate disparities d CA is shown.
  • the candidate disparities d CA can be validated in any manner.
  • the candidate disparities d CA can be validated via the disparity validator 360 .
  • Validation of the candidate disparities d CA of a selected block configuration 310 shown in FIG. 15 can include, for example, comparing the threshold matching cost with the individual matching cost MC ij of the second point 242 identified using the selected block configuration 310 . If the individual matching cost MC ij of the identified second point 242 is worse than the threshold matching cost, the candidate disparities d CA associated with such a second point 242 is not valid.
  • a candidate disparity d CA associated with the block configuration 310 having the greatest size can be validated first.
  • the d CA can be identified as the disparity d.
  • a block configuration 310 having a large size can provide great amount of information for matching and thus allow block matching at high accuracy.
  • starting validation from the block configuration 310 having the greatest size can advantageously identify the disparity d accurately at early stage of the validation.
  • a candidate disparity d CA associated with a different block configuration 310 can be validated.
  • the candidate disparity d CA of the block configuration 310 having the greatest size can be invalid due to poor matching resulting from significant depth variation of imaging.
  • the block configuration 310 having the same size but a different shape can be validated.
  • the block configuration 310 having a smaller size with a different and/or same shape can be validated.
  • a probability of successful block matching can be improved and failed block matching can be prevented.
  • the validation can be implemented until at least one candidate disparity d CA is identified. Additionally and/or alternatively, the validation can be implemented on all candidate disparities d CA . In some cases, one or more candidate disparities d CA can be determined as valid, and each of the valid candidate disparities d CA can be identified as the disparity d. The valid candidate disparities d CA can be combined to form the disparity d, for example, via a filtering technique such as mean filtering and/or median filtering.
  • an exemplary processing system 600 is shown as including one or more modules to perform any of the disclosed methods.
  • the processing system 600 is shown as including a receiving module 610 , a selecting module 620 , and a matching module 630 .
  • the receiving module 610 can be configured to obtain a first image 221 and a second image 222 (both shown in FIG. 1 ).
  • the receiving module 610 can receive the images 221 , 222 respectively obtained by a first imaging device 201 and a second imaging device 202 (both shown in FIG. 1 ).
  • the selecting module 620 can be configured to select a plurality of block configurations 310 .
  • the matching module 630 can be configured to match the first point 241 on the first image 221 with the corresponding second point 242 on the second image 222 using the block configurations 310 , as shown in FIG. 2 .
  • the block configurations 310 can have different sizes. In some embodiments, at least one block configuration 310 can have a size of 7 pixels by 7 pixels, 9 pixels by 9 pixels, 13 pixels by 13 pixels, or 25 pixels by 25 pixels. In some embodiments, the block configurations 310 can have different shapes. In some embodiments, at least one of the block configurations 310 can have a non-rectangular shape. In some embodiments, the block configurations 310 can include a block configuration 310 with a circular shape, a block configuration with a triangular shape, a block configuration with a trapezoidal shape, or a combination thereof.
  • the selecting module 620 can be configured to select one or more pre-determined block configurations 310 . In some embodiments, the selecting module 620 can be configured to dynamically select the block configurations 310 . In some embodiments, the selecting module 620 can be configured to randomly select the block configurations 310 .
  • the matching module 630 can be configured to determine a matching cost c between the first point 241 on the first image 221 and each of a plurality of candidate second points 282 (shown in FIG. 7 ) of the second image 222 to identify the corresponding second point 242 . In some embodiments, the matching module 630 can be configured to determine a matching cost between the first point 241 on the first image 221 and each of a plurality of candidate second points 282 on a line of the second image 222 to identify the corresponding second point 242 . In some embodiments, the matching module 630 can be configured to determine the matching cost based on individual matching costs c for each of the block configurations 310 .
  • the matching module 630 can be configured to determine the matching cost by selecting one of the individual matching costs c. In some embodiments, the matching module 630 can be configured to determine the matching cost by weighing the individual matching costs c. In some embodiments, the matching module 630 can be configured to determine the matching cost using a predetermined weight for each block configuration 310 .
  • the matching module 630 can be configured to determine the individual matching costs for each block configuration 310 of the block configurations 310 by choosing a first block 341 of pixels around the first point 241 of the first image 221 based on the block configuration 310 , choosing a second block 342 of pixels around a selected candidate second point 282 of the second image 222 based on the block configuration 310 , and determining the individual matching cost for the block configuration 310 between the first point 241 of the first image 221 and the selected candidate second point 282 of the second image 222 .
  • the matching module 630 can be configured to determine an object distance D based on a disparity d between the first point 241 of the first image 221 and the corresponding second point 242 of the second image 222 .
  • the matching module 630 can be configured to determine individual matching costs between the first point on the first image and each of a plurality of candidate second points on the second image based on each of the block configurations to select candidate second points respectively corresponding to the block configurations.
  • the matching module 630 can be configured to obtain candidate disparities d CA between the first point 241 and the selected candidate second points 282 , respectively, and determine the disparity d based on the candidate disparities d CA . In some embodiments, the matching module 630 is configured to determine an object distance D based on the disparity d.
  • the matching module 630 can be configured to match a first pixel on the first image 221 with a corresponding second pixel on the second image 222 using the block configurations 310 . In some embodiments, the matching module 630 is configured to match a first feature on the first image 221 with a corresponding second feature on the second image 222 using the block configurations 310 .
  • the processing system 600 can optionally include a pre-processing module 640 for pre-processing the first and second images 221 , 222 prior to the matching.
  • the pre-processing module 640 is configured to rectify the second image 222 relative to the first image 221 .
  • an exemplary stereoscopic imaging system 100 is shown as being a mobile platform that is an unmanned aerial vehicle (UAV) 150 .
  • the stereoscopic imaging system 100 can be mounted on the mobile platform, such as the UAV 150 .
  • UAVs 150 colloquially referred to as “drones,” are aircraft without a human pilot onboard the vehicle whose flight is controlled autonomously or by a remote pilot (or sometimes both). UAVs are now finding increased usage in civilian applications that require various forms of aerial data-gathering.
  • Various types of UAVs 150 are suitable for use as stereoscopic imaging systems 100 .
  • UAV 150 is an aerial rotorcraft that is propelled by multiple rotors.
  • One suitable type of rotorcraft has four rotors and is known as a quadcopter, quadrotor helicopter, or quad rotor.
  • Exemplary quadcopters suitable for the present systems and methods for imaging include numerous models currently available commercially.
  • UAVs 150 suitable for the present systems and methods further include, but are not limited to, other rotor designs such as single rotor, dual rotor, trirotor, hexarotor, and octorotor designs. Fixed wing UAVs 150 and hybrid rotorcraft-fixed wing UAVs 150 can also be used.
  • Imaging devices 201 , 202 can be mounted to the UAV 150 in any suitable manner (for example, on an underside of the UAV 150 ).
  • Other components of the stereoscopic imaging system 100 can be included in a fuselage of the UAV 150 , as appropriate, for protection against wear and tear.

Abstract

One or more block configurations are selected, based on object information associated with an object, from a plurality of block configurations having at least one of different sizes or different shapes. A first point in a first image is matched with a corresponding second point in a second image using the one or more block configurations.

Description

    CROSS-REFERENCE TO RELATED APPLICATIONS
  • This application is a continuation of patent application Ser. No. 16/240,704, filed on Jan. 4, 2019, which is a continuation of International Application No. PCT/CN2016/088829, filed on Jul. 6, 2016, the entire contents of both of which are incorporated herein by reference.
  • COPYRIGHT NOTICE
  • A portion of the disclosure of this patent document contains material which is subject to copyright protection. The copyright owner has no objection to the facsimile reproduction by anyone of the patent document or the patent disclosure, as it appears in the Patent and Trademark Office patent file or records, but otherwise reserves all copyright rights whatsoever.
  • FIELD
  • The disclosed embodiments relate generally to computer vision and more particularly, but not exclusively, to systems and methods for stereoscopic imaging.
  • BACKGROUND
  • High-precision depth mapping is important for applications such as automatic obstacle avoidance in robotics and mobile platforms, geographic mapping, and space exploration. Stereoscopic imaging is a technique for providing depth information using binocular vision. Such a technique can be used for constructing a depth map from two or more images by matching features in the images. Existing techniques for matching features, however, often have mismatching problems, especially when encountering complex images with significant depth variation or smooth texture.
  • In view of the foregoing, there is a need for systems and methods for stereoscopic imaging that overcome the disadvantages of currently systems and methods.
  • SUMMARY
  • In accordance with a first aspect disclosed herein, there is set forth a method of block matching between a first image and a second image for stereoscopic imaging, comprising: selecting a block configuration from a plurality of block configurations; and matching a first point on the first image with a corresponding second point on the second image using the selected block configuration.
  • In accordance with another aspect disclosed herein, there is set forth a stereoscopic imaging system, comprising: a first imaging device configured to obtain a first image; a second imaging device configured to obtain a second image; and one or more processors configured to: select a block configuration from a plurality of block configurations; and match a first point on the first image with a corresponding second point on the second image using the selected block configuration.
  • In accordance with another aspect disclosed herein, there is set forth an apparatus for stereoscopic imaging, comprising one or more processors configured to: receive a first image; receive a second image; select a block configuration from a plurality of block configurations; and match a first point on the first image with a corresponding second point on the second image using the selected block configuration.
  • In accordance with another aspect disclosed herein, there is set forth a computer readable storage medium, comprising: instruction for selecting a block configuration from a plurality of block configurations for stereoscopic imaging; and instruction for matching a first point on the first image with a corresponding second point on the second image using the selected block configuration.
  • In accordance with another aspect disclosed herein, there is set forth a processing system, comprising: a receiving module for receiving a first image obtained by an first imaging device and a second image obtained by a second imaging device; a selecting module for selecting a block configuration from a plurality of block configurations for stereoscopic imaging; and a matching module for matching a first point on the first image with a corresponding second point on the second image using the selected block configuration.
  • BRIEF DESCRIPTION OF THE DRAWINGS
  • FIG. 1 is an exemplary diagram illustrating an embodiment of a system for stereoscopic imaging.
  • FIG. 2 is an exemplary diagram illustrating an embodiment of block matching using the system of FIG. 1.
  • FIG. 3 is an exemplary diagram illustrating an embodiment of a stereoscopic imaging apparatus of the system of FIG. 1.
  • FIG. 4 is an exemplary top-level flow chart illustrating an embodiment of a method for block matching using the system of FIG. 1.
  • FIG. 5 is an exemplary diagram illustrating an embodiment of a method for triangulation of distance using the system of FIG. 1.
  • FIG. 6 is an exemplary diagram illustrating another embodiment of a method for triangulation of distance using the system of FIG. 1.
  • FIG. 7 is an exemplary diagram illustrating an embodiment of block matching using the system of FIG. 1.
  • FIG. 8 is an exemplary diagram illustrating an embodiment of the system of FIG. 1, wherein an image pre-processor is shown.
  • FIG. 9 is an exemplary diagram illustrating an embodiment of a method for rectifying images using the image pre-processor of FIG. 8.
  • FIG. 10 is an exemplary diagram illustrating an embodiment of the system of FIG. 1, wherein a disparity validator is shown.
  • FIG. 11 is an exemplary diagram illustrating an embodiment of the system of FIG. 1, wherein block matching using matching costs are shown.
  • FIG. 12 is an exemplary diagram illustrating an embodiment of a method for identifying corresponding points using the system of FIG. 1.
  • FIG. 13 is an exemplary flow chart illustrating an embodiment of a method for identifying corresponding points using the system of FIG. 1.
  • FIG. 14 is an exemplary diagram illustrating an embodiment of the system of FIG. 1, wherein matching costs are combined to identify a candidate point.
  • FIG. 15 is an exemplary diagram illustrating an embodiment of the system of FIG. 1, wherein candidate disparities are found for determining a binocular disparity.
  • FIG. 16 is an exemplary diagram illustrating another embodiment of the system of FIG. 1, wherein candidate disparities are validated using a disparity validator.
  • FIG. 17 is an exemplary top level block diagram illustrating modules of a processing system for block matching.
  • FIG. 18 is exemplary diagram illustrating an embodiment of the system of FIG. 1, wherein the system is an unmanned aerial vehicle (UAV).
  • It should be noted that the figures are not drawn to scale and that elements of similar structures or functions are generally represented by like reference numerals for illustrative purposes throughout the figures. It also should be noted that the figures are only intended to facilitate the description of the embodiments. The figures do not illustrate every aspect of the described embodiments and do not limit the scope of the present disclosure.
  • DETAILED DESCRIPTION OF THE EMBODIMENTS
  • Current systems and methods have drawbacks in ensuring high accuracy block matching between two images for stereoscopic imaging. The present systems and methods improve accuracy in block matching by using blocks of varying shapes and sizes, as described in further specificity below.
  • The present systems and methods are illustrated with respect to an exemplary stereoscopic imaging system 100 in FIG. 1. The stereoscopic imaging system 100 can include a first imaging device 201 and a second imaging device 202 for capturing a first image 221 and a second image 222, respectively. Although the stereoscopic imaging system 100 is shown to include two imaging devices 201, 202 for illustrative purposes only, the present systems and methods can use any number of imaging devices (for example, one, two, three, four, five, six, or more imaging devices) for stereoscopic imaging, as desired. For example, two or more pairs of imaging devices can be used for enhanced stereoscopic imaging, as desired. The imaging devices 201, 202 can receive incident light from an external environment and convert the incident light into digital and/or analog signals (not shown). The signals can be processed to produce the images 221, 222 that can be displayed to a user. Exemplary imaging devices 201, 202 suitable for use with the present systems and methods include, but are not limited to, commercially-available cameras and camcorders.
  • The imaging devices 201, 202 can be relatively positioned in a selected manner, so the images 221, 222 can perceive an object (shown in FIG. 2) from different perspectives. The imaging devices 201, 202 can be positioned in any desired manner in the stereoscopic imaging system 100. In some embodiments, for example, the imaging devices 201, 202 can be positioned side-by-side so that the imaging devices 201, 202 have parallel optical axes.
  • The stereoscopic imaging system 100 can include a stereoscopic imaging apparatus 300 for receiving the images 221, 222 from the imaging devices 201, 202, respectively. The stereoscopic imaging apparatus 300 can obtain depth information of the object based on the images 221, 222. Exemplary depth information can include an object distance from the object to the first imaging device 201 and/or the second imaging device 202.
  • The stereoscopic imaging apparatus 300 can obtain the depth information by matching the first image 221 with the second image 222 using block matching. As shown in FIG. 1, the stereoscopic imaging apparatus 300 can select a plurality of block configurations 310 for block matching. Each block configuration 310 can include one or more configuration parameters for defining one or more blocks 341, 342 (shown in FIG. 2) on each of the first image 221 and the second image 222, respectively. Exemplary configuration parameters can include size (expressed by number of pixels, for example), shape, or a combination thereof.
  • As shown in FIG. 1, a block configuration calculator 320 can be implemented based on a corresponding block configuration 310. That is, the block configuration calculators 320A, 320B, . . . , 320N can be implemented based on block configurations 310A, 320B, . . . , 310N, respectively. The block configuration calculator 320 can perform any calculation and/or operations used for evaluating matching between the blocks 341, 342. For example, evaluating matching can include calculating a matching cost c. The matching cost c can quantify a similarity between pixels in the blocks 341, 342. Any suitable comparison method between pixels of two blocks can be used to find the matching cost c. Exemplary methods for finding the matching cost c include comparison of pixel values using correlation, covariance, sum of absolute difference, sum of squared differences, and the like.
  • In some embodiments, results generated by the block configuration calculators 320 can be inputted to a cross-block-configuration calculator 330 to compare and contrast matching results using different block configurations 310.
  • Turning now to FIG. 2, exemplary first and second images 221, 222 of an object 230 are shown for block matching using an exemplary block configuration 310 that is circular. Based on the block configuration 310, a first block 341 that includes a first point 241 is defined on the first image 221. Similarly, a second block 342 that includes the second point 242 can be defined. Each point 241, 242 can include one or more adjacent pixels representing a part of the image. In some embodiments, each point can be a single pixel. In some embodiments, each point can include a feature (for example, a set of pixels representing an object) represented by a plurality of pixels.
  • Although FIG. 2 shows the block configuration 310 as having a circular shape for illustrative purposes only, the block configurations 310 can have any uniform and/or different shapes. At least one of the block configurations 310 can have a non-rectangular shape. Exemplary block configurations 310 can have a triangular shape, a circular shape, an oval shape, a trapezoidal shape, a polygonal shape, or any combination thereof. The shapes can be regular or irregular, as desired. Additionally and/or alternatively, the block configurations 310 can have any uniform and/or different sizes. Exemplary dimensions include, for example, 7 pixels, 9 pixels, 13 pixels, 20 pixels, 25 pixels, or more. In one example, the block configuration 310 can have a square shape having a size of 7 pixels by 7 pixels, 9 pixels by 9 pixels, 13 pixels by 13 pixels, or 25 pixels by 25 pixels.
  • Turning now to FIG. 3, the exemplary stereoscopic imaging apparatus 300 is shown as including processor 302 and a memory 304. The processor 302 can be any type of processing system for implementing block matching functions and/or other operations described in the present disclosure. Exemplary processors 302 include, without limitation, one or more general purpose microprocessors (for example, single or multi-core processors), application-specific integrated circuits (ASICs), application-specific instruction-set processors, graphics processing units (GPUs), physics processing units, digital signal processing units, coprocessors, network processing units, audio processing units, encryption processing units, and the like.
  • In certain embodiments, the processor 302 can include specialized hardware and/or software for performing various block matching functions and operations described herein. For example, the processor 302 can include an image processing engine or media processing unit, which can include specialized hardware for enhancing the speed and efficiency of certain operations for image processing and block matching. Specialized hardware can include, but are not limited to, specialized parallel processors, caches, high speed buses, and the like. In some embodiments, the processor 302 can include field-programmable gate arrays (FPGAs), ASICs, central processing units (CPUs), GPUs, visual processing units (VPUs), or a combination thereof.
  • Although FIG. 3 shows the stereoscopic imaging apparatus 300 as including one processor 302 for illustrative purposes only, the stereoscopic imaging apparatus 300 can include any number of uniform and/or different processors 302.
  • The memory 304 (alternatively referred to herein as a non-transitory computer readable medium) can comprise any type of memory and can be, for example, a random-access memory (RAM), a static RAM, a dynamic RAM, a read-only memory (ROM), a programmable ROM, an erasable programmable ROM, an electrically erasable programmable ROM, a flash memory, a secure digital (SD) card, and the like. In some embodiments, the memory 304 has a storage capacity that accommodates the needs of the block matching functions and operations described herein. The memory 304 can have any commercially-available memory capacity suitable for use in image processing applications and, in some embodiments, has a storage capacity of at least 512 Megabytes, 1 Gigabyte, 2 Gigabytes, 4 Gigabytes, 16 Gigabytes, 32 Gigabytes, 64 Gigabytes, or more. Although FIG. 3 shows the stereoscopic imaging apparatus 300 as including one memory 304 for illustrative purposes only, the stereoscopic imaging apparatus 300 can include any number of uniform and/or different memories 304.
  • The stereoscopic imaging apparatus 300 can further include any hardware and/or software desired for performing the block matching functions and operations described herein. For example, the stereoscopic imaging apparatus 300 can include one or more input/output interfaces (not shown) for receiving the images 221, 222 shown in FIG. 1. Exemplary interfaces include, but are not limited to, universal serial bus (USB), digital visual interface (DVI), display port, serial ATA (SATA), IEEE 1394 interface (also known as FireWire), serial, video graphics array (VGA), super video graphics array (SVGA), small computer system interface (SCSI), high-definition multimedia interface (HDMI), audio ports, and/or proprietary input/output interfaces. As another example, the stereoscopic imaging apparatus 300 can include one or more input/output devices (not shown), for example, buttons, a keyboard, keypad, trackball, displays, and/or a monitor. As yet another example, the stereoscopic imaging apparatus 300 can include hardware for communication between components of the image processing system 100 (for example, between the processor 302 and the memory 304).
  • Turning now to FIG. 4, an exemplary method 400 for stereoscopic imaging is shown. Optionally, the first image 221 can be obtained, at 401. Optionally, the second image 221 can be obtained, at 402. In one example, the stereoscopic imaging apparatus 300 can receive the images 221, 222 respectively obtained from the imaging devices 201, 202 shown in FIG. 2. In another example, the stereoscopic imaging apparatus 300 can obtain the images 221, 222 stored on another computer system (not shown), and the stereoscopic imaging apparatus 300 is not necessarily in communication with the imaging devices 201, 202.
  • At 410, a block configuration 310 is selected from a plurality of block configurations 310. For example, the block configuration 310 can be selected from a library of block configurations 310. The library of block configurations 310 can be predefined by an operator, as desired. In an illustrative and non-limiting example, the block configuration 310 can be selected based on a texture of the object 230 (shown in FIG. 2). In the example, when the object 230 has a smooth texture (such as a monochromatic wall or smooth glass), the block configurations 310 having a small size may provide limited information and result in mismatching. Thus, a block configurations 310 having a large size can be selected to provide more information for block matching. In another example, when the object 230 has a rough texture with significant depth variation, a block configuration 310 having a large size can lead to poor overlapping (or large amount of mismatching features) between the images 221, 222. Such poor overlapping can result in a very low degree of matching. Thus, a block configuration 310 having a small size can be selected. In still another example, a block configuration 310 having a shape and/or size matching shape of the object 230 or a portion of the object 230 can be selected, for the block configuration 310 to include a selected texture for matching.
  • In one embodiment, the stereoscopic imaging apparatus 300 can select one or more predetermined block configurations 310. For example, an operator can select block configurations 310 with a variety of shapes and/or sizes, with or without consideration of content of the images 221, 222.
  • In some embodiments, the stereoscopic imaging apparatus 300 can select the block configuration 310 dynamically. For example, block configurations 310 can be selected based on content of the images 221, 222, such as based on the shape and/or size of object 230 in the images 221, 222. For example, the stereoscopic imaging apparatus 300 can perform one or more image feature recognition and/or feature detection methods to determine texture of the object 230 in the images 221, 222. Exemplary feature recognition methods can include edge detection (for example, Shi & Tomasi corner detection), image binarization, SURF blob (Speeded Up Robust Features) detection, MSER blob (Maximally Stable Extremal Regions) detection, SIFT (Scale-Invariant Feature Transform) descriptors, FREAK (Fast REtinA Keypoint) descriptors, BRISK (Binary Robust Invariant Scalable Keypoints) descriptors, HOG (Histogram of Oriented Gradients) descriptors, and the like.
  • In yet other embodiments, the stereoscopic imaging apparatus 300 can select the block configuration 310 randomly. Randomizing the selection of the block configuration 310 can reduce matching error associated with using a fixed set of block configuration 310 for a specific type of texture, for example.
  • As shown in FIG. 4, at 420, the first point 241 on the first image 221 is matched with the corresponding second point 242 on the second image 222 using the block configurations 310. Stated somewhat differently, the second point 242 that corresponds to the first point 241 can be identified based on the matching.
  • The method 400 can advantageously improve accuracy of block matching. As previously described, different size and/or shape of the block configuration 310 can result in different accuracy of block matching. When a single block configuration 310 is used, the block configuration 310 may not be able to adapt to all types of complex scenes or objects. Using a plurality of block configurations 310 can ensure that one or more block configurations 310 can be suitable for achieving accurate matching and reducing probability of mismatching. Using the plurality of block configurations 310 can thus compensate for limitations or inaccuracies associated with single block configuration 310, thereby improving the accuracy and reliability of a block matching result.
  • FIG. 5 and FIG. 6 are shown to illustrate obtaining the depth information using the images 221, 222. FIG. 5 shows the imaging devices 201, 202 imaging the object 230 using stereopsis. FIG. 6 illustrates a method of ascertaining an object distance D using triangulation with reference to the two imaging devices 201, 202. Each of the imaging devices 201, 202 perceives a same point 220 on the object 230, but in different spatial coordinates as illustrated by the coordinate axes (x1, y1, z1) and (x2, y2, z2). The imaging devices 201, 202 perceive the point 220 along respective optical axes 261, 262 and thereby arrive at two different images 221, 222 of the point 220. The point 220 is captured in the images 221, 222 as the first and second points 241, 242 (shown in FIG. 2), respectively. The images 221, 222 can be two-dimensional. The images 221, 222 are typically different, being taken from different positions, unless the imaging devices 201, 202 are positioned such that their optical axes 261, 262 coincide. Accordingly, under most circumstances, a disparity d shown in Equation (4) can be found between the images 221, 222, as described below with reference to FIG. 6.
  • Referring now to FIG. 6, the images 221, 222 can be compared to ascertain the object distance D between the pair of imaging devices 201, 202 and the point 220. A method of triangulation can be used to ascertain the object distance D using the disparity d shown in Equation (4). Specifically, the position of the point 220 having an index i, represented by coordinates (xi, yi, zi), can be given as follows:
  • x i = b d ( x i K - c x ) , Equation ( 1 ) y i = b d ( y i K - c y ) , and Equation ( 2 ) z i = b d f Equation ( 3 )
  • where cx and cy represent respective center coordinates of the first imaging device 201 (or the second imaging device 202), xi K and yi K represent the coordinates representing the point 220 in the first image 221 where K=1 (or the second image 222 where K=2), b is the baseline (in other words, the distance between the center coordinates of the imaging devices 201, 202), ƒ is the focal length of each of the imaging devices 201, 202 (assuming here that the imaging devices 201, 202 have the same focal length), i is an index over multiple points 220 of the object 230, and d is the disparity between the images 221, 222, represented here as:

  • d i =x i 1 −x i 2  Equation (4)
  • Based on the images 221, 222 and using Equations (1)-(4), the stereoscopic imaging apparatus 300 shown in FIG. 1 can determine the coordinate zi. The object distance D can be equal to and/or based on the coordinate zi.
  • Turning to now FIG. 7, another illustrative example of matching points between two images is shown. As shown in FIG. 7, for a given first 241 on the first image 221, a matching process can be used to select a second point 242 on the second image 222 that corresponds to the first point 241. A plurality of candidate second points 282 a . . . 282 m can be selected for evaluation of a matching cost c for determining the suitability of matching. As described above with reference to FIG. 1, any suitable comparison method between pixels of two blocks can be used to find the matching cost c. Exemplary methods for finding the matching cost c include comparison of pixel values using correlation, covariance, sum of absolute difference, sum of squared differences, and the like. Based on the matching costs c, the second point 242 can be selected from among the candidate second points 282 a . . . 282 m.
  • The candidate second points 282 a . . . 282 m for matching can be selected in any manner. In some embodiments, the candidate second points 282 a . . . 282 m can be selected in a predefined neighborhood about the coordinates of the first point 241. For example, candidate second points 282 a . . . 282 m can be evaluated that are within a defined range that is 2 pixels, 4 pixels, 8 pixels, 16 pixels, 32 pixels, 64 pixels, 128 pixels, 256 pixels, or more of the coordinates of the first point 241. Alternatively, all pixels of the second image 222 can be evaluated. In some embodiments, all of the pixels within the defined range can be evaluated for the matching cost c. In other embodiments, pixels within the defined range can be sampled (for example, randomly or by interval sampling) for the matching cost c.
  • Turning now to FIG. 8, an exemplary stereoscopic imaging system 100 can include an image pre-processor 350 for pre-processing images 221, 222. The image pre-processor 350 can execute any image transformation operations for projecting the images 221, 222 onto a common image plane defined in a common coordinate system (shown in FIG. 9). For example, the image pre-processor 350 can be used for rectifying the images 221, 222. The image pre-processor 350 can transform the images 221, 222 so that the coordinate axes (x1, y1, z1) and (x2, y2, z2) are aligned. For example, the image pre-processor 350 can perform rotation to align the x1-axis with the x2-axis and/or align the y1-axis with the y2-axis. Additionally and/or alternatively, the image pre-processor 350 can perform scaling transformation to eliminate scaling difference between the images 221, 222. Additionally and/or alternatively, the image pre-processor 350 can perform rotation and/or skew adjustments in z direction to align the z1-axis with the z2-axis. By pre-processing the images 221, 222 images, the pre-processor 350 can simplify subsequent block matching between the images 221, 222, advantageously improving matching efficiency.
  • Additionally and/or alternatively, the image pre-processor 350 can be configured to perform any image processing operations to improve quality of the images 221, 222. Subsequent block matching can advantageously be more easily performed. Exemplary image processing operations include, but are not limited to, image de-noising, image sharpening, and/or image de-blurring.
  • Although FIG. 8 shows the image pre-processor 350 and the stereoscopic imaging apparatus 300 as being individual units, the image pre-processor 350 and the stereoscopic imaging apparatus 300 can be at least partially integrated, without limitation.
  • Turning now to FIG. 9, the images 221, 222 after rectification are shown. Rectification advantageously aligns the images 221, 222 along the same y-axis and z-axis, such that the images 221, 22 are offset only along the x-axis. Stated somewhat differently, the second point 242 can have a same y-axis coordinate as the first point 241. Thus, the candidate second points 282 can be chosen in the second image 222 along a horizontal line 226 that passes the first point 241. Searching of the candidate second points 282 can thus be one-dimensional. Computation for the searching is advantageously simplified.
  • After rectification of the images 221, 222, and/or other pre-processing steps, the image pre-processor 350 (shown in FIG. 8) can transmit the rectified images 221, 222 to the stereoscopic imaging apparatus 300. Subsequently, a block configuration 310 can be selected from a plurality of block configurations. Points 241, 242 on the rectified images 221, 222 can be matched using the selected block configuration 310 using the present systems and methods.
  • Turning to FIG. 10, the stereoscopic imaging apparatus 300 can include a disparity validator 360. The disparity validator 360 can perform any calculation operations used for evaluating accuracy of the disparity d using a predetermined criteria. When accuracy of the disparity d does not satisfy the predetermined criteria, the disparity validator 360 can determine that the disparity d is invalid. Although FIG. 10 shows the disparity validator 360 and the cross-block-configuration calculator 330 as being individual units, the disparity validator 360 and the cross-block-configuration calculator 330 can be at least partially integrated, without limitation.
  • Turning to FIG. 11, exemplary embodiments are now shown to illustrate generating the matching cost c to match the points 241, 242 (shown in FIG. 2) for obtaining the disparity d. Each block configuration calculator 320 can generate an individual matching cost MCij between the first point 241 and a selected candidate second point 282 shown in FIG. 7. In the individual matching cost MCij, i is an index over the block configurations 310 and j is an index over the candidate second points 282. Stated somewhat differently, between the first point 241 and the jth candidate second point 282, block configuration calculators 320A, 320B, . . . , 320N can calculate individual matching cost MCAj, MCBj, . . . , MCNj, respectively. The individual matching cost MCij can be provided to the cross-block-configuration calculator 330 to be combined for identifying the second point 242 corresponding to the first point 241.
  • Turning now to FIG. 12, the first block 341 and the second block 342 are shown to include the first point 241 and the candidate second point 282 a. The first block 341 and the second block 342 can be based on the block configuration 310A. The block configuration 310A can have a triangular shape shown in FIG. 11 for illustrative purposes only. The individual matching cost MCAj between the first point 241 and the candidate second point 282 a can thus be calculated using the blocks 341, 342 based on the block configuration 310A.
  • Turning now to FIG. 13, an exemplary method 500 for determining an individual matching cost MCij for each block configuration 310 is shown. The first block 341 of pixels around the first point 241 of the first image 221 can be chosen, at 510, based on the block configuration 310. The block configuration 310 can include any one of the block configurations 310A, 310B, . . . , 310N shown in FIG. 11. A second block 342 of pixels around the candidate second point 282 of the second image 222 can be chosen based on the block configuration 310. The candidate second point 282 can include any one of the candidate second points 282 a, 282 b, . . . , 282 m shown in FIG. 12.
  • The individual matching cost MCij for the block configuration 310 between the first point 241 of the first image 221 and the selected candidate second point 282 of the second image 222 can be determined, at 530. The method 500 can be performed for each candidate second point 282 based on each block configuration 310.
  • Turning to FIG. 14, the individual cost MCij is shown to be combined via the cross-block-configuration calculator 330 (shown in FIG. 11) to identify the second point 242 among the candidate second points 282. As shown in FIG. 14, for the candidate second point 282 a, the individual matching costs MCij can include MCAa, MCBa, . . . , MCNa. For the candidate second point 282 m, the individual matching costs MCij can include MCAm, MCBm, . . . , MCNm.
  • For each candidate second point 282, the corresponding individual matching cost MCij can be combined to generate the matching cost MG. As shown in FIG. 14, for the candidate second point 282 a, the individual matching costs MCAa, MCBa, . . . , MCNa can be combined to generate matching cost MCa. For the candidate second point 282 m, the sub-individual matching costs MCAm, MCBm, . . . , MCNm can be combined to generate matching cost MCm.
  • The matching cost MCj of the corresponding second point 282 can be generated in any suitable manner. For example, the matching cost MCj can be a function ƒ of the individual matching costs MCij of the corresponding candidate second point 282:

  • MCj=ƒ(MCAj,MCBj, . . . ,MCNj).  Equation (5)
  • In one embodiment, the matching cost MCj can be determined by selecting one of the individual matching costs MCAj, MCBj, . . . , MCNj. For example, the matching cost MCj can be determined by selecting the greatest or best one of the individual matching costs MCAj, MCBj, . . . , MCNj, that is, by selecting the individual matching cost that indicates the best matching.
  • In another embodiment, the MCj can be determined by weighing the individual matching costs MCAj, MCBj, . . . , MCNj, as expressed as follows for illustrative purposes:

  • MCji=A N w iMCij  Equation (6)
  • where wi is a weight of the ith block configuration 310. The wi can be pre-determined based on accuracy of the ith block configuration 310 for all images or for the specific type of content of images 221, 222 shown in FIG. 12.
  • The matching costs MCj of the candidate second points 282 can be compared to identify a candidate second point 282. In one example, the second point 242 having the greatest matching cost MCj can be selected as the matching second point 242. In another example, the matching cost MCj of each candidate second point 282 can be compared with a threshold matching cost. When the matching cost MCj indicates a better match than the threshold matching cost, the corresponding candidate second point 282 can be selected as the matching second point 242.
  • As shown in FIG. 14, by using the plurality of block configurations 310A, 310B, . . . , 310N, a plurality of individual matching costs MCij can be obtained for each of the candidate second point 282. Problems of mismatch and/or poor match for a single block configuration 310 can thereby be prevented. The identified second point 242 can have the best matching cost that is provided by the block configuration 310 that is optimally adapted to the scene of the images 221, 222.
  • Turning to FIG. 15, each block configuration calculator 320 can generate a candidate disparity dCA. Each candidate disparity dCA can be obtained using the second point 242 (shown in FIG. 2) that is identified based on the block configuration 310. Stated somewhat differently, based on a selected block configuration 310, a second point 242 can be identified among the candidate second points 282. The candidate disparity dCA can thus be obtained based on the second point 242. For example, the candidate disparities dA, dB, . . . , dN shown in FIG. 15 can be obtained based on the second points 242 respectively identified using the block configuration 310A, 310B, . . . , 310N.
  • Using a selected block configuration 310 with index of i, the block configuration calculator 320 can identify the corresponding second point 242 in any suitable manner. For example, individual matching cost MCij can be calculated for each candidate second point 282 based on the ith block configuration 310. The candidate second point 282 associated with the best individual matching cost MCij can be selected as the second point 242 for the ith block configuration 310.
  • The candidate disparities dCA are inputted to the cross-block-configuration calculator 330 to be combined for identifying the disparity d. Turning to FIG. 16, an illustrative example of combining the candidate disparities dCA is shown. The candidate disparities dCA can be validated in any manner. For example, the candidate disparities dCA can be validated via the disparity validator 360. Validation of the candidate disparities dCA of a selected block configuration 310 shown in FIG. 15 can include, for example, comparing the threshold matching cost with the individual matching cost MCij of the second point 242 identified using the selected block configuration 310. If the individual matching cost MCij of the identified second point 242 is worse than the threshold matching cost, the candidate disparities dCA associated with such a second point 242 is not valid.
  • In an illustrative and non-limiting example, a candidate disparity dCA associated with the block configuration 310 having the greatest size can be validated first. When the candidate disparity dCA is valid, the dCA can be identified as the disparity d. A block configuration 310 having a large size can provide great amount of information for matching and thus allow block matching at high accuracy. Thus, starting validation from the block configuration 310 having the greatest size can advantageously identify the disparity d accurately at early stage of the validation.
  • When such a candidate disparity dCA is invalid, a candidate disparity dCA associated with a different block configuration 310 can be validated. For example, the candidate disparity dCA of the block configuration 310 having the greatest size can be invalid due to poor matching resulting from significant depth variation of imaging. In that case, the block configuration 310 having the same size but a different shape can be validated. Additionally and/or alternatively, the block configuration 310 having a smaller size with a different and/or same shape can be validated. Advantageously, by using the plurality of block configurations 310, a probability of successful block matching can be improved and failed block matching can be prevented.
  • The validation can be implemented until at least one candidate disparity dCA is identified. Additionally and/or alternatively, the validation can be implemented on all candidate disparities dCA. In some cases, one or more candidate disparities dCA can be determined as valid, and each of the valid candidate disparities dCA can be identified as the disparity d. The valid candidate disparities dCA can be combined to form the disparity d, for example, via a filtering technique such as mean filtering and/or median filtering.
  • Turning now to FIG. 17, an exemplary processing system 600 is shown as including one or more modules to perform any of the disclosed methods. The processing system 600 is shown as including a receiving module 610, a selecting module 620, and a matching module 630. In some embodiments, the receiving module 610 can be configured to obtain a first image 221 and a second image 222 (both shown in FIG. 1). The receiving module 610 can receive the images 221, 222 respectively obtained by a first imaging device 201 and a second imaging device 202 (both shown in FIG. 1). The selecting module 620 can be configured to select a plurality of block configurations 310. The matching module 630 can be configured to match the first point 241 on the first image 221 with the corresponding second point 242 on the second image 222 using the block configurations 310, as shown in FIG. 2.
  • In some embodiments, the block configurations 310 can have different sizes. In some embodiments, at least one block configuration 310 can have a size of 7 pixels by 7 pixels, 9 pixels by 9 pixels, 13 pixels by 13 pixels, or 25 pixels by 25 pixels. In some embodiments, the block configurations 310 can have different shapes. In some embodiments, at least one of the block configurations 310 can have a non-rectangular shape. In some embodiments, the block configurations 310 can include a block configuration 310 with a circular shape, a block configuration with a triangular shape, a block configuration with a trapezoidal shape, or a combination thereof.
  • In some embodiments, the selecting module 620 can be configured to select one or more pre-determined block configurations 310. In some embodiments, the selecting module 620 can be configured to dynamically select the block configurations 310. In some embodiments, the selecting module 620 can be configured to randomly select the block configurations 310.
  • In some embodiments, the matching module 630 can be configured to determine a matching cost c between the first point 241 on the first image 221 and each of a plurality of candidate second points 282 (shown in FIG. 7) of the second image 222 to identify the corresponding second point 242. In some embodiments, the matching module 630 can be configured to determine a matching cost between the first point 241 on the first image 221 and each of a plurality of candidate second points 282 on a line of the second image 222 to identify the corresponding second point 242. In some embodiments, the matching module 630 can be configured to determine the matching cost based on individual matching costs c for each of the block configurations 310. In some embodiments, the matching module 630 can be configured to determine the matching cost by selecting one of the individual matching costs c. In some embodiments, the matching module 630 can be configured to determine the matching cost by weighing the individual matching costs c. In some embodiments, the matching module 630 can be configured to determine the matching cost using a predetermined weight for each block configuration 310. In some embodiments, the matching module 630 can be configured to determine the individual matching costs for each block configuration 310 of the block configurations 310 by choosing a first block 341 of pixels around the first point 241 of the first image 221 based on the block configuration 310, choosing a second block 342 of pixels around a selected candidate second point 282 of the second image 222 based on the block configuration 310, and determining the individual matching cost for the block configuration 310 between the first point 241 of the first image 221 and the selected candidate second point 282 of the second image 222.
  • In some embodiments, the matching module 630 can be configured to determine an object distance D based on a disparity d between the first point 241 of the first image 221 and the corresponding second point 242 of the second image 222.
  • In some embodiments, the matching module 630 can be configured to determine individual matching costs between the first point on the first image and each of a plurality of candidate second points on the second image based on each of the block configurations to select candidate second points respectively corresponding to the block configurations.
  • In some embodiments, the matching module 630 can be configured to obtain candidate disparities dCA between the first point 241 and the selected candidate second points 282, respectively, and determine the disparity d based on the candidate disparities dCA. In some embodiments, the matching module 630 is configured to determine an object distance D based on the disparity d.
  • In some embodiments, the matching module 630 can be configured to match a first pixel on the first image 221 with a corresponding second pixel on the second image 222 using the block configurations 310. In some embodiments, the matching module 630 is configured to match a first feature on the first image 221 with a corresponding second feature on the second image 222 using the block configurations 310.
  • As shown in FIG. 17, the processing system 600 can optionally include a pre-processing module 640 for pre-processing the first and second images 221, 222 prior to the matching. In some embodiments, the pre-processing module 640 is configured to rectify the second image 222 relative to the first image 221.
  • Turning now to FIG. 18, an exemplary stereoscopic imaging system 100 is shown as being a mobile platform that is an unmanned aerial vehicle (UAV) 150. Stated somewhat differently, the stereoscopic imaging system 100 can be mounted on the mobile platform, such as the UAV 150. UAVs 150, colloquially referred to as “drones,” are aircraft without a human pilot onboard the vehicle whose flight is controlled autonomously or by a remote pilot (or sometimes both). UAVs are now finding increased usage in civilian applications that require various forms of aerial data-gathering. Various types of UAVs 150 are suitable for use as stereoscopic imaging systems 100. One suitable type of UAV 150, for example, is an aerial rotorcraft that is propelled by multiple rotors. One suitable type of rotorcraft has four rotors and is known as a quadcopter, quadrotor helicopter, or quad rotor. Exemplary quadcopters suitable for the present systems and methods for imaging (for example, stereoscopic imaging) include numerous models currently available commercially. UAVs 150 suitable for the present systems and methods further include, but are not limited to, other rotor designs such as single rotor, dual rotor, trirotor, hexarotor, and octorotor designs. Fixed wing UAVs 150 and hybrid rotorcraft-fixed wing UAVs 150 can also be used. Imaging devices 201, 202 can be mounted to the UAV 150 in any suitable manner (for example, on an underside of the UAV 150). Other components of the stereoscopic imaging system 100 can be included in a fuselage of the UAV 150, as appropriate, for protection against wear and tear.
  • The disclosed embodiments are susceptible to various modifications and alternative forms, and specific examples thereof have been shown by way of example in the drawings and are herein described in detail. It should be understood, however, that the disclosed embodiments are not to be limited to the particular forms or methods disclosed, but to the contrary, the disclosed embodiments are to cover all modifications, equivalents, and alternatives.

Claims (20)

What is claimed is:
1. A block matching method comprising:
selecting, based on object information associated with an object, one or more block configurations from a plurality of block configurations having at least one of different sizes or different shapes; and
matching, using the one or more block configurations, a first point in a first image with a corresponding second point in a second image.
2. The method of claim 1, wherein:
the object information is indicative of a texture of the object; and
sizes in the one or more block configurations are negatively correlated to a roughness of the texture indicated by the object information.
3. The method of claim 1, wherein selecting the one or more block configurations comprises selecting a block configuration having a shape that matches at least one of a shape or a size of the object or a portion of the object, such that the selected block configuration matches a texture of the object or a texture of the portion of the object.
4. The method of claim 1, wherein:
selecting the one or more block configurations comprises selecting a first block configuration and a second block configuration from the plurality of block configurations that have at least one of different shapes or different sizes; and
matching the first point with the corresponding second point comprises:
obtaining a first matching cost by matching, using the first block configuration, the first point in the first image with the corresponding second point in the second image;
obtaining a second matching cost by matching, using the second block configuration, the first point in the first image with the corresponding second point in the second image; and
matching the first point in the first image with the corresponding second point in the second image based on the first matching cost and the second matching cost.
5. The method of claim 1, wherein selecting the one or more block configurations comprises dynamically selecting the one or more block configurations from the plurality of block configurations.
6. The method of claim 1, wherein matching the first point with the corresponding second point comprises:
determining matching costs between the first point in the first image and a plurality of candidate second points in the second image; and
identifying the corresponding second point from the plurality of candidate second points based on the matching costs.
7. The method of claim 6, wherein determining the matching costs between the first point and the plurality of candidate second points comprises at least one of:
determining matching costs between the first point in the first image and a plurality of candidate second points on a line of the second image to identify the corresponding second point; or
determining, using one block configuration from the block configurations, individual matching costs between the first point and the plurality of candidate second points to determine at least one matching cost that includes at least one of:
selecting the at least one matching cost from the individual matching costs;
determining the at least one matching cost based on weighted individual matching costs by applying first weights to the individual matching costs; or
determining the at least one matching cost based on weighted block configurations by applying second weights to the plurality of block configurations.
8. The method of claim 1, further comprising:
determining an object distance based on a disparity between the first point in the first image and the corresponding second point in the second image.
9. The method of claim 1, wherein matching the first point with the corresponding second point comprises determining, using each block configuration of the plurality of block configurations, individual matching costs between the first point and each of a plurality of candidate second points in the second image to select candidate second points respectively corresponding to the plurality of block configurations.
10. The method of claim 9, further comprising:
obtaining candidate disparities between the first point and the selected candidate second points, respectively;
determining a disparity between the first point and the corresponding second point based on the candidate disparities; and
determining an object distance based on the disparity.
11. The method of claim 1, wherein matching the first point with the corresponding second point comprises at least one of:
matching, using the one or more block configurations, a first pixel in the first image with a corresponding second pixel in the second image; or
matching, using the one or more block configurations, a first feature in the first image with a corresponding second feature in the second image.
12. A stereoscopic imaging system comprising:
a first imaging device configured to obtain a first image;
a second imaging device configured to obtain a second image; and
one or more processors configured to:
select, based on object information associated with an object, one or more block configurations from a plurality of block configurations having at least one of different sizes or different shapes; and
match, using the one or more block configurations, a first point in the first image with a corresponding second point in the second image.
13. The stereoscopic imaging system of claim 12, wherein:
the object information is indicative of a texture of the object; and
sizes in the one or more block configurations are negatively correlated to a roughness of the texture indicated by the object information.
14. The stereoscopic imaging system of claim 12, wherein the one or more processors are further configured to select a block configuration having a shape that matches at least one of a shape or a size of the object or a portion of the object, such that the selected block configuration matches a texture of the object or a texture of the portion of the object.
15. The stereoscopic imaging system of claim 12, where the one or more processors are further configured to:
select a first block configuration and a second block configuration from the plurality of block configurations that have a least one of different shapes or different sizes;
obtain a first matching cost by matching, using the first block configuration, the first point in the first image with the corresponding second point in the second image;
obtain a second matching cost by matching, using the second block configuration, the first point in the first image with the corresponding second point in the second image; and
match the first point in the first image with the corresponding second point in the second image based on the first matching cost and the second matching cost.
16. The stereoscopic imaging system of claim 12, wherein the one or more processors are further configured to:
determine matching costs between the first point in the first image and a plurality of candidate second points in the second image; and
identify the corresponding second point from the plurality of candidate second points based on the matching costs.
17. The stereoscopic imaging system of claim 12, the one or more processors are further configured to determine an object distance based on a disparity between the first point in the first image and the corresponding second point in the second image.
18. The stereoscopic imaging system of claim 12, wherein the one or more processors are further configured to determine, using each block configuration of the plurality of block configurations, individual matching costs between the first point and each of a plurality of candidate second points in the second image to select candidate second points respectively corresponding to the plurality of block configurations.
19. The stereoscopic imaging system of claim 18, wherein the one or more processors are further configured to:
obtain candidate disparities between the first point and the selected candidate second points, respectively;
determine a disparity between the first point and the corresponding second point based on the candidate disparities; and
determine an object distance based on the disparity.
20. The stereoscopic imaging system of claim 12, wherein the one or more processors are further configured to:
match, using the one or more block configurations, a first pixel in the first image with a corresponding second pixel in the second image; and
match, using the one or more block configurations, a first feature in the first image with a corresponding second feature in the second image.
US17/143,849 2016-07-06 2021-01-07 Systems and methods for stereoscopic imaging Abandoned US20210125361A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US17/143,849 US20210125361A1 (en) 2016-07-06 2021-01-07 Systems and methods for stereoscopic imaging

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
PCT/CN2016/088829 WO2018006296A1 (en) 2016-07-06 2016-07-06 Systems and methods for stereoscopic imaging
US16/240,704 US10896519B2 (en) 2016-07-06 2019-01-04 Systems and methods for stereoscopic imaging
US17/143,849 US20210125361A1 (en) 2016-07-06 2021-01-07 Systems and methods for stereoscopic imaging

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
US16/240,704 Continuation US10896519B2 (en) 2016-07-06 2019-01-04 Systems and methods for stereoscopic imaging

Publications (1)

Publication Number Publication Date
US20210125361A1 true US20210125361A1 (en) 2021-04-29

Family

ID=60901444

Family Applications (2)

Application Number Title Priority Date Filing Date
US16/240,704 Active US10896519B2 (en) 2016-07-06 2019-01-04 Systems and methods for stereoscopic imaging
US17/143,849 Abandoned US20210125361A1 (en) 2016-07-06 2021-01-07 Systems and methods for stereoscopic imaging

Family Applications Before (1)

Application Number Title Priority Date Filing Date
US16/240,704 Active US10896519B2 (en) 2016-07-06 2019-01-04 Systems and methods for stereoscopic imaging

Country Status (3)

Country Link
US (2) US10896519B2 (en)
CN (1) CN109314774B (en)
WO (1) WO2018006296A1 (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108335322B (en) * 2018-02-01 2021-02-12 深圳市商汤科技有限公司 Depth estimation method and apparatus, electronic device, program, and medium
US11508079B2 (en) * 2019-06-28 2022-11-22 Intel Corporation Parallelism in disparity map generation

Family Cites Families (31)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB0125774D0 (en) * 2001-10-26 2001-12-19 Cableform Ltd Method and apparatus for image matching
GB0708676D0 (en) * 2007-05-04 2007-06-13 Imec Inter Uni Micro Electr A Method for real-time/on-line performing of multi view multimedia applications
CN101321299B (en) * 2007-06-04 2011-06-01 华为技术有限公司 Parallax generation method, generation cell and three-dimensional video generation method and device
US8213706B2 (en) * 2008-04-22 2012-07-03 Honeywell International Inc. Method and system for real-time visual odometry
CN101498889B (en) * 2009-03-03 2011-09-21 无锡易斯科电子技术有限公司 Multi-eye stereo camera shooting method and device
CN102065313B (en) * 2010-11-16 2012-10-31 上海大学 Uncalibrated multi-viewpoint image correction method for parallel camera array
RU2583040C2 (en) * 2011-08-30 2016-05-10 Нокиа Текнолоджиз Ой Device, method and computer program for encoding and decoding video signals
US20130121559A1 (en) * 2011-11-16 2013-05-16 Sharp Laboratories Of America, Inc. Mobile device with three dimensional augmented reality
TWI489418B (en) * 2011-12-30 2015-06-21 Nat Univ Chung Cheng Parallax Estimation Depth Generation
WO2013106418A1 (en) * 2012-01-09 2013-07-18 Tk Holdings, Inc. Stereo-vision object detection system and method
CN103366354B (en) * 2012-03-27 2016-09-07 富士通株式会社 Method and system for stereo matching
US20140009462A1 (en) * 2012-04-17 2014-01-09 3Dmedia Corporation Systems and methods for improving overall quality of three-dimensional content by altering parallax budget or compensating for moving objects
US20140003711A1 (en) * 2012-06-29 2014-01-02 Hong Kong Applied Science And Technology Research Institute Co. Ltd. Foreground extraction and depth initialization for multi-view baseline images
CN103337064A (en) * 2013-04-28 2013-10-02 四川大学 Method for removing mismatching point in image stereo matching
CN103260043A (en) * 2013-04-28 2013-08-21 清华大学 Binocular stereo image matching method and system based on learning
CN104284172A (en) * 2013-07-04 2015-01-14 联咏科技股份有限公司 Image matching method and stereo matching system
US9761002B2 (en) * 2013-07-30 2017-09-12 The Boeing Company Stereo-motion method of three-dimensional (3-D) structure information extraction from a video for fusion with 3-D point cloud data
CN104427324A (en) * 2013-09-02 2015-03-18 联咏科技股份有限公司 Parallax error calculation method and three-dimensional matching device thereof
KR20150053438A (en) * 2013-11-08 2015-05-18 한국전자통신연구원 Stereo matching system and method for generating disparity map using the same
JP6417886B2 (en) * 2013-12-12 2018-11-07 株式会社リコー Parallax value deriving device, moving body, robot, parallax value production method, and program
JP6565188B2 (en) * 2014-02-28 2019-08-28 株式会社リコー Parallax value deriving apparatus, device control system, moving body, robot, parallax value deriving method, and program
US9390508B2 (en) * 2014-03-03 2016-07-12 Nokia Technologies Oy Method, apparatus and computer program product for disparity map estimation of stereo images
CN103822616B (en) * 2014-03-18 2016-01-20 武汉大学 A kind of figure segmentation retrains with topographic relief the Remote Sensing Images Matching Method combined
US9407896B2 (en) * 2014-03-24 2016-08-02 Hong Kong Applied Science and Technology Research Institute Company, Limited Multi-view synthesis in real-time with fallback to 2D from 3D to reduce flicker in low or unstable stereo-matching image regions
JP6561512B2 (en) * 2014-03-27 2019-08-21 株式会社リコー Parallax value deriving device, moving body, robot, parallax value deriving method, parallax value producing method, and program
JP6589313B2 (en) * 2014-04-11 2019-10-16 株式会社リコー Parallax value deriving apparatus, device control system, moving body, robot, parallax value deriving method, and program
EP2960858B1 (en) * 2014-06-25 2018-08-08 Honda Research Institute Europe GmbH Sensor system for determining distance information based on stereoscopic images
CN104639933A (en) * 2015-01-07 2015-05-20 前海艾道隆科技(深圳)有限公司 Real-time acquisition method and real-time acquisition system for depth maps of three-dimensional views
KR20170047780A (en) * 2015-10-23 2017-05-08 한국전자통신연구원 Low-cost calculation apparatus using the adaptive window mask and method therefor
US10477178B2 (en) * 2016-06-30 2019-11-12 Massachusetts Institute Of Technology High-speed and tunable scene reconstruction systems and methods using stereo imagery
US10462445B2 (en) * 2016-07-19 2019-10-29 Fotonation Limited Systems and methods for estimating and refining depth maps

Also Published As

Publication number Publication date
US20190180461A1 (en) 2019-06-13
US10896519B2 (en) 2021-01-19
CN109314774A (en) 2019-02-05
CN109314774B (en) 2021-05-25
WO2018006296A1 (en) 2018-01-11

Similar Documents

Publication Publication Date Title
US9177481B2 (en) Semantics based safe landing area detection for an unmanned vehicle
US10482681B2 (en) Recognition-based object segmentation of a 3-dimensional image
EP2833322B1 (en) Stereo-motion method of three-dimensional (3-D) structure information extraction from a video for fusion with 3-D point cloud data
CN111222395B (en) Target detection method and device and electronic equipment
US8199977B2 (en) System and method for extraction of features from a 3-D point cloud
US10521694B2 (en) 3D building extraction apparatus, method and system
US20210192761A1 (en) Image depth estimation method and device, readable storage medium, and electronic apparatus
US20170004345A1 (en) Survey data processing device, survey data processing method, and survey data processing program
US20210125361A1 (en) Systems and methods for stereoscopic imaging
CN112883850B (en) Multi-view space remote sensing image matching method based on convolutional neural network
WO2015108996A1 (en) Object tracking using occluding contours
US9405959B2 (en) System and method for classification of objects from 3D reconstruction
US11887346B2 (en) Systems and methods for image feature extraction
Cheng et al. Building boundary extraction from high resolution imagery and lidar data
AliAkbarpour et al. Fast structure from motion for sequential and wide area motion imagery
US20200226392A1 (en) Computer vision-based thin object detection
US11842440B2 (en) Landmark location reconstruction in autonomous machine applications
JP2023530449A (en) Systems and methods for air and ground alignment
CN110197104B (en) Distance measurement method and device based on vehicle
CN112001954B (en) Underwater PCA-SIFT image matching method based on polar curve constraint
CN111898396A (en) Obstacle detection method and device
Xing et al. Barrier Detection and Tracking from Parameterized Lidar Data
Yaqoob et al. Performance evaluation of mobile stereonet for real time navigation in autonomous mobile robots
Wang et al. Stereo Matching for Binocular Underwater Images Using SURF Operator and Epipolar Restriction
Mohammed et al. Registration of time of flight terrestrial laser scanner data for stop-and-go mode

Legal Events

Date Code Title Description
AS Assignment

Owner name: SZ DJI TECHNOLOGY CO., LTD., CHINA

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:YANG, KANG;GAO, MINGMING;REEL/FRAME:054849/0372

Effective date: 20181204

STPP Information on status: patent application and granting procedure in general

Free format text: APPLICATION DISPATCHED FROM PREEXAM, NOT YET DOCKETED

STPP Information on status: patent application and granting procedure in general

Free format text: DOCKETED NEW CASE - READY FOR EXAMINATION

STPP Information on status: patent application and granting procedure in general

Free format text: NON FINAL ACTION MAILED

STCB Information on status: application discontinuation

Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION