US20190251677A1 - Homography rectification - Google Patents

Homography rectification Download PDF

Info

Publication number
US20190251677A1
US20190251677A1 US16/395,153 US201916395153A US2019251677A1 US 20190251677 A1 US20190251677 A1 US 20190251677A1 US 201916395153 A US201916395153 A US 201916395153A US 2019251677 A1 US2019251677 A1 US 2019251677A1
Authority
US
United States
Prior art keywords
image
dimension
transform
transposed
intermediate image
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US16/395,153
Inventor
Nathaniel Bogan
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Cognex Corp
Original Assignee
Cognex Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Cognex Corp filed Critical Cognex Corp
Priority to US16/395,153 priority Critical patent/US20190251677A1/en
Assigned to COGNEX CORPORATION reassignment COGNEX CORPORATION ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: BOGAN, NATHANIEL
Publication of US20190251677A1 publication Critical patent/US20190251677A1/en
Abandoned legal-status Critical Current

Links

Images

Classifications

    • G06T5/80
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T5/00Image enhancement or restoration
    • G06T5/006Geometric correction
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T3/00Geometric image transformation in the plane of the image

Definitions

  • Disclosed herein are apparatus, systems, and methods that relate to rectifying a homographic projection of an image for image processing applications.
  • Digital images can be formed by many devices and used for many practical purposes.
  • Devices include cameras with image sensors operating on visible or infrared light, such as a charge-coupled device (CCD) image sensor or a complementary metal-oxide-semiconductor (CMOS) image sensor, line-scan sensors, flying spot scanners, electron microscopes, X-ray devices including computed tomography (CT) scanners, magnetic resonance imagers, and other devices known to those skilled in the art.
  • CCD charge-coupled device
  • CMOS complementary metal-oxide-semiconductor
  • CT computed tomography
  • Practical applications are found in industrial automation, medical diagnosis, satellite imaging for a variety of military, civilian, and scientific purposes, photographic processing, surveillance and traffic monitoring, document processing, and many other applications.
  • the images formed by the various devices can be analyzed by machine vision systems to extract appropriate information.
  • One form of analysis that is of considerable practical importance is determining the position, orientation, and size of patterns in an image that correspond to objects in the field of view of the imaging device.
  • Pattern detection methods are of particular importance in industrial automation, where they are used to guide robots and other automation equipment in semiconductor manufacturing, electronics assembly, pharmaceuticals, food processing, consumer goods manufacturing, and many others.
  • Pattern detection methods can operate on images that have been subject to perspective distortions.
  • the perspective distortions tend to increase the computational complexity and the computing time of pattern detection methods. Therefore, as a preprocessing step, images can be rectified to remove perspective distortions prior to performing the pattern detection methods.
  • apparatus, systems, and methods are provided for rectifying a homographic distortion of an image for image processing applications.
  • Some embodiments include a machine vision system.
  • the system can include a processor configured to run a computer program stored in memory.
  • the computer program can be configured to cause the processor to receive a distorted image and a homography matrix associated with the distorted image, determine, using a homography decomposition module, a first transform and a second transform that collectively represent the homography matrix, wherein the first transform relates the distorted image to an intermediate image, and the second transform relates the intermediate image to a rectified image, wherein the first transform is constrained to map pixels along a first dimension of the intermediate image to locations along a first dimension of the distorted image, and wherein the locations along the first dimension of the distorted image are separated by a same distance from adjacent locations along the first dimension of the distorted image, generate, using a homography removal module, the intermediate image by removing the first transform from the distorted image, and generate, using the homography removal module, the rectified image by removing the second transform from the intermediate image.
  • the second transform is constrained to map pixels along a first dimension of the rectified image to locations along the first dimension of the intermediate image, and wherein the locations along the first dimension of the intermediate image are separated by a same distance from adjacent locations along the first dimension of the intermediate image.
  • the processor is further configured to transpose the intermediate image using the homography removal module prior to removing the second transform from the intermediate image.
  • the processor is further configured, using the homography removal module, to cause a direct memory access controller to transpose the intermediate image prior to removing the second transform from the intermediate image.
  • the direct memory access controller is configured to transpose the intermediate image using a two-dimensional direct memory access.
  • the processor is further configured, using the homography removal module, to remove the first transform from the distorted image by determining, for a first pixel of the intermediate image, a corresponding first location in the distorted image, and convolving pixels surrounding the corresponding first location in the distorted image to determine an image value for the first pixel.
  • the processor is further configured, using the homography removal module, to determine the corresponding first location by operating the first transform on a coordinate of the first pixel.
  • the processor is configured, using the homography removal module, to determine, for a second pixel of the intermediate image having an identical coordinate as the first pixel in a second dimension, a corresponding second location in the distorted image by adding a constant to a coordinate of the corresponding first location in the first dimension.
  • the processor is configured, using the homography removal module, to provide the rectified image to a pattern detection system so that the pattern detection system can use the rectified image to determine whether the rectified image includes a particular pattern.
  • the processor is configured to receive the homography matrix from the pattern detection system, wherein the homography matrix is determined by the pattern detection system based on a hypothesis of homography associated with the distorted image.
  • the processor is configured, using the homography removal module, to select one of a set of kernels based on a sub-pixel coordinate of one of the locations along the first dimension of the distorted image, and apply the selected kernel at the one of the locations.
  • Some embodiments include a method for operating the disclosed system or the disclosed non-transitory computer readable medium.
  • the method includes receiving, at a homography decomposition module, a homography matrix associated with a distorted image, determining, using the homography decomposition module, a first transform and a second transform that collectively represent the homography matrix, wherein the first transform relates the distorted image to an intermediate image, and the second transform relates the intermediate image to a rectified image, wherein the first transform is constrained to map pixels along a first dimension of the intermediate image to locations along a first dimension of the distorted image, and wherein the locations along the first dimension of the distorted image are separated by a same distance from adjacent locations along the first dimension of the distorted image, generating, using a homography removal module coupled to the homography decomposition module, the intermediate image by removing the first transform from the distorted image, and generating, using the homography removal module, the rectified image by removing the second transform from the intermediate image.
  • the method includes transposing, using the homography removal module, the intermediate image prior to removing the second transform from the intermediate image.
  • the method includes causing, using the homography removal module, a direct memory access controller to transpose the intermediate image prior to removing the second transform from the intermediate image.
  • the method includes determining, for a first pixel of the intermediate image, a corresponding first location in the distorted image, and convolving pixels surrounding the corresponding location in the distorted image to determine an image value for the first pixel.
  • the method includes determining, for a second pixel of the intermediate image having an identical coordinate as the first pixel in a second dimension, a corresponding second location in the distorted image by adding a constant to a coordinate of the corresponding first location in the first dimension.
  • the method includes providing the rectified image to a pattern detection system so that the pattern detection system can use the rectified image to determine whether the rectified image includes a particular pattern.
  • the method includes selecting one of a set of kernels based on a sub-pixel coordinate of one of the locations along the first dimension of the distorted image, and applying the selected kernel at the one of the locations.
  • Some embodiments include a non-transitory computer readable medium having executable instructions associated with a homography decomposition module and a homography removal module.
  • the modules are operable to cause a machine vision system to receive a homography matrix associated with a distorted image, determine a first transform and a second transform that collectively represent the homography matrix, wherein the first transform relates the distorted image to an intermediate image, and the second transform relates the intermediate image to a rectified image, wherein the first transform is constrained to map pixels along a first dimension of the intermediate image to locations along a first dimension of the distorted image, and wherein the locations along the first dimension of the distorted image are separated by a same distance from adjacent locations along the first dimension of the distorted image, generate the intermediate image by removing the first transform from the distorted image, and generate the rectified image by removing the second transform from the intermediate image.
  • the disclosed apparatus, systems, and methods described herein can improve the efficiency of rectifying homographic distortions in a distorted image by mapping the distorted image to a rectified image using a multi-stage rectification process.
  • Each stage of the rectification process is designed to reduce the computation and to reduce the amount of time needed to access images from memory.
  • FIG. 1 shows a machine vision system for rectifying a homography of a distorted image in accordance with some embodiments.
  • FIG. 2 illustrates an application of a second transform B to distort a rectified image into an intermediate image, and an application of a first transform A to distort the intermediate image into the distorted image, in accordance with some embodiments.
  • FIG. 3 illustrates a two-stage image mapping process in accordance with some embodiments.
  • FIG. 4 illustrates an example of a distorted image that has been subject to perspective distortion in accordance with some embodiments.
  • FIG. 5 illustrates an example of an intermediate image generated by removing the first transform A from the distorted image of FIG. 4 in accordance with some embodiments.
  • FIG. 6 illustrates an example of a rectified image generated by removing the second transform B from the intermediate image of FIG. 5 in accordance with some embodiments.
  • An image can include a two-dimensional function whose values correspond to physical characteristics of an object, such as brightness (radiant energy, reflected or otherwise), color, temperature, height above a reference plane, etc., and measured by any image-forming device, or whose values correspond to simulated characteristics of an object, and generated by any data processing device.
  • An image can be regarded as a two-dimensional matrix, where each bin (e.g., a pixel) in the two-dimensional matrix is associated with a value or a set of values.
  • Each pixel can be identified using a two-dimensional coordinate (x, y).
  • a row of an image can be identified by a set of pixels having an identical y-coordinate; a column of an image can be identified by a set of pixels having an identical x-coordinate.
  • a distorted image can include an image that has been subject to distortions, for example, due to a perspective effect.
  • a distorted image can be disposed on an image plane, and it can represent a perspective view of an object plane. Therefore, parallel lines on an object plane may converge on the image plane, either within the distorted image or outside of the distorted image.
  • a rectified image can include an image that is recovered by removing distortions (e.g., a perspective effect) from the distorted image.
  • the rectified image can be disposed on an object plane, and it can represent an orthographic view of the object plane.
  • a vanishing line of an object plane can be the line at infinity.
  • Image processing can refer to an automated analysis of images to determine the characteristics of features shown in them. It is used, for example, in automated manufacturing lines, where images of parts are analyzed to determine placement and alignment prior to assembly. It is also used, for example, in quality assurance where images of packages are analyzed to insure that product labels, lot numbers, and “freshness” dates are properly positioned and legible.
  • Image processing has non-industrial applications as well. In biotechnology research, for example, it can be used to identify constituents of microscopically imaged samples, or growth patterns in culture media. On the macroscopic scale, it can be used in astronomical research to find objects in time-lapse images. Meteorological, agricultural and defense applications of image processing include the detection and analysis of objects in satellite images.
  • homography rectification refers to a relationship between two images of the same planar surface in space.
  • Homography has an application in image rectification, which can refer to a process for mapping a distorted image to a rectified image on a common image plane on which a vanishing line become a line at infinity. This transformation process can be referred to as a homography rectification.
  • homography rectification removes distortions introduced by perspective properties of an imaging system.
  • an image mapping technique can be used to remove the homography from the distorted image to recover the rectified image.
  • This removal operation can include one or more image processing operations, which may include one or more of resizing, translation, rotation, skewing, and/or other image transformation operations.
  • a pixel at a coordinate (4, 10) in the rectified image may map to a location that is off from the pixel grid by a sub-pixel, e.g., a coordinate (2.5, 5.33), in the distorted image. This may necessitate interpolation and/or filtering to determine appropriate image values at the determined location.
  • an appropriate intensity for the pixel at coordinate (4, 10) in the rectified image might be determined as a weighted average of the intensities for the source pixel locations (2, 5), (3, 5), (2, 6), and (3, 6).
  • location is used to refer to a coordinate in a plane, which may or may not be located on a pixel grid.
  • Such image mapping techniques are usually both computationally expensive and time consuming because the corresponding pixel(s) in the distorted image must be independently identified for each pixel in the rectified image.
  • image mapping techniques are generally inefficient in their memory usage because they access pixels stored in memory in a manner that is not aligned to the memory structure (e.g., the way data is stored in memory). For example, when an image mapping technique uses one or more pixels in the distorted image to interpolate an image value for a pixel in the rectified image, the one or more pixels are fetched from memory.
  • Certain kinds of memory such as dynamic random access memory (DRAM), have a property that access to sequential memory locations (e.g., consecutive x coordinates in an image) is generally much faster than non-sequential access. Therefore, the order in which the one or more pixels are accessed from the memory or the order in which the one or more pixels are written to the memory can be critical for fast memory access operations.
  • conventional image mapping techniques do not guarantee that the order in which the one or more pixels are fetched would be favorable to memory. Because the one or more pixels accessed from the memory may not correspond to sequential memory locations in the memory, retrieving pixels from the memory can be time-consuming.
  • Catmull shows a method that decomposes an image mapping process into multiple stages so that pixels on the distorted image—referred to as source pixels—corresponding to a pixel in the rectified image—referred to as a destination pixel—are located in sequential locations in memory.
  • source pixels corresponding to a pixel in the rectified image
  • destination pixel corresponding to a pixel in the rectified image
  • Catmull's method still requires performing substantial computation, notably a division, for each destination pixel.
  • Catmull appears to have combined its method with specialized hardware to improve the computational speed, but its method is not amenable to be efficiently performed on a general-purpose processor.
  • the present disclosure provides a multi-stage image mapping mechanism for mapping a distorted image to a rectified image.
  • the multi-stage image mapping mechanism can remove homography from a distorted image to reconstruct a rectified image in two-stages: (1) a first stage in which distortion is partially removed from a distorted image to generate an intermediate image, and (2) a second stage in which residual distortion is removed from the intermediate image to recover the rectified image.
  • the multi-stage mechanism can be advantageous because it can allow each stage to be efficient in (1) identifying correspondence between pixels in two images, (2) accessing pixels from memory, and (3) filtering pixels to determine image values.
  • the multi-stage image mapping mechanism decomposes a homography transformation H into two separate transformations: a first transformation A and a second transformation B. Therefore,
  • the second transformation B is designed to map a coordinate of an object plane (e.g., a plane associated with a rectified image) to a coordinate of an intermediate image plane (e.g., a plane associated with an intermediate image)
  • the first transformation A is designed to map a coordinate of the intermediate image plane to a coordinate of an image plane (e.g., a plane associated with a distorted image). Therefore, the first stage of the multi-stage image mapping mechanism can be thought of as removing the first transformation A from the distorted image to generate an intermediate image, and the second stage of the multi-stage image mapping mechanism can be thought of as removing the second transformation B from the intermediate image to generate a rectified image.
  • a source image refers to an image that is provided as an input to one of decomposed transformations of homography (e.g., a first transform A or a second transform B)
  • a destination image refers to an image that is provided as an output from one of decomposed transformations of homography (e.g., a first transform A or a second transform B).
  • the source image is the rectified image and the destination image is the intermediate image
  • the source image is the intermediate image and the destination image is the distorted image.
  • one or more of the decomposed transformations are designed to map all pixels of a source image having a particular y-coordinate to pixels of a destination image having a fixed y-coordinate.
  • the first transform A is configured to map a row of a source image to a row of a destination image. This feature allows a majority of memory accesses for the source image and/or the destination image to be aligned with the memory structure, thereby reducing cache misses and reducing the access time for accessing the external memory.
  • one or more of the decomposed transformations are configured to map pixels in a particular row of a source image to locations that are equidistant from adjacent locations in a fixed row of a destination image. For example, pixels (1, y 0 ), (2, y 0 ), (3, y 0 ) in the source image (e.g., intermediate image for the first transform A) can be mapped to pixels (2, y 1 ), (4, y 1 ), (6, y 1 ) in the destination image (e.g., distorted image for the first transform A).
  • the fixed row of a destination image may be offset from the pixel grid by a sub-pixel value. Therefore, the coordinate y 1 can be a fractional coordinate value (e.g., 3.5).
  • the image mapping technique need not independently compute locations in the destination image corresponding to pixels in a particular row of a source image. Instead, once the image mapping technique identifies a first location in the destination image that corresponds to a first pixel in a source image, the image mapping technique can use the first location as a reference point to determine (e.g., by simply adding the sampling constant to the x-coordinate of the first location) a second location corresponding to an adjacent pixel in the same row as the first pixel in the source image. This characteristic of decomposed transformations reduces the amount of computation needed for identifying one or more locations in the destination image that correspond to pixels in a source image.
  • FIG. 1 shows a machine vision system 100 for rectifying a homography of a distorted image in accordance with some embodiments.
  • the system 100 can include a computing device 102 , which includes a processor 104 , memory 106 , a homography rectification (HR) module 108 , a direct memory access (DMA) controller 114 , and an interface 116 .
  • the system 100 can also include an image capture device 118 , which can generate an image of a scene.
  • the processor 104 can execute instructions and one or more memory devices 106 can store instructions and/or data.
  • the memory device 106 can be a non-transitory computer readable medium, such as a dynamic random access memory (DRAM), a static random access memory (SRAM), flash memory, a magnetic disk drive, an optical drive, a programmable read-only memory (PROM), a read-only memory (ROM), or any other memory or combination of memories.
  • DRAM dynamic random access memory
  • SRAM static random access memory
  • PROM programmable read-only memory
  • ROM read-only memory
  • the memory device 106 can be used to temporarily store data.
  • the memory device 106 can also be used for long-term data storage.
  • the processor 104 and the memory device 106 can be supplemented by and/or incorporated into special purpose logic circuitry.
  • the HR module 108 can include a homography decomposition module 110 and a homography removal module 112 .
  • the homography decomposition module 110 can be configured to receive a representation of homography to be removed from a distorted image, and determine decomposed transformations corresponding to the homography.
  • the decomposed transformations can include, for example, a first transform A to be removed from a distorted image to generate an intermediate image, and a second transform B to be removed from the intermediate image to generate a rectified image.
  • the decomposed transformations can include more than two transforms, e.g., three or four transforms, that may be sequentially removed from the distorted image to generate a rectified image.
  • the homography decomposition module 110 can constrain one or more of the decomposed transformations so that all pixels in a particular row of a source image are mapped to a fixed y-coordinate of a destination image.
  • the fixed y-coordinate may be an integer, in which case all pixels in a particular row of a source image are mapped to a fixed row of a destination image.
  • the homography decomposition module 110 can constrain one or more of the decomposed transformations so that pixels in a particular row of a source image are mapped to locations that are equidistant from each other in a fixed y-coordinate of a destination image.
  • the homography removal module 110 can be configured to remove decomposed transformation from the distorted image to recover the rectified image. For example, the homography removal module 110 can remove the first transform A from the distorted image to generate an intermediate image, and remove the second transform B from the intermediate image to recover a rectified image. In some embodiments, the homography removal module 110 can down-sample a row of a destination image to remove a decomposed transformation from the destination image. In this case, the homography removal module 110 can down-sample each row of the destination image with a different sampling rate to account for the projective distortion of the decomposed transformation.
  • the homography removal module 110 can be configured to down-sample a destination image as a part of the decomposed transformation removal process. This can be done by appropriately scaling decomposed transformations, e.g., by a down-sampling rate.
  • the direct memory access (DMA) controller 114 is configured to allow certain hardware devices to send or receive data directly to or from the main memory, thereby bypassing the processor 104 to speed up memory operations.
  • the DMA controller 114 can allow the homography decomposition module 110 to access the memory 106 without involving the processor 104 .
  • the DMA controller 114 can perform a two-dimensional DMA. The two-dimensional DMA is able to automatically skip over certain bytes in memory that do not correspond to a column of an image.
  • the interface 116 can be implemented in hardware to send and receive signals in a variety of mediums, such as optical, copper, and/or wireless interfaces, and in a number of different protocols, some of which may be non-transient.
  • one or more of the modules 108 , 110 , 112 can be implemented in software.
  • the software can run on a processor 104 capable of executing computer instructions or computer code.
  • the processor 104 is implemented in hardware using an application specific integrated circuit (ASIC), programmable logic array (PLA), digital signal processor (DSP), field programmable gate array (FPGA), or any other integrated circuit.
  • ASIC application specific integrated circuit
  • PDA programmable logic array
  • DSP digital signal processor
  • FPGA field programmable gate array
  • the processor 104 suitable for the execution of a computer program include, by way of example, both general and special purpose microprocessors, digital signal processors, and any one or more processors of any kind of digital computer.
  • the processor 104 receives instructions and data from a read-only memory or a random access memory or both.
  • the modules 108 , 110 , 112 can be implemented in digital electronic circuitry, or in computer hardware, firmware, software, or in combinations of them.
  • the implementation can be as a computer program product, e.g., a computer program tangibly embodied in a machine-readable storage device, for execution by, or to control the operation of, a data processing apparatus, e.g., a programmable processor, a computer, and/or multiple computers.
  • a computer program can be written in any form of computer or programming language, including source code, compiled code, interpreted code and/or machine code, and the computer program can be deployed in any form, including as a stand-alone program or as a subroutine, element, or other unit suitable for use in a computing environment.
  • a computer program can be deployed to be executed on one computer or on multiple computers at one or more sites.
  • two or more modules 108 , 110 , 112 can be implemented on the same integrated circuit, such as ASIC, PLA, DSP, or FPGA, thereby forming a system on chip.
  • Subroutines can refer to portions of the computer program and/or the processor/special circuitry that implement one or more functions.
  • the computing device 102 can be operatively coupled to an image capture device 118 .
  • the image capture device 118 can provide, via the interface 116 , one or more images to be rectified.
  • the image capture device 118 can include a charge-coupled device (CCD) image sensor or a complementary metal-oxide-semiconductor (CMOS) image sensor, line-scan sensors, flying spot scanners, electron microscopes, X-ray devices including computed tomography (CT) scanners, magnetic resonance imagers, and/or other devices known to those skilled in the art.
  • CCD charge-coupled device
  • CMOS complementary metal-oxide-semiconductor
  • the computing device 102 can be operatively coupled to external equipment, for example factory automation or logistics equipment, or to a communications network, for example a factory automation or logistics network, in order to receive instructions and/or data from the equipment or network and/or to transfer instructions and/or data to the equipment or network.
  • Computer-readable storage devices suitable for embodying computer program instructions and data include all forms of volatile and non-volatile memory, including by way of example semiconductor memory devices, e.g., DRAM, SRAM, EPROM, EEPROM, and flash memory devices; magnetic disks, e.g., internal hard disks or removable disks; magneto-optical disks; and optical disks, e.g., CD, DVD, HD-DVD, and Blu-ray disks.
  • the computing device 102 can include user equipment.
  • the user equipment can communicate with one or more radio access networks and with wired communication networks.
  • the user equipment can be a cellular phone.
  • the user equipment can also be a smart phone providing services such as word processing, web browsing, gaming, e-book capabilities, an operating system, and a full keyboard.
  • the user equipment can also be a tablet computer providing network access and most of the services provided by a smart phone.
  • the user equipment operates using an operating system such as Symbian OS, iPhone OS, RIM's Blackberry, Windows Mobile, Linux, HP WebOS, and Android.
  • the screen might be a touch screen that is used to input data to the mobile device, in which case the screen can be used instead of the full keyboard.
  • the user equipment can also keep global positioning coordinates, profile information, or other location information.
  • the computing device 102 can include a server.
  • the server can operate using operating system (OS) software.
  • OS operating system
  • the OS software is based on a Linux software kernel and runs specific applications in the server such as monitoring tasks and providing protocol stacks.
  • the OS software allows server resources to be allocated separately for control and data paths. For example, certain packet accelerator cards and packet services cards are dedicated to performing routing or security control functions, while other packet accelerator cards/packet services cards are dedicated to processing user session traffic. As network requirements change, hardware resources can be dynamically deployed to meet the requirements in some embodiments.
  • the homography removal module 112 can be configured to rectify a distorted image to recover a rectified image using a two-stage image mapping process.
  • the homography removal module 112 can remove a first transformation A from the distorted image to generate an intermediate image by processing pixels along the x-axis of the distorted image, and in the second stage of the two-stage image mapping process, the homography removal module 112 can remove a second transform B by processing pixels along the y-axis of the intermediate image.
  • one or more decomposed transformations of homography can be designed so that the homography removal module 112 accesses pixels preferentially along an x-axis of a source image and along an x-axis of a destination image.
  • memory is generally aligned with the x-axis of an image.
  • memory generally stores an image by (1) storing all pixels having a first y-coordinate while preserving the pixel order along the x-axis, and then (2) storing all pixels having a second y-coordinate (oftentimes adjacent to the first y-coordinate) while preserving the pixel order along the x-axis, until all pixels in the image are stored.
  • the homography removal module 112 can use direct memory access, e.g., using the direct memory access controller, to reduce cache misses and memory access time.
  • the homography removal module 112 can be configured to transpose an intermediate image generated by the first stage of the two-stage image mapping process, prior to initiating the second stage of the two-stage image mapping process.
  • the homography removal module 112 can still access pixels preferentially along an x-axis of the transposed intermediate image (which is in effect the y-axis of the transposed intermediate image) to align the memory access to the memory structure during the second stage of the two-stage image mapping process.
  • the homography decomposition module 110 is configured to decompose the homography into a first transform and a second transform that collectively represent the homography.
  • the homography decomposition module 110 can use a predetermined relationship between the homography, the first transform, and the second transform to determine the decomposition. The following describes this predetermined relationship between the homography, the first transform, and the second transform.
  • Homography is typically represented using a 3 ⁇ 3 homography matrix H in a homogenous coordinate (x h , y h , z h ), where the subscript h indicates that the variable is associated with a homogeneous coordinate.
  • (x, y) is a coordinate system of the object plane (e.g., a plane associated with a rectified image) and (x′, y′) is a coordinate system of an image plane (e.g., a plane associated with a distorted image).
  • the homography decomposition module 110 can choose an origin point on the image plane, and define that location to be the origin of the object plane as well. This way, the homography matrix H can be simplified to remove translation:
  • the homography matrix H that defines the homography between an object plane and an image plane has six parameters and can be written as:
  • This representation of the homography matrix H allows for rectifying arbitrary amount of rotations.
  • the homography matrix H can be further constrained to improve the quality of the rectified image.
  • the homography decomposition module 112 can constrain the rotation element of the homography matrix H.
  • the constraint on the rotation element of the homography matrix H determines (e.g., fixes) the amount of rotation associated with the homography matrix H.
  • the homography matrix H that defines the homography between an object plane and an image plane has five parameters and can be written as:
  • the actual rotation associated with the deformed image can deviate substantially from the amount of rotation associated with the homography matrix H. This may result in undesirable performance degradation.
  • the homography decomposition module 112 can instruct the homography removal module 110 to locally down-sample during stage 1 of the multi-stage rectification process and then locally up-sample during stage 2 of the multi-stage rectification process to reduce performance degradation.
  • the homography matrix H can be further constrained to include only two degrees of freedom with regards to the tilt.
  • the two degree of freedom with regards to the tilt can be parameterized using: (1) a tilt direction ⁇ (also known as azimuth) and (2) a tilt amount ⁇ (also known as declination or just tilt).
  • a tilt direction ⁇ also known as azimuth
  • a tilt amount ⁇ also known as declination or just tilt
  • [ x h ′ y h ′ z h ′ ] [ ⁇ ( cos ⁇ ⁇ ⁇ ) 2 + ( sin ⁇ ⁇ ⁇ ) 2 ⁇ cos ⁇ ⁇ ⁇ sin ⁇ ⁇ ⁇ ⁇ ⁇ cos ⁇ ⁇ ⁇ ⁇ ( 1 - cos ⁇ ⁇ ⁇ ) 0 sin ⁇ ⁇ ⁇ ⁇ ⁇ cos ⁇ ⁇ ⁇ ⁇ ( 1 - cos ⁇ ⁇ ⁇ ) ( sin ⁇ ⁇ ⁇ ) 2 + ( cos ⁇ ⁇ ⁇ ) 2 ⁇ cos ⁇ ⁇ ⁇ 0 - sin ⁇ ⁇ ⁇ sin ⁇ f cos ⁇ ⁇ ⁇ ⁇ sin ⁇ ⁇ ⁇ f 1 ] ⁇ [ x y 1 ]
  • the homography decomposition module 112 can constrain the rotation component of the homography matrix H through another set of parameterization.
  • This set of parameterization includes three angle parameters that specify (sequentially): (1) a rotation around the z axis, (2) a rotation around the y axis and (3) a rotation around the x axis. This is often called a “roll, pitch, and yaw” representation. There are at least two different ways to parameterize the “roll, pitch, and yaw” representation.
  • the pitch and yaw represent rotations around the “new” axis after the previous rotations; and in a second example, each of the roll, pitch, and yaw represents rotation around a non-moving “world” axis.
  • the proceeding disclosure uses the second example of the “roll, pitch, and yaw” representation, in which the rotation around the world z axis is indicated as ⁇ ; the rotation around the world y axis is indicated as ⁇ , and the rotation around the x axis is indicated as ⁇ .
  • the homography matrix H can take the following form:
  • the homography decomposition module 112 can constrain the rotation around the z-axis ( ⁇ ) in the homography matrix H.
  • the homography matrix H
  • the homography matrix H can be simplified to the following form:
  • This parameterization has a favorable characteristic that both h 01 and h 10 have a small magnitude.
  • the goal of homography rectification is to remove this homography H from the distorted image on the image plane (x′ h , y′ h , z′ h ).
  • the homography decomposition module 112 is configured to decompose the homography matrix H into a first transform A and a second transform B such that
  • the first transform A relates the distorted image, on an image plane represented by the coordinate system (x′, y′), to an intermediate image, on an intermediate image plane represented by the coordinate system (x I , y I ), and the second transform B relates the intermediate image, on the intermediate image plane represented by the coordinate system (x I , y I ), to a rectified image, on an object plane represented by the coordinate system (x, y).
  • FIG. 2 illustrates an application of the second transform B to distort a rectified image (e.g., an orthographic view of the object plane, which is the desired outcome of the two-stage rectification process) into an intermediate image, and an application of the first transform A to distort the intermediate image into the distorted image (e.g., input image to the two-stage rectification process).
  • a rectified image e.g., an orthographic view of the object plane, which is the desired outcome of the two-stage rectification process
  • the first transform A to distort the intermediate image into the distorted image
  • the homography H can be removed from the distorted image to recover the rectified image by (1) removing the first transform A from the distorted image to generate an intermediate image and (2) removing the second transform B from the intermediate image to generate the rectified image.
  • the second transform B can be represented as a concatenation of three operations: a transpose operation T that flips the x and y coordinates while leaving the z coordinate intact, the transposed second transform B yx , and another transpose operation T:
  • transpose operation T can be represented as follows:
  • the homography matrix H can be represented as follows:
  • B yx [ b 11 b 10 0 b 01 b 00 0 b 21 b 20 1 ] .
  • the subscript “yx” is used to indicate that the transposed second transform B yx operates on a flipped x-y coordinates. Note that the transposed second transform B yx is not merely a transpose of the second transform B
  • the transposed second transform B yx can be derived from the second transform B by rearranging the terms in B based on the fact that the x and y coordinates are swapped with the z coordinate remaining the same.
  • the first transform A and the second transform B can be constrained to have properties discussed above: that (1) pixels in a particular row of a source image are mapped to equi-distance pixels (e.g., each pair of adjacent pixels are separated by an equal distance) in a fixed row of a destination image, hereinafter referred to as a first constraint, and that (2) a row of a source image is mapped to a row of a destination image, hereinafter referred to as a second constraint.
  • the first constraint can be introduced into the first transform A by constraining
  • the x′ can be represented directly using the parameters of the first transform A:
  • the multiplier a 20 can be set to zero.
  • the second constraint can be introduced into the first transform A by introducing a constraint that the y-coordinate of the distorted image must depend only on the y-coordinate of the intermediate image.
  • the denominator z h ′ is already known to depend only on y I . Therefore, if the numerator y h ′ depends only on y I , then the y′ (e.g., the y-coordinate of the distorted image) would depend only on the y-coordinate of the intermediate image, thereby mapping the row of the intermediate image to a row of a distorted image.
  • the y-coordinate of the distorted image can be expressed as follows:
  • the first constraint and the second constraint require that the first transform A takes the following form:
  • A [ a 00 a 01 0 0 a 11 0 0 a 21 1 ]
  • the first constraint and the second constraint should be imposed on the second transform B as well. Since the transposed second transform
  • the transpose second transform B yx can be related back to the second transform B as follows:
  • the homography matrix H can be represented as follows:
  • homography matrix H can also be represented as follows:
  • the decomposition of the homography matrix H can be used to analytically solve for a relationship between the coefficients of H, the coefficients of A, and the coefficients of B. Based on this analytic calculation, it can be shown that the first transform A and the transposed second transform B yx are related to the coefficients of H as follows:
  • the homography decomposition module 110 can generate the first transform A and the transposed second transform B yx using the above relationship.
  • the first transform A and/or the second transform B can include a scaling factor that effectively down-samples the destination image when recovering the source image.
  • the following decomposition adheres to the first constraint and the second constraint, as described above, while allowing for down-sampling by a down-sampling rate in the first transform A:
  • homography matrix H can be represented as follows:
  • the sampling factors s Ax and/or s Ay may be less than 1 to effectively oversample the distorted image in the first stage. This way, the information in the distorted image can be preserved for processing in the second stage.
  • the homography removal module 112 can efficiently remove the first transform A and the transposed second transform B yx from the distorted image.
  • the job of the homography removal module 112 in each stage of the two-stage rectification process is to remove a limited homography (e.g., a first transform A or a transposed second transform B yx ) with only four non-trivial coefficients:
  • A [ s Ax s Ay ⁇ h 01 0 0 s Ay ⁇ h 11 0 0 s Ay ⁇ h 21 1 ]
  • B xy [ 1 s Ay 1 s Ay ⁇ ( h 01 h 11 ) 0 0 1 s Ax ⁇ ( h 00 - h 01 2 h 11 ) 0 0 h 20 - h 21 ⁇ h 01 h 11 0 ]
  • the homography removal module 112 can, in effect, perform the same procedure twice, once for the first transform A and once for the transposed second transform B yx . Therefore, the first stage and the second stage of the two-stage rectification process can comprise substantially identical operations.
  • FIG. 3 illustrates a two-stage image mapping process in accordance with some embodiments.
  • the two-stage image mapping process uses the decomposition of the homography, as determined by the homography decomposition module 110 , to perform rectification in two stages.
  • the homography removal module 112 can process pixels of the distorted image along the x-axis
  • the homography removal module 112 can process pixels of the intermediate image along the y-axis. It is worth noting that, thanks to the transpose operation that occurs between the first stage and the second stage, the first stage of the multi-stage rectification process is substantially similar to the second stage of the multi-stage rectification process.
  • the homography removal module 112 can be configured to receive the distorted image, and the homography represented as a concatenation of a first transform A and a second transform B (or a transposed second transform).
  • the homography removal module 112 can receive the homography representation from the homography decomposition module 110 .
  • FIG. 4 illustrates an example of a distorted image that has been subject to perspective distortion. The perspective distortion manifests itself by transforming two vertical, parallel lines 402 , 404 into two non-parallel lines and two horizontal, parallel lines 406 , 408 into two non-parallel lines.
  • the homography removal module 112 is configured to remove the first transformation from the distorted image by processing distorted image along the x-axis, thereby generating the intermediate image.
  • the homography removal module 112 can optionally down-sample the intermediate image.
  • the step 302 and the step 302 a can be performed simultaneously.
  • the first transform A can incorporate the down-sampling factor, as discussed above, so that by removing the first transform A from the distorted image, the homography removal module 112 can in effect also perform the down-sampling operation.
  • the homography removal module 112 can be configured to determine, for each pixel of an intermediate image, a corresponding location in the distorted image. For example, the homography removal module 112 can access one of the pixels in a row of an intermediate image, and determine which location within the distorted image corresponds to the one of the pixels in the intermediate image using the first transform A. In some cases, this location within the distorted image can have a sub-pixel coordinate value. For example, the location within the distorted image can have a coordinate of (2.24, 3.1).
  • the homography removal module 112 knows in advance that a pixel in a row of the intermediate image always corresponds to a location in a predetermined y-coordinate of the distorted image. Therefore, the pattern in which the homography removal module 112 should access the distorted image in memory 106 is predictable and deterministic.
  • the homography removal module 112 can use a standard ping-pong two-buffer system and the DMA controller 114 to read and store a row or groups of rows of an image in parallel.
  • the homography removal module 112 determines the correspondence between a pixel in an intermediate image and a location in a distorted image.
  • the homography removal module 112 can determine the image value for that pixel in the intermediate image.
  • the homography removal module 112 can use the image value at the corresponding location in the distorted image.
  • the homography removal module 112 can filter the image values surrounding the corresponding location in the distorted image. In some cases, the homography removal module 112 may simply copy the image value of the corresponding location in the distorted image. When the corresponding location does not fall on the grid of the distorted image, the homography removal module 112 can determine the image value of the pixel in the intermediate image by interpolating image values of pixels surrounding the corresponding location in the distorted image.
  • the homography removal module 112 can convolve pixels surrounding the corresponding location in the distorted image with a kernel to interpolate image values of pixels surrounding the corresponding location in the distorted image.
  • the kernel can also be designed to perform appropriate smoothing (e.g., to satisfy the Nyquist sampling constraint) along a row of the distorted image.
  • the kernel can be a truncated sync function; in other embodiments, the kernel can be a truncated Gaussian function.
  • the homography removal module 112 can predetermine a set of kernels to be used for pre-selected sub-pixel variations from the grid locations. When the corresponding location in the distorted image is off from the grid in the x-axis by 0.1 pixel, then the homography removal module 112 can use a first one of the predetermined set of kernels; when the corresponding location in the distorted image is off from the grid in the x-axis by 0.3 pixel, then the homography removal module 112 can use a second one of the predetermined set of kernels, etc.
  • each kernel in the set of kernels has the same functional form.
  • the set of kernels can include the following:
  • K1 36, 56, 36, 0 K2: 30, 56, 38, 4 K3: 25, 54, 41, 8 K4: 21, 50, 45, 12 K5: 16, 48, 48, 16 K6: 12, 45, 50, 21 K7: 8, 41, 54, 25 K8: 4, 38, 56, 30
  • K1 36, 56, 36, 0 K2: 30, 56, 38, 4 K3: 25, 54, 41, 8 K4: 21, 50, 45, 12 K5: 16, 48, 48, 16 K6: 12, 45, 50, 21 K7: 8, 41, 54, 25 K8: 4, 38, 56, 30
  • These kernels K 1 -K 8 all have the same functional form, but each kernel is associated with a different sub-pixel coordinate (also referred to as a sub-pixel phase.)
  • the homography removal module 112 can use K 1 to filter pixels around the location. Note that K 1 is symmetric around the center.
  • the homography removal module 112 can use K 2 to filter pixels around the location. For example, the homography removal module 112 can directly multiply K 2 to a sequence of four image pixels in order to simultaneously perform sub-pixel interpolation and filtering.
  • the homography removal module 112 can use K 5 to filter pixels around the location. Because an offset of a 1 ⁇ 2 sub-pixel phase refers to identifying an image value at the exact boundary of two adjacent pixels, K 5 is symmetric. Note that K 2 and K 8 , which correspond to a sub-pixel phase of 1 ⁇ 8 and 7 ⁇ 8, respectively, have the same values but are mirrors of each other. This makes sense because 1 ⁇ 8 is the same as ⁇ 7 ⁇ 8 and ⁇ 1 ⁇ 8 is the same as 7 ⁇ 8. The same is true for the pairs “1 ⁇ 4, 3 ⁇ 4”, and “3 ⁇ 8, 5 ⁇ 8”.
  • the homography removal module 112 can use a kernel associated with a sub-pixel phase that is closest to that of the location. For example, when the sub-pixel phase of the location is 1/9, the homography removal module 112 can use K 1 to filter that location since 1/9 is closer to 1 ⁇ 8 than any other sub-pixel phases associated with the kernels.
  • the homography removal module 112 can perform the convolution operation using a single-instruction-multiple-data (SIMD) processing unit.
  • SIMD single-instruction-multiple-data
  • the convolution operation can be easily performed using a SIMD multiply and/or a SIMD dot-product instruction operable by a SIMD processing unit.
  • the convolution operation can involve a two-dimensional kernel.
  • the homography removal module 112 can perform the convolution in two separate steps: (1) convolve along the x-axis (e.g., a row of an image), and (2) convolve along the y-axis (e.g., a column of an image).
  • the homography removal module 112 can apply a single kernel function to all corresponding locations within a fixed y-coordinate of the distorted image. Because the sub-pixel phase of locations within the distorted image may differ from one another, the homography removal module 112 can select, based on the sub-pixel phase of each location, one of the kernels in a set corresponding to that single kernel function. This can avoid the homography removal module 112 needing to generate the kernel during convolution along the fixed row of the distorted image.
  • the homography removal module 112 can use the same kernel function for different rows; in other embodiments, the homography removal module 112 can use different kernel functions for different rows.
  • the result of the convolution operation along the x-axis (also referred to as an x-convolution result) can be stored in a temporary buffer.
  • the homography removal module 112 can perform the convolution along the y-axis by convolving a y-directional kernel on the x-convolution result stored in the temporal buffer.
  • the temporal buffer can maintain a predetermined number of rows associated with the x-convolution result. This predetermined number of rows can be determined based on a size of the y-directional kernel. For example, when a y-directional kernel has a length of 5 pixels, the temporal buffer can maintain 5 rows associated with the x-convolution result.
  • the temporal buffer can be a rolling buffer (e.g., a circular buffer) that stores a row of the x-convolution result until it is no longer needed to compute the y-convolution.
  • a rolling buffer e.g., a circular buffer
  • the homography removal module 112 can store, in the temporal buffer, five rows of the x-convolution result, identified by y-coordinates: j+2, j+1, j, j ⁇ 1, j ⁇ 2.
  • the homography removal module 112 can increase the target y-coordinate by one pixel and computes the y-convolution of all pixels having the y-coordinate of j+1.
  • the homography removal module 112 can use five rows of the x-convolution result, identified by y-coordinates: j+3, j+2, j+1, j, j ⁇ 1. Therefore, the temporal buffer no longer needs the row of the x-convolution result having the y-coordinate of j ⁇ 2. In this case, the temporal buffer removes the row of the x-convolution result having the y-coordinate of j ⁇ 2 and replaces it with the row of the x-convolution result having the y-coordinate of j+3.
  • the y-directional kernel can be identical for each pixel along the row of the output image. Therefore, this y-directional convolution can be implemented by multiplying every pixel of the x-convolved row by the kernel coefficient corresponding to that row, and then summing along columns. This operation can be efficiently implemented using a SIMD processing unit, for example, a SIMD multiply and/or SIMD add instructions.
  • each row has its own kernel function that can be computed, for example, by (1) mapping a symmetric kernel function in the object plane to a corresponding not-necessarily-symmetric kernel function in the image plane, and (2) integrating it into pixel bins.
  • kernel function inherently allows correct sampling at non-integer positions in the y-axis, and also allows the sampling distance between any two rows to be different without loss of performance.
  • all the kernel functions used for convolution along the y-axis can be precomputed during a training phase.
  • FIG. 5 illustrates an example of an intermediate image generated by removing the first transform A from the distorted image of FIG. 4 in accordance with some embodiments. Note that the two vertical, parallel lines 402 , 404 are now parallel in the intermediate image. However, the first stage has not removed the distortion of the two horizontal parallel lines 406 and 408 .
  • the intermediate image can have a skewed trapezoid shape (e.g., two sides are parallel; the other two sides may not be).
  • the relevant start points and run lengths in each row in the distorted image, x-convolved row, and each row in the intermediate image can optionally be precomputed in a training step.
  • the homography removal module 112 can move onto the second stage of the two-stage image mapping process. However, as discussed above, prior to initiating the second stage, in step 304 , the homography removal module 112 is configured to transpose the intermediate image.
  • the homography removal module 112 can perform the transpose operation using a “vertical” buffer.
  • output values of the y-directional convolution e.g., the intermediate image
  • near-to-chip memory e.g. user-controlled Level-2 cache.
  • These output values are written to the buffer in a column order, instead of a row order, and hence the name vertical buffer.
  • the homography removal module 112 need not store or access any external memory devices, which improves the memory access time. In effect, this embodiment performs the transpose operation in a local manner.
  • the homography removal module 112 can alternate writing to one of two vertical buffers in a “ping-ping” fashion so that the previous column, written to a first vertical buffer, can be written out to external memory while the current column is being written to a second vertical buffer.
  • the vertical buffer data can be transferred to a region in external memory using a DMA controller 114 .
  • the region in the external memory can be large enough to accommodate the full rectified image.
  • the DMA controller 114 can transfer the vertical buffer data to external memory using two-dimensional DMA.
  • the two-dimensional DMA is able to automatically skip over bytes in external memory that do not correspond to a vertical section of the image, which conceptually matches the data in the vertical buffer.
  • the external memory does not store adjacent sections (e.g., tiles) of the result image in consecutive memory locations, which may deteriorate the read and/or write speed.
  • the two-dimensional DMA can handle the read/write operation efficiently at a minimal additional cost (as long as the width of the local vertical buffer is sufficiently large).
  • the homography removal module 112 is configured to remove the second transformation from the intermediate image by processing pixels in the x-axis of the transposed intermediate image (e.g., the y-axis of the intermediate image). Subsequently, in step 306 a , the homography removal module 112 is optionally configured to down-sample the result of step 306 , and in step 308 , the homography removal module 112 is configured to transpose the result of step 306 (or the result of step 306 a if that is performed).
  • the homography removal module 112 in steps 306 / 306 a , can use similar operations as steps 302 / 302 a , and in step 308 , the homography removal module 112 can use a similar process as step 304 .
  • FIG. 6 illustrates an example of a rectified image generated by removing the second transform B from the intermediate image of FIG. 5 in accordance with some embodiments.
  • the two horizontal, parallel lines 406 , 408 , as well as the two vertical, parallel lines 402 , 404 are parallel in the rectified image. Therefore, the resulting rectified image has all perspective distortion rectified, and is down-sampled by the target down-sampling rate.
  • the homography removal module 112 can down-sample the distorted image during homography rectification, and the homography removal module 112 can perform the down sampling in one or more of the two-stages of the multi-stage rectification process.
  • it can be advantageous to down-sample as much as possible in step 302 a (e.g., in the first stage), as this can accelerate the first stage and further accelerate the second stage of the multi-stage rectification process.
  • the image In each stage of the multi-stage rectification process, the image can be down-sampled independently in x or y direction.
  • the rectified image can be provided to a pattern detection system so that the pattern detection system can determine whether the rectified image includes a particular pattern. Because pattern detection systems generally operate more efficiently when the source image does not include non-linear distortion, the disclosed homography rectification process can improve the efficiency of pattern matching. In some cases, the pattern detection system can use a simple correlation-based pattern matching technique to determine whether the rectified image includes a particular pattern. In other cases, the pattern detection system can use embodiments of pattern detection techniques disclosed in U.S. Pat. No. 7,016,539, entitled “Method for fast, robust, multi-dimensional pattern recognition,” issued on Mar. 21, 2006, which is herein incorporated by reference in its entirety.
  • the distorted image when the homography is not known in advance, can be rectified using one of a plurality of hypothesized homography transforms, and the rectified image can be provided to a pattern detection system to determine if a particular pattern exists under the one of a plurality of hypothesized homography transforms. This process can be iterated for other hypothesized homography transforms until the pattern detection system detects the particular pattern from the rectified image.
  • the disclosed multi-stage rectification process can be used to merely perform affine image sampling, without actually removing a general homographic perspective distortions.
  • the parameters h 20 and h 21 can be set to zero.
  • the disclosed multi-stage rectification process can be used to merely remove image scaling.
  • the disclosed multi-stage rectification process can be configured to remove image scaling operations by setting the parameters h 10 and h 01 of the homography matrix H to zero, in addition to h 20 and h 21 .
  • the disclosed multi-stage rectification process can be computationally faster than the prior known methods for removing affine transformation or image scaling operation.
  • the computational benefit is derived from the way a kernel is applied to a row of an image.
  • the homography removal module 112 is configured to apply a single kernel function to a row of an image.
  • the homography removal module 112 can maintain a set of kernels that corresponds to this kernel function, where each kernel in the set corresponds to a particular sub-pixel phase.
  • the homography removal module 112 can select one of these kernels based on a sub-pixel phase of the location. Because the set of kernels corresponding to sub-pixel phases are already pre-determined, the homography removal module 112 can remove affine transformation or image scaling by applying a selected one of these kernels. When the sub-pixel phase of a location does not exactly correspond to one of the kernels in the set, the homography removal module 112 can select a kernel that is associated with a sub-pixel phase closest to that of the location. This removes nearly all interpolation operations, which tend to be computationally expensive.
  • the homography removal module 112 can further improve the accuracy by maintaining a large number of kernels, each kernel adapted to a finer sub-pixel variation (e.g., sub-pixel phases) from a pixel grid.
  • the homography removal module 112 can determine, for each pixel of a rectified image, the sub-pixel phase of the corresponding location in the distorted image, and use the sub-pixel variation to select one of the predetermined set of kernels. Subsequently, the homography removal module 112 can apply the selected kernel for the corresponding pixel in the distorted mage to remove the image scaling.
  • the homography removal module 112 when the homography removal module 112 is configured to remove image scaling only, the homography removal module 112 can perform the operation by only using the first stage of the multi-stage rectification process.
  • the homography removal module 112 may use only one of the stages in the multi-stage rectification process to remove the perspective distortion from the distorted image.
  • a distorted image may be subject to a perspective distortion that results from a rotation of an image sensor around an x-axis of the image sensor (e.g., a row of image sensor bit-cells).
  • an image sensor may be positioned such that the top of the object plane is tilted away from the image sensor as if the image sensor is rotated around an axis that is parallel to one of image sensor axes.
  • This relationship between the image sensor plane and the object plane may be represented by a homography transform H that actually satisfies the first and second constraints: that (1) pixels in a particular row of a rectified image are mapped to equi-distance pixels in a fixed row of a distorted image and that (2) a row of the rectified image is mapped to a row of the distorted image. Therefore, in this case, the homography removal module 112 can be configured to perform the rectification operation by only using one of the stages of the multi-stage rectification process.
  • the multi-stage rectification process can be used to align images captured by multiple cameras.
  • a machine vision system can include a plurality of cameras viewing different parts of the scene, but are roughly aligned.
  • each of the plurality of cameras is designed to have an identical in-plane rotation and distance from plane.
  • the pixel grid from each camera may be displaced from one another by a sub-pixel value because, due to physical limitations, it is difficult to align the pixels from each camera exactly.
  • the differences in the images captured by the multiple cameras can be calibrated offline. Therefore, the relationship between images captured by the multiple cameras can be determined offline. This relationship can be represented as a homography matrix. Therefore, the disclosed multi-stage rectification process can be used to align images captured by multiple cameras. In some cases, the disclosed multi-stage rectification process can also be used to down-sample the captured images so that, if one or more cameras have different number of pixels or different pixel sizes, the disclosed multi-stage rectification process can down-sample the images to account for such differences.

Abstract

The present disclosure provides a multi-stage image mapping mechanism for mapping a distorted image to a rectified image. For example, the multi-stage image mapping mechanism can remove homography from a distorted image to reconstruct a rectified image in two-stages: (1) a first stage in which distortion is partially removed from a distorted image to generate an intermediate image, and (2) a second stage in which residual distortion is removed from the intermediate image to recover the rectified image.

Description

    CROSS REFERENCE TO RELATED APPLICATIONS
  • This application is related to U.S. patent application Ser. No. ______, entitled “EFFICIENT IMAGE TRANSFORMATION,” identified by the Attorney Docket Number (C15-006B) 0103819-00137US2, filed on the even-date herewith, the entire content of which is herein incorporated by reference.
  • TECHNICAL FIELD
  • Disclosed herein are apparatus, systems, and methods that relate to rectifying a homographic projection of an image for image processing applications.
  • BACKGROUND
  • Digital images can be formed by many devices and used for many practical purposes. Devices include cameras with image sensors operating on visible or infrared light, such as a charge-coupled device (CCD) image sensor or a complementary metal-oxide-semiconductor (CMOS) image sensor, line-scan sensors, flying spot scanners, electron microscopes, X-ray devices including computed tomography (CT) scanners, magnetic resonance imagers, and other devices known to those skilled in the art. Practical applications are found in industrial automation, medical diagnosis, satellite imaging for a variety of military, civilian, and scientific purposes, photographic processing, surveillance and traffic monitoring, document processing, and many other applications.
  • To serve these applications, the images formed by the various devices can be analyzed by machine vision systems to extract appropriate information. One form of analysis that is of considerable practical importance is determining the position, orientation, and size of patterns in an image that correspond to objects in the field of view of the imaging device. Pattern detection methods are of particular importance in industrial automation, where they are used to guide robots and other automation equipment in semiconductor manufacturing, electronics assembly, pharmaceuticals, food processing, consumer goods manufacturing, and many others.
  • Pattern detection methods can operate on images that have been subject to perspective distortions. However, the perspective distortions tend to increase the computational complexity and the computing time of pattern detection methods. Therefore, as a preprocessing step, images can be rectified to remove perspective distortions prior to performing the pattern detection methods.
  • SUMMARY
  • In accordance with the disclosed subject matter, apparatus, systems, and methods are provided for rectifying a homographic distortion of an image for image processing applications.
  • Some embodiments include a machine vision system. The system can include a processor configured to run a computer program stored in memory. The computer program can be configured to cause the processor to receive a distorted image and a homography matrix associated with the distorted image, determine, using a homography decomposition module, a first transform and a second transform that collectively represent the homography matrix, wherein the first transform relates the distorted image to an intermediate image, and the second transform relates the intermediate image to a rectified image, wherein the first transform is constrained to map pixels along a first dimension of the intermediate image to locations along a first dimension of the distorted image, and wherein the locations along the first dimension of the distorted image are separated by a same distance from adjacent locations along the first dimension of the distorted image, generate, using a homography removal module, the intermediate image by removing the first transform from the distorted image, and generate, using the homography removal module, the rectified image by removing the second transform from the intermediate image.
  • In some embodiments, the second transform is constrained to map pixels along a first dimension of the rectified image to locations along the first dimension of the intermediate image, and wherein the locations along the first dimension of the intermediate image are separated by a same distance from adjacent locations along the first dimension of the intermediate image.
  • In some embodiments, the processor is further configured to transpose the intermediate image using the homography removal module prior to removing the second transform from the intermediate image.
  • In some embodiments, the processor is further configured, using the homography removal module, to cause a direct memory access controller to transpose the intermediate image prior to removing the second transform from the intermediate image.
  • In some embodiments the direct memory access controller is configured to transpose the intermediate image using a two-dimensional direct memory access.
  • In some embodiments, the processor is further configured, using the homography removal module, to remove the first transform from the distorted image by determining, for a first pixel of the intermediate image, a corresponding first location in the distorted image, and convolving pixels surrounding the corresponding first location in the distorted image to determine an image value for the first pixel.
  • In some embodiments, the processor is further configured, using the homography removal module, to determine the corresponding first location by operating the first transform on a coordinate of the first pixel.
  • In some embodiments, the processor is configured, using the homography removal module, to determine, for a second pixel of the intermediate image having an identical coordinate as the first pixel in a second dimension, a corresponding second location in the distorted image by adding a constant to a coordinate of the corresponding first location in the first dimension.
  • In some embodiments, the processor is configured, using the homography removal module, to provide the rectified image to a pattern detection system so that the pattern detection system can use the rectified image to determine whether the rectified image includes a particular pattern.
  • In some embodiments, the processor is configured to receive the homography matrix from the pattern detection system, wherein the homography matrix is determined by the pattern detection system based on a hypothesis of homography associated with the distorted image.
  • In some embodiments, the processor is configured, using the homography removal module, to select one of a set of kernels based on a sub-pixel coordinate of one of the locations along the first dimension of the distorted image, and apply the selected kernel at the one of the locations.
  • Some embodiments include a method for operating the disclosed system or the disclosed non-transitory computer readable medium. The method includes receiving, at a homography decomposition module, a homography matrix associated with a distorted image, determining, using the homography decomposition module, a first transform and a second transform that collectively represent the homography matrix, wherein the first transform relates the distorted image to an intermediate image, and the second transform relates the intermediate image to a rectified image, wherein the first transform is constrained to map pixels along a first dimension of the intermediate image to locations along a first dimension of the distorted image, and wherein the locations along the first dimension of the distorted image are separated by a same distance from adjacent locations along the first dimension of the distorted image, generating, using a homography removal module coupled to the homography decomposition module, the intermediate image by removing the first transform from the distorted image, and generating, using the homography removal module, the rectified image by removing the second transform from the intermediate image.
  • In some embodiments, the method includes transposing, using the homography removal module, the intermediate image prior to removing the second transform from the intermediate image.
  • In some embodiments, the method includes causing, using the homography removal module, a direct memory access controller to transpose the intermediate image prior to removing the second transform from the intermediate image.
  • In some embodiments, the method includes determining, for a first pixel of the intermediate image, a corresponding first location in the distorted image, and convolving pixels surrounding the corresponding location in the distorted image to determine an image value for the first pixel.
  • In some embodiments, the method includes determining, for a second pixel of the intermediate image having an identical coordinate as the first pixel in a second dimension, a corresponding second location in the distorted image by adding a constant to a coordinate of the corresponding first location in the first dimension.
  • In some embodiments, the method includes providing the rectified image to a pattern detection system so that the pattern detection system can use the rectified image to determine whether the rectified image includes a particular pattern.
  • In some embodiments, the method includes selecting one of a set of kernels based on a sub-pixel coordinate of one of the locations along the first dimension of the distorted image, and applying the selected kernel at the one of the locations.
  • Some embodiments include a non-transitory computer readable medium having executable instructions associated with a homography decomposition module and a homography removal module. The modules are operable to cause a machine vision system to receive a homography matrix associated with a distorted image, determine a first transform and a second transform that collectively represent the homography matrix, wherein the first transform relates the distorted image to an intermediate image, and the second transform relates the intermediate image to a rectified image, wherein the first transform is constrained to map pixels along a first dimension of the intermediate image to locations along a first dimension of the distorted image, and wherein the locations along the first dimension of the distorted image are separated by a same distance from adjacent locations along the first dimension of the distorted image, generate the intermediate image by removing the first transform from the distorted image, and generate the rectified image by removing the second transform from the intermediate image.
  • The disclosed apparatus, systems, and methods described herein can improve the efficiency of rectifying homographic distortions in a distorted image by mapping the distorted image to a rectified image using a multi-stage rectification process. Each stage of the rectification process is designed to reduce the computation and to reduce the amount of time needed to access images from memory.
  • There has thus been outlined, rather broadly, the features of the disclosed subject matter in order that the detailed description thereof that follows may be better understood, and in order that the present contribution to the art may be better appreciated. There are, of course, additional features of the disclosed subject matter that will be described hereinafter and which will form the subject matter of the claims appended hereto. It is to be understood that the phraseology and terminology employed herein are for the purpose of description and should not be regarded as limiting.
  • BRIEF DESCRIPTION OF THE DRAWINGS
  • Various objects, features, and advantages of the disclosed subject matter can be more fully appreciated with reference to the following detailed description of the disclosed subject matter when considered in connection with the following drawings, in which like reference numerals identify like elements.
  • FIG. 1 shows a machine vision system for rectifying a homography of a distorted image in accordance with some embodiments.
  • FIG. 2 illustrates an application of a second transform B to distort a rectified image into an intermediate image, and an application of a first transform A to distort the intermediate image into the distorted image, in accordance with some embodiments.
  • FIG. 3 illustrates a two-stage image mapping process in accordance with some embodiments.
  • FIG. 4 illustrates an example of a distorted image that has been subject to perspective distortion in accordance with some embodiments.
  • FIG. 5 illustrates an example of an intermediate image generated by removing the first transform A from the distorted image of FIG. 4 in accordance with some embodiments.
  • FIG. 6 illustrates an example of a rectified image generated by removing the second transform B from the intermediate image of FIG. 5 in accordance with some embodiments.
  • DETAILED DESCRIPTION
  • In the following description, numerous specific details are set forth regarding the systems and methods of the disclosed subject matter and the environment in which such systems and methods may operate, etc., in order to provide a thorough understanding of the disclosed subject matter. It will be apparent to one skilled in the art, however, that the disclosed subject matter may be practiced without such specific details, and that certain features, which are well known in the art, are not described in detail in order to avoid complication of the disclosed subject matter. In addition, it will be understood that the examples provided below are exemplary, and that it is contemplated that there are other systems and methods that are within the scope of the disclosed subject matter.
  • An image can include a two-dimensional function whose values correspond to physical characteristics of an object, such as brightness (radiant energy, reflected or otherwise), color, temperature, height above a reference plane, etc., and measured by any image-forming device, or whose values correspond to simulated characteristics of an object, and generated by any data processing device. An image can be regarded as a two-dimensional matrix, where each bin (e.g., a pixel) in the two-dimensional matrix is associated with a value or a set of values. Each pixel can be identified using a two-dimensional coordinate (x, y). In some embodiments, a row of an image can be identified by a set of pixels having an identical y-coordinate; a column of an image can be identified by a set of pixels having an identical x-coordinate.
  • A distorted image can include an image that has been subject to distortions, for example, due to a perspective effect. A distorted image can be disposed on an image plane, and it can represent a perspective view of an object plane. Therefore, parallel lines on an object plane may converge on the image plane, either within the distorted image or outside of the distorted image.
  • A rectified image can include an image that is recovered by removing distortions (e.g., a perspective effect) from the distorted image. The rectified image can be disposed on an object plane, and it can represent an orthographic view of the object plane. In the rectified image, a vanishing line of an object plane can be the line at infinity.
  • Image processing can refer to an automated analysis of images to determine the characteristics of features shown in them. It is used, for example, in automated manufacturing lines, where images of parts are analyzed to determine placement and alignment prior to assembly. It is also used, for example, in quality assurance where images of packages are analyzed to insure that product labels, lot numbers, and “freshness” dates are properly positioned and legible.
  • Image processing has non-industrial applications as well. In biotechnology research, for example, it can be used to identify constituents of microscopically imaged samples, or growth patterns in culture media. On the macroscopic scale, it can be used in astronomical research to find objects in time-lapse images. Meteorological, agricultural and defense applications of image processing include the detection and analysis of objects in satellite images.
  • One image processing technique for improving the accuracy of object/pattern/feature detection is homography rectification. Homography refers to a relationship between two images of the same planar surface in space. Homography has an application in image rectification, which can refer to a process for mapping a distorted image to a rectified image on a common image plane on which a vanishing line become a line at infinity. This transformation process can be referred to as a homography rectification. In some sense, homography rectification removes distortions introduced by perspective properties of an imaging system.
  • In homography rectification, once the homography (e.g., a mapping between the distorted image and the rectified image) is known, an image mapping technique can be used to remove the homography from the distorted image to recover the rectified image. This removal operation can include one or more image processing operations, which may include one or more of resizing, translation, rotation, skewing, and/or other image transformation operations.
  • Conventional image mapping techniques are unfortunately slow and too computationally intensive for most image processing applications. Conventional image mapping techniques often identify, for every pixel location in the rectified image, a corresponding pixel location in the distorted image, and the value of the corresponding pixel location in the distorted image is copied to the associated pixel location in the rectified image to complete the mapping. In a simplistic example, every pixel coordinate position in the rectified image maps directly to an existing pixel in the source. Thus, for example, the pixel at coordinate (4, 10) in the rectified image maps to coordinate (2, 5) in the distorted image; the pixel at (6, 10) in the rectified image maps to (3, 5) in the distorted image; and so on.
  • However, rarely do pixels in the rectified image map directly to grid positions in the distorted image. Thus, for example, a pixel at a coordinate (4, 10) in the rectified image may map to a location that is off from the pixel grid by a sub-pixel, e.g., a coordinate (2.5, 5.33), in the distorted image. This may necessitate interpolation and/or filtering to determine appropriate image values at the determined location. In the example provided above, an appropriate intensity for the pixel at coordinate (4, 10) in the rectified image might be determined as a weighted average of the intensities for the source pixel locations (2, 5), (3, 5), (2, 6), and (3, 6). Hereinafter, the term location is used to refer to a coordinate in a plane, which may or may not be located on a pixel grid.
  • Such image mapping techniques are usually both computationally expensive and time consuming because the corresponding pixel(s) in the distorted image must be independently identified for each pixel in the rectified image.
  • Furthermore, such image mapping techniques are generally inefficient in their memory usage because they access pixels stored in memory in a manner that is not aligned to the memory structure (e.g., the way data is stored in memory). For example, when an image mapping technique uses one or more pixels in the distorted image to interpolate an image value for a pixel in the rectified image, the one or more pixels are fetched from memory. Certain kinds of memory, such as dynamic random access memory (DRAM), have a property that access to sequential memory locations (e.g., consecutive x coordinates in an image) is generally much faster than non-sequential access. Therefore, the order in which the one or more pixels are accessed from the memory or the order in which the one or more pixels are written to the memory can be critical for fast memory access operations. Unfortunately, conventional image mapping techniques do not guarantee that the order in which the one or more pixels are fetched would be favorable to memory. Because the one or more pixels accessed from the memory may not correspond to sequential memory locations in the memory, retrieving pixels from the memory can be time-consuming.
  • Some of the issues associated with conventional image mapping techniques were addressed to a certain degree in 3D Transformations of Images in Scanline Order (Catmull and Smith, 1980). Catmull shows a method that decomposes an image mapping process into multiple stages so that pixels on the distorted image—referred to as source pixels—corresponding to a pixel in the rectified image—referred to as a destination pixel—are located in sequential locations in memory. However, Catmull's method still requires performing substantial computation, notably a division, for each destination pixel. Catmull appears to have combined its method with specialized hardware to improve the computational speed, but its method is not amenable to be efficiently performed on a general-purpose processor.
  • Some of the issues associated with conventional image mapping techniques were also addressed to a certain degree in High-quality multi-pass image resampling (Szeliski et al, 2010). Szeliski discloses methods for performing some classes of image mapping that arguably has certain desired properties. However, Szeliski's methods do not apply to homographic/perspective image mapping. In fact, Szeliski concedes that it “[has] not yet fully developed the theory of optimal multi-pass perspective algorithms because achieving full computational efficiency is tricky.” Therefore, there is a need to improve the efficiency of computation and memory access for homographic/perspective image mapping.
  • The present disclosure provides a multi-stage image mapping mechanism for mapping a distorted image to a rectified image. For example, the multi-stage image mapping mechanism can remove homography from a distorted image to reconstruct a rectified image in two-stages: (1) a first stage in which distortion is partially removed from a distorted image to generate an intermediate image, and (2) a second stage in which residual distortion is removed from the intermediate image to recover the rectified image. The multi-stage mechanism can be advantageous because it can allow each stage to be efficient in (1) identifying correspondence between pixels in two images, (2) accessing pixels from memory, and (3) filtering pixels to determine image values.
  • To this end, the multi-stage image mapping mechanism decomposes a homography transformation H into two separate transformations: a first transformation A and a second transformation B. Therefore,

  • H=A*B
  • Under this representation, the second transformation B is designed to map a coordinate of an object plane (e.g., a plane associated with a rectified image) to a coordinate of an intermediate image plane (e.g., a plane associated with an intermediate image), and the first transformation A is designed to map a coordinate of the intermediate image plane to a coordinate of an image plane (e.g., a plane associated with a distorted image). Therefore, the first stage of the multi-stage image mapping mechanism can be thought of as removing the first transformation A from the distorted image to generate an intermediate image, and the second stage of the multi-stage image mapping mechanism can be thought of as removing the second transformation B from the intermediate image to generate a rectified image.
  • In the proceeding description, a source image refers to an image that is provided as an input to one of decomposed transformations of homography (e.g., a first transform A or a second transform B), and a destination image refers to an image that is provided as an output from one of decomposed transformations of homography (e.g., a first transform A or a second transform B). For example, for the second transform B, the source image is the rectified image and the destination image is the intermediate image; and for the first transform A, the source image is the intermediate image and the destination image is the distorted image.
  • In some embodiments, one or more of the decomposed transformations (A and/or B) are designed to map all pixels of a source image having a particular y-coordinate to pixels of a destination image having a fixed y-coordinate. In one example, the first transform A is configured to map a row of a source image to a row of a destination image. This feature allows a majority of memory accesses for the source image and/or the destination image to be aligned with the memory structure, thereby reducing cache misses and reducing the access time for accessing the external memory.
  • In some embodiments, one or more of the decomposed transformations (A and/or B) are configured to map pixels in a particular row of a source image to locations that are equidistant from adjacent locations in a fixed row of a destination image. For example, pixels (1, y0), (2, y0), (3, y0) in the source image (e.g., intermediate image for the first transform A) can be mapped to pixels (2, y1), (4, y1), (6, y1) in the destination image (e.g., distorted image for the first transform A). As before, the fixed row of a destination image may be offset from the pixel grid by a sub-pixel value. Therefore, the coordinate y1 can be a fractional coordinate value (e.g., 3.5).
  • In some embodiments, because the distance between locations in the destination image is constant (referred to as a “sampling constant”), the image mapping technique need not independently compute locations in the destination image corresponding to pixels in a particular row of a source image. Instead, once the image mapping technique identifies a first location in the destination image that corresponds to a first pixel in a source image, the image mapping technique can use the first location as a reference point to determine (e.g., by simply adding the sampling constant to the x-coordinate of the first location) a second location corresponding to an adjacent pixel in the same row as the first pixel in the source image. This characteristic of decomposed transformations reduces the amount of computation needed for identifying one or more locations in the destination image that correspond to pixels in a source image.
  • FIG. 1 shows a machine vision system 100 for rectifying a homography of a distorted image in accordance with some embodiments. The system 100 can include a computing device 102, which includes a processor 104, memory 106, a homography rectification (HR) module 108, a direct memory access (DMA) controller 114, and an interface 116. The system 100 can also include an image capture device 118, which can generate an image of a scene.
  • In some embodiments, the processor 104 can execute instructions and one or more memory devices 106 can store instructions and/or data. The memory device 106 can be a non-transitory computer readable medium, such as a dynamic random access memory (DRAM), a static random access memory (SRAM), flash memory, a magnetic disk drive, an optical drive, a programmable read-only memory (PROM), a read-only memory (ROM), or any other memory or combination of memories. The memory device 106 can be used to temporarily store data. The memory device 106 can also be used for long-term data storage. The processor 104 and the memory device 106 can be supplemented by and/or incorporated into special purpose logic circuitry.
  • In some embodiments, the HR module 108 can include a homography decomposition module 110 and a homography removal module 112.
  • The homography decomposition module 110 can be configured to receive a representation of homography to be removed from a distorted image, and determine decomposed transformations corresponding to the homography. The decomposed transformations can include, for example, a first transform A to be removed from a distorted image to generate an intermediate image, and a second transform B to be removed from the intermediate image to generate a rectified image. In some embodiments, the decomposed transformations can include more than two transforms, e.g., three or four transforms, that may be sequentially removed from the distorted image to generate a rectified image.
  • The homography decomposition module 110 can constrain one or more of the decomposed transformations so that all pixels in a particular row of a source image are mapped to a fixed y-coordinate of a destination image. The fixed y-coordinate may be an integer, in which case all pixels in a particular row of a source image are mapped to a fixed row of a destination image. Also, the homography decomposition module 110 can constrain one or more of the decomposed transformations so that pixels in a particular row of a source image are mapped to locations that are equidistant from each other in a fixed y-coordinate of a destination image.
  • The homography removal module 110 can be configured to remove decomposed transformation from the distorted image to recover the rectified image. For example, the homography removal module 110 can remove the first transform A from the distorted image to generate an intermediate image, and remove the second transform B from the intermediate image to recover a rectified image. In some embodiments, the homography removal module 110 can down-sample a row of a destination image to remove a decomposed transformation from the destination image. In this case, the homography removal module 110 can down-sample each row of the destination image with a different sampling rate to account for the projective distortion of the decomposed transformation.
  • In some embodiments, the homography removal module 110 can be configured to down-sample a destination image as a part of the decomposed transformation removal process. This can be done by appropriately scaling decomposed transformations, e.g., by a down-sampling rate.
  • In some embodiments, the direct memory access (DMA) controller 114 is configured to allow certain hardware devices to send or receive data directly to or from the main memory, thereby bypassing the processor 104 to speed up memory operations. For example, the DMA controller 114 can allow the homography decomposition module 110 to access the memory 106 without involving the processor 104. In some embodiments, the DMA controller 114 can perform a two-dimensional DMA. The two-dimensional DMA is able to automatically skip over certain bytes in memory that do not correspond to a column of an image.
  • In some embodiments, the interface 116 can be implemented in hardware to send and receive signals in a variety of mediums, such as optical, copper, and/or wireless interfaces, and in a number of different protocols, some of which may be non-transient.
  • In some embodiments, one or more of the modules 108, 110, 112 can be implemented in software. The software can run on a processor 104 capable of executing computer instructions or computer code. The processor 104 is implemented in hardware using an application specific integrated circuit (ASIC), programmable logic array (PLA), digital signal processor (DSP), field programmable gate array (FPGA), or any other integrated circuit. The processor 104 suitable for the execution of a computer program include, by way of example, both general and special purpose microprocessors, digital signal processors, and any one or more processors of any kind of digital computer. Generally, the processor 104 receives instructions and data from a read-only memory or a random access memory or both.
  • The modules 108, 110, 112 can be implemented in digital electronic circuitry, or in computer hardware, firmware, software, or in combinations of them. The implementation can be as a computer program product, e.g., a computer program tangibly embodied in a machine-readable storage device, for execution by, or to control the operation of, a data processing apparatus, e.g., a programmable processor, a computer, and/or multiple computers. A computer program can be written in any form of computer or programming language, including source code, compiled code, interpreted code and/or machine code, and the computer program can be deployed in any form, including as a stand-alone program or as a subroutine, element, or other unit suitable for use in a computing environment. A computer program can be deployed to be executed on one computer or on multiple computers at one or more sites.
  • In some embodiments, two or more modules 108, 110, 112 can be implemented on the same integrated circuit, such as ASIC, PLA, DSP, or FPGA, thereby forming a system on chip. Subroutines can refer to portions of the computer program and/or the processor/special circuitry that implement one or more functions.
  • The computing device 102 can be operatively coupled to an image capture device 118. The image capture device 118 can provide, via the interface 116, one or more images to be rectified. The image capture device 118 can include a charge-coupled device (CCD) image sensor or a complementary metal-oxide-semiconductor (CMOS) image sensor, line-scan sensors, flying spot scanners, electron microscopes, X-ray devices including computed tomography (CT) scanners, magnetic resonance imagers, and/or other devices known to those skilled in the art.
  • The computing device 102 can be operatively coupled to external equipment, for example factory automation or logistics equipment, or to a communications network, for example a factory automation or logistics network, in order to receive instructions and/or data from the equipment or network and/or to transfer instructions and/or data to the equipment or network. Computer-readable storage devices suitable for embodying computer program instructions and data include all forms of volatile and non-volatile memory, including by way of example semiconductor memory devices, e.g., DRAM, SRAM, EPROM, EEPROM, and flash memory devices; magnetic disks, e.g., internal hard disks or removable disks; magneto-optical disks; and optical disks, e.g., CD, DVD, HD-DVD, and Blu-ray disks.
  • In some embodiments, the computing device 102 can include user equipment. The user equipment can communicate with one or more radio access networks and with wired communication networks. The user equipment can be a cellular phone. The user equipment can also be a smart phone providing services such as word processing, web browsing, gaming, e-book capabilities, an operating system, and a full keyboard. The user equipment can also be a tablet computer providing network access and most of the services provided by a smart phone. The user equipment operates using an operating system such as Symbian OS, iPhone OS, RIM's Blackberry, Windows Mobile, Linux, HP WebOS, and Android. The screen might be a touch screen that is used to input data to the mobile device, in which case the screen can be used instead of the full keyboard. The user equipment can also keep global positioning coordinates, profile information, or other location information.
  • In some embodiments, the computing device 102 can include a server. The server can operate using operating system (OS) software. In some embodiments, the OS software is based on a Linux software kernel and runs specific applications in the server such as monitoring tasks and providing protocol stacks. The OS software allows server resources to be allocated separately for control and data paths. For example, certain packet accelerator cards and packet services cards are dedicated to performing routing or security control functions, while other packet accelerator cards/packet services cards are dedicated to processing user session traffic. As network requirements change, hardware resources can be dynamically deployed to meet the requirements in some embodiments.
  • In some embodiments, the homography removal module 112 can be configured to rectify a distorted image to recover a rectified image using a two-stage image mapping process. In the first stage of the two-stage image mapping process, the homography removal module 112 can remove a first transformation A from the distorted image to generate an intermediate image by processing pixels along the x-axis of the distorted image, and in the second stage of the two-stage image mapping process, the homography removal module 112 can remove a second transform B by processing pixels along the y-axis of the intermediate image.
  • In some embodiments, one or more decomposed transformations of homography can be designed so that the homography removal module 112 accesses pixels preferentially along an x-axis of a source image and along an x-axis of a destination image. This is because memory is generally aligned with the x-axis of an image. In other words, memory generally stores an image by (1) storing all pixels having a first y-coordinate while preserving the pixel order along the x-axis, and then (2) storing all pixels having a second y-coordinate (oftentimes adjacent to the first y-coordinate) while preserving the pixel order along the x-axis, until all pixels in the image are stored. Therefore, by designing one or more decomposed transformations to access pixels preferentially along the x-axis of the image, the homography removal module 112 can use direct memory access, e.g., using the direct memory access controller, to reduce cache misses and memory access time.
  • In some embodiments, the homography removal module 112 can be configured to transpose an intermediate image generated by the first stage of the two-stage image mapping process, prior to initiating the second stage of the two-stage image mapping process. By doing so, although the second stage of the two-stage image mapping process is configured to process pixels along the y-axis of the intermediate image, the homography removal module 112 can still access pixels preferentially along an x-axis of the transposed intermediate image (which is in effect the y-axis of the transposed intermediate image) to align the memory access to the memory structure during the second stage of the two-stage image mapping process.
  • As discussed above, in some embodiments, the homography decomposition module 110 is configured to decompose the homography into a first transform and a second transform that collectively represent the homography. In some cases, once the homography decomposition module 110 receives the predetermined homography, the homography decomposition module 110 can use a predetermined relationship between the homography, the first transform, and the second transform to determine the decomposition. The following describes this predetermined relationship between the homography, the first transform, and the second transform.
  • Homography is typically represented using a 3×3 homography matrix H in a homogenous coordinate (xh, yh, zh), where the subscript h indicates that the variable is associated with a homogeneous coordinate. Suppose (x, y) is a coordinate system of the object plane (e.g., a plane associated with a rectified image) and (x′, y′) is a coordinate system of an image plane (e.g., a plane associated with a distorted image). By definition of homogenous coordinates, it follows that:
  • x = x h / z h y = y h / z h x = x h / z h y = y h z h
  • A general perspective operation can be expressed as follows:
  • [ x h y h z h ] = H [ x y 1 ]
  • Or equivalently by explicitly identifying the coefficients of H:
  • [ x h y h z h ] = [ h 00 h 01 h 02 h 10 h 11 h 12 h 20 h 21 h 22 ] [ x y 1 ]
  • Without loss of generality, the homography decomposition module 110 can choose an origin point on the image plane, and define that location to be the origin of the object plane as well. This way, the homography matrix H can be simplified to remove translation:

  • h 02 =h 12=0
  • Furthermore, as is the case with homogenous coordinate representations, the homography decomposition module 110 can arbitrarily set the overall scale of the homography matrix H. In some embodiments, the homography decomposition module 110 can set the scale of the homography matrix H by setting h22=1.
  • With these assumptions, the homography matrix H that defines the homography between an object plane and an image plane has six parameters and can be written as:
  • [ x h y h z h ] = [ h 00 h 01 0 h 10 h 11 0 h 20 h 21 1 ] [ x y 1 ]
  • This representation of the homography matrix H allows for rectifying arbitrary amount of rotations.
  • In some cases, the homography matrix H can be further constrained to improve the quality of the rectified image. To this end, the homography decomposition module 112 can constrain the rotation element of the homography matrix H. The constraint on the rotation element of the homography matrix H, in effect, determines (e.g., fixes) the amount of rotation associated with the homography matrix H.
  • In some embodiments, the homography decomposition module 112 is configured to constrain the rotation component of the homography matrix H by setting h01=h10.
  • Under this constraint, the homography matrix H that defines the homography between an object plane and an image plane has five parameters and can be written as:
  • [ x h y h z h ] = [ h 00 h 01 0 h 01 h 11 0 h 20 h 21 1 ] [ x y 1 ]
  • In some cases, the actual rotation associated with the deformed image can deviate substantially from the amount of rotation associated with the homography matrix H. This may result in undesirable performance degradation. In such cases, the homography decomposition module 112 can instruct the homography removal module 110 to locally down-sample during stage 1 of the multi-stage rectification process and then locally up-sample during stage 2 of the multi-stage rectification process to reduce performance degradation.
  • In some embodiments, the homography matrix H can be further constrained to include only two degrees of freedom with regards to the tilt. The two degree of freedom with regards to the tilt can be parameterized using: (1) a tilt direction τ (also known as azimuth) and (2) a tilt amount σ (also known as declination or just tilt). When the tilt constraint is combined with previously described constraints and conventions, this is sufficient to compute the homography matrix H to rectify the tilt specified by σ and τ:
  • [ x h y h z h ] = [ ( cos τ ) 2 + ( sin τ ) 2 cos σ sin τ cos τ ( 1 - cos σ ) 0 sin τ cos τ ( 1 - cos σ ) ( sin τ ) 2 + ( cos τ ) 2 cos σ 0 - sin τsinσ f cos τ sin σ f 1 ] [ x y 1 ]
  • In some embodiments, the homography decomposition module 112 can constrain the rotation component of the homography matrix H through another set of parameterization. This set of parameterization includes three angle parameters that specify (sequentially): (1) a rotation around the z axis, (2) a rotation around the y axis and (3) a rotation around the x axis. This is often called a “roll, pitch, and yaw” representation. There are at least two different ways to parameterize the “roll, pitch, and yaw” representation. In a first example, the pitch and yaw represent rotations around the “new” axis after the previous rotations; and in a second example, each of the roll, pitch, and yaw represents rotation around a non-moving “world” axis. For the purpose of illustration, the proceeding disclosure uses the second example of the “roll, pitch, and yaw” representation, in which the rotation around the world z axis is indicated as γ; the rotation around the world y axis is indicated as β, and the rotation around the x axis is indicated as α. Based on this representation, the homography matrix H can take the following form:
  • [ x h y h z h ] = [ cos βcosγ - cos βsin γ 0 sin α sin β cos γ + cos αsin γ - sin αsinβsinγ + cos α cos γ 0 - cos αsin β cos γ + sin αsinγ f cos αsin βsin γ + sin α co f 1 ] [ x y 1 ]
  • In some embodiments, the homography decomposition module 112 can constrain the rotation around the z-axis (γ) in the homography matrix H. By constraining the value of γ to zero, the homography matrix H can be simplified to the following form:
  • [ x h y h z h ] = [ cos β 0 0 sin αsin β cos α 0 - cos αsinβ f sin α f 1 ] [ x y 1 ]
  • This parameterization has a favorable characteristic that both h01 and h10 have a small magnitude.
  • The remainder of the present disclosure provides embodiments based on the following representation of the homography matrix H:
  • [ x h y h z h ] = [ h 00 h 01 0 h 01 h 11 0 h 20 h 21 1 ] [ x y 1 ]
  • However, one of ordinary skill in the art can adapt the disclosed embodiments to use other homography matrix representations.
  • Given the homography matrix H, the goal of homography rectification is to remove this homography H from the distorted image on the image plane (x′h, y′h, z′h).
  • In some embodiments, the homography decomposition module 112 is configured to decompose the homography matrix H into a first transform A and a second transform B such that

  • H=A*B
  • where * indicates a concatenation operator. This relationship can be written out as follows:
  • H = [ a 00 a 01 0 a 10 a 11 0 a 20 a 21 1 ] * [ b 00 b 01 0 b 10 b 11 0 b 20 b 21 1 ]
  • In some sense, the first transform A relates the distorted image, on an image plane represented by the coordinate system (x′, y′), to an intermediate image, on an intermediate image plane represented by the coordinate system (xI, yI), and the second transform B relates the intermediate image, on the intermediate image plane represented by the coordinate system (xI, yI), to a rectified image, on an object plane represented by the coordinate system (x, y). FIG. 2 illustrates an application of the second transform B to distort a rectified image (e.g., an orthographic view of the object plane, which is the desired outcome of the two-stage rectification process) into an intermediate image, and an application of the first transform A to distort the intermediate image into the distorted image (e.g., input image to the two-stage rectification process).
  • Since the first transform A relates the distorted image with an intermediate image and the second transform B relates the intermediate image with a rectified image, the homography H can be removed from the distorted image to recover the rectified image by (1) removing the first transform A from the distorted image to generate an intermediate image and (2) removing the second transform B from the intermediate image to generate the rectified image.
  • In some embodiments, the second transform B can be represented as a concatenation of three operations: a transpose operation T that flips the x and y coordinates while leaving the z coordinate intact, the transposed second transform Byx, and another transpose operation T:

  • B=T*B yx *T
  • where the transpose operation T can be represented as follows:
  • T = [ 0 1 0 1 0 0 0 0 1 ]
  • Therefore, the homography matrix H can be represented as follows:

  • H=A*T*B yx *T
  • where
  • B yx = [ b 11 b 10 0 b 01 b 00 0 b 21 b 20 1 ] .
  • The subscript “yx” is used to indicate that the transposed second transform Byx operates on a flipped x-y coordinates. Note that the transposed second transform Byx is not merely a transpose of the second transform B
  • ( e . g . , [ b 00 b 01 0 b 10 b 11 0 b 20 b 21 1 ] ) .
  • The transposed second transform Byx can be derived from the second transform B by rearranging the terms in B based on the fact that the x and y coordinates are swapped with the z coordinate remaining the same.
  • In some embodiments, the first transform A and the second transform B can be constrained to have properties discussed above: that (1) pixels in a particular row of a source image are mapped to equi-distance pixels (e.g., each pair of adjacent pixels are separated by an equal distance) in a fixed row of a destination image, hereinafter referred to as a first constraint, and that (2) a row of a source image is mapped to a row of a destination image, hereinafter referred to as a second constraint.
  • The first constraint can be introduced into the first transform A by constraining
  • dx dx I ,
  • which indicates the rate of change of x′ (the x-coordinate of the image plane) with respect to x′ (the x-coordinate in the intermediate image plane). The x′ can be represented directly using the parameters of the first transform A:
  • x = x h z h = a 00 x I + a 01 y I a 20 x I + a 21 y I + 1
  • Therefore, using the chain rule of differentiation, the derivative of x′ with respect to x′ can be represented as follows:
  • dx dx I = a 00 ( a 20 x I + a 21 y I + 1 ) - ( a 00 x I + a 01 y I ) ( a 20 ) ( a 20 x I + a 21 y I + 1 ) 2 dx dx I = a 00 + a 00 a 21 y I - a 01 a 20 y I ( a 20 x I + a 21 y I + 1 ) 2
  • If this derivative
  • dx dx I
  • depends only on y′, then the distance between adjacent pixels in the intermediate image will be identical for all adjacent pixels. The derivative
  • dx dx I
  • can be made to depend only on yI. The only term that depends on xI is found in the denominator. Therefore, in order to remove the dependency on xI, the multiplier a20 can be set to zero.
  • The second constraint can be introduced into the first transform A by introducing a constraint that the y-coordinate of the distorted image must depend only on the y-coordinate of the intermediate image. By setting a20=0 as above, the denominator zh′ is already known to depend only on yI. Therefore, if the numerator yh′ depends only on yI, then the y′ (e.g., the y-coordinate of the distorted image) would depend only on the y-coordinate of the intermediate image, thereby mapping the row of the intermediate image to a row of a distorted image.
  • By applying the first transform A to the intermediate image coordinates, the y-coordinate of the distorted image can be expressed as follows:

  • y′ h =a 10 x I +a 11 y I
  • Therefore, in order for the y-coordinate of the distorted image to depend only on y, the coefficient for x1 can be set to zero: a10=0.
  • Therefore, in summary, the first constraint and the second constraint require that the first transform A takes the following form:
  • A = [ a 00 a 01 0 0 a 11 0 0 a 21 1 ]
  • In some embodiments, the first constraint and the second constraint should be imposed on the second transform B as well. Since the transposed second transform
  • B yx = [ b 11 b 10 0 b 01 b 00 0 b 21 b 20 1 ]
  • is operating under the same condition as A, Byx should have the same form as A. Therefore, b21=0 per the first constraint and b01=0 per the second constraint. Therefore, Byx can be represented as follows:
  • B yx = [ b 11 b 10 0 0 b 00 0 0 b 20 1 ]
  • The transpose second transform Byx can be related back to the second transform B as follows:
  • B = [ b 00 0 0 b 10 b 11 0 b 20 0 1 ]
  • In summary, the homography matrix H can be represented as follows:
  • [ h 00 h 01 0 h 01 h 11 0 h 20 h 21 1 ] = [ a 00 a 01 0 0 a 11 0 0 a 21 1 ] * [ b 00 0 0 b 10 b 11 0 b 20 0 1 ]
  • Alternatively, the homography matrix H can also be represented as follows:
  • [ h 00 h 01 0 h 01 h 11 0 h 20 h 21 1 ] = [ a 00 a 01 0 0 a 11 0 0 a 21 1 ] * T * [ b 11 b 10 0 0 b 00 0 0 b 20 1 ] * T
  • The decomposition of the homography matrix H can be used to analytically solve for a relationship between the coefficients of H, the coefficients of A, and the coefficients of B. Based on this analytic calculation, it can be shown that the first transform A and the transposed second transform Byx are related to the coefficients of H as follows:
  • A = [ 1 h 01 0 0 h 11 0 0 h 21 1 ] B xy = [ 1 h 01 h 11 0 0 h 00 - h 01 2 h 11 0 0 h 20 - h 21 h 01 h 11 1 ]
  • Therefore, once the homography decomposition module 110 receives the coefficients of the homography matrix H, the homography decomposition module 110 can generate the first transform A and the transposed second transform Byx using the above relationship.
  • In some embodiments, the first transform A and/or the second transform B can include a scaling factor that effectively down-samples the destination image when recovering the source image. The following decomposition adheres to the first constraint and the second constraint, as described above, while allowing for down-sampling by a down-sampling rate in the first transform A:
  • [ h 00 h 01 0 h 01 h 11 0 h 20 h 21 1 ] = [ s Ax s Ay h 01 0 0 s Ay h 11 0 0 s Ay h 21 1 ] * [ 1 s Ax ( h 00 - h 01 2 h 11 ) 0 0 1 s Ay ( h 01 h 11 ) 1 s Ay 0 h 20 - h 21 h 01 h 11 0 1 ]
  • where sAx and sAy refer to sampling factors in the x dimension and the y dimension, respectively. Alternatively, the homography matrix H can be represented as follows:
  • [ h 00 h 01 0 h 01 h 11 0 h 20 h 21 1 ] = [ s Ax s Ay h 01 0 0 s Ay h 11 0 0 s Ay h 21 1 ] * T * [ 1 s Ay 1 s Ay ( h 01 h 11 ) 0 0 1 s Ax ( h 00 - h 01 2 h 11 ) 0 0 h 20 - h 21 h 01 h 11 1 ]
  • In some embodiments, the sampling factors sAx and/or sAy may be less than 1 to effectively oversample the distorted image in the first stage. This way, the information in the distorted image can be preserved for processing in the second stage.
  • In some embodiments, the homography removal module 112 can efficiently remove the first transform A and the transposed second transform Byx from the distorted image. The job of the homography removal module 112 in each stage of the two-stage rectification process is to remove a limited homography (e.g., a first transform A or a transposed second transform Byx) with only four non-trivial coefficients:
  • A = [ s Ax s Ay h 01 0 0 s Ay h 11 0 0 s Ay h 21 1 ] , or B xy = [ 1 s Ay 1 s Ay ( h 01 h 11 ) 0 0 1 s Ax ( h 00 - h 01 2 h 11 ) 0 0 h 20 - h 21 h 01 h 11 0 ]
  • Because the first transform A and the transposed second transform Byx have the same structure, the homography removal module 112 can, in effect, perform the same procedure twice, once for the first transform A and once for the transposed second transform Byx. Therefore, the first stage and the second stage of the two-stage rectification process can comprise substantially identical operations.
  • FIG. 3 illustrates a two-stage image mapping process in accordance with some embodiments. The two-stage image mapping process uses the decomposition of the homography, as determined by the homography decomposition module 110, to perform rectification in two stages. In the first stage of the two-stage image mapping process (identified as steps 302-304), the homography removal module 112 can process pixels of the distorted image along the x-axis, and in the second stage of the two-stage image mapping process (identified as steps 306-308), the homography removal module 112 can process pixels of the intermediate image along the y-axis. It is worth noting that, thanks to the transpose operation that occurs between the first stage and the second stage, the first stage of the multi-stage rectification process is substantially similar to the second stage of the multi-stage rectification process.
  • Prior to initiating the steps in FIG. 3, the homography removal module 112 can be configured to receive the distorted image, and the homography represented as a concatenation of a first transform A and a second transform B (or a transposed second transform). The homography removal module 112 can receive the homography representation from the homography decomposition module 110. FIG. 4 illustrates an example of a distorted image that has been subject to perspective distortion. The perspective distortion manifests itself by transforming two vertical, parallel lines 402, 404 into two non-parallel lines and two horizontal, parallel lines 406, 408 into two non-parallel lines.
  • In step 302, the homography removal module 112 is configured to remove the first transformation from the distorted image by processing distorted image along the x-axis, thereby generating the intermediate image. In step 302 a, the homography removal module 112 can optionally down-sample the intermediate image. In some embodiments, the step 302 and the step 302 a can be performed simultaneously. For example, the first transform A can incorporate the down-sampling factor, as discussed above, so that by removing the first transform A from the distorted image, the homography removal module 112 can in effect also perform the down-sampling operation.
  • In step 302, the homography removal module 112 can be configured to determine, for each pixel of an intermediate image, a corresponding location in the distorted image. For example, the homography removal module 112 can access one of the pixels in a row of an intermediate image, and determine which location within the distorted image corresponds to the one of the pixels in the intermediate image using the first transform A. In some cases, this location within the distorted image can have a sub-pixel coordinate value. For example, the location within the distorted image can have a coordinate of (2.24, 3.1).
  • Because the first transform A is subject to the first and second constraints, the homography removal module 112 knows in advance that a pixel in a row of the intermediate image always corresponds to a location in a predetermined y-coordinate of the distorted image. Therefore, the pattern in which the homography removal module 112 should access the distorted image in memory 106 is predictable and deterministic. Thus, the homography removal module 112 can use a standard ping-pong two-buffer system and the DMA controller 114 to read and store a row or groups of rows of an image in parallel.
  • Once the homography removal module 112 determines the correspondence between a pixel in an intermediate image and a location in a distorted image, the homography removal module 112 can determine the image value for that pixel in the intermediate image. To determine the image value for that pixel, the homography removal module 112 can use the image value at the corresponding location in the distorted image.
  • When the corresponding location falls on a pixel grid of the distorted image, the homography removal module 112 can filter the image values surrounding the corresponding location in the distorted image. In some cases, the homography removal module 112 may simply copy the image value of the corresponding location in the distorted image. When the corresponding location does not fall on the grid of the distorted image, the homography removal module 112 can determine the image value of the pixel in the intermediate image by interpolating image values of pixels surrounding the corresponding location in the distorted image.
  • For example, the homography removal module 112 can convolve pixels surrounding the corresponding location in the distorted image with a kernel to interpolate image values of pixels surrounding the corresponding location in the distorted image. In some cases, the kernel can also be designed to perform appropriate smoothing (e.g., to satisfy the Nyquist sampling constraint) along a row of the distorted image. In some embodiments, the kernel can be a truncated sync function; in other embodiments, the kernel can be a truncated Gaussian function.
  • In some embodiments, the homography removal module 112 can predetermine a set of kernels to be used for pre-selected sub-pixel variations from the grid locations. When the corresponding location in the distorted image is off from the grid in the x-axis by 0.1 pixel, then the homography removal module 112 can use a first one of the predetermined set of kernels; when the corresponding location in the distorted image is off from the grid in the x-axis by 0.3 pixel, then the homography removal module 112 can use a second one of the predetermined set of kernels, etc. In some embodiments, each kernel in the set of kernels has the same functional form.
  • For example, the set of kernels can include the following:
  • K1: 36, 56, 36, 0
    K2: 30, 56, 38, 4
    K3: 25, 54, 41, 8
    K4: 21, 50, 45, 12
    K5: 16, 48, 48, 16
    K6: 12, 45, 50, 21
    K7:  8, 41, 54, 25
    K8:  4, 38, 56, 30

    These kernels K1-K8 all have the same functional form, but each kernel is associated with a different sub-pixel coordinate (also referred to as a sub-pixel phase.)
  • When the homography removal module 112 is to filter a location that is on a pixel grid (e.g., 0 sub-pixel phase), then the homography removal module 112 can use K1 to filter pixels around the location. Note that K1 is symmetric around the center. When the homography removal module 112 is to filter a location that is separated from the pixel grid by ⅛ sub-pixel phase, then the homography removal module 112 can use K2 to filter pixels around the location. For example, the homography removal module 112 can directly multiply K2 to a sequence of four image pixels in order to simultaneously perform sub-pixel interpolation and filtering. When the homography removal module 112 is to filter a location that is separated from the pixel grid by ½ sub-pixel phase, then the homography removal module 112 can use K5 to filter pixels around the location. Because an offset of a ½ sub-pixel phase refers to identifying an image value at the exact boundary of two adjacent pixels, K5 is symmetric. Note that K2 and K8, which correspond to a sub-pixel phase of ⅛ and ⅞, respectively, have the same values but are mirrors of each other. This makes sense because ⅛ is the same as −⅞ and −⅛ is the same as ⅞. The same is true for the pairs “¼, ¾”, and “⅜, ⅝”.
  • In some embodiments, when a location on the distorted image does not have a sub-pixel phase that exactly matches the sub-pixel phase of one of the kernels in the set, then the homography removal module 112 can use a kernel associated with a sub-pixel phase that is closest to that of the location. For example, when the sub-pixel phase of the location is 1/9, the homography removal module 112 can use K1 to filter that location since 1/9 is closer to ⅛ than any other sub-pixel phases associated with the kernels.
  • In some embodiments, the homography removal module 112 can perform the convolution operation using a single-instruction-multiple-data (SIMD) processing unit. In particular, the convolution operation can be easily performed using a SIMD multiply and/or a SIMD dot-product instruction operable by a SIMD processing unit.
  • In some embodiments, the convolution operation can involve a two-dimensional kernel. When the kernel is separable in the spatial axes (e.g., x and y axes), the homography removal module 112 can perform the convolution in two separate steps: (1) convolve along the x-axis (e.g., a row of an image), and (2) convolve along the y-axis (e.g., a column of an image).
  • With respect to the first step, because the first transform A is subject to the first constraint, pixels on a row of an intermediate image are mapped to locations that are equidistant from adjacent locations within a fixed y-coordinate of the distorted image. Therefore, the homography removal module 112 can apply a single kernel function to all corresponding locations within a fixed y-coordinate of the distorted image. Because the sub-pixel phase of locations within the distorted image may differ from one another, the homography removal module 112 can select, based on the sub-pixel phase of each location, one of the kernels in a set corresponding to that single kernel function. This can avoid the homography removal module 112 needing to generate the kernel during convolution along the fixed row of the distorted image. In some embodiments, the homography removal module 112 can use the same kernel function for different rows; in other embodiments, the homography removal module 112 can use different kernel functions for different rows. The result of the convolution operation along the x-axis (also referred to as an x-convolution result) can be stored in a temporary buffer.
  • When the homography removal module 112 completes the convolution along the x-axis, the homography removal module 112 can perform the convolution along the y-axis by convolving a y-directional kernel on the x-convolution result stored in the temporal buffer. In some embodiments, the temporal buffer can maintain a predetermined number of rows associated with the x-convolution result. This predetermined number of rows can be determined based on a size of the y-directional kernel. For example, when a y-directional kernel has a length of 5 pixels, the temporal buffer can maintain 5 rows associated with the x-convolution result.
  • In some embodiments, the temporal buffer can be a rolling buffer (e.g., a circular buffer) that stores a row of the x-convolution result until it is no longer needed to compute the y-convolution. For example, when the homography removal module 112 computes a y-convolution at a pixel having a y-coordinate of j and the y-directional kernel has five pixels, then the homography removal module 112 can store, in the temporal buffer, five rows of the x-convolution result, identified by y-coordinates: j+2, j+1, j, j−1, j−2. When the homography removal module 112 completes the y-convolution of all pixels having the y-coordinate of j, the homography removal module 112 can increase the target y-coordinate by one pixel and computes the y-convolution of all pixels having the y-coordinate of j+1. In this case, the homography removal module 112 can use five rows of the x-convolution result, identified by y-coordinates: j+3, j+2, j+1, j, j−1. Therefore, the temporal buffer no longer needs the row of the x-convolution result having the y-coordinate of j−2. In this case, the temporal buffer removes the row of the x-convolution result having the y-coordinate of j−2 and replaces it with the row of the x-convolution result having the y-coordinate of j+3.
  • In some embodiments, the y-directional kernel can be identical for each pixel along the row of the output image. Therefore, this y-directional convolution can be implemented by multiplying every pixel of the x-convolved row by the kernel coefficient corresponding to that row, and then summing along columns. This operation can be efficiently implemented using a SIMD processing unit, for example, a SIMD multiply and/or SIMD add instructions.
  • The constraints on the first transform A do not guarantee any special characteristics about the pixel sampling along the y-axis. Therefore, each row has its own kernel function that can be computed, for example, by (1) mapping a symmetric kernel function in the object plane to a corresponding not-necessarily-symmetric kernel function in the image plane, and (2) integrating it into pixel bins. Such a kernel inherently allows correct sampling at non-integer positions in the y-axis, and also allows the sampling distance between any two rows to be different without loss of performance. Optionally, all the kernel functions used for convolution along the y-axis can be precomputed during a training phase.
  • FIG. 5 illustrates an example of an intermediate image generated by removing the first transform A from the distorted image of FIG. 4 in accordance with some embodiments. Note that the two vertical, parallel lines 402, 404 are now parallel in the intermediate image. However, the first stage has not removed the distortion of the two horizontal parallel lines 406 and 408.
  • As shown in FIG. 5, the intermediate image can have a skewed trapezoid shape (e.g., two sides are parallel; the other two sides may not be). In some cases, the relevant start points and run lengths in each row in the distorted image, x-convolved row, and each row in the intermediate image can optionally be precomputed in a training step.
  • To remove the distortion of the two horizontal parallel lines 406 and 408, the homography removal module 112 can move onto the second stage of the two-stage image mapping process. However, as discussed above, prior to initiating the second stage, in step 304, the homography removal module 112 is configured to transpose the intermediate image.
  • In some embodiments, the homography removal module 112 can perform the transpose operation using a “vertical” buffer. For example, output values of the y-directional convolution (e.g., the intermediate image) can be stored as a temporary image of a fixed width, preferably that fits into near-to-chip memory (e.g. user-controlled Level-2 cache). These output values are written to the buffer in a column order, instead of a row order, and hence the name vertical buffer. Because the buffer can be entirely contained in near-to-chip memory, the homography removal module 112 need not store or access any external memory devices, which improves the memory access time. In effect, this embodiment performs the transpose operation in a local manner.
  • In some embodiments, the homography removal module 112 can alternate writing to one of two vertical buffers in a “ping-ping” fashion so that the previous column, written to a first vertical buffer, can be written out to external memory while the current column is being written to a second vertical buffer.
  • In some embodiments, the vertical buffer data can be transferred to a region in external memory using a DMA controller 114. The region in the external memory can be large enough to accommodate the full rectified image.
  • In some embodiments, the DMA controller 114 can transfer the vertical buffer data to external memory using two-dimensional DMA. The two-dimensional DMA is able to automatically skip over bytes in external memory that do not correspond to a vertical section of the image, which conceptually matches the data in the vertical buffer. Unlike the local vertical buffer, the external memory does not store adjacent sections (e.g., tiles) of the result image in consecutive memory locations, which may deteriorate the read and/or write speed. In such cases, the two-dimensional DMA can handle the read/write operation efficiently at a minimal additional cost (as long as the width of the local vertical buffer is sufficiently large).
  • In step 306, once the intermediate image is transposed, the homography removal module 112 is configured to remove the second transformation from the intermediate image by processing pixels in the x-axis of the transposed intermediate image (e.g., the y-axis of the intermediate image). Subsequently, in step 306 a, the homography removal module 112 is optionally configured to down-sample the result of step 306, and in step 308, the homography removal module 112 is configured to transpose the result of step 306 (or the result of step 306 a if that is performed). In some embodiments, in steps 306/306 a, the homography removal module 112 can use similar operations as steps 302/302 a, and in step 308, the homography removal module 112 can use a similar process as step 304.
  • FIG. 6 illustrates an example of a rectified image generated by removing the second transform B from the intermediate image of FIG. 5 in accordance with some embodiments. Note that the two horizontal, parallel lines 406, 408, as well as the two vertical, parallel lines 402, 404 are parallel in the rectified image. Therefore, the resulting rectified image has all perspective distortion rectified, and is down-sampled by the target down-sampling rate.
  • As discussed above, the homography removal module 112 can down-sample the distorted image during homography rectification, and the homography removal module 112 can perform the down sampling in one or more of the two-stages of the multi-stage rectification process. In some embodiments, when there is a target down-sampling rate, it can be advantageous to down-sample as much as possible in step 302 a (e.g., in the first stage), as this can accelerate the first stage and further accelerate the second stage of the multi-stage rectification process. In each stage of the multi-stage rectification process, the image can be down-sampled independently in x or y direction.
  • In some embodiments, the rectified image can be provided to a pattern detection system so that the pattern detection system can determine whether the rectified image includes a particular pattern. Because pattern detection systems generally operate more efficiently when the source image does not include non-linear distortion, the disclosed homography rectification process can improve the efficiency of pattern matching. In some cases, the pattern detection system can use a simple correlation-based pattern matching technique to determine whether the rectified image includes a particular pattern. In other cases, the pattern detection system can use embodiments of pattern detection techniques disclosed in U.S. Pat. No. 7,016,539, entitled “Method for fast, robust, multi-dimensional pattern recognition,” issued on Mar. 21, 2006, which is herein incorporated by reference in its entirety.
  • In some embodiments, when the homography is not known in advance, the distorted image can be rectified using one of a plurality of hypothesized homography transforms, and the rectified image can be provided to a pattern detection system to determine if a particular pattern exists under the one of a plurality of hypothesized homography transforms. This process can be iterated for other hypothesized homography transforms until the pattern detection system detects the particular pattern from the rectified image.
  • In some embodiments, the disclosed multi-stage rectification process can be used to merely perform affine image sampling, without actually removing a general homographic perspective distortions. For example, to limit the homography matrix H to an affine transformation, the parameters h20 and h21 can be set to zero. In other embodiments, the disclosed multi-stage rectification process can be used to merely remove image scaling. For example, the disclosed multi-stage rectification process can be configured to remove image scaling operations by setting the parameters h10 and h01 of the homography matrix H to zero, in addition to h20 and h21.
  • Even when the disclosed multi-stage rectification process is used to merely remove affine transformation or image scaling operations, the disclosed multi-stage rectification process can be computationally faster than the prior known methods for removing affine transformation or image scaling operation. The computational benefit is derived from the way a kernel is applied to a row of an image. As discussed above, the homography removal module 112 is configured to apply a single kernel function to a row of an image. The homography removal module 112 can maintain a set of kernels that corresponds to this kernel function, where each kernel in the set corresponds to a particular sub-pixel phase. This way, when the homography removal module 112 applies the single kernel function to a row of an image, at each location in the row, the homography removal module 112 can select one of these kernels based on a sub-pixel phase of the location. Because the set of kernels corresponding to sub-pixel phases are already pre-determined, the homography removal module 112 can remove affine transformation or image scaling by applying a selected one of these kernels. When the sub-pixel phase of a location does not exactly correspond to one of the kernels in the set, the homography removal module 112 can select a kernel that is associated with a sub-pixel phase closest to that of the location. This removes nearly all interpolation operations, which tend to be computationally expensive.
  • In case the homography removal module 112 is configured to remove image scaling only, the homography removal module 112 can further improve the accuracy by maintaining a large number of kernels, each kernel adapted to a finer sub-pixel variation (e.g., sub-pixel phases) from a pixel grid. When the homography removal module 112 performs the down-sampling, the homography removal module 112 can determine, for each pixel of a rectified image, the sub-pixel phase of the corresponding location in the distorted image, and use the sub-pixel variation to select one of the predetermined set of kernels. Subsequently, the homography removal module 112 can apply the selected kernel for the corresponding pixel in the distorted mage to remove the image scaling.
  • In some embodiments, when the homography removal module 112 is configured to remove image scaling only, the homography removal module 112 can perform the operation by only using the first stage of the multi-stage rectification process.
  • In some embodiments, when the homography transform H between a rectified image and a distorted image satisfies the first and second constraints described above, the homography removal module 112 may use only one of the stages in the multi-stage rectification process to remove the perspective distortion from the distorted image. For example, a distorted image may be subject to a perspective distortion that results from a rotation of an image sensor around an x-axis of the image sensor (e.g., a row of image sensor bit-cells). In this case, an image sensor may be positioned such that the top of the object plane is tilted away from the image sensor as if the image sensor is rotated around an axis that is parallel to one of image sensor axes. This relationship between the image sensor plane and the object plane may be represented by a homography transform H that actually satisfies the first and second constraints: that (1) pixels in a particular row of a rectified image are mapped to equi-distance pixels in a fixed row of a distorted image and that (2) a row of the rectified image is mapped to a row of the distorted image. Therefore, in this case, the homography removal module 112 can be configured to perform the rectification operation by only using one of the stages of the multi-stage rectification process.
  • In some embodiments, the multi-stage rectification process can be used to align images captured by multiple cameras. In some cases, a machine vision system can include a plurality of cameras viewing different parts of the scene, but are roughly aligned. In other words, each of the plurality of cameras is designed to have an identical in-plane rotation and distance from plane. However, due to physical limitations, there may be “small” differences in the in-plane rotation and distance from plane, which manifest themselves as in-plane rotation and scale, respectively. Furthermore, the pixel grid from each camera may be displaced from one another by a sub-pixel value because, due to physical limitations, it is difficult to align the pixels from each camera exactly.
  • The differences in the images captured by the multiple cameras can be calibrated offline. Therefore, the relationship between images captured by the multiple cameras can be determined offline. This relationship can be represented as a homography matrix. Therefore, the disclosed multi-stage rectification process can be used to align images captured by multiple cameras. In some cases, the disclosed multi-stage rectification process can also be used to down-sample the captured images so that, if one or more cameras have different number of pixels or different pixel sizes, the disclosed multi-stage rectification process can down-sample the images to account for such differences.
  • It is to be understood that the disclosed subject matter is not limited in its application to the details of construction and to the arrangements of the components set forth in the following description or illustrated in the drawings. The disclosed subject matter is capable of other embodiments and of being practiced and carried out in various ways. Also, it is to be understood that the phraseology and terminology employed herein are for the purpose of description and should not be regarded as limiting.
  • As such, those skilled in the art will appreciate that the conception, upon which this disclosure is based, may readily be utilized as a basis for the designing of other structures, methods, and apparatus for carrying out the several purposes of the disclosed subject matter. It is important, therefore, that the claims be regarded as including such equivalent constructions insofar as they do not depart from the spirit and scope of the disclosed subject matter. For example, some of the disclosed embodiments relate one or more variables. This relationship may be expressed using a mathematical equation. However, one of ordinary skill in the art may also express the same relationship between the one or more variables using a different mathematical equation by transforming the disclosed mathematical equation. It is important that the claims be regarded as including such equivalent relationships between the one or more variables.
  • Although the disclosed subject matter has been described and illustrated in the foregoing exemplary embodiments, it is understood that the present disclosure has been made only by way of example, and that numerous changes in the details of implementation of the disclosed subject matter may be made without departing from the spirit and scope of the disclosed subject matter.

Claims (20)

1-20. (canceled)
21. A machine vision system comprising:
a processor configured to run a computer program stored in memory that is configured to cause the processor to:
receive a distorted image and a homography matrix associated with the distorted image;
determine, using a homography decomposition module, a first transform and a second transform that collectively represent the homography matrix, wherein:
the first transform relates the distorted image to an intermediate image,
the second transform relates a transposed intermediate image to a transposed rectified image,
the distorted image, the intermediate image, the transposed intermediate image, and the transposed rectified image comprise at least a first dimension and a second dimension,
the first transform is constrained to map pixels along the first dimension of the intermediate image to locations along the first dimension of the distorted image, and
the locations along the first dimension of the distorted image are separated by a same distance from adjacent locations along the first dimension of the distorted image;
generate, using a homography removal module, the intermediate image by removing the first transform from the distorted image;
transpose, using the homography removal module, the intermediate image prior to removing the second transform from the intermediate image to generate the transposed intermediate image; and
generate, using the homography removal module, the transposed rectified image by removing the second transform from the transposed intermediate image.
22. The machine vision system of claim 21, wherein the locations along the first dimension of the distorted image are separated by a same distance from adjacent locations along the first dimension of the distorted image by constraining a rate of change of (1) the first dimension of the distorted image with respect to (2) the first dimension of the intermediate image.
23. The machine vision system of claim 21, wherein the second transform is constrained to map pixels along the first dimension of the transposed rectified image to locations along the first dimension of the transposed intermediate image, and wherein the locations along the first dimension of the transposed intermediate image are separated by a same distance from adjacent locations along the first dimension of the transposed intermediate image.
24. The machine vision system of claim 23, wherein the locations along the first dimension of the transposed intermediate image are separated by a same distance from adjacent locations along the first dimension of the transposed intermediate image by constraining a rate of change of (1) the first dimension of the transposed intermediate image with respect to (2) the first dimension of the transposed rectified image.
25. The machine vision system of claim 21, wherein the processor is configured, using the homography removal module, to cause a direct memory access controller to transpose the intermediate image prior to removing the second transform from the intermediate image.
26. The machine vision system of claim 21, wherein the processor is configured, using the homography removal module, to remove the first transform from the distorted image by determining, for a first pixel of the intermediate image, a corresponding first location in the distorted image, and convolving pixels surrounding the corresponding first location in the distorted image to determine an image value for the first pixel.
27. The machine vision system of claim 26, wherein the processor is configured, using the homography removal module, to determine, for a second pixel of the intermediate image having an identical coordinate as the first pixel in a second dimension, a corresponding second location in the distorted image by adding a constant to a coordinate of the corresponding first location in the first dimension.
28. The machine vision system of claim 21, wherein the processor is configured, using the homography removal module, to:
transpose the transposed rectified image to generate a rectified image; and
provide the rectified image to a pattern detection system so that the pattern detection system can use the rectified image to determine whether the rectified image includes a particular pattern.
29. The machine vision system of claim 21, wherein the processor is configured, using the homography removal module, to select one of a set of kernels based on a sub-pixel coordinate of one of the locations along the first dimension of the distorted image, and apply the selected kernel at the one of the locations.
30. A computerized method comprising:
receiving, at a homography decomposition module, a distorted image and a homography matrix associated with the distorted image;
determining, using the homography decomposition module, a first transform and a second transform that collectively represent the homography matrix, wherein:
the first transform relates the distorted image to an intermediate image,
the second transform relates a transposed intermediate image to a transposed rectified image,
the distorted image, the intermediate image, the transposed intermediate image, and the transposed rectified image comprise at least a first dimension and a second dimension,
the first transform is constrained to map pixels along the first dimension of the intermediate image to locations along the first dimension of the distorted image, and
the locations along the first dimension of the distorted image are separated by a same distance from adjacent locations along the first dimension of the distorted image;
generating, using a homography removal module, the intermediate image by removing the first transform from the distorted image;
transposing, using the homography removal module, the intermediate image prior to removing the second transform from the intermediate image to generate the transposed intermediate image; and
generating, using the homography removal module, the transposed rectified image by removing the second transform from the transposed intermediate image.
31. The method of claim 30, wherein the locations along the first dimension of the distorted image are separated by a same distance from adjacent locations along the first dimension of the distorted image by constraining a rate of change of (1) the first dimension of the distorted image with respect to (2) the first dimension of the intermediate image.
32. The method of claim 30, wherein the second transform is constrained to map pixels along the first dimension of the transposed rectified image to locations along the first dimension of the transposed intermediate image, and wherein the locations along the first dimension of the transposed intermediate image are separated by a same distance from adjacent locations along the first dimension of the transposed intermediate image.
33. The method of claim 32, wherein the locations along the first dimension of the transposed intermediate image are separated by a same distance from adjacent locations along the first dimension of the transposed intermediate image by constraining a rate of change of (1) the first dimension of the transposed intermediate image with respect to (2) the first dimension of the transposed rectified image.
34. The method of claim 30, further comprising causing, using the homography removal module, a direct memory access controller to transpose the intermediate image prior to removing the second transform from the intermediate image.
35. The method of claim 30, wherein removing the first transform from the distorted image comprises determining, for a first pixel of the intermediate image, a corresponding first location in the distorted image, and convolving pixels surrounding the corresponding location in the distorted image to determine an image value for the first pixel.
36. The method of claim 35, further comprising determining, for a second pixel of the intermediate image having an identical coordinate as the first pixel in a second dimension, a corresponding second location in the distorted image by adding a constant to a coordinate of the corresponding first location in the first dimension.
37. The method of claim 30, further comprising:
transposing the transposed rectified image to generate a rectified image; and
providing the rectified image to a pattern detection system so that the pattern detection system can use the rectified image to determine whether the rectified image includes a particular pattern.
38. The method of claim 30, further comprising selecting one of a set of kernels based on a sub-pixel coordinate of one of the locations along the first dimension of the distorted image, and applying the selected kernel at the one of the locations.
39. A non-transitory computer readable medium having executable instructions associated with a homography decomposition module and a homography removal module, operable to cause a machine vision system to:
receive a distorted image and a homography matrix associated with the distorted image;
determine a first transform and a second transform that collectively represent the homography matrix, wherein:
the first transform relates the distorted image to an intermediate image,
the second transform relates a transposed intermediate image to a transposed rectified image,
the distorted image, the intermediate image, the transposed intermediate image, and the transposed rectified image comprise at least a first dimension and a second dimension,
the first transform is constrained to map pixels along the first dimension of the intermediate image to locations along the first dimension of the distorted image, and
the locations along the first dimension of the distorted image are separated by a same distance from adjacent locations along the first dimension of the distorted image;
generate the intermediate image by removing the first transform from the distorted image;
transpose the intermediate image prior to removing the second transform from the intermediate image to generate the transposed intermediate image; and
generate the transposed rectified image by removing the second transform from the transposed intermediate image.
US16/395,153 2015-04-03 2019-04-25 Homography rectification Abandoned US20190251677A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US16/395,153 US20190251677A1 (en) 2015-04-03 2019-04-25 Homography rectification

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US14/678,279 US10275863B2 (en) 2015-04-03 2015-04-03 Homography rectification
US16/395,153 US20190251677A1 (en) 2015-04-03 2019-04-25 Homography rectification

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
US14/678,279 Continuation US10275863B2 (en) 2015-04-03 2015-04-03 Homography rectification

Publications (1)

Publication Number Publication Date
US20190251677A1 true US20190251677A1 (en) 2019-08-15

Family

ID=57017345

Family Applications (2)

Application Number Title Priority Date Filing Date
US14/678,279 Active US10275863B2 (en) 2015-04-03 2015-04-03 Homography rectification
US16/395,153 Abandoned US20190251677A1 (en) 2015-04-03 2019-04-25 Homography rectification

Family Applications Before (1)

Application Number Title Priority Date Filing Date
US14/678,279 Active US10275863B2 (en) 2015-04-03 2015-04-03 Homography rectification

Country Status (1)

Country Link
US (2) US10275863B2 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11562551B1 (en) * 2022-04-14 2023-01-24 Scandit Ag Image analysis for decoding angled optical patterns

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10692172B2 (en) 2015-12-30 2020-06-23 Snap Inc. Method of plane tracking
TR201700580A2 (en) * 2017-01-13 2018-07-23 Anadolu Ueniversitesi AN INCREASED REALITY BASED TESTING SYSTEM
EP3644281A4 (en) * 2017-06-20 2021-04-28 Sony Interactive Entertainment Inc. Calibration device, calibration chart, chart pattern generation device, and calibration method
JP6570164B1 (en) 2018-11-28 2019-09-04 株式会社ツバサファクトリー Computer program, image processing method, and image processing apparatus
WO2020239212A1 (en) * 2019-05-29 2020-12-03 Huawei Technologies Co., Ltd. Image processing device and method

Family Cites Families (74)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4829587A (en) * 1987-03-02 1989-05-09 Digital Equipment Corporation Fast bitonal to gray scale image scaling
US5054097A (en) 1988-11-23 1991-10-01 Schlumberger Technologies, Inc. Methods and apparatus for alignment of images
US5299308A (en) 1990-02-28 1994-03-29 Ricoh Company, Ltd. Graphic data processing apparatus for producing a tone for an edge pixel and reducing aliasing effects
US5519823A (en) 1991-03-15 1996-05-21 Hewlett-Packard Company Apparatus for rendering antialiased vectors
GB2278524B (en) 1993-05-28 1997-12-10 Nihon Unisys Ltd Method and apparatus for rendering visual images employing area calculation and blending of fractional pixel lists for anti-aliasing and transparency
US6525723B1 (en) * 1998-02-17 2003-02-25 Sun Microsystems, Inc. Graphics system which renders samples into a sample buffer and generates pixels in response to stored samples at different rates
US6449664B1 (en) * 1998-11-16 2002-09-10 Viewahead Technology, Inc. Two dimensional direct memory access in image processing systems
KR100313217B1 (en) * 1998-12-23 2001-12-28 서평원 Pipeline DCT device
JP4332934B2 (en) 1999-06-17 2009-09-16 株式会社セガ Anti-aliasing method and image processing apparatus using the same
US6606404B1 (en) * 1999-06-19 2003-08-12 Microsoft Corporation System and method for computing rectifying homographies for stereo vision processing of three dimensional objects
US6608923B1 (en) * 1999-06-19 2003-08-19 Microsoft Corporation System and method for rectifying images of three dimensional objects
US7084905B1 (en) * 2000-02-23 2006-08-01 The Trustees Of Columbia University In The City Of New York Method and apparatus for obtaining high dynamic range images
US6608631B1 (en) 2000-05-02 2003-08-19 Pixar Amination Studios Method, apparatus, and computer program product for geometric warps and deformations
JP3429280B2 (en) 2000-09-05 2003-07-22 理化学研究所 How to correct lens distortion in images
US7738706B2 (en) * 2000-09-22 2010-06-15 Sri International Method and apparatus for recognition of symbols in images of three-dimensional scenes
US6756993B2 (en) 2001-01-17 2004-06-29 The University Of North Carolina At Chapel Hill Methods and apparatus for rendering images using 3D warping techniques
GB2372659A (en) 2001-02-23 2002-08-28 Sharp Kk A method of rectifying a stereoscopic image
US6963365B2 (en) 2001-02-28 2005-11-08 Hewlett-Packard Development Company, L.P. System and method for removal of digital image vertical distortion
US7035332B2 (en) * 2001-07-31 2006-04-25 Wis Technologies, Inc. DCT/IDCT with minimum multiplication
US6950469B2 (en) 2001-09-17 2005-09-27 Nokia Corporation Method for sub-pixel value interpolation
JP4144292B2 (en) * 2002-08-20 2008-09-03 ソニー株式会社 Image processing apparatus, image processing system, and image processing method
WO2004063991A1 (en) 2003-01-14 2004-07-29 The Circle For The Promotion Of Science And Engineering Multi-parameter highly-accurate simultaneous estimation method in image sub-pixel matching and multi-parameter highly-accurate simultaneous estimation program
JP3852934B2 (en) * 2003-05-20 2006-12-06 株式会社バンダイナムコゲームス Image processing system, program, and information storage medium
US7430335B2 (en) * 2003-08-13 2008-09-30 Apple Inc Pre-processing method and system for data reduction of video sequences and bit rate reduction of compressed video sequences using spatial filtering
US8161388B2 (en) * 2004-01-21 2012-04-17 Rodriguez Arturo A Interactive discovery of display device characteristics
US7239331B2 (en) 2004-02-17 2007-07-03 Corel Corporation Method and apparatus for correction of perspective distortion
US7379623B2 (en) 2004-04-30 2008-05-27 Microsoft Corporation Method to quickly warp a 2-D image using only integer math
US7583858B2 (en) * 2004-10-12 2009-09-01 Eastman Kodak Company Image processing based on direction of gravity
US20060078215A1 (en) * 2004-10-12 2006-04-13 Eastman Kodak Company Image processing based on direction of gravity
US7619658B2 (en) * 2004-11-15 2009-11-17 Hewlett-Packard Development Company, L.P. Methods and systems for producing seamless composite images without requiring overlap of source images
US20060104521A1 (en) * 2004-11-15 2006-05-18 Shu-Wen Teng Image processing devices and methods
US7403658B2 (en) * 2005-04-15 2008-07-22 Microsoft Corporation Direct homography computation by local linearization
EP1875458A1 (en) 2005-04-21 2008-01-09 Koninklijke Philips Electronics N.V. Sub-pixel mapping
US7369140B1 (en) 2005-06-03 2008-05-06 Nvidia Corporation System, apparatus and method for subpixel shifting of sample positions to anti-alias computer-generated images
US7456846B1 (en) 2005-06-03 2008-11-25 Nvidia Corporation Graphical processing system, graphical pipeline and method for implementing subpixel shifting to anti-alias texture
JP4529837B2 (en) * 2005-08-04 2010-08-25 カシオ計算機株式会社 Imaging apparatus, image correction method, and program
EP1768387B1 (en) * 2005-09-22 2014-11-05 Samsung Electronics Co., Ltd. Image capturing apparatus with image compensation and method therefor
US7769236B2 (en) * 2005-10-31 2010-08-03 National Research Council Of Canada Marker and method for detecting said marker
US7821570B2 (en) * 2005-11-30 2010-10-26 Eastman Kodak Company Adjusting digital image exposure and tone scale
US7800628B2 (en) 2006-06-16 2010-09-21 Hewlett-Packard Development Company, L.P. System and method for generating scale maps
US8023144B2 (en) * 2006-07-11 2011-09-20 Lexmark International, Inc. Method for generating transposed image data
EP2168079B1 (en) 2007-01-23 2015-01-14 Valeo Schalter und Sensoren GmbH Method and system for universal lane boundary detection
JP4958610B2 (en) * 2007-04-06 2012-06-20 キヤノン株式会社 Image stabilization apparatus, imaging apparatus, and image stabilization method
US7903899B2 (en) * 2007-05-23 2011-03-08 Videosurf, Inc. Method of geometric coarsening and segmenting of still images
US8681224B2 (en) 2007-06-26 2014-03-25 Dublin City University Method for high precision lens distortion calibration and removal
US7780084B2 (en) * 2007-06-29 2010-08-24 Microsoft Corporation 2-D barcode recognition
US8233045B2 (en) * 2007-07-16 2012-07-31 Trw Automotive U.S. Llc Method and apparatus for distortion correction and image enhancing of a vehicle rear viewing system
JP4461165B2 (en) * 2007-09-26 2010-05-12 株式会社東芝 Image processing apparatus, method, and program
ATE452379T1 (en) 2007-10-11 2010-01-15 Mvtec Software Gmbh SYSTEM AND METHOD FOR 3D OBJECT RECOGNITION
KR100966592B1 (en) * 2007-12-17 2010-06-29 한국전자통신연구원 Method for calibrating a camera with homography of imaged parallelogram
KR100924689B1 (en) * 2007-12-17 2009-11-03 한국전자통신연구원 Apparatus and method for transforming an image in a mobile device
US8098262B2 (en) * 2008-09-05 2012-01-17 Apple Inc. Arbitrary fractional pixel movement
CN101533513B (en) 2009-04-24 2012-12-26 天津大学 Picture falsification testing method based on double-vision geometry
US8355592B1 (en) * 2009-05-06 2013-01-15 Adobe Systems Incorporated Generating a modified image with semantic constraint
JP4528867B1 (en) * 2009-05-15 2010-08-25 株式会社東芝 Image processing apparatus, image processing method, and display apparatus
US8995793B1 (en) 2009-10-09 2015-03-31 Lockheed Martin Corporation Moving object super-resolution systems and methods
US8890934B2 (en) * 2010-03-19 2014-11-18 Panasonic Corporation Stereoscopic image aligning apparatus, stereoscopic image aligning method, and program of the same
US8908011B2 (en) * 2010-09-17 2014-12-09 Panasonic Corporation Three-dimensional video creating device and three-dimensional video creating method
US9118816B2 (en) 2011-12-06 2015-08-25 Mobileye Vision Technologies Ltd. Road vertical contour detection
US9313519B2 (en) 2011-03-11 2016-04-12 Google Technology Holdings LLC Interpolation filter selection using prediction unit (PU) size
US9179047B2 (en) * 2011-06-13 2015-11-03 University Of Florida Research Foundation, Inc. Systems and methods for estimating the structure and motion of an object
US9189892B2 (en) * 2011-07-01 2015-11-17 Google Inc. Systems and methods for activities solver development in augmented reality applications
EP2756682A4 (en) * 2011-09-12 2015-08-19 Intel Corp Networked capture and 3d display of localized, segmented images
EP2751521B1 (en) 2011-11-23 2016-03-02 Creaform Inc. Method and system for alignment of a pattern on a spatial coded slide image
US8942512B2 (en) * 2011-12-24 2015-01-27 Ecole De Technologie Superieure Methods and systems for processing a first image with reference to a second image
JP6071257B2 (en) 2012-06-07 2017-02-01 キヤノン株式会社 Image processing apparatus, control method therefor, and program
US8928730B2 (en) 2012-07-03 2015-01-06 DigitalOptics Corporation Europe Limited Method and system for correcting a distorted input image
US9111059B2 (en) * 2012-11-01 2015-08-18 Stc.Unm System and methods for dynamic management of hardware resources
US8970709B2 (en) * 2013-03-13 2015-03-03 Electronic Scripting Products, Inc. Reduced homography for recovery of pose parameters of an optical apparatus producing image data with structural uncertainty
US9317893B2 (en) * 2013-03-26 2016-04-19 Sharp Laboratories Of America, Inc. Methods and systems for correcting a document image
US9842423B2 (en) * 2013-07-08 2017-12-12 Qualcomm Incorporated Systems and methods for producing a three-dimensional face model
EP2843616A1 (en) * 2013-08-29 2015-03-04 Sick Ag Optoelectronic device and method for recording rectified images
KR102265109B1 (en) * 2014-01-24 2021-06-15 삼성전자주식회사 Method and apparatus for image processing
US9407896B2 (en) 2014-03-24 2016-08-02 Hong Kong Applied Science and Technology Research Institute Company, Limited Multi-view synthesis in real-time with fallback to 2D from 3D to reduce flicker in low or unstable stereo-matching image regions

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11562551B1 (en) * 2022-04-14 2023-01-24 Scandit Ag Image analysis for decoding angled optical patterns
US11893764B1 (en) 2022-04-14 2024-02-06 Scandit Ag Image analysis for decoding angled optical patterns

Also Published As

Publication number Publication date
US10275863B2 (en) 2019-04-30
US20160292831A1 (en) 2016-10-06

Similar Documents

Publication Publication Date Title
US9542732B2 (en) Efficient image transformation
EP3101623B1 (en) Homography rectification
US20190251677A1 (en) Homography rectification
US20210174471A1 (en) Image Stitching Method, Electronic Apparatus, and Storage Medium
US9235063B2 (en) Lens modeling
CN109753971B (en) Correction method and device for distorted text lines, character recognition method and device
US9569874B2 (en) System and method for perspective preserving stitching and summarizing views
US9824486B2 (en) High resolution free-view interpolation of planar structure
US20130011057A1 (en) Hough Transform Method for Linear Ribbon and Circular Ring Detection in the Gradient Domain
RU2631765C1 (en) Method and system of correcting perspective distortions in images occupying double-page spread
CN110517186B (en) Method, device, storage medium and computer equipment for eliminating invoice seal
GB2536430B (en) Image noise reduction
CN111383252B (en) Multi-camera target tracking method, system, device and storage medium
CN112435223B (en) Target detection method, device and storage medium
Lu et al. Multiperspective image stitching and regularization via hybrid structure warping
Jain et al. Panorama construction from multi-view cameras in outdoor scenes
Nagy et al. Super-resolution for traditional and omnidirectional image sequences
US11734855B2 (en) Rotation equivariant orientation estimation for omnidirectional localization
CN117036985B (en) Small target detection method and device for video satellite image
US20220318961A1 (en) Method and electronic device for removing artifact in high resolution image
CN111179166A (en) Image processing method, device, equipment and computer readable storage medium
CN117253022A (en) Object identification method, device and inspection equipment
TW202143169A (en) Photographing method for dynamic scene compensation and a camera using the method
CN115272519A (en) Dynamic area detection method and storage medium

Legal Events

Date Code Title Description
AS Assignment

Owner name: COGNEX CORPORATION, MASSACHUSETTS

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:BOGAN, NATHANIEL;REEL/FRAME:049002/0300

Effective date: 20190123

STPP Information on status: patent application and granting procedure in general

Free format text: NON FINAL ACTION MAILED

STCB Information on status: application discontinuation

Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION