US20230066494A1 - Apparatus to perform alignment to images, image processing method to perform alignment to images, and computer readable non-transitory memory to perform alignment to images - Google Patents

Apparatus to perform alignment to images, image processing method to perform alignment to images, and computer readable non-transitory memory to perform alignment to images Download PDF

Info

Publication number
US20230066494A1
US20230066494A1 US18/054,501 US202218054501A US2023066494A1 US 20230066494 A1 US20230066494 A1 US 20230066494A1 US 202218054501 A US202218054501 A US 202218054501A US 2023066494 A1 US2023066494 A1 US 2023066494A1
Authority
US
United States
Prior art keywords
images
blocks
feature amount
region
image
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US18/054,501
Inventor
Takao Sasaki
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Canon Inc
Original Assignee
Canon Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Canon Inc filed Critical Canon Inc
Priority to US18/054,501 priority Critical patent/US20230066494A1/en
Publication of US20230066494A1 publication Critical patent/US20230066494A1/en
Abandoned legal-status Critical Current

Links

Images

Classifications

    • H04N5/232125
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N23/00Cameras or camera modules comprising electronic image sensors; Control thereof
    • H04N23/95Computational photography systems, e.g. light-field imaging systems
    • H04N23/958Computational photography systems, e.g. light-field imaging systems for extended depth of field imaging
    • H04N23/959Computational photography systems, e.g. light-field imaging systems for extended depth of field imaging by adjusting depth of field during image capture, e.g. maximising or setting range based on scene characteristics
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N23/00Cameras or camera modules comprising electronic image sensors; Control thereof
    • H04N23/60Control of cameras or camera modules
    • H04N23/62Control of parameters via user interfaces
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N23/00Cameras or camera modules comprising electronic image sensors; Control thereof
    • H04N23/60Control of cameras or camera modules
    • H04N23/63Control of cameras or camera modules by using electronic viewfinders
    • H04N23/631Graphical user interfaces [GUI] specially adapted for controlling image capture or setting capture parameters
    • H04N23/632Graphical user interfaces [GUI] specially adapted for controlling image capture or setting capture parameters for displaying or modifying preview images prior to image capturing, e.g. variety of image resolutions or capturing parameters
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N23/00Cameras or camera modules comprising electronic image sensors; Control thereof
    • H04N23/60Control of cameras or camera modules
    • H04N23/63Control of cameras or camera modules by using electronic viewfinders
    • H04N23/633Control of cameras or camera modules by using electronic viewfinders for displaying additional information relating to control or operation of the camera
    • H04N23/635Region indicators; Field of view indicators
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N23/00Cameras or camera modules comprising electronic image sensors; Control thereof
    • H04N23/60Control of cameras or camera modules
    • H04N23/67Focus control based on electronic image sensor signals
    • H04N23/673Focus control based on electronic image sensor signals based on contrast or high frequency components of image signals, e.g. hill climbing method
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N23/00Cameras or camera modules comprising electronic image sensors; Control thereof
    • H04N23/60Control of cameras or camera modules
    • H04N23/67Focus control based on electronic image sensor signals
    • H04N23/675Focus control based on electronic image sensor signals comprising setting of focusing regions
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N23/00Cameras or camera modules comprising electronic image sensors; Control thereof
    • H04N23/60Control of cameras or camera modules
    • H04N23/67Focus control based on electronic image sensor signals
    • H04N23/676Bracketing for image capture at varying focusing conditions
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N23/00Cameras or camera modules comprising electronic image sensors; Control thereof
    • H04N23/95Computational photography systems, e.g. light-field imaging systems
    • H04N23/951Computational photography systems, e.g. light-field imaging systems by using two or more images to influence resolution, frame rate or aspect ratio
    • H04N5/232127
    • H04N5/232133
    • H04N5/23232

Definitions

  • the aspect of the embodiments relates to an image processing apparatus that combines a plurality of images different in in-focus position in an optical axis direction.
  • Japanese Patent Application Laid-Open No. 2015-186088 discusses a focus stacking composition technique in which a plurality of images different in in-focus position is captured, a focusing area is extracted from each of the images, and the extracted focusing areas are combined to generate one composite image in which the entire imaging area is in focus. To perform the focus stacking composition, the images are aligned because field angles of the images different in in-focus position are finely different and due to the effect of camera shake in the imaging.
  • Japanese Patent Application Laid-Open No. 2015-186088 discusses a technique for performing alignment in which blocks are set in each of the images to calculate a moving amount by using the blocks.
  • the blocks are arranged first in the focusing area of the image because a feature amount such as a motion vector calculated from the focusing area of the image is high in accuracy.
  • an image processing apparatus includes at least one memory configured to store instructions, and at least one processor in communication with the at least one memory and configured to execute the instructions to detect a feature amount from each of a plurality of images different in in-focus position in an optical axis direction, and align the plurality of images by using the feature amount.
  • the at least one processor further executes instructions to detect the feature amount by preferentially using a focusing area of each of the plurality of images.
  • FIG. 1 is a block diagram illustrating a configuration of a digital camera according to an exemplary embodiment of the disclosure.
  • FIG. 2 is a flowchart illustrating generation of a composite image according to the exemplary embodiment of the disclosure.
  • FIG. 3 is a flowchart illustrating imaging according to the exemplary embodiment of the disclosure.
  • FIG. 4 is a flowchart illustrating alignment according to a first exemplary embodiment.
  • FIGS. 5 A to 5 D are diagrams illustrating arrangement of blocks according to the first exemplary embodiment.
  • FIGS. 6 A to 6 E are diagrams illustrating rearrangement of blocks according to the first exemplary embodiment.
  • FIG. 7 is a flowchart illustrating image composition according to the exemplary embodiment of the disclosure.
  • FIG. 8 is a flowchart illustrating alignment processing according to a second exemplary embodiment.
  • FIG. 9 is a flowchart illustrating another example of the alignment processing according to the second exemplary embodiment.
  • FIG. 1 is an example of a block diagram illustrating a configuration of a digital camera as an image processing apparatus according to a first exemplary embodiment.
  • a digital camera 100 can capture a still image, record information on a focus position, calculate a contrast value, and combine images. Further, the digital camera 100 can perform enlargement processing or reduction processing on an image captured and stored, or an image input from outside.
  • a control unit 101 is a signal processor such as a central processing unit (CPU) and a micro processing unit (MPU), and controls units of the digital camera 100 while reading out a program previously stored in a read only memory (ROM) 105 described below. For example, as described below, the control unit 101 instructs an imaging unit 104 described below to start and end imaging. The control unit 101 also instructs an image processing unit 107 described below to perform image processing based on the program stored in the ROM 105 . An instruction from a user is input to the digital camera 100 by an operation unit 110 described below, and is transmitted to each of the units of the digital camera 100 via the control unit 101 .
  • CPU central processing unit
  • MPU micro processing unit
  • a driving unit 102 includes a motor, and mechanically operates an optical system 103 described below under the instruction of the control unit 101 .
  • the driving unit 102 moves a position of a focus lens included in the optical system 103 to adjust a focal length of the optical system 103 based on the instruction of the control unit 101 .
  • the optical system 103 includes a zoom lens, the focus lens, and a diaphragm.
  • the diaphragm is a mechanism for adjusting a quantity of light to be transmitted.
  • a focus position is changeable by changing the positions of the lenses.
  • the imaging unit 104 is a photoelectric conversion device that performs photoelectric conversion to convert an incident optical signal into an electric signal.
  • a charge-coupled device (CCD) sensor or a complementary metal-oxide semiconductor (CMOS) sensor is adoptable as the imaging unit 104 .
  • the imaging unit 104 has a moving image capturing mode, and can capture a plurality of temporally continuous images as frames of a moving image.
  • the ROM 105 is a nonvolatile read-only memory as a recording medium, and stores parameters for operation of each of the units in addition to an operation program for each of the units included in the digital camera 100 .
  • a random access memory (RAM) 106 is a rewritable volatile memory, and is used as a temporary storage area of data output in the operation of each of the units included in the digital camera 100 .
  • the image processing unit 107 performs various image processing such as white balance adjustment, color interpolation, and filtering on the images output from the imaging unit 104 or data of an image signal recorded in a built-in memory 109 described below.
  • the image processing unit 107 also performs compression processing conforming to a standard such as Joint Photographic Experts Group (JPEG) on data of the images captured by the imaging unit 104 .
  • JPEG Joint Photographic Experts Group
  • the image processing unit 107 includes an integrated circuit (application specific integrated circuit (ASIC)) in which circuits each performing specific processing are integrated.
  • the control unit 101 may perform a part or all of the functions of the image processing unit 107 by performing processing based on the program read out from the ROM 105 . In a case where the control unit 101 performs all of the functions of the image processing unit 107 , the image processing unit 107 as hardware is unnecessary.
  • a display unit 108 is a liquid crystal display, an organic electroluminescence (EL) display, or other type of display that displays an image temporarily stored in the RAM 106 , an image stored in the built-in memory 109 described below, and a setting screen of the digital camera 100 .
  • EL organic electroluminescence
  • the built-in memory 109 stores the image captured by the imaging unit 104 , the image processed by the image processing unit 107 , information on the focus position in imaging, etc.
  • a memory card or the like may be used in place of the built-in memory 109 .
  • the operation unit 110 includes, for example, buttons, switches, keys, and a mode dial provided on the digital camera 100 , and examples of the operation unit 110 includes a touch panel also serving as the display unit 108 .
  • the instruction from the user is transmitted to the control unit 101 via the operation unit 110 .
  • FIG. 2 is a flowchart illustrating generation of a composite image according to the present exemplary embodiment.
  • the imaging unit 104 captures a plurality of images at respectively different in-focus positions in an optical axis direction.
  • the control unit 101 performs alignment on the plurality of images captured by the imaging unit 104 in step S 201 .
  • the image processing unit 107 combines the aligned images to generate a composite image having a deeper depth of field.
  • FIG. 3 is a flowchart illustrating the imaging in step S 201 according to the present exemplary embodiment.
  • step S 301 the control unit 101 sets an in-focus position.
  • the user designates a focus position via the touch panel also serving as the display unit 108 , and designates a plurality of in-focus positions at equal intervals on a front side and a rear side, in the optical axis direction, of an in-focus position corresponding to the designated focus position.
  • the control unit 101 determines an imaging order of the set in-focus positions in order of a distance.
  • step S 302 the imaging unit 104 captures an image at an in-focus position, at which imaging has not been performed and which is the earliest in the imaging order among the in-focus positions set in step S 301 .
  • step S 303 the control unit 101 determines whether images have been captured at all of the in-focus positions set in step S 301 . In a case where images have been captured at all of the set in-focus positions (YES in step S 303 ), the processing in the flowchart is ended. In a case where there is an in-focus position where the image has not been captured (NO in step S 303 ), the processing returns to step S 302 .
  • images may be simultaneously captured at the plurality of in-focus positions set in step S 301 .
  • FIG. 4 is a flowchart illustrating the alignment in step S 202 according to the present exemplary embodiment.
  • the control unit 101 detects a motion vector as a feature amount of each image, and performs the alignment by using the motion vector.
  • step S 401 the image processing unit 107 calculates a contrast value from each of the images captured in step S 201 .
  • the contrast value calculated at this time is used for rearrangement of blocks and image composition described below.
  • the image processing unit 107 first calculates a luminance value Y by using the following equation (1) from color signals Sr, Sg, and Sb of each of pixels.
  • a Sobel filter is applied to a matrix L of luminance values Y of 3 ⁇ 3 pixels, and a contrast value I is calculated by using the following equations (2) to (4).
  • I h ( - 1 0 1 - 2 0 2 - 1 0 1 ) ⁇ L ( 2 )
  • I v ( - 1 - 2 - 1 0 0 0 1 2 1 ) ⁇ L ( 3 )
  • I I h 2 + I v 2 ( 4 )
  • the above-described method of calculating the contrast value is merely an example, and for example, an edge detection filter such as a Laplacian filter, or a bandpass filter allowing frequencies in a predetermined band to pass therethrough may be used.
  • an edge detection filter such as a Laplacian filter, or a bandpass filter allowing frequencies in a predetermined band to pass therethrough may be used.
  • step S 402 the control unit 101 determines target images on which the alignment processing is to be performed.
  • the control unit 101 determines, from among the images captured in step S 201 , two temporally adjacent images captured at respective times next to each other are determined as the target images of the alignment processing. For example, a first captured image and a second captured image are first determined as the target images.
  • step S 403 the control unit 101 detects a focusing area of each of the target images.
  • a method for detecting a focusing area for example, information on a focusing area is recorded in imaging, and the focusing area is extracted in step S 403 .
  • the method for detecting a focusing area in which contrast values of all captured images are compared with each other, and an area having a contrast value larger than a contrast value of a corresponding area in the other image may be determined as the focusing area.
  • an interval between the in-focus positions of the two target images is relatively small, so that it is expected that the focusing areas are largely overlapped.
  • step S 404 the image processing unit 107 arranges blocks in each of the target images.
  • FIGS. 5 A to 5 D are diagrams illustrating arrangement of the blocks according to the present exemplary embodiment.
  • FIG. 5 A illustrates an example of the captured image.
  • FIG. 5 B illustrates an example of arrangement of blocks in the image illustrated in FIG. 5 A .
  • blocks are arranged on the entire area of the image illustrated in FIG. 5 A .
  • FIG. 5 C illustrates an example in which blocks are arranged in a part of the area of the image illustrated in FIG. 5 A .
  • blocks 501 and search ranges 502 including the respective blocks 501 are illustrated.
  • the search ranges 502 are arranged for detection of a motion vector described below, based on the respective blocks.
  • the number of arranged blocks are the same in the examples illustrated in FIG. 5 B and FIG. 5 C , the number of blocks is not limited thereto. The number of blocks may be changed based on a calculation function of the control unit 101 . To accurately detect the motion vector described below, however, blocks similarly in all of the images are to be arranged.
  • the search ranges are arranged so as not to be overlapped with one another.
  • the search ranges may be arranged so as to be overlapped with one another as illustrated in FIG. 5 D .
  • FIG. 5 D although the sizes of the blocks and the search ranges are similar to the sizes of the blocks and the search ranges in FIG. 5 B , the blocks and the search ranges do not cover the entire area of the image unlike FIG. 5 B because the search ranges are overlapped with one another.
  • step S 405 the image processing unit 107 detects a motion vector from each of the blocks arranged in step S 404 .
  • the control unit 101 calculates, in each of the search ranges in one of the target images, a correspondence point at which a sum of absolute difference (hereinafter, referred to as SAD) of luminance values with respect to the corresponding block of the other image becomes minimum.
  • SAD sum of absolute difference
  • the control unit 101 calculates the motion vector based on the center of the block and the correspondence point of the center of the block.
  • the control unit 101 may also use a sum of squared difference (hereinafter, referred to as SSD), normalized cross correlation (hereinafter, referred to as NCC), etc., in addition to the SAD.
  • SSD sum of squared difference
  • NCC normalized cross correlation
  • step S 406 the image processing unit 107 determines whether the motion vectors detected in step S 405 satisfy a predetermined condition.
  • the motion vectors are required to satisfy a certain condition. For example, as described below, at least three motion vectors not parallel to one another are applied to calculate an affine transformation coefficient. In other words, the alignment cannot be performed unless a predetermined number or more of motion vectors are detected. Further, in a case where the detected motion vectors are excessively small and an influence of the difference is large, the accuracy of the alignment may be low. In other words, the alignment cannot be performed unless the detected motion vectors are each greater than or equal to a threshold value. Moreover, in a case where the motion vectors detected from the blocks are not directed to a certain direction, i.e., in a case where the deviation is large, the accuracy of the alignment may be low.
  • the image processing unit 107 previously determines a condition that the motion vectors are to satisfy, and the image processing unit 107 determines, in step S 406 , whether the motion vectors satisfy the above-described condition. In a case where the motion vectors do not satisfy the above-described predetermined condition (NO in step S 406 ), the accuracy of the alignment may be low even if the alignment is performed. Therefore, the processing proceeds to step S 407 without directly proceeding to the alignment processing. In contrast, in a case where the detected motion vectors satisfy the predetermined condition (YES in step S 406 ), the processing proceeds to step S 409 .
  • step S 407 the image processing unit 107 re-arranges the blocks in each of the target images with reference to the focusing area detected in step S 403 .
  • the image processing unit 107 refers to a portion where the focusing areas of the two respective target images are overlapped with each other, and arranges the blocks at positions corresponding to the overlapped portion.
  • FIGS. 6 A to 6 E are diagrams illustrating rearrangement of the blocks according to the present exemplary embodiment.
  • An image 610 in FIG. 6 A and an image 620 in FIG. 6 B are the two target images determined by the control unit 101 in step S 402 .
  • An area 611 is a focusing area of the image 610 a
  • an area 621 is a focusing area of the image 620 .
  • FIG. 6 C illustrates a state where the image 610 and the image 620 are overlapped with each other.
  • FIG. 6 C a state where blocks are arranged in an area 631 where the focusing area 611 of the image 610 and the focusing area 621 of the image 620 are overlapped is illustrated.
  • FIG. 6 D illustrates an area where the blocks are re-arranged in the image 610
  • FIG. 6 E illustrates an area where the blocks are re-arranged in the image 620 .
  • the blocks are arranged over the entire area 631 , the arrangement of the blocks is not limited thereto, and the blocks may be arranged in a part of the area 631 .
  • the control unit 101 re-arranges the blocks in the block rearrangement area with density higher than density of the blocks arranged in step S 404 .
  • the sizes of the blocks and the search ranges are made smaller, and the same number of blocks are re-arranged.
  • the search ranges arranged in step S 404 are not overlapped with one another, the sizes of the blocks and the search ranges are not changed, and the blocks are re-arranged such that the search ranges of the blocks are overlapped with one another as illustrated in FIG. 5 D .
  • the search ranges arranged in step S 404 are overlapped with one another, the sizes of the blocks and the search ranges may not be changed, and the blocks may be re-arranged such that the overlapped portions become larger.
  • step S 408 the image processing unit 107 re-detects the motion vector from each of the blocks re-arranged in step S 407 .
  • the method of re-detecting a motion vector may be similar to the method of detecting the motion vector in step S 405 .
  • step S 409 the control unit 101 determines whether the motion vectors of all of the images have been calculated. In a case where there is an image in which a motion vector has not been calculated (NO in step S 409 ), the processing returns to step S 402 , and the control unit 101 determines target images again. For example, the control unit 101 determines two target images in order of imaging. In other words, in the first motion vector calculation s, the first image and the second image are determined as target images. In second motion vector calculation, the second image and a third image are determined as target images.
  • step S 409 If the motion vectors of all of the images are calculated (YES in step S 409 ), the processing proceeds to step S 410 .
  • step S 410 a conversion coefficient is calculated.
  • the conversion coefficient is a coefficient for deformation of an image in next step S 411 .
  • an image can be deformed by using a conversion coefficient A expressed in the following equation (5).
  • (x′, y′) indicates coordinates after deformation
  • (x, y) indicates coordinates before the deformation.
  • the conversion coefficient A in the equation (5) can be calculated.
  • step S 411 the equation (5) is applied to all coordinates in one of the target images by using the calculated conversion coefficient. As a result, the two target images can be aligned. Applying such a method to all pairs of the target images determined in step S 402 allows for alignment of all of the images.
  • step S 203 the image processing unit 107 performs image composition on the aligned images.
  • FIG. 7 is a flowchart illustrating image composition according to the present exemplary embodiment.
  • step S 701 the image processing unit 107 calculates contrast values of the respective aligned images.
  • the contrast values can be calculated by a method similar to the method described above in step S 401 . Further, the contrast value calculated in step S 401 may be converted in coordinate in consideration of deformation, and the converted contrast value may be used.
  • step S 702 the image processing unit 107 generates a composition map.
  • the image processing unit 107 compares the contrast values of pixels located at the same position in the images, and calculates a composition ratio corresponding to the magnitude of the contrast values. More specifically, among the pixels located at the same position, the composition ratio of 100% is given to the pixel having the largest contrast value, and the composition ratio of 0% is given to the other pixel located at the same position. In other words, the following equation (6) can be obtained.
  • C k (x, y) indicates the contrast value calculated in step S 701
  • a m (x, y) indicates the ratio of the composition map.
  • m indicates an m-th image among the plurality of images different in in-focus position
  • x indicates a horizontal coordinate of the image
  • y indicates a vertical coordinate of the image.
  • step S 702 the composition ratio is appropriately adjusted to prevent a boundary portion from becoming unnatural.
  • the composition ratio of the composition map in one image is not binarized to 0% and 100% but continuously changes.
  • step S 703 the image processing unit 107 generates a composite image based on the composition map.
  • the generated composite image has a deeper depth of field than the depth of field of each of the captured images because the composite image is generated by extracting the focusing areas in the captured images.
  • the conversion coefficient is calculated in step S 410 by using all of the detected motion vectors.
  • the calculation is not limited thereto.
  • the control unit 101 uses the motion vectors from the focusing area detected in step S 403 in the calculation of the conversion coefficient in step S 410 .
  • the blocks are re-arranged in the focusing area, which allows for the alignment with higher accuracy.
  • a second exemplary embodiment is described below with reference to FIG. 8 .
  • the blocks are arranged first based on the focusing area in the second exemplary embodiment.
  • the present exemplary embodiment will be described by focusing on differences from the first exemplary embodiment.
  • a processing flow of the focus stacking composition according to the present exemplary embodiment is similar to the processing flow in FIG. 2 according to the first exemplary embodiment. Processing flows of the imaging and the image composition are also similar to the processing flows according to the first exemplary embodiment.
  • FIG. 8 is a flowchart illustrating alignment processing according to the present exemplary embodiment. Processing in steps S 801 to S 803 is similar to the processing in steps S 401 to S 403 according to the first exemplary embodiment.
  • step S 804 the control unit 101 arranges blocks based on the focusing area detected in step S 803 .
  • a specific arrangement method may be, for example, the method described with reference to FIGS. 6 A to 6 E in the first exemplary embodiment.
  • step S 805 the control unit 101 detects a motion vector of each of the target images based on the blocks arranged in step S 804 .
  • step S 809 the control unit 101 determines whether motion vectors have been detected from all of the images. In a case where motion vectors have been detected from all of the images (YES in step S 809 ), the processing proceeds to step S 810 . In a case where there is an image from which a motion vector has not been detected (NO in step S 809 ), the processing returns to step S 802 .
  • Processing in steps S 810 and S 811 is similar to the processing in steps S 410 and S 411 according to the first exemplary embodiment.
  • FIG. 9 is a flowchart illustrating a modification of the alignment processing according to the present exemplary embodiment.
  • the processing proceeds to the step of calculating the conversion coefficient.
  • the processing includes a step of determining whether the motion vectors satisfy the predetermined condition. In other words, in the example illustrated in FIG. 9 , it is determined whether the detected motion vectors satisfy the predetermined condition in a manner similar to step S 406 in the first exemplary embodiment.
  • the blocks are re-arranged in step S 907 .
  • the blocks are arranged in consideration of the focusing area in step S 904 , in one embodiment, the blocks are arranged with higher density in the rearrangement in step S 907 .
  • a detection accuracy of a motion vector can be improved by arrangement of blocks used for detection of the motion vector based on the position of a focusing area.
  • the implementation is not limited to the digital camera.
  • the exemplary embodiments may be implemented in a mobile terminal with a built-in imaging device, or in a network camera that is capable of capturing an image.
  • the exemplary embodiments of the disclosure can be realized by supplying a program for realizing one or more functions of the exemplary embodiments to a system or an apparatus via a network or a storage medium and causing one or more processors of a computer of the system or the apparatus to read out and execute the program. Further, the exemplary embodiments of the disclosure can be realized by a circuit (e.g., ASIC) capable of realizing one or more functions.
  • a circuit e.g., ASIC
  • Embodiment(s) of the disclosure can also be realized by a computer of a system or apparatus that reads out and executes computer executable instructions (e.g., one or more programs) recorded on a storage medium (which may also be referred to more fully as a ‘non-transitory computer-readable storage medium’) to perform the functions of one or more of the above-described embodiment(s) and/or that includes one or more circuits (e.g., application specific integrated circuit (ASIC)) for performing the functions of one or more of the above-described embodiment(s), and by a method performed by the computer of the system or apparatus by, for example, reading out and executing the computer executable instructions from the storage medium to perform the functions of one or more of the above-described embodiment(s) and/or controlling the one or more circuits to perform the functions of one or more of the above-described embodiment(s).
  • computer executable instructions e.g., one or more programs
  • a storage medium which may also be referred to more fully as a ‘
  • the computer may comprise one or more processors (e.g., central processing unit (CPU), micro processing unit (MPU)) and may include a network of separate computers or separate processors to read out and execute the computer executable instructions.
  • the computer executable instructions may be provided to the computer, for example, from a network or the storage medium.
  • the storage medium may include, for example, one or more of a hard disk, a random-access memory (RAM), a read only memory (ROM), a storage of distributed computing systems, an optical disk (such as a compact disc (CD), digital versatile disc (DVD), or Blu-ray Disc (BD)TM), a flash memory device, a memory card, and the like.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Human Computer Interaction (AREA)
  • Computing Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Studio Devices (AREA)
  • Image Analysis (AREA)

Abstract

An apparatus includes at least one memory configured to store instructions, and at least one processor in communication with the at least one memory and configured to execute the instructions to detect a feature amount from each of a plurality of images different in in-focus position in an optical axis direction, and align the plurality of images by using the feature amount. The at least one processor further executes instructions to detect the feature amount by preferentially using a focusing area of each of the plurality of images.

Description

    CROSS-REFERENCE TO RELATED APPLICATION
  • This application is a Continuation of U.S. application Ser. No. 17/153,637, filed Jan. 20, 2021, which claims priority from Japanese Patent Application No. 2020-014082, filed Jan. 30, 2020, which is hereby incorporated by reference herein in its entirety.
  • BACKGROUND OF THE DISCLOSURE Field of the Disclosure
  • The aspect of the embodiments relates to an image processing apparatus that combines a plurality of images different in in-focus position in an optical axis direction.
  • Description of the Related Art
  • In a case where images of a plurality of objects that is largely different in distance from a digital camera are captured or in a case where an image of an object long in a depth direction is captured, a part of the object can be in focus because of an insufficient depth of field. To solve such an issue, Japanese Patent Application Laid-Open No. 2015-186088 discusses a focus stacking composition technique in which a plurality of images different in in-focus position is captured, a focusing area is extracted from each of the images, and the extracted focusing areas are combined to generate one composite image in which the entire imaging area is in focus. To perform the focus stacking composition, the images are aligned because field angles of the images different in in-focus position are finely different and due to the effect of camera shake in the imaging. Japanese Patent Application Laid-Open No. 2015-186088 discusses a technique for performing alignment in which blocks are set in each of the images to calculate a moving amount by using the blocks.
  • When the image alignment is performed, in one embodiment, the blocks are arranged first in the focusing area of the image because a feature amount such as a motion vector calculated from the focusing area of the image is high in accuracy.
  • However, those images captured for the focus stacking composition are different in position of the focusing area. When the blocks are arranged in the same manner, accuracy of the alignment may be deteriorated.
  • SUMMARY OF THE DISCLOSURE
  • According to an embodiment of the disclosure, an image processing apparatus includes at least one memory configured to store instructions, and at least one processor in communication with the at least one memory and configured to execute the instructions to detect a feature amount from each of a plurality of images different in in-focus position in an optical axis direction, and align the plurality of images by using the feature amount. The at least one processor further executes instructions to detect the feature amount by preferentially using a focusing area of each of the plurality of images.
  • Further features of the disclosure will become apparent from the following description of exemplary embodiments with reference to the attached drawings.
  • BRIEF DESCRIPTION OF THE DRAWINGS
  • FIG. 1 is a block diagram illustrating a configuration of a digital camera according to an exemplary embodiment of the disclosure.
  • FIG. 2 is a flowchart illustrating generation of a composite image according to the exemplary embodiment of the disclosure.
  • FIG. 3 is a flowchart illustrating imaging according to the exemplary embodiment of the disclosure.
  • FIG. 4 is a flowchart illustrating alignment according to a first exemplary embodiment.
  • FIGS. 5A to 5D are diagrams illustrating arrangement of blocks according to the first exemplary embodiment.
  • FIGS. 6A to 6E are diagrams illustrating rearrangement of blocks according to the first exemplary embodiment.
  • FIG. 7 is a flowchart illustrating image composition according to the exemplary embodiment of the disclosure.
  • FIG. 8 is a flowchart illustrating alignment processing according to a second exemplary embodiment.
  • FIG. 9 is a flowchart illustrating another example of the alignment processing according to the second exemplary embodiment.
  • DESCRIPTION OF THE EMBODIMENTS
  • Exemplary embodiments of the disclosure are described in detail below with reference to accompanying drawings.
  • FIG. 1 is an example of a block diagram illustrating a configuration of a digital camera as an image processing apparatus according to a first exemplary embodiment. A digital camera 100 can capture a still image, record information on a focus position, calculate a contrast value, and combine images. Further, the digital camera 100 can perform enlargement processing or reduction processing on an image captured and stored, or an image input from outside.
  • A control unit 101 is a signal processor such as a central processing unit (CPU) and a micro processing unit (MPU), and controls units of the digital camera 100 while reading out a program previously stored in a read only memory (ROM) 105 described below. For example, as described below, the control unit 101 instructs an imaging unit 104 described below to start and end imaging. The control unit 101 also instructs an image processing unit 107 described below to perform image processing based on the program stored in the ROM 105. An instruction from a user is input to the digital camera 100 by an operation unit 110 described below, and is transmitted to each of the units of the digital camera 100 via the control unit 101.
  • A driving unit 102 includes a motor, and mechanically operates an optical system 103 described below under the instruction of the control unit 101. For example, the driving unit 102 moves a position of a focus lens included in the optical system 103 to adjust a focal length of the optical system 103 based on the instruction of the control unit 101.
  • The optical system 103 includes a zoom lens, the focus lens, and a diaphragm. The diaphragm is a mechanism for adjusting a quantity of light to be transmitted. A focus position is changeable by changing the positions of the lenses.
  • The imaging unit 104 is a photoelectric conversion device that performs photoelectric conversion to convert an incident optical signal into an electric signal. For example, a charge-coupled device (CCD) sensor or a complementary metal-oxide semiconductor (CMOS) sensor is adoptable as the imaging unit 104. The imaging unit 104 has a moving image capturing mode, and can capture a plurality of temporally continuous images as frames of a moving image.
  • The ROM 105 is a nonvolatile read-only memory as a recording medium, and stores parameters for operation of each of the units in addition to an operation program for each of the units included in the digital camera 100. A random access memory (RAM) 106 is a rewritable volatile memory, and is used as a temporary storage area of data output in the operation of each of the units included in the digital camera 100.
  • The image processing unit 107 performs various image processing such as white balance adjustment, color interpolation, and filtering on the images output from the imaging unit 104 or data of an image signal recorded in a built-in memory 109 described below. The image processing unit 107 also performs compression processing conforming to a standard such as Joint Photographic Experts Group (JPEG) on data of the images captured by the imaging unit 104.
  • The image processing unit 107 includes an integrated circuit (application specific integrated circuit (ASIC)) in which circuits each performing specific processing are integrated. Alternatively, the control unit 101 may perform a part or all of the functions of the image processing unit 107 by performing processing based on the program read out from the ROM 105. In a case where the control unit 101 performs all of the functions of the image processing unit 107, the image processing unit 107 as hardware is unnecessary.
  • A display unit 108 is a liquid crystal display, an organic electroluminescence (EL) display, or other type of display that displays an image temporarily stored in the RAM 106, an image stored in the built-in memory 109 described below, and a setting screen of the digital camera 100.
  • The built-in memory 109 stores the image captured by the imaging unit 104, the image processed by the image processing unit 107, information on the focus position in imaging, etc. A memory card or the like may be used in place of the built-in memory 109.
  • The operation unit 110 includes, for example, buttons, switches, keys, and a mode dial provided on the digital camera 100, and examples of the operation unit 110 includes a touch panel also serving as the display unit 108. The instruction from the user is transmitted to the control unit 101 via the operation unit 110.
  • FIG. 2 is a flowchart illustrating generation of a composite image according to the present exemplary embodiment. In step S201, the imaging unit 104 captures a plurality of images at respectively different in-focus positions in an optical axis direction. In step S202, the control unit 101 performs alignment on the plurality of images captured by the imaging unit 104 in step S201. In step S203, the image processing unit 107 combines the aligned images to generate a composite image having a deeper depth of field. In the following, each of the steps will be described in detail.
  • FIG. 3 is a flowchart illustrating the imaging in step S201 according to the present exemplary embodiment.
  • In step S301, the control unit 101 sets an in-focus position. For example, the user designates a focus position via the touch panel also serving as the display unit 108, and designates a plurality of in-focus positions at equal intervals on a front side and a rear side, in the optical axis direction, of an in-focus position corresponding to the designated focus position. At the same time, the control unit 101 determines an imaging order of the set in-focus positions in order of a distance.
  • In step S302, the imaging unit 104 captures an image at an in-focus position, at which imaging has not been performed and which is the earliest in the imaging order among the in-focus positions set in step S301.
  • In step S303, the control unit 101 determines whether images have been captured at all of the in-focus positions set in step S301. In a case where images have been captured at all of the set in-focus positions (YES in step S303), the processing in the flowchart is ended. In a case where there is an in-focus position where the image has not been captured (NO in step S303), the processing returns to step S302.
  • In a case of a multi lens camera including a plurality of imaging units 104, images may be simultaneously captured at the plurality of in-focus positions set in step S301.
  • FIG. 4 is a flowchart illustrating the alignment in step S202 according to the present exemplary embodiment. In the present exemplary embodiment, the control unit 101 detects a motion vector as a feature amount of each image, and performs the alignment by using the motion vector.
  • In step S401, the image processing unit 107 calculates a contrast value from each of the images captured in step S201. The contrast value calculated at this time is used for rearrangement of blocks and image composition described below.
  • As an example of a method of calculating the contrast value, for example, the image processing unit 107 first calculates a luminance value Y by using the following equation (1) from color signals Sr, Sg, and Sb of each of pixels.

  • Y=0.299Sr+0.587Sg+0.114Sb  (1)
  • Next, a Sobel filter is applied to a matrix L of luminance values Y of 3×3 pixels, and a contrast value I is calculated by using the following equations (2) to (4).
  • I h = ( - 1 0 1 - 2 0 2 - 1 0 1 ) · L ( 2 ) I v = ( - 1 - 2 - 1 0 0 0 1 2 1 ) · L ( 3 ) I = I h 2 + I v 2 ( 4 )
  • The above-described method of calculating the contrast value is merely an example, and for example, an edge detection filter such as a Laplacian filter, or a bandpass filter allowing frequencies in a predetermined band to pass therethrough may be used.
  • In step S402, the control unit 101 determines target images on which the alignment processing is to be performed. Typically, the control unit 101 determines, from among the images captured in step S201, two temporally adjacent images captured at respective times next to each other are determined as the target images of the alignment processing. For example, a first captured image and a second captured image are first determined as the target images.
  • In step S403, the control unit 101 detects a focusing area of each of the target images. As a method for detecting a focusing area, for example, information on a focusing area is recorded in imaging, and the focusing area is extracted in step S403. Alternatively, there is another example of the method for detecting a focusing area in which contrast values of all captured images are compared with each other, and an area having a contrast value larger than a contrast value of a corresponding area in the other image may be determined as the focusing area. At this time, an interval between the in-focus positions of the two target images is relatively small, so that it is expected that the focusing areas are largely overlapped.
  • In step S404, the image processing unit 107 arranges blocks in each of the target images.
  • FIGS. 5A to 5D are diagrams illustrating arrangement of the blocks according to the present exemplary embodiment. FIG. 5A illustrates an example of the captured image. FIG. 5B illustrates an example of arrangement of blocks in the image illustrated in FIG. 5A. In the example illustrated in FIG. 5B, blocks are arranged on the entire area of the image illustrated in FIG. 5A. FIG. 5C illustrates an example in which blocks are arranged in a part of the area of the image illustrated in FIG. 5A. In FIG. 5B, blocks 501 and search ranges 502 including the respective blocks 501 are illustrated. The search ranges 502 are arranged for detection of a motion vector described below, based on the respective blocks.
  • Although the numbers of arranged blocks are the same in the examples illustrated in FIG. 5B and FIG. 5C, the number of blocks is not limited thereto. The number of blocks may be changed based on a calculation function of the control unit 101. To accurately detect the motion vector described below, however, blocks similarly in all of the images are to be arranged.
  • Further, in FIG. 5B and FIG. 5C, the search ranges are arranged so as not to be overlapped with one another. However, the arrangement of the search ranges is not limited thereto. The search ranges may be arranged so as to be overlapped with one another as illustrated in FIG. 5D. In FIG. 5D, although the sizes of the blocks and the search ranges are similar to the sizes of the blocks and the search ranges in FIG. 5B, the blocks and the search ranges do not cover the entire area of the image unlike FIG. 5B because the search ranges are overlapped with one another.
  • In step S405, the image processing unit 107 detects a motion vector from each of the blocks arranged in step S404. The control unit 101 calculates, in each of the search ranges in one of the target images, a correspondence point at which a sum of absolute difference (hereinafter, referred to as SAD) of luminance values with respect to the corresponding block of the other image becomes minimum. The control unit 101 calculates the motion vector based on the center of the block and the correspondence point of the center of the block. In the above-described calculation of the correspondence point, the control unit 101 may also use a sum of squared difference (hereinafter, referred to as SSD), normalized cross correlation (hereinafter, referred to as NCC), etc., in addition to the SAD.
  • In step S406, the image processing unit 107 determines whether the motion vectors detected in step S405 satisfy a predetermined condition.
  • To perform alignment with high accuracy, the motion vectors are required to satisfy a certain condition. For example, as described below, at least three motion vectors not parallel to one another are applied to calculate an affine transformation coefficient. In other words, the alignment cannot be performed unless a predetermined number or more of motion vectors are detected. Further, in a case where the detected motion vectors are excessively small and an influence of the difference is large, the accuracy of the alignment may be low. In other words, the alignment cannot be performed unless the detected motion vectors are each greater than or equal to a threshold value. Moreover, in a case where the motion vectors detected from the blocks are not directed to a certain direction, i.e., in a case where the deviation is large, the accuracy of the alignment may be low. The image processing unit 107 previously determines a condition that the motion vectors are to satisfy, and the image processing unit 107 determines, in step S406, whether the motion vectors satisfy the above-described condition. In a case where the motion vectors do not satisfy the above-described predetermined condition (NO in step S406), the accuracy of the alignment may be low even if the alignment is performed. Therefore, the processing proceeds to step S407 without directly proceeding to the alignment processing. In contrast, in a case where the detected motion vectors satisfy the predetermined condition (YES in step S406), the processing proceeds to step S409.
  • In step S407, the image processing unit 107 re-arranges the blocks in each of the target images with reference to the focusing area detected in step S403. In one embodiment, the image processing unit 107 refers to a portion where the focusing areas of the two respective target images are overlapped with each other, and arranges the blocks at positions corresponding to the overlapped portion.
  • FIGS. 6A to 6E are diagrams illustrating rearrangement of the blocks according to the present exemplary embodiment. An image 610 in FIG. 6A and an image 620 in FIG. 6B are the two target images determined by the control unit 101 in step S402. An area 611 is a focusing area of the image 610 a, and an area 621 is a focusing area of the image 620. FIG. 6C illustrates a state where the image 610 and the image 620 are overlapped with each other. In FIG. 6C, a state where blocks are arranged in an area 631 where the focusing area 611 of the image 610 and the focusing area 621 of the image 620 are overlapped is illustrated. FIG. 6D illustrates an area where the blocks are re-arranged in the image 610, and FIG. 6E illustrates an area where the blocks are re-arranged in the image 620.
  • In the example of FIG. 6C, although the blocks are arranged over the entire area 631, the arrangement of the blocks is not limited thereto, and the blocks may be arranged in a part of the area 631.
  • If the control nit 101 determines a block rearrangement area, the control unit 101 re-arranges the blocks in the block rearrangement area with density higher than density of the blocks arranged in step S404. For example, the sizes of the blocks and the search ranges are made smaller, and the same number of blocks are re-arranged. In a case where the search ranges arranged in step S404 are not overlapped with one another, the sizes of the blocks and the search ranges are not changed, and the blocks are re-arranged such that the search ranges of the blocks are overlapped with one another as illustrated in FIG. 5D. On the other hand, in a case where the search ranges arranged in step S404 are overlapped with one another, the sizes of the blocks and the search ranges may not be changed, and the blocks may be re-arranged such that the overlapped portions become larger.
  • In step S408, the image processing unit 107 re-detects the motion vector from each of the blocks re-arranged in step S407. The method of re-detecting a motion vector may be similar to the method of detecting the motion vector in step S405.
  • In step S409, the control unit 101 determines whether the motion vectors of all of the images have been calculated. In a case where there is an image in which a motion vector has not been calculated (NO in step S409), the processing returns to step S402, and the control unit 101 determines target images again. For example, the control unit 101 determines two target images in order of imaging. In other words, in the first motion vector calculation s, the first image and the second image are determined as target images. In second motion vector calculation, the second image and a third image are determined as target images.
  • If the motion vectors of all of the images are calculated (YES in step S409), the processing proceeds to step S410. In step S410, a conversion coefficient is calculated. The conversion coefficient is a coefficient for deformation of an image in next step S411. For example, an image can be deformed by using a conversion coefficient A expressed in the following equation (5).
  • I = ( x y 1 ) = AI = ( a b c d e f g h i ) ( x y 1 ) ( 5 )
  • In the equation (5), (x′, y′) indicates coordinates after deformation, and (x, y) indicates coordinates before the deformation. When coordinates at a start point and an end point of each of the motion vectors detected from the two target images are respectively defined as (x, y) and (x′, y′) and three motion vectors not parallel to one another have been detected, the conversion coefficient A in the equation (5) can be calculated. Next, in step S411, the equation (5) is applied to all coordinates in one of the target images by using the calculated conversion coefficient. As a result, the two target images can be aligned. Applying such a method to all pairs of the target images determined in step S402 allows for alignment of all of the images.
  • Next, in step S203, the image processing unit 107 performs image composition on the aligned images.
  • FIG. 7 is a flowchart illustrating image composition according to the present exemplary embodiment.
  • In step S701, the image processing unit 107 calculates contrast values of the respective aligned images. The contrast values can be calculated by a method similar to the method described above in step S401. Further, the contrast value calculated in step S401 may be converted in coordinate in consideration of deformation, and the converted contrast value may be used.
  • In step S702, the image processing unit 107 generates a composition map. In a method of generating the composition map, the image processing unit 107 compares the contrast values of pixels located at the same position in the images, and calculates a composition ratio corresponding to the magnitude of the contrast values. More specifically, among the pixels located at the same position, the composition ratio of 100% is given to the pixel having the largest contrast value, and the composition ratio of 0% is given to the other pixel located at the same position. In other words, the following equation (6) can be obtained.
  • A m ( x , y ) = max k = 1 C k ( x , y ) ( 6 )
  • In the equation (6), Ck(x, y) indicates the contrast value calculated in step S701, and Am(x, y) indicates the ratio of the composition map. Further, m indicates an m-th image among the plurality of images different in in-focus position, x indicates a horizontal coordinate of the image, and y indicates a vertical coordinate of the image.
  • In step S702, the composition ratio is appropriately adjusted to prevent a boundary portion from becoming unnatural. As a result, the composition ratio of the composition map in one image is not binarized to 0% and 100% but continuously changes.
  • In step S703, the image processing unit 107 generates a composite image based on the composition map. The generated composite image has a deeper depth of field than the depth of field of each of the captured images because the composite image is generated by extracting the focusing areas in the captured images.
  • The above-described exemplary embodiment is merely illustrative, and various modifications can be implemented. For example, the following modifications can be made.
  • In the example described above, in the case where the detected motion vectors satisfy the predetermined condition in step S406, the conversion coefficient is calculated in step S410 by using all of the detected motion vectors. The calculation, however, is not limited thereto. For example, even in the case where the detected motion vectors satisfy the predetermined condition, in one embodiment, the control unit 101 uses the motion vectors from the focusing area detected in step S403 in the calculation of the conversion coefficient in step S410.
  • According to the first exemplary embodiment, in a case where such motion vectors that satisfy the predetermined condition cannot be calculated, the blocks are re-arranged in the focusing area, which allows for the alignment with higher accuracy.
  • A second exemplary embodiment is described below with reference to FIG. 8 . Unlike the first exemplary embodiment, the blocks are arranged first based on the focusing area in the second exemplary embodiment. In the following, the present exemplary embodiment will be described by focusing on differences from the first exemplary embodiment.
  • A processing flow of the focus stacking composition according to the present exemplary embodiment is similar to the processing flow in FIG. 2 according to the first exemplary embodiment. Processing flows of the imaging and the image composition are also similar to the processing flows according to the first exemplary embodiment.
  • FIG. 8 is a flowchart illustrating alignment processing according to the present exemplary embodiment. Processing in steps S801 to S803 is similar to the processing in steps S401 to S403 according to the first exemplary embodiment.
  • In step S804, the control unit 101 arranges blocks based on the focusing area detected in step S803. A specific arrangement method may be, for example, the method described with reference to FIGS. 6A to 6E in the first exemplary embodiment.
  • In step S805, the control unit 101 detects a motion vector of each of the target images based on the blocks arranged in step S804.
  • In step S809, the control unit 101 determines whether motion vectors have been detected from all of the images. In a case where motion vectors have been detected from all of the images (YES in step S809), the processing proceeds to step S810. In a case where there is an image from which a motion vector has not been detected (NO in step S809), the processing returns to step S802.
  • Processing in steps S810 and S811 is similar to the processing in steps S410 and S411 according to the first exemplary embodiment.
  • Further, as a modification of the present exemplary embodiment, the following implementation method is applicable. FIG. 9 is a flowchart illustrating a modification of the alignment processing according to the present exemplary embodiment. In the example illustrated in FIG. 8 , after the motion vector is detected in step S805, the processing proceeds to the step of calculating the conversion coefficient. In the example illustrated in FIG. 9 , the processing includes a step of determining whether the motion vectors satisfy the predetermined condition. In other words, in the example illustrated in FIG. 9 , it is determined whether the detected motion vectors satisfy the predetermined condition in a manner similar to step S406 in the first exemplary embodiment. In a case where the detected motion vectors do not satisfy the predetermined condition (NO in step S906), the blocks are re-arranged in step S907. In the present exemplary embodiment, since the blocks are arranged in consideration of the focusing area in step S904, in one embodiment, the blocks are arranged with higher density in the rearrangement in step S907.
  • According to the second exemplary embodiment, a detection accuracy of a motion vector can be improved by arrangement of blocks used for detection of the motion vector based on the position of a focusing area.
  • Other Exemplary Embodiments
  • Although the above-described exemplary embodiments have been described based on implementation in a digital camera, the implementation is not limited to the digital camera. For example, the exemplary embodiments may be implemented in a mobile terminal with a built-in imaging device, or in a network camera that is capable of capturing an image.
  • The exemplary embodiments of the disclosure can be realized by supplying a program for realizing one or more functions of the exemplary embodiments to a system or an apparatus via a network or a storage medium and causing one or more processors of a computer of the system or the apparatus to read out and execute the program. Further, the exemplary embodiments of the disclosure can be realized by a circuit (e.g., ASIC) capable of realizing one or more functions.
  • Further features of the disclosure will become apparent from the following description of exemplary embodiments with reference to the attached drawings.
  • Embodiment(s) of the disclosure can also be realized by a computer of a system or apparatus that reads out and executes computer executable instructions (e.g., one or more programs) recorded on a storage medium (which may also be referred to more fully as a ‘non-transitory computer-readable storage medium’) to perform the functions of one or more of the above-described embodiment(s) and/or that includes one or more circuits (e.g., application specific integrated circuit (ASIC)) for performing the functions of one or more of the above-described embodiment(s), and by a method performed by the computer of the system or apparatus by, for example, reading out and executing the computer executable instructions from the storage medium to perform the functions of one or more of the above-described embodiment(s) and/or controlling the one or more circuits to perform the functions of one or more of the above-described embodiment(s). The computer may comprise one or more processors (e.g., central processing unit (CPU), micro processing unit (MPU)) and may include a network of separate computers or separate processors to read out and execute the computer executable instructions. The computer executable instructions may be provided to the computer, for example, from a network or the storage medium. The storage medium may include, for example, one or more of a hard disk, a random-access memory (RAM), a read only memory (ROM), a storage of distributed computing systems, an optical disk (such as a compact disc (CD), digital versatile disc (DVD), or Blu-ray Disc (BD)™), a flash memory device, a memory card, and the like.
  • While the disclosure has been described with reference to exemplary embodiments, it is to be understood that the disclosure is not limited to the disclosed exemplary embodiments. The scope of the following claims is to be accorded the broadest interpretation so as to encompass all such modifications and equivalent structures and functions.

Claims (16)

What is claimed is:
1. An apparatus, comprising:
at least one memory configured to store instructions; and
at least one processor in communication with the at least one memory and configured to execute the instructions to:
arrange a region in each of a plurality of images different in in-focus position in an optical axis direction;
detect a feature amount from the region in each of the plurality of images; and
align the plurality of images by using the feature amount, and
in a case where the detected feature amount does not satisfy a predetermined condition, re-arrange the region.
2. The apparatus according to claim 1, wherein the at least one processor further executes instructions to generate a composite image by extracting respective focusing areas of the plurality of images.
3. The apparatus according to claim 2, wherein the composite image has a depth of field deeper than a depth of field of each of the plurality of images.
4. The apparatus according to claim 1, wherein the feature amount is a motion vector.
5. The apparatus according to claim 1, wherein the at least one processor further executes instructions to arrange the region based on a focusing area of each of the plurality of images.
6. The apparatus according to claim 2, wherein the at least one processor further executes instructions to detect the focusing area of each of the plurality of images by using contrast of the plurality of images.
7. The apparatus according to claim 4, wherein the at least one processor further executes instructions to:
arrange a plurality of blocks in the region in each of the plurality of images; and
detect a feature amount from each of the plurality of blocks in one image of the plurality of images.
8. The apparatus according to claim 7, wherein the at least one processor further executes instructions to arrange a plurality of blocks similar to the plurality of blocks in each of the plurality of blocks.
9. The apparatus according to claim 8, wherein the at least one processor further executes instructions to re-arrange the plurality of blocks based on a focusing area.
10. The apparatus according to claim 9, wherein the at least one processor further executes instructions to re-arrange the plurality of blocks with at least one of a changed number of blocks, a changed size of each of the blocks, and changes positions of the blocks.
11. The apparatus according to claim 7, wherein the predetermined condition is a condition where a number of the motion vectors detected is greater than a predetermined number.
12. The apparatus according to claim 7, wherein the predetermined condition is a condition where a magnitude of each of the motion vectors is greater than a predetermined threshold value.
13. The apparatus according to claim 7, wherein the predetermined condition is a condition where deviation of the motion vectors is less than a predetermined threshold value.
14. An apparatus comprising:
a sensor to pick up a plurality of images at different in-focus positions in an optical axis direction;
at least one memory configured to store instructions; and
at least one processor in communication with the at least one memory and configured to execute the instructions to:
arrange a region in each of the plurality of images;
detect a feature amount from the region in each of the plurality of images; and
align the plurality of images by using the feature amount, and
in a case where the detected feature amount does not satisfy a predetermined condition, re-arrange the region.
15. A method comprising:
arranging a region in each of a plurality of images different in in-focus position in an optical axis direction;
detecting a feature amount from the region in each of the plurality of images; and
aligning the plurality of images by using the feature amount, and
in a case where the detected feature amount does not satisfy a predetermined condition, re-arranging the region.
16. A non-transitory computer-readable storage medium which stores a program for causing a computer of an apparatus to execute a method, the method comprising:
arranging a region in each of a plurality of images different in in-focus position in an optical axis direction;
detecting a feature amount from the region in each of the plurality of images; and
aligning the plurality of images by using the feature amount, and
in a case where the detected feature amount does not satisfy a predetermined condition, re-arranging the region.
US18/054,501 2020-01-30 2022-11-10 Apparatus to perform alignment to images, image processing method to perform alignment to images, and computer readable non-transitory memory to perform alignment to images Abandoned US20230066494A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US18/054,501 US20230066494A1 (en) 2020-01-30 2022-11-10 Apparatus to perform alignment to images, image processing method to perform alignment to images, and computer readable non-transitory memory to perform alignment to images

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
JP2020014082A JP2021121067A (en) 2020-01-30 2020-01-30 Image processing device, imaging apparatus, image processing method, program, and recording medium
JP2020-014082 2020-01-30
US17/153,637 US11616902B2 (en) 2020-01-30 2021-01-20 Apparatus to perform alignment to images, image processing method to perform alignment to images, and computer readable non-transitory memory to perform alignment to images
US18/054,501 US20230066494A1 (en) 2020-01-30 2022-11-10 Apparatus to perform alignment to images, image processing method to perform alignment to images, and computer readable non-transitory memory to perform alignment to images

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
US17/153,637 Continuation US11616902B2 (en) 2020-01-30 2021-01-20 Apparatus to perform alignment to images, image processing method to perform alignment to images, and computer readable non-transitory memory to perform alignment to images

Publications (1)

Publication Number Publication Date
US20230066494A1 true US20230066494A1 (en) 2023-03-02

Family

ID=77063033

Family Applications (2)

Application Number Title Priority Date Filing Date
US17/153,637 Active 2041-04-14 US11616902B2 (en) 2020-01-30 2021-01-20 Apparatus to perform alignment to images, image processing method to perform alignment to images, and computer readable non-transitory memory to perform alignment to images
US18/054,501 Abandoned US20230066494A1 (en) 2020-01-30 2022-11-10 Apparatus to perform alignment to images, image processing method to perform alignment to images, and computer readable non-transitory memory to perform alignment to images

Family Applications Before (1)

Application Number Title Priority Date Filing Date
US17/153,637 Active 2041-04-14 US11616902B2 (en) 2020-01-30 2021-01-20 Apparatus to perform alignment to images, image processing method to perform alignment to images, and computer readable non-transitory memory to perform alignment to images

Country Status (2)

Country Link
US (2) US11616902B2 (en)
JP (1) JP2021121067A (en)

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6788802B2 (en) * 1998-05-21 2004-09-07 Sanyo Electric Co., Ltd. Optical flow estimation method and image synthesis method
JP4678603B2 (en) * 2007-04-20 2011-04-27 富士フイルム株式会社 Imaging apparatus and imaging method
JP5713752B2 (en) * 2011-03-28 2015-05-07 キヤノン株式会社 Image processing apparatus and control method thereof
JP2013020527A (en) * 2011-07-13 2013-01-31 Sony Corp Image processing device, method, and program
US10051274B2 (en) * 2013-10-25 2018-08-14 Canon Kabushiki Kaisha Image processing apparatus, method of calculating information according to motion of frame, and storage medium
JP6280780B2 (en) 2014-03-25 2018-02-14 オリンパス株式会社 IMAGING DEVICE, IMAGING DEVICE CONTROL METHOD, AND PROGRAM
JP6347675B2 (en) * 2014-06-06 2018-06-27 キヤノン株式会社 Image processing apparatus, imaging apparatus, image processing method, imaging method, and program

Also Published As

Publication number Publication date
US11616902B2 (en) 2023-03-28
JP2021121067A (en) 2021-08-19
US20210243377A1 (en) 2021-08-05

Similar Documents

Publication Publication Date Title
US10313577B2 (en) Focus detection apparatus, focus detection method, and image capturing apparatus
US10511779B1 (en) Image capturing apparatus, method of controlling same, and storage medium
US10116865B2 (en) Image processing apparatus and image processing method for calculating motion vector between images with different in-focus positions
US11196925B2 (en) Image processing apparatus that detects motion vectors, method of controlling the same, and storage medium
US11044396B2 (en) Image processing apparatus for calculating a composite ratio of each area based on a contrast value of images, control method of image processing apparatus, and computer-readable storage medium
US11375110B2 (en) Image processing apparatus, image pickup apparatus, image processing method, and non-transitory computer-readable storage medium
US11653107B2 (en) Image pick up apparatus, image pick up method, and storage medium
US11616902B2 (en) Apparatus to perform alignment to images, image processing method to perform alignment to images, and computer readable non-transitory memory to perform alignment to images
US11206344B2 (en) Image pickup apparatus and storage medium
JP2023165555A (en) Image processing apparatus, imaging apparatus, control method, and program
US20200311893A1 (en) Image processing apparatus with images having different in-focus positions, image pickup apparatus, image processing method, and computer readable storage medium
US11206350B2 (en) Image processing apparatus, image pickup apparatus, image processing method, and storage medium
US12100122B2 (en) Image processing apparatus for combining a plurality of images, imaging apparatus, imaging method, and recording medium
US20240089597A1 (en) Image capturing apparatus for capturing and compositing images different in in-focus position, control method, and storage medium
US11057560B2 (en) Image pickup apparatus, image pickup method, and recording medium
US11206348B2 (en) Image apparatus to generate a combined image, control method for controlling an apparatus to generate a combined image, and storage medium
US11843867B2 (en) Imaging apparatus, imaging method, and storage medium for correcting brightness of an image based on a predetermined value of exposure
US20230419459A1 (en) Image processing apparatus, imaging apparatus, image processing method, and recording medium background
US11778321B2 (en) Image capturing apparatus capable of performing omnifocal photographing, method of controlling same, and storage medium
JP2023026997A (en) Imaging device, imaging method, program, and recording medium

Legal Events

Date Code Title Description
STPP Information on status: patent application and granting procedure in general

Free format text: NON FINAL ACTION MAILED

STCB Information on status: application discontinuation

Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION