CN110288558B

CN110288558B - Super-depth-of-field image fusion method and terminal

Info

Publication number: CN110288558B
Application number: CN201910561628.1A
Authority: CN
Inventors: 陈兵; 邹兴文; 逄宗元
Original assignee: Xintu Photonics Co ltd
Current assignee: XINTU PHOTONICS Co.,Ltd.
Priority date: 2019-06-26
Filing date: 2019-06-26
Publication date: 2021-08-31
Anticipated expiration: 2039-06-26
Also published as: CN110288558A

Abstract

The invention discloses a fusion method and a fusion terminal of super depth of field images, which are characterized in that an image to be fused is split through a Laplacian pyramid to obtain a high-frequency information set and a low-frequency information set, the low-frequency information set is subjected to guiding filtering processing to obtain synthesized low-frequency information, and the synthesized high-frequency information and low-frequency information are reconstructed to obtain the super depth of field images.

Description

Super-depth-of-field image fusion method and terminal

Technical Field

The invention relates to the field of image processing, in particular to a super-depth-of-field image fusion method and a terminal.

Background

The multi-focus image fusion is an important branch of the image fusion technology and is mainly used for processing the imaged picture. When a certain scene is imaged, due to the limited focusing range of the optical system, the general optical imaging system is difficult to form clear images on objects at different distances in the scene. When the focal point of the imaging system is focused on an object, it can form a sharp image on the image plane. At this time, the image formed on the image plane by the object located at other position will show different degrees of blurring. Therefore, the imaging mechanism of the optical lens makes the imaging system improve the resolution continuously, and the influence of the limited focusing range on the whole effect of the imaged picture cannot be avoided, that is, it is difficult to obtain clear images of all objects in the same scene only by means of the imaging system. In order to more fully and truly reflect the information of a scene, it is desirable to obtain a clear image of all objects in the scene. One method for solving the problem is to focus different objects in a scene respectively to obtain a plurality of multi-focus images of the scene, then fuse the multi-focus images, and extract respective clear areas, thereby obtaining a fused image in which all the objects in the scene are clear. The multi-focus image fusion technology enables objects with different imaging distances to be clearly presented in one image, and lays a good foundation for processing such as feature extraction and image recognition, so that the utilization rate of image information is effectively improved, and a system detects and recognizes targets. However, a fused image obtained by the existing multi-focus image fusion method has a small particle block problem, so that the obtained fused image has certain difference with an original image.

Disclosure of Invention

The technical problem to be solved by the invention is as follows: the method and the terminal for fusing the super-depth-of-field images are provided, the problem that small particle blocks exist in fused images is thoroughly solved, and the fused images are closer to original images.

In order to solve the technical problems, the invention adopts a technical scheme that:

a method for fusing super-depth images comprises the following steps:

s1, aligning an image sequence to be fused, wherein the focus point of each image in the image sequence is different;

s2, respectively carrying out Laplacian pyramid splitting on each image in the aligned image sequence, extracting high-frequency information and low-frequency information of each image, and obtaining a high-frequency information set and a low-frequency information set corresponding to the image sequence;

s3, obtaining synthesized high-frequency information according to the high-frequency information set, conducting guiding filtering processing on the low-frequency information set to obtain synthesized low-frequency information, and conducting Laplacian pyramid reconstruction according to the synthesized high-frequency information and the synthesized low-frequency information to obtain a super field depth image.

In order to solve the technical problem, the invention adopts another technical scheme as follows:

a super-depth-of-field image fusion terminal comprises a memory, a processor and a computer program stored on the memory and capable of running on the processor, wherein the processor executes the computer program to realize the following steps:

The invention has the beneficial effects that: the method comprises the steps of splitting an image to be fused through a Laplace pyramid to obtain a high-frequency information set and a low-frequency information set, conducting guided filtering on the low-frequency information set to obtain synthesized low-frequency information, and reconstructing the synthesized high-frequency information and the synthesized low-frequency information to obtain a super depth-of-field image.

Drawings

Fig. 1 is a flowchart illustrating steps of a super-depth-of-field image fusion method according to an embodiment of the present invention;

fig. 2 is a schematic structural diagram of a super-depth-of-field image fusion terminal according to an embodiment of the present invention;

description of reference numerals:

1. a fusion terminal of super field depth images; 2. a memory; 3. a processor.

Detailed Description

In order to explain technical contents, achieved objects, and effects of the present invention in detail, the following description is made with reference to the accompanying drawings in combination with the embodiments.

Referring to fig. 1, a method for fusing super depth-of-field images includes the steps of:

From the above description, the beneficial effects of the present invention are: the method comprises the steps of splitting an image to be fused through a Laplace pyramid to obtain a high-frequency information set and a low-frequency information set, conducting guided filtering on the low-frequency information set to obtain synthesized low-frequency information, and reconstructing the synthesized high-frequency information and the synthesized low-frequency information to obtain a super depth-of-field image.

Further, the step S1 includes:

extracting characteristic points of each image in the image sequence through a surf matching algorithm, and screening out a preset number of matching points;

calculating surf feature description of a preset dimension according to the preset number of matching points, and performing rough matching between images according to the surf feature description;

and calculating a transition matrix between the roughly matched images through a ransac algorithm, and aligning the corresponding images according to the transition matrix.

According to the description, the surf matching algorithm is adopted to extract the image feature points, calculate the feature description of the feature points, perform rough matching, and calculate the transition matrix between the images after rough matching through the ransac algorithm, so that the matching of the images to be fused is realized, the accurate matching between the images can be realized, and the accuracy of subsequent fusion is improved.

Further, the step S3 of obtaining the synthesized high-frequency information according to the high-frequency information set, and performing the guided filtering process on the low-frequency information set to obtain the synthesized low-frequency information includes:

selecting the high-frequency information with the maximum absolute value in the high-frequency information set as synthesized high-frequency information;

calculating the weight corresponding to each low-frequency information in the low-frequency information set by adopting a guided filtering method;

and weighting and summing each low-frequency information in the low-frequency information set and the corresponding weight thereof to obtain the synthesized low-frequency information.

According to the above description, the high-frequency information with the largest absolute value is selected as the synthesized high-frequency information, the weight corresponding to each low-frequency information in the low-frequency information set is calculated by adopting a guided filtering method, each low-frequency information is weighted based on the weight, and the synthesized low-frequency information is obtained, so that the small particle phenomenon in low-frequency synthesis can be prevented, the small particles in the fused image are avoided, the image synthesized by the super-depth of field is clear, fine and transparent, and more detailed information can be programmed.

Further, after the step S3 of performing the guided filtering process on the low frequency information set to obtain the synthesized low frequency information, the method further includes:

and performing region growth of a preset neighborhood on the synthesized low-frequency information, judging whether the region of each pixel point in the synthesized low-frequency information after growth is smaller than a preset value, and if so, removing the pixel point.

From the above description, it can be determined whether the synthesized low-frequency information has a "hole" by the region growing method, and if so, the "hole" is removed, so that an isolated small region can be removed, and the integrity of the fused image is ensured.

Further, the step S2 includes:

respectively carrying out Gaussian filtering on each image in the aligned image sequence according to a preset level, extracting high-frequency information of each layer of each image and low-frequency information of the highest layer of each image, and obtaining a high-frequency information set and a low-frequency information set corresponding to the image sequence;

in step S3, selecting the high frequency information with the largest absolute value in the high frequency information set as the synthesized high frequency information includes:

selecting the high-frequency information with the maximum absolute value in the high-frequency information of each layer in the high-frequency information set as the high-frequency information after the layer synthesis;

in the step S3, the performing laplacian pyramid reconstruction according to the synthesized high-frequency information and low-frequency information to obtain a super depth-of-field image includes:

and performing the following recursion on the synthesized low-frequency information from the highest layer to the bottom: and after the low-frequency information is up-sampled and subjected to Gaussian filtering, adding the high-frequency information of the corresponding level to serve as the low-frequency information of the next level.

As can be seen from the above description, by performing laplacian pyramid splitting and reconstruction of a preset level on an image sequence to be fused, features and details on different frequency bands of different decomposition layers can be extracted and displayed, features and details from different images can be fused together, and the fusion effect is good.

Referring to fig. 2, a super-depth-of-field image fusion terminal includes a memory, a processor, and a computer program stored in the memory and executable on the processor, where the processor executes the computer program to implement the following steps:

Further, the step S1 includes:

Further, the step S2 includes:

Example one

specifically, feature points of each image in the image sequence are extracted through a surf matching algorithm, and a preset number of matching points are screened out;

calculating a transition matrix between the images after rough matching through a ransac algorithm, and aligning the corresponding images according to the transition matrix;

preferably, all feature points of each image can be extracted through a surf matching algorithm, 500 matching points are screened out, and 64-dimensional surf feature description is calculated according to the 500 matching points;

performing coarse matching between images according to the surf feature description, wherein the coarse matching adopts nearest neighbor coarse matching;

finally, calculating a transition matrix between the roughly matched images through a ransac algorithm, and aligning the corresponding images through the transition matrix;

during image alignment, the above-mentioned alignment process may be performed between two images, so that all images in the image sequence are aligned;

specifically, Gaussian filtering is respectively carried out on each image in the aligned image sequence according to a preset level, high-frequency information of each layer of each image and low-frequency information of the highest layer of each image are extracted, and a high-frequency information set and a low-frequency information set corresponding to the image sequence are obtained;

the specific implementation operation is as follows:

s2.1, assuming that the original image A is taken as the bottommost layer image LA₀(layer 0 of the laplacian pyramid), convolved with a gaussian kernel W to obtain an image GA₀；

S2.2, mixing LA₀Subtract GA₀Obtaining high-frequency information HA of layer 0₀；

S2.3, mixing GA₀Downsampling (removing even rows and columns) to obtain the previous layer image LA₁(layer 1 of the laplacian pyramid), repeating S2.1 and S2.2 to obtain high-frequency information HA of each layer₀、HA₁，……，HA_NAnd the low frequency information LA of the highest layer_NWherein N is a preset hierarchy;

the decomposition of each image can be realized through the steps S2.1 to S2.3, and M pieces of high-frequency information HA corresponding to the M images can be obtained if M images exist₀、HA₁，……，HA_NAnd the low frequency information LA of the highest layer_NForming a high frequency information set and a low frequency information set corresponding to the image sequence;

s3, obtaining synthesized high-frequency information according to the high-frequency information set, conducting guiding filtering processing on the low-frequency information set to obtain synthesized low-frequency information, and conducting Laplacian pyramid reconstruction according to the synthesized high-frequency information and the synthesized low-frequency information to obtain a super field depth image;

in step S3, the obtaining of the synthesized high-frequency information according to the high-frequency information set includes:

specifically, the high-frequency information with the largest absolute value in the high-frequency information of each layer in the high-frequency information set is selected as the high-frequency information after the layer synthesis;

assuming that there are two images in total, the high frequency information set obtained is { HA₀,HA₁,……,HA_N,HB₀,HB₁,……,HB_NH, the synthesized high-frequency information is { H }₀,H₁,……,H_nH, which includes N levels, each level having corresponding synthesized high-frequency information, H_i(m,n)＝max(HA_i(m,n),HB_i(m, N)), i ═ 1,2, … …, N, (m, N) denotes the pixel point position;

preferably, 5-layer laplacian pyramid decomposition may be performed, and a gaussian filter with a window of 5 and σ of 1 is performed on the original image:

obtaining high-frequency information of a corresponding layer, subtracting the filtered image from the original image to be used as the high-frequency information of the corresponding layer, and extracting the filtered image in an interlaced and alternate manner to be used as an input image of the next layer;

performing Gaussian filtering with a window of 3 and sigma 1 on the high-frequency information of each layer, and then selecting the high-frequency information with the largest absolute value as the synthesized high-frequency information of each layer;

the step of performing guided filtering processing on the low-frequency information set to obtain synthesized low-frequency information comprises:

weighting and summing each low-frequency information in the low-frequency information set and the corresponding weight thereof to obtain synthesized low-frequency information;

specifically, assuming that there are two pictures in total, the low-frequency information LA of the highest layer of each picture is obtained by pyramid decomposition_NAnd LB_N；

Calculating LA by adopting guide filtering mode_NAnd LB_NThe synthesized weight W₁And W₂Then the synthesized low frequency information L_N＝W₁*LA_N+W₂*LB_N；

The calculation process of the guided filtering mode is as follows:

will LA_NConsidering as an input image P, the weight W is calculated with G as a guide map₁And W₂Wherein G (m, n) ═ max (LA)_N(m,n),LB_N(m, n)), (m, n) denoting the pixel location;

setting a guide image G, inputting an image P and outputting an image Q; the goal of guided filtering is: making the input P and output Q as identical as possible, while the texture part is similar to the guide map G;

to meet the first objective, to make the input P and output Q as similar as possible, it is desirable to minimize the squared difference min (Q-P)²；

To satisfy the second object, it is required that the texture of the output image Q is similar to that of the guide map G

Integrating to obtain Q ═ alpha G + b;

consider a small window W_kIn W_kThe internal assumption is that alpha, b remains unchanged and is set as alpha_k，b_k；

W_kInner pixel satisfy

q_i＝α_kg_i+b_k,i∈W_k (1)

Substituting (1) into the first target, so that the pixels in the window satisfy the above two conditions simultaneously:

where ε is a penalized large α_kPreferably, epsilon is 0.01, and the guide window is 3;

to minimize (2), satisfy

Where | W | is the window W_kThe total number of pixels. Get it solved

Let p_kIs to input a picture P in a window W_kAverage value of (d), μ_kAnd

is to guide the drawing G in the window W_kMean and variance of, then

Wherein the content of the first and second substances,

is the guide graph G and the input graph P at W_kAn inner covariance;

calculating alpha_k,b_kThen, the window W can be calculated according to (1)_kThe output pixel of (1);

for a pixel i, the output value q_iAnd all windows W covering the pixels i_kRelated, therefore when W_kDifferent from q_iAre also different, a simple strategy is to average all possible q_iA value;

all windows W covering i are calculated_kAlpha of (A)_k,b_kAll windows W covering the pixel i_kIs | W |, then

Wherein

Specifically, when the guide map G is the same as the input image P, the guide filter edge occurrence maintains a smooth characteristic, which is analyzed as follows:

when G ═ P, it is clear

Obtained from the formulae (5) and (6)

b_k＝μ_k(1-α_k)；

When ε is equal to 0, α_k＝1,b_k0, i.e. the output is the same as the input image; if epsilon>0, consider two cases:

first, high variance: if the image P is in the window W_kIn a number of variations, then

Having a_k≈1,b_k≈0；

Second, flat block: then

Having a_k≈0,b_k≈μ_k(ii) a If the whole input image is like window W_kIs likely to be very flat when a_k，b_kIs averaged to obtain alpha_k≈0,b_k≈μ_k,q_i≈μ_k。

Thus, when a pixel is in a window of high variance, its output value is constant, in the flat region, its output value becomes the average of the surrounding window pixels, specifically, the criteria of high variance and flat are controlled by a parameter ε, if the window variance is much smaller than this parameter then it is smoothed then the variance is much larger and the window size determines how large the surrounding range of pixels is referenced to calculate the variance and mean;

in this case, the output image Q can be calculated by calculating the parameters of the guided filtering according to equations (5) to (8);

the filtering result of the oriented filtering at the pixel point i can be expressed as a weighted average

q_i＝∑_jW_ij(G)p_j (9)

Wherein i, j are both pixel indices;

filter weight W_ijIs a function of the pilot graph G and is independent of P;

the filter weights are calculated by substituting (6) into (8) and eliminating b to obtain:

calculating a partial derivative:

wherein the content of the first and second substances,

when j is not in the window W_kWhen the temperature of the water is higher than the set temperature,

is 0;

bringing (12) and (13) into (11) to obtain

I.e. the weight of the output image

So that the image Q is output_ij＝W_ij×P_ij；

Wherein, W_ijIs the weight corresponding to the pixel point (i, j) in the low frequency information (i.e., the input image); e.g. with low frequency information LA_NAnd LB_NAs the input image, the input image LA is calculated by the above-mentioned guiding filtering method_NAnd LB_NCorresponding weight WA of each pixel point in_ijAnd WB_ijThen weighted and summed to obtain input image LA_NAnd LB_NIs QA to WA_ij*LA_Nij，QB＝WB_ij*LB_NijFinally synthesized low frequency information L_NQA + QB; the performing laplacian pyramid reconstruction according to the synthesized high-frequency information and low-frequency information to obtain a super-depth-of-field image includes:

and performing the following recursion on the synthesized low-frequency information from the highest layer to the bottom: performing up-sampling and Gaussian filtering on the low-frequency information, and adding the high-frequency information of the corresponding level as the low-frequency information of the next level;

specifically, the synthesized low-frequency information is up-sampled, with a 2-fold window of 5 and a 1-sigma gaussian filter, and then the synthesized top-most high-frequency H is added_NTo obtain the input G of the next layer_N-1；

Then to G_N-1Upsampling is performed, a 2-time window is 5, sigma is 1 Gaussian filtering, and then the high frequency H of the synthesized N-1 layer is added_N-1To obtain the input G of the next layer_N-2And the upper layer of the pyramid is recurred to the lower layer of the pyramid, and finally the fused super-depth-of-field image which is as large as the input image is obtained.

Example two

The difference between the present embodiment and the first embodiment is:

in step S3, after the performing the guided filtering process on the low frequency information set to obtain the synthesized low frequency information, the method further includes:

performing region growth of a preset neighborhood on the synthesized low-frequency information, judging whether a region of each pixel point in the synthesized low-frequency information after growth is smaller than a preset value, and if so, removing the pixel point;

preferably, the synthesized low-frequency information is subjected to region growth of 4 neighborhoods, and if the region of each point after growth is less than 10000 pixel points, the point is judged to be a cavity, and the cavity is removed.

Specifically, after calculating the guiding filter of the low-frequency information, the weights W1 and W2 of the two pieces of low-frequency information are calculated respectively, and the W1 and W2 of each pixel point (i, j) are compared to obtain a matrix C, wherein the matrix C is obtained

Four-neighborhood region growing for each point of matrix C with value 1, e.g. C_ijComparing whether four points of upper (A), lower (B), left (C) and right (D) are 1, judging whether the points of 1 are the points of 1, counting the number N of the points of 1 until reaching the boundary of the matrix C, and if N is less than 10000, counting the number N of the points of 1_ijIs a void, and is removed.

EXAMPLE III

Referring to fig. 2, a super-depth-of-field image fusion terminal 1 includes a memory 2, a processor 3, and a computer program stored in the memory 1 and executable on the processor 3, where the processor 3 implements the steps in the first embodiment when executing the computer program.

Example four

Referring to fig. 2, a super-depth-of-field image fusion terminal 1 includes a memory 2, a processor 3, and a computer program stored in the memory 1 and executable on the processor 3, where the processor 3 implements the steps in the second embodiment when executing the computer program.

In summary, according to the fusion method and the terminal for the super depth-of-field image provided by the invention, the image to be fused is split through the laplacian pyramid to obtain the high-frequency information set and the low-frequency information set, the high-frequency information set is synthesized by adopting the method of obtaining the maximum absolute value, the low-frequency information set is subjected to the guided filtering processing to obtain the weight of the low-frequency information, the low-frequency information is weighted and synthesized, the synthesized low-frequency information is subjected to the region growing method to remove the isolated small region, the synthesized high-frequency information and low-frequency information are reconstructed to obtain the super depth-of-field image, the problems that a large amount of particles and water stains appear in the depth-of-field synthesized image in the existing depth-of-field fusion method are solved, the small particle blocks in the fusion image are thoroughly eliminated, the obtained super depth-of-field image is closer to the original image, and the fused super depth-of-field image is clear, Fine and transparent, and can present more detailed information.

The above description is only an embodiment of the present invention, and not intended to limit the scope of the present invention, and all equivalent changes made by using the contents of the present specification and the drawings, or applied directly or indirectly to the related technical fields, are included in the scope of the present invention.

Claims

1. A method for fusing super-depth images is characterized by comprising the following steps:

s1, aligning an image sequence to be fused, wherein the focus point of each image in the image sequence is different, and the image sequence comprises two images to be fused { A, B };

s2, respectively carrying out Laplacian pyramid splitting on each image in the aligned image sequence, extracting high-frequency information and low-frequency information of each image, and obtaining a high-frequency information set and a low-frequency information set corresponding to the image sequence, wherein the high-frequency information set is { HA }₀,HA₁,……,HA_N,HB₀,HB₁,……,HB_NThe low-frequency information is set as { LA }_N，LB_NN is a preset number of layers;

in step S3, the obtaining of the synthesized high-frequency information according to the high-frequency information set, and the performing of the guided filtering process on the low-frequency information set to obtain the synthesized low-frequency information includes:

calculating the weight corresponding to each pixel point corresponding to each low-frequency information in the low-frequency information set by adopting a guided filtering method;

weighting each low-frequency information in the low-frequency information set and the corresponding weight of each pixel point to obtain synthesized low-frequency information;

the area growth of a preset neighborhood is carried out on the synthesized low-frequency information, whether the area of each pixel point in the synthesized low-frequency information after growth is smaller than a preset value or not is judged, and if yes, the pixel point is removed, wherein the area of the synthesized low-frequency information after growth is judged to comprise:

the weight W1 corresponding to each pixel point (i, j) of the low-frequency information of the two images after the guide filtering_ijAnd W2_ijComparing to obtain each point C in the matrix C_ijThe value of (a), wherein,

performing four-neighborhood region growth on each point with the value of 1 of the matrix C, comparing whether the upper, lower, left and right points of the matrix C are 1, then judging the points with the value of 1, the upper, lower, left and right points of 1 until the boundary of the matrix C is reached, counting the number of the points with the value of 1, and if the number of the points is less than the preset value, counting the point C in the synthesized low-frequency information_ijAnd removing the corresponding pixel points which are holes.

2. The method for fusing super depth of field images according to claim 1, wherein the step S1 comprises:

3. The method for fusing super depth of field images according to claim 1, wherein the step S2 comprises:

4. A super-depth-of-field image fusion terminal comprises a memory, a processor and a computer program which is stored on the memory and can run on the processor, and is characterized in that the processor executes the computer program to realize the following steps:

s2, aligning the image sequence after alignmentRespectively splitting each image in the column by using a Laplacian pyramid, extracting high-frequency information and low-frequency information of each image, and obtaining a high-frequency information set and a low-frequency information set corresponding to the image sequence, wherein the high-frequency information set is { HA }₀,HA₁,……,HA_N,HB₀,HB₁,……,HB_NThe low-frequency information is set as { LA }_N，LB_NN is a preset number of layers;

5. The super depth-of-field image fusion terminal according to claim 4, wherein the step S1 includes:

6. The super depth-of-field image fusion terminal according to claim 4, wherein the step S2 includes: