CN111784572B

CN111784572B - Image fusion and super-resolution joint implementation method based on discriminant dictionary learning

Info

Publication number: CN111784572B
Application number: CN202010425926.0A
Authority: CN
Inventors: 李华锋; 陈怡文; 杨默远; 余正涛; 张亚飞
Original assignee: Kunming University of Science and Technology
Current assignee: Kunming University of Science and Technology
Priority date: 2020-05-19
Filing date: 2020-05-19
Publication date: 2022-06-28
Anticipated expiration: 2040-05-19
Also published as: CN111784572A

Abstract

The invention relates to a method for jointly realizing image fusion and super-resolution based on discriminative dictionary learning, belonging to the technical field of digital image processing. Specifically, two pairs of low-rank, sparse dictionaries and a high-resolution and low-resolution image coding coefficient transformation matrix are jointly trained first. One pair of dictionaries is used for representing low-rank and sparse components of an input image, the other pair is used for reconstructing a high-resolution fusion low-rank and sparse components, and a conversion matrix is used for establishing a potential relation between a high-resolution image and a low-resolution image. Then, a sparse and low-rank separation model is constructed, the input image is effectively decomposed into low-rank and sparse components, and therefore a high-resolution fusion image can be constructed through different dictionaries. The invention realizes the fusion of images and super-resolution reconstruction jointly. The experimental result shows that the invention has better fusion performance in both visual effect and objective index.

Description

Image fusion and super-resolution joint implementation method based on discriminant dictionary learning

Technical Field

The invention relates to a method for realizing image fusion and super-resolution combination based on discriminative dictionary learning, belonging to the technical field of digital image processing.

Background

Image fusion can integrate complementary information acquired by different sensors about the same scene into one image, and can provide more comprehensive and accurate description for the scene, thereby facilitating identification of events and objects. In recent years, this technology has been receiving more and more attention from researchers, and has made significant research progress.

Existing image fusion methods can be roughly classified into three categories, namely, a multiscale transform (MST) -based method, a Dictionary Learning (DL) -based method, and a deep learning-based method. Among the MST based methods, commonly used MSTs include wavelet transforms, dual tree complex transforms (DTCWT), shear transforms, curvelet transforms, contourlet transforms, and non-subsampled contourlet transforms (NSCT). Usually, the base of MST is fixed, sparseness is weak, and local information of an image cannot be represented adaptively sparsely. Different from the MST method, the Sparse Representation (SR) technology based on dictionary learning can effectively overcome the defects of the MST method and show good fusion performance. With the development of deep learning, image fusion based on deep learning is more and more concerned by people, and accordingly, some excellent fusion methods also appear. However, the above method has good performance only when the source image has a high resolution. If the input image is of low resolution, the resolution of the fusion result will also be low, which hinders the application of the fusion result.

In order to improve the resolution of the fused image, a common solution is to implement fusion and super-resolution reconstruction step by step. However, this method is likely to introduce the workpiece created in the first step into the next step, thereby degrading the visual quality of the final result.

Disclosure of Invention

The invention aims to provide a multi-source image fusion method based on discriminant dictionary learning and morphological component decomposition aiming at the defects and shortcomings in the prior art.

The technical scheme adopted by the invention is as follows: a multi-source image fusion method based on discriminant dictionary learning and morphological component decomposition comprises the following steps:

1) and constructing a training sample of dictionary learning. Selecting 8 HR training samples for training a high-resolution image discrimination dictionary pair, then performing down-sampling, and performing up-sampling by a bicubic interpolation method to restore the images to be the same as the high-resolution images as corresponding LR training images;

2) an initial dictionary is randomly generated. A new dictionary learning method is then proposed for decomposing the input image into low rank and sparse components. For synchronously realizing image fusion and super-resolution reconstruction, a pair of HR dictionaries and a pair of LR dictionaries are jointly trained, and a coefficient conversion matrix H between coding coefficients of high-resolution image blocks and low-resolution image blocks is trained, wherein the pair of HR dictionaries is D _h,lAnd D_h，sOne pair of LR dictionaries is D_l，lAnd D_l，s；

3) Obtaining a low-rank dictionary D from the step 2)_l，lAnd sparse dictionary D_l，sThen, taking the LR image in pairs, decomposing the LR image block Y_lObtaining a low-rank sparse coding coefficient A_l,lAnd A_l,s；

4) LR coding coefficient A obtained by step 3)_l，s、A_l，lAnd the conversion matrix H obtained in the step 2) adopts a method of maximum absolute value to construct the coding coefficient of the HR fusion image

And

and finally obtaining a high-resolution fusion image block.

Specifically, the step 2) includes the following steps:

step2.1 utilizes the constructed training sample learning dictionary, and the proposed discriminant dictionary learning model is as follows:

and

low rank dictionary and sparse dictionary that are HR images。X_hIs a high resolution training image, X_lIs a corresponding low resolution image. In addition, D is used to low resolution low rank, sparse dictionary_l，lAnd D_l,sIs shown in which

M × K represents the matrix dimension of M rows and K columns, ε₁、ε₂、ε₃And ε₄A constant controlling the amplitude of each atom in the different dictionaries. Z_l，lAnd Z_l，sIs X_lIn dictionary D_l，lAnd D_l，sCoding coefficient of_h，lAnd Z_h，sAre each X_hIn dictionary D_h，lAnd D_h，sThe coding coefficients of (1). H is Z_l，iAnd Z_h，i(i ═ l, s); i | · | purple wind_FIs an F norm operator; Ψ (H, Z)_l，l，Z_h，l，Z_l，s，Z_h,s) And Φ (D)_l，l，D_h，l，Z_l，l，Z_h，l，Z_l，s，Z_h,s) Is a discriminant regular term to ensure the learning dictionary D _l，l、D_h，l、D_l，sAnd D_h，sThe discrimination ability of (1).

For convenience of processing, the LR image is set to the same size as the HR image by the bicubic interpolation process. Then, the relationship between the encoding coefficients of the LR image and the HR image can be described as:

in the formula (2) < lambda >₁，λ₂And λ₅Is a regularization parameter.

In the formula (2), the reaction mixture is,

for the purpose of avoiding over-fitting,

is a relational transformation term used to represent the relationship between sparse component coding coefficients, where | · |. includes_FIs the F-norm operator. In the LR image and its corresponding HR image,

for establishing Z_h，lAnd Z_l,lThe relationship between them. It is clear that the low rank components have strong linear correlation between them and that each element in the same coding coefficient vector has similar values. Based on this fact, an all-1 matrix is introduced

And using regularization terms

To characterize the low rank component coded coefficients.

To improve the discrimination of the learning dictionary, the following regularization terms are defined:

in formula (3) < lambda >₃And λ₄Is a regularization parameter. I D_h,lZ_h,l||_*And | | | D_l,lZ_l,l||_*D for ensuring separation from an input image pair_h,lZ_h,lAnd D_l,lZ_l,lIs low-rank, wherein | · |. non-woven phosphor_*Is the kernel norm operator. After obtaining the low rank component, it can also pass X_i-D_i,lZ_i,lObtaining a sparse component, wherein X_iRepresenting input source data. Therefore, the objective function that discriminates the dictionary learning can be expressed as:

And (4) optimizing and solving the Step2.2 dictionary learning model.

Variable D to be solved_l,l、D_l,s、D_h，l、D_h，s、Z_h，s、Z_h，l、Z_l，s、Z_l，lAnd H, which are non-convex and difficult to solve directly. If one of the variables is solved while the other variables are fixed, each sub-problem is a convex function. Therefore, each variable in (4) is solved in turn by an alternating iterative method.

Step2.2.1 for ease of optimization, four variable matrices X were introduced_h，s、X_h，l、X_l,sAnd X_l,lThat is, training the sparse component and the low-rank component of the input high-resolution image and the sparse component and the low-rank component of the low-resolution image, the optimization problem in equation (4) is converted into:

step2.2.2 update X_h,s、X_h,lAnd a coding coefficient matrix Z_h,sAnd Z_h,l. First, fixing other variables, updating X_h,sThen, equation (5) is converted into the following expression:

the above formula has the following closed solution form:

similarly, fix other variables, X_h,lThe solution form of (c) is as follows:

equation (8) can be solved efficiently by the SVT algorithm.

After X is updated_h,sAnd X_h,lThen, the coding coefficient matrix Z is further updated_h,sAnd Z_h,l：

In the formula (9)

Formula (9) belongs to₁Minimizing the problem by using a Two-Step Iterative Shrinkage/Thresholding algorithm (Two-Step Iterative Shrinkage/Thresholding algorithm, TwinT)^[57]To solve. For equation (10), the solution is as follows:

step2.2.3 updating X _l,s，X_l,lAnd a coding coefficient matrix Z_l,sAnd Z_l,l. First, fixing other variables, optimizing X_l,sAnd X_l,lSolving expressions are respectively as follows:

wherein

Equation (13) can be solved efficiently using the SVT algorithm.

Further, by fixing other variables, Z is obtained_l,sUpdating expression sum Z_l,lThe closed solution is in the form:

wherein

The same applies to TwinT algorithm (14).

Step2.2.4 updates the transformation matrix H. The other variable matrix is fixed, and the updated expression of H obtained by the formula (5) is as follows:

the formula (16) is F norm, and the following expression about H is obtained by calculating partial derivative of H:

the above equation can effectively solve H by Sylvester function of MATLAB.

Step2.2.5 updating dictionary pair D_h,s、D_h,lAnd D_l,s、D_l,l. Fixing other variables to be unchanged, solving dictionary variables by using a Lagrange dual method to obtain D_h,sAnd D_h,lThe update expression of (a) is as follows:

in the same way, D_l,sAnd D_l,lThe closed solution expression is as follows:

in the above four expressions, Λ₁、Λ₂、Λ₃And Λ₄Is a diagonal matrix of the corresponding optimal dual variables.

Specifically, the steps of step 3) are as follows:

from step 2), in order to ensure the successful separation of different components, a new image decomposition model is provided, and a source image is decomposed into a low-rank part and a sparse part.

Obtaining a low-rank dictionary D from Step3.1 _l,lSparse dictionary D_l,sThen, taking paired LR images, collecting N image block data by using a sliding window with the size of (N multiplied by N), wherein the number of each image block is used as a column vector, and then forming a matrix by the column vectors. Decomposition of LR image blocks Y_lObtaining low rank, sparse coding coefficient A_l,lAnd A_l,s. The low rank, sparse decomposition model is as follows:

in the formula A_l,l(i, j) is A_l,lThe (i, j) th value of (a). If A is to be_l,lAs a group, the value in each row of (a) is minimized_l,l||_2,1Will be such that A_l,lThe values in each row of (a) are the same. From the inequalities R (AB) ≦ min { R (A), R (B) } (R denotes the rank of the matrix), it is known that the best is obtainedMiniaturization | | | A_l,l||_2,1Can further ensure D_l,lA_l,lLow rank and avoid D due to minimization | |_l,lA_l,l||_*Causing some drawbacks.

The solution of the Step3.2 decomposition model can also adopt an alternative iteration method to solve the image decomposition model (22) and update A_l,sAnd A_l,l。

Step3.2.1 for ease of optimization, variable Y was introduced_l,lAnd Y_l,sI.e., the low rank component and the sparse component of the LR image block, equation (22) can be transformed into:

step3.2.2 updating Y_l,sAnd Y_l,l: first fix Y_l,l,A_l,lAnd A_l,s，Y_l,sThe update expression is as follows:

similarly, using the TwinT algorithm to solve for Y_l,sL of₁And (4) optimizing the norm.

Further, Y is fixed_l,s,A_l,lAnd A_l,s，Y_l,lThe update expression is as follows:

In formula (25)

Step3.2.3 optimized sparse coding coefficient A_l,sAnd A_l,l. The update expression is as follows:

formula (26) is₁And (3) solving the norm optimization problem by adopting a TwinT algorithm in the section. Formula (27) is_2,1Norm optimization problem.

Specifically, the steps of step 4) are as follows:

assuming N LR images to be fused, the sparse dictionary D learned through step 2_l,lLow rank dictionary D_l,sAnd a transformation matrix H; the decomposition model in the step3 is used for decomposing the image to be fused to obtain a coding coefficient A of low-rank sparse components_l,lAnd A_l,s. Order to

And

respectively representing low-rank, sparse coding coefficients of the LR input image. Then low rank, sparse fusion coding coefficients of HR

And

is constructed by the following steps:

m in the formula (28) represents the number of input images, such that

A high resolution fused image block can be obtained

In formula (29)

Rec is a reconstruction operation used to convert image blocks into an image.

The invention has the beneficial effects that:

(1) the invention provides an effective discriminant dictionary learning model, which can learn a conversion dictionary between high and low resolutions and two pairs of discriminant dictionaries and jointly represent low-rank and sparse components of a low-resolution and high-resolution image pair.

(2) The conversion dictionary in the invention reveals the relationship between the coding coefficient of the low-resolution image and the corresponding high-resolution version image, and can improve the resolution of the low-resolution image after fusion, so that the image details are clearer.

(3) The invention realizes the fusion and the high-resolution reconstruction of the low-resolution images at the same time, and can obtain the fusion images with better vision and objective evaluation performance, thereby providing more comprehensive and accurate image description for the identification of events and objects.

Drawings

FIG. 1 is a flow chart of the present invention;

FIG. 2 is an HR source image in which the first row is a pair of infrared and visible light images (240X 320) of "street", respectively; the second row is the MR-T1/MR-T2 medical image pair (256 × 256), respectively; the third row is a "bookshelf" and "clock" multi-focal image pair (240 × 320);

FIG. 3 is an LR source image, in which (a) is an infrared and visible light image pair (120 × 160); (b) for a medical image pair (128 x 128); (c) for a pair of multifocal images (120 × 160);

FIG. 4 shows the fusion and high resolution results (240X 320) of infrared and visible images of "street", in which (a) - (d) represent the fusion and super resolution reconstruction result images of Ours, Li's, Zhu's (bicubic), Zhu's (SRSR), respectively;

FIG. 5 shows the fusion and high resolution results (256X 256) of MR-T1/MR-T2 medical images, and (a) to (d) respectively show the fusion and super resolution reconstruction result images of Ours, Li's, Zhu's (bicubic), Zhu's (SRSR);

Fig. 6 shows the fusion and high resolution results (240 × 320) of the multi-focus image of "bookshelf", and in the drawings, (a) to (d) respectively show the fusion and super-resolution reconstruction result images of Ours, Li's, Zhu's (bicubic), Zhu's (srsr).

Detailed Description

The invention is described in further detail below with reference to the figures and the specific embodiments.

Example 1: as shown in fig. 1 to 6, a multi-source image fusion method based on discriminative dictionary learning and morphological component decomposition includes the following steps:

1) and constructing a training sample for dictionary learning. Selecting 8 HR training samples for training a high-resolution image discrimination dictionary pair, then performing down-sampling, and performing up-sampling by a bicubic interpolation method to restore the images to be the same as the high-resolution images as corresponding LR training images;

2) an initial dictionary is randomly generated. A new dictionary learning method is then proposed for decomposing the input image into low rank and sparse components. For synchronously realizing image fusion and super-resolution reconstruction, a pair of HR dictionaries and a pair of LR dictionaries are jointly trained, and a coefficient conversion matrix H between coding coefficients of high-resolution image blocks and low-resolution image blocks is trained, wherein the pair of HR dictionaries is D_h,lAnd D_h,sOne pair of LR dictionaries is D _l,lAnd D_l,s；

3) Obtaining a low-rank dictionary D from the step 2)_l,lAnd sparse dictionary D_l,sThen, taking the LR image in pairs, decomposing the LR image block Y_lObtaining low rank, sparse coding coefficient A_l,lAnd A_l,s；

4) LR coding coefficient A obtained by step 3)_l,s、A_l,lAnd the conversion matrix H obtained in the step 2) adopts a method of maximum absolute value to construct the coding coefficient of the HR fusion image

And

and finally obtaining a high-resolution fusion image block.

Specifically, the step 2) includes the following steps:

and

are the low rank dictionary and sparse dictionary of the HR image. X_hIs a high resolution training image, X_lIs a corresponding low resolution image. In addition, D is used to low resolution low rank, sparse dictionary_l,lAnd D_l,sIs shown in which

M × K represents the matrix dimension of M rows and K columns, ε₁、ε₂、ε₃And ε₄A constant controlling the amplitude of each atom in the different dictionaries. Z_l,lAnd Z_l,sIs X_lIn dictionary D_l,lAnd D_l,sCoding coefficient of_h,lAnd Z_h,sAre each X_hIn dictionary D_h,lAnd D_h,sThe coding coefficients of (1). H is Z_l,iAnd Z_h,i(i ═ l, s); i | · | purple wind_FIs an F norm operator; Ψ (H, Z)_l，l，Z_h，l，Z_l，s，Z_h,s) And Φ (D)_l，l，D_h，l，Z_l，l，Z_h，l，Z_l，s，Z_h,s) Is a discriminant regular term to ensure the learning dictionary D_l，l、D_h，l、D_l,sAnd D_h,sThe discrimination ability of (1).

in the formula (2) < lambda >₁,λ₂And λ₅Is a regularization parameter.

In the formula (2), the reaction mixture is,

for the purpose of avoiding over-fitting,

for establishing Z_h,lAnd Z_l,lThe relationship between them. It is clear that the low rank components have strong linear correlation between them and that each element in the same coding coefficient vector has similar values. Based on this fact, an all-1 matrix is introduced

And using regularization terms

To characterize the low rank component coded coefficients.

And (4) optimizing and solving the Step2.2 dictionary learning model.

Variable D to be solved_l,l、D_l,s、D_h,l、D_h,s、Z_h,s、Z_h,l、Z_l,s、Z_l,lAnd H, which are non-convex and difficult to solve directly. If one of the variables is solved while the other variables are fixed, each sub-problem is a convex function. Therefore, each variable in (4) is solved in turn by an alternating iterative method.

Step2.2.1 for ease of optimization, four variable matrices X were introduced_h,s、X_h,l、X_l,sAnd X_l,lThat is, training the sparse component and the low-rank component of the input high-resolution image and the sparse component and the low-rank component of the low-resolution image, the optimization problem in equation (4) is converted into:

step2.2.2 update X_h,s,X_h,lAnd a coding coefficient matrix Z_h,sAnd Z_h,l. First, fixing other variables, updating X_h,sThen, equation (5) is converted into the following expression:

the above formula has the following closed solution form:

similarly, fix other variables, X_h,lThe solution form of (c) is as follows:

equation (8) can be solved efficiently by the SVT algorithm.

In the formula (9)

step2.2.3 updating X _l,s、X_l,lAnd a coding coefficient matrix Z_l,sAnd Z_l,l. First, fixing other variables, optimizing X_l,sAnd X_l,lSolving expressions are respectively as follows:

wherein

Equation (13) can be solved efficiently using the SVT algorithm.

wherein

The same applies to TwinT algorithm (14).

the above equation can effectively solve H by Sylvester function of MATLAB.

in the same way, D_l,sAnd D_l,lThe closed solution expression is as follows:

Specifically, the steps of step 3) are as follows:

Obtaining a low-rank dictionary D from Step3.1 _l,lSparse dictionary D_l,sThen, a pair of LR images is taken, which is (n × n) largeThe small sliding window collects N image block data, each image block number is used as a column vector, and then the column vectors form a matrix. Decomposition of LR image blocks Y_lObtaining low rank, sparse coding coefficient A_l,lAnd A_l,s. The low rank, sparse decomposition model is as follows:

in the formula A_l,l(i, j) is A_l,lThe (i, j) th value of (a). If A is to be_l,lAs a group, the value in each row of (a) is minimized_l,l||_2,1Will be such that A_l,lThe values in each row of (a) are the same. From the inequalities R (AB) ≦ min { R (A), R (B) } (R represents the rank of the matrix), it may be known to minimize | | | A_l,l||_2,1Can further ensure D_l,lA_l,lLow rank and avoid D due to minimization | |_l,lA_l,lSome defects are caused.

The solution of the Step3.2 decomposition model can also adopt an alternative iteration method to solve the image decomposition model (21) and update A_l,sAnd A_l,l。

Further, Y is fixed_l,s、A_l,lAnd A_l,s、Y_l,lThe update expression is as follows:

In formula (24)

formula (26) is₁And (3) solving the norm optimization problem by adopting a TwinT algorithm in the section. Formula (27) is_2,1And (5) carrying out norm optimization.

Specifically, the steps of step 4) are as follows:

And

And

is constructed by the following steps:

m in the formula (28) represents the number of input images, such that

A high resolution fused image block can be obtained

In formula (29)

Rec is a reconstruction operation used to convert image blocks into an image.

The present invention will be described in detail with reference to specific examples.

In the dictionary learning part, 8 training samples are selected as HR training images for training images, then down sampling is carried out, and then up sampling is carried out by a bicubic interpolation method to restore the images with the same size as the high-resolution images as corresponding LR training images. And as a training sample, obtaining a required low-rank dictionary, a sparse dictionary and a conversion matrix by iterative updating according to a proposed dictionary learning algorithm. For the test image, a pair of IR and visible images of HR (as shown in the first row of FIG. 2), a pair of medical images of HR (as shown in the second row of FIG. 2) and a pair of multi-focus images of HR (as shown in the last row of FIG. 2) are selected, with the low resolution versions of these source images being shown in FIG. 3. In dictionary learning and image decomposition, lambda is involved in total ₁，λ₂，λ₃，λ₄And λ₅And 5 regularization parameters and the maximum iteration number K, wherein six parameters are required to be set. According to experimental experience, setting lambda_iThe numerical values of (i ═ 1,2, …,5) are: 1, 0.01, 1.5, 0.01 and 0.00001; k is 10. In order to verify the effectiveness of the method, experiments are respectively carried out on medical images, infrared and visible light images and multi-focus image images.

In order to verify the superiority of the invention, the proposed method is compared with the latest image fusion and super-resolution methods at present. The Li method can realize image fusion and super-resolution at the same time, so that the method is one of comparison methods. However, due to the limitations of such methods, fusion results produced by excellent fusion algorithms such as the method of Zhu are super resolved. For super resolution, high resolution results are constructed using bicubic (bicubic) and sparse representation based super resolution (SRSR). Therefore, in the present invention, the super resolution results of the method of Zhu were compared with the obtained results.

In order to objectively and fairly evaluate the quality of the fusion result generated by different methods, the invention adopts four objective evaluation indexes to measure the quality of the fusion result besides comparing the fusion result in visual effect. These metrics include a Spatial Frequency (SF) based metric Q _SFQuality-aware clustering (Q) method_QACCommon entropy of image information Q_ENTAnd image mean gradient (gradient) Q_GD。Q_SFThe ratio of the SF errors is used to measure the quality of the fused image. Specifically, if Q_SFIf the value of (a) is lower than zero, it means that the active image information is lost in the fusion result; and if Q_SFA value greater than 0, and in the absence of any artifacts and noise, indicates that the details of the source image are enhanced. The higher the objective evaluation value of the other three indexes, the better the fusion quality.

Experiment 1: and (3) fusion and super-resolution reconstruction of the infrared and visible light images.

The first set of experiments was performed on the low resolution infrared and visible images of fig. 3(a) as test images. The fusion and super-resolution reconstruction results of the different methods are shown in fig. 4.

"Ours" in fig. 4 indicates the fusion and hyper-resolution reconstruction effect of the proposed method. As can be seen from fig. 4 and table 1 objective evaluation index data, the fusion results of Ours are superior to other methods in terms of visual perception and objective evaluation. The digital detail in the upper left corner of fig. 4 shows that the image after fusion reconstruction by the method of the present invention is clearer and brighter. The visual result of the method of the invention at the details proves to be better, i.e. the proposed method is very efficient and rational. In addition, three objective evaluation results of different methods are shown in table 1. From these data, it can be seen that objective evaluation leads to a conclusion that is consistent with subjective evaluation, which further verifies the superiority of the method of the invention.

TABLE 1 quantitative evaluation of different fusion and super-resolution reconstruction methods for infrared and visible light images of "streets

Experiment 2: fusion and super-resolution reconstruction of medical images

In a second experiment, a low resolution medical image pair such as that of fig. 3(b) was fused and super-resolved reconstructed. From these source images, it can be seen that the image information of the same scene obtained by different devices is complementary. The purpose of medical image fusion and super-resolution is to extract complementary information in a low-resolution source image, inject the complementary information into a fused image and improve the resolution of a fusion result. Fig. 5 shows the fusion and super-resolution reconstruction results of different methods. As can be seen from fig. 5(a) to 5(d), the proposed Ours are visually clearly superior to the Li method, and the step-wise operation method: zhu's (bicubic) and Zhu's (SRSR). In addition, it can be seen from the table that the proposed method excludes the index Q_ENTSlightly lower than the other three results, and better than the results of the Li method and the zhu method after the ultra-separation treatment in other evaluation indexes.

TABLE 2 quantitative evaluation of the Performance of different fusion and super-resolution reconstruction methods for MR-T1/MR-T2 medical images

Experiment 3: fusion and super-resolution reconstruction of multi-focus images

In a third experiment, the image pair of fig. 3(c) was tested. In the focused images, each focused image is captured by the same sensor modality, but the sensor modes are different, and the focus areas are different, and the multi-focus image fusion is to obtain images of all focused objects and fuse the images into one image. The fusion and super-resolution results for the different fusion methods are given in fig. 6. Comparing fig. 6(a) with fig. 6(c) - (d), it can be found that the resolution of the alarm clock at the near position and the resolution of the bookshelf at the far position are both higher in the result image of the method provided by the present invention, which proves that the method sufficiently fuses the images with different focuses. And the proposed Ours method is superior to other methods in terms of image contrast. As can be seen in Table 3, in the proposed method, Q_SFAlso less than 0, but still loses less information than the other three methods. The results of the objective evaluation of the data in table 3 further demonstrate the superiority of the proposed method over other methods.

TABLE 3 quantitative evaluation of different fusion and super-resolution reconstruction method performances of "Bookshelf" multi-focus image

While the present invention has been described in detail with reference to the embodiments shown in the drawings, the present invention is not limited to the embodiments, and various changes can be made without departing from the spirit and scope of the present invention.

Claims

1. A multi-source image fusion method based on discriminant dictionary learning and morphological component decomposition is characterized in that: the method comprises the following specific steps:

1) constructing a training sample for dictionary learning: selecting 8 HR training samples to train a high-resolution image discrimination dictionary pair, then performing down-sampling, and performing up-sampling by a bicubic interpolation method to restore the images to be the same as the high-resolution images as corresponding LR training images;

2) randomly generating an initial dictionary: a new dictionary learning method is provided for decomposing an input image into low-rank and sparse components, jointly training a pair of HR dictionaries, a pair of LR dictionaries and a coefficient conversion matrix H between coding coefficients of high-resolution and low-resolution image blocks for synchronously realizing image fusion and super-resolution reconstruction, wherein the pair of HR dictionaries is D_h，lAnd D_h，sOne pair of LR dictionaries is D_l，lAnd D_l,s；

3) Obtaining a low-rank dictionary D from step 2)_l,lAnd sparse dictionary D_l,sThen, taking the LR image pair, decomposing the LR image block Y_lObtaining low rank, sparse coding coefficient A_l,lAnd A_l,s；

4) LR coding coefficient A obtained by step 3)_l,s，A_l，lAnd the conversion matrix H obtained in the step 2) adopts a method of maximum absolute value to construct the coding coefficient of the HR fusion image

And

and finally obtaining a high-resolution fusion image block.

2. The multi-source image fusion method based on discriminative dictionary learning and morphological component decomposition according to claim 1, wherein the step 2) comprises the following steps:

step2.1 utilizes the constructed training sample to learn the dictionary, and the proposed discriminant dictionary learning model is as follows:

and

low rank dictionary and sparse dictionary, X, being HR images_hIs a high resolution training image, X_lIs a corresponding low resolution image, and further, is a low resolution low rank, sparse dictionary for D_l，lAnd D_l，sIs shown in which

M × K represents the matrix dimension of M rows and K columns, ε₁、ε₂、ε₃And ε₄Constants controlling the amplitude of each atom in different dictionaries, Z_l，lAnd Z_l，sIs X_lIn dictionary D_l，lAnd D_l，sCoding coefficient of_h,lAnd Z_h,sAre each X_hIn dictionary D_h,lAnd D_h,sH is Z_l，iAnd Z_h，i(i ═ l, s); i | · | purple wind_FIs an F norm operator; Ψ (H, Z)_l,l,Z_h,l,Z_l,s,Z_h,s) And Φ (D)_l,l,D_h,l,Z_l,l,Z_h,l,Z_l,s,Z_h,s) Is a discriminant regular term to ensure the learning dictionary D_l,l、D_h,l、D_l,sAnd D_h,sThe discrimination ability of (1);

the LR image is set to the same size as the HR image by the bicubic interpolation process, and then the relationship between the encoding coefficients of the LR image and the HR image is described as:

in the formula (2) < lambda >₁,λ₂And λ₅Is a regularization parameter;

in the formula (2), the reaction mixture is,

for the purpose of avoiding over-fitting,

Is a relational transform term used to represent the relationship between sparse component coding coefficients, where | · | | | calvert |_FFor the F norm operator, in the LR image and its corresponding HR image,

for establishing Z_h,lAnd Z_l,lThe relationship between the low rank components is obviously strong in linear correlation, and each element in the same coding coefficient vector has similar value, based on the fact that all 1 matrixes are introduced

And using regularization terms

To characterize the low rank component coding coefficients;

the following regularization term is defined:

in formula (3) < lambda >₃And λ₄Is the regularization parameter, | D_h,lZ_h,l||_*And | | | D_l，lZ_l，l||_*D for ensuring separation from an input image pair_h,lZ_h，lAnd D_l，lZ_l，lIs low-rank, wherein | · |. non-woven phosphor_*For the kernel norm operator, after the low rank component is obtained, pass X_i-D_i，lZ_i，lObtaining a sparse component, wherein X_iRepresenting input source data, and therefore, discriminating the objective function representing dictionary learning is expressed as:

optimized solution of Step2.2 dictionary learning model

Variable D to be solved_l，l、D_l,s、D_h,l、D_h,s、Z_h,s、Z_h,l、Z_l,s、Z_l,lAnd H, which are non-convex and difficult to solve directly, if one of the variables is solved while the other variables are fixed, each sub-problem is a convex function, and therefore each variable in (4) is solved in turn by an alternating iterative method;

step2.2.1 introduces four variable matrices X _h,s、X_h,l、X_l,sAnd X_l,lThat is, training the sparse component and the low-rank component of the input high-resolution image and the sparse component and the low-rank component of the low-resolution image, the optimization problem in equation (4) is converted into:

step2.2.2 update X_h,s、X_h,lAnd a coding coefficient matrix Z_h,sAnd Z_h,l: first, fixing other variables, updating X_h,sThen, equation (5) is converted into the following expression:

the above formula has the following closed solution form:

similarly, fix other variables, X_h,lThe solution form of (c) is as follows:

the formula (8) can be solved effectively through SVT algorithm;

In the formula (9)

Formula (9) belongs to₁Minimum sizeSolving the problem by using a two-step iterative shrinkage/threshold algorithm, wherein for the formula (10), the solving result is as follows:

step2.2.3 updating X_l,s，X_l,lAnd a coding coefficient matrix Z_l，sAnd Z_l，lFirst, fixing other variables, optimizing X_l，sAnd X_l，lSolving the expressions respectively as follows:

wherein

The formula (13) can be effectively solved by using an SVT algorithm;

wherein

Solving (14) by using a TwinT algorithm;

step2.2.4 updates the transformation matrix H: the other variable matrix is fixed, and the updated expression of H obtained by the formula (5) is as follows:

according to the formula, H can be effectively solved through a Sylvester function of MATLAB;

step2.2.5 updating dictionary pair D_h，s、D_h，lAnd D_l，s、D_l，lFixing other variables to be unchanged, solving dictionary variables by using a Lagrange dual method to obtain D_h,sAnd D_h,lThe update expression of (a) is as follows:

in the same way, D_l，sAnd D_l，lThe closed solution expression is as follows:

in the above four expressions, Λ₁、Λ₂、Λ₃And Λ₄Is the most correspondingDiagonal matrix of the dual-superior variable.

3. The multi-source image fusion method based on the discriminant dictionary learning and morphological component decomposition as claimed in claim 2, wherein the step 3) comprises the steps of:

obtaining a low rank dictionary D from Step3.1_l，lSparse dictionary D_l，sThen, taking paired LR images, collecting N image block data by a sliding window with the size of (N multiplied by N), taking the number of each image block as a column vector, forming the column vectors into a matrix, decomposing an LR image block Y_lObtaining low rank, sparse coding coefficient A_l，lAnd A_l，sThe low rank, sparse decomposition model is as follows:

in the formula A_l，l(i, j) is A_l，lIf A is equal to the (i, j) th value of (a)_l,lAs a group, the value in each row of (a) is minimized_l,l||_2,1Will be such that A_l，lIs different from the equation R (AB) ≦ min { R (A), R (B) }, R represents the rank of the matrix, it can be known to minimize | | A _l,l||_2,1Can further ensure D_l,lA_l,lLow rank of (c), and avoid D due to minimization | | |_l， _lA_l,l||_*Some defects are caused;

the solution of Step3.2 decomposition model is carried out by adopting an alternative iteration method for the image decomposition model (22) and updating A_l,sAnd A_l,l；

Step3.2.1 introduction of variable Y_l,lAnd Y_l,sI.e., the low rank component and the sparse component of the LR image block, equation (22) can be transformed into:

Step3.2.2 update Y_l,sAnd Y_l,l: first fix Y_l,l,A_l,lAnd A_l,s，Y_l,sThe update expression is as follows:

similarly, using the TwinT algorithm to solve for Y_l,sL of₁A norm optimization problem;

in the formula (25)

Step3.2.3 optimized sparse coding coefficient A_l,sAnd A_l,lThe update expression is as follows:

formula (26) is₁The norm optimization problem is solved by adopting a TwinT algorithm, and the equation (27) is l_2,1Norm optimization problem.

4. The multi-source image fusion method based on the discriminant dictionary learning and morphological component decomposition as claimed in claim 3, wherein the step 4) comprises the steps of:

let us assume that N treats the fused LR image, learn through step 2To sparse dictionary D_l,lLow rank dictionary D_l,sAnd a transformation matrix H; the decomposition model in the step3 is used for decomposing the image to be fused to obtain a coding coefficient A of low-rank sparse components _l,lAnd A_l,sLet us order

And

respectively representing low-rank and sparse coding coefficients of LR input image, and low-rank and sparse fusion coding coefficients of HR

And

is constructed by the following steps:

m in the formula (28) represents the number of input images, such that

A high resolution fused image block can be obtained

In formula (29)

Rec is a reconstruction operation used to convert image blocks into an image.