CN111080566A

CN111080566A - Visible light and infrared image fusion method based on structural group double-sparse learning

Info

Publication number: CN111080566A
Application number: CN201911270444.6A
Authority: CN
Inventors: 王志社; 姜晓林; 王君尧; 武圆圆
Original assignee: Taiyuan University of Science and Technology
Current assignee: Taiyuan University of Science and Technology
Priority date: 2019-12-12
Filing date: 2019-12-12
Publication date: 2020-04-28

Abstract

The invention relates to a visible light and infrared image fusion method based on structural group double-sparse learning. The method comprises the following steps: (1) performing sliding window processing on the input visible light and infrared images, searching similar blocks of an original image block, performing group vectorization, and establishing an image similar structure group matrix; (2) taking the image similar structure group matrix as a training sample, forming a base dictionary by using a Kronecker product of shear wavelets, obtaining a sparse dictionary through online learning, and performing linear reconstruction on the base dictionary and the sparse dictionary to obtain a final double-sparse dictionary; (3) and combining the double sparse dictionaries, performing group sparse solution on the image similar structure group by using SOMP to obtain a group sparse coefficient, and obtaining a final fusion image by using a large fusion rule and image reconstruction. The method solves the problem that the image fusion quality is low due to the fact that the existing sparse fusion algorithm ignores the correlation among image blocks and the dictionary adaptability is poor, and can be applied to the fields of remote sensing detection, medical diagnosis, intelligent driving, safety monitoring and the like.

Description

Visible light and infrared image fusion method based on structural group double-sparse learning

Technical Field

The invention relates to an image fusion method in the field of image processing, in particular to a visible light and infrared image fusion method based on structural group double sparse learning.

Background

Visible light and infrared imaging technologies have important applications in the aspects of remote sensing, medical diagnosis, intelligent driving, safety monitoring and the like. The visible light sensor can describe scene information of the environment through light reflection imaging, has higher spatial resolution, and is easily influenced by illumination conditions and weather change factors; the infrared sensor can reflect the radiation characteristics of the target and the background through thermal radiation imaging, but structural characteristics and texture information of the target are lacked. The two types of imaging utilize different physical characteristics of the target to detect, have strong complementarity, only fuse the two types of images, just can synthesize different imaging advantages, reduce information loss, do benefit to target identification processing and personnel and observe, satisfy practical demand. Therefore, image fusion is an important prerequisite for improving the detection and recognition levels of visible light and infrared imaging.

The key technology for realizing the fusion of the visible light image and the infrared image is to integrate the significant characteristics between the two images into one image so as to exert the integrated advantages of the visible light image and the infrared image. The multi-scale transformation fusion method is to approximate the salient feature information of the image by using a given mathematical model, but because the salient feature types of the image are complicated and changeable, the multi-scale transformation cannot extract the salient features of all types of the image. In order to improve the multi-scale transformation fusion effect, a redundant dictionary is constructed through online learning from the perspective of image signal sparsity, image signals are represented on the redundant dictionary through sparse representation, and the significant features of the images are described by using representation coefficients and atoms corresponding to the redundant dictionary. At present, the traditional sparse representation fusion model has two problems: firstly, in the process of dictionary learning and sparse coding, each image block is considered independently, and the correlation among the blocks is ignored, so that the sparse coding coefficient is inaccurate; secondly, the respective advantages of the analysis dictionary and the learning dictionary are not combined, and the adaptability of the dictionary is not strong.

Research shows that non-local similarity is an important characteristic of an image, and illustrates that many similar structures (such as detail and texture information) exist at different positions in the image, and the information contained in the similar structures is applied to image processing, so that the effect of the image processing can be improved, and the method is used in the fields of image denoising, compressed sensing, super-resolution and the like. In fact, visible light images have rich repetitive structures and a large amount of redundant information, and image blocks reflect local geometric structures and repeatedly appear at different positions of the images, and have non-local structural similarity. In the infrared image, most of the infrared image is a background area, the gray scale change of the infrared image is slow, and small background image blocks have strong correlation and obvious non-local structural similarity. Therefore, through the non-local similarity of the images, the structural similarity groups of the visible images and the infrared images are established, the correlation among the image blocks is further established, and the accuracy of sparse coding is improved.

At present, sparse representation dictionaries can be divided into two categories, parsing dictionaries and learning dictionaries. The analytic dictionary establishes a formulaic mathematical model for the data, so the analytic dictionary is highly structured, can realize fast numerical value, but has poor adaptability; the learning dictionary is used for learning through training samples, and is more adaptive, but the learning model is complex. Researches show that organic combination of an analysis dictionary and a training dictionary is realized, and the advantages of the two dictionaries are combined, so that the adaptability of the dictionary is improved, and the complexity of a model is reduced, which is an urgent need for sparse representation fusion development.

In summary, there is an urgent need for an image fusion method that can effectively establish the correlation of image blocks, improve the accuracy of sparse coding, enhance the applicability of redundant dictionaries, reduce the complexity of dictionary learning models, and further effectively improve the fusion effect of visible light and infrared images.

Disclosure of Invention

The invention provides a visible light and infrared image fusion method based on structural group double-sparse learning, and aims to solve the problem that the existing sparse representation fusion algorithm ignores the correlation among blocks and is poor in dictionary learning adaptability, so that the fusion quality of visible light and infrared images is poor.

The invention is realized by adopting the following technical scheme: a visible light and infrared image fusion method based on structural group double sparse learning comprises the following steps:

s1: performing sliding window processing on the input visible light and infrared images, searching for similar image blocks of the original image blocks, performing group vectorization on the original image blocks and the similar image blocks, and establishing an image similar structure group matrix;

s2: constructing a double-sparse learning model, forming a base dictionary by using a Kronecker product of a shear wavelet, obtaining a sparse dictionary through online learning, and performing linear reconstruction on the base dictionary and the sparse dictionary to obtain a final double-sparse dictionary;

s3: and combining the double sparse dictionaries, carrying out sparse solution on the image similar structure group by adopting the SOMP to obtain a group sparse coefficient, and obtaining a final fusion image by adopting a large fusion rule and image reconstruction.

The above-mentioned structure group matrix construction process of the visible light and infrared image is as follows: the method comprises the steps of carrying out image blocking on an input image through a sliding window, using Euclidean distance as a criterion, searching for similar image blocks of an original image block, enabling the original image block and the similar image blocks to form similar structure groups, arranging the image blocks in each structure similar group according to a column vector sequence, connecting the image block vectors end to end, and obtaining similar structure group matrixes of the visible light and the infrared image respectively.

The process for constructing the double sparse dictionaries facing the similar structure group matrix comprises the following steps: the method comprises the steps of taking a similar structure group matrix of visible light images and infrared images as training samples, obtaining a base dictionary by using a Kronecker product of shear wavelets, obtaining a learning dictionary by using a sparse learning model and adopting a sequential updating iterative learning method, and finally, carrying out linear reconstruction on the base dictionary and the learning dictionary to obtain a final double-sparse dictionary.

Compared with the existing sparse representation fusion technology, the method has the following advantages:

1. according to the method, the non-local similarity of the images is utilized, the image similar structure group matrix is constructed, the relation between the image blocks is established, the capability of capturing the remarkable features of the images by the dictionary atoms can be effectively enhanced, and the accuracy of dictionary learning sparse coding is improved.

2. The method comprises the steps of forming a base dictionary by a Kronecker product of shear wavelets, obtaining a sparse dictionary through online learning, and performing linear reconstruction on the base dictionary and the sparse dictionary to obtain a double-sparse dictionary; the double sparse dictionaries are combined with respective advantages of the analytic dictionary and the learning dictionary, complexity of a dictionary learning model is reduced, and applicability of the enhanced redundant dictionary is enhanced.

3. The image fusion method for image structure group matrix double sparse learning is established, the fusion effect is obvious, the method can also be applied to the fusion of multi-mode images, multi-focus images and medical images, and the method has high application value in the field of image fusion.

Drawings

FIG. 1 is a schematic diagram of the structure of the method of the present invention.

Fig. 2 shows a first set of visible light and infrared image fusion experiments, which sequentially include a visible light image, an infrared image and a fusion image from left to right.

Fig. 3 shows a second set of visible light and infrared image fusion experiments, which sequentially include a visible light image, an infrared image and a fusion image from left to right.

Fig. 4 shows a third set of visible light and infrared image fusion experiment, which sequentially includes a visible light image, an infrared image and a fusion image from left to right.

Fig. 5 shows a fourth set of the visible light and infrared image fusion experiment, which sequentially includes a visible light image, an infrared image and a fusion image from left to right.

Detailed Description

A visible light and infrared image fusion method based on structural group double sparse learning comprises the following steps:

s11: the method comprises the steps of adopting a sliding window technology, wherein the size of a sliding window is N multiplied by N, the sliding step length is 1 pixel, and dividing a visible light image V and an infrared image I with the size of M multiplied by N into (M-N +1) · (N-N +1) image blocks.

S12: for each original image block p_iIn L × L neighborhood, the Euclidean distance is used as the measurement criterion to calculate the sum p_iS most similar image blocks, and the original image block p_iForming a similarity group g with s similar image blocks_iThere are s +1 image blocks in each similarity group. For visible and infrared images, (M-N +1) · (N-N +1) groups of similar structures, respectively, are obtained, respectively

And

s13: similar structure group for visible light and infrared image

And

firstly, arranging image blocks according to the sequence of column vectors to obtain image block vectors

Then the s +1 image block vectors in the similar structure group are connected end to obtain a similar structure group vector with higher dimensionality

S14: combining each similar structure group vector as a matrix column to respectively obtain a similar structure group matrix of the visible light and the infrared image

s21: for two-dimensional separable shear wavelets, the Kronecker product of the shear wavelets is used to form a base dictionary

Let A be an element of R^w×mFor sparse learning dictionaries, X ∈ R^{m×(M-n+1)·(N-n+1)}For sparse coefficient matrix, similar structure of visible light and infrared image is grouped into matrix

As a training sample, the dual sparse online learning model can then be represented as:

wherein x is_iIs an arbitrary row of the sparse coefficient matrix X, a_jIs an arbitrary column of the sparse dictionary A, | | | | | non-woven phosphor₀Is L₀Norm, solving the number of nonzero elements in the vector (namely sparsity); and p and k respectively control the sparsity of the sparse coefficient X and the sparse dictionary A.

S22: by using a pair dictionary atom a_jThe learning method of sequential update obtains the sparse learning dictionary a through iteration, and the learning process can be expressed as:

wherein E is_jIs composed of

The error of (2).

S23: and multiplying the base dictionary phi and the sparse learning dictionary A, and performing linear reconstruction to obtain a final double-sparse dictionary D.

S3: combining double sparse dictionaries, carrying out sparse solution on the image similar structure group matrix by using SOMP to obtain a group sparse coefficient, and obtaining a final fusion image by using a large fusion rule and image reconstruction;

s31: similar structure group matrix of visible light and infrared images by using SOMP algorithm in combination with double sparse dictionaries D

Performing sparse decomposition to respectively obtainGroup sparsity coefficients for visible and infrared images

And

s32: the group sparse coefficient combination adopts a fusion rule with the maximum absolute value, and can be expressed as follows:

wherein the content of the first and second substances,

is composed of

T is 1,2, …, m. Final fused set of similar structures vector pass

Thus obtaining the product.

S33: for each obtained fusion similarity structure group vector

It is equally divided into s +1 subvectors. And reconstructing each sub vector into an image block with the size of n multiplied by n, placing the image block at a position corresponding to the reconstructed fusion image, and averaging the value at each position of the reconstructed fusion image according to the pixel superposition times to obtain the final fusion image.

Claims

1. A visible light and infrared image fusion method based on structural group double sparse learning is characterized by comprising the following steps:

2. The visible light and infrared image fusion method based on structure group double sparse learning of claim 1, characterized in that the similar structure group matrix construction process of the visible light and infrared image is as follows: and carrying out image blocking on the input image through a sliding window, searching for similar image blocks of the original image block by adopting Euclidean distance as a criterion, forming similar structure groups by the original image block and the similar image blocks, arranging the image blocks in each structure similar group according to a column vector sequence, and connecting the image block vectors end to obtain a similar structure group matrix of the visible light and the infrared image.

3. The visible light and infrared image fusion method based on structure group double sparse learning according to claim 1 or 2, characterized in that the process of constructing the double sparse dictionary facing to the similar structure group matrix is as follows: the method comprises the steps of taking a similar structure group matrix of visible light images and infrared images as training samples, obtaining a base dictionary by using a Kronecker product of shear wavelets, obtaining a learning dictionary by using a sparse learning model and adopting a sequential updating iterative learning method, and finally, carrying out linear reconstruction on the base dictionary and the learning dictionary to obtain a final double-sparse dictionary.