CN115578621B

CN115578621B - Image recognition method based on multi-source data fusion

Info

Publication number: CN115578621B
Application number: CN202211355233.4A
Authority: CN
Inventors: 王浩宇; 冯小杰; 程玉虎; 王雪松; 刘晓敏
Original assignee: China University of Mining and Technology CUMT
Current assignee: China University of Mining and Technology CUMT
Priority date: 2022-11-01
Filing date: 2022-11-01
Publication date: 2023-06-20
Anticipated expiration: 2042-11-01
Also published as: CN115578621A

Abstract

The invention discloses an image recognition method based on multi-source data fusion, which can realize multi-source image fusion of visible light, infrared and the like and is applied to image recognition tasks. Aiming at the problem that the single-mode data cannot provide enough information for the image recognition task, the image recognition method based on multi-source data fusion is provided, and the image recognition task can be effectively performed by fusing the multi-source data. Firstly, decomposing visible light and infrared images through scale decomposition to obtain a basic image of reinforced contour information; then, respectively carrying out sub-division on the visible light and infrared images; then, respectively carrying out weighted fusion on the subgraphs corresponding to the two images, and obtaining a basic fusion image from the fused subgraphs; in order to obtain the original detail information, the basic fusion image and the original image are subjected to weighted fusion to highlight the salient features of the image, so that a final fusion image is obtained; and finally, inputting the fused image into an image recognition network to perform an image recognition task.

Description

Image recognition method based on multi-source data fusion

Technical Field

The invention relates to an image recognition method based on multi-source data fusion, and belongs to the field of pattern recognition.

Background

With the continued development of sensor technology, a variety of different types of data are continually being collected by researchers. The single-mode data with insufficient information quantity cannot meet the high requirement of each task on precision. The multi-mode fusion technology can fuse different data about the same target acquired from different sensors through a unified frame, so that sufficient information is provided for training of a model. Infrared and visible light fusion is an important and frequently occurring problem. In recent years, a plurality of fusion methods have been proposed to fuse an infrared image with a visible light image to obtain a fused image with more retained detailed information, which can be applied to various tasks such as object detection, image classification, segmentation, and the like.

The multi-modal fusion method can be broadly divided into three types: and (3) data fusion, decision fusion and feature fusion. The data fusion directly processes the data acquired by different sensors and then inputs the data into the model. The most common data fusion method based on the spatial domain and the transformation domain comprises a logic filtering method, a weighted average method, a pyramid decomposition method, a wavelet transformation method and the like. The data fusion can reserve more detail information, and the data utilization rate is increased.

The decision fusion is to extract the characteristics of the multi-mode data respectively, then to fuse and utilize the multi-mode characteristics in a targeted manner in a decision layer, and finally to make an optimal decision according to a certain criterion. Common decision-based fusion methods include expert systems, bayesian reasoning, voting methods, and the like.

The feature fusion firstly extracts the features of the image, and then the fused feature images are obtained through analysis, processing and integration, and the features with discriminant are extracted from the fused feature images, so that the detection precision is improved. Compared with the disadvantages that the data fusion processing is long in time consumption and easy to be affected by noise, strict registration is needed, and late fusion is relatively dependent on the feature extraction capability of the previous level, the feature fusion not only furthest reserves the effective identification information of multiple features participating in fusion, but also eliminates the redundant information of the correlation among the multiple features caused by subjective and objective factors to a great extent, thereby reserving effective target detection information and realizing considerable information compression.

Many multi-mode fusion methods have good effects in infrared and visible light image fusion tasks, but the loss of many detail areas is difficult to avoid during image fusion, so that the fusion quality is low, and the fusion picture is unfavorable for downstream tasks. This patent is therefore considered to be designed.

Disclosure of Invention

The invention aims to: in order to overcome the defects in the prior art, the invention provides an image recognition method based on multi-source data fusion, which can fuse a visible light image and an infrared image into an image with enhanced significance characteristics so as to facilitate the execution of an image recognition task.

The invention discloses an image recognition method based on multi-source data fusion, which comprises the following steps:

step 1, respectively carrying out scale decomposition on a visible light original image and an infrared light original image to obtain a visible light basic image and an infrared light basic image of enhanced contour information;

step 2, respectively carrying out sub-division on the visible light basic image and the infrared light basic image; weighting and fusing the corresponding subgraphs to obtain a basic fusion image;

step 3, carrying out weighted fusion on the basic fusion image and the visible light original image to obtain a final fusion image;

and step 4, inputting the fused images into an image recognition network to perform an image recognition task.

Further, in step 1, the visible light base image and the infrared light base image may be obtained by the following formula:

wherein ,X_S Representing an original image, which is a visible original image or an infrared original image,

represents a visible light base image or an infrared light base image g _x and g_y Respectively horizontal and vertical gradient operators, and lambda is a weight factor.

Further, the step 2 specifically includes the following:

first, sub-division is performed on a visible light base image and an infrared light base image, respectively:

wherein ,

representing an nth sub-graph after dividing the basic image;

then, respectively carrying out weighted fusion on the subgraphs corresponding to the two images:

wherein ,

and />

The ith sub-picture, respectively representing visible and infrared light,/->

Subgraph fusion weights based on second order statistics and representing visible light base image, < ->

Sub-graph fusion weight of table infrared basic image based on second order statistics,/->

And the final output basic fusion image.

Further, in step 3, the basic fusion image and the original image are subjected to weighted fusion to highlight the salient features of the image, so as to obtain a final fusion image, which is specifically expressed as follows:

wherein ,

and X_I Respectively a basic fusion image and an original visible light image, wherein mu is a significance fusion coefficient based on second order statistics, F _k Is the final fused image.

The beneficial effects are that: the method comprises the steps of decomposing an original image to obtain a visible light basic image with rich distinguishing characteristics and an infrared light basic image with obvious contour characteristics, dividing the two basic images into subgraphs, fusing the subgraphs, obtaining local area characteristics of the two images to obtain a fused image with the obvious contour characteristics, carrying out weighted fusion on the basic fused image and the original image to obtain original detail information, and highlighting the obvious characteristics of the image to obtain a final fused image.

Drawings

FIG. 1 is a schematic flow chart of the method of the present invention.

Detailed Description

The invention is further explained below with reference to the drawings.

As shown in fig. 1, an image recognition method based on multi-source data fusion includes the following steps:

step 1, respectively carrying out scale decomposition on a visible light original image and an infrared light original image to obtain a visible light basic image and an infrared light basic image of the enhanced contour information;

visible light images can provide a richer discriminative feature, but for scenes with lower visibility, such as smoke, rain, fog, evening and night scenes, it is difficult to obtain the salient features of the object, while infrared images can provide salient features in these scenes. And decomposing the visible light and infrared images through scale decomposition to obtain a basic image of the enhanced contour information.

For each original image X _S Base image

Can be obtained by solving the following formula:

wherein ,g_x and g_y Respectively horizontal and vertical gradient operators, wherein lambda is a weight factor; both the visible light base image and the infrared light base image context can be obtained by the above equation.

Step S2: firstly, respectively carrying out sub-division on visible light and infrared images; then respectively carrying out weighted fusion on the subgraphs corresponding to the two images; finally, obtaining a basic fusion image from the fused subgraphs;

the visible light image is difficult to obtain the salient contour features for scenes with low visibility, the infrared is sensitive to the heat source, and the basic images obtained by decomposing the two images are subjected to sub-image weighted fusion to obtain the basic fusion image with the salient contour features so as to fully utilize the local area features of the two images.

wherein ,

representing the nth sub-graph after dividing the base image

wherein ,

and />

The ith sub-picture, respectively representing visible and infrared light,/->

And the final output basic fusion image.

Step S3: in order to obtain the original detail information, the basic fusion image and the visible light original image are subjected to weighted fusion to highlight the salient features of the image, so that a final fusion image is obtained;

and fusing visible light and infrared significance contour information by the basic fusion image obtained through sub-image weighted fusion to obtain a contour information reinforced fusion image. However, in the case of scale decomposition, a part of detail images are discarded, so that the original images need to be weighted and fused to supplement the detail parts.

Weighting and fusing the basic fusion image and the original image to highlight the salient features of the image, so as to obtain a final fusion image:

wherein ,

and X_I Respectively a basic fusion image and a visible light original image, wherein mu is a significance fusion coefficient based on second order statistics, F _k Is the final fused image.

Step S4: and inputting the final fusion image into an image recognition network to perform an image recognition task.

Claims

1. An image recognition method based on multi-source data fusion is characterized by comprising the following steps:

step 1, respectively performing scale decomposition on a visible light original image and an infrared light original image to obtain a visible light basic image and an infrared light basic image, wherein the visible light basic image and the infrared light basic image can be obtained by the following formula:

wherein ,X_S The original image is presented in a table,

represents a visible light base image or an infrared light base image g _x and g_y Respectively horizontal and vertical gradient operators, wherein lambda is a weight factor;

step 2, respectively carrying out sub-division on the visible light basic image and the infrared light basic image; the corresponding subgraphs are subjected to weighted fusion to obtain a basic fusion image, which comprises the following specific contents: