CN112991200B

CN112991200B - Method and device for adaptively enhancing infrared image

Info

Publication number: CN112991200B
Application number: CN202110183673.5A
Authority: CN
Inventors: 张辉; 曹剑中; 刘学斌; 王浩
Original assignee: XiAn Institute of Optics and Precision Mechanics of CAS
Current assignee: XiAn Institute of Optics and Precision Mechanics of CAS
Priority date: 2021-02-08
Filing date: 2021-02-08
Publication date: 2024-02-02
Anticipated expiration: 2041-02-08
Also published as: CN112991200A

Abstract

The invention provides an infrared image self-adaptive enhancement method and device, which aims to solve the technical problems that the existing infrared image enhancement method has the defects of image local detail loss, insufficient image detail and incapability of forming an end-to-end solution, capability of exploring and enriching image details and capability of automatically repairing the image details. The invention designs an infrared image enhancement model based on an countermeasure generation network, fully utilizes the characteristics of the countermeasure neural network, realizes the dynamic balance of the generation sub-network and the identification sub-network through the neural network design and the loss function design, and can realize the end-to-end infrared image enhancement processing; in the model training process, the real visible light gray level image is used as a condition, the identification sub-network D is used for supervising and generating the sub-network G, and the mutual game between the sub-network and the identification sub-network is generated, so that the network model has good infrared image enhancement capability.

Description

Method and device for adaptively enhancing infrared image

Technical Field

The invention relates to a method and a device for adaptively enhancing an infrared image.

Background

With development and maturity of infrared imaging technology, various infrared imaging devices suitable for civil use are continuously emerging, and the infrared imaging devices play an increasingly important role in various fields of national economy. The infrared imaging has important applications of day and night observation and thermal target detection, and is widely applied to the fields of security monitoring, auxiliary driving, fire-fighting police, industrial monitoring, electric power monitoring and the like. However, due to the limitation of the infrared imaging principle, the infrared image has the characteristics of dark whole body, low contrast, blurred edge, large noise, unobvious detailed information, poor visual effect and the like.

To overcome the above-mentioned drawbacks of the infrared image, a histogram equalization method of the infrared image is disclosed in patent document publication No. CN104252700B, comprising: reading an infrared image, dividing the infrared image into at least two sub-images, carrying out histogram equalization processing on each sub-image to obtain an equalized sub-image of the sub-image, and then splicing the histogram equalized sub-images; the method comprises the core ideas that an infrared image is subjected to segmentation processing to obtain a plurality of sub-images, then each sub-image is subjected to histogram equalization enhancement processing in parallel, and the sub-images after histogram equalization are spliced and fused into a final infrared image after histogram equalization; the method has the defects that: (1) the contrast of the tidying image depends on the original infrared image, the local contrast of the image is insufficient, and the phenomena of detail loss and the like exist; (2) the image features of the response visible light are not combined, the image detail is not abundant enough, and the subsequent application such as target detection is difficult.

The patent document with publication number of CN110009569A discloses an infrared and visible light image fusion method based on a lightweight convolutional neural network. The key point of the scheme is that the neural network is utilized to extract the infrared image characteristics and the visible light image characteristics respectively, then a characteristic weight map is constructed, and then the fusion strategy is utilized to realize the fusion of the infrared image and the visible light image. Although the infrared image and the visible light image are fused, an end-to-end solution cannot be formed in engineering application, and each extracted feature is independent, and the method does not have the capability of exploring and enriching the image details and the capability of automatically repairing the image details.

Disclosure of Invention

The invention provides an infrared image self-adaptive enhancement method and device, which aims to solve the technical problems that the existing infrared image enhancement method has the defects of image local detail loss, insufficient image detail and incapability of forming an end-to-end solution, capability of exploring and enriching image details and capability of automatically repairing the image details.

The technical scheme of the invention is as follows:

the method for adaptively enhancing the infrared image is characterized by comprising the following steps of:

step 1: constructing an image dataset

Selecting an existing ImageNet data set as a basis for constructing an image data set, converting an image in the ImageNet data set into a visible light gray image by using gray level conversion, converting the gray level image into an infrared image by using contrast conversion or histogram conversion, and taking each obtained infrared image and a visible light gray level image corresponding to the infrared image as one sample of the image data set;

step 2: model training

Constructing an antagonistic neural network, inputting a sample in an image data set into the constructed antagonistic neural network to perform deep learning training on the antagonistic neural network to obtain an infrared image enhancement model;

step 3: downloading the infrared image enhancement model obtained in the step 2 into a parallel processing module;

step 4: the infrared and visible light image acquisition device acquires infrared images and visible light images of a target scene in real time, when the number of groups of the acquired infrared images and visible light gray images is 10% greater than the number of samples in the current image data set, randomly replacing the corresponding number of samples in the image data set with the acquired infrared images and visible light gray images, and then returning to the step 2; simultaneously, copying one part of infrared image acquired in real time and sending the copied part of infrared image to a parallel processing module;

step 5: and simultaneously carrying out parallel enhancement processing on a plurality of infrared images by utilizing an infrared image enhancement model arranged in a parallel processing module, and outputting corresponding enhanced images.

Further, the countermeasure neural network constructed in the step 2 includes generating a sub-network G and an authentication sub-network D;

the generation sub-network is used for generating an enhanced infrared image; the identification sub-network D is used for judging the authenticity of the visible light gray level image and the enhanced infrared image to obtain a probability value of 0-1, wherein 0 represents that the enhanced infrared image quality is very low and has no credibility, and 1 represents that the enhanced infrared image quality is very high and approaches to the real image.

Further, the network structure of the generated sub-network is as follows:

CR-CBR-CBR-D-Tanh

wherein, C is a convolution layer, R is a ReLu activation function, B is a batch normalization layer, D is a deconvolution layer, and Tanh is:z is a feature image obtained after deconvolution; e is the bottom of natural logarithm;

the loss function of the generated subnetwork is:

L _G ＝E[λ×L _cout +log(D(G(X),X)]

wherein:

e is Euclidean distance;

L _cont a quality loss function representing an enhanced infrared image and a visible gray scale image, defined by an L2 distance,

y is a visible light gray scale image corresponding to the input infrared image;

g (X) is an enhanced infrared image obtained by utilizing the generation sub-network; d is the output of the authentication sub-network;

the network structure of the authentication sub-network D is:

CL-CBL-CBL-CBL-C-Sigmoid；

wherein: sigmoid:c is a convolution layer, B is a batch normalization layer, and L is a leakage ReLU activation function;

the loss function of the authentication subnetwork D is:

wherein:

x is an infrared image, Y is a visible light gray scale image, and G (X) is an enhanced infrared image obtained by utilizing a generation sub-network.

The invention also provides an infrared image self-adaptive enhancement device, which is characterized in that: the system comprises an image data set, an infrared image enhancement module based on deep learning, an infrared image enhancement model, a model downloading module, an infrared and visible light image acquisition device, an image transmission module and a parallel processing module;

the image dataset is used for providing image samples for model training for the infrared image enhancement module based on the deep learning;

the infrared image enhancement module based on the deep learning is used for the deep learning training to obtain the infrared image enhancement model;

the model lower loading module is used for downloading the infrared image enhancement model into the parallel processing module in a remote service bus mode;

the infrared and visible light image acquisition device is used for acquiring infrared images and corresponding visible light gray images in real time and transmitting the infrared images and the corresponding visible light gray images to the image transmission module and the parallel processing module;

the parallel processing module is used for carrying out parallel enhancement processing on the received multiple infrared images;

the image transmission module is used for transmitting the visible light gray level image and the infrared image acquired by the infrared and visible light image acquisition device to the image data set.

Further, the image dataset is constructed according to the following method:

the visible light image is converted into a visible light gray image by gray level conversion, then the gray level image is converted into an infrared image by contrast conversion or histogram conversion, and each obtained infrared image and the corresponding visible light gray level image are taken as one sample of an image data set.

Further, the infrared image enhancement module based on deep learning comprises a generation sub-network G and an identification sub-network D;

Further, the network structure of the generated sub-network is as follows:

CR-CBR-CBR-D-Tanh

the loss function of the generated subnetwork is:

L _G ＝E[λ×L _cout +log(D(G(X),X)]

wherein:

e is Euclidean distance;

the network structure of the authentication sub-network D is:

CL-CBL-CBL-CBL-C-Sigmoid；

wherein: sigmoid:c is a convolution layer, B is a batch normalization layer, and L is a Leaky ReLU laserA living function;

the loss function of the authentication subnetwork D is:

wherein:

Compared with the prior art, the invention has the advantages that:

1. in the model training process, the invention uses the real visible light gray level image as the condition, and monitors the generation sub-network G by means of the identification sub-network D, and generates the mutual game between the sub-network and the identification sub-network, so that the network model has good infrared image enhancement capability.

2. The invention designs an infrared image enhancement model based on an countermeasure generation network, fully utilizes the characteristics of the countermeasure neural network, realizes the dynamic balance of the generation sub-network and the identification sub-network through the neural network design and the loss function design, and can realize the end-to-end infrared image enhancement processing.

3. The invention provides a synchronous update mechanism for infrared image enhancement training and dynamic acquisition of an infrared and visible light image acquisition device, which realizes automatic optimization of an infrared image enhancement model, further improves the infrared image enhancement processing effect and finally realizes the infrared image enhancement application of off-line modeling-on-line application-sustainable update.

4. The invention has been proved by system verification under the monitoring scene of the electric power system equipment, the result shows that the invention can effectively promote the detail characteristics of the infrared image, and the image contrast is improved by a self-adaptive mode, and the detail level of the image is effectively enhanced by introducing the visible light characteristics, thereby providing a good image foundation for the image recognition and state monitoring of the subsequent equipment monitoring.

Drawings

Fig. 1 is a schematic diagram of an infrared image adaptive enhancement device of the present invention.

FIG. 2 is an infrared image enhancement model of the present invention based on an countermeasure generation network.

FIG. 3 is a graph of the contrast of the effect of an original image before and after enhancement with an enhanced image according to the present invention, (a) is an original infrared image; (b) is an enhanced infrared image.

Detailed Description

The invention is further described below with reference to the accompanying drawings.

As shown in fig. 1, the method for adaptively enhancing an infrared image provided by the invention comprises the following steps:

step 1: constructing an image dataset

step 2: model training

step 5: and simultaneously carrying out parallel enhancement processing on a plurality of infrared images by utilizing an infrared image enhancement model arranged in a parallel processing module, and outputting the enhanced images.

As shown in FIG. 1, the invention also provides an infrared image self-adaptive enhancement device for realizing the infrared image self-adaptive enhancement method, which comprises an image data set, an infrared image enhancement module based on deep learning, an infrared image enhancement model, a model downloading module, an infrared and visible light image acquisition device, an image transmission module and a parallel processing module.

1. Image data set

The image data set is an image set used for training an infrared image enhancement module based on deep learning, and in view of the fact that infrared images are generally lacking in the infrared image enhancement based on a neural network at present, the infrared image is one of key factors capable of achieving a better image processing effect based on the deep learning infrared image enhancement technology. The present invention therefore primarily builds an image dataset with a sufficient number of samples.

The construction method of the image dataset comprises the following steps:

selecting an existing ImageNet data set as a basis for constructing an image data set, converting an image in the ImageNet data set into a visible light gray image by using gray level conversion, obtaining infrared images by using contrast conversion or histogram conversion technology on the gray level image, and taking each obtained infrared image and the corresponding visible light gray level image as one sample of the image data set; in this embodiment, the basic data set includes 10000 samples, where the resolution of the visible light gray scale image is: 512x512, infrared image resolution is: 512x512.

The method has the advantages that the problem of infrared image shortage can be solved, a training set with enough sample numbers is provided for the infrared image enhancement module based on deep learning, but because the infrared images in the basic data set constructed by the method are not real infrared images, if the infrared image enhancement module based on deep learning is trained by using only the basic data set, the infrared image enhancement model obtained by training is insufficient in infrared image coverage characteristics, and the infrared image enhancement effect is not ideal. To solve this problem, the present invention dynamically updates the image dataset based on the application scenario.

A method for dynamically updating an image dataset:

the infrared and visible light image acquisition device acquires a group of visible light gray level images and infrared images every set time (for example, 15 minutes), the acquisition time per day is 09:00-17:00, and the infrared and visible light gray level images of 32 groups are acquired by a single infrared and visible light image acquisition device per day. The acquisition time period is selected to be 09:00-17:00, and under the condition of better imaging conditions, infrared images and corresponding visible light gray level images are acquired at the same time, when the whole system is applied, not less than 50 sets of infrared and visible light image acquisition devices are deployed, and the acquisition time period is distributed in different scenes such as roads, forests, schools and the like, so that application pictures covering different scenes are ensured, namely: 1600 additional images are generated daily.

When the number of the acquired images is 10% greater than the number of the images in the current image dataset, randomly extracting 10% of the images from the current image dataset, carrying out replacement processing on the images by the acquired images, marking the images which are used for replacement and are acquired by the infrared and visible light image acquisition device in the replacement process, and when the images which are marked are updated and replaced next time, not carrying out replacement on the marked images, finally, when all the images in the whole image dataset are marked, indicating that the whole image dataset is updated, deleting the marks marked by the current sample images in the image dataset after the whole image dataset is updated, and then entering the next round of dynamic updating according to the same method.

Table 1 update threshold selection for image dataset

Dynamically updating thresholds	Infrared image enhanced PSNR
		3％	25.36
6％	25.78
		9％	26.02
10％	26.39

Table 1 is an experiment made by adopting different thresholds when the image dataset is dynamically updated, and after each dynamic update, an infrared image enhancement module based on deep learning is triggered to train to obtain a new infrared image enhancement model, and then the infrared image enhancement is performed by using the new infrared image enhancement model; by PSNR analysis of 512 infrared image enhancement training, it was found that as the threshold increases, the quality of the infrared image after enhancement gradually increases, and in view of quality and speed, 10% is finally selected as the threshold at the time of dynamic update.

2. Infrared image enhancement module based on deep learning

The infrared image enhancement module based on the deep learning is mainly used for the deep learning training, the input of the infrared image enhancement module is an image data set, and the output of the infrared image enhancement module is a trained infrared image enhancement model.

As shown in fig. 2, the network architecture of the deep learning-based infrared image enhancement module is based on an countermeasure generation network, and includes 2 sub-networks in total: a subnetwork G and an authentication subnetwork D are generated.

2.1 Generation of subnetworks

2.1.1 network formation

The generating sub-network is composed of 3 convolution layers conv and 1 deconvolution layer Deconv, and a batch of normalization networks are added after the second convolution layer and the third convolution layer, namely: in deep learning training, the Batch NN is a data normalization method proved by practice, and the effect of the Batch NN can accelerate the convergence rate of model training, so that the model training process is more stable, and a certain regularization effect is achieved.

2.1.2 generating a subnetwork design:

(1) the resolution of the input image of the generation sub-network is 256x256, so that the image in the image dataset needs to be scaled before being input into the generation sub-network;

(2) the network structure is as follows:

CR-CBR-CBR-D-Tanh

the size of the CR input image of the first convolution layer is 256x256, and the better characteristics are reserved through convolution operation and ReLu processing, so that 64 characteristic vectors are finally obtained, and then 2 convolution layer operations are followed to further strengthen the extracted characteristics.

(3) In order to reduce loss of image details, as shown in fig. 2, residual Connection Skip Connection is added after a convolution layer 1 and after a convolution layer 2, and residual Connection is performed again between the feature vector obtained after weighting processing and a convolution layer 3, so that the image feature vectors with different dimensions obtained by each convolution layer flow in each layer in a generation sub-network, and image detail features are reserved;

(4) generating a sub-network for generating an enhanced image (enhanced infrared image);

(5) and (3) loss function design: l (L) _G ＝E[λ×L _cout +log(D(G(X),X)]The method comprises the steps of carrying out a first treatment on the surface of the Wherein: l (L) _cont A quality loss function representing an enhanced infrared image and a visible gray scale image, defined by an L2 distance,e is Euclidean distance; y is a visible light gray scale image corresponding to the input infrared image; g (X) is an enhanced infrared image obtained by utilizing the generation sub-network; d is the output of the authentication sub-network; the authentication sub-network is used for judging the authenticity of the visible light gray level image and the enhanced infrared image to obtainProbability value of 0-1, 0 represents that the enhanced infrared image quality is very low and has no credibility, 1 represents that the enhanced infrared image quality is very high and approaches to a real image, L _G Using mathematical expectations to measure characteristics of generating and discriminating sub-networks in infrared image enhancement, where λ×L _cout The generation of the sub-network is encouraged to generate more realistic infrared enhanced images, and the log (G (X), X) in the formula encourages the identification of the sub-network to distinguish the infrared enhanced images, and the two are finally balanced dynamically.

2.1.3 related network parameter settings:

type(s)	Convolution kernel	Convolution step length	Filling
				conv	4×4	2×2	1×1
conv	3×3	1×1	1×1
				conv	3×3	1×1	1×1
deconv	4×4	2×2	1×1

2.2 authentication subnetworks

2.2.1 network formation

The authentication sub-network is composed of 5 convolutional layers conv.

2.2.2 authentication subnetwork design:

(1) the network input image is: generating an enhanced image output by a sub-network and a visible light gray image corresponding to the enhanced image

(2) The network structure is as follows: CL-CBL-CBL-CBL-C-Sigmoid, wherein: sigmoid:c is a convolution layer, B is a batch normalization layer, and L is a leakage ReLU activation function;

(3) the network adopts 5 convolution layers to obtain characteristic images, and uses a Sigmoid function to perform normalization processing to judge the similarity degree [0-1] of the infrared enhanced image and the visible light gray image;

(4) and (3) loss function design:wherein: l (L) _D For identifying the loss function of the sub-network, X is an infrared image, Y is a visible light gray scale image, G (X) is an enhanced infrared image obtained by generating the sub-network; the loss function is designed by adopting a cross entropy loss function, and aims to maximize the expected value of the loss function when the identification sub-network encounters real data such as a visible light gray level image, and judge the image as a generated image when the identification sub-network encounters generated data such as an infrared enhanced image, wherein the aim of the identification sub-network is that the generated image is identified in the whole training process, L _D Maximum, but for the generation of a sub-network the goal is for the generated image not to be identified, L _D Tends to be minimal, and adopts a maximumThe loss function design of the small game mode can enable the network model to have good infrared image enhancement capability.

2.2.3 related network parameter settings:

type(s)	Convolution kernel	Convolution step length	Filling
				conv	4×4	2×2	1×1
conv	4×4	2×2	1×1
				conv	4×4	2×2	1×1
conv	4×4	1×1	1×1
				conv	4×4	1×1	1×1

The image quality data obtained after the infrared image enhancement module based on deep learning is used for enhancing 512 infrared test images are shown in the following table, 512 images are randomly tested, and after the infrared image processing is carried out by the method, the image evaluation indexes PSNT and SSIM are better represented, so that the method has a good infrared image enhancement effect.

	PSNR	SSIM
			512 infrared test images	26.67	0.974

3. Infrared image enhancement model

The infrared image enhancement model is a result obtained after the image dataset is trained by the infrared image enhancement module based on deep learning.

4. Lower die assembly module of die

The model underfilling module is used for underfilling the infrared image enhancement model which is output after the training of the infrared image enhancement module based on the deep learning into the parallel processing module through a remote service bus mode.

5. Infrared and visible light image acquisition device

The infrared and visible light image acquisition device is used for acquiring infrared images and corresponding visible light gray images in real time, the acquired infrared images are used for being sent into the parallel processing module to be subjected to image enhancement processing, and the infrared images and the visible light gray images are also sent to the image data set through the image transmission module and used for dynamically updating the image data set.

6. Parallel processing module

The input of the parallel processing module is an infrared image acquired by an infrared and visible light image acquisition device in real time, and the received infrared image is enhanced by using an infrared image enhancement model arranged in the parallel processing module; in order to improve the real-time processing speed of infrared image enhancement, the parallel processing module adopts a Cuda-based acceleration technology, fully utilizes the GPU computing capability, and simultaneously adopts a multithread parallel technology to improve the processing speed of image enhancement.

7. Image transmission module

The image transmission module is responsible for transmitting visible light gray level images and infrared images acquired by the infrared and visible light image acquisition device to an image data set, when the number of incremental image groups is greater than or equal to a dynamic update threshold value of the image data set, the infrared image enhancement module based on deep learning is triggered to automatically train, automatic update of an infrared image enhancement model installed in a parallel processing module is realized, continuous update of the infrared image enhancement model is realized in a subsequent image enhancement process, the infrared image enhancement effect is continuously optimized, and finally, the infrared image enhancement application of offline modeling-online application-sustainable update is realized.

Fig. 3 is a graph comparing the effects of the original image before and after enhancement with the enhanced image after enhancement, wherein the left side is the original infrared image, the right side is the enhanced infrared image, and the picture content of the right side image is richer, the texture is obvious, the detail is prominent, and the visual effect is better.

Claims

1. An infrared image self-adaptation reinforcing device is characterized in that: the system comprises an image data set, an infrared image enhancement module based on deep learning, an infrared image enhancement model, a model downloading module, an infrared and visible light image acquisition device, an image transmission module and a parallel processing module;

the image dataset is used for providing image samples for model training for the infrared image enhancement module based on the deep learning; the updating mode of the image data set is as follows: when the number of groups of the collected infrared images and the visible light gray images is 10% greater than the number of samples in the current image data set, randomly replacing the corresponding number of samples in the image data set by the collected infrared images and the visible light gray images, marking the images collected by the infrared and visible light image collecting devices for replacement in the replacement process, and when the images are updated and replaced next time, not replacing the marked images; after the whole image data set is updated, deleting marks marked by the current sample image in the image data set, and then entering the next round of dynamic updating according to the same method;

the infrared image enhancement module based on deep learning comprises a generation sub-network G and an identification sub-network D;

the generation sub-network is used for generating an enhanced infrared image; the identification sub-network D is used for judging the authenticity of the visible light gray level image and the enhanced infrared image to obtain a probability value of 0-1, wherein 0 represents that the enhanced infrared image quality is very low and has no credibility, and 1 represents that the enhanced infrared image quality is very high and approaches to the real image;

the network structure of the generation sub-network is as follows:

CR-CBR-CBR-D-Tanh

the loss function of the generated subnetwork is:

L _G ＝E[λ×L _cout +log(D(G(X),X)]

wherein:

e is Euclidean distance;

the network structure of the authentication sub-network D is:

CL-CBL-CBL-CBL-C-Sigmoid；

the loss function of the authentication subnetwork D is:

wherein:

x is an infrared image, Y is a visible light gray scale image, and G (X) is an enhanced infrared image obtained by utilizing a generation sub-network;

the parallel processing module is used for carrying out parallel enhancement processing on a plurality of received infrared images, and simultaneously processing a plurality of images by utilizing a multithreading parallel technology, so that the processing speed of image enhancement is improved;

the image transmission module is used for transmitting the visible light gray level images and the infrared images acquired by the infrared and visible light image acquisition device to the image data set, and triggering the automatic training of the infrared image enhancement module when the number of the incremental image groups is greater than or equal to the dynamic update threshold value of the image data set, so as to realize the automatic update of the infrared image enhancement model which is downloaded in the parallel processing module.

2. The infrared image adaptive enhancement device of claim 1, wherein:

the image dataset is constructed according to the following method:

3. A method of adaptive enhancement of an infrared image based on the infrared image adaptive enhancement device according to any one of claims 1 or 2, comprising the steps of:

step 1: constructing an image dataset

step 2: model training

the generation sub-network of the infrared image enhancement model consists of 3 convolution layers and 1 deconvolution layer, wherein a residual error processing module is used between the convolution layers, and the method specifically comprises the following steps: adding residual connection after the first convolution layer and after the second convolution layer, carrying out residual connection again on the feature vector obtained after weighting treatment and the third convolution layer, enabling the image feature vectors with different dimensions obtained by all convolution layers to flow in all layers in the generation sub-network, and reserving image detail features;

step 4: the infrared and visible light image acquisition device acquires infrared images and visible light images of a target scene in real time, when the number of groups of the acquired infrared images and visible light gray images is 10% greater than the number of samples in the current image data set, randomly replacing the corresponding number of samples in the image data set by the acquired infrared images and visible light gray images, marking the images acquired by the infrared and visible light image acquisition device for replacement in the replacement process, and when the images marked for the next updating and replacement are replaced, not replacing the marked images, and then returning to the step 2; after the whole image data set is updated, deleting marks marked by the current sample image in the image data set, and then entering the next round of dynamic updating according to the same method; simultaneously, copying one part of infrared image acquired in real time and sending the copied part of infrared image to a parallel processing module;