CN112991200B - Method and device for adaptively enhancing infrared image - Google Patents

Method and device for adaptively enhancing infrared image Download PDF

Info

Publication number
CN112991200B
CN112991200B CN202110183673.5A CN202110183673A CN112991200B CN 112991200 B CN112991200 B CN 112991200B CN 202110183673 A CN202110183673 A CN 202110183673A CN 112991200 B CN112991200 B CN 112991200B
Authority
CN
China
Prior art keywords
image
infrared
images
network
visible light
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202110183673.5A
Other languages
Chinese (zh)
Other versions
CN112991200A (en
Inventor
张辉
曹剑中
刘学斌
王浩
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
XiAn Institute of Optics and Precision Mechanics of CAS
Original Assignee
XiAn Institute of Optics and Precision Mechanics of CAS
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by XiAn Institute of Optics and Precision Mechanics of CAS filed Critical XiAn Institute of Optics and Precision Mechanics of CAS
Priority to CN202110183673.5A priority Critical patent/CN112991200B/en
Publication of CN112991200A publication Critical patent/CN112991200A/en
Application granted granted Critical
Publication of CN112991200B publication Critical patent/CN112991200B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T5/00Image enhancement or restoration
    • G06T5/90Dynamic range modification of images or parts thereof
    • G06T5/92Dynamic range modification of images or parts thereof based on global image properties
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/10Image acquisition modality
    • G06T2207/10048Infrared image

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Evolutionary Computation (AREA)
  • Artificial Intelligence (AREA)
  • Biomedical Technology (AREA)
  • Biophysics (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Image Processing (AREA)
  • Image Analysis (AREA)

Abstract

The invention provides an infrared image self-adaptive enhancement method and device, which aims to solve the technical problems that the existing infrared image enhancement method has the defects of image local detail loss, insufficient image detail and incapability of forming an end-to-end solution, capability of exploring and enriching image details and capability of automatically repairing the image details. The invention designs an infrared image enhancement model based on an countermeasure generation network, fully utilizes the characteristics of the countermeasure neural network, realizes the dynamic balance of the generation sub-network and the identification sub-network through the neural network design and the loss function design, and can realize the end-to-end infrared image enhancement processing; in the model training process, the real visible light gray level image is used as a condition, the identification sub-network D is used for supervising and generating the sub-network G, and the mutual game between the sub-network and the identification sub-network is generated, so that the network model has good infrared image enhancement capability.

Description

Method and device for adaptively enhancing infrared image
Technical Field
The invention relates to a method and a device for adaptively enhancing an infrared image.
Background
With development and maturity of infrared imaging technology, various infrared imaging devices suitable for civil use are continuously emerging, and the infrared imaging devices play an increasingly important role in various fields of national economy. The infrared imaging has important applications of day and night observation and thermal target detection, and is widely applied to the fields of security monitoring, auxiliary driving, fire-fighting police, industrial monitoring, electric power monitoring and the like. However, due to the limitation of the infrared imaging principle, the infrared image has the characteristics of dark whole body, low contrast, blurred edge, large noise, unobvious detailed information, poor visual effect and the like.
To overcome the above-mentioned drawbacks of the infrared image, a histogram equalization method of the infrared image is disclosed in patent document publication No. CN104252700B, comprising: reading an infrared image, dividing the infrared image into at least two sub-images, carrying out histogram equalization processing on each sub-image to obtain an equalized sub-image of the sub-image, and then splicing the histogram equalized sub-images; the method comprises the core ideas that an infrared image is subjected to segmentation processing to obtain a plurality of sub-images, then each sub-image is subjected to histogram equalization enhancement processing in parallel, and the sub-images after histogram equalization are spliced and fused into a final infrared image after histogram equalization; the method has the defects that: (1) the contrast of the tidying image depends on the original infrared image, the local contrast of the image is insufficient, and the phenomena of detail loss and the like exist; (2) the image features of the response visible light are not combined, the image detail is not abundant enough, and the subsequent application such as target detection is difficult.
The patent document with publication number of CN110009569A discloses an infrared and visible light image fusion method based on a lightweight convolutional neural network. The key point of the scheme is that the neural network is utilized to extract the infrared image characteristics and the visible light image characteristics respectively, then a characteristic weight map is constructed, and then the fusion strategy is utilized to realize the fusion of the infrared image and the visible light image. Although the infrared image and the visible light image are fused, an end-to-end solution cannot be formed in engineering application, and each extracted feature is independent, and the method does not have the capability of exploring and enriching the image details and the capability of automatically repairing the image details.
Disclosure of Invention
The invention provides an infrared image self-adaptive enhancement method and device, which aims to solve the technical problems that the existing infrared image enhancement method has the defects of image local detail loss, insufficient image detail and incapability of forming an end-to-end solution, capability of exploring and enriching image details and capability of automatically repairing the image details.
The technical scheme of the invention is as follows:
the method for adaptively enhancing the infrared image is characterized by comprising the following steps of:
step 1: constructing an image dataset
Selecting an existing ImageNet data set as a basis for constructing an image data set, converting an image in the ImageNet data set into a visible light gray image by using gray level conversion, converting the gray level image into an infrared image by using contrast conversion or histogram conversion, and taking each obtained infrared image and a visible light gray level image corresponding to the infrared image as one sample of the image data set;
step 2: model training
Constructing an antagonistic neural network, inputting a sample in an image data set into the constructed antagonistic neural network to perform deep learning training on the antagonistic neural network to obtain an infrared image enhancement model;
step 3: downloading the infrared image enhancement model obtained in the step 2 into a parallel processing module;
step 4: the infrared and visible light image acquisition device acquires infrared images and visible light images of a target scene in real time, when the number of groups of the acquired infrared images and visible light gray images is 10% greater than the number of samples in the current image data set, randomly replacing the corresponding number of samples in the image data set with the acquired infrared images and visible light gray images, and then returning to the step 2; simultaneously, copying one part of infrared image acquired in real time and sending the copied part of infrared image to a parallel processing module;
step 5: and simultaneously carrying out parallel enhancement processing on a plurality of infrared images by utilizing an infrared image enhancement model arranged in a parallel processing module, and outputting corresponding enhanced images.
Further, the countermeasure neural network constructed in the step 2 includes generating a sub-network G and an authentication sub-network D;
the generation sub-network is used for generating an enhanced infrared image; the identification sub-network D is used for judging the authenticity of the visible light gray level image and the enhanced infrared image to obtain a probability value of 0-1, wherein 0 represents that the enhanced infrared image quality is very low and has no credibility, and 1 represents that the enhanced infrared image quality is very high and approaches to the real image.
Further, the network structure of the generated sub-network is as follows:
CR-CBR-CBR-D-Tanh
wherein, C is a convolution layer, R is a ReLu activation function, B is a batch normalization layer, D is a deconvolution layer, and Tanh is:z is a feature image obtained after deconvolution; e is the bottom of natural logarithm;
the loss function of the generated subnetwork is:
L G =E[λ×L cout +log(D(G(X),X)]
wherein:
e is Euclidean distance;
L cont a quality loss function representing an enhanced infrared image and a visible gray scale image, defined by an L2 distance,
y is a visible light gray scale image corresponding to the input infrared image;
g (X) is an enhanced infrared image obtained by utilizing the generation sub-network; d is the output of the authentication sub-network;
the network structure of the authentication sub-network D is:
CL-CBL-CBL-CBL-C-Sigmoid;
wherein: sigmoid:c is a convolution layer, B is a batch normalization layer, and L is a leakage ReLU activation function;
the loss function of the authentication subnetwork D is:
wherein:
x is an infrared image, Y is a visible light gray scale image, and G (X) is an enhanced infrared image obtained by utilizing a generation sub-network.
The invention also provides an infrared image self-adaptive enhancement device, which is characterized in that: the system comprises an image data set, an infrared image enhancement module based on deep learning, an infrared image enhancement model, a model downloading module, an infrared and visible light image acquisition device, an image transmission module and a parallel processing module;
the image dataset is used for providing image samples for model training for the infrared image enhancement module based on the deep learning;
the infrared image enhancement module based on the deep learning is used for the deep learning training to obtain the infrared image enhancement model;
the model lower loading module is used for downloading the infrared image enhancement model into the parallel processing module in a remote service bus mode;
the infrared and visible light image acquisition device is used for acquiring infrared images and corresponding visible light gray images in real time and transmitting the infrared images and the corresponding visible light gray images to the image transmission module and the parallel processing module;
the parallel processing module is used for carrying out parallel enhancement processing on the received multiple infrared images;
the image transmission module is used for transmitting the visible light gray level image and the infrared image acquired by the infrared and visible light image acquisition device to the image data set.
Further, the image dataset is constructed according to the following method:
the visible light image is converted into a visible light gray image by gray level conversion, then the gray level image is converted into an infrared image by contrast conversion or histogram conversion, and each obtained infrared image and the corresponding visible light gray level image are taken as one sample of an image data set.
Further, the infrared image enhancement module based on deep learning comprises a generation sub-network G and an identification sub-network D;
the generation sub-network is used for generating an enhanced infrared image; the identification sub-network D is used for judging the authenticity of the visible light gray level image and the enhanced infrared image to obtain a probability value of 0-1, wherein 0 represents that the enhanced infrared image quality is very low and has no credibility, and 1 represents that the enhanced infrared image quality is very high and approaches to the real image.
Further, the network structure of the generated sub-network is as follows:
CR-CBR-CBR-D-Tanh
wherein, C is a convolution layer, R is a ReLu activation function, B is a batch normalization layer, D is a deconvolution layer, and Tanh is:z is a feature image obtained after deconvolution; e is the bottom of natural logarithm;
the loss function of the generated subnetwork is:
L G =E[λ×L cout +log(D(G(X),X)]
wherein:
e is Euclidean distance;
L cont a quality loss function representing an enhanced infrared image and a visible gray scale image, defined by an L2 distance,
y is a visible light gray scale image corresponding to the input infrared image;
g (X) is an enhanced infrared image obtained by utilizing the generation sub-network; d is the output of the authentication sub-network;
the network structure of the authentication sub-network D is:
CL-CBL-CBL-CBL-C-Sigmoid;
wherein: sigmoid:c is a convolution layer, B is a batch normalization layer, and L is a Leaky ReLU laserA living function;
the loss function of the authentication subnetwork D is:
wherein:
x is an infrared image, Y is a visible light gray scale image, and G (X) is an enhanced infrared image obtained by utilizing a generation sub-network.
Compared with the prior art, the invention has the advantages that:
1. in the model training process, the invention uses the real visible light gray level image as the condition, and monitors the generation sub-network G by means of the identification sub-network D, and generates the mutual game between the sub-network and the identification sub-network, so that the network model has good infrared image enhancement capability.
2. The invention designs an infrared image enhancement model based on an countermeasure generation network, fully utilizes the characteristics of the countermeasure neural network, realizes the dynamic balance of the generation sub-network and the identification sub-network through the neural network design and the loss function design, and can realize the end-to-end infrared image enhancement processing.
3. The invention provides a synchronous update mechanism for infrared image enhancement training and dynamic acquisition of an infrared and visible light image acquisition device, which realizes automatic optimization of an infrared image enhancement model, further improves the infrared image enhancement processing effect and finally realizes the infrared image enhancement application of off-line modeling-on-line application-sustainable update.
4. The invention has been proved by system verification under the monitoring scene of the electric power system equipment, the result shows that the invention can effectively promote the detail characteristics of the infrared image, and the image contrast is improved by a self-adaptive mode, and the detail level of the image is effectively enhanced by introducing the visible light characteristics, thereby providing a good image foundation for the image recognition and state monitoring of the subsequent equipment monitoring.
Drawings
Fig. 1 is a schematic diagram of an infrared image adaptive enhancement device of the present invention.
FIG. 2 is an infrared image enhancement model of the present invention based on an countermeasure generation network.
FIG. 3 is a graph of the contrast of the effect of an original image before and after enhancement with an enhanced image according to the present invention, (a) is an original infrared image; (b) is an enhanced infrared image.
Detailed Description
The invention is further described below with reference to the accompanying drawings.
As shown in fig. 1, the method for adaptively enhancing an infrared image provided by the invention comprises the following steps:
step 1: constructing an image dataset
Selecting an existing ImageNet data set as a basis for constructing an image data set, converting an image in the ImageNet data set into a visible light gray image by using gray level conversion, converting the gray level image into an infrared image by using contrast conversion or histogram conversion, and taking each obtained infrared image and a visible light gray level image corresponding to the infrared image as one sample of the image data set;
step 2: model training
Constructing an antagonistic neural network, inputting a sample in an image data set into the constructed antagonistic neural network to perform deep learning training on the antagonistic neural network to obtain an infrared image enhancement model;
step 3: downloading the infrared image enhancement model obtained in the step 2 into a parallel processing module;
step 4: the infrared and visible light image acquisition device acquires infrared images and visible light images of a target scene in real time, when the number of groups of the acquired infrared images and visible light gray images is 10% greater than the number of samples in the current image data set, randomly replacing the corresponding number of samples in the image data set with the acquired infrared images and visible light gray images, and then returning to the step 2; simultaneously, copying one part of infrared image acquired in real time and sending the copied part of infrared image to a parallel processing module;
step 5: and simultaneously carrying out parallel enhancement processing on a plurality of infrared images by utilizing an infrared image enhancement model arranged in a parallel processing module, and outputting the enhanced images.
As shown in FIG. 1, the invention also provides an infrared image self-adaptive enhancement device for realizing the infrared image self-adaptive enhancement method, which comprises an image data set, an infrared image enhancement module based on deep learning, an infrared image enhancement model, a model downloading module, an infrared and visible light image acquisition device, an image transmission module and a parallel processing module.
1. Image data set
The image data set is an image set used for training an infrared image enhancement module based on deep learning, and in view of the fact that infrared images are generally lacking in the infrared image enhancement based on a neural network at present, the infrared image is one of key factors capable of achieving a better image processing effect based on the deep learning infrared image enhancement technology. The present invention therefore primarily builds an image dataset with a sufficient number of samples.
The construction method of the image dataset comprises the following steps:
selecting an existing ImageNet data set as a basis for constructing an image data set, converting an image in the ImageNet data set into a visible light gray image by using gray level conversion, obtaining infrared images by using contrast conversion or histogram conversion technology on the gray level image, and taking each obtained infrared image and the corresponding visible light gray level image as one sample of the image data set; in this embodiment, the basic data set includes 10000 samples, where the resolution of the visible light gray scale image is: 512x512, infrared image resolution is: 512x512.
The method has the advantages that the problem of infrared image shortage can be solved, a training set with enough sample numbers is provided for the infrared image enhancement module based on deep learning, but because the infrared images in the basic data set constructed by the method are not real infrared images, if the infrared image enhancement module based on deep learning is trained by using only the basic data set, the infrared image enhancement model obtained by training is insufficient in infrared image coverage characteristics, and the infrared image enhancement effect is not ideal. To solve this problem, the present invention dynamically updates the image dataset based on the application scenario.
A method for dynamically updating an image dataset:
the infrared and visible light image acquisition device acquires a group of visible light gray level images and infrared images every set time (for example, 15 minutes), the acquisition time per day is 09:00-17:00, and the infrared and visible light gray level images of 32 groups are acquired by a single infrared and visible light image acquisition device per day. The acquisition time period is selected to be 09:00-17:00, and under the condition of better imaging conditions, infrared images and corresponding visible light gray level images are acquired at the same time, when the whole system is applied, not less than 50 sets of infrared and visible light image acquisition devices are deployed, and the acquisition time period is distributed in different scenes such as roads, forests, schools and the like, so that application pictures covering different scenes are ensured, namely: 1600 additional images are generated daily.
When the number of the acquired images is 10% greater than the number of the images in the current image dataset, randomly extracting 10% of the images from the current image dataset, carrying out replacement processing on the images by the acquired images, marking the images which are used for replacement and are acquired by the infrared and visible light image acquisition device in the replacement process, and when the images which are marked are updated and replaced next time, not carrying out replacement on the marked images, finally, when all the images in the whole image dataset are marked, indicating that the whole image dataset is updated, deleting the marks marked by the current sample images in the image dataset after the whole image dataset is updated, and then entering the next round of dynamic updating according to the same method.
Table 1 update threshold selection for image dataset
Dynamically updating thresholds Infrared image enhanced PSNR
3% 25.36
6% 25.78
9% 26.02
10% 26.39
Table 1 is an experiment made by adopting different thresholds when the image dataset is dynamically updated, and after each dynamic update, an infrared image enhancement module based on deep learning is triggered to train to obtain a new infrared image enhancement model, and then the infrared image enhancement is performed by using the new infrared image enhancement model; by PSNR analysis of 512 infrared image enhancement training, it was found that as the threshold increases, the quality of the infrared image after enhancement gradually increases, and in view of quality and speed, 10% is finally selected as the threshold at the time of dynamic update.
2. Infrared image enhancement module based on deep learning
The infrared image enhancement module based on the deep learning is mainly used for the deep learning training, the input of the infrared image enhancement module is an image data set, and the output of the infrared image enhancement module is a trained infrared image enhancement model.
As shown in fig. 2, the network architecture of the deep learning-based infrared image enhancement module is based on an countermeasure generation network, and includes 2 sub-networks in total: a subnetwork G and an authentication subnetwork D are generated.
2.1 Generation of subnetworks
2.1.1 network formation
The generating sub-network is composed of 3 convolution layers conv and 1 deconvolution layer Deconv, and a batch of normalization networks are added after the second convolution layer and the third convolution layer, namely: in deep learning training, the Batch NN is a data normalization method proved by practice, and the effect of the Batch NN can accelerate the convergence rate of model training, so that the model training process is more stable, and a certain regularization effect is achieved.
2.1.2 generating a subnetwork design:
(1) the resolution of the input image of the generation sub-network is 256x256, so that the image in the image dataset needs to be scaled before being input into the generation sub-network;
(2) the network structure is as follows:
CR-CBR-CBR-D-Tanh
wherein, C is a convolution layer, R is a ReLu activation function, B is a batch normalization layer, D is a deconvolution layer, and Tanh is:z is a feature image obtained after deconvolution; e is the bottom of natural logarithm;
the size of the CR input image of the first convolution layer is 256x256, and the better characteristics are reserved through convolution operation and ReLu processing, so that 64 characteristic vectors are finally obtained, and then 2 convolution layer operations are followed to further strengthen the extracted characteristics.
(3) In order to reduce loss of image details, as shown in fig. 2, residual Connection Skip Connection is added after a convolution layer 1 and after a convolution layer 2, and residual Connection is performed again between the feature vector obtained after weighting processing and a convolution layer 3, so that the image feature vectors with different dimensions obtained by each convolution layer flow in each layer in a generation sub-network, and image detail features are reserved;
(4) generating a sub-network for generating an enhanced image (enhanced infrared image);
(5) and (3) loss function design: l (L) G =E[λ×L cout +log(D(G(X),X)]The method comprises the steps of carrying out a first treatment on the surface of the Wherein: l (L) cont A quality loss function representing an enhanced infrared image and a visible gray scale image, defined by an L2 distance,e is Euclidean distance; y is a visible light gray scale image corresponding to the input infrared image; g (X) is an enhanced infrared image obtained by utilizing the generation sub-network; d is the output of the authentication sub-network; the authentication sub-network is used for judging the authenticity of the visible light gray level image and the enhanced infrared image to obtainProbability value of 0-1, 0 represents that the enhanced infrared image quality is very low and has no credibility, 1 represents that the enhanced infrared image quality is very high and approaches to a real image, L G Using mathematical expectations to measure characteristics of generating and discriminating sub-networks in infrared image enhancement, where λ×L cout The generation of the sub-network is encouraged to generate more realistic infrared enhanced images, and the log (G (X), X) in the formula encourages the identification of the sub-network to distinguish the infrared enhanced images, and the two are finally balanced dynamically.
2.1.3 related network parameter settings:
type(s) Convolution kernel Convolution step length Filling
conv 4×4 2×2 1×1
conv 3×3 1×1 1×1
conv 3×3 1×1 1×1
deconv 4×4 2×2 1×1
2.2 authentication subnetworks
2.2.1 network formation
The authentication sub-network is composed of 5 convolutional layers conv.
2.2.2 authentication subnetwork design:
(1) the network input image is: generating an enhanced image output by a sub-network and a visible light gray image corresponding to the enhanced image
(2) The network structure is as follows: CL-CBL-CBL-CBL-C-Sigmoid, wherein: sigmoid:c is a convolution layer, B is a batch normalization layer, and L is a leakage ReLU activation function;
(3) the network adopts 5 convolution layers to obtain characteristic images, and uses a Sigmoid function to perform normalization processing to judge the similarity degree [0-1] of the infrared enhanced image and the visible light gray image;
(4) and (3) loss function design:wherein: l (L) D For identifying the loss function of the sub-network, X is an infrared image, Y is a visible light gray scale image, G (X) is an enhanced infrared image obtained by generating the sub-network; the loss function is designed by adopting a cross entropy loss function, and aims to maximize the expected value of the loss function when the identification sub-network encounters real data such as a visible light gray level image, and judge the image as a generated image when the identification sub-network encounters generated data such as an infrared enhanced image, wherein the aim of the identification sub-network is that the generated image is identified in the whole training process, L D Maximum, but for the generation of a sub-network the goal is for the generated image not to be identified, L D Tends to be minimal, and adopts a maximumThe loss function design of the small game mode can enable the network model to have good infrared image enhancement capability.
2.2.3 related network parameter settings:
type(s) Convolution kernel Convolution step length Filling
conv 4×4 2×2 1×1
conv 4×4 2×2 1×1
conv 4×4 2×2 1×1
conv 4×4 1×1 1×1
conv 4×4 1×1 1×1
The image quality data obtained after the infrared image enhancement module based on deep learning is used for enhancing 512 infrared test images are shown in the following table, 512 images are randomly tested, and after the infrared image processing is carried out by the method, the image evaluation indexes PSNT and SSIM are better represented, so that the method has a good infrared image enhancement effect.
PSNR SSIM
512 infrared test images 26.67 0.974
3. Infrared image enhancement model
The infrared image enhancement model is a result obtained after the image dataset is trained by the infrared image enhancement module based on deep learning.
4. Lower die assembly module of die
The model underfilling module is used for underfilling the infrared image enhancement model which is output after the training of the infrared image enhancement module based on the deep learning into the parallel processing module through a remote service bus mode.
5. Infrared and visible light image acquisition device
The infrared and visible light image acquisition device is used for acquiring infrared images and corresponding visible light gray images in real time, the acquired infrared images are used for being sent into the parallel processing module to be subjected to image enhancement processing, and the infrared images and the visible light gray images are also sent to the image data set through the image transmission module and used for dynamically updating the image data set.
6. Parallel processing module
The input of the parallel processing module is an infrared image acquired by an infrared and visible light image acquisition device in real time, and the received infrared image is enhanced by using an infrared image enhancement model arranged in the parallel processing module; in order to improve the real-time processing speed of infrared image enhancement, the parallel processing module adopts a Cuda-based acceleration technology, fully utilizes the GPU computing capability, and simultaneously adopts a multithread parallel technology to improve the processing speed of image enhancement.
7. Image transmission module
The image transmission module is responsible for transmitting visible light gray level images and infrared images acquired by the infrared and visible light image acquisition device to an image data set, when the number of incremental image groups is greater than or equal to a dynamic update threshold value of the image data set, the infrared image enhancement module based on deep learning is triggered to automatically train, automatic update of an infrared image enhancement model installed in a parallel processing module is realized, continuous update of the infrared image enhancement model is realized in a subsequent image enhancement process, the infrared image enhancement effect is continuously optimized, and finally, the infrared image enhancement application of offline modeling-online application-sustainable update is realized.
Fig. 3 is a graph comparing the effects of the original image before and after enhancement with the enhanced image after enhancement, wherein the left side is the original infrared image, the right side is the enhanced infrared image, and the picture content of the right side image is richer, the texture is obvious, the detail is prominent, and the visual effect is better.

Claims (3)

1. An infrared image self-adaptation reinforcing device is characterized in that: the system comprises an image data set, an infrared image enhancement module based on deep learning, an infrared image enhancement model, a model downloading module, an infrared and visible light image acquisition device, an image transmission module and a parallel processing module;
the image dataset is used for providing image samples for model training for the infrared image enhancement module based on the deep learning; the updating mode of the image data set is as follows: when the number of groups of the collected infrared images and the visible light gray images is 10% greater than the number of samples in the current image data set, randomly replacing the corresponding number of samples in the image data set by the collected infrared images and the visible light gray images, marking the images collected by the infrared and visible light image collecting devices for replacement in the replacement process, and when the images are updated and replaced next time, not replacing the marked images; after the whole image data set is updated, deleting marks marked by the current sample image in the image data set, and then entering the next round of dynamic updating according to the same method;
the infrared image enhancement module based on the deep learning is used for the deep learning training to obtain the infrared image enhancement model;
the infrared image enhancement module based on deep learning comprises a generation sub-network G and an identification sub-network D;
the generation sub-network is used for generating an enhanced infrared image; the identification sub-network D is used for judging the authenticity of the visible light gray level image and the enhanced infrared image to obtain a probability value of 0-1, wherein 0 represents that the enhanced infrared image quality is very low and has no credibility, and 1 represents that the enhanced infrared image quality is very high and approaches to the real image;
the network structure of the generation sub-network is as follows:
CR-CBR-CBR-D-Tanh
wherein, C is a convolution layer, R is a ReLu activation function, B is a batch normalization layer, D is a deconvolution layer, and Tanh is:z is a feature image obtained after deconvolution; e is the bottom of natural logarithm;
the loss function of the generated subnetwork is:
L G =E[λ×L cout +log(D(G(X),X)]
wherein:
e is Euclidean distance;
L cont a quality loss function representing an enhanced infrared image and a visible gray scale image, defined by an L2 distance,
y is a visible light gray scale image corresponding to the input infrared image;
g (X) is an enhanced infrared image obtained by utilizing the generation sub-network; d is the output of the authentication sub-network;
the network structure of the authentication sub-network D is:
CL-CBL-CBL-CBL-C-Sigmoid;
wherein: sigmoid:c is a convolution layer, B is a batch normalization layer, and L is a leakage ReLU activation function;
the loss function of the authentication subnetwork D is:
wherein:
x is an infrared image, Y is a visible light gray scale image, and G (X) is an enhanced infrared image obtained by utilizing a generation sub-network;
the model lower loading module is used for downloading the infrared image enhancement model into the parallel processing module in a remote service bus mode;
the infrared and visible light image acquisition device is used for acquiring infrared images and corresponding visible light gray images in real time and transmitting the infrared images and the corresponding visible light gray images to the image transmission module and the parallel processing module;
the parallel processing module is used for carrying out parallel enhancement processing on a plurality of received infrared images, and simultaneously processing a plurality of images by utilizing a multithreading parallel technology, so that the processing speed of image enhancement is improved;
the image transmission module is used for transmitting the visible light gray level images and the infrared images acquired by the infrared and visible light image acquisition device to the image data set, and triggering the automatic training of the infrared image enhancement module when the number of the incremental image groups is greater than or equal to the dynamic update threshold value of the image data set, so as to realize the automatic update of the infrared image enhancement model which is downloaded in the parallel processing module.
2. The infrared image adaptive enhancement device of claim 1, wherein:
the image dataset is constructed according to the following method:
the visible light image is converted into a visible light gray image by gray level conversion, then the gray level image is converted into an infrared image by contrast conversion or histogram conversion, and each obtained infrared image and the corresponding visible light gray level image are taken as one sample of an image data set.
3. A method of adaptive enhancement of an infrared image based on the infrared image adaptive enhancement device according to any one of claims 1 or 2, comprising the steps of:
step 1: constructing an image dataset
Selecting an existing ImageNet data set as a basis for constructing an image data set, converting an image in the ImageNet data set into a visible light gray image by using gray level conversion, converting the gray level image into an infrared image by using contrast conversion or histogram conversion, and taking each obtained infrared image and a visible light gray level image corresponding to the infrared image as one sample of the image data set;
step 2: model training
Constructing an antagonistic neural network, inputting a sample in an image data set into the constructed antagonistic neural network to perform deep learning training on the antagonistic neural network to obtain an infrared image enhancement model;
the generation sub-network of the infrared image enhancement model consists of 3 convolution layers and 1 deconvolution layer, wherein a residual error processing module is used between the convolution layers, and the method specifically comprises the following steps: adding residual connection after the first convolution layer and after the second convolution layer, carrying out residual connection again on the feature vector obtained after weighting treatment and the third convolution layer, enabling the image feature vectors with different dimensions obtained by all convolution layers to flow in all layers in the generation sub-network, and reserving image detail features;
step 3: downloading the infrared image enhancement model obtained in the step 2 into a parallel processing module;
step 4: the infrared and visible light image acquisition device acquires infrared images and visible light images of a target scene in real time, when the number of groups of the acquired infrared images and visible light gray images is 10% greater than the number of samples in the current image data set, randomly replacing the corresponding number of samples in the image data set by the acquired infrared images and visible light gray images, marking the images acquired by the infrared and visible light image acquisition device for replacement in the replacement process, and when the images marked for the next updating and replacement are replaced, not replacing the marked images, and then returning to the step 2; after the whole image data set is updated, deleting marks marked by the current sample image in the image data set, and then entering the next round of dynamic updating according to the same method; simultaneously, copying one part of infrared image acquired in real time and sending the copied part of infrared image to a parallel processing module;
step 5: and simultaneously carrying out parallel enhancement processing on a plurality of infrared images by utilizing an infrared image enhancement model arranged in a parallel processing module, and outputting corresponding enhanced images.
CN202110183673.5A 2021-02-08 2021-02-08 Method and device for adaptively enhancing infrared image Active CN112991200B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110183673.5A CN112991200B (en) 2021-02-08 2021-02-08 Method and device for adaptively enhancing infrared image

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110183673.5A CN112991200B (en) 2021-02-08 2021-02-08 Method and device for adaptively enhancing infrared image

Publications (2)

Publication Number Publication Date
CN112991200A CN112991200A (en) 2021-06-18
CN112991200B true CN112991200B (en) 2024-02-02

Family

ID=76393138

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110183673.5A Active CN112991200B (en) 2021-02-08 2021-02-08 Method and device for adaptively enhancing infrared image

Country Status (1)

Country Link
CN (1) CN112991200B (en)

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108154523A (en) * 2017-12-25 2018-06-12 北京航空航天大学 A kind of real-time modeling method system and method in airborne photoelectric platform
CN109544487A (en) * 2018-09-30 2019-03-29 西安电子科技大学 A kind of infrared image enhancing method based on convolutional neural networks
CN110084757A (en) * 2019-04-15 2019-08-02 南京信息工程大学 A kind of infrared depth image enhancement method based on generation confrontation network
CN110889813A (en) * 2019-11-15 2020-03-17 安徽大学 Low-light image enhancement method based on infrared information
CN111160307A (en) * 2019-12-31 2020-05-15 帷幄匠心科技(杭州)有限公司 Face recognition method and face recognition card punching system
CN111612725A (en) * 2020-04-29 2020-09-01 中国科学院西安光学精密机械研究所 Image fusion method based on contrast enhancement of visible light image
CN112116573A (en) * 2020-09-16 2020-12-22 四川嘉能佳网创新能源科技有限责任公司 High-precision infrared image anomaly detection method and system

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10475174B2 (en) * 2017-04-06 2019-11-12 General Electric Company Visual anomaly detection system
CN108446617B (en) * 2018-03-09 2022-04-22 华南理工大学 Side face interference resistant rapid human face detection method

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108154523A (en) * 2017-12-25 2018-06-12 北京航空航天大学 A kind of real-time modeling method system and method in airborne photoelectric platform
CN109544487A (en) * 2018-09-30 2019-03-29 西安电子科技大学 A kind of infrared image enhancing method based on convolutional neural networks
CN110084757A (en) * 2019-04-15 2019-08-02 南京信息工程大学 A kind of infrared depth image enhancement method based on generation confrontation network
CN110889813A (en) * 2019-11-15 2020-03-17 安徽大学 Low-light image enhancement method based on infrared information
CN111160307A (en) * 2019-12-31 2020-05-15 帷幄匠心科技(杭州)有限公司 Face recognition method and face recognition card punching system
CN111612725A (en) * 2020-04-29 2020-09-01 中国科学院西安光学精密机械研究所 Image fusion method based on contrast enhancement of visible light image
CN112116573A (en) * 2020-09-16 2020-12-22 四川嘉能佳网创新能源科技有限责任公司 High-precision infrared image anomaly detection method and system

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
Single infrared image enhancement using a deep convolutional neural network;Xiaodong Kuang, et.al;《Neurocomputing》;20190307;摘要,第1-10页 *
基于生成对抗网络的红外图像数据增强;陈佛计等;《计算机应用》;20200331(第07期);第228-232页 *

Also Published As

Publication number Publication date
CN112991200A (en) 2021-06-18

Similar Documents

Publication Publication Date Title
CN113178255B (en) GAN-based medical diagnosis model attack resistance method
CN113592736B (en) Semi-supervised image deblurring method based on fused attention mechanism
CN111292264A (en) Image high dynamic range reconstruction method based on deep learning
CN104809443A (en) Convolutional neural network-based license plate detection method and system
CN111539343B (en) Black smoke vehicle detection method based on convolution attention network
CN111242868B (en) Image enhancement method based on convolutional neural network in scotopic vision environment
CN111696021B (en) Image self-adaptive steganalysis system and method based on significance detection
CN109685772B (en) No-reference stereo image quality evaluation method based on registration distortion representation
CN116342894B (en) GIS infrared feature recognition system and method based on improved YOLOv5
CN110751591A (en) Self-adaptive color image steganography method based on fuzzy inference system
CN113689382A (en) Tumor postoperative life prediction method and system based on medical images and pathological images
Zhou et al. Deep multi-scale features learning for distorted image quality assessment
CN116798070A (en) Cross-mode pedestrian re-recognition method based on spectrum sensing and attention mechanism
CN106355210A (en) Method for expressing infrared image features of insulators on basis of depth neuron response modes
CN113537110A (en) False video detection method fusing intra-frame and inter-frame differences
CN112149526A (en) Lane line detection method and system based on long-distance information fusion
CN112102186A (en) Real-time enhancement method for underwater video image
CN112330562B (en) Heterogeneous remote sensing image transformation method and system
CN110503049B (en) Satellite video vehicle number estimation method based on generation countermeasure network
CN112991200B (en) Method and device for adaptively enhancing infrared image
CN112528788A (en) Re-recognition method based on domain invariant features and space-time features
CN116612355A (en) Training method and device for face fake recognition model, face recognition method and device
CN115170985B (en) Remote sensing image semantic segmentation network and segmentation method based on threshold attention
CN115630361A (en) Attention distillation-based federal learning backdoor defense method
CN112991257B (en) Heterogeneous remote sensing image change rapid detection method based on semi-supervised twin network

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant