CN116363009B

CN116363009B - Method and system for enhancing rapid light-weight low-illumination image based on supervised learning

Info

Publication number: CN116363009B
Application number: CN202310343585.6A
Authority: CN
Inventors: 成行; 张雨; 杨国辉; 王越; 王玉; 漆保凌; 王春晖
Original assignee: Harbin Institute of Technology
Current assignee: Harbin Institute of Technology
Priority date: 2023-03-31
Filing date: 2023-03-31
Publication date: 2024-03-12
Anticipated expiration: 2043-03-31
Also published as: CN116363009A

Abstract

A rapid light-weight low-illumination image enhancement method and system based on supervised learning belong to the technical field of digital image processing. The method aims to solve the problems of low processing speed and large dependence on the quantity of parameters in the existing low-illumination image enhancement method based on supervised learning. The invention firstly extracts the V-channel image I of the low-illumination image ^V Obtaining histogram information, then splicing the histogram information with an expected average brightness value, and extracting global information through a multi-layer perceptron; in pair I ^V Performing high-order curve adjustment to obtain I ^V′ The method comprises the steps of carrying out a first treatment on the surface of the The low-illumination image to be enhanced is simultaneously input into a local enhancement network module for processing, V channels corresponding to the input are firstly input into a first convolution unit after splicing operation, and the output of the first convolution unit is equal to I ^V′ After splicing, the output of the first convolution layer in the second convolution unit is spliced with the output of the second convolution unit and then is input into the third convolution unit, and the third convolution unit outputs an enhanced image after processing.

Description

Method and system for enhancing rapid light-weight low-illumination image based on supervised learning

Technical Field

The invention belongs to the technical field of digital image processing, and particularly relates to a method and a system for enhancing a low-illumination image.

Background

Images are one of the important sources of human information acquisition, however, when the ambient light is insufficient or the acquisition performance of equipment is limited, the acquired images are often low-illumination images, and the images have the problems of low brightness, low contrast, high noise, large color deviation and the like, so that the acquisition of useful information by people is seriously influenced. The low-illumination image enhancement methods commonly used at present can be divided into two main types, namely a traditional image enhancement method and a method based on deep learning:

conventional image enhancement methods mainly include histogram equalization, gamma correction, defogging or Retinex model-based methods, and other improved methods based on these methods. While these methods can significantly improve the brightness and contrast of the image, the subsequent separate denoising step or the variation-based joint iterative denoising process is time consuming and therefore unsuitable for real-time low-intensity image enhancement applications.

Image enhancement methods based on deep learning can be divided into two main categories, unsupervised learning and supervised learning. Generally, the method based on the unsupervised learning has faster processing speed and stronger robustness in various environments, but the method generally lacks a corresponding color correction and denoising method, and is more limited in processing subsequent tasks with high precision requirements. In contrast, the supervised learning-based method can generally solve various degradation problems existing in low-illumination images and remarkably improve the image quality, but most of the methods need complex network structures, are large in dependent parameter quantity and low in processing speed.

In addition, an image retouching method similar to the image enhancement method can be adopted for the problems of unsuitable brightness, poor contrast, color deviation and the like of the image. The method can work on a single pixel by basic repair operations and is therefore very fast and lightweight. But most of these methods do not take noise into account.

In summary, the conventional low-illumination image enhancement method mainly has the following problems: 1. the traditional low-illumination image enhancement method has the defects that the processing process is too long in time and cannot meet the requirement of low-illumination image real-time enhancement; 2. the existing low-illumination image enhancement method based on unsupervised learning cannot simultaneously solve the problems of low contrast, poor brightness, color distortion, serious noise and the like; 3. the existing low-illumination image enhancement method based on supervised learning is generally complex in network structure, large in parameter dependence and low in processing speed; 4. the image modification method is a lightweight method and processes very quickly, but most cannot handle noise problems. In summary, at present, the problems of low contrast, poor brightness, color distortion, serious noise and the like of a low-illumination image are rapidly, lightweight and effectively solved, and the method is still a great challenge in the field.

Disclosure of Invention

The invention aims to solve the problems of low processing speed and large dependence on parameters in the existing low-illumination image enhancement method based on supervised learning, and the problems of low contrast, poor brightness, color distortion and serious noise in the low-illumination image are difficult to effectively solve.

A rapid light-weight low-illumination image enhancement method based on supervised learning comprises the following steps:

inputting the low-illumination image I to be enhanced into an image enhancement network, and enhancing the low-illumination image by utilizing the image enhancement network to obtain a final enhanced image E; the low-illumination image I to be enhanced is a matrix of M x N x 3, M is the number of rows, N is the number of columns, and 3 is the three RGB color channels { R, G, B } of the image I;

the process of enhancing the low-illumination image by using the image enhancement network comprises the following steps:

step 111, extracting a V-channel image I of the low-illumination image I in the HSV color space ^V ；

Step 112, obtaining V-channel image I ^V The histogram information is spliced with an expected average brightness value mu and then is input into a global feature extraction module of the network, and the global feature extraction module extracts global information through a multi-layer perceptron:

{α _0，1…t }G(H(I ^V )，μ)

wherein alpha is ₀ 、α ₁ 、……、α _t The μ is an average brightness value, which is an element in the output of the multi-layer perceptron and is used for controlling the image enhancement degree; h (I) ^V ) Representing acquired image I ^V An operation of a histogram; g (·) represents the global feature extraction module process;

representing elements in the output of the multi-layer perceptron as higher-order curve coefficients;

step 113, performing high-order curve adjustment on the input low-illumination image V channel image to obtain a global brightness adjustment scheme, and finally outputting I ^V Global brightness adjustment image I of (2) ^V ′：

Where k represents the number of iterations, for example:representing an input low-intensity image V-channel image I ^V When k=7, the number of the cells,α _k-1 ∈{α _0,1…t }；

the low-illumination image to be enhanced is input into a local enhancement network module for processing; the local enhancement network module comprises three convolution units, namely a first convolution unit to a third convolution unit, wherein each convolution unit comprises a plurality of convolution layers;

the local enhancement network module firstly performs splicing operation on the V channels corresponding to the input and then inputs the V channels into a first convolution unit, and the output of the first convolution unit and the global brightness adjustment image I ^V The output of the first convolution layer in the second convolution unit and the final output of the second convolution unit are input into a third convolution unit after the splicing operation, the third convolution unit is connected with the output layer, and the output of the output layer is an enhanced image.

Further, the first to third convolution units each include two convolution layers.

Further, the convolution layers in the first to third convolution units are all the convolution layers of 3*3.

Further, the training process of the image enhancement network comprises the following steps:

taking the collected low-illumination image and the corresponding normal-illumination image as an image pair, and constructing a training data set, wherein the low-illumination image is taken as an input, and the corresponding normal-illumination image is taken as a reference image;

training the image enhancement network by using a training data set, wherein the specific training process comprises the following steps:

cutting and dicing the low-illumination image and the normal-illumination image in the image pair in the same mode, wherein the sizes of the diced image blocks are M 'and N'; training by using randomly cut low-illumination image blocks, wherein the input in the actual training is M '×N' ×3×n, and N is the number of the image blocks input in each training;

before inputting an image block into an image enhancement network, respectively obtaining corresponding V-channel images for each low-illumination image, and obtaining histogram information of the V-channel images; meanwhile, taking the average brightness value of the normal illumination image corresponding to the low illumination image as an expected average brightness value mu; extracting global information through a multi-layer perceptron; representing elements in the output of the multi-layer perceptron as higher-order curve coefficients;

since the training process inputs a plurality of image blocks at a time, for each image block, a V-channel image I is extracted ^V Then, the high-order curve coefficient of the whole image corresponding to the current image block is used for adjustment to obtain a global brightness adjustment image I ^V ′；

Training of the image enhancement network is achieved based on the image blocks in the training data set, and the trained image enhancement network is obtained. Further, the loss function of the image enhancement network in the training process is as follows:

L _ALL ＝L ₁ +L _SSIM +L _color +L _brightness +L _structure

wherein the first item L ₁ Representing the mean absolute value error, the second term L _SSIM To enhance SSIM loss between image and reference image, the third term L _color For colour loss, fourth term L _brightness For loss of brightness, the fifth item L _structure Is a structural loss;

the color loss function L _color The method comprises the following steps:

wherein T represents the processing of the image enhancement network; i _(i ， _j) 、Y _(i ， _j) The ith row and jth column pixels of the low-illumination image and the reference normal-illumination image respectively; m and n respectively represent m and n pixels for each row and each column of the input image;<·，·>representing cosine similarity of the two vectors;

the brightness loss function L _brightness The method comprises the following steps:

wherein, c takes R, G, B, which corresponds to three color channels of red, green and blue in RGB color space; b (T (I) ^c _(i，j) )))、b(Y ^c _(i，j) ) Represented as pixels T (I ^c _(i，j) )、Y ^c _(i，j) The purpose of subtracting the minimum value is to eliminate the effect of the constant in b (·) for the centered image block;

the structure loss function L _structure The method comprises the following steps:

wherein,is a gradient operator.

A rapid light-weight low-illumination image enhancement system based on supervised learning comprises an image acquisition unit and an image enhancement unit;

an image acquisition unit: the method comprises the steps of receiving a low-illumination image to be enhanced, and processing the image I into a matrix of M x N x 3, wherein M is the number of rows, N is the number of columns, and 3 is three RGB color channels { R, G, B } of the image I;

an image enhancement unit: the method comprises the steps of calling an image enhancement network to enhance a low-illumination image to obtain a final enhanced image E; the process of enhancing the low-illumination image by the image enhancement network comprises the following steps:

the local enhancement network module firstly performs splicing operation on the V channels corresponding to the input and then inputs the V channels into a first convolution unit, and the output of the first convolution unit and the global brightness adjustment image I ^V ' after the splicing operation, the output of the first convolution layer in the second convolution unit and the final output of the second convolution unit are input into a third convolution unit, the third convolution unit is connected with the output layer and outputsThe output of the layer is an enhanced image.

A computer storage medium having stored therein at least one instruction that is loaded and executed by a processor to implement the fast light weight low light intensity image enhancement system based on supervised learning.

A fast light-weight low-light image enhancement device based on supervised learning, the device comprising a processor and a memory, the memory storing at least one instruction, the at least one instruction being loaded and executed by the processor to implement the fast light-weight low-light image enhancement system based on supervised learning.

The beneficial effects are that:

the invention not only can process with smaller parameter quantity by introducing the designed rapid lightweight image enhancement network and solving the typical low-illumination image degradation problem, but also can obviously reduce the network complexity, thereby having faster processing speed; and meanwhile, the brightness and contrast of the image, the color and structural information are restored and noise is removed. Compared with the current most advanced low-illumination image enhancement method, the method has excellent comprehensive performance.

Drawings

Fig. 1 is a schematic diagram of a low-light image enhancement flow.

Fig. 2 is a schematic diagram of a fast lightweight low-light image enhancement network architecture.

FIG. 3 is an original low-light image effect before enhancement in an embodiment.

Fig. 4 is an enhanced image effect in an embodiment.

Fig. 5 is a reference normal illuminance image effect in the embodiment.

Detailed Description

The invention provides a rapid and lightweight low-illumination image enhancement network, which comprises a global feature extraction module and a local enhancement network module, constructs a loss function based on relative information and is used for end-to-end training of low-illumination images so as to improve the overall performance of the images. The invention is further described in connection with the following detailed description.

The first embodiment is as follows: this embodiment will be described with reference to fig. 1.

The embodiment is a rapid light-weight low-illumination image enhancement method based on supervised learning, comprising the following steps:

step one, designing a rapid lightweight low-illumination image enhancement network;

as shown in fig. 2, the processing procedure of the fast lightweight low-light image enhancement network is as follows:

for the low-illumination image to be enhanced, firstly extracting a V-channel image I of an HSV color space of the low-illumination image to be enhanced ^V Obtaining histogram information of the V-channel image; then, the histogram information and the expected average brightness value are spliced and then input into a global feature extraction module of the network, wherein the global feature extraction module is realized by a 5-layer multi-layer perceptron; generating a global brightness adjustment scheme by a high-order curve adjustment method to obtain a global brightness adjusted image of the V-channel image;

the specific process comprises the following steps:

Step 112, obtaining V-channel image I ^V The histogram information is spliced with an expected average brightness value mu and then is input into a global feature extraction module of the network, and the global feature extraction module extracts global information through a five-layer multi-layer perceptron:

{α _0，1…t }G(H(I ^V )，μ)

the average luminance value μ is an expected average luminance value when the enhancement is actually performed, and is set according to the expectation.

Wherein k=0, 1, …, t;representing an input low-intensity image V-channel image I ^V ，/>Representing k iterative images;

the low-illumination image to be enhanced is input into a local enhancement network module for processing; the local enhancement network module firstly performs splicing operation on V channels input in an HSV color space corresponding to the input, and then inputs the V channels into a first convolution layer, wherein the first convolution layer is connected with a second convolution layer, and the first convolution layer and the second convolution layer are both convolution layers of 3*3;

output of the second convolution layer and global brightness adjustment image I ^V Inputting a third convolution layer after splicing operation, wherein the third convolution layer is connected with a fourth convolution layer, and the third convolution layer and the fourth convolution layer are both convolution layers of 3*3;

the output of the third convolution layer and the output of the fourth convolution layer are input into a fifth convolution layer after being spliced, the fifth convolution layer is connected with a sixth convolution layer, and the fifth convolution layer and the sixth convolution layer are both convolution layers of 3*3;

the sixth convolution unit is connected to the output layer and outputs an illumination image E.

And splicing and subsequent processing are carried out on the global brightness adjustment image output by the global feature extraction module and the output of the second convolution layer of the local enhancement network module, so that local enhancement can be effectively realized.

Step two, acquiring any number of image pairs formed by low-illumination images and normal-illumination images to construct a training data set, wherein the input of an image enhancement network is only n low-illumination images, and the corresponding normal-illumination images are used as reference images;

training the rapid lightweight low-illumination image enhancement network by using a training data set, wherein the training process comprises the following steps of:

step 311, cutting and dicing the low-illumination image and the normal-illumination image in the image pair in the same manner, wherein the size of the diced image block is M '×n', M ', N' can be arbitrarily set, and in some embodiments, the size of the image block is 100×100; training by using randomly cut low-illumination image blocks, wherein in the actual training, the input is M 'N' 3 x N, N is the number of the image blocks input in each training, and in some embodiments, 171 small blocks are randomly selected in each training;

what is to be described here is: before inputting an image block into a rapid lightweight low-illumination image enhancement network, respectively obtaining a corresponding V-channel image for each low-illumination image, and obtaining histogram information of the V-channel image; meanwhile, taking the average brightness value of the normal illumination image corresponding to the low illumination image as an expected average brightness value mu (the average brightness value mu in the training process is determined for the normal illumination image corresponding to each low illumination image); extracting global information through a multi-layer perceptron; representing elements in the output of the multi-layer perceptron as higher-order curve coefficients;

since the training process inputs a plurality of image blocks at a time, for each image block, a V-channel image I is extracted ^V Then using the higher-order curve coefficient of the whole image corresponding to the current image blockAdjusting to obtain a global brightness adjustment image I ^V ′；

For the first input of the training process, the global brightness adjustment image I ^V Splicing and subsequent processing are carried out on the output of the second convolution layer of the 'and local enhancement network module, so that local enhancement is realized, and a primarily enhanced image E' is obtained;

step 312, comparing the preliminary enhanced image E' with the normal illuminance reference image Y paired with the input low illuminance image, and constructing a loss function based on the relative information to enhance the generalization ability of the neural network, wherein the loss function is as follows:

L _ALL ＝L ₁ +L _SSIM +L _color +L _brightness +L _structure

wherein the first item L ₁ Representing the mean absolute value error, the second term L _SSIM To enhance SSIM loss between image and reference image, the third term L _color For colour loss, fourth term L _brightness For loss of brightness, the fifth item L _structure Is a structural loss.

The color loss function L _color The method comprises the following steps:

wherein T represents the processing of the fast lightweight low-illumination image enhancement network; i _(i ， _j) 、Y _(i ， _j) The ith row and jth column pixels of the low-illumination image and the reference normal-illumination image respectively; m and n respectively represent m and n pixels for each row and each column of the input image;<·，·>representing cosine similarity of the two vectors.

wherein, c takes R, G, B corresponding to red, green and blue under RGB color spaceColor channels; b (T (I) ^c _(i，j) )))、b(Y ^c _(i，j) ) Represented as pixels T (I ^c _(i，j) )、Y ^c _(i，j) The purpose of subtracting the minimum is to eliminate the effect of the constant in b (·) for the centered image block.

wherein,is a gradient operator.

Step 313, training the network using Adam random optimization algorithm, and setting the update rate to 0.0001.

Based on the image blocks in the training data set, training the image enhancement network according to the loss function to obtain a trained image enhancement network, and taking the output of the trained image enhancement network as an enhanced image E.

And fifthly, inputting the low-illumination image to be enhanced into a trained image enhancement network, and outputting the network as an enhanced image.

The low-illumination image I to be enhanced is a matrix of M x N x 3, M is the number of rows, N is the number of columns, and 3 is the three RGB color channels { R, G, B } of the image I; the final enhanced image E obtained by enhancing the image I is a matrix of m×n×3.

The second embodiment is as follows:

the embodiment is a rapid light-weight low-illumination image enhancement system based on supervised learning, which is characterized by comprising an image acquisition unit and an image enhancement unit;

{α _0，1…t }G(H(I ^V )，μ)

the local enhancement network module firstly performs splicing operation on the V channels corresponding to the input and then inputs the V channels into a first convolution unit, and the output of the first convolution unit and the global brightness adjustment image I ^V′ And after the splicing operation is performed, the output of the first convolution layer in the second convolution unit and the final output of the second convolution unit are input into a third convolution unit, the third convolution unit is connected with the output layer, and the output of the output layer is an enhanced image.

Preferably, each of the first to third convolution units includes two convolution layers.

Preferably, the convolution layers in the first to third convolution units are all convolution layers of 3*3.

And a third specific embodiment:

the embodiment is a computer storage medium having at least one instruction stored therein, the at least one instruction being loaded and executed by a processor to implement the supervised learning based rapid lightweight low illumination image enhancement system.

It should be understood that the instructions comprise a computer program product, software, or computerized method corresponding to any of the methods described herein; the instructions may be used to program a computer system, or other electronic device. Computer storage media may include readable media having instructions stored thereon and may include, but is not limited to, magnetic storage media, optical storage media; magneto-optical storage media include read-only memory ROM, random-access memory RAM, erasable programmable memory (e.g., EPROM and EEPROM), and flash memory layers, or other types of media suitable for storing electronic instructions.

The specific embodiment IV is as follows:

the embodiment is a rapid light-weight low-illumination image enhancement device based on supervised learning, the device comprises a processor and a memory, and it should be understood that any device comprising the processor and the memory described by the invention can also comprise other units and modules for displaying, interacting, processing, controlling and other functions through signals or instructions;

the memory stores at least one instruction that is loaded and executed by the processor to implement the supervised learning based fast lightweight low illumination image enhancement system.

Examples

The low-light image enhancement experiment was performed according to the method of the first embodiment.

Embodiments of the present invention use pre-prepared datasets. The original low-luminance image before enhancement is shown in fig. 3, the effect after enhancement is shown in fig. 4, and the reference normal-luminance image is shown in fig. 5.

From the experimental results, it can be seen that: the rapid light-weight low-illumination image enhancement network based on supervised learning can remarkably enhance the brightness and contrast of images and retain the details and color information of the images. In addition, the network has simple structure, small dependent parameter quantity and high processing speed in experiments. In conclusion, the rapid light-weight low-illumination image enhancement method based on supervised learning has rapid, light-weight and remarkable low-illumination image enhancement effects.

The above examples of the present invention are only for describing the calculation model and calculation flow of the present invention in detail, and are not limiting of the embodiments of the present invention. Other variations and modifications of the above description will be apparent to those of ordinary skill in the art, and it is not intended to be exhaustive of all embodiments, all of which are within the scope of the invention.

Claims

1. A rapid light-weight low-illumination image enhancement method based on supervised learning is characterized by comprising the following steps:

{α _0，1...t }＝G(H(I ^V )，μ)

step 113, performing high-order curve adjustment on the input low-illumination image V channel image to obtain a global brightness adjustment scheme, and finally outputting I ^V Global brightness adjustment image I of (2) ^V′ ：

Where k represents the number of iterations,representing an input low-intensity image V-channel image I ^V When k=7, _j->α _k-1 ∈{α _0，1...t }；

2. The method of claim 1, wherein the first to third convolution units each include two convolution layers.

3. The method of claim 2, wherein the convolution layers in the first to third convolution units are all convolution layers of 3*3.

4. A method for rapid lightweight low-light image enhancement based on supervised learning as set forth in claim 1, 2, or 3, wherein the training process of the image enhancement network comprises the steps of:

since the training process inputs a plurality of image blocks at a time, for each image block, a V-channel image I is extracted ^V Then, the high-order curve coefficient of the whole image corresponding to the current image block is used for adjustment to obtain a global brightness adjustment image I ^V′ ；

Training of the image enhancement network is achieved based on the image blocks in the training data set, and the trained image enhancement network is obtained.

5. The method for enhancing a rapid light-weight low-illumination image based on supervised learning as recited in claim 4, wherein the loss function of the image enhancement network in the training process is as follows:

L _ALL ＝L ₁ +L _SSIM +L _color +L _brightness +L _structure

the color loss function L _color The method comprises the following steps:

wherein T represents the processing of the image enhancement network; i _(i，j) 、Y _(i，j) The ith row and jth column pixels of the low-illumination image and the reference normal-illumination image respectively; m and n respectively represent m and n pixels for each row and each column of the input image;<·，·>representing cosine similarity of the two vectors;

wherein, c takes R, G, B, which corresponds to three color channels of red, green and blue in RGB color space; b (T (I) ^c _(i，j) ))、b(Y ^c _(i，j) ) Expressed in pixelsY ^c _(i，j) The purpose of subtracting the minimum value is to eliminate the effect of the constant in b (·) for the centered image block;

wherein,is a gradient operator.

6. A rapid light-weight low-illumination image enhancement system based on supervised learning is characterized by comprising an image acquisition unit and an image enhancement unit;

{α _0，1...t }＝G(H(I ^V )，μ)

7. A supervised learning based fast light weight low illumination image enhancement system according to claim 6, wherein the first to third convolution units each comprise two convolution layers.

8. The supervised learning based fast light weight low illumination image enhancement system according to claim 7, wherein the convolution layers in the first through third convolution units are each 3*3.

9. A computer storage medium having stored therein at least one instruction that is loaded and executed by a processor to implement a supervised learning based fast lightweight low illumination image enhancement system as recited in any of claims 6 to 8.

10. A supervised learning based fast lightweight low illumination image enhancement apparatus, the apparatus comprising a processor and a memory having at least one instruction stored therein, the at least one instruction being loaded and executed by the processor to implement a supervised learning based fast lightweight low illumination image enhancement system as recited in any of claims 6 to 8.