CN115239603A

CN115239603A - Unmanned aerial vehicle aerial image dim light enhancing method based on multi-branch neural network

Info

Publication number: CN115239603A
Application number: CN202211166059.9A
Authority: CN
Inventors: 许达文; 张祥建; 张旻晋
Original assignee: Chengdu Shihaixintu Microelectronics Co ltd
Current assignee: Chengdu Shihaixintu Microelectronics Co ltd
Priority date: 2022-09-23
Filing date: 2022-09-23
Publication date: 2022-10-25

Abstract

The invention provides an unmanned aerial vehicle aerial image dim light enhancing method based on a multi-branch neural network, which comprises the steps of obtaining unmanned aerial vehicle aerial dim light images under different scenes and constructing a dim light image data set; selecting image data in a dim light image data set, inputting a pre-established multi-branch dim light enhancement neural network, and outputting a color image with the same size as an input image by performing feature extraction and feature enhancement on the input image data at different levels; fusing the color image characteristics of different branches to obtain a final enhanced image; constructing model loss of the multi-branch dim light enhanced neural network, taking the enhanced image as a training sample, and training the multi-branch dim light enhanced neural network until the model loss is converged to obtain a trained network model; and (4) enhancing the aerial dark light image of the unmanned aerial vehicle to be tested by using the trained model. The proposal effectively solves the problem of poor image quality of aerial images under the dark light condition.

Description

Unmanned aerial vehicle aerial image dim light enhancing method based on multi-branch neural network

Technical Field

The invention relates to the technical field of computer vision image processing, in particular to an unmanned aerial vehicle aerial image dim light enhancing method based on a multi-branch neural network.

Background

Under the dim light environment, because the restriction of illumination condition, the obvious defects such as the dark local feature of the color image colour that obtains is not outstanding can appear information loss and unexpected noise in the dark space and lead to unmanned aerial vehicle to take photo by plane the rate of accuracy of discernment and the impression directly perceived of image when can't receive sufficient light at the shooting in-process, all can produce huge influence to the effect of taking photo by plane.

Disclosure of Invention

In order to make up for the defects in the prior art, the invention provides the unmanned aerial vehicle aerial image dim light enhancing method based on the multi-branch neural network, which solves the problem of poor image quality of an aerial image under the dim light condition, improves the accuracy of aerial identification of the unmanned aerial vehicle, and enhances the shooting effect of the unmanned aerial vehicle in the dim light environment.

The purpose of the invention is realized by adopting the following technical scheme:

an unmanned aerial vehicle aerial image dim light enhancing method based on a multi-branch neural network comprises the following steps:

acquiring unmanned aerial vehicle aerial dark light images in different scenes, and constructing a dark light image data set;

selecting image data in a dim light image data set, inputting a pre-established multi-branch dim light enhancement neural network, and outputting a color image with the same size as an input image by performing feature extraction and feature enhancement on the input image data at different levels;

fusing the color image characteristics of different branches to obtain a final enhanced image;

constructing model loss of a multi-branch dim light enhanced neural network, and training the multi-branch dim light enhanced neural network by taking the enhanced image as a training sample until the model loss is converged to obtain a trained network model;

and enhancing the aerial dark light image of the unmanned aerial vehicle to be tested by using the trained model.

Preferably, the constructing the dim-light image dataset comprises: acquiring photos shot under normal light and dark light states in different scenes through preset angles and positions of the unmanned aerial vehicle, and performing image cutting and zooming on the photos to a 255X 255-sized three-channel color image;

collecting three-channel color image data, and dividing the collected data in proportion; of these, 80% are used as training set and 20% are used as test set.

Preferably, the pre-establishing of the multi-branch dim light enhancement neural network comprises:

acquiring image data in a training set as an input image, performing feature extraction of different levels on the input image through a feature extraction module, and performing feature enhancement on the extracted image features through an enhancement module to generate a color image with the same size as the input image; the feature extraction module is a 10-layer convolutional neural network, and each layer uses 32 3X3 convolutional kernels for feature extraction; after each convolution the ReLu activation function is set for non-linear mapping.

Preferably, the enhancement module comprises a 10-layer convolutional neural network, and each layer of network is formed by connecting 8 3X3 convolutional kernels, 16 5X5 deconvolution kernels, 8 5X5 deconvolution kernels and 3 5X5 deconvolution kernels in series; after each convolution a ReLu activation function is set for non-linear mapping.

Preferably, the enhanced image is an image feature enhanced by fusion module using 3-channel 1X1 convolution kernel fusion.

Preferably, the model loss of the multi-branch dim light enhancement neural network comprises structure loss and context loss;

wherein, the structure loss is used for measuring the difference between the enhanced image and the corresponding normal light image and guiding the learning process;

the context loss is used for measuring the difference between the enhanced image and the normal light image corresponding to the enhanced image.

Further, the structural loss is determined by:

wherein,

in order to be a loss of the structure,

the average values of the pixel values of the enhanced image and the normal light image are respectively;

is the variance of the two pixel values;

is a hyper-parameter.

Further, the context loss is determined by:

wherein E and G are respectively an enhanced image and a real image,

the sizes of the characteristic graphs in the pre-trained VGG-19 network are respectively described;

and representing the feature mapping obtained by the jth convolutional layer of the ith feature map in the pre-trained VGG-19 network.

Preferably, the constructing a model loss of the multi-branch dim light enhanced neural network, taking the enhanced image as a training sample, and training the multi-branch dim light enhanced neural network until the model loss converges, so as to obtain a trained network model includes: inputting corresponding training samples according to the sample input quantity of the predefined model training, and performing iterative training on the model; for the characteristic color extracted from the VGG-19 network of the normal light image in the context loss, the output of the fourth convolution layer of the third block of the VGG-19 network is used as a context loss extraction layer in the training process;

and updating network parameters through an Adam optimization algorithm, and saving the model with the highest test result as a final model after iteration is finished.

Preferably, the enhancing the unmanned aerial vehicle aerial dark light image to be tested by using the trained model comprises:

and acquiring a dark light image shot by the unmanned aerial vehicle in a dark light environment, zooming the shot dark light image to 255X255, inputting the zoomed dark light image into a trained model, and acquiring an enhanced picture.

Compared with the prior art, the invention has the following beneficial effects:

the invention provides a dark light enhancement method for aerial images of unmanned aerial vehicles based on a multi-branch neural network, which comprises the steps of firstly, obtaining aerial dark light images of unmanned aerial vehicles under different scenes to construct a dark light image data set; selecting image data in a dim light image data set, inputting a pre-established multi-branch dim light enhancement neural network, and outputting a color image with the same size as an input image by performing feature extraction and feature enhancement on the input image data at different levels; secondly, fusing the color image characteristics of different branches to obtain a final enhanced image; finally, constructing model loss of the multi-branch dim light enhanced neural network, taking the enhanced image as a training sample, and training the multi-branch dim light enhanced neural network until the model loss is converged to obtain a trained network model; and enhancing the aerial dark light image of the unmanned aerial vehicle to be tested by using the trained model. Therefore, the problem of poor image quality of aerial images under the dark light condition is effectively solved.

The invention provides innovation from the method, does not rely on the inherent mode of the traditional unmanned aerial vehicle shooting, and makes up the shooting defects of dark color image, unobtrusive local characteristics and the like caused by the limitation of illumination conditions. The accuracy of unmanned aerial vehicle aerial photography discernment is improved, shooting effect under the reinforcing unmanned aerial vehicle dim light environment.

Drawings

In order to more clearly illustrate the detailed description of the invention or the technical solutions in the prior art, the drawings used in the detailed description or the prior art description will be briefly described below. Throughout the drawings, like elements or portions are generally identified by like reference numerals. In the drawings, elements or portions are not necessarily drawn to scale.

FIG. 1 shows a flow chart of a method for enhancing dim light of an aerial image of an unmanned aerial vehicle based on a multi-branch neural network, provided by the invention;

FIG. 2 shows a schematic structural diagram of a system for enhancing the dim light of an aerial image of an unmanned aerial vehicle based on a multi-branch neural network, provided by the invention;

fig. 3 shows a schematic structural diagram of an electronic device provided by the present invention.

Detailed Description

Embodiments of the present invention will be described in detail below with reference to the accompanying drawings. The following examples are only for illustrating the technical solutions of the present invention more clearly, and therefore are only examples, and the protection scope of the present invention is not limited thereby.

It is to be noted that, unless otherwise specified, technical or scientific terms used herein shall have the ordinary meaning as understood by those skilled in the art to which the invention pertains.

As shown in fig. 1, the embodiment of the present invention provides a method for enhancing dim light of an aerial image of an unmanned aerial vehicle based on a multi-branch neural network, the method comprising the following steps:

s1, acquiring unmanned aerial vehicle aerial photography dark light images in different scenes, and constructing a dark light image data set;

s2, selecting image data in the dim light image data set, inputting a pre-established multi-branch dim light enhancement neural network, and outputting a color image with the same size as an input image by performing different levels of feature extraction and feature enhancement on the input image data; fusing the color image characteristics of different branches to obtain a final enhanced image;

s3, constructing model loss of the multi-branch dim light enhanced neural network, and training the multi-branch dim light enhanced neural network by taking the enhanced image as a training sample until the model loss is converged to obtain a trained network model;

and S4, enhancing the aerial photography dim light image of the unmanned aerial vehicle to be tested by using the trained model.

In step S1, the process of constructing the dim-light image data set includes: acquiring photos shot under normal light and dark light states in different scenes through preset angles and positions of the unmanned aerial vehicle, and cutting and zooming the photos into three-channel color images with the size of 255X 255;

collecting three-channel color image data, and dividing the collected data in proportion; of these, 80% are training sets and 20% are test sets. A training set for storing image data for model training; and the test set is used for storing the image data to be tested.

In step S2, establishing a multi-branch dim light enhancing neural network includes:

acquiring image data in a training set as an input image, performing feature extraction of different levels on the input image through a feature extraction module, and performing feature enhancement on the extracted image features through an enhancement module to generate a color image with the same size as the input image; the feature extraction module is a 10-layer convolutional neural network, and each layer uses 32 3X3 convolutional kernels for feature extraction; after each convolution a ReLu activation function is set for non-linear mapping.

The enhancement module comprises 10 layers of convolutional neural networks, wherein each layer of convolutional neural network is formed by connecting 8 3X3 convolutional kernels, 16 5X5 deconvolution kernels, 8 5X5 deconvolution kernels and 3X 5 deconvolution kernels in series; after each convolution the ReLu activation function is set for non-linear mapping.

The enhanced image in step S2 is obtained by fusing the enhanced image features by a fusion module using a 3-channel 1X1 convolution kernel.

In step S3, model loss of the multi-branch dim light enhanced neural network comprises structure loss and context loss; wherein, the structure loss is used for measuring the difference between the enhanced image and the corresponding normal light image and guiding the learning process;

and context loss for measuring the difference between the enhanced image and the normal light image corresponding to the enhanced image.

The structural loss is determined by the following formula:

wherein,

in order to be a loss of the structure,

is the variance of the two pixel values;

is a hyper-parameter.

The context loss is determined by:

wherein E and G are respectively an enhanced image and a real image,

the sizes of all characteristic graphs in the pre-trained VGG-19 network are respectively described;

representing the feature map obtained by the jth convolutional layer of the ith feature map in the pre-trained VGG-19 network. The VGG-19 network is a classical convolutional neural network structure, which comprises 5 blocks, each block comprises 2,4 convolutional layers respectively, and different convolutional layers in each block comprise the same convolutional kernel size, 64,128,256,512 respectively. The output sizes of the convolution layers in the blocks are the same, and are respectively as follows: 112X 112, 56X 56, 28X 28, 14X 14 and 7X 7. That is, the value of i in the above formula is 1-5, and the value of j corresponds to the number of convolution layers in each block, which is 1-2,1-4, 1-4, and 1-4, respectively.W _i,j AndH _i,j is the same in each block, i.e. in five blocksW _i,j AndH _i,j the values of (b) are the length and width of the output feature map, respectively (112 ), (56, 56), (28, 28), (14, 14), (7, 7).C _i,j The number of convolution kernels in each block, which is the same for each block, is 64,128,256,512, respectively. In step S3, it is mentioned later that "for the characteristic color extracted in the context loss normal light image VGG, the output of the fourth convolution layer of the third block of the VGG-19 network is used as the context loss extraction layer in the training process", and therefore, when the calculation formula is specifically executed, i =3, j =4,W _i,j =28 andH _i,j =28，C _i,j =256”。

in step S3, training the multi-branch dim light enhanced neural network with the enhanced image as a training sample includes: inputting corresponding training samples according to the sample input quantity of the predefined model training, and performing iterative training on the model; for the characteristic color extracted from the VGG-19 network model of the normal light image in the context loss, the output of a fourth convolution layer of a third block of the VGG-19 network is used as a context loss extraction layer in the training process;

In the step S4, the step of enhancing the aerial photography dim light image of the unmanned aerial vehicle to be tested by using the trained model comprises the following steps:

Example 1: the embodiment 1 of the invention provides an unmanned aerial vehicle aerial image dim light enhancement system based on a multi-branch neural network by combining the inventive principles of the above contents and further based on the same technical concept of the unmanned aerial vehicle aerial image dim light enhancement method based on the multi-branch neural network.

As shown in fig. 2, in this embodiment, the unmanned aerial vehicle aerial image dim light enhancement system 100 based on the multi-branch neural network includes:

the building module 110 is configured to obtain the dark light images of the unmanned aerial vehicle aerial photography in different scenes, and build a dark light image dataset;

the image acquisition module 120 is configured to select image data from the dim-light image data set, input a pre-established multi-branch dim-light enhancement neural network, perform feature extraction and feature enhancement on the input image data at different levels, and output a color image having the same size as the input image; fusing the color image characteristics of different branches to obtain a final enhanced image;

the model training module 130 is configured to construct a model loss of the multi-branch dim light enhanced neural network, train the multi-branch dim light enhanced neural network with the enhanced image as a training sample until the model loss converges, and obtain a trained network model;

and the image enhancement module 140 is used for enhancing the aerial dark light image of the unmanned aerial vehicle to be tested by using the trained model.

Example 2: referring to fig. 3, embodiment 2 provides an electronic device 200 according to an embodiment of the present application. The electronic device 200 includes: a transceiver 210, a memory 220, a communication bus 230, and a processor 240.

The elements of the transceiver 210, the memory 220, and the processor 240 are electrically connected to each other directly or indirectly to achieve data transmission or interaction. For example, the components may be electrically coupled to each other via one or more communication buses 230 or signal lines. The transceiver 210 is used for transceiving data. The memory 220 is used for storing a computer program, such as a software functional module shown in fig. 2, that is, the unmanned aerial vehicle aerial image dim light enhancement system 100 based on the multi-branch neural network. The unmanned aerial vehicle aerial image dim light enhancement system 100 based on the multi-branch neural network includes at least one software functional module stored in the memory 220 in the form of software or firmware (firmware) or solidified in an Operating System (OS) of the electronic device 200. The processor 240 is configured to execute executable modules stored in the memory 220, for example, software functional modules or computer programs included in the unmanned aerial vehicle image dim light enhancement system 100 based on the multi-branch neural network.

At this time, the processor 240 is configured to obtain the dark light images of the unmanned aerial vehicle aerial photography in different scenes, and construct a dark light image dataset;

selecting image data in a dim light image data set, inputting a pre-established multi-branch dim light enhancement neural network, and outputting a color image with the same size as an input image by performing feature extraction and feature enhancement on the input image data at different levels; fusing the color image characteristics of different branches to obtain a final enhanced image;

and (4) enhancing the aerial dark light image of the unmanned aerial vehicle to be tested by using the trained model.

The Memory 220 may be, but is not limited to, a Random Access Memory (RAM), a Read Only Memory (ROM), a Programmable Read-Only Memory (PROM), an Erasable Read-Only Memory (EPROM), an electrically Erasable Read-Only Memory (EEPROM), and the like.

Processor 240 may be an integrated circuit chip having signal processing capabilities, such as an IMX6 chip running a Vxworks (embedded real-time operating system). The Processor may also be a general-purpose Processor, including a Central Processing Unit (CPU), a Network Processor (NP), and the like; but also Digital Signal Processors (DSPs), application Specific Integrated Circuits (ASICs), field Programmable Gate Arrays (FPGAs) or other Programmable logic devices, discrete Gate or transistor logic devices, discrete hardware components. The various methods, steps, and logic blocks disclosed in the embodiments of the present application may be implemented or performed. A general purpose processor may be a microprocessor or the processor 240 may be any conventional processor or the like.

Finally, it should be noted that: the above embodiments are only used to illustrate the technical solution of the present invention, and not to limit the same; while the invention has been described in detail and with reference to the foregoing embodiments, it will be understood by those skilled in the art that: the technical solutions described in the foregoing embodiments may still be modified, or some or all of the technical features may be equivalently replaced; such modifications and substitutions do not depart from the spirit and scope of the present invention, and they should be construed as being included in the following claims and description.

Claims

1. An unmanned aerial vehicle aerial image dim light enhancing method based on a multi-branch neural network is characterized by comprising the following steps:

acquiring unmanned aerial vehicle aerial photography dim light images in different scenes, and constructing a dim light image data set;

constructing model loss of the multi-branch dim light enhanced neural network, and training the multi-branch dim light enhanced neural network by taking the enhanced image as a training sample until the model loss is converged to obtain a trained network model;

2. The method of claim 1, wherein constructing a dim-light image dataset comprises: acquiring photos shot under normal light and dark light states in different scenes through preset angles and positions of the unmanned aerial vehicle, and performing image cutting and zooming on the photos to a 255X 255-sized three-channel color image;

3. The method of claim 1, wherein the pre-establishing of the multi-branch dim light enhancing neural network comprises:

acquiring image data in a training set as an input image, performing feature extraction of different levels on the input image through a feature extraction module, and performing feature enhancement on the extracted image features through an enhancement module to generate a color image with the same size as the input image; the feature extraction module is a 10-layer convolutional neural network, and each layer uses 32 convolution kernels of 3X3 to extract features; after each convolution a ReLu activation function is set for non-linear mapping.

4. The method of claim 1, wherein the enhancement module comprises 10 layers of convolutional neural network, each layer of network consisting of 8 3X3 convolutional kernels, 16 5X5 deconvolution kernels, 8 5X5 deconvolution kernels, and 3 5X5 deconvolution kernels in series; after each convolution a ReLu activation function is set for non-linear mapping.

5. The method of claim 1, the enhanced image being an image feature enhanced by a fusion module using a 3-channel 1X1 convolution kernel fusion.

6. The method of claim 1, wherein the model loss of the multi-branch dim light-enhancing neural network comprises structural loss, contextual loss;

7. The method of claim 6, wherein the structural loss is determined by:

wherein,

in order to be a loss of the structure,

the variance of the two pixel values;

is a hyper-parameter.

8. The method of claim 7, wherein the context loss is determined by:

wherein E and G are respectively an enhanced image and a real image,

representing the feature map obtained by the jth convolutional layer of the ith feature map in the pre-trained VGG-19 network.

9. The method of claim 1, wherein constructing a model loss of the multi-branch dim-light-enhanced neural network, and training the multi-branch dim-light-enhanced neural network with the enhanced image as a training sample until the model loss converges to obtain a trained network model comprises: inputting corresponding training samples according to the sample input quantity of the predefined model training, and performing iterative training on the model; for the characteristic color extracted from the VGG-19 network of the normal light image in the context loss, the output of the fourth convolution layer of the third block of the VGG-19 network is used as a context loss extraction layer in the training process;

10. The method of claim 1, wherein the enhancing the drone to be tested aerial scout image using the trained model comprises:

and acquiring a dim light image shot by the unmanned aerial vehicle in a dim light environment, zooming the shot dim light image to 255X255 size, inputting the zoomed dim light image into a trained model, and acquiring an enhanced picture.