CN111767964A

CN111767964A - Improved DenseNet-based multi-channel feature re-labeling image classification method

Info

Publication number: CN111767964A
Application number: CN202010649381.1A
Authority: CN
Inventors: 林丽群; 陈柏林; 赵铁松; 刘佳惠
Original assignee: Fuzhou University
Current assignee: Fuzhou University
Priority date: 2020-07-08
Filing date: 2020-07-08
Publication date: 2020-10-13

Abstract

The invention relates to a multichannel feature re-labeling image classification method based on improved DenseNet, which comprises the following steps: step S1, collecting a training data set, and preprocessing the training data set to obtain a preprocessed training data set; step S2: based on the multi-channel characteristic heavy-mark intensive connection network, building a classification model; step S3, training a classification model according to a training data set, introducing training effect evaluation indexes Acc and F1-Score, evaluating the model training process in real time, storing the training model and data in real time, and drawing according to a final iteration result to obtain a trained classification model; step S4: and inputting the data set to be classified into the trained classification model to obtain a classification result. The invention effectively improves the image recognition classification detection precision.

Description

Improved DenseNet-based multi-channel feature re-labeling image classification method

Technical Field

The invention relates to the technical field of image recognition and classification, in particular to a multichannel feature re-labeling image classification method based on improved DenseNet.

Background

In recent years, Convolutional Neural Networks (CNN) have made a series of contributions to computer vision tasks, and have made great progress in the fields of image recognition, target detection, medical images, semantic segmentation, and the like. Research on convolutional neural networks is also ongoing, and a large set of efficient and good models achieve significant performance gains in various computer vision tasks, such as AlexNet, VGGNet, google lenet, ResNet, and densneet, in the corner.

At present, the convolutional neural network is developing towards deeper depth and wider width due to the promotion of hardware devices such as GPU. In the research in the depth direction of the CNN network, the VGG19 network proves that the depth of the network is a key part for improving the performance of the architecture. And the high way and ResNet structure realizes the network depth to more than 100 layers by adopting the data bypass and the skip connection idea. Henceforth, the DenseNet network proposed a completely new dense connection model, i.e. the flow of information between all layers connects each layer to another layer in a feed-forward manner, so that each layer in the network accepts as input the characteristics of all layers before it. Certainly, some researchers also explore the influence of the network channel width on the CNN network performance, and GoogleNet enhances the feature extraction capability by using a multi-scale convolution kernel on a single-layer convolution layer, so as to widen the network channel. The Wide residual network follows the principle that depth is not the only important parameter, and by shortening the depth and increasing the functional characteristics of each layer, a wider neural network is realized.

However, researches show that the deeper and wider the network structure is, the better optimization of the performance cannot be obtained, and the problems of poor convergence, severe overfitting and gradient disappearance are easy to occur. Meanwhile, when the network structure reaches a certain depth and width, the performance of the network structure tends to average precision. These problems, which have been a hindrance to further advance of image recognition classification, have been initiated by many researchers for new structural studies.

Disclosure of Invention

In view of this, the present invention provides an improved DenseNet-based multi-channel feature re-labeling image classification method, which effectively improves the image recognition classification detection accuracy.

In order to achieve the purpose, the invention adopts the following technical scheme:

a multi-channel feature re-labeling image classification method based on improved DenseNet comprises the following steps:

step S1, collecting a training data set, and preprocessing the training data set to obtain a preprocessed training data set;

step S2: based on the multi-channel characteristic heavy-mark intensive connection network, building a classification model;

step S3, training a classification model according to a training data set, introducing training effect evaluation indexes Acc and F1-Score, evaluating the model training process in real time, storing the training model and data in real time, and drawing according to a final iteration result to obtain a trained classification model;

step S4: and inputting the data set to be classified into the trained classification model to obtain a classification result.

Further, the training data set preprocessing specifically includes: and performing stretching rotation transformation and data expansion on the data sample by using the picture generator to obtain a sample set training larger than the original data set.

Further, the step S2 is specifically:

step S21: reconstructing an ascending and descending dimensional Bottleneck structure based on a DenseNet-BC network 1 x 1conv +3 x 3conv to form a Reconstructed Bottleneck structure suitable for the same dimension 3 x 3conv + 3conv of a low-depth and narrow-width network;

step S22: the method comprises the following steps of forming a dense connection block DenseBlock by using a multichannel parallel connection DenseBlock while forming the dense connection block structure of a multipath;

step S23: and introducing an SE structure with characteristic recalibration capability after the multipath dense connection block, and marking important characteristics.

Further, the step S21 is specifically: the method is characterized in that a Bottleneck infinitesimal structure in the Densenet is improved, 1 × 1conv in the original structure is replaced by 3 × 3conv, and two convolution layers with the size of 3 × 3 are stacked to replace one convolution layer with the size of 5 × 5; and the 4k-k lifting dimension feature extraction mode in the original micro-element structure is directly changed into a 4k-4k same-dimension structure according to the structure with shallow depth and narrow width, so that the network channel is further widened.

Further, the step S22 is specifically: parallel dense blocks with similar depths are added to DenseBlocks with improved reinforced Bottleneck structures to increase the width of the network, namely, the network is connected in parallel by multiple channels;

since the number n of multipaths and the depth d of the network do not exhibit a single inverse relationship, the values of both in the different training data sets need to be set through a small range to determine the appropriate multipath value;

for a single densebclk:

let x₀Is input, H₁Is x₀The output is x₁，H₂Is x₀And x₁I.e. L-layer DenseNet network has

And directly connecting, connecting all the previous layers as input, and expressing the following expression:

x_l＝H_l([x₀,x₁,…,x_l-1])

then, the DenseBlcok for multipath is expressed as:

x_j＝x_l1+x_l2+…+x_ln＝H_l1([x₀,x₁,…,x_l-1])+H_l2([x₀,x₁,…,x_l-1])+…+H_ln([x₀,x₁,…,x_l-1])，

wherein x_jIs a characteristic of the output of multiple parallel channels, H_lnThe characteristics output by the nth individual DenseBlcok are expressed.

Compared with the prior art, the invention has the following beneficial effects:

the invention effectively improves the image recognition classification detection precision.

Drawings

FIG. 1 is an overall block diagram of an embodiment of the present invention;

FIG. 2 is a flow chart of the overall structure improvement in one embodiment of the present invention;

fig. 3 is a network training flow chart based on multi-channel feature re-labeling of the DenseNet network in an embodiment of the present invention.

Detailed Description

The invention is further explained below with reference to the drawings and the embodiments.

Referring to fig. 1, the present invention provides a multi-channel feature re-labeling image classification method based on improved DenseNet, which includes the following steps:

s1, collecting public training data sets such as CIFAR-10/100, SVHN, MNIST and the like, and performing transformation such as stretching rotation and the like and data expansion on the data samples by using a picture generator to obtain a sample set training larger than the original data set, improving the network image recognition capability to obtain a model fitting effect and obtaining a preprocessed training data set;

In this embodiment, the step S2 specifically includes:

step S22: the method comprises the following steps of forming a dense connection block DenseBlock by using a multichannel parallel connection DenseBlock while forming the dense connection block DenseBlock by a constrained Bottleneck structure, and enhancing the characteristic multiplexing capability;

step S23: and an SE structure with the characteristic recalibration capability is introduced after the multi-path dense connecting blocks, important characteristics are marked, redundant characteristics are reduced, and the classification speed is improved.

In this embodiment, step S21 specifically includes: the method is characterized in that a Bottleneck infinitesimal structure in the Densenet is improved, 1 × 1conv in the original structure is replaced by 3 × 3conv, and two convolution layers with the size of 3 × 3 are stacked to replace one convolution layer with the size of 5 × 5; and the 4k-k lifting dimension feature extraction mode in the original micro-element structure is directly changed into a 4k-4k same-dimension structure according to the structure with shallow depth and narrow width, so that the network channel is further widened.

In this embodiment, the step S22 specifically includes: parallel dense blocks with similar depths are added to DenseBlocks with the improved ReconstructedButtleneck structure to increase the width of the network, namely the network is connected in parallel with multiple channels;

for a single densebclk:

x_l＝H_l([x₀,x₁,…,x_l-1])

then, the DenseBlcok for multipath is expressed as:

The above description is only a preferred embodiment of the present invention, and all equivalent changes and modifications made in accordance with the claims of the present invention should be covered by the present invention.

Claims

1. A multi-channel feature re-labeling image classification method based on improved DenseNet is characterized by comprising the following steps:

2. The improved DenseNet-based multi-channel feature re-labeling image classification method according to claim 1, characterized in that the training data set is preprocessed specifically as follows: and performing stretching rotation transformation and data expansion on the data sample by using the picture generator to obtain a sample set training larger than the original data set.

3. The improved DenseNet-based multi-channel feature relabeling image classification method according to claim 1, wherein the step S2 specifically comprises:

4. The improved DenseNet-based multi-channel feature relabeling image classification method according to claim 3, wherein the step S21 is specifically: the method is characterized in that a Bottleneck infinitesimal structure in the Densenet is improved, 1 × 1conv in the original structure is replaced by 3 × 3conv, and two convolution layers with the size of 3 × 3 are stacked to replace one convolution layer with the size of 5 × 5; and the 4k-k lifting dimension feature extraction mode in the original micro-element structure is directly changed into a 4k-4k same-dimension structure according to the structure with shallow depth and narrow width, so that the network channel is further widened.

5. The improved DenseNet-based multi-channel feature re-labeling image classification method according to claim 1, characterized in that: the step S22 specifically includes: parallel dense blocks with similar depths are added to DenseBlocks with improved reinforced Bottleneck structures to increase the width of the network, namely, the network is connected in parallel by multiple channels;

for a single densebclk:

x_l＝H_l([x₀,x₁,…,x_l-1])

then, the DenseBlcok for multipath is expressed as: