CN114821097A

CN114821097A - Multi-scale feature image classification method based on transfer learning

Info

Publication number: CN114821097A
Application number: CN202210365307.6A
Authority: CN
Inventors: 龚勋; 樊琳
Original assignee: Southwest Jiaotong University
Current assignee: Southwest Jiaotong University
Priority date: 2022-04-07
Filing date: 2022-04-07
Publication date: 2022-07-29
Anticipated expiration: 2042-04-07
Also published as: CN114821097B

Abstract

The invention relates to the technical field of multi-scale feature image classification, in particular to a multi-scale feature image classification method based on transfer learning, which comprises the following steps: s1, constructing a data set; comprises a source domain and a target domain; s2, training a model on a source domain by using a multi-scale feature extraction network to extract features; s3, sending the extracted features into a classification network for classification training; and S4, carrying out classification training on the data sets of the same task and different domains under the condition of transfer learning. The invention can easily apply the relation obtained for a certain type of data in a model training task to different problems in the same field.

Description

Multi-scale feature image classification method based on transfer learning

Technical Field

The invention relates to the technical field of multi-scale feature image classification, in particular to a multi-scale feature image classification method based on transfer learning.

Background

At present, in the process of solving problems by using a deep learning technology, the most common obstacle is that a large amount of parameters of a model need to be trained, and therefore, massive training data is needed for support. When faced with a particular problem in a certain field, data of the size required for building a model may not be available in general.

Disclosure of Invention

The invention provides a multi-scale feature image classification method based on transfer learning, which can be easily applied to different problems in the same field.

The invention discloses a multi-scale feature image classification method based on transfer learning, which comprises the following steps of:

s1, constructing a data set; comprises a source domain and a target domain;

s2, training a model on a source domain by using a multi-scale feature extraction network to extract features;

s3, sending the extracted features into a classification network for classification training;

and S4, carrying out classification training on the data sets of the same task and different domains under the condition of transfer learning.

Preferably, in step S1, the MNIST dataset is used as the source domain; an MNIST-M data set is constructed to serve as a target domain, the MNIST-M data set is formed by mixing MNIST numbers and random color blocks in a BSDS500 data set, 10 samples are selected from each type, a small sample data set under MNIST-M is constructed, and then the source domain data set and the target domain data set are divided into a training set and a testing set.

Preferably, after step S1, image preprocessing is performed: rotating the source domain data set, and adjusting the size of the image to make the image conform to network input; adjusting preprocessing operations such as color saturation brightness and the like of the image; and adjusting the size of the target domain data image, and matching the size of the source domain data image.

Preferably, in step S2, the multi-scale feature extraction network is divided into three branches, where the first branch includes 12 convolution layers with convolution kernel size of 3 × 3, 12 normalization layers, and 12 active layers, where the 7 th to 10 th convolutions are hollow convolutions; the second branch comprises 12 convolution layers with convolution kernel size of 5 x 5, 12 normalization layers and 12 activation layers, wherein the 7 th convolution to the 10 th convolution is a hole convolution; the third branch comprises 12 convolution layers with convolution kernel size of 7 x 7, 12 normalization layers and 12 activation layers, wherein the 7 th convolution to the 10 th convolution is a hole convolution; the network training inputs MNIST images of a source domain and labels corresponding to the images, the input images firstly enter a feature extraction network, three branches simultaneously extract features of different scales of the images, then perform interpolation and fusion on the features of different scales, and finally obtain the features:

wherein F _concatenate Representing the overall feature after fusion of the multi-scale features, F _i*i (i-3, 5,7) represents the features extracted by the network under different scales,

representing the merging of different scale features in a first dimension.

Preferably, in step S3, the classification network is composed of three convolutional layers and three fully-connected layers; the network training input is F _concatenate Training by combining the label of each picture; adopting Adam to carry out combined updating on parameters of the classification network and the feature extraction network; using a cross-entropy loss function as a classification loss function for the network:

wherein N represents the number of samples, y _s,i Represents the label from the source domain sample i, with a positive class of 1, a negative class of 0, p _s,i Is the probability that sample i is predicted to be positive;

the model is trained using source domain data until the loss function no longer falls, and the multi-scale feature extraction network and the classification network are tested using a source domain test set.

Preferably, in step S4, through step S2 and step S3, obtaining a multi-scale feature extraction and classification network trained on a source domain, then adopting a migration learning strategy, freezing a feature sub-network layer, and performing iterative training on a target domain data set to obtain a final small sample data set classification model, thereby implementing classification detection on an MNIST-M data set; using a cross-entropy loss function as a classification loss function for the network:

wherein N represents the number of samples, y _t,i Represents the label from the target domain sample i, with a positive class of 1, a negative class of 0, p _t,i Is the probability that sample i is predicted to be positive.

The method utilizes the source domain data with more data to assist in training the target domain data with less data and different fields, simultaneously constructs a multi-scale feature extraction network, and fuses the features of different receptive fields of the image by using convolution kernels and cavity convolutions with different sizes. The invention can easily apply the relation obtained for a certain type of data in a model training task to different problems in the same field.

Drawings

Fig. 1 is a flowchart of a multi-scale feature image classification method based on transfer learning in embodiment 1.

Detailed Description

For a further understanding of the invention, reference should be made to the following detailed description taken in conjunction with the accompanying drawings and examples. It is to be understood that the examples are illustrative of the invention and not limiting.

Example 1

As shown in fig. 1, the present embodiment provides a multi-scale feature image classification method based on transfer learning, which is characterized in that: the method comprises the following steps:

s1, constructing a data set; the MNIST dataset is used as a source domain (source). And constructing an MNIST-M data set as a target domain (target), wherein the MNIST-M data set is formed by mixing MNIST numbers and random color blocks in the BSDS500 data set, 10 samples are selected from each class, and a small sample data set under MNIST-M is constructed. Dividing a source domain (source) data set and a target domain (target) data set into a training set and a testing set;

and S2, training a model on a source domain (source) by using a multi-scale feature extraction network to extract features. The multi-scale feature extraction network is divided into three branches. The first branch contains 12 convolution layers with convolution kernel size of 3 x 3, 12 normalization layers, 12 active layers, of which 7-10 convolutions are hollow convolution (scaled convolution); the second branch contains 12 convolution layers with convolution kernel size 5 x 5, 12 normalization layers, 12 active layers, of which 7-10 convolutions are hole convolutions (scaled convolution); the third branch contains 12 convolution layers with convolution kernel size 7 x 7, 12 normalization layers, 12 active layers, of which the 7 th to 10 th convolutions are hole convolutions (scaled convolution). The network training inputs MNIST images of a source domain (source) and labels corresponding to the images, the input images firstly enter a feature extraction network, three branches simultaneously extract features of different scales of the images, then interpolate and fuse the features of different scales, and finally the obtained features are as follows:

wherein F _concatenate Representing the fused overall features of the multi-scale features, F _i*i (i-3, 5,7) represents the features extracted by the network at different scales.

Representing the merging of different scale features in a first dimension.

And S3, sending the extracted features into a classification network for classification training. The classification network consists of three convolutional layers and three fully-connected layers. The network training input is F _concatenate Training is performed in combination with the label of each picture. Adam is adopted to carry out combined updating on the parameters of the classification network and the feature extraction network. Using a Cross Entropy loss function (Cross Encopy) as a classification loss function for a networkNumber:

wherein N represents the number of samples, y _s,i Represents a label from a source domain (source) sample i, with a positive class of 1, a negative class of 0, p _s,i Is the probability that sample i is predicted to be positive.

The model is trained using source domain (source) data until the loss function no longer falls. And testing the multi-scale feature extraction network and the classification network using a source domain (source) test set.

S4, carrying out classification training on data sets of the same task and different domains under the condition of transfer learning: and S2 and S3 are carried out to obtain a multi-scale feature extraction and classification network trained on a source domain, then a transfer learning strategy is adopted, a feature sub-network layer is frozen, iterative training is carried out on a target domain data set (target), a final small sample data set classification model is obtained, and the MNIST-M data set is classified and detected. As a classification loss function of the network, a Cross Entropy loss function (Cross entry) is used:

wherein N represents the number of samples, y _t,i A label representing a sample i from a target domain (target), a positive class of 1, a negative class of 0, p _t,i Is the probability that sample i is predicted to be positive. The final test result on the target data set was 99%.

The present invention and its embodiments have been described above schematically, without limitation, and what is shown in the drawings is only one of the embodiments of the present invention, and the actual structure is not limited thereto. Therefore, if the person skilled in the art receives the teaching, without departing from the spirit of the invention, the person skilled in the art shall not inventively design the similar structural modes and embodiments to the technical solution, but shall fall within the scope of the invention.

Claims

1. A multi-scale feature image classification method based on transfer learning is characterized in that: the method comprises the following steps:

s1, constructing a data set; comprises a source domain and a target domain;

2. The multi-scale feature image classification method based on the transfer learning as claimed in claim 1, wherein: in step S1, the MNIST dataset is used as a source domain; an MNIST-M data set is constructed to serve as a target domain, the MNIST-M data set is formed by mixing MNIST numbers and random color blocks in a BSDS500 data set, 10 samples are selected from each type, a small sample data set under MNIST-M is constructed, and then the source domain data set and the target domain data set are divided into a training set and a testing set.

3. The multi-scale feature image classification method based on transfer learning according to claim 2, characterized in that: after step S1, image preprocessing is performed: rotating the source domain data set, and adjusting the size of the image to make the image conform to network input; adjusting preprocessing operations such as color saturation brightness and the like of the image; and adjusting the size of the target domain data image, and matching the size of the target domain data image with the size of the source domain.

4. The multi-scale feature image classification method based on the transfer learning according to claim 3, characterized in that: in step S2, the multi-scale feature extraction network is divided into three branches, where the first branch includes 12 convolution layers with convolution kernel size of 3 × 3, 12 normalization layers, and 12 active layers, where the 7 th convolution to 10 th convolution is a hole convolution; the second branch comprises 12 convolution layers with convolution kernel size of 5 x 5, 12 normalization layers and 12 activation layers, wherein the 7 th convolution to the 10 th convolution is a hole convolution; the third branch comprises 12 convolution layers with convolution kernel size of 7 x 7, 12 normalization layers and 12 activation layers, wherein the 7 th convolution to the 10 th convolution is a hole convolution; the network training inputs MNIST images of a source domain and labels corresponding to the images, the input images firstly enter a feature extraction network, three branches simultaneously extract features of different scales of the images, then perform interpolation and fusion on the features of different scales, and finally obtain the features:

representing the merging of different scale features in a first dimension.

5. The multi-scale feature image classification method based on the transfer learning according to claim 4, characterized in that: in step S3, the classification network includes three convolutional layers and three full link layers; the network training input is F _concatenate Training by combining the label of each picture; adopting Adam to carry out combined updating on parameters of the classification network and the feature extraction network; using a cross-entropy loss function as a classification loss function for the network:

6. The multi-scale feature image classification method based on the transfer learning according to claim 5, characterized in that: in step S4, obtaining a multi-scale feature extraction and classification network trained on a source domain through step S2 and step S3, then adopting a transfer learning strategy, freezing a feature sub-network layer, and performing iterative training on a target domain data set to obtain a final small sample data set classification model, thereby realizing the classification detection of an MNIST-M data set; using a cross-entropy loss function as a classification loss function for the network: