CN113989575A

CN113989575A - Small sample image classification method and system based on specific parameter distribution generation

Info

Publication number: CN113989575A
Application number: CN202111465249.6A
Authority: CN
Inventors: 刘东博; 展华益; 文艺
Original assignee: Sichuan Cric Technology Co ltd
Current assignee: Sichuan Cric Technology Co ltd
Priority date: 2021-12-03
Filing date: 2021-12-03
Publication date: 2022-01-28

Abstract

The invention relates to the field of small sample image classification, in particular to a small sample image classification method and system based on specific parameter distribution generation, which can effectively improve the result accuracy of the small sample image classification method. The technical scheme comprises the following steps: the method comprises the steps of constructing an overall architecture of a small sample image classification neural network, using parameters of various convolution neural networks as parameter training sets, training a distribution learning network by using the parameter training sets, generating initial parameters of the small sample image classification neural network through the trained distribution learning network, constructing a parameter adjusting network, using a target data set to train the small sample image classification neural network under the adjustment of the parameter adjusting network, inputting images to be classified, extracting characteristics of the images to be classified through the small sample image classification neural network, and judging the classes of the images. The method is suitable for classifying the small sample images.

Description

Small sample image classification method and system based on specific parameter distribution generation

Technical Field

The invention relates to the field of small sample image classification, in particular to a small sample image classification method and system based on specific parameter distribution generation.

Background

Training of image classification neural networks has been highly complex, since neural networks require many pairs of large amounts of data and repeated iterative training to learn "knowledge" from the data. However, in a small sample learning task of the neural network, there are not enough samples available for the neural network to learn. One common method is to compensate for the training insufficiency caused by too few training samples by migrating the knowledge of other training adequate networks to a small sample learning neural network. This method of migrating knowledge from one network to another is called migration learning. In recent years, more and more small sample learning has begun to be combined with migratory learning. Migratory learning can migrate knowledge in a neural network that is trained adequately on large data sets into a new network for processing new tasks. The mainstream neural network migration learning method comprises parameter direct migration and parameter distribution migration, wherein the parameter direct migration directly takes a trained network parameter as a part of a current task network parameter, and the parameter distribution migration takes the distribution of the trained network parameter as a regularization parameter to guide the training of the current task network.

However, the current mainstream neural network transfer learning method is trained according to the existing data direct network model, and the classification accuracy is not high.

Disclosure of Invention

The invention aims to provide a small sample image classification method and system based on specific parameter distribution generation, which can effectively improve the result accuracy of the small sample image classification method.

The invention adopts the following technical scheme to realize the purpose, and the small sample image classification method based on the specific parameter distribution comprises the following steps:

step 1, constructing an overall architecture of a small sample image classification neural network, wherein the overall architecture comprises an architecture of a distributed learning network, an architecture of a parameter adjusting network and an architecture of the small sample image classification neural network;

step 2, obtaining parameters of various convolutional neural networks as a parameter training set;

step 3, training a distributed learning network by using a parameter training set;

step 4, generating initial parameters of a small sample image classification neural network through the trained distributed learning network;

step 5, under the adjustment of the parameter adjustment network, training a small sample image classification neural network by using a target data set, wherein the target data set is an image set to be classified;

and 6, inputting the image to be classified, extracting the characteristics of the image through a small sample image classification neural network, and judging the category of the image.

Further, in step 1, the parameter adjusting network and the distributed learning network have the same structure and parameters.

Further, in step 2, the plurality of convolutional neural networks include VGG, DensNet, AlexNet, and ResNet convolutional neural networks.

Further, in step 3, the distributed learning network is constructed by a countermeasure generation network, and an optimization objective function of the distributed learning network is as follows:

g and D denote the generator and the discriminator, respectively, x is the data of the training data set, z is the data sampled from the model, P_dataRepresenting the distribution of real data, P_zWhich represents the distribution of the simulated data,

denotes x is in P_dataThe above expectations.

Further, according to the strategy of optimizing the objective function by optimizing D and then optimizing G, the optimization objective function is decomposed as follows:

further, in step 4, a specific method for generating the initial parameter includes:

randomly generating random Gaussian noise images with the number of convolution kernels meeting the requirement of the small sample learning network being N and 7 multiplied by 7, and then inputting the Gaussian noise images into the distribution learning network to generate initial parameters of the small sample image classification neural network.

Further, in step 5, a specific method for training a small sample image classification neural network by using the target data set includes:

carrying out adjustment training on the small sample image classification neural network by using a target data set to obtain new parameters; adjusting the new parameters by using a parameter adjusting network; and then replacing corresponding original parameters in the small sample image classification neural network by using the adjusted parameters, and performing adjustment training on the small sample image classification neural network through the target data set until the loss function of the small sample image classification neural network in the training of the target data set reaches a set condition.

Further, in step 6, the specific method for extracting the features of the small sample image classification neural network and judging the category of the small sample image classification neural network comprises the following steps:

and performing feature extraction on the input image by using a small sample image classification neural network, wherein the features are feature graphs obtained after convolution of a convolution kernel layer of the small sample image classification neural network, performing pooling processing on the feature graphs to reduce the dimensionality to a set value, classifying the feature graphs by using a final full-connection layer, and judging the category of the feature graphs.

The small sample image classification system generated based on specific parameter distribution is applied to the small sample image classification method generated based on specific parameter distribution, and comprises the following steps:

the parameter generation module is used for constructing an overall architecture of the small sample image classification neural network, the overall architecture comprises an architecture of a distributed learning network, an architecture of a parameter adjusting network and an architecture of the small sample image classification neural network, parameters of various convolutional neural networks are used as a parameter training set to train the distributed learning network by using the parameter training set, and initial parameters of the small sample image classification neural network are generated through the trained distributed learning network;

the network training module is used for training a small sample image classification neural network by using a target data set under the adjustment of the parameter adjustment network, wherein the target data set is an image set to be classified;

and the online classification module is used for inputting the images to be classified, extracting the characteristics of the images through a small sample image classification neural network and judging the categories of the images.

The invention has the beneficial effects that: a network framework comprising distributed learning, parameter adjustment and a small sample image classification network is constructed, and the network for small sample image classification can be directly subjected to parameter generation, so that the network is not directly trained according to the existing data like a traditional method. This ensures the rationality of the network framework. And only a small amount of labeled samples are needed to carry out high-accuracy classification, the method provided by the invention is extremely low in cost and easy to implement, and the result accuracy of the small-sample image classification method can be effectively improved.

Drawings

FIG. 1 is a flow chart of a method for classifying small sample images generated based on a particular parameter distribution according to the present invention.

Fig. 2 is a block diagram of a small sample image classification system generated based on a specific parameter distribution according to the present invention.

FIG. 3 is a schematic diagram of the general architecture of the present invention.

In the drawing, 101 denotes migration source network parameters, 102 denotes initial parameters, 103 denotes a small sample learning network, 104 denotes small sample training data, a denotes a parameter generation method flow, b denotes a network training method flow, and c denotes an online classification method flow.

Detailed Description

The invention is described in more detail below with reference to the figures and examples.

Fig. 1 is a flowchart of a method for classifying small sample images generated based on specific parameter distribution according to the present invention, which includes: a parameter generation method flow a, a network training method flow b and an online classification method flow c. The method comprises the following specific steps:

step 1, constructing an overall architecture of a small sample image classification neural network;

step 5, constructing a parameter adjusting network, wherein the parameter adjusting network and the distributed learning network have the same parameters;

step 6, under the adjustment of the parameter adjustment network, training a small sample image classification neural network by using a target data set, wherein the target data set is an image set to be classified;

and 7, inputting the image to be classified, extracting the characteristics of the image through a small sample image classification neural network, and judging the category of the image.

Wherein, the content of the step 1-4 is a parameter generation method process, the content of the step 5-6 is a network training method process, and the content of the step 7 is an online classification method process.

In step 1, the overall architecture comprises: the method comprises the steps of constructing a distributed learning network, a parameter adjusting network and a small sample image classification neural network.

In step 2, the plurality of convolutional neural networks include VGG, DensNet, AlexNet and ResNet convolutional neural networks.

In step 3, the distributed learning network is constructed by a countermeasure generation network, and an optimization objective function of the distributed learning network is as follows:

denotes x is in P_dataThe above expectations.

According to the strategy of firstly optimizing D and then optimizing G, the optimization objective function is decomposed as follows:

in step 4, the specific method for generating the initial parameters includes:

In step 6, the specific method for training the small sample image classification neural network by using the target data set comprises the following steps:

In step 7, the specific method for extracting the characteristics of the small sample image classification neural network and judging the category of the small sample image classification neural network comprises the following steps:

Fig. 2 is a block diagram of a small sample image classification system generated based on specific parameter distribution according to the present invention, which includes:

the parameter generation module is used for constructing an overall architecture of the small sample image classification neural network, training the distributed learning network by using the parameter training set by taking the parameters of various convolutional neural networks as the parameter training set, and generating initial parameters of the small sample image classification neural network through the trained distributed learning network;

the network training module is used for constructing a parameter adjusting network, the parameter adjusting network and the distributed learning network have the same parameters, and under the adjustment of the parameter adjusting network, a target data set is used for training a small sample image classification neural network, wherein the target data set is an image set to be classified;

When acquiring corresponding parameters from a convolutional neural network, normalization operation needs to be performed on a convolutional kernel, and the specific operation is as follows:

selecting a convolution kernel with the largest size as a basic template, wherein the selected convolution kernel has the size of 7 multiplied by 7; then expanding convolution kernels with other sizes to a uniform size, and aiming at convolution kernels with the size less than 7 multiplied by 7 quarter, such as 1 multiplied by 1, 2 multiplied by 2 and 3 multiplied by 3, expanding the convolution kernels by integral times, and supplementing the insufficient size with 0; convolution kernels of other sizes were supplemented directly with 0 to 7 × 7; finally, all convolution kernels are averaged and squared, and then normalized.

FIG. 3 is a schematic diagram of the overall architecture of the present invention, including the architecture of the distributed learning network, the architecture of the parameter adjustment network, and the architecture of the small sample image classification neural network 103, training the distributed learning network using the migration source network parameters 101, and generating the initial parameters 102; under the adjustment of the parameter adjustment network, the small sample training data 104 is used to train the small sample image classification neural network, and the small sample training data is the image set to be classified. After training is finished, inputting images to be classified, extracting the characteristics of the images through a small sample image classification neural network, and judging the categories of the images.

It is to be understood that the specific embodiments of the invention described are merely illustrative of some of the embodiments of the invention, and that the invention is not to be construed as being limited to all embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

Claims

1. The small sample image classification method based on specific parameter distribution generation is characterized by comprising the following steps:

2. The method for classifying small sample images generated based on specific parameter distribution according to claim 1, wherein the parameter adjusting network and the distribution learning network have the same structure and parameters.

3. The method for classifying small sample images generated based on specific parameter distribution according to claim 1, wherein in the step 2, the plurality of convolutional neural networks comprise VGG, DensNet, AlexNet and ResNet convolutional neural networks.

4. The method for classifying small sample images generated based on specific parameter distribution according to claim 1, wherein in step 3, the distributed learning network is composed by constructing a confrontation generation network, and the optimization objective function of the distributed learning network is as follows:

denotes x is in P_dataThe above expectations.

5. The method for classifying small sample images generated based on specific parameter distribution according to claim 4, wherein the optimization objective function is decomposed as follows according to the strategy of optimizing D and then optimizing G:

6. the method for classifying small sample images generated based on specific parameter distribution according to claim 1, wherein in step 4, the specific method for generating initial parameters comprises:

7. The method for classifying small sample images generated based on specific parameter distribution as claimed in claim 1, wherein in step 5, the specific method for training the neural network for classifying small sample images by using the target data set comprises:

8. The method for classifying the small sample image generated based on the specific parameter distribution according to claim 1, wherein in step 6, the specific method for extracting the features of the small sample image through the small sample image classification neural network and judging the category of the small sample image comprises the following steps:

9. The small sample image classification system generated based on specific parameter distribution is applied to the small sample image classification method generated based on specific parameter distribution in any one of claims 1 to 8, and is characterized by comprising the following steps: