CN114511739A

CN114511739A - Task-adaptive small sample image classification method based on meta-migration learning

Info

Publication number: CN114511739A
Application number: CN202210089412.1A
Authority: CN
Inventors: 初妍; 谢天文; 莫士奇; 李松; 时洁; 曹宇辰
Original assignee: Harbin Engineering University
Current assignee: Harbin Engineering University
Priority date: 2022-01-25
Filing date: 2022-01-25
Publication date: 2022-05-17

Abstract

The invention belongs to the technical field of computers, and particularly relates to a task self-adaptive small sample image classification method based on meta-migration learning. The method makes up the problem that the characteristics of the MAML model extracted by adopting a 4Conv shallow network are insufficient by combining element migration learning; trainable parameters are added to learn the use of balance element knowledge in each task, and the problems of unbalanced task, unbalanced category and unbalanced distribution of small sample learning in a real scene are solved. The invention selects samples with low accuracy in each task, and recombines the data of the samples to make the samples become more difficult tasks, so that the accuracy of the model is improved in the process of learning the more difficult tasks by the meta-learner. The difficult task mining algorithm provided by the invention collects samples with poor classification effect on line to form difficult tasks, so that a learner can learn the difficult tasks more quickly and with better effect.

Description

Task-adaptive small sample image classification method based on meta-migration learning

Technical Field

The invention belongs to the technical field of computers, and particularly relates to a task self-adaptive small sample image classification method based on meta-migration learning.

Background

Deep learning has had great success in various recognition tasks, however, the parameters of deep neural networks are numerous and require sufficiently labeled samples to train models, which severely limits their scalability. For many rare classes, it is not feasible to collect a large number of training samples, on the contrary, people often need to see some examples of fresh things to quickly identify a new object class, and as inspired by the Learning ability of human beings, researchers hope that machine Learning models can also quickly learn under the condition of few training samples, and the key of having the quick Learning ability is to acquire a priori knowledge, that is, a large number of tasks are needed to train the models, so that the models continuously summarize the priori knowledge in the process of quickly Learning new knowledge, and thus, the machine Learning models can be helped to make correct classification only by a small number of samples when encountering unidentified classes, that is, the concept of small sample Learning (Few-Shot-Learning). In the small sample learning problem, a training set is composed of a group of base classes each of which has enough training samples and a group of new classes each of which has only a small number of labeled samples (shots), and the small sample learning aims to achieve better generalization performance by transmitting the knowledge of the base classes through the training of a small number of new class samples of a model.

Meta-learning is applied to the field of small sample learning as a framework, and how to adapt to a new task with only a few labeled samples is learned by utilizing a large number of similar tasks. However, meta-learning uses a shallow network as a feature extractor to avoid overfitting of the model, resulting in insufficient extraction of features. In addition, the existing meta-learning methods for small sample classification assume that the number of instances of each task and category is fixed, even if the number of instances of each task and category is greatly different, meta-learners still learn by using meta-knowledge equally in all tasks, and they do not consider that the distribution difference of tasks is not seen, and the meta-knowledge learned on a training set may not be useful, which constitute the problems of classification imbalance, task imbalance and distribution imbalance in small sample learning. At present, research aiming at learning of unbalanced small samples is less, and how to handle the small sample learning problem closer to a real scene is one of the problems to be solved urgently at present.

Disclosure of Invention

The invention aims to solve the problems that the feature extraction is insufficient when a shallow network is adopted to extract features in a meta-learning method and the existing small sample learning method does not consider imbalance in a real scene, and provides a task self-adaptive small sample image classification method based on meta-migration learning.

A task self-adaptive small sample image classification method based on meta-migration learning comprises the following steps:

step 1: acquiring a large-scale image data set, pre-training a deep network by using samples in the large-scale image data set, and outputting weight parameter vectors theta and theta of a group of feature extractors and classifiers;

step 2: freezing the weight parameter vector theta of the feature extractor, and resetting the weight parameter vector theta of the classifier;

and step 3: acquiring a small sample image data set to be classified, and dividing the small sample image data set into a meta-training set and a meta-testing set; the meta-training set comprises a support set and a query set, wherein the support set is used for updating parameters of the basic learner, and the query set is used for calculating a loss function and updating the meta-learner;

and 4, step 4: initializing parameters of a meta learner and a basic learner, and setting the maximum iteration times; the basic learner is used for rapidly learning new knowledge from a new task, and the meta learner is used for summarizing experiences in all learned knowledge; the small sample learning takes tasks as a unit, and each task comprises N classes;

and 5: randomly extracting samples from the meta-training set to form a training task, then randomly extracting the training task to train a basic learner, continuously updating and optimizing meta-migration learning parameters and balance parameters, and obtaining a class in which the samples with low identification accuracy are located as a difficult class in the step;

step 6: randomly sampling from the difficult class set to form a difficult task;

and 7: randomly extracting difficult tasks to train a basic learner, and continuously updating optimized learner parameters, meta migration learning parameters and balance parameters until the loss is lower than a threshold value;

and 8: judging whether the maximum iteration times is reached; if the maximum iteration times are not reached, clearing the difficult class set, and returning to the step 5 for the next iteration; if the maximum iteration times are reached, outputting the element migration learning parameters and the balance parameters after the iteration is finished to obtain a trained basic learner;

and step 9: and inputting the meta-test set into a trained basic learning device to obtain a small sample image classification result.

Further, when pre-training in step 1, the loss function is:

wherein D is sample data; f. of_[Θ；θ]Representing a model equation comprising weight parameter vectors Θ and θ of the feature extractor and the classifier; x, y denote the specimen and label, respectively;

the updating formula of the weight parameter vectors theta and theta of the feature extractor and the classifier is as follows:

further, the collection process of the difficulty classes in the step 5 is as follows:

step 5.1: acquiring a support set and a query set of a current input task;

step 5.2: calculating a loss function supporting centralized samples and updating a basic learner parameter [ theta ]; theta]Meta migration learning parameters

Equilibrium parameter omega_ni,γ_i,z_i；

(Θ,θ₀)＝g(z_i)(Θ,θ)

Where the subscript i denotes the index of the task, i 1, 2.., N; g (·) is a non-negative activation function, and the above equation is an initialization equation of the model by balancing the variable z^τTo determine how much meta-knowledge to use;

wherein mu is the learning rate of the meta migration learning parameter; l is_T(query)A loss function that is a sample of the query set;

wherein f (·) max (0, min (·,1)) is a clipping function; alpha is the task learning rate; series of scalars

Processing the class imbalance as a coefficient of the per-task gradient descent step loss;

step 5.3: and extracting samples from the query set, calculating the classification accuracy of the samples, arranging the samples in ascending order according to the accuracy, and taking the class where the m lowest samples are located as a difficult class, wherein m is a preset constant.

The invention has the beneficial effects that:

the method makes up the problem that the characteristics of the MAML model extracted by adopting a 4Conv shallow network are insufficient by combining element migration learning; trainable parameters are added to learn the use of balance element knowledge in each task, and the problems of unbalanced task, unbalanced category and unbalanced distribution of small sample learning in a real scene are solved. The invention selects samples with low accuracy in each task, and recombines the data of the samples to make the samples become more difficult tasks, so that the accuracy of the model is improved in the process of learning the more difficult tasks by the meta-learner. The difficult task mining algorithm provided by the invention collects samples with poor classification effect on line to form difficult tasks, so that a learner can learn the difficult tasks more quickly and with better effect.

Drawings

FIG. 1 is a general block diagram of a model in the present invention.

FIG. 2 is a flow chart of the training of the model of the present invention.

FIG. 3 is a block diagram of a feature extractor ResNet-12 of the present invention.

Fig. 4 is a structural diagram of an inference network in the present invention.

FIG. 5 is a pseudo code diagram of a specific algorithm in the present invention.

Detailed Description

The invention is further described below with reference to the accompanying drawings.

In order to solve the problems that the feature extraction is insufficient when the feature extraction method adopts a shallow network to extract features and the existing small sample learning method does not consider imbalance in a real scene, the invention provides a task-adaptive small sample image classification method (MT-TAML) based on meta-migration learning, which adopts the following technical scheme:

a task-adaptive small sample image classification method based on meta-migration learning is disclosed, wherein a model is generally shown in FIG. 1, and a model training process is shown in FIG. 2, and comprises the following steps:

step 1: acquiring a large-scale image data set (such as ImageNet), pre-training a deep network by using samples in the large-scale image data set, and outputting weight parameter vectors theta and theta of a group of feature extractors and classifiers;

in pre-training, the loss function is:

the feature extractor employs ResNet-12, as shown in FIG. 3; the updating formula of the weight parameter vectors theta and theta of the feature extractor and the classifier is as follows:

and 5: randomly extracting samples from the meta-training set to form a training task, then randomly extracting the training task to train a basic learner, continuously updating and optimizing meta-migration learning parameters and balance parameters, wherein a reasoning network of the balance parameters is shown in FIG. 4, and the class where the samples with low identification accuracy are located is obtained in the step and is used as a difficult class;

the collection procedure for the difficult classes is:

step 5.1: acquiring a support set and a query set of a current input task;

step 5.2: calculation branchHolding the loss function of the concentrated sample and updating the parameters [ theta ] of the basic learner; theta]Meta migration learning parameters

Balance parameter omega_ni,γ_i,z_i；

(Θ,θ₀)＝g(z_i)(Θ,θ)

step 5.3: extracting samples from the query set, calculating the classification accuracy of the samples, arranging the samples in ascending order according to the accuracy, taking the class where the m lowest samples are as the difficult class, wherein m is a preset constant; the specific algorithm of the task self-adaptive small sample image classification method based on the element migration learning is shown in FIG. 5;

and 8: judging whether the maximum iteration times is reached; if the maximum iteration times are not reached, clearing the difficult class set, and returning to the step 5 for next iteration; if the maximum iteration times are reached, outputting the element migration learning parameters and the balance parameters after the iteration is finished to obtain a trained basic learner;

and step 9: and inputting the meta-test set into a trained basic learner to obtain a small sample image classification result.

The invention provides a task self-adaptive small sample learning algorithm (MT-TAML) based on element migration learning on an MAML basic model, and the problem of insufficient characteristics of the MAML model by adopting a 4Conv shallow network is solved by combining element migration learning; trainable parameters are added to learn the use of balance element knowledge in each task, and the problems of unbalanced task, unbalanced category and unbalanced distribution of small sample learning in a real scene are solved.

The difficult task mining algorithm provided by the invention collects samples with poor classification effect on line to form difficult tasks, so that a learner can learn the difficult tasks more quickly and with better effect. Traditional meta-batch processing consists of randomly sampled tasks, where randomness implies random difficulties. In the algorithm, samples with low accuracy in each task are intentionally selected, and data of the samples are recombined to form more difficult tasks, so that the accuracy of the model is improved in the process of learning the more difficult tasks by the meta-learner.

The above description is only a preferred embodiment of the present invention and is not intended to limit the present invention, and various modifications and changes may be made by those skilled in the art. Any modification, equivalent replacement, or improvement made within the spirit and principle of the present invention should be included in the protection scope of the present invention.

Claims

1. A task self-adaptive small sample image classification method based on meta-migration learning is characterized by comprising the following steps:

2. The task-adaptive small sample image classification method based on meta-migration learning according to claim 1, characterized in that: during the pre-training in step 1, the loss function is:

wherein D is sample data; f. of_[Θ；θ]Representing a model equation containing weight parameter vectors theta and theta of the feature extractor and the classifier; x, y denote the specimen and label, respectively;

3. the task-adaptive small sample image classification method based on meta-migration learning according to claim 1, characterized in that: the collection process of the difficulty classes in the step 5 is as follows:

step 5.1: acquiring a support set and a query set of a current input task;

step 5.2: calculating a loss function supporting the centralized sample and updating a basic learner parameter [ theta ]; theta]Meta migration learning parameters

Balance parameter omega_ni,γ_i,z_i；

(Θ,θ₀)＝g(z_i)(Θ,θ)

Wherein, subscript i represents an index of the task, i ═ 1, 2., N; g (-) is a nonnegative activation function, the above equation is the initialization equation of the model, by balancing the variable z^τTo determine how much meta-knowledge to use;