CN113378942B

CN113378942B - Small sample image classification method based on multi-head feature cooperation

Info

Publication number: CN113378942B
Application number: CN202110667364.5A
Authority: CN
Inventors: 刘宝弟; 兴雷; 邵帅; 刘伟锋; 王延江
Original assignee: China University of Petroleum East China
Current assignee: China University of Petroleum East China
Priority date: 2021-06-16
Filing date: 2021-06-16
Publication date: 2022-07-01
Anticipated expiration: 2041-06-16
Also published as: CN113378942A

Abstract

The invention discloses a small sample image classification method based on multi-head feature cooperation, which belongs to the technical field of pattern recognition, simultaneously uses embedded features extracted by a plurality of feature extractors, and introduces a subspace learning method to convert the original multi-head features into a uniform low-dimensional representation space, thereby being beneficial to reducing redundant information and effectively solving the problem that the measuring scale degrees of different embedded features are inconsistent when the different embedded features are in different feature spaces. In addition, the combined weight of each multi-head feature is automatically updated by designing a weight calculation part, and the processed multi-head embedded features are cascaded to obtain the cooperative representation of the sample, so that the problem of reasonable use of the multi-head features is effectively solved.

Description

Small sample image classification method based on multi-head feature cooperation

Technical Field

The invention relates to the technical field of pattern recognition, in particular to a small sample image classification method based on multi-head feature cooperation.

Background

Inspired by human cognitive learning, scholars put forward the problem of small sample image classification, and after learning a large number of samples of limited classes, the scholars can quickly and accurately learn by using a small amount of sample data when encountering new classes by using priori knowledge. In recent years, problems associated with small sample learning have become a new important research direction in the field of machine learning, and are considered as one of the development directions of next-generation artificial intelligence.

At present, the main small sample image classification methods include the following methods:

(1) the small sample image classification method based on data expansion comprises the following steps: a small sample image classification method based on data expansion is proposed in 2018, a new sample data set is generated from an original data set by using a generated countermeasure network, and in order to solve the problem that sample data is insufficient during training of small sample image classification of the generated countermeasure network, a generator is designed to map sample data of a large probability class to sample data of a small probability class. The small sample image classification method has a plurality of specific advantages in pattern recognition through expanding data. However, the process of generating samples only ensures the correctness of the generated samples, and does not consider the distribution of the samples, which is very disadvantageous for the classification.

(2) The small sample image classification method based on the prototype network comprises the following steps: the small sample image classification method based on the prototype network is proposed in 2017 by Snell J and Swersky K, and uses the mean value of the features of each type of sample in a support set as one representation of the type, measures similarity through Euclidean distance, and predicts the label of an unknown sample. The method is simple and effective, and achieves better performance in a small sample image classification task. However, since the training data of the small sample image is very small, it is very difficult to estimate the sample distribution by using only the training sample, which may cause a certain deviation in the final classification.

(3) The small sample image classification method based on optimization comprises the following steps: the optimized small sample image classification method is proposed in 2017 by Ravi S and Larochelle H, and provides a framework for meta-learning and model independence, wherein the framework only has one weight initialization and can use any number of gradient step lengths to carry out self-adaptive learning, the model is trained by a random gradient descent method and is easier to fine tune to adapt to new sample data, and the model can be quickly trained on a small sample data set. But the testing phase does not adequately mine the value of the unlabeled exemplars.

Disclosure of Invention

In order to solve the problems existing in the image classification process of the small sample image classification method in the prior art, the embodiment of the invention provides a small sample image classification method based on multi-head feature cooperation. The technical scheme is as follows:

the invention provides a small sample image classification method based on multi-head feature cooperation, which comprises the following steps:

extracting image features by adopting a convolutional neural network;

training a classifier by directly optimizing a first objective function, and predicting the category of the test sample by using the classifier, wherein the first objective function is as follows:

wherein the content of the first and second substances,

diml and N denote the size and number of samples, C denotes the number of classes, x_n，y_n(N ═ 1, 2.., N) denotes N_thThe embedded feature vector and the tag vector of the sample,

representing a classifier to be learned, | | · | luminance_FRepresentation regularization is carried out on (-) and mu represents the weight of a constraint term of a classifier W;

the classifier W is as follows:

W＝YX^T(XX^T+μI)^-1

wherein, I represents an identity matrix;

the test sample characteristics

The categories of (A) are:

wherein max represents an operator for obtaining the index of the maximum value in the vector;

introducing a subspace learning method, reconstructing the original multi-head features to a uniform low-dimensional space, and obtaining new embedded features through the learning subspace;

solving the optimal weight combination in the new embedded features;

calculating a final cooperative characteristic through a first formula, calculating a final cooperative classifier and predicting the category of the cooperative characteristic according to the final cooperative classifier, wherein the first formula is as follows:

wherein the content of the first and second substances,

the final collaboration feature is represented as a result of,

represents P^hAnd Z^hThe nth feature of (1);

the final co-classifier is:

W_z＝YZ^T(ZZ^T+μI)^-1

wherein the content of the first and second substances,

is the final collaborative classifier;

the categories of the predicted collaboration features are:

wherein the content of the first and second substances,

for testing sample characteristics

The collaboration feature of (1).

Expanding the small sample image feature classification of multi-head feature cooperation to semi-supervised setting, utilizing unlabelled data to strengthen a classifier, and utilizing an optimal classifier to predict the category of a query tag, wherein the category of the query tag is as follows:

wherein Z is_qRepresenting collaboration features of the query set data.

Optionally, the solving of the optimal weight combination in the new embedded feature specifically includes: recalculating the loss of the first objective function on the h-th feature using the new embedded feature and the new classifier

Calculating an optimal weight combination using a second objective function, wherein the first objective function has a loss in the h-th feature

Comprises the following steps:

wherein, P^hA new embedded feature is represented that is embedded in the image,

representing a new classifier;

the second objective function is:

wherein Ω is [ Ω ]¹，Ω²，...，Ω^H]^TRepresents the optimal weight combination, Ω^hWeight representing h-th feature, | | · |. non-woven phosphor₂Is represented by₂Regularization,. l₂Expressing the squaring and root re-opening of all elements in the vector, wherein eta is a parameter;

the optimal weight of the h-th feature calculated by adopting the second objective function is as follows:

wherein the content of the first and second substances,

is the optimal weight of the h-th feature.

Optionally, the calculating the optimal weight combination by using the second objective function specifically includes: and introducing Lagrange quantity on the basis of the second objective function, and obtaining the optimal weight combination by adopting a Newton method.

Optionally, the classifying of the small sample image features of the multi-head feature cooperation is expanded to a semi-supervised setting, and the classifier is enhanced by using the unlabeled data, specifically:

training a basic classifier by using each feature of the support set data to obtain a classifier:

wherein the content of the first and second substances,

to represent

The hh feature of (1), wherein

And

respectively representing support set data, unlabeled data and query set data,

to represent

Is characterized in that it is a mixture of two or more of the above-mentioned components,

is a classifier obtained by training with support set data, Y_sIs a label matrix supporting the set data;

obtaining support set cooperation characteristics and support set cooperation classifiers by using each characteristic of support set data, and predicting label-free data by using a second formula

Wherein the second formula is:

wherein Z is_uRepresenting the collaboration feature of the non-tagged data,

to represent

And Z_uCollaboration feature of unlabeled exemplars, z_unIs shown at Z_uThe nth feature of (1), Y_pseudoA soft pseudo label representing an unlabeled data prediction;

and selecting a most reliable sample through a soft pseudo label predicted by label-free data, expanding the sample to a support set, and repeatedly training to obtain the optimal classifier with stable performance.

Predicting the category of the query label by using the optimal classifier, wherein the category of the query label is as follows:

wherein Z is_qRepresenting collaboration features of the query set data.

The technical scheme provided by the embodiment of the invention has the following beneficial effects:

the small sample image classification method based on multi-head feature cooperation provided by the embodiment of the invention simultaneously uses the embedded features extracted by multiple feature extractors, and introduces a subspace learning method to convert the original multi-head features into a uniform low-dimensional representation space, which is simultaneously beneficial to reducing redundant information and effectively solves the problem of inconsistent measuring scale caused by different embedded features in different feature spaces. In addition, the combined weight of each multi-head feature is automatically updated by designing a weight calculation part, and the processed multi-head embedded features are cascaded to obtain the cooperative representation of the sample, so that the problem of reasonable use of the multi-head features is effectively solved.

Drawings

In order to more clearly illustrate the technical solutions in the embodiments of the present invention, the drawings required to be used in the description of the embodiments are briefly introduced below, and it is obvious that the drawings in the description below are only some embodiments of the present invention, and it is obvious for those skilled in the art that other drawings can be obtained according to the drawings without creative efforts.

Fig. 1 is a schematic flowchart of a small sample image classification method based on multi-head feature cooperation according to an embodiment of the present invention.

Detailed Description

In order to make the objects, technical solutions and advantages of the present invention more apparent, embodiments of the present invention will be described in detail with reference to the accompanying drawings.

The method for classifying small sample images based on multi-head feature cooperation according to the embodiment of the present invention will be described in detail below with reference to fig. 1.

Referring to fig. 1, a small sample image classification method based on multi-head feature collaboration according to an embodiment of the present invention includes:

step 110: and extracting image features by adopting a convolutional neural network.

And extracting image features by adopting a convolutional neural network model Resnet-12 model. Specifically, firstly, the image scale size is changed into 84x84 size, and then the Resnet-12 model is called to obtain the characteristics of the image to be processed. The process of extracting image features by using the convolutional neural network is not the protection content of the present invention, the process of extracting image features by using the convolutional neural network belongs to the prior art, and the process of extracting image features by using the convolutional neural network is a common image feature extraction method.

Step 120: training a classifier by directly optimizing a first objective function, predicting test samples using the classifier

The category (2).

Definition of

representing a classifier to be learned;

using a first objective function

Training a classifier, wherein | L | · L calculation_FRepresenting regularization on (-) and mu represents the weight of a constraint term of the classifier W;

the classifier W obtained by directly optimizing the first objective function is:

W＝YX^T(XX^T+μI)^-1

wherein I represents an identity matrix;

predicting test samples

To obtain

The categories of (A) are:

where max represents the operator that obtains the index of the maximum value in the vector.

Step 130: and (3) introducing a subspace learning method, reconstructing the original multi-head features to a uniform low-dimensional space, and obtaining new embedded features through the learning subspace.

Defining a total of H sample embedding features, x^hDenotes H, where H is 1, 2, …, H.

Introduced subspace learning approach (denoted as

) Reconstructing the original features into a uniform low-dimensional space, regarding the H features of the same sample as the sample, and expressing the features of the extended data set as the samples

Performing subspace learning operations

Obtaining new embedded features

Wherein the content of the first and second substances,

representing the h-th feature after subspace transformation, dim2 represents the dimension of the feature after subspace transformation.

Step 140: and solving the optimal weight combination in the new embedded features.

Of different characteristicsThe importance is different, and the optimal weight combination omega is found to be [ omega ]¹，Ω²，...，Ω^H]^TLet these features have different influence on the final decision, where Ω denotes a weight vector, and Ω^h(H ═ 1, 2, …, H) represents the H-th element in Ω.

Using the converted features P^hReplacement of x^hObtaining

A new classifier of

According to the formula W ═ YX in step 120^T(XX^T+μI)^-1Calculating to obtain a new classifier

Using a new embedded feature P^hAnd a new classifier

Recalculating the first objective function

Loss in h-th characteristic

The calculation result is as follows:

calculating an optimal weight combination by adopting a second objective function, wherein the second objective function is as follows:

wherein | · | charging₂Is represented by₂Regularization,. l₂Expressing the squaring and root re-opening of all elements in the vector, wherein eta is a parameter;

introducing a lagrangian quantity, the second objective function is rewritten as:

where ζ is a constant and Λ ═ Λ¹，Λ²，...，Λ^H]^TIs a vector.

The above equation (1) is rewritten into a matrix form as follows:

wherein the content of the first and second substances,

suppose that

Is an optimal solution according to the Karush-Kuhn-Tucker (KKT) condition in

Obtaining:

the above equation (2) is rewritten as follows:

solving for

The procedure of (2) is as follows:

solving for

The procedure of (2) is as follows:

order to

Wherein

Is that

The above formula (3) is rewritten as follows:

optimization of the h-th feature

Expressed as:

order to

The above formula (4) is rewritten as:

in conjunction with the above equation (2), the above equation (5) is rewritten as:

therefore, the temperature of the molten metal is controlled,

the rewrite is:

order to

Combining the second objective function, the above equation (6), and the above equation (7), obtain:

combining Newton's method to obtain:

wherein f' (. cndot.) represents a derivative function of f (. cndot.), t is an iteration number, and the optimal solution hat Lambda can be obtained through t iteration_avg。

The optimal weight of the h-th feature is obtained:

step 150: calculating a final cooperative characteristic through a first formula, calculating a final cooperative classifier and predicting the category of the cooperative characteristic according to the final cooperative classifier, wherein the first formula is as follows:

wherein the content of the first and second substances,

the final collaboration feature is represented as such,

represents P^hAnd Z^hThe nth feature of (1);

according to the formula W ═ YX in step 120^T(XX^T+μI)^-1And replacing x with Z to obtain the final collaborative classifier

Expression (c):

W_z＝YZ^T(ZZ^T+μI)^-1

obtaining the characteristics of the test sample by the first formula

Of

Predicting collaboration features

The categories of (A) are:

step 160: expanding the small sample image feature classification of multi-head feature cooperation to semi-supervised setting, utilizing unlabelled data to strengthen a classifier, and utilizing an optimal classifier to predict the category of a query tag, wherein the category of the query tag is as follows:

wherein Z is_qRepresenting collaboration features of the query set data.

Definition of

Is characterized by

Wherein, the first and the second end of the pipe are connected with each other,

and

respectively represent support set data, unlabeled data, and query set data, and thus

Is characterized by being defined as

According to different use data, the design of the current classifier is divided into induction setting, semi-supervision setting and conversion setting, wherein the semi-supervision setting adopts a support set

And tagless collections

The classifier is trained and then the query label is predicted.

The method comprises the following steps of expanding the small sample image feature classification of multi-head feature cooperation to semi-supervised setting, and utilizing unlabelled data to strengthen a classifier, wherein the method specifically comprises the following steps:

wherein the content of the first and second substances,

the combined weight for each feature is calculated according to the formula in step 140:

obtaining the collaborative features of the support set and the classifier according to the formula in step 150:

wherein the content of the first and second substances,

to represent

And Z_sSupporting the cooperative property of set samples, z_snIs shown at Z_sThe nth feature of (1).

Support set cooperation feature zsn and support set cooperation classifier W are obtained by using each feature of support set data_zUsing a second formula

Predicting unlabeled data

Wherein Z is_uRepresenting the collaboration feature of the non-tagged data,

to represent

And Z_uCollaboration feature of unlabeled exemplars, z_unIs shown at Z_uThe nth feature of (1), Y_pseudoSoft pseudo labels representing unlabeled data predictions;

soft pseudo label Y predicted by unlabeled data_pseudoSelecting a most reliable sample, and defining a corresponding pseudo label and characteristic as Y_SelectAnd p_SelectExtension to support set acquisition

And repeating the training to obtain the optimal classifier with stable performance.

By the formula

Obtaining a collaborative embedding feature of the query data, wherein,

to represent

And Z_qCollaborative properties of query set samples, z_qnIs shown at Z_qThe nth feature of (1).

Claims

1. A small sample image classification method based on multi-head feature collaboration is characterized by comprising the following steps:

extracting image features by adopting a convolutional neural network;

training a classifier by directly optimizing a first objective function, and predicting the class of the test sample by using the classifier, wherein the first objective function is as follows:

wherein the content of the first and second substances,

dim1 and N denote the size and number of samples, respectively, C denotes the number of classes, x_n，y_n(N ═ 1, 2.., N) denotes N_thThe embedded feature vector and the tag vector of the sample,

represents a classifier to be learned, | · |. non-woven phosphor_FRepresents regularization of (-) and μ represents the weight of the classifier W constraint term:

the classifier W is as follows:

W＝YX^T(XX^T+μI)^-1

wherein I represents an identity matrix;

the test sample characteristics

The categories of (1) are:

introducing a subspace learning method, reconstructing the original multi-head features to a uniform low-dimensional space, and obtaining new embedded features through the learning subspace; wherein the multi-head feature represents an image embedding feature extracted by simultaneously using a plurality of feature extractors;

solving the optimal weight combination in the new embedded features;

wherein the content of the first and second substances,

the final collaboration feature is represented as such,

represents P^hAnd Z^hThe nth feature of (1); omega-omega¹，Ω²，...，Ω^H]^TWhich represents the optimal combination of weights, and,

weight of H-th feature, H number of kinds of features, P_nRepresenting the embedded features of the nth image,

to

Is shown and

to

The optimal combining weights are in a one-to-one correspondence,

an nth feature representing an H-th feature;

the final co-classifier is:

W_z＝YZ^T(ZZ^T+μI)^-1

wherein the content of the first and second substances,

is the final co-classifier;

the categories of the predicted collaboration features are:

wherein the content of the first and second substances,

for testing sample characteristics

A collaboration feature of (1);

wherein Z is_qRepresenting collaboration features of the query set data.

2. The image classification method according to claim 1, wherein the solving for the optimal weight combination in the new embedded features specifically comprises: recalculating the loss of the first objective function on the h-th feature using the new embedded feature and the new classifier

Comprises the following steps:

wherein, P^hA new embedded feature is represented that is embedded in,

representing a new classifier;

the second objective function is:

wherein Ω is [ Ω ]¹，Ω²，...，Ω^H]^TRepresents the optimal weight combination, Ω^hWeight representing h-th feature, | | · |. non-woven phosphor₂Is represented by₂Regularization,. l₂Representing all elements in a pair vectorSolving the square sum and then opening the root number, wherein eta is a parameter;

wherein the content of the first and second substances,

is the optimal weight of the H-th feature, H represents the number of kinds of the features, F^hRepresenting the loss of function for the h-th feature, Λ ═ Λ¹，Λ²，...，Λ^H]^TA vector is represented that is a function of,

the optimal solution for a is represented as,

to represent

Average value of (a).

3. The image classification method according to claim 2, wherein the calculating of the optimal weight combination by using the second objective function is specifically: and introducing Lagrange quantity on the basis of the second objective function, and obtaining the optimal weight combination by adopting a Newton method.

4. The image classification method according to claim 1, wherein the small sample image feature classification with multi-head feature collaboration is extended to a semi-supervised setting, and a classifier is enhanced by using unlabeled data, specifically:

wherein the content of the first and second substances,

to represent

The h feature of (1), wherein

And

respectively representing support set data, unlabeled data and query set data,

to represent

is a classifier obtained by training with support set data, Y_sIs a matrix of labels that supports the set data,

representing test set data;

obtaining support set cooperation characteristics and support set cooperation classifier by using each characteristic of support set data, and predicting label-free data by using a second formula

Wherein the second formula is:

wherein Z is_uRepresenting the collaboration feature of the non-tagged data,

to represent

selecting a most credible sample through a soft pseudo label predicted by label-free data, expanding the sample to a support set, and repeatedly training to obtain an optimal classifier with stable performance;

predicting the category of a query label by using an optimal classifier, wherein the category of the query label is as follows:

wherein Z is_qRepresenting collaboration features of the query set data.