CN111079790B

CN111079790B - Image classification method for constructing class center

Info

Publication number: CN111079790B
Application number: CN201911129753.1A
Authority: CN
Inventors: 王好谦; 刘志宏; 张永兵; 杨芳
Original assignee: Shenzhen International Graduate School of Tsinghua University
Current assignee: Shenzhen International Graduate School of Tsinghua University
Priority date: 2019-11-18
Filing date: 2019-11-18
Publication date: 2023-06-30
Anticipated expiration: 2039-11-18
Also published as: CN111079790A

Abstract

An image classification method for constructing a class center, comprising: extracting image feature vectors, and constructing category center vectors in a feature space; taking Euclidean distance between the image feature vector and the category center vector as a classification basis to classify the image features; calculating the dispersion degree of different categories according to the Euclidean distance between every two category center vectors; and calculating a loss function of the network according to the classification result and the dispersion degree of different categories, and learning the network parameters and the center vector by using the loss function. The image classification method has the characteristic of controlling the distances between the classes, and the method controls the distances between the classes by directly constructing the class center, so that the distribution of the image features is more beneficial to classification, and a better classification effect is obtained. Compared with the prior art, the method can lead the characteristics extracted by the network to have better intra-class and inter-class distribution characteristics.

Description

Image classification method for constructing class center

Technical Field

The invention relates to the field of computer vision and image processing, in particular to an image classification method.

Background

Image classification is a traditional computer vision problem, and refers to inputting a picture into a computer, and the computer identifies the category of objects in the picture, for example, whether a cat or a dog is identified in the picture. The problem of image classification has many applications in practical scenarios, such as face recognition used in the entrance/exit customs, and it is required to recognize whether the face in the camera and the face picture in the database belong to the same person.

Before deep learning is applied to image classification, conventional image recognition adopts a pattern recognition method, and mainly comprises three parts of image feature acquisition, classifier training and image prediction. The acquisition of the image features represents that the image features which can be used for a classifier are obtained from the image, and the common image features are in the form of high-dimensional vectors; the input of the classifier is image characteristics, the output is the classifying effect, the classifier contains unknown trainable parameters, the parameters of the classifier can be continuously optimized by utilizing a training set, the training set comprises labels of each image, namely the types of objects in the pictures, and the training process can enable the classifier to learn classifying priori information in the training set; after the classifier is trained, a picture of an unknown label can be input, and the class of the picture is output through calculation of the classifier.

Along with development of hardware resources and research of algorithm theory, deep learning is applied to related problems of computer images in a large number, image features are extracted by adopting a multi-layer convolutional neural network method, and the multi-layer convolutional network contains a large number of trainable parameters which are trained by means of training sets. After extracting the features, the image needs to be classified by using a classifier, the image features are classified by using a full-connection layer and a softmax function in deep learning, and the probability that the features belong to each class can be calculated. Training of the network requires a loss function, the cross entropy loss function being most commonly used in classification problems.

Features extracted by a common deep learning image classification method are separable in a high-dimensional space, but the situation that the intra-class distance is larger than the inter-class distance possibly occurs, and when two unknown classified pictures are given to judge whether the two unknown classified pictures belong to the same class, an effective judging threshold cannot be selected. The existing method adopts a full connection layer for classification, and essentially depends on the mode and angle characteristics of the characteristics in a high-dimensional space. Similarly, euclidean distances may be used as a basis for classification in a high dimensional space, and a loss function of the network is calculated based on the Euclidean distances.

The foregoing background is only for the purpose of facilitating an understanding of the inventive concepts and technical aspects of the invention and is not necessarily prior art to the present application, but is not intended to be used to evaluate the novelty and creativity of the present application in the event that no clear evidence indicates that such is already disclosed at the filing date of the present application.

Disclosure of Invention

The invention aims to overcome the defects of the prior art and provides a deep learning image classification method based on a construction type center.

In order to achieve the above purpose, the present invention adopts the following technical scheme:

a deep learning image classification method based on a construction category center comprises the following steps:

1) Inputting a training image, extracting feature vectors of the image by using a multi-layer convolutional neural network, wherein the feature vectors of the image are high-dimensional vectors distributed in a high-dimensional feature space, and meanwhile, constructing center vectors of different categories in the feature space, wherein the center vectors are consistent with the feature vector dimensions of the image;

2) Calculating Euclidean distances between the feature vector of the image and the center vectors of different categories, and taking the category of the center vector with the smallest distance as the category of the feature vector of the image;

3) Calculating the dispersion degree of the feature vector distribution of the images of different categories according to the Euclidean distance between the center vectors of each category;

4) Calculating probability scores of the images belonging to each class according to Euclidean distances between the feature vectors of the images and the center vectors of the classes, and introducing allowance parameters in the probability score calculation to control the intra-class distances;

5) And calculating a network loss function, and updating the values of the network weights and the center vectors of the categories by using a method of inverse gradient propagation.

Further:

in the step 1), the constructed class center vector is a trainable parameter, the class center vector is initialized by using a random initialization method, the value of the class center vector is updated by using a gradient back propagation method in the step 4), the number of classified classes is assumed to be m, and all the center vectors are expressed as c _i ,i＝1,2,…,m。

In the step 2), the feature vector of the image is represented as f, and the distance between the feature vector of the image and the class center vector of the i-th class is L _i I=1, 2, …, m, find the minimum distance L _k I.e.

The category of the image feature is judged as the kth category. In the step 3), it is assumed that the center vectors of different classes are c _i I=1, 2, …, m, and the Euclidean distance between the center vectors of different categories is calculated and expressed as D _ij Where i, j=1, 2, …, m and i+.j, for D _ij Averaging all or part of the elements of (a) gives the degree of dispersion of the different classes.

In the step 3), the method of D _ij Is to average the euclidean distance between each class of center vectors and its nearest center vector.

In the step 4), the distance L between the characteristic vector f of the image and the center vector of different categories is utilized _i The probability score is calculated by i=1, 2, …, m, and the specific method is as follows: firstly normalizing all the distances to obtain the relative distance

R _i The larger the distance between the feature vector of the image and the center vector of the i-th class is; introducing a margin in calculating a probability score in order to constrain the intra-class distance; assuming that the true label of the feature vector of the image is the kth class, calculating the probability score of the feature vector of the image belonging to the kth class as follows:

the probability score of a feature vector of an image belonging to other categories is expressed as:

wherein the probability score P _i The smaller the probability that the feature vector representing the image belongs to the i-th class is, the larger the m represents the margin.

m is greater than 1.

The steps are as follows5) In which the calculation of the Loss function comprises two parts, namely the classification Loss of image features ₁ And degree of dispersion Loss of various centers ₂ The two loss functions use the super-parameter lambda to adjust the weights of different constraints, i.e. the final training loss is:

Loss＝Loss ₁ +λLoss ₂ (3)

loss of classification Loss ₁ The computation is performed using cross entropy.

After the Loss function Loss is obtained, the values of the network weights and the center points are updated by using a gradient back propagation method.

A computer readable storage medium storing a computer program which when executed by a processor implements the image classification method.

The invention has the following beneficial effects:

the invention provides an image classification method for constructing a class center, wherein image features are extracted, and the class center is constructed in a feature space; taking Euclidean distance between the image features and the class center as the classifying basis of the image features to classify the image features; calculating the dispersion degree of different categories according to the Euclidean distance between every two center vectors; and calculating a loss function of the network according to the classification result and the dispersion degree of different categories, and learning the network parameters and the center vector by using the loss function. The image classification method has the characteristic of controlling the distances between the classes, and the method controls the distances between the classes by directly constructing the class center, so that the distribution of the image features is more beneficial to classification, and a better classification effect is obtained. Compared with the prior art, the method can lead the characteristics extracted by the network to have better intra-class and inter-class distribution characteristics. Compared with a classification network of full-connection layer plus softmax operation, the center point thought and Euclidean distance calculation provided by the invention can directly control the distribution condition of the feature vectors of different categories in a high-dimensional space, so that not only can the feature vector of each category be ensured to be close to the center of the category as much as possible, but also the feature vectors of different categories can be kept away as much as possible. The method of the invention has very visual geometric interpretation, and is different from the full-connection layer and softmax method in that the geometric interpretation of the full-connection layer and softmax method is vector angle characteristic of high-dimensional space, the invention is Euclidean distance characteristic of high-dimensional space, and a trainable class center point is introduced. The loss function obtains a very good classifying effect on the task of image classification, and better intra-class and inter-class feature distribution characteristics can be obtained by adjusting the super parameters in the method.

Drawings

Fig. 1 is a basic flow of an image classification method according to an embodiment of the present invention.

FIG. 2 is a schematic diagram of a first degree of dispersion according to an embodiment of the present invention.

FIG. 3 is a second degree of dispersion of an embodiment of the present invention.

FIG. 4 is a diagram illustrating two-class no-margin parameters according to an embodiment of the present invention.

FIG. 5 is a diagram illustrating the classification of the residual parameters according to the embodiment of the present invention.

Detailed Description

The following describes embodiments of the present invention in detail. It should be emphasized that the following description is merely exemplary in nature and is in no way intended to limit the scope of the invention or its applications.

Referring to fig. 1 to 5, in one embodiment, a deep learning image classification method based on a build category center includes the steps of:

step 1), constructing a deep learning model, inputting a training image, extracting feature vectors of the image by using a multi-layer convolutional neural network, wherein the feature vectors of the image are high-dimensional vectors distributed in a high-dimensional feature space, and meanwhile, constructing center vectors of different types in the feature space, wherein the center vectors are consistent with the feature vector dimensions of the image;

step 2), calculating Euclidean distances between the feature vector of the image and the center vectors of different categories, and taking the category of the center vector with the smallest distance as the category of the feature vector of the image;

step 3), calculating the dispersion degree of the feature vector distribution of the images of different categories according to the Euclidean distance between the center vectors of each category;

step 4), calculating probability scores of the images belonging to each class according to Euclidean distances between the feature vectors of the images and the center vectors of the classes, and introducing margin parameters in the probability score calculation to control the intra-class distances;

step 5), calculating a network loss function, and learning network parameters and center vectors by using the loss function.

The image features extracted in the step 1) are high-dimensional vectors, the high-dimensional vectors are distributed in a high-dimensional feature space, a central feature vector is built for each category in the feature space, and the dimensions of the central feature vector are consistent with those of the image feature vector. Specifically, the constructed class center vector is a trainable parameter, the class center vector is initialized by using a random initialization method, the value of the class center vector is updated by using a gradient back propagation method in the step 4), the number of class classes is assumed to be m, and all the center vectors are expressed as c _i ,i＝1,2,…,m。

In the step 2), the Euclidean distance between the feature vector of the image and the center vector of each category is calculated, and the image is classified into the category of the center vector with the smallest Euclidean distance with the feature of the image. Representing the feature vector of the image as f, and the distance between the feature vector of the image and the class center vector of the i-th class is

The degree of feature vector dispersion of the same class. There are two ways of expressing the degree of dispersion, the first being to average the euclidean distance between the center vectors by two and the second being to average the euclidean distance between each class of center vectors and its nearest center vector.

Specifically, assume that the center vector of the different class is c _i I=1, 2, …, m, and the Euclidean distance between the center vectors of different categories is calculated and expressed as D _ij Where i, j=1, 2, …, m and i+.j, for D _ij All or part of the element(s) of (a) is flattenedAnd obtaining the dispersion degree of different categories. In one embodiment, for D _ij May be averaging the euclidean distance between each class of center vectors and its nearest center vector.

In the step 4), the probability score of the image belonging to each class is calculated according to the Euclidean distance between the image characteristics and the center of each class. In the calculation process of the probability score, in order to make the intra-class distance as small as possible, a margin parameter is introduced, so that the Euclidean distance between the image feature and the center of the correct class is ensured to be far smaller than the Euclidean distance between the image feature and the center of the wrong class.

In a preferred embodiment, the distance L between the feature vector f of the image and the center vector of the different classes is used _i The probability score is calculated by i=1, 2, …, m, and the specific method is as follows: firstly normalizing all the distances to obtain the relative distance

R _i The larger the distance between the feature vector of the image and the center vector of the i-th class is; in order to restrict the intra-class distance, the method for classifying the image features by using the softmax function is improved, and a margin is introduced when a probability score is calculated; assuming that the true label of the feature vector of the image is the kth class, calculating the probability score of the feature vector of the image belonging to the kth class by using the method of the application is as follows:

wherein the probability score P _i The smaller the probability that the feature vector representing the image belongs to the i-th class is, the larger the m represents the margin. Preferably, m is greater than 1 in order to increase the intra-class distance constraint.

In the step 5), the loss function of the network is calculated on the basis of the image classification probability scores and the dispersion degrees of different categories, the calculated loss function comprises two parts, namely classification constraint and dispersion degree constraint, the classification constraint can be calculated by using cross entropy loss, the dispersion degree constraint can be expressed by different category dispersion degrees, and different weights are distributed between the two constraints through a weight coefficient. The network weights and center vectors are then updated using a back propagation method.

Specifically, the calculation of the Loss function includes two parts, namely, the classification Loss of image features ₁ And degree of dispersion Loss of various centers ₂ The two loss functions use the super-parameter lambda to adjust the weights of different constraints, i.e. the final training loss is:

Loss＝Loss ₁ +λLoss ₂ (3)。

in a preferred embodiment, after the Loss function Loss is obtained, the values of the network weights and center points are updated using a gradient back-propagation method.

In some embodiments, the present invention provides an image classification method for constructing a class center, which mainly includes: extracting image features and constructing a category center in a feature space; the Euclidean distance between the image features and the class center is used as the classifying basis of the image features to classify the image features; calculating the dispersion degree of different categories according to the Euclidean distance between every two center vectors; and calculating a loss function of the network according to the classification result and the dispersion degree of different categories, and learning the network parameters and the center vector by using the loss function.

In some embodiments, the present invention generally comprises the steps of: the first step, a feature extraction network is constructed, abstract features of an input image are extracted by using a multi-layer convolution network, high-dimensional feature vectors which can be used for classification are obtained, and center vectors of different classes are constructed in a feature space; and a second step of: judging feature categories, calculating Euclidean distance between the image feature vectors and each type of center vector, wherein the center vector category with the smallest distance is used as the category of the image features; and a third step of: calculating the dispersion degree of the feature distribution of different categories according to the Euclidean distance between the category center vectors; fourth step: calculating probability scores of the image features belonging to each class, and introducing margin parameters to control intra-class distances; fifth step: and calculating a network loss function, and updating the values of the network weight and the center vector by using a method of inverse gradient propagation. In these embodiments, the intra-class and inter-class distances of image features are controlled by the margin and the degree of dispersion using Euclidean distance calculation loss functions.

The first step specifically comprises the following steps: features of the input image are extracted using a multi-layer convolution, the image features being high-dimensional vectors distributed in feature space. And constructing center vectors of different categories in the feature space, wherein the dimension of the center vector is consistent with the feature vector of the image.

The second step specifically comprises: the Euclidean distance between the image feature and the class center vector is calculated, and the feature is classified into the class of the center vector with the smallest Euclidean distance.

The third step specifically comprises: and calculating Euclidean distance between class center vectors to represent the dispersion degree of the feature vectors of different classes. There are two ways of expressing the degree of dispersion, the first being to average the euclidean distance between the center vectors by two and the second being to average the euclidean distance between each class of center vectors and its nearest center vector.

The fourth step specifically comprises: and calculating the probability score of the image belonging to each class according to the Euclidean distance between the image characteristics and the center of each class. In the calculation process of the probability score, in order to make the intra-class distance as small as possible, a margin parameter is introduced, so that the Euclidean distance between the image feature and the center of the correct class is ensured to be far smaller than the Euclidean distance between the image feature and the center of the wrong class.

The fifth step specifically includes: and calculating a loss function of the network on the basis of the dispersion degree of different categories and the image classification probability score, wherein the loss function comprises two parts of dispersion degree constraint and classification constraint, the classification constraint is calculated by using cross entropy loss, the dispersion degree constraint is expressed by the dispersion degree of different categories, and different weights are distributed between the two constraints through a weight coefficient. The network weights and center vectors are updated using a back propagation method.

As will be described in further detail below.

The center vector construction is that as shown in fig. 1, the flow of the image classification task comprises two parts of feature extraction and classification by using features, the feature extraction of the image depends on a convolutional neural network, the input is a picture to be classified, and a high-dimensional vector is obtained after a plurality of convolutional layers, namely, the image features in fig. 1. Assuming that the dimension of the image feature is 1×n and the number of categories is m, in the feature space

Building m eigenvectors c _i I=1, 2, …, m, expressed as center vectors of different categories, and the values of the center vectors are randomly initialized with gaussian distribution or uniform distribution.

Degree of dispersion: to ensure the classification effect, it is necessary to ensure that the features of different classes are as far apart as possible, and the inter-class distance of the image features is controlled by controlling the distance between the centers of the different classes in the present application. Let all class centers be c _i There are two specific implementations of i=1, 2, …, m to calculate the degree of dispersion for different categories. The first method is as shown in FIG. 2, wherein small dots in the figure represent the distribution of category centers, euclidean distances between each center and other center points are calculated, euclidean distances between a center point 1 and other center points are drawn in FIG. 2, and corresponding Euclidean distances are calculated for other points similarly, so that each point has m-1 Euclidean distances, m points are shared, m (m-1) distances are shared, and the distances are expressed as D _ij Where i, j=1, 2, …, m and i+.j, the average of all euclidean distances is taken as the degree of dispersion of all center points, i.e.

The visual understanding is that the Euclidean distance between every two central points is calculated, dis _avg The larger the center points are, the farther the center points are from each other as a whole, the larger the dispersion degree is, and the inter-class distances of the image features are larger. The method has the disadvantages that the training process is to make Dis _avg As large as possible, but possibly with all centersCenters of points that are farther apart from each other are farther apart, while centers that are closer together are relatively unchanged and may even be closer together, which is detrimental to the final sorting effect. Therefore, the present application proposes a second method for calculating the degree of dispersion, as shown in fig. 3, each small dot still represents the center of each category, the number on the dot is the number of the center point, for the center c _i Still calculate c _i Distance from all other center points, but only to its nearest center point, e.g. center point c in FIG. 3 ₃ Point 1 of the remaining points and nearest point D is reserved ₃₁ Similarly, find all +.>

i=1, 2, …, m, where k _i The value of (2) satisfies->

The lines in fig. 3 represent all +.>

For->

Averaging gives the degree of dispersion, expressed as +.>

Compared with the first, the second method only considers the distance between the relatively nearer points, and can lead the two points with smaller distances to be continuously far away in the training process, so as to avoid the situation that the distance between the two points is too small.

Classification probability: the abstract feature f of the image is obtained through a feature extraction network, and the center vector of the ith class is c _i The Euclidean distance of the image features and each center vector is calculated and denoted as L _i I=1, 2, …, m, i.e. the euclidean distance of the image feature vector and the center of the i-th class. Classifying images into categories of the closest central point to their features, i.e. finding L _i Minimum value L in i=1, 2, …, m _k The image is classified into the kth class. To limit the range of values for the distance, it is normalized to obtain the relative distance, i.e

i＝1,2,…,m，R _i The smaller the value the smaller the distance between the feature and the i-th center, the more likely the image feature is to belong to the i-th class.

Consider the case of only two classes, the relative distance being R ₁ And R is ₂ Satisfy R ₁ +R ₂ When R is =1 ₁ <R ₂ When the image is classified into class 1, when R ₁ >R ₂ The image is classified into class 2, so the classification plane is R ₁ ＝R ₂ As shown in fig. 4, the cross-hatched area indicates R classified as class 1 =0.5 ₁ Is a vertical hatched area representing R classified as class 2 ₁ Is a range of values. At R ₁ Regions around =0.5, possibly classified as class 1 or class 2, may have a small fluctuation that may cause classification to be less stable, thus taking into account the addition margin.

Likewise, the two classification problems are obtained, and two relative distances R are obtained ₁ And R is ₂ Then, the remaining parameter m is used to calculate the classification condition, when mR ₁ <R ₂ When the image is classified into class 1, when R ₁ >mR ₂ When the image is classified into the 2 nd class, the classification surface is calculated as

When the value of m is greater than 1, there is a certain margin between the classifying planes, as in the classifying case of m=3 shown in fig. 5, when the horizontal line shadow region and the vertical line shadow region are respectively R classified into 1 st and 2 nd classes ₁ The range of values, the hatched area in the middle, is the margin. After adding the margin, compared to the method without margin, R ₁ The image cannot be classified in the region around 0.5 because the relative distance at this time cannot be classified reliably, and correct classification can be performed only when the relative distance is sufficiently reliable.

In a general classification task, a cross entropy calculation loss function is used, a probability score of each class of the image needs to be expressed, and the probability score of each class of the image is as follows:

satisfy the following requirements

But P is _i Probability of not being direct, P _i The smaller the probability that the image belongs to the i-th class is, the greater. When m is>1, intra-class constraint can be increased, so that image features of the same class are gathered as much as possible, a certain margin is reserved on a classification surface, and the learned image features are more reliable in classification.

Loss function: the loss function comprises two parts of classification loss and dispersion degree. After obtaining the probability score of each class of image features, calculating the classification loss by using cross entropy, namely assuming that the image label is the kth class, wherein the probability score of the image belonging to the kth class is P _k The classification loss is expressed as

Loss ₁ ＝log P _k (5)

The smaller the probability score for the correct classification, the smaller the penalty value, so the network parameters are optimized towards a smaller probability score for the correct classification.

The degree of dispersion of the known class centers can be expressed as Dis _avg Or Dis _min The dispersion degree and the Euclidean distance belong to the same order of magnitude, in order to ensure the rapid convergence of the network in the early stage of training and the small gradient in the later stage of training, ensure the stable convergence of the network, a logarithmic function is added on the basis of the Euclidean distance, namely, the loss function of the dispersion degree is expressed as

Loss ₂ ＝-log Dis (6)

Wherein Dis may be SDis _avg Or Dis _min The smaller the value of the loss function, the smaller the gradient value, when Dis is larger.

The weights of the two loss functions are adjusted by a parameter lambda, the final loss function being expressed as

Loss＝Loss ₁ +λLoss ₂ (7)

And then, optimizing the values of the network weight and the center vector by using a gradient back propagation method, so that the loss value is continuously reduced, correspondingly, the classification accuracy is continuously improved, and the distribution of the image features meets the requirements of smaller intra-class distance and larger inter-class distance.

The foregoing is a further detailed description of the invention in connection with specific/preferred embodiments, and it is not intended that the invention be limited to such description. It will be apparent to those skilled in the art that several alternatives or modifications can be made to the described embodiments without departing from the spirit of the invention, and these alternatives or modifications should be considered to be within the scope of the invention. In the description of the present specification, reference to the terms "one embodiment," "some embodiments," "preferred embodiments," "examples," "specific examples," or "some examples," etc., means that a particular feature, structure, material, or characteristic described in connection with the embodiment or example is included in at least one embodiment or example of the invention. In this specification, schematic representations of the above terms are not necessarily directed to the same embodiment or example. Furthermore, the particular features, structures, materials, or characteristics described may be combined in any suitable manner in any one or more embodiments or examples. Furthermore, the different embodiments or examples described in this specification and the features of the different embodiments or examples may be combined and combined by those skilled in the art without contradiction.

Claims

1. A deep learning image classification method based on a construction category center comprises the following steps:

3) Calculating the dispersion degree of the feature vector distribution of the images of different categories according to the Euclidean distance between the center vectors of each category; assume that the center vectors of the different classes are c _i I=1, 2, …, m, and the Euclidean distance between the center vectors of different categories is calculated and expressed as D _ij Wherein i, j=1, 2, …, m and i+.j, averaging the Euclidean distance between each type of center vector and its nearest center vector, obtaining the dispersion degree of different types; wherein for center c _i Calculate c _i Distance from all other center points, but only the distance from its nearest center point is retained to find all

i=1, 2, …, m, where k _i The value of (2) satisfies->

For->

Averaging gives the degree of dispersion, expressed as +.>

5) Calculating a network Loss function based on the image classification probability scores and the dispersion degrees of different classes, wherein the calculation of the Loss function comprises two parts, namely the classification Loss of the image features ₁ And degree of dispersion Loss of various centers ₂ And learning the network parameters and the center vectors of the categories by using the loss function.

2. The image classification method according to claim 1, wherein in the step 1), the constructed class center vector is a trainable parameter, the class center vector is initialized by a random initialization method, and in the step 4), the value of the class center vector is updated by a gradient back propagation method, and all the center vectors are expressed as c assuming that the number of classification classes is m _i ,i＝1,2,…,m。

3. The image classification method according to claim 1 or 2, wherein in the step 2), the image feature vector is represented as f, and the distance between the image feature vector and the center vector of the i-th class is L _i I=1, 2, …, m, find the minimum distance L _k I.e.

The category of the image feature is judged as the kth category.

4. The image classification method according to any one of claims 1 to 2, wherein in the step 4), a distance L between a feature vector f of the image and a center vector of a different class is used _i The probability score is calculated by i=1, 2, …, m, and the specific method is as follows: firstly normalizing all the distances to obtain the relative distance

where e is a natural constant, and the probability score of the feature vector of the image belonging to other categories is expressed as:

the smaller the probability score, the greater the probability that the feature vector representing the image belongs to the corresponding class, and m represents the margin.

5. The image classification method of claim 4, wherein m has a value greater than 1.

6. The image classification method according to any one of claims 1 to 2, characterized in that in step 5) the two loss functions use the super-parameter λ to adjust the weights of the different constraints, i.e. the final training loss is:

Loss＝Loss ₁ +λLoss ₂ (3)

loss of classification Loss ₁ Calculating by using cross entropy;

updating the values of the network weights and the center vectors of the categories by using a reverse gradient propagation method.

7. A computer-readable storage medium storing a computer program, characterized in that the computer program, when executed by a processor, implements the image classification method of any one of claims 1 to 6.