CN113255695A

CN113255695A - Feature extraction method and system for target re-identification

Info

Publication number: CN113255695A
Application number: CN202110557889.3A
Authority: CN
Inventors: 张超捷; 黄宇恒; 徐天适; 张华俊; 魏东
Original assignee: GRG Banking Equipment Co Ltd
Current assignee: GRG Banking Equipment Co Ltd
Priority date: 2021-05-21
Filing date: 2021-05-21
Publication date: 2021-08-13

Abstract

The invention provides a feature extraction method and a feature extraction system for target re-identification, wherein the feature extraction method comprises the following steps: acquiring a picture to be extracted; and extracting the characteristics of the picture to be extracted based on a neural network model which is trained by adopting a slicing method in advance. The training steps of the neural network model are as follows: acquiring an input image; inputting an input image into a preset neural network model to obtain a first feature vector; slicing the first feature vector to obtain a plurality of second feature vectors; slicing the category center parameters to obtain a plurality of category center parameter slices; determining a total loss function based on the category center parameter slice, the second feature vector, the first feature vector and the category center parameter; the class center parameters and the parameters of the neural network model are updated based on the total loss function. The feature extraction method for target re-identification can simultaneously meet different requirements of different services on retrieval precision, retrieval time and storage space.

Description

Feature extraction method and system for target re-identification

Technical Field

The invention relates to the technical field of image recognition, in particular to a feature extraction method and system for target re-recognition.

Background

At present, under different service scenes, the requirements on the retrieval precision, the retrieval time and the storage space of target re-identification are different, for example, some services need very high retrieval precision, and the requirements on the retrieval time and the storage space are relatively low; while other services require faster retrieval times and less storage space. In order to deal with these different services, it is a conventional practice to train a plurality of deep learning models respectively, each model extracts feature vectors of different lengths, and then selects different models according to different service requirements.

If multiple different business requirements exist in the same target re-recognition system at the same time, and the requirements of the businesses on retrieval precision, retrieval time and storage space are different, by adopting the traditional method, multiple models need to be trained, so that the complexity and space cost of the whole system are increased, and the development cost is increased.

Disclosure of Invention

One of the objectives of the present invention is to provide a feature extraction method for target re-identification, which can simultaneously meet different requirements of different services on retrieval accuracy, retrieval time and storage space.

The feature extraction method for target re-identification provided by the embodiment of the invention comprises the following steps:

acquiring a picture to be extracted;

and carrying out feature extraction on the picture to be extracted based on a neural network model which is trained by adopting a slicing method in advance.

Preferably, the training step of the neural network model is as follows:

acquiring an input image;

inputting the input image into a preset neural network model to obtain a first feature vector;

slicing the first feature vector to obtain a plurality of second feature vectors;

slicing the category center parameters to obtain a plurality of category center parameter slices;

determining a total loss function based on the category-centric parameter slice, the second feature vector, the first feature vector, and the category-centric parameter;

updating the class center parameters and the parameters of the neural network model based on a total loss function.

Preferably, a total loss function is determined based on the class center parameter slice, the second feature vector, the first feature vector and the class center parameter; the method comprises the following steps:

determining a plurality of first sub-loss functions based on the category-centric parameter slice and the second feature vector;

determining a second sub-loss function based on the first feature vector and the class center parameter;

determining the total loss function based on a plurality of the first and second sub-loss functions; the overall loss function is calculated as follows:

the overall loss function is calculated as follows:

wherein L is the total loss function,

is the first sub-loss function; mu.s_NL_NIs the second sub-loss function; mu.s_iIs the coefficient of a sub-loss function of characteristic length i, where μ_NNot equal to 0; n is the length of the feature vector output by the model; l is_iA sub-loss function with a characteristic length i is represented.

Preferably, the first and second liquid crystal materials are,

where BS represents the number of images of a set of training sets, θ_i，ybRepresenting the angle between the characteristic vector with the length of i of the target of the b picture and the corresponding actual category center vector; thetai, j represents the angle between the characteristic vector with the length of i of the target of the b picture and the central vector of other categories; cn represents the number of classification categories; s, m, k are all hyper-parameters.

Preferably, the first feature vector is sliced to obtain a plurality of second feature vectors; comprises the following steps:

and sequentially taking a preset number of elements of the feature vector as the second feature vector from the foremost element of the first feature vector.

Preferably, the category center parameter slices correspond to the second feature vectors one to one.

Preferably, the class center parameters and the parameters of the neural network model are updated by a back propagation method.

Additional features and advantages of the invention will be set forth in the description which follows, and in part will be obvious from the description, or may be learned by practice of the invention. The objectives and other advantages of the invention will be realized and attained by the structure particularly pointed out in the written description and claims hereof as well as the appended drawings.

The technical solution of the present invention is further described in detail by the accompanying drawings and embodiments.

Drawings

The accompanying drawings, which are included to provide a further understanding of the invention and are incorporated in and constitute a part of this specification, illustrate embodiments of the invention and together with the description serve to explain the principles of the invention and not to limit the invention. In the drawings:

FIG. 1 is a schematic diagram of model training of a feature extraction method for target re-recognition in an embodiment of the present invention;

FIG. 2 is a diagram illustrating the positions of the selected eigenvector elements and their corresponding resulting loss functions;

FIG. 3 is a diagram illustrating the angles of the target feature vector and the class center vector.

Detailed Description

The preferred embodiments of the present invention will be described in conjunction with the accompanying drawings, and it will be understood that they are described herein for the purpose of illustration and explanation and not limitation.

Referring to fig. 1 to 3, the batch _ size in fig. 1 is the number of images of one batch during model training; h is the image height; w is the image width; c is the number of image channels; n is the length of the feature vector output by the model; class _ num is the target class number of the images used for training; the feature vector slice refers to: and taking a part of the elements in front of the feature vector as a new feature vector from the front element, wherein the new feature vector is the feature vector slice.

The existing normal training process is as follows:

training by using an image classification network, wherein each different target is taken as a different class; the picture classification network can be divided into two sub-networks according to functions, wherein the first network is a network for extracting picture features and is set as a sub-network A; the second sub-network is set as B, and B classifies according to the characteristics of A output; after the classification network training is finished, abandoning the sub-network B, and reserving the sub-network A to extract the characteristics of the picture for the task of target re-identification;

the neural network model of the present application is trained as follows:

acquiring an input image;

The "category-centric parameter" in fig. 1 is the parameter (weight) of the sub-network B; the slice is a part of the original feature vector, the slice of the category center parameter corresponds to the slice of the feature vector, and the dimension (shape) of the original feature vector is assumed to be: batch _ size × N, then the dimensionality of the original category center parameter needs to be N × class _ num (batch _ size is the number of pictures for training, and class _ num is the number of categories for classification); at the moment, when the network is calculated forwards, the feature vector is multiplied by the class center parameter, and the output dimension is batch _ size × class _ num, namely, the classification result can be judged; when slicing is performed, if the slice dimension taken by the feature vector is batch _ size _ m (m < ═ N), then the class center parameter also needs to take the slice dimension as m class _ num, so that the output dimension after multiplication is ensured to be the same. In addition, the parameter adjustment uses a "back propagation method", all neural networks are trained based on the method, parameters in the network are updated once per iteration, and the parameters to be adjusted (updated) are: all parameters (weights) participating in the training in the network, including subnetwork a and subnetwork B.

Determining a total loss function based on the category-centric parameter slice, the second feature vector, the first feature vector, and the category-centric parameter; the method comprises the following steps:

determining the total loss function based on a plurality of the first and second sub-loss functions.

The structure of the total loss function used in the present technique is represented by:

wherein L is the total loss function,

Preferably, the first and second liquid crystal materials are,

FIG. 2 is a diagram illustrating the positions of the selected eigenvector elements and their corresponding resulting loss functions; in the figure, Vi denotes an element of the feature vector, where i denotes a position in the feature vector.

By using the target feature extraction technology, the retrieval accuracy of the feature vector with the extraction length of N can be ensured, and meanwhile, the feature elements contributing greatly to the system can be moved forward, so that the retrieval accuracy can also achieve a good effect when the front M feature elements of the feature vector are extracted for calculation.

The technology for extracting the target features integrates a plurality of models in the traditional method, under the condition of ensuring the retrieval precision, the functions which can be realized by a plurality of models (each model extracts feature vectors with different lengths) are integrated in the same model, the high-efficiency integration of the retrieval precision, the retrieval time and the storage space is realized, meanwhile, the integration has business flexibility, the length of extracting the features can be switched freely according to different businesses, and the development cost, the complexity of a system and the space cost are greatly reduced.

It will be apparent to those skilled in the art that various changes and modifications may be made in the present invention without departing from the spirit and scope of the invention. Thus, if such modifications and variations of the present invention fall within the scope of the claims of the present invention and their equivalents, the present invention is intended to include such modifications and variations.

Claims

1. A feature extraction method for target re-identification is characterized by comprising the following steps:

acquiring a picture to be extracted;

and extracting the characteristics of the picture to be extracted based on a neural network model which is trained by adopting a slicing method in advance.

2. The method for extracting features of target re-identification according to claim 1, wherein the training of the neural network model comprises the following steps:

acquiring an input image;

3. The method for feature extraction for object re-identification according to claim 2, wherein the total loss function is determined based on the class center parameter slice, the second feature vector, the first feature vector and the class center parameter; the method comprises the following steps:

wherein L is the total loss function,

4. The feature extraction method of object re-recognition according to claim 3,

where BS represents the number of images of a set of training sets, θ_i,ybThe length of the object of the b-th picture isi, the angle between the feature vector of i and the corresponding actual class center vector; thetai, j represents the angle between the characteristic vector with the length of i of the target of the b picture and the central vector of other categories; cn represents the number of classification categories; s, m, k are all hyper-parameters.

5. The method for extracting features of object re-recognition according to claim 2, wherein the first feature vector is sliced to obtain a plurality of second feature vectors; the method comprises the following steps:

6. The method for extracting features of object re-identification according to claim 2, wherein the class center parameter slices correspond to the second feature vectors one to one.

7. The method for extracting features of object re-identification according to claim 2, wherein the class center parameters and the parameters of the neural network model are updated by using a back propagation method.

8. A feature extraction system for object re-recognition, comprising:

the acquisition module is used for acquiring a picture to be extracted;

and the extraction module is used for extracting the characteristics of the picture to be extracted based on a neural network model which is trained by adopting a slicing method in advance.