CN114332787A

CN114332787A - Passive domain unsupervised domain self-adaptive vehicle re-identification method

Info

Publication number: CN114332787A
Application number: CN202111680087.8A
Authority: CN
Inventors: 黄立勤; 陈重游; 潘林; 杨明静
Original assignee: Fuzhou University
Current assignee: Fuzhou University
Priority date: 2021-12-30
Filing date: 2021-12-30
Publication date: 2022-04-12
Anticipated expiration: 2041-12-30
Also published as: CN114332787B

Abstract

The invention provides a passive domain unsupervised domain self-adaptive vehicle re-recognition method. It no longer uses source domain data, but rather utilizes learned source domain knowledge implicit in the source domain model as a guide, thereby facilitating migration of the style of target domain data to the style of source domain data.

Description

Passive domain unsupervised domain self-adaptive vehicle re-identification method

Technical Field

The invention belongs to the technical field of machine learning and vehicle re-identification, and particularly relates to a passive domain unsupervised domain self-adaptive vehicle re-identification method.

Background

At present, most vehicle re-identification methods based on supervised learning achieve high precision, but the cost is extremely dependent on effective supervision labels, and the method cannot be well expanded to the unsupervised field. For this reason, some Unsupervised-based vehicle re-identification methods have come to mind, and here the main difference between the Unsupervised re-identification task and the currently focused Unsupervised Pre-training (Unsupervised Pre-training) task is emphasized: (1) the unsupervised pre-training task starts from network random initialization, and the unsupervised re-identification task starts from a pre-trained network; (2) the unsupervised pre-trained network can be applied to the downstream task only through the fine-tune, the unsupervised re-recognition task can be regarded as an unsupervised downstream task, and the network which is trained can be directly deployed. The unsupervised vehicle re-identification is divided into two directions of unsupervised domain adaptation and completely unsupervised.

The method is characterized in that the unsupervised domain self-adaptive task and the completely unsupervised task are similar in nature, the difference is that the unsupervised domain self-adaptive method additionally needs a source domain data set with a label, the model is pre-trained by using the source domain data, after the pre-training is finished, the source domain data is not used, and only the target domain data without the label is used for re-training to obtain a final model; while a completely unsupervised approach only requires training with unlabeled target domain data.

Currently, among the unsupervised vehicle re-identification tasks, the unsupervised domain adaptation task is currently the most interesting, while the completely unsupervised task, which does not rely on source domain data, is less studied.

Currently, a vehicle re-identification algorithm based on unsupervised domain adaptation usually needs a source domain model, and then utilizes labeled source domain data and unlabeled target domain data to perform a series of fine tuning, so that the model can have good performance on a target domain. However, as data privacy and security are increasingly valued by scientific research workers, the source domain data is not easy to acquire, and the source domain model is relatively easy to acquire.

In the conventional unsupervised domain adaptive method based on picture generation, it is generally necessary to promote the migration of the style of target domain data to the style of source domain data by using the source domain data as content and style guidance.

Disclosure of Invention

Considering that most vehicle re-identification methods based on supervised learning achieve higher precision in recent years, but depend heavily on effective supervised labels, how to deploy a vehicle re-identification model to a label-free data set and achieve better effect is a great difficulty. In addition, due to the limitations of data security and privacy, the source domain data acquisition is also difficult in practical applications.

In order to make up the blank and the defects of the prior art, the invention provides a passive domain unsupervised domain self-adaptive vehicle re-identification method. A generator is trained by using relationship keeping consistent loss and knowledge distillation loss through a source domain model and target domain data, so that a pseudo target sample with a source domain style is generated, and then the model is finely adjusted by using the pseudo target sample, so that the performance of the model is improved. It no longer uses source domain data, but rather utilizes learned source domain knowledge implicit in the source domain model as a guide, thereby facilitating migration of the style of target domain data to the style of source domain data.

Compared with other unsupervised domain self-adaption methods based on domain migration, the method has the greatest advantage that source domain data does not need to be accessed.

The invention specifically adopts the following technical scheme:

a method for re-identifying a vehicle in a passive domain in an unsupervised domain is characterized by comprising the following steps: in the vehicle re-identification process, a generator is trained by using relationship keeping consistent loss and knowledge distillation loss through a source domain model and target domain data, so that a pseudo target sample with a source domain style is generated, and then the model is finely adjusted by using the pseudo target sample, so that the performance of the model is improved.

Further, source domain data is not used, but rather source domain knowledge learned implicit in the source domain model is utilized as a guide, thereby facilitating migration of the style of the target domain data to the style of the source domain data.

And further, by using the synthetic image, generating a pseudo target sample by the synthetic image through the SPGAN for pre-training, and simultaneously receiving source domain data and target domain data to jointly train again, thereby completing the unsupervised domain self-adaptive task.

Further, two network models are designed through the synthesized image to be used for learning the direction similarity and the background similarity of the vehicle, and then the direction similarity and the camera similarity are subtracted from the vehicle ID similarity, so that the interference of the similar direction and the similar background to the models is reduced.

Further, the source domain model is obtained by a public pre-training model or by using a public data set with cross entropy loss and triplet loss pre-training; and the target domain model is obtained by loading the obtained source domain model parameters and utilizing target domain data to carry out fine adjustment.

Further, assume that only the feature extractor in the source model is adapted to the target domain; given a source model f_S(. o) and an object model f_T(. to train a generator for the passive knowledge migration module; setting the target image to x, given the generated image

Feature mapping of all source domain data output by source model

And probability distribution

To describe adaptation in the generated imageIn addition to the loss of knowledge distillation, a new relationship maintenance loss is introduced, which maintains the target image target model feature map f_T(x) And the generated image source model feature map

Relative channel relationship between them;

the knowledge distillation loss is: in the passive knowledge migration module, a combination of source model and generator will be utilized_S(g (-) describing the target model f_TThe knowledge adapted in (h) is regarded as a special case of knowledge distillation; extracting knowledge differences between the source domain data and the target domain data into a generator; using output from a model of a generating image feed source

And the target image feeds the output p (f) of the target model_T(x) Constitutes a loss of knowledge distillation:

the relationship retention penalty is: after successful knowledge distillation, the global features of the target image from the target model and the global features of the generated source domain style image from the source model should be similar, so a relationship preservation loss is used for constraint;

given source domain feature maps

And target domain feature map f_T(x) It is first reshaped into a feature vector F_SAnd F_T，

Wherein D, H, W are feature map depth (number of channels), height, and width, respectively; then, their channel level autocorrelation, or gram matrix,

wherein G is_s,G_T∈R^D×DAnd applying line L2 to normalize:

wherein [ i, ] represents the ith row in the matrix; finally, the relationship of the retention loss is defined as the mean square error MSE between the normalized glans matrices:

the total loss was:

through the two loss constraints, the source style image can be generated from the target image, so that the method is used for further fine adjustment of the model and the identification capability of the model is improved.

The present invention and its preferred aspects provide a new unsupervised domain adaptive vehicle re-identification framework for passive knowledge migration. Compared with other unsupervised domain adaptation methods based on domain migration, the method has the greatest advantage that source domain data does not need to be accessed. Only a model trained by source domain data needs to be obtained, and then the knowledge hidden in the source domain model can be specifically migrated to the target domain data by using the passive knowledge migration module.

Compared with the prior art, the invention and the preferred scheme thereof have the following technical advantages:

1. the existing method usually directly utilizes source domain data to pre-train a model, the source domain data is not used after pre-training, and only the target domain is used for training, so that real labels in the source domain data cannot be reasonably utilized.

The invention and the optimal scheme thereof apply the synthetic image in the field of vehicle re-identification, generate the synthetic image into a pseudo target sample through SPGAN for pre-training, and simultaneously can receive source domain data and target domain data for combined re-training, thereby completing the unsupervised domain self-adaptive task.

2. In the existing method, because the target image has no label information, the model can not be accurately distinguished for some interferences caused by the change of the vehicle direction or the view angle.

The invention and the preferable scheme thereof design two network models through the synthesized image to be used for learning the direction similarity and the background similarity of the vehicle, and then subtract the direction similarity and the camera similarity by the vehicle ID similarity so as to reduce the interference of the similar direction and the background to the models.

Drawings

FIG. 1 is a schematic flow chart of an overall method according to an embodiment of the present invention.

Detailed Description

In order to make the features and advantages of the present invention comprehensible, embodiments accompanied with figures are described in detail as follows:

as shown in fig. 1, the method for re-identifying a vehicle in an unsupervised domain of a passive domain specifically includes the following design details:

(1) obtaining a source domain model and a target domain model: the source domain model may be obtained by downloading an open pre-trained model or pre-training with cross entropy loss and triplet loss using an open dataset, while the target domain model may be fine-tuned using target domain data by loading the obtained source domain model parameters.

(2) Source domain style image generation: according to the existing unsupervised domain adaptive working principle, the embodiment assumes that only the feature extractor in the source model is adapted to the targetA domain. Given a source model f_S(. o) and an object model f_T(..), the present embodiment trains a generator g (for the passive knowledge migration module). Since the training process is passive, the present embodiment will refer to the target image as x in the following for simplicity.

Given the generated image

Feature mapping of all source domain data output by source model

And probability distribution

To describe the adapted knowledge in the generated image, this embodiment introduces a new relationship maintenance penalty that maintains the target image target model feature map f, in addition to the conventional knowledge distillation penalty_T(x) And the generated image source model feature map

Relative channel relationship between them.

Knowledge distillation loss: in the passive knowledge migration module proposed by the present embodiment, a combination f of a source model and a generator is utilized_S(g (-) to describe the target model f_TThe knowledge of adaptation in (can be seen as a special case of knowledge distillation. The purpose of this embodiment is to extract knowledge differences between source domain data and target domain data into the generator. In this case, the present embodiment utilizes the output of the model that generates the image feed source

And the target image feeds the output p (f) of the target model_T(x) Constitutes a loss of knowledge in distillation.

Relationship retention loss: after successful knowledge distillation, the present embodiment may think that the global features derived from the target image by the target model and the global features derived from the generated source domain style image by the source model should be similar, so the present embodiment utilizes the relationship preservation penalty to constrain. In the feature map f_T(x) And

facilitating similar channel-level relationships therebetween helps achieve this goal.

In the past, the knowledge distillation work is usually restricted by maintaining the relation of batch level or pixel level. However, this way of constraining is not applicable to the current task. First, batch-level relationships do not provide good supervision over the task of generating each image, which can be detrimental to the effect of the generation. The validity of the pixel-level relationships is also compromised after global pooling. In contrast, channel-level relationships are computed on a per-image basis and are not affected by global pooling. Therefore, the channel level relationships are more suitable for calculating the relationship retention penalty.

Given source domain feature maps

And target domain feature map f_T(x) This embodiment first needs to reshape it into a feature vector F_SAnd F_T，

Wherein D, H, W are feature map depth (number of channels), height, and width, respectively. Next, the present embodiment computes their channel level autocorrelation, or gram matrix,

wherein G is_s,G_T∈R^D×D. As with other knowledge, the similarity of distillation remains lost, this example applies line L2 normalization,

wherein [ i, ] represents the ith row in the matrix. Finally, the present embodiment defines the relationship of the retention loss as the Mean Square Error (MSE) between the normalized gram matrices,

the total loss was:

through the two loss constraints, the embodiment can generate the source style image from the target image, so that the source style image is used for further fine adjustment of the model, and the model identification capability is improved.

The present invention is not limited to the above preferred embodiments, and various other passive domain unsupervised domain adaptive vehicle re-identification methods can be derived by anyone based on the teaching of the present invention.

Claims

1. A method for re-identifying a vehicle in a passive domain in an unsupervised domain is characterized by comprising the following steps: in the vehicle re-identification process, a generator is trained by using relationship keeping consistent loss and knowledge distillation loss through a source domain model and target domain data, so that a pseudo target sample with a source domain style is generated, and then the model is finely adjusted by using the pseudo target sample, so that the performance of the model is improved.

2. The passive domain unsupervised domain adaptive vehicle re-identification method of claim 1, wherein: instead of using source domain data, source domain knowledge learned implicit in the source domain model is utilized as guidance to facilitate migration of the style of the target domain data to the style of the source domain data.

3. The passive domain unsupervised domain adaptive vehicle re-identification method of claim 2, wherein: and (3) applying the synthetic image, generating a pseudo target sample by the synthetic image through SPGAN for pre-training, and receiving the source domain data and the target domain data to jointly train again so as to complete the unsupervised domain self-adaptive task.

4. The passive domain unsupervised domain adaptive vehicle re-identification method of claim 3, wherein: two network models are designed through a composite image to be used for learning the direction similarity and the background similarity of the vehicle, and then the direction similarity and the camera similarity are subtracted from the vehicle ID similarity, so that the interference of the similar direction and the similar background to the models is reduced.

5. The passive domain unsupervised domain adaptive vehicle re-identification method of claim 4, wherein: the source domain model is obtained through an open pre-training model or through pre-training with cross entropy loss and triplet loss by utilizing an open data set; and the target domain model is obtained by loading the obtained source domain model parameters and utilizing target domain data to carry out fine adjustment.

6. The passive domain unsupervised domain adaptive vehicle re-identification method of claim 5, wherein:

assuming that only the feature extractor in the source model is adapted to the target domain; given a source model f_S(. o) and an object model f_T(. for passive knowledge migrationMoving the module to train a generator; setting the target image to x, given the generated image

Feature mapping of all source domain data output by source model

And probability distribution

In order to describe the adapted knowledge in the generated image, besides the loss of knowledge distillation, a new relationship maintenance loss is introduced, which maintains the target image target model characteristic diagram f_T(x) And the generated image source model feature map

Relative channel relationship between them;

given source domain feature maps

wherein G is_s，G_T∈R^D×DAnd applying line L2 to normalize:

wherein [ i ]: represents the ith row in the matrix; finally, the relationship of the retention loss is defined as the mean square error MSE between the normalized glans matrices:

the total loss was: