CN111428650B

CN111428650B - Pedestrian re-recognition method based on SP-PGGAN style migration

Info

Publication number: CN111428650B
Application number: CN202010226128.5A
Authority: CN
Inventors: 孙艳丰; 胡芸萍
Original assignee: Beijing University of Technology
Current assignee: Beijing University of Technology
Priority date: 2020-03-26
Filing date: 2020-03-26
Publication date: 2024-04-02
Anticipated expiration: 2040-03-26
Also published as: CN111428650A

Abstract

The invention provides a pedestrian re-identification method based on SP-PGGAN style migration. The method comprises the following steps: constructing an SP-PGGAN model based on the cycleGAN; simultaneously inputting a training set of the marked pedestrian re-recognition data set and a training set of the unmarked pedestrian re-recognition data set into the SP-PGGAN model for training, namely obtaining the training set of the marked pedestrian re-recognition data set after migration through a generator G; training a classification network on the training set of the migrated marked pedestrian re-recognition data set by utilizing a pedestrian re-recognition model IDE to obtain a trained IDE model; and inputting the test set of the unlabeled pedestrian re-identification data set into the trained IDE model to realize the pedestrian re-identification of the unlabeled data set. Because the SP-PGGAN migration model designed by the invention is more accurate in the style migration process, the pedestrian re-identification effect of the unlabeled data set can be improved to a great extent.

Description

Pedestrian re-recognition method based on SP-PGGAN style migration

Technical Field

The invention belongs to the field of computer vision, and particularly relates to technologies such as deep learning, countermeasure network, image processing, feature extraction and the like. The pedestrian re-recognition method based on SP-PGGAN style migration can realize style migration from the marked data set to the unmarked data set, and the pedestrian re-recognition network trained after the marked data set migration can improve the effect of pedestrian re-recognition of the unmarked data set.

Background

Pedestrian Re-recognition (Person Re-identification) is also called pedestrian Re-recognition, and is a technique for judging whether a specific pedestrian exists in an image or video sequence by using a computer vision technique. Widely recognized as a sub-problem of image retrieval. Given a monitored pedestrian image, the pedestrian image is retrieved across devices. The camera is used for making up the visual limitation of the current fixed camera, can be combined with the pedestrian detection/pedestrian tracking technology, and can be widely applied to the fields of intelligent video monitoring, intelligent security and the like. In recent years, pedestrian re-recognition has gained increasing attention in the field of computer vision. The pedestrian re-identification has wide application prospects, including pedestrian retrieval, pedestrian tracking, street event detection, pedestrian action behavior analysis and the like. The method and the device for identifying the pedestrian re-recognition data aim at solving the problems, wherein one of the challenges is to the requirement of a large amount of marked training data, and an effective migration model is provided for the challenge, so that the marked data training model can be better applied to a non-marked data set, and the pedestrian re-recognition accuracy of the non-marked data is improved.

The existing pedestrian re-identification data set aiming at the unlabeled data is mostly trained on the labeled data set, and the model after training is directly used for testing the unlabeled data set. Therefore, the direct migration is not high in accuracy of pedestrian re-identification directly used for testing due to different bottom layer distribution of the two data sets. In recent years, with the development of the countermeasure network and the progress of migration technology, it has become possible to perform style migration using unpaired data sets, and even data sets of different underlying distributions can perform style conversion well. Under the background, how to perform more accurate migration on the basis of CycleGAN is one of hot spots of image recognition research, and has wide application prospect.

Disclosure of Invention

Aiming at the challenge of difficult data annotation in pedestrian re-recognition, the invention provides a new migration model SP-PGGAN by utilizing a deep learning technology, and realizes pedestrian re-recognition on the basis of the migration model. The existing pedestrian re-identification data set aiming at the unlabeled data is mostly trained on the labeled data set, and the model after training is directly used for testing the unlabeled data set. Therefore, the direct migration is not high in accuracy of pedestrian re-identification directly used for testing due to different bottom layer distribution of the two data sets. Firstly, an SP-PGGAN model is built, then a training set with a marked pedestrian re-recognition data set and a training set without a marked pedestrian re-recognition data set are simultaneously input into the built SP-PGGAN model for style migration, the training set with the marked pedestrian re-recognition data set after migration is obtained for IDE training, and then pedestrian re-recognition is realized on a testing set without the marked pedestrian re-recognition data set. The main flow of the invention is shown in figure 1, and can be divided into the following three steps: and (3) constructing an SP-PGGAN model, migrating the style of the SP-PGGAN model, and realizing pedestrian re-identification.

(1) Construction of SP-PGGAN model

The invention firstly builds an SP-PGGAN model, the model structure of which is improved on the basis of a cycleGAN, and the cycleGAN is essentially two mirror symmetry GANs, so as to form a ring network. Two GANs share two generators and each has a local arbiter, i.e., two local arbiters and two generators in common. The SP-PGGAN is based on the cyclegaN, a generator of the cyclegaN is reserved, and a twin network is added after the generator is generated to guide the generation process of the generator. Meanwhile, the two discriminators are replaced by a local discriminator and a global discriminator which are parallel. The invention points of the SP-PGGAN model proposed by the invention are intuitively shown in fig. 2.

(2) Style migration of SP-PGGAN model

In order to better test on the unmarked dataset test set, the training set of the marked pedestrian re-identification dataset and the training set of the unmarked pedestrian re-identification dataset are simultaneously input into the built SP-PGGAN model for style migration. Two similarities are maintained during migration: firstly, if an image from a marked data set is migrated to an unmarked data set, the migrated image is consistent with the unmarked data set in style; secondly, the ID information of pedestrians in the images needing to be kept with the marked data sets before and after the image migration is unchanged. Such ID information is not the background of the image or the style of the image, but the pedestrian region of the image having a potential relation to the ID information, i.e., the labeling information of the pedestrian, other than the background.

(3) Implementation of pedestrian re-recognition

After style migration, the invention needs to further carry out experiments of pedestrian re-identification. According to the invention, the training set with the marked data set obtained after SP-PGGAN style migration is trained in a classification network by utilizing the IDE model of pedestrian re-identification, so that the trained IDE model is obtained, and then the pedestrian re-identification is realized on the test set without the marked data set.

Drawings

In order to more clearly illustrate the technical solution of the embodiments of the present invention, the drawings required in the description of the embodiments will be briefly described as follows:

FIG. 1 is a flow chart of an SP-PGGAN style migration method for pedestrian re-identification;

FIG. 2 is a diagram showing the comparison of SP-PGGAN model and cycleGAN model;

FIG. 3 is a schematic diagram of a cycleGAN model network;

FIG. 4 is a network architecture diagram based on SP-PGGAN style migration;

FIG. 5 is a network architecture diagram of a pedestrian re-identification method (IDE);

advantageous effects

The invention adds the twin network on the basis of the CycleGAN, and the twin network can make two pictures with similar pedestrian information more similar and two pictures with dissimilar pedestrian information more dissimilar. Meanwhile, in order to consider the information of the local part and the whole part in the pedestrian migration process and improve the effect of the pedestrian migration process, the invention uses the global discriminator and the local discriminator as the SP-PGGAN discriminator at the same time. The improved migration network can enable pictures generated in the pedestrian migration process to be better used for training of the classification network, and experiments show that the invention realizes improvement of mAP (mean average precision, average recognition accuracy) of 12% of recognition than that of the direct-migrated pedestrians.

Detailed Description

In light of the foregoing, the following is a specific implementation, but the scope of protection of this patent is not limited to this implementation.

Step 1: construction of SP-PGGAN model

The invention firstly builds a migration model of SP-PGGAN. The model can be used for a migration process of pedestrian re-identification, but is not limited to the process, the pedestrian re-identification is an application scene of the invention, and other similar scenes related to style migration can be universal.

The SP-PGGAN model is an improvement over cycleGAN. The CycleGAN is bidirectional, and is a ring network formed by two mirror symmetry gags, wherein the two gags share two generators and are respectively provided with a local discriminator, namely, the two local discriminators and the two generators. As shown in FIG. 3, G and F are two generators of cycleGAN, D _x And D _y Is two local discriminators of CycleGAN. In this model, the purpose of the generator is to make the pictures generated during training more and more realistic, and attempt to cheat the discriminator, and the purpose of the discriminator is to make the pictures more and more interesting during training, and can identify the true or false of the pictures.

As shown in FIG. 4, the SP-PGGAN model provided by the invention consists of two major parts, namely a generator and a discriminator. The generator of SP-PGGAN consists of two parts: one part is a generator G and a generator F of the CycleGAN, and each generator is sequentially provided with two convolution layers with the step length of 2, six residual blocks and two deconvolution layers with the step length of 1/2 from shallow to deep; the other part is a twin network, which sequentially comprises four layers of convolution layers with the step length of 2, four layers of maximum pooling layers with the step length of 2 and one layer of full connection layer from shallow to deep; the SP-PGGAN discriminator includes a discriminator D _T Sum discriminator D _s Wherein the discriminator D _T Comprising a local discriminant D _T And global arbiter D _T Discriminator D _s Comprising a local discriminant D _s And global arbiter D _s The method comprises the steps of carrying out a first treatment on the surface of the Distinguishing device D _T Sum discriminator D _s The structure is the same, the parameters are not shared, wherein, the global arbiter D _T And local discriminant D _T The four convolution layers with the step length of 2 are shared, and the four convolution layers are divided into two paths from the end of the fourth layer, wherein one path is D _T A global discriminator, which outputs a binary number finally, the binary number determining whether the whole picture is true or false; the other way is D _T Local discriminant byThe full connection layer outputs a 256-dimensional vector, and each number determines whether the image block at the corresponding position is true or false. The specific network structure of the SP-PGGAN model is shown in Table 1.

TABLE 1

Step 2: style migration of SP-PGGAN model

After the model is built, in order to better test on a test set of the unlabeled dataset, the training set of the labeled pedestrian re-recognition dataset and the training set of the unlabeled pedestrian re-recognition dataset are simultaneously input into the built SP-PGGAN model for style migration. As shown in fig. 4, in the migration process, for the forward direction, a picture x from a labeled dataset is generated by a generator G, then a picture G (x) is generated by another generator F, a part of losses of the generator are difference losses between x and F (G (x)), another part of losses are from a twin network, the distance between the picture x and the picture G (x) is shortened after passing through the twin network, the distance between the picture G (x) and the picture y from a non-labeled dataset is lengthened after passing through the twin network, and the losses of the part are from' distance losses of two pairs of samples. The loss of the discriminators is the global discriminators D of the generated picture G (x) and picture y _T And local discriminant D _T Cross entropy loss below. Also in the opposite direction, the picture y from the unlabeled dataset generates a picture F (y) by the generator F, then the picture F (y) generates a picture G (F (y)) by the other generator G, a part of the loss of the generator is the difference loss between y and G (F (y)), the other part of the loss is from the twinning network, the distance between the picture y and the picture F (y) is shortened after passing through the twinning network, the distance between the picture F (y) and the picture x from the labeled dataset is lengthened after passing through the twinning network, and the loss of the part is from the distance loss of the two pairs of samples. Discrimination ofThe loss of the device is that the generated picture F (y) and the picture x are in the global discriminator D _s And local discriminant D _s Cross entropy loss below. In the training process, the parameters of the discriminant are kept unchanged, parameters of the training generation process are fixed, and the parameters of the discriminant are trained. Repeating the above process to gradually evolve the generator and the discriminator respectively. The overall loss in this process is shown in equation 1:

L＝L _Tadv +L _Sadv +L _PTadv +L _PSadv +γ ₁ L _cyc +γ ₂ L _ide +γ ₃ L _con (1)

L _Tadv and L _Sadv Is a local discriminant D of two mirror symmetry in the SP-PGGAN model _T And D _s The losses of (2) are cross entropy losses, as shown in formula (2) and formula (3); wherein x and y respectively represent pictures in the training set with the marked data set and pictures in the training set without the marked data set, P _x 、P _y Refers to the x and y compliant distribution, G, D _T Generator and local discriminant, F, D, representing forward direction respectively _s A generator and a local arbiter representing the opposite direction, respectively. G (x) represents a picture generated by the marked training set picture x after passing through the generator G, D _T (G (x)) and D _T (y) denotes that G (x) and y pass through the local discriminator D _T And judging the result. F (y) represents a picture generated by generating a non-labeling training set picture y through a generator F, D _s (x) And D _s (F (y)) means that x and F (y) pass through the local discriminant D _s And judging the result.

L _PTadv And L _PSadv Is a global discriminant D of two mirror symmetry in the SP-PGGAN model _T And D _s And the loss of the local discriminant D _T And D _s Loss of (2)Identical, i.e. L _PTadv ＝L _Tadv And L is _PSadv ＝L _Sadv ；

L _cyc Is the sum of the losses of two mirror-symmetrical generators in the SP-PGGAN model, i.e. the sum of the euclidean distances between x and F (G (x)), y and G (F (y)); as shown in equation (4). For the positive direction, x represents a picture in the training set with the labeled dataset, G (x) represents a picture obtained after passing through the generator G, F (x)) represents a picture obtained after passing through the generator G and then through the generator F, and the euclidean distance between the picture x and the generated picture F (x)) represents a generation loss. For the reverse direction, y represents a picture in the unlabeled dataset training set, F (y) represents a picture obtained through the generator F, G (F (y)) represents a picture obtained through the generator F and further through the generator G, and the euclidean distance between the picture x and the generated picture G (F (y)) represents a generation loss.

L _ide The color consistency loss in the forward and reverse direction generation process is represented as shown in a formula (5). For the positive direction, x represents a picture in the training set with the labeled dataset, F (x) represents a picture generated by x through the generator F, and Euclidean distance between the picture x and the generated picture F (x) represents a loss of color consistency in the generation process. For the opposite direction, y represents the picture in the training set with the labeled dataset, G (y) represents the picture in which x is generated by the generator F, and the euclidean distance between the picture y and the generated picture G (y) represents the loss of color consistency in the generation process.

L _con Is the generation loss of the twin network, and consists of the Euclidean distance of the positive sample and the Euclidean distance of the negative sample. As shown in equation (6). X is x ₁ ,x ₂ Two input pictures representing a twin network, i represents the label of the input vector pair, when i=X represents at 1 ₁ ,x ₂ Is a positive sample pair, which refers to two pictures of the same pedestrian, namely x and G (x) in the forward direction and y and F (y) in the opposite direction. i=0 times represents x ₁ ,x ₂ Is a negative pair of samples, which refers to two pictures of different pedestrians, namely y and G (x) in the positive direction and x and F (y) in the opposite direction. d represents the euclidean distance between two input pictures. m is E [0,2 ]]The range of values of m is defined.

L _con (i,x ₁ ,x ₂ )＝(1-i){max(0,m-d)} ² +id ² (6)

In formula (1), γ ₁ 、γ ₂ 、γ ₃ Control L respectively _cyc 、L _ide 、L _con The importance parameter is in the range of [1,10]。

The invention uses the mark-1501 and DukeMTMC-reID as training sets to train the SP-PGGAN, wherein the mark-1501 comprises 1501 pedestrians, 12,936 training set pictures and 19,732 searching set pictures. Of which 751 pedestrians were used for training and 750 were used for testing. Each pedestrian is captured under a maximum of six shots. DukeMTMC-reID includes 34,183 pictures of 1,404 pedestrians: 702 pedestrians were used for training, and the remaining 702 pedestrians were used for testing. There were 2,228 images to be tested and 17,661 images to be found.

The invention uses TensorFlow as a framework, uses Market-1501 and DukeMTMC-reID as training sets to train SP-PGGAN, and does not use any ID information in the training process. In all migration experiments we set γ in equation (1) ₁ ＝10，γ ₂ ＝5，γ ₃ =3, m=2 in formula (6). The initial learning rate of SP-PGGAN was set at 0.0002 at the same time, and the model stopped training after 5 epochs. During the test phase we use generator G for the migration of Market-1501→DukeMTMC-reiD and generator F for the migration of DukeMTMC-reiD→Market-1501.

Step 3: implementation of pedestrian re-recognition

After style migration, the invention needs to further carry out experiments of pedestrian re-identification. The invention is mainly an improvement in the style migration process, so that in the second step, a basic IDE (ID-discriminative Embedding) method is adopted, the method can be replaced by any pedestrian re-recognition method, the method is mainly selected because the method is a basic method for pedestrian re-recognition, and the migration method of the invention obtains a better result on the model.

In an experiment of pedestrian re-recognition, the training set with the marked data set obtained after SP-PGGAN style migration is trained on the classification network by utilizing the IDE model of pedestrian re-recognition, so that a trained classification network is obtained. And then inputting the test set without the marked data set into a trained classification network, so that the pedestrian re-recognition is realized. As shown in fig. 5, the SP-PGGAN style migrated picture G (x) is trained by the IDE model to obtain a trained IDE network. In the testing process, the pedestrian pictures to be tested in the unlabeled data set and the searching set are input into the trained IDE network together, and the pedestrian pictures identical to the pedestrian pictures to be tested are found in the searching set, so that the pedestrian re-identification process is realized.

As shown in FIG. 5, the IDE of the present invention employs ResNet-50 as the baseline model and adjusts the output dimension of the fully connected layer of the last layer to the number of rows in the training dataset. IDE is a network for training a class, the first 5 layers of convolution layers, 6, 7 are fully connected layers of 1024 neurons, and layer 8 is a class layer of ID number. The network is then trained as a classification task.

The present invention uses ResNet-50 pre-trained on ImageNet to adjust the training set. The final full connection layer of the output is tuned to 751 and 702, respectively, for Market-1501 and DukeMTMC-reID, respectively. The experiments at this step of the present invention used a small batch SGD on a 1080 titanium GPU to train the CNN model. The batch, maximum number of units, momentum and gamma during training were set to 16, 50, 0.9 and 0.1, respectively. The initial learning rate was 0.001, decaying to 0.0001 after 40 batches.

In the present invention, in order to further enhance the effect of pedestrian re-recognition on the target data set, a feature pooling method called Local Maximum Pooling (LMP) is introduced. It can be used well on trained IDE models and can reduce the impact on the re-recognition process due to erroneous pictures in the migration process. In the original ResNet-50, global averaging pooling was used on the fifth convolutional layer. After adding LMP, the feature map obtained by the fifth convolution layer needs to be divided into P parts in the horizontal direction first, and then global/average pooling is used for each part after the division. Finally, the invention splices the global average pooling or global maximum pooling results, and the process can be directly used in the test process. In experiments, the invention can compare global maximum pooling with global average pooling, and select good effect to splice for LMP.

The results of the examples of the present invention are shown in table 2:

TABLE 2

As can be seen from table 2, the migration using SP-PGGAN (m=2) is better than the pedestrian re-recognition effect using CycleGAN. That is, rank-1 and mAP tested on Market-1501 with SP-PGGAN (m=2) were 11.2% and 6.2% more than cyclegaN, respectively, and rank-1 and mAP were 6.7% and 3.5% more, respectively, when tested on DukeMTMC-reID. Thus, the effectiveness of the SP-PGGAN model proposed by the invention is demonstrated. At the same time, the use of LMP on SP-PGGAN has been demonstrated to improve pedestrian re-recognition to some extent.

Claims

1. The pedestrian re-identification method based on SP-PGGAN style migration is characterized by comprising the following three steps:

step (1) construction of SP-PGGAN model

The model is used for style migration between marked pedestrian re-identification data sets and unmarked pedestrian re-identification data sets, the model structure is improved based on a CycleGAN, the CycleGAN is two mirror symmetry GANs, a ring network is formed, the two GANs share a generator G and a generator F, and the generator G and the generator F are respectively provided with a local discriminator which is respectively D _X And D _Y The SP-PGGAN is based on the cyclegaN, a generator of the cyclegaN is reserved, a twin network is added after the generator is generated to guide the generation process of the generator, and meanwhile, a local discriminator D of the cyclegaN is used for determining the generation process of the cyclegaN _X And D _Y The method is replaced by a local discriminator and a global discriminator which are parallel;

step (2) style migration of SP-PGGAN model

Simultaneously inputting a training set of the marked pedestrian re-recognition data set and a training set of the unmarked pedestrian re-recognition data set into a constructed SP-PGGAN model for style migration, and obtaining a training set of the migrated marked pedestrian re-recognition data set; two similarities are maintained during migration: firstly, if an image from a marked data set is migrated to an unmarked data set, the migrated image is consistent with the unmarked data set in style; secondly, before and after the image migration, the ID information of the pedestrian in the image with the marked data set is required to be kept unchanged; the ID information refers to the image pedestrian area with potential relation with the ID information except the background, namely the labeling information of pedestrians;

implementation of pedestrian re-identification in step (3)

Training a classification network on a training set with a marked data set obtained after SP-PGGAN style migration by utilizing an IDE model for pedestrian re-identification to obtain a trained IDE model, and realizing pedestrian re-identification on a test set without the marked data set;

the step (2) comprises the following steps:

in the migration process, for the positive direction, the generation process is that a training set picture x from a marked data set generates a picture G (x) through a generator G, the picture G (x) generates a picture F (G (x)) through a generator F, then the pictures x and G (x), the picture G (x) and a picture y of a non-marked data set training set are respectively input into a twin network, and the twin network is used for improving the accuracy of pedestrian ID information in marked data in the style migration process; the discrimination process is that the picture G (x) and the picture y of the training set of the unmarked data set are simultaneously input into the global discriminator D _T1 And local discriminant D _T2 Is performed in the middle ofTraining a discriminator, wherein the global discriminator discriminates the true and false of the whole picture, and the local discriminator discriminates the local true and false of the picture; for the opposite direction, the generation process is that a picture y from a training set without a marked data set generates a picture F (y) through a generator F, the picture F (y) generates a picture G (F (y)) through a generator G, and then the pictures y and F (y), the picture F (y) and a picture x with the training set with the marked data set are respectively input into a twin network; the discrimination process is that the picture F (y) and the picture x with the marked data set training set are simultaneously input into the global discriminator D _S1 And local discriminant D _S2 Training the discriminator;

in the training process, parameters of the discriminant are kept unchanged, parameters of the generating process are trained, then parameters of the generator and the discriminant are fixed, and the processes are repeated, so that the generator and the discriminant respectively gradually evolve.

2. The pedestrian re-recognition method based on SP-PGGAN style migration according to claim 1, wherein the pedestrian re-recognition method is characterized by: the SP-PGGAN model in the step (1) consists of a generator and a discriminator;

wherein the generator of SP-PGGAN is composed of two parts: one part is a generator G and a generator F of the CycleGAN, and each generator is sequentially provided with two convolution layers with the step length of 2, six residual blocks and two deconvolution layers with the step length of 1/2 from shallow to deep; the other part is a twin network, which sequentially comprises four layers of convolution layers with the step length of 2, four layers of maximum pooling layers with the step length of 2 and one layer of full connection layer from shallow to deep;

the SP-PGGAN discriminator includes a discriminator D _T Sum discriminator D _S Wherein the discriminator D _T Comprising a local discriminant D _T2 And global arbiter D _T1 Discriminator D _s Comprising a local discriminant D _S2 And global arbiter D _S1 The method comprises the steps of carrying out a first treatment on the surface of the Distinguishing device D _T Sum discriminator D _S The structure is the same, the parameters are not shared, wherein, the global arbiter D _T1 And local discriminant D _T2 The four convolution layers with the step length of 2 are shared, and the four convolution layers are divided into two paths from the end of the fourth convolution layer, wherein one path is a global discriminator D _T1 Finally, a binary number is output, and the binary number determines whether the whole picture is judged to be true or false; the other path is a local discriminator D _T2 It outputs a 256-dimensional vector through the full-connection layer, and each number determines whether the image block at the corresponding position is true or false.

3. The pedestrian re-recognition method based on SP-PGGAN style migration according to claim 1, wherein the loss function of the SP-PGGAN model in the repeated training process is:

L＝L _Tadv +L _Sadv +L _PTadv +L _PSadv +γ ₁ L _cyc +γ ₂ L _ide +γ ₃ L _con

L _Tadv and L _Sadv Is a local discriminant D with mirror symmetry in two directions in the SP-PGGAN model _T2 And local discriminant D _S2 The losses of (2) are all cross entropy losses; l (L) _PTadv And L _PSadv Is a global discriminant D of mirror symmetry in two directions in the SP-PGGAN model _T1 And global arbiter D _S1 The loss of the (2) is the same as the loss of the local discriminant, and is the cross entropy loss; l (L) _cyc Is the sum of the losses of two mirror-symmetrical generators in the SP-PGGAN model; l (L) _ide Indicating the consistent loss of color in the generation process of the positive direction and the negative direction; l (L) _con Is the generation loss of the twin network, gamma ₁ 、γ ₂ 、γ ₃ Control L respectively _cyc 、L _ide 、L _con The importance parameter is in the range of [1,10]。

4. The pedestrian re-recognition method based on SP-PGGAN style migration of claim 3, wherein:

the L is _Tadv The calculation formula of (2) is as follows:

wherein G (x) represents a marked trainingThe training set picture x is a picture generated by a generator G, D _T (G (x)) and D _T (y) denotes the passage of G (x) and y through the local discriminator D, respectively _T2 The result of the discrimination;

the L is _Sadv The calculation formula of (2) is as follows:

wherein F (y) represents a picture generated by the non-labeling training set picture y after the non-labeling training set picture passes through the generator F, D _s (x) And D _s (F (y)) means that x and F (y) pass through the local discriminant D _S2 The result of the discrimination;

the L is _cyc Is the sum of Euclidean distances between x and F (G (x)), y and G (F (y));

the L is _ide The calculation formula of (2) is as follows:

wherein F (x) represents the picture in which x is generated by the generator F, and G (y) represents the picture in which x is generated by the generator F;

the L is _con The calculation formula of (2) is as follows:

L _con (i,x ₁ ,x ₂ )＝(1-i){max(0,m-d)} ² +id ²

wherein x is ₁ ,x ₂ Two input pictures representing a twin network, i representing the labels of the input vector pair, and x when i=1 ₁ ,x ₂ Is a positive sample pair, which refers to two pictures of the same pedestrian, namely x and G (x) in the positive direction and y and F (y) in the opposite direction; i=0 times represents x ₁ ,x ₂ Is a negative sample pair, which refers to two pictures of different pedestrians, namely y and G (x) in the positive direction, and x and F (y) in the opposite direction; d represents the Euclidean distance between two input pictures; m is E [0,2 ]]The range of values of m is defined.

5. The pedestrian re-recognition method based on SP-PGGAN style migration of claim 1,

the method is characterized in that: the step (3) is specifically as follows:

training the SP-PGGAN style migrated picture G (x) through an IDE model to obtain a trained IDE model, and inputting the picture of the pedestrian to be tested and the search set in the unlabeled data set into the trained IDE model together in the test process, so as to intensively find the picture of the pedestrian identical to the picture of the pedestrian to be tested, thereby realizing the process of re-identifying the pedestrian.