Disclosure of Invention
In view of the foregoing analysis, embodiments of the present invention provide an unsupervised domain adaptive pedestrian re-identification system and method, so as to solve the problems of large migration loss of a pedestrian re-identification network and low accuracy of a result of pedestrian re-identification in the prior art.
In one aspect, the present invention provides an unsupervised domain adaptive pedestrian re-identification system, including:
a data acquisition module for acquiring a source domain sample set comprising a plurality of source domain sample subsets and a target domain sample set comprising a plurality of target domain sample subsets;
the network model training module is used for obtaining a classification loss function and a sample invariance loss function of the pedestrian re-recognition network model, sequencing and layering pedestrian pictures in the target domain sample subset according to the similarity between each pedestrian picture in the source domain sample subset and each pedestrian picture in the target domain sample subset, and obtaining a layering loss function based on layered pedestrian picture characteristics and corresponding layering weights; traversing each source domain sample subset and each target domain sample subset based on the classification loss function, the sample invariance loss function and the hierarchical loss function to perform iterative optimization on the pedestrian re-identification network model;
and the re-identification module is used for identifying the pedestrian image to be identified by utilizing the optimized pedestrian re-identification network model to obtain the image which is the same as or similar to the pedestrian image to be identified.
Further, the pedestrian re-identification network model comprises a residual error network structure, a full connection layer and a Softmax normalization layer which are connected in sequence and correspond to the classification loss function, a target domain memory and an L, wherein the target domain memory and the L are connected in sequence and correspond to the sample invariance loss function2The system comprises a normalization layer, a source domain memory and a similarity measurement axis network structure which are sequentially connected and correspond to a hierarchical loss function, wherein the residual error network structure is respectively connected with the full connection layer, the target domain memory, the source domain memory and the similarity measurement axis network structure.
Further, the network model training module obtains the hierarchical loss function specifically by:
respectively inputting the source domain sample subset and the target domain sample subset into a residual error network structure of a pedestrian re-identification network model to extract image features so as to obtain and respectively store the features of each pedestrian picture in the source domain sample subset and the features of each pedestrian picture in the target domain sample subset, and multiplying the features of each pedestrian picture in the source domain sample subset and the features of each pedestrian picture in the target domain sample subset to obtain corresponding similarity;
based on the similarity, sequencing the pedestrian pictures in the target domain sample subset in a descending order, sequentially selecting a first preset number of pictures as first-layer pictures, selecting a second preset number of pictures as second-layer pictures, taking the rest pictures as third-layer pictures, and respectively setting layer weights of the three layers of pictures;
obtaining a hierarchical loss function based on the characteristics of each pedestrian picture in the target domain sample subset and the level weight corresponding to the level to which the pedestrian picture belongs:
wherein L isSLRepresenting the hierarchical loss function, ntRepresenting the number of pedestrian pictures, w, in the target field sample subsett,rRepresenting the level weight, r representing the sequence number of the pedestrian pictures in the target domain sample subset according to the similarity, xt,iRepresenting target domain sample subset input pedestrian re-recognitionInputting a pedestrian picture with the sequence of i, f (x) in the network modelt,i) Picture x representing pedestriant,iP (m | x)t,i) Picture x representing pedestriant,iProbability of similarity to all classes of pedestrian pictures in all stored subsets of source domain samples, R1[m]Features, R, representing all classes of pedestrian pictures in all stored subsets of active domain samples1[j]Features representing class j pedestrian pictures in all stored subsets of active domain samples, NsAnd the stored category number of the pedestrian pictures in all the active domain sample subsets is represented, and beta is a temperature coefficient.
Further, the layer weights of the three layers of pictures are set as follows:
where b denotes the number of first layer pictures, k1Represents the sum of the numbers of the first layer picture and the second layer picture, and p (l | i) represents the pedestrian picture xt,iProbability of class i pedestrian pictures in all target domain sample subsets, R1[l]Features representing class i pedestrian pictures in all stored subsets of target domain samples, NtRepresenting the number of categories of pedestrian pictures in all the stored subsets of the target domain samples.
Further, the network model training module obtains a classification loss function specifically by the following means:
inputting the source domain sample subset into a residual error network structure of a pedestrian re-identification network model to extract image features so as to obtain the features of each pedestrian picture in the source domain sample subset;
sequentially inputting the characteristics of each pedestrian picture into a full connection layer and a softmax regression layer of a pedestrian re-identification network model, and performing characteristic dimension conversion and characteristic normalization;
classifying a loss function by adopting the following formula based on the characteristics of each pedestrian picture after dimension conversion and normalization;
wherein L issrcRepresenting the classification loss function, nsRepresenting the number of pedestrian pictures, x, in a subset of source domain sampless,mRepresents the m-th pedestrian picture in the source domain sample subset, f (x)s,m) Picture x representing pedestrians,mCharacteristic of (a), ys,mRepresenting a pedestrian picture x in a subset of source domain sampless,mClass label of p (y)s,m|xs,m) Picture x representing pedestrians,mBelong to the category ys,mProbability of (a), f (x)s,j) Representing features of class j pedestrian pictures in the source domain sample subset, NsRepresenting the number of categories of pedestrian pictures in all the stored subsets of the active domain samples.
Further, the network model training module obtains a sample invariance loss function specifically by the following means:
inputting the target domain sample subset into a residual error network structure of a pedestrian re-identification network model to extract image features so as to obtain the features of each pedestrian picture in the target domain sample subset, and performing L on the features of each pedestrian picture2Carrying out standardization treatment;
obtaining a sample invariance loss function according to the characteristics of each pedestrian picture after the normalization processing through the following formula:
wherein L isTRepresenting the sample invariance loss function, ntRepresenting the number, x, of pictures of a person in a sample subset of the target fieldt,iRepresenting the pedestrian picture with the input sequence i when the target domain sample subset is input into the pedestrian re-identification network model, f (x)t,i) Picture x representing pedestriant,iIs characterized by wi,lPicture x representing pedestriant,iWeight of pedestrian pictures belonging to class I, p (l | x)t,i) Picture x representing pedestriant,iProbability of a pedestrian belonging to class I picture, NtAnd the stored category number of the pedestrian pictures in all the target domain sample subsets is represented, and beta is a temperature coefficient.
Further, the network model training module performs iterative optimization on the pedestrian re-recognition network model specifically by the following means:
obtaining a total loss function according to the classification loss function, the sample invariance loss function and the layering loss function:
L=λ1Lsrc+λ2LT+λ3LSL,
wherein L represents the total loss function, LsrcRepresenting the classification loss function, λ1Weight representing the classification loss function, LTRepresenting the sample invariance loss function, λ2Weight, L, representing the sample invariance loss functionSLRepresenting the hierarchical loss function, λ3A weight representing a hierarchical loss function;
and traversing each source domain sample subset and each target domain sample subset, and iteratively updating the total loss function until the variable quantity of the total loss function value is smaller than a preset value so as to complete the optimization of the pedestrian re-identification network model.
On the other hand, the invention provides an unsupervised domain self-adaptive pedestrian re-identification method, which comprises the following steps:
obtaining a source domain sample set comprising a plurality of source domain sample subsets and a target domain sample set comprising a plurality of target domain sample subsets;
respectively obtaining a classification loss function and a sample invariance loss function of a pedestrian re-identification network model based on the source domain sample subset and the target domain sample subset, sequencing and layering pedestrian pictures in the target domain sample subset according to the similarity of each pedestrian picture in the source domain sample subset and each pedestrian picture in the target domain sample subset, and obtaining a layering loss function based on layered pedestrian picture characteristics and corresponding layering weights; traversing each source domain sample subset and each target domain sample subset based on the classification loss function, the sample invariance loss function and the hierarchical loss function to perform iterative optimization on the pedestrian re-identification network model;
and identifying the image of the pedestrian to be identified by utilizing the optimized pedestrian re-identification network model to obtain the image which is the same as or similar to the image of the pedestrian to be identified.
Further, the hierarchical loss function is obtained by specifically:
respectively inputting the source domain sample subset and the target domain sample subset into a residual error network structure of a pedestrian re-identification network model to extract image features so as to obtain and respectively store the features of each pedestrian picture in the source domain sample subset and the features of each pedestrian picture in the target domain sample subset, and multiplying the features of each pedestrian picture in the source domain sample subset and the features of each pedestrian picture in the target domain sample subset to obtain corresponding similarity;
based on the similarity, sequencing the pedestrian pictures in the target domain sample subset in a descending order, sequentially selecting a first preset number of pictures as first-layer pictures, selecting a second preset number of pictures as second-layer pictures, taking the rest pictures as third-layer pictures, and respectively setting layer weights of the three layers of pictures;
obtaining a hierarchical loss function based on the characteristics of each pedestrian picture in the target domain sample subset and the level weight corresponding to the level to which the pedestrian picture belongs:
wherein L isSLRepresenting the hierarchical loss function, ntRepresenting the number of pedestrian pictures, w, in the target field sample subsett,rRepresenting the level weight, r representing the sequence number of the pedestrian pictures in the target domain sample subset according to the similarity, xt,iRepresenting the pedestrian picture with the input sequence i when the target domain sample subset is input into the pedestrian re-identification network model, f (x)t,i) Picture x representing pedestriant,iP (m | x)t,i) Picture x representing pedestriant,iProbability of similarity to all classes of pedestrian pictures in all stored subsets of source domain samples, R1[m]Features, R, representing all classes of pedestrian pictures in all stored subsets of active domain samples1[j]Features representing class j pedestrian pictures in all stored subsets of active domain samples, NsAnd the stored category number of the pedestrian pictures in all the active domain sample subsets is represented, and beta is a temperature coefficient.
Further, the layer weights of the three layers of pictures are set as follows:
where b denotes the number of first layer pictures, k1Represents the sum of the numbers of the first layer picture and the second layer picture, and p (l | i) represents the pedestrian picture xt,iProbability of class i pedestrian pictures in all target domain sample subsets, R1[l]Features representing class i pedestrian pictures in all stored subsets of target domain samples, NtRepresenting the number of categories of pedestrian pictures in all the stored subsets of the target domain samples.
Compared with the prior art, the invention can realize at least one of the following beneficial effects:
1. the unsupervised domain self-adaptive pedestrian re-identification system and the unsupervised domain self-adaptive pedestrian re-identification method effectively combine the supervised pedestrian re-identification with the unsupervised pedestrian re-identification through the combination of the source domain sample with the label and the target domain sample without the label, and firstly propose to layer the target domain sample through the similarity between the source domain sample and the target domain sample, so that the discrimination of the pedestrian re-identification network on the target domain sample is improved, the migration loss of the network is reduced, and the accuracy of the pedestrian re-identification result of the pedestrian re-identification network is further improved.
2. The unsupervised domain self-adaptive pedestrian re-recognition system and the unsupervised domain self-adaptive pedestrian re-recognition method, provided by the invention, utilize the similarity of the characteristics of the source domain sample and the target domain sample to carry out layering and weighting on the negative sample to obtain a layered loss function, the loss function enables the network to be self-adaptive in the training of a pedestrian re-recognition network model so as to reduce the learning of the characteristics of the negative sample, thereby reducing the migration loss of the network, and on the basis, the classification loss function and the sample invariance loss function are combined to carry out iterative optimization on the pedestrian re-recognition network, so that the accuracy of the pedestrian re-recognition result of the pedestrian re-recognition network.
In the invention, the technical schemes can be combined with each other to realize more preferable combination schemes. Additional features and advantages of the invention will be set forth in the description which follows, and in part will be obvious from the description, or may be learned by practice of the invention. The objectives and other advantages of the invention will be realized and attained by the structure particularly pointed out in the written description and drawings.
Detailed Description
The accompanying drawings, which are incorporated in and constitute a part of this application, illustrate preferred embodiments of the invention and together with the description, serve to explain the principles of the invention and not to limit the scope of the invention.
System embodiment
The invention discloses an unsupervised domain self-adaptive pedestrian re-identification system. As shown in fig. 1, the system includes:
a data acquisition module 110 for acquiring a source domain sample set comprising a plurality of source domain sample subsets and a target domain sample set comprising a plurality of target domain sample subsets. Specifically, a large number of pictures of pedestrians are randomly acquired from different angles, each picture contains a pedestrian, the acquired pictures are divided into a training set and a testing set, the training set is used for training the pedestrian re-recognition network model, and the testing set is used for testing the trained pedestrian re-recognition network model so as to ensure the recognition accuracy of the pedestrian re-recognition network model. The method comprises the steps of randomly selecting partial pictures in a training set, adding a label to each picture, specifically, adding the same label to a plurality of pictures of the same pedestrian, wherein different labels represent different pedestrians, dividing the pictures with the labels added into a plurality of source domain sample subsets, taking other pictures without the labels in the training set as a target domain sample set, adding a number to each picture, and randomly dividing the pictures with the numbers added into a plurality of target domain sample subsets.
The network model training module 120 is configured to obtain a classification loss function and a sample invariance loss function of the pedestrian re-recognition network model, rank and layer pedestrian pictures in the target domain sample subset according to a similarity between each pedestrian picture in the source domain sample subset and each pedestrian picture in the target domain sample subset, and obtain a layer loss function based on layered pedestrian picture features and corresponding layer weights. And traversing each source domain sample subset and each target domain sample subset based on a classification loss function, a sample invariance loss function and a hierarchical loss function to perform iterative optimization on the pedestrian re-identification network model.
And the re-identification module is used for identifying the pedestrian image to be identified by utilizing the optimized pedestrian re-identification network model to obtain the image which is the same as or similar to the pedestrian image to be identified. Specifically, a to-be-identified pedestrian picture is input into a trained pedestrian re-identification network model, the model can output a picture most similar to the to-be-identified pedestrian picture, exemplarily, the model is set to output the number of the first three pictures with the highest similarity to the to-be-identified pedestrian picture, the first three pictures are pictures in a target domain sample set, and then the to-be-identified pedestrian picture and the first three pictures are manually compared to determine the category of the to-be-identified pedestrian picture.
Preferably, as shown in fig. 2, the pedestrian re-identification network model includes a residual network structure, a fully-connected layer and a Softmax normalization layer connected in sequence corresponding to the classification loss function, a target domain memory connected in sequence corresponding to the sample invariance loss function, and an L2The system comprises a normalization layer, a source domain memory and a similarity measurement axis network structure which are connected in sequence and correspond to a hierarchical loss function, wherein the residual error network structure is connected with a full connection layer, a target domain memory, the source domain memory and the similarity measurement axis network structure respectively. Specifically, the target domain memory and the source domain memory are both in a key value storage structure, the key is used for storing the characteristics of the pedestrian picture, the value is used for storing a number or a label corresponding to the pedestrian picture, and the residual network structure is ResNet 50.
Preferably, the network model training module obtains the classification loss function, the sample invariance loss function and the hierarchical loss function in the following way:
step 1, inputting the source domain sample subset and the target domain sample subset into a residual error network structure of a pedestrian re-identification network model respectively to extract image features so as to obtain the features of each pedestrian picture in the source domain sample subset and the features of each pedestrian picture in the target domain sample subset.
And storing the characteristics of each pedestrian picture in the target domain sample subset in a target domain memory, and storing the characteristics of each pedestrian picture in the source domain sample subset in a source domain memory.
Step 2, specifically, obtaining a classification loss function in the following way:
and sequentially inputting the characteristics of each pedestrian picture in the source domain sample subset into a full connection layer and a softmax regression layer of the pedestrian re-identification network model, and performing characteristic dimension conversion and characteristic normalization to strengthen the nonlinearity of the pedestrian re-identification network model in learning.
Classifying a loss function by adopting the following formula based on the characteristics of each pedestrian picture after dimension conversion and normalization;
wherein L issrcRepresenting the classification loss function, nsRepresenting the number of pedestrian pictures, x, in a subset of source domain sampless,mRepresents the m-th pedestrian picture in the source domain sample subset, f (x)s,m) Picture x representing pedestrians,mCharacteristic of (a), ys,mRepresenting a pedestrian picture x in a subset of source domain sampless,mClass label of p (y)s,m|xs,m) Picture x representing pedestrians,mBelong to the category ys,mProbability of (a), f (x)s,j) Representing features of class j pedestrian pictures in the source domain sample subset, NsRepresenting the number of categories of pedestrian pictures in all the stored subsets of the active domain samples.
Step 3, specifically, obtaining a sample invariance loss function through the following method:
l is carried out on the characteristics of each pedestrian picture in the current target domain sample subset stored in the target domain memory2And (6) carrying out standardization processing.
Obtaining a sample invariance loss function according to the characteristics of each pedestrian picture after the normalization processing through the following formula:
wherein L isTRepresenting the sample invariance loss function, ntRepresenting the number, x, of pictures of a person in the current target field sample subsett,iRepresenting the pedestrian picture with the input sequence i when the current target domain sample subset is input into the pedestrian re-identification network model, f (x)t,i) Picture x representing pedestriant,iIs characterized by wi,lPicture x representing pedestriant,iWeight of pedestrian pictures belonging to class I, p (l | x)t,i) Picture x representing pedestriant,iProbability of a pedestrian belonging to class I picture, NtAnd the stored category number of the pedestrian pictures in all the target domain sample subsets is represented, and beta is a temperature coefficient.
Step 4, obtaining a layering loss function by the following method:
multiplying the characteristics of each pedestrian picture in the current source domain sample subset and the characteristics of each pedestrian picture in the current target domain sample subset based on the similarity measurement axis network structure to obtain corresponding similarity;
and sequencing the pedestrian pictures in the target domain sample subset in a descending order based on the similarity, sequentially selecting a first preset number of pictures as first-layer pictures, selecting a second preset number of pictures as second-layer pictures, taking the rest pictures as third-layer pictures, and respectively setting layer weights of the three layers of pictures. Illustratively, the number of first layer pictures is 3, and the number of second layer pictures is 147.
Obtaining a hierarchical loss function based on the characteristics of each pedestrian picture in the target domain sample subset and the level weight corresponding to the level to which the pedestrian picture belongs:
wherein L isSLRepresenting the hierarchical loss function, ntRepresenting the number of pedestrian pictures, w, in the target field sample subsett,rRepresenting the level weight, r representing the sequence number of the pedestrian pictures in the target domain sample subset according to the similarity, xt,iRepresenting the pedestrian picture with the input sequence i when the target domain sample subset is input into the pedestrian re-identification network model, f (x)t,i) Picture x representing pedestriant,iP (m | x)t,i) Picture x representing pedestriant,iProbability of similarity to all classes of pedestrian pictures in all stored subsets of source domain samples, R1[m]Features, R, representing all classes of pedestrian pictures in all stored subsets of active domain samples1[j]Features representing class j pedestrian pictures in all stored subsets of active domain samples, NsAnd the stored category number of the pedestrian pictures in all the active domain sample subsets is represented, and beta is a temperature coefficient.
Preferably, the layer weights of the three layers of pictures are set as follows:
where b denotes the number of first layer pictures, k1Represents the sum of the numbers of the first layer picture and the second layer picture, and p (l | i) represents the pedestrian picture xt,iProbability of class i pedestrian pictures in all target domain sample subsets, R1[l]Features representing class i pedestrian pictures in all stored subsets of target domain samples, NtRepresenting pedestrians in all stored subsets of target domain samplesNumber of categories of pictures.
Specifically, the steps 2 to 4 are not sequentially performed, and can be performed simultaneously.
Step 5, specifically, the pedestrian re-identification network model is subjected to iterative optimization in the following mode:
obtaining a total loss function according to the classification loss function, the sample invariance loss function and the layering loss function:
L=λ1Lsrc+λ2LT+λ3LSL,
wherein L represents the total loss function, LsrcRepresenting the classification loss function, λ1Weight representing the classification loss function, LTRepresenting the sample invariance loss function, λ2Weight, L, representing the sample invariance loss functionSLRepresenting the hierarchical loss function, λ3Weights representing a hierarchical loss function, exemplary, λ1A value of 0.7, λ2A value of 0.3, λ3Is 0.2.
And traversing each source domain sample subset and each target domain sample subset, and repeating the steps 1 to 5 to update the total loss function in an iterative manner until the variable quantity of the total loss function value is smaller than a preset value so as to complete the optimization of the pedestrian re-identification network model.
Specifically, in the iterative update process, the pedestrian image features of the target domain sample subset stored in the target domain memory are updated in real time in the following manner:
x
t,irepresenting the pedestrian picture with the input sequence i when the target domain sample subset is input into the pedestrian re-identification network model, f (x)
t,i) Picture x representing pedestrian
t,iWarp L
2Normalized characteristic, R
iImage x representing pedestrian in target domain stored in target domain memory
t,iIs characterized in that it is a mixture of two or more of the above-mentioned components,
is a hyper-parameter that controls the speed of feature updates.
Method embodiment
The embodiment of the method is based on the same principle as the system embodiment, so that the method is not repeated herein, and repeated parts can refer to the system embodiment on the market.
Specifically, as shown in fig. 3, the method includes:
s110, obtaining a source domain sample set comprising a plurality of source domain sample subsets and a target domain sample set comprising a plurality of target domain sample subsets.
S120, respectively obtaining a classification loss function and a sample invariance loss function of the pedestrian re-identification network model based on the source domain sample subset and the target domain sample subset, sequencing and layering pedestrian pictures in the target domain sample subset according to the similarity of each pedestrian picture in the source domain sample subset and each pedestrian picture in the target domain sample subset, and obtaining a layering loss function based on layered pedestrian picture characteristics and corresponding layering weights; and traversing each source domain sample subset and each target domain sample subset based on the classification loss function, the sample invariance loss function and the hierarchical loss function to perform iterative optimization on the pedestrian re-identification network model.
S130, identifying the pedestrian image to be identified by utilizing the optimized pedestrian re-identification network model, and obtaining the image which is the same as or similar to the pedestrian image to be identified.
Preferably, the stratification loss function is obtained in particular by:
respectively inputting the source domain sample subset and the target domain sample subset into a residual error network structure of a pedestrian re-identification network model to extract image features so as to obtain and respectively store the features of each pedestrian picture in the source domain sample subset and the features of each pedestrian picture in the target domain sample subset, and multiplying the features of each pedestrian picture in the source domain sample subset and the features of each pedestrian picture in the target domain sample subset to obtain corresponding similarity;
based on the similarity, sequencing the pedestrian pictures in the target domain sample subset in a descending order, sequentially selecting a first preset number of pictures as first-layer pictures, selecting a second preset number of pictures as second-layer pictures, taking the rest pictures as third-layer pictures, and respectively setting layer weights of the three layers of pictures;
obtaining a hierarchical loss function based on the characteristics of each pedestrian picture in the target domain sample subset and the level weight corresponding to the level to which the pedestrian picture belongs:
wherein L isSLRepresenting the hierarchical loss function, ntRepresenting the number of pedestrian pictures, w, in the target field sample subsett,rRepresenting the level weight, r representing the sequence number of the pedestrian pictures in the target domain sample subset according to the similarity, xt,iRepresenting the pedestrian picture with the input sequence i when the target domain sample subset is input into the pedestrian re-identification network model, f (x)t,i) Picture x representing pedestriant,iP (m | x)t,i) Picture x representing pedestriant,iProbability of similarity to all classes of pedestrian pictures in all stored subsets of source domain samples, R1[m]Features, R, representing all classes of pedestrian pictures in all stored subsets of active domain samples1[j]Features representing class j pedestrian pictures in all stored subsets of active domain samples, NsAnd the stored category number of the pedestrian pictures in all the active domain sample subsets is represented, and beta is a temperature coefficient.
Preferably, the layer weights of the three layers of pictures are set as follows:
where b denotes the number of first layer pictures, k1Represents the sum of the numbers of the first layer picture and the second layer picture, and p (l | i) represents the pedestrian picture xt,iProbability of class i pedestrian pictures in all target domain sample subsets, R1[l]Features representing class i pedestrian pictures in all stored subsets of target domain samples, NtRepresenting the number of categories of pedestrian pictures in all the stored subsets of the target domain samples.
Compared with the prior art, the unsupervised domain self-adaptive pedestrian re-identification system and the unsupervised domain self-adaptive pedestrian re-identification method provided by the embodiment have the advantages that on one hand, the combination of the source domain sample with the label and the target domain sample without the label effectively combines the supervised pedestrian re-identification with the unsupervised pedestrian re-identification, namely, the target domain sample is layered through the similarity between the source domain sample and the target domain sample for the first time, so that the discrimination of the pedestrian re-identification network on the target domain sample is improved, the migration loss of the network is reduced, and the accuracy of the pedestrian re-identification result by the pedestrian re-identification network is improved; on the other hand, the negative samples are layered and weighted by utilizing the similarity of the characteristics of the source domain samples and the target domain samples to obtain a layered loss function, the loss function enables the network to adaptively reduce the learning of the characteristics of the negative samples in the training of the pedestrian re-recognition network model, so that the migration loss of the network is reduced, on the basis, the classification loss function and the sample invariance loss function are combined to perform iterative optimization on the pedestrian re-recognition network, and the accuracy of the pedestrian re-recognition network on the pedestrian re-recognition result can be greatly improved.
Those skilled in the art will appreciate that all or part of the flow of the method implementing the above embodiments may be implemented by a computer program, which is stored in a computer readable storage medium, to instruct related hardware. The computer readable storage medium is a magnetic disk, an optical disk, a read-only memory or a random access memory.
The above description is only for the preferred embodiment of the present invention, but the scope of the present invention is not limited thereto, and any changes or substitutions that can be easily conceived by those skilled in the art within the technical scope of the present invention are included in the scope of the present invention.