CN116863313B

CN116863313B - Target re-identification method and system based on label increment refining and symmetrical scoring

Info

Publication number: CN116863313B
Application number: CN202311134901.5A
Authority: CN
Inventors: 黄文心; 苏帅朋; 韩希钰; 钟忺; 刘文璇; 贾雪梅; 赵石磊; 巫世峰
Original assignee: Hubei University
Current assignee: Hubei University
Priority date: 2023-09-05
Filing date: 2023-09-05
Publication date: 2024-01-12
Anticipated expiration: 2043-09-05
Also published as: CN116863313A

Abstract

The invention discloses a target re-identification method and a target re-identification system based on label increment refining and symmetrical scoring, wherein the method comprises the following steps: pre-training a first depth model by using data containing noise labels, and extracting the characteristics of all samples through the first depth model; calculating the distance from the extracted sample characteristics to the class center, and dividing the sample into a clean sample and a suspicious sample according to the distance; calculating the characteristic distance from each suspicious sample to a clean sample, and constructing a characteristic distance matrix; generating a tag diffusion weight matrix according to the characteristic distance matrix, and diffusing the tag of the clean sample to the suspicious sample based on the tag diffusion weight matrix to obtain an optimized tag of the suspicious sample; training a second depth model with the clean samples and their labels, the suspicious samples and their optimized labels for target re-identification. According to the method, the sample is divided into the clean sample and the suspicious sample, and the labels of the suspicious sample are refined according to the label scattering weight, so that the label noise is effectively reduced.

Description

Target re-identification method and system based on label increment refining and symmetrical scoring

Technical Field

The invention belongs to the technical field of target re-identification, and particularly relates to a target re-identification method and system based on label increment refining and symmetrical scoring.

Background

Computer vision is an important branch of the field of artificial intelligence, aimed at using computer algorithms to simulate and understand the human visual system. It obtains useful information by analyzing image and video data to solve various practical problems such as object recognition, object localization, and anomaly detection. Target re-recognition is an important task in the field of computer vision, generally involving recognition of objects such as pedestrians and vehicles. In the task of pedestrian re-identification, the goal is to identify and track the identity of different pedestrians, even though they are present at different times and places, and to be able to track in a video sequence. The vehicle re-identification task is to find a specific vehicle in images shot by different cameras, and further lock and track the same vehicle by searching. The target re-identification task is widely applied to a plurality of fields such as safety monitoring, intelligent transportation and the like, and has important significance for the application of artificial intelligence in modern life.

However, current target re-identification is still highly dependent on large-scale precisely annotated data. Existing large-scale images or videos are usually automatically marked by a computer, but due to factors such as gestures, shielding, deformation, illumination and the like, incorrect marking is easy to occur, and even under the condition of expensive manual marking, similar images are difficult to accurately identify. Therefore, the tag noise problem is difficult to avoid. Conventional target re-identification research is mainly focused on improving identification accuracy or model network, and negative effects caused by tag noise are not fully considered. Some methods with noise immunity in certain fields are often limited to a single task, lacking generalization capability. For example, the tag noise method in the image classification field is not applicable or does not achieve the desired effect in the pedestrian or vehicle re-recognition field. Some approaches in pedestrian or vehicle re-identification tasks also fail to correct false labels or make full use of the relationship information between samples. Recent studies utilize neighborhood label optimization to improve labels, however, when there are more noisy labels, the reliability of the neighborhood labels is not guaranteed.

There are methods for label noise that achieve significant results in image classification tasks, but involve only a few categories in image classification tasks, each category containing thousands of images. However, the task of pedestrian re-identification involves many different identities, with a sample of each identity typically not exceeding 30 images. Thus, a well-behaved method or a tag optimization method in image classification may not be suitable for pedestrian re-recognition tasks with tag noise. Some studies have explored this problem, e.g., purifyNet in PurifyNet: A robust person re-identification model with noisy labels, which proposes a hard-aware instance re-weighting strategy, assigning more weight to difficult samples with correct labels, thus optimizing the loss function; CORE, collaborative refining for person re-identification with label noise, proposes a self-label refinement strategy for joint optimization of labels and networks. These studies all use models to optimize tags, however, the models suffer from tag noise and problems of large intra-class differences, small inter-class differences, etc. due to cross-device presence when predicting the correct tags. The aforementioned methods each treat each sample as an isolated individual, and ignore neighborhood information, which is important in optimizing noise tags. In addition, there are some efforts to target recognition that employ sparse pairwise loss, with a few suitable pairwise adaptive forward mining strategies to dynamically adapt to changes within different categories or to improve the accuracy of target recognition by generating missing image portions. However, these methods do not adequately address the tag noise problems caused by the presence of occlusions, illumination, and cross-device factors.

In view of the foregoing, there is still a need to develop more intensive research and design more applicable methods to solve this problem for the task of re-identifying pedestrians with tag noise.

Disclosure of Invention

In view of the above, the invention provides a target re-identification method and a target re-identification system based on label increment refining and symmetrical scoring, which are used for solving the problem that the influence of label noise cannot be effectively reduced.

The invention discloses a target re-identification method based on label increment refining and symmetrical scoring, which comprises the following steps:

pre-training a first depth model by using data containing noise labels, and extracting the characteristics of all samples through the first depth model;

calculating the distance from the extracted sample characteristics to the class center, and dividing the sample into a clean sample and a suspicious sample according to the distance;

calculating the characteristic distance from each suspicious sample to a clean sample, and constructing a characteristic distance matrix;

generating a tag diffusion weight matrix according to the characteristic distance matrix, and diffusing the tag of the clean sample to the suspicious sample based on the tag diffusion weight matrix to obtain an optimized tag of the suspicious sample;

training a second depth model with the clean samples and their labels, the suspicious samples and their optimized labels for target re-identification.

On the basis of the above technical solution, preferably, calculating the distance from the extracted sample feature to the class center includes:

calculating the distance from the extracted sample feature to the category center;

calculating the average distance from the sample in each category to the center of the category;

and taking the samples with the average distance smaller than or equal to the average distance of the category as clean samples and taking the samples with the average distance larger than the average distance of the category as suspicious samples.

On the basis of the above technical solution, preferably, the calculating the feature distance from each suspicious sample to the clean sample and constructing the feature distance matrix specifically includes:

calculating suspicious samplesiTo clean samplejIs the characteristic distance of (2)：

if _k Representative sampleiIs the first of (2)kThe dimensional characteristics of the object are defined,jf _k representative samplejIs the first of (2)kThe dimensional characteristics of the object are defined,k=1,2,...,n，nis the dimension of the sample feature;

using a softmax function pairNormalizing to obtain normalized characteristic distance:

constructing an NxN feature distance matrix S from the normalized feature distances, whereinS is the firstiLine 1jColumn elements.

On the basis of the above technical solution, preferably, the formula for generating the tag diffusion weight matrix according to the feature distance matrix is:

wherein the method comprises the steps ofDiffusing weight matrix for tagsPIs the first of (2)iLine 1jColumn elements.

On the basis of the above technical solution, preferably, the diffusing the label of the clean sample to the suspicious sample based on the label diffusing weight matrix specifically includes:

calculating a new tag matrix of suspicious samples:

new tag matrix for suspicious sample, +.>Original tag matrix for suspicious sample, +.>The matrix is N rows and C columns, and C is the total number of sample categories;

for suspicious samplesiOptimized tagThe method comprises the following steps:

wherein,crepresents the firstcThe number of categories of the product,c=1,2,...,C，for new tag matrix->Is the first of (2)iLine 1cColumns.

On the basis of the above technical solution, preferably, in the process of training the second depth model by using the clean sample and the label thereof, the suspicious sample and the optimization label thereof to perform target re-identification, the suspicious sample is weighted, and a weight calculation formula is as follows:

wherein,for suspicious samplesiIndividual weight parameters of->Representing the turn on which the model was trained.

On the basis of the above technical solution, preferably, in the process of training the second depth model by using the clean sample and the label thereof, the suspicious sample and the optimized label thereof to re-identify the target, a loss function is calculated by an individual symmetry scoring mechanism:

wherein,Lrepresenting a loss function and,CE=plnq、RCE=qlnprespectively suspicious samplesiIs a standard cross entropy and an inverse cross entropy of (a),pfor a true tag distribution,qa category distribution is predicted for the model.

In a second aspect of the present invention, a target re-identification system based on tag delta refinement and symmetry scoring is disclosed, the system comprising:

and the feature extraction module is used for: pre-training a first depth model by using data containing noise labels, and extracting the characteristics of all samples through the first depth model;

sample dividing module: the method comprises the steps of calculating the distance from an extracted sample feature to a class center, and dividing the sample into a clean sample and a suspicious sample according to the distance;

and a label optimizing module: the method comprises the steps of calculating the characteristic distance from each suspicious sample to a clean sample, and constructing a characteristic distance matrix; generating a tag diffusion weight matrix according to the characteristic distance matrix, and diffusing the tag of the clean sample to the suspicious sample based on the tag diffusion weight matrix to obtain an optimized tag of the suspicious sample;

target re-identification module: for training a second depth model with the clean samples and their labels, suspicious samples and their optimized labels for target re-identification.

In a third aspect of the present invention, an electronic device is disclosed, comprising: at least one processor, at least one memory, a communication interface, and a bus;

the processor, the memory and the communication interface complete communication with each other through the bus;

the memory stores program instructions executable by the processor which the processor invokes to implement the method according to the first aspect of the invention.

In a fourth aspect of the invention, a computer-readable storage medium is disclosed, storing computer instructions that cause a computer to implement the method according to the first aspect of the invention.

Compared with the prior art, the invention has the following beneficial effects:

1) The method and the device distinguish the clean sample and the suspicious sample by using the extracted characteristics and the intra-class average distance, and refine the label of the suspicious sample according to the similarity of the suspicious sample and the clean sample as the diffusion weight of the clean sample label, thereby reducing the label noise and solving the problem that an optimization algorithm is invalid when the neighborhood labels of the sample are mostly wrong labels when the noise labels are more.

2) According to the invention, the learning degree of the model on the suspicious sample is measured by using the individual symmetrical score in the training process, and the model prediction result is added as the score parameter, so that the learning of the model on the potential noise sample and the hard sample far from the class center is balanced, the fault tolerance rate of the model on the tag noise is improved, the model can be better adapted to the existence of the tag noise, the sensitivity on the wrong tag is reduced as much as possible, and the influence of the potential tag noise on the depth model is further reduced.

Drawings

In order to more clearly illustrate the embodiments of the invention or the technical solutions in the prior art, the drawings that are required in the embodiments or the description of the prior art will be briefly described, it being obvious that the drawings in the following description are only some embodiments of the invention, and that other drawings may be obtained according to these drawings without inventive effort for a person skilled in the art.

FIG. 1 is a flow chart of a target re-identification method based on label increment refining and symmetric scoring of the present invention.

Detailed Description

The following description of the embodiments of the present invention will clearly and fully describe the technical aspects of the embodiments of the present invention, and it is apparent that the described embodiments are only some embodiments of the present invention, not all embodiments. All other embodiments, which can be made by those skilled in the art based on the embodiments of the present invention without making any inventive effort, are intended to fall within the scope of the present invention.

Referring to fig. 1, the invention provides a target re-identification method based on label increment refining and symmetry scoring, which comprises the following steps:

s1, pre-training a first depth model by using sample data containing noise labels, and extracting the characteristics of all samples through the first depth model.

The resnet50, convolutional neural network, etc. may be employed as a first depth model, trained using sample data containing noise labels, extracting features of all samples.

S2, calculating the distance from the extracted sample characteristic to the class center, and dividing the sample into a clean sample and a suspicious sample according to the distance.

The specific way of dividing into clean samples and suspicious samples is:

calculating Euclidean distance from the extracted sample characteristics to the category center;

Because the sample data contains noise labels, the characteristics of the samples are inaccurate, and the accuracy of target re-identification is affected. Therefore, the method and the device distinguish the clean sample from the suspicious sample by calculating the average distance in the class, so that the suspicious sample can be conveniently subjected to label optimization in the follow-up process.

S3, calculating the characteristic distance from each suspicious sample to the clean sample, and constructing a characteristic distance matrix.

For ease of calculation, a matrix S of N x N is provided for preserving the characteristic distance between samples, N being the number of samples,S _ij representing suspicious samplesiAnd clean samplejIf the sample isjFor suspicious sampleS _ij =0. If a sample isiFor clean samples, only the matrix SS _ii =1，iThe other columns of rows are all 0.

Calculating suspicious samples using the following formulaiTo clean samplejIs the characteristic distance of (2)：

Wherein,if _k representative sampleiIs the first of (2)kThe dimensional characteristics of the object are defined,jf _k representative samplejIs the first of (2)kThe dimensional characteristics of the object are defined,k=1,2,...,n，nis the characteristic dimension of the sample.

constructing an NxN feature distance matrix S from the normalized feature distances, whereinS is the firstiLine 1jThe column elements are arranged in a row,i,j=1,2,...,N。

s4, generating a tag diffusion weight matrix according to the feature distance matrix, and diffusing the tags of the clean samples to the suspicious samples based on the tag diffusion weight matrix to obtain optimized tags of the suspicious samples.

The formula for generating the tag diffusion weight matrix according to the characteristic distance matrix is as follows:

wherein the method comprises the steps ofDiffusing weight matrix for tagsPIs the first of (2)iLine 1jColumn element, tag diffusion weight matrixPA matrix of N rows and N columns, whereinP _ij Representative sampleiAndjis a similarity of (3). By aligningPEach row is ordered according to the ascending order of the rows and records the front of each row after being orderedkThe number of columns in which each sample is most similar to each sample can be foundkClean samples. If a sample isjNot a sampleiFront of (2)kNeighbors, thenP _ij =0. In this way, a neighborhood relationship can be established.

Diffusion of weight matrix through tagsPCalculating a new tag matrix of suspicious samples:

for suspicious samplesiOptimized tagThe method comprises the following steps:

According to the method, the characteristic distances between the clean samples and the suspicious samples are measured to obtain all the clean samples which are most similar to the suspicious samples, the characteristic distances between each suspicious sample and the clean samples are normalized, the similarity obtained according to the normalization result is used as clean sample diffusion label information to generate the weight of a new label for the suspicious samples, and the label type with the largest weight is found according to the label diffusion result to obtain the new optimized label, so that the problem that an optimization algorithm is invalid when the neighborhood labels of the samples are mostly wrong labels when noise labels are more can be solved.

S5, training a second depth model by using the clean sample and the label thereof, the suspicious sample and the optimized label thereof to re-identify the target.

In the process of training a second depth model by using the clean sample and the label thereof, the suspicious sample and the optimized label thereof to carry out target re-identification, the suspicious sample is weighted, and a weight calculation formula is as follows:

wherein,for suspicious samplesiIs a weight parameter of the individual.

Furthermore, for suspicious samples, the loss function is calculated by an individual symmetry scoring mechanism:

wherein,Lrepresenting the loss function, i.e., the individual symmetry score of the measurable sample.CE=plnq、RCE=qlnpRespectively suspicious samplesiIs a standard cross entropy and an inverse cross entropy of (a),representing the turn on which the model is trained,pfor a true tag distribution,qa category distribution is predicted for the model.

For clean samples, the individual symmetry scores default toThe weight defaults to 1.

Under training data containing label noise, the true labels of the model may be erroneous, thus the original distributionpNot necessarily representing the true distribution, in which case the distribution predicted by the modelqInstead, the invention has a certain representativeness, so the invention increases the model prediction result based on the individual symmetry scoring mechanismqAs a means oflnpBy a coefficient of (1)The method realizes the prediction result of the exchange model and the original label distribution to achieve the aim of reducing the harm of label noise, thereby improving the fault tolerance rate of the label noise. In this way, the model can better adapt to the presence of label noise during training and minimize sensitivity to false labels. In addition, the individual symmetry scoring mechanism scores suspicious samples and clean samples respectively, scoring of each individual instance level is achieved, the learning degree of the model is indirectly measured through the scoring, the learning process of difficult samples and samples possibly with error labels is helped to be balanced, more accurate feedback is provided for the model, and the performance and the robustness of the model in a target re-recognition task are improved.

Corresponding to the embodiment of the method, the invention also provides a target re-identification system based on label increment refining and symmetrical scoring, which comprises the following steps:

The system embodiments and the method embodiments are in one-to-one correspondence, and the brief description of the system embodiments is just to refer to the method embodiments.

The invention also discloses an electronic device, comprising: at least one processor, at least one memory, a communication interface, and a bus; the processor, the memory and the communication interface complete communication with each other through the bus; the memory stores program instructions executable by the processor that the processor invokes to implement the aforementioned methods of the present invention.

The invention also discloses a computer readable storage medium storing computer instructions for causing a computer to implement all or part of the steps of the methods of the embodiments of the invention. The storage medium includes: a usb disk, a removable hard disk, a ROM, a RAM, a magnetic disk, or an optical disk, or other various media capable of storing program codes.

The system embodiments described above are merely illustrative, wherein the elements illustrated as separate elements may or may not be physically separate, and the elements shown as elements may or may not be physical elements, i.e., may be distributed over a plurality of network elements. One of ordinary skill in the art may select some or all of the modules according to actual needs without performing any inventive effort to achieve the objectives of the present embodiment.

The foregoing description of the preferred embodiments of the invention is not intended to be limiting, but rather is intended to cover all modifications, equivalents, alternatives, and improvements that fall within the spirit and scope of the invention.

Claims

1. A target re-identification method based on label increment refining and symmetry scoring, the method comprising:

；

wherein the method comprises the steps ofDiffusing weight matrix for tagsPIs the first of (2)iLine 1jColumn elements; wherein->Is the first of the feature distance matrix SiLine 1jColumn elements;

based on the tag diffusion weight matrix, diffusing the tag of the clean sample to the suspicious sample, and obtaining the optimized tag of the suspicious sample specifically comprises the following steps:

calculating a new tag matrix of suspicious samples:

；

for suspicious samplesiOptimized tagThe method comprises the following steps:

；

wherein,crepresents the firstcThe number of categories of the product,c=1,2,...,C，for new tag matrix->Is the first of (2)iLine 1cA column;

training a second depth model by using the clean sample and the label thereof, the suspicious sample and the optimized label thereof to carry out target re-identification;

in the process of training the second depth model by using the clean sample and the label thereof, the suspicious sample and the optimized label thereof to re-identify the target, a loss function is calculated through an individual symmetry scoring mechanism:

；

wherein,Lrepresenting a loss function and,CE=plnq、RCE=qlnprespectively suspicious samplesiIs a standard cross entropy and an inverse cross entropy of (a),pfor a true tag distribution,qthe class distribution is predicted for the model,，/>for suspicious samplesiIndividual weight parameters of->Representing the turn on which the model was trained.

2. The target re-identification method based on label increment refining and symmetric scoring according to claim 1, wherein calculating the distance from the extracted sample feature to the class center comprises dividing the sample into a clean sample and a suspicious sample according to the distance size:

3. The method for re-identifying a target based on incremental refining and symmetric scoring of claim 1, wherein the calculating the feature distance from each suspicious sample to a clean sample and constructing the feature distance matrix specifically comprises:

；

4. A target re-identification system based on tag delta refinement and symmetry scoring using the method of any one of claims 1-3, the system comprising:

and the feature extraction module is used for: pre-training a first depth model by using data containing noise labels, and extracting features of all samples through the first depth model;

5. An electronic device, comprising: at least one processor, at least one memory, a communication interface, and a bus;

the memory stores program instructions executable by the processor, the processor invoking the program instructions to implement the method of any of claims 1-3.

6. A computer readable storage medium storing computer instructions for causing a computer to implement the method of any one of claims 1 to 3.