CN110826417B

CN110826417B - Cross-view pedestrian re-identification method based on discriminant dictionary learning

Info

Publication number: CN110826417B
Application number: CN201910966029.8A
Authority: CN
Inventors: 谢明鸿; 颜悦
Original assignee: Kunming University of Science and Technology
Current assignee: Kunming University of Science and Technology
Priority date: 2019-10-12
Filing date: 2019-10-12
Publication date: 2022-08-16
Anticipated expiration: 2039-10-12
Also published as: CN110826417A

Abstract

The invention relates to a cross-view pedestrian re-identification method based on discriminative dictionary learning, and belongs to the technical field of digital image processing. Firstly, on the basis of the fact that pedestrian images from the same camera view angle share the same domain, dividing pedestrian features of different view angles into specific view angle domain information components and domain invariant pedestrian appearance feature components, learning a discrimination dictionary algorithm to create a domain general dictionary for describing the domain information components and a domain invariant dictionary for describing the domain invariant components, and meanwhile forcing pedestrian coding coefficients under the same view angle to have strong similarity; then, an expansion regular term is provided to force the coding coefficients of different pedestrians to keep a certain distance, and the coding coefficients of the same pedestrian are as close as possible; and finally, designing a pedestrian matching scheme by adopting the Euclidean distance based on the model only having the pedestrian characteristic information. The pedestrian re-identification method provided by the invention can separate the domain information in the image to solve the problem of domain deviation among different visual angles, and generates a good identification effect.

Description

Cross-view pedestrian re-identification method based on discriminant dictionary learning

Technical Field

The invention relates to a cross-view pedestrian re-identification method based on discriminative dictionary learning, and belongs to the technical field of digital image processing.

Background

Pedestrian re-identification is a technique that uses computer vision to determine the presence or absence of a target pedestrian from images or video sequences taken by different cameras. In recent years, pedestrian re-recognition has attracted increasing attention from researchers due to wide applications in pedestrian search, pedestrian tracking, and pedestrian behavior analysis, and a large number of methods of pedestrian re-recognition have been proposed. Although computer vision researchers have made great efforts to improve the performance of pedestrian re-identification systems, this technique still presents significant challenges because the appearance of pedestrians is often largely visually ambiguous in cross-camera views.

Disclosure of Invention

The invention aims to provide a cross-view pedestrian re-identification method based on discriminative dictionary learning, which is used for solving the problem of offset of a pedestrian re-identification domain in the prior art.

The technical scheme of the invention is as follows: a cross-view pedestrian re-recognition method based on discriminant dictionary learning comprises the following steps:

1) determining a global model framework of cross-view pedestrian re-recognition based on the learning of a discriminant dictionary;

2) dividing the pedestrian image features of different visual angles into specific visual angle domain information components and domain invariant pedestrian appearance feature components, and learning a discrimination dictionary algorithm to create a domain general dictionary for describing the domain information components and a domain invariant dictionary for describing the domain invariant components;

3) training a discrimination promoting item of the dictionary;

4) providing an expansion regular term to force the coding coefficients of different pedestrians to keep a certain distance, and the coding coefficients of the same pedestrian are as close as possible;

5) training a discrimination promoting item of the coding coefficient, and forcing the coding coefficients of the pedestrian images with the same visual angle to have strong similarity;

6) determining an overall objective function of cross-view pedestrian re-recognition based on the learning of a discriminant dictionary;

7) solving variables to be updated in the overall objective function;

8) and designing a pedestrian matching scheme by adopting Euclidean distance based on the model with only the domain unchanged pedestrian appearance characteristics.

Specifically, the overall model framework of step 1) comprises:

by using

Representing a training sample set under a two-phase machine visual angle, wherein robust feature representation learning and discriminant metric learning are required to be integrated into a frame, and the overall model frame is as the formula (1)Shown in the figure:

in the formula (I), the compound is shown in the specification,

a domain dictionary representing the pedestrian images under all cameras,

representing a domain-specific dictionary for coding pedestrian appearance features after separating domain information, Z _a ,Z _b Is X on dictionary D _a And X _b Of the domain information, Z _ta ,Z _tb Is corresponding to the dictionary D _t The coding coefficients of the domain-specific information. Phi (D, D) _t ,Z _a ,Z _b ,Z _ta ,Z _tb ) Are data fidelity terms, minimizing which can be used to learn dictionaries D and D _t Has the presentation capability. Ψ (D, D) _t ) Is a discrimination promoting term of dictionary, gamma (Z) _a ,Z _b ,Z _ta ,Z _tb ) The term is a discrimination promoting term of the coding coefficient, and the minimization of the two terms is to enable the dictionary and the coding coefficient to have strong discrimination capability.

Is of D

Row of

Is D _t To (1) a

And (4) columns.

Specifically, the discriminative dictionary algorithm of step 2) includes:

to mitigate domain shifts between different camera perspectives, domain information is separated from pedestrian image features, and then data fidelity terms Φ (D, D) _t ,Z _a ,Z _b ,Z _ta ,Z _tb ) Expressed as:

in the formula (I), the compound is shown in the specification,

domain information for establishing a view angle of a and b two cameras,

for separating the domain information from the appearance of pedestrians unaffected by the domain.

Specifically, the dictionary identification promoting item in the step 3) includes:

the dictionary D is used to represent domain information for different camera perspectives, since images from the same camera have the same domain features, it is desirable that the images are linearly related to each other in terms of domain features. To get from the sample X _a And X _b The domain information is separated, and the proposed dictionary discrimination promoting items are as follows:

in the formula, | D | non-conducting phosphor _* The method is used for solving the nuclear norm of the dictionary D, because the domain information component and the real appearance characteristic of the pedestrian have different space morphological characteristics, an incoherent regular term of the structure is introduced

To promote the domain dictionary D and the pedestrian feature dictionary D _t Are independent of each other. Alpha (alpha) ("alpha") ₁ And alpha ₂ Is two scalar parameters respectively representing | | | D | | non-woven phosphor _* And

weight information of the item.

Specifically, the expanding regularization term of step 4) includes:

hoping that the same pedestrian from different camera views is in a domain specific dictionary D _t Have the same coding coefficients, while it is desirable that the algorithm be able to make the distance between the coding coefficients of different pedestrians from different camera views larger than a constant. To meet this requirement, the following function is proposed for the viewing angle a, and a similar function is proposed for the viewing angle b by the same method, which is not described here again:

in the formula, { z } ₊ Max { z,0}, c is an arbitrary constant,

a k-th image representing the l-th pedestrian at a-camera view;

representing the k-th pedestrian corresponding to the coding coefficient which is most dissimilar to the k-th image of the l-th pedestrian under the b view angle ^* An image, wherein k ^* ≠k；

Indicating the l < th > image most similar to the k < th > image of the l < th > pedestrian under the b < th > view ^* Kth of individual pedestrian ^* An image of which ^* Not equal to l. In the formula

Represent

It does not result in identity to the pedestrianThe misjudgment of (2). While

To represent

It means that the pedestrian matching using the encoding coefficient of the pedestrian image feature causes misrecognition. In this case, minimization

Can promote

Specifically, the coding coefficient discrimination promoting term in step 5) includes:

matrix Z of coding coefficients for both a and b view fields _a And Z _b The same domain should have the same sparse representation. Based on the above considerations, Γ (Z) in the overall model framework (1) is defined _a ,Z _b ,Z _ta ,Z _tb ) Comprises the following steps:

in the formula (I), the compound is shown in the specification,

minimizing Z laces _2,1 The entries in each row of Z may be made the same, which term may cause the same atoms to be selected from D to represent the original features of the same domain, and cause the coding coefficients of these features to share the same sparse representation on D. Alpha is alpha ₃ ,α ₄ ,α ₅ Is three scalar parameters, each representing | | | Z _a || _2,1 +||Z _b || _2,1 、||Z _ta || ₁ +||Z _tb || ₁ And

weight information of the item.

Specifically, the overall objective function of step 6) includes:

in the formula, M _a And M _b Respectively representing the number of pedestrians at the view angle of the two-phase machine, N _al And N _bl The representations respectively represent the number of images corresponding to the ith pedestrian under the view angle of the two cameras.

Specifically, the variable solving of step 7) includes:

variables D, D for requirements in the overall objective function (6) _t ,Z _a ,Z _b ,Z _ta ,Z _tb It is not co-convex, but it is convex for each variable when all other variables are fixed. Therefore, they can be optimized by an alternating iterative process, the solution for each variable being as follows:

in order to update the coding coefficient Z _a Of variable Z _b Update method and Z _a Consistent, and not described in detail herein, assume first D, D _t ,Z _b ,Z _ta ,Z _tb Are all fixed, with the following objective function:

this is a typical l _2,1 Minimization problem, Z _a The analytic solution of (a) can be expressed as:

Z _a ＝(4D ^T D+α ₃ Λ ₁ ) ^-1 (4D ^T X _a +2D ^T D _t Z _ta ) (8)

in the formula, Λ ₁ Is formed by

The diagonal matrix is formed by the following steps,

represents Z _i Column j.

Then, by fixing D, D _t ,Z _a ,Z _b ,Z _tb To update Z _ta Of variable Z _tb Update method and Z _ta In agreement, which is not described here, the following objective functions are available:

for convenience of optimization, equation (9) is rewritten as a vector form:

in the formula (I), the compound is shown in the specification,

is the visual characteristic of the kth image of the ith pedestrian under the view angle a. To solve for (10), a relaxation variable

Introduced, equation (10) can then be relaxed as:

the variables can be updated by solving

The above problem can be solved by an iterative shrinkage algorithm,

the update may be by:

wherein h represents the h-th iteration,

using updates

Z _ta Can be constructed as

In updating the coding coefficient Z _a And Z _ta Then, dictionaries D and D _t Can be updated alternately, with the following objective function:

to update D, an intermediate variable C is introduced, and equation (14) becomes:

c can be solved by:

this is a typical core specification minimization problem that can be solved by singular value thresholding algorithms. To update D _t A relaxation variable H is introduced:

the closed solution for the relaxation variable H can be expressed as:

H＝(α ₂ D _t D _t ^T +I ₁ ) ^-1 D (18)

wherein, I ₁ Using updated C and H for an identity matrix, D can be optimized by solving:

this problem can be solved by the lagrange dual. Finally, D _t The optimization can be achieved by solving:

this problem can be solved as the problem in equation (19).

Specifically, the pedestrian matching scheme of step 8) includes:

in the test, the dictionaries D and D are learned _t The separation of the domain information and the specific pedestrian information can be achieved by solving:

in the formula, Z _a ,Z _b Representing the matrix of domain coding coefficients in views a, b, respectively, Z _ta ,Z _tb And coding coefficient matrixes respectively representing specific pedestrian information under the visual angles a and b. This problem can be solved by an alternating iteration method, when

And

and when so, stopping iteration. Order to

And

is composed of

And

the vector of coding coefficients of the second pedestrian may measure the similarity between pedestrians by calculating the following distance:

the invention has the beneficial effects that:

1. in the current pedestrian re-identification method, most researches assume that the pedestrian image to be identified has no domain difference between two visual angles, so that not only more image information is lost, but also false information is introduced to the result, and the visual effect of the pedestrian image is influenced. The pedestrian re-identification method provided by the invention can separate the domain information from the pedestrian image, avoids the transmission of false information, can reduce time consumption and improves the discrimination capability of pedestrians.

2. Compared with other methods, the pedestrian re-identification method provided by the invention has the advantage that the identification performance is obviously improved.

Drawings

FIG. 1 is a flow chart of the present invention;

fig. 2 is a pedestrian image pair from a perspective of two cameras on a PRID2011 dataset provided by an embodiment of the present invention;

fig. 3 is a diagram of a parameter α in an algorithm based on a PRID2011 data set according to an embodiment of the present invention ₁ The CMC curve of (1);

fig. 4 is a diagram of a parameter α in an algorithm based on a PRID2011 data set according to an embodiment of the present invention ₂ The CMC curve of (1);

FIG. 5 is a diagram of algorithm targeting on a PRID 2011-based data set, provided by an embodiment of the present inventionMiddle parameter alpha ₃ The CMC curve of (1);

fig. 6 is a diagram of a parameter α in an algorithm based on a PRID2011 data set according to an embodiment of the present invention ₄ The CMC curve of (1);

fig. 7 is a diagram of a parameter α in an algorithm based on a PRID2011 data set according to an embodiment of the present invention ₅ The CMC curve of (1).

Detailed Description

The invention is further described with reference to the following drawings and detailed description.

Example 1: domain shift between pedestrian images from different camera perspectives is one of the major factors contributing to pedestrian appearance ambiguity. In addition, domain information in the same camera view is stable for a certain time, and all images in the same view share the same domain information. If the domain information can be separated from the pedestrian image, the remaining information will not be interfered by the domain information, and domain shift will not occur between pedestrian images from different camera perspectives. Based on the thought, the invention provides a novel domain invariant dictionary learning method which is used for cross-view pedestrian re-identification. In this approach, it is assumed that images from the same camera perspective share the same domain. In order to achieve a domain-invariant visual feature, pedestrian features at different viewing angles are divided into two components, one of which is a domain-specific component and the other of which is a domain-invariant feature component.

As shown in fig. 1, a cross-view pedestrian re-identification method based on discriminative dictionary learning includes the following steps:

3) training a discrimination promoting item of the dictionary;

6) determining an overall objective function of cross-view pedestrian re-recognition based on the learning of a discrimination dictionary;

7) solving variables to be updated in the overall objective function;

8) and designing a pedestrian matching scheme by adopting an Euclidean distance based on the model with only unchanged pedestrian appearance characteristics in the domain.

The specific implementation process is as follows: firstly, on the basis of the fact that pedestrian images from the same camera view angle share the same domain, dividing pedestrian features of different view angles into specific view angle domain information components and domain invariant pedestrian appearance feature components, learning a discrimination dictionary algorithm to create a domain general dictionary for describing the domain information components and a domain invariant dictionary for describing the domain invariant components, and meanwhile forcing pedestrian coding coefficients under the same view angle to have strong similarity; then, in order to overcome the appearance ambiguity, an extended regular term is provided to force the coding coefficients of different pedestrians to keep a certain distance, and the coding coefficients of the same pedestrian are as close as possible; and finally, designing a pedestrian matching scheme by adopting the Euclidean distance based on the model only having the pedestrian characteristic information.

Further, the overall model framework of step 1) comprises:

by using

Representing a training sample set under a two-phase machine view, in this case, robust feature representation learning and discriminant metric learning need to be integrated into a framework, and the overall model framework is shown as formula (1):

in the formula (I), the compound is shown in the specification,

a domain dictionary representing the pedestrian images under all cameras,

representing a domain-specific dictionary for coding pedestrian appearance features after separating domain information, Z _a ,Z _b Is X on dictionary D _a And X _b Of the domain information, Z _ta ,Z _tb Is corresponding to the dictionary D _t The coding coefficients of the domain-specific information. Phi (D, D) _t ,Z _a ,Z _b ,Z _ta ,Z _tb ) Are data fidelity terms, minimizing which can be used to learn dictionaries D and D _t Has the presentation capability. Ψ (D, D) _t ) Is a discrimination promoting term of the dictionary, gamma (Z) _a ,Z _b ,Z _ta ,Z _tb ) The term is a discrimination promoting term of the coding coefficient, and the minimization of the two terms is to enable the dictionary and the coding coefficient to have strong discrimination capability.

Is of D

Row by row

Is D _t To (1) a

And (4) columns.

Further, the discriminant dictionary algorithm in step 2) includes:

in the formula (I), the compound is shown in the specification,

domain information for establishing a and b two-camera view angles,

Further, the dictionary distinguishing promoting item in the step 3) comprises:

To promote the domain dictionary D and the pedestrian feature dictionary D _t Are independent of each other. Alpha is alpha ₁ And alpha ₂ Is two scalar parameters respectively representing | | | D | | non-woven phosphor _* And

weight information of the item.

Further, the extended regularization term of step 4) includes:

the same pedestrian domain-specific dictionary D, intended to come from different camera views _t Have the same coding coefficients, while it is desirable that the algorithm be able to make the distance between the coding coefficients of different pedestrians from different camera views larger than a constant. In order to meet this need, it is known to provide,the following function is proposed for the viewing angle a, and a similar function is proposed for the viewing angle b by using the same method, which is not described here again:

in the formula, { z } ₊ Max { z,0}, c is an arbitrary constant,

a k-th image representing the l-th pedestrian at a-camera view;

Representing the l < th > image of the l < th > pedestrian under the b view angle and the k < th > image of the l < th > pedestrian under the a view angle ^* Kth of individual pedestrian ^* An image of which ^* Not equal to l. In the formula

Represent

It does not lead to misjudgment of the identity of the pedestrian. While

To represent

Can promote

Further, the coding coefficient discrimination promotion item in step 5) includes:

in the formula (I), the compound is shown in the specification,

weight information of the item.

Further, the overall objective function of step 6) includes:

Further, the variable solving of step 7) includes:

Z _a ＝(4D ^T D+α ₃ Λ ₁ ) ^-1 (4D ^T X _a +2D ^T D _t Z _ta ) (8)

in the formula, Λ ₁ Is formed by

The diagonal matrix is formed by the following steps,

represents Z _i Column j.

Then, by fixing D, D _t ,Z _a ,Z _b ,Z _tb To update Z _ta Of variable Z _tb Update method and Z _ta Consistently, and not described herein, there are the following objective functions:

for convenience of optimization, equation (9) is rewritten as a vector form:

in the formula (I), the compound is shown in the specification,

Introduced, equation (10) can then be relaxed as:

the variables can be updated by solving

The above problem can be solved by an iterative shrinkage algorithm,

the update may be by:

wherein h represents the h-th iteration,

using updates

Z _ta Can be constructed as

c can be solved by:

the closed solution for the relaxation variable H can be expressed as:

H＝(α ₂ D _t D _t ^T +I ₁ ) ^-1 D (18)

this problem can be solved as the problem in equation (19).

Further, the pedestrian matching scheme of step 8) includes:

in the formula, Z _a ,Z _b Representing the matrix of domain coding coefficients in views a, b, respectively, Z _ta ,Z _tb And coding coefficient matrixes respectively representing specific pedestrian information under the visual angles a and b. This problem can be solved by an alternate iteration method, when

And

and when so, stopping iteration. Order to

And

is composed of

And

the vector of coding coefficients of the second pedestrian can be calculated as followsDistance to measure the similarity between pedestrians:

in the step 3), since images from the same camera view have domain similarity, dictionaries used for representing domain components are refined by low-rank terms, and meanwhile structural incoherent regular terms are introduced to enable a domain dictionary D and a pedestrian feature dictionary D to be promoted _t The two judgment promoting terms aiming at the dictionary are added, so that the dictionary has stronger judgment capability.

In the steps 4) and 5), two discrimination promoting items aiming at the coding coefficient are added, so that the coding coefficient has stronger discrimination capability, and meanwhile, the coding coefficient Z is updated _ta ,Z _tb In this case, a gradient descent method is used.

In the step 8), a pedestrian matching scheme is designed by adopting an Euclidean distance based on the model with only unchanged pedestrian appearance characteristics of the domain, so that adverse effects on the recognition result caused by domain deviation are avoided.

The invention is further illustrated below with reference to specific experimental data.

In the experiment, each data set was randomly divided into two non-overlapping parts, one used as a training sample and the other used as a test sample. Cumulative matching feature (CMC) curves are used to quantitatively evaluate recognition performance. There are seven parameters in the model, including dictionaries D and D _t Sizes d and d of _t Five scalar parameters, i.e. alpha ₁ ，α ₂ ，α ₃ ，α ₄ And alpha ₅ . The values of the above parameters were set to d 50 throughout the experiment, d _t ＝760,α ₁ ＝1,α ₂ ＝0.01,α ₃ ＝28,α ₄ 1 and α ₅ 5. Parameter alpha ₁ ，α ₂ ，α ₃ ，α ₄ And alpha ₅ The impact on the recognition performance is given in fig. 3-7. Table 1 shows the performance comparison based on the most recent results on the PRID2011 data set, with the maximum values being bolded.

Table 1: performance comparison based on most recent results on PRID2011 dataset

The comparison result shows that the recognition rate of the proposed method is highest on different grades, and is even 5.4%, 3.9%, 4.9% and 0.5% higher than that of the suboptimal methods of

grades

1, 5, 10 and 20 respectively.

While the present invention has been described in detail with reference to the embodiments, the present invention is not limited to the embodiments, and various changes can be made without departing from the spirit of the present invention within the knowledge of those skilled in the art.

Claims

1. A cross-view pedestrian re-recognition method based on discriminative dictionary learning is characterized in that: the method comprises the following steps:

3) training a discrimination promoting item of the dictionary;

4) the method comprises the steps that according to an expansion regular term, coding coefficients of different pedestrians are forced to keep a certain distance, and the coding coefficients of the same pedestrian are close to each other as much as possible;

7) solving variables to be updated in the overall objective function;

8) designing a pedestrian matching scheme by adopting Euclidean distance based on a model with only domain invariant pedestrian appearance characteristics;

the overall model framework of the step 1) comprises the following steps:

by using

in the formula (I), the compound is shown in the specification,

a domain dictionary representing the pedestrian images under all cameras,

representing a domain-specific dictionary for coding pedestrian appearance features after separating domain information, Z _a ,Z _b Is X on dictionary D _a And X _b Of the domain information, Z _ta ,Z _tb Is corresponding to the dictionary D _t Phi (D, D) of the domain-specific information _t ,Z _a ,Z _b ,Z _ta ,Z _tb ) Is the data fidelity term, Ψ (D, D) _t ) Is a discrimination promoting term of the dictionary, gamma (Z) _a ,Z _b ,Z _ta ,Z _tb ) Is a discrimination promoting term of the coding coefficient,

is of D

Row by row

Is D _t To (1) a

Columns;

the discriminant dictionary algorithm in the step 2) comprises the following steps:

data fidelity term phi (D, D) _t ,Z _a ,Z _b ,Z _ta ,Z _tb ) Expressed as:

in the formula (I), the compound is shown in the specification,

establishing the domain information of the viewing angles of the a and b two cameras,

separating the domain information from pedestrian appearance characteristics that are not affected by the domain;

the dictionary discrimination promoting item in the step 3) comprises:

the proposed dictionary discrimination promoting terms are:

in the formula, | D | non-conducting phosphor _* Is to solve the nuclear norm of the dictionary D,

is a structurally incoherent regularization term, α ₁ And alpha ₂ Is two scalar parameters respectively representing | | | D | | non-woven phosphor _* And

weight information of the item;

the expanding regular term of the step 4) comprises the following steps:

the following function is proposed for the viewing angle a, and a similar function is proposed for the viewing angle b by using the same method, which is not described here again:

in the formula, { z } ₊ Max { z,0}, c is an arbitrary constant,

a k-th image representing the l-th pedestrian at a-camera view;

Indicating the l < th > image most similar to the k < th > image of the l < th > pedestrian under the b < th > view ^* Kth of individual pedestrian ^* An image of which ^* Not equal to l, and in the formula

To represent

It will not cause misjudgment of the identity of the pedestrian

To represent

It means that the pedestrian matching using the coding coefficients of the pedestrian image features leads to misrecognition, in which case the minimization

Can promote

2. The cross-view pedestrian re-recognition method based on the discriminant dictionary learning is characterized in that: the coding coefficient discrimination promoting item in the step 5) comprises:

defining Γ (Z) in a global model framework (1) _a ,Z _b ,Z _ta ,Z _tb ) Comprises the following steps:

in the formula (I), the compound is shown in the specification,

α ₃ ,α ₄ ,α ₅ is three scalar parameters, each representing | | | Z _a || _2,1 +||Z _b || _2,1 、||Z _ta || ₁ +||Z _tb || ₁ And

weight information of the item.

3. The cross-view pedestrian re-recognition method based on the discriminant dictionary learning as claimed in claim 2, wherein: the overall objective function of the step 6) comprises:

4. The cross-view pedestrian re-recognition method based on discriminative dictionary learning is characterized in that: the variable solving of the step 7) comprises the following steps:

variables D, D for requirements in the overall objective function (6) _t ,Z _a ,Z _b ,Z _ta ,Z _tb It is not co-convex, but when all other variables are fixed, it is convex for each variable, so they are optimized by an alternating iterative process, the solution for each variable being as follows:

in order to update the coding coefficient Z _a Of variable Z _b Update method and Z _a Consistent, and not described in detail herein, assume first D, D _t ,Z _b ,Z _ta ,Z _tb Are fixed, having the following objective function:

Z _a ＝(4D ^T D+α ₃ Λ ₁ ) ^-1 (4D ^T X _a +2D ^T D _t Z _ta ) (8)

in the formula, Λ ₁ Is formed by

The diagonal matrix is formed by the following steps,

represents Z _i Column j of (1);

then, by fixing D, D _t ,Z _a ,Z _b ,Z _tb To update Z _ta Of variable quantityZ _tb Update method and Z _ta Consistently, and not described herein, there are the following objective functions:

for convenience of optimization, equation (9) is rewritten as a vector form:

in the formula (I), the compound is shown in the specification,

is the visual characteristic of the kth image of the l pedestrian under the view angle a, and for solving (10), a relaxation variable

Introduced, equation (10) can then be relaxed as:

the variables can be updated by solving

The above problem can be solved by an iterative shrinkage algorithm,

the update may be by:

wherein h represents the h-th iteration,

using updates

Z _ta Can be constructed as

c can be solved by:

this is a typical kernel specification minimization problem that can be solved by singular value thresholding algorithms to update D _t A relaxation variable H is introduced:

the closed solution for the relaxation variable H can be expressed as:

H＝(α ₂ D _t D _t ^T +I ₁ ) ^-1 D (18)

this problem can be solved by lagrange duality, and finally, D _t The optimization can be achieved by solving:

this problem can be solved as the problem in equation (19).

5. The cross-view pedestrian re-recognition method based on discriminative dictionary learning according to claim 4, characterized in that: the pedestrian matching scheme of the step 8) comprises the following steps:

in the formula, Z _a ,Z _b Representing the matrix of domain coding coefficients in views a, b, respectively, Z _ta ,Z _tb The problem can be solved by the alternate iteration method when the coding coefficient matrixes respectively represent the specific pedestrian information under the visual angles a and b

And

then stop iteration, order

And

is composed of

And