CN110046669B

CN110046669B - Pedestrian retrieval method based on sketch image half-coupling metric identification dictionary learning

Info

Publication number: CN110046669B
Application number: CN201910325189.4A
Authority: CN
Inventors: 荆晓远; 马飞; 黄鹤; 姚永芳; 訾璐
Original assignee: Guangdong University of Petrochemical Technology
Current assignee: Guangdong University of Petrochemical Technology
Priority date: 2019-04-22
Filing date: 2019-04-22
Publication date: 2019-12-24
Anticipated expiration: 2039-04-22
Also published as: CN110046669A

Abstract

The invention belongs to the technical field of traffic pedestrian image processing, and discloses a pedestrian retrieval method and a system for learning a semi-coupling metric identification dictionary based on a sketch image, wherein a heterogeneous pedestrian retrieval database is established, then characteristic extraction is carried out, characteristics are extracted from an image, and different people are marked by corresponding colors; processing the extracted sample characteristics, introducing a dictionary learning method, and learning a dictionary pair of heterogeneous data; learning a mapping matrix from the sketch image set and the regular image set; and introducing a learning metric of a discrimination algorithm. The invention has the advantages of solving the problem of lack of heterogeneous pedestrian data sets in the field of heterogeneous pedestrian retrieval and providing half-coupling metric discriminative dictionary learning (SMD) for the first time²L) technique. The technique can learn the half-coupling mapping matrix from the heterogeneous sample and dictionary pairs, reducing the difference between heterogeneous data to some extent. The ideal searching effect is achieved on the new SINPID data set.

Description

Pedestrian retrieval method based on sketch image half-coupling metric identification dictionary learning

Technical Field

The invention belongs to the technical field of traffic pedestrian image processing, and particularly relates to a pedestrian retrieval method based on sketch image semi-coupling metric identification dictionary learning.

Background

Currently, the closest prior art:

heterogeneous pedestrian retrieval between sketch images and ordinary images plays an important role in public safety and criminal investigation, and the purpose of Heterogeneous Pedestrian Retrieval (HPR) is to retrieve images of the same person from heterogeneous image sets for identification. Although pedestrian retrieval plays an important role in public safety and criminal investigation, research is not yet abundant, and no data set for pedestrian retrieval problem (SINPR) between a sketch image and a general image exists in the field of pedestrian recognition so far. It is therefore necessary to acquire pedestrian data Sets (SINPID) of a sketch image and a normal image.

Most of the existing pedestrian re-identification problems are mainly focused on the matching problem in a normal scene, and the problem is solved to a certain extent.

However, directly applying existing pedestrian re-identification methods to SINPR would limit their performance. The half-coupling matrix is an effective technology suitable for different data source applications, and can link the relationship between different data sources. Dictionary learning is an effective computer vision application and object representation technique. Meanwhile, the invention can learn a projection matrix to reduce the difference between heterogeneous samples and learn dictionary pairs for different data sources.

In summary, the problems of the prior art are as follows:

(1) directly applying existing pedestrian re-identification methods to SINPR would limit their performance. The existing pedestrian reconstruction method cannot be directly used because the previous method is based on a heterogeneous pedestrian retrieval method, and the purpose of the previous method is to retrieve images of the same person from a heterogeneous image set for recognition. And pedestrian retrieval between the sketch image and the ordinary image, which cannot be analyzed.

(2) In the prior art, a half-coupling mapping strategy is not adopted, the relationship between a sketch image and a common photo is connected, and the difference between heterogeneous samples is reduced.

(3) No combination of learning metric matrices can reveal intrinsic projections of heterogeneous data.

(4) For sketch images and common photos in complex scenes, the dictionary pair in the prior art does not have good applicability.

(5) The prior art does not combine discrimination constraint to make the same category compact and different categories separate, which is not beneficial to retrieval and classification.

To date, there is no data set for the pedestrian retrieval problem (SINPR) between a sketch image and a normal image in the field of pedestrian recognition. It is therefore necessary to acquire pedestrian data Sets (SINPID) of a sketch image and a normal image.

The difficulty of solving the technical problems is as follows:

according to the method, a data set of the sketch image and the common photo needs to be established, and in the prior art, a half-coupling mapping strategy is not adopted, so that the relationship between the sketch image and the common photo is connected, and the difference between heterogeneous samples is reduced.

A better dictionary pair needs to be established to satisfy the relationship between the sketch picture and the common picture in the complex scene.

For intrinsic projection of heterogeneous data, a learning metric matrix needs to be incorporated.

In order to make the retrieval and classification more effective, it is necessary to make the same category compact and separate different categories.

The significance of solving the technical problems is as follows:

in the technical field of pedestrian traffic image processing, heterogeneous pedestrian retrieval between a sketch image and a common image plays an important role in public safety and criminal investigation, and the purpose of Heterogeneous Pedestrian Retrieval (HPR) is to retrieve images of the same person from heterogeneous image sets for identification. Although pedestrian retrieval plays an important role in public safety and criminal investigation, research is not yet abundant, and no data set for pedestrian retrieval problem (SINPR) between a sketch image and a general image exists in the field of pedestrian recognition so far. It is therefore necessary to acquire pedestrian data Sets (SINPID) of a sketch image and a normal image.

Disclosure of Invention

Aiming at the problems in the prior art, the invention provides a pedestrian retrieval method based on sketch image semi-coupling metric identification dictionary learning. Currently, there is no data set for the pedestrian retrieval problem (SINPR) between a sketch image and a normal image in the field of pedestrian recognition. It is therefore necessary to acquire pedestrian data Sets (SINPID) of a sketch image and a normal image. The half-coupling matrix is an effective technology suitable for different data source applications, and can link the relationship between different data sources. Dictionary learning is an effective computer vision application and object representation technique. Meanwhile, the invention can learn a projection matrix to reduce the difference between heterogeneous samples and learn dictionary pairs for different data sources.

The invention is realized in such a way that a pedestrian retrieval method (SMD) based on the learning of the half-coupling metric identification dictionary of sketch images²L), comprising the following steps:

step one, shooting a photo by using a camera and establishing a database of the photo;

step two, feature extraction;

step three, learning a heterogeneous dictionary pair D by utilizing a dictionary learning technology_SAnd D_N；

Step four, learning a projection matrix P from the sketch image set and the common image set;

fifthly, learning a metric matrix W by using the idea of discriminant learning;

solving parameters;

and seventhly, re-identifying the pedestrians.

Further, in the first step, a camera is used for taking pictures and establishing a database of the camera, and the specific method is as follows:

data was first collected on a campus using 2 cameras to take pictures of a real scene, for a total of 400 pictures of 200 pedestrians. Then the data of one camera is used as a common image set, and the data of the other camera is processed to form a sketch image set. Therefore, the pedestrian data Set (SINPID) collected by the invention is composed of 2 parts, namely 1) a common image set; 2) a set of sketch images. Half of samples of sketch images and common images are randomly selected as training sets, and the rest samples are used as testing sets. And the invention performs segmentation processing on the image set, the common image is 4160 × 3120pixels, the invention manually segments each person from the original photograph, and finally each pedestrian image is 560 × 230 pixels. Wherein the common image set is a common RGB image shot by a camera, and the common image set is used as a gallery data set; the sketch image set is a water powder sketch image generated through computer software and manual assistance instead of generating sketch style images through artists and witnesses, and the group of sketch image sets are used as probe sets. Where the software that generates the Sketch set may use Sketch Guru, the type of Sketch image may be selected by the present invention. Because the witness cannot provide suspicious court information, but the witness can give a rough and comprehensive description, the gouache sketch image ignores some details of the person to some extent but the color and layout of the clothing is the same, i.e. there is little change in the colors of the coat and pants, which helps to identify a person from the gallery dataset. The SINPID dataset is available from https:// sites.

Further, in the second step, feature extraction, the invention extracts two types of features from the collected pedestrian data Set (SINPID) for evaluation, wherein the two types of features include a LOMO patch feature and a PCB depth feature. Because the LOMO can extract HSV color features and SILTP texture features from the image represented by the three-scale pyramid, the features are extracted by the patch, and have certain robustness to the change of different viewpoints; and the PCB may extract a deep convolution descriptor consisting of a number of part-level features that may take the entire image as input and obtain a feature vector for each image.

Further, in the third step, the dictionary learning technology is utilized to learn the heterogeneous dictionary pair D_SAnd D_N. The dictionary learning technique is introduced to obtain good data representation of images of different styles, so that dictionary pairs are learned separately for heterogeneous data. Let X be ═ X₁，x₂...x_N]And Y ═ Y₁，y₂...y_N]Respectively a sketch image set and a normal image set. X is formed by R^d*N,Y∈R^d* ^ND is the dimension of the image feature and N represents the total number of sample images. D_SAnd D_NRespectively represent elementDescribing dictionary pairs of images and common images, therefore learning an objective function of a heterogeneous dictionary pair using dictionary learning techniques can be defined as:

wherein the dictionary pair D_SAnd D_NRespectively as follows: d_S∈R^d*m，D_N∈R^d*mAnd m represents the number of elements in the dictionary pair. A ═ a₁,a₂...a_N],B＝[b₁,b₂...b_N]A denotes X in D_SB represents Y in D_NThe coding coefficient matrix of (2).

Further, in step four, the projection matrix P is learned from the sketch image and the normal image set. The method aims to establish the relationship between a sketch image set and a common image set and search a half-coupling projection matrix to reduce the difference between the sketch image set and the common image set to a certain extent. Suppose P ∈ R^d*dIs a half-coupled projection matrix. By minimizing the distance between the a and B coding coefficient matrices, a half-coupled projection matrix is obtained, thereby reducing the difference between samples. The half-coupling mapping matrix can thus be calculated as follows:

further, in step five, the metric matrix W is learned using the concept of the discriminative learning. The method aims to improve good feature representation capability and can make samples of the same class compact and samples of different classes separate by adding discrimination constraint terms. Metric learning and dictionary learning using discriminative ideas can be computed as follows:

where S denotes that the (i, j) elements belong to the same class, and D denotes that the (i, j) elements belong to different classes.

M＝W^TW W∈R^d*d (5)

Furthermore, to prevent overfitting, the present invention regularizes the following: a parameter projection matrix P, coefficients A, B and a discrimination constraint term W. The regularization term can be expressed as:

combine the above equations (1), (2), (3), (6) to produce the SMD²The objective function of L can be rewritten as:

where λ is the regularization parameter balance factor.

Further, in step six, the parameters are solved. Equation (7) for D_S,D_NNeither P, W is a joint convex function, but if the other variables are fixed, it is a convex function solution for each variable. Therefore, the invention divides the formula (7) into 4 subproblem solutions, namely dictionary pair updating, representing coefficient updating and updating of the half-coupling projection matrix. The method specifically comprises the following steps:

1) first the invention fixes other parameters to update a and B. Updating a preserves the term where only a exists, and equation (7) can be rewritten as:

to solve equation (8), the present invention mayBy mixing alpha_iIs set to 0 to solve. Alpha is alpha_iCan be used for

Expressed as follows:

α_i＝(D_S ^TD_S+P^TP+λI+(1-β)W^TW)^-1

b is similar to the solution of a in that,

b_i＝(D_N ^TD_N+(λ-1)I+(1-β)W^TW)^-1

2) update D_SAnd D_NUpdate D_SEquation (7) can be rewritten as follows:

to solve equation (11), D_SCan be obtained by the formula D_S＝XA^T(AA^T+∧))^-1(12) (ii) a Λ is a diagonal matrix, similar to equation (11), updating D_NEquation (7) can be rewritten as follows:

3) updating P, fixing other parameters, the invention can rewrite the formula (7) as:

solving equation (14), which can be solved by setting the derivative of P to 0, P is solved as follows:

P＝BA^T(AA^T+λI))^-1 (15)；

4) finally, W is updated, and other parameters are fixed, and equation (7) can be rewritten as follows:

the present invention may update W by a gradient descent algorithm,

where t denotes the number of iterations of the algorithm as t.

Further, in step seven, the pedestrian is re-identified. And randomly selecting half of sketch images and common images as training sets, and the rest of sketch images and common images as test sets. And the sketch image is used as a probe set, and the ordinary image is used as a gallery set. And giving a sketch image from the probe to search pedestrians. Inputting a sketch image set and a common image set, and solving parameters D according to the formulas (12), (15) and (18)_S,D_NP, W. Suppose F is a feature of the sketch image set, G is a feature of the general photo set,

the invention performs pedestrian retrieval in the following manner:

1) firstly, the formula (9) is used for solving the pixel image set dictionary D_SSolving for D using the learned projection matrix P_SThe corresponding coefficient matrix f is calculated as follows:

2) solving a common image set dictionary D using equation (10)_NThen solve for D_NThe corresponding coefficient matrix g is calculated as follows:

3) the coefficient matrix f corresponding to the sketch image set and the coefficient matrix g corresponding to the common image set are solved by the formula (19) and the formula (20), the sketch image retrieval method can retrieve the sketch image corresponding to the common image set, and can obtain the sketch image by calculating the distance between the two images, and the calculation method is as follows:

and (3) calculating a formula (21), solving the corresponding distance, and sequencing the distances, wherein the common picture with the minimum distance is the picture searched by using the sketch image.

Another object of the present invention is to provide a pedestrian retrieval control system based on half-coupling metric discriminative dictionary learning of sketch images.

The invention also aims to provide a traffic road pedestrian image retrieval terminal of the pedestrian retrieval method based on the half-coupling metric identification dictionary learning of the sketch image.

In summary, the advantages and positive effects of the invention are:

the invention solves the problem of lack of heterogeneous pedestrian data sets in the field of heterogeneous pedestrian retrieval and provides half-coupling metric discriminative dictionary learning (SMD) for the first time²L) technique. The technique can learn the half-coupling mapping matrix from the heterogeneous sample and dictionary pairs, reducing the difference between heterogeneous data to some extent. The ideal searching effect is achieved on the new SINPID data set.

In order to verify whether the algorithm has good superiority or not, a pedestrian retrieval algorithm based on sketch image half-coupling metric identification dictionary learning and 6 comparison algorithms of KISSME, XQDA, TDL and SLD are used²L, JDML were compared to a PCB. These 6 comparison algorithms include metric-based, dictionary-based learning, deep learning-based, pedestrian re-recognition-based algorithms. And finally, verifying the new data set SINPID as experimental data.

The evaluation index of retrieval and identification is accumulated matching characteristic curve CMC, CMC curve is a matching probability of top-k, each sample in the common image set is sequentially calculated to be a distance from the probe image set, then the samples are sequenced, rank is the value of top-k selected by the invention, the value of CMC matching rate is about 1, if the number of times of testing is more, the identification accuracy rate is better.

In order to verify the method SMD of the invention²L performance, experiments were performed on the new SINPID dataset, with the experimental results shown in table 1 and fig. 2.

TABLE 1 Top-r match ratio on SINPID dataset

Fig. 2 (a) shows the results of the experiment using the LOMO characteristics, and (b) shows the results of the experiment using the PCB characteristics. As can be seen from the experimental results in table 1: the algorithm of the present invention achieves a higher match rate in the comparison algorithm, e.g., SMD²Compared with a comparison algorithm XQDA with L having LOMO characteristics in a SINPID data set, the matching rate of Rank-1 is improved by 2.1% (-36.2% -34.1%). Meanwhile, the invention also utilizes the depth characteristics for evaluation, and shows that some results are lower than those of the LOMO characteristic method. The reason for this may be three-fold: 1) the SINPID data set had only 400 images of 200 people, which resulted in the PCB not having sufficient sample training. 2) Watercolor sketch images contain a smaller amount of information than ordinary photographs of another camera. 3) The network architecture is not suitable for heterogeneous pedestrian samples, but for two normal samples.

Through the above experiments, it can be seen that: method SMD of the invention²Most of the matching rate of L is better than other methods with LOMO and PCB depth features. There are mainly 3 aspects: 1) by adopting a half-coupling mapping strategy, the relationship between the sketch image and the common photo can be connected, and the difference between heterogeneous samples is reduced. 2) The learning metric matrix may reveal intrinsic projections of heterogeneous data. 3) The dictionary pair has better applicability to sketch images and common photos in complex scenes. 4) The discrimination constraint can make the same category compact and different categories separate, which is beneficial to retrieval and classificationAnd (5) performing tasks. From the above analysis, it can be seen that the new SINPID dataset is stable for different types of features, suitable for further evaluation of pedestrian retrieval, indicating SMD²The superiority of the L algorithm.

Drawings

Fig. 1 is a flowchart of a pedestrian retrieval method based on sketch image half-coupling metric identification dictionary learning according to an embodiment of the present invention.

Fig. 2 is a graph of the performance of various methods and features provided by embodiments of the present invention on a new SINPID data set.

Detailed Description

In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention is further described in detail with reference to the following embodiments. It should be understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention.

In the prior art, a half-coupling mapping strategy is not adopted, the relationship between a sketch image and a common photo is connected, and the difference between heterogeneous samples is reduced. No combination of learning metric matrices can reveal intrinsic projections of heterogeneous data. For sketch images and common photos in complex scenes, the dictionary pair in the prior art does not have good applicability. The prior art does not combine discrimination constraint to make the same category compact and different categories separate, which is not beneficial to retrieval and classification.

In order to solve the above technical problems, the present invention will be described in detail with reference to specific embodiments.

As shown in fig. 1, a pedestrian retrieval method based on sketch image half-coupling metric identification dictionary learning according to an embodiment of the present invention includes:

step one, taking a picture by using a camera and establishing a database of the camera: data was first collected on a campus using 2 cameras to take pictures of a real scene, for a total of 400 pictures of 200 pedestrians. Then the data of one camera is used as a common image set, and the data of the other camera is processed to form a sketch image set. Therefore, the pedestrian data Set (SINPID) collected by the invention is composed of 2 parts, namely 1) a common image set; 2) a set of sketch images. Half of samples of sketch images and common images are randomly selected as training sets, and the rest samples are used as testing sets. And the invention performs segmentation processing on the image set, the common image is 4160 × 3120pixels, the invention manually segments each person from the original photograph, and finally each pedestrian image is 560 × 230 pixels. Wherein the common image set is a common RGB image shot by a camera, and the common image set is used as a gallery data set; the sketch image set is a water powder sketch image generated through computer software and manual assistance instead of generating sketch style images through artists and witnesses, and the group of sketch image sets are used as probe sets. Where the software that generates the Sketch set may use Sketch Guru, the type of Sketch image may be selected by the present invention. Because the witness cannot provide suspicious court information, but the witness can give a rough and comprehensive description, the gouache sketch image ignores some details of the person to some extent but the color and layout of the clothing is the same, i.e. there is little change in the colors of the coat and pants, which helps to identify a person from the gallery dataset. The SINPID dataset is available from https:// sites.

And step two, feature extraction, namely extracting two types of features from the collected pedestrian data Set (SINPID) for evaluation, wherein the features comprise LOMO patch features and PCB depth features. Because the LOMO can extract HSV color features and SILTP texture features from the image represented by the three-scale pyramid, the features are extracted by the patch, and have certain robustness to the change of different viewpoints; and the PCB may extract a deep convolution descriptor consisting of a number of part-level features that may take the entire image as input and obtain a feature vector for each image.

Step three, learning a heterogeneous dictionary pair D by utilizing a dictionary learning technology_SAnd D_N. The dictionary learning technique is introduced to obtain good data representation of images of different styles, so that dictionary pairs are learned separately for heterogeneous data. Let X be ═ X₁，x₂...x_N]And Y ═ Y₁，y₂...y_N]Respectively a sketch image set and a normal image set. X is formed by R^d*N,Y∈R^d*ND is the dimension of the image feature and N represents the total number of sample images. D_SAnd D_NDictionary pairs representing a sketch image and a normal image, respectively, so learning the objective function of a heterogeneous dictionary pair using dictionary learning techniques can be defined as:

wherein the dictionary pair D_SAnd D_NRespectively as follows: d_S∈R^d*m,D_N∈R^d*mAnd m represents the number of elements in the dictionary pair. A ═ a₁,a₂...a_N],B＝[b₁,b₂...b_N]A denotes X in D_SB represents Y in D_NThe coding coefficient matrix of (2).

And step four, learning a projection matrix P from the sketch image and the common image set. The method aims to establish the relationship between a sketch image set and a common image set, find a half-coupling projection matrix,

reducing the difference between them to some extent. Suppose P ∈ R^d*dIs a half-coupled projection matrix. By minimizing the distance between the A and B coding coefficient matrixes, a half-coupling projection matrix is obtained, thereby reducing the number of samples

The difference between them. The half-coupling mapping matrix can thus be calculated as follows:

and step five, learning a metric matrix W by using the idea of the discriminant learning. The method aims to improve good feature representation capability and can make samples of the same class compact and samples of different classes separate by adding discrimination constraint terms. Metric learning and dictionary learning using discriminative ideas can be computed as follows:

M＝W^TW W∈R^d*d (5)；

where λ is the regularization parameter balance factor.

In step six, the parameters are solved. Equation (7) for D_S,D_NNeither P, W is a joint convex function, but if the other variables are fixed, it is a convex function solution for each variable. Therefore, the invention divides the formula (7) into 4 subproblem solutions, namely dictionary pair updating, representing coefficient updating and updating of the half-coupling projection matrix. The method specifically comprises the following steps:

to solve equation (8), the present invention can be implemented by converting α_iIs set to 0 to solve. Alpha is alpha_iCan be expressed as follows:

α_i＝(D_S ^TD_S+P^TP+λI+(1-β)W^TW)^-1

b is similar to the solution of a in that,

b_i＝(D_N ^TD_N+(λ-1)I+(1-β)W^TW)^-1

2) update D_SAnd D_NUpdate D_SEquation (7) can be rewritten as follows:

to solve equation (11), D_SCan be obtained by the formula D_S＝XA^T(AA^T+∧))^-1(12) (ii) a Λ is a diagonal matrix, similar to equation (11), updating D_NEquation (7) can be rewritten as

The following were used:

P＝BA^T(AA^T+λI))^-1 (15)；

the present invention may update W by a gradient descent algorithm,

where t denotes the number of iterations of the algorithm as t.

the invention performs pedestrian retrieval in the following manner:

The invention is further described below in connection with experiments.

To verify the bookMethod of the invention SMD²L performance, experiments were performed on the new SINPID dataset, with the experimental results shown in table 1 and fig. 2.

TABLE 1 Top-r match ratio on SINPID dataset

Through the above experiments, it can be seen that: method SMD of the invention²Most of the matching rate of L is better than other methods with LOMO and PCB depth features. There are mainly 3 aspects: 1) by adopting a half-coupling mapping strategy, the relationship between the sketch image and the common photo can be connected, and the difference between heterogeneous samples is reduced. 2) The learning metric matrix may reveal intrinsic projections of heterogeneous data. 3) The dictionary pair has better applicability to sketch images and common photos in complex scenes. 4) Discrimination constraints enable the same category to be compact and different categories to be separated, and retrieval and classification tasks are facilitated. From the above analysis, it can be seen that the new SINPID dataset is stable for different types of features, suitable for further evaluation of pedestrian retrieval, indicating SMD²L is calculatedThe superiority of the method.

The above description is only for the purpose of illustrating the preferred embodiments of the present invention and is not to be construed as limiting the invention, and any modifications, equivalents and improvements made within the spirit and principle of the present invention are intended to be included within the scope of the present invention.

Claims

1. A pedestrian retrieval method based on sketch image semi-coupling metric identification dictionary learning is characterized by comprising the following steps:

establishing a heterogeneous pedestrian retrieval database SINPID comprising a sketch image set and a regular image set; extracting features, namely extracting the features from the images and marking the images with different corresponding colors;

carrying out heterogeneous data processing on the extracted sample characteristics; a dictionary learning method is introduced to learn a dictionary pair of heterogeneous data, and a mapping matrix is learned from a sketch image set and a regular image set;

and introducing an identification algorithm learning metric to carry out pedestrian retrieval based on the sketch image semi-coupling metric identification dictionary learning;

the pedestrian retrieval calculation method based on the sketch image semi-coupling metric identification dictionary learning comprises the following steps:

step two, feature extraction;

fifthly, learning a metric matrix W by using the idea of discriminant learning;

solving parameters;

seventhly, identifying the pedestrians again;

in step three, X ═ X₁，x₂...x_N]And Y ═ Y₁，y₂...y_N]Are sketches respectivelyAn image set and a common image set; x is formed by R^d*N,Y∈R^d*N(ii) a d is the dimension of the image feature, N represents the total number of sample images; d_SAnd D_NThe dictionary pair respectively represents the sketch image and the common image, and the target function of the heterogeneous dictionary pair is defined as follows by utilizing a dictionary learning technology to learn:

wherein the dictionary pair D_SAnd D_NRespectively as follows: d_S∈R^d*m,D_N∈R^d*mM denotes the number of elements in the dictionary pair; a ═ a₁,a₂...a_N],B＝[b₁,b₂...b_N]A represents X in D_SB represents Y in D_NA matrix of coding coefficients of (c);

in step four, P is equal to R^d*dIs a half-coupled projection matrix; obtaining a half-coupling projection matrix by minimizing the distance between the A and B coding coefficient matrixes; the half-coupling mapping matrix is calculated as follows:

in the fifth step, the metric learning and the dictionary learning are calculated by utilizing the identification thought according to the following modes:

wherein S indicates that the (i, j) elements belong to the same class, and D indicates that the (i, j) elements belong to different classes;

M＝W^TW W∈R^d*d

(5)；

a parameter projection matrix P, coefficients A and B and an identification constraint term W; the regularization term is expressed as:

combining the above formulas (1), (2), (3), (6); then SMD²The objective function of L is rewritten as:

wherein λ is a regularization parameter balance factor;

in the sixth step, the dictionary pair divided by the formula (7) is updated, and the subproblems of representing coefficient updating and half-coupling projection matrix updating comprise:

1) fixing parameters of formula (8) except for parameters A and B and updating parameters A and B; updating and keeping the terms of A and the parameters A only exist, and rewriting the formula (7) as follows:

by mixing alpha_iThe derivative of (a) is set to 0 for solution; alpha is alpha_iExpressed as follows:

α_i＝(D_S ^TD_S+P^TP+λI+(1-β)W^TW)^-1

updating B the entry for which only parameter B exists and keeping it,

b_i＝(D_N ^TD_N+(λ-1)I+(1-β)W^TW)^-1

2) update D_SAnd D_NUpdate D_SEquation (7) is rewritten as follows:

D_Scan be obtained by the formula D_S＝XA^T(AA^T+∧))^-1 (12)；

Λ is a diagonal matrix, update D_NRewrite equation (7) as follows:

3) updating P, fixing other parameters, and rewriting equation (7) as:

solving by setting the derivative of P to 0, P is solved as follows:

P＝BA^T(AA^T+λI))^-1 (15)；

4) finally, W is updated, other parameters are fixed, and formula (7) is rewritten as follows:

w is updated by a gradient descent algorithm,

wherein t represents the iteration number of the algorithm as t;

in the seventh step, the sketch image set and the common image set are input, and the parameters D are solved according to the formulas (12), (15) and (18)_S,D_NP, W; f is the characteristic of the sketch image set, G is the characteristic of the common photo set, and the method specifically comprises the following steps:

solving a pixilated image set dictionary D using equation (9)_SSolving for D using the learned projection matrix P_SThe corresponding coefficient matrix f is calculated as follows:

solving a common image set dictionary D using equation (10)_NThen solve for D_NThe corresponding coefficient matrix g is calculated as follows:

the coefficient matrix f corresponding to the sketch image set and the coefficient matrix g corresponding to the common image set are solved through a formula (19) and a formula (20), the sketch image corresponding to the common image set is searched, and the distance between the two images is calculated to obtain the sketch image, wherein the calculation mode is as follows:

2. The sketch image-based pedestrian retrieval method for identification dictionary learning by using half-coupling metrics as claimed in claim 1, wherein the step one specifically comprises:

shooting pictures in a real scene by using a plurality of cameras to collect data, and collecting data of a plurality of pictures in total; then, taking the data of one camera as a common image set, and processing the data of the other camera to form a sketch image set;

randomly selecting half samples of the sketch images and the common images as a training set, and using the rest samples as a test set; and performs a segmentation process on the image set.

3. The pedestrian retrieval method based on learning of the sketch image semi-coupling metric identification dictionary of claim 1, wherein in the second step, two types of features extracted from the collected pedestrian data set SINPID are evaluated, wherein the two types of features comprise LOMO patch features and PCB depth features; LOMO patch characteristics are extracted by patch; the PCB depth feature is a depth convolution descriptor composed of a plurality of part-level features, takes the whole image as input, and obtains a feature vector of each image.

4. A pedestrian retrieval control system for sketch image based semi-coupling metric identification dictionary learning implementing the pedestrian retrieval method for sketch image based semi-coupling metric identification dictionary learning according to claim 1.

5. A traffic road pedestrian image retrieval terminal implementing the pedestrian retrieval method based on the sketch image half-coupling metric identification dictionary learning of claim 1.