CN110046669A - Half Coupling Metric based on sketch image identifies the pedestrian retrieval method of dictionary learning - Google Patents

Half Coupling Metric based on sketch image identifies the pedestrian retrieval method of dictionary learning Download PDF

Info

Publication number
CN110046669A
CN110046669A CN201910325189.4A CN201910325189A CN110046669A CN 110046669 A CN110046669 A CN 110046669A CN 201910325189 A CN201910325189 A CN 201910325189A CN 110046669 A CN110046669 A CN 110046669A
Authority
CN
China
Prior art keywords
image
sketch
dictionary
pedestrian
half coupling
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201910325189.4A
Other languages
Chinese (zh)
Other versions
CN110046669B (en
Inventor
荆晓远
马飞
黄鹤
姚永芳
訾璐
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangdong University of Petrochemical Technology
Original Assignee
Guangdong University of Petrochemical Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Guangdong University of Petrochemical Technology filed Critical Guangdong University of Petrochemical Technology
Priority to CN201910325189.4A priority Critical patent/CN110046669B/en
Publication of CN110046669A publication Critical patent/CN110046669A/en
Application granted granted Critical
Publication of CN110046669B publication Critical patent/CN110046669B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/50Information retrieval; Database structures therefor; File system structures therefor of still image data
    • G06F16/58Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/583Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/28Determining representative reference patterns, e.g. by averaging or distorting; Generating dictionaries
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/10Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands

Abstract

The invention belongs to traffic pedestrian image processing technology fields, disclose the pedestrian retrieval method and system of a kind of half Coupling Metric identification dictionary learning based on sketch image, establish the isomery pedestrian retrieval database of oneself, then feature extraction is carried out, feature is extracted from image and marks different people with corresponding color;The sample characteristics extracted are handled, dictionary learning method is introduced, learns the dictionary pair of isomeric data;From sketch map image set and regular image focusing study mapping matrix;Introduce identification algorithm study measurement.The advantage of the invention is that solving the problems, such as that isomery pedestrian's data set lacks and be put forward for the first time half Coupling Metric identification dictionary learning (SMD in isomery pedestrian retrieval field2L) technology.The technology can learn half coupled maps matrix from isomery sample and dictionary centering, reduce the difference between isomeric data to a certain extent.Ideal retrieval effectiveness is achieved on new SINPID data set.

Description

Half Coupling Metric based on sketch image identifies the pedestrian retrieval method of dictionary learning
Technical field
The invention belongs to traffic pedestrian image processing technology field more particularly to a kind of half degrees of coupling based on sketch image Amount identifies the pedestrian retrieval method of dictionary learning.
Background technique
Currently, the immediate prior art:
Isomery pedestrian retrieval between sketch image and normal image plays in terms of public safety and criminal investigation focuses on It acts on, the purpose of isomery pedestrian retrieval (HPR) is exactly to retrieve the image of the same person from foreign peoples's image set to identify. Although pedestrian retrieval plays a very important role in public safety and criminal investigation, it is still rare to study, up to the present, Pedestrian identifies that field has not been used in the data set of the pedestrian retrieval problem (SINPR) between sketch map image and normal image. It is therefore desirable to acquire pedestrian's data set (SINPID) of sketch image and normal image.
Current most of pedestrians identify the matching problem that problem is laid particular emphasis on mostly under normal scene again, and in certain journey It is solved this problem in that on degree.
However, existing pedestrian again recognition methods is directly applied to the performance that SINPR will will limit them.Half coupling moment Battle array is a kind of effective technology suitable for different data sources application, it can get up the relationship between different data sources. And dictionary learning is that a kind of effective computer vision is applied and object presentation technology.The present invention can learn a projection simultaneously Matrix to reduce the difference between isomery sample, and learns dictionary pair to different data sources.
In conclusion problem of the existing technology is:
(1) existing pedestrian again recognition methods is directly applied to the performance that SINPR will will limit them.It cannot directly use Existing pedestrian's weighing method, because of method of the pervious method based on isomery pedestrian retrieval, its purpose is from foreign peoples's image set In retrieve the image of the same person and identify.And the pedestrian retrieval between unanalyzable sketch map image and normal image Problem.
(2) in the prior art, half coupled maps strategy is not used, the relationship of sketch image and common photo is connected Come, reduces the difference between isomery sample.
(3) being not bound with study metric matrix can reveal that the inherent projection of isomeric data.
(4) for the sketch image and common photo under complex scene, prior art dictionary is preferable applicable to not having Property.
(5) prior art be not bound with differentiation constraint keep same category compact and different classes of separation, be unfavorable for retrieval with Classification.
Up to the present, identify that field has not been used in the pedestrian retrieval between sketch map image and normal image in pedestrian The data set of problem (SINPR).It is therefore desirable to acquire pedestrian's data set (SINPID) of sketch image and normal image.
Solve the difficulty of above-mentioned technical problem:
The present invention needs to establish the data set of vegetarian noodles image and common photo, in the prior art, is not reflected using half coupling Strategy is penetrated, the relationship of sketch image and common photo is connected, reduces the difference between isomery sample.
Need to establish preferable dictionary pair, to meet the relationship of the sketch picture under complex scene and common photo.
Inherence projection for isomeric data, needs associative learning metric matrix.
In order to enable the effect of retrieval and classification is more preferable, need to keep same category compact and different classes of separation.
Solve the meaning of above-mentioned technical problem:
In traffic pedestrian image processing technology field, the isomery pedestrian retrieval between sketch image and normal image is public It is played an important role in terms of safety and criminal investigation, the purpose of isomery pedestrian retrieval (HPR) is exactly to examine from foreign peoples's image set The image that rope goes out the same person identifies.Although pedestrian retrieval plays a very important role in public safety and criminal investigation, Be study it is still rare, up to the present, pedestrian identify field have not been used between sketch map image and normal image The data set of pedestrian retrieval problem (SINPR).It is therefore desirable to acquire pedestrian's data set of sketch image and normal image (SINPID)。
However, existing pedestrian again recognition methods is directly applied to the performance that SINPR will will limit them.Half coupling moment Battle array is a kind of effective technology suitable for different data sources application, it can get up the relationship between different data sources. And dictionary learning is that a kind of effective computer vision is applied and object presentation technology.The present invention can learn a projection simultaneously Matrix to reduce the difference between isomery sample, and learns dictionary pair to different data sources.
Summary of the invention
In view of the problems of the existing technology, the present invention provides a kind of, and half Coupling Metric based on sketch image identifies word The pedestrian retrieval method of allusion quotation study.Currently, identifying that field has not been used between sketch map image and normal image in pedestrian The data set of pedestrian retrieval problem (SINPR).It is therefore desirable to acquire pedestrian's data set of sketch image and normal image (SINPID).Half coupling matrix be it is a kind of suitable for different data sources application effective technology, it can by different data sources it Between relationship get up.And dictionary learning is that a kind of effective computer vision is applied and object presentation technology.This hair simultaneously It is bright to learn a projection matrix, to reduce the difference between isomery sample, and dictionary pair is learnt to different data sources.
The invention is realized in this way a kind of half Coupling Metric based on sketch image identifies the pedestrian retrieval of dictionary learning Method (SMD2L), comprising the following steps:
Step 1: shooting photo with camera and establishing the database of oneself;
Step 2: feature extraction;
Step 3: using dictionary learning technological learning isomery dictionary to DSAnd DN
Step 4: learning projection matrix P from sketch map image set and normal image collection;
Step 5: learning metric matrix W using the thought for identifying study;
Step 6: parametric solution;
Step 7: pedestrian identifies again.
Further, in step 1, photo is shot with camera and establishes the database of oneself, specific practice is:
The photo under real scene is shot to collect data with 2 cameras first in campus, has collected 200 in total A pedestrian totally 400 picture datas.Then using the data of one of camera as normal image collection, another camera Data by processing formed sketch map image set.Therefore pedestrian's data set (SINPID) that the present invention is collected into is by 2 parts Form 1) normal image collection;2) sketch map image set.The present invention has randomly selected the sketch image of half and the sample of normal image As training set, remaining is as test set.And the present invention is split processing to image set, and normal image is 4160* 3120pixels, the present invention are partitioned into everyone from original photo by hand, and last each pedestrian image is 560* 230pixels.Wherein normal image collection is the common RGB image shot with camera, using this group of normal image collection as gallery Data set;Sketch map image set be pigment sketch collection image generated by computer software and human assistance rather than by artist and Witness generates the image of sketch style, using this group of sketch map image set as probe collection.Here the software for generating sketch collection can be with Using Sketch Guru, the type present invention for generating sketch image can choose pigment sketch type.Because witness cannot provide Suspicious appears in court information, but witness can provide rough and comprehensive description, and the image of pigment sketch is ignored to a certain extent Some details of people but the color of clothes and layout are all that the color of identical i.e. housing and trousers has almost no change, this Facilitate from gallery data set to identify a people.SINPID data set can be from https: //sites.google.com/ Site/SINPID2018 is obtained.
Further, in step 2, feature extraction, the present invention extracts collected pedestrian's data set (SINPID) Two kinds of feature is evaluated out, including LOMO patch feature and PCB depth characteristic.Because LOMO can be from three rulers It spends in the image of pyramid representation and extracts hsv color feature and SILTP textural characteristics, this feature is extracted by patch , there is certain robustness to the variation of different points of view;And PCB can extract a kind of depth being made of multiple portions grade feature Convolution descriptor is spent, this feature can be using whole image as input, and obtains the feature vector of each image.
Further, in step 3, using dictionary learning technological learning isomery dictionary to DSAnd DN.Introduce dictionary learning The purpose of technology is that the good data of different-style image in order to obtain indicates, therefore learns dictionary pair respectively to isomeric data. Assuming that X=[x1, x2...xN] and Y=[y1, y2...yN] it is sketch map image set and normal image collection respectively.X∈Rd*N,Y∈Rd* NIt .d is the dimension of characteristics of image, N indicates the total quantity of sample image. DSAnd DNRespectively indicate sketch image and normal image Dictionary pair, therefore can be with using the objective function of dictionary learning technological learning isomery dictionary pair is defined as:
Wherein dictionary is to DSAnd DNIt is respectively as follows: DS∈Rd*m, DN∈Rd*m, what m was indicated is the quantity of the element of dictionary centering.A =[a1,a2...aN], B=[b1,b2...bN] .A indicate be X in DSOn code coefficient matrix, B indicate be Y in DNOn Code coefficient matrix.
Further, in step 4, from sketch image and normal image focusing study projection matrix P.Its purpose be for Sketch map image set and normal image collection relationship between the two are established, one and half coupling projection matrixes are found, to a certain degree The upper difference reduced between them.Assuming that P ∈ Rd*dIt is half coupling projection matrix.By minimizing A and B code coefficient matrix Distance obtains half coupling projection matrix, to reduce the difference between sample.Therefore half coupled maps matrix can be by with lower section Formula calculates:
Further, in step 5, learn metric matrix W using the thought for identifying study.Its purpose is to improve Good character representation ability can make of a sort sample compact in addition identifying bound term, inhomogeneous sample separation. It can be calculated in the following manner using thought progress metric learning and dictionary learning is identified:
What wherein S was indicated is that (i, j) element belongs to same class, and what D was indicated is that (i, j) element belongs to inhomogeneity.
M=WTW W∈Rd*d (5)
In addition, over-fitting in order to prevent, below regularization of the present invention: parameter projection matrix P, coefficient A, B and identifying about Beam item W.Regularization term can indicate are as follows:
In conjunction with above-mentioned formula (1), (2), (3), (6) then SMD2The objective function of L can be rewritten are as follows:
Wherein λ is the regularization parameter balance factor.
Further, in step 6, parametric solution.Formula (7) is for DS,DN, P, W are not joint convex functions, but such as It is that convex function solves for each variable in the case that its dependent variable of fruit is fixed.Therefore formula (7) is divided into 4 by the present invention A sub- problem solving, i.e. dictionary indicate the update of coefficient update and half coupling projection matrix to update.It specifically includes:
1) present invention fixes other parameters to update A and B first.It updates A to retain the item for only existing A, formula (7) It can rewrite are as follows:
For solution formula (8), the present invention can be by by αiDerivative be set as 0 to solve.αiIt can
It is expressed as:
αi=(DS TDS+PTP+λI+(1-β)WTW)-1
B is similar with the solution of A,
bi=(DN TDN+(λ-1)I+(1-β)WTW)-1
2) D is updatedSAnd DN, update DSFormula (7) can be rewritten as follows:
For solution formula (11), DSCalculating can be obtained by following formula: DS=XAT(AAT+∧))-1(12); ∧ is a diagonal matrix, similar to formula (11), updates DNFormula (7) can be rewritten as follows:
3) P, fixed other parameters are updated, the present invention can rewrite formula (7) are as follows:
Solution formula (14) can be solved by setting 0 for the derivative of P, and the solution of P is as follows:
P=BAT(AAT+λI))-1(15);
4) final updating W, fixed other parameters, formula (7) can be rewritten as follows:
The present invention can update W by gradient descent algorithm,
Wherein t indicate be algorithm the number of iterations be t.
Further, in step 7, pedestrian identifies again.The sketch image and normal image for having randomly selected half are as instruction Practice collection, remaining is as test set.And using sketch image as probe collection, normal image is as gallery collection.Provide a width From the sketch image of probe, pedestrian retrieval is carried out.Sketch map image set and normal image collection are inputted, according to above-mentioned formula (12), (15), (18) solve parameter DS,DN,P,W.Vacation lets f be the feature of sketch map image set, and G is the feature of common photo collection,
The present invention executes pedestrian retrieval in the following way:
1) sketch map image set dictionary D is solved first with formula (9)S, using the projection matrix P learnt, to solve DSIt is right The coefficient matrix f answered, calculation are as follows:
2) normal image collection dictionary D is solved using formula (10)N, then solve DNCorresponding coefficient matrix g, calculation It is as follows:
3) coefficient matrix f and normal image collection corresponding to sketch map image set are solved by formula (19) and formula (20) Corresponding coefficient matrix g, the present invention, which can be retrieved, concentrates corresponding sketch image in normal image, can be by calculating two The distance between a image obtains, and calculation is as follows:
Calculation formula (21) solves corresponding distance, then adjusts the distance and be ranked up, wherein apart from the smallest common photograph Piece is exactly the picture being retrieved using sketch image.
Another object of the present invention is to provide a kind of, and half Coupling Metric based on sketch image identifies the row of dictionary learning People retrieves control system.
Another object of the present invention is to provide a kind of, and half Coupling Metric based on sketch image identifies the row of dictionary learning The traffic route pedestrian image searching terminal of people's search method.
In conclusion advantages of the present invention and good effect are as follows:
The present invention solves the problems, such as that isomery pedestrian's data set lacks and is put forward for the first time in isomery pedestrian retrieval field Half Coupling Metric identifies dictionary learning (SMD2L) technology.The technology can be reflected from isomery sample and half coupling of dictionary centering study Matrix is penetrated, reduces the difference between isomeric data to a certain extent.Ideal inspection is achieved on new SINPID data set Suo Xiaoguo.
To verify whether this algorithm has good superiority, half Coupling Metric based on sketch image is identified into dictionary learning Pedestrian retrieval algorithm and 6 kinds of comparison algorithms KISSME, XQDA, TDL, SLD2L, JDML and PCB are compared.This 6 kinds comparisons Algorithm includes being based on dictionary learning based on measurement, is based on deep learning, the algorithm identified again based on pedestrian.Finally in the present invention New data set SINPID verified as experimental data.
The evaluation index of retrieval is the matching probability that accumulative matching properties curve CMC, CMC curve is a kind of top-k, Each sample that normal image is concentrated successively is taken into a distance with the calculation of its probe image collection, is then ranked up, rank is The value of the value of top-k selected by the present invention, CMC matching rate tends to 1, if test number it is more, identification it is accurate Rate is better.
In order to verify the method for the present invention SMD2The performance of L is tested in new SINPID data set, and experimental result is table 1 And Fig. 2.
The matching rate of top-r of the table 1 on SINPID data set
What (a) was indicated in Fig. 2 be using the experimental result of LOMO feature, (b) in indicate is reality using PCB feature Test result.From the experimental result in table 1 it can be seen that algorithm of the invention obtains higher matching rate in comparison algorithm, For example, SMD2L has the comparison algorithm XQDA of LOMO feature to compare in SINPID data set, and the matching rate of Rank-1 improves 2.1% (=36.2%-34.1%).The present invention also utilizes depth characteristic and assesses simultaneously, it will be seen that some results is lower than The method of LOMO feature.Its reason may have three aspects: 1) SINPID data set only has the image of 400 200 people, this makes PCB does not pass through enough sample trainings.2) watercolor sketch formula image is than information that the common photo of another video camera includes Amount is few.3) network architecture is not suitable for isomery pedestrian sample, and is suitable for two kinds of normal samples.
By being tested above it can be seen that the method for the present invention SMD2Most of matching rates of L are better than other with LOMO The method of feature and PCB depth characteristic.Mainly have 3 aspects: 1) using half coupled maps strategy, can by sketch image with it is general The relationship of logical photo connects, and reduces the difference between isomery sample.2) study metric matrix can reveal that isomeric data Inherence projection.3) for the sketch image and common photo under complex scene, dictionary is to preferable applicability.4) differentiate about Beam can make the compact and different classes of separation of same category, be conducive to retrieval and classification task.As seen from the above analysis, newly SINPID data set be to different types of feature it is stable, suitable for the further evaluation of pedestrian retrieval, show SMD2L is calculated The superiority of method.
Detailed description of the invention
Fig. 1 is the pedestrian retrieval that half Coupling Metric provided in an embodiment of the present invention based on sketch image identifies dictionary learning Method flow diagram.
Fig. 2 is the performance map of distinct methods provided in an embodiment of the present invention and feature on new SINPID data set.
Specific embodiment
In order to make the objectives, technical solutions, and advantages of the present invention clearer, with reference to embodiments, to the present invention It is further elaborated.It should be appreciated that the specific embodiments described herein are merely illustrative of the present invention, it is not used to Limit the present invention.
In the prior art, half coupled maps strategy is not used, the relationship of sketch image and common photo is connected, Reduce the difference between isomery sample.Being not bound with study metric matrix can reveal that the inherent projection of isomeric data.For multiple Sketch image and common photo under miscellaneous scene, prior art dictionary is to without preferable applicability.The prior art is not tied It closes and differentiates that constraint keeps same category compact and different classes of separation, be unfavorable for retrieving and classify.
In order to solve the above technical problems, below with reference to concrete scheme, the present invention is described in detail.
As shown in Figure 1, half Coupling Metric provided in an embodiment of the present invention based on sketch image identifies the row of dictionary learning People's search method includes:
Step 1: shooting photo with camera and establishing the database of oneself: true with 2 camera shootings first in campus Photo under real field scape collects data, has collected 200 pedestrians totally 400 picture datas in total.Then one of them is taken the photograph As the data of head are as normal image collection, the data of another camera form sketch map image set by processing.Therefore this hair The bright pedestrian's data set (SINPID) being collected into forms 1 by 2 parts) normal image collection;2) sketch map image set.The present invention with Machine has chosen the sketch image of half and the sample of normal image as training set, remaining is as test set.And the present invention Processing is split to image set, normal image is 4160*3120pixels, and the present invention is partitioned into often by hand from original photo One people, last each pedestrian image is 560*230pixels.Wherein normal image collection is the common RGB shot with camera Image, using this group of normal image collection as gallery data set;Sketch map image set is to generate water by computer software and human assistance Powder sketch collection image rather than the image that sketch style is generated by artist and witness, using this group of sketch map image set as spy Needle collection.Here Sketch Guru can be used in the software for generating sketch collection, and the type present invention for generating sketch image can choose Pigment sketch type.Because witness cannot provide suspicious information of appearing in court, but witness can provide rough and comprehensive description, water The image of powder sketch has ignored some details of people to a certain extent but the color of clothes and layout are all identical i.e. outer The color of set and trousers has almost no change, this facilitates from gallery data set to identify a people.SINPID data set can To be obtained from https: //sites.google.com/site/SINPID2018.
Step 2: feature extraction, the present invention extracts collected pedestrian's data set (SINPID) two kinds of Feature is evaluated, including LOMO patch feature and PCB depth characteristic.Because LOMO can be from three scale pyramid tables Hsv color feature and SILTP textural characteristics are extracted in the image shown, this feature is extracted by patch, to different views The variation of point has certain robustness;And PCB can extract a kind of depth convolution description being made of multiple portions grade feature Symbol, this feature can be using whole image as input, and obtains the feature vector of each image.
Step 3: using dictionary learning technological learning isomery dictionary to DSAnd DN.The purpose for introducing dictionary learning technology is The good data of different-style image indicates in order to obtain, therefore learns dictionary pair respectively to isomeric data.Assuming that X=[x1, x2...xN] and Y=[y1, y2...yN] it is sketch map image set and normal image collection respectively. X∈Rd*N,Y∈Rd*NIt .d is image spy The dimension of sign, N indicate the total quantity of sample image.DSAnd DNRespectively indicate the dictionary pair of sketch image and normal image, therefore benefit It can be with the objective function of dictionary learning technological learning isomery dictionary pair is defined as:
Wherein dictionary is to DSAnd DNIt is respectively as follows: DS∈Rd*m,DN∈Rd*m, what m was indicated is the quantity of the element of dictionary centering.A =[a1,a2...aN], B=[b1,b2...bN] .A indicate be X in DSOn code coefficient matrix, B indicate be Y in DNOn Code coefficient matrix.
Step 4: from sketch image and normal image focusing study projection matrix P.Its purpose is to establish sketch map Image set and normal image collection relationship between the two, find one and half coupling projection matrixes,
The difference between them is reduced to a certain extent.Assuming that P ∈ Rd*dIt is half coupling projection matrix.By minimizing A With the distance of B code coefficient matrix, obtain half coupling projection matrix, thus reduce sample it
Between difference.Therefore half coupled maps matrix can calculate in the following manner:
Step 5: learning metric matrix W using the thought for identifying study.Its purpose is to improve good mark sheet Show ability, in addition identifying bound term, of a sort sample can be made compact, inhomogeneous sample separation.Utilize identification thought Carrying out metric learning and dictionary learning can calculate in the following manner:
What wherein S was indicated is that (i, j) element belongs to same class, and what D was indicated is that (i, j) element belongs to inhomogeneity.
M=WTW W∈Rd*d(5);
In addition, over-fitting in order to prevent, below regularization of the present invention: parameter projection matrix P, coefficient A, B and identifying about Beam item W.Regularization term can indicate are as follows:
In conjunction with above-mentioned formula (1), (2), (3), (6) then SMD2The objective function of L can be rewritten are as follows:
Wherein λ is the regularization parameter balance factor.
In step 6, parametric solution.Formula (7) is for DS,DN, P, W are not joint convex functions, but if other It is that convex function solves for each variable in the case that variable is fixed.Therefore formula (7) is divided into 4 sons and asked by the present invention Topic solves, i.e., dictionary indicates the update of coefficient update and half coupling projection matrix to update.It specifically includes:
1) present invention fixes other parameters to update A and B first.It updates A to retain the item for only existing A, formula (7) It can rewrite are as follows:
For solution formula (8), the present invention can be by by αiDerivative be set as 0 to solve.αiIt is represented by as follows:
αi=(DS TDS+PTP+λI+(1-β)WTW)-1
B is similar with the solution of A,
bi=(DN TDN+(λ-1)I+(1-β)WTW)-1
2) D is updatedSAnd DN, update DSFormula (7) can be rewritten as follows:
For solution formula (11), DSCalculating can be obtained by following formula: DS=XAT(AAT+∧))-1(12); ∧ is a diagonal matrix, similar to formula (11), updates DNFormula (7) can be rewritten as
It is as follows:
3) P, fixed other parameters are updated, the present invention can rewrite formula (7) are as follows:
Solution formula (14) can be solved by setting 0 for the derivative of P, and the solution of P is as follows:
P=BAT(AAT+λI))-1(15);
4) final updating W, fixed other parameters, formula (7) can be rewritten as follows:
The present invention can update W by gradient descent algorithm,
Wherein t indicate be algorithm the number of iterations be t.
Further, in step 7, pedestrian identifies again.The sketch image and normal image for having randomly selected half are as instruction Practice collection, remaining is as test set.And using sketch image as probe collection, normal image is as gallery collection.Provide a width From the sketch image of probe, pedestrian retrieval is carried out.Sketch map image set and normal image collection are inputted, according to above-mentioned formula (12), (15), (18) solve parameter DS,DN,P,W.Vacation lets f be the feature of sketch map image set, and G is the feature of common photo collection,
The present invention executes pedestrian retrieval in the following way:
1) sketch map image set dictionary D is solved first with formula (9)S, using the projection matrix P learnt, to solve DSIt is right The coefficient matrix f answered, calculation are as follows:
2) normal image collection dictionary D is solved using formula (10)N, then solve DNCorresponding coefficient matrix g, calculation It is as follows:
3) coefficient matrix f and normal image collection corresponding to sketch map image set are solved by formula (19) and formula (20) Corresponding coefficient matrix g, the present invention, which can be retrieved, concentrates corresponding sketch image in normal image, can be by calculating two The distance between a image obtains, and calculation is as follows:
Calculation formula (21) solves corresponding distance, then adjusts the distance and be ranked up, wherein apart from the smallest common photograph Piece is exactly the picture being retrieved using sketch image.
The present invention is described further below with reference to experiment.
To verify whether this algorithm has good superiority, half Coupling Metric based on sketch image is identified into dictionary learning Pedestrian retrieval algorithm and 6 kinds of comparison algorithms KISSME, XQDA, TDL, SLD2L, JDML and PCB are compared.This 6 kinds comparisons Algorithm includes being based on dictionary learning based on measurement, is based on deep learning, the algorithm identified again based on pedestrian.Finally in the present invention New data set SINPID verified as experimental data.
The evaluation index of retrieval is the matching probability that accumulative matching properties curve CMC, CMC curve is a kind of top-k, Each sample that normal image is concentrated successively is taken into a distance with the calculation of its probe image collection, is then ranked up, rank is The value of the value of top-k selected by the present invention, CMC matching rate tends to 1, if test number it is more, identification it is accurate Rate is better.
In order to verify the method for the present invention SMD2The performance of L is tested in new SINPID data set, and experimental result is table 1 And Fig. 2.
The matching rate of top-r of the table 1 on SINPID data set
What (a) was indicated in Fig. 2 be using the experimental result of LOMO feature, (b) in indicate is reality using PCB feature Test result.From the experimental result in table 1 it can be seen that algorithm of the invention obtains higher matching rate in comparison algorithm, For example, SMD2L has the comparison algorithm XQDA of LOMO feature to compare in SINPID data set, and the matching rate of Rank-1 improves 2.1% (=36.2%-34.1%).The present invention also utilizes depth characteristic and assesses simultaneously, it will be seen that some results is lower than The method of LOMO feature.Its reason may have three aspects: 1) SINPID data set only has the image of 400 200 people, this makes PCB does not pass through enough sample trainings.2) watercolor sketch formula image is than information that the common photo of another video camera includes Amount is few.3) network architecture is not suitable for isomery pedestrian sample, and is suitable for two kinds of normal samples.
By being tested above it can be seen that the method for the present invention SMD2Most of matching rates of L are better than other with LOMO The method of feature and PCB depth characteristic.Mainly have 3 aspects: 1) using half coupled maps strategy, can by sketch image with it is general The relationship of logical photo connects, and reduces the difference between isomery sample.2) study metric matrix can reveal that isomeric data Inherence projection.3) for the sketch image and common photo under complex scene, dictionary is to preferable applicability.4) differentiate about Beam can make the compact and different classes of separation of same category, be conducive to retrieval and classification task.As seen from the above analysis, newly SINPID data set be to different types of feature it is stable, suitable for the further evaluation of pedestrian retrieval, show SMD2L is calculated The superiority of method.
The foregoing is merely illustrative of the preferred embodiments of the present invention, is not intended to limit the invention, all in essence of the invention Made any modifications, equivalent replacements, and improvements etc., should all be included in the protection scope of the present invention within mind and principle.

Claims (10)

1. the pedestrian retrieval method that a kind of half Coupling Metric based on sketch image identifies dictionary learning, which is characterized in that described The pedestrian retrieval method that half Coupling Metric based on sketch image identifies dictionary learning includes:
Establish isomery pedestrian retrieval database SINPID, including sketch map image set and regular image set;Feature extraction is carried out, from figure Feature is extracted as in, and different image labelings is carried out with corresponding color;
Data processing to isomery is carried out to the sample characteristics extracted;Introduce the word of dictionary learning method study isomeric data Allusion quotation pair, from sketch map image set and regular image focusing study mapping matrix;
And identification algorithm study measurement is introduced, carry out pedestrian's inspection that half Coupling Metric based on sketch image identifies dictionary learning Rope.
2. half Coupling Metric based on sketch image identifies the pedestrian retrieval method of dictionary learning as described in claim 1, It is characterized in that, it includes following step that the pedestrian retrieval that half Coupling Metric based on sketch image identifies dictionary learning, which calculates method, It is rapid:
Step 1: shooting photo with camera and establishing the database of oneself;
Step 2: feature extraction;
Step 3: using dictionary learning technological learning isomery dictionary to DSAnd DN
Step 4: learning projection matrix P from sketch map image set and normal image collection;
Step 5: learning metric matrix W using the thought for identifying study;
Step 6: parametric solution;
Step 7: pedestrian identifies again.
3. half Coupling Metric based on sketch image identifies the pedestrian retrieval method of dictionary learning as claimed in claim 2, It is characterized in that, is specifically included in step 1:
The photo under real scene is shot with multiple cameras to collect data, collects multiple pictures data in total;Then by one The data of camera are as normal image collection, and in addition the data of a camera form sketch map image set by processing;
Sketch map image set randomly selects the sketch image of half and the sample of normal image as training set, and remaining be used as is surveyed Examination collection;And processing is split to image set.
4. half Coupling Metric based on sketch image identifies the pedestrian retrieval method of dictionary learning, step as claimed in claim 2 In rapid two, two kinds of feature is extracted to collected pedestrian's data set SINPID and is evaluated, including LOMO patch Feature and PCB depth characteristic;LOMO patch feature is extracted by patch;PCB depth characteristic is to be made of multiple portions grade feature Depth convolution descriptor and obtain the feature vector of each image using whole image as input.
5. half Coupling Metric based on sketch image identifies the pedestrian retrieval method of dictionary learning, step as claimed in claim 2 In rapid three, X=[x1, x2...xN] and Y=[y1, y2...yN] it is sketch map image set and normal image collection respectively;X∈Rd*N,Y∈ Rd*N;D is the dimension of characteristics of image, and N indicates the total quantity of sample image;DSAnd DNRespectively indicate sketch image and normal image Dictionary pair utilizes the objective function of dictionary learning technological learning isomery dictionary pair is defined as:
Wherein dictionary is to DSAnd DNIt is respectively as follows: DS∈Rd*m,DN∈Rd*m, what m was indicated is the quantity of the element of dictionary centering.A= [a1,a2...aN], B=[b1,b2...bN] .A indicate be X in DSOn code coefficient matrix, B indicate be Y in DNOn volume Code coefficient matrix.
6. half Coupling Metric based on sketch image identifies the pedestrian retrieval method of dictionary learning, step as claimed in claim 2 In rapid four, P ∈ Rd*dIt is half coupling projection matrix;By minimizing the distance of A and B code coefficient matrix, half coupling projection is obtained Matrix;Half coupled maps matrix calculates in the following manner:
In step 5, using identifying, thought carries out metric learning and dictionary learning calculates in the following manner:
What wherein S was indicated is that (i, j) element belongs to same class, and what D was indicated is that (i, j) element belongs to inhomogeneity;
M=WTW W∈Rd*d(5);
Parameter projection matrix P, coefficient A, B and identification bound term W;Regularization term indicates are as follows:
In conjunction with above-mentioned formula (1), (2), (3), (6);Then SMD2The objective function of L is rewritten are as follows:
Wherein λ is the regularization parameter balance factor.
7. half Coupling Metric based on sketch image identifies the pedestrian retrieval method of dictionary learning, step as claimed in claim 2 The dictionary for dividing formula (7) in rapid six indicates that the update subproblem of coefficient update and half coupling projection matrix includes: to update
1) parameter and undated parameter A and B of the set formula (8) in addition to parameter A and B;A is updated to only existing the item of parameter A simultaneously Retained, formula (7) is rewritten are as follows:
By by αiDerivative be set as 0 solution;αiIt is expressed as:
αi=(DS TDS+PTP+λI+(1-β)WTW)-1
B is updated to the item for only existing parameter B and to retain,
bi=(DN TDN+(λ-1)I+(1-β)WTW)-1
2) D is updatedSAnd DN, update DSFormula (7) is rewritten as follows:
DSCalculating can be obtained by following formula: DS=XAT(AAT+∧))-1(12);
∧ is a diagonal matrix, updates DNFormula (7) is rewritten as follows:
3) P is updated, fixed other parameters rewrite formula (7) are as follows:
By setting 0 solution for the derivative of P, the solution of P is as follows:
P=BAT(AAT+λI))-1(15);
4) final updating W, fixed other parameters, formula (7) is rewritten as follows:
W is updated by gradient descent algorithm,
Wherein t indicate be algorithm the number of iterations be t;
8. half Coupling Metric based on sketch image identifies the pedestrian retrieval method of dictionary learning, step as claimed in claim 2 In rapid seven, sketch map image set and normal image collection are inputted, according to formula (12), (15), (18) solve parameter DS,DN,P,W;F is The feature of sketch map image set, G are the features of common photo collection, are specifically included:
Sketch map image set dictionary D is solved using formula (9)S, using the projection matrix P learnt, to solve DSCorresponding coefficient square Battle array f, calculation are as follows:
Normal image collection dictionary D is solved using formula (10)N, then solve DNCorresponding coefficient matrix g, calculation are as follows:
It is solved corresponding to coefficient matrix f and normal image collection corresponding to sketch map image set by formula (19) and formula (20) Coefficient matrix g, retrieve and concentrate corresponding sketch image in normal image, obtained by calculating the distance between two images , calculation is as follows:
Calculation formula (21) solves corresponding distance, then adjusts the distance and be ranked up, and is exactly benefit apart from the smallest common photo The picture being retrieved with sketch image.
9. a kind of pedestrian retrieval side for implementing half Coupling Metric described in claim 1 based on sketch image and identifying dictionary learning Half Coupling Metric based on sketch image of method identifies the pedestrian retrieval control system of dictionary learning.
10. the pedestrian retrieval side that kind of implementation half Coupling Metric described in claim 1 based on sketch image identifies dictionary learning The traffic route pedestrian image searching terminal of method.
CN201910325189.4A 2019-04-22 2019-04-22 Pedestrian retrieval method based on sketch image half-coupling metric identification dictionary learning Active CN110046669B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910325189.4A CN110046669B (en) 2019-04-22 2019-04-22 Pedestrian retrieval method based on sketch image half-coupling metric identification dictionary learning

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910325189.4A CN110046669B (en) 2019-04-22 2019-04-22 Pedestrian retrieval method based on sketch image half-coupling metric identification dictionary learning

Publications (2)

Publication Number Publication Date
CN110046669A true CN110046669A (en) 2019-07-23
CN110046669B CN110046669B (en) 2019-12-24

Family

ID=67278414

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910325189.4A Active CN110046669B (en) 2019-04-22 2019-04-22 Pedestrian retrieval method based on sketch image half-coupling metric identification dictionary learning

Country Status (1)

Country Link
CN (1) CN110046669B (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113240715A (en) * 2021-05-29 2021-08-10 刘文平 Multiple feature-driven similarity measurement learning figure retrieval method
CN113553947A (en) * 2021-07-23 2021-10-26 湖南大学 Method and device for re-identifying pedestrian by embedding multi-mode into generation description and electronic equipment
CN116052095A (en) * 2023-03-31 2023-05-02 松立控股集团股份有限公司 Vehicle re-identification method for smart city panoramic video monitoring

Citations (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7567715B1 (en) * 2004-05-12 2009-07-28 The Regents Of The University Of California System and method for representing and encoding images
US20120162244A1 (en) * 2010-12-28 2012-06-28 Microsoft Corporation Image search color sketch filtering
US20140379730A1 (en) * 2013-06-24 2014-12-25 Fujitsu Limited Multimodality-based image tagging apparatus and method
CN105550649A (en) * 2015-12-09 2016-05-04 武汉工程大学 Extremely low resolution human face recognition method and system based on unity coupling local constraint expression
CN106919919A (en) * 2017-02-28 2017-07-04 西安电子科技大学 A kind of SAR target discrimination methods based on multiple features fusion word bag model
CN108027836A (en) * 2015-09-24 2018-05-11 高通股份有限公司 With the system and method for the query image access images of capture
US20180174313A1 (en) * 2016-12-20 2018-06-21 Toshiba Tec Kabushiki Kaisha Information processing apparatus and information processing method
CN108647690A (en) * 2017-10-17 2018-10-12 南京工程学院 The sparse holding projecting method of differentiation for unconstrained recognition of face
CN108932518A (en) * 2018-06-22 2018-12-04 大连理工大学 A kind of feature extraction of shoes watermark image and search method of view-based access control model bag of words
CN109064403A (en) * 2018-08-10 2018-12-21 安徽师范大学 Fingerprint image super-resolution method based on classification coupling dictionary rarefaction representation
CN109117860A (en) * 2018-06-27 2019-01-01 南京邮电大学 A kind of image classification method based on subspace projection and dictionary learning
CN109284668A (en) * 2018-07-27 2019-01-29 昆明理工大学 A kind of pedestrian's weight recognizer based on apart from regularization projection and dictionary learning
CN109409201A (en) * 2018-09-05 2019-03-01 昆明理工大学 A kind of pedestrian's recognition methods again based on shared and peculiar dictionary to combination learning

Patent Citations (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7567715B1 (en) * 2004-05-12 2009-07-28 The Regents Of The University Of California System and method for representing and encoding images
US20120162244A1 (en) * 2010-12-28 2012-06-28 Microsoft Corporation Image search color sketch filtering
US20140379730A1 (en) * 2013-06-24 2014-12-25 Fujitsu Limited Multimodality-based image tagging apparatus and method
CN108027836A (en) * 2015-09-24 2018-05-11 高通股份有限公司 With the system and method for the query image access images of capture
CN105550649A (en) * 2015-12-09 2016-05-04 武汉工程大学 Extremely low resolution human face recognition method and system based on unity coupling local constraint expression
US20180174313A1 (en) * 2016-12-20 2018-06-21 Toshiba Tec Kabushiki Kaisha Information processing apparatus and information processing method
CN106919919A (en) * 2017-02-28 2017-07-04 西安电子科技大学 A kind of SAR target discrimination methods based on multiple features fusion word bag model
CN108647690A (en) * 2017-10-17 2018-10-12 南京工程学院 The sparse holding projecting method of differentiation for unconstrained recognition of face
CN108932518A (en) * 2018-06-22 2018-12-04 大连理工大学 A kind of feature extraction of shoes watermark image and search method of view-based access control model bag of words
CN109117860A (en) * 2018-06-27 2019-01-01 南京邮电大学 A kind of image classification method based on subspace projection and dictionary learning
CN109284668A (en) * 2018-07-27 2019-01-29 昆明理工大学 A kind of pedestrian's weight recognizer based on apart from regularization projection and dictionary learning
CN109064403A (en) * 2018-08-10 2018-12-21 安徽师范大学 Fingerprint image super-resolution method based on classification coupling dictionary rarefaction representation
CN109409201A (en) * 2018-09-05 2019-03-01 昆明理工大学 A kind of pedestrian's recognition methods again based on shared and peculiar dictionary to combination learning

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
MA FEI ETC: "Multi-View based Coupled Dictionary Learning for Person Re-identification", 《2017 INTERNATIONAL CONFERENCE ON SECURITY, PATTERN ANALYSIS,》 *
XIAO-YUAN JING ETC: "Multi-spectrallow-rankstructureddictionarylearning", 《PATTERNRECOGNITION》 *

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113240715A (en) * 2021-05-29 2021-08-10 刘文平 Multiple feature-driven similarity measurement learning figure retrieval method
CN113553947A (en) * 2021-07-23 2021-10-26 湖南大学 Method and device for re-identifying pedestrian by embedding multi-mode into generation description and electronic equipment
CN116052095A (en) * 2023-03-31 2023-05-02 松立控股集团股份有限公司 Vehicle re-identification method for smart city panoramic video monitoring

Also Published As

Publication number Publication date
CN110046669B (en) 2019-12-24

Similar Documents

Publication Publication Date Title
CN109948425B (en) Pedestrian searching method and device for structure-aware self-attention and online instance aggregation matching
You et al. Relative CNN-RNN: Learning relative atmospheric visibility from images
Mouine et al. A shape-based approach for leaf classification using multiscaletriangular representation
CN106446150B (en) A kind of method and device of vehicle precise search
CN105809146B (en) A kind of image scene recognition methods and device
CN111104867A (en) Recognition model training and vehicle heavy recognition method and device based on component segmentation
CN110046669A (en) Half Coupling Metric based on sketch image identifies the pedestrian retrieval method of dictionary learning
CN107506703A (en) A kind of pedestrian's recognition methods again for learning and reordering based on unsupervised Local Metric
CN109344851B (en) Image classification display method and device, analysis instrument and storage medium
CN109299707A (en) A kind of unsupervised pedestrian recognition methods again based on fuzzy depth cluster
CN108764269A (en) A kind of cross datasets pedestrian recognition methods again based on space-time restriction incremental learning
CN109558833A (en) A kind of face recognition algorithms evaluating method and device
Clarke et al. Perceptual similarity: A texture challenge.
CN114758362B (en) Clothing changing pedestrian re-identification method based on semantic perception attention and visual shielding
CN112288761A (en) Abnormal heating power equipment detection method and device and readable storage medium
CN112507853A (en) Cross-mode pedestrian re-identification method based on mutual attention mechanism
CN116897012A (en) Method and device for identifying physique of traditional Chinese medicine, electronic equipment, storage medium and program
Burghardt et al. Automated visual recognition of individual African penguins
CN109670423A (en) A kind of image identification system based on deep learning, method and medium
Zhang et al. Deep feature learning with attributes for cross-modality person re-identification
CN114579794A (en) Multi-scale fusion landmark image retrieval method and system based on feature consistency suggestion
CN113220915B (en) Remote sensing image retrieval method and device based on residual attention
CN109492120A (en) Model training method, search method, device, electronic equipment and storage medium
CN113705310A (en) Feature learning method, target object identification method and corresponding device
CN109241315A (en) A kind of fast face search method based on deep learning

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant