CN109508663A - A kind of pedestrian's recognition methods again based on multi-level supervision network - Google Patents

A kind of pedestrian's recognition methods again based on multi-level supervision network Download PDF

Info

Publication number
CN109508663A
CN109508663A CN201811299473.0A CN201811299473A CN109508663A CN 109508663 A CN109508663 A CN 109508663A CN 201811299473 A CN201811299473 A CN 201811299473A CN 109508663 A CN109508663 A CN 109508663A
Authority
CN
China
Prior art keywords
pedestrian
network
feature vector
level supervision
recognition methods
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201811299473.0A
Other languages
Chinese (zh)
Other versions
CN109508663B (en
Inventor
张君鹏
申瑞民
姜飞
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shanghai Jiaotong University
Original Assignee
Shanghai Jiaotong University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shanghai Jiaotong University filed Critical Shanghai Jiaotong University
Priority to CN201811299473.0A priority Critical patent/CN109508663B/en
Publication of CN109508663A publication Critical patent/CN109508663A/en
Application granted granted Critical
Publication of CN109508663B publication Critical patent/CN109508663B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/10Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
    • G06V40/103Static body considered as a whole, e.g. static pedestrian or occupant recognition
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/241Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
    • G06F18/2413Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches based on distances to training or reference patterns
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/241Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
    • G06F18/2413Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches based on distances to training or reference patterns
    • G06F18/24147Distances to closest patterns, e.g. nearest neighbour classification

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Evolutionary Biology (AREA)
  • Evolutionary Computation (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • General Engineering & Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Human Computer Interaction (AREA)
  • Multimedia (AREA)
  • Image Analysis (AREA)

Abstract

The present invention relates to a kind of pedestrian's recognition methods again based on multi-level supervision network, this method extracts the feature of different semantic hierarchies to pedestrian image by a multi-level supervision network, and then realizes that pedestrian identifies again;The multi-level supervision network is including a multilayer depth convolutional neural networks as core network and multiple categorization modules as feature extraction sub-network;Pedestrian image is converted to the characteristic pattern of different semantic hierarchies by core network, each categorization module is separately converted to the feature vector with distinction by each layer characteristic pattern that supervised learning extracts core network, feature vector splicing shape on all levels is final feature vector, realizes that pedestrian identifies again based on the final feature vector.Compared with prior art, the present invention extracts the feature of pedestrian image difference semantic hierarchies, improves the distinction of feature, and the stability of training process is improved in the way of the supervised learning of half-separation type, network accuracy rate performance is improved, has many advantages, such as that heavy recognition accuracy is high.

Description

A kind of pedestrian's recognition methods again based on multi-level supervision network
Technical field
The present invention relates to a kind of pedestrians again recognition methods, more particularly, to a kind of pedestrian's weight based on multi-level supervision network Recognition methods.
Background technique
Pedestrian in video identify again be computer vision and artificial intelligence field an important subject.Its task Target can summarize are as follows: the image (query image) of given one (or multiple) pedestrian to be searched needs having prison All images of the pedestrian are searched out in control video image set (gallery images).Pedestrian identify again intelligent security guard, The fields such as urban safety are of great immediate significance and are worth, and are a big hot spots of Recent study.
However, in reality scene, since video camera shooting angle, shooting distance, locating light environment etc. are different, With a group traveling together, there are significant vision differences in different video.In addition to this, attitudes vibration caused by human motion, block Phenomena such as further improve the difficulty of this task.Then, how pedestrian's image to be extracted under the influence of these factors and is had The feature of high discrimination and be for identification this technical field problem.
Existing pedestrian's weight identification technology is commonly divided into 3 steps.Scheme firstly, preparing the largely pedestrian with identity label Decent, as training set data library.Later, the depth convolutional neural networks of training set data training specific structure are utilized.Mind Structure and training method through network generally determine the accuracy rate performance of pedestrian weight identifying system, are a most important steps. Finally, using trained convolutional network to search library image zooming-out feature.When needing to identify pedestrian again, only need to utilize Trained convolutional network extracts feature to pedestrian image to be checked (query image), by obtained feature vector and retrieval The feature vector of library image compares cosine similarity or Euclidean distance one by one and is ranked up.The most similar several library images are For the output result of pedestrian's weight identifying system.
Under the above technological frame, prior art focuses primarily on the structure design and instruction of depth convolutional neural networks The mode of white silk designs.This kind of technology can substantially be divided into the pedestrian of two classes (1) based on provincial characteristics and identify again.(2) it is based on metric learning Pedestrian identify again.
Pedestrian's weight identifying schemes based on provincial characteristics divide an image into multiple horizontal zones generally according to spatial position Or net region.After region division, feature is extracted using depth convolutional neural networks to each block respectively.Feature mentions The process taken may be summarized to be: original image (or block) is sent into convolutional neural networks, by multiple convolutional layers, batch normalization The network units such as layer, nonlinear activation layer obtain the characteristic pattern containing high-level semantic feature.Later, by obtained characteristic pattern into The global average pond of row, as the feature vector for representing this image block.Finally, the feature vector of all blocks is melted It closes or merges, obtain the feature vector for representing this group traveling together.For example, document " Glad:global-local-alignment descriptor for pedestrian retrieval”(Wei L,Zhang S,Yao H,et al.Proceedings of The 2017ACM on Multimedia Conference.ACM, 2017:420-428) human body is drawn according to human body key point It is divided into three head, the upper part of the body, lower part of the body regions, and respectively to these three extracted region features.Document " Beyond Part Models:Person Retrieval with Refined Part Pooling”(Sun Y,Zheng L,Yang Y,et Al.arXiv preprint arXiv:1711.09349,2017) pedestrian image level is divided into 6 blocks, and right respectively Six extracted region features.
Pedestrian's weight identifying schemes based on metric learning are common usually using well-designed loss function training network Metric learning loss function has control loss function, triple loss function, big distance s oftmax loss function etc..
There is also following disadvantages for the above-mentioned prior art:
1, during extracting feature, such methods only with depth convolutional network the last layer feature, to net The utilization rate of network is not high.
2, such methods do not utilize multilayer semantic information caused by network.Although the characteristic pattern of network the last layer wraps Containing stronger semantic information, but it also can be therefore easily lost image detail, and then cause the distinction of feature limited.
3, the pedestrian based on metric learning identifies that network is generally more difficult with training again.
Summary of the invention
It is an object of the present invention to overcome the above-mentioned drawbacks of the prior art and provide one kind based on multi-level prison Superintend and direct pedestrian's recognition methods again of network.
An object of the present invention is to solve existing pedestrian weight identification technology to convolutional network middle layer characteristic use rate not High problem improves the distinction and robustness of general characteristic.
The second object of the present invention is to improve the stability of network training process, and promote network accuracy rate performance.
The purpose of the present invention can be achieved through the following technical solutions:
A kind of pedestrian's recognition methods again based on multi-level supervision network, this method are based on a multi-level supervision network implementations Pedestrian identifies that the multi-level supervision network includes a multilayer depth convolutional neural networks as core network and multiple points again Pedestrian image is converted to the characteristic pattern of different semantic hierarchies as feature extraction sub-network, the core network by generic module, respectively The categorization module is separately converted to the feature with distinction by each layer characteristic pattern that supervised learning extracts core network Vector, the feature vector splicing shape on all levels is final feature vector, realizes that pedestrian knows again based on the final feature vector Not.
Further, the multilayer depth convolutional neural networks are made of multiple residual error convolution modules, each residual error convolution mould Block includes several convolutional layers, batch normalization layer and nonlinear activation layer.
Further, the input of multiple categorization modules respectively corresponds the defeated of multiple residual error convolution modules in core network Out.
Further, each categorization module not shared parameter.
Further, the categorization module include the convolutional layer set gradually, it is batch normalization layer, nonlinear activation layer, complete The average pond layer of office, dropout layers, the full articulamentum of dimension, batch normalization layer and softmax layers.
Further, the training process of the multi-level supervision network specifically includes:
1) by acquiring a collection of sample in pedestrian's database for collecting, input the multi-level supervision network carry out it is preceding to biography It broadcasts;
2) intersect entropy loss according to the classification results of each categorization module and sample label calculating;
3) half-separation type backpropagation, the backpropagation are carried out to the multi-level supervision network based on intersection entropy loss Cheng Zhong, only the last one corresponding categorization module of each convolutional neural networks module is completed to the reversed of entire multi-level supervision network It propagates, remaining only carries out backpropagation to categorization module;
4) gradient descent algorithm and undated parameter are executed to network according to the gradient that backpropagation obtains, until network is received It holds back.
Further, the final feature vector is spliced by the feature vector that each categorization module obtains.
Further, realize that pedestrian identifies again based on the final feature vector specifically:
By final feature vector and library image feature vector one by one compared with cosine similarity, with the highest preceding k of similarity Image is as the query result identified again.
Further, the sample acquisition process in pedestrian's database specifically includes:
Pedestrian's video is obtained using different cameras by different spatial, is frame by pedestrian's video slicing, to every Each of one frame image difference pedestrian marks an independent sample label, carries out resolution ratio scaling and reversion to every image Processing forms pedestrian's database.
Compared with prior art, the present invention have with following the utility model has the advantages that
1, the present invention devises categorization module, realizes the depth convolutional network that can effectively extract different semantic hierarchies features Structure merges multi-level merging features in identification, so as to improve the distinction and robustness of general characteristic, solves existing The problem for having pedestrian's weight identification technology not high to convolutional network middle layer characteristic use rate, is more efficiently utilized computing resource.
2, the present invention devises a kind of effective multilayer supervision network training mode, the i.e. training method of half-separation type, has Effect improves the stability of training process, and promotes network accuracy rate performance.
3, the inquiry phase identified again in pedestrian, the merging features Cheng Genggao for all levels that multiple categorization modules are obtained The feature vector of dimension, the identification of lifting feature emphatically, and then significant improve the accuracy rate identified again.
4, the method for the present invention is suitble to pedestrian's weight identification mission under complex scene.
Detailed description of the invention
Fig. 1 is the structural schematic diagram of the multi-level supervision network of the present invention;
Fig. 2 is testing result schematic diagram of the invention.
Specific embodiment
The present invention is described in detail with specific embodiment below in conjunction with the accompanying drawings.The present embodiment is with technical solution of the present invention Premised on implemented, the detailed implementation method and specific operation process are given, but protection scope of the present invention is not limited to Following embodiments.
The invention proposes a kind of pedestrian's recognition methods again based on multi-level supervision network, in depth residual error network (ResNet) on the basis of, using the categorization module (Classification Block) of multiple not shared parameters, in network Supervised learning is carried out on different depth, and then the feature of different semantic hierarchies is extracted to pedestrian image.The overall structure of network is such as Shown in Fig. 1.In the network training stage, the present invention uses a kind of supervised learning mode of half-separation type, improves training process Stability, and improve network accuracy rate performance.In inquiry (query) stage that pedestrian identifies again, by the feature of all levels It is spliced into more high-dimensional feature vector, emphatically the identification of lifting feature, and then significant improves the accuracy rate identified again.
The multi-level supervision network that the present invention uses include multilayer depth convolutional neural networks as core network with Multiple categorization modules are turned residual error convolution module pedestrian image by a series of as feature extraction sub-network, the core network It is changed to the characteristic pattern of different semantic hierarchies, each categorization module passes through each layer characteristic pattern that supervised learning extracts core network It is separately converted to the feature vector with distinction, the feature vector splicing shape on all levels is final feature vector, is based on The final feature vector realizes that pedestrian identifies again.The categorization module includes the convolutional layer set gradually, criticizes normalization layer, is non-thread Property active coating, global average pond layer, dropout layer, the full articulamentum of dimension, criticize and normalize layer and softmax layers.Each classification Module not shared parameter.
As shown in Figure 1, in the present embodiment, it is multi-level supervise network be based on existing ResNet50 (He K, Zhang X, Ren S,et al.Deep residual learning for image recognition.Proceedings of the IEEE conference on computer vision and pattern recognition.2016:770-778) it realizes. ResNet50 is constituted with ResNet Block (residual error convolution module), and then extracts characteristic pattern to pedestrian image, and network is whole by 5 A stage composition, is referred to as Block1 to Block5.Each stage includes multiple residual error convolution modules, such as Block4 is by 6 A residual error convolution module composition, respectively Block4_1 to Block4_6;Block5 is made of 3 residual error convolution modules, respectively For Block5_1 to Block5_3.These residual error convolution modules are by several convolutional layers, batch normalization layer, nonlinear activation layer structure At.Traditional pedestrian identifies that network is usually only used to the feature of whole network the last layer (Block5_3) output again, In the present embodiment, 9 categorization modules are provided with, are extracted Block4_1 to Block4_6, Block5_1 to Block5_3 totally 9 The feature of level, and be respectively fed to categorization module and exercise supervision study, to improve the distinction and robustness of feature.
In the present embodiment, the detailed process that categorization module carries out characteristic processing can be divided into:
(1) firstly, the triple channel pedestrian image that dimension is 256*128*3 enters network, by Block1, it is changed into 128* The characteristic pattern of 64*64.
(2) characteristic pattern of 128*64*64 passes through Block2, is changed into the characteristic pattern of 64*32*256.
(3) characteristic pattern of 64*32*256 passes through Block3, is changed into the characteristic pattern of 32*16*512.
(4) characteristic pattern of 32*16*512 sequentially passes through Block4-1 to Block4-6, exports the feature of 6 16*8*1024 Figure.
(5) characteristic pattern of 16*8*1024 sequentially passes through Block5-1 to Block5-3, exports the feature of 3 16*8*2048 Figure.Different from original ResNet50, which removes the down-sampling operation of Block5.
The characteristic pattern of (6) 6 16*8*1024 is respectively fed to categorization module 1 to 6.Categorization module is 1*1* by core size 2048 convolutional layer, batch normalization layer, nonlinear activation layer, global average pond layer, dropout layers, 512 tie up full articulamentum, Batch normalization layer, softmax layers form in order.It should be noted that the not shared parameters of categorization module 1 to 9.
(7) characteristic pattern of 16*8*1024 is converted into the characteristic pattern of 16*8*2048 through 1*1 convolutional layer, by the average pond of the overall situation Changing layer becomes the feature vector of 1*1*2048, using full articulamentum dimensionality reduction, 512 dimensional feature vector of boil down to.
(8) 512 dimensional feature vectors are sent into softmax layers as the feature for representing pedestrian, complete the supervision of classification task It practises.
(9) categorization module 7-9 is similar with categorization module 1 to 6, only difference is that Block5_1 to Block5_3 is exported Characteristic pattern be 16*8*2048 tie up rather than 16*8*1024 tie up.
The present invention is trained multi-level supervision network using the training method of a half-separation type, improves training process Stability, and promote network accuracy rate performance.Training process specifically includes:
Step 1: by acquiring a collection of sample in pedestrian's database for collecting, input the multi-level supervision network carry out it is preceding to It propagates.
The collection of pedestrian's database includes: with preprocessing process
1) pedestrian's video needs to shoot in different spatial using different cameras, to guarantee the more of pedestrian image Sample and otherness form the training set for possessing a large amount of different pedestrian's videos.
It 2) is that frame uses the mode or calculation manually marked for each frame image by video slicing after video collect The mode of method automatic marking intercepts out the pedestrian in frame, saves as individual picture, and mark one to each different pedestrian Independent sample label.
3) every picture is zoomed to the resolution sizes of 256*128, is needed with adapting to the input of subsequent convolutional neural networks It asks, while being adapted to the physical size of human body, avoid anamorphose.
4) by treated, pedestrian's picture carries out horizontal 180 degree reversion one by one, to expand training data, is formed final Pedestrian's database.
Step 2: intersecting entropy loss according to the classification results of each categorization module and sample label calculating.
Step 3: half-separation type backpropagation, the reversed biography being carried out to the multi-level supervision network based on entropy loss is intersected During broadcasting, only the last one corresponding categorization module of each convolutional neural networks module is completed to entire multi-level supervision network Backpropagation, remaining only carries out backpropagation to categorization module.
In the present embodiment, as shown in the dotted line of Fig. 1, for categorization module 1,2,3,4,5,7,8 backpropagation only to point Generic module itself carries out, and the gradient of backpropagation is not back to the ResNet network of trunk, and dotted arrow indicates backpropagation ladder Degree is truncated herein;For categorization module 6 and 9, categorization module itself and entire ResNet core network participate in reversely passing together It broadcasts, completes the supervised learning of entire ResNet core network.
Step 4: gradient descent algorithm and undated parameter being executed to network according to the gradient that backpropagation obtains, until network Convergence.
In the present embodiment, when carrying out pedestrian's weight identification mission using above-mentioned trained multi-level supervision network, one is given Pedestrian image to be searched is opened, completes identification process according to following process:
(1) pedestrian image to be searched is zoomed to the resolution sizes of 256*128, with the good neural network of adaptation training Input demand.
(2) pedestrian image is sent into trained multi-level supervision network and carries out propagated forward.Each categorization module output The feature of one 512 dimension.
(3) feature vector that 9 categorization modules export is spliced, forms the feature vector of one 4608 dimension, represented The final feature of pedestrian.
(4) by feature vector and the feature vector of library image of 4608 dimensions obtained in previous step one by one compared with cosine it is similar Degree, and be ranked up according to similarity.The highest preceding k of similarity images are the query result of identification of attaching most importance to.K can be as needed Setting.
Table 1 illustrates accuracy rate effect of the present invention on Duke-MTMC Reid [4] data set, it can be seen that the party It is higher that method compares accuracy rate with some existing methods, and weight recognition effect is more preferable.In table, Rank@1, Rank@5, Rank@10 distinguish 1 before when indicating using CMC curve statistical, preceding 5, preceding 10 accuracy rate.
1 present invention of table is compared with prior art accuracy rate
Fig. 2 illustrates some effect of visualization of the present invention on Duke-MTMC Reid data set, effect of the present invention compared with It is good.Query indicates that image to be searched, subsequent 10 picture are and the highest 10 library images of pedestrian's similarity.
The preferred embodiment of the present invention has been described in detail above.It should be appreciated that those skilled in the art without It needs creative work according to the present invention can conceive and makes many modifications and variations.Therefore, all technologies in the art Personnel are available by logical analysis, reasoning, or a limited experiment on the basis of existing technology under this invention's idea Technical solution, all should be within the scope of protection determined by the claims.

Claims (9)

1. a kind of pedestrian's recognition methods again based on multi-level supervision network, which is characterized in that this method is based on a multi-level prison It superintends and directs network implementations pedestrian to identify again, the multi-level supervision network includes a multilayer depth convolutional neural networks as backbone network As feature extraction sub-network, pedestrian image is converted to different semantic hierarchies by the core network for network and multiple categorization modules Characteristic pattern, each categorization module are separately converted to have and distinguish by each layer characteristic pattern that supervised learning extracts core network Property feature vector, the feature vector splicing shape on all levels is final feature vector, is realized based on the final feature vector Pedestrian identifies again.
2. pedestrian's recognition methods again according to claim 1 based on multi-level supervision network, which is characterized in that described more Layer depth convolutional neural networks are made of multiple residual error convolution modules, and each residual error convolution module includes several convolutional layers, batch normalizing Change layer and nonlinear activation layer.
3. pedestrian's recognition methods again according to claim 2 based on multi-level supervision network, which is characterized in that Duo Gesuo The input for stating categorization module respectively corresponds the output of multiple residual error convolution modules in core network.
4. pedestrian's recognition methods again according to claim 1 based on multi-level supervision network, which is characterized in that each described Categorization module not shared parameter.
5. pedestrian's recognition methods again according to claim 1 based on multi-level supervision network, which is characterized in that described point Generic module include the convolutional layer set gradually, batch normalization layer, nonlinear activation layer, global average pond layer, dropout layers, It ties up full articulamentum, criticize normalization layer and softmax layers.
6. pedestrian's recognition methods again according to claim 1 based on multi-level supervision network, which is characterized in that described more The training process of level supervision network specifically includes:
1) by acquiring a collection of sample in pedestrian's database for collecting, the multi-level supervision network progress propagated forward is inputted;
2) intersect entropy loss according to the classification results of each categorization module and sample label calculating;
3) half-separation type backpropagation, the back-propagation process are carried out to the multi-level supervision network based on intersection entropy loss In, only part classification module completes the backpropagation to entire core network, remaining only carries out backpropagation to categorization module;
4) gradient descent algorithm and undated parameter are executed to network according to the gradient that backpropagation obtains, until network convergence.
7. pedestrian's recognition methods again according to claim 1 based on multi-level supervision network, which is characterized in that it is described most Whole feature vector is spliced by the feature vector that each categorization module obtains.
8. pedestrian's recognition methods again according to claim 1 based on multi-level supervision network, which is characterized in that be based on institute It states final feature vector and realizes that pedestrian identifies again specifically:
By final feature vector and library image feature vector one by one compared with cosine similarity, with the highest preceding k of similarity images As the query result identified again.
9. pedestrian's recognition methods again according to claim 6 based on multi-level supervision network, which is characterized in that the row Sample acquisition process in personal data library specifically includes:
Pedestrian's video is obtained using different cameras by different spatial, is frame by pedestrian's video slicing, to each frame Each of image difference pedestrian marks an independent sample label, carries out resolution ratio scaling and horizontal inversion to every image Processing forms pedestrian's database.
CN201811299473.0A 2018-10-31 2018-10-31 Pedestrian re-identification method based on multi-level supervision network Active CN109508663B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201811299473.0A CN109508663B (en) 2018-10-31 2018-10-31 Pedestrian re-identification method based on multi-level supervision network

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201811299473.0A CN109508663B (en) 2018-10-31 2018-10-31 Pedestrian re-identification method based on multi-level supervision network

Publications (2)

Publication Number Publication Date
CN109508663A true CN109508663A (en) 2019-03-22
CN109508663B CN109508663B (en) 2021-07-13

Family

ID=65747474

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811299473.0A Active CN109508663B (en) 2018-10-31 2018-10-31 Pedestrian re-identification method based on multi-level supervision network

Country Status (1)

Country Link
CN (1) CN109508663B (en)

Cited By (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110188635A (en) * 2019-05-16 2019-08-30 南开大学 A kind of plant pest recognition methods based on attention mechanism and multi-level convolution feature
CN110533184A (en) * 2019-08-31 2019-12-03 南京人工智能高等研究院有限公司 A kind of training method and device of network model
CN110781755A (en) * 2019-09-28 2020-02-11 深圳中物智建科技有限公司 Intelligent identification and metering method and system for construction steel bars
CN110969610A (en) * 2019-12-03 2020-04-07 杭州天铂云科光电科技有限公司 Power equipment infrared chart identification method and system based on deep learning
CN111291821A (en) * 2020-02-20 2020-06-16 上海眼控科技股份有限公司 Vehicle weight recognition method and device, computer equipment and storage medium
CN111340141A (en) * 2020-04-20 2020-06-26 天津职业技术师范大学(中国职业培训指导教师进修中心) Crop seedling and weed detection method and system based on deep learning
CN111414815A (en) * 2020-03-04 2020-07-14 清华大学深圳国际研究生院 Pedestrian re-identification network searching method and pedestrian re-identification method
CN112308000A (en) * 2020-11-06 2021-02-02 安徽清新互联信息科技有限公司 High-altitude parabolic detection method based on space-time information
CN112330743A (en) * 2020-11-06 2021-02-05 安徽清新互联信息科技有限公司 High-altitude parabolic detection method based on deep learning
CN112528756A (en) * 2020-11-20 2021-03-19 华南理工大学 Unsupervised pedestrian re-identification method based on heteromorphic image
CN112528059A (en) * 2021-02-08 2021-03-19 南京理工大学 Deep learning-based traffic target image retrieval method and device and readable medium
CN112733695A (en) * 2021-01-04 2021-04-30 电子科技大学 Unsupervised key frame selection method in pedestrian re-identification field
WO2021203801A1 (en) * 2020-04-08 2021-10-14 苏州浪潮智能科技有限公司 Person re-identification method and apparatus, electronic device, and storage medium
CN116912633A (en) * 2023-09-12 2023-10-20 深圳须弥云图空间科技有限公司 Training method and device for target tracking model

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106203318A (en) * 2016-06-29 2016-12-07 浙江工商大学 The camera network pedestrian recognition method merged based on multi-level depth characteristic
CN107103308A (en) * 2017-05-24 2017-08-29 武汉大学 A kind of pedestrian's recognition methods again learnt based on depth dimension from coarse to fine
CN107832672A (en) * 2017-10-12 2018-03-23 北京航空航天大学 A kind of pedestrian's recognition methods again that more loss functions are designed using attitude information
US20180253596A1 (en) * 2017-03-06 2018-09-06 Conduent Business Services, Llc System and method for person re-identification using overhead view images

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106203318A (en) * 2016-06-29 2016-12-07 浙江工商大学 The camera network pedestrian recognition method merged based on multi-level depth characteristic
US20180253596A1 (en) * 2017-03-06 2018-09-06 Conduent Business Services, Llc System and method for person re-identification using overhead view images
CN107103308A (en) * 2017-05-24 2017-08-29 武汉大学 A kind of pedestrian's recognition methods again learnt based on depth dimension from coarse to fine
CN107832672A (en) * 2017-10-12 2018-03-23 北京航空航天大学 A kind of pedestrian's recognition methods again that more loss functions are designed using attitude information

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
FEI JIANG 等: "GRAPH REGULARIZED TENSOR SPARSE CODING FOR IMAGE REPRESENTATION", 《PROCEEDINGS OF THE IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME) 2017》 *
XIAOBIN CHANG等: "Multi-level Factorisation Net for Person Re-identification", 《2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION》 *

Cited By (21)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110188635B (en) * 2019-05-16 2021-04-30 南开大学 Plant disease and insect pest identification method based on attention mechanism and multi-level convolution characteristics
CN110188635A (en) * 2019-05-16 2019-08-30 南开大学 A kind of plant pest recognition methods based on attention mechanism and multi-level convolution feature
CN110533184A (en) * 2019-08-31 2019-12-03 南京人工智能高等研究院有限公司 A kind of training method and device of network model
CN110781755A (en) * 2019-09-28 2020-02-11 深圳中物智建科技有限公司 Intelligent identification and metering method and system for construction steel bars
CN110969610A (en) * 2019-12-03 2020-04-07 杭州天铂云科光电科技有限公司 Power equipment infrared chart identification method and system based on deep learning
CN111291821A (en) * 2020-02-20 2020-06-16 上海眼控科技股份有限公司 Vehicle weight recognition method and device, computer equipment and storage medium
CN111414815A (en) * 2020-03-04 2020-07-14 清华大学深圳国际研究生院 Pedestrian re-identification network searching method and pedestrian re-identification method
CN111414815B (en) * 2020-03-04 2023-11-14 清华大学深圳国际研究生院 Pedestrian re-recognition network searching method and pedestrian re-recognition method
WO2021203801A1 (en) * 2020-04-08 2021-10-14 苏州浪潮智能科技有限公司 Person re-identification method and apparatus, electronic device, and storage medium
CN111340141A (en) * 2020-04-20 2020-06-26 天津职业技术师范大学(中国职业培训指导教师进修中心) Crop seedling and weed detection method and system based on deep learning
CN112330743B (en) * 2020-11-06 2023-03-10 安徽清新互联信息科技有限公司 High-altitude parabolic detection method based on deep learning
CN112308000B (en) * 2020-11-06 2023-03-07 安徽清新互联信息科技有限公司 High-altitude parabolic detection method based on space-time information
CN112330743A (en) * 2020-11-06 2021-02-05 安徽清新互联信息科技有限公司 High-altitude parabolic detection method based on deep learning
CN112308000A (en) * 2020-11-06 2021-02-02 安徽清新互联信息科技有限公司 High-altitude parabolic detection method based on space-time information
CN112528756A (en) * 2020-11-20 2021-03-19 华南理工大学 Unsupervised pedestrian re-identification method based on heteromorphic image
CN112528756B (en) * 2020-11-20 2023-06-23 华南理工大学 Unsupervised pedestrian re-identification method based on different composition
CN112733695A (en) * 2021-01-04 2021-04-30 电子科技大学 Unsupervised key frame selection method in pedestrian re-identification field
CN112733695B (en) * 2021-01-04 2023-04-25 电子科技大学 Unsupervised keyframe selection method in pedestrian re-identification field
CN112528059A (en) * 2021-02-08 2021-03-19 南京理工大学 Deep learning-based traffic target image retrieval method and device and readable medium
CN116912633A (en) * 2023-09-12 2023-10-20 深圳须弥云图空间科技有限公司 Training method and device for target tracking model
CN116912633B (en) * 2023-09-12 2024-01-05 深圳须弥云图空间科技有限公司 Training method and device for target tracking model

Also Published As

Publication number Publication date
CN109508663B (en) 2021-07-13

Similar Documents

Publication Publication Date Title
CN109508663A (en) A kind of pedestrian's recognition methods again based on multi-level supervision network
CN110427813B (en) Pedestrian re-recognition method of twin generation type countermeasure network based on gesture guidance pedestrian image generation
CN112651262B (en) Cross-modal pedestrian re-identification method based on self-adaptive pedestrian alignment
CN101493887B (en) Eyebrow image segmentation method based on semi-supervision learning and Hash index
CN113963032A (en) Twin network structure target tracking method fusing target re-identification
CN109492528A (en) A kind of recognition methods again of the pedestrian based on gaussian sum depth characteristic
CN111723600B (en) Pedestrian re-recognition feature descriptor based on multi-task learning
Wang et al. Global relation reasoning graph convolutional networks for human pose estimation
El-Ghaish et al. Human action recognition based on integrating body pose, part shape, and motion
CN112434599A (en) Pedestrian re-identification method based on random shielding recovery of noise channel
CN111291612A (en) Pedestrian re-identification method and device based on multi-person multi-camera tracking
CN113221770A (en) Cross-domain pedestrian re-identification method and system based on multi-feature hybrid learning
CN113761995A (en) Cross-mode pedestrian re-identification method based on double-transformation alignment and blocking
CN114333062B (en) Pedestrian re-recognition model training method based on heterogeneous dual networks and feature consistency
CN117351518B (en) Method and system for identifying unsupervised cross-modal pedestrian based on level difference
CN112613474B (en) Pedestrian re-identification method and device
CN116453192A (en) Self-attention shielding face recognition method based on blocking
CN113869151B (en) Cross-view gait recognition method and system based on feature fusion
CN116935300A (en) Gesture driving and dynamic alignment shielding pedestrian re-recognition method
CN113435248A (en) Mask face recognition base enhancement method, device, equipment and readable storage medium
Tajik et al. Video-based Person Re-Identification Using Attention Based on Separation Index
Huang et al. A Person Re-identification Method Fusing Bottleneck Transformer andRelation-aware Global Attention
CN112712042B (en) Pedestrian re-identification end-to-end network architecture with embedded key frame extraction
Chang et al. Improving Retrieval Efficiency of Person Re-Identification Based on Resnet50
CN112487232B (en) Face retrieval method and related products

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
CB02 Change of applicant information

Address after: 200030 Dongchuan Road, Minhang District, Minhang District, Shanghai

Applicant after: Shanghai Jiaotong University

Address before: 200030 Huashan Road, Shanghai, No. 1954, No.

Applicant before: Shanghai Jiaotong University

CB02 Change of applicant information
GR01 Patent grant
GR01 Patent grant