CN107577990A

CN107577990A - A kind of extensive face identification method for accelerating retrieval based on GPU

Info

Publication number: CN107577990A
Application number: CN201710675398.2A
Authority: CN
Inventors: 邹复好; 曹锋; 李开; 王浩; 白兴强; 栾朝阳
Original assignee: WUHAN SHIJI JINQIAO SAFETY TECHNOLOGY Co Ltd; Huazhong University of Science and Technology
Current assignee: WUHAN SHIJI JINQIAO SAFETY TECHNOLOGY Co Ltd; Huazhong University of Science and Technology
Priority date: 2017-08-09
Filing date: 2017-08-09
Publication date: 2018-01-12
Anticipated expiration: 2037-08-09
Also published as: CN107577990B

Abstract

The invention discloses a kind of extensive face identification method for accelerating retrieval based on GPU, it is related to computer vision field, including Face datection is with aliging, face characteristic extraction, the coarse matching that Hash feature is obtained, face index data base is established, more GPU accelerate, Candidate Set based on Hash obtains, the accurate matching based on distance metric and ballot obtain and most match the steps such as people.The two benches characteristic matching of extensive face identification method based on hash index and more GPU speed-up computations disclosed by the invention for accelerating retrieval based on GPU, the screening of candidate feature vector can be accelerated using computation capability powerful GPU, it is greatly reduced when being retrieved on large-scale dataset and consumes, the types of applications demand that based on the realization of depth convolutional neural networks and requirement of real-time is higher can be met well.

Description

Large-scale face recognition method based on GPU (graphics processing Unit) accelerated retrieval

Technical Field

The invention relates to the field of computer vision, in particular to a large-scale face recognition method based on GPU (graphics processing unit) accelerated retrieval.

Background

In recent years, with the rapid improvement of computer performance and the continuous completion of deep learning methods, the fields of pattern recognition and artificial intelligence have made significant breakthroughs. People have excellent effects on many pattern recognition tasks through a deep learning method, and face recognition is not exceptional. With the advent of the big data era, the face image data is more and more abundant, and how to efficiently and accurately identify the identity information of a person in a large-scale face data set is a hotspot of research in the field of pattern recognition and information retrieval at present.

The human face recognition is an important means for identity recognition, and has extremely high theoretical and application values. And the image retrieval based on the human face is a very meaningful direction in the field of information retrieval, and has very wide application. For example, in the entertainment field, the most similar star face can be found by submitting own images; in the public security field, criminals can be searched for through face comparison and retrieval; in the security field, the method can also relate to the applications of an access control system, blacklist monitoring, water visitor identification and the like; in addition, the method has great application requirements in the fields of self-service of banks, people and certificates unification of hotels, information security and the like. Therefore, the face classifier which has both recognition efficiency and accuracy under the large-scale data environment is developed and has extremely high practical significance.

The traditional face retrieval method is that face features are extracted manually, then face feature library is searched based on nearest neighbor, and search based on face images is converted into similarity measurement based on real-value feature vectors. This approach works well on small-scale datasets, but once the dataset grows, the efficiency and accuracy of recognition drops dramatically. In addition, the feature vector of the face is usually a high-dimensional feature vector, and under the condition of high dimension, if we also perform nearest neighbor search on the whole database, the efficiency is very low.

The face recognition under the condition of large-scale data is essentially a retrieval problem of multimedia data, and a plurality of data with the closest face to be recognized on a feature space are returned, namely an approximate nearest neighbor search algorithm. In the field of image approximate search, two implementation methods can be used, one is to directly perform similarity search in a high-dimensional feature space, and the other is to map the high-dimensional space to a Hamming space and convert the high-dimensional space into a retrieval problem based on a semantic hash method. The former has a significant disadvantage in the case of large data volumes, namely the "dimensional disaster" problem. In order to solve the problem of dimension disaster, students make a lot of researches on semantic hashing methods, the semantic hashing methods can generate very compact hash codes to directly reflect semantic information of an original feature space, and the hamming distances are close if the features are close to each other in the original feature space, otherwise, the hamming distances are far. However, in the approximate search research work based on the semantic hash method, most of the approximate search research works are hash generation methods, and few researches are made on designing hash indexes to improve the retrieval efficiency. Therefore, how to speed up the reduction of time consumption of approximate search of the hash method in large-scale hash data is a valuable research direction.

Disclosure of Invention

Aiming at the defects in the prior art, the invention aims to provide a large-scale face recognition method based on GPU (graphics processing unit) accelerated retrieval, which can accelerate the screening of candidate feature vectors by utilizing the powerful parallel computing capability of the GPU, greatly reduce the retrieval time consumption on a large-scale data set and well meet various application requirements based on the realization of a deep convolutional neural network and higher real-time requirement.

In order to achieve the above purposes, the technical scheme adopted by the invention is as follows:

a large-scale face recognition method based on GPU accelerated retrieval comprises the following steps:

s1, inputting the picture to be detected into an MTCNN network, detecting the position of the face and the position of key points in the picture by adopting a face detection algorithm, and aligning the detected face;

s2, extracting real-valued feature vectors of the face picture and the picture mirror image processed in the step S1 by using a trained deep learning model, and then fusing the two real-valued feature vectors and reducing dimensions to obtain real-valued features of the face;

s3, converting the real-value features into hash features;

s4, repeating the steps S1-S3 to detect the faces to be detected one by one, and establishing a key value type face database by using the hash feature as an index and the real-value feature vector as a value;

s5, using a multi-GPU accelerated Hash search algorithm to obtain k Hash features which are adjacent to the Hash features of the picture to be detected;

s6, using the k Hash features obtained in S5 as indexes, searching in the face database to obtain a candidate set consisting of k real-value feature vectors;

s7, calculating the vector similarity measurement distance between the real-valued feature vector of the photo to be inquired and the real-valued feature vector in the candidate set;

and S8, voting to obtain the score of each photo to be inquired according to the vector similarity measurement distance between the real-value feature vector in the candidate set and the real-value feature vector of the photo to be inquired, and taking the highest score as a face recognition result.

On the basis of the above technical solution, the step S1 specifically includes the following steps:

s101, inputting a picture to be detected into an MTCNN (multiple terminal connected network), generating a face candidate set window and regression position coordinates thereof by using a first CNN, and combining face windows with high overlapping degree by using a non-maximum suppression algorithm to generate a face window candidate set;

s102, sending the result obtained in the step S101 to a second CNN which is more complex than the first CNN, and performing re-filtering and fine adjustment on the position of the face window;

s103, sending the result obtained in the step S102 to a third CNN which is more complex than the second CNN for fine adjustment, and generating the position of a final face window in each picture to be detected and coordinates of five face key points;

s104, judging whether a final face window is generated in the step S103, and if not, finishing the identification; if so, extracting a final face window image, correcting and aligning the face to the center, and storing the corrected face window image as the specified resolution.

On the basis of the technical scheme, a face feature extraction network is used for extracting a face photo and a real-value feature vector of a photo mirror image;

the face feature extraction network is a 32-layer deep convolutional neural network and comprises a convolutional layer, a down-sampling layer, a PRelu activation layer, a full connection layer and a loss function layer.

On the basis of the technical scheme, the loss function layer comprises two loss functions of softmax-loss and center-loss, and the softmax-loss function is used for improving the intra-class polymerization degree of the sample in the feature space after network mapping; the center-loss function is used for increasing the inter-class distance of the sample in the feature space after network mapping.

On the basis of the above technical solution, in the step S2, the real-valued eigenvector of the face photo processed in the step S1 and the real-valued eigenvector of the mirror image of the photo are fused according to the following formula:

fi＝max(xi，yi)i＝1，2，…，n

wherein xi and yi are the ith dimension of the vector x and y to be fused respectively, n is the dimension of the real-valued eigenvector, and fi is the ith dimension of the fused real-valued eigenvector.

On the basis of the above technical solution, in the step S3, the face real-valued feature obtained in the step S2 is converted into a hash feature according to the following formula:

f(x)＝0.5×(sign(x)+1)

wherein,

on the basis of the above technical solution, in the step S5, obtaining the hash features of k neighbors by using a multiple-GPU accelerated hash lookup algorithm specifically includes the following steps:

s501, dividing a data set consisting of all N Hash features into M parts according to the set GPU number M, wherein each part does not exceed (N + M-1)/M Hash features, and copying all the Hash features of the divided subdata sets from a host to corresponding equipment ends;

s502, setting a calculation thread number for each GPU, and calculating Hamming distances between the hash code to be inquired and all hash codes in a data set in parallel;

s503, dividing all Hamming distances into SUBN/K groups according to K adjacent data as a group, executing in parallel, merging and sorting the Hamming distances of all SUBN/K groups into order in the group;

s504, sequentially finding out the K data with the minimum hamming distance in the array subscripts [ nK, (n +1) K) and [ (n +1) K, (n +2) K), and moving it to the position of [ nK, (n +1) K), where n ═ 0, 2, 4 …, each time comparing the maximum value in the array subscript [ nK, (n +1) K) with the minimum value in the array subscripts [ (n +1) K, (n +2) K), and exchanging the two values if the maximum value in [ nK, (n +1) K) is larger;

s505, repeating the steps S503 and S504, respectively sequencing [0, K ], [2K, 3K ], [4K, 5K) …, grouping the 2K arrays of [0, K) and [2K, 3K ]), finding out the first K minimum Hamming distances, moving the Hamming distances to [0, K ], and so on until the first K Hamming distances of the data packets on all M GPUs are stored at the positions of [0, K);

s506, copying the calculation results of the M GPUs to a host-side memory, and traversing the M data by the maximum heap sorting for all the M data by the M Hamming distances to obtain the minimum K Hamming distances in the M groups of data.

Based on the above technical solution, in the step S7, the distance between the real-valued feature vector of the face to be queried obtained in the step S5 and each real-valued feature vector in the candidate set is calculated by using a Cosine metric or a euclidean metric.

Based on the above technical solution, in step S8, the voter formula used is as follows:

score (ID) is the final voting score of each face ID in the candidate set, sim is the distance between the real-valued feature vector of the face to be queried obtained in step S7 and the real-valued feature vector in the candidate set, and threshold is a set threshold.

Compared with the prior art, the invention has the advantages that:

(1) the large-scale face recognition method based on GPU accelerated retrieval is based on two-stage feature matching of the Hash index and multi-GPU accelerated calculation, can accelerate screening of candidate feature vectors by utilizing the strong parallel computing capability of the GPU, greatly reduces retrieval time consumption on a large-scale data set, and can well meet various application requirements which are based on realization of a deep convolutional neural network and have high real-time requirements.

(2) The large-scale face recognition method based on GPU accelerated retrieval has the advantages that the accuracy rate on an LFW face test set reaches 99.48%, the recognition rate in a MegaFace test task reaches 72.5%, and the effect of ensuring high recognition accuracy rate while greatly reducing the retrieval time consumption on a large-scale data set is realized.

(3) The large-scale face recognition method based on GPU accelerated retrieval has the advantages that each step is relatively independent, certain step can be replaced and adjusted along with technical progress or actual requirements without influencing implementation of other steps, and the expansibility is good.

Drawings

FIG. 1 is a schematic diagram of a large-scale face recognition retrieval method based on GPU acceleration according to an embodiment of the present invention;

FIG. 2 is a MTCNN frame diagram for face detection and key point location in an embodiment of the present invention;

FIG. 3 is a diagram of a network structure for extracting facial features based on deep learning according to an embodiment of the present invention;

FIG. 4 is a frame diagram of face real-valued feature extraction and fusion proposed in the embodiment of the present invention;

fig. 5 is a flowchart of a hash lookup algorithm based on GPU acceleration according to an embodiment of the present invention.

Detailed Description

The present invention will be described in further detail with reference to the accompanying drawings and examples.

The terms used in the examples of the present invention are explained as follows:

MTCNN: multi-task conditional neural network, multitasking convolutional neural network;

CNN: a convolutional neural network;

PReLU (parametric reconstructed Linear Unit): activation function with parameters.

Referring to fig. 1, an embodiment of the present invention provides a large-scale face recognition method based on GPU accelerated retrieval, including the following steps:

s2, extracting real-valued feature vectors of the face picture and the picture mirror image processed in the step S1 by using a trained deep learning model, and then fusing the two vectors and reducing dimensions to obtain the real-valued features of the face;

s3, designing a hash function to convert the human face real value features obtained in the step S2 into hash features;

s4, repeating the steps S1-S3 to detect the faces to be detected one by one, using the hash features in the step S3 as indexes and real-value feature vectors as values, and establishing a key value type face database;

s5, processing the query photo according to the steps S1-S3 to obtain hash characteristics, and obtaining the hash characteristics of k neighbors by using a multi-GPU accelerated hash search algorithm;

s6, using the hash feature obtained in S5 as an index, searching in the face database established in the step S4 to obtain a candidate set corresponding to the real-value feature vector;

s7, calculating the Hamming distance between the real-value feature vector of the photo to be inquired and the feature vector in the candidate set;

and S8, after subtracting a threshold value according to the Hamming distance between the vector in the candidate set and the vector to be inquired, voting to obtain the score of each candidate ID, and taking the highest score as a face recognition result.

The large-scale face recognition method based on GPU accelerated retrieval provided by the invention divides the query process into a rough matching stage and a fine matching stage. In the rough matching stage, a depth hashing technology is firstly utilized to generate hash features corresponding to each face image, efficient indexes are built for all the hash features in a data set, a hash searching algorithm of GPU accelerated calculation is used for inquiring the Hamming distances between the hash features of the face image to be searched and all the hash features, faces corresponding to the first k hash features which meet the requirement that the distances are smaller than a specific Hamming distance and the Hamming distances are the smallest serve as candidate sets, and results obtained in the rough matching stage are obtained. In the accurate matching stage, the corresponding real-valued features are taken out according to the Hash features in the candidate set, and a proper similarity measurement method is selected to compare the real-valued features in the candidate set with the real-valued features of the face to be retrieved. And (4) sending the comparison result to a voter, and finally obtaining the face ID with the highest vote score as the face identification result.

The following steps detail the detailed process of the large-scale face recognition method based on GPU accelerated retrieval in one embodiment of the method of the invention:

step S1, inputting the picture to be detected into an MTCNN network, detecting the position of the face and the position of key points in the picture by adopting a face detection algorithm, and aligning the detected face;

the invention adopts MTCNN method as face detection and key point positioning method, the overall frame diagram is shown in figure 2, and the processing of face detection and key point positioning specifically comprises the following steps:

s103, fine-tuning the result obtained in the step S102 through a third CNN which is more complex than the second CNN, and generating the final face window position of each picture to be detected and the coordinates of five face key points;

s104, judging whether a final face window is generated in the step S103, and if not, finishing the identification; if yes, extracting a final face window image, correcting the face to be in the middle, and storing the face to be in the specified resolution.

As shown in fig. 3, the MTCNN in the embodiment of the present invention processes an image in three stages: the first CNN uses a full convolution network P-Net (Proposal network) to obtain a part of a face window candidate set, wherein a bounding box regression is used for calibrating and an NMS is used for merging candidate boxes; then sending the data into a more complex second CNN, and removing more non-face areas by using a full convolution network R-Net (refine network); and finally, inputting the result into a third more complex CNN network O-Net (output network) for fine processing, and outputting a final face frame and five face key point positions.

The method designs three network structures for cascade optimization processing. Compared with a multi-classification target detection task, the human face detection task is a two-classification problem, so that the human face detection task needs fewer filters compared with the target detection, but needs better discrimination, and a deeper network structure is designed in the O-Net to extract better semantic features. In order to achieve the real-time object, the sizes of the designed convolution kernels are both 3 × 3 and 2 × 2, so that the amount of calculation can be reduced, and the three CNN structures are shown in fig. 3.

Inputting the picture to be detected into the MTCNN network, obtaining whether the input picture has a face, the position of a face window and the coordinates of key points of the face after the processing in the steps S101-S103, and obtaining the processed face picture required in the step S2 after the processing in the step S104.

Step S2, extracting the real-valued feature vectors of the face picture and the picture mirror image processed in the step S1 by using a trained deep learning model, and then fusing the two vectors and reducing the dimension to be used as the real-valued features of the face;

the human face feature extraction network designed by the invention is formed by stacking the structures of the residual blocks according to the residual network-Resnet, and a 32-layer deep convolutional neural network is designed, and comprises a convolutional layer, a down-sampling layer, a PRelu activation layer, a full connection layer and other different types of structures, complex nonlinear transformation is fitted through the combination of the structures, and the whole network structure is shown in figure 4.

The specific configuration and parameter settings of the network are shown in the following table:

the input to the network is an image with a resolution of 96 × 112 × 3, and 512-dimensional features are output. The network structure has 32 layers, Conv represents a convolutional layer, MP represents a down-sampling layer (adopting a maximum pooling method), and FC represents a full connection layer. The repetition represents the number of times the structure overlaps and the output is the output size of the feature after passing through the layer. It can be seen from the table that the more the number of parameters is, the more the network structure is, the number of parameters of the last fully-connected layer is half of the total number of parameters, and the final output feature vector is 512-dimensional. The loss function layer is arranged behind the last FC layer, and the feature extraction network used by the method simultaneously uses two loss functions of softmax-loss and center-loss to improve the cluster aggregation and the cluster distance separation and finally improve the accuracy. On the basis of softmax-loss, the Center-loss records a class Center in a feature space of each class of a training set respectively, in the training process, distance constraint between a sample and the class Center in the feature space after network mapping is increased, the aggregation degree of the mapped features in the classes is improved, and meanwhile, the distance between the classes is increased by combining softmax-loss, so that the learned features have better generalization and discrimination capability.

because the data scale processed by the method is in hundred million level, in order to search the real-valued feature vector corresponding to the hash feature more quickly, Redis can be adopted to store the real-valued vector, each hash index corresponds to a plurality of feature vectors, if the hash feature generated by a certain feature vector does not exist in the database, the corresponding hash index is added, otherwise, the feature vector is added to the corresponding hash index. In order to store information about a human face, the present invention uses three tables to store corresponding information, which are hash _ set, face _ info _ hash, and person _ info _ hash. Wherein hash _ set is a set type data structure that stores all hash indices. The face _ info _ Hash and the person _ info _ Hash are Hash type data structures in the Redis, and store data in a key-value pair form, wherein the face _ info _ Hash stores related information of each face, the person _ info _ Hash stores information of each person, each person has a unique ID, and meanwhile, each person can have multiple faces.

The concrete structure of the person _ info _ hash key is as follows:

the specific structure of the face _ info _ hash key is as follows:

because both tables are key-value data structures, new information can be freely added, each human face sheet stores a corresponding hash index in the face _ info _ hash table, the corresponding hash index is a key with a key name, real-value feature vectors of a plurality of human faces are stored in the key, and the key name of each feature vector is formed by the id and the number of the human face as shown in the following table:

and step S5, performing rough matching on the real-value feature vector and the hash feature vector corresponding to the face photo to be inquired obtained in the pre-order step. In the large-scale face recognition method, the rough matching stage is to retrieve K hash characteristics which are closest to the Hamming distance of the picture to be inquired. As is well known, a GPU is a graphics processor for image rendering, which integrates a very large number of computational cores, commonly used for data processing and scientific computing. The strong parallel computing capability of the GPU meets the application scene of large-scale feature distance computing, namely K hash features meeting query conditions are searched in large-scale Hash features. Therefore, the invention designs a Top K Hash search algorithm based on multi-GPU acceleration. The main flow of the algorithm is shown in fig. 5, and specifically comprises the following steps:

s501, dividing a data set consisting of all N hash features into M parts according to the set GPU number M, wherein each part does not exceed SUBN (N + M-1)/M hash features, and copying all the hash features of the divided sub data set from a host to a corresponding device end;

The algorithm aims at searching K hash features with the minimum Hamming distance from the query hash feature in the N hash features, firstly dividing all the N hash features into M parts according to the number M of available GPUs, wherein each part contains (N + M-1)/M hash features, respectively copying the hash features of each part from a host to a device end, and then, respectively carrying out the following operations on each GPU, setting the Block (Block) and Grid (Grid) sizes used for calculation of each GPU, quickly calculating the Hamming distances between the hash code to be inquired and all the hash codes in the data set by using the parallel calculation capacity of the GPU, finally selecting the first K minimum Hamming distances from all the Hamming distances and returning the Hamming distances, and then combining the results calculated by the M GPUs at the host end according to the concept of distance use merging and sorting to obtain the K hash features with the minimum Hamming distance with the inquired hash features. The specific flow is as described in the above steps S501 to S506.

The whole algorithm realizes the quick calculation and retrieval of the hash on the GPU by using the ideas of Merge (Merge Sort) and Bitonic Sort (Bitonic Sort). In the Hash indexing method based on GPU acceleration, each GPU needs to maintain an index structure the same as a Hamming distance list, when the Hamming distance changes, the position of the index is moved at the same time, and finally the position of the Top K Hamming distance is the position of the corresponding index.

The main video memory overhead of the algorithm is to store all Hamming distances, and the space complexity is O_(N)In the time overhead, the calculation of Hamming distance and the overhead of binary merging and sorting are mainly included, and in the two processes, the time complexity of the Hamming distance is O_(N)The latter time complexity is O_(NlogN)Thus overall time complexity of O_(NlogN)。

After candidate hash indexes are found out by using a GPU accelerated hash lookup algorithm, face feature vectors which are correspondingly stored in all the hash indexes are obtained from a face database according to the hash indexes, and the face feature vectors form a candidate set.

In step S6, the hash feature vector obtained in step S5 is used as a key name, a key value corresponding to the key name in Redis is queried, so as to obtain a candidate feature vector, and according to the establishment process of the face database in step 4, the sub-key name of the feature vector stored in each hash index contains Id of a person corresponding to the feature vector, and according to Id and a corresponding real-valued feature vector, all hash indexes obtained in step 5 in Redis sequentially queried, and all obtained sub-key value pairs form a feature vector candidate set with a map structure, where the map structure is as follows:

Id	feature vector
		Id1_face_feature_0	Id1 real-valued feature of the person's photograph 1
Id2_face_feature_0	Id2 real-valued feature of the person's photograph 1
		Id2_face_feature_1	Id2 real-valued feature of person's photograph 2
…	…

Step S7, calculating the Hamming distance between the real-valued eigenvector of the photo to be inquired and the eigenvector in the candidate set;

in this step, the distance between the real-valued feature vector of the face to be queried obtained in step S5 and each feature vector in the candidate set is calculated, and the implementation of the present invention uses Cosine as the similarity metric, but is not limited to Cosine metric, and may also use euclidean metric, etc. When the Cosine distance of two vectors is closer to 1, the two vectors are more similar. And storing the calculated key name and distance according to each feature vector in a map structure for the next step of processing.

And step S8, after subtracting a threshold value according to the Hamming distance between the vector in the candidate set and the vector to be inquired, voting to obtain the score of each candidate ID, and taking the highest score as a face recognition result.

After the similarity scores between the facial features to be queried and all people in the candidate set are obtained, since each person in the candidate set may have more than one picture, a voter needs to be designed to vote for the face ID, and the voter in this embodiment is designed as follows:

score (ID) is the final voting score of each face ID in the candidate set, sim is the distance between the real-valued feature vector of the face to be queried obtained in step S7 and the feature vector in the candidate set, and threshold is a set threshold. It can be seen that when the cosine distance is greater than the threshold, the score of the person corresponding to the picture is increased, otherwise, the score of the corresponding person is decreased, and the ID with the largest voting score is the final recognition result.

The present invention is not limited to the above-described embodiments, and it will be apparent to those skilled in the art that various modifications and improvements can be made without departing from the principle of the present invention, and such modifications and improvements are also considered to be within the scope of the present invention. Those not described in detail in this specification are within the skill of the art.

Claims

1. A large-scale face recognition method based on GPU accelerated retrieval is characterized by comprising the following steps:

s3, converting the real-value features into hash features;

s4, repeating the steps S1-S3 to detect the faces to be detected one by one, using the hash features as indexes and real-value feature vectors as values, and establishing a key value type face database;

2. The large-scale face recognition method based on GPU-accelerated retrieval as recited in claim 1, wherein: the step S1 specifically includes the following steps:

s101, inputting a picture to be detected into an MTCNN (multiple terminal connected network), generating a face candidate set window and regression position coordinates thereof by using a first CNN, and combining the face windows with high overlapping degree by using a non-maximum suppression algorithm to generate a face window candidate set;

3. The large-scale face recognition method based on GPU-accelerated retrieval as recited in claim 1, wherein:

extracting a face photo and a real-value feature vector of a photo mirror image by using a face feature extraction network;

4. The large-scale face recognition method based on GPU-accelerated retrieval as recited in claim 3, wherein: the loss function layer comprises two loss functions of softmax-loss and center-loss, and the softmax-loss function is used for improving the intra-class polymerization degree of the sample in the feature space after network mapping; the center-loss function is used for increasing the inter-class distance of the sample in the feature space after network mapping.

5. The large-scale face recognition method based on GPU-accelerated retrieval as recited in claim 1, wherein: in step S2, the real-valued eigenvector of the face photograph processed in step S1 and the real-valued eigenvector of the mirror image of the photograph are fused according to the following formula:

f_i＝max(x_i，y_i)i＝1，2，…，n

wherein x is_iAnd y_iRespectively, the ith dimension of the vector x, y to be fused, n the dimension of the real-valued eigenvector, f_iIs the ith dimension of the fused real-valued eigenvector.

6. The large-scale face recognition method based on GPU-accelerated retrieval as recited in claim 1, wherein: in the step S3, the face real-valued features obtained in the step S2 are converted into hash features according to the following formula:

f(x)＝0.5×(sign(x)+1)

wherein,

7. the large-scale face recognition method based on GPU-accelerated retrieval as recited in claim 1, wherein: in step S5, obtaining the hash features of the k neighbors by using the multiple GPU accelerated hash lookup algorithm specifically includes the following steps:

s501, dividing a data set consisting of all N hash features into M parts according to the set GPU number M, wherein each part does not exceed SUBN (N + M-1)/M hash features, and copying all the hash features of the divided subdata set from a host to a corresponding device end;

s506, copying the calculation results of the M GPUs to a memory of the host end, and traversing the M data by the maximum heap sorting for all the M data by the maximum heap sorting to obtain the minimum K Hamming distances in the M groups of data.

8. The large-scale face recognition method based on GPU-accelerated retrieval as recited in claim 1, wherein: in step S7, the distance between the real-valued feature vector of the face to be queried obtained in step S5 and each real-valued feature vector in the candidate set is calculated by using Cosine metric or euclidean metric.

9. The large-scale face recognition method based on GPU-accelerated retrieval as recited in claim 1, wherein: in step S8, the voter formula used is as follows: