CN109145986B - Large-scale face recognition method - Google Patents

Large-scale face recognition method Download PDF

Info

Publication number
CN109145986B
CN109145986B CN201810956517.6A CN201810956517A CN109145986B CN 109145986 B CN109145986 B CN 109145986B CN 201810956517 A CN201810956517 A CN 201810956517A CN 109145986 B CN109145986 B CN 109145986B
Authority
CN
China
Prior art keywords
loss function
face recognition
optimization
steps
softmax loss
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201810956517.6A
Other languages
Chinese (zh)
Other versions
CN109145986A (en
Inventor
杨世杰
黄坤山
彭文瑜
林玉山
杨表
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Foshan Tongguan Technology Co ltd
Foshan Nanhai Guangdong Technology University CNC Equipment Cooperative Innovation Institute
Original Assignee
Foshan Tongguan Technology Co ltd
Foshan Nanhai Guangdong Technology University CNC Equipment Cooperative Innovation Institute
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Foshan Tongguan Technology Co ltd, Foshan Nanhai Guangdong Technology University CNC Equipment Cooperative Innovation Institute filed Critical Foshan Tongguan Technology Co ltd
Priority to CN201810956517.6A priority Critical patent/CN109145986B/en
Publication of CN109145986A publication Critical patent/CN109145986A/en
Application granted granted Critical
Publication of CN109145986B publication Critical patent/CN109145986B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/10Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
    • G06V40/16Human faces, e.g. facial parts, sketches or expressions
    • G06V40/172Classification, e.g. identification

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Evolutionary Biology (AREA)
  • Evolutionary Computation (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Artificial Intelligence (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Oral & Maxillofacial Surgery (AREA)
  • Human Computer Interaction (AREA)
  • Multimedia (AREA)
  • Image Analysis (AREA)
  • Collating Specific Patterns (AREA)

Abstract

A large-scale face recognition method comprises the following steps: the method comprises the following steps: training data and cleaning the trained data; step two: setting a network; step three: inputting settings, adopting 5 key points to calibrate the human face and normalizing the human face to a 112 × 112 color image; step four: outputting the setting; step five: and optimizing the Softmax loss function to obtain a new Softmax loss function, and improving the accuracy of face recognition according to the new Softmax loss function. The invention provides a large-scale face recognition method, which improves the face recognition precision, thereby realizing large-scale face recognition.

Description

Large-scale face recognition method
Technical Field
The invention relates to the technical field of face recognition, in particular to a large-scale face recognition method.
Background
The face recognition identity authentication technology is widely applied to the fields of security, entrance guard, attendance checking and the like due to the characteristics of convenience, rapidness, non-contact and the like. The convolutional neural network has the characteristic of large capacity in the aspect of characteristic, and is suitable for large-scale face recognition. However, in the face recognition process, intra-class changes caused by factors such as facial expressions, postures, ages, positions and coverings of a person and inter-class changes caused by different identities from outside light, backgrounds and the like need to be considered, the two changes are highly complex and nonlinear, and are generally analyzed by using a Softmax loss function, but the Softmax loss function only separates inter-class features and does not gather features belonging to the same class, and the features are not effective enough for face recognition.
Disclosure of Invention
The invention aims to provide a large-scale face recognition method, which improves the face recognition precision and further realizes large-scale face recognition.
In order to achieve the purpose, the invention adopts the following technical scheme:
a large-scale face recognition method comprises the following steps:
the method comprises the following steps: training data and cleaning the trained data;
step two: setting a network;
step three: inputting settings, adopting 5 key points to calibrate the human face and normalizing the human face to a 112 × 112 color image;
step four: outputting the setting;
step five: optimizing the Softmax loss function to obtain a new Softmax loss function, and improving the accuracy of face recognition according to the new Softmax loss function, wherein the Softmax loss function is defined as follows:
Figure BDA0001772802320000021
where m represents the number of samples, n represents the number of sample classifications, s represents the scale coefficient,m+θyrepresents an angle increment manner, wherein m is 0.5.
Preferably, in step five, the optimization of the Softmax loss function includes a primary optimization and a secondary optimization, and the primary optimization includes the following steps:
setting bias as 0, and adding a margin to the included angle of projection to obtain a primary optimization formula as follows:
Figure BDA0001772802320000022
where m represents the number of samples, n represents the number of sample classifications, s represents the scale coefficient,m+θyrepresents an angle increment manner, wherein m is 0.5.
Preferably, the second optimization comprises adding a fixed value to the angle.
Preferably, in step one, the training data comprises training using a VGG network.
Preferably, in step four, the output setting includes a method of using the optimal structure (contribution- > BN- > Dropout- > Fullyconnected- > BN) to realize the connection of the last Convolution layer to the feature vector.
Has the advantages that:
by optimizing a traditional softmax loss function, theta is expressed as an included angle formed by normalizing w and x, so that only one straight line exists between classes, the boundary is enlarged, and classification is facilitated. Meanwhile, training data are optimized and cleaned, and an optimal network structure is selected, so that the recognition precision is greatly improved, and along with the improvement of the recognition precision, the recognition efficiency and accuracy of large-scale human faces are improved.
Drawings
Fig. 1 is a flow chart of the present invention for improving the accuracy of large-scale face recognition.
Detailed Description
The technical scheme of the invention is further explained by the specific implementation mode in combination with the attached drawings.
As shown in fig. 1, the large-scale face recognition method of this embodiment includes the following steps:
the method comprises the following steps: training data and cleaning the trained data; the training set used VGG network training 8000 people, contained 3000000 images, the test set contained 160000 images of 500 people, image quality spanning pose, age, lighting, race, occupation, etc. The training data is cleaned.
Step two: setting a network; various network settings were verified using the MxNet architecture, using VGG2 as training data, softmax as a loss function. The batch size is set to 512, the learning rate starts from 0.1, the learning rate is reduced by 10 times at 100k, 140k and 160k respectively, the impulse unit takes a value of 0.9, and the weight attenuation is set to 5 e-4.
Step three: inputting settings, adopting 5 key points to calibrate the human face and normalizing the human face to a 112 × 112 color image; the face was aligned using 5 keypoints and normalized to a 112 × 112 color image, and in order to ensure the resolution of the feature map, this example replaced conv7 × 7stride2 with conv3 × 3stride 1. The output of the last convolution is 7 x 7.
Step four: outputting the setting; including the implementation of the last Convolution layer connected to the feature vector by the optimal structure (constraint- > BN- > Dropout- > Fullyconnected- > BN).
Step five: optimizing the Softmax loss function to obtain a new Softmax loss function, and improving the accuracy of face recognition according to the new Softmax loss function, wherein the definition of the common Softmax loss function is as follows:
Figure BDA0001772802320000041
the traditional Softmax can not optimize the inter-class distance and the intra-class distance of the feature points, and bias is set to be 0 for calculation convenience, so that the included angle after normalization is represented. In such Softmax, class-to-class boundaries are simply one line. Such points that fall near the boundary may make the generalization capability of the entire model poor. To make this limit larger, the points between the different classes are made as far apart as possible. This can be achieved by adding a margin to the included angle of the projection. The function after the initial optimization is therefore as follows:
Figure BDA0001772802320000042
after the primary optimization, the Softmax function stops converging in the initial stage, so that secondary optimization is performed, a fixed value is added to the angle, and the function after the secondary optimization is as follows:
Figure BDA0001772802320000043
after optimization, the angular distance is more directly affected on the angle than the cosine distance.
The technical principle of the present invention is described above in connection with specific embodiments. The description is made for the purpose of illustrating the principles of the invention and should not be construed in any way as limiting the scope of the invention. Based on the explanations herein, those skilled in the art will be able to conceive of other embodiments of the present invention without inventive effort, which would fall within the scope of the present invention.

Claims (1)

1. A large-scale face recognition method is characterized in that: the method comprises the following steps:
the method comprises the following steps: training data and cleaning the trained data;
step two: setting a network, using an MxNet architecture, using a VGG2 network as training data, and using softmax as a loss function to verify various network settings; the batch size is set to 512, the learning rate is reduced by 10 times from 0.1 at 100k, 140k and 160k respectively, the impulse unit takes a value of 0.9, and the weight attenuation is set to 5 e-4;
step three: inputting settings, adopting 5 key points to calibrate the human face and normalizing the human face to a 112 × 112 color image;
step four: output settings including methods for implementing the last Convolution layer connected to the feature vector using the optimal structure Convolume- > BN- > Dropout- > Fullyconnected- > BN;
step five: optimizing the Softmax loss function to obtain a new Softmax loss function, and improving the accuracy of face recognition according to the new Softmax loss function, wherein the method comprises the following steps:
optimizing the Softmax loss function comprises a primary optimization and a secondary optimization, wherein the primary optimization comprises the following steps:
setting bias as 0, and adding a margin to the included angle of projection to obtain a primary optimization formula as follows:
Figure DEST_PATH_IMAGE002
m represents the number of samples, i represents the number of samples, represents the ith sample, n represents the number of sample classifications, j represents the jth number of sample classifications, s represents the scale coefficient,
Figure DEST_PATH_IMAGE004
representing a fixed value, representing the angle of projection of the feature points,
Figure DEST_PATH_IMAGE006
representing the included angle of the feature point after projection normalization;
and performing secondary optimization on the formula after the primary optimization, wherein the secondary optimization comprises adding a fixed value on the angle as follows:
Figure DEST_PATH_IMAGE008
where m represents the number of samples, i represents the ith number of samples, n represents the number of sample classifications, j represents the jth number of sample classifications, s represents the scale coefficient,
Figure DEST_PATH_IMAGE010
indicating an angular incremental manner in which
Figure DEST_PATH_IMAGE012
Figure DEST_PATH_IMAGE014
Which represents a fixed value of the time-varying,
Figure DEST_PATH_IMAGE016
and representing the included angle of the projection normalization of the characteristic points.
CN201810956517.6A 2018-08-21 2018-08-21 Large-scale face recognition method Active CN109145986B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810956517.6A CN109145986B (en) 2018-08-21 2018-08-21 Large-scale face recognition method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810956517.6A CN109145986B (en) 2018-08-21 2018-08-21 Large-scale face recognition method

Publications (2)

Publication Number Publication Date
CN109145986A CN109145986A (en) 2019-01-04
CN109145986B true CN109145986B (en) 2021-12-24

Family

ID=64790832

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810956517.6A Active CN109145986B (en) 2018-08-21 2018-08-21 Large-scale face recognition method

Country Status (1)

Country Link
CN (1) CN109145986B (en)

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106372581A (en) * 2016-08-25 2017-02-01 中国传媒大学 Method for constructing and training human face identification feature extraction network
CN107103281A (en) * 2017-03-10 2017-08-29 中山大学 Face identification method based on aggregation Damage degree metric learning
CN108182409A (en) * 2017-12-29 2018-06-19 北京智慧眼科技股份有限公司 Biopsy method, device, equipment and storage medium

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10509952B2 (en) * 2016-08-30 2019-12-17 Irida Labs S.A. Fast, embedded, hybrid video face recognition system

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106372581A (en) * 2016-08-25 2017-02-01 中国传媒大学 Method for constructing and training human face identification feature extraction network
CN107103281A (en) * 2017-03-10 2017-08-29 中山大学 Face identification method based on aggregation Damage degree metric learning
CN108182409A (en) * 2017-12-29 2018-06-19 北京智慧眼科技股份有限公司 Biopsy method, device, equipment and storage medium

Also Published As

Publication number Publication date
CN109145986A (en) 2019-01-04

Similar Documents

Publication Publication Date Title
CN110443143B (en) Multi-branch convolutional neural network fused remote sensing image scene classification method
CN108898180B (en) Depth clustering method for single-particle cryoelectron microscope images
CN104966097B (en) A kind of complex script recognition methods based on deep learning
CN108520202B (en) Method for extracting image characteristics with robustness resistance based on variational spherical projection
CN105138998B (en) Pedestrian based on the adaptive sub-space learning algorithm in visual angle recognition methods and system again
CN109190446A (en) Pedestrian's recognition methods again based on triple focused lost function
CN112580590A (en) Finger vein identification method based on multi-semantic feature fusion network
CN111126386A (en) Sequence field adaptation method based on counterstudy in scene text recognition
CN111401156B (en) Image identification method based on Gabor convolution neural network
CN107203752A (en) A kind of combined depth study and the face identification method of the norm constraint of feature two
CN107480723B (en) Texture Recognition based on partial binary threshold learning network
CN108038504A (en) A kind of method for parsing property ownership certificate photo content
Hu et al. Toward driver face recognition in the intelligent traffic monitoring systems
CN111160538B (en) Method and system for updating margin parameter value in loss function
CN115546553A (en) Zero sample classification method based on dynamic feature extraction and attribute correction
CN108520201A (en) A kind of robust human face recognition methods returned based on weighted blend norm
CN109508670B (en) Static gesture recognition method based on infrared camera
CN109145986B (en) Large-scale face recognition method
CN103226698B (en) A kind of method for detecting human face
CN111191027B (en) Generalized zero sample identification method based on Gaussian mixture distribution (VAE)
CN116704585A (en) Face recognition method based on quality perception
CN111445540A (en) Automatic registration method for RGB colored three-dimensional point cloud
CN111126173A (en) High-precision face detection method
Hassan et al. Intelligent sign language recognition using enhanced fourier descriptor: a case of Hausa sign language
CN110378414A (en) The personal identification method of multi-modal biological characteristic fusion based on evolution strategy

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant