CN110738071A

CN110738071A - face algorithm model training method based on deep learning and transfer learning

Info

Publication number: CN110738071A
Application number: CN201810787805.3A
Authority: CN
Inventors: 刘中秋; 陈高曙; 唐松鹤
Original assignee: Miaxis Biometrics Co Ltd
Current assignee: Miaxis Biometrics Co Ltd
Priority date: 2018-07-18
Filing date: 2018-07-18
Publication date: 2020-01-31

Abstract

The invention provides face algorithm model training methods based on deep learning and transfer learning, which are characterized in that a convolutional neural network is trained on a pure portrait dataset of a large-capacity sample to obtain a pre-training model, then, on the basis, the final model is obtained by retraining a portrait dataset (a data set of a portrait and an identity card photo) through transfer learning, and the problem of how to use fewer portrait training samples to train a high-accuracy portrait recognition model is effectively solved.

Description

face algorithm model training method based on deep learning and transfer learning

[ technical field ] A method for producing a semiconductor device

The invention relates to the field of face image recognition, in particular to testimony recognition training methods based on deep learning and transfer learning.

[ background of the invention ]

The human face recognition is branches of image recognition, two human face photos can be fully automatically verified by using a human face verification algorithm, and whether the two human face photos are the same people or not can be judged.

[ summary of the invention ]

In order to solve the problems, the invention provides face algorithm model training methods based on deep learning and transfer learning, a convolutional neural network is trained on a pure portrait dataset of a large-capacity sample to obtain a pre-training model, then on the basis, the transfer learning is carried out, and retraining is carried out on a testimony dataset (a portrait and an identification card photo dataset) to obtain a final model, so that the problem of how to use fewer testimony training samples to train a testimony recognition model with high accuracy is effectively solved.

The invention discloses an face algorithm model training method based on deep learning and transfer learning, which comprises the following steps:

step 1: training a pure portrait dataset of a collected large-capacity sample by adopting a convolutional neural network to obtain a pre-training model, wherein the convolutional neural network model sequentially comprises a training data layer, a convolutional layer, an activation layer, a pooling layer, a full-link layer and a loss layer;

step 2: and (3) taking the pre-training model in the step (1) as a training starting point, copying network parameters before the full-connection layer in the step (1) by adopting transfer learning, and training the full-connection layer and the loss layer of the testimonial convolutional neural network on the testimonial data set in an important way to obtain a final model.

Because the sample of the human image data set is less, when the convolutional neural network training is directly carried out on the human image data set, a network model with good effect is difficult to train, the sample of the human image data set is more, the human image data set and the human image data set are all picture sets containing human faces, and the learning task is face recognition, so that the human image data set can be used for transfer learning. Therefore, in order to improve the accuracy of the human face identification of the testimony, a convolutional neural network is trained on a human image data set, parameters of a data layer, a convolutional layer, an activation layer and a pooling layer of a pre-training model are copied into the testimony convolutional neural network, and a full connection layer and a loss layer of the testimony convolutional neural network are trained in an important mode.

As technical solutions, the training of the data layer includes the following steps:

s1, detecting the human face, namely, inputting pictures containing the human face, and detecting the position of the human face by adopting a cascade structure and a multilayer neural network;

s2: key point positioning: according to the detected face, 5 key points are extracted by adopting a coarse-fine self-encoder network: left eye center, right eye center, nose tip, left mouth corner and right mouth corner;

s3: face preprocessing: the face images detected in step S1 are aligned using similarity transformation of 5 key points detected in step S2 with 5 given standard key points, and the aligned images have the same size.

As technical solutions, the loss function of the convolutional neural network training adopts softmax loss and center loss, and the calculation formula is as follows:

wherein,

is the total loss;

the difference between classes can be increased if the model is softmax;the method is characterized in that the method comprises the steps of learning classes of centers and reducing the intra-class distance, wherein lambda is the weight occupied by the center loss.

As technical solutions, in step 2, the learning rate of each layer before the full connection layer is set to 0 or reduced, because the parameters of each layer before the full connection layer are generalization-capable, with little or no need of retraining, the full connection layer and the loss layer need to be retrained to extract the specific features of the testimonial data set.

As technical solutions, the pre-training model obtained according to step 1 is compared with an evaluation set of face recognition international authority, the evaluation set includes LFW and megaface, and the accuracy of the pre-training model can be further guaranteed by comparing the pre-training model with the evaluation set of face recognition international authority.

In conclusion, the invention has the advantages that based on deep learning and transfer learning, convolutional neural network training is carried out on the pure portrait dataset of the large-capacity sample, and on the basis of the obtained training result, the testimony recognition model with higher accuracy is obtained by carrying out fine adjustment on the testimony dataset, thereby solving the problems that the testimony dataset has fewer samples and can not be directly applied to deep learning.

[ description of the drawings ]

FIG. 1 is a general flow chart of example 1 of the present invention

FIG. 2 is a flowchart of convolutional neural network training of human image data set according to embodiment 1 of the present invention

FIG. 3 is a flowchart of transfer learning according to embodiment 1 of the present invention

[ detailed description ] embodiments

Example 1

As shown in fig. 1, the face algorithm model training methods based on deep learning and transfer learning provided by the present invention firstly train a convolutional neural network on a pure human image data set of a large-capacity sample, as shown in fig. 2, including the following steps:

1. face detection, pictures containing faces are input, and the positions of the faces are detected by adopting a cascade structure and a multilayer neural network;

2. key point positioning: according to the detected face, 5 key points are extracted by adopting a coarse-fine self-encoder network: left eye center, right eye center, nose tip, left mouth corner and right mouth corner;

3. face preprocessing: aligning the face image detected in the step 1 by using the similarity transformation of the 5 key points detected in the step 2 and 5 given standard key points, wherein the aligned images have the same size;

4. network training: training is carried out by adopting a convolutional neural network, and softmax loss and center loss are adopted as loss functions, and the formula is as follows:

wherein,

is the total loss that is to be expected,

is softmax, the inter-class gap can be increased,

is centerloss, it is possible to learn the centers of classes and reduce the intra-class distance, and λ is the weight taken by the center loss.

The convolutional neural network simultaneously comprises a data layer, a convolutional layer, an activation layer, a pooling layer, a full-link layer and a loss layer, wherein the data layer can be obtained through the steps 1-3, the convolutional layer is used for convolving images so as to implicitly extract features from training data, the activation layer is added with nonlinear factors so as to improve the expression capacity of the convolutional neural network, the pooling layer is used for downsampling a feature mapping surface and mainly reduces dimensions, the full-link layer plays a role of a classifier in the whole convolutional neural network, the features are obtained in the full-link layer, and the features are groups of fixed-length numbers.

After the convolutional neural network of the portrait data set is trained, an evaluation set LFW for identifying international authority by a human face is used for comparison and test, and a pre-training model after accuracy test is used for training the testimony data set.

Because the portrait data set and the testimony data set are all picture sets containing human faces, and the learning task is human face recognition, the transfer learning can be used, and the transfer learning process is shown in fig. 3, and the specific method is as follows:

A. inputting a testimony data set, and taking a pre-training model of the portrait data set as a training starting point;

B. copying parameters of a data layer, a convolutional layer, an activation layer and a pooling layer of a pre-training model into a testimonial convolutional neural network, and setting the learning rate of the convolutional layer to be 0;

C. and training a full connection layer and a loss layer of the testimonial convolutional neural network to obtain a final model.

Claims

1, a face algorithm model training method based on deep learning and transfer learning, which is characterized by comprising the following steps:

2. The method for training facial algorithm models based on deep learning and transfer learning of claim 1, wherein the training of the data layer comprises the following steps:

3. The deep learning and transfer learning-based face algorithm model training method of claim 1, wherein the loss function of convolutional neural network training is softmax loss and center loss, and the calculation formula is as follows:

wherein,is the total loss;the difference between classes can be increased if the model is softmax;

4. The human face algorithm model training methods based on deep learning and transfer learning of claim 1, wherein in the step 2, the learning rate of each layer before the fully connected layer is set to 0 or reduced.

5. The training method of human face algorithm models based on deep learning and transfer learning according to claim 1, wherein the pre-training model obtained according to step 1 is compared with an evaluation set of human face recognition international authority, and the accuracy of the pre-training model is tested, wherein the evaluation set comprises LFW and MegaFace.