CN112183190A

CN112183190A - Human face quality evaluation method based on local key feature recognition

Info

Publication number: CN112183190A
Application number: CN202010831673.7A
Authority: CN
Inventors: 张诗辉; 金梦醒; 刘恒
Original assignee: Hangzhou Yiwei Technology Co ltd
Current assignee: Hangzhou Yiwei Technology Co ltd
Priority date: 2020-08-18
Filing date: 2020-08-18
Publication date: 2021-01-05

Abstract

The invention aims to disclose a human face quality evaluation method based on local key feature recognition, which comprises the following steps of; compared with the prior art, the method has the advantages that the multitask output judgment is added on the basis of the original face detection, the related face local key features are judged on the basis of not adding model parameters, the face recognition of the face local key features is carried out, the eye and the eyebrow are taken as the core, an attention mechanism is added to extract the face local key features, the face local key feature extraction pairing is carried out by combining face contour features, the high-quality evaluation of the face local key features is realized, the accuracy and the efficiency of the face recognition are better improved by matching with a deep learning algorithm and a face recognition technology, and the purpose of the method is realized.

Description

Human face quality evaluation method based on local key feature recognition

Technical Field

The invention relates to a face quality evaluation method, in particular to a face quality evaluation method based on local key feature recognition.

Background

At present, most face recognition equipment in the market does not support local key feature recognition, so that the detection recognition rate is low under special conditions (the face is shielded, sunglasses are worn on the face, and the like).

Therefore, a face quality evaluation method based on local key feature recognition is particularly needed to solve the existing problems.

Disclosure of Invention

The invention aims to provide a human face quality evaluation method based on local key feature recognition, aiming at the defects of the prior art, the high-quality evaluation of the local key features of the human face is realized, and the accuracy and the efficiency of the human face recognition are better improved by matching a deep learning algorithm and a human face recognition technology.

The technical problem solved by the invention can be realized by adopting the following technical scheme:

a human face quality evaluation method based on local key feature recognition is characterized by comprising the following steps:

step 1: acquiring an image through a camera device, entering the next step if the image is a human face image, and re-acquiring the image through the camera device if the image is not the human face image;

step 2: detecting whether the obtained face image has face local key features, if not, entering a step 3, and if so, entering a step 4;

and step 3: comparing the face image with a face database through a face algorithm, releasing if the comparison is passed, and returning to the step 1 if the comparison is not passed;

and 4, step 4: and (3) comparing the face image with a face local key feature database through a face local key feature algorithm, releasing if the comparison is passed, and returning to the step 1 if the comparison is not passed.

In an embodiment of the present invention, in step 2, it is detected whether the obtained face image has local key features of a face, and through a network model structure based on a feature pyramid and a multitask Loss function, Loss is defined as follows:

dividing the loss function into three loss function combinations, namely a two-classification softmax function of whether the loss function is a human face or not

Face detection frame coordinate smooth function

And whether it is a dichotomized softmax function of the mask

Y in (1)_iIn order to be the GT and the GT,

for model test results, the same applies to t_iAnd m_iAre all the GT bodies,

and

and (5) reasoning output for the model.

Wherein the content of the first and second substances,

a binary softmax function of whether the region is a face:

for the face frame coordinate function, smooth is used_L1The function of the function is that of the function,

wherein the content of the first and second substances,

and (4) using a softmax function as a classification function of whether the human face has the features.

In one embodiment of the invention, the local key features of the human face are that the eyes and the eyebrows of the data set picture are marked again on the basis of the features of the whole face, an attention mechanism is added to the network, depth feature extraction is emphasized on the eyes and the eyebrows, new human face features are generated by combining the features of the whole face and the eyebrows of the eyes, and finally the human face features are based on the features of a previous layer FC of the classification network trained by L-Softmax,

L-Softmax of

Is deformed into

Further, the attribute mechanism is to perform weight summation on the overall features and the local features of the picture, increase the weight for the region with large contribution, decrease the weight for the region with small contribution, combine the characteristics of the softmax function, output the interval [0,1], and is suitable for being used in the distribution of the weights to obtain a weight calculation formula:

and calculating to obtain:

further, the features are mapped into n +1 dimensions through a softmax function, each dimension represents a category, and a one-dimensional convolution is used for replacing a full-link layer to be classified and branched.

In an embodiment of the present invention, the comparison evaluation index is a general (MAP) index, and the formula is as follows:

p is the accuracy; r is the recall rate;

during calculation, the coincidence rate of the prediction frame and the marking frame is set as a, and the prediction frame with a being more than or equal to 0.5 is regarded as a positive example:

in an embodiment of the invention, by adopting the human face quality evaluation method based on local key feature recognition, the human face detection speed is less than 30ms, the human face feature extraction speed is less than 200ms, the human face detection accuracy is greater than 99.3%, and the human face recognition accuracy is greater than 92%.

Compared with the prior art, the human face quality evaluation method based on local key feature recognition adds multi-task output judgment on the basis of original human face detection, judges related human face local key features on the basis of not adding model parameters, carries out human face recognition on the human face local key features, extracts the human face local key features by taking eyes and eyebrows as a core and adding an attention mechanism, and combines facial contour features to carry out extraction pairing on the recognized human face local key features, thereby realizing the purpose of the invention.

The features of the present invention will be apparent from the accompanying drawings and from the detailed description of the preferred embodiments which follows.

Drawings

FIG. 1 is a schematic flow chart of a human face quality evaluation method based on local key feature recognition according to the present invention;

FIG. 2 is a schematic diagram of a local key feature model structure of a human face according to the present invention;

FIG. 3 is a schematic diagram of the network structure with and without Attention according to the present invention;

FIG. 4 is a schematic structural diagram of an Attention module according to the present invention;

fig. 5 is a schematic structural diagram of a ResBlock layer according to the present invention;

FIG. 6 is a structural diagram of the one-dimensional convolution according to the present invention.

Detailed Description

In order to make the technical means, the creation characteristics, the achievement purposes and the effects of the invention easy to understand, the invention is further explained below by combining the specific drawings.

Examples

As shown in fig. 1 to 6, the method for evaluating the face quality based on local key feature recognition of the present invention includes the following steps:

In this embodiment, in step 2, whether the obtained face image has local key features of the face is detected, and the Loss is defined as follows by using a network model structure based on a feature pyramid and a multitask Loss function:

wherein the content of the first and second substances,

a binary softmax function of whether the region is a face:

wherein the content of the first and second substances,

In the embodiment, the local key features of the human face are that the eyes and the eyebrows of the data set picture are marked again on the basis of the features of the whole face, an attention mechanism is added to the network, depth feature extraction is emphasized on the eyes and the eyebrows, new human face features are generated by combining the features of the whole face and the eyebrows of the eyes, and finally the human face features are based on the features of the FC (fiber channel) in the front layer of the classification network trained by L-Softmax,

L-Softmax of

Is deformed into

and calculating to obtain:

resblock layer: (1 × 1 convolution + relu +1 × 1 convolution +3 × 3 convolution) × 8(12 or 16), the first 1 × 1 convolution is to make more detailed features of the original extended learning image for the channel, the second 1 × 1 convolution is to compress the number of channels in proportion, reduce the size of parameters and accelerate calculation, then obtain the output feature diagram of the layer through one 3 × 3 convolution, and the final output result of the layer is obtained by adding the input layer and the output layer. The Resblock layer may be repeated 8, 12 or 16 times.

A characteristic pyramid: convolutional neural networks of different levels can realize the extraction from low-level image features to high-level features. As shown in a structure diagram, the last four features of different levels of the model are fused, so that the last output feature simultaneously contains information of a low level and a high level, and the accuracy of network identification and position judgment is improved.

One-dimensional convolution: features are mapped into n +1 dimensions through a softmax function, each dimension represents a category, and a one-dimensional convolution is used for replacing a full-link layer to be classified and branched, so that the detection and classification precision is guaranteed, network parameters are effectively reduced, and the inference time is shortened.

In this embodiment, the comparison evaluation index is a general (MAP) index, and the formula is as follows:

p is the accuracy; r is the recall rate;

in this embodiment, by using the method for evaluating the face quality based on local key feature recognition, the face detection speed is less than 30ms, the face feature extraction speed is less than 200ms, the face detection accuracy is greater than 99.3%, and the face recognition accuracy is greater than 92%.

The foregoing shows and describes the general principles and broad features of the present invention and advantages thereof. It will be understood by those skilled in the art that the present invention is not limited to the embodiments described above, which are merely illustrative of the principles of the present invention, but that various changes and modifications may be made without departing from the spirit and scope of the invention, which is defined by the appended claims and their equivalents.

Claims

1. A human face quality evaluation method based on local key feature recognition is characterized by comprising the following steps:

2. The method for evaluating the quality of the human face based on the local key feature recognition according to claim 1, wherein the step 2 detects whether the obtained human face image has the local key features of the human face, and the Loss is defined as follows through a network model structure based on a feature pyramid and a multitask Loss function:

wherein the content of the first and second substances,

a binary softmax function of whether the region is a face:

wherein the content of the first and second substances,

3. The human face quality evaluation method based on local key feature recognition according to claim 1, wherein the local key features of the human face are that the eyes and the eyebrows of the data set picture are marked again on the basis of the full-face features, an attention mechanism is added to the network, depth feature extraction is emphasized on the eyes and the eyebrows, new human face features are generated by combining the full-face and the eye eyebrow features, and finally the human face features are based on the features of the FC (first layer) of the classification network trained by L-Softmax,

L-Softmax of

Is deformed into

4. The method for evaluating the face quality based on the local key feature recognition according to claim 3, wherein the attention mechanism is to sum the weights of the overall features and the local features of the picture, increase the weight for the region with large contribution and decrease the weight for the region with small contribution, and in combination with the characteristics of the softmax function, the output interval is [0,1], and is suitable for being used in the distribution of the weights to obtain the weight calculation formula:

and calculating to obtain:

5. the method for evaluating the quality of the human face based on the local key feature recognition according to claim 2, characterized in that the features are mapped into n +1 dimensions by a softmax function, each dimension represents a category, and a one-dimensional convolution is used to replace a full-connected layer for classification branching.

6. The method for evaluating the quality of the human face based on the local key feature recognition according to claim 1, wherein the compared evaluation index is a general (MAP) index, and the formula is as follows:

p is the accuracy; r is the recall rate;

7. the method for evaluating the quality of a human face based on local key feature recognition according to claim 1, wherein the human face quality evaluating method based on local key feature recognition is adopted, the human face detection speed is less than 30ms, the human face feature extraction speed is less than 200ms, the human face detection accuracy is greater than 99.3%, and the human face recognition accuracy is greater than 92%.