CN111738242A

CN111738242A - Face recognition method and system based on self-adaption and color normalization

Info

Publication number: CN111738242A
Application number: CN202010848894.5A
Authority: CN
Inventors: 陈晓莉; 丁一帆; 徐菁; 杨世宏; 徐云华; 林建洪
Original assignee: Zhejiang Ponshine Information Technology Co ltd
Current assignee: Zhejiang Ponshine Information Technology Co ltd
Priority date: 2020-08-21
Filing date: 2020-08-21
Publication date: 2020-10-02
Anticipated expiration: 2040-08-21
Also published as: CN111738242B

Abstract

The invention discloses a face recognition method based on self-adaptation and color normalization, which comprises the following steps: s1, creating a face image data set; s2, carrying out self-adaptive scale selection processing on the images in the face data set based on a face detection network to obtain new face images; s3, carrying out face positioning on the output new face image, and carrying out color normalization processing on the eye circumference area of the positioned face to obtain a processed face image; s4, inputting the processed face image into a face recognition network, calculating the Euclidean distance between the vector of the processed face image and a prestored face image, judging whether the Euclidean distance is smaller than a first preset threshold value, if so, obtaining a training sample, and inputting the training sample into the face recognition network for training to obtain a final face recognition network model; and S5, inputting the image to be recognized into a face recognition network model for face recognition to obtain a final recognition result.

Description

Face recognition method and system based on self-adaption and color normalization

Technical Field

The invention relates to the technical field of face recognition, in particular to a face recognition method and a face recognition system based on self-adaption and color normalization.

Background

In training a face detection network, because the face size in a sample image is not fixed due to different angles and focal lengths, in order to accurately locate all possible faces in the image, we usually enlarge/reduce the original image to form an image pyramid, which means that as many scales as possible are selected to form images with different resolutions, and when the enlargement/reduction scales are very close, redundancy of detection frames is caused.

For example, patent publication No. CN109684931A discloses a face recognition method based on color channels, which includes the following steps: s1: training a face recognition model and normalizing the frequencies of different color channels; s2: selecting a face image to be recognized and a face image in storage information, respectively calculating the similarity xB, xR and xG of two face images in different color channels by using the weights of different face characteristics, respectively substituting the similarity into the corresponding similarity probability functions Nr, Ng and Nb to obtain the maximum probability max [ Nr (xR), Ng (xG) and Nb (xB) ] of the similarity, and judging whether the face image to be recognized and the image in the storage information are the same person or not according to the relation between the maximum probability and a threshold value. The face recognition method based on the color sub-channels provided by the invention provides a more reliable and stable recognition result for the problem of glasses reflection commonly existing in face recognition. Although the above patent can identify the human face, the detection is directed to full-face detection, and the interference of light, shadow and the like is large, so that the human face identification cannot be accurately performed.

In order to solve the problem, the invention designs an adaptive scale selection mechanism. Through experiments, the accuracy of face recognition can be improved through color normalization of the detected face image in the eye surrounding area.

Disclosure of Invention

The invention aims to provide a face recognition method and a face recognition system based on self-adaptation and color normalization, which aim to overcome the defects of the prior art, reduce the redundancy of a face detection frame through a proposed self-adaptation scale selection mechanism, reduce the influence caused by factors such as light, color channels and the like through color normalization of an eye surrounding area before a detected face image is input into a face recognition network, and improve the accuracy of face recognition.

In order to achieve the purpose, the invention adopts the following technical scheme:

a face recognition method based on self-adaptation and color normalization comprises the following steps:

s1, creating a face image data set;

s2, carrying out self-adaptive scale selection processing on the images in the face data set based on a face detection network to obtain new face images;

s3, carrying out face positioning on the output new face image, and carrying out color normalization processing on the eye circumference area of the positioned face to obtain a processed face image;

s4, inputting the processed face image into a face recognition network, calculating the Euclidean distance between the vector of the processed face image and a prestored face image, judging whether the Euclidean distance is smaller than a first preset threshold value, if so, obtaining a training sample, and inputting the training sample into the face recognition network for training to obtain a final face recognition network model;

and S5, inputting the image to be recognized into a face recognition network model for face recognition to obtain a final recognition result.

Further, the step S2 specifically includes:

s21, collecting the images in the face data setPAccording to a preset proportion

Performing image conversion on the imagePIs converted into resolution of

New image of

(ii) a Wherein the content of the first and second substances,

；

s22, new image after conversion

Inputting the data into a face detection network with 12 × 12 resolution to obtain dimension of

Confidence score graph of

Are combined with each other

Counting all elements in the list which are larger than a second preset threshold value to obtain the total number of all elements

；

S23, calculating the total number

And images

Has a resolution of

Ratio of (A) to (B)

Obtaining n ratios

For n ratios

Sorting is performed and the first three of the sorting is selected

As input data;

s24, selecting the first three in the sequence

Inputting the candidate frames into a 24 × 24 resolution face detection network, adjusting the detection frames to obtain candidate frames with confidence scores greater than a second preset threshold and IoU smaller than a third preset threshold, and storing the obtained candidate frames and other candidate frames in a candidate list;

and S25, inputting the candidate frames in the candidate list into a face detection network with 48 × 48 resolution to adjust the detection frames, and taking the candidate frames with IoU smaller than a third preset threshold and confidence greater than a fourth preset threshold as final output face frames.

Further, the step S24 of adjusting the detection frame input into the 24 × 24 resolution face detection network is to adjust the detection frame to a size of 24 × 24 × 3, and obtain a 1 × 2D confidence array of whether there is a face and a 1 × 4D boundary information array that constrains the boundary of each candidate boundary frame.

Further, the step S25 of adjusting the detection frame input into the 48 × 48 resolution face detection network is to adjust the detection frame to a size of 48 × 48 × 3, and obtain a 1 × 2D confidence array of whether there is a face and a 1 × 4D boundary information array that constrains the boundary of each candidate boundary box.

Further, the second preset threshold is 0.9; the third preset threshold value is 0.7; the fourth preset threshold is 0.95.

Further, the step S3 specifically includes:

s31, cutting the output face frame, and positioning the face in the face frame to obtain a positioned face image;

s32, performing 2d smoothing on the positioned face image to obtain a periocular region with a main part of a positioning color from the upper left corner [0.375,0.375] to the lower right corner [0.625,0.625 ];

and S33, calculating three-channel mean values of all the face image colors, and smoothing the values of all the channels of the eye surrounding area of each face image to the mean value to obtain the processed face image.

Further, the step S4 is specifically:

defining triplets in a face recognition network as<a，p，n>Where a, p belong to the face image of the same person and n belongs to the face image of another person, will<a，p，n>Inputting into a face recognition network, and outputting

，

，

And is outputted

，

，

Satisfy the requirement of

And performing condition screening on the training samples, screening out proper triplets, inputting the triplets into face recognition, and training to obtain a final face recognition network model.

Further, the performing of condition screening on the training samples specifically comprises: selecting

And when it is satisfied

Training samples of the conditions.

Further, in the step S5, the image to be recognized is input into the face recognition network model for face recognition, specifically, the image to be recognized is processed based on the steps S2-S4, so as to obtain a final recognition result.

Correspondingly, a face recognition system based on self-adaptation and color normalization is also provided, which comprises:

the creation module is used for creating a face image data set;

the first processing module is used for carrying out self-adaptive scale selection processing on the images in the face data set based on a face detection network to obtain new face images;

the second processing module is used for carrying out face positioning on the output new face image and carrying out color normalization processing on the periocular region of the positioned face to obtain a processed face image;

the training module is used for inputting the processed face image into a face recognition network, calculating the Euclidean distance between the vector of the processed face image and a prestored face image, judging whether the Euclidean distance is smaller than a first preset threshold value, if so, obtaining a training sample, and inputting the training sample into the face recognition network for training to obtain a final face recognition network model;

and the recognition module is used for inputting the image to be recognized into the face recognition network model for face recognition to obtain a final recognition result.

Compared with the prior art, the invention aims to reduce the redundancy of a face detection frame by the proposed adaptive scale selection mechanism before a face image is input into a face detection network, and reduce the influence caused by factors such as light, color channels and the like by color normalization of an eye surrounding area before the detected face image is input into the face recognition network, thereby improving the accuracy of face recognition.

Drawings

FIG. 1 is a flow chart of a face recognition method based on self-adaptation and color normalization according to an embodiment;

FIG. 2 is a schematic structural diagram of a training phase according to an embodiment;

FIG. 3 is a schematic structural diagram of an identification phase according to an embodiment;

FIG. 4 is a diagram illustrating the color normalization result of the eye region according to an embodiment;

FIG. 5 is a schematic view of a cut-away area of a face after positioning according to an embodiment;

fig. 6 is a structural diagram of a face recognition system based on adaptation and color normalization according to a third embodiment.

Detailed Description

The embodiments of the present invention are described below with reference to specific embodiments, and other advantages and effects of the present invention will be easily understood by those skilled in the art from the disclosure of the present specification. The invention is capable of other and different embodiments and of being practiced or of being carried out in various ways, and its several details are capable of modification in various respects, all without departing from the spirit and scope of the present invention. It is to be noted that the features in the following embodiments and examples may be combined with each other without conflict.

The invention aims to provide a face recognition method and system based on self-adaption and color normalization aiming at the defects of the prior art.

Example one

The embodiment provides a face recognition method based on self-adaptation and color normalization, as shown in fig. 1, including the steps of:

s1, creating a face image data set;

The method for face recognition based on self-adaptation and color normalization of the embodiment comprises a training phase and a recognition phase.

Before the image is subjected to face detection in a training stage, a self-adaptive scale selection mechanism is designed to reduce the redundant number of candidate face detection frames before network convolution, then the candidate face detection frames are sent to a face detection stage adopting multi-resolution screening, the face detection stage positions the five facial features of the detected face, 2d smoothing processing is carried out on the image by utilizing five points, color normalization processing of the peripheral region of the eye is carried out on the face image to be recognized, the face image is sent to a face recognition stage adopting triplet loss, the convolution characteristics are converted into vectors subjected to normalization through L2 in the face recognition stage, and the training of face recognition is completed by comparing whether the Euclidean distance between the image vector to be recognized and the face with known expression is smaller than a certain threshold value. As shown in fig. 2.

In the recognition stage, the face image to be recognized is mapped to the Euclidean space through a convolutional network, and the face recognition is completed according to the principle that the face of the same person has a smaller distance and the face of different persons has a larger distance. As shown in fig. 3.

Steps S1-S4 of the present embodiment are training phases, and step S5 is an identification period.

In step S1, a face image data set is created.

Creation of a data set: an ASIA-FaceV5 asian face data set was used, containing 500 photos of each person, 5 photos of each person, for a total of 2500 photos. All face images were 16-bit color BMP files, the resolution of the images was 640 x 480, 80% of which were selected as the training set and 20% as the test set.

In step S2, an adaptive scale selection process is performed on the image in the face data set based on the face detection network, and a new face image is obtained.

The method specifically comprises the following steps: image adaptive scale selection and face detection training.

Image adaptive scale selection: because the size of the face possibly existing in the image is unknown, an adaptive scale selection mechanism is provided to meet the detection of different face sizes.

Performing image conversion on the imagePIs converted into resolution of

New image of

(ii) a Wherein the content of the first and second substances,

；

setting a default scale ratio to

Using each scale in the set

（

Is composed of

) To the picture in the data set according to the scale

An enlargement and reduction process is performed, in which case the image P in the data set can be converted to a resolution of

New image of

Wherein

。

The face detection training comprises the following steps:

s22, new image after conversion

Confidence score graph of

Are combined with each other

；

When a new image is formed

After determination, the new image is displayed

The data is sent into a 12 × 12 resolution size network (the network is a convolution network) to obtain the dimension of

Confidence score graph of

Are combined with each other

Counting all elements in the list which are larger than a second preset threshold value of 0.9 to obtain the total number

。

In the present embodiment, all elements refer to points in the generation ROI region, and the total number is the number of all points.

In a 12 × 12 resolution convolutional network, the input image size is 12 × 12 × 3 (height × width × channel), the final output of the network is a 1 × 1 × 2 confidence map that gives whether a face is present in the 12 × 12 image and a 1 × 1 × 4 boundary regression map that gives the corresponding ROI boundary if a face is detected.

S23, calculating the total number

And images

Has a resolution of

Ratio of (A) to (B)

Obtaining n ratios

For n ratios

Sorting is performed and the first three of the sorting is selected

As input data;

the formula for calculating the ratio is:

in step S21, the preset ratio is n, so there are n total numbers

And n images

Thus, n ratios are obtained by the above formula

Then to n ratios

Sorting is performed, and the first three are selected

The input to the 24 × 24 network aims to reduce the detection block redundancy, and solves the detection block redundancy caused by adopting a zooming-in/zooming-out mode.

S24, selecting the first three in the sequence

scoring the confidence

And (4) inputting the images of the first three in the sequence into a face detection network with the resolution of 24 × 24, and excluding low confidence coefficient from inputting into the face detection network.

In a 24 × 24 resolution network, the detection box is adjusted to a size of 24 × 24 × 3, a 1 × 2D confidence array giving whether a face exists and a 1 × 4D boundary information array restricting the boundary of each candidate boundary box are obtained, and the candidate box with a confidence score greater than 0.9 and IoU smaller than a third preset threshold of 0.7 is obtained and kept in the candidate list together with all other candidate boxes.

The candidate boxes in the candidate list are input to a face detection network of 48 x 48 resolution.

In a 48 × 48 resolution network, the detection frame is adjusted to a size of 48 × 48 × 3, resulting in a 1 × 2D confidence array and a 1 × 4D boundary information array that give whether a face exists. At this stage IoU is set to the third preset threshold 0.7, candidate boxes with confidence level greater than the fourth preset threshold 0.95 are output as final output, IoU is less than the third preset threshold 0.7.

In the embodiment, a group of scaled pictures is set, the scaled pictures are input into a 12 × 12 convolutional network, the score of each scaled picture is calculated, top3 (the first three) is selected and input into a 24 × 24 convolutional network, the load of a face detection network is reduced, and the detection time is shortened.

It should be noted that the face detection network in this embodiment is a concatenated convolutional network.

In step S3, face localization is performed on the output new face image, and color normalization processing is performed on the periocular region of the located face, so as to obtain a processed face image.

The method specifically comprises the following steps:

cutting and positioning: the face detection network cuts the detected face frame, the cutting size is 224 multiplied by 224, the facial features in the image are positioned, and the positioned face position is centered.

obtaining a face interval: after the image is cut and positioned, the image is subjected to 2d smoothing processing by utilizing five points, and a section from the upper left corner [0.375,0.375] to the lower right corner [0.625,0.625] of the main part of the positioned color is obtained.

Wherein, the ratio from the upper left corner [0.375,0.375] to the lower right corner [0.625,0.625] is the image; after the face detection, the cutting proportion of the area after the face positioning is 0.375 and 0.625, namely the interval from the upper left corner [0.375,0.375] to the lower right corner [0.625,0.625 ].

FIG. 5 is a schematic view of a cut-out region of a human face after positioning; the image in the left image box of fig. 5 is a face finally obtained after the original image is positioned; the diagram in the right diagram box of fig. 5 is the face obtained by clipping the positioned face according to the ratio of 0.375 to 0.625, wherein the vertex of the upper left corner of the box is [0.375,0.375 ]; the lower right corner of the box is located at [0.625,0.625], and then color processing operations are performed on this region.

In this embodiment, multiple experiments prove that the face can be completely covered by the cropping ratios of 0.375 and 0.625 after the selection and the positioning.

Calculating the average value of the channels: and randomly sampling and calculating 500 three-channel color mean values of the human face, wherein the mean values are R:122.15, G:95.98 and B:80.90 respectively.

Color normalization processing of the eye periphery region: in the detection calculation process, the present embodiment specifically describes using the B channel as an example:

and smoothing, namely smoothing the value of each channel to a mean value by using (B-np (B [ left _ margin: right _ margin, left _ margin: right _ margin ]) + B _ mean), wherein np.mean represents the averaging operation, B _ mean is the mean value of the B channel, and left _ margin and right _ margin are respectively the intervals from the upper left corner [0.375,0.375] to the lower right corner [0.625,0.625] of the image to be detected. The result of color normalization of the periocular region is shown in fig. 4, with the image before processing on the left and the image after color normalization of the periocular region on the right.

In the embodiment, a face range is processed, because the face in the face image detected by the face detection network is generally in the range from the upper left corner [0.375,0.375] to the lower right corner [0.625,0.625], the eye circumference is selected because the part is close to the skin color of the face and is slightly interfered by light and shadow, so a series of operations of the average value of the color utilization in the interval are normalized, and the accuracy of face recognition can be improved.

In step S4, the processed face image is input into a face recognition network, and the euclidean distance between the vector of the processed face image and the pre-stored face image is calculated, and it is determined whether the euclidean distance is smaller than a first preset threshold, if so, a training sample is obtained, and the training sample is input into the face recognition network for training, so as to obtain a final face recognition network model.

And (3) face recognition training: the face recognition network is a convolutional network using a tripletloss loss function by defining triplets as<a，p，n>Wherein a, p correspond to the same id, a, p belong to the face image of the same person, n correspond to different id, belong to the face image of another person, the purpose is to separate a pair of positive examples and negative examples with a certain distance interval, make the distance between all faces of the same person very small, and the distance between a pair of face images from different persons very large, formulate even its output result

，

，

Satisfy the following requirements

Selecting

And when it is satisfied

The training samples are subjected to condition screening by using online _ learning, and proper triplets are selected from the training samples for training to obtain a final face recognition network model.

In step S5, the image to be recognized is input to the face recognition network model for face recognition, and the final recognition result is obtained.

The above-described processing of steps S2-S4 is performed on the pictures in the test data (image to be recognized) set, the number of detection frames of this image input to the face detection network is reduced by step S2, and face detection is performed by a concatenated convolutional network (face detection network); processing the section from the upper left corner [0.375,0.375] to the lower right corner [0.625,0.625] of the detected face image through the step S3, carrying out color normalization processing on the color of the section, sending the processed image into the trained face recognition network model of the step S4, calculating whether Euclidean distance between the image to be recognized and the image in the face library is smaller than a set threshold value through the trained face recognition network model, and further judging whether the image is the same identity, so as to obtain a final recognition result.

The face recognition method provided by the embodiment comprises 4 stages, namely, an image adaptive scale selection stage before face detection, a face detection stage, a color normalization processing stage of a periocular region before face recognition and a face recognition stage. In the first stage, because the size of a face possibly existing in an image is not fixed, the face is quickly found by setting different scaling ratios; in the second stage, 12 × 12, 24 × 24 and 48 × 48 multi-resolution convolutional networks are adopted for step-by-step screening, and a large number of non-face detection frames are eliminated while a high recall rate is kept; in the third stage, even the photos of the same person are greatly different due to the influence of factors such as illumination, expression, angle, shielding and the like in the images, the colors of the eye surrounding areas of the images to be recognized are normalized, and the richness of face feature extraction can be kept; in the fourth stage, a triple loss face recognition network is utilized, so that the difference between good face feature vectors can be learned, and a recognition result can be given quickly.

The embodiment aims to reduce the redundancy of a face detection frame through a proposed adaptive scale selection mechanism before a face image is input into a face detection network, and reduce the influence caused by factors such as light, color channels and the like through color normalization of an eye surrounding area before the detected face image is input into the face recognition network, so that the accuracy of face recognition is improved.

Example two

The difference between the face recognition method based on self-adaptation and color normalization provided by the embodiment and the embodiment one is that:

this example uses an ASIA-FaceV5 asian face data set with 2000 images as the training set and 500 images as the test set. The processing was performed using the method of example one, and the speed and accuracy comparison results using this method and not using this method are shown in table 1.

TABLE 1

According to the table 1, the detection speed is improved through the self-adaptive mechanism, and the color normalization of the eye surrounding area improves the accuracy rate of face recognition.

EXAMPLE III

The present embodiment provides a face recognition system based on adaptive and color normalization, as shown in fig. 6, including:

a creation module 11 for creating a face image dataset;

the first processing module 12 is configured to perform adaptive scale selection processing on an image in a face data set based on a face detection network to obtain a new face image;

the second processing module 13 is configured to perform face positioning on the output new face image, and perform color normalization processing on the periocular region of the positioned face to obtain a processed face image;

the training module 14 is configured to input the processed face image into a face recognition network, calculate an euclidean distance between a vector of the processed face image and a prestored face image, determine whether the euclidean distance is smaller than a first preset threshold, if so, obtain a training sample, and input the training sample into the face recognition network for training to obtain a final face recognition network model;

and the recognition module 15 is configured to input the image to be recognized into a face recognition network model for face recognition, so as to obtain a final recognition result.

It should be noted that the face recognition system based on self-adaptation and color normalization provided in this embodiment is similar to the embodiment, and is not repeated here.

It is to be noted that the foregoing is only illustrative of the preferred embodiments of the present invention and the technical principles employed. It will be understood by those skilled in the art that the present invention is not limited to the particular embodiments described herein, but is capable of various obvious changes, rearrangements and substitutions as will now become apparent to those skilled in the art without departing from the scope of the invention. Therefore, although the present invention has been described in greater detail by the above embodiments, the present invention is not limited to the above embodiments, and may include other equivalent embodiments without departing from the spirit of the present invention, and the scope of the present invention is determined by the scope of the appended claims.

Claims

1. A face recognition method based on self-adaptation and color normalization is characterized by comprising the following steps:

s1, creating a face image data set;

2. The method for face recognition based on self-adaptation and color normalization according to claim 1, wherein the step S2 specifically includes:

Performing image conversion on the imagePIs converted into resolution of

New image of

(ii) a Wherein the content of the first and second substances,

；

s22, new image after conversion

Confidence score graph of

Are combined with each other

；

S23, calculating the total number

And images

Has a resolution of

Ratio of (A) to (B)

Obtaining n ratios

For n ratios

Sorting is performed and the first three of the sorting is selected

As input data;

s24, selecting the first three in the sequence

Input deviceAdjusting the detection frames in the face detection network with the resolution of 24 × 24 to obtain candidate frames with the confidence scores larger than a second preset threshold and IoU smaller than a third preset threshold, and storing the obtained candidate frames and other candidate frames in a candidate list;

3. The adaptive and color-normalization based face recognition method according to claim 2, wherein the step S24 of adjusting the detection box in the 24 × 24 resolution face detection network is to adjust the detection box to a size of 24 × 24 × 3, and obtain a 1 × 2D confidence array of whether there is a face and a 1 × 4D boundary information array for constraining the boundary of each candidate boundary box.

4. The adaptive and color-normalization based face recognition method of claim 2, wherein the step S25 of adjusting the detection box input into the 48 × 48 resolution face detection network is to adjust the detection box to a size of 48 × 48 × 3, and obtain a 1 × 2D confidence array of whether there is a face and a 1 × 4D boundary information array for constraining the boundary of each candidate boundary box.

5. The adaptive and color-normalization-based face recognition method according to claim 2, wherein the second preset threshold is 0.9; the third preset threshold value is 0.7; the fourth preset threshold is 0.95.

6. The adaptive and color-normalization-based face recognition method according to claim 2, wherein the step S3 specifically comprises:

7. The method for face recognition based on self-adaptation and color normalization according to claim 1, wherein the step S4 specifically comprises:

And is outputted

Satisfy the requirement of

8. The adaptive and color normalization-based face recognition method according to claim 7, wherein the conditional screening of the training samples specifically comprises: selecting

And when it is satisfied

Training samples of the conditions.

9. The adaptive and color-normalization based face recognition method according to claim 1, wherein the image to be recognized is input into the face recognition network model in step S5 for face recognition, specifically, the image to be recognized is processed based on steps S2-S4, and a final recognition result is obtained.

10. A face recognition system based on adaptation and color normalization, comprising:

the creation module is used for creating a face image data set;