CN114842543A

CN114842543A - Three-dimensional face recognition method and device, electronic equipment and storage medium

Info

Publication number: CN114842543A
Application number: CN202210615124.5A
Authority: CN
Inventors: 曹一波; 刘顺
Original assignee: South China Normal University
Current assignee: South China Normal University
Priority date: 2022-06-01
Filing date: 2022-06-01
Publication date: 2022-08-02
Anticipated expiration: 2042-06-01
Also published as: CN114842543B

Abstract

The invention relates to a three-dimensional face recognition method, which comprises the following steps: acquiring three-dimensional face point cloud data to be recognized; inputting the three-dimensional face point cloud data to be recognized into an RP-net network model to obtain local-global face characteristics; and carrying out similarity measurement on the local-global face features and the face features of a candidate face to obtain a similarity measurement value, and judging whether the three-dimensional face point cloud data to be identified is the candidate face according to the similarity measurement value. Compared with the prior art, the invention provides the three-dimensional face recognition method which carries out feature extraction on the three-dimensional face point cloud data through the RP-net network model, and simultaneously captures the local features and the global features of the face in the feature extraction of the RP-net network model, wherein the local features can describe the face features in more detail, and the recognition accuracy of the three-dimensional face can be improved.

Description

Three-dimensional face recognition method and device, electronic equipment and storage medium

Technical Field

The present invention relates to the field of three-dimensional face recognition technologies, and in particular, to a three-dimensional face recognition method, an apparatus, an electronic device, and a storage medium.

Background

With the continuous improvement of the information degree of the current society, the related information security problem is more and more emphasized, and all the information security can not leave the authentication of the personal identity finally. Whether personal privacy information, property security, or government confidential documents and administrative authorities, the identity of the relevant personnel needs to be authenticated to ensure security. The traditional identity authentication modes such as certificates, passwords, seals, cards and the like have the defects and hidden dangers, authentication tools such as certificates, cards and the like are easy to damage or lose, passwords are easy to be confused and forgotten and the like, and due to the advantages of reliability and convenience, the emerging biometric identification technology has the advantages which are incomparable with the traditional identity identification and authentication technology, and is widely concerned and used by society.

The three-dimensional face recognition technology has unique advantages, and is mainly embodied in that:

(1) the three-dimensional face recognition has the characteristics of constant illumination and posture, the three-dimensional shape data of the face can be regarded as not changing along with the change of light rays and sight lines, accessories such as makeup have great influence on the two-dimensional image data, but have no obvious influence on the three-dimensional data.

(2) Three-dimensional data has a definite spatial shape representation, so its information is richer than two-dimensional images.

In the existing three-dimensional face recognition method, the position information and the depth information of a three-dimensional image are represented by three-dimensional face point cloud data, and the three-dimensional face point cloud data is classified and recognized through a deep learning model, but the existing deep learning model has insufficient details for feature extraction of the three-dimensional face point cloud data, and the recognition rate of the three-dimensional face recognition is low.

Disclosure of Invention

The invention aims to overcome the defects of the prior art and provide a three-dimensional face recognition method which can simultaneously extract the local features and the global features of three-dimensional face point cloud data and improve the recognition rate of three-dimensional face recognition.

The invention is realized by the following technical scheme: a three-dimensional face recognition method comprises the following steps:

acquiring three-dimensional face point cloud data to be recognized;

inputting the three-dimensional face point cloud data to be recognized into an RP-net network model to obtain local-global face features, wherein the RP-net network model comprises a sampling module, a grouping module and a feature extraction module, and the sampling module is used for sampling the three-dimensional face point cloud data to be recognized to obtain a plurality of key points;

the grouping module is used for taking each key point as a circle center and taking the characteristic extraction radius value r as a radius to obtain a plurality of circular areas; grouping the three-dimensional face point cloud data to be recognized according to the circular area to obtain a plurality of key point cloud data sets;

the feature extraction module comprises a first transformation layer, a local feature marking layer, a first MLP layer, a second transformation layer, a second MLP layer and a pooling layer, wherein the first transformation layer is used for carrying out alignment operation on each key point cloud data set; the local feature marking layer is used for rotating the key point cloud data set output by the first conversion layer around a three-dimensional coordinate axis by T rotation angles { theta [ theta ]) respectively _k 1,2, T, obtaining a rotating point cloud data set Q 'corresponding to three-dimensional coordinate axes' _x (θ _k )，Q′ _y (θ _k )，Q′ _z (θ _k ) (ii) a Respectively collecting the rotation point cloud data set Q' _x (θ _k )，Q′ _y (θ _k )，Q′ _z (θ _k ) Projecting on three planes where three-dimensional coordinate axes are located to obtain corresponding projection point cloud data sets; separately capturing the projection point cloud data setsIs divided into N on average _b ×N _b Each grid is calculated, and the point cloud data quantity in each grid is calculated to obtain a corresponding distribution matrix D _x ，D _y ，D _z (ii) a According to the distribution matrix D respectively _x ，D _y ，D _z Obtaining the corresponding central moment mu _mn And shannon entropy e; respectively according to the central moment mu _mn Obtaining the sub-feature descriptors f corresponding to three-dimensional coordinate axes according to the Shannon entropy e _x (θ _k )，f _y (θ _k )，f _z (θ _k ) (ii) a Sub-feature descriptor f _x (θ _k )，f _y (θ _k )，f _z (θ _k ) Polymerizing to obtain a local feature descriptor f; the first MLP layer is used for performing dimension raising operation on the key point cloud data set output by the first transformation layer according to the local feature descriptor f; the second transformation layer is used for carrying out alignment operation on the key point cloud data set output by the first MLP layer; the second MLP layer is used for performing dimension increasing operation on the key point cloud data set output by the second conversion layer; the pooling layer is used for performing maximum pooling operation on the key point cloud data set output by the second MLP layer to obtain the local-global face features;

and carrying out similarity measurement on the local-global face features and the face features of a candidate face to obtain a similarity measurement value, and judging whether the three-dimensional face point cloud data to be identified is the candidate face according to the similarity measurement value.

Compared with the prior art, the invention provides the three-dimensional face recognition method which carries out feature extraction on the three-dimensional face point cloud data through the RP-net network model, and simultaneously captures the local features and the global features of the face in the feature extraction of the RP-net network model, wherein the local features can describe the face features in more detail, and the recognition accuracy of the three-dimensional face can be improved.

Further, after acquiring the three-dimensional face point cloud data to be recognized, the method further comprises the following steps:

horizontally slicing the three-dimensional face point cloud data to be recognized to obtain a plurality of horizontal contour maps;

aiming at each horizontal contour map, placing a plurality of detection points on a horizontal contour edge line, setting a detection circle by taking each detection point as a circle center, obtaining detection distances from two intersection points of the detection circle and the horizontal contour edge line to corresponding detection points, and determining the detection point corresponding to the largest detection distance as a nose tip candidate point;

determining the nose tip candidate point with the maximum detection distance as a nose tip point;

and calculating the distance between each data point in the three-dimensional face point cloud data to be recognized and the nose tip point, and removing the data points with the distance larger than a preset distance.

Further, after acquiring the three-dimensional face point cloud data to be recognized, the method further comprises the following steps: and acquiring a median coordinate of the data points in the same field aiming at each data point in the three-dimensional face point cloud data to be recognized, and replacing the coordinate of the data point with the median coordinate.

Further, for each data point in the three-dimensional face point cloud data to be recognized, obtaining a median coordinate of the data point in the same field, and replacing the coordinate of the data point with the median coordinate, the method further comprises the following steps: and filling the missing holes in the three-dimensional human face point cloud data to be recognized through cubic interpolation.

Further, after acquiring the three-dimensional face point cloud data to be recognized, the method further comprises the following steps: and acquiring a normal vector of each data point in the three-dimensional face point cloud data to be recognized, and adding the normal vector into the three-dimensional face point cloud data to be recognized.

Further, the central moment μ _mn Is expressed as

Wherein the content of the first and second substances,

d (i, j) denotes the ith row and jth column of the distribution matrix D, D ═ D _x ,D _y ,D _z ]，μ _mn ＝[μ ₁₁ ,μ ₁₂ ,μ ₂₁ ,μ ₂₂ ]。

Further, the similarity metric is a nearest neighbor distance ratio.

Based on the same inventive concept, the invention also provides a three-dimensional face recognition device, comprising:

the data acquisition module is used for acquiring three-dimensional face point cloud data to be recognized;

the system comprises a feature acquisition module, a feature extraction module and a feature extraction module, wherein the feature acquisition module is used for inputting the three-dimensional face point cloud data to be recognized into an RP-net network model to obtain local-global face features, the RP-net network model comprises a sampling module, a grouping module and a feature extraction module, and the sampling module is used for sampling the three-dimensional face point cloud data to be recognized to obtain a plurality of key points;

the feature extraction module comprises a first transformation layer, a local feature marking layer, a first MLP layer, a second transformation layer, a second MLP layer and a pooling layer, wherein the first transformation layer is used for carrying out alignment operation on each key point cloud data set; the local feature marking layer is used for rotating the key point cloud data set output by the first conversion layer around a three-dimensional coordinate axis by T rotation angles { theta } _k 1,2, T, obtaining a rotating point cloud data set Q 'corresponding to three-dimensional coordinate axes' _x (θ _k )，Q′ _y (θ _k )，Q _z ′(θ _k ) (ii) a Respectively collecting the rotation point cloud data set Q' _x (θ _k )，Q′ _y (θ _k )，Q′ _z (θ _k ) Projecting on three planes where three-dimensional coordinate axes are located to obtain corresponding projection point cloud data sets; respectively averagely dividing the projection point cloud data set into N _b ×N _b Each grid is obtained, and the point cloud data volume in each grid is calculated to obtain pairsCorresponding distribution matrix D _x ，D _y ，D _z (ii) a According to the distribution matrix D respectively _x ，D _y ，D _z Obtaining the corresponding central moment mu _mn And shannon entropy e; respectively according to the central moment mu _mn Obtaining the sub-feature descriptors f corresponding to three-dimensional coordinate axes according to the Shannon entropy e _x (θ _k )，f _y (θ _k )，f _z (θ _k ) (ii) a Sub-feature descriptor f _x (θ _k )，f _y (θ _k )，f _z (θ _k ) Polymerizing to obtain a local feature descriptor f; the first MLP layer is used for performing dimension raising operation on the key point cloud data set output by the first transformation layer according to the local feature descriptor f; the second transformation layer is used for carrying out alignment operation on the key point cloud data set output by the first MLP layer; the second MLP layer is used for performing dimension increasing operation on the key point cloud data set output by the second conversion layer; the pooling layer is used for performing maximum pooling operation on the key point cloud data set output by the second MLP layer to obtain the local-global face features;

and the matching module is used for carrying out similarity measurement on the local-global face features and the face features of a candidate face to obtain a similarity measurement value, and judging whether the three-dimensional face point cloud data to be identified is the candidate face according to the similarity measurement value.

Based on the same inventive concept, the present invention also provides an electronic device, comprising:

a processor;

a memory for storing a computer program for execution by the processor;

wherein the processor implements the steps of the above method when executing the computer program.

Based on the same inventive concept, the present invention also provides a computer-readable storage medium, characterized in that a computer program is stored thereon, which, when executed, implements the steps of the above-described method.

For a better understanding and practice, the invention is described in detail below with reference to the accompanying drawings.

Drawings

Fig. 1 is a schematic flow chart of a three-dimensional face recognition method according to an embodiment;

FIG. 2 is a schematic diagram of a network structure of an embodiment of an RP-net network model;

fig. 3 is a schematic structural diagram of the three-dimensional face recognition apparatus according to the embodiment.

Detailed Description

In order to make the objects, technical solutions and advantages of the present application more clear, embodiments of the present application will be described in further detail below with reference to the accompanying drawings.

It should be understood that the described embodiments are only some embodiments of the invention, and not all embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

When the following description refers to the accompanying drawings, like numbers in different drawings represent the same or similar elements unless otherwise indicated. The embodiments described in the following exemplary embodiments do not represent all embodiments consistent with the present application. Rather, they are merely examples of apparatus and methods consistent with certain aspects of the application, as detailed in the appended claims.

In the description of the present application, it is to be understood that the terms "first," "second," "third," and the like are used solely for distinguishing between similar elements and not necessarily for describing a particular sequential or chronological order, nor should be construed to indicate or imply relative importance. The specific meaning of the above terms in the present application can be understood by those of ordinary skill in the art as appropriate. Further, in the description of the present application, "a plurality" means two or more unless otherwise specified. "and/or" describes the association relationship of the associated objects, meaning that there may be three relationships, e.g., a and/or B, which may mean: a exists alone, A and B exist simultaneously, and B exists alone. The character "/" generally indicates that the former and latter associated objects are in an "or" relationship.

Please refer to fig. 1, which is a schematic flow chart of the three-dimensional face recognition method of the present embodiment, the method includes steps S1-S3:

and step S1, acquiring the three-dimensional face point cloud data to be recognized, and preprocessing the three-dimensional face point cloud data.

The three-dimensional face point cloud data is a representation mode of three-dimensional face information, the three-dimensional face point cloud data comprises a plurality of data points, each data point corresponds to a three-dimensional coordinate in a three-dimensional coordinate system, and the data points with the three-dimensional coordinates jointly form a three-dimensional face. The three-dimensional axes in the three-dimensional coordinate system are respectively an x axis, a y axis and a z axis, the plane where the x axis and the y axis are located is an xy plane, the plane where the x axis and the z axis are located is an xz plane, and the plane where the y axis and the z axis are located is a yz plane.

The three-dimensional face point cloud data can be acquired by scanning the face through special scanning equipment, and can be stored in any computer storage medium such as a memory.

The method specifically comprises the steps of preprocessing a three-dimensional face point cloud image, wherein the face clipping operation is used for clipping redundant data points when the three-dimensional face point cloud data to be recognized contains the redundant data points except for a face, such as a body below a shoulder, and the like, so as to obtain three-dimensional face point cloud data mainly comprising the face, and the method specifically comprises the following steps of S111-S114:

s111: horizontally slicing the point cloud data of the three-dimensional human faces to be recognized to obtain a plurality of horizontal contour maps of the three-dimensional human faces;

in a preferred embodiment, step S111 may also perform uniform interpolation on the obtained horizontal contour map to fill the holes in the horizontal contour map.

S112: in each horizontal contour map, a plurality of detection points are placed on a horizontal contour edge line at a certain density, a detection circle with a fixed radius is arranged by taking each detection point as a circle center, the detection distance h from two intersection points of the detection circle and the horizontal contour edge line to the corresponding detection point is obtained, and the detection point corresponding to the maximum detection distance h is determined as a nasal tip candidate point;

s113: and carrying out thinning screening on the nasal tip candidate points to determine the nasal tip points.

In an alternative implementation, the nose tip candidate point can be regarded as a set of data points on the bridge of the nose, and the nose tip candidate point corresponding to the maximum detection distance h is determined as the nose tip point.

In an alternative implementation, the nose tip candidate points can be subjected to a refined screening by a random sample consensus (RANSAC) algorithm.

S114: calculating the distance between each point and the nose tip point in the three-dimensional human face point cloud data to be recognized, cutting out the points of which the distance is greater than the preset distance, and cutting out the three-dimensional human face point cloud data mainly comprising human faces.

In one implementation, the predetermined distance in step S114 may be set to 90 mm.

The face smoothing operation is used for eliminating noise spikes in the three-dimensional face point cloud data to be recognized, especially the noise spikes are easy to appear in the area of eyes, nose spikes, teeth and the like, and the subsequent recognition accuracy can be improved by eliminating the noise spikes. The method specifically comprises the following steps of S121: acquiring the median coordinate of each data point of the three-dimensional face point cloud data to be recognized in one field, and replacing the data point coordinate with the median coordinate.

Since a hole defect may also occur on the three-dimensional surface of the three-dimensional face point cloud data in the process of eliminating the noise spike, and a hole defect caused by specular reflection of a dark region of the face, an open mouth, a blocking object, a sclera, a pupil, eyelashes and the like when the three-dimensional face point cloud data is collected, in a preferred embodiment, the facial smoothing operation may further include step S122: filling of the hole is performed by cubic interpolation. Thereby, the holes can be eliminated to improve the identification accuracy.

And the data enhancement operation is used for acquiring the normal vector of each data point in the three-dimensional face point cloud data to be recognized and increasing the normal vector of each data point in the three-dimensional face point cloud data. The normal vector of the data point can display more local features, and extraction and identification of the face features are facilitated. Specifically, the normal vector of each data point is the plane of the data pointNormal vectors, which can be obtained by fitting local neighbors of each data point by minimizing a cost function. A three-dimensional face point cloud data containing m data points is expressed as P ═ P ₁ ,p ₂ ,...,p _s ] ^T Wherein the three-dimensional coordinate of the ith data point is represented as p _i ＝[p _ix ,p _iy ,p _iz ] ^T The normal vector of the ith data point is represented as n _i ＝[n _ix ,n _iy ,n _iz ] ^T Wherein n is _ix ,n _iy ,n _iz Is a data point p _i Normal component in the x, y, z channels of the three-dimensional coordinate system, data point p _i The surrounding set of l neighboring points is denoted Q _i ＝[p _i1 ,p _i2 ,...,p _il ] ^T The data point p is minimized by minimizing a cost function minA _i Normal vector and neighbor set Q _i Form a plane vector point multiplied by 0, i.e. data point p _i Normal vector and neighbor set Q _i The formed plane is vertical, and the expression of the minimum cost function minA is as follows:

in specific implementation, a matrix with the field of 5 × 5 can be selected for data enhancement operation, and then m × n × 6 three-dimensional face point cloud data is obtained, wherein m and n are the data range size of the three-dimensional face point cloud data.

And step S2, inputting the three-dimensional human face point cloud data preprocessed in the step S1 into an RP-net network model to obtain local-global human face characteristics.

Please refer to fig. 2, which is a schematic network structure diagram of an RP-net network model of this embodiment, where the RP-net network model includes a sampling module, a grouping module and a feature extraction module, where the sampling module is configured to perform downsampling on three-dimensional face point cloud data, and the method specifically includes the following steps: sampling is performed through Farthest Point Sampling (FPS) to obtain a plurality of key points, where the initial key points may be random data points or nasal tip points obtained in step S113.

The grouping module is used for grouping the three-dimensional face point cloud data according to the key points acquired by the sampling module, and specifically, each key point is taken as a circle center, and the characteristic extraction radius value r is taken as a radius to obtain a plurality of circular areas; dividing the three-dimensional face point cloud data into a plurality of key point cloud data sets according to a circular area, and successively inputting each key point cloud data set into a feature extraction layer. When the RP-net network model is trained, the characteristic extraction radius value r in the grouping module can be set to be a plurality of different step values, and the final characteristic extraction radius value r is determined according to the recognition rate of the model.

The feature extraction module is used for extracting features of each key point cloud data set output by the grouping module. Specifically, the feature extraction layer comprises a first transformation layer, a local feature marker layer, a first MLP layer, a second transformation layer, a second MLP layer and a pooling layer, wherein the first transformation layer is used for performing alignment operation on each key point cloud data set through T-net and matrix multiplication (matrix multiplication).

The local feature marking layer is used for acquiring a local feature descriptor of the key point cloud data set output by the first conversion layer and marking the local feature of the key point cloud data set through the local feature descriptor. The method specifically comprises the following steps: rotating the key point cloud data set by T rotation angles { theta ] around the x-axis, the y-axis and the z-axis of the three-dimensional coordinate system respectively _k 1, 2., T, resulting in a corresponding three sets of rotated point cloud datasets Q' _x (θ _k )，Q′ _y (θ _k )，Q′ _z (θ _k ) (ii) a Each rotation point cloud data set Q' _x (θ _k )，Q′ _y (θ _k )，Q′ _z (θ _k ) Respectively projecting on xy, xz and yz planes to obtain projection point cloud data sets corresponding to the three planes

Projecting a point cloud data set

Are respectively averagely divided into N _b ×N _b Personal netGrid, and calculating the point cloud data quantity in each grid to obtain three corresponding groups of rotating point cloud data sets Q' _x (θ _k )，Q′ _y (θ _k )，Q′ _z (θ _k ) Has a size of N _b ×N _b Distribution matrix D of _x ，D _y ，D _z (ii) a To distribution matrix D _x ，D _y ，D _z Carrying out normalization operation to realize invariance to grid resolution change; to distribution matrix D _x ，D _y ，D _z Performing compression operation to obtain central moment mu _mn And shannon entropy e, where central moment μ _mn The expression of (a) is:

wherein the content of the first and second substances,

d (i, j) denotes the ith row and jth column of the distribution matrix D, D ═ D _x ,D _y ,D _z ]. In this embodiment, μ _mn ＝[μ ₁₁ ,μ ₁₂ ,μ ₂₁ ,μ ₂₂ ]。

The expression of the Shannon entropy e is

According to the central moment mu _mn And the Shannon entropy e to obtain a sub-feature descriptor f rotating around the x-axis _x (θ _k ) Sub-feature descriptor f rotating around the y-axis _y (θ _k ) And a sub-feature descriptor f rotated about the z-axis _z (θ _k )；

Sub-feature descriptor f _x (θ _k ) Sub-feature descriptor f _y (θ _k ) And a sub-feature descriptor f _z (θ _k ) Carrying out aggregation to obtain a local feature descriptor f, wherein the expression of the local feature descriptor f is

f＝{f _x (θ _k ),f _y (θ _k ),f _z (θ _k )}

The first MLP layer is a multilayer perceptron and is used for performing dimension-increasing operation on each key point cloud data set according to the local feature descriptor f.

And the second conversion layer is used for carrying out alignment operation on the key point cloud data set output by the first MLP layer through T-net and matrix multiplication.

The second MLP layer is a multi-layer perceptron and is configured to perform a second dimension-increasing operation on the key point cloud data set output by the second transform layer, and in a specific implementation, a characteristic dimension after the second dimension-increasing operation is 1024.

And the pooling layer is used for performing maximum pooling operation on the key point cloud data set output by the second MLP layer to obtain local-global face features.

And step S3, performing similarity measurement on the local-global face features obtained in the step S2 and the face features of a candidate face, and judging whether the three-dimensional face point cloud data to be recognized is the candidate face according to a similarity measurement result.

In one embodiment, a Nearest Neighbor Distance Ratio (NNDR) may be used to perform similarity measurement between local-global face features and face features of candidate faces, the obtained nearest neighbor distance ratio is compared with a preset comparison threshold, and when the nearest neighbor distance ratio is greater than the preset threshold, a face corresponding to the three-dimensional face point cloud data to be recognized may be determined as a corresponding candidate face. In a specific implementation, the preset comparison threshold may be set to a plurality of different step values, and the final preset comparison threshold is determined according to the recognition accuracy of the result.

Experiments are respectively carried out on the disclosed three-dimensional face data set Bosphorus data set through the three-dimensional face recognition method, the three-dimensional face recognition method based on voxel representation of the depth image and the large-range 3D face recognition method, wherein the experiment environment comprises a normal environment and a low-light environment, and the recognition rate in the corresponding experiment environment is obtained. Among them, the three-dimensional face recognition method (3d face recognition based on volumetric rendering of range image) based on voxel representation of depth image is the method disclosed in Koushik Dutta, Debotosh Bhattacharjee, Mita nasiprari, and Anik Poddar in 2019 on Advanced Computing and Systems for Security; the wide-area 3D face recognition method (Towards large-scale 3D face recognition) is the method disclosed by Syed Zulqarinin Gilani and Ajmal Mian in 2016International Conference on Digital Image Computing, Techniques and Applications (DICTA). As shown in table 1, compared with two three-dimensional face recognition methods in the prior art, the three-dimensional face recognition method of the present embodiment improves the recognition rate in both the normal environment and the low-light environment.

TABLE 1

Compared with the prior art, the method has the advantages that the feature extraction is carried out on the three-dimensional face point cloud data through the RP-net network model, the local features and the global features of the face are captured simultaneously in the feature extraction of the RP-net network model, the local features can describe the face features in more detail, and the accuracy rate of the three-dimensional face recognition can be improved. In addition, the surface normal vector is added into the three-dimensional face point cloud data input into the RP-net network model, so that the three-dimensional face characteristics under the weak light or dark environment can be enhanced, and the high identification accuracy can be realized in the weak light and dark environment.

Based on the same invention concept, the invention also provides a three-dimensional face recognition device. Please refer to fig. 3, which is a schematic structural diagram of the three-dimensional face recognition apparatus of this embodiment, the apparatus includes a data obtaining module 10, a feature obtaining module 20, and a matching module 30, wherein the data obtaining module 10 is configured to obtain three-dimensional face point cloud data to be recognized, and pre-process the three-dimensional face point cloud data, where the pre-process includes a face cropping operation, a face smoothing operation, and a data enhancement operation, and a specific flow of steps of the face cropping operation, the face smoothing operation, and the data enhancement operation is the same as that of the face cropping operation, the face smoothing operation, and the data enhancement operation described in the foregoing method embodiment, and is not described herein again;

the feature obtaining module 20 is configured to input the three-dimensional face point cloud data preprocessed in the data obtaining module 10 into an RP-net network model to obtain local-global face features, where a network structure and a process flow of steps included in the RP-net network model are completely the same as those of the RP-net network model described in the above method embodiment, and are not described herein again.

The matching module 30 is configured to perform similarity measurement on the local-global face features obtained in the feature obtaining module 20 and face features of a candidate face, and determine whether the three-dimensional face point cloud data to be recognized is the candidate face according to a similarity measurement result.

For the device embodiments, reference is made to the description of the method embodiments for relevant details, since they correspond essentially to the method embodiments.

Based on the same inventive concept, the invention also provides an electronic device, which can be a server, a desktop computing device or a mobile computing device (e.g., a laptop computing device, a handheld computing device, a tablet computer, a netbook, etc.), and other terminal devices. The electronic equipment comprises one or more processors and a memory, wherein the processors are used for executing the three-dimensional face recognition method of the program implementation method embodiment; the memory is for storing a computer program executable by the processor. The electronic equipment can also comprise a display screen which is used for displaying the retrieval result image obtained by the processor.

Based on the same inventive concept, the present invention further provides a computer-readable storage medium, corresponding to the aforementioned embodiments of the three-dimensional face recognition method, wherein the computer-readable storage medium has a computer program stored thereon, and the computer program, when executed by a processor, implements the steps of the three-dimensional face recognition method described in any of the above embodiments.

This application may take the form of a computer program product embodied on one or more storage media (including, but not limited to, disk storage, CD-ROM, optical storage, etc.) having program code embodied therein. Computer-usable storage media include permanent and non-permanent, removable and non-removable media, and information storage may be implemented by any method or technology. The information may be computer readable instructions, data structures, modules of a program, or other data. Examples of the storage medium of the computer include, but are not limited to: phase change memory (PRAM), Static Random Access Memory (SRAM), Dynamic Random Access Memory (DRAM), other types of Random Access Memory (RAM), Read Only Memory (ROM), Electrically Erasable Programmable Read Only Memory (EEPROM), flash memory or other memory technologies, compact disc read only memory (CD-ROM), Digital Versatile Discs (DVD) or other optical storage, magnetic tape storage or other magnetic storage devices, or any other non-transmission medium, may be used to store information that may be accessed by a computing device.

The above-mentioned embodiments only express several embodiments of the present invention, and the description thereof is more specific and detailed, but not construed as limiting the scope of the invention. It should be noted that, to those skilled in the art, changes and modifications may be made without departing from the spirit of the present invention, and it is intended that the present invention encompass such changes and modifications.

Claims

1. A three-dimensional face recognition method is characterized by comprising the following steps:

acquiring three-dimensional face point cloud data to be recognized;

the feature extraction module comprises a first transformation layer, a local feature marking layer, a first MLP layer, a second transformation layer, a second MLP layer and a pooling layer, wherein the first transformation layer is used for carrying out alignment operation on each key point cloud data set; the local feature marking layer is used for rotating the key point cloud data set output by the first conversion layer around a three-dimensional coordinate axis by T rotation angles { theta } _k 1,2, T, obtaining a rotating point cloud data set Q 'corresponding to three-dimensional coordinate axes' _x (θ _k )，Q′ _y (θ _k )，Q′ _z (θ _k ) (ii) a Respectively collecting the rotation point cloud data set Q' _x (θ _k )，Q′ _y (θ _k )，Q′ _z (θ _k ) Projecting on three planes where three-dimensional coordinate axes are located to obtain corresponding projection point cloud data sets; respectively averagely dividing the projection point cloud data set into N _b ×N _b Each grid is calculated, and the point cloud data quantity in each grid is calculated to obtain a corresponding distribution matrix D _x ，D _y ，D _z (ii) a According to the distribution matrix D respectively _x ，D _y ，D _z Obtaining the corresponding central moment mu _mn And shannon entropy e; respectively according to the central moment mu _mn Obtaining the sub-feature descriptors f corresponding to three-dimensional coordinate axes according to the Shannon entropy e _x (θ _k )，f _y (θ _k )，f _z (θ _k ) (ii) a Sub-feature descriptor f _x (θ _k )，f _y (θ _k )，f _z (θ _k ) Polymerizing to obtain a local feature descriptor f; the first MLP layer is used for performing dimension raising operation on the key point cloud data set output by the first transformation layer according to the local feature descriptor f; the second transformation layer is used for carrying out alignment operation on the key point cloud data set output by the first MLP layer; the second MLP layer is used for performing dimension increasing operation on the key point cloud data set output by the second conversion layer; the pooling layer is used for performing maximum pooling operation on the key point cloud data set output by the second MLP layer to obtain the local-global personA face feature;

2. The method of claim 1, wherein after acquiring the three-dimensional face point cloud data to be recognized, the method further comprises the steps of:

and calculating the distance between each data point in the three-dimensional human face point cloud data to be recognized and the nose tip point, and removing the data points with the distance larger than a preset distance.

3. The method of claim 1, wherein after acquiring the three-dimensional face point cloud data to be recognized, the method further comprises the steps of: and acquiring a median coordinate of the data points in the same field aiming at each data point in the three-dimensional face point cloud data to be recognized, and replacing the coordinate of the data point with the median coordinate.

4. The method of claim 3, wherein: and aiming at each data point in the three-dimensional face point cloud data to be recognized, acquiring a median coordinate of the data point in the same field, and replacing the coordinate of the data point with the median coordinate, and further comprising the following steps: and filling the missing holes in the three-dimensional human face point cloud data to be recognized through cubic interpolation.

5. The method of claim 1, wherein after acquiring the three-dimensional face point cloud data to be recognized, the method further comprises the steps of: and acquiring a normal vector of each data point in the three-dimensional face point cloud data to be recognized, and adding the normal vector into the three-dimensional face point cloud data to be recognized.

6. Method according to claim 1, characterized in that the central moment μ _mn Is expressed as

Wherein the content of the first and second substances,

7. The method of claim 1, wherein the similarity metric is a nearest neighbor distance ratio.

8. A three-dimensional face recognition apparatus, comprising:

the feature extraction module comprises a first transformation layer, a local feature marking layer, a first MLP layer, a second transformation layer, a second MLP layer and a pooling layer, wherein the first transformation layer is used for carrying out alignment operation on each key point cloud data set; the local feature marking layer is used for rotating the key point cloud data set output by the first conversion layer around a three-dimensional coordinate axis by T rotation angles { theta } _k 1,2, T, obtaining a rotating point cloud data set Q 'corresponding to three-dimensional coordinate axes' _x (θ _k )，Q′ _y (θ _k )，Q′ _z (θ _k ) (ii) a Respectively collecting the rotation point cloud data set Q' _x (θ _k )，Q′ _y (θ _k )，Q′ _z (θ _k ) Projecting on three planes where three-dimensional coordinate axes are located to obtain corresponding projection point cloud data sets; respectively averagely dividing the projection point cloud data set into N _b ×N _b Each grid is calculated, and the point cloud data quantity in each grid is calculated to obtain a corresponding distribution matrix D _x ，D _y ，D _z (ii) a According to the distribution matrix D respectively _x ，D _y ，D _z Obtaining the corresponding central moment mu _mn And shannon entropy e; respectively according to the central moment mu _mn Obtaining the sub-feature descriptors f corresponding to three-dimensional coordinate axes according to the Shannon entropy e _x (θ _k )，f _y (θ _k )，f _z (θ _k ) (ii) a Sub-feature descriptor f _x (θ _k )，f _y (θ _k )，f _z (θ _k ) Polymerizing to obtain a local feature descriptor f; the first MLP layer is used for performing dimension raising operation on the key point cloud data set output by the first transformation layer according to the local feature descriptor f; the second transformation layer is used for carrying out alignment operation on the key point cloud data set output by the first MLP layer; the second MLP layer is used for performing dimension increasing operation on the key point cloud data set output by the second conversion layer; the pooling layer is used for the placePerforming maximum pooling operation on the key point cloud data set output by the second MLP layer to obtain the local-global face features;

9. An electronic device, comprising:

a processor;

a memory for storing a computer program for execution by the processor;

wherein the processor, when executing the computer program, implements the steps of the method of any one of claims 1-7.

10. A computer-readable storage medium, having a computer program stored thereon, wherein the computer program, when executed, performs the steps of the method of any of claims 1-7.