WO2019033571A1

WO2019033571A1 - Facial feature point detection method, apparatus and storage medium

Info

Publication number: WO2019033571A1
Application number: PCT/CN2017/108750
Authority: WO
Inventors: 陈林; 张国辉
Original assignee: 平安科技（深圳）有限公司
Priority date: 2017-08-17
Filing date: 2017-10-31
Publication date: 2019-02-21
Also published as: CN107679447A

Abstract

A facial feature point detection method, an electronic apparatus and a computer readable storage medium. The method comprises: capturing a real-time image via an imaging apparatus, and extracting a real-time facial image from the real-time image via a human face identification algorithm; inputting the real-time facial image to a pre-trained facial average model, and identifying t facial feature points from the real-time facial image via the facial average model.

Description

Facial feature point detecting method, device and storage medium

Priority claim

This application is based on the priority of the Chinese Patent Application entitled "Face Feature Point Detection Method, Apparatus and Storage Medium" filed on August 17, 2017, with the application number CN 201710709109.6 submitted on August 17, 2017, the overall content of the Chinese patent application It is incorporated herein by reference.

Technical field

The present application relates to the field of computer vision processing technologies, and in particular, to a facial feature point detecting method and apparatus, and a computer readable storage medium.

Background technique

Face recognition is a biometric recognition technology based on human facial feature information for user recognition. At present, face recognition has a wide range of applications, and plays a very important role in many areas such as access control attendance and identity recognition, which brings great convenience to people's lives. Face recognition, the general product approach is to use the deep learning method to train the facial feature point recognition model through deep learning, and then use the facial feature point recognition model to identify facial features.

Face recognition includes facial micro-expression recognition. Micro-expression recognition is widely used in psychology, advertising effect evaluation, human factors engineering and human-computer interaction. Therefore, how to accurately recognize facial micro-expression is very important.

However, the industry can currently detect 5 and 68 feature points. The 5 feature points include two eyeballs, the tip of the nose and the corners of the mouth; 68 feature points do not include the eyeball. For facial micro-expression recognition, the above identification The feature points are not enough.

Summary of the invention

The present application provides a facial feature point detecting method, device and computer readable storage medium, the main purpose of which is to identify a more comprehensive feature point, which can make the face recognition and the facial micro expression judgment more accurate.

To achieve the above object, the present application provides an electronic device, including: a memory, a processor, and an imaging device, wherein the memory includes a facial feature point detecting program, and the facial feature point detecting program is executed by the processor Implement the following steps:

Real-time facial image acquisition step: capturing a real-time image by using a camera device, and extracting a real-time facial image from the real-time image by using a face recognition algorithm;

Feature point identification step: input the real-time facial image into a pre-trained facial average model, and use the facial average model to identify t facial feature points from the real-time facial image.

Preferably, the feature point identification step further comprises:

The real-time facial image is aligned with the facial average model, and the feature extraction algorithm searches for the t facial feature points matching the t facial feature points of the facial average model in the real-time facial image.

Preferably, the training step of the facial average model comprises:

Establishing a sample library with n face sample images, and marking t facial feature points in each face sample image, the t facial feature points including: eyes, eyebrows, nose, mouth, and facial contour a position feature point, wherein the position feature point of the eye includes a position feature point of the eyeball; and

The face feature recognition model is trained by using the face sample image marked with t facial feature points to obtain a face average model for the face feature points.

Preferably, each eyeball marks 4 position feature points.

In addition, to achieve the above object, the present application further provides a facial feature point detecting method, the method comprising:

Preferably, the feature point identification step further comprises:

Preferably, the training step of the facial average model comprises:

Preferably, the face feature recognition model is an ERT algorithm, and the formula is as follows:

Where t represents the cascading sequence number, τ _t (·, ·) represents the current level of the regression, each regression is composed of a number of regression trees, S (t) is the shape estimation of the current model, each regression τ _t (·,·) predicts an increment based on the input current image I and S(t)

In the process of model training, a part of the feature points of each sample picture of n sample pictures is taken to train the first regression tree, and the predicted value of the first regression tree and the part of the feature points are The residual of the true value is used to train the second tree... and so on, until the predicted value of the Nth tree is trained and the true value of the partial feature point is close to 0, and all the regression trees of the ERT algorithm are obtained. A facial average model for facial feature points is obtained from these regression trees.

Preferably, each eyeball marks 4 position feature points.

Preferably, the feature extraction algorithm comprises: a SIFT algorithm, a SURF algorithm, an LBP algorithm, and an HOG algorithm.

In addition, in order to achieve the above object, the present application further provides a computer readable storage medium including a facial feature point detecting program, when the facial feature point detecting program is executed by a processor, implementing the above Any of the steps of the facial feature point detection method described.

The facial feature point detecting method and device and the computer readable storage medium proposed by the present application can identify a feature point more comprehensively by recognizing a plurality of feature points including a position feature point of an eyeball from a real-time facial image. Face recognition and facial micro-expressions are more accurate.

DRAWINGS

1 is a schematic diagram of an operating environment of a preferred embodiment of a facial feature point detecting method of the present application;

2 is a block diagram of a facial feature point detecting program of FIG. 1;

3 is a flow chart of a preferred embodiment of a facial feature point detecting method of the present application.

The implementation, functional features and advantages of the present application will be further described with reference to the accompanying drawings.

Detailed ways

It is understood that the specific embodiments described herein are merely illustrative of the application and are not intended to be limiting.

The application provides a facial feature point detecting method. Referring to FIG. 1 , it is a schematic diagram of an operating environment of a preferred embodiment of a facial feature point detecting method of the present application.

In the embodiment, the facial feature point detecting method is applied to an electronic device 1. The electronic device 1 may be a terminal device having a computing function, such as a server, a smart phone, a tablet computer, a portable computer, or a desktop computer.

The electronic device 1 includes a processor 12, a memory 11, an imaging device 13, a network interface 14, and a communication bus 15. The camera device 13 is installed in a specific place, such as an office place and a monitoring area, and real-time images are taken in real time for the target entering the specific place, and the captured real-time image is transmitted to the processor 12 through the network. Network interface 14 may optionally include a standard wired interface, a wireless interface (such as a WI-FI interface). Communication bus 15 is used to implement connection communication between these components.

The memory 11 includes at least one type of readable storage medium. The at least one type of readable storage medium may be a non-volatile storage medium such as a flash memory, a hard disk, a multimedia card, a card type memory, or the like. In some embodiments, the readable storage medium may be an internal storage unit of the electronic device 1, such as a hard disk of the electronic device 1. In other embodiments, the readable storage medium may also be an external memory of the electronic device 1, such as a plug-in hard disk equipped on the electronic device 1, a smart memory card (SMC), Secure Digital (SD) card, Flash Card, etc.

In the present embodiment, the readable storage medium of the memory 11 is generally used to store the facial feature point detecting program 10 installed on the electronic device 1, the face image sample library, and the constructed and trained facial average model and the like. The memory 11 can also be used to temporarily store the output that has been output or will be output The data.

The processor 12, in some embodiments, may be a Central Processing Unit (CPU), microprocessor or other data processing chip for running program code or processing data stored in the memory 11, such as performing facial features. Point detection program 10, etc.

Figure 1 shows only the electronic device 1 having the components 11-15 and the facial feature point detection program 10, but it should be understood that not all illustrated components may be implemented, and more or fewer components may be implemented instead. .

Optionally, the electronic device 1 may further include a user interface, and the user interface may include an input unit such as a keyboard, a voice input device such as a microphone, a device with a voice recognition function, a voice output device such as an audio, a headphone, and the like. Optionally, the user interface may also include a standard wired interface and a wireless interface.

Optionally, the electronic device 1 may further include a display, which may also be appropriately referred to as a display screen or a display unit. In some embodiments, it may be an LED display, a liquid crystal display, a touch liquid crystal display, an OLED (Organic Light-Emitting Diode) touch sensor, or the like. The display is used to display information processed in the electronic device 1 and a user interface for displaying visualizations.

Optionally, the electronic device 1 further comprises a touch sensor. The area provided by the touch sensor for the user to perform a touch operation is referred to as a touch area. Further, the touch sensor described herein may be a resistive touch sensor, a capacitive touch sensor, or the like. Moreover, the touch sensor includes not only a contact type touch sensor but also a proximity type touch sensor or the like. Furthermore, the touch sensor may be a single sensor or a plurality of sensors arranged, for example, in an array.

In addition, the area of the display of the electronic device 1 may be the same as or different from the area of the touch sensor. Optionally, a display is stacked with the touch sensor to form a touch display. The device detects a user-triggered touch operation based on a touch screen display.

Optionally, the electronic device 1 may further include an RF (Radio Frequency) circuit, a sensor, an audio circuit, and the like, and details are not described herein.

In the apparatus embodiment shown in FIG. 1, an operating system and a facial feature point detecting program 10 may be included in the memory 11 as a computer storage medium; when the processor 12 executes the facial feature point detecting program 10 stored in the memory 11, Implement the following steps:

Real-time facial image acquisition step: a real-time image is captured by the camera device 13, and a real-time facial image is extracted from the real-time image by using a face recognition algorithm. When the camera 13 captures a real-time image, the camera 13 transmits the real-time image to the processor 12. When the processor 12 receives the real-time image, it first acquires the size of the image to create a grayscale image of the same size. Converting the acquired color image into a grayscale image and creating a memory space; equalizing the grayscale image histogram can reduce the amount of grayscale image information to speed up the detection, and then load the training library to detect the image. The face of the face, and return an object containing the face information, obtain the data of the location of the face, and record the number; finally obtain the area of the avatar and save it, thus completing a real-time facial image extraction process.

Specifically, the face recognition algorithm for extracting a real-time facial image from the real-time image may also be: Geometric feature based methods, local feature analysis methods, feature face methods, elastic model based methods, neural network methods, and the like.

Establishing a sample library with n face sample images, and marking t facial feature points in each face sample image, the t facial feature points including: eyes, eyebrows, nose, mouth, and facial contour A position feature point, wherein the position feature point of the eye includes a position feature point of the eyeball. A sample library having n face images is created, and t facial feature points are manually marked in each face image, and the position feature points of the eye include: a position feature point of the eyelid and a position feature point of the eyeball.

The face feature recognition model is trained by using the face sample image marked with t facial feature points to obtain a face average model for the face feature points. The face feature recognition model is an Ensemble of Regression Tress (ERT) algorithm. The ERT algorithm is expressed as follows:

Where t represents the cascading sequence number and τ _t (·, ·) represents the regression of the current stage. Each regression is composed of a number of regression trees, and the purpose of training is to obtain these regression trees.

Where S(t) is the shape estimate of the current model; each regression τ _t (·, ·) predicts an increment based on the input images I and S(t)

Add this increment to the current shape estimate to improve the current model. Each level of regression is based on feature points for prediction. The training data set is: (I1, S1), ..., (In, Sn) where I is the input sample image and S is the shape feature vector composed of the feature points in the sample image.

In the process of model training in this embodiment, each sample picture has 76 face feature points, and part of the feature points of all sample images are taken (for example, 70 features are randomly selected among 76 feature points of each sample image). Point) training the first regression tree, using the residual of the predicted value of the first regression tree and the true value of the partial feature points (weighted average of 70 feature points taken from each sample picture) Training the second tree... and so on, until the predicted value of the Nth tree is trained and the true value of the part of the feature points is close to 0, and all the regression trees of the ERT algorithm are obtained, and the average of the face points is obtained according to the regression trees. The model is saved to the memory 11 and the model file and the sample library.

In this embodiment, since 76 facial feature points are marked in each face sample image in the sample library, there are also 76 facial feature points in the face average model, and the trained facial average model is called from the memory. Aligning the real-time facial image with the facial average model, and then using the feature extraction algorithm to search for 76 facial feature points matching the 76 facial feature points of the facial average model in the real-time facial image, and identifying the recognized facial features The 76 facial feature points are still recorded as P1 to P76, and the coordinates of the 76 facial feature points are: (x ₁ , y ₁ ), (x ₂ , y ₂ ), (x ₃ , y ₃ ), ..., (x ₇₆ , y ₇₆ ).

Among them, the outer contour of the face has 17 feature points (P1 ~ P17, evenly distributed on the outer contour of the face), and the left and right eyebrows respectively have 5 feature points (respectively recorded as P18 ~ P22, P23 ~ P27, evenly distributed in the eyebrows) The upper end), the nose has 9 feature points (P28 ~ P36), the left and right eyelids have 6 feature points (respectively labeled as P37 ~ P42, P43 ~ P48), the left and right eyeballs have 4 feature points (respectively recorded as P49 ~ P52, P53～P56), there are 20 feature points in the lip (P57～P76), and there are 8 feature points on the upper and lower lips of the lip (respectively labeled as P57～P64, P65～P72), respectively. There are 2 feature points (remember separately It is P73 to P74, P75 to P76). Of the 8 feature points of the upper lip, 5 are located on the outer contour line of the upper lip (P57-61), 3 are located on the contour line of the upper lip (P62-P64, P63 is the central feature point on the inner side of the upper lip); 8 of the lower lip Of the feature points, 5 are located on the outer contour line of the lower lip (P65 to P69), and 3 are located in the outline of the lower lip (P70 to P72, and P71 is the central feature point on the inner side of the lower lip). One of the two feature points of the left and right lip angles is located on the outer contour line of the lips (for example, P74 and P76, which can be called outer lip feature points), and one is located on the outer contour line of the lips (for example, P73 and P75, which can be called Inner lip corner feature point).

In this embodiment, the feature extraction algorithm is a SIFT (scale-invariant feature transform) algorithm. The SIFT algorithm extracts the local features of each facial feature point from the facial average model of the facial feature points, selects a facial feature point as the reference feature point, and searches for the same or similar local feature of the reference feature point in the real-time facial image. The feature points (for example, the difference of the local features of the two feature points are within a preset range), according to this principle until all the face feature points are found in the real-time face image. In other embodiments, the feature extraction algorithm may also be a SURF (Speeded Up Robust Features) algorithm, an LBP (Local Binary Patterns) algorithm, a HOG (Histogram of Oriented Gridients) algorithm, or the like.

The electronic device 1 of the present embodiment extracts a real-time facial image from a real-time image, and uses the facial average model to identify a facial feature point in the real-time facial image, and the recognized feature point is more comprehensive, and the face recognition and the face recognition can be performed. The judgment of the facial micro-expression is more accurate.

In other embodiments, facial feature point detection program 10 may also be partitioned into one or more modules, one or more modules being stored in memory 11 and executed by processor 12 to complete the application. A module as referred to in this application refers to a series of computer program instructions that are capable of performing a particular function. Referring to FIG. 2, it is a block diagram of the facial feature point detecting program 10 of FIG. In this embodiment, the facial feature point detecting program 10 can be divided into: an obtaining module 110, an identifying module 120, and a calculating module 130. The functions or operational steps implemented by the modules 110-130 are similar to the above, and are not described in detail herein, by way of example, for example:

The acquiring module 110 is configured to acquire a real-time image captured by the camera device 13 and extract a real-time facial image from the real-time image by using a face recognition algorithm; and

The identification module 120 is configured to input the real-time facial image into a facial average model, and use the facial average model to identify t facial feature points from the real-time facial image.

In addition, the present application also provides a facial feature point detecting method. Referring to FIG. 3, it is a flowchart of a preferred embodiment of the facial feature point detecting method of the present application. The method can be performed by a device that can be implemented by software and/or hardware.

In this embodiment, the facial feature point detecting method includes:

In step S10, a real-time image is captured by the camera device, and a real-time face image is extracted from the real-time image by using a face recognition algorithm. When the camera captures a real-time image, the camera sends the real-time image to the processor. When the processor receives the real-time image, the image is first acquired to create a grayscale image of the same size; Color image, converted into grayscale image, and create a memory space; equalize the grayscale image histogram to make grayscale image Reduce the amount of information to speed up the detection, then load the training library, detect the face in the picture, and return an object containing the face information, obtain the data of the location of the face, and record the number; finally get the area of the avatar And save it, this completes the process of real-time facial image extraction.

Specifically, the face recognition algorithm for extracting the real-time facial image from the real-time image may also be: a geometric feature-based method, a local feature analysis method, a feature face method, an elastic model-based method, a neural network method, and the like.

Step S20: input the real-time facial image into a pre-trained facial average model, and use the facial average model to identify t facial feature points from the real-time facial image.

The face feature recognition model is trained by using the face sample image marked with t facial feature points to obtain a face average model for the face feature points. The face feature recognition model is an ERT algorithm. The ERT algorithm is expressed as follows:

In the process of model training in this embodiment, each sample picture has 76 face feature points, and part of the feature points of all sample images are taken (for example, 70 features are randomly selected among 76 feature points of each sample image). Point) training the first regression tree, using the residual of the predicted value of the first regression tree and the true value of the partial feature points (weighted average of 70 feature points taken from each sample picture) Training the second tree... and so on, until the predicted value of the Nth tree is trained and the true value of the part of the feature points is close to 0, and all the regression trees of the ERT algorithm are obtained, and the average of the face points is obtained according to the regression trees. Model and save the model file and sample library to memory.

Among them, the outer contour of the face has 17 feature points (P1 ~ P17, evenly distributed on the outer contour of the face), and the left and right eyebrows respectively have 5 feature points (respectively recorded as P18 ~ P22, P23 ~ P27, evenly distributed in the eyebrows) The upper end), the nose has 9 feature points (P28 ~ P36), the left and right eyelids have 6 feature points (respectively labeled as P37 ~ P42, P43 ~ P48), the left and right eyeballs have 4 feature points (respectively recorded as P49 ~ P52, P53～P56), there are 20 feature points in the lip (P57～P76), and there are 8 feature points on the upper and lower lips of the lip (respectively labeled as P57～P64, P65～P72), respectively. There are two feature points (respectively labeled as P73 to P74 and P75 to P76). Of the 8 feature points of the upper lip, 5 are located on the outer contour line of the upper lip (P57-61), 3 are located on the contour line of the upper lip (P62-P64, P63 is the central feature point on the inner side of the upper lip); 8 of the lower lip Of the feature points, 5 are located on the outer contour line of the lower lip (P65 to P69), and 3 are located in the outline of the lower lip (P70 to P72, and P71 is the central feature point on the inner side of the lower lip). One of the two feature points of the left and right lip angles is located on the outer contour line of the lips (for example, P74 and P76, which can be called outer lip feature points), and one is located on the outer contour line of the lips (for example, P73 and P75, which can be called Inner lip corner feature point).

In this embodiment, the feature extraction algorithm is a SIFT algorithm. The SIFT algorithm extracts the local features of each facial feature point from the facial average model of the facial feature points, selects a facial feature point as the reference feature point, and searches for the same or similar local feature of the reference feature point in the real-time facial image. The feature points (for example, the difference of the local features of the two feature points are within a preset range), according to this principle until all the face feature points are found in the real-time face image. In other embodiments, the feature extraction algorithm may also be a SURF algorithm, an LBP algorithm, an HOG algorithm, or the like.

The facial feature point detecting method proposed in the embodiment extracts a real-time facial image from a real-time image, and uses the facial average model to identify a facial feature point in the real-time facial image, and the recognized feature point is more comprehensive and can make a face The recognition and facial micro-expressions are more accurate.

In addition, the embodiment of the present application further provides a computer readable storage medium, where the computer readable storage medium includes a facial feature point detecting program, and when the facial feature point detecting program is executed by the processor, the following operations are implemented:

Real-time facial image acquisition step: capturing a real-time image by using a camera device, and extracting a real-time facial image from the real-time image by using a face recognition algorithm; and

Optionally, the training step of the facial average model includes:

Optionally, the face feature recognition model is an ERT algorithm, and the formula is as follows:

The specific implementation manner of the computer readable storage medium of the present application is substantially the same as the specific embodiment of the facial feature point detecting method described above, and details are not described herein again.

It is to be understood that the term "comprises", "comprising", or any other variants thereof, is intended to encompass a non-exclusive inclusion, such that a process, apparatus, article, or method that comprises a series of elements includes those elements. It also includes other elements not explicitly listed, or elements that are inherent to such a process, device, item, or method. An element that is defined by the phrase "comprising a ..." does not exclude the presence of additional equivalent elements in the process, the device, the item, or the method that comprises the element.

The serial numbers of the embodiments of the present application are merely for the description, and do not represent the advantages and disadvantages of the embodiments. Through the description of the above embodiments, those skilled in the art can clearly understand that the foregoing embodiment method can be implemented by means of software plus a necessary general hardware platform, and of course, can also be through hardware, but in many cases, the former is better. Implementation. Based on such understanding, the technical solution of the present application, which is essential or contributes to the prior art, may be embodied in the form of a software product stored in a storage medium (such as ROM/RAM as described above). , a disk, an optical disk, including a number of instructions for causing a terminal device (which may be a mobile phone, a computer, a server, or a network device, etc.) to perform the methods described in the various embodiments of the present application.

The above is only a preferred embodiment of the present application, and is not intended to limit the scope of the patent application, and the equivalent structure or equivalent process transformations made by the specification and the drawings of the present application, or directly or indirectly applied to other related technical fields. The same is included in the scope of patent protection of this application.

Claims

An electronic device, comprising: a memory, a processor, and an imaging device, wherein the memory includes a facial feature point detecting program, and the facial feature point detecting program is executed by the processor to implement the following steps :

Real-time facial image acquisition step: capturing a real-time image by using a camera device, and extracting a real-time facial image from the real-time image by using a face recognition algorithm;

Feature point identification step: input the real-time facial image into a pre-trained facial average model, and use the facial average model to identify t facial feature points from the real-time facial image.
The electronic device according to claim 1, wherein the feature point identification step further comprises:

The real-time facial image is aligned with the facial average model, and the feature extraction algorithm searches for the t facial feature points matching the t facial feature points of the facial average model in the real-time facial image.
The electronic device according to claim 2, wherein the training step of the facial average model comprises:

Establishing a sample library with n face sample images, and marking t facial feature points in each face sample image, the t facial feature points including: eyes, eyebrows, nose, mouth, and facial contour a position feature point, wherein the position feature point of the eye includes a position feature point of the eyeball; and

The face feature recognition model is trained by using the face sample image marked with t facial feature points to obtain a face average model for the face feature points.
The electronic device according to claim 3, wherein the face feature recognition model is an ERT algorithm, and the formula is as follows:

Where t represents the cascading sequence number, τ t (·, ·) represents the current level of the regression, each regression is composed of a number of regression trees, S (t) is the shape estimation of the current model, each regression τ t (·,·) predicts an increment based on the input current image I and S(t)
In the process of model training, a part of the feature points of each sample picture of n sample pictures is taken to train the first regression tree, and the predicted value of the first regression tree and the part of the feature points are The residual of the true value is used to train the second tree... and so on, until the predicted value of the Nth tree is trained and the true value of the partial feature point is close to 0, and all the regression trees of the ERT algorithm are obtained. A facial average model for facial feature points is obtained from these regression trees.
The electronic device according to claim 2, wherein the feature extraction algorithm comprises: a SIFT algorithm, a SURF algorithm, an LBP algorithm, and an HOG algorithm.
The electronic device according to claim 3, wherein each eyeball marks four position feature points.
The electronic device according to claim 1, wherein the face recognition algorithm comprises: a geometric feature based method, a local feature analyzing method, a feature face method, an elastic model based method, and a neural network method.
A facial feature point detecting method, the application electronic device, wherein the method comprises:

Real-time facial image acquisition step: capturing a real-time image by using a camera device, and extracting a real-time facial image from the real-time image by using a face recognition algorithm;

Feature point identification step: input the real-time facial image into a pre-trained facial average model, and use the facial average model to identify t facial feature points from the real-time facial image.
The facial feature point detecting method according to claim 8, wherein the feature point identifying step further comprises:

The real-time facial image is aligned with the facial average model, and the feature extraction algorithm searches for the t facial feature points matching the t facial feature points of the facial average model in the real-time facial image.
The facial feature point detecting method according to claim 9, wherein the training step of the facial average model comprises:

Establishing a sample library with n face sample images, and marking t facial feature points in each face sample image, the t facial feature points including: eyes, eyebrows, nose, mouth, and facial contour a position feature point, wherein the position feature point of the eye includes a position feature point of the eyeball; and

The face feature recognition model is trained by using the face sample image marked with t facial feature points to obtain a face average mode for the face feature points.
The facial feature point detecting method according to claim 10, wherein the facial feature recognition model is an ERT algorithm, and the formula is as follows:

Where t represents the cascading sequence number, τ t (·, ·) represents the current level of the regression, each regression is composed of a number of regression trees, S (t) is the shape estimation of the current model, each regression τ t (·,·) predicts an increment based on the input current image I and S(t)
In the process of model training, a part of the feature points of each sample picture of n sample pictures is taken to train the first regression tree, and the predicted value of the first regression tree and the part of the feature points are The residual of the true value is used to train the second tree... and so on, until the predicted value of the Nth tree is trained and the true value of the partial feature point is close to 0, and all the regression trees of the ERT algorithm are obtained. A facial average model for facial feature points is obtained from these regression trees.
The facial feature point detecting method according to claim 9, wherein the feature extraction algorithm comprises: a SIFT algorithm, a SURF algorithm, an LBP algorithm, and an HOG algorithm.
The facial feature point detecting method according to claim 10, wherein each eyeball marks four position feature points.
The facial feature point detecting method according to claim 8, wherein the face recognition algorithm comprises: a geometric feature based method, a local feature analyzing method, a feature face method, an elastic model based method, and a neural network method.
A computer readable storage medium, comprising: a facial feature point detecting program, wherein the facial feature point detecting program is executed by a processor to implement the following steps:

Real-time facial image acquisition step: capturing a real-time image by using a camera device, and extracting a real-time facial image from the real-time image by using a face recognition algorithm;

Feature point identification step: input the real-time facial image into a pre-trained facial average model, and use the facial average model to identify t facial feature points from the real-time facial image.
The computer readable storage medium according to claim 15, wherein the feature point identification step further comprises:

The real-time facial image is aligned with the facial average model, and the feature extraction algorithm searches for the t facial feature points matching the t facial feature points of the facial average model in the real-time facial image.
The computer readable storage medium of claim 16, wherein the training step of the facial average model comprises:

Establishing a sample library with n face sample images, and marking t facial feature points in each face sample image, the t facial feature points including: eyes, eyebrows, nose, mouth, and facial contour a position feature point, wherein the position feature point of the eye includes a position feature point of the eyeball; and

The face feature recognition model is trained by using the face sample image marked with t facial feature points to obtain a face average model for the face feature points.
The computer readable storage medium according to claim 17, wherein the face feature recognition model is an ERT algorithm, and the formula is as follows:

Where t represents the cascading sequence number, τ t (·, ·) represents the current level of the regression, each regression is composed of a number of regression trees, S (t) is the shape estimation of the current model, each regression τ t (·,·) predicts an increment based on the input current image I and S(t)
In the process of model training, a part of the feature points of each sample picture of n sample pictures is taken to train the first regression tree, and the predicted value of the first regression tree and the part of the feature points are The residual of the true value is used to train the second tree... and so on, until the predicted value of the Nth tree is trained and the true value of the partial feature point is close to 0, and all the regression trees of the ERT algorithm are obtained. A facial average model for facial feature points is obtained from these regression trees.
The computer readable storage medium according to claim 16, wherein the feature extraction algorithm comprises: a SIFT algorithm, a SURF algorithm, an LBP algorithm, and an HOG algorithm.
A computer readable storage medium according to claim 17, wherein each eyeball marks 4 position feature points.