CN108268838B

CN108268838B - Facial expression recognition method and facial expression recognition system

Info

Publication number: CN108268838B
Application number: CN201810001358.4A
Authority: CN
Inventors: 付璐斯; 周盛宗; 于志刚
Original assignee: Fujian Institute of Research on the Structure of Matter of CAS
Current assignee: Fujian Institute of Research on the Structure of Matter of CAS
Priority date: 2018-01-02
Filing date: 2018-01-02
Publication date: 2020-12-29
Anticipated expiration: 2038-01-02
Also published as: CN108268838A

Abstract

The application discloses a facial expression recognition method, which comprises the following steps: detecting a human face from an original image; carrying out face alignment and feature point positioning on the detected face; extracting facial feature information from the face image; and carrying out expression classification according to the acquired feature data to realize facial expression recognition. The method and the device have the advantages that the maximum possibility of the facial expressions is predicted by face detection, feature point positioning, feature extraction and expression classification, so that the accuracy of expression recognition is guaranteed, and the method and the device have wide application prospects.

Description

Facial expression recognition method and facial expression recognition system

Technical Field

The application relates to a facial expression recognition method and a facial expression recognition system, and belongs to the technical field of facial expression recognition.

Background

The generation of human emotion is a very complex psychological process, the expression of emotion is accompanied by a plurality of expression modes, and the expression modes which are usually studied by computer students are mainly three types: expression, voice, action. In the three emotion expression modes, the emotion proportion contributed by the expression is as high as 55%, and with the increasingly wide application of the human-computer interaction technology, the human face expression recognition technology has very important significance in the field of human-computer interaction. As one of the main research methods in the field of pattern recognition and machine learning, a large number of facial expression recognition algorithms have been proposed.

However, the facial expression recognition technology also has its weaknesses: 1. different human expression changes: the facial expressions can generate differences according to the differences of different human expression modes; 2. the same person context changes: real-time performance of the expression of the same person in real life; 3. external conditions, such as: background, illumination, angle, distance, and the like have a large influence on emotion recognition. All of the above will affect the accuracy of facial expression recognition.

Disclosure of Invention

The method and the system for recognizing the facial expressions aim to provide a method and a system for recognizing the facial expressions, which can realize accurate recognition of the expressions.

In order to achieve the purpose, the invention provides a facial expression recognition method.

The facial expression recognition method is characterized by comprising the following steps:

detecting a human face from an original image;

carrying out face alignment and feature point positioning on the detected face;

extracting facial feature information from the face image;

and carrying out expression classification according to the acquired feature data to realize facial expression recognition.

The face detection comprises the following steps: namely, the existence of human faces is detected from original images of various scenes, and human face regions are accurately separated.

Further, the detecting the human face from the original image includes:

scanning an original image line by line based on a local binary mode to obtain a response image;

adopting an AdaBoost algorithm to carry out face detection on the response image, and detecting the existence of a face;

and (5) adopting an AdaBoost algorithm to carry out human eye detection and separating out a human face area.

Optionally, the multi-scale detection is performed according to 1.25-0.9 in the detection process by using the AdaBoost algorithm.

Further, the performing face alignment and feature point positioning on the detected face includes:

and marking the face characteristic points by adopting a local constraint model.

Optionally, labeling the facial feature points by using the local constraint model, after obtaining the coordinates of the feature points, selecting regions reflecting differences among various expressions, and extracting two types of features of expression features based on deformation and expression features based on movement;

and performing feature evaluation by adopting recursive feature elimination and a linear vector machine, and further performing feature selection on the selected features.

Further, the extracting facial feature information from the face image includes:

selecting areas which represent differences among various expressions, and extracting two types of characteristics of expression characteristics based on deformation and expression characteristics based on movement;

Optionally, the regions representing differences between the expressions include the contour points of the eyes, the nose tip, the mouth corner points, the eyebrows, and the face.

Further, the extracting facial feature information from the face image further includes: and performing feature selection on the extracted facial feature information, acquiring a facial feature subset, and storing the facial feature information for expression recognition.

Further, the classifying the expressions according to the acquired feature data, and implementing facial expression recognition includes:

selecting samples according to the extracted facial feature information, training an expression classifier by using priori knowledge, wherein each sample corresponds to a corresponding expression label;

and realizing expression classification by adopting a least square rule through an expression classifier.

Further, the classifying the expressions according to the acquired feature data, and implementing facial expression recognition further includes:

and (3) manufacturing a base vector space by using the expression characteristics of the known label, and projecting the characteristics of the expression to be detected to the space to judge the expression category so as to identify the facial expression.

As a specific implementation manner, the facial expression recognition method includes the following steps: (1) detecting a human face from an original image; (2) carrying out face alignment and feature point positioning on the detected face; (3) extracting facial feature information from the face image; (4) and carrying out expression classification according to the acquired feature data to realize facial expression recognition.

Wherein, step (1) further includes: (11) scanning an original image line by line based on a local binary mode to obtain a response image; (12) adopting an AdaBoost algorithm to carry out face detection on the response image, and detecting the existence of a face; (13) and (5) adopting an AdaBoost algorithm to carry out human eye detection and separating out a human face area.

Further, the AdaBoost algorithm is adopted to carry out face detection or human eye detection, and multi-scale detection is carried out according to 1.25-0.9.

The step (2) further comprises: and marking the face characteristic points by adopting a local constraint model.

The step (3) further comprises: (31) selecting three main areas of a mouth, eyebrows and eyes which represent differences among various expressions, and extracting two types of characteristics of expression characteristics based on deformation and expression characteristics based on movement; (32) and performing feature evaluation by adopting recursive feature elimination and a linear vector machine, and further performing feature selection on the selected features.

Further, feature selection is carried out on the extracted facial feature information, a facial feature subset is obtained, and the facial feature information is stored and used for expression recognition.

The step (4) further comprises: (41) selecting samples according to the extracted facial feature information, training an expression classifier by using priori knowledge, wherein each sample corresponds to a corresponding expression label; (42) and realizing expression classification by adopting a least square rule through an expression classifier.

Furthermore, a base vector space is manufactured by using the expression characteristics of the known label, the expression category of the expression to be detected is judged by projecting the characteristics of the expression to the space, and the facial expression recognition is carried out.

In another aspect of the present application, a facial expression recognition system is provided, where the system includes: the system comprises a face detection module, a feature point positioning module, a feature extraction module and a facial expression recognition module;

the facial expression recognition module is used for detecting a human face from an original image;

the characteristic point positioning module is connected with the face detection module and is used for carrying out face alignment and characteristic point positioning on the detected face;

the feature extraction module is connected with the feature point positioning module and used for extracting facial feature information from the face image;

the facial expression recognition module is connected with the feature extraction module and used for predicting the maximum possibility of facial expression data to be recognized through the trained expression classifier according to the extracted facial feature information, finding out the expression category with the highest possibility and realizing facial expression recognition.

Optionally, the face detection module scans the original image line by line based on a local binary mode to obtain a response image;

Optionally, the feature point positioning module labels the face feature points by using a local constraint model.

Optionally, the feature extraction module selects regions reflecting differences among various expressions, and extracts two types of features of expression features based on deformation and expression features based on movement;

Optionally, the regions representing the differences between the types of expressions comprise at least one of a mouth, eyebrows, eyes, and nose tips.

Optionally, the feature extraction module performs feature selection on the extracted facial feature information, obtains a facial feature subset, and stores the facial feature information for expression recognition.

Optionally, the facial expression recognition module classifies expressions according to the acquired feature data, and implementing facial expression recognition includes: selecting samples according to the extracted facial feature information, training an expression classifier by using priori knowledge, wherein each sample corresponds to a corresponding expression label;

Optionally, the facial expression recognition module uses expression features of known labels to make a base vector space, and the expression to be detected determines the expression category by projecting the features of the expression to the space, so as to perform facial expression recognition.

The beneficial effects that this application can produce include:

the method and the device have the advantages that the maximum possibility of the facial expressions is predicted by face detection, feature point positioning, feature extraction and expression classification, so that the accuracy of expression recognition is guaranteed, and the method and the device have wide application prospects.

Drawings

Fig. 1 is a schematic flow chart of a face recognition method according to the present application.

Fig. 2 is a schematic diagram of an architecture of a face recognition system according to the present application.

Detailed Description

The present application will be described in detail with reference to examples, but the present application is not limited to these examples.

Example 1

The following describes the facial expression recognition method and system provided by the present invention in detail with reference to the accompanying drawings.

Referring to fig. 1, a flow diagram of a facial expression recognition method according to the present invention is shown. The method comprises the following steps: s11: detecting a human face from an original image; s12: carrying out face alignment and feature point positioning on the detected face; s13: extracting facial feature information from the face image; s14: and carrying out expression classification according to the acquired feature data to realize facial expression recognition. The above steps are described in detail below with reference to the accompanying drawings.

S11: a face is detected from an original image.

Face detection: namely, the existence of human faces is detected from original images of various scenes, and human face regions are accurately separated. As a preferred embodiment, step S11 can be further completed by the following steps: 11) scanning an original image line by line based on a local binary mode to obtain a response image; 12) adopting an AdaBoost algorithm to carry out face detection on the response image, and detecting the existence of a face; 13) and (5) adopting an AdaBoost algorithm to carry out human eye detection and separating out a human face area.

Local Binary Pattern (LBP) is an effective texture descriptor, which has excellent delineation capability for local texture features of images. The LBP operator process is similar to the template operation in the filtering process, and the original image is scanned line by line; for each pixel point in the original image, taking the gray value of the point as a threshold value, and carrying out binarization on 8 fields of 3 multiplied by 3 around the point; and (4) forming the binary result into an 8-bit binary number according to a certain sequence, and using the value (0-255) of the binary number as the point response.

As shown in table 1, in an embodiment, the original image corresponds to a gray value, for the center point of the 3 × 3 area in table 1, the 8 fields are binarized by using the gray value 88 as a threshold, and the result of binarization is formed into a binary number 10001011, i.e. 139 in decimal, as the response of the center in a clockwise direction (the order may be arbitrary, but needs to be uniform) from the top left point. After the whole progressive scanning process is finished, obtaining an LBP response image which can be used as the characteristic of subsequent work; the corresponding gray scale values of the resulting response image are shown in table 2.

180	52	5
			213	88	79
158	84	156

Table 1 an embodiment of the original image corresponds to gray scale values.

1	0	0
			1	139	0
1	0	1

Table 2 the resulting response image corresponds to gray scale values.

The AdaBoost algorithm, which is proposed by Freund and Schapire according to an online distribution algorithm, allows designers to continually add new weak classifiers until a sufficiently small error rate for a certain subscription is reached. In the AdaBoost algorithm, each training sample is assigned a weight that characterizes the probability that it was selected into the training set by a component classifier. If a sample point has been accurately classified, then the probability that it is selected is reduced in constructing the next training set; conversely, if a sample point is not correctly classified, its weight is increased. Through the training of the T wheel, the AdaBoost algorithm can focus on the samples which are difficult to detect, and a strong classifier for target detection is obtained comprehensively.

The AdaBoost algorithm is described as follows:

1) given a calibrated training sample set (x)₁,y₁),(x₂,y₂),……(x_L,y_L). Wherein, g_j(x_i) The jth Haar-Like feature, x, representing the ith training image_iE.g. X, represents the input training sample, y_iE Y ═ {1, -1} represents true and false samples, respectively.

2) Initialization weight w_1，i1/2m,1/2n, where m, n respectively represent data of true and false samples, and the total number of samples L is m + n.

3) For T rounds of training, For T is 1,2, …, T.

The weights for all samples are normalized:

for the jth Haar-Like feature in each sample, a simple classifier can be obtained, i.e., the threshold θ is determined_jAnd an offset P_jSo as to make an error_jThe minimum value is reached:

wherein the content of the first and second substances,

offset P_jThe inequality direction is determined, and only +/-1 two conditions exist.

In the determined simple classifier, find out one with minimum error_tWeak classifier h of_t。

4) The weights of all samples are updated:

wherein, beta_t＝_t/(1-_t) If x is_iQuilt h_iCorrectly classify, then e_iWhen it is equal to 0, otherwise e_i＝1。

5) The final strong classifier is:

wherein alpha is_t＝ln(1/β_t) Is according to h_tMeasured by the prediction error of (1).

Therefore, the human face can be detected through the steps. In the detection process, multi-scale detection can be carried out according to 1.25-0.9, and finally windows are combined to output results.

And on the basis of detecting the human face, the AdaBoost algorithm is used for human eye detection. The basic principle of human eye detection is the same as that of human face detection, and is not described herein again. In the human eye detection process, multi-scale detection can be performed according to 1.25-0.9, and a rejection mechanism (which can be established according to the characteristics of the position, the size and the like of the human eyes) is established.

S12: and carrying out face alignment and feature point positioning on the detected face.

Positioning the characteristic points: namely, according to the input human face image, key feature points of the face, such as eyes, nose tip, mouth corner points, eyebrows and contour points of each part of the human face, are automatically positioned. As a preferred embodiment, step S12 can be further completed by the following steps: and marking the face characteristic points by adopting a local constraint model.

The local constraint model (CLM) accomplishes the detection of facial feature points by initializing the location of the average face and then letting the feature points on each average face search for matches in their neighborhood locations. The whole process is divided into two stages: a model building phase and a point fitting phase. The model construction phase can subdivide the construction of two different models: shape model construction and Patch model construction. Shape model construction is the modeling of the shape of a face model, which describes the criteria followed by shape changes. The Patch model models the neighborhood around each feature point, establishes a feature point matching criterion, and judges the optimal matching of the feature points.

The local constraint model (CLM) algorithm is described as follows:

1) shape model construction

And calculating the average shape of all the aligned face samples in the training set. Suppose there are M pictures, each picture has N characteristic points, and the coordinate of each characteristic point is assumed to be (x)_i,y_i) The vector composed of the coordinates of N feature points on one image is represented by x ═ x₁ y₁ x₂ y₂ … x_N y_N]^TMeaning, the average face of all images is available:

calculating the difference between the shape vector of each sample image and the average face to obtain a shape change matrix X with zero mean:

the principal components determining the change in face shape can be obtained by PCA transformation of the matrix X, i.e.

Determining a main characteristic value lambda_iAnd corresponding feature vectors p_i. Since the eigenvectors corresponding to the larger eigenvalues generally contain the main information of the sample, the eigenvectors corresponding to the largest k eigenvalues are selected to form the orthogonal matrix P (P ═ P)₁,p₂,…,p_k)。

Weight vector b of shape change is (b)₁,b₂,…,b_k)^TEach component of b represents its magnitude in the direction of the corresponding eigenvector:

then for any face test image, its sample shape vector can be expressed as:

2) patch model construction

Supposing that M face images exist in a training sample, selecting N key feature points of the face on each image, selecting a patch area with a fixed size around each feature point, and marking the patch area containing the feature points as a positive sample; then truncating the same size patch in the non-feature point area and marking as a negative sample.

Assuming that there are a total of r patches per feature point, it is formed into a vector (x)⁽¹⁾,x⁽²⁾,…x^(r))^TFor each image in the sample set, there is

Then the output will have only positive and negative examples, i.e. patch is a feature point region and a non-feature point region. Then y⁽ⁱ⁾1,2, … r, wherein y is { -1,1} i { -1, 2, … r⁽ⁱ⁾1 is a positive sample mark, y⁽ⁱ⁾Negative sample label-1. The trained linear support vector machine is:

wherein x_iSubspace vectors, i.e. support vectors, alpha, representing sample sets_iIs a weight coefficient, M_sIs the number of support vectors per feature point, and b is the offset. The following can be obtained:

y⁽ⁱ⁾＝w^T·x⁽ⁱ⁾+θ

w^T＝[w₁ w₂ … w_n]is the weight coefficient of each support vector and θ is the offset. Thus, a patch model is established for each feature point.

3) Point fitting

A similar response map, denoted R (X, Y), is generated for each feature point by performing a local search within the bounding region of the currently estimated feature point location.

Fitting a quadratic function to the response plot, assuming that R (X, Y) is in the neighborhood range (X)₀,y₀) We find the maximum and fit a function to this position so that the position corresponds one-to-one to the maximum R (X, Y). The quadratic function can be described as follows:

r(x,y)＝a(x-x₀)²+b(y-y₀)²+c

wherein a, b and c are coefficients of a quadratic function, and the solving method is to minimize the error between the quadratic functions R (X, Y) and R (X, Y), namely to complete a least square calculation:

with the parameters a, b, and c, r (x, y) is an objective cost function for the feature point location, and then a deformation constraint cost function is added to form an objective function for feature point search, where the objective function is as follows:

optimizing the objective function each time to obtain a new feature point position, and then updating in an iteration mode until the maximum value is converged, so that the face point fitting is completed.

S13: facial feature information is extracted from the face image.

Feature extraction: namely, representative feature information of the human face is extracted from the normalized human face image. As a preferred embodiment, step S13 can be further completed by the following steps: (31) selecting three main areas of a mouth, eyebrows and eyes which represent differences among various expressions, and extracting two types of characteristics of expression characteristics based on deformation and expression characteristics based on movement; (32) and performing feature evaluation by adopting recursive feature elimination and a linear vector machine, and further performing feature selection on the selected features.

And (3) marking the facial feature points by using a local constraint model, acquiring the coordinates of the feature points, selecting the shape features of three main areas, namely the mouth, the eyebrows and the eyes, calculating the related slope information among key points in the three areas, and extracting the expression features based on deformation. And simultaneously tracking key points in the three regions, extracting corresponding displacement information, extracting distance information between specific feature points of the expression pictures, subtracting the distances from the calm pictures to obtain change information of the distances, and extracting the expression features based on motion.

And performing feature evaluation by adopting recursive feature elimination and a linear vector machine, and further denoising the selected features by adopting the weight value calculated by the support vector machine as a sorting criterion.

The feature selection algorithm is described as follows:

inputting: training sample set

l is the number of categories

And (3) outputting: feature order set R

1) Initializing an original feature set S [ {1, 2, …, D }, and a feature ordering set R [ ]

2) Generate (l (l-1))/2 training samples:

in training sample

Finding out pairwise combinations of different categories to obtain a final training sample:

the following process is circulated until S [ ]:

3) obtaining l training subsamples X_j(j＝1,2,…,(l(l-1))/2)；

Respectively by X_jTraining support vector machine to obtain w respectively_j(j＝1,2,…,l)；

Calculating a ranking criterion score

Finding features with minimum ranking criteria score

Updating the characteristic set R ═ { p }. U.R;

this feature S is removed in S as S/p.

S14: and carrying out expression classification according to the acquired feature data to realize facial expression recognition.

And (4) classification: that is, human expressions are roughly classified into seven categories, which are happiness, anger, sadness, disgust, surprise, fear, and neutrality, respectively. As a preferred embodiment, step S14 can be further completed by the following steps: (41) selecting samples according to the extracted facial feature information, training an expression classifier by using priori knowledge, wherein each sample corresponds to a corresponding expression label; (42) and realizing expression classification by adopting a least square rule through an expression classifier.

Training an expression classifier: and training the extracted facial features by using a support vector machine algorithm, and obtaining an expression classifier after the training is finished.

Support Vector Machine (SVM) algorithm description:

input training set

Wherein x_i∈R^D,y_i∈{+1,-1}，x_iIs the ith sample, N is the sample size, and D is the sample feature number. The SVM finds the optimal classification hyperplane w · x + b ═ 0.

The optimization problem required to be solved by the SVM is as follows:

s.t.y_i(w·x_i+b)≥1-ξ_i i＝1,2,…,N

ξ_i≥0,i＝1,2,…,N

while the original problem can be converted into a dual problem:

wherein alpha is_iIs a lagrange multiplier.

The final solution for w is:

the discriminant function of the SVM is:

and (4) classifying expressions: and inputting the extracted facial feature information into a trained classifier, and enabling the classifier to give a value of expression prediction. I.e., applying the least squares rule, the best functional match of the data is found by minimizing the sum of the squares of the errors. Thus, a complete face recognition process is completed.

Referring to fig. 2, an architecture of a facial expression recognition system according to the present invention is schematically illustrated; the system comprises: a face detection module 21, a feature point positioning module 22, a feature extraction module 23 and a facial expression recognition module 24.

The face detection module 21 is configured to detect a face from an original image. The face detection module 21 may scan the original image line by line based on a local binary mode to obtain a response image; then, adopting an AdaBoost algorithm to carry out face detection on the response image, and detecting the existence of a face; and then, adopting an AdaBoost algorithm to carry out human eye detection, and separating out a human face area. The specific implementation of face detection refers to the aforementioned method flow, and is not described herein again.

The feature point positioning module 22 is connected to the face detection module 21, and is configured to perform face alignment and feature point positioning on the detected face. And marking the face characteristic points by adopting a local constraint model, and positioning key characteristic points of the face, such as eyes, nose tips, corner points of the mouth, eyebrows and contour points of all parts of the face. The specific implementation of feature point positioning refers to the aforementioned method flow, and is not described herein again.

The feature extraction module 23 is connected to the feature point positioning module 22, and is configured to extract facial feature information from a face image. The feature extraction module 23 may extract two types of features of expression features based on deformation and expression features based on movement by selecting three main regions, i.e., mouth, eyebrow, and eye, that represent differences among various expressions; and then, performing feature evaluation by adopting recursive feature elimination and a linear vector machine, and further performing feature selection on the selected features. And in the characteristic extraction stage, the extracted facial characteristic information is subjected to characteristic selection, a facial characteristic subset is obtained, and the facial characteristic information is stored and used for expression recognition. The specific implementation manner refers to the aforementioned method flow, and is not described herein again.

The face recognition module 24 is connected to the feature extraction module 23, and is configured to classify expressions according to the acquired feature data, so as to implement facial expression recognition. The feature extraction module 24 may select samples according to the extracted facial feature information, train an expression classifier using priori knowledge, and each sample corresponds to a corresponding expression label; and then, the expression classifier is used for realizing expression classification by adopting a least square rule. The classification process is to use the expression features of the known labels to manufacture a base vector space, and the expression to be detected judges the type of the expression by projecting the features of the expression to the space so as to recognize the facial expression. The specific implementation manner refers to the aforementioned method flow, which is not described herein again.

Embodiment 2 facial expression recognition method

The method for recognizing the facial expressions in the embodiment comprises the following steps:

step 11: detecting a human face from an original image;

in this step, a specific embodiment includes step 101, step 102, and step 103.

Step 101: and scanning the original image line by line based on the local binary mode to obtain a response image.

Step 102: and adopting an AdaBoost algorithm to detect the human face of the response image, and detecting the existence of the human face.

Step 103: and (5) adopting an AdaBoost algorithm to carry out human eye detection and separating out a human face area.

In a specific mode, the AdaBoost algorithm is adopted to carry out human face detection or human eye detection, and multi-scale detection is carried out according to 1.25-0.9.

Step 12: carrying out face alignment and feature point positioning on the detected face;

in this step, a specific implementation manner is: and marking the face characteristic points by adopting a local constraint model.

Step 13: extracting facial feature information from the face image;

in this step, a specific embodiment includes step 301 and step 302.

Step 301: selecting three main areas of a mouth, eyebrows and eyes which represent differences among various expressions, and extracting two types of characteristics of expression characteristics based on deformation and expression characteristics based on movement;

in this step, another specific embodiment is: selecting main areas of which contour points of eyes, nose tips, mouth corner points, eyebrows and all parts of the human face represent differences among various expressions, and extracting two types of characteristics of expression characteristics based on deformation and expression characteristics based on movement;

step 302: and performing feature evaluation by adopting recursive feature elimination and a linear vector machine, and further performing feature selection on the selected features.

In a specific implementation mode, feature selection is carried out on the extracted facial feature information, facial feature subsets are obtained, and the facial feature information is stored and used for expression recognition.

Step 14: and carrying out expression classification according to the acquired feature data to realize facial expression recognition.

In this step, a specific embodiment includes step 401 and step 402.

Step 401: selecting samples according to the extracted facial feature information, training an expression classifier by using priori knowledge, wherein each sample corresponds to a corresponding expression label;

step 402: and realizing expression classification by adopting a least square rule through an expression classifier.

In a specific implementation mode, a base vector space is manufactured by using expression features of known labels, and the expression category is judged by projecting the features of the expression to be detected to the space, so that facial expression recognition is performed.

The various algorithms involved in this example are the same as those in example 1.

Embodiment 3 facial expression recognition system

The facial expression recognition system in the embodiment includes: the system comprises a face detection module, a feature point positioning module, a feature extraction module and a facial expression recognition module;

in a specific implementation manner, the face detection module scans an original image line by line based on a local binary mode to obtain a response image;

In a specific embodiment, the multi-scale detection is performed according to 1.25-0.9 in the detection process by using the AdaBoost algorithm.

in a specific embodiment, the feature point positioning module labels the face feature points by using a local constraint model.

in a specific embodiment, the feature extraction module selects regions reflecting differences among various expressions, and extracts two types of features of expression features based on deformation and expression features based on movement;

In a specific embodiment, the region that represents the difference between the expressions includes an eye, a nose tip, a mouth corner point, an eyebrow, and contour points of each part of the human face.

The facial expression recognition module is connected with the feature extraction module and used for predicting the maximum possibility of facial expression data to be recognized through a trained expression classifier according to the extracted facial feature information, finding out the expression category with the highest possibility and realizing facial expression recognition;

in a particular embodiment: the facial expression recognition module classifies expressions according to the acquired feature data, and the facial expression recognition comprises the following steps: selecting samples according to the extracted facial feature information, training an expression classifier by using priori knowledge, wherein each sample corresponds to a corresponding expression label;

In a specific embodiment, the facial expression recognition module classifies expressions according to the acquired feature data, and implementing facial expression recognition includes: selecting samples according to the extracted facial feature information, training an expression classifier by using priori knowledge, wherein each sample corresponds to a corresponding expression label;

the expression classification is realized by adopting a least square rule through an expression classifier;

the facial expression recognition module uses expression features of known labels to manufacture a base vector space, and the expression to be detected judges the type of the expression by projecting the features of the expression to the space so as to recognize the facial expression.

Although the present application has been described with reference to a few embodiments, it should be understood that various changes, substitutions and alterations can be made herein without departing from the spirit and scope of the application as defined by the appended claims.

Claims

1. A facial expression recognition method is characterized by comprising the following steps:

detecting a human face from an original image;

carrying out face alignment and feature point positioning on the detected face;

extracting facial feature information from the face image;

according to the acquired feature data, performing expression classification to realize facial expression recognition;

the detecting of the human face from the original image comprises:

adopting an AdaBoost algorithm to carry out human eye detection and separating out a human face area;

the extracting of the facial feature information from the face image includes:

performing feature evaluation by adopting recursive feature elimination and a linear vector machine, and further performing feature selection on the selected features by adopting the weight value calculated by the support vector machine as a sorting criterion;

selecting regions reflecting differences among various expressions, and extracting two types of characteristics of expression characteristics based on deformation and expression characteristics based on movement comprises the following steps: selecting three areas of a mouth, eyebrows and eyes, calculating correlation slope information among key points in the three areas, and extracting expression characteristics based on deformation according to the correlation slope information; tracking key points in the three regions, extracting corresponding displacement information, extracting distance information between specific feature points of the face image, subtracting the distance information from the distance between the specific feature points in the calm picture to obtain distance change information between the specific feature points, and extracting expression features based on movement according to the distance change information;

the face alignment and feature point positioning of the detected face comprises: and initializing the positions of the average faces by adopting a local constraint model, and then searching and matching the feature points on each average face in the neighborhood position to complete the detection and labeling of the face feature points.

2. The method of claim 1, wherein the multi-scale detection is performed according to 1.25-0.9 during the detection process by using the AdaBoost algorithm.

3. The method of claim 1, wherein the regions representing differences between the types of expressions comprise contour points of each part of the eyes, the nose tip, the mouth corner points, the eyebrows and the human face.

4. The method of claim 1, wherein extracting facial feature information from the face image further comprises: and performing feature selection on the extracted facial feature information, acquiring a facial feature subset, and storing the facial feature information for expression recognition.

5. The method of claim 1, wherein the performing expression classification according to the acquired feature data to realize facial expression recognition comprises:

6. The method of claim 5, wherein the classifying the expression according to the acquired feature data to realize facial expression recognition further comprises:

7. A system for facial expression recognition, the system comprising: the system comprises a face detection module, a feature point positioning module, a feature extraction module and a facial expression recognition module;

the face detection module is used for detecting a face from an original image;

the face detection module scans an original image line by line based on a local binary mode to obtain a response image;

the feature extraction module selects regions reflecting differences among various expressions, and extracts two types of features of expression features based on deformation and expression features based on movement;

the feature extraction module selects three areas, namely a mouth area, an eyebrow area and an eye area, calculates the correlation slope information among key points in the three areas, and extracts expression features based on deformation according to the correlation slope information; tracking key points in the three regions, extracting corresponding displacement information, extracting distance information between specific feature points of the face image, subtracting the distance information from the distance between the specific feature points in the calm picture to obtain distance change information between the specific feature points, and extracting expression features based on movement according to the distance change information;

the feature point positioning module initializes the positions of the average faces by adopting a local constraint model, and then searches and matches feature points on each average face in the neighborhood position of the feature points to complete the detection and labeling of the face feature points.

8. The system according to claim 7, wherein the regions representing differences between the types of expressions comprise contour points of each part of the eyes, the nose tip, the mouth corner points, the eyebrows and the human face.

9. The system of claim 7, wherein the feature extraction module performs feature selection on the extracted facial feature information, obtains a facial feature subset, and stores the facial feature information for expression recognition.

10. The system of claim 7, wherein the facial expression recognition module performs expression classification according to the acquired feature data, and implementing facial expression recognition comprises: selecting samples according to the extracted facial feature information, training an expression classifier by using priori knowledge, wherein each sample corresponds to a corresponding expression label;

11. The system of claim 10, wherein the facial expression recognition module uses expression features of known labels to create a base vector space, and the expression to be tested determines the expression category by projecting the features of the expression to the space, so as to perform facial expression recognition.