CN109753950A

CN109753950A - Dynamic human face expression recognition method

Info

Publication number: CN109753950A
Application number: CN201910109704.5A
Authority: CN
Inventors: 于明; 苗少栋; 王岩; 郭迎春; 刘依; 朱叶; 阎刚; 于洋; 师硕; 郝小可
Original assignee: Hebei University of Technology; Tianjin University of Commerce
Current assignee: Hebei University of Technology; Tianjin University of Commerce
Priority date: 2019-02-11
Filing date: 2019-02-11
Publication date: 2019-05-14
Anticipated expiration: 2039-02-11
Also published as: CN109753950B

Abstract

Dynamic human face expression recognition method of the present invention is related to the method for the characteristics of image of figure or characteristic for identification, is a kind of dynamic human face expression recognition method based on geometrical characteristic and semantic feature, step is: the pretreatment of dynamic human face image sequence；The detection of human face expression frame and the characteristic point of human face expression gray level image mark；The calibration of human face expression delta-shaped region on human face expression gray level image；The extraction of the geometrical characteristic of human face expression delta-shaped region on human face expression gray level image；The analysis and extraction of semantic feature on human face expression gray level image；SVM classifier training simultaneously obtains classification results；Complete the identification of dynamic human face expression.That the present invention overcomes the generally existing real-times of the prior art is poor, high vulnerable to illumination effect, intrinsic dimensionality and time complexity and then influence the satisfactory defect of facial expression recognition rate.

Description

Dynamic human face expression recognition method

Technical field

Technical solution of the present invention is related to the method for the characteristics of image of figure or characteristic for identification, specifically dynamic Facial expression recognizing method.

Background technique

Human face expression is most effective way in human emotion's exchange, and with the development of computer technology, human face expression is known There is not important application in the field for being related to NI Vision Builder for Automated Inspection and pattern-recognition, such as psychological study, video conference, intelligence It can human-computer interaction, affection computation and medical industry.With the development in an all-round way of human-computer interaction technology, how research makes computer automatic Perception human emotion be artificial intelligence focus.

The facial expression recognizing method of early stage concentrates on the human face expression feature in research still image.However, based on quiet Motion information of the facial expression recognition of state image due to lacking expression, cannot reflect the space-time characteristic of expression.Human face expression is made For a dynamic change procedure, space-time characteristic plays an important role.Dynamic human face expression recognition method can be mentioned comprehensively The space-time characteristic of human face expression is got, the variation of human face expression itself is reflected with this, to improve the robust of facial expression recognition Property and accuracy.

Following some document reports research of existing dynamic human face expression recognition method: document " Recognition of facial expressions based on salient geometric features and support vector machines”(Ghimire D,Lee J,Li Z N,et al.Recognition of facial expressions based on salient geometric features and support vector machines[J].Multimedia Tools&Applications, 2016:1-26.) it proposes to initialize face feature point by Elastic Bunch Graph Matching, it uses Kanade-Lucas-Tomaci (KLT) tracker tracks characteristic point, then by using ELM as Weak Classifier Multistage AdaBoost classifier carrys out selected characteristic, this method since the number of ELM Weak Classifier reaches 23426, feature selecting Time loss is long.CN108256426A discloses a kind of facial expression recognizing method based on convolutional neural networks, first passes through people The key point of face calibrates dynamic human face image sequence, is identified using convolutional neural networks to human face expression, the party For method since the convolutional neural networks frame number of plies is too deep, the time complexity for extracting human face expression feature is high.CN108921042A is public A kind of face sequence expression recognition method based on deep learning has been opened, has proposed to extract dynamic human face figure by deep learning frame As the Analysis On Multi-scale Features of sequence, facial expression recognition is completed with this, this method is different resolutions due to Multi resolution feature extraction The convolutional space-time feature of rate image sequence leads to the overlong time for extracting feature consumption.Document " Facial Expression Recognition from Video Sequences Based on Spatial-Temporal Motion Local Binary Pattern and Gabor Multiorientation Fusion Histogram”(Zhao L,Wang Z, Zhang G.Facial Expression Recognition from Video Sequences Based on Spatial- Temporal Motion Local Binary Pattern and Gabor Multiorientation Fusion Histogram[J].Mathematical Problems in Engineering,2017,(2017-02-19),2017, It 2017:1-12.) proposes to combine using dynamic and static information in dynamic human face image sequence and carries out facial expression analysis It is special to extract texture using the method that both spatiotemporal motion local binary patterns (STM-LBP) and Gabor filter combine for method Sign, it is big, high vulnerable to illumination interference and intrinsic dimensionality all to there is computation complexity in STM-LBP and Gabor algorithm itself therein Defect.CN105139004A discloses the facial expression recognizing method based on video sequence, proposes and utilizes three-dimensional orthogonal The center Haar-like binary pattern (HCBP-TOP) extracts the textural characteristics of dynamic human face image sequence, and this method is due to HCBP- TOP textural characteristics are obtained in layered piecemeal treated sub-block, and sub-block quantity leads to intrinsic dimensionality height more. CN104036255A discloses a kind of facial expression recognizing method, by comparing the people formed by characteristic point position information vector difference The similarity of face expressive features library and expressive features vector Euclidean distance to be measured completes facial expression recognition, and this method is due to defeated The quantity of the Facial Expression Image entered and the human face expression characteristic point of calibration is more, leads to the intrinsic dimensionality of human face expression feature vector It is high.CN103971137B discloses the three-dimensional dynamic human face expression recognition method based on the study of structural sparse features, extracts dynamic Training sample of the LBP-TOP textural characteristics of three-dimensional module as encoder dictionary in state human face image sequence, and using PCA to it Feature input condition random field models after dimensionality reduction are completed training and identified by dimensionality reduction, and this method is special due to LBP-TOP texture Sign is to extract to obtain on the three-dimensional module for dividing dynamic human face image sequence, and the quantity of three-dimensional module is more and textural characteristics Itself high defect of existing characteristics dimension, even if intrinsic dimensionality is still excessively high after PCA dimensionality reduction.Document " Dynamic image sequences expression recognition based on active appearance model and optical flow”(Shao Hong,Wang Yang,Wang Wei.Dynamic image sequences expression recognition based on active appearance model and optical flow[J].Computer Engineering and Design, 2017,38 (6): 1642-1646.) it reports and is moved using active appearance models AAM positioning In state human face image sequence in initial frame Facial Expression Image 68 characteristic points, recycle gaussian pyramid track these features Point, using the difference of the peak value frame of human face expression in dynamic human face image sequence and the characteristic point coordinate of neutral frame as human face expression spy Sign, this method because of real-time difference due to that can cause tracking effect to be deteriorated in the excessive situation of dynamic human face image sequence frame number. CN106934375A discloses the facial expression recognizing method based on the description of characteristic point motion profile, by tracking dynamic human face figure As characteristic point in the slope variation of interframe extracts human face expression feature in sequence, and it is inputted RBF neural and is identified with this Human face expression, for this method since the frame number of tracked dynamic human face image sequence and the quantity of characteristic point are more, it is poor that there are real-times Defect.CN101908149A discloses a kind of method that countenance is identified from human face image sequence, and proposition passes through tracking The change in displacement of 20 characteristic points obtains geometrical characteristic on dynamic human face image sequence, and by canonical correlation analysis to feature into Row analysis completes facial expression recognition with this, this method be only extracted characteristic point displacement and length these two types geometrical characteristic and not Consider textural characteristics or semantic feature, there is a problem of that discrimination is low.Document " Spatio-temporal convolutional features with nested LSTM for facial expression recognition”(Zhenbo Yu, Guangcan Liu,Qingshan Liu,Jiankang Deng.Spatio-temporal convolutional features with nested LSTM for facial expression recognition[J] .Neurocomputing, 2018:50-57) report the multilayer apperance feature for learning human face expression using deep learning frame With time multidate information, the problem of this method, is time complexity height.CN106980811A discloses facial expression recognition side Method and facial expression recognition device carry out facial expression recognition, this method using the training pattern containing more deep learning frames Since the deep learning frame used is more, have the defects that time complexity is high.

In short, tracking characteristics point effect is poor, leads to dynamic human face since the feature based on dynamic human face image sequence is complicated Expression recognition method research the generally existing real-time of the prior art it is poor, vulnerable to illumination effect, intrinsic dimensionality and time complexity It is high to influence the satisfactory defect of facial expression recognition rate in turn.

Summary of the invention

It is a kind of special based on geometry the technical problems to be solved by the present invention are: providing dynamic human face expression recognition method Sign and semantic feature dynamic human face expression recognition method, overcome the generally existing real-time of the prior art it is poor, vulnerable to illumination shadow It rings, intrinsic dimensionality and time complexity height influence the satisfactory defect of facial expression recognition rate in turn.

The present invention solves technical solution used by the technical problem: dynamic human face expression recognition method, is a kind of base In the dynamic human face expression recognition method of geometrical characteristic and semantic feature, the specific steps are as follows:

The first step, the pretreatment of dynamic human face image sequence:

First every frame Facial Expression Image in the dynamic human face image sequence of input is carried out size to be normalized to size being M × N pixel, then every frame Facial Expression Image in the dynamic human face image sequence inputted using following formula (1) by Rgb space is transformed into gray space, obtains every frame human face expression gray level image I_{gray_tn},

I_{gray_tn}=0.299I_R+0.587I_G+0.114I_B(1),

In formula (1), I_R、I_G、I_BIt is every frame Facial Expression Image in inputted dynamic human face image sequence respectively Red, green and blue three channel components, retain every frame human face expression gray level image I_{gray_tn}, for people in following second step The detection of face expression frame is used with characteristic point mark；

Second step, the detection of human face expression frame and the characteristic point of human face expression gray level image mark:

Every frame human face expression that the above-mentioned first step is obtained using the Multiview_Reinforce interface in the library LibFace Gray level image I_{gray_tn}The detection of human face expression frame is carried out, and characteristic point mark, this 68 spies are carried out to 68 characteristic points therein Shown in the following formula of total coordinate vector (2) for levying point,

X=((x₁,y₁),(x₂,y₂),...,(x_k,y_k),...,(x₆₈,y₆₈))^T(2),

In formula (2), x_k, y_kIn respectively every frame human face expression gray level image abscissa corresponding to k-th of characteristic point and Ordinate, k ∈ [1,68]；

Third step, the calibration of human face expression delta-shaped region on human face expression gray level image:

The every frame human face expression gray level image I marked from above-mentioned second step_{gray_tn}In 68 characteristic points in select eyebrow, 30 characteristic points on eyes, nose and mouth carry out the calibration of human face expression delta-shaped region on human face expression gray level image, 10 human face expression delta-shaped regions of composition calibration altogether, the vector combination TR that human face expression delta-shaped region is consequently formed is such as Shown in lower formula (3),

TR={ tr₁,tr₂,...,tr_i,...,tr₁₀(3),

In formula (3), tr_i={ X_S_{I, 1},X_S_{I, 2},X_S_{I, 3}, i ∈ [1,10], tr_iFor i-th of human face expression triangle Region, X_S_{I, 1}、X_S_{I, 2}、X_S_{I, 3}The vertex on the 1st, 2,3 vertex in respectively i-th of human face expression delta-shaped region is sat Mark,

The 10 human face expression delta-shaped regions demarcated include: left eye and lip, eyes and eyebrow, in lip and eyebrow Heart point, eyebrow and nose, nose and lip, eyebrow, eyes and nose, nose central point and the corners of the mouth, right eye eyeball and lip, pure mouth Lip, pure eyes；

4th step, the extraction of the geometrical characteristic of human face expression delta-shaped region on human face expression gray level image:

The human face expression of every frame human face expression gray level image calibration in the dynamic human face image sequence in above-mentioned third step It is calculated on the vector combination TR of delta-shaped region, the specific steps are as follows:

4.1st step, the human face expression delta-shaped region top of every frame human face expression gray level image in dynamic human face image sequence The extraction of distance feature between point:

By the human face expression triangle of frame human face expression gray level image every in the dynamic human face image sequence in above-mentioned third step I-th of human face expression delta-shaped region tr in the vector combination TR in shape region_iIn three vertex transverse and longitudinal coordinate groups be combined into for X_S_{I, 1}(x_{I, 1}, y_{I, 1})、X_S_{I, 2}(x_{I, 2}, y_{I, 2})、X_S_{I, 3}(x_{I, 3}, y_{I, 3}), respectively with following formula (4), formula (5), formula (6) the human face expression delta of every frame human face expression gray level image in the dynamic human face image sequence in above-mentioned third step is calculated I-th of human face expression delta-shaped region tr in the vector combination TR in domain_iIn the Euclidean distance between vertex two-by-two,

Vertex X_S_{I, 1}With vertex X_S_{I, 2}Between Euclidean distance d_{I, 1}The following formula of calculating (4) shown in:

Vertex X_S_{I, 1}With vertex X_S_{I, 3}Between Euclidean distance d_{I, 2}The following formula of calculating (5) shown in:

Vertex X_S_{I, 2}With vertex X_S_{I, 3}Between Euclidean distance d_i,3The following formula of calculating (6) shown in:

Formula (4), (5), in (6), x_i,1, x_i,2, x_{I, 3}In respectively i-th of human face expression delta-shaped region the 1st, 2,3 The abscissa on a vertex, y_{I, 1}, y_{I, 2}, y_{I, 3}The the 1st, 2,3 vertex in respectively i-th of human face expression delta-shaped region it is vertical Coordinate, i ∈ [1,10],

Thus the human face expression delta-shaped region top of every frame human face expression gray level image in dynamic human face image sequence is completed The extraction of distance feature between point；

4.2nd step, the human face expression delta-shaped region top of every frame human face expression gray level image in dynamic human face image sequence The extraction of the angle character of point:

Calculate the human face expression three of every frame human face expression gray level image in the dynamic human face image sequence in above-mentioned third step I-th of human face expression delta-shaped region tr in the vector combination TR of angular domain_iIn three apex coordinates angle character, tr_iThree vertex transverse and longitudinal coordinate groups be combined into as X_S_{I, 1}(x_{I, 1}, y_{I, 1})、X_S_{I, 2}(x_{I, 2}, y_{I, 2})、X_S_{I, 3}(x_{I, 3}, y_{I, 3}), it uses X_S_i,1、X_S_i,2、X_S_i,3The coordinate on these three vertex calculates the angle character r on corresponding 1st vertex_i,1, the 2nd top The angle character r of point_{I, 2}, the 3rd vertex angle character r_{I, 3}Formula (7), formula (8), formula (9) as follows,

In formula (7), formula (8) and formula (9), x_i,1、x_i,2、x_{I, 3}In respectively i-th of human face expression delta-shaped region The the 1st, 2,3 vertex abscissa, y_{I, 1}, y_{I, 2}, y_{I, 3}The the 1st, 2,3 in respectively i-th of human face expression delta-shaped region The ordinate on vertex, i ∈ [1,10]；

4.3rd step, the human face expression delta-shaped region of every frame human face expression gray level image in dynamic human face image sequence The extraction of geometrical characteristic:

The geometrical characteristic is made of distance feature and angle character, the dynamic human face figure that above-mentioned 4.1st step is obtained As the vector of the human face expression delta-shaped region of the human face expression gray level image of the neutral frame in sequence combines i-th of face in TR Expression delta-shaped region tr_iDistance feature and the neutral frame in the obtained dynamic human face image sequence of above-mentioned 4.2nd step people I-th of human face expression delta-shaped region tr in the vector combination TR of the human face expression delta-shaped region of face expression gray level image_i's Angle character is worth i.e. for six totally: d_i,1, d_i,2, d_i,3, r_i,1, r_{I, 2,}r_i,3It is stored in vector h_iIn, h here_i={ d_i,1,d_i,2,d_i,3, r_i,1,r_i,2,r_i,3, wherein [1,10] i ∈, by the people of the peak value frame with above-mentioned neutral frame in same dynamic human face image sequence I-th of human face expression delta-shaped region tr in the vector combination TR of the human face expression delta-shaped region of face expression gray level image_i's Distance feature and angle character are worth i.e. for six totally: d_i,1, d_i,2, d_i,3, r_i,1, r_i,2, r_i,3It is stored in vector w_iIn, w_i={ d_i,1,d_i,2, d_i,3,r_i,1,r_i,2,r_i,3, wherein [1,10] i ∈, by vector w_iWith vector h_iCharacteristic Ratios be put into array z, z= {z_(i-1)×6+j, z_(i-1)×6+j=w_(i-1),j/h_(i-1),j, wherein h is the face table of human face expression gray level image in above-mentioned neutral frame The distance feature of all delta-shaped regions and the total value of angle character in the vector combination TR of feelings delta-shaped region, w is above-mentioned peak It is worth the distance of all delta-shaped regions in the vector combination TR of the human face expression delta-shaped region of human face expression gray level image in frame The total value of feature and angle character, z are the human face expression delta-shaped region of human face expression gray level image in peak value frame and neutral frame Vector combination TR in the distance feature of all delta-shaped regions and the ratio of angle character value, i ∈ [1,10], j ∈ [0,5], So far it is fully completed the human face expression delta-shaped region combination TR of the every frame human face expression gray level image of this dynamic human face image sequence Extraction of Geometrical Features；

The extraction of the geometrical characteristic of all dynamic human face image sequences in 4.4th step, training set and test set:

Circulation executes the operation of above-mentioned the 4.1st step to the 4.3rd step, i.e. by six class human face expressions in training set: surprised, evil Use of the array z storage that fearness, glad, sad, detest and angry corresponding each dynamic human face image sequence obtain to training set The human face expression geometrical characteristic vector f of SVM classifier is trained, six different human face expressions being obtained by six class human face expressions Geometrical characteristic vector f forms six class human face expression collection, then i.e. by six class human face expressions in test set: it is surprised, fear, be glad, The array z storage that sad, detest and angry corresponding each dynamic human face image sequence obtain is used to test SVM to test set The human face expression geometrical characteristic vector te of disaggregated model, the six different human face expression geometry obtained by six class human face expressions are special Vector te is levied, also forms six class human face expression collection, above-mentioned two place refers to that six class human face expression collection are the six class people for being combined into one Face expression collection so far completes training until all dynamic human face image sequences circulation in training set and test set executes completion The Extraction of Geometrical Features of all dynamic human face image sequences in collection and test set；

5th step, the analysis and extraction of the semantic feature on human face expression gray level image:

The human face expression of human face expression geometrical characteristic vector f and test set to training set obtained in above-mentioned 4th step is several What feature vector te carries out semantic analysis, to realize the analysis and extraction of the semantic feature on human face expression gray level image, specifically Steps are as follows:

5.1st step constructs human face expression semantic characteristics description set:

Defining human face expression semanteme is a kind of natural language description to human face expression feature, including to human face expression feature In geometric shape, these dominant and recessive attributes of the relative position and emotion of Different Organs explanation.

IfGather for one, when human face expression has A human face expression Feature, a are wherein a-th of human face expression feature, and when a-th of human face expression feature is made of U attribute, u is u kind therein Attribute, when u attribute is made of the different strength grade of B kind, b is b kind grade therein, then claimsFor a face Expression semanteme, Μ are a face expression semanteme description collection.

Formulate it is following it is small, in, the strength grade of big three kinds of descriptions human face expression semantic feature, human face expression semanteme is abbreviated For m_a,b, wherein a is a-th of human face expression semantic feature, and a ∈ [1,59], b are strength grade, b ∈ [1,3], wherein 1,2,3 point Do not represent it is small, in, big strength grade,

On above-mentioned 4th step human face expression gray level image after the completion of the extraction of the geometrical characteristic of human face expression delta-shaped region, Utilize face in the human face expression delta-shaped region vector combination TR divided in frame human face expression every in dynamic human face image sequence The side length and angle of expression delta-shaped region construct human face expression semantic characteristics description set YU, shown in following formula (10),

YU={ yu₁,yu₂,...,yu_n,...,yu₅₉1≤n≤59 (10),

In formula (10), YU is the conjunction of face expression semanteme characteristic descriptor set, yu_nFor the conjunction of face expression semanteme characteristic descriptor set In to human face expression delta-shaped region vector combination TR in n-th of side length of human face expression delta-shaped region and the language of angle character Justice description, n are the quantity of face expression semanteme feature description；

It include human face expression delta-shaped region vector described in above-mentioned third step in human face expression semantic feature set YU It combines in 10 human face expression delta-shaped regions in TR according to six of each delta-shaped region of above-mentioned 4th step away from walk-off angle Degree feature carries out the human face expression semantic feature of 60 face expressive features that semantic description obtains, due to wherein two face tables The description of feelings semantic feature is identical, therefore omits 1 face expression semanteme feature；

5.2nd step formulates semantic feature strength grade decision rule:

All semantic feature values in the human face expression geometrical characteristic vector f of training set in above-mentioned 4.4th step are risen Sequence sequence, obtains semantic feature range set PF, PF={ pf₁,pf₂,...,pf_v,...pf₅₉, 1 <=<=59 v wherein pf_v For each semantic description yu_nCorresponding characteristic range, by pf_vSemantic feature strength grade is divided according to the size of f/3, it is semantic special Sign strength grade defines identical with the 5.1st step；

5.3rd step, to all human face expression triangles in frame human face expression gray level image every in dynamic human face image sequence Region carries out semantic analysis:

Calculate every frame human face expression grayscale image in each dynamic human face image sequence in the training set in above-mentioned 4.4th step I-th of human face expression delta-shaped region tr in the vector combination TR of the human face expression delta-shaped region of picture_iHuman face expression is semantic special Mean value TR_AVG and standard deviation TR_SD is levied,

Shown in the following formula of the calculating of mean value TR_AVG (11):

TR_AVG=(d_0+i×6+d_1+i×6+d_2+i×6+r_3+i×6+r_4+i×6+r_5+i×6)/6 (11),

Shown in the following formula of the calculating of standard deviation TR_SD (12):

D in formula (11), (12)_j+i×6For the people of every frame human face expression gray level image in each dynamic human face image sequence I-th of human face expression delta-shaped region tr in the vector combination TR of face expression delta-shaped region_iJ-th interior of distance feature value, r_m+i×6For the Vector Groups of the human face expression delta-shaped region of every frame human face expression gray level image in each dynamic human face image sequence Close i-th of human face expression delta-shaped region tr in TR_iM-th interior of angle character value, wherein setting i ∈ [1,10], j ∈ [0, 2], thus all human face expression triangles in every frame human face expression gray level image are completed in dynamic human face image sequence in [3,5] m ∈ The semantic analysis in shape region；

5.4th step obtains the optimal human face expression delta-shaped region combination of six class human face expression collection:

Human face expression triangle in each dynamic human face image sequence is concentrated to six class human face expressions in the 4.4th step first Region vector combines 10 human face expression delta-shaped region number consecutivelies in TR, the standard deviation then obtained according to above-mentioned 5.3 step TR_SD carries out ascending sort to 10 human face expression delta Field Numbers in the TR, selects 5 before ranking human face expressions three Then angular domain number counts before the six class human face expressions concentration ranking in above-mentioned 4.4th step 5 number quantity, It obtains 5 most numbers of shared number quantity, ascending sort is finally carried out to this 5 numbers according to standard deviation TR_SD, is obtained The optimal human face expression delta-shaped region of six class human face expression collection combines；

5.5th step extracts the final semantic feature of six class human face expression collection:

According to the above-mentioned prepared semantic feature strength grade decision rule of 5.2nd step, to six of the acquisition in the 5.4th step The semantic feature intensity of human face expression delta-shaped region in the optimal human face expression delta-shaped region combination of class human face expression collection Grade is determined, and counts each semantic feature strength grade quantity shared by six class human face expression collection, selects shared quantity Most strength grade, as corresponding semantic description yu_nCorresponding strength grade, final circulation executes aforesaid operations, to six classes All semantic description yu in human face expression collection_nThe shared quantity of corresponding strength grade is counted, and then obtains six class faces All semantic description yu in expression collection_nCorresponding strength grade extracts the final semantic feature of six class human face expression collection with this；

So far the analysis of the semantic feature on human face expression gray level image all terminates with extraction；

6th step, SVM classifier training simultaneously obtain classification results:

By the analysis of the semantic feature on above-mentioned 5th step human face expression gray level image and extract obtained semantic feature Data input SVM classifier is trained and predicts, judges that the dynamic human face image sequence inputted in the above-mentioned first step belongs to Which class human face expression takes the average result of experiment as final facial expression recognition rate using ten times of cross-validation methods, specific to grasp It is as follows to make process:

(6.1) by the analysis of the semantic feature on above-mentioned 5th step human face expression gray level image and the obtained semanteme of extraction Characteristic inputs SVM classifier training, is constructed according to the human face expression geometrical characteristic vector f of the training set of above-mentioned 4.4th step The semantic feature matrix of training sample out, further according to the human face expression geometrical characteristic vector te structure of the test set of above-mentioned 4.4th step The semantic feature matrix of test sample is produced, then according to its corresponding trained classification of the semantic feature matrix construction of training sample Sample matrix, the value in the training sample classification matrix are human face expression classification；

(6.2) linear kernel function is used, the type of stopping criterion for iteration 100, SVM classifier uses C_SVC, first will instruction The semantic feature matrix, training classification sample matrix and parameter for practicing sample are sent into the train function of SVM classifier, are classified Model, then will be predicted in the predict function of the semantic feature Input matrix of test sample to the disaggregated model, it is thus complete At SVM classifier training and obtain classification results, then in the library CK+ and the library MMI experiment obtain it is surprised, fear, be glad, sad, Detest the classification results with angry six kinds of human face expressions；

Thus the identification of dynamic human face expression is completed.

Above-mentioned dynamic human face expression recognition method, wherein the gray scale normalization algorithm used, geometrical normalization algorithm, Libface Face datection and characteristic point dimensioning algorithm and SVM classifier are all well-known in the art.

The invention has the advantages that compared with prior art, remarkable result of the invention and superiority are as follows:

(1) dynamic human face expression recognition method of the present invention is a kind of dynamic human face table based on geometrical characteristic and semantic feature Feelings recognition methods, by marking the characteristic point of face key position, in every frame human face expression gray scale of dynamic human face image sequence The combination of expression human face expression delta-shaped region is demarcated on image, and distance is then extracted respectively to each human face expression delta-shaped region Then feature and angle character are carried out using the distance feature and angle character ratio of neutral frame and peak value frame as geometrical characteristic Semantic analysis finally extracts semantic feature.Based on the geometrical characteristic of human face expression frame, for dynamic human face image sequence face Key position carries out multidate information extraction, can not only reduce time complexity, moreover it is possible to improve for the face as caused by age difference Portion's scale, size, the robustness of cephalad direction and texture variations, extracted semantic feature, to the robust of facial expression recognition Property is good, not only reduces intrinsic dimensionality and time complexity and also improves facial expression recognition rate.

(2) the selected geometrical characteristic based on human face expression delta-shaped region of the present invention is by dynamic human face image sequence It is every in distance feature and dynamic human face image sequence between the human face expression delta-shaped region vertex of every frame human face expression gray level image The angle character on the human face expression delta-shaped region vertex of frame human face expression gray level image is constituted, by calculating dynamic human face image In sequence between the human face expression delta-shaped region vertex of every frame human face expression gray level image in distance and dynamic human face image sequence The tracking result that the difference of the angle on the human face expression delta-shaped region vertex of every frame human face expression gray level image obtains, for by The variation of face scale caused by age is different, size, cephalad direction and texture is robust, and two kinds of the method for the present invention calculating is several What feature, that is, distance feature and angle character step are easy, and operation time is short.

(3) human face expression geometrical characteristic proposed by the present invention is the spy of dynamic human face image sequence kind peak value frame and neutral frame Ratio is levied, feature only has 60 dimensions, and dimension reduces half again after by semantic analysis, reduces intrinsic dimensionality.

(4) present invention carries out semantic description to geometrical characteristic, and counts to the geometrical characteristic after semantic description, makes The decision rule of semantic feature strength grade is determined；Semantic analysis is carried out to the geometrical characteristic after semantic description, obtains area Optimal human face expression delta-shaped region combination in domain, and according to the decision rule of formulation to human face expression triangle optimal in region The strength grade of the semantic feature of region combination is determined, final semantic feature is obtained, experiments verify that the present invention can be into one Step improves dynamic human face Expression Recognition rate.

(5) present invention is compared with CN108256426A: CN108256426A is by the key point of face to human face expression figure Human face expression is identified using convolutional neural networks after being calibrated as sequence.Due to convolutional neural networks frame number of plies mistake It is deep, cause the time complexity for extracting face characteristic high, and the geometrical characteristic that the present invention extracts is according to peak value frame and neutral frame Characteristic Ratios obtain, computation complexity is low.

(6) present invention is compared with CN108921042A: CN108921042A extracts dynamic human face by deep learning frame The Analysis On Multi-scale Features of image sequence complete facial expression recognition with this.What it is due to Multi resolution feature extraction is different resolution figure As the convolutional space-time feature of sequence, algorithm elapsed time itself is long to cause time complexity high, and the face table that the present invention extracts The geometrical characteristic of feelings delta-shaped region combination only calculates in frame distance and angle character value in human face expression delta-shaped region, is not required to Convolutional space-time feature is extracted, time loss is reduced, CN108921042A is lower than on time complexity.

(7) present invention is compared with CN105139004A: CN105139004A passes through the center three-dimensional orthogonal Haar-like two-value Mode (HCBP-TOP) extracts the textural characteristics of human face expression sequence, since HCBP-TOP textural characteristics are at layered piecemeal It is obtained in sub-block after reason, sub-block quantity causes the intrinsic dimensionality extracted high more, the human face expression triangle that the present invention extracts The intrinsic dimensionality of the geometrical characteristic of region combination only has 60 dimensions, and only original in the semantic feature obtained after semantic analysis The half of geometrical characteristic dimension, therefore, the method for the present invention reduces intrinsic dimensionality.

(8) present invention is compared with CN104036255A: CN104036255A is by comparison by dynamic human face image sequence The expressive features library and test the similarity of expressive features vector Euclidean distance to complete that characteristic point position information vector difference is formed Facial expression recognition, but since the quantity of the human face characteristic point of the facial expression image and calibration of input is excessive, cause expressive features to The intrinsic dimensionality of amount is high, and the present invention only keeps track peak value frame and human face expression triangle in neutral frame in dynamic human face image sequence The ratio of the angle and distance of region combination, and the intrinsic dimensionality extracted only has 60 dimensions to obtain after further semantic analysis Semantic feature be only original half, further reduced intrinsic dimensionality.

(9) present invention is compared with CN106934375A: CN106934375A passes through special on tracking dynamic human face image sequence Sign point extracts expressive features in the slope variation of interframe, and is inputted RBF neural to identify human face expression, due to tracking The frame number of dynamic human face image sequence and the quantity of characteristic point are excessive, there is a problem of real-time difference, and the present invention not only with CN106934375A is less than on the frame number of the facial image of track, in the characteristic point quantity of selected human face expression delta-shaped region Also well below CN106934375A, the type for the aspect ratio CN106934375A that the present invention extracts is more, including distance, angle and Semantic feature, therefore the present invention ensure that real-time while improving the effect of tracking dynamic human face image sequence.

(10) present invention is compared with CN101908149A: CN101908149A is proposed through tracking dynamic human face image sequence The change in displacement of upper 20 characteristic points extracts geometrical characteristic, and is analyzed it by canonical correlation analysis and complete face with this Expression Recognition, due to this method be only extracted characteristic point displacement and both geometrical characteristics of length and do not consider textural characteristics or Semantic feature causes this method to there is a problem of that discrimination is low, and the present invention not only allows for human face expression delta-shaped region group Two kinds of geometrical characteristics of angle and distance in conjunction, have also extracted advanced semantic feature, have improved the discrimination of human face expression.

(11) present invention is compared with CN106980811A: CN106980811A uses the instruction containing multiclass deep learning frame Practice model to carry out facial expression recognition, since the deep learning frame used is excessive, there is a problem of that time complexity is high, this The semantic feature dimension for the human face expression delta-shaped region combination that invention is extracted only 30 is tieed up, and is calculated simple and is not needed to establish deep layer Deep learning frame identify human face expression, the recognition effect that need to only use SVM classifier that can just be got well reduces the time Complexity.

(12) CN103971137B extracts the LBP-TOP textural characteristics of three-dimensional module in dynamic human face image sequence as volume The training sample of code word allusion quotation, and using PCA to its dimensionality reduction.By the feature input condition random field models after dimensionality reduction to complete to train And identification.Since LBP-TOP textural characteristics are to extract to obtain on the three-dimensional module for dividing dynamic human face image sequence, three The defect that the quantity of dimension module is more and textural characteristics existing characteristics dimension itself is high, even if intrinsic dimensionality is still after PCA dimensionality reduction It is excessively high, and the geometrical characteristic dimension for the dynamic human face image sequence that the present invention extracts only 60 is tieed up, and is being mentioned after semantic analysis The semantic feature taken reduces the geometrical characteristic dimension of half, therefore, the defect that the method for the present invention can overcome intrinsic dimensionality high.

(13) a kind of small sample face identification method of CN106529447A, face of the CN105139004A based on video sequence A kind of recognition methods of human face expression of expression recognition method, CN105069447B, CN105139039B human face in video frequency sequence are micro- The classification and recognition methods of the human face expression of recognition methods, CN106127196A based on dynamic texture feature of expression, The recognition methods of the micro- facial expression image sequence of face and CN106599854A are based on mostly special in CN106548149A monitor video sequence The human face expression automatic identifying method of sign fusion is the previous patented technology of the present inventor team, in practice also It is poor that there are robustness, and intrinsic dimensionality and time complexity are higher, the lower defect of facial expression recognition rate, in order to overcome these Defect, the present inventor team continue the face identification method technology for researching and developing update with great concentration, just by creative labor Develop dynamic human face expression recognition method of the invention.It is obtained on the basis of six patented technologies of above-mentioned previous application The claimed technical solution of the method for the present invention is not that those skilled in the art can obtain easily.

Detailed description of the invention

Present invention will be further explained below with reference to the attached drawings and examples.

Fig. 1 is the schematic process flow diagram of the method for the present invention.

Fig. 2 is the schematic diagram for carrying out characteristic point mark in the method for the present invention using Libface on human face expression frame.

Fig. 3 is that characteristic point annotation process and warp are first on two surprised Facial Expression Images in data set in the method for the present invention The schematic diagram of the characteristic point of beginningization.

Fig. 4 be in the method for the present invention human face expression delta-shaped region calibration after the completion of result and extract human face expression triangle The process schematic of geometrical characteristic in region.

Quantity accounting shared by human face expression delta-shaped region Happy semantic analysis ranking feature of the Fig. 5 (a) for before ranking 5 Histogram.

Quantity accounting shared by human face expression delta-shaped region Angry semantic analysis ranking feature of the Fig. 5 (b) for before ranking 5 Histogram.

Quantity shared by human face expression delta-shaped region Disgust semantic analysis ranking feature of the Fig. 5 (c) for before ranking 5 accounts for Compare histogram.

Quantity accounting column shared by human face expression delta-shaped region Sad semantic analysis ranking feature of the Fig. 5 (d) for before ranking 5 Shape figure.

Quantity accounting column shared by human face expression delta-shaped region Fear semantic analysis ranking feature of the Fig. 5 (e) for before ranking 5 Shape figure.

Quantity shared by human face expression delta-shaped region Surprise semantic analysis ranking feature of the Fig. 5 (f) for before ranking 5 accounts for Compare histogram.

Fig. 6 is that the method for the present invention divides geometrical characteristic and semantic feature extraction method through SVM on CK+ and MMI data set Recognition effect comparison diagram after class.

Specific embodiment

Embodiment illustrated in fig. 1 shows that the process of the method for the present invention is:

Embodiment illustrated in fig. 2, which is shown in the present invention, carries out the inspection of human face expression frame to human face expression sequence using Libface After survey and mark the schematic diagram of characteristic point.

Embodiment illustrated in fig. 3, which is shown, carries out characteristic point mark to two surprised human face expression sequences that data are concentrated, and Therefrom observe the variation of characteristic point in face facial expression image, and the characteristic point schematic diagram by initialization.I.e. pure The characteristic point marked under white background, the diagram can intuitively show that characteristic point changes the best part in surprised expression.

After embodiment illustrated in fig. 4 shows characteristic point mark, 10 human face expressions three being demarcated on Facial Expression Image The schematic diagram of angular domain and human face expression delta-shaped region distance and angle character, wherein A, B, C are respectively delta The vertex in domain, AB, AC and BC are respectively the side length calculated distance feature of vertex A, B, C two-by-two between vertex, and α, β and γ The angle character that the interior angle of respectively vertex A, B, C is calculated.

Embodiment illustrated in fig. 5 show the method for the present invention in six class human face expression frames human face expression delta-shaped region it is several After what feature carries out semantic analysis, each human face expression delta Field Number exists in human face expression delta-shaped region vector combination TR Human face expression delta-shaped region quantity accounting histogram in standard deviation ranking TOP V is counted most by shared quantity Excellent human face expression delta-shaped region combination.

Fig. 5 (a) illustrated embodiment is that Happy expression sequence human face expression delta-shaped region vector after semantic analysis combines Each human face expression delta Field Number 5 quantity accounting schematic diagrames before standard deviation ranking in TR, according to human face expression three Each human face expression delta Field Number quantity accounting is ranked up in angular domain vector combination TR, last Happy expression sequence Optimal human face expression delta-shaped region combination is arranged as tr from small to large in column₉, tr₁, tr₅, tr₂, tr₆, wherein tr₀-tr₉Respectively Indicate the corresponding number of 10 delta-shaped regions, the numerical value in ordinate indicates each delta Field Number in Happy face table Feelings concentrate shared quantitative value, first five ranking of the standard deviation that abscissa indicates, each delta-shaped region is compiled in each standard deviation ranking Quantity shared by number sums to the population size of happy human face expression collection.

Fig. 5 (b) illustrated embodiment is that Angry expression sequence human face expression delta-shaped region vector after semantic analysis combines Each human face expression delta Field Number 5 quantity accounting schematic diagrames before standard deviation ranking in TR, according to human face expression three Each human face expression delta Field Number quantity accounting is ranked up in angular domain vector combination TR, last Angry expression sequence Optimal human face expression delta-shaped region combination is arranged as tr from small to large in column₁, tr₂, tr₇, tr₆, tr₃, wherein tr₀-tr₉Respectively Indicate the corresponding number of 10 delta-shaped regions, the numerical value in ordinate indicates each delta Field Number in Angry face table Feelings concentrate shared quantitative value, first five ranking of the standard deviation that abscissa indicates, each delta-shaped region is compiled in each standard deviation ranking Quantity shared by number sums to the population size of Angry human face expression collection.

Fig. 5 (c) illustrated embodiment is Disgust expression sequence human face expression delta-shaped region Vector Groups after semantic analysis Each human face expression delta Field Number 5 quantity accounting schematic diagrames before standard deviation ranking in TR are closed, according to human face expression Each human face expression delta Field Number quantity accounting is ranked up in delta-shaped region vector combination TR, last Disgust table Optimal human face expression delta-shaped region combination is arranged as tr from small to large in feelings sequence₈, tr₆, tr₂, tr₃, tr₄, wherein tr₀-tr₉ The corresponding number of 10 delta-shaped regions is respectively indicated, the numerical value in ordinate indicates each delta Field Number in Disgust Human face expression concentrates shared quantitative value, first five ranking of the standard deviation that abscissa indicates, each triangle in each standard deviation ranking Quantity shared by zone number sums to the population size of Disgust human face expression collection.

Fig. 5 (d) illustrated embodiment is that Sad expression sequence human face expression delta-shaped region vector after semantic analysis combines TR Interior each human face expression delta Field Number 5 quantity accounting schematic diagrames before standard deviation ranking, according to human face expression triangle Each human face expression delta Field Number quantity accounting is ranked up in shape region vector combination TR, in last Sad expression sequence Optimal human face expression delta-shaped region combination is arranged as tr from small to large₂, tr₁, tr₆, tr₃, tr₉, wherein tr₀-tr₉It respectively indicates The corresponding number of 10 delta-shaped regions, the numerical value in ordinate indicate that each delta Field Number is concentrated in Sad human face expression Shared quantitative value, standard deviation first five the ranking that abscissa indicates, in each standard deviation ranking shared by each delta Field Number Quantity sums to the population size of Sad human face expression collection.

Fig. 5 (e) illustrated embodiment is that Fear expression sequence human face expression delta-shaped region vector after semantic analysis combines Each human face expression delta Field Number 5 quantity accounting schematic diagrames before standard deviation in TR, according to human face expression triangle Each human face expression delta Field Number quantity accounting is ranked up in region vector combination TR, in last Fear expression sequence most Excellent human face expression delta-shaped region combination is arranged as tr from small to large₉, tr₈, tr₁, tr₂, tr₆, wherein tr₀-tr₉Respectively indicate 10 The corresponding number of a delta-shaped region, the numerical value in ordinate indicate that each delta Field Number concentrates institute in Fear human face expression Account for quantitative value, first five ranking of the standard deviation that abscissa indicates, number shared by each delta Field Number in each standard deviation ranking Amount sums to the population size of Fear human face expression collection.

Fig. 5 (f) illustrated embodiment is Surprise expression sequence human face expression delta-shaped region vector after semantic analysis Each human face expression delta Field Number 5 quantity accounting schematic diagrames before standard deviation ranking in TR are combined, according to face table Each human face expression delta Field Number quantity accounting is ranked up in feelings delta-shaped region vector combination TR, last Surprise Optimal human face expression delta-shaped region combination is arranged as tr from small to large in expression sequence₁, tr₅, tr₂, tr₃, tr₉, wherein tr₀- tr₉The corresponding number of 10 delta-shaped regions is respectively indicated, the numerical value in ordinate indicates that each delta Field Number exists Surprise human face expression concentrates shared quantitative value, first five ranking of the standard deviation that abscissa indicates, in each standard deviation ranking Quantity shared by each delta Field Number sums to the population size of Surprise human face expression collection.

Embodiment illustrated in fig. 6 shows that the method for the present invention is special by geometrical characteristic method and semanteme on CK+ and MMI data set Levy effect contrast figure of the extracting method through SVM training and classifying, wherein the numerical value of ordinate indicate whether or not there is semantic analysis in CK+ and Discrimination on MMI data set, abscissa indicate the number of 10 experiments, are divided into 1 between abscissa, are divided between ordinate 5%, the minimum discrimination of ordinate is 50%.

Embodiment 1

A kind of dynamic human face expression recognition method based on geometrical characteristic and semantic feature of the present embodiment, specific steps are such as Under:

The first step, the pretreatment of dynamic human face image sequence:

Every frame Facial Expression Image progress size in the dynamic human face image sequence of input, which is first normalized to size, is Then every frame Facial Expression Image in the dynamic human face image sequence inputted is used following formula by 640 × 480 pixels (1) gray space is transformed by rgb space, obtains every frame human face expression gray level image I_{gray_tn},

I_{gray_tn}=0.299I_R+0.587I_G+0.114I_B(1),

X=((x₁,y₁),(x₂,y₂),...,(x_k,y_k),...,(x₆₈,y₆₈))^T(2),

TR={ tr₁,tr₂,...,tr_i,...,tr₁₀(3),

The geometrical characteristic is made of distance feature and angle character, the dynamic human face figure that above-mentioned 4.1st step is obtained As the vector of the human face expression delta-shaped region of the human face expression gray level image of the neutral frame in sequence combines i-th of face in TR Expression delta-shaped region tr_iDistance feature and the neutral frame in the obtained dynamic human face image sequence of above-mentioned 4.2nd step people I-th of human face expression delta-shaped region tr in the vector combination TR of the human face expression delta-shaped region of face expression gray level image_i's Angle character is worth i.e. for six totally: d_i,1, d_i,2, d_i,3, r_i,1, r_i,2, r_i,3It is stored in vector h_iIn, h here_i={ d_i,1,d_i,2,d_i,3, r_i,1,r_i,2,r_i,3, wherein [1,10] i ∈, by the people of the peak value frame with above-mentioned neutral frame in same dynamic human face image sequence I-th of human face expression delta-shaped region tr in the vector combination TR of the human face expression delta-shaped region of face expression gray level image_i's Distance feature and angle character are worth i.e. for six totally: d_i,1, d_i,2, d_i,3, r_i,1, r_i,2, r_i,3It is stored in vector w_iIn, w_i={ d_i,1,d_i,2, d_i,3,r_i,1,r_i,2,r_i,3, wherein [1,10] i ∈, by vector w_iWith vector h_iCharacteristic Ratios be put into array z, z= {z_(i-1)×6+j, z_(i-1)×6+j=w_(i-1),j/h_(i-1),j, wherein h is the face table of human face expression gray level image in above-mentioned neutral frame The distance feature of all delta-shaped regions and the total value of angle character in the vector combination TR of feelings delta-shaped region, w is above-mentioned peak It is worth the distance of all delta-shaped regions in the vector combination TR of the human face expression delta-shaped region of human face expression gray level image in frame The total value of feature and angle character, z are the human face expression delta-shaped region of human face expression gray level image in peak value frame and neutral frame Vector combination TR in the distance feature of all delta-shaped regions and the ratio of angle character value, i ∈ [1,10], j ∈ [0,5], So far it is fully completed the human face expression delta-shaped region combination TR of the every frame human face expression gray level image of this dynamic human face image sequence Extraction of Geometrical Features；

YU={ yu₁,yu₂,...,yu_n,...,yu₅₉1≤n≤59 (10),

In formula (10), YU is the conjunction of face expression semanteme characteristic descriptor set, yu_nFor the conjunction of face expression semanteme characteristic descriptor set In to human face expression delta-shaped region vector combination TR in n-th of side length of human face expression delta-shaped region and the language of angle character Justice description, specific statement provide in the following Table 1, and n is the quantity of face expression semanteme feature description；

It include human face expression delta-shaped region vector described in above-mentioned third step in human face expression semantic feature set YU It combines in 10 human face expression delta-shaped regions in TR according to six of each delta-shaped region of above-mentioned 4th step away from walk-off angle Degree feature carries out the human face expression semantic feature of 60 face expressive features that semantic description obtains, due to wherein two face tables The description of feelings semantic feature is identical, therefore omits 1 face expression semanteme feature, i.e. n ∈ [1,59]；

1. semantic characteristics description collection YU of table

5.2nd step formulates semantic feature strength grade decision rule:

All semantic feature values in the human face expression geometrical characteristic vector f of training set in above-mentioned 4.4th step are risen Sequence sequence, obtains semantic feature range set PF, PF={ pf₁,pf₂,...,pf_v,...pf₅₉, 1 <=<=59 v wherein pf_v For each semantic description yu_nCorresponding characteristic range, by pf_vSemantic feature strength grade is divided according to the size of f/3, it is semantic special Sign strength grade defines identical with the 5.1st step, and the decision rule of semantic feature strength grade is as shown in Table 2 below:

The decision rule of 2. semantic feature strength grade of table

The vector of the human face expression delta-shaped region of every frame human face expression gray level image in each dynamic human face image sequence Combine i-th of human face expression delta-shaped region tr in TR_iHuman face expression semantic characteristics description combination, as shown in Table 3 below:

3. human face expression delta-shaped region tr of table_iSemantic description combination

∧ in table 3 is in order that logical "and" symbol in discrete mathematics；

Shown in the following formula of the calculating of mean value TR_AVG (11):

TR_AVG=(d_0+i×6+d_1+i×6+d_2+i×6+r_3+i×6+r_4+i×6+r_5+i×6)/6 (11),

The optimal human face expression delta-shaped region combination of six class human face expressions is as shown in Table 4 below:

The optimal human face expression delta-shaped region of 4. 6 class human face expression of table combines

Expression classification	The combination of human face expression optimum triangular shape region
		Surprise	tr₁∧tr₅∧tr₂∧tr₃∧tr₉
Fear	tr₉∧tr₈∧tr₁∧tr₂∧tr₆
		Happy	tr₉∧tr₁∧tr₅∧tr₂∧tr₆
Sad	tr₂∧tr₁∧tr₆∧tr₃∧tr₄
		Disgust	tr₈∧tr₆∧tr₂∧tr₃∧tr₄
Angry	tr₁∧tr₂∧tr₇∧tr₆∧tr₃

∧ in table 4 is in order that logical "and" symbol in discrete mathematics；

According to the above-mentioned prepared semantic feature strength grade decision rule of 5.2nd step, to six of the acquisition in the 5.4th step The semantic feature intensity of human face expression delta-shaped region in the optimal human face expression delta-shaped region combination of class human face expression collection Grade is determined, and counts each semantic feature strength grade quantity shared by six class human face expression collection, selects shared quantity Most strength grade, as corresponding semantic description yu_nCorresponding strength grade, final circulation executes aforesaid operations, to six classes All semantic description yu in human face expression collection_nThe shared quantity of corresponding strength grade is counted, statistical conditions such as the following table 5, 6, shown in 7,8,9,10:

Table 5.Surprise expression delta-shaped region combines interior semantic feature strength grade quantity statistics

Table 6.Fear expression delta-shaped region combines interior semantic feature strength grade quantity statistics

Table 7.Sad expression delta-shaped region combines interior semantic feature strength grade quantity statistics

Table 8.Disgust expression delta-shaped region combines interior semantic feature strength grade quantity statistics

Table 9.angry expression delta-shaped region combines interior semantic feature strength grade quantity statistics

Table 10.happy expression delta-shaped region combines interior semantic feature strength grade quantity statistics

According to all semantic description yu in six class human face expression collection in above-mentioned table 5-10_nCorresponding semantic feature strength grade Quantity situation, statistics provide the strength grade that each expression delta-shaped region combines interior semantic feature, extract six class faces with this The final semantic feature of expression collection；

The final semantic feature of six class human face expression collection is as shown in table 11,12,13,14,15,16；

The combination of table 11.Surprise expression semanteme feature

Delta-shaped region	Semantic feature combination
		tr₁	m_7,3∧m_8,3∧m_9,2∧m_10,1∧m_11,1∧m_12,3
tr₅	m_31,3∧m_32,3∧m_3,1∧m_33,1∧m_34,3∧m_35,3
		tr₂	m_13,3∧m_14,3∧m_15,3∧m_16,1∧m_17,1∧m_18,3
tr₃	m_19,3∧m_20,3∧m_21,3∧m_22,1∧m_23,3∧m_24,3
		tr₉	m_54,3∧m_55,3∧m_56,1∧m_57,1∧m_58,3∧m_59,3

The combination of table 12.Fear expression semanteme feature

Delta-shaped region	Semantic feature combination
		tr₉	m_54,2∧m_55,2∧m_56,3∧m_57,2∧m_58,2∧m_59,2
tr₈	m_48,3∧m_49,2∧m_50,3∧m_51,3∧m_52,2∧m_53,2
		tr₁	m_7,3∧m_8,3∧m_9,3∧m_10,3∧m_11,2∧m_12,2
tr₂	m_13,3∧m_14,3∧m_15,2∧m_16,3∧m_17,2∧m_18,2
		tr₆	m_36,3∧m_37,2∧m_38,3∧m_39,3∧m_40,1∧m_41,2

The combination of table 13.Happy expression semanteme feature

Delta-shaped region	Semantic feature combination
		tr₉	m_54,3∧m_55,3∧m_56,3∧m_57,3∧m_58,1∧m_59,1
tr₁	m_7,2∧m_8,2∧m_9,3∧m_10,3∧m_11,2∧m_12,1
		tr₅	m_31,1∧m_32,3∧m_3,3∧m_33,3∧m_34,1∧m_35,1
tr₂	m_13,2∧m_14,3∧m_15,3∧m_16,3∧m_17,2∧m_18,1
		tr₆	m_36,2∧m_37,2∧m_38,1∧m_39,3∧m_40,1∧m_41,1

The combination of table 14.Sad expression semanteme feature

Delta-shaped region	Semantic feature combination
		tr₂	m_13,2∧m_14,3∧m_15,1∧m_16,1∧m_17,3∧m_18,3
tr₁	m_7,3∧m_8,3∧m_9,1∧m_10,1∧m_11,3∧m_12,3
		tr₆	m_36,2∧m_37,2∧m_38,1∧m_39,1∧m_40,3∧m_41,3
tr₃	m_19,2∧m_20,3∧m_21,1∧m_22,2∧m_23,2∧m_24,3
		tr₉	m_54,1∧m_55,1∧m_56,1∧m_57,1∧m_58,3∧m_59,3

The combination of table 15.Disgust expression semanteme feature

Delta-shaped region	Semantic feature combination
		tr₈	m_48,1∧m_49,1∧m_50,2∧m_51,2∧m_52,2∧m_53,3
tr₆	m_36,1∧m_37,1∧m_38,2∧m_39,2∧m_40,2∧m_41,2
		tr₂	m_13,1∧m_14,1∧m_15,1∧m_16,2∧m_17,3∧m_18,1
tr₃	m_19,1∧m_20,1∧m_21,1∧m_22,3∧m_23,1∧m_24,1
		tr₄	m_25,1∧m_26,1∧m_27,1∧m_28,3∧m_29,3∧m_30,1

The combination of table 16.Angry expression semanteme feature

Delta-shaped region	Semantic feature combination
		tr₁	m_7,1∧m_8,1∧m_9,1∧m_10,2∧m_11,3∧m_12,1
tr₂	m_13,1∧m_14,2∧m_15,1∧m_16,2∧m_17,3∧m_18,1
		tr₇	m_42,1∧m_43,1∧m_44,1∧m_45,3∧m_46,1∧m_47,1
tr₆	m_36,1∧m_37,1∧m_38,2∧m_39,2∧m_40,2∧m_41,2
		tr₃	m_19,1∧m_20,1∧m_21,1∧m_22,3∧m_23,1∧m_24,1

∧ in above-mentioned table 11-16 is in order that logical "and" symbol in discrete mathematics；

6th step, SVM classifier training simultaneously obtain classification results:

Thus the identification of dynamic human face expression is completed.

Embodiment 2

The present embodiment is to carry out experimental verification to dynamic human face expression recognition method of the invention.

A. 262 dynamic human face image sequences are chosen in CK+ data set, each dynamic human face image sequence includes 2 width Image, i.e., neutral frame and peak value frame, totally 524 Facial Expression Image frames are tested.

By the Extraction of Geometrical Features method TGF and geometry semantic feature extraction method SA- in the present invention in CK+ data set TGF, in the document 1, document 2, document 3, document 4 in the discrimination and background technique obtained after ten times of cross-validation experiments Discrimination comparison it is as shown in table 17:

Discrimination compares on table 17.CK+ data set

B. 208 dynamic human face image sequences are chosen in MMI data set, each dynamic human face image sequence includes 2 width Image, i.e., neutral frame and peak value frame, totally 416 Facial Expression Image frames are tested.

By the Extraction of Geometrical Features method TGF and geometry semantic feature extraction method SA- in the present invention in MMI data set TGF, the identification in document 1, document 2, document 4 in the discrimination and background technique obtained after ten times of cross-validation experiments Rate comparison is as shown in table 18:

Discrimination compares on table 18.MMI data set

In above-described embodiment, gray scale normalization algorithm, geometrical normalization algorithm, the Libface Face datection point mark of use It infuses algorithm and SVM classifier is all well-known in the art.

Claims

1. dynamic human face expression recognition method, it is characterised in that: be a kind of dynamic human face based on geometrical characteristic and semantic feature Expression recognition method, the specific steps are as follows:

The first step, the pretreatment of dynamic human face image sequence:

First every frame Facial Expression Image in the dynamic human face image sequence of input is carried out size to be normalized to size being M × N Pixel, it is then that every frame Facial Expression Image in the dynamic human face image sequence inputted is empty by RGB using following formula (1) Between be transformed into gray space, obtain every frame human face expression gray level image I_{gray_tn},

I_{gray_tn}=0.299I_R+0.587I_G+0.114I_B(1),

In formula (1), I_R、I_G、I_BIt is the red of every frame Facial Expression Image in inputted dynamic human face image sequence respectively Three channel components of color, green and blue retain every frame human face expression gray level image I_{gray_tn}, for face in following second step The detection of expression frame is used with characteristic point mark；

Every frame human face expression gray scale that the above-mentioned first step is obtained using the Multiview_Reinforce interface in the library LibFace Image I_{gray_tn}The detection of human face expression frame is carried out, and characteristic point mark, this 68 characteristic points are carried out to 68 characteristic points therein The following formula of total coordinate vector (2) shown in,

X=((x₁,y₁),(x₂,y₂),...,(x_k,y_k),...,(x₆₈,y₆₈))^T(2),

In formula (2), x_k, y_kAbscissa corresponding to k-th of characteristic point and vertical seat in respectively every frame human face expression gray level image Mark, k ∈ [1,68]；

The every frame human face expression gray level image I marked from above-mentioned second step_{gray_tn}In 68 characteristic points in select eyebrow, eyes, 30 characteristic points on nose and mouth carry out the calibration of human face expression delta-shaped region on human face expression gray level image, form altogether 10 human face expression delta-shaped regions of calibration, the vector combination TR that human face expression delta-shaped region is consequently formed is following formula (3) shown in,

TR={ tr₁,tr₂,...,tr_i,...,tr₁₀(3),

In formula (3), tr_i={ X_S_{I, 1},X_S_{I, 2},X_S_{I, 3}, i ∈ [1,10], tr_iFor i-th of human face expression delta-shaped region, X_S_{I, 1}、X_S_{I, 2}、X_S_{I, 3}The apex coordinate on the 1st, 2,3 vertex in respectively i-th of human face expression delta-shaped region,

The 10 human face expression delta-shaped regions demarcated include: left eye and lip, eyes and eyebrow, lip and eyebrow center Point, eyebrow and nose, nose and lip, eyebrow, eyes and nose, nose central point and the corners of the mouth, right eye eyeball and lip, pure mouth Lip, pure eyes；

The human face expression triangle of every frame human face expression gray level image calibration in the dynamic human face image sequence in above-mentioned third step It is calculated on the vector combination TR in shape region, the specific steps are as follows:

4.1st step, in dynamic human face image sequence between the human face expression delta-shaped region vertex of every frame human face expression gray level image The extraction of distance feature:

By the human face expression delta of frame human face expression gray level image every in the dynamic human face image sequence in above-mentioned third step I-th of human face expression delta-shaped region tr in the vector combination TR in domain_iIn three vertex transverse and longitudinal coordinate groups be combined into as X_S_{I, 1} (x_{I, 1}, y_{I, 1})、X_S_{I, 2}(x_{I, 2}, y_{I, 2})、X_S_{I, 3}(x_{I, 3}, y_{I, 3}), it is calculated respectively with following formula (4), formula (5), formula (6) In dynamic human face image sequence in above-mentioned third step the human face expression delta-shaped region of every frame human face expression gray level image to I-th of human face expression delta-shaped region tr in amount combination TR_iIn the Euclidean distance between vertex two-by-two,

Formula (4), (5), in (6), x_i,1, x_i,2, x_{I, 3}The the 1st, 2,3 top in respectively i-th of human face expression delta-shaped region The abscissa of point, y_{I, 1}, y_{I, 2}, y_{I, 3}The ordinate on the 1st, 2,3 vertex in respectively i-th of human face expression delta-shaped region, I ∈ [1,10],

Thus it completes in dynamic human face image sequence between the human face expression delta-shaped region vertex of every frame human face expression gray level image The extraction of distance feature；

4.2nd step, the human face expression delta-shaped region vertex of every frame human face expression gray level image in dynamic human face image sequence The extraction of angle character:

Calculate the human face expression triangle of every frame human face expression gray level image in the dynamic human face image sequence in above-mentioned third step I-th of human face expression delta-shaped region tr in the vector combination TR in region_iIn three apex coordinates angle character, tr_i's Three vertex transverse and longitudinal coordinate groups are combined into as X_S_{I, 1}(x_{I, 1}, y_{I, 1})、X_S_{I, 2}(x_{I, 2}, y_{I, 2})、X_S_{I, 3}(x_{I, 3}, y_{I, 3}), use X_S_i,1、 X_S_i,2、X_S_i,3The coordinate on these three vertex calculates the angle character r on corresponding 1st vertex_i,1, the 2nd vertex angle Spend feature r_{I, 2}, the 3rd vertex angle character r_{I, 3}Formula (7), formula (8), formula (9) as follows,

In formula (7), formula (8) and formula (9), x_i,1、x_i,2、x_{I, 3}In respectively i-th of human face expression delta-shaped region 1, the abscissa on 2,3 vertex, y_{I, 1}, y_{I, 2}, y_{I, 3}The the 1st, 2,3 vertex in respectively i-th of human face expression delta-shaped region Ordinate, i ∈ [1,10]；

4.3rd step, the geometry of the human face expression delta-shaped region of every frame human face expression gray level image in dynamic human face image sequence The extraction of feature:

The geometrical characteristic is made of distance feature and angle character, the dynamic human face image sequence that above-mentioned 4.1st step is obtained I-th of human face expression in the vector combination TR of the human face expression delta-shaped region of the human face expression gray level image of neutral frame in column Delta-shaped region tr_iDistance feature and the neutral frame in the obtained dynamic human face image sequence of above-mentioned 4.2nd step face table I-th of human face expression delta-shaped region tr in the vector combination TR of the human face expression delta-shaped region of feelings gray level image_iAngle Feature is worth i.e. for six totally: d_i,1, d_i,2, d_i,3, r_i,1, r_i,2, r_i,3It is stored in vector h_iIn, h here_i={ d_i,1,d_i,2,d_i,3,r_i,1, r_i,2,r_i,3, wherein [1,10] i ∈, by the face table of the peak value frame with above-mentioned neutral frame in same dynamic human face image sequence I-th of human face expression delta-shaped region tr in the vector combination TR of the human face expression delta-shaped region of feelings gray level image_iDistance Feature and angle character are worth i.e. for six totally: d_i,1, d_i,2, d_i,3, r_i,1, r_i,2, r_i,3It is stored in vector w_iIn, w_i={ d_i,1,d_i,2,d_i,3, r_i,1,r_i,2,r_i,3, wherein [1,10] i ∈, by vector w_iWith vector h_iCharacteristic Ratios be put into array z, z= {z_(i-1)×6+j, z_(i-1)×6+j=w_(i-1),j/h_(i-1),j, wherein h is the face table of human face expression gray level image in above-mentioned neutral frame The distance feature of all delta-shaped regions and the total value of angle character in the vector combination TR of feelings delta-shaped region, w is above-mentioned peak It is worth the distance of all delta-shaped regions in the vector combination TR of the human face expression delta-shaped region of human face expression gray level image in frame The total value of feature and angle character, z are the human face expression delta-shaped region of human face expression gray level image in peak value frame and neutral frame Vector combination TR in the distance feature of all delta-shaped regions and the ratio of angle character value, i ∈ [1,10], j ∈ [0,5], So far it is fully completed the human face expression delta-shaped region combination TR of the every frame human face expression gray level image of this dynamic human face image sequence Extraction of Geometrical Features；

Circulation executes the operation of above-mentioned the 4.1st step to the 4.3rd step, i.e. by six class human face expressions in training set: it is surprised, fear, The array z storage that glad, sad, detest and angry corresponding each dynamic human face image sequence obtain is used to instruct to training set Practice the human face expression geometrical characteristic vector f of SVM classifier, the six different human face expression geometry obtained by six class human face expressions Feature vector f forms six class human face expression collection, then i.e. by six class human face expressions in test set: surprised, fear, is glad, wound The array z storage that the heart, detest and angry corresponding each dynamic human face image sequence obtain is used to test SVM points to test set The human face expression geometrical characteristic vector te of class model, the six different human face expression geometrical characteristics obtained by six class human face expressions Vector te, also forms six class human face expression collection, and above-mentioned two place refers to that six class human face expression collection are the six class faces for being combined into one Expression collection so far completes training set until all dynamic human face image sequences circulation in training set and test set executes completion With the Extraction of Geometrical Features of dynamic human face image sequences all in test set；

The human face expression geometry of human face expression geometrical characteristic vector f and test set to training set obtained in above-mentioned 4th step is special It levies vector te and carries out semantic analysis, to realize the analysis and extraction of the semantic feature on human face expression gray level image, specific steps It is as follows:

Defining human face expression semanteme is a kind of natural language description to human face expression feature, including in human face expression feature The explanation of these dominant and recessive attributes of geometric shape, the relative position of Different Organs and emotion.

IfGather for one, when human face expression has A face expressive features, A is wherein a-th of human face expression feature, and when a-th of human face expression feature is made of U attribute, u is u attribute therein, When u attribute is made of the different strength grade of B kind, b is b kind grade therein, then claimsFor a human face expression language Justice, Μ are a face expression semanteme description collection.

Formulate it is following it is small, in, the strength grade of big three kinds of descriptions human face expression semantic feature, human face expression semanteme is abbreviated as m_a,b, wherein a is a-th of human face expression semantic feature, and a ∈ [1,59], b are strength grade, b ∈ [1,3], wherein 1,2,3 difference Represent it is small, in, big strength grade,

On above-mentioned 4th step human face expression gray level image after the completion of the extraction of the geometrical characteristic of human face expression delta-shaped region, utilize Human face expression in the human face expression delta-shaped region vector combination TR divided in every frame human face expression in dynamic human face image sequence The side length and angle of delta-shaped region construct human face expression semantic characteristics description set YU, shown in following formula (10),

YU={ yu₁,yu₂,...,yu_n,...,yu₅₉1≤n≤59 (10),

In formula (10), YU is the conjunction of face expression semanteme characteristic descriptor set, yu_nIt is right in the conjunction of face expression semanteme characteristic descriptor set N-th of side length of human face expression delta-shaped region and the semanteme of angle character are retouched in human face expression delta-shaped region vector combination TR It states, n is the quantity of face expression semanteme feature description；

It include the combination of human face expression delta-shaped region vector described in above-mentioned third step in human face expression semantic feature set YU It is special according to six distances of each delta-shaped region of above-mentioned 4th step and angle in 10 human face expression delta-shaped regions in TR Sign carries out the human face expression semantic feature of 60 face expressive features that semantic description obtains, due to wherein two human face expression languages The description of adopted feature is identical, therefore omits 1 face expression semanteme feature；

5.2nd step formulates semantic feature strength grade decision rule:

Ascending order row is carried out to semantic feature values all in the human face expression geometrical characteristic vector f of the training set in above-mentioned 4.4th step Sequence obtains semantic feature range set PF, PF={ pf₁,pf₂,...,pf_v,...pf₅₉, 1 <=<=59 v wherein pf_vIt is every A semantic description yu_nCorresponding characteristic range, by pf_vSemantic feature strength grade is divided according to the size of f/3, semantic feature is strong Degree grade defines identical with the 5.1st step；

5.3rd step, to all human face expression delta-shaped regions in frame human face expression gray level image every in dynamic human face image sequence Carry out semantic analysis:

Calculate every frame human face expression gray level image in each dynamic human face image sequence in the training set in above-mentioned 4.4th step I-th of human face expression delta-shaped region tr in the vector combination TR of human face expression delta-shaped region_iHuman face expression semantic feature is equal Value TR_AVG and standard deviation TR_SD,

Shown in the following formula of the calculating of mean value TR_AVG (11):

TR_AVG=(d_0+i×6+d_1+i×6+d_2+i×6+r_3+i×6+r_4+i×6+r_5+i×6)/6 (11),

D in formula (11), (12)_j+i×6For the human face expression of every frame human face expression gray level image in each dynamic human face image sequence I-th of human face expression delta-shaped region tr in the vector combination TR of delta-shaped region_iJ-th interior of distance feature value, r_m+i×6For In each dynamic human face image sequence in the vector combination TR of the human face expression delta-shaped region of every frame human face expression gray level image I-th of human face expression delta-shaped region tr_iM-th interior of angle character value, wherein setting i ∈ [1,10], j ∈ [0,2], m ∈ [3,5] thus complete in dynamic human face image sequence all human face expression delta-shaped regions in every frame human face expression gray level image Semantic analysis；

Human face expression delta-shaped region in each dynamic human face image sequence is concentrated to six class human face expressions in the 4.4th step first Vector combines 10 human face expression delta-shaped region number consecutivelies in TR, the standard deviation TR_SD then obtained according to above-mentioned 5.3 step Ascending sort is carried out to 10 human face expression delta Field Numbers in the TR, selects 5 before ranking human face expression triangles Then zone number counts before the six class human face expressions concentration ranking in above-mentioned 4.4th step 5 number quantity, obtains This 5 numbers are finally carried out ascending sort according to standard deviation TR_SD, obtain six classes by 5 most numbers of shared number quantity The optimal human face expression delta-shaped region of human face expression collection combines；

According to the above-mentioned prepared semantic feature strength grade decision rule of 5.2nd step, to six class people of the acquisition in the 5.4th step The semantic feature strength grade of human face expression delta-shaped region in the optimal human face expression delta-shaped region combination of face expression collection Determined, and count each semantic feature strength grade quantity shared by six class human face expression collection, selects shared quantity most Strength grade, as corresponding semantic description yu_nCorresponding strength grade, final circulation executes aforesaid operations, to six class faces All semantic description yu in expression collection_nThe shared quantity of corresponding strength grade is counted, and then obtains six class human face expressions All semantic description yu in collecting_nCorresponding strength grade extracts the final semantic feature of six class human face expression collection with this；

6th step, SVM classifier training simultaneously obtain classification results:

By the analysis of the semantic feature on above-mentioned 5th step human face expression gray level image and extract obtained semantic feature data Input SVM classifier is trained and predicts, judges which class the dynamic human face image sequence inputted in the above-mentioned first step belongs to Human face expression takes the average result of experiment as final facial expression recognition rate, concrete operations stream using ten times of cross-validation methods Journey is as follows:

(6.1) by the analysis of the semantic feature on above-mentioned 5th step human face expression gray level image and the obtained semantic feature of extraction Data input SVM classifier training, construct instruction according to the human face expression geometrical characteristic vector f of the training set of above-mentioned 4.4th step The semantic feature matrix for practicing sample, constructs further according to the human face expression geometrical characteristic vector te of the test set of above-mentioned 4.4th step The semantic feature matrix of test sample, then its corresponding trained classification sample according to the semantic feature matrix construction of training sample Matrix, the value in the training sample classification matrix are human face expression classification；

(6.2) linear kernel function is used, the type of stopping criterion for iteration 100, SVM classifier uses C_SVC, first will training sample This semantic feature matrix, training classification sample matrix and parameter are sent into the train function of SVM classifier, obtain disaggregated model, It will be predicted in the predict function of the semantic feature Input matrix of test sample to the disaggregated model again, thus complete SVM Classifier training simultaneously obtains classification results, then in the library CK+ and the library MMI experiment obtain it is surprised, fear, be glad, sad, detest and The classification results of angry six kinds of human face expressions；

Thus the identification of dynamic human face expression is completed.