CN113191135A

CN113191135A - Multi-category emotion extraction method fusing facial characters

Info

Publication number: CN113191135A
Application number: CN202110412378.2A
Authority: CN
Inventors: 骆曦; 刘晓晓
Original assignee: Beijing Union University
Current assignee: Beijing Union University
Priority date: 2021-01-26
Filing date: 2021-04-16
Publication date: 2021-07-30

Abstract

The invention provides a multi-class emotion extraction method fused with characters, which comprises the following steps of preprocessing a text set: putting the preprocessed text set into a Skip-Gram model in Word2Vec for training, and embedding the context relationship of words into a low-dimensional space to obtain Word vectors corresponding to all the words; constructing a facial character emotion dictionary; calculating the emotion probability of the characters in the document; calculating the text emotion probability; and calculating the comprehensive emotion probability of the document. The method extracts various emotion probabilities of the face characters by calculating the similarity and generates a face character emotion dictionary, integrates face character emotion information on the basis of the text by calculating the document face character emotion probability, helps to improve comprehensiveness and accuracy of emotion extraction of a user, further improves decision-making accuracy, provides reliable basis for emotion extraction by utilizing high efficiency and strong feature learning capacity of a neural network and a recurrent neural network, and reduces dependence on manual construction of emotion dictionaries and rules.

Description

Multi-category emotion extraction method fusing facial characters

Technical Field

The invention relates to the technical field of natural language processing technology and emotion analysis, in particular to a multi-class emotion extraction method fusing facial characters.

Background

With the development of information technology and network technology, social media become a main platform for modern people to communicate with each other and transfer information, such as forums, microblogs, online comments and the like, and a large amount of information rich in subjective emotion emerges every day. Through analyzing the information published by the user, the emotional information implicit in the information can be identified, the evolution rule of the user emotion can be found, and valuable information prediction is carried out, so that the method has important value in internet information mining. The emotion analysis is to analyze information such as the viewpoint, emotion, evaluation, attitude, emotion and the like of people by using methods such as natural language processing, text analysis, computational linguistics and the like, and mainly aims to predict valuable information based on a mining result and display the prediction result in a more intuitive mode. In recent years, emotion analysis technology has wide application in marketing, public opinion monitoring, policy analysis and public relationship management, and has high economic and social values.

The existing emotion analysis technology has two main means:

(1) the method based on the emotion dictionary comprises the following steps: the emotion words play an important role in expressing the text emotional tendency, and the method based on the dictionary mainly uses the related information of the emotion words to judge the emotional tendency. The method comprises the steps of making an emotion dictionary, utilizing rules such as sentence grammar and word occurrence positions, splitting a sentence, analyzing and matching the dictionary for a text, weighting emotion words, and finally using emotion values as the basis for judging the emotion tendency of the text. The emotion dictionary has high accuracy, but the recall rate is low; the construction and the perfection of the rules and the dictionaries need a large amount of manpower, the quality of the rules and the dictionaries determines the emotion analysis quality, and for different fields, the difficulty of constructing the emotion dictionaries is different, and the accurate construction cost is higher; furthermore, this approach does not take into account the effect of word context on emotion changes.

(2) Method based on machine learning: the method is used as a supervised classification problem, a model is trained by using a labeled text, and then the trained model is used for predicting the emotion polarity of the unlabeled text, so that the method is mature at present. The Convolutional Neural Network (CNN) performs convolution calculation by using a plurality of convolution kernels, can well extract local features of texts from different angles, but cannot solve the context dependence of long texts. Long-short term memory network (LSTM) is a kind of recurrent neural network, and uses a three-gate design method, which can capture a user's changing emotion by using the ability of a text sequence, but has a weak ability to recognize local features.

The character-based emoticon is a character-based emoticon, and the combination sequence of the emoticon is arranged by utilizing the display appearance of specific characters in a computer character code table to form a pattern for describing the emoticon of a character. In social media, more and more people frequently use the characters to express and express rich internal emotions, the imagination space of network communication is enriched, the social media is deeply loved by young users, and the social media are developed into network culture symbols affecting the world at present. The use of the color characters can bring about the changes of semantics and context emotions, so that the traditional emotion analysis based on the text alone cannot meet the requirements, more and more accurate information needs to be provided for the emotion decision of the user by combining the color characters, and the decision accuracy is further improved.

The invention patent application with the application number of 201910976409.X discloses a multi-class emotion classification method based on model fusion. The method has the disadvantages that a larger data set is needed for adjustment and pre-training, the influence of the characters in the text is not considered, the capability of capturing sentence sequence information is poor, and more complex semantic features cannot be obtained.

Disclosure of Invention

In order to solve the technical problems, the method for extracting the multi-class emotion fusing the color characters, provided by the invention, extracts the multi-emotion probability of the color characters by calculating the similarity and generates a color character emotion dictionary, and integrates the color character emotion information on the basis of the text by calculating the color character emotion probability of the document so as to help improve the comprehensiveness and accuracy of emotion extraction of a user and further improve the accuracy of decision making, and meanwhile, the method provides a reliable basis for emotion extraction by utilizing the high efficiency and strong feature learning capability of a neural network and a cyclic neural network, and reduces the dependence on manually constructed emotion dictionaries and rules.

The invention aims to provide a multi-class emotion extraction method fusing characters and words, which comprises the following steps of preprocessing a text set:

step 1: putting the preprocessed text set into a Skip-Gram model in Word2Vec for training, and embedding the context relationship of words into a low-dimensional space to obtain Word vectors corresponding to all the words;

step 2: constructing a facial character emotion dictionary;

and step 3: calculating the emotion probability of the characters in the document;

and 4, step 4: calculating the text emotion probability;

and 5: and calculating the comprehensive emotion probability of the document.

Preferably, the pre-treatment step comprises the sub-steps of:

step 01: extracting the face characters from the text set by using a regular expression to generate a face character dictionary;

and step 02, adding the facial character dictionary into a user-defined dictionary such as a Chinese word segmentation tool, performing word segmentation processing on all texts in the text set, and stopping words.

In any of the above schemes, preferably, the step 2 includes the following sub-steps:

step 21: dividing the emotion into four groups of opposite emotions according to the Plutchik emotion wheel, and respectively acquiring eight emotion words and word vectors corresponding to each word in a word dictionary from the trained Skip-Gram model;

step 22: respectively calculating the similarity between each color word vector and eight emotion word vectors, namely the cosine distance s₁,s₂,...,s₈The cosine distance between two word vectors X and Y is calculated as follows:

wherein X is (X)₁,x₂,x₃,…x_D),Y＝(y₁,y₂,y₃,…y_D) All the word vectors comprise D-dimension characteristics, X is the word vector representation of a word X, Y is the word vector representation of a word Y, D is the dimension of the word vector, and i is the ith component of the word vector;

step 23: the cosine distance sim₁,sim₂,...,sim₈Performing a normalization process, P (w)_iThe classification probability of the ith emotion corresponding to the character w can be calculated by the following formula:

wherein, sim_iRepresents the cosine distance between the character and the ith emotional word, and finally, P (w)₁+P(w)₂+…+P(w)₈＝1；

Step 24: calculating the emotion probability of all the face characters and generating a face character emotion dictionary.

Preferably, in any of the above schemes, the emotion includes eight: happy music

Sadness and weakness,Xi Huan

Aversion and surprise

Anger and anger

Terrorism, where happiness, trust, expectation are positive emotions, sadness, disgust, anger, terrorism are negative emotions, and surprise is neutral emotions.

In any of the above schemes, preferably, the step 3 includes collecting all the words { w } for a document₁,w₂,...w_mAnd (4) by inquiring a facial character emotion dictionary, averaging all kinds of emotion probabilities to obtain a document facial character emotion probability value:

wherein S is_iThe ith emotion value of the document text is m is the number of the documents containing the text, and j represents the jth text in the document.

In any of the above schemes, preferably, the step 4 includes the following sub-steps:

step 41: carrying out word vector representation;

step 42: inputting a bidirectional LSTM network;

step 43: inputting a text convolution neural network;

step 44: using maximum pooling to perform down-sampling processing to obtain sequence characteristic z ═ z { (z)₁,z₂,…,z_q}；

Step 45: and inputting the softmax layer.

In any of the above schemes, preferably, the step 41 includes representing the text by using a word vector output by Skip-Gram, and obtaining a word vector sequence t ═ t of the text₁,t₂,…,t_n]Wherein t is_iTo representThe ith word in the text, n is the maximum number of words that can be input.

In any of the above schemes, preferably, the step 42 includes setting the word vector sequence t ═ t₁,t₂,…,t_n]Respectively inputting the forward and reverse long and short term memory network, fully fusing the context information of the text to obtain the forward characteristic sequence t_f＝[t_f1,t_f2,…,t_fn]And reverse signature sequence t_b＝[t_b1,t_b2,…,t_bn]Will t_fAnd t_bSplicing two word vectors of the same word to obtain a spliced word vector sequence t_fb＝[t_fb1,t_fb2,…,t_fbn]Wherein t is_fbi＝[t_fi；t_bi]Dimension 2D.

In any of the above solutions, preferably, the step 43 includes using a text convolution model to pair the matrix t_fbPerforming convolution operation, wherein a convolution kernel w belongs to R [ h ] 2D]With a convolution kernel height h and a convolution kernel w at the matrix t_fbOne-dimensional convolution with step size 1, sequence feature c_iIs formed by a convolution kernel w and a matrix region x_i:i+h-1Performing convolution operation to obtain:

c_i＝f(w·x_i:i+h-1+b)

where f is a non-linear activation function such as the tanh, and b ∈ R is the bias term. Convolution kernel w to t_fbEach region { x_1:h,x_2:h+1,x_3:h+2,…,x_n-h+1:nThe convolution operation will obtain a sequence feature C ═ C with 1 columns₁,c₂,…,c_n-h+1]. The convolution kernel height h may be set to 1,2,3, …, q (where q is the number of convolution kernels), and q sequence features may be obtained.

In any of the above embodiments, preferably, the step 45 includes inputting z, and outputting a vector P ═ P of T × 1₁,P₂,…,P_i,…,P_T}＝softmax[w﹒(z。r)+b]Where w is the weight matrix, r is used to introduce Dropout operations, b is the bias vector, T is the number of classes, P_iAnd indicating the probability value of the current text belonging to the ith emotion category.

In any of the above schemes, preferably, the document integrated emotion probability G ═ G₁,G₂,…,G_i,…,G_TIn which G is_iThe probability value of the document belonging to the ith emotion category is calculated by the color word emotion probability value and the text emotion probability value:

G_i＝αS_i+(1-α)P_i，

wherein alpha is a weight coefficient of emotion probability of the face characters, and the value range is 0< alpha < 1.

In any of the above schemes, preferably, the step 5 includes using the class with the highest probability value as the final emotion classification of the document.

The invention provides a multi-category emotion extraction method fusing facial characters, which can effectively mine and extract a large amount of information published by users in social media, and carry out valuable information prediction and decision based on results, can be applied to multiple fields of politics, economy, services, medical treatment and the like, and has higher economic and social values.

Word2Vec is a tool developed by Google corporation for training Word vectors, and includes both CBOW (Continuous Bag-of-Words Model) and Skip-gram (Continuous Skip-gram Model) training models.

The Skip-Gram model is a training of word vectors based on the context of the target word.

Drawings

FIG. 1 is a flow chart of a preferred embodiment of a multi-category emotion extraction method with text and color fusion according to the invention.

FIG. 2 is a flowchart of an embodiment of a method for constructing a facial character emotion dictionary in accordance with the method for extracting multi-class emotion fusing facial characters of the present invention.

FIG. 3 is a flowchart of an embodiment of a text emotion probability calculation method of a multi-class emotion extraction method with color and text fusion according to the invention.

FIG. 4 is a flow chart of another preferred embodiment of the multi-category emotion extraction method with color text fusion according to the invention.

FIG. 5 is a flowchart of an embodiment of text emotion probability calculation according to the multi-class emotion extraction method with color word fusion of the present invention.

Detailed Description

The invention is further illustrated with reference to the figures and the specific examples.

Example one

As shown in fig. 1, step 100 is performed to preprocess the text set. In the step, step 101 is executed, wherein a regular expression is used for extracting the facial characters from the text set to generate a facial character dictionary; and step 102, adding the facial-character dictionary into a user-defined dictionary such as a Chinese word segmentation tool, performing word segmentation processing on all texts in the text set, and stopping words.

And step 110 is executed, the preprocessed text set is put into a Skip-Gram model in Word2Vec for training, the context relation of the words is embedded into a low-dimensional space, and Word vectors corresponding to all the words are obtained.

Step 120 is executed to construct a facial-word emotion dictionary. As shown in fig. 2, step 121 is executed, the emotions are divided into four sets of opposite emotions according to the Plutchik emotion wheel, and eight emotion words and a word vector corresponding to each word in the word dictionary are respectively obtained from the Skip-Gram model after training;

step 122 is executed to calculate the similarity between each color word vector and the eight emotion word vectors, i.e. the cosine distance s₁,s₂,...,s₈The cosine distance between two word vectors X and Y is calculated as follows:

wherein X is (X)₁,x₂,x₃,…x_D),Y＝(y₁,y₂,y₃,…y_D) All contain D-dimensional features, X is the word vector representation of the word X, Y is the word vector representation of the word Y, D represents the dimension of the word vector, and i represents the ith component of the word vector.

Step 123 is executed to determine the cosine distance sim₁,sim₂,...,sim₈Performing a normalization process, P (w)_iThe classification probability of the ith emotion corresponding to the character w can be calculated by the following formula:

wherein, sim_iRepresents the cosine distance between the character and the ith emotional word, and finally, P (w)₁+P(w)₂+…+P(w)₈＝1。

Step 124 is executed to calculate the emotion probabilities of all the color words and generate a color word emotion dictionary. The emotion includes eight types: happy music

Sadness and liking

Aversion and surprise

Anger and anger

Step 130 is executed to calculate the emotion probability of the characters in the document. All words and phrases set for a document w₁,w₂,...w_mAnd (4) by inquiring a facial character emotion dictionary, averaging all kinds of emotion probabilities to obtain a document facial character emotion probability value:

wherein S is_iThe ith emotion value of the document text is mThe document contains the number of the text words, and j represents the jth text word in the document.

Step 140 is executed to calculate the text emotion probability. As shown in fig. 3, step 141 is executed to perform word vector representation, represent the text using the word vector output by Skip-Gram, and obtain a word vector sequence t of the text as [ t ═ t₁,t₂,…,t_n]Wherein t is_iRepresents the ith word in the text, and n is the maximum number of inputtable words.

Step 142 is executed, inputting the bidirectional LSTM network, and setting the word vector sequence t ═ t₁,t₂,…,t_n]Respectively inputting the forward and reverse long and short term memory network, fully fusing the context information of the text to obtain the forward characteristic sequence t_f＝[t_f1,t_f2,…,t_fn]And reverse signature sequence t_b＝[t_b1,t_b2,…,t_bn]Will t_fAnd t_bSplicing two word vectors of the same word to obtain a spliced word vector sequence t_fb＝[t_fb1,t_fb2,…,t_fbn]Wherein t is_fbi＝[t_fi；t_bi]Dimension 2D.

Step 143 is executed, the text convolution neural network is input, and the matrix t is aligned by the text convolution model_fbPerforming convolution operation, wherein a convolution kernel w belongs to R [ h ] 2D]With a convolution kernel height h and a convolution kernel w at the matrix t_fbOne-dimensional convolution with step size 1, sequence feature c_iIs formed by a convolution kernel w and a matrix region x_i:i+h-1Performing convolution operation to obtain:

c_i＝f(w·x_i:i+h-1+b)

Execution step144, downsampling using maximum pooling to obtain sequence feature z ═ z { (z)₁,z₂,…,z_q}。

Step 145 is executed to input softmax layer, input z, and output T × 1 vector P ═ { P }₁,P₂,…,P_i,…,P_T}＝softmax[w﹒(z。r)+b]Where w is the weight matrix, r is used to introduce Dropout operations, b is the bias vector, T is the number of classes, P_iAnd indicating the probability value of the current text belonging to the ith emotion category.

And step 150, calculating the comprehensive emotion probability of the document, and taking the class with the maximum probability value as the final emotion classification of the document. Document integrated emotion probability G ═ G₁,G₂,…,G_i,…,G_TIn which G is_iThe probability value of the document belonging to the ith emotion category is calculated by the color word emotion probability value and the text emotion probability value:

G_i＝αS_i+(1-α)P_i，

Example two

The main drawbacks of the prior art solutions are:

(1) and (3) emotion dictionary: the construction and the perfection of the rules and the dictionaries need a large amount of manpower, the difficulty of constructing the emotion dictionaries is different in different fields, the cost of accurate construction is high, and the influence of word context on emotion change is not considered.

(2) Machine learning: the convolutional neural network can better extract local features of the text from different angles, but cannot solve the context dependence of the long text. The long-term and short-term memory network can effectively integrate the adjacent position information, solves the problems of gradient disappearance, gradient explosion and the like caused by long-term dependence, and has poor capability of identifying local characteristics.

(3) No consideration of the text: currently, emotion analysis is mostly based on simple text information, but as the color characters are developed, more and more users frequently use the color characters to express and express the internal emotion. Therefore, the traditional emotion analysis based on the text cannot meet the requirement, and the characters can provide more and more accurate information for the emotion decision of the user, so that the decision accuracy is improved.

Aiming at the problems, the invention provides a multi-class emotion extraction method fusing facial characters. Compared with simple text data, the characters are more beneficial to emotion expression, so that the cooperative text use can help to improve comprehensiveness and accuracy of emotion extraction. Firstly, generating a facial character emotion dictionary, extracting facial characters in a corpus set through a regular expression, representing the facial characters by using word vectors, calculating the similarity of each facial character and various emotion vocabularies, and performing normalization processing to obtain various emotion probability values of the facial characters. When extracting document emotion information, the method is divided into two parts of processing of a face character and a text: the facial character part calculates the facial character emotion probability of the document by inquiring a facial character emotion dictionary; the text part converts words into low-dimensional vectors blended into context information based on a Skip-gram word vector model, further extracts context characteristic information through a bidirectional long-short term memory network, then performs convolution operation on a text matrix by using convolution kernels with different heights to extract text local characteristics, and calculates text emotion probability through pooling, full connection and a Softmax function; and finally, weighting and calculating the emotion information of the document by the results of the characters and the text.

As shown in fig. 2, the detailed steps are described as follows:

step 1 pretreatment

Extracting the face characters from the text set by using a regular expression to generate a face character dictionary; and adding the word dictionary into a self-defined dictionary of a Chinese word segmentation tool such as Jieba, NLPIR and the like, performing word segmentation processing on all texts in the text set, and stopping words.

Step 2 word embedding

And putting the preprocessed text set into a Skip-Gram model in Word2Vec for training, and embedding the context relationship of the words into a low-dimensional space to obtain Word vectors corresponding to all the words. The size of the window in the model parameters can be 10, and the number of the neurons of the hidden layer can be 300.

Step 3, constructing a facial character emotion dictionary

3.1 Emotion is divided into four groups according to Plutchik Emotion wheel disk, and the four groups are opposite and are eight types: happy music

Sadness and liking

Aversion and surprise

Anger and anger

Terrorism, where happiness, trust, expectation belong to positive emotions, sadness, disgust, anger, terrorism belong to negative emotions, and surprise belongs to neutral emotions; respectively acquiring eight emotion words and a word vector corresponding to each word in a word dictionary from the trained Skip-Gram model;

3.2 calculating the similarity between each color word vector and eight emotion word vectors, namely the cosine distance s₁,s₂,...,s₈The cosine distance between two word vectors X and Y is calculated as follows:

wherein X ═ X1, X2, X3, … xD, Y ═ Y1, Y2, Y3, … yD, all including D dimensional features.

3.3 the cosine distance sim obtained above₁,sim₂,...,sim₈Performing a normalization process, P (w)_iThe classification probability of the ith emotion corresponding to the character w can be calculated by the following formula:

wherein, sim_iRepresents the cosine distance between the character and the ith emotional word, and finally, P (w)₁+P(w)₂+…+P(w)₈1. Calculating the emotion probability of all the face characters and generating a face character emotion dictionary.

Step 4, calculating the emotional probability of the document characters

All words and phrases set for a document w₁,w₂,...w_mAnd (4) by inquiring a facial character emotion dictionary, averaging all kinds of emotion probabilities to obtain a document facial character emotion probability value:

wherein S is_iThe ith emotion value of the document text is m, and the m is the number of the documents containing the text.

Step 5, calculating the text emotion probability

FIG. 2 is a flowchart of text emotion probability calculation.

5.1 the word vector represents: using the word vector output by Skip-Gram in step 1.2 to represent the text, and obtaining a word vector sequence t ═ t of the text₁,t₂,…,t_n]Wherein t is_iAnd (4) representing the ith word in the text, wherein n is the maximum number of inputtable words, and the word vectors with insufficient number are subjected to 0 complementing treatment.

5.2 input two-way LSTM network: converting the word vector sequence t to [ t ]₁,t₂,…,t_n]Respectively inputting the forward and reverse long-short term memory network (LSTM) networks, and fully fusing the context information of the text to obtain a forward characteristic sequence t_f＝[t_f1,t_f2,…,t_fn]Reverse signature sequence t_b＝[t_b1,t_b2,…,t_bn]Will t_fAnd t_bSplicing two word vectors of the same word to obtain a spliced word vector sequence t_fb＝[t_fb1,t_fb2,…,t_fbn]Wherein t is_fbi＝[t_fi；t_bi]Dimension of 2D

5.3 convolution of input textA neural network: matrix t pair using text convolution model_fbAnd performing convolution operation to further extract local features of the text. Convolution kernel w epsilon R [ h x 2D [ ]]The convolution kernel height is h, multiple heights can be set, and the convolution kernel width remains unchanged to 2 times the word vector dimension D. Convolution kernel w in matrix t_fbThe above one-dimensional convolution is carried out by step length 1, the convolution operation is similar to the traditional N-gram feature extraction mode, and the sequence feature c_iIs formed by a convolution kernel w and a matrix region x_i:i+h-1Performing convolution operation to obtain

c_i＝f(w·x_i:i+h-1+b)

Where f is a non-linear activation function such as the tanh and b ∈ R is the bias term. Convolution kernel w to t_fbEach region { x_1:h,x_2:h+1,x_3:h+2,…,x_n-h+1:nThe convolution operation will obtain a sequence feature C ═ C with 1 columns₁,c₂,…,c_n-h+1]. The convolution kernel height h may be set to 1,2,3, …, q (where q is the number of convolution kernels), and q sequence features may be obtained.

5.4 pooling: using maximum pooling to perform down-sampling processing to obtain sequence characteristic z ═ z { (z)₁,z₂,…,z_q}。

5.5 input softmax layer: vector P ═ { P } of input z, output T1₁,P₂,…,P_i,…,P_T}＝softmax[w﹒(z。r)+b]W is a weight matrix, r is used for introducing Dropout operation, b is a bias vector, T is a category number of 8, and Pi represents a probability value that the current text belongs to the ith emotion category.

Step 6, calculating the comprehensive emotion probability of the document

Document comprehensive emotion probability value G ═ { G ═ G₁,G₂,…,G_i,…,G_TIn which G is_iThe probability value of the document belonging to the ith emotion category is calculated by the color word emotion probability value and the text emotion probability value:

G_i＝αS_i+(1-α)P_i，

wherein alpha is a facial character emotion probability weight coefficient, the value range is 0< alpha <1, and finally the class with the maximum probability value is used as the final emotion classification of the document. If positive, negative and neutral emotional polarities are required to be output, three types of probabilities of happiness, trust and expectation are added to be used as positive emotional probabilities, four types of emotions of sadness, disgust, anger and terror are added to be used as negative emotional probabilities, surprise is directly used as neutral emotional probabilities, and finally the type with the maximum positive, negative and neutral probability values is used as the final emotional polarity.

The method and the device fuse the characters on the basis of the text to extract various emotions: in social media, characters are frequently used and are more beneficial to expression of emotion, and changes in semantic and contextual emotion are caused, so that the traditional emotion analysis based on text only cannot meet the requirement. The method extracts various emotion probabilities of the face characters by calculating the similarity and generates a face character emotion dictionary, and integrates face character emotion information on the basis of the text by calculating the document face character emotion probability so as to help improve the comprehensiveness and accuracy of emotion extraction of the user and further improve the accuracy of decision making.

The method integrates various models to extract text features: the method comprises the steps of firstly obtaining an embedded word vector by using a Skip-gram model, then further extracting context feature information through a bidirectional LSTM, and finally performing convolution operation by using convolution kernels with different heights to extract text local features, so that the finally extracted feature space has low dimensionality, not only contains the whole context information, but also can concern the text local information. The method provided by the invention integrates the advantages of various models, provides reliable basis for emotion extraction by utilizing the high efficiency and strong characteristic learning capability of the neural network and the recurrent neural network, and reduces the dependence on artificially constructed emotion dictionaries and rules.

For a better understanding of the present invention, the foregoing detailed description has been given in conjunction with specific embodiments thereof, but not with the intention of limiting the invention thereto. Any simple modifications of the above embodiments according to the technical essence of the present invention still fall within the scope of the technical solution of the present invention. In the present specification, each embodiment is described with emphasis on differences from other embodiments, and the same or similar parts between the respective embodiments may be referred to each other. For the system embodiment, since it basically corresponds to the method embodiment, the description is relatively simple, and for the relevant points, reference may be made to the partial description of the method embodiment.

Claims

1. A multi-category emotion extraction method fusing facial characters comprises the steps of preprocessing a text set, and is characterized by further comprising the following steps:

step 2: constructing a facial character emotion dictionary;

and 4, step 4: calculating the text emotion probability;

and 5: and calculating the comprehensive emotion probability of the document.

2. The method for extracting multi-category emotion fusing text and text as claimed in claim 1, wherein said step 2 comprises the sub-steps of:

wherein X is (X)₁,x₂,x₃,…x_D),Y＝(y₁,y₂,y₃,…y_D) All contain D-dimensional features, X is the word vector representation of the word X, and y isA word vector representation of word Y, D represents the dimension of the word vector, and i represents the ith component of the word vector;

3. The method as claimed in claim 2, wherein said step 3 comprises collecting all the color words { w } for a document₁,w₂,...w_mAnd (4) by inquiring a facial character emotion dictionary, averaging all kinds of emotion probabilities to obtain a document facial character emotion probability value:

4. The method for extracting multi-category emotion fusing text and text as claimed in claim 3, wherein said step 4 comprises the sub-steps of:

step 41: carrying out word vector representation;

step 42: inputting a bidirectional LSTM network;

step 43: inputting a text convolution neural network;

Step 45: and inputting the softmax layer.

5. The method as claimed in claim 4, wherein the step 41 comprises using the word vector output from Skip-Gram to represent the text, and obtaining a word vector sequence t ═ t [ t ] of the text₁,t₂,…,t_n]Wherein t is_iRepresents the ith word in the text, and n is the maximum number of inputtable words.

6. The method of claim 5, wherein the step 42 comprises selecting the word vector sequence t ═ t [ t ] according to the multi-category emotion extraction method₁,t₂,…,t_n]Respectively inputting the forward and reverse long and short term memory network, fully fusing the context information of the text to obtain the forward characteristic sequence t_f＝[t_f1,t_f2,…,t_fn]And reverse signature sequence t_b＝[t_b1,t_b2,…,t_bn]Will t_fAnd t_bSplicing two word vectors of the same word to obtain a spliced word vector sequence t_fb＝[t_fb1,t_fb2,…,t_fbn]Wherein t is_fbi＝[t_fi；t_bi]Dimension 2D.

7. The method of claim 6, wherein said step 43 comprises using a text convolution model to pair said matrix t_fbPerforming convolution operation, wherein a convolution kernel w belongs to R [ h ] 2D]With a convolution kernel height h and a convolution kernel w at the matrix t_fbOne-dimensional convolution with step size 1, sequence feature c_iIs formed by a convolution kernel w and a matrix region x_i:i+h-1Performing convolution operation to obtain:

c_i＝f(w·x_i:i+h-1+b)

8. The method of claim 7, wherein step 45 comprises inputting z and outputting T1 vector P { P ═ P { (m ═ P) } in the text-to-text fusion₁,P₂,…,P_i,…,P_T}＝softmax[w﹒(z。r)+b]Where w is the weight matrix, r is used to introduce Dropout operations, b is the bias vector, T is the number of classes, P_iAnd indicating the probability value of the current text belonging to the ith emotion category.

9. The method of claim 8, wherein the document integrated emotion probability G ═ G₁,G₂,…,G_i,…,G_TIn which G is_iThe probability value of the document belonging to the ith emotion category is calculated by the color word emotion probability value and the text emotion probability value:

G_i＝αS_i+(1-α)P_i，

10. The method as claimed in claim 9, wherein the step 5 comprises classifying the class with the highest probability value as the final emotion classification of the document.