CN108984724B

CN108984724B - Method for improving emotion classification accuracy of specific attributes by using high-dimensional representation

Info

Publication number: CN108984724B
Application number: CN201810754022.5A
Authority: CN
Inventors: 谢珏; 吴含前; 李露
Original assignee: Kaier Bote Information Technology Kunshan Co ltd
Current assignee: Kaier Bote Information Technology Kunshan Co ltd
Priority date: 2018-07-10
Filing date: 2018-07-10
Publication date: 2021-09-28
Anticipated expiration: 2038-07-10
Also published as: CN108984724A

Abstract

The invention discloses a method for improving the emotion classification accuracy of specific attributes by using high-dimensional representation. Firstly, the invention provides a clause segmentation algorithm to segment a comment text into a plurality of clauses; secondly, coding the words in each clause by utilizing a plurality of bidirectional long-short term memory neural networks so as to obtain the representation of each clause; and finally, coding the expression of each clause obtained in the last step by adopting a bidirectional long-short term memory neural network so as to obtain the final expression of the whole sentence. Through the method, the information more relevant to the specific attribute is captured from three different dimensions of words, clauses and sentences, and finally the accuracy of emotion classification of the specific attribute is improved.

Description

Method for improving emotion classification accuracy of specific attributes by using high-dimensional representation

Technical Field

The invention relates to an emotion analysis method for comment text expression, in particular to a method for improving emotion classification accuracy of specific attributes by using high-dimensional representation.

Background

In order to obtain the emotion polarity of each attribute in the comment text, attribute words, emotion words and emotion modifiers in the comment text are identified by an emotion analysis (SA) technology and are further analyzed to judge the emotion polarity expressed by the comment text for a specific attribute, so that the comment text can be applied to the fields of event analysis, network public opinion analysis, spam processing and the like.

When judging the emotion polarity of the comment text, the traditional coarse-grained emotion analysis method only analyzes and processes the whole comment text and cannot judge the fine-grained polarity of the comment text according to the specific attributes in the comment text. Therefore, the recent research on emotion analysis tends to be fine grained, which becomes a hot topic for research and attention at home and abroad.

Currently, Deep Neural Network (DNN) technology is utilized to carry out emotion analysis on specific attributes in a text, Tang and the like propose a Target-Dependent Long-Short Term Memory Neural Network (TD-LSTM) and a Target-associated Long-Short Term Memory (TC-LSTM) on the problem of carrying out emotion Classification on sentences according to the specific attributes in a Target-Dependent classified with Long Short Term Memory, wherein the TD-LSTM takes Target information into consideration when generating and representing sentences, and then the TC-LSTM associates the Target information and context thereof on the basis of the method, the method takes the average value of word vectors in the target phrase as the target vector, however, the simple average of the word vectors in the target phrase cannot completely express the semantics of the target phrase, and therefore, the optimal result cannot be obtained. Dong et al propose an Adaptive recurrent Neural Network (AdaRNN) for a hypothesis that depends on a specific attribute in an "Adaptive recurrent Neural Network for Target-dependent Twitter sententinformation Classification". And adaptively transmitting the emotional words to the specific attributes according to the context and the syntactic relation between the emotional words and the specific attributes. The method converts the dependency tree of the sentence into a recursive structure for a specific attribute and obtains a higher level representation based on the structure. Experimental research shows that the classification performance of the classifier constructed based on AdaRNN is superior to that of the traditional machine learning method and the basic recurrent neural network method, but the classification performance still needs to be improved.

Disclosure of Invention

The purpose of the invention is as follows: based on the defects of the prior art, the invention provides a method for improving the emotion classification accuracy of specific attributes by using high-dimensional representation.

The technical scheme is as follows:

a method for improving emotion classification accuracy of specific attributes by using high-dimensional representation comprises a training stage and a testing stage:

the method comprises the following specific steps:

a training stage:

s1) a sentence is segmented into a plurality of clauses by using a clause segmentation algorithm, each word in the clauses is expressed in the form of a word vector, the whole spliced word vector of the word and an attribute word vector is used as the input of a deep neural network model, all unknown words are randomly sampled and initialized in uniformly distributed U (-0.01,0.01), the dimensionalities of the word vector and a bidirectional long-short term memory neural network are set to be 300, and other super parameters are correspondingly adjusted according to a development data set to obtain a trained deep neural network model;

s2), the deep neural network model comprises a 3-layer architecture including a word coding layer, a clause coding layer and a softmax layer, the word coding layer is used for capturing the relevance of each word in a clause and a specific attribute, the clause coding layer maps the specific attribute into the clause, and the softmax layer is used for inputting the final representation S of the comment text into a softmax classifier to finally obtain the class probability distribution of the comment text for the given attribute;

s3), the input word sequence of the deep neural network model is a word vector with (d + d ') dimension, wherein d represents the dimension of the word vector, d' represents the dimension of the attribute word vector, and the value of d can be adjusted according to the experimental condition;

s4), training a Loss Function of the model, training a specific attribute emotion classification model based on high-dimensional representation in an End-to-End (End-to End) mode by adopting a Cross-Entropy Loss Function (Cross-Entrol Loss Function);

s5) given training data x_t，a_t，y_tWherein x is_tDenotes the t-th sample to be predicted, a_tRepresenting the attribute, y, present in the sample_tRepresenting a sample x to be predicted_tFor a specific attribute a_tTrue category label of (2);

s6) regarding the emotion classification model with specific attributes based on high-dimensional representation as a black box function

The output of this function is a vector representing the probability that the input text belongs to each class label, and the goal of the training is to minimize the loss function:

in the above formula, M represents the number of training samples, K represents the number of class labels, and L represents the bias parameter L₂Regularization of (1);

s7) adopting Adagrad optimization function, and uniformly distributing parameters of all matrixes and vectors

Wherein r and c' are the number of rows and columns in the matrix; in the training process, in order to avoid overfitting, a Dropout strategy is adopted in the Bi-LSTM;

and (3) a testing stage:

s8) inputting the comment text to be processed into the trained deep neural network model, and obtaining the emotion polarity of the comment text aiming at the specific attribute.

Further, the clause segmentation algorithm is specifically to segment sentences through punctuation marks and connection words (collectively referred to as delimiters): defining a minnum parameter to limit the number of words at least to be contained in the clause, and dividing the partial sentence into the clauses if and only if minnum is larger than a specified value;

in addition, a maxnum parameter is defined to ensure that each sentence is cut into the same number of clauses, and the operation aims at that the subsequent neural network requires a fixed number of clauses as input;

the separator includes punctuation marks and connection words, i.e., ","; "," and "," but "," so "," especialoly "," however "," the "," throughout "," except ".

Further, the other super parameters are adjusted according to the development data set, specifically, the initial value of the learning rate is set to 0.1, the regularization weight of the parameter is set to 10-⁵Dropoutratate is set to 0.25.

Further, in the clause segmentation algorithm, the parameter minnum is set to 3, and the parameter maxnum is set to 4, so that all possible clauses can be mined from the sentence, and the model can achieve the best performance on developing the data set.

Further, the two-way long-short term memory neural network model based on high-dimensional representation and composed of a word coding layer, a clause coding layer and a softmax layer comprises the following specific processes:

a first word coding layer, assuming that the comment text contains C clauses in total, wherein C is used_iTo represent the ith clause and each clause contains N in common_iA word, I_ijThen the word that appears at the jth position in the ith clause is represented, where j e [1, N_i]；

Clause c_iFor words appearing in

Is represented by where j ∈ [1, N ∈ ]]The words w_ij＝E_w·I_ijAre all stored in a word vector (word embedding) matrix, wherein

Where d represents the dimension of the word vector and V represents the vocabulary;

the attribute category (aspect category) of appearance is composed of two parts, entity (entity) and feature (attribute):

specifically, assume an entity string e₁Has a length of L₁It is expressed as

Wherein

Representing a d' dimensional vector representation of the nth word in the entity string;

accordingly, the present invention represents the characteristic character string as

Usually, a word vector representation has a linear structure, which makes it have an overlapping or subtractive property at a semantic level, so that the purpose of combining words can be achieved by adding elements of the word vector;

adding the entity word vectors and the feature word vectors to obtain a final representation of the attribute word vectors:

then, adding an attribute word vector on the basis of the word vector representation to obtain an attribute extended representation of each word:

in the above formula

Namely, it is

Is (d + d'), i is belonged to [1, C]，j∈[1，N_i]，

Representing a vector splicing operator, C representing the number of clauses, N_iExpress clause c_iThe number of words contained therein;

the obtained word vector

As input, a bidirectional long-short term memory neural network (Bi-LSTM) is adopted to integrate the information of each word in the forward direction and the backward direction, so that the input of a word vector matrix is converted into a new representation:

Bi-LSTM means that each training sequence is a long short-term memory neural network (LSTM) forward and backward, and they are connected with an output layer;

this structure provides complete past and future context information for each point in the output layer input sequence;

the forward LSTM contained in the Bi-LSTM is denoted as

The neural network is selected from_i，1To

I.e. reading clause c from front to back_iThe corresponding backward LSTM of the word in (1) is expressed as

Is to be from

To I_i，1I.e. reading clause c from back to front_iThe word in (1):

hidden forward state

And backward hidden layer state

Is spliced to obtainEach word I in the clause_ijThe final hidden-layer state representation of (1) fusing all the following words I in the clause_ijThe related information of (1):

finally, each word I in the clause is divided into words through the Mean-Pooling layer_ijHidden layer state h of_ijAverage pooling to get the final representation of the clause:

second clause coding layer, for clause vector c obtained in the previous step_iStill, a Bi-LSTM is used to encode these given clause vectors to fuse context information:

similar to the word coding layer, by concatenating forward hidden layer states

And backward hidden layer state

To obtain each clause c in the comment text_iThe final hidden-layer state representation of (2) fusing all the following clauses c in the comment text_iThe related information is as follows:

commenting each clause c in the text through the Mean-Pooling layer_iHidden layer state h of_iAnd carrying out average pooling to obtain a final representation of the comment text:

for the third softmax layer, inputting the final representation s of the comment text into a softmax classifier, and finally obtaining the class probability distribution of the comment text for given attributes:

o＝W_l·s+b_l

represents the output, W_lRepresenting a weight matrix, b_lRepresents an offset;

the method of calculating the probability that a given sentence belongs to each category K e [1, K ] is as follows:

theta represents all parameters, and the class label with the highest probability calculated according to the formula is used as the final class label of the comment text.

Compared with the prior art, the method for improving the emotion classification accuracy of the specific attribute by using high-dimensional representation, provided by the invention, comprises the following steps: according to the method, a multi-level and high-dimensional deep neural network model is constructed by using the comment text and the specific attribute information thereof from three different dimensions of words, clauses and sentences so as to achieve better classification performance.

Drawings

FIG. 1 is a flow chart of the method of the present invention;

FIG. 2 is a diagram of a specific attribute emotion classification model architecture constructed by the present invention;

FIG. 3 is a restaurant domain review text example.

Detailed Description

The technical scheme of the invention is further explained by combining the attached drawings.

The embodiment shows a method for improving the emotion classification accuracy of specific attributes by using high-dimensional representation:

constructing a bidirectional long-short term memory neural network model based on high-dimensional representation aiming at specific attributes, wherein the model comprises a training stage and a testing stage:

in the training stage, the word coding layer is used for capturing the correlation between each word in the clauses and the specific attributes, the clauses coding layer maps the specific attributes into the clauses to serve as the input of the deep neural network model, all unknown words are randomly sampled and initialized in uniformly distributed U (-0.01,0.01), the dimensionalities of word vectors and the bidirectional long-short term memory neural network are set to be 300, and other super parameters are correspondingly adjusted according to the development data set to obtain the trained deep neural network model;

in the testing stage, inputting the comment text to be processed into the trained deep neural network model to obtain the emotion polarity of the comment text aiming at the specific attribute;

wherein: aiming at judging the emotion polarity expressed by the comment text aiming at the specific attribute:

on one hand, not all components in the comment text have certain relevance with specific attributes;

on the other hand, the comment text may contain a plurality of attributes, and different attributes may be subjected to emotion classification by combining with information of different parts in the sentence;

therefore, referring to fig. 1, the clause segmentation algorithm proposed in this embodiment segments a sentence into different clauses so as to map specific attributes into the clauses.

The basic idea of the clause segmentation algorithm proposed in this embodiment is to segment sentences by punctuation marks and conjunctions (collectively referred to as delimiters):

as shown in fig. 3, "great and tasty" should not be divided into two clauses by the conjunction of "and", so that not all separators can be used as the boundaries of the clauses;

the scheme defines a minnum parameter to limit the number of words at least to be contained in the clause, and divides the partial sentence into the clauses only when the minnum is larger than a specified value;

the clause segmentation method is explained in detail in table 1, wherein the separator includes punctuation marks and connection words, i.e., ","; "," and "," but "," so "," especialoly "," however "," the "," throughout "," except ".

TABLE 1 clause segmentation Algorithm

Referring to fig. 2, the bidirectional long-short term memory neural network model based on high-dimensional representation, which is constructed for specific attributes, comprises a 3-layer architecture of a word coding layer, a clause coding layer and a softmax layer:

the word coding layer is used for capturing the relevance of each word in the clauses and the specific attributes;

the clause coding layer maps the specific attribute into the clause;

the softmax layer is used for inputting the final representation s of the comment text into a softmax classifier, and finally obtaining the class probability distribution of the comment text for the given attribute;

training Loss Function of model Cross-Entropy Loss Function (Cross-Entropy Loss Function) was selected, and the activation Function was adarad:

the two-way long-short term memory neural network model based on high-dimensional representation and composed of a word coding layer, a clause coding layer and a softmax layer comprises the following specific processes:

Clause c_iFor words appearing in

Wherein

in the above formula

Namely, it is

Is (d + d'), i is belonged to [1, C]，j∈[1，N_i]，

the obtained word vector

the forward LSTM contained in the Bi-LSTM is denoted as

The neural network is selected from_i，1To

Is to be from

To I_i，1I.e. reading clause c from back to front_iThe word in (1):

hidden forward state

And backward hidden layer state

Splicing to obtain each word I in the clauses_ijThe final hidden-layer state representation of (1) fusing all the following words I in the clause_ijThe related information of (1):

similar to the word coding layer, by concatenating forward hidden layer states

And backward hidden layer state

o＝W_l·s+b_l

A verification step:

in order to verify the advantages of the deep neural network model provided by the invention relative to other emotion classification algorithms, a series of comparison experiments are carried out:

the experimental environment configuration comprises two parts of hardware and software:

the hardware configuration used by the training model is Intel Xeon 2.5GHz, 4 cores and 8GB of memory;

the software configuration part is provided with an operating system of Windows10, a machine learning front-end library utilized is keras-1.2.2, a rear end is theta-0.8.2 and is based on python2.7 and some scientific calculation libraries;

the experimental procedure mainly includes three aspects:

1) data preparation

According to the invention, experiments are carried out on two data sets (namely, the Laptop computer field and the Restaurant field) of a semantic evaluation Task12 to verify the effectiveness of the method provided by the invention, each data set consists of a plurality of user comments, each comment comprises an attribute list and emotional polarities corresponding to the attributes, wherein the emotional polarities comprise positive direction, neutral direction and negative direction, and the data distribution conditions of the two fields in the data set are shown by referring to a table 2;

in addition, 10% of the data is randomly selected from the training set as a development data set for adjusting algorithm parameters, and Glove is selected as a pre-trained word vector.

TABLE 2 Restaurant and Laptop computer Domain data set distribution

2) Model training

The method adopts a Cross-Entropy Loss Function (Cross-Entropy Loss Function) to train a specific attribute emotion classification model based on high-dimensional representation in an End-to-End (End-to-End) mode. Given training data x_t，a_t，y_tWherein x is_tDenotes the t-th sample to be predicted, a_tRepresenting the attribute, y, present in the sample_tRepresenting a sample x to be predicted_tFor a specific attribute a_tTrue category label of (2);

considering a high-dimensional representation-based specific attribute emotion classification model as a black box function

m represents the number of training samples, K represents the number of class labels, and L represents the bias parameter L₂Regularization of (1);

adagrad is adopted as an optimization function, and parameters of all matrixes and vectors are uniformly distributed

Wherein r and c' are the number of rows and columns in the matrix;

and in order to avoid overfitting during training, the Dropout strategy is adopted in Bi-LSTM.

3) Results of the experiment

Comparing the deep neural network model with a reference method so as to comprehensively evaluate the performance of the model:

the reference method and the method provided by the scheme adopt Glove word vectors during training;

the reference method comprises the following steps:

1) majority algorithm (Majority): this method is a basic reference method, which assigns a majority of emotional polarities appearing in a training set to each test sample for a specific attribute;

2) long short term memory neural network (LSTM): the method uses only one LSTM to model the context to obtain the hidden layer representation of each word, then the average value of all the hidden layer representations is regarded as the final representation of the input, and the final representation is sent to the softmax layer to obtain the prediction probability value of each label;

3) long-short term memory neural network based on target association (TC-LSTM): the method extends the basic LSTM by using two LSTMs, namely one forward LSTM and one backward LSTM for the attribute information. In addition, the model blends attribute information into the representation of the sentence, and finally, the representation of the two attributes is spliced together for emotion polarity prediction aiming at the specific attribute;

4) attention-based long-short term memory neural network (ATAE-LSTM): the method models context words through an LSTM, and embeds attribute vectors into each word vector;

5) interactive Attention Network (IAN): the method is an interactive learning method, firstly, modeling is carried out on context and attributes through LSTM, and then attention expression is interactively learned on the context and the attributes;

the method proposed by the scheme is a multilayer bidirectional long-short term memory neural network (Hierarchical Bi-LSTM): the method is a multi-layer Bi-LSTM, and a multi-level and high-dimensional deep neural network model is constructed by utilizing comment texts and specific attribute information thereof based on three different dimensions of words, clauses and sentences in high-dimensional representation: firstly, a sentence is divided into a plurality of clauses by using a clause division algorithm; then, coding all clauses by utilizing a plurality of bidirectional long-short term memory neural networks; and finally, coding the clauses by utilizing a bidirectional long-short term memory neural network, and further obtaining the probability that the comment text belongs to each category aiming at the specific attribute through a softmax layer.

TABLE 3 comparison of Performance of different attribute level Emotion Classification methods for plain text

Referring to table 3, a comparison of the performance between this scheme and other baseline methods is shown:

as can be observed from table 3, the performance of the Majority algorithm is the worst, and the classification accuracy of the classifier constructed by the Majority algorithm in the retaurant field and the Laptop field is 53.7% and 57.0% respectively;

in addition, all the methods are realized on the basis of the LSTM neural network model, the classification performance of the methods is superior to that of a Majority algorithm, and experimental results show that the LSTM model not only has the potential of automatic generation and representation, but also can bring performance improvement for attribute-level emotion classification;

in addition, it can be seen from Table 3 that the classification accuracy of TC-LSTM, ATAE-LSTM and IAN are all better than that of LSTM. This result demonstrates that it is helpful to take attribute information into account when sentiment classification is performed for a particular attribute for improving classification performance;

finally, it can be seen that the Hierarchical Bi-LSTM method proposed by the present invention is superior to all of the aforementioned methods, which highlights the superiority of using clause information.

In summary, the method for improving the emotion classification accuracy of the specific attribute by using high-dimensional representation provided in this embodiment: according to the method, a multi-level and high-dimensional deep neural network model is constructed by using the comment text and the specific attribute information thereof from three different dimensions of words, clauses and sentences so as to achieve better classification performance.

Claims

1. A method for improving emotion classification accuracy of specific attributes by using high-dimensional representation is characterized by comprising the following steps: the method comprises a training phase and a testing phase: the method comprises the following specific steps:

a training stage:

s2), the deep neural network model comprises a 3-layer architecture including a word coding layer, a clause coding layer and a softmax layer, the word coding layer is used for capturing the relevance of each word in a clause and a specific attribute, the clause coding layer maps the specific attribute into the clause, and the softmax layer is used for inputting the final representation S of the comment text into a softmax classifier and finally obtaining the class probability distribution of the comment text for the given attribute;

s4), training a special attribute emotion classification model based on high-dimensional representation by adopting a Cross-Entropy Loss Function (Cross-Entropy Loss Function) in an End-to-End (End-to-End) mode;

s5) given training data x_t，a_t，y_tWherein x is_tDenotes the t-th sample to be predicted, a_tRepresenting the attribute, y, present in the sample_tRepresenting a sample x to be predicted_tFor a specific attribute a_tIsA category of truth label;

and (3) a testing stage:

s8) inputting the comment text to be processed into the trained deep neural network model, and obtaining the emotion polarity of the comment text for specific attributes.

2. The method for improving the emotion classification accuracy of the specific attribute by using the high-dimensional representation as claimed in claim 1, wherein: the clause segmentation algorithm is specifically to segment sentences through punctuation marks and connecting words: defining a minnum parameter to limit the number of words at least to be contained in the clause, and dividing the sentence into clauses if and only if minnum is larger than a specified value;

3. The method for improving the emotion classification accuracy of the specific attribute by using the high-dimensional representation as claimed in claim 2, wherein: the other hyper-parameters are adjusted according to the development data set, specifically, the initial value of the learning rate is set to 0.1, the regularization weight of the parameter is set to 10^-5Dropout Rate is set to 0.25.

4. The method for improving the emotion classification accuracy of the specific attribute by using the high-dimensional representation as claimed in claim 3, wherein: in the clause segmentation algorithm, the parameter minnum is set to be 3, and the parameter maxnum is set to be 4, so that all possible clauses can be mined from the sentences, and the model can achieve the best performance on developing a data set.

5. The method for improving emotion classification accuracy of specific attributes by using high-dimensional representation as claimed in claim 4, wherein: the two-way long-short term memory neural network model based on high-dimensional representation and composed of a word coding layer, a clause coding layer and a softmax layer comprises the following specific processes:

Clause c_iFor words appearing in

Wherein

Indicating d for the nth word in the entity string^′A dimension vector representation;

accordingly, the characteristic character string is represented as

Usually, a word vector representation has a linear structure, which makes it have an overlap or subtraction characteristic at the semantic level, so that the purpose of combining words can be achieved by adding elements of the word vector;

in the above formula

Namely, it is

Is (d + d'), i is belonged to [1, C]，j∈[1，N_i]，

the obtained word vector

the forward LSTM contained in the Bi-LSTM is denoted as

The neural network is selected from_i,1To

Is to be from

To I_i，1I.e. reading clause c from back to front_iThe word in (1):

hidden forward state

And backward hidden layer state

Splicing to obtain each word I in the clauses_ijIs shown, the final hidden state representation fuses all the following words I in the clause_ijThe related information of (1):

similar to the word coding layer, by concatenating forward hidden layer states

And backward hidden layer state

To obtain each clause c in the comment text_iThe final hidden state representation of (2) fusing all the following clauses c in the comment text_iThe related information is as follows:

o＝W_l·s+b_l