CN111325027A

CN111325027A - Sparse data-oriented personalized emotion analysis method and device

Info

Publication number: CN111325027A
Application number: CN202010102417.4A
Authority: CN
Inventors: 周德宇; 张朦; 张林海
Original assignee: Southeast University
Current assignee: Southeast University
Priority date: 2020-02-19
Filing date: 2020-02-19
Publication date: 2020-06-23
Anticipated expiration: 2040-02-19
Also published as: CN111325027B

Abstract

The invention discloses a sparse data-oriented personalized emotion analysis method and device, which are used for grouping users with similar scoring habits and enhancing user representation by utilizing grouping information to realize personalized emotion analysis. The method comprises the following steps: preprocessing a document; calculating to obtain an emotion scoring basis by using a basic emotion analysis model based on a deep neural network; calculating to obtain emotion scoring offset and fluctuation by using a group-based personalized analysis model; and calculating the final emotion scoring by combining the emotion scoring basis and the emotion scoring offset. Compared with the prior personalized emotion analysis method, the method can learn to obtain good user representation under the condition that the text data of the user is sparse, can effectively model the user in the personalized emotion analysis, and can more accurately perform the personalized emotion analysis.

Description

Sparse data-oriented personalized emotion analysis method and device

Technical Field

The invention relates to emotion analysis of a text by using user text data under the condition of sparse data, and belongs to the technical field of machine learning.

Background

The user generated text sentiment analysis is intended to calculate a corresponding sentiment score (e.g., satisfaction) based on a text (e.g., a Twitter or a shopping comment) composed by the user. Traditional emotion analysis methods consider the mapping between text and emotion scores to be the same for all users, i.e., do not distinguish individual differences between users. However, such an assumption is not in accordance with the actual situation. Because the emotion expression modes of users are different due to different educational backgrounds, social experiences and the like, the personalized emotion analysis for the users is very necessary. However, some existing personalized emotion analysis methods generally use a fixed-dimension user vector to represent each user, the user vector is generally randomly initialized and then learned by the network, and the user representation mode has strong dependence on data and the network. According to network statistics, most users of Twitter send Twitter rarely, and nearly 80% of Twitter is sent by 10% of active users. This means that in real life, the situation that user data is sparse often exists, so it has very important social meaning to solve the problem of personalized emotion analysis under the sparse environment of data.

Disclosure of Invention

The purpose of the invention is as follows: in order to overcome the defects in the prior art, the invention provides a sparse data-oriented personalized emotion analysis method and device, which can solve the problem of data sparsity in current personalized emotion analysis.

The technical scheme is as follows: in order to achieve the above purpose, the method for analyzing personalized emotion facing to sparse data, provided by the invention, comprises the following steps:

(1) preprocessing a document;

(2) using a basic emotion analysis model based on a deep neural network, taking words of a document as input, respectively calculating semantic representation of each sentence in the document and semantic representation of the document through sentence-level semantic representation learning and document-level semantic representation learning, and taking a numerical value obtained by mapping the semantic representation of the document as an emotion scoring basis;

(3) using a group-based personalized emotion analysis model, taking semantic representation, user vectors and global group vectors of a document obtained by a deep neural network-based basic emotion analysis model as input, respectively calculating and obtaining user representation of each sentence and user representation of the document in the document through sentence-level user representation learning and document-level user representation learning, cascading the user representation of the document and the semantic representation obtained by the deep neural network-based basic emotion analysis model as final representation of the document, and mapping the final representation of the document to two values to be used as emotion scoring offset and fluctuation respectively; the emotion scoring offset is used for final scoring calculation, and emotion scoring fluctuation is used for network optimization;

(4) and adding the emotion scoring basis and the emotion scoring offset to obtain a final emotion scoring.

Further, the document preprocessing in the step (1) comprises: the method comprises the steps of segmenting words of a document, and filtering out stop words in the document and words which only appear once in a processed data set.

Further, the step (2) of calculating the emotion scoring basis by using the basic emotion analysis model based on the deep neural network comprises the following steps:

(2.1) aiming at each word in the sentence, mapping the word into a word vector trained in advance, and then coding each word in the sentence by using a bidirectional long and short memory network Bi-LSTM to obtain a corresponding hidden state of each word; calculating a weight for each word using an attention mechanism; finally, weighting and summing each word to obtain semantic representation of each sentence;

(2.2) regarding each sentence in the document, taking semantic representation of the sentence as input, and coding each sentence in the document by utilizing Bi-LSTM to obtain a corresponding hidden state of each sentence; calculating a weight for each sentence using an attention mechanism; finally, weighting and summing each sentence to obtain semantic representation of the document;

(2.3) mapping the semantic representation of the document level to a numerical value, namely emotion scoring basis, by using a multi-layer perceptron.

Further, the calculating of the emotion scoring offset and fluctuation by using the group-based personalized emotion analysis model in the step (3) comprises:

(3.1) calculating to obtain the user hidden state of each word on the basis of the hidden state of each word in the Bi-LSTM, the group global vector and the user vector corresponding to the document; calculating the weight of the user hidden state corresponding to each word by using an attention mechanism; finally, weighting and summing the hidden states of the users corresponding to each word to obtain the user representation of the sentence;

(3.2) calculating to obtain the user hidden state of each sentence on the basis of the hidden state of each sentence, the group global vector and the sentence user representation in the Bi-LSTM; calculating a weight of the hidden state of each sentence user using an attention mechanism; finally, weighting and summing the hidden states of the users of each sentence to obtain the user representation of the document;

(3.3) concatenating the semantic representation of the document and the user representation as a final representation of the document;

and (3.4) respectively mapping the final document representation to two numerical values, namely emotion scoring offset calculation and emotion scoring fluctuation by using two multilayer perceptrons.

Further, the user of the sentence is represented as:

wherein ,

e_kis the global vector for the k-th group,

is the word w_ijThe corresponding hidden state, u is the user vector corresponding to the document,

and

is a model parameter, softmax (·) is a normalized logistic regression function, and tanh (·) is a hyperbolic tangent activation function.

Further, the user of the document is represented as:

wherein ,

is the sentence s_iThe corresponding hidden state is set to be in a hidden state,

and

are the model parameters.

Further, using the joint loss to optimize the network includes: using mean square error loss for a basic emotion analysis model based on a deep neural network; the group-based personalized emotion analysis model is subjected to Gaussian penalty loss, so that emotion fluctuation is learned from a loss function, and the influence of samples with overlarge fluctuation on a network is reduced; adding a group vector-based penalty term to enable the group vector obtained by learning to have discriminability; l joining network parameters₂The regularization term avoids overfitting.

Further, the loss function of the network is defined as:

wherein ,

λ‖Θ‖²is a network parameter L₂The regularization term, T is the number of samples,

is the true sentiment score for the tth document,

is the output result of the basic emotion analysis model based on the deep neural network of the t document, y_tThe group-based personalized emotion analysis model for the tth document outputs a final emotion score,

the emotion scoring fluctuation of the output of the t document based on the personalized emotion analysis model of the group, wherein I is an identity matrix, and E is { E ═ E }₁,…,e_KIs a matrix of group vectors, | - |_FIs the Frobenius norm of the matrix.

Based on the same inventive concept, the individualized emotion analysis device facing to sparse data comprises a memory, a processor and a computer program which is stored on the memory and can run on the processor, wherein the computer program realizes the individualized emotion analysis method facing to sparse data when being loaded to the processor.

Has the advantages that: compared with the existing personalized emotion analysis method, the method can effectively solve the problem of sparse user data in the personalized emotion analysis which generally exists in real life, and the personalized emotion analysis performance can be improved by establishing the emotion score of one user as Gaussian distribution and considering the offset and fluctuation of the emotion score of the user.

Drawings

FIG. 1 is a flow chart of a method in an embodiment of the invention.

Fig. 2 is a schematic diagram of calculation in user representation learning at a sentence level of a user Encoder (U-Encoder) in the embodiment of the present invention.

Detailed Description

The present invention is further illustrated by the following description in conjunction with the accompanying drawings and the specific embodiments, it is to be understood that these examples are given solely for the purpose of illustration and are not intended as a definition of the limits of the invention, since various equivalent modifications will occur to those skilled in the art upon reading the present invention and fall within the limits of the appended claims.

The problem can be described as follows: for a document and a user u thereof, the personalized emotion analysis task is to predict an emotion score y corresponding to the document (such as a satisfaction score, for example, 1-5 scores, 1 score indicates dissatisfaction, and 5 scores indicates satisfaction). According to the observation, since the user has a generalized emotion expression mode, an emotion scoring basis y can be obtained^bThe users have the deviation y of sentiment scoring due to individual difference^sAnd a fluctuation σ²Therefore, the predicted sentiment score can be modeled as a value that follows a Gaussian distribution

So the personalized sentiment analysis problem can be converted into y^b，y^s and σ²The prediction problem of (1).

The individualized emotion analysis method facing to sparse data, disclosed by the embodiment of the invention, as shown in fig. 1, mainly comprises the following steps: .

S1: document preprocessing: segmenting the document, removing stop words in the document and words appearing only once in the data set to obtain a processed document d, wherein d comprises M sentences, and each sentence c_iComprising N_iA word.

S2: calculating to obtain an emotion scoring basis by using a basic emotion analysis model based on a deep neural network, and specifically comprising the following steps of:

(1) semantic representation learning at sentence level. Firstly, each word in a sentence is mapped into a word vector trained in advance, and then a sentence can be represented as

Each word in the sentence is then sequence encoded using Bi-LSTM:

then will be

And

splicing to obtain a word w_ijHidden state of

Since each word contributes differently to the semantic representation of the sentence, the attention mechanism is used so that important words in the sentence have higher weights, which is calculated as follows:

wherein

Is the word w_ijThe weight of (a) is determined,

and

are the model parameters. So the semantics s of the sentence_iThe representation can be obtained by weighted summation of the hidden states of the words:

(2) semantic representation learning at the document level. Bi-LSTM encodes each sentence representation in the document into a corresponding hidden state for each sentence:

then will be

And

splicing to obtain the hidden state of the sentence

Since different sentences contribute differently to the final semantic representation of the document, the weight of each sentence is further calculated using an attention mechanism, as follows:

wherein

Is the sentence s_iThe weight of (a) is determined,

and

are the model parameters. So the document semantic representation r^bThe hidden state of a sentence can be weighted and summed to obtain:

(3) and calculating the emotion scoring basis. A semantic representation at the document level is mapped to a numerical value using a multi-layered perceptron,

namely emotion scoring basis:

y^b＝MLP(r^b)

s3: and calculating the emotion scoring offset and fluctuation by using a group-based personalized emotion analysis model. Assume that there are a total of K groups, each group having a corresponding global vector e_kThis vector is initialized randomly and learned by network adaptation. The method specifically comprises the following steps:

(1) sentence-level users represent learning. With each word

Hidden state of

Based on the user vector U corresponding to the document, a user Encoder (U-Encoder) is designed to enhance the user representation, as shown in fig. 2, and includes: calculating the word w_ijCorresponding group representation

And computing an enhanced user representation

First to compute the relationship between words and groups, an attention mechanism is used to compute the words w_ijCorresponding group representation

If the word is associated with the groupIf the habit is more similar, the word has a higher weight, which is specifically calculated as follows:

then, since the learned user representation may be unreliable under the condition of sparse data, a group-representation-based mechanism is used to enhance the user representation, and the specific calculation manner is as follows:

wherein

Is an enhanced user representation for each word,

control of

To p_ijThe influence of (a) on the performance of the device,

and

are the model parameters. It should be noted that the conventional recurrent neural network usually calculates the hidden state of each word sequentially, but the U-Encoder is not a sequential structure, so that the user representations can be simultaneously calculated in parallel, and the calculation performance can be effectively improved.

Since different words contribute differently to the user representation of the sentence, the attention mechanism is further used to calculate the weight of the user hidden state corresponding to each word:

wherein

Is a user representation

The weight of (a) is determined,

and

are the model parameters. Finally, weighting and summing the hidden states of the users corresponding to each word to obtain the user representation v of the sentence_i：

(2) The users at the document level represent learning. In the hidden state of each sentence

And corresponding user representation v_iOn the basis, the U-Encoder is used for coding, and the method comprises the following steps: calculating a sentence s_iCorresponding group representation

And computing an enhanced user representation

First to calculate the relationship between words and groups, attention is usedMechanism calculation sentence s_iCorresponding group representation

The specific calculation is as follows:

wherein

Is an enhanced user representation for each sentence,

control of

To pair

The influence of (a) on the performance of the device,

and

are the model parameters.

Since different sentences contribute differently to the user representation of the document, an attention mechanism is further used to calculate the weight of the user's hidden state for each sentence:

wherein

Is a user representation

The weight of (a) is determined,

and

are the model parameters. Finally, weighting and summing the hidden states of the users corresponding to each sentence to obtain the user representation r of the document^u _i：

(3) And calculating emotion scoring offset and fluctuation. Representing a user of a document r^uAnd semantic representation r obtained by basic emotion analysis model based on deep neural network^bAnd (3) the cascaded final representations are respectively mapped to the final representation of the document level by using two multi-layer perceptrons:

y^s＝MLP([r^u,r^b])

σ²＝MLP([r^u,r^b])

s4: base y combined with sentiment scoring^bSentiment score offset y^sAnd emotional score fluctuation sigma²The final prediction result can be obtained

Due to y^b+y^sIs an unbiased estimate of y, so will y^b+y^sAs the value of y, i.e. y ═ y^b+y^s. And the emotional expression fluctuation is further used as a constraint in the optimization of the model.

Since the present invention can be considered a multitasking architecture, we use joint loss to optimize the network. For a traditional generalized emotion analysis model, we use the mean square error as a loss function:

where T is the number of samples and,

is the true sentiment score for the tth document,

is the output result of the basic emotion analysis model based on the deep neural network of the t document.

For the group-based personalized emotion analysis model, a Gaussian penalty function is used as a loss function, and the loss function comprises two parts: (1) residual regression based on the emotion fluctuation is used for learning from the loss function to obtain the emotion fluctuation, and the influence of a sample with overlarge fluctuation on a network is reduced; (2) and a regularization term of emotion fluctuation to avoid the predicted emotion fluctuation from being too large. The specific definition is as follows:

wherein y_tThe group-based personalized emotion analysis model for the tth document outputs a final emotion score,

and fluctuating the emotion scoring output by the group-based personalized emotion analysis model of the tth document.

In addition, in order to make the global vectors of different groups have a certain distinctiveness, a group vector-based penalty term is further introduced:

where I is the identity matrix, E ═ E₁,…,e_KIs a matrix of group vectors, | - |_FIs the Frobenius norm of the matrix.

The loss function of the network is defined as follows:

wherein λ‖Θ‖²Is a network parameter L₂A regularization term.

In the experimental process, the experimental parameters are set as follows: pre-trained GloVe was used as word embedding, with the hidden state dimensions of word embedding, user and word all being 300. Dropout rate of 0.5, learning rate of 0.001, L₂The weight lost is 0.01. The network was optimized using Adam with a number of groups of 6. The RMSE obtained on the user data sparse data set of the Yelp 2013 is 0.7166, the MAE is 0.5501, and the performance is superior to that of the existing personalized emotion analysis method.

Compared with the existing method, the individualized emotion analysis method facing sparse data considers the problem of sparse user data in individualized emotion analysis, which is ubiquitous in real life, and the method further enhances user representation by grouping users with similar emotion expression modes and utilizing group information. Meanwhile, the emotion score corresponding to the user text is generally considered to be a single numerical value by the original method, the emotion score of one user is established to be uniform Gaussian distribution by the method, and the deviation and fluctuation of the emotion score of the user are considered, so that the personalized emotion analysis performance is favorably improved.

Based on the same inventive concept, the individualized emotion analysis device for sparse data disclosed by the embodiment of the invention comprises a memory, a processor and a computer program which is stored on the memory and can run on the processor, wherein the computer program realizes the individualized emotion analysis method for sparse data when being loaded to the processor.

The above description is only of the preferred embodiments of the present invention, and it should be noted that: it will be apparent to those skilled in the art that various modifications and adaptations can be made without departing from the principles of the invention and these are intended to be within the scope of the invention.

Claims

1. A sparse data-oriented personalized emotion analysis method is characterized by comprising the following steps:

(1) preprocessing a document;

2. The sparse-data-oriented personalized emotion analysis method of claim 1, wherein the document preprocessing in step (1) comprises: the method comprises the steps of segmenting words of a document, and filtering out stop words in the document and words which only appear once in a processed data set.

3. The sparse data-oriented personalized emotion analysis method of claim 1, wherein the step (2) of calculating the emotion scoring basis by using the deep neural network-based basic emotion analysis model comprises:

4. The sparse data-oriented personalized emotion analysis method of claim 1, wherein the step (3) of calculating emotion scoring offsets and fluctuations by using the group-based personalized emotion analysis model comprises:

5. The sparse data-oriented personalized emotion analysis method of claim 4, wherein the user representation of the sentence is:

wherein ,

e_kis the global vector for the k-th group,

and

6. The sparse data-oriented personalized emotion analysis method of claim 5, wherein the user representation of the document is:

wherein ,

and

are the model parameters.

7. The sparse data-oriented personalized emotion analysis method of claim 1, wherein joint loss is used for optimizing the network, and comprises: using mean square error loss for a basic emotion analysis model based on a deep neural network; the group-based personalized emotion analysis model is subjected to Gaussian penalty loss, so that emotion fluctuation is learned from a loss function, and the influence of samples with overlarge fluctuation on a network is reduced; adding a group vector-based penalty term to enable the group vector obtained by learning to have discriminability; l joining network parameters₂The regularization term avoids overfitting.

8. The sparse data-oriented personalized emotion analysis method of claim 7, wherein a loss function of the network is defined as:

wherein ,

is the true sentiment score for the tth document,

9. A sparse data oriented personalized emotion analysis apparatus comprising a memory, a processor and a computer program stored on the memory and executable on the processor, wherein the computer program when loaded into the processor implements the sparse data oriented personalized emotion analysis method according to any of claims 1-8.