CN113255366A

CN113255366A - Aspect-level text emotion analysis method based on heterogeneous graph neural network

Info

Publication number: CN113255366A
Application number: CN202110593991.9A
Authority: CN
Inventors: 田锋; 安文斌; 陈妍; 徐墨; 高瞻; 郭倩; 文华; 郑庆华
Original assignee: Xian Jiaotong University
Current assignee: Xian Jiaotong University
Priority date: 2021-05-28
Filing date: 2021-05-28
Publication date: 2021-08-13
Anticipated expiration: 2041-05-28
Also published as: CN113255366B

Abstract

The invention discloses an aspect-level text emotion analysis method based on a heterogeneous graph neural network, and belongs to the field of language processing. According to the co-occurrence relation of words and sentences in the text and the evaluation aspect contained in the sentences, a three-level graph structure network in the word-sentence-evaluation aspect is constructed; then obtaining initial embedded vector representation of each node; and then using the parameters of the graph attention network training model, continuously updating the embedded vector representation of the nodes in the graph network according to the connection relation of each node in the graph network through a multi-head attention mechanism, and finally predicting the aspect-level emotional tendency of the text. And calculating the correlation between the sentence nodes and the evaluation nodes by using a self-attention mechanism according to the finally obtained embedded vector representation of the sentence nodes and the evaluation nodes, thereby obtaining the predicted text aspect level emotional tendency. The invention effectively improves the expression capability and generalization capability of the model.

Description

Aspect-level text emotion analysis method based on heterogeneous graph neural network

Technical Field

The invention belongs to the field of language processing, and particularly relates to an aspect-level text emotion analysis method based on a heterogeneous graph neural network.

Background

Aspect-Based Sentiment Analysis (ABSA) is a fine-grained text Sentiment Analysis method, and compared with the traditional Sentiment Analysis, the ABSA can provide more detailed and rich Sentiment information. The aspect level emotion analysis is mainly used for analyzing the corresponding emotional tendency (positive, neutral and negative) of the text in different aspects, and can be divided into two sub-tasks: aspect-based sentiment classification (ATSA) and category-based sentiment classification (ACSA). For example, for the text "food is very good, i.e. the server attitude is too bad", ACSA needs to give a positive emotion for rating aspect "food" and a negative emotion for rating aspect "service".

Most models at this stage are based on attention mechanisms and neural networks. The method comprises the steps of capturing semantic information of a text by using a neural network, capturing information in an evaluation aspect by using an attention mechanism, and strengthening the attention between the text semantic and the evaluation aspect. Classified from the network structure, the existing methods can be roughly classified into the following four categories: 1. methods based on the Recurrent Neural Network (RNN), such as Wang et al, propose an attention-based long-short term memory network (LSTM) to generate an embedded representation of the evaluation aspect. 2. Convolutional Neural Network (CNN) based methods, such as Xue et al, propose convolutional gating cells to extract text features and evaluate aspect features. 3. Graph Neural Network (GNN) based approaches, such as Li et al, propose using graph neural networks to model grammatical structures of text to assist in classification. 4. Pre-training model-based methods, such as Sun, utilize a pre-training model BERT to model semantic relationships between text and evaluation aspects. The four technical schemes have the following disadvantages: first, existing models assume that input texts are independently and identically distributed, however, strong correlation often exists between texts with comment properties, which are main research objects of aspect-level emotion analysis, and ignoring the correlation results in loss of a large amount of information, so that the performance of the models is reduced. Secondly, the existing models ignore the characteristic of structural similarity between texts with the same emotion aiming at the same evaluation aspect, which results in that information between the texts cannot be shared, so that the expression capability of the models is poor. Thirdly, the existing models ignore the characteristic of semantic expression diversity among texts with the same emotion aiming at the same evaluation aspect, and the loss of the diversity information can cause the generalization capability of the models to be poor.

Disclosure of Invention

The invention aims to overcome the defects of the prior art and provides an aspect-level text emotion analysis method based on a heterogeneous graph neural network.

In order to achieve the purpose, the invention adopts the following technical scheme to realize the purpose:

an aspect level text emotion analysis method based on a heterogeneous graph neural network comprises the following steps:

(1) constructing a three-level graph network structure of a word-sentence-evaluation aspect according to the co-occurrence relation of the word and the text and the evaluation aspect related in the text;

(2) initializing the embedded vector representation of each node in the graph network structure by using a pre-trained model to respectively obtain an initial embedded representation matrix of the word node

Initial embedded representation matrix of text nodes

And initial embedded representation matrix in evaluation

(3) Continuously updating the embedded representation of each node in the graph network structure by a multi-head self-attention mechanism according to the semantic relation between the graph network structure and the text by adopting a graph attention network GAT training model so as to continuously exchange information among the nodes, thereby obtaining the embedded representation matrix of each node in the step (t +1)

Finally, the text embedded expression matrix is obtained

And an evaluation-wise embedded representation matrix

(4) Embedding a representation matrix with the text

And an evaluation-side embedded representation matrix

Calculating the correlation between the text and each emotional tendency in the evaluation aspect through a self-attention mechanism, taking the emotion with the maximum correlation as the predicted emotion of the text in the evaluation aspect, calculating the difference between the predicted emotion de-tendency and the text real emotional tendency through a loss function, and finally optimizing model parameters through back propagation until the proximity of the predicted emotional tendency and the text real emotional tendency is in a preset range to obtain a trained model;

(5) inputting the text to be classified into a trained model for feature extraction, calculating the correlation between the extracted text feature vector and the trained evaluation vector by adopting a self-attention mechanism, and finally classifying by using a softmax classifier.

Further, the graph network structure G tableShown as follows: g ═ V_w,V_s,V_a,E_ws,E_sa}；

Wherein, V_wRepresenting word nodes contained in the text; v_sRepresenting a text node; v_aRepresenting evaluation aspect nodes; e_wsRepresenting an edge between a word node and a text node, the weight of which represents the position of the word appearing in the text; e_saRepresenting an edge between the text node and the evaluation aspect node.

Further, the specific operation of initializing the word node embedding vector in step (2) is as follows:

aiming at word nodes in a graph network structure, initializing the word nodes by using a pre-trained GloVe word vector library to obtain word embedded vectors, and splicing all the word embedded vectors to obtain a word initial embedded matrix.

Further, the specific operation of initializing the text node embedding vector in the step (2) is as follows:

aiming at a text node in a graph network structure, initializing the text node by using a pre-training language model BERT to obtain an initial embedded vector, and splicing all the initial embedded vectors of the text to obtain an initial embedded matrix of the text.

Further, the specific operation of initializing the node embedding vector in the evaluation aspect in the step (2) is as follows:

aiming at evaluation nodes in a graph network structure, coding the evaluation nodes by using one-hot coding, mapping coding vectors to a feature space by using a layer of full-connection network FCN with learnable parameters to obtain initial embedded vectors of the evaluation nodes, and splicing all the evaluation initial embedded vectors to obtain an evaluation initial embedded matrix.

Further, the specific operation of updating the embedded representation of each node in the graph network structure in the step (3) is as follows:

embedding vector h for a given node in a graph network structure_iAnd a neighbor N connected to said given node_iObtaining a new embedded vector representation for a given node using a multi-head attention mechanism

The embedded representation of a given node n in step t is noted

The embedded representation of the neighbors of a given node at step t is noted

Let the embedding of a given node n at (t +1) be denoted

New embedded vector representation based on given nodes

Construction of

The relationship between the three is as follows:

based on the initial embedded matrix corresponding to the word node, the text node and the evaluation node in the graph network structure, the embedded expression matrix of each node in the step (t +1) is obtained through repeated iteration of the formula (5)

Further, in step (4), the correlation between the text and each emotional tendency in the evaluation aspect is calculated, and the calculation formula is as follows:

wherein the content of the first and second substances,

is composed of

Embedding vectors corresponding to the ith text node;

is composed of

The embedded vector corresponding to the jth evaluation aspect; beta is a_ijThe attention weight between the text node vector and the evaluation node vector is obtained;

embedding the representation for the text nodes after the attention weight weighting;

a probability distribution representing an emotional tendency of the predicted text in the current evaluation aspect; w_a,b_aIs a learnable parameter; softmax () is an exponential normalization function, and the calculation formula is as follows:

further, in the step (4), the predicted text emotional tendency distribution is carried out

With text true emotion label

Comparing, passing through cross entropy loss function

The difference between the two is calculated and the loss over all samples is the sum over all text nodes i and all evaluation-aspect nodes j:

and finally, continuously updating the model parameters through a back propagation algorithm until the degree of closeness between the predicted emotional tendency and the real emotional tendency of the text is in a preset range.

Compared with the prior art, the invention has the following beneficial effects:

the invention relates to an aspect level text emotion analysis method based on a heterogeneous graph neural network, which comprises the steps of constructing a three-level graph structure network of a word-sentence-evaluation aspect according to the co-occurrence relation of words and sentences in a text and the evaluation aspect contained in the sentences; then generating embedded representation of each node in the graph network, and respectively initializing word nodes, sentence nodes and evaluation nodes in the graph network by using a pre-trained language model so as to obtain initial embedded vector representation of each node; and then using the parameters of the graph attention network training model, continuously updating the embedded vector representation of the nodes in the graph network according to the connection relation of each node in the graph network through a multi-head attention mechanism, and finally predicting the aspect-level emotional tendency of the text. And calculating the correlation between the sentence nodes and the evaluation nodes by using a self-attention mechanism according to the finally obtained embedded vector representation of the sentence nodes and the evaluation nodes, thereby obtaining the predicted text aspect level emotional tendency. According to the method, the structural similarity information and semantic expression diversity information between texts with the same evaluation aspect and emotional tendency are captured by means of the graph neural network, embedded vector representation of the texts and nodes in the evaluation aspect is obtained through model training, and the expression capability and generalization capability of the model are effectively improved.

Drawings

Fig. 1 is a schematic diagram of a network structure of the present invention.

Detailed Description

In order to make the technical solutions of the present invention better understood, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

It should be noted that the terms "first," "second," and the like in the description and claims of the present invention and in the drawings described above are used for distinguishing between similar elements and not necessarily for describing a particular sequential or chronological order. It is to be understood that the data so used is interchangeable under appropriate circumstances such that the embodiments of the invention described herein are capable of operation in sequences other than those illustrated or described herein. Furthermore, the terms "comprises," "comprising," and "having," and any variations thereof, are intended to cover a non-exclusive inclusion, such that a process, method, system, article, or apparatus that comprises a list of steps or elements is not necessarily limited to those steps or elements expressly listed, but may include other steps or elements not expressly listed or inherent to such process, method, article, or apparatus.

Aiming at the problems of the existing model, the invention provides a method for modeling the relationship between texts with the same evaluation aspect and emotional tendency and the relationship between the texts and the evaluation aspect by combining a graph neural network, so that the model can learn the structural similarity characteristics between similar texts, thereby improving the expression capability of the model; meanwhile, the model can learn the semantic diversity characteristics among texts, so that the generalization capability of the model is improved.

The invention is described in further detail below with reference to the accompanying drawings:

step 1: constructing a three-level graph network structure of a word-sentence-evaluation aspect according to the co-occurrence relation of the word and the text and the evaluation aspect related in the text;

graph structure G is represented as: g ═ V_w,V_s,V_a,E_ws,E_saIn which V_wRepresenting nodes of words, V, contained in the text_sRepresenting nodes of text, V_aRepresenting nodes in evaluation, E_wsRepresenting the edges between the word nodes and the text nodes, the weights of which represent the positions of the words appearing in the text, E_saRepresenting an edge between the text node and the evaluation aspect node;

step 2: aiming at word nodes in a graph network structure, initializing the word nodes by using pre-trained GloVe word vectors to obtain word embedded vectors, and splicing all the word embedded vectors to obtain a word initial embedded matrix;

for the word w, the GloVe word vector library is consulted through the sequence number of the word w in the dictionary to obtain the initial embedded vector of the word w as

Wherein d is_wFor the dimension of the word embedding vector, all word embedding vectors are spliced to obtain a word initial embedding matrix

Where n is the number of words.

And step 3: initializing a text node in a graph network structure by using a pre-training language model BERT to obtain an initial embedded vector of the text, and splicing all the initial embedded vectors of the text to obtain an initial embedded matrix of the text;

for the text s ═ w₁,w₂,…,w_lWherein w is_i(i e 1 … l) are the words that make up the text, l is the sentence length, and the initial embedded vector for the text s is:

X_s＝MeanPooling(BERT(s))#(1)

wherein, MeanPooling represents that the final output of the BERT model is subjected to average pooling,

d_ssplicing all the text initial embedded vectors for the dimensionality of the text embedded vectors to obtain a text initial embedded matrix

Where m is the number of texts.

And 4, step 4: aiming at evaluation nodes in a graph network structure, coding is carried out by using unique hot codes, a layer of parameter learnable full-connection network (FCN) is utilized to map coding vectors to a feature space, initial embedded vectors of the coding vectors are obtained, and all the evaluation initial embedded vectors are spliced to obtain an evaluation initial embedded matrix;

for the evaluation-side node a, the initial embedded vector is:

X_a＝FCN(OneHot(a))#(2)

wherein OneHot represents one-hot encoding; the FCN represents a fully-connected network,

d_aembedding the dimensions of the vector for evaluation; splicing all the initial embedded vectors in the evaluation aspect to obtain an initial embedded matrix in the evaluation aspect

k is the number of evaluation flanks.

And 5: embedding vector h for a given node in a graph network structure_iAnd neighbor node N connected with it_iObtaining new embedded vectors of given nodes by using a multi-head attention mechanism, and obtaining new embedded vectors of given nodes

Comprises the following steps:

wherein, | | represents the splicing operation of the vector, and σ represents the ReLU activationFunction, W_nIn order for the parameters to be learnable,

the attention scores for node i and node j at the nth head,

the calculation formula is as follows:

wherein the content of the first and second substances,

for learnable parameters, e_ijThe weight representing the edge between node i and node j depends on where the word node appears in the text.

Embedded representation of given node n at step t

The embedded representation of its neighbors in step t is recorded as

The embedding of node n at (t +1) is shown as

The calculation formula is shown as formula (3) and is expressed as:

step 6: aiming at word nodes, text nodes and evaluation nodes in the graph network structure, corresponding initial embedding matrixes are given, and embedding in the step t is obtained through repeated iteration of a formula (5)

Then, the embedded meter in step (t +1)The calculation formula is as follows:

wherein the content of the first and second substances,

is the calculated intermediate value; FFN () is a feedforward network, the input of the feedforward network is the connection of the calculated intermediate value and the residual error of the embedded representation at the time t, and the advantage of doing so is that the expression capability of the model can be greatly improved and the model can be rapidly converged, and the calculation formula is as follows:

FFN(x)＝max(0,xW₁+b₁)W₂+b₂

where max () denotes taking the larger of the two elements, W₁,W₂,b₁,b₂Are learnable parameters.

And 7: given the final text-embedded representation matrix

And an evaluation-wise embedded representation matrix

And calculating the correlation between the two by adopting a self-attention mechanism, wherein the calculation formula is as follows:

wherein the content of the first and second substances,

is composed of

Embedding vectors corresponding to the ith text node;

is composed of

The embedded vector corresponding to the jth evaluation aspect; beta is a_ijThe attention weight between the text node vector and the evaluation node vector represents the correlation weight distribution between the text node vector and the evaluation node vector;

a probability distribution representing an emotional tendency of the predicted text in the current evaluation aspect; w_a,b_aIs a learnable parameter; softmax () is an exponential normalization function, which is calculated by the formula:

and 8: distributing predicted text emotional tendency

With text true emotion label

Comparing, passing through cross entropy loss function

and finally, continuously updating model parameters through a back propagation algorithm, so that the predicted emotional tendency of the text is continuously close to the real emotional tendency of the text.

And step 9: model training

The gradient is updated by using an Adam optimizer, the learning rate is set to be 0.001, the first-order momentum parameter of Adam is 0.1, the second-order momentum parameter is 0.999, the number of data set training iterations (Epoch) is set to be 200, the parameters of the pre-trained BERT model are fixed, and the pre-trained GloVe word vector is 300-dimensional.

Model use:

and (3) performing feature extraction on the text input model to be classified, calculating the correlation between the extracted text feature vector and the trained evaluation aspect vector by adopting a self-attention mechanism, and finally classifying by using a softmax classifier.

Referring to fig. 1, fig. 1 is a schematic diagram of a network model of the present invention, which mainly includes a node-embedded representation initialization module, a graph attention module and a prediction module. The node embedded representation initialization module is used for initializing embedded representations of words, texts and evaluation nodes; the graph attention module is used for iteratively updating the embedded representation of the network node; the prediction module uses the final node embedding to represent the emotional tendency of the predicted text.

To measure model performance, comparative experiments were performed on five widely used public data sets, a training set of data sets, a test set partition, and the number of texts containing different emotions as shown in table 1. Table 2 shows the results of the comparison experiments, which are compared with thirteen common models in terms of index accuracy (Acc.) and F1 values, and it can be seen from the table that the models HAGNN-GloVe and HAGNN-BERT of the present invention achieve the best results in most indexes, and are greatly improved in model performance compared to the conventional methods.

TABLE 1 statistical information for data sets used to measure model performance

Table 2 shows the accuracy (Acc.) and F1 values of the comparison model on different data sets, where HAGNN-GloVe and HAGNN-BERT are two methods of the present invention, which employ different initialization data.

Table 2 comparison of model accuracy (Acc.) and F1 values on different datasets

The above-mentioned contents are only for illustrating the technical idea of the present invention, and the protection scope of the present invention is not limited thereby, and any modification made on the basis of the technical idea of the present invention falls within the protection scope of the claims of the present invention.

Claims

1. An aspect-level text emotion analysis method based on a heterogeneous graph neural network is characterized by comprising the following steps of:

Initial embedded representation matrix of text nodes

And initial embedded representation matrix in evaluation

Finally, the text embedded expression matrix is obtained

And an evaluation-wise embedded representation matrix

(4) Embedding a representation matrix with the text

And an evaluation-side embedded representation matrix

2. The method for analyzing aspect-level text emotion based on an abnormal pattern neural network of claim 1, wherein the graph network structure G is represented as: g ═ V_w，V_s，V_a，E_ws，E_sa}；

3. The method for analyzing aspect-level text emotion based on heterographic neural network of claim 1, wherein the specific operations of initializing word node embedding vectors in step (2) are as follows:

4. The method for analyzing aspect-level text emotion based on heterographic neural network of claim 1, wherein the specific operation of initializing text node embedding vectors in step (2) is as follows:

5. The method for analyzing the aspect-level text emotion based on the heteromorphic neural network of claim 1, wherein the specific operation of initializing the evaluation aspect node embedding vector in the step (2) is as follows:

6. The method for analyzing aspect-level text emotion based on a heteromorphic neural network of claim 1, wherein the specific operation of updating the node-embedded representation in the graph network structure in step (3) is as follows:

The embedded representation of a given node n in step t is noted

The embedded representation of the neighbors of a given node at step t is noted

Let the embedding of a given node n at (t +1) be denoted

New embedded vector representation based on given nodes

Construction of

The relationship between the three is as follows:

7. The method for analyzing the emotion of the aspect-level text based on the heteromorphic neural network as claimed in claim 1, wherein in the step (4), the correlation between the text and each emotional tendency of the evaluation aspect is calculated by the following formula:

wherein the content of the first and second substances,

is composed of

Embedding vectors corresponding to the ith text node;

is composed of

a probability distribution representing an emotional tendency of the predicted text in the current evaluation aspect; w_a，b_aIs a learnable parameter; softmax () is an exponential normalization function, and the calculation formula is as follows:

8. the method for analyzing aspect-level text emotion based on neural network of heterogeneous map as claimed in claim 7, wherein in step (4), the method is performed in advanceMeasured text emotional tendency distribution

With text true emotion label

Comparing, passing through cross entropy loss function