CN114781352A

CN114781352A - Emotion analysis method based on association between grammar dependency type and aspect

Info

Publication number: CN114781352A
Application number: CN202210373785.1A
Authority: CN
Inventors: 刘辉; 马祥
Original assignee: Chongqing University of Post and Telecommunications
Current assignee: Chongqing University of Post and Telecommunications
Priority date: 2022-04-07
Filing date: 2022-04-07
Publication date: 2022-07-22
Anticipated expiration: 2042-04-07

Abstract

The invention discloses an emotion analysis method based on association between grammar dependency type and aspects, which comprises the steps of S1, obtaining a text needing to be analyzed, and converting the text into word vector representation through a pre-training model; s2, splicing the specific aspect with each word represented by the sentence, introducing a grammar dependency type, and giving greater weight to the important dependency type in the grammar dependency tree; s3 using sentence expression with aspect perception and grammar dependence type as the input of model; s4, introducing an inter-aspect correlation matrix, and acquiring a sentence representation containing inter-aspect correlation through a graph convolution network; s5, merging the sentence expression containing the association between the aspects and the sentence expression containing the aspect characteristics; and S6, after sentence expressions containing aspect information and the association between the aspects are obtained, judging the emotion polarity of the target aspect by combining the query vector. The invention improves the capability of identifying information and grammar dependency relation among aspects of the model.

Description

Emotion analysis method based on association between grammar dependency type and aspect

Technical Field

The invention relates to an emotion analysis method based on association between grammar dependency types and aspects, and mainly relates to the field of natural language processing.

Background

The development of electronic commerce generates a large amount of comment texts with emotional polarities, and the comment texts have important commercial values and attract researchers to research emotional analysis. In general, a sentence contains several different aspects, which may have the same or different emotions between them. In the past, all aspects are generally considered independently in aspect level emotion analysis, emotional connection among the aspects is split, and the analysis is limited, even under some conditions, emotion judgment of a target aspect can be analyzed by means of emotions of other aspects. For example, in the sentence "the nuisverylimed-ithnkwecounted4or5entries", the meaning word "entries" has no definite emotion word, and the emotion of the meaning word "entries" cannot be judged only by considering the latter half of the sentence, but can be judged to be negative by the negative emotion of the meaning word "menu". In recent years, research on facet emotion analysis has been rapidly developed. Korean tiger et al propose to use knowledge-graph to incorporate background knowledge into text to provide a large amount of context information for sentences. Li zhang et al propose a BERT-based memory network model that fully interacts the output of the memory network with the attention of [ CLS ] vectors of the terms, avoiding the loss of important information. Gorgeous et al propose to screen out the initially acquired features using a self-attention mechanism, then send the screened features to CNN using different convolution kernels to extract different local features, and finally screen out important information via the self-attention mechanism. Lin et al propose to use location information to select features of a deep memory network and design a cross-facet module to obtain emotional associations between facets. Liang et al corrected the syntax dependency tree, centered on the aspect words, enhanced the dependency graph, and constructed the inter-aspect association graph to obtain the inter-aspect emotional dependency. However, most of the existing research only focuses on the emotion of a certain aspect of a sentence, and ignores the relationship between the aspects. Meanwhile, most researches use the original syntax dependency tree, and the influence of different dependency types on the emotion polarity is not considered.

Disclosure of Invention

Aiming at the defects of the prior art, the invention provides an emotion analysis method based on the association between the grammar dependency type and the aspect, which solves the problem that the emotion association between different aspects cannot be considered in the traditional method, and meanwhile, by introducing the grammar dependency type, the grammar dependency type which is more important for judging target aspect emotion can be given with larger weight, so that the emotion information and the grammar information between the aspects are effectively combined, and the accuracy and the stability of aspect-level emotion analysis on different data sets are improved.

In order to achieve the purpose, the technical scheme of the invention is as follows: an emotion analysis method based on association between a grammatical dependency type and an aspect, the method specifically comprising the steps of:

s1, obtaining a text to be analyzed, and converting the text into word vector representation through a pre-training model;

s2, the specific aspect is spliced with each word represented by the sentence, so that the sentence representation only focuses on the emotion of the specific aspect each time, the grammar dependency type is introduced, important dependency types in the grammar dependency tree are given greater weight, the sentence representation containing more grammar information is obtained through the graph convolution network, and then the sentence representation with aspect perception and grammar dependency type is improved by adopting the attention layer;

s3, the sentence representation with the aspect perception and grammar dependency type is used as the input of the model, the target aspect is used as the query vector, the non-target aspect enables the aspect characteristics to be further propagated in the sentence representation through the Bi-GRU, and then the inter-aspect attention is calculated between the sentence representation containing more aspect characteristics and the query vector;

s4, introducing an inter-aspect correlation matrix, and acquiring a sentence representation containing inter-aspect correlation through a graph convolution network;

s5, merging sentence expression containing inter-aspect correlation and sentence expression containing inter-aspect characteristics, and controlling the quantity of introduced inter-aspect correlation characteristics by using inter-aspect correlation coefficients;

s6, after sentence representation containing aspect information and inter-aspect correlation is obtained, inter-aspect attention is used for controlling influence of non-target aspects on target aspects to obtain final sentence representation, and the emotion polarity of the target aspects is judged by combining the query vector.

Preferably, the step S2 gives greater weight to the important dependency types in the syntax dependency tree specifically as follows: and obtaining a sentence representation containing aspect information by using the Bi-GRU, then introducing a grammar dependency type, obtaining the sentence representation containing grammar characteristics through a graph convolution network, and adopting an attention layer to expand the influence of words which play an important role in judging the specific aspect emotion.

Preferably, the sentence representation containing the aspect information comprises the following specific processes:

general terms a_iEmbedding into a word vector representation of a sentence, resulting in a word vector representation with facet words:

passing through Bi-GRU, it is named as GRU₁To obtain

Is represented by a hidden representation

Namely, it is

The specific construction process of the graph convolution network focusing on grammar dependence types comprises the following steps: first, using SpaCy to obtain syntax dependency information, a list of dependency tuples (w) can be used_i,w_j,c_ij) Is shown by c_ijRepresents the word w_iWith another word w_jThe type of dependency of (2); the syntax dependency information is then represented by an adjacency matrix A, in which the element a is present_ijRepresenting a word w_iWith another word w_jThe relationship between, if there are edges connected, a_ij1 is ═ 1; no edges are connected, then a_ij＝0；

Element c in the relation dependency type matrix_ijWord-in-word mapping to themGo into

Different weights are given to each edge by the following method

The following formula is calculated:

wherein the content of the first and second substances,

and

respectively represent words w_iAnd the word w_jHidden state of layer (l-1) < th >, original

And

from GRU₁An output of (d);

final word w_iThe output at layer l via the graph-convolution network is as follows:

wherein, W^(l)And b^(l)Representing trainable parameters in a l-level graph convolutionNumber, σ denotes the activation function ReLU; representation of sentences with aspect information obtained after TFGCN

Representing;

in representing facet-aware sentence representations, an attention layer is introduced to improve facet-aware sentence representation

The specific operation is shown in the following formula:

α＝softmax(z),

wherein

b_sIs a scalar.

Preferably, the step S3 of calculating inter-aspect attention is specifically: and taking the sentence representation containing the inter-aspect association and the grammar dependency type as an input, passing the sentence representation through a Bi-GRU, further transmitting aspect information in the sentence representation to obtain a sentence representation containing more aspect features, taking the target aspect as a query vector, and calculating the attention between the target aspect and the non-target aspect.

Preferably, the inter-computational attention specific procedure: firstly, the method is to

Input to another hidden layer of size D₀Gated cyclic unit GRU₂The preparation method comprises the following steps of (1) performing; it is composed ofHe passes through GRU₂The resulting hidden representation is

For target aspect representation

Converting the query vector into a query vector q by using a full connection layer; the query vector is obtained as follows:

wherein

To obtain the correlation between the query vector and other vectors, the following method is adopted for calculation:

β＝softm ax(z_a)

wherein the content of the first and second substances,

β_irepresenting an attention score between the target aspect and the other aspects;

the inter-aspect attention module based on the target aspect comprises the following specific processes:

will be provided with

Input to another hidden layer of size D₀Gated cyclic unit GRU₂In, and through GRU in other respects₂The resulting hidden representation is

For object aspect representation

Converting the query vector q into a query vector q by using a full connection layer, and calculating attention between a target aspect and other aspects; the query vector is obtained as follows:

wherein

β＝so ftm ax(z_a)

wherein, the first and the second end of the pipe are connected with each other,

β_ian attention score between the target aspect and the other aspects is represented.

Preferably, the sentence expression associated with step S4 is specifically: and obtaining a correlation matrix between the aspects, and then using the correlation matrix and the sentence representation of the aspect characteristics as input of a graph volume network to obtain the sentence representation of the correlation between the aspects through the graph volume network.

Preferably, the sentence representation of the association between the aspects is as follows:

firstly, constructing an adjacency matrix between aspects for acquiring context correlation between the aspects, wherein the specific construction method is shown as the following formula:

set of words, p, representing aspects of the object^tIs the starting position of the target aspect;

a set of words representing other aspects;

forming a non-directional adjacency matrix, i.e.

After the inter-aspect adjacency matrix is acquired, the inter-aspect adjacency matrix is

And through GRU₂Resulting hidden representation with aspect information

As input of the graph volume network, acquiring mutual emotional characteristics of a specific aspect and other aspects; hidden representation of ith node in ith layer in graph convolution network

The updates of (c) are as follows:

wherein, ReLU represents an activation function,

representing the hidden representation of the ith node at level l-1, W^(l)And b^(l)Representing trainable parameters in the convolution of the l-layer diagram;

the hidden representation with the inter-aspect association information finally obtained through the graph convolution network is as follows:

preferably, the sentence expression of step S5 containing both the aspect features and the associations between the aspects is specifically: combining sentence representations containing aspect characteristics and sentence representations related between aspects, and introducing a correlation coefficient between aspects to control the amount of introduced correlation information between aspects, and finally obtaining the sentence representations simultaneously containing the aspect characteristics and the correlation between aspects;

meanwhile, the sentence expression containing the aspect characteristics and the association between the aspects is as follows:

the sentence representation with rich aspect information can be obtained by the above process

And sentence representation Q with inter-aspect associations^Inter(ii) a To make full use of

And Q^InterThe two representations are combined to discover the mutual relationship between the two representations, and the combined sentence expression is shown as the following formula:

wherein the coefficient gamma e [0,1] represents how much of the characteristics between the introduction aspects.

Preferably, the final sentence representation of step S6 is embodied as: multiplying the sentence representation containing the aspect features and the association between the aspects with the attention between the aspects to control the influence of different non-target aspects on the target aspects to obtain a final sentence representation, adding the final sentence representation with the query vector of the target aspect, and inputting the final sentence representation to a softmax layer to obtain a final representation of emotion analysis.

Preferably, the final representation for emotion analysis is embodied as:

firstly, the belt is provided with a squareSentence representation Q of interplanar associations^InterMultiplying with inter-aspect attention beta to obtain final output of sentence representation

The query vector q for the target facet is then added to the output to generate a target facet sentence representation with rich inter-facet association information, which is then sorted by softmax, which contains C classes, to obtain the final representation p for emotion analysis, as shown below:

ρ＝softmax((q+o)W_ρ+b_ρ)

wherein the weight matrix W_ρAnd bias b_ρIs trainable; after the final representation is obtained, the emotion polarity is predicted as follows:

wherein

Is the predicted emotion polarity.

The technical principle and the beneficial effects of the invention are as follows:

(1) according to the method, more important grammar dependency relationship is given more weight through the grammar dependency type, so that the model can focus on the grammar features beneficial to target aspect emotion analysis, and the grammar features of the target aspect are enriched.

(2) The sentence representation of the non-target aspect further passes through the Bi-GRU, so that after the aspect information is further transmitted in the sentence representation, the target aspect is used as a query vector to calculate inter-aspect attention between the non-target aspect and the non-target aspect, and the sentence representation of the non-target aspect with larger emotional association with the target aspect can play a more important role.

(3) According to the invention, the inter-aspect correlation matrix is introduced, and after sentence representation of inter-aspect correlation is obtained through graph convolution, the inter-aspect correlation coefficient is introduced to control the amount of the introduced inter-aspect correlation information, so that the introduced inter-aspect information is beneficial to improving emotion analysis of a target aspect.

Drawings

In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings required to be used in the description of the embodiments will be briefly introduced below, it is obvious that the drawings in the following description are only three of the present invention, and for those skilled in the art, other drawings can be obtained according to these drawings without creative efforts.

FIG. 1 is an emotion analysis model based on associations between grammatical dependency types and aspects;

FIG. 2 is a specific aspect sentence representation with aspect information and grammar dependency types;

FIG. 3 is a construction of adjacency matrix and syntax-dependent type matrix.

Detailed Description

The technical solutions of the present invention will be described clearly and completely with reference to the accompanying drawings, and it is to be understood that the described embodiments are merely preferred embodiments of the present invention, rather than all embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

Examples

As shown in FIG. 1, the present disclosure provides an emotion analysis method based on association between syntax dependency type and aspect, including:

s1, obtaining a text to be analyzed, converting the text into word vector representation through a pre-training model, and representing the text to be analyzed as a word vector, specifically: converting the text into word vectors which can be recognized by a computer by using a GloVe pre-training model;

s3, the sentence representation is used as an input of a model, a target aspect is used as a query vector, a non-target aspect enables aspect features to be further propagated in the sentence representation through the Bi-GRU, and then inter-aspect attention is calculated between the obtained sentence representation containing more aspect features and the query vector;

s4, introducing an inter-aspect correlation matrix, and acquiring sentence representation containing inter-aspect correlation through a graph convolution network;

s6, after the sentence representation containing the aspect information and the association between the aspects is obtained, the influence of the non-target aspect on the target aspect is controlled by the attention between the aspects to obtain the final sentence representation, and the emotion polarity of the target aspect is judged by combining the query vector.

Further, the syntax dependency type is used to give a greater weight to the important dependency types in the syntax dependency tree, which is specifically represented as: and obtaining a sentence representation containing aspect information by using the Bi-GRU, then introducing a grammar dependency type, obtaining the sentence representation containing grammar characteristics through a graph convolution network, and adopting an attention layer to expand the influence of words which play an important role in judging the specific aspect emotion.

The inter-aspect attention module based on the target aspect specifically comprises: the sentence representation which simultaneously contains inter-aspect association and grammar dependency types is used as input and is passed through Bi-GRU, so that aspect information is further propagated in the sentence representation to obtain the sentence representation containing more aspect features, and the target aspect is used as a query vector to calculate attention between the target aspect and the non-target aspect.

The sentence representation of the association between the aspects is embodied as follows: and obtaining a correlation matrix between the aspects, and then using the correlation matrix and the sentence representation of the aspect characteristics as input of a graph volume network to obtain the sentence representation of the correlation between the aspects through the graph volume network.

The sentence expression simultaneously containing aspect characteristics and correlation among aspects is specifically as follows: and merging the sentence representation containing the aspect characteristics and the sentence representation associated between the aspects, and introducing an inter-aspect association coefficient to control the amount of the introduced inter-aspect association information so as to finally obtain the sentence representation simultaneously containing the aspect characteristics and the inter-aspect associations.

And the final representation used for emotion analysis is embodied as follows: and simultaneously multiplying the sentence representation containing the aspect characteristics and the association between the aspects by the attention between the aspects to control the influence of different non-target aspects on the target aspects to obtain a final sentence representation, adding the final sentence representation with the query vector of the target aspect, and inputting the final sentence representation into a softmax layer to obtain a final representation of emotion analysis.

Further, the specific process of obtaining sentences containing aspect information is as follows:

then passing through Bi-GRU, and naming it as GRU₁To obtain

Is represented by a hidden representation

Namely that

Then, the specific construction process of the graph convolution network focusing on the grammar dependence type comprises the following steps: first, using SpaCy to obtain syntax dependency information, a list of dependency tuples (w) can be used_i,w_j,c_ij) Is shown by c_ijRepresents the word w_iWith another word w_jIn accordance withA type of lysine; the syntax dependency information is then represented by the adjacency matrix A, in which the element a is present_ijRepresenting a word w_iWith another word w_jThe relationship between, if there are edges connected, a_ij1 is ═ 1; no edges are connected, then a_ij＝0。

To fully utilize the dependency type, element c in the dependency type matrix of the relationship is used_ijWord embedding mapped to them

Since each edge in the graph contributes differently to the emotion, in combination with the dependency type, the following method is adopted in the present embodiment to give different weight to each edge. Weight of each edge

The following formula is calculated:

wherein the content of the first and second substances,

and

respectively represent words w_iAnd the word w_jHidden state of layer (l-1), original

And

from GRU₁To output of (c).

Final word w_iThe output formula at the l layer through the graph convolution network is as follows:

wherein, W^(l)And b^(l)Represents the trainable parameter in the l-level graph convolution, σ represents the activation function ReLU. The above process incorporates dependency types into the graph convolution network and gives each edge different weights with the dependency types, enabling to focus on contextual information more important to the face word emotion analysis. Representation of sentences with aspect information obtained after TFGCN

And (4) showing.

Then, for a specific aspect a for expansion_iThe influence of words with important function for emotion judgment is introduced into an attention layer to improve sentence representation of aspect perception when representing sentence representation of aspect perception

The specific operation is shown in the following formula:

α＝softmax(z),

wherein

b_sIs a scalar.

Further, the inter-aspect attention module based on the target aspect specifically includes:

firstly, the method is to

Input to another hidden layer of size D₀Gated cyclic unit GRU₂The purpose of the method is to enable the aspect information to be sufficiently spread in the sentence representation, and obtain more sentence representations with the aspect information. Otherwise through GRU₂The resulting hidden representation is

For target aspect representation

The present embodiment uses a fully connected layer to convert it into a query vector q for computing attention between the target aspect and other aspects. The query vector is obtained as follows:

wherein

In order to obtain the correlation between the query vector and other vectors, the following method is adopted for calculation:

β＝softm ax(z_a)

wherein the content of the first and second substances,

β_irepresenting between target aspects and other aspectsThe attention score is obtained by the attention mechanism, and other aspects having a large correlation with the target aspect are weighted more, and the attention score plays a more important role in determining the emotion of the target aspect.

Further, the sentence representation of the association between the aspects includes the following specific calculation processes:

set of words, p, representing aspects of the object^tIs the starting position of the target aspect.

A set of words representing other aspects. To obtain richer dependency information of the input sentence, the present embodiment constructs a non-directional adjacency matrix, i.e.

And through GRU₂Resulting hidden representation with aspect information

As input to the graph convolution network to obtain the mutual emotional characteristics of the specific aspect and other aspects. Hidden representation of ith node in ith layer in graph convolution network

The updates of (2) are as follows:

wherein, ReLU represents an activation function,

representing a hidden representation of the ith node at level l-1, W^(l)And b^(l)Representing trainable parameters in the convolution of the l-level graph.

Finally, the hidden representation with the correlation information between the aspects obtained through the graph convolution network is shown as

Further, the sentence expression simultaneously containing aspect features and associations between aspects specifically includes:

And sentence representation Q with inter-aspect associations^Inter. To make full use of

And Q^InterIn this section, the two representations are combined to find the correlation between the two representations, and the sentence representation after combination is shown as the following expression.

Further, the final representation for emotion analysis is specifically represented as:

to introduce inter-aspect associations for further sentiment analysis, more accurately modeling the sentence representation of the target aspect, first one will bringSentence representation Q of inter-aspect associations^InterMultiplying the inter-aspect attention beta to obtain the final output of sentence representation

The query vector q for the target aspect is then added to the output to generate a target aspect sentence representation with rich inter-aspect association information, and then a softmax classifier containing C classes (C is generally 2 or 3 in the emotion analysis task, and in this embodiment is 3) is used to obtain a final representation p for emotion analysis, as shown in the following formula:

ρ＝softmax((q+o)W_ρ+b_ρ)

wherein the weight matrix W_ρAnd bias b_ρIs trainable.

After the final representation is obtained, the emotion polarity is predicted as shown in the following equation.

Wherein

Is the predicted emotional polarity.

The problem that emotion association between different aspects cannot be considered in the traditional method is solved, meanwhile, by introducing the grammar dependency type, higher weight can be given to the grammar dependency type which is more important for emotion judgment of the target aspect, emotion information and grammar information between the aspects are effectively combined, and accuracy and stability of aspect level emotion analysis on different data sets are improved.

In order to verify the effectiveness of the model provided by the embodiment of the invention in solving the aspect-level emotion analysis task, three public data sets are adopted for carrying out experiments:

including both REST14 and LAP14 data sets. In addition, TWITTER datasets built from social networking sites have also been employed. Each data set is labeled in advance and is divided into a training set and a testing set, the data of the training set is used for training a proper model, and the testing set is used for judging the generalization ability of the trained model. Each dataset contains three categories, positive, neutral and negative, respectively.

Setting experimental parameters: initial representations of sentences and aspect words are obtained by using GloVe word vectors and BERT word vectors respectively, wherein the input dimension of the GloVe word vectors is 300 generally, and the input dimension of the BERT word vectors is 768 generally. If the loss function of the model does not decrease for 5 epochs consecutively, the training is terminated by adopting an early-stop mechanism. For the weight matrix and the bias matrix, random initialization is performed by uniformly distributing U (-0.01,0.01), and the number of layers of the graph convolution network is set to 2. Other super parameter settings are shown in table 1.

TABLE 1 setting of hyper-parameters

Accuracy (Accuracy, Acc) and Macro-F1 (F1 value for short) widely used in aspect-level emotion analysis are adopted as evaluation indexes of the model. Accuracy refers to the proportion of correctly predicted samples to the total number of samples. Macro-F1 takes the average of all classes of F1 as the F1 value for the whole sample. The larger the value of the index is, the better the classification effect is.

The calculation of the accuracy Acc and F1 values is shown below.

Where TN represents true negative samples, FN represents false negative samples, FP represents false positive samples, and TP represents true samples. P represents precision and R represents recall.

First, to verify the superiority of the model herein, comparative experiments were performed on three data sets, with the results shown in table 2.

TABLE 2 accuracy and F1 values for each model on the TWITTER dataset

TABLE 3 accuracy (%) (in% of the different models) on REST14 and LAP14

Table 4F 1 values (%) (on REST14 and LAP 14) for the different models

Note: TF-IAGCN is the model provided by the invention, and the bold data shows the best effect.

Secondly, to further verify the superiority of the model proposed in the present invention in emotion analysis of sentences containing multiple aspects, the REST14 and LAP14 data sets were further divided according to the Single Aspect (SA) and Multiple Aspects (MA) contained in the sentences, as shown in table 5. The accuracy of each model in single and multiple aspects of the REST14 and LAP14 datasets is shown in table 6.

TABLE 5 distribution of single and multiple aspects in LAP14 and REST14 for individual emotion categories

TABLE 6 accuracy (%) -of each model in different aspect numbers

Thirdly, in order to verify the effectiveness of several modules proposed by the present invention, several sets of ablation experiments were performed as follows, and the results are shown in table 7.

w/oTFGCN, wherein TFGCN represents a graph volume network of a focus syntax type, and w/oTFGCN represents hidden representation obtained by GRU directly when sentence representation of an embedding aspect is constructed, and the hidden representation is input to an attention layer to obtain sentence representation of embedding certain aspect.

w/oIaatt, IAatt represents attention between aspects, w/oIAatt represents that attention between aspects is not calculated, and sentence representations containing specific aspects and sentence representations related between aspects are directly fused and spliced with a query vector of a target aspect to predict emotion.

W/oagcn, IAGCN representing a graph convolution network with inter-aspect associations, w/oagcn representing not extracting sentence representations with inter-aspect associations through the graph convolution network with inter-aspect associations, using only sentence representations containing a particular aspect, and generating a final representation with inter-aspect attention.

The final results of the ablation experiments are shown in table 7.

Table 7 ablation experimental results (%)

The embodiment provides an emotion analysis model based on the association between the grammatical dependency types and the aspects, and the superiority of the emotion analysis model is verified through the embodiment.

The above description is only for the purpose of illustrating the preferred embodiments of the present invention and is not to be construed as limiting the invention, and any modifications, equivalents, improvements and the like that fall within the spirit and principle of the present invention are intended to be included therein.

Claims

1. An emotion analysis method based on association between a grammatical dependency type and an aspect, the method specifically comprising the steps of:

s1, acquiring a text to be analyzed, and converting the text into word vector representation through a pre-training model;

s2, the specific aspect is spliced with each word represented by the sentence, so that the sentence represents that only the emotion of the specific aspect is focused at each time, the grammar dependency type is introduced, important dependency types in the grammar dependency tree are endowed with larger weight, the sentence representation containing more grammar information is obtained through a graph convolution network, and then the sentence representation with the aspect perception and the grammar dependency type is improved by adopting an attention layer;

s5, merging the sentence expression containing the inter-aspect association and the sentence expression containing the aspect characteristics, and controlling the introduction amount of the inter-aspect association characteristics by using the inter-aspect association coefficient;

2. The method of claim 1, wherein the emotion analysis method comprises the following steps: step S2 gives more weight to important dependency types in the syntax dependency tree specifically as: and obtaining a sentence representation containing aspect information by using the Bi-GRU, then introducing a grammar dependency type, obtaining the sentence representation containing grammar characteristics through a graph convolution network, and adopting an attention layer to expand the influence of words which play an important role in judging the specific aspect emotion.

3. The method of claim 2, wherein the emotion analysis method comprises the following steps: the specific process of sentence representation containing aspect information is as follows:

general terms a_iEmbedding into a word vector representation of a sentence to obtain a word vector representation with aspect words:

passing through Bi-GRU, it is named as GRU₁To obtain

Is hidden representation of

Namely, it is

The specific construction process of the graph convolution network focusing on grammar dependence types comprises the following steps: first, using SpaCy to obtain syntax dependency information, a list of dependency tuples (w) can be used_i,w_j,c_ij) To represent c_ijRepresenting a word w_iWith another word w_jThe type of dependency of (2); the syntax dependency information is then represented by the adjacency matrix A, in which the element a is present_ijRepresents the word w_iWith another word w_jThe relationship between, if there are edges connected, a_ij1 is ═ 1; no edges are connected, then a_ij＝0；

Make the relationship dependent onElement c in type matrix_ijWord embedding mapped to them

Different weights are given to each edge by the following method

The following formula is calculated:

and

And

derived from GRU₁An output of (d);

final word w_iThe output at layer l over the graph convolution network is as follows:

wherein, W^(l)And b^(l)Represents a trainable parameter in the convolution of the l-layer graph, and sigma represents an activation function ReLU; representation of sentences with aspect information obtained after TFGCN

Represents;

The specific operation is shown in the following formula:

α＝softmax(z),

wherein

b_sIs a scalar.

4. The method of claim 2, wherein the emotion analysis method comprises: step S3 calculates inter-aspect attention specifically as: and taking the sentence representation containing the inter-aspect association and the grammar dependency type as an input, passing the sentence representation through a Bi-GRU, further transmitting aspect information in the sentence representation to obtain a sentence representation containing more aspect features, taking the target aspect as a query vector, and calculating the attention between the target aspect and the non-target aspect.

5. The method of claim 4, wherein the emotion analysis method comprises the following steps: the specific flow of attention among calculation aspects is as follows: firstly, the first step is to

Input to another hidden layer of size D₀Gated cyclic unit GRU₂Performing the following steps; other aspects go through GRU₂The resulting hidden representation is

For object aspect representation

wherein

β＝softmax(z_a)

wherein the content of the first and second substances,

will be provided with

Input to another hidden layer of size D₀Gated cyclic unit GRU₂In other aspects through GRU₂The obtained hidden representation is

For target aspect representation

wherein

β＝softmax(z_a)

wherein the content of the first and second substances,

6. The method of claim 4, wherein the emotion analysis method comprises the following steps: the sentence associated in step S4 is specifically represented as: and obtaining a correlation matrix between the aspects, and then using the correlation matrix and the sentence representation of the aspect characteristics as input of the graph volume network, and obtaining the sentence representation of the correlation between the aspects through the graph volume network.

7. The method of claim 6, wherein the emotion analysis method comprises: the sentence expression of the association between the aspects comprises the following specific calculation processes:

an adjacency matrix between the aspects is first constructed, for obtaining context dependencies between the aspects,

the specific construction method is shown as the following formula:

wherein the content of the first and second substances,

a set of words representing other aspects;

form a non-directional contiguous matrix, i.e.

After obtaining the inter-aspect adjacency matrix, the inter-aspect adjacency matrix is

And through GRU₂Resulting hidden representation with aspect information

As input to the graph convolution network, to obtain the mutual emotional characteristics of the specific aspect and other aspects; hidden representation of ith node in ith layer in graph convolution network

The updates of (2) are as follows:

wherein, ReLU represents an activation function,

representing a hidden representation of the ith node at level l-1, W^(l)And b^(l)Representing trainable parameters in the convolution of the l-layer graph;

8. the method of claim 6, wherein the emotion analysis method comprises the following steps: step S5 is embodied as a sentence expression including both aspect features and associations between aspects: combining sentence representations containing aspect characteristics and sentence representations related between aspects, and introducing a correlation coefficient between aspects to control the amount of introduced correlation information between aspects, and finally obtaining the sentence representations simultaneously containing the aspect characteristics and the correlation between aspects;

And Q^InterThe two representations are combined to discover the correlation between the two representations, and the combined sentence is expressed by the following formula:

where the coefficient γ ∈ [0,1] indicates how much inter-aspect features are introduced.

9. The method of claim 8, wherein the emotion analysis method comprises the following steps: the final sentence representation of step S6 is specifically: multiplying the sentence representation containing the aspect features and the association between the aspects with the attention between the aspects to control the influence of different non-target aspects on the target aspects to obtain a final sentence representation, adding the final sentence representation with the query vector of the target aspect, and inputting the final sentence representation to a softmax layer to obtain a final representation of emotion analysis.

10. The method of claim 9, wherein the emotion analysis method comprises: the final representation for emotion analysis is embodied as:

first, a sentence with an association between aspects is represented as Q^InterMultiplying with inter-aspect attention beta to obtain final output of sentence representation

The query vector q for the target aspect is then added to the output to produce a query vector with abundanceThe target aspect sentence representation of the rich inter-aspect association information is then used with a softmax classifier containing C classes to obtain the final representation p for emotion analysis, as shown in the following formula:

ρ＝softmax((q+o)W_ρ+b_ρ)

wherein the weight matrix W_ρAnd bias b_ρIs trainable; after the final representation is obtained, the emotion polarity is predicted as shown in the following formula:

wherein

Is the predicted emotional polarity.