CN113705197A

CN113705197A - Fine-grained emotion analysis method based on position enhancement

Info

Publication number: CN113705197A
Application number: CN202111000430.XA
Authority: CN
Inventors: 刘磊; 侯良文; 焦一狄; 李静
Original assignee: Beijing University of Technology
Current assignee: Beijing University of Technology
Priority date: 2021-08-30
Filing date: 2021-08-30
Publication date: 2021-11-26
Anticipated expiration: 2041-08-30
Also published as: CN113705197B

Abstract

The invention provides a fine-grained emotion analysis method based on position enhancement, which is used for solving the problem of low precision caused by fine-grained emotion analysis of a text in the prior art. Firstly, preprocessing a text, and then carrying out emotion analysis through a fine-granularity emotion analysis model. The model comprises an embedding layer, a semantic representation layer, an information interaction layer and an output layer. The embedding layer maps sentences into context word embedding and aspect word embedding, the semantic representation layer enhances the text semantic representation capability of the model through a position enhanced attention mechanism, the information interaction layer enhances the interactivity of the aspect words and the contexts thereof through a memory network, the context semantic enhanced representation based on the aspects is used as an external memory unit interacting with the aspects, so that the external memory unit can learn semantic information in the complex text, and finally the output layer carries out emotion prediction. According to the invention, the context range for performing emotion expression on aspects is reasonably defined, so that the fine-grained emotion analysis accuracy is improved.

Description

Fine-grained emotion analysis method based on position enhancement

Technical Field

The invention belongs to the technical field of information processing, and relates to a fine-grained emotion analysis method based on position enhancement.

Background

The rapid development of social networks and e-commerce shopping platforms enables people to make opinions and expressions on the network platforms more conveniently, so that a large amount of text data containing user emotion information is generated, and huge practical values are included. And the product has multidimensional attribute, so that consumers can comment on the product and the service from different angles, such as quality, price, service and the like. The traditional text sentiment analysis technology generally provides a sentiment judgment on the whole, and cannot meet the requirement of judging sentiment tendentiousness in different aspects in comment texts, so the sentiment analysis granularity of the text needs to be more refined. For example, the sentence "the food in the restaurant is too good to eat, but the service quality is poor. ", the fine-grained sentiment analysis task is aimed at judging the sentiment tendencies (food, positive) and (service, negative) of the aspects" food "and" service ". The "food" and "service" in the example sentence are called facets, and the other text that does not belong to the facets "this restaurant is too delicious, but the quality is poor. "is referred to as context. The task is a popular research direction in the field of natural language processing, which is beneficial to the selection of consumers and the decision of enterprises on products, and has wide commercial prospect and application value.

With the continuous maturation of deep learning technology, the method is effectively applied to the field of text emotion analysis. The research method of the fine-grained text sentiment analysis mainly focuses on the improvement of the basic structure of the neural network. Due to the attention mechanism, the deep learning model has stronger text representation capability and data processing capability, and good progress is achieved in a fine-grained emotion analysis research task. But combining textual feature representations for specific aspects remains challenging due to the complexity of the linguistic structure and the implicit emotional attribute expressions.

Disclosure of Invention

In order to solve the problem of low precision caused by performing fine-grained emotion analysis on a text in the prior art, the invention provides a fine-grained emotion analysis method based on position enhancement. The method comprises the steps of preprocessing a text, constructing a word vector, and performing emotion analysis through a fine-granularity emotion analysis model. The model consists of an embedding layer, a semantic representation layer, an information interaction layer and an output layer. The method comprises the steps that firstly, an embedding layer maps sentences in a text into context word embedding and aspect word embedding representations according to word vectors, secondly, a semantic representation layer enhances the text semantic representation capability of a model through a position enhanced attention mechanism, secondly, an information interaction layer enhances the interactivity of aspect words and contexts of the aspect words through a memory network, and takes the context semantic enhanced representations based on specific aspects as external memory units interacting with the aspects, so that the external memory units can learn semantic information in complex texts, and finally, an output layer conducts emotion prediction on the specific aspect words.

In order to achieve the purpose, the invention adopts the following technical scheme

A fine-grained emotion analysis method based on position enhancement comprises the following steps:

step 1 text preprocessing

(1) Case and case conversion: all upper case letters present are converted to lower case letters.

(2) Word segmentation: and performing word segmentation on the text data set by adopting a general language word segmentation module.

(3) Removing stop words: some words in the text data that have no practical meaning are removed.

(4) Constructing a position weight matrix M for the text data by using a shielding mechanism, and calculating the weight according to the position of a word in a sentence, wherein the calculation formula is as follows:

wherein h is_maxExpressed as the maximum length of the input sentence. The shielding mechanism is based on the middle part of the sentenceCalculating the position weight of the relative position of the face word and the context word, M_ijDenotes the term w_iThe position weight of the word pair at the center, i and j are the position indices of the words. Word pair distance is h_maxWithin/2, giving weight according to the distance, otherwise, giving M_ijIs set to 0.

Step 2, constructing word vectors

And mapping each word of the preprocessed text data to a vector space, wherein each word corresponds to a vector with the dimension d. For each word in the text data, if the word exists in the pre-trained word vector table, using a word vector in the word vector table as a word vector of the word, and if the word does not exist in the pre-trained word vector table, using a normal distribution random initialization vector as the word vector of the word.

Step 3, constructing a fine-grained emotion analysis model, and performing fine-grained emotion prediction of aspect-level emotion on a text to be analyzed by using the fine-grained emotion analysis model, wherein the model specifically comprises the following steps:

3.1 embedding layer

Each sentence in the text data contains 1 or more aspects, and each aspect has a corresponding sentiment value. And mapping each sentence in the text data into a low-dimensional dense word embedding expression vector according to the word vector constructed in the step 2, wherein the word embedding expression vector is divided into aspect word embedding and contextual word embedding. Each sentence is composed of an aspect and a context of the aspect, and the corresponding word embedding representation is divided into context word embedding and aspect word embedding. Context refers to other parts of the sentence that do not include aspects. If the aspect is composed of a plurality of words, average pooling of word-embedded vector representations of the plurality of words is performed as the vector representation of the aspect.

3.2 semantic representation layer

The semantic representation layer is used for extracting high-level abstract representation of the text, is composed of blocks (Block) in series connection, and obtains deeper abstract features H of the text through continuous iterative computation. Each Block is formed by combining a position weight fusion mechanism and a feedforward neural network through residual connection and layer normalization. The output of two submodules, namely a position weight fusion mechanism and a feedforward neural network in a single Block can be formally expressed as follows:

output ═ LayerNorm (x + sublayer (x)) (equation 2)

Wherein LayerNorm () is layer normalization, Sublayer () is designed in detail for each Block of the function realized by the submodule itself as follows:

(1) a self-attention mechanism is used. The context word embedding E is linearly mapped to three different spaces to obtain a corresponding query matrix Q, a key matrix K and a value matrix V, and the mutual dependency relationship between the context word pairs is calculated by utilizing a key value pair attention mechanism and a query vector and a key vector. Wherein, the ith word in the context word embedding E is embedded into E_iThe linear mapping of (i ═ 1,2, …, n) is represented as a query vector q_iKey vector k_iVector of sum values v_iWhere n denotes the number of context words, the linear mapping process is expressed as follows:

wherein W_q、W_kAnd W_vParameter matrices, respectively linear mapping, Q, K and V respectively query vector q_iKey vector k_iVector of sum values v_iA matrix is formed.

(2) A location weight fusion mechanism is used. In the self-attention mechanism, measures of weight enhancement and weight shielding are added to the relevance weight matrix through the position weight matrix M, so that information with high relevance to the aspect words is enhanced, and the influence of irrelevant information or wrong emotion information is weakened. The enhancement or masking of the weights is measured by how far the context words are from the side.

First, each query vector q_iRespectively with key vector k_jPerforming association score calculation by using a compatible function f to obtain an association score matrix S belonging to R between word pairs^n×nQuery vector q_iAnd the key vector k_jIs associated with a score S_ijIs represented as follows:

S_ij＝f(q_i,k_j)＝w^Tσ(Wq_i+Vk_j) i, j ∈ {1,2, …, n } (equation 4)

Wherein W, V and W are parameters to be trained, sigma is a Sigmoid activation function, and n is the number of context words.

And then by giving the association score f (q)_i,k_j) Increasing the position weight M_ijTo strengthen and weaken the influence of the context on the aspect, and combining the value vector v_iInformation aggregation is carried out, and the characteristic head is extracted as h₁,h₂,…,h_n]Each feature h_iThe calculation of (d) is represented as follows:

wherein A is^m∈R^n×nIs composed of

Constructed position-fused weight matrix, M_ijIs the position weight of the word, beta is the expansion coefficient of the position weight, and n is the number of the context words.

(3) A multi-headed self-attentive mechanism is used. For capturing different aspects of textual information, the calculation is as follows:

LF-MultiHead(Q,K,V)＝Concat(head₁,…,head_h)W^O(formula 7)

Where Concat (. cndot.) represents vector splicing, W^ORepresenting linear compression transformation, compressing the matrix formed by the multi-head attention mechanism to the original dimension, h representing the number of aspects, head_iFeatures extracted by the self-attention mechanism for the ith aspect;

(4) residual join and layer normalization operations are used. Residual error connection takes the following word embedding as input, and carries out layer normalization operation together after being fused with the result of the multi-head self-attention mechanism, so as to obtain the output of the position weight fusion mechanism.

(5) And taking the output of the position weight fusion mechanism as the input of a feedforward neural network layer, and outputting the result as Block finally through residual connection and normalization operation. The feedforward neural network is realized by adopting a full connection layer and a Relu activation function.

3.3 information interaction layer

And (3) interacting by using a memory network, and enhancing the relation between the abstract characteristics H and the aspect to ensure the interactivity of the aspect and the context thereof. Abstract text characteristic H ═ H obtained by semantic representation layer of memory network₁,h₂,…,h_n]The 1 st computing layer is embedded with the aspect words v_aspectAnd taking the weighted combination r of the memory units as an initial input, taking the output of the memory units as the input of the next calculation layer, and carrying out iterative calculation on each layer in turn. The weighted combination formula is as follows:

wherein n is the memory capacity, alpha_i∈[0,1]Is a memory cell h_iIs a weight of_i＝1α_i1. Weight α_iThe semantic relevance of the context to the computational aspect of the feedforward neural network is obtained by calculating as follows:

ω_i＝tanh(W_att[h_i；v_aspect]+b_att) (formula 9)

Wherein W_att∈R^1×2nIs a parameter vector, b_att∈R^1×1To be offset, α_iAs a memory cell h_iAnd (4) distributing the weight value.

3.4 output layer

After the aspect information is interacted with the memory unit for multiple times, the obtained final representation is used as the emotion characteristics corresponding to the specific aspect and is finally input into the softmax function to obtain emotion distribution, and the aspect emotion is predicted.

The model needs to be trained before prediction, which is as follows:

the fine-grained emotion analysis model is trained by minimizing the cross-entropy loss function and the L2 regularization term, with the entire loss function optimizing the model parameters by gradient descent. The loss function is as follows:

wherein D is a training data set, c represents the context of the sentence, e represents the aspect of the sentence, l represents the emotion label of the aspect, S is an emotion category set, y^sFor one-hot codes generated according to emotion classes, f^sAnd (c, e and theta) are prediction emotion distribution of the model, lambda is a regularization term coefficient, strength of control regularization is strong or weak, and theta is a model weight coefficient.

Advantageous effects

(1) The influence of the position information on emotion prediction of a specific aspect is fully considered, the context range of emotion expression of the aspect is defined through a position strengthening attention mechanism, and the text semantic representation capability of the model for the specific aspect is enhanced; according to the invention, the context range for emotion expression on aspects is reasonably defined, so that the text semantic representation capability and the information interaction of the emotion analysis system are enhanced, and the fine-grained emotion analysis accuracy is improved.

(2) The context semantic enhancement expression based on the specific aspect is used as an external memory unit interacting with the aspect, so that the problem that the semantic information in a more complex text is difficult to learn due to the fact that the external memory unit is established on a single word vector in a memory network is solved;

(3) the invention can realize parallelization calculation in model training and improve the efficiency of model training.

Drawings

FIG. 1 is a flow chart of a model structure;

FIG. 2 is a diagram of a location embedding model architecture;

FIG. 3 is a structure of a position-enhanced fine-grained sentiment analysis model.

Detailed Description

The following examples are intended to illustrate the present invention but are not intended to limit the scope of the invention.

The invention takes a public data set and a multi-aspect emotion data set MAMS of a computer note field and a restaurant field contained in a semantic evaluation match SemEval-2014 Task4 as training corpora of a model so as to verify the effectiveness of the model. The specific implementation steps are as follows:

step 1 text preprocessing

Firstly, text preprocessing is carried out on a training corpus, and the processing steps are as follows:

(4) Constructing a position weight matrix M: calculating the weight according to the relative position of the words in the sentence, wherein the calculation formula is as follows:

wherein h is_maxExpressed as the maximum length of the input sentence. The masking mechanism calculates its position weight, M, based on the relative positions of the aspect words and the context words in the sentence_ijDenotes the term w_iThe position weight of the word pair at the center, i and j are the position indices of the words. Word pair distance is h_maxWithin/2, giving weight according to the distance, otherwise, giving M_ijIs set to 0.

Here, The sentence "The food in The same resource is delayed, but The service is related force" is taken as an example to explain The position weight matrix MAnd (5) constructing. j sentences of length 13, aspects in the sentences "food" and "service", the sentences being represented as s { (food,2), (retaurant, 5), (delicious,7), (service,10), (realaly, 12), (por, 13) }, the position weight M of the word "por" relative to the aspect "service", after removal of stop words_10,13＝1-|10-13|/13＝0.769。

The aspect words and the context words are marked in the training corpus, and the automatic recognition mode of the aspect words is out of the research scope of the invention.

Step 2: word vector construction

For the preprocessed training samples, each word in the training samples is mapped to a vector space through a Glove word vector, and then each word corresponds to a vector with the dimension d being 300. For each word in the training sample, if the word exists in the pre-training word vector table, using the word vector in the word vector table as the word vector of the word, and if the word does not exist in the pre-training word vector table, using the randomly distributed U (-0.25,0.25) random initialization vector as the word vector of the word.

And 3, constructing a fine-grained emotion analysis model, and performing fine-grained emotion analysis on the text to be analyzed by using the trained fine-grained emotion analysis model.

3.1 embedding layer

First, for the sentence s ═ w₁,w₂,…,w_nUsing Glove word embedding technology to embed word w_i(i ═ 1,2, …, n) is mapped to a low-dimensional dense vector as a word-embedded representation. Word embedding means is divided into contextual word embedding E ═ E₁,e₂,…,e_n]And aspect word embedding v_aspectIf the aspect is composed of a plurality of words, the vector representations of the plurality of words are averaged and pooled as the vector representation of the aspect.

3.2 semantic representation layer

The semantic representation layer is composed of 6 blocks (Block), each Block is formed by combining two submodules of a position weight fusion mechanism and a feedforward neural network in series, and each submodule is subjected to residual error connection and layer normalization. The output of each sub-layer can be formally represented as follows:

output ═ LayerNorm (x + sublayer (x)) (equation 2)

Where LayerNorm () is the layer normalization, Sublayer () is the function implemented by the Sublayer itself, and x is the input characteristic.

The specific design steps of the Block are as follows:

first, the context word embedding E is linearly mapped to three different spaces, E for each word embedding_i(i-1, 2, …, n) to obtain the corresponding query vector q_iKey vector k_iVector of sum values v_i. The linear mapping process is represented as follows:

wherein W_q、W_kAnd W_vRespectively, linear mapped parameter matrices, and Q, K and V respectively query vectors q_iKey vector k_iVector of sum values v_iA matrix is formed.

Second, position weight fusion is performed. The self-attention mechanism is added with measures of weight enhancement and weight shielding, and the enhancement or shielding of the weight takes the distance from the aspect words as a measure, enhances the information with higher relevance with the aspect words and weakens the influence of irrelevant information or wrong emotional information. First, each query vector q_iRespectively with key vector k_iPerforming association score calculation by using a compatible function f to obtain an association score matrix S belonging to R between word pairs^n×nExpressed as follows:

S_ij＝f(q_i,k_j)＝w^Tσ(Wq_i+Vk_j) i, j ∈ {1,2, …, n } (equation 4)

Where W and V are the parameters to be trained and σ is the Sigmoid activation function.

And then by giving the association score f (q)_i,k_j) Increasing the position weight M_ijTo strengthen and weaken the influence of the context on the aspect, and combining the value vector v_iInformation aggregation is carried out, and the characteristic head is extracted as h₁,h₂,…,h_n]The calculation is expressed as follows:

wherein A is^m∈R^n×nAs a weight matrix after position fusion, M_ijIs the relative position weight of the word, and beta is the expansion coefficient of the position weight, and the value is 10.

To capture different aspects of textual information, the self-attention mechanism is expanded to a multi-headed self-attention mechanism, represented as follows:

LF-MultiHead(Q,K,V)＝Concat(head₁,…,head_h)W^O(formula 7)

Where Concat (. cndot.) represents vector splicing, W^ORepresenting a linear compression transform, h representing the number of aspects, head_iFeatures extracted by the self-attention mechanism for the ith aspect word;

and finally, embedding the words into the output obtained by a position weight fusion mechanism as the input of a feedforward neural network layer, and obtaining the output of Block by residual connection and normalization operation, wherein the feedforward neural network is realized by adopting a full connection layer and a Relu activation function, and the feedforward neural network is expressed as follows:

FFN(x)＝Relu(xW₁+b₁)W₂+b₂(formula 8)

Wherein W₁And W₂For training parameters, b₁And b₂Is the bias term.

The whole semantic representation layer is continuously calculated in an iterative mode through the serial connection among a plurality of Block layers, so that a deeper abstract feature H of the text is obtained, and emotion semantic information aiming at a specific aspect is deeply mined.

3.3 information interaction layer

Information interaction layer ensures that aspects and their contexts have interaction using memory networkAnd (4) mutual performance. Text feature H ═ H extracted by semantic representation layer based on position enhancement₁,h₂,…,h_n]The method is characterized in that the method is used as a memory unit and comprises 3 calculation layers, an aspect word vector v is used as an initial input, a weighted combination r is adaptively selected from a context hiding state H as the input of a next calculation layer, and the weighted combination formula is as follows:

wherein n is the memory capacity, alpha_i∈[0,1]Is a memory cell h_iIs a weight of_i＝1α_i1. Since context has different effects on emotion determination for a particular target, semantic relevance of aspects to context is calculated using a feed-forward neural network. Thus, according to the external memory unit h_iSemantic relation with aspect, adaptive as memory unit h_iWeights are assigned. The scoring function is calculated as follows:

ω_i＝tanh(W_att[h_i；v_aspect]+b_att) (formula 10)

3.4 output layer

After the facet information and the memory unit are interacted for a plurality of times, the obtained final representation is used as the emotion characteristics corresponding to the specific target and is input to the softmax layer to predict the facet emotion.

Before performing emotion analysis by using a fine-grained emotion analysis model, the model needs to be trained, specifically, the model is trained by minimizing a cross entropy loss function and an L2 regularization term, the whole loss function optimizes model parameters by gradient descent, and the loss function is as follows:

wherein S is an emotion category set, D is a training data set, y^sFor one-hot codes generated according to emotion classes, f^sAnd (c, e and theta) are prediction emotion distribution of the model, lambda is a regularization term coefficient, the value of lambda is 0.001, the strength of regularization is controlled, and theta is a model weight coefficient. The L2 normalized attenuation coefficient was set to 10 e-4. All weight matrices in the training are randomly initialized with a uniform distribution of U (-0.01 ) and the bias is initialized to 0.

Step 4 Experimental analysis

In order to verify the performance of the model, experiments are carried out on three data sets of Restaurant, Laptop and MAMS and are compared with other baseline models, so that the effectiveness of the method is verified

TABLE 1 comparison of the results

Claims

1. A fine-grained emotion analysis method based on position enhancement is characterized by comprising the following steps:

step 1, text preprocessing;

step 2, word vector construction: mapping each word in the preprocessed text data to a vector space to obtain a word vector of each word;

step 3, performing aspect-level emotion fine-grained emotion prediction on the text to be analyzed by utilizing a fine-grained emotion analysis model, wherein the fine-grained emotion analysis model comprises an embedding layer, a semantic representation layer, an information interaction layer and an output layer,

the specific prediction process is as follows:

firstly, the embedding layer maps sentences in the text into context word embedding and aspect word embedding according to the word vectors obtained in the step 2;

then, embedding the context words by the semantic representation layer by using a self-attention mechanism, and enhancing the attention mechanism by the aspect word position information to enhance the text semantic representation capability of the model;

next, enhancing the interactivity of the aspect words and the contexts thereof through an information interaction layer;

and finally, the output layer predicts the emotion of the aspect words.

2. The fine-grained emotion analysis method based on location enhancement according to claim 1, wherein:

step 1 the text preprocessing comprises the following steps:

(1) case and case conversion: converting all existing upper case letters into lower case letters;

(2) word segmentation: performing word segmentation on the text data by adopting a general language word segmentation module;

(3) removing stop words: removing some words without practical meaning in the text data;

(4) constructing a position weight matrix M for the text data by using a shielding mechanism, wherein the calculation formula is as follows:

wherein h is_maxExpressed as the maximum length of the input sentence, the masking mechanism calculates its position weight, M, based on the relative positions of the aspect words and the context words in the sentence_ijDenotes the term w_iThe position weight of the word pair as the center, i and j are the position indexes of the words, and the distance of the word pair is h_maxWithin/2, giving weight according to the distance, otherwise, giving M_ijIs set to 0.

3. The fine-grained emotion analysis method based on location enhancement according to claim 1, characterized by comprising the steps of: the acquisition mode of the word vector in the step 2 is as follows: for each word in the text data, if the word exists in the pre-trained word vector table, using a word vector in the word vector table as a word vector of the word, and if the word does not exist in the pre-trained word vector table, using a normal distribution random initialization vector as the word vector of the word.

4. The fine-grained emotion analysis method based on location enhancement according to claim 1, wherein: the specific steps of each layer of the fine-grained emotion analysis model are as follows:

the embedding layer obtains word vectors of all words according to the step 2, the sentences in the text are mapped into word embedding representations in a low-dimensional dense vector form, the sentences with marked aspects are regarded as being formed by the contexts of aspects and aspects, the corresponding word embedding representations are divided into word embedding of the upper and lower languages and word embedding of the aspects, and if one aspect is formed by a plurality of words, the vector representations of the words are subjected to average pooling to be used as the vector representation of the aspects;

the semantic representation layer is used for extracting high abstract representation of a text, a network concrete structure of the semantic representation layer is composed of K blocks in series connection, and deeper abstract features H of the text are obtained through continuous iterative calculation, wherein each Block is added with position weight in an attention mechanism and processed by using a residual connection layer, a layer normalization layer and a feedforward neural network layer;

the information interaction layer uses a memory network for interaction and is used for enhancing the relation between the abstract characteristics H and the aspect and ensuring the interactivity between the aspect and the context thereof; abstract text characteristic H ═ H obtained by semantic representation layer of memory network₁,h₂,…,h_n]As a memory unit, the memory unit is composed of L calculation layers, wherein the 1 st calculation layer is embedded with aspect words v_aspectTaking the weighted combination r of the memory unit as an initial input, taking the output of the memory unit as the input of the next calculation layer, and sequentially and iteratively calculating each layer, wherein the weighted combination formula is as follows:

wherein n is the memory capacity, alpha_i∈[0,1]Is a memory cell h_iIs a weight of_i＝1α_i1, weight α_iThe semantic relevance of the context to the computational aspect of the feedforward neural network is obtained by calculating as follows:

ω_i＝tanh(W_att[h_i；v_aspect]+b_att) (formula 3)

Wherein W_att∈R^1×2nIs a parameter vector, b_att∈R^1×1To be offset, α_iAs a memory cell h_iDistributing the weight value of (1);

the output layer takes the result of the information interaction layer as input and predicts the emotion in the aspect by utilizing the softmax function.

5. The fine-grained emotion analysis method based on location enhancement according to claim 4, characterized by comprising the steps of: the Block of the semantic representation layer in the step 3 is designed as follows:

(1) using a self-attention mechanism: the context word embedding E is linearly mapped to three different spaces to obtain a corresponding query matrix Q, a key matrix K and a value matrix V, wherein the ith word embedding E in the context word embedding E_iThe linear mapping of (i ═ 1,2, …, n) is represented as a query vector q_iKey vector k_iVector of sum values v_iWhere n denotes the number of context words, the linear mapping process is expressed as follows:

wherein W_q、W_kAnd W_vParameter matrices, respectively linear mapping, Q, K and V respectively query vector q_iKey vector k_iVector of sum values v_iA matrix of formations;

(2) using a location weight fusion mechanism: the measures of weight enhancement and weight shielding are added in the self-attention mechanism and are used for enhancing information with high relevance with the aspect words and weakening the influence of irrelevant information or wrong emotion information, and the enhancement or shielding of the weight is measured by the distance between the context words and the aspect words, and specifically the measures are as follows:

S_ij＝f(q_i,k_j)＝w^Tσ(Wq_i+Vk_j) i, j ∈ {1,2, …, n } (equation 6)

Wherein W, V and W are parameters to be trained, sigma is a Sigmoid activation function, and n is the number of context words;

and then by giving the association score f (q)_i，k_j) Increasing the position weight M_iijTo strengthen and weaken the influence of the context on the aspect, and combining the value vector v_iInformation aggregation is carried out, and the characteristic head is extracted as h₁，h₂，…，h_n]The calculation is expressed as follows:

wherein A is^m∈R^n×nIs formed by

Constructed position-fused weight matrix, M_ijIs the relative position weight of the word, n is the number of the context words, and beta is the expansion coefficient of the position weight;

(3) using a multi-headed self-attention mechanism for capturing different aspects of textual information, the calculation is as follows:

LF-MultiHead(Q，K，V)＝Concat(head₁，…，head_h)W^O(formula 9)

(4) using residual join and layer normalization operations: residual error connection takes the following word embedding as input, and carries out layer normalization operation together with the result of the multi-head self-attention mechanism to obtain the output of the position weight fusion mechanism;

(5) and taking the output of the position weight fusion mechanism as the input of a feedforward neural network layer, and performing residual connection and normalization operation again, wherein the result is finally output as a Block layer, and the feedforward neural network is realized by adopting a full connection layer and a Relu activation function.

6. The fine-grained emotion analysis method based on location enhancement according to claim 1, characterized by comprising the steps of: training a fine-grained emotion analysis model: the model was trained by minimizing the cross-entropy loss function and the L2 regularization term, the entire loss function optimizing the model parameters by gradient descent, the loss function being as follows:

Loss＝∑_{(c，e，l)∈D}∑_s∈Sy^slog f^s(c；e；θ)+λ||θ||²(formula 11)

Wherein D is a training data set, c represents the context of the sentence, e represents the aspect of the sentence, l represents the emotion label of the aspect, S is an emotion category set, y^sFor one-hot codes generated according to emotion classes, f^s(c; e; theta) as the predicted emotion score of the modelAnd distributing, wherein lambda is a regularization term coefficient, the strength of regularization is controlled, and theta is a model weight coefficient.