CN114764564A

CN114764564A - Aspect-level emotion polarity classification method based on fusion linguistic knowledge

Info

Publication number: CN114764564A
Application number: CN202210465093.XA
Authority: CN
Inventors: 王笛; 田玉敏; 万波; 岳瑞峰; 王泉; 罗雪梅; 王义峰; 安玲玲; 潘蓉; 赵辉
Original assignee: Xidian University
Current assignee: Xidian University
Priority date: 2022-04-25
Filing date: 2022-04-25
Publication date: 2022-07-19

Abstract

The invention discloses an aspect-level emotion polarity classification method based on fusion linguistic knowledge, which comprises the following steps of: (1) establishing a training sample set and a testing sample set; (2) building an aspect level emotion classification model based on fusion linguistic knowledge; (3) performing iterative training on the aspect-level emotion polarity classification model; (4) and obtaining the classification result of the aspect level emotion polarity. The invention constructs an aspect-level emotion polarity classification network based on fusion linguistic knowledge, analyzes and extracts the specific linguistic knowledge in the comment data by utilizing the existing SOTA model, builds a linguistic knowledge fusion network based on a graph neural network and an attention mechanism, fuses the linguistic knowledge implied in the comment data into the final word expression, and improves the accuracy of the aspect-level emotion polarity classification.

Description

Aspect-level emotion polarity classification method based on fusion linguistic knowledge

Technical Field

The invention belongs to the technical field of natural language processing, relates to a text emotion polarity classification method, and particularly relates to an aspect-level emotion polarity classification method based on fusion linguistic knowledge.

Background

Today, people often express their emotions such as joy, anger, sadness, music, etc. by making comments on the internet. The sentiment information contained in the comments can help a consumer to quickly judge whether a certain service is worth consuming, and can also help a service provider to improve the product of the consumer. It is difficult to determine the sentiment expressed by a comment, especially when the same comment contains sentiment information that is diametrically opposite for different aspects of the same thing. Therefore, how to accurately and efficiently mine potential fine-grained subjective emotion in comment data is an urgent problem to be solved in the current text emotion analysis field.

The aspect-level sentiment polarity classification method is a powerful tool for mining potential fine-grained subjective sentiment information in comment data. The aspect-level sentiment polarity classification ASC is used for judging the sentiment polarity of a given aspect word in a comment sentence, the aspect word refers to a word or a phrase describing a specific aspect of an entity in the comment sentence, and the sentiment polarity comprises negative, neutral and positive. For example, a comment sentence "beautiful in shape but not highly famous" for a certain historical relic implies two diametrically opposite emotional polarities of "positive" and "negative" for the given aspect words "shape" and "famous", respectively.

With the continuous and deep research of scientific research personnel on the classification task of the aspect-level emotion polarities, a plurality of classification methods of the aspect-level emotion polarities are proposed:

the patent document of Chongqing university at its application, "an aspect level sentiment classification model" (application No.: 202010778078.1, application publication No.: CN 111985205A) discloses an aspect level sentiment polarity classification model. The method comprises the steps of firstly converting sentences into word vectors by using an embedding layer, then inputting the word vectors into a neural network layer based on a gating cycle unit GRU to convert the word vectors into corresponding hidden state sequences, and then capturing information important for the emotion polarity of a given aspect from the hidden state sequences by using a multi-head attention mechanism MHA instead of a common cyclic neural network RNN, so that the relation between the sentences and the given aspect is strengthened. However, the invention has the defect that the RNN is simply replaced by MHA, and the classification accuracy of comment data with different language styles is still deficient regardless of the expression mode and language habit specific to the comment data.

Kai Shuang et al published an article titled Interactive POS-aware network for aspect-level sentiment classification in 2020 on neuro output, and disclosed an aspect-level sentiment polarity classification method, which considers that part of sample data with 'limitation' exists in comment data set, and the final expression of the model can be influenced by the part of data, so that the article provides an Interactive tag POS perception network IPAN, which explicitly introduces POS information, firstly distinguishes different POS type information through a POS filter gate, and then uses an attention mechanism based on POS to help the model concentrate on the lemmas containing important sentiment orientations. But there is a problem in that the method only notices the influence of the part of speech of the lemma in the comment sentence on the emotional polarity, but ignores the influence of the special grammatical relation possibly existing between the lemma in the comment sentence on the aspect-level emotional polarity.

In summary, for the application of fine-grained sentiment polarity classification, the existing method ignores the expression mode and language style specific to the comment data, and does not effectively utilize linguistic knowledge, which is rich in meaning, but specifically refers to the relation between part-of-speech tags and grammar in the invention to mine the sentiment polarity of the aspect level contained in the comment data, so that the problem of low classification accuracy occurs.

Disclosure of Invention

The invention aims to provide an aspect emotion polarity classification method based on fusion linguistic knowledge, aiming at the defects of the prior art. The invention fully considers the uniqueness of the comment data, integrates linguistic knowledge into the deep learning model, and is used for solving the problem that the accuracy of classification of the aspect-level emotion polarity is low because the specific language style and expression mode of the comment data are ignored and the emotion polarity accumulated in the comment data cannot be effectively mined by utilizing the linguistic knowledge in many current classification methods of the aspect-level emotion polarity.

The technical scheme adopted by the invention comprises the following steps:

(1) establishing a training sample set X_trainAnd test sample set X_test：

(1a) Obtaining N comment sentences C ═ { C ═ CⁿN is more than or equal to 1 and less than or equal to N, wherein c ⁿThe n-th comment sentence is represented,

is shown by cⁿThe mth lemma in the sequence is N not less than 1000, M not less than 32 and not more than 64;

(1b) for each comment sentence cⁿEach element of

Performing IOB labeling to obtain an IOB label sequence set R ═ R corresponding to CⁿN is more than or equal to 1 and less than or equal to N, wherein rⁿDenotes cⁿCorresponding IOB tag sequence, rⁿ＝{γ^m|1≤m≤M，γ^m∈{O,I,B}}，γ^mIndicating IOB labels corresponding to the mth word element, O, I, B respectively indicating that the word element does not belong to the aspect word, belongs to the aspect word and is the first word element of the aspect word;

(1c) extracting each comment sentence cⁿEach of the word elements

Corresponding part-of-speech tag POS is obtained, and a part-of-speech tag sequence set P ═ { P } corresponding to C is obtainedⁿN is more than or equal to 1 and less than or equal to N, and a part-of-speech dependency graph set is constructed according to P

Wherein p isⁿDenotes cⁿA sequence of corresponding part-of-speech tags,

to represent

The corresponding part-of-speech tag is,

noun, verb, conj, adj, adv, and unk respectively represent nouns, verbs, conjunctions, adjectives, adverbs, and other parts of speech,

denotes cⁿA corresponding part-of-speech dependency graph;

(1d) extracting each comment sentence cⁿCorresponding syntax parsing tree, obtaining C corresponding syntax parsing tree set T ═ TⁿN is more than or equal to 1 and less than or equal to N, and a part-of-speech dependency graph set is constructed according to T

Wherein, tⁿ、

Respectively represent cⁿCorresponding syntax parse tree and syntax dependency graph;

(1e) Constructing a data set S ═ SⁿN is more than or equal to 1 and less than or equal to N, and N is randomly selected₁Training sample set X consisting of individual data and corresponding real emotion polarities_train＝(S_train；Y_train) Will remain N₂Combining the individual data and the corresponding real emotion polarities into a test sample set X_test＝(S_test；Y_test) Wherein, in the step (A),

representing the nth data, S, in the data set_trainA set of training data is represented that is,

denotes the n-th₁Individual training data, Y_trainDenotes S_trainThe corresponding set of true emotion polarities are set,

to represent

Corresponding true emotional polarity, S_testA set of test data is represented that is,

denotes the n-th₂Test data, Y_testDenotes S_testA corresponding set of true emotion polarities for the respective emotion patterns,

to represent

The corresponding true emotional polarity of the user's hand,

N₂＝N-N₁；

(2) building an aspect level emotion polarity classification model M based on fusion linguistic knowledge;

building text feature embedding modules M including sequential connections_embedLinguistic knowledge extraction module M_extractAnd linguistic knowledge fusion module M_poolAspect level Emotion polarity Classification model M, wherein M_embedAdopting a Bert network structure; m_extractThe system comprises a graph neural network consisting of double-layer graph convolution layers which are arranged in parallel; m_poolThe system comprises an attention analyzer, a normalization layer and a full connection layer which are sequentially connected and are composed of a plurality of full connection layers;

(3) performing iterative training on the aspect level emotion polarity classification model M:

(3a) The iteration frequency is T, the maximum iteration frequency is T more than or equal to 20, the T-th aspect-level emotion polarity classification model fusing linguistic knowledge is M', and T is 1, M^t＝M；

(3b) Will train sample set X_trainModel M for classifying aspect-level emotion polarity^tThe input, text feature embedding module

To S_trainEach comment statement in (1)

Each element of

Embedding text features one by one to obtain a text feature sequence I corresponding to the text feature sequence Iⁿ¹；

(3c) Linguistic knowledge extraction module

The double-layer graph volume layer module in the system is respectively embedded into the text characteristics

Text feature embedding sequence Iⁿ¹Performing part-of-speech dependency graphs based on their correspondences

And grammatical dependency graphs

Respectively obtaining the feature sequences fused with part-of-speech knowledge

And feature sequences that fuse grammatical knowledge

(3d) Linguistic knowledge fusion module

Pair of attention analyzers

Extracted characteristic sequence fusing part-of-speech knowledge

And feature sequences that fuse grammatical knowledge

Performing significance evaluation to obtain attention weight

And

and through a normalization layer pair

And

normalizing to obtain the final attention weight

And

by using

And

are respectively paired

And

weighting and splicing to obtain the final characteristic sequence fusing linguistic knowledge

Finally, using the full connection layer pair

Are classified to obtain

Predicted emotion classification of

Then S_trainThe result of the aspect level emotion polarity classification of (1) is

(3e) Using cross-entropy losses by

And

computational linguistics knowledge fusion module

Loss value L of_tAnd through a back propagation algorithm, through a loss value L_tComputing

Weight parameter gradient d omega_tThen using a random gradient descent method through d omega_tTo pair

Weight parameter omega of_tUpdating is carried out;

(3f) judging whether T is greater than or equal to T, if so, obtaining a trained aspect-level emotion polarity classification model M', otherwise, making T equal to T +1, and executing the step (3 b);

(4) obtaining an aspect-level emotion polarity classification result;

set X of test samples_testThe input of the aspect-level emotion polarity classification model M' is subjected to forward propagation to obtain X_testAspect level sentiment polarity classification result set

Wherein the content of the first and second substances,

denotes the n-th₂Aspect level emotional polarity of a sequence of frames.

Compared with the prior art, the invention has the following advantages:

firstly, the method analyzes the specific expression mode of the comment data from the linguistic angle, fully excavates the linguistic knowledge contained in the comment data, overcomes the defect that the existing fine-grained emotion analysis model cannot effectively adapt to the linguistic style of the comment data, so that the classification effect of the aspect-level emotion polarity is poor, has higher practical value, and improves the performance of a text emotion analysis system.

Secondly, the invention carries out graph convolution on word embedding according to the part of speech dependency graph and the grammar dependency graph respectively, and further fuses the text representation fused with the linguistic knowledge and the grammar knowledge by using an attention mechanism, so that the invention obtains better effect on the fusion of the linguistic knowledge.

Drawings

FIG. 1 is a flow chart of the present invention.

FIG. 2 is a schematic diagram of the linguistic knowledge extraction module of the present invention.

Detailed Description

The invention is described in further detail below with reference to the figures and the examples of embodiment.

Referring to fig. 1, the present invention includes the steps of:

step 1) establishing a training sample set X_trainAnd test sample set X_test：

Step 1a) obtaining N comment sentences C ═ { C ═ CⁿN is more than or equal to 1 and less than or equal to N, wherein cⁿThe n-th comment sentence is represented,

is shown by cⁿIn the mth lemma, N is more than or equal to 1000, and M is more than or equal to 32 and less than or equal to 64.

Step 1b) for each comment statement cⁿEach of which isWord element

Performing IOB labeling to obtain an IOB label sequence set R ═ R corresponding to CⁿN is more than or equal to 1 and less than or equal to N, wherein rⁿDenotes cⁿCorresponding IOB tag sequence, rⁿ＝{γ^m|1≤m≤M，γ^m∈{O,I,B}}，γ^mThe IOB tag indicates that the mth lemma corresponds to, and O, I, B indicates that the lemma does not belong to the aspect word, belongs to the aspect word, and belongs to the aspect word and is the first lemma of the aspect word.

Step 1c) extracting each comment statement cⁿEach of the word elements

Wherein p isⁿDenotes cⁿA sequence of corresponding part-of-speech tags,

represent

The corresponding part-of-speech tag is,

is shown by cⁿCorresponding parts of speech dependency graph.

Is a matrix of size M, if comment statement cⁿWord element in

The part-of-speech tag is noun, lemma

The part of speech tag of (1) is non, verb, conj or adj; or lemma

The part of speech tag is verb, the lemma

The part of speech tag of (1) is non, verb, conj or adv; or a word element

The part-of-speech tag of is conj, the lemma

The part-of-speech tag is non, verb, conj, adj or adv; or a word element

The part-of-speech tag of is adj, the lemma

The part-of-speech tag of (1) is a non, conj, adj or adv; or lemma

The part of speech tag is adv, the lemma

The part-of-speech tag is verb, conj, adj or adv; or a word element

The part-of-speech tag of is conj, the lemma

The part of speech tag is non, verb, conj, adj or adv, then

M in m₁Line m₂Array element value of column is 1, if comment statement c ⁿWord element in

The part of speech tag is noun, no matter the lemma

Why the part-of-speech tag of (c),

m in m₁Line m₂Column and m₂Line m₁The array element values of the columns are all 0.

Step 1d) extracting each comment statement cⁿCorresponding syntax parsing tree, obtaining C corresponding syntax parsing tree set T ═ TⁿN is more than or equal to 1 and less than or equal to N, and a part-of-speech dependency graph set is constructed according to T

Wherein, tⁿ、

is a matrix of size M, according to the comment statement cⁿCorresponding syntax parse tree tⁿIf comment on statement cⁿWord element in

And the lemma

At tⁿThere is a direct edge in the middle of the picture,

m in m₁Line m₂Row and m₂Line m₁The array element value of the column is 1, otherwise it is 0.

Step 1e) construct a dataset S ═ SⁿN is less than or equal to 1 and less than or equal to N, and N is randomly selected₁Training sample set X consisting of individual data and corresponding real emotion polarities_train＝(S_train；Y_train) Will remain N₂Combining the individual data and the corresponding real emotion polarities into a test sample set X_test＝(S_test；Y_test) Wherein, in the process,

denotes the n-th₁Individual training data, Y_trainDenotes S_trainA corresponding set of true emotion polarities for the respective emotion patterns,

to represent

denotes the n-th₂Number of tests According to Y_testDenotes S_testThe corresponding set of true emotion polarities are set,

to represent

The corresponding polarity of the true emotion,

N₂＝N-N₁；

step 2), building an aspect-level emotion polarity classification model M based on fusion linguistic knowledge;

building text feature embedding modules M including sequential connections_embedLinguistic knowledge extraction module M_extractAnd linguistic knowledge fusion module M_poolAspect level Emotion polarity Classification model M, wherein M_embedAdopting a Bert network structure; m_extractThe system comprises a graph neural network consisting of double-layer graph convolution layers which are arranged in parallel; m_poolThe system comprises an attention analyzer, a normalization layer and a full connection layer which are connected in sequence and are composed of a plurality of full connection layers;

text feature embedding module M_embedFormed by connecting in order 12 encoders that the structure is the same, the concrete structure of every encoder is: multi-headed self-attention → layer regularization layer → forward propagation layer → layer regularization layer.

Linguistic knowledge extraction module

The two double-layer graph volume layers have the same structure, and the specific structure of the double-layer graph volume layer is as follows: first image volume layer → random inactivation layer → second image volume layer; the formula for graph convolution is:

wherein, the first and the second end of the pipe are connected with each other,

represents the hidden layer state corresponding to the ith morpheme of the l layer, sigma represents an activation function, W^lParameter matrix, g, representing the l _ijE {0,1} th and jth lemma corresponding values in the graph, g e { g_pos,g_syn}。

Linguistic knowledge fusion module

The included attention analyzer includes 2 fully connected layers.

Step 3) performing iterative training on the aspect-level emotion polarity classification model M:

step 3a) setting the iteration frequency as T, the maximum iteration frequency as T not less than 20, and the T-th aspect-level emotion polarity classification model fusing linguistic knowledge as M', and making T1, M^t＝M。

Step 3b) training sample set X_trainModel M for classifying aspect-level emotion polarity^tThe input, text feature embedding module

To S_trainEach comment sentence in (1)

Each element of (1)

Embedding text features one by one to obtain a text feature sequence I corresponding to the text feature sequence Iⁿ¹。

Step 3c) linguistic knowledge extraction module

Text feature embedding sequence Iⁿ¹Performing dependency graph based on correspondence thereof

And grammatical dependency graphs

And feature sequences incorporating grammatical knowledge

Step 3d) linguistic knowledge fusion module

Attention analyzer pair of (1)

Extracted characteristic sequence fusing part-of-speech knowledge

And feature sequences that fuse grammatical knowledge

Performing significance evaluation to obtain attention weight

And

and through a normalization layer pair

And

normalizing to obtain the final attention weight

And

by using

And

are respectively paired with

And

Finally, the full connection layer pair is utilized

Are classified to obtain

Predicted emotion classification of

Then S_trainThe result of the aspect level sentiment polarity classification of (1) is

Step 3e) adopts cross entropy loss, and the calculation formula is as follows:

wherein

Number of classes 3, y representing the cross entropy loss function, C representing the sentiment polarity_iWhether a sample is a genuine label of the i-th class, y_iRepresenting the probability of the prediction sample being of the i-th class,

representing a regularization term by

And

computational linguistics knowledge fusion module

Loss value L of_tAnd by a back propagation algorithm, by a loss value L_tComputing

Weight parameter omega_tAnd (6) updating.

And 3f) judging whether T is greater than or equal to T, if so, obtaining a trained aspect-level emotion polarity classification model M', otherwise, making T equal to T +1, and executing the step 3 b).

Step 4), obtaining an aspect-level emotion polarity classification result;

set X of test samples _testThe input of the aspect-level emotion polarity classification model M' is subjected to forward propagation to obtain X_testAspect level sentiment polarity classification result set

Wherein the content of the first and second substances,

denotes the n-th₂Aspect level emotional polarity of a sequence of frames.

Claims

1. An aspect-level emotion polarity classification method based on fusion linguistic knowledge is characterized by comprising the following steps of:

(1) establishing a training sample set X_trainAnd test sample set X_test：

(1a) Obtaining N comment sentences C ═ { C ═ CⁿN is more than or equal to 1 and less than or equal to N, wherein cⁿThe n-th comment sentence is represented,

denotes cⁿThe mth lemma in the sequence is N not less than 1000, M not less than 32 and not more than 64;

(1b) for each comment sentence cⁿEach element of

Performing IOB labeling to obtain an IOB label sequence set R ═ R corresponding to CⁿN is more than or equal to 1 and less than or equal to N, wherein rⁿIs shown by cⁿCorresponding IOB tag sequence, rⁿ＝{γ^m|1≤m≤M，γ^m∈{O,I,B}}，γ^mIndicating IOB labels corresponding to the mth word element, O, I, B respectively indicating that the word element does not belong to the aspect word, belongs to the aspect word and is the first word element of the aspect word;

(1c) extracting each comment sentence cⁿEach element of Chinese

Wherein p isⁿDenotes cⁿA sequence of corresponding part-of-speech tags,

To represent

The corresponding part-of-speech tag is used,

denotes cⁿA corresponding part-of-speech dependency graph;

Wherein, tⁿ、

(1e) constructing a data set S ═ SⁿN is more than or equal to 1 and less than or equal to N, and N is randomly selected₁Training sample set X consisting of individual data and corresponding real emotion polarities_train＝(S_train；Y_train) Will remain N₂Combining the individual data and the corresponding real emotion polarities into a test sample set X_test＝(S_test；Y_test) Wherein, in the process,

to represent

denotes the n-th₂Test data, Y_testDenotes S_testThe corresponding set of true emotion polarities are set,

to represent

The corresponding true emotional polarity of the user's hand,

N₂＝N-N₁；

building text feature embedding modules M including sequential connections _embedLinguistic knowledge extraction module M_extractAnd linguistic knowledge fusion module M_poolAspect level Emotion polarity Classification model M, wherein M_embedAdopting a Bert network structure; m_extractThe system comprises a graph neural network consisting of double-layer graph convolution layers which are arranged in parallel; m_poolThe system comprises an attention analyzer, a normalization layer and a full connection layer which are sequentially connected and are composed of a plurality of full connection layers;

(3b) Will train sample set X_trainModel M for classifying aspect-level emotion polarity^tInput, text feature embedding module M_e ^t _mbedTo S_trainEach comment statement in (1)

Each element of

(3c) Linguistic knowledge extraction module

The double-layer graph convolution layer module in the system is respectively embedded into the text characteristics

And grammatical dependency graphs

And feature sequences that fuse grammatical knowledge

(3d) Linguistic knowledge fusion module

Pair of attention analyzers

Extracted characteristic sequence fusing part-of-speech knowledge

And feature sequences that fuse grammatical knowledge

Performing significance evaluation to obtain attention weight

And

and through a normalization layer pair

And

normalizing to obtain the final attention weight

And

by using

And

are respectively paired

And

Finally, the full connection layer pair is utilized

Are classified to obtain

Predicted emotion classification

(3e) Using cross-entropy losses by

And

computational linguistics knowledge fusion module

Weight parameter gradient d omega_tThen using a random gradient descent method through d ω_tTo pair

Weight parameter omega_tUpdating is carried out;

(4) obtaining an aspect-level emotion polarity classification result;

set X of test samples_testThe input of the aspect-level emotion polarity classification model M' is subjected to forward propagation to obtain X _testAspect level sentiment polarity classification result set

Wherein the content of the first and second substances,

denotes the n-th₂Aspect level emotional polarity of a sequence of frames.

2. The fusion language of claim 1The method for classifying aspect-level emotion polarities of linguistic knowledge is characterized in that the comment sentences c in the step (1c)ⁿCorresponding parts-of-speech dependency graph

Wherein:

is a matrix of size M × M;

if comment statement cⁿWord element in

The part-of-speech tag is noun, lemma

The part-of-speech tag is non, verb, conj or adj, or a lemma

The part of speech tag is verb, the word element

The part-of-speech tag is non, verb, conj or adv, or a lemma

Part-of-speech tag of (1) is conj, a word element

The part-of-speech tag is non, verb, conj, adj or adv or a word element

The part of speech tag of is adj, the lemma

Part-of-speech tag of noun, conj, adj, or adv, or a token

The part of speech tag is adv, the lemma

The part-of-speech tag is verb, conj, adj or adv, or a lemma

The part-of-speech tag of is conj, the lemma

The part of speech tag is non, verb, conj, adj or adv, then

M in₁Line m₂Array element value of the column is 1;

if comment statement cⁿWord element in (1)

The part-of-speech tag of (1) is no, then no matter the lemma

Is what is the part-of-speech tag of (c),

m in₁Line m₂Row and m₂Line m ₁The array element value of the column is 1.

3. The method for classifying sentiment polarities at aspect level fused with linguistic knowledge according to claim 1, wherein the comment sentence c in the step (1d) isⁿCorresponding syntax dependency graph

Wherein:

is a matrix of size M × M;

statement according to comment cⁿCorresponding syntax parse tree tⁿIf comment on statement cⁿWord element in (1)

And the lemma

At tⁿThere is a direct edge in the middle of the picture,

m in₁Line m₂Row and m₂Line m₁The array element value of the column is 1, otherwise it is 0.

4. The method for classifying emotion polarities of an aspect level fused with linguistic knowledge according to claim 1, wherein the classification model M of the emotion polarities of the aspect level fused with linguistic knowledge in the step (2) is:

text feature embedding module M_embedThe encoder is formed by connecting 12 encoders with the same structure in sequence, and the specific structure of each encoder is as follows: multi-headed attention layer → layer regularization layer → forward propagation layer → layer regularization layer;

linguistic knowledge extraction module

The two double-layer graph volume layers have the same structure, and the specific structure of the double-layer graph volume layer is as follows: the first graphic volume layer → the random inactivation layer → the second graphic volume layer; the formula for graph convolution is:

wherein the content of the first and second substances,

Represents the hidden layer state corresponding to the ith morpheme of the l layer, sigma represents an activation function, W^lParameter matrix, g, representing the l_ijE {0,1} th and jth lemma corresponding values in the graph, g e { g_pos,g_syn}；

Linguistic knowledge fusion module

The included attention analyzer includes 2 fully connected layers.