CN112883167A

CN112883167A - Text emotion classification model based on hierarchical self-power-generation capsule network

Info

Publication number: CN112883167A
Application number: CN202110292481.8A
Authority: CN
Inventors: 程艳; 孙欢; 陈豪迈; 赵松华
Original assignee: Jiangxi Normal University
Current assignee: Jiangxi Normal University
Priority date: 2021-03-18
Filing date: 2021-03-18
Publication date: 2021-06-01

Abstract

Aiming at the defects of the capsule network and the hierarchical attention network, a hybrid neural network based on the hierarchical self-attention network and the capsule network is provided. The invention aims to integrate the advantages of a capsule network, a hierarchical model and an attention mechanism, and introduces the self-attention mechanism in the hierarchical attention network to enhance the feature expression capability of the model and reduce the dependence of external parameters, so that the model can learn more key features of the text. The method comprises the steps of using a hierarchical self-attention network to carry out emotion modeling on text from two levels of words and sentences, and meanwhile selectively paying attention to important words and sentences in the text. And the capsule network coding text part and the whole space incidence relation is used, richer text emotional characteristics are extracted, and the advantages of the two are fused to improve the classification performance.

Description

Text emotion classification model based on hierarchical self-power-generation capsule network

Technical Field

The invention belongs to the field of natural language processing, and is applied to emotion analysis tasks.

Background

First, noun interpretation: 1. textual emotion analysis (SentimentAnalysis): the method is a process for analyzing, processing and extracting subjective texts with emotional colors by utilizing natural language processing and text mining technologies.

2. Capsule networks (connected networks): in 2017, the Hinton group proposed a capsule network, expanded the scalar type network to vectors for the first time, and used a dynamic routing algorithm to perform the transfer calculations between capsules. The method is used in the image processing field at first and is later used in natural language processing tasks such as text emotion classification, and more text semantic information is reserved compared with a convolutional neural network.

3. Bidirectional GRU network (Bidirectional Gated regenerative Unit): to solve the gradient message problem of the standard RNN (recurrent neural network), the GRU uses an update gate (update gate) and a reset gate (resetgate). These two gating mechanisms are unique in that they can preserve information in long-term sequences and do not clear over time or remove because they are not relevant to prediction.

4. Self-attention Mechanism (self-attention Mechanism): derived from studies on human vision. In cognitive science, due to the bottleneck of information processing, human beings selectively pay attention to a part of all information while ignoring other visible information; later, people applied this idea to image processing and natural language processing and achieved good results, and the purpose of introducing a self-attention mechanism is to better focus on words that are important for emotion classification.

Secondly, the prior art: 1, (1) text emotion classification method based on RNN: socher et al use a tree-structured LSTM (Long short term memory network) network to improve semantic representation, in which memory elements are able to hold associations between instances, thereby capturing relationships between words; cho et al propose GRU units, which have fewer parameters and faster training than LSTM models, and can capture global semantic features. (2) The emotion analysis method combining the hierarchical neural network and the attention mechanism comprises the following steps: tang et al builds a hierarchical model using convolutional neural networks and LSTM to obtain the feature representation of chapters from both word and sentence perspectives. Yang et al combines an attention mechanism with a hierarchical model, and pays attention to keywords in sentences and key sentences in chapters respectively by using the attention mechanism. (3) Capsule network: in 2011, GeoffreyHinton et al introduced the capsule network for the first time in a paper entitled "transform autoencoder". In 2018, Zhao et al put forward for the first time that the capsule network is applied to a text classification task, and the classification performance of the capsule network on a plurality of data sets exceeds that of a general neural network model. In the same year, Kim and the like propose a capsule network based on a static routing mechanism according to text attributes, so that the classification performance is improved while the calculation complexity is effectively reduced.

The CapsNet model: the model applies the capsule network to the text classification task, and comprises the following four layers: the input layer inputs the text into the network in a word vector form; the convolution layer uses a convolution control module, and the thought is derived from a gate control mechanism of a recurrent neural network (LSTM) and a GRU, so that noise information is screened out, and local features of a text are better extracted; rolling and depositing a capsule layer: due to the high variability of the text, the layer improves the traditional dynamic algorithm into a static routing algorithm to obtain the global semantic features of the text; classifying capsule layers: and outputting the prediction label of the text.

3. Hierarchical attention model: the method is based on a recurrent neural network model and is combined with attention to construct a hierarchical model. The model models the text from two levels of words and sentences, uses a recurrent neural network to extract the characteristics of the text, pays attention to important words and sentences respectively, and finally classifies the text.

Thirdly, the technical problem is as follows: 1. although the capsule network improves the defects of the traditional convolutional neural network to a certain extent, the local features of the text are extracted through convolutional operation essentially, important words in the text cannot be paid attention selectively, and the long-distance dependency relationship cannot be coded, so that the method has great limitation in recognizing the text with semantic turning. In the hierarchical attention model, the attention mechanism needs more parameter dependence, and the model cannot focus more on the internal sequence relation of the text.

2. Aiming at the defects of the capsule network and the hierarchical attention network, the invention aims to integrate the advantages of the capsule network, the hierarchical model and the attention mechanism, introduce the self-attention mechanism into the hierarchical attention network, reduce the dependence of external parameters and enable the model to learn more key characteristics of the text. The global semantic features of the texts are learned by using a hierarchical self-attention network, important words and sentences in the texts are concerned, and the spatial association relation between the parts of the texts and the whole text is learned by using a capsule network, so that the classification performance of the model is effectively improved.

Disclosure of Invention

1. Aiming at the defects of the capsule network and the hierarchical attention network, the invention aims to integrate the advantages of the capsule network, the hierarchical model and the attention mechanism, introduce the self-attention mechanism into the hierarchical attention network, reduce the dependence of external parameters and enable the model to learn more key characteristics of the text. The global semantic features of the texts are learned by using a hierarchical self-attention network, important words and sentences in the texts are concerned, and the spatial association relation between the parts of the texts and the whole text is learned by using a capsule network, so that the classification performance of the model is effectively improved.

2. The technical innovation points of the invention are as follows: (1) a mixed neural network based on a hierarchical self-attention network and a capsule network is designed, richer semantic feature information is extracted by utilizing the relation between the part of a rubber network coded text and the whole body, and context information is extracted from two levels of words and sentences by utilizing the hierarchical self-attention network, so that the classification performance is improved by fusing the advantages of the two levels; (2) self-attention is introduced into the hierarchical model to replace the traditional attention, so that the dependence of external parameters is reduced, the internal dependence relation of the text is captured, important words and sentences are concerned, and the characteristic expression capability of the model is enhanced;

drawings

FIG. 1 is a diagram of a hierarchical self-attention capsule network model architecture.

Detailed Description

The attached drawing is a model structure diagram of the invention, which mainly comprises two modules: the hierarchical network module consists of a bidirectional GRU network and a hierarchical network based on self attention, and learns text context information by using the BiGRU to capture important word and sentence characteristics from the attention; and the capsule network module consists of a rolling capsule layer and a classification capsule layer, encodes text semantics and structural information based on the text representation output by the hierarchical network module, learns the associated characteristic information of the text part and the integration, and finally classifies the text semantics and the structural information. According to different functions of each layer in the layered network module, the module is divided into five layers by the model: word embedding, word level coding layer, word level self-attention layer, sentence level coding layer and sentence level self-attention layer. Word embedding: and performing word embedding mapping on the text to obtain a continuous low-dimensional real-valued vector for representing semantic information of words. The model uses Glove pre-training word vectors, maps words in a text into a 300-dimensional word vector matrix, and establishes an initial characteristic matrix representing the text as an input X of the model. Word level coding layer: and taking the word X as a feature extraction object, and performing feature acquisition on the text by using the BiGRU to obtain the global semantic information hit of the text. Word level self-attention layer: the model can pay attention to important information in the text, different weights are given to each word by self-attention in the weight adjusting layer to represent the contribution degree of the words, and finally the sentence sequence representation Si is obtained. Sentence-level coding layer: the method comprises the following steps of taking a sentence Si as a feature extraction object, and carrying out feature acquisition on a text by using BiGRU to obtain a hidden feature hi in the sentence, wherein the sentence level self-attention layer comprises the following steps: and endowing each sentence with different contribution degrees to measure the importance of the sentence to the text to obtain a final sentence representation V. The capsule network module inputs the sentence representation V into the rolling capsule layer to learn the spatial association relation between the text part and the whole body through a dynamic routing algorithm to obtain the high-level feature representation Vout. And finally, carrying out normalization processing on the classification capsule layer to complete the text emotion classification task.

Claims

1. A mixed neural network model based on a hierarchical self-attention network and a capsule network is provided, richer semantic feature information is extracted by utilizing the relation between parts and the whole body in a capsule network coding text, and context information is extracted from two levels of words and sentences by utilizing the hierarchical self-attention network, so that the classification performance is improved by combining the advantages of the two levels.

2. The model in claim 1 introduces self-attention to replace traditional attention, reduces external parameter dependence, captures text internal dependence, pays attention to important words and sentences, and enhances the feature expression capability of the model.