CN110879938A

CN110879938A - Text emotion classification method, device, equipment and storage medium

Info

Publication number: CN110879938A
Application number: CN201911110950.9A
Authority: CN
Inventors: 张少华; 孟琳琳; 周雪
Original assignee: China United Network Communications Group Co Ltd
Current assignee: China United Network Communications Group Co Ltd
Priority date: 2019-11-14
Filing date: 2019-11-14
Publication date: 2020-03-13

Abstract

The invention provides a text emotion classification method, a text emotion classification device, text emotion classification equipment and a storage medium. The method comprises the following steps: acquiring a word vector in text data to be processed, and extracting a feature vector corresponding to the word vector; extracting context feature representation from the feature vector by adopting a Bi-directional long-time memory network Bi-LSTM model; and on the basis of the extracted context feature representation, an Attention mechanism is utilized, then a top-k-max Pooling processing mode is introduced to fully extract the text feature representation, and the extracted features are sent to a classifier to obtain higher accuracy. The method of the embodiment of the invention improves the classification accuracy of the text emotion classification, and has a good classification effect.

Description

Text emotion classification method, device, equipment and storage medium

Technical Field

The invention relates to the technical field of computers, in particular to a text emotion classification method, device, equipment and storage medium.

Background

With the development of the internet and the increase of internet users, a great amount of text information is generated on the internet by network users, such as comments on a certain commodity, a movie, a shop and the like, and how to extract useful information from the text is beneficial to merchants, consumers and the like. Therefore, the text emotion tendency analysis becomes more important, and the text emotion tendency analysis (i.e. text emotion classification) is a branch of the Natural Language Processing (NLP) field, and the traditional text emotion classification mainly includes: the two methods do not consider the context information of words or the word order problem of texts and need a large amount of manpower to extract text features, and may not extract important features in texts at a deeper level.

In recent years, with the development of deep learning technology, Recurrent Neural Network (RNN) and Convolutional Neural Network (CNN) models have been proposed, where the CNN model mainly uses a Convolutional layer and a downsampling layer to perform feature extraction, and the RNN model makes the state of a current node (or the first nodes) affect the state of a next node and uses the state of the last node as a feature, but the above models have a poor text emotion classification effect.

Disclosure of Invention

The invention provides a text emotion classification method, a text emotion classification device, text emotion classification equipment and a storage medium, which are used for improving the text emotion classification effect.

In a first aspect, the present invention provides a text emotion classification method, including:

acquiring a word vector in text data to be processed, and extracting a feature vector corresponding to the word vector by using convolution operation;

extracting context feature representation from the feature vector by adopting a Bi-directional long-time memory network Bi-LSTM model;

determining semantic codes corresponding to the context feature representations according to the extracted context feature representations;

performing maximum pooling on semantic codes corresponding to the context feature representations, and splicing the semantic codes subjected to maximum pooling to obtain spliced feature representations;

and classifying the spliced feature representation to acquire the emotion type corresponding to the text data.

In a second aspect, the present invention provides a text emotion classification apparatus, including:

the extraction module is used for acquiring word vectors in the text data to be processed and extracting the feature vectors corresponding to the word vectors;

the extraction module is also used for extracting context feature representation from the feature vector by adopting a Bi-directional long-time memory network Bi-LSTM model;

the determining module is used for determining semantic codes corresponding to the context feature representations according to the extracted context feature representations;

the processing module is used for performing maximum pooling processing on the semantic codes corresponding to the context feature representation and splicing the semantic codes subjected to maximum pooling processing to obtain spliced feature representation;

and classifying the spliced feature representations to acquire emotion categories corresponding to the text data.

In a third aspect, the present invention provides a computer-readable storage medium, on which a computer program is stored, and when the computer program is executed by a processor, the computer program implements the method described in any one of the first aspect.

In a fourth aspect, an embodiment of the present invention provides an electronic device, including:

a processor; and

a memory for storing executable instructions of the processor;

wherein the processor is configured to perform the method of any of the first aspects via execution of the executable instructions.

The text emotion classification method, the text emotion classification device, the text emotion classification equipment and the storage medium, provided by the embodiment of the invention, are used for acquiring word vectors in text data to be processed and extracting feature vectors corresponding to the word vectors by using convolution operation; extracting context feature representation from the feature vector by adopting a Bi-directional long-time memory network Bi-LSTM model; acquiring the importance of different features by using an Attention mechanism according to the extracted context feature representation, and then sending the context feature representation into a top-k-maxporoling pooling layer to extract the most important first k features, so as to determine semantic codes corresponding to the context feature representation; and classifying the semantic codes corresponding to the context feature representation to obtain the emotion categories corresponding to the text data, wherein the Bi-LSTM model can fully obtain the context features of words in the text data, and can distinguish important features and filter non-important features through semantic coding, so that the important features have higher weight, the accuracy of text emotion classification is improved, and the classification effect is better.

Drawings

The accompanying drawings, which are incorporated in and constitute a part of this specification, illustrate embodiments consistent with the present disclosure and together with the description, serve to explain the principles of the disclosure.

FIG. 1 is a flowchart illustrating a text emotion classification method according to an embodiment of the present invention;

FIG. 2 is a schematic diagram illustrating a text emotion classification effect method according to an embodiment of the present invention;

FIG. 3 is a schematic diagram illustrating the principle of pooling according to one embodiment of the method of the present invention;

FIG. 4 is a schematic diagram of Bi-LSTM model principle of an embodiment of the method provided by the present invention;

FIG. 5 is a schematic illustration of an attention mechanism according to an embodiment of the method of the present invention;

FIG. 6 is a schematic structural diagram of an embodiment of a text emotion classification apparatus provided in the present invention;

fig. 7 is a schematic structural diagram of an embodiment of an electronic device provided in the present invention.

With the foregoing drawings in mind, certain embodiments of the disclosure have been shown and described in more detail below. These drawings and written description are not intended to limit the scope of the disclosed concepts in any way, but rather to illustrate the concepts of the disclosure to those skilled in the art by reference to specific embodiments.

Detailed Description

Reference will now be made in detail to the exemplary embodiments, examples of which are illustrated in the accompanying drawings. When the following description refers to the accompanying drawings, like numbers in different drawings represent the same or similar elements unless otherwise indicated. The implementations described in the exemplary embodiments below are not intended to represent all implementations consistent with the present disclosure. Rather, they are merely examples of apparatus and methods consistent with certain aspects of the present disclosure, as detailed in the appended claims.

The terms "comprising" and "having," and any variations thereof, in the description and claims of this invention and the drawings described herein are intended to cover non-exclusive inclusions. For example, a process, method, system, article, or apparatus that comprises a list of steps or elements is not limited to only those steps or elements listed, but may alternatively include other steps or elements not listed, or inherent to such process, method, article, or apparatus.

Firstly, the application scene related to the invention is introduced:

the text emotion classification method provided by the embodiment of the invention is applied to a scene for carrying out emotion classification on text data so as to improve classification accuracy.

The emotion root is an emotion for judging whether a text data expresses a positive emotion or a negative emotion, for example, for comments on the network, such as purchase evaluation, movie evaluation, microblog comments and the like.

The method provided by the invention can be realized by the electronic equipment such as a processor executing corresponding software codes, and can also be realized by the electronic equipment performing data interaction with a server while executing the corresponding software codes, for example, the server performs part of operations to control the electronic equipment to execute the method.

The following embodiments are all described with electronic devices as the executing bodies.

The technical solution of the present invention will be described in detail below with specific examples. The following several specific embodiments may be combined with each other, and details of the same or similar concepts or processes may not be repeated in some embodiments.

FIG. 1 is a flowchart illustrating a text emotion classification method according to an embodiment of the present invention. As shown in fig. 1, the method provided by this embodiment includes:

step 101, obtaining word vectors in text data to be processed, and extracting feature vectors corresponding to the word vectors.

Specifically, the word vector may be trained before obtaining the word vector, for example, the word vector may be trained using 30G news corpus of dog search, the corpus may be segmented by using Jieba under python, and then the word vector may be trained, and the word vector may be trained using cbow model under word2vec, and the parameters may be set as: the context window length is set to 5, the learning rate alpha is used with a default of 0.025, the lowest frequency min-count is used with a default of 5, i.e., if a word occurs in a document less than 5 times, it is discarded and the word vector dimension is set to 100 dimensions.

And segmenting the text data to be processed to obtain a plurality of words, and converting the words in the text data into word vectors according to the trained word vector model. The text data includes, for example, a plurality of sentences, each sentence corresponding to a plurality of word vectors. For example by the word embedding layer shown in fig. 2.

Further, as shown in fig. 2, after text data is converted into corresponding word vectors, preliminary feature vectors are extracted through one layer of one-dimensional convolution layer.

And 102, extracting context feature representation for the feature vector by adopting a Bi-directional long-time memory network Bi-LSTM model.

Specifically, in the method of the embodiment of the present invention, the feature vector obtained by the convolution operation, which is the calculation result of the convolution layer, is sent to the Bi-LSTM model (as shown in fig. 2), and the Bi-LSTM model can fully extract the text features.

The two-way long-and-short-term memory network has great advantages in processing sequence data (and text data with sequence), so the feature vector Bi-LSTM model is used after the first layer of convolution operation in the embodiment of the invention. Compared with a traditional Recurrent Neural Network (RNN) model, the long and short term memory Network (LSTM) has no problems of gradient extinction and gradient explosion, and has good effect in natural language processing. In order to enable the LSTM to fuse the vocabulary information of the current time and all the context information thereof, a Bi-LSTM model capable of reading text in two directions is used in the embodiment of the present invention.

And extracting context feature representation from the feature vector after the convolution operation through a Bi-LSTM model.

And 103, determining semantic codes corresponding to the context feature representations according to the extracted context feature representations.

Specifically, as shown in fig. 2, the semantic code corresponding to the context feature representation is determined by an Attention mechanism, that is, the influence of the important feature can be highlighted by calculating the Attention distribution probability in the text emotion classification, that is, different keywords in the sentence have different influence on the classification result.

For example, it is impossible to remember all descriptions when looking at a review about a certain product, and only some keywords such as "good", "not good", "like" and "like" can be remembered, and these words are important for the expression of the emotional tendency of the text, so that different features in the text data have different effects on the classification result.

The semantic code is calculated according to the output result (namely, the context feature representation) and the probability weight of the Bi-LSTM layer.

104, performing maximum pooling on semantic codes corresponding to the context feature representations, and splicing the multiple semantic codes subjected to maximum pooling to obtain spliced feature representations;

in order to further reduce the data latitude, the maximum pooling operation can be performed after the semantic code is generated: and k-max-firing, selecting the first k maximum values in the generated semantic coding result by using a fixed sliding window, extracting the most important first k characteristics, and filtering out the non-important characteristics to reduce the data latitude so as to improve the convergence speed and the prediction precision of the model.

And 105, classifying the spliced feature representation to acquire the emotion type corresponding to the text data.

Specifically, the text data is classified according to the semantic code and the result of the maximum pre-K pooling process, and the emotion type corresponding to the text data is obtained. The classification process may be performed by a preset classification function. Wherein different classifiers can be utilized for the classification process.

For example, for a comment on a certain product, if two categories are performed, the comment can be classified into a category of good comment or bad comment. Good comments may include words such as "good", and bad comments may include statements such as bad use experience and no purchase.

The method of the embodiment comprises the steps of obtaining word vectors in text data to be processed, and extracting feature vectors corresponding to the word vectors; extracting context feature representation from the feature vector by adopting a Bi-directional long-time memory network Bi-LSTM model; determining semantic codes corresponding to the context feature representations according to the extracted context feature representations; and classifying the semantic codes corresponding to the context feature representation to obtain the emotion categories corresponding to the text data, wherein the Bi-LSTM model can fully obtain the context features of words in the text data, and can distinguish important features and filter non-important features through semantic coding, so that the important features have higher weight, the accuracy of text emotion classification is improved, and the classification effect is better.

On the basis of the foregoing embodiment, optionally, step 104 may be specifically implemented by:

selecting the first k largest semantic codes in the sliding window by using a preset sliding window to obtain the first k largest semantic codes corresponding to a plurality of sliding windows;

and splicing the first k maximum semantic codes corresponding to the plurality of sliding windows to obtain the spliced feature representation.

Specifically, as shown in fig. 2, after semantic coding, k-max pooling is performed, and as shown in fig. 3, a top-k calculation formula is:

top-k＝max_k{c₁,c₂,c₃,…c_p}

k in the above formula represents the first k values to be taken as maximum, c₁,c₂,c₃,…c_pRespectively, the semantic code values, and p the size of the sliding window.

The numbers represent the concatenation of the vectors. The step size of the sliding window may be k-1.

That is, the first k maximum semantic code values are found at p semantic code values at a time, then the sliding window moves to the right by k-1 semantic code values, and the next group of p semantic code values is found.

And finally, classifying the spliced feature representation by using a preset classification function to obtain the emotion type corresponding to the text data.

On the basis of the foregoing embodiment, optionally, the extracting of the feature vector corresponding to the word vector in step 101 may specifically be implemented in the following manner:

inputting the word vector into the convolutional layer to obtain a characteristic matrix as follows:

each row in the F matrix represents a feature vector generated after convolution of a word vector in the text data by convolution windows with different sizes;

wherein c in each row_ij＝ReLU(s_jf+θ)，Wherein ReLU is activation function, f is equal to R^k×DRepresents the convolution operation of a filter with convolution layer length k (i.e. convolution window size k) on a D-dimensional word vector, theta represents the offset, s_jA word vector matrix s composed of k successive words starting from the jth word in the text data_j＝[w_j,w_j+1,…,w_j+k-1]Wherein w is_j∈R^DRepresenting a word vector of a jth word in the text data, wherein the dimension is D dimension, and the value range of i is 1-m; m is the number of the types of the convolution windows, the value range of j is 1 to n, and n is the number of words after the words are segmented in the text data.

For example, m is 3, the size of the convolution window k of type 1 is 2, the size of the convolution window k of type 2 is 3, and the size of the convolution window k of type 3 is 4.

Further, as shown in fig. 4, step 102 may be specifically implemented by:

determining the above feature representation corresponding to the feature vector according to the following formula (2);

determining the following feature representation corresponding to the feature vector according to the following formula (3);

obtaining the feature representation of the context corresponding to the feature vector by using the following formula (4) according to the feature representation and the following feature representation;

wherein h is_tA hidden state h corresponding to the t-th word in the text data_t＝o_t⊙tanh(c_t)，o_t＝δ(W_o·X+b_o)，c_t＝f_t⊙_t-1+i_t⊙tanh(W_c·X+b_c)，

f_t＝δ(W_f·X+b_f)，,i_t＝δ(W_i·X+b_i)；

Wherein, W_f、W_i、W_o、W_cWeight matrix for LSTM, b_f、b_i、b_o、b_cOffset of LSTM, w1_tColumn vectors of the t-th column of the F matrix, delta (·) is an activation function, ⊙ is a dot product operation of the matrix, and n is the number of word vectors.

In particular, δ (·) may be an activation function sigmoid.

The Bi-LSTM layer can fuse the current vocabulary information and all the context information thereof together to obtain the characteristic representation of the context.

Further, as shown in fig. 5, step 103 may be specifically implemented as follows:

determining semantic codes corresponding to the feature representation of the context by using the following formula (5) according to the feature representation of the context extracted by the Bi-LSTM model and the probability weight;

wherein out_tIs a characteristic representation of the context, a_ltRepresenting the degree of importance of the tth feature representation, i.e. the probability weight of the tth feature representation, a_ltCalculated from the following equation (6):

wherein r is_t＝v^Ttanh(W_Aout_t+b)，W_AIs a parameter matrix, b is a bias term, v^TIs the transpose of the random initial matrix v.

The value range of l is 1 to n, and n is the number of word vectors.

The semantic coding corresponding to the characteristic representation of the context is calculated according to the output result of the Bi-LSTM layer and the probability weight, and the influence of important characteristics is highlighted by utilizing an Attention mechanism.

The method of the embodiment of the invention takes the Bi-LSTM model as a single layer to be fused between the convolutional layer and the pooling layer, firstly utilizes the convolutional layer to carry out the preliminary feature extraction of the text, to fully obtain the contextual characteristics of words in the text data the feature vectors are fed into the Bi-LSTM model, in order to distinguish important features and filter non-important features, an attention mechanism and Top-k maximum pooling are introduced to an output result of the Bi-LSTM model, the important features have higher weights by the attention mechanism, the step length of a sliding window of the Top-k maximum pooling can be k-1, and therefore N-Gram operation in natural language processing is simulated.

To sum up, the method of the embodiment of the invention comprises the steps of after convolution calculation of feature vectors of a convolutional layer, putting the calculation results into a feature matrix F one by one according to the calculation sequence, directly introducing Bi-LSTM after the convolutional layer in order to prevent the sequence of the original sentence from being disturbed by a pooling layer, introducing the calculation result of the Bi-LSTM into an Attention mechanism to enable important features to have higher weight, introducing top-k maximum pooling processing in order to reduce feature dimensionality and improve classification accuracy, and finally sending extracted feature representations into a strong classifier for classification in order to improve the accuracy of text emotion classification, thereby obtaining better classification effect.

Based on the method of the embodiment, the CBLTK model of the embodiment of the present invention is established in a mode of merging the CNN and Bi-LSTM models, features of the text are extracted by using the feature that deep learning can extract deeper features, in order to improve the accuracy of final classification, several commonly used strong classifiers are compared below, and the extracted features are sent to the strong classifiers for classification (support vector machine, SVM for short)/random forest (RF for short), etc.):

the text data used below is, for example, broad shadow evaluation data, which has been marked with the number of stars of the user at the time of review (five stars represent well and one star represents poorly), from which five stars and one star data are extracted for the study of text emotion classification (i.e., the text data is subjected to two classifications), thirty thousand text data for each category of positive and negative types is used as training data, twenty thousand text data for each category of positive and negative types is used as test data, and on statistical average each review includes 45 words, the model of the embodiment of the present invention is specified as 45 words using the sentence length of the text data, is directly truncated if there are sentences exceeding 45, if the sentence length is less than 45, null is used for filling, the word segmentation tool uses the jieba word segmentation of Python and Tensorflow1.6 to construct the CBLTK model of the embodiment of the invention. The word vector training corpus may be a dog news corpus 30G.

Based on the sequential characteristic among words of text data, the embodiment of the invention only uses one layer of convolution kernel at the first layer of the CBLTK model, and uses a max-posing maximum pooling layer and a classification layer, such as softmax classification, after using an attention mechanism, and dropout can be set to be 50% in the model training process and is regularized by using L2. The model of the embodiment of the invention can set minimatch to be 100, uses three types of convolution windows with different sizes and 150 convolution kernels in each type, and selects the best convolution window of the three types.

TABLE 1 selection of convolution window size

From the results in table 1, we select convolution windows with convolution windows of lengths 2, 3 and 4 respectively to perform convolution operation, and can select 150 hidden layer neurons for the number of the hidden layer neurons of the LSTM layer in the Bi-LSTM layer. For the k max pooling layer, the most suitable value for k values from 1 to 6 was found, and k can be selected to be 3 from the results given in the following table.

TABLE 2 selection of k values

Top-k value	Rate of accuracy
		1	71.2％
2	74.5％
		3	77.6％
4	75.3％
		5	76.8％
6	75.6％

The CBLTK model of the embodiment of the invention mainly takes the final classification accuracy as an evaluation index, several groups of comparison tests are respectively classified by using softmax and strong classifiers, and the classification result of the traditional text emotion classification method is added as a reference. The specific results are as follows:

TABLE 3 use of softmax classifier

Model (model)	Rate of accuracy
		CNN	80.3％
LSTM	81.1％
		CNN+LSTM	82.8％
LSTM+CNN	83.3％
		CBLTK	85.2％

TABLE 4 use of Strong classifiers (svm)

It can be seen from tables 3 and 4 that the method for classifying text emotion by using a conventional text emotion classification method (for example, term frequency-inverse text frequency index (Tf-idf) algorithm) is high in accuracy without using a deep learning method because text features in a shallow layer can be extracted, and the effect of using first CNN and then LSTM is good without using first LSTM and then CNN. It can be seen from table 2 that although the CBLTM model proposed in the embodiment of the present invention uses SVM, the effect enhancement is not large, so that the following model combines four commonly used strong classifiers: SVM, RF, naive Bayes and GBDT to find out a combination mode with better classification effect.

TABLE 5 combination of different classifiers

Combination mode	Rate of accuracy
		CBLTK+SVM	86.1％
CBLTK+RF	87.6％
		CBLTK+GBDT	89.3％
CBLTK + naive Bayes	85.3％

Table 5 the current results are related to other factors besides the currently selected data, such as the amount of data, whether the features extracted by the current model are applicable to the current classifier, etc., and it can be seen from the results in table 5 that the use of the strong classifier is not necessarily all effective, for example, the effect of the honokibayes combination is not as good as expected, which may be related to the characteristics of the currently used data and the honokibayes: the premise of naive bayes is that each feature is assumed to be independent, and words and phrases in text classification have strong correlation, and it can be seen from the result that the reason why GBDT is superior to random forest (hereinafter referred to as RF) is probably due to the fact that RF uses bagging in ensemble learning, that is: the GBDT belongs to boosting idea, which is to sample according to error rate, that is, a weak classifier gives a relatively low weight to a weak classification error during training, so the training process of the GBDT is similar to the deep learning model integrated into the Attention mechanism, and the weight value is used to highlight important features.

Fig. 6 is a structural diagram of an embodiment of a text emotion classification device provided in the present invention, and as shown in fig. 6, the text emotion classification device of the present embodiment includes:

the extraction module 601 is configured to obtain a word vector in text data to be processed, and extract a feature vector corresponding to the word vector;

the extraction module 601 is further configured to extract context feature representation for the feature vector by using a Bi-directional long-and-short time memory network Bi-LSTM model;

a determining module 602, configured to determine, according to the extracted context feature representation, a semantic code corresponding to the context feature representation;

a processing module 603, configured to perform maximal pooling on the semantic codes corresponding to the context feature representations, and splice multiple semantic codes after the maximal pooling to obtain a spliced feature representation;

In a possible implementation manner, the extracting module 601 is specifically configured to:

wherein c in each row_ij＝ReLU(s_jf + θ), wherein ReLU is the activation function, f ∈ R^k×DRepresents the convolution operation of a filter with convolution layer length k on a D-dimensional word vector, theta represents the offset, and s_jK successive word groups representing the beginning of the ith word in the text dataResultant word vector matrix s_j＝[w_j,w_j+1,…,w_j+k-1]Wherein w is_j∈R^DRepresenting a word vector of a jth word in the text data, wherein the dimension is D dimension, and the value range of i is 1-m; m is the number of the types of the convolution windows, the value range of j is 1 to n, and n is the number of words after the words are segmented in the text data.

wherein h is_tA hidden state h corresponding to the t-th word in the text data_t＝o_t⊙tanh(c_t)，o_t＝δ(W_o·X+b_o)，c_t＝f_t⊙c_t-1+i_t⊙tanh(W_c·X+b_c)，

f_t＝δ(W_f·X+b_f)，,i_t＝δ(W_i·X+b_i)；

Wherein, W_f、W_i、W_o、W_cWeight matrix for LSTM, b_f、b_i、b_o、b_cOffset of LSTM, w1_tColumn vectors of the t-th column of the F matrix, delta (·) is an activation function, ⊙ is a dot product operation of the matrix, n is the number of word vectors, and the value range of t is 1 to n.

In a possible implementation manner, the determining module 602 is specifically configured to:

wherein r is_t＝v^Ttanh(W_Aout_t+b)，W_AIs a parameter matrix, b is a bias term, v^TIs a transpose of the random chamber matrix v.

In a possible implementation manner, the processing module 603 is specifically configured to:

and classifying the spliced feature representation by using a preset classification function to obtain the emotion classification corresponding to the text data.

The apparatus of this embodiment may be configured to implement the technical solutions of the above method embodiments, and the implementation principles and technical effects are similar, which are not described herein again.

Fig. 7 is a structural diagram of an embodiment of an electronic device provided in the present invention, and as shown in fig. 7, the electronic device includes:

a processor 701, and a memory 702 for storing executable instructions for the processor 701.

Optionally, the method may further include: a communication interface 703 for enabling communication with other devices.

The above components may communicate over one or more buses.

The processor 501 is configured to execute the corresponding method in the foregoing method embodiment by executing the executable instruction, and the specific implementation process of the method may refer to the foregoing method embodiment, which is not described herein again.

The embodiment of the present invention further provides a computer-readable storage medium, where a computer program is stored, and when the computer program is executed by a processor, the method in the foregoing method embodiment is implemented.

Other embodiments of the disclosure will be apparent to those skilled in the art from consideration of the specification and practice of the disclosure disclosed herein. This application is intended to cover any variations, uses, or adaptations of the disclosure following, in general, the principles of the disclosure and including such departures from the present disclosure as come within known or customary practice within the art to which the disclosure pertains. It is intended that the specification and examples be considered as exemplary only, with a true scope and spirit of the disclosure being indicated by the following claims.

It will be understood that the present disclosure is not limited to the precise arrangements described above and shown in the drawings and that various modifications and changes may be made without departing from the scope thereof. The scope of the present disclosure is limited only by the appended claims.

Claims

1. A text emotion classification method is characterized by comprising the following steps:

2. The method according to claim 1, wherein the extracting the feature vector corresponding to the word vector comprises:

wherein c in each row_ij＝ReLU(s_jf + θ), wherein ReLU is the activation function, f ∈ R^k×DRepresents the convolution operation of a filter with convolution layer length k on a D-dimensional word vector, theta represents the offset, and s_jA word vector matrix s composed of k successive words starting from the ith word in the text data_j＝[w_j,w_j+1,…,w_j+k-1]Wherein w is_j∈R^DA word vector representing a jth word in the text data, the dimension being a D dimension, the i value rangeIs 1 to m; m is the number of the types of the convolution windows, the value range of j is 1 to n, and n is the number of words after the words are segmented in the text data.

3. The method of claim 2, wherein the extracting context feature representation of the feature vector by using a Bi-directional long-and-short memory network Bi-LSTM model comprises:

f_t＝δ(W_f·X+b_f)，,i_t＝δ(W_i·X+b_i)；

4. The method according to claim 3, wherein the determining semantic coding corresponding to the context feature representation according to the extracted context feature representation comprises:

determining semantic codes corresponding to the feature representation of the context by using the following formula (5) according to the feature representation and the probability weight of the context extracted by the Bi-LSTM model;

5. The method according to claim 1, wherein performing maximal pooling on semantic codes corresponding to the context feature representations and splicing the multiple semantic codes after the maximal pooling to obtain a spliced feature representation comprises:

6. The method according to any one of claims 1 to 5, wherein the classifying the spliced feature representation comprises:

and classifying the spliced feature representations by using different preset classification models to obtain emotion categories corresponding to the text data.

7. A text emotion classification device, comprising:

the determining module is used for determining semantic codes corresponding to the context feature representations according to the extracted context feature representations by utilizing an Attention mechanism top-k-maxporoling mechanism;

8. The apparatus of claim 7, wherein the processing module is specifically configured to:

9. A computer-readable storage medium, on which a computer program is stored, which, when being executed by a processor, carries out the method of any one of claims 1-6.

10. An electronic device, comprising:

a processor; and

a memory for storing executable instructions of the processor;

wherein the processor is configured to perform the method of any of claims 1-6 via execution of the executable instructions.