CN110472245B

CN110472245B - Multi-label emotion intensity prediction method based on hierarchical convolutional neural network

Info

Publication number: CN110472245B
Application number: CN201910751989.2A
Authority: CN
Inventors: 冯时; 谢宏亮; 王大玲; 张一飞
Original assignee: Northeastern University China
Current assignee: Northeastern University China
Priority date: 2019-08-15
Filing date: 2019-08-15
Publication date: 2022-11-29
Anticipated expiration: 2039-08-15
Also published as: CN110472245A

Abstract

The invention provides a multi-label emotion intensity prediction method based on a hierarchical convolutional neural network, which comprises the following steps of: dividing an original multi-label social media short text into a training set and a testing set; preprocessing a section of original multi-label social media short text data in a training set to obtain basic emotion single label data of the training set; constructing a single-label emotion classification model based on a hierarchical convolutional neural network; constructing an emotion intensity value model based on the attention convolution neural network; and aiming at the multi-label social media short text test data, predicting by using a single label emotion classification model of a hierarchical convolutional neural network to obtain an optimized multi-label emotion intensity vector. By adopting the multi-label emotion intensity prediction method based on the hierarchical convolutional neural network, the accuracy of social media text emotion intensity prediction can be further improved, and the method is particularly suitable for scenes with multiple basic emotions in the text.

Description

Multi-label emotion intensity prediction method based on hierarchical convolutional neural network

Technical Field

The invention belongs to the field of text mining and public sentiment analysis, and particularly relates to a multi-label emotion intensity prediction method based on a hierarchical convolutional neural network;

background

With the development of mobile internet technology in recent years, people can conveniently share their opinions and opinions by using social media, and the social media becomes an important way for many people to publish their opinions and opinions. Meanwhile, the huge amount of users causes a great amount of short text data to be generated in social media every day, and the data becomes an important data source of the online public opinion analysis system. The emotion analysis is a component of a public opinion analysis system, and the research of the emotion analysis has important significance. Meanwhile, short text data occupies a high proportion in social media, so emotion analysis for short texts is a research direction with practical application value.

Current textual emotion analysis research is mainly focused on the text classification problem, i.e. the classification of text into appropriate categories based on "happy", "angry", "aversive" and other basic emotions. But the text can express not only the emotion categories conveyed by the authors, but also the intensity of the emotions expressed by the same emotion in the text is greatly different. Furthermore, the expression of human emotions is very complex. For example, a plurality of emotional intensities can be expressed in a short text of social media. If the emotion analysis method of the single mark is adopted, only the emotion with the strongest emotion intensity in the text can be analyzed. However, some existing multi-label emotion classification algorithms can correctly give other emotions contained in the text, but cannot predict the emotion intensity values of various emotions, and cannot know which emotions are dominant in the sentence through the algorithms. The practical multi-mark emotion intensity prediction algorithm accords with the law of emotion complexity expression of people, and has great application value in the fields of social media network public opinion early warning, emergency public opinion tracking and the like.

Disclosure of Invention

Aiming at the problem, the invention provides a multi-label emotion intensity prediction method (HCNN) based on a hierarchical convolutional neural network. The method mainly aims to learn a mapping function from a training data set through a deep learning method, give a social media short text, and predict emotion marks corresponding to the social media short text, wherein the emotion marks are vectors consisting of n real numbers, and each real number value [0,1] represents the strength of a corresponding basic emotion.

A multi-label emotion intensity prediction method based on a hierarchical convolutional neural network comprises the following specific processes:

step 1: dividing an original multi-label social media short text into a training set and a testing set;

and 2, step: preprocessing a section of original multi-label social media short text data in a training set to obtain preprocessed single label data, wherein the data in the original multi-label social media short text training set are n real number vectors [ e ] representing basic emotion intensity values ₁ ，e ₂ …e _i ，e _n ]；

Step 2.1: removing punctuations irrelevant to emotion analysis from the social media short text data, reserving question marks and exclamation marks, and removing other punctuations to obtain the social media short text data after the punctuations are removed;

step 2.2, if the social media short text data after the punctuation removal has a numerical value, replacing the existing numerical value in the social media short text data after the punctuation removal with a specified numerical value to obtain the social media short text data after the numerical value replacement;

step 2.3: if the emotion category with the largest scale has num texts, when the emotion category distribution of the social media short text data after numerical value replacement is unbalanced, namely when the number of texts owned by other emotion categories is less than (3 x num)/4, resampling is carried out on the social media short text data of the category, and finally short text data with similar scales of all emotion categories are obtained, wherein the similar short text data is defined as: the data volume of the minimum scale category is not less than 0.75 times of the data volume of the maximum scale category; if the number of texts owned by other emotion categories is more than or equal to (3 x num)/4, no resampling is carried out, and the step 2.4 is carried out;

step 2.4: aiming at the social media short text data after resampling, the social media short text data contains basic emotion e _i Into the corresponding basic emotion ticket markup data D _i If a piece of resampled social media short text data comprises n basic emotion intensity values, n pieces of basic emotion sheet mark data are generated, wherein each piece is e _i >0, then the emotion is considered to be present, e _i =0, the emotion is considered to be absent;

and step 3: obtaining a plurality of sections of original multi-label social media short text data to form a training set, and processing each section of original multi-label social media short text data by adopting the method in the step 2 to obtain basic emotion single label data D of the training set _i ；

And 4, step 4: constructing a single-label emotion classification model based on a Hierarchical Convolutional Neural Network (HCNN);

step 4.1: basic emotion single-labeled data D of training set _i Converting the word vector matrix into a word vector matrix X and initializing an embedded layer of a neural network model;

step 4.2: convolution window and max pooling operations with Convolutional Neural Network (CNN) for word vector matrix XAs extracted local features v _w ：

v _w ＝CNN(X)

Step 4.3: coding is carried out aiming at a word vector matrix X, bidirectional long-short time memory network (BilSt) coding is used for obtaining an enhanced vector representation which considers context information and aims at each word, the enhanced vector is used for representing sentences S to obtain the matrix X _c In matrix X _c On the basis of the above-mentioned data, the convolutional neural network is used to extract the characteristics of logic layer so as to obtain vector v _c ：

X _c ＝BiLSTM(X)

v _c ＝CNN(X _c )

Step 4.4: fusing local features and logic layer features to form a new vector v of the text _f 。

Wherein, the symbol

Representing a vector splicing operation or a vector addition operation.

Step 4.5: new vector v of text _f Inputting the emotion data into a full connection layer to obtain a single-label emotion classification model of the hierarchical convolutional neural network;

step 4.6: new vector v of text _f Inputting the emotion data into a full connection layer, and obtaining the output of a single-label emotion classification model of the hierarchical convolutional neural network by using a softmax function:

the single label emotion classification model of the hierarchical convolutional neural network uses a cross entropy loss function as follows:

in the above formula, N represents the number of training examples, y _i Is a binary variable used to indicate whether the ith sample belongs to a certain class,

the representation model predicts the probability that the ith sample belongs to the specified class.

Optimizing a single-label emotion classification model of the hierarchical convolutional neural network by using a cross entropy loss function, performing iterative optimization by using a gradient descent algorithm, and ending the optimization process when the training data set is integrally iterated and circulated for L times to obtain the single-label emotion classification model of the hierarchical convolutional neural network after the loss function is optimized, namely the final single-label emotion classification model of the hierarchical convolutional neural network;

and 5: constructing an emotion intensity value model based on an Attention Convolution Neural Network (ACNN);

step 5.1: for single marker data set D _i Filtering out e _i Text of =0, next for e _i >0, training a mood intensity value prediction model by utilizing the following steps;

and step 5.2: single marker data set D _i Converting each text S into a word vector matrix X' for initializing an embedded layer of the neural network model;

step 5.3: word vector matrix X using long and short time memory model _P Coding is carried out to obtain a task related expression vector v of the text S _s Wherein:

v _s ＝LSTM(X _P )

step 5.4: representing a vector v by a sentence _s And the original word vector matrix X _P And calculating the related weight of the word vector through an attention mechanism, wherein the attention vector calculation method comprises the following steps:

v _a ＝X _P Wv _s

wherein v is _a Is the attention vector, W is the weight;

weighting the word vector by the attention vector, namely scaling the word vector in the subsequent window, wherein the formula is as follows:

α _i ＝l*softmax(v _a [i:i+l]),i∈{0,1,…,n-l}

where l represents the size of the window at the time,

represents the word vector corresponding to the i +0 th vocabulary in the sentence. Therefore, the similarity score in the current window is converted into a probability distribution by utilizing the softmax function, and multiplied by l to obtain the weight of the zoom word vector, and then the weight is multiplied by the original word vector X _P Multiplied and scaled. For each window, a new weighted token Z of text is generated.

And step 5.5: and extracting the characteristic of the weighted representation Z by using a convolutional neural network. Weighted token vector Z generated for window size l _l The most significant features are extracted using CNN networks and max pooling methods:

v _l ＝CNN(Z _l )

v different window sizes _l Splicing the features to form a final characterization vector v of the input text _g 。

Step 5.6: v is to be _g Inputting the data into the full connection layer, and obtaining the final output of the model, namely the emotion intensity value of the text by using the softmax function.

Step 5.7: optimizing the model by using the training data and the loss function to obtain optimal parameters and an optimized emotional intensity model:

using the mean square error of the actual emotional intensity value and the model predicted emotional intensity value as a loss function of the emotional intensity model:

where N' represents the number of instances of a particular singly labeled emotional training data set, p _i Indicating the emotional intensity value to which the ith sample is labeled,

and (4) representing the emotional intensity value predicted by the model h' of the ith sample.

And (4) carrying out iterative optimization by adopting a random gradient descent algorithm, and ending the optimization process when the whole iteration of the training data set is circulated for L' times to obtain a single-label emotion intensity prediction model.

And 6: aiming at the multi-label social media short text test data, predicting by using a single label emotion classification model of a hierarchical convolutional neural network to obtain an optimized multi-label emotion intensity vector;

step 6.1: preprocessing a section of original multi-label social media short text test centralized data in the test set to obtain preprocessed single label data; wherein the data in the multi-label social media short text test set are n ' real number vectors [ e ' representing basic emotion intensity values ' ₁ ，e’ ₂ …e’ _i ，e’ _n’ ]；

Step 6.1.1: removing punctuations irrelevant to emotion analysis from the social media short text data, reserving question marks and exclamation marks, and removing other punctuations to obtain the social media short text data after the punctuations are removed;

step 6.1.2, if the social media short text data after the punctuation is removed has a numerical value, replacing the existing numerical value in the social media short text data after the punctuation is removed with a specified numerical value to obtain the social media short text data after the numerical value is replaced;

step 6.1.3, if the social media short text data after the punctuation is removed has a numerical value, replacing the existing numerical value in the social media short text data after the punctuation is removed with a specified numerical value to obtain the social media short text data after the numerical value is replaced;

step 6.1.3: for social media short text data after replacing numerical value, for the text data containing basic emotion e' _i Is put into corresponding basic mood sheet markup data D' _i Then a piece of social media short text data after replacing numerical values, containing n ' basic emotion intensity values, will generate n ' pieces of basic emotion single mark data, wherein each piece is if ' _i >0, then the mood is considered to be present, e' _i =0, the emotion is considered to be absent;

step 6.2: testing data D 'by using preprocessed single marking data' _i Converting the word vector matrix into a word vector matrix X' and initializing an embedded layer of the neural network model;

step 6.3: extracting local features v 'by adopting convolution window of Convolution Neural Network (CNN) aiming at word vector matrix X' _w ；

Step 6.4: encoding a word vector matrix X ', using bi-directional long-and-short-term memory network encoding to obtain an enhanced vector representation for each word that considers context information, using the enhanced vector to represent the sentence S to obtain a matrix X' _c In matrix X' _c Using convolution windowing and max pooling operations to extract features of the logical layer, resulting in vector v' _c ；

Step 6.5: fusing local features and logical layer features to form a new vector v 'of text' _f Obtaining the convolution of the network and the output vector of the pooling layer;

step 6.6: new vector v 'of text' _f Inputting the emotion data into a full connection layer, and obtaining the output of a single-label emotion classification model of the hierarchical convolutional neural network by using a softmax function:

step 6.7: calculating the emotion intensity value output by the single-label emotion classification model by using an emotion intensity model ACNN;

step 6.8: output based on ACNN emotion intensity model of each emotion

Combining to obtain optimized multi-label emotion intensity value vector

The beneficial technical effects are as follows:

by adopting the multi-label emotion intensity prediction method based on the hierarchical convolutional neural network, the accuracy of emotion intensity prediction of the social media text can be further improved, and the method is particularly suitable for scenes in which various basic emotions exist in the text at the same time.

Drawings

FIG. 1 is a whole framework of a multi-label emotion intensity prediction method based on a hierarchical convolutional neural network according to an embodiment of the present invention;

FIG. 2 is a diagram of a HCNN model architecture according to an embodiment of the present invention;

FIG. 3 shows comparison result 1 with conventional CNN model experiment

FIG. 4 comparison with conventional CNN model experiment result 2

Detailed Description

The invention will be further described with reference to the accompanying drawings and specific examples: a multi-label emotion intensity prediction method based on a hierarchical convolutional neural network comprises the following specific processes:

step 2: preprocessing a section of original multi-label social media short text data in a training set to obtain preprocessed single label data, wherein the data in the original multi-label social media short text training set are n real number vectors [ e ] representing basic emotion intensity values ₁ ，e ₂ …e _i ，e _n ]；

step 2.4: aiming at the social media short text data after resampling, aiming at the social media short text data containing basic emotion e _i Into the corresponding basic emotion ticket markup data D _i If one piece of resampled social media short text data contains n basic emotion intensity values, n pieces of basic emotion single label data are generated, wherein if e of each piece _i >0, then the emotion is considered to be present, e _i =0, then the emotion is considered to be absent;

and 3, step 3: obtaining a plurality of sections of original multi-label social media short text data to form a training set, and processing each section of original multi-label social media short text data by adopting the method in the step 2 to obtain basic emotion single label data D of the training set _i ；

The overall framework of the algorithm of the invention is shown as figure 1, and mainly comprises two parts of model training and prediction, wherein the main algorithms 1 and 2 are described as follows:

and 4, step 4: constructing a single-label emotion classification model based on a Hierarchical Convolutional Neural Network (HCNN), as shown in FIG. 2;

step 4.1: basic emotion single-labeled data D of training set _i Converting the word vector matrix into a word vector matrix X and initializing an embedded layer of a neural network model; the present invention uses the Chinese Wikipedia to train Chinese word vectors and to initialize the embedding layer of the neural network model. When the context window is set to be 5, an optimization method of negative sampling is adopted for training through a Skip-gram model of a word2vec tool.

Step 4.2: extracting local features v by adopting convolution window and maximum pooling operation of Convolution Neural Network (CNN) aiming at word vector matrix X _w ：

v _w ＝CNN(X)

Step 4.3: encoding a word vector matrix X, using bidirectional long-time memory network (BilSTM) encoding to obtain an enhanced vector representation for each word considering context information, and using the enhanced vector to represent the sentence S to obtain the matrix X _c In matrix X _c On the basis, a convolutional neural network is used for extracting the characteristics of a logic layer to obtain a vector v _c ：

X _c ＝BiLSTM(X)

v _c ＝CNN(X _c )

Wherein, the symbol

Representing a vector splicing operation or a vector addition operation.

Step 4.5: new vector v of text _f Inputting the single-label emotion into the full-connection layer to obtain the single-label emotion of the hierarchical convolutional neural networkClassifying the model;

Optimizing the single-label emotion classification model of the hierarchical convolutional neural network by using a cross entropy loss function, performing iterative optimization by using a gradient descent algorithm, finishing the optimization process when the training data set is subjected to overall iterative loop for L times, and obtaining the single-label emotion classification model of the hierarchical convolutional neural network after the loss function is optimized, namely the final single-label emotion classification model of the hierarchical convolutional neural network;

a binary classifier model { C of each basic emotion can be trained through the algorithm 1 _i H and emotion intensity prediction model _i On the basis, the method can predict the intensity value of each basic emotion in the given short text, and when various basic emotions are expressed in the text, the prediction of the multi-mark emotion intensity can be completed, specifically see algorithm 2.

The trained classification and prediction model is utilized, the strength values of multiple emotions existing in the text at the same time can be effectively predicted through the algorithm 2, and experimental results show that the method provided by the invention can further improve the text emotion strength prediction effect, and are shown in the attached figures 3 and 4.

step 5.1: for a single marker dataset D _i Filter out e _i Text of =0, next for e _i >0, training a mood intensity value prediction model by utilizing the following steps;

step 5.2: single labeled data set D _i Each text S in the text is converted into a word vector matrix X' for initializing an embedded layer of a neural network model;

v _s ＝LSTM(X _P )

v _a ＝X _P Wv _s

α _i ＝l*softmax(v _a [i:i+l]),i∈{0,1,…,n-l}

where l represents the size of the window at the time,

representing the i +0 th word in the sentenceA word vector. Therefore, the similarity score in the current window is converted into a probability distribution by utilizing the softmax function, and multiplied by l to obtain the weight of the scaling word vector, and then the weight is multiplied with the original word vector X _P Multiplied and scaled. For each window, a new weighted representation Z of the text is generated.

Step 5.5: and extracting the characteristic of the weighted representation Z by using a convolutional neural network. Weighted token vector Z generated for window size l _l The most significant features are extracted using CNN networks and max pooling methods:

v _l ＝CNN(Z _l )

Step 5.6: v is to be _g And inputting the model into the full connection layer, and obtaining the final output of the model, namely the emotional intensity value of the text by using a softmax function.

Step 5.7: optimizing the model by using the training data and the loss function to obtain the optimal parameters and the optimized emotional intensity model:

where N' represents the number of instances of a particular singly labeled emotional training data set, p _i Indicating the value of the emotional intensity to which the ith sample is labeled,

the emotional intensity value predicted by the model h' of the ith sample is shown.

And (3) carrying out iterative optimization by adopting a random gradient descent algorithm, and ending the optimization process when the whole iteration of the training data set is circulated for L' times to obtain a single-label emotion intensity prediction model.

Step 6: aiming at multi-label social media short text test data, predicting by using a single label emotion classification model of a hierarchical convolution neural network to obtain an optimized multi-label emotion intensity vector;

step 6.1.2, if the social media short text data after the punctuation removal has a numerical value, replacing the existing numerical value in the social media short text data after the punctuation removal with a specified numerical value to obtain the social media short text data after the numerical value replacement;

step 6.1.3, if the social media short text data after the punctuation removal has a numerical value, replacing the existing numerical value in the social media short text data after the punctuation removal with a specified numerical value to obtain the social media short text data after the numerical value replacement;

step 6.3: extracting local feature v 'by adopting convolution window of Convolution Neural Network (CNN) aiming at word vector matrix X' _w ；

Step 6.4: encoding a word vector matrix X ', obtaining an enhanced vector representation for each word taking into account context information using bi-directional spatiotemporal memory network encoding, representing the sentence S using this enhanced vector to obtain a matrix X' _c In matrix X' _c Using convolution windowing and maximum pooling operations to extract features of the logical layer, resulting in a vector v' _c ；

step 6.8: output based on ACNN emotion intensity model of each emotion

Combining to obtain optimized multi-label emotion intensity value vector

The core innovation of the invention is to provide a hierarchical convolutional neural network model (HCNN) which can be used for emotion classification and emotion intensity prediction of social media texts. The invention provides a specific embodiment of the HCNN model.

(1) Training and testing data. The Chinese blog data set is adopted, 19751 sentences are totally adopted, and basic emotions include eight types, namely anger, anxiety, expectation, aversion, joy, love, sadness and surprise. Each sentence in the data set is labeled according to the expressed emotion, and the intensity value of each emotion is within [0,1], wherein the intensity 0 indicates that the sentence does not express the basic emotion.

(2) Word vector pre-training. Chinese word vectors are trained using the Chinese Wikipedia, with the raw corpus downloaded directly from Wiki Dump. The word vector training tool selects word2vec, specifically selects a Skip-gram model with a vector dimension of 200, and when a context window is set to be 5, the optimization method of Negative Sampling is adopted for training, the size of a Sampling value is 1e-4, and the number of iterations of the model is 15.

An HCNN network training method. And an Adam optimization method is adopted for HCNN training. The number of convolution kernels in the network is set to 200. The last fully-connected layer of the model contains two hidden layers, the number of hidden units is 200 and 100 respectively, and the drop rate values of the two layers are set to be 0.2 and 0.1 respectively. And aiming at different basic emotions, different sizes of convolution windows and the number of hidden units of the bidirectional long-time and short-time memory network can be set, and the work is finished by adjusting parameters on a verification set.

Fig. 3 is a comparison result 1 of a conventional CNN model experiment, in which DCNN and ACNN are HCNN models that are obtained by fusing multi-layer features by means of vector splicing and vector addition, respectively; RL, sequencing loss; HL, hamming loss; MSE is mean square error; SA, subset accuracy; the arrow direction indicates that the larger the index is, the better the index is, and the smaller the index is, the better the index is; FIG. 4 shows the comparison result with the conventional CNN model experiment 2, and DCNN and ACNN are HCNN models with multi-layer characteristics fused by means of vector splicing and vector addition, respectively; OE, single error; maF is the macro-average F value; miF is the micro-average F value; AP is average accuracy rate; the arrow direction indicates upward that the larger the index is, the better, and the downward that the index is, the smaller the index is, the better.

Claims

1. A multi-label emotion intensity prediction method based on a hierarchical convolutional neural network is characterized in that the multi-label emotion intensity prediction method based on the hierarchical convolutional neural network comprises the following specific processes:

And 4, step 4: constructing a single-label emotion classification model based on a hierarchical convolutional neural network;

step 4.2: extracting local features v by adopting convolution window and maximum pooling operation of convolution neural network aiming at word vector matrix X _w ：

v _w ＝CNN(X)

Step 4.3: coding is carried out aiming at a word vector matrix X, bidirectional long-and-short-term memory network coding is used for obtaining an enhanced vector representation aiming at each word and considering context information, the enhanced vector is used for representing a sentence S to obtain the matrix X _c In matrix X _c On the basis, a convolutional neural network is used for extracting the characteristics of a logic layer to obtain a vector v _c ：

X _c ＝BiLSTM(X)

v _c ＝CNN(X _c )

Step 4.4: fusing local features and logic layer features to form a new vector v of the text _f ；

Wherein, the symbol

Representing a vector splicing operation or a vector addition operation;

step 4.6: new vector v of text _f Inputting the data into a full connection layer, and obtaining the output of a single-label emotion classification model of the hierarchical convolutional neural network by using a softmax function:

representing the probability that the model predicts that the ith sample belongs to the specified class;

and 5: constructing an emotion intensity value model based on the attention convolution neural network;

step 5.3: word vector matrix X using long-short time memory model _P Coding is carried out to obtain a task related expression vector v of the text S _s Wherein:

v _s ＝LSTM(X _P )

v _a ＝X _P Wv _s

wherein v is _a W is the attention vector, W is the weight;

weighting the word vectors by the attention vectors, i.e. scaling the word vectors in the subsequent window, the formula is:

α _i ＝l*softmax(v _a [i:i+l]),i∈{0,1,…,n-l}

where l represents the size of the window at the time,

representing word vectors corresponding to the i +0 th vocabulary in the sentence; therefore, the similarity score in the current window is converted into a probability distribution by utilizing the softmax function, and multiplied by l to obtain the weight of the scaling word vector, and then the weight is multiplied with the original word vector X _P The multiplication is carried out by the following steps,scaling it; for each window, generating a new weighted representation Z of the text;

and step 5.5: extracting the characteristics of the weighted representation Z by using a convolutional neural network, and generating a weighted representation vector Z with a window size of l _l The most significant features are extracted using CNN networks and max pooling methods:

v _l ＝CNN(Z _l )

v different window sizes _l Splicing the features to form a final characterization vector v of the input text _g ；

Step 5.6: v is to be _g Inputting the model into a full connection layer, and obtaining the final output of the model, namely the emotion intensity value of the text by using a softmax function;

representing the emotional intensity value predicted by the model h' of the ith sample;

performing iterative optimization by adopting a random gradient descent algorithm, and ending the optimization process when the training data set is subjected to overall iterative loop for L' times to obtain a single-label emotion intensity prediction model;

step 6: and aiming at the multi-label social media short text test data, predicting by using a single-label emotion classification model of the hierarchical convolutional neural network to obtain an optimized multi-label emotion intensity vector.

2. The method for predicting multi-label emotional intensity based on the hierarchical convolutional neural network as claimed in claim 1, wherein the step 2 specifically comprises:

step 2.3: if the emotion category with the largest scale has num texts, when the emotion category distribution of the social media short text data after numerical value replacement is unbalanced, namely when the number of texts owned by other emotion categories is less than (3 x num)/4, resampling is carried out on the social media short text data of the category, and finally short text data with similar scales of all emotion categories are obtained, wherein the similar short text data is defined as: the data quantity of the minimum scale category is not less than 0.75 times of the data quantity of the maximum scale category; if the number of texts owned by other emotion types is greater than or equal to (3 × num)/4, no resampling is performed, and the step 2.4 is carried out;

step 2.4: aiming at the social media short text data after resampling, the social media short text data contains basic emotion e _i Into the corresponding basic emotion ticket markup data D _i If a piece of resampled social media short text data comprises n basic emotion intensity values, n pieces of basic emotion sheet mark data are generated, wherein each piece is e _i >0, then the emotion is considered to be present, e _i If =0, the emotion is considered to be absent.

3. The method for predicting multi-label emotional intensity based on the hierarchical convolutional neural network of claim 1, wherein the step 6 specifically comprises:

step 6.1.3: for social media short text data after replacing numerical value, for including basic emotion e' _i Is put to the corresponding basic emotion single label data D' _i Then a piece of social media short text data after replacing numerical values, containing n ' basic emotion intensity values, will generate n ' pieces of basic emotion single mark data, wherein each piece is if ' _i >0, the emotion is considered to be present, e' _i =0, then the emotion is considered to be absent;

step 6.3: extracting local feature v 'by adopting convolution window of convolution neural network aiming at word vector matrix X' _w ；

step 6.8: output based on ACNN emotion intensity model of each emotion

Combining to obtain optimized multi-label emotion intensity value vector