CN113095069A

CN113095069A - Data-driven neural network architecture system based on emotion analysis

Info

Publication number: CN113095069A
Application number: CN202110255563.5A
Authority: CN
Inventors: 王小华; 潘晓光; 焦璐璐; 张娜; 宋晓晨
Original assignee: Shanxi Sanyouhe Smart Information Technology Co Ltd
Current assignee: Shanxi Sanyouhe Smart Information Technology Co Ltd
Priority date: 2021-03-09
Filing date: 2021-03-09
Publication date: 2021-07-09

Abstract

The invention relates to the application field of a convolutional neural network in emotion analysis, in particular to a data-driven neural network architecture system based on emotion analysis, which comprises the following modules: the system comprises an embedded layer module, a characteristic convolution layer module, a maximum pooling layer module and a compact layer module, wherein the embedded layer module is input to the whole system module; the feature convolution layer module is used for feature extraction; the maximum pooling layer module reserves local information of word combinations in the text; the compact layer module determines which classification the input belongs to according to a certain weight held in each feature node.

Description

Data-driven neural network architecture system based on emotion analysis

Technical Field

The invention relates to the application field of a convolutional neural network in emotion analysis, in particular to a data-driven neural network architecture system based on emotion analysis.

Background

In the application of the convolutional neural network in emotion analysis, a good effect is achieved under the condition of small calculation amount, but a neural network model usually needs a large amount of data and a large training sample set, and when different types of networks are used, the optimal hyper-parameter setting or design selection is difficult to find.

Problems or disadvantages of the prior art: convolutional neural networks when building network architectures it is difficult to find enough data to provide these networks, optimize their parameters, and make the correct design choices.

Disclosure of Invention

Aiming at the problem that enough data are difficult to find, the invention provides a data-driven neural network architecture system based on emotion analysis.

In order to solve the technical problems, the invention adopts the technical scheme that:

a data driven neural network architecture system based on emotion analysis comprises the following modules: the system comprises an embedded layer module, a characteristic convolution layer module, a maximum pooling layer module and a compact layer module, wherein all the modules are in communication connection in a parallel mode, and the embedded layer module is input to the whole system module; the feature convolution layer module is used for feature extraction; the maximum pooling layer module reserves local information of word combinations in the text; the compact layer module determines which classification the input belongs to according to a certain weight held in each feature node.

In the embedding layer module, word-embedded representations are created or looked up for each text document, and then adjusted during training using neural networks, using pre-training word vectors created from a large corpus of text.

In the feature convolution layer module, where different layers are used in parallel to extract words, 2 grams, 3 grams, and longer features, a fixed number of 80 filters are applied in each layer to generate a feature map.

In the maximum pooling layer module, data is sampled in a sub-sample mode, the most relevant and effective characteristics are selected for the classification stage, and local information of word combinations in the text is reserved.

In the maximum pooling layer module, the length of the generated feature map is as follows by performing experiments on different data sets with different values

The feature maps are concatenated and pushed to a feed-forward layer that works as a classifier, using 0.1L2 regularization criterion and 0.35 dropout rate to mitigate overfitting, and then using binary cross-copy and Adam optimizers to compute loss and optimize training.

In the max pooling layer module, a deeper network structure D >2 is used with a larger training data set, followed by a series of W parallel convolutions with increasing filter size, then the max pooling layer stack of region r is reduced by selecting the most relevant features to classify, repeating the same convolution and max pool stack multiple times to form a network of a total of D stacks, the output of the last max pool layer being the last set of features to be used.

In the compact layer module, the size of the set region is adjusted to generate a feature map with the length of 6 to 18 units, and the optimal classification score is obtained.

Has the advantages that: the optimal choice of the current-stage architecture design is determined through a series of experiments on the neural structures of various architectures, so that a high-performance emotion analysis model is generated. Associated hyper-parameter settings and design choices are also determined. A parallel convolution with a filter length of 3 is usually sufficient to capture relevant text features. Furthermore, the maximum pool area size should be adapted to the length of the text document to produce an optimal feature map. When the aggregate region is resized to generate a feature map of length 6 to 18, the best classification score can be obtained.

Drawings

FIG. 1 is a system flow diagram of the present invention;

Detailed Description

The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

A data-driven neural network architecture system based on emotion analysis, as shown in fig. 1, includes the following modules: the system comprises an embedded layer module, a characteristic convolution layer module, a maximum pooling layer module and a compact layer module, wherein all the modules are in communication connection in a parallel mode, and the embedded layer module is input to the whole system module; the feature convolution layer module is used for feature extraction; the maximum pooling layer module reserves local information of word combinations in the text; the compact layer module determines which classification the input belongs to according to a certain weight held in each feature node.

Further, in the embedding layer module, which is a non-trainable embedding layer, requiring the creation or lookup of word embedding representations for each text document, one possible approach is to initialize the vectors for each word with random values and then adjust them using neural networks during training, which requires a large set of training documents, higher performance can be obtained with pre-trained word vectors created from a large corpus of text where a small training data set is available, the pre-trained vectors for each word are all directly sourced.

Further, in the feature convolution layer module, where W filters are sized so as to be k-1, k-2, k-3 … all of these layers are used in parallel to extract words, 2 grams, 3 grams and longer features, a fixed number of 80 filters are applied in each layer to generate a feature map, and an important parameter of the network is its depth D or the total number of layer stacks.

Further, the max pooling layer module, whose role is to subsample sample the data and select the most relevant and efficient features for the classification phase, utilizes the max pooling layer, which is known to retain local information of word combinations in the text.

Further, in the max pooling level module, the length of the feature map generated by performing experiments on different data sets with different values for the length of the region r is

Where n is the number of words in the training document, the feature maps are concatenated and pushed to the feed forward layer working as a classifier; the latter consists of 80 nodes, overfitting is relieved by adopting 0.1L2 regularization specifications and 0.35 drop-out rate, loss calculation and optimization training are calculated by adopting a binary cross copy and Adam optimizer, and 70/10/20% of data segmentation is used in training, development and testing respectively.

Further, in the max pooling level module, a deeper level of the network structure D >2 is used with a larger training data set, followed by a series of W parallel convolutions with increasing filter size, the convolution operation acts as a word and phrase feature extractor, and the W width can vary depending on the task and the type of data being analyzed; the following is the maximum pooling level stack of the region r, and as with W, the value of r can be adjusted to accommodate the number of words in each document of the training set, and the same convolution and maximum pooling stack are repeated multiple times to form a network of a total of D stacks, the value of D being flexible and should be selected according to the size of the training data set, and the output of the last maximum pooling level is the last set of properties to be used.

Furthermore, in the compact layer module, the length of the document and the size of the feature map have certain regularity, and the size of the set area is adjusted to generate the feature map with the length of 6 to 18 units, so that the optimal classification score can be obtained; applying deep networks to larger datasets yields optimal results in terms of dataset size.

Although only the preferred embodiments of the present invention have been described in detail, the present invention is not limited to the above embodiments, and various changes can be made without departing from the spirit of the present invention within the knowledge of those skilled in the art, and all changes are encompassed in the scope of the present invention.

Claims

1. A data-driven neural network architecture system based on emotion analysis is characterized by comprising the following modules: the system comprises an embedded layer module, a characteristic convolution layer module, a maximum pooling layer module and a compact layer module, wherein all the modules are in communication connection in a parallel mode, and the embedded layer module is input to the whole system module; the feature convolution layer module is used for feature extraction; the maximum pooling layer module reserves local information of word combinations in the text; the compact layer module determines which classification the input belongs to according to a certain weight held in each feature node.

2. The data-driven neural network architecture system based on emotion analysis, as claimed in claim 1, wherein: in the embedding layer module, word-embedded representations are created or looked up for each text document, and then adjusted during training using neural networks, using pre-training word vectors created from a large corpus of text.

3. The system of claim 2, wherein the neural network architecture system is based on sentiment analysis, and comprises: in the feature convolution layer module, where different layers are used in parallel to extract words, 2 grams, 3 grams, and longer features, a fixed number of 80 filters are applied in each layer to generate a feature map.

4. The data-driven neural network architecture system based on emotion analysis according to claim 3, wherein: in the maximum pooling layer module, data is sampled in a sub-sample mode, the most relevant and effective characteristics are selected for the classification stage, and local information of word combinations in the text is reserved.

5. The data-driven neural network architecture system based on emotion analysis according to claim 4, wherein: the max pooling layer module is generated by performing experiments on different data sets with different valuesHas a length of

6. The system of claim 5, wherein the neural network architecture comprises: in the max pooling layer module, a deeper network structure D >2 is used with a larger training data set, followed by a series of W parallel convolutions with increasing filter size, then the max pooling layer stack of region r is reduced by selecting the most relevant features to classify, repeating the same convolution and max pool stack multiple times to form a network of a total of D stacks, the output of the last max pool layer being the last set of features to be used.

7. The data-driven neural network architecture system based on emotion analysis according to claim 6, wherein: in the compact layer module, the size of the set region is adjusted to generate a feature map with the length of 6 to 18 units, and the optimal classification score is obtained.