CN114461791A

CN114461791A - Social text sentiment analysis system based on deep quantum neural network

Info

Publication number: CN114461791A
Application number: CN202111350644.XA
Authority: CN
Inventors: 刘宇鹏; 冯贤杰
Original assignee: Harbin University of Science and Technology
Current assignee: Harbin University of Science and Technology
Priority date: 2021-11-15
Filing date: 2021-11-15
Publication date: 2022-05-10

Abstract

A social text sentiment analysis system based on a deep quantum neural network belongs to the technical field of intersection of quantum computing and natural language processing. The invention solves the problems of large storage overhead, high time complexity and low prediction precision of the existing deep learning method. The invention provides an adjustable parameter sub-deep learning method for social text sentiment analysis, the used quantum computation only needs to store input and output without storing intermediate results, so that the storage space of binary codes is saved, the problem of high storage cost of the binary codes is solved, and the quantum parallel computation does not need to gradually compute the classical computation of each deep neural network, so that the time complexity of the deep neural networks is reduced. The method can obtain a better model by adopting staged training, and has higher prediction precision and stronger robustness compared with the prior method. The method and the device can be applied to analyzing social text emotion.

Description

Social text sentiment analysis system based on deep quantum neural network

Technical Field

The invention belongs to the technical field of intersection of quantum computing and natural language processing, and particularly relates to a social text sentiment analysis system based on a deep quantum neural network.

Background

Emotional analysis of social media topics aims to explore the opinions and attitudes of people on a social network about a certain topic or event. The popularity of the internet has made people aware of the increasing and convenience of information channels. The amount of active daily users of the Xinlang as a large social network platform in China already exceeds 2.24 hundred million people, and the total number of the comments issued each day is as much as two hundred million on average. The social media mass data contains rich real-time information. People can push life dynamics and opinions to a social media platform and can also comment on popular events. The data with subjective colors bring great convenience to the research of emotion analysis.

The social text data has real-time performance and timeliness, and the value of the data can be more greatly exerted by capturing the timeliness of the social text information and analyzing the latest topic data. At present, most of research aiming at social text emotion analysis is focused on improving the analysis performance of an emotion analyzer by using a deep learning method, but the deep learning method is based on binary coding, so that the coding is 0 or 1, a large amount of time consumption, storage overhead and high time complexity are caused, and the problem is more obvious particularly under the conditions of large data volume and excessive neural network parameters. Moreover, the existing deep learning method has low social text emotion prediction precision.

Disclosure of Invention

The invention aims to solve the problems of high storage cost, high time complexity and low prediction precision of the conventional deep learning method, and provides a social text sentiment analysis system based on a deep quantum neural network.

The technical scheme adopted by the invention for solving the technical problems is as follows:

a social text sentiment analysis system based on a deep quantum neural network comprises a data acquisition module, a data processing module, a text sequence information acquisition model and a sentiment analysis model, wherein:

the data acquisition module is used for acquiring an emotion classification data set, and the emotion classification data set comprises comment texts and labels corresponding to the comment texts;

the data processing module is used for segmenting the comment texts in the acquired emotion classification data set to obtain a comment text word sequence, and then mapping the comment text word sequence into a word vector space to obtain vectorized text sequence information;

the text sequence information acquisition model is trained by utilizing the comment text word sequence obtained by the data processing module and the corresponding vectorized text sequence information, and the output of the trained text sequence information acquisition model is vectorized text sequence information;

the emotion analysis model comprises a quantum long-time and short-time memory neural network and a network fusion module, wherein the quantum long-time and short-time memory neural network comprises an Input layer, a QLSTM layer and a full connection layer, and the QLSTM layer comprises t QLSTM units;

the quantum long-short term memory neural network utilizes vectorization text sequence information and labels corresponding to data in the emotion classification data set to pre-train, after n pre-trained quantum long-short term memory neural networks are obtained, the network fusion module fuses the n pre-trained quantum long-short term memory neural networks, and the output of the fused quantum long-short term memory neural networks is used as the output of the emotion analysis model;

after the comment text to be analyzed is segmented, obtaining a comment text word sequence to be analyzed; inputting a comment text word sequence to be analyzed into a trained text sequence information acquisition model, and acquiring vectorized text sequence information corresponding to the comment text to be analyzed;

and inputting the vectorized text sequence information corresponding to the comment text to be analyzed into an emotion analysis model, and outputting an emotion analysis result of the comment text to be analyzed.

Further, the data acquired by the data acquisition module further comprises a target data set, and the data format of the target data set is the same as that of the emotion classification data set, namely the target data set comprises comment texts and labels corresponding to the comment texts;

and fine-tuning the pre-trained emotion analysis model by utilizing vectorization text sequence information corresponding to the comment text in the target data set and the label to obtain a final emotion analysis model.

Further, the emotion classification data set is obtained by crawling on a new wave, and a Scapy distributed crawler framework is adopted in the crawling mode;

the target data set is a data set which is sorted and published from hotel comments of the cellular network in 2018.

Further, the QLSTM unit is defined as:

v_t＝h_t-1*x_t

f_t＝Activate(VQC₁(v_t))

i_t＝Activate(VQC₂(v_t))

o_t＝Activate(VQC₄(v_t))

h_t＝VQC₅(o_t*Activate(c_t))

y_t＝VQC₆(o_t*Activate(c_t))

wherein x is_tIs the input vector at time t of the QLSTM cell, h_t-1Is a hidden state at time t-1, v_tIs an input vector of VQC cell body time t, Activate (r) is an activation function, f_tForgetting gate vector, i, at time t_tFor the input gate vector at time t,

temporary cell state at time t, c_t-1The cell state at time t-1, c_tThe cell state at time t, o_tIs the output gate vector at time t, h_tHidden state at time t, y_tFor the output state at time t, VQC₁、VQC₂、VQC₃、VQC₄、VQC₅、VQC₆Respectively represent the 1 st, 2 nd, 3 rd, 4 th, 5 th and 6 th VQC cell bodies.

Further, the activation function Activate is a Sigmoid activation function or a hyperbolic tangent activation function.

Further, the working process of the quantum long-time and short-time memory neural network is as follows:

the method comprises the steps of taking the output of an Input layer as the Input of a QLSTM layer, and dividing the feature extraction of the QLSTM layer into two processes, wherein the first process is the forward propagation feature extraction among QLSTM units, the second process is the backward propagation feature extraction among the QLSTM units, connecting the output of the last QLSTM unit in two directions, taking the connection result as the final output of the QLSTM layer, taking the output of the QLSTM layer as the Input of a full connection layer, mapping the feature to a [0, 1] interval through a Sigmoid activation function, taking the mapping result as the probability of social text emotion polarity, and if the probability P is more than or equal to 0.5, the social text expression emotion is positive, and otherwise, the expressed emotion is negative.

Further, when the quantum long-time and short-time memory neural network is pre-trained, the adopted loss function is a cross entropy loss function, and the adopted network parameter updating method is an RMSprop algorithm.

Further, the method adopted by the network convergence module is a bagging algorithm.

Further, the VQC cell body structurally comprises a quantum coding layer, a variation circuit layer and a quantum measuring layer;

the quantum operation of the variational circuit layer comprises a multi-qubit Toffoli gate and a single-qubit revolving gate, wherein the multi-qubit Toffoli gate is used for replacing a cnot quantum gate, and the multi-qubit Toffoli gate is used for quantum entanglement of three quantum wires.

Further, the text sequence information acquisition model is a bert pre-training model.

The invention has the beneficial effects that:

the invention provides an adjustable parameter sub-deep learning method for social text sentiment analysis, the used quantum computation only needs to store input and output without storing intermediate results, so that the storage space of binary codes is saved, the problem of high storage cost of the binary codes is solved, and the quantum parallel computation does not need to gradually compute the classical computation of each deep neural network, so that the time complexity of the deep neural networks is reduced. The method can obtain a better model by adopting staged training, and has higher prediction precision and stronger robustness compared with the prior method.

Meanwhile, the invention utilizes quantum entanglement and quantum probability to calculate, enlarges the expression space, has high training operation speed and is easy to realize.

Drawings

FIG. 1 is a flow chart of the method of the present invention;

FIG. 2 is a diagram of an initial model architecture for emotion analysis, according to an embodiment;

FIG. 3 is a schematic diagram of a cell body structure of a quantum long-short time memory neural network in an embodiment;

FIG. 4 is a schematic diagram of a general structure of VQC in the embodiment;

FIG. 5 is a schematic diagram of the principle of quantum encoding in an example;

FIG. 6 is a schematic diagram of VQC structure of QLSTM in the example;

FIG. 7 is a schematic diagram of a quantum long-short term memory neural network model training flow of an embodiment;

FIG. 8 is a schematic diagram of a model structure of a quantum long-and-short time memory neural network in the embodiment;

FIG. 9 is a diagram of neural network training in an example.

Detailed Description

First embodiment this embodiment will be described with reference to fig. 1. The social text sentiment analysis system based on the deep quantum neural network comprises a data acquisition module, a data processing module, a text sequence information acquisition model and a sentiment analysis model, wherein:

the label is an emotion label, and the emotion label comprises: comments on positive emotional tendency and comments on negative emotional tendency;

when the neural network is memorized in a pre-training quantum long-term manner, in order to facilitate network convergence, a sample needs to be standardized and adjusted to a [0, 1] interval, specifically as follows:

in the formula, x_maxAnd x_minMaximum and minimum values in known sample data; x is the number of_jSample data which needs to be adjusted currently;

and inputting the vectorized text sequence information corresponding to the comment text to be analyzed into the pre-trained and fine-tuned emotion analysis model, and outputting the emotion analysis result of the comment text to be analyzed.

The second embodiment is as follows: the difference between the embodiment and the specific embodiment is that the data acquired by the data acquisition module further includes a target data set, and the data format of the target data set is the same as that of the emotion classification data set, that is, the target data set includes a comment text and a tag corresponding to the comment text;

Other steps and parameters are the same as those in the first embodiment.

The third concrete implementation mode: the difference between the first embodiment and the second embodiment is that the emotion classification data set is obtained by crawling on a new wave, and a Scapy distributed crawler frame is adopted in the crawling mode;

the target data set is a data set which is organized and published from hotel comments of the journey network in 2018.

Other steps and parameters are the same as those in the first or second embodiment.

The fourth concrete implementation mode: this embodiment differs from one of the first to third embodiments in that the QLSTM unit is defined as:

v_t＝h_t-1*x_t

f_t＝Activate(VQC₁(v_t))

i_t＝Activate(VQC₂(v_t))

o_t＝Activate(VQC₄(v_t))

h_t＝VQC₅(o_t*Activate(c_t))

y_t＝VQC₆(o_t*Activate(c_t))

Other steps and parameters are the same as those in one of the first to third embodiments.

The fifth concrete implementation mode: the difference between this embodiment and one of the first to fourth embodiments is that the activation function Activate is a Sigmoid activation function or a hyperbolic tangent activation function.

Other steps and parameters are the same as in one of the first to fourth embodiments.

The sixth specific implementation mode: the difference between this embodiment and one of the first to fifth embodiments is that the working process of the quantum long-time and short-time memory neural network is as follows:

the output of an Input layer is used as the Input of a QLSTM layer, the feature extraction of the QLSTM layer is divided into two processes, wherein the first process is the forward propagation feature extraction among QLSTM units, the second process is the backward propagation feature extraction among the QLSTM units, the output of the last QLSTM unit in two directions is connected, the connection result is used as the final output of the QLSTM layer, the output of the QLSTM layer is used as the Input of a full connection layer, after the features are mapped to a [0, 1] interval through a Sigmoid activation function, the mapping result is used as the probability of social text emotion polarity, if the probability P is larger than or equal to 0.5, the social text expression emotion is positive, and otherwise, the expressed emotion is negative.

The unidirectional network only stores past information, ignores future information, and can only predict a prediction result according to the past information; unlike the unidirectional neural network, the bidirectional neural network in the embodiment stores past and future information, and combines the two information as two hidden states, so that a prediction result can be presumed according to the past and future information, and the accuracy of prediction is improved.

Other steps and parameters are the same as those in one of the first to fifth embodiments.

The seventh concrete implementation mode: the difference between this embodiment and one of the first to sixth embodiments is that, when the quantum long-and-short-term memory neural network is pre-trained, the adopted loss function is a cross entropy loss function, and the adopted network parameter updating method is an RMSprop algorithm.

Adjusting parameters of the quantum long-time and short-time memory neural network model through an iterative training model to obtain an optimal neural network model:

the parameters of the quantum neural network model are mainly adjustable parameter quantum gates, such as single quantum bit revolving gate R_IWhich needs to learn α_I，β_IAnd gamma_IThese three parameters; when the output of the network is not equal to the expected value, an error function exists, and the error is propagated reversely to adjust the parameters of the adjustable parameter sub-gate to train the quantum length memory neural network.

In the optimization process, a parameter offset method is used toAn analytical gradient of the quantum circuit is obtained. Given an observation

Desired value of (a):

where x is the input value of the quantum circuit, U₀(x) Preparing conventional x to quantum state code, i is the index of circuit parameter to calculate gradient, U_i(θ_i) Is a Pauli matrix generated by rotation of a single quantum bit, f relative to a parameter θ_iThe gradient of (a) is:

the analysis evaluation expected parameter theta can be obtained_iAnd applying gradient descent optimization from classical machine learning to a VQC-based machine learning model.

Wherein the parameter migration method is to express the gradient as a linear combination of evaluation functions at two different points, and the evaluation expectation is that f is relative to the parameter theta_iOf the gradient of (c).

Other steps and parameters are the same as those in one of the first to sixth embodiments.

The specific implementation mode is eight: the difference between this embodiment and one of the first to seventh embodiments is that the method adopted by the network convergence module is a bagging algorithm.

And combining the quantum long-time and short-time memory neural networks trained by different data sets into a strong learner by a voting method to improve the robustness of the model. The self-service sampling method of the bagging algorithm can effectively avoid overfitting, and the generalization performance is improved through the increase of the difference degree between individual networks; the method can process high-dimensional features without feature selection; the training device is simple in structure, easy to implement, low in calculation overhead and high in training efficiency.

Other steps and parameters are the same as those in one of the first to seventh embodiments.

The specific implementation method nine: the difference between the present embodiment and one of the first to eighth embodiments is that the structure of the VQC cell body includes a quantum encoding layer, a variational circuit layer, and a quantum measuring layer;

the quantum operation of the variation circuit layer comprises a multi-qubit Toffoli gate and a single-qubit revolving gate, wherein the multi-qubit Toffoli gate is used for replacing a cnot quantum gate, and the multi-qubit Toffoli gate is used for quantum entanglement of three quantum lines.

The VQC cell body is a quantum circuit with adjustable parameters and can be subjected to iterative optimization. The principle is to encode classical data into quantum state and send it into quantum circuit, and transform it by quantum gate with adjustable parameters. And finally, measuring the result of the converted result by using a quantum measurer.

A quantum measurement layer that takes into account the expected value of each qubit by measurement on a computational basis. Using a quantum simulation platform, numerical calculations can be performed on classical computers, whereas on real quantum computers, these values are statistically estimated by repeated measurements, which should theoretically be close to the simulated values at the zero noise limit. The result returned is a fixed-length vector that will be further processed on a classical computer.

The CNOT quantum gate can only consider the relationship between two adjacent quantum lines, and the state of the former line determines whether the latter line performs Not operation, which is often insufficient; the Toffoli gate can consider the relation of three quantum lines, and the states of the first two lines determine whether the last line is subjected to Not operation, so that more bases can be provided for the prediction result, and the generated effect is better.

Other steps and parameters are the same as those in one to eight of the embodiments.

The detailed implementation mode is ten: the difference between this embodiment and one of the first to ninth embodiments is that the text sequence information acquisition model is a bert pre-training model.

Other steps and parameters are the same as in one of the first to tenth embodiments.

The invention will be further explained with reference to the following drawings and examples, in particular:

as shown in fig. 1, the present embodiment provides a social text sentiment analysis method based on a quantum long-and-short-term memory neural network, including the following steps:

s201, obtaining an emotion classification data set and a target data set.

In this step, obtaining an emotion analysis data set specifically includes:

and crawling comment texts and labels from the new waves in a large scale. Specifically, crawlers are used to crawl comment texts from newwave posts, and a Scapy distributed crawler frame is used in the crawling mode. One text comment corresponds to one comment tag for representation, and is stored in a binary format, so that an emotion analysis data set is obtained.

The target data set adopts a data set which is sorted and disclosed by hotel comments of the cellular network in 2018, the positive and negative samples in the target data set are utilized, the used data format is the same as that of the emotion analysis data set, and the data set is smaller than the emotion analysis data set.

S202, constructing an emotion analysis initial model.

In step 202, as shown in fig. 2, an emotion analysis initial model is built mainly for constructing a quantum long-short time memory neural network and model fusion.

The quantum long-short time memory neural network specifically comprises the following components: in order to extract emotional features related to social text context, a bidirectional quantum long-short term memory neural network is constructed and comprises an Input layer, a QLSTM layer and a full connection layer, wherein the QLSTM layer comprises t QLSTM units, and the QLSTM units are connected in a parallel and opposite mode and used for extracting emotional features related to social text context.

The QLSTM unit adopts the cell body structure of the quantum long-time memory neural network shown in the figure 3. It consists of 6 VQC structures and classical binary gates. The QLSTM cell is defined here as (the Activate block represents the laser)And the active function can be selected from a Sigmoid active function and a hyperbolic tangent active function. x is the number of_tIs the input vector at time t of the QLSTM cell, h_tIs a hidden state at time t, v_tIs the input direction of VQC cell body at time t, c_tIs the cell state at time t):

v_t＝h_t-1*x_t (1)

f_t＝Sigmoid(VQC₁(v_t)) (2)

i_t＝Sigmoid(VQC₂(v_t)) (3)

o_t＝Sigmoid(VQC₄(v_t)) (6)

h_t＝VQC₅(o_t*tanh(c_t)) (7)

y_t＝VQC₆(o_t*tanh(c_t)) (8)

wherein, the expression (1) is the hidden state h_t-1And the input vector x of the QLSTM cell_tMultiplying to obtain an input vector v of VQC cell body time t_t。

Wherein, the formula (2) realizes the function of forgetting the door. Using a VQC₁Input v at cell body detection time t_tOutputting a value of [0, 1] through an activation function Sigmoid]Forgetting gate vector f in interval_t。f_tBy acting at c_t-1(i.e. f)_t*c_t-1) The determination of the "forgotten" or "retained" cell state c by a gating mechanism_t-1。

Wherein, the formula (3) realizes the function of the input gate. Using a VQC₂Input v at cell body detection time t_tOutput by activating function SigmoidA value of [0, 1]Input gate vector i within a span_t. The formula (4) generates a temporary transition state

Using VQC₃From input v_tAnd the activation function tanh to generate a temporary cell state

Finally, equation (5) will be input into gate i_tMultiplication by temporary cell status

Re-association with cell status c_t-1Sum vector f_tThe result of the multiplication is summed up with the cell state c at the update time t_t。

Wherein, the formula (6), the formula (7) and the formula (8) realize the output function. The formula (6) utilizes a VQC₄Input v at cell body detection time t_tOutputting a value of [0, 1] through an activation function Sigmoid]Output gate vector o within interval_t. The formula (7) generates the hidden state h_tUsing VQC₅Vector o through the output gate_tAnd the cell state c of the activation function tanh_tGenerating a hidden state h_t. The formula (8) generates the output state y_tUsing VQC₆Vector o through the output gate_tAnd the cell state c of the activation function tanh_tGenerating an output state y_t。

The VQC structure is shown in FIG. 4 and comprises a quantum coding layer U (x), a variation circuit layer V (theta) and a quantum measurement layer. (x is the classical data requiring input quantum wire processing; theta is the angle at which quantum data requires rotation)

The principle of quantum encoding is shown in fig. 5, and the quantum state of N qubits can be expressed as:

wherein

Is each ground state and each q_iE {0,1 }. Amplitude of vibration

Is squared of

The measured probability of the post-intermediate state, such that the total probability equals 1:

according to the quantum coding principle, a schematic diagram of a specific structure of the VQC designed herein is shown in FIG. 6. The quantum-encoding layer U (x) comprising H, R_yAnd R_zAnd a door. The variational circuit layer V (theta) comprises Toffoli gate and R_IThe door is composed of a door body. The quantum measurement layer measures the output of the quantum wires by the monte carlo method through the measurement gate.

R_I＝R(α_I,β_I,γ_I)＝exp(-i(α_I+β_I+γ_I)θ/2)＝cos(θ/2)-isin(θ/2)(α_I+β_I+γ_I)

x_t+1＝R_I(x_t)

Wherein x is_tIs the classical data quantum state at time t; h door is reversible single quantityA sub-gate, which can rapidly prepare uniformly superposed qubits; r_y、R_zAnd R_IThe door belongs to a rotating single quantum door, and can enable the input state to rotate for a certain angle around a corresponding shaft; toffoli gate belongs to the gate of control bit, only if the control bit is all |1>The state of the controlled bit is exchanged; i is the unit of imaginary numbers in mathematics; i is the I-th quantum wire; theta refers to the angle of rotation on a certain axis; alpha is alpha_I,β_I,γ_I3 rotation angles on the x, y, z axes, respectively.

The Monte Carlo method is a numerical simulation method which takes a probability phenomenon as a research object.

Wherein, the model fusion is specifically as follows: and combining the quantum long-time and short-time memory neural networks trained by different data sets into a strong learner by a voting method.

S203, vectorizing the comment text sequence of the emotion classification data set, and performing word vector model training on the vectorized comment text sequence to obtain a vectorized comment text sequence.

In the step S203, a participler is adopted to perform participle on the comment text, the comment text sequence of the emotion data set is mapped into a word vector space, and the length of vectorization of the comment text is calculated.

In this step, S203 pre-trains the comment text vector using the pre-training model, and inputs the segmented comment text sequence and the length thereof into the pre-training model, so that vectorized text sequence information is output.

And S204, inputting the pre-trained text sequence and the label into an emotion analysis initial model for training to obtain an emotion analysis pre-training model.

In step S204, the pre-trained text sequence and the label are input into the emotion analysis initial model for training, as shown in fig. 7, specifically as follows:

and (3) carrying out training set and verification set processing on the text sequence and the comment labels trained by the word vector model, and carrying out comparison between 4: the 1 proportion is randomly divided into a training set and a testing set, so that overfitting caused by too small data volume in the model training process is avoided. As shown in fig. 8, a training set represented by m n × d dimensional matrices is Input into a QLSTM layer from an Input layer, the feature extraction of the QLSTM layer is divided into two processes, the first process is forward propagation [ QLSTM (f)) ] feature extraction between QLSTM units, the second process is backward propagation [ QLSTM (b)) ] feature extraction between QLSTM units, and finally, the output of the last QLSTM unit in two directions is connected as the final output of the QLSTM layer, and the calculation formulas of forward propagation and backward propagation QLSTM are as follows:

wherein c is_tIndicating the cell state, h, of QLSTM at time t_tIndicating the hidden state at time t, x_tFor input at time t, c_t-1、h_t-1And c_t+1、h_t+1Representing the state of the unit and the hidden state at the last moment, the parameters of the QLSTM in the two directions are shared, and the output of the QLSTM layer is finally represented as the output connection result of the last QLSTM unit in the two directions, wherein n is the length of the social text sequence.

And inputting the output of the QLSTM layer into a full-connection layer for feature dimension reduction, and mapping the features into a [0, 1] interval through a Sigmoid activation function, namely the probability of social text emotion polarity. If the probability P is more than or equal to 0.5, the emotion expressed by the social text is positive, otherwise, the expressed emotion is negative.

Adjusting model parameters of the quantum long-time and short-time memory neural network, specifically: and (4) training the quantum long-time and short-time memory neural network in the step (S202) by combining with training data. Model training adopts a cross entropy function to calculate model prediction and label loss; the model training adopts RMSprop algorithm to update parameters of each layer of the network, and the updating formula is as follows:

wherein l [ i ]]Is the label of sample i, o [ i]Probability predicted for sample i, g_tGradient of step t, E [ g ]²]_tIs the sum of squares of the gradients

Weighted moving average of (3). For QLSTM, the hyper-parameters are set as follows: learning rate η is 0.01, smoothing constant α is 0.99, and e is 10^-8。

And (4) adjusting quantum gate parameters of the quantum long-time and short-time memory neural network through calculation, and carrying out next neural network training.

Carrying out iterative training on the quantum long-time memory neural network, specifically comprising the following steps: training the model to converge when 10 iterations are performed, and taking the model at the moment as a feature extraction model; and extracting social text features by using the trained quantum long-short term memory neural network model, thereby obtaining the neural network training process of the graph 9. Since n quantum long-time and short-time memory neural network models with better performance need to be trained in step S202, n times of neural network training need to be traversed. These models are fused by voting to form a strong learner.

S205, performing fine tuning training on the target data set by using the emotion analysis pre-training model to obtain an emotion analysis model.

Specifically, parameters learned by the emotion analysis pre-training model are migrated and verified on a target data set, so that fusion is performed depending on a plurality of quantum long-time and short-time memory neural networks. If the effect is not good, the quantum long-short time memory neural network model parameters of the S204 need to be retrained and optimized, so that n quantum long-short time memory neural networks are obtained again for model fusion. Then, the target data set is used for testing.

And S206, performing word segmentation and word vector model training on the analysis text, and finally inputting an emotion analysis model for emotion analysis.

The public opinion emotional tendency can be predicted efficiently within a certain speed, so that social media comments are guided to develop in the correct direction.

The above-described calculation examples of the present invention are merely to explain the calculation model and the calculation flow of the present invention in detail, and are not intended to limit the embodiments of the present invention. It will be apparent to those skilled in the art that other variations and modifications of the present invention can be made based on the above description, and it is not intended to be exhaustive or to limit the invention to the precise form disclosed, and all such modifications and variations are possible and contemplated as falling within the scope of the invention.

Claims

1. A social text sentiment analysis system based on a deep quantum neural network is characterized by comprising a data acquisition module, a data processing module, a text sequence information acquisition model and a sentiment analysis model, wherein:

2. The social text sentiment analysis system based on the deep quantum neural network is characterized in that the data acquired by the data acquisition module further comprises a target data set, the data format of the target data set is the same as that of the sentiment classification data set, namely the target data set comprises comment texts and labels corresponding to the comment texts;

3. The social text sentiment analysis system based on the deep quantum neural network is characterized in that the sentiment classification data sets are obtained by crawling on the new waves in a Scapy distributed crawler frame;

4. The deep quantum neural network-based social text sentiment analysis system of claim 3, wherein the QLSTM units are defined as:

v_t＝h_t-1*x_t

f_t＝Activate(VQC₁(v_t))

i_t＝Activate(VQC₂(v_t))

o_t＝Activate(VQC₄(v_t))

h_t＝VQC₅(o_t*Activate(c_t))

y_t＝VQC₆(o_t*Activate(c_t))

temporary cell state at time t, c_t-1The cell state at time t-1, c_tThe cell state at time t, o_tIs the output gate vector at time t, h_tHidden state at time t, y_tFor the output state at time t, VQC₁、VQC₂、VQC₃、VQC₄、VQC₅、VQC₆Are respectively provided withRepresents the 1 st, 2 nd, 3 rd, 4 th, 5 th and 6 th VQC cell bodies.

5. The social text sentiment analysis system based on the deep quantum neural network of claim 4, wherein the activation function Activate is a Sigmoid activation function or a hyperbolic tangent activation function.

6. The social text sentiment analysis system based on the deep quantum neural network as claimed in claim 5, wherein the working process of the quantum long-time and short-time memory neural network is as follows:

7. The social text sentiment analysis system based on the deep quantum neural network as claimed in claim 6, wherein when the quantum long-time memory neural network is pre-trained, the adopted loss function is a cross entropy loss function, and the adopted network parameter updating method is an RMSprop algorithm.

8. The social text sentiment analysis system based on the deep quantum neural network of claim 7, wherein the method adopted by the network fusion module is a bagging algorithm.

9. The deep quantum neural network-based social text sentiment analysis system according to claim 8, wherein the structure of the VQC cell bodies comprises a quantum coding layer, a variation circuit layer and a quantum measurement layer;

10. The deep quantum neural network-based social text sentiment analysis system of claim 9, wherein the text sequence information acquisition model is a bert pre-training model.