CN110222185A

CN110222185A - A kind of emotion information representation method of associated entity

Info

Publication number: CN110222185A
Application number: CN201910511692.9A
Authority: CN
Inventors: 徐睿峰; 梁斌; 杜嘉晨; 黄锦辉; 何瑜岚
Original assignee: Shenzhen Graduate School Harbin Institute of Technology
Current assignee: Shenzhen Graduate School Harbin Institute of Technology
Priority date: 2019-06-13
Filing date: 2019-06-13
Publication date: 2019-09-10

Abstract

The invention patent relates to a kind of emotion information classification methods of associated entity.The method comprising the steps of one), indicates using the large-scale term vector of wikipedia corpus training as the general term vector of word in text；Step 2), term vector is finely adjusted for entity and entity attribute different in text in conjunction with the Q learning method in intensified learning, word is made to have different vectors to indicate when modifying different entities or entity attribute；Step 3), the word emotion information vector expression that study obtains is applied in entity level text emotion analysis task.Using this method the feeling polarities of different entities or entity attribute can be effectively differentiated in the case where not using attention mechanism.

Description

A kind of emotion information representation method of associated entity

Technical field

The invention belongs to emotion information presentation technology field, in particular to the emotion information expression side of a kind of associated entity Method.

Background technique

Text emotion analysis is the differentiation by the completion such as being analyzed text, concluding, handle to text feeling polarities. In text emotion analysis task, the word information of text can directly affect text especially with the word of emotional color Feeling polarities.In the text emotion analysis task with entity, need to carry out feeling polarities for entity different in text to sentence Disconnected, this will not only consider text itself, while also consider entity information different in text.In the text data of reality, Often there are multiple entities in one text, and different entities have different emotional expressions.On the other hand, for different realities Body, even if using the same qualifier, it is possible that antipodal feeling polarities.Such as " noise of automobile is very big ", " vapour The space of vehicle is very big ", it is equally to describe that the word " big " of entity " automobile " attribute is Negative Affect when describing " car noise ", It and is positive emotion when describing " motor space ".

There are many traditional word information expressing method, such as: One-hot representation method and by word be expressed as successive value to Term vector representation method (continuous bag of words, Continues Bag Of Words and jump multi-component grammar, the Skip n- of amount Gram) etc..Such methods for model learning and adjustment, can learn to arrive word by the way that word to be expressed as to the vector of a multidimensional Characteristic information in the text.But the above method usually only considers other words in word itself and word and text Dependence.So word all only has identical vector table for the attribute of different scenes, different entities and different entities Show.For the sentiment analysis task with entity, currently used method is to splice the expression of special entity and different terms, structure Making new word indicates, be either added outside knowledge base or interdependent syntactic analysis etc. obtain between different terms and entity Connection.Although these methods can solve the word information in multiple entity text emotion analysis task to a certain extent, expression is asked Topic, but there are still some disadvantages:

1. identical vector information can be added to different terms in the method that binding entity vector indicates, cannot effectively distinguish not With word to the percentage contribution of entity or entity attribute；

2. the method for combining external knowledge needs the quality of height dependence external knowledge, when the information of introducing is inappropriate, It can bring challenges instead to the study of model；

3. such methods all do not have different terms construction vector to be indicated for special entity, entity attribute, make word There is different expressions when modifying different entities, and the significance level of word is distinguished.

Summary of the invention

For the shortcoming for overcoming prior art, the present invention proposes the method that a kind of emotion information of associated entity indicates, Targetedly vector can be carried out to word in the case where not using external knowledge to finely tune, make word when being associated with different entities There is different vectors to indicate, effectively differentiates the feeling polarities of different entities or entity attribute.

To achieve the goals above, the technical solution adopted by the present invention are as follows:

A kind of emotion information representation method of associated entity, which is characterized in that this method includes the following steps:

Step 1), using the large-scale term vector of wikipedia corpus training as the general term vector of word in text It indicates；

Step 2), in conjunction with intensified learning Q learning method for entity and entity attribute different in text to the word of word Vector is finely adjusted, and word is made to have different vectors to indicate when modifying different entities or entity attribute；

Step 3), the word emotion information vector expression that study obtains is applied to specific text emotion analysis task In.

Next word is chosen with ε-greedy, and different entities are assigned with different reward values.

Compared to existing technology, advantages of the present invention has:

1, Q study can not make in combination intensified learning proposed by the invention come the method being finely adjusted to term vector Targetedly vector fine tuning is carried out to word in the case where external knowledge, make word have when being associated with different entities it is different to Amount indicates

2, it can be obtained in text apart from entity or the farther away word of entity attribute using ε-greedy method to entity or reality The emotional connection of body attribute.

3, input text is indicated using the term vector after fine tuning proposed by the present invention, it can be without using attention mechanism In the case of, effectively differentiate the feeling polarities of different entities or entity attribute.

Detailed description of the invention

Fig. 1 is general term vector training；

Fig. 2 is the disaggregated model using fine tuning term vector.

Specific embodiment

The present invention is further described for explanation and specific embodiment with reference to the accompanying drawing.

The present invention is a kind of emotion information representation method of associated entity.

The key step of this method has:

Step 1: using the large-scale term vector of wikipedia corpus training as the general term vector table of word in text Show；

Step 2: in conjunction with intensified learning Q learning method for entity and entity attribute different in text to the word of word to Amount is finely adjusted, and word is made to have different vectors to indicate when modifying different entities or entity attribute；

Step 3: the word emotion information vector expression that study obtains is applied in specific text emotion analysis task.

This method schematic diagram is shown in attached drawing 1,2.

It is specific as follows (attached using the general term vector of large-scale wikipedia corpus training in above method step 1 Shown in Fig. 1):

1. crawling enough corpus from wikipedia, and corpus is pre-processed, filters out and task is not acted on Text；

2. use depth language model network (ASGD Weight-Dropped Long-Short Term Memory, AWD-LSTM term vector training) is carried out on wikipedia corpus, obtains the term vector set of entry.

In above method step 2, using the Q study in intensified learning with AWD-LSTM network in particular task corpus In term vector is micro-adjusted:

v_s,w=v_s,w+α(r_i+γmax_w′v_s′,w′-v_s,w)

Wherein, v_s,wIt is indicated for the vector of current term, v_s′,w′For the vector table for reaching next word from current term Show, r_iFor the mobile reward value of this word provided for entity or entity attribute i, α is learning rate, and γ is award decay series Number.In the present invention, centered on a certain entity or entity attribute, move word along the entity or entity attribute, often A mobile word, assigns an award 0, and a specific award r is assigned when word is moved to entity or entity attribute i_i。 Method by the way that different entities and entity attribute are arranged with different awards can carry out specific aim to different terms in learning process Adjustment.Meanwhile the method by moving word gradually, different terms can be also distinguished to the emotion shadow of entity or entity attribute The degree of sound.

In addition, certain pairs of entities have the word of highlights correlations to be likely to appear in from entity farther out in the text of reality Place, will be unable to learn well these words using above-mentioned method for trimming at this time and the emotion of entity or entity attribute joined System.In order to solve this problem, the present invention has used ε-greedy to choose word next time, i.e., with ε's in the above-mentioned methods Probability randomly selects word in the text.Those can be effectively obtained farther out but the word that has an important influence from entity by this method Emotional connection of the language to entity or entity attribute.

In fine tuning term vector method, carry out objective function using mean square error:

L (v)=E (r_i+γmax_w′v_s′,w′-v_s,w)²

In above method step 3, using traditional shot and long term memory network (Long-Short Term Memory, LSTM) text emotion with entity is carried out to specific corpus to analyze.The specific method is as follows (shown in attached drawing 2):

Step 31): input text is indicated using the term vector after fine tuning, and text is chronologically transported to LSTM network In.

Step 32): learnt and adjusted the abstract of the available text of ginseng to the term vector matrix in 1 by LSTM network Change character representation:

H=[h₁,h₂,...,h_n]

Step 33) passes through using the abstract feature of the last layer network obtained in 2 as the input of full articulamentum The sentiment analysis result of the available associated entity of softmax function.

Y=softmax (Wh_n+b)。

To sum up, this method can carry out targetedly vector fine tuning to word in the case where not using external knowledge, make word Language has different vectors to indicate when being associated with different entities, can be obtained in text using ε-greedy method apart from entity or entity Emotional connection of the farther away word of attribute to entity or entity attribute, term vector after fine tuning indicate input text, can be not In the case where using attention mechanism, the feeling polarities of different entities or entity attribute are effectively differentiated.

The above content is a further detailed description of the present invention in conjunction with specific preferred embodiments, and it cannot be said that Specific implementation of the invention is only limited to these instructions.For those of ordinary skill in the art to which the present invention belongs, exist Under the premise of not departing from present inventive concept, a number of simple deductions or replacements can also be made, all shall be regarded as belonging to of the invention Protection scope.

Claims

1. a kind of emotion information representation method of associated entity, which is characterized in that

Step 1), using the large-scale term vector of extensive corpus of text training as the general term vector table of word in text Show；

Step 2), in conjunction with intensified learning Q learning method for entity and entity attribute different in text to the term vector of word It is finely adjusted, word is made to have different vectors to indicate when modifying different entities or entity attribute；

Step 3), the word emotion information vector expression that study obtains is applied in specific text emotion analysis task.

2. the method according to claim 1, wherein above-mentioned steps one) in, the specific step of the general term vector of training It is rapid as follows: crawl a large amount of corpus of text from internet, and corpus pre-processed, unrelated symbol in removal text and Stop words.Later using depth language model network (ASGD Weight-Dropped Long-Short Term Memory, AWD-LSTM term vector training) is carried out on large-scale corpus, obtains the term vector set of word.

3. method according to claim 1 or 2, which is characterized in that in above-mentioned steps two) in, utilize the Q in intensified learning Study and AWD-LSTM network are micro-adjusted term vector in particular task corpus:

v_s,w=v_s,w+α(r_i+γmax_w′v_s′,w′-v_s,w)

Wherein, v_s,wIt is indicated for the vector of current term, v_s′,w′It is indicated to reach the vector of next word from current term, r_i For the mobile reward value of this word provided for entity or entity attribute i, α is learning rate, and γ is award decay coefficient.

4. method according to claim 1 or 2, which is characterized in that above-mentioned steps three) in, using shot and long term memory network (Long-Short Term Memory, LSTM) carries out the text emotion with entity to specific corpus and analyzes.

5. according to the method described in claim 4, it is characterized by: above-mentioned carry out band entity to specific corpus using LSTM network Text emotion analysis specific steps are as follows:

Step 31): input text is indicated using the term vector after fine tuning, and text is chronologically transported in LSTM network；

Step 32): the abstract that the term vector matrix in step 31) is learnt and ginseng is adjusted to obtain text by LSTM network Character representation:

H=[h₁,h₂,...,h_n]

Step 33): using the abstract feature of the last layer network obtained in step 32) as the input of full articulamentum, pass through Softmax function obtains the sentiment analysis result of associated entity:

Y=softmax (Wh_n+b)

Wherein, W is weight matrix, and b is biasing.

6. according to the method described in claim 3, it is characterized in that, choose next word with ε-greedy, and to difference Entity assigns different reward values.