CN102929861B - Method and system for calculating text emotion index - Google Patents

Method and system for calculating text emotion index Download PDF

Info

Publication number
CN102929861B
CN102929861B CN201210404224.XA CN201210404224A CN102929861B CN 102929861 B CN102929861 B CN 102929861B CN 201210404224 A CN201210404224 A CN 201210404224A CN 102929861 B CN102929861 B CN 102929861B
Authority
CN
China
Prior art keywords
emotion
word
mark
field
negative
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201210404224.XA
Other languages
Chinese (zh)
Other versions
CN102929861A (en
Inventor
廖建新
范文
张雷
赵贝尔
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Xinxun Digital Technology (Hangzhou) Co.,Ltd.
Original Assignee
Hangzhou Dongxin Beiyou Information Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hangzhou Dongxin Beiyou Information Technology Co Ltd filed Critical Hangzhou Dongxin Beiyou Information Technology Co Ltd
Priority to CN201210404224.XA priority Critical patent/CN102929861B/en
Publication of CN102929861A publication Critical patent/CN102929861A/en
Application granted granted Critical
Publication of CN102929861B publication Critical patent/CN102929861B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Machine Translation (AREA)

Abstract

The invention discloses a method and a system for calculating a text emotion index. The method comprises the following steps of: (A), establishing a non-field limitation emotion word library, selecting a plurality of non-field limitation emotion words, and saving the non-field limitation emotion words and corresponding emotion scores in the non-field limitation emotion word library; and (B), dividing a text to be calculated into a plurality of clauses according to punctuation marks, performing word segmentation on each clause, sequentially searching non-field limitation emotion words contained in each clause, adjusting an emotion score of each non-field limitation emotion word contained in the clauses by using an emotion index calculation rule, and finally, counting the emotion score of each clause according to the adjusted emotion scores of the non-field limitation emotion words, so that an emotion index value of the text is calculated. The method belongs to the fields of natural language processing technology and pattern recognition, and the emotion analysis accuracy and user satisfaction of the text can be effectively improved.

Description

A kind of text emotion index calculation method and system
Technical field
The present invention relates to a kind of text emotion index calculation method and system, belong to natural language processing technique and area of pattern recognition.
Background technology
Along with the progressively development of Internet technology, increasing netizen is by blog, and micro-blog, forum, the channels such as news online comment express them to commercial product, accident, the suggestion of government work etc.The commercial product company of some specific areas, as digital product industry, grocery trade, hotel industry etc., need to understand in time client to the evaluation of their product, understand client to the satisfaction of its product, and then make suitable product adjustment and adapt to competition among enterprises fierce at present; In addition, departments of government also needs to understand netizen in time to the attitude of government work or the attitude of focusing media event, grasps the mood of the people in time, makes correct choice, prevent the generation of malignant event.Because network information is large, increase also very fast, therefore we by computing machine, will carry out the sentiment analysis work of robotization simultaneously.
From current research conditions both domestic and external, the object of sentiment analysis assessment text is grouped into front evaluate (Positive) or negative (Negative) evaluation.Such as given product evaluation, system needs to judge expressed by this evaluation to be front suggestion or the negative comment (also introducing neutral evaluation in some cases) of the person of reading and appraising.Sentiment analysis is mainly used in judging the popular prevailing paradigm to an object fast.This task is similar based on the text classification (text classification) of theme to tradition, therefore identical technology is mostly used to solve this problem from technically seeing of current research use, as supervised learning algorithm (supervisedlearning), semi-supervised learning algorithm (semi ?supervised learning) and unsupervised learning algorithm (unsupervised learning).But both distinguish again to some extent, based on the text classification of theme document assigned in the different themes classification pre-defined, such as politics, science, physical culture etc.Based in the classification of theme, theme relative words are important.And in suggestion classification, theme relative words are unessential.On the contrary, the suggestion vocabulary characterizing front or negative view is important, such as, and " good ", " outstanding ", " sad ", " poor " etc.Meanwhile, these fields residing for viewpoint word are also very important, and different suggestion tendencies can be expressed in same word in different fields.Therefore, according to the domain feature of sentiment analysis, we can be divided into single field sentiment analysis technology and cross-cutting sentiment analysis technology.
1, single field sentiment analysis technology
Single field sentiment analysis, namely by the Emotional Corpus in certain field marked, is trained a disaggregated model on this basis, has then been continued the sentiment analysis task in this field follow-up by this model.In this classification, the application of supervised learning algorithm is occupied an leading position, as K ?nearest neighbor algorithm (kNN), NB Algorithm (NaiveBayes), and algorithm of support vector machine (SVM).
Due to the triviality of corpus annotation, semi-supervised learning algorithm, as expectation-maximization algorithm (EM) and unsupervised learning algorithm, score function method is also all widely used in sentiment analysis research.But experiment proves that supervised learning method is better than semi-supervised and unsupervised algorithm in the classification of single field suggestion.
2, cross-cutting sentiment analysis technology
In sentiment analysis application, field migration (domain ?transfer) problem is more common.As given an emotion word " height ".If " room rate is high ", so this word is exactly negative; If " earning a large income ", so this word is exactly front.If we adopt supervised learning algorithm conventional in single field in this case, the decline of sorter accuracy rate will be caused.In current research, solve sentiment analysis field migration problem and mainly contain three kinds of schemes.The first uses unsupervised learning (unsupervised learning) method exactly, namely semantic point (semantic orientation) method in English sentiment analysis in application, first the method carries out part-of-speech tagging to each participle, then the part of speech sequence of specifying is installed filter, the sequence satisfied condition is remained, then uses point to mutual information (PointWise Mutual Information) algorithm and AltaVista search engine completes and carries out feeling polarities marking to each phrase remained.The marking of last these phrases comprehensive, completes the feeling polarities marking to one or one section words.First scheme is the public characteristic (generalizable features) finding training set field (also referred to as source domain) and test set field (also referred to as target domain), then completes migration task by semi-supervised learning (semi ?supervised learning) algorithm.Last a solution uses corpus migration algorithm, and this algorithm finds the similar portion of target domain corpus and source domain corpus by calculating, then re-training sorter obtains disaggregated model.This scheme completes based on supervised learning algorithm.Although two kinds of solutions below avoid the drawback of the first scheme, all need re-training sorter, therefore the operational efficiency of whole algorithm is not high yet.
At present, robotization sentiment analysis for text it is also proposed some solutions: such as patented claim CN201210154332.6 (application title: a kind of text sentiment classification method and system, application time: 2012 ?05 ?17, applicant: University Of Suzhou) disclose a kind of text sentiment classification method, comprise: contrast preset emotion vocabulary, in text to be sorted, find out emotion word, and obtain the feeling polarities corresponding with emotion word according to emotion vocabulary; Use two polarity transformation rule judgment emotion word whether polarity transformation to occur, according to the feeling polarities of emotion word and the polarity transformation result of emotion word, calculate each word in text to be sorted and appear at the probability in the text of each polarity; Appear at the probability in the text of each polarity according to each word in text to be sorted, utilize Bayesian classifier model to treat classifying text and classify.These technical schemes are mainly used in judging text feeling polarities, as front, neutrality or negative.Because emotional semantic classification is different from text classification, to same section of comment, different people may think that it have expressed different Sentiment orientation, such as text: " this mobile phone working procedure travelling speed is very fast, and screen is also fine, is short of to some extent exactly in stand-by time." somebody may think neutral, somebody may think negative, and even some people may think front.Too thought in absolute terms the feeling polarities that text provides by technique scheme, be difficult to meet proprietary Sentiment orientation, analysis result accuracy rate is not high.
Therefore, how effectively to improve the accuracy rate that text emotion is analyzed? be still the technical barrier that a urgent need will solve.
Summary of the invention
In view of this, the object of this invention is to provide a kind of text emotion index calculation method and system, effectively can improve the accuracy rate that text emotion is analyzed.
In order to achieve the above object, the invention provides a kind of text emotion index calculation method, described method includes:
Steps A, build the emotion dictionary that non-field limits, select the emotion word that multiple non-field limits, and the emotion mark of the emotion word limited in described non-field and correspondence thereof is kept in the emotion dictionary that non-field limits;
Step B, according to punctuation mark, text to be calculated is divided into multiple clause, and participle is carried out to each clause, then the emotion word of the non-field restriction comprised in each clause is found successively, the emotion mark of affection index computation rule to the emotion word that each non-field comprised in clause limits is used to adjust, finally according to the emotion mark of the emotion word of the non-field restriction after adjustment, add up the emotion mark of each clause, thus calculate the affection index value of text to be calculated
In described step B, find the emotion word of the non-field restriction comprised in each clause successively, use the emotion mark of affection index computation rule to the emotion word that each non-field comprised in clause limits to adjust, include further:
Step B1, judge the emotion word that whether has non-field to limit in the participle of each clause, if so, then from the emotion dictionary that non-field limits, extract the emotion mark F that emotion word that described non-field limits is corresponding, continue next step; If not, then this flow process terminates;
Whether step B2, the participle judging to be positioned in described clause before emotion word that non-field limits have adversative, and if so, then the emotion mark F of the emotion word limited in described non-field is adjusted to 2*F, continues next step; If not, then next step is continued; Described adversative includes but are not limited to: but but but, wilfully, just but, as, so that, unexpectedly, surprisingly;
Step B3, the participle judging before emotion word that in described clause, non-field limits whether have negative word and the word distance of emotion word that negative word and non-field limit is less than or equal to 2, if, the emotion mark F of the emotion word then limited in described non-field is adjusted to-F, continues next step; If not, then next step is continued; Described negative word includes but are not limited to: not, do not have;
Step B4, judge whether to have adverbial word in described clause and the word distance of emotion word that adverbial word and non-field limit is less than or equal to 2, if so, then the emotion mark F of the emotion word limited in described non-field is adjusted to 2*F, continues next step; If not, then next step is continued; Described adverbial word includes but are not limited to: very, especially;
Step B5, judge whether emotion word that non-field limits is in the end of the sentence of clause, if so, then the emotion mark F of the emotion word limited in described non-field is adjusted to 2*F.
In order to achieve the above object, present invention also offers a kind of text emotion index computing system, include:
Non-field emotion dictionary construction device, for building the emotion dictionary that non-field limits, selects the emotion word that multiple non-field limits, and the emotion mark of the emotion word limited in described non-field and correspondence thereof is kept in the emotion dictionary of non-field restriction;
Text emotion exponential calculation device, for the text to be calculated of input being divided into multiple clause according to punctuation mark, and participle is carried out to each clause, then the emotion word of the non-field restriction comprised in each clause is found successively, the emotion mark of affection index computation rule to the emotion word that each non-field comprised in clause limits is used to adjust, finally according to the emotion mark of the emotion word of the non-field restriction after adjustment, add up the emotion mark of each clause, thus calculate the affection index value of text to be calculated
Text emotion exponential calculation device includes further:
Text input unit, for inputting text to be calculated, and sends to text emotion computing unit by described text to be calculated;
Text emotion computing unit, for text to be calculated is divided into multiple clause, and participle is carried out to each clause, the emotion word that the non-field of preserving in the emotion the dictionary then each participle in clause and non-field limited one by one limits contrasts, if consistent, then described clause and the emotion word of the non-field restriction of its correspondence are sent to emotion word score calculating unit; Receive the emotion mark of the emotion word of the non-field restriction that emotion word score calculating unit returns, and according to the emotion word that the non-field that each clause comprises limits, add up the emotion mark of each clause, finally according to the emotion mark of all clauses, add up the affection index value of text to be calculated;
Emotion word score calculating unit, the emotion word that the clause sent for receiving text emotion computing unit limits with the non-field of its correspondence, the emotion mark that the emotion word of described non-field restriction is corresponding is extracted from the emotion dictionary that non-field limits, then the emotion mark of affection index computation rule to the emotion word that the non-field comprised in clause limits is used to adjust, the emotion mark of the emotion word finally limited in the non-field after adjustment sends to text emotion computing unit, described affection index computation rule can based on Chinese parsing, the emotion word limited according to field non-in clause and adversative, negative word, the position of adverbial word in clause, the emotion mark of the emotion word that field non-in clause limits is adjusted.
Compared with prior art, the invention has the beneficial effects as follows: the present invention does not need re-training sorter in actual applications, and execution efficiency is higher, take into full account the field animal migration of emotion word, and consider the feature of Chinese expression, emotional semantic classification is different from text classification, to same section of comment, different people may think that it have expressed different Sentiment orientation, therefore the present invention represents the result that text emotion is analyzed instead of concrete feeling polarities by text emotion index, and a given affection index interval: [text emotion Index Min, text emotion index maximal value], affection index value is more close to text emotion index maximal value, then illustrate that the positive emotion tendency that the text is expressed is larger, more close to text emotion minimum value, then illustrate that its negative emotion tendency is larger, thus avoid and too think in absolute terms, effectively improve accuracy rate and the user satisfaction of text emotion analysis.
Accompanying drawing explanation
Fig. 1 is the emotion word schematic diagram that non-field limits.
Fig. 2 is a kind of text emotion index calculation method of the present invention process flow diagram.
Fig. 3 is the concrete operations process flow diagram of Fig. 2 steps A.
Fig. 4 is in Fig. 2 step B, finds the emotion word of the non-field restriction comprised in each clause successively, uses the concrete operations process flow diagram that the emotion mark of affection index computation rule to the emotion word that each non-field comprised in clause limits adjusts.
Fig. 5 is the composition structural representation of non-field emotion dictionary construction device.
Fig. 6 is the composition structural representation of text emotion exponential calculation device.
Embodiment
For making the object, technical solutions and advantages of the present invention clearly, below in conjunction with drawings and Examples, the present invention is described in further detail.
In sentiment analysis research, emotion word plays important role, is the emotion word that non-field limits especially.Such as " good " and " bad " is just the emotion word that non-field limits respectively, because in the field of the overwhelming majority, " good " is all the word of expressing positive emotion, and " bad " is the word of expressing negative emotion.But the word as " height " is exactly field to be limited, such as: " his income is very high ", what now " height " this vocabulary reached is exactly positive emotion, but: " present room rate is very high ", what now " height " expressed is exactly negative emotion.Therefore, the present invention first obtains a large amount of emotion word from various approach such as networks, the Emotional Corpus of multiple FIELD Data is included subsequently by one, calculate the emotion mark of these emotion word, result represents no matter at positive emotion or in negative emotion, and the higher or lower word of score is the word that limits of non-field; Then, by emotion word and the emotion mark of wherein non-field restriction, and based on Chinese parsing, the affection index value of text to be measured is calculated, described affection index can be used for the emotion intensity described expressed by text to be measured, interval index can select-150 to 150, and the mark more trending towards the two poles of the earth illustrates that the negative of text to be measured or positive emotion tend to stronger.
As shown in Figure 1, three circles represent the emotion word having distinguished positive and negative emotion in three fields respectively, and intermediate interdigitated the part of blacking represents three field compathy words, therefore can think that black part is the emotion word that the non-field in these three fields limits.
As shown in Figure 2, a kind of text emotion index calculation method of the present invention includes:
Steps A, build the emotion dictionary that non-field limits, select the emotion word that multiple non-field limits, and the emotion mark of the emotion word limited in described non-field and correspondence thereof is kept in the emotion dictionary that non-field limits;
Step B, according to punctuation mark, text to be calculated is divided into multiple clause, and participle is carried out to each clause, then the emotion word of the non-field restriction comprised in each clause is found successively, the emotion mark of affection index computation rule to the emotion word that each non-field comprised in clause limits is used to adjust, finally according to the emotion mark of the emotion word of the non-field restriction after adjustment, add up the emotion mark of each clause, thus calculate the affection index value of text to be calculated.
The position of emotion word in clause that described affection index computation rule can limit according to other participles in clause (as adversative, negative word, adverbial word) and non-field, adjusts the emotion mark of the emotion word that non-field limits.
As shown in Figure 3, Fig. 2 steps A includes further:
Steps A 1, obtain multiple emotion word, and calculate in corpus according to emotion mark the probability that in the probability and front/or negative document that probability that the front/or negative number of documents, emotion word and front/that include emotion word or negative document occur jointly, front/or negative document occur, emotion word occurs, calculate the co-occurrence mark of each emotion word and positive emotion/or negative emotion respectively.
Described emotion mark calculates corpus can select an Emotional Corpus including multiple FIELD Data, and such as, shown in table 1, the emotion mark selected calculates the suggestion grouped data that corpus includes notebook computer, hotel, these three fields of books:
Table 1 emotion mark calculates corpus
In described steps A 1, the computing formula of the co-occurrence mark of emotion word w and positive emotion is: wherein, c (e p, be w) calculate at emotion mark the front number of documents including emotion word w in corpus; P (e p, w|e p) be the probability that emotion mark calculates that in corpus, emotion word w and front document occur jointly, its value is: n pthe quantity that emotion mark calculates front document in corpus; P (e p) be the probability that emotion mark calculates that in corpus, front document occurs, its value is: c (e p) be the quantity that emotion mark calculates front document in corpus, N dthe quantity that emotion mark calculates all documents in corpus; P (w|e p) be the probability that emotion mark calculates that in the front document of corpus, emotion word w occurs, its value is: c (w|e p) be the number of times that emotion mark calculates that in the front document of corpus, emotion word w occurs, words pit is total word frequency number (the word sum namely in the document of front) that emotion mark calculates in the front document of corpus;
The computing formula of the co-occurrence mark of emotion word w and negative emotion is: wherein, c (e n, be w) calculate at emotion mark the negative number of documents including emotion word w in corpus; P (e n, w|e n) be the probability that emotion mark calculates that in corpus, emotion word w and negative document occur jointly, its value is: n nthe quantity that emotion mark calculates negative document in corpus; P (e n) be the probability that emotion mark calculates that in corpus, negative document occurs, its value is: c (e n) be the quantity that emotion mark calculates negative document in corpus; P (w|e n) be the probability that emotion mark calculates that in the negative document of corpus, emotion word w occurs, its value is: c (w|e n) be the number of times that emotion mark calculates that in the negative document of corpus, emotion word w occurs, words nit is total word frequency number that emotion mark calculates in the negative document of corpus.
Steps A 2, use maximum-minimum method for normalizing, respectively the co-occurrence mark of each emotion word and positive emotion/or negative emotion be normalized, wherein, to the computing formula that the co-occurrence mark of emotion word w and positive emotion is normalized be: cP (e p, w) minthe minimum value of the co-occurrence mark of all emotion word and positive emotion, CP (e p, w) maxit is the maximal value of the co-occurrence mark of all emotion word and positive emotion; To the computing formula that the co-occurrence mark of emotion word w and negative emotion is normalized be: cN (e n, w) minthe minimum value of the co-occurrence mark of all emotion word and negative emotion, CN (e n, w) maxit is the maximal value of the co-occurrence mark of all emotion word and negative emotion;
Steps A 3, difference according to the co-occurrence mark of emotion word and positive emotion, negative emotion, the front polarity calculating each emotion word is poor, and wherein, the computing formula of the front polarity difference of emotion word w is: DValue (w, p, n)=(CP (e p, w)-CN (e nw)) β, DValue (w, p, n) be that the front polarity of emotion word w is poor, β is Dynamic gene parameter, and object makes the front polarity difference of cross-cutting emotion word be greater than 1.0, to facilitate the adjustment follow-up NB Algorithm being carried out to conditional probability, the value of such as β can be set to 10000.
Steps A 4, (described T is a real number being greater than 0 to arrange polarity difference limen value T, such as: T=1.0), and do you judge that the front polarity difference of each emotion word is greater than T or is less than-T? if, then using the front polarity of described emotion word difference as its emotion mark, and described emotion word and emotion mark thereof to be kept in the emotion dictionary that non-field limits.Wherein, if the front polarity difference of described emotion word is greater than T, then illustrate that described emotion word is the positive emotion word that non-field limits, if its front polarity difference is less than-T, then illustrate that described emotion word is the negative emotion word that non-field limits.
If the value of DValue (w, p, n) is greater than T or is less than-T, then illustrate that described emotion word is the emotion word that non-field limits, be saved in the emotion dictionary of non-field restriction; If DValue is (w, p, n) value is at [-T, T] in scope, then illustrate that the difference on the frequency that described emotion word occurs in the front and negative document of emotion mark calculating corpus is few, that is described emotion word may express positive emotion in the field had, and negative emotion then can be expressed in some fields, and therefore it is not the emotion word that non-field limits.
As shown in Figure 4, in Fig. 2 step B, find the emotion word of the non-field restriction comprised in each clause successively, use the emotion mark of affection index computation rule to the emotion word that each non-field comprised in clause limits to adjust, include further:
Do you step B1, judge the emotion word that has non-field to limit in the participle of each clause? if so, then from the emotion dictionary that non-field limits, extract the emotion mark F that the emotion word of described non-field restriction is corresponding, continue next step; If not, then this flow process terminates;
Described step B1 can further include: extract each participle in clause successively, and and the emotion word that limits of the non-field of preserving in the emotion dictionary that limits of non-field contrast, if unanimously, then illustrate that described participle is the emotion word that non-field limits.
Do step B2, the participle judging to be positioned in described clause before emotion word that non-field limits have adversative? if so, then the emotion mark F of the emotion word limited in described non-field is adjusted to 2*F, continues next step; If not, then next step is continued;
Described adversative includes but are not limited to: but but but, wilfully, just but, as, so that, unexpectedly, surprisingly.
Described step B2 can further include: build turnover dictionary, and the participle be arranged in clause before emotion word that non-field limits and the adversative preserved of turnover dictionary are contrasted, if consistent, then have adversative before the emotion word that in described clause, non-field limits is described.
Step B3, the participle judging before non-field limits in described clause emotion word have negative word and the word distance of emotion word that negative word and non-field limit is less than or equal to 2? if, the emotion mark F of the emotion word then limited in described non-field is adjusted to-F, continues next step; If not, then next step is continued;
Described negative word includes but are not limited to: not, do not have.
Do you step B4, judge to have adverbial word in described clause and the word distance of emotion word that adverbial word and non-field limit is less than or equal to 2? if so, then the emotion mark F of the emotion word limited in described non-field is adjusted to 2*F, continues next step; If not, then next step is continued;
Described adverbial word includes but are not limited to: very, especially.
Do you step B5, judge that emotion word that non-field limits is in the end of the sentence of clause? if so, then the emotion mark F of the emotion word limited in described non-field is adjusted to 2*F, and this flow process terminates; If not, then this flow process terminates.
In Fig. 2 step B, according to the emotion mark of the emotion word that the non-field after adjustment limits, add up the emotion mark of each clause, thus calculate the affection index value of text to be calculated, include further:
Step 1, the emotion mark of emotion word limited according to all non-field comprised in clause, calculate the emotion mark of described clause: FU=F (A1)+F (A2)+... + F (An), wherein A1, A2 ... An is the emotion word that the non-field comprised in clause limits respectively, F (A1), F (A2) ... F (An) be non-field limit emotion word A1, A2 ... emotion mark after An adjustment;
Do you step 2, judge that the participle number of described clause is less than participle maximal value (such as: participle maximal value is 20)? if so, then the emotion mark FU of described clause is adjusted to 2*FU, then continues next step; If not, then next step is continued;
Step 3, the affection index value of adding up text to be calculated are the emotion mark sum of all clauses;
Do you step 4, judge that the affection index value of described text to be calculated is greater than text emotion index maximal value (if text emotion index maximal value is 15000)? if, then the affection index value of described text to be calculated is adjusted to text emotion index maximal value, this flow process terminates; If not, then next step is continued;
Do you step 5, judge that the affection index value of described text to be calculated is less than text emotion Index Min (if text emotion Index Min is-15000)? if, then the affection index value of described text to be calculated is adjusted to text emotion Index Min, this flow process terminates; If not, then this flow process terminates.
A kind of text emotion index computing system of the present invention includes non-field emotion dictionary construction device and text emotion exponential calculation device, wherein:
Non-field emotion dictionary construction device, for building the emotion dictionary that non-field limits, selects the emotion word that multiple non-field limits, and the emotion mark of the emotion word limited in described non-field and correspondence thereof is kept in the emotion dictionary of non-field restriction;
Text emotion exponential calculation device, for the text to be calculated of input being divided into multiple clause according to punctuation mark, and participle is carried out to each clause, then the emotion word of the non-field restriction comprised in each clause is found successively, the emotion mark of affection index computation rule to the emotion word that each non-field comprised in clause limits is used to adjust, finally according to the emotion mark of the emotion word of the non-field restriction after adjustment, add up the emotion mark of each clause, thus calculate the affection index value of text to be calculated.
As shown in Figure 5, non-field emotion dictionary construction device includes emotion co-occurrence score calculating unit, normalization unit, front polarity difference computing unit and non-field emotion word judging unit further, wherein:
Emotion co-occurrence score calculating unit, for obtaining multiple emotion word, and include the front/of emotion word or negative number of documents according in emotion mark calculating corpus, the probability that emotion word and front/or negative document occur jointly, the probability that front/or negative document occur, and the probability that in front/or negative document, emotion word occurs, calculate the co-occurrence mark of each emotion word and positive emotion/or negative emotion respectively, and the co-occurrence mark of described emotion word and positive emotion/or negative emotion is sent to normalization unit, described emotion mark calculates corpus can select an Emotional Corpus including multiple FIELD Data, the computing formula of the co-occurrence mark of emotion word w and positive emotion is: wherein, c (e p, be w) calculate at emotion mark the front number of documents including emotion word w in corpus, P (e p, w|e p) be the probability that emotion mark calculates that in corpus, emotion word w and front document occur jointly, its value is: n pthe quantity that emotion mark calculates front document in corpus, p (e p) be the probability that emotion mark calculates that in corpus, front document occurs, its value is: c (e p) be the quantity that emotion mark calculates front document in corpus, N dthe quantity that emotion mark calculates all documents in corpus, p (w|e p) be the probability that emotion mark calculates that in the front document of corpus, emotion word w occurs, its value is: c (w|e p) be the number of times that emotion mark calculates that in the front document of corpus, emotion word w occurs, words pit is total word frequency number (the word sum namely in the document of front) that emotion mark calculates in the front document of corpus, the computing formula of the co-occurrence mark of emotion word w and negative emotion is: wherein, c (e n, be w) calculate at emotion mark the negative number of documents including emotion word w in corpus, P (e n, w|e n) be the probability that emotion mark calculates that in corpus, emotion word w and negative document occur jointly, its value is: n nthe quantity that emotion mark calculates negative document in corpus, p (e n) be the probability that emotion mark calculates that in corpus, negative document occurs, its value is: c (e n) be the quantity that emotion mark calculates negative document in corpus, p (w|e n) be the probability that emotion mark calculates that in the negative document of corpus, emotion word w occurs, its value is: c (w|e n) be the number of times that emotion mark calculates that in the negative document of corpus, emotion word w occurs, words nit is total word frequency number that emotion mark calculates in the negative document of corpus,
Normalization unit, for using maximum-minimum method for normalizing, the emotion word calculate emotion co-occurrence score calculating unit and the co-occurrence mark of positive emotion/or negative emotion are normalized, and the co-occurrence mark of the emotion word after process and positive emotion/or negative emotion is sent to front polarity difference computing unit, wherein: cP (e p, w) minthe minimum value of the co-occurrence mark of all emotion word and positive emotion, CP (e p, w) maxit is the maximal value of the co-occurrence mark of all emotion word and positive emotion; cN (e n, w) minthe minimum value of the co-occurrence mark of all emotion word and negative emotion, CN (e n, w) maxit is the maximal value of the co-occurrence mark of all emotion word and negative emotion;
Front polarity difference computing unit, for the difference of the co-occurrence mark according to the emotion word after normalization cell processing and positive emotion, negative emotion, the front polarity calculating each emotion word is poor, and the front polarity difference of emotion word is sent to non-field emotion word judging unit, wherein: DValue (w, p, n)=(CP (e p, w)-CN (e n, w)) and β, DValue (w, p, n) they are that the front polarity of emotion word w is poor, β is Dynamic gene parameter;
Non-field emotion word judging unit, for arranging polarity difference limen value T (described T is a real number being greater than 0), and judge whether the front polarity difference of the emotion word that front polarity difference computing unit is sent is greater than T or is less than-T, if, then using the front polarity of described emotion word difference as its emotion mark, and described emotion word and emotion mark thereof to be kept in the emotion dictionary that non-field limits.
As shown in Figure 6, text emotion exponential calculation device includes text input unit, text emotion computing unit and emotion word score calculating unit further, wherein:
Text input unit, for inputting text to be calculated, and sends to text emotion computing unit by described text to be calculated;
Text emotion computing unit, for text to be calculated is divided into multiple clause, and participle is carried out to each clause, the emotion word that the non-field of preserving in the emotion the dictionary then each participle in clause and non-field limited one by one limits contrasts, if consistent, then described clause and the emotion word of the non-field restriction of its correspondence are sent to emotion word score calculating unit; Receive the emotion mark of the emotion word of the non-field restriction that emotion word score calculating unit returns, and according to the emotion word that the non-field that each clause comprises limits, add up the emotion mark of each clause, finally according to the emotion mark of all clauses, add up the affection index value of text to be calculated;
Emotion word score calculating unit, the emotion word that the clause sent for receiving text emotion computing unit limits with the non-field of its correspondence, the emotion mark that the emotion word of described non-field restriction is corresponding is extracted from the emotion dictionary that non-field limits, then the emotion mark of affection index computation rule to the emotion word that the non-field comprised in clause limits is used to adjust, the emotion mark of the emotion word finally limited in the non-field after adjustment sends to text emotion computing unit, described affection index computation rule can based on Chinese parsing, the emotion word limited according to field non-in clause and adversative, negative word, the position of adverbial word in clause, the emotion mark of the emotion word that field non-in clause limits is adjusted.
Explain further the present invention in order to clearer, citing is below illustrated:
1, first from knowing download 8936 emotion word, wherein positive emotion word 4566 net (HOWNET), negative emotion word 4370.
2, by the present invention, the positive emotion word that 1074 non-fields limit can be selected, the negative emotion word that 365 non-fields limit.Table 2 lists the emotion mark of emotion word that non-field that emotion mark comes the first five and latter five limits and their correspondences:
The emotion mark of the emotion word that the non-field of table 2 the first five and latter five limits and correspondence
As can be seen from the data of table 2, it is the situation that positive emotion word or negative emotion word meet the restriction of non-field all very much, such as positive emotion word first " liking ", also be that non-field limits in our subjective determination, what express when seldom having field to occur " liking " this word is negative emotion.In like manner, " love ", " satisfaction ", " disappointment ", " gloomy " etc. in table 2 are also all the emotion word that obvious non-field limits.
3, text to be calculated is: weather is good especially, and mood is also good.
(1), clause 1 is: weather is good especially; Clause 2 is: and mood is also good.
(2), for clause 1, the emotion word that ' good ' limits for the non-field be present in the emotion dictionary of non-field restriction, emotion mark is 10 points, occurred again adverbial word ' especially ' in clause 1, and be 1 with the distance of ' good ', and ' good ' is in end of the sentence, therefore, the emotion mark of ' good ' is adjusted to: 10*2*2=40, and because the participle number of clause 1 is less than 20, the emotion mark of then clause 1 is: 40*2=80.
(3), for clause 2, the emotion word that ' well ' limits for the non-field be present in the emotion dictionary of non-field restriction, emotion mark is 10 points, and ' well ' is in clause's end of the sentence, therefore, the emotion mark of ' well ' is adjusted to: 10*2=20, and because the participle number of clause 2 is less than 20, the emotion mark of then clause 2 is: 20*2=40.
(4), the affection index value of text to be calculated is: 80+40=120.
The foregoing is only preferred embodiment of the present invention, not in order to limit the present invention, within the spirit and principles in the present invention all, any amendment made, equivalent replacement, improvement etc., all should be included within the scope of protection of the invention.

Claims (11)

1. a text emotion index calculation method, is characterized in that, described method includes:
Steps A, build the emotion dictionary that non-field limits, select the emotion word that multiple non-field limits, and the emotion mark of the emotion word limited in described non-field and correspondence thereof is kept in the emotion dictionary that non-field limits;
Step B, according to punctuation mark, text to be calculated is divided into multiple clause, and participle is carried out to each clause, then the emotion word of the non-field restriction comprised in each clause is found successively, the emotion mark of affection index computation rule to the emotion word that each non-field comprised in clause limits is used to adjust, finally according to the emotion mark of the emotion word of the non-field restriction after adjustment, add up the emotion mark of each clause, thus calculate the affection index value of text to be calculated
In described step B, find the emotion word of the non-field restriction comprised in each clause successively, use the emotion mark of affection index computation rule to the emotion word that each non-field comprised in clause limits to adjust, include further:
Step B1, judge the emotion word that whether has non-field to limit in the participle of each clause, if so, then from the emotion dictionary that non-field limits, extract the emotion mark F that emotion word that described non-field limits is corresponding, continue next step; If not, then this flow process terminates;
Whether step B2, the participle judging to be positioned in described clause before emotion word that non-field limits have adversative, and if so, then the emotion mark F of the emotion word limited in described non-field is adjusted to 2*F, continues next step; If not, then next step is continued; Described adversative includes but are not limited to: but but but, wilfully, just but, as, so that, unexpectedly, surprisingly;
Step B3, the participle judging before emotion word that in described clause, non-field limits whether have negative word and the word distance of emotion word that negative word and non-field limit is less than or equal to 2, if, the emotion mark F of the emotion word then limited in described non-field is adjusted to-F, continues next step; If not, then next step is continued; Described negative word includes but are not limited to: not, do not have;
Step B4, judge whether to have adverbial word in described clause and the word distance of emotion word that adverbial word and non-field limit is less than or equal to 2, if so, then the emotion mark F of the emotion word limited in described non-field is adjusted to 2*F, continues next step; If not, then next step is continued; Described adverbial word includes but are not limited to: very, especially;
Step B5, judge whether emotion word that non-field limits is in the end of the sentence of clause, if so, then the emotion mark F of the emotion word limited in described non-field is adjusted to 2*F.
2. method according to claim 1, is characterized in that, described steps A includes further:
Steps A 1, obtain multiple emotion word, and calculate in corpus according to emotion mark the probability that in the probability and front/or negative document that probability that the front/or negative number of documents, emotion word and front/that include emotion word or negative document occur jointly, front/or negative document occur, emotion word occurs, calculate the co-occurrence mark of each emotion word and positive emotion/or negative emotion respectively, described emotion mark calculates corpus and selects an Emotional Corpus including multiple FIELD Data;
Steps A 2, use maximum-minimum method for normalizing, respectively the co-occurrence mark of each emotion word and positive emotion/or negative emotion is normalized;
Steps A 3, difference according to the co-occurrence mark of emotion word and positive emotion, negative emotion, the front polarity calculating each emotion word is poor;
Steps A 4, polarity difference limen value T is set, described T is a real number being greater than 0, and judge whether the front polarity difference of each emotion word is greater than T or is less than-T, if, then using the front polarity of described emotion word difference as its emotion mark, and described emotion word and emotion mark thereof to be kept in the emotion dictionary that non-field limits.
3. method according to claim 2, is characterized in that, also includes: in described steps A 1, and the computing formula of the co-occurrence mark of emotion word w and positive emotion is: CP ( e p , w ) = c ( e p , w ) log 2 [ P ( e p , w | e p ) p ( e p ) p ( w | e p ) ] , Wherein, c (e p, be w) calculate at emotion mark the front number of documents including emotion word w in corpus; P (e p, w|e p) be the probability that emotion mark calculates that in corpus, emotion word w and front document occur jointly, its value is: n pthe quantity that emotion mark calculates front document in corpus; P (e p) be the probability that emotion mark calculates that in corpus, front document occurs, its value is: c (e p) be the quantity that emotion mark calculates front document in corpus, N dthe quantity that emotion mark calculates all documents in corpus; P (w|e p) be the probability that emotion mark calculates that in the front document of corpus, emotion word w occurs, its value is: c (w|e p) be the number of times that emotion mark calculates that in the front document of corpus, emotion word w occurs, words pit is total word frequency number that emotion mark calculates in the front document of corpus;
The computing formula of the co-occurrence mark of emotion word w and negative emotion is: wherein, c (e n, be w) calculate at emotion mark the negative number of documents including emotion word w in corpus; P (e n, w|e n) be the probability that emotion mark calculates that in corpus, emotion word w and negative document occur jointly, its value is: n nthe quantity that emotion mark calculates negative document in corpus; P (e n) be the probability that emotion mark calculates that in corpus, negative document occurs, its value is: c (e n) be the quantity that emotion mark calculates negative document in corpus; P (w|e n) be the probability that emotion mark calculates that in the negative document of corpus, emotion word w occurs, its value is: c (w|e n) be the number of times that emotion mark calculates that in the negative document of corpus, emotion word w occurs, words nit is total word frequency number that emotion mark calculates in the negative document of corpus.
4. method according to claim 2, is characterized in that, in described steps A 2, to the computing formula that the co-occurrence mark of emotion word w and positive emotion is normalized is: wherein, CP (e p, w) minthe minimum value of the co-occurrence mark of all emotion word and positive emotion, CP (e p, w) maxit is the maximal value of the co-occurrence mark of all emotion word and positive emotion;
To the computing formula that the co-occurrence mark of emotion word w and negative emotion is normalized be: wherein, CN (e n, w) minthe minimum value of the co-occurrence mark of all emotion word and negative emotion, CN (e n, w) maxit is the maximal value of the co-occurrence mark of all emotion word and negative emotion.
5. method according to claim 2, is characterized in that, in described steps A 3, the computing formula of the front polarity difference of emotion word w is: DValue (w, p, n)=(CP (e p, w)-CN (e n, w)) and β, wherein, CP (e p, w) be the co-occurrence mark of emotion word w and positive emotion, CN (e n, w) be the co-occurrence mark of emotion word w and negative emotion, β is Dynamic gene parameter.
6. method according to claim 1, is characterized in that, in described step B, according to the emotion mark of the emotion word that the non-field after adjustment limits, adds up the emotion mark of each clause, thus calculates the affection index value of text to be calculated, include further:
Step 1, the emotion mark of emotion word limited according to all non-field comprised in clause, calculate the emotion mark of described clause: FU=F (A1)+F (A2)+... + F (An), wherein A1, A2 ... An is the emotion word that the non-field comprised in clause limits respectively, F (A1), F (A2) ... F (An) be non-field limit emotion word A1, A2 ... emotion mark after An adjustment;
Step 2, judge whether the participle number of described clause is less than participle maximal value, if so, then the emotion mark FU of described clause is adjusted to 2*FU, then continues next step; If not, then next step is continued;
Step 3, the affection index value of adding up text to be calculated are the emotion mark sum of all clauses;
Step 4, judge whether the affection index value of described text to be calculated is greater than text emotion index maximal value, if so, then the affection index value of described text to be calculated is adjusted to text emotion index maximal value, this flow process terminates; If not, then next step is continued;
Step 5, judge whether the affection index value of described text to be calculated is less than text emotion Index Min, if so, then the affection index value of described text to be calculated is adjusted to text emotion Index Min.
7. a text emotion index computing system, is characterized in that, includes:
Non-field emotion dictionary construction device, for building the emotion dictionary that non-field limits, selects the emotion word that multiple non-field limits, and the emotion mark of the emotion word limited in described non-field and correspondence thereof is kept in the emotion dictionary of non-field restriction;
Text emotion exponential calculation device, for the text to be calculated of input being divided into multiple clause according to punctuation mark, and participle is carried out to each clause, then the emotion word of the non-field restriction comprised in each clause is found successively, the emotion mark of affection index computation rule to the emotion word that each non-field comprised in clause limits is used to adjust, finally according to the emotion mark of the emotion word of the non-field restriction after adjustment, add up the emotion mark of each clause, thus calculate the affection index value of text to be calculated
Text emotion exponential calculation device includes further:
Text input unit, for inputting text to be calculated, and sends to text emotion computing unit by described text to be calculated;
Text emotion computing unit, for text to be calculated is divided into multiple clause, and participle is carried out to each clause, the emotion word that the non-field of preserving in the emotion the dictionary then each participle in clause and non-field limited one by one limits contrasts, if consistent, then described clause and the emotion word of the non-field restriction of its correspondence are sent to emotion word score calculating unit; Receive the emotion mark of the emotion word of the non-field restriction that emotion word score calculating unit returns, and according to the emotion word that the non-field that each clause comprises limits, add up the emotion mark of each clause, finally according to the emotion mark of all clauses, add up the affection index value of text to be calculated;
Emotion word score calculating unit, the emotion word that the clause sent for receiving text emotion computing unit limits with the non-field of its correspondence, the emotion mark that the emotion word of described non-field restriction is corresponding is extracted from the emotion dictionary that non-field limits, then the emotion mark of affection index computation rule to the emotion word that the non-field comprised in clause limits is used to adjust, the emotion mark of the emotion word finally limited in the non-field after adjustment sends to text emotion computing unit, described affection index computation rule can based on Chinese parsing, the emotion word limited according to field non-in clause and adversative, negative word, the position of adverbial word in clause, the emotion mark of the emotion word that field non-in clause limits is adjusted.
8. system according to claim 7, is characterized in that, described non-field emotion dictionary construction device includes further:
Emotion co-occurrence score calculating unit, for obtaining multiple emotion word, and include the front/of emotion word or negative number of documents according in emotion mark calculating corpus, the probability that emotion word and front/or negative document occur jointly, the probability that front/or negative document occur, and the probability that in front/or negative document, emotion word occurs, calculate the co-occurrence mark of each emotion word and positive emotion/or negative emotion respectively, and the co-occurrence mark of described emotion word and positive emotion/or negative emotion is sent to normalization unit, described emotion mark calculates corpus and selects an Emotional Corpus including multiple FIELD Data,
Normalization unit, for using maximum-minimum method for normalizing, the emotion word calculate emotion co-occurrence score calculating unit and the co-occurrence mark of positive emotion/or negative emotion are normalized, and the co-occurrence mark of the emotion word after process and positive emotion/or negative emotion is sent to front polarity difference computing unit;
Front polarity difference computing unit, for the difference of the co-occurrence mark according to the emotion word after normalization cell processing and positive emotion, negative emotion, the front polarity calculating each emotion word is poor, and the front polarity difference of emotion word is sent to non-field emotion word judging unit;
Non-field emotion word judging unit, for arranging polarity difference limen value T, described T is a real number being greater than 0, and judge whether the front polarity difference of the emotion word that front polarity difference computing unit is sent is greater than T or is less than-T, if, then using the front polarity of described emotion word difference as its emotion mark, and described emotion word and emotion mark thereof to be kept in the emotion dictionary that non-field limits.
9. system according to claim 8, is characterized in that, the computing formula of the co-occurrence mark of emotion word w and positive emotion is: wherein, c (e p, be w) calculate at emotion mark the front number of documents including emotion word w in corpus; P (e p, w|e p) be the probability that emotion mark calculates that in corpus, emotion word w and front document occur jointly, its value is: n pthe quantity that emotion mark calculates front document in corpus; P (e p) be the probability that emotion mark calculates that in corpus, front document occurs, its value is: c (e p) be the quantity that emotion mark calculates front document in corpus, N dthe quantity that emotion mark calculates all documents in corpus; P (w|e p) be the probability that emotion mark calculates that in the front document of corpus, emotion word w occurs, its value is: c (w|e p) be the number of times that emotion mark calculates that in the front document of corpus, emotion word w occurs, words pit is total word frequency number that emotion mark calculates in the front document of corpus;
The computing formula of the co-occurrence mark of emotion word w and negative emotion is: wherein, c (e n, be w) calculate at emotion mark the negative number of documents including emotion word w in corpus; P (e n, w|e n) be the probability that emotion mark calculates that in corpus, emotion word w and negative document occur jointly, its value is: n nthe quantity that emotion mark calculates negative document in corpus; P (e n) be the probability that emotion mark calculates that in corpus, negative document occurs, its value is: c (e n) be the quantity that emotion mark calculates negative document in corpus; P (w|e n) be the probability that emotion mark calculates that in the negative document of corpus, emotion word w occurs, its value is: c (w|e n) be the number of times that emotion mark calculates that in the negative document of corpus, emotion word w occurs, words nit is total word frequency number that emotion mark calculates in the negative document of corpus.
10. system according to claim 8, is characterized in that, to the computing formula that the co-occurrence mark of emotion word and positive emotion is normalized is: CP ( e p , w ) = CP ( e p , w ) - CP ( e p , w ) min CP ( e p , w ) max CP ( e p , w ) min , Wherein, CP (e p, w) minthe minimum value of the co-occurrence mark of all emotion word and positive emotion, CP (e p, w) maxit is the maximal value of the co-occurrence mark of all emotion word and positive emotion;
To the computing formula that the co-occurrence mark of emotion word and negative emotion is normalized be: cN (e n, w) minthe minimum value of the co-occurrence mark of all emotion word and negative emotion, CN (e n, w) maxit is the maximal value of the co-occurrence mark of all emotion word and negative emotion.
11. systems according to claim 8, is characterized in that, the computing formula of the front polarity difference of emotion word w is: DValue (w, p, n)=(CP (e p, w)-CN (e n, w)) and β, wherein, CP (e p, w) be the co-occurrence mark of emotion word w and positive emotion, CN (e n, w) be the co-occurrence mark of emotion word w and negative emotion, β is Dynamic gene parameter.
CN201210404224.XA 2012-10-22 2012-10-22 Method and system for calculating text emotion index Active CN102929861B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201210404224.XA CN102929861B (en) 2012-10-22 2012-10-22 Method and system for calculating text emotion index

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201210404224.XA CN102929861B (en) 2012-10-22 2012-10-22 Method and system for calculating text emotion index

Publications (2)

Publication Number Publication Date
CN102929861A CN102929861A (en) 2013-02-13
CN102929861B true CN102929861B (en) 2015-07-22

Family

ID=47644662

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201210404224.XA Active CN102929861B (en) 2012-10-22 2012-10-22 Method and system for calculating text emotion index

Country Status (1)

Country Link
CN (1) CN102929861B (en)

Families Citing this family (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105378707A (en) * 2013-04-11 2016-03-02 朗桑有限公司 Entity extraction feedback
CN103473380B (en) * 2013-09-30 2016-04-13 南京大学 A kind of computer version sensibility classification method
CN103678720B (en) * 2014-01-02 2017-02-22 中国标准化研究院 Method and device for processing user feedback data
CN106470363B (en) 2015-08-18 2019-09-13 阿里巴巴集团控股有限公司 Compare the method and device of race into row written broadcasting live
CN106855852B (en) * 2015-12-09 2020-09-15 北京国双科技有限公司 Statement emotion determining method and device
US10706368B2 (en) * 2015-12-30 2020-07-07 Veritas Technologies Llc Systems and methods for efficiently classifying data objects
CN107180021A (en) * 2016-03-09 2017-09-19 北京京东尚科信息技术有限公司 A kind of data processing method, system and its server
CN108090040B (en) * 2016-11-23 2021-08-17 北京国双科技有限公司 Text information classification method and system
CN107688651B (en) * 2017-08-31 2021-11-16 平安科技(深圳)有限公司 News emotion direction judgment method, electronic device and computer readable storage medium
CN107967337B (en) * 2017-12-05 2021-10-15 云南大学 Cross-domain emotion analysis method based on emotion polarity enhancement semantics
CN108304452B (en) * 2017-12-14 2021-03-26 腾讯科技(深圳)有限公司 Article processing method and device and storage medium
CN108877336A (en) * 2018-03-26 2018-11-23 深圳市波心幻海科技有限公司 Teaching method, cloud service platform and tutoring system based on augmented reality
CN108647281B (en) * 2018-05-03 2023-11-14 腾讯科技(深圳)有限公司 Webpage access risk detection and prompting method and device and computer equipment
CN110287405B (en) * 2019-05-21 2021-06-01 百度在线网络技术(北京)有限公司 Emotion analysis method, emotion analysis device and storage medium
CN110334182A (en) * 2019-06-24 2019-10-15 中国南方电网有限责任公司 Online service method with speech emotion recognition
CN113779258B (en) * 2021-11-10 2022-03-11 上海蜜度信息技术有限公司 Method for analyzing public satisfaction, storage medium and electronic device

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101894102A (en) * 2010-07-16 2010-11-24 浙江工商大学 Method and device for analyzing emotion tendentiousness of subjective text
CN102323944A (en) * 2011-09-02 2012-01-18 苏州大学 Sentiment classification method based on polarity transfer rules
CN102682124A (en) * 2012-05-16 2012-09-19 苏州大学 Emotion classifying method and device for text

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101894102A (en) * 2010-07-16 2010-11-24 浙江工商大学 Method and device for analyzing emotion tendentiousness of subjective text
CN102323944A (en) * 2011-09-02 2012-01-18 苏州大学 Sentiment classification method based on polarity transfer rules
CN102682124A (en) * 2012-05-16 2012-09-19 苏州大学 Emotion classifying method and device for text

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
robability Adjustment Naive Bayes Algorithm Based on Nondomain-Specific Sentiment and Evaluation Word for Domain-Transfer Sentiment Analysis;Wen Fan et al.;《 2011 Eighth International Conference on Fuzzy Systems and Knowledge Discovery》;20110728;第2卷;摘要、第1044-1045页第II,III节 *

Also Published As

Publication number Publication date
CN102929861A (en) 2013-02-13

Similar Documents

Publication Publication Date Title
CN102929861B (en) Method and system for calculating text emotion index
CN106919673B (en) Text mood analysis system based on deep learning
CN105183833B (en) Microblog text recommendation method and device based on user model
CN110427623A (en) Semi-structured document Knowledge Extraction Method, device, electronic equipment and storage medium
CN110888990B (en) Text recommendation method, device, equipment and medium
CN110110225B (en) Online education recommendation model based on user behavior data analysis and construction method
CN109086355B (en) Hot-spot association relation analysis method and system based on news subject term
CN105912576A (en) Emotion classification method and emotion classification system
CN110705247B (en) Based on x2-C text similarity calculation method
CN111309864B (en) User group emotional tendency migration dynamic analysis method for microblog hot topics
CN109446423B (en) System and method for judging sentiment of news and texts
CN104199845B (en) Line Evaluation based on agent model discusses sensibility classification method
Lin et al. Rumor detection with hierarchical recurrent convolutional neural network
CN109960791A (en) Judge the method and storage medium, terminal of text emotion
CN113962293A (en) LightGBM classification and representation learning-based name disambiguation method and system
CN110399603A (en) A kind of text-processing technical method and system based on sense-group division
Chang et al. A METHOD OF FINE-GRAINED SHORT TEXT SENTIMENT ANALYSIS BASED ON MACHINE LEARNING.
WO2023050652A1 (en) Text recognition-based method for determining esg index in region, and related product
Samsudin et al. Mining opinion in online messages
Wei et al. Online education recommendation model based on user behavior data analysis
CN104794209A (en) Chinese microblog sentiment classification method and system based on Markov logic network
Sanchez-Gomez et al. Sentiment-oriented query-focused text summarization addressed with a multi-objective optimization approach
Huang et al. Contribution of improved character embedding and latent posting styles to authorship attribution of short texts
Gan et al. Microblog sentiment analysis via user representative relationship under multi-interaction hybrid neural networks
CN115510269A (en) Video recommendation method, device, equipment and storage medium

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
CP02 Change in the address of a patent holder
CP02 Change in the address of a patent holder

Address after: 310013, Zhejiang, Xihu District, Wensanlu Road, No. 398, 4 floor, Hangzhou

Patentee after: EB Information Technology Ltd.

Address before: 100191 Beijing, Zhichun Road, No. 9, hearing the building on the floor of the 7 floor,

Patentee before: EB Information Technology Ltd.

CP01 Change in the name or title of a patent holder
CP01 Change in the name or title of a patent holder

Address after: 310013 4th floor, No.398 Wensan Road, Xihu District, Hangzhou City, Zhejiang Province

Patentee after: Xinxun Digital Technology (Hangzhou) Co.,Ltd.

Address before: 310013 4th floor, No.398 Wensan Road, Xihu District, Hangzhou City, Zhejiang Province

Patentee before: EB Information Technology Ltd.