CN103699626B - Method and system for analysing individual emotion tendency of microblog user - Google Patents

Method and system for analysing individual emotion tendency of microblog user Download PDF

Info

Publication number
CN103699626B
CN103699626B CN201310711626.9A CN201310711626A CN103699626B CN 103699626 B CN103699626 B CN 103699626B CN 201310711626 A CN201310711626 A CN 201310711626A CN 103699626 B CN103699626 B CN 103699626B
Authority
CN
China
Prior art keywords
word
topic
emotion
user
value
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201310711626.9A
Other languages
Chinese (zh)
Other versions
CN103699626A (en
Inventor
王伟凝
刘剑聪
韦岗
王励
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
South China University of Technology SCUT
Original Assignee
South China University of Technology SCUT
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by South China University of Technology SCUT filed Critical South China University of Technology SCUT
Priority to CN201310711626.9A priority Critical patent/CN103699626B/en
Publication of CN103699626A publication Critical patent/CN103699626A/en
Application granted granted Critical
Publication of CN103699626B publication Critical patent/CN103699626B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/35Clustering; Classification
    • G06F16/355Class or cluster creation or modification
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/958Organisation or management of web site content, e.g. publishing, maintaining pages or automatic linking

Abstract

The invention discloses a method for analysing individual emotion tendency of microblog user. The method comprises the following steps of: acquiring data, separating words, loading a word bank and an emoticon bank, establishing the interested topic bank of the user, dividing short sentences, extracting emotion elements, establishing the individual locution list of the user, calculating a locution emotion value, calculating the topic emotion tendency of the user, and calculating the overall emotion tendency of the user. The invention further discloses a system for analysing individual emotion tendency of microblog user. According to the method and the system disclosed by the invention, an emotion analysis on the single microblog user is realized, and the emotion analysis on the user is combined with specific topics to avoid an indistinct and stiff analysis mode, thus the emotion analysis on the user is more meticulous and directional, and the accuracy of the emotion tendency analysis is improved.

Description

A kind of microblog users individualized emotion trend analysis method and system
Technical field
The present invention relates to microblog data process field, particularly to a kind of microblog users individualized emotion trend analysis method And system.
Background technology
Microblogging is the random platform of a freedom, and its information is short and small, issues rapid, user delivers oneself frequently by microblogging Subjective feeling to various events and comment object, share with other people oneself value, suggestion, emotion etc..In micro-blog information Contain a lot of emotion words, contain abundant emotion information.The environment freely loosening, makes emotion in user's microblog data Expression information can deeper into, exactly reflect user Sentiment orientation.
The sentiment analysis research work of Chinese microblogging, is primarily directed to certain particular event and theme is carried out at present, analysis All related micro-blog information texts, extract Emotion element, carry out statistical analysis, the emotion information of microblogging are carried out point Class, mark and prediction, achieve certain achievement.But current research is primarily upon the sentiment analysis of micro-blog information or colony uses The heartbeat conditions at family, for single microblog users Sentiment orientation analysis and research not yet carry out in a deep going way, not individually for The Sentiment orientation analysis of user.And, the analysis to Sentiment orientation, it does not refine to each specific aspect in social life yet, The specific aim that this results in sentiment analysis is not strong, the accuracy of analysis and prediction, comprehensive waits to improve further.
The emotional expression mode of microblogging is personalized it is necessary to the individualized feature introducing user just can obtain more accurately Analysis result.The much-talked-about topic conversion of microblogging is very fast, and active user is relatively stable.The trend of impact event development is institute There are the microblog users of participation, the emotion model of user is metastable.By the analysis to user individual emotion, Ke Yigeng Emotion, the development of predicted events and the change of accurate more meticulously labelling micro-blog information.The user feeling analysis information set up is also Can be with life-time service, the accumulation with data can be more and more accurate.
By the individualized emotion analytical technology to microblog users, can analyze judge they to hot issue, particularly By liking or hatred degree of, special object or product, excavate business therein and social value, before there is wide application Scape, such as 1) public sentiment monitoring, the trend analysis and prediction of much-talked-about topic, sentiment analysis of social group etc.;2) stock market, popular disease The trend analysis and prediction such as disease, election;3) user behavior analysis based on big data, such as propensity to consume, user preferences etc..Micro- The research of rich user individual Sentiment orientation analysis method has important learning value and social meaning.
Content of the invention
In order to overcome disadvantages mentioned above and the deficiency of prior art, it is an object of the invention to provide a kind of microblog users individual character Change Sentiment orientation analysis method it is achieved that sentiment analysis to microblogging unique user, make the sentiment analysis of user more careful, more Directional.
Another object of the present invention is to providing a kind of microblog users individualized emotion trend analysis system.
The purpose of the present invention is achieved through the following technical solutions:
A kind of microblog users individualized emotion trend analysis method, comprises the following steps:
(1) gather all data of the microblogging homepage of each user, be stored in data base;
(2) text data in the microblog data that step (1) is collected carries out participle, obtains participle set and part of speech mark Note;
(3) dictionary, emoticon storehouse needed for loading;Described dictionary includes hownet sentiment word lexicon, degree dictionary, negative Dictionary, personal pronoun dictionary, function word is connected dictionary, cyberspeak dictionary and classified lexicon;
(4) using the interlayer based on word frequency, aggregating algorithm sets up user's topic of interest storehouse upwards:
(4-1) set up topic tree: filter out the word not having topic meaning in user version data, obtain there is obvious topic The word of information, using classified lexicon, counts word frequency, sets up topic tree;Described topic tree is hierarchical structure, and ground floor is one Level topic classification, the second layer is two grades of sub-topic classification, and third layer is classified for three-level sub-topic;The described word not having topic meaning Language includes degree word, negative word, personal pronoun, function word, link word, adjective;
(4-2) according to topic tree, by the aggregating algorithm upwards of the interlayer based on word frequency, successively extract high frequency topic;
(4-3) set up a main split, for placing popular spy on the topic word that cannot be included into father's layer topic and network There are topic word or phrase, obtain conventional topic storehouse;Word in microblog data is mated with the word in conventional topic storehouse Corresponding, the topic word language that occurrence number in the microblog data of user is exceeded threshold value extracts, and also serves as high frequency topic;
(4-4) the high frequency topic obtaining step (4-2) and (4-3), as user's topic of interest word, sets up user's sense Interest topic storehouse;
(5) microblog data that step (1) is collected divide short sentence it is ensured that each short sentence at most contain one interested Topic word;
(6) extract the Emotion element in each short sentence, calculate the initial emotion value of each short sentence:
(6-1) set of words of short sentence is carried out mating mapping with each dictionary and emoticon storehouse, mark all kinds of emotions Element;Described Emotion element includes emotion word, degree word, negative word, punctuation mark, emoticon, wherein degree word and punctuate Symbol is all used for adjusting the degree of emotion word, and negative word is used for adjusting the polarity of emotion word;
(6-2) the emotion value of calculating short sentence Chinese version:
The weights of setting Emotion element: positive emotion word weights are "+1 ", negative emotion word weights are " -1 ";Negative word is weighed It is worth for " -1 ";Degree word and punctuation mark, according to the depth of its degree, arrange weights, and weights scope is between 0 to 3;Degree word The emotion word being affected with punctuation mark follows nearby principle, and that is, each degree word or punctuation mark affect apart from its nearest feelings The emotion degree of sense word;
Emotion value i of short sentence Chinese versionwordsComputational methods be:
i w o r d s = b · σ i = 1 m ( σ j = 1 n c i j · f i j ) · q i
In formula, qiRepresent i-th emotion word, cijRepresent and modify qiJ-th degree word weights, fijRepresent and modify qi? J negative word weights;If qiDo not attach degree word, then cijTake default value 1;If qiDo not attach negative word, then fijTake acquiescence Value 1;N takes modification qiDegree word number and modify qiNegative word number in maximum, m represents the number of emotion word, b table The corresponding weights of indicating point symbol, i, j are positive integer;
(6-3) calculate short sentence in emoticon emotion value:
The expression providing for microblogging operator, it is divided into front for the contribution of Sentiment orientation, negatively, neutral three kinds Situation: the weights of front emoticon are set to "+1 ", the weights of negative emoticon are set to " -1 ", the weights of neutral expression's symbol It is set to " 0 ";
Emoticon emotion value i in short sentencemarksComputational methods be:
i m a r k s = σ i = 1 l m i
In formula, miRepresent the expression of i-th table front, negative or neutral emotion, i is positive integer, and l is emoticon Number;
(6-4) calculate initial emotion value i of short sentence0:
i0=iwords+imarks
(7) to the text data after step (2) process, extract the word combination of high frequency using word slip window sampling, obtain To user individual idiom list;
(8) statistical analysiss are carried out to the initial emotion value of all short sentences comprising each bar idiom, draw the feelings of idiom Inductance value;
For every idiom, find out all short sentences containing this idiom, its initial emotion value sum-average arithmetic calculates Method is as follows:
i g = 1 p σ i = 1 p i 0 i
In formula, i0iFor the initial emotion value of i-th short sentence comprising this idiom, p is the short sentence number containing this idiom, igEmotion initial value for this idiom;
By igValue be mapped in [- 3,3], obtain emotion value i of idiom 'g, record usual in the personalization of this user In language emotion labelling table;
(9) calculate the individualized emotion value of each short sentence, computational methods are:
i = i 0 - σ i = 1 m ′ ( σ j = 1 n ′ c g i j · f g i j ) · q g i + σ k = 1 r i g k ′
In formula, i0For the initial emotion value of short sentence, qgiRepresent i-th word, cgijRepresent and modify qgiJ-th degree word Weights, fgijRepresent and modify qgiJ-th negative word weights;If qgiDo not attach degree word, then cgijTake default value 1;If qgiNot yet There is subsidiary negative word, then fgijTake default value 1;N' takes modification qgiThe number of degree word and modify qgiThe number of negative word in Maximum;M' represents the number of word, and i, j are positive integer;i'gkRepresent the emotion value of k-th idiom, r represents that this is short The number of idiom in sentence;
(10) Sentiment orientation of calculating user's topic of interest:
For any user topic of interest word in user's topic of interest storehouse, it is calculated as follows its emotion value:
i topic i = 1 w σ j = 1 w i j
ijFor comprising the individualized emotion value of j-th short sentence of this user's topic of interest word, w is to comprise this user to feel emerging The short sentence sum of funny remarks epigraph,Emotion value for this user's topic of interest word;WillValue be mapped in [- 3,3], Obtain the topic Sentiment orientation value of final user, using these values, set up user individual microblog topic emotion value list.
After carrying out step (10), can also follow the steps below:
A () repeats step (1)~(7), for the idiom adding new in user individual idiom list, by step Suddenly (8) are calculated emotion value i of this idiom 'g;It is reported in user individual idiom list for before this circulation In idiom, then by the following method update i 'g: to every idiom, carry out step (8) first, obtain the i of epicycle circulationg's Value of calculation ig_newIf, ig_prevThe i obtaining for last round of circulationgValue of calculation, then igIt is updated to:
ig1ig_prev2ig_new
In formula, ω1For ig_prevWeights, ω2For ig_newWeights;
By igValue be mapped in [- 3,3], obtain emotion value i of idiom 'g
B i ' that () obtains according to step (a)g, carry out step (9)~(10).
After carrying out step (10), can also follow the steps below:
Repeat step (1)~(9), for emerging in the new user's sense adding in user's topic of interest storehouse in this circulation Funny remarks is write inscription, and calculates the topic Sentiment orientation of user by the method for step (10);It is reported in user for before this circulation User's topic of interest word in topic of interest storehouse, then update the topic Sentiment orientation i of user by the following methodtopic: first Carry out step (10), obtain the i of epicycle circulationtopicValue of calculation itopic_newIf, itopic_prevLast round of circulation obtains itopicValue of calculation, then itopicIt is updated to:
itopic=ω '1itopic_prev+ω'2itopic_new
ω'1For itopic_prevWeights, ω '2For itopic_newWeights.
After carrying out step (10), also follow the steps below:
According to following formula calculating user's overall emotion propensity value:
i u s e r = 1 s σ i = 1 s i i
In formula, iiRepresent the emotion value of i-th short sentence, s represents short sentence sum.
All data of the described microblogging homepage gathering each user of step (1), particularly as follows:
In units of user, collect all data in its homepage;Described data includes oneself sending out in user home page face Cloth and forwarding microblog data, the data of " comment sending ", the data of "@mentions mine ", microblogging name and user of interest Microblogging name, self-introduction, spontaneous or forward the title in the url webpage and video link that comprise in microblogging.
Step (4-2) described according to topic tree, by the aggregating algorithm upwards of the interlayer based on word frequency, successively extract high frequency words Topic, particularly as follows:
To each topic word, if its occurrence number is higher than the threshold value setting, this topic is high frequency topic, otherwise, will The occurrence number of this topic word passes to father's layer topic word, successively calculates and extracts high frequency topic;The occurrence number of topic word includes this The occurrence number of topic itself and the occurrence number of the sub-topic of this topic.
The described data collecting step (1) of step (5) divides short sentence, particularly as follows:
Each microblog data of analysis, if not having topic word or pertaining only to a topic, whole piece in a microblog data Microblogging is as a short sentence;
If one in microblog data contain multiple topic word, combine punctuation mark degree of priority analyze: if two away from Have punctuate between maximum topic word, be then split into two short sentences at punctuate, if the maximum topic word of two distances it Between there is no punctuate, then checking time the topic word of big distance, if all there is no punctuate, not splitting, whole piece microblog data conduct One short sentence;Described distance is to reach another topic word tree number to be passed through from a topic word in topic tree;If There are multiple punctuates between two topic word, then choose and divided at the high punctuate of relative importance value: in punctuation mark relative importance value, fullstop > branch > comma.
Text data after the described process to step (2) of step (7), extracts the word combination of high frequency, particularly as follows:
If the length of window of sliding window is w, w is the word number that sliding window comprises, and successively takes 1,2,3,4 respectively;Profit Count in all short sentences with word sliding window, the total degree of each word or phrase appearance, will appear from number of times and be more than threshold value User individual idiom list listed in word or phrase;Collect daily statement word, set up daily statement phrase database;By daily table Predicate is rejected from user individual idiom list, obtains user individual idiom list.
Realize the microblog users individualized emotion trend analysis system of above-mentioned analysis method, including
Data acquisition module, for gathering all data of the microblogging homepage of each user, is stored in data base;
Word-dividing mode, for carrying out participle to the text data in the data collecting, obtains participle set and part of speech mark Note;
Dictionary load-on module, for loading required dictionary, emoticon storehouse;Described dictionary includes hownet emotion word word Storehouse, degree dictionary, negates dictionary, personal pronoun dictionary, and function word is connected dictionary, cyberspeak dictionary and classified lexicon;
Module is set up in user's topic of interest storehouse, for using the interlayer based on word frequency, aggregating algorithm sets up user's sense upwards Interest topic storehouse;
Short sentence division module, divide short sentence for the data that arrives data collecting module collected it is ensured that each short sentence extremely Contain a topic of interest word more;
Emotion element extraction module, for extracting the Emotion element in each short sentence, calculates the initial emotion of each short sentence Value;
Module is set up in user individual idiom list, for the data that data acquisition module is collected, using word Slip window sampling extracts the word combination of high frequency, obtains user individual idiom list;
Idiom emotion value computing module, for carrying out statistical to the emotion value of all short sentences comprising each bar idiom Analysis, draws the emotion value of idiom;
Short sentence emotion value computing module, for calculating the individualized emotion value of each short sentence;
User's topic Sentiment orientation computing module, for calculating the Sentiment orientation of each topic of interest of user.
Described microblog users individualized emotion trend analysis system, also includes user's overall emotion tendency computing module, Overall emotion for calculating user is inclined to.
Compared with prior art, the present invention has advantages below and a beneficial effect:
(1) present invention achieves sentiment analysis to microblogging unique user, by the sentiment analysis of user with to concrete topic Combine, it is to avoid general mechanical analytical model, make the sentiment analysis to user more careful, more directional.From user Habitual expression way consider, using user individual idiom as one of the element of sentiment analysis, be conducive to improving emotion The accuracy of trend analysis.
(2) present invention is directed to user, carries out sentiment analysis to its microblog data, user can be helped to be appreciated more fully from itself And the hobby of other users;When the new speech focus of one section of appearance in network, it is possible to use the result that this method obtains, quickly Draw interest level and the Sentiment orientation of this user, to predict speech and the reaction of user;Product development business, operator and wide Accuse business and utilize this method, user interested in pushed away commodity or service can be found out, contribute to product development business offer fuller The commodity of sufficient demand, contributing to operator provides more humane intimate service, contributes to advertiser and is directed to user's input advertisement; Market demand reference can be provided for many industries;Contribute to public sentiment monitoring.
Brief description
Fig. 1 is the flow chart of the microblog users individualized emotion trend analysis method of embodiments of the invention.
Fig. 2 is the data acquisition of microblog users individualized emotion trend analysis method and the pretreatment of embodiments of the invention The particular flow sheet of process.
Fig. 3 is the particular flow sheet setting up user's topic of interest storehouse of embodiments of the invention.
Fig. 4 is the schematic diagram of the topic tree set up by the microblog data of certain user in embodiments of the invention.
Fig. 5 is the particular flow sheet successively extracting high frequency topic in embodiments of the invention from topic tree.
Fig. 6 is the concrete setting schematic diagram of the emoticon weights of embodiments of the invention.
Fig. 7 is the exemplary plot (w takes 1) of the utilization word sliding window extraction word combination of embodiments of the invention.
Fig. 8 is the exemplary plot (w takes 3) of the utilization word sliding window extraction word combination of embodiments of the invention.
Fig. 9 is the idiom emotion value calculation flow chart of embodiments of the invention.
Figure 10 is step (7)~(8) of microblog users individualized emotion trend analysis method of embodiments of the invention Particular flow sheet.
Figure 11 is step (5)~(11) of microblog users individualized emotion trend analysis method of embodiments of the invention Particular flow sheet.
Figure 12 is the structural representation of the microblog users individualized emotion trend analysis system of embodiments of the invention.
Specific embodiment
With reference to embodiment, the present invention is described in further detail, but embodiments of the present invention not limited to this.
Embodiment
As shown in figure 1, the microblog users individualized emotion trend analysis method of the present embodiment, comprise the following steps:
(1) gather all microblog data of the microblogging homepage of each user, be stored in data base:
In units of user, collect all data in its homepage;Described data includes oneself sending out in user home page face Cloth and forwarding microblog data, the data of " comment sending ", the data of "@mentions mine ", microblogging name and user of interest Microblogging name, self-introduction, spontaneous or forward the title in the url webpage and video link that comprise in microblogging.
(2) text data in the microblog data that step (1) is collected is read, using Chinese in units of microblogging paragraph by paragraph The segmenting method of morphological analysis system ictclas carries out participle operation, obtains participle set and corresponding part-of-speech tagging.
(3) dictionary, emoticon storehouse needed for loading;Described dictionary includes hownet sentiment word lexicon, degree dictionary, negative Dictionary, personal pronoun dictionary, function word is connected dictionary, cyberspeak dictionary and classified lexicon;Wherein, hownet sentiment word lexicon, Degree dictionary, each word in negative dictionary is with weights;Chinese word is only taken, by it in hownet sentiment word lexicon In positive emotion word and front evaluates word be classified as positive emotion word, by negative emotion word therein and unfavorable ratings Word is classified as negative emotion word;With regard to degree word and negative word, using self-built degree word dictionary and negative word dictionary, wherein Degree word dictionary comprises degree word 219, and negative dictionary comprises negative word 48.Classified lexicon is using the qq input method of improvement Classified lexicon, is hierarchical tree structure;Dictionary ground floor be maximum kind, comprise game, disciplines, hobby, sports and amusement, Culture and arts, area etc., the dictionary second layer is the subclasses of ground floor, for example, is divided into reason below " disciplines " big class again Workers and peasants doctor, social sciences economy, education military affairs etc., dictionary third layer is the subclasses of second layer classification, such as divides below social sciences economy For equity fund, law, commodity, finance etc.;Under the catalogue of whole classified lexicon, have 6223 and do not subdivide dictionary;Expression Symbolic library passes through to collect the emoticon of microblog and provide weights to obtain.
Step (1)~(3) are data acquisition and preprocessing process, and particular flow sheet is shown in Fig. 2.
(4) using the interlayer based on word frequency, aggregating algorithm sets up user's topic of interest storehouse upwards, as shown in figure 3, step As follows:
(4-1) set up topic tree: filter out the word not having topic meaning in user version data, obtain there is obvious topic The word of information, using classified lexicon, counts word frequency, sets up topic tree;Described topic tree is hierarchical structure, and ground floor is one Level topic classification, the second layer is two grades of sub-topic classification, and third layer is classified for three-level sub-topic;The described word not having topic meaning Language includes degree word, negative word, personal pronoun, function word, link word, adjective;
Fig. 4 is the topic tree set up by the microblog data of certain user.In square frame, corresponding numeral is this topic word and its son The number of times that topic word occurs.
(4-2) according to topic tree, by the aggregating algorithm upwards of the interlayer based on word frequency, successively extract high frequency topic: to every One topic word, if its occurrence number is higher than the threshold value setting, this topic is high frequency topic, otherwise, going out this topic word Occurrence number passes to his father's layer topic word, successively calculates and extracts high frequency topic;The occurrence number of topic word includes this topic itself Occurrence number and the occurrence number of its sub-topic;
Operation example as shown in figure 5, in figure represents the high frequency topic extracting with dashed rectangle, extracted altogether four high Frequency topic.
(4-3) set up a main split, for placing popular spy on the topic word that cannot be included into father's layer topic and network There are topic word or phrase, obtain conventional topic storehouse, this branch does not have layer architecture;By the word in microblog data and conventional words Word in exam pool carries out coupling and corresponds to, and the topic word language that occurrence number in the microblog data of user is exceeded threshold value extracts Come, also serve as high frequency topic;
(4-4) the high frequency topic obtaining step (4-2) and (4-3), as user's topic of interest word, sets up user's sense Interest topic storehouse.
(5) data that step (1) collects is divided short sentence it is ensured that each short sentence at most contains a topic of interest Word;
The described data collecting step (1) divides short sentence, particularly as follows: each microblog data of analysis, if one is micro- There is no topic word in rich data or pertain only to a topic, then whole piece microblogging is as a short sentence;If one containing in microblog data There are multiple topic word, then combine punctuation mark degree of priority and analyze: if having punctuate between the topic word of two distance maximums, It is split into two short sentences at punctuate;If there is no punctuate between the topic word of two distance maximums, then check time big distance Topic word;If all there is no punctuate, do not split, whole piece microblog data is as a short sentence;Described distance is in topic tree In reach another topic word tree number to be passed through from a topic word;If having multiple punctuates between two topic word, Choose and divided at the high punctuate of relative importance value: in punctuation mark relative importance value, fullstop > branch > comma.
(6) extract the Emotion element in each short sentence, calculate the initial emotion value of each short sentence:
(6-1) set of words of short sentence is carried out mating mapping with each dictionary and emoticon storehouse, mark all kinds of emotions Element;Described Emotion element includes emotion word, degree word, negative word, punctuation mark, emoticon, wherein degree word and punctuate Symbol is all used for adjusting the degree of emotion word, and negative word is used for adjusting the polarity of emotion word;
(6-2) the emotion value of calculating short sentence Chinese version:
The weights of setting Emotion element: positive emotion word weights are "+1 ", negative emotion word weights are " -1 ";Negative word is weighed It is worth for " -1 ";Degree word and punctuation mark, according to the depth of its degree, arrange weights, and weights scope is 0 to 3;Degree word and mark The emotion word that point symbol is affected follows nearby principle, and that is, each degree word or punctuation mark affect apart from its nearest emotion word Emotion degree;
Weights setting citing such as table 1~4:
Table 1 hownet emotion word
Front word Language Negation words Language
Love is doted on Regret
Love and esteem Annoyed
Good and sound Dim
Degree word weights commonly used by table 2
Degree word Degree value
Too 3
Very 2.5
Very 2
Negative word weights commonly used by table 3
Table 4 punctuation mark weights
Punctuation mark Degree coefficient
.(fullstop) 1
, (comma) 1
!!!…! 2
???…? 1.5
Emotion value i of short sentence Chinese versionwordsComputational methods be:
i w o r d s = b · σ i = 1 m ( σ j = 1 n c i j · f i j ) · q i
In formula, qiRepresent i-th emotion word, cijRepresent and modify qiJ-th degree word weights, fijRepresent and modify qi? J negative word weights;If qiDo not attach degree word, then cijTake default value 1;If qiDo not attach negative word, then fijTake acquiescence Value 1;N takes modification qiDegree word number and modify qiNegative word number in maximum, m represents the number of emotion word, b table The corresponding weights of indicating point symbol, i, j are positive integer;
(6-3) calculate short sentence in emoticon emotion value:
The expression providing for microblogging operator, it is divided into front for the contribution of Sentiment orientation, negatively, neutral three kinds Situation: the weights of front emoticon are set to "+1 ", the weights of negative emoticon are set to " -1 ", the weights of neutral expression's symbol It is set to " 0 ";
The concrete setting citing of emoticon weights is as shown in Figure 6.
Emoticon emotion value i in short sentencemarksComputational methods be:
i m a r k s = σ i = 1 l m i
In formula, miRepresent the expression of i-th table front, negative or neutral emotion, i is positive integer, and l is emoticon Number;
(6-4) calculate initial emotion value i of short sentence0:
i0=iwords+imarks
(7) to the text data after step (2) process, extract the word combination of high frequency using word slip window sampling, obtain To user individual idiom list;
If the length of window of sliding window is w, w is the word number that sliding window comprises, and successively takes 1,2,3,4 respectively;Profit Count in all short sentences with word sliding window, the total degree of each word or phrase appearance, will appear from number of times and be more than threshold value (no Have different threshold values with window, length of window is less, required threshold value is higher) word or phrase list in user individual be used to Term list;Wherein, during w=1, only statistical disposition is carried out to adjective and modal particle.For phrase, with the most word of word number Group lists idiom list in.In addition, the no obvious feelings being caused due to Chinese grammer using self-built daily statement phrase database rejecting The daily statement phrase that sense is pointed to, specific practice is: artificially collects Chinese everyday expressions collocation, as " I ", " present " " rise Come " etc., to compare the daily statement phrase that the no obvious emotion rejected in idiom list is pointed to.Thus, obtain user personality Change idiom list.
Fig. 7 takes word sliding window when 1 to extract the exemplary plot of word combination for w.
Fig. 8 takes word sliding window when 3 to extract the exemplary plot of word combination for w.
(8) the initial emotion value of all short sentences comprising each bar idiom is carried out with statistical analysiss, calculates each bar idiom Emotion value, as shown in figure 9, detailed process is as follows:
For every idiom, find out all short sentences containing this idiom, its initial emotion value sum-average arithmetic calculates Method is as follows:
i g = 1 p σ i = 1 p i 0 i
In formula, i0iFor the initial emotion value of i-th short sentence comprising this idiom, p is the short sentence number containing this idiom, igEmotion initial value for this idiom;
In view of i in theorygValue be infinity, but its most of numerical value is distributed near 10 again, so according to Formula (1) is by igThe value of (x as in formula) is mapped in [- 3,3], obtains emotion value i of idiom 'g(y as in formula), Record is in the personalized idiom emotion labelling table of this user;
y = 0.42 x , 0 &le; x < 50 - 0.42 x , - 50 < x < 0 3 , 0 > 50 - 3 , x > - 50 - - - ( 1 )
The particular flow sheet of step (7)~(8) is shown in Figure 10.
(9) calculate the individualized emotion value of each short sentence, computational methods are: deduct comprise in initial emotion value idiom or The Emotion element value of calculation of any word in idiom, then adds the idiom calculating according to user individual idiom list Emotion value, obtain the individualized emotion value of short sentence it may be assumed that
i = i 0 - &sigma; i = 1 m &prime; ( &sigma; j = 1 n &prime; c g i j &centerdot; f g i j ) &centerdot; q g i + &sigma; k = 1 r i g k &prime;
In formula, i0The initial emotion value of the short sentence for above calculating, cgij、fgij、qgiIt is the correlometer of word in idiom Calculation value, qgiRepresent i-th word, cgijRepresent and modify qgiJ-th degree word weights, fgijRepresent and modify qgiJ-th negative Word weights;If qgiDo not attach degree word, then cgijTake default value 1;If qgiDo not attach negative word, then fgijTake default value 1; N' takes modification qgiThe number of degree word and modify qgiThe number of negative word in maximum, m' represents the number of word, i, J is positive integer;i'gkRepresent the emotion value of k-th idiom, r represents the number of idiom in this short sentence.
(10) Sentiment orientation of calculating user's topic of interest:
For any user topic of interest word in user's topic of interest storehouse, it is calculated as follows its emotion value:
i topic i = 1 w &sigma; j = 1 w i j
In formula, ijFor comprising the individualized emotion value of j-th short sentence of this user's topic of interest word, w is to comprise this use The short sentence sum of family topic of interest word,Emotion value for this user's topic of interest word;
Will according to formula (1)The value of (x as in formula) is mapped in [- 3,3], obtains final user interested Topic Sentiment orientation value (y as in formula).Using these values, set up user individual microblog topic emotion value list, such as table 5 Shown.
Table 5 user individual microblog topic emotion value list
After carrying out step (10), also the overall emotion tendency of user can be calculated, that is, follow the steps below:
(11) overall emotion calculating user is inclined to:
According to following formula calculating user's overall emotion propensity value:
i u s e r = 1 s &sigma; i = 1 s i i
In formula, iiRepresent the emotion value of i-th short sentence, s represents short sentence sum.
The particular flow sheet of step (5)~(11) is shown in Figure 11.
It is, in general, that micro-blog information amount is bigger, the topic word in user's topic of interest storehouse is then abundanter, the emotion of user Trend analysis are also more accurate.So, the Sentiment orientation analysis to microblog users should regularly repeat, and covers old result.This It is a kind of mechanism of the timing upgrading of the sentiment analysis to user, result can be made more comprehensively, accurately, more ageing.
Wherein, due in renewal process the list of user individual idiom have new idiom and add, the present embodiment Microblog users individualized emotion trend analysis method is updated to the analysis result of step (1)~(11) using following methods:
After carrying out step (11), also follow the steps below:
(12) step (1)~(7) are repeated, for the idiom adding new in user individual idiom list, by Step (8) is calculated emotion value i of this idiom 'g;It is reported in user individual idiom row for before this circulation Idiom in table, then update i ' by the following methodg: to every idiom, carry out step (8) first, obtain the i of epicycle circulationg Value of calculation ig_newIf, ig_prevThe i obtaining for last round of circulationgValue of calculation, then igIt is updated to:
ig1ig_prev2ig_new
In formula, ω1For ig_prevWeights, ω2For ig_newWeights;
By igValue be mapped in [- 3,3], obtain emotion value i of idiom 'g
(13) i ' being obtained according to step (12)g, carry out follow-up step (9)~(11).
Because in renewal process, user's topic of interest storehouse has new user's topic of interest word addition, the present embodiment Microblog users individualized emotion trend analysis method is updated to the analysis result of step (1)~(11) using following methods:
After carrying out step (11), also follow the steps below:
(14) step (1)~(9) are repeated, for the user in the new addition in user's topic of interest storehouse in this circulation Topic of interest word, is calculated the topic Sentiment orientation of user by the method for step (10);It is reported in for before this circulation User's topic of interest word in user's topic of interest storehouse, then update the topic Sentiment orientation i of user by the following methodtopic: Carry out step (10) first, obtain the i of epicycle circulationtopicValue of calculation itopic_newIf, itopic_prevObtain for last round of circulation itopicValue of calculation, then itopicIt is updated to:
itopic=ω '1itopic_prev+ω'2itopic_new
ω'1For itopic_prevWeights, ω '2For itopic_newWeights.
The update method in conventional topic storehouse: periodically check much-talked-about topic one hurdle in microblogging homepage, topic therein is added To in conventional topic storehouse;In addition, the renewal of qq input method classified lexicon is checked in timing, new entry is added in classified lexicon.
As shown in figure 12, the microblog users individualized emotion trend analysis system of the present embodiment, including
Data acquisition module, for gathering all data of the microblogging homepage of each user, is stored in data base;
Word-dividing mode, for carrying out participle to the text data in the data collecting, obtains participle set and part of speech mark Note;
Dictionary load-on module: for loading required dictionary, emoticon storehouse;Described dictionary includes hownet emotion word word Storehouse, degree dictionary, negates dictionary, personal pronoun dictionary, and function word is connected dictionary, cyberspeak dictionary and classified lexicon;
Module is set up in user's topic of interest storehouse: for using the interlayer based on word frequency, aggregating algorithm sets up user's sense upwards Interest topic storehouse:
Short sentence division module, divide short sentence for the data that arrives data collecting module collected it is ensured that each short sentence extremely Contain a topic of interest word more;
Emotion element extraction module, for extracting the Emotion element in each short sentence, calculates the initial emotion of each short sentence Value;
Module is set up in user individual idiom list, for the data that data acquisition module is collected, using word Slip window sampling extracts the word combination of high frequency, obtains user individual idiom list;
Idiom emotion value computing module, for carrying out statistical to the emotion value of all short sentences comprising each bar idiom Analysis, draws the emotion value of idiom:
Short sentence emotion value computing module, for calculating the individualized emotion value of each short sentence;
The topic Sentiment orientation computing module of user, for calculating the Sentiment orientation of each topic of interest of user;
User's overall emotion is inclined to computing module, and the overall emotion for calculating user is inclined to.
Above-described embodiment is the present invention preferably embodiment, but embodiments of the present invention are not subject to described embodiment Limit, other any spirit without departing from the present invention and the change made under principle, modification, replacement, combine, simplify, All should be equivalent substitute mode, be included within protection scope of the present invention.

Claims (10)

1. a kind of microblog users individualized emotion trend analysis method is it is characterised in that comprise the following steps:
(1) gather all data of the microblogging homepage of each user, be stored in data base;
(2) text data in the microblog data that step (1) is collected carries out participle, obtains participle set and part-of-speech tagging;
(3) dictionary, emoticon storehouse needed for loading;Described dictionary includes hownet sentiment word lexicon, degree dictionary, negative word Storehouse, personal pronoun dictionary, function word is connected dictionary, cyberspeak dictionary and classified lexicon;
(4) using the interlayer based on word frequency, aggregating algorithm sets up user's topic of interest storehouse upwards:
(4-1) set up topic tree: filter out the word not having topic meaning in user version data, obtain there is obvious topic information Word, using classified lexicon, count word frequency, set up topic tree;Described topic tree is hierarchical structure, and ground floor is talked about for one-level Topic classification, the second layer is two grades of sub-topic classification, and third layer is classified for three-level sub-topic;The described word bag not having topic meaning Include degree word, negative word, personal pronoun, function word, link word, adjective;
(4-2) according to topic tree, by the aggregating algorithm upwards of the interlayer based on word frequency, successively extract high frequency topic;
(4-3) set up a main split, for placing popular peculiar words on the topic word that cannot be included into father's layer topic and network Epigraph or phrase, obtain conventional topic storehouse;By the word in microblog data and the word in conventional topic storehouse carry out coupling corresponding, The topic word language that occurrence number in the microblog data of user is exceeded threshold value extracts, and also serves as high frequency topic;
(4-4) the high frequency topic obtaining step (4-2) and (4-3), as user's topic of interest word, sets up user interested Topic storehouse;
(5) microblog data that step (1) collects is divided short sentence it is ensured that each short sentence at most contains a topic of interest Word;
(6) extract the Emotion element in each short sentence, calculate the initial emotion value of each short sentence:
(6-1) set of words of short sentence is carried out mating mapping with each dictionary and emoticon storehouse, mark all kinds of Emotion elements; Described Emotion element includes emotion word, degree word, negative word, punctuation mark, emoticon, wherein degree word and punctuation mark all For adjusting the degree of emotion word, negative word is used for adjusting the polarity of emotion word;
(6-2) the emotion value of calculating short sentence Chinese version:
The weights of setting Emotion element: positive emotion word weights are "+1 ", negative emotion word weights are " -1 ";Negative word weights are “-1”;Degree word and punctuation mark, according to the depth of its degree, arrange weights, and weights scope is between 0 to 3;Degree word and mark The emotion word that point symbol is affected follows nearby principle, and that is, each degree word or punctuation mark affect apart from its nearest emotion word Emotion degree;
Emotion value i of short sentence Chinese versionwordsComputational methods be:
i w o r d s = b &centerdot; &sigma; i = 1 m ( &sigma; j = 1 n c i j &centerdot; f i j ) &centerdot; q i
In formula, qiRepresent i-th emotion word, cijRepresent and modify qiJ-th degree word weights, fijRepresent and modify qiJ-th no Determine word weights;If qiDo not attach degree word, then cijTake default value 1;If qiDo not attach negative word, then fijTake default value 1;n Take modification qiDegree word number and modify qiNegative word number in maximum, m represents the number of emotion word, and b represents punctuate The corresponding weights of symbol, i, j are positive integer;
(6-3) calculate short sentence in emoticon emotion value:
The expression providing for microblogging operator, it is divided into front for the contribution of Sentiment orientation, negatively, neutral three kinds of feelings Condition: the weights of front emoticon are set to "+1 ", the weights of negative emoticon are set to " -1 ", and the weights of neutral expression's symbol set For " 0 ";
Emoticon emotion value i in short sentencemarksComputational methods be:
i m a r k s = &sigma; i = 1 l m i
In formula, miRepresent the expression of i-th table front, negative or neutral emotion, i is positive integer, and l is emoticon number;
(6-4) calculate initial emotion value i of short sentence0:
i0=iwords+imarks
(7) to the text data after step (2) process, extract the word combination of high frequency using word slip window sampling, used Family personalization idiom list;
(8) statistical analysiss are carried out to the initial emotion value of all short sentences comprising each bar idiom, draw the emotion value of idiom;
For every idiom, find out all short sentences containing this idiom, by its initial emotion value sum-average arithmetic, computational methods As follows:
i g = 1 p &sigma; i = 1 p i 0 i
In formula, i0iFor the initial emotion value of i-th short sentence comprising this idiom, p is the short sentence number containing this idiom, igFor The emotion initial value of this idiom;
By igValue be mapped in [- 3,3], obtain emotion value i of idiom 'g, record is in the personalized idiom feelings of this user In sense labelling table;
(9) calculate the individualized emotion value of each short sentence, computational methods are:
i = i 0 - &sigma; i = 1 m &prime; ( &sigma; j = 1 n &prime; c g i j &centerdot; f g i j ) &centerdot; q g i + &sigma; k = 1 r i g k &prime;
In formula, i0For the initial emotion value of short sentence, qgiRepresent i-th word, cgijRepresent and modify qgiJ-th degree word weights, fgijRepresent and modify qgiJ-th negative word weights;If qgiDo not attach degree word, then cgijTake default value 1;If qgiNot attached Band negative word, then fgijTake default value 1;N' takes modification qgiThe number of degree word and modify qgiThe number of negative word in Big value, m' represents the number of word, and i, j are positive integer;i'gkRepresent the emotion value of k-th idiom, r represents in this short sentence The number of idiom;
(10) Sentiment orientation of calculating user's topic of interest:
For any user topic of interest word in user's topic of interest storehouse, it is calculated as follows its emotion value:
i topic i = 1 w &sigma; j = 1 w i j
ijFor comprising the individualized emotion value of j-th short sentence of this user's topic of interest word, w is to comprise this user words interested The short sentence sum of epigraph,Emotion value for this user's topic of interest word;WillValue be mapped in [- 3,3], obtain The topic Sentiment orientation value of final user, using these values, sets up user individual microblog topic emotion value list.
2. microblog users individualized emotion trend analysis method according to claim 1 is it is characterised in that carry out step (10), after, also follow the steps below:
A () repeats step (1)~(7), for the idiom adding new in user individual idiom list, by step (8) it is calculated emotion value i of this idiom 'g;It is reported in user individual idiom list for before this circulation Idiom, then by the following method update i 'g: to every idiom, carry out step (8) first, obtain the i of epicycle circulationgMeter Calculation value ig_newIf, ig_prevThe i obtaining for last round of circulationgValue of calculation, then igIt is updated to:
ig1ig_prev2ig_new
In formula, ω1For ig_prevWeights, ω2For ig_newWeights;
By igValue be mapped in [- 3,3], obtain emotion value i of idiom 'g
B i ' that () obtains according to step (a)g, carry out step (9)~(10).
3. microblog users individualized emotion trend analysis method according to claim 1 is it is characterised in that carry out step (10), after, also follow the steps below:
Repeat step (1)~(9), for the user's words interested in the new addition in user's topic of interest storehouse in this circulation Epigraph, is calculated the topic Sentiment orientation of user by the method for step (10);Emerging for being reported in user's sense before this circulation User's topic of interest word in funny remarks exam pool, then update the topic Sentiment orientation i of user by the following methodtopic: carry out first Step (10), obtains the i of epicycle circulationtopicValue of calculation itopic_newIf, itopic_prevThe i obtaining for last round of circulationtopic's Value of calculation, then itopicIt is updated to:
itopic=ω '1itopic_prev+ω'2itopic_new
ω'1For itopic_prevWeights, ω '2For itopic_newWeights.
4. the microblog users individualized emotion trend analysis method according to any one of claims 1 to 3 it is characterised in that After carrying out step (10), also follow the steps below:
According to following formula calculating user's overall emotion propensity value:
i u s e r = 1 s &sigma; i = 1 s i i
In formula, iiRepresent the emotion value of i-th short sentence, s represents short sentence sum.
5. microblog users individualized emotion trend analysis method according to claim 1 is it is characterised in that step (1) institute State all data of the microblogging homepage gathering each user, particularly as follows:
In units of user, collect all data in its homepage;Described data includes oneself issue in user home page face Micro- with user's of interest with the microblog data forwarding, the data of " comment sending ", the data of "@mentions mine ", microblogging name Rich name, self-introduction, spontaneous or forward the title in the url webpage and video link that comprise in microblogging.
6. microblog users individualized emotion trend analysis method according to claim 1 is it is characterised in that step (4-2) Described high frequency topic, by the aggregating algorithm upwards of the interlayer based on word frequency, is successively extracted according to topic tree, particularly as follows:
To each topic word, if its occurrence number is higher than the threshold value setting, this topic is high frequency topic, otherwise, this is talked about The occurrence number of epigraph passes to father's layer topic word, successively calculates and extracts high frequency topic;The occurrence number of topic word includes this topic The occurrence number of itself and the occurrence number of the sub-topic of this topic.
7. microblog users individualized emotion trend analysis method according to claim 1 is it is characterised in that step (5) institute State the data that step (1) is collected and divide short sentence, particularly as follows:
Each microblog data of analysis, if not having topic word or pertaining only to a topic, whole piece microblogging in a microblog data As a short sentence;
If one containing multiple topic word in microblog data, in conjunction with the analysis of punctuation mark degree of priority: if two distances are There is punctuate between big topic word, be then split into two short sentences at punctuate, if do not had between the topic word of two distance maximums There is punctuate, then checking time topic word of big distance, if all there is no punctuate, not splitting, whole piece microblog data is as one Short sentence;Described distance is to reach another topic word tree number to be passed through from a topic word in topic tree;If two There are multiple punctuates between topic word, then choose and divided at the high punctuate of relative importance value: in punctuation mark relative importance value, fullstop > point Number > comma.
8. microblog users individualized emotion trend analysis method according to claim 1 is it is characterised in that step (7) institute State the text data after step (2) is processed, extract the word combination of high frequency, particularly as follows:
If the length of window of sliding window is w, w is the word number that sliding window comprises, and successively takes 1,2,3,4 respectively;Using word Language sliding window counts in all short sentences, the total degree of each word or phrase appearance, will appear from the word that number of times is more than threshold value Or phrase lists user individual idiom list in;Collect daily statement word, set up daily statement phrase database;By daily statement word Reject from user individual idiom list, obtain user individual idiom list.
9. realize the microblog users individualized emotion trend analysis system of analysis method described in any one of claim 1~8, it is special Levy and be, including
Data acquisition module, for gathering all data of the microblogging homepage of each user, is stored in data base;
Word-dividing mode, for carrying out participle to the text data in the data collecting, obtains participle set and part-of-speech tagging;
Dictionary load-on module, for loading required dictionary, emoticon storehouse;Described dictionary includes hownet sentiment word lexicon, journey Degree dictionary, negates dictionary, personal pronoun dictionary, and function word is connected dictionary, cyberspeak dictionary and classified lexicon;
Module is set up in user's topic of interest storehouse, for using the interlayer based on word frequency upwards aggregating algorithm to set up user interested Topic storehouse;
Short sentence division module, for by data collecting module collected to data divide short sentence it is ensured that each short sentence at most contains There is a topic of interest word;
Emotion element extraction module, for extracting the Emotion element in each short sentence, calculates the initial emotion value of each short sentence;
Module is set up in user individual idiom list, for the data that data acquisition module is collected, is slided using word Window technique extracts the word combination of high frequency, obtains user individual idiom list;
Idiom emotion value computing module, for statistical analysiss are carried out to the emotion value of all short sentences comprising each bar idiom, Draw the emotion value of idiom;
Short sentence emotion value computing module, for calculating the individualized emotion value of each short sentence;
User's topic Sentiment orientation computing module, for calculating the Sentiment orientation of each topic of interest of user.
10. microblog users individualized emotion trend analysis system according to claim 9 is it is characterised in that also include using Family overall emotion tendency computing module, the overall emotion for calculating user is inclined to.
CN201310711626.9A 2013-12-20 2013-12-20 Method and system for analysing individual emotion tendency of microblog user Active CN103699626B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201310711626.9A CN103699626B (en) 2013-12-20 2013-12-20 Method and system for analysing individual emotion tendency of microblog user

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201310711626.9A CN103699626B (en) 2013-12-20 2013-12-20 Method and system for analysing individual emotion tendency of microblog user

Publications (2)

Publication Number Publication Date
CN103699626A CN103699626A (en) 2014-04-02
CN103699626B true CN103699626B (en) 2017-02-01

Family

ID=50361154

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201310711626.9A Active CN103699626B (en) 2013-12-20 2013-12-20 Method and system for analysing individual emotion tendency of microblog user

Country Status (1)

Country Link
CN (1) CN103699626B (en)

Families Citing this family (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105740224A (en) * 2014-12-11 2016-07-06 仲恺农业工程学院 Text analysis based user psychology early warning method and apparatus
CN104572616B (en) * 2014-12-23 2018-04-24 北京锐安科技有限公司 The definite method and apparatus of Text Orientation
CN104899298B (en) * 2015-06-09 2018-01-16 华东师范大学 A kind of microblog emotional analysis method based on large-scale corpus feature learning
CN104933130A (en) * 2015-06-12 2015-09-23 百度在线网络技术(北京)有限公司 Comment information marking method and comment information marking device
CN105022725B (en) * 2015-07-10 2018-04-20 河海大学 A kind of text emotion trend analysis method applied to finance Web fields
CN106709829B (en) * 2015-08-03 2020-06-02 科大讯飞股份有限公司 Learning situation diagnosis method and system based on online question bank
CN105095190B (en) * 2015-08-25 2018-01-12 众联数据技术(南京)有限公司 A kind of sentiment analysis method combined based on Chinese semantic structure and subdivision dictionary
CN106933789B (en) * 2015-12-30 2023-06-20 阿里巴巴集团控股有限公司 Travel attack generation method and generation system
CN105843796A (en) * 2016-03-28 2016-08-10 北京邮电大学 Microblog emotional tendency analysis method and device
CN106503220A (en) * 2016-10-28 2017-03-15 上海大学 A kind of microblogging emoticon affection computation method based on a mutual information
CN108062300A (en) * 2016-11-08 2018-05-22 中移(苏州)软件技术有限公司 A kind of method and device that Sentiment orientation analysis is carried out based on Chinese text
CN107391712A (en) * 2017-07-28 2017-11-24 王亚迪 A kind of network public opinion trend prediction analysis method
CN107943789A (en) * 2017-11-17 2018-04-20 新华网股份有限公司 Mood analysis method, device and the server of topic information
CN109524106A (en) * 2018-10-31 2019-03-26 北京指掌易科技有限公司 A kind of mental model for analyzing introgression by chat content
CN109446378A (en) * 2018-11-08 2019-03-08 北京奇艺世纪科技有限公司 Information recommendation method, Sentiment orientation determine method and device and electronic equipment
CN110189742B (en) * 2019-05-30 2021-10-08 芋头科技(杭州)有限公司 Method and related device for determining emotion audio frequency, emotion display and text-to-speech
CN111611455A (en) * 2020-05-22 2020-09-01 安徽理工大学 User group division method based on user emotional behavior characteristics under microblog hot topics
CN112116391A (en) * 2020-09-18 2020-12-22 北京达佳互联信息技术有限公司 Multimedia resource delivery method and device, computer equipment and storage medium
CN112684110A (en) * 2020-12-14 2021-04-20 江西省蚕桑茶叶研究所(江西省经济作物研究所) Tea sensory evaluation method based on favorite expressions

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102279890A (en) * 2011-09-02 2011-12-14 苏州大学 Sentiment word extracting and collecting method based on micro blog
CN102663046A (en) * 2012-03-29 2012-09-12 中国科学院自动化研究所 Sentiment analysis method oriented to micro-blog short text
JP2012221480A (en) * 2011-04-06 2012-11-12 L Is B Corp Message processing system

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9069880B2 (en) * 2012-03-16 2015-06-30 Microsoft Technology Licensing, Llc Prediction and isolation of patterns across datasets

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2012221480A (en) * 2011-04-06 2012-11-12 L Is B Corp Message processing system
CN102279890A (en) * 2011-09-02 2011-12-14 苏州大学 Sentiment word extracting and collecting method based on micro blog
CN102663046A (en) * 2012-03-29 2012-09-12 中国科学院自动化研究所 Sentiment analysis method oriented to micro-blog short text

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
基于多特征与复合分类法的中文微博情感分析;吴维等;《中国期刊全文数据库 北京信息科技大学学报(自然科学版)》;20130831;第28卷(第4期);全文 *

Also Published As

Publication number Publication date
CN103699626A (en) 2014-04-02

Similar Documents

Publication Publication Date Title
CN103699626B (en) Method and system for analysing individual emotion tendency of microblog user
Deng et al. Smart generation system of personalized advertising copy and its application to advertising practice and research
Araque et al. MoralStrength: Exploiting a moral lexicon and embedding similarity for moral foundations prediction
Pérez-Rosas et al. Experiments in open domain deception detection
Mostafa More than words: Social networks’ text mining for consumer brand sentiments
Batool et al. Precise tweet classification and sentiment analysis
Santos et al. Improving international attractiveness of higher education institutions based on text mining and sentiment analysis
Extra et al. Language proficiency and socio-cultural orientation of Turkish and Moroccan youngsters in the Netherlands
CN109829166B (en) People and host customer opinion mining method based on character-level convolutional neural network
CN102682120B (en) Method and device for acquiring essential article commented on network
CN103793503A (en) Opinion mining and classification method based on web texts
CN105843897A (en) Vertical domain-oriented intelligent question and answer system
CN104731770A (en) Chinese microblog emotion analysis method based on rules and statistical model
CN103917968A (en) System and method for managing opinion networks with interactive opinion flows
US20140156673A1 (en) Measuring and altering topic influence on edited and unedited media
Mangal et al. Analysis of users’ interest based on tweets
CN106202252A (en) Method, system are recommended in a kind of trip analyzed based on user emotion
Qi et al. Subjective well-being measurement based on Chinese grassroots blog text sentiment analysis
Zhao et al. An analysis of well-being determinants at the city level in China using big data
Märkle-Huß et al. Improving sentiment analysis with document-level semantic relationships from rhetoric discourse structures
Figueroa Male or female: What traits characterize questions prompted by each gender in community question answering?
Awrahman et al. Sentiment analysis and opinion mining within social networks using konstanz information miner
KR101007284B1 (en) System and method for searching opinion using internet
Gottipati et al. Analyzing tweets on new norm: work from home during COVID-19 outbreak
Maity et al. Language use matters: Analysis of the linguistic structure of question texts can characterize answerability in quora

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant