CN106780073A - A kind of community network maximizing influence start node choosing method for considering user behavior and emotion - Google Patents

A kind of community network maximizing influence start node choosing method for considering user behavior and emotion Download PDF

Info

Publication number
CN106780073A
CN106780073A CN201710019177.XA CN201710019177A CN106780073A CN 106780073 A CN106780073 A CN 106780073A CN 201710019177 A CN201710019177 A CN 201710019177A CN 106780073 A CN106780073 A CN 106780073A
Authority
CN
China
Prior art keywords
emotion
node
word
comment
participle
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201710019177.XA
Other languages
Chinese (zh)
Other versions
CN106780073B (en
Inventor
邓晓衡
曹德娟
潘琰
沈海澜
桂劲松
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Central South University
Original Assignee
Central South University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Central South University filed Critical Central South University
Priority to CN201710019177.XA priority Critical patent/CN106780073B/en
Publication of CN106780073A publication Critical patent/CN106780073A/en
Application granted granted Critical
Publication of CN106780073B publication Critical patent/CN106780073B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/01Social networking

Landscapes

  • Business, Economics & Management (AREA)
  • Engineering & Computer Science (AREA)
  • Primary Health Care (AREA)
  • Strategic Management (AREA)
  • Economics (AREA)
  • General Health & Medical Sciences (AREA)
  • Human Resources & Organizations (AREA)
  • Marketing (AREA)
  • Computing Systems (AREA)
  • Health & Medical Sciences (AREA)
  • Tourism & Hospitality (AREA)
  • Physics & Mathematics (AREA)
  • General Business, Economics & Management (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention discloses a kind of community network maximizing influence start node choosing method for considering user behavior and emotion, use the evaluation data set with tag along sort, build emotional words training pattern and extend existing sentiment dictionary, behavior disposition analysis and Sentiment orientation analysis are carried out to community network node respectively, as the judging basis of node influence power, the influence power propagation model BSIS for considering user behavior and emotion is built, and maximum effect power marginal benefit node is solved with reference to greedy algorithm and be added in start node set.The present invention consider user behavior tendency and Sentiment orientation, it is more efficient, accurate, truly excavate maximizing influence start node.

Description

A kind of community network maximizing influence start node for considering user behavior and emotion Choosing method
Technical field
The invention belongs to field of computer technology, it is related to the community network influence power of a kind of consideration user behavior and emotion most Bigization start node choosing method.
Background technology
As community network is fast-developing, more and more people are by societies such as microblogging, circle of friends, Facebook, Twitter Net list up to oneself to other people, product, social event etc. personal views and view, exchange is shared with other people, by oneself In work, life, affect display to community network.In community network, the information of the exchange and interdynamic between user illustrates use Sentiment orientation between family, the emotion between usual user is directly connected to the factor of influence between user.According to community network The emotion influence relation of middle user, finds the maximum user group of influence power so that information obtains virus-type with minimum cost Propagate, influence power is spread in the way of " public praise effect ".Therefore sentiment analysis are carried out to community network user for influence power most Bigization research is significant.
Innovatory algorithm and model quilt of many based on IC models (independent cascade model) and LT models (linear threshold model) Propose to find maximizing influence start node.When influence power is propagated, user feeling was analyzed and based on the time in community network The user behavior tendency of delay is seldom considered.
Therefore, it is necessary to provide a kind of community network maximizing influence start node choosing for considering user behavior and emotion Take method.
The content of the invention
Technical problem solved by the invention is in view of the shortcomings of the prior art, to propose a kind of consideration user behavior and feelings The community network maximizing influence start node choosing method of sense, based on community network user behavior tendency and Sentiment orientation, Maximum marginal benefit node is solved as maximizing influence start node, it is as a result more accurate and reliable.
The technical scheme is that:
A kind of community network maximizing influence start node choosing method for considering user behavior and emotion, including it is following Step:
Step 1:Using the evaluation data set with tag along sort, emotional words training pattern is built, in existing emotion The emotional words vocabulary with emotion weight for obtaining being applied to network comment information sentiment analysis on the basis of dictionary extends To new sentiment dictionary;
Because the evaluation information of community network is all without label, therefore using information on commodity comment, root in electric business network Emotional semantic classification is carried out to evaluation information according to star is evaluated, 5 stars are evaluated for front, 1 star is unfavorable ratings;Herein, utilize Reptile crawls Jingdone district information on commodity comment, extracts commodity and evaluates positive negative sample, and keeps positive and negative sample size balanced.Evaluation information It is as follows that emotion vocabulary builds model step:
Commodity evaluation text is carried out Chinese word segmentation by step 1.1 using ANSJ, obtains indicating the word segmentation result of word part of speech, Statistics part of speech is the number of times that adjectival participle occurs in forward direction evaluates sample and negative sense evaluates sample respectively;
Step 1.2 counts positive and evaluates sample and negative sense evaluation sample size summation N, sets participle frequency threshold value r, maximum Emotion weight max;
Step 1.3 is adjectival participle word for part of speech, counts its occurrence number N in forward direction evaluates samplePWith Negative sense evaluates the times N occurred in sampleNIf,Perform step 1.4,1.5,1.6;
Step 1.4 calculates the positive probability P of word wordP:
Wherein PP∈(0,1);
Step 1.5 is according to its emotion weight of the positive probability calculation of participle word w1And w2
Wherein PP∈ (0,1), then w1∈(-1,1);
Wherein w2∈(-1,1);
Step 1.6 gives adjective participle word, and its comprehensive emotion weight is w:
Step 2:Data prediction and data pick-up are carried out to community network data set, user behavior record is extracted, calculated Consider the user behavior influence power of user behavior time delay;
In community network, if user node u and v are adjacent node, and the message that v is issued to u is by a series of times Postpone to produce a series of behaviors, wherein behavior is including thumb up, comment and forwards, then consider user's row of user behavior time delay For influence power B_Inf (u, v) is:
Wherein, λ ∈ (0,1), M (u) represents that u performed the massage set of behavior, and M (v) represents that user v performed behavior Massage set, | M (u) | represents the message bar number that u performed behavior, and | M (u) ∩ M (v) | represents that v performs row to the message that u is issued For bar number, t represent v pairs and u issue message process performing average time delay, T represent v pairs with its all neighbor node The average time delay of the message process performing of issue.
Step 3:According to expanded sentiment dictionary, sentiment analysis are carried out to comment information between community network user, Obtain the user feeling influence power based on user feeling tendency;
Two neighboring users node u and v are given, if u issues first message, v is evaluated it, and wherein one is evaluated Content commentjCarry out participle and obtain a series of participle set { word1,word2,...,wordn, go search each participle exist Corresponding participle emotion weight { w in sentiment dictionary1,w2,...,wn, ifSentiment dictionary, wi=0, then in this evaluation The emotion score value of appearance:
What deserves to be explained is, in comment information often occur with turnover meaning word, for example Chinese in " no Be, no " and English in the adverbial word such as " not ", this kind of word combined the emotion participle expression that it can be caused to modify with other emotion participles The meaning it is reverse;Accordingly, it is considered to arrive the semantic modified adverbial word of turnover, we analyze the context of this kind of word, will be with its phase Adjacent emotion participle is extracted and obtains turnover participle set { reverse_w1,reverse_w2,...,reverse_wm, go Search their emotion weight { r_w respectively in sentiment dictionary1,r_w2,...,r_wm, to commentjEmotion score value enter The emotion score value that row is corrected:
Therefore, if evaluation content commentjIn the positive emotion word that includes it is more, participle emotion weight accumulated value is higher And it is denseer to the positive emotion of u just, to represent v, know that can to derive u bigger to the influence power of v therefrom.If evaluation content commentjIn the negative sense emotion word that includes it is more, negative more of participle emotion weight accumulated value and be negative represent v to u negative senses Emotion is denseer, thus derives u bigger to the influence power of v.Therefore, two neighboring user node u and v, v first commenting to u are given By commentj, based on commentjEmotion score value derive u to the single emotion influence power of v:
Wherein ο ∈ (0,1);
Therefore comment information { comment of two neighboring user node u and v and a plurality of v to u is given1, comment2,...,commentn, the node emotion influence power being calculated is commented on according to every, average and draw u to v's Emotion influence power:
Step 4:Consider the influence power of the Sentiment orientation based on user behavior tendency, obtain the total influence power of user TotalInfu,v
TotalInfu,v=β B_Infu,v+(1-β)SentiInfu,v
Step 5:The total influence power and the propagation path of total influence power being calculated according to step 4, consider social network comprehensively The topological relation of network node, therefore u is φ to the propagating influence summation of vu,v(v):
There is the node set of behavior and emotion influence power in wherein I (v) expression network topology structures to v;
Step 6:The total influence power marginal benefits of calculate node u are σMarginal_Revenue(u):
Wherein, S represents start node set, and A represents the set of all user nodes in community network;
Community network interior joint marginal benefit is calculated simultaneously to sort by size, choose the maximum preceding nk node of marginal benefit according to Secondary to be inserted into queue Q, marginal benefit maximum node is inserted into S in ejection Q;
Step 7:Statistics S interior joint numbers, if | S | < k, all node limits are received in recalculating queue Q according to step 6 Benefit and more new sort, marginal benefit maximum node is inserted into S in ejection Q, if | S |=k, S are final start node collection Close;
Beneficial effect:
The present invention proposes a kind of community network maximizing influence start node selection for considering user behavior and emotion Method.Based on user behavior and Sentiment orientation in online community network, proposition is a kind of to consider two kinds of influence powers of factor Propagation model, is named as BSIS (Influence Spread Based on Users ' Behaviors and Sentiment) Model.The model is divided into three phases, and the first stage builds emotional words training pattern expanding sentiment dictionary, and second stage builds User behavior influence power, the phase III builds user feeling influence power.With reference to greedy algorithm and the maximum limit of BSIS model solutions Income node is used as influence power maximum start node.Commodity evaluating data is crawled from Jingdone district, is divided into evaluation according to star is evaluated Front is evaluated and the class of unfavorable ratings two, based on the existing sentiment dictionary of emotional words training pattern extension.Using Flickr and microblogging Social network data collection, realizes model proposed by the present invention and algorithm on Hadoop and Spark platforms.Test result indicate that, Influence power propagation model proposed by the present invention and algorithm compared to the method based on traditional IC, LT and CDNF models, with more preferable Influence power communication effect, the start node of wider array of influence power spread scope and Geng Gao chooses quality.
Brief description of the drawings
Fig. 1 is a kind of community network maximizing influence start node for considering user behavior and emotion proposed by the present invention The flow chart of choosing method;
Fig. 2 is to use in embodiment 1 Flickr data sets to be based on BSIS, and the different methods of this 4 kinds of CDNF, IC and LT are chosen Start node influence power communication effect comparison diagram;
Fig. 3 is to use in embodiment 1 microblog data collection to be based on the influence power that the start node that BSIS chooses produces to propagate effect Fruit is schemed;
Fig. 4 is to use in embodiment 1 Flickr data sets to be based on BSIS, and the different methods of this 4 kinds of CDNF, IC and LT are chosen The thumb up of reception of start node, forwarding, comment behavior perform number of times comparison diagram;
Fig. 5 be use in embodiment 1 microblog data collection be based on the thumb up of the reception of the start node that BSIS chooses, forwarding, Comment behavior performs number of times figure;
Specific embodiment
For a kind of more detailed description community network influence power for considering user behavior and emotion proposed by the present invention Start node choosing method is maximized, the present invention is further illustrated with example below in conjunction with the accompanying drawings.
The present invention proposes a kind of community network maximizing influence start node selection for considering user behavior and emotion Method.Based on user behavior and Sentiment orientation in online community network, a kind of influence power for considering both factors is proposed Propagation model, is named as BSIS (Influence Spread Based on Users ' Behaviors and Sentiment) Model.The model is divided into three phases, and the first stage builds emotional words training pattern expanding sentiment dictionary, and second stage builds User behavior influence power, the phase III builds user feeling influence power.With reference to greedy algorithm and the maximum limit of BSIS model solutions Income node is used as influence power maximum start node.Commodity evaluating data is crawled from Jingdone district website, will be evaluated according to star is evaluated It is divided into front to evaluate and the class of unfavorable ratings two, based on the existing sentiment dictionary of emotional words training pattern extension.Using Flickr and Microblogging social network data collection, realizes model proposed by the present invention and algorithm on Hadoop and Spark platforms.Experimental result table It is bright, influence power propagation model proposed by the present invention and algorithm compared to the method on traditional IC, LT and CDNF models, with more preferable Influence power communication effect, the start node of wider array of influence power spread scope and Geng Gao chooses quality.
Fig. 1 is a kind of community network maximizing influence start node for considering user behavior and emotion proposed by the present invention Choosing method flow chart, specific implementation step is as follows:
Step 1:Using the evaluation data set with tag along sort, emotional words training pattern is built, in existing emotion The emotional words vocabulary with emotion weight for obtaining being applied to network comment information sentiment analysis on the basis of dictionary extends To new sentiment dictionary;
Because the evaluation information of community network is all without label, therefore using information on commodity comment, root in electric business network Emotional semantic classification is carried out to evaluation information according to star is evaluated, 5 stars are evaluated for front, 1 star is unfavorable ratings;Herein, utilize Reptile crawls Jingdone district information on commodity comment, extracts commodity and evaluates positive negative sample, and keeps positive and negative sample size balanced.Evaluation information It is as follows that emotion vocabulary builds model step:
Commodity evaluation text is carried out Chinese word segmentation by step 1.1 using ANSJ, obtains indicating the word segmentation result of word part of speech, Statistics part of speech is the number of times that adjectival participle occurs in forward direction evaluates sample and negative sense evaluates sample respectively;
Step 1.2 counts positive and evaluates sample and negative sense evaluation sample size summation N, sets participle frequency threshold value r, maximum Emotion weight max;
Step 1.3 is adjectival participle word for part of speech, counts its occurrence number N in forward direction evaluates samplePWith Negative sense evaluates the times N occurred in sampleNIf,Perform step 1.4,1.5,1.6;
Step 1.4 calculates the positive probability P of word wordP:
Wherein PP∈(0,1);
Step 1.5 is according to its emotion weight of the positive probability calculation of participle word w1And w2
Wherein PP∈ (0,1), then w1∈(-1,1);
Wherein w2∈(-1,1);
Step 1.6 gives adjective participle word, and its comprehensive emotion weight is w:
Step 2:Data prediction and data pick-up are carried out to community network data set, user behavior record is extracted, calculated Consider the user behavior influence power of user behavior time delay;
In community network, if user node u and v are adjacent node, and the message that v is issued to u is by a series of times Postpone to produce a series of behaviors, wherein behavior is including thumb up, comment and forwards, then consider user's row of user behavior time delay For influence power B_Inf (u, v) is:
Wherein, λ ∈ (0,1), M (u) represents that u performed the massage set of behavior, and M (v) represents that user v performed behavior Massage set, | M (u) | represents the message bar number that u performed behavior, and | M (u) ∩ M (v) | represents that v performs row to the message that u is issued For bar number, t represent v pairs and u issue message process performing average time delay, T represent v pairs with its all neighbor node The average time delay of the message process performing of issue.
Step 3:According to expanded sentiment dictionary, sentiment analysis are carried out to comment information between community network user, Obtain the user feeling influence power based on user feeling tendency;
Two neighboring users node u and v are given, if u issues first message, v is evaluated it, and wherein one is evaluated Content commentjCarry out participle and obtain a series of participle set { word1,word2,...,wordn, go search each participle exist Corresponding participle emotion weight { w in sentiment dictionary1,w2,...,wn, ifSentiment dictionary, wi=0, then in this evaluation The emotion score value of appearance:
What deserves to be explained is, in comment information often occur with turnover meaning word, for example Chinese in " no Be, no " and English in the adverbial word such as " not ", this kind of word combined the emotion participle expression that it can be caused to modify with other emotion participles The meaning it is reverse;Accordingly, it is considered to arrive the semantic modified adverbial word of turnover, we analyze the context of this kind of word, will be with its phase Adjacent emotion participle is extracted and obtains turnover participle set { reverse_w1,reverse_w2,...,reverse_wm, go Search their emotion weight { r_w respectively in sentiment dictionary1,r_w2,...,r_wm, to commentjEmotion score value enter The emotion score value that row is corrected:
Therefore, if evaluation content commentjIn the positive emotion word that includes it is more, participle emotion weight accumulated value is higher And it is denseer to the positive emotion of u just, to represent v, know that can to derive u bigger to the influence power of v therefrom.If evaluation content commentjIn the negative sense emotion word that includes it is more, negative more of participle emotion weight accumulated value and be negative represent v to u negative senses Emotion is denseer, thus derives u bigger to the influence power of v.Therefore, two neighboring user node u and v, v first commenting to u are given By commentj, based on commentjEmotion score value derive u to the single emotion influence power of v:
Wherein ο ∈ (0,1);
Therefore comment information { comment of two neighboring user node u and v and a plurality of v to u is given1, comment2,...,commentn, the node emotion influence power being calculated is commented on according to every, take
Average value draws emotion influence powers of the u to v:
Step 4:Consider the influence power of the Sentiment orientation based on user behavior tendency, obtain the total influence power of user TotalInfu,v
TotalInfu,v=β B_Infu,v+(1-β)SentiInfu,v
Step 5:The total influence power and the propagation path of total influence power being calculated according to step 4, consider social network comprehensively The topological relation of network node, therefore u is φ to the propagating influence summation of vu,v(v):
There is the node set of behavior and emotion influence power in wherein I (v) expression network topology structures to v;
Step 6:The total influence power marginal benefits of calculate node u are σMarginal_Revenue(u):
Wherein, S represents start node set, and A represents the set of all user nodes in community network;
Community network interior joint marginal benefit is calculated simultaneously to sort by size, choose the maximum preceding nk node of marginal benefit according to Secondary to be inserted into queue Q, marginal benefit maximum node is inserted into S in ejection Q;
Step 7:Statistics S interior joint numbers, if | S | < k, all node limits are received in recalculating queue Q according to step 6 Benefit and more new sort, marginal benefit maximum node is inserted into S in ejection Q, if | S |=k, S are final start node collection Close;
Embodiment 1:
In this embodiment, crawl Jingdone district using reptile and do shopping network user comment data as emotional words training pattern Training set and test set, this data set evaluates and 14937 1 star unfavorable ratings comprising 17052 5 star fronts.The present embodiment Emotional words training pattern is built based on step 1, is gathered using 80% as training, 20% gathers as test, final training Obtain 468 emotional words with weight extend existing Taiwan Univ. Chinese emotion vocabulary NTUSD with SentiWordNet English emotion vocabularys.
In this embodiment, using Flickr (one, U.S. picture shares social network sites) social network data collection, the number Recorded comprising 40808 user nodes and 75269 user behaviors according to collection.Microblogging community network data set, the number are used simultaneously Recorded comprising 412952 user nodes and 500977 user behaviors according to collection.
The embodiment is real on Hadoop2.5.2 and Spark2.0.2 (12Core, 120GB Memory, 6Workers) It is existing, it is effectively quick BSIS influence powers propagation model and start node derivation algorithm to be realized, it is calculated maximizing influence Start node set.Using Liang Ge Fiel meeting Network data set, from influence power communication effect and user's true impact power scope Two aspect comparison of design experiments, by our result and traditional IC model (independent cascade model), LT models (linear threshold mould Type) and CDNF models (the credit distributed model based on nodal properties) contrast relatively come the correctness and validity of verification method. Side activation probability wherein in IC models between consecutive points is learnt by EM algorithms (EM algorithm), and LT models The side of adjacent node v and u is activated Probability p p (u, v) and is calculated using formula pp (u, v)=1/N (u), and wherein N (u) represents section The neighbor node number of point u.
As shown in Figure 2, left subgraph represented in Flickr community networks, is asked based on tetra- kinds of models of SBIS, CDNF, IC, LT Total influence power distribution map for being calculated of preceding 50 start nodes of solution, right subgraph is represented based on tetra- kinds of SBIS, CDNF, IC, LT The distribution map that total influence power summation of model solution start node increases with start node number.From the figure 3, it may be seen that left and right sub-chart Show in microblogging community network, total influence distribution map and total influence power of preceding 50 start nodes based on SBIS model solutions are total And growth chart.Represented by the checkings of Fig. 2 and 3, the behavial factor of user feeling, the start node tool of selection are considered due to SBIS There is bigger influence power communication effect.
In order to verify the real influence power that the start node that SBIS models are chosen is produced in true community network, this reality Thumb up, forwarding, the comment behavior number of times of the other users execution that example is received by the use of user are applied as true impact power criterion, Statistics chooses the real behavior execution time that start node set is received based on tetra- influence power propagation models of SBIS, CDNF, IC, LT Number.As shown in Figure 4, using Flickr data sets, left subgraph represents the first 50 initial sections chosen based on SBIS, CDNF, IC, LT Point receive thumb up, forwarding, comment behavior perform number of times distribution map, right subgraph represent use Flickr data sets, based on SBIS, CDNF, IC, LT choose thumb up, forwarding, the comment behavior execution number of times summation of start node reception with the growth of start node number Growth chart.As shown in Figure 5, left and right subgraph represents use microblog data collection, first 50 initial sections of BSIS model solutions respectively Thumb up, forwarding, comment behavior execution number of times distribution map and execution number of times summation growth chart that point is received.As seen from the figure, it is based on It is higher than other models that the behavior that the start node that SBIS influence powers propagation model is chosen is received performs number of times, therefore compared to other Model, SBIS can effectively excavate popular user, so as to ensure that the selection quality of start node, be produced in true community network Wider array of influence power spread scope, more accurately reflects influence power communication effect.
Knowable to being tested more than, at the beginning of the inventive method is based on user behavior and Sentiment orientation factor excavation maximizing influence Beginning node, can more truly reflect user force, and experiment shows that the inventive method chooses quality, influence in start node Power communication effect and influence power spread scope aspect have accuracy and reliability.

Claims (1)

1. a kind of community network maximizing influence start node choosing method for considering user behavior and emotion, its feature exists In comprising the following steps:
Step 1:Using the evaluation data set with tag along sort, emotional words training pattern is built, in existing sentiment dictionary On the basis of obtain being applied to the extension of the emotional words vocabulary with emotion weight of network comment information sentiment analysis and obtain new Sentiment dictionary;
Because the evaluation information of community network is all without label, therefore using information on commodity comment in electric business network, according to commenting Valency star carries out emotional semantic classification to evaluation information, and 5 stars are evaluated for front, and 1 star is unfavorable ratings;Herein, using reptile Jingdone district information on commodity comment is crawled, commodity is extracted and is evaluated positive negative sample, and keep positive and negative sample size balanced;Evaluation information emotion It is as follows that vocabulary builds model step:
Commodity evaluation text is carried out Chinese word segmentation by step 1.1 using ANSJ, obtains indicating the word segmentation result of word part of speech, is counted Part of speech is the number of times that adjectival participle occurs in forward direction evaluates sample and negative sense evaluates sample respectively;
Step 1.2 counts positive and evaluates sample and negative sense evaluation sample size summation N, sets participle frequency threshold value r, maximum emotion Weight max;
Step 1.3 is adjectival participle word for part of speech, counts its occurrence number N in forward direction evaluates samplePCommented with negative sense The times N occurred in valency sampleNIf,Perform step 1.4,1.5,1.6;
Step 1.4 calculates the positive probability P of word wordP:
Wherein PP∈(0,1);
Step 1.5 is according to its emotion weight of the positive probability calculation of participle word w1And w2
Wherein PP∈ (0,1), then w1∈(-1,1);
Wherein w2∈(-1,1);
Step 1.6 gives adjective participle word, and its comprehensive emotion weight is w:
Step 2:Data prediction and data pick-up are carried out to community network data set, user behavior record is extracted, is calculated and is considered The user behavior influence power of user behavior time delay;
In community network, if user node u and v are adjacent node, and the message that v is issued to u is by a series of time delays Producing a series of behaviors, wherein behavior includes thumb up, comment and forwards, then consider the user behavior shadow of user behavior time delay Ringing power B_Inf (u, v) is:
Wherein, λ ∈ (0,1), M (u) represents that u performed the massage set of behavior, and M (v) represents that user v performed the message of behavior Set, | M (u) | represents the message bar number that u performed behavior, and | M (u) ∩ M (v) | represents the message process performing that v is issued to u Bar number, t represents the v pairs of average time delay of the message process performing issued with u, and T represents v pairs with its all neighbor node issue Message process performing average time delay;
Step 3:According to expanded sentiment dictionary, sentiment analysis are carried out to comment information between community network user, obtained User feeling influence power based on user feeling tendency;
Two neighboring users node u and v are given, if u issues first message, v is evaluated it, to wherein one evaluation content commentjCarry out participle and obtain a series of participle set { word1,word2,...,wordn, remove to search each participle in emotion Corresponding participle emotion weight { w in dictionary1,w2,...,wn, ifwi=0, then feelings of this evaluation content Sense score value:
What deserves to be explained is, the word with turnover meaning often occurs in comment information, such as in Chinese " it is not, Adverbial word such as " not " or not and in English, this kind of word is combined the meaning that the emotion participle that it can be caused to modify is expressed with other emotion participles Think reverse;Accordingly, it is considered to arrive the semantic modified adverbial word of turnover, we analyze the context of this kind of word, will be adjacent thereto Emotion participle is extracted and obtains turnover participle set { reverse_w1,reverse_w2,...,reverse_wm, go to search Their emotion weight { r_w respectively in sentiment dictionary1,r_w2,...,r_wm, to commentjEmotion score value repaiied The emotion score value just corrected:
Therefore, if evaluation content commentjIn the positive emotion word that includes it is more, participle emotion weight accumulated value is higher and is Just, the positive emotion for representing v to u is denseer, knows that can to derive u bigger to the influence power of v therefrom;If evaluation content commentj In the negative sense emotion word that includes it is more, negative more of participle emotion weight accumulated value and be negative, it is denseer to u negative sense emotions to represent v, Thus u is derived bigger to the influence power of v;Therefore, the first comment of two neighboring user node u and v, v to u is given commentj, based on commentjEmotion score value derive u to the single emotion influence power of v:
Wherein ο ∈ (0,1);
Therefore comment information { comment of two neighboring user node u and v and a plurality of v to u is given1,comment2,..., commentn, the node emotion influence power being calculated, the emotion influence power for drawing u to v of averaging are commented on according to every:
Step 4:Consider the influence power of the Sentiment orientation based on user behavior tendency, obtain the total influence power of user TotalInfu,v
TotalInfu,v=β β _ Infu,v+(1-β)·SentiInfu,v
Step 5:The total influence power and the propagation path of total influence power being calculated according to step 4, consider community network section comprehensively The topological relation of point, therefore u is φ to the propagating influence summation of vu,v(v):
There is the node set of behavior and emotion influence power in wherein I (v) expression network topology structures to v;
Step 6:The total influence power marginal benefits of calculate node u are σMarginal_Revenue(u):
Wherein, S represents start node set, and A represents the set of all user nodes in community network;
Calculate community network interior joint marginal benefit and sort by size, choose the maximum preceding nk node of marginal benefit and insert successively Enter to queue Q, marginal benefit maximum node is inserted into S in ejection Q;
Step 7:Statistics S interior joint numbers, if | S | < k, in recalculating queue Q according to step 6, all node marginal benefits are simultaneously More new sort, marginal benefit maximum node is inserted into S in ejection Q, if | S |=k, S are final start node set.
CN201710019177.XA 2017-01-11 2017-01-11 Social network influence maximization initial node selection method considering user behaviors and emotions Active CN106780073B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710019177.XA CN106780073B (en) 2017-01-11 2017-01-11 Social network influence maximization initial node selection method considering user behaviors and emotions

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710019177.XA CN106780073B (en) 2017-01-11 2017-01-11 Social network influence maximization initial node selection method considering user behaviors and emotions

Publications (2)

Publication Number Publication Date
CN106780073A true CN106780073A (en) 2017-05-31
CN106780073B CN106780073B (en) 2021-05-25

Family

ID=58949265

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710019177.XA Active CN106780073B (en) 2017-01-11 2017-01-11 Social network influence maximization initial node selection method considering user behaviors and emotions

Country Status (1)

Country Link
CN (1) CN106780073B (en)

Cited By (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108460010A (en) * 2018-01-17 2018-08-28 南京邮电大学 A kind of comprehensive grade model implementation method based on sentiment analysis
CN108549632A (en) * 2018-04-03 2018-09-18 重庆邮电大学 A kind of social network influence power propagation model construction method based on sentiment analysis
CN108804524A (en) * 2018-04-27 2018-11-13 成都信息工程大学 Emotion based on stratification taxonomic hierarchies differentiates and importance division methods
CN110138619A (en) * 2019-05-28 2019-08-16 湖南大学 A kind of start node choosing method that realizing maximizing influence and system
CN110598960A (en) * 2018-05-23 2019-12-20 北京国双科技有限公司 Entity-level emotion assessment method and device
CN110727881A (en) * 2019-10-23 2020-01-24 精硕科技(北京)股份有限公司 Method and device for determining target public figure
CN110738421A (en) * 2019-10-17 2020-01-31 西南大学 method for measuring influence of multi-layer network users based on shortest propagation path
CN112052995A (en) * 2020-08-31 2020-12-08 杭州电子科技大学 Social network user influence prediction method based on fusion emotional tendency theme
CN112487304A (en) * 2020-11-26 2021-03-12 杭州电子科技大学 Method for establishing influence propagation model based on viewpoint vectorization
CN113688202A (en) * 2021-07-30 2021-11-23 杭州网易云音乐科技有限公司 Emotion polarity analysis method and device, electronic equipment and computer storage medium
CN116245555A (en) * 2023-03-09 2023-06-09 清瑞网络科技(山东)有限责任公司 User information collecting and analyzing system based on big data

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101770487A (en) * 2008-12-26 2010-07-07 聚友空间网络技术有限公司 Method and system for calculating user influence in social network
CN103530360A (en) * 2013-10-12 2014-01-22 广西师范学院 Network society influence maximization algorithm based on microblog text affective computing

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101770487A (en) * 2008-12-26 2010-07-07 聚友空间网络技术有限公司 Method and system for calculating user influence in social network
CN103530360A (en) * 2013-10-12 2014-01-22 广西师范学院 Network society influence maximization algorithm based on microblog text affective computing

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
XIAOHENG DENG等: "Credit Distribution and Influence Maximization in", 《2015 12TH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY》 *

Cited By (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108460010A (en) * 2018-01-17 2018-08-28 南京邮电大学 A kind of comprehensive grade model implementation method based on sentiment analysis
CN108549632A (en) * 2018-04-03 2018-09-18 重庆邮电大学 A kind of social network influence power propagation model construction method based on sentiment analysis
CN108804524B (en) * 2018-04-27 2020-03-27 成都信息工程大学 Emotion distinguishing and importance dividing method based on hierarchical classification system
CN108804524A (en) * 2018-04-27 2018-11-13 成都信息工程大学 Emotion based on stratification taxonomic hierarchies differentiates and importance division methods
CN110598960B (en) * 2018-05-23 2022-06-03 北京国双科技有限公司 Entity-level emotion assessment method and device
CN110598960A (en) * 2018-05-23 2019-12-20 北京国双科技有限公司 Entity-level emotion assessment method and device
CN110138619B (en) * 2019-05-28 2020-05-19 湖南大学 Initial node selection method and system for realizing influence maximization
CN110138619A (en) * 2019-05-28 2019-08-16 湖南大学 A kind of start node choosing method that realizing maximizing influence and system
CN110738421A (en) * 2019-10-17 2020-01-31 西南大学 method for measuring influence of multi-layer network users based on shortest propagation path
CN110738421B (en) * 2019-10-17 2023-08-22 西南大学 Multilayer network user influence measuring method based on shortest propagation path
CN110727881A (en) * 2019-10-23 2020-01-24 精硕科技(北京)股份有限公司 Method and device for determining target public figure
CN112052995A (en) * 2020-08-31 2020-12-08 杭州电子科技大学 Social network user influence prediction method based on fusion emotional tendency theme
CN112487304A (en) * 2020-11-26 2021-03-12 杭州电子科技大学 Method for establishing influence propagation model based on viewpoint vectorization
CN112487304B (en) * 2020-11-26 2022-05-03 杭州电子科技大学 Method for establishing influence propagation model based on viewpoint vectorization
CN113688202A (en) * 2021-07-30 2021-11-23 杭州网易云音乐科技有限公司 Emotion polarity analysis method and device, electronic equipment and computer storage medium
CN113688202B (en) * 2021-07-30 2024-03-15 杭州网易云音乐科技有限公司 Emotion polarity analysis method and device, electronic equipment and computer storage medium
CN116245555A (en) * 2023-03-09 2023-06-09 清瑞网络科技(山东)有限责任公司 User information collecting and analyzing system based on big data
CN116245555B (en) * 2023-03-09 2023-12-08 张家口巧工匠科技服务有限公司 User information collecting and analyzing system based on big data

Also Published As

Publication number Publication date
CN106780073B (en) 2021-05-25

Similar Documents

Publication Publication Date Title
CN106780073A (en) A kind of community network maximizing influence start node choosing method for considering user behavior and emotion
CN106980692B (en) Influence calculation method based on microblog specific events
Barbosa et al. Robust sentiment detection on twitter from biased and noisy data
CN108628833B (en) Method and device for determining summary of original content and method and device for recommending original content
CN103577989B (en) A kind of information classification approach and information classifying system based on product identification
CN110781668B (en) Text information type identification method and device
CN106940732A (en) A kind of doubtful waterborne troops towards microblogging finds method
Mhatre et al. Dimensionality reduction for sentiment analysis using pre-processing techniques
CN105183717B (en) A kind of OSN user feeling analysis methods based on random forest and customer relationship
CN104881458B (en) A kind of mask method and device of Web page subject
CN109815485B (en) Method and device for identifying emotion polarity of microblog short text and storage medium
CN105095419A (en) Method for maximizing influence of information to specific type of weibo users
WO2008022581A1 (en) Method and device for obtaining the new words and input method system
CN108960574A (en) Quality determination method, device, server and the storage medium of question and answer
CN113849598B (en) Social media false information detection method and detection system based on deep learning
CN105893484A (en) Microblog Spammer recognition method based on text characteristics and behavior characteristics
CN102789449A (en) Method and device for evaluating comment text
CN110909230A (en) Network hotspot analysis method and system
CN106156117A (en) Hidden community core communication circle detection towards particular topic finds method and system
Sadiq et al. High dimensional latent space variational autoencoders for fake news detection
Yao et al. Online deception detection refueled by real world data collection
De Zarate et al. Vocabulary-Based Method for Quantifying Controversy in Social Media.
Modi et al. Sentiment analysis of Twitter feeds using flask environment: A superior application of data analysis
CN106933798B (en) Information analysis method and device
Wu et al. Reducing noisy labels in weakly labeled data for visual sentiment analysis

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant