CN103116611A - Social network opinion leader identification method - Google Patents

Social network opinion leader identification method Download PDF

Info

Publication number
CN103116611A
CN103116611A CN201310028159XA CN201310028159A CN103116611A CN 103116611 A CN103116611 A CN 103116611A CN 201310028159X A CN201310028159X A CN 201310028159XA CN 201310028159 A CN201310028159 A CN 201310028159A CN 103116611 A CN103116611 A CN 103116611A
Authority
CN
China
Prior art keywords
node
seed
child node
social networks
influence power
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201310028159XA
Other languages
Chinese (zh)
Inventor
张璐
蔡皖东
蔡霖
彭冬
王塑
叶三成
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Xi'an tobacco monopoly bureau
Northwestern Polytechnical University
Original Assignee
Xi'an tobacco monopoly bureau
Northwestern Polytechnical University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Xi'an tobacco monopoly bureau, Northwestern Polytechnical University filed Critical Xi'an tobacco monopoly bureau
Priority to CN201310028159XA priority Critical patent/CN103116611A/en
Publication of CN103116611A publication Critical patent/CN103116611A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention discloses a social network opinion leader identification method which is used for solving the technical problem that the prior social network opinion leader identification method is low in efficiency. The technical scheme is that first, social network data are obtained; then, an information dissemination model is built according to social network characteristics; then, each node degree is calculated according to the information dissemination model and social network topological information; a node which is most influential is used as a first seed node; the left seed nodes in a new node assembly are detected to form a seed node assembly, wherein the seed nodes of the seed node assembly are arranged according to influence from great to small; the first n seed nodes are selected from the seed node assembly with the seed nodes arranged according to influence to constitute an opinion leader assembly, and accordingly, an opinion leader with different influence or charisma is identified. Because the seed node which is most influential in the social network is detected to identify the opinion leader, massive blind detection time is saved and identification efficiency is improved. Compared with the background art, identification efficiency is improved by 50-90 %.

Description

Social networks leader of opinion recognition methods
Technical field
The present invention relates to a kind of recognition methods, be specifically related to a kind of social networks leader of opinion recognition methods.
Background technology
Along with the development of Web2.0 technology, social networks (SNS) has become popular network application in the internet.At present, some extensive online social network sites, visit capacity as Facebook has surpassed Google, become the first website of the U.S., and domestic everybody net that is subjected to deeply that the university student welcomes, at present the registered user has reached 100,000,000, and day logs in 2,200 ten thousand person-times (on October 27th, 2009, everybody netted the data of announcement), and the data of alexa in Dec, 2009 website show that in front 15 of domestic and international website visiting amount, social network sites accounts for respectively 4 and 6.Social network sites has millions of online users every day, and this is comprising huge potential business opportunity, promotes their product such as some companies can utilize the social network sites online user.
In social networks, the influence power of planting child node is very important to promoting Information Communication.Some company or users that promote its product, service by virus-type marketing mode nourish very large interest to the influential kind child node of How to choose.Want at social network sites to be that its product is advertised such as A company, because advertising expenditure is limited, can only throw in K user, A company wishes that these initial users can like its product, and with them as kind of a child node, passing on from one to another mode with public praise in social networks affects their friend, allows their friend also like its product, and their friend further affects more friend by social networks, makes more user can like its product.A company wishes that certainly the initial user's (namely planting child node) who selects has considerable influence power, and the number that affects is many as much as possible, just can reach maximum advertisement benefit thereby spend a small amount of expense.As seen, plant child node and brought into play important effect in the spreading network information process, they are equivalent to the leader of opinion, and by their guiding and impact, local suggestion may develop and be network public opinion.The statistics demonstration, the most of user in network does not often participate in manufacturing and the propagation of information, and the leader of opinion is often followed in the decision that they make.Recognition network leader of opinion effectively, delivering guided bone information by the leader of opinion affects the place network user but not directly persuades them, can effectively trigger the influence power of whole network or society, for promoting Information Communication, improve demonstration effect and have important practical significance.
People have studied from different perspectives the social networks leader of opinion and have found and identification problem, identifying the leader of opinion by the kind child node that detects influence power maximum in social networks is wherein a kind of important method, and cause and concern and the attention of industry problems is summed up as the maximizing influence problem.
For the maximizing influence problem, at present derivation algorithm mainly is divided into two classes: (1) complex network algorithm, and such as based on node degree with based on the algorithm at center etc., this class algorithm Main Problems is that resulting kind of child node influence power is on the low side; (2) greedy algorithm, its subject matter be that counting yield is lower, computing time is unstable and extensibility relatively poor etc.
Kempe etc. are at document " Maximizing the spread of influence through a social network(SIGKDD, pages137-146, 2003) ", with the maximizing influence problem as a discrete optimization problems of device, proved that the maximizing influence problem is a NP difficult problem, and an approximate greedy algorithm proposed, its core concept is to choose the maximum node of influence power increment as kind of a child node at every turn, proved that influence power that greedy algorithm obtains kind of child node is not less than (1-1/e) of optimal algorithm, and studied the discrete Information Propagation Model of three kinds of differences and how to have sought the kind child node with maximum effect power in model.Experimental result shows, greedy algorithm obtain kind of child node influence power significantly higher than traditional based on node degree with based on the algorithm at center, but this greedy algorithm has a serious shortcoming, it is exactly the counting yield problem, such as the kind child node of (approximately 15000 nodes) search maximum effect power in social networks on a fairly large scale need to be calculated time a couple of days on a station server, and in extensive social networks, millions of nodes being arranged, may doubly increase by exponentially computing time.
Leskovec etc. are in document " Cost-effective outbreak detection in networks(SIGKDD; pages420-429; 2007) ", propose a CELF (Cost-Effective Lazy Forward selection) and optimize greedy method, the method is based on influence power and has the Submodular function feature extraction, namely the influence power of all nodes weakens along with interstitial content in the seed node set is increased in, and has monotone decline.The method is divided into two steps: first step is used for selecting first kind child node, is all searching for kind of a child node in node, selects the influence power maximum node to join in the seed node set; Second step is used for selecting the remaining child node of planting, and utilizes influence power to have this character of monotone decline and search for kind of a child node in the larger node of some effects power.Due to the minimizing of the search of the method in second step seed node space, so counting yield is greatly improved.The experimental result demonstration, in the situation that Search Results is identical, the CELF algorithm has improved nearly 700 times than the computing velocity of original greedy algorithm, but for large-scale social networks, its counting yield is still lower.
Wei Chen etc. are at document " Efficient influence maximization in social networks (SIGKDD, pp.199-208, 2009) ", a new greedy algorithm (NewGreedy) has been proposed, basic thought is in social network diagram, select dependence edge with factor of influence p between node, set up a brand-new subgraph, then select the subgraph moderate to count the node of maximum as kind of a child node, and a MixGreedy algorithm proposed, it is divided into two parts, first adopts the NewGreedy algorithm idea to choose first kind child node, second portion adopts the CELF algorithm idea to choose the remaining child node of planting.The MixGreedy algorithm combines the advantage of NewGreedy algorithm and CELF algorithm, and its counting yield increases than CELF algorithm.Owing to mutually not activating with factor of influence p between node in linear threshold model, the MixGreedy algorithm need to be tried to achieve kind of a child node from independent cascade model or cum rights cascade model, calculate again their influence power in linear threshold model, therefore its Search Results and other greedy algorithms differ larger sometimes, and extensibility is relatively poor in linear threshold model.
Summary of the invention
In order to overcome the inefficient deficiency of existing social networks leader of opinion recognition methods, the invention provides a kind of social networks leader of opinion recognition methods.the leader of opinion is found and identified to the method by the kind child node that detects influence power maximum in social networks, when planting the child node influence power in detecting social networks, the factors such as relevance according to the distribution of social networks degree of node and degree of node number and influence power, employing detects kind of child node in having the part of nodes of the high number of degrees and it calculates influence power, only need to detect and calculate in the larger node of small part influence power, therefore can save a large amount of blind detection time, reduce kind of a child node influence power computation complexity, improved social networks leader of opinion recognition efficiency.
The technical solution adopted for the present invention to solve the technical problems is: a kind of social networks leader of opinion recognition methods is characterized in comprising the following steps:
(1) utilize the web crawlers instrument, gather actual social network data from the internet.
(2) according to the social networks characteristic, the Information Propagation Models such as the independent cascade model of use carry out modeling analysis to the Information Communication process of social networks.In Information Propagation Model, input social network data and r parameter, wherein r(0<r≤1) be the number percent that highly several sections of points account for all nodes.
(3) according to information such as Information Propagation Model and social networks topologys, calculate each node number of degrees, and descending the sequence, select the node of the front r of sequence to form new node set.
(4) detect kind of a child node in new node set, and use Submodular function to calculate kind of a child node influence power, with the node of influence power maximum wherein as first kind child node.
(5) detect remaining kind child node in new node set, the same Submodular function that uses calculates kind of a child node influence power, in the process of each selection kind of child node, the large node of calculating section influence power, until all seed node selections are complete, form one and arrive greatly the seed node set of minispread by influence power.
(6) from the seed node set of influence power sequence, choose front n and plant child node, consist of leader of opinion's set, identify the leader of opinion who has Different Effects power or appeal in social networks.In formula, seed interstitial content in 0<n≤set.
Described r parameter value is between 0.01 to 0.2.
the invention has the beneficial effects as follows: owing to finding and identifying the leader of opinion by the kind child node that detects influence power maximum in social networks, when planting the child node influence power in detecting social networks, the factors such as relevance according to the distribution of social networks degree of node and degree of node number and influence power, employing detects kind of child node in having the part of nodes of the high number of degrees and it calculates influence power, only need to detect and calculate in the larger node of small part influence power, therefore can save a large amount of blind detection time, reduce kind of a child node influence power computation complexity, improved social networks leader of opinion recognition efficiency.Checking and actual test show by experiment, the present invention compares with background technology, in the situation that influence power is not suffered a loss, recognition efficiency has improved 50~90%, and be with good expansibility, be suitable for detecting the kind child node of maximum effect power in extensive social networks, thereby identify the leader of opinion.
Below in conjunction with drawings and Examples, the present invention is elaborated.
Description of drawings
Fig. 1 is the process flow diagram of social networks leader of opinion of the present invention recognition methods.
Embodiment
Key concept involved in the present invention is explained as follows:
(1) Information Propagation Model: social networks is a kind of complex network, and scientific and technological circle adopt Mathematical Modeling Methods that the complex network characteristic is analyzed usually, to describe spreading network information characteristic and inherent law.For social networks, usually adopt three kinds of Information Propagation Models such as independent cascade model, cum rights cascade model and linear threshold model to carry out modeling analysis to social networks kind child node influence power.Like this, how social networks kind child node maximizing influence Solve problems just converts in Information Propagation Model detection and Identification maximum effect power seed node problems to.
(2) influence power function: definition σ () is the influence power function, and S is the seed node set, and U is the search node set.σ (S) is the influence power of seed node set S, and namely S set affects the interstitial content size.
(3) Submodular function character:
Definition 1: if for any element x, y ∈ RK has f (x ∨ y)+f (x ∧ y)≤f (x)+f (y), function f: R k→ R is Submodular function.
Can be drawn the following conclusions by definition 1.
Conclusion 1: if f is Submodular function,
Figure BDA00002775745400041
F (A+j)-f (A) 〉=f (B+j)-f (B) is arranged.As seen, any Submodular function has dullness, the non-character such as negative.
Conclusion 2: in any one example of independent cascade model, cum rights cascade model and linear threshold model, influence power function σ () is a Submodular function.
With reference to Fig. 1.Social networks leader of opinion recognition methods concrete steps of the present invention are as follows:
1. obtain social network data.
Utilize the web crawlers instrument, gather actual social network data from the internet, extracting the network topological informations such as wherein node, connection, to deposit database in pending.
2. set up Information Propagation Model.
According to the social networks characteristic, the Information Propagation Models such as the independent cascade model of use carry out modeling analysis to the Information Communication process of social networks.In Information Propagation Model, the parameters such as input social network data and r, r(0<r≤1 wherein) account for the number percent of all nodes for highly several sections of points, determine according to parameters such as the interstitial content in social networks, seed interstitial content and network topologies, generally between 0.01 to 0.2.
3. the computing node number of degrees.
According to Information Propagation Model and social networks topology information, calculate each node number of degrees, and descending the sequence, select the node of the front r of sequence to form new node set.
4. select first kind child node.
Detect kind of a child node in new node set, and use Submodular function to calculate kind of a child node influence power, with the node of influence power maximum wherein as first kind child node.
5. select remaining kind child node.
Detect remaining kind child node in new node set, the same Submodular function that uses calculates kind of a child node influence power, in the process of each selection kind of child node, the large node of calculating section influence power, until all seed node selections are complete, form one and arrive greatly the seed node set of little sequence by influence power.
6. identify the leader of opinion.
From the seed node set of influence power sequence, choose seed interstitial content in front n(0<n≤set) individual kind of child node, consist of leader of opinion's set, thereby identify the leader of opinion who has had Different Effects power or appeal in social networks.
Use false code to describe specific algorithm of the present invention as follows:
Figure BDA00002775745400051
Figure BDA00002775745400061

Claims (2)

1. social networks leader of opinion recognition methods is characterized in that comprising the following steps:
(1) utilize the web crawlers instrument, gather actual social network data from the internet;
(2) according to the social networks characteristic, the Information Propagation Models such as the independent cascade model of use carry out modeling analysis to the Information Communication process of social networks; In Information Propagation Model, input social network data and r parameter, wherein r(0<r≤1) be the number percent that highly several sections of points account for all nodes;
(3) according to information such as Information Propagation Model and social networks topologys, calculate each node number of degrees, and descending the sequence, select the node of the front r of sequence to form new node set;
(4) detect kind of a child node in new node set, and use Submodular function to calculate kind of a child node influence power, with the node of influence power maximum wherein as first kind child node;
(5) detect remaining kind child node in new node set, the same Submodular function that uses calculates kind of a child node influence power, in the process of each selection kind of child node, the large node of calculating section influence power, until all seed node selections are complete, form one and arrive greatly the seed node set of minispread by influence power;
(6) from the seed node set of influence power sequence, choose front n and plant child node, consist of leader of opinion's set, identify the leader of opinion who has Different Effects power or appeal in social networks; In formula, seed interstitial content in 0<n≤set.
2. social networks leader of opinion according to claim 1 recognition methods is characterized in that: described r parameter value is between 0.01 to 0.2.
CN201310028159XA 2013-01-25 2013-01-25 Social network opinion leader identification method Pending CN103116611A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201310028159XA CN103116611A (en) 2013-01-25 2013-01-25 Social network opinion leader identification method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201310028159XA CN103116611A (en) 2013-01-25 2013-01-25 Social network opinion leader identification method

Publications (1)

Publication Number Publication Date
CN103116611A true CN103116611A (en) 2013-05-22

Family

ID=48414985

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201310028159XA Pending CN103116611A (en) 2013-01-25 2013-01-25 Social network opinion leader identification method

Country Status (1)

Country Link
CN (1) CN103116611A (en)

Cited By (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104134159A (en) * 2014-08-04 2014-11-05 中国科学院软件研究所 Method for predicting maximum information spreading range on basis of random model
CN104616200A (en) * 2015-02-11 2015-05-13 中南大学 Influence maximization initial node selecting method based on node characteristics
CN105138580A (en) * 2015-07-31 2015-12-09 中国科学院信息工程研究所 Method for minimizing network negative information influence based on blocked continuous boundary
CN105335892A (en) * 2015-10-30 2016-02-17 南京邮电大学 Realization method for discovering important users of social network
CN105959368A (en) * 2016-04-29 2016-09-21 成都信息工程大学 Social cloud hot spot resource prediction and disposition method
CN106204297A (en) * 2016-07-11 2016-12-07 深圳市中北明夷科技有限公司 A kind of recognition methods closing social propagation opinion leader and device
CN107123056A (en) * 2017-03-03 2017-09-01 华南理工大学 A kind of location-based social big data information maximization method
CN107507020A (en) * 2017-07-27 2017-12-22 上海交通大学 Obtain the maximized method of Internet communication influence power competitive advantage
CN108122168A (en) * 2016-11-28 2018-06-05 中国科学技术大学先进技术研究院 Seed node screening technique and device in social activity network
CN108710618A (en) * 2017-04-05 2018-10-26 中国科学院声学研究所 One kind is based on concurrent process and the desired propagation seed node selection method of influence power
CN110750721A (en) * 2019-10-21 2020-02-04 秒针信息技术有限公司 Information pushing method and device, electronic equipment and readable storage medium
CN111125453A (en) * 2019-12-27 2020-05-08 中国电子科技集团公司信息科学研究院 Opinion leader role identification method in social network based on subgraph isomorphism and storage medium
CN112100241A (en) * 2020-09-14 2020-12-18 哈尔滨工程大学 Social network dynamic influence maximization method based on theme
CN112667876A (en) * 2020-12-24 2021-04-16 湖北第二师范学院 Opinion leader group identification method based on PSOTVCF-Kmeans algorithm
US11468521B2 (en) 2016-10-31 2022-10-11 Tencent Technology (Shenzhen) Company Limited Social media account filtering method and apparatus

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101859315A (en) * 2010-04-30 2010-10-13 西北工业大学 Heuristic solving method for maximizing influence of social network
CN102663101A (en) * 2012-04-13 2012-09-12 北京交通大学 Sina microblog-based user grade sequencing algorithm
CN102890696A (en) * 2011-02-10 2013-01-23 微软公司 Social network based contextual ranking

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101859315A (en) * 2010-04-30 2010-10-13 西北工业大学 Heuristic solving method for maximizing influence of social network
CN102890696A (en) * 2011-02-10 2013-01-23 微软公司 Social network based contextual ranking
CN102663101A (en) * 2012-04-13 2012-09-12 北京交通大学 Sina microblog-based user grade sequencing algorithm

Cited By (24)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104134159A (en) * 2014-08-04 2014-11-05 中国科学院软件研究所 Method for predicting maximum information spreading range on basis of random model
CN104134159B (en) * 2014-08-04 2017-10-24 中国科学院软件研究所 A kind of method that spread scope is maximized based on stochastic model information of forecasting
CN104616200A (en) * 2015-02-11 2015-05-13 中南大学 Influence maximization initial node selecting method based on node characteristics
CN104616200B (en) * 2015-02-11 2017-10-10 中南大学 A kind of maximizing influence start node choosing method based on nodal properties
CN105138580A (en) * 2015-07-31 2015-12-09 中国科学院信息工程研究所 Method for minimizing network negative information influence based on blocked continuous boundary
CN105138580B (en) * 2015-07-31 2018-11-23 中国科学院信息工程研究所 A kind of network negative information influence minimum method based on the company of blocking side
CN105335892A (en) * 2015-10-30 2016-02-17 南京邮电大学 Realization method for discovering important users of social network
CN105959368A (en) * 2016-04-29 2016-09-21 成都信息工程大学 Social cloud hot spot resource prediction and disposition method
CN105959368B (en) * 2016-04-29 2019-04-02 成都信息工程大学 A kind of method of social activity cloud hot point resource prediction and deployment
CN106204297A (en) * 2016-07-11 2016-12-07 深圳市中北明夷科技有限公司 A kind of recognition methods closing social propagation opinion leader and device
US11468521B2 (en) 2016-10-31 2022-10-11 Tencent Technology (Shenzhen) Company Limited Social media account filtering method and apparatus
CN108122168A (en) * 2016-11-28 2018-06-05 中国科学技术大学先进技术研究院 Seed node screening technique and device in social activity network
CN108122168B (en) * 2016-11-28 2020-11-13 中国科学技术大学先进技术研究院 Method and device for screening seed nodes in social activity network
CN107123056A (en) * 2017-03-03 2017-09-01 华南理工大学 A kind of location-based social big data information maximization method
CN108710618A (en) * 2017-04-05 2018-10-26 中国科学院声学研究所 One kind is based on concurrent process and the desired propagation seed node selection method of influence power
CN107507020B (en) * 2017-07-27 2021-10-08 上海交通大学 Method for obtaining network propagation influence competitive advantage maximization
CN107507020A (en) * 2017-07-27 2017-12-22 上海交通大学 Obtain the maximized method of Internet communication influence power competitive advantage
CN110750721A (en) * 2019-10-21 2020-02-04 秒针信息技术有限公司 Information pushing method and device, electronic equipment and readable storage medium
CN111125453A (en) * 2019-12-27 2020-05-08 中国电子科技集团公司信息科学研究院 Opinion leader role identification method in social network based on subgraph isomorphism and storage medium
CN111125453B (en) * 2019-12-27 2023-03-28 中国电子科技集团公司信息科学研究院 Opinion leader role identification method in social network based on subgraph isomorphism and storage medium
CN112100241A (en) * 2020-09-14 2020-12-18 哈尔滨工程大学 Social network dynamic influence maximization method based on theme
CN112100241B (en) * 2020-09-14 2023-07-25 哈尔滨工程大学 Social network dynamic influence maximization method based on theme
CN112667876A (en) * 2020-12-24 2021-04-16 湖北第二师范学院 Opinion leader group identification method based on PSOTVCF-Kmeans algorithm
CN112667876B (en) * 2020-12-24 2024-04-09 湖北第二师范学院 Opinion leader group identification method based on PSOTVCF-Kmeans algorithm

Similar Documents

Publication Publication Date Title
CN103116611A (en) Social network opinion leader identification method
CN103678669B (en) Evaluating system and method for community influence in social network
CN104134159B (en) A kind of method that spread scope is maximized based on stochastic model information of forecasting
CN103488678B (en) Friend recommendation system based on user sign-in similarity
CN102394798B (en) Multi-feature based prediction method of propagation behavior of microblog information and system thereof
CN103678613B (en) Method and device for calculating influence data
Kundu et al. A new centrality measure for influence maximization in social networks
CN101957834B (en) Content recommending method and device based on user characteristics
CN103412872B (en) Micro-blog social network information recommendation method based on limited node drive
US20170154267A1 (en) Discovering signature of electronic social networks
Pal et al. Centrality measures, upper bound, and influence maximization in large scale directed social networks
CN103064917A (en) Specific-tendency high-influence user group discovering method orienting microblog
CN102262681A (en) Method for identifying key blog sets in blog information spreading
CN105095419A (en) Method for maximizing influence of information to specific type of weibo users
CN105550275B (en) A kind of microblogging transfer amount prediction technique
CN103136331A (en) Micro blog network opinion leader identification method
CN107123055A (en) A kind of social big data information maximization method based on PageRank
CN105678590A (en) topN recommendation method for social network based on cloud model
CN109783805A (en) A kind of network community user recognition methods and device
Li et al. Social network user influence dynamics prediction
CN105335476A (en) Method and device for classifying hot event
Wang et al. A study on influential user identification in online social networks
CN102750288B (en) A kind of internet content recommend method and device
CN105589916A (en) Method for extracting explicit and implicit interest knowledge
CN103200073B (en) A kind of information processing method and device

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20130522