CN104579790B - A kind of method determining link prediction reduction limit quantity - Google Patents

A kind of method determining link prediction reduction limit quantity Download PDF

Info

Publication number
CN104579790B
CN104579790B CN201510037313.9A CN201510037313A CN104579790B CN 104579790 B CN104579790 B CN 104579790B CN 201510037313 A CN201510037313 A CN 201510037313A CN 104579790 B CN104579790 B CN 104579790B
Authority
CN
China
Prior art keywords
limit
node
index
add
network
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201510037313.9A
Other languages
Chinese (zh)
Other versions
CN104579790A (en
Inventor
张维明
周游
修保新
程光权
谢福利
朱先强
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
National University of Defense Technology
Original Assignee
National University of Defense Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by National University of Defense Technology filed Critical National University of Defense Technology
Priority to CN201510037313.9A priority Critical patent/CN104579790B/en
Publication of CN104579790A publication Critical patent/CN104579790A/en
Application granted granted Critical
Publication of CN104579790B publication Critical patent/CN104579790B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Data Exchanges In Wide-Area Networks (AREA)

Abstract

The invention belongs to network reconfiguration and link prediction field, be specifically related to a kind of method determining link prediction reduction limit number, the present invention is based on the interlinking of link prediction and limit cluster, utilize the segmentation density index in the cluster of limit to determine the reduction limit quantity in link prediction process.Use link Predicting Technique to calculate the right connection probability of all nodes not connecting limit, and sort according to connection probability.Sound out in a network successively according to clooating sequence and add each limit, all to re-start limit cluster analysis to network, in the cluster process of limit, computed segmentation density index, by adding the change of network maximum fractionation density before and after a certain bar limit, and then determine whether add this limit.The invention solves after link prediction, determine the problem of reduction limit quantity; The principle of the invention is simple, clear process, be easy to realize, and can provide the support in decision-making, improve the efficiency of decision-making for policymaker.

Description

A kind of method determining link prediction reduction limit quantity
Technical field
The invention belongs to network reconfiguration and link prediction field, be specifically related to a kind of method determining link prediction reduction limit number.
Background technology
Link prediction is to be solved is to the disappearance reduction on limit and the problem of prediction in complex network, existing link prediction method, for the consideration of computational speed and computation complexity, uses the link prediction based on similitude in practical application.Existing link prediction technology cannot determine to reduce the quantity on limit.By existing method, policymaker can only the right connection probability of the company of calculating limit (not having limit to be directly connected in observation grid) node and connect the sequence of probability, and do not know to reduce the quantity on limit, but this is the problem that must solve in network reconfiguration process.
Introduce several existing link prediction index based on similitude below.
Node v in define grid G set of node x, the set of its neighbor node is designated as Γ (x), and the number of neighbor node is designated as k x.Link prediction method based on similitude is thought, two node v xand v ysimilitude s xylarger, between these two nodes, the possibility on the company of generation limit is larger.
(1) common neighbours' index
Common neighbours' similitude can be described as structural equivalence again, if namely two nodes have a lot of common neighbor node, so these two nodes are similar.Visible, structural equivalence pay close attention to be two some whether be in same environment.If it is more that two points share common neighbours, then their the more company of tending to limits.
Common neighbours' index mathematical definition is: two node v xand v ysimilitude be their common neighbours' number, that is:
s xy=|Γ(x)∩Γ(y)|
If consider the impact of the degree of two end nodes on similarity between two nodes from different perspectives, produce again following 6 kinds of similarity indices.
1) Salton index [1]
Salton index is also known as cosine similarity, and it reflects the ratio of common neighbours' number and two-end-point geometric mean, is defined as
s xy = | Γ ( x ) ∩ Γ ( y ) | k x k y
2) Jaccard index [2]
This index was just suggested before more than 100 years, its reflection be the ratio of two node common neighbours numbers and whole neighbours' quantity, be defined as
s xy = | Γ ( x ) ∩ Γ ( y ) | | Γ ( x ) ∪ Γ ( y ) |
3) Sorenson index [3]
This index is usually used in ecological data research, and it reflects that the ratio of common neighbours' number and two-end-point algebraic mean number is defined as
s xy = 2 × | Γ ( x ) ∩ Γ ( y ) | k x + k y
4) the favourable index of magnanimous node (HPI) [4]
This index is thought, spend larger node, the centrality shown in a network is more obvious, and easy and other nodes produce and connect limit, and it is defined as
s xy = | Γ ( x ) ∩ Γ ( y ) | min { k x , k y }
Because the less node of denominator degree of being determines, under this definition, between known magnanimous node and other nodes, more easily there is high similarity.
5) the unfavorable index of magnanimous node (HDI)
This index is contrary with HPI index, and it thinks that the large node of network moderate is unfavorable for the connection with other nodes.Contrary with HPI index, in similarity indices formula, denominator gets the maximum of two node degrees, is defined as
s xy = | Γ ( x ) ∩ Γ ( y ) | max { k x , k y }
6) LHN-I index [5]
This index is proposed by Leicht, Holme and Newman, is defined as
s xy = | Γ ( x ) ∩ Γ ( y ) | k x k y
Wherein, denominator k xk ybe proportional to node v xand v ythe desired value of common neighbours' number, i.e. E (| Γ (x) ∩ Γ (y) |).
(2) AA index [6]
Adamic-Adar index (AA index) considers the information of the degree of the common neighbours of two nodes.The contribution of common neighbours to similitude that its thought degree of being is little is larger.This is readily appreciated that in social networks: such as in microblogging, two people not theed least concerned can pay close attention to a star (having more magnanimous node), but two people of focus attentions equally on domestic consumer (having the node of less degree) are probably friends.AA index is that each common neighbor node gives a weighted value, is defined as
s xy = Σ z ∈ Γ ( x ) ∩ Γ ( y ) 1 log k z
(3) Resourse Distribute index index [7]
To consider in network G non-conterminous two node v arbitrarily xand v ysince network is communicated with, and can think v xsome information can be transmitted to v y, and their common neighbours just can regard the medium that information is transmitted as.Suppose the resource of each common neighbours Dou Youyige unit, and mean allocation is passed to its neighbours, then from v xv can be delivered to ynumber of resources just can be defined as node v xand v ysimilarity, that is:
s xy = Σ z ∈ Γ ( x ) ∩ Γ ( y ) 1 k z
[1]SaltonG,McGillMJ.Introductiontomoderninformationretrieval[J].1983.
[2]JaccardP.EtudecomparativedeladistributionfloraledansuneportiondesAlpesetduJura[M].Impr.Corbaz,1901.
[3]SorensenT.AmethodofestablishinggroupsofequalamplitudeinplantsociologybasedonsimilarityofspeciescontentanditsapplicationtoanalysesofthevegetatianonDanishcommons[J].BiologiskeSkrifter,1948,5(4):1-34
[4]RavaszE,SomeraAL,MongruDA,etal.Hierarchicalorganizationofmodularityinmetabolicnetworks[J].science,2002,297(5586):1551-1555.
[5]LeichtEA,HolmeP,NewmanMEJ.Vertexsimilarityinnetworks[J].PhysicalReviewE,2006,73(2):026120.
[6]AdamicLA,AdarE.Friendsandneighborsontheweb[J].Socialnetworks,2003,25(3):211-230.
[7]ZhouT,LüL,ZhangYC.Predictingmissinglinksvialocalinformation[J].TheEuropeanPhysicalJournalB,2009,71(4):623-630.
The present invention is intended to solve the problem determining reduction limit number in link prediction (link prediction is to be solved is to the disappearance reduction on limit and the problem of prediction in complex network) process.Method due to existing link prediction only gives the right connection probability of non-connected node in network, and one does not determine the method on reduction how many limits, and for policymaker, a kind of method of reduction limit quantity that can instruct has great importance.
Summary of the invention
Reduce limit count issue for solving in above-mentioned link reduction process, the invention provides and propose a kind of method can determining to reduce limit quantity, concrete technical scheme is as follows:
Determine a method for link prediction reduction limit quantity, comprise the following steps:
(1) initialization sets reduction limit quantity n as 0, selects link prediction similarity indices, does not connect the connection probability that the node on limit is right in computing network, and according to connection probability from big to small by node to sequence composition sequence collection;
(2) judge whether sequence sets is empty set, if empty set, goes to step (5), otherwise, from sequence sets, select the node pair of current connection maximum probability, judge whether to there is multiple node to connection maximum probability simultaneously; If exist, for connecting the equal node of probability to processing, in current network respectively heuristically these nodes between add a limit, calculate the maximum fractionation density after adding, the rear maximum fractionation density calculated is added from big to small according to souning out respectively, these being connected the equal node of probability to carrying out partial ordering's composition local sequence sets, entering step (3); If do not exist, directly enter step (4);
(3) selection node pair is successively concentrated from local sequence, calculate and compare this node between add the maximum fractionation density of a forward and backward network in limit, if add the maximum fractionation density behind this limit be more than or equal to interpolation before maximum fractionation density, then add this limit in a network, the value of n adds 1, if node corresponding to this limit is to being last node pair that local sequence is concentrated, from sequence sets, then delete all nodes pair that local sequence is concentrated, renewal sequence collection, goes to step (2); If add the maximum fractionation density behind this limit be less than interpolation before maximum fractionation density, then do not add this limit, go to step (5);
(4) calculate and compare this node between add the maximum fractionation density of network before and after a limit, if add the maximum fractionation density behind this limit be more than or equal to interpolation before maximum fractionation density, then add this limit in a network, the value of n adds 1, node corresponding for this limit is deleted from sequence sets, renewal sequence collection; Go to step (2); If add the maximum fractionation density behind this limit be less than interpolation before maximum fractionation density, then do not add this limit, go to step (5);
(5) end loop, Output rusults n value.
Adopt the technique effect that the present invention obtains: the invention solves after link prediction, determine the problem of reduction limit quantity; The principle of the invention is simple, clear process, be easy to realize, and can provide the support in decision-making, improve the efficiency of decision-making for policymaker;
Accompanying drawing explanation
Fig. 1 is that the present invention determines reduction limit quantitative approach flow chart;
Fig. 2 is limit of the present invention similitude exemplary graph;
Tu3Shi karate club network topological diagram.
Embodiment
Below in conjunction with the drawings and specific embodiments, the present invention will be further described.
As shown in Figure 1, the present invention is based on the interlinking of link prediction and limit cluster, utilize the segmentation density index in the cluster of limit to determine the reduction limit quantity in link prediction process.Link Predicting Technique to be used before the present invention to calculate the right connection probability of all nodes not connecting limit using, and sort according to connection probability.Sound out in a network successively according to clooating sequence and add each limit, all to re-start limit cluster analysis to network, in the cluster process of limit, computed segmentation density index, by adding the change of network maximum fractionation density before and after a certain bar limit, and then determine whether add this limit.
Introduce the computational methods of network maximum fractionation density below.
First according to the index that link prediction uses, limit similarity indices formula is determined.As shown in Figure 2, limit similitude is for adjacent edge (having two limits of public vertex).In the present embodiment, link prediction uses Jaccard index, and Similarity measures formula is: so in the cluster process of limit, limit similarity definition is wherein, e ik, e jkfor there being two limits of common node, Γ (i), Γ (j) are node v i, v jneighbor node set.
After determining limit similarity indices formula, the limit similitude of all adjacent edges in computing network.Before the cluster of limit, make every bar limit alone as a class.Maximum for limit similitude two limits (not also being classified as a class) are classified as a class, until all limits are all classified as a class in network by each cluster.
After each cluster, first calculate the Connection Density D of each class inside c:
Wherein, C represents the class that cluster obtains, m cthe number of the internal edges of representation class, n cthe number (concrete list of references AHNY-Y, BAGROWJP, LEHMANNS.Linkcommunitiesrevealmultiscalecomplexityinnetw orks.Nature, 2010,466:761-764.) of the point that representation class comprises.After each cluster, the segmentation density D of whole network is exactly the weighted average of the inside Connection Density of all subclasses, is calculated by following formula:
D = 2 M Σ C m C m C - ( n C - 1 ) ( n C - 2 ) ( n C - 1 ) .
Wherein, M represents limit number total in network.Network every bar limit before the cluster of limit, alone as a class, is all classified as in a class process to final all limits, and each cluster all calculates once splits density, and the maximum of all segmentation density is exactly the maximum fractionation density of network.
For karate club network, network topology is illustrated in fig. 3 shown below, and number in the figure 1-33 represents node.Perform step 1, initialization n=0, use Jaccard index to carry out link prediction, calculate each node to connection probability, and arrange from big to small, composition sequence collection is as follows:
Node pair Connect probability
{7,12} 0.571429
{4,5} 0.5
{6,9} 0.5
{13,15} 0.5
{24,25} 0.5
{24,26} 0.5
{25,26} 0.5
{24,27} 0.5
{25,27} 0.5
{26,27} 0.5
{24,28} 0.5
{25,28} 0.5
{26,28} 0.5
{27,28} 0.5
{12,14} 0.428571
{13,14} 0.4
{3,23} 0.041667
Perform step 2, sequence sets is not empty set, selects the node of connection maximum probability to { 7,12} does not exist with { 7,12} node, to connecting other equal nodes pair of probability, directly enters step 4.
Perform step 4, network adds limit, and { the maximum fractionation density after 7,12} is greater than interpolation limit { the maximum fractionation density before 7,12}, therefore { value of 7,12}, n adds 1 (now n=1) to add limit, by node to 7,12} deletes from sequence sets, and now sequence sets is as follows:
Node pair Connect probability
{4,5} 0.5
{6,9} 0.5
{13,15} 0.5
{24,25} 0.5
{24,26} 0.5
{25,26} 0.5
{24,27} 0.5
{25,27} 0.5
{26,27} 0.5
{24,28} 0.5
{25,28} 0.5
{26,28} 0.5
{27,28} 0.5
{12,14} 0.428571
{13,14} 0.4
{3,23} 0.041667
Enter step 2.
Perform step 2, sequence sets is not empty set, selects the node of connection maximum probability to { 4,, there is node to { 6,9}, { 13 in 5}, 15}, { 24,25}, { 24,26}, { 25,26}, { 24,27}, { 25,27}, { 26,27}, { 24,28}, { 25,28}, { 26,28}, { 27,28} and node are to { 4,5} has equal connection probability.Calculate the maximum fractionation density distinguished in a network after these limits of exploratory interpolation, arrange from big to small, composition local sequence sets, as follows:
Enter step 3.
Perform step 3, in order the exploratory limit of adding local sequence and concentrating.{ before the maximum fractionation density after 4,5} is greater than interpolation, so add limit, { 4,5}, n add 1 (now n=2), by node to { 4,5} deletes from the sequence of local to add limit.Now local sequence sets is:
{ before the maximum fractionation density after 6,9} is greater than interpolation, so add limit, { 6,9}, n add 1 (now n=3), by node to { 6,9} deletes from the sequence of local to add limit.Now local sequence sets is:
{ before the maximum fractionation density after 13,15} is greater than interpolation, so add limit, { 13,15}, n add 1 (now n=4), by node to { 13,15} deletes from the sequence of local to add limit.Now local sequence sets is
{ before the maximum fractionation density after 24,25} is less than interpolation, so do not add limit, { 24,25} goes to step 5 to add limit.
Perform step 5, delete all sequences collection, exit circulation, terminate.N=4, the limit of adding is { 7,12}, { 4,5}, { 6,9}, { 13,15}.
Limit similarity indices is relevant with the similarity indices used by the link prediction stage.In real world applications, different similarity indices is used to carry out link prediction, different limit similarity indices will be used to determine the quantity on reduction limit, the limit Similarity measures formula in link prediction use Jaccard index situation is given in this specification, but be not limited thereto, use other similarity indices situations when link prediction under, corresponding limit Similarity measures formula is as follows:
E in table xk, e ykfor there being two limits of common node, Γ (x), Γ (y) are node v x, v yneighbor node set.
Content described in this specification embodiment is only enumerating the way of realization that the present invention conceives; should not being considered as of protection scope of the present invention is only limitted to the concrete form that embodiment is stated, protection scope of the present invention also and conceive the equivalent technologies means that can expect according to the present invention in those skilled in the art.

Claims (2)

1. determine a method for link prediction reduction limit quantity, it is characterized in that, comprise the following steps:
(1) initialization sets reduction limit quantity n as 0, selects link prediction similarity indices, does not connect the connection probability that the node on limit is right in computing network, and according to connection probability from big to small by node to sequence composition sequence collection;
(2) judge whether sequence sets is empty set, if empty set, goes to step (5), otherwise, from sequence sets, select the node pair of current connection maximum probability, judge whether to there is multiple node to connection maximum probability simultaneously; If exist, for connecting the equal node of probability to processing, in current network respectively heuristically these nodes between add a limit, calculate the maximum fractionation density after adding, the rear maximum fractionation density calculated is added from big to small according to souning out respectively, these being connected the equal node of probability to carrying out partial ordering's composition local sequence sets, entering step (3); If do not exist, directly enter step (4);
(3) selection node pair is successively concentrated from local sequence, calculate and compare this node between add the maximum fractionation density of a forward and backward network in limit, if add the maximum fractionation density behind this limit be more than or equal to interpolation before maximum fractionation density, then add this limit in a network, the value of n adds 1, if node corresponding to this limit is to being last node pair that local sequence is concentrated, from sequence sets, then delete all nodes pair that local sequence is concentrated, renewal sequence collection, goes to step (2); If add the maximum fractionation density behind this limit be less than interpolation before maximum fractionation density, then do not add this limit, go to step (5);
(4) calculate and compare this node between add the maximum fractionation density of network before and after a limit, if add the maximum fractionation density behind this limit be more than or equal to interpolation before maximum fractionation density, then add this limit in a network, the value of n adds 1, node corresponding for this limit is deleted from sequence sets, renewal sequence collection; Go to step (2); If add the maximum fractionation density behind this limit be less than interpolation before maximum fractionation density, then do not add this limit, go to step (5);
(5) end loop, Output rusults n value.
2. a kind of method determining link prediction reduction limit quantity as claimed in claim 1, it is characterized in that, described link prediction similarity indices is any one index in Salton index, Jaccard index, Sorenson index, HPI, HDI, LHN-I index and AA index, wherein HPI represents the favourable index of magnanimous node, HDI represents the unfavorable index of magnanimous node, the index abbreviated name that LHN-I index expression Leicht, Holme and Newman propose, AA index expression Adamic-Adar index.
CN201510037313.9A 2015-01-26 2015-01-26 A kind of method determining link prediction reduction limit quantity Active CN104579790B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510037313.9A CN104579790B (en) 2015-01-26 2015-01-26 A kind of method determining link prediction reduction limit quantity

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510037313.9A CN104579790B (en) 2015-01-26 2015-01-26 A kind of method determining link prediction reduction limit quantity

Publications (2)

Publication Number Publication Date
CN104579790A CN104579790A (en) 2015-04-29
CN104579790B true CN104579790B (en) 2016-01-20

Family

ID=53095055

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510037313.9A Active CN104579790B (en) 2015-01-26 2015-01-26 A kind of method determining link prediction reduction limit quantity

Country Status (1)

Country Link
CN (1) CN104579790B (en)

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104866558B (en) * 2015-05-18 2018-08-10 中国科学院计算技术研究所 A kind of social networks account mapping model training method and mapping method and system
CN105183796A (en) * 2015-08-24 2015-12-23 同济大学 Distributed link prediction method based on clustering
CN105228185B (en) * 2015-09-30 2017-03-22 中国人民解放军国防科学技术大学 Method used for identifying identity of fuzzy redundant node in communication network
CN107194069A (en) * 2017-05-23 2017-09-22 浙江工业大学 A kind of link prediction method favourable based on Bayesian Estimation and magnanimous node
CN110942178B (en) * 2019-11-01 2022-08-23 南京邮电大学 Charging pile recommendation method based on link prediction method of resource allocation index
CN112434437B (en) * 2020-12-02 2023-08-25 大连大学 Method for constructing equipment support super-network dynamic evolution model by considering node recombination

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101960329A (en) * 2006-12-14 2011-01-26 波音公司 Method and device for trilateration using los link prediction and pre-measurement los path filtering
CN103905246A (en) * 2014-03-06 2014-07-02 西安电子科技大学 Link prediction method based on grouping genetic algorithm

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101960329A (en) * 2006-12-14 2011-01-26 波音公司 Method and device for trilateration using los link prediction and pre-measurement los path filtering
CN103905246A (en) * 2014-03-06 2014-07-02 西安电子科技大学 Link prediction method based on grouping genetic algorithm

Also Published As

Publication number Publication date
CN104579790A (en) 2015-04-29

Similar Documents

Publication Publication Date Title
CN104579790B (en) A kind of method determining link prediction reduction limit quantity
Inglesi-Lotz et al. The role of renewable versus non-renewable energy to the level of CO2 emissions a panel analysis of sub-Saharan Africa’s Βig 10 electricity generators
CN103324980B (en) A kind of method for forecasting
Delvenne et al. Centrality measures and thermodynamic formalism for complex networks
CN103064917B (en) The high-impact customer group of a kind of specific tendency towards microblogging finds method
Valverde-Rebaza et al. Link prediction in complex networks based on cluster information
CN104933624A (en) Community discovery method of complex network and important node discovery method of community
CN106991617B (en) Microblog social relationship extraction algorithm based on information propagation
CN105095419A (en) Method for maximizing influence of information to specific type of weibo users
CN103179198B (en) Based on the topic influence individual method for digging of many relational networks
CN105550275A (en) Microblog forwarding quantity prediction method
CN103838804A (en) Social network user interest association rule mining method based on community division
CN107317704A (en) A kind of complex network important node sort method based on tight ness rating and structural hole
CN105976048A (en) Power transmission network extension planning method based on improved artificial bee colony algorithm
CN106411904B (en) A kind of network risks control method based on microstate prediction
CN104077412A (en) Micro-blog user interest prediction method based on multiple Markov chains
CN110995597A (en) Method and system for selecting safe link of power communication network
Yeh A sequential decomposition method for estimating flow in a multi-commodity, multistate network
CN105045967A (en) Group degree based sorting method and model evolution method for important nodes on complex network
CN106327026A (en) Wind power prediction optimization method
CN101859315A (en) Heuristic solving method for maximizing influence of social network
CN107463551A (en) The analysis method and device of social networks interpersonal relationships
CN104715034A (en) Weighed graph overlapping community discovery method based on central persons
CN106451431A (en) Transmission grid loss allocation method in hybrid transaction mode
Bagheri et al. FSIM: A fast and scalable influence maximization algorithm based on community detection

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant